U.S. patent application number 09/071035 was filed with the patent office on 2002-04-18 for enterococcus faecalis polynucleotides and polypeptides.
This patent application is currently assigned to HUMAN GENOME SCIENCES, INC.. Invention is credited to BAILEY, CAMELLA, CHOI, GIL H., HROMOCKYJ, ALEX, KUNSCH, CHARLES A..
Application Number | 20020045737 09/071035 |
Document ID | / |
Family ID | 27366427 |
Filed Date | 2002-04-18 |
United States Patent
Application |
20020045737 |
Kind Code |
A1 |
CHOI, GIL H. ; et
al. |
April 18, 2002 |
ENTEROCOCCUS FAECALIS POLYNUCLEOTIDES AND POLYPEPTIDES
Abstract
The present invention relates to novel genes from E. faecalis
and the polypeptides they encode. Also provided as are vectors,
host cells, antibodies and methods for producing the same. The
invention further relates to screening methods for identifying
agonists and antagonists of E. faecalis polypeptide activity. The
invention additionally relates to diagnostic methods for detecting
Enterococcus nucleic acids, polypeptides and antibodies in a
biological sample. The present invention further relates to novel
vaccines for the prevention or attenuation of infection by
Enterococcus.
Inventors: |
CHOI, GIL H.; (ROCKVILLE,
MD) ; BAILEY, CAMELLA; (TAKOMA PARK, MD) ;
HROMOCKYJ, ALEX; (N. POTOMAC, MD) ; KUNSCH, CHARLES
A.; (NORCROSS, GA) |
Correspondence
Address: |
HUMAN GENOME SCIENCES INC
9410 KEY WEST AVENUE
ROCKVILLE
MD
20850
|
Assignee: |
HUMAN GENOME SCIENCES, INC.
|
Family ID: |
27366427 |
Appl. No.: |
09/071035 |
Filed: |
May 4, 1998 |
Current U.S.
Class: |
536/23.1 |
Current CPC
Class: |
C07K 14/315 20130101;
A61P 31/04 20180101; C07K 14/24 20130101 |
Class at
Publication: |
536/23.1 |
International
Class: |
C07H 021/02; C07H
021/04 |
Claims
What is claimed is:
1. An isolated nucleic acid molecule comprising a polynucleotide
having a nucleotide sequence selected from the group consisting of:
(a) a nucleotide sequence encoding any one of the amino acid
sequences of the polypeptides shown in Table 1; or (b) a nucleotide
sequence complementary to any one of the nucleotide sequences in
(a). (c) a nucleotide sequence at least 95% identical to any one of
the nucleotide sequences shown in Table 1; or, (d) a nucleotide
sequence at least 95% identical to a nucleotide sequence
complementary to any one of the nucleotide sequences shown in Table
1.
2. An isolated nucleic acid molecule of claim 1 comprising a
polynucleotide which hybridizes under stringent hybridization
conditions to a polynucleotide having a nucleotide sequence
identical to a nucleotide sequence in (a) or (b) of claim 1.
3. An isolated nucleic acid molecule of claim 1 comprising a
polynucleotide which encodes an epitope-bearing portion of a
polypeptide in (a) of claim 1.
4. The isolated nucleic acid molecule of claim 3, wherein said
epitope-bearing portion of a polypeptide comprises an amino acid
sequence listed in Table 4.
5. A method for making a recombinant vector comprising inserting an
isolated nucleic acid molecule of claim 1 into a vector.
6. A recombinant vector produced by the method of claim 5.
7. A host cell comprising the vector of claim 6.
8. A method of producing a polypeptide comprising: (a) growing the
host cell of claim 7 such that the protein is expressed by the
cell; and (b) recovering the expressed polypeptide.
9. An isolated polypeptide comprising a polypeptide selected from
the group consisting of: (a) a polypeptide consisting of one of the
complete amino acid sequences of Table 1; (b) a polypeptide
consisting of one the complete amino acid sequences of Table 1
except the N-terminal residue; (c) a fragment of the polypeptide of
(a) having biological activity; and (d) a fragment of the
polypeptide of (a) which binds to an antibody specific for the
polypeptide of (a).
10. An isolated antibody specific for the polypeptide of claim
9.
11. A polypeptide produced according to the method of claim 8.
12. An isolated polypeptide comprising an amino acid sequence at
least 95% identical to a sequence selected from the group
consisting of an amino acid sequence of any one of the polypeptides
in Table 1.
13. An isolated polypeptide antigen comprising an amino acid
sequence of an E. faecalis epitope shown in Table 4.
14. An isolated nucleic acid molecule comprising a polynucleotide
with a nucleotide sequence encoding a polypeptide of claim 9.
15. A hybridoma which produces an antibody of claim 10.
16. A vaccine, comprising: (1) one or more E. faecalis polypeptides
selected from the group consisting of a polypeptide of claim 9; and
(2) a pharmaceutically acceptable diluent, carrier, or excipient;
wherein said polypeptide is present, in an amount effective to
elicit protective antibodies in an animal to a member of the
Enterococcus genus.
17. A method of preventing or attenuating an infection caused by a
member of the Enterococcus genus in an animal, comprising
administering to said animal a polypeptide of claim 9, wherein said
polypeptide is administered in an amount effective to prevent or
attenuate said infection.
18. A method of detecting Enterococcus nucleic acids in a
biological sample comprising: (a) contacting the sample with one or
more nucleic acids of claim 1, under conditions such that
hybridization occurs, and (b) detecting hybridization of said
nucleic acids to the one or more Enterococcus nucleic acid
sequences present in the biological sample.
19. A method of detecting Enterococcus nucleic acids in a
biological sample obtained from an animal, comprising: (a)
amplifying one or more Enterococcus nucleic acid sequences in said
sample using polymerase chain reaction, and (b) detecting said
amplified Enterococcus nucleic acid.
20. A kit for detecting Enterococcus antibodies in a biological
sample obtained from an animal, comprising (a) a polypeptide of
claim 9 attached to a solid support; and (b) detecting means.
21. A method of detecting Enterococcus antibodies in a biological
sample obtained from an animal, comprising (a) contacting the
sample with a polypeptide of claim 9; and (b) detecting
antibody-antigen complexes.
Description
[0001] This application claims benefit of 35 U.S.C. section 119(e)
based on copending U.S. Provisional Application No. 60/046,655,
filed May 16, 1997; 60/044,031, filed May 16, 1997; and 60/066,099,
filed Nov. 14, 1997. Provisional Application No. 60/066,099, filed
Nov. 14, 1997 is herein incorporated by reference in its
entirety.
FIELD OF THE INVENTION
[0002] The present invention relates to novel Enterococcus faecalis
genes (E. faecalis) nucleic acids and polypeptides. Also provided
are vectors, host cells and recombinant methods for producing the
same. Further provided are diagnostic methods for detecting
Enterococcus faecalis using probes, primers, and antibodies to the
E. faecalis nucleic acids and polypeptides of the present
invention. The invention further relates to screening methods for
identifying agonists and antagonists of E. faecalis polypeptide
activity and to vaccines using E. faecalis nucleic acids and
polypeptides.
BACKGROUND OF THE INVENTION
[0003] Enterococci have been recognized as being pathogenic for
humans since the turn of the century when they were first described
by Thiercelin in 1988 as microscopic organisms. The genus
Enterococcus includes the species Enterococcus faecalis or E.
faecalis which is the most common pathogen in the group, accounting
for 80-90 percent of all enterococcal infections. See Lewis et al.
(1990) Eur J. Clin Microbiol Infect Dis.9:111-117.
[0004] The incidence of enterococcal infections has increased in
recent years and enterococci are now the second most frequently
reported nosocomial pathogens. Enterococcal infection is of
particular concern because of its resistance to antibiotics. Recent
attention has focused on enterococci not only because of their
increasing role in nosocomial infections, but also because of their
remarkable and increasing resistance to antimicrobial agents. These
factors are mutually reinforcing since resistance allows
enterococci to survive in an environment in which antimicrobial
agents are heavily used; the hospital setting provides the
antibiotics which eliminate or suppress susceptible bacteria,
thereby providing a selective advantage for resistant organisms,
and the hospital also provides the potential for dissemination of
resistant enterococci via the usual routes of hand and
environmental contamination.
[0005] Antimicrobial resistance can be divided into two general
types, inherent or intrinsic property and that which is acquired.
The genes for intrinsic resistance, like other species
characteristics, appear to reside on the chromosome. Acquired
resistance results from either a mutation in the existing DNA or
acquisition of new DNA. The various inherent traits expressed by
enterococci include resistance to semisynthetic
penicillinase-resistant penicillins, cephalosporins, low levels of
aminoglycosides, and low levels of clindamycin. Examples of
acquired resistance include resistance to chloramphenicol,
erythromycin, high levels of clindamycin, tetracycline, high levels
of aminoglycosides, penicillin by means of penicillinase,
fluoroquinolones, and vancomycin. Resistance to high levels of
penicillin without penicillinase and resistance to fluoroquinolones
are not known to be plasmid or transposon mediated and presumably
are due to mutation(s).
[0006] Although the main reservoir for enterococci in humans is the
gastrointestinal tract, the bacteria can also reside in the
gallbladder, urethra and vagina.
[0007] E. faecalis has emerged as an important pathogen in
endocarditis, bacteremia, urinary tract infections (UTIs),
intraabdominal infections, soft tissue infections, and neonatal
sepsis. See Lewis et al. (1990) supra. In the 1970s and 1980s
enterococci became firmly established as major nosocomial
pathogens. They are now the fourth leading cause of
hospital-acquired infection and the third leading cause of
bacteremia in the United States. Fatality ratios for enterococcal
bactermia range from 12% to 68%, with death due to enterococcal
sepsis in 4 to 50% of these cases. See T. G. Emori (1993) Clin.
Microbiol. Rev. 6:428-442.
[0008] The ability of enterococci to colonize the gastrointestinal
tract, plus the many intrinsic and acquired resistance traits,
means that these organisms, which usually seem to have relatively
low intrinsic virulence, are given an excellent opportunity to
become secondary invaders. Since nosocomial isolates of enterococci
have displayed resistance to essentially every useful antimicrobial
agent, it will likely become increasingly difficult to successfully
treat and control enterococcal infections. Particularly when the
various resistance genes come together in a single strain, an event
almost certain to occur at some time in the future.
[0009] The etiology of diseases mediated or exacerbated by
Enterococcus faecalis, involves the programmed expression of E.
faecalis genes, and that characterizing these genes and their
patterns of expression would dramatically add to our understanding
of the organism and its host interactions. Knowledge of the E.
faecalis gene and genomic organization would improve our
understanding of disease etiology and lead to improved and new ways
of preventing, treating and diagnosing diseases. Thus, there is a
need to characterize the genome of E. faecalis and for
polynucleotides of this organism.
SUMMARY OF THE INVENTION
[0010] The present invention provides for isolated E. faecalis
polynucleotides and polypeptides shown in Table 1 and SEQ ID NO:1
through SEQ ID NO:496 (polynucleotide sequences having odd SEQ ID
NOs and polypeptide sequences having even SEQ ID NOs). One aspect
of the invention provides isolated nucleic acid molecules
comprising polynucleotides having a nucleotide sequence selected
from the group consisting of: (a) a nucleotide sequence shown in
Table 1; (b) a nucleotide sequence encoding any of the amino acid
sequences of the polypeptides shown in Table 1; and (c) a
nucleotide sequence complementary to any of the nucleotide
sequences in (a) or (b). The invention further provides for
fragments of the nucleic acid molecules of (a), (b) & (c)
above.
[0011] Further embodiments of the invention include isolated
nucleic acid molecules that comprise a polynucleotide having a
nucleotide sequence at least 90% identical, and more preferably at
least 95%, 96%, 97%, 98% or 99% identical, to any of the nucleotide
sequences in (a), (b) or (c) above, or a polynucleotide which
hybridizes under stringent hybridization conditions to a
polynucleotide in (a), (b) or (c) above. Additional nucleic acid
embodiments of the invention relate to isolated nucleic acid
molecules comprising polynucleotides which encode the amino acid
sequences of epitope-bearing portions of a E. faecalis polypeptide
having an amino acid sequence in (a) above.
[0012] The present invention also relates to recombinant vectors,
which include the isolated nucleic acid molecules of the present
invention, and to host cells containing the recombinant vectors, as
well as to methods of making such vectors and host cells. The
present invention further relates to the use of these vectors in
the production of E. faecalis polypeptides or peptides by
recombinant techniques.
[0013] The invention further provides isolated E. faecalis
polypeptides having an amino acid sequence selected from the group
consisting of an amino acid sequence of any of the polypeptides
described in Table 1 or fragments thereof.
[0014] The polypeptides of the present invention also include
polypeptides having an amino acid sequence with at least 70%
similarity, and more preferably at least 75%, 80%, 85%, 90%, 95%,
96%, 97%, 98%, or 99% similarity to those described in Table 1, as
well as polypeptides having an amino acid sequence at least 70%
identical, more preferably at least 75% identical, and still more
preferably 80%, 85%, 90%, 95%, 96%, 97%, 98%, or 99% identical to
those above; as well as isolated nucleic acid molecules encoding
such polypeptides.
[0015] The present invention further provides a single or
multi-component vaccine comprising one or more of the E. faecalis
polynucleotides or polypeptides described in Table 1, or fragments
thereof, together with a pharmaceutically acceptable diluent,
carrier, or excipient, wherein the E. faecalis polypeptide(s) are
present in an amount effective to elicit an immune response to
members of the Enterococcus genus. or at least E. faecalis, in an
animal. The E. faecalis polypeptides of the present invention may
further be combined with one or more immunogens of one or more
other Enterococcal or non-Enterococcal organisms to produce a
multi-component vaccine intended to elicit an immunological
response against members of the Enterococcus genus and, optionally,
one or more non-Enterococcal organisms.
[0016] The vaccines of the present invention can be administered in
a DNA form, e.g., "naked" DNA, wherein the DNA encodes one or more
Enterococcal polypeptides and, optionally, one or more polypeptides
of a non-Enterococcal organism. The DNA encoding one or more
polypeptides may be constructed such that these polypeptides are
expressed as fusion proteins.
[0017] The vaccines of the present invention may also be
administered as a component of a genetically engineered organism or
host cell. Thus, a genetically engineered organism or host cell
which expresses one or more E. faecalis polypeptides may be
administered to an animal. For example, such a genetically
engineered organism or host cell may contain one or more E.
faecalis polypeptides of the present invention intracellularly, on
its cell surface, or in its periplasmic space. Further, such a
genetically engineered organism or host cell may secrete one or
more E. faecalis polypeptides. The vaccines of the present
invention may also be co-administered to an animal with an immune
system modulator (e.g., CD86 and GM-CSF).
[0018] The invention also provides a method of inducing an
immunological response in an animal to one or more members of the
Enterococcus genus, preferably one or more isolates of the E.
faecalis species, comprising administering to the animal a vaccine
as described above.
[0019] The invention further provides a method of inducing a
protective immune response in an animal, sufficient to prevent,
attenuate, or control an infection by members of the Enterococcus
genus, preferably at least E. faecalis species, comprising
administering to the animal a composition comprising one or more of
the polynucleotides or polypeptides described in Table 1, or
fragments thereof. Further, these polypeptides, or fragments
thereof, may be conjugated to another immunogen and/or administered
in admixture with an adjuvant.
[0020] The invention further relates to antibodies elicited in an
animal by the administration of one or more E. faecalis
polypeptides of the present invention and to methods for producing
such antibodies and fragments thereof. The invention further
relates to recombinant antibodies and fragments thereof and to
methods for producing such antibodies and fragments thereof.
[0021] The invention also provides diagnostic methods for detecting
the expression of the polynucleotides of Table 1 by members of the
Enterococcits genus in an animal. One such method involves assaying
for the expression of a polynucleotide encoding E. faecalis
polypeptides in a sample from an animal. This expression may be
assaved either directly (e.g., by assaying polypeptide levels using
antibodies elicited in response to amino acid sequences described
in Table 1) or indirectly (e.g., by assaying for antibodies having
specificity for amino acid sequences described in Table 1). The
expression of polynucleotides can also be assayed by detecting the
nucleic acids of Table 1. An example of such a method involves the
use of the polymerase chain reaction (PCR) to amplify and detect
Enterococcus nucleic acid sequences.
[0022] The present invention also relates to nucleic acid probes
having all or part of a nucleotide sequence described in Table 1
(odd SEQ ID NOs) which are capable of hybridizing under stringent
conditions to Enterococcus nucleic acids. The invention further
relates to a method of detecting one or more Enterococcus nucleic
acids in a biological sample obtained from an animal, said one or
more nucleic acids encoding Enterococcus polypeptides, comprising:
(a) contacting the sample with one or more of the above-described
nucleic acid probes, under conditions such that hybridization
occurs, and (b) detecting hybridization of said one or more probes
to the Enterococczis nucleic acid present in the biological
sample.
[0023] Other uses of the polypeptides of the present invention
include: inter alia, to detect E. faecalis immunoassays, as epitope
tags, as molecular weight markers on SDS-PAGE gels, as molecular
weight markers for molecular sieve gel filtration columnns, to
generate antibodies that specificaly bind E. faecalis polypeotides
of the present invention for the detection E. faecalis in
irrununoassays, to generate an immune response against E. faecalis
and other Enterococczus species, and as vaccines against E.
faecalis, other Enterococcus species and other bacteria
genuses.
[0024] Isolated nucleic acid molecules of the present invention,
particularly DNA molecules, are useful as probes for gene mapping
and for identifying E. faecalis in a biological samples, for
instance, by Southern and Northern blot analysis. Polynucleotides
of the present invention are also useful in detecting E. faecalis
by PCR using primers for a particular E. faecalis polynucleotide.
Isolated polynucleotides of the present invention are also useful
in making the polypeptides of the present invention.
DETAILED DESCRIPTION
[0025] The present invention relates to recombinant E. faecalis
nucleic acids and fragments thereof. The present invention further
relates to recombinant E. faecalis polypeptides and fragments
thereof. The invention also relates to methods for using these
polypeptides to produce immunological responses and to confer
immunological protection to disease caused by members of the genus
Entejococcus, at least isolates of the E. faecalis genus. The
invention further relates to nucleic acid sequences which encode
antigenic E. faecalis polypeptides and to methods for detecting E.
faecalis nucleic acids and polypeptides in biological samples. The
invention also relates to antibodies specific for the polypeptides
and peptides of the present invention and methods for detecting
such antibodies produced in a host animal.
[0026] Definitions
[0027] The following definitions are provided to clarify the
subject matter which the inventors consider to be the present
invention.
[0028] As used herein, the phrase "pathogenic agent" means an agent
which causes a disease state or affliction in an animal. Included
within this definition, for examples, are bacteria, protozoans,
fungi, viruses and metazoan parasites which either produce a
disease state or render an animal infected with such an organism
susceptible to a disease state (e.g., a secondary infection).
Further included are species and strains of the genus Enterococcus
which produce disease states in animals.
[0029] As used herein, the term "organism" means any living
biological system, including viruses, regardless of whether it is a
pathogenic agent.
[0030] As used herein, the term "Enterococczs" means any species or
strain of bacteria which is members of the genus Enterococcus. Such
species and strains are known to those of skill in the art, and
include those that are pathogenic and those that are not.
[0031] As used herein, the phrase "one or more E. faecalis
polypeptides of the present invention" means polypeptides
comprising the amino acid sequence of one or more of the E.
faecalis polypeptides described in Table 1 (even SEQ ID NOs). These
polypeptides may be expressed as fusion proteins wherein the E.
faecalis polypeptides of the present invention are linked to
additional amino acid sequences which may be of Enterococcal or
non-Enterococcal origin. This phrase further includes polypeptide
comprising fragments of the E. faecalis polypeptides of the present
invention. Additional definitions are provided throughout the
specification.
[0032] Explanation of Table 1
[0033] Table 1, below, provides information describing genes which
encode polypeptides of E. faecalis. The table lists the gene
identifier which consists of the letters EF, which denote E.
faecalis, followed immediately by a three digit numeric code, which
arbitrarily number the E. faecalis genes of the present invention.
A number from 1 through 4 follows the three digit number. A number
1 represents the full length open reading frame of the gene
specified by the preceeding three digit number. A number 2
represents the full length potypeptide encoded by the gene
specified the preceeding three digit number. A number 3 represents
a polynucleotide fragment, of the gene represented by the
preceeding three digit number, used to produce an antigenic
polypeptide. A number 4 represents an antigenic polypeptide
fragment, of the gene represented by the preceeding three digit
number, used to stimulate an immune response or as a vaccine. The
nucleotide and amino acid sequences of each gene and fragment are
also shown in the Sequence Listing under the SEQ ID NO listed in
Table 1.
[0034] Explanation of Table 2
[0035] Table 2 lists accession numbers for the closest matching
sequences between the polypeptides of the present invention and
those available through GenBank and Derwent databases. These
reference numbers are the database entry numbers commonly used by
those of skill in the art, who will be familar with their
denominations. The descriptions of the numenclature for GenBank are
available from the National Center for Biotechnology Information.
Column 1 lists the gene or ORF of the present invention. Column 2
lists the accession number of a "match" gene sequence in GenBank or
Derwent databases. Column 3 lists the description of the "match"
gene sequence. Columns 4 and 5 are the high score and smallest sum
probability, respectively, calculated by BLAST. Polypeptides of the
present invention that do not share significant identity/similarity
with any polypeptide sequences of GenBank and Derwent are not
represented in Table 2. Polypeptides of the present invention that
share significant identity/similarity with more than one of the
polypeptides of GenBank and Derwent are represented more than
once.
[0036] Explanation of Table 3.
[0037] The E. faecalis polypeptides of the present invention may
include one or more conservative amino acid substitutions from
natural mutations or human manipulation as indicated in Table 3.
Changes are preferably of a minor nature, such as conservative
amino acid substitutions that do not significantly affect the
folding or activity of the protein. Residues from the following
groups, as indicated in Table 3, may be substituted for one
another: Aromatic, Hydrophobic, Polar, Basic, Acidic, and
Small,
[0038] Explanation of Table 4
[0039] Table 4 lists residues comprising antigenic epitopes of
antigenic epitope-bearing fragments present in each of the full
length E. faecalis polypeptides described in Table 1 as predicted
by the inventors using the algorithm of Jameson and Wolf, (1988)
Comp. Appl. Biosci. 4:181-186. The Jameson-Wolf antigenic analysis
was performed using the computer program PROTEAN (Version 3.11 for
the Power MacIntosh, DNASTAR, Inc., 1228 South Park Street Madison,
Wis.). E. faecalis polypeptide shown in Table 1 may one or more
antigenic epitopes comprising residues described in Table 4. It
will be appreciated that depending on the analytical criteria used
to predict antigenic determinants, the exact address of the
determinant may vary slightly. The residues and locations shown
described in Table 4 correspond to the amino acid sequences for
each full length gene sequence shown in Table 1 and in the Sequence
Listing. Polypeptides of the present invention that do not have
antigenic epitopes recognized by the Jameson-Wolf algorithm are not
represented in Table 2.
[0040] Selection of Nucleic Acid Sequences Encoding Antigenic E.
Faecalis Polypeptides
[0041] Sequenced E. faecalis genomic DNA was obtained from the E.
faecalis strain V586. The E. faecalis strain V586 was deposited May
2, 1997 at the ATCC. 10801 University Blvd. Manassas, Va20110-2209,
and given accession number 55969.
[0042] Some ORFs contained in the subset of fragments of the E.
faecalis genome disclosed herein were derived through the use of a
number of screening criteria detailed below. The ORFs are bounded
at the amino terminus by a methionine or valine residue and usually
at the carboxy terminus by a stop codon.
[0043] Most of the selected sequences consist of complete ORFs. The
polypeptides that do not comprise a complete ORF can be determined
by determining whether the corresponding polynucleotide sequence
comprises a stop codon after the codon for the last amino acid
residue in the polypeptide sequence. It is not always preferred to
express a complete ORF in a heterologous system. It may be
challenging to express and purify a highly hydrophobic protein by
common laboratory methods. Some of the polypeptide vaccine
candidates described herein have been modified slightly to simplify
the production of recombinant protein. For example, nucleotide
sequences which encode highly hydrophobic domains, such as those
found at the amino terminal signal sequence, have been excluded
from some constructs used for expression of the polypeptides.
Furthermore, any highly hydrophobic amino acid sequences occurring
at the carboxy terminus have also been excluded from the
recombinant expression constructs. Thus, in one embodiment, a
polypeptide which represents a truncated or modified ORF may be
used as an antigen.
[0044] While numerous methods are known in the art for selecting
potentially immunogenic polypeptides, many of the ORFs disclosed
herein were selected on the basis of screening Enterococcus
faecalis ORFs for several aspects of potential immunogenicity. One
set of selection criteria are as follows:
[0045] 1. Type I signal sequence: An amino terminal type I signal
sequence generally directs a nascent protein across the plasma and
outer membranes to the exterior of the bacterial cell. Experimental
evidence obtained from studies with Escherichia coli suggests that
the typical type I signal sequence consists of the following
biochemical and physical attributes (Izard, J. W. and Kendall, D.
A. Mol. Microbiol. 13:765-773 (1994)). The length of the type I
signal sequence is approximately 15 to 25 primarily hydrophobic
amino acid residues with a net positive charge in the extreme amino
terminus. In addition, the central region of the signal sequence
adopts an alpha-helical conformation in a hydrophobic environment.
Finally, the region surrounding the actual site of cleavage is
ideally six residues long, with small side-chain amino acids in the
-1 and -3 positions.
[0046] 2. Type IV signal sequence: The type IV signal sequence is
an example of the several types of functional signal sequences
which exist in addition to the type I signal sequence detailed
above. Although functionally related, the type IV signal sequence
possesses a unique set of biochemical and physical attributes
(Strom, M. S. and Lory, S., J Bacteriol. 174:7345-7351 (1992)).
These are typically six to eight amino acids with a net basic
charge followed by an additional sixteen to thirty primarily
hydrophobic residues. The cleavage site of a type IV signal
sequence is typically after the initial six to eight amino acids at
the extreme amino terminus. In addition, type IV signal sequences
generally contain a phenylalanine residue at the +1 site relative
to the cleavage site.
[0047] 3. Lipoprotein: Studies of the cleavage sites of twenty-six
bacterial lipoprotein precursors has allowed the definition of a
consensus amino acid sequence for lipoprotein cleavage. Nearly
three-fourths of the bacterial lipoprotein precursors examined
contained the sequence L--(A,S)--(G,A)--C at positions -3 to +1,
relative to the point of cleavage (Hayashi, S. and Wu, H. C., J
Bioenerg. Biomembr. 22:451-471 (1 990)).
[0048] 4. LPXTG motif: It has been experimentally determined that
most anchored proteins found on the surface of gram-positive
bacteria possess a highly conserved carboxy terminal sequence. More
than fifty such proteins from organisms such as S. pyogenes, S.
mutans, E. fa ecalis, S. pneumoniae, and others, have been
identified based on their extracellular location and carboxy
terminal amino acid sequence (Fischetti, V. A., ASM News 62:405-410
(1996)). The conserved region consists of six charged amino acids
at the extreme carboxy terminus coupled to 15-20 hydrophobic amino
acids presumed to function as a transmembrane domain. Immediately
adjacent to the transmembrane domain is a six amino acid sequence
conserved in nearly all proteins examined. The amino acid sequence
of this region is L-P-X-T-G-X, where X is any amino acid.
[0049] An algorithm for selecting antigenic and immunogenic
Enterococcis faecalis polypeptides including the foregoing criteria
was developed. The algorithm is similar to that described in U.S.
patent application Ser. No. 08/781,986, filed Jan. 3, 1997, which
is fully incorporated by reference herein. Use of the algorithm by
the inventors to select immunologically useful Enterococcus
faecalis polypeptides resulted in the selection of a number of the
disclosed ORFs. Polypeptides comprising the polypeptides identified
in this group may be produced by techniques standard in the art and
as further described herein.
[0050] Nucleic Acid Molecules
[0051] Sequenced E. faecalis genomic DNA was obtained from the E.
faecalis strain V586. As discussed elsewhere hererin,
polynucleotides of the present invention readily may be obtained by
routine application of well known and standard procedures for
cloning and sequencing DNA. Detailed methods for obtaining
libraries and for sequencing are provided below, for instance. A
wide variety of Enterococcus faecalis strains that can be used to
prepare E. faecalis genomic DNA for cloning and for obtaining
polynucleotides and polypeptides of the present invention. A wide
variety of Enterococcus faecalis strains are available to the
public from recognized depository institutions, such as the
American Type Culture Collection (ATCC). It is recognized that
minor variation is the nucleic acid and amino acid sequence may be
expected from E. faecalis strain to strain. The present invention
provides for genes, including both polynucleotides and
polypeptides, of the of the present invention from all the
Enterococcus faecalis strains.
[0052] Unless otherwise indicated, all nucleotide sequences
determined by sequencing a DNA molecule herein were determined
using an automated DNA sequencer (such as the Model 373 from
Applied Biosystems, Inc., Foster City, Calif.), and all amino acid
sequences of polypeptides encoded by DNA molecules determined
herein were predicted by translation of a DNA sequence determined
as above. Therefore, as is known in the art for any DNA sequence
determined by this automated approach, any nucleotide sequence
determined herein may contain some errors. Nucleotide sequences
determined by automation are typically at least about 90%
identical, more typically at least about 95% to at least about
99.9% identical to the actual nucleotide sequence of the sequenced
DNA molecule. The actual sequence can be more precisely determined
by other approaches including manual DNA sequencing methods well
known in the art. As is also known in the art, a single insertion
or deletion in a determined nucleotide sequence compared to the
actual sequence will cause a frame shift in translation of the
nucleotide sequence such that the predicted amino acid sequence
encoded by a determined nucleotide sequence will be completely
different from the amino acid sequence actually encoded by the
sequenced DNA molecule, beginning at the point of such an insertion
or deletion. In case of conflict between Table 1 and either the
nucleic acid sequence of the clones listed in Table 1 or the amino
acid sequence of the protein expressed by the clones listed in
Table 1, the clones listed in Table 1 are controlling. By
"nucleotide sequence" of a nucleic acid molecule or polynucleotide
is intended to mean either a DNA or RNA sequence.Using the
information provided herein, such as the nucleotide sequence in
Table 1, a nucleic acid molecule of the present invention encoding
a E. faecalis polypeptide may be obtained using standard cloning
and screening procedures, such as those for cloning DNAs using
genomic DNA as starting material. See, e.g., Sambrook et al.
MOLECULAR CLONING: A LABORATORY MANUAL (Cold Spring Harbor, N.Y.
2nd ed. 1989); Ausubel et al., CURRENT PROTOCALS IN MOLECULAR
BIOLOGY (John Wiley and Sons, N.Y. 1989). Illustrative of the
invention, the nucleic acid molecule described in Table 1 was
discovered in a DNA library derived from a E. faecalis genomic
DNA.
[0053] Nucleic acid molecules of the present invention may be in
the form of RNA, such as mRNA, or in the form of DNA, including,
for instance, DNA and genomic DNA obtained by cloning or produced
synthetically. The DNA may be double-stranded or single-stranded.
Single-stranded DNA or RNA may be the coding strand, also known as
the sense strand, or it may be the non-coding strand, also referred
to as the anti-sense strand.
[0054] By "isolated" nucleic acid molecule(s) is intended a nucleic
acid molecule. DNA or RNA, which has been removed from its native
environment. This includes segments of DNA comprising the E.
faecalis polynucleotides of the present invention isolated from the
native chromosome. These fragments include both isolated fragments
consisting only of E. faecalis DNA and fragments comprising
heterologous sequences such as vector sequences or other foreign
DNA. For example, recombinant DNA molecules contained in a vector
are considered isolated for the purposes of the present invention.
Further examples of isolated DNA molecules include recombinant DNA
molecules maintained in heterologous host cells or purified
(partially or substantially) DNA molecules in solution. Isolated
RNA molecules include in vivo or in vitro RNA transcripts of the
DNA molecules of the present invention. Isolated nucleic acid
molecules according to the present invention further include such
molecules produced synthetically.
[0055] In addition, isolated nucleic acid molecules of the
invention include DNA molecules which comprise a sequence
substantially different from those described above but which, due
to the degeneracy of the genetic code, still encode a E. faecalis
polypeptides and peptides of the present invention (e.g.
polypeptides of Table 1). That is, all possible DNA sequences that
encode the E. faecalis polypeptides of the present invention. This
includes the genetic code and species-specific codon preferences
known in the art. Thus, it would be routine for one skilled in the
art to generate the degenerate variants described above, for
instance, to optimize codon expression for a particular host (e.g.,
change codons in the bacteria mRNA to those preferred by a
mammalian or other bacterial host such as E. coli).
[0056] The invention further provides isolated nucleic acid
molecules having the nucleotide sequence shown in Table 1 or a
nucleic acid molecule having a sequence complementary to one of the
above sequences. Such isolated molecules, particularly DNA
molecules, are useful as probes for gene mapping and for
identifying E. faecalis in a biological sample, for instance, by
PCR, Southern blot, Northern blot, or other form of hybridization
analysis.
[0057] The present invention is further directed to nucleic acid
molecules encoding portions or fragments of the nucleotide
sequences described herein. Fragments include portions of the
nucleotide sequences of Table 1, or the E. faecalis nucleotide
sequences contained in the plasimd clones listed in Table 1, at
least 10 contiguous nucleotides in length selected from any two
integers, one of which representing a 5' nucleotide position and a
second of which representing a 3' nucleotide position, where the
first nucleotide for each nucleotide sequence in Table 1 is
position 1. That is, every combination of a 5' and 3' nucleotide
position that a fragment at least 10 contiguous nucleotides in
length could occupy is included in the invention. At least means a
fragment may be 10 contiguous nucleotide bases in length or any
integer between 10 and the length of an entire nucleotide sequence
of Table 1 minus 1. Therefore, included in the invention are
contiguous fragments specified by any 5' and 3' nucleotide base
positions of a nucleotide sequences of Table 1 wherein the
contiguous fragment is any integer between 10 and the length of an
entire nucleotide sequence minus 1.
[0058] Further, the invention includes polynucleotides comprising
fragments specified by size, in nucleotides, rather than by
nucleotide positions. The invention includes any fragment size, in
contiguous nucleotides, selected from integers between 10 and the
length of an entire nucleotide sequence minus 1. Preferred sizes of
contiguous nucleotide fragments include 20 nucleotides, 30
nucleotides, 40 nucleotides, 50 nucleotides. Other preferred sizes
of contiguous nucleotide fragments, which may be useful as
diagnostic probes and primers, include fragments 50-300 nucleotides
in length which include, as discussed above, fragment sizes
representing each integer between 50-300. Larger fragments are also
useful according to the present invention corresponding to most, if
not all, of the nucleotide sequences shown in Table 1 or of the E.
faecalis nucleotide sequences of the plasimd clones listed in Table
1. The preferred sizes are, of course, meant to exemplify not limit
the present invention as all size fragments, representing any
integer between 10 and the length of an entire nucleotide sequence
minus 1, are included in the invention. Additional preferred
nucleic acid fragments of the present invention include nucleic
acid molecules encoding epitope-bearing portions of E. faecalis
polypeptides identified in Table 4.
[0059] The present invention also provides for the exclusion of any
fragment, specified by 5' and 3' base positions or by size in
nucleotide bases as described above for any nucleotide sequence of
Table 1 or the plasmid clones listed in Table 1. Any number of
fragments of nucleotide sequences in Table 1 or the plasimd clones
listed in Table 1, specified by 5' and 3' base positions or by size
in nucleotides, as described above, may be excluded from the
present invention.
[0060] In another aspect, the invention provides an isolated
nucleic acid molecule comprising a polynucleotide which hybridizes
under stringent hybridization conditions to a portion of a
polynucleotide in a nucleic acid molecules of the invention
described above, for instance, nucleotide sequences of Table 1 or
the E. faecalis sequences of the plasimd clones listed in Table 1.
By "stringent hybridization conditions" is intended overnight
incubation at 42.degree. C. in a solution comprising: 50%
formamide, 5.times. SSC (150 mM NaCl, 15 mM trisodium citrate), 50
mM sodium phosphate (pH 7.6), 5.times. Denhardt's solution, 10%
dextran sulfate, and 20 .mu.g/ml denatured, sheared salmon sperm
DNA, followed by washing the filters in 0.1.times. SSC at about
65.degree. C.
[0061] By a polynucleotide which hybridizes to a "portion" of a
polynucleotide is intended a polynucleotide (either DNA or RNA)
hybridizing to at least about 15 nucleotides bases, and more
preferably at least about 20 nucleotides bases, still more
preferably at least about 30 nucleotides bases, and even more
preferably about 30-70 (e.g., 50) nucleotides bases of the
reference polynucleotide. These are useful as diagnostic probes and
primers as discussed above. By a portion of a polynucleotide of "at
least 20 nucleotides bases in length," for example, is intended 20
or more contiguous nucleotides bases nucleotides from the
nucleotide sequence of the reference polynucleotide (e.g., the
nucleotide sequence as shown in Table 1). Portions of a
polynucleotide which hybridizes to a nucleotide sequence in Table
1, which can be used as probes and primers, may also be precisely
specified by 5' and 3' base positions or by size in nucleotide
bases as described above or precisely excluded in the same
manner.
[0062] The nucleic acid molecules of the present invention include
those encoding the full length E. faecalis polypeptides of Table 1
and portions of the E. faecalis polypeptides of Table 1. Also
included in the present invention are nucleic acids encoding the
above full length sequences and further comprise additional
sequences, such as those encoding an added secretory leader
sequence such as a pre-, or pro- or prepro- protein sequence.
Further included in the present invention are nucleic acids
encoding the above full length sequences and portions thereof and
further comprise additional heterologous amino acid sequences
encoded by nucleic acid sequences from a different source.
[0063] Also included in the present invention are nucleic acids
encoding the above protein sequences together with additional,
non-coding sequences, including for example, but not limited to
non-coding 5' and 3' sequences. These sequences include
transcribed, non-translated sequences that may play a role in
transcription, and mRNA processing, for example, ribosome binding
and stability of mRNA. Also included in the present invention are
additional coding sequences which provide additional
functionalities.
[0064] Thus, a nucleotide sequence encoding a polypeptide may be
fused to a marker sequence, such as a sequence encoding a peptide
which facilitates purification of the fused polypeptide. In certain
preferred embodiments of this aspect of the invention, the marker
amino acid sequence is a hexa-histidine peptide, such as the tag
provided in a pQE vector (QIAGEN, Inc., 9259 Eton Avenue,
Chatsworth, Calif., 91311), among others, many of which are
commercially available. For instance, hexa-histidine provides for
convenient purification of the fusion protein. See Gentz et al.
(1989) Proc. Natl. Acad. Sci. 86:821-24. The "HA" tag is another
peptide useful for purification which corresponds to an epitope
derived from the influenza hemagglutinin protein. See Wilson et al.
(1 984) Cell 37:767. As discussed below, other such fusion proteins
include the E. faecalis polypeptides of the present invention fused
to Fc at the N- or C-terminus.
[0065] Variant and Mutant Polynucleotides
[0066] The present invention further relates to variants of the
nucleic acid molecules which encode portions, analogs or
derivatives of a E. faecalis polypeptides of Table 1 and variant
polypeptides thereof including portions, analogs, and derivatives
of the E. faecalis polypeptides. Variants may occur naturally, such
as a natural allelic variant.
[0067] By an "allelic variant" is intended one of several alternate
forms of a gene occupying a given locus on a chromosome of an
organism. See, e.g., B. Lewin, Genes IV (1990). Non-naturally
occurring variants may be produced using art-known mutagenesis
techniques.
[0068] Such nucleic acid variants include those produced by
nucleotide substitutions, deletions, or additions. The
substitutions, deletions, or additions may involve one or more
nucleotides. The variants may be altered in coding regions,
non-coding regions, or both. Alterations in the coding regions may
produce conservative or non-conservative amino acid substitutions,
deletions or additions. Especially preferred among these are silent
substitutions, additions and deletions, which do not alter the
properties and activities of a E. faecalis protein of the present
invention or portions thereof. Also especially preferred in this
regard are conservative substitutions.
[0069] Such polypeptide variants include those produced by amino
acid substitutions, deletions or additions. The substitutions,
deletions, or additions may involve one or more residues.
Alterations may produce conservative or non-conservative amino acid
substitutions, deletions, or additions. Especially preferred among
these are silent substitutions, additions and deletions, which do
not alter the properties and activities of a E. faecalis protein of
the present invention or portions thereof. Also especially
preferred in this regard are conservative substitutions.
[0070] The present invention also relates to recombinant vectors,
which include the isolated nucleic acid molecules of the present
invention, and to host cells containing the recombinant vectors, as
well as to methods of making such vectors and host cells and for
using them for production of E. faecalis polypeptides or peptides
by recombinant techniques.
[0071] The present application is directed to nucleic acid
molecules at least 90%, 95%, 96%, 97%, 98% or 99% identical to a
nucleic acid sequence shown in Table 1. The above nucleic acid
sequences are included irrespective of whether they encode a
polypeptide having E. faecalis activity. This is because even where
a particular nucleic acid molecule does not encode a polypeptide
having E. faecalis activity, one of skill in the art would still
know how to use the nucleic acid molecule, for instance, as a
hybridization probe. Uses of the nucleic acid molecules of the
present invention that do not encode a polypeptide having E.
faecalis activity include, inter alia, isolating an E. faecalis
gene or allelic variants thereof from a DNA library, and detecting
E. faecalis mRNA expression samples, environmental samples,
suspected of containing E. faecalis by Northern Blot analysis.
[0072] Preferred, are nucleic acid molecules having sequences at
least 90%, 95%, 96%, 97%, 98% or 99% identical to the nucleic acid
sequence shown in Table 1, which do, in fact, encode a polypeptide
having E. faecalis protein activity By "a polypeptide having E.
faecalis activity" is intended polypeptides exhibiting activity
similar, but not necessarily identical, to an activity of the E.
faecalis protein of the invention, as measured in a particular
biological assay suitable for measuring activity of the specified
protein.
[0073] Due to the degeneracy of the genetic code, one of ordinary
skill in the art will immediately recognize that a large number of
the nucleic acid molecules having a sequence at least 90%, 95%,
96%, 97%, 98%, or 99% identical to the nucleic acid sequences shown
in Table 1 will encode a polypeptide having E. faecalis protein
activity. In fact, since degenerate variants of these nucleotide
sequences all encode the same polypeptide, this will be clear to
the skilled artisan even without performing the above described
comparison assay. It will be further recognized in the art that,
for such nucleic acid molecules that are not degenerate variants, a
reasonable number will also encode a polypeptide having E. faecalis
protein activity. This is because the skilled artisan is fully
aware of amino acid substitutions that are either less likely or
not likely to significantly effect protein function (e.g.,
replacing one aliphatic amino acid with a second aliphatic amino
acid), as further described below.
[0074] The biological activity or function of the polypeptides of
the present invention are expected to be similar or identical to
polypeptides from other bacteria that share a high degree of
structural identity/similarity. Tables 2 lists accession numbers
and descriptions for the closest matching sequences of polypeptides
available through Genbank and Derwent databases. It is therefore
expected that the biological activity or function of the
polypeptides of the present invention will be similar or identical
to those polypeptides from other bacterial genuses, species, or
strains listed in Table 2.
[0075] By a polynucleotide having a nucleotide sequence at least,
for example, 95% "identical" to a reference nucleotide sequence of
the present invention, it is intended that the nucleotide sequence
of the polynucleotide is identical to the reference sequence except
that the polynucleotide sequence may include up to five point
mutations per each 100 nucleotides of the reference nucleotide
sequence encoding the E. faecalis polypeptide. In other words, to
obtain a polynucleotide having a nucleotide sequence at least 95%
identical to a reference nucleotide sequence, up to 5% of the
nucleotides in the reference sequence may be deleted, inserted, or
substituted with another nucleotide. The query sequence may be an
entire sequence shown in Table 1, the ORF (open reading frame), or
any fragment specified as described herein.
[0076] As a practical matter, whether any particular nucleic acid
molecule or polypeptide is at least 90%, 95%. 96%, 97%, 98% or 99%
identical to a nucleotide sequence of the presence invention can be
determined conventionally using known computer programs. A
preferred method for determining the best overall match between a
query sequence (a sequence of the present invention) and a subject
sequence, also referred to as a global sequence alignment, can be
determined using the FASTDB computer program based on the algorithm
of Brutlag et al. See Brutlag et al. (1990) Comp. App. Biosci.
6:237-245. In a sequence alignment the query and subject sequences
are both DNA sequences. An RNA sequence can be compared by first
converting U's to T's. The result of said global sequence alignment
is in percent identity. Preferred parameters used in a FASTDB
alignment of DNA sequences to calculate percent identity are:
Matrix=Unitary, k-tuple=4, Mismatch Penalty=1, Joining Penalty=30.
Randomization Group Length=0, Cutoff Score=1, Gap Penalty=5, Gap
Size Penalty 0.05, Window Size=500 or the lenght of the subject
nucleotide sequence, whichever is shorter.
[0077] If the subject sequence is shorter than the query sequence
because of 5' or 3' deletions, not because of internal deletions, a
manual correction must be made to the results. This is because the
FASTDB program does not account for 5' and 3' truncations of the
subject sequence when calculating percent identity. For subject
sequences truncated at the 5' or 3' ends, relative to the query
sequence, the percent identity is corrected by calculating the
number of bases of the query sequence that are 5' and 3' of the
subject sequence, which are not matched/aligned, as a percent of
the total bases of the query sequence. Whether a nucleotide is
matched/aligned is determined by results of the FASTDB sequence
alignment. This percentage is then subtracted from the percent
identity, calculated by the above FASTDB program using the
specified parameters, to arrive at a final percent identity score.
This corrected score is what is used for the purposes of the
present invention. Only nucleotides outside the 5' and 3'
nucleotides of the subject sequence, as displayed by the FASTDB
alignment, which are not matched/aligned with the query sequence,
are calculated for the purposes of manually adjusting the percent
identity score.
[0078] For example, a 90 nucleotide subject sequence is aligned to
a 100 nucleotide query sequence to determine percent identity. The
deletions occur at the 5' end of the subject sequence and
therefore, the FASTDB alignment does not show a matched/alignment
of the first 10 nucleotides at 5' end. The 10 unpaired nucleotides
represent 10% of the sequence (number of nucleotides at the 5' and
3' ends not matched/total number of nucleotides in the query
sequence) so 10% is subtracted from the percent identity score
calculated by the FASTDB program. If the remaining 90 nucleotides
were perfectly matched the final percent identity would be 90%. In
another example, a 90 nucleotide subject sequence is compared with
a 100 nucleotide query sequence. This time the deletions are
internal deletions so that there are no nucleotides on the 5' or 3'
of the subject sequence which are not matched/aligned with the
query. In this case the percent identity calculated by FASTDB is
not manually corrected. Once again, only nucleotides 5' and 3' of
the subject sequence which are not matched/aligned with the query
sequence are manually corrected for. No other manual corrections
are to made for the purposes of the present invention.
[0079] Vectors and Host Cell
[0080] The present invention also relates to vectors which include
the isolated DNA molecules of the present invention, host cells
comprising the recombinant vectors, and the production of E.
faecalis polypeptides and peptides of the present invention
expressed by the host cells.
[0081] Recombinant constructs may be introduced into host cells
using well known techniques such as infection, transduction,
transfection, transvection, electroporation and transformation. The
vector may be, for example, a phage, plasmid, viral or retroviral
vector. Retroviral vectors may be replication competent or
replication defective. In the latter case, viral propagation
generally will occur only in complementing host cells.
[0082] The polynucleotides may be joined to a vector containing a
selectable marker for propagation in a host. Generally, a plasmid
vector is introduced in a precipitate, such as a calcium phosphate
precipitate, or in a complex with a charged lipid. If the vector is
a virus, it may be packaged in vitro using an appropriate packaging
cell line and then transduced into host cells.
[0083] Preferred are vectors comprising cis-acting control regions
to the polynucleotide of interest. Appropriate trans-acting factors
may be supplied by the host, supplied by a complementing vector or
supplied by the vector itself upon introduction into the host.
[0084] In certain preferred embodiments in this regard, the vectors
provide for specific expression, which may be inducible and/or cell
type-specific. Particularly preferred among such vectors are those
inducible by environmental factors that are easy to manipulate,
such as temperature and nutrient additives.
[0085] Expression vectors useful in the present invention include
chromosomal-, episomal- and virus-derived vectors, e.g., vectors
derived from bacterial plasmids, bacteriophage, yeast episomes,
yeast chromosomal elements, viruses such as baculoviruses, papova
viruses, vaccinia viruses, adenoviruses, fowl pox viruses,
pseudorabies viruses and retroviruses, and vectors derived from
combinations thereof, such as cosmids and phagemids.
[0086] The DNA insert should be operatively linked to an
appropriate promoter, such as the phage lambda PL promoter, the E.
coli lac, Irp and tac promoters, the SV40 early and late promoters
and promoters of retroviral LTRs, to name a few. Other suitable
promoters will be known to the skilled artisan. The expression
constructs will further contain sites for transcription initiation,
termination and, in the transcribed region, a ribosome binding site
for translation. The coding portion of the mature transcripts
expressed by the constructs will preferably include a translation
initiating site at the beginning and a termination codon (UAA, UGA
or UAG) appropriately positioned at the end of the polypeptide to
be translated.
[0087] As indicated, the expression vectors will preferably include
at least one selectable marker. Such markers include dihydrofolate
reductase or neomycin resistance for eukaryotic cell culture and
tetracycline, kanamycin, or ampicillin resistance genes for
culturing in E. coli and other bacteria. Representative examples of
appropriate hosts include, but are not limited to, bacterial cells,
such as E. coli, Streptomyces and Salmonella typhimurium cells;
fungal cells, such as yeast cells; insect cells such as Drosophila
S2 and Spodoptera Sf9 cells; animal cells such as CHO, COS and
Bowes melanoma cells; and plant cells. Appropriate culture mediums
and conditions for the above-described host cells are known in the
art.
[0088] Among vectors preferred for use in bacteria include pQE70,
pQE60 and pQE9, pQE10 available from Qiagen; pBS vectors,
Phagescript vectors, Bluescript vectors, pNH8A, pNH16a, pNH18A,
pNH46A available from Stratagene; pET series of vectors available
from Novagen; and ptrc99a, pKK223-3, pKK233-3, pDR540, pRIT5
available from Pharmacia. Among preferred eukaryotic vectors are
pWLNEO, pSV2CAT, pOG44, pXT1 and pSG available from Stratagene; and
pSVK3, pBPV, pMSG and pSVL available from Pharmacia. Other suitable
vectors will be readily apparent to the skilled artisan.
[0089] Among known bacterial promoters suitable for use in the
present invention include the E. coli lacl and lacZ promoters, the
T3, T5 and T7 promoters, the gpt promoter, the lambda PR and PL
promoters and the trp promoter. Suitable eukaryotic promoters
include the CMV immediate early promoter, the HSV thymidine kinase
promoter, the early and late SV40 promoters, the promoters of
retroviral LTRs, such as those of the Rous sarcoma virus (RSV), and
metallothionein promoters, such as the mouse metallothionein-I
promoter.
[0090] Introduction of the construct into the host cell can be
effected by calcium phosphate transfection, DEAE-dextran mediated
transfection, cationic lipid-mediated transfection,
electroporation, transduction, infection or other methods. Such
methods are described in many standard laboratory manuals (for
example, Davis, et al., Basic
[0091] Methods In Molecular Biology (1986)).
[0092] Transcription of DNA encoding the polypeptides of the
present invention by higher eukaryotes may be increased by
inserting an enhancer sequence into the vector. Enhancers are
cis-acting elements of DNA, usually about from 10 to 300
nucleotides that act to increase transcriptional activity of a
promoter in a given host cell-type. Examples of enhancers include
the SV40 enhancer, which is located on the late side of the
replication origin at nucleotides 100 to 270, the cytomegalovirus
early promoter enhancer, the polyoma enhancer on the late side of
the replication origin, and adenovirus enhancers.
[0093] For secretion of the translated polypeptide into the lumen
of the endoplasmic reticulum, into the periplasmic space or into
the extracellular environment, appropriate secretion signals may be
incorporated into the expressed polypeptide, for example, the amino
acid sequence KDEL. The signals may be endogenous to the
polypeptide or they may be heterologous signals.
[0094] The polypeptide may be expressed in a modified form, such as
a fusion protein, and may include not only secretion signals, but
also additional heterologous functional regions. For instance, a
region of additional amino acids, particularly charged amino acids,
may be added to the N-terminus of the polypeptide to improve
stability and persistence in the host cell, during purification, or
during subsequent handling and storage. Also, peptide moieties may
be added to the polypeptide to facilitate purification. Such
regions may be removed prior to final preparation of the
polypeptide. The addition of peptide moieties to polypeptides to
engender secretion or excretion, to improve stability and to
facilitate purification, among others, are familiar and routine
techniques in the art. A preferred fusion protein comprises a
heterologous region from immunoglobulin that is useful to
solubilize proteins. For example, EP-A-O 464 533 (Canadian
counterpart 2045869) discloses fusion proteins comprising various
portions of constant region of immunoglobulin molecules together
with another human protein or part thereof. In many cases, the Fc
part in a fusion protein is thoroughly advantageous for use in
therapy and diagnosis and thus results, for example, in improved
pharmacokinetic properties (EP-A 0232 262). On the other hand, for
some uses it would be desirable to be able to delete the Fc part
after the fusion protein has been expressed, detected and purified
in the advantageous manner described. This is the case when Fc
portion proves to be a hindrance to use in therapy and diagnosis,
for example when the fusion protein is to be used as antigen for
immunizations. In drug discovery, for example, human proteins, such
as, hIL5-receptor has been fused with Fc portions for the purpose
of high-throughput screening assays to identify antagonists of
hIL-5. See Bennett, D. et al. (1995) J. Molec. Recogn. 8:52-58 and
Johanson, K. et al. (1995) J. Biol. Chem. 270 (16):9459-9471.
[0095] The E. faecalis polypeptides can be recovered and purified
from recombinant cell cultures by well-known methods including
ammonium sulfate or ethanol precipitation, acid extraction, anion
or cation exchange chromatography, phosphocellulose chromatography,
hydrophobic interaction chromatography, affinity chromatography,
hydroxylapatite chromatography, lectin chromatography and high
performance liquid chromatography ("HPLC") is employed for
purification. Polypeptides of the present invention include
naturally purified products, products of chemical synthetic
procedures, and products produced by recombinant techniques from a
prokaryotic or eukaryotic host. including, for example, bacterial,
yeast, higher plant, insect and mammalian cells.
[0096] Polypeptides and Fragments
[0097] The invention further provides an isolated E. faecalis
polypeptide having an amino acid sequence in Table 1, or a peptide
or polypeptide comprising a portion of the above polypeptides.
[0098] Variant and Mutant Polypeptides
[0099] To improve or alter the characteristics of E. faecalis
polypeptides of the present invention, protein engineering may be
employed. Recombinant DNA technology known to those skilled in the
art can be used to create novel mutant proteins or muteins
including single or multiple amino acid substitutions, deletions,
additions, or fusion proteins. Such modified polypeptides can show,
e.g., enhanced activity or increased stability. In addition, they
may be purified in higher yields and show better solubility than
the corresponding natural polypeptide, at least under certain
purification and storage conditions.
[0100] N-Terminal and C-Terminal Deletion Mutiants
[0101] It is known in the art that one or more amino acids may be
deleted from the N-terminus or C-terminus without substantial loss
of biological function. For instance, Ron et al. J. Biol. Chem.,
268:2984-2988 (1993), reported modified KGF proteins that had
heparin binding activity even if 3, 8, or 27 N-terminal amino acid
residues were missing. Accordingly, the present invention provides
polypeptides having one or more residues deleted from the amino
terminus of the amino acid sequence of the E. faecalis polypeptides
shown in Table 1, and polynucleotides encoding such
polypeptides.
[0102] Similarly, many examples of biologically functional
C-terminal deletion muteins are known. For instance, Interferon
gamma shows up to ten times higher activities by deleting 8-10
amino acid residues from the carboxy terminus of the protein See,
e.g., Dobeli, et al. (1988) J. Biotechnology 7:199-216.
Accordingly, the present invention provides polypeptides having one
or more residues from the carboxy terminus of the amino acid
sequence of the E. faecalis polypeptides shown in Table 1. The
invention also provides polypeptides having one or more amino acids
deleted from both the amino and the carboxyl termini as described
below.
[0103] The present invention is further directed to polynucleotide
encoding portions or fragments of the amino acid sequences
described herein as well as to portions or fragments of the
isolated amino acid sequences described herein. Fragments include
portions of the amino acid sequences of Table 1, are at least 5
contiguous amino acid in length, are selected from any two
integers, one of which representing a N-terminal position. The
initiation codon of the polypeptides of the present inventions
position 1. Every combination of a N-terminal and C-terminal
position that a fragment at least 5 contiguous amino acid residues
in length could occupy, on any given amino acid sequence of Table 1
is included in the invention. At least means a fragment may be 5
contiguous amino acid residues in length or any integer between 5
and the number of residues in a full length amino acid sequence
minus 1. Therefore, included in the invention are contiguous
fragments specified by any N-terminal and C-terminal positions of
amino acid sequence set forth in Table 1 wherein the contiguous
fragment is any integer between 5 and the number of residues in a
full length sequence minus 1.
[0104] Further, the invention includes polypeptides comprising
fragments specified by size, in amino acid residues, rather than by
N-terminal and C-terminal positions. The invention includes any
fragment size, in contiguous amino acid residues, selected from
integers between 5 and the number of residues in a full length
sequence minus 1. Preferred sizes of contiguous polypeptide
fragments include about 5 amino acid residues, about 10 amino acid
residues, about 20 amino acid residues, about 30 amino acid
residues, about 40 amino acid residues, about 50 amino acid
residues, about 100 amino acid residues, about 200 amino acid
residues, about 300 amino acid residues, and about 400 amino acid
residues. The preferred sizes are, of course, meant to exemplify,
not limit, the present invention as all size fragments representing
any integer between 5 and the number of residues in a full length
sequence minus 1 are included in the invention. The present
invention also provides for the exclusion of any fragments
specified by N-terminal and C-terminal positions or by size in
amino acid residues as described above. Any number of fragments
specified by N-terminal and C-terminal positions or by size in
amino acid residues as described above may be excluded.
[0105] The above fragments need not be active since they would be
useful, for example, in immunoassays, in epitope mapping, epitope
tagging, to generate antibodies to a particular portion of the
protein, as vaccines, and as molecular weight markers.
[0106] Other Mutants
[0107] In addition to N- and C-terminal deletion forms of the
protein discussed above, it also will be recognized by one of
ordinary skill in the art that some amino acid sequences of the E.
faecalis polypeptide can be varied without significant effect of
the structure or function of the protein. If such differences in
sequence are contemplated, it should be remembered that there will
be critical areas on the protein which determine activity.
[0108] Thus, the invention further includes variations of the E.
faecalis polypeptides which show substantial E. faecalis
polypeptide activity or which include regions of E. faecalis
protein such as the protein portions discussed below. Such mutants
include deletions, insertions, inversions, repeats, and type
substitutions selected according to general rules known in the art
so as to have little effect on activity. For example, guidance
concerning how to make phenotypically silent amino acid
substitutions is provided. There are two main approaches for
studying the tolerance of an amino acid sequence to change. See,
Bowie, J. U. et al. (1990), Science 247:1306-1310. The first method
relies on the process of evolution, in which mutations are either
accepted or rejected by natural selection. The second approach uses
genetic engineering to introduce amino acid changes at specific
positions of a cloned gene and selections or screens to identify
sequences that maintain functionality.
[0109] These studies have revealed that proteins are surprisingly
tolerant of amino acid substitutions. The studies indicate which
amino acid changes are likely to be permissive at a certain
position of the protein. For example, most buried amino acid
residues require nonpolar side chains, whereas few features of
surface side chains are generally conserved. Other such
phenotypically silent substitutions are described by Bowie et al.
(supra) and the references cited therein. Typically seen as
conservative substitutions are the replacements, one for another,
among the aliphatic amino acids Ala, Val, Leu and Ile; interchange
of the hydroxyl residues Ser and Thr, exchange of the acidic
residues Asp and Glu, substitution between the amide residues Asn
and Gln, exchange of the basic residues Lys and Arg and
replacements among the aromatic residues Phe, Tyr.
[0110] Thus, the fragment, derivative, analog, or homolog of the
polypeptide of Table 1, or that encoded by the plaimds listed in
Table 1, may be: (i) one in which one or more of the amino acid
residues are substituted with a conserved or non-conserved amino
acid residue (preferably a conserved amino acid residue) and such
substituted amino acid residue may or may not be one encoded by the
genetic code: or (ii) one in which one or more of the amino acid
residues includes a substituent group: or (iii) one in which the E.
faecalis polypeptide is fused with another compound, such as a
compound to increase the half-life of the polypeptide (for example,
polyethylene glycol): or (iv) one in which the additional amino
acids are fused to the above form of the polypeptide, such as an
IgG Fc fusion region peptide or leader or secretory sequence or a
sequence which is employed for purification of the above form of
the polypeptide or a proprotein sequence. Such fragments,
derivatives and analogs are deemed to be within the scope of those
skilled in the art from the teachings herein.
[0111] Thus, the E. faecalis polypeptides of the present invention
may include one or more amino acid substitutions, deletions, or
additions, either from natural mutations or human manipulation. As
indicated, changes are preferably of a minor nature, such as
conservative amino acid substitutions that do not significantly
affect the folding or activity of the protein (see Table 3).
[0112] Amino acids in the E. faecalis proteins of the present
invention that are essential for function can be identified by
methods known in the art, such as site-directed mutagenesis or
alanine-scanning mutagenesis. See, e.g., Cunningham et al. (1989)
Science 244:1081-1085. The latter procedure introduces single
alanine mutations at every residue in the molecule. The resulting
mutant molecules are then tested for biological activity using
assays appropriate for measuring the function of the particular
protein.
[0113] Of special interest are substitutions of charged amino acids
with other charged or neutral amino acids which may produce
proteins with highly desirable improved characteristics, such as
less aggregation. Aggregation may not only reduce activity but also
be problematic when preparing pharmaceutical formulations, because
aggregates can be immunogenic. See, e.g., Pinckard et al., (1967)
Clin. Exp. Immunol. 2:331-340; Robbins, et al., (1987) Diabetes
36:838-845; Cleland, et al., (1993) Crit. Rev. Therapeutic Drug
Carrier Systems 10:307-377.
[0114] The polypeptides of the present invention are preferably
provided in an isolated form, and preferably are substantially
purified. A recombinantly produced version of the E. faecalis
polypeptide can be substantially purified by the one-step method
described by Smith et al. (1988) Gene 67:31-40. Polypeptides of the
invention also can be purified from natural or recombinant sources
using antibodies directed against the polypeptides of the invention
in methods which are well known in the art of protein
purification.
[0115] The invention further provides for isolated E. faecalis
polypeptides comprising an amino acid sequence selected from the
group consisting of: (a) the amino acid sequence of a full-length
E. faecalis polypeptide having the complete amino acid sequence
shown in Table 1; (b) the amino acid sequence of a full-length E.
faecalis polypeptide having the complete amino acid sequence shown
in Table 1 excepting the N-terminal methionine; (c) the complete
amino acid sequence encoded by the plaimds listed in Table 1; and
(d) the complete amino acid sequence excepting the N-terminal
methionine encoded by the plaimds listed in Table 1. The
polypeptides of the present invention also include polypeptides
having an amino acid sequence at least 80% identical, more
preferably at least 90% identical, and still more preferably 95%,
96%, 97%, 98% or 99% identical to those described in (a), (b), (c),
and (d) above.
[0116] Further polypeptides of the present invention include
polypeptides which have at least 90% similarity, more preferably at
least 95% similarity, and still more preferably at least 96%, 97%,
98% or 99% similarity to those described above.
[0117] A further embodiment of the invention relates to a
polypeptide which comprises the amino acid sequence of a E.
faecalis polypeptide having an amino acid sequence which contains
at least one conservative amino acid substitution, but not more
than 50 conservative amino acid substitutions, not more than 40
conservative amino acid substitutions, not more than 30
conservative amino acid substitutions, and not more than 20
conservative amino acid substitutions. Also provided are
polypeptides which comprise the amino acid sequence of a E.
faecalis polypeptide, having at least one, but not more than 10, 9,
8, 7, 6, 5, 4, 3, 2 or 1 conservative amino acid substitutions.
[0118] By a polypeptide having an amino acid sequence at least, for
example, 95% "identical" to a query amino acid sequence of the
present invention, it is intended that the amino acid sequence of
the subject polypeptide is identical to the query sequence except
that the subject polypeptide sequence may include up to five amino
acid alterations per each 100 amino acids of the query amino acid
sequence. In other words, to obtain a polypeptide having an amino
acid sequence at least 95% identical to a query amino acid
sequence, up to 5% of the amino acid residues in the subject
sequence may be inserted, deleted, (indels) or substituted with
another amino acid. These alterations of the reference sequence may
occur at the amino or carboxy terminal positions of the reference
amino acid sequence or anywhere between those terminal positions,
interspersed either individually among residues in the reference
sequence or in one or more contiguous groups within the reference
sequence.
[0119] As a practical matter, whether any particular polypeptide is
at least 90%, 95%, 96%, 97%, 98% or 99% identical to, for instance,
the amino acid sequences shown in Table 1 or to the amino acid
sequence encoded by the plaimds listed in Table 1 can be determined
conventionally using known computer programs. A preferred method
for determining the best overall match between a query sequence (a
sequence of the present invention) and a subject sequence, also
referred to as a global sequence alignment, can be determined using
the FASTDB computer program based on the algorithm of Brutlag et
al., (1990) Comp. App. Biosci. 6:237-245. In a sequence alignment
the query and subject sequences are both amino acid sequences. The
result of said global sequence alignment is in percent identity.
Preferred parameters used in a FASTDB amino acid alignment are:
Matrix=PAM 0, k-tuple=2, Mismatch Penalty=1, Joining Penalty=20,
Randomization Group Length=0, Cutoff Score=1, Window Size=sequence
length, Gap Penalty=5, Gap Size Penalty=0.05, Window Size=500 or
the length of the subject amino acid sequence, whichever is
shorter.
[0120] If the subject sequence is shorter than the query sequence
due to N- or C-terminal deletions, not because of internal
deletions, the results, in percent identity, must be manually
corrected. This is because the FASTDB program does not account for
N- and C-terminal truncations of the subject sequence when
calculating global percent identity. For subject sequences
truncated at the N- and C-termini, relative to the query sequence,
the percent identity is corrected by calculating the number of
residues of the query sequence that are N- and C-terminal of the
subject sequence, which are not matched/aligned with a
corresponding subject residue, as a percent of the total bases of
the query sequence. Whether a residue is matched/aligned is
determined by results of the FASTDB sequence alignment. This
percentage is then subtracted from the percent identity, calculated
by the above FASTDB program using the specified parameters, to
arrive at a final percent identity score. This final percent
identity score is what is used for the purposes of the present
invention. Only residues to the N- and C-termini of the subject
sequence, which are not matched/aligned with the query sequence,
are considered for the purposes of manually adjusting the percent
identity score. That is. only query amino acid residues outside the
farthest N- and C-terminal residues of the subject sequence.
[0121] For example, a 90 amino acid residue subject sequence is
aligned with a 100 residue query sequence to determine percent
identity. The deletion occurs at the N-terminus of the subject
sequence and therefore, the FASTDB alignment does not match/align
with the first 10 residues at the N-terminus. The 10 unpaired
residues represent 10% of the sequence (number of residues at the
N- and C-termini not matched/total number of residues in the query
sequence) so 10% is subtracted from the percent identity score
calculated by the FASTDB program. If the remaining 90 residues were
perfectly matched the final percent identity would be 90%. In
another example, a 90 residue subject sequence is compared with a
100 residue query sequence. This time the deletions are internal so
there are no residues at the N- or C-termini of the subject
sequence which are not matched/aligned with the query. In this case
the percent identity calculated by FASTDB is not manually
corrected. Once again, only residue positions outside the N- and
C-terminal ends of the subject sequence, as displayed in the FASTDB
alignment, which are not matched/aligned with the query sequence
are manually corrected. No other manual corrections are to made for
the purposes of the present invention.
[0122] The above polypeptide sequences are included irrespective of
whether they have their normal biological activity. This is because
even where a particular polypeptide molecule does not have
biological activity, one of skill in the art would still know how
to use the polypeptide, for instance, as a vaccine or to generate
antibodies. Other uses of the polypeptides of the present invention
that do not have E. faecalis activity include, inter alia, as
epitope tags, in epitope mapping, and as molecular weight markers
on SDS-PAGE gels or on molecular sieve gel filtration columns using
methods known to those of skill in the art.
[0123] As described below, the polypeptides of the present
invention can also be used to raise polyclonal and monoclonal
antibodies, which are useful in assays for detecting E. faecalis
protein expression or as agonists and antagonists capable of
enhancing or inhibiting E. Faecalis protein function. Further, such
polypeptides can be used in the yeast two-hybrid system to
"capture" E. faecalis protein binding proteins which are also
candidate agonists and antagonists according to the present
invention. See, e.g., Fields et al. (1989) Nature 340:245-246.
[0124] Epitope-Bearing Portions
[0125] In another aspect, the invention provides peptides and
polypeptides comprising epitope-bearing portions of the E. faecalis
polypeptides of the present invention. These epitopes are
immunogenic or antigenic epitopes of the polypeptides of the
present invention. An "immunogenic epitope" is defined as a part of
a protein that elicits an antibody response when the whole protein
or polypeptide is the immunogen. These immunogenic epitopes are
believed to be confined to a few loci on the molecule. On the other
hand, a region of a protein molecule to which an antibody can bind
is defined as an "antigenic determinant" or "antigenic epitope."
The number of immunogenic epitopes of a protein generally is less
than the number of antigenic epitopes. See, e.g., Geysen, et al.
(1983) Proc. Natl. Acad. Sci. USA 81:3998-4002. Predicted antigenic
epitopes are shown in Table 4, below. It is pointed out that Table
4 only lists amino acid residues comprising epitopes predicted to
have the highest degree of antigenicity. The polypeptides not
listed in Table 4 and portions of polypeptides not listed in Table
4 are not considered non-antigenic. This is because they may still
be antigenic in vivo but merely not recognized as such by the
particular algorithm used. Thus, Table 4 lists the amino acid
residues comprising preferred antigenic epitopes but not a complete
list. Amino acid residues comprising other antigenic epitopes may
be determined by algorithms similar to the Jameson-Wolf analysis or
by in vivo testing for an antigenic response using the methods
described herein or those known in the art.
[0126] As to the selection of peptides or polypeptides bearing an
antigenic epitope (i.e., that contain a region of a protein
molecule to which an antibody can bind), it is well known in that
art that relatively short synthetic peptides that mimic part of a
protein sequence are routinely capable of eliciting an antiserum
that reacts with the partially mimicked protein. See. e.g.,
Sutcliffe, et al., (1983) Science 219:660-666. Peptides capable of
eliciting protein-reactive sera are frequently represented in the
primary sequence of a protein, can be characterized by a set of
simple chemical rules, and are confined neither to immunodominant
regions of intact proteins (i.e., immunogenic epitopes) nor to the
amino or carboxyl terminals. Peptides that are extremely
hydrophobic and those of six or fewer residues generally are
ineffective at inducing antibodies that bind to the mimicked
protein; longer, peptides, especially those containing proline
residues, usually are effective. See. Sutcliffe, et al., supra, p.
661. For instance, 18 of 20 peptides designed according to these
guidelines, containing 8-39 residues covering 75% of the sequence
of the influenza virus hemagglutinin HA1 polypeptide chain, induced
antibodies that reacted with the HA1 protein or intact virus; and
12/12 peptides from the MuLV polymerase and 18/18 from the rabies
glycoprotein induced antibodies that precipitated the respective
proteins.
[0127] Antigenic epitope-bearing peptides and polypeptides of the
invention are therefore useful to raise antibodies, including
monoclonal antibodies, that bind specifically to a polypeptide of
the invention. Thus, a high proportion of hybridomas obtained by
fusion of spleen cells from donors immunized with an antigen
epitope-bearing peptide generally secrete antibody reactive with
the native protein. See Sutcliffe, et al., supra, p. 663. The
antibodies raised by antigenic epitope-bearing peptides or
polypeptides are useful to detect the mimicked protein, and
antibodies to different peptides may be used for tracking the fate
of various regions of a protein precursor which undergoes
post-translational processing. The peptides and anti-peptide
antibodies may be used in a variety of qualitative or quantitative
assays for the mimicked protein, for instance in competition assays
since it has been shown that even short peptides (e.g., about 9
amino acids) can bind and displace the larger peptides in
immunoprecipitation assays. See, e.g., Wilson, et al., (1984) Cell
37:767-778. The anti-peptide antibodies of the invention also are
useful for purification of the mimicked protein, for instance, by
adsorption chromatography using methods known in the art.
[0128] Antigenic epitope-bearing peptides and polypeptides of the
invention designed according to the above guidelines preferably
contain a sequence of at least seven, more preferably at least nine
and most preferably between about 10 to about 50 amino acids (i.e.
any integer between 7 and 50) contained within the amino acid
sequence of a polypeptide of the invention. However, peptides or
polypeptides comprising a larger portion of an amino acid sequence
of a polypeptide of the invention, containing about 50 to about 100
amino acids, or any length up to and including the entire amino
acid sequence of a polypeptide of the invention, also are
considered epitope-bearing peptides or polypeptides of the
invention and also are useful for inducing antibodies that react
with the mimicked protein. Preferably, the amino acid sequence of
the epitope-bearing peptide is selected to provide substantial
solubility in aqueous solvents (i.e., the sequence includes
relatively hydrophilic residues and highly hydrophobic sequences
are preferably avoided); and sequences containing proline residues
are particularly preferred.
[0129] Non-limiting examples of antigenic polypeptides or peptides
that can be used to generate an enterococcal-specific immune
response or antibodies include portions of the amino acid sequences
identified in Table 1. More specifically, Table 4 discloses a list
of non-limiting residues that are involved in the antigenicity of
the epitope-bearing fragments of the present invention. Therefore,
the present inventions provides for isolatd and purified antigenic
epitope-bearing fragements of the polypeptides of the present
invention comprising a peptide sequences of Table 4. The antigenic
epitope-bearing fragments comprising a peptide sequence of Table 4
preferably contain a sequence of at least seven, more preferably at
least nine and most preferably between about 10 to about 50 amino
acids (i.e. any integer between 7 and 50) of a polypeptide of the
present invention. That is, included in the present invention are
antigenic polypeptides between the integers of 7 and 50 amino acid
in length comprising one or more of the sequences of Table 4.
Therefore, in most cases, the polypeptides of Table 4 make up only
a portion of the antigenic polypeptide. All combinations of
sequences between the integers of 7 and 50 amino acid in length
comprising one or more of the sequences of Table 4 are included.
The antigenic epitope-bearing fragements may be specified by either
the number of contiguous amino acid residues or by specific
N-terminal and C-terminal positions as described above for the
polypeptide fragements of the present invention, wherein the
initiation codon is residue 1. Any number of the described
antigenic epitope-bearing fragements of the present invention may
also be excluded from the present invention in the same manner.
[0130] The epitope-bearing peptides and polypeptides of the
invention may be produced by any conventional means for making
peptides or polypeptides including recombinant means using nucleic
acid molecules of the invention. For instance, an epitope-bearing
amino acid sequence of the present invention may be fused to a
larger polypeptide which acts as a carrier during recombinant
production and purification, as well as during immunization to
produce anti-peptide antibodies. Epitope-bearing peptides also may
be synthesized using known methods of chemical synthesis. For
instance, Houghten has described a simple method for synthesis of
large numbers of peptides, such as 10-20 mg of 248 different 13
residue peptides representing single amino acid variants of a
segment of the HA1 polypeptide which were prepared and
characterized (by ELISA-type binding studies) in less than four
weeks (Houghten, R. A. Proc. Natl. Acad. Sci. USA 82:5131-5135
(1985)). This "Simultaneous Multiple Peptide Synthesis (SMPS)"
process is further described in U.S. Pat. No. 4,631.211 to Houghten
and coworkers (1986). In this procedure the individual resins for
the solid-phase synthesis of various peptides are contained in
separate solvent-permeable packets, enabling the optimal use of the
many identical repetitive steps involved in solid-phase methods. A
completely manual procedure allows 500-1000 or more syntheses to be
conducted simultaneously (Houghten et al. (1985) Proc. Natl. Acad.
Sci. 82:5131-5135 at 5134.
[0131] Epitope-bearing peptides and polypeptides of the invention
are used to induce antibodies according to methods well known in
the art. See, e.g., Sutcliffe, et al., supra;; Wilson, et al.,
supra;; and Bittle, et al. (1985) J. Gen. Virol. 66:2347-2354.
Generally, animals may be immunized with free peptide; however,
anti-peptide antibody titer may be boosted by coupling of the
peptide to a macromolecular carrier, such as keyhole limpet
hemacyanin (KLH) or tetanus toxoid. For instance, peptides
containing cysteine may be coupled to carrier using a linker such
as m-maleimidobenzoyl-N-hydroxysuccinimide ester (MBS), while other
peptides may be coupled to carrier using a more general linking
agent such as glutaraldehyde. Animals such as rabbits, rats and
mice are immunized with either free or carrier-coupled peptides,
for instance, by intraperitoneal and/or intradermal injection of
emulsions containing about 100 .mu.g peptide or carrier protein and
Freund's adjuvant. Several booster injections may be needed, for
instance, at intervals of about two weeks, to provide a useful
titer of anti-peptide antibody which can be detected, for example,
by ELISA assay using free peptide adsorbed to a solid surface. The
titer of anti-peptide antibodies in serum from an immunized animal
may be increased by selection of anti-peptide antibodies, for
instance, by adsorption to the peptide on a solid support and
elution of the selected antibodies according to methods well known
in the art.
[0132] Immunogenic epitope-bearing peptides of the invention, i.e.,
those parts of a protein that elicit an antibody response when the
whole protein is the immunogen, are identified according to methods
known in the art. For instance, Geysen, et al., supra, discloses a
procedure for rapid concurrent synthesis on solid supports of
hundreds of peptides of sufficient purity to react in an ELISA.
Interaction of synthesized peptides with antibodies is then easily
detected without removing them from the support. In this manner a
peptide bearing an immunogenic epitope of a desired protein may be
identified routinely by one of ordinary skill in the art. For
instance, the immunologically important epitope in the coat protein
of foot-and-mouth disease virus was located by Geysen et al. supra
with a resolution of seven amino acids by synthesis of an
overlapping set of all 208 possible hexapeptides covering the
entire 213 amino acid sequence of the protein. Then, a complete
replacement set of peptides in which all 20 amino acids were
substituted in turn at every position within the epitope were
synthesized, and the particular amino acids conferring specificity
for the reaction with antibody were determined. Thus, peptide
analogs of the epitope-bearing peptides of the invention can be
made routinely by this method. U.S. Pat. No. 4,708,781 to Geysen
(1987) further describes this method of identifying a peptide
bearing an immunogenic epitope of a desired protein.
[0133] Further still, U.S. Pat. No. 5,194,392, to Geysen (1990),
describes a general method of detecting or determining the sequence
of monomers (amino acids or other compounds) which is a topological
equivalent of the epitope (i.e., a "mimotope") which is
complementary to a particular paratope (antigen binding site) of an
antibody of interest. More generally, U.S. Pat. No. 4,433,092, also
to Geysen (1989), describes a method of detecting or determining a
sequence of monomers which is a topographical equivalent of a
ligand which is complementary to the ligand binding site of a
particular receptor of interest. Similarly, U.S. Pat. No. 5,480,971
to Houghten, R. A. et al. (1996) discloses linear
C.sub.1-C.sub.7-alkyl peralkylated oligopeptides and sets and
libraries of such peptides, as well as methods for using such
oligopeptide sets and libraries for determining the sequence of a
peralkylated oligopeptide that preferentially binds to an acceptor
molecule of interest. Thus, non-peptide analogs of the
epitope-bearing peptides of the invention also can be made
routinely by these methods. The entire disclosure of each document
cited in this section on "Polypeptides and Fragments" is hereby
incorporated herein by reference.
[0134] As one of skill in the art will appreciate, the polypeptides
of the present invention and the epitope-bearing fragments thereof
described above can be combined with parts of the constant domain
of immunoglobulins (IgG), resulting in chimeric polypeptides. These
fusion proteins facilitate purification and show an increased
half-life in vivo. This has been shown, e.g., for chimeric proteins
consisting of the first two domains of the human CD4-polypeptide
and various domains of the constant regions of the heavy or light
chains of mammalian immunoglobulins. (EPA 0,394,827; Traunecker et
al. (1988) Nature 331:84-86. Fusion proteins that have a
disulfide-linked dimeric structure due to the IgG part can also be
more efficient in binding and neutralizing other molecules than a
monomeric E. faecalis polypeptide or fragment thereof alone. See
Fountoulakis et al. (1995) J. Biochem. 270:3958-3964. Nucleic acids
encoding the above epitopes of E. faecalis polypeptides can also be
recombined with a gene of interest as an epitope tag to aid in
detection and purification of the expressed polypeptide.
[0135] Antibodies
[0136] E. faecalis protein-specific antibodies for use in the
present invention can be raised against the intact E. faecalis
protein or an antigenic polypeptide fragment thereof, which may be
presented together with a carrier protein, such as an albumin, to
an animal system (such as rabbit or mouse) or, if it is long enough
(at least about 25 amino acids), without a carrier.
[0137] As used herein, the term "antibody" (Ab) or "monoclonal
antibody" (Mab) is meant to include intact molecules, single chain
whole antibodies, and antibody fragments. Antibody fragments of the
present invention include Fab and F(ab')2 and other fragments
including single-chain Fvs (scFv) and disulfide-linked Fvs (sdFv).
Also included in the present invention are chimeric and humanized
monoclonal antibodies and polyclonal antibodies specific for the
polypeptides of the present invention. The antibodies of the
present invention may be prepared by any of a variety of methods.
For example, cells expressing a polypeptide of the present
invention or an antigenic fragment thereof can be administered to
an animal in order to induce the production of sera containing
polyclonal antibodies. For example, a preparation of E. faecalis
polypeptide or fragment thereof is prepared and purified to render
it substantially free of natural contaminants. Such a preparation
is then introduced into an animal in order to produce polyclonal
antisera of greater specific activity.
[0138] In a preferred method, the antibodies of the present
invention are monoclonal antibodies or binding fragments thereof.
Such monoclonal antibodies can be prepared using hybridoma
technology. See, e.g. Harlow et al., ANTIBODIES: A LABORATORY
MANUAL, (Cold Spring Harbor Laboratory Press, 2nd ed. 1988);
Hammerling, et al., in: MONOCLONAL ANTIBODIES AND T-CELL HYBRIDOMAS
563-681 (Elsevier, N.Y., 1981). Fab and F(ab')2 fragments may be
produced by proteolytic cleavage, using enzymes such as papain (to
produce Fab fragments) or pepsin (to produce F(ab')2 fragments).
Alternatively, E. faecalis polypeptide-binding fragments, chimeric,
and humanized antibodies can be produced through the application of
recombinant DNA technology or through synthetic chemistry using
methods known in the art.
[0139] Alternatively, additional antibodies capable of binding to
the polypeptide antigen of the present invention may be produced in
a two-step procedure through the use of anti-idiotypic antibodies.
Such a method makes use of the fact that antibodies are themselves
antigens, and that, therefore, it is possible to obtain an antibody
which binds to a second antibody. In accordance with this method,
E. faecalis polypeptide-specific antibodies are used to immunize an
animal, preferably a mouse. The splenocytes of such an animal are
then used to produce hybridoma cells, and the hybridoma cells are
screened to identify clones which produce an antibody whose ability
to bind to the E. faecalis polypeptide-specific antibody can be
blocked by the E. faecalis polypeptide antigen. Such antibodies
comprise anti-idiotypic antibodies to the E. faecalis
polypeptide-specific antibody and can be used to immunize an animal
to induce formation of further E. faecalis polypeptide-specific
antibodies.
[0140] Antibodies and fragements thereof of the present invention
may be described by the portion of a polypeptide of the present
invention recognized or specifically bound by the antibody.
Antibody binding fragements of a polypeptide of the present
invention may be described or specified in the same manner as for
polypeptide fragements discussed above., i.e, by N-terminal and
C-terminal positions or by size in contiguous amino acid residues.
Any number of antibody binding fragments, of a polypeptide of the
present invention, specified by N-terminal and C-terminal positions
or by size in amino acid residues, as described above, may also be
excluded from the present invention. Therefore, the present
invention includes antibodies the specifically bind a particuarlly
discribed fragement of a polypeptide of the present invention and
allows for the exclusion of the same.
[0141] Antibodies and fragements thereof of the present invention
may also be described or specified in terms of their
cross-reactivity. Antibodies and fragements that do not bind
polypeptides of any other species of Enterococcus other than E.
faecalis are included in the present invention. Likewise,
antibodies and fragements that bind only species of Enterococcus,
i.e. antibodies and fragements that do not bind bacteria from any
genus other than Enterococcus, are included in the present
invention.
[0142] Diagnostic Assays
[0143] The present invention further relates to methods for
assaying staphylococcal infection in an animal by detecting the
expression of genes encoding staphylococcal polypeptides of the
present invention. The methods comprise analyzing tissue or body
fluid from the animal for Enterococcus-specific antibodies, nucleic
acids, or proteins. Analysis of nucleic acid specific to
Enterococcus is assayed by PCR or hybridization techniques using
nucleic acid sequences of the present invention as either
hybridization probes or primers. See, e.g., Sambrook et al.
Molecular cloning: A Laboratory Manual (Cold Spring Harbor
Laboratory Press, 2nd ed., 1989, page 54 reference); Eremeeva et
al. (1994) J. Clin. Microbiol. 32:803-810 (describing
differentiation among spotted fever group Rickettsiae species by
analysis of restriction fragment length polymorphism of
PCR-amplified DNA) and Chen et al. 1994 J. Clin. Microbiol.
32:589-595 (detecting B. burgdorferi nucleic acids via PCR).
[0144] Where diagnosis of a disease state related to infection with
Enterococcus has already been made, the present invention is useful
for monitoring progression or regression of the disease state
whereby patients exhibiting enhanced Enterococcus gene expression
will experience a worse clinical outcome relative to patients
expressing these gene(s) at a lower level.
[0145] By "biological sample" is intended any biological sample
obtained from an animal, cell line, tissue culture, or other source
which contains Enterococcus polypeptide, MRNA, or DNA. Biological
samples include body fluids (such as saliva, blood, plasma, urine,
mucus, synovial fluid, etc.) tissues (such as muscle, skin, and
cartilage) and any other biological source suspected of containing
Enterococcus polypeptides or nucleic acids. Methods for obtaining
biological samples such as tissue are well known in the art.
[0146] The present invention is useful for detecting diseases
related to Enterococcus infections in animals. Preferred animals
include monkeys, apes, cats, dogs, birds, cows, pigs, mice, horses,
rabbits and humans. Particularly preferred are humans.
[0147] Total RNA can be isolated from a biological sample using any
suitable technique such as the single-step
guanidinium-thiocyanate-phenol- -chloroform method described in
Chomczynski et al. (1987) Anal. Biochem. 162:156-159. mRNA encoding
Enterococcus polypeptides having sufficient homology to the nucleic
acid sequences identified in Table 1 to allow for hybridization
between complementary sequences are then assayed using any
appropriate method. These include Northern blot analysis, S1
nuclease mapping, the polymerase chain reaction (PCR), reverse
transcription in combination with the polymerase chain reaction
(RT-PCR), and reverse transcription in combination with the ligase
chain reaction (RT-LCR).
[0148] Northern blot analysis can be performed as described in
Harada et al. (1990) Cell 63:303-312. Briefly, total RNA is
prepared from a biological sample as described above. For the
Northern blot, the RINA is denatured in an appropriate buffer (such
as glyoxaludimethyl sulfoxide/sodium phosphate buffer), subjected
to agarose gel electrophoresis, and transferred onto a
nitrocellulose filter. After the RNAs have been linked to the
filter by a UV linker, the filter is prehybridized in a solution
containing formamide, SSC, Denhardt's solution, denatured salmon
sperm, SDS, and sodium phosphate buffer. A E. faecalis
polynucleotide sequence shown in Table 1 labeled according to any
appropriate method (such as the .sup.32P-multiprimed DNA labeling
system (Amersham)) is used as probe. After hybridization overnight,
the filter is washed and exposed to x-ray film. DNA for use as
probe according to the present invention is described in the
sections above and will preferably at least 15 nucleotides in
length.
[0149] S1 mapping can be performed as described in Fujita et al.
(1987) Cell 49:357-367. To prepare probe DNA for use in S1 mapping,
the sense strand of an above-described E. faecalis DNA sequence of
the present invention is used as a template to synthesize labeled
antisense DNA. The antisense DNA can then be digested using an
appropriate restriction endonuclease to generate further DNA probes
of a desired length. Such antisense probes are useful for
visualizing protected bands corresponding to the target mRNA (i.
e., mRNA encoding Enterococcus polypeptides).
[0150] Levels of mRNA encoding Enterococcus polypeptides are
assayed, for e.g., using the RT-PCR method described in Makino et
al. (1990) Technique 2:295-301. By this method, the radioactivities
of the "amplicons" in the polyacrylamide gel bands are linearly
related to the initial concentration of the target mRNA. Briefly,
this method involves adding total RNA isolated from a biological
sample in a reaction mixture containing a RT primer and appropriate
buffer. After incubating for primer annealing, the mixture can be
supplemented with a RT buffer, dNTPs, DTT, RNase inhibitor and
reverse transcriptase. After incubation to achieve reverse
transcription of the RNA, the RT products are then subject to PCR
using labeled primers. Alternatively, rather than labeling the
primers, a labeled dNTP can be included in the PCR reaction
mixture. PCR amplification can be performed in a DNA thermal cycler
according to conventional techniques. After a suitable number of
rounds to achieve amplification, the PCR reaction mixture is
electrophoresed on a polyacrylamide gel. After drying the gel, the
radioactivity of the appropriate bands (corresponding to the mRNA
encoding the Enterococcus polypeptides of the present invention)
are quantified using an imaging analyzer. RT and PCR reaction
ingredients and conditions, reagent and gel concentrations, and
labeling methods are well known in the art. Variations on the
RT-PCR method will be apparent to the skilled artisan. Other PCR
methods that can detect the nucleic acid of the present invention
can be found in PCR PRIMER: A LABORATORY MANUAL (C. W. Dieffenbach
et al. eds., Cold Spring Harbor Lab Press, 1995).
[0151] The polynucleotides of the present invention, including both
DNA and RNA, may be used to detect polynucleotides of the present
invention or Enterococcal species including E. faecalis using bio
chip technology. The present invention includes both high density
chip arrays (>1000 oligonucleotides per cm.sup.2) and low
density chip arrays (<1000 oligonucleotides per cm.sup.2). Bio
chips comprising arrays of polynucleotides of the present invention
may be used to detect Enterococcal species, including E. faecalis,
in biological and environmental samples and to diagnose an animal,
including humans, with an E. faecalis or other Enterococcal
infection. The bio chips of the present invention may comprise
polynucleotide sequences of other pathogens including bacteria,
viral, parasitic, and fungal polynucleotide sequences, in addition
to the polynucleotide sequences of the present invention, for use
in rapid diffenertial pathogenic detection and diagnosis. The bio
chips can also be used to monitor an E. faecalis or other
Enterococcal infections and to monitor the genetic changes
(deletions, insertions, mismatches, etc.) in response to drug
therapy in the clinic and drug development in the laboratory. The
bio chip technology comprising arrays of polynucleotides of the
present invention may also be used to simultaneously monitor the
expression of a multiplicity of genes, including those of the
present invention. The polynucleotides used to comprise a selected
array may be specified in the same manner as for the fragements,
i.e, by their 5' and 3' positions or length in contigious base
pairs and include from. Methods and particular uses of the
polynucleotides of the present invention to detect Enterococcal
species, including E. faecalis, using bio chip technology include
those known in the art and those of: U.S. Pat. Nos. 5510270,
5545531, 5445934, 5677195, 5532128, 5556752, 5527681, 5451683,
5424186, 5607646, 5658732 and World Pat. Nos. WO/9710365.
WO/9511995, WO/9743447, WO/9535505, each incorporated herein in
their entireties.
[0152] Biosensors using the polynucleotides of the present
invention may also be used to detect, diagnose, and monitor E.
faecalis or other Enterococcal species and infections thereof.
Biosensors using the polynucleotides of the present invention may
also be used to detect particular polynucleotides of the present
invention. Biosensors using the polynucleotides of the present
invention may also be used to monitor the genetic changes
(deletions, insertions, mismatches, etc.) in response to drug
therapy in the clinic and drug development in the laboratory.
Methods and particular uses of the polynucleotides of the present
invention to detect Enterococcal species, including E. faecalis,
using biosenors include those known in the art and those of: U.S.
Pat. Nos 5721102, 5658732, 5631170, and World Pat. Nos. WO97/3501
1, WO/9720203, each incorporated herein in their entireties.
[0153] Thus, the present invention includes both bio chips and
biosensors comprising polynucleotides of the present invention and
methods of their use.
[0154] Assaying Enterococcius polypeptide levels in a biological
sample can occur using any art-known method, such as antibody-based
techniques. For example, Enterococcus polypeptide expression in
tissues can be studied with classical immunohistological methods.
In these, the specific recognition is provided by the primary
antibody (polyclonal or monoclonal) but the secondary detection
system can utilize fluorescent, enzyme, or other conjugated
secondary antibodies. As a result, an immunohistological staining
of tissue section for pathological examination is obtained. Tissues
can also be extracted, e.g., with urea and neutral detergent, for
the liberation of Enterococcus polypeptides for Western-blot or
dot/slot assay. See, e.g., Jalkanen, M. et al. (1985) J. Cell.
Biol. 101:976-985; Jalkanen, M. et al. (1987) J. Cell . Biol.
105:3087-3096. In this technique, which is based on the use of
cationic solid phases, quantitation of a Enterococcus polypeptide
can be accomplished using an isolated Enterococcus polypeptide as a
standard. This technique can also be applied to body fluids.
[0155] Other antibody-based methods useful for detecting
Enterococcus polypeptide gene expression include immunoassays, such
as the ELISA and the radjoimmunoassay (RIA). For example, a
Enterococcus polypeptide-specific monoclonal antibodies can be used
both as an immunoabsorbent and as an enzyme-labeled probe to detect
and quantify a Enterococcus polypeptide. The amount of a
Enterococcus polypeptide present in the sample can be calculated by
reference to the amount present in a standard preparation using a
linear regression computer algorithm. Such an ELISA is described in
lacobelli et al. (1988) Breast Cancer Research and Treatment
11:19-30. In another ELISA assay, two distinct specific monoclonal
antibodies can be used to detect Enterococcus polypeptides in a
body fluid. In this assay, one of the antibodies is used as the
immunoabsorbent and the other as the enzyme-labeled probe.
[0156] The above techniques may be conducted essentially as a
"one-step" or "two-step" assay. The "one-step" assay involves
contacting the Enterococcus polypeptide with immobilized antibody
and, without washing, contacting the mixture with the labeled
antibody. The "two-step" assay involves washing before contacting
the mixture with the labeled antibody. Other conventional methods
may also be employed as suitable. It is usually desirable to
immobilize one component of the assay system on a support, thereby
allowing other components of the system to be brought into contact
with the component and readily removed from the sample. Variations
of the above and other immunological methods included in the
present invention can also be found in Harlow et al., ANTIBODIES: A
LABORATORY MANUAL, (Cold Spring Harbor Laboratory Press, 2nd ed.
1988).
[0157] Suitable enzyme labels include, for example, those from the
oxidase group, which catalyze the production of hydrogen peroxide
by reacting with substrate. Glucose oxidase is particularly
preferred as it has good stability and its substrate (glucose) is
readily available. Activity of an oxidase label may be assayed by
measuring the concentration of hydrogen peroxide formed by the
enzyme-labeled antibody/substrate reaction. Besides enzymes, other
suitable labels include radioisotopes, such as iodine (.sup.125I,
.sup.121I), carbon (.sup.14C), sulphur (.sup.35S), tritium
(.sup.3H), indium (.sup.112In), and technetium (.sup.99mTc), and
fluorescent labels, such as fluorescein and rhodamine, and
biotin.
[0158] Further suitable labels for the Enterococcus
polypeptide-specific antibodies of the present invention are
provided below. Examples of suitable enzyme labels include malate
dehydrogenase, Enterococcal nuclease, delta-5-steroid isomerase,
yeast-alcohol dehydrogenase, alpha-glycerol phosphate
dehydrogenase, triose phosphate isomerase, peroxidase, alkaline
phosphatase, asparaginase, glucose oxidase, beta-galactosidase,
ribonuclease, urease, catalase, glucose-6-phosphate dehydrogenase,
glucoamylase, and acetylcholine esterase.
[0159] Examples of suitable radioisotopic labels include .sup.3H,
.sup.111In, .sup.125I, .sup.131I, .sup.32P, .sup.35S, .sup.14C,
.sup.51Cr, .sup.57To, .sup.58Co, .sup.59Fe, .sup.75Se, .sup.152Eu,
.sup.90Y, .sup.67Cu, .sup.217Ci, .sup.211At, .sup.212Pb, .sup.47Sc,
.sup.109Pd, etc. .sup.111In is a preferred isotope where in vivo
imaging is used since its avoids the problem of dehalogenation of
the .sup.251I or .sup.131I-labeled monoclonal antibody by the
liver. In addition, this radionucleotide has a more favorable gamma
emission energy for imaging. See, e.g., Perkins et al. (1985) Eur.
J. Nucl. Med. 10:296-301; Carasquillo et al. (1987) J. Nucl. Med.
28:281-287. For example, .sup.111In coupled to monoclonal
antibodies with 1 (P isothiocyanatobenzyl)-DPTA has shown little
uptake in non-tumors tissues, particularly the liver, and therefore
enhances specificity of tumor localization. See, Esteban et al.
(1987) J. Nucl. Med. 28:861-870.
[0160] Examples of suitable non-radioactive isotopic labels include
.sup.157Gd, .sup.55Mn, .sup.162Dy, .sup.52Tr, and .sup.56Fe.
[0161] Examples of suitable fluorescent labels include an
.sup.152Eu label, a fluorescein label, an isothiocyanate label, a
rhodamine label, a phycoerythrin label, a phycocyanin label, an
allophycocyanin label, an o-phthaldehyde label, and a fluorescamine
label.
[0162] Examples of suitable toxin labels include, Pseudomonas
toxin, diphtheria toxin, ricin, and cholera toxin.
[0163] Examples of chemiluminescent labels include a luminal label,
an isoluminal label, an aromatic acridinium ester label. an
imidazole label. an acridinium salt label, an oxalate ester label,
a luciferin label, a luciferase label, and an aequorin label.
[0164] Examples of nuclear magnetic resonance contrasting agents
include heavy metal nuclei such as Gd, Mn, and iron.
[0165] Typical techniques for binding the above-described labels to
antibodies are provided by Kennedy et al. (1976) Clin. Chim. Acta
70:1-31, and Schurs et al. (1977) Clin. Chim. Acta 81:1-40.
Coupling techniques mentioned in the latter are the glutaraldehyde
method, the periodate method, the dimaleimide method, the
m-maleimidobenzyl-N-hydroxy- -succinimide ester method, all of
which methods are incorporated by reference herein.
[0166] In a related aspect, the invention includes a diagnostic kit
for use in screening serum containing antibodies specific against
E. faecalis infection. Such a kit may include an isolated E.
faecalis antigen comprising an epitope which is specifically
immunoreactive with at least one anti-E. faecalis antibody. Such a
kit also includes means for detecting the binding of said antibody
to the antigen. In specific embodiments, the kit may include a
recombinantly produced or chemically synthesized peptide or
polypeptide antigen. The peptide or polypeptide antigen may be
attached to a solid support.
[0167] In a more specific embodiment, the detecting means of the
above-described kit includes a solid support to which said peptide
or polypeptide antigen is attached. Such a kit may also include a
non-attached reporter-labeled anti-human antibody. In this
embodiment, binding of the antibody to the E. faecalis antigen can
be detected by binding of the reporter labeled antibody to the
anti-E. faecalis polypeptide antibody.
[0168] In a related aspect, the invention includes a method of
detecting E. faecalis infection in a subject. This detection method
includes reacting a body fluid, preferably serum, from the subject
with an isolated E. faecalis antigen, and examining the antigen for
the presence of bound antibody. In a specific embodiment, the
method includes a polypeptide antigen attached to a solid support,
and serum is reacted with the support. Subsequently, the support is
reacted with a reporter-labeled anti-human antibody. The support is
then examined for the presence of reporter-labeled antibody.
[0169] The solid surface reagent employed in the above assays and
kits is prepared by known techniques for attaching protein material
to solid support material, such as polymeric beads, dip sticks,
96-well plates or filter material. These attachment methods
generally include non-specific adsorption of the protein to the
support or covalent attachment of the protein, typically through a
free amine group, to a chemically reactive group on the solid
support, such as an activated carboxyl, hydroxyl, or aldehyde
group. Alternatively, streptavidin coated plates can be used in
conjunction with biotinylated antigen(s).
[0170] The polypeptides and antibodies of the present invention,
including fragments thereof, may be used to detect Enterococcal
species including E. faecalis using bio chip and biosensor
technology. Bio chip and biosensors of the present invention may
comprise the polypeptides of the present invention to detect
antibodies, which specifically recognize Enterococcal species,
including E. faecalis. Bio chip and biosensors of the present
invention may also comprise antibodies which specifically recognize
the polypeptides of the present invention to detect Enterococcal
species, including E. faecalis or specific polypeptides of the
present invention. Bio chips or biosensors comprising polypeptides
or antibodies of the present invention may be used to detect
Enterococcal species, including E. faecalis, in biological and
environmental samples and to diagnose an animal, including humans,
with an E. faecalis or other Enterococcal infection. Thus, the
present invention includes both bio chips and biosensors comprising
polypeptides or antibodies of the present invention and methods of
their use.
[0171] The bio chips of the present invention may further comprise
polypeptide sequences of other pathogens including bacteria, viral,
parasitic, and fungal polypeptide sequences, in addition to the
polypeptide sequences of the present invention, for use in rapid
diffenertial pathogenic detection and diagnosis. The bio chips of
the present invention may further comprise antibodies or fragements
thereof specific for other pathogens including bacteria, viral,
parasitic, and fungal polypeptide sequences, in addition to the
antibodies or fragements thereof of the present invention, for use
in rapid diffenertial pathogenic detection and diagnosis. The bio
chips and biosensors of the present invention may also be used to
monitor an E. faecalis or other Enterococcal infection and to
monitor the genetic changes (amio acid deletions, insertions,
substitutions, etc.) in response to drug therapy in the clinic and
drug development in the laboratory. The bio chip and biosensors
comprising polypeptides or antibodies of the present invention may
also be used to simultaneously monitor the expression of a
multiplicity of polypeptides, including those of the present
invention. The polypeptides used to comprise a bio chip or
biosensor of the present invention may be specified in the same
manner as for the fragements, i.e, by their N-terminal and
C-terminal positions or length in contigious amino acid residue.
Methods and particular uses of the polypeptides and antibodies of
the present invention to detect Enterococcal species, including E.
faecalis, or specific polypeptides using bio chip and biosensor
technology include those known in the art, those of the U.S. Pat.
Nos. and World Pat. Nos. listed above for biochips and biosensors
using polynucleotides of the present invention, and those of: U.S.
Pat. Nos. 5658732, 5135852, 5567301, 5677196, 5690894 and World
Pat. Nos. WO9729366, WO9612957, each incorporated herein in their
entireties.
[0172] Treatment
[0173] Agonists and Antagonists--Assays and Molecules
[0174] The invention also provides a method of screening compounds
to identify those which enhance or block the biological activity of
the E. faecalis polypeptides of the present invention. The present
invention further provides where the compounds kill or slow the
growth of E. faecalis. The ability of E. faecalis antagonists,
including E. faecalis ligands, to prophylactically or
therapeutically block antibiotic resistance may be easily tested by
the skilled artisan. See, e.g., Straden et al. (1997) J. Bacteriol.
179(1):9-16.
[0175] An agonist is a compound which increases the natural
biological function or which functions in a manner similar to the
polypeptides of the present invention, while antagonists decrease
or eliminate such functions. Potential antagonists include small
organic molecules, peptides, polypeptides, and antibodies that bind
to a polypeptide of the invention and thereby inhibit or extinguish
its activity.
[0176] The antagonists may be employed for instance to inhibit
peptidoglycan cross bridge formation. Antibodies against E.
faecalis may be employed to bind to and inhibit E. faecalis
activity to treat antibiotic resistance. Any of the above
antagonists may be employed in a composition with a
pharmaceutically acceptable carrier.
[0177] Vaccines
[0178] The present invention also provides vaccines comprising one
or more polypeptides of the present invention. Heterogeneity in the
composition of a vaccine may be provided by combining E. faecalis
polypeptides of the present invention. Multi-component vaccines of
this type are desirable because they are likely to be more
effective in eliciting protective immune responses against multiple
species and strains of the Enterococcus genus than single
polypeptide vaccines.
[0179] Multi-component vaccines are known in the art to elicit
antibody production to numerous immunogenic components. See, e.g.,
Decker et al. (1996) J. Infect. Dis. 174:S270-275. In addition, a
hepatitis B, diphtheria, tetanus, pertussis tetravalent vaccine has
recently been demonstrated to elicit protective levels of
antibodies in human infants against all four pathogenic agents.
See, e.g., Aristegui, J. et al. (1997) Vaccine 15:7-9.
[0180] The present invention in addition to single-component
vaccines includes multi-component vaccines. These vaccines comprise
more than one polypeptide, immunogen or antigen. Thus, a
multi-component vaccine would be a vaccine comprising more than one
of the E. faecalis polypeptides of the present invention.
[0181] Further within the scope of the invention are whole cell and
whole viral vaccines. Such vaccines may be produced recombinantly
and involve the expression of one or more of the E. faecalis
polypeptides described in Table 1. For example, the E. faecalis
polypeptides of the present invention may be either secreted or
localized intracellular, on the cell surface, or in the periplasmic
space. Further, when a recombinant virus is used, the E. faecalis
polypeptides of the present invention may, for example, be
localized in the viral envelope, on the surface of the capsid, or
internally within the capsid. Whole cells vaccines which employ
cells expressing heterologous proteins are known in the art. See,
e.g., Robinson, K. et al. (1997) Nature Biotech. 15:653-657;
Sirard, J. et al. (1997) Infect. Immun. 65:2029-2033; Chabalgoity,
J. et al. (1997) Infect. Immun. 65:2402-2412. These cells may be
administered live or may be killed prior to administration.
Chabalgoity, J. et al., supra, for example, report the successful
use in mice of a live attenuated Salmonella vaccine strain which
expresses a portion of a platyhelminth fatty acid-binding protein
as a fusion protein on its cells surface.
[0182] A multi-component vaccine can also be prepared using
techniques known in the art by combining one or more E. faecalis
polypeptides of the present invention, or fragments thereof, with
additional non-Enterococcal components (e.g., diphtheria toxin or
tetanus toxin, and/or other compounds known to elicit an immune
response). Such vaccines are useful for eliciting protective immune
responses to both members of the Enterococcus genus and
non-Enterococcal pathogenic agents.
[0183] The vaccines of the present invention also include DNA
vaccines. DNA vaccines are currently being developed for a number
of infectious diseases. See, et al., Boyer, et al. (1997) Nat. Med.
3:526-532; reviewed in Spier, R. (1996) Vaccine 14:1285-1288. Such
DNA vaccines contain a nucleotide sequence encoding one or more E.
faecalis polypeptides of the present invention oriented in a manner
that allows for expression of the subject polypeptide. For example,
the direct administration of plasmid DNA encoding B. burgdorgeri
OspA has been shown to elicit protective immunity in mice against
borrelial challenge. See, Luke et al. (1997) J. Infect. Dis.
175:91-97.
[0184] The present invention also relates to the administration of
a vaccine which is co-administered with a molecule capable of
modulating immune responses. Kim et al. (1997) Nature Biotech.
15:641-646, for example, report the enhancement of immune responses
produced by DNA immunizations when DNA sequences encoding molecules
which stimulate the immune response are co-administered. In a
similar fashion, the vaccines of the present invention may be
co-administered with either nucleic acids encoding immune
modulators or the immune modulators themselves. These immune
modulators include granulocyte macrophage colony stimulating factor
(GM-CSF) and CD86.
[0185] The vaccines of the present invention may be used to confer
resistance to Enterococcal infection by either passive or active
immunization. When the vaccines of the present invention are used
to confer resistance to Enterococcal infection through active
immunization, a vaccine of the present invention is administered to
an animal to elicit a protective immune response which either
prevents or attenuates a Enterococcal infection. When the vaccines
of the present invention are used to confer resistance to
Enterococcal infection through passive immunization, the vaccine is
provided to a host animal (e.g., human, dog, or mouse), and the
antisera elicited by this antisera is recovered and directly
provided to a recipient suspected of having an infection caused by
a member of the Enierococcus genus.
[0186] The ability to label antibodies, or fragments of antibodies,
with toxin molecules provides an additional method for treating
Enterococcal infections when passive immunization is conducted. In
this embodiment, antibodies, or fragments of antibodies, capable of
recognizing the E. faecalis polypeptides disclosed herein, or
fragments thereof, as well as other Enterococcus proteins, are
labeled with toxin molecules prior to their administration to the
patient. When such toxin derivatized antibodies bind to
Enterococcus cells, toxin moieties will be localized to these cells
and will cause their death.
[0187] The present invention thus concerns and provides a means for
preventing or attenuating a Enterococcal infection resulting from
organisms which have antigens that are recognized and bound by
antisera produced in response to the polypeptides of the present
invention. As used herein, a vaccine is said to prevent or
attenuate a disease if its administration to an animal results
either in the total or partial attenuation (i e., suppression) of a
symptom or condition of the disease, or in the total or partial
immunity of the animal to the disease.
[0188] The administration of the vaccine (or the antisera which it
elicits) may be for either a "prophylactic" or "therapeutic"
purpose. When provided prophylactically, the compound(s) are
provided in advance of any symptoms of Enterococcal infection. The
prophylactic administration of the compound(s) serves to prevent or
attenuate any subsequent infection. When provided therapeutically,
the compound(s) is provided upon or after the detection of symptoms
which indicate that an animal may be infected with a member of the
Enterococcus genus. The therapeutic administration of the
compound(s) serves to attenuate any actual infection. Thus, the E.
faecalis polypeptides, and fragments thereof, of the present
invention may be provided either prior to the onset of infection
(so as to prevent or attenuate an anticipated infection) or after
the initiation of an actual infection.
[0189] The polypeptides of the invention, whether encoding a
portion of a native protein or a functional derivative thereof, may
be administered in pure form or may be coupled to a macromolecular
carrier. Example of such carriers are proteins and carbohydrates.
Suitable proteins which may act as macromolecular carrier for
enhancing the immunogenicity of the polypeptides of the present
invention include keyhole limpet hemacyanin (KLH) tetanus toxoid,
pertussis toxin, bovine serum albumin, and ovalbumin. Methods for
coupling the polypeptides of the present invention to such
macromolecular carriers are disclosed in Harlow et al., ANTIBODIES:
A LABORATORY MANUAL, (Cold Spring Harbor Laboratory Press, 2nd ed.
1988).
[0190] A composition is said to be "pharmacologically or
physiologically acceptable" if its administration can be tolerated
by a recipient animal and is otherwise suitable for administration
to that animal. Such an agent is said to be administered in a
"therapeutically effective amount" if the amount administered is
physiologically significant. An agent is physiologically
significant if its presence results in a detectable change in the
physiology of a recipient patient.
[0191] While in all instances the vaccine of the present invention
is administered as a pharmacologically acceptable compound, one
skilled in the art would recognize that the composition of a
pharmacologically acceptable compound varies with the animal to
which it is administered. For example, a vaccine intended for human
use will generally not be co-administered with Freund's adjuvant.
Further, the level of purity of the E. faecalis polypeptides of the
present invention will normally be higher when administered to a
human than when administered to a non-human animal.
[0192] As would be understood by one of ordinary skill in the art,
when the vaccine of the present invention is provided to an animal,
it may be in a composition which may contain salts, buffers,
adjuvants, or other substances which are desirable for improving
the efficacy of the composition. Adjuvants are substances that can
be used to specifically augment a specific immune response. These
substances generally perform two functions: (1) they protect the
antigen(s) from being rapidly catabolized after administration and
(2) they nonspecifically stimulate immune responses.
[0193] Normally, the adjuvant and the composition are mixed prior
to presentation to the immune system, or presented separately, but
into the same site of the animal being immunized. Adjuvants can be
loosely divided into several groups based upon their composition.
These groups include oil adjuvants (for example, Freund's complete
and incomplete), mineral salts (for example, AlK(SO.sub.4).sub.2,
AlNa(SO.sub.4).sub.2, AlNH.sub.4(SO.sub.4), silica, kaolin, and
carbon), polynucleotides (for example, poly IC and poly AU acids),
and certain natural substances (for example, wax D from
Mycobacterium tuberculosis, as well as substances found in
Corynebacterium parvum, or Bordetella pertussis, and members of the
genus Brucella. Other substances useful as adjuvants are the
saponins such as, for example, Quil A. (Superfos A/S, Denmark).
Preferred adjuvants for use in the present invention include
aluminum salts, such as AlK(SO.sub.4).sub.2, AlNa(SO.sub.4).sub.2,
and AlNH.sub.4(SO.sub.4). Examples of materials suitable for use in
vaccine compositions are provided in REMINGTON'S PHARMACEUTICAL
SCIENCES 1324-1341 (A. Osol, ed, Mack Publishing Co, Easton, Pa.,
(1980) (incorporated herein by reference).
[0194] The therapeutic compositions of the present invention can be
administered parenterally by injection, rapid infusion,
nasopharyngeal absorption (intranasopharangeally), dermoabsorption,
or orally. The compositions may alternatively be administered
intramuscularly, or intravenously. Compositions for parenteral
administration include sterile aqueous or non-aqueous solutions,
suspensions, and emulsions. Examples of non-aqueous solvents are
propylene glycol, polyethylene glycol, vegetable oils such as olive
oil, and injectable organic esters such as ethyl oleate. Carriers
or occlusive dressings can be used to increase skin permeability
and enhance antigen absorption. Liquid dosage forms for oral
administration may generally comprise a liposome solution
containing the liquid dosage form. Suitable forms for suspending
liposomes include emulsions, suspensions, solutions, syrups, and
elixirs containing inert diluents commonly used in the art, such as
purified water. Besides the inert diluents, such compositions can
also include adjuvants, wetting agents, emulsifying and suspending
agents, or sweetening, flavoring, or perfuming agents.
[0195] Therapeutic compositions of the present invention can also
be administered in encapsulated form. For example, intranasal
immunization using vaccines encapsulated in biodegradable
microsphere composed of poly(DL-lactide-co-glycolide). See, Shahin,
R. et al. (1995) Infect. Immun. 63:1195-1200. Similarly, orally
administered encapsulated Salmonella typhimurium antigens can also
be used. Allaoui-Attarki, K. et al. (1997) Infect. Immun.
65:853-857. Encapsulated vaccines of the present invention can be
administered by a variety of routes including those involving
contacting the vaccine with mucous membranes (e.g., intranasally,
intracolonicly, intraduodenally).
[0196] Many different techniques exist for the timing of the
immunizations when a multiple administration regimen is utilized.
It is possible to use the compositions of the invention more than
once to increase the levels and diversities of expression of the
immunoglobulin repertoire expressed by the immunized animal.
Typically, if multiple immunizations are given, they will be given
one to two months apart.
[0197] According to the present invention, an "effective amount" of
a therapeutic composition is one which is sufficient to achieve a
desired biological effect. Generally, the dosage needed to provide
an effective amount of the composition will vary depending upon
such factors as the animal's or human's age, condition, sex, and
extent of disease, if any, and other variables which can be
adjusted by one of ordinary skill in the art.
[0198] The antigenic preparations of the invention can be
administered by either single or multiple dosages of an effective
amount. Effective amounts of the compositions of the invention can
vary from 0.01-1,000 .mu.g/ml per dose, more preferably 0.1-500
.mu.g/ml per dose, and most preferably 10-300 .mu.g/ml per
dose.
Examples
Example 1
[0199] Isolation of a Selected DNA Clone From the Deposited Sample
of E. Faecalis
[0200] Three approaches can be used to isolate a E. faecalis clone
comprising a polynucleotide of the present invention from any E.
faecalis genomic DNA library. The E. faecalis strain V586 has been
deposited as a convienent source for obtaining a E. faecalis strain
although a wide varity of strains E. faecalis strains can be used
which are known in the art.
[0201] E. faecalis genomic DNA is prepared using the following
method. A 20 ml overnight bacterial culture grown in a rich medium
(e.g., Trypticase Soy Broth, Brain Heart Infusion broth or Super
broth), pelleted, washed two times with TES (3 OmM Tris-pH 8.0, 25
mM EDTA, 50 mM NaCl), and resuspended in 5 ml high salt TES (2.5 M
NaCl). Lysostaphin is added to final concentration of approx 50
.mu.g/ml and the mixture is rotated slowly 1 hour at 37 C to make
protoplast cells. The solution is then placed in incubator (or
place in a shaking water bath) and warmed to 55 C. Five hundred
micro liter of 20% sarcosyl in TES (final concentration 2%) is then
added to lyse the cells. Next, guanidine HCl is added to a final
concentration of 7M (3.69 g in 5.5 ml). The mixture is swirled
slowly at 55 C for 60-90 min (solution should clear). A CsCl
gradient is then set up in SW41 ultra clear tubes using 20 ml 5.7 M
CsCl and overlaying with 2.85 M CsCl. The gradient is carefully
overlayed with the DNA-containing GuHCl solution. The gradient is
spun at 30,000 rpm, 20 C for 24 hr and the lower DNA band is
collected. The volume is increased to 5 ml with TE buffer. The DNA
is then treated with protease K (10 .mu.g/ml) overnight at 37 C,
and precipitated with ethanol. The precipitated DNA is resuspended
in a desired buffer.
[0202] In the first method, a plasmid is directly isolated by
screening a plasmid E. faecalis genomic DNA library using a
polynucleotide probe corresponding to a polynucleotide of the
present invention. Particularly, a specific polynucleotide with
30-40 nucleotides is synthesized using an Applied Biosystems DNA
synthesizer according to the sequence reported. The oligonucleotide
is labeled, for instance, with .sup.32P-.gamma.-ATP using T4
polynucleotide kinase and purified according to routine methods.
(See, e.g., Maniatis et al., Molecular Cloning: A Laboratory
Manual, Cold Spring Harbor Press, Cold Spring, N.Y. (1982).) The
library is transformed into a suitable host, as indicated above
(such as XL-1 Blue (Stratagene)) using techniques known to those of
skill in the art. See, e.g., Sambrook et al. MOLECULAR CLONING: A
LABORATORY MANUAL (Cold Spring Harbor, N.Y. 2nd ed. 1989); Ausubel
et al., CURRENT PROTOCALS IN MOLECULAR BIOLOGY (John Wiley and
Sons, N.Y. 1989). The transformants are plated on 1.5% agar plates
(containing the appropriate selection agent, e.g., ampicillin) to a
density of about 150 transfornants (colonies) per plate. These
plates are screened using Nylon membranes according to routine
methods for bacterial colony screening. See, e.g., Sambrook et al.
MOLECULAR CLONING: A LABORATORY MANUAL (Cold Spring Harbor, N.Y.
2nd ed. 1989); Ausubel et al., CURRENT PROTOCALS IN MOLECULAR
BIOLOGY (John Wiley and Sons, N.Y. 1989) or other techniques known
to those of skill in the art.
[0203] Alternatively, two primers of 15-25 nucleotides derived from
the 5' and 3' ends of a polynucleotide of Table 1 are synthesized
and used to amplify the desired DNA by PCR using a E. faecalis
genomic DNA prep as a template. PCR is carried out under routine
conditions, for instance, in 25 .mu.l of reaction mixture with 0.5
.mu.g of the above DNA template. A convenient reaction mixture is
1.5-5 mM MgCl.sub.2, 0.01% (w/v) gelatin, 20 .mu.M each of dATP,
dCTP, dGTP, dTTP, 25 pmol of each primer and 0.25 Unit of Taq
polymerase. Thirty five cycles of PCR (denaturation at 94.degree.
C. for 1 min; annealing at 55.degree. C. for 1 min; elongation at
72.degree. C. for 1 min) are performed with a Perkin-Elmer Cetus
automated thermal cycler. The amplified product is analyzed by
agarose gel electrophoresis and the DNA band with expected
molecular weight is excised and purified. The PCR product is
verified to be the selected sequence by subcloning and sequencing
the DNA product.
[0204] Finally, overlapping oligos of the DNA sequences of Table 1
can be chemically synthesized and used to generate a nucleotide
sequence of desired length using PCR methods known in the art.
Example 2(a)
[0205] Expression and Purification Enterococcal polypeptides in E.
Coli
[0206] The bacterial expression vector pQE60 was used for bacterial
expression of some of the polypeptide fragements used in the soft
tissue and systemic infection models discussed below. (QIAGEN,
Inc., 9259 Eton Avenue, Chatsworth, Calif., 91311). pQE60 encodes
ampicillin antibiotic resistance ("Ampr") and contains a bacterial
origin of replication ("ori"), an IPTG inducible promoter, a
ribosome binding site ("RBS"), six codons encoding histidine
residues that allow affinity purification using
nickel-nitrilo-tri-acetic acid ("Ni-NTA") affinity resin (QIAGEN,
Inc., supra) and suitable single restriction enzyme cleavage sites.
These elements are arranged such that an inserted DNA fragment
encoding a polypeptide expresses that polypeptide with the six His
residues (i.e., a "6.times. His tag") covalently linked to the
carboxyl terminus of that polypeptide.
[0207] The DNA sequence encoding the desired portion of a E.
faecalis protein of the present invention was amplified from E.
faecalis genomic DNA using PCR oligonucleotide primers which anneal
to the 5' and 3' sequences coding for the portions of the E.
faecalis polynucleotide shown in Table 1. Additional nucleotides
containing restriction sites to facilitate cloning in the pQE60
vector are added to the 5' and 3' sequences, respectively.
[0208] For cloning the mature protein, the 5' primer has a sequence
containing an appropriate restriction site followed by nucleotides
of the amino terminal coding sequence of the desired E. faecalis
polynucleotide sequence in Table 1. One of ordinary skill in the
art would appreciate that the point in the protein coding sequence
where the 5' and 3' primers begin may be varied to amplify a DNA
segment encoding any desired portion of the complete protein
shorter or longer than the mature form. The 3' primer has a
sequence containing an appropriate restriction site followed by
nucleotides complementary to the 3' end of the polypeptide coding
sequence of Table 1, excluding a stop codon, with the coding
sequence aligned with the restriction site so as to maintain its
reading frame with that of the six His codons in the pQE60
vector.
[0209] The amplified E. faecalis DNA fragment and the vector pQE60
were digested with restriction enzymes which recognize the sites in
the primers and the digested DNAs were then ligated together. The
E. faecalis DNA was inserted into the restricted pQE60 vector in a
manner which places the E. faecalis protein coding region
downstream from the IPTG-inducible promoter and in-frame with an
initiating AUG and the six histidine codons.
[0210] The ligation mixture was transformed into competent E. coli
cells using standard procedures such as those described by Sambrook
et al., supra. E. coli strain M15/rep4, containing multiple copies
of the plasmid pREP4, which expresses the lac repressor and confers
kanamycin resistance ("Kanr"), was used in carrying out the
illustrative example described herein. This strain, which was only
one of many that are suitable for expressing a E. faecalis
polypeptide, is available commercially (QIAGEN, Inc., supra).
Transformants were identified by their ability to grow on LB agar
plates in the presence of ampicillin and kanamycin. Plasmid DNA was
isolated from resistant colonies and the identity of the cloned DNA
confirmed by restriction analysis, PCR and DNA sequencing.
[0211] Clones containing the desired constructs were grown
overnight ("O/N") in liquid culture in LB media supplemented with
both ampicillin (100 .mu.g/ml) and kanamycin (25 .mu.g/ml). The O/N
culture was used to inoculate a large culture, at a dilution of
approximately 1:25 to 1:250. The cells were grown to an optical
density at 600 nm ("OD600") of between 0.4 and 0.6.
Isopropyl-.beta.-D-thiogalactopyranoside ("IPTG") was then added to
a final concentration of 1 mM to induce transcription from the lac
repressor sensitive promoter, by inactivating the lacd repressor.
Cells subsequently were incubated further for 3 to 4 hours. Cells
then were harvested by centrifugation.
[0212] The cells were then stirred for 3-4 hours at 4.degree. C. in
6 M guanidine-HCl, pH 8. The cell debris was removed by
centrifugation, and the supernatant containing the E. faecalis
polypeptide was loaded onto a nickel-nitrilo-tri-acetic acid
("Ni-NTA") affinity resin column (QIAGEN, Inc., supra). Proteins
with a 6.times. His tag bind to the Ni-NTA resin with high affinity
were purified in a simple one-step procedure (for details see: The
QlAexpressionist, 1995, QIAGEN, Inc., supra). Briefly the
supernatant was loaded onto the column in 6 M guanidine-HCl, pH 8,
the column was first washed with 10 volumes of 6 M guanidine-HCl,
pH 8, then washed with 10 volumes of 6 M guanidine-HCl pH 6, and
finally the E. faecalis polypeptide was eluted with 6 M
guanidine-HCI, pH 5.
[0213] The purified protein was then renatured by dialyzing it
against phosphate-buffered saline (PBS) or 50 mM Na-acetate, pH 6
buffer plus 200 mM NaCl. Alternatively, the protein could be
successfully refolded while immobilized on the Ni-NTA column. The
recommended conditions are as follows: renature using a linear
6M-1M urea gradient in 500 mM NaCl, 20% glycerol, 20 mM Tris/HCl pH
7.4, containing protease inhibitors. The renaturation should be
performed over a period of 1.5 hours or more. After renaturation
the proteins can be eluted by the addition of 250 mM immidazole.
Immidazole was removed by a final dialyzing step against PBS or 50
mM sodium acetate pH 6 buffer plus 200 mM NaCl. The purified
protein was stored at 4.degree. C. or frozen at -80.degree. C.
[0214] Some of the polypeptide of the present invention were
prepared using a non-denaturing protein purification method. For
these polypeptides, the cell pellet from each liter of culture was
resuspended in 25 mls of Lysis Buffer A at 4.degree. C. (Lysis
Buffer A=50 mM Na-phosphate, 300 mM NaCl, 10 mM 2-mercaptoethanol,
10% Glycerol, pH 7.5 with 1 tablet of Complete EDTA-free protease
inhibitor cocktail (Boehringer Mannheim #1873580) per 50 ml of
buffer). Absorbance at 550 nm was approximately 10-20 O.D./ml. The
suspension was then put through three freeze/thaw cycles from
-70.degree. C. (using a ethanol-dry ice bath) up to room
temperature. The cells were lysed via sonication in short 10 sec
bursts over 3 minutes at approximately 80 W while kept on ice. The
sonicated sample was then centrifuged at 15,000 RPM for 30 minutes
at 4.degree. C. The supernatant was passed through a column
containing 1.0 ml of CL-4B resin to pre-clear the sample of any
proteins that may bind to agarose non-specifically, and the
flow-through fraction was collected.
[0215] The pre-cleared flow-through was applied to a
nickel-nitrilo-tri-acetic acid ("Ni-NTA") affinity resin column
(Quiagen, Inc., supra). Proteins with a 6.times. His tag bind to
the Ni-NTA resin with high affinity and can be purified in a simple
one-step procedure. Briefly, the supernatant was loaded onto the
column in Lysis Buffer A at 4.degree. C., the column was first
washed with 10 volumes of Lysis Buffer A until the A280 of the
eluate returns to the baseline. Then, the column was washed with 5
volumes of 40 mM Imidazole (92% Lysis Buffer A/8% Buffer B) (Buffer
B=50 mM Na-Phosphate, 300 mM NaCl, 10% Glycerol, 10 mM
2-mercaptoethanol, 500 mM Imidazole, pH of the final buffer should
be 7.5). The protein was eluted off of the column with a series of
increasing Imidazole solutions made by adjusting the ratios of
Lysis Buffer A to Buffer B. Three different concentrations were
used: 3 volumes of 75 mM Imidazole, 3 volumes of 150 mM Imidazole,
5 volumes of 500 mM Imidazole. The fractions containing the
purified protein were analyzed using 8%, 10% or 14% SDS-PAGE
depending on the protein size. The purified protein was then
dialyzed 2.times. against phosphate-buffered saline (PBS) in order
to place it into an easily workable buffer. The purified protein
was stored at 4.degree. C. or frozen at -80.degree..
[0216] The following alternative method may be used to purify E.
faecalis expressed in E. coli when it is present in the form of
inclusion bodies. Unless otherwise specified, all of the following
steps are conducted at 4-10.degree. C.
[0217] Upon completion of the production phase of the E. coli
fermentation, the cell culture is cooled to 4-1 0.degree. C. and
the cells are harvested by continuous centrifugation at 15,000 rpm
(Heraeus Sepatech). On the basis of the expected yield of protein
per unit weight of cell paste and the amount of purified protein
required, an appropriate amount of cell paste, by weight, is
suspended in a buffer solution containing 100 mM Tris. 50 mM EDTA,
pH 7.4. The cells are dispersed to a homogeneous suspension using a
high shear mixer.
[0218] The cells are then lysed by passing the solution through a
microfluidizer (Microfuidics, Corp. or APV Gaulin, Inc.) twice at
4000-6000 psi. The homogenate is then mixed with NaCl solution to a
final concentration of 0.5 M NaCl, followed by centrifugation at
7000.times. g for 15 min. The resultant pellet is washed again
using 0.5 M NaCl, 100 mM Tris, 50 mM EDTA, pH 7.4.
[0219] The resulting washed inclusion bodies are solubilized with
1.5 M guanidine hydrochloride (GuHCl) for 2-4 hours. After
7000.times. g centrifugation for 15 min., the pellet is discarded
and the E. faecalis polypeptide-containing supernatant is incubated
at 4.degree. C. overnight to allow further GuHCI extraction.
[0220] Following high speed centrifugation (30,000.times. g) to
remove insoluble particles, the GuHCI solubilized protein is
refolded by quickly mixing the GuHCl extract with 20 volumes of
buffer containing 50 mM sodium, pH 4.5, 150 mM NaCl, 2 mM EDTA by
vigorous stirring. The refolded diluted protein solution is kept at
4.degree. C. without mixing for 12 hours prior to further
purification steps.
[0221] To clarify the refolded E. faecalis polypeptide solution, a
previously prepared tangential filtration unit equipped with 0.16
.mu.m membrane filter with appropriate surface area (e.g.,
Filtron), equilibrated with 40 mM sodium acetate, pH 6.0 is
employed. The filtered sample is loaded onto a cation exchange
resin (e.g., Poros HS-50, Perseptive Biosystems). The column is
washed with 40 mM sodium acetate, pH 6.0 and eluted with 250 mM,
500 mM, 1000 mM, and 1500 mM NaCl in the same buffer, in a stepwise
manner. The absorbance at 280 mm of the effluent is continuously
monitored. Fractions are collected and further analyzed by
SDS-PAGE.
[0222] Fractions containing the E. faecalis polypeptide are then
pooled and mixed with 4 volumes of water. The diluted sample is
then loaded onto a previously prepared set of tandem columns of
strong anion (Poros HQ-50, Perseptive Biosystems) and weak anion
(Poros CM-20, Perseptive Biosystems) exchange resins. The columns
are equilibrated with 40 mM sodium acetate, pH 6.0. Both columns
are washed with 40 mM sodium acetate, pH 6.0, 200 mM NaCl. The
CM-20 column is then eluted using a 10 column volume linear
gradient ranging from 0.2 M NaCl, 50 mM sodium acetate, pH 6.0 to
1.0 M NaCl, 50 mM sodium acetate, pH 6.5. Fractions are collected
under constant A.sub.280 monitoring of the effluent. Fractions
containing the E. faecalis polypeptide (determined, for instance,
by 16% SDS-PAGE) are then pooled.
[0223] The resultant E. faecalis polypeptide exhibits greater than
95% purity after the above refolding and purification steps. No
major contaminant bands are observed from Commassie blue stained
16% SDS-PAGE gel when 5 .mu.g of purified protein is loaded. The
purified protein is also tested for endotoxin/LPS contamination,
and typically the LPS content is less than 0.1 ng/ml according to
LAL assays.
Example 2(b)
[0224] Alternative Expression and Purification
Enterococcalpolypeptides in E. Coli
[0225] the vector pQE10 was alternatively used to clone and express
some of the polypeptides of the present invention for use in the
soft tissue and systemic infection models discussed below. The
difference being such that an inserted DNA fragment encoding a
polypeptide expresses that polypeptide with the six His residues
(i.e., a "6.times. His tag") covalently linked to the amino
terminus of that polypeptide. The bacterial expression vector pQE10
(QIAGEN, Inc., 9259 Eton Avenue, Chatsworth, Calif., 91311) was
used in this example. The components of the pQE10 plasmid are
arranged such that the inserted DNA sequence encoding a polypeptide
of the present invention expresses the polypeptide with the six His
residues (i.e., a "6.times. His tag")) covalently linked to the
amino terminus.
[0226] The DNA sequences encoding the desired portions of a
polypeptide of Table 1 were amplified using PCR oligonucleotide
primers from genomic E. faecalis DNA. The PCR primers anneal to the
nucleotide sequences encoding the desired amino acid sequence of a
polypeptide of the present invention. Additional nucleotides
containing restriction sites to facilitate cloning in the pQE 10
vector were added to the 5' and 3' primer sequences,
respectively.
[0227] For cloning a polypeptide of the present invention, the 5'
and 3' primers were selected to amplify their respective nucleotide
coding sequences. One of ordinary skill in the art would appreciate
that the point in the protein coding sequence where the 5' and 3'
primers begins may be varied to amplify a DNA segment encoding any
desired portion of a polypeptide of the present invention. The 5'
primer was designed so the coding sequence of the 6.times. His tag
is aligned with the restriction site so as to maintain its reading
frame with that of E. faecalis polypeptide. The 3' was designed to
include an stop codon. The amplified DNA fragment was then cloned,
and the protein expressed, as described above for the pQE60
plasmid.
[0228] The DNA sequences encoding the amino acid sequences of Table
1 may also be cloned and expressed as fusion proteins by a protocol
similar to that described directly above, wherein the pET-32b(+)
vector (Novagen, 601 Science Drive, Madison, Wis. 53711) is
preferentially used in place of pQE10.
[0229] The above methods are not limited to the polypeptide
fragements actually produced. The above method, like the methods
below, can be used to produce either full length polypeptides or
desired fragements therof.
Example 2(c)
[0230] Alternative Expression and Pitriflcation of Enterococcal
polypeptides in E. Coli
[0231] The bacterial expression vector pQE60 is used for bacterial
expression in this example (QIAGEN, Inc., 9259 Eton Avenue,
Chatsworth, Calif., 91311). However, in this example, the
polypeptide coding sequence is inserted such that translation of
the six His codons is prevented and, therefore, the polypeptide is
produced with no 6.times. His tag.
[0232] The DNA sequence encoding the desired portion of the E.
faecalis amino acid sequence is amplified from an E. faecalis
genomic DNA prep the deposited DNA clones using PCR oligonucleotide
primers which anneal to the 5' and 3' nucleotide sequences
corresponding to the desired portion of the E. faecalis
polypeptides. Additional nucleotides containing restriction sites
to facilitate cloning in the pQE60 vector are added to the 5' and
3' primer sequences.
[0233] For cloning a E. faecalis polypeptides of the present
invention, 5' and 3' primers are selected to amplify their
respective nucleotide coding sequences. One of ordinary skill in
the art would appreciate that the point in the protein coding
sequence where the 5' and 3' primers begin may be varied to amplify
a DNA segment encoding any desired portion of a polypeptide of the
present invention. The 3' and 5' primers contain appropriate
restriction sites followed by nucleotides complementary to the 5'
and 3' ends of the coding sequence respectively. The 3' primer is
additionally designed to include an in-frame stop codon.
[0234] The amplified E. faecalis DNA fragments and the vector pQE60
are digested with restriction enzymes recognizing the sites in the
primers and the digested DNAs are then ligated together. Insertion
of the E. faecalis DNA into the restricted pQE60 vector places the
E. faecalis protein coding region including its associated stop
codon downstream from the IPTG-inducible promoter and in-frame with
an initiating AUG. The associated stop codon prevents translation
of the six histidine codons downstream of the insertion point.
[0235] The ligation mixture is transformed into competent E. coli
cells using standard procedures such as those described by Sambrook
et al. E. coli strain M15/rep4, containing multiple copies of the
plasmid pREP4, which expresses the lac repressor and confers
kanamycin resistance ("Kanr"), is used in carrying out the
illustrative example described herein. This strain, which is only
one of many that are suitable for expressing E. faecalis
polypeptide, is available commercially (QIAGEN, Inc., supra).
Transformants are identified by their ability to grow on LB plates
in the presence of ampicillin and kanamycin. Plasmid DNA is
isolated from resistant colonies and the identity of the cloned DNA
confirmed by restriction analysis, PCR and DNA sequencing.
[0236] Clones containing the desired constructs are grown overnight
("O/N") in liquid culture in LB media supplemented with both
ampicillin (100 .mu.g/ml) and kanamycin (25 .mu.g/ml). The O/N
culture is used to inoculate a large culture, at a dilution of
approximately 1:25 to 1:250. The cells are grown to an optical
density at 600 nm ("OD600") of between 0.4 and 0.6.
isopropyl-b-D-thiogalactopyranoside ("IPTG") is then added to a
final concentration of 1 mM to induce transcription from the lac
repressor sensitive promoter, by inactivating the lacl repressor.
Cells subsequently are incubated further for 3 to 4 hours. Cells
then are harvested by centrifugation.
[0237] To purify the E. faecalis polypeptide, the cells are then
stirred for 3-4 hours at 4.degree. C. in 6 M guanidine-HCl, pH 8.
The cell debris is removed by centrifugation, and the supernatant
containing the E. faecalis polypeptide is dialyzed against 50 mM
Na-acetate buffer pH 6, supplemented with 200 mM NaCl.
Alternatively, the protein can be successfully refolded by
dialyzing it against 500 mM NaCl, 20% glycerol, 25 mM Tris/HCl pH
7.4, containing protease inhibitors. After renaturation the protein
can be purified by ion exchange, hydrophobic interaction and size
exclusion chromatography. Alternatively, an affinity chromatography
step such as an antibody column can be used to obtain pure E.
faecalis polypeptide. The purified protein is stored at 4.degree.
C. or frozen at -80.degree. C.
[0238] The following alternative method may be used to purify E.
faecalis polypeptides expressed in E. coli when it is present in
the form of inclusion bodies. Unless otherwise specified, all of
the following steps are conducted at 4-10.degree. C.
[0239] Upon completion of the production phase of the E. coli
fermentation, the cell culture is cooled to 4-10.degree. C. and the
cells are harvested by continuous centrifugation at 15,000 rpm
(Heraeus Sepatech). On the basis of the expected yield of protein
per unit weight of cell paste and the amount of purified protein
required, an appropriate amount of cell paste, by weight, is
suspended in a buffer solution containing 100 mM Tris, 50 mM EDTA,
pH 7.4. The cells are dispersed to a homogeneous suspension using a
high shear mixer.
[0240] The cells ware then lysed by passing the solution through a
microfluidizer (Microfuidics, Corp. or APV Gaulin, Inc.) twice at
4000-6000 psi. The homogenate is then mixed with NaCi solution to a
final concentration of 0.5 M NaCl, followed by centrifugation at
7000.times. g for 15 min. The resultant pellet is washed again
using 0.5 M NaCl, 100 mM Tris, 50 mM EDTA, pH 7.4.
[0241] The resulting washed inclusion bodies are solubilized with
1.5 M guanidine hydrochloride (GuHCl) for 2-4 hours. After
7000.times. g centrifugation for 15 min., the pellet is discarded
and the E. faecalis polypeptide-containing supernatant is incubated
at 4.degree. C. overnight to allow further GuHCl extraction.
[0242] Following high speed centrifugation (30,000.times. g) to
remove insoluble particles, the GuHCI solubilized protein is
refolded by quickly mixing the GuHCl extract with 20 volumes of
buffer containing 50 mM sodium, pH 4.5, 150 mM NaCl, 2 mM EDTA by
vigorous stirring. The refolded diluted protein solution is kept at
4.degree. C. without mixing for 12 hours prior to further
purification steps.
[0243] To clarify the refolded E. faecalis polypeptide solution, a
previously prepared tangential filtration unit equipped with 0.16
.mu.m membrane filter with appropriate surface area (e.g.,
Filtron), equilibrated with 40 mM sodium acetate, pH 6.0 is
employed. The filtered sample is loaded onto a cation exchange
resin (e.g., Poros HS-50, Perseptive Biosystems). The column is
washed with 40 mM sodium acetate. pH 6.0 and eluted with 250 mM,
500 mM, 1000 mM, and 1500 mM NaCl in the same buffer, in a stepwise
manner. The absorbance at 280 mm of the effluent is continuously
monitored. Fractions are collected and further analyzed by
SDS-PAGE.
[0244] Fractions containing the E. faecalis polypeptide are then
pooled and mixed with 4 volumes of water. The diluted sample is
then loaded onto a previously prepared set of tandem columns of
strong anion (Poros HQ-50, Perseptive Biosystems) and weak anion
(Poros CM-20, Perseptive Biosystems) exchange resins. The columns
are equilibrated with 40 mM sodium acetate, pH 6.0. Both columns
are washed with 40 mM sodium acetate, pH 6.0, 200 mM NaCl. The
CM-20 column is then eluted using a 10 column volume linear
gradient ranging from 0.2 M NaCl, 50 mM sodium acetate, pH 6.0 to
1.0 M NaCl, 50 mM sodium acetate, pH 6.5. Fractions are collected
under constant A.sub.280 monitoring of the effluent. Fractions
containing the E. faecalis polypeptide (determined, for instance,
by 16% SDS-PAGE) are then pooled.
[0245] The resultant E. faecalis polypeptide exhibits greater than
95% purity after the above refolding and purification steps. No
major contaminant bands are observed from Commassie blue stained
16% SDS-PAGE gel when 5 .mu.g of purified protein is loaded. The
purified protein is also tested for endotoxin/LPS contamination,
and typically the LPS content is less than 0.1 ng/ml according to
LAL assays.
Example 2(d)
[0246] Cloning and Expression of E. Faecalis in Other Bacteria
[0247] E. faecalis polypeptides can also be produced in: E.
faecalis using the methods of S. Skinner et al., (1988) Mol.
Microbiol. 2:289-297 or J. I. Moreno (1996) Protein Expr. Purif.
8(3):332-340; Lactobacillus using the methods of C. Rush et al.,
1997 Appl. Microbiol. Biotechnol. 47(5):537-542; or in Bacillus
suibtilis using the methods Chang et al., U.S. Pat. No.
4,952,508.
Example 3
[0248] Cloning and Expression in COS Cells
[0249] A E. faecalis expression plasmid is made by cloning a
portion of the DNA encoding a E. faecalis polypeptide into the
expression vector pDNAI/Amp or pDNAIII (which can be obtained from
Invitrogen, Inc.). The expression vector pDNAI/amp contains: (1) an
E. coli origin of replication effective for propagation in E. coli
and other prokaryotic cells; (2) an ampicillin resistance gene for
selection of plasmid-containing prokaryotic cells; (3) an SV40
origin of replication for propagation in eukaryotic cells; (4) a
CMV promoter, a polylinker, an SV40 intron; (5) several codons
encoding a hemagglutinin fragment (i.e., an "HA" tag to facilitate
purification) followed by a termination codon and polyadenylation
signal arranged so that a DNA can be conveniently placed under
expression control of the CMV promoter and operably linked to the
SV40 intron and the polyadenylation signal by means of restriction
sites in the polylinker. The HA tag corresponds to an epitope
derived from the influenza hemagglutinin protein described by
Wilson et al. 1984 Cell 37:767. The fusion of the HA tag to the
target protein allows easy detection and recovery of the
recombinant protein with an antibody that recognizes the HA
epitope. pDNAIII contains, in addition, the selectable neomycin
marker.
[0250] A DNA fragment encoding a E. faecalis polypeptide is cloned
into the polylinker region of the vector so that recombinant
protein expression is directed by the CMV promoter. The plasmid
construction strategy is as follows. The DNA from a E. faecalis
genomic DNA prep is amplified using primers that contain convenient
restriction sites, much as described above for construction of
vectors for expression of E. faecalis in E. coli. The 5' primer
contains a Kozak sequence, an AUG start codon, and nucleotides of
the 5' coding region of the E. faecalis polypeptide. The 3' primer,
contains nucleotides complementary to the 3' coding sequence of the
E. faecalis DNA. a stop codon, and a convenient restriction
site.
[0251] The PCR amplified DNA fragment and the vector, pDNAI/Amp,
are digested with appropriate restriction enzymes and then ligated.
The ligation mixture is transformed into an appropriate E. coli
strain such as SURE.TM. (Stratagene Cloning Systems, La Jolla,
Calif. 92037), and the transformed culture is plated on ampicillin
media plates which then are incubated to allow growth of ampicillin
resistant colonies. Plasmid DNA is isolated from resistant colonies
and examined by restriction analysis or other means for the
presence of the fragment encoding the E. faecalis polypeptide
[0252] For expression of a recombinant E. faecalis polypeptide, COS
cells are transfected with an expression vector, as described
above, using DEAE-dextran, as described, for instance, by Sambrook
et al. (supra). Cells are incubated under conditions for expression
of E. faecalis by the vector.
[0253] Expression of the E. faecalis-HA fusion protein is detected
by radiolabeling and immunoprecipitation, using methods described
in, for example Harlow et al., supra. To this end, two days after
transfection, the cells are labeled by incubation in media
containing .sup.35S-cysteine for 8 hours. The cells and the media
are collected, and the cells are washed and the lysed with
detergent-containing RIPA buffer: 150 mM NaCl, 1% NP-40, 0.1% SDS,
1% NP-40, 0.5% DOC, 50 mM TRIS, pH 7.5, as described by Wilson et
al. (supra ). Proteins are precipitated from the cell lysate and
from the culture media using an HA-specific monoclonal antibody.
The precipitated proteins then are analyzed by SDS-PAGE and
autoradiography. An expression product of the expected size is seen
in the cell lysate, which is not seen in negative controls.
Example 4
[0254] Cloning and Expression in CHO Cells
[0255] The vector pC4 is used for the expression of E. faecalis
polypeptide in this example. Plasmid pC4 is a derivative of the
plasmid pSV2-dhfr (ATCC Accession No. 37146). The plasmid contains
the mouse DHFR gene under control of the SV40 early promoter.
Chinese hamster ovary cells or other cells lacking dihydrofolate
activity that are transfected with these plasmids can be selected
by growing the cells in a selective medium (alpha minus MEM, Life
Technologies) supplemented with the chemotherapeutic agent
methotrexate. The amplification of the DHFR genes in cells
resistant to methotrexate (MTX) has been well documented. See,
e.g., Alt et al., 1978, J. Biol. Chem. 253:1357-1370; Hamlin et
al., 1990, Biochem. et Biophys. Acta, 1097:107-143; Page et al.,
1991, Biotechnology 9:64-68. Cells grown in increasing
concentrations of MTX develop resistance to the drug by
overproducing the target enzyme, DHFR, as a result of amplification
of the DHFR gene. If a second gene is linked to the DHFR gene, it
is usually co-amplified and over-expressed. It is known in the art
that this approach may be used to develop cell lines carrying more
than 1,000 copies of the amplified gene(s). Subsequently, when the
methotrexate is withdrawn, cell lines are obtained which contain
the amplified gene integrated into one or more chromosome(s) of the
host cell.
[0256] Plasmid pC4 contains the strong promoter of the long
terminal repeat (LTR) of the Rouse Sarcoma Virus, for expressing a
polypeptide of interest, Cullen, et al. (1985) Mol. Cell. Biol.
5:438-447; plus a fragment isolated from the enhancer of the
immediate early gene of human cytomegalovirus (CMV), Boshart, et
al., 1985, Cell 41:521-530. Downstream of the promoter are the
following single restriction enzyme cleavage sites that allow the
integration of the genes: Bam HI, Xba I, and Asp 718. Behind these
cloning sites the plasmid contains the 3' intron and
polyadenylation site of the rat preproinsulin gene. Other high
efficiency promoters can also be used for the expression, e.g., the
human .beta.-actin promoter, the SV40 early or late promoters or
the long terminal repeats from other retroviruses, e.g., HIV and
HTLVI. Clontech's Tet-Off and Tet-On gene expression systems and
similar systems can be used to express the E. faecalis polypeptide
in a regulated way in mammalian cells (Gossen et al., 1992, Proc.
Natl. Acad. Sci. USA 89:5547-5551. For the polyadenylation of the
mRNA other signals, e.g., from the human growth hormone or globin
genes can be used as well. Stable cell lines carrying a gene of
interest integrated into the chromosomes can also be selected upon
co-transfection with a selectable marker such as gpt, G418 or
hygromycin. It is advantageous to use more than one selectable
marker in the beginning, e.g., G418 plus methotrexate.
[0257] The plasmid pC4 is digested with the restriction enzymes and
then dephosphorylated using calf intestinal phosphates by
procedures known in the art. The vector is then isolated from a 1%
agarose gel. The DNA sequence encoding the E. faecalis polypeptide
is amplified using PCR oligonucleotide primers corresponding to the
5' and 3' sequences of the desired portion of the gene. A 5' primer
containing a restriction site, a Kozak sequence, an AUG start
codon, and nucleotides of the 5' coding region of the E. faecalis
polypeptide is synthesized and used. A 3' primer, containing a
restriction site, stop codon, and nucleotides complementary to the
3' coding sequence of the E. faecalis polypeptides is synthesized
and used. The amplified fragment is digested with the restriction
endonucleases and then purified again on a 1% agarose gel. The
isolated fragment and the dephosphorylated vector are then ligated
with T4 DNA ligase. E. coli HB101 or XL-1 Blue cells are then
transformed and bacteria are identified that contain the fragment
inserted into plasmid pC4 using, for instance, restriction enzyme
analysis.
[0258] Chinese hamster ovary cells lacking an active DHFR gene are
used for transfection. Five .mu.g of the expression plasmid pC4 is
cotransfected with 0.5 .mu.g of the plasmid pSVneo using a
lipid-mediated transfection agent such as Lipofectin.TM. or
LipofectAMINE.TM. (LifeTechnologies Gaithersburg, MD). The plasmid
pSV2-neo contains a dominant selectable marker, the neo gene from
Tn5 encoding an enzyme that confers resistance to a group of
antibiotics including G418. The cells are seeded in alpha minus MEM
supplemented with 1 mg/ml G418. After 2 days, the cells are
trypsinized and seeded in hybridoma cloning plates (Greiner,
Germany) in alpha minus MEM supplemented with 10, 25, or 50 ng/ml
of methotrexate plus 1 mg/ml G418. After about 10-14 days single
clones are trypsinized and then seeded in 6-well petri dishes or 10
ml flasks using different concentrations of methotrexate (50 nM,
100 nM, 200 nM, 400 nM, 800 nM). Clones growing at the highest
concentrations of methotrexate are then transferred to new 6-well
plates containing even higher concentrations of methotrexate (1
.mu.M, 2 .mu.M, 5 .mu.M, 10 mM, 20 mM). The same procedure is
repeated until clones are obtained which grow at a concentration of
100-200 .mu.M. Expression of the desired gene product is analyzed,
for instance, by SDS-PAGE and Western blot or by reversed phase
HPLC analysis.
Example 5
[0259] Quantitative Murine Soft Tissue Infection Model for E.
Faecalis
[0260] Compositions of the present invention, including
polypeptides and peptides, are assayed for their ability to
function as vaccines or to enhance/stimulate an immune response to
a bacterial species (e.g., E. faecalis) using the following
quantitative murine soft tissue infection model. Mice (e.g., NIH
Swiss female mice, approximately 7 weeks old) are first treated
with a biologically protective effective amount, or immune
enhancing/stimulating effective amount of a composition of the
present invention using methods known in the art, such as those
discussed above. See,e.g., Harlow et al., ANTIBODIES: A LABORATORY
MANUAL, (Cold Spring Harbor Laboratory Press, 2nd ed. 1988). An
example of an appropriate starting dose is 20 .mu.g per animal.
[0261] The desired bacterial species used to challenge the mice,
such as E. faecalis, is grown as an overnight culture. The culture
is diluted to a concentration of 5.times.10.sup.8 cfu/ml, in an
appropriate media, mixed well, serially diluted, and titered. The
desired doses are further diliuted 1:2 with sterilized Cytodex 3
microcarrier beads preswollen in sterile PBS (3 g/100 ml). Mice are
anesthetize briefly until docile, but still mobile and injected
with 0.2 ml of the Cytodex 3 bead/bacterial mixture into each
animal subcutaneously in the inguinal region. After four days,
counting the day of injection as day one, mice are sacrificed and
the contents of the abscess is excised and placed in a 15 ml
conical tube containing 1.0 ml of sterile PBS. The contents of the
abscess is then enzymatically treated and plated as follows.
[0262] The abscess is first disrupted by vortexing with sterilized
glass beads placed in the tubes. 3.Omls of prepared enzyme mixture
(1.0 ml Collagenase D (4.0 mg/ml), 1.0 ml Trypsin (6.0 mg/ml) and
8.0 mls PBS) is then added to each tube followed by a 20 min.
incubation at 37 C. The solution is then centrifuged and the
supernatant drawn off. 0.5 ml dH20 is then added and the tubes are
vortexed and then incubated for 10 min. at room temperature. 0.5 ml
media is then added and samples are serially diluted and plated
onto agar plates, and grown overnight at 37 C. Plates with distinct
and separate colonies are then counted, compared to positive and
negative control samples, and quantified. The method can be used to
identify composition and determine appropriate and effective doses
for humans and other animals by comparing the effective doses of
compositions of the present invention with compositions known in
the art to be effective in both mice and humans. Doses for the
effective treatment of humans and other animals, using compositions
of the present invention, are extrapolated using the data from the
above experiments of mice. It is appreciated that further studies
in humans and other animals may be needed to determine the most
effective doses using methods of clinical practice known in the
art.
Example 6
[0263] Murine Systemic Neutropenic Model for E. Faecalis
Infection
[0264] Compositions of the present invention, including
polypeptides and peptides, are assayed for their ability to
function as vaccines or to enhance/stimulate an immune response to
a bacterial species (e.g., E. faecalis) using the following
qualitative murine systemic neutropenic model. Mice (e.g., NIH
Swiss female mice, approximately 7 weeks old) are first treated
with a biologically protective effective amount, or immune
enhancing/stimulating effective amount of a composition of the
present invention using methods known in the art, such as those
discussed above. See, e.g., Harlow et al., ANTIBODIES: A LABORATORY
MANUAL, (Cold Spring Harbor Laboratory Press, 2nd ed. 1988). An
example of an appropriate starting dose is 20 .mu.g per animal.
Mice are then injected with 250-300 mg/kg cyclophosphamide
intraperitonially. Counting the day of C.P. injection as day one,
the mice are left untreated for 5 days to begin recovery of
PMNL'S.
[0265] The desired bacterial species used to challenge the mice,
such as E. faecalis, is grown as an overnight culture. The culture
is diluted to a concentration of 5.times.10.sup.8 cfu/ml, in an
appropriate media, mixed well, serially diluted, and titered. The
desired doses are further diliuted 1:2 in 4% Brewer's yeast in
media. Mice are injected with the bacteria/brewer's yeast challenge
intraperitonially. The Brewer's yeast solution alone is used as a
control. The mice are then monitered twice daily for the first week
following challenge, and once a day for the next week to ascertain
morbidity and mortality. Mice remaining at the end of the
experiment are sacrificed. The method can be used to identify
compositions and determine appropriate and effective doses for
humans and other animals by comparing the effective doses of
compositions of the present invention with compositions known in
the art to be effective in both mice and humans. Doses for the
effective treatment of humans and other animals, using compositions
of the present invention, are extrapolated using the data from the
above experiments of mice. It is appreciated that further studies
in humans and other animals may be needed to determine the most
effective doses using methods of clinical practice known in the
art.
[0266] The disclosure of all publications (including patents,
patent applications, journal articles, laboratory manuals, books,
or other documents) cited herein are hereby incorporated by
reference in their entireties.
[0267] The present invention is not to be limited in scope by the
specific embodiments described herein, which are intended as single
illustrations of individual aspects of the invention. Functionally
equivalent methods and components are within the scope of the
invention, in addition to those shown and described herein and will
become apparant to those skilled in the art from the foregoing
description and accompanying drawings. Such modifications are
intended to fall within the scope of the appended claims.
1TABLE 1 Nucleotide and Amino Acid Seqeuences of E. faecalis Genes.
EF001-1 (SEQ ID NO:1) TGAAAGAATA TTGCCAGAAC GTGGCGAGCA AATTGTTTTA
TAAATTTTTT TAAGGGAGAG AAAAAAATGA AGTTCAAAAC TCTAGCAACA ACAGTGTTAG
CAACCGCAGC TATTTTCGCA TTGGGGGCTT GTGGTAACGG TAATGGGGCC AAAGAATCAA
ACGATATTGT GAAAGAAGTG AAGGAAGATA CGACAATCAC TTTCTCGCAT GCAATGAATG
GGGTTCAAGA AGAAGCGTTA ACAAAATTAA CGAAAGACTT CATGAAAGAA AATCCAAAAA
TTAAACTGGA ATTACAAAAT CAATCTGCTT ACCCTGATTT ACAAGCCAAA ATCAATTCGA
CTTTAACTTC ACCAAAAGAT TTACCAACAA TTACGCAAGC GTACCCAGGC TGGTTATGGA
ATGCTGCACA AGATGAAATG TTAGTGGACT TAAAACCATA TATGGATGAT GACACAATCG
GCTGGAAAGA TGCAGAGCCA ATTCGTGAAG TATTGTTAGA CGGCGCCAAA ATCGACGGCA
AACAATACGG CATTCCATTT AATAAATCGA CAGAAATGTT ATTCTATAAT GCTGATTTGT
TGAAAGAATA TGGTGTTGAA GTACCGAAAA CATTAGAGGA ATTAAAAGAA GCTTCTAAAA
CAATTTACGA AAAATCCAAC AAAGAAGTCG TTGGTGCTGG TTTTGACTCG TTAAATAACT
ATTACGCAAT TGGAATGAAA AACAAAGGCG TTGATTTTAA TAAAGACTTA GATTTAACAA
GCAAAGATTC ACAAGAAGTC GTGGACTATT ACCGTGATGG TATCGAAGCA GGTTACTTCC
GCACAGCTGG TTCAGATAAA TATTTATCTG GCCCATTTGC AAACAAAAAG GTAGCAATGT
TTGTCGGTAG TATTGCTGGT GCTGGTTTTG TTCAAAAAGA TGCTGAAGCT GGTGGCTATG
AATACGGTGT TGCACCACGT CCTGAAAAAA TCAACTTACA ACAAGGAACA GATATTTATA
TGTTCGATAG TGCTACGCCA GAACAACGGA CAGCGGCATT TGAATTCATG AAATTCTTAG
CTACTCCTGA TTCACAATTG TACTGGGCAC AACAAACAGG TTATATGCCA ATTTTAGAAT
CTGTTTTACA CAGTGATGAG TACAAAAATT CTAAGACAAC CAAAGTACCT GCACAACTTG
AAAACGCAGT AAAAGATTTA TTCGCTATCC CAGTAGAAGA AAATGCTGAT TCAGCCTATA
ATGAAATGCG GACAATTATG GAAAGTATTT TTGCTTCATC AAATAAAGAC ACGAGAAAAT
TATTGAAAGA TGCAACATCA CAATTTGAAC AAGCATGGAA CCAATAA EF001-2 (SEQ ID
NO:2) MKFKTLATT VLATAAIFAL GACGNGNGAK ESNDIVKEVK EDTTITFWHA
MNGVQEEALT KLTKDFMKEN PKIKVELQNQ SAYPDLQAKI NSTLTSPKDL PTITQAYPGW
LWNAAQDEML VDLKPYMDDD TIGWKDAEPI REVLLDGAKI DGKQYGIPFN KSTEMLFYNA
DLLKEYGVEV PKTLEELKEA SKTIYEKSNK EVVGAGFDSL NNYYAIGMKN KGVDFNKDLD
LTSKDSQEVV DYYRDGIEAG YFRTAGSDKY LSGPFANKKV AMFVGSIAGA GFVQKDAEAG
GYEYGVAPRP EKINLQQGTD IYMFDSATPE QRTAAFEFMK FLATPDSQLY WAQQTGYMPI
LESVLHSDEY KNSKTTKVPA QLENAVKDLF AIPVEENADS AYNEMRTIME SIFASSNKDT
RKLLKDATSQ FEQAWNQ EF001-3 (SEQ ID NO:3) TT GTGGTAACGG TAATGGGGCC
AAAGAATCAA ACGATATTGT GAAAGAAGTG AAGGAAGATA CGACAATCAC TTTCTGGCAT
GCAATGAATG GGGTTCAAGA AGAAGCGTTA ACAAAATTAA CGAAAGACTT CATGAAAGAA
AATCCAAAAA TTAAAGTGGA ATTACAAAAT CAATCTGCTT ACCCTGATTT ACAAGCCAAA
ATCAATTCGA CTTTAACTTC ACCAAAAGAT TTACCAACAA TTACGCAAGC GTACCCAGGC
TGGTTATGGA ATGCTGCACA AGATGAAATG TTAGTGGACT TAAAACCATA TATGGATGAT
GACACAATCG GCTGGAAAGA TGCAGAGCCA ATTCGTGAAG TATTGTTAGA CGGCGCCAAA
ATCGACGGCA AACAATACGG CATTCCATTT AATAAATCGA CAGAAATGTT ATTCTATAAT
GCTGATTTGT TGAAAGAATA TGGTGTTGAA GTACCGAAAA CATTAGAGGA ATTAAAAGAA
GCTTCTAAAA CAATTTACGA AAAATCCAAC AAAGAAGTCG TTGGTGCTGG TTTTGACTCG
TTAAATAACT ATTACGCAAT TGGAATGAAA AACAAAGGCG TTGATTTTAA TAAAGACTTA
GATTTAACAA GCAAAGATTC ACAAGAAGTC GTGGACTATT ACCGTGATGG TATCGAAGCA
GGTTACTTCC GCACAGCTGG TTCAGATAAA TATTTATCTG GCCCATTTGC AAACAAAAAG
GTAGCAATGT TTGTCGGTAG TATTGCTGGT GCTGGTTTTG TTCAAAAAGA TGCTGAAGCT
GGTGGCTATG AATACGGTGT TGCACCACGT CCTGAAAAAA TCAACTTACA ACAAGGAACA
GATATTTATA TGTTCGATAG TGCTACGCCA GAACAACGGA CAGCGGCATT TGAATTCATG
AAATTCTTAG CTACTCCTGA TTCACAATTG TACTGGGCAC AACAAACAGG TTATATGCCA
ATTTTAGAAT CTGTTTTACA CAGTGATGAG TACAAAAATT CTAAGACAAC CAAAGTACCT
GCACAACTTG AAAACGCAGT AAAAGATTTA TTCGCTATCC CAGTAGAAGA AAATGCTGAT
TCAGCCTATA ATGAAATGCG GACAATTATG GAAAGTATTT TTGCTTCATC AAATAAAGAC
ACGAGAAAAT TATTGAAAGA TGCAACATCA CAATTTGAAC AAGCATGGAA CCAA EF001-4
(SEQ ID NO:4) CGNGNGAK ESNDTVKEVK EDTTITFWHA MNGVQEEALT KLTKDFMKEN
PKIKVELQNQ SAYPDLQAKI NSTLTSPKDL PTITQAYPGW LWNAAQDEML VDLKPYMDDD
TIGWKDAEPI REVLLDGAKI DGKQYGIPFN KSTEMLFYNA DLLKEYGVEV PKTLEELKEA
SKTTYEKSNK EVVGAGFDSL NNYYALGMKN KGVDFNKDLD LTSKDSQEVV DYYRDGTEAG
YFRTAGSDKY LSGPFANKKV AMFVGSIAGA GPVQKDAEAG GYEYGVAPRP EKINLQQGTD
IYMFDSATPE QRTAAFEFMK FLATPDSQLY WAQQTGYNPI LESVLHSDEY KNSKTTKVPA
QLENAVKDLF AIPVEENADS AYNEMRTIME SIFASSNKDT RKLLKDATSQ FEQAWNQ
EF002-1 (SEQ ID NO:5) TAAATAGCGG AGGTAGTACA AATGAAATTT TGGAAAAAAG
GCTTAACAGC GGCAGCGCTG TTAGCAGTGG CGGCAGTAAC TTTAACAGCA TGTCGTGGTT
CAAGTGAAAA GAAAGCAACT GAAAAGAGTG AAGATGGCAA AACAAAATTA ACAGTAACTA
CTTGGAATTA TGACACGACC CCAGAATTTG AGAAATTATT CAGAGCTTTT GAAGCGGAAA
ATCCTGATAT CACTATTGAA CCGGTGGACA TTGCTTCAGA TGATTATGAC ACAAAAGTAA
CAACGATGCT TTCATCAGGA GATACGACGG ATATTTTAAC CATGAAAAAC TTACTTTCAT
ATTCTAATTA CGCGCTACGC AATCAATTGG TGGATTTAAC CGATCACGTT AAAGATTTAG
ATATCGAACC TGCCAAAGCA AGTTACGAGA TGTATGAAAT CGATGGTAAA ACCTATGCTC
AGCCTTACCG TACAGATTTC TGGGTATTGT ATTACAATAA AM,AATGTTT GATGAAGCCG
GAATTGCCTA TCCCGATAAC TTAACTTGGG ATGAATATGA AGCGTTAGCG AAAAAATTAT
CTAAACCAGA AGAACAAGTA TATCGTGCCT ATCAACATAC TTGGCGCTCA ACCGTTCAAG
CGATTGCTGC TGCTCAAAAC AATGCCAATT TGATTGAACC AAAATACAAT TATATGGAAA
CTTATTATGA TCGCGCATTG AGAATGCAAA AAGATCAATC ACAAATGGAT TTTGGAACAG
CAAAATCAAC AAAAGTAACG TATCAATCAC AATTTGAAAA TTCAAAAGCG GCGATGATGT
ACATGGGTAG CTGGTACATG GGGACTTTAT TAACAAACAT TGATGATGGC AAAACAAATG
TCGAATGGGG GATTGCCGAA ATACCACAAC AAGAAAAAGG CAAAGCAACT ACCTTTGGCT
CACCGACAAG TTTTGCAATT AATAAAAACA GTAAAAAACA AAAAGCTGCT CAAAAATTCT
TAGACTTTGC TTCAGGTAAA GAAGGTGCAA AACTTTTAGC AGAAGTAGGG GTGGTTCCTT
CTTATAAAAC AGATGAAATT GATAAAATCT ACTTTGCAAG AAAAGGAATG CCTTCAGACG
AGTCTCACAA AAACCCTTTA ACCCAGATAC AATTAATTTA G EF002-2 (SEQ ID NO:6)
MKFW KKGLTAAALL AVAAVTLTAC GGSSEKKATE KSEDGKTKLT VTTWNYDTTP
EFEKLFRAFE AENPDITIEP VDIASDDYDT KVTTMLSSGD TTDILTMKNL LSYSNYALRN
QLVDLTDHVK DLDIEPAKAS YEMYEIDGKT YAQPYRTDFW VLYYNKKMFD EAGIAYPDNL
TWDEYEALAK KLSKPEEQVY GAYQHTWRST VQAIAAAQNN ANLIEPKYNY METYYDRALR
MQKDQSQMDF GTAKSTKVTY QSQFENSKAA MMYMGSWYMG TLLTNIDDCK TNVEWGIAEI
PQQEKGKATT FGSPTSFAIN KNSKKQKAAQ KFLDFASGKE GAKLLAEVGV VPSYKTDEID
KIYFARKGMP SDESHKKPLT QIQLT EF002-3 (SEQ ID NO:7) A TGTGGTGGTT
CAAGTGAAAA GAAAGCAACT GAAAAGAGTG AAGATGGCAA AACAAAATTA ACAGTAACTA
CTTGGAATTA TGACACCACC CCAGAATTTG AGAAATTATT CAGAGCTTTT GAAGCGGAAA
ATCCTGATAT CACTATTGAA CCGGTGGACA TTGCTTCAGA TGATTATGAC ACAAAAGTAA
CAACGATGCT TTCATCAGGA GATACGACGG ATATTTTAAC CATGAAAAAC TTACTTTCAT
ATTCTAATTA CGCGCTACGC AATCAATTGG TGGATTTAAC CGATCACGTT AAAGATTTAG
ATATCGAACC TGCCAAAGCA AGTTACGAGA TGTATGAAAT CGATGGTAAA ACCTATGCTC
AGCCTTACCG TACAGATTTC TGGGTATTGT ATTACAATAA AAAAATGTTT GATGAAGCCG
GAATTGCCTA TCCCGATAAC TTAACTTGGG ATGAATATGA AGCGTTAGCG AAAAAATTAT
CTAAACCAGA AGAACAAGTA TATGGTGCCT ATCAACATAC TTGGCGCTCA ACCGTTCAAG
CGATTGCTGC TGCTCAAAAC AATGCCAATT TGATTGAACC AAAATACAAT TATATGGAAA
CTTATTATGA TCGCGCATTG AGAATGCAAA AAGATCAATC ACAAATGCAT TTTGGAACAG
CAAAATCAAC AAAAGTAACG TATCAATCAC AATTTGAAAA TTCAAAAGCG GCGATGATGT
ACATGGGTAG CTGGTACATG GGGACTTTAT TAACAAACAT TGATGATGGC AAAACAAATG
TCGAATGGGG GATTGCCGAA ATACCACAAC AAGAAAAAGG CAAAGCAACT ACCTTTGGCT
CACCGACAAG TTTTGCAATT AATAAAAACA GTAAAAAACA AAAAGCTGCT CAAAAATTCT
TAGACTTTGC TTCAGGTAAA CAAGGTGCAA AACTTTTAGC AGAAGTAGGG GTGGTTCCTT
CTTATAAAAC AGATGAAATT GATAAAATCT ACTTTGCAAG AAAAGGAATG CCTTCAGACG
AGTCTCACAA AAAGCCTTTA ACCCAGATAC AATTAATT EF002-4 (SEQ ID NO:8) C
GGSSEKKATE KSEDGKTKLT VTTWNYDTTP EFEKLFRAFE AENPDITIEP VDIASDDYDT
KVTTMLSSGD TTDILTMKNL LSYSNYALRN QLVDLTDHVK DLDIEPAKAS YEMYEIDGKT
YAQPYRTDFW VLYYNKKMFD EAGIAYPDNL TWDEYEALAK KLSKPEEQVY GAYQHTWRST
VQAIAAAQNN ANLIEPKYNY METYYDRALR MQKDQSQMDF GTAKSTKVTY QSQFENSKAA
MMYMGSWYMG TLLTNIDDGK TNVEWGIAEI PQQEKGKATT FGSPTSFAIN KNSKKQKAAQ
KFLDFASGKE GAKLLAEVGV VPSYKTDEID KTYFARKGMP SDESHKKPLT QIQLI
EF003-1 (SEQ ID NO:9) TAGGAGGACA AAAGAATGAA GAAGTTTTAT TTAGCNACAT
TCGCTGTTAT TGCAACAGTT ATTTTAGCTG CCTGTGGGGG AAATAAACAA GCAGACCAGA
AAGAAGACAA GGAGATTACC GTTGCCGTGC AATTGGAATC TTCAAAAGAT ATCTTGGAGA
TTGCCAAGAA AGAAGCTGAG AAAAAAGGGT ACAAAATTAA CATTATGGAA GTGAGCGACA
ATGTTGCCTA CAACGATGCC GTGCAACATG ACGAAGCGGA TGCTAATTTT GCGCAACATC
AACCCTTCAT GGAAATGTTT AACAAAGAGA AAAAAGCTGA TTTAGTGGCT GTGCAACCGA
TTTATTATTT TGCTGGTGGT TTCTATTCAA AAGAATACCA AGATGCGAAA GATTTACCTG
AAAATGCCAA AGTGGGGATT CCTAGCGATC CAACCAATGA AGGTCGTGCT TTAGCAATTT
TAAATGCAAA CGGCGTGATT AAATTAAAAG AAGGTGTCGG CTTTAACGGC ACGGTGGCAG
ATGTCGTGGA AAATCCTAAA AACATCACTT TTGAAAGCAT TGATTTACTG AATTTAGCTA
AAGCCTATGA TGAAAAAGAC ATCGCTATGG TGTTCTGCTA CCCAGCCTAC TTAGAACCTG
CTGGTTTAAC AACGAAAGAT GCGATCTTGT TAGAAGATAA AGAAGCAAGT AAACATTACG
CATTGCAAGT TGTGACACGC AAAGGCGAAA AACATAGCGA AAAAATCAAG GTTTTAAAAG
AAGCGATGAC AACAAAAGAA GTTGCTGAAT ACATCAAGAA AAATTCTAAA GGCGCCAATA
TTCCTGCGTT TTAA EF003-2 (SEQ ID NO:10) MKKFYL ATFAVIATVI LAACGGNKQA
DQKEDKEITV AVQLESSKDI LEIAKKEAEK KGYKINIMEV SDNVAYNDAV QHDEADANFA
QHQPFMEMFN KEKKADLVAV QPIYYFAGGF YSKEYQDAKD LPENAKVCIP SDPTNEGRAL
AILNANGVIK LKEGVGFNGT VADVVENPKN ITFESTDLLN LAKAYDEKUT AMVFCYPAYL
EPAGLTTKDA ILLEDKEASK HYALQVVTRK GEKDSEKTKV LKEAMTTKEV AEYTKKNSKG
ANIPAF EF003-3 (SEQ ID NO:11) CTGTGGGGG AAATAAACAA GCAGACCAGA
AAGAAGACAA GGAGATTACC GTTGCCGTGC AATTGGAATC TTCAAAAGAT ATCTTGGAGA
TTGCCAAGAA AGAAGCTGAG AAAAAAGGGT ACAAAATTAA CATTATGGAA GTGAGCGACA
ATCTTGCCTA CAACGATGCC GTGCAACATG ACGAAGCGGA TGCTAATTTT GCGCAACATC
AACCCTTCAT GGAAATGTTT AACAAAGAGA AAAAAGCTGA TTTAGTGGCT GTGCAACCGA
TTTATTATTT TGCTGGTGGT TTCTATTCAA AAGAATACCA AGATGCGAAA GATTTACCTG
AAAATGCCAA AGTGGGGATT CCTAGCGATC CAACCAATGA AGGTCGTGCT TTAGCAATTT
TAAATGCAAA CGGCGTGATT AAATTAAAAG AAGGTGTCGG CTTTAACGGC ACGGTGGCAG
ATGTCGTGGA AAATCCTAAA AACATCACTT TTGAAAGCAT TGATTTACTG AATTTAGCTA
AAGCCTATGA TGAAAAAGAC ATCGCTATGG TGTTCTGCTA CCCAGCCTAC TTAGAACCTG
CTGGTTTAAC AACGAAAGAT GCGATCTTGT TAGAAGATAA AGAAGCAAGT AAACATTACG
CATTGCAAGT TGTGACACGC AAAGGCGAAA AAGATAGCGA AAAAATCAAG GTTTTAAAAG
AAGCGATGAC AACAAAAGAA GTTGCTGAAT ACATCAAGAA AAATTCTAAA GGCGCCAATA
TTCCTGCGTT T EF003-4 (SEQ ID NO:12) CGGNKQA DQKEDKEITV AVQLESSKDI
LEIAKKEAEK KGYKINTMEV SDNVAYNDAV QHDEADANFA QHQPFNEMFN KEKKADLVAV
QPIYYFAGGF YSKEYQDAKD LPENAKVGIP SDPTNEGRAL AILNANGVIK LKEGVGFNGT
VADVVENPKN ITFESIDLLN LAKAYDEKDI ANVECYPAYL EPAGLTTKDA ILLEDKEASK
HYALQVVTRK GEKDSEKIKV LKEANTTKEV AEYIKKNSKG ANIPAF EF004-1 (SEQ ID
NO:13) TAAATCGAAA GAAGGATGAT AGAAATGAAA AAAATGATTA AATTTGCAGG
CATTGCTCTT ATTTTTGCAG CTCTTCTCTC TGCCTGTAGC AACGCAAAAA ATAATACACA
AAAGAAAGCC GAAACTGCTG CCCAGTCAAG CACTATTGAA GCTTCAGACA GTAACGAAAA
CGAGCCTAAT ACAGAAAACA TAACCCAAGC AGTTAAACAG TTAGAAGAAA AATTTAACTC
TGACGAGAAA TTAGTAAAAA TAGATGTTAA AAATAATGTT AAAGATGACA CATCAGATAA
CCCTCACGCT GTCATTACGG TTAAGGTAAT TAATGATGAA GCAAAAAAAA ATATGGAAGA
AATGCAGACT GCGATAGATT CCAACTCAGG TACAGAGGCA CAAAAGACTG CCATATACGG
AATTCAATTA AATGTTGAAG AAGTAGCCAA AACATTAGAA AATGATAACG ATGTTATTTC
TTTCATCACA CCTTACACGA ATGGGAACGA CAGAACCATA GCAAAATCAA CTAAAAATGA
AAATATTATT CCGTTAGTAA AATAA EF004-2 (SEQ ID NO:14) MKK MIKFAGIALI
FAALLSACSN AKNNTQKKAE TAAQSSTIEA SDSNENEPNT ENITQAVKQL EEKFNSDEKL
VKIDVKNNVK DDTSDNPHAV ITVKVINDEA KKNMEEMQTA IDSNSGTEAQ KTAIYGIQLN
VEEVAKTLEN DNDVISFITP YTNGNDRTIA KSTKNENIIP LVK EF004-3 (SEQ ID
NO:15) CTGTAGC AACGCAAAAA ATAATACACA AAAGAAAGCC GAAACTGCTG
CCCAGTCAAG CACTATTGAA GCTTCAGACA GTAACGAAAA CGAGCCTAAT ACAGAAAACA
TAACCCAAGC ACTTAAACAG TTAGAAGAAA AATTTAACTC TGACGAGAAA TTAGTAAAAA
TAGATGTTAA AAATAATGTT AAAGATGACA CATCAGATAA CCCTCACGCT GTCATTACGG
TTAAGGTAAT TAATGATGAA GCAAAAAAAA ATATGGAAGA AATGCAGACT GCGATAGATT
CCAACTCAGG TACAGAGOCA CAAAAGACTG CCATATACGG AATTCAATTA AATGTTGAAG
AAGTAGCCAA AACATTAGAA AATGATAACG ATGTTATTTC TTTCATCACA CCTTACACGA
ATGGGAACGA CAGAACCATA GCAAAATCAA CTAAAAATGA AAATATTATT CCGTTAGTAA
AA EF004-4 (SEQ ID NO:16) CSN AKNNTQKKAE TAAQSSTIEA SDSNENEPNT
ENITQAVKQL EEKFNSDEKL VKIDVKNNVK DDTSDNPHAV ITVKVINDEA KKNMEEMQTA
IDSNSGTEAQ KTAIYGIQLN VEEVAKTLEN DNDVISFITP YTNGNDRTIA KSTKNENIIP
LVK EF005-1 (SEQ ID NO:17) TAAAAAATGA AAAAACGATT GACGATTGTG
GGGATGCTTT TTCTGGCCAT TTTAGTAATG GTTGGTTGTG GTAAAAATCA GCAAGCAACG
ACAAAAGAAA AAGAGACAAA ACCTGAAGAA CTAACTCTTT ACATTGTGCG CCACGGAAAA
ACCATGTTAA ATACGACGGA CCGCGTACAA GGATGGTCAG ATGCGGTCCT AACACCAGAA
GGTGAAAAAG TTGTGACAGC AACTGGGATT GGACTGAAAG ATGTTGCCTT TCAAAATGCA
TATAGTAGTG ATAGTGGCCG CGCCTTGCAA ACTGCTCAAC TTATTTTAGA TCAAAATAAA
GCAGGCAAAG ACCTTGAAGT CGTGCGTGAC CCAGATTTAC GTGAATTTAA TTTTGGTAGC
TATGAAGGGG ATTTAAATAA GACAATGTGG CAGGATATTG CTGATGATCA AGGTGTTTCC
TTAGAAGAAT TTATGAAAAA CATGACTCCT GAATCCTTTG CCAATAGTGT AGCTAAACTG
GATCAACAGC GCGAGGAAAG CAAGAATAAC TGGCCTGCAG AAGACTATGC TACAATTACT
AAACGTTTGA AAAAAGGCTT AGATAAAATT GTTGCCACAG AATCAGCCAA TTCTGGGAAT
GGCAATGTTT TAGTGGTCTC TCATGGCTTG AGTATTTCAG CGTTGTTAGC AACTTTATTT
GATGATTTTA AAGTCCCAGA AGGCGGTTTG AAGAATGCTA GTGTCACAAC AATTCATTAC
AAAAATGGCG AATATACTTT GGATAAAGTC AATCATGTCA GCTACTTAGA AGCAGGCGAA
AAAGAATCAA AATAA EF005-2 (SEQ ID NO:18) MKKRLTIVG MLFLAILVMV
GCGKNQQATT KEKETKPEEL TLYIVRHGKT MLNTTDRVQG WSDAVLTPEG EKVVTATGTC
LKDVAFQNAY SSDSGRALQT AQLILDQNKA GKDLEVVRDP DLREFNFGSY EGDLNKTMWQ
DIADDQGVSL EEFMKNMTPE SFANSVAKLD QQREESKNNW PAEDYATITK RLKKGLDKIV
ATESANSGNG NVLVVSHGLS ISALLATLED DFKVPEGGLK NASVTTIHYK NGEYTLDKVN
DVSYLEAGEK ESK EF005-3 (SEQ ID NO:19) TTGTG GTAAAAATCA GCAAGCAACG
ACAAAAGAAA AAGAGACAAA ACCTGAAGAA CTAACTCTTT ACATTGTGCG CCACGGAAAA
ACCATGTTAA ATACGACGGA CCGCGTACAA GGATGGTCAG ATGCGGTCCT AACACCAGAA
GGTGAAAAAG TTGTGACAGC AACTGGGATT GGACTGAAAG ATGTTGCCTT TCAAAATCCA
TATAGTAGTG ATAGTGGCCG CGCCTTGCAA ACTGCTCAAC TTATTTTAGA TCAAAATAAA
GCAGGCAAAG ACCTTGAAGT CGTGCGTGAC CCAGATTTAC GTGAATTTAA TTTTGGTAGC
TATGAAGGGG ATTTAAATAA GACAATGTCG CAGGATATTG CTGATGATCA AGGTGTTTCC
TTAGAAGAAT TTATGAAAAA CATGACTCCT GAATCCTTTG CCAATAGTGT AGCTAAACTG
GATCAACAGC GCGAGGAAAG CAAGAATAAC TGGCCTGCAG AAGACTATGC TACAATTACT
AAACGTTTGA AAAAAGGCTT AGATAAAATT GTTCCCACAG AATCAGCCAA TTCTCGGAAT
GGCAATGTTT TAGTGGTCTC TCATGGCTTG AGTATTTCAG CGTTGTTAGC AACTTTATTT
GATGATTTTA AAGTCCCAGA AGGCGGTTTG AAGAATGCTA GTGTCACAAC AATTCATTAC
AAAAATGGCG AATATACTTT GGATAAAGTC AATGATGTCA GCTACTTAGA AGCAGGCGAA
AAAGAATCAA AA EF005-4 (SEQ ID NO:20) CGKNQQATT KEKETKPEEL
TLYIVRHGKT MLNTTDRVQG WSDAVLTPEG EKVVTATGIG LKDVAFQNAY SSDSCRALQT
AQLILDQNKA GKDLEVVRDP DLREFNFGSY EGDLNKTMWQ DIADDQGVSL EEFMKNMTPE
SFANSVAKLD QQREESKNNW PAEDYATITK RLKKGLDKTV ATESANSGNG NVLVVSHGLS
ISALLATLED DFKVPEGGLK NASVTTIHYK NGEYTLDKVN DVSYLEAGEK ESK EF006-1
(SEQ ID NO:21) TAAACGATAA ATGGAGGGAA TAAGATGAAA AAACGTACAT
TATCGTCAGT AATTACTGTA GCAGTACCTG TCTTAGTTTT AGGGGCTTGC GGCAATAAAA
AGAGTGATGA CTCCGTCTTG AAAGTTGGAG CTTCACCAGT TCCACATGCA GAGATTTTAG
AACATGTAAA ACCTTTATTA GAAAAAGAAG GCGTAAAATT AGAAGTGACG ACTTATACAG
ATTACGTCCT ACCTAACAAG GCGTTGGAAA GTGGCGATAT CGATGCCAAC TATTTCCAAC
ATGTGCCGTT CTTTAATGAA GCGGTTAAAG AAAATGATTA TGACTTTGTG AATGCAGGTG
CGATTCATTT AGAACCAGTT
GGGCTTTACT CGAAAAAATA CAAATCGTTA CAAGAAATTC CTGATGGTTC AACGATTTAC
GTTACCTCTT CCGTTTCAGA TTGGCCACGC GTATTAACTA TCTTAGAAGA TGCTGGTTTA
ATCACGCTGA AAGAAGGGGT AGACCGGACA ACTGCTACTT TCGATGATAT TGATAAAAAT
ACTAAAAAGT TGAAATTCAA TCATGAAAGT GATCCAGCAA TCATGACCAC TCTTTATGAC
AXTGAAGAAC GGGCTGCGGT TTTAATTAAC TCAAACTTTG CCGTGGATCA AGGATTAAAT
CCGAAAAAAG ATGCGATTGC CTTAGAAAAA GAAAGTTCAC CTTATGCCAA TATTATTGCG
GTTCGTAAAG AAGACGAAAA CAACGAAAAT GTAAAAAAAT TAGTCAAAGT GTTACCTAGC
AAAGAAGTCC AAGATTGGAT TACGAAAAAA TGGAACGGCG CTATTGTTCC AGTCAATGAA
TAA EF006-2 (SEQ ID NO:22) MKK RTLWSVITVA VAVLVLGACG NKKSDDSVLK
VGASPVPHAE ILEHVKPLLE KEGVKLEVTT YTDYVLPNKA LESGDIDANY FQHVPFFNEA
VKENDYDFVN AGAIHLEPVG LYSKKYKSLQ EIPDGSTIYV SSSVSDWPRV LTILEDAGLI
TLKEGVDRTT ATFDDIDKNT KKLKFNHESD PAIMTTLYDN EEGAAVLINS NFAVDQGLNP
KKDAIALEKE SSPYANIIAV RKEDENNENV KKLVKVLRSK EVQDWITKKW NGAIVPVNE
EF006-3 (SEQ ID NO:23) TTGC GGCAATAAAA AGAGTGATGA CTCGCTCTTG
AAAGTTGGAG CTTCACCAGT TCCACATGCA GAGATTTTAG AACATGTAAA ACCTTTATTA
GAAAAAGAAG GCGTAAAATT AGAAGTGACG ACTTATACAG ATTACGTGCT ACCTAACAAG
GCGTTGGAAA GTGGCGATAT CGATGCCAAC TATTTCCAAC ATGTGCCGTT CTTTAATGAA
GCGGTTAAAG AAAATGATTA TGACTTTGTG AATGCAGGTG CGATTCATTT AGAACCAGTT
GGGCTTTACT CGAAAAAATA CAAATCGTTA CAAGAAATTC CTGATGGTTC AACGATTTAC
GTTAGCTCTT CCGTTTCAGA TTGCCCACGC CTATTAACTA TCTTAGAAGA TGCTCGTTTA
ATCACGCTGA AAGAAGGGGT AGACCGGACA ACTGCTACTT TCGATGATAT TGATAAAAAT
ACTAAAAAGT TGAAATTCAA TCATGAAAGT GATCCAGCAA TCATGACCAC TCTTTATGAC
AATGAAGAAG CGGCTGCGGT TTTAATTAAC TCAAACTTTG CCGTGGATCA AGGATTAAAT
CCGAAAAAAG ATGCGATTGC CTTAGAAAAA GAAAGTTCAC CTTATGCCAA TATTATTGCG
GTTCGTAAAG AAGACGAAAA CAACGAAAAT GTAAAAAAAT TAGTCAAAGT GTTACGTAGC
AAAGAAGTCC AAGATTGGAT TACGAAAAAA TGGAACGGCC CTATTGTTCC AGTCAATGAA
EF006-4 (SEQ ID NO:24) CC NKKSDDSVLK VGASPVPHAE ILEHVKPLLE
KEGVKLEVTT YTDYVLPNKA LESGDIDANY FQHVPFFNEA VKENDYDFVN AGAIHLEPVG
LYSKKYKSLQ EIPDCSTIYV SSSVSDWPRV LTILEDAGLI TLKEGVDRTT ATFDDIDKNT
KKLKFNHESD PAIMTTLYDN EEGAAVLTNS NFAVDQGLNP KKDAIALEKE SSPYANIIAV
RKEDENNENV KKLVKVLRSK EVQDWTTKKW NGAIVPVNE EF008-1 (SEQ ID NO:25)
TAAACCCTGA GAAAGAAATC GAGOAATCAA CCAATGAAAA AATTTAGTTT ATTTTTTTTA
ACACTTTTAG CAGGGTTAAC GTTAGCTGCT TCCGCGAATC AAGCCGCTGA AAAGAAAGAA
AAATTAGCAA TTGTGACAAC GAACTCGATC CTATCTGATT TAGTCAAAAA TGTTGCCCAA
GACAAAATTG AGCTGCATAG TATTGTGCCA ATTGGGACAG ACCCTCACGA ATATGAACCG
TTACCAGAAG ACATTGCGAA AGCTTCTGAA GCCGACATTT TATTCTTTAA CGGCTTGAAC
TTAGAAACAG GCGGAAATGG CTGGTTTAAC AAATTAATGA AAACGGCCAA AAAAGTTGAG
AATAAAGATT ACTTTTCTAC AAGCAAAAAT GTTACGCCAC AATATTTAAC AAGTGCCGGT
CAACAACAAA CAGAAGATCC ACATGCTTGG TTAGACATTG AAAATGGCAT TAAATATGTA
GAAAACATTC GTGACGTGTT AGTAGAAAAA CATCCAAAAA ATAAAGATTT CTATACAGAA
AACGCGAAAA ATTATACCGA AAAACTTAGC AAACTACATG AGGAAGCCAA AGCTAAATTT
GCTCATATTC CTGATGATAA AAAATTATTA GTTACAAGTG AAGGTGCCTT TAAATATTTC
TCCAAAGCTT ATGATTTAAA TGCCGCTTAT ATTTGGGAAA TTAACACAGA AAGTCAACGN
ACACCTCAAC AAATGACCAC GATTATTGAT ACCATTAAGA AATCAAAAGC ACCTGTGTTA
TTTGTTGAAA CCAGTGTCGA TAAACGTAGT ATGGAACGCG TCTCAAAAGA ACTGAAACGA
CCAATTTACG ATACACTTTT CACAGACTCT CTTGCCAAAG AAGGAACAGA AGGCCATACG
TACTACACCA TGATGAACTG GAATTTAACA AAAATCCATG ATGGCTTAAT GAGTAAATAA
EF008-2 (SEQ ID NO:26) MKKFSLFFLT LLAGLTLAAC CNQAAEKKEK LAIVTTNSIL
SDLVKNVGQD KIELHSIVPI GTDPHEYEPL PEDIAKASSA DILFFNGLNL ETGCNGWFNK
LMKTAKKVEN KDYFSTSKNV TPQYLTSAGQ EQTEDPHAWL DIENGIKYVE NIRDVLVEKD
PKNKDFYTEN AKNYTEKLSK LHEEAKAKFA DIPDDKKLLV TSEGAFKYFS KAYDLNAAYI
WEINTESQGT PEQMTTIIDT IKKSKAPVLF VETSVDKRSM ERVSKEVKRP IYDTLFTDSL
AKEGTEGDTY YSMMNWNLTK IHDGLMSK EF008-3 (SEQ ID NO:27) T TGCGGGAATC
AAGCCGCTGA AAAGAAAGAA AAATTAGCAA TTGTGACAAC GAACTCGATC CTATCTGATT
TACTGAAAAA TGTTGGGCAA GACAAAATTG AGCTGCATAG TATTGTGCCA ATTGGGACAG
ACCCTCACGA ATATGAACCC TTACCAGAAG ACATTGCGAA AGCTTCTGAA GCGGACATTT
TATTCTTTAA CGGCTTGAAC TTAGAAACAG CCGGAAATCG CTGGTTTAAC AAATTAATGA
AAACGGCCAA AAAAGTTGAG AATAAAGATT ACTTTTCTAC AAGCAAAAAT CTTACGCCAC
AATATTTAAC AAGTGCCGGT CAAGAACAAA CAGAAGATCC ACATGCTTGG TTAGACATTG
AAAATCGCAT TAAATATCTA CAAAACATTC GTGACGTGTT AGTAGAAAAA GATCCAAAAA
ATAAAGATTT CTATACAGAA AACGCCAAAA ATTATACCGA AAAACTTAGC AAACTACATG
AGGAAGCCAA AGCTAAATTT GCTGATATTC CTGATCATAA AAAATTATTA GTTACAAGTG
AAGGTGCCTT TAAATATTTC TCCAAAGCTT ATGATTTAAA TGCCGCTTAT ATTTGGGAAA
TTAACACAGA AAGTCAAGGN ACACCTGAAC AAATGACCAC GATTATTGAT ACCATTAAGA
AATCAAAAGC ACCTGTGTTA TTTGTTGAAA CCAGTGTCGA TAAACGTAGT ATGGAACGGG
TCTCAAAAGA AGTGAAACGA CCAATTTACG ATACACTTTT CACAGACTCT CTTGCCAAAG
AAGGAACAGA AGGCGATACG TACTACAGCA TGATGAACTG GAATTTAACA AAAATCCATG
ATGGCTTAAT GAGTAAA EF008-4 (SEQ ID NO:28) c GNQAAEKKEK LAIVTTNSIL
SDLVKNVGQD KIELHSIVPI GTDPHEYEPL PEDIAKASSA DILFFNGLNL ETOGNOWENK
LMKTAKKVEN KDYESTSKNV TPQYLTSAGQ EQTEDPHAWL DIENGIKYVE NIRDVLVEKD
PKNKDFYTEN AKNYTEKLSK LHEEAKAKFA DIPDDKKLLV TSECAFKYFS KAYDLNAAYI
WEINTESQOT PEQMTTIIDT IKKSKAPVLF VETSVDKRSM ERVSKEVKRP IYDTLFTDSL
AKEGTEGDTY YSMMNWNLTK IHDGLMSK EF009-1 (SEQ ID NO:29) TGACAAATGA
AAAAATTTAG TAAATTAATT GGACTTATTG GGCTATTAGC TTTTACGATT GCAGGTTGTG
CATCGGGGTC TGTGAAGGAT ACTAAGACAG AAACCGTTAA ACTAGGGGTT GTAGGAACAA
AAAATGATGA ATGGGAATCG GTCAAAGACC GTTTGAAAAA GAAAAATATT GATTTACAAT
TGGTAGAATT TACAGACTAT ACGCAACCAA ACGCAGCATT AGCAGAAAAA GAAATTGATT
TAAATGCCTT TCAGCATCAA ATCTTTTTAG ACAATTACAA TAAAGAGCAT GGAACGAAAT
TAGTATCAAT TGGCAATACA GTCAATGCAC CATTGGGAAT TTACGCTAAT AAATTGAAAG
ATATCACGAA AATTAAAGAC GGCGGAGAAA TTGCTATTCC TAATGACCCA ACGAATGGCG
GGCGGGCGTT AATTTTATTA CAAACTGCAG GACTGATAAA ACTAGATOCT GCGAAACAGC
AACTACCGAC TGTCAGTGAT ATTACTGAAA ATAAACGCCA ATTGAAAATA ACTGAATTAG
ATOCTACOCA AACAGCGCGC GCTTTACAAG ATGTCGATGC TTCAGTGATT AATAGCGGCA
TGGCTGTCGA TGCTGGGTAT ACACCAGATA AAGATGCTAT TTTCTTAGAA CCTGTAAACG
AAAAAGCGAA ACCTTATGTG AACATTGTCG TGGCCCGAGA AGAAGATCAA GAGAATAAAC
TTTATCAAAA AGTTGTAGAA GAATATCAAC AAGAAGAAAC GAAAAAGGTC ATTGCAGAAA
CATCAAAAGG CGCCAATGTT CCAGCCTGCG AAACATTTCG TAAAAAATAA EF009-2 (SEQ
ID NO:30) MKKFSKLIG LIGVLAFTIA GCASGSVKDT KTETVKLGVV GTKNDEWESV
KDRLKKKNID LQLVEFTDYT QPNAALAEKE IDLNAFQHQI FLDNYNKEHG TKLVSIGNTV
NAPLGIYANK LKDTTKIKDG GEIAIPNDPT NGGEALILLQ TAGLIKVDPA KQQLPTVSDI
TENKRQLKIT ELDATQTARA LQDVDASVIN SGMAVDAGYT PDKDAIELEP VNEKAKPYVN
IVVAREEDQE NKLYQKVVEE YQQEETKKVI AETSKGANVP AWETFGKK EF009-3 (SEQ
ID NO:31) TTGTG CATCGGGGTC TGTGAAGGAT ACTAAGACAG AAACCGTTAA
ACTAGGGGTT GTAGGAACAA AAAATGATGA ATGGGAATCG GTCAAAGACC GTTTGAAAAA
GAAAAATATT GATTTACAAT TGGTAGAATT TACAGACTAT ACGCAACCAA ACGCAGCATT
AGCAGAAAAA GAAATTGATT TAAATGCCTT TCAGCATCAA ATCTTTTTAG ACAATTACAA
TAAAGAGCAT GGAACGAAAT TAGTATCAAT TGGCAATACA GTCAATGCAC CATTGGGAAT
TTACGCTAAT AAATTGAAAG ATATCACGAA AATTAAAGAC GGCGGAGAAA TTGCTATTCC
TAATGACCCA ACGAATGGCG GGCGGGCGTT AATTTTATTA CAAACTGCAG GACTGATAAA
AGTAGATCCT GCGAAACAGC AACTACCGAC TGTCAGTGAT ATTACTGAAA ATAAACGCCA
ATTGAAAATA ACTGAATTAG ATGCTACGCA AACAGCGCGC GCTTTACAAG ATGTCGATGC
TTCAGTGATT AATAGCGGCA TGGCTGTCGA TGCTGGGTAT ACACCAGATA AAGATGCTAT
TTTCTTAGAA CCTGTAAACG AAAAAGCGAA ACCTTATGTG AACATTGTCG TGGCCCGAGA
AGAAGATCAA GAGAATAAAC TTTATCAAAA AGTTGTAGAA GAATATCAAC AAGAAGAAAC
GAAAAAGGTC ATTGCAGAAA CATCAAAAGC CCCCAATGTT CCAGCCTGGG AAACATTTGG
TAAAAAA EF009-4 (SEQ ID NO:32) CASGSVKDT KTETVKLOVV GTKNDEWESV
KDRLKKKNID LQLVEFTDYT QPNAALAEKE IDLNAFQHQI FLDNYNKEHG TKLVSIGNTV
NAPLGIYANK LKDITKTKDG GEIAIPNDPT NGGRALILLQ TAGLIKVDPA KQQLPTVSDI
TENKRQLKIT ELDATQTAEA LQDVDASVIN SGMAVDAGYT PDKDAIFLEP VNEKAKPYVN
IVVAREEDQE NKLYQKVVEE YQQEETKKVT AETSKGANVP AWETFGKK EF010-1 (SEQ
ID NO:33) TGAAAGAATA AAATTGTACA GGAGGAAATA AGGAATGAAA AAATGGCAAA
AAGGATTAGC CGTAGCTGGC GCACAGCTTT AGCTGTAGGA CTAAGCGCGT GCGGTAAATC
TTCAAAAGAT GCAGCGTCAA AAGGTGATGA TAGTACACCA ACCTTATTAA TGTATCGTGT
TGGGGACAAA CCAGATAATT ATGACCAATT AATCGATAAT GCGAATAAAA TTATCGAGAA
AAAAATTGGG GCAAAATTAA AAATGGAATT TGTTGGTTGG GGCGATTGGG ACCAAAAAAT
GTCAACAATC GTTCCTTCTG GTGAAAGCTA TGATATTTCA TTAGCACAAA ATTATGCAAC
GAATGCACAA AAAGGCGCCT ATCCTGATTT AACTGATTTA GCACCTAAAT ATGCCAAAGA
AGCCTATGAT CAATTGCCAG ATAACTATAT TAAAGGAAAT ACGATTAATG GAAAACTGTA
TGCGTTCCCA ATTTTAGGTA ACTCTTACGC TCAACAAGTT TTAACTTTTA ATAAAGAATA
TGTCGATAAA TACAATTTAG ATATTAGTAA AGTCGATGGT AGTTATGAAA GTGCAACGGA
AGTTCTAAAA GAATTCCNTA AAAANGANCC AAATATTGCT GCTTTTGCTA TCGGCCAAAC
ATTCTTTGCA ACAGGTAATT ATGACTTCCC TATTGGTAAC CAATATCCAT TTGCAGTAAA
AACAACTGAT ACTGGCTCAC CAAAAATTAT TAACCAATAT GCCGACAAAG ACATGATTAA
TAACTTAAAA GTCTTGCATC AATGGTATAA AGATGGCTTG ATTCCAACAC ATGCTGCTAC
AAGTACAACA CCATATGACT TAAATACCAA TACTTGGTTT ATGCGTCAAG AAACACAAGG
ACCTATGGAT TATGGTCATA CAATCTTAAC ACAACCTGCT CGCAAACCAC TTGTTTCTCG
TCCACTAACA GAACCATTAA AAACAACACC TCAAGCGCAA ATCGCTAACT ATGTTGTTGC
AAACACGTCT AAAAACAAAG AAAAATCTGT TGAATTGTTA CGTTTATTAA ACAGCAATCC
AGAATTGTTA AACGGACTTG TTTATGGTGA ACAAGGCAAA CAATATGAAA AACTTGGCGA
TGATCGTGTG AAATTGTTGA AAGATTACAC ACCAACAACT CATTTGAGTG CTTGGAACAC
AGGAAACAAC TTAATCATTT CGCCAGAAGA ATCTGTCACT GAAGAAATGG TTAAAGAACG
TGATAAGAGC ATCGAAGAAG CAAAAGATTC ACCAATTCTT GGTTTTACTT TTGTAAATGA
TAAAGTGAAA ACTGAAATCA CTAACGTTCC TACAGTTATG AACCGTTACG CAGCAACCTT
AAATACAGGA ACTGTTCATC CAGAAGAAAC ACTTCCAAAA TTAATGGATG ACCTAAAAAC
AGCTGGCTGG CATAAAGTTC AAAAAGAAAT GCAAACACAA TTAGACGAAT ATATCCAATC
TCAAAAATAA EF010-2 (SEQ ID NO:34) MAKRISR SWRTALAVGL SACGKSSKDA
ASKGDDSTPT LLMYRVGDKP DNYDQLIDNA NKIIEKKIGA KLKMEFVGWG DWDQKNSTIV
ASGESYDISL AQNYATNAQK GAYADLTDLA PKYAKEAYDQ LPDNYIKGNT INGKLYAFPI
LCNSYGQQVL TFNKEYVDKY NLDISKVDGS YESATEVLKE FXKXXPNIAA FAIGQTFFAT
GNYDFPIGNQ YPFAVKTTDT GSPKIINQYA DKDMINNLKV LHQWYKDGLI PTDAATSTTP
YDLNTNTWFM RQETQGPMDY GDTILTQAAG KPLVSRPLTE PLKTTAQAQM ANYVVANTSK
NKEKSVELLG LLNSNPELLN GLVYCEEGKQ YEKVGDDRVK LLKDYTPTTH LSAWNTGNNL
IIWPEESVTE EMVKERDKSI EEAKDSPILG FTFVNDKVKT EITNVATVHN RYAASLNTGT
VDPEETLPKL MDDLKTAGWD KVQKEMQTQL DEYIQSQK EF010-3 (SEQ ID NO:35) GT
GCGGTAAATC TTCAAAAGAT GCAGCGTCAA AAGGTGATGA TAGTACACCA ACGTTATTAA
TGTATCGTGT TGGGGACAAA CCAGATAATT ATGACCAATT AATCGATAAT GCGAATAAAA
TTATCGAGAA AAAAATTGGG GCAAAATTAA AAATGGAATT TCTTGGTTGG GGCGATTGGG
ACCAAAAAAT GTCAACAATC GTTGCTTCTG GTGAAAGCTA TGATATTTCA TTAGCACAAA
ATTATGCAAC GAATGCACAA AAAGGCGCCT ATGCTGATTT AACTGATTTA GCACCTAAAT
ATGCCAAAGA AGCCTATGAT CAATTGCCAG ATAACTATAT TAAAGGAAAT ACGATTAATG
GAAAACTGTA TGCGTTCCCA ATTTTAGGTA ACTCTTACGG TCAACAAGTT TTAACTTTTA
ATAAAGAATA TGTCGATAAA TACAATTTAG ATATTAGTAA AGTCGATGGT AGTTATGAAA
GTGCAACGGA AGTTCTAAAA GAATTCCNTA AAAANGANCC AAATATTGCT GCTTTTGCTA
TCGGCCAAAC ATTCTTTGCA ACAGGTAATT ATGACTTCCC TATTGGTAAC CAATATCCAT
TTGCAGTAAA AACAACTGAT ACTGGCTCAC CAAAAATTAT TAACCAATAT GCCGACAAAC
ACATGATTAA TAACTTAAAA GTCTTGCATC AATGGTATAA AGATGGCTTG ATTCCAACAG
ATGCTGCTAC AAGTACAACA CCATATGACT TAAATACCAA TACTTGGTTT ATGCGTCAAG
AAACACAAGG ACCTATGGAT TATGGTGATA CAATCTTAAC ACAAGCTGCT GGCAAACCAC
TTGTTTCTCG TCCACTAACA GAACCATTAA AAACAACAGC TCAAGCGCAA ATGGCTAACT
ATGTTGTTGC AAACACGTCT AAAAACAAAG AAAAATCTGT TGAATTGTTA GGTTTATTAA
ACAGCAATCC AGAATTCTTA AACGGACTTG TTTATGGTGA AGAAGGCAAA CAATATGAAA
AAGTTGGCGA TGATCGTGTG AAATTGTTGA AAGATTACAC ACCAACAACT CATTTGAGTG
CTTGGAACAC AGGAAACAAC TTAATCATTT GGCCAGAAGA ATCTGTCACT GAAGAAATGG
TTAAAGAACG TGATAAGAGC ATCGAAGAAG CAAAAGATTC ACCAATTCTT GGTTTTACTT
TTGTAAATGA TAAAGTGAAA ACTGAAATCA CTAACGTTGC TACAGTTATG AACCGTTACG
CAGCAAGCTT AAATACAGGA ACTGTTGATC CAGAAGAAAC ACTTCCAAAA TTAATGGATG
ACCTAAAAAC AGCTGGCTGG GATAAAGTTC AAAAAGAAAT GCAAACACAA TTAGACGAAT
ATATCCAATC TCAAAAA EF010-4 (SEQ ID NO:36) CGKSSKDA ASKGDDSTPT
LLMYRVGDKP DNYDQLIDNA NKIIEKKIGA KLKMEFVGWG DWDQKMSTIV ASGESYDISL
AQNYATNAQK GAYADLTDLA PKYAKEAYDQ LPDNYIKGNT INGKLYAFPI LGNSYGQQVL
TFNKEYVDKY NLDISKVDGS YESATEVLKE FXKXXPNIAA FAIGQTFFAT GNYDFPIGNQ
YPFAVKTTDT GSPKIINQYA DKDMINNLKV LHQWYKDGLI PTDAATSTTP YDLNTNTWFM
RQETQGPMDY GDTILTQAAG KPLVSRPLTE PLKTTAQAQM ANYVVANTSK NKEKSVELLG
LLNSNPELLN GLVYGEEGKQ YEKVGDDRVK LLKDYTPTTH LSAWNTGNNL IIWPEESVTE
EMVKERDKSI EEAKDSPILG FTFVNDKVKT EITNVATVMN RYAASLNTGT VDPEETLPKL
MDDLKTAGWD KVQKEMQTQL DEYIQSQK EF011-1 (SEQ ID NO:37) TAACGTTTTT
GGAGGAAAAG AATGAAAAAG AAATTTTTAG CAATGATGGC AGTTTCAATG ATGGGACTGT
TAATGTTAAG TGCTTGTCAA ACAAATAAAA AAACAGCAGA TTCTGCAACA ACAGAAACAA
CAGCTAAAAC GGAAGTCACA GTCAAAGACA CCAATGGTCA ATTAACCGTT CCCAAAAATC
CTAAGAAAGT CGTTGTTTTT GATAATGGTT CCTTGGATAC AATGGATGCA CTAGGTGTCG
GTGACCGCGT GGTAGGTGCG CCAACTAAAA ATATCCCTGC GTATTTGAAA AAATACCAAA
AAGTTCAATC AGCAGGCGGC ATTAAAGAAC CAGATTTAGA AAAAATCAAT CAACTAAAAC
CAGACTTAAT TATTATTTCT GGTCGTCAAC AAGATTATCA AGAACAATTA AAAGCCATTG
CGCCAACCAT TTACTTAGCT GTAGATGCCA AAAATCCTTG GGCATCAACG AAACAAAATA
TCGAAACGTT AGGCACTATT TTTGATAAAG AAGAGGTAGC TAAAGAAAAA ATAACTGCCT
TAGAAAAAGA AATTGCTGAC GTGAAAAAAC AAGCAGAAGC TAGCGCGAAT AATGCGCTTG
TTGTGTTAGT TAACGAAGGA CAACTTTCCG CTTACGGAAA AGGCTCTCGT TTCGGTTTAA
TTCATGATAC ATTTGGCTTC AAAGCAGCAG ACGATAAGAT TGAAGCTTCC ACTCATGGGC
AAAGTGTTTC TTACGAATAT GTTTTAGAAA AAAATCCTGG GATTCTCTTT GTGGTAGATC
GCACCAAAGC AATTGGTGGC GACGATTCAA AAGATAACGT CGCTGCAAAC GAATTGATTC
AAAAAACCGA TGCTGGTAAA AATGATAAAG TCATTATGCT TCAACCAGAT GTTTGGTATC
TAAGCGGTGG TGGATTAGAA TCAATGCATT TGATGATAGA AGATGTTAAA AAAGGATTAG
AGTAA EF011-2 (SEQ ID NO:38) MKKK FLANMAVSMM GLLMLSACQT NKKTADSATT
ETTAKTEVTV KDTNGQLTVP KNPKKVVVFD NGSLDTMDAL GVGDRVVGAP TKNIPAYLKK
YQKVESAGGI KEPDLEKINQ LKPDLIIISG RQQDYQEQLK AIAPTIYLAV DAKNPWASTK
QNIETLGTIF DKEEVAKEKI TGLEKEIADV KKQAEASANN ALVVLVNEGQ LSAYGKGSRF
GLIHDTFGFK AADDKIEAST HGQSVSYEYV LEKNPGILFV VDRTKAIGGD DSKDNVAANE
LIQKTDAGKN DKVIMLQPDV WYLSGGGLES MHLMIEDVKK GLE EF011-3 (SEQ ID
NO:39) TTGTCAA ACAAATAAAA AAACAGCAGA TTCTGCAACA ACAGAAACAA
CAGCTAAAAC GGAAGTCACA GTCAAAGACA CCAATGGTCA ATTAACCGTT CCCAAAAATC
CTAAGAAAGT CGTTGTTTTT GATAATGGTT CCTTGGATAC AATGGATGCA CTAGGTGTCG
GTGACCGCGT GGTAGGTGCG CCAACTAAAA ATATCCCTGC GTATTTGAAA AAATACCAAA
AAGTTGAATC AGCACGCCGC ATTAAAGAAC CAGATTTAGA AAAAATCAAT CAACTAAAAC
CAGACTTAAT TATTATTTCT GGTCGTCAAC AAGATTATCA AGAACAATTA AAAGCCATTG
CGCCAACCAT TTACTTAGCT GTAGATGCCA AAAATCCTTG GGCATCAACG AAACAAAATA
TCGAAACGTT AGGCACTATT TTTGATAAAG AAGAGGTAGC TAAAGAAAAA ATAACTGGCT
TAGAAAAAGA AATTGCTGAC GTGAAAAAAC AAGCAGAAGC TAGCGCGAAT AATGCGCTTG
TTGTGTTAGT TAACGAAGGA CAACTTTCCG CTTACGGAAA AGGCTCTCGT TTCGGTTTAA
TTCATGATAC ATTTGGCTTC AAAGCAGCAG ACGATAAGAT TGAAGCTTCC ACTCATGGGC
AAAGTGTTTC TTACGAATAT GTTTTAGAAA AAAATCCTGG GATTCTCTTT GTGGTAGATC
GCACCAAAGC AATTGGTGGC GACGATTCAA AAGATAACGT CGCTGCAAAC GAATTGATTC
AAAAAACCGA TGCTGGTAAA AATGATAAAG TCATTATGCT TCAACCAGAT GTTTGGTATC
TAAGCGGTGG TGGATTAGAA TCAATGCATT TGATGATAGA AGATGTTAAA AAAGGATTAG
AG EF011-4 (SEQ ID NO:40) CQT NKKTADSATT ETTAKTEVTV KDTNGQLTVP
KNPKKVVVFD NGSLDTMDAL GVGDRVVGAP TKNIPAYLKK YQKVESAGGI
KEPDLEKINQ
LKPDLIIISG RQQDYQEQLK AIAPTIYLAV DAKNPWASTK QNIETLGTIF DKEEVAKEKI
TGLEKEIADV KKQAEASANN ALVVLVNEGQ LSAYGKGSRF GLIHDTFGFK AADDKIEAST
HGQSVSYEYV LEKNPGILFV VDRTKAIGGD DSKDNVAANE LIQKTDAGKN DKVIMLQPDV
WYLSGGGLES MHLMIEDVKK GLE EF012-1 (SEQ ID NO:41) TGAGGGGGCA
ACAACATGAA ATTGGGGAAA AAAGTAGTAG GTTTGATTGC AACAGGGTTT CTTTTAGCCG
CATGTGGCGG AACCAAAGAA GCGGCAGAGA AAGTAGATTC GGGAAATTTA GCAGCTGAAC
AAAAAATCAG TATTAGTTCA CCTGCACCAA TCTCAACATT GGATACAACA CAAACAACAG
ATAAAAATAC CTTTACAATG GCACAACATT TATTTGAAGG CCTTTATCGG TTTGATGATG
ATAGTGCCAC GGTGCCAGCT CTAGCTAAAG ATGTCAAGAT TAGTGACGAT GGGCGCAAGT
ACCACTTTAC CTTGCGGGAG GGGATTAAGT GGAGCAACGG CGAGCCAATC ACGGCCCAAG
ATTTTGTTTA TTCTTGGAAA AAACTGGTGA CACCAGCGAC GATTGGACCG AATGCCTATT
TACTAGACAG TGTTAAAAAT AGTTTTGAAA TACGCAACGG TGAAAAGTCA GTCGATGAAT
TAGGGATTTC AGCCCCGAAT GACAAAGAAT TCATTGTTGA ATTAAAACAG GCCCAACCTT
CCTTCTTAGC AGTCGTTTCG ATTGCTTGGT TAGCGCCACA AAATCAAAAA TTTGTCGAAG
CGCAAGGCAA AGATTACGCC TTGGATAGTG AACATTTACT TTATAGCGGG CCATTTACGC
TAGCCAATTG GGATGCGACT TCAGATACTT GGACATTGAA AAAAAATCCA GAATACTATG
ATGCGGATCA AGTGAAACTG GAAGAAGTTG CGGTTAGCAC AATCAAAGAA GATAATACTG
GGATTAACTT ATATCAAGTG AATGAACTAG ACTTAGTTCG CATTAACGGA CAATATGTTC
AACAATATCA AGATGATCCA GGCTATGTCA GTCATCCAGA TGTGGCCAAC TACTTCTTAG
ATTTCAACAA AAAAGAAGGA ACGCCATTAG CGAATGTTCA TTTACGAAAA GCGATTGGCC
AAGCAATTGA TAAAGAAGCC TTAACACAAA GTGTCTTAAA CGATGGGTCA AAACCCCTTA
ACGGATTGAT TCCAAGTAAA CTTTATGCGA ATCCAGAAAC GGATGAAGAT TTCCGAGCTT
ACAGTGGCGA ATATTTGAAA AATGACGTCA AAAAAGCTCA AGCTGAATGG ACGAAAGCCC
AAGCGGATGT CGGTAAAAAA GTGAAACTTT CATTGCTGGC GGCAGACACA GATCAAGGAA
AACGAATTGC TGAATATGTT CAAAGTCAGT TGCAAGAAAA TCTGCCAGGT TTAGAAATTA
CCATTTCATC GCAACCAAGT AATAATGTGA ACCAATCGCG ACGTGAAAAA AATTATGAGT
TGTCTCTTTC AGGATGGATT GCCGGCAGTA GTGAATTAGA CTCTTACTTT AACTTATATG
CAGGAGAATC AAGTTACAAT TACGGCAATT ATCATAATGC CAAATACGAC CAATTGGTAG
AAGAGGCACG AACGATTAAT GCCAATAATC CAGAGAAACA GTTTGCAGAA TACAAAGAAG
CGGAAGACAT CTTGTTGAAC CAAGATGCTG CCCAAGTACC GCTGTATCAA AGTGCCTCAA
ATTATCTAAT CAATCCTAAA TTGAAAGGCA TTAGTTATCA CTTGTATGGG GATTATTTCC
ACTTGCGCAA TGCCTATTTA ACAGAATGA EF012-2 (SEQ ID NO:42) MKLGKK
WGLIATGFL LAACGGTKEA AEKVDSGNLA AEQKISISSP APISTLDTTQ TTDKNTFTMA
QHLFEGLYRF DDDSATVPAL AKDVKISDDG RKYHFTLREG IKWSNGEPIT AQDFVYSWKK
LVTPATIGPN AYLLDSVKNS FEIRNGEKSV DELGISAPND KEFIVELKQA QPSFLAVVSI
AWLAPQNQKF VEAQGKDYAL DSEHLLYSGP FTLANWDATS DTWTLKKNPE YYDADQVKLE
EVAVSTIKED NTGINLYQVN ELDLVRINGQ YVQQYQDDPG YVSHPDVANY FLDFNKKEGT
PLANVHLRKA IGQAIDKEAL TQSVLNDGSK PLNGLIPSKL YANPETDEDF PAYSGEYLKN
DVKKAQAEWT KAQADVGKKV KLSLLAADTD QGKRIAEYVQ SQLQENLPGL EITISSQPSN
NVNQSRREKN YELSLSGWIA GSSELDSYFN LYAGESSYNY GNYHNAKYDQ LVEEARTINA
NNPEKQFAEY KEAEDILLNQ DAAQVPLYQS ASNYLINPKL KGISYHLYGD YFHLRNAYLT S
EF012-3 (SEQ ID NO:43) ATGTGGCGG AACCAAAGAA GCGGCAGAGA AAGTAGATTC
GGGAAATTTA GCAGCTGAAC AAAAAATCAG TATTAGTTCA CCTGCACCAA TCTCAACATT
GGATACAACA CAAACAACAG ATAAAAATAC CTTTACAATG GCACAACATT TATTTGAAGG
CCTTTATCGG TTTGATGATG ATAGTGCCAC GGTGCCAGCT CTAGCTAAAG ATGTCAAGAT
TAGTGACGAT GGGCGCAAGT ACCACTTTAC CTTGCGGGAG GGGATTAAGT GGAGCAACGG
CGAGCCAATC ACCGCCCAAC ATTTTGTTTA TTCTTGGAAA AAACTGGTGA CACCAGCGAC
GATTGGACCG AATGCCTATT TACTAGACAG TGTTAAAAAT AGTTTTGAAA TACGCAACGG
TGAAAAGTCA GTCGATGAAT TAGGGATTTC AGCCCCGAAT GACAAAGAAT TCATTGTTGA
ATTAAAACAG GCCCAACCTT CCTTCTTAGC AGTCGTTTCG ATTGCTTGGT TAGCGCCACA
AAATCAAAAA TTTGTCGAAG CGCAAGGCAA AGATTACGCC TTGGATAGTG AACATTTACT
TTATAGCGGG CCATTTACGC TAGCCAATTG GGATGCGACT TCAGATACTT GGACATTGAA
AAAAAATCCA GAATACTATG ATGCGGATCA AGTGAAACTG GAAGAAGTTG CGGTTAGCAC
AATCAAAGAA GATAATACTG GGATTAACTT ATATCAAGTG AATGAACTAG ACTTAGTTCG
CATTAACGGA CAATATGTTC AACAATATCA AGATGATCCA GGCTATGTCA GTCATCCAGA
TGTGGCCAAC TACTTCTTAG ATTTCAACAA AAAAGAAGGA ACGCCATTAG CGAATGTTCA
TTTACGAAAA GCGATTGGCC AAGCAATTGA TAAAGAAGCC TTAACACAAA GTGTCTTAAA
CGATGGGTCA AAACCCCTTA ACGGATTGAT TCCAAGTAAA CTTTATGCGA ATCCAGAAAC
CGATGAAGAT TTCCGAGCTT ACAGTGGCGA ATATTTGAAA AATGACGTCA AAAAAGCTCA
AGCTGAATGG ACGAAAGCCC AAGCGGATGT CGGTAAAAAA GTGAAACTTT CATTGCTGGC
GGCAGACACA GATCAAGGAA AACGAATTGC TGAATATGTT CAAAGTCAGT TGCAAGAAAA
TCTGCCAGGT TTAGAAATTA CCATTTCATC GCAACCAAGT AATAATGTGA ACCAATCGCG
ACGTGAAAkA AATTATGAGT TGTCTCTTTC AGGATGGATT GCCGGCAGTA GTGAATTAGA
CTCTTACTTT AACTTATATG CAGGAGAATC AAGTTACAAT TACGGCAATT ATCATAATGC
CAAATACGAC CAATTGGTAG AAGAGGCACG AACGATTAAT GCCAATAATC CAGAGAAACA
GTTTGCAGAA TACAAAGAAG CGGAAGACAT CTTGTTGAAC CAAGATGCTG CCCAAGTACC
GCTGTATCAA AGTGCCTCAA ATTATCTAAT CAATCCTAAA TTGAAAGGCA TTAGTTATCA
CTTGTATGGG GATTATTTCC ACTTGCGCAA TGCCTATTTA ACAGAA EF012-4 (SEQ ID
NO:44) CGGTKEA AEKVDSGNLA AEQKISISSP APISTLDTTQ TTDKNTFTMA
QHLFEGLYRF DDDSATVPAL AKDVKISDDG RKYHFTLREG IKWSNGEPIT AQDFVYSWKK
LVTPATIGPN AYLLDSVKNS FEIRNGEKSV DELGISAPND KEFTVELKQA QPSFLAVVST
AWLAPQNQKF VEAQGKDYAL DSEHLLYSGP FTLANWDATS DTWTLKKNPE YYDADQVKLE
EVAVSTIKED NTGINLYQVN ELDLVRINGQ YVQQYQDDPG YVSHPDVANY FLDFNKKEGT
PLANVHLRKA IGQAIDKEAL TQSVLNDGSK PLNGLIPSKL YANPETDEDF RAYSGEYLKN
DVKKAQAEWT KAQADVGKKV KLSLLAADTD QGKRIAEYVQ SQLQENLPGL EITISSQPSN
NVNQSRREKN YELSLSGWIA GSSELDSYFN LYAGESSYNY GNYHNAKYDQ LVEEARTINA
NNPEKQFAEY KEAEDILLNQ DAAQVPLYQS ASNYLINPKL KGISYHLYGD YFHLRNAYLT E
EF013-1 (SEQ ID NO:45) TAACGAAAAA TGAAAAAAAT TGCTTTGTTC AGTATGTTAA
CGTTCAGTGT ATTGTCTTTA AGTCTAGCAG GATGTGGAAA CAAAAAAACA GCAAGCACAA
ATGATTCTAA GCCAAAGCAA GAAACAAAGA AAGCCACGCA GAAATCCTCT AGCCAACAAG
AAATGAAAAG TAGTCATTCG TCTGTCACGG GTCAAAATTC TAATGTGACA GGGGAAAATC
CGTCAGAAAA TGCCACGCAG CCTTCTGCAG GAACTGATGA AACGAATGAA GTCCCTCAAA
ACCAAGCACC TGATACAAAC ATTACAATTA CCAATGTTGT TTTCAATCCT GAAAGAAATG
AAATTAATGG TACTACATTA CCTAATGCAA CCATTACAGC AACGGTAGTC GGTGATGCTT
CTGCACAAGC AGGTGTTTTT TATGCGGATG CCAATGGCAA TTTTACAGTA ATTAGTCCCA
GAGCGGGAGC GACTACTCAA TTAATCGCAA CCGTTGATCA ACGGAATAGT GCACCTGTCC
AAATTGATAT TCCAAGTTCA GGACAAGAAG CAGCGCTTTC TTTTAGCAAT ATTACGATTG
ATCCGAAACA AGGGACAATT TCTGGTAAAA CAGCACCGAA TGCAACTATT TTAGTGTCAC
GTGCAGATGA TGCGCGGGTG ATTTTAGCAA GTTTTACTGC GGATGCCCAA GGGAATTTCA
CAGCCAGTAA TTTAGTTCCC GGCACAAAAA ATCGCTTAGA TGTTACGTTA AATGGAGAAA
TAGGGACACC TTACTTGTTT GATTTACCAA ATTAA EF013-2 (SEQ ID NO:46)
MKKIALFS MLTFSVLSLS LAGCGNKKTA STNDSKPKQE TKKATQKSSS QQEMKSSHSS
VTGQNSNVTG ENPSENATQP SAGTDETNEV PQNQAPDTNI TITNVVFNPE RNEINGTTLP
NATITATVVG DASAQAGVFY ADANGNFTVI SPRAGATTQL IATVDQRNSA PVQIDIPSSG
QEAALSFSNI TIDPKQGTIS GKTAPNATTL VSRADDARVI LASFTADAQG NFTASNLVPG
TKNRLDVTLN GEIGTPYLFD LPN EF013-3 (SEQ ID NO:47) ATGTGGAAA
CAAAAAAACA GCAAGCACAA ATGATTCTAA GCCAAAGCAA GAAACAAAGA AAGCCACGCA
GAAATCCTCT AGCCAACAAG AAATGAAAAG TAGTCATTCG TCTGTCACGG GTCAAAATTC
TAATGTGACA GGGGAAAATC CGTCAGAAAA TGCCACGCAG CCTTCTGCAG GAACTGATGA
AACGAATGAA GTCCCTCAAA ACCAAGCACC TGATACAAAC ATTACAATTA CCAATGTTGT
TTTCAATCCT GAAAGAAATG AAATTAATGG TACTACATTA CCTAATGCAA CCATTACAGC
AACGGTAGTC GGTGATGCTT CTGCACAAGC AGGTGTTTTT TATGCGGATG CCAATCGCAA
TTTTACAGTA ATTAGTCCCA GAGCGGGAGC GACTACTCAA TTAATCGCAA CCGTTGATCA
ACGGAATAGT GCACCTGTCC AAATTGATAT TCCAAGTTCA GGACAAGAAG CAGCGCTTTC
TTTTAGCAAT ATTACGATTG ATCCGAAACA AGGGACAATT TCTGGTAAAA CAGCACCGAA
TGCAACTATT TTAGTGTCAC GTGCAGATGA TGCGCGGGTG ATTTTAGCAA GTTTTACTGC
GGATGCCCAA GGGAATTTCA CAGCCAGTAA TTTAGTTCCC GGCACAAAAA ATCGCTTAGA
TGTTACGTTA AATGGAGAAA TAGGGACACC TTACTTGTTT GATTTACCAA AT EF013-4
(SEQ ID NO:48) CCNKKTA STNDSKPKQE TKKATQKSSS QQEMKSSHSS VTGQNSNVTG
ENPSENATQP SAGTDETNEV PQNQAPDTNI TITNVVFNPE RNEINGTTLP NATITATVVG
DASAQAGVFY ADANGNFTVI SPRAGATTQL IATVDQRNSA PVQIDIPSSG QEAALSFSNI
TIDPKQGTTS GKTAPNATIL VSRADDARVI LASFTADAQG NFTASNLVPG TKNRLDVTLN
GEIGTPYLFD LPN EF014-1 (SEQ ID NO:49) TGATGGTGGA GACTTTTTAA
GAGAGAGGAA GTACAGCCAA TGAGTAGGAA GCGAAAAATC ACCTTAATTA GTTTAGTCAT
CATTTTGGTT TTTGTCACAG TCGGCTCAGC ATACTTTGCT GTAGCGGGTA GCTATTTAAA
GAAAACAATT GATAAAGGCT ATGTTCCCAT AAAAAATGAT TATAATGAAG CCCAAAATAA
AGATAGTCAA TCGTTTTTGA TTATGGGGCT AGACAATACA ATTGAACGGA AATTAGGCAC
AACTAGGACT GATGCTATGA TGGTGATTAC CGTGAATAAC AAGACGAAGA AAATAACCTA
TTTAAGTTTG CCACGGGATA GTTTTGTTCA AATTGATGCG AAAAATTACC AAGGGATGCA
GCGAATTGAA GCCGCCTATA CCTACGATGG ACCAACAGCT TCTGTTAACA CAGTTGAGAA
ATTATTGAAT ATTCCAATCA ATCATTACGT TGTGTTTAAC TTTTTATCTT TTATTAAGTT
AATTGATGCG GTTCGCGGCA TAGATGTCAA TGTCAAGCAG GCGTTTGATG GTGTCACCAA
AGACGGCCCA GGATCCATTC ATTTTGATGC AGGGAAACAG CATTTAGATG GTACGAAAGC
TTTATCTTAT GCCCGTGAAA GACATAGCGA TAACGATATT ATGCGTGGAT TCCGACAACA
AGAAATTATT CAAGCAGTTG AAGACAAGTT GAAATCTCGT CAATCAATCA TGAAAATAAT
GGACATTATT GATTCGTTAA ATGGAAACAT TCAAACTGAT GTGGATTCCA ATGAATTGAC
TCATTTAGTC AAAGAAGGTT TCACTTGGAC CAATTATGAT AAACAACAGC TTTCTTTTGA
CTGGCGCACT TTTAGTAATG AAGGGCGCAG TATGGTTGAA CTATACCCAG ATAGTATTGA
AAATGTCCGT CATCAATTAC GTGTGTCTTT AAATTTAGAA AAGCCAGATG AACCAGATCA
AGACGGCTAT GTCTTCCATA CGAACGGTGA ATTTTTATAT CAAAGTGATT ATACCGTTCA
AGATGAAGCA GCTGAGGAAA ACGAAATGAC TTCCATCAAC CGCAATACGT ATATTGGTGT
TCCTGCTAAT ACACAGACCO GCCCGTTGCC ATCAGTTAAA ACGGAAAATG GCTTTATAAA
ATAA EF014-2 (SEQ ID NO:50) MSRKRKIS LISLVIILVF VTVGSAYFAV
AGSYLKKTID KGYVPIKNDY NEAQNKDSQS FLIMGLDNTI ERKLGTTRTD ANMVITVNNK
TKKITYLSLP RDSFVQIDAK NYQGMQRIEA AYTYDGPTAS VNTVEKLLNI PINHYVVFNF
LSFIKLIDAV GGIDVNVKQA FDGVTKDGPG SIHFDAGKQH LDGTKALSYA RERHSDNDIM
RGFRQQEIIQ AVEDKLKSGQ SIMKIMDIID SLNGNIQTDV DSNELTHLVK EGLTWTNYDK
QQLSFDWRTF SNEGRSMVEL YPDSIENVRH QLRVSLNLEK PDERDQDGYV FHTNGEFLYQ
SDYTVQDEAA EENEMTSING NTYIGVPGNT QTGPLPSVKT ENGFIK EF014-3 (SEQ ED
NO:51) TGCT GTAGCGCGTA GCTATTTAAA GAAAACAATT GATAAAGGCT ATGTTCCCAT
AAAAAATGAT TATAATGAAG CGCAAAATAA ACATAGTCAA TCGTTTTTGA TTATGGGGCT
AGACAATACA ATTGAACGGA AATTAGGCAC AACTAGGACT GATGCTATGA TGGTGATTAC
CGTGAATAAC AAGACGAAGA AAATAACCTA TTTAAGTTTG CCACGGGATA GTTTTGTTCA
AATTGATGCG AAAAATTACC AAGGGATGCA GCGAATTGAA GCCGCCTATA CCTACGATGG
ACCAACAGCT TCTGTTAACA CAGTTGAGAA ATTATTGAAT ATTCCAATCA ATCATTACGT
TGTGTTTAAC TTTTTATCTT TTATTAAGTT AATTGATGCG GTTGGCCGCA TAGATGTCAA
TGTCAAGCAG GCGTTTGATG GTGTCACCAA AGACGGGCCA GGATCCATTC ATTTTGATGC
ACGGAAACAG CATTTAGATG GTACGAAAGC TTTATCTTAT GCCCGTGAAA GACATAGCGA
TAACGATATT ATGCGTGGAT TCCGACAACA AGAAATTATT CAAGCAGTTG AAGACAAGTT
GAAATCTGGT CAATCAATCA TGAAAATAAT GGACATTATT GATTCGTTAA ATGGAAACAT
TCAAACTGAT GTGGATTCCA ATGAATTGAC TCATTTAGTC AAAGAAGGTT TGACTTGGAC
CAATTATGAT AAACAACAGC TTTCTTTTGA CTGGCGCACT TTTAGTAATG AAGGGCGCAG
TATGGTTGAA CTATACCCAG ATAGTATTGA AAATGTCCGT CATCAATTAC GTGTGTCTTT
AAATTTAGAA AAGCCAGATG AACGAGATCA AGACGGCTAT GTCTTCCATA CGAACGGTGA
ATTTTTATAT CAAAGTGATT ATACCGTTCA AGATGAAGCA GCTGAGGAAA ACGAAATGAC
TTCCATCAAC GGCAATACGT ATATTGGTGT TCCTGGTAAT ACACAGACCG GCCCGTTGCC
ATCAGTTAAA ACGGAAAATG GCTTTATAAA A EF014-4 (SEQ ID NO:52) AV
AGSYLKKTID KGYVPIKNDY NEAQNKDSQS FLIMGLDNTI ERKLGTTRTD AMMVITVNNK
TKKITYLSLP RDSFVQIDAK NYQGMQRIEA AYTYDGPTAS VNTVEKLLNI PINHYVVFNF
LSFIKLIDAV GGIDVNVKQA FDCVTKDGPG SIHFDAGKQH LDGTKALSYA RERHSDNDIM
RGFRQQEIIQ AVEDKLKSGQ SIMKIMDIID SLNGNIQTDV DSNELTHLVK EGLTWTNYDK
QQLSFDWRTF SNEGRSMVEL YPDSIENVRH QLRVSLNLEK PDERDQDGYV FHTNGEFLYQ
SDYTVQDEAA EENEMTSING NTYIGVPGNT QTGPLPSVKT ENGFIK EF015-1 (SEQ ID
NO:53) TAATTAAAAA TGTGTAAAAA GGGTCTGATG AAAAAAGGAG ACATAATAGT
TATTATCTTT TTAATAGCTA TCTCTTTTTC TCCATATTTT ATTTTTTTTC ACAATAATCC
ATTTAACTCC AAAAGTTTTG ACGACACTAA ATATGCTGTG GTCAAGATAG ATGGGAAAGA
GATTGAGCGT ATAAATTTAG ATGATTCAAA AGAATTTATC AAAACATATT ATCCATCAAA
AGGGCAATAT AATACTATAG AAGTTAAAAA TGGGCACGTT CGTGTAAAAA AAGATAATAG
TCCAGATCAA ATTGCGGTGA AAACAGGATG GATATCAGAA CCAGGGCNAA CTACTATCTG
TATTCCTCAC AGATTCATTT TAGAAATTGT TCAACAATAT TCTAAGGATT ATTATATTTA
CTAA EF015-2 (SEQ ID NO:54) MK KGDIIVIIFL TAISESPYFI FFHNNPFNSK
SFDDTKYAVV KIDGKEIERIL NLDDSKEFIK TYYPSKGQYN TIEVKNGHVR VKKDNSPDQI
AVKTGWISEP GXTSICTPHR FILEIEVQQYS KDYYIY EF015-3 (SEQ ID NO:55)
CAATAATCC ATTTAACTCC AAAAGTTTTG ACGACACTAA ATATGCTGTG GTCAAGATAG
ATGGGAAAGA GATTGAGCGT ATAAATTTAG ATGATTCAAA AGAATTTATC AAAACATATT
ATCCATCAAA AGGGCAATAT AATACTATAG AAGTTAAAAA TGGGCACGTT CGTGTAAAAA
AAGATAATAG TCCAGATCAA ATTGCGGTGA AAACAGGATG GATATCAGAA CCAGGCCNAA
CTAGTATCTG TATTCCTCAC AGATTCATTT TAGAAATTGT TCAACAATAT TCTAAGGATT
ATTATATTTA C EF015-4 (SEQ ID NO:56) NNPFNSK SFDDTKYAVV KTDGKEIERI
NLDDSKEFIK TYYPSKGQYN TIEVKNGHVR VKKDNSPDQT AVKTGWISEP GXTSICIPHR
FILEIVQQYS KDYYIY EF016-1 (SEQ ID NO:57) TGACGGTTGC CCCCGTCCAA
TAGAAAGGAG TTTATGATGA AAAAGAAATA TTCTTTAGCC TTGCTGGTTA TCTGTTGTAG
TTTACTCCTA TTTGCAGGTT GTGGTAAAAG AAAAAGCAAC GAAGATCAAT GGACACGGAT
TAACGAAGAA AAACGGATTA TTATTGGCTT AGATGACTCC TTTGTGCCCA TGGGTTTTCA
AGATAAATCA GGCAAAATTG TCGGCTTTGA TGTCGACTTA GCCAAAGCGG TTTTTAAACT
TTATGGCATT TCCGTTGACT TCCAACCGAT TGATTGGTCT ATGAAAGAAA CAGAATTACA
AAATCAAACC ATTGATCTTA TTTGGAACGG CTACACTAAA ACGAGCGAGC GGGCCGAAAA
AGTTCAATTC ACACAACCTT ACATGACGAA CGACCAAGTA CTTGTTTCTT TAAAAGAAAA
AAACATTGCA ACAGCGAGCG ACATGCAAGG CAAAATTTTA GGCGTTCAAA ACGGCTCTTC
TGGCTATGAT GGCTTCGAAA GTCAGCCTGA CGTTTTGAAA AAATTTGTTA AAGACCAAAC
ACCTATTTTA TATGACGGCT TTAATGAAGC TTTCTTAGAT TTAAAATCTG GTCGAATTGA
CGGACTCCTA ATCGATCGCG TTTACGCCAA CTACTATCTT TCCCACGAAG ATAATTTAAA
AAACTATACT ATTTCTCATG TAGGCTATGA CAATGAAGAT TTTGCTGTGG GCGTCCGCAA
ATCAGACAAT CAATTAGTCC AAAAAATCAA TACTGCCTTT GAAACGTTAC GAAAAGATGG
CACCCTTACT AAAATTTCTC AAAAATGGTT TGGAGAGGAC GTTACAAATA ACACAAAAAT
AAACTAA EF016-2 (SEQ ID NO:58) MMKKKYSLAL LVICCSLLLF AGCGKRKSNE
DQWTRINEEK RIIIGLDDSF VPMGFQDKSG KIVGFDVDLA KAVFKLYGIS VDFQPIDWSM
KETELQNQTI DLIWNGYTKT SERAEKVQFT QPYMTNDQVL VSLKEKNIAT ASDMQGKILG
VQNGSSGYDG FESQPDVLKK FVKDQTPILY DGFNEAFLDL KSGRIDGLLI DRVYANYYLS
HEDNLKNYTI SHVGYDNEDF AVGVRKSDNQ LVQKINTAFE TLRKDGTLSK TSQKWFGEDV
TNNTKIN EF016-3 (SEQ ID NO:59) AAGCAAC GAAGATCAAT GGACACGGAT
TAACGAAGAA AAACGGATTA TTATTGGCTT AGATGACTCC TTTGTGCCCA TGGGTTTTCA
AGATAAATCA GGCAAAATTG TCGGCTTTGA TGTCGACTTA GCCAAAGCGG TTTTTAAACT
TTATGGCATT TCCGTTGACT TCCAACCGAT TGATTGGTCT ATGAAAGAAA CAGAATTACA
AAATCAAACC ATTGATCTTA TTTGGAACGG CTACACTAAA ACGAGCGAGC GGGCCGAAAA
AGTTCAATTC ACACAACCTT ACATGACGAA CGACCAAGTA CTTGTTTCTT TAAAAGAAAA
AAACATTGCA ACAGCGAGCG ACATGCAAGG CAAAATTTTA GGGGTTCAAA ACGGCTCTTC
TGGCTATGAT GGCTTCGAAA GTCAGCCTGA CGTTTTGAAA AAATTTGTTA AAGACCAAAC
ACCTATTTTA TATGACGGCT TTAATGAAGC TTTCTTAGAT TTAAAATCTG GTCGAATTGA
CGGACTCCTA ATCGATCGCG TTTACGCCAA CTACTATCTT TCCCACGAAG ATAATTTAAA
AAACTATACT ATTTCTCATG TAGGCTATGA CAATGAAGAT TTTGCTGTGG GCGTCCGCAA
ATCAGACAAT CAATTAGTCC AAAAAATCAA TACTGCCTTT GAAACGTTAC GAAAAGATGG
CACCCTTAGT AAAATTTCTC AAAAATGGTT TGGAGAGGAC GTTACAAATA ACACAAAAAT
AAAC EF016-4 (SEQ ID NO:60) SNE DQWTRINEEK RIIIGLDDSF VPMGFQDKSG
KIVGFDVDLA KAVFKLYGIS
VDFQPIDWSM KETELQNQTI DLIWNGYTKT SERAEKVQFT QPYMTNDQVL VSLKEKNIAT
ASDMQGKILG VQNGSSGYDG FESQPDVLKK FVKDQTPILY DGFNEAFLDL KSGRIDGLLI
DRVYANYYLS HEDNLKNYTI SHVGYDNEDF AVGVRKSDNQ LVQKINTAFE TLRKDGTLSK
ISQKWFGEDV TNNTKIN EF017-1 (SEQ ID NO:61) TGAGGTGTTT TTATGAAAAG
CGCAACAAAG CAAAGGCTGT CTTTGGCAGC AATCATGGTT CTACTTCTCT CGGGCTGTGG
AAGTGTTGGG AAAGAAACCA AAAAGCAAGA ACAACAGGTA TTACGGGTCG GGATTGATTC
GGAATTATCA ACGGCAGACG TGTCGTTGGC AATGGATAAT ACCGCAGCAG ATGTAATGAG
CCAAGTAGGG GAGGGACTTT TCTCCTTTGA CGAAAAAGGA GAAGCGAAAC CAGCATTGGC
AACTGAAAAA GTACAGCCCT CCAATGATGG TTTAAGCTAT ACTTTTACGA TTCGAAAAGA
TGCAAAATGG AGTAACGGCG AGCCAATCAC AGCAAATGAT TTTGAATACT CTTGGAAGCG
CACAGTGGAC CCAAAAACAG CTTCCCCGCA AGCGTATTAC TTTGAAGGGT TAAAAAATTA
TCGTGCTATT GTTGACGGTA GCAAATCTAA AGAAGAGTTA GGGGTAACAG CCATTGATGA
CCATACCTTG GAAGTAGAGC TAAGCTATCC TATGAGTTAT TTTCAACAAT TATTGGCGGT
ACCAGCTTTT TATCCTTTAA ATGAAGCATT TGTCGAAAAA ACGGGCAAAA ACTATGGTAC
ATCAGCTGAG TCAACACTTT ACAATGGCGC CTTCACATTA GAAGGTTGGG ATGGCACGAA
TAATACTTGG TCCTATGTGA AGAATAAAAA TTATTGGGAT CAAGCGAATG TTTCGCTAGA
TAAGGTGGAT GTCCAAGTAG TTAAAGAAGT CAATACTGGG AAAAATCTTT TCGAAGGGAA
AGAATTAGAT GTTGTAAAAA TTTCTGGAGA AATTGTTGCA CAAGAACAAG GCAATGCAGC
TTTGAAAATT CGTGAAATTC CTGGAACGTA TTATATCCAA TTAAATACGC AAAAAGATCT
TTTGGCAAAT AAGAATGCAC GTCGAGCAAT AGCATTATCA TTGAATTCTG AGCGTTTAGC
TAAAAATGTT TTAAATGATG GCTCAAAAAA AGCACTTGGC TTCGTGCCAA CAGGTTTCAC
TAATCAACAA ACCCAAAAAG ATTTTGCAGA GGAATTAGGA GATTTAAATC CTAGTGAACC
AGAAAAAGCG AAAGAGTTAT GGCAAACGGC TAAAAAAGAA TTAGGAATTG AAAAAGCGGA
GCTAACGATT TTAAGTTCGG ATACAGAAAA TGCTAAAAAA ATCAGTGAGT ATGTTCAAGG
AGCTTTAGCA GATAATTTAG AAAATTTAAC AGTCAATGTT TCACCAGTTC CTTTTAATAA
TCGTTTAGAA AAAAGTCGCA GCGGAGATTT CGACATTGTG GTTGGTGGCT GGACGCCAGT
ATATGCTGAT CCAATCGATT TCTTAAACTT ACTGCAATCA AAAAATTCCA ATAATTTTGG
TAAATGGTCT AATAAGACCT TTGATCAGTT GCTTCAAGAA CCAAACGTAA CTTATGCAAA
TAAATATGAA GAACGTTGGA AAACATTACA AAAAGCGGAT CAATTGGTTG CGGAAGAAGC
CCCCCTAGTT CCTCTTTATC AATTAACAGA AGCACGCTTA GTGGCCGATT CTGTCCAAAA
TTTAGTCTAT GGTCCATTAG GTTCAGGCTA TTACAAATCA GTCTCTATCG GCGACAAGTA A
EF017-2 (SEQ ID NO:62) MKRATKQ RLSLAAIMVL LLSGCGSVGK ETKKQEQQVL
RVGIDSELST ADVSLAMDNT AADVMSQVGE GLFSFDEKGE AKPALATEKV QPSNDGLSYT
FTIRKDAKWS NGEPITANDF EYSWKRTVDP KTASPQAYYF EGLKNYRAIV DGSKSKEELG
VTAIDDHTLE VELSYPMSYF QQLLAVPAFY PLNEAFVEKT GKNYGTSAES TLYNGAFTLE
GWDGTNNTWS YVKNKNYWDQ ANVSLDKVDV QVVKEVNTGK NLFEGKELDV VKISGEIVAQ
EQGNAALKIR EIPGTYYIQL NTQKDLLANK NARPAIALSL NSERLAKNVL NDGSKKALGF
VPTGFTNQET QKDFAEELGD LNPSEPEKAK ELWQTAKKEL GIEKAELTIL SSDTENAKKI
SEYVQGALAD NLENLTVNVS PVPFNNRLEK SRSGDFDIVV GGWTPVYADP IDFLNLLQSK
NSNNFGKWSN KTFDQLLQEA NVTYANKYEE RWKTLQKADQ LVAEEAPLVP LYQLTEARLV
ADSVQNLVYG PLGSGYYKSV SIGDK EF017-3 (SEQ ID NO:63) CTGTGG
AAGTGTTGGG AAAGAAACCA AAAAGCAAGA ACAACAGGTA TTACGGGTCG GGATTGATTC
GGAATTATCA ACGGCAGACG TGTCGTTGGC AATGGATAAT ACCGCAGCAG ATGTAATGAG
CCAAGTAGGG GAGGGACTTT TCTCCTTTGA CGAAAAAGCA GAAGCGAAAC CAGCATTGGC
AACTGAAAAA GTACAGCCCT CCAATGATGG TTTAAGCTAT ACTTTTACGA TTCGAAAAGA
TGCAAAATGG AGTAACGGCG AGCCAATCAC AGCAAATGAT TTTGAATACT CTTGGAAGCG
CACAGTGGAC CCAAAAACAG CTTCCCCGCA AGCGTATTAC TTTGAAGGGT TAAAAAATTA
TCGTGCTATT GTTGACGGTA GCAAATCTAA AGAAGAGTTA GGGGTAACAG CCATTGATGA
CCATACCTTG GAAGTAGAGC TAAGCTATCC TATGAGTTAT TTTCAACAAT TATTGGCGGT
ACCAGCTTTT TATCCTTTAA ATGAAGCATT TGTCGAAAAA ACGGGCAAAA ACTATGGTAC
ATCAGCTGAG TCAACACTTT ACAATGGCGC CTTCACATTA GAAGGTTGGG ATGGCACGAA
TAATACTTGG TCCTATGTGA AGAATAAAAA TTATTGGGAT CAAGCGAATC TTTCGCTAGA
TAAGGTGGAT GTCCAAGTAG TTAAAGAAGT CAATACTGGG AAAAATCTTT TCGAAGGGAA
AGAATTAGAT GTTGTAAAAA TTTCTGGAGA AATTGTTGCA CAACAACAAG GCAATGCAGC
TTTGAAAATT CGTGAAATTC CTGGAACGTA TTATATCCAA TTAAATACGC AAAAAGATCT
TTTGGCAAAT AAGAATGCAC GTCGAGCAAT AGCATTATCA TTGAATTCTG AGCGTTTAGC
TAAAAATGTT TTAAATGATG GCTCAAAAAA AGCACTTGGC TTCGTGCCAA CAGGTTTCAC
TAATCAAGAA ACGCAAAAAG ATTTTGCAGA GGAATTAGGA GATTTAAATC CTAGTGAACC
AGAAAAAGCG AAAGAGTTAT GGCAAACGGC TAAAAAAGAA TTAGGAATTG AAAAACCGGA
GCTAACGATT TTAAGTTCGG ATACAGAAAA TGCTAAAAAA ATCAGTGAGT ATGTTCAAGG
AGCTTTAGCA GATAATTTAG AAAATTTAAC AGTCAATGTT TCACCAGTTC CTTTTAATAA
TCGTTTAGAA AAAAGTCGCA GCGGAGATTT CGACATTGTG GTTGGTGGCT GGACGCCAGT
ATATGCTGAT CCAATCGATT TCTTAAACTT ACTGCAATCA AAAAATTCCA ATAATTTTGG
TAAATGGTCT AATAAGACCT TTGATCAGTT GCTTCAAGAA GCAAACGTAA CTTATGCAAA
TAAATATGAA GAACGTTGCA AAACATTACA AAAAGCGGAT CAATTGGTTG CGGAAGAAGC
CCCCCTAGTT CCTCTTTATC AATTAACAGA AGCACGCTTA GTGGCCGATT CTGTCCAAAA
TTTAGTCTAT GGTCCATTAG GTTCAGGCTA TTACAAATCA GTCTCTATCG GCGACAAG
EF017-4 (SEQ ID NO:64) CGSVGK ETKKQEQQVL RVGIDSELST ADVSLAIADNT
AADVMSQVGE GLFSFDEKGE AKPALATEKV QPSNDGLSYT FTIRKDAKWS NGEPITANDF
EYSWKRTVDP KTASPQAYYF EGLKNYRAIV DGSKSKEELG VTAIDDHTLE VELSYPMSYF
QQLLAVPAFY PLNEAFVEKT GKNYGTSAES TLYNGAFTLE GWDGTNNTWS YVKNKNYWDQ
ANVSLDKVDV QVVKEVNTGK NLFEGKELDV VKISGEIVAQ EQGNAALKIR EIPGTYYIQL
NTQKDLLANK NARRAIALSL NSERLAKNVL NDGSKKALGF VPTGFTNQET QKDFAEELGD
LNPSEPEKAK ELWQTAKKEL GIEKAELTIL SSDTENAKKI SEYVQGALAD NLENLTVNVS
PVPFNNRLEK SRSGDFDIVV GGWTPVYADP IDFLNLLQSK NSNNFGKWSN KTFDQLLQEA
NVTYANKYEE RWKTLQKADQ LVAEEAPLVP LYQLTEARLV ADSVQNLVYG PLGSGYYKSV
SIGDK EF018-1 (SEQ ID NO:65) TGTCATTACA ACGATACCAA TTTTAATCAT
TTATCCATTA CTACAAAAAC ACTTTATCGG CGGTATGATG GCCGGTGCAG TAAAAGAATA
AAGAAAGTAG GGAACAATAT GAAAAAAGTT TTAGGCGGTT TATTGGTGGC AACGGCGGTC
GTTAGTTTAG CGGCCTGTAG CGGTGGGGAA AAGAAAGCTA GCTCAGATGT CTCAATTAAG
GATCGGTATG AATTAGATGA AAAGACGCCT GCTTGGAAGT TACATAAGAA GAAAGAACCG
ACCAAGATTA AATGGTATAT TAACTCAGAT TGGACGGCGC TCCCTTTTGG AAAAGACGTG
ACCACTGCGC AGATTAAAAA AGACTTAAAT GTGGATATTG AATTTATTTC CGGCGATGAT
TCAAAATTAA ATGCCATGAT TTCAAGTGGA GATATGCCTG ATATCGTGAC ATTAACTGAA
AAAACTGGAC AAGCAGCATT GAAAGCAGAT TCTTGGGCCT ATTCTTTAAA CGATTTAGCT
AAAAAATATG ACCCCTATTT AATGAAAGTT GTTAACCAAG ATACGTTTAA ATGGTATGCC
TTAGAGGATG GAAAAACATA TGGTTACCCT AATTACTCTA ATACAAAAGC GGATTATGAA
AGTGGAAATA TCCCAGTAAA TGATAATTTT GTTATTCGTG AAGATGTCTA TAATGCATTA
GGCAAGCCAG ACGTTTCAAC ACCAGAAAAT TTTGAAAAAG TCATGCAACA GATTAAAGAA
AAATATCCTG AGATGACCCC AATGGGCTTC ACCACAGTGG GCGATGGTGC AGGACCATTT
TTAGACAAAT TACAAGACTT CTTAGGTGTT CCTTTAGAGG ATAAAAATGG TAAATACTAT
GATCGAAATT TAGATAAAGA ATATTTAGAA TGGTTAAAAA CATTTAATGA TGTTTACCGA
GCAGGCAATA TTAGTGATGA TAGCTTCACA GATGATGGGG CAACGTTTGA TGAAAAAGTG
AAACAAGGAA ATTATGCAAC CATGCTCGTT GCTGGAACCA GTGGTCAAGG TGGGAACTTC
ACAGAATTTA TGAAAAAATC TGGCACACGT TATATAGCCA TTGATGGACC AAGTAGCACT
TCTGGCCGAA AACCAACATT AAATCAAACC GGCATTTCAG GTTGGTTAAG TAATTACATT
ACGAAAGATG CGAAAGATCC AGCAAAAGTC ACTCAACTGT TCACATATTT AATTGATGAA
CCGGGACAAA TTTTAACAAA ATATGGCGTT GAAGGAGTTA CTTATGCGTA CAATGATCAA
GGAAAAATTG ATTATTTACC AGAAGTGAAA AAATTAGAAC AAACAGACAA TGATGCCTAC
AACAAAAAAT ATGGCATTAG TCGTTTCCTA TACTTTAACA ACGACCGTGT CAATAAACTA
AAAGTACCAA TGGAAAGTGC TTTAACGCAA ATGCAAGAAT GCGGCAAAGG AAAATTAGTC
CCACATTTCG TAATTGAAAA TATTAATCCA GATGCAGGAA CGCCGGAAGC TCGTGCGAAT
GAAGCGATTG AAACCAAACT AAATACAACC GTTATTTCAA TGATTCGTGC GAAAGATGAT
AAAGCCTTTG ACAAATCTTT AGAAGACTAC AAAGCATTCT TAAAATCAAA TAAATCGGAT
GCAATTGAAA AAATAAAATC TGAGAAAATG GCGGAAAACA GAGACAAACT TAAGTAA
EF018-2 (SEQ ID NO:66) MKKV LGGLLVATAV VSLAACSGGE KKASSDVSIK
DRYELDEKTP AWKLDKKKEP TKIKWYTNSD WTALPFGKDV TTAQIKKDLN VDIEFISGDD
SKLNANISSG DMPDIVTLTE KTGQAALKAD SWAYSLNDLA KKYDPYLMKV VNQDTFKWYA
LEDGKTYGYP NYSNTKADYE SGNIPVNDNF VIREDVYNAL GKPDVSTPEN FEKVMQQIKE
KYPEMTPMGF TTVGDGAGPF LDKLQDFLGV PLEDKNGKYY DRNLDKEYLE WLKTFNDVYR
AGNTSDDSFT DDGATFDEKV KQGNYATMLV AGTSGQGGNF TEFMKKSGTR YIAIDGPSST
SGRKPTLNQT GISGWLSNYI TKDAKDPAKV TQLFTYLIDE PGQILTKYGV EGVTYAYNDQ
GKIDYLPEVK KLEQTDNDAY NKKYGISRFL YFNNDRVNKL KVPMESALTQ MQEWGKGKLV
PHFVIENINP DAGTPEARAN EAIETKLNTT VISMTRAKDD KAFDKSLEDY KAFLKSNKWD
AIEKIKSEKM AENRDKLK EF018-3 (SEQ ID NO:67) CTGTAG CGGTGGGGAA
AAGAAAGCTA GCTCACATGT CTCAATTAAG GATCGGTATG AATTAGATGA AAAGACGCCT
GCTTGGAAGT TAGATAAGAA GAAAGAACCG ACCAAGATTA AATGGTATAT TAACTCAGAT
TGGACGGCGC TGCCTTTTGG AAAAGACGTG ACCACTGCGC ACATTAAAAA AGACTTAAAT
GTGGATATTG AATTTATTTC CGGCGATGAT TCAAAATTAA ATGCCATGAT TTCAAGTGGA
GATATGCCTG ATATCGTGAC ATTAACTGAA AAAACTGGAC AAGCAGCATT GAAAGCAGAT
TCTTGGGCCT ATTCTTTAAA CGATTTAGCT AAAAAATATG ACCCCTATTT AATGAAAGTT
GTTAACCAAG ATACGTTTAA ATGGTATGCC TTAGAGGATG GAAAAACATA TGGTTACCCT
AATTACTCTA ATACAAAAGC GGATTATGAA AGTGGAAATA TCCCAGTAAA TGATAATTTT
GTTATTCGTG AAGATGTCTA TAATCCATTA CGCAAGCCAG ACGTTTCAAC ACCAGAAAAT
TTTGAAAAAG TCATGCAACA GATTAAAGAA AAATATCCTG AGATGACCCC AATGGGCTTC
ACCACAGTGG GCGATGGTGC AGGACCATTT TTAGACAAAT TACAAGACTT CTTAGGTGTT
CCTTTAGAGG ATAAAAATGG TAAATACTAT GATCGAAATT TAGATAAAGA ATATTTAGAA
TGGTTAAAAA CATTTAATGA TGTTTACCGA GCAGGCAATA TTAGTGATGA TAGCTTCACA
GATGATGGGG CAACGTTTGA TGAAAAAGTG AAACAAGGAA ATTATGCAAC CATGCTCGTT
GCTGGAACCA GTGGTCAAGG TGGGAACTTC ACAGAATTTA TGAAAAAATC TGGCACACGT
TATATAGCCA TTGATGGACC AAGTACCACT TCTCGCCGAA AACCAACATT AAATCAAACC
GGCATTTCAG GTTGGTTAAG TAATTACATT ACGAAAGATG CGAAAGATCC AGCAAAAGTC
ACTCAACTGT TCACATATTT AATTGATGAA CCGGGACAAA TTTTAACAAA ATATGGCGTT
GAAGGAGTTA CTTATGCGTA CAATGATCAA GGAAAAATTG ATTATTTACC AGAAGTGAAA
AAATTAGAAC AAACAGACAA TGATGCCTAC AACAAAAAAT ATGGCATTAG TCGTTTCCTA
TACTTTAACA ACGACCGTGT CAATAAACTA AAAGTACCAA TGGAAAGTGC TTTAACGCAA
ATGCAAGAAT GGGGCAAAGG AAAATTAGTC CCACATTTCG TAATTGAAAA TATTAATCCA
GATGCACGAA CGCCGGAAGC TCGTGCGAAT GAAGCGATTG AAACCAAACT AAATACAACC
GTTATTTCAA TGATTCGTGC GAAAGATGAT AAAGCCTTTG ACAAATCTTT AGAAGACTAC
AAAGCATTCT TAAAATCAAA TAAATGGGAT CCAATTGAAA AAATAAAATC TCAGAAAATG
GCGGAAAACA GAGACAAACT TAAG EF018-4 (SEQ ID NO:68) CSGGE KKASSDVSIK
DRYELDEKTP AWKLDKKKEP TKIKWYINSD WTALPFGKDV TTAQIKKDLN VDIEFISGDD
SKLNAMTSSG DMPDIVTLTE KTGQAALKAD SWAYSLNDLA KKYDPYLMKV VNQDTFKWYA
LEDGKTYGYP NYSNTKADYE SGNIPVNDNF VIREDVYNAL GKPDVSTPEN FEKVMQQIKE
KYPEMTPMGF TTVGDGAGPF LDKLQDFLGV PLEDKNGKYY DRNLDKEYLE WLKTFNDVYR
AGNISDDSFT DDGATFDEKV KQGNYATMLV AGTSGQGGNF TEFMKKSGTR YIAIDGPSST
SGRKPTLNQT GISGWLSNYI TKDAKDPAKV TQLFTYLIDE PGQILTKYGV EGVTYAYNDQ
GKIDYLPEVK KLEQTDNDAY NKKYGISRFL YFNNDRVNKL KVPMESALTQ MQEWGKGKLV
PHFVIENINP DAGTPEARAN EAIETKLNTT VISMIRAKDD KAFDKSLEDY KAFLKSNKWD
AIEKIKSEKM AENRDKLK SF019-1 (SEQ ID NO:69) TAAAGGAGTT ACACAATGAA
ACTTTTAAAA AAGACGGTCC TAATTGGTAC AACCCTTCTT CTTGGTTCAT TCTTACTCGC
AGCTTGTGGT AATACGAATA AAGAAGCCAA CAACGCTGAC AAAACACATG AAGTAACAGA
TACOTTAGGO AATAAAGTAA CCGTCCCCGC GAAACCCAAA CGGATTATTG CGAGTTATTT
AGAAGATTAT CTAGTTGCAT TAGGAGAAAA ACCAGTGGCA CAATGGACAG TTGGACAAGG
CAGCATTCAA GATTATTTAG CGAAAGAATT GAAAGATGTC CCCACTATTT CCTATGACTT
GCCATATGAA GCGGTTCTAA AATTTGAACC TGACTTATTA TTAATCAGTT CATCTGCTCT
AGTTGAAGGC GGTAAATACA AAGAATACAG TAAAATTGCG CCAACTTATG TAGTCAAAAA
CGGCGAAAAT GTCACCTGGC GTGATCAATT GGAAGATATT GCCACTGTTT TAGATAAAAA
AGAACAAGCG AAAAAAGTGT TAGAAGATTA TGATACCTTA ACCAAAGGCG TCCAAGAATA
TCTTGGCAAA AAAGATGCTG GCAAATCTGC GGCAGTCTTA TGGGTAACCA ACAACCAAGT
CTTTATGGTT AGCGATAATC GCTCAAGCGG AACCGTGCTC TATCAGGACT TAGGCCTCCA
AGTTCCAAAA TTAGTGGAAG AAATTTCTAA AAACGCTACT GCGGATTGGA ATCAAGTTTC
TTTAGAAAAA TTAGCTGAGC TTGACGCAGA CCACATTTTC CTTGTAAACA GCGATGAATC
AGCACCTCTT TTCCAAGAAG CAATTTGGAA GAACTTACCT GCTGTGAAAA ATAACCAAGT
TCATACCTAT GATAAAAAAA GTAGTTGGTT ATACAACGGA CCTATTGCGA ATACTCAAAT
TGTTGAAGAT GTAAAAAAAG CGCTCTTAAA TTAA 5F019-2 ((SEQ ID NO:70)
MKLLKK TVLIGTTLLL GSFLLAACGN TNKEANNADK THEVTDTLGN KVTVPAKPKR
IIASYLEDYL VALGEKPVAQ WTVGQGSIQD YLAKELKDVP TISYDLPYEA VLKFEPDLLL
TSSSALVEGG KYKEYSKIAP TYVVKNGENV TWRDQLEDIA TVLDKKEQAK KVLEDYDTLT
KGVQEYLGKK DAGKSAAVLW VTNNQVFMVS DNRSSGTVLY QDLGLQVPKL VEEISKNATA
DWNQVSLEKL AELDADHIEL VNSDESAPLF QEAIWKNLPA VKNNQVHTYD KKSSWLYNGP
IANTQIVEDV KKALLN EF019-3 (SEQ ID NO:71) TTGTGGT AATACGAATA
AAGAAGCCAA CAACGCTGAC AAAACACATG AAGTAACAGA TACCTTAGGC AATAAAGTAA
CCGTCCCCGC GAAACCCAAA CGGATTATTG CGAGTTATTT AGAAGATTAT CTAGTTGCAT
TAGGAGAAAA ACCAGTGGCA CAATGGACAG TTGGACAAGG CAGCATTCAA GATTATTTAG
CGAAAGAATT GAAAGATGTC CCCACTATTT CCTATGACTT GCCATATGAA GCGGTTCTAA
AATTTGAACC TGACTTATTA TTAATCAGTT CATCTGCTCT AGTTGAAGGC GGTAAATACA
AAGAATACAG TAAAATTGCG CCAACTTATG TAGTCAAAAA CGGCGAAAAT GTCACCTGGC
GTGATCAATT GGAAGATATT GCCACTGTTT TAGATAAAAA AGAACAAGCG AAAAAAGTGT
TAGAAGATTA TGATACCTTA ACCAAAGGCG TCCAAGAATA TCTTGGCAAA AAAGATGCTG
GCAAATCTGC GGCAGTCTTA TGGGTAACCA ACAACCAAGT CTTTATGGTT AGCGATAATC
GCTCAAGCGG AACCGTGCTC TATCAGGACT TAGGCCTCCA AGTTCCAAAA TTAGTGGAAG
AAATTTCTAA AAACGCTACT GCGGATTGGA ATCAAGTTTC TTTAGAAAAA TTAGCTGAGC
TTGACGCAGA CCACATTTTC CTTGTAAACA GCGATGAATC AGCACCTCTT TTCCAAGAAG
CAATTTGGAA GAACTTACCT GCTGTGAAAA ATAACCAAGT TCATACCTAT GATAAAAAAA
GTAGTTGGTT ATACAACGGA CCTATTGCGA ATACTCAAAT TGTTGAAGAT GTAAAAAAAG
CGCTCTTAAA T EF019-4 (SEQ ID NO:72) CGN TNKEANNADK THEVTDTLGN
KVTVPAKPKR IIASYLEDYL VALGEKPVAQ WTVGQGSIQD YLAKELKDVP TISYDLPYEA
VLKFEPDLLL ISSSALVEGG KYKEYSKIAP TYVVKNGENV TWRDQLEDIA TVLDKKEQAK
KVLEDYDTLT KGVQEYLGKK DAGKSAAVLW VTNNQVFMVS DNRSSGTVLY QDLGLQVPKL
VEEISKNATA DWNQVSLEKL AELDADHIEL VNSDESAPLF QEATWKNLPA VKNNQVHTYD
KKSSWLYNGP IANTQIVEDV KKALLN EF020-1 (SEQ ID NO:73) TGAGGAGATG
AGAAAATGAA AAAGGTAGTT TCAATTTTGT TGATGGTTGT TGCAGTCTTC ACATTAACTG
CATGTAATGG TTCTAAATTA GATAAAACAG GTGAAGAATT TAAAAATTCT ATAATGAAAG
ATTCTTCATA TGGTGATGAA TATTCAGAAG ATGGTTTTAG TTTTTTAATA TATAAAGATA
AAGACACTAA TCGTTATTTG GCTGATGTTT GGGTTCCTGT TAAAGATGAA ACTAGCGCAT
TGGAGTATTT TTATTATTAT GATGAAGATA AGCGATTAGA TAGTACTAAA AGTAAAGTAA
CCTTTGATGA TATGAAAGCT AGTGGAAACT ATGAAGTAGT GTATAAATCA GGGAAATTTA
AATAA EF020-2 (SEQ ID NO:74) MKKVVS ILLMVVAVFT LTACNGSKLD
KTGEEFKNSI MKDSSYGDEY SEDGFSFLIY KDKDTNRYLA DVWVPVKDET SALEYFYYYD
EDKRLDSTKS KVTFDDMKAS GNYEVVYKSG KFK EF020-3 (SEQ ID NO:75)
ATGTAATGG TTCTAAATTA GATAAAACAG GTGAAGAATT TAAAAATTCT ATAATGAAAG
ATTCTTCATA TGGTGATGAA TATTCAGAAG ATGGTTTTAG TTTTTTAATA TATAAAGATA
AAGACACTAA TCGTTATTTG GCTGATCTTT GGGTTCCTGT TAAAGATGAA ACTAGCGCAT
TGGAGTATTT TTATTATTAT GATGAAGATA AGCGATTAGA TAGTACTAAA AGTAAAGTAA
CCTTTGATCA TATGAAAGCT AGTGGAAACT ATGAAGTAGT GTATAAATCA GGGAAATTTA
AA EF020-4 (SEQ ID NO:76) CNGSKLD KTGEEFKNSI MKDSSYGDEY SEDGFSFLILY
KDKDTNRYLA DVWVPVKDET SALEYFYYYD EDKRLDSTKS KVTFDDMKAS GNYEVVYKSG
KFK EF021-1 (SEQ ID NO:77) TAGTTGTTTA AATACATTAA ACTATTTTTA
GGAGGCTTTA CAGAAATGAA AAAAGCAAAA TTATTCGGTT TTAGTTTGAT TGCATTAGGT
TTATCAGTTT CACTTGCAGC ATGTGGTGGT GGCAAAGGCA AAACCGCTGA AAGCGGCGGT
GGCAAAGGGG ATGCAGCGCA TAGTGCTGTA ATOATTACAG ATACAGGCGG CGTGGATGAC
AAGTCGTTCA ACCAATCTTC TTGGGAAGGA TTGCAAGCTT GGGGTAAAGA ACATGATTTA
CCAGAAGGTT CAAAAGGGTA TGCATATATT CAATCGAATG ATGCAGCTGA CTATACAACC
AATATTGACC AAGCGGTATC AAGTAAATTC AACACAATCT TTGGTATTGG CTACTTGCTA
AAAGATGCAA TTTCTTCTGC
AGCAGATGCC AACCCTGATA CAAACTTTGT TTTAATCGAT GATCAAATCG ATGGCAAAAA
GAATGTCGTT TCTGCAACAT TTAGAGATAA TGAAGCAGCT TACTTAGCCG GTGTTGCTGC
TGCAAATGAA ACAAAAACGA ACAAAGTCCG TTTTGTTGGT GCTGAAGAAG GGGTCGTAAT
TGACCGTTTC CAAGCTGGTT TTGAAAAAGG TGTGGCTGAT GCTGCGAAAG AATTAGGTAA
AGAAATTACT GTTGATACGA AATATGCGGC TTCATTTGCT GATCCTGCCA AAGGGAAAGC
TTTAGCTGCT GCAATGTACC AAAACGGCGT TGATATCATC TTCCATGCTT CTGGTGCGAC
TGGACAAGGG GTCTTCCAAG AAGCAAAAGA CTTGAATGAA TCAGGTTCTG GCGACAAAGT
TTGGGTAATC GGCGTTGACC GCGATCAAGA TGCTGATGGC AAGTACAAAA CAAAAGACGG
CAAAGAAGAC AACTTCACGT TAACTTCAAC GCTTAAAGGT GTCGGCACAG CGGTTCAAGA
TATTGCCAAC CGTGCGTTAG AAGACAAATT CCCTGGTGGC GAACATTTAG TTTATGGATT
AAAAGATGGT GGCGTTGACT TAACAGACGG CTATTTAAAC GACAAAACAA AAGAAGCTGT
TAAAACAGCA AAAGATAAAG TAATCTCAGG TGACGTAAAA GTCCCAGAAA AACCAGAATA A
EF021-2 (SEQ ID NO:78) MKKAKL FGFSLIALGL SVSLAACGGG KGKTAESGGG
KGDAAHSAVI ITDTGGVDDK SFNQSSWEGL QAWGKEHDLP EGSKGYAYIQ SNDAADYTTN
IDQAVSSKFN TIFGIGYLLK DAISSAADAN PDTNFVLIDD QIDGKKNVVS ATFRDNEAAY
LAGVAAANET KTNKVGFVGG EEGVVIDRFQ AGFEKGVADA AKELGKEITV DTKYAASFAD
PAKGKALAAA MYQNGVDIIF HASGATGQGV FQEAKDLNES GSGDKVWVIG VDRDQDADGK
YKTKDGKEDN FTLTSTLKGV GTAVQDIANR ALEDKFPGGE HLVYGLKDGG VDLTDGYLND
KTKEAVKTAK DKVISGDVKV PEKPE EF021-3 (SEQ ID NO:79) ATGTGGTGGT
GGCAAAGGCA AAACCGCTGA AAGCGGCGGT GGCAAAGGGG ATCCAGCGCA TAGTGCTGTA
ATCATTACAG ATACAGGCGG CGTGGATGAC AAGTCGTTCA ACCAATCTTC TTGGGAAGGA
TTGCAAGCTT CGGGTAAAGA ACATGATTTA CCAGAAGGTT CAAAAGGCTA TGCATATATT
CAATCGAATG ATGCAGCTGA CTATACAACC AATATTGACC AAGCGGTATC AAGTAAATTC
AACACAATCT TTGGTATTGG CTACTTGCTA AAAGATGCAA TTTCTTCTGC AGCAGATGCC
AACCCTGATA CAAACTTTGT TTTAATCGAT GATCAAATCG ATGGCAAAAA GAATGTCGTT
TCTGCAACAT TTAGAGATAA TGAAGCAGCT TACTTAGCCG GTGTTGCTGC TGCAAATGAA
ACAAAAACGA ACAAAGTCGG TTTTGTTGGT GGTGAAGAAG GGGTCGTAAT TGACCGTTTC
CAAGCTGGTT TTGAAAAAGG TGTGGCTGAT GCTGCGAAAG AATTAGGTAA AGAAATTACT
GTTGATACGA AATATGCGCC TTCATTTGCT GATCCTGCCA AAGGGAAAGC TTTAGCTGCT
GCAATGTACC AAAACGGCGT TGATATCATC TTCCATGCTT CTCGTGCGAC TCGACAAGGG
GTCTTCCAAG AAGCAAAAGA CTTGAATGAA TCAGGTTCTG GCGACAAAGT TTGGGTAATC
GGCGTTGACC GCGATCAAGA TGCTGATGGC AAGTACAAAA CAAAAGACGG CAAAGAAGAC
AACTTCACGT TAACTTCAAC GCTTAAAGGT GTCGGCACAG CGGTTCAAGA TATTGCCAAC
CGTGCGTTAG AAGACAAATT CCCTGGTGGC GAACATTTAG TTTATGGATT AAAAGATGGT
GGCGTTGACT TAACAGACGG CTATTTAAAC GACAAAACAA AAGAAGCTGT TAAAACAGCA
AAAGATAAAG TAATCTCAGG TGACGTAAAA GTCCCAGAAA AACCAGAA EF021-4 (SEQ
ID NO:80) CGGG KCKTAESGGG KGDAAHSAVI ITDTGGVDDK SFNQSSWEGL
QAWGKEHDLP EGSKGYAYIQ SNDAADYTTN IDQAVSSKFN TIFGIGYLLK DAISSAADAN
PDTNFVLIDD QIDGKKNVVS ATFRDNEAAY LAGVAAANET KTNKVGFVGG EEGVVIDRFQ
AGFEKCVADA AKELGKEITV DTKYAASFAD PAKGKALAAA MYQNGVDIIF HASGATGQGV
FQEAKDLNES GSGDKVWVIG VDRDQDADGK YKTKDGKEDN FTLTSTLKGV GTAVQDIANR
ALEDKFPGGE HLVYGLKDGG VDLTDGYLND KTKEAVKTAK DKVISGDVKV PEKPE
EF022-1 (SEQ ID NO:81) TAAGAGCATA AAAAAATGAA GAGTTATACG AGAAACAAGA
TGAAAAAGTA TTTAAAAATC ACAATGGTTT GTATTTTATT GGTAGCATTT TTAGCTGGGT
GTACCAATAA AAATGAAAAT AAAAAGAAAC AGAAAAATAC CAAAGAAGCC GTTCAACTGA
TGTCACCCTC GGAATTAACA ACGCTCAACA CCTCTGTATT ATTGGATTTT CCACATGCTA
TTGTCCAAAC TGCAGCGTTT GAAGGGTTAT ATAGTTTAGA TGAACAAGAC CAATTGGTAC
CAGCCGTAGC AAAAGCATTG CCGATGATTT CAGAAGATGG AAAAACCTAC ACGATTTCTT
TGAGAAAAGA AGCGGTTTGG AGTAACGATG ATCCTGTCAC AGCACATGAT TTTGAATATG
CTTGGAAAAA AATGATTGAT CCTAAAAACG GCTTTGTTTA TAGCTTCCTC ATCGTTGAAA
CAATTCAAAA TGGTGCAGAA ATCTCAGCGG GGAAATTAGC ACCCAATGAA CTAGGTGTCA
CAGCTGTGGA TGATTATACA TTAAAGGTGA CCCTCAAAGA GCCAAAACCG TACTTTACGT
CCTTGTTAGC TTTTCCGACA TTTTTCCCGC AAAATCNAAA AGTAGTCGAA CAATTTGGTG
CGGACTATGG AACTGCTAGT GATAAAGTCG TCTATAATGG TCCGTTCGTG GTAAAAGATT
GGCAGCAAAC AAAGATGGAC TGCCAACTAG CAAAAAATAA TCGCTATTGG GATCACCAGA
ACGTGCGCTC AGACATTATC AATTATACAG TTATCAAAGA AACATCTACC GCATTGAATC
TTTTTGAAGA TGGACAATTA GATGTGGCTA CACTAAGTGG TGAACTCGCG CAACAGAATA
AAAATAATAC GTTGTATCAT TCGTATCCAA CAGCGACAAT GAACTATTTG CGCTTAAATC
AAAAACGGNA AGGGCAAGCN ACGCCGCTTG CAAACGAAAA CCTGCGTAAA GCATTGGCTT
TAGGAATAGA TAAAGAAAAT CTAGTCAATA ATATTATTGC AGATGGTTCT AAAGCCCTAC
ATGGTGCGAT TACGGAAGGC TTTGTGGCGA ATCCCACAAC GGGTCTCGAT TTTCGTCAAG
AAGCAGGTAA TTTAATGGTT TATAACAAAG AAAAAGCGCA AAGTTATTGG AAAAAAGCAC
AAGCAGAATT AGGAGAAAAG GTTAACGTTG AATTGATGGT AACAGATGAT GGTTCTTACA
AAAAAATTGG TGAAAGTTTG CAAGGCTCGC TACAAGAATT GTTTCCTGGT TTGACAATAG
AGCTAACCGC ATTGCCGACT GAAGCTGCAT TGAACTTTGG GCCACAAAGT GACTATGATT
TATTCTTAAT TTACTGGACA CCAGACTATC AAGACCCTAT TTCTACCCTG ATGACTTTAT
ACAAGGGCAA TGATCGCAAT TATCAGAACC CTGTCTATGA CAAATTATTA GATGAAGCAG
CCACAACCTA TGCCTTAGAG CCAGAAAAAA GATGGGCGAC ACTGATTGCA GCTGAAAAAG
AAGTGATTGA AACGACTGCT GGCATGATTC CACTTAGCCA AAATGAACAA ACAGTCCTGC
AAAATGATAA AGTCAAAGGC TTGAATTTTC ATACCTTTGG CGCTCCATTA ACGTTAAAAA
ATGTTTATAA GGAAAAATAA EF022-2 (SEQ ID NO:82) MKKYLKIET MVCILLVGFL
AGCTNKNENK KKQKNTKEAV QLMSPSELTT LNTSVLLDFP DAIVQTAAFE GLYSLDEQDQ
LVPAVAKALP MISEDGKTYT ISLRKEAVWS NDDPVTAHDF EYAWKKMIDP KNGFVYSFLI
VETIQNGAEI SAGKLAPNEL GVTAVDDYTL KVTLKEPKPY FTSLLAFPTF FPQNXKVVEQ
FGADYGTASD KVVYNGPFVV KDWQQTKMDW QLAKNNRYWD HQNVRSDIIN YTVIKETSTA
LNLFEDCQLD VATLSGELAQ QNKNNTLYHS YPTATHNYLR LNQKRXGQAT PLANENLRKA
LALGIDKENL VNNIIADGSK ALHGAITEGF VANPTTGLDF RQEAGNLMVY NKEKAQSYWK
KAQAELGEKV NVELMVTDDG SYKKICESLQ GSLQELFPGL TIELTALPTE AALNFGRESD
YDLFLIYWTP DYQDPISTLM TLYKGNDRNY QNPVYDKLLD EAATTYALEP EKRWATLIAA
EKEVIETTAG MIPLSQNEQT VLQNDKVKGL NFHTFGAPLT LKNVYKEK EF022-3 (SEQ
ID NO:83) CT GTACCAATAA AAATGAAAAT AAAAAGAAAC ACAAAAATAC CAAAGAAGCC
GTTCAACTGA TGTCACCCTC GGAATTAACA ACGCTCAACA CCTCTGTATT ATTGGATTTT
CCAGATGCTA TTGTCCAAAC TGCAGCGTTT GAAGGGTTAT ATAGTTTAGA TGAACAAGAC
CAATTGCTAC CACCCGTAGC AAAAGCATTG CCGATGATTT CAGAAGATGG AAAAACCTAC
ACGATTTCTT TGAGAAAAGA AGCGGTTTGG AGTAACGATG ATCCTGTCAC AGCACATGAT
TTTCAATATG CTTCGAAAAA AATGATTGAT CCTAAAAACG GCTTTGTTTA TAGCTTCCTC
ATCGTTGAAA CAATTCAAAA TGGTGCAGAA ATCTCAGCGG GGAAATTAGC ACCCAATGAA
CTAGGTGTCA CAGCTGTGGA TGATTATACA TTAAAGGTGA CGCTCAAAGA GCCAAAACCG
TACTTTACGT CCTTGTTAGC TTTTCCGACA TTTTTCCCGC AAAATCNAAA AGTAGTCGAA
CAATTTGGTG CGGACTATGG AACTGCTAGT GATAAAGTCG TCTATAATGG TCCGTTCGTG
GTAAAAGATT GGCAGCAAAC AAAGATGGAC TGGCAACTAG CAAAAAATAA TCGCTATTGG
GATCACCAGA ACGTGCGCTC AGACATTATC AATTATACAG TTATCAAAGA AACATCTACC
GCATTGAATC TTTTTGAAGA TGGACAATTA GATGTGGCTA CACTAAGTGG TGAACTGGCG
CAACAGAATA AAAATAATAC GTTGTATCAT TCGTATCCAA CACCGACAAT GAACTATTTG
CGCTTAAATC AAAAACGGNA AGGGCAAGCN ACGCCGCTTC CAAACGAAAA CCTGCGTAAA
GCATTGGCTT TAGGAATAGA TAAAGAAAAT CTAGTCAATA ATATTATTCC AGATGGTTCT
AAAGCGCTAC ATGGTGCGAT TACGGAAGGC TTTGTGGCGA ATCCCACAAC GGGTCTCGAT
TTTCGTCAAG AAGCAGGTAA TTTAATGGTT TATAACAAAG AAAAAGCGCA AAGTTATTGG
AAAAAAGCAC AAGCAGAATT AGGAGAAAAG GTTAACGTTG AATTGATGGT AACAGATGAT
GGTTCTTACA AAAAAATTGG TGAAAGTTTG CAAGGCTCGC TACAAGAATT GTTTCCTGGT
TTGACAATAG AGCTAACCGC ATTGCCGACT GAAGCTGCAT TGAACTTTGG GCGAGAAAGT
GACTATGATT TATTCTTAAT TTACTGGACA CCAGACTATC AAGACCCTAT TTCTACCCTG
ATGACTTTAT ACAAGGGCAA TGATCGCAAT TATCAGAACC CTGTCTATGA CAAATTATTA
GATGAAGCAG CCACAACCTA TGCCTTAGAG CCACAAAAAA GATGGGCGAC ACTGATTGCA
GCTGAAAAAG AAGTGATTGA AACGACTGCT GGCATGATTC CACTTAGCCA AAATGAACAA
ACAGTCCTGC AAAATGATAA AGTCAAAGGC TTGAATTTTC ATACCTTTGG CGCTCCATTA
ACGTTAAAAA ATGTTTATAA CGAAAAA EF022-4 (SEQ ID NO:84) CTNKNENK
KKQKNTKEAV QLMSPSELTT LNTSVLLDFP DAIVQTAAFE GLYSLDEQDQ LVPAVAKALP
MISEDGKTYT ISLRKEAVWS NDDPVTAHDF EYAWKKMIDP KNGFVYSFLI VETIQNGAEI
SAGKLAPNEL GVTAVDDYTL KVTLKEPKPY FTSLLAFPTF FPQNXKVVEQ FGADYGTASD
KVVYNGPFVV KDWQQTKMDW QLAKNNRYWD HQNVRSDIIN YTVIKETSTA LNLFEDGQLD
VATLSGELAQ QNKNNTLYHS YPTATMNYLR LNQKRXGQAT PLANENLEKA LALGTDKENL
VNNIIADGSK ALHGAITEGE VANPTTGLDF RQEAGNLMVY NKEKAQSYWK KAQAELGEKV
NVELMVTDDG SYKKIGESLQ GSLQELFPGL TIELTALPTE AALNFGRESD YDLFLIYWTP
DYQDPISTLM TLYKGNDENY QNPVYDKLLD EAATTYALEP EKRWATLIAA EKEVIETTAG
MIPLSQNEQT VLQNDKVKGL NFHTFCAPLT LKNVYKEK EF023-1 (SEQ ID NO:85)
TAAAATGGAG GGATCGGTAT GAAGAAATTA AAAATGTTAG GATGCGTCGG GTTCCTTTTA
GCTTTAACGG CTTGTCAGGC GGGAACGGGA AACTCGGCTG ATAGTAACAA AGCAGCGGAA
CAAAAAATTG CAATTAGTTC TGAAGCCGCT ATTTCGACAA TGGAACCACA CACAGCGGGG
GATACGACCT CGACTTTAGT CATCAATCAA GTTTATGAAG GACTCTATGT TTTAGGTAAA
GAAGATGAAT TAGAGTTGGG GGTCGCTGCG GAAGAACCAG CGATTTCTGA AGATGAAACC
CTTTATACAT TTAACATTAG AGAAGATCCC AAATGGTCGA ATGATGATCC AGTAACAGCA
AACGACTTTG TTTATGCATG GCAACAAGTT GCTTCCCCTA AATCAGGATC GATTCATCAA
GCTTTATTTT TTCATGTCAT TAAAAATCCT AACCAAATTG CTTTAGAAGG CGCAGATGTG
AATACTCTTG GGCTTAAGGC GCTAGATGAT AAAACGTTAG AAATAACTTT AGAACGGCCC
ACCCCTTATT TGAAATCATT ACTTTCGTTT CCTGTTTTGT TTCCACAAAA TGAAAAATAT
ATCAAAGAAC AAGGGGATAA ATATGCTACT GATGCAGAAC ATTTGATTTA TAATGGTCCT
TTTAAATTGA AAGAATGGGA TAATGCCTCT TCTGATGACT GGACCTACCA AAAAAATGAT
ACOTATTOGO ATGCTGAAAA AGTTAAATTA ACAGAAGCGA AAGTTTCAGT AATTAAGAGC
CCAACGACAC CGGTGAATTT GTTTGACTCC AATGAATTCG ATGTAGTGAA TAAGCTAAGT
GCTGAATTTA TTCCTGGTTA TGTTGATAAT CCACCCTTTC TTTCAATTCC TCAATTCGTC
ACATACTTTT TAAAAATGAA CAGCGTTCGT GATGGAAAAG AAAATCCGGC TTTAGCGAAC
AACAATATTC GTAAAGCGTT GGCACAAGCT TTTGATAAAG AAAGTTTTGT AAAAGAAGTC
TTGCAAGATC AATCAACGGC TACACATCAA GTAATTCCGC CGGGACAAAC GATTGCGCCA
GATCCAACAG ATTTCACAAA ACTAGCTGCT AAGAAAAATA ACTACTTAAC CTACGATACA
CCGAAACCAA AACAATTCTC GGAAAAAGGG AAAAAAGAAA TTGGGCTCCA TAAAATCAAA
TTAGAATTTT TAACAGATGA TACAGACAGC GCCAAAAAAG CTGCTGAGTT TTTCCAATTT
CAATTGGAAG AAAATCTAGA TGGATTAGAA GTGAATGTTA CTCAAGTTCC TTTTACTATT
CGTGTTGATC GTCATCAAAC GAGACACTAT CATTTAGAAT TATCTCCTTG GGGAACCGAT
TATCGTGATC CATTAACAGT TATGCGCATC TTTACTTCGG ATAGTACCTT GGGCGGCGTA
ACGTTCAAGA GTGATACGTA TGATCAATTA ATTCAAGAAA CTAGAACAAC ACATGCGGCT
GATCAAGAGG CTCGTTTAAA TGACTTTGCT CAAGCACAAG ATATTTTGGT GAATCACGAA
ACCGTTTTAG CACCAATCTA CAATCGAAGC ATTTCTGTAT TAGCTAATCA AAAAATCAAG
GATCTGTATT GGCATTCATT TGGACCCACG TACAGTTTAA AATGGGCTTA TGTTAACTAA
EF023-2 (SEQ ID NO:86) MKKLK MLGCVGLLLA LTACQAGTGN SADSNKAAEQ
KIAISSEAAI STNEPHTAGD TTSTLVMNQV YECLYVLCKE DELELGVAAE EPAISEDETV
YTFKIREDAK WSNDDPVTAN DFVYAWQQVA SPKSCSIHQA LFFDVIKNAK EIALEGADVN
TLGVKALDDK TLEITLERPT PYLKSLLSFP VLFPQNEKYI KEQGDKYATD AEHLIYNCPF
KLKEWDNASS DDWTYEKNDT YWDAEKVKLT EAKVSVIKSP TTAVNLFDSN ELDVVNKLSG
EFTPGYVDNP AFLSIPQFVT YELKNNSVRD GKENPALANN NIRKALAQAF DKESFVKEVL
QDQSTATDQV IPPGQTIAPD GTDFTKLAAK KNNYLTYDTA KAKEFWEKGK KEIGLDKIKL
EFLTDDTDSA KKAAEFFQFQ LEENLDGLEV NVTQVPFTIR VDRDQTRDYD LELSGWGTDY
RDPLTVMRIF TSDSTLGGVT FKSDTYDQLI QETRTTHAAD QEARLNDFAQ AQDILVNQET
VLAPTYNRSI SVLANQKIKD LYWHSFGPTY SLKWAYVN EF023-3 (SEQ ID NO:87)
GGCAACGGGA AACTCGGCTG ATAGTAACAA AGCAGCGGAA CAAAAAATTG CAATTACTTC
TGAAGCGGCT ATTTCGACAA TGGAACCACA CACAGCCGGG GATACGACCT CGACTTTAGT
CATGAATCAA GTTTATGAAG GACTCTATGT TTTACGTAAA GAAGATGAAT TAGAGTTGGG
GGTCGCTGCC GAAGAACCAG CGATTTCTGA ACATGAAACC GTTTATACAT TTAAGATTAG
AGAAGATGCC AAATGGTCGA ATGATGATCC AGTAACAGCA AACGACTTTG TTTATGCATG
GCAACAAGTT GCTTCCCCTA AATCAGGATC GATTCATCAA GCTTTATTTT TTGATGTCAT
TAAAAATGCT AAGGAAATTG CTTTAGAAGG CGCAGATGTG AATACTCTTG GCCTTAAGGC
GCTAGATGAT AAAACGTTAG AAATAACTTT AGAACCGCCC ACCCCTTATT TGAAATCATT
ACTTTCGTTT CCTGTTTTGT TTCCACAAAA TGAAAAATAT ATCAAAGAAC AAGGGGATAA
ATATGCTACT GATGCAGAAC ATTTGATTTA TAATGGTCCT TTTAAATTGA AAGAATGGGA
TAATGCCTCT TCTGATGACT GGACCTACGA AAAAAATGAT ACGTATTGGG ATGCTGAAAA
AGTTAAATTA ACAGAAGCGA AAGTTTCAGT AATTAAGACC CCAACGACAG CGGTGAATTT
GTTTGACTCG AATGAATTGG ATGTAGTGAA TAAGCTAAGT GGTGAATTTA TTCCTCGTTA
TGTTGATAAT CCAGCCTTTC TTTCAATTCC TCAATTCGTC ACATACTTTT TAAAAATGAA
CAGCGTTCGT GATGGAAAAG AAAATCCGGC TTTAGCGAAC AACAATATTC GTAAAGCGTT
GGCACAAGCT TTTGATAAAG AAAGTTTTGT AAAAGAAGTC TTGCAAGATC AATCAACGGC
TACAGATCAA GTAATTCCGC CGGGACAAAC CATTGCGCCA GATGGAACAG ATTTCACAAA
ACTAGCTGCT AAGAAAAATA ACTACTTAAC CTACGATACA GCGAAAGCAA AAGAATTCTG
GGAAAAAGGG AAAAAAGAAA TTGGGCTGGA TAAAATCAAA TTAGAATTTT TAACAGATGA
TACAGACAGC GCCAAAAAAG CTGCTGAGTT TTTCCAATTT CAATTCGAAG AAAATCTAGA
TGGATTAGAA GTGAATGTTA CTCAAGTTCC TTTTACTATT CGTGTTGATC GTGATCAAAC
GAGAGACTAT GATTTAGAAT TATCTGGTTG GGGAACCGAT TATCGTGATC CATTAACAGT
TATGCGCATC TTTACTTCGG ATAGTACCTT GGGCGGCGTA ACGTTCAAGA GTGATACGTA
TGATCAATTA ATTCAAGAAA CTAGAACAAC ACATGCGGCT GATCAAGAGG CTCGTTTAAA
TGACTTTGCT CAAGCACAAG ATATTTTGGT GAATCAGGAA ACGGTTTTAG CACCAATCTA
CAATCGAAGC ATTTCTGTAT TAGCTAATCA AAAAATCAAG GATCTGTATT GGCATTCATT
TGGACCCACG TACAGTTTAA AATGGGCTTA TGTTAAC EF023-4 (SEQ ID NO:88)
GTGN SADSNKAAEQ KIAISSEAAI STMEPHTAGD TTSTLVMNQV YEGLYVLGKE
DELELGVAAE EPATSEDETV YTFKIREDAK WSNDDPVTAN DFVYAWQQVA SPKSGSIHQA
LFFDVIKNAK EIALEGADVN TLGVKALDDK TLEITLERFT PYLKSLLSFP VLFPQNEKYT
KEQGDKYATD AEHLIYNGPF KLKEWDNASS DDWTYEKNDT YWDAEKVKLT EAKVSVIKSP
TTAVNLFDSN ELDVVNKLSG EFIPGYVDNP AFLSIPQFVT YFLKNNSVRD GKENPALANN
NIRKALAQAF DKESFVKEVL QDQSTATDQV IPPGQTIAPD GTDFTKLAAK KNNYLTYDTA
KAKEEWEKOK KEIGLDKTKL EFLTDDTDSA KKAAEFFQFQ LEENLDGLEV NVTQVPFTIR
VDRDQTRDYD LELSGWGTDY RDPLTVMRIF TSDSTLGGVT FKSDTYDQLI QETRTTHAAD
QEARLNDFAQ AQDILVNQET VLAPIYNRSI SVLANQKIKD LYWHSFGPTY SLKWAYVN
EF024-1 (SEQ ID NO:89) TAATGGCCGT TTCGTCTACT AATAAAGAGG ATGAAGCTAC
TCAAATGGCG TTGGCAATGG AACAAGGATC ATAAAAAAGG AGAAGTGAGC ATGAAAAAAG
TACTACCTTT TATTGCCTTA GTCGGCTTGT TATTGTTGTC AGGTTGTGGA ACAGATATGA
AAAAGATATT GACTGCCGAT GGTGGTAAAT GGAAAGTGGA AGAAACACGT GCAACTTACA
CTTTTTTTGA TGACGGTAAA TTTTCAGCTA ATGACTCAGA GGATACTGTT AGTGGGACAT
ACACTTATGA TGAAAAAAAT AAAAAAATAA CCTTTGACNT TACTAGCAGN AACTCTTTCA
TTATGGAAAA AGTNGANTNC AANGNTANCA AGATTACAGG GGAAATTGGC GAAAAACAAA
GAACACTTAT AAAACAAAAA ACAGAATAA 5F024-2 (SEQ ID NO:90) M KKVLPFIALV
GLLLLSGCGT DMKKILTADG GKWKVEETRA TYTFFDDGKF SANDSEDSVS GTYTYDEKNK
KITFDXTSXN SFIMEKVXXX XXKITGEIGE KQRTLIKQKT E EF024-3 (SEQ ID
NO:91) ATT GACTGCCGAT GGTGGTAAAT GGAAAGTGGA AGAAACACGT GCAACTTACA
CTTTTTTTGA TGACGGTAAA TTTTCAGCTA ATGACTCAGA GGATAGTGTT AGTGGGACAT
ACACTTATGA TGAAAAAAAT AAAAAAATAA CCTTTGACNT TACTAGCAGN AACTCTTTCA
TTATGGAAAA AGTNGANTNC AANGNTANCA AGATTACAGO GGAAATTGGC GAAAAACAAA
GAACACTTAT AAAACAAAAA ACAGAA EF024-4 (SEQ lED NO:92) LTADG
GKWKVEETRA TYTFFDDGKF SANDSEDSVS GTYTYDEKNK KITFDXTSXN SFIMEKVXXX
XXKIETGEIGE KQRTLIKQKT E SF025-1 (SEQ ID NO:93) TGAATGAAAC
ATATTAAAGG AATGTTGGTT TTTATCGGAT TATTTATTTT GGTTGGTTGT GCGCCAGATC
AAGAGCCAAC GAAACAAACA ACAAGTGGTC CGCAAGAGAC AAAGCAAGTG AAGCAAGTTA
CCGTCACCAA TCAAACGACT TCTGCGGTGG AAAAACAAGC GCCGACTAAA AATGACGAAC
TGATTGCTAA TCAATTGACT TTTGATTCTC ATGAATACAC GTACGAAGTG GTTACAGGGG
CCACACAAAC GACATTTGGA ACAACCCCAC CAGCAAAATA TACACCGGAA GAAAAAAAGA
AAAAAATGTT TTGGTCCAAT CAACCGCCTT TGGGATTAAT GACGGGTAAC TATTATAAAA
ATGAAGGTGT ATTTACTGGC GGAAATTACG GCATTGTAGA GATTATTACG GAACCTGAAA
CGCAAAGGAT TCTGAATGTT GAGTTTACAG AGTTTGCTAG TGATCCTTAT TATGATACAC
GCTATTCGGG
TGTCAACAAA CGCCTGTCGG ATTATCCTGA ATTTCAAGCA AGCAACACGC GTACAGACGA
TACGTTAGTC ACCGTTGTTA ATGGTATTAC TTATGTAGAA AAACAAATGC GTGACGAAAA
TCGTGTTACA GGTAATTTTT ATACGGTACG CGGTTCATCA ACTTCTGCGC GTGAAGGATT
AATCCCTTTA GCAGCAGAGA TGGACACTTG GCTAAAAGAG CCATCGAAAG AAACCTATAT
CGGTTACGCA GAAGATTTAG GCAATGGCCT AATCGCTCGA CTTCAAGTGA TAACAGAAGA
CCAGAAAATA AAACATGTCA GCTATGATGA ATACTTTTCA GATGAACAGG AAAAAATCAC
AGAAACAGCC TGCGGCCTTT TTATCGTCAA TCGAAATATT ATTCACCAGG ATACAATAAA
CAAACCAACA ATTCTTTTAT TCATTTTGTA G EF025-2 (SEQ ID NO:94)
MKHIKGMLVF IGLFILVGCA PDQEPTKQTT SGPQETKQVK QVTVTNQTTS AVEKQAPTKN
DELIANQLTF DSHEYTYEVV TGATQTTFGT TPPAKYTPEE KKKKMFWSNQ PPLGLMTGNY
YKNEGVFTGG NYGIVEIITE PETQRTLNVE FTEFASDPYY DTRYSGVNKR LSDYPEFQAS
NTRTDDTLVT VVNGITYVEK QMRDENRVTG NFYTVRGSST SAREGLMPLA AEMDTWLKEP
SKETYIGYAE DLGNGLIARL QVITEEQKIK HVSYDEYFSD EQEKITETAC GLFIVNRNII
HQDTINKPTT LLFIL EF025-3 (SEQ ID NO:95) AAC GAAACAAACA ACAAGTGGTC
CGCAAGAGAC AAAGCAAGTG AAGCAAGTTA CCGTCACCAA TCAAACGACT TCTGCGGTGG
AAAAACAAGC GCCGACTAAA AATGACGAAC TGATTGCTAA TCAATTGACT TTTGATTCTC
ATGAATACAC GTACGAAGTC GTTACAGGGG CCACACAAAC GACATTTGGA ACAACCCCAC
CAGCAAAATA TACACCGGAA GAAAAAAAGA AAAAAATGTT TTGGTCCAAT CAACCGCCTT
TGGGATTAAT GACGGGTAAC TATTATAAAA ATGAAGGTGT ATTTACTGGC GGAAATTACG
GCATTGTAGA GATTATTACG GAACCTGAAA CGCAAAGGAT TCTGAATGTT GAGTTTACAG
ACTTTGCTAG TGATCCTTAT TATGATACAC GCTATTCCCG TGTCAACAAA CGCCTGTCGG
ATTATCCTGA ATTTCAAGCA ACCAACACGC GTACAGACGA TACGTTAGTC ACCGTTGTTA
ATGGTATTAC TTATGTAGAA AAACAAATGC GTGACGAAAA TCGTGTTACA GGTAATTTTT
ATACGGTACG CGGTTCATCA ACTTCTGCGC GTGAAGGATT AATGCCTTTA GCAGCAGAGA
TGGACACTTG GCTAAAAGAG CCATCGAAAG AAACGTATAT CGGTTACGCA GAAGATTTAG
GCAATGGCCT AATCGCTCGA CTTCAAGTGA TAACAGAAGA GCAGAAAATA AAACATGTCA
GCTATGATGA ATACTTTTCA GATGAACAGG AAAAAATCAC AGAAACAGCC TGCGGCCTTT
TTATCGTCAA TCGAAATATT ATTCACCAGG ATACAATAAA CAAACCAACA ATTCTTTTAT
TCATTTTG EF025-4 (SEQ ID NO:96) TKQTT SGPQETKQVK QVTVTNQTTS
AVEKQAPTKN DELIANQLTF DSHEYTYEVV TGATQTTFGT TPPAKYTPEE KKKKNFWSNQ
PPLGLMTGNY YKNEGVFTGG NYGIVEITTE PETQRILNVE FTEFASDPYY DTRYSGVNKP
LSDYPEFQAS NTRTDDTLVT VVNGITYVEK QMRDENRVTG NFYTVRGSST SAREGLMPLA
AEMDTWLKEP SKETYTGYAE DLGNGLIARL QVITEEQKIK HVSYDEYFSD EQEKITETAC
GLFTVNRNII HQDTINKPTI LLFIL EF026-1 (SEQ ID NO:97) TGAGTGTATG
ATTACTCATT TCCCTTTGAA TCAGTTATGA TAAAGGAAGA AATAAATAAA TTTTTTGGAG
GGATTTTCAT GAAAATGTCT AAAGTACTCA CCACTGTTTT GACGGCAACT GCTGCTCTTG
TGTTGCTTAG TGCTTGTTCA TCTGATAAAA AAACAGATAG TAGTTCTAGT AGCAAAGAAA
CAGCTAATTC AAGTACAGAA GTAGTCTCTG CTCCTTCAAT TAGTGCCAAG CCTGAAGAGC
TCGAAATGGC GTTAAGTGAT AAAGGAAATT GGATTGTCGC AGCTACTGAC AATGTCACTT
TTGATAAAGA GGTAACAGTT GCTGGTACTT TCCATGATAA GGGGAAAGAT TCCAACGATG
TCTATCGTAA ATTAGCACTT TATTCCCAAG ATGATAATAA AAAAGTAACT GCTGAATATG
AAATCACGGT TCCTAAGCTA ATCGTTTCTT CTGAAAATTT CAACATCGTT CACGGGACTG
TCAAAGGTGA TATTGAGGTG AAAGCAAATG GCTTTACTTT AAATGGTACC AAAGTTAATG
GCAATATTAC TTTTGATAAA CAAGAATACA AAGATTCTGC TGACTTAGAA AAAGATGGTG
CCACTGTTAC TGGTGAAGTC ACCGTAGCCA ATAATTAA EF026-2 (SEQ ID NO:98)
MKMSK VLTTVLTATA ALVLLSACSS DKKTDSSSSS KETANSSTEV VSGASISAKP
EELEMALSDK GNWIVAATDN VTFDKEVTVA GTFHDKGKDS NDVYRKLALY SQDDNKKVTA
EYEITVPKLI VSSENFNIVH GTVKGDIEVK ANGFTLNGTK VNGNITFDKQ EYKDSADLEK
DGATVTGEVT VANN EF026-3 (SEQ ID NO:99) AACAGATAG TAGTTCTAGT
AGCAAAGAAA CAGCTAATTC AAGTACAGAA GTAGTCTCTG GTGCTTCAAT TAGTGCCAAG
CCTGAAGAGC TCGAAATGGC GTTAAGTGAT AAAGGAAATT GGATTGTCGC AGCTACTGAC
AATGTCACTT TTGATAAAGA GGTAACAGTT GCTGGTACTT TCCATGATAA GGGGAAAGAT
TCCAACGATG TCTATCGTAA ATTAGCACTT TATTCCCAAG ATGATAATAA AAAAGTAACT
GCTGAATATG AAATCACGGT TCCTAAGCTA ATCGTTTCTT CTGAAAATTT CAACATCGTT
CACGGGACTG TCAAAGGTGA TATTGAGGTG AAAGCAAATG GCTTTACTTT AAATGGTACC
AAAGTTAATG GCAATATTAC TTTTGATAAA CAAGAATACA AAGATTCTGC TGACTTAGAA
AAAGATGGTG CCACTGTTAC TGGTGAAGTC ACCGTAGCCA ATAAT EF026-4 (SEQ ID
NO:100) TDSSSSS KETANSSTEV VSGASISAKP EELEMALSDK GNWIVAATDN
VTFDKEVTVA GTFHDKGKDS NDVYRKLALY SQDDNKKVTA EYEITVPKLI VSSENFNIVH
GTVKGDIEVK ANGFTLNGTK VNGNITFDKQ EYKDSADLEK DGATVTCEVT VANN EF027-1
(SEQ ID NO:101) TTTGGTATGA AACAGAAAAA GTGGTTAATC GGACTTGTTG
CACTGGGCTT GGTTTTAGCA GCATGTGGAA GTGGCGGTTC GAAAACGACC TCAAACGAAC
CAGCTACACA GAAAATTAAC GTCGCATCTG GTGGTGAACT CTCGACATTA GACAGCGCTC
ATTATACAGA TGTCTATAGT TCCGATATGA TTGGTCAAGT AGTTGAAGGC TTGTATCGAC
AAGATAAAAA CGGAGATCCT GAGCTAGCTA TGGCGAAAGC AGAGCCACAA GTTAGTGAAG
ACGGGTTAGT CTATACATTC AAGTTACGAG AAGCAAAATG GACAAACGGG GATCCAGTTA
AAGCAGGGGA TTTTGTAGTT GCGTTTAGAA ACGTGGTCGA TCCAGCATAC GGTTCAAGTA
GCAGTAATCA AATGGATATT TTTAAAAATG GGCGTGCGGT GCGGGAAGGA CAAGCCACGA
TGGAAGAATT TGGTGTCAAA GCAATCGATG ACCAGACACT AGAACTAACA TTGGAAAATC
CAATTCCTTA TTTAGCCCAA GTCTTGGTTG GGACACCTTT TATGCCTAAA AATGAAGCCT
TTGCCAAAGA AAAAGGTACT GCCTATGGGA CTTCTGCAGA TAATTTTGTT GGCAATGGGC
CGTTTGTAAT TTCAGGTTGG GATGGCAATT CCGAAACTTG GAAATTGAAG AAGAATGATC
ATTATTGGGA TAAAGAACAC GTAAAATTGA ATGAAATTGA TGTTCAAGTA GTGAAAGAAA
TTGGCACAGG AGCCAATCTT TTTGATAATG GCGACTTAGA TTACACTGTT TTAGCAGATA
CTTATGCACT TCAGTATAAA GAGTCAAAAC AAGCGCATTT TGTACCTAAA GCCATGGTGG
GTTATTTAAG CCCCAATCAT CGCCGTGAAA TTACCGGCAA CGAACATGTT CGAAAAGCTT
TTTTACAAGC GATTGACAAA GAAACTTTTG CAAAAGAAAT TTTAGGAGAT GGCTCGACAG
CTTTAAATGG NTTTGTACCA GCTAATTTTG CAAAAATCCA GATACAGGTG AAGATTTCCG
CAAAGAAAAT GGTGATTTAT TGCCATATAA TATTAAAGAA GCCCAAGCTA ACTGGAACAA
TT EF027-2 (SEQ ID NO:102) MKQKKWLI GLVALGLVLA ACGSGGSKTT
SNEPATQKIN VASGGELSTL DSAHYTDVYS SDMIGQVVEG LYRQDKNGDP ELAMAKAEPQ
VSEDGLVYTF KLREAKWTNG DPVKAGDFVV AFRNVVDPAY GSSSSNQMDI FKNGRAVREG
QATMEEFGVK AIDDQTLELT LENPIPYLAQ VLVGTPFMPK NEAFAKEKGT AYGTSADNFV
GNGPFVISGW DGNSETWKLK KNDHYWDKEH VKLNEIDVQV VKEIGTGANL FDNGDLDYTV
LADTYALQYK ESKQAHFVPK AMVGYLSPNH RREITGNEHV RKAFLQAIDK ETFAKEILGD
GSTALNGFVP ANFAKIQIQV KISAKKNVIY CHIILKKPKL TCTI EF027-3 (SEQ ID
NO:103) AACGACC TCAAACGAAC CAGCTACACA GAAAATTAAC GTCGCATCTG
GTGGTGAACT CTCGACATTA GACAGCGCTC ATTATACAGA TGTCTATAGT TCCGATATGA
TTGGTCAAGT AGTTGAAGGC TTGTATCGAC AAGATAAAAA CGGAGATCCT GAGCTAGCTA
TGGCGAAAGC AGAGCCACAA GTTAGTGAAG ACGGGTTAGT CTATACATTC AAGTTACGAG
AACCAAAATG GACAAACGGG GATCCAGTTA AAGCAGGGGA TTTTGTAGTT GCGTTTAGAA
ACGTGGTCGA TCCAGCATAC GGTTCAAGTA GCAGTAATCA AATGGATATT TTTAAAAATG
GGCGTGCGGT GCGGGAAGGA CAAGCCACGA TGGAAGAATT TGGTGTCAAA GCAATCGATG
ACCAGACACT AGAACTAACA TTGGAAAATC CAATTCCTTA TTTAGCCCAA GTCTTGGTTG
GGACACCTTT TATGCCTAAA AATGAAGCCT TTGCCAAAGA AAAAGGTACT GCCTATGGGA
CTTCTGCAGA TAATTTTGTT GGCAATGGGC CGTTTCTAAT TTCAGGTTGG GATGGCAATT
CCGAAACTTG GAAATTGAAG AAGAATGATC ATTATTGGGA TAAAGAACAC GTAAAATTGA
ATGAAATTGA TGTTCAAGTA GTGAAAGAAA TTGGCACAGG AGCCAATCTT TTTGATAATG
GCGACTTAGA TTACACTGTT TTAGCAGATA CTTATGCACT TCAGTATAAA GAGTCAAAAC
AAGCGCATTT TGTACCTAAA GCCATGGTGG GTTATTTAAG CCCCAATCAT CGCCGTGAAA
TTACCGGCAA CGAACATGTT CGAAAAGCTT TTTTACAAGC GATTGACAAA GAAACTTTTG
CAAAAGAAAT TTTAGGAGAT GGCTCGACAG CTTTAAATGG NTTTGTACCA GCTAATTTTG
CAAAAATCCA GATACAGGTG AAGATTTCCG CAAAGAAAAT GGTGATTTAT TGCCATATAA
TATTAAAGAA GCCCAAGCTA A EF027-4 (SEQ ID NO:104) TT SNEPATQKIN
VASGGELSTL DSAHYTDVYS SDMIGQVVEG LYRQDKNGDP ELANAKAEPQ VSEDGLVYTF
KLREAKWTNG DPVKAGDFVV AFRNVVDPAY GSSSSNQMDI FKNGRAVREG QATMEEFGVK
AIDDQTLELT LENPIPYLAQ VLVGTPFMPK NEAFAKEKGT AYGTSADNFV GNGPFVISGW
DGNSETWKLK KNDHYWDKEH VKLNEIDVQV VKEIGTGANL FDNGDLDYTV LADTYALQYK
ESKQAHFVPK AMVGYLSPNH RREITGNEHV RKAFLQAIDK ETFAKEILGD GSTALNGFVP
ANFAKIQTQV KISAKKMVIY CHIILKKPKL EF028-1 (SEQ ID NO:105) TAACAGAAGC
AATACAACAA CTTAACACTT TGTTTACTTG TTATTTATCA GAAATCAACT AAGACTTGTT
ATAGTCAATG TATGGGTAGA TATGAAGGAG GAAACAAGGA AATGAAGAAA AGAGCTTTGC
TAGGGGTTAC CTTATTAACA TTCACAACAT TAGCGGGTTG TACAAATTTA TCTGAACAGA
AAAGCGGCGA AAAACAAACA GAGGTTGCTG AAGCGAAGGC AACTGAATCT GAAAAAGCAT
CAGTAAAAAA TGTTATTTTT ATGATTGGAG ATGCCATGGG GAATCCGTAT ACAACGGGCT
ATCGCTATTT CAAAGCCAAT CACTCAGACA AGCGTGTTCC CCAAACAGCT TTTGATACCT
ATTTGGTCGG ACAGCAAGCC ACTTATCCAG AAGATGAAGA AGAGAATGTC ACCGATTCAG
CTTCCGCAGC GACAGCGATG GCTGCCGGAG TGAAAACCTA TAATAATGCT ATTGCACTCG
ATAATGACAA GTCCAAAACA GAAACAGTGC TCGAACGTGC GAAAAAAGTG GGGAAATCAA
CGGGTCTTGT AGCAACATCT GAAATAACAC ATGCAACCCC TGCTGCATAT GGCGCACATA
ATGTTTCACG CAAAAATATG GCAGAAATCG CCGATGACTA TTTTGATGAT CAAATCGACG
CACAACACAA AGTCGATGTG TTACTTGGCG GCGGCTCCGA ATTATTTGCC CGGAAAGATC
GTGATTTAGT CAAAGAATTT TCCCAAGCGG GTTATGGTCA TGTCACAGAC AAAAACTCGT
TAAATGAGAA CCAAGACGAC AAAATTTTAG GCTTGTTTGC ACCAGGCGGG CTACCTAAAA
TGATTGACCG AACGGAAGAA GTCCCTTCAT TAGCTGATAT GACAGAAGCG GCTCTTCAAC
GGTTAGATAA AAATGAAAAA GGTTTCTTTT TAATGGTTGA AGGTAGTCAA ATTGATTGGG
CCGGGCATAG CAATGATATT GTTGGCGCGA TGAGCGAAAT GCAAGACTTC GAAGCGGCGT
TTCAAAAGGC CATCGATTTT GCCAAAAAAG ATGGTGAACA TTGGTGGTTA CAACTGCAGA
TCATTCAACA GGGGGCTTGT CTTTAG EF028-2 (SEQ ID NO:106) MKKR
ALLGVTLLTF TTLAGCTNLS EQKSGEKQTE VAEAKATESE KASVKNVIFM IGDGMGNPYT
TGYRYFKANH SDKRVPQTAF DTYLVCQQAT YPEDEEENVT DSASAATAMA AGVKTYNNAI
ALDNDKSKTE TVLERAKKVG KSTGLVATSE ITHATPAAYG AHNVSRKNNA EIADDYFDDQ
IDGQHKVDVL LGGGSELFAR KDRDLVKEFS QAGYGHVTDK KSLNENQDDK ILGLFAPGGL
PKNIDRTEEV PSLADMTEAA LQRLDKNEKG FFLMVEGSQI DWAGHSNDIV GAMSEMQDFE
AAFEKAIDFA KKDGEHWWLQ LQTTQQGACL EF028-3 (SEQ ID NO:107) ACAGA
AAAGCGGCGA AAAACAAACA GAGGTTGCTG AAGCGAAGGC AACTGAATCT GAAAAAGCAT
CAGTAAAAAA TGTTATTTTT ATGATTGGAG ATGGCATGGG GAATCCGTAT ACAACGGGCT
ATCGCTATTT CAAAGCCAAT CACTCAGACA AGCGTGTTCC CCAAACAGCT TTTGATACCT
ATTTGGTCGG ACAGCAAGCC ACTTATCCAG AAGATGAAGA AGAGAATGTC ACCGATTCAG
CTTCCGCAGC GACAGCGATG GCTGCCGGAG TGAAAACCTA TAATAATGCT ATTGCACTCG
ATAATGACAA GTCCAAAACA GAAACAGTGC TCGAACGTGC GAAAAAAGTG GGGAAATCAA
CGGGTCTTGT AGCAACATCT GAAATAACAC ATGCAACCCC TGCTGCATAT GGCGCACATA
ATGTTTCACG CAAAAATATG GCAGAAATCG CCGATGACTA TTTTGATGAT CAAATCGACG
GACAACACAA AGTCGATGTG TTACTTGGCG GCGGCTCCGA ATTATTTGCC CGGAAAGATC
GTGATTTAGT CAAAGAATTT TCCCAAGCGG GTTATGGTCA TGTCACAGAC AAAAAGTCGT
TAAATGAGAA CCAAGACGAC AAAATTTTAG GCTTGTTTGC ACCAGGCGGG CTACCTAAAA
TGATTGACCG AACGGAAGAA GTCCCTTCAT TAGCTGATAT GACAGAAGCG GCTCTTCAAC
GGTTAGATAA AAATGAAAAA GGTTTCTTTT TAATGGTTGA AGGTAGTCAA ATTGATTGGG
CCGGGCATAG CAATGATATT GTTGGCGCGA TGAGCGAAAT GCAAGACTTC GAAGCGGCGT
TTGAAAAGGC CATCGATTTT GCCAAAAAAG ATGGTGAACA TTGGTGGTTA CAACTGCAGA
TCATTCAACA GGGGGCTTGT CTT EF028-4 (SEQ ID NO:108) QKSGEKQTE
VAEAKATESE KASVKNVIFM IGDGMGNPYT TGYRYFKANH SDKRVPQTAF DTYLVGQQAT
YPEDEEENVT DSASAATAMA AGVKTYNNAT ALDNDKSKTE TVLERAKKVG KSTGLVATSE
ITHATPAAYG AHNVSRKNMA EIADDYFDDQ IDGQHKVDVL LGGGSELFAR KDRDLVKEFS
QAGYGHVTDK KSLNENQDDK ILGLFAPGGL PKMIDRTEEV PSLADMTEAA LQRLDKNEKG
FFLMVEGSQI DWAGHSNDIV GAMSEMQDPE AAFEKAIDFA KKDGEHWWLQ LQI IQQGACL
EF029-1 (SEQ ID NO:109) TGAAGGAGGG AGAAAATGAA AAAGTTAATC GGTAAAAAGT
GGCTGCTGCT TACAGCAGTA GCCACTTTTT TATTATCAGG ATGCGCAAGT CTTGAACAAA
AAGCACAGGA TAGTGTAAAA GAAGTTACTG AAAATGTTAC TCAAACTATT TCAAACGATC
AACGTATACC AGCTGATTTT GTTAGGCACG TGGATGGCGA TACCACAGTA TTAAAAATTG
ACGGAAAAGA ACAAAAACTT CGGTTTTTAT TAATTGACAC ACCCGAGACT GTGAAACCGA
AAACAAAAGT TCAGCCGTTC GGATTGGAAG CTAGCAAACG CACAAAAGAG CTTTTGTCTA
CTGCTTCAGA AATTACGTTT GAATATGATA AGGGCGATAA AACAGATCGT TACGGACGAG
CGTTGGGCTA CATATTCGTA GATGGAACAT TACTACAAAA AACGCTTGTA AGTCAAGGAT
TAGCTCGTGT TGCCTATGTA AAAGACCCTA CAACTAAGTA TTTGGCAGAA CTAGAGCAAG
CCCAAGAACA GGCTAAAAAT GAGTCACTCG GAATCTGGAG CATACCAGGT TATGTGACAC
AACGGGGGTT TAGTAAATAA EF029-2 (SEQ ID NO:110) MKKLIG KKWLLLTAVA
TFLLSGCASL EQKAQDSVKE VTENVTQTIS NDQRIPADFV RHVDGDTTVL KIDGKEQKVR
FLLIDTPETV KPKTKVQPFG LEASKRTKEL LSTASEITFE YDKGDKTDRY GRALGYIFVD
GTLLQKTLVS EGLARVAYVK EPTTKYLAEL EQAQEQAKNE SLGIWSIPGY VTQRGFSK
EF029-3 (SEQ ID NO:111) AAATGTTAC TCAAACTATT TCAAACGATC AACGTATACC
AGCTGATTTT GTTAGGCACG TGGATGGCGA TACCACAGTA TTAAAAATTG ACGGAAAAGA
ACAAAAAGTT CGGTTTTTAT TAATTGACAC ACCCGAGACT GTCAAACCGA AAACAAAAGT
TCAGCCGTTC GGATTGGAAG CTAGCAAACG CACAAAAGAG CTTTTGTCTA CTGCTTCAGA
AATTACGTTT GAATATGATA AGGGCGATAA AACAGATCGT TACGGACGAG CGTTGGGCTA
CATATTCGTA GATGGAACAT TACTACAAAA AACGCTTGTA AGTGAACGAT TAGCTCGTCT
TGCCTATGTA AAAGAGCCTA CAACTAAGTA TTTGGCAGAA CTAGAGCAAG CCCAAGAACA
GGCTAAAAAT GAGTCACTCG GAATCTGGAG CATACCAGGT TATGTGACAC AACGCGGGTT
TAGTAAA EF029-4 (SEQ ID NO:112) NVTQTIS NDQRIPADFV RHVDGDTTVL
KIDGKEQKVR FLLTDTPETV KPKTKVQPFG LEASKRTKEL LSTASEITFE YDKGDKTDRY
GRALGYIFVD GTLLQKTLVS EGLARVAYVK EPTTKYLAEL EQAQEQAKNE SLGIWSIPGY
VTQRGFSK EF030-1 (SEQ ID NO:113) TGATTGACAC ATAGGGGGAA TAGTATGAAA
AAGTTAAAAA TGATGGGGAT TATGTTATTT GTTAGTACGG TCTTGGTAGG TTGTGGCACA
ACAGCAGANA CAAAAATAGA CGAGAAAGCA ACTGAGAAAA CCAGTGTCTC GAAAAAAGTT
TTAAATTTAA TGGAGAACTC GGAAATCGGT TCAATGGATT CTATTTTTAC ACAAGATGAA
GCCAGTATTA ACGCACAGTC CAATGTCTTT GAAGGGTTAT ATCAATTGGA TGAAAAAGAT
CAACTAATAC CTGCTGCTGC TAAAGAGATG CCAGAAATTT CTGAGGATGG CAAACGATAT
ACCATTAAAC TAAGAGAAGA TCGCAAGTGG TCCAATGGTG ATGCTGTAAC AGCCAATGAT
TTCGTTTTTG CTTGGCGTAA ATTAGCGAAT CCCAAAAACC AAGCCAATTA CTTTTTCTTG
TTAGAAGGAA CGATTCTGAA CGGAACAGCT ATTACAAAAG AGGAAAAAGC ACCAGAGGAA
TTGGGTGTCA AAGCGCTTGA TGATTATACT TTGGAGGTTA CTTTAGAAAA GCCTGTACCA
TATTTTACGT CGTTATTGGC ATTTTCTCCA TTTTTCCCAC AAAACGAAGC ATTCGTGAAA
GAAAAAGGAC AAGCCTATGG CACTTCTAGT GAAATGATTG TATCTAATGG TCCGTTTTTA
ATGAAAAATT GGGATCAGTC AGCGATGTCG TGGGATTTTG TGCGTAATCC CTACTATTAC
GATAAACAAA AAGTAAAATC AGAAACGATT CATTTTGAAG TTCTTAAAGA AACCAATACC
GTTTATAATT TGTACGAATC AGGTGAATTA GATGTGGCTG TCTTAACAGG AGATTTTGCT
AAACAAAATC GAGACAACCC AGACTATGAA GCAATCGAAC GGTCAAAAGT CTATTCCTTA
CGTTTAAACC AAAAAAGAAA CGAAAAACCA TCCATTTTTG CAAATGAGAA TGTCCGCAAA
GCTTTAGCTT ATGCTTTGGA TAAAAAAAGT TTAGTCGATA ATATTTTAGC AGATGGCTCA
AAAGAAATTT ATGGGTACAT TCCAGAAAAA TTTGTATATA ACCCAGAAAC GAATGAAGAT
TTTCGTCAAG AAGCAGGCGC TCTTGTCAAA ACAGACGCCA AAAAAGCCAA AGAGTATTTA
GATAAAGCAA AAGCAGAGCT AAACGGAGAT GTAGCCATTG AACTTCTTTC AAGAGATGGT
GATAGTGACC GA EF030-2 (SEQ ID NO:114) MKK LKMMGIMLFV STVLVGCGTT
AXTKIDEKAT EKTSVSKKVL NLMENSEIGS MDSIFTQDEA SINAQSNVFE GLYQLDEKDQ
LIPAAAKEMP EISEDGKRYT IKLREDGKWS NGDAVTANDF VFAWRKLANP KNQANYFFLL
EGTILNGTAI TKEEKAPEEL GVKALDDYTL EVTLEKPVPY FTSLLAFSPF FPQNEAFVKE
KGQAYGTSSE MIVSNGPFLM KNWDQSAMSW DFVRNPYYYD KEKVKSETIH FEVLKETNTV
YNLYESGELD VAVLTGDFAK QNRDNPDYEA IERSKVYSLR LNQKRNEKPS IFANENVRKA
LAYALDKKSL VDNILADGSK EIYGYIPEKF VYNPETNEDF RQEAGALVKT DAKKAKEYLD
KAKAELNGDV AIELLSRDGD SDR EF030-3 (SEQ ID NO:115) GAGAAAGCA
ACTGAGAAAA CCAGTGTCTC GAAAAAAGTT TTAAATTTAA TGGAGAACTC GGAAATCGGT
TCAATGGATT CTATTTTTAC ACAAGATCAA GCCAGTATTA ACGCACAGTC CAATGTCTTT
GAAGGGTTAT ATCAATTGGA TGAAAAAGAT CAACTAATAC CTGCTGCTGC TAAAGAGATG
CCACAAATTT CTGAGGATGG CAAACGATAT ACCATTAAAC TAAGAGAAGA TCCCAAGTGG
TCCAATGGTG ATGCTGTAAC
AGCCAATGAT TTCGTTTTTG CTTGGCGTAA ATTAGCGAAT CCCAAAAACC AAGCCAATTA
CTTTTTCTTG TTAGAAGGAA CGATTCTGAA CGGAACACCT ATTACAAAAC AGGAAAAAGC
ACCACACGAA TTGGGTGTCA AAGCGCTTGA TOATTATACT TTGGAGGTTA CTTTAGAAAA
GCCTGTACCA TATTTTACGT CGTTATTGGC ATTTTCTCCA TTTTTCCCAC AAAACGAAGC
ATTCGTGAAA GAAAAAGGAC AACCCTATGG CACTTCTAGT GAAATCATTG TATCTAATGG
TCCCTTTTTA ATGAAAAATT GGGATCAGTC AGCGATGTCG TGGGATTTTC TCCGTAATCC
CTACTATTAC CATAAAGAAA AACTAAAATC AGAAACGATT CATTTTGAAG TTCTTAAAGA
AACCAATACC GTTTATAATT TGTACGAATC AGGTGAATTA GATGTGGCTG TCTTAACAGG
AGATTTTGCT AAACAAAATC GAGACAACCC AGACTATGAA GCAATCCAAC CGTCAAAAGT
CTATTCCTTA CGTTTAAACC AAAAAAGAAA CGAAAAACCA TCCATTTTTG CAAATGAGAA
TGTCCGCAAA CCTTTAGCTT ATGCTTTGGA TAAAAAAACT TTACTCGATA ATATTTTACC
AGATGGCTCA AAAGAAATTT ATOGOTACAT TCCAGAAAAA TTTGTATATA ACCCAGAAAC
GAATGAAGAT TTTCGTCAAG AAGCAGGCGC TCTTGTCAAA ACAGACGCCA AAAAAGCCAA
AGAGTATTTA GATAAAGCAA AAGCAGAGCT AAACGGAGAT GTAGCCATTG AACTTCTTTC
AAGAGATGGT EF030-4 (SEQ ID NO:116) EKAT EKTSVSKKVL NLMENSETGS
MDSIFTQDEA SINAQSNVFE GLYQLDEKDQ LIPAAAKEMP EISEDGKRYT IKLREDGKWS
NGDAVTANDF VFAWRKLANP KNQANYFFLL EGTILNGTAI TKEEKAPEEL GVKALDDYTL
EVTLEKPVPY FTSLLAFSPF FPQNEAFVKE KGQAYGTSSE MIVSNGPFLM KNWDQSAMSW
DFVRNPYYYD KEKVKSETTH FEVLKETNTV YNLYESGELD VAVLTGDFAK QNRDNPDYEA
IERSKVYSLR LNQKRNEKPS IFANENVRKA LAYALDKKSL VDNILADCSK EIYGYIPEKF
VYNPETNEDE RQEAGALVKT DAKKAKEYLD KAKAELNGDV AIELLSRDG EF031-1 (SEQ
ID NO:117) TGAGAAATTA GTTATTTTAG AAAAATAAAA ACCATTTTGG ACGAAGATTT
AAAAATGAAA AAACGCGTAA TTTTAGGGAC ATTAGTCGCT GCAACGTTAT TAATCACTGC
TTGTGGAAAC AGCGAAGCAA CTACGAAAAG CGAGAGCAAA GGTGGAAGTA ATGCTTTAGT
CGTTTCAACT TTCGGATTAA GTGAAGATAT TGTCAAAAAA GACATTATCG CTCCATTTGA
AAAAGAGAAT GAAGCGAAAG TTACCTTAGA AGTAGGCAAT ACCCCAGACC GCTTTACGAA
ATTAAAAAAT AATCCCAATG CGGGAATTGA TGTCATTGAA TTAGCACAAG CAAATGCAGC
ACAAGGTGGA AAAGATCCGT TATTTGAAAA AATTACAGAA AAAGAAGTAC CTAATTTAAG
TCAGTTAACC CCGGGAGCAA AAGAGGTTTT TGAAAGTGGT GCTGGCGTAC CAATCGCTGT
AAACAGTATC GGGATTGTTT ACAACAAAGA AAAATTAGGC AAACAAATTA AAAACTGGGA
TGACTTATGG TCAGCTGATT TGAAAGGTAA AATTTCTGTT CCACACGTTG CCACGACGGC
AGGTCCTTTA ATCTTATACG TTGCTAGTGA ACATGCTGGT CAAGATATTA CAAAAGATAA
CGGGAAGGCC GCTTTTGAAC CCATGAAAGA ATTAAAACCA AACGTTGTTA AAACGTATTC
AAAATCGTCA GACTTAGCNA ATATGTTCCA ATCTGGTGAA ATTGAAGCAG CTGTGGTTGC
TGATTTTGCG GTTGATATTA TTCAAGGCGC ACAGAAAACG TGA EF0031-2 (SEQ ID
NO:118) MKK RVTLGTLVAA TLLMTACGNS EATTKSESKG GSNALVVSTF GLSEDIVKKD
IIAPFEKENE AKVTLEVGNS ADRFTKLKNN PNAGIDVIEL AQANAAQGCK DGLFEKITEK
EVPNLSQLTP GAKEVFESGA GVPTAVNSIG IVYNKEKLCK ETKNWDDLWS ADLKGKISVP
DVATTACPLM LYVASEHACQ DITKDNCKAA FEAMKELKPN VVKTYSKSSD LANMFQSGEI
EAAVVADFAV DIIQGAQKT EF031-3 (SEQ ID NO:119) AA CTACGAAAAG
CGAGAGCAAA GGTC-GAAGTA ATGCTTTAGT CGTTTCAACT TTCGGATTAA GTGAAGATAT
TGTCAAAAAA GACATTATCG CTCCATTTGA AAAAGAGAAT GAAGCGAAAG TTACCTTAGA
AGTAGGCAAT AGCGCAGACC GCTTTACGAA ATTAAAAAAT AATCCCAATG CGGGAATTGA
TGTCATTGAA TTAGCACAAG CAAATGCAGC ACAAGGTGGA AAAGATGGGT TATTTGAAAA
AATTACACAA AAAGAAGTAC CTAATTTAAG TCAGTTAACG CCGGGAGCAA AAGAGGTTTT
TGAAAGTGGT GCTGGCGTAC CAATCGCTGT AAACAGTATC GGGATTGTTT ACAACAAAGA
AAAATTAGGC AAAGAAATTA AAAACTGGGA TGACTTATGG TCAGCTGATT TGAAAGGTAA
AATTTCTGTT CCAGACGTTG CCACGACGGC AGGTCCTTTA ATGTTATACG TTGCTAGTGA
ACATGCTGGT CAAGATATTA CAAAAGATAA CGGGAAGGCC CCTTTTGAAG CGATGAAAGA
ATTAAAACCA AACGTTGTTA AAACGTATTC AAAATCGTCA GACTTAGCNA ATATGTTCCA
ATCTGGTGAA ATTGAAGCAG CTGTGGTTGC TGATTTTGCC GTTGATATTA TTCAAGGCGC
ACAGAAAA EF031-4 (SEQ ID NO:120) TTKSESKG GSNALVVSTF GLSEDIVKKD
IIAPFEKENE AKVTLEVGNS ADRFTKLKNN PNAGIDVIEL AQANAAQGCK DGLFEKTTEK
EVPNLSQLTP GAKEVFESGA GVPIAVNSIG IVYNKEKLGK EIKNWDDLWS ADLKGKISVP
DVATTAGPLM LYVASEHAGQ DITKDNGKAA FEAMKELKPN VVKTYSKSSD LANMFQSGEI
EAAVVADFAV DIIQGAQK EF032-1 (SEQ ID NO:121) TGAATAAATT ATTTAGGAGG
AATTATGATG AAAAAATTAA TTAGTTTAGG ATTGGTTTGT GTTTGTGGTA TTTCACTACT
TACTGCTTGT NCGGGAAATA ATGATAATAA AGATACTGAA AAGTCAACCA GTCAATCTAG
CAGCACAGTT AAACAACCGA ATTCAAAAGA CTTTGTTGCG TCAGGGGAAT ATTCAGTTGG
AAAAGATATT GATCCTGGAG ATTACTATGC TGTATTAACT CAACTAGATG ATAAATCGAG
CATAGTTCTT ATTACCGTCA AATCAGGCGG AGAAAATAGT AACCATGACT TATACGGAGT
GGGAAACAAG AAAAAAGTAT CTCTTAAAAA GGGAGATACT CTCACATTCG AAACTGCCGA
CAAAGATTTT GTTGTTAGAT TTTTAAATGA AAAAGATTTT CAAGAATATA TGAAAAATCC
AGTATCNAGT ACTGAAACTA GCAAACANAA AACAGTAAAC TCTGATGTTT CTAAAAGTAG
TAGCCAAGAT AATAAACAAT CTGATGTATC TGAAAAAAAA GAAGTAAGTA CTGAAGCGAA
GTCTGATGTA GCTACTAATA CTTTACCGAG CGAAGATAAA AATACTAATC ACATTACTAA
GCTAGCACAT GAGCCAACCT TAGAACAACA AACCGTCTTA GATACTTTAG CTAAGCATCA
ATTTAATGAT ATGTATCCTT ATAAAGGAAG CAAAATGCAT TCAATTATCG GCGTCATCCC
AACCATGGAC GCAAAAAGAT GGTAA EF032-2 (SEQ ID NO:122) MK KLISLGLVCV
CGISLLTACX GNNDNKDTEK STSQSSSTVK QPNSKDFVAS GEYSVGKDID PGDYYAVLTQ
LDDKSSIVLI TVKSGGENSN HDLYGVGNKK KVSLKKGDTL TFETADKDFV VRFLNEKDFQ
EYMKNPVSST ETSKXKTVNS DVSKSSSQDN KQSDVSEKKE VSTEAKSDVA TNTLPSEDKN
TNDITKLADE PTLEQQTVLD TLAKHQFNDM YPYKGSKMHS IIGVIPTMDA KRW EF032-3
(SEQ ID NO:123) TA ATGATAATAA AGATACTGAA AAGTCAACCA GTCAATCTAG
CAGCACAGTT AAACAACCGA ATTCAAAAGA CTTTGTTGCG TCAGGGGAAT ATTCAGTTGG
AAAAGATATT GATCCTGGAG ATTACTATGC TGTATTAACT CAACTAGATG ATAAATCGAG
CATAGTTCTT ATTACCGTCA AATCAGGCGG AGAAAATAGT AACCATGACT TATACGGAGT
GGGAAACAAG AAAAAAGTAT CTCTTAAAAA GGGAGATACT CTCACATTCG AAACTGCCGA
CAAAGATTTT GTTGTTAGAT TTTTAAATGA AAAAGATTTT CAAGAATATA TGAAAAATCC
AGTATCNAGT ACTGAAACTA GCAAACANAA AACAGTAAAC TCTGATGTTT CTAAAAGTAG
TAGCCAAGAT AATAAACAAT CTGATGTATC TGAAAAAAAA CAAGTAAGTA CTGAAGCGAA
GTCTGATGTA GCTACTAATA CTTTACCGAG CGAAGATAAA AATACTAATG ACATTACTAA
GCTAGCAGAT GAGCCAACCT TAGAACAACA AACCGTCTTA GATACTTTAG CTAAGCATCA
ATTTAATGAT ATGTATCCTT ATAAAGGAAG CAAAATGCAT TCAATTATCG CCGTCATCCC
AACCATGGAC GCAAAAAGAT GG EF032-4 (SEQ ID NO:124) NDNKDTEK
STSQSSSTVK QPNSKDFVAS GEYSVCKDID PGDYYAVLTQ LDDKSSIVLI TVKSGGENSN
HDLYGVGNKK KVSLKKGDTL TFETADKDFV VRFLNEKDFQ EYMKNPVSST ETSKXKTVNS
DVSKSSSQDN KQSDVSEKKE VSTEAKSDVA TNTLPSEDKN TNDITKLADE PTLEQQTVLD
TLAKHQFNDM YPYKGSKMHS IIGVIPTMDA KRW EF033-1 (SEQ ID NO:125)
TCACTGCTTT TTTTCTATTG GAGAAAAAAG TGGTTTTTTT GTATTGTTTT GACGTTGAGA
CAAAGGAGGT TCATTTCAGA AAATTTTCCC CAAAATAAAA TAGACGAATG CGAGGATGAA
AAAATGAAAA AATTTACTTT AACAATGATC ACTTTAGGTT TAGTAGCAAC ACTTGGCTTA
GCAGGATGTG GTAAACAGGA AAAGAAAGCA ACTACCTCTT CTGAAAAAAC AGAAGTAACG
TTACCAACCA AAGACCGTAG CGGCAAAGAA ATTACTTTAC CCAAAGAACC AACCAAAATT
ATTTCCCTAG TGCCATCAAC AACAGAAGTG ATTGAAGACT TAGGTAAAAC CGACCAATTA
ATCGCAGTTG ATACTCAAAG TAGTACAATG ATGACTGATT TAAAAAAATT ACCACAAATG
GATATGATGG CTGTCGATGC CGAAAAATTG ATTGCCTTGA AACCACAAAT TGTTTATGTG
AATGACATCA ATTTAGCTAG CTCAGAAAGT GTTTGGAAGC AAGTGGAAGA TGCTGGAATT
ACAGTCCTTA ATATCCCCAC TAGTACAAGC ATCAAAGCAA TCAAAGAACA CGTCCAATTC
ATCGCTGATA GCTTATCTGA ACATGAAAAA GGACAAAAGT TAATCAAAAC AATGGATCAA
GAAATCGACG AGTAG EF033-2 (SEQ ID NO:126) MKKFTLTMMT LGLVATLGLA
GCGKQEKKAT TSSEKTEVTL PTKDRSGKEI TLPKEATKII SLVPSTTEVI EDLGKTDQLI
AVDTQSSTMM TDLKKLPQMD MMAVDAEKLI ALKPQIVYVN DINLASSESV WKQVEDAGIT
VVNIPTSTSI KAIKEDVQFI ADSLSEHEKG QKLIKTHDQE IDE EF033-3 (SEQ ID
NO:127) CTCTT CTCAAAAAAC AGAAGTAACC TTACCAACCA AAGACCGTAG
CGGCAAAGAA ATTACTTTAC CCAAAGAAGC AACCAAAATT ATTTCCCTAG TGCCATCAAC
AACAGAAGTC ATTGAAGACT TAGGTAAAAC CGACCAATTA ATCGCAGTTG ATACTCAAAG
TAGTACAATG ATGACTGATT TAAAAAAATT ACCACAAATG GATATGATGG CTGTCGATGC
CCAAAAATTG ATTGCCTTGA AACCACAAAT TGTTTATGTG AATGACATCA ATTTAGCTAG
CTCAGAAAGT GTTTGGAAGC AAGTGGAAGA TGCTGGAATT ACAGTCGTTA ATATCCCCAC
TAGTACAAGC ATCAAAGCAA TCAAAGAAGA CGTCCAATTC ATCGCTGATA GCTTATCTGA
ACATGAAAAA GGACAAAAGT TAATCAAAAC AATGGATCAA GAAATCGACG AGTAG
EF033-4 (SEQ ID NO:128) SSEKTEVTL PTKDRSGKEI TLPKEATKII SLVPSTTEVI
EDLGKTDQLT AVDTQSSTMM TDLKKLPQMD MMAVDAEKLI ALKPQIVYVN DINLASSESV
WKQVEDAGIT VVNIPTSTSI KAIKEDVQFI ADSLSEHEKG QKLIKTMDQE IDE EF034-1
(SEQ ID NO:129) TACGAGGGAG TAATCATGAA AAAAATCGGG TATTTTAGTT
GTATTATTTT TTTCATGTTT TTGGTAGGTT GTAGTAATAA CAAAAAAGAA AACGGCAATC
TTTTGAATGC CAGTTCCTTT CCTTTAATAC TCACCACGAT TATTCAAAAA GAACAAGACC
TAACGAAAGG TTCAATTTTT TTCAACAAGG ATAAAACCAT GACGCTTGAA AAAGAATATT
TAGTTAATCC CAATAATGAA GACACAAAAA AAACAAGTAG AACAGAAAAA AACGTATATA
AAAATATTAA AATACAAGAA AATAAAGAGA GCTATGAAAT TATAGGTCAA TTGGACAAAA
AAACGAAAAA AATAGAGTTT AAAAAAGTTG ATGAAGGTAA ACGTATATCT GATGCAGAAG
GTAATGTGTA TGGTGATTTT GGTGGTAAAT AG EF034-2 (SEQ ID NO:130) MKKIGY
FSCIIFFMFL VGCSNNKKEN GNLLNASSFP LILTTIIEKE EDLTKGSIFF NKDKTMTLEK
EYLVNPNNED TKKTSRTEKK VYKNIKIQEN KESYEIIGQL DKKTKKIEFK KXTDEGKRISD
AEGNVYGDFG GK EF034-3 (SEQ ID NO:131) AGAA AACGGCAATC TTTTGAATGC
CAGTTCGTTT CCTTTAATAC TCACCACGAT TATTCAAAAA GAAGAAGACC TAACGAAAGG
TTCAATTTTT TTCAACAAGG ATAAAACCAT GACGCTTGAA AAAGAATATT TAGTTAATCC
CAATAATGAA GACACAAAAA AAACAAGTAC AACAGAAAAA AAGGTATATA AAAATATTAA
AATACAAGAA AATAAAGAGA CCTATGAAAT TATAGGTCAA TTGGACAAAA AAACGAAAAA
AATAGAGTTT AAAAAAGTTG ATGAAGGTAA ACGTATATCT GATGCAGAAG GTAATGTGTA
TGCTGATTTT GGTGGTAAAT AC EF034-4 (SEQ ID NO:132) KEN GNLLNASSFP
LILTTIIEKE EDLTKGSIFF NKDKTNTLEK EYLVNPNNED TKKTSRTEKK VYKNIKIQEN
KESYEIIGQL DKKTKKIEFK KVDECKRISD AEGNVYGDFG GK EF035-1 (SEQ ID
NO:133) TAAACGAGAG GTGAGTTTAT GAAAACAAAA ATCCGAAAAA CAGTTATCTT
GTCAGCATTT TTATTCACAA GTTTCCTTTT ACTGAGTGGT TGTACCTCGG CTGGCGAAGA
GATGGAAAAA ACAATTGATC GACAGAAAGA AAAAGTCGAT AAAACGGTCG ATAAGCAGAA
ACATAAAAAT GAAAATTCCA TGGAAAGTTA CGACGAAAAA GTTGACCGTT CTTTAGATAG
TCAAGAAGAC AAAATCC-ATA CTACTGAGTA A EF035-2 (SEQ ID NO:134) NKTKI
GKTVILSAFL FTSFLLLSGC TSAGEEMEKT IDRQKEKVDK TVDKQKHKNE NSMESYDEKV
DRSLDSQEDK IDTTE EF035-3 (SEQ ID NO:135) GATGGAAAAA ACAATTGATC
GACAGAAAGA AAAAGTCGAT AAAACGGTCG ATAAGCAGAA ACATAAAAAT GAAAATTCCA
TGGAAAGTTA CGACGAAAAA GTTGACCGTT CTTTAGATAG TCAAGAAGAC AAAATCGATA
CTACTGAG EF035-4 (SEQ ID NO:136) MEKT IDRQKEKVDK TVDKQKHKNE
NSMESYDEKV DRSLDSQEDK IDTTE EF036-1 (SEQ ID NO:137) TAATTTTCAA
GTCCTACATA TAATGGTAAA ATAGAATGGA TTGAAATTAA TTGGAGGAAT AATGAATCGA
TGAAAAAAAG ATTGCTATTA TTTATTGGTT TGGCAAGTAT ACTTACTTTG ACAGGATGTG
CAAAATGGAT TGATCGTGGT GAATCCATCA CAGCGGTAGG CTCATCAGCT TTACAACCAT
TAGTAGAGAC AGCGAGTGAG GAATATCAAA GCCAAAATCC GGGAAGATTT ATTAATGTCC
AAGGTGGCGG AAGCGGAACA GGTCTGAGTC AAGTCCAATC TGGCGCGGTA GACATTGGTA
ATTCTGATTT ATTTGCAGAA GAGAAAAAGG GCATCAAAGC GGAAGACTTA ATTGATCATA
AAGTTGCTGT CGTTGGGATT ACACCAATCG TTAACAAAAA TGTCGCTGTC AAAGATATCT
CAATGGAAAA TTTAAAGAAA ATCTTTTTAG GTGAAGTAAC AAACTGGAAA GAACTTCGCG
GGAAAGACCA AAAAATTGTT ATTTTGAATA GAGCCGCCGG TAGTCGTACG CGTGCGACTT
TTGAAAAGTG GGTCTTGGGA GATAAAACAG CCATTCGTGC GCAAGAACAA GATTCCAGCG
GCATGGTTCG TTCCATTGTT TCTGATACAC CAGGAGCGAT TAGTTATACC GCATTTTCAT
ATGTTACTGA TGAAGTAGCT ACGTTAAGTA TTGATGGTGT TCAGCCAACA GATGAAAATG
TAATGAACAA TAAATGGATT ATTTGGTCTT ATGAACACAT GTACACTCGT AAAAATCCAA
GTGATTTAAC CAAAGAGTTT TTAGACTTTA TGTTGTCAGA TGATATCCAA GAACGTGTGA
TTGGTCAATT AGGGTATATT CCTGTTTCGA AAATGGAAAT TGAACGGGAT TGGCAAGGAA
ATGTCATTAA ATAA EF-36-2 (SEQ ID NO:138) MKKRLLLF IGLASILTLT
GCAKWIDRGE SITAVGSSAL QPLVETASEE YQSQNPGRFI NVQGGGSGTG LSQVQSGAVD
IGNSDLFAEE KKGIKAEDLI DHKVAVVGIT PTVNKNVGVK DISMENLKKI FLGEVTNWKE
LGGKDQKTVT LNRAAGSGTR ATFEKWVLGD KTAIRAQEQD SSGMVRSIVS DTPGAISYTA
FSYVTDEVAT LSIDGVQPTD ENVMNNKWII WSYEHMYTRK NPSDLTKEFL DFMLSDDIQE
RVIGQLGYIP VSKNEIERDW QGNVIK EF036-3 (SEQ ID NO:139) GAT TGATCGTGGT
GAATCCATCA CAGCGGTAGG CTCATCAGCT TTACAACCAT TACTAGAGAC AGCGAGTGAG
GAATATCAAA GCCAAAATCC GGGAAGATTT ATTAATGTCC AAGGTGGCGG AAGCGGAACA
GGTCTGAGTC AAGTCCAATC TGGCGCGGTA GACATTGGTA ATTCTGATTT ATTTGCAGAA
GAGAAAAAGG GCATCAAAGC GGAAGACTTA ATTGATCATA AAGTTGCTGT CGTTGGGATT
ACACCAATCG TTAACAAAAA TGTCGGTGTC AAAGATATCT CAATGGAAAA TTTAAAGAAA
ATCTTTTTAG GTGAAGTAAC AAACTGGAAA GAACTTGGCG GGAAAGACCA AAAAATTGTT
ATTTTGAATA GAGCGGCCGG TAGTGGTACG CGTGCGACTT TTGAAAAGTG GGTCTTGGGA
GATAAAACAG CCATTCGTGC GCAAGAACAA GATTCCAGCG GCATGGTTCG TTCCATTGTT
TCTGATACAC CAGGAGCGAT TAGTTATACC GCATTTTCAT ATCTTACTGA TGAAGTAGCT
ACGTTAAGTA TTGATCGTGT TCAGCCAACA GATGAAAATG TAATGAACAA TAAATGGATT
ATTTGGTCTT ATGAACACAT GTACACTCGT AAAAATCCAA GTGATTTAAC CAAAGAGTTT
TTAGACTTTA TGTTGTCAGA TGATATCCAA GAACGTGTGA TTGGTCAATT AGGGTATATT
CCTGTTTCGA AAATGGAAAT TGAACGGGAT TGGCAAGGAA ATGTCATTAA A EF036-4
(SEQ ID NO:140) IDEGE SITAVGSSAL QPLVETASEE YQSQNPGRFT NVQGGGSGTG
LSQVQSGAVD IGNSDLFAEE KKGIKAEDLI DHKVAVVGIT PTVNKNVGVK DISMENLKKI
FLGEVTNWKE LGGKDQKTVI LNRAACSGTR ATFEKWVLGD KTATEAQEQD SSGMVRSIVS
DTPGAISYTA FSYVTDEVAT LSIDGVQPTD ENVHNNKWTI WSYEHMYTRK NPSDLTKEFL
DFMLSDDIQE RVIGQLGYIP VSKMEIERDW QGNVIK EF037-1 (SEQ ID NO:141)
TGAGTGTATG ATTACTCATT TCCCTTTGAA TCAGTTATGA TAAAGGAAGA AATAAATAAA
TTTTTTGGAG GGATTTTCAT GAAAATGTCT AAAGTACTCA CCACTGTTTT GACGGCAACT
GCTGCTCTTG TGTTGCTTAG TGCTTGTTCA TCTGATAAAA AAACAGATAG TAGTTCTAGT
ACCAAAGAAA CAGCTAATTC AAGTACAGAA GTAGTCTCTG GTGCTTCAAT TAGTGCCAAG
CCTGAAGAGC TCCAAATGGC GTTAAGTGAT AAAGGAAATT GGATTGTCGC AGCTACTGAC
AATGTCACTT TTGATAAAGA GGTAACAGTT GCTGGTACTT TCCATCATAA GGGGAAAGAT
TCCAACGATG TCTATCGTAA ATTAGCACTT TATTCCCAAG ATGATAATAA AAAAGTAACT
GCTGAATATG AAATCACGGT TCCTAAGCTA ATCGTTTCTT CTGAAAATTT CAACATCGTT
CACGGGACTG TCAAAGGTGA TATTGAGGTG AAAGCAAATG GCTTTACTTT AAATGGTACC
AAAGTTAATG GCAATATTAC TTTTCATAAA CAAGAATACA AAGATTCTGC TGACTTAGAA
AAAGATGGTG CCACTGTTAC TGGTCAAGTC ACCGTAGCCA ATAA EF037-2 (SEQ ID
NO:142) MKMSK VLTTVLTATA ALVLLSACSS DKKTDSSSSS KETANSSTEV
VSGASISAKP EELEMALSDK CNWIVAATDN VTFDKEVTVA GTFHDKGKDS NDVYRKLALY
SQDDNKKVTA EYEITVPKLI VSSENFNTVH GTVKGDIEVK ANGFTLNGTK VNGNITFDKQ
EYKDSADLEK DGATVTGEVT VANN EF037-3 (SEQ ID NO:143) AACAGATAG
TAGTTCTAGT AGCAAAGAAA CAGCTAATTC AAGTACAGAA GTAGTCTCTG CTGCTTCAAT
TAGTCCCAAG CCTGAAGAGC TCGAAATGGC GTTAAGTGAT AAAGGAAATT GGATTGTCGC
AGCTACTGAC AATGTCACTT TTGATAAAGA GCTAACAGTT GCTGGTACTT TCCATGATAA
GGGGAAAGAT TCCAACGATG TCTATCGTAA ATTAGCACTT TATTCCCAAG ATGATAATAA
AAAAGTAACT GCTGAATATG AAATCACGGT TCCTAAGCTA ATCGTTTCTT CTGAAAATTT
CAACATCGTT CACGGGACTG TCAAAGGTGA TATTGAGGTG AAAGCAAATG GCTTTACTTT
AAATGGTACC AAACTTAATC CCAATATTAC TTTTCATAAA CAAGAATACA AAGATTCTCC
TGACTTAGAA AAAGATGGTG CCACTGTTAC TGGTGAAGTC ACCGTAGCCA A EF037-4
(SEQ ID NO:144) TDSSSSS KETANSSTEV VSGASISAKP EELEMALSDK GNWIVAATDN
VTFDKEVTVA GTFHDKGKDS NDVYRKLALY SQDDNKKVTA EYEITVPKLI VSSENFNIVH
GTVKGDIEVK ANGFTLNGTK VNGNITFDKQ EYKDSADLEK DGAPVTGEVT VANN EF038-1
(SEQ ID NO:145) TAATGGCCAT TTCGTCTACT AATAAAGAGG ATGAAGCTAC
TCAAATGGCG TTGGCAATGG AACAAGGATC
ATAAAAAAGG AGAAGTGAGC ATGAAAAAAG TACTACCTTT TATTGCCTTA GTCGGCTTGT
TATTGTTGTC AGGTTGTGGA ACAGATATGA AAAAGATATT GACTGCCGAT GGTGGTAAAT
GGGAACTAGA AAATAAAAGT CCAACTACTA CTTACACTTT TTTTGATGAT GAAACTTTTT
CGAGGTATAA TTCAAAAATT AGTGATAGTG GAACCTACTC TTACGATGAA AATAATAAAA
AACTCACTTT GGATATAAAA AATAAAGAAC AATTAATAAT GGAAAATGTT GAATATAAAG
ACGGTAAATT AAAAGGTGAA ATTGGACCCG AGAAGGACTC TGATAAAAAA TNGAATAAGA
GGTGTCTTTG A EF038-2 (SEQ ID NO:146) M KLLKWRWQWN KDHKKGEVSM
KKVLPFIALV GLLLLSGCGT DMKKILTADG GKWELENKSP TTTYTFFDDE TFSRYNSKIS
DSGTYSYDEN NKKLTLDIKN KEQLTMENVE YKDGKLKGEI GGEKDSDKKX NKRCL
EF038-3 (SEQ ID NO:147) TTGTGGA ACAGATATGA AAAAGATATT GACTGCCGAT
GGTGGTAAAT GGGAACTAGA AAATAAAAGT CCAACTACTA CTTACACTTT TTTTGATGAT
GAAACTTTTT CGAGGTATAA TTCAAAAATT AGTGATAGTG GAACGTACTC TTACGATGAA
AATAATAAAA AACTCACTTT GGATATAAAA AATAAAGAAC AATTAATAAT GGAAAATGTT
CAATATAAAG ACGGTAAATT AAAAGGTGAA ATTGGAGGCG AGAAGGACTC TGATAAAAAA
TNGAATAAGA GGTGTCTTTG A EF038-4 (SEQ ID NO:148) CGT DMKKILTADG
GKWELENKSP TTTYTFFDDE TFSRYNSKIS DSGTYSYDEN NKKLTLDIKN KEQLIMENVE
YKDGKLKGEI GGEKDSDKKX NKRCL EF039-l (SEQ ID NO:149) TAAATATATC
AAAAAGAAAA AAGGGGATTA CCAACCATGA AAAAGAAAAA AGTTTTTAGT CCGCTTACCT
TATTAACCTT TAGTACGTTG TTCATTGCAG GCTGTGCTGG CGGAGCCAAC TCTGCAACAG
ATAAATCAAG TGCAGCTAGC TCAAGCACTG CAGTCTCTAG TTCAGCAGAA GCACCTAAAG
AGCAATCAAA AGGACAAGAA TTAACAGAAA TTTTATCCAG TACTGATTGG CAAGGCACAA
AAGTTTACGA CAAAAATNAT AATAATTTAA CAGCAGAAAA TGCTAATTTT ATTCGTTTAG
CAAAATATGA TGGTGAAACA GGTTTTTATG AATTTTTCGA CAAAGAAACA GGTGAAACCC
GTGGCGATGA AGGCACATTC TTTGTGACAG ACGATGGCGA AAAGCGTATC TTAATTTCGC
ATACACAAAA CTATCAAGCG GTGGTCGATT TAACGGAAGT GACGAAAGAT AAATTTACCT
ATAAGCGAAT GGGTAAAGAT AAACACGGGA AAGATGTAGA AGTCTTTGTA GAACATATCC
CTTATTCTGA CGAGAAATTA ACCTTTACGA ACGGCCGTAA AGATTTAGAA ACAGAAACTG
GCAAGATTGT TACCAATGAA CCTGGGGATG ACATTTTAGG GGCCACATTA TGGAATGGCA
CGAAAGTTTT AGATGAAGAC GGTAACGATG TTACTGAAGC AAATAAAATG TTTATTAGTT
TAGCGAAATT TGATAATAAA ACAAGTAAAT ATGAATTCTT TGATTTAGAA ACGGGTAAAA
CACGTGGAGA TTTTGGTTAC TTCCAAGTAA TTGATAATAA CAAAATCCGT GCTCACGTTT
CAATTGGTGA CAATAAATAT GGAGCTGCAT TAGAATTAAC AGAATTAAAT GATAAACGTT
TTACGTATAC ACGAATGCGT AAAGACAACA ATGGCAAAGA AATTAAAGTC TTTGTAGAAC
ATGAACCATA TGAAGGAGAC TTTACGCCAG ACTTCACGTT CTAA EF039-2 (SEQ ID
NO:150) MKKKKVFSA LTLLTFSTLL IAGCAGGANS ATDKSSAASS STAVSSSAEA
AKEQSKGQEL TEILSSTDWQ GTKVYDKNXN NLTAENANFI GLAKYDGETG FYEFFDKETG
ETRGDEGTFF VTDDGEKRIL ISDTQNYQAV VDLTEVTKDK FTYKRMGKDK DGKDVEVPVE
HIPYSDEKLT FTNGRKDLET ETGKIVTNEP GDDILGATLW NGTKVLDEDG NDVTEANKMF
ISLAKFDNKT SKYEFFDLET GKTRGDFGYF QVIDNNKIRA HVSIGDNKYG AALELTELND
KRFTYTRMGK DNNGKEIKVF VEHEPYEGDF TPDFTF EF039-3 (SEQ ID NO:151)
TGCAACAG ATAAATCAAG TGCAGCTAGC TCAAGCACTG CAGTCTCTAG TTCAGCAGAA
GCAGCTAAAG AGCAATCAAA AGGACAAGAA TTAACAGAAA TTTTATCCAG TACTGATTGG
CAAGGCACAA AAGTTTACGA CAAAAATNAT AATAATTTAA CAGCAGAAAA TGCTAATTTT
ATTGGTTTAG CAAAATATGA TGGTGAAACA GGTTTTTATG AATTTTTCGA CAAAGAAACA
GCTGAAACCC GTGGCGATGA AGGCACATTC TTTGTGACAG ACGATGGCGA AAAGCGTATC
TTAATTTCGG ATACACAAAA CTATCAAGCG GTGGTCGATT TAACGGAAGT GACGAAAGAT
AAATTTACCT ATAAGCGAAT GGGTAAAGAT AAAGACGGGA AAGATGTAGA AGTCTTTGTA
GAACATATCC CTTATTCTGA CGAGAAATTA ACCTTTACGA ACGGCCGTAA AGATTTAGAA
ACAGAAACTG GCAAGATTGT TACCAATGAA CCTGGGGATG ACATTTTAGG GGCCACATTA
TGGAATGGCA CGAAAGTTTT AGATGAAGAC GGTAACGATG TTACTGAAGC AAATAAAATG
TTTATTAGTT TAGCGAAATT TGATAATAAA ACAAGTAAAT ATGAATTCTT TGATTTAGAA
ACGGGTAAAA CACGTGGAGA TTTTGGTTAC TTCCAAGTAA TTGATAATAA CAAAATCCGT
GCTCACGTTT CAATTGGTGA CAATAAATAT GGAGCTGCAT TAGAATTAAC AGAATTAAAT
GATAAACGTT TTACGTATAC ACGAATGGGT AAAGACAACA ATGGCAAAGA AATTAAAGTC
TTTGTACAAC ATGAACCATA TGAAGGAGAC TTTACGCCAG ACTTCACGTT CTAA EF039-4
(SEQ ID NO:152) ATDKSSAASS STAVSSSAEA AKEQSKGQEL TEILSSTDWQ
GTKVYDKNXN NLTAENANFI GLAKYDGETG FYEFFDKETG ETRGDEGTFF VTDDGEKRIL
ISDTQNYQAV VDLTEVTKDK FTYKRMGKDK DGKDVEVFVE HIPYSDEKLT FTNGRKDLET
ETGKIVTNEP GDDILGATLW NGTKVLDEDG NDVTEANKMF ISLAKFDNKT SKYEFFDLET
GKTRGDFGYF QVIDNNKIRA HVSIGDNKYG AALELTELND KRFTYTRMGK DNNGKEIKVF
VEHEPYEGDF TPDFTF EF040-1 (SEQ ID NO:153) TAGATTAGAA CCACTGGAGA
AAAATCTCAT ATTTCTCTCG AGGAAAGGAA GTTGAGCACA ATGAACAAAA AAATTTTAAT
GGGGCTATTA AGTGTCGTGA CCATTCCATT ACTTGCTGCG TGTCAAGGAG GAGAAACACC
TTCCGCAGCG TCAAAAAATA GTCAAACGGT GACTACTCAA AGTAGTGCAA AAACTGAAAG
CACCAGTACA ACCCGTTCGG TAGCTCAAAC AACATCAAAA GAGGAAGTGA AAGAACCGAT
GAAGACCTAT GAAGTGGGTG CGCTTTTAGA AGCAGCCAAT CAACGAGATA CGAAGAAGGT
CAAGGAAATT TTACAAGATA CTACTTATCA AGTGGATGAA GTCGACACAG AAGGCAACAC
ACCGCTCAAT ATCGCTGTTC ACAATAATGA CATTGAGATT GCAAAAGCGT TGATTGATCG
GGGTGCCGAT ATTAATCTGC AAAACAGCAT TAGTGATAGT CCCTATCTTT ATGCGGGAGC
GCAAGGACGT ACGGAGATTT TAGCGTATAT GTTAAAACAT GCGACCCCAG ATTTAAATAA
CCATAACCGT TACGGTGGCA ATGCGTTAAT TCCGGCAGCT GAAAAAGGAC ATATTGACAA
TGTGAAGCTC TTGTTAGAAG ATGGACGAGA AGACATAGAT TTCCAAAATG ACTTTGGCTA
TACAGCATTG ATTGAGGCAG TGGGGTTACG TGAAGGGAAC CAACTTTACC AAGATATTGT
AAAATTGTTA ATGGAAAATG GTGCGGATCA ATCCATTAAA GACAATTCTG GTCGAACAGC
AATGGACTAT GCCAATCAAA AAGGTTATAC GGAAATTAGT AAAATTTTAG CACAGTACAA
CTAA EF040-2 (SEQ ID NO:154) M NKKILMGLLS VVTIPLLAAC QGGETPSAAS
KNSQTVTTQS SAKTESTSTT RSVAQTTSKE EVKEPMKTYE VGALLEAANQ RDTKKVKEIL
QDTTYQVDEV DTEGNTPLNI AVHNNDIEIA KALTDRGADI NLQNSISDSP YLYAGAQGRT
EILAYMLKHA TPDLNKHNRY GGNALIPAAE KGHIDNVKLL LEDGREDIDF QNDFGYTALI
EAVGLREGNQ LYQDIVKLLM ENGADQSIKD NSGRTAMDYA NQKGYTEISK ILAQYN
EF040-3 (SEQ ID NO:155) AGCG TCAAAAAATA GTCAAACGGT GACTACTCAA
AGTAGTGCAA AAACTGAAAG CACCAGTACA ACCCGTTCGG TAGCTCAAAC AACATCAAAA
GAGGAAGTGA AAGAACCGAT GAAGACCTAT GAAGTGGGTG CGCTTTTAGA AGCAGCCAAT
CAACGAGATA CGAAGAAGGT CAAGGAAATT TTACAAGATA CTACTTATCA AGTGGATGAA
GTCGACACAG AAGGCAACAC ACCGCTCAAT ATCGCTGTTC ACAATAATGA CATTGAGATT
GCAAAAGCGT TGATTGATCG GGGTGCCGAT ATTAATCTGC AAAACAGCAT TAGTGATAGT
CCCTATCTTT ATGCGGGAGC GCAAGGACGT ACGGAGATTT TAGCGTATAT GTTAAAACAT
GCGACCCCAG ATTTAAATAA GCATAACCGT TACGGTGGCA ATGCGTTAAT TCCGGCAGCT
GAAAAAGGAC ATATTGACAA TGTGAAGCTC TTGTTAGAAG ATGGACGAGA AGACATAGAT
TTCCAAAATG ACTTTGGCTA TACAGCATTG ATTGAGGCAG TGGGGTTACG TGAAGGGAAC
CAACTTTACC AAGATATTGT AAAATTGTTA ATGGAAAATG GTGCGGATCA ATCCATTAAA
GACAATTCTG GTCGAACAGC AATGGACTAT GCCAATCAAA AAGGTTATAC GGAAATTAGT
AAAATTTTAG CACAGTACAA C EF040-4 (SEQ ID NO:156) AS KNSQTVTTQS
SAKTESTSTT RSVAQTTSKE EVKEPMKTYE VGALLEAANQ RDTKKVKEIL QDTTYQVDEV
DTEGNTPLNT AVHNIWIEIA KALTDRGADI NLQNSISDSP YLYAGAQGRT ETLAYMLKHA
TPDLNKHNRY GGNALIPAAE KGHIDNVKLL LEDGREDIDF QNDFGYTALI EAVGLREGNQ
LYQDIVKLLM ENGADQSIKD NSGRTAMDYA NQKGYTEISK ILAQYN EF041-1 (SEQ ID
NO:157) TAATTATTAA NTTCTGATTT TTCAGAAAAT ACAGATTGCA TTATTTTAGG
AGGCAACACT ATGAAATTGA AAAAGTCATT AACATTCGGT GTGATTACAT TATTTAGCGT
AACAACTTTA GCCGCTTGTG GAGGCGGCGG AACGTCAGAT AGCTCAAGCG CGTCTGGTGG
CGGTAAGGCA AGTGGCGAAC AAGTTTTACG TGTCACAGAA CAACAAGAAA TGCCAACAGC
TGATTTATCA CTAGCAACAG NCAGAATTAG TTTTATTGCA TTAAATAATG TATATGAAGG
AATTTATCGT TTAGACAAAG ATAACAAAGT CCAACCTGCA GGTGCAGCGG AAAAAGCAGA
AGTTTCTGAA GATGGACTAA CATACAAAAT TAAATTAAAT AAAGATGCAA AATGGTCAGA
CGGTAAACCA GTGACTGCTA ATGACTATGT TTACGGATGG CAACGAACAG TTGATCCAGC
GACAGCTTCT GAATATGCTT ATCTGTATGC CTCTGTAAAA AATGGTGATG CCATTGCTAA
AGGGGAAAAA GATAAATCAG AATTAGGAAT TAAAGCAGTC AGTGATACAG AATTAGAAAT
CACTTTAGAA AAAGCAACAC CATACTTTGA TTACTTATTA GCTTTCCCAT CATTCTTCCC
GCAACGTCAA GACATTGTGG AAAAATATGG TAAAAATTAT GCATCAAACA GCGAAAGTGC
TGTCTACAAT GGTCCATTCG TCTTAGACGG CTTTGATGGT CCTGGTACAG ATACAAAATG
GTCATTCAAG AAAAACGATC AATATTGGGA TAAAGATACT GTGAAACTGG ACTCAGTAGA
TGTGAATGTC GTGAAAGAAT CACCAACCGC GTTGAACTTG TTCCAAGATG GACAAACAGA
CGATGTCGTT CTTTCTGGTG AATTAGCCCA ACAAATGGCC AATGACCCAG CTTTTGTTAG
TCAAAAAGAA GCATCAACAC AATATATGGA ACTAAATCAA CGTGATGAAA AATCACCATT
TAGAAATGCG AACTTACGTA AAGCAATTTC TTACTCAATC GACCGTAAAG CGTTAGTTGA
ATCAATCCTT AGGCGATGG EF041-2 (SEQ ID NO:158) M KLKKSLTFGV
ITLFSVTTLA ACGGGGTSDS SSASGGGKAS GEQVLRVTEQ QEMPTADLSL ATXRISFIAL
NNVYEGIYRL DKDNKVQPAG AAEKAEVSED GLTYKIKLNK DAKWSDGKPV TANDYVYGWQ
RTVDPATASE YAYLYASVKN GDAIAKGEKD KSELGIKAVS DTELEITLEK ATPYFDYLLA
FPSFFPQRQD IVEKYGKNYA ENSESAVYNG PFVLDGFDGP GTDTKWSFKK NDQYWDKDTV
KLDSVDVNVV KESPTALNLF QDGQTDDVVL SGELAQQMAN DPAFVSQKEA STQYMELNQR
DEKSPFRNAN LRKAISYSID RKALVESILR GW EF041-3 (SEQ ID NO:159) TTGTG
GAGGCGGCGG AACCTCAGAT AGCTCAAGCG CGTCTGGTGG CGGTAAGGCA AGTGGCGAAC
AAGTTTTACG TGTCACAGAA CAACAAGAAA TGCCAACAGC TGATTTATCA CTAGCAACAG
NCAGAATTAG TTTTATTGCA TTAAATAATG TATATGAAGG AATTTATCGT TTAGACAAAG
ATAACAAAGT CCAACCTGCA GGTGCAGCGG AAAAAGCAGA AGTTTCTGAA GATGGACTAA
CATACAAAAT TAAATTAAAT AAAGATGCAA AATGGTCAGA CGGTAAACCA GTGACTGCTA
ATGACTATGT TTACGGATGG CAACGAACAG TTGATCCAGC GACAGCTTCT GAATATGCTT
ATCTGTATGC CTCTGTAAAA AATGGTGATG CCATTGCTAA AGGGGAAAAA GATAAATCAG
AATTAGGAAT TAAAGCAGTC AGTCATACAG AATTAGAAAT CACTTTAGAA AAAGCAACAC
CATACTTTGA TTACTTATTA GCTTTCCCAT CATTCTTCCC GCAACGTCAA GACATTGTGG
AAAAATATGG TAAAAATTAT GCATCAAACA GCGAAAGTGC TGTCTACAAT GGTCCATTCG
TCTTAGACGG CTTTGATGGT CCTGGTACAG ATACAAAATG GTCATTCAAG AAAAACCATC
AATATTGGGA TAAAGATACT GTGAAACTGG ACTCAGTAGA TGTGAATGTC GTGAAAGAAT
CACCAACCGC GTTGAACTTG TTCCAAGATG GACAAACAGA CGATGTCGTT CTTTCTGGTG
AATTAGCCCA ACAAATGGCC AATGACCCAG CTTTTGTTAG TCAAAAAGAA GCATCAACAC
AATATATGGA ACTAAATCAA CGTGATGAAA AATCACCATT TAGAAATGCG AACTTACGTA
AAGCAATTTC TTACTCAATC GACCGTAAAG CGTTAGTTGA ATCAATCCTT AGGGGATGG
EF041-4 (SEQ ID NO:160) CGCGGTSDS SSASGGGKAS GEQVLRVTEQ QEMPTADLSL
ATXRISFIAL NNVYEGIYRL DKDNKVQPAG AAEKAEVSED GLTYKTKLNK DAKWSDGKPV
TANDYVYGWQ RTVDPATASE YAYLYASVKN GDAIAKGEKD KSELGIKAVS DTELEITLEK
ATPYFDYLLA FPSFFPQRQD IVEKYGKNYA SNSESAVYNG PFVLDGFDGP GTDTKWSFKK
NDQYWDKDTV KLDSVDVNVV KESPTALNLF QDGQTDDVVL SGELAQQMAN DPAFVSQKEA
STQYMELNQR DEKSPFRNAN LRKAISYSTD RKALVESILR OW EF044-1 (SEQ ID
NO:161) TAAGATAAAA TTAGTTATAG CGTCTATAGG AGGAATAGTA TGAAAAAATT
AGTTTGTGTT ATTTTAGTTA TTTTTTTAAC AGGTTGTAGT TCTCAAAAAG CGAATGAACC
TAAAAAACAA GAAAATTCTA CCAATCATAC AACATCAATA AAAAGCAGTA CTAATCATTA
CAGTTCTAGC ATAGAAACAA GCTCTAATAA TAAACTAAAA GAAACTTCAG AAAGTGCCAG
CACCACTCAA ACTTCGTCAA AGTCGAAAAA TGAAGTATCT ACAAATGTCG AAGAAGCAAA
TTCTTTAGAA GCAACACCTT ATGCTGTCGA TCTTAGTAGC TTAAACAATC CACTCGTATT
TAATTTTAAA GGAATGAATG TGCCAACTTC AATTACGTTA GAGAACTTAA ATTCAACACC
AACTGCTACC TTCCGAACTA AATTGTTTGG GGCTGAAAAT GGTCAAGTGA AAGAAGCCAT
TAATAAATAT GAGCTATCTA TAAATACAAT TCCTACAAAA GAGATTAGAA TATTTTCAGC
GGCCGATAAC AGTATTCGCA CCGTTAAAGT AAATACAGAA TTAATTTTAG GAACTAATAT
TTCTTCAAAC GATGAACAAA ATAGATCGGG CACTTTATAC TTATTCAACA ATAAAAATGG
TTCGATATCT TTAATCACTC CTAACTACGC TGGCAATGTT ACGGATGATC AAAAAGACGT
TATGCTAGAA GTAATTCAAT AA EF044-2 (SEQ ID NO:162) MKKLVCVI
LVIFLTGCSS QKANEPKKQE NSTNHTTSIK SSTNHYSSSI ETSSNNKLKE TSESASTTQT
SSKSKNEVST NVEEANSLEA TPYAVDLSSL NNPLVFNFKG MNVPTSITLE NLNSTPTATF
RTKLFGAENG QVKEAINKYE LSTNTIPTKE IRIFSAADNS IRTVKVNTEL ILGTNTSSND
EQNRSGTLYL FNNKNGSISL ITPNYAGNVT DDQKDVMLEV IQ EF044-3 (SEQ ID
NO:163) TTGTAGT TCTCAAAAAC CGAATGAACC TAAAAAACAA GAAAATTCTA
CCAATCATAC AACATCAATA AAAAGCAGTA CTAATCATTA CAGTTCTAGC ATAGAAACAA
GCTCTAATAA TAAACTAAAA GAAACTTCAG AAAGTGCCAG CACCACTCAA ACTTCGTCAA
AGTCGAAAAA TGAAGTATCT ACAAATGTCG AAGAAGCAAA TTCTTTAGAA GCAACACCTT
ATGCTGTCGA TCTTAGTAGC TTAAACAATC CACTCGTATT TAATTTTAAA GGAATGAATG
TGCCAACTTC AATTACGTTA GAGAACTTAA ATTCAACACC AACTGCTACC TTCCGAACTA
AATTGTTTGG GGCTGAAAAT GGTCAAGTGA AAGAAGCCAT TAATAAATAT GAGCTATCTA
TAAATACAAT TCCTACAAAA GAGATTAGAA TATTTTCAGC GGCCGATAAC AGTATTCGCA
CCGTTAAAGT AAATACAGAA TTAATTTTAG GAACTAATAT TTCTTCAAAC GATGAACAAA
ATAGATCGGG CACTTTATAC TTATTCAACA ATAAAAATGG TTCGATATCT TTAATCACTC
CTAACTACGC TGGCAATGTT ACGGATGATC AAAAAGACGT TATGCTAGAA GTAATTCAA
EF044-4 (SEQ ID NO:164) CSS QKANEPKKQE NSTNHTTSIK SSTNHYSSSI
ETSSNNKLKE TSESASTTQT SSKSKNEVST NVEEANSLEA TPYAVDLSSL NNPLVFNFKG
MNVPTSITLE NLNSTPTATF RTKLFGAENG QVKEAINKYE LSINTIPTKE IRIFSAADNS
TRTVKVNTEL ILGTNISSND EQNRSGTLYL FNNKNGSISL ITPNYAGNVT DDQKDVMLEV
IQ EF045-1 (SEQ ID NO:165) TAGCCAAAAA ATGAGGCAGG AAAAGAGATG
AACAAGAAAC GGATTTTAGG TGCAATCACG TTAGCTTCTG TGTTAGTATT CGGGTTAGCT
GCATGTGGTG GCGGCAATAA AGGCGGGGGC AATAAAGCAA CGGAAACAGA AGACATTTCA
AAAATGCCAA TCGCTGTTAA AAATGATAAA AAAGCAATTG ATGGCGGTAC ATTAGATGTC
GCTGTAGTTA TGGATACACA ATTCCAAGGA CTTTTCCAGC AACAATTTTA TCAAGACAAC
TATGATGCAC AATACATGCT TCCAACGGTA CAGCCATTAT TTAACAATGA TGCAGACTTT
AAGATTGTCG ATGGGGGTCC TGCGGATCTG AAATTAGATG AAGATCCCAA TACAGCAACC
ATTAAATTAC GTGACAATTT GAAATGGTCT GACGGTAAAG ATGTGACAGC CGATGACCTG
ATTTTCTCTT ATGAAGTCAT TGGTCATAAA GACTATACAG GGATTCGTTA TGATGATAAC
TTTACGAATA TTGTTGGCAT GGAAGACTAC CATGATGGTA AATCGCCAAC CATTTCTGGC
ATAGAAAAAG TCAATGATAA AGAAGTTAAA ATCACTTATA AAGAAGTTCA CCCAGGAATG
CAACAATTAG GTGGCGGTGT TTGGGGCTCA GTTTTACCAA AACATGCCTT TCAAGGAATT
GCTGTTAAAG ACATGGAATC AAGCGATGCA GTTCGTAAAA ACCCTGTGAC TATTGGACCA
TACTACATGA GTAATATTGT GACAGGTGAA TCTGTTGAAT ACCTACCAAA TGAGCATTAC
TACGGTGGTA AACCTAAATT AGATAAATTA GTGTTCAAAT CTGTTCCTTC TGCGAGCATT
GTAGAAGCGA TGAAACCGAA ACAATACGAT ATTGCATTAT CAATGCCAAC AGATACGTAT
CCAACATACA AAGATACTGA AGGGTATCAA ATCTTAGGAC GTCCCGAACA AGCCTACACG
TATATTGGCT TTAAAATGGG TACGTTTGAC AAAGAAACAA ATACAGTGAA ATACAATCCA
AAAGCTAAAA TGGCAGATAA AAGCTTACGT CAAGCCATGG GCTATGCAAT TGACAATGAT
GCAGTCGGCC AAAAATTCTA CAACGGCTTA CGAACAGGGG CAACAACGTT AATCCCACCA
GTCTTCAAGA GCTTGCATGA TAGCGAAGCG AAAGGCTATA CGCTTGATTT AGACAAAGCG
AAAAAATTAT TAGACCATOC TGGTTATAAA GACGTAGACG GCGATGGCAT TCGCGAAGAC
AAAGAAGGCA AACCACTACA AATCAAGTTT CCTTCAATGT CAGGCGCCGA AACTGCACAA
CCACTTGCTC ATTACTATGT CCAACAATGG AAAGAAATTG GCTTAAACCT AACGTATACA
ACAGGACGCT TAATTGATTT CCAAGCATTC TATGATAAAT TCAAAAATGA TGACCCAGAA
GTACATATCT ATCAAGGCGC CTGGGGCACA GGTTCAGATC CTTCACCAAC CGGCTTATAT
GGTCCAAACT CACCCTTTAA CTATACACGT TTTGACTCAG AAGAAAATAC TAAATTACTT
GATGCGATTG ATTCAAAAGC ATCATTTGAT GAACAAAAAC GTAAAAAAGC CTTCTACGAT
TGGCAAGAGT ATGCCATTGA TGAAGCGTTT GTAATCCCAA CGCTTTACAG AAATGAAGTC
TTGCCTGTCA ACGACCGTGT AGTTGACTTT ACTTGGCCAG TTGATACGAA AGATAATCCA
TCCGCAACGG TGCGTGTCAC AGCAGACTCA CGGAAATAA EF045-2 (SEQ ID NO:166)
MN KKRILCAITL ASVLVFGLAA CGGGNKGGGN KATETEDISK MPIAVKNDKK
ATDGGTLDVA VVMDTQFQGL FQQEFYQDNY DAQYNLPTVQ PLFNNDADFK IVDGGPADLK
LDEDANTATI KLRDNLKWSD GKDVTADDVI FSYEVIGHKD YTGIRYDDNF TNIVGMEDYH
DGKSPTTSGI EKVNDKEVKI TYKEVHPGMQ QLGGGVWGSV LPKHAFEGIA VKDMESSDAV
RKNPVTIGPY YMSNTVTGES VEYLPNEHYY GGKPKLDKLV FKSVPSASIV EAMKAKQYDI
ALSMPTDTYP TYKDTEGYQT LGRPEQAYTY IGFKNGTFDK ETNTVKYNPK AKMADKSLRQ
AMGYAIDNDA VCQKFYNGLR TGATTLIPPV FKSLHDSEAK GYTLDLDKAK KLLDDAGYKD
VDGDGIREDK EGKPLEIKFA
SMSGGETAQP LADYYVQQWK EIGLNVTYTT GRLIDFQAFY DKLKNDDPEV DIYQCAWGTG
SDPSPTGLYC PNSAFNYTRF ESEENTKLLD ATDSKASFDE EKRKKAFYDW QEYAIDEAFV
IPTLYRNEVL PVNDRVVDFT WAVDTKDNPW ATVGVTADSR K EF045-3 (SEQ ID
NO:167) ATGTGGTG CCGCCAATAA AGGCGGCCGC AATAAAGCAA CGGAAACAGA
AGACATTTCA AAAATGCCAA TCGCTGTTAA AAATGATAAA AAAGCAATTG ATGGCCGTAC
ATTAGATGTC GCTGTAGTTA TGGATACACA ATTCCAAGGA CTTTTCCAGC AAGAATTTTA
TCAACACAAC TATGATCCAC AATACATGCT TCCAACGGTA CAGCCATTAT TTAACAATGA
TGCAGACTTT AAGATTGTCG ATGGGGGTCC TGCGGATCTG AAATTACATC AAGATCCCAA
TACAGCAACC ATTAAATTAC CTGACAATTT GAAATGGTCT GACCGTAAAG ATGTGACAGC
CGATGACGTG ATTTTCTCTT ATGAAGTCAT TGCTCATAAA GACTATACAG GGATTCGTTA
TGATGATAAC TTTACGAATA TTGTTGGCAT GGAAGACTAC CATGATGGTA AATCGCCAAC
CATTTCTGGC ATAGAAAAAG TCAATGATAA AGAAGTTAAA ATCACTTATA AAGAAGTTCA
CCCAGGAATC CAACAATTAG GTGGCGGTGT TTGGGGCTCA GTTTTACCAA AACATGCCTT
TGAAGGAATT GCTGTTAAAG ACATGCAATC AAGCGATGCA GTTCGTAAAA ACCCTGTGAC
TATTGGACCA TACTACATGA CTAATATTGT GACAGGTGAA TCTGTTGAAT ACCTACCAAA
TGAGCATTAC TACGGTGGTA AACCTAAATT AGATAAATTA GTGTTCAAAT CTGTTCCTTC
TGCGACCATT GTAGAAGCGA TGAAAGCGAA ACAATACGAT ATTGCATTAT CAATGCCAAC
AGATACGTAT CCAACATACA AAGATACTGA AGGGTATCAA ATCTTAGGAC GTCCCGAACA
AGCCTACACG TATATTGGCT TTAAAATGGG TACGTTTCAC AAAGAAACAA ATACAGTGAA
ATACAATCCA AAACCTAAAA TGGCAGATAA AAGCTTACGT CAAGCCATGG GCTATGCAAT
TGACAATGAT GCAGTCGGCC AAAAATTCTA CAACGGCTTA CGAACAGGGG CAACAACGTT
AATCCCACCA GTCTTCAAGA GCTTGCATGA TAGCGAAGCG AAAGGCTATA CGCTTGATTT
AGACAAAGCG AAAAAATTAT TAGACGATGC TGGTTATAAA GACGTAGACG GCGATGGCAT
TCGCGAAGAC AAAGAAGGCA AACCACTAGA AATCAAGTTT GCTTCAATGT CACGCGGCGA
AACTGCACAA CCACTTGCTG ATTACTATGT CCAACAATGG AAAGAAATTG GCTTAAACGT
AACGTATACA ACAGGACGCT TAATTGATTT CCAAGCATTC TATGATAAAT TGAAAAATGA
TGACCCAGAA GTAGATATCT ATCAAGGCGC GTGOCGCACA GGTTCAGATC CTTCACCAAC
CGGCTTATAT GGTCCAAACT CAGCCTTTAA CTATACACGT TTTGAGTCAC AAGAAAATAC
TAAATTACTT GATGCGATTG ATTCAAAAGC ATCATTTGAT GAAGAAAAAC GTAAAAAAGC
CTTCTACGAT TGGCAAGAGT ATGCCATTGA TGAAGCGTTT GTAATCCCAA CGCTTTACAG
AAATGAAGTC TTGCCTGTCA ACGACCGTGT AGTTGACTTT ACTTGGGCAG TTGATACGAA
AGATAATCCA TGGGCAACGG TGGGTGTCAC AGCAGACTCA CGGAAA EF045-4 (SEQ ID
NO:168) CGGGNKCGGN KATETEDISK MPTAVKNDKK AIDGGTLDVA VVMDTQFQGL
FQQEFYQDNY DAQYMLPTVQ PLFNNDADFK IVDGGPADLK LDEDANTATT KLRDNLKWSD
GKDVTADDVT FSYEVIGHKD YTGIRYDDNF TNIVGMEDYH DGKSPTISGI EKVNDKEVKI
TYKEVHPGMQ QLGGGVWGSV LPKHAFEGIA VKDMESSDAV RKNPVTIGPY YMSNIVTGES
VEYLPNEHYY GGKPKLDKLV FKSVPSASIV EANKAKQYDI ALSMPTDTYP TYKDTEGYQI
LGRPEQAYTY IGFKMGTFDK ETNTVKYNPK AKMADKSLRQ AMGYAIDNDA VGQKFYNGLR
TGATTLTPPV FKSLHDSEAK GYTLDLDKAK KLLDDAGYKD VDGDGIREDK EGKPLEIKFA
SMSGGETAQP LADYYVQQWK EIGLNVTYTT GRLIDFQAFY DKLKNDDPEV DIYQGAWGTG
SDPSPTGLYG PNSAFNYTRF ESEENTKLLD AIDSKASFDE EKRKKAFYDW QEYAIDEAFV
IPTLYRNEVL PVNDRVVDFT WAVDTKDNPW ATVGVTADSR K EF046-1 (SEQ ID
NO:169) TAGGAGGATA TAATGAAAAA AAAACTTATT GTACTATTGT TAGCCTTATT
TTTAACGGCA TGTAGTAATA ATACTGGGGG AAAAAATAGC GACGCTTCAT CTACTGAAGT
ATCAACTAAG CAGCAAACTA CCCAGTCTTC TAAAAAAGAT AGTAGTAATC CGGACACAAC
ACCAACTTCT ACATCATCTA TAACAATTGA AACAACCGAG AATTTAAAGA ATAGAGAATT
GAATCCAACA GATGATGTTT CAAAAACTAG ACGACAATTG TATGAACAAG GAATTAACAG
TTCAACAATT ACGGATAAAG AACTAAAGGA ATATATATCA GAGGCTAAAG AACAAAAGAA
AGATGTCATT AATTATATTA AGCAAAAA EF046-2 (SEQ ID NO:170) MKKKLTV
LLLALFLTAC SNNTGGKNSD ASSTEVSTKQ QTTQSSKKDS SNPDTTPTST SSITIETTEN
LKNRELNPTD DVSKTRRQLY EQGINSSTIT DKELKEYISE AKEQKKDVIN YIKQK
EF046-3 (SEQ ID NO:171) A TGTAGTAATA ATACTGGGGG AAAAAATAGC
GACGCTTCAT CTACTGAAGT ATCAACTAAG CAGCAAACTA CCCAGTCTTC TAAAAAAGAT
AGTAGTAATC CGGACACAAC ACCAACTTCT ACATCATCTA TAACAATTGA AACAACCGAG
AATTTAAAGA ATAGAGAATT GAATCCAACA GATGATGTTT CAAAAACTAG ACGACAATTG
TATGAACAAG GAATTAACAG TTCAACAATT ACGGATAAAG AACTAAAGGA ATATATATCA
GAGGCTAAAG AACAAAAGAA AGATGTCATT AATTATATTA AGCAAAAA EF046-4 (SEQ
ID NO:172) C SNNTGGKNSD ASSTEVSTKQ QTTQSSKKDS SNPDTTPTST SSITIETTEN
LKNRELNPTD DVSKTRRQLY EQGINSSTIT DKELKEYISE AKEQKKDVIN YIKQK
EF047-1 (SEQ ID NO:173) TAGGGAAAAC AAGGAGGAAT TCTTATGAAA AAGATAGGGC
TTATTTCTAG TGCTTTTCTT TTAACCCTTG CTTTAGCAGC ATGCGGCGGC GGAAAAAGTA
CAGAAAATAC GGATAGTCGT TCCAGTGCTG CGGAAAGTAC CACAGTCGAG AGTACAAAAG
CATCTGCTAC AAAAGAATCA AGTAGCAAAG CAACAACAAA ATCTAGTGAT GCGAAACCGT
CAGGAACAAC AACAGCTGAT TCGAAAGCAA CAGCTTCTTC TACGAAGGAA GCGGCAAATA
ATGGCTCAGC AGAGAAGCAA TCACCAGCGA AAAATGCGAA TCCAGATGAC CAAGCCAACC
AAGTGCTTAA CCAGCTAGCA AACATGTTTC CTGGTCAAGG CTTACCGCAG GCAATTTTAA
CGAGTCAAAC GAATAACTTT TTAACTGCAG CGACAACTTC ACAAGCGGAT CAAAACAATT
TCCGTGTTTT ATATTATGCA GAAAAAGAAG CGATTCCACT GAATGATGCA CGTGTCAATC
AGTTAACGCC AATTAGTTCT TTTGAGAAAA AAACATATGG CTCTGATGCC GAACCAAAAA
ATGCAGTGAA CCAAATCATT GACAATGGCG GTCAACCAGT AGATTTAGGT TACAATATTA
CTGGGTATAA ACAAGGGGCG GCAGGTTCTA GTTACTTATC TTGGCAAGAA GGCAATTGGA
GTTTAGTCGT ACGGGCCTCA AATATCAATG GTGAATCGCC TGATGATTTA GCGAAAAATG
TTGTCAACAT TTTGGAACAA CAAACATTAC CAGCACCGAA TACCGTTGGT CAAATCACAC
TGAACGTGGC AGGAACCACT GACTATAATC GAAACTCAGT AGTTTGGCAA GCCGGTACAG
TCGTTTACTC TGTCCATCAT TTTGACCCAA TTCAAGCAGT GAAGATGGCA ACATCAATGT
AA EF047-2 (SEQ ID NO:174) MKK IGLISSAFLL TLALAACGGG KSTENTDSRS
SAAESTTVES TKASATKESS SKATTKSSDA KPSGTTTADS KATASSTKEA ANNGSAEKQS
PAKNANPDDQ ANQVLNQLAN MFPGQGLPQA ILTSQTNNFL TAATTSQADQ NNFRVLYYAE
KEAIPVNDAR VNQLTPISSF EKKTYGSDAE AKNAVNQIID NGGQPVDLGY NITGYKQGAA
GSSYLSWQEG NWSLVVRASN INGESPDDLA KNVVNILEQE TLPAPNTVGQ ITLNVAGTTD
YNRNSVVWQA GTVVYSVHHF DPIQAVKMAT SM EF047-3 (SEQ ID NO:175)
ATGCGGCGGC GGAAAAAGTA CACAAAATAC GGATAGTCGT TCCAGTGCTG CGGAAAGTAC
CACAGTCGAG AGTACAAAAG CATCTGCTAC AAAAGAATCA AGTAGCAAAG CAACAACAAA
ATCTACTGAT GCGAAACCGT CAGGAACAAC AACAGCTGAT TCGAAAGCAA CAGCTTCTTC
TACGAAGGAA GCGGCAAATA ATGGCTCAGC AGAGAAGCAA TCACCAGCGA AAAATGCGAA
TCCAGATGAC CAAGCCAACC AAGTGCTTAA CCAGCTAGCA AACATGTTTC CTGGTCAAGG
CTTACCGCAG GCAATTTTAA CGAGTCAAAC GAATAACTTT TTAACTGCAG CGACAACTTC
ACAAGCGGAT CAAAACAATT TCCGTGTTTT ATATTATGCA GAAAAAGAAG CGATTCCAGT
GAATGATGCA CGTGTCAATC AGTTAACGCC AATTAGTTCT TTTGAGAAAA AAACATATGG
CTCTGATGCC GAAGCAAAAA ATGCAGTGAA CCAAATCATT GACAATGGCG GTCAACCAGT
AGATTTAGGT TACAATATTA CTGGGTATAA ACAAGGGGCG GCAGGTTCTA GTTACTTATC
TTGGCAAGAA GGCAATTGGA GTTTAGTCGT ACGGGCCTCA AATATCAATG GTGAATCGCC
TGATGATTTA GCGAAAAATG TTGTCAACAT TTTGGAACAA GAAACATTAC CAGCACCGAA
TACCGTTGGT CAAATCACAC TGAACGTGGC AGGAACCACT GACTATAATC GAAACTCAGT
AGTTTGGCAA GCCGGTACAG TCGTTTACTC TGTCCATCAT TTTGACCCAA TTCAACCAGT
GAAGATGGCA ACATCAATGT AA EF047-4 (SEQ ID NO:176) CGGG KSTENTDSRS
SAAESTTVES TKASATKESS SKATTKSSDA KPSGTTTADS KATASSTKEA ANNGSAEKQS
PAKNANPDDQ ANQVLNQLAN MFPGQGLPQA ILTSQTNNFL TAATTSQADQ NNFRVLYYAE
KEATPVNDAR VNQLTPTSSF EKKTYGSDAE AKNAVNQIID NGGQPVDLCY NITGYKQGAA
GSSYLSWQEG NWSLVVRASN INGESPDDLA KNVVNILEQE TLPAPNTVGQ ITLNVAGTTD
YNRNSVVWQA GTVVYSVHHF DPIQAVKMAT SM EF048-1 (SEQ ID NO:177)
TAAGGAGAAA AGTTCATGAA AAAAAGAAAG GTTTTATTTA CAGCAGTTAT GGTATTGGCA
GGATTACAGT TGCTAAGTGG TTGCGGCAAA ACAGAAGCTT CGGCAAATGA TACGGTAGTC
TTGCGCTATG CGTATGCTAG TAATAGCCAA CCAGTTATCG ATTCTATGAA GAAATTCGGT
GAATTAGTAG AGGAAAAAAC AGATGGTAAA GTTCAAATTG AATATTTTCC AGATGGTCAA
TTAGGAGGAC AAACAGAACT AATTGAATTA ACACAAACAG GTGCAATTCA TTTTGCAAAG
GTCAGTGGAT CAGCATTAGA AAGTTTTTCT AAAGATTATT CTGTATTTGC CATTCCGTAT
ATTTTTGATA ATGAAAAACA TTTTTTTAAA GTAATGCATA ATCAAGCGCT AATGCAACCA
GTGTATGATT CTACAAAAAA ATTAGGATTT GTTGCTTTAA CTTATTATGA CTCTGGTCAA
CGAAGTTTTT ATATGAGCAA AGGGCCTGTT ACATCTCCAG ATGATTTGAA AGGTAAAAAA
ATTCGGCTCA TGCAAACTGA AACCGCCATC AAAATGGTAG AACTTTTAGG GGCTTCGCCA
GTACCTATGG GTAGTTCGGA AGTATATACT TCTCTACAAT CTAATCTAAT CAACGGTGCA
GAGAATAATC AGTTCGTTTT ATATACACCT GGTCATGCTG GTGTGGCTAA GTATTATTCT
TATGATCAGC ATACTCGAGT GCCAGATATT GTGATTATGA ACGAGGGAAC AAAAGAACGT
TTGACAGCGA AACAAGAACA ACCGATTGAA GAAGCAGCAA AACAATCGAC CGCTTTTGAA
AAAACGGTCT TTAAAGAAGC GGTTGAAGAA GAAAAGAAAA AAGCACAAGC AGAATATGGC
GTTGTGTTCA ATCAAGTAGA CAGTGAACCA TTCCAAAAAC TTGTTCAACC CTTGCATCAA
TCATTCAAAA ATAGCTCAGA ACATGGCGAA CTGTATCAGG CTATTCGCCA GTTGGCGGAC
TAA EF048-2 (SEQ ID NO:178) MKKRKV LFTAVMVLAG LQLLSGCGKT EASANDTVVL
RYAYASNSQP VIDSMKKFGE LVEEKTDGKV QIEYFPDGQL GOETELTELT QTGATDFAKV
SGSALESFSK DYSVFAIPYI FDNEKHFFKV MDNQALMQPV YDSTKKLGEV GLTYYDSGQR
SFYMSKGPVT SPDDLKGKKI RVMQSETAIK MVELLGGSPV PMGSSEVYTS LQSNLINGAE
NNEFVLYTAG HGGVAKYYSY DEHTRVPDIV IMNEGTKERL TAKQEQATEE AAKESTAFEK
TVFKEAVEEE KKKAQAEYGV VFNQVDSEPF QKLVQPLHES FKNSSEHGEL YQAIRQLAD
EF04S-3 (SEQ ID NO:179) TTGCGGCAAA ACAGAAGCTT CGGCAAATGA TACGGTAGTC
TTGCGCTATG CGTATGCTAG TAATAGCCAA CCAGTTATCG ATTCTATGAA GAAATTCGGT
GAATTAGTAG AGGAAAAAAC AGATGGTAAA GTTCAAATTG AATATTTTCC AGATGGTCAA
TTAGGACGAC AAACAGAACT AATTGAATTA ACACAAACAG GTGCAATTGA TTTTGCAAAG
GTCAGTGGAT CACCATTAGA AAGTTTTTCT AAAGATTATT CTGTATTTGC CATTCCGTAT
ATTTTTCATA ATGAAAAACA TTTTTTTAAA GTAATGGATA ATCAAGCGCT AATGCAACCA
GTGTATGATT CTACAAAAAA ATTAGGATTT GTTGGTTTAA CTTATTATGA CTCTGGTCAA
CGAAGTTTTT ATATGAGCAA AGGGCCTGTT ACATCTCCAG ATGATTTGAA AGGTAAAAAA
ATTCGGGTCA TGCAAAGTGA AACCGCCATC AAAATGGTAG AACTTTTAGG GGGTTCGCCA
GTACCTATGG GTAGTTCCGA ACTATATACT TCTCTACAAT CTAATCTAAT CAACGGTGCA
GAGAATAATG AGTTCGTTTT ATATACAGCT GGTCATGGTG GTGTGGCTAA GTATTATTCT
TATGATGAGC ATACTCGAGT GCCAGATATT GTGATTATGA ACGAGGGAAC AAAAGAACGT
TTGACAGCGA AACAAGAACA AGCGATTGAA GAAGCAGCAA AAGAATCGAC CGCTTTTGAA
AAAACGGTCT TTAAAGAAGC GGTTGAAGAA GAAAAGAAAA AAGCACAAGC AGAATATGGC
GTTCTGTTCA ATCAAGTAGA CAGTOAACCA TTCCAAAAAC TTGTTCAACC GTTGCATGAA
TCATTCAAAA ATAGCTCAGA ACATGGCGAA CTGTATCAGG CTATTCGCCA GTTGGCGCAC
TAA EF048-4 (SEQ ID NO:180) CGKT EASANDTVVL RYAYASNSQP VIDSMKKFGE
LVEEKTDGKV QIEYFPDGQL OGETELIELT QTGAIDFAKV SCSALESFSK DYSVFAIPYI
FDNEKHFFKV MDNQALMQPV YDSTKKLGFV GLTYYDSGQR SFYMSKGPVT SPDDLKGKKI
RVMQSETAIK MVELLGGSPV PMGSSEVYTS LQSNLINGAE NNEFVLYTAG HCGVAKYYSY
DEHTRVPDIV IMNEGTKERL TAKQEQATEE AAKESTAFEK TVFKEAVEEE KKKAQAEYGV
VFNQVDSEPF QKLVQPLHES FKNSSEHGEL YQAIRQLAD FF049-1 (SEQ ID NO:181)
TGAGACTCTT TCTTTTTCAA AATGAGGTAT GGTATAGTTA TAACAGANAT AAAACTANAA
AAAACAGGAG TGCATAAGAG AATGAAGAAA AAACTAATCT TAGCTGCAGC GGGCGCAATG
GCCGTTTTTA GTTTAGCACC GTGTTCAAGC GCTTCAAAAC ATATCCCAAC AATGAAAGGT
TCAACAATTA CTGTTGATGA TTTTTATAAC CAAATTAAAG AACAAAGCAC TAGCCAACAA
GCGTTTAGCC AAATGGTTAT TTATAAAGTC TTTGAAGAAA AATATGGCGA CAAAGTAACT
GACAAAGANA TTCAAAAAAA CTTTGACGAA GCCAAAGAAC AAGTAGAAGC ACAAGGCGGA
AAGTTCTCTG ATGCATTAAA ACAAGCTCGT TTAACTGAAA AAACATTCAA GAAACAGTTA
AAACAAAGAG CAGCCTATGA TGCAGGTCTA AAAGCCCACT TAAAAATTAC AGATGAAGAC
TTAAAAACAG CTTGGGCAAG TTTCCATCCA CAAGTAGAAG CACAAATTAT CCAAGTTGCT
TCAGAAGATG ATGCCAAAGC TGTCAAGAAA GAAATCACTG ACGGCGGCGA TTTCACAAAA
ATTCCTAAAG AAAAATCAAC AGATACTGCT ACGAAAAAAG ATGGCGGTAA AATTAAATTT
GATTCACAAG CAACAACTGT TCCTCCCGAA GTTAAAGAAG CTGCCTTCAA ATTAAAAGAT
GGCGAAGTGT CAGAACCAAT TGCTGCAACA AATATGCAAA CCTACCAAAC AACCTACTAT
GTAGTCAAAA TGACGAAAAA CAAAGCAAAA GGCAATCACA TGAAACCTTA TGAAAAAGAC
ATCAAGAAAA TTGCTGAAGA AACAAAATTA GCCGATCAAA CATTTGTTTC GAAAGTCATT
AGTGACGAAT TAAAAGCGGC CAATGTGAAA ATTAAAGATG ATGCCTTCAA GAACGCTTTA
GCAGGCTACA TGCAAACTGA ATCTTCAAGC GCTTCTTCAG AGAAAAAAGA ATCAAAATCA
AGTGATTCTA AAACAAGCGA TACCAAAACA AGCGACTCTG AAAAAGCAAC AGATTCTTCA
AGCAAAACAA CAGAATCTTC TTCTAAATAA EF049-2 (SEQ ID NO:182) MKKK
LILAAAGAIMA VFSLAACSSG SKDIATMKGS TITVDDFYNQ IKEQSTSQQA FSQMVIYKVF
EEKYGDKVTD KXTQKNFDEA KEQVEAQGGK FSDALKQAGL TEKTFKKQLK QEAAYDAGLK
AHLKITDEDL KTAWASFHPE VEAQIIQVAS EDDAKAVKKE ITDGGDFTKI AKEKSTDTAT
KKDGGKIKFD SQATTVPAEV KEAAFKLKDG EVSEPTAATN MQTYQTTYYV VKNTKNKAKG
NDMKPYEKEI KKTAEETKLA DQTFVSKVTS DELKAANVKI KDDAFKNALA GYMQTESSSA
SSEKKESKSS DSKTSDTKTS DSEKATDSSS KTTES SSK EF049-3 (SEQ ID NO:183)
GTGTTCAAGC GGTTCAAAAG ATATCGCAAC AATGAAAGGT TCAACAATTA CTGTTGATGA
TTTTTATAAC CAAATTAAAG AACAAAGCAC TAGCCAACAA GCGTTTAGCC AAATGGTTAT
TTATAAAGTC TTTGAAGAAA AATATGGCGA CAAAGTAACT GACAAAGANA TTCAAAAAAA
CTTTGACGAA GCCAAAGAAC AAGTAGAAGC ACAAGGCGGA AAGTTCTCTG ATGCATTAAA
ACAAGCTGGT TTAACTGAAA AAACATTCAA GAAACACTTA AAACAAAGAG CAGCCTATGA
TGCAGGTCTA AAAGCCCACT TAAAAATTAC AGATGAAGAC TTAAAAACAG CTTGGGCAAG
TTTCCATCCA GAAGTAGAAG CACAAATTAT CCAAGTTGCT TCAGAAGATG ATGCCAAAGC
TGTCAAGAAA GAAATCACTG ACGGCGGCGA TTTCACAAAA ATTGCTAAAG AAAAATCAAC
AGATACTGCT ACGAAAAAAG ATGGCGGTAA AATTAAATTT GATTCACAAG CAACAACTGT
TCCTGCCGAA GTTAAAGAAG CTGCCTTCAA ATTAAAAGAT GGCGAAGTGT CAGAACCAAT
TGCTGCAACA AATATGCAAA CCTACCAAAC AACCTACTAT GTAGTGAAAA TGACGAAAAA
CAAAGCAAAA GGCAATGACA TGAAACCTTA TGAAAAAGAG ATCAAGAAAA TTGCTGAAGA
AACAAAATTA GCCGATCAAA CATTTGTTTC GAAAGTCATT AGTGACGAAT TAAAAGCGGC
CAATGTGAAA ATTAAAGATG ATGCCTTCAA GAACGCTTTA GCAGGCTACA TGCAAACTGA
ATCTTCAAGC GCTTCTTCAG AGAAAAAAGA ATCAAAATCA AGTGATTCTA AAACAAGCGA
TACCAAAACA AGCGACTCTG AAAAAGCAAC AGATTCTTCA AGCAAAACAA CAGAATCTTC
TTCTAAATAA EF049-4 (SEQ lED NO:184) CSSG SKDIATMKGS TITVDDFYNQ
IKEQSTSQQA FSQMVIYKVF EEKYGDKVTD KXIQKNFDEA KEQVEAQGGK FSDALKQAGL
TEKTFKKQLK QRAAYDAGLK AHLKITDEDL KTAWASFHPE VEAQIIQVAS EDDAKAVKKE
ITDGGDFTKI AKEKSTDTAT KKDGGKIKFD SQATTVPAEV KEAAFKLKDG EVSEPIAATN
MQTYQTTYYV VKNTKNKAKG NDMKPYEKEI KKIAEETKLA DQTFVSKVIS DELKAANVKI
KDDAFKNALA GYMQTESSSA SSEKKESKSS DSKTSDTKTS DSEKATDSSS KTTESSSK
EF050-1 (SEQ ID NO:185) TAGGGTCTGG AAAAGCAGTC AACTGACTTC TTTTCCAAGC
CCTTTTTTAG TTCATCGCAG AAAGGATGNA AAAAAATGAA CATGCCCAAA AATATCNGTT
ATTTTTCTTT GCTAATGGGT CTTGTTCTAT TATTAAGTGC TTGCCAAATT GGGGCAACTA
CGAAGGATGA CAACCAAGCC GCCACAAAAG AAGCAACTGT TGAGTTAAAC CGCACAACAA
CACCAACGCT TTTTTTTCAT GGTTACGCAG GAACTAAAAA TTCGTTTGGC TCCTTACTGC
ATCGCTTGGA GAAACAAGGT GCCACAACTC AAGAATTAGT GCTACTCGTT AAACCTGATG
GGACCGTGGT TAAAGAGCGA GGAGCTTTAA GTGGCAAAGC GACGAATCCC AGTGTTCAAG
TTCTATTTGA AGATAATAAA AACAATGAAT GGAATCAAAC AGAATGGATA AAAAACACAT
TACTCTATTT ACAAAAAAAT TATCAAGTGA ACAAAGCCAA TATTGTCGGG CACTCTATGG
GTGGTGTTAG TGGTTTACGT TATTTAGGAA CCTATGGGCA AGATACATCG TTACCTAAAA
TTGAAAAATT CGTCAGCATT GGAGCACCTT TCAATGATTT TATTGATACG AGTCAACAGC
AAACCATCGA AACGGAACTA GAAAACGGCC CCACAGAAAA AAGTAGCCGC TATTTGGATT
ATCAAGAGAT GATTAATGTT GTTCCAGAAA AACTGCCCAT TTTATTAATT GGTGGTCAAT
TAAGTCCAAC AGATTTAAGT GATGGAACGG TGCCGTTATC TAGTGCCTTA GCAGTCAACG
CCTTGCTAAG ACAGCGAGGA ACTCAAGTCA CTAGCCAGAT TATTAAAGGA GAAAATGCAC
AACATAGTCA ATTACATGAA AATCCTGAAG TAGATCAATT GCTAATCGAA TTTCTATGGC
CGAGTAAAAA ATAG EF050-2 (SEQ ID NO:186) HNMPKN IXYFSLLMGL
VLLLSACQIG ATTKDDNQAA TKEATVELNR TTTPTLFFHG YAGTKNSFGS LLHRLEKQGA
TTQELVLLVK PDGTVVKERG ALSGKATNPS VQVLFEDNKN NEWNQTEWIK NTLLYLQKNY
QVNKANIVGH SMGGVSGLRY LGTYGQDTSL PKIEKFVSIG APFNDFIDTS QQQTIETELE
NGPTEKSSRY LDYQEMINVV PEKLPILLIG GQLSPTDLSD
GTVPLSSALA VNALLRQRGT QVTSQIIKGE NAQHSQLHEN PEVDQLLIEF LWPSKK
EF050-3 (SEQ ID NO:187) TTGCCAAATT GGGGCAACTA CGAAGGATGA CAACCAAGCC
GCCACAAAAG AAGCAACTGT TCAGTTAAAC CGCACAACAA CACCAACGCT TTTTTTTCAT
GGTTACGCAG GAACTAAAAA TTCGTTTGGC TCGTTACTCC ATCGCTTGGA GAAACAAGGT
GCCACAACTC AAGAATTAGT GCTACTCGTT AAACCTGATG GGACCGTCGT TAAAGAGCGA
GGAGCTTTAA GTGGCAAAGC GACGAATCCC AGTGTTCAAG TTCTATTTGA AGATAATAAA
AACAATGAAT GGAATCAAAC AGAATGGATA AAAAACACAT TACTCTATTT ACAAAAAAAT
TATCAAGTGA ACAAAGCCAA TATTGTCGGG CACTCTATGG GTGGTGTTAG TCGTTTACGT
TATTTAGGAA CCTATGGGCA AGATACATCG TTACCTAAAA TTGAAAAATT CGTCAGCATT
GGAGCACCTT TCAATGATTT TATTGATACG AGTCAACAGC AAACCATCGA AACGGAACTA
GAAAACGGCC CCACAGAAAA AAGTAGCCGC TATTTGGATT ATCAAGAGAT GATTAATGTT
GTTCCAGAAA AACTGCCCAT TTTATTAATT GGTGGTCAAT TAAGTCCAAC AGATTTAAGT
GATGGAACGG TGCCGTTATC TAGTGCCTTA GCAGTCAACG CCTTGCTAAG ACAGCGAGGA
ACTCAAGTCA CTAGCCAGAT TATTAAAGGA GAAAATGCAC AACATAGTCA ATTACATGAA
AATCCTGAAG TAGATCAATT GCTAATCGAA TTTCTATGCC CGAGTAAAAA ATAG EF050-4
(SEQ ID NO:188) CQIG ATTKDDNQAA TKEATVELNR TTTPTLFFHG YAGTKNSFGS
LLHRLEKQGA TTQELVLLVK PDGTVVKERG ALSGKATNPS VQVLFEDNKN NEWNQTEWIK
NTLLYLQKNY QVNKANIVGH SMGGVSGLRY LGTYGQDTSL PKIEKFVSIG APFNDFIDTS
QQQTIETELE NGPTEKSSRY LDYQEMINVV PEKLPILLIG GQLSPTDLSD GTVPLSSALA
VNALLRQRGT QVTSQIIKGE NAQHSQLHEN PEVDQLLIEF LWPSKK EF051-1 (SEQ ID
NO:189) TAAAAGAAAA GAGGCGTTCA AATGTCTAAA CAAAAAAAGG CTGTGTTCCT
GCTTAGTTTA TTCAGTTTAG TTGCCCTAAT TGCTGCATGT ACAAATCAGC CGCAAAAAGA
AACAGTTTCA ACAAAAAAAG AAGAAATAAC CCTTGCGGCA GCAGCTAGCT TAGAATCAGT
CATGGAGAAG AAAATTATTC CAGCCTTTGA AAAAGAGCAT CCAGATATTC AGGTAACTGG
AACCTATGAT AGTTCTGGAA AATTACAGAT GCAAATTGAA AAAGGCCTAA AAGCCGATGT
ATTTTTCTCA GCTTCGACAA AACAAATGAA TGCATTGGTT GCAGAAAAAC TAATTAATAA
AAAAAGTGTC GTTCCTTTAT TGGAAAACCA GCTCGTTCTT ATTGTGCCTA ACCAAGATCA
AGCAAAGTGG CATGATTTTT CTGATTTAAA AAAAGCCCAA ATGATAGCAA TTGGTGATCC
TGCAAGTGTT CCAGCTGGTC AATATGCCGA AGAAGGCTTA AAAGCTTTAG GCGCTTGGTC
TTATGTAGAA AAACACGCAA GCTTTGGCAC GAATGTAACA GAAGTCCTTG AATGGGTAGC
TAATGCAAGT GCAGAAGCTG GCTTAGTTTA TGCGACAGAT GCAGCAACCA ATTCAAAAGT
AGCGATTGTT GCGGCCATGC CTGAAGCTGT TTTGAAAAAG CCAATTATCT ATCCAGTTGG
TAAAGTTGCC GCCTCTAAGA AACAAAAATC AGCAGATGCT TTTTTAAATT TTTTACAGAG
TCAACAATGC AGAAAATATT TTGANAATAT TGGCTTTAAG TTAACAAAGT AG EF051-2
(SEQ ID NO:190) MSKQ KKAVFLLSLF SLVALIAACT NQPQKETVST KKEEITLAAA
ASLESVMEKK IIPAFEKEHP DIQVTGTYDS SGKLQMQIEK GLKADVFFSA STKQMNALVA
EKLINKKSVV PLLENQLVLI VPNQDQAKWH DFSDLKKAQM IAIGDPASVP AGQYAEEGLK
ALGAWSYVEK HASFGTNVTE VLEWVANASA EAGLVYATDA ATNSKVAIVA AMPEAVLKKP
IIYPVGKVAA SKKQKSADAF LNFLQSQQCR KYFXNIGFKL TK EF051-3 (SEQ ID
NO:191) ATGT ACAAATCAGC CGCAAAAAGA AACAGTTTCA ACAAAAAAAG AAGAAATAAC
CCTTGCGGCA GCAGCTAGCT TAGAATCAGT CATGGAGAAG AAAATTATTC CAGCCTTTGA
AAAAGAGCAT CCAGATATTC AGGTAACTGG AACCTATGAT AGTTCTGGAA AATTACAGAT
GCAAATTGAA AAAGGCCTAA AAGCCGATGT ATTTTTCTCA GCTTCGACAA AACAAATGAA
TGCATTGGTT GCAGAAAAAC TAATTAATAA AAAAAGTGTC GTTCCTTTAT TGGAAAACCA
GCTCGTTCTT ATTGTGCCTA ACCAAGATCA AGCAAAGTGG CATGATTTTT CTGATTTAAA
AAAAGCCCAA ATGATAGCAA TTGGTGATCC TGCAAGTGTT CCAGCTGGTC AATATGCCGA
AGAAGGCTTA AAAGCTTTAG GCGCTTGGTC TTATGTAGAA AAACACGCAA GCTTTGGCAC
GAATGTAACA GAAGTCCTTG AATGGGTAGC TAATGCAAGT GCAGAAGCTG GCTTAGTTTA
TGCGACAGAT GCAGCAACCA ATTCAAAAGT AGCGATTGTT GCGGCCATGC CTGAAGCTGT
TTTGAAAAAG CCAATTATCT ATCCAGTTGG TAAAGTTGCC GCCTCTAAGA AACAAAAATC
AGCAGATGCT TTTTTAAATT TTTTACAGAG TCAACAATGC AGAAAATATT TTGANAATAT
TGGCTTTAAG TTAACAAAGT AG EF051-4 (SEQ ID NO:192) CT NQPQKETVST
KKEEITLAAA ASLESVMEKK IIPAFEKEHP DIQVTGTYDS SGKLQMQIEK GLKADVFFSA
STKQMNALVA EKLINKKSVV PLLENQLVLI VPNQDQAKWH DFSDLKKAQM IAIGDPASVP
AGQYAEEGLK ALGAWSYVEK HASFGTNVTE VLEWVANASA EAGLVYATDA ATNSKVAIVA
AMPEAVLKKP IIYPVGKVAA SKKQKSADAF LNFLQSQQCR KYFXNIGFKL TK EF052-1
(SEQ ID NO:193) TAAAGTAGGA GAAGCGCAAG CGAAAAAAGT GAATCAATCG
GCAGCGTATC AAGTAGTGAT CCCACAATGG GTACCATGGG TAGCATTATC TTTGACAGTA
GCACTTGCTG GATTGATTGC TTACTTAGTT CGTCGTGGAG AGAAGTGGAA AAACGAAGGG
GAAGTGACAT AATGAGANGA NGAAATCTTC NGTTTTTATT ATTGTTGGTT CTATTAATTT
ATATTCCTCA AACAACTTAT GCAGAAAATA GGGAGACCAC AGAAGTCGGA ATCGGGTTTA
CAAAAACTTC AGACATACCA TCAAAAAAAA ATCCAGTTGT GAATGTATTG CCGCAAACAA
CCATTCAATC GCTATCAATC GTTCGTAGCA GAACGCAAAT AAAAAGATTA CCTAAAACTG
GTGACAATCG AATAACTTGG CTAAGCTGGT TTGGCATATT GTTTTTAATA AGTAGTTTTT
GGCTGTTTCT ATTTAGACAA TTATGTAGAA AAGGAGAATA A EF052-2 (SEQ ID
NO:194) MRXX NLXFLLLLVL LIYIPQTTYA ENRETTEVOT GFTKTSDIPS KKNPVVNVLP
QTTIQSLSIV RSRTQIKRLP KTGDNRITWL SWFGILFLIS SFWLFLFRQL CRKGE
EF052-3 (SEQ ID NO:195) AGAAAATA GGGAGACCAC AGAAGTCGGA ATCGGGTTTA
CAAAAACTTC AGACATACCA TCAAAAAAAA ATCCAGTTGT GAATGTATTG CCGCAAACAA
CCATTCAATC GCTATCAATC GTTCGTAGCA GAACGCAAAT AAAAAGAT EF052-4 (SEQ
ID NO:196) ENRETTEVGI GFTKTSDIPS KKNPVVNVLP QTTIQSLSIV RSRTQIKR
EF053-1 (SEQ ID NO:197) TAGTCATGGC ACCATAACAA GGAGGAGAGA AGTGAGATGA
AAAAATACCT TTTGCTTAGT TGTTTTTTAG GTCTTTTCAG CTTCTGTCAT TCACACACTG
CGTTTGGAGA AGCAGCTTAT GAAAATAGTG GTGTTGTCTC CTTTTATGGA ACGTATGAAT
ATCCCACAGA AGAGTCGACA ACAGCGACTA GTAATTCTTC CACAACGACC GAACCCACCA
AGCCAGCTGA CCGAGGCGCT TCATCCGTCC TTTCTTCTGG CGTATATGGA TCGCGACAAG
GAAGATTACC AGCGACAGGT ACCACCAATC AAGCACCATT TATTTATTTG GGAATCAGCC
TTATCACTAT ACGCATATTA TTTATTAAAA GGAGAAGAGA AGATGAAAAA AACAGTATTA
GCAGTAGTAG GGATTGTAGG ATTTAG EF053-2 (SEQ ID NO:198) MKKYLLLSC
FLGLFSFCHS DTAFGEAAYE NSGVVSFYGT YEYPTEESTT ATSNSSTTTE PTKPADGGAS
SVLSSGVYGS RQGRLPATGT TNQAPFIYLG ISLITIGILF IKRRREDEKN SISSSRDCRI
EF053-3 (SEQ lED NO:199) TTTGGACA AGCAGCTTAT GAAAATAGTG GTGTTGTCTC
CTTTTATGGA ACGTATGAAT ATCCCACAGA AGAGTCGACA ACAGCGACTA GTAATTCTTC
CACAACGACC GAACCCACCA AGCCAGCTGA CGGAGGCGCT TCATCCGTCC TTTCTTCTGG
CGTATATCGA TCGCGACAAG GAAGA EF053-4 (SEQ ID NO:200) FGEAAYE
NSGVVSFYGT YEYPTEESTT ATSNSSTTTE PTKPADGGAS SVLSSGVYGS RQGR EF054-1
(SEQ ID NO:201) TAAATAAAAA ATTATTTGGA GGAAATTACA ATGAAAAAAA
TTATTTTATC AAGCTTGTTT AGTGCAGTAC TAGTATTCGG TGGCGGAAGT ATAACAGCAT
TCGCTGACGA TTTAGGACCA ACAGATCCAG CAACTCCACC AATTACCGAA CCAACTGATT
CTAGTGAACC TACGAATCCT ACTGACCCGG TGGATCCTGC AGAACCGCCA GTAATACCAA
CTGATCCAAC AGAACCAAGC AAGCCAACCG AGCCTACAAC ACCGAGTGAG CCAGAAAAGC
CAACAGAACC AACAACGCCA ATTGATCCTG GAACGCCGGT TGAACCGACT GAACCAAGCG
AGCCAACAGA ACCTAGTCAA CCAACCGAGC CTACAACACC AAGCGAACCA GAAAAACCTG
TTACTCCAGA ACAACCGAAA GAACCAACTC AACCAGTGAT TCCAGAAAAA CCAGCAGAAC
CAGAAACACC AAAAACTCCT GAACAGCCCA CTAAACCAAT AGACGTAGTC GTTACACCTA
GTGGAGAAAT TGATAAAACG AATCAATCGG CAGGAACACA ACCAAGTATT CCTATTGAAA
CAAGCAACTT AGCGGAGGTA ACACATGTAC CAAGTGAAAC TACTCCAATT ACAACAGAAG
CTGGGGAAGA AATTGTAGCA GTAGATAAAG GTGTTCCGTT AACCAAAACA CCAGAAGGAT
TAAAACCAAT TAGCAGCTCG TATAAGGTTT TACCTAGCGG AAACGTTGAG GTAAAAGCAA
GTGATGGAAA AATGAAAGTA TTGCCACATA CAGGAGAGAA ATTCACACTC CTTTTCTCTG
TATTGGGAAG CTTCTTTGTA TTAATTTCAG GATTCTTTTT CTTTAAAAAG AATAAGAAAA
AAGCTTAA EF054-2 (SEQ ID NO:202) M KKIILSSLFS AVLVFGGGSI TAFADDLGPT
DPATPPITEP TDSSEPTNPT EPVDPAEPPV IPTDPTEPSK PTEPTTPSEP EKPTEPTTPI
DPGTPVEPTE PSEPTEPSQP TEPTTPSEPE KPVTPEQPKE PTQPVIPEKP AEPETPKTPE
QPTKPIDVVV TPSGEIDKTN QSAGTQPSIP IETSNLAEVT HVPSETTPIT TEAGEEIVAV
DKGVPLTKTP EGLKPISSSY KVLPSGNVEV KASDGKMKVL PHTGEKFTLL FSVLGSFFVL
ISGFFFFKKN KKKA EF054-3 (SEQ ID NO:203) A ACAGATCCAG CAACTCCACC
AATTACCGAA CCAACTGATT CTAGTGAACC TACGAATCCT ACTGAGCCGG TGGATCCTGC
AGAACCGCCA GTAATACCAA CTGATCCAAC AGAACCAAGC AAGCCAACCG AGCCTACAAC
ACCGAGTGAC CCAGAAAAGC CAACAGAACC AACAACGCCA ATTGATCCTG GAACGCCGGT
TGAACCGACT GAACCAAGCG AGCCAACAGA ACCTAGTCAA CCAACCGAGC CTACAACACC
AAGCGAACCA GAAAAACCTG TTACTCCAGA ACAACCGAAA CAACCAACTC AACCAGTGAT
TCCAGAAAAA CCAGCAGAAC CAGAAACACC AAAAACTCCT GAACAGCCCA CTAAACCAAT
AGACGTAGTC GTTACACCTA GTGGAGAkAT TGATAAAACG AATCAATCGG CAGGAACACA
ACCAAGTATT CCTATTGAAA CAAGCAACTT AGCGGAGGTA ACACATGTAC CAAGTGAAAC
TACTCCAATT ACAACAGAAG CTGGGGAAGA AATTGTAGCA GTAGATAAAG GTGTTCCGTT
AACCAAAACA CCAGAAGGAT TAAAACCAAT TAGCAGCTCG TATAAGGTTT TACCTAGCGG
AAACGTTGAG GTAAAAGCAA GTGATGGAAA AATGAAAGTA T EF054-4 (SEQ ID
NO:204) DDLGPT DPATPPITEP TDSSEPTNPT EPVDPAEPPV IPTDPTEPSK
PTEPTTPSEP EKPTEPTTPI DPGTPVEPTE PSEPTEPSQP TEPTTPSEPE KPVTPEQPKE
PTQPVIPEKP AEPETPKTPE QPTKPIDVVV TPSGEIDKTN QSAGTQPSTP IETSNLAEVT
HVPSETTPIT TEAGEEIVAV DKGVPLTKTP EGLKPISSSY KVLPSGNVEV KASDGKMKV
EF055-1 (SEQ ID NO:205) TAACAAAAGG TTGTTTTGTC TTTCTTGTGT AAAAGGGCAA
GAAAGGCTAG CGAGTTAAAA GGAGGTTTTT CAATGAAAAA AAAGCGTTAT TTAATGATTG
TGTGTCTACT ATCTTCTCCT AGTTTTTTTA TAAATGTTGA AGCGTCTGAT GGTCGTTCTA
GTTCGGTGGG GATTGAATTT TACCAAAATC CGAGAACACC CGCTCCTAAA GATCCCCCAC
CGAAAACAGA TGCGCCAGCT GCTGATCCCA AGGAACCAGC TGGTCCTCCG CAAGGAGATC
AACGAAGTGG TGGTTCGACA CAGACCACCA CAACTGGCTC AACGCTCCCT CGTACAGGGA
GCAAGAGTCA GGCAAATTTG AGCATTCTCN GNTTCGCCTT AATCGGTTTG GCGGGAATCG
TACATAGAAA GAAGGGACGA CATGAAGCAA ACTAA EF055-2 (SEQ ID NO:206)
MKKKRYL MIVCLLSSPS FFINVEASDG GSSSVGIEFY QNPRTPAPKD PPPKTDAPAA
DPKEPAGPPQ GDQRSGGSTQ TTTTGSTLPR TGSKSQANLS ILXFALIGLA GIVHRKKGRH
EAN EF055-3 (SEQ ID NO:207) AGCGTCTGAT GGTGGTTCTA GTTCGGTGGG
CATTGAATTT TACCAAAATC CGAGAACACC CGCTCCTAAA GATCCCCCAC CCAAAACAGA
TGCGCCAGCT GCTGATCCCA AGGAACCAGC TGGTCCTCCG CAAGGAGATC AACGAAGTGG
TGGTTCGACA CAGACCACCA CAACTCGCTC AACG EE055-4 (SEQ ID NO:208) SDG
GSSSVGIEEY QNPRTPAPKD PPPKTDAPAA DPKEPAGPPQ GDQRSGGSTQ TTTTGST
EF056-1 (SEQ ID NO:209) TAAATGAAAA AAAAGCGTTA TTTAATAATT GCGTGTTTAC
TATTTTCCCC TAGTTTTTTT ATAAATGTTG AAGCATCTGA GGGTGGTTCT AGTTCGCTGG
GAATTGAATT TTACCAAAAT CCGGCAACAC CCGCTCCTAA AGATGCCCCA CCGAAAACAG
ATGAGCCAGC TGCGGATCCC AAGGAACCAG CTGGTCCTCT GCAAGGAGAT CAACGAAGTG
GTGGTTCGAC ACAGACCACC ACAGCTGGCT CGCAGCTCCC TCGTACAGGA AGCAAGAGTC
AGGCAAACCT GAGCATTCTT GGTCTTGTCT TGATTGGTCT TGTCGGAATG GTCCAGAGAA
AGAAGGGACG ACATGAAGCA AACTAA EF056-2 (SEQ ID NO:210) MKKKRYLIIA
CLLFSPSFFI NVEASEGGSS SVGIEFYQNP ATPAPKDAPP KTDEPAADPK EPAGPLQGDQ
RSGGSTQTTT AGSQLPRTGS KSQANLSTLG LVLLGLVGMV QRKKGRHEAN EF056-3 (SEQ
ID NO:211) ATCTGA GGGTC-GTTCT AGTTCGGTGG GAATTGAATT TTACCAAAAT
CCGGCAACAC CCGCTCCTAA AGATGCCCCA CCGAAAACAG ATGAGCCAGC TGCGGATCCC
AAGGAACCAG CTGGTCCTCT GCAAGGAGAT CAACGAAGTG GTGGTTCGAC ACAGACCACC
ACAGCTGGCT CGCAG EF056-4 (SEQ ID NO:212) SEGGSS SVGIEFYQNP
ATPAPKDAPP KTDEPAADPK EPAGPLQGDQ RSGGSTQTTT AGSQ EF057-1 (SEQ ID
NO:213) TAATGTTTAT TGGCTGGCCC AGTCAATGTT GAAAATGGGG AAGCAGGAAT
TCAGATGAAA ATCATAAAAA GGTTTAGTTT GGTATGTTTA GGGCTATTGA TCATTGGGTT
GCNAACAAAA AGCGNTATGG CTGAAGAAAA TAATTATGAA TCAAATGGTC AAGCGAGCTT
CTATGGTACC TACGTTTATG AGAATGAAAA AGAGTCAAAT GACGTAGCGT ATACCCAACA
ATCAGAAGAA CAGGGAAGAA ACAATTTAGC TGCTTCTGGA CAAGCAGTTT TACCTAAAAC
AGCCGAGTCT GAAAATCCGC TGTATTCCTT GATAGGAGTT AGTTTGTTGG GGATAGTCAT
TTATTTAATT AATAAAATGA AACGAGAGAA GGAGTTTATT TAA EF057-2 (SEQ ID
NO:214) MKI IKRFSLVCLG LLIIGLXTKS XMAEENNYES NGQASFYGTY VYENEKESND
VAYTQQSEEQ GRNNLAASGQ AVLPKTGESE NPLYSLIGVS LLGIVIYLIN KMKREKEFI
EF057-3 (SEQ ID NO:215) AAA TAATTATGAA TCAAATGGTC AAGCGAGCTT
CTATGGTACC TACGTTTATG AGAATGAAAA AGAGTCAAAT GACGTAGCGT ATACCCAACA
ATCAGAAGAA CAGGGAAGAA ACAATTTAGC TGCTTCTGGA CAAGCAGTTT EF057-4 (SEQ
ID NO:216) EENNYES NGQASFYGTY VYENEKESND VAYTQQSEEQ GRNNLAASGQ AV
EF058-1 (SEQ ID NO:217) TGAAGAACGT TCTATTTGGT TGACGATTGC AGGCCTGCTA
ATCATTGGGA TGCTAGTCAT TTGGCTATTT TATCAAAAAC AAAAAAGAGG AGAGAGAAAA
TGAAGCAATT AAAAAAAGTT TGGTACACCG TTAGTACCTT GTTACTAATT TTGCCACTTT
TCACAAGTGT ATTAGGGACA ACAACTGCAT TTGCAGAAGA AAATGGGGAG AGCGCACAGC
TCGTGATTCA CAAAAACAAA ATGACGGATT TACCAGATCC GCTTATTCAA AATAGCGGGA
AAGAAATGAG CGAGTTTGAT AAATATCAAC GACTGGCAGA TGTGACGTTT AGTATTTATA
ACGTGACGAA CGAATTTTAC GAGCAACGAG CGGCAGGCGC AAGCGTTGAT GCAGCTAAAC
AAGCTGTCCA AAGTTTAACT CCTGGGAAAC CTGTTGCTCA AGOAACCACC GATGCAAATG
GGAATGTCAC TGTTCAGTTA CCTAAAAAAC AAAATGGTAA AGATGCAGTG TATACCATTA
AAGAAGAACC AAAAGAGGGT GTAGTTGCTG CTACGAATAT GGTGGTGGCG TTCCCAGTTT
ACGAAATGAT CAAGCAAACA GATGGTTCCT ATAAATATGG AACAGAAGAA TTAGCGGTTG
TTCATATTTA TCCTAAAAAT GTGGTAGCCA ATGATGGTAG TTTACATGTG AAAAAAGTAG
GAACTGCTGA AAATGAAGGA TTAAATGGCG CAGAATTTGT TATTTCTAAA AGCGAAGGCT
CACCAGGCAC AGTAAAATAT ATCCAAGGAG TCAAAGATGG ATTATATACA TGGACAACGG
ATAAAGAACA AGCAAAACGC TTTATTACTG GGAAAAGTTA TGAAATTGGC GAAAATGATT
TCACAGAAGC AGAGAATGGA ACGGGAGAAT TAACAGTTAA AAATCTTGAG GTTGGTTCGT
ATATTTTAGA AGAAGTAAAA GCTCCAAATA ATGCAGAATT AATTGAAAAT CAAACAAAAA
CACCATTTAC AATTGAAGCA AACAATCAAA CACCTGTTGA AAAAACAGTC AAAAATGATA
CCTCTAAAGT TGATAAAACA ACACCAAGCT TAGATGGTAA AGATGTGGCA ATTGGCGAAA
AAATTAAATA TCAAATTTCT GTAAATATTC CATTGGGGAT TGCAGACAAA GAAGGCGACG
CTAATAAATA CGTCAAATTC AATTTAGTTG ATAAACATGA TGCAGCCTTA ACTTTTGATA
ACGTGACTTC TGGAGAGTAT GCTTATGCGT TATATGATGG GGATACAGTG ATTGCTCCTG
AAAATTATCA AGTGACTGAA CAAGCAAATG GCTTCACTGT CGCCGTTAAT CCAGCGTATA
TTCCTACGCT AACACCAGGC GGCACACTAA AATTCGTTTA CTTTATGCAT TTAAATGAAA
AAGCAGATCC TACGAAAGGC TTTAAAAATG AGGCGAATGT TGATAACGGT CATACCGACG
ACCAAACACC ACCAACTGTT GAAGTTGTCA CAGGTGGGAA ACGTTTCATT AAAGTCGATG
GCGATGTGAC AGCGACACAA GCCTTGGCGG GAGCTTCCTT TGTCGTCCGT GATCAAAACA
GCGACACAGC AAATTATTTG AAAATCGATG AAACAACGAA AGCAGCAACT TGGGTCAAAA
CAAAAGCTGA AGCAACTACT TTTACAACAA CGGCTGATGG ATTAGTTGAT ATCACAGGGC
TTAAATACGG TACCTATTAT TTAGAAGAAA CTGTAGCTCC TGATGATTAT GTCTTGTTAA
CAAATCGGAT TGAATTTGTG GTCAATGAAC AATCATATGG CACAACAGAA AACCTAGTTT
CACCAGAAAA AGTACCAAAC AAACACAAAG GTACCTTACC TTCAACAGGT GGCAAAGGAA
TCTACGTTTA CTTAGGAAGT GGCGCAGTCT TGCTACTTAT TGCAGGAGTC TACTTTGCTA
GACGTAGAAA AGAAAATGCT TAA EF058-2 (SEQ ID NO:218) MKQLKKVW
YTVSTLLLIL PLFTSVLGTT TAFASENGES AQLVIHKKKM TDLPDPLIQN SGKEMSEFDK
YQGLADVTFS IYNVTNEFYE QRAAGASVDA AKQAVQSLTP GKPVAQGTTD ANGNVTVQLP
KKQNGKDAVY TIKEEPKEGV VAATNNVVAF PVYEMIKQTD GSYKYGTEEL AVVHIYPKNV
VANDGSLHVK KVGTAENEGL NGAEFVISKS EGSPGTVKYI QGVKDGLYTW TTDKEQAKRF
ITGKSYEIGE NDFTEAENGT GELTVKNLEV GSYILEEVKA PNNAELIENQ TKTPFTIEAN
NQTPVEKTVK NDTSKVDKTT PSLDGKDVAI GEKIKYQISV NIPLGIADKE GDANKYVKFN
LVDKHDAALT FDNVTSGEYA YALYDGDTVI APENYQVTEQ ANGFTVAVNP AYIPTLTPGG
TLKFVYFMHL NEKADPTKGF KNEANVDNGH TDDQTPPTVE VVTGGKRFIK VDGDVTATQA
LAGASFVVRD QNSDTANYLK IDETTKAATW VKTKAEATTF TTTADGLVDI TGLKYGTYYL
EETVAPDDYV LLTNRIEFVV NEQSYGTTEN LVSPEKVPNK HKGTLPSTGG KGIYVYLGSG
AVLLLIAGVY FARRRKENA EF058-3 (SEQ ID NO:219) AGAAGA AAATGGGGAG
AGCGCACAGC TCGTGATTCA CAAAAAGAAA ATGACGGATT TACCAGATCC GCTTATTCAA
AATAGCGGGA AAGAAATGAG CGAGTTTCAT AAATATCAAG
GACTGGCAGA TGTGACGTTT AGTATTTATA ACGTGACGAA CGAATTTTAC GAGCAACGAG
CGGCAGGCGC AAGCGTTCAT GCACCTAAAC AAGCTGTCCA AAGTTTAACT CCTGGGAAAC
CTGTTGCTCA AGGAACCACC GATGCAAATG GGAATGTCAC TGTTCAGTTA CCTAAAAAAC
AAAATGGTAA AGATGCAGTG TATACCATTA AAGAAGAACC AAAAGAGGGT GTAGTTGCTG
CTACGAATAT GGTGGTGGCG TTCCCAGTTT ACGAAATGAT CAAGCAAACA GATOOTTOCT
ATAAATATGG AACAGAAGAA TTAGCGGTTG TTCATATTTA TCCTAAAAAT OTOCTAGOCA
ATGATGGTAG TTTACATGTG AAAAAAGTAG GAACTGCTGA AAATGAAGGA TTAAATGGCG
CAGAATTTGT TATTTCTAAA ACCGAAGGCT CACCAGGCAC AGTAAAATAT ATCCAAGGAG
TCAAAGATGG ATTATATACA TGGACAACGG ATAAAGAACA AGCAAAACGC TTTATTACTG
GGAAAAGTTA TGAAATTGGC GAAAATGATT TCACAGAAGC AGAGAATGGA ACGGGAGAAT
TAACAGTTAA AAATCTTGAG GTTGGTTCGT ATATTTTAGA AGAAGTAAAA GCTCCAAATA
ATGCAGAATT AATTGAAAAT CAAACAAAAA CACCATTTAC AATTGAAGCA AACAATCAAA
CACCTGTTGA AAAAACAGTC AAAAATGATA CCTCTAAAGT TGATAAAACA ACACCAAGCT
TAGATGGTAA AGATGTGGCA ATTGGCGAAA AAATTAAATA TCAAATTTCT GTAAATATTC
CATTGGGGAT TGCAGACAAA GAAGGCGACG CTAATAAATA CGTCAAATTC AATTTAGTTG
ATAAACATGA TGCAGCCTTA ACTTTTGATA ACGTGACTTC TGGAGAGTAT GCTTATGCGT
TATATGATGG GGATACAGTG ATTGCTCCTG AAAATTATCA AGTGACTGAA CAAGCAAATG
GCTTCACTGT CGCCGTTAAT CCAGCGTATA TTCCTACGCT AACACCAGGC GGCACACTAA
AATTCGTTTA CTTTATGCAT TTAAATGAAA AAGCAGATCC TACGAAAGGC TTTAAAAATG
AGGCGAATGT TGATAACGGT CATACCGACG ACCAAACACC ACCAACTGTT GAAGTTGTGA
CAGGTGGGAA ACGTTTCATT AAAGTCGATG GCGATGTGAC AGCGACACAA GCCTTGGCGG
GAGCTTCCTT TCTCGTCCCT GATCAAAACA GCGACACAGC AAATTATTTG AAAATCGATG
AAACAACCAA AGCAGCAACT TGGGTGAAAA CAAAAGCTGA AGCAACTACT TTTACAACAA
CGGCTGATGG ATTAGTTGAT ATCACAGGGC TTAAATACGG TACCTATTAT TTAGAAGAAA
CTGTAGCTCC TGATGATTAT GTCTTGTTAA CAAATCGGAT TGAATTTGTG GTCAATGAAC
AATCATATGG CACAACAGAA AACCTAGTTT CACCAGAAAA AGTACCAAAC AAACACAAAG
GTACCTTACC T EF058-4 (SEQ ID NO:220) FENGES AQLVIHKKKM TDLPDPLJIQN
SGKENSEFDK YQGLADVTFS IYNVTNEFYE QEAAGASVDA AKQAVQSLTF GKPVAQGTTD
ANGNVTVQLP KKQNGKDAVY TIKEEPKEGV VAATNMVVAF PVYEMIKQTD GSYKYGTEEL
AVVHIYPKNV VANDGSLHVK KVGTAENEGL NGAEFVISKS EGSPGTVKYI QGVKDGLYTW
TTDKEQAKRF ITGKSYEIGE NDFTEAENGT GELTVKNLEV GSYILEEVKA PNNAELIENQ
TKTPFTIEAN NQTPVEKTVK NDTSKVDKTT PSLDGKDVAI GEKIKYQISV NTPLGIADKE
GDANKYVKFN LVDKHDAALT FDNVTSGEYA YALYDGDTVI APENYQVTEQ ANGFTVAVNP
AYIPTLTFGG TLKFVYFMHL NEKADPTKGF KNEANVDNGH TDDQTPPTVE VVTGGKRFIK
VDGDVTATQA LAGASFVVRD QNSDTANYLK IDETTKAATW VKTKAEATTF TTTADGLVDT
TGLKYGTYYL EETVAPDDYV LLTNRTEFVV NEQSYGTTEN LVSPEKVPNK HKGT FF059-1
(SEQ ID NO:221) TAGATTGGAA GAATGAAAAT GAAAAAAATG ATTATTATTG
CCTTATTCAG TACAAGCCTT TTAGCAGGGG GAAGCAGTGT TTCTGCTTAT GCGCAAGAAT
CAGAAGGAAA TCTTGGTGAA ACAACAGGGA GTGTTTTACC AGATGAACCG AATGTACCAA
CTGACCCAAT AACGCCAAGT GAGCCAGAGC AACCAACAGA GCCAAGTACA CCAGAGCAAC
CATCGGAACC GTCAACACCA ACCGAACCTA GTGAGCCTTC AAAACCGACG GATCCTTCGT
TACCAGACGA ACCGAGCGTA CCAACAGAGC CAACAACGCC AAGTAAGCCA GAGCAACCAA
CAGAGCCAAC AACGCCAAGT GTACCAGAGC AACCAACAGA GCCAAGTGTA CCAGAAAAAC
CAGTAGAACC AAATAAACCA ACCGAGCCAG AAAAGCCTGT GCCAGTTGTT CCTGAAAAAC
CAGTTGTACC ACAACAACCA GAGCAACCAA CAGATGTGGT GGTAAAGCCA AATGGAGAAA
TTGCAACAGG AGAATCTACA CAACAGCCAA CTGTTCCAAT TGAAACGAAT AACCTTTCAG
AAGTAACACA TGTCCCAACT GTGACGACAC CGATTGAAAC AGCAAGCGGA GAAGCAATTG
TCGCAGTGGA TAAGGGCGTT CCTTTAACAC AAACGGCTGA TGGATTAAAA CCGATTAAAA
GTGAATATAA AGTATTACCA AGTGGCAATG TACAAGTGAA AAGTGCTGAC GGAAAAATGA
AAGTACTTCC TTACACTGGT GAAAAAATGG GCATAATTGG GTCAATCCCT GGTGTATGTT
TGACTGTTTT ATCAGGAATC TTAATTTATA AAAAACGTAA AGTGTAG EF059-2 (SEQ ID
NO:222) MKKMI IIALFSTSLL AGGSSVSAYA QESEGNLGET TGSVLPDEPN
VPTDPITPSE PEQPTEPSTP EQPSEPSTPT EPSEPSKPTD PSLPDEPSVP TEPTTPSKPE
QPTEPTTPSV PEQPTEPSVP EKPVEPNKPT EPEKPVPVVP EKPVVPQQPE QPTDVVVKPN
GEIATGESTQ QPTVPIETNN LSEVTHVPTV TTPIETASGE AIVAVDKGVP LTQTADGLKP
IKSEYKVLPS GNVQVKSADC KMKVLPYTGE KMGIIGSIAG VCLTVLSGIL IYKKRKV
EF059-3 (SEQ ID NO:223) AGAAGGAAA TCTTGGTGAA ACAACAGGGA GTGTTTTACC
AGATGAACCG AATGTACCAA CTGACCCAAT AACGCCAAGT GAGCCAGAGC AACCAACAGA
GCCAAGTACA CCAGAGCAAC CATCGGAACC GTCAACACCA ACCGAACCTA GTGAGCCTTC
AAAACCGACG GATCCTTCGT TACCAGACGA ACCGAGCGTA CCAACAGACC CAACAACGCC
AAGTAAGCCA GAGCAACCAA CAGAGCCAAC AACGCCAAGT GTACCAGAGC AACCAACAGA
GCCAACTGTA CCAGAAAAAC CACTAGAACC AAATAAACCA ACCGAGCCAG AAAAGCCTGT
GCCAGTTGTT CCTGAAAAAC CAGTTGTACC ACAACAACCA GAGCAACCAA CAGATGTGGT
CGTAAAGCCA AATGGAGAAA TTGCAACAGG AGAATCTACA CAACAGCCAA CTGTTCCAAT
TGAAACGAAT AACCTTTCAG AAGTAACACA TGTCCCAACT GTGACGACAC CGATTGAAAC
AGCAAGCGGA GAAGCAATTG TCGCAGTGGA TAAGGGCGTT CCTTTAACAC AAACGGCTGA
TGGATTAAAA CCGATTAAAA GTGAATATAA AGTATTACCA AGTGGCAATG TACAAGTGAA
AAGTGCTGAC GGAAAAATGA AAGTAC EF059-4 (SEQ ID NO:224) EGNLGET
TGSVLPDEPN VPTDPITPSE PEQPTEPSTP EQPSEPSTPT EPSEPSKPTD PSLPDEPSVP
TEPTTPSKPE QPTEPTTPSV PEQPTEPSVP EKPVEPNKPT EPEKPVPVVP EKPVVPQQPE
QPTDVVVKPN GEIATGESTQ QPTVPIETNN LSEVTHVPTV TTPIETASGE AIVAVDKGVP
LTQTADGLKP IKSEYKVLPS GNVQVKSADG KNKV EF060-1 (SEQ ID NO:225)
TGAAAAATAG ACAAGGAGCA CGCGATGATG ACAATGAAAA GTAAAGGGTC ACTTCTGGTG
ACGTTGGGAA TACTTTTAAC CGTTGGCATT GCGAGTCTAA TTGTTTCTTC TGAGAGTTTT
GCAGAAGAAG TAGGGCAAAC GAATATCGGT GTAACGTTCT ATGGAGGAAA AGAGCCACTA
AAAACGGAAG CTGTCATTAA GCCAATAGAG CAACCAGTCA CTC-ATAAAGA TAAAAAAACG
TCACAACAAC AAGACAAAGT GAGCAGAAAA ACCACTGCTA AAACGAATCC GACTAATGCA
CAGACGTCAT TACCAAGGAC AGGTGAACGA AATAGCACGT GGCTTTACAG CCTTGGTATT
GCCTGTTTAC TCGTAGTACT AACAAGTTTC TATTATTTGA ATAAAAAAAG GAAAAAGGAA
AAATAA EF060-2 (SEQ ID NO:226) MMT MKSKGSLLVT LGILLTVGIA SLIVSSESFA
EEVGQTNIGV TFYGGKEPLK TEGVIKPIEQ PVTDKDKKTS QQQDKVSRKT TAKTNPTNAQ
TSLPRTGERN STWLYSLGIA CLLVVLTSFY YLNKKRKKEK EF060-3 (SEQ ID NO:227)
AGAAGAAG TAGCGCAAAC CAATATCGGT GTAACGTTCT ATGGAGCAAA AGAGCCACTA
AAAACGGAAG GTGTCATTAA GCCAATAGAG CAACCAGTCA CTGATAAAGA TAAAAAAACG
TCACAACAAC AAGACAAAGT GAGCAGAAAA ACCACTGCTA AAACGAATCC GACTAATGCA
CAGACCTCAT EE060-4 (SEQ ID NO:228) EEVGQTNIGV TFYGGKEPLK TEGVIKPIEQ
PVTDKDKKTS QQQDKVSRKT TAKTNPTNAQ TS EF061-1 (SEQ ID NO:229)
TAATGGAACG ACCCACAGAA GAAGATTTTG AACTTACAAA TTAAAATTAA AATGGAGGAA
ATAATGATGA AAAAAATTCT TTTTGCTAGT TTATTTAGTG CCACACTACT ATTTGGGGGA
AGTGAAATTT CTGCTTTTGC ACAAGAAATT ATCCCTGATG ATACTACGAC ACCGCCCATT
GAAGTACCAA CACAACCAAG TACACCAGAA AAGCCAACAG ATCCAACACC GCCAATTGAG
CCACCTGTAG ACCCTGTAGA GCCACCTATT ACACCAACGG AGCCAACAGA ACCGACAGAG
CCGACAACAC CAACAGAACC TACAACTCCT ACAGAGCCAA GTGAACCAGA ACAACCAACG
GAGCCAAGTA AACCAGTAGA ACCTGAAAAA CCAGTTACAC CAAGCAAACC AGCAGAACCC
GAAAAAACTG TGACACCAAC TAAACCAACA GAATCTGAAA AACCAGTACA ACCAGCAGAA
CCAAGCAAGC CAATCGACGT TGTTGTAACG CCAACAGGGG AATTAAATCA CGCTGGAAAT
GGTACACAAC AGCCAACAGT CCCTATTGAA ACAAGTAATT TGGCAGAAAT CACGCACGTG
CCTAGTGTAA CAACACCTAT TACAACTACA GACGGAGAAA ACATTGTAGC TGTAGAAAAA
GGTCTTCCAC TTACACAAAC AGCAGAAGGG TTAAAACCTA TTCAATCNAG TTACAAAGTA
TTGCCTAGCG GAAATGTAGA AGTAAAAGGT AAGGACGGTA AAATGAAGGT TTTACCATAC
ACAGGTGAAC AAATGAATAT CTTTTTATCT GCCGTAGCGG TATCTTGTCT GTAG EF061-2
(SEQ ID 140:230) MKKKTLFASL FSATLLFGGS EISAFAQEII PDDTTTPPIE
VPTEPSTPEK PTDPTPPIEP PVDPVEPPIT PTEPTEPTEP TTPTEPTTPT EPSEPEQPTE
PSKPVEPEKP VTPSKPAEPE KTVTPTKPTE SEKPVQPAEP SKPIDVVVTP TGELNHAGNG
TQQPTVPIET SNLAEITHVP SVTTPITTTD GENIVAVEKO VPLTQTAEGL KPIQSSYKVL
PSGNVEVKGK DGKNKVLPYT GEEMNIELSA VAVSCL EF061-3 (SEQ ID 140:231)
GAAATTT CTGCTTTTGC ACAAGAAATT ATCCCTGATG ATACTAGGAC ACCGCCCATT
GAAGTACCAA CAGAACCAAG TACACCAGAA AAGCCAACAG ATCCAACACC GCCAATTGAG
CCACCTGTAG ACCCTGTAGA GCCACCTATT ACACCAACGG AGCCAACAGA ACOGACACAG
CCGACAACAC CAACAGAACC TACAACTCCT ACAGAGCCAA GTGAACCAGA ACAACCAACG
GAGCCAAGTA AACCAGTAGA ACCTGAAAAA CCAGTTACAC CAAGCAAACC AGCAGAACCC
GAAAAAACTG TGACACCAAC TAAACCAACA GAATCTGAAA AACCAGTACA ACCAGCAGAA
CCAAGCAAGC CAATCGACGT TGTTGTAACG CCAACAGGGG AATTAAATCA CGCTGGAAAT
GGTACACAAC AGCCAACAGT CCCTATTGAA ACAAGTAATT TGGCAGAAAT CACGCACGTG
CCTAGTGTAA CAACACCTAT TACAACTACA GACGGAGAAA ACATTGTAGC TGTAGAAAAA
GGTGTTCCAC TTACACAAAC AGCAGAAGGG TTAAAACCTA TTCAATCNAG TTACAAAGTA
TTGCCTAGCG GAAATGTAGA AGTAAAAGGT AAGGACGGTA AAATGAAGGT TT EF061-4
(SEQ ID NO:232) QEII PDDTTTPPIE VPTEPSTPEK PTDPTPPIEP PVDPVEPPIT
PTEPTEPTEP TTPTEPTTPT EPSEPEQPTE PSKPVEPEKP VTPSKPAEPE KTVTPTKPTE
SEKPVQPAEP SKPIDVVVTP TGELNHAGNG TQQPTVPIET SNLAEITHVP SVTTPTTTTD
GENIVAVEKG VPLTQTAEGL KPIQSSYKVL PSGNVEVKGK DGKMKV EF062-1 (SEQ ID
NO:233) TGATTCTTGA AGCAACAAAT GAAAGCAAAA AAACAATATA AGACATATAA
AGCTAAGAAT CACTGGGTAA CTGTCCCTAT TCTTTTTCTA AGTGTGTTAG GAGCCGTAGG
ATTAGCTACT GATAATGTAC AAGCCGCGGA ATTAGATACG CAACCAGAAA CAACGACGGT
TCAACCCAAT AACCCCGACC TGCAGTCAGA AAAGGAAACA CCTAAAACGG CAGTATCTGA
AGAAGCAACA GTACAAAAAG ACACTACTTC TCAACCGACC AAAGTAGAAG AAGTAGCGCC
AGAAAATAAA GGTACTGAAC AAAGTTCAGC TACCCCAAAT GATACCACAA ACGCGCAACA
ACCAACAGTA GGACCTGAAA AATCAGCACA AGAACAACCA GTAGTAAGCC CTGAAACAAC
CAATGAACCT CTAGGGCACC CAACAGAAGT TGCACCAGCT GAAAATGAAG TGAATAAATC
AACGTCCATT CCTAAAGAAT TTGAAACACC AGACGTTGAT AAAGCAGTTG ATGAAGTAAA
AAAAGATCCA AACATTACCG TTGTTGAAAA ACCAGCAGAA GACTTAGGCA ACGTTTCTTC
TAAAGATTTA GCTGCAAAAG AAAAAGAAGT AGACCAACTA CAAAAAGAAC AAGCGAAAAA
GATTCCCCAA CAAGCAGCTG AATTAAAAGC CAAAAATGAA AAAATTGCCA AAGAAAATGC
AGAAATTGCG GCAAAAAACA AAGCNGAAAA AGAGCGNTAN GANAAAGAAG TCGCNGAATA
CAACAAGCAT AAGAACGAAA ACAGCTATGT CAATGAAGCG ATTAGTAAAA ACCTAGTGTT
CGATCAATCT GTCGTGACGA AAGACACTAA AATTTCGTCG ATTAAAGGCG GAAAATTTAT
CAAAGCAACT GATTTTAATA AAGTAAATGC AGGGGATTCA AAAGATATCT TTACAAAATT
ACGGAAAGAT ATGGGNGGGA AAGNTACTGG CAACTTCCAG AATTCCTTTG TAAAAGAGGC
AAATCTTGGG TCTAATGGTG GGTATGCGGT TCTTTTAGAA AAAAATAAAC CAGTGACACT
GACCTATACA GGACTAAACG CTAGTTATTT AGGACGTAAA ATTACAAAAG CAGAATTTGT
TTATGAACTA CAATCCTCAC CAAGCCAAAG TGGAACGTTA AATGCAGTAT TTTCAAACGA
TCCGATTATC ACNGCTTTTA TTGGTACAAA CAGAGTCAAT GGTAAGGATG TTAAAACACG
CTTAACGATT AAGTTCTTTG ATGCGTCAGG TAAAGAAGTA CTACCAGATA AAGATAGTCC
ATTTGCGTAT GCGCTGTCTT CTTTAAATTC AAGTTTAACG AATAAAGGTG GCCATGCGGA
ATTTGTTTCT GATTTTGGGG CNAACAATGC GTTCAAATAC ATTAATGGNT CNTATGTGAA
AAAACAAGCG GATGGAAAAT TTTACTCACC GGAAGATATT GACTATGGCA CAGGACCTTC
TGGATTGAAA AATAGTGATT GGGACGCTGT AGGTCACAAG AATGCCTACT TTGGTTCAGG
TGTAGGTCTA GCNAATGGNC GTATTTCCTT TTCTTTTGGT ATGACAACAA AAGGAAAAAG
TAATGTGCCT GTATCTAGTG CGCAATGGTT TGCCTTTAGN ACTAACTTAA ATGCGCAATC
AGTGAAGCCT ATTTTCAATT ATGGGAATCC AAAAGAACCA GAAAAAGCAA CGATTGAATT
CAATNGATAC AAAGCCAATG TCGTTCCTGT NCTTGTGCCN AATAAAGAAG TCACTCATGG
NCAGAAAAAT NTCAATGATT TAAATGTGAA NCGTGGCGAT TCTTTACAAT ACATTGTGAC
AGGGGATACG ACAGAACTTG CCAAAGTAGA TCCAAAAACA GTAACNAAAC AAGGGATTCG
AGATACNTTT GATGCAGAAA AACTGACGAT TGATTTATCC AAAGTGAAAG TTTATCAAGC
AGACGCAAGT CTNAACGANA AAGACTNAAA AGCTGTTGCT GCAGCNATTA ATTCAGGAAN
AGCTAAAGAC GTGACTGCTT CTTATGANCT CAATTTAGAT CAAAACACCG TCACAGCAAT
GATGAAAACC AACGCNGACG GNTCNGTTGT TTTAGCAATG GGGTATAAAT ATTTACTTGT
CTTGCCGTTT GTAGTGAAAA ATGTAGAAGG CGATTTTGAA AATACAGCTG TTCAGCTGAC
AAANGATGGN GAAACGGTAA CAAATACAGT GATTAACCAT GTGCCAGGTA GTAATCCTTC
CAAAGATGTA AAAGCAGATA AAAACGGTAC AGTTGGCAGT GTTTCTCTAC ATGATAAAGA
TATTCCGTTA CAAACAAAAA TTTATTATGA AGTGAAATCT TCCGAACGTC CAGCNAACTA
TGGCGGAATN ACNGAAGAAT GGGGCATGAA TGATGTCTTG GACACGACCC ATGATCGTTT
CACAGGNAAA TGGCACGCTA TTACNAANTA TGACCTTAAA GTAGGGGANA AAACGTTAAA
AGCAGGAACA GATATTTCTG CCTACATTCT TTTAGAAAAC AAAGACAATA AAGACTTGAC
GTTTACNATG AATCAAGCAT TATTGGCNGC NTTAAATGAA GGAAGCAATA AAGTAGGCAA
ACAAGCTTGG TCTGTGTATC TGCAACTCGA ACGGATNAAA ACAGOTOACO TAGAAAACAC
GCAAACAGAA AACTACAACA AAGAGCTTGT NCGTTCTAAT ACNGTCCTCA CGCATACNCC
TGATGATCCA AAACCAACCA AAGCCGTTCA TAACAAGAAA CGGGAACANA TTAANCATGG
AAAAGTNGCT CGTGGTGATG TTCTTTCTTA TGAAATGACN TGGGACTTAA AAGGGTACGA
TAAAGACTTT GCCTTTGATA CACTOGATOT TOCCACAGOC GTTTCTTTCT TCGATGATTA
CGATGAAACG AANGTGACAC CAATCAAAGA CTTACTTCGT GTCAAAGATT CTAAAGGGGN
AGACATTACO AACCAGTTCA CGATCTCNTG GGACGATGCC AAAGGCACGG TGACNATNTC
TGCCAAAGAC CCACAAGCCT TTATTCTAGC GNATGGTGGG CAAGAATTGC GTGTAACNCT
CCCTACAAAA GTCAAAGCCG ATGTTTCTGG NGATGTTTAT AATTCAGCGG AACAAAATAC
ATTTGGNCAA CGAATTAAAA CCAATACNGT TGTCAACCAT ATTCCAAAAG TGAANCCTAA
AAAAGACGTG GTTATTAAAG TNGGTGACAA ACAAAGTCAA AATGGNGCCA CAATCAAATT
AGGGGAGAAN TTCTTCTATG AATTTACAAG TAGTGACATT CCTCCAGAAT ACGCTGGNGT
TGTGGAAGAA TGGTCGATTA GCGATAAACT AGACGTCAAA CATGACAAAT TTAGTGGCCA
ATGGTCTGTG TTTGCCAATT CTAATTTTGT TTTAGCAGAC GGAACCAAAG TGAATAAAGG
GGACGACATT TCGAAACTAT TCACGATGAC CTTTGAACAA GGGGTAGTGA AAATCACGGC
CAGTCAAGCC TTTTTNGATG CGATGAATCT AAAAGAAAAC AAAAACGTTC CACACTCATG
GAAAGCGTTC ATTGGTGTAG AACGAATTGC GGCAGGAGAC GTTTACAACA CAATCGAAGA
ATCTTTCAAC AATGAGAAGA TTAAAACNAA TACGGTAGTG ACNCATACGC CAGAAAAACC
ACAAACNCCA CCAGAAAAAA CAGTCATTGT ACCACCAACA CCAAAAACAC CGCAAGCACC
ACTAGAGOCA TTAGTGGTAG AAAAGGCAAG TGTNGTGCCA GAATTGCCGC AAACAGGCGA
AAAACAAAAT GTCTTATTAA CGGTAGCTGG TAGTTTAGCC GCAATGCTTG GCTTAGCAGG
CTTAGGCTTT AAACGTAGAA AAGAAACAAA ATAA EF062-2 (SEQ ID NO:234) MKAKK
QYKTYKAKNH WVTVPILFLS VLCAVGLATD NVQAAELDTQ PETTTVQPNN PDLQSEKETP
KTAVSEEATV QKDTTSQPTK VEEVAPENKG TEQSSATPND TTNAQQPTVG AEKSAQEQPV
VSPETTNEPL GQPTEVAPAE NEVNKSTSIP KEFETPDVDK AVDEVKKDPN ITVVEKPAED
LGNVSSKDLA AKEKEVDQLQ KEQAKKIAQQ AAELKAKNEK IAKENAEILAA KNKAEKERXX
KEVAEYNKHK NENSYVNEAI SKNLVFDQSV VTKDTKISSI KGGKFIKATD FNKVNAGDSK
DIFTKLRKDM GGKXTGNFQN SFVKEANLGS NGGYAVLLEK NKPVTVTYTG LNASYLGRKI
TKAEFVYELQ SSPSQSGTLN AVFSNDPIIT AFIGTNRVNG KDVKTRLTIK FFDASGKEVL
PDKDSPFAYA LSSLNSSLTN KGGHAEFVSD FGANNAFKYI NGSYVKKQAD GKFYSPEDTD
YGTGPSGLKN SDWDAVGHKN AYFGSGVGLA NGRISFSFGM TTKGKSNVPV SSAQWFAFXT
NLNAQSVKPI FNYGNPKEPE KATIEFNXYK ANVVPVLVPN KEVTDGQKNX NDLNVXRGDS
LQYIVTGDTT ELAKVDPKTV TKQGIRDTFD AEKVTIDLSK TKVYQADASL NXKDXKAVAA
AINSGXAKDV TASYXLNLDQ NTVTAMMKTN ADGSVVLANG YKYLLVLPFV VKNVEGDFEN
TAVQLTXDGE TVTNTVINHV PGSNPSKDVK ADKNGTVGSV SLHDKDIPLQ TKIYYEVKSS
ERPANYGGXT EEWGMNDVLD TTHDRFTGKW HATTXYDLKV GXKTLKAGTD ISAYILLENK
DNKDLTFTNN QALLAALNEG SNKVGKQAWS VYLEVERXKT GDVENTQTEN YNKELVRSNT
VVTHTPDDPK PTKAVHNKKG EXIXHGKVAR GDVLSYEMTW DLKGYDKDFA FDTVDLATGV
SFFDDYDETX VTPIKDLLRV KDSKGXDTTN QFTISWDDAK GTVTXSAKDP QAFILAXGGQ
ELRVTLPTKV KADVSGDVYN SAEQNTFGQR IKTNTVVNHI PKVXPKKDVV IKVGDKQSQN
GATIKLGEXF FYEFTSSDTP AEYAGVVEEW SISDKLDVYH DKFSGQWSVF ANSNFVLADG
TKVNKGDDIS KLFTMTFEQG VVKITASQAF XDAMNLKENK NVAHSWKAFI GVERIAAGDV
YNTIEESFNN EKIKTNTVVT HTPEKPQTPP EKTVIVPPTP KTPQAPVEPL VVEKASVVPE
LPQTGEKQNV LLTVAGSLAA MLGLAGLGFK RRKETK EF062-3 (SEQ ID NO:235)
TGATTCTTGA AGCAACAAAT GAAAGCAAAA AAACAATATA AGACATATAA AGCTAAGAAT
CACTGGGTAA CTGTCCCTAT TCTTTTTCTA AGTGTGTTAG GAGCCGTAGG ATTAGCTACT
GATAATGTAC AAGCCGCGCA ATTAGATACG CAACCAGAAA CAACGACGGT TCAACCCAAT
AACCCCGACC TGCAGTCAGA AAAGGAAACA CCTAAAACGG CAGTATCTGA AGAAGCAACA
GTACAAAAAG ACACTACTTC TCAACCGACC AAAGTAGAAG AAGTAGCGCC AGAAAATAAA
GGTACTGAAC AAAGTTCAGC TACCCCAAAT GATACCACAA ACGCGCAACA ACCAACAGTA
GGAGCTGAAA AATCAGCACA AGAACAACCA GTAGTAAGCC CTGAAACAAC
CAATGAACCT CTAGGGCAGC CAACAGAAGT TGCACCAGCT GAAAATGAAG TGAATAAATC
AACGTCCATT CCTAAAGAAT TTGAAACACC AGACGTTGAT AAAGCAGTTG ATCAAGTAAA
AAAAGATCCA AACATTACCG TTGTTGAAAA ACCAGCAGAA GACTTAGGCA ACGTTTCTTC
TAAAGATTTA GCTGCAAAAG AAAAAGAAGT AGACCAACTA CAAAAAGAAC AAGCGAAAAA
CATTGCCCAA CAAGCAGCTG AATTAAAAGC CAAAAATGAA AAAATTGCCA AAGAAAATGC
AGAAATTGCG GCAAAAAACA AAGCNGAAAA AGAGCGNTAN CANAAAGAAG TCGCNGAATA
CAACAAGCAT AAGAACGAAA ACAGCTATCT CAATGAAGCG ATTAGTAAAA ACCTAGTGTT
CGATCAATCT GTCGTGACGA AAGACACTAA AATTTCGTCG ATTAAACGCG GAAAATTTAT
CAAAGCAACT GATTTTAATA AAGTAAATGC AGGGGATTCA AAAGATATCT TTACAAAATT
ACGGAAAGAT ATGGGNGGGA AAGNTACTGG CAACTTCCAG AATTCCTTTG TAAAAGAGGC
AAATCTTGGG TCTAATGGTG GGTATGCGGT TCTTTTAGAA AAAAATAAAC CAGTGACAGT
GACCTATACA GGACTAAACG CTAGTTATTT AGGACGTAAA ATTACAAAAG CAGAATTTGT
TTATGAACTA CAATCCTCAC CAAGCCAAAG TGGAACGTTA AATGCAGTAT TTTCAAACGA
TCCGATTATC ACNGCTTTTA TTGGTACAAA CAGAGTCAAT GGTAAGGATG TTAAAACACG
CTTAACGATT AAGTTCTTTG ATGCGTCAGG TAAAGAAGTA CTACCAGATA AAGATAGTCC
ATTTGCGTAT GCGCTGTCTT CTTTAAATTC AAGTTTAACG AATAAAGGTG GCCATGCGGA
ATTTGTTTCT GATTTTGGGG CNAACAATGC GTTCAAATAC ATTAATGGNT CNTATGTGAA
AAAACAAGCG GATGGAAAAT TTTACTCACC GGAAGATATT GACTATGCCA CACGACCTTC
TGGATTGAAA AATAGTGATT GGGACGCTGT AGGTCACAAG AATGCCTACT TTGGTTCAGG
TGTAGGTCTA GCNAATGGNC GTATTTCCTT TTCTTTTGGT ATGACAACAA AAGGAAAAAG
TAATGTCCCT GTATCTAGTG CGCAATGGTT TGCCTTTAGN ACTAACTTAA ATGCGCAATC
AGTGAAGCCT ATTTTCAATT ATGGGAATCC AAAAGAACCA GAAAAAGCAA CGATTGAATT
CAATNGATAC AAAGCCAATG TCGTTCCTGT NCTTGTGCCN AATAAAGAAG TCACTGATGG
NCAGAAAAAT NTCAATGATT TAAATGTGAA NCGTGGCGAT TCTTTACAAT ACATTGTGAC
AGGGGATACG ACAGAACTTC CCAAAGTAGA TCCAAAAACA GTAACNAAAC AAGGGATTCG
AGATACNTTT GATGCAGAAA AAGTGACGAT TGATTTATCC AAAGTGAAAG TTTATCAAGC
AGACGCAAGT CTNAACGANA AAGACTNAAA ACCTGTTGCT GCAGCNATTA ATTCAGGAAN
AGCTAAAGAC GTGACTGCTT CTTATGANCT CAATTTAGAT CAAAACACCG TCACAGCAAT
GATGAAAACC AACGCNGACG GNTCNGTTGT TTTAGCAATG GGGTATAAAT ATTTACTTGT
CTTGCCGTTT GTAGTGAAAA ATGTAGAAGG CGATTTTGAA AATACAGCTG TTCAGCTGAC
AAANGATGGN GAAACGGTAA CAAATACAGT GATTAACCAT GTGCCAGGTA GTAATCCTTC
CAAAGATGTA AAAGCAGATA AAAACGGTAC AGTTGGCAGT GTTTCTCTAC ATGATAAAGA
TATTCCGTTA CAAACAAAAA TTTATTATGA AGTGAAATCT TCCGAACGTC CAGCNAACTA
TGGCGGAATN ACNGAAGAAT GGGGCATGAA TGATGTCTTG GACACGACCC ATGATCGTTT
CACAGGNAAA TGGCACGCTA TTACNAANTA TGACCTTAAA GTAGGGGANA AAACGTTAAA
AGCAGGAACA GATATTTCTG CCTACATTCT TTTAGAAAAC AAAGACAATA AAGACTTGAC
GTTTACNATG AATCAAGCAT TATTGGCNGC NTTAAATGAA GGAAGCAATA AAGTAGGCAA
ACAAGCTTGG TCTGTGTATC TGGAAGTCGA ACGGATNAAA ACAGGTGACG TAGAAAACAC
GCAAACAGAA AACTACAACA AAGAGCTTGT NCGTTCTAAT ACNGTGGTGA CGCATACNCC
TGATGATCCA AAACCAACCA AAGCCGTTCA TAACAAGAAA GGGGAAGANA TTAANCATCG
AAAAGTNGCT CGTGGTGATG TTCTTTCTTA TGAAATGACN TGGGACTTAA AAGGGTACGA
TAAAGACTTT GCCTTTGATA CAGTCGATCT TGCGACAGGC GTTTCTTTCT TCGATGATTA
CGATGAAACG AANGTGACAC CAATCAAAGA CTTACTTCGT GTCAAAGATT CTAAAGGGGN
AGACATTACG AACCAGTTCA CGATCTCNTG GGACGATGCC AAAGGCACCG TGACNATNTC
TGCCAAAGAC CCACAAGCCT TTATTCTAGC GNATGGTGGG CAAGAATTGC GTGTAACNCT
CCCTACAAAA GTCAAAGCCG ATGTTTCTGG NGATGTTTAT AATTCAGCGG AACAAAATAC
ATTTGGNCAA CGAATTAAAA CCAATACNGT TGTCAACCAT ATTCCAAAAG TGAANCCTAA
AAAAGACGTG GTTATTAAAG TNGGTGACAA ACAAAGTCAA AATGGNGCCA CAATCAAATT
AGGGGAGAAN TTCTTCTATG AATTTACAAG TAGTGACATT CCTGCAGAAT ACGCTGGNGT
TGTGGAAGAA TCGTCGATTA GCGATAAACT AGACGTCAAA CATGACAAAT TTAGTGGCCA
ATGGTCTGTG TTTGCCAATT CTAATTTTGT TTTAGCAGAC GGAACCAAAG TGAATAAAGC
GGACCACATT TCGAAACTAT TCACGATGAC CTTTGAACAA GGGGTAGTGA AAATCACGGC
CAGTCAAGCC TTTTTNGATG CGATGAATCT AAAAGAAAAC AAAAACGTTG CACACTCATG
GAAAGCGTTC ATTGGTGTAC AACGAATTGC GOCAGGAGAC GTTTACAACA CAATCGAAGA
ATCTTTCAAC AATGAGAAGA TTAAAACNAA TACGGTAGTG ACNCATACCC CAGAAAAACC
ACAAACNCCA CCAGAAAAAA CAGTGATTGT ACCACCAACA CCAAAAACAC CGCAAGCACC
AGTAGAGCCA TTAGTGGTAG AAAAGGCAAC TG EF062-4 (SEQ ID NO:236) AELDTQ
PETTTPVQPNN PDLQSEKETP KTAVSEEATV QKDTTSQPTK VEEVAPENKC TEQSSATPND
TTNAQQPTVG AEKSAQEQPV VSPETTNEPL GQPTEVAPAE NEVNKSTSIP KEFETPDVDK
AVDEVKKDPN ITVVEKPAED LGNVSSKDLA AKEKEVDQLQ KEQAKKIAQQ AAELKAKNEK
IAKENAEIAA KNKAEKERXX KEVAEYNKHK NENSYVNEAI SKNLVFDQSV VTKDTKISSI
KGGKFIKATD ENKVNAGDSK DIFTKLRKDM GGKXTGNFQN SFVKEANLGS NGGYAVLLEK
NKPVTVTYTG LNASYLGRKI TKAEFVYELQ SSPSQSGTLN AVFSNDPIIT AFIGTNRVNG
KDVKTRLTIK FFDASGKEVL PDKDSPEAYA LSSLNSSLTN KGGHAEEVSD FGANNAFKYI
NGSYVKKQAD GKFYSPEDID YGTGPSGLKN SDWDAVGHKN AYFGSGVGLA NGRISFSFGM
TTKGKSNVPV SSAQWFAFXT NLNAQSVKPI FNYGNPKEPE KATIEFNXYK ANVVPVLVPN
KEVTDGQKNX NDLNVXRGDS LQYIVTGDTT ELAKVDPKTV TKQGTRDTFD AEKVTIDLSK
VKVYQADASL NXKDXKAVAA AINSGXAKDV TASYXLNLDQ NTVTAMMKTN ADGSVVLANG
YKYLLVLPFV VKNVEGDFEN TAVQLTXDGE TVTNTVINHV PGSNPSKDVK ADKNGTVGSV
SLHDKDIPLQ TKIYYEVKSS ERPANYGGXT EEWGMNDVLD TTHDRETGKW HAITXYDLKV
GXKTLKAGTD ISAYILLENK DNKDLTFTMN QALLAALNEG SNKVGKQAWS VYLEVERXKT
CDVENTQTEN YNKELVRSNT VVTHTPDDPK PTKAVHNKKG EXIXHGKVAR GDVLSYEMTW
DLKGYDKDFA FDTVDLATGV SEFDDYDETX VTPIKDLLRV KDSKGXDITN QFTISWDDAK
GTVTXSAKDP QAFILAXGGQ ELRVTLPTKV KADVSGDVYN SAEQNTFGQR IKTNTVVNHI
PKVXPKKDVV IKVGDKQSQN GATTKLGEXF FYEFTSSDIP AEYAGVVEEW SISDKLDVKH
DKFSGQWSVF ANSNFVLADG TKVNKGDDIS KLFTMTFEQG VVKITASQAF XDAMNLKENK
NVAHSWKAFT GVERIAAGDV YNTTEESFNN EKIKTNTVVT HTPEKPQTPP EKTVIVPPTP
KTPQAPVEPL VVEKASV SF063-1 (SEQ ID NO:237) TGATTCTTGA AGCAACAAAT
GAAAGCAAAA AAACAATATA AGACATATAA AGCTAAGAAT CACTGGGTAA CTGTCCCTAT
TCTTTTTCTA AGTGTGTTAG GAGCCGTACG ATTAGCTACT GATAATGTAC AAGCCGCGGA
ATTAGATACO CAACCAGAAA CAACGACGGT TCAACCCAAT AACCCCGACC TGCAGTCAGA
AAAGGAAACA CCTAAAACGG CAGTATCTGA AGAAGCAACA GTACAAAAAG ACACTACTTC
TCAACCGACC AAAGTAGAAG AAGTAGCGCC AGAAAATAAA GGTACTGAAC AAAGTTCAGC
TACCCCAAAT GATACCACAA ACGCGCAACA ACCAACAGTA GGAGCTGAAA AATCAGCACA
AGAACAACCA GTAGTAAGCC CTGAAACAAC CAATGAACCT CTAGGGCAGC CAACAGAAGT
TGCACCAGCT GAAAATGAAG TGAATAAATC AACGTCCATT CCTAAAGAAT TTGAAACACC
AGACGTTGAT AAAGCAGTTG ATGAAGTAAA AAAAGATCCA AACATTACCG TTGTTGAAAA
ACCAGCAGAA GACTTAGGCA ACGTTTCTTC TAAAGATTTA GCTGCAAAAG AAAAAGAAGT
AGACCAACTA CAAAAAGAAC AAGCGAAAAA GATTGCCCAA CAAGCAGCTG AATTAAAAGC
CAAAAATGAA AAAATTGCCA AAGAAAATGC AGAAATTGCG GCAAAAAACA AAGCNGAAAA
AGAGCGNTAN GANAAAGAAG TCGCNGAATA CAACAAGCAT AAGAACGAAA ACAGCTATGT
CAATGAAGCG ATTAGTAAAA ACCTAGTGTT CGATCAATCT GTCGTGACGA AAGACACTAA
AATTTCGTCG ATTAAAGGCG GAAAATTTAT CAAAGCAACT GATTTTAATA AAGTAAATGC
AGGGGATTCA AAAGATATCT TTACAAAATT ACGGAAAGAT ATGGGNGGGA AAGNTACTGG
CAACTTCCAG AATTCCTTTG TAAAAGAGGC AAATCTTGGG TCTAATGGTG GGTATGCGGT
TCTTTTAGAA AAAAATAAAC CAGTGACAGT GACCTATACA GGACTAAACG CTAGTTATTT
AGGACGTAAA ATTACAAAAG CAGAATTTGT TTATGAACTA CAATCCTCAC CAAGCCAAAG
TGGAACGTTA AATGCAGTAT TTTCAAACGA TCCGATTATC ACNGCTTTTA TTGGTACAAA
CAGAGTCAAT GGTAAGGATG TTAAAACACG CTTAACGATT AAGTTCTTTG ATGCGTCAGG
TAAAGAAGTA CTACCAGATA AAGATAGTCC ATTTGCGTAT GCGCTGTCTT CTTTAAATTC
AAGTTTAACG AATAAAGGTG GCCATGCGGA ATTTGTTTCT GATTTTGGGG CNAACAATGC
GTTCAAATAC ATTAATGGNT CNTATGTGAA AAAACAAGCG GATGGAAAAT TTTACTCACC
GGAAGATATT GACTATGGCA CAGGACCTTC TGGATTGAAA AATAGTGATT GGGACGCTGT
ACGTCACAAG AATGCCTACT TTCGTTCAGG TGTAGGTCTA GCNAATGGNC GTATTTCCTT
TTCTTTTGGT ATGACAACAA AAGGAAAAAG TAATGTGCCT GTATCTAGTG CGCAATCGTT
TCCCTTTAGN ACTAACTTAA ATGCGCAATC AGTGAAGCCT ATTTTCAATT ATGGGAATCC
AAAAGAACCA GAAAAAGCAA CGATTGAATT CAATNGATAC AAAGCCAATG TCGTTCCTGT
NCTTGTGCCN AATAAAGAAG TCACTGATGG NCAGAAAAAT NTCAATGATT TAAATGTCAA
NCGTGGCGAT TCTTTACAAT ACATTGTGAC AGGGGATACG ACAGAACTTG CCAAAGTAGA
TCCAAAAACA GTAACNAAAC AAGGGATTCG AGATACNTTT GATGCAGAAA AAGTGACCAT
TGATTTATCC AAAGTGAAAG TTTATCAAGC AGACGCAAGT CTNAACGANA AAGACTNAAA
AGCTGTTGCT GCAGCNATTA ATTCAGGAAN AGCTAAAGAC GTGACTGCTT CTTATGANCT
CAATTTAGAT CAAAACACCG TCACAGCAAT GATGAAAACC AACGCNCACG GNTCNGTTGT
TTTAGCAATG GGGTATAAAT ATTTACTTGT CTTGCCGTTT GTAGTGAAAA ATGTAGAAGG
CCATTTTGAA AATACAGCTG TTCAGCTGAC AAANGATGGN GAAACGGTAA CAAATACAGT
GATTAACCAT GTGCCAGGTA GTAATCCTTC CAAAGATGTA AAAGCAGATA AAAACGGTAC
AGTTGGCAGT GTTTCTCTAC ATGATAAAGA TATTCCGTTA CAAACAAAAA TTTATTATGA
AGTGAAATCT TCCGAACGTC CAGCNAACTA TGGCGGAATN ACNGAAGAAT GGGGCATGAA
TGATGTCTTG GACACGACCC ATGATCGTTT CACAGGNAAA TGGCACGCTA TTACNAANTA
TGACCTTAAA GTAGGGGANA AAACGTTAAA AGCAGGAACA GATATTTCTG CCTACATTCT
TTTAGAAAAC AAAGACAATA AAGACTTGAC GTTTACNATG AATCAAGCAT TATTGGCNGC
NTTAAATGAA GGAAGCAATA AAGTAGGCAA ACAAGCTTGG TCTGTGTATC TGGAAGTCGA
ACGGATNAAA ACAGGTGACG TAGAAAACAC GCAAACAGAA AACTACAACA AAGAGCTTGT
NCGTTCTAAT ACNGTGGTGA CGCATACNCC TGATGATCCA AAACCAACCA AAGCCGTTCA
TAACAAGAAA GGGGAAGANA TTAANCATGG AAAAGTNGCT CGTGCTGATG TTCTTTCTTA
TGAAATGACN TGGGACTTAA AAGGGTACGA TAAAGACTTT GCCTTTGATA CAGTCGATCT
TGCGACAGGC GTTTCTTTCT TCGATGATTA CGATGAAACG AANGTGACAC CAATCAAAGA
CTTACTTCGT GTCAAAGATT CTAAAGGGGN AGACATTACG AACCAGTTCA CGATCTCNTG
GGACGATGCC AAAGGCACGG TGACNATNTC TGCCAAAGAC CCACAAGCCT TTATTCTAGC
GNATGGTGGG CAAGAATTGC GTGTAACNCT CCCTACAAAA GTCAAAGCCG ATGTTTCTGG
NGATGTTTAT AATTCAGCGG AACAAAATAC ATTTGGNCAA CCAATTAAAA CCAATACNGT
TGTCAACCAT ATTCCAAAAG TGAANCCTAA AAAAGACGTG GTTATTAAAG TNGGTGACAA
ACAAAGTCAA AATGGNGCCA CAATCAAATT ACGGGACAAN TTCTTCTATG AATTTACAAG
TAGTGACATT CCTGCACAAT ACGCTGGNGT TGTGGAAGAA TGGTCGATTA GCGATAAACT
AGACGTCAAA CATGACAAAT TTAGTGGCCA ATGGTCTGTG TTTGCCAATT CTAATTTTGT
TTTAGCAGAC GGAACCAAAG TGAATAAAGG GGACGACATT TCGAAACTAT TCACGATCAC
CTTTGAACAA GGGGTAGTGA AAATCACGGC CAGTCAAGCC TTTTTNGATG CGATGAATCT
AAAAGAAAAC AAAAACGTTG CACACTCATG GAAAGCGTTC ATTGGTGTAG AACGAATTGC
GGCAGGAGAC GTTTACAACA CAATCGAAGA ATCTTTCAAC AATGAGAAGA TTAAAACNAA
TACGGTAGTG ACNCATACGC CAGAAAAACC ACAAACNCCA CCAGAAAAAA CAGTGATTGT
ACCACCAACA CCAAAAACAC CGCAAGCACC AGTAGAGCCA TTAGTGGTAG AAAAGGCAAG
TGTNGTGCCA GAATTGCCGC AAACAGGCGA AAAACAAAAT GTCTTATTAA CGGTAGCTGG
TAGTTTAGCC GCAATGCTTG GCTTAGCAGG CTTAGGCTTT AAACGTAGAA AAGAAACAAA
ATAA EF063-2 (SEQ ID NO:238) MKAKK QYKTYKAKNH WVTVPILFLS VLGAVGLATD
NVQAAELDTQ PETTTVQPNN PDLQSEKETP KTAVSEEATV QKDTTSQPTK VEEVAPENKG
TEQSSATPND TTNAQQPTVG AEKSAQEQPV VSPETTNEPL GQPTEVAPAE NEVNKSTSIP
KEFETPDVDK AVDEVKKDPN ITVVEKPAED LGNVSSKDLA AKEKEVDQLQ KEQAKKIAQQ
AAELKAKNEK IAKENAEIAA KNKAEKERXX KEVAEYNKHK NENSYVNEAT SKNLVFDQSV
VTKDTKISSI KGGKFTKATD FNKVNAGDSK DIFTKLRKDM GGKXTGNFQN SFVKEANLGS
NGGYAVLLEK NKPVTVTYTG LNASYLGRKI TKAEFVYELQ SSPSQSGTLN AVFSNDPTIT
AFIGTNRVNG KDVKTRLTIK FFDASGKEVL PDKDSPFAYA LSSLNSSLTN KGGHAEFVSD
FGANNAFKYI NGSYVKKQAD OKEYSPEDID YGTGPSGLKN SDWDAVGHKN AYFGSGVGLA
NGRISFSFGM TTKGKSNVPV SSAQWFAFXT NLNAQSVKPI FNYGNPKEPE KATIEFNXYK
ANVVPVLVPN KEVTDGQKNX NDLNVXRGDS LQYIVTGDTT ELAKVDPKTV TKQGIRDTFD
AEKVTIDLSK VKVYQADASL NXKDXKAVAA AINSGXAKDV TASYXLNLDQ NTVTANMKTN
ADGSVVLAMG YKYLLVLPFV VKNVEGDFEN TAVQLTXDGE TVTNTVINHV PGSNPSKDVK
ADKNGTVGSV SLHDKDIPLQ TKIYYEVKSS ERPANYGGXT EEWGMNDVLD TTHDRFTGKW
HAITXYDLKV GXKTLKAGTD ISAYILLENK DNKDLTFTMN QALLAALNEG SNKVGKQAWS
VYLEVERXKT GDVENTQTEN YNKELVRSNT VVTHTPDDPK PTKAVHNKKG EXTXHGKVAR
GDVLSYEMTW DLKGYDKDFA FDTVDLATGV SFFDDYDETX VTPIKDLLRV KDSKGXDITN
QFTISWDDAK GTVTXSAKDP QAFILAXGGQ ELRVTLPTKV KADVSGDVYN SAEQNTFGQR
IKTNTVVNHI PKVXPKKDVV IKVGDKQSQN GATIKLGEXF FYEFTSSDIP AEYAGVVEEW
SISDKLDVKH DKFSGQWSVF ANSNFVLADG TKVNKGDDIS KLFTMTFEQC VVKITASQAF
XDAMNLKENK NVAHSWKAFT GVERIAAGDV YNTIEESFNN EKIKTNTVVT HTPEKPQTPP
EKTVIVPPTP KTPQAPVEPL VVEKASVVPE LPQTGEKQNV LLTVAGSLAA MLGLAGLGFK
RRKETK EF063-3 (SEQ ID NO:239) GGA ATTACATACG CAACCAGAAA CAACGACGGT
TCAACCCAAT AACCCCGACC TGCAGTCAGA AAAGGAAACA CCTAAAACGG CAGTATCTGA
AGAAGCAACA GTACAAAAAG ACACTACTTC TCAACCGACC AAAGTAGAAC AAGTAGCGCC
AGAAAATAAA GGTACTGAAC AAAGTTCAGC TACCCCAAAT GATACCACAA ACGCGCAACA
ACCAACAGTA GGAGCTGAAA AATCAGCACA AGAACAACCA GTAGTAAGCC CTGAAACAAC
CAATGAACCT CTAGGGCAGC CAACAGAAGT TGCACCAGCT GAAAATGAAG TGAATAAATC
AACGTCCATT CCTAAAGAAT TTGAAACACC AGACGTTGAT AAAGCAGTTG ATGAACTAAA
AAAAGATCCA AACATTACCG TTGTTGAAAA ACCAGCAGAA GACTTAGGCA ACGTTTCTTC
TAAAGATTTA GCTGCAAAAG AAAAAGAAGT AGACCAACTA CAAAAAGAAC AAGCGAAAAA
GATTGCCCAA CAAGCAGCTG AATTAAAAGC CAAAAATGAA AAAATTGCCA AAGAAAATCC
AGAAATTGCG GCAAAAAACA AAGCNGAAAA AGAGCGNTAN GANAAAGAAG TCGCNGAATA
CAACAAGCAT AAGAACGAAA ACAGCTATGT CAATGAAGCG ATTAGTAAAA ACCTAGTGTT
CGATCAATCT GTCGTGACGA AAGACACTAA AATTTCGTCG ATTAAAGGCG GAAAATTTAT
CAAAGCAACT GATTTTAATA AAGTAAATGC AGGGGATTCA AAAGATATCT TTACAAAATT
ACGGAAAGAT ATGGGNGGGA AAGNTACTGG CAACTTCCAG AATTCCTTTG TAAAAGAGGC
AAATCTTGGG TCTAATGGTG GGTATGCGGT TCTTTTAGAA AAAAATAAAC CAGTGACAGT
GACCTATACA GGACTAAACG CTAGTTATTT AGGACGTAAA ATTACAAAAG CAGAATTTGT
TTATGAACTA CAATCCTCAC CAAGCCAAAG TGGAACGTTA AATGCAGTAT TTTCAAACGA
TCCGATTATC ACNGCTTTTA TTGGTACAAA CAGAGTCAAT GGTAAGGATG TTAAAACACG
CTTAACGATT AAGTTCTTTG ATGCGTCAGG TAAAGAAGTA CTACCAGATA AAGATAGTCC
ATTTGCGTAT GCGCTGTCTT CTTTAAATTC AAGTTTAACG AATAAAGCTG GCCATGCGGA
ATTTGTTTCT GATTTTGGGG CNAACAATGC GTTCAAATAC ATTAATGGNT CNTATGTGAA
AAAACAAGCG GATGCAAAAT TTTACTCACC GGAAGATATT GACTATGGCA CAGGACCTTC
TGGATTGAAA AATAGTGATT GGGACGCTGT AGGTCACAAG AATGCCTACT TTGGTTCAGG
TGTAGGTCTA GCNAATGGNC GTATTTCCTT TTCTTTTGGT ATGACAACAA AAGGAAAAAG
TAATGTGCCT GTATCTAGTG CGCAATGGTT TGCCTTTAGN ACTAACTTAA ATGCGCAATC
AGTGAAGCCT ATTTTCAATT ATGGGAATCC AAAAGAACCA GAAAAAGCAA CGATTGAATT
CAATNCATAC AAAGCCAATG TCGTTCCTGT NCTTGTGCCN AATAAAGAAG TCACTGATGG
NCAGAAAAAT NTCAATGATT TAAATGTGAA NCGTGGCGAT TCTTTACAAT ACATTGTGAC
AGGGGATACG ACAGAACTTG CCAAAGTAGA TCCAAAAACA GTAACNAAAC AAGGGATTCG
AGATACNTTT GATGCAGAAA AAGTGACGAT TGATTTATCC AAAGTG EF063-4 (SEQ ID
NO:240) ELDTQ PETTTVQPNN PDLQSEKETP KTAVSEEATV QKDTTSQPTK
VEEVAPENKG TEQSSATPND TTNAQQPTVG AEKSAQEQPV VSPETTNEPL GQPTEVAPAE
NEVNKSTSIP KEFETPDVDK AVDEVKKDPN ITVVEKPAED LGNVSSKDLA AKEKEVDQLQ
KEQAKKIAQQ AAELKAKNEK IAKENAEIAA KNKAEKERXX KEVAEYNKHK NENSYVNEAT
SKNLVFDQSV VTKDTKISSI KGGKFIKATD FNKVNAGDSK DIFTKLRKDM GGKXTGNFQN
SFVKEANLGS NGCYAVLLEK NKPVTVTYTG LNASYLGRKI TKAEFVYELQ SSPSQSGTLN
AVFSNDPIIT AFIGTNRVNG KDVKTRLTIK FFDASGKEVL PDKDSPFAYA LSSLNSSLTN
KGGHAEFVSD FGANNAFKYI NGSYVKKQAD GKFYSPEDID YGTGPSGLKN SDWDAVGHKN
AYFGSGVGLA NGRISFSFGM TTKGKSNVPV SSAQWFAFXT NLNAQSVKPI FNYGNPKEPE
KATIEFNXYK ANVVPVLVPN KEVTDGQKNX NDLNVXRGDS LQYIVTGDTT ELAKVDPKTV
TKQGIRDTFD AEKVTIDLSK V EF064-1 (SEQ ID NO:241) TCATTCTTGA
ACCAACAAAT GAAAGCAAAA AAACAATATA AGACATATAA AGCTAAGAAT CACTGGGTAA
CTGTCCCTAT TCTTTTTCTA AGTGTGTTAG GAGCCGTAGG ATTAGCTACT GATAATCTAC
AAGCCGCGGA ATTAGATACG CAACCAGAAA CAACGACGGT TCAACCCAAT AACCCCGACC
TGCAGTCAGA AAAGGAAACA CCTAAAACGG CAGTATCTGA AGAAGCAACA GTACAAAAAG
ACACTACTTC TCAACCGACC AAAGTAGAAG AAGTAGCGCC AGAAAATAAA GGTACTGAAC
AAAGTTCAGC TACCCCAAAT GATACCACAA ACGCGCAACA ACCAACAGTA GGAGCTGAAA
AATCAGCACA AGAACAACCA GTAGTAACCC CTGAAACAAC CAATGAACCT CTAGGGCAGC
CAACAGAAGT TGCACCAGCT GAAAATGAAG TGAATAAATC AACGTCCATT CCTAAAGAAT
TTGAAACACC AGACGTTGAT AAAGCAGTTG ATGAAGTAAA AAAAGATCCA AACATTACCG
TTCTTGAAAA ACCAGCAGAA GACTTAGGCA ACGTTTCTTC TAAAGATTTA GCTGCAAAAG
AAAAAGAAGT AGACCAACTA CAAAAAGAAC AAGCGAAAAA GATTGCCCAA
CAAGCAGCTG AATTAAAAGC CAAAAATGAA AAAATTGCCA AAGAAAATGC AGAAATTGCG
GCAAAAAACA AAGCNGAAAA AGAGCGNTAN GANAAAGAAG TCGCNGAATA CAACAAGCAT
AAGAACGAAA ACACCTATGT CAATGAAGCG ATTAGTAAAA ACCTAGTGTT CGATCAATCT
GTCGTGACGA AAGACACTAA AATTTCGTCG ATTAAAGGCG GAAAATTTAT CAAAGCAACT
GATTTTAATA AAGTAAATGC AGGGGATTCA AAAGATATCT TTACAAAATT ACGGAAAGAT
ATGGGNGGGA AAGNTACTGG CAACTTCCAG AATTCCTTTG TAAAAGACGC AAATCTTGCG
TCTAATGGTG GGTATGCGGT TCTTTTAGAA AAAAATAAAC CAGTGACAGT GACCTATACA
GGACTAAACG CTAGTTATTT AGGACGTAAA ATTACAAAAG CAGAATTTGT TTATGAACTA
CAATCCTCAC CAAGCCAAAG TGGAACGTTA AATGCAGTAT TTTCAAACGA TCCGATTATC
ACNGCTTTTA TTGGTACAAA CAGAGTCAAT GGTAAGGATG TTAAAACACG CTTAACGATT
AAGTTCTTTG ATGCGTCAGG TAAAGAAGTA OTACCAGATA AAGATAGTCC ATTTGCGTAT
GCGCTGTCTT CTTTAAATTC AAGTTTAACG AATAAAGGTG GCCATGCGGA ATTTGTTTCT
GATTTTGGGG CNAACAATGC GTTCAAATAC ATTAATGGNT CNTATGTGAA AAAACAAGCG
GATGGAAAAT TTTACTCACC GGAAGATATT GACTATGGCA CAGGACCTTC TGGATTGAAA
AATAGTGATT GGGACGCTGT AGGTCACAAG AATGCCTACT TTGGTTCAGG TGTAGGTCTA
GCNAATGGNC GTATTTCCTT TTCTTTTGGT ATGACAACAA AAGGAAAAAG TAATGTGCCT
GTATCTAGTG CGCAATGGTT TGCCTTTAGN ACTAACTTAA ATGCGCAATC AGTGAAGCCT
ATTTTCAATT ATCGGAATCC AAAAGAACCA GAAAAAGCAA CGATTGAATT CAATNGATAC
AAAGCCAATG TCGTTCCTGT NCTTGTGCCN AATAAAGAAG TCACTGATGG NCAGAAAAAT
NTCAATGATT TAAATGTGAA NCGTGGCGAT TCTTTACAAT ACATTGTGAC AGGGGATACG
ACAGAACTTG CCAAAGTAGA TCCAAAAACA GTAACNAAAC AAGGGATTCG AGATACNTTT
GATGCAGAAA AAGTGACGAT TGATTTATCC AAAGTGAAAG TTTATCAAGC AGACGCAAGT
CTNAACGANA AAGACTNAAA AGCTGTTGCT GCAGCNATTA ATTCAGGAAN AGCTAAAGAC
GTGACTGCTT CTTATGANCT CAATTTAGAT CAAAACACCG TCACAGCAAT GATGAAAACC
AACGCNGACG GNTCNGTTGT TTTAGCAATG GGGTATAAAT ATTTACTTGT CTTGCCGTTT
GTAGTGAAAA ATGTAGAAGG CGATTTTGAA AATACAGCTG TTCAGCTGAC AAANGATGGN
CAAACGGTAA CAAATACAGT GATTAACCAT GTGCCAGGTA GTAATCCTTC CAAAGATGTA
AAAGCAGATA AAAACGGTAC AGTTGGCAGT GTTTCTCTAC ATGATAAAGA TATTCCGTTA
CAAACAAAAA TTTATTATGA AGTGAAATCT TCCGAACGTC CAGCNAACTA TGGCGGAATN
ACNGAAGAAT GGGGCATGAA TGATGTCTTG CACACGACCC ATGATCGTTT CACAGGNAAA
TGCCACGCTA TTACNAANTA TGACCTTAAA GTAGGGGANA AAACGTTAAA AGCAGGAACA
GATATTTCTG CCTACATTCT TTTAGAAAAC AAAGACAATA AAGACTTGAC GTTTACNATC
AATCAAGCAT TATTGGCNGC NTTAAATGAA GGAAGCAATA AAGTAGGCAA ACAAGCTTGG
TCTGTGTATC TGGAACTCGA ACGGATNAAA ACAGGTGACG TAGAAAACAC GCAAACAGAA
AACTACAACA AAGAGCTTGT NCGTTCTAAT ACNGTGGTGA CGCATACNCC TGATGATCCA
AAACCAACCA AAGCCGTTCA TAACAAGAAA GGGGAAGANA TTAANCATGG AAAAGTNGCT
CGTGGTGATG TTCTTTCTTA TGAAATGACN TGGGACTTAA AAGGGTACGA TAAAGACTTT
GCCTTTGATA CAGTCGATCT TGCGACAGGC GTTTCTTTCT TCGATGATTA CGATGAAACG
AANGTGACAC CAATCAAAGA CTTACTTCGT GTCAAAGATT CTAAAGGGCN AGACATTACG
AACCAGTTCA CGATCTCNTG GGACGATGCC AAAGGCACGG TGACNATNTC TGCCAAAGAC
CCACAAGCCT TTATTCTAGC GNATGGTGGG CAAGAATTGC GTCTAACNCT CCCTACAAAA
GTCAAAGCCG ATGTTTCTGG NGATGTTTAT AATTCAGCGG AACAAAATAC ATTTGGNCAA
CGAATTAAAA CCAATACNGT TGTCAACCAT ATTCCAAAAG TGAANCCTAA AAAAGACGTG
GTTATTAAAG TNGGTGACAA ACAAAGTCAA AATGGNGCCA CAATCAAATT AGGGGAGAAN
TTCTTCTATG AATTTACAAG TAGTGACATT CCTGCAGAAT ACGCTGGNGT TGTGGAAGAA
TGGTCGATTA GCGATAAACT AGACGTCAAA CATGACAAAT TTAGTGGCCA ATGGTCTGTG
TTTGCCAATT CTAATTTTGT TTTAGCAGAC GGAACCAAAG TGAATAAAGG GGACGACATT
TCGAAACTAT TCACGATGAC CTTTGAACAA GGGGTAGTGA AAATCACGGC CAGTCAAGCC
TTTTTNGATG CGATGAATCT AAAAGAAAAC AAAAACGTTG CACACTCATG GAAAGCGTTC
ATTGGTGTAG AACCAATTGC GOCAGGAGAC GTTTACAACA CAATCGAAGA ATCTTTCAAC
AATGAGAAGA TTAAAACNAA TACGGTAGTG ACNCATACGC CAGAAAAACC ACAAACNCCA
CCAGAAAAAA CAGTGATTGT ACCACCAACA CCAAAAACAC CGCAAGCACC AGTAGAGCCA
TTAGTGGTAG AAAAGGCAAG TGTNGTGCCA GAATTGCCGC AAACAGGCGA AAAACAAAAT
GTCTTATTAA CGGTAGCTGG TAGTTTAGCC GCAATGCTTG GCTTAGCAGG CTTAGGCTTT
AAACGTAGAA AAGAAACAAA ATAA EF064-2 (SEQ ID NO:242) MKAKK QYKTYKAKNH
WVTVPILFLS VLGAVGLATD NVQAAELDTQ PETTTVQPNN PDLQSEKETP KTAVSEEATV
QKDTTSQPTK VEEVAPENKG TEQSSATPND TTNAQQPTVG AEKSAQEQPV VSPETTNEPL
GQPTEVAPAE NEVNKSTSIP KEFETPDVDK AVDEVKKDPN ITVVEKPAED LGNVSSKDLA
AKEKEVDQLQ KEQAKKIAQQ AAELKAKNEK IAKENAEIAA KNKAEKERXX KEVAEYNKHK
NENSYVNEAI SKNLVFDQSV VTKDTKISSI KGGKFIKATD FNKVNAGDSK DIFTKLRKDM
GGKXTGNFQN SFVKEANLGS NGGYAVLLEK NKPVTVTYTG LNASYLGRKI TKAEFVYELQ
SSPSQSGTLN AVFSNDPITT AFIGTNRVNG KDVKTRLTIK FFDASGKEVL PDKDSPFAYA
LSSLNSSLTN KGGHAEFVSD FGANNAFKYI NGSYVKKQAD GKFYSPEDID YGTGPSGLKN
SDWDAVGHKN AYFGSGVGLA NGRISFSFGM TTKGKSNVPV SSAQWFAFXT NLNAQSVKPI
FNYGNPKEPE KATIEFNXYK ANVVPVLVPN KEVTDGQKNX NDLNVXRGDS LQYIVTGDTT
ELAKVDPKTV TKQGIRDTFD AEKVTIDLSK VKVYQADASL NXKDXKAVAA AINSGXAKDV
TASYXLNLDQ NTVTAMMKTN ADGSVVLAMG YKYLLVLPFV VKNVEGDFEN TAVQLTXDGE
TVTNTVINHV PGSNPSKDVK ADKNGTVGSV SLHDKDIPLQ TKIYYEVKSS ERPANYGGXT
EEWGHNDVLD TTHDRFTGKW HAITXYDLKV GXKTLKAGTD ISAYILLENK DNKDLTFTMN
QALLAALNEG SNKVGKQAWS VYLEVERXKT GDVENTQTEN YNKELVRSNT VVTHTPDDPK
PTKAVHNKKG EXIXHGKVAR GDVLSYEMTW DLKGYDKDFA FDTVDLATGV SFFDDYDETX
VTPIKDLLRV KDSKGXDITN QFTISWDDAK GTVTXSAKDP QAFILAXGGQ ELRVTLPTKV
KADVSGDVYN SAEQNTFGQR IKTNTVVNHI PKVXPKKDVV IKVGDKQSQN GATIKLGEXF
FYEFTSSDIP AEYAGVVEEW SISDKLDVKH DKFSGQWSVF ANSNFVLADG TKVNKGDDIS
KLFTMTFEQG VVKITASQAF XDANNLKENK NVAHSWKAFT GVERIAAGDV YNTLEESFNN
EKIKTNTVVT HTPEKPQTPP EKTVIVPPTP KTPQAPVEPL VVEKASVVPE LPQTGEKQNV
LLTVAGSLAA MLGLAGLGFK RRKETK EF064-3 (SEQ ID NO:243) AGTGACGAT
TGATTTATCC AAAGTGAAAG TTTATCAAGC AGACGCAAGT CTNAACGANA AAGACTNAAA
AGCTGTTGCT CCAGCNATTA ATTCAGGAAN AGCTAAAGAC GTGACTGCTT CTTATGANCT
CAATTTAGAT CAAAACACCC TCACAGCAAT GATGAAAACC AACGCNGACG GNTCNGTTGT
TTTAGCAATG GGGTATAAAT ATTTACTTGT CTTGCCGTTT GTAGTGAAAA ATGTAGAAGG
CGATTTTGAA AATACAGCTG TTCAGCTGAC AAANGATGGN GAAACGGTAA CAAATACAGT
GATTAACCAT CTGCCAGGTA GTAATCCTTC CAAAGATGTA AAAGCAGATA AAAACGGTAC
AGTTGGCAGT GTTTCTCTAC ATGATAAAGA TATTCCGTTA CAAACAAAAA TTTATTATGA
AGTGAAATCT TCCGAACGTC CAGCNAACTA TGGCGGAATN ACNGAAGAAT GGGGCATGAA
TGATGTCTTG GACACGACCC ATGATCGTTT CACAGGNAAA TGGCACGCTA TTACNAANTA
TGACCTTAAA GTAGGGGANA AAACGTTAAA AGCAGGAACA GATATTTCTG CCTACATTCT
TTTAGAAAAC AAAGACAATA AAGACTTGAC GTTTACNATG AATCAAGCAT TATTGGCNGC
NTTAAATGAA GGAAGCAATA AAGTAGGCAA ACAAGCTTGG TCTGTGTATC TGGAAGTCGA
ACGGATNAAA ACAGGTGACG TACAAAACAC GCAAACAGAA AACTACAACA AAGAGCTTGT
NCGTTCTAAT ACNGTGGTGA CGCATACNCC TGATGATCCA AAACCAACCA AAGCCGTTCA
TAACAAGAAA GGGGAAGANA TTAANCATGG AAAAGTNGCT CGTGGTGATG TTCTTTCTTA
TGAAATGACN TCGGACTTAA AAGGGTACGA TAAAGACTTT GCCTTTGATA CAGTCGATCT
TGCGACAGGC GTTTCTTTCT TCGATGATTA CGATGAAACG AANGTGACAC CAATCAAAGA
CTTACTTCGT GTCAAAGATT CTAAAGGGGN AGACATTACG AACCAGTTCA CGATCTCNTG
GGACGATGCC AAAGGCACGG TGACNATNTC TGCCAAAGAC CCACAAGCCT TTATTCTAGC
GNATGGTGGG CAAGAATTGC GTGTAACNCT CCCTACAAAA GTCAAAGCCG ATGTTTCTGG
NGATGTTTAT AATTCAGCGG AACAAAATAC ATTTGGNCAA CGAATTAAAA CCAATACNGT
TGTCAACCAT ATTCCAAAAG TGAANCCTAA AAAAGACGTG GTTATTAAAG TNGGTGACAA
ACAAAGTCAA AATGGNGCCA CAATCAAATT AGGGGAGAAN TTCTTCTATG AATTTACAAG
TAGTGACATT CCTGCAGAAT ACGCTGGNGT TGTGGAAGAA TGGTCGATTA GCGATAAACT
AGACGTCAAA CATGACAAAT TTAGTGGCCA ATGGTCTGTG TTTGCCAATT CTAATTTTGT
TTTAGCAGAC CGAACCAAAG TGAATAAAGG GGACGACATT TCGAAACTAT TCACGATGAC
CTTTGAACAA GGGGTAGTGA AAATCACGGC CAGTCAAGCC TTTTTNGATG CGATGAATCT
AAAACAAAAC AAAAACGTTG CACACTCATG GAAAGCGTTC ATTGGTGTAG AACGAATTGC
GGCAGGAGAC GTTTACAACA CAATCGAAGA ATCTTTCAAC AATGAGAAGA TTAAAACNAA
TACCGTAGTG ACNCATACGC CAGAAAAACC ACAAACNCCA CCAGAAAAAA CAGTGATTGT
ACCACCAACA CCAAAAACAC CGCAAGCACC AGTAGAGCCA TTAGTGGTAG AAAAGGCAAG
TGTNGTGCCA GAATTGCCGC AAACAGGCGA AAAACAAAAT GTCTTATTAA CGGTAGCTCG
TAGTTTAGCC CCAATGCTTG GCTTAC-CAGG CTTAGGCTTT AAACGTAGAA AAGAAACAAA
ATAA EF064-4 (SEQ ID NO:244) VTIDLSK VKVYQADASL NXKDXKAVAA
AINSGXAKDV TASYXLNLDQ NTVTANMKTN ADGSVVLAMG YKYLLVLPFV VKNVEGDFEN
TAVQLTXDGE TVTNTVINHV PGSNPSKDVK ADKNGTVGSV SLHDKDIPLQ TKIYYEVKSS
ERPANYGGXT EEWGMNDVLD TTHDRFTGKW HATTXYDLKV GXKTLKAGTD ISAYILLENK
DNKDLTFTMN QALLAALNEG SNKVGKQAWS VYLEVERXKT GDVENTQTEN YNKELVRSNT
VVTHTPDDPK PTKAVHNKKG EXIXHGKVAR GDVLSYEMTW DLKGYDKDFA FDTVDLATGV
SFFDDYDETX VTPIKDLLRV KDSKGXDITN QFTISWDDAK GTVTXSAKDP QAFILAXGGQ
ELRVTLPTKV KADVSGDVYN SAEQNTFGQR IKTNTVVNHI PKVXPKKDVV IKVGDKQSQN
GATIKLGEXF FYEFTSSDIP AEYAGVVEEW SISDKLDVKH DKFSGQWSVF ANSNFVLADG
TKVNKGDDTS KLFTMTFEQG VVKITASQAF XDAMNLKENK NVAHSWKAFI GVERIAAGDV
YNTTEESFNN EKIKTNTVVT HTPEKPQTPP EKTVIVPPTP KTPQAPVEPL VVEKASV
EF065-1 (SEQ ID NO:245) TAGCGAAAGA AAATAGGGAG GATTAAAATG TTTAAGAAAG
CAACGAAATT ATTATCGACA ATGGTGATTG TCGCTGGAAC AGTTGTCGGA AATTTCAGTC
CCACATTGGC TTTAGCTGAA GAAGCGGTTA AAGCAGGAGA TACAGAAGGA ATGACCAATA
CGGTGAAAGT GAAAGACGAC AGTCTGGCTG ATTGTAAACG GATATTGGAA GGACAAGCTA
CTTTCCCAGT TCAAGCGGGT GAAACGGAAC CAGTCGATTT AGTAGTTGTT GAAGATGCTA
GTGGTAGTTT TTCAGATAAT TTTCCACATG TAAGACAAGC GATTGATGAA GTGGTTCAAG
GCTTATCTGA TCAAGACCGC GTGATGCTGG CTTCATATCG CGGCGGAAAA CAATTTATGT
TTCCTGATGG AAAGACAAAA ATTAATTCAG CTGATTATGA TATGAATGTG CGCGTCAATA
CGCAATTGAC TTATGATAAA AGCCAATTTG TCTCTGGTTT TGGAGACGTT CGGACCTATG
GTGGTACGCC AACCGCCCCA CGATTGAAAC TCGCTTTAGA TACGTACAAT CAAACACACG
GAGATTTAAC GAATCGAAAA ACGTATTTCC TATTAGTGAC AGATGGGGTC GCTAATACAC
GTTTAGATGG TTACTTGCAT AAGACCAATA CCAATGATTC AATCAATGAA TATCCAGATC
CAAGACATCC TCTTCAAGTC TCAGTGGAAT ATAGTAATCA CTACCAAGGT GCAGCAGCAG
AAGTTTTAGC GTTAAACCAA GAAATTACTA ACCAAGGCTA TGAAATGATT AATGCGTATT
GGGAAAGTGT TGAATCTTTA AGTTCAGTGA ATTCATACTT TCATAAATAT AAAACAGAAC
TGGCTCCTTT TGTAAAACAA GAGTTGCAAC AAGGGTCTAG CACACCAGAA GATTTTATTA
CAAGCCAATC TATTGATGAT TTTACAACCC AATTAAAACA AATTGTCAAA CATCGTCTGG
CGCAATCGAC ACCAGCAACA GCTTCATTAA CGATTGCCAA TCAATTTGAT ATTCAATCTG
CGACCGCTAC GGACGATGCT GGAAATGATG TGCCTGTTCA AATTAACGGA CAAACCATTT
CAGCAACTAG TACAGAAGGT TACGTAGGAA ACATCACGAT TCACTACGAA GTCAAAGAAA
ATACAGCGAT TGATGCAGCA ACCCTTGTAA GTAGTGGGAC AATGAATCAA GGAACAATTG
CTAAGGAATT TCCAGAAGCG ACGATTCCTA AAAATGACAA TGCGCATGCG TGTGACGTGA
CGCCAGAAGA TCCAACGATT ACAAAAGATA TCGAAAATCA AGAACACTTA GATTTAACCA
ATCGTGAAGA TAGTTTCGAT TGGCATGTCA AAACAGCCTT TGOCAACGAA ACCAGTACTT
GGACCCAAGC CAGCATGGTG GATGACATTA ATAAAGTGCT AGATATCATT GATGTGAAAG
TCACCGACGA AAATGGTAAA GATGTTACAG CTAACCGCAC AGTAACACAA GAAAATAACA
AAGTAACTTT TGAAATGAAC AAACAAGCAG ACAGCTATGA CTATTTAAGT GGTCATACGT
ATACAATGAC TATOACCACT AAAATTAAAA CTGACGCAAC GGACGAACAA TTAGCGCCTT
ACATTGAACA AGGCGGGATT CCCAACCAAG CCGACTTAAA CTTTGGCAAT GAAGGTGACG
TGTTACATTC CAACAAACCA ACCGTAACAC CACCGCCAGT TGATCCAAAT ATTGCTAAAG
ACGTAGAAGG ACAAGAACAT TTAGATTTAA CCAACCGCGA TCAAGAATTT AAATGCAACG
TCAAAACAGC TTTCGGTAAC GAAACAAGCA CTTGGACCCA AGCCAGCATG GTAGATGACA
TTAATAAAGT GTTAGACATC ACTGATGTAA AAGTCACAGA TGAAAATGGT AAAGATGTTA
CAGCTAACGG CAAAGTAACA CAAGAAAATA ACAAAGTAAC TTTTGAAATG AACAANCAAG
CNGACAGCTA TGACTATTTA AGTGGTCATA CGTACACAAT GACCATTACT ACTAAAATCA
AAGCTAGCGC AACGCACGAA GAATTAGCAC CTTATATTGA ACAAGGTGGC ATTCCCAACC
AAGCCGACTT GAACTTTGGC AACGAAGGTG ACGTGTTGCA TTCCAACAAA CCAACCGTAA
CACCACCTCC ACCAACGCCA GAAGATCCAA CGATTACAAA AGATATCGAA GGCCAAGAAC
ATTTAGATTT AACCAACCGT GACCAAGAAT TTAAATGGAA CGTCAAAACA GCTTTCCGTA
ACGAAACAAG CACATGGACC CAAGCCAGCA TGGTGGATGA CATTAATAAA GTGTTAGACA
TCACAGACGT GAAAGTTNCT GANGAAAATG GCAAAGATGT TACAGATAAT GGCATAGTAA
CACAAGAAAA TAACAAAGTA ACTTTTACTA TCAACAAAAA AGATGACAGC TACTCTTACT
TAGCTGGTCA TACATACACA ATGACTATTA CCACTAAAAT TAAAACTGAC GCAACGGATG
AAGAATTAGC GCCTTATATT GAACAAGGCG GGATTCCCAA CCAAGCCGAC TTAAACTTTG
GCAACGAAGG TGACGTGTTG CATTCCAACA AGCCAACCGT AACACCGCCT GCACCAACGC
CAGAAGACCC AAAAAAACCT GAACCTAAAC AACCGCTAAA ACCGAAAAAA CCGTTGACGC
CTACAAATCA TCAAGCACCA ACGAACCCAG TCAATTTTGG AAAATCAGCA AGTAAAGGAA
TTCATTTACC AATGACTAAT ACAACAGTAA ATCCACTTTA CATGATCGCA GGTTTAATTG
TCCTTATAGT GGCTATTAGC TTTGGCATAA CAAAAAATAA AAAAAGAAAA AATTAG
EF065-2 (SEQ ID NO:246) ME KKATKLLSTM VIVAGTVVGN FSPTLALAEE
AVKAGDTEGM TNTVKVKDDS LADCKRILEG QATFPVQAGE TEPVDLVVVE DASGSFSDNF
PHVRQAIDEV VQGLSDQDRV MLASYRGGKQ FMFPDGKTKI NSADYDMNVR VNTQLTYDKS
QFVSGFGDVR TYGGTPTAPG LKLALDTYNQ THGDLTNRKT YFLLVTDGVA NTRLDGYLHK
TNTNDSINEY PDPRHPLQVS VEYSNDYQGA AAEVLALNQE ITNQGYEMIN AYWESVESLS
SVNSYFDKYK TEVGPFVKQE LQQGSSTPED FITSQSIDDF TTQLKQIVKD RLAQSTPATA
SLTIANQFDI QSATATDDAG NDVPVQINGQ TISATSTEGY VGNITTHYEV KENTAIDAAT
LVSSGTMNQG TIAKEEPEAT IPKNDNAHAC DVTPEDPTIT KDIENQEHLD LTNREDSFDW
HVKTAFGNET STWTQASMVD DINKVLDIID VKVTDENGKD VTANGTVTQE NNKVTFEMNK
QADSYDYLSG HTYTMTITTK TKTDATDEEL APYTEQGGIP NQADLNFGNE GDVLHSNKPT
VTPPPVDPNI AKDVEGQEHL DLTNRDQEFK WNVKTAFGNE TSTWTQASMV DDINKVLDIT
DVKVTDENGK DVTANGKVTQ ENNKVTFEHN XQADSYDYLS GHTYTMTITT KIKASATDEE
LAPYIEQGGI PNQADLNFGN EGDVLHSNKP TVTPPAPTPE DPTITKDIEG QEHLDLTNRD
QEFKWNVKTA FGNETSTWTQ ASMVDDINKV LDITDVKVXX ENGKDVTDNG IVTQENNKVT
FTMNKKDDSY SYLAGHTYTM TITTKIKTDA TDEELAPYIE QGGIPNQADL NFGNEGDVLH
SNKPTVTPPA PTPEDPKKPE PKQPLKPKKP LTPTNHQAPT NPVNFGKSAS KGTHLPMTNT
TVNPLYMIAG LIVLIVAISF GITKNKKRKN EF065-3 (SEQ ID NO:247) GGTTA
AAGCAGGAGA TACAGAAGGA ATGACCAATA CGGTGAAAGT GAAAGACGAC AGTCTGGCTG
ATTGTAAACG GATATTGGAA GGACAAGCTA CTTTCCCAGT TCAAGCGGGT GAAACGGAAC
CAGTCGATTT AGTAGTTGTT GAAGATGCTA GTGGTAGTTT TTCAGATAAT TTTCCACATG
TAAGACAAGC GATTGATGAA GTGGTTCAAG GCTTATCTGA TCAAGACCGC GTGATGCTGG
CTTCATATCG CGGCGGAAAA CAATTTATGT TTCCTGATGG AAAGACAAAA ATTAATTCAG
CTGATTATGA TATGAATGTG CGCGTCAATA CGCAATTGAC TTATGATAAA AGCCAATTTG
TCTCTGGTTT TGGAGACGTT CGGACGTATG GTGGTACGCC AACCGCCCCA GGATTGAAAC
TCGCTTTAGA TACCTACAAT CAAACACACG GAGATTTAAC GAATCGAAAA ACGTATTTCC
TATTAGTGAC AGATGGGGTC GCTAATACAC GTTTAGATGG TTACTTGCAT AAGACCAATA
CCAATGATTC AATCAATGAA TATCCAGATC CAAGACATCC TCTTCAAGTC TCAGTGGAAT
ATAGTAATGA CTACCAAGGT GCAGCAGCAC AAGTTTTAGC GTTAAACCAA GAAATTACTA
ACCAAGGCTA TGAAATGATT AATGCGTATT GGGAAAGTGT TGAATCTTTA AGTTCACTGA
ATTCATACTT TGATAAATAT AAAACAGAAC TGGGTCCTTT TGTAAAACAA GAGTTGCAAC
AAGGCTCTAG CACACCAGAA GATTTTATTA CAAGCCAATC TATTGATGAT TTTACAACCC
AATTAAAACA AATTGTCAAA GATCGTCTGG CGCAATCGAC ACCAGCAACA GCTTCATTAA
CGATTGCCAA TCAATTTGAT ATTCAATCTG CGACCGCTAC GGACGATGCT GGAAATGATG
TGCCTGTTCA AATTAACGGA CAAACCATTT CAGCAACTAG TACAGAAGGT TACGTAGGAA
ACATCACGAT TCACTACGAA GTCAAAGAAA ATACAGCGAT TGATCCAGCA ACCCTTGTAA
GTAGTGGGAC AATGAATCAA GGAACAATTG CTAAGGAATT TCCAGAAGCG ACGATTCCTA
AAAATGACAA TGCGCATGCG TGTGACGTGA CGCCAGAAGA TCCAACGATT ACAAAAGATA
TCGAAAATCA AGAACACTTA GATTTAACCA ATCGTGAAGA TAGTTTCGAT TGGCATGTCA
AAACACCCTT TGCCAACGAA ACCAGTACTT GGACCCAAGC CAGCATGGTG GATGACATTA
ATAAAGTGCT ACATATCATT GATGTGAAAG TCACCGACGA AAATGGTAAA GATGTTACAG
CTAACGGCAC AGTAACACAA GAAAATAACA AAGTAACTTT TGAAATGAAC AAACAAGCAG
ACAGCTATGA CTATTTAAGT GGTCATACGT ATACAATGAC TATCACCACT AAAATTAAAA
CTGACGCAAC GGACGAAGAA TTAGCGCCTT ACATTGAACA AGGCGGGATT CCCAACCAAG
CCGACTTAAA CTTTGGCAAT GAAGGTGACG TGTTACATTC CAACAAACCA ACCGTAACAC
CACCGCCAGT TGATCCAAAT ATTGCTAAAG ACGTAGAAGG ACAAGAACAT TTAGATTTAA
CCAACCGCGA TCAAGAATTT AAATGGAACG TCAAAACAGC TTTCGGTAAC GAAACAAGCA
CTTGGACCCA AGCCAGCATG GTAGATGACA TTAATAAAGT GTTAGACATC ACTGATCTAA
AAGTCACAGA TGAAAATGGT AAAGATCTTA CAGCTAACGG CAAAGTAACA
CAAGAAAATA
ACAAAGTAAC TTTTGAAATG AACAANCAAG CNGACAGCTA TGACTATTTA AGTGGTCATA
CGTACACAAT GACCATTACT ACTAAAATCA AAGCTAGCGC AACGGACGAA GAATTAGCAC
CTTATATTGA ACAAGGTGGC ATTCCCAACC AAGCCGACTT GAACTTTGGC AACGAAGGTG
ACGTGTTGCA TTCCAACAAA CCAACCGTAA CACCACCTGC ACCAACGCCA GAAGATCCAA
CGATTACAAA AGATATCGAA GGCCAAGAAC ATTTAGATTT AACCAACCGT GACCAAGAAT
TTAAATGGAA CGTCAAAACA GCTTTCGGTA ACGAAACAAG CACATGGACC CAAGCCAGCA
TGGTGCATGA CATTAATAAA GTGTTAGACA TCACAGACGT GAAAGTTNCT GANGAAAATG
GCAAAGATGT TACAGATAAT GGCATAGTAA CACAAGAAAA TAACAAAGTA ACTTTTACTA
TGAACAAAAA AGATGACAGC TACTCTTACT TAGCTGGTCA TACATACACA ATGACTATTA
CCACTAAAAT TAAAACTGAC GCAACGGATG AAGAATTAGC GCCTTATATT GAACAAGGCG
GGATTCCCAA CCAAGCCGAC TTAAACTTTG GCAACGAAGG TGACGTGTTG CATTCCAACA
AGCCAACCGT AACACCGCCT GCACCAACGC CAGAAGACCC AAAAAAACCT GAACCTAAAC
AACCGCTAAA ACCGAAAAAA CCGTTGACGC CTACAAATCA TCAAGCACCA ACGAACCCAG
TCAATTTTGG AAAATCAGCA AGTAAAGGAA TT EF065-4 (SEQ ID NO:248)
AVKAGDTEGM TNTVKVKDDS LADCKRILEG QATFPVQAGE TEPVDLVVVE DASGSFSDNF
PHVRQAIDEV VQGLSDQDRV MLASYRGGKQ FMFPDGKTKI NSADYDMNVR VNTQLTYDKS
QFVSGFGDVR TYGGTPTAPG LKLALDTYNQ THGDLTNRKT YFLLVTDGVA NTRLDGYLHK
TNTNDSINEY PDPRHPLQVS VEYSNDYQGA AAEVLALNQE ITNQGYEMTN AYWESVESLS
SVNSYFDKYK TEVGPFVKQE LQQGSSTPED FITSQSIDDF TTQLKQIVKD RLAQSTPATA
SLTIANQFDT QSATATDDAG NDVPVQINGQ TISATSTEGY VGNITIHYEV KENTAIDAAT
LVSSGTMNQG TTAKEFPEAT IPKNDNAHAC DVTPEDPTIT KDIENQEHLD LTNREDSFDW
HVKTAFGNET STWTQASMVD DINKVLDIID VKVTDENGKD VTANGTVTQE NNKVTFEMNK
QADSYDYLSG HTYTMTITTK IKTDATDEEL APYIEQGGIP NQADLNFGNE GDVLHSNKPT
VTPPPVDPNI AKDVEGQEHL DLTNRDQEFK WNVKTAFGNE TSTWTQASMV DDINKVLDIT
DVKVTDENGK DVTANGKVTQ ENNKVTFEMN XQADSYDYLS GHTYTMTITT KIKASATDEE
LAPYIEQGGI PNQADLNFGN EGDVLHSNKP TVTPPAPTPE DPTITKDIEG QEHLDLTNRD
QEFKWNVKTA FGNETSTWTQ ASMVDDINKV LDITDVKVXX ENGKDVTDNG IVTQENNKVT
FTMNKKDDSY SYLAGHTYTM TITTKIKTDA TDEELAPYIE QGGIPNQADL NFGNEGDVLH
SNKPTVTPPA PTPEDPKKPE PKQPLKPKKP LTPTNHQAPT NPVNFGKSAS KGIH EF066-1
(SEQ ID NO:249) TAGCGAAAGA AAATAGGGAG GATTAAAATG TTTAAGAAAG
CAACGAAATT ATTATCGACA ATGGTGATTG TCGCTGGAAC AGTTGTGGGA AATTTCAGTC
CCACATTGGC TTTAGCTGAA GAAGCGGTTA AAGCAGGAGA TACAGAAGGA ATGACCAATA
CGGTGAAAGT GAAAGACGAC AGTCTGGCTG ATTGTAAACG GATATTGGAA GGACAAGCTA
CTTTCCCAGT TCAAGCGGGT GAAACGCAAC CAGTCGATTT AGTAGTTGTT GAAGATGCTA
GTGGTAGTTT TTCAGATAAT TTTCCACATG TAAGACAAGC GATTGATGAA GTGGTTCAAG
GCTTATCTCA TCAAGACCGC GTGATGCTGG CTTCATATCG CGGCGGAAAA CAATTTATGT
TTCCTGATGG AAAGACAAAA ATTAATTCAG CTGATTATGA TATGAATGTG CGCGTCAATA
CGCAATTCAC TTATGATAAA AGCCAATTTG TCTCTGGTTT TGGAGACGTT CGGACGTATG
GTGGTACGCC AACCGCCCCA GGATTGAAAC TCGCTTTAGA TACGTACAAT CAAACACACG
GAGATTTAAC GAATCGAAAA ACGTATTTCC TATTAGTGAC AGATGGGGTC GCTAATACAC
GTTTAGATGG TTACTTGCAT AAGACCAATA CCAATGATTC AATCAATGAA TATCCAGATC
CAAGACATCC TCTTCAAGTC TCAGTGGAAT ATAGTAATGA CTACCAAGGT GCAGCAGCAG
AAGTTTTAGC GTTAAACCAA GAAATTACTA ACCAAGGCTA TGAAATGATT AATGCGTATT
GGGAAAGTGT TGAATCTTTA AGTTCAGTGA ATTCATACTT TGATAAATAT AAAACAGAAG
TGGGTCCTTT TGTAAAACAA CAGTTGCAAC AACGGTCTAG CACACCAGAA GATTTTATTA
CAACCCAATC TATTGATGAT TTTACAACCC AATTAAAACA AATTGTCAAA GATCGTCTGG
CGCAATCGAC ACCAGCAACA GCTTCATTAA CGATTGCCAA TCAATTTGAT ATTCAATCTG
CCACCGCTAC CCACGATGCT GGAAATGATG TGCCTGTTCA AATTAACGGA CAAACCATTT
CAGCAACTAG TACAGAAGGT TACGTAGGAA ACATCACGAT TCACTACGAA GTCAAAGAAA
ATACAGOGAT TGATGCAGCA ACCCTTGTAA GTAGTGGGAC AATGAATCAA GGAACAATTG
CTAAGCAATT TCCAGAAGCG ACGATTCCTA AAAATGACAA TGCGCATGCG TGTGACGTGA
CGCCAGAAGA TCCAACGATT ACAAAAGATA TCGAAAATCA ACAACACTTA GATTTAACCA
ATCGTGAAGA TAGTTTCGAT TGGCATGTCA AAACAGCCTT TGGCAACGAA ACCAGTACTT
GCACCCAAGC CAGCATGGTG GATGACATTA ATAAAGTGCT AGATATCATT GATGTGAAAG
TCACCGACGA AAATGGTAAA GATGTTACAG CTAACGGCAC AGTAACACAA GAAAATAACA
AAGTAACTTT TGAAATGAAC AAACAAGCAG ACAGCTATGA CTATTTAAGT GGTCATACGT
ATACAATGAC TATCACCACT AAAATTAAAA CTGACGCAAC GGACGAAGAA TTAGCGCCTT
ACATTGAACA AGGCGGGATT CCCAACCAAG CCGACTTAAA CTTTGGCAAT GAAGGTGACG
TGTTACATTC CAACAAACCA ACCGTAACAC CACCGCCAGT TGATCCAAAT ATTGCTAAAG
ACGTAGAAGG ACAAGAACAT TTAGATTTAA CCAACCGCGA TCAAGAATTT AAATGGAACG
TCAAAACAGC TTTCCGTAAC GAAACAAGCA CTTGGACCCA AGCCAGCATG GTAGATGACA
TTAATAAAGT GTTAGACATC ACTGATGTAA AAGTCACAGA TGAAAATGGT AAAGATGTTA
CAGCTAACGG CAAAGTAACA CAAGAAAATA ACAAAGTAAC TTTTGAAATG AACAANCAAG
CNGACAGCTA TGACTATTTA AGTGGTCATA CGTACACAAT GACCATTACT ACTAAAATCA
AAGCTAGCGC AACGGACGAA GAATTAGCAC CTTATATTGA ACAAGGTGGC ATTCCCAACC
AAGCCCACTT GAACTTTGGC AACGAAGGTG ACGTGTTGCA TTCCAACAAA CCAACCGTAA
CACCACCTGC ACCAACGCCA GAAGATCCAA CGATTACAAA AGATATCGAA GGCCAAGAAC
ATTTAGATTT AACCAACCGT GACCAAGAAT TTAAATGGAA CGTCAAAACA GCTTTCGGTA
ACGAAACAAG CACATGGACC CAAGCCAGCA TGGTGGATGA CATTAATAAA GTGTTAGACA
TCACAGACGT GAAAGTTNCT GANGAAAATC GCAAAGATCT TACAGATAAT GGCATAGTAA
CACAAGAAAA TAACAAAGTA ACTTTTACTA TGAACAAAAA AGATGACAGC TACTCTTACT
TAGCTGGTCA TACATACACA ATGACTATTA CCACTAAAAT TAAAACTGAC GCAACGGATG
AAGAATTAGC GCCTTATATT GAACAAGGCG GGATTCCCAA CCAAGCCGAC TTAAACTTTG
GCAACGAAGG TGACGTGTTG CATTCCAACA AGCCAACCCT AACACCGCCT GCACCAACGC
CAGAAGACCC AAAAAAACCT GAACCTAAAC AACCGCTAAA ACCCAAAAAA CCGTTGACGC
CTACAAATCA TCAAGCACCA ACGAACCCAG TCAATTTTGG AAAATCAGCA AGTAAAGGAA
TTCATTTACC AATGACTAAT ACAACAGTAA ATCCACTTTA CATGATCGCA GGTTTAATTG
TCCTTATAGT GGCTATTAGC TTTGGCATAA CAAAAAATAA AAAAAGAAAA AATTAG
EF066-2 (SEQ ID NO:250) MF KKATKLLSTM VIVAGTVVGN FSPTLALAEE
AVKAGDTEGM TNTVKVKDDS LADCKRILEG QATFPVQAGE TEPXTDLVVVE DASGSFSDNF
PHVRQAIDEV VQGLSDQDRV MLASYRGGKQ FMFPDGKTKI NSADYDMNVR VKTQLTYDKS
QFVSGFGDVR TYGGTPTAPC LKLALDTYNQ THGDLTNRKT YFLLVTDGVA NTRLDGYLHK
TNTNDSINEY PDPRHPLQVS VEYSNDYQGA AAEVLALNQE ITNQGYEMTN AYWESVESLS
SVNSYFDKYK TEVGPFVKQE LQQGSSTPED FITSQSIDDF TTQLKQIVKD RLAQSTPATA
SLTIANQFDI QSATATDDAG NDVPVQTNGQ TISATSTEGY VGNITIHYEV KENTAIDAAT
LVSSGTMNQG TIAKEFFEAT IPKNDNAHAC DVTPEDPTIT KDIENQEHLD LTNREDSFDW
HVKTAFGNET STWTQASMVD DINKVLDIID VKVTDENGKD VTANGTVTQE NNKVTFEHNK
QADSYDYLSG HTYTMTTTTK IKTDATDEEL APYIEQGGIP NQADLNFGNE GDVLHSNKPT
VTPPPVDPNI AKDVEGQEHL DLTNRDQEFK WNVKTAFGNE TSTWTQASMV DDTNKVLDTT
DVKVTDENGK DVTANGKVTQ ENNKVTFEMN XQADSYDYLS GHTYTMTITT KIKASATDEE
LAPYIEQGGI PNQADLNFGN EGDVLHSNKP TVTPPAPTPE DPTITKDIEG QEHLDLTNRD
QEFKWNVKTA FGNETSTWTQ ASMVDDINKV LDITDVKVXX ENGKDVTDNG IVTQENNKVT
FTMNKKDDSY SYLAGHTYTM TITTKIKTDA TDEELAPYTE QGGTPNQADL NFGNEGDVLH
SNKPTVTPPA PTPEDPKKPE PKQPLKPKKP LTPTNHQAPT NPVNFGKSAS KGIHLPMTNT
TVNPLYMIAG LIVLIVAISF GITKNKKRKN EF066-3 (SEQ ID NO:251) GGTTA
AAGCAGGAGA TACAGAACGA ATGACCAATA CGGTGAAAGT GAAAGACGAC AGTCTGGCTG
ATTGTAAACG GATATTGGAA GGACAAGCTA CTTTCCCAGT TCAAGCGGGT GAAACGGAAC
CAGTCGATTT AGTAGTTGTT GAAGATGCTA GTGGTAGTTT TTCAGATAAT TTTCCACATC
TAAGACAAGC GATTGATGAA GTGGTTCAAG GCTTATCTGA TCAAGACCGC GTGATGCTGG
CTTCATATCG CGGCGGAAAA CAATTTATGT TTCCTGATGG AAAGACAAAA ATTAATTCAG
CTGATTATGA TATGAATGTG CCCGTCAATA CGCAATTGAC TTATGATAAA AGCCAATTTG
TCTCTGGTTT TGGAGACGTT CGGACGTATG GTGGTACGCC AACCGCCCCA GGATTGAAAC
TCGCTTTAGA TACGTACAAT CAAACACACG GAGATTTAAC GAATCGAAAA ACGTATTTCC
TATTAGTGAC AGATGGGGTC GCTAATACAC GTTTAGATGG TTACTTGCAT AAGACCAATA
CCAATGATTC AATCAATGAA TATCCAGATC CAAGACATCC TCTTCAAGTC TCAGTGGAAT
ATAGTAATGA CTACCAAGGT GCAGCAGCAG AAGTTTTAGC GTTAAACCAA GAAATTACTA
ACCAAGGCTA TGAAATGATT AATGCGTATT GGCAAAGTGT TGAATCTTTA AGTTCAGTGA
ATTCATACTT TGATAAATAT AAAACAGAAG TGGGTCCTTT TGTAAAACAA GAGTTGCAAC
AAGGGTCTAG CACACCAGAA GATTTTATTA CAAGCCAATC TATTGATGAT TTTACAACCC
AATTAAAACA AATTGTCAAA GATCGTCTGG CGCAATCGAC ACCAGCAACA GCTTCATTAA
CGATTGCCAA TCAATTTGAT ATTCAATCTG CGACCGCTAC GGACGATGCT GGAAATGATG
TGCCTGTTCA AATTAACGGA CAAACCATTT CAGCAACTAG TACAGAAGGT TACGTAGGAA
ACATCACGAT TCACTACGAA GTCAAAGAAA ATACAGCGAT TGATGCAGCA ACCCTTGTAA
GTAGTGGGAC AATGAATCAA GGAACAATTG CTAAGGAATT TCCAGAAGCG ACCATTCCTA
AAAATGACAA TGCGCATGCC TGTGACGTGA CGCCAGAAGA TCCAACGATT ACAAAAGATA
TCGAAAATCA AGAACACTTA GATTTAACCA ATCGTGAAGA TAGTTTCGAT TGGCATGTCA
AAACAGCCTT TGGCAACGAA ACCAGTACTT GGACCCAAGC CAGCATGGTG GATGACATTA
ATAAAGTGCT AGATATCATT GATGTGAAAG TCA EF066-4 (SEQ ID NO:252)
AVKAGDTEGM TNTVKVKDDS LADCKRTLEG QATFPVQAGE TEPVDLVVVE DASGSFSDNF
PHVRQAIDEV VQGLSDQDRV MLASYRGGKQ FMFPDGKTKI NSADYDMNVR VNTQLTYDKS
QFVSGFGDVR TYGGTPTAPG LKLALDTYNQ THGDLTNRKT YFLLVTDGVA NTRLDGYLHK
TNTNDSINEY PDPRHPLQVS VEYSNDYQGA AAEVLALNQE ITNQGYEMIN AYWESVESLS
SVNSYFDKYK TEVGPFVKQE LQQGSSTPED FITSQSIDDF TTQLKQIVKD RLAQSTPATA
SLTIANQFDI QSATATDDAG NDVPVQINGQ TISATSTEGY VGNITIHYEV KENTAIDAAT
LVSSGTMNQG TIAKEFPEAT IPKNDNAHAC DVTPEDPTIT KDIENQEHLD LTNREDSFDW
HVKTAFGNET STWTQASMVD DINKVLDIID VKVT EF067-1 (SEQ ID NO:253)
TAGCGAAAGA AAATAGGGAG GATTAAAATG TTTAAGAAAG CAACGAAATT ATTATOGACA
ATGGTGATTG TCGCTGGAAC AGTTGTGGGA AATTTCAGTC CCACATTGGC TTTAGCTGAA
GAAGCGGTTA AAGCAGGAGA TACAGAAGGA ATGACCAATA CGGTGAAAGT GAAAGACGAC
AGTCTGGCTG ATTGTAAACG GATATTGGAA GGACAAGCTA CTTTCCCAGT TCAAGCGGGT
GAAACGGAAC CAGTCGATTT AGTAGTTGTT GAAGATGCTA GTGGTAGTTT TTCAGATAAT
TTTCCACATG TAAGACAAGC GATTGATGAA GTGGTTCAAG GCTTATCTGA TCAAGACCGC
GTGATGCTGG CTTCATATCG CGGCGGAAAA CAATTTATGT TTCCTGATGG AAAGACAAAA
ATTAATTCAG CTGATTATGA TATGAATGTG CGCGTCAATA CGCAATTGAC TTATGATAAA
AGCCAATTTG TCTCTGGTTT TGGAGACGTT CGGACGTATG GTGGTACGCC AACCGCCCCA
GGATTGAAAC TCGCTTTAGA TACGTACAAT CAAACACACG GAGATTTAAC GAATCGAAAA
ACGTATTTCC TATTAGTCAC AGATGGGGTC GCTAATACAC GTTTAGATGG TTACTTGCAT
AAGACCAATA CCAATGATTC AATCAATGAA TATCCAGATC CAAGACATCC TCTTCAAGTC
TCAGTGGAAT ATAGTAATGA CTACCAAGGT CCAGCAGCAG AAGTTTTAGC GTTAAACCAA
GAAATTACTA ACCAAGGCTA TGAAATGATT AATGCGTATT GGGAAAGTGT TGAATCTTTA
AGTTCAGTGA ATTCATACTT TGATAAATAT AAAACAGAAG TGGGTCCTTT TGTAAAACAA
GAGTTGCAAC AAGGGTCTAG CACACCAGAA GATTTTATTA CAAGCCAATC TATTGATGAT
TTTACAACCC AATTAAAACA AATTGTCAAA GATCGTCTGG CCCAATCGAC ACCAGCAACA
GCTTCATTAA CGATTGCCAA TCAATTTGAT ATTCAATCTG CGACCGCTAC GGACGATGCT
GGAAATGATG TGCCTGTTCA AATTAACGGA CAAACCATTT CAGCAACTAG TACAGAAGGT
TACCTAGGAA ACATCACGAT TCACTACCAA GTCAAACAAA ATACAGCGAT TGATCCAGCA
ACCCTTGTAA GTAGTGGGAC AATGAATCAA GGAACAATTG CTAACGAATT TCCAGAAGCG
ACGATTCCTA AAAATGACAA TGCOCATGCG TGTGACGTGA CGCCAGAAGA TCCAACGATT
ACAAAAGATA TCGAAAATCA AGAACACTTA GATTTAACCA ATCGTGAAGA TAGTTTCGAT
TGGCATGTCA AAACAGCCTT TGGCAACCAA ACCAGTACTT GGACCCAAGC CAGCATGGTG
GATGACATTA ATAAAGTGCT AGATATCATT GATGTGAAAG TCACCGACGA AAATGGTAAA
GATGTTACAG CTAACGGCAC AGTAACACAA GAAAATAACA AAGTAACTTT TGAAATGAAC
AAACAAGCAG ACACCTATGA CTATTTAAGT GGTCATACGT ATACAATGAC TATCACCACT
AAAATTAAAA CTGACGCAAC GCACGAAGAA TTAGCGCCTT ACATTGAACA AGGCGGGATT
CCCAACCAAG CCGACTTAAA CTTTGGCAAT GAAGGTGACG TGTTACATTC CAACAAACCA
ACCGTAACAC CACCGCCAGT TGATCCAAAT ATTGCTAAAG ACGTAGAAGG ACAAGAACAT
TTAGATTTAA CCAACCGCGA TCAAGAATTT AAATGGAACG TCAAAACAGC TTTCGGTAAC
GAAACAAGCA CTTGGACCCA ACCCAGCATC GTAGATGACA TTAATAAAGT GTTAGACATC
ACTGATGTAA AAGTCACAGA TGAAAATGGT AAAGATGTTA CAGCTAACGG CAAAGTAACA
CAAGAAAATA ACAAAGTAAC TTTTGAAATG AACAANCAAG CNGACACCTA TGACTATTTA
AGTCGTCATA CGTACACAAT GACCATTACT ACTAAAATCA AAGCTAGCGC AACGGACGAA
GAATTAGCAC CTTATATTGA ACAACGTGGC ATTCCCAACC AAGCCGACTT GAACTTTGGC
AACGAAGGTG ACGTGTTGCA TTCCAACAAA CCAACCGTAA CACCACCTGC ACCAACCCCA
GAAGATCCAA CCATTACAAA AGATATCGAA CCCCAAGAAC ATTTACATTT AACCAACCGT
GACCAAGAAT TTAAATGGAA CGTCAAAACA GCTTTCGGTA ACGAAACAAG CACATGGACC
CAACCCAGCA TGGTGGATGA CATTAATAAA GTCTTAGACA TCACAGACGT GAAAGTTNCT
GANGAAAATG GCAAAGATGT TACAGATAAT GGCATAGTAA CACAAGAAAA TAACAAAGTA
ACTTTTACTA TGAACAAAAA AGATGACAGC TACTOTTACT TAGCTGGTCA TACATACACA
ATGACTATTA CCACTAAAAT TAAAACTGAC GCAACGGATG AAGAATTACC GCCTTATATT
GAACAACGCG GGATTCCCAA CCAAGCCGAC TTAAACTTTG GCAACGAAGG TGACGTCTTG
CATTCCAACA AGCCAACCGT AACACCGCCT GCACCAACGC CAGAAGACCC AAAAAAACCT
GAACCTAAAC AACCGCTAAA ACCGAAAAAA CCGTTGACGC CTACAAATCA TCAAGCACCA
ACGAACCCAG TCAATTTTCG AAAATCAGCA AGTAAAGGAA TTCATTTACC AATGACTAAT
ACAACAGTAA ATCCACTTTA CATGATCGCA GGTTTAATTG TCCTTATAGT GGCTATTAGC
TTTGGCATAA CAAAAAATAA AAAAAGAAAA AATTAC EF067-2 (SEQ ID NO:254) ME
KKATKLLJSTM VTVAGTVVGN FSPTLALAEE AVKAGDTEGM TNTVKVKDDS LADCKRILEG
QATFPVQAGE TEPVDLVVVE DASGSFSDNF PHVRQAIDEV VQCLSDQDRV MLASYRGGKQ
FMFPDGKTKI NSADYDMNVR VNTQLTYDKS QFVSGFGDVR TYGGTPTAPG LKLALDTYNQ
THGDLTNRKT YFLLVTDGVA NTRLDGYLHK TNTNDSTNEY PDPRHPLQVS VEYSNDYQGA
AAEVLALNQE ITNQGYEMIN AYWESVESLS SVNSYFDKYK TEVGPFVKQE LQQGSSTPED
FITSQSIDDF TTQLKQIVKD RLAQSTPATA ELTIANQEDI QSATATDDAG NDVPVQTNGQ
TISATETEGY VCNITIHYEV KENTAIDAAT LVSSCTMNQG TIAKEFPEAT IPKNDNAHAC
DVTPEDPTIT KUTENQEELD LTNREDSFDW HVKTAFGNET STWTQASMVD DINKVLDIID
VKVTDENGKD VTANGTVTQE NNKVTFEMNK QADSYDYLSG HTYTMTITTK IKTDATDEEL
APYTEQOGIP NQADLNFGNE GDVLHSNKPT VTPPPVDPNI AKDVEGQEHL DLTNRDQEFK
WNVKTAFGNE TSTWTQASMV DDINKVLDIT DVKVTDENGK DVTANGKVTQ ENNKVTFEMN
XQADSYDYLS GHTYTMTITT KIKASATDEE LAPYIEQGGI PNQADLNEGN ECDVLHSNKP
TVTPPAPTPE DPTITKDIEG QEHLDLTNRD QEFKWNVKTA FCNETSTWTQ ASMVDDTNKV
LDITDVKVXX ENGKDVTDNG IVTQENNKVT FTMNKKDDSY SYLAGHTYTM TITTKIKTDA
TDEELAPYIE QGGIPNQADL NFGNEGDVLH SNKPTVTPPA PTPEDPKKPE PKQPLKPKKP
LTPTNHQAPT NPVNFGKSAS KGIHLPMTNT TVNPLYMIAG LIVLIVAISF GITKNKKRKN
EF067-3 (SEQ ID NO:255) GOT AGATATCATT GATGTGAAAG TCACCGACGA
AAATGGTAAA GATOTTACAG CTAACGGCAC AGTAACACAA GAAAATAACA AAGTAACTTT
TGAAATGAAC AAACAAGCAG ACAGCTATGA CTATTTAAGT GGTCATACGT ATACAATGAC
TATOACCACT AAAATTAAAA CTGACGCAAC GGACGAAGAA TTAGCGCCTT ACATTGAACA
AGGCGGGATT CCCAACCAAG CCGACTTAAA CTTTGGCAAT GAAGGTGACG TOTTACATTO
CAACAAACCA ACCGTAACAC CACCGCCAGT TGATCCAAAT ATTGCTAAAG ACGTAGAAGG
ACAAGAACAT TTAGATTTAA CCAACCGCGA TCAAGAATTT AAATGGAACG TCAAAACAGC
TTTCGGTAAC GAAACAAGCA OTTOGACOCA AGCCAGCATG GTAGATGACA TTAATAAAGT
OTTAGACATO ACTGATGTAA AAGTCACAGA TGAAAATGGT AAAGATGTTA CAGCTAACGG
CAAAGTAACA CAAGAAAATA ACAAAGTAAC TTTTGAAATG AACAANCAAG ONGACAGOTA
TGACTATTTA AGTGGTCATA CGTACACAAT GACCATTACT ACTAAAATCA AAGCTAGCGC
AACGGACGAA GAATTAGCAC CTTATATTGA ACAAGGTGGC ATTCCCAACC AAGCCGACTT
GAACTTTGGC AACGAAGGTG ACGTGTTGCA TTCCAACAAA CCAACCGTAA CACCACCTGC
ACCAACGCCA GAAGATCCAA CGATTACAAA AGATATCGAA GGCCAAGAAC ATTTAGATTT
AACCAACCGT GACCAAGAAT TTAAATGGAA CGTCAAAACA GCTTTCGCTA ACGAAACAAG
CACATGGACC CAAGCCAGCA TGGTGGATGA CATTAATAAA OTOTTAGACA TOACAGACOT
GAAAGTTNCT GANGAAAATG GCAAAGATGT TACAGATAAT GGCATAGTAA CACAAGAAAA
TAACAAAGTA ACTTTTACTA TGAACAAAAA AGATOACAGO TACTCTTACT TAGCTGGTCA
TACATACACA ATGACTATTA CCACTAAAAT TAAAACTGAC GCAACGGATG AAGAATTAGC
GCCTTATATT GAACAAGGCG GGATTCCCAA CCAAGCCGAC TTAAACTTTG GCAACGAAGG
TGACGTGTTG CATTCCAACA AGCCAACCGT AACACCGCCT GCACCAACGC CAGAAGACCC
AAAAAAACCT GAACCTAAAC AACCGCTAAA ACCGAAAAAA CCGTTGACGC CTACAAATCA
TCAAGCACCA ACGAACCCAG TCAATTTTGG AAAATCAGCA AGTAAAGGAA TT EF067-4
(SEQ ID NO:256) VLDIID VKVTDENGKD VTANGTVTQE NNKVTFEMNK QADSYDYLSG
HTYTMTITTK IKTDATDEEL APYTEQGGIP NQADLNFGNE GDVLHSNKPT VTPPPVDPNI
AKDVEGQEHL DLTNRDQEFK WNVKTAFGNE TSTWTQASMV DDTNKVLDTT DVKVTDENGK
DVTANGKVTQ ENNKVTFEMN XQADSYDYLS GHTYTMTTTT
KIKASATDEE LAPYTEQOGI PNQADLNFGN EGDVLHSNKP TVTPPAPTPE DPTITKDIEG
QEHLDLTNRD QEFKWNVKTA FGNETSTWTQ ASMVDDINKV LDITDVKVXX ENGKDVTDNG
IVTQENNKVT FTMNKKDDSY SYLAGHTYTM TITTKIKTDA TDEELAPYTE QGGIPNQADL
NFGNEGDVLH SNKPTVTPPA PTPEDPKKPE PKQPLKPKKP LTPTNHQAPT NPVNFGKSAS
KGIH EF068-1 (SEQ ID NO:257) TAGGGGAAGC TAATGATCTT GGTATTTATC
GTTTATTTTA AAGAAAAGAG GGACGATCAG ATGAAAAAGA AAATTGTTGA GGATTTTAAT
CGGAAAAGTC AGCATAAAAA ATGGACAAAA CGCAAGATGC TTAATTTAGC AATATCAAGT
GGTTTATTAT TTACGTCATT AGCAATCCCT GTAAGTATAG CTGTTACCTC TGGCACAATC
AGTGCATCAG CAGCGGTCTT GGATATCGAA CTATTATCAA ATGTTACGTC AAATAATGAC
AGTGGCACTT CAACGAGTAA TCGTTGGACA GCCGCAAACC AAAATCAACC AGTTAATTTC
ACGGTTTCTG GTGGCGCTTT AGCAGATGCT TCCGCTGTGT TTAGTGGACA AAAACAAGCG
GTGTTAGTGG TTCCTCCTGA GTTAAGAGGA AATGTAGCTG CAGOAGGOAG CGCAGCAATC
AATACCAATG TCACGATTGA TCTTTCAAAA GTTACTTTTT TGACTGCCGT TTTGAATGCA
GCCAATGATT TAACCAATGT GATTACTCAA ATTACCAGTG GGGCGTTAGG GAATTTAACT
GGTGTTGATA TTGATTTGAC GGAAGTGAAT CGTCAATTGG AATTAGTTAA TAACATTGAA
AACTTAGGTG CTGCTTCATT TACAGCTCCG GAAACGTTAG CAGCTGACGG CTCATACATT
AGTGCACCGA TTAGTGATGG TTTAGGGTTA GTTTTAGCCC AAAATGTTTC AAACATCTTA
CAAGATTTGA ATGCGGCAGT TCAAGCTTTG GAGGCAAAAG GTACCAGTAT CCCAAGTAAT
CTTGTCGCCG CAGCTATAAA TGCAGCCTTG CTTCCTGTCA AAGGCACGGT AAACGTGGCT
GTTTCAGGTG CTTTGCCTTT ATTAGCGGTT GGTGGTTCAG GCGTAAATGA GTTAGTGGAT
GCTTCTTTAC TAGGCACAAC CACGGTTACT TTACCAACTA CCGTTTCAAC ACCTCAAAAT
TTATCCAATA ATTTAGATGC TCGTTTTGTA GGAACAGTCG TTCAAACAGA TCTTTTAGAC
GTTAATTTAT TAGCAACAGC AGACCGTGTA TCCAACATTT ATTTTGCTGC AGGCACTACT
AGTGAAGTAA CCGCACCAAC AATCACAGGA GTAACAGGTA ATTCAACAGC AGGTTACGAA
GTTAAAGGAA CTGCCGATGC CAATGCCACG GTTGAAATCC GAAATGCAGG AGGCACCGTA
ATAGGCACAG GTACCGCTGA TGGGACAGGA GCGTTTACAG TTACCGTTCC CGCAGGTGAA
GCACGCGCCA ATGAAACGTT AACCGCCGTA GCGAAAAACG CCAGCGGNAC AGAAAGNACG
CCAACAACGT TCCAAACNCC AGCGGATGAA GCAACCGTAA CCGCACCAAC AATCACAGGA
GTGACAGGTA ATTCAACGGC AGGTTACGAA GTTAAAGGAA CTGCCGATGC CAATGCCACG
GTTGAAATCC GAAATGCAGG AGGCACCGTA ATAGOCACAG GTACCGCTGA TGGGACAGGA
GCGTTTACAG TTACCGTTCC CGCAGGTGAA GCAGGTGCCA ATGAAACGTT AACCGCCGTA
GCGAAAAACG CCAGCGGCAC ACAAAGTACG CCAACAACGT TCCAAACACC AGCGGATGAA
GCAACCGTAA CCGCACCAAC AATCACAGGA GTGACAGGTA ATTCAACAGC AGGTTACGAA
GTTAAAGGAA CTGCCGATGC CAATGCCACG GTTGAGATCC GAAATGCAGG AGGTGCCGTG
ATAGGTACAG GTACTGCTGA TGGGACAGGG GCATTTACAG TTACCATTCC CGCAGGTGAA
GCAGGTGCGA ATGAAACGTT AACCGCCGTA GCGAAAAACG CCAGCGGTAC AGAAAGTACG
CCAACAACGT TCCAAACGCC AGCGGATCCT AATACGCCCG TGGCGACGCC AATTGTTGAG
ACTGTAACAG GTAGTACAAC AAAAGGCTAT GAGGTCAAAG GGACTGCTGA AGTTGGCACC
ACCATTGAGG TTCGCGATGC AGCTGGCACG GTCCTTGGTA CTGCAACAAC TGGAACTGAC
CGAAAATATA CAGTGACTTT AGATTCAGGA ACAGCAACAG CAAATCAAAC GCTGAGCGTT
GTAGCGAAAA ACGCTAGTGG CACGGAAAGT CAACCAGCAA CGGCGACAAC ACCAGCTGAT
GTCACTGCAC CAACAGTTGA TAACATCACA GGCAACTCTG GTTCGGGTTA TGAAATTACA
GGAACAGCAG ACCCTAACAC AACAATCGAA GTTCGTGATC CATCTCGGGC AGTCATTGCT
ACAGGTACCT CTGATGCGAA TGGTGATTTT ACTGTAACGC TACCAACGGG AACGACCAAT
CCTGGGGATA CCTTAACAGT GATTGGAAAG GATAACGCGG GAAATGAAAG TCAACCGACT
GAAGTCCTTG TTCCTGCTGA TGCCACGGTT ACAGCACCAA CTGTAACAGG AGTAACAGGT
AATTCAGTTG CTGGTTATCA GGTGACAGGC ACCGCTGATC CGAATGCTAC CATCGAAATT
CGTGATGCAG ATGGGAACGT GATTGCAACA GGGACTGCCG ATGGGACTGG TTCCTTTGCT
GTGAACCTTC CAGCTGGGAC GGCAAATGCG AATCAAACAT TGACAGCGTT AGCCAAAGAT
CCTGCTGGCA ATACAAGTAC ACCGACAACC TTCCAAACAC CAGCAGATGA AGTAGTGGCA
CCGCCAAGTG TCGACAAAGT TACTGGGAAT ACAACACAAG GATATCAAGT GACAGGTACC
GCTGAACTTG GCACCACCAT TGAAGTTCGT GCAACAGACG GAACAGTTTT AGGCACCGCA
ACAACTGGAC CGACTGGCCA ATATACTGTG ACGTTAGCTT CAGGAAAAGC AACAGCTAAA
CAAACAGTGA ATGTAGTTGC TAAAAATGAT ACTGGACTTG AGAGTCAACC AACTACAGCT
ATGACACCCG CTGATGTTAC CACACCAACA ATTGGTGACA TTACTGGAGA TTCAACAACT
GGTTATGAAA TCACTGGGAC CGCGGACCCT AATACCACCA TTGAAGTACG GAACCCAGAT
GGAACAATTA TTGGTACAAC GACAACGGAT GATCAAGGAA ACTTTACTGT GGACCTTCCA
GCGGGAGCCG CTAATCCTGG TGATACATTA ACAGTTGTTG GAAAAGACGG TGACGGCAAT
CAAAGTCAAC CAACGGAAGT GACGGTCCCT GAAGATGCAA CCGTAGCAGC ACCAACTGTG
ACGACTGTTA CAGGAACAAC TGCCACTGGG TATCAAGTAA CCGGCACGGC AGAGCCAAAT
GTCACCATTG AGATTCACAA TGAAGCAGGT TTAGTTATTG CTACGGGAAC GACTGATGGT
GCTGGCGCAT TTACAATCAC TCTTCCGACG GGCACAGCAA CAGCTAACGA AGCCTTAACT
GCCATTGCGA AAGATGCTGC TGGGAAAGAA AGTAATCCGA CTGCTTTCAA AACACCTGCT
GATCCAGATG CACCAGTCGC GACACCTACT GTTGACAAAA TCACTGGTAG CACGACAAAC
GGCTATCAAG TAGTAGGAGC AGCAGAAGTT GGTACAACAG TTGAGGTGCG TGACGCCGAT
GGCACAGTCC TTGGCATGGC AACTACTGGA ACTGATGGCA AATACACAGT GACTTTAGAG
CCAGGGAAGG CCTCAGCTAA CGAAACAATA ACTGTCGTAG CGAAAAATGC AACAGGAAAA
GAAAGTCAGC CAGCTACAGC AACTACACCA GTCGACTTAG CCACACCAAC CATTGATTCT
ATTACCGGAA ATTCTAGTAA AGGTTACGAA ATCACTGGAA CGGCGGAGCC AAAAACCACT
ATTGATGTCC GTGACGCAGA CGGAACCATC ATTGCTGCTA CAACTGCTAA CGAAACCGGC
CAATATACGG TGACTCTACC AGCTGGCGTA GTGACACCAG GAGAAACGAT TACGATTATT
AGCAAAGATG GCGCAGGTAA TGAAAGTCAA CCAGCTACAC CCGTTATTCC AGCGGATGTT
GTTTTAGCGG CGCCAACTAT TACGAAGGTT GAAGGAAACA AAGCCAATGG CTATACAGTC
ACTGGAACTG CTGATCCAAA TGTCACGGTT CAATTTTACA ATAGCAGTGA ACAATTATTG
GCAAGTGCCA ATACAACTAC TGGAGCTACC TTCTCCGTTC ATATTGCAGC AGGGTTAGCA
ACAGAAAAAG AAACGTTAAC CGCACTAACC ACAGATACAC AAGGAAATGT GAGTCCTAAA
ACCACATTTA TGACGCCAGC CGATATTACG GGAGAACCAG AGATTAAAAT TGCGGCACCA
ACTGTTTCTT CAGTTTTAGG AACGTCTAAA GCCGGCTACC TCATCAAAGG AACAGCTGAA
CCAAACCGAA TCATTCAAAT TAGTAACCGA CTATTAAGAA GTGTGATTGC TGTAGGTGCC
ACCGATGCTG AAGGCAACTT CGCTATCCAA TTAACAGCGG GACAAGCGAC TGCTCAACAA
AGTTTACTTG CGACAGCTAC CGATGGCGCA GGACATTACA GTACGGCTAC AACCTTCATG
ACGCCAGCCG ACCCAACGAA TCCTGGAGGA GGCAATGGTA ACACTGGCGG AAATAACGGC
AATACAGGCG GCAATACAGG AAACAATGGC GCAACTGGCG GGAATAATGG GAATGGTTCA
AACACAGGTT CAAATCCAAA TGGAGGTTCT GGTTTAGGCA CAACAGGTTC TGGCTTAGGT
TCACTAGGCA ATGGCCTCGG TACAAATGGT AGTGGCTACC ACCCTAAACT AAGTACCATC
ACTTATGGCA CTGGAAATCA CGGGAAAACA GGCTACTTAC CTAGCACAGG TGAAAAAGAG
TCTTCAGCCG TGACAACAAG TTTGTTTCGC GCCTTTGTCG CACTCCTTGC GAGCATGGGA
ATCATCAAAC GCAAACGTAA AAACTAG EF068-2 (SEQ ID NO:258) M KKKIVEDFNR
KSQHKKWTKR KMLNLAISSG LLFTSLAIPV SIAVTSCTIS ASAAVLDIEL LSNVTSNNDS
GTSTSNRWTA ANQNQPVNFT VSGGALADAS AVFSGQKQAV LVVPPELRGN VAAAGSAAIN
TNVTIDLSKV TFLTAVLNAA NDLTNVITQI TSGALGNLTG VDIDLTEVNR QLELVNNIEN
LGAASFTAPE TLAADGSYIS APISDGLGLV LAQNVSNILQ DLNAAVQALE AKGTSIPSNL
VAAAINAALL PVKGTVNVAV SGALPLLAVG GSGVNELVDA SLLGTTTVTL PTTVSTPQNL
SNNLDARFVG TVVQTDLLDV NLLATADGVS NIYFAAGTTS EVTAPTITGV TGNSTAGYEV
KGTADANATV EIRNAGGTVI GTGTADGTGA FTVTVPAGEA GANETLTAVA KNASGTEXTP
TTFQTPADEA TVTAPTITGV TGNSTAGYEV KGTADANATV EIRNAGGTVI GTGTADGTGA
FTVTVPAGEA GANETLTAVA KNASGTESTP TTFQTPADEA TVTAPTITGV TGNSTAGYEV
KGTADANATV EIRNAGGAVI GTGTADGTGA FTVTIPAGEA GANETLTAVA KNASGTESTP
TTFQTPADPN TPVATPIVET VTCSTTKGYE VKGTAEVGTT IEVRDAAGTV LGTATTGTDG
KYTVTLDSGT ATANQTLSVV AKNASGTESQ PATATTPADV TAPTVDNITG NSGSGYEITG
TADPNTTIEV RDPSGAVIGT GTSDANGDFT VTLPTGTTNP GDTLTVIGKD NAGNESQPTE
VLVPADATVT APTVTGVTCN SVAGYQVTGT ADPNATIEIR DADGNVIATG TADGTGSFAV
NLPAGTANAN ETLTALAKDP AGNTSTPTTF QTPADEVVAP PSVDKVTGNT TQGYQVTGTA
ELGTTIEVRA TDGTVLGTAT TGPTGQYTVT LASGKATAKQ TVNVVAKNDT GLESQPTTAM
TPADVTTPTI GDITGDSTTG YEITGTADPN TTIEVRNPDG TIIGTTTTDD QGNFTVDLPA
GAANPGDTLT VVGKDGDGNE SQPTEVTVPE DATVAAPTVT TVTGTTATGY QVTGTAEPNV
TIEIHNEAGL VIATGTTDGA CAFTITLPTG TATANEALTA IAKDAAGKES NPTAFKTPAD
PDAPVATPTV DKITGSTTNG YQVVGAAEVG TTVEVRDADG TVLGMATTGT DGKYTVTLEP
GKASANETIT VVAKNATGKE SQPATATTPV DLATPTIDSI TGNSSKGYEI TGTAEPKTTI
DVRDADGTII AATTANETGQ YTVTLPAGVV TPGETITIIS KDGAGNESQP ATAVIPADVV
LAAPTITKVE GNKANGYTVT GTADPNVTVQ FYNSSEQLLA SGNTTTGGTF SVHIAAGLAT
EKETLTALTT DTQGNVSPKT TFMTPADITG EPEIKIAAPT VSSVLGTSKA GYLIKGTAEP
NRIIQISNRL LRSVIAVGAT DAEGNFAIQL TAGQATAQQS LLATATDGAG HYSTATTFMT
PADPTNPGGG NGNTGGNNGN TGGNTGNNGA TGGNNCNGSN TGSNPNGGSG LGTTGSGLGS
LGNGLGTNGS GYHPKLSTIS YGTGNHGKTG YLPSTGEKES SAVTTSLFGA EVALLASMGI
IKRKRKN EF068-3 (SEQ ID NO:259) CTC TGGCACAATC AGTGCATCAG
CAGCGGTCTT GGATATCGAA CTATTATCAA ATGTTACGTC AAATAATGAC AGTGGCACTT
CAACGAGTAA TCGTTGGACA GCCGCAAACC AAAATCAACC AGTTAATTTC ACGGTTTCTG
GTGGCGCTTT AGCAGATGCT TCCGCTGTGT TTAGTGGACA AAAACAAGCG GTGTTAGTGG
TTCCTCCTGA GTTAAGAGGA AATGTAGCTG CAGCAGGCAG CGCAGCAATC AATACCAATG
TCACGATTGA TCTTTCAAAA GTTACTTTTT TGACTGCCGT TTTGAATGCA GCCAATGATT
TAACCAATGT GATTACTCAA ATTACCAGTG GGGCGTTAGG GAATTTAACT GGTGTTGATA
TTGATTTGAC GGAAGTGAAT CGTCAATTCG AATTAGTTAA TAACATTGAA AACTTAGGTG
CTGCTTCATT TACAGCTCCG GAAACGTTAG CAGCTGACGG CTCATACATT AGTGCACCGA
TTAGTGATGG TTTAGGGTTA GTTTTAGCCC AAAATGTTTC AAACATCTTA CAAGATTTGA
ATGCGGCAGT TCAAGCTTTG GAGGCAAAAG GTACCAGTAT CCCAAGTAAT CTTGTCGCCG
CAGCTATAAA TGCAGCCTTG CTTCCTGTCA AAGGCACGGT AAACGTGGCT GTTTCAGGTG
CTTTGCCTTT ATTAGCGGTT GCTGGTTCAG GCGTAAATGA GTTAGTGGAT GCTTCTTTAC
TAGGCACAAC CACGGTTACT TTACCAACTA CCGTTTCAAC ACCTCAAAAT TTATCCAATA
ATTTAGATGC TCGTTTTGTA GGAACAGTCG TTCAAACAGA TCTTTTAGAC GTTAATTTAT
TAGCAACAGC AGACGGTGTA TCCAACATTT ATTTTGCTGC AGGCACTACT AGTGAACTAA
CCGCACCAAC AATCACAGGA GTAACAGGTA ATTCAACAGC AGGTTACGAA GTTAAAGGAA
CTGCCGATGC CAATGCCACG GTTGAAATCC GAAATGCAGG AGGCACCGTA ATAGGCACAG
GTACCGCTGA TGGGACAGGA GCGTTTACAG TTACCGTTCC CGCAGGTGAA GCAGGCGCCA
ATGAAACGTT AACCGCCGTA GCGAAAAACG CCAGCGGNAC AGAAAGNACG CCAACAACGT
TCCAAACNCC AGCGGATGAA GCAACCGTAA CCGCACCAAC AATCACAGGA GTGACAGGTA
ATTCAACCGC AGGTTACGAA GTTAAAGGAA CTGCCGATGC CAATGCCACG GTTGAAATCC
GAAATGCAGG AGGCACCGTA ATAGGCACAG GTACCGCTGA TGGGACAGGA GCGTTTACAG
TTACCGTTCC CGCAGGTGAA GCAGGTGCCA ATGAAACGTT AACCGCCGTA GCGAAAAACG
CCAGCGGCAC AGAAAGTACC CCAACAACGT TCCAAACACC AGCGGATGAA GCAACCGTAA
CCGCACCAAC AATCACAGGA GTGACAGGTA ATTCAACAGC AGGTTACGAA GTTAAAGGAA
CTGCCGATGC CAATGCCACG GTTGAGATCC GAAATGCAGG AGGTGCCGTG ATAGGTACAG
GTACTGCTGA TGGGACAGGG GCATTTACAG TTACCATTCC CGCAGGTCAA GCAGGTGCGA
ATGAAACGTT AACCGCCGTA GCGAAAAACG CCAGCGGTAC AGAAAGTACG CCAACAACGT
TCCAAACGCC EF068-4 (SEQ ID NO:260) TSGTIS ASAAVLDIEL LSNVTSNNDS
GTSTSNRWTA ANQNQPXTNFT VSGGALADAS AVFSGQKQAV LVVPPELRGN VAAAGSAAIN
TNVTIDLSKV TFLTAVLNAA NDLTNVITQI TSGALGNLTG VDIDLTEVNR QLELVNNIEN
LGAASFTAPE TLAADGSYIS APISDGLGLV LAQNVSNILQ DLNAAVQALE AKGTSIPSNL
VAAAINAALL PVKGTVNVAV SGALPLLAVG GSGVNELVDA SLLGTTTVTL PTTVSTPQNL
SNNLDARFVG TVVQTDLLDV NLLATADGVS NIYFAAGTTS EVTAPTITGV TGNSTAGYEV
KGTADANATV EIRNAGGTVI GTGTADGTGA FTVTVPAGEA GANETLTAVA KNASGTEXTP
TTFQTP EF069-1 (SEQ ID NO:261) TAGGGGAACC TAATGATCTT GGTATTTATC
GTTTATTTTA AAGAAAAGAG GGACGATCAG ATCAAAAAGA AAATTGTTGA GGATTTTAAT
CGGAAAAGTC AGCATAAAAA ATGGACAAAA CGCAAGATGC TTAATTTAGC AATATCAAGT
GGTTTATTAT TTACGTCATT AGCAATCCCT GTAAGTATAG CTGTTACCTC TGGCACAATC
AGTGCATCAG CAGCGGTCTT GGATATCGAA CTATTATCAA ATGTTACGTC AAATAATGAC
AGTGGCACTT CAACGAGTAA TCGTTGGACA GCCGCAAACC AAAATCAACC AGTTAATTTC
ACGGTTTCTG GTGGCGCTTT AGCAGATGCT TCCGCTGTGT TTAGTGGACA AAAACAAGCG
GTGTTAGTGG TTCCTCCTGA GTTAAGAGGA AATGTAGCTG CAGCAGGCAG CGCAGCAATC
AATACCAATG TCACGATTGA TCTTTCAAAA GTTACTTTTT TGACTGCCGT TTTGAATGCA
GCCAATGATT TAACCAATGT GATTACTCAA ATTACCAGTG GGGCGTTAGG GAATTTAACT
GGTGTTGATA TTGATTTGAC GGAAGTGAAT CGTCAATTGG AATTAGTTAA TAACATTGAA
AACTTAGGTG CTGCTTCATT TACAGCTCCG GAAACGTTAG CAGCTGACGG CTCATACATT
AGTGCACCGA TTAGTGATGG TTTAGGGTTA GTTTTAGCCC AAAATGTTTC AAACATCTTA
CAAGATTTGA ATGCGGCAGT TCAAGCTTTG GAGGCAAAAG GTACCAGTAT CCCAAGTAAT
CTTGTCGCCG CAGCTATAAA TGCAGCCTTG CTTCCTGTCA AAGGCACGGT AAACGTGGCT
GTTTCAGGTG CTTTGCCTTT ATTAGCGGTT GGTGGTTCAG GCGTAAATGA GTTAGTGGAT
GCTTCTTTAC TAGGCACAAC CACGGTTACT TTACCAACTA CCGTTTCAAC ACCTCAAAAT
TTATCCAATA ATTTAGATGC TCGTTTTGTA GGAACAGTCG TTCAAACAGA TCTTTTAGAC
GTTAATTTAT TAGCAACAGC AGACGGTGTA TCCAACATTT ATTTTGCTGC AGGOACTACT
AGTGAAGTAA CCGCACCAAC AATCACAGGA GTAACAGGTA ATTCAACAGC AGGTTACGAA
GTTAAAGGAA CTGCCGATGC CAATGCCACG GTTGAAATCC GAAATGCAGG AGGCACCGTA
ATAGGCACAG GTACCGCTGA TGGGACAGGA GCGTTTACAG TTACCGTTCC CGCAGGTGAA
GCAGGCGCCA ATGAAACGTT AACCGCCGTA GCGAAAAACG CCAGCGGNAC AGAAAGNACG
CCAACAACGT TCCAAACNCC AGCGGATGAA GCAACCGTAA CCGCACCAAC AATCACAGGA
GTGACAGGTA ATTCAACGGC AGGTTACGAA GTTAAAGGAA CTGCCGATGC CAATGCCACG
GTTGAAATCC GAAATGCAGG AGGCACCGTA ATAGGCACAG GTACCGCTGA TGGGACAGGA
GCGTTTACAG TTACCGTTCC CGCAGGTGAA GCAGGTGCCA ATGAAACGTT AACCGCCGTA
GCGAAAAACG CCAGCGGCAC AGAAAGTACG CCAACAACGT TCCAAACACC AGCGGATGAA
GCAACCGTAA CCGCACCAAC AATCACAGGA GTGACAGGTA ATTCAACAGC AGGTTACGAA
GTTAAAGGAA CTGCCGATGC CAATGCCACG GTTGAGATCC GAAATGCAGG AGGTGCCGTG
ATAGGTACAG GTACTGCTGA TGGGACAGGG GCATTTACAG TTACCATTCC CGCAGGTGAA
GCAGGTGCGA ATGAAACGTT AACCGCCGTA GCGAAAAACG CCAGCGGTAC AGAAAGTACG
CCAACAACGT TCCAAACGCC AGCGGATCCT AATACGCCCG TGGCGACGCC AATTGTTGAG
ACTGTAACAG GTAGTACAAC AAAAGGCTAT GAGGTCAAAG GGACTCCTGA AGTTGGCACC
ACCATTGAGG TTCGCGATGC AGCTGGCACG GTCCTTGGTA CTGCAACAAC TGGAACTGAC
GGAAAATATA CAGTGACTTT AGATTCAGGA ACAGCAACAG CAAATCAAAC GCTGAGCGTT
GTAGCGAAAA ACGCTAGTGG CACGGAAAGT CAACCAGCAA CGGCGACAAC ACCAGCTGAT
GTCACTGCAC CAACAGTTGA TAACATCACA GGCAACTCTG GTTCGGGTTA TGAAATTACA
GGAACAGCAG ACCCTAACAC AACAATCGAA GTTCGTGATC CATCTGGGGC AGTCATTGGT
ACAGGTACCT CTGATGCGAA TGGTGATTTT ACTGTAACGC TACCAACGGG AACGACCAAT
CCTGGGGATA CGTTAACAGT GATTGGAAAG GATAACGCGG GAAATGAAAG TCAACCGACT
GAAGTCCTTG TTCCTGCTGA TGCCACGGTT ACAGCACCAA CTGTAACAGG AGTAACAGGT
AATTCAGTTG CTGGTTATCA GGTGACAGGC ACCGCTGATC CGAATGCTAC CATCGAAATT
CGTGATGCAG ATGGGAACGT GATTGCAACA GGGACTGCCG ATGGGACTGG TTCCTTTGCT
GTGAACCTTC CAGCTGGGAC GGCAAATGCG AATCAAACAT TGACAGCGTT AGCCAAAGAT
CCTGCTGGCA ATACAAGTAC ACCGACAACC TTCCAAACAC CAGCAGATGA AGTAGTGGCA
CCGCCAAGTG TCGACAAAGT TACTGGGAAT ACAACACAAG GATATCAAGT GACAGGTACC
GCTGAACTTG GCACCACCAT TGAAGTTCGT GCAACAGACG GAACAGTTTT AGGCACCGCA
ACAACTGGAC CGACTGGCCA ATATACTGTG ACGTTAGCTT CAGGAAAAGC AACAGCTAAA
CAAACAGTGA ATGTAGTTGC TAAAAATGAT ACTGGACTTG AGAGTCAACC AACTACAGCT
ATGACACCCG CTGATGTTAC CACACCAACA ATTGGTGACA TTACTGGAGA TTCAACAACT
GGTTATGAAA TCACTGGGAC GGCGGACCCT AATACCACCA TTGAAGTACG GAACCCAGAT
GGAACAATTA TTGGTACAAC GACAACGGAT GATCAAGGAA ACTTTACTGT GGACCTTCCA
GCGGGAGCCG CTAATCCTGG TOATACATTA ACAGTTGTTG GAAAAGACGG TGACGGCAAT
GAAAGTCAAC CAACGGAAGT GACGGTCCCT GAAGATGCAA CCGTAGCAGC ACCAACTGTG
ACGACTGTTA CAGGAACAAC TGCCACTGGG TATCAAGTAA CCGGCACGGC AGAGCCAAAT
GTCACCATTG AGATTCACAA TGAAGCAGGT TTAGTTATTG CTACGGGAAC GACTGATGGT
GCTGGCGCAT TTACAATCAC TCTTCCGACG GGCACAGCAA CAGCTAACGA AGCCTTAACT
GCCATTGCGA AAGATGCTGC TGGGAAAGAA AGTAATCCGA CTGCTTTCAA AACACCTGCT
GATCCAGATG CACCAGTCGC GACACCTACT GTTGACAAAA TCACTGGTAG CACGACAAAC
GGCTATCAAG TAGTAGGAGC AGCAGAAGTT GGTACAACAG TTGAGGTGCG TGACGCCGAT
GGCACAGTCC TTGGCATGGC AACTACTGGA ACTGATGCCA AATACACAGT GACTTTAGAG
CCAGGGAAGG CCTCAGCTAA CGAAACAATA ACTGTCGTAG CGAAAAATGC AACAGGAAAA
GAAAGTCAGC CAGCTACAGC AACTACACCA GTCGACTTAG CCACACCAAC CATTGATTCT
ATTACCGGAA ATTCTAGTAA AGGTTACGAA ATCACTGGAA CGGCGGAGCC AAAAACCACT
ATTGATGTCC GTGACGCAGA CGGAACCATC ATTGCTGCTA CAACTGCTAA CGAAACCGGC
CAATATACGG TGACTCTACC AGCTGGCGTA GTGACACCAG GAGAAACGAT TACGATTATT
AGCAAAGATG GCGCAGGTAA TGAAAGTCAA
CCAGCTACAG CCGTTATTCC AGCGGATGTT GTTTTAGCGG CGCCAACTAT TACGAAGGTT
GAAGGAAACA AAGCCAATGG CTATACAGTC ACTGGAACTG CTGATCCAAA TGTCACGGTT
CAATTTTACA ATAGCAGTGA ACAATTATTG GCAAGTGGCA ATACAACTAC TGGAGGTACC
TTCTCCGTTC ATATTCCAGC AGGGTTAGCA ACAGAAAAAG AAACGTTAAC CGCACTAACC
ACAGATACAC AAGGAAATGT GAGTCCTAAA ACCACATTTA TGACGCCAGC CGATATTACG
GGAGAACCAG AGATTAAAAT TGCGGCACCA ACTGTTTCTT CAGTTTTAGG AACGTCTAAA
GCCGGCTACC TCATCAAAGG AACAGCTGAA CCAAACCGAA TCATTCAAAT TAGTAACCGA
CTATTAAGAA GTGTGATTGC TGTAGGTGCC ACCGATGCTG AAGGCAACTT CGCTATCCAA
TTAACAGCGG GACAAGCGAC TGCTCAACAA AGTTTACTTG CGACAGCTAC CGATGGCGCA
GGACATTACA GTACGGCTAC AACCTTCATG ACGCCAGCCG ACCCAACGAA TCCTGGACGA
GGCAATGGTA ACACTGGCGG AAATAACGGC AATACAGGCG GCAATACAGG AAACAATGGC
GCAACTGGCG GGAATAATGG GAATGGTTCA AACACAGGTT CAAATCCAAA TGGAGGTTCT
GGTTTAGGCA CAACAGGTTC TGGCTTAGGT TCACTAGGCA ATGGCCTCGG TACAAATGGT
AGTGGCTACC ACCCTAAACT AAGTACCATC AGTTATGGCA CTGGAAATCA CGGGAAAACA
GGCTACTTAC CTAGCACAGG TGAAAAAGAG TCTTCAGCCG TGACAACAAG TTTGTTTGGC
GCCTTTGTCG CACTCCTTGC GAGCATGCGA ATCATCAAAC GCAAACGTAA AAACTAG
EF069-2 (SEQ ID NO:262) M KKKIVEDFNR KSQHKKWTKR KMLNLAISSG
LLFTSLAIPV SIAVTSGTIS ASAAVLDIEL LSNVTSNNDS GTSTSNRWTA ANQNQPVNFT
VSGGALADAS AVFSGQKQAV LVVPPELRGN VAAAGSAAIN TNVTIDLSKV TFLTAVLNAA
NDLTNVITQI TSGALGNLTG VDIDLTEVNR QLELVNNIEN LGAASFTAPE TLAADGSYIS
APISDGLGLV LAQNVSNTLQ DLNAAVQALE AKGTSIPSNL VAAAINAALL PVKGTVNVAV
SGALPLLAVG GSGVNELVDA SLLGTTTVTL PTTVSTPQNL SNNLDARFVG TVVQTDLLDV
NLLATADGVS NIYFAAGTTS EVTAPTITGV TGNSTAGYEV KGTADANATV EIRNAGGTVI
GTGTADGTGA PIWTVPAGEA GANETLTAVA KNASGTEXTP TTFQTPADEA TVTAPTITGV
TGNSTAGYEV KGTADANATV EIRNAGGTVI GTGTADGTGA FTVTVPAGEA GANETLTAVA
KNASGTESTP TTFQTPADEA TVTAPTITGV TGNSTAGYEV KGTADANATV ETENAGGAVI
GTGTADGTGA FTVTIPAGEA GANETLTAVA KNASGTESTP TTFQTPADPN TPVATPIVET
VTGSTTKGYE VKGTAEVGTT TEVRDAAGTV LGTATTGTDG KYTVTLDSGT ATANQTLSVV
AKNASGTESQ PATATTPADV TAPTVDNITG NSGSGYEITG TADPNTTIEV RDPSGAVIGT
GTSDANGDFT VTLPTGTTNP GDTLTVIGKD NAGNESQPTE VLVPADATVT APTVTGVTGN
SVAGYQVTGT ADPNATIEIR DADGNVIATG TADGTGSFAV NLPAGTANAN ETLTALAKDP
AGNTSTPTTF QTPADEVVAP PSVDKVTGNT TQGYQVTGTA ELGTTIEVRA TDGTVLGTAT
TGPTGQYTVT LASGKATAKQ TVNVVAKNDT GLESQPTTAN TPADVTTPTI GDITGDSTTG
YEITGTADPN TTIEVENPDG TIIGTTTTDD QGNFTVDLPA GAANPGDTLT VVGKDGDGNE
SQPTEVTVPE DATVAAPTVT TVTGTTATGY QVTGTAEPNV TTETHNEAGL VIATGTTDGA
GAFTITLPTG TATANEALTA IAKDAAGKES NPTAFKTPAD PDAPVATPTV DKITGSTTNG
YQVVGAAEVG TTVEVRDADG TVLGMATTGT DGKYTVTLEP GKASANETIT VVAKNATGKE
SQPATATTPV DLATPTTDSI TGNSSKGYEI TGTAEPKTTI DVRDADGTII AATTANETGQ
YTVTLPAGVV TPGETITIIS KDGAGNESQP ATAVIPADVV LAAPTITKVE GNKANGYTVT
GTADPNVTVQ FYNSSEQLLA SGNTTTGGTF SVHTAAGLAT EKETLTALTT DTQGNVSPKT
TFMTPADITG EPETKIAAPT VSSVLGTSKA GYLIKGTAEP NRIIQISNRL LRSVIAVGAT
DAEGNFAIQL TAGQATAQQS LLATATDGAG HYSTATTFMT PADPTNPGGG NGNTGGNNGN
TGGNTGNNGA TGGNNGNGSN TGSNPNGGSG LGTTGSGLGS LGNGLGTNGS GYHPKLSTTS
YGTGNHGKTG YLPSTGEKES SAVTTSLFGA FVALLASMGI IKRKRKN EF069-3 (SEQ ID
NO:263) AGGTGAA GCAGGTGCGA ATGAAACGTT AACCGCCGTA GCGAAAAACG
CCAGCGGTAC AGAAAGTACG CCAACAACGT TCCAAACGCC AGCGGATCCT AATACGCCCG
TGGCGACGCC AATTGTTGAG ACTGTAACAG GTAGTACAAC AAAAGGCTAT GAGGTCAAAG
GGACTGCTGA AGTTGGCACC ACCATTGAGG TTCGCGATGC AGCTGGCACG GTCCTTGGTA
CTGCAACAAC TGGAACTGAC GGAAAATATA CAGTGACTTT AGATTCAGGA ACAGCAACAG
CAAATCAAAC GCTGAGCGTT GTAGCGAAAA ACGCTAGTGG CACGGAAAGT CAACCAGCAA
CGGCGACAAC ACCAGCTGAT GTCACTGCAC CAACAGTTGA TAACATCACA CGCAACTCTG
GTTCGGGTTA TGAAATTACA GGAACAGCAG ACCCTAACAC AACAATCGAA GTTCGTGATC
CATCTGGGGC AGTCATTGGT ACAGGTACCT CTGATGCGAA TGGTGATTTT ACTGTAACGC
TACCAACGGG AACGACCAAT CCTGGGGATA CGTTAACAGT GATTGGAAAG GATAACGCGG
GAAATGAAAG TCAACCGACT GAAGTCCTTG TTCCTCCTGA TGCCACGGTT ACAGCACCAA
CTGTAACAGC AGTAACAGGT AATTCAGTTG CTGCTTATCA GGTGACAGGC ACCGCTGATC
CGAATGCTAC CATCCAAATT CGTGATGCAG ATGGGAACCT GATTGCAACA GGGACTCCCG
ATGCGACTGG TTCCTTTGCT GTGAACCTTC CAGCTGGGAC GGCAAATGCG AATGAAACAT
TGACAGCGTT AGCCAAAGAT CCTGCTGGCA ATACAAGTAC ACCGACAACC TTCCAAACAC
CAGCAGATGA AGTAGTGGCA CCGCCAAGTG TCGACAAAGT TACTGGGAAT ACAACACAAG
GATATCAAGT GACAGGTACC GCTGAACTTG GOACCACCAT TGAAGTTCGT GCAACAGACG
GAACAGTTTT AGGCACCGCA ACAACTGGAC CGACTGGCCA ATATACTGTG ACGTTAGCTT
CAGGAAAAGC AACAGCTAAA CAAACAGTGA ATGTAGTTGC TAAAAATGAT ACTGCACTTG
AGAGTCAACC AACTACAGCT ATGACACCCG CTGATGTTAC CACACCAACA ATTGGTGACA
TTACTGGAGA TTCAACAACT GGTTATGAAA TCACTGGGAC GGCGGACCCT AATACCACCA
TTGAACTACG GAACCCAGAT GGAACAATTA TTGGTACAAC GACAACGGAT GATCAACGAA
ACTTTACTGT GGACCTTCCA GCGGGAGCCG CTAATCCTGG TGATACATTA ACAGTTGTTG
GAAAAGACCG TGACGGCAAT GAAAGTCAAC CAACGGAAGT GACGGTCCCT CAAGATGCAA
CCGTACCAGC ACCAACTGTC ACGACTGTTA CAGGAA EF069-4 (SEQ ID NO:264)
AGEA GANETLTAVA KNASGTEXTP TTFQTPADEA TVTAPTTTGV TGNSTAGYEV
KGTADANATV EIRNAGGTVI GTGTADGTGA FTVTVPAGEA GANETLTAVA KNASGTESTP
TTFQTPADEA TVTAPTITGV TGNSTAGYEV KGTADANATV EIRNAGGAVI GTGTADGTGA
FTVTIPAGEA GANETLTAVA KNASGTESTP TTFQTPADPN TPVATPIVET VTGSTTKGYE
VKGTAEVGTT IEVRDAAGTV LGTATTGTDG KYTVTLDSGT ATANQTLSVV AKNASGTESQ
PATATTPADV TAPTVDNITG NSGSGYETTG TADPNTTIEV RDPSGAVIGT GTSDANGDFT
VTLPTGTTNP GDTLTVIGKD NAGNESQPTE VLVPADATVT APTVTGVTGN SVAGYQVTGT
ADPNATIEIR DADGNVIATG TADGTGSFAV NLPAGTANAN ETLTALAKDP AGNTSTPTTF
QTPADEVVAP PSVDKVTGNT TQGYQVTGTA ELGTTIEVRA TDCTVLGTAT TGPTGQYTVT
LASGKATAKQ TVNVVAKNDT GLESQPTTAM TPADVTTPTI GDITGDSTTG YEITCTADPN
TTIEVRNPDG TIIGTTTTDD QGNFTVDLPA GAANPGDTLT VVGKDGDGNE SQPTEVTVPE
DAT VAAPTVT TVTGT EF070-1 (SEQ ID NO:265) TAGGGGAAGC TAATGATCTT
GGTATTTATC GTTTATTTTA AAGAAAAGAG GGACGATCAG ATGAAAAAGA AAATTGTTGA
GGATTTTAAT CGGAAAAGTC AGCATAAAAA ATGGACAAAA CGCAAGATCC TTAATTTAGC
AATATCAAGT GGTTTATTAT TTACGTCATT AGCAATCCCT GTAAGTATAG CTGTTACCTC
TGGCACAATC AGTGCATCAG CAGCGGTCTT GGATATCGAA CTATTATCAA ATGTTACGTC
AAATAATGAC AGTGGCACTT CAACGAGTAA TCGTTGGACA GCCGCAAACC AAAATCAACC
AGTTAATTTC ACGGTTTCTG CTGGCGCTTT ACCAGATGCT TCCGCTGTGT TTAGTGGACA
AAAACAAGCG GTGTTAGTGG TTCCTCCTGA GTTAAGAGGA AATGTAGCTG CAGCAGGCAG
CGCAGCAATC AATACCAATG TCACGATTGA TCTTTCAAAA GTTACTTTTT TGACTGCCGT
TTTGAATGCA GCCAATGATT TAACCAATGT GATTACTCAA ATTACCAGTG GGGCGTTAGG
GAATTTAACT GGTGTTGATA TTGATTTGAC GGAAGTGAAT CCTCAATTGG AATTAGTTAA
TAACATTGAA AACTTAGGTG CTGCTTCATT TACAGCTCCG GAAACGTTAG CAGCTGACGG
CTCATACATT AGTGCACCGA TTAGTGATGG TTTAGGGTTA GTTTTAGCCC AAAATGTTTC
AAACATCTTA CAAGATTTGA ATGCGCCAGT TCAAGCTTTG GAGGCAAAAG GTACCAGTAT
CCCAAGTAAT CTTGTCGCCG CAGCTATAAA TGCAGCCTTG CTTCCTGTCA AAGGCACGGT
AAACGTGGCT GTTTCAGGTG CTTTGCCTTT ATTAGCGGTT GGTGGTTCAG GCGTAAATGA
GTTAGTGGAT GCTTCTTTAC TAGGCACAAC CACGGTTACT TTACCAACTA CCGTTTCAAC
ACCTCAAAAT TTATCCAATA ATTTAGATGC TCGTTTTGTA GGAACAGTCG TTCAAACAGA
TCTTTTAGAC GTTAATTTAT TAGCAACAGC AGACGGTGTA TCCAACATTT ATTTTGCTGC
AGGCACTACT ACTGAAGTAA CCGCACCAAC AATCACAGGA GTAACAGGTA ATTCAACAGC
AGGTTACGAA GTTAAAGGAA CTGCCGATCC CAATGCCACG GTTGAAATCC GAAATCCAGG
AGGCACCGTA ATAGGCACAG GTACCGCTGA TGGGACAGGA GCGTTTACAG TTACCGTTCC
CGCAGGTGAA CCAGGCGCCA ATGAAACGTT AACCGCCGTA GCGAAAAACG CCAGCGGNAC
AGAAAGNACG CCAACAACGT TCCAAACNCC AGCGGATGAA GCAACCGTAA CCGCACCAAC
AATCACAGGA GTGACAGGTA ATTCAACGGC AGGTTACGAA GTTAAAGGAA CTGCCGATGC
CAATGCCACG GTTGAAATCC GAAATGCAGG AGGCACCGTA ATAGGCACAG GTACCGCTGA
TGGGACAGGA GCGTTTACAG TTACCGTTCC CGCAGGTGAA GCAGGTGCCA ATGAAACGTT
AACCGCCGTA GCGAAAAACG CCAGCGGCAC AGAAAGTACG CCAACAACGT TCCAAACACC
AGCGGATGAA GCAACCGTAA CCGCACCAAC AATCACAGGA GTGACAGGTA ATTCAACAGC
AGGTTACGAA GTTAAAGGAA CTGCCGATGC CAATGCCACG GTTGAGATCC GAAATGCAGG
AGGTGCCGTG ATAGGTACAG GTACTGCTGA TGGGACAGGG GCATTTACAG TTACCATTCC
CGCAGGTGAA GCAGGTGCGA ATGAAACGTT AACCGCCCTA GCGAAAAACG CCAGCGGTAC
AGAAAGTACG CCAACAACGT TCCAAACGCC AGCGGATCCT AATACGCCCG TGGCCACGCC
AATTGTTGAG ACTGTAACAG GTAGTACAAC AAAAGGCTAT GAGGTCAAAG GGACTGCTGA
AGTTGGCACC ACCATTGAGG TTCGCGATGC AGCTGGCACG GTCCTTGGTA CTGCAACAAC
TGGAACTGAC GGAAAATATA CAGTGACTTT AGATTCAGGA ACAGCAACAG CAAATCAAAC
GCTGAGCGTT GTAGCGAAAA ACGCTAGTGG CACGGAAAGT CAACCAGCAA CGGCGACAAC
ACCAGCTGAT GTCACTGCAC CAACAGTTGA TAACATCACA GGCAACTCTG GTTCGGGTTA
TGAAATTACA GGAACAGCAG ACCCTAACAC AACAATCGAA GTTCGTGATC CATCTGGGGC
AGTCATTGGT ACAGGTACCT CTGATGCGAA TGGTGATTTT ACTGTAACGC TACCAACGGG
AACGACCAAT CCTGGGGATA CGTTAACAGT GATTGGAAAG GATAACGCGG GAAATGAAAG
TCAACCGACT GAAGTCCTTG TTCCTGCTGA TGCCACGGTT ACAGCACCAA CTGTAACAGG
AGTAACAGGT AATTCACTTG CTGCTTATCA GGTGACAGGC ACCGCTGATC CGAATGCTAC
CATCGAAATT CGTCATGCAG ATGGGAACGT GATTGCAACA GGGACTGCCG ATGGGACTGG
TTCCTTTGCT GTGAACCTTC CAGCTGGGAC GGCAAATGCG AATGAAACAT TGACAGCGTT
AGCCAAAGAT CCTGCTGGCA ATACAAGTAC ACCGACAACC TTCCAAACAC CAGCAGATGA
AGTAGTGGCA CCGCCAAGTG TCGACAAAGT TACTGGGAAT ACAACACAAG GATATCAAGT
GACAGGTACC GCTGAACTTG GCACCACCAT TGAAGTTCGT GCAACAGACG GAACACTTTT
AGGCACCGCA ACAACTGGAC CGACTGGCCA ATATACTGTG ACGTTAGCTT CAGGAAAAGC
AACAGCTAAA CAAACAGTGA ATGTAGTTGC TAAAAATGAT ACTGGACTTG AGAGTCAACC
AACTACAGCT ATGACACCCG CTGATGTTAC CACACCAACA ATTGGTGACA TTACTGGAGA
TTCAACAACT GGTTATGAAA TCACTGGGAC GGCGGACCCT AATACCACCA TTGAAGTACG
GAACCCAGAT GGAACAATTA TTGGTACAAC GACAACGGAT GATCAAGGAA ACTTTACTGT
CGACCTTCCA GCGCGAGCCG CTAATCCTGG TGATACATTA ACAGTTGTTG GAAAAGACGG
TGACGGCAAT GAAAGTCAAC CAACGGAAGT GACGGTCCCT GAAGATGCAA CCGTAGCAGC
ACCAACTGTG ACGACTGTTA CAGGAACAAC TGCCACTGGG TATCAAGTAA CCGGCACGGC
AGAGCCAAAT GTCACCATTG ACATTCACAA TGAAGCAGGT TTAGTTATTG CTACGGGAAC
GACTGATGGT GCTCGCGCAT TTACAATCAC TCTTCCGACG GGCACAGCAA CAGCTAACGA
AGCCTTAACT GCCATTGCGA AAGATGCTGC TGGGAAAGAA AGTAATCCGA CTGCTTTCAA
AACACCTGCT GATCCAGATG CACCAGTCGC GACACCTACT GTTGACAAAA TCACTGGTAG
CACGACAAAC GGCTATCAAG TAGTAGGAGC AGCAGAAGTT GGTACAACAG TTGAGGTGCG
TGACGCCGAT GGCACAGTCC TTGGCATGGC AACTACTGGA ACTGATGGCA AATACACAGT
GACTTTAGAG CCAGGCAAGG CCTCAGCTAA CGAAACAATA ACTGTCGTAG CGAAAAATGC
AACAGGAAAA GAAAGTCAGC CAGCTACAGC AACTACACCA GTCGACTTAG CCACACCAAC
CATTGATTCT ATTACCGGAA ATTCTAGTAA AGGTTACGAA ATCACTGGAA CGGCGGAGCC
AAAAACCACT ATTGATGTCC GTGACGCAGA CGGAACCATC ATTGCTGCTA CAACTGCTAA
CGAAACCGGC CAATATACGG TGACTCTACC AGCTGGCGTA GTGACACCAG GAGAAACGAT
TACGATTATT AGCAAAGATG GCGCAGGTAA TGAAAGTCAA CCAGCTACAG CCGTTATTCC
AGCGGATGTT GTTTTAGCGG CGCCAACTAT TACGAAGCTT GAAGGAAACA AAGCCAATGG
CTATACAGTC ACTGGAACTG CTGATCCAAA TGTCACGGTT CAATTTTACA ATAGCAGTGA
ACAATTATTG GCAAGTGGCA ATACAACTAC TGGAGGTACC TTCTCCGTTC ATATTGCAGC
AGGGTTAGCA ACAGAAAAAG AAACGTTAAC CGCACTAACC ACAGATACAC AAGGAAATGT
GAGTCCTAAA ACCACATTTA TGACGCCAGC CGATATTACG GGAGAACCAG AGATTAAAAT
TGCGGCACCA ACTGTTTCTT CAGTTTTAGG AACGTCTAAA GCCGGCTACC TCATCAAAGG
AACAGCTGAA CCAAACCGAA TCATTCAAAT TAGTAACCGA CTATTAAGAA GTGTGATTGC
TGTAGGTGCC ACCGATGCTG AAGGCAACTT CGCTATCCAA TTAACAGCGG GACAAGCGAC
TGCTCAACAA AGTTTACTTG CGACAGCTAC CGATGGCGCA GGACATTACA GTACGGCTAC
AACCTTCATG ACGCCAGCCG ACCCAACGAA TCCTGGAGGA GGCAATGGTA ACACTGGCGG
AAATAACGGC AATACAGGCG GCAATACAGG AAACAATGGC GCAACTGGCG GGAATAATGG
GAATGGTTCA AACACAGGTT CAAATCCAAA TGGAGGTTCT GGTTTAGGCA CAACAGGTTC
TGGCTTAGGT TCACTAGGCA ATGGCCTCGG TACAAATGGT AGTGGCTACC ACCCTAAACT
AAGTACCATC AGTTATGGCA CTGGAAATCA CGGGAAAACA GGCTACTTAC CTAGCACAGG
TGAAAAAGAG TCTTCAGCCG TGACAACAAG TTTGTTTGGC GCCTTTGTCG CACTCCTTGC
GAGCATGGGA ATCATCAAAC GCAAACGTAA AAACTAG EF070-2 (SEQ ID NO:266) M
KKKIVEDFNR KSQHKKWTKR KMLNLAISSG LLFTSLAIPV SIAVTSGTIS ASAAVLDIEL
LSNVTSNNDS GTSTSNRWTA ANQNQPVNFT VSGGALADAS AVFSGQKQAV LVVPPELRGN
VAAAGSAAIN TNVTIDLSKV TFLTAVLNAA NDLTNVITQI TSGALGNLTG VDIDLTEVNR
QLELVNNIEN LGAASFTAPE TLAADGSYIS APISDGLGLV LAQNVSNILQ DLNAAVQALE
AKGTSIPSNL VAAAINAALL PVKGTVNVAV SGALPLLAVG GSGVNELVDA SLLGTTTVTL
PTTVSTPQNL SNNLDARFVG TVVQTDLLDV NLLATADGVS NIYFAAGTTS EVTAPTITGV
TGNSTAGYEV KGTADANATV EIRNAGGTVI GTGTADGTGA FTVTVPAGEA GANETLTAVA
KNASGTEXTP TTFQTPADEA TVTAPTITGV TGNSTAGYEV KGTADANATV EIRNAGGTVI
GTGTADGTGA FTVTVPAGEA GANETLTAVA KNASGTESTP TTFQTPADEA TVTAPTITGV
TGNSTAGYEV KGTADANATV EIRNAGGAVI GTGTADGTGA FTVTIPAGEA GANETLTAVA
KNASGTESTP TTFQTPADPN TPVATPIVET VTGSTTKGYE VKGTAEVGTT IEVRDAAGTV
LGTATTGTDG KYTVTLDSGT ATANQTLSVV AKNASGTESQ PATATTPADV TAPTVDNITG
NSGSGYEITG TADPNTTIEV RDPSGAVIGT GTSDANGDFT VTLPTGTTNP GDTLTVTGKD
NAGNESQPTE VLVPADATVT APTVTGVTGN SVAGYQVTGT ADPNATIEIR DADGNVIATG
TADGTGSFAV NLPAGTANAN ETLTALAKDP AGNTSTPTTF QTPADEVVAP PSVDKVTGNT
TQGYQVTGTA ELGTTIEVRA TDGTVLGTAT TGPTGQYTVT LASGKATAKQ TVNVVAKNDT
GLESQPTTAM TPADVTTPTI GDITGDSTTG YEITGTADPN TTIEVRNPDG TIIGTTTTDD
QGNFTVDLPA GAANPGDTLT VVGKDGDCNE SQPTEVTVPE DATVAAPTVT TVTGTTATGY
QVTGTAEPNV TIEIHNEAGL VIATGTTDGA GAFTITLPTG TATANEALTA IAKDAAGKES
NPTAFKTPAD PDAPVATPTV DKITGSTTNG YQVVGAAEVG TTVEVRDADG TVLGMATTGT
DGKYTVTLEP GKASANETIT VVAKNATGKE SQPATATTPV DLATPTTDSI TCNSSKGYEI
TGTAEPKTTI DVRDADGTII AATTANETGQ YTVTLPAGVV TPGETITITS KDGAGNESQP
ATAVIPADVV LAAPTITKVE GNKANGYTVT GTADPNVTVQ FYNSSEQLLA SGNTTTGGTF
SVHIAAGLAT EKETLTALTT DTQGNVSPKT TFMTPADTTG EPEIKTAAPT VSSVLGTSKA
GYLIKGTAEP NRIIQISNRL LRSVIAVGAT DAEGNFAIQL TAGQATAQQS LLATATDGAG
HYSTATTFMT PADPTNPGGG NGNTGGNNGN TGGNTGNNGA TGGNNGNGSN TGSNPNGGSG
LGTTGSGLGS LGNGLGTNGS GYHPKLSTIS YGTGNHGKTG YLPSTGEKES SAVTTSLFGA
FVALLASMGI IKRKRKN EF070-3 (SEQ ID NO:267) CGG TGACGGCAAT
GAAAGTCAAC CAACGGAAGT GACGGTCCCT GAAGATGCAA CCGTAGCAGC ACCAACTGTG
ACGACTGTTA CAGGAACAAC TGCCACTGGG TATCAACTAA CCGGCACGGC AGAGCCAAAT
GTCACCATTG AGATTCACAA TGAAGCAGGT TTAGTTATTG CTACGGGAAC GACTGATGGT
GCTGGCGCAT TTACAATCAC TCTTCCGACC GGCACAGCAA CAGCTAACGA AGCCTTAACT
GCCATTGCGA AAGATGCTGC TGGGAAAGAA AGTAATCCGA CTGCTTTCAA AACACCTGCT
GATCCAGATG CACCAGTCGC GACACCTACT GTTGACAAAA TCACTGGTAG CACGACAAAC
GGCTATCAAG TAGTAGGAGC AGCAGAAGTT GGTACAACAG TTGAGGTGCG TGACGCCGAT
GGCACAGTCC TTGGCATGGC AACTACTGGA ACTGATGGCA AATACACAGT GACTTTAGAG
CCAGGGAAGG CCTCAGCTAA CGAAACAATA ACTGTCGTAG CGAAAAATGC AACAGGAAAA
GAAAGTCAGC CAGCTACAGC AACTACACCA GTCGACTTAG CCACACCAAC CATTGATTCT
ATTACCGGAA ATTCTAGTAA AGGTTACGAA ATCACTGGAA CGGCGGAGCC AAAAACCACT
ATTGATGTCC GTGACGCAGA CGGAACCATC ATTGCTGCTA CAACTGCTAA CGAAACCGGC
CAATATACGG TGACTCTACC AGCTGGCGTA GTCACACCAG GAGAAACGAT TACGATTATT
AGCAAAGATG GCGCAGGTAA TGAAAGTCAA CCAGCTACAG CCGTTATTCC AGCGGATGTT
GTTTTAGCGG CGCCAACTAT TACGAAGGTT GAAGGAAACA AAGCCAATGG CTATACAGTC
ACTGGAACTG CTGATCCAAA TGTCACGGTT CAATTTTACA ATAGCAGTGA ACAATTATTG
GCAAGTGGCA ATACAACTAC TGGAGGTACC TTCTCCGTTC ATATTGCAGC AGGGTTAGCA
ACAGAAAAAG AAACGTTAAC CGCACTAACC ACAGATACAC AAGGAAATGT GAGTCCTAAA
ACCACATTTA TGACGCCAGC CGATATTACG GGAGAACCAG AGATTAAAAT TGCGGCACCA
ACTGTTTCTT CAGTTTTAGG AACGTCTAAA GCCGGCTACC TCATCAAAGG AACAGCTGAA
CCAAACCGAA TCATTCAAAT TAGTAACCGA CTATTAAGAA GTGTGATTGC TGTAGGTGCC
ACCGATGCTG AAGGCAACTT CGCTATCCAA TTAACAGCGG GACAAGCGAC TGCTCAACAA
AGTTTACTTG CGACAGCTAC CGATGGCGCA GGACATTACA GTACGGCTAC AACCTTCATG
ACGCCAGCCG ACCCAACGAA TCCTGGAGGA GGCAATGGTA ACACTGGCGG AAATAACGGC
AATACAGCCG GCAATACAGG AAACAATGGC GCAACTGGCG GGAATAATGG GAATGGTTCA
AACACAGGTT CAAATCCAAA TGGAGGTTCT GGTTTAGGCA CAACAGGTTC TGGCTTAGGT
TCACTACGCA ATGGCCTCGG TACAAATGGT
ACTGGCTACC ACCCTAAACT AAGTACCATC AGTTATGGCA CTGGAAATCA CGGGAAAACA
GGCTACT EF70-4 (SEQ ID NO:268) DGDGNE SQPTEVTVPE DATVAAPTVT
TVTGTTATGY QVTGTAEPNV TIEIHNEAGL VIATGTTDGA GAFTITLPTG TATANEALTA
IAKDAAGKES NPTAFKTPAD PDAPVATPTV DKITGSTTNG YQVVGAAEVG TTVEVRDADG
TVLGMATTGT DGKYTVTLEP GKASANETIT VVAKNATGKE SQPATATTPV DLATPTIDST
TGNSSKGYEI TGTAEPKTTI DVRDADGTII AATTANETGQ YTVTLPAGVV TPGETITIIS
KDGAGNESQP ATAVIPADVV LAAPTITKVE GNKANGYTVT GTADPNVTVQ FYNSSEQLLA
SGNTTTGGTF SVHIAAGLAT EKETLTALTT DTQGNVSPKT TFMTPADITG EPEIKIAAPT
VSSVLGTSKA GYLIKGTAEP NRIIQISNRL LRSVIAVGAT DAEGNFAIQL TAGQATAQQS
LLATATDGAG HYSTATTFMT PADPTNPGGG NGNTGGNNGN TGGNTGNNGA TGGNNGNGSN
TGSNPNGGSG LGTTGSGLGS LGNGLGTNGS GYHPKLSTIS YGTGNHGKTG YL EF071-1
(SEQ ID NO:269) TAAGTAGAAG TGGTCGGGAC AAACGTAGAA CTTTCGCTGA
TTGCCGAAGA AATTACTTCT GTCCCGCCAT TTATCTGCAG GTTTAAGCCG TGGAAGGGAA
GTTATTTTGA CTTTCCTTTC ATCGCTTTTT TAAGAAAGGA GCATGCTATG TTTAAAAAAT
TAATGATTCA ACTTGCTTTA GTGATTCGTT TAAGTTTAAC GATTCCGATG ACGGCTTNCG
CTTACACCAT CGAAGCGGAT CCAATCAACT TTACTTATTT TCCCGGCTCT GCAAGCAATG
AATTAATTGT TTTACATGAA TCTGGAAACG AGCGGAACCT AGGACCACAC AGTTTAGACA
ATGAAGTGGC CTATATGAAA CGAAATTGGT CAAATGCTTA TGTCTCATAT TTTGTCGGAT
CTGGTGGACG AGTGAAACAA TTAGCTCCTG CTGGCCAAAT TCAATATGGC GCAGGTTCTT
TAGCTAATCA AAAAGCCTAT GCGCAAATCG AATTGGCTCG AACGAATAAT GCGGCGACAT
TTAAAAAAGA TTATGCTGCC TATGTTAATT TGGCCCGTGA TTTGGCTCAG AACATTGGTG
CTGATTTTTC TCTGGACGAT GGAACAGGTT ATGGCATAGT CACTCATGAT TGGATTACAA
AAAATTGGTG GGGAGATCAT ACAGATCCTT ATCGTTATTT AGCGCGTGGG GGATTAGTAA
AGCGCATTGG CACNAGATTT ACAACGGGCG TTTCNGNAAC AGGTGAGACT GGTCATTATT
CAGCCAGGTA A EF071-2 (SEQ ID NO:270) MF KKLMIQLALV IGLSLTIPMT
AXAYTIEADP INFTYFPGSA SNELIVLHES GNERNLGPHS LDNEVAYMKR NWSNAYVSYF
VGSGGRVKQL APAGQIQYGA GSLANQKAYA QIELARTNNA ATFKKDYAAY VNLARDLAQN
IGADFSLDDG TGYGIVTHDW ITKNWWGDHT DPYGYLARGG LVKRIGTRFT TGVSXTGETG
HYSAR EF071-3 (SEQ ID NO:271) C TTTAAAAAAT TAATGATTCA ACTTGCTTTA
GTGATTGGTT TAAGTTTAAC GATTCCGATG ACCGCTTNCG CTTACACCAT CGAAGCGGAT
CCAATCAACT TTACTTATTT TCCCGGCTCT GCAAGCAATG AATTAATTGT TTTACATGAA
TCTGGAAACG AGCGGAACCT AGGACCACAC AGTTTAGACA ATGAAGTGGC CTATATGAAA
CGAAATTGGT CAAATGCTTA TGTCTCATAT TTTGTCGGAT CTGGTGGACG AGTGAAACAA
TTAGCTCCTG CTGGCCAAAT TCAATATGGC GCAGGTTCTT TAGCTAATCA AAAAGCCTAT
GCGCAAATCG AATTGGCTCG AACGAATAAT GCGGCGACAT TTAAAAAAGA TTATGCTGCC
TATGTTAATT TGGCCCGTGA TTTGGCTCAG AACATTGGTG CTGATTTTTC TCTGGACGAT
GGAACAGGTT ATGGCATAGT CACTCATGAT TGGATTACAA AAAATTGGTG GGGAGATCAT
ACAGATCCTT ATGGTTATTT AGCGCGTGGG GGATTAGTAA AGCGCATTGG CACNAGATTT
ACAACGGGCG TTTCNGNAAC AGGTGACACT GGTCATTATT CAGCCAGGT EF071-4 (SEQ
ID NO:272) F KKLMIQLALV IGLSLTIPMT AXAYTIEADP INFTYFPGSA SNELIVLHES
GNERNLGPHS LDNEVAYMKR NWSNAYVSYF VGSGGRVKQL APAGQIQYCA GSLANQKAYA
QIELARTNNA ATFKKDYAAY VNLARDLAQN IGADFSLDDG TGYGIVTHDW ITKNWWGDHT
DPYGYLARGG LVKRIGTRFT TGVSXTGETG HYSAR EF072-1 (SEQ ID NO:273)
TAATCAATGA AAAACGCACG TTGGTTAAGT ATTTGCGTCA TGCTACTCGC TCTTTTCGGG
TTTTCACAGC AAGCATTAGC AGAGGCATCG CAAGCAAGCG TTCAAGTTAC GTTGCACAAA
TTATTGTTCC CTGATGGTCA ATTACCAGAA CAGCAGCAAA ACACAGGGGA AGAGGGAACG
CTGCTTCAAA ATTATCGGGG CTTAAATGAC GTCACTTATC AAGTCTATGA TGTGACGGAT
CCGTTTTATC AGCTTCGTTC TGAAGGAAAA ACGGTCCAAG AGGCACAGCG TCAATTAGCA
GAAACCGGTG CAACAAATAG AAAACCGATC GCAGAAGATA AAACACAGAC AATAAATGGA
GAAGATGGAG TGGTTTCTTT TTCATTAGCT AGCAAAGATT CGCAGCAACG AGATAAAGCC
TATTTATTTG TTGAAGCGGA AGCACCAGAA GTGGTAAAGG AAAAAGCTAG CAACCTAGTA
GTGATTTTGC CTGTTCAAGA TCCACAAGGG CAATCGTTAA CCCATATTCA TTTATATCCA
AAAAATCAAG AAAATGCCTA TGACTTACCA CCACTTGAAA AAACGGTACT CGATAAGCAA
CAAGGCTTTA ATCAAGGAGA GCACATTAAC TATCAGTTAA CCACTCAGAT TCCAGCGAAT
ATTTTAGGAT ATCAGCAATT CCGTTTGTCA GATAAGGCGG ATACAACGTT GACACTTTTA
CCAGAATCAA TTGAGGTAAA AGTGGCTGGA AAAACAGTTA CTACAGGTTA CACACTGACG
ACGCAAAAGC ATGGATTTAC GCTTGATTTT TCAATTAAAG ACTTACAAAA CTTTGCAAAT
CAAACAATGA CTGTGTCGTA TCAAATGCGT TTAGAAAAGA CCGCTGAACC TGACACTGCG
ATTAACAACG AAGGACAATT AGTCACGGAC AAACATACCT TGACTAAAAG AGCCACAGTT
CGTACAGGCG GCAAGTCTTT TGTCAAAGTT GATAGTGAAA ATGCGAAAAT CACCTTGCCA
GAGGCTGTTT TTATCGTCAA AAATCAAGCG GGGGAATACC TCAATGAAAC AGCAAACGGG
TATCGTTGGC AAAAAGAAAA AGCATTAGCT AAAAAATTCA CGTCTAATCA AGCCGGTGAA
TTTTCAGTTA AAGGNNTTAA AAGATCGCCA GTACTTCTTG GAAGAAATCT CTCCACCAAA
AGGTTATCTT CTGAATCAAA CAGAAATTCC TTTTACGGTG GGAAAAAATT CTTATGCAAC
GAACGGACAA CGAACAGCAC CGTTACATGT AATCAATAA EF072-2 (SEQ ID NO:274)
MKNARWLSI CVMLLALFGF SQQALAEASQ ASVQVTLHKL LFPDGQLPEQ QQNTGEEGTL
LQNYRGLNDV TYQVYDVTDP FYQLRSEGKT VQEAQRQLAE TGATNRKPIA EDKTQTINGE
DGVVSFSLAS KDSQQRDKAY LFVEAEAPEV VKEKASNLVV ILPVQDPQGQ SLTHIHLYPK
NEENAYDLPP LEKTVLDKQQ GFNQGEHINY QLTTQIPANI LGYQEFRLSD KADTTLTLLP
ESIEVKVAGK TVTTGYTLTT QKHGFTLDFS IKDLQNFANQ TMTVSYQMRL EKTAEPDTAI
NNEGQLVTDK HTLTKRATVR TGGKSFVKVD SENAKITLPE AVFIVKNQAG SYLNETANGY
RWQKEKALAK KFTSNQAGEF SVKGXKRWPV LLGRNLCTKR LSSESNRNSF YGGKKFLCNE
RTTNSTVTCN Q EF072-3 (SEQ ID NO:275) ATTACCAGAA CAGCAGCAAA
ACACAGGGGA AGAGCGAACG CTGCTTCAAA ATTATCGGGG CTTAAATGAC GTCACTTATC
AAGTCTATGA TGTGACGGAT CCGTTTTATC AGCTTCGTTC TGAAGGAAAA ACGGTCCAAG
AGGCACAGCG TCAATTAGCA GAAACCGGTG CAACAAATAG AAAACCGATC GCAGAAGATA
AAACACAGAC AATAAATGGA GAAGATGGAG TGGTTTCTTT TTCATTAGCT AGCAAAGATT
CGCAGCAACG AGATAAAGCC TATTTATTTG TTGAAGCGGA AGCACCAGAA GTGGTAAAGG
AAAAAGCTAG CAACCTAGTA GTGATTTTGC CTGTTCAAGA TCCACAAGGG CAATCGTTAA
CGCATATTCA TTTATATCCA AAAAATGAAG AAAATGCCTA TGACTTACCA CCACTTGAAA
AAACGGTACT CGATAAGCAA CAAGGCTTTA ATCAAGGAGA GCACATTAAC TATCAGTTAA
CGACTCAGAT TCCAGCGAAT ATTTTAGGAT ATCAGGAATT CCGTTTGTCA GATAAGGCGG
ATACAACGTT GACACTTTTA CCAGAATCAA TTGAGGTAAA AGTGGCTGGA AAAACAGTTA
CTACAGGTTA CACACTGACG ACGCAAAAGC ATGGATTTAC GCTTGATTTT TCAATTAAAG
ACTTACAAAA CTTTGCAAAT CAAACAATGA CTGTGTCGTA TCAAATGCGT TTAGAAAAGA
CCGCTGAACC TGACACTGCG ATTAACAACG AAGGACAATT AGTCACGGAC AAACATACCT
TGACTAAAAG AGCCACAGTT CGTACAGGCG GCAAGTCTTT TGTCAAAGTT GATAGTGAAA
ATGCGAAAAT CACCTTGCCA GAGGCTCTTT TTATCGTCAA AAATCAAGCG GGGGAATACC
TCAATGAAAC AGCAAACGGG TATCGTTGGC AAAAAGAAAA AGCATTAGCT AAAAAATTCA
CGTCTAATCA AGCCGGTGAA TTTTCAGTTA AAGGNNTTAA AAGATGGCCA GTACTTCTTG
GAAGAAATCT CTGCACCAAA AGGTTATCTT CTGAATCAAA CAGAAATTCC TTTTACGGTG
GGAAAAAATT CTTATGCAAC GAACGGACAA CGAACAGCAC CGTTACATGT A EF072-4
(SEQ ID NO:276) QLPEQ QQNTGEEGTL LQNYRGLNDV TYQVYDVTDP FYQLRSEGKT
VQEAQRQLAE TGATNRKPIA EDKTQTINGE DGVVSFSLAS KDSQQRDKAY LFVEAEAPEV
VKEKASNLVV ILPVQDPQGQ SLTHIHLYPK NEENAYDLPP LEKTVLDKQQ GFNQGEHINY
QLTTQIPANT LGYQEFRLSD KADTTLTLLP ESIEVKVAGK TVTTGYTLTT QKHGFTLDFS
IKDLQNFANQ TMTVSYQMRL EKTAEPDTAI NNEGQLVTDK HTLTKRATVR TGGKSFVKVD
SENAKITLPE AVFIVKNQAG EYLNETANGY RWQKEKALAK KFTSNQAGEF SVKGXKRWPV
LLGRNLCTKR LSSESNRNSF YGGKKFLCNE RTTNSTVTC EF073-1 (SEQ ID NO:277)
TAAATGAACA AATTAAATAC AAAATTACTG ATTGGCTATA TTCTTTTAGG AGCCTTAATC
ATTGCTGTCG CTAGAGAATA TGGCTTCTTC GCTTTTGTGA TTCTGGTAGG CTTTTTAGTA
TTCGTTCTCT ATCGAAAAAA GAAAAATGCC GCCGACAAAA GCGATCAAAT GCCTTACTTA
ACGAAAGATA AAGAAGCCCA TTATCGTGAG TTGGGGTTAT CTCCACAAGA AATTGATTTT
TTCAGAAGTA CAATGAGCAC ACCCAAAAAA CAAATCATAC AATTGCAAGA AAACATGAAT
CGTTCAACTA AATTACGGGC GATTGACTTA CGTAATGATA CTACGAAGGT TTCTAAAGCT
CTGTTTAAAG AGTTAGTGAA AGAACCTAAA AAGTTACACT TAGCCAATCA CTTTCTCTAT
ACACATTTAC CAAATATCGT TGACTTAACA AGTAAACATT TAGAAATCGA ACAACACGAA
GTAAAAAACA AACAAACGTA TGAAAAATTA GAAGAAAGCG CACAAATCAT TGACCAATTG
TCAAAATTAG TTAAAAATGA TTATGAGGAA ATCGTTTCCG ATGACTTAGA CGATTTAGAT
GTCGAAATGT CGATCGCTAA AAGCAGCTTG TCGCAAAAAG CTGCAACTGA GCAATCACCT
CAAGTAAACG AAGACCAGCA ATAA EF073-2 (SEQ ID NO:278) MNKLNTKLLI
GYILLOALTI AVAREYGFFA FVILVGFLVF VLYRKKKNAA DKSDQMPYLT KDKEAHYREL
GLSPQEIDFF RSTMSTAKKQ IIQLQENHNR STKLRAIDLR NDTTKVSKAL FKELVKEPKK
LHLANHFLYT HLPNIVDLTS KHLEIEQHEV KNKQTYEKLE ESAQIIDQLS KLVKNDYEEI
VSDDLDDLDV EMSIAKSSLS QKAATEESPQ VNEDQQ EF073-3 (SEQ ID NO:279) CT
ATCGAAAAAA GAAAAATGCC GCCGACAAAA GCGATCAAAT GCCTTACTTA ACGAAAGATA
AAGAAGCCCA TTATCGTGAG TTGGGGTTAT CTCCACAAGA AATTGATTTT TTCAGAAGTA
CAATGAGCAC AGCCAAAAAA CAAATCATAC AATTGCAAGA AAACATGAAT CGTTCAACTA
AATTACGGGC GATTGACTTA CGTAATGATA CTACGAAGGT TTCTAAAGCT CTGTTTAAAG
AGTTAGTGAA AGAACCTAAA AAGTTACACT TAGCCAATCA CTTTCTCTAT ACACATTTAC
CAAATATCGT TGACTTAACA AGTAAACATT TAGAAATCGA ACAACACGAA GTAAAAAACA
AACAAACGTA TGAAAAATTA GAAGAAAGCG CACAAATCAT TGACCAATTG TCAAAATTAG
TTAAAAATGA TTATGAGGAA ATCGTTTCCG ATGACTTAGA CGATTTAGAT GTCGAAATGT
CGATCGCTAA AAGCAGCTTG TCGCAAAAAG CTGCAACTGA GGAATCACCT CAAGTAAACG
AAGACCAGCA AT EF073-4 (SEQ ID NO:280) YRKKKNAA DKSDQMPYLT
KUKEAHYREL GLSPQEIDFF RSTMSTAKKQ IIQLQENMNR STKLRAIDLR NDTTKVSKAL
FKELVKEPKK LHLANHFLYT HLPNIVDLTS KHLEIEQHEV KNKQTYEKLE ESAQIIDQLS
KLVKNDYEEI VSDDLDDLDV EMSIAKSSLS QKAATEESPQ VNEDQQ EF074-1 (SEQ ID
NO:281) TAAAGGAGTT CTCAAAAAAT GAAGCTAAAA AAAATAATTC CTGCTTTTCC
CCTTCTTTCA ACCGTTGCAG TTGGCTTGTG GTTAACGCCT ACTCAAGCTT CTGCAGATGC
TGCGGATACG ATGGTAGATA TCTCTGGCAA AAAAGTGTTG GTTGGATATT GGCATAACTG
GGCCTCAAAA GGACGCGATG GTTACAAACA AGGAACATCA GCATCACTAA ACCTTTCAGA
AGTAAATCAA GCCTACAATG TCGTACCGGT TTCCTTCATG AAAAGCGATG GCACGACACG
GATTCCTACG TTCAAGCCTT ATAACCAAAC GGACACTGCC TTCCGACAAG AAGTCGCACA
ATTAAATAGT CAAGGTCGCG CAGTTTTATT GGCACTTGGT GGAGCAGATG CACATATTCA
ATTAGTCAAA GGCGATGAAC AAGCCTTTGC GAATGAAATC ATTCGTCAAG TGGAAACATA
CGGCTTTGAT GGTTTAGACA TCGACTTACA GCAATTGGCG ATTACTGCTG GCGACAACCA
AACCGTCATC CCTGCTACGT TGAAAATAGT CAAAGACCAT TATCGAGCAC AAGGAAAAAA
TTTCATCATT ACGATGGCAC CAGAATTCCC TTATTTAAAA CCTGGTGCCG CTTATGAAAC
ATACATTACT TCCCTAAATG GTTATTATGA TTACATTGCC CCACAATTAT ATAACCAAGG
CGGCGACGGT GTCTGGGTTG ATGAAGTTAT GACTTGGGTT GCTCAAAGCA ACGATGCTCT
AAAATACGAG TTCCTCTATN ATATT EF074-2 (SEQ ID NO:282) MKLKK
IIPAFPLLST VAVGLWLTPT QASADAADTM VDISGKKVLV GYWHNWASKG RDGYKQGTSA
SLNLSEVNQA YNVVPVSFMK SDGTTRIPTF KPYNQTDTAF RQEVAQLNSQ GRAVLLALGG
ADAHIQLVKG DEQAFANEII RQVETYGFDG LDIDLEQLAI TAGDNQTVIP ATLKIVKDHY
FAQGKNFIIT MAPEFPYLKP GAAYETYITS LNGYYDYIAP QLYNQGGDGV WVDEVMTWVA
QSNDALKYEF LYXI EF074-3 (SEQ ID NO:283) TGC TGCGGATACG ATGGTAGATA
TCTCTGGCAA AAAAGTGTTG GTTGGATATT GCCATAACTG GGCCTCAAAA GGACGCGATG
GTTACAAACA AGGAACATCA GCATCACTAA ACCTTTCAGA AGTAAATCAA GCCTACAATG
TCGTACCGCT TTCCTTCATG AAAAGCGATG GCACGACACG GATTCCTACG TTCAAGCCTT
ATAACCAAAC GCACACTGCC TTCCGACAAG AAGTCGCACA ATTAAATAGT CAAGGTCGCG
CAGTTTTATT GGCACTTGGT GGAGCAGATG CACATATTCA ATTAGTCAAA GGCGATGAAC
AAGCCTTTGC GAATGAAATC ATTCGTCAAG TGGAAACATA CGGCTTTGAT GGTTTAGACA
TCGACTTAGA GCAATTGGCG ATTACTGCTG GCGACAACCA AACCGTCATC CCTGCTACGT
TGAAAATAGT CAAAGACCAT TATCGAGCAC AAGGAAAAAA TTTCATCATT ACGATGGCAC
CAGAATTCCC TTATTTAAAA CCTGGTGCCG CTTATGAAAC ATACATTACT TCCCTAAATG
GTTATTATCA TTACATTGCC CCACAATTAT ATAACCAAGG CGGCGACGGT GTCTGGGTTG
ATGAAGTTAT GACTTGGGTT GCTCAAAGCA ACGATGCTCT AAAATACGAG TTCCTCT
EF074-4 (SEQ ID NO:284) AADTM VDISGKKVLV CYWHNWASKG RDGYKQGTSA
SLNLSEVNQA YNVVPVSFMK SDGTTRIPTF KPYNQTDTAF RQEVAQLNSQ GRAVLLALGG
ADAHIQLVKG DEQAFANEIT RQVETYGFDG LDIDLEQLAI TAGDNQTVIP ATLKIVKDHY
RAQGKNFIIT MAPEFPYLKP GAAYETYITS LNGYYDYIAP QLYNQGGDGV WVDEVMTWVA
QSNDALKYEF LY EF075-1 (SEQ ID NO:285) TAACCTATAA GAAAAAAATC
ACAACCTGTG ATAAATTATT GGAGGNAAAA TATGTCAAAA GGGAAGAAAA TTTTTGCCAT
TATCNTTGGA ATTATCTTGG NTCTATTTCT TGCAGTTCTT GGAATGGGAG CAAAACTTTA
TTGGGATGTT TCTAAATCAA TGGATAAAAC CTATGAAACA GTAGAACGAT CTAAAAAAAG
TCAGGTCAAT TTAAACAATA AGGAGCCTTT TTCTGTTTTA TTATTAGGGA TTGATACAGG
CGATGATGGG CGTGTCGAGC AAGGTCGTTC GGATACAACA ATTGTTGCAA CAGTTAATCC
TCGTGACAAG CAAACAACCT TAGTCAGTCT TGCTCGCGAT ACCTATGTTG ATATTCCAGG
TCAAGGAAAA CAAGATAAAT TGAATCACGC CTATGCTTTT GGTGGCGCAT CTTTACCAAT
GGACACAGTT GAAAACTATT TAAACATACC TATTAATCAT TATGTTTCAA TTAATATGGC
TGGTTTAAAA GAATTAGTCA ACGCGGTTCC CGGAATCGAA GTGAACAATA ATCTGACTTT
TTCTCAAGAC GGATATGATT TTACGATTGG TAAAATTTCA TTCGATGGTG AACAAGCACT
CTCCTATTCA AGAATGCGTT ACGAAGACCC TAATGGTGAC TACGGCCGCC AAGAACGTCA
AAGAAAAGTG ATTGAAGCCA TCGTCCAAAA AGTCTTAAGT CTTAACAGCG TAAGCAACTA
TCAAGAAATT TTAACAGCTG TTTCTGATAA TATGAAGACA GATTTAAGTT TTGATGACAT
GAAAAAAATT GCCTTAGATT ATCGCAGTGC CTTTGGTAAA GTGAAACAAG ACCAACTTCA
AGGTACTGGT TTTATGCAAG ATGCTGTTTC CTATCAACGT GTGGATGAAC AAGAATTAAC
TCGTGTCCAA CAAGAGTTGA AAAATCAATT GAATACAAAA TAA EF075-2 (SEQ ID
NO:286) MSKG KKIEFAIIXGI ILXLJFLAVVG MGAKLYWIJVS KSMDKTYETV
ERSKKSQVNL NNKEPFSVLL LGIDTGDDGR VEQGRSDTTI VATVNPRDKQ TTLVSLARDT
YVDIPGQGKQ DKLNHAYAFG GASLANDTVE NYLNIPINHY VSINMAGLKE LVNAVGGIEV
NNNLTFSQDG YDFTICKISL DGEQALSYSR MRYEDPNGDY GRQERQRKVT EGTVQKVLSL
NSVSNYQEIL TAVSDNMKTD LSFDDMKKTA LDYRSAFGKV KQDQLQCTGF MQDCVSYQRV
DEQELTRVQQ ELKNQLNTK EF075-3 (SEQ ID NO:287) ACTTTA TTGGGATGTT
TCTAAATCAA TGCATAAAAC CTATGAAACA GTAGAACGAT CTAAAAAAAG TCAGGTCAAT
TTAAACAATA AGGAGCCTTT TTCTGTTTTA TTATTAGGGA TTGATACAGG CGATGATGGG
CGTGTCGAGC AAGGTCGTTC GGATACAACA ATTGTTGCAA CAGTTAATCC TCGTGACAAG
CAAACAACCT TAGTCAGTCT TGCTCGCGAT ACCTATGTTG ATATTCCAGG TCAAGGAAAA
CAACATAAAT TGAATCACGC CTATGCTTTT GGTCGCGCAT CTTTAGCAAT GGACACAGTT
GAAAACTATT TAAACATACC TATTAATCAT TATGTTTCAA TTAATATGGC TGGTTTAAAA
GAATTAGTCA ACGCGGTTGG CGGAATCGAA GTGAACAATA ATCTGACTTT TTCTCAAGAC
GGATATGATT TTACGATTGG TAAAATTTCA TTGGATGCTG AACAAGCACT CTCCTATTCA
AGAATGCGTT ACGAAGACCC TAATGGTGAC TACGCCCGCC AAGAACGTCA AAGAAAAGTG
ATTCAAGGCA TCGTCCAAAA ACTCTTAAGT CTTAACAGCG TAAGCAACTA TCAAGAAATT
TTAACAGCTG TTTCTGATAA TATGAAGACA GATTTAAGTT TTGATGACAT GAAAAAAATT
CCCTTAGATT ATCGCACTGC CTTTGGTAAA GTGAAACAAG ACCAACTTCA AGGTACTGGT
TTTATGCAAG ATGGTGTTTC CTATCAACGT GTGGATGAAC AAGAATTAAC TCGTGTCCAA
CAAGAGTTGA AAAATCAATT GAATACAAAA EF075-4 (SEQ ID NO:288) KLYWDVS
KSMDKTYETV ERSKKSQVNL NNKEPFSVLL LGIDTGDDGR VEQGRSDTTI VATVNPEDKQ
TTLVSLARDT YVDIPCQGKQ DKLNHAYAFG GASLANDTVE NYLNIPINHY VSINNAGLKE
LVNAVGGIEV NNNLTFSQDG YDFTIGKTSL DGEQALSYSR MRYEDPNGDY GRQERQRKVI
EGIVQKVLSL NSVSNYQEIL TAVSDNNKTD LSEDDMKKIA LDYRSAFGKV KQDQLQGTGF
MQDGVSYQRV DEQELTRVQQ ELKNQLNTK EF076-1 (SEQ ID NO:289) TAGAAAATAA
CAGAGGAGCT GAAGGAAATG AAACCATCAA CAAAAATTCG TATCGGTTTA AGCATTGCTG
CAGTTGCAAG TGTCTCTGTT GCAGTCATCG CTTCTGAAAA AATTATTAAG AAGGTATCTC
ATGTTTCCAA TCGTTATAAA GTTAAAAAGT TTGTAGACGA TAAATTTGAT CGAAACCAAA
AATTATTATC GATTGTCGAT GATTTATCCG ATGATGAATT ACATTCTGTT TTAAATGTTG
TGGATCGTGT GAAAGATGGC GGTTCAAAAT TAGCTGAATA TGGCGAAAAA GTTAAAGACA
ATACAGATTC TTTAAAAGAA CGCTTTTTCA CATTTATTGA AGATGCAATG AACTTAAAAA
AGTCGCCTAG GCCATCTTTT TTTTATAAAA ATAATTCTTT TGTTTCAACA TAA EF076-2
(SEQ ID NO:290) MK ASTKIGIGLS IAAVASVSVA VIASEKIIKK VSHVSNRYKV
KKFVDDKFDG NQKLLSIVDD LSDDELDSVL NVVDRVKDCG SKLAEYGEKV KDNTDSLKER
FFTFIEDAMK
LKKWPRPSFF YKNNSFVST EF076-3 (SEQ D NO:291) CATCG CTTCTGAAAA
AATTATTAAG AAGGTATCTC ATGTTTCCAA TCGTTATAAA GTTAAAAAGT TTGTAGACGA
TAAATTTGAT GGAAACCAAA AATTATTATC GATTGTCGAT GATTTATCCG ATGATGAATT
AGATTCTGTT TTAAATGTTG TGGATCGTGT GAAAGATGGC GCTTCAAAAT TAGCTGAATA
TGGCGAAAAA GTTAAAGACA ATACACATTC TTTAAAAGAA CGCTTTTTCA CATTTATTGA
AGATGCAATG AAGTTAAAAA AGTGGCCTAG GCCATCTTTT TTTTATAAAA ATAATTCTT
EF076-4 (SEQ ID NO:292) VIASEKIIKK VSHVSNRYKV KKFVDDKFDG NQKLLSIVDD
LSDDELDSVL NVVDRVKDGG SKLAEYGEKV KDNTDSLKER FFTFIEDAMK LKKWPRPSFF
YKNNS EF077-1 (SEQ ID NO:293) TAATGTAAAG TGAATGATCG GAGAGAAAAA
GAGATGAAGC ATGTAACAAA ATTGGGGATT ACAATTATAA CAGGAGTTTT GOCATTATTA
TTTGAATTTA TTTTACATCA GCCGAATTGG GCGTATGGCA TTATTTTAAT AACAGGTTCT
GTAATGGCGT TAATGATGTT CTCGGAAATG ATTCAAACCT TACGTGAAGG AAAATATGGT
GTCGATATTT TAGCGATTAC CGCTATCGTT GCAACCTTAG CTGTGGGAGA ATACTGGGCC
AGTTTGATGA TTTTAATTAT GTTGACTGGT GGTGATTCAT TAGAAGACTA TGCCGCTGGA
AAAGCTAACC AAGAGCTGAA GTCATTATTG GATAACTCGC CACAAAAAGC TCATCGCTTG
AATGGCGAAA ATTTAGAAGA TGTTTCTGTT GAGGAAATCA ATGTTGGCGA TGAATTAGTA
GTAAAACCAG GGGAACTAGT TCCAGTTGAT GGCTTGGTAA AAACCGGGAC ATCAACAGTC
GATGAATCTT CATTAACAGG AGAATCAAAA CCAATTGAAA AAAATCCTGG GGATGAATTA
ATGTCGGGTT CCGTGAATGG TGACGGCTCT TTGAAAATGG TTGCTGAAAA AACTGTAGCA
GACAGTCAAT ATCAAACAAT TGTGAACTTA GTGAAAGAAT CTGCGGCGCG TCCAGCTCAT
TTTGTACGTT TAGCAGATCG CTATGCGGTA CCTTTTACAC TAGTTGCCTA CCTAATTGCA
GGTGTTGCTT GGTTTGTTTC AAAAAGTCCG ACACGTTTTG CGGAAGTCTT AGTTGTTGCT
TCGCCGTGTC CTTTAATTCT ATCTGCCCCA ATTGCTTTAG TGGCAGGGAT GGGTCGTTCA
AGTCGTCATG GCGTCGTTAT TAAATCGGGA ACGATGGTCG AAAAATTAGC TTCTGCAAAA
ACGATTGCGT TTGATAAAAC AGGCACGATT ACGCAAGGAC AACTTTCTGT TGATCAAGTC
CAACCAATCA ATGCTGGAAT AACTGCTGCT GAATTAGTGG GATTGGCAGC AAGCGTGGAA
CAAGAATCAA GTCATATTTT AGCTAGATCA ATTGTTGCTT ATGCCAGAAA GCAAGATGTC
CCATTAAAAA ATATTACAGA TCTAGCCGAA GTTTCTGGTG CTGGCGTGAA GGCATTTGTG
GATGGTGCTG AGATACGGGT AGGTAAAAAG AATTTTGTGA CACAAGAGTC TCAAGAAACT
GAAAAAATTG ATAAAACGAC TATTCATATT TCACGTAATG GCACATATTT AGGCCGAATT
ACTTTTACAG ACACTGTACG CCCAGAAGCA AAAGAGACTA TGGAAAAATT ACACCAATTA
CATCTTCAAC GAATTTTAAT GCTGACGGGG GATCAAGAAT CCGTTGCAGA AACGATTGCT
GCAGAAGTAG GAATTACCGA AGTACATGGG GAATGTTTAC CACAAGATAA ATTAACTATT
CTAAAAGAAT TGCCTAAAGA AAATCATCCA GTCATCATGG TAGGAGATGG TGTAAATGAT
GCACCTTCGC TTCCTGCTGC AGACGTAGGT ATTGCTATGG GTGCTCATGG AGCTACTGCG
GCTAGTGAAA CTGCTGACGT TGTTATTTTA AAAGATGACT TAAGTAAAGT CAGCCAAGCG
GTCGAAATTG CCCAAGATAC CATGAAAATT GCCAAACAAT CTGTATTAAT CGGAATTTTT
ATCTGCGTTT TACTAATGTT AATTGCTAGT ACCGGGATCA TTCCGGCGCT AATCGGGGCT
ATGCTACAAG AAGTCGTGGA CACTGTGTCA ATCTTATCTG CTTTGCGTGC TCGTCGAATT
GGCCAGTAA EF077-2 (SEQ ID NO:294) MKHVThLGIT IITGVLALLF EFILHQPNWA
YGIILITGSV MALMNFWEMI QTLREGKYGV DILAITAIVA TLAVGEYWAS LMILIMLTGG
DSLEDYAAGK ANQELKSLLD NSPQKAHRLN GENLEDVSVE EINVGDELVV KPGELVPVDG
LVKTGTSTVD ESSLTGESKP IEKNPGDELM SGSVNGDGSL KMVAEKTVAD SQYQTIVNLV
KESAARPAHF VRLADRYAVP FTLVAYLIAG VAWFVSKSPT RFAEVLWAS PCPLILLSAPI
ALVAGMGRSS RHGVVIKSGT MVEKLASAKT IAFDKTGTIT QGQLSVDQVQ PINAGITAAE
LVGLAASVEQ ESSHILARSI VAYARKQDVP LKNITDLAEV SGAGVKAFVD CAEIRVGKKN
FVTQESQETE KIDKTTIHIS RNGTYLGRIT FTDTVRPEAK ETMEKLHQLH LQRILMLTGD
QESVAETIAA EVGITEVHGE CLPQDKLTIL KELPKENHPV IMVGDGVNDA PSLAAADVGI
AMGAHGATAA SETADVVILK DDLSKVSQAV ETAQDTMKIA KQSVLIGIFI CVLLMLIAST
GIIPALIGAN LQEVVDTVSI LSALRARRTG Q EF077-3 (SEQ ID NO:295) TCA
GCCGAATTGG GCGTATGGCA TTATTTTAAT AACAGGTTCT GTAATGGCGT TAATGATGTT
CTGGGAAATG ATTCAAACCT TACGTGAAGG AAAATATGGT GTCGATATTT TAGCGATTAC
CGCTATCGTT GCAACCTTAG CTGTGGGAGA ATACTGGGCC AGTTTGATGA TTTTAATTAT
GTTGACTGGT GGTGATTCAT TAGAAGACTA TGCCGCTGGA AAAGCTAACC AAGAGCTGAA
GTCATTATTG GATAACTCGC CACAAAAAGC TCATCGCTTG AATGGCGAAA ATTTAGAAGA
TGTTTCTGTT GAGGAAATCA ATGTTGGCGA TGAATTAGTA GTAAAACCAG GGGAACTAGT
TCCAGTTGAT GGCTTGGTAA AAACCGGGAC ATCAACAGTC GATGAATCTT CATTAACAGG
AGAATCAAAA CCAATTGAAA AAAATCCTGG GGATGAATTA ATGTCGGGTT CCGTGAATGG
TGACGGCTCT TTGAAAATGG TTGCTGAAAA AACTGTAGCA GACAGTCAAT ATCAAACAAT
TGTGAACTTA GTGAAAGAAT CTGCGGCGCG TCCAGCTCAT TTTGTACGTT TAGCAGATCG
CTATGCGGTA CCTTTTACAC TAGTTGCCTA CCTAATTGCA GGTGTTGCTT GGTTTGTTTC
AAAAAGTCCG ACACGTTTTG CGGAAGTCTT AGTTGTTGCT TCGCCGTGTC CTTTAATTCT
ATCTGCCCCA ATTGCTTTAG TGGCAGGGAT GGGTCGTTCA AGTCGTCATG GGGTCGTTAT
TAAATCGGGA ACGATGGTCG AAAAATTAGC TTCTGCAAAA ACGATTGCGT TTGATAAAAC
AGGCACGATT ACGCAAGGAC AACTTTCTGT TGATCAAGTC CAACCAATCA ATGCTGGAAT
AACTGCTGCT GAATTAGTGG GATTGGCAGC AAGCGTGGAA CAAGAATCAA CTCATATTTT
AGCTAGATCA ATTGTTGCTT ATGCCAGAAA GCAAGATGTC CCATTAAAAA ATATTACAGA
TCTAGCGGAA GTTTCTGGTG CTGGCGTGAA GGCATTTGTG GATGGTGCTG AGATACGGGT
AGGTAAAAAG AATTTTGTGA CACAAGAGTC TCAAGAAACT GAAAAAATTG ATAAAACGAC
TATTCATATT TCACGTAATG GCACATATTT AGGCCGAATT ACTTTTACAG ACACTGTACG
CCCAGAAGCA AAAGAGACTA TGGAAAAATT ACACCAATTA CATCTTCAAC GAATTTTAAT
GCTGACGGGG GATCAAGAAT CCGTTGCAGA AACGATTGCT GCAGAAGTAG GAATTACCGA
AGTACATGGG GAATGTTTAC CACAAGATAA ATTAACTATT CTAAAAGAAT TGCCTAAAGA
AAATCATCCA GTCATCATGG TAGGAGATOG TGTAAATGAT GCACCTTCGC TTGCTGCTGC
AGACGTAGGT ATTGCTATGG GTGCTCATGG AGCTACTGCG GCTAGTGAAA CTGCTGACGT
TGTTATTTTA AAAGATGACT TAAGTAAAGT CAGCCAAGCG GTCGAAATTG CCCAAGATAC
CATGAAAATT GCCAAACAAT CTGTATTAAT CGGAATTTTT ATCTGCGTTT TACTAATGTT
AATTGCTAGT ACCGGGATCA TTCCGGCGCT AATCGGGGCT ATGCTACAAG AAGTCGTGGA
CACTGTGTCA ATCTTATCTG CTTTGCGTGC TCGTCGAATT GGCC EF077-4 (SEQ ID
NO:296) QPNWA YGITLITGSV MALMMFWEMI QTLREGKYGV DILAITAIVA
TLAVGEYWAS LMILIMLTGG DSLEDYAAGK ANQELKSLLD NSPQKAHRLN GENLEDVSVE
ETNVGDELVV KPGELVPVDG LVKTGTSTVD ESSLTGESKP IEKNPGDELM SGSVNGDGSL
KNVAEKTVAD SQYQTIVNLV KESAARPAHF VRLADRYAVP FTLVAYLIAG VAWFVSKSPT
RFAEVLVVAS PCPLILSAPI ALVAGMGRSS RHGVVIKSGT MVEKLASAKT IAFDKTGTTT
QGQLSVDQVQ PINAGTTAAE LVGLAASVEQ ESSHILARSI VAYARKQDVP LKNITDLAEV
SGAGVKAFVD GAEIRVGKKN FVTQESQETE KIDKTTIHIS RNGTYLGRTT FTDTVRPEAK
ETMEKLHQLH LQRILMLTGD QESVAETIAA EVGTTEVHGE CLPQDKLTIL KELPKENHPV
IMVGDGVNDA PSLAAADVGI AMGAHGATAA SETADVVILK DDLSKVSQAV EIAQDTMKTA
KQSVLIGIFI CVLLMLIAST GIIPALIGAM LQEVVDTVST LSALRARRIG EF079-1 (SEQ
ID NO:297) TAATTTCTAG CATCACCGAA GAAATTTTTA GAAAAACAAA GAGCCTGGGC
CAATCACTGT CCCAGGCTCT CATGCTTTAT TTTTAAGGAG GAAGCAATGA AGTCAAAAAA
GAAACGTCGT ATCATTGATG GTTTTATGAT TCTTTTACTG ATTATTGGAA TAGGTGCATT
TGCGTATCCT TTTGTTAGCG ATGCATTAAA TAACTATCTG GATCAACAAA TTATCGCTCA
TTATCAAGCA AAAGCAAGCC AAGAAAACAC CAAAGAAATG GCTGAACTTC AAGAAAAAAT
GGAAAAGAAA AACCAAGAAT TAGCGAAAAA AGGCAGCAAT CCTGGATTAG ATCCTTTTTC
TGAAACGCAA AAAACAACGA AAAAACCAGA CAAATCCTAT TTTGAAAGTC ATACGATTGG
TGTTTTAACC ATTCCAAAAA TAAATGTCCG TTTACCAATT TTTGATAAAA CGAATGCATT
GCTATTGGAA AAAGGAAGCT CCTTGTTAGA AGGAACCTCC TATCCTACAG GTGGTACGAA
TACACATGCG GTCATTTCAG GCCATCGTGG TCTCCCTCAA GCCAAATTAT TTACAGATTT
GCCAGAATTA AAAAAAGGCG ATGAATTTTA TATCGAAGTC AATGGGAAGA CGCTTGCTTA
TCAAGTAGAT CAAATAAAAA CCGTTGAACC AACTGATACA AAAGATTTAC ACATTGAGTC
TGGCCAAGAT CTCGTCACTT TATTAACTTG CACACCGTAT ATGATAAACA GTCATCGGTT
ATTAGTTCGA GGACATCGTA TCCCATATCA ACCAGAAAAA GCAGCAGCGG GGATGAAAAA
AGTGGCACAA CAACAAAATT TACTATTATG GACATTACTT TTAATTGCCT GTGCGTTAAT
TATTAGCGGC TTCATTATCT GGTACAAGCG ACGGAAAAAG ACGACCAGAA AACCAAAGTA G
EF079-2 (SEQ ID NO:298) MKSKKKRRI IDGFMILLLI IGIGAFAYPE VSDALNNYLD
QQIIAHYQAK ASQENTKEMA ELQEKMEKKN QELAKKGSNP GLDPFSETQK TTKKPDKSYF
ESHTIGVLTI PKINVRLPIF DKTNALLLEK GSSLLEGTSY PTGGTNTHAV ISGHRGLPQA
KLFTDLPELK KGDEFYIEVN GKTLAYQVDQ IKTVEPTDTK DLHIESGQDL VTLLTCTPYM
INSHRLLVRG HRIPYQPEKA AAGMKKVAQQ QNLLLWTLLL IACALIISGF IIWYKEPKKT
TRKPK EF079-3 (SEQ ID NO:299) TCCT TTTGTTAGCG ATGCATTAAA TAACTATCTG
GATCAACAAA TTATCGCTCA TTATCAAGCA AAAGCAAGCC AAGAAAACAC CAAAGAAATG
GCTGAACTTC AAGAAAAAAT GGAAAAGAAA AACCAAGAAT TAGCGAAAAA AGGCAGCAAT
CCTGGATTAG ATCCTTTTTC TGAAACGCAA AAAACAACGA AAAAACCAGA CAAATCCTAT
TTTGAAAGTC ATACGATTGG TGTTTTAACC ATTCCAAAAA TAAATGTCCG TTTACCAATT
TTTGATAAAA CGAATGCATT GCTATTGGAA AAAGGAAGCT CCTTGTTAGA AGGAACCTCC
TATCCTACAG GTGGTACGAA TACACATGCG GTCATTTCAG GCCATCGTGG TCTCCCTCAA
GCCAAATTAT TTACAGATTT GCCAGAATTA AAAAAAGGCG ATGAATTTTA TATCGAAGTC
AATGGGAAGA CGCTTGCTTA TCAAGTAGAT CAAATAAAAA CCGTTGAACC AACTGATACA
AAAGATTTAC ACATTGAGTC TGGCCAAGAT CTCGTCACTT TATTAACTTG CACACCGTAT
ATGATAAACA GTCATCGGTT ATTAGTTCGA GGACATCGTA TCCCATATCA ACCAGAAAAA
GCAGCAGCGG GGATGAAAAA AGTGGCACAA CAACAAAATT TACTATTATO GACATTACTT
TTAATTGCCT GTGCGTTAAT TATTAGCGGC TTCATTATCT GGTACAAGCG ACGGAAAAAG
ACGACCAGAA AACCAA EF079-4 (SEQ ID NO:300) PF VSDALNNYLD QQITAHYQAK
ASQENTKEMA ELQEKNEKKN QELAKKGSNP GLDPFSETQK TTKKPDKSYF ESHTTGVLTT
PKINVRLPIF DKTNALLLEK GSSLLEGTSY PTGGTNTHAV ISGHRGLPQA KLFTDLPELK
KGDEFYTEVN GKTLAYQVDQ IKTVEPTDTK DLHTESGQDL VTLLTCTPYM INSHRLLVRG
HRIPYQPEKA AAGMKKVAQQ QNLLLWTLLL IACALILISGF IIWYKRRKKT TRKP
EF080-1 (SEQ ID NO:301) TAGTTACACT CGTTTAGGGC TAGCAACGTT AGGCATTTTC
GCTCGACTCT TAGCACTCTT TTTATTAGGA GGTTATTTCC TATGAAAAAA CGACTTTTAC
CTATTTTTTT CCTAATACTT CTTACCTTTG GCCTTGCCCT ACCCGTTTCG GCGGCTGAAA
ATTCAATTGA TGATGGCGCA CAATTACTGA CACCTGATCA AATCAACCAA CTAAAGCAAG
AGATACAACC TTTAGAAGAA AAAACAAAAG CCTCTGTCTT TATTGTAACC ACAAATAATA
ATACCTATGG CGATGAACAA GAATATGCAG ATCATTATCT TTTAAATAAA GTTGGCAAGG
ACCAAAATGC GATTCTTTTT CTCATTGATA TGGACTTACG GAAAATCTAC ATCTCTACTT
CTGGAAACAT GATTGATTAT ATGACAGATG CACGAATTGA TGATACCTTA GATAAAATAT
GGGATAATAT GAGTCAAGGA AATTATTTCG CGGCTGCTCA AACCTTTGTT CAGGAAACTC
AAGCATTTGT TAATAAAGGG GTTCCTGGGG GGCACTATCG TGTGGACAGC GAAACAGGTA
AAATCACTCG TTATAAAGTC ATTACCCCGC TGGAAATGGT AATTGCTTTT GCTGCTGCGC
TGATACTCAG TTTGGTCTTC TTAGGCATTA ATATTTCTAA ATATCAATTA AAATTTTCAA
GTTATCAATA TCCCTTTAGG GAAAAAACAA CTTTAAACTT AACCTCCCGC ACAGATCAGT
TAACCAACTC TTTCATCACT ACGCGTCGTA TTCCTAAAAA CAATGGCGGC AGTGGCGGAA
TGGGCGGTGG TGGTAGCACC ACCCACTCAA CTGGCGGCGG CACATTCGGT GGCGGCGGTC
GAAGTTTTTA G EF080-2 (SEQ ID NO:302) MKKR LLPTFFLILL TFGLALPVSA
AENSIDDGAQ LLTPDQINQL KQEIQPLEEK TKASVFIVTT NNNTYGDEQE YADHYLLNKV
GKDQNAILFL IDMDLRKIYI STSGNMIDYM TDARIDDTLD KIWDNMSQGN YFAAAQTFVQ
ETQAFVNKGV PGGHYRVDSE TGKITRYKVI TPLEMVIAFA AALILSLVFL GINISKYQLK
FSSYQYPFRE KTTLNLTSRT DQLTNSFITT RRTPKNNGGS GGMGGGGSTT HSTGGGTFGG
GGRSF EF080-3 (SEQ ID NO:303) GGCTGAAA ATTCAATTGA TGATGGCGCA
CAATTACTGA CACCTGATCA AATCAACCAA CTAAAGCAAG AGATACAACC TTTAGAAGAA
AAAACAAAAG CCTCTGTCTT TATTGTAACC ACAAATAATA ATACCTATGG CGATGAACAA
GAATATGCAG ATCATTATCT TTTAAATAAA GTTGGCAAGG ACCAAAATGC GATTCTTTTT
CTCATTGATA TGGACTTACG GAAAATCTAC ATCTCTACTT CTGGAAACAT GATTGATTAT
ATGACAGATG CACGAATTGA TGATACCTTA GATAAAATAT GGGATAATAT GAGTCAAGGA
AATTATTTCG CGGCTGCTCA AACCTTTGTT CAGGAAACTC AAGCATTTGT TAATAAAGGG
GTTCCTGGGG GGCACTATCG TGTGGACAGC GAAACAGGTA AAATCACTCG TTATAAAGTC
ATTACCCCGC TGGAAATGGT AATTGCTTTT GCTGCTGCGC TGATACTCAG TTTGGTCTTC
TTAGGCATTA ATATTTCTAA ATATCAATTA AAATTTTCAA GTTATCAATA TCCCTTTACG
GAAAAAACAA CTTTAAACTT AACCTCCCGC ACAGATCAGT TAACCAACTC TTTCATCACT
ACGCGTCGTA TTCCTAAAAA CAATGGCGGC AGTGGCGGAA TGGGCGGTGG TGGTAGCACC
ACCCACTCAA CTGGCGGCGG CACATTCGGT GGCGGCGGTC GAAGT EF080-4 (SEQ ID
NO:304) AENS I DDGAQ LLTPDQINQL KQEIQPLEEK TKASVFIVTT NNNTYGDEQE
YADHYLLNKV GKDQNAILFL IDMDLRKIYI STSGNNIDYM TDARIDDTLD KIWDNNSQGN
YFAAAQTFVQ ETQAFVNKGV PGGHYRVDSE TGKITRYKVT TPLEMVIAFA AALILSLVFL
GINISKYQLK FSSYQYPFRE KTTLNLTSRT DQLTNSFITT RRTPKNNGGS GGMGGGGSTT
HSTGGGTFGG GGRS EF081-1 (SEQ ID NO:305) TGAATGGAAC CAAGCAATCG
TAATAAAAAA TCTTCAAAAA AACCACTTAT TCTTGGTGTT TCTGCCTTGG TTCTAATCGC
TGCTGCCGGT GGCGGGTATT ATGCTTATAG TCAATGGCAA GCCAAACAAG AATTAGCCGA
AGCGAAGAAA ACAGCTACTA CATTTTTAAA CGTATTGTCA AAACAGGAAT TTGATAAGTT
ACCGTCCGTT GTTCAAGAAG CTAGCTTAAA GAAAAATGGC TATGATACTA AATCTGTTGT
TGAAAAATAC CAAGCAATTT ATTCAGGGAT TCAAGCAGAA GGAGTCAAAG CTAGTGATGT
TCAAGTCAAA AACGCGAAAG ACAATCAATA CACATTTACC TATAAATTAT CGATGAGCAC
GCCTTTAGGC GAAATGAAAG ATTTGTCTTA TCAATCAAGT ATCGCCAAAA AAGGCGATAC
CTACCAAATC GCTTGGAAGC CATCTTTAAT TTTTCCAGAT ATGTCAGGAA ATGATAAAAT
TTCGATTCAA GTAGATAATG CCAAACGTGG AGAAATTGTC GATCGTAATG GTAGTGGGCT
AGCAATTAAC AAAGTGTTTG ACGAAGTGGG CGTAGTGCCT GGCAAACTCG GTTCTGCCGC
AGAAAAAACA GCCAATATCA AAGCTTTTAG TGATAAATTC GGCGTTTCTG TTGATGAAAT
CAATCAAAAG TTAAGCCAAG GATGGGTCCA AGCAGACTCC TTTGTACCAA TCACAGTCGC
TTCTGAACCA GTGACAGAAT TACCAACAGG GGCTGCGACA AAAGATACAG AGTCACGTTA
TTATCCGCTG GGGGAAGCAN TGCGCAATTA A EF081-2 (SEQ ID NO:306)
MERSNRNKKS SKKPLILGVS ALVLIAAAGG GYYAYSQWQA KQELAEAKKT ATTFLNVLSK
QEFDKLPSVV QEASLKKNGY DTKSVVEKYQ AIYSGIQAEG VKASDVQVKK AKDNQYTFTY
KLSMSTPLGE MKDLSYQSSI AKKGDTYQTA WKPSLIFPDM SGNDKISIQV DNAKRGEIVD
RNGSGLAINK VFDEVGVVPG KLGSGAEKTA NIKAFSDKFG VSVDETNQKL SQGWVQADSF
VPITVASEPV TELPTGAATK DTESRYYPLG EAXRN EF081-3 (SEQ ID NO:307) T
GGCGGGTATT ATOOTTATAG TCAATGGCAA GCCAAACAAG AATTAGCCGA AGCGAAGAAA
ACAGCTACTA CATTTTTAAA CGTATTGTCA AAACAGGAAT TTGATAAGTT ACCGTCCGTT
GTTCAAGAAG CTAGCTTAAA GAAAAATGGC TATGATACTA AATCTGTTGT TGAAAAATAC
CAAGCAATTT ATTCAGGGAT TCAAGCAGAA GGAGTCAAAG CTAGTGATGT TCAAGTCAAA
AAGGCGAAAG ACAATCAATA CACATTTACC TATAAATTAT CGATGAGCAC GCCTTTAGGC
GAAATGAAAG ATTTGTCTTA TCAATCAAGT ATCGCCAAAA AAGGCGATAC CTACCAAATC
GCTTGGAAGC CATCTTTAAT TTTTCCAGAT ATGTCAGGAA ATGATAAAAT TTCGATTCAA
GTAGATAATG CCAAACGTGG AGAAATTGTC GATCGTAATG GTAGTGGGCT AGCAATTAAC
AAAGTGTTTG ACGAAGTGGG CGTAGTGCCT GGCAAACTCG GTTCTGGCGC AGAAAAAACA
GCCAATATCA AAGCTTTTAG TGATAAATTC GGCGTTTCTG TTGATGAAAT CAATCAAAAG
TTAAGCCAAG GATGGGTCCA AGCAGACTCC TTTGTACCAA TCACAGTCGC TTCTGAACCA
GTGACAGAAT TACCAACAGG GGCTGCGACA AAAGATACAG AGTCACGTTA TTATCCGCTG
GGGG EF081-4 (SEQ ID NO:308) G GYYAYSQWQA KQELAEAKKT ATTFLNVLSK
QEFDKLPSVV QEASLKKNGY DTKSVVEKYQ AIYSGIQAEG VKASDVQVKK AKDNQYTFTY
KLSMSTPLGE MKDLSYQSSI AKKGDTYQIA WKPSLIFPDM SGNDKISIQV DNAKRGETVD
RNGSGLAINK VFDEVGVVPG KLGSGAEKTA NIKAFSDKFG VSVDEINQKL SQGWVQADSF
VPITVASEPV TELPTGAATK DTESRYYPLG EF082-1 (SEQ ID NO:309) TAAAAAATGA
AAAAGATCGT GCGCATTTCA AGCATTTTGT TCGTTGCTAC GCCTCTTATG CTTTTAAATA
GTTCAAAAGT TGAAGCAGCT CAAGTCGCTT CTATTCAATC CAACGCTGAT ATTACGTTTG
CTCTTGATAA TACTGTCACG CCACCTGTCA ACCCGACGAA CCCTTCTCAG CCTGTGACAC
CTAATCCTGC TGATCCTCAT CAACCTGGTA CAGCCGGACC CCTTAGTATT GACTATGTTT
CAAATATCCA TTTTGGATCA AAACAAATTC AAGCCGGAAC AGCGATCTAT TCGGCACAAC
TGGATCAAGT GCAAAATAGT ACTGGCGATT TAATTAGCGT GCCAAACTAT GTTCAAGTAA
CTGACAAACG TGGTCTAAAT CTTGGCTGGA AATTATCAGT TAAACAGAGT GCGCAATTTG
CTACAAGTGA TTCAACACCC GCTGTTTTGG ATAATGCATC CTTGACCTTT TTAGCAGCAA
CACCCAATTC AACACAGTTA CTTTCTTTGG CGCCATTAAC GGTCCCAGTA ACCTTGGATC
CAACTGGTGC CGCCACTTCT CCTGTGGCGA CTGCCGCTCT TTCAACAGGA ATGGGCACTT
GGACATTAGC TTTTGGTAGC GGANCGACCG CTGCTCAAGG CATTCAATTA ACTGTTCCTG
CGACAACGAA AAAAGTTGCA GCTAAACAAT ATAAAACAAC GCTTACTTGG
ATTTTGGATG
ATACACCACT TTAA EF082-2 (SEQ ID NO:310) MKKIVRISS ILFVATPLNL
LNSSKVEAAQ VASIQSNADI TFALDNTVTP PVNPTNPSQP VTPNPADPHQ PGTAGPLSID
YVSNIHFGSK QIQAGTAIYS AQLDQVQNST GDLISVPNYV QVTDKRGLNL GWKLSVKQSA
QFATSDSTPA VLDNASLTFL AATPNSTQLL SLAPLTVPVT LDPTGAATSP VATAALSTGM
GTWTLAFGSG XTAAQGIQLT VPATTKKVAA KQYKTTLTWI LDDTPL EF082-3 (SEQ ID
NO:311) AGCT CAAGTCGCTT CTATTCAATC CAACGCTGAT ATTACGTTTG CTCTTGATAA
TACTGTCACG CCACCTGTCA ACCCGACGAA CCCTTCTCAG CCTGTGACAC CTAATCCTGC
TGATCCTCAT CAACCTGGTA CAGCCGGACC CCTTAGTATT GACTATGTTT CAAATATCCA
TTTTGGATCA AAACAAATTC AAGCCCGAAC AGCGATCTAT TCGGCACAAC TGGATCAAGT
GCAAAATAGT ACTGGCGATT TAATTAGCGT GCCAAACTAT GTTCAAGTAA CTGACAAACG
TGGTCTAAAT CTTGGCTGGA AATTATCAGT TAAACAGAGT GCGCAATTTG CTACAAGTGA
TTCAACACCC GCTGTTTTGG ATAATGCATC CTTGACCTTT TTAGCAGCAA CACCCAATTC
AACACAGTTA CTTTCTTTGG CGCCATTAAC GGTCCCAGTA ACCTTGGATC CAACTGGTGC
CGCCACTTCT CCTGTCGCGA CTGCCGCTCT TTCAACAGGA ATGGGCACTT GGACATTAGC
TTTTGGTAGC GGANCGACCG CTGCTCAAGG CATTCAATTA ACTGTTCCTG CGACAACGAA
AAAAGTTGCA GCTAAACAAT ATAAAACAAC GCTTACTTGG ATTTTGGATG ATACACCACT
EF082-4 (SEQ ID NO:312) AQ VASIQSNADI TFALDNTVTP PVNPTNPSQP
VTPNPADPHQ PGTAGPLSID YVSNIHFGSK QIQAGTAIYS AQLDQVQNST GDLISVPNYV
QVTDKRGLNL GWKLSVKQSA QFATSDSTPA VLDNASLTFL AATPNSTQLL SLAPLIVPVT
LDPTGAATSP VATAALSTGM GTWTLAFGSG XTAAQCIQLT VPATTKKVAA KQYKTTLTWI
LDDTP EF083-1 (SEQ ID NO:313) TAATTTAAAA GACAAGGAGA AATAAAAATG
AAAAAGAAAA TTTTAGCAGG AGCGCTTGTC GCTCTGTTTT TTATGCCTAC AGCTATGTTT
GCCGCAAAAG GAGACCAAGG TGTGGATTGG GCGATTTATC AAGGTGAACA AGGTCGCTTT
GGCTATGCAC ATGATAAATT CGCTATTGCC CAGATTGGAG GCTACAATGC TAGCGGTATT
TATGAACAAT ACACATATAA AACGCAAGTG GCAAGTGCTA TTGCCCAAGG TAAACGTGCG
CATACCTATA TTTGGTATGA CACTTGGGGA AACATGGACA TTGCGAAAAC AACAATGGAT
TACTTTTTGC CACGTATTCA AACGCCTAAA AATTCCATCG TTGCATTAGA TTTTGAACAT
GGAGCGTTGG CTAGTGTTCC AGATGGATAT GGAGGATATG TAAGTTCAGA TGCCGAAAAA
GCAGCAAATA CAGAGACAAT TTTGTACGGT ATGCGCAGAA TCAAACAGGC TGGCTATACT
CCAATGTATT ACAGCTATAA GCCATTTACA CTAAATCATG TAAACTATCA ACAAATCATC
AAAGAGTTTC CTAACTCTTT ATGGATTGCT GCGTATCCTA TCGATGGTGT GTCACCATAT
CCATTGTATG CTTATTTCCC AAGCATGGAT GGTATTGGTA TTTGGCAATT CACATCCGCT
TATATTGCAG GTGGTTTAGA TGGTAACGTA GATTTAACAG GAATTACGGA TAGTGGTTAT
ACAGATACCA ATAAACCAGA AACGGATACG CCAGCAACAG ATGCAGGCGA AGAAATTGAA
AAAATACCTA ATTCTGATGT TAAAGTTGGC GATACCGTCA AAGTGAAATT TAATGTAGAT
GCTTGGGCAA CTGGGGAAGC TATTCCGCAA TGGGTAAAAG GAAACAGCTA CAAAGTGCAA
GAAGTAACTG GAAGCAGAGT ATTGCTTGAA GGTATCTTGT CATGGATTAG CAAAGGTGAT
ATTGAATTAT TGCCAGACGC AACAGTCGTC CCTGATAAGC AACCAGAAGC GACTCATGTG
GTACAATACG GAGAAACATT ATCAAGTATT GCTTATCAAT ATGGAACAGA CTATCAAACG
TTGGCGGCAT TAAATGGATT GGCTAATCCA AATCTTATTT ATCCTGGTCA AGTTTTGAAA
GTCAATGGAT CGGCAACAAG TAATGTCTAC ACGGTTAAAT ACGGCGATAA TTTATCTAGT
ATTGCAGCAA AACTTGGCAC TACTTATCAA GCTTTAGCTG CATTAAACGG ATTAGCAAAT
CCTAACTTGA TTTATCCAGG TCAAACATTG AATTATTAA EF083-2 (SEQ ID NO:314)
MK KKILAGALVA LFFMPTAMFA AKGDQGVDWA IYQGEQGRFG YAHDKFAIAQ
IGGYNASGIY EQYTYKTQVA SAIAQGKRAH TYIWYDTWGN MDIAKTTMDY FLPRIQTPKN
SIVALDFEHG ALASVPDGYG GYVSSDAEKA ANTETILYGM RRIKQAGYTP MYYSYKPFTL
NHVNYQQIIK EFPNSLWIAA YPIDGVSPYP LYAYFPSMDG GIWQFTSAY IAGGLDGNVD
LTGITDSGYT DTNKPETDTP ATDAGEEIEK IPNSDVKVGD TVKVKFNVDA WATGEAIPQW
VKGNSYKVQE VTGSRVLLEG ILSWISKGDI ELLPDATVVP DKQPEATHVV QYGETLSSIA
YQYGTDYQTL AALNGLANPN LIYPGQVLKV NGSATSNVYT VKYGDNLSSI AAKLGTTYQA
LAALNGLANP NLIYPGQTLN Y EF083-3 (SEQ ID NO:315) AAAAG GAGACCAAGG
TGTGGATTGG GCGATTTATC AAGGTCAACA AGGTCGCTTT GGCTATGCAC ATGATAAATT
CGCTATTGCC CAGATTGGAG GCTACAATGC TAGCGGTATT TATGAACAAT ACACATATAA
AACGCAAGTG GCAAGTGCTA TTGCCCAAGG TAAACGTGCG CATACCTATA TTTGGTATGA
CACTTGGGGA AACATGGACA TTGCGAAAAC AACAATGGAT TACTTTTTGC CACGTATTCA
AACGCCTAAA AATTCCATCG TTGCATTAGA TTTTGAACAT GGAGCGTTGG CTAGTGTTCC
AGATGGATAT GGACGATATG TAAGTTCAGA TGCCGAAAAA GCAGCAAATA CAGAGACAAT
TTTGTACGGT ATGCGCAGAA TCAAACAGGC TGGCTATACT CCAATGTATT ACAGCTATAA
GCCATTTACA CTAAATCATG TAAACTATCA ACAAATCATC AAAGAGTTTC CTAACTCTTT
ATCGATTGCT GCGTATCCTA TCGATGGTGT GTCACCATAT CCATTGTATG CTTATTTCCC
AAGCATGGAT GGTATTGGTA TTTGGCAATT CACATCCGCT TATATTGCAG GTGGTTTAGA
TGGTAACGTA GATTTAACAG GAATTACGGA TAGTGGTTAT ACAGATACCA ATAAACCAGA
AACGGATACG CCAGCAACAG ATGCAGGCGA AGAAATTGAA AAAATACCTA ATTCTGATGT
TAAAGTTGGC GATACCGTCA AAGTGAAATT TAATGTAGAT GCTTGGGCAA CTGGGGAAGC
TATTCCGCAA TGGGTAAAAG GAAACAGCTA CAAAGTGCAA GAAGTAACTG GAAGCAGAGT
ATTGCTTGAA GGTATCTTGT CATGGATTAG CAAAGGTGAT ATTGAATTAT TGCCAGACGC
AACAGTCGTC CCTGATAAGC AACCAGAAGC GACTCATGTG GTACAATACG GAGAAACATT
ATCAAGTATT GCTTATCAAT ATGGAACAGA CTATCAAACG TTGGCGGCAT TAAATGGATT
GGCTAATCCA AATCTTATTT ATCCTGGTCA AGTTTTGAAA GTCAATGGAT CGGCAACAAG
TAATGTCTAC ACGGTTAAAT ACGGCGATAA TTTATCTAGT ATTGCACCAA AACTTGGCAC
TACTTATCAA GCTTTAGCTG CATTAAACGG ATTAGCAAAT CCTAACTTGA TTTATCCAGG
TCAAACATTG AAT EF083-4 (SEQ ID NO:316) KGDQGVDWA IYQGEQGRFG
YAHDKFAIAQ IGGYNASGIY EQYTYKTQVA SAIAQGKRAH TYIWYDTWGN MDIAKTTMDY
FLPRIQTPKN SIVALDFEHG ALASVPDGYG GYVSSDAEKA ANTETILYGM RRIKQAGYTP
MYYSYKPFTL NHVNYQQIIK EFPNSLWTAA YPIDGVSPYP LYAYFPSMDG IGIWQFTSAY
IAGCLDGNVD LTGITDSGYT DTNKPETDTP ATDAGEEIEK TPNSDVKVGD TVKVKFNVDA
WATGEAIPQW VKGNSYKVQE VTGSRVLLEG ILSWISKGDI ELLPDATVVP DKQPEATHVV
QYGETLSSIA YQYGTDYQTL AALNGLANPN LIYPGQVLKV NGSATSNVYT VKYGDNLSSI
AAKLGTTYQA LAALNGLANP NLIYPGQTLN EF084-1 (SEQ ID NO:317) TAGTCAAACG
TTTATTTTTT CCTTAAATCC AGAAAAAATC CCGTAATTAT GGTACACTAC CTATTGAATT
GGACGAGAAC TATGAAGAAA TTTGATGTAA TTATTGTCGG TGCTGGGACG AGCGGTATGA
TGGCCACGAT TGCGGCCGCC GAAGCAGGCG CTCAAGTATT ATTGATTGAA AAAAATCGCC
GTGTTGGGAA AAAATTATTA ATGACTGGTG GCGGCCGCTG TAATGTAACC AATAATCGGC
CCGCAGAAGA AATCATTTCA TTTATTCCTG GGAATGGAAA ATTTTTATAC AGCGCATTTT
CACAATTTGA TAACTATGAT ATCATGAACT TTTTTGAATC CAATGGTATT CACTTAAAAG
AAGAAGATCA CGGACGCATG TTCCCTGTTA CAGATAAATC GAAGTCAATT GTTGATGCGC
TATTTAACCG CATTAACGAA TTAGGAGTCA CTGTTTTTAC AAAAACACAG GTCACAAAAT
TACTACGAAA AGACGATCAA ATAATTGGCG TTGAAACCGA ACTGGAAAAA ATTTATGCAC
CGTGTGTTGT ATTAACAACT GGCGGCCGCA CTTATCCTTC CACAGGAGCA ACTGGTGATG
GCTATAAACT AGCCAAAAAA ATGGGGCATA CCATCAGCCC GCTCTACCCT ACCGAATCAC
CTATTATTTC TGAAGAACCT TTTATCCTGG ATAAAACGTT GCAAGGTCTC TCTTTACAAG
ATGTTAATTT AACTGTTTTG AACCAAAAAG GAAAACCTTT AGTTAATCAT CAAATGGATA
TGCTGTTTAC ACATTTTGGC ATTTCAGGAC CTGCCGCGCT CCGCTGTTCT AGTTTTATTA
ACCAAGAATT AACTCGCAAC GGTAATCAAC CTGTCACGGT AGCCTTGGAT GTGTTTCCGA
CAAAATCTTT TGAAGAAGTG CCTGCCAAAC AACTAACAGA AAAGCAACCN CTTTCCTTTG
TGGAACTACT GAAAGACTTT CAGTTCACTG TTACGAAAAC ATTGCCTTTG GAAAAATCTT
TTGTCACAGG CGGTGGGATT TCCCTCAAAG AAGTGACCCC TAAAACAATG GAGAGCAAAT
TAGTCAATGG TTTATTTTTT GCTGGTGAAC TTTTAGATAT TAATGGCTAT ACTGGAGGCT
ACAATGTTAC AGCTGCATTT GTCACTGGAC ATGTTGCTGG CTCCCATGCC GCAGAAATTG
CAGAATACAC CTATTTACCA ATTGAAGAAG TCTAA EF084-2 (SEQ ID NO:318) MKKF
DVIIVGAGTS GHMATIAAAE AGAQVLLIEK NRRVGKKLLM TGGGRCNVTN NRPAEEIISF
IPGNGKFLYS AFSQFDNYDI MNFFESNGIH LKEEDHGRMF PVTDKSKSIV DALFNRINEL
GVTVFTKTQV TKLLRKDDQI IGVETELEKI YAPCVVLTTG GRTYPSTGAT GDGYKLAKKN
GHTISPLYPT ESPIISEEPF ILDKTLQGLS LQDVNLTVLN QKGKPLVNHQ MDMLFTHFGI
SGPAALRCSS FINQELTRNG NQPVTVALDV FPTKSFEEVP AKQLTEKQRL SFVELLKDFQ
FTVTKTLPLE KSFVTGGGIS LKEVTPKTME SKLVNGLFFA GELLDINGYT GGYNVTAAFV
TCHVAGSHAA EIAEYTYLPI EEV EF084-3 (SEQ ID NO:319) C GAAGCAGGCG
CTCAAGTATT ATTGATTGAA AAAAATCGCC GTGTTGGGAA AAAATTATTA ATGACTGGTG
GCGGCCGCTG TAATGTAACC AATAATCGGC CCGCAGAAGA AATCATTTCA TTTATTCCTG
GGAATGGAAA ATTTTTATAC AGCGCATTTT CACAATTTGA TAACTATGAT ATCATGAACT
TTTTTGAATC CAATGGTATT CACTTAAAAG AAGAAGATCA CGGACGCATG TTCCCTGTTA
CAGATAAATC GAAGTCAATT GTTGATGCGC TATTTAACCG CATTAACGAA TTAGGAGTCA
CTGTTTTTAC AAAAACACAG GTCACAAAAT TACTACGAAA AGACGATCAA ATAATTGGCG
TTGAAACCGA ACTGGAAAAA ATTTATGCAC CGTGTGTTGT ATTAACAACT GGCGGCCGCA
CTTATCCTTC CACAGGACCA ACTGGTGATG GCTATAAACT AGCCAAAAAA ATGGGGCATA
CCATCAGCCC GCTCTACCCT ACCGAATCAC CTATTATTTC TGAAGAACCT TTTATCCTGG
ATAAAACGTT GCAAGGTCTC TCTTTACAAG ATGTTAATTT AACTGTTTTG AACCAAAAAG
GAAAACCTTT AGTTAATCAT CAAATGGATA TGCTGTTTAC ACATTTTGGC ATTTCAGGAC
CTGCCGCGCT CCGCTGTTCT AGTTTTATTA ACCAAGAATT AACTCGCAAC GGTAATCAAC
CTGTCACGGT AGCCTTGGAT GTGTTTCCGA CAAAATCTTT TGAAGAAGTG CCTGCCAAAC
AACTAACAGA AAAGCAACGN CTTTCCTTTG TGGAACTACT GAAAGACTTT CAGTTCACTG
TTACGAAAAC ATTGCCTTTG GAAAAATCTT TTGTCACAGG CGGTGGGATT TCCCTCAAAG
AAGTGACCCC TAAAACAATG GAGAGCAAAT TAGTCAATGG TTTATTTTTT GCTGGTGAAC
TTTTAGATAT TAATGGCTAT ACTGGAGGCT ACAATGTTAC AGCTGCATTT GTCACTGGAC
ATGTTGCTGG CTCCCATGCC GCAGAAATTG CAGAATACAC CTATTTACCA ATTGAAGAAG
TC EF084-4 (SEQ ID NO:320) E AGAQVLLIEK NRRVGKKLLM TGGGRCNVTN
NRPAEEIISF IPGNGKFLYS AFSQEDNYDI MNEFESNGIH LKEEDHGRNF PVTDKSKSIV
DALFNRINEL GVTVFTKTQV TKLLRKDDQI IGVETELEKI YAPCVVLTTG GRTYPSTGAT
GDGYKLAKKM GHTISPLYPT ESPIISEEPF ILDKTLQGLS LQDVNLTVLN QKGKPLVNHQ
HDMLFTHFGI SGPAALRCSS FINQELTRNG NQPVTVALDV FPTKSFEEVP AKQLTEKQRL
SFVELLKDFQ FTVTKTLPLE KSFVTGGGIS LKEVTPKTME SKLVNGLFFA GELLDINGYT
GGYNVTAAFV TGHVAGSHAA EIAEYTYLPI EEV EF085-1 (SEQ ID NO:321)
TAACCCATGA AATCATTTTG TCCCGCATAT GGGGATATGA CTTTGACGGT GATGGCAGCA
CAGTCCACAC TCATATCAAA AATCTGCGGG CGAACTGCCG GAAAATATCA TCAAAACCAT
CCGCGGTGTA GGTTACCGAT TGGAGGAATC ATTATAATGG AAAGAAAAGG GATTTTCATT
AAGGTTTTTT CCTATACGAT CATTGTCCTG TTACTGCTTG TCGGTGTAAC GGCAACACTG
TTTGCACAGC AATTTGTGTC TTATTTCAGA GCGATGGAAG CACAGCAAAC AGTAAAATCC
TATCAGCCAT TGGTGGAACT GATTCAGAAT AGCGATAGGC TTGATATGCA AGAGGTGGCA
GGGCTGTTTC ACTACAATAA CCAATCCTTT GAGTTTTATA TTGAAGATAA AGAGGGAAGC
GTACTCTATG CCACACCGAA TGCCGATACA TCAAATAGTG TTAGGCCCGA CTTTCTTTAT
GTGGTACATA GAGATGATAA TATTTCGATT GTTGCTCAAA GCAAGGCAGG TGTGGGATTG
CTTTATCAAG GGCTGACAAT TCGGGGAATT GTTATGATTG CGATAATGGT TGTATTCAGC
CTTTTATGCG CGTATATCTT TGCGCGGCAA ATGACAACGC CGATCAAAGC CTTAGCGGAC
AGTGCGAATA AAATGGCAAA CCTGAAAGAA GTACCGCCGC CGCTGGAGCG AAAGGATGAG
CTTGGCGCAC TGGCTCACGA CATGCATTCC ATGTATATCA GGCTGAAAGA AACCATCGCA
AGGCTGGAGG ATGAAATCGC AAGGGAACAT GAGTTGGAGG AAACACAGCG ATATTTCTTT
GCGGCAGCCT CTCATGAGTT AAAAACGCCC ATCGCGGCTG TAAGCGTTCT GTTGGAGGGA
ATGCTTGAAA ATATCGGTGA CTACAAAGAC CATTCTAAGT ATCTGCGCGA ATGCATCAAA
ATGATGGACA GGCAGGGCAA AACCATTTCC GAAATACTGG AGCTTGTCAG CCTGAACGAT
GGGAGAATCG TACCCATAGC CGAACCGCTG GACATAGGGC GCACGGTTGC CGAGCTGCTA
CCCGATTTTC AAACCTTGGC AGAGGCAAAC AACCAGCGGT TCGTCACAGA TATTCCAGCC
GGACAAATTG TCCTGTCCGA TCCGAAGCTG ATCCAAAAGG CGCTATCCAA TGTCATATTG
AATGCGGTTC AGAACACGCC CCAGGGAGGT GAGGTACGGA TATGGAGTGA GCCTGGGGCT
GAAAAATACC GTCTTTCCGT TTTGAACATG GGCGTTCACA TTGATGATAC TGCACTTTCA
AAGCTGTTCA TCCCATTCTA TCGCATTGAT CAGGCGCGAA GCAGCAAAAA GTGGGCGAAG
CGGTTTGGGG CTTGCCATCG TACAAAAAAC GCTGGATGCC ATGAGCCTCC AATATGCGCT
GGAAAACACC TCAGATGGCG TTTTGTTCTG GCTGGATTTA CCGCCCACAT CAACACTATA
AATATTTAA EF085-2 (SEQ ID NO:322) MERKGIFIK VESYTIIVEL LLVGVTATLF
AQQFVSYFRA MEAQQTVKSY QPLVELIQNS DRLDMQEVAG LFHYNNQSFE FYIEDKEGSV
LYATPNADTS NSVRPDFLYV VHRDDNISIV AQSKACVGLL YQGLTIRGIV MIAIMVVFSL
LCAYIFARQM TTPIKALADS ANKMANLKEV PPPLERKDEL GALAHDMHSM YIRLKETIAR
LEDEIAREHE LEETQRYFFA AASHELKTPI AAVSVLLEGM LENIGDYKDH SKYLRECIKM
MDRQGKTISE ILELVSLNDG RIVPIAEPLD IGRTVAELLP DEQTLAEANN QRFVTDIPAG
QIVLSDPKLI QKALSNVILN AVQNTPQGGE VRIWSEPGAE KYRLSVLNMG VHIDDTALSK
LFIPEYRIDQ ARSSKKWAKR FGACHRTKNA GCHEPPICAG KHLRWRFVLA GETAHINTIN I
EF085-3 (SEQ ID NO:323) GC AATTTGTGTC TTATTTCAGA GCGATGGAAG
CACAGCAAAC AGTAAAATCC TATCAGCCAT TGGTGGAACT GATTCAGAAT AGCGATAGGC
TTGATATGCA AGAGGTGGCA CGGCTGTTTC ACTACAATAA CCAATCCTTT GAGTTTTATA
TTGAAGATAA AGAGGGAAGC GTACTCTATG CCACACCGAA TGCCGATACA TCAAATAGTG
TTAGGCCCGA CTTTCTTTAT GTGGTACATA GAGATGATAA TATTTCGATT GTTGCTCAAA
GCAAGGCAGG TGTGGGATTG CTTTATCAAG GGCTGACAAT TCGGGGAATT GTTATGATTG
CGATAATGGT TGTATTCAGC CTTTTATGCG CGTATATCTT TGCGCGGCAA ATGACAACGC
CGATCAAAGC CTTAGCGGAC AGTGCGAATA AAATGGCAAA CCTGAAAGAA GTACCGCCGC
CGCTGGAGCG AAAGGATGAG CTTCGCGCAC TGGCTCACGA CATGCATTCC ATGTATATCA
GGCTGAAACA AACCATCGCA AGGCTGGAGG ATGAAATCGC AAGGGAACAT GAGTTGGAGG
AAACACAGCG ATATTTCTTT GCGGCAGCCT CTCATGAGTT AAAAACGCCC ATCGCGGCTG
TAAGCGTTCT GTTGGAGGGA ATGCTTGAAA ATATCGGTGA CTACAAAGAC CATTCTAAGT
ATCTGCGCGA ATGCATCAAA ATGATGCACA GGCAGGGCAA AACCATTTCC GAAATACTGG
AGCTTGTCAG CCTCAACGAT GGGACAATCG TACCCATAGC CGAACCGCTG GACATAGGGC
GCACGGTTGC CGAGCTGCTA CCCGATTTTC AAACCTTGGC AGAGGCAAAC AACCAGCGCT
TCGTCACAGA TATTCCAGCC GGACAAATTG TCCTGTCCGA TCCGAAGCTG ATCCAAAAGG
CGCTATCCAA TGTCATATTG AATGCGGTTC AGAACACGCC CCAGGGAGGT GAGGTACGGA
TATGGAGTCA GCCTGGGGCT GAAAAATACC GTCTTTCCGT TTTGAACATG GGCGTTCACA
TTGATGATAC TGCACTTTCA AAGCTGTTCA TCCCATTCTA TCGCATTGAT CAGGCGCGAA
C-CAGCAAAAA GTGGGCGAAG CGGTTTGGGG CTTGCCATCG TACAAAAAAC GCTGGATGCC
ATGAGCCTCC AATATGCGCT GGAAAACACC TCAGATGGCG TTTTGTTCTG GCTGGATTTA
CCGCCCACAT CAACACTATA AATATTT EF085-4 (SEQ ID NO:324) QFVSYEPA
MEAQQTVKSY QPLVELIQNS DRLDMQEVAG LFHYNNQSFE FYIEDKEGSV LYATPNADTS
NSVRPDFLYV VHRDDNISIV AQSKAGVGLL YQGLTIRGIV MIAIMVVFSL LCAYIFARQM
TTPIKALADS ANKNANLKEV PPPLERKDEL GALAHDMHSM YIRLKETIAR LEDEIAREHE
LEETQRYFEA AASHELKTPI AAVSVLLEGM LENIGDYKDH SKYLRECIKM MDRQGKTISE
ILELVSLNDG RIVPIAEPLD IGRTVAELLP DFQTLAEANN QRFVTDIPAG QIVLSDPKLI
QKALSNVILN AVQNTPQGGE VRIWSEPGAE KYRLSVLNMG VHIDDTALSK LFIPFYRIDQ
ARSSKKWAKR FGACHRTKNA GCHEPPICAG KHLRWRFVLA GFTAHINTIN I EF086-1
(SEQ ID NO:325) TAACTGGTGG GATTGGCAAA TTGGTTCCGC GCAGCGCTAA
CAGATACATT GATTTTATTA CATGATGACC TATTGAATAC AGATGCAGAA AAATTAAATA
AATTTACTGC TCCGCTGATG CTGTATGCAA AAGATCCAAA CATACAATGG CCAATTTATC
GTGCAACAGG AGCTAACTTA ACAGATATTT CAATCACCGT TTTAGGTACT GGACTTTTGT
TAGAAGATAA TCAACGCCTA GTACAAGTAC AAGAAGCTGT TCCGTCCGTT TTAAAAAGTG
TTTCCTCTGG TGATGGCTTA TATCCTGATG GTTCCTTGAT TCAACATGGT TATTTTCCGT
ACAACGCCAG TTACGGGAAT GAGTTGCTAA AACGGTTTGG ACGAATTCAG ACTATTTTAC
AAGGTTCCGA CTGGGAGATG AATGACCCTA ACATTAGTAA TTTATTTAAT GTTGTGGATA
AAGGTTACTT ACAATTGATG GTAAATGGAA AAATGCCATC GATGGTTTCT GGTAGAAGTA
TTTCCAGAGC GCCAGAAACG AATCCTTTTA CTACAGAGTT TGAATCGGGT AAAGAAACAA
TAGCTAATTT AACCTTAATT GCAAAATTTG CACCAGAAAA TTTAAGAAAT GACATTTATA
CATCTATCCA AACGTGGCTT CAACAAAGTG GGTCATACTA TCATTTCTTT AAAAAACCAA
GAGATTTTGA AGCGTTAATT GACTTGAAAA ATGTAGTGAA TAGTGCGTCA CCTGCCCAAG
CCACACCAAT GCAATCTTTA AATGTATATG GTTCGATGGA TCGAGTCCTA CAGAAAAATA
ACGAATATGC GGTGGGGATC AGTATGTATT CACAACGTGT CGGAAACTAT GAATTTGGGA
ATACGGAAAA TAAAAAAGGC TGGCATACAG CAGACGGCAT GCTTTATTTA TACAATCAAG
ACTTTGCTCA GTTTGATGAA GGATACTGGG CAACGATCGA TCCATATCGA TTACCAGGAA
CGACAGTTGA CACAAGAGAA TTGGCAAATG GTGCTTATAC AGGGAAACGC AGTCCCCAGT
CATGGGTAGG TGGCTCAAAT AATGGACAGG TTGCCTCTAT AGGAATGTTT TTAGATAAAA
GTAATGAAGG AATGAACTTA GTTGCTAAAA AATCTTGGTT CTTATTAGAT CGTCAAATCA
TTAATTTGGG AAGTGGCATT ACTGGTACGA CAGATGCTTC GATTGAAACA ATCCTCGATA
ATCGGATGAT
TCATCCACAG GAAGTGAAGC TTAACCAAGG TTCAGACAAA GATAATTCTT GGATTAGTTT
AAGCGCAGCG ANTCCATTGA ATAACATTGG CTATGTTTTT CCTAATTCNA TGAATACGCT
TGATGTTCAA ATAGAAGAAC GCTCTGGTCG CTACGGAGAT ATTAACGAAT ACTTTGTTAA
TGATAAAACC TATACAAATA CATTTGCTAA AATTAGTAAA AATTATGCCA AGACTGTTGA
AAATGGTACT TACGAATATT TAACAGTGGT TGGGAAAACG AATGAAGAAA TCCCAGCTCT
TTCTAAAAAC AAAGGCTATA CTGTTCTAGA AAATACAGCA AACTTACAAG CCATTGAAGC
AGGTAATTAT GTCATGATGA ATACATGGAA TAATGACCAA GAAATTGCAG GACTGTATGC
GTATGATCCA ATGTCGGTTA TTTCAGAAAA AATTGATAAC GGTGTTTATC GCTTAACTCT
TGCGAATCCT TTACAAAATA ATGCATCCGT TTCTATTGAA TTTGATAAGG GCATTCTTGA
AGTAGTCGCA GOGGACCOAG AAATTTCTGT TGACCAAAAT ATTATCACTT TAAATAGTGC
GGGGTTAAAT GGCAGCTCGC GTTCAATCAT TGTTAAAACA ACTCCTGAAG TAACGAAAGA
AGCGTTAGAA AAATTAATTC AGGAACAAAA AGAACACCAA GAAAAAGACT ACACCGCAAG
CAGCTGGAAA GTCTACAGCG AAGCATTGAA ACAAGCACAA ACTGTGGCAG ATCAAACAAC
AGCAACGCAA GCAGAAGTAG ACCAAGCAGA AACAGAGTTA CGTTCGGCAG TGAAGCAATT
GGTAAAAGTG CCAACTAAAG AAGTAGATAA AACCAACTTG TTGAAAATCA TCAAAGAAAA
CGAGAAACAC CAAGAAAAAG ACTACACCGC AAGCAGTTGG AAAGTCTACA GTGAAGCATT
GAAGCAAGCG CAAACTGTGG CAGATCAAAC AACAGCAACG CAAGCAGAAG TAGACCAACC
AGAAGCAAAA CTACGTTCGG CAGTGAAGCG ATTAACATTG AAAAATAGTG GGGAAAATAA
AAAGGAGCAA AAAAATGGGG GGAATAATGG ACACTTAAAT ACTAGTACAG GAGTTGATCA
AACTGGTACG AAACAAGTTA AGCCATCAAC CCAAGGTGGT TTCAGAAAAG CTAGCCAATT
TTTACCGAGC ACAGGAGAAA AGAAATCGAT CGCGCTTGTG ATTATTGGTC TTCTAGTTAT
CGCCAGTGGG TGTCTTTTAG TTTTTCGTAA AAGTAAATCG AAGAAGTAA EF086-2 (SEQ
ID NO:326) LVGLANWFRA ALTDTLILLH DDLLNTDAEK LNKFTAPLML YAKDPNIQWP
IYRATGANLT DISITVLGTG LLLEDNQRLV QVQEAVPSVL KSVSSGDGLY PDGSLTQHGY
FPYNGSYGNE LLKGFGRIQT ILQGSDWEMN DPNISNLFNV VDKGYLQLMV NGKMPSMVSG
RSISRAPETN PFTTEFESGK ETIANLTLIA KFAPENLRIW IYTSIQTWLQ QSGSYYHFFK
KPRDFEALID LKNVVNSASP AQATPMQSLN VYGSMDRVLQ KNNEYAVGIS MYSQRVGNYE
FGNTENKKGW HTADGMLYLY NQDFAQFDEG YWATIDPYRL PGTTVDTREL ANGAYTGKRS
PQSWVGGSNN GQVASIGMFL DKSNEGMNLV AKKSWFLLDG QIINLGSGIT GTTDASIETI
LDNRMIHPQE VKLNQGSDKD NSWISLSAAX PLNNIGYVFP NSMNTLDVQI EERSGRYGDI
NEYFVNDKTY TNTFAKISKN YGKTVENGTY EYLTVVGKTN EEIAALSKNK GYTVLENTAN
LQAIEAGNYV MMNTWNNDQE IAGLYAYDPM SVISEKIDNG VYRLTLANPL QNNASVSIEF
DKGTLEVVAA DPEISVDQNI ITLNSAGLNG SSRSTIVKTT PEVTKEALEK LIQEQKEHQE
KDYTASSWKV YSEALKQAQT VADQTTATQA EVDQAETELR SAVKQLVKVP TKEVDKTNLL
KIIKENEKHQ EKDYTASSWK VYSEALKQAQ TVADQTTATQ AEVDQAEAKL RSAVKRLTLK
NSGENKKEQK NGGNNGHLNT STGVDQTGTK QVKPSSQCGF RKASQFLPST GEKKSIALVI
IGLLVIASGC LLVFRKSKSK K EF086-3 (SEQ ID NO:327) ACCAGAAAA
TTTAAGAAAT GACATTTATA CATCTATCCA AACGTGGCTT CAACAAAGTG GGTCATACTA
TCATTTCTTT AAAAAACCAA GACATTTTGA AGCGTTAATT GACTTGAAAA ATGTAGTGAA
TAGTGCGTCA CCTGCCCAAG CGACACCAAT GCAATCTTTA AATGTATATG GTTCGATGGA
TCGAGTCCTA CAGAAAAATA ACGAATATGC GGTGGGGATC AGTATGTATT CACAACGTGT
CGGAAACTAT CAATTTGCGA ATACGGAAAA TAAAAAAGGC TGGCATACAG CAGACGGCAT
GCTTTATTTA TACAATCAAG ACTTTGCTCA GTTTGATGAA GGATACTGGG CAACGATCGA
TCCATATCGA TTACCAGGAA CGACAGTTGA CACAAGAGAA TTGGCAAATG GTGCTTATAC
AGGGAAACGC AGTCCCCAGT CATGGGTAGG TGGCTCAAAT AAT EF086-4 (SEQ ID
NO:328) PENLEND IYTSIQTWLQ QSGSYYHFFK KPRDFEALID LKNVVNSASP
AQATPMQSLN VYGSMDRVLQ KNNEYAVGIS MYSQRVGNYE FGNTENKKGW HTADGMLYLY
NQDFAQFDEG YWATIDPYRL PGTTVDTREL ANGAYTGKRS PQSWVGGSNN EF087-1 (SEQ
ID NO:329) TAACTGGTGG GATTGGCAAA TTGGTTCCGC GCAGCGCTAA CAGATACATT
GATTTTATTA CATGATGACC TATTGAATAC AGATGCAGAA AAATTAAATA AATTTACTGC
TCCGCTGATG CTGTATGCAA AAGATCCAAA CATACAATGC CCAATTTATC GTGCAACAGG
AGCTAACTTA ACAGATATTT CAATCACCGT TTTAGGTACT GGACTTTTGT TAGAAGATAA
TCAACGCCTA GTACAAGTAC AAGAAGCTGT TCCGTCCGTT TTAAAAAGTG TTTCCTCTGG
TGATGGCTTA TATCCTGATG GTTCCTTGAT TCAACATGGT TATTTTCCGT ACAACGGCAG
TTACGGGAAT GAGTTGCTAA AAGCGTTTGG ACCAATTCAG ACTATTTTAC AAGGTTCCGA
CTGGGAGATG AATGACCCTA ACATTAGTAA TTTATTTAAT GTTGTGGATA AAGGTTACTT
ACAATTGATG GTAAATGGAA AAATGCCATC GATGGTTTCT GGTACAAGTA TTTCCAGAGC
GCCAGAAACG AATCCTTTTA CTACAGAGTT TGAATCGGGT AAAGAAACAA TAGCTAATTT
AACCTTAATT GCAAAATTTG CACCAGAAAA TTTAAGAAAT GACATTTATA CATCTATCCA
AACGTGGCTT CAACAAAGTG GGTCATACTA TCATTTCTTT AAAAAACCAA GAGATTTTGA
AGCGTTAATT GACTTGAAAA ATGTAGTGAA TAGTGCGTCA CCTGCCCAAG CGACACCAAT
GCAATCTTTA AATGTATATG GTTCGATGGA TCGAGTCCTA CAGAAAAATA ACGAATATGC
GGTGGGGATC AGTATGTATT CACAACGTGT CGGAAACTAT GAATTTGGGA ATACGGAAAA
TAAAAAAGGC TGGCATACAG CAGACGGCAT GCTTTATTTA TACAATCAAG ACTTTGCTCA
GTTTGATGAA GGATACTCGG CAACGATCGA TCCATATCGA TTACCAGGAA CGACAGTTGA
CACAAGAGAA TTGGCAAATG GTGCTTATAC AGGGAAACGC AGTCCCCAGT CATGGGTAGG
TGGCTCAAAT AATGGACAGG TTGCCTCTAT AGGAATGTTT TTAGATAAAA GTAATGAAGG
AATGAACTTA GTTGCTAAAA AATCTTGGTT CTTATTAGAT GGTCAAATCA TTAATTTGGG
AAGTGGCATT ACTGGTACGA CAGATGCTTC GATTGAAACA ATCCTCGATA ATCGGATGAT
TCATCCACAG GAAGTGAAGC TTAACCAACG TTCAGACAAA GATAATTCTT GGATTAGTTT
AAGCGCAGCG ANTCCATTGA ATAACATTCG CTATGTTTTT CCTAATTCNA TGAATACGCT
TGATGTTCAA ATAGAACAAC GCTCTGGTCG CTACGGAGAT ATTAACGAAT ACTTTGTTAA
TGATAAAACC TATACAAATA CATTTCCTAA AATTAGTAAA AATTATGGCA AGACTGTTGA
AAATGGTACT TACGAATATT TAACAGTGGT TGGGAAAACG AATGAAGAAA TCGCAGCTCT
TTCTAAAAAC AAAGGCTATA CTGTTCTAGA AAATACAGCA AACTTACAAG CCATTGAAGC
AGGTAATTAT GTCATGATGA ATACATGGAA TAATGACCAA GAAATTGCAG GACTGTATGC
GTATGATCCA ATGTCGGTTA TTTCAGAAAA AATTGATAAC GGTGTTTATC GCTTAACTCT
TGCGAATCCT TTACAAAATA ATGCATCCGT TTCTATTGAA TTTGATAAGG GCATTCTTGA
AGTAGTCGCA GCGGACCCAG AAATTTCTGT TGACCAAAAT ATTATCACTT TAAATAGTGC
GGGGTTAAAT GGCAGCTCGC GTTCAATCAT TGTTAAAACA ACTCCTGAAG TAACGAAAGA
AGCGTTAGAA AAATTAATTC AGGAACAAAA AGAACACCAA GAAAAAGACT ACACCGCAAG
CAGCTGGAAA GTCTACAGCG AAGCATTGAA ACAAGCACAA ACTGTGGCAG ATCAAACAAC
AGCAACGCAA GCAGAAGTAG ACCAAGCAGA AACAGAGTTA CGTTCGGCAG TGAAGCAATT
GGTAAAAGTG CCAACTAAAG AAGTAGATAA AACCAACTTG TTGAAAATCA TCAAAGAAAA
CGAGAAACAC CAAGAAAAAG ACTACACCGC AAGCAGTTGG AAAGTCTACA GTGAAGCATT
GAAGCAAGCG CAAACTGTCG CAGATCAAAC AACAGCAACG CAAGCAGAAG TAGACCAAGC
AGAAGCAAAA CTACGTTCGG CAGTGAAGCG ATTAACATTG AAAAATAGTG GGGAAAATAA
AAAGGAGCAA AAAAATGGGG GGAATAATGG ACACTTAAAT ACTAGTACAG GAGTTGATCA
AACTGGTACG AAACAAGTTA AGCCATCAAG CCAAGGTGGT TTCAGAAAAG CTAGCCAATT
TTTACCGAGC ACAGGAGAAA AGAAATCGAT CGCGCTTGTG ATTATTGGTC TTCTAGTTAT
CGCCAGTGGG TGTCTTTTAG TTTTTCGTAA AAGTAAATCG AAGAAGTAA EF087-2 (SEQ
ID NO:330) LVGLANWFRA ALTDTLILLH DDLLNTDAEK LNKFTAPLML YAKDPNTQWP
IYRATGANLT DISITVLGTG LLLEDNQRLV QVQEAVPSVL KSVSSGDGLY PDGSLIQHGY
FPYNGSYGNE LLKGFGRIQT ILQGSDWEMN DPNTSNLFNV VDKGYLQLMV NGKNPSMVSG
RSISRAPETN PFTTEFESGK ETIANLTLIA KFAPENLRND IYTSIQTWLQ QSGSYYHFFK
KPRDFEALID LKNVVNSASP AQATPMQSLN VYGSMDRVLQ KNNEYAVGIS MYSQRVGNYE
FGNTENKKGW HTADGMLYLY NQDFAQFDEG YWATIDPYRL PGTTVDTREL ANGAYTGKRS
PQSWVGGSNN GQVASIGMFL DKSNEGMNLV AKKSWFLLDG QIINLGSGIT GTTDASIETI
LDNRMIHPQE VKLNQGSDKD NSWISLSAAX PLNNTGYVFP NSMNTLDVQI EERSGRYGDI
NEYFVNDKTY TNTFAKISKN YGKTVENGTY EYLTVVGKTN EEIAALSKNK GYTVLENTAN
LQAIEAGNYV MMNTWNNDQE IACLYAYDPM SVISEKTDNG VYRLTLANPL QNNASVSIEF
DKGILEVVAA DPEISVDQNI ITLNSAGLNG SSRSIIVKTT PEVTKEALEK LIQEQKEHQE
KDYTASSWKV YSEALKQAQT VADQTTATQA EVDQAETELR SAVKQLVKVP TKEVDKTNLL
KIIKENEKHQ EKDYTASSWK VYSEALKQAQ TVADQTTATQ AEVDQAEAKL RSAVKRLTLK
NSGENKKEQK NGGNNGHLNT STGVDQTGTK QVKPSSQGGF RKASQFLPST GEKKSIALVI
IGLLVIASGC LLVFRKSKSK K EF087-3 (SEQ ID NO:331) A ATCGGATGAT
TCATCCACAG GAAGTGAAGC TTAACCAAGG TTCAGACAAA GATAATTCTT GGATTAGTTT
AAGCGCAGCG ANTCCATTGA ATAACATTGG CTATGTTTTT CCTAATTCNA TGAATACGCT
TGATGTTCAA ATAGAAGAAC GCTCTGGTCG CTACGGAGAT ATTAACGAAT ACTTTGTTAA
TGATAAAACC TATACAAATA CATTTGCTAA AATTAGTAAA AATTATGGCA AGACTGTTCA
AAATGGTACT TACGAATATT TAACAGTGGT TGGGAAAACG AATGAAGAAA TCGCAGCTCT
TTCTAAAAAC AAAGGCTATA CTGTTCTAGA AAATACAGCA AACTTACAAG CCATTGAAGC
AGGTAATTAT GTCATGATGA ATACATGCAA TAATGACCAA GAAATTGCAG GACTGTATGC
CTATGATCCA ATGTCGGTTA TTTCAGAAAA AATTGATAAC GGTGTTTATC GCTTAACTCT
TGCGAATCCT TTACAAAATA ATGCATCC EF087-4 (SEQ ID NO:332) NRI4IHPQE
VKLNQGSDKD NSWISLSAAX PLNNIGYVFP NSHNTLDVQI EERSGRYGDT NEYFVNDKTY
TNTFAKISKN YCKTVENGTY EYLTVVGKTN EEIAALSKNK GYTVLENTAN LQAIEAGNYV
MMNTWNNDQE IAGLYAYDPM SVISEKIDNG VYRLTLANPL QNNAS EF088-1 (SEQ ID
NO:333) TAACTGGTGG GATTGGCAAA TTGGTTCCGC GCAGCGCTAA CAGATACATT
GATTTTATTA CATGATGACC TATTGAATAC AGATGCAGAA AAATTAAATA AATTTACTGC
TCCGCTGATG CTGTATCCAA AAGATCCAAA CATACAATGG CCAATTTATC GTGCAACAGG
AGCTAACTTA ACAGATATTT CAATCACCGT TTTAGGTACT GGACTTTTGT TAGAAGATAA
TCAACGCCTA GTACAAGTAC AAGAAGCTGT TCCGTCCGTT TTAAAAAGTG TTTCCTCTGG
TGATGGCTTA TATCCTGATG GTTCCTTCAT TCAACATGGT TATTTTCCGT ACAACGGCAC
TTACGGGAAT GAGTTGCTAA AAGGGTTTGG ACGAATTCAG ACTATTTTAC AAGGTTCCGA
CTGGGAGATG AATGACCCTA ACATTAGTAA TTTATTTAAT GTTGTGGATA AAGGTTACTT
ACAATTGATG GTAAATGGAA AAATGCCATC GATGGTTTCT GGTAGAAGTA TTTCCAGAGC
GCCAGAAACG AATCCTTTTA CTACAGAGTT TGAATCGGGT AAAGAAACAA TAGCTAATTT
AACCTTAATT GCAAAATTTG CACCAGAAAA TTTAAGAAAT GACATTTATA CATCTATCCA
AACGTGGCTT CAACAAAGTG GGTCATACTA TCATTTCTTT AAAAAACCAA GAGATTTTGA
AGCGTTAATT GACTTGAAAA ATGTAGTCAA TAGTGCGTCA CCTGCCCAAG CGACACCAAT
GCAATCTTTA AATGTATATG GTTCGATGGA TCGAGTCCTA CAGAAAAATA ACGAATATGC
GGTGGGGATC AGTATGTATT CACAACGTGT CGGAAACTAT GAATTTGGGA ATACGGAAAA
TAAAAAAGGC TGGCATACAG CAGACGGCAT GCTTTATTTA TACAATCAAG ACTTTGCTCA
GTTTGATGAA GGATACTGGG CAACGATCCA TCCATATCGA TTACCAGGAA CGACAGTTGA
CACAAGAGAA TTGGCAAATG GTGCTTATAC AGGGAAACGC AGTCCCCAGT CATGGGTAGG
TGGCTCAAAT AATGGACAGG TTGCCTCTAT AGGAATGTTT TTAGATAAAA GTAATGAAGG
AATGAACTTA GTTGCTAAAA AATCTTGGTT CTTATTAGAT GGTCAAATCA TTAATTTGGG
AAGTGGCATT ACTOCTACCA CAGATGCTTC GATTGAAACA ATCCTCGATA ATCGGATGAT
TCATCCACAG GAAGTGAAGC TTAACCAAGG TTCAGACAAA GATAATTCTT GGATTAGTTT
AAGCGCAGCG ANTCCATTGA ATAACATTGG CTATGTTTTT CCTAATTCNA TGAATACGCT
TGATGTTCAA ATAGAAGAAC GCTCTGGTCG CTACGGAGAT ATTAACGAAT ACTTTGTTAA
TGATAAAACC TATACAAATA CATTTGCTAA AATTAGTAAA AATTATGGCA AGACTGTTGA
AAATGGTACT TACGAATATT TAACAGTGGT TGGGAAAACG AATGAAGAAA TCGCAGCTCT
TTCTAAAAAC AAAGGCTATA CTGTTCTAGA AAATACAGCA AACTTACAAG CCATTGAAGC
AGGTAATTAT GTCATGATGA ATACATGGAA TAATGACCAA GAAATTGCAG GACTGTATGC
GTATGATCCA ATGTCGGTTA TTTCAGAAAA AATTGATAAC GGTGTTTATC GCTTAACTCT
TGCGAATCCT TTACAAAATA ATGCATCCGT TTCTATTGAA TTTGATAAGG GCATTCTTGA
AGTAGTCGCA GCGGACCCAG AAATTTCTGT TGACCAAAAT ATTATCACTT TAAATAGTGC
GGGGTTAAAT GGCAGCTCGC GTTCAATCAT TGTTAAAACA ACTCCTCAAG TAACGAAAGA
AGCGTTAGAA AAATTAATTC AGGAACAAAA AGAACACCAA GAAAAAGACT ACACCCCAAG
CAGCTGGAAA GTCTACAGCG AAGCATTGAA ACAACCACAA ACTGTGGCAG ATCAAACAAC
AGCAACGCAA GCAGAAGTAG ACCAAGCAGA AACAGAGTTA CGTTCGGCAG TGAAGCAATT
GGTAAAAGTG CCAACTAAAG AAGTAGATAA AACCAACTTG TTGAAAATCA TCAAAGAAAA
CGAGAAACAC CAACAAAAAG ACTACACCGC AAGCAGTTGG AAAGTCTACA GTGAAGCATT
GAAGCAAGCG CAAACTGTGG CAGATCAAAC AACAGCAACG CAAGCAGAAG TAGACCAAGC
AGAACCAAAA CTACGTTCGG CAGTGAAGCG ATTAACATTG AAAAATAGTG GGGAAAATAA
AAAGGAGCAA AAAAATGGGG GGAATAATGG ACACTTAAAT ACTAGTACAG GAGTTGATCA
AACTGGTACG AAACAAGTTA AGCCATCAAG CCAAGGTGGT TTCAGAAAAG CTAGCCAATT
TTTACCGAGC ACAGGAGAAA AGAAATCGAT CGCGCTTGTG ATTATTGGTC TTCTAGTTAT
CGCCAGTGGG TGTCTTTTAG TTTTTCGTAA AAGTAAATCG AAGAAGTAA EF088-2 (SEQ
ID NO:334) LVGLANWFRA ALTDTLILLH DDLLNTDAEK LNKFTAPLML YAKDPNIQWP
IYRATGANLT DISITVLGTG LLLEDNQRLV QVQEAVPSVL KSVSSCDGLY PDGSLIQHGY
FPYNGSYGNE LLKGFGRIQT ILQGSDWENM DPNISNLFNV VDKGYLQLMV NGKMPSMVSG
RSISRAPETN PFTTEFESGK ETIANLTLIA KFAPENLRND IYTSIQTWLQ QSGSYYHFFK
KPRDFEALID LKNVVNSASP AQATPMQSLN VYGSMDRVLQ KNNEYAVGIS MYSQRVGNYE
FGNTENKKGW HTADGMLYLY NQDFAQFDEG YWATIDPYRL PGTTVDTREL ANGAYTGKRS
PQSWVGGSNN GQVASIGMFL DKSNEGMNLV AKKSWFLLDG QLLNLGSGIT GTTDASIETI
LDNRMIHPQE VKLNQGSDKD NSWISLSAAX PLNNIGYVFP NSHNTLDVQI EERSGRYGDT
NEYFVNDKTY TNTFAKISKN YGKTVENGTY EYLTVVGKTN EEIAALSKNK GYTVLENTAN
LQAIEAGNNV MMNTWNNDQE IAGLYAYDPM SVISEKIDNG VYRLTLANPL QNNASVSIEF
DKGILEVVAA DPEISVDQNI ITLNSAGLNG SSRSIIVKTT PEVTKEALEK LIQEQKEHQE
KDYTASSWKV YSEALKQAQT VADQTTATQA EVDQAETELR SAVKQLVKVP TKEVDKTNLL
KIIKENEKHQ EKDYTASSWK VYSEALKQAQ TVADQTTATQ AEVDQAEAKL RSAVKRLTLK
NSGENKKEQK NGGNNGHLNT STGVDQTGTK QVKPSSQGGF RKASQFLPST GEKKSIALVI
IGLLVIASGC LLVFRKSKSK K EF088-3 (SEQ ID NO:335) A ACTCCTGAAG
TAACGAAAGA AGCGTTAGAA AAATTAATTC AGCAACAAAA AGAACACCAA GAAAAACACT
ACACCGCAAG CAGCTGGAAA GTCTACAGCG AAGCATTGAA ACAAGCACAA ACTGTGGCAG
ATCAAACAAC AGCAACGCAA GCAGAAGTAG ACCAAGCAGA AACAGAGTTA CGTTCGGCAG
TGAAGCAATT GGTAAAAGTG CCAACTAAAG AAGTAGATAA AACCAACTTG TTGAAAATCA
TCAAAGAAAA CGAGAAACAC CAAGAAAAAG ACTACACCGC AAGCAGTTGG AAAGTCTACA
GTGAAGCATT GAAGCAAGCG CAAACTGTGG CAGATCAAAC AACAGCAACG CAAGCAGAAG
TAGACCAAGC AGAAGCAAAA CTACGTTCGG CAGTGAAGCG ATTAACATTG AAAAATAGTG
GGGAAAATAA AAAGGAGCAA AAAAATGGGG GGAATAATGG ACACTTAAAT ACTAGTACAG
GAGTTGATCA AACTGGTACG AAACAAGTTA AGCCATCAAG CCAAGGTGGT TTCAGAAAAG
CTAGCCAATT TTTACCGAGC ACAGGAGAAA AGAAA EF088-4 (SEQ ID NO:336) T
PEVTKEALEK LIQEQKEHQE KDYTASSWKV YSEALKQAQT VADQTTATQA EVDQAETELR
SAVKQLVKVP TKEVDKTNLL KIIKENEKHQ EKDYTASSWK VYSEALKQAQ TVADQTTATQ
AEVDQAEAKL RSAVKRLTLK NSGENKKEQK NGGNNGHLNT STGVDQTGTK QVKPSSQGGF
RKASQFLPST GEKK EF089-1 (SEQ ID NO:337) TGACAGATAC ACCTGCTAAC
ACAGGAAACT AAGAACGACA GCATACACGC AAGATCGGGA TATAGGTCAA AAATTTTTTG
GCTTATCTTT CGGTCTTTTG GTGCTTATAA TACAACAAAG AATGACAGAC ATAGGAGAAT
GAATATGAAC AGATGGAAAG TATATGCAAC GGTAATCGCT TGTATGTTAT TTGGCTGGAT
TGGCGTGGAG GCGCACGCTT CTGAATTTAA TTTTGCGGTC ACACCAACAA TTCCCGAAAA
TCAAGTGGAT AAATCAAAAA CCTACTTTGA CTTAAAAATG GCGCCTCGTG CCAAACAAAC
CGTAGAAATT CAGTTACGCA ATGATACAGA TGAAGACATT ACCATTGAAA ATACGGTGAA
CTCAGCGACA ACAAATTTAA ATGGCGTAGT AGAATATGGC CAAAACGGGA TCAAACCTGA
CAAAACCTTA CGTTTTAACT TAAAAGATTA TGTGGAAGCA CCGAAAGAAA TCATCTTGCC
GAAGCATTCC CAAAAGACCT TACCTTTAAC CATTACGATG CCTAAAGATT CTTTTGATGG
CGTGATGGCT GGCGGTATAA CACTCAAAGA GAAAAAGAAA GAAACAACGA CTTCTGCGGA
TCAATCAAAA GGGTTAGCTA TTAATAATGA ATACTCCTAT GTTGTGGCTA TTATTCTTCA
GCAAAATGAG ACAAAGGTTC AACCAGATTT AAAATTACTG GGGGTTAAAC CAGGCCAAGT
CAACGCGCGA AACGTCATCA ATGTTTCTTT ACAAAACCCA CAAGCGGCCT ATTTAAACCA
ATTACATTTA ATCAACACTG TTTCAAAAGG AGGCGAAACG CTTTACCAAT CCGATACTCA
GGATATGCAA GTGGCGCCAA ACTCTAACTT TAGTTACCCA ATTTCTTTAA AAGGGGAACG
ATTAACGCCA GGAAAATATG TCTTGAAATC AACCGCCTAT GGTGTAAAAG ATGAAAAGGG
CACCTATCAA GTCAAAGGCG CCAATGGTGA AGAACGGTAC CTGTACAAAT GGGAATTTAC
AAAAGAATTT ACTATTTCTG GGGACGTCGC TAAAGAATTA AATGAAAAAG ACGTAACCAT
TAAAGGAACC AATTGGTGGT TGTATCTACT GATTGCATTA ATCATTCTAG CGCTGCTCTT
ATTGATTTTC TTCTTGTATC GTAAAAAGAA AAAAGAGGAA GAACAACAAT CTGAGCAATA A
EF089-2 (SEQ ID NO:338) HNR WKVYATVTAC MLFGWTGVEA HASEFNFAVT
PTTPENQVDK SKTYFDLKMA PGAKQTVEIQ LRNDTDEDIT IENTVNSATT NLNGVVEYGQ
NGIKPDKTLR FNLKDYVEAP KETILPKHSQ KTLPLTTTMP KDSFDGVMAG GITLKEKKKE
TTTSADQSKG LAINNEYSYV VAITLQQNET KVQPDLKLLG VKPCQVNARN VINVSLQNPQ
AAYLNQLHLI NTVSKGGETL YQSDTEDMQV APNSNFSYPI SLKGERLTPG KYVLKSTAYG
VKDEKGTYQV KGANGEERYL YKWEFTKEFT ISGDVAKELN EKDVTIKGTN
WWLYLLIALI
ILALLLLIFF LYRKKKKEEE QQSEQ EF089-3 (SEQ ID NO:339) T CTGAATTTAA
TTTTGCGGTC ACACCAACAA TTCCCGAAAA TCAAGTGGAT AAATCAAAAA CCTACTTTGA
CTTAAAAATG GCGCCTGGTG CCAAACAAAC CGTAGAAATT CAGTTACGCA ATGATACAGA
TGAAGACATT ACCATTGAAA ATACGGTGAA CTCAGCGACA ACAAATTTAA ATGGCGTAGT
AGAATATGGC CAAAACGGGA TCAAACCTGA CAAAACCTTA CGTTTTAACT TAAAAGATTA
TGTGGAAGCA CCGAAAGAAA TCATCTTGCC GAAGCATTCC CAAAAGACCT TACCTTTAAC
CATTACGATG CCTAAAGATT CTTTTGATGG CGTGATGGCT GCCGGTATAA CACTCAAAGA
GAAAAAGAAA GAAACAACGA CTTCTGCGGA TCAATCAAAA GGGTTAGCTA TTAATAATGA
ATACTCCTAT GTTGTGGCTA TTATTCTTCA GCAAAATGAG ACAAAGCTTC AACCAGATTT
AAAATTACTG GGGGTTAAAC CAGGCCAAGT CAACGCGCGA AACGTCATCA ATGTTTCTTT
ACAAAACCCA CAAGCGGCCT ATTTAAACCA ATTACATTTA ATCAACACTG TTTCAAAAGG
AGGCGAAACG CTTTACCAAT CCGATACTGA GGATATGCAA GTGGCGCCAA ACTCTAACTT
TACTTACCCA ATTTCTTTAA AAGGGGAACG AT EF089-4 (SEQ ID NO:340)
SEFNFAVT PTIPENQVDK SKTYFDLKMA PGAKQTVEIQ LRNDTDEDIT TENTVNSATT
NLNGVVEYGQ NGIKPDKTLR FNLKDYVEAP KEIILPKHSQ KTLPLTITMP KDSFDGVMAG
GITLKEKKKE TTTSADQSKG LAINNEYSYV VAIILQQNET KVQPDLKLLG VKPGQVNARN
VINVSLQNPQ AAYLNQLHLI NTVSKGGETL YQSDTEDMQV APNSNFSYPI SLKGER
EF090-1 (SEQ ID NO:341) TAGTCTCTAA GAAATAAACC TAAAATTATT GATATAAAGG
ATGAACAAAT GAAAAAAGAA GAAATGCAAA TGCGTAATAC ACGTCGTCAA AAATCAGGAA
AAAATAATAA AAAGAAAGTA ATTATTACTT CTTTGGTTGG ACTAGCTCTG GTTGCTGGGG
GCAGTTATGT TTATTTTCAA AGTCACTTTT TNCCAACCAC AAAAGTAAAT GGAGTTTCTG
TAGGCTGGTT AAATCTAAAT GCTGCAGAAG AAAAATTAGC GCAAGTTAAT CAAACCGAAG
AAGTTGTGGT TCAAACGGGG ACAAAAGAAG AAAAAATTCA ACTTCCTAAA AAATACCAAT
TGGATCAAAA ATTTTTAAAA GACCATTTAC ACAGTAGCAA GGTGAAGCTA CCGTTAAACG
AGGCATTCAA AAAAGAACTA GAAGCCAAAT TAGCAACTTT GAGTTTTCCA GAGGGGAAAC
CAAGCAAAAA TGCGAGTATC CGTCGAGGCA ATGGCACTTT TGAAATTGTT CCCGAAGAAC
AAGGCACAGT AGTGGACACA CAGCGCTTAA ACCAGCAGAT TATTGCGGAT GTTGAAGCGG
GAAAAGGCAA CTATCAATAT AATGCCAAAG ATTTTTATAA AGCCCCTGAA ATTACAAAAG
AGGATCAAAC GTTAAAGGCA ACATTGACAA CGCTCAATAA CAAGTTAAAT AAAACAATTA
CAGTTGATAT TAATGGTGAA AAAGTAGCCT TTGATAAAAC ACAAATTCAA AACGTGCTGA
ATGATGATGG CACAATCAAC AAAGAAAAAC TAACTACTTG GGTGACACAA TTAGAAACAA
CATATGGTTC TGCTAATCAA CCAGTTTTAT TTACAGATGT TCACGGCACG ACACGTCGTT
TTAAAAACAA CGGAAGTTAT GGCTGGTCGA TTGATGCGGC CAAAACGCAA GAACTACTAG
TAAACGCGCT GAATAGCCAA GAACAAACGA ATGCAATCAC TGCTCCGTTG GTTGGTGATA
CCAAAGAAAA TAGTAAAATT GCCAATAATT ACATTGAAAT TGATTTAAAA GATCAAAAAA
TGTATTGTTT CATTGATGGC AAAAAAATAG TCACCACAGA TGTCATTACT GGCAGATATA
ACAAAGGAAC CGCAACAGTA CCAGGATTCC ATACAATTTT ATATCGGACA ACCGATGTGA
ATTTAGAAGG TCAAATGCTT GATGGTTCTC GATACAGTGT GCCAGTAAAA TATTGGATGC
CGTTATTAAG TCAAGGGGCC GTTCTCACAC AAATCGGGAT TCATCACTCC GACCATAAAT
TCGATAAGTA TGOCGATAAA GAAGCCTTTA AAACCGATGC TGGTAGTAAT GGCTGTATCA
ATACGCCAGC AACAGAAGTT TCAAAAATCT TTGATGTATC CTATGACGGA ATGCCGGTAA
TTATTTATGG ACATATCTAT GATGATGCAC CAGGTGAATT TGATAAACCT GTAGATTACG
GCGAAGAAGT ATAA EF090-2 (SEQ ID NO:342) MRNTRRQK SGKNNKKKVI
TTSLVGLALV AGCSYVYFQS HFXPTTKVNG VSVGWLNVNA AEEKLAQVNQ TEEVVVQTGT
KEEKIQLPKK YQLDQKFLKD HLHSSKVKLP LNEAFKKELE AKLATLSFPE GKPSKNASIR
RGNGTFEIVP EEQGTVVDTQ RLNQQITADV EAGKGNYQYN AKDFYKAPET TKEDQTLKAT
LTTLNNKLNK TITVDINGEK VAFDKTQIQN VLNDDGTINK EKLTTWVTQL ETTYGSANQP
VLFTDVHGTT RRFKNNGSYG WSTDGAKTQE LLVNALNSQE QTNAITAPLV GDTKENSKIA
NNYTEIDLKD QKMYCFIDGK KIVTTDVITG RYNKGTATVP GFHTILYRTT DVNLEGQMLD
GSRYSVPVKY WMPLLSQGGV VTQIGIHDSD HKLDKYGDKE AFKTDAGSNG CINTPGTEVS
KIFDVSYDGM PVIIYGHIYD DAPGEFDKPV DYGEEV EF090-3 (SEQ ID NO:343) CAC
AAAAGTAAAT CGAGTTTCTG TACGCTGGTT AAATGTAAAT GCTGCAGAAC AAAAATTACC
GCAAGTTAAT CAAACCGAAC AAGTTGTGGT TCAAACGGGC ACAAAAGAAG AAAAAATTCA
ACTTCCTAAA AAATACCAAT TGCATCAAAA ATTTTTAAAA GACCATTTAC ACAGTAGCAA
GGTGAAGCTA CCGTTAAACG AGGCATTCAA AAAAGAACTA CAAGCCAAAT TAGCAACTTT
GAGTTTTCCA GAGGGGAAAC CAAGCAAAAA TGCGAGTATC CGTCGAGCCA ATGGCACTTT
TGAAATTCTT CCCGAAGAAC AAGGCACAGT AGTGGACACA CAGCGCTTAA ACCAGCAGAT
TATTGCGGAT GTTGAACCGG GAAAAGCCAA CTATCAATAT AATGCCAAAC ATTTTTATAA
AGCCCCTGAA ATTACAAAAC ACCATCAAAC GTTAAAGGCA ACATTGACAA CGCTCAATAA
CAAGTTAAAT AAAACAATTA CAGTTCATAT TAATGGTGAA AAAGTAGCCT TTGATAAAAC
ACAAATTCAA AACGTGCTGA ATCATGATGG CACAATCAAC AAACAAAAAC TAACTACTTG
GGTGACACAA TTAGAAACAA CATATGGTTC TGCTAATCAA CCAGTTTTAT TTACAGATCT
TCACGCCACG ACACGTCGTT TTAAAAACAA CGCAACTTAT GGCTGGTCGA TTGATGGGGC
CAAAACGCAA GAACTACTAG TAAACGCGCT GAATAGCCAA CAACAAACGA ATGCAATCAC
TGCTCCGTTG GTTGGTGATA CCAAACAAAA TAGTAAAATT GCCAATAATT ACATTGAAAT
TGATTTAAAA GATCAAAAAA TGTATTGTTT CATTGATGGC AAAAAAATAG TCACCACAGA
TGTCATTACT CGCAGATATA ACAAAGGAAC CGCAACAGTA CCAGGATTCC ATACAATTTT
ATATCGGACA ACCGATGTGA ATTTAGAAGG TCAAATGCTT CATGGTTCTC GATACAGTGT
GCCAGTAAAA TATTGGATGC CGTTATTAAG TCAAGGGGGC GTTGTCACAC AAATCGGGAT
TCATGACTCC GACCATAAAT TGGATAACTA TGCCGATAAA GAAGCCTTTA AAACCGATGC
TGGTAGTAAT GGCTGTATCA ATACGCCAGG AACAGAAGTT TCAAAAATCT TTGATGTATC
CTATGACGGA ATGCCGGTAA TTATTTATGC ACATATCTAT GATGATGCAC CACGTGAATT
TGATAAACCT GTAGATTACG GCGAAGAAGT AT EF090-4 (SEQ ID NO:344) TKVNG
VSVGWLNVNA AEEKLAQVNQ TEEVVVQTGT KEEKIQLPKK YQLDQKFLKD HLHSSKVKLP
LNEAFKKELE AKLATLSFPE GKPSKNASIR RCNGTFEIVP EEQGTVVDTQ RLNQQIJADV
EAGKGNYQYN AKDFYKAPEI TKEDQTLKAT LTTLNNKLNK TITVDINGEK VAFDKTQIQN
VLNDDGTINK EKLTTWVTQL ETTYGSANQP VLFTDVHGTT RRFKNNGSYG WSIDGAKTQE
LLVNALNSQE QTNAITAPLV CDTKENSKIA NNYIEIDLKD QKMYCFIDGK KIVTTDVITG
RYNKGTATVP GFHTILYRTT DVNLEGQMLD GSRYSVPVKY WMPLLSQGCV VTQIGIHDSD
HKLDKYGDKE AFKTDAGSNG CINTPGTEVS KIFDVSYDGM PVIIYGHIYD DAPGEFDKPV
DYGEEV EF091-1 (SEQ ID NO:345) TAATTGGNGC AGATTTTTAT GGCTAAAAAA
GGCGGATTTT TCTTAGGNGC AGTAATTGGT GGAACAGCAG CAGCCGTTGC CGCATTATTA
CTTGCACCAA AATCAGGTAA AGAATTACGT GATGATTTAT CAAATCAAAC AGATGATTTA
AAAAACAAAG CGCAAGATTA CACAGATTAT GCTGTTCAAA AAGGAACAGA ATTAACAGAA
ATCGCAAAAC AAAAAGCCGG CGTTTTATCA GATCAAGCCT CTGATTTGGC AGGTTCTGTC
AAAGAAAAAA CAAAAGATTC ATTGGATAAA GCACAAGGTG TTTCTGGCGA CATGCTTGAT
AACTTTAAAA AACAAACAGG TGATTTATCT GATCAATTTA AAAAAGCAGC TGACGATGCT
CAAGATCACG CAGAAGATTT AGGTGAAATT GCCGAAGATG CAGCAGAAGA TATCTATATT
GACGTTAAAG ATTCTGCGGC AGCGGCCAAA GAAACTGTTT CTGCTGGTGT CGATGAAGCA
ANAGAAACCA CCAAAGATGT TCCTGAAAAA GCTGCAGAAG CAAAAGAAGA TGTTAAAGAT
GCAGCGAAAG ACGTAAAAAA AGAATTTAAA GGGTAA EF091-2 (SEQ ID NO:346)
MAKKG GFFLGAVIGG TAAAVAALLL APKSGKELRD DLSNQTDDLK NKAQDYTDYA
VQKGTELTEI AKQKAGVLSD QASDLAGSVK EKTKDSLDKA QGVSGDMLDN FKKQTGDLSD
QFKKAADDAQ DHAEDLGEIA EDAAEDIYID VKDSAAAAKE TVSAGVDEAX ETTKDVPEKA
AEAKEDVKDA AKDVKKEFKG EF091-3 (SEQ ID NO:347) AT CAAATCAAAC
AGATGATTTA AAAAACAAAG CGCAAGATTA CACAGATTAT GCTGTTCAAA AAGGAACAGA
ATTAACAGAA ATCGCAAAAC AAAAAGCCGG CGTTTTATCA GATCAAGCCT CTGATTTGGC
AGGTTCTGTC AAAGAAAAAA CAAAAGATTC ATTGGATAAA GCACAAGGTG TTTCTGGCGA
CATGCTTGAT AACTTTAAAA AACAAACAGG TGATTTATCT GATCAATTTA AAAAAGCAGC
TGACGATGCT CAAGATCACG CAGAAGATTT AGGTGAAATT GCCGAAGATG CAGCAGAAGA
TATCTATATT GACGTTAAAG ATTCTGCGGC AGCGGCCAAA GAAACTGTTT CTGCTGGTGT
CGATGAAGCA ANAGAAACCA CCAAAGATGT TCCTGAAAAA GCTGCAGAAG CAAAAGAAGA
TGTTAAAGAT GCAGCGAAAG ACGTAAAAAA AGAATTTAAA GGGTAA EF091-4 (SEQ ID
NO:348) SNQTDDLK NKAQDYTDYA VQKGTELTEI AKQKAGVLSD QASDLAGSVK
EKTKDSLDKA QGVSGDMLDN FKKQTGDLSD QFKKAADDAQ DHAEDLGEIA EDAAEDIYID
VKDSAAAAKE TVSAGVDEAX ETTKDVPEKA AEAKEDVKDA AKDVKKEFKG EF092-1 (SEQ
ID NO:349) TAAGGGGATG AAGAAAAAAT GGCAAAAAAA ACAATTATGT TAGTTTGTTC
CGCAGGAATG AGCACGAGTT TATTAGTAAC AAAAATGCAA AAAGCAGCAG AAGATCGTGG
CATGGAAGCA GACATCTTTG CAGTATCGGC TTCTGAAGCA GATACAAACT TGGAAAATAA
AGAGGTGAAT GTTTTACTTT TAGGTCCACA AGTTCGTTTC ATGAAAGGGC AATTTGAACA
AAAATTACAA CCAAAAGGGA TTCCTTTAGA TGTAATTAAC ATGGCAGATT ATGGCATGAT
GAATGGCGAA AAAGTTTTAG ATCAAGCAAT CTCATTAATG GGATAA EF092-2 (SEQ ID
NO:350) MAKKT IMLVCSAGMS TSLLVTKNQK AAEDRGMEAD TFAVSASEAD
TNLENKEVNV LLLGPQVRFM KGQFEQKLQP KGIPLDVINM ADYGMMINGEK VLDQAILSLMG
EF092-3 (SEQ ID NO:351) AG AAGATCGTGG CATGGAAGCA GACATCTTTG
CAGTATCGGC TTCTGAAGCA GATACAAACT TGGAAAATAA AGAGGTGAAT GTTTTACTTT
TAGGTCCACA AGTTCGTTTC ATGAAAGGGC AATTTGAACA AAAATTACAA CCAAAAGGGA
TTCCTTTAGA TGTAATTAAC ATGGCAGATT ATGGCATGAT GAATGGCGAA AAAGTTTTAG
ATCAAGCAAT CTCATTAATG GGAT EF092-4 (SEQ ID NO:352) EDRGMEAD
IFAVSASEAD TNLENKEVNV LLLGPQVRFM KGQFEQKLQP KGIPLDVINM ADYGMMNGEK
VLDQAISLMG EF093-1 (SEQ ID NO:353) TAGTTTTTTT CCGATAAAGG GAGAATTTTA
ATGAGGCAAA AATATTCAGG AAACTTATTG TTCACGGCCA TGGCCATTGT TTATTTGATG
ACTTTTCTCG CCCTTCAGTT ACTAGAAGAA CGTCAGTTAA CACAAAAATT TACGCAAGCT
ACCCAGGAAT ACTATGCAGG GAAAAGTATC TTTCATTTAT TTCTTGCAGA TGTTAAACAA
AATAGACGAA AGTTAAAAAC AGAAGAAAGG CTCGTATACG CGCAACTGAC COTOGATTAT
ACATACAAAA ATGAACAATT AAGAATAACT GTTTTATTAA ACAAATCTGG TCGAAAATAC
CAATATCAAG AGAGAGTTTC TCATCAAAAA AAAGCGGAAA CAATACTGGA ATAG EF093-2
(SEQ ID NO:354) M RQKYSGNLLF TAMAIVYLMS FLALQLLEER QLTQKFTQAT
QEYYAGKSIF HLFLADVKQN RRKLKTEERL VYAQVTLDYT YKNEQLRITV LLNKSGRKYQ
YQERVSHQKK AETILE EF093-3 (SEQ ID NO:355) CCTTCAGTT ACTAGAAGAA
CGTCAGTTAA CACAAAAATT TACGCAAGCT ACCCAGGAAT ACTATGCAGG GAAAAGTATC
TTTCATTTAT TTCTTGCAGA TGTTAAACAA AATAGACGAA AGTTAAAAAC AGAAGAAAGG
CTCGTATACG CGCAAGTGAC CCTCGATTAT ACATACAAAA ATGAACAATT AAGAATAACT
GTTTTATTAA ACAAATCTGG TCGAAAATAC CAATATCAAG AGAGAGTTTC TCATCAAAAA
AAAGCCGAAA CAATACTGG EF093-4 (SEQ ID NO:356) LQLLEER QLTQKFTQAT
QEYYAGKSJIF HLFLADVKQN RRKLKTEERL VYAQVTLDYT YKNEQLRITV LLNKSCRKYQ
YQERVSHQKK AETI EF094-1 (SEQ ID NO:357) TAAACATTTG AGACATTCAG
AGGTGAATGT CTCTTTTTTA TTACTCAAAA ACGAAAGGGG ATTAATTATA TGAAAAAAAC
AACATTTAAA AATTGGTCGT TATTTGCGAC TTTGGCTCTA TTAAGTCAAA CAATTGGCGG
AACGATTGGT CCTACGATTG CTTTTGCCGA TGAAATTACT CACCCTCAAG AGGTAACAAT
TCATTATGAC GTAAGTAAAC TGTATGAAGT TGACGGAACT TTTAGCGATG GCAGCACCCT
CTCAGAACGT ACTACGTCAT TATATGCAGA ATACAATGGT GCAAAACAAA CAGTATTTTG
TATTGAACCA GGTGTTAGTA TTCCAACAGA AGTGACGCAC GGTTATCAGA AAAACCCTTT
GCCATCAATG TCTGATAAAG CGAAACTAGT ATCGGTTCTT TGGGAAAAGC CTGGAACAGA
TATTOATACA AATATGGTTG CACAAAAGAT GATTTGGGAA GAAGTGAACG GTTATAAACT
CCATTCCATA AAAAGATTAG GTGGTGCTTC AGTTGATATA AAATCTATTG AAGGAAAAAT
TAATAAGGCA ATTGAGGAGT ATCAXAAAAA ACCAAGTTTT CATAATACCA CTGTAAAAAC
AATTTTAGGT CAATCGACAA CTTTAATAGA TAAAAATGAA TTAAATTTAT CTGAGTTTGA
TAAAGTCGTC CAAAATACGG CGAATATAGA TTACCGTGTA ATTCGGAATC AATTAGTGCT
TACTCCAAAC TCTAATTCCA AATCAGGAAC ATTAACATTG AAAAAATCAG CTGGTACTGG
AACTCCAGTC GCTTATAAAA AAGCAGGACT TCAAACTGTG ATGGCTGGTG CGCTTGATAA
GCCCAATACC TACGCTATTA AAATTAATGT GGAAACTAAG GGTTCTTTAA AGATCAAAAA
AATCGATAAA GAATCAGGTG ATATTGTACC AGAAACGGTT TTCCATTTAG ATTTTGGGAA
AGCTTTACCT TCAAAAGATG TGACAACAGA TAAAGATGGC ATTTCTATTT TGGATGGAAT
TCCCCATGGT ACAAAGGTAA CTATTACTGA AAAATCGGTG CCAGATCCTT ATATGATTCA
TACCACACCC ATGGCTGCCA CCATTAAAGC GGGCGAGACC ATTTCCATGA CTTCGAAAAA
TATGCGACAA AAAGGTCAAA TTCTTTTAGA GAAGACTGGG GTAGAAACAG GTACTGATCT
TTGGAATGAC AATTATTCTC TAGCTGGAAA TACATTTGCC ATTCGTAAAG ACAGCCCAGC
TGGTGAAATT GTCCAAGAAA TAACAACGGA TGAAAAAGGT CGTGCGGAAA CACCAAAAGA
GCTTGCTAAT GCTTTGGAAC TGGGAACCTA TTACGTGACA GAAACTAAAT CTAGTAATGG
TTTCGTGAAT ACCTTCAAAC CAACAAAAGT CGAGTTAAAA TATGCCAATC AAACCGTGGC
TCTTGTTACC AGTAACGTAA AAGGGCAAAA CCAAGAAATT ACTGGGGAAA CCACTTTGAC
AAAAGAAGAC AAAGATACCG GTAATGAGAG TCAACGGAAA GCTCAGTTTA AAGGAGCTGA
ATATACTCTC TTTACTGCAA AAGATGGTCA AGCTGTTAAA TGGAGTGAAG CTTTTAAAAC
AGAATTAGTG AAGGGAACGA AAGCTTCTGA TGAAACAGTG ACTTTGGCTT TAGATGAAAA
GAACCAAGTT GCCGTTAAAC ACCTAGCAAT TAACGAGTAT TTCTGGCAAG AAACCAAAGC
ACCTGAAGGA TATACTTTGG ATGAAACGAA GTATCCTGTA TCCATCAAAA AAGTTGATAA
TAACGAAAAA AATGCCGTAA TTACTCGAGA TGTTACGGCA AAAGAACAAG TTATTCGCTT
TGGCTTTGAT TTCTTTAAAT TTGCTGCATC GGCTGATGGC ACTGCCGAAA CTGGATTTAA
CGACTTATCT TTTAAAGTGT CGCCATTGGA AGGGACCAAN GAAATCACAG GTCCTGAAGA
TAAAGCGACC ACAGCTTGTA ACGAGCAATT AGGTTTTGAT GGCTATGGTA AGTTTGAAAA
TCTTCCTTAT GCGGATTATT TACTTGAAGA AATAGAGGCT CCAGAAGGAT TTCAAAAGAT
TACACCACTA GAAATCCGTT CTACATTTAA GGAAAACAAA GACGACTATG CGAAGAGTGA
GTATGTCTTT ACCATTACCG AAGAAGGACA AAAACAACCA ATTAAGATGG TGACCGTTCC
TTACGAGAAA CTAACTAACA ACGAGTTTTC TGTTAGTCTG AACCGTTTGA TGCTTTATGA
TTTCCCCGAG AAAGAAGATA GTTTGACTTC TCTTGCGACT TGGAAAGACG GAAATAAAAA
ATTGAATACC CTTGATTTTA CCGACCTAGT TGATAAATTG AGATATAACT TGCATGAAAT
CAAAGAAGAC TGGTATGTCG TAGCTCAAGC CATTGATGTG GAAGCCACAA AAGCTGCCCA
AGAAAAAGAC GAAAAAGCCA AACCGGTGGT GATTGCCGAA ACAACCGCAA CGTTGGCGAA
CAAAGAGAAA ACTGGAACTT GGAAAATTCT GCATAAATTA ACCGCTGAAC AAGTTTTGGA
TAAAAGCATC GTCTTGTTCA ATTATGTGTA TGAAAACAAG GTAGCCTTTG AAGCAGGCAA
TGAGCCAGTA GCGAAGGATG CTAGCTTGAA CAATCAAGCA CAAACCGTCA ATTGTACGAT
TGAACGCCAT GTTTCCATCC AAACAAAAGC CCACCTAGAA GATGGTTCGC AAACTTTTAC
TCATGGTGAC GTGATGGATA TGTTTGATGA TGTGTCGGTT ACCCATGATG TACTGGATGG
CTCAAAAGAA GCTTTCGAAA CAATTCTGTA TGCTTTACTA CCAGATGGTA CGAACAAAGA
AATTTGGAAA TCTGGCAAAA TTGAGCATGA AGTGAATGAT AAAGAATTTA CCAAAACCGT
ACTTGCGGAA AAAGTAGATA CCGGAAAGTA TCCAGAAGGA ACTAAGTTTA CTTTTACCGA
AATCAATTAC GAAAAAGATG GAAACGTGAA TGGAAAACAC AATGAAGATT TGAAAGAAAA
ATCTCAAACC TTAACACCAA AAGAAGTGCC AACCATACCG AGTACGCCAA AACAACCGGA
AACACCAGCT GTTCCAAGTA ATTCTCAAGA ATCTAGTCCC ACAGTGAAGA CATTCCCGCA
AACTGGGGAG AAAAATTCCA ACGTTCTACT GTTAGTTGGC TTTATCTTGA TTTTTTCGAC
TGCTGGGTAT TATTTCTGGA ATCGCCGCAA TTAA EF094-2 (SEQ ID NO:358)
MKKTTFKN WSLFATLALL SQTIGGTIGP TIAFADEITH PQEVTIHYDV SKLYEVDGTF
SDGSTLSERT TSLYAEYNGA KQTVFCIEPG VSIPTEVTHG YQKNPLPSMS DKAKLVSVLW
EKAGTDTDTN MVAQKNIWEE VNGYKLHSIK RLGGASVDIK SIEGKINKAI EEYQKKPSFH
NTTVKTILGQ STTLIDKNEL NLSEFDKVVQ NTANIDYRVI GNQLVLTPNS NSKSGTLTLK
KSAGTGTPVA YKKAGLQTVM AGALDKPNTY AIKINVETKG SLKIKKIDKE SGDIVPETVF
HLDFGKALPS KDVTTDKDGI SILDGIPHGT KVTITEKSVP DPYMTDTTPM AATIKAGETT
SMTSKNMRQK GQILLEKTGV ETGTDLWNDN YSLAGNTFAI RKDSPAGEIV QEITTDEKGR
AETPKELANA LELGTYYVTE TKSSNGFVNT FKPTKVELKY ANQTVALVTS NVKGQNQEIT
GETTLTKEDK DTGNESQGKA EFKGAEYTLF TAKDGQAVKW SEAFKTEIVK GTKASDETVT
LALDEKNQVA VKHLAINEYF WQETKAPEGY TLDETKYPVS IKKVDNNEKN AVITRDVTAK
EQVIRFGFDF FKFAGSADGT AETGFNDLSF KVSPLEGTXE TTGAEDKATT ACNEQLGFDG
YGKFENLPYG DYLLEETEAP EGEQKITPLE IRSTFKENKD DYAKSEYVPT ITEEGQKQPI
KNVTVPYEKL TNNEFSVSLN RLMLYDLPEK EDSLTSLATW KDGNKKLNTL DFTELVDKLR
YNLHEIKEDW YVVAQAIDVE ATKAAQEKDE KAKPVVIAET TATLANKEKT GTWKILHKLT
AEQVLDKSIV LFNYVYENKV AFEAGNEPVA KDASLNNQAQ TVNCTIERHV SIQTKAHLED
CSQTFTHGDV MDMFDDVSVT HDVLDGSKEA FETILYALLP DGTNKEIWKS GKIEHEVNDK
EFTKTVLAEK VDTGKYPEGT KFTFTEINYE KDGNVNGKHN EDLKEKSQTL TPKEVPTIPS
TPKQPETPAV PSNSQESSPT VKTFPQTGEK NSNVLLLVGF ILIFSTAGYY FWNRRN
EF094-3 (SEQ ID NO:359) CGA TGAAATTACT CACCCTCAAG AGGTAACAAT
TCATTATCAC GTAAGTAAAC TGTATGAAGT TGACGGAACT TTTAGCGATG GCAGCACGCT
CTCAGAACGT ACTACGTCAT TATATGCAGA ATACAATGGT GCAAAACAAA CAGTATTTTG
TATTGAACCA GGTGTTAGTA TTCCAACAGA AGTGACGCAC GGTTATCAGA AAAACCCTTT
GCCATCAATG TCTGATAAAG CGAAACTAGT
ATCGGTTCTT TGGGAAAAGG CTGGAACAGA TATTGATACA AATATCGTTG CACAAAAGAT
GATTTGGGAA GAAGTGAACG GTTATAAACT CCATTCCATA AAAAGATTAG GTGGTGCTTC
AGTTGATATA AAATCTATTG AAGGAAAAAT TAATAAGGCA ATTGAGGAGT ATCAAAAAAA
ACCAAGTTTT CATAATACCA CTGTAAAAAC AATTTTAGCT CAATCGACAA CTTTAATAGA
TAAAAATGAA TTAAATTTAT CTGAGTTTGA TAAAGTCGTC CAAAATACGG CGAATATAGA
TTACCGTGTA ATTGCGAATC AATTAGTGCT TACTCCAAAC TCTAATTCCA AATCAGGAAC
ATTAACATTG AAAAAATCAG CTGGTACTGG AACTCCAGTC GCTTATAAAA AAGCAGGACT
TCAAACTGTG ATGGCTGGTG CGCTTGATAA GCCCAATACC TACGCTATTA AAATTAATGT
GGAAACTAAG GGTTCTTTAA AGATCAAAAA AATCGATAAA GAATCAGGTG ATATTGTACC
AGAAACGGTT TTCCATTTAG ATTTTGGGAA AGCTTTACCT TCAAAAGATG TGACAACAGA
TAAAGATGGG ATTTCTATTT TGGATGGAAT TCCCCATGGT ACAAAGGTAA CTATTACTGA
AAAATCGCTG CCACATCCTT ATATGATTGA TACCACACCC ATCGCTGCCA CCATTAAAGC
GGGCGAGACC ATTTCCATGA CTTCGAAAAA TATGCGACAA AAACGTCAAA TTCTTTTAGA
GAAGACTGGG GTAGAAACAG GTACTGATCT TTGGAATGAC AATTATTCTC TAGCTGGAAA
TACATTTGCC ATTCGTAAAC ACAGCCCAGC TGGTGAAATT GTCCAAGAAA TAACAACGGA
TGAAAAAGGT CGTGCGGAAA CACCAAAAGA GCTTGCTAAT GCTTTGGAAC TGGGAACCTA
TTACGTGACA GAAACTAAAT CTAGTAATGG TTTCGTGAAT ACCTTCAAAC CAACAAAAGT
CGAGTTAAAA TATGCCAATC AAACCGTGGC TCTTGTTACC AGTAACGTAA AAGGGCAAAA
CCAAGAAATT ACTGGGGAAA CCACTTTGAC AAAAGAAGAC AAAGATACCG GTAATGAGAG
TCAAGGGAAA GCTCAGTTTA AAGGAGCTGA ATATACTCTC TTTACTGCAA AAGATGGTCA
AGCTGTTAAA TGGAGTGAAG CTTTTAAAAC ACAATTAGTG AAGGGAACGA AAGCTTCTGA
TGAAACAG EF094-4 (SEQ ID NO:360) DEITH PQEVTTHYDV SKLYEVDGTF
SDGSTLSERT TSLYAEYNGA KQTVFCTEPG VSIPTEVTHC YQKNPLPSMS DKAKLVSVLW
EKAGTDIDTN MVAQKMIWEE VNGYKLHSIK RLCGASVDIK SIEGKINKAI EEYQKKPSFH
NTTVKTILGQ STTLIDKNEL NLSEEDKVVQ NTANIDYRVI GNQLVLTPNS NSKSGTLTLK
KSAGTGTPVA YKKAGLQTVM AGALDKPNTY AIKINVETKG SLKIKKIDKE SGDIVPETVF
HLDFGKALPS KDVTTDKDGI SILDGIPHGT KVTITEKSVP DPYMTDTTPM AATIKAGETI
SMTSKNMRQK GQILLEKTGV ETGTDLWNDN YSLAGNTFAT RKDSPAGEIV QEITTDEKGR
AETPKELANA LELGTYYVTE TKSSNGFVNT FKPTKVELKY ANQTVALVTS NVKGQNQEIT
GETTLTKEDK DTGNESQGKA EFKGAEYTLF TAKDGQAVKW SEAFKTELVK GTKASDET
EF095-1 (SEQ ID NO:361) TAAGAATTGT TGGATTGTTC TTTACAAAGA AGGGACAATA
TGAAGCGAAG TAAATGGAAA GAATTGATAG TAACGGGCAT CTGCCATATA TTAGTATTCC
CCATACTAAT ACAGACAACT GTTTTTGCAG AAACATTACC AAGTACAAAA CAAGTAACAG
AAGCAACCAA TCATTCATTA ACAGCAGAAA AAGCCGAAAG TGAACAACCA CAGACAAAGG
ATAAACTACA TGATGAAGAA ACACTGGCAT TGTCAAAAAG TGAGTTAATC CATAATGAGG
CTAATGTTAC AAGTCAAACG ATTAGAGAAA GAATTGAGAC GCCTAACCTA ACTTATCGTT
ATGGATTTAT TAATGAAGAG GGGCAGCCAG TAAACGCCAA TGAGATCCTT CTACAGTATC
ATAGTTGGCA ACGCAATTCC CCAGATGGCA TAAATGTGTG GGAAGGTGAA AGTCAACCAG
TGACAGCATC TACAGTGGCT AATTTAAAAG AAGTGGTAAT TCCAAGTGAG AAAGTAGCCG
TCTATTCCGA CATGTCAACG GTGCTTGCAG CGAGTAATCA AACATTTTTT TTACCAAGAT
ATTATACTTC TTTAAGCTTA TACAATAAGA AAGGGGAAAT TGATCCCAAT TATCCGCTGC
CAACTATTTC CGACGCATCA GGAAACCAAT ATCCAACAAC AATTTCGCAA TTTGAATTGG
AAAAAATGTC TGCACAACAA TATAGTCAGA AAACAGGAGT AACGTTTAAC ATTAGOGAGA
GTCAAAAACT AATCGTTCCT TTGTACAACC AAGTGAAGGT TGATTCATCG AATCAATCTC
GGCTATTGAA TTACTTTAAA TTTTCAGGGC CGGTTTATTA TCATGTTACC AATCGCAAAG
TGACAGAACA TTTTGTGGAT ACTCAAGGGA AACCAATCCC TCCACCACCG GGGTTTAGAC
AAGGAAAGCA AACACTTATT GAGCGTGACC CTTACACCTT TAAACAGAAA GATCTTTTGC
CAAGTAGCTA TGAAATTGAC TCAAAAACGT ATCAATTTCA AGGATGGTAT AAAGGGAAAA
CGAAACCTGA AAATTTAGAA AAAAGCGTAA CGCCCAGTTA TGATATTACC TATGACGACA
ATGATGATTT AACTGTTGTC TATAAGGAGA TACCTCAAAA AAATTATACA TTTGAGGATG
TCAATGGTGT TGAAATTGCA CCACCATCTG ATTTTATTCA GGATCACCAA CAACCAATAA
CTACGGATGG CTTTCGCTAT TTAGCTGGAA AAAAACTGCC ACAACAATAC AGCGTTAACG
GTAAAACTTA TTTATATCAA GGTTGGTATC AAGATAAAAC NAAACAAGAG AGCTTAGAAA
AAACGAAGCG ACCCATAAAC TCCCCTGTTT TTAATGAAAT GAACGCTATT ACAGCAGTGT
ATAAGGAAAT AACTGCAAAA GCTGAAATGC AAATAGAAGG ACTAGTCAAA GTCATGCCAA
GTGGTTATAT ACAAATTTGG CAGATTATGC TTACAAATGT GGGAGAAGTA CCGTTAAAAA
AAATAAACTT AAAGCCAGCA AGTGGTTGGT CACCACGTCT AGCTCGGCCA ATCCAAGTCA
CGATTCGTGT TGGATCTGAA CCAAACAAAA TTGTTCCTAT TACTGATGAA AATTGGCGAG
TTGGCATTAC TTTAAATACG GAAGTGCCTA TTGGTCAGAC AGCAACTATT ATGATGACAA
CAATTGCTAC AGGTGAACCA GATCAAGTGT TACAAGCGGC TGTTGAAATG AATGGAAATT
TTTCTGCTGT TCACGCAGCT GATACTGTCA GAATCCAACC TAAAAATCAA GAAATTGTGG
CACCAGATGA GGAAGGTTTT ATCAGCACAC CAACTTTTGA TTTTGGCAAA GTCGCCATTT
CTAGCAACAC GCAGCAACAT GGTTTAAAGC AGGOAGOAGA TTATTATGAA AATGGTCAGG
AAAATCCATA TTTACGTTTG AAAAAATCAC AACCCAATTG GGCACTAACT GCAGAACTAT
CCCCCTTTGA AGGAAGAGTG GATCAACTAT CATCAATGAC AAAGTTATTG TTAGGAACAA
CCAATGTTTC AGGTTTTATT CAGTACAATC AACCAACGGA AACTAAAGTT GCTCTTGGCA
AAACAACCGC TATTCAATTA GTTGCCAACG GTGTAGCTAG CCATATTGTT GCCAATGGTC
AGTTTGACGA AAGTGATGTT TATCAATTTG ATTTTTCTTT TGATCAAATC AAATTAGAAA
TTCCAGCAAA TCAAGGTAGA AAAGATCAAA CTTATCAAGC AATGGTGACT TGGAATTTAG
TGACAGGCCC ATAA EF095-2 (SEQ ID NO:362) MKRSKWKE LIVTGICHIL
VFPILIQTTV FAETLPSTKQ VREGTNHSLT AEKAESEQPQ TKDKLHDEET LALSKSELID
NEANVTSQTI RERIETPNLT YRYGFINEEG QPVNANEILL QYHSWQGNSP DGINVWEGES
QPVTASTVAN LKEVVIPSEK VAVYSDMSTV LAASNQTEFL PRYYTSLSLY NKKCETDPNY
PLPTTSDASG NQYPTTISQF ELEKMSAQQY SQKTGVTFNI SESQKLIVPL YNQVKVDSSN
QSGLLNYFKF SGPVYYHVTN RKVTEHFVDT QGKPIPPPPG FRQGKQTLIE RDPYTFKQKD
LLPSSYEIDS KTYQFQGWYK GKTKPENLEK SVTPSYDITY DDNDDLTVVY KEIPQKNYTF
EDVNGVEIAP PSDFIQDHQQ PITTDGFRYL AGKKLPQQYS VNGKTYLYQG WYQDKTKQES
LEKTKRPTNS PVFNEMNAIT AVYKEITAKA EMQIEGLVKV MPSGYIQIWQ IMLTNVGEVP
LKKINLKPAS CWSPGLARPI QVTIRVGSEP NKIVPITDEN WRVGITLNTE VPEGQTATIM
MTTIATGEPD QVLQAAVEMN GNFSAVHAAD TVRTQPKNQE IVAPDEEGFI STPTFDFGKV
AISSNTQQHG LKQAADYYEN GQENPYLRLK KSQPNWALTA ELSPFEGRVD QLSSMTKLLL
GTTNVSGFIQ YNQPTETKVA LGKTTAIQLV ANGVASHTVA NGQFDESDVY QFDFSFDQIK
LEIPANQORK DQTYQAMVTW NLVTGP EF095-3 (SEQ ID NO:363) AAGTACAAAA
CAAGTAAGAG AAGGAACCAA TCATTCATTA ACAGCAGAAA AAGCCGAAAG TGAACAACCA
CAGACAAAGG ATAAACTACA TGATGAAGAA ACACTGGCAT TGTCAAAAAG TGAGTTAATC
GATAATGACG CTAATCTTAC AAGTCAAACG ATTAGAGAAA GAATTGAGAC GCCTAACCTA
ACTTATCGTT ATCGATTTAT TAATGAAGAG GGGCAGCCAG TAAACGCCAA TGAGATCCTT
CTACAGTATC ATAGTTGGCA AGGCAATTCC CCAGATGGCA TAAATGTGTG GGAAGGTGAA
AGTCAACCAG TGACAGCATC TACAGTGGCT AATTTAAAAG AAGTGGTAAT TCCAAGTGAG
AAAGTAGCCG TCTATTCCGA CATGTCAACG GTGCTTGCAG CGAGTAATCA AACATTTTTT
TTACCAACAT ATTATACTTC TTTAAGCTTA TACAATAAGA AAGGGGAAAT TGATCCCAAT
TATCCGCTGC CAACTATTTC CGACGCATCA GGAAACCAAT ATCCAACAAC AATTTCGCAA
TTTCAATTCG AAAAAATGTC TGCACAACAA TATAGTCAGA AAACAGGAGT AACGTTTAAC
ATTACCGAGA GTCAAAAACT AATCGTTCCT TTCTACAACC AAGTCAAGGT TGATTCATCG
AATCAATCTG GGCTATTGAA TTACTTTAAA TTTTCAGCCC CGGTTTATTA TCATGTTACC
AATCGCAAAG TGACACAACA TTTTGTGGAT ACTCAAGGGA AACCAATCCC TCCACCACCG
GCGTTTAGAC AAGGAAAGCA AACACTTATT GAGCGTGACC CTTACACCTT TAAACAGAAA
GATCTTTTGC CAAGTAGCTA TGAAATTGAC TCAAAAACGT ATCAATTTCA AGGATGGTAT
AXAGGGAAAA CGAAACCTGA AAATTTAGAA AAAAGCGTAA CGCCCAGTTA TGATATTACC
TATGACGACA ATGATGATTT AACTGTTGTC TATAAGCAGA TACCTCAAAA AAATTATACA
TTTGAGGATG TCAATGGTGT TGAAATTGCA CCACCATCTG ATTTTATTCA GGATCACCAA
CAACCAATAA CTACGGATCG CTTTCGCTAT TTAGCTGGAA AAAAACTGCC ACAACAATAC
AGCGTTAACG GTAAAACTTA TTTATATCAA GCTTGGTATC AAGATAAAAC NAAACAAGAG
AGCTTAGAAA AAACGAAGCG ACCCATAAAC TCCCCTGTTT TTAATGAAAT GAACGCTATT
ACAGCAGTCT ATAAGGAAAT AACTGCAAAA GCTGAAATGC AAATAGAAGG ACTAGTCAAA
GTCATGCCAA GTGGTTATAT ACAAATTTGG CAGATTATGC TTACAAATGT GGCAGAAGTA
CCGTTAAAAA AAATAAACTT AAACCCAGCA AGTCGTTGGT CACCACOTOT ACCTCGGCCA
ATCCAACTCA CCATTCGTGT TGGATCTCAA CCAAACAAAA TTGTTCCTAT TACTGATCAA
AATTGGCGAG TTGGCATTAC TTTAAATACG GAAGTGCCTA TTGGTCAGAC ACCAACTATT
ATGATGACAA CAATTGCTAC AGCTGAACCA GATCAACTGT TACAAGCGGC TGTTGAAATG
AATCGAAATT TTTCTGCTGT TCACGCAGCT GATACTGTCA GAATCCAACC TAAAAATCAA
GAAATTGTGG CACCAGATLA GGAAGGTTTT ATCAGCACAC CAACTTTTGA TTTTGGCAAA
GTCGCCATTT CTACCAACAC GCAGCAACAT CGTTTAAACC AGOCACCAGA TTATTATGAA
AATGGTCAGG AAAATCCATA TTTACGTTTG AAAAAATCAC AACCCAATTG GGCACTAACT
GCAGAACTAT CCCCCTTTGA AGGAAGAGTG CATCAACTAT CATCAATGAC AAAGTTATTG
TTAGGAACAA CCAATGTTTC AGGTTTTATT CAGTACAATC AACCAACCGA AACTAAAGTT
GCTCTTCGCA AAACAACCGC TATTCAATTA GTTGCCAACG GTGTAGCTAG CCATATTGTT
GCCAATGGTC AGTTTGACGA AAGTGATGTT TATCAATTTG ATTTTTCTTT TGATCAAATC
AAATTAGAAA TTCCAGCAAA TCAAGGTAGA AAAGATCAAA CTTATCAAGC AATGGTGACT
TGGAATTTAG TGACAGGCCC A 5F095-4 (SEQ ID NO:364) STKQ VREGTNHSLT
AEKAESEQPQ TKDKLHDEET LALSKSELID NEANVTSQTI RERIETPNLT YRYGFINEEG
QPVNANEILL QYHSWQCNSP DGINVWEGES QPVTASTVAN LKEVVIPSEK VAVYSDMSTV
LAASNQTFFL PRYYTSLSLY NKKGETDPNY PLPTISDASG NQYPTTISQF ELEKNSAQQY
SQKTCVTFNT SESQKLIVPL YNQVKVDSSN QSGLLNYFKE SGPVYYHVTN RKVTEHFVDT
QGKPIPPPPG FRQGKQTLIE RDPYTPKQKD LLPSSYEIDS KTYQFQGWYK GKTKPENLEK
SVTPSYDITY DDNDDLTVVY KEIPQKNYTF EDVNGVEIAP PSDETQDHQQ PITTDGFRYL
AGKKLPQQYS VNGKTYLYQG WYQDKTKQES LEKTKRPINS PVFNEMNATT AVYKETTAKA
EMQIEGLVKV MPSGYIQIWQ IMLTNVGEVP LKKINLKPAS GWSPGLARPI QVTIRVGSEP
NKIVPITDEN WRVCTTLNTE VPIGQTATIM MTTIATGEPD QVLQAAVEHN GNFSAVHAAD
TVRIQPKNQE IVAPDEEGFI STPTFDFGKV AISSNTQQHG LKQAADYYEN GQENPYLRLK
KSQPNWALTA ELSPFEGRVD QLSSMTKLLL GTTNVSGFIQ YNQPTETKVA LGKTTATQLV
ANGVASHIVA NGQFDESDVY QFDFSFDQIK LEIPANQGRK DQTYQAIYTVTW NLVTCP
EF096-1 (SEQ ID NO:365) TGAGGTGGCC AAGTTAAAAT GAAAAAATTA CAGTCACTTT
TTATTGGAAT TATCGCTATT ATTGTCATCT TGTTTTTTCG CGTGCGCCAA TTGGAGAAAG
CAAGTGGCAT GGCAGGAGCA GATACCTTGA CCATTTACAA TTGGGGGGAC TATATAGATC
CGGCCTTGAT TAAGAAATTT GAAAAAGAAA CAGCCTATAA AGTCAATTAC GAAACCTTTG
ATTCTAATGA AGCTATGTAT ACAAAAATTC AGCAAGGTGG CACAGCCTAT GATATTGCCA
TTCCTTCTGA ATATATGATT CAAAAAATGA TGAAAGCGAA GATGCTTTTA CCACTTCATC
ACAGCAAATT AAAAGGCTTA GAAAACATTG ATGCACGCTT TTTAGATCAA TCCTTTGATC
CCAAAAATAA GTTTTCCGTT CCGTACTTCT GGGGCACGTT GGGGATTATT TATAATGATA
AATTTATTGA CGGCCGTCAG ATCCAACATT GGCATGATTT ATCGCCCCCG GAATTAAAAA
ATAATGTCAT GCTGATTGAT GGCGCTCGCG AAGTGTTAGG ATTATCTTTG AACAGTTTAG
GCTATTCGTT AAACAGTAAA AACGACCAAC AATTACGTCA GGCTACCGAT AAGTTAAACC
GATTAACGAA CAATGTCAAA GCAATTGTTG CCGATGAAAT CAAAATGTAC ATGGCTAATG
AAGAAAGTGC AGTTGCTGTA ACTTTCTCTG GTGAAGCTGC TGAAATGCTA CAAAACAATG
AACATCTACA TTATGTGATT CCCAGTGAAG GCTCTAATCT CTGGTTTGAT AACATTGTGA
TGCCTAAGAC AGCCAAAAAT AAAGAGCGTG CCTATGCATT TATCAACTTT ATGTTACGAC
CAGAAAATGC GGCACAAAAT GCAGAATATA TTGGTTATTC CACACCAAAT AAAGAAGCTA
AAAAACTATT ACCAAAAGAA GTTGCCGAAG ATAAACAATT TTATCCAGAT GATGAAACTA
TCAAACATTT AGAAGTTTAC CAAGACTTAG GTCAAGAATA CTTACGAATT TATAACGATC
TGTTCTTGCA GTTTAAGATG TATCCGAAAT PA EE096-2 (SEQ ID NO:366) MKKLQ
SLEIGITAII VILFFGVRQL EKASGMAGAD TLTIYNWGDY IDPALIKKFE KETGYKVNYE
TFDSNEAMYT KIQQGGTAYD IAIPSEYMTQ KMMKAKMLLP LDHSKLKGLE NIDARFLDQS
FDPKNKESVP YFWGTLGIIY NDKFIDGRQI QHWDDLWRPE LKNNVMLIDG AREVLGLSLN
SLGYSLNSKN DQQLRQATDK LNRLTNNVKA IVADEIK4YM ANEESAVAVT FSGEAAEMLE
NNEHLHYVTP SEGSNLWFDN IVMPKTAKNK EGAYAFHNFM LRPENAAQNA EYIGYSTPNK
EAKKLLPKEV AEDKQFYPDD ETTKHLEVYQ DLGQEYLGIY NDLFLEFKMY RK EF096-3
(SEQ ID NO:367) AAGTGGCAT GCCAGGAGCA GATACCTTGA CCATTTACAA
TTGGGGGGAC TATATAGATC CGGCCTTGAT TAAGAAATTT GAAAAAGAAA CAGGCTATAA
AGTCAATTAC GAAACCTTTG ATTCTAATGA AGCTATGTAT ACAAAAATTC AGCAAGGTGG
CACAGCCTAT GATATTGCCA TTCCTTCTGA ATATATGATT CAAAAAATGA TGAAAGCGAA
GATGCTTTTA CCACTTGATC ACAGCAAATT AAAAGGCTTA GAAAACATTG ATGCACGCTT
TTTAGATCAA TCCTTTGATC CCAAAAATAA GTTTTCCGTT CCGTACTTCT GGGGCACGTT
GGGGATTATT TATAATGATA AATTTATTGA CGGCCGTCAG ATCCAACATT GGGATGATTT
ATGGCGCCCG GAATTAAAAA ATAATGTCAT GCTGATTGAT GGCGCTCGCG AAGTGTTAGG
ATTATCTTTG AACAGTTTAG GCTATTCGTT AAACAGTAAA AACGACCAAC AATTACGTCA
GGCTACCGAT AAGTTAAACC GATTAACGAA CAATGTCAAA GCAATTGTTG CCGATGAAAT
CAAAATGTAC ATGGCTAATG AAGAAAGTGC AGTTGCTGTA ACTTTCTCTG GTGAAGCTGC
TGAAATGCTA GAAAACAATG AACATCTACA TTATGTGATT CCCAGTGAAG GCTCTAATCT
CTGGTTTGAT AACATTGTGA TGCCTAAGAC AGCCAAAAAT AAAGAGGGTG CCTATGCATT
TATGAACTTT ATGTTACGAC CAGAAAATGC GGCACAAAAT GCAGAATATA TTGGTTATTC
CACACCAAAT AAAGAAGCTA AAAAACTATT ACCAAAAGAA GTTGCCGAAG ATAAACAATT
TTATCCAGAT GATGAAACTA TCAAACATTT AGAAGTTTAC CAAGACTTAG GTCAAGAATA
CTTAGGAATT TATAACGATC TGTTCTTGGA GTTTAAGATG TATCGGAAA EF096-4 (SEQ
ID NO:368) SGMAGAD TLTIYNWGDY IDPALIKKFE KETGYKVNYE TFDSNEAMYT
KIQQGGTAYD IAIPSEYMTQ KMMKAKNLLP LDHSKLKGLE NIDARFLDQS FDPKNKFSVP
YFWGTLGIIY NDKFIDGRQI QHWDDLWRPE LKNNVMLIDG AREVLGLSLN SLGYSLNSKN
DQQLRQATDK LNRLTNNVKA IVADEIKMYM ANEESAVAVT FSGEAAEMLE NNEHLHYVIP
SEGSNLWFDN IVMPKTAKNK EGAYAFMNFM LRPENAAQNA EYTGYSTPNK EAKKLLPKEV
AEDKQFYPDD ETIKHLEVYQ DLGQEYLGIY NDLFLEFKMY RK EF097-1 (SEQ ID
NO:369) TAGAAGTATT CTAATTATCT ACATAGAGAG CGAGGGACAA GGAATATGAA
GGAAAAAGAA ATGCATTCGC TCTTTTTTAA ACATAAGTTT GTGAAAGTAA CTCCCTATTT
ACGTCGTTTT GGTCATCGTT TGAGTGGGAT GATTATGCCA AATTTGAGTA TTTTTATTGC
GTGGAGCTTA TTGTCTTTGG TGGCTGGCTA TACGACTGGG AATCTACGGC TAGCTCTTTC
TGAAGTCGAA ACGATAATGA TTCGAGTTGT TTTACCGATT CTAATTGGTT TTACAGGCGG
AAAAATGTTC GAGGAACAAC GTGGCGGCGT TGTTGCTGCT ATTGCGACAG TGGGCGTGAT
TGTTTCCACA GATGTTCCAC AGTTGTTTGG TGCTATGTTT ATTGGCCCTT TAGCAGGATA
TACTTTCGCC AAAATTGAAC AAATTCTCTT ACCGAAAGTT AAAGAAGGCT ACGAGATGCT
GACTAAAAAC TTTTTAGCAG CAATTGTGGG AGGACTGCTG TGCTGTTTTG GTATTCTGGT
TGTAGCTCCG GCTGTTGAAA GCGCTAGTTT TTGGCTGTAT CAATTTTCTT CTTGGTTAAT
TGAAGCCAAT CTTTTACCAT TGGTTCACGT TTTCTTAGAG CCCTTAAAAG TGTTATTTTT
TAATAATGCG ATTAACCATG GCTTATTAAC GCCTCTAGGT TTAGAAGGTG CTAGTCAAAC
AGGTCAGTCC ATTTTATTTC TATTGGAAAC AAACCCTGGA CCAGGCGTGG GCGTTTTGGT
TGCTTTTCTG CTGTTTGGGC CTGTAGGACA ACGAAAAACA GCAGGAGGTG CCACCATGAT
TCAACTGATT GGGGGCATTC ATGAAATTTA TTTTCCGTTT GTTTTGATGG ACCCGCGCTT
ATTTTTAGCA GTAATTGCTG GAGGAATGAG TGGTACGCTT GTTTTTCAAA TATTTAATGT
GGGTCTAAGT GCTCCAGCTT CGCCAGGTTC ATTGGTTGCG ATTTTAGCCA ATGCCCCGAC
TGATGCGAGG CTGGCGGTTT TTAGCGGAAT TTTTGTTAGC TTTCTGTGCT CTTTTGCAAT
AGCAAGCTTG TTATTAAAAC GTCAACGAGG AATTGAACCA GTTTCAATGA TAAAGATGAA
GGAGGAAGAC CAAGTGGAAA CAGTCACACC TAACTATCAG CAAATTTTAT TTGTTTGTGA
TGCAGGAATG GGCTCAAGTG CCATGGGGGC TAGTTTGCTA AGCCGACAAT TAAAAGCTGT
GAACTTGGAG ATGCCTGTGA CTTACCAGTC CGTTCATCAG ATGAAGTGGC AGCCTAAGAC
ATTAGTGGTC ATTCAAGCAG AATTGAAACA GTTAGCACAA AAGTACGTCC CAGAAAAGGA
TATGGTGAGT GTTCAAAATT TTTTAGAkAT TAAATCCTAT TACCCGCAAG TTTTAGCCAA
ACTGACTGCT TCTTCTCAAG AGCAATCTTC ACTTGGTTCA GAGTCTACTG AAACGAACTC
GACAAAACAA ATACAGAAGC TTGTTTTTTT ATATGCCGAG AATGTTCGAG GATCGCAAAC
AATGGGAATG GAATTATTGC GGCAACAAGC GGCGAAACAA GGAGTCGCGA TTGAAGTATC
TAAAGAGCCA CTGGAAACAG TCTTTTTTAC CAAGGAGACA ACCTACGTAG TGACTCGTGA
ACTGGCGCAA GCCTATCATT TAGATCTAAC GCAACAAAAT TTATACGTAG TTACTAGTTT
TTTGAATAAG AAAGAGTATC AAGAATGGCT GGAAGGAGGA GCTGATAGAT GTTTTTAA
EF097-2 (SEQ ID NO:370) MLTKNF LACIVGGLLC CFGILVVAPA VESASFWLYQ
FSSWLIEANL LPLVHVFLEP LKVLFFNNAI NHGLLTPLGL EGASQTGQSI LFLLETNPGP
GVGVLVAFLL FGPVGQRKTA GGATMIQLIG GIHEIYFPFV LMDPRLFLAV IAGGMSGTLV
FQIFNVGLSA PASPGSLVAI LANAPTDARL AVFSGIFVSF LCSFAIASLL LKRQRGIEPV
SMIKMKEEDQ VETVTPNYQQ ILFVCDAGMG SSAMGASLLS RQLKAVNLEM PVTYQSVHQM
KWQPKTLVVI QAELKQLAQK YVPEKDMVSV QNFLEIKSYY PQVLAKLTAS SQEQSSLGSE
STETNSTKQI QKLVFLYAEN VRGSQTMGME LLRQQAAKQG VAIEVSKEPL
ETVFFTKETT YVVTRELAQA YHLDLTQQNL YVVTSFLNKK EYQEWLEGGA DRCF EF097-3
(SEQ ID NO:371) ACGAGG AATTGAACCA GTTTCAATGA TAAAGATGAA GGAGGAAGAC
CAAGTGGAAA CAGTCACACC TAACTATCAG CAAATTTTAT TTGTTTGTGA TGCAGGAATG
GGCTCAAGTG CCATGGGGGC TAGTTTGCTA AGCCGACAAT TAAAAGCTGT GAACTTGGAG
ATGCCTGTGA CTTACCAGTC CGTTCATCAG ATGAAGTGGC AGCCTAAGAC ATTAGTGGTC
ATTCAAGCAG AATTGAAACA GTTAGCACAA AAGTACGTCC CAGAAAAGGA TATGGTGAGT
GTTCAAAATT TTTTAGAAAT TAAATCCTAT TACCCGCAAG TTTTAGCCAA ACTGACTGCT
TCTTCTCAAG AGCAATCTTC ACTTGGTTCA GAGTCTACTG AAACGAACTC GACAAAACAA
ATACAGAAGC TTGTTTTTTT ATATGCCGAG AATGTTCGAC GATCGCAAAC AATGGGAATG
GAATTATTGC GGCAACAAGC GGCGAAACAA GGAGTCGCGA TTGAAGTATC TAAAGAGCCA
CTGGAAACAG TCTTTTTTAC CAAGGAGACA ACCTACGTAG TGACTCGTGA ACTGGCGCAA
GCCTATCATT TAGATCTAAC GCAACAAAAT TTATACGTAG TTACTAGTTT TTTGAATAAG
AAAGAGTATC AAGAATGGCT GGAAGGAGGA GCTGATAGAT GTTTTT EF097-4 (SEQ ID
NO:372) RGIEPV SMIKNKEEDQ VETVTPNYQQ ILFVCDAGMG SSAMGASLLS
RQLKAVNLEM PVTYQSVHQM KWQPKTLVVI QAELKQLAQK YVPEKDMVSV QNFLEIKSYY
PQVLAKLTAS SQEQSSLGSE STETNSTKQI QKLVFLYAEN VRGSQTMGME LLRQQAAKQG
VAIEVSKEPL ETVFFTKETT YVVTRELAQA YHLDLTQQNL YVVTSFLNKK EYQEWLEGGA
DRCF EF098-1 (SEQ ID NO:373) TAAATGAAAA AGACAAAAGT AATGACATTG
ATGGCAACCA CAACTTTAGG CGCACTGGCA CTTGTACCAA TGAGTGCATT AGCAGTCGAC
GGTGGTGAAT ACCAAACAAA CGGAGCGATT CAATTTGCAC CAAATACGAA CCCAACGAAT
CCAGTTGATC CGACGAATCC AGACCCAGAT AAACCAATTA CACCAGTTGA TCCAACTGAT
CCGACAGGGC CTAAGCCAGG GACAGCAGGT CCGTTATCCA TTGACTATGC ATCTAGCTTA
TCTTTTGGGG AACAAACGAT TACCTCAAAA AATATGACCT ACTATGCAGA AACACAAAAA
TACAAAGATA ACGCTGGTGC CGACCAAGAA GGCCCAAACT TTGTTCAAGT CTCAGATAAT
CGTGGGACTG AGACAGGTTG GACGCTAAAA GTAAAACAAA ATCGTCAATT CAAAACTGAA
GCCAACCAAG AACTAACAGC GGCCAAACTA ACTTTAAGCA ACGGACGCGT GGTTTCAGCT
TCACAATCTG CAAAGCCAAC GACAGCGCCA GCTACGATTG AATTAAACCC AACTGGGGCT
GAATCAGTGG TCATGGCTGC TGGCGATAAA GAAGGTGCGG GTACGTACTT AATGAGCTGG
GGCGATAGTG TAGATACCGC TAAAACAAGT ATTTCATTAG AAGTACCTGG TTCAACCACA
AAATATGCGA AAAAATACAC GACAACTTTT ACTTGGACTT TGACAGATAC ACCTGCTAAC
ACAGGAAACT AA EF098-2 (SEQ ID NO:374) MKKTKVMTLM ATTTLGALAL
VPMSALAVDG GEYQTNGAIQ FAPNTNPTNP VDPTNPDPDK PITPVDPTDP TGPKPGTAGP
LSIDYASSLS FGEQTITSKN MTYYAETQKY KDNAGADQEG PNFVQVSDNR GTETGWTLKV
KQNGQFKTEA NQELTAAKVT LSNGRVVSAS QSAKPTTAPA TIELNPTGAE SVVMAAGDKE
GAGTYLMSWG DSVDTAKTSI SLEVPGSTTK YAKKYTTTFT WTLTDTPANT GN EF098-3
(SEQ ID NO:375) AGTCGAC GCTGGTGAAT ACCAAACAAA CGGAGCGATT CAATTTGCAC
CAAATACGAA CCCAACGAAT CCAGTTGATC CGACGAATCC AGACCCAGAT AAACCAATTA
CACCAGTTGA TCCAACTGAT CCGACAGGGC CTAAGCCAGG GACAGCAGGT CCGTTATCCA
TTGACTATGC ATCTAGCTTA TCTTTTGGGG AACAAACGAT TACCTCAAAA AATATGACCT
ACTATOCAGA AACACAAAAA TACAAAGATA ACGCTGGTGC CGACCAAGAA GGCCCAAACT
TTGTTCAAGT CTCAGATAAT CGTGGCACTG AGACAGGTTG GACGCTAAAA GTAAAACAAA
ATGGTCAATT CAAAACTGAA GCCAACCAAG AACTAACACC GGCCAAAGTA ACTTTAAGCA
ACGGACGCGT GGTTTCAGCT TCACAATCTG CAAAGCCAAC GACAGCGCCA GGTACCATTG
AATTAAACCC AACTGGGGCT GAATCAGTGG TCATGGCTGC TGGCGATAAA GAAGGTGCGG
GTACGTACTT AATGAGCTGG GGCGATAGTG TAGATACCGC TAAAACAAGT ATTTCATTAG
AAGTACCTGG TTCAACCACA AAATATGCGA AAAAATACAC GACAACTTTT ACTTGGACTT
TGACAGATAC ACCTGCTAAC ACAGGAAACT EF098-4 (SEQ ID NO:376) VDG
GEYQTNGAIQ FAPNTNPTNP VDPTNPDPDK PITPVDPTDP TGPKPGTAGP LSIDYASSLS
FGEQTITSKN MTYYAETQKY KDNAGADQEG PNFVQVSDNR GTETGWTLKV KQNGQFKTEA
NQELTAAKVT LSNGRVVSAS QSAKPTTAPA TIELNPTGAE SVVMAAGDKE GAGTYLMSWG
DSVDTAKTSI SLEVPGSTTK YAKKYTTTFT WTLTDTPANT GN EF099-1 (SEQ ID
NO:377) TGATGTTGTA GAGGGCTGAT GAAATGTTTA TCAGTCTTCT TTTTATTGAA
AGGAGAGATC ATGAAGAAAT TAGGCAAGGT TTTAATTGTT AGTTGTTTTA TTTTTATTCT
TCCTTTTTTA TTATTTTTAG GTGTATTTTC TTCTAGTGAA AGCGGAGATT CTTCCCAGTT
TCAGCCCGCT ACACCACAGG AAAAAGTAGC ATTAGAAGTT TCTAACTACG TGACGTCACA
TGGCGGAACG TTGCAGTTTG CTTCCGCTTC GATTGGCAAT ATGGAACATG AAAGTGGATT
AAATCCTGCT AGAATTCAAA GTGATTTATC GTTTAATTCA GCGATAGCTT TTAATCCTTC
GTTAGGCGGT TATGGAATTG GGTTAGGACA ATGGGATTCA GGACGAAGAG TTAATTTATT
AAATTTTGCA AAAAGTCAAA AAAAGGAATG GAAATCAGTA GCTTTACAAA TGGATTTTGC
GTGGAATAAG GATGGTTCTG ATAGTGACTT ACTTAAAAGA ATGTCTAAAT CAAAAGATGT
GAATACACTT GCGGTAGATA TTTTGAAGCT GTGGGAACGA GCTGGAACAA AAGATGATCC
CGCAGAACAA GTAAAAAGAA AGGCTAGTGC TAATAATTGG TATAAACGAC TTTCTACAGG
TTCCATGGGC GGAGGTTCAG CCAATGTTGG TCGAGGAAAA ATTGATGCCT TGGAAAAAGT
GATGGGGCAA ACTATTAATG GTGGTCAATG TTATGGCTTA TCTGCTTTTT TTGTTGAAAA
ACAAGGAGGT CTACAAATGA TGGGTACGGG GCATATGTTT GCGAGTGAAA TTGGTAATGA
TTATCCTTGG AGTTCAATTG GTTGGACAGT CATAAAGAAT CCAAATTATT CAGATATTAA
AGGAGGAGAT GTCATTAATT TTGGTCAAGG TGGTGTGGCT ACTAGTATTT ATGGGCATAC
TGGTGTAGTG GCAAGTGTTG AAGGTAAAAA CAAGTTTACT ACTTATGAGC AAAACGCTGA
ACAAGGTCAA ATTGTTGCTA AGTATTTTCG GACTTGGGGA TTAGATTTTC CACATGTGAC
CAGCATAGTA AGGAAATAG EF099-2 (SEQ ID NO:378) MKCLS VFFLLKGEIM
KKLGKVLIVS CFIFILPFLL FLGVFSSSES GDSSQFQPAT PQEKVALEVS NYVTSHGGTL
QFASAWIGNN EHESGLNPAR IQSDLSFNSA IAFNPSLGGY GIGLGQWDSG RRVNLLNFAK
SQKKEWKSVA LQMDFAWNKD GSDSDLLKRM SKSKDVNTLA VDILKLWERA GTKDDPAEQV
KRKASANMWY KRLSTGSMGG GSANVGGGKI DALEKVMGQT INGGQCYGLS AFFVEKQGGL
QMMGTGHMFA SEIGNDYPWS SIGWTVTKNP NYSDIKAGDV INFGQGGVAT SIYGHTGVVA
SVEGKNKFTT YEQNAEQGQI VAKYFRTWGL DFPHVTSIVR K EF099-3 (SEQ ID
NO:379) TAGTGAA AGCGGAGATT CTTCCCAGTT TCAGCCCGCT ACACCACAGG
AAAAAGTAGC ATTAGAAGTT TCTAACTACG TGACGTCACA TGGCCGAACG TTGCAGTTTG
CTTCCGCTTG GATTGGCAAT ATGGAACATG AAAGTGGATT AAATCCTGCT AGAATTCAAA
GTGATTTATC GTTTAATTCA GCGATAGCTT TTAATCCTTC GTTAGGCGGT TATGGAATTG
GGTTAGGACA ATGGGATTCA GGACGAAGAG TTAATTTATT AAATTTTGCA AAAAGTCAAA
AAAAGGAATG GAAATCAGTA GCTTTACAAA TGGATTTTGC GTGGAATAAG GATGGTTCTG
ATAGTGACTT ACTTAAAAGA ATGTCTAAAT CAAAAGATGT GAATACACTT GCGGTAGATA
TTTTGAAGCT GTGGGAACGA GCTGGAACAA AAGATGATCC CGCAGAACAA GTAAAAAGAA
AGGCTAGTGC TAATAATTGG TATAAACGAC TTTCTACAGG TTCCATGGGC GGAGGTTCAG
CCAATGTTGG TGGAGGAAAA ATTGATGCCT TGGAAAAAGT GATGGGGCAA ACTATTAATG
GTGGTCAATG TTATGGCTTA TCTGCTTTTT TTGTTGAAAA ACAAGGAGGT CTACAAATGA
TGGGTACGGG GCATATGTTT GCGAGTGAAA TTGGTAATGA TTATCCTTGG AGTTCAATTG
GTTGGACAGT CATAAAGAAT CCAAATTATT CAGATATTAA AGCAGGAGAT GTCATTAATT
TTGGTCAAGG TGGTGTGGCT ACTAGTATTT ATGGGCATAC TGGTGTAGTG GCAAGTGTTG
AAGGTAAAAA CAAGTTTACT ACTTATGAGC AAAACGCTGA ACAAGGTCAA ATTGTTGCTA
AGTATTTTCG GACTTGGGGA TTAGATTTTC CACATGTGAC CAGCATAGTA AGGAAAT
EF099-4 (SEQ ID NO:380) SES GDSSQFQPAT PQEKVALEVS NYVTSHGGTL
QFASAWIGNM EHESGLNPAR IQSDLSFNSA IAFNPSLGGY GIGLGQWDSG RRVNLLNFAK
SQKKEWKSVA LQMDFAWNKD GSDSDLLKRM SKSKDVNTLA VDILKLWERA GTKDDPAEQV
KRKASANNWY KRLSTGSMGG GSANVGGGKI DALEKVMGQT INGGQCYGLS AFFVEKQGGL
QMMGTGHMFA SETGNDYPWS SIGWTVIKNP NYSDIKAGDV INFGQGGVAT SIYGHTGVVA
SVEGKNKFTT YEQNAEQGQI VAKYFRTWGL DFPHVTSIVR K EF100-1 (SEQ ID
NO:381) TANTTATGGC AATATGGAAG GAGTTTTATA ATGAAAAAGA AACAAAAATA
CGCAGGGTTT ACATTATTAG AAATGTTGAT TGTCTTATTG ATTATTTCCG TATTGATTTT
ACTTTTTGTC CCTAACTTAG CGAAACATAA AGAAACAGTT GATAAAAAAG GCAATGAAGC
AATCGTAAAA ATTGTAGAAT CACAAATCGA GCTCTACACA CTAGAAAAAA ATAAGACGCC
TTCCTTAAAT GAATTAGTCA ACGAAGGCTA CATTACTAAA GAGCAGTTAG ATAAATATAC
AGCAGAAAAG CAATGA EF100-2 (SEQ ID NO:382) MKKKQKYAGF TLLEMLIVLL
IISVLILLFV PNLAKHKETV DKKGNEAIVK IVESQIELYT LEKNKTPSLN ELVNEGYITK
EQLDKYTAEK Q EF100-3 (SEQ ID NO:383) TAA AGAAACAGTT GATAAAAAAG
GCAATGAAGC AATCGTAAAA ATTGTAGAAT CACAAATCGA GCTCTACACA CTAGAAAAAA
ATAAGACGCC TTCCTTAAAT GAATTAGTCA ACGAAGGCTA CATTACTAAA GAGCAGTTAG
ATAAATATAC AGCAGAAAAG CAAT EF100-4 (SEQ ID NO:384) KETV DKKGNEAIVK
IVESQIELYT LEKNKTPSLN ELVNEGYITK EQLDKYTAEK Q EF100-1 (SEQ ID
NO:385) TANTTATGGC AATATGGAAG GAGTTTTATA ATGAAAAAGA AACAAAAATA
CGCACGGTTT ACATTATTAG AAATGTTGAT TGTCTTATTG ATTATTTCCG TATTGATTTT
ACTTTTTGTC CCTAACTTAG CGAAACATAA AGAAACAGTT GATAAAAAAG GCAATGAAGC
AATCGTAAAA ATTGTAGAAT CACAAATCGA GCTCTACACA CTAGAAAAAA ATAAGACGCC
TTCCTTAAAT GAATTAGTCA ACGAAGGCTA CATTACTAAA GAGCAGTTAG ATAAATATAC
AGCAGAAAAG CAATGA EF100-2 (SEQ ID NO:386) MKKKQKYAGF TLLEMLIVLL
IISVLILLFV PNLAKHKETV DKKGNEAIVK IVESQIELYT LEKNKTPSLN ELVNEGYITK
EQLDKYTAEK Q EF100-3 (SEQ ID NO:387) TAA AGAAACAGTT GATAAAAAAG
GCAATGAAGC AATCGTAAAA ATTGTAGAAT CACAAATCGA GCTCTACACA CTAGAAAAAA
ATAAGACGCC TTCCTTAAAT GAATTAGTCA ACGAAGGCTA CATTACTAAA GAGCAGTTAG
ATAAATATAC AGCAGAAAAG CAAT EF100-4 (SEQ ID NO:388) KETV DKKGNEAIVK
IVESQIELYT LEKNKTPSLN ELVNEGYITK EQLDKYTAEK Q EF101-1 (SEQ ID
NO:389) TGAGGAGATG AAACGAAGAA AATGAAGAAG AAAACGATAA TTATATTGGG
GGCAGTTGCG GTAATTGCGG TTGGGGGCAT CGTAACTGTG AATGCGTTAA ATAAAAATGC
ACAACAAGTA GCTGTCAAGC AAGCGCCTAA AGATGACTGG GGAATTGACT ATTTTGACGT
TCCCGACTTG CAACAAATTT ATATTAACGG TGTCATCCAA CCGGAACAAA TGGAAGCCTT
TGCGCGTGAT CAAAAAATAA CAAAGGATCC AGAGATTAAG GTGAAAAACG GCGATGTCGT
AGATGCAGGC ACAGAATTAT TTACTTATGA AGATGAGGCG GTCACAAAAG AAATTGAGGC
ACAACAAAAT AGCTTAGCCA AATTAGAAAC GAAGCGGGCG AATATCTATA ATAAGTGGAA
TCGGGCCATT GATAAATTTA ATAAAACTAA AGAAGAAGAC CGCACGATGT CTGGTGATGA
TTTAAATGAA CAATATCAAA CAGAAGTCGA TGCAGTAGAT GAAGAGATTA CCTTCACCAA
TGAAACCTTA GCGGATTTAG GAGCGAAGCA ATATATTTCC ACAAAGGCTA ATTTCAAAGG
TCGTGTATCA ATTCCAGAAG TAAAAGATGC CAATTCACCG ATTTTACGGT TAACTTCAGA
AGATCTTTAT TTAGCTGGAA AAGTGAATGA AAAGGACTTG ACTAAAATTA GTGTTGGGCA
AAAAGCTAAA CTAACTTCTG TTTCCAACAA TGTGGTTGTG GATGGCTCAA TTTCTTACAT
CGATGATAAT CCTCCTGAAG GCAACAGCGA TGCCGCGAGT CGCAATCCAG AGGGCGGCAC
AACGATGTCT AGTTATAGCG TCAAAATTGC GTTGGCCAAT TTAGACAAAG TCAAAAATGG
CTACCATATG CAAGCAACCA TTGATTTAGG CGATTTAGGG GCGATTGAGT TACCGAAAAA
AGCGATTCAA AAAGAGGGTG AACAGGCCTA CCTTTTAGTG AATGATTTTG GAACCATCAT
TCGTCGTGAT GTCCAAGTCG GGCAAGAAAA TGGCGACAAA ATGGCGATTG AATCTGGCTT
AGAATCAGCC GACCGAGTGG TTATTTCTTC AAAAAAACCA GTAAAAGTCG GTGATATTGT
TGAATCAGAT GCAGCGATTG CTTCTGATGA ATCAGCAACC AACGAATCAA TGACAGATGC
GTCGAAATAG EF101-2 (SEQ ID NO:390) MKKK TITILGAVAV IAVGGIVTVN
ALNKNAQQVA VKQAPKDDWG IDYFDVPDLQ QIYINGVIQP EQMEAFARDQ KITKDPEIKV
KNGDVVDAGT ELFTYEDEAV TKEIEAQQNS LAKLETKRAN IYNKWNRAID KFNKTKEEDR
TMSGDDLNEQ YQTEVDAVDE ETTETNETLA DLGAKQYIST KANFKGRVSI PEVKDANSPI
LRLTSEDLYL AGKVNEKDLT KISVGQKAKL TSVSNNVVVD GSTSYIDDNP PEGNSDAASG
NPEGGTTMSS YSVKIALANL DKVKNGYHMQ ATIDLCDLGA IELPKKAIQK EGEQAYVLVN
DFGTIIRRDV QVGQENGDKM AIESGLESAD RVVISSKKPV KVGDIVESDA AIASDESATN
ESMTDASK EF101-3 (SEQ ID NO:391) TAAAAATGC ACAACAAGTA GCTGTCAAGC
AAGCGCCTAA AGATGACTGG GGAATTGACT ATTTTGACGT TCCCGACTTG CAACAAATTT
ATATTAACGG TGTCATCCAA CCGGAACAAA TGGAAGCCTT TGCGCGTCAT CAAAAAATAA
CAAAGGATCC AGAGATTAAG GTGAAAAACG GCGATGTCGT AGATGCAGGC ACAGAATTAT
TTACTTATGA AGATGAGGCG GTCACAAAAG AAATTGAGGC ACAACAAAAT AGCTTAGCCA
AATTAGAAAC GAAGCGGGCG AATATCTATA ATAAGTGCAA TCGGGCCATT GATAAATTTA
ATAAAACTAA AGAAGAAGAC CGCACGATGT CTGGTGATGA TTTAAATGAA CAATATCAAA
CAGAAGTCGA TGCAGTAGAT GAAGAGATTA CCTTCACCAA TGAAACCTTA GCGGATTTAG
GAGCGAAGCA ATATATTTCC ACAAAGGCTA ATTTCAAAGG TCGTGTATCA ATTCCAGAAG
TAAAAGATGC CAATTCACCG ATTTTACGGT TAACTTCAGA AGATCTTTAT TTAGCTGGAA
AAGTGAATGA AAAGGACTTG ACTAAAATTA GTGTTGGGCA AAAAGCTAAA CTAACTTCTG
TTTCCAACAA TGTGGTTGTG GATGGCTCAA TTTCTTACAT CGATGATAAT CCTCCTGAAG
GCAACAGCGA TGCCGCGAGT GGCAATCCAG AGGGCGGCAC AACGATGTCT AGTTATAGCG
TCAAAATTGC GTTGGCCAAT TTAGACAAAG TCAAAAATGG CTACCATATG CAAGCAACCA
TTGATTTAGG CGATTTAGGG GCGATTGAGT TACCGAAAAA AGCGATTCAA AAAGAGGGTG
AACAGGCCTA CGTTTTAGTG AATGATTTTG GAACCATCAT TCGTCGTGAT GTCCAAGTCG
GGCAAGAAAA TGGCGACAAA ATGGCGATTG AATCTGGCTT AGAATCAGCC GACCCAGTGG
TTATTTCTTC AAAAAAACCA GTAAAAGTCG GTGATATTGT TGAATCAGAT GCAGCGATTG
CTTCTGATGA ATCAGCAACC AACGAATCAA TGACAGATGC GTCGAAAT EF101-4 (SEQ
ID NO:392) KNAQQVA VKQAPKDDWG IDYFDVPDLQ QIYINGVIQP EQMEAFARDQ
KITKDPEIKV KNGDVVDAGT ELFTYEDEAV TKEIEAQQNS LAKLETKRAN IYNKWNRAID
KFNKTKEEDR TMSGDDLNEQ YQTEVDAVDE EITFTNETLA DLGAKQYIST KANFKGRVSI
PEVKDANSPI LRLTSEDLYL AGKVNEKDLT KISVGQKAKL TSVSNNVVVD GSISYIDDNP
PEGNSDAASG NPEGGTTMSS YSVKIALANL DKVKNGYHMQ ATIDLGDLGA IELPKKAIQK
EGEQAYVLVN DFGTIIRRDV QVGQENGDKM AIESGLESAD RVVISSKKPV KVGDIVESDA
AIASDESATN ESMTDASK EF102-1 (SEQ ID NO:393) TAAACATTTG AGACATTCAG
AGGTGAATGT CTCTTTTTTA TTACTCAAAA ACGAAAGGGG ATTAATTATA TGAAAAAAAC
AACATTTAAA AATTGGTCGT TATTTGCGAC TTTGGCTCTA TTAAGTCAAA CAATTGGCGG
AACGATTGGT CCTACGATTG CTTTTGCCGA TGAAATTACT CACCCTCAAG AGCTAACAAT
TCATTATGAC GTAAGTAAAC TGTATGAAGT TGACGGAACT TTTAGCGATG GCAGCACGCT
CTCAGAACGT ACTACGTCAT TATATGCAGA ATACAATGGT GCAAAACAAA CAGTATTTTG
TATTGAACCA CGTGTTAGTA TTCCAACAGA AGTGACGCAC GGTTATCAGA AAAACCCTTT
GCCATCAATG TCTCATAAAG CGAAACTAGT ATCGGTTCTT TGGGAAAAGG CTGGAACAGA
TATTGATACA AATATGGTTG CACAAAAGAT GATTTGGGAA GAAGTGAACG GTTATAAACT
CCATTCCATA AAAAGATTAG GTGGTGCTTC AGTTGATATA AAATCTATTG AAGGAAAAAT
TAATAAGGCA ATTGAGGAGT ATCAAAAAAA ACCAAGTTTT CATAATACCA CTGTAAAAAC
AATTTTAGGT CAATCGACAA CTTTAATAGA TAAAAATGAA TTAAATTTAT CTGAGTTTGA
TAAAGTCGTC CAAAATACGG CGAATATAGA TTACCGTGTA ATTGGGAATC AATTAGTGCT
TACTCCAAAC TCTAATTCCA AATCAGGAAC ATTAACATTG AAAAAATCAG CTGOTACTGG
AACTCCAGTC GCTTATAAAA AAGCAGGACT TCAAACTGTG ATGGCTGGTG CGCTTGATAA
GCCCAATACC TACGCTATTA AAATTAATGT GGAAACTAAG GGTTCTTTAA AGATCAAAAA
AATCGATAAA GAATCAGCTG ATATTGTACC AGAPACGGTT TTCCATTTAG ATTTTGGGAA
AGCTTTACCT TCAAAAGATG TGACAACAGA TAAAGATGGG ATTTCTATTT TGGATCGAAT
TCCCCATGGT ACAAAGGTAA CTATTACTGA AAAATCGGTG CCAGATCCTT ATATGATTGA
TACCACACCC ATGGCTGCCA CCATTAAAGC GGGCGAGACC ATTTCCATGA CTTCGAAAAA
TATGCGACAA AAAGGTCAAA TTCTTTTAGA GAAGACTGGG GTAGAAACAG GTACTGATCT
TTGGAATGAC AATTATTCTC TAGCTGGAAA TACATTTGCC ATTCGTAAAG ACAGCCCAGC
TGGTCAAATT GTCCAAGAAA TAACAACGGA TGAAAAAGGT CGTGCGGAAA CACCAAAAGA
GCTTGCTAAT GCTTTGGAAC TGGGAACCTA TTACGTGACA GAAACTAAAT CTAGTAATGG
TTTCGTGAAT ACCTTCAAAC CAACAAAAGT CGAGTTAAAA TATGCCAATC AAACCGTGGC
TCTTGTTACC AGTAACGTAA AAGGGCAAAA CCAAGAAATT ACTGGGGAAA CCACTTTGAC
AAAAGAAGAC AAAGATACCG GTAATGAGAG TCAAGGGAAA GCTGAGTTTA AAGGACCTGA
ATATACTCTC TTTACTGCAA AAGATGGTCA AGCTGTTAAA TGGAGTGAAG CTTTTAAAAC
AGAATTAGTG AAGGGAACGA AAGCTTCTGA TGAAACAGTG ACTTTGGCTT TAGATGAAAA
GAACCAAGTT CCCGTTAAAC ACCTAGCAAT TAACGAGTAT TTCTGGCAAG AAACCAAAGC
ACCTGAAGGA TATACTTTGG ATGAAACGAA GTATCCTGTA TCCATCAAAA AAGTTGATAA
TAACGAAAAA AATGCCGTAA TTACTCGAGA TGTTACGGCA AAAGAACAAG TTATTCGCTT
TGGCTTTGAT TTCTTTAAAT TTGCTGGATC GGCTGATGGC ACTGCCGAAA CTGGATTTAA
CGACTTATCT TTTAAAGTGT CGCCATTGGA AGGGACCAAN GAAATCACAG GTGCTGAAGA
TAAAGCGACC ACAGCTTGTA ACGAGCAATT AGGTTTTGAT GGCTATGGTA AGTTTGAAAA
TCTTCCTTAT GGGGATTATT TACTTGAAGA AATAGAGGCT CCAGAAGGAT TTCAAAAGAT
TACACCACTA GAAATCCGTT CTACATTTAA GGAAAACAAA GACGACTATG CGAAGAGTGA
GTATGTCTTT ACCATTACCG AAGAAGCACA AAAACAACCA ATTAAGATGG TGACCGTTCC
TTACGAGAAA CTAACTAACA ACGAGTTTTC TGTTAGTCTG AACCGTTTGA TGCTTTATGA
TTTGCCCGAG AAAGAAGATA GTTTGACTTC TCTTGCGACT TGGAAAGACG GAAATAAAAA
ATTGAATACC CTTGATTTTA CCGAGCTAGT
TGATAAATTG AGATATAACT TGCATGAAAT CAAAGAAGAC TGGTATGTCG TAGCTCAAGC
CATTGATGTG GAAGCCACAA AAGCTGCCCA AGAAAAAGAC GAAAAAGCCA AACCGGTGGT
GATTGCCGAA ACAACCGCAA CGTTGGCGAA CAAAGAGAAA ACTGGAACTT CGAAAATTCT
GCATAAATTA ACCGCTGAAC AAGTTTTGGA TAAAAGCATC GTCTTGTTCA ATTATGTGTA
TGAAAACAAG GTAGCCTTTG AAGCAGGCAA TGAGCCAGTA GCGAAGGATG CTAGCTTGAA
CAATCAAGCA CAAACCGTCA ATTGTACGAT TGAACGCCAT GTTTCCATCC AAACAAAAGC
CCACCTAGAA GATGGTTCGC AAACTTTTAC TCATGGTGAC GTGATGGATA TGTTTGATGA
TGTGTCGGTT ACCCATGATG TACTGGATGG CTCAAAAGAA GCTTTCGAAA CAATTCTGTA
TGCTTTACTA CCAGATGGTA CGAACAAAGA AATTTGGAAA TCTGGCAAAA TTGAGCATGA
AGTGAATGAT AAAGAATTTA CCAAAACCGT ACTTGCGGAA AAAGTAGATA CCGGAAAGTA
TCCAGAAGGA ACTAAGTTTA CTTTTACGGA AATCAATTAC GAAAAAGATG GAAACGTGAA
TGGAAAACAC AATGAAGATT TGAAAGAAAA ATCTCAAACC TTAACACCAA AAGAAGTGCC
AACCATACCG AGTACGCCAA AACAACCGGA AACACCAGCT GTTCCAAGTA ATTCTCAAGA
ATCTAGTCCC ACAGTGAAGA CATTCCCGCA AACTGGGGAG AAAAATTCCA ACGTTCTACT
CTTAGTTGGC TTTATCTTGA TTTTTTCCAC TGCTGGGTAT TATTTCTGGA ATCGCCGCAA
TTAA EF102-2 (SEQ ID NO:394) MKKTTFKN WSLFATLALL SQTIGGTTGP
TIAFADEITH PQEVTIHYDV SKLYEVDGTF SDGSTLSSRT TSLYAEYNGA KQTVFCIEPG
VSIPTEVTHG YQKNPLPSMS DKAKLVSVLW EKAGTDIDTN MVAQKMIWEE VNGYKLHSIK
RLGGASVDTK SIEGKINKAI EEYQKKPSFH NTTVKTILGQ STTLIDKNEL NLSEFDKVVQ
NTANIDYRVI GNQLVLTPNS NSKSGTLTLK KSAGTGTPVA YKKAGLQTVM AGALDKPNTY
AIKTNVSTKG SLKTKKIDKE SGDIVPETVF HLDFGKALPS KDVTTDKDGT SILDGIPHGT
KVTITEKSVP DPYMIDTTPM AATTKAGETI SMTSKNNRQK CQILLEKTGV ETCTDLWNDN
YSLAGNTFAI RKDSPACETV QEITTDEKGR AETPKELANA LELGTYYVTE TKSSNGFVNT
FKPTKVELKY ANQTVALVTS NVKGQNQEIT GETTLTKEDK DTGNESQGKA EFKGAEYTLF
TAKDGQAVKW SEAFKTELVK GTKASDETVT LALDEKNQVA VKHLAINEYF WQETKAPEGY
TLDETKYPVS IKKVDNNEKN AVITRDVTAK EQVIRFGFDF FKFAGSADGT AETGFNDLSF
KVSPLEGTXE ITGAEDKATT ACNEQLGFDG YGKFENLPYG DYLLEEIEAP EGPQKITPLE
IRSTFKENKD DYAKSEYVFT ITEEGQKQPI KMVTVPYEKL TNNEFSVSLN RLMLYDLPEK
EDSLTSLATW KDGNKKLNTL DFTELVDKLR YNLHETKEDW YVVAQAIDVE ATKAAQEKDE
KAKPVVIAET TATLANKEKT GTWKILHKLT AEQVLDKSIV LFNYVYENKV AFEAGNEPVA
KDASLNNQAQ TVNCTIERHV STQTKAHLED GSQTFTHGDV MDMFDDVSVT HDVLDGSKEA
FETILYALLP DGTNKEIWKS GKIEHEVNDK EFTKTVLAEK VDTGKYPEGT KFTFTEINYE
KDGNVNGKHN EDLKEKSQTL TPKEVPTIPS TPKQPETPAV PSNSQESSPT VKTFPQTGEK
NSNVLLLVGF ILIFSTAGYY FWNRRN EF102-3 (SEQ ID NO:395) TT TAGATGAAAA
GAACCAAGTT GCCGTTAAAC ACCTAGCAAT TAACGAGTAT TTCTGGCAAG AAACCAAAGC
ACCTGAACGA TATACTTTGG ATGAAACGAA GTATCCTGTA TCCATCAAAA AAGTTGATAA
TAACGAAAAA AATGCCGTAA TTACTCGACA TCTTACGGCA AAAGAACAAG TTATTCGCTT
TGGCTTTGAT TTCTTTAAAT TTGCTGGATC GGCTGATGGC ACTGCCGAAA CTGGATTTAA
CGACTTATCT TTTAAAGTGT CGCCATTGGA AGGGACCAAN GAAATCACAG GTGCTGAAGA
TAAAGCGACC ACAGCTTGTA ACGAGCAATT AGGTTTTCAT GGCTATGGTA AGTTTGAAAA
TCTTCCTTAT GGGGATTATT TACTTGAACA AATAGAGGCT CCACAAGGAT TTCAAAAGAT
TACACCACTA GAAATCCGTT CTACATTTAA GGAAAACAAA GACGACTATG CGAAGACTGA
CTATGTCTTT ACCATTACCG AAGAAGGACA AAAACAACCA ATTAAGATCG TGACCGTTCC
TTACGACAAA CTAACTAACA ACGAGTTTTC TGTTACTCTG AACCGTTTGA TGCTTTATGA
TTTGCCCGAC AAAGAAGATA GTTTGACTTC TCTTGCGACT TGGAAAGACG GAAATAAAAA
ATTGAATACC CTTGATTTTA CCGAGCTAGT TCATAAATTC AGATATAACT TGCATGAAAT
CAAAGAAGAC TGGTATGTCC TAGCTCAAGC CATTGATGTG GAAGCCACAA AAGCTGCCCA
AGAAAAACAC GAAAAAGCCA AACCGGTGGT GATTCCCCAA ACAACCGCAA CCTTGGCGAA
CAAAGAGAAA ACTGGAACTT GGAAAATTCT GCATAAATTA ACCGCTGAAC AAGTTTTGGA
TAAAAGCATC GTCTTGTTCA ATTATGTGTA TGAAAACAAG CTAGCCTTTC AAGCAGGCAA
TGAGCCACTA GCGAACGATG CTAGCTTGAA CAATCAAGCA CAAACCGTCA ATTGTACGAT
TGAACGCCAT GTTTCCATCC AAACAAAAGC CCACCTACAA GATGGTTCCC AAACTTTTAC
TCATGGTGAC GTGATCGATA TGTTTGATGA TGTGTCCCTT ACCCATGATG TACTGGATGG
CTCAAAACAA GCTTTCGAAA CAATTCTGTA TGCTTTACTA CCAGATGGTA CGAACAAAGA
AATTTGGAAA TCTCGCAAAA TTGAGCATGA AGTGAATGAT AAAGAATTTA CCAAAACCGT
ACTTGCGGAA AAACTACATA CCGCAAAGTA TCCAGAAGGA ACTAAGTTTA CTTTTACGGA
AATCAATTAC GAAAAAGATG GAAACGTGAA TGGAAAACAC AATGAAGATT TGAAAGAAAA
ATCTCAAACC TTAACACCAA AAGAAGTGCC AACCATACCG AGTACCCCAA AACAACCGGA
AACACCAGCT GTTCCAAGTA ATTCTCAAGA ATCTAGTCCC ACAGTGAACA EF102-4 (SEQ
ID NO:396) LDEKNQVA VKHLAINEYF WQETKAPEGY TLDETKYPVS IKKVDNNEKN
AVITRDVTAK EQVIRFGFDF FKFACSADCT AETGFNDLSF KVSPLEGTXE ITGAEDKATT
ACNEQLGFDC YGKFENLPYG DYLLEETEAP ECFQKITPLE IRSTFKENKD DYAKSEYVFT
ITEEGQKQPT KMVTVPYEKL TNNEFSVSLN RLMLYDLPEK EDSLTSLATW KDGNKKLNTL
DFTELVDKLR YNLHEIKEDW YVVAQATDVE ATKAAQEKDE KAKPVVIAET TATLANKEKT
GTWKILHKLT AEQVLDKSIV LFNYVYENKV AFEAGNEPVA KDASLNNQAQ TVNCTIERHV
SIQTKAHLED GSQTFTHGDV MDMFDDVSVT HDVLDCSKEA FETILYALLP DGTNKEIWKS
GKIEHEVNDK EFTKTVLAEK VDTGKYPEGT KFTFTEINYE KDGNVNGKHN EDLKEKSQTL
TPKEVPTIPS TPKQPETPAV PSNSQESSPT VK EF103-1 (SEQ ID NO:397)
TAAGATAGGT TTATCAAAGA AAAGGAGCGA TGCTTTATGA AAAAGAAAGT ATTAAGTTCG
ATTACTTTAG TAACATTAAG TACGTTACTT ATAGCAGGTT ATGCAAGTCC AGCATTTGCA
GATCATGCAG CCAATCCAAA TAGTGCTACA GCAAATTTAG GCAAACATCA AAACAATGGC
CAAACAAGAG GCGACAAGGC GACTAAGATT TTATCTGGCA CGGACTGGCA AGGAACCCGT
GTTTATGATG CTGCTGGTAA TGATTTAACG GCAGAAAATG CTAATTTTAT TGGTTTAGCA
AAATATGATG GTGAAACCGG TTTTTACGAG TTTTTCGACA AAAATACTGG GGAAACCCGT
GGTGACGAAG GAACATTTTT TGTGACAGGT GATGGCACAA AACGAATTTT AATTTCGCGG
ACACAAAATT ATCAAGCCGT ACTGGATTTA ACCGAAGTGA GTAAAGACNA ATTTACTTAC
AAGCGTTTAG GGAAAGATAA ACTGGGGAAT GATGTTGAAG TTTACGTGGA ACACATCCCT
TATCATGGGA AAAAATTAGC TTTTACAAAT GGACGTGAAG CATTAACCAA TCAAACTGGC
AAAATTGTGA CAAATAAATC AGGGGATAAA ATTTTAGGAA CAACCTTGTG GAATGGCACA
AAAGTCGTAG ATAAAAACGG TAATGATGTG ACAGCGGCCA ATCAAAATTT CATTAGTTTA
GCGAAATTTG ATCCAAACAC AAGTAAATAT GAATTTTTCA ATTTACAAAC AGGTGAAACC
CGCGGCGACT TTGGGTACTT CCAAGTGGTG GACAATAACA AGATTCGGGC CCATGTATCT
ATTGGTACGA ATCGTTACGG CGCGGCCCTA GAATTAACGG AACTAAACAA TGATCGATTT
ACGTATACTC GAATGGGTAA AGATAATGCT GGTAATGATA TTCAAGTGTT CGTGGAACAT
GAACCTTACC AAGGCACATA TCATCCAGCC TTTACTTTCT AA EF103-2 (SEQ ID
NO:398) MKKKVLSSI TLVTLSTLLI AGYASPAFAD HAANPNSATA NLGKHQNNGQ
TRGDKATKIL SGTDWQGTRV YDAAGNDLTA ENANFIGLAK YDGETGFYEF FDKNTGETRG
DEGTFFVTGD GTKRILISRT QNYQAVVDLT EVSKDXFTYK RLGKDKLGND VEVYVEHIPY
HGKKLAFTNG REALTNQTGK IVTNKSGDKI LGTTLWNGTK VVDKNGNDVT AANQNFISLA
KFDPNTSKYE FFNLQTGETR GDFGYFQVVD NNKIRAHVSI GTNRYGAALE LTELNNDRFT
YTRMGKDNAG NDIQVFVEHE PYQGTYHPAF TF EF103-3 (SEQ ID NO:399)
TCATGCAG CCAATCCAAA TAGTGCTACA GCAAATTTAG GCAAACATCA AAACAATGGC
CAAACAAGAG GCGACAAGGC GACTAAGATT TTATCTGGCA CGGACTGGCA AGGAACCCGT
GTTTATGATG CTGCTGGTAA TGATTTAACG GCAGAAAATG CTAATTTTAT TGGTTTAGCA
AAATATGATG GTGAAACCGG TTTTTACGAG TTTTTCGACA AAAATACTGG GGAAACCCGT
GGTGACGAAG GAACATTTTT TGTGACAGGT GATGGCACAA AACGAATTTT AATTTCGCGG
ACACAAAATT ATCAAGCCGT AGTGGATTTA ACCGAAGTGA GTAAAGACNA ATTTACTTAC
AAGCGTTTAG GGAAAGATAA ACTGGGGAAT GATGTTGAAG TTTACGTGGA ACACATCCCT
TATCATGGGA AAAAATTAGC TTTTACAAAT GGACGTGAAG CATTAACCAA TCAAACTGGC
AAAATTGTGA CAAATAAATC AGGGGATAAA ATTTTAGGAA CAACCTTGTG GAATGGCACA
AAAGTCGTAG ATAAAAACGG TAATGATGTG ACAGCGGCCA ATCAAAATTT CATTAGTTTA
GCGAAATTTG ATCCAAACAC AAGTAAATAT GAATTTTTCA ATTTACAAAC AGGTGAAACC
CGCGGCGACT TTGGGTACTT CCAAGTGGTG GACAATAACA AGATTCGGGC CCATGTATCT
ATTGGTACGA ATCGTTACGG CGCGGCGCTA GAATTAACGG AACTAAACAA TGATCGATTT
ACGTATACTC GAATGGGTAA AGATAATGCT GGTAATGATA TTCAAGTGTT CGTGGAACAT
GAACCTTACC AAGGCACATA TCATCCAGCC T EF103-4 (SEQ ID NO:400)
HAANPNSATA NLGKHQNNGQ TRGDKATKIL SGTDWQGTRV YDAAGNDLTA ENANFIGLAK
YDGETGFYEF FDKNTGETRG DEGTFFVTGD GTKRILISRT QNYQAVVDLT EVSKDXFTYK
RLGKDKLGND VEVYVEHIPY HGKKLAFTNC REALTNQTGK TVTNKSGDKI LGTTLWNGTK
VVDKNGNDVT AANQNFISLA KFDPNTSKYE FFNLQTGETR CDFGYFQVVD NNKIRAHVSI
GTNRYCAALE LTELNNDRFT YTRMGKDNAG NDIQVFVEHE PYQGTYHPA EF104-1 (SEQ
ID NO:401) TGAAAGGGGA TTAGTATGAA GAAAAAAACT TTTTCTTTTG TGATGTTGAG
TATACTTCTC GCACAAAATT TCGGGTTTGC CGTAAATGCC TATGCTGTAA CAACGACAGA
AGCACAAACA GAGACCACTG ATACAGCAAA AAAAGAGGCA GAGTTATCGA ACTCAACACC
ATCTTTACCT TTAGCAACAA CGACTACTTC AGAAATGAAT CAACCAACTG CAACAACTGA
ATCGCAAACC ACAGAGGCGA GCACAACAGC TTCCAGTGAT GCTGCTACAC CATCTGAACA
ACAAACAACG GAGGACAAGG ACACCTCACT TAATGAAAAA GCCCTGCCAG ATGTTCAAGC
GCCAATTACA GATGAACTAC TTGACAGTAT GAGTCTTGCG CCGATTGGTG GAACAGAATA
CAGCCAAACA GAGGTTCACC GCGAATTAAA TACAACACCG GTAACCGCTA CGTTCCAATT
TGCTGTTGGA AACACAGGTT ATGCACCTGG ATCAGTTTAT ACAGTTCAAT TACCAGAACA
TTTAGGTTAT TCAACTGTCA GCGGAGAAGT GACAGGCATT GGCGCAACTT GGGCAGTCGA
TGCGGCGACC AAAACATTAA GTATTACGTT TAATCAACGA GTTTCAGATA CTTCCTTTAA
AGTAGAACTA AAAAGTTATC TAACAACAGA GGCGGAACCA TTAATCAAAA TTGAAACTCC
AGGAAAAAAT AAAAAAACCT ACTCGTTTGA TTTATATGAA CAAGTGGAAC CAATTCAATA
TAACGAACGA ACCAGAACGA CGGGGTTAGA TGGCGAAATT TTTTATAATT TAGACCGGAC
GTTAACTGGC AATCAAACAT TAGAATTATT AACAACAGAG ACGCCAGGCG CTGTCTTTGG
AAAACAAGAT AACTTGGAAC CTCAAGTTTT CAGTTACGAT GTCGACATTA ATGGTCAAAT
TTTACCAGAA ACGCAAACCT TGTTAACACC TGGCAAAGAT TATACATTAA GCGATAATTC
ACTCGGGCGG ATTGCTGTAA CTGTTCCAAA CATCAATCAA CAAAAAGCCT ATTCCTTATC
GATTAATCGG ACAATTTATT TAGAGAGTGC TTCGGACTAT AACTACTTAT ATTCGCAGCA
GTATCCAACA ACAAAAATTG GCTCAATTTC TTTGAAAAGT ACGACAGGAA CTAAACAAAC
AACCGATTTT ACTGCTAAGA CGAGTCAAAC AAGTAAAGTA ATTGCTGATC GTGAAATGCG
TAGTATGTCC TATATCAGTT TTCAAAGCAA AGGGAAATAT TATGTAACAA TTTATGGCAC
GTTAACAGAA ACAAAAGTGG GTCAACAAAT COTATTAGAG AGTACAAACG GTCAAGAAAT
TAAGAATCCT AAATTTACGG CGTATGGTCC TTTATATGAA AATGTAAAAT TGGAAGACTA
TTTTGATATT AAAACTGAAG GTGGCAAGCT CACTTTAACG GCCACAAAAG ATAGCTATTT
AAGAATAAAT ATTTCTGATT TAACAATGGA TTTTGACAAG AAGGACATTA ATCTATCATT
AAGTACACCT GTAATTGGTC CTAATAAAGC CATTCAATTA GTATCCGATC AATATATTGA
ACCAATTAGT GTTGTTAATC CTTTGAATGC TGAAACTGCT TGGGGTAATT ATGATCAAAA
TGGTGCCTAT TCATCAAGAA CAACTGTCTC AGTTATGGGA AGCAAAGAGA AACCGATTCA
AAATTTAGAA ATTAAAGTAA AGCATCCTAA TTATCTTTCA TTACGAGCTA CAAAAGAAAT
TTATTTTTAT TACAAGTTAG GAACGGATTA TACAGTAACG CCAACGTCAG ATGGTTCAGT
TATTAAGTTC ACTACGCCAA TAACCAACGA AATCCAAATT CCAATTGGTT TTAATTATGT
GCCAGATAGT TTGCCAAAAG ATAAAAGTAT CCCAGTCGAT ACGATACCGA TAACAATGAG
TGCTGAAGGT TTAACTCCAG TTGATACGAC AGTAACTACT AATAGTAAGC GTGGTTCTGA
ACGAACACTT CAAAGTAGTA AAAATCAATT CCTTGTCAAT GCACGAAATG ATTCTTTTGA
CTCACTAAGC GTCCGTACAA AAATTCCAGC TGGCGCCGAT GTTCTTTTTG ACATTTATGA
TGTTTCAAAC GATCAGGTAG ATTCAATTTA TCCACAATAC TGGGACCGCG GTCAATACTT
TGATAAACCA ATGACGCCAA ACAGCCCTGG ATATCCAACG ATTACTTTTG ACGAAAATAC
CAATAGTTAC ACGTTTGATT TTGGAAAAAC CAACAAACGT TACATTATTG AGTATAAAAA
CGCCAATGGC TGGATCGACG TGCCAACTCT TTATATAACA GGGACAGCGA AAGAACCACA
ATCGAATAAT AATGAAGGCT CTGCTTCGGT TTCTGTTCAA AATGAAGCGT TAGACATTTT
GAGTGCAACA CAAGCGGCGA ATCCAACATT AAAAAATGTA ACAAAAACGA CAGTAACAAC
AAAAAATATT GATAATAAAA CACATCGTGT GAAAAATCCA ACGATTGAAT TAACACCAAA
AGGCACAACC AATGCTCAAA TCGATTTGAA TTCTATTACC GTGAAAGGCG TGCCAGAAGA
TGCTTATTCA TTAGAGAAGA CTACAAACGG TGCGAAAGTC ATTTTTAAAG ACTATACATT
GACAGAAAAC ATTACGATTG AATACAATAC GGTCTCTGCA AACGCTGGCC AAATCTATAC
AGAAACAACA ATCGACTCTG AAACATTGAA CCAGATGTCT GCTAGCAAGA AAAAAGTCAC
CACTGCGCCA ATCACATTGA AATTCTCAGA AGGTGATGCG GAAGGTATTG TTTATTTAGC
AACTGCCACA TTCTACACGC ATAACGTAGA GGATGAAAAC CAAGCAATTG CGAAGGTTTC
TTTTCAACTA ATTCATAATG TCACCCATAC AGCAACCGAA TTTACAACAG ATGAAAAAGG
TCAATACTCC TTTGATGCCA TCATGACAGG TOATTATACT TTGCGAGTAA CGAATGTACC
GCAGGAATAT TCCGTGGATG AAGAGTATTT GACAGGAAAA GCCATTAAGC TGGTCAAACG
ACACAACCAA CTAAAAATTC CATTAACGAA AACAATTGAT CACAGTCCTT TACAAGTCAA
AGATTCAACG ATTTATGTCG GCGATTCATG GAAACCAGAA CAGAACTTTG TTTCAGCAAC
AGATAAAACA GGTCAAGACG TTCCCTTCGA AAAAATCACT GTTTCAGGTC AAGTTGATAA
CANCAAAGCA GCCGTTTATC CAATTATTTA CAGTGACGAA GGTAAAGAAG AAACAGCCTA
TGTGACCGTC AAACCCCACC AATCTAAGTT AGAGGTCAAA GATACAACGA TTTATGTTGG
TGATTCGTGG AAACCAGAAG ATAATTTCGT TTCAGCCACA GACAAAACAG GTCAACACGT
NCCGTTTGAA AAAATTGATG TTCAGCGAAC AGTCAATGTT CATAAAATAC GCCATTATGA
AATTGTCTAT AAAAATGGCA NAAAAGAAGC GAAAGCAATC GTTCATGTCC GTGATGACAG
TCAGTTAGAG GTTAAAGATA CAACGATTTA TGTTGGTGAT TCCTGGAAAC CAGAAGATAA
TTTCGTTTCA GCAACAGACA AAACAGGCCA AGACGTTCCG TTTGAAAAAA TCACTGTTTC
AGGTCAAGTT GATACTAGCA AAGCAGGCGT TTATCCAATC GTTTACAGTT ACGAAGGTAA
AGAAGAAACA GCTAATGTCA CTCTCAAACC CGACCAATCT AAGTTAGAGG TTAAAGATAC
AACGATTTAT GTGGGCGATA AATCGGAACC AGAAGATAAT TTCGTTTCAG CAACAGACAA
AACAGGTCAA GATGTCCCGT TTGAAAAAAT TGACGTTCAG GGAACAGTGA ATCTTGATAA
AATAGGCGAT TATGAAATTG TCTATAAAAA TGGCACAAAA GAAGCGAAAG CAATCGTTCA
TGTCCGTGAT CACAGTCAGT TACAGGTCAA AGATACAACA ATTTATGTGG GTGATAAATG
GCAAGCACAA GATAACTTCG TTTCCGCGAC AGACAAAACA GGTCAAGACG TTCCGTTTGA
AAAAATTGAT GTTCAGGGAA CAGTGAATGT TGATAAAATA GGCGATTATC AAATTGTCTA
TAAAAATGGC ACAAAAGAAG CGAAAGCAAT CGTTCATGTC CGTGATCATA GTCGTTTACA
AGTCAAGGAT ACAACGATTT ATGTCGGCGA TTCNTGGANA CCAGAAGNGA ACTTTGTTTC
AGCNACAGAT AAAACAGGTC AAGATCTCCC ATTCGAAAAA ATCACTGTT EF104-2 (SEQ
ID NO:402) MKKKTF SFVMLSILLA QNFCFAVNAY AVTTTEAQTE TTDTAKKEAE
LSNSTPSLPL ATTTTSEMNQ PTATTESQTT EASTTASSDA ATPSEQQTTE DKDTSLNEKA
LPDVQAPITD ELLDSMSLAP IGGTEYSQTE VHRELNTTPV TATFQFAVGN TGYAFGSVYT
VQLPEHLGYS TVSGEVTGIG ATWAVDAATK TLSITFNQRV SDTSFKVELK SYLTTEAEPL
IKIETPGKNK KTYSFDLYEQ VEPIQYNERT RTTGLDCETF YNLDRTLTGN QTLELLTTET
PGAVFGKQDN LEPQVFSYDV DINOQILPET QTLLTPGKDY TLSDNSLGRI AVTVPNNNQQ
KAYSLSINRT IYLESASDYN YLYSQQYPTT KIGSISLKST TGTKQTTDFT AKTSQTSKVI
ADREHRSMSY ISFQSKGKYY VTIYGTLTET KVGQQIVLES TNCQEIKNPK FTAYGPLYEN
VKLEDYFDIK TEGGKLTLTA TKDSYLRINI SDLTMDFDKK DINLSLSTPV IGPNKAIQLV
SDQYIEPISV VNPLNAETAW GNYDQNGAYS SRTTVSVMGS KEKPIQNLEI KVKHPNYLSL
RATKEIYFYY KLGTDYTVTP TSDCSVTKFT TPITNEIQIP ICFNYVPDSL PKDKSIPVDT
IPITMSAEGL TPVDTTVTTN SKRGSERTLQ SSKNQFLVNA ENDSFDSLSV RTKIPAGADV
LFDIYDVSND QVDSIYPQYW DRGQYFDKPM TPNSPGYPTI TFDENTNSYT FDFGKTNKRY
IIEYKNANCW IDVPTLYITG TAKEPQSNNN EGSASVSVQN EALDTLSATQ AANPTLKNVT
KTTVTTKNID NKTHRVKNPT IELTPKGTTN AQIDLNSITV KGVPEDAYSL EKTTNGAKVI
FKDYTLTENI TTEYNTVSAN AGQIYTETTI DSETLNQMSA SKKKVTTAPI TLKFSEGDAE
CIVYLATATE YTHNVEDENQ AIAKVSFELI DNVTHTATEF TTDEKGQYSF DAIMTGDYTL
RVTNVPQEYS VDEEYLTGKA TKLVKCDNQL KTPLTKTIDH SRLQVKDSTI YVGDSWKPEE
NFVSATDKTC QDVPFEKTTV SGQVDNXKAG VYPIIYSDEG KEETAYVTVK PDQSKLEVKD
TTIYVGDSWK PEDNFVSATD KTGQDVPFEK IDVQGrJNVD KIGDYEIVYK NGXKEAKAIV
HVRDDSQLEV KDTTTYVGDS WKPEDNFVSA TDKTGQDVPF EKITVSCQVD TSKAGVYPIV
YSYEGKEETA NVTVKPDQSK LEVKDTTIYV GDKWEPEDNF VSATDKTGQD VPFEKIDVQG
TVNVDKIGDY EIVYKNCTKE AKATVHVRDD SQLEVKDTTI YVGDKWEAED NFVSATDKTG
QDVPFEKTDV QGTVNVDKIG DYEIVYKNGT KEAKAIVHVR DDSRLQVKDT TIYVGDSWXP
EXNFVSATDK TGQDVPFEKI TV EF104-3 (SEQ ID NO:403) TGTAA CAACGACAGA
AGCACAAACA GAGACCACTG ATACAGCAAA AAAAGAGGCA GAGTTATCGA ACTCAACACC
ATCTTTACCT TTAGCAACAA CGACTACTTC AGAAATGAAT CAACCAACTG CAACAACTGA
ATCGCAAACC ACAGAGGCGA GCACAACAGC TTCCAGTGAT GCTGCTACAC CATCTGAACA
ACAAACAACG GAGGACAAGG ACACCTCACT TAATGAAAAA GCCCTGCCAG ATGTTCAAGC
GCCAATTACA GATCAACTAC TTGACAGTAT GAGTCTTGCG CCGATTGGTG GAACAGAATA
CAGCCAAACA GAGGTTCACC GCGAATTAAA TACAACACCG GTAACCGCTA CGTTCCAATT
TCCTGTTGGA AACACAGGTT ATCCACCTGG ATCAGTTTAT ACAGTTCAAT TACCAGAACA
TTTAGGTTAT TCAACTGTCA GCGGAGAAGT CACAGGCATT GGCGCAACTT GGGCAGTCGA
TGCGGCGACC AAAACATTAA
GTATTACGTT TAATCAACGA GTTTCAGATA CTTCCTTTAA AGTAGAACTA AAAAGTTATC
TAACAACAGA GGCGGAACCA TTAATCAAAA TTGAAACTCC AGGAAAAAAT AAAAAAACCT
ACTCGTTTGA TTTATATGAA CAAGTGGAAC CAATTCAATA TAACGAACGA ACCAGAACGA
CGGGGTTAGA TGGCGAAATT TTTTATAATT TAGACCGGAC GTTAACTGGC AATCAAACAT
TAGAATTATT AACAACAGAG ACGCCAGGCG CTGTCTTTGG AAAACAAGAT AACTTGCAAC
CTCAAGTTTT CAGTTACGAT GTCGACATTA ATGGTCAAAT TTTACCAGAA ACGCAAACCT
TCTTAACACC TGGCAAAGAT TATACATTAA GCGATAATTC ACTCGGGCGG ATTGCTGTAA
CTGTTCCAAA CATGAATCAA CAAAAAGCCT ATTCCTTATC GATTAATCGG ACAATTTATT
TAGAGAGTGC TTCGGACTAT AACTACTTAT ATTCGCAGCA GTATCCAACA ACAAAAATTG
GGTCAATTTC TTTCAAAAGT ACGACAGGAA CTAAACAAAC AACCGATTTT ACTCCTAACA
CGACTCAAAC AAGTAAAGTA ATTGCTGATC GTGAAATGCG TAGTATGTCC TATATCAGTT
TTCAAAGCAA AGGGAAATAT TATGTAACAA TTTATGGCAC GTTAACAGAA ACAAAAGTGG
GTCAACAAAT CGTATTAGAG AGTACAAACG GTCAAGAAAT TAAGAATCCT AAATTTACGG
CGTATGGTCC TTTATATGAA AATGTAAAAT TGGAAGACTA TTTTGATATT AAAACTGAAG
GTGGCAAGCT CACTTTAACG GCCACAAAAG ATAGCTATTT AAGAATAAAT ATTTCTGATT
TAACAATGGA TTTTGACAAG AAGGACATTA ATCTATCATT AAGTACACCT GTAATTGGTC
CTAATAAACC CATTCAATTA GTATCCGATC AATATATTGA ACCAATTAGT GTTGTTAATC
CTTTGAATGC TGAAACTGCT TGGGGTAATT ATGATCAAAA TGGTGCCTAT TCATCAAGAA
CAACTGTCTC AGTTATGGGA AGCAAAGAGA AACCGATTCA AAATTTAGAA ATTAAAGTAA
ACCATCCTAA TTATCTTTCA TTACGAGCTA CAAAACAAAT TTATTTTTAT TACAAGTTAG
GAACGGATTA TACAGTAACG CCAACGTCAG ATGGTTCACT TATTAAGTTC ACTACGCCAA
TAACCAACGA AATCCAAATT CCAATTGGTT TTAATTATGT GCCAGATAGT TTGCCAA4AG
ATAAAAGTAT CCCAGTCGAT ACGATACCGA TAACAATGAG TGCTGAAGGT TTAACTCCAG
TTGATACGAC AGTAACTACT AATAGTAAGC GTGGTTCTGA ACGAACACTT CAAAGTAGTA
AAAATCAATT CCTTGTCAAT GCACGAAATG ATTCTTTTGA CTCACTAAGC GTCCGTACAA
AAATTCCAGC TCGCGCCGAT GTTCTTTTTG ACATTTATGA TGTTTCAAAC GATCAGGTAG
ATTCAATTTA TCCACAATAC TCGGACCGCG GTCAATACTT TGATAAACCA ATGACGCCAA
ACAGCCCTGG ATATCCAACG ATTACTTTTG ACGAAAATAC CAATAGTTAC ACGTTTGATT
TTGGAAAAAC CAACAAACGT TACATTATTG AGTATAAAAA CGCCAATGGC TGGATCGACG
TGCCAACTCT TTATATAACA GGGACAGCGA AAGAACCACA ATCGAATAAT AATGAAGGCT
CTGCTTCGGT TTCTGTTCAA AATGAAGCCT TAGACATTTT GAGTGCAACA CAACCGGCGA
ATCCAACATT AAAAAATGTA ACAAAAACGA CAGTAACAAC AAAAAATATT GATAATAAAA
CACATCGTGT GAAAAATCCA ACGATTGAAT TAACACCAAA AGGCACAACC AATCCTCAAA
TCGATTTGAA TTCTATTACC CTGAAAGGCG TCCCAGAACA TGCTTATTCA TTAGAGAAGA
CTACAAACGG TGCGAAAGTC ATTTTTAAAG ACTATACATT GACAGAAAAC ATTACGATTG
AATACAATAC GGTCTCTGCA AACGCTGGCC AAATCTATAC ACAAACAACA ATCGACTCTG
AAACATTGAA CCAGATGTCT GCTAGCAAGA AAAAAGTCAC CACTGCGCCA ATCACATTGA
AATTCTCAGA AGGTGATGCG GAAGGTATTG TTTATTTAGC AACTGCCACA TTCTACACGC
ATAACGTAGA GGATGAAAAC CAACCAATTG CGAAGGTTTC TTTTGAACTA ATTGATAATG
TCACGCATAC AGCAACCGAA TTTACAACAC ATGAAAAAGG TCAATACTCC TTTGATGCCA
TCATCACACG TGATTATACT TTGCGAGTAA CGAATGTACC CCAGGAATAT TCCGTGGATG
AAGAGTATTT CACAGGAAAA GCCATTAAGC TGGTCAAAGG AGACAACCAA CTAAAAATTC
CATTAACCAA AACAATTGAT CACAGTCGTT TACAAGTCAA AGATTCAACG ATTTATGTCG
GCGATTCATG GAAACCAGAA GAGAACTTTG TTTCAGCAAC AGATAAAACA GGTCAAGACG
TTCCCTTCGA AAAAATCACT GTTTCAGGTC AAGTTGATAA CANCAAAGCA CGCGTTTATC
CAATTATTTA CAGTGACCAA GGTAAAGAAG AAACAGCCTA TGTGACCGTC AAACCCGACC
AATCTAAGTT AGAGGTCAAI GATACAACGA TTTATGTTGG TGATTCGTGG AAACCAGAAG
ATAATTTCGT TTCAGCGACA GACAAAACAG GTCAAGACGT NCCGTTTGAA AAAATTGATG
TTCAGGGAAC AGTGAATGTT GATAAAATAG GCGATTATGA AATTGTCTAT AAAAATGGCA
NAAAAGAAGC GAAAGCAATC GTTCATGTCC GTGATGACAG TCAGTTAGAG GTTAAAGATA
CAACGATTTA TGTTGGTGAT TCGTGGAAAC CAGAAGATAA TTTCGTTTCA GCAACAGACA
AAACAGGCCA AGACGTTCCG TTTGAAAAAA TCACTGTTTC AGGTCAAGTT GATACTAGCA
AAGCAGCCGT TTATCCAATC GTTTACAGTT ACGAAGGTAA AGAAGAAACA GCTAATGTGA
CTGTCAAACC CGACCAATCT AAGTTAGAGG TTAAAGATAC AACGATTTAT GTGGCCGATA
AATGGGAACC AGAAGATAAT TTCGTTTCAG CAACAGACAA AACAGGTCAA GATGTCCCGT
TTGAAAAAAT TGACGTTCAG GCAACAGTGA ATGTTGATAA AATAGGCGAT TATGAAATTG
TCTATAAAAA TGGCACAAAA GAAGCGAAAG CAATCGTTCA TGTCCGTGAT GACAGTCAGT
TAGAGGTCAA AGATACAACA ATTTATGTCG GTGATAAATG GGAAGCAGAA GATAACTTCG
TTTCCCCGAC AGACAAAACA GGTCAAGACG TTCCGTTTGA AAAAATTGAT GTTCAGGGAA
CAGTGAATGT TGATAAAATA GGCGATTATG AAATTGTCTA TAAAAATGGC ACAAAAGAAG
CGAAAGCAAT CGTTCATGTC CGTGATGATA GTCGTTTACA AGTCAAGGAT ACAACGATTT
ATGTCGGCGA TTCNTGGANA CCAGAAGNGA ACTTTGTTTC AGCNACACAT AAAACAGGTC
AAGATGTCCC ATTG EF104-4 (SEQ ID NO:404) VTTTEAQTE TTDTAKKEAE
LSNSTPSLPL ATTTTSEMNQ PTATTESQTT EASTTASSDA ATPSEQQTTE DKDTSLNEKA
LPDVQAPITD ELLDSMSLAP IGGTEYSQTE VHRELNTTPV TATFQFAVGN TGYAPGSVYT
VQLPEHLGYS TVSGEVTGIG ATWAVPAATK TLSITFNQRV SDTSFKVELK SYLTTEAEPL
IKIETPGKNK KTYSFDLYEQ VEPIQYNERT RTTGLDGEIF YNLDRTLTGN QTLELLTTET
PGAVFGKQDN LEPQVFSYDV DINGQILPET QTLLTPGKDY TLSDNSLGRI AVTVPNNNQQ
KAYSLSINRT IYLESASDYN YLYSQQYPTT KIGSISLKST TGTKQTTDFT AKTSQTSKVI
ADREMRSMSY ISFQSKGKYY VTIYGTLTET KVGQQIVLES TNGQEIKNPK FTAYGPLYEN
VKLEDYFDIK TEGGKLTLTA TKDSYLRINI SDLTMDFDKK DINLSLSTPV IGPNKAIQLV
SDQYIEPTSV VNPLNAETAW GNYDQNGAYS SRTTVSVMGS KEKPIQNLEI KVKHPNYLSL
RATKEIYFYY KLCTDYTVTP TSDGSVIKFT TPITNEIQIP IGFNYVPDSL PKDKSIPVDT
IPTTMSAEGL TPVDTTVTTN SKRGSERTLQ SSKNQFLVNA RNDSFDSLSV RTKIPAGADV
LFDIYDVSND QVDSIYPQYW DRGQYFDKPM TPNSPGYPTI TFDENTNSYT FDFGKTNKRY
IIEYKNANGW IDVPTLYITG TAKEPQSNNN EGSASVSVQN EALDILSATQ AANPTLKNVT
KTTVTTKNID NKTHRVKNPT IELTPKGTTN AQIDLNSITV KGVPEDAYSL EKTTNGAKVI
FKDYTLTENI TIEYNTVSAN AGQIYTETTI DSETLNQMSA SKKKVTTAPI TLKFSEGDAE
GIVYLATATF YTHNVEDENQ AIAKVSFELI DNVTHTATEF TTDEKGQYSF DAIMTGDYTL
RVTNVPQEYS VDEEYLTGKA IKLVKGDNQL KIPLTKTIDH SRLQVKDSTI YVGDSWKPEE
NFVSATDKTG QDVPFEKITV SGQVDNXKAG VYPIIYSDEG KEETAYVTVK PDQSKLEVKD
TTIYVGDSWK PEDNFVSATD KTGQDVPFEK IDVQGTVNVD KIGDYEIVYK NGXKEAKAIV
HVRDDSQLEV KDTTIYVGDS WKPEDNFVSA TDKTCQDVPF EKITVSCQVD TSKAGVYPIV
YSYEGKEETA NVTVKPDQSK LEVKDTTIYV GDKWEPEDNF VSATDKTGQD VPFEKIDVQG
TVNVDKIGDY EIVYKNGTKE AKAIVHVRDD SQLEVKDTTI YVGDKWEAED NFVSATDKTG
QDVPFEKIDV QGTVNVDKIG DYEIVYKNGT KEAKAIVHVR DDSRLQVKDT TIYVGDSWXP
EXNFVSATDK TGQDVPF EF105-1 (SEQ ID NO:405) TAAATGAAAA AAACAGTCGT
CTACTCCTTG TTATTCGGAA CAATGTTGCT TGGCGCCACT GTTCCTGCTG AAGCGGCGAC
GGTCGTTTTT GATAGCGAAC AGTCGATTGT TTTTACCCCA AGCACAGATG GGACGGATCC
AGTAAATCCA GAAAATCCCG ATCCAGAAAA ACCAGTTCGA CCAGTCGATC CAACGAATCC
TGATGGACCT AATCCAGGTA CCCCTGGTCC ACTTTCCATC GATTATGCCT CAAGTTTGGA
TTTTGGGAGT AATGAGATAT CCAATAAGGA TCAAACGTAT TTTGCCAGAG CGCAAACCTA
TAGAAATCCA GATGGTTCAG CAAGTGAATT GGCAACTGCT AATTATGTAC AAGTAAGTGA
TTTACGGGGA ACCAATGCTG GCTGGGTTTT AAAAGTGAAA CAAAATGGTC AATTTCGTAA
TGCAGAAACA TTACACAAAG AATTAACAGG CGCCACCGTC GCCTTTACTG AGCCCAGTGT
TCGCTCAAAT GCGACGCACG TATTGCCGCC AACTGCTACC GCAAACATTC AATTAGATGC
TGCGGGCGCA CAAACTGTTG TCATGCAAGC CCCAGAAAAG ACCGCCGCCG GAACGTGGAT
CACGCTGTGG GGGCAAGCAG AAAAAGTGAC CGAAAAAAAT CAACAAGCAC AGCAAGTAAA
TGCCACAATC ACACGGGCAA TCTCACTAAC TGTTCCTGGG AAAACCCCTA AGGATGCAGT
ACAATATAAA ACAACATTGA CTTGGCTACT TTCAGATGTA CCAGTAAATA ATGGAGGGAA
ATAA EF105-2 (SEQ ID NO:406) MKKTVVYSLL FGTMLLGATV PAEAATVVFD
SEQSIVFTPS TDGTDPVNPE NPDPEKPVRP VDPTNPDGPN PGTPGPLSID YASSLDFGSN
EISNKDQTYF ARAQTYENPD GSASELATAN YVQVSDLRGT NAGWVLKVKQ NCQFRNAETL
HKELTGATVA FTEPSVRSNA TDVLPPTATA NIQLDAACAE TVVMQAPEKT GAGTWITLWG
QAEKVTEKNQ QGQQVNATIT RAISLTVPCK TPKDAVQYKT TLTWLLSDVP VNNGGK
EF105-3 (SEQ ID NO:407) GGCGAC GGTCGTTTTT GATAGCGAAC AGTCGATTGT
TTTTACCCCA AGCACAGATG GGACGGATCC AGTAAATCCA GAAAATCCCG ATCCAGAAAA
ACCAGTTCGA CCAGTCGATC CAACGAATCC TGATGGACCT AATCCAGGTA CCCCTGGTCC
ACTTTCCATC GATTATGCCT CAAGTTTGGA TTTTGGGACT AATGAGATAT CGAATAAGGA
TCAAACGTAT TTTGCCAGAG CGCAAACCTA TAGAAATCCA GATGGTTCAG CAAGTGAATT
GGCAACTGCT AATTATGTAC AAGTAAGTGA TTTACGGCGA ACCAATGCTG GCTGGGTTTT
AAAAGTGAAA CAAAATGGTC AATTTCGTAA TGCAGAAACA TTACACAAAG AATTAACAGG
CGCCACCGTC GCCTTTACTG AGCCCAGTGT TCGCTCAAAT GCCACGGACG TATTGCCGCC
AACTGCTACC GCAAACATTC AATTAGATGC TGCGGGCGCA GAAACTGTTG TCATGCAACC
CCCAGAAAAG ACCGGCGCCG GAACCTGGAT CACGCTGTGG GGGCAAGCAG AAAAAGTGAC
CGAAAAAAAT CAACAAGGAC AGCAAGTAAA TGCCACAATC ACACGGGCAA TCTCACTAAC
TGTTCCTGGG AAAACCCCTA AGGATGCAGT AC EF105-4 (SEQ ID NO:408) ATVVFD
SEQSIVFTPS TDGTDPVNPE NPDPEKPVRP VDPTNPDGPN PGTPGPLSID YASSLDFGSN
EISNKDQTYF ARAQTYRNPD GSASELATAN YVQVSDLRGT NAGWVLKVKQ NGQFRNAETL
HKELTGATVA FTEPSVRSNA TDVLPPTATA NIQLDAAGAE TVVMQAPEKT GAGTWITLWG
QAEKVTEKNQ QGQQVNATIT PAISLTVPGK TPKDAV EF106-1 (SEQ ID NO:409)
TAGTCGTTTA TGAAGAAAAA AATCGTTGGT ACAATTACGT TGTTGGCTTT AAGTGCGTTA
TTAGTTGGTG GAGCACGAGG CGCTTTGACG CCAGAAGCAT ACGTTCCTCA AAGCGTAGAC
AATCCCAATA ATTTAGGGGA TTTACCTGAG TATTTACGTT CAGTTGGTAT TAGACAAGAT
GAACCATTAT CAGAAAAAGA TTGGGCTGGA ACACGCGTTT ATGATCGAAA TGGGAATGAC
TTAACAGATG AAAATCAAAA CCTATTACAT GCAATCAAAT TTGATGCAAC CACTAGTTTC
TATCAATTTT TTGATAAAGA GACTGGAGAA TCAACAGGAG ATGAACGAAC CTTCTTTATG
ACCGCTCGTA TTACAGATGT TTCCCGTCTT GTAATTATTT CTGAAACCAA AAATTATCAA
CGTGTATACC CACTTAGAAC TTTATACCAA GATACTTTTA CGTATAGACA GATGGGGAAA
GATAAAAACG GAAATGATAT TGAAGTTTTC GTAGAAAACA AAGCAACCTC AGGACCAGTT
TATGGTCGTC CGCAGCCATA CCCCAATAAT CGTCCCAGAA CACTAGAATT CACGAATGGA
CGCCGTGCCA TGACAGAACA AACAGGCCAG ATTGATCTAA ATCGACAAGG CGATGAAATT
ATTGGTAAAA CTTCCTTTGA TGGGACACCG CAACTTCTTT GGAATGGCAC AAAAGTAGTG
GATAAAGATG GCAATGACGT AACTTCGGCC AACCAAAACT TTATCAGCTT AGCGAAATTT
GACCAAGATA GCAGCAAATA TGAATTTTTC AATTTACAAA CTGGTGAAAC TCGTGGCGAC
TATGGCTACT TTAAACTAGG AAATCAAAAT AAATTCCGTG CCCATGTTTC CATTGGAACC
AATCGCTATG GCGCTGTCTT AGAGTTAACA GAATTGAATG ATAATCGTTT TACGTACACA
CGAATGGGTA AAGATAACGA AGGAAACCAT ATCCAAGTCT ATGTGGAACA TCAACCATAC
CAAGGAACTT TTAATCCTGA ATTTACCTTT TAA EF106-2 (SEQ ID NO:410)
MKKKIVGT ITLLALSALL VGGAGGALTA EAYVPQSVDN PNNLGDLPEY LRSVGIRQDE
GLSEKDWAGT RVYDRNGNDL TDENQNLLHA IKFDATTSFY EFFDKETGES TGDEGTFFMT
AGITDVSRLV IISETKNYQG VYPLRTLYQD TFTYRQMCKD KNGNDIEVFV ENKATSGPVY
GRPQPYPNNR PRTLEFTNGR RAMTEQTGQI DVNRQGDETI GKTSFDGTPQ LLWNGTKVVD
KDCNDVTSAN QNFISLAKFD QDSSKYEFFN LQTGETRGDY GYFKVGNQNK FRAHVSIGTN
RYGAVLELTE LNDNRFTYTR MGKDNEGNDI QVYVEHEPYQ GTFNPEFTF EF106-3 (SEQ
ID NO:411) AT ACGTTCCTCA AAGCGTAGAC AATCCCAATA ATTTAGGGGA
TTTACCTGAG TATTTACGTT CAGTTGGTAT TAGACAAGAT GAAGGATTAT CAGAAAAAGA
TTGGGCTGGA ACACGCGTTT ATGATCGAAA TGGGAATGAC TTAACAGATG AAAATCAAAA
CCTATTACAT GCAATCAAAT TTGATGCAAC CACTAGTTTC TATGAATTTT TTGATAAAGA
GACTGGAGAA TCAACAGGAG ATGAAGGAAC CTTCTTTATG ACCGCTGGTA TTACAGATGT
TTCCCGTCTT GTAATTATTT CTGAAACCAA AAATTATCAA GGTGTATACC CACTTACAAC
TTTATACCAA GATACTTTTA CGTATAGACA GATGGGGAAA GATAAAAACG GAAATGATAT
TGAAGTTTTC GTAGAAAACA AAGCAACCTC AGGACCAGTT TATGGTCGTC CGCAGCCATA
CCCCAATAAT CGTCCCAGAA CACTAGAATT CACGAATGGA CGCCGTGCCA TGACAGAACA
AACAGGCCAG ATTGATGTAA ATCGACAAGG GGATGAAATT ATTGGTAAAA CTTCCTTTGA
TGGGACACCG CAACTTCTTT CGAATGGCAC AAAAGTAGTG GATAAAGATG GCAATGACGT
AACTTCGGCC AACCAAAACT TTATCAGCTT AGCGAAATTT GACCAAGATA GCAGCAAATA
TGAATTTTTC AATTTACAAA CTGGTGAAAC TCGTCGCGAC TATGGCTACT TTAAAGTAGG
AAATCAAAAT AAATTCCGTG CCCATGTTTC CATTGGAACC AATCGCTATG GCGCTGTCTT
AGAGTTAACA GAATTGAATG ATAATCGTTT TACGTACACA CGAATGGGTA AAGATAACGA
AGGAAACGAT ATCCAAGTCT ATGTGGAACA TGAACCATAC CAAGGAACTT EF106-4 (SEQ
ID NO:412) YVPQSVDN PNNLGDLPEY LRSVGIRQDE GLSEKDWAGT RVYDRNGNDL
TDENQNLLHA IKFDATTSFY EFFDKETGES TGDEGTFFMT AGITDVSRLV IISETKNYQG
VYPLRTLYQD TFTYRQMGKD KNGNDIEVFV ENKATSGPVY GRPQPYPNNR PRTLEFTNGR
RAMTEQTGQI DVNRQGDEII GKTSFDGTPQ LLWNGTKVVD KDGNDVTSAN QNFISLAKFD
QDSSKYEFFN LQTGETRGDY GYFKVGNQNK FRAHVSIGTN RYGAVLELTE LNDNRFTYTR
MGKDNECNDI QVYVEHEPYQ GT EF107-1 (SEQ ID NO:413) TAAAAAACGG
CACTCAATAT GTCAAAATTT GAAATTTCAA GCTGTGTGTT CTTTGGTAAA ATANATANAA
AAATGCTAGT TATCAGTATC GATAATAACA GGATACTGAT TAAGAAAGGA CTTTATAGAG
ACTATAGATT GAATTTTTAC ATAGAAAGAA GGACCAAGAT GAAGCGAGTA AATTGGAAAA
GATGGCTAGT TGTTGGGTTA AGTTGTTCTT TGTTCATGGA TTCAGTGGTT GGTGTGACTG
TGTTAGCGGA AACGATTACT GGGGCGACGG AGCAAGGAGT AGCAACATCT CAGTCGAGTC
ACGAAGCGAG CCAGACGACG CAAACAACCG AAGAGTCACA GGCAACGGTC GCTAGTGAAG
CGAAAACAGT ACCGCCACAG GAAACGGCAA GAATTGCTTC TCGAGCGATT GGTTATTCTT
CTGTGGAAGG GCGCGAGATT CCCTTTTTCT TTGTGGAGGA AGACGGGACG TTGTTTGATC
CCGACCGAAT TACGATGGCG GTCAATCTTT CCACGTTTTC GTTTTATGAA CACAAATTAC
AACGAACCCC CCTTGAGCCC ACCACTGTGA ATGGCGGAAA GTTACTGTCT ATTCCAACGT
CACCAGCTTT TAAATATGAT ACAAATAACC AGAATCCAAG TAATATTTAT GGCGTTTCTG
AAGTGTCGTT TACTATTCCT AAGGAGTATC AAAGCCTGGA CATTCGACCA AGTACGTTTT
ATACAGGAGA CACTACGCAA TATCCAGTGC CAACGGTTTT TGCGAACGTT GGGGGCAAAG
TGACGAACTA TGTGGGCGCC AATGCGGAGA CGGAATTAGA GTTAACCAAT GAAAAAATGC
CCAATAAGCT GACGTTTGGT CCTAAAAAGA CGTTTAAATA TACGGTACCT ACGGCACCAG
GAGGCGTTAC GTATGCGCTG ACCTATTTTT ATGGAGATGT CGGCGGTCCA ACTAGTTCGC
ACCAAAGACG AGGAACAGCG GGTCCTGTGT ATTATTATTT AACAAAGCGG CGTGTCACGG
AAAAATTTGA GAATCCCGCA GGCGGGGCGA TTCCTGCGCC AGAAGGTTAT ACGCAGGATA
AGAAAACCAT TGTAACAGGG GAGGATTTTA CTTTTACCCA AGAAGGCACC TTGCCTGAAC
GTTACACAGG CAGTGATGGG AAGACGTATT TATTTAAAGG TTGGTACAAA GGGAATGCGA
AACCTAGCAC CTTGGAAACC ACCAAAACGC CTAGTTATGC GGTGACCTAT GATGACAATG
ACGATTTGCA TGTGGTCTAT GAAGAAGCAG TGATGAAAAC CTATACGTTG CCAGCGAGAG
AAGCTTTGTT CGGCTATGTT GATGAGCAAG GAAACTTGAT TAATCCCGCC AAGTTTAAGC
TAAGTGCGAC CATGGGTGAA ACTGACGGAG CCACAGGGGA AATGACGACT TTTCCCACAA
TTGATGGAAT CGATATGCCA GCAAGTCAAT TAAAGAAATT AGCCATCCCG CAAAAAGTCT
ACACACGCCC AGACGATGGG ACAATCGTAA CTTATGGCCC GCAAGAAGTG AGTGTTGAAA
TTCCTAAGTA TTACCAGACG ATTTCGATTT CACCAACTAC TGCGTATACA GGGGATAAAA
CCAAGTATCC AGTACCAAAT GAAGTGCGCC GTGGCATCGA AAACCCCGAC AACATTGTTA
GTAGTTTAGT GGGAANCNCT GCGTATAACT TGACCCAAAA AAGTGCCACA CGCTATACTG
CCCGCCGTTC TTACTGGANG TGGGGCCCCA CGAAGACACT TTACTCAATG AGTATCTATT
CAGGAACTGC TGCGGGCAAC TATAATTTAT CGACCCCTGA TGGCACCATT TATTATTACT
TAGAAAATCG GCGGGTCACT GAACATTTTG TAGACGAAAG TGGCGCAAAA ATCACGCCAC
CAACTGGCTT TACACAAGGA AATCAGCTAG TGGTGGACAG TGAAAACTAT GTCTACACTG
TCGCAAAAGC TTTGCCGAAG ATCTACCAAG CTGGTGAAAA AACCTATATC TTCCAAGGCT
GGTTTAAAGG CAAAACCAAG CCAGCAACAT TAAAGACGAC AACGACCCCA AGTTTTACAC
CAACTTTTAA TGATGAGGAC GACATGACCG CTGTGTACCA AGAAGCGATT CCCACCGCGG
AACTAACGTT AACAGGTGCC GTTGACATAA TCGAAAATGG CGCCACAATG GATTACTGGG
AGGCGCTACT GAAGAACACA GGCGAAGCGC CGTTAACCAC CATTAAAATC AAGCCAACGG
CAACTTGGGC GGCTGGCATC GGCGCACCCA ACACGATATT TGTACAAGGA ACGGGTCAAA
ACACCAAACC TTTTCCTGTC ACCAAAGAAC AATGGACGAC CGGTGCAGGA GTGTCCATCA
CGTTGGATCA GCCTTTACCA GCTGGCGGTC AATTAAAAAT GAACTTATTA GGAACCGCCG
TTACAGCAAA TCCTCGTCAA GTTTTAACCG CTGATGTTGA AGTAACGGGC AACTTTGGCA
GTTTAACTGC CAAAGATACG GTCCGTATTA AAGACTTAGA TCAAGAAATT ACGAGTCCTG
ACGGCGACGG CTTTATTAGT ACCCCGACAT TTGATTTTGG TAAACTAGCA ATTTCAGGAA
GTAAGCAACA ATATGGTTTG AAGAAGGCCG CAGATTACTA CGGCAATGGC ACTCGCAACC
CTTATTTACG CCTGAATACT AGCCAAGCCA ATTGGAGTTT AACGGCCCAG CTATCGCAAC
CAAAATCAGC CACAGACAGC TTGCCAACAA CGACCCGCTT GTTGCTAGGA ACGGCCGCTG
CTGCCAGCTT TACCGATTAC AACCAACCAA CAGAAACCAG GACACCACTT GGCAAGACCA
GCACCGTGAC TTTAACCGCC GACAATACCG CAACAGCGGT GGTCGCAAAC CAACACTTCA
CAGGCAGTGA CGTCTATCAG TTGGACTTCA CGTTTGCTAA CATCAAACTA GAAGTGCCAG
CCAACCAAGG
TATGGCTGGC CAACAATACC AAGCCGCCGT CACGTGGAAT TTAGTGACTG GCCCCTAA
EF107-2 (SEQ ID NO:414) MKRVN WKRWLVVGLS CSLFMDSVVG VTVLAETITG
ATEQGVATSQ SSDEASQTTQ TTEESQATVA SEAKTVPPQE TARIASRAIG YSSVEGREIP
FFFVEEDGTL FDPDRITMAV NLSTFSFYEE KLQRTPLEPT TVNCGKLLSI PTSPAFKYDT
NNQNPSNIYG VSEVSFTIPK EYQSLDIRPS TFYTGDTTQY PVPTVFANVG GKVTNYVGAN
AETELELTNE KMPNKLTFGP KKTFKYTVAT APGGVTYALT YFYGDVGGPT SSHQRRGTAG
PVYYYLTKRR VTEKFENPAG GAIPAPEGYT QDKKTIVTGE DFTFTQEGTL PERYTGSDGK
TYLFKGWYKG NAKPSTLETT KTPSYAVTYD DNDDLHVVYE EAVMKTYTLP AREALFGYVD
EQGNLINPAK FKLSATMGES DGATGEMTTF PTIDGIDMPA SQLKKLAIPQ KVYTRPDDGT
IVTYGPQEVS VEIPKYYQTI SLSPTTAYTG DKTKYPVPNE VRRGIENPDN IVSSLVGXXA
YNLTQKSATR YTARRSYWXW GPTKTLYSMS IYSGTAGGNY NLSTPDGTIY YYLENRRVTE
HFVDESGAKI TPPTGFTQGN QLVVDSENYV YTVAKALPKE YQAGEKTYIF QGWFKGKTKP
ATLKTTTTPS FTPTFNDEDD MTAVYQEAIP TAELTLTGAV DIIENGATMD YWEALLKNTG
EAPLTTIKIK PTATWAAGIG APNTIFVQGT GQNTKAFPVT KEQWTTGAGV SITLDQPLPA
GGQLKMNLLG TAVTGNPGQV LTADVEVTGN FGSLTAKDTV RIKDLDQEIT SPDGDGFIST
PTFDFGKLAI SGSKQQYGLK KAADYYGNGT RNPYLRLNTS QANWSLTAQL SQPKSATDSL
PTTTRLLLGT AAAASFTDYN QPTETRTPLG KTSTVTLTAD NTATAVVANQ QFTGSDVYQL
DFTFANIKLE VPANQGMAGQ QYQAAVTWNL VTGP EF107-3 (SEQ ID NO:415) CC
AGCAAGGAGT AGCAACATCT CAGTCGAGTG ACGAAGCGAG CCAGACGACG CAAACAACCG
AAGAGTCACA GGCAACGGTC GCTAGTGAAG CCAAAACAGT ACCGCCACAG GAAACGGCAA
GAATTGCTTC TCGAGCGATT GGTTATTCTT CTGTGGAAGG GCGCGAGATT CCCTTTTTCT
TTGTGGAGGA AGACGGGACG TTGTTTGATC CCGACCGAAT TACGATGGCG GTCAATCTTT
CCACGTTTTC GTTTTATGAA GAGAAATTAC AACGAACCCC CCTTGAGCCC ACCACTGTGA
ATGGCGGAAA GTTACTGTCT ATTCCAACGT CACCAGCTTT TAAATATGAT ACAAATAACC
AGAATCCAAG TAATATTTAT GGCGTTTCTG AAGTGTCGTT TACTATTCCT AAGGAGTATC
AAAGCCTGCA CATTCGACCA AGTACGTTTT ATACAGGAGA CACTACGCAA TATCCAGTCC
CAACGGTTTT TCCGAACGTT GGGGGCAAAG TGACGAACTA TGTGGGCGCC AATGCGGAGA
CGGAATTAGA GTTAACCAAT GAAAAAATGC CCAATAAGCT GACGTTTGGT CCTAAAAAGA
CGTTTAAATA TACGGTAGCT ACGGCACCAG GAGGCGTTAC GTATGCGCTG ACCTATTTTT
ATGGAGATGT CGGCGGTCCA ACTAGTTCGC ACCAAAGACG AGGAACAGCG GGTCCTGTGT
ATTATTATTT AACAAAGCGG CGTGTCACGG AAAAATTTGA GAATCCCGCA GGCGGGGCGA
TTCCTGCGCC AGAAGGTTAT ACGCAGGATA AGAAAACCAT TGTAACAGGG GAGGATTTTA
CTTTTACCCA AGAAGGCACC TTGCCTGAAC GTTACACAGG CAGTGATCGG AAGACGTATT
TATTTAAAGG TTGGTACAAA GGGAATGCGA AACCTAGCAC GTTGGAAACC ACCAAAACGC
CTAGTTATGC GGTGACCTAT GATGACAATG ACGATTTGCA TGTGGTCTAT GAAGAAGCAG
TGATGAAAAC CTATACGTTG CCAGCGAGAG AAGCTTTGTT CCGCTATGTT GATGAGCAAG
GAAACTTGAT TAATCCCGCC AAGTTTAAGC TAAGTGCGAC CATGGGTGAA AGTGACGGAG
CCACAGGGGA AATGACGACT TTTCCCACAA TTGATGGAAT CGATATGCCA GCAAGTCAAT
TAAAGAAATT AGCCATCCCG CAAAAAGTCT ACACACGCCC AGACGATGGG ACAATCGTAA
CTTATGGCCC GCAAGAAGTG AGTGTTGAAA TTCCTAAGTA TTACCAGACG ATTTCGATTT
CACCAACTAC TGCGTATACA GGGGATAAAA CCAAGTATCC AGTACCAAAT GAAGTGCGCC
GTGGCATCGA AAACCCCGAC AACATTGTTA GTAGTTTAGT GGGAANCNCT GCGTATAACT
TGACCCAAAA AAGTGCCACA CCCTATACTG CCCGCCGTTC TTACTCGANG TGGGGCCCCA
CGAACACACT TTACTCAATG AGTATCTATT CAGGAACTGC TGGGGGCAAC TATAATTTAT
CGACCCCTGA TGGCACCATT TATTATTACT TAGAAAATCG GCGGGTCACT GAACATTTTG
TACACGAAAG TGGCGCAAAA ATCACGCCAC CAACTGGCTT TACACAAGGA AATCAGCTAG
TCGTGCACAG TGAAAACTAT GTCTACACTG TCGCAAAAGC TTTGCCGAAG ATCTACCAAG
CTGGTGAAAA AACCTATATC TTCCAAGGCT GGTTTAAAGG CAAAACCAAG CCAGCAACAT
TAAAGACGAC AACGACCCCA AGTTTTACAC CAACTTTTAA TGATGAGGAC GACATGACCG
CTGTGTACCA AGAAGCGATT CCCACCGCGG AACTAACGTT AACAGGTGCC GTTGACATAA
TCGAAAATGG CGCCACAATG GATTACTGGG AGGCGCTACT GAAGAACACA GGCGAAGCGC
CGTTAACCAC CATTAAAATC AAGCCAACGG CAACTTGGGC GGCTGGCATC GGCGCACCCA
ACACGATATT TGTACAAGGA ACGGGTCAAA ACACCAAAGC TTTTCCTGTC ACCAAAGAAC
AATGGACGAC CGGTGCAGGA GTGTCCATCA CGTTGGATCA GCCTTTACCA GCTGGCGGTC
AATTAAAAAT GAACTTATTA GGAACCGCCG TTACAGGAAA TCCTGGTCAA GTTTTAACCG
CTGATGTTGA AGTAACGGGC AACTTTGGCA GTTTAACTGC CAAAGATACG GTCCGTATTA
AAGACTTAGA TCAAGAAATT ACGAGTCCTG ACGGCGACGG CTTTATTAGT ACCCCGACAT
TTGATTTTGG TAAACTAGCA ATTTCAGGAA GTAAGCAACA ATATGGTTTG AAGAAGGCCG
CAGATTACTA CGGCAATGGC ACTCGCAACC CTTATTTACG CCTGAATACT AGCCAAGCCA
ATTGGAGTTT AACGGCCCAG CTATCGCAAC CAAAATCAGC CACAGACAGC TTGCCAACAA
CGACCCGCTT GTTGCTAGGA ACGGCCGCTG CTCCCAGCTT TACCGATTAC AACCAACCAA
CAGAAACCAG GACACCACTT GGCAAGACCA GCACCGTGAC TTTAACCGCC GACAATACCG
CAACAGCGGT GGTCGCAAAC CAACAGTTCA CAGGCAGTGA CGTCTATCAG TTGGACTTCA
CGTTTGCTAA CATCAAACTA GAAGTGCCAG CCAACCAAGG TATGGCTGGC CAACAATACC
AAGCCGCCGT CACGTGGAAT TTAGTGACTG GCCCCT EF107-4 (SEQ ID NO:416)
EQGVATSQ SSDEASQTTQ TTEESQATVA SEAKTVPPQE TARIASRAIG YSSVEGREIP
FFFVEEDGTL FDPDRITMAV NLSTFSFYEE KLQRTPLEPT TVNGGKLLSI PTSPAFKYDT
NNQNPSNIYG VSEVSFTIPK EYQSLDIRPS TFYTGDTTQY PVPTVFANVG GKVTNYVGAN
AETELELTNE KMPNKLTFGP KKTFKYTVAT APGGVTYALT YFYGDVGGPT SSHQRRGTAG
PVYYYLTKRR VTEKFENPAG GAIPAPEGYT QDKKTIVTGE DFTFTQEGTL PERYTGSDGK
TYLFKGWYKG NAKPSTLETT KTPSYAVTYD DNDDLHVVYE EAVMKTYTLP AREALFGYVD
EQGNLINPAK FKLSATMGES DGATGEMTTF PTIDGIDMPA SQLKKLAIPQ KVYTRPDDGT
IVTYGPQEVS VEIPKYYQTI SISPTTAYTG DKTKYPVPNE VRRGIENPDN IVSSLVGXXA
YNLTQKSATR YTARRSYWXW GPTKTLYSMS IYSGTAGGNY NLSTPDGTIY YYLENRRVTE
HFVDESGAKI TPPTGFTQGN QLVVDSENYV YTVAKALPKI YQAGEKTYIF QGWFKGKTKP
ATLKTTTTPS FTPTFNDEDD MTAVYQEAIP TAELTLTGAV DITENGATMD YWEALLKNTG
EAPLTTIKIK PTATWAAGIG APNTIFWQGT GQNTKAFPVT KEQWTTGAGV SITLDQPLPA
GGQLKNNLLG TAVTGNPGQV LTADVEVTGN FGSLTAKDTV RIKDLDQEIT SPDGDGFIST
PTFDFGKLAI SGSKQQYGLK KAADYYGNGT RNPYLRLNTS QANWSLTAQL SQPKSATDSL
PTTTRLLLGT AAAASFTDYN QPTETRTPLG KTSTVTLTAD NTATAVVANQ QFTGSDVYQL
DFTFANIKLE VPANQGMAGQ QYQAAVTWNL VTGP EF108-1 (SEQ LD NO:417)
TAATCGGTTT GGCGGGAATC GTACATAGAA AGAAGGGACG ACATGAAGCA AACTAAGTGG
CAACGATTAG CAACCATTGG CTTGTGTAGT TCTTTAGTAA TTAACGCCTT TTCTGGTGTG
ACGGCAGTTG CGGAAACCGT GACGATTGAA AGTAGTCCGA CCGCCGAAAG TAGTGCCAAG
GAACAGACGC AAGCAAGTAG CGTGAAGGAA GAAACAACGA AAGCCAGTAC GGAAAATAGT
CAAGTAACAA CTGACACGAG TCAGGAAGAA GCAACGAAAG AAGCGGAGAA AGAAGAACCG
CAAGCAGAAG TGCAACAAGC AGAAACACCA ATCATTCCTA AACCAAAAAA AATCAATATG
AAGGCAACTT ATTCATTTTC TGCAGAAACT TATCAGTTTG GATTTGTGAA TGAATCAGGT
CAATTAATAA ATCCAGATAT TATACCAATT ACGTATAGCT ATGCCAAAGG ATCATGGAAG
ACAGATGGTT ATAATCGAAA GTGGACTAGT ATGGTTCAAG GGAGTGCTTC AACCGTAGGA
AACTTAAAGA ATGTAATAAT GCCAGCAACT TCTGTAGTTA TGCCACCAGG ACCGTCATAT
GAAGGAACTC AAGAGGTGTA CACAAACTTT TCAATTCGCA TACCAAAATA TTATGCATCA
GCGAGTCTCT ACAATAGAGA AGGTAAAATT GATTCTACTT ATCCGTTACC TGCTATTGCA
CTAGCAGGTA CTAGACCGCT ATCTTTGACT CAAAGTAGTG TAATTAGTGC ATTGGCGCTG
ACCAGTAAAG GAGACAATGT TTATACACCA CGGGAAACAT TTTTTGGAGG AGATCCTGCA
GGTGTAAAGT TTACTAATTT TTTGTATCGT ATAAATGACT TTGATGTGAA AGGTAATAAC
ATAGGTTATA AGACTGTGAG TAGCCCAATC TATTACCATC TGACCAACCG CCGTGTCACC
GAAAACTTCG TAGATACAAG TGGCGCCAAA ATCACGCCAC CAAGTAATTT CACCCAAGGG
AAACAAACGG TCATTAACAG TGATCCTTAC ACGTTCCAAC AAAGTGGTTT TTTACCCGAG
ACCTACAAAG TTGGCACGAA ATCTTACCGA TTCAAAGGCT GGTACAAAGG GAAAACCAAA
ACCGAGCCTT TGGCCACCAC TAAAACACCT AGCTATAAAG TCACGTATGA TGACAATGAT
GATTTGACGG TGGTCTATGA GGAGTTTTCA GGGTACGAGC TGCCTGCTTC GACCAATCAA
TTTGGCTTTG TGGATGAAGC GACGAACAAA TTAATTGCCC CCGACCAAGT GCACATGAAG
TATAATCTTA CTTTAAATGA AAATAATAAA AAAACAGTAA TGAGCAGTAA CTTAACGGGG
ACAGATACAG CGACACTGAA AAACTTGTCC GTGCCTGTCA ACTATTTTGA ACAATATCGC
GTCAATACGT TTTATCGCGC GAGTGACATT ACGTTTACAT TGCCCAAACG GTACAAATCA
ATCAATATTA CCAAATCAGA TGGCAAAACC GACCCAGCTT TTCCTCTTCC TAAAATCTAT
AATATAGATC AAGTAGAAAT GTCACACATG CCTGTGACCA CTTATAACAA GTTGAAACAG
CTGTCGGGCC AAACGTTTGG CTTTAATGCT TTAGCCGATC AACCTGAATT TTATACGAAA
ACGTTATTTG GGACAGAGTC TGGCATCGAT GACCCAGTCA ATTATTATAC AATGAGTGGC
CCTGTTTACT ATTATTTAGA AAACCGCAAA GTCACCGAGA ACTTCGTAGA CACCAACGGC
GCTAAAATCA CACCGCCAAC AGGTTTCACC CAAGGTAAAA AAACGGTGAT TACAAGCGAC
GCCTACACTT TCAAACAAGC AGGCACCTTA CCAGACACTT ACACAACAGG CGGTAAGACC
TACAAGTTCA AAGGTTGGTA CAAAGGCAAG TCCATACTCA ACACATTGAC AACTACCAAA
GCGCCAAGTT ATCAAGTGAC CTACGATGAC AATGATGATT TGAATGTGGT GTATGAAGAA
GAAACAGTTA CGACAGTGTA TCCATCAGTC GATATGAACT TTGTGAATGA AAAAGGCGGG
GCTTTCACAC CGGCGTTAAC TTTTAGTGGT AAGTACTATG CGCAAAGTAC GAGTGCGTAC
TTAAGAACCG ATTTATATGA CGTGACCTCA AAAAATAATG GTAATGGGCA ATATACGGTA
AGTATTAATA ATGGTAGTAT GCCATTGTCC CAAGAATTAT TGAAAAAATA TAATAATGGA
CAACCAATCA GTGCTACCAA CAGATTACAG TTTAATGTTG ATAAATTAGC CATCGACCAA
CAACTAAAAT ATCTTGACAG CATTCAATTA GACACAGCTC AAAGTAGCAA TCTCAAATCC
TATAGATATG TGTACACGAA CAATAGCTCA CTGGTTTTCG ACCCAAATGT AGCACCAGCA
GAGGTTGACC TTAGTTCAGA ATCTCTTAAC TTGCTTAATT TTGATTCAGA TGGCACCTAT
TTTTCTAATG CAAATAATAG ACTTTTTTAC ACGCATTTAG GATATAGTGG CACACCAGGA
GTTAACTATC TTCTCGTAAT GTTTCTTTTT AACGCCAAAC CTGCGGATAA GTCAAAACTT
GTCTACAAAG TCACTCGCAA ACAAGTCACC GAAAACTTCG TGGATGTCAA CGGTGCCAAA
ATCACTGCAC CAACAGGCTT CACCCAAGGT AACCAAGTAC CAATGAACAG TAACACCTTC
AAGTACACAG CCGCAAAAGC TTTACCAGCG ACGTATACTA CAGGTGGCAA AGTCTATACC
TTCCAAGGGT GGTATAAAGG GAAAACCAAG CCAAGTACGT TGAACAAAAC AACAACTCCA
ACGTTCAATG CGACCTTTGA TGGCAATGAC GATATGACCG CCATGTATAA GGAAGAAATA
CCAACAGCTA GTGTCACATT AACTCGACCA AAAGAAGTGA TTGATACGAA TACCAATGTA
ATCTGGACAA CAACGATCAC GAATACTAGC AAAGCACCCT TACAAAATCT CACCTTGAAA
AAAGGGCCCA ATTGGTCAGC TGGTCTGACG ATCCCGACCT TTATGGAAGT GACACCAGAA
GGAGAAACGA CAAAATCAAT CCCAGTAAAT AGTACACTTT GGACAGAGGG GGTTCCTTTA
CCAAATGCCC TTCCTATCGG CAAAAAAGTT TCAGTTGCTT TCACAACTCG CGCAACAGGG
AAACCAAACA CTGTTTTGAA AGCAGAAGTT GTAGTATTTG GTGGTATTAA AGATAGTACA
GTGGATAACT TCGTGAGAAT TCGTCCAAAT GATCAAGAAG TAGTCACACC AACGACCGAA
GGCTTCATCA GTGTGCCAAC CTTCGACTTC GGCCAAGTGG GCGTTGCAGG AACTAAGCAA
CAACACAGCT TGAAACAAGC CGCGGATTAC TACGGTAACG GCACACGGAA TCCGTATCTG
CGGATTAAGA AAACGCAACC CAATTGGAGC TTAACAGCGC AACTGTCACA ACCAAAATCA
GCGACAGACA GCTTGCCTAC AGCGACCCGC TTATTATTAG GGGCGGCGCC TGTCTCTAGC
TTTACCAATT ACAATCAACC AACCGAGTTG AAAAATACGG TCGGTACCAC GAGTGCCATT
AGCTTAACAG CCAACAACAC AGCAACGAGT ATTATTGCCA ACAAGCAATT CACAGGTAGT
AATGTTTATC AGTTGGACTT CACCTTCAAT AATGTCAAAC TTGAAGTGCC AGCCAATCAA
GGTGTTAAAG GGCAACAATA CAAGGCCGCA GTTACATGGA ACCTAGTTAC AGGTCCTTAA
EF108-2 (SEQ ID NO:418) MKQTKWQ RLATIGLCSS LVINAFSGVT AVAETVTTES
SPTAESSAKE ETQASSVKEE TTKASTENSQ VTTDTSQEEA TKEAEKEEPQ AEVEQAETPI
IPKPKKINMK ATYSFSAETY QFGFVNESGQ LINPDIIPIT YSYAKGSWKT DGYNRKWTSM
VQGSASTVGN LKNVIMPATS VVMPPGPSYE GTQEVYTNFS TRIPKYYASA SLYNREGKID
STYPLPAIAL AGTRPLSLTQ SSVISALALT SKGDNVYTPR ETFFGGDPAG VKFTNFLYRI
NDFDVKGNNI GYKTVSSPIY YHLTNRRVTE NFVDTSGAKI TPPSNFTQGK QTVLNSDPYT
FQQSGFLPET YKVGTKSYRF KCWYKGKTKT EPLATTKTPS YKVTYDDNDD LTVVYEEFSG
YELPASTNQF GFVDEATNKL IAPDQVQMKY NLTLNENNKK TVMSSNLTGT DTATLKNLSV
PVNYFEQYRV NTFYGASDIT FTLPKRYKSI NITKSDGKTD PAFPLPKIYN IDQVEMSHNP
VTTYNKLKQL SGQTFGFNAL ADQPEFYTKT LFGTESGIDD PVNYYTMSGP VYYYLENRKV
TENFVDTNGA KITPPTGFTQ GKKTVITSDA YTFKQAGTLP DTYTTGGKTY KFKGWYKGKS
TLNTLTTTKA PSYQVTYDDN DDLNVVYEEE TVTTVYPSVD MNFVNEKGGA FTPALTFSGK
YYAQSTSAYL RTDLYDVTSK NNGNGQYTVS INNGSMPLSQ ELLKKYNNGQ PISATNRLQF
NVDKLAIDQQ LKYVDSIQLD TAQSSNLKSY RYVYTNNSSL VFDPNVAPAE VDLSSESLNL
LNFDSDGTYF SNANNRLFYT HLGYSGTPGV NYLLVMFLFN AKPADKSKLV YKVTRKQVTE
NFVDVNGAKI TAPTGFTQGN QVPMNSNTFK YTAAKALPAT YTTGOKVYTF QGWYKGKTKP
STLNKTTTPT FNATFDGNDD MTAMYKEEIP TASVTLTRPK EVIDTNTNVI WTTTITNTSK
APLQNLTLKK GPNWSAGLTI PTFMEVTPEC ETTKSIPVNS TLWTEGVPLP NAVPTGKKVS
VAFTTEATGK PNTVLKAEVV VFGGIKDSTV DNFVRIRPND QEVVTPTTEG FISVPTFDFG
QVGVAGTKQQ HSLKQAADYY GNGTRNPYLR IKKTQPNWSL TAQLSQPKSA TDSLPTATRL
LLGAAPVSSF TNYNQPTELK NTVGTTSAIS LTANNTATSI IANKQFTGSN VYQLDFTFNN
VKLEVPANQG VKGQQYKAAV TWNLVTGP EF108-3 (SEQ ID NO:419) CGT
GACGATTGAA AGTAGTCCGA CCGCCGAAAG TAGTGCCAAG GAAGAGACGC AAGCAACTAG
CGTGAAGGAA GAAACAACGA AAGCCAGTAC GGAAAATAGT CAAGTAACAA CTGACACGAG
TCAGGAAGAA GCAACGAAAG AACCCCACAA AGAAGAACCG CAAGCACAAG TGGAACAAGC
AGAAACACCA ATCATTCCTA AACCAAAAAA AATCAATATG AAGGCAACTT ATTCATTTTC
TGCAGAAACT TATCAGTTTG GATTTGTGAA TGAATCAGGT CAATTAATAA ATCCAGATAT
TATACCAATT ACGTATAGCT ATCCCAAAGG ATCATGGAAG ACAGATGGTT ATAATCGAAA
GTGGACTAGT ATGGTTCAAG GGAGTGCTTC AACCGTAGGA AACTTAAAGA ATCTAATAAT
GCCACCAACT TCTGTACTTA TGCCACCAGG ACCOTCATAT GAAGGAACTC AAGAGGTGTA
CACAAACTTT TCAATTCGCA TACCAAAATA TTATGCATCA CCGAGTCTCT ACAATAGAGA
AGCTAAAATT GATTCTACTT ATCCGTTACC TGCTATTGCA CTAGCAGGTA CTAGACCGCT
ATCTTTGACT CAAAGTAGTG TAATTACTGC ATTGGCGCTG ACCAGTAAAG GAGACAATGT
TTATACACCA CGGGAAACAT TTTTTGGAGG AGATCCTGCA GGTGTAAAGT TTACTAATTT
TTTGTATCGT ATAAATCACT TTGATGTGAA AGGTAATAAC ATAGOTTATA AGACTOTOAG
TAGCCCAATC TATTACCATO TGACCAACCG CCGTGTCACC GAAAACTTCG TAGATACAAG
TGGCGCCAAA ATCACGCCAC CAAGTAATTT CACCCAAGCG AAACAAACGG TCATTAACAG
TGATCCTTAC ACGTTCCAAC AAAGTCCTTT TTTACCCGAG ACCTACAAAG TTGGCACGAA
ATCTTACCCA TTCAAAGGCT GGTACAAAGG GAAAACCAAA ACCGAGCCTT TOOCCACCAC
TAAAACACCT AGCTATAAAG TCACGTATGA TGACAATGAT GATTTCACGG TGGTCTATGA
GGAGTTTTCA GGGTACGAGC TGCCTGCTTC GACCAATCAA TTTGGCTTTG TGGATGAAGC
GACGAACAAA TTAATTGCCC CCGACCAAGT GCAGATGAAG TATAATCTTA CTTTAAATCA
AAATAATAAA AAAACAGTAA TGAGCAGTAA CTTAACGGGG ACAGATACAC CGACACTGAA
AAACTTGTCC GTGCCTGTCA ACTATTTTGA ACAATATCGC CTCAATACGT TTTATGGCGC
GAGTGACATT ACGTTTACAT TGCCCAAACG GTACAAATCA ATCAATATTA CCAAATCAGA
TGGCAAAACC GACCCAGCTT TTCCTCTTCC TAAAATCTAT AATATAGATC AAGTAGAAAT
GTCACACATG CCTGTGACCA CTTATAACAA GTTGAAACAG CTGTCGGGCC AAACGTTTGG
CTTTAATGCT TTAGCCCATC AACCTGAATT TTATACGAAA ACGTTATTTG GGACAGAGTC
TGGCATCGAT GACCCACTCA ATTATTATAC AATGAGTGGC CCTGTTTACT ATTATTTAGA
AAACCGCAAA GTCACCGAGA ACTTCGTAGA CACCAACGGC GCTAAAATCA CACCGCCAAC
AGGTTTCACC CAAGGTAAAA AAACGGTGAT TACAAGCCAC GCCTACACTT TCAAACAAGC
AGGCACCTTA CCAGACACTT ACACAACAGG CGGTAAGACC TACAAGTTCA AAGGTTGGTA
CAAAGGCAAG TCCATACTCA ACACATTGAC AACTACCAAA GCGCCAAGTT ATCAAGTGAC
CTACGATGAC AATGATCATT TGAATGTGGT GTATGAAGAA GAAACAGTTA CGACAGTGTA
TCCATCAGTC GATATGAACT TTGTGAATGA AAAACCCGCC GCTTTCACAC CGGCGTTAAC
TTTTAGTGGT AAGTACTATG CGCAAAGTAC GAGTGCGTAC TTAAGAACCG ATTTATATGA
CGTGACCTCA AAAAATAATG GTAATGGGCA ATATACGGTA AGTATTAATA ATGGTAGTAT
GCCATTGTCC CAAGAATTAT TGAAAAAATA TAATAATGGA CAACCAATCA GTGCTACCAA
CAGATTACAG TTTAATGTTG ATAAATTAGC CATCGACCAA CAACTAAAAT ATGTTGACAG
CATTCAATTA GACACAGCTC AAAGTAGCAA TCTGAAATCC TATAGATATG TGTACACGAA
CAATAGCTCA CTGGTTTTCG ACCCAAATGT AGCACCAGCA GAGGTTGACC TTAGTTCAGA
ATCTCTTAAC TTGCTTAATT TTGATTCAGA TGGCACCTAT TTTTCTAATG CAAATAATAG
ACTTTTTTAC ACGCATTTAG GATATAGTGG CACACCAGGA GTTAACTATC TTCTCGTAAT
GTTTCTTTTT AACGCCAAAC CTGCGGATAA GTCAAAACTT CTCTACAAAG TCACTCGCAA
ACAAGTCACC GAAAACTTCG TGGATGTCAA CGGTGCCAAA ATCACTGCAC CAACAGGCTT
CACCCAAGGT AACCAAGTAC CAATGAACAG TAACACCTTC AAGTACACAG CGGCAAAAGC
TTTACCAGCG ACGTATACTA CAGGTGGCAA AGTCTATACG TTCCAAGGGT GGTATAAAGG
GAAAACCAAG CCAAGTACGT TGAACAAAAC AACAACTCCA ACGTTCAATG CGACCTTTGA
TGGCAATGAC GATATGACCG CCATGTATAA GGAAGAAATA CCAACAGCTA GTGTCACATT
AACTCGACCA AAAGAAGTGA TTGATACGAA TACCAATGTA ATCTGGACAA CAACGATCAC
GAATACTAGC AAAGCACCCT TACAAAATCT CACCTTGAAA AAAGGGCCCA ATTGGTCAGC
TGGTCTGACG ATCCCGACCT TTATGGAAGT GACACCAGAA GGAGAAACGA CAAAATCAAT
CCCAGTAAAT AGTACACTTT GGACAGAGGG GGTTCCTTTA
CCAAATGCCC TTCCTATCGG CAAAAAAGTT TCAGTTGCTT TCACAACTCG CGCAACAGGG
AAACCAAACA CTGTTTTGAA AGCAGAAGTT GTAGTATTTG GTGGTATTAA AGATAGTACA
GTGGATAACT TCGTGAGAAT TCGTCCAAAT GATCAAGAAG TAGTCACACC AACGACCGAA
GGCTTCATCA GTGTGCCAAC CTTCGACTTC GGCCAAGTGG GCGTTGCAGG AACTAAGCAA
CAACACAGCT TGAAACAAGC CGCGGATTAC TACGGTAACG GCACACGGAA TCCGTATCTG
CGGATTAAGA AAACGCAACC CAATTGGAGC TTAACAGCGC AACTGTCACA ACCAAAATCA
GCGACAGACA GCTTGCCTAC AGCGACCCGC TTATTATTAG GGGCGGCGCC TGTCTCTAGC
TTTACCAATT ACAATCAACC AACCGAGTTG AAAAATACGG TCGGTACCAC GAGTGCCATT
AGCTTAACAC CCAACAACAC AGCAACGAGT ATTATTGCCA ACAAGCAATT CACAGGTAGT
AATGTTTATC AGTTGGACTT CACCTTCAAT AATGTCAAAC TTGAAGTGCC AGCCAATCAA
GCTGTTAAAG GGCAACAATA CAAGGCCGCA GTTACATGGA ACCTAGTTAC AG EF108-4
(SEQ lED NO:420) VTIES SPTAESSAKE ETQASSVKEE TTKASTENSQ VTTDTSQEEA
TKEAEKEEPQ AEVEQAETPI IPKPKKINMK ATYSFSAETY QFGFVNESGQ LINPDIIPIT
YSYAKGSWKT DGYNRKWTSM VQGSASTVGN LKNVIMPATS VVMPPGPSYE GTQEVYTNFS
IRTPKYYASA SLYNREGKTD STYPLPAIAL AGTRPLSLTQ SSVISALALT SKGDNVYTPR
ETFFGGDPAG VKFTNFLYRI NDFDVKGNNI GYKTVSSPIY YHLTNRRVTE NFVDTSGAKI
TPPSNFTQGK QTVTNSDPYT FQQSGFLPET YKVGTKSYRF KCWYKGKTKT EPLATTKTPS
YKVTYDDNDD LTVVYEEFSC YELPASTNQF GFVDEATNKL IAPDQVQMKY NLTLNENNKK
TVMSSNLTGT DTATLKNLSV PVNYFEQYRV NTFYGASDIT FTLPKRYKST NTTKSDGKTD
PAFPLPKIYN IDQVEMSHMP VTTYNKLKQL SGQTFGFNAL ADQPEFYTKT LFGTESGIDD
PVNYYTMSGP VYYYLENRKV TENFVDTNGA KITPPTGFTQ GKKTVTTSDA YTFKQAGTLP
DTYTTGGKTY KFKGWYKGKS ILNTLTTTKA PSYQVTYDDN DDLNVVYEEE TVTTVYPSVD
MNFVNEKGGA FTPALTFSGK YYAQSTSAYL RTDLYDVTSK NNGNGQYTVS INNGSMPLSQ
ELLKKYNNGQ PTSATNRLQF NVDKLAIDQQ LKYVDSIQLD TAQSSNLKSY RYVYTNNSSL
VFDPNVAPAE VDLSSESLNL LNFDSDGTYF SNANNRLFYT HLGYSGTPGV NYLLVNFLFN
AKPADKSKLV YKVTRKQVTE NFVDVNGAKI TAPTGFTQGN QVPMNSNTFK YTAAKALPAT
YTTGGKVYTF QGWYKGKTKP STLNKTTTPT FNATFDGNDD MTAMYKEEIP TASVTLTRPK
EVIDTNTNVI WTTTITNTSK APLQNLTLKK GPNWSAGLTI PTFMEVTPEG ETTKSIPVNS
TLWTEGVPLP NAVPIGKKVS VAFTTRATGK PNTVLKAEVV VFGGIKDSTV DNFVRTRPND
QEVVTPTTEG FTSVPTFDFG QVGVAGTKQQ HSLKQAADYY GNGTRNPYLR IKKTQPNWSL
TAQLSQPKSA TDSLPTATRL LLGAAPVSSF TNYNQPTELK NTVGTTSAIS LTANNTATSI
IANKQFTGSN VYQLDFTFNN VKLEVPANQG VKGQQYKAAV TWNLVT EF109-1 (SEQ ID
NO:421) AGGAGTAAAT TAATCAAAAA AAGTGTTATA ACTAGTTCTA TOTTACCOOT
TTTCTTGTCG CCATTTCTCG TTACCCCTAT TTCTGCTTAC GCTTTGGAAC GCTCTAAGGG
AACTACTGAA GAAACGGTGG CTTCAGAAAC ATCTCTAACG GAGCGACAAA TGAGTAGCGG
TGTCACTGAA GAAATCAACC CAACCATCAT AAATTCTCAA GAGGAAACAG AAACAACGTC
CACTTCCTCA ACCTCCGATT CCACCACTGA AGTTTCTACA TCAGAAGTAA CAACTGTTAA
TGATACAGAA NATAGTAGCG ACGTACTGAA ACTACTTTGG NAACATCACN AACTAATCAG
GACACACCTA TAG EF109-2 (SEQ ID NO:422) MKKSVI TSSMLAVLLS GFLVTPISAY
ALERSKGTTE ETVASETSLT ERQMSSGVTE EHNPSIILNSQ EETETTSTSS TSDSTTEVST
SEVTTVNDTE XSSDVLKLLW XHHXVMRTHL EF109-3 (SEQ ID 110:423) GGAAC
GCTCTAAGGG AACTACTGAA GAAACGGTGG CTTCACAAAC ATCTCTAACG GAGCGACAAA
TGAGTACCCG TCTCACTCAA GAAATGAACC CAAGCATCAT AAATTCTCAA GAGGAAACAG
AAACAACGTC CACTTCCTCA ACCTCCGATT CCACCACTGA AGTTTCTACA TCAG EF109-4
(SEQ ID 140:424) ERSKGTTE ETVASETSLT ERQMSSGVTE EMNPSIINSQ
EETETTSTSS TSDSTTEVST S EF110-1 (SEQ ID 140:425) TAAATAAAAA
TGGATAAGGA GTGGCATAAT CTTATGAAAA AGTTCTCCAT ACGAAAAATT AGTGCTGGTT
TTTTGTTTCT GATTTTAGTA ACTTTGATCG CCGGTTTTAG CTTGTCTGCA AATGCAGAAG
AGTATATCGT TCCTGCCGAA AGTCATTCAC GACAAAAAAG ATCGTTACTG GACCCTGAGG
ACAGAAGACA AGAAGTGGCA GATACAACCG AAGCGCCTTT TGCGTCAATC GGAAGAATCA
TTTCCCCTGC CAGTAAACCA GGCTATATTT CTTTAGGAAC AGGCTTTGTT GTTGGAACCA
ATACAATTGT CACCAATAAT CATGTGGCTG AAAGTTTTAA GAATGCCAAA GTATTAAATC
CGAATGCCAA AGATGATGCT TGGTTTTATC CAGGTCGAGA TGGCAGTGCG ACACCATTTG
GCAAATTCAA AGTGATTGAT GTAGCTTTTT CCCCGAATGC GGATATTGCG GTAGTGACTG
TCGGCAAACA AAACGATCGT CCAGATGGCC CAGAGTTGGG AGAAATTTTA ACGCCATTTG
TTTTGAAAAA GTTTGAATCT TCAGATACCC ATGTCACAAT ATCAGGCTAT CCAGGTGAGA
AAAACCACAC ACAATGGTCT CATGAAAATG ATTTGTTTAC ATCTAACTTT ACAGACTTAG
AAAATCCATT ACTATTTTAT GATATCGATA CAACCGGCGG TCAATCTGGT TCACCAATCT
ATAATGATCA GGTTGAAGTA GTTGGTGTTC ATTCCAATGG CGGCATTAAG CAAACAGGAA
ATCATGGTCA AAGACTAAAT GAAGTGAATT ATAACTTTAT TGTTAATCGA GTGAATGAAG
AAGAAAATAA ACGTTTATCC GCTGTGCCAG CAGCGTAA EF110-2 (SEQ ID 140:426)
MKKFSIRKIS AGFLFLILVT LIAGFSLSAN AEEYIVPAES HSRQKRSLIJD PEDRRQEVAD
TTEAPFASIG RIISPASKPG YISLGTGFVV GTNTIVTNNH VAESFKNAKV LNPNAKDDAW
FYPGRDGSAT PFGKFKVIDV AFSPNADIAV VTVGKQNDRP DGPELGEILT PFVLKKFESS
DTHVTISGYP GEKNHTQWSH ENDLFTSNFT DLENPLLFYD IDTTGGQSGS PIYNDQVEVV
GVHSNGGIKQ TGNHGQRLNE VNYNFIVNRV NEEENKRLSA VPAA EF110-3 (SEQ ID
140:427) AG AGTATATCGT TCCTGCCGAA AGTCATTCAC GACAAAAAAG ATCGTTACTG
GACCCTGAGG ACAGAAGACA AGAAGTGGCA GATACAACCG AAGCGCCTTT TGCGTCAATC
GGAAGAATCA TTTCCCCTGC CAGTAAACCA GGCTATATTT CTTTAGGAAC AGGCTTTGTT
GTTGGAACCA ATACAATTGT CACCAATAAT CATGTGGCTG AAAGTTTTAA GAATGCCAAA
GTATTAAATC CGAATGCCAA AGATGATGCT TGGTTTTATC CAGGTCGAGA TGGCAGTGCG
ACACCATTTG CCAAATTCAA AGTGATTGAT GTAGCTTTTT CCCCGAATGC GGATATTGCG
GTAGTGACTG TCGGCAAACA AAACGATCGT CCAGATGGCC CAGAGTTGGG AGAAATTTTA
ACGCCATTTG TTTTGAAAAA GTTTGAATCT TCAGATACCC ATGTCACAAT ATCAGGCTAT
CCAGGTGAGA AAAACCACAC ACAATGGTCT CATGAAAATG ATTTGTTTAC ATCTAACTTT
ACAGACTTAC AAAATCCATT ACTATTTTAT GATATCGATA CAACCGGCGG TCAATCTGGT
TCACCAATCT ATAATGATCA GGTTGAAGTA GTTGGTGTTC ATTCCAATGG CGGCATTAAG
CAAACAGGAA ATCATGGTCA AAGACTAAAT GAAGTGAATT ATAACTTTAT TGTTAATCGA
GTGAATGAAG AAGAAAATAA ACGTTTATCC GCTGTGCCAG CAGCGT EF110-4 (SEQ ID
NO:428) EYIVPAES HSRQKRSLLD PEDRRQEVAD TTEAPFASTG RIISPASKPG
YISLGTGFVV GTNTIVTNNH VAESFKNAKV LNPNAKDDAW FYPGRDGSAT PFGKFKVIDV
AFSPNADIAV VTVGKQNDRP DOPELGEILT PFVLKKFESS DTHVTISGYP GEKNHTQWSH
ENDLFTSNFT DLENPLLFYD IDTTGGQSGS PIYNDQVEVV GVHSNGGIKQ TGNHGQRLNE
VNYNFIVNRV NEEENKRLSA VPAA EF111-1 (SEQ ID NO:429) TGATCAATAC
ACTTCGATAC GGTCGCTTTT TTTCTAGAGA AAGTTGAATC TTTCAATAAT AAAAAGGGAT
ACACTCCATT TGGCATAGTC CTTGCTGATA ATAAATCAGT GTATAAAGCG CTATCATTTT
ATAGGAGGGG TTTTATGAAG GGTTTATCAA AAAAGAAACG GGTGTCTACT TGGTTAGCGT
TAGGAATCAC CGTAGTCAGC TGTTTTGCGT TAAGCAGGGA AGTGCAAGCA AGTGTTGAAA
GAACAAAAGT TGATGAATTT GCAAATGTTT TAGATGTGAG TGCATCACCA ACCGAACGGA
CGAATGGCGT ATACGATACC AATTATTTTA ATAATTTTTC TGATTTAGGT GCATGGCATG
GCTACTATTT ACCTGAAAAA AGCAATAAAG AGCTACTGGG TGGTTTTGCG GGGCCATTGA
TTATTGCCGA AGAATATCCA GTAAACTTGG CCGCAAGTTT AAACAAATTA ACGGTCAAAA
ATAAAAAAAC GGGAGAAACC TATGATTTAA GCCAAAGCAA CCGCATGGAC CTGTCTTATT
ATCCTGGGCG CCTAGAGCAA ACCTATGAAT TAGACGATTT AACGATTCAT TTAGCTTTAA
TTTTTGTCAG CAATCGAACG GCGCTTATCC AAACGACACT TGAAAACACT GGTGAAGAGC
CCTTGTCACT TGGAGCAAGC TGGACAGGTG CGGTCTTTCA CAAAATTCAA GAGGGAACGG
AAACCTTAGA TATTGGCACT CGTTTAACTG CTAAAGACAA TGACATTCAA GTGAATTTTG
GTGAAGTCAG AGAAACGTGG AATTATTTTG CTACGAAAGA CACAAAATAT ACGATTCATC
ATGCGGATAA AGTTTCAACA AAAATTGATA ATCGGAATTA TACAGCAACC GCTGAACCAA
TTGAATTGAA GCCTAAACAA ACGTACAACA CCTATACGAC AGAAAGCTAT ACTTTTACAA
AAGAAGAAGA GGCAAAGGAA CAACAACAAG CACCCGAATA TACCAAAAAT GCGGCGCGCT
ATTTCAAAGA GAACAAGCAA AGATGGCAAG GATATCTAGA TAAAACGTTT GATCAAAAGA
AAACAGCAGA ATTTCCTGAA TATCAAAATG CGCTAGTCAA ATCGATTGAA ACGATTAATA
CCAATTGGCG AAGTGCGGCA GGTGCCTTTA AGCATGACGG GATTGTTCCG TCCATGTCTT
ATAAATGGTT TATTGGTATG TGGGCTTGGG ATTCGTGGAA AGCGGATGTA GCAACGGCTG
ATTTTAATCC TGAGTTAGCT AAAAATAATA TGCGGGCCTT GTTTGATTAT CAAATTCAAA
AAGATGATAC CGTACGTCCA CAAGATGCAG GAGCGATCAT TGATGCTGTC TTTTACAATC
AAGACAGTGC GCGTGGTGGT GAAGGTGGCA ACTGGAATGA ACGAAATTCT AAACCACCAT
TGGCTCCATG GGCAGTTTGG CATATTTATC AAGAAACCAA AGATAAGGAA TTTTTAAAAG
AAATGTATCC CAAACTTGTG GCTTATCATA ATTGGTGGTA TACCAACAGA GACCACAATA
AAAATCGGAT AGCAGAATAT GGAAGCATGG TCAGTGATGC TCACTGGCAA AAAGACGACA
AGGATCAAAT CATTAAAGAT AAAAATGGCC ACCTAAAGTG GATGATGATG CTGTTATTGA
AGCAGCCGCG TGGGAAAGTG GCATGGATAA CGCTACACGG TTTGACAAAG AAGGTGTGGG
CAAAGGCGAC GTTGGAGTTA AAGTTTTTGA AAACAAAAAT AAAGGAAAAG TAG EF111-2
(SEQ TD NO:430) MKG LSKKKRVSTW LALGITVVSC FALSREVQAS VERTKVDEFA
NVLDVSASPT ERTNGVYDTN YFNNFSDLGA WHGYYLPEKS NKELLGGFAG PLIIAEEYPV
NLAASLNKLT VKNKKTGETY DLSQSNRMDL SYYPGRLEQT YELDDLTIHL ALTFVSNRTA
LIQTTLENTG EEPLSLGASW TGAVFDKTQE GTETLDIGTR LTAKDNDIQV NFGEVRETWN
YFATKDTKYT IHHADKVSTK IDNRNYTATA EPIELKPKQT YNTYTTESYT FTKEEEAKEQ
QQAPEYTKNA ARYFKENKQR WQGYLDKTFD QKKTAEFPEY QNALVKSIET INTNWRSAAG
AFKHDGIVPS MSYKWFIGMW AWDSWKADVA TADFNPELAK NNMRALFDYQ TQKDDTVRPQ
DAGATIDAVE YNQDSARGGE GGNWNERNSK PPLAAWAVWH TYQETKDKEF LKEMYPKLVA
YHNWWYTNRD HNKNGIAEYG SMVSDAHWQK DDKDQIIKDK NGHLKWMMML LLKQPRGKVA
WITLHGLTKK VWAKATLELK FLKTKIKEK EF111-3 (SEQ ID NO:431) TGATGAATTT
GCAAATGTTT TAGATGTGAG TGCATCACCA ACCGAACGGA CGAATGGCGT ATACGATACC
AATTATTTTA ATAATTTTTC TGATTTAGGT GCATGGCATG GCTACTATTT ACCTGAAAAA
AGCAATAAAG AGCTACTGGG TGGTTTTGCG GGGCCATTGA TTATTGCGGA AGAATATCCA
GTAAACTTGG CGGCAAGTTT AAACAAATTA ACGGTCAAAA ATAAAAAAAC GGGAGAAACC
TATGATTTAA GCCAAAGCAA CCGCATGGAC CTGTCTTATT ATCCTGGGCG CCTAGAGCAA
ACCTATGAAT TAGACGATTT AACGATTCAT TTAGCTTTAA TTTTTGTCAG CAATCGAACG
GCGCTTATCC AAACGACACT TGAAAACACT GGTGAAGAGC CCTTGTCACT TGGAGCAAGC
TGGACAGGTG CGGTCTTTGA CAAAATTCAA GAGGGAACGG AAACCTTAGA TATTGGCACT
CGTTTAACTG CTAAAGACAA TGACATTCAA GTGAATTTTG CTGAAGTCAG AGAAACCTCG
AATTATTTTG CTACGAAAGA CACAAAATAT ACGATTCATC ATGCGGATAA AGTTTCAACA
AAAATTGATA ATCGGAATTA TACAGCAACC GCTGAACCAA TTGAATTGAA GCCTAAACAA
ACGTACAACA CCTATACGAC AGAAAGCTAT ACTTTTACAA AAGAAGAAGA GGCAAAGGAA
CAACAACAAG CACCCGAATA TACCAAAAAT GCGGCGCGCT ATTTCAAAGA GAACAAGCAA
AGATGGCAAG GATATCTAGA TAAAACGTTT GATCAAAAGA AAACAGCAGA ATTTCCTGAA
TATCAAAATG CGCTAGTCAA ATCGATTGAA ACGATTAATA CCAATTGGCG AAGTGCGGCA
GGTGCCTTTA AGCATGACGG GATTGTTCCG TCCATGTCTT ATAAATGGTT TATTGGTATG
TGGGCTTGGG ATTCGTGGAA AGCGGATGTA GCAACGGCTG ATTTTAATCC TGAGTTAGCT
AAAAATAATA TGCGGGCCTT GTTTGATTAT CAAATTCAAA AAGATGATAC CGTACGTCCA
CAAGATGCAG GAGCGATCAT TGATGCTGTC TTTTACAATC AAGACAGTGC GCGTGGTGGT
GAAGGTGGCA ACTGGAATGA ACGAAATTCT AAACCACCAT TGGCTGCATG GGCAGTTTGG
CATATTTATC AAGAAACCAA AGATAAGGAA TTTTTAAAAG AAATGTATCC CAAACTTGTG
GCTTATCATA ATTGGTGGTA TACCAACAGA GACCACAATA AAAATGGGAT AGCAGAATAT
GGAAGCATGG TCAGTGATGC TCACTGGCAA AAAGACGACA AGGATCAAAT CATTAAAGAT
AAAAATGGCC ACCTAAAGTG GATGATGATG CTGTTATTGA AGCAGCCGCG TGGGAAAGTG
GCATGGATAA CGCTACACGG TTTGACAAAG AAGGTGTGGG CAAAGGCGAC GTTGGAGTTA
AAGTT EF111-4 (SEQ ID NO:432) DEFA NVLDVSASPT ERTNGVYDTN YFNNFSDLGA
WHGYYLPEKS NKELLGGFAG PLIIAEEYPV NLAASLNKLT VKNKKTGETY DLSQSNRMDL
SYYPGRLEQT YELDDLTIHL ALIFVSNRTA LIQTTLENTC EEPLSLGASW TGAVFDKIQE
GTETLDIGTR LTAKDNDIQV NFGEVRETWN YFATKDTKYT IHHADKVSTK IDNRNYTATA
EPIELKPKQT YNTYTTESYT FTKEEEAKEQ QQAPEYTKNA ARYFKENKQR WQGYLDKTFD
QKKTAEFPEY QNALVKSTET INTNWRSAAG AFKHDGIVPS MSYKWFTGMW AWDSWKADVA
TADFNPELAK NNMRALFDYQ IQKDDTVRPQ DAGAIIDAVE YNQDSARGGE GGNWNERNSK
PPLAAWAVWH IYQETKDKEF LKEMYPKLVA YHNWWYTNRD HNKNGIAEYG SMVSDAHWQK
DDKDQIIKDK NGHLKWMMML LLKQPRGKVA WITLHGLTKK VWAKATLELK SF117-1 (SEQ
ID NO:433) TAATTCGATG GAGAAGGTGG TTTAGTGAAA AGATTTTCAT TTTTTTTACT
AATTTTACTT GCTTTAACAG GTTGTAAATC CGGTGAAAAA GAATTTGATG AAGAATCTCT
TCAAAATCTA AAGGAAACGN CACAGTCTTA NTCAGAAACA GAATTACAAA ATGGTGACGT
TCGTTTAAAT GAATATATTT CTTTGAAAGG GGAGATTGTT GAGAGTGACA GTCGTTCCAG
TTTAATAAAA AAAGGTGATC GTTTTATTTT GAAAAGTGGT TCTAGTAAAT ATCAAGTTTN
TAATGAGCAA AAGAAAAAAT TGAAGATTGG TGACGAAGTG ACAGTTTACG GAGAATATTA
CGGCTTTTTG AAAGGCACAT TAATTGAAAG TGAGGAGAAT CATGATTCAG CCACGAATTA C
EF117-2 (SEQ ID NO:434) VKR FSFFLLILLA LTGCKSGEKE FDEESLQNLK
ETXQSXSETE LQNGDVRLNE YISLKGEIVE SDSRSSLIKK GDRFILKSGS SKYQVXNEQK
KKLKIGDEVT VYGEYYGFLK GTLIESEENH DSATN EF117-3 (SEQ ID NO:435) TG
AAGAATCTCT TCAAAATCTA AAGGAAACGN CACAGTCTTA NTCAGAAACA GAATTACAAA
ATGGTGACGT TCGTTTAAAT GAATATATTT CTTTGAAAGC GGAGATTGTT GACAGTGACA
GTCCTTCCAG TTTAATAAAA AAAGGTGATC GTTTTATTTT GAAAAGTGGT TCTAGTAAAT
ATCAAGTTTN TAATGAGCAA AAGAAAAAAT TGAAGATTGG TGACGAAGTG ACAGTTTACG
GACAATATTA CGGCTTTTTG AAAGGGACAT TAATTGAAAG TGAGGAGAAT CATGATTCAG
CCACGAA EF117-4 (SEQ ID NO:436) EESLQNLK ETXQSXSETE LQNGDVRLNE
YISLKGEIVE SDSRSSLIKK GDRFILKSGS SKYQVXNEQK KKLKIGDEVT VYGEYYGFLK
GTLIESEENH DSATN EF118-1 (SEQ ID NO:437) TGAGGGGGAA AAAGTGTGTT
AAAAAGAAAA GTGGGGATTG TCGCAGGCCT TTTCTGTTCA GCTTTGTTAC TGACAGGTTG
TGGCAAAAGT GCGAAAGATG AGTTCATTCA AGGAATCGGC AATCANAACG CACAAGAATC
TGGGGTTTGN GATTTCTCTA TGTCAATTAG TGACATGAAA TTTTCACAAG AAGATGGTGC
ACAAACGAAT CCTATGATTG GGATGCTCAT CACGCAAATC AAAGACGCAT CGCTTTCTGG
GGAAGATTCA AGTAGATGCC AAAAAAGAAA AAGCATTCAA CTTAGACATC AAATTAAAAG
CGATCGCAAT GGATCTACCG ATTTCATTGG TTGGATCGTT AGATAA EF118-2 (SEQ ID
NO:438) VLKRKV GIVAGVFCSA LLLTGCGKSA KDEFIQGIGN XNAQESGVXD
FSMSISDMKF SQEDGAQTNP MIGMLITQIK DASLSGEDSS RCQKRKSIQL RDETKSDGNG
CTDFIGWIVR EF118-3 (SEQ ID NO:439) GAAAGATG AGTTCATTCA AGGAATCGGC
AATCANAACG CACAAGAATC TGCGGTTTGN GATTTCTCTA TGTCAATTAG TGACATGAAA
TTTTCACAAG AAGATGGTGC ACAAACGAAT CCTATGATTG GGATGCTCAT CACGCAAATC
AAAGACGCAT CGCTTTCTGG GGAAGATTCA AGTAGATGCC AAAAAAGAAA AAGCATTCAA
CTTAGAGATG AAATTAAAAG CGATGGGAAT GGATGTACCG ATTTCATTGG TTGGATCGTT
AGAT EF118-4 (SEQ ID NO:440) KDEFIQGIGN XNAQESGVXD FSMSISDMKF
SQEDGAQTNP MIGMLITQIK DASLSGEDSS RCQKRKSIQL RDEIKSDGNG CTDFIGWIVR
EF119-1 (SEQ ID NO:441) TAAAGAATAC CGAGTAAAAT TTTCGGAAGG CTTTTTTTCA
AAAATTGTAT ATGCAAAAGA AGTGCAACGG AAAGGAGCTC GGAAATCGTG AATAAGCTAC
CTTTACTTAT TTTATTGTTA GGCGCAGTGT TGCTTGTTAG TGGCTGTCAA AGCCATAAGG
AAGAAAACAA GTCTAGTAAA GTATCGACAG AAGAAACGAC AGTGATTGAA ACAGTAGCAA
GGGAACAATC GAAGGAATCG TTTACGAGTG AAGCAACTAA AAAACAGACA GAAACAACGA
AATTAGAAGA ACCAGATCAT GTAAAACTTC TAGAAGCTTA TGGAAATGCG TATGCGAACT
TTACAAGTAT TAATGATCGC AATGAAAAGC TAAAGCCCCT CATGACTGAA AAATGTATCA
AAAAAAATGG AATTGATGTT AAAACTGGAG TAGCGTTAGT TTCCGTAGGA AAGGTTACAA
CGATTTATAA AAATGATCAA CATGAATATG CTTTACTTTT GGATTGTGAA CAAAATCGAA
CGCAGACACG AGTCTTACTT TTGGCTAAGG TCAAGAACAA TAAAATTTCT GAAATGACCT
ATAATTCAGT TAAGCAAGAG TATTAG EF119-2 (SEQ ID NO:442) VN KLPLLILLLG
GVLLVSGCQS HKEENKSSKV STEETTVIET VAREQSKESF TSEATKKQTE TTKLEEPDHV
KLLEAYGNAY ANFTSINDRN EKLKPILMTEK CIKKNGIDVK TGVALVSVGK VTTIYKNDQH
EYALLLDCEQ NGTQTRVLLL AKVKNNKISE MTYNSVKQEY EF119-3 (SEQ ID NO:443)
AGAAAACAA GTCTAGTAAA GTATCGACAC AAGAAACGAC AGTGATTGAA ACAGTAGCAA
GGGAACAATC GAAGGAATCG TTTACGAGTG AAGCAACTAA AAAACAGACA GAAACAACGA
AATTAGAAGA ACCAGATCAT GTAAAACTTC TAGAAGCTTA TGGAAATGCG TATGCGAACT
TTACAAGTAT TAATGATCGC AATGAAAAGC TAAAGCCCCT CATGACTGAA AAATGTATCA
AAAAAAATGG AATTGATGTT AAAACTGGAG TAGCGTTAGT TTCCGTAGGA AAGGTTACAA
CGATTTATAA AAATGATCAA CATGAATATG CTTTACTTTT GGATTGTGAA CAAAATGGAA
CGCAGACACG AGTGTTACTT TTGGCTAAGG TGAAGAACAA TAAAATTTCT GAAATGACCT
ATAATTCAGT TAAGCAAGAG TAT EF119-4 (SEQ ID NO:444) ENKSSKV
STEETTVIET VAREQSKESF TSEATKKQTE TTKLEEPDHV KLLEAYGNAY
ANFTS INDRN EKLKPLMTEK CIKKNGIDVK TGVALVSVGK VTTIYKNDQH EYALLLDCEQ
NGTQTRVLLL AKVKNNKI SE MTYNSVKQEY EF120-1 (SEQ ID NO:445)
TGAATAGGCG TGAAAAAGGG AATGTTAGCG TTTTTTGTCG TGCTAGCGGT TTTATCATTA
ACTGCTTGTC GGGAACCAAA AGNAAAGAAA GTAACCGCTT CAACGGAGGC ATCCTCTAAA
GTTGAAGAGA CGAATGAAAA AACGAGTGAA ACAATTGATA AGACAAACGA ACAAGCGAGC
AGCAGTGTCG AGTCTAACGA ATCAGTGAAA AATGAAGAGC CGACAGCTGA TGGAAACAAT
AGTCAGCTAA CTGTAGCTGA TTTAGATACT ACAGCGATTA ATGCTGGCGA TTTTACTACT
TTAGTTGGAA TATGGAAAAA TGGTAAAGGA GAGAGTTTGA TCATTCATCC TGATGGTAGT
ACAAATACCG GAGGAATGAT TACGAAGGAT TCACCTACTG ATGAGTCGCG ACCAATTACA
AGCTTAAGTA TTAGGTGGGG GCCTACTGGT GCTGCGCTAT TATTATATAA AATTGGTGTT
EF120-2 (SEQ ID NO:446) VKKGMLAF FVVLAVLSLT ACREPKXKKV TASTEASSKV
EETNEKTSET IDKTNEQASS SVESNESVKN EEPTADGNNS QLTVADLDTT AINAGDFTTL
VGIWKNGKGE SLIIHPDGST NTGGMITKDS PTDESRPITS LSIRWGPTGA ALLLYKIGV
EF120-3 (SEQ ID NO:447) AAGAAA GTAACCGCTT CAACGGAGGC ATCCTCTAAA
GTTGAAGAGA CGAATGAAAA AACGACTGAA ACAATTGATA AGACAAACGA ACAAGCGAGC
AGCAGTGTCG AGTCTAACGA ATCAGTGAAA AATGAAGAGC CGACAGCTGA TGGAAACAAT
AGTCAGCTAA CTGTACCTGA TTTAGATACT ACAGCGATTA ATGCTGCCGA TTTTACTACT
TTAGTTGGAA TATGGAAAAA TGGTAAAGGA GAGAGTTTGA TCATTCATCC TGATGGTAGT
ACAAATACCG GAGGAATGAT TACGAAGGAT TCACCTACTG ATGAGTCGCG ACCAATTACA
AGCTTAAGTA TTAGGTGGGG GCCTACTGGT GCTGCGCTAT TATTATATAA AATTGGTGTT
EF120-4 (SEQ ID NO:448) KKV TASTEASSKV EETNEKTSET IDKTNEQASS
SVESNESVKN EEPTADGNNS QLTVADLDTT AINAGDFTTL VGIWKNGKGE SLIIHPDGST
NTGGMITKDS PTDESRPITS LSIRWGPTGA ALLLYKIGV EF121-1 (SEQ ID NO:449)
TGAAACACAA GGAGGAAATT TGTGAAAAAG TTGAGCTTTA AAAAAGTGAA GTGGGGCATG
CATTTTTTAA TGGCTGTTGC GTTGATAGCG CCAAGTGTTA CTAGTACGGC ATATGCAGTA
GAAACAACGA GTCAACAAAG TTCAGAAGCA GTAACAAGTA CCACCGATTC AAGTAGAAAA
CAAGAACCAG TCATTACACA GGAAACAACA GACATCAAAC AAGAAGCACC AAATCAGGCT
ACGAGTCACA GTGTCAAGCA GTCACAAGAA ACCACAGCAC CAACAGAGAC GACGAATTTA
GAAACGTCAA TCGCTGAAAA AGAAGAAACG AGCACGCCGC AAAAAATAAC AATTTTAGGT
ACGTCAGATG TTCATGGTCA ATTATGGAAT TGGTCTTATG AAGATGATAA AGAACTACCA
GTTGGTTTGT CCCAAGTAAG TACAGTCGTT AACCAAGTCC GGGCACAAAA CCCAGCAGGC
ACCGTTTTAA TTCATAATGG CGACAATATT CAAGGCACTA TTTTAACAGA TGACTTGTAT
AATAAAGCGC CTTTAGTGAA TGAAAAGACC CATCCAATGA TCACCGCCAT GAATGTGATG
AAGTATGATG CAATGCTTTT GGGAAATCAT GAGTTTAATT TTGGTTTACC GTTAATCAAA
AAAATTCAAC AAGAAGCCAC TTTTCCAATC TTGTCTGCGA ATACCTACAA TAAGGAAGAT
GGTCTTCCTT TTGTTGAAGG GACTACCACG AAGGAACTTG ATTTTAATCA AGATGGGCAG
CCAGATTTAA AAGTTGGGAT TATCGGCTTA ACAATTCCGC ACATTCCTTT GTGGGATGGC
CCTCGTGTTA CTTCGCTTAA TTTTTTACCT TTGAAAGAAG AAGCAGAAAA AGCAGTTACT
GAGTTGAAAG CTAACGATCA GGCTGACATT ATTGTTGCCT CGATTCATGC GGGACAACAA
AATAGTGATC CGGCTGCCAG TGCCGACCAA GTAATTGAAA ATGTCGCGGG GATTGATGCG
TATATTCTGG GTCATGACCA CCTTTCTTTT ACCAAGCAAG GAGCAGCGCC GAATGGAAAA
ACTGTACCGG TAGGGGGACC GAAAGATACG GGGACAGAAG TTGTCAAAAT TGATCTTTCA
GTTGCTAAAA ATGCCGATAA GTGGGAAGTG CAAGAAGGTA CAGCAACGAT TGTACCAACA
ACGAATGTTC CAGCAGATGA AGCAGTTAAG GCAGCGACAA AAGAATACCA TGAAAAAACG
CGAGCGTTTA TTCAGGAGGA GATCGGCACA GCAACAGCTG ATTTTTTACC AAAACAAGAA
ATTAAAGGAA TTCCCGAAGC ACAATTACAA CCAACAGCGA TGATTTCTTT AATTAATAAC
GTTCAAAAAG AAGTAACGGG CGCACAATTA AGTGCGGCAG CGCTGTTTAA ATACGACAGT
AAATTACCTG CGGGGAAGAT TTCCTATGCC ACGATTTTTG ATATCTACAA ATACCCGAAT
ACCTTAGTGA GTGTTCCCAT TAACGGTGAA AACTTACTGA AGTATTTACA AAAACAAGCG
GCGTACTATA ACCAAACACA GCCAGATGAT TTGACCATTA CTTTTAATCC AAACATTCGT
GTATATAACT ATGACATGAT TTCTGGAGTG GACTACAAGA TTGACATTTC AAAACCAGTG
GGTCAACGAA TTGTAGATGC GAAAATTGAC GGCCAACCCC TGGATCCTGC CAAAGAATAT
ACGATTGCTA TGAATAATTA TCGTTACGGC GGTTTAGCTA GCCAAGGGAT TCAAGTAGGG
GAACCTATTA AAAATTCTGA TCCAGAAACC TTACGAGGAA TGATTGTTGA TTATATTAAG
AAAAAAGGAA CTCTTGATCC AGAACAACAA ATCGAACGAA ATTGGTCAAT TATTOGGACA
AATTTTGATG AAAAATGGCG TGCCAAAGCA ATCGAATTAG TGAATGACGG CACTCTTCAA
ATTCCGACTT CTCCTGATGG ACGTACACCA AACGCCGCCG CTATTACGAA ACAAGATGTC
CGTAATGCGG GCTTTGATTT AGATAATGCA TATACCATTA TGCACACAAA TGACGTTCAT
GGCCGACTAG AAGCAGGGAA AGGCGAATTA GGTATGGCGC GTCTAAAAAC CTTTAAAGAC
CAAGAAAACC CAACCTTGAT GGTGGATGCA GGGGATGTTT TCCAAGGATT ACCAATCTCC
AATTTCTCCA AAGGCGCGGA TATGGCCAAA GCAATGAATG AAGTTGGTTA TGATGCCATG
GCGGTGGGAA ATCACGAGTT TGATTTTGGT TTAGAGATTG CACTAGGTTA TAAAGACCAA
CTGAATTTTC CGATTTTATC TAGTAATACG TATTACAAAG ATGGCAGTGG ACGGGTTTTT
GATCCGTATA CAATCGTAGA AAAATCCGGG AAAAAGTTTG CCATTGTAGG TGTGACGACC
CCAGAAACAG CAACGAAAAC ACACCCGAAA AACGTAGAGA AGGTGACATT TAAAGACCCG
ATTCCAGAAG TAGAAGCAGT GATTAAGGAA ATTAAAGAGA AGTACGCGGA TATNCAAGCT
TTCGTGGTTA CTGGGCATTT AGGCGTAGAT GAAACGACCC CGCATATCTG GCGTGGTGAT
ACGCTAGCAG AAACCCTTAG TCAAACATAT CCTGAGTTAG ATATCACTGT GATTGATGGA
CATTCGCATA CAGCCGTCGA AAGTGGCAAA CGTTATGGCA AAGTCATCTA TGCTCAAACA
GGTAATTATT TAAATAATGT TGGGATCGTC ACAGCACCAG AGAGTGAACC AACTAAGAAA
ACAACAAAAT TGATTTCAGC AGCAGAGCTG CTAGAATTGC CAGAAAACCC GGCAGTTAAA
GCCATCGTTG ATGAAGCACG TACGAATTTT AACGCTGAAA ATGAAAAAGT AATTGTCGAT
TATATTCCAT TCACATTGGA TGGACAACGA GAAAATGTGC GCACACGAGA GACCAACTTA
GGGAATTTGA TTGGTGATGC GATTATGTCA TATGGCCAAG ACGCGTTTAG CCAACCTGCT
GATTTTGCAG TAACTAATGG TGGCGGCATT CGCGCTGATA TTAAACAAGG GCCAATTAAA
GTTGGGGATG TCATTGCTGT GTTACCTTTT GGCAATAGCA TTGCGCAAAT TCAAGTAACC
GGCGCCCAAG TTAAAGAAAT GTTTGAAATG TCTGTTCGTT CGATTCCACA AAAAGATGAG
AATGGCACAA TTTTACTAGA TGATGCTGGC CAACCAAAAC TTGGCGCAAA TCGTGGTTTC
CTACATGTTT CAAGCTCCAT TCGTATCCAC TATGATTCCA CAAAACCAGG TACTCGCTTG
GCTAGTGACG AAGGCAATGA AACAGGACAA ACGATTGTCG GTAGTCGCGT ATTAGGAATA
GAAATTAAAA ATCGGCAAAC ACAAAAGTTT GAACCATTGG ATGAGAAGAA ACAATACCGG
ATGGCTACCA ATGATTTCTT AGCTGCTGGT GGTGATGGTT ACGATATGCT AGGTGGTGAA
CGAGAAGAAG GGATTTCACT AGATTCTGTC TTAATTGAAT ACTTGAAAAG TGCAACCAGC
TTGCGGTTGT ATCGTGCAGC AACGACGATT GATTTAGCAC AATATAAAGA ACCATTCCCA
GGCGAACGAA TTGTTTCTAT TTCGGAAGAA GCTTACAAAG AGTTAATCGG TGGAGGAGAG
ACGCCAAAAC CAGATCCAAA ACCAGACCCG AAACCAACAC CAGAAACACC AGTAGCAACC
AATAAACAAA ACCAAGCGGG AGCAAGACAG AGCAATCCAT CCGTAACAGA GAAGAAAAAG
TATGGCGGCT TTTTACCTAA AACGGGTACA GAAACAGAAA CGCTTGCATT ATATGGTTTA
CTGTTCGTTG GACTTTCTTC TTCTGGCTGG TATATTTATA AACGACGTAA CAAAGCTAGT
TAG EF121-2 (SEQ ID NO:450) VKKL SFKKVKWGMH FLMAVALIAP SVTSTAYAVE
TTSQQSSEAV TSTTDSSRKQ EPVITQETTD IKQEAPNQAT SDSVKQSQET TAPTETTNLE
TSIAEKEETS TPQKITILGT SDVHGQLWNW SYEDDKELPV GLSQVSTVVN QVRAQNPAGT
VLIDNGDNIQ GTILTDDLYN KAPLVNEKTH PMITAMNVMK YDAMVLGNHE FNFGLPLIKK
IQQEATFPIL SANTYNKEDG LRFVEGTTTK ELDFNQDGQP DLKVGIIGLT IPHIPLWDGP
RVTSLNFLPL KEEAEKAVTE LKANDQADII VASIHAGQQN SDPAASADQV IENVAGIDAY
ILGHDHLSFT KQGAAPNGKT VPVGGPKDTG TEVVKIDLSV AKNADKWEVQ EGTATIVPTT
NVPADEAVKA ATKEYHEKTR AFIQEEIGTA TADFLPKQEI KGIPEAQLQP TANISLINNV
QKEVTGAQLS AAALFKYDSK LPAGKISYAT IFDIYKYPNT LVSVPINGEN LLKYLEKQGA
YYNQTQPDDL TISFNPNIRV YNYDMISGVD YKIDISKPVG ERIVDAKIDG QPLDPAKEYT
IAMNNYRYGG LASQGIQVGE PIKNSDPETL RGMIVDYIKK KGTLDPEQEI ERNWSIIGTN
FDEKWRAKAT ELVNDGTLQT PTSPDGRTPN AAAITKQDVR NAGFDLDNAY TIMHTNDVHG
RLEAGKGELG MARLKTFKDQ ENPTLMVDAG DVFQGLPISN FSKGADMAKA MNEVGYDANA
VGNHEFDFGL EIALGYKDQL NFPILSSNTY YKDGSGRVFD PYTIVEKSGK KFAIVGVTTP
ETATKTHPKN VEKVTFKDPI PEVEAVIKEI KEKYADXQAF VVTGHLGVDE TTPHIWRGDT
LAETLSQTYP ELDITVIDGH SHTAVESGKR YGKVIYAQTG NYLNNVGIVT APESEPTKKT
TKLISAAELL ELPENPAVKA IVDEARTNFN AENEKVIVDY IPFTLDCQRE NVRTRETNLG
NLIGDAIMSY GQDAFSQPAD FAVTNGGGIR ADIKQGPIKV GDVIAVLPFG NSIAQTQVTG
AQVKEMFEMS VRSIPQKDEN GTILLDDAGQ PKLGANGGFL HVSSSIRIHY DSTKPGTRLA
SDEGNETGQT IVGSRVLGIE TKNRQTQKFE PLDEKKQYRM ATNDFLAACG DGYDMLGGER
EEGISLDSVL IEYLKSATSL RLYRAATTTD LAQYKEPFPG ERIVSISEEA YKELIGGGET
PKPDPKPDPK PTPETPVATN KQNQAGARQS~NPSVTEKKKY GGFLPKTGTE TETLALYGLLJ
FVGLSSSGWY IYKRRNKAS EF121-3 (SEQ ID NO:451) ACAAAG TTCAGAAGCA
GTAACAAGTA CCACCGATTC AAGTAGAAAA CAAGAACCAC TCATTACACA CGAAACAACA
GACATCAAAC AACAAGCACC AAATCACGCT ACGAGTGACA GTGTCAAGCA GTCACAACAA
ACCACAGCAC CAACAGAGAC GACCAATTTA GAAACGTCAA TCGCTGAAAA AGAAGAAACG
AGCACGCCGC AAAAAATAAC AATTTTAGGT ACGTCAGATG TTCATGGTCA ATTATGGAAT
TGGTCTTATG AAGATCATAA AGAACTACCA GTTGGTTTGT CCCAAGTAAG TACAGTCGTT
AACCAAGTCC GGGCACAAAA CCCAGCAGGC ACCGTTTTAA TTCATAATGG CGACAATATT
CAAGGCACTA TTTTAACAGA TGACTTGTAT AATAAAGCGC CTTTAGTGAA TGAAAAGACC
CATCCAATGA TCACCCCCAT GAATGTGATG AAGTATGATG CAATGGTTTT CGGAAATCAT
GAGTTTAATT TTCGTTTACC GTTAATCAAA AAAATTCAAC AAGAAGCCAC TTTTCCAATC
TTGTCTGCGA ATACCTACAA TAAGGAAGAT GGTCTTCGTT TTGTTGAAGG GACTACCACG
AAGGAACTTG ATTTTAATCA AGATGGGCAC CCAGATTTAA AAGTTGGGAT TATCGGCTTA
ACAATTCCGC ACATTCCTTT GTGGGATGGC CCTCGTGTTA CTTCGCTTAA TTTTTTACCT
TTGAAACAAG AAGCAGAAAA AGCAGTTACT GAGTTGAAAC CTAACGATCA CCCTGACATT
ATTGTTGCCT CGATTCATGC GGCACAACAA AATAGTGATC CGGCTGCCAG TCCCGACCAA
CTAATTGAAA ATGTCGCGGG GATTGATGCC TATATTCTGG GTCATGACCA CCTTTCTTTT
ACCAACCAAG GAGCAGCGCC GAATGGAAAA ACTGTACCCG TAGGGGGACC GAAAGATACG
GGGACAGAAG TTGTCAAAAT TGATCTTTCA GTTGCTAAAA ATGCCGATAA GTGGGAAGTG
CAAGAAGGTA CAGCAACGAT TGTACCAACA ACCAATGTTC CAGCAGATGA AGCAGTTAAG
GCAGCGACAA AAGAATACCA TGAAAAAACG CGAGCGTTTA TTCAGCAGCA GATCGGCACA
GCAACAGCTG ATTTTTTACC AAAACAAGAA ATTAAAGGAA TTCCCGAAGC ACAATTACAA
CCAACAGCGA TGATTTCTTT AATTAATAAC GTTCAAAAAG AAGTAACCGG CGCACAATTA
AGTGCGGCAG CGCTOTTTAA ATACGACAGT AAATTACCTG CGCCGAAGAT TTCCTATGCC
ACGATTTTTG ATATCTACAA ATACCCGAAT ACCTTAGTGA GTGTTCCCAT TAACGGTGAA
AACTTACTGA ACTATTTACA AAAACAACCG GCGTACTATA ACCAAACACA GCCAGATGAT
TTGACCATTA GTTTTAATCC AAACATTCGT CTATATAACT ATGACATGAT TTCTGGAGTC
GACTACAACA TTGACATTTC AAAACCAGTG GGTGAACGAA TTGTAGATGC CAAAATTGAC
GGCCAACCGC TGGATCCTGC CAAAGAATAT ACGATTGCTA TGAATAATTA TCGTTACGGC
GGTTTAGCTA GCCAAGGCAT TCAAGTAGGG GAACCTATTA AAAATTCTGA TCCAGAAACC
TTACGAGCAA TGATTGTTGA TTATATTAAG AAAAAAGGAA CTCTTGATCC AGAACAAGAA
ATCGAACGAA ATTGGTCAAT TATTGGGACA AATTTTGATG AAAAATGGCG TGCCAAAGCA
ATCCAATTAG TGAATGACGG CACTCTTCAA ATTCCGACTT CTCCTGATGC ACGTACACCA
AACCCCG EF121-4 (SEQ ID NO:452) QSSEAV TSTTDSSRKQ EPVITQETTD
IKQEAPNQAT SDSVKQSQET TAPTETTNLE TSIAEKEETS TPQKITILGT SDVHGQLWNW
SYEDDKELPV GLSQVSTVVN QVRAQNPAGT VLIDNGDNIQ GTILTDDLYN KAPLVNEKTH
PMITAMNVMK YDAMVLCNHE FNFGLPLIKK IQQEATFPIL SANTYNKEDO LRFVEGTTTK
ELDFNQDGQP DLKVGIIGLT IPHIPLWDGP RVTSLNFLPL KEEAEKAVTE LKANDQADII
VASIHAGQQN SDPAASADQV TENVAGIDAY ILGHDHLSFT KQGAAPNGKT VPVGGPKDTG
TEVVKIDLSV AKNADKWEVQ EGTATIVPTT NVPADEAVKA ATKEYHEKTR AFIQEEIGTA
TADFLPKQET KGIPEAQLQP TAMISLINNV QKEVTGAQLS AAALFKYDSK LPAGKISYAT
IFDIYKYPNT LVSVPTNGEN LLKYLEKQGA YYNQTQPDDL TISFNPNIRV YNYDMISGVD
YKIDISKPVG ERIVDAKIDG QPLDPAKEYT IAMNNYRYGG LASQGIQVGE PIKNSDPETL
RGMTVDYIKK KGTLDPEQEI ERNWSITGTN FDEKWRAKAI ELVNDGTLQI PTSPDGRTPN A
EF122-1 (SEQ ID NO:453) TGAAACACAA GGAGGAAATT TGTGAAAAAG TTGAGCTTTA
AAAAAGTGAA GTGGGGCATG CATTTTTTAA TGGCTGTTGC GTTGATAGCG CCAAGTGTTA
CTAGTACGGC ATATGCAGTA GAAACAACGA GTCAACAAAG TTCAGAAGCA GTAACAAGTA
CCACCGATTC AAGTAGAAAA CAAGAACCAG TCATTACACA GGAAACAACA GACATCAAAC
AAGAAGCACC AAATCAGGCT ACGAGTGACA GTGTCAAGCA GTCACAAGAA ACCACAGCAC
CAACAGAGAC GACCAATTTA GAAACGTCAA TCGCTGAAAA AGAAGAAACG AGCACGCCGC
AAAAAATAAC AATTTTAGGT ACGTCAGATG TTCATGGTCA ATTATGGAAT TGGTCTTATG
AAGATGATAA AGAACTACCA GTTGGTTTCT CCCAAGTAAG TACAGTCGTT AACCAAGTCC
GGGCACAAAA CCCAGCAGGC ACCGTTTTAA TTGATAATGG CGACAATATT CAAGGCACTA
TTTTAACAGA TGACTTGTAT AATAAAGCGC CTTTAGTGAA TGAAAAGACC CATCCAATGA
TCACCGCCAT GAATGTGATG AAGTATGATG CAATGGTTTT GGGAAATCAT GAGTTTAATT
TTGGTTTACC GTTAATCAAA AAAATTCAAC AAGAAGCCAC TTTTCCAATC TTGTCTGCGA
ATACCTACAA TAAGGAAGAT GGTCTTCGTT TTGTTGAAGG GACTACCACG AAGGAACTTG
ATTTTAATCA AGATGGGCAG CCAGATTTAA AAGTTGGGAT TATCGGCTTA ACAATTCCGC
ACATTCCTTT GTGGGATGGC CCTCGTGTTA CTTCGCTTAA TTTTTTACCT TTGAAAGAAG
AAGCAGAAAA AGCAGTTACT GAGTTGAAAG CTAACGATCA GGCTGACATT ATTGTTGCCT
CGATTCATGC CGGACAACAA AATAGTGATC CGGCTGCCAG TGCCGACCAA GTAATTGAAA
ATGTCGCGGG GATTGATGCG TATATTCTGG GTCATGACCA CCTTTCTTTT ACCAAGCAAG
GAGCAGCGCC GAATGGAAAA ACTGTACCGG TAGGGGGACC GAAAGATACG GGGACAGAAG
TTGTCAAAAT TGATCTTTCA GTTGCTAAAA ATGCCGATAA GTGGGAAGTG CAAGAAGGTA
CAGCAACGAT TGTACCAACA ACGAATGTTC CAGCAGATGA AGCAGTTAAG GCAGCGACAA
AAGAATACCA TGAAAAAACG CGAGCGTTTA TTCAGGAGGA GATCGGCACA GCAACAGCTG
ATTTTTTACC AAAACAAGAA ATTAAAGGAA TTCCCGAAGC ACAATTACAA CCAACAGCGA
TGATTTCTTT AATTAATAAC GTTCAAAAAG AAGTAACGGG CGCACAATTA AGTGCGGCAG
CGCTGTTTAA ATACGACAGT AAATTACCTG CGGGGAAGAT TTCCTATGCC ACGATTTTTG
ATATCTACAA ATACCCGAAT ACCTTAGTGA GTGTTCCCAT TAACGGTGAA AACTTACTGA
AGTATTTAGA AAAACAAGGG GCGTACTATA ACCAAACACA GOCAGAIGAT TTGACCATTA
GTTTTAATCC AAACATTCGT GTATATAACT ATGACATGAT TTCTGGAGTG GACTACAAGA
TTGACATTTC AAAACCAGTG GGTGAACGAA TTGTAGATGC GAAAATTGAC GGCCAACCGC
TGGATCCTGC CAAAGAATAT ACGATTGCTA TGAATAATTA TCGTTACGGC GGTTTAGCTA
GCCAAGGGAT TCAAGTAGGG GAACCTATTA AAAATTCTGA TCCAGAAACC TTACGAGGAA
TGATTGTTGA TTATATTAAG AAAAAAGGAA CTCTTGATCC AGAACAAGAA ATCGAACGAA
ATTGGTCAAT TATTGGGACA AATTTTGATG AAAAATGGCG TGCCAAAGCA ATCGAATTAG
TGAATGACGG CACTCTTCAA ATTCCGACTT CTCCTGATGG ACOTACACCA AACGCCGCCG
CTATTACGAA ACAAGATGTC CGTAATGCGG GCTTTGATTT AGATAATGCA TATACCATTA
TGCACACAAA TGACGTTCAT GGCCGACTAG AAGCAGGGAA AGGCGAATTA GGTATGGCGC
GTCTAAAAAC CTTTAAAGAC CAAGAAAACC CAACCTTGAT GGTGGATGCA GGGGATGTTT
TCCAAGGATT ACCAATCTCC AATTTCTCCA AAGGCGCGGA TATGGCCAAA GCAATGAATG
AAGTTGGTTA TGATGCCATG GCGGTGGGAA ATCACGAGTT TGATTTTGGT TTAGAGATTG
CACTAGGTTA TAAAGACCAA CTGAATTTTC CGATTTTATC TAGTAATACG TATTACAAAG
ATGGCAGTGG ACGGGTTTTT GATCCGTATA CAATCGTAGA AAAATCCGGG AAAAAGTTTG
CCATTGTAGG TGTGACGACC CCAGAAACAG CAACGAAAAC ACACCCGAAA AACGTAGAGA
AGGTGACATT TAAAGACCCG ATTCCAGAAG TAGAAGCACT GATTAACCAA ATTAAAGAGA
ACTACGCGGA TATNCAAGGT TTCGTGGTTA CTGGGCATTT AGGCGTAGAT GAAACGACGC
CGCATATCTG GCGTGGTGAT ACGCTACCAG AAACCCTTAG TCAAACATAT CCTCACTTAG
ATATCACTGT GATTOATOGA CATTOCCATA CAGCCGTCGA AAGTGGCAAA CGTTATGGCA
AAGTGATCTA TCCTCAAACA GGTAATTATT TAAATAATGT TGGCATCGTC ACAGCACCAG
AGAGTGAACC AACTAAGAAA ACAACAAAAT TGATTTCAGC AGCAGAGCTG CTACAATTGC
CAGAAAACCC GGCAGTTAAA GCCATCGTTC ATGAAGCACG TACGAATTTT AACGCTGAAA
ATGAAAAAGT AATTGTCGAT TATATTCCAT TCACATTGGA TGGACAACGA GAAAATGTGC
GCACACGAGA GACCAACTTA GGGAATTTGA TTGGTGATGC GATTATGTCA TATGGCCAAG
ACGCGTTTAG CCAACCTGCT GATTTTGCAG TAACTAATGG TGCCCGCATT CGCGCTGATA
TTAAACAAGG GCCAATTAAA GTTGGGGATG TCATTGCTGT GTTACCTTTT GGCAATAGCA
TTGCGCAAAT TCAAGTAACC GGCGCCCAAC TTAAAGAAAT GTTTGAAATG TCTGTTCGTT
CGATTCCACA AAAAGATGAG AATGGCACAA TTTTACTAGA TGATGCTGGC CAACCAAAAC
TTGGCGCAAA TGGTGGTTTC CTACATCTTT CAAGCTCCAT TCGTATCCAC TATGATTCCA
CAAAACCAGG TACTCGCTTG GCTAGTGACG AAGGCAATGA AACAGGACAA ACGATTGTCG
GTAGTCGCGT ATTAGGAATA GAAATTAAAA ATCGGCAAAC ACAAAAGTTT GAACCATTGG
ATGAGAAGAA ACAATACCGC ATGGCTACCA ATGATTTCTT AGCTGCTGGT GGTGATGGTT
ACGATATGCT AGGTGGTGAA CGAGAACAAG GGATTTCACT ACATTCTGTC TTAATTGAAT
ACTTCAAAAG TCCAACCAGC TTGCGGTTGT ATCGTGCAGC AACGACGATT GATTTAGCAC
AATATAAAGA ACCATTCCCA GGCGAACGAA TTGTTTCTAT TTCGGAAGAA GCTTACAAAG
AGTTAATCGG TGGAGGAGAG ACGCCAAAAC CAGATCCAAA ACCAGACCCC AAACCAACAC
CAGAAACACC AGTAGCAACC
AATAAACAAA ACCAAGCGGG AGCAAGACAG AGCAATCCAT CCGTAACAGA GAAGAAAAAG
TATGGCGGCT TTTTACCTAA AACGGGTACA GAAACAGAAA CGCTTGCATT ATATGGTTTA
CTGTTCGTTG GACTTTCTTC TTCTGGCTGG TATATTTATA AACGACGTAA CAAAGCTAGT
TAG EF122-2 (SEQ ID NO:454) VKKL SFKKVKWGMH FLMAVALIAP SXTTSTAYAVE
TTSQQSSEAV TSTTDSSRKQ EPVITQETTD IKQEAPNQAT SDSVKQSQET TAPTETTNLE
TSIAEKEETS TPQKTTTLGT SDVHGQLWNW SYEDDKELPV GLSQVSTVVN QVBAQNPAGT
VLIDNGDNIQ GTILTDDLYN KAPLVNEKTH PMITANNVMK YDAMVLGNHE FNFGLPLIKK
IQQEATFPTL SANTYNKEDG LRFVEGTTTK ELDFNQDGQP DLKVGIIGLT IPHIPLWDGP
RVTSLNFLPL KEEAEKAVTE LKANDQADII VASIHAGQQN SDPAASADQV TENVAGIDAY
ILGHDHLSFT KQGAAPNGKT VPVGGPKDTG TEVVKTDLSV AKNADKWEVQ EGTATIVPTT
NVPADEAVKA ATKEYHEKTR AFIQESIOTA TADFLPKQEI KGIPEAQLQP TAMISLINNV
QKEVTGAQLS AAALFKYDSK LPAGKISYAT IFDIYKYPNT LVSVPINGEN LLKYLEKQGA
YYNQTQPDDL TTSFNPNIRV YNYDMISGVD YKIDISKPVG ERIVDAKIDG QPLDPAKEYT
IAMNNYRYGG LASQGIQVGE PIKNSDPETL RGMIVDYIKK KGTLDPEQET EENWSIIGTN
FDEKWRAKAI ELVNDGTLQT PTSPDGRTPN AAAITKQDVR NAGEDLONAY TIMHTNDVHG
RLEAGKGELG MARLKTFKDQ ENPTLMVDAG DVFQGLPISN FSKGADMAKA MNEVGYDAMA
VGNHEFDFGL EIALGYKDQL NFPILSSNTY YKDGSGRVFD PYTIVEKSGK KFATVGVTTP
ETATKTHPKN VEKVTFKDPI PEVEAVIKET KEKYADXQAF VVTGHLGVDE TTPHIWRGDT
LAETLSQTYP ELDITVIDGH SHTAVESGKR YGKVIYAQTG NYLNNVGIVT APESEPTKKT
TKLISAAELL ELPENPAVKA IVDEARTNFN AENEKVIVDY IPFTLDGQRE NVRTRETNLG
NLIGDAIMSY GQDAFSQPAD FAVTNGGGIR ADIKQGPTKV GDVIAVLPFG NSIAQTQVTG
AQVKEMFEMS VRSIPQKDEN GTILLDDAGQ PKLGANGGFL EVESSIRIHY DSTKPGTRLA
SDEGNETCQT IVGSRVLGIE IKNRQTQKFE PLDEKKQYRM ATNDFLAAGG DGYDMLGGER
EEGTSLDSVL IEYLKSATSL RLYPAATTTD LAQYKEPEPG ERIVSISEEA YKELIGGGET
PKPDPKPDPK PTPETPVATN KQNQAGARQS NPSVTEKKKY GGFLPKTGTE TETLALYGLL
FVGLSSSGWY IYKRPNKAS EF122-3 (SEQ ID NO:455) TG AAAAATGGCG
TGCCAAAGCA ATCGAATTAG TGAATGACGG CACTCTTCAA ATTCCGACTT CTCCTGATGG
ACGTACACCA AACGCCGCCG CTATTACGAA ACAAGATGTC CGTAATGCGG GCTTTGATTT
AGATAATGCA TATACCATTA TGCACACAAA TGACGTTCAT GGCCGACTAG AAGCAGGGAA
AGGCGAATTA GGTATGGCGC GTCTAAAAAC CTTTAAAGAC CAAGAAAACC CAACCTTGAT
GGTGGATCCA GGGGATGTTT TCCAAGGATT ACCAATCTCC AATTTCTCCA AAGGCGCGGA
TATGGCCAAA GCAATGAATG AAGTTGGTTA TGATGCCATG GCGGTGGGAA ATCACGAGTT
TGATTTTGGT TTAGAGATTG CACTACGTTA TAAAGACCAA CTGAATTTTC CGATTTTATC
TAGTAATACG TATTACAAAG ATGGCAGTGG ACGGGTTTTT GATCCGTATA CAATCGTAGA
AAAATCCGGG AAAAAGTTTG CCATTGTAGG TGTGACGACC CCAGAAACAG CAACGAAAAC
ACACCCGAAA AACGTAGAGA AGGTGACATT TAAAGACCCC ATTCCAGAAG TAGAAGCAGT
GATTAAGGAA ATTAAAGAGA AGTACGCGGA TATNCAAGCT TTCGTGGTTA CTGGGCATTT
AGGCGTAGAT GAAACGACGC CGCATATCTG GCGTGGTGAT ACGCTAGCAG AAACCCTTAG
TCAAACATAT CCTGAGTTAG ATATCACTGT GATTGATGGA CATTCGCATA CAGCCGTCGA
AAGTGGCAAA CGTTATGGCA AAGTGATCTA TGCTCAAACA GGTAATTATT TAAATAATGT
TGGGATCGTC ACAGCACCAG AGAGTGAACC AACTAAGAAA ACAACAAAAT TGATTTCAGC
AGCAGAGCTG CTAGAATTGC CAGAAAACCC GGCAGTTAAA GCCATCGTTG ATGAAGCACG
TACGAATTTT AACGCTGAAA ATGAAAAAGT AATTGTCGAT TATATTCCAT TCACATTGGA
TGGACAACGA GAAAATGTGC GCACACGAGA GACCAACTTA GGGAATTTGA TTGGTGATGC
GATTATGTCA TATGGCCAAG ACGCGTTTAG CCAACCTGCT GATTTTGCAG TAACTAATGG
TGGCCGCATT CGCGCTGATA TTAAACAAGG GCCAATTAAA GTTGGGGATG TCATTCCTGT
GTTACCTTTT GGCAATAGCA TTGCGCAAAT TCAAGTAACC GGCGCCCAAG TTAAAGAAAT
GTTTGAAATG TCTGTTCGTT CGATTCCACA AAAAGATGAG AATGGCACAA TTTTACTAGA
TGATGCTGGC CAACCAAAAC TTGGCGCAAA TGGTGGTTTC CTACATGTTT CAAGCTCCAT
TCGTATCCAC TATCATTCCA CAAAACCAGG TACTCGCTTG GCTAGTGACG AAGGCAATGA
AACAGGACAA ACCATTGTCC GTAGTCGCGT ATTAGGAATA GAAATTAAAA ATCGGCAAAC
ACAAAAGTTT GAACCATTGG ATGAGAAGAA ACAATACCGG ATGGCTACCA ATGATTTCTT
AGCTGCTGGT GGTGATGGTT ACGATATGCT AGGTCGTGAA CGAGAAGAAG GGATTTCACT
AGATTCTGTC TTAATTGAAT ACTTGAAAAG TGCAACCAGC TTGCGGTTGT ATCGTGCAGC
AACGACGATT GATTTAGCAC AATATAAAGA ACCATTCCCA GGCGAACGAA TTGTTTCTAT
TTCGGAAGAA GCTTACAAAG AGTTAATCGG TGGAGGAGAG ACGCCAAAAC CAGATCCAAA
ACCAGACCCG AAACCAACAC CAGAAACACC AGTAGCAACC AATAAACAAA ACCAAGCGGG
AGCAAGACAG AGCAATCCAT CCGTAACAGA GAAGAAAAAG TATGGCGGCT TT EF122-4
(SEQ ID NO:456) EKWRAKAI ELVNDGTLQI PTSPDCRTPN AAAITKQDVR
NAGFDLDNAY TIMHTNDVHG RLEAGKGELG MARLKTFKDQ ENPTLMVDAG DVFQGLPISN
FSKGADMAKA MNEVGYDAMA VGNHEFDFGL EIALGYKDQL NFPILSSNTY YKDGSGRVFD
PYTIVEKSGK KFAIVGVTTP ETATKTHPKN VEKVTFKDPI PEVEAVIKEI KEKYADXQAF
VVTGHLGVDE TTPHIWRGDT LAETLSQTYP ELDITVIDGH SHTAVESGKR YGKVTYAQTG
NYLNNVGIVT APESEPTKKT TKLISAAELL ELPENPAVKA IVDEARTNFN AENEKVIVDY
IPFTLDGQRE NVRTRETNLG NLIGDAIMSY GQDAFSQPAD FAVTNGGGIR ADIKQGPIKV
GDVIAVLPFG NSTAQIQVTG AQVKEMFEMS VRSIPQKDEN GTILLDDAGQ PKLGANGGFL
HVSSSIRIHY DSTKPGTRLA SDEGNETGQT IVGSRVLGIE LKNRQTQKFE PLDEKKQYRM
ATNDFLAAGG DGYDMLGGER EEGISLDSVL IEYLKSATSL RLYRAATTID LAQYKEPFPG
ERIVSISEEA YKELIGGGET PKPDPKPDPK PTPETPVATN KQNQAGARQS NPSVTEKKKY
GGF EF123-1 (SEQ ID NO:457) TAAAATAAAA AATTGGTACG AAGTGAACGT
TCTCTTCTAT GTGTCGTTAG TAGAGGAAGG ATGAAAGAAA TGAGAAAGAA TGGTCCAATG
GTAAACCGTT GGCTCTACGG GTTGATGTGT TTGTTACTTG TTCTAAATTA TGGCACACCA
CTCATGGCTT TGGCGGAAGA GGTTAACAGC GATGGCCAGT TAACGTTAGG AGAAGTGAAG
CAAACCAGCC AGCAAGAAAT GACCTTAGCG CTTCAAGGAA AAGCACAACC AGTAACACAA
GAGGTTGTAG TGCATTATAG TGCCAATGTG TCAATCAAAG CTGCACATTG GGCAGCGCCC
AATAATACGC GCAAGATTCA AGTGGATGAC CAGAAGAAAC AGATTCAAAT TGAATTGAAT
CAGCAAGCGT TAGCAGATAC CTTAGTCTTA ACGTTGAACC CTACAGCTAC AGAAGATGTG
ACGTTTTCTT ATCGACAACA GCAACGAGCG TTGACGTTAA AGACTGGTAC TGATCCGACA
GAATCAACGG CAATCACGAG TTCGCCAGCC GCATCAGCGA ATGAAGGTTC AACAGAAGAA
GCATCTACAA ACTCCTCTGT TCCTCGTTCG TCCGAAGAAA CTGTCGCCAG CACGACAAAA
GCGATAGAAA GTAAAACAAC TGAATCGACG ACTGTCAAAC CGCGCGTAGC AGGACCAACA
GATATCAGTG ATTATTTTAC AGGTGATGAA ACAACGATTA TCGATAATTT TGAAGATCCG
ATTTATTTAA ATCCTGATGG AACACCAGCA ACACCGCCGT ATAAAGAAGA TGTGACCATT
CATTGGAACT TTAACTGGTC GATTCCAGAA GATGTGCGAG AACAAATGAA AGCAGGCGAT
TACTTCGAGT TTCAATTACC TGGCAATTTG AAACCTAATA AACCAGGTTC AGGTGATTTA
CTTGATGCAG AAGGCAATGT CTATGGAACC TACACAATTA GTGAAGATGG TACGGTTCGT
TTTACCTTTA ATGAGCGAAT CACGTCTGAA AGTGACATTC ACGGGGACTT TTCTTTAGAT
ACTCATTTGA ATGATTCAGA TGGGCGGGGC CCAGGAGATT GGGTGATTGA TATTCCTACA
CAAGAAGATT TGCCGCCTGT AGTGATTCCA ATTGTCCCAG ATACCGAACA ACAAATTGAT
AAACAAGGCC ATTTTGATCG AACGCCCAAT CCTAGTGCGA TTACTTGGAC GGTAGATATC
AATCAAGCGA TGAAAGATCA AACAAATCCA ACTGTGACGG AAACATGGCC AACAGGGAAT
ACCTTTAAGT CCGTGAAAGT CTATGAGTTA GTGATCAATC TTGATGGAAC AATTAAAGAA
GTGGGTCGCG AACTTAGTCC AGATGAATAT ACCGTTGATA AAAATGGCAA TGTGACGATT
AAAGGTGACA CCAACAAAGC GTATCGTCTT GAGTACCAAA CGACGATTGA CGAGGCGGTT
ATTCCAGATG GCGGCGGCGA TCTGCCTTTT AAAAATCACG CGACGTTAAC AAGTGATAAT
AATCCAAATG GGTTAGATGC TGAAGCAACT GTTACCGCCA CATATGGCAA AATGTTAGAC
AACCGCAATA TAGATTACGA CGAAGCCAAT CAAGAATTCA CTTGGGAAAT TAACTACAAC
TATGGTGAAC AAACCATTCC AAAAGACCAA GCAGTCATTA CAGACACAAT GGGGGATAAT
TTAACGTTTG AACCAGATTC TTTACATTTA TATTCAGTGA CATTTGATGA CAAAGGAAAT
GAAGTCGTTG GAGCAGAACT TGTGGAAGGA AAAGATTACA AAGTGGTAAT CAACGGAGAC
GGTTCCTTTG CAATTGACTT TTTACATGAT GTGACTGGCG CAGTCAAGAT TGATTATAAA
ACCAAAGTTG ATGGAATTGT CGAAGGCGAT GTTGCCGTGA ATAATCGTGT GGATGTTGGC
ACTGGTCAGC ATTCAGAAGA TGATGGCACA GCCAGTCAAC AAAATATTAT TAAAAACACT
GGTGCAGTTG ATTATCAAAA TTCAACGATT GGTTGGACGT TAGCTGTGAA TCAAAATAAT
TATTTGATGG AAAATGCCGT GATTACGGAT ACGTACGAAC CAGTTCCTGG CTTAACTATG
GTACCCAATT CGTTGGTTGT CAAAGATACA ACCACTGGTG CTCAGTTGAC GTTAGGCAAG
GATTTCATGG TAGAAATAAC TCGTAATGCA GATGGTGAAA CAGGCTTTAA GGTAAGTTTT
ATAGGGGCGT ATGCCAAAAC AAGTGATCCC TTCCACATAA CTTATACTAC CTTTTTCGAT
GTTACCGAGT TAGACGCTAA CAATCCTGCG TTGGACCATT ATCGAAATAC CGCTGCCATT
GATTGGACGG ATGAAGCAGG AAACAATCAT CATTCAGAAG ATAGTAAACC GTTTAAACCT
TTACCTGCTT TTGATTTAAA TGCGCAAAAA AGCGGTGTTT ACAATGCCGT CACCAAAGAA
ATCACTTGGA CGATTGCGGT TAATTTAAGT AATAATCGTT TAGTCGACGC CTTTTTGACG
GATCCAATTT TAACCAATCA AACCTATTTG GCTGGGACCT TGAAAGTCTA TGAAGGCAAT
ACAAAGCCAG ATGGTTCGGT TGAAAAAGTG AAACCAACGC AACCGTTGAC GGATATCACA
ATGGAAGAAC CAAGCGAGAA AAACCAAAAT ACTTGGCGTG TTGATTTTCC TAATGATACT
CGTACGTATG TGATTGAATT TAAGACGTCT GTTGATGAAA AAGTTATCGA AGGTTCGGCT
AGTTATGACA ATACCGCATC TTATACAAAC CAAGGTTCTT CACGTCATGT GACAGGAAAA
GTTTCTATTC AACATGGTGG CGAATCAGTG AAAAAAGCTG GCGAATACCA CAAAGATGAT
CCAGATCATG TGTACTGGCA TGTAATGATC AATGGCGCCC AATCGGTTTT AGACGATGTG
GTTATTACTG ATACACCCTC ACCAAACCAA GTGCTAGATC CCGAGTCATT GGTGATTTAC
GGTACCAACG TAACAGAAGA CGGAACTATT ACGCCAGATA AATCTGTTAT TTTAGAAGAA
GGAAAAGATT ACACACTGGA AGTTACCACC GATAATGAAA CAGGACAACA AAAAATTGTC
GTTAAAATGG CCCATATTGA AGCACCTTAT TATATGGAAT ATCGTAGTTT AGTGACTTCT
TCAGCGGCGG GGAGTACAGA CACGGTATCC AACCAAGTGT CAATTACTGG AAATGGTTCA
GAAGTCGTTC ATGGGGATGA CAATGGCGAT GTGGTCGTTG ACATTGATCA CAGTGGCGGG
CATGCCACAG GGACTAAAGG CAAAATTCAG CTGAAGAAAA CAGCCATGGA TGAGACGACT
ATTTTAGCAG GCGCCCATTT CCAAATTTGG GACCAAGCTA AAACACAAGT CCTACGTGAA
GGTACAGTAG ATGCCACCGG GGTTATCACA TTTGGTGGGT TGCCACAAGG GCAATACATT
TTGGTGGAGA CAAAAGCACC AGAACGCTAT ACAGTTTCGG ACGAATTAGC TAAAGGCCGA
GTCATTACTA TTGATGAAGA AACTTCAGCC GAAGGAGCAC AACCAACCAT TATTAAAAAC
GATGTCAATA AAGTATTTTT AGAAAAAATG GATGAGAAGG GTAAAAAGTT AGTCAATGCT
CGCTTTAAAT TAGAGCATGC CGTAACCACG CCGTTTACTC ATTGGGAAGA AGTTCCCCTT
GCGCCGGATC GAACCAACGC GAATGGCCAG TTAGAGGTGG ATAGTTTAAA ACCAGGGCTT
TATCAGTTCA CAGAAATCGA AGCACCGACA GGCTATCTTT TAGACACOAC CCCCAAACGA
TTCATCGTGA CACAAAATAC GAGCGGACAA ATTCGTGATG TTCATGTCAA AATGCTTAAT
TACCAAGGTT CTGCTGAACT AATTAAAAAA GACCAAGCAG GCAATCCATT AGCAGGTGCT
GAATTTTCAG TCCTTGACAC CACAGGACAA GCAGTTCGAG AACACTTAGT TTCGGATGCA
AACGGAAAAG TCACAGTGAC GGATTTAGCC CCAGGAAAAT ATCAATTTGT GGAAACCAAA
GCGCCAGCAG GGTACCTTTT AAACACTGAA CCAAGTGCTT TCACGATTGC AGCAAGCGAT
CGGGGCAAAC CAGCAACAGT TATAGCAACG GCTAACTTTG TTAACTATCA AGGCACGGCT
AAATTAATCA AAAAAGATGT GAATGGACAC TTATTAAGTG GTGCGACATT TAAAGTGCTT
GATGCGAAGG GAGAAACGAT TCAAACAGGC TTGACGACAA ATAATCAAGG GGAAATTGTT
GCAGAGCACT TAGCCCCAGG AAAATATCGC TTTGTAGAAA CCAAAGCGCC AACAGGCTAT
TTATTAAATA CCACGCCAGT CCCATTTGAA ATTGCTGAGA AAAATGCTGG TAAACCAGCG
GTCGTGGTTG CTAGTGACAA CTTTGTGAGT TACAAAGGCG CTTTCCAAAT CGTGAAAACG
AATAGCGCAG ACCAACCATT AGCAGGTGCT GTTTTTGAAT TATATGATCA CAATAAACAA
TCATTAGGGA TTACAGCAAC GAGTGGCAAA GATGGCAAAA TTATCTTTAG AGACTTGGCG
CCAGGTACCT ATTATTACAA AGAAATCAAA GCACCAAAAT TACCAGATGG CGCAGATTAT
ATTATTTATC CTGAATTAGT AAAAGTAGAA ATTCGTGGTG ATTTCAAAGG TGATCCGGAG
ATTTTCCAAT TAGGGGCCTT CGCCAATTTC AAAGGACGCG CCGTCTTTAA GAAAATTGAT
GCCAATGCGA ACCCACTTCC AGGAACGATT TTTAAATTGT ATCGAATCGA AAACGGGGAA
AAAATCTTTG AAAGAGAACT AACTGCTGAA AAAGATGGTT CATTGGCTAT GGAGGATTTA
GGTGCTGGTA GCTATGAATT AGATGAACTG GATGCAACGG ATGGCTATAT CGTCAATAAA
CAACCCATTT ATTTTGTAGT GAAGAAGAAT TCAAATGATA AACAACCACT AGATGAGTTA
GAGTTTGTAA ATTATCAAGC AGAAGTAATG GGACGTAAAG TCAACGAGCA AGGTCAAACC
TTAGCGGGTG CAGTTTTTGC AATTTACAAT GCCGATGAGC AGAATCAGCC CCAAGGTTCA
CCGATAACAT TCTTGAATCG TGCAGGAGAA AAAGTTTCTG AAATAACAAC GGATAAGACT
GGCGAAATTT ACGCTAAAGG GCTAAATGAA GGGCATTACG TTTTAGTGGA AACGAAAGCA
CCAACAGGCT ATCTGTTAGA CACAACGCTA CATCCATTTG ATGTAACCGC CCAATTAGGA
AAAGAGCAGC CAATTGCTTT AGGCGATCTT ATCAATTATC AAGGAACTGC TCAATTAACC
AAAGAAAACG AAACAGGTGA AGCATTGGCA GGTGCGGTGT TTAAGGTCAT TGATGAAACA
GGGCAAACCG TAGATGGACA AACCAATCTG ATGTCTGACA AGCAAGGCAA AGTCATTGCG
AAAAACTTAG CACCGGGAAC GTATCGTTTT GTGGAGACAC AAGCGCCAAC TAGCTATCTT
CTTAATGAAA CGCCAAGCGC AAGCTTTACG ATTGCCAAAG ACAACCAAGG CAAACCAGCC
ACTGTGGTAC TTAAAGCACC TTTTATTAAT TACCAAGGTG CTGCCAAGCT GGTGAAAATT
GATCAGCAAA AGAATGCCTT AGCAGGTGCT GAATTTAAAG TGACAGATGC AGAGACAGGG
CAAACTGTCG CTCGTTCATT ACGTTCTGAC AACCAAGGGT TAGTTCAAGT GAACCACTTA
CAACCAGGAA AATATACCTT TGTGGAAACA AAAGCACCGG ATGGTTACCA ACTGTCTAAG
CAAGCTGTCG CATTOACTAT TGCGGCAACA GCGAAAGACA AACCTGAACT CGTGAATGCG
GGCACGTTTG TTAACGAGAA ACAACCTGTA TCCAAAAAAA CAAAACCAAA TCAGCCAACA
ACGAAACAAG CAGCTAGAGA GACAGGTTGG CTTGGTTTAC CGAAAACCAA CACACAAGTC
AATTACTTCT TTGTCTTTAT CGGCCTCATG TTGGTCGGTT TGGCAAGTTG GCTCTTCTAT
AAAAAGAGCA AGAAATAA EF123-2 (SEQ ID NO:458) MRKNGPMV NRWLYGLMCL
LLVLNYGTPL MALAEEVNSD GQLTLGEVKQ TSQQEMTLAL QGKAQPVTQE VVVHYSANVS
IKAAHWAAPN NTRKIQVDDQ KKQIQIELNQ QALADTLVLT LNPTATEDVT FSYGQQQRAL
TLKTGTDPTE STAITSSPAA SANEGSTEEA STNSSVPRSS EETVASTTKA IESKTTESTT
VKPRVAGPTD ISDYFTGDET TTIDNFEDPT YLNPDGTPAT PPYKEDVTIH WNFNWSIPED
VREQNKACDY FEFQLPGNLK PNKPGSGDLV DAEGNVYGTY TISEDGTVRF TFNERITSES
DIHGDFSLDT HLNDSDGRGP CDWVIDIPTQ EDLPPVVTPI VPDTEQQIDK QGHFDRTPNP
SAITWTVDIN QANKDQTNPT VTETWPTGNT FKSVKVYELV MNLDGTIKEV GRELSPDEYT
VDKNCNVTIK GDTNKAYRLE YQTTIDEAVT PDGGGDVPFK NHATLTSDNN PNCLDAEATV
TATYGKMLDK RNIDYDEANQ EFTWETNYNY GEQTIPKDQA VITDTMGDNL TFEPDSLHLY
SVTFDDKGNE VVGAELVEGK DYKVVINGDG SFAIDFLHDV TGAVKIDYKT KVDGIVEGDV
AVNNRVDVCT GQHSEDDGTA SQQNIIKNTG AVDYQNSTIG WTLAVNQNNY LMENAVITDT
YEPVPGLTMV PNSLVVKDTT TCAQLTLCKD FMVEITRNAD GETGFKVSFI GAYAKTSDAF
HITYTTFFDV TELDANNPAL DHYRNTAAID WTDEACNNHH SEDSKPFKPL PAFDLNAQKS
GVYNAVTKET TWTIAVNLSN NRLVDAFLTD PILTNQTYLA GSLKVYEGNT KPDGSVEKVK
PTQPLTDITM EEPSEKNQNT WRVDFPNDSR TYVIEFKTSV DEKVIEGSAS YDNTASYTNQ
GSSRDVTGKV SIQHGGESVK KGGEYHKDDP DHVYWHVMTN GAQSVLDDVV ITDTPSPNQV
LDPESLVTYG TNVTEDGTIT PDKSVILEEG KDYTLEVTTD NETGQQKIVV KMAHIEAPYY
MEYRSLVTSS AAGSTDTVSN QVSITGNGSE VVHGDDNGDV VVDIDHSGGH ATGTKGKIQL
KKTAMDETTI LAGAHFQIWD QAKTQVLREG TVDATGVITF GGLPQGQYIL VETKAPEGYT
VSDELAKGRV ITIDEETSAE GAQPTIIKND VNKVFLEKMD EKGKKLVNAR FKLEHAVTTP
FTHWEEVPLA PDRTNANGQL EVDSLKPGLY QFTEIEAPTG YLLDTTPKRF IVTQNTSGQI
RDVHVKMLNY QGSAELIKKD QACNPLAGAE FSVLDTTGQA VREHLVSDAN GKVTVTDLAP
GKYQFVETKA PAGYLLNTEP SAFTIAASDR GKPATVTATA NFVNYQCTAK LIKKDVNCHL
LSGATFKVLD AKGETIQTGL TTNNQGEIVA EHLAPCKYRF VETKAPTGYL LNTTPVPFEI
AEKNAGKPAV VVASDNFVSY KGAFQIVKTN SADQPLAGAV FELYDHNKQS LGTTATSGKD
OKITERDLAP GTYYYKEIKA PKLPDGADYT IYPELVKVEI RGDFKCDPET FQLGAFANFK
GRAVFKKIDA NANPLPGTIF KLYRIENGEK IFEREVTAEK DGSLANEDLG AGSYELDELD
ATDGYIVNKQ PIYFVVKKNS NDKQPLDELE FVNYQAEVMG RKVNEQGQTL AGAVFATYNA
DEQNQPQCSP ITELNEAGEK VSEITTDKTG EIYAKGLNEG HYVLVETKAP TGYLLDTTLH
PFDVTAQLGK EQPIALGDLT NYQGTAQLTK ENETGEALAG AVFKVIDETG QTVDGQTNLM
SDKQGKVIAK NLAPGTYRFV ETQAPTSYLL NETPSASFTI AKDNQGKPAT VVLKAPEINY
QGAAKLVKID QQKNALAGAE FKVTDAETGQ TVARSLRSDN QGLVQVNHLQ PGKYTFVETK
APDGYQLSKQ AVAFTIAATA KDKPELVNAG TFVNEKQPVS KKTKPNQPTT KQAARETGWL
CLPKTNTQVN YFFVFIGLML VGLASWLFYK KSKK EF123-3 (SEQ lED NO:459)
CGAAGA GGTTAACAGC GATCCCCAGT TAACGTTACG AGAACTCAAG CAAACCAGCC
AGCAAGAAAT GACCTTAGCG CTTCAAGGAA AAGCACAACC AGTAACACAA CAGGTTCTAG
TOCATTATAG TGCCAATGTC TCAATCAAAG CTGCACATTG GGCAGCGCCC AATAATACGC
GCAAGATTCA AGTCGATGAC CAGAAGAAAC AGATTCAAAT TGAATTGAAT CAGCAACCGT
TACCAGATAC GTTAGTCTTA ACGTTCAACC CTACACCTAC AGAAGATGTG ACGTTTTCTT
ATCGACAACA GCAACCAGCG TTGACGTTAA AGACTGGTAC TGATCCCACA GAATCAACGG
CAATCACGAG TTCGCCAGCC GCATCAGCGA ATGAACGTTC AACAGAAGAA GCATCTACAA
ACTCCTCTGT TCCTCGTTCG TCCGAAGAAA CTGTCGCCAG CACGACAAAA GCGATAGAAA
GTAAAACAAC TGAATCGACG ACTGTCAAAC CGCGCGTAGC AGGACCAACA GATATCACTG
ATTATTTTAC AGGTCATGAA ACAACGATTA TCCATAATTT TGAAGATCCG ATTTATTTAA
ATCCTGATGG AACACCAGCA ACACCGCCGT ATAAACAAGA TGTGACCATT CATTGGAACT
TTAACTGGTC GATTCCACAA
GATGTGCGAG AACAAATGAA AGGAGOCCAT TACTTCCAGT TTCAATTACC TGGCAATTTG
AAACCTAATA AACCAGGTTC AGGTGATTTA GTTGATGCAG AAGGCAATGT CTATGGAACC
TACACAATTA GTGAAGATGG TACGGTTCGT TTTACCTTTA ATGAGCGAAT CACGTCTGAA
AGTGACATTC ACGGGGACTT TTCTTTAGAT ACTCATTTGA ATGATTCAGA TGGGCGGGGC
CCAGGAGATT GGGTGATTGA TATTCCTACA CAAGAAGATT TGCCGCCTGT AGTGATTCCA
ATTGTCCCAG ATACCGAACA ACAAATTGAT AAACAAGGCC ATTTTGATCG AACGCCCAAT
CCTAGTGCGA TTACTTGGAC GGTAGATATC AATCAAGCGA TGAAAGATCA AACAAATCCA
ACTGTGACGG AAACATGGCC AACAGGGAAT ACCTTTAAGT CCGTGAAAGT CTATGAGTTA
GTGATGAATC TTGATGGAAC AATTAAAGAA GTGGGTCGCG AACTTAGTCC AGATGAATAT
ACCGTTGATA AAAATGGCAA TGTGACGATT AAAGGTGACA CCAACAAAGC GTATCGTCTT
GAGTACCAAA CGACGATTGA CGAGGCGGTT ATTCCAGATG GCGGCCGCGA TGTGCCTTTT
AAAAATCACG CGACCTTAAC AAGTCATAAT AATCCAAATG GGTTAGATGC TGAAGCAACT
GTTACCGCCA CATATGGCAA AATGTTAGAC AAGCGCAATA TAGATTACGA CGAAGCCAAT
CAAGAATTCA CTTCGGAAAT TAACTACAAC TATGGTGAAC AAACCATTCC AAAAGACCAA
GCAGTCATTA CAGACACAAT GGGCGATAAT TTAACGTTTG AACCAGATTC TTTACATTTA
TATTCAGTGA CATTTGATGA CAAAGGAAAT GAAGTCGTTG GAGCAGAACT TGTGGAAGGA
AAAGATTACA AAGTGGTAAT CAACGGAGAC GGTTCCTTTG CAATTGACTT TTTACATGAT
GTGACTGGCG CAGTCAAGAT TGATTATAAA ACCAAAGTTG ATGGAATTGT CGAAGGCGAT
GTTGCCGTGA ATAATCGTCT GGATGTTGGC ACTGGTCAGC ATTCAGAAGA TGATGGCACA
GCCAGTCAAC AAAATATTAT TAAAAACACT GGTGCAGTTG ATTATCAAAA TTCAACGATT
GGTTGGACGT TAGCTGTGAA TCAAAATAAT TATTTGATGG AAAATGCCGT GATTACGGAT
ACGTACGAAC CAGTTCCTGG CTTAACTATG GTACCCAATT CGTTGGTTGT CAAAGATACA
ACCACTGGTG CTCAGTTGAC GTTAGGCAAG GATTTCATGG TAGAAATAAC TCGTAATGCA
GATGGTGAAA CAGGCTTTAA GGTAAGTTTT ATAGGGGCGT ATGCCAAAAC AAGTGATCCC
TTCCACATAA CTTATACTAC CTTTTTCGAT GTTACCGAGT TAGACGCTAA CAATCCTGCG
TTGGACCATT ATCGAAATAC CGCTGCCATT GATTGG EF123-4 (SEQ ID NO:460)
EEVNSD GQLTLGEVKQ TSQQEMTLAL QGKAQPVTQE VVVHYSANVS IKAAHWAAPN
NTRKIQVDDQ KKQIQIELNQ QALADTLVLT LNPTATEDVT FSYGQQQRAL TLKTGTDPTE
STAITSSPAA SANEGSTEEA STNSSVPRSS EETVASTTKA IESKTTESTT VKPRVAGPTD
ISDYFTGDET TTIDNFEDPI YLNPDGTPAT PPYKEDVTIH WNFNWSIPED VREQMKACDY
FEFQLPCNLK PNKPGSGDLV DAEGNVYGTY TISEDGTVRF TFNERITSES DIHGDFSLDT
HLNDSDGRGP GDWVTDTPTQ EDLPPVVIPI VPDTEQQIDK QGHFDRTPNP SAITWTVDIN
QAMKDQTNPT VTETWPTGNT FKSVKVYELV MNLDGTIKEV GRELSPDEYT VDKNGNVTIK
GDTNKAYRLE YQTTIDEAVI PDGGGDVPFK NHATLTSDNN PNGLDAEATV TATYGKMLDK
RNIDYDEANQ EFTWETNYNY GEQTTPKDQA VITDTMGDNL TFEPDSLHLY SVTFDDKGNE
VVGAELVEGK DYKVVINGDG SFAIDFLHDV TGAVKIDYKT KVDGIVEGDV AVNNRVDVGT
GQHSEDDGTA SQQNIIKNTG AVDYQNSTIG WTLAVNQNNY LMENAVTTDT YEPVPGLTMV
PNSLVVKDTT TGAQLTLGKD FMVEITRNAD GETGFKVSFI GAYAKTSDAF HITYTTFFDV
TELDANNPAL DHYRNTAAID W EF124-1 (SEQ ID NO:461) TAAAATAAAA
AATTGGTACG AAGTGAACGT TCTCTTCTAT GTGTCGTTAG TAGAGGAAGG ATGAAAGAAA
TGAGAAAGAA TGGTCCAATG GTAAACCGTT GGCTCTACGG GTTGATGTGT TTGTTACTTG
TTCTAAATTA TGGCACACCA CTCATGGCTT TGGCGGAAGA GGTTAACAGC GATGGCCAGT
TAACGTTAGG AGAAGTGAAG CAAACCAGCC AGCAAGAAAT GACCTTAGCG CTTCAAGGAA
AAGCACAACC AGTAACACAA GAGGTTGTAG TGCATTATAG TGCCAATGTG TCAATCAAAG
CTGCACATTG GGCAGCGCCC AATAATACGC GCAAGATTCA AGTGGATGAC CAGAAGAAAC
AGATTCAAAT TGAATTGAAT CAGCAAGCCT TACCAGATAC GTTAGTCTTA ACGTTGAACC
CTACAGCTAC AGAAGATGTG ACGTTTTCTT ATGGACAACA GCAACGAGCG TTGACGTTAA
AGACTGGTAC TGATCCGACA GAATCAACGG CAATCACGAG TTCGCCAGCC GCATCAGCGA
ATGAAGGTTC AACAGAAGAA GCATCTACAA ACTCCTCTGT TCCTCGTTCG TCCGAAGAAA
CTGTCGCCAG CACGACAAAA GCGATAGAAA GTAAAACAAC TGAATCGACG ACTGTCAAAC
CGCGCGTAGC AGGACCAACA GATATCAGTG ATTATTTTAC AGGTGATGAA ACAACGATTA
TCGATAATTT TGAAGATCCG ATTTATTTAA ATCCTGATGG AACACCAGCA ACACCGCCGT
ATAAAGAAGA TGTGACCATT CATTGGAACT TTAACTGGTC GATTCCAGAA GATGTGCGAG
AACAAATGAA AGCAGGCGAT TACTTCGAGT TTCAATTACC TGGCAATTTG AAACCTAATA
AACCAGGTTC AGGTGATTTA GTTGATGCAG AAGGCAATGT CTATGGAACC TACACAATTA
GTGAAGATGG TACGGTTCGT TTTACCTTTA ATGAGCGAAT CACGTCTGAA AGTGACATTC
ACGGGGACTT TTCTTTAGAT ACTCATTTGA ATGATTCAGA TCGGCGGGGC CCAGGAGATT
GGGTGATTGA TATTCCTACA CAAGAAGATT TGCCGCCTGT AGTGATTCCA ATTGTCCCAG
ATACCGAACA ACAAATTGAT AAACAAGGCC ATTTTGATCG AACGCCCAAT CCTAGTGCGA
TTACTTGGAC GGTAGATATC AATCAAGCGA TGAAAGATCA AACAAATCCA ACTGTGACGG
AAACATGGCC AACAGGGAAT ACCTTTAAGT CCGTGAAAGT CTATGAGTTA GTGATGAATC
TTGATGGAAC AATTAAAGAA GTGGGTCGCC AACTTAGTCC AGATGAATAT ACCGTTGATA
AAAATGGCAA TGTGACGATT AAAGGTGACA CCAACAAAGC GTATCGTCTT GAGTACCAAA
CGACGATTGA CGAGGCGGTT ATTCCAGATG GCGGCGGCGA TGTGCCTTTT AAAAATCACG
CGACGTTAAC AAGTGATAAT AATCCAAATG GGTTAGATGC TGAAGCAACT GTTACCGCCA
CATATGGCAA AATGTTAGAC AAGCGCAATA TAGATTACCA CGAAGCCAAT CAAGAATTCA
CTTGGGAAAT TAACTACAAC TATGGTGAAC AAACCATTCC AAAAGACCAA GCAGTCATTA
CAGACACAAT GGGGGATAAT TTAACGTTTG AACCAGATTC TTTACATTTA TATTCAGTGA
CATTTGATGA CAAAGGAAAT GAAGTCGTTG GAGCAGAACT TGTGGAAGGA AAAGATTACA
AAGTGGTAAT CAACGGAGAC GGTTCCTTTG CAATTGACTT TTTACATGAT GTGACTGGCG
CAGTCAAGAT TGATTATAAA ACCAAAGTTG ATGGAATTGT CGAAGGCGAT GTTGCCGTGA
ATAATCGTGT GGATGTTGGC ACTGGTCAGC ATTCAGAAGA TGATGGCACA GCCAGTCAAC
AAAATATTAT TAAAAACACT GGTGCAGTTG ATTATCAAAA TTCAACGATT GGTTGGACGT
TAGCTGTGAA TCAAAATAAT TATTTGATGG AAAATGCCGT GATTACGGAT ACGTACGAAC
CAGTTCCTGG CTTAACTATG GTACCCAATT CGTTGGTTGT CAAAGATACA ACCACTGGTG
CTCAGTTGAC GTTAGGCAAG GATTTCATGG TAGAAATAAC TCGTAATGCA GATGGTGAAA
CACGCTTTAA GGTAAGTTTT ATAGGGGCGT ATGCCAAAAC AAGTGATGCC TTCCACATAA
CTTATACTAC CTTTTTCGAT GTTACCGAGT TAGACGCTAA CAATCCTGCG TTGGACCATT
ATCCAAATAC CGCTGCCATT GATTGGACGG ATCAAGCAGG AAACAATCAT CATTCAGAAG
ATAGTAAACC GTTTAAACCT TTACCTGCTT TTGATTTAAA TGCGCAAAAA AGCGGTGTTT
ACAATGCCGT CACCAAAGAA ATCACTTGGA CGATTGCGGT TAATTTAAGT AATAATCGTT
TAGTCGACGC CTTTTTGACG GATCCAATTT TAACCAATCA AACCTATTTG GCTGGGAGCT
TGAAAGTCTA TGAAGGCAAT ACAAAGCCAG ATGGTTCGGT TGAAAAAGTG AAACCAACGC
AACCGTTGAC GGATATCACA ATGGAAGAAC CAAGCGAGAA AAACCAAAAT ACTTGGCGTG
TTGATTTTCC TAATGATAGT CGTACGTATG TGATTGAATT TAAGACGTCT GTTGATGAAA
AAGTTATCGA AGGTTCGGCT AGTTATGACA ATACCGCATC TTATACAAAC CAAGGTTCTT
CACGTGATGT GACAGGAAAA GTTTCTATTC AACATGGTGG CGAATCAGTG AAAAAAGGTG
GCGAATACCA CAAAGATGAT CCAGATCATG TGTACTGGCA TGTAATGATC AATGGCGCCC
AATCGGTTTT AGACGATGTG GTTATTACTG ATACACCCTC ACCAAACCAA GTGCTAGATC
CCGAGTCATT GGTGATTTAC GGTACCAACG TAACAGAAGA CGGAACTATT ACGCCAGATA
AATCTGTTAT TTTAGAAGAA GGAAAAGATT ACACACTGGA AGTTACCACC GATAATGAAA
CAGGACAACA AAAAATTGTC GTTAAAATGG CCCATATTGA AGCACCTTAT TATATGGAAT
ATCGTAGTTT AGTGACTTCT TCAGCGGCGG GGAGTACAGA CACGGTATCC AACCAAGTGT
CAATTACTGG AAATGGTTCA GAAGTCGTTC ATGGGGATGA CAATGGCGAT GTGGTCGTTG
ACATTGATCA CAGTGGCGGG CATGCCACAG GGACTAAAGG CAAAATTCAG CTGAAGAAAA
CAGCCATGGA TGAGACGACT ATTTTAGCAG GCGCCCATTT CCAAATTTGG GACCAAGCTA
AAACACAAGT CCTACGTGAA GGTACAGTAG ATGCCACCGG GGTTATCACA TTTGGTGGGT
TGCCACAAGG GCAATACATT TTGGTGGAGA CAAAAGCACC AGAAGGCTAT ACAGTTTCGG
ACGAATTAGC TAAAGGCCGA GTCATTACTA TTGATGAAGA AACTTCAGCC GAAGGAGCAC
AACCAACCAT TATTAAAAAC GATGTCAATA AAGTATTTTT AGAAAAAATG GATGAGAAGG
GTAAAAAGTT AGTCAATGCT CGCTTTAAAT TAGAGCATGC CGTAACCACG CCGTTTACTC
ATTGGGAAGA AGTTCCCCTT GCGCCGGATC GAACCAACGC GAATGGCCAG TTAGAGGTGG
ATAGTTTAAA ACCAGGGCTT TATCAGTTCA CAGAAATCGA AGCACCGACA GGCTATCTTT
TAGACACCAC CCCCAAACGA TTCATCGTGA CACAAAATAC GAGCGGACAA ATTCGTGATG
TTCATGTCAA AATGCTTAAT TACCAAGGTT CTGCTGAACT AATTAAAAAA GACCAAGCAG
GCAATCCATT AGCAGGTGCT GAATTTTCAG TCCTTGACAC CACAGGACAA GCAGTTCGAG
AACACTTAGT TTCGGATGCA AACGGAAAAG TCACAGTGAC GGATTTAGCC CCAGGAAAAT
ATCAATTTGT GGAAACCAAA GCGCCAGCAG GGTACCTTTT AAACACTGAA CCAAGTGCTT
TCACGATTGC AGCAAGCGAT CGGGGCAAAC CAGCAACAGT TATAGCAACG GCTAACTTTG
TTAACTATCA AGGCACGGCT AAATTAATCA AAAAAGATGT GAATGGACAC TTATTAAGTG
GTGCGACATT TAAAGTGCTT CATGCGAAGG GAGAAACGAT TCAAACAGGC TTGACGACAA
ATAATCAAGG GGAAATTGTT GCAGAGCACT TAGCCCCAGG AAAATATCGC TTTGTAGAAA
CCAAAGCGCC AACAGGCTAT TTATTAAATA CCACGCCAGT CCCATTTGAA ATTGCTGAGA
AAAATGCTGG TAAACCAGCG GTCGTGGTTG CTAGTGACAA CTTTGTGACT TACAAAGGGG
CTTTCCAAAT CGTGAAAACG AATAGCGCAG ACCAACCATT AGCAGGTGCT GTTTTTGAAT
TATATGATCA CAATAAACAA TCATTAGGGA TTACAGCAAC GAGTGGCAAA GATGGCAAAA
TTATCTTTAG AGACTTGGCG CCAGGTACCT ATTATTACAA AGAAATCAAA GCACCAAAAT
TACCAGATOG CGCAGATTAT ATTATTTATC CTGAATTAGT AAAAGTAGAA ATTCGTGGTG
ATTTCAAAGG TGATCCGGAG ATTTTCCAAT TAGGGGCCTT CGCCAATTTC AAAGGACGCG
CCGTCTTTAA GAAAATTGAT GCCAATGCGA ACCCACTTCC AGGAACGATT TTTAAATTGT
ATCGAATCGA AAACGGGGAA AAAATCTTTG AAAGAGAAGT AACTGCTGAA AAAGATGGTT
CATTGGCTAT GGAGGATTTA GGTGCTGGTA GCTATGAATT AGATGAACTG GATGCAACGG
ATGGCTATAT CGTCAATAAA CAACCCATTT ATTTTGTAGT GAAGAAGAAT TCAAATGATA
AACAACCACT AGATGAGTTA GAGTTTGTAA ATTATCAAGC AGAAGTAATG GGACGTAAAG
TCAACGAGCA AGGTCAAACC TTAGCGGGTG CAGTTTTTGC AATTTACAAT GCCGATGAGC
AGAATCAGCC CCAAGGTTCA CCGATAACAT TCTTGAATCG TGCAGGAGAA AAAGTTTCTG
AAATAACAAC GGATAAGACT GGCGAAATTT ACGCTAAAGG GCTAAATGAA GGGCATTACG
TTTTAGTGGA AACGAAAGCA CCAACAGGCT ATCTGTTAGA CACAACGCTA CATCCATTTG
ATGTAACCGC CCAATTAGGA AAAGAGCAGC CAATTGCTTT AGGCGATCTT ATCAATTATC
AAGGAACTGC TCAATTAACC AAAGAAAACG AAACAGGTGA AGCATTGGCA GGTGCGCTGT
TTAAGGTCAT TGATGAAACA GGGCAAACCG TAGATGGACA AACCAATCTG ATGTCTGACA
AGCAAGGCAA AGTCATTGCG AAAAACTTAG CACCGGGAAC GTATCGTTTT GTGGAGACAC
AAGCGCCAAC TAGCTATCTT CTTAATGAAA CGCCAAGCCC AAGCTTTACG ATTGCCAAAG
ACAACCAAGG CAAACCAGCC ACTGTGGTAC TTAAAGCACC TTTTATTAAT TACCAAGGTG
CTGCCAAGCT GGTGAAAATT GATCAGCAAA AGAATGCCTT AGCAGGTGCT GAATTTAAAG
TGACAGATGC AGAGACAGGG CAAACTGTCG CTCGTTCATT ACGTTCTGAC AACCAAGGGT
TAGTTCAAGT GAACCACTTA CAACCAGGAA AATATACCTT TGTGGAAACA AAAGCACCGG
ATGGTTACCA ACTGTCTAAG CAAGCTGTCG CATTCACTAT TGCGGCAACA GCGAAAGACA
AACCTGAACT CGTGAATGCG GGCACGTTTG TTAACGAGAA ACAACCTGTA TCCAAAAAAA
CAAAACCAAA TCAGCCAACA ACGAAACAAG CAGCTAGAGA GACAGGTTGG CTTGGTTTAC
CGAAAACCAA CACACAAGTC AATTACTTCT TTGTCTTTAT CGGCCTCATG TTGGTCGGTT
TGGCAAGTTG GCTCTTCTAT AAAAAGAGCA AGAAATAA EF124-2 (SEQ ID NO:462)
MRKNGPMV NRWLYGLMCL LLVLNYGTPL MALAEEVNSD GQLTLGEVKQ TSQQEMTLAL
QGKAQPVTQE VVVHYSANVS IKAAHWAAPN NTRKIQVDDQ KKQIQIELNQ QALADTLVLT
LNPTATEDVT FSYGQQQRAL TLKTGTDPTE STAITSSPAA SANEGSTEEA STNSSVPRSS
EETVASTTKA IESKTTESTT VKPRVAGFTD ISDYFTGDET TITDNFEDPI YLNPDGTPAT
PPYKEDVTIH WNFNWSIPED VREQMKAGDY FEFQLPGNLK PNKPGSGDLV DAEGNVYGTY
TISEDGTVRF TFNERITSES DIHGDFSLDT HLNDSDGRGP GDWVIDIPTQ EDLPPVVIPT
VPDTEQQIDK QGHFDRTPNP SAITWTVDIN QAMKDQTNPT VTETWPTGNT FKSVKVYELV
MNLDGTIKEV GRELSPDEYT VDKNGNVTIK GDTNKAYRLE YQTTIDEAVI PDGGGDVPFK
NHATLTSDNN PNGLDAEATV TATYGKMLDK RNIDYDEANQ EFTWEINYNY GEQTTPKDQA
VTTDTMGDNL TFEPDSLHLY SVTFDDKGNE VVGAELVEGK DYKVVINGDG SFAIDFLHDV
TGAVKIDYKT KVDGIVEGDV AVNNRVDVGT GQHSEDDGTA SQQNIIKNTG AVDYQNSTIG
WTLAVNQNNY LMENAVITDT YEPVPGLTMV PNSLVVKDTT TGAQLTLGKD FMVEITRNAD
GETGFKVSFI GAYAKTSDAF HTTYTTFFDV TELDANNPAL DHYRNTAAID WTDEAGNNHH
SEDSKPFKPL PAFDLNAQKS GVYNAVTKEI TWTIAVNLSN NRLVDAFLTD PILTNQTYLA
GSLKVYEGNT KPDGSVEKVK PTQPLTDITM EEPSEKNQNT WRVDFPNDSR TYVIEFKTSV
DEKVIEGSAS YDNTASYTNQ GSSRDVTGKV SIQHGGESVK KGGEYHKDDP DHVYWHVMIN
GAQSVLDDVV ITDTPSPNQV LDPESLVIYG TNVTEDGTIT PDKSVILEEG KDYTLEVTTD
NETGQQKIVV KMAHIEAPYY MEYRSLVTSS AAGSTDTVSN QVSTTGNGSE VVHGDDNGDV
VVDIDHSGGH ATGTKGKIQL KKTAMDETTI LAGAHFQIWD QAKTQVLREG TVDATGVTTF
CGLPQGQYIL VETKAPEGYT VSDELAKGRV ITIDEETSAE GAQPTIIKND VNKVFLEKMD
EKGKKLVNAR FKLEHAVTTP FTHWEEVPLA PDRTNANCQL EVDSLKPGLY QFTEIEAPTG
YLLDTTPKRF IVTQNTSGQI RDVHVKMLNY QGSAELTKKD QAGNPLAGAE FSVLDTTGQA
VREHLVSDAN GKVTVTDLAP GKYQFVETKA PAGYLLNTEP SAFTTAASDR GKPATVIATA
NFVNYQGTAK LIKKDVNGHL LSGATFKVLD AKCETIQTGL TTNKQCEIVA EHLAPGKYRF
VETKAPTGYL LNTTPVPFEI AEKNAGKPAV VVASDNFVSY KGAFQIVKTN SADQPLAGAV
FELYDHNKQS LGITATSGKD GKTIFRDLAP GTYYYKEIKA PKLPDCADYI IYPELVKVEI
RGDFKGDPEI FQLCAFANFK GPAVFKKIDA NANPLPGTIF KLYRIENGEK IFEREVTAEK
DGSLAMEDLG AGSYELDELD ATDGYIVNKQ PIYFVVKKNS NDKQPLDELE FVNYQAEVMC
RKVNEQGQTL ACAVEATYNA DEQNQPQGSP ITFLNRACEK VSETTTDKTG EIYAKGLNEG
HYVLVETKAP TGYLLDTTLH PFDVTAQLGK EQPIALGDLI NYQGTAQLTK ENETGEALAG
AVFKVTDETG QTVDGQTNLM SDKQGKVIAK NLAPGTYRFV ETQAPTSYLL NETPSASFTT
AKDNQGKPAT VVLKAPFINY QCAAKLVKID QQKNALAGAE FKVTDAETGQ TVARSLRSDN
QGLVQVNHLQ PGKYTFVETK APDGYQLSKQ AVAFTIAATA KDKPELVNAG TFVNEKQPVS
KKTKPNQPTT KQAARETGWL GLPKTNTQVN YFFVFIGLML VGLASWLFYK KSKK EF124-3
(SEQ ID NO:463) TGCCTTCCACATAACTTATACTACCTTTTTGACG GATCCAATTT
TAACCAATCA AACCTATTTG GCTGGGAGCT TGAAAGTCTA TGAAGGCAAT ACAAAGCCAG
ATGGTTCGGT TGAAAAAGTG AAACCAACGC AACCCTTGAC GCATATCACA ATGGAAGAAC
CAAGCGAGAA AAACCAAAAT ACTTGCCGTG TTGATTTTCC TAATGATAGT CGTACGTATG
TGATTGAATT TAAGACGTCT GTTGATGAAA AAGTTATCGA AGGTTCGGCT AGTTATGACA
ATACCGCATC TTATACAAAC CAAGGTTCTT CACGTGATGT GACAGGAAAA CTTTCTATTC
AACATGGTCC CGAATCAGTG AAAAAAGGTC GCGAATACCA CAAAGATGAT CCACATCATG
TCTACTGCCA TGTAATGATC AATGGCGCCC AATCGGTTTT AGACGATGTG GTTATTACTG
ATACACCCTC ACCAAACCAA GTGCTAGATC CCGAGTCATT GGTGATTTAC GGTACCAACG
TAACAGAAGA CGGAACTATT ACOCCAGATA AATCTCTTAT TTTAGAAGAA CGAAAAGATT
ACACACTGGA AGTTACCACC GATAATGAAA CAGGACAACA AAAAATTGTC GTTAAAATGG
CCCATATTGA AGCACCTTAT TATATGGAAT ATCGTAGTTT AGTGACTTCT TCAGCCGCGG
OGACTACAGA CACGGTATCC AACCAAGTGT CAATTACTGG AAATGGTTCA GAAGTCGTTC
ATGGGGATGA CAATGGCGAT GTCGTCGTTG ACATTCATCA CAGTGCCGGG CATGCCACAG
GGACTAAAGG CAAAATTCAG CTGAAGAAAA CAGCCATCGA TOAGACCACT ATTTTAGCAG
GCGCCCATTT CCAAATTTGG GACCAAGCTA AAACACAAGT CCTACGTGAA GGTACACTAG
ATGCCACCGG GGTTATCACA TTTGGTGGGT TGCCACAAGG GCAATACATT TTGGTGGAGA
CAAAAGCACC AGAAGGCTAT ACAGTTTCGG ACGAATTAGC TAAAGGCCGA GTCATTACTA
TTGATGAAGA AACTTCAGCC GAAGGAGCAC AACCAACCAT TATTAAAAAC CATGTCAATA
AAGTATTTTT AGAAAAAATG GATGAGAAGG GTAAAAAGTT AGTCAATGCT CGCTTTAAAT
TAGAGCATGC CGTAACCACG CCGTTTACTC ATTCGGAAGA AGTTCCCCTT GCGCCGGATC
GAACCAACGC GAATGGCCAG TTAGAGGTGG ATAGTTTAAA ACCAGCGCTT TATCAGTTCA
CAGAAATCGA AGCACCGACA GGCTATCTTT TAGACACCAC CCCCAAACGA TTCATCGTGA
CACAAAATAC GAGCGGACAA ATTCGTGATG TTCATGTCAA AATGCTTAAT TACCAAGGTT
CTGCTGAACT AATTAAAAAA GACCAAGCAG GCAATCCATT AGCAGGTGCT CAATTTTCAG
TCCTTGACAC CACAGGACAA GCAGTTCGAG AACACTTAGT TTCGGATGCA AACGGAAAAG
TCACAGTGAC GGATTTAGCC CCAGGAAAAT ATCAATTTGT GGAAACCAAA GCGCCAGCAG
GGTACCTTTT AAACACTGAA CCAAGTGCTT TCACGATTGC AGCAAGCGAT CGGGGCAAAC
CAGCAACAGT TATAGCAACG GCTAACTTTG TTAACTATCA AGGCACGGCT AAATTAATCA
AAAAAGATGT GAATGGACAC TTATTAAGTG GTGCGACATT TAAAGTGCTT GATGCGAAGG
GAGAAACGAT TCAAACAGGC TTGACGACAA ATAATCAAGG G EF124-4 (SEQ ID
NO:464) AF HTTYTTFFDV TELDANNPAL DHYRNTAAID WTDEAGNNHH SEDSKPFKPL
PAFDLNAQKS GVYNAVTKEI TWTIAVNLSN NRLVDAFLTD PILTNQTYLA GSLKVYEGNT
KPDGSVEKVK PTQPLTDITM EEPSEKNQNT WRVDFPNDSR TYVIEFKTSV DEKVIEGSAS
YDNTASYTNQ GSSRDVTGKV SIQHGGESVK KGGEYHKDDP DHVYWHVMTN GAQSVLDDVV
ITDTPSPNQV LDPESLVIYG TNVTEDGTIT PDKSVILEEG KDYTLEVTTD NETGQQKIVV
KMAHIEAPYY MEYRSLVTSS AAGSTDTVSN QVSITGNGSE VVHGDDNGDV VVDIDHSGGH
ATGTKGKIQL KKTAMDETTI LAGAHFQIWD QAKTQVLREG TVDATGVITF GGLPQGQYIL
VETKAPEGYT VSDELAKGRV ITIDEETSAE GAQPTIIKND VNKVFLEKMD EKGKKLVNAR
FKLEHAVTTP FTHWEEVPLA PDRTNANGQL EVDSLKPGLY QFTEIEAPTG YLLDTTPKRF
IVTQNTSGQI RDVHVKMLNY QGSAELIKKD QAGNPLAGAE FSVLDTTGQA VREHLVSDAN
GKVTVTDLAP GKYQFVETKA PAGYLLNTEP SAFTIAASDR GKPATVTATA
NFVNYQGTAK
LIKKDVNGHL LSGATFKVLD AKGETTQTGL TTNNQG EF125-1 (SEQ ID NO:465)
TAAAATAAAA AATTGGTACG AAGTGAACGT TCTCTTCTAT CTGTCGTTAG TAGAGGAAGG
ATGAAAGAAA TGAGAAAGAA TGGTCCAATG GTAAACCGTT GGCTCTACGG GTTGATGTGT
TTGTTACTTG TTCTAAATTA TCGCACACCA CTCATGGCTT TGGCGGAAGA GGTTAACAGC
GATGGCCAGT TAACGTTAGG AGAAGTGAAG CAAACCAGCC AGCAAGAAAT GACCTTAGCG
CTTCAAGGAA AAGCACAACC AGTAACACAA GAGGTTGTAG TGCATTATAG TGCCAATGTG
TCAATCAAAG CTGCACATTG GGCAGCGCCC AATAATACGC GCAAGATTCA AGTGGATGAC
CACAAGAAAC AGATTCAAAT TGAATTGAAT CAGCAAGCGT TACCAGATAC GTTAGTCTTA
ACGTTGAACC CTACAGCTAC AGAAGATGTG ACGTTTTCTT ATGGACAACA GCAACGAGCG
TTGACGTTAA AGACTGGTAC TGATCCGACA GAATCAACGG CAATCACGAG TTCGCCAGCC
GCATCAGCGA ATGAAGGTTC AACAGAAGAA GCATCTACAA ACTCCTCTGT TCCTCGTTCG
TCCGAAGAAA CTGTCGCCAG CACGACAAAA GCGATAGAAA GTAAAACAAC TGAATCGACG
ACTGTCAAAC CGCGCGTAGC AGGACCAACA GATATCAGTG ATTATTTTAC AGGTGATGAA
ACAACGATTA TCGATAATTT TGAAGATCCG ATTTATTTAA ATCCTGATGG AACACCAGCA
ACACCGCCCT ATAAAGAAGA TGTGACCATT CATTGGAACT TTAACTGGTC GATTCCAGAA
GATGTGCGAG AACAAATGAA AGCAGGCGAT TACTTCGAGT TTCAATTACC TGGCAATTTG
AAACCTAATA AACCAGGTTC AGGTGATTTA GTTGATGCAG AAGGCAATGT CTATGGAACC
TACACAATTA GTGAAGATGG TACGGTTCGT TTTACCTTTA ATGAGCGAAT CACGTCTGAA
AGTGACATTC ACGGGGACTT TTCTTTAGAT ACTCATTTGA ATGATTCAGA TGGGCGGGGC
CCAGGAGATT GGGTGATTGA TATTCCTACA CAAGAAGATT TGCCGCCTGT AGTGATTCCA
ATTGTCCCAG ATACCGAACA ACAAATTGAT AAACAAGGCC ATTTTGATCG AACGCCCAAT
CCTAGTGCGA TTACTTGGAC GGTAGATATC AATCAAGCGA TGAAAGATCA AACAAATCCA
ACTGTGACGG AAACATGGCC AACAGGGAAT ACCTTTAAGT CCGTGAAAGT CTATGAGTTA
GTGATGAATC TTGATGGAAC AATTAAAGAA GTGGGTCGCG AACTTAGTCC AGATGAATAT
ACCGTTGATA AAAATGGCAA TGTGACGATT AAAGGTGACA CCAACAAAGC GTATCGTCTT
GAGTACCAAA CGACGATTGA CGAGGCGGTT ATTCCAGATG GCGGCGGCGA TGTGCCTTTT
AAAAATCACG CGACGTTAAC AAGTGATAAT AATCCAAATG GGTTAGATGC TGAAGCAACT
GTTACCGCCA CATATGGCAA AATGTTAGAC AAGCGCAATA TAGATTACGA CGAAGCCAAT
CAAGAATTCA CTTGGCAAAT TAACTACAAC TATGGTGAAC AAACCATTCC AAAAGACCAA
GCAGTCATTA CAGACACAAT GGGGGATAAT TTAACGTTTG AACCAGATTC TTTACATTTA
TATTCAGTGA CATTTGATGA CAAAGGAAAT GAAGTCGTTG GAGCAGAACT TGTGGAAGGA
AAAGATTACA AAGTGGTAAT CAACGGAGAC GGTTCCTTTG CAATTGACTT TTTACATGAT
GTGACTGGCG CAGTCAAGAT TGATTATAAA ACCAAAGTTG ATGGAATTGT CGAAGGCGAT
GTTGCCGTGA ATAATCGTGT GGATGTTGGC ACTGGTCAGC ATTCAGAAGA TGATGGCACA
GCCAGTCAAC AAAATATTAT TAAAAACACT GGTGCAGTTG ATTATCAAAA TTCAACCATT
GGTTGGACGT TAGCTGTGAA TCAAAATAAT TATTTGATGG AAAATGCCGT GATTACGGAT
ACGTACGAAC CAGTTCCTGG CTTAACTATG GTACCCAATT CGTTGGTTGT CAAAGATACA
ACCACTGGTG CTCAGTTGAC GTTAGGCAAG GATTTCATGG TAGAAATAAC TCGTAATGCA
GATGGTGAAA CAGCCTTTAA GGTAAGTTTT ATAGGGGCGT ATGCCAAAAC AAGTGATGCC
TTCCACATAA CTTATACTAC CTTTTTCGAT GTTACCGAGT TAGACGCTAA CAATCCTGCG
TTGGACCATT ATCGAAATAC CGCTGCCATT GATTGGACGG ATGAAGCAGG AAACAATCAT
CATTCAGAAG ATAGTAAACC GTTTAAACCT TTACCTGCTT TTGATTTAAA TGCGCAAAAA
AGCGGTGTTT ACAATGCCGT CACCAAAGAA ATCACTTGGA CGATTGCGGT TAATTTAAGT
AATAATCGTT TAGTCGACGC CTTTTTGACG GATCCAATTT TAACCAATCA AACCTATTTG
GCTGGGAGCT TCAAAGTCTA TGAAGGCAAT ACAAAGCCAG ATGGTTCGGT TGAAAAAGTG
AAACCAACGC AACCGTTGAC GGATATCACA ATGGAAGAAC CAAGCGAGAA AAACCAAAAT
ACTTGGCGTG TTGATTTTCC TAATGATAGT CGTACGTATG TGATTGAATT TAAGACGTCT
GTTGATGAAA AAGTTATCGA AGGTTCGGCT AGTTATGACA ATACCGCATC TTATACAAAC
CAAGGTTCTT CACGTGATGT GACAGGAAAA GTTTCTATTC AACATGGTGG CGAATCAGTG
AAAAAAGGTG GCGAATACCA CAAAGATGAT CCAGATCATG TGTACTGGCA TGTAATGATC
AATGGCGCCC AATCGGTTTT AGACGATGTG GTTATTACTG ATACACCCTC ACCAAACCAA
GTGCTAGATC CCGAGTCATT GGTGATTTAC GGTACCAACG TAACAGAAGA CGGAACTATT
ACGCCAGATA AATCTGTTAT TTTAGAAGAA GGAAAAGATT ACACACTGGA AGTTACCACC
GATAATGAAA CAGGACAACA AAAAATTGTC GTTAAAATGG CCCATATTGA AGCACCTTAT
TATATGGAAT ATCGTAGTTT AGTGACTTCT TCAGCGGCGG GGAGTACAGA CACGGTATCC
AACCAAGTGT CAATTACTGG AAATGGTTCA GAAGTCGTTC ATGGGGATGA CAATGGCGAT
GTGGTCGTTC ACATTGATCA CAGTGGCGGG CATGCCACAC GGACTAAAGC CAAAATTCAG
CTGAAGAAAA CAGCCATGGA TGAGACGACT ATTTTAGCAG GCGCCCATTT CCAAATTTGG
GACCAAGCTA AAACACAAGT CCTACGTGAA GGTACAGTAG ATGCCACCGG GGTTATCACA
TTTGGTGGGT TGCCACAAGG GCAATACATT TTGGTGGAGA CAAAAGCACC AGAAGGCTAT
ACAGTTTCCG ACGAATTAGC TAAAGGCCGA GTCATTACTA TTGATCAAGA AACTTCAGCC
GAAGGAGCAC AACCAACCAT TATTAAAAAC GATGTCAATA AAGTATTTTT AGAAAAAATG
GATGACAAGG GTAAAAAGTT AGTCAATGCT CGCTTTAAAT TAGAGCATGC CGTAACCACG
CCGTTTACTC ATTGGGAAGA AGTTCCCCTT GCGCCGGATC GAACCAACGC GAATGGCCAG
TTAGAGGTGG ATAGTTTAAA ACCAGGGCTT TATCAGTTCA CAGAAATCGA AGCACCGACA
GGCTATCTTT TAGACACGAC CCCCAAACGA TTCATCGTGA CACAAAATAC GAGCGGACAA
ATTCGTGATG TTCATGTCAA AATGCTTAAT TACCAAGGTT CTGCTGAACT AATTAAAAAA
GACCAAGCAG GCAATCCATT AGCAGGTGCT GAATTTTCAG TCCTTGACAC CACAGGACAA
GCAGTTCGAG AACACTTAGT TTCGGATGCA AACGGAAAAG TCACAGTGAC GGATTTAGCC
CCAGGAAAAT ATCAATTTGT GGAAACCAAA GCGCCAGCAG GGTACCTTTT AAACACTGAA
CCAAGTGCTT TCACGATTGC AGCAAGCGAT CGGGGCAAAC CAGCAACAGT TATAGCAACG
GCTAACTTTG TTAACTATCA AGGCACGGCT AAATTAATCA AAAAAGATGT GAATGGACAC
TTATTAAGTG GTGCGACATT TAAAGTGCTT GATGCGAAGG GAGAAACGAT TCAAACAGGC
TTGACGACAA ATAATCAAGG GGAAATTGTT GCAGAGCACT TAGCCCCAGG AAAATATCGC
TTTGTAGAAA CCAAAGCGCC AACAGGCTAT TTATTAAATA CCACGCCACT CCCATTTGAA
ATTGCTGAGA AAAATGCTGG TAAACCAGCG GTCGTGGTTG CTAGTGACAA CTTTGTGAGT
TACAAAGGGG CTTTCCAAAT CGTGAAAACG AATAGCGCAG ACCAACCATT AGCAGGTGCT
GTTTTTGAAT TATATGATCA CAATAAACAA TCATTAGGGA TTACAGCAAC GAGTGGCAAA
GATGGCAAAA TTATCTTTAG AGACTTGGCG CCAGCTACCT ATTATTACAA AGAAATCAAA
GCACCAAAAT TACCAGATGG CGCAGATTAT ATTATTTATC CTGAATTAGT AAAAGTAGAA
ATTCGTGGTG ATTTCAAAGG TGATCCGGAG ATTTTCCAAT TAGGGGCCTT CGCCAATTTC
AAAGGACGCG CCGTCTTTAA GAAAATTGAT GCCAATGCGA ACCCACTTCC AGGAACGATT
TTTAAATTGT ATCGAATCGA AAACGGGGAA AAAATCTTTG AAAGAGAAGT AACTGCTGAA
AAAGATGGTT CATTGGCTAT GGAGGATTTA GGTGCTGGTA GCTATGAATT AGATGAACTG
GATGCAACGG ATOOCTATAT CGTCAATAAA CAACCCATTT ATTTTGTAGT GAAGAAGAAT
TCAAATCATA AACAACCACT AGATGAGTTA GAGTTTGTAA ATTATCAAGC AGAAGTAATG
GGACGTAAAG TCAACGAGCA AGGTCAAACC TTAGCGGGTG CAGTTTTTGC AATTTACAAT
GCCGATGAGC AGAATCAGCC CCAAGGTTCA CCCATAACAT TCTTGAATCG TGCAGGAGAA
AAAGTTTCTG AAATAACAAC GGATAAGACT GCCGAAATTT ACGCTAAAGG GCTAAATGAA
GGGCATTACG TTTTAGTGGA AACGAAAGCA CCAACAGGCT ATCTGTTAGA CACAACGCTA
CATCCATTTG ATGTAACCGC CCAATTAGGA AAAGAGCAGC CAATTGCTTT ACGCGATCTT
ATCAATTATC AAGGAACTGC TCAATTAACC AAAGAAAACG AAACAGGTGA AGCATTGGCA
GGTGCGGTGT TTAAGGTCAT TCATGAAACA GGGCAAACCG TAGATGGACA AACCAATCTG
ATCTCTGACA AGCAAGGCAA AGTCATTGCC AAAAACTTAG CACCGGGAAC GTATCGTTTT
GTGGAGACAC AAGCGCCAAC TAGCTATCTT CTTAATGAAA CGCCAAGCGC AAGCTTTACG
ATTGCCAAAG ACAACCAAGG CAAACCAGCC ACTGTGGTAC TTAAAGCACC TTTTATTAAT
TACCAAGGTG CTGCCAAGCT GGTGAAAATT GATCAGCAAA AGAATGCCTT AGCAGGTGCT
GAATTTAAAG TGACAGATGC AGAGACAGGG CAAACTGTCG CTCGTTCATT ACGTTCTGAC
AACCAAGGGT TAGTTCAAGT GAACCACTTA CAACCAGGAA AATATACCTT TGTGGAAACA
AAAGCACCGG ATOOTTACCA ACTGTCTAAC CAACCTGTCG CATTCACTAT TCCGGCAACA
GCGAAAGACA AACCTCAACT CGTGAATGCG GGCACGTTTG TTAACGAGAA ACAACCTGTA
TCCAAAAAAA CAAAACCAAA TCAGCCAACA ACGAAACAAG CAGCTAGAGA GACAGGTTGG
CTTGGTTTAC CGAAAACCAA CACACAAGTC AATTACTTCT TTCTCTTTAT CCGCCTCATG
TTCCTCGGTT TGGCAACTTG CCTCTTCTAT AAAAAGAGCA ACAAATAA EF125-2 (SEQ
ID NO:466) MRKNGPMV NRWLYGLMCL LLVLNYGTPL MALAEEVNSD GQLTLGEVKQ
TSQQEMTLAL QCKAQPVTQE VVVHYSANVS IKAAHWAAPN NTRKIQVDDQ KKQIQIELNQ
QALADTLVLT LNPTATEDVT FSYGQQQRAL TLKTGTDPTE STAITSSPAA SANEGSTEEA
STNSSVPRSS EETVASTTKA IESKTTESTT VKPRVAGPTD ISDYFTGDET TIIDNFEDPI
YLNPDCTPAT PPYKEDVTIH WNFNWSIPED VREQMKAGDY FEFQLPGNLK PNKPGSGDLV
DAECNVYGTY TISEDGTVRF TFNERITSES DIHGDFSLDT HLNDSDGRGP GDWVIDIPTQ
EDLFPVVIPI VPDTEQQTDK QGHFDRTPNP SATTWTVDIN QAMKDQTNPT VTETWPTGNT
FKSVKVYELV HNLDGTIKEV GRELSPDEYT VDKNGNVTIK GDTNKAYRLE YQTTIDEAVI
PDGGGDVPFK NHATLTSDNN PNGLDAEATV TATYGKNLDK RNIDYDEANQ SETWETNYNY
GEQTIPKDQA VITDTMGDNL TFEPDSLHLY SVTFDDKGNE VVGAELVEGK DYKVVINGDC
SFAIDFLHDV TGAVKIDYKT KVDGIVEGDV AVNNRVDVGT GQHSEDDGTA SQQNIIKNTC
AVDYQNSTIG WTLAVNQNNY LMENAVITDT YEPVPGLTMV PNSLVVKDTT TGAQLTLCKD
FMVEITRNAD GETGFKVSFI GAYAKTSDAF HITYTTFFDV TELDANNPAL DHYRNTAAID
WTDEAGNNHH SEDSKPFKPL PAFDLNAQKS GVYNAVTKEI TWTIAVNLSN NRLVDAFLTD
PILTNQTYLA CSLKVYEGNT KPDGSVEKVK PTQPLTDITM EEPSEKNQNT WRVDFPNDSR
TYVIEFKTSV DEKVIECSAS YDNTASYTNQ GSSRDVTGKV SIQHGGESVK KGGEYHKDDP
DHVYWHVMIN GAQSVLDDVV ITDTPSPNQV LDPESLVIYG TNVTEDGTIT PDKSVILEEG
KDYTLEVTTD NETCQQKIVV KMAHIEAPYY MEYRSLVTSS AAGSTDTVSN QVSITGNGSE
VVHGDDNGDV VVDIDHSGCH ATGTKGKIQL KKTANDETTI LAGAHFQTWD QAKTQVLREG
TVDATGVITF GGLPQCQYIL VETKAPEGYT VSDELAKGRV ITIDEETSAE GAQPTIIKND
VNKVFLEKMD EKGKKLVNAR FKLEHAVTTP FTHWEEVPLA PDRTNANGQL EVDSLKPGLY
QFTEIEAPTG YLLDTTPKRF IVTQNTSGQI RDVHVKMLNY QGSAELIKKD QAGNPLAGAE
FSVLDTTGQA VREHLVSDAN GKVTVTDLAP GKYQFVETKA PACYLLNTEP SAFTIAASDR
GKPATVIATA NFVNYQCTAK LIKKDVNGHL LSGATFKVLD AKGETIQTGL TTNNQGEIVA
EHLAPGKYRF VETKAPTGYL LNTTPVFFEI AEKNACKPAV VVASDNFVSY KGAFQIVKTN
SADQPLAGAV FELYDHNKQS LGTTATSGKD OKITERDLAP GTYYYKETKA PKLPDGADYI
IYPELVKVEI RGDFKGDPEI FQLGAFANFK GEAVFKKIDA NANPLPGTIF KLYRIENGEK
IFEREVTAEK DGSLAMEDLG AGSYELDELD ATDGYIVNKQ PIYFVVKKNS NDKQPLDELE
FVNYQAEVMG RKVNEQGQTL AGAVFATYNA DEQNQPQGSP ITFLNPAGEK VSEITTDKTC
EIYAKGLNEG HYVLVETKAP TGYLLDTTLH PFDVTAQLGK EQPIALCDLI NYQCTAQLTK
ENETGEALAG AVFKVIDETG QTVDGQTNLM SDKQGKVTAK NLAPGTYRFV ETQAPTSYLL
NETPSASFTI AKDNQGKPAT VVLKAPFINY QGAAKLVKID QQKNALAGAE FKVTDAETGQ
TVARSLRSDN QGLVQVNHLQ PGKYTFVETK APDGYQLSKQ AVAFTIAATA KDKPELVNAG
TFVNEKQPVS KKTKPNQPTT KQAARETGWL GLPKTNTQVN YFFVFICLML VGLASWLFYK
KSKK EF125-3 (SEQ ID NO:467) TAACTTTG TTAACTATCA AGGCACGGCT
AAATTAATCA AAAAAGATGT GAATGGACAC TTATTAAGTG GTGCGACATT TAAAGTGCTT
GATGCGAAGG CAGAAACGAT TCAAACAGGC TTGACGACAA ATAATCAAGG GGAAATTGTT
GCAGAGCACT TAGCCCCAGG AAAATATCGC TTTGTAGAAA CCAAAGCGCC AACAGGCTAT
TTATTAAATA CCACGCCAGT CCCATTTGAA ATTGCTGAGA AAAATGCTGG TAAACCAGCG
GTCGTGGTTG CTAGTGACAA CTTTGTGAGT TACAAAGGGG CTTTCCAAAT CGTGAAAACG
AATAGCGCAG ACCAACCATT AGCAGGTGCT GTTTTTGAAT TATATGATCA CAATAAACAA
TCATTAGGGA TTACAGCAAC GAGTGGCAAA GATGGCAAAA TTATCTTTAG AGACTTCGCG
CCAGGTACCT ATTATTACAA AGAAATCAAA GCACCAAAAT TACCAGATGG CGCAGATTAT
ATTATTTATC CTGAATTAGT AAAAGTAGAA ATTCGTGGTG ATTTCAAAGG TGATCCGGAG
ATTTTCCAAT TAGGGGCCTT CGCCAATTTC AAAGGACGCG CCGTCTTTAA GAAAATTGAT
GCCAATGCGA ACCCACTTCC AGGAACGATT TTTAAATTGT ATCGAATCGA AAACGGGGAA
AAAATCTTTG AAAGAGAAGT AACTGCTGAA AAAGATGGTT CATTGGCTAT GGAGGATTTA
GGTGCTGCTA GCTATGAATT AGATGAACTG GATGCAACGG ATGGCTATAT CGTCAATAAA
CAACCCATTT ATTTTGTAGT GAAGAAGAAT TCAAATGATA AACAACCACT AGATGAGTTA
GAGTTTGTAA ATTATCAAGC AGAAGTAATG GGACGTAAAG TCAACGAGCA AGGTCAAACC
TTAGCGCGTG CAGTTTTTGC AATTTACAAT GCCGATGAGC AGAATCAGCC CCAAGGTTCA
CCGATAACAT TCTTGAATCG TGCAGGAGAA AAAGTTTCTG AAATAACAAC GGATAAGACT
GGCGAAATTT ACCCTAAAGG GCTAAATGAA GGGCATTACG TTTTAGTGGA AACGAAAGCA
CCAACAGGCT ATCTGTTAGA CACAACGCTA CATCCATTTG ATGTAACCGC CCAATTAGGA
AAAGAGCAGC CAATTGCTTT AGGCGATCTT ATCAATTATC AAGGAACTGC TCAATTAACC
AAAGAAAACG AAACAGGTGA AGCATTGGCA GGTGCGGTCT TTAACGTCAT TGATGAAACA
GGGCAAACCG TAGATGGACA AACCAATCTG ATGTCTCACA AGCAAGGCAA AGTCATTGCG
AAAAACTTAG CACCGGGAAC GTATCGTTTT GTGGAGACAC AAGCGCCAAC TAGCTATCTT
CTTAATGAAA CGCCAAGCGC AACCTTTACG ATTGCCAAAG ACAACCAAGG CAAACCAGCC
ACTGTGGTAC TTAAAGCACC TTTTATTAAT TACCAAGGTG CTGCCAAGCT GGTGAAAATT
GATCAGCAAA AGAATGCCTT AGCAGGTGCT GAATTTAAAG TGACAGATGC AGAGACAGGG
CAAACTGTCG CTCGTTCATT ACGTTCTGAC AACCAAGGGT TAGTTCAAGT GAACCACTTA
CAACCAGGAA AATATACCTT TGTGGAAACA AAAGCACCGG ATGGTTACCA ACTGTCTAAG
CAAGCTGTCG CATTCACTAT TGCGGCAACA GCGAAAGACA AACCTGAACT CGTGAATGCG
GGCACGTTTG TTAACGAGAA ACAACCTGTA TCCAAAAAAA CAAAACCAAA TCAGCCAACA
ACGAAACAAG CAGCTAGAGA GACAGGTTGG CTTGGT EF125-4 (SEQ ID NO:468)
NFVNYQGTAK LIKKDXTNGHL LSGATFKVLD AKGETIQTGL TTNNQGEIVA EHLAPGKYRF
VETKAPTGYL LNTTPVPFEI AEKNAGKPAV VVASDNFVSY KGAFQIVKTN SADQPLAGAV
FELYDHNKQS LGITATSGKD GKIIFRDLAP GTYYYKEIKA PKLPDGADYI IYPELVKVEI
RGDFKGDPET FQLGAFANFK GRAVFKKIDA NANPLPGTIF KLYRIENGEK IFEREVTAEK
DGSLAMEDLG AGSYELDELD ATDGYTVNKQ PIYFVVKKNS NDKQPLDELE FVNYQAEVMG
RKVNEQGQTL AGAVEATYNA DEQNQPQGSP ITFLNRAGEK VSEITTDKTG EIYAKGLNEG
HYVLVETKAP TGYLLDTTLH PFDVTAQLGK EQPIALGDLI NYQGTAQLTK ENETGEALAG
AVFKVIDETG QTVDGQTNLM SDKQGKVIAK NLAPGTYRFV ETQAPTSYLL NETPSASFTI
AKDNQGKPAT VVLKAPFINY QCAAKLVKID QQKNALAGAE FKVTDAETGQ TVARSLRSDN
QGLVQVNHLQ PGKYTFVETK APDGYQLSKQ AVAFTTAATA KDKPELVNAG TFVNEKQPVS
KKTKPNQPTT KQAARETGWLG EF126-1 (SEQ ID NO:469) TAGCGAAAGA
AAATAGCGAG GATTAAAATG TTTAAGAAAG CAACGAAATT ATTATCGACA ATGGTGATTG
TCGCTGGAAC AGTTGTGGGA AATTTCAGTC CCACATTGGC TTTAGCTGAA GAAGCGGTTA
AAGCAGGAGA TACAGAAGGA ATGACCAATA CGGTGAAAGT GAAAGACGAC AGTCTGGCTG
ATTGTAAACG GATATTGGAA GGACAAGCTA CTTTCCCAGT TCAAGCGGGT GAAACGGAAC
CAGTCGATTT AGTAGTTGTT GAAGATGCTA GTGGTAGTTT TTCAGATAAT TTTCCACATG
TAAGACAAGC GATTGATGAA GTGGTTCAAG GCTTATCTGA TCAAGACCGC GTGATGCTGG
CTTCATATCG CGGCGGAAAA CAATTTATGT TTCCTGATGG AAAGACAAAA ATTAATTCAG
CTGATTATGA TATGAATGTG CGCGTCAATA CGCAATTGAC TTATGATAAA AGCCAATTTG
TCTCTGGTTT TGGAGACGTT CGGACGTATG GTGGTACGCC AACCGCCCCA GGATTGAAAC
TCGCTTTAGA TACGTACAAT CAAACACACG GAGATTTAAC GAATCGAAAA ACGTATTTCC
TATTAGTGAC AGATGGGGTC GCTAATACAC GTTTAGATGG TTACTTGCAT AAGACCAATA
CCAATGATTC AATCAATGAA TATCCAGATC CAAGACATCC TCTTCAAGTC TCAGTGGAAT
ATAGTAATGA CTACCAAGGT GCAGCAGCAG AAGTTTTAGC GTTAAACCAA GAAATTACTA
ACCAAGGCTA TGAAATGATT AATGCGTATT GGGAAAGTGT TGAATCTTTA AGTTCAGTGA
ATTCATACTT TGATAAATAT AAAACAGAAG TGGGTCCTTT TGTAAAACAA GAGTTGCAAC
AAGGGTCTAG CACACCAGAA GATTTTATTA CAAGCCAATC TATTGATGAT TTTACAACCC
AATTAAAACA AATTGTCAAA GATCGTCTGG CGCAATCGAC ACCAGCAACA GCTTCATTAA
CGATTGCCAA TCAATTTGAT ATTCAATCTG CCACCGCTAC GGACCATGCT GGAAATGATG
TGCCTGTTCA AATTAACGGA CAAACCATTT CAGCAACTAG TACAGAAGGT TACGTAGGAA
ACATCACGAT TCACTACGAA CTCAAAGAAA ATACAGCGAT TGATGCAGCA ACCCTTGTAA
GTAGTGGGAC AATGAATCAA GGAACAATTG CTAAGGAATT TCCAGAAGCG ACGATTCCTA
AAAATGACAA TGCGCATGCG TGTCACGTGA CGCCAGAAGA TCCAACGATT ACAAAAGATA
TCGAAAATCA AGAACACTTA GATTTAACCA ATCGTGAAGA TAGTTTCGAT TGGCATGTCA
AAACAGCCTT TGGCAACGAA ACCAGTACTT GGACCCAAGC CAGCATGGTG GATGACATTA
ATAAAGTGCT AGATATCATT GATGTGAAAG TCACCGACCA AAATGGTAAA GATGTTACAG
CTAACGGCAC AGTAACACAA GAAAATAACA AAGTAACTTT TGAAATGAAC AAACAAGCAG
ACAGCTATGA CTATTTAAGT GGTCATACGT ATACAATGAC TATCACCACT AAAATTAAAA
CTGACGCAAC GGACGAAGAA TTAGCGCCTT ACATTGAACA AGGCGGGATT CCCAACCAAG
CCGACTTAAA CTTTGGCAAT GAAGGTGACG TGTTACATTC CAACAAACCA ACCGTAACAC
CACCGCCAGT TGATCCAAAT ATTCCTAAAG ACGTACAAGG ACAACAACAT TTAGATTTAA
CCAACCGCGA TCAAGAATTT AAATGGAACG TCAAAACAGC TTTCGGTAAC GAAACAAGCA
CTTGGACCCA AGCCAGCATG GTAGATGACA TTAATAAAGT GTTAGACATC ACTGATGTAA
AAGTCACAGA TGAAAATGGT AAAGATGTTA CAGCTAACGG CAAAGTAACA CAAGAAAATA
ACAAAGTAAC TTTTGAAATG AACAANCAAG CNGACAGCTA TGACTATTTA AGTGGTCATA
CGTACACAAT GACCATTACT ACTAAAATCA AAGCTAGCGC AACGGACGAA GAATTAGCAC
CTTATATTGA ACAAGGTGGC ATTCCCAACC AAGCCGACTT GAACTTTGGC AACGAAGGTG
ACGTGTTGCA TTCCAACAAA CCAACCGTAA CACCACCTGC ACCAACGCCA GAAGATCCAA
CGATTACAAA AGATATCGAA GGCCAAGAAC ATTTAGATTT
AACCAACCGT GACCAAGAAT TTAAATGGAA CGTCAAAACA GCTTTCGGTA ACGAAACAAG
CACATGGACC CAAGCCAGCA TGGTGGATGA CATTAATAAA GTGTTAGACA TCACAGACGT
GAAAGTTNCT GANGAAAATG GCAAAGATGT TACAGATAAT GGCATAGTAA CACAAGAAAA
TAACAAAGTA ACTTTTACTA TGAACAAAAA AGATGACAGC TACTCTTACT TAGCTGGTCA
TACATACACA ATGACTATTA CCACTAAAAT TAAAACTGAC GCAACGGATG AAGAATTAGC
GCCTTATATT GAACAAGGCG GGATTCCCAA CCAAGCCGAC TTAAACTTTG GCAACGAAGG
TGACGTGTTG CATTCCAACA AGCCAACCGT AACACCGCCT GCACCAACGC CAGAAGACCC
AAAAAAACCT GAACCTAAAC AACCGCTAAA ACCGAAAAAA CCGTTGACGC CTACAAATCA
TCAAGCACCA ACGAACCCAG TCAATTTTGG AAAATCAGCA AGTAAAGGAA TTCATTTACC
AATGACTAAT ACAACAGTAA ATCCACTTTA CATGATCGCA GGTTTAATTG TCCTTATAGT
GGCTATTAGC TTTGGCATAA CAAAAAATAA AANAAGAAAA AATTAG EF126-2 (SEQ TD
NO:470) MF KKATKLLSTM VIVAGTVVGN FSPTLALAEE AVKAGDTEGM TNTVKVKDDS
LADCKRILEG QATFPVQAGE TEPVDLVVVE DASGSFSDNF PHVRQAIDEV VQGLSDQDRV
MLASYRGGKQ FMFPDGKTKI NSADYDMNVR VNTQLTYDKS QFVSGFGDVR TYGGTPTAPG
LKLALDTYNQ THGDLTNRKT YFLLVTDGVA NTRLDGYLHK TNTNDSINEY PDPRHPLQVS
VEYSNDYQGA AAEVLALNQE ITNQGYEMTN AYWESVESLS SVNSYFDKYK TEVGPFVKQE
LQQGSSTPED FITSQSIDDF TTQLKQJNKD RLAQSTPATA SLTIANQFDI QSATATDDAG
NDVPVQINGQ TISATSTEGY VGNITIHYEV KENTAIDAAT LVSSGTMNQG TIAKEEPEAT
IPKNDNAHAC DVTPEDPTIT KDIENQEHLD LTNREDSFDW HVKTAFGNET STWTQASMVD
DINKVLDIID VKVTDENGKD VTANGTVTQE NNKVTFEMNK QADSYDYLSG HTYTMTITTK
IKTDATDEEL APYIEQGGIP NQADLNFGNE GDVLHSNKPT VTPPPVDPNI AKDVEGQEHL
DLTNRDQEFK WNVKTAFGNE TSTWTQASMV DDINKVLDIT DVKVTDENCK DVTANGKVTQ
ENNKVTFEMN XQADSYDYLS GHTYTMTITT KIKASATDEE LAPYIEQGGI PNQADLNFGN
EGDVLHSNKP TVTPPAPTPE DPTITKDIEG QEHLDLTNRD QEFKWNVKTA FCNETSTWTQ
ASMVDDINKV LDITDVKVXX ENGKDVTDNG IVTQENNKVT FTMNKKDDSY SYLAGHTYTM
TITTKIKTDA TDEELAPYIE QGGIPNQADL NFGNEGDVLH SNKPTVTPPA PTPEDPKKPE
PKQPLKPKKP LTPTNHQAPT NPVNFGKSAS KGTHLPMTNT TVNPLYMIAG LIVLIVAISF
GITKNKKRKN EF126-3 (SEQ ID NO:471) TGAA GAAGCGGTTA AAGCAGGAGA
TACAGAAGGA ATGACCAATA CGGTGAAAGT GAAAGACGAC AGTCTGCCTG ATTGTAAACG
GATATTGGAA GGACAAGCTA CTTTCCCAGT TCAAGCGGGT GAAACGCAAC CAGTCGATTT
AGTAGTTGTT GAAGATGCTA GTGGTAGTTT TTCAGATAAT TTTCCACATG TAAGACAAGC
GATTGATGAA GTGGTTCAAG GCTTATCTGA TCAAGACCGC GTGATGCTGG CTTCATATCG
CGGCGGAAAA CAATTTATGT TTCCTGATGG AAAGACAAAA ATTAATTCAG CTGATTATGA
TATCAATGTG CGCGTCAATA CGCAATTGAC TTATGATAAA AGCCAATTTG TCTCTGGTTT
TGGAGACGTT CGGACGTATG GTGGTACGCC AACCGCCCCA GGATTGAAAC TCGCTTTAGA
TACGTACAAT CAAACACACG GAGATTTAAC GAATCGAAAA ACGTATTTCC TATTAGTGAC
AGATGGGGTC GCTAATACAC GTTTAGATGG TTACTTGCAT AAGACCAATA CCAATGATTC
AATCAATGAA TATCCAGATC CAAGACATCC TCTTCAAGTC TCAGTGCAAT ATAGTAATGA
CTACCAAGGT GCAGCAGCAG AAGTTTTAGC GTTAAACCAA GAAATTACTA ACCAAGGCTA
TGAAATGATT AATGCGTATT GGGAAAGTGT TGAATCTTTA AGTTCAGTGA ATTCATACTT
TGATAAATAT AAAACAGAAG TGGGTCCTTT TGTAAAACAA GAGTTGCAAC AAGGGTCTAG
CACACCAGAA GATTTTATTA CAAGCCAATC TATTGATGAT TTTACAACCC AATTAAAACA
AATTGTCAAA GATCGTCTGG CGCAATCGAC ACCAGCAACA CCTTCATTAA CGATTGCCAA
TCAATTTGAT ATTCAATCTG CGACCGCTAC GGACGATGCT GGAAATGATG TGCCTGTTCA
AATTAACGGA CAAACCATTT CAGCAACTAG TACAGAAGGT TACGTAGGAA ACATCACCAT
TCACTACGAA GTCAAAGAAA ATACAGCGAT TGAT EF126-4 (SEQ ID NO:472) EE
AVKAGDTECM TNTVKVKDDS LADCKRILEG QATFPVQAGE TEPVDLVVVE DASGSFSDNF
PHVRQAIDEV VQGLSDQDRV MLASYRGGKQ FMFPDGKTKI NSADYDHNVR VNTQLTYDKS
QFVSGFGDVR TYGGTPTAPG LKLALDTYNQ THGDLTNRKT YFLLVTDGVA NTRLDGYLHK
TNTNDSINEY PDPRYPLQVS VEYSNDYQGA AAEVLALNQE ITNQGYEMIN AYWESVESLS
SVNSYFDKYK TEVGPFVKQE LQQGSSTPED FITSQSIDDF TTQLKQIVKD RLAQSTPATA
SLTIANQFDI QSATATDDAG NDVPVQINGQ TISATSTEGY VGNITIHYEV KENTAID
EF127-1 (SEQ ID NO:473) TAGCGAAAGA AAATAGGGAG GATTAAAATG TTTAAGAAAG
CAACGAAATT ATTATCGACA ATGGTGATTG TCGCTGGAAC AGTTGTGGGA AATTTCAGTC
CCACATTGGC TTTAGCTGAA GAAGCGGTTA AAGCAGGAGA TACAGAAGGA ATGACCAATA
CGGTGAAAGT GAAAGACGAC AGTCTGGCTG ATTGTAAACG GATATTGGAA GGACAAGCTA
CTTTCCCAGT TCAAGCGGGT GAAACGGAAC CAGTCGATTT AGTAGTTGTT GAAGATGCTA
GTGGTAGTTT TTCAGATAAT TTTCCACATC TAAGACAAGC GATTGATGAA GTGGTTCAAG
GCTTATCTGA TCAAGACCGC GTGATGCTGG CTTCATATCG CGGCGGAAAA CAATTTATGT
TTCCTCATGG AAAGACAAAA ATTAATTCAG CTGATTATGA TATGAATGTG CGCGTCAATA
CGCAATTGAC TTATGATAAA AGCCAATTTG TCTCTGGTTT TGGAGACGTT CGGACGTATG
GTGGTACGCC AACCGCCCCA GGATTGAAAC TCGCTTTAGA TACGTACAAT CAAACACACG
GAGATTTAAC GAATCGAAAA ACGTATTTCC TATTAGTGAC AGATGGGGTC GCTAATACAC
GTTTAGATGG TTACTTGCAT AAGACCAATA CCAATGATTC AATCAATGAA TATCCAGATC
CAAGACATCC TCTTCAAGTC TCAGTGGAAT ATAGTAATGA CTACCAAGGT GCAGCAGCAG
AAGTTTTAGC GTTAAACCAA GAAATTACTA ACCAAGGCTA TGAAATGATT AATGCGTATT
GGGAAAGTGT TGAATCTTTA AGTTCAGTGA ATTCATACTT TGATAAATAT AAAACAGAAG
TGGGTCCTTT TGTAAAACAA GAGTTGCAAC AAGGGTCTAG CACACCAGAA GATTTTATTA
CAAGCCAATC TATTGATGAT TTTACAACCC AATTAAAACA AATTCTCAAA GATCGTCTGG
CGCAATCGAC ACCAGCAACA GCTTCATTAA CGATTGCCAA TCAATTTGAT ATTCAATCTG
CGACCGCTAC GGACGATGCT GGAAATGATG TGCCTGTTCA AATTAACGGA CAAACCATTT
CAGCAACTAG TACAGAAGGT TACGTAGGAA ACATCACGAT TCACTACGAA GTCAAAGAAA
ATACAGCGAT TGATGCAGCA ACCCTTGTAA GTAGTGGGAC AATGAATCAA GGAACAATTG
CTAAGCAATT TCCAGAAGCG ACGATTCCTA AAAATGACAA TGCGCATGCG TGTGACGTGA
CGCCAGAAGA TCCAACGATT ACAAAAGATA TCGAAAATCA AGAACACTTA GATTTAACCA
ATCGTGAAGA TAGTTTCGAT TGGCATGTCA AAACAGCCTT TGGCAACGAA ACCAGTACTT
GGACCCAAGC CAGCATGGTG GATGACATTA ATAAAGTGCT AGATATCATT GATGTGAAAG
TCACCGACGA AAATGGTAAA GATGTTACAG CTAACGGCAC AGTAACACAA GAAAATAACA
AAGTAACTTT TGAAATGAAC AAACAAGCAG ACAGCTATGA CTATTTAAGT GGTCATACGT
ATACAATGAC TATCACCACT AAAATTAAAA CTGACGCAAC GGACGAAGAA TTAGCGCCTT
ACATTGAACA AGGCGGGATT CCCAACCAAG CCGACTTAAA CTTTGGCAAT GAAGGTGACG
TGTTACATTC CAACAAACCA ACCGTAACAC CACCGCCAGT TGATCCAAAT ATTGCTAAAG
ACGTAGAAGG ACAAGAACAT TTAGATTTAA CCAACCGCGA TCAAGAATTT AAATGGAACG
TCAAAACAGC TTTCGGTAAC GAAACAAGCA CTTGGACCCA AGCCAGCATG GTAGATGACA
TTAATAAAGT GTTAGACATC ACTGATGTAA AAGTCACAGA TGAAAATGGT AAAGATGTTA
CAGCTAACCG CAAAGTAACA CAAGAAAATA ACAAAGTAAC TTTTCAAATG AACAANCAAG
CNGACAGCTA TGACTATTTA AGTGGTCATA CGTACACAAT GACCATTACT ACTAAAATCA
AAGCTAGCGC AACGGACGAA GAATTAGCAC CTTATATTGA ACAAGGTGGC ATTCCCAACC
AAGCCGACTT GAACTTTGGC AACGAAGGTG ACGTGTTGCA TTCCAACAAA CCAACCGTAA
CACCACCTGC ACCAACGCCA GAAGATCCAA CGATTACAAA AGATATCGAA GGCCAAGAAC
ATTTAGATTT AACCAACCGT GACCAAGAAT TTAAATGGAA CGTCAAAACA GCTTTCGGTA
ACGAAACAAG CACATGGACC CAAGCCAGCA TGGTGGATGA CATTAATAAA GTGTTAGACA
TCACAGACCT GAAAGTTNCT GANGAAAATG GCAAAGATGT TACAGATAAT GGCATAGTAA
CACAAGAAAA TAACAAAGTA ACTTTTACTA TGAACAAAAA AGATGACAGC TACTCTTACT
TAGCTGGTCA TACATACACA ATGACTATTA CCACTAAAAT TAAAACTGAC GCAACGGATG
AAGAATTAGC GCCTTATATT GAACAAGGCG GGATTCCCAA CCAAGCCGAC TTAAACTTTG
GCAACGAAGG TGACGTGTTG CATTCCAACA AGCCAACCGT AACACCGCCT GCACCAACGC
CAGAAGACCC AAAAAAACCT GAACCTAAAC AACCGCTAAA ACCGAAAAAA CCGTTGACGC
CTACAAATCA TCAAGCACCA ACGAACCCAG TCAATTTTGG AAAATCAGCA AGTAAAGGAA
TTCATTTACC AATGACTAAT ACAACAGTAA ATCCACTTTA CATGATCGCA GGTTTAATTG
TCCTTATAGT GGCTATTAGC TTTGCCATAA CAAAAAATAA AAAAAGAAAA AATTAG
EF127-2 (SEQ ID NO:474) ME KKATKLLSTM VIVAGTVVGN FSPTLALAEE
AVKAGDTEGM TNTVKVKDDS LADCKRILEG QATFPVQAGE TEPVDLVVVE DASGSFSDNF
PHVRQAIDEV VQGLSDQDRV MLASYRGGKQ FMFPDGKTKI NSADYDMNVR VNTQLTYDKS
QFVSGFGDVR TYGGTPTAPG LKLALDTYNQ THGDLTNRKT YFLLVTDGVA NTRLDGYLHK
TNTNDSINEY PDPRHPLQVS VEYSNDYQGA AAEVLALNQE ITNQGYEMIN AYWESVESLS
SVNSYFDKYK TEVGPFVKQE LQQGSSTPED FITSQSIDDF TTQLKQIVKD RLAQSTPATA
SLTIANQFDI QSATATDDAG NDVPVQINGQ TISATSTEGY VGNITIHYEV KENTAIDAAT
LVSSGTHNQG TIAKEFPEAT IPKNDNAHAC DVTPEDPTIT KDIENQEHLD LTNREDSFDW
HVKTAFGNET STWTQASMVD DINKVLDIID VKVTDENGKD VTANGTVTQE NNKVTFEMNK
QADSYDYLSG HTYTMTITTK IKTDATDEEL APYIEQGGIP NQADLNFGNE GDVLHSNKPT
VTPPPVDPNI AKDVEGQEHL DLTNRDQEFK WNVKTAFGNE TSTWTQASMV DDINKVLDIT
DVKVTDENGK DVTANGKVTQ ENNKVTFEMN XQADSYDYLS GHTYTMTITT KIKASATDEE
LAPYIEQGGI PNQADLNFGN EGDVLHSNKP TVTPPAPTPE DPTITKDIEG QEHLDLTNRD
QEFKWNVKTA FGNETSTWTQ ASMVDDINKV LDITDVKVXX ENGKDVTDNG IVTQENNKVT
FTMNKKDDSY SYLAGHTYTM TITTKIKTDA TDEELAPYIE QGGTPNQADL NFGNEGDVLH
SNKPTVTPPA PTPEDPKKPE PKQPLKPKKP LTPTNHQAPT NPVNFGKSAS KGIHLPMTNT
TVNPLYMIAG LIVLIVAISF GITKNKKRKN EF127-3 (SEQ ID NO:475) GAATCAA
GGAACAATTG CTAAGGAATT TCCAGAAGCG ACGATTCCTA AAAATGACAA TGCGCATGCG
TGTGACGTGA CGCCAGAAGA TCCAACGATT ACAAAAGATA TCGAAAATCA AGAACACTTA
GATTTAACCA ATCGTGAAGA TAGTTTCGAT TGGCATGTCA AAACAGCCTT TGGCAACGAA
ACCAGTACTT GGACCCAAGC CAGCATGGTG GATGACATTA ATAAAGTGCT AGATATCATT
GATGTGAAAG TCACCGACGA AAATGGTAAA GATGTTACAG CTAACGGCAC AGTAACACAA
GAAAATAACA AAGTAACTTT TGAAATGAAC AAACAAGCAG ACAGCTATGA CTATTTAAGT
GGTCATACGT ATACAATGAC TATCACCACT AAAATTAAAA CTGACGCAAC GGACGAAGAA
TTAGCGCCTT ACATTGAACA AGGCGGGATT CCCAACCAAG CCGACTTAAA CTTTGGCAAT
CAAGGTGACC TGTTACATTC CAACAAACCA ACCGTAACAC CACCGCCAGT TGATCCAAAT
ATTGCTAAAG ACGTAGAAGG ACAAGAACAT TTAGATTTAA CCAACCGCGA TCAAGAATTT
AAATGGAACG TCAAAACAGC TTTCGGTAAC GAAACAAGCA CTTGGACCCA AGCCAGCATG
GTAGATGACA TTAAT EF127-4 (SEQ ID NO:476) NQG TIAKEF PEAT IPKNDNAHAC
DVTPEDPTIT KDIENQEHLD LTNREDSFDW HVKTAFGNET STWTQASMVD DINKVLDIID
VKVTDENGKD VTANGTVTQE NNKVTFENNK QADSYDYLSG HTYTMTITTK IKTDATDEEL
APYIEQGGIP NQADLNFGNE GDVLHSNKPT VTPPPVDPNI AKDVEGQEHL DLTNRDQEFK
WNVKTAFGNE TSTWTQASMV DDIN EF12S-1 (SEQ ED NO:477) TAGCGAAAGA
AAATAGGGAG GATTAAAATG TTTAAGAAAG CAACGAAATT ATTATCGACA ATGGTGATTG
TCGCTGGAAC AGTTGTGGGA AATTTCAGTC CCACATTGGC TTTAGCTGAA GAAGCGGTTA
AAGCAGGAGA TACAGAAGGA ATGACCAATA CGGTGAAAGT GAAAGACGAC AGTCTGGCTG
ATTGTAAACG GATATTGGAA GGACAAGCTA CTTTCCCAGT TCAAGCGGGT GAAACGGAAC
CAGTCGATTT AGTACTTGTT GAAGATGCTA GTGGTAGTTT TTCAGATAAT TTTCCACATG
TAAGACAAGC GATTGATGAA GTGGTTCAAG GCTTATCTGA TCAAGACCGC GTCATGCTGG
CTTCATATCG CGGCGGAAAA CAATTTATGT TTCCTGATGG AAAGACAAAA ATTAATTCAG
CTGATTATGA TATGAATGTG CGCGTCAATA CGCAATTGAC TTATGATAAA AGCCAATTTG
TCTCTGGTTT TGGAGACGTT CGGACGTATG GTGGTACGCC AACCGCCCCA GGATTGAAAC
TCGCTTTAGA TACGTACAAT CAAACACACG GAGATTTAAC GAATCGAAAA ACGTATTTCC
TATTAGTGAC AGATGGGGTC GCTAATACAC GTTTAGATGG TTACTTGCAT AAGACCAATA
CCAATGATTC AATCAATGAA TATCCAGATC CAAGACATCC TCTTCAAGTC TCAGTGGAAT
ATAGTAATGA CTACCAAGGT GCAGCAGCAG AAGTTTTAGC GTTAAACCAA GAAATTACTA
ACCAAGGCTA TCAAATGATT AATGCGTATT GGGAAAGTGT TGAATCTTTA AGTTCAGTGA
ATTCATACTT TGATAAATAT AAAACAGAAG TGGGTCCTTT TGTAAAACAA GAGTTGCAAC
AAGGGTCTAG CACACCAGAA GATTTTATTA CAAGCCAATC TATTGATGAT TTTACAACCC
AATTAAAACA AATTGTCAAA GATCGTCTGG CGCAATCGAC ACCAGCAACA GCTTCATTAA
CGATTGCCAA TCAATTTGAT ATTCAATCTG CGACCGCTAC GGACGATGCT GGAAATGATG
TGCCTGTTCA AATTAACGGA CAAACCATTT CAGCAACTAG TACAGAAGGT TACGTAGGAA
ACATCACGAT TCACTACGAA GTCAAAGAAA ATACAGCGAT TGATGCAGCA ACCCTTGTAA
GTACTGGGAC AATGAATCAA GGAACAATTG CTAAGGAATT TCCAGAAGCG ACGATTCCTA
AAAATGACAA TGCGCATGCG TGTGACGTGA CGCCAGAAGA TCCAACGATT ACAAAAGATA
TCGAAAATCA AGAACACTTA GATTTAACCA ATCGTGAAGA TAGTTTCGAT TGGCATGTCA
AAACAGCCTT TGGCAACGAA ACCAGTACTT GGACCCAAGC CAGCATGGTG GATGACATTA
ATAAAGTGCT AGATATCATT GATGTGAAAG TCACCGACCA AAATGGTAAA GATGTTACAG
CTAACGGCAC AGTAACACAA GAAAATAACA AAGTAACTTT TGAAATGAAC AAACAAGCAG
ACAGCTATGA CTATTTAAGT GGTCATACGT ATACAATGAC TATCACCACT AAAATTAAAA
CTGACGCAAC GGACGAAGAA TTAGCGCCTT ACATTGAACA AGGCGGGATT CCCAACCAAG
CCGACTTAAA CTTTGGCAAT GAAGGTGACG TGTTACATTC CAACAAACCA ACCGTAACAC
CACCGCCAGT TGATCCAAAT ATTGCTAAAC ACGTAGAACG ACAAGAACAT TTAGATTTAA
CCAACCGCGA TCAAGAATTT AAATGGAACG TCAAAACAGC TTTCGGTAAC GAAACAAGCA
CTTGGACCCA AGCCAGCATG GTAGATGACA TTAATAAAGT GTTAGACATC ACTGATGTAA
AAGTCACAGA TGAAAATGGT AAAGATGTTA CAGCTAACGG CAAAGTAACA CAAGAAAATA
ACAAAGTAAC TTTTGAAATG AACAANCAAG CNGACAGCTA TGACTATTTA AGTGGTCATA
CGTACACAAT GACCATTACT ACTAAAATCA AAGCTAGCGC AACGGACGAA GAATTAGCAC
CTTATATTGA ACAAGGTGGC ATTCCCAACC AAGCCGACTT GAACTTTGGC AACGAAGGTG
ACGTGTTGCA TTCCAACAAA CCAACCGTAA CACCACCTGC ACCAACGCCA GAAGATCCAA
CGATTACAAA AGATATCGAA GGCCAAGAAC ATTTAGATTT AACCAACCGT GACCAAGAAT
TTAAATGGAA CGTCAAAACA GCTTTCGGTA ACGAAACAAG CACATGGACC CAAGCCAGCA
TGGTGGATGA CATTAATAAA GTGTTAGACA TCACAGACGT GAAAGTTNCT GANGAAAATG
GCAAAGATGT TACAGATAAT GGCATAGTAA CACAAGAAAA TAACAAAGTA ACTTTTACTA
TGAACAAAAA AGATGACAGC TACTCTTACT TAGCTGGTCA TACATACACA ATGACTATTA
CCACTAAAAT TAAAACTGAC GCAACGGATG AAGAATTAGC GCCTTATATT GAACAAGGCG
GGATTCCCAA CCAAGCCGAC TTAAACTTTG GCAACGAAGG TGACGTGTTG CATTCCAACA
AGCCAACCGT AACACCGCCT GCACCAACGC CAGAAGACCC AAAAAAACCT GAACCTAAAC
AACCGCTAAA ACCGAAAAAA CCGTTGACGC CTACAAATCA TCAAGCACCA ACGAACCCAG
TCAATTTTGG AAAATCAGCA AGTAAAGGAA TTCATTTACC AATGACTAAT ACAACAGTAA
ATCCACTTTA CATGATCGCA GGTTTAATTG TCCTTATAGT GGCTATTAGC TTTGGCATAA
CAAAAAATAA AAAAAGAAAA AATTAG EF128-2 (SEQ ID NO:478) MF KKATKLLSTM
VIVAGTVVGN FSPTLALAEE AVKAGDTEGM TNTVKVKDDS LADCKRILEG QATFPVQAGE
TEPVDLVVVE DASGSFSDNF PHVRQAIDEV VQGLSDQDRV MLASYRGGKQ FMFPDGKTKI
NSADYDMNVR VNTQLTYDKS QFVSGFGDVR TYGGTPTAPG LKLALDTYNQ THGDLTNRKT
YFLLVTDGVA NTRLDGYLHK TNTNDSINEY PDPRHPLQVS VEYSNDYQGA AAEVLALNQE
ITNQGYEMTN AYWESVESLS SVNSYFDKYK TEVGPFVKQE LQQGSSTPED FITSQSIDDF
TTQLKQIVKD RLAQSTPATA SLTIANQFDI QSATATDDAG NDVPVQINGQ TISATSTEGY
VGNITIHYEV KENTAIDAAT LVSSGTMNQG TIAKEFPEAT IPKNDNAHAC DVTPEDPTIT
KDTENQEHLD LTNREDSFDW HVKTAFGNET STWTQASMVD DINKVLDITD VKVTDENGKD
VTANGTVTQE NNKVTFEMNK QADSYDYLSG HTYTMTITTK IKTDATDEEL APYTEQGGIP
NQADLNFGNE GDVLHSNKPT VTPPPVDPNI AKDVEGQEHL DLTNRDQEFK WNVKTAFGNE
TSTWTQASMV DDTNKVLDIT DVKVTDENGK DVTANGKVTQ ENNKVTFEMN XQADSYDYLS
GHTYTMTITT KIKASATDEE LAPYIEQGGI PNQADLNFGN EGDVLHSNKP TVTPPAPTPE
DPTITKDIEG QEHLDLTNRD QEFKWNVKTA FGNETSTWTQ ASMVDDINKV LDITDVKVXX
ENGKDVTDNG IVTQENNKVT FTHNKKDDSY SYLAGHTYTM TITTKIKTDA TDEELAPYIE
QGGIPNQADL NFGNEGDVLH SNKPTVTPPA PTPEDPKKPE PKQPLKPKKP LTPTNHQAPT
NPVNFGKSAS KGTHLPMTNT TVNPLYMIAG LIVLIVAISF GITKNKKRKN EF128-3 (SEQ
ID NO:479) AGA TCAAAATGGT AAAGATGTTA CAGCTAACGG CAAAGTAACA
CAAGAAAATA ACAAAGTAAC TTTTGAAATG AACAANCAAG CNGACAGCTA TGACTATTTA
AGTGGTCATA CGTACACAAT GACCATTACT ACTAAAATCA AAGCTAGCGC AACGGACGAA
GAATTAGCAC CTTATATTGA ACAAGGTGGC ATTCCCAACC AAGCCGACTT GAACTTTGGC
AACGAAGGTG ACGTGTTGCA TTCCAACAAA CCAACCGTAA CACCACCTGC ACCAACGCCA
GAAGATCCAA CGATTACAAA AGATATCGAA GGCCAAGAAC ATTTAGATTT AACCAACCGT
GACCAAGAAT TTAAATGGAA CGTCAAAACA GCTTTCGGTA ACGAAACAAG CACATGGACC
CAAGCCAGCA TGGTGGATGA CATTAATAAA GTGTTAGACA TCACAGACGT GAAAGTTNCT
GANGAAAATG GCAAAGATGT TACAGATAAT GGCATAGTAA CACAAGAAAA TAACAAAGTA
ACTTTTACTA TGAACAAAAA AGATGACAGC TACTCTTACT TAGCTGGTCA TACATACACA
ATGACTATTA CCACTAAAAT TAAAACTGAC GCAACGGATG AAGAATTAGC GCCTTATATT
GAACAAGGCG GGATTCCCAA CCAAGCCGAC TTAAACTTTG CCAACGAAGG TGACGTGTTG
CATTCCAACA AGCCAACCGT AACACCGCCT GCACCAACGC CAGAAGACCC AAAAAAACCT
GAACCTAAAC AACCGCTAAA ACCGAAAAAA CCGTTGACGC CTACAAATCA TCAAGCACCA
ACGAACCCAG TCAATTTTGG AAAATCAGCA AGTAAAGGAA TTCAT EF128-4 (SEQ ID
NO:480) DENGK DVTANGKVTQ ENNKVTFEMN XQADSYDYLS GHTYTMTITT
KIKASATDEE LAPYIEQGGI PNQADLNFGN EGDVLHSNKP TVTPPAPTPE DPTITKDIEG
QEHLDLTNRD QEFKWNVKTA FGNETSTWTQ
ASMVDDJNKV LDITDVKVXX ENGKDVTDNG IVTQENNKVT FTMNKKDDSY SYLAGHTYTM
TITTKIKTDA TDEELAPYIE QGGIPNQADL NFGNEGDVLH SNKPTVTPPA PTPEDPKKPE
PKQPLKPKKP LTPTNHQAPT NPVNFGKSAS KGIH EF129-1 (SEQ ID NO:481)
TGACAAGTGA AGAAACGTCT ATTTGCATCA GTATTACTAT GTTCATTAAC GCTATCAGCA
ATTGCTACCC CAAGCATCGC TTTGGCGGAC AATGTTGATA AAAAAATTGA AGAAAAAAAT
CAAGAAATTT CATCATTAAA AGCAAAACAA GGGGATTTAG CTTCACAAGT ATCTTCTTTA
GAAGCAGAAG TATCTTCAGT ATTTGATGAA AGCATGGCTT TACGTGAACA AAAGCAAACA
CTAAAAGCAA AATCAGAACA ATTACAACAA GAAATTACAA ACTTGAATCA ACGTATTGAA
AAACGTAACG AAGCAATCAA AAATCAAGCA CGTGATGTTC AAGTTAATGG ACAAAGCACA
ACAATGCTAG ATGCAGTTTT AGATGCGGAC TCAGTTGCAG ATGCAATCAG CCGTGTTCAA
GCTGTTTCAA CAATCGTAAG TGCCAACAAC GACTTAATGC AACAACAAAA AGAAGACAAA
CAAGCCGTTG TTGATAAAAA AGCTGAAAAC GAGAAAAAAG TGAAACAACT TGAAGCAACA
GAAGCTGAAT TAGAAACAAA ACGTCAAGAT TTACTTTCTA AACAATCTGA ATTAAACGTA
ATGAAAGCTT CATTAGCATT AGAACAATCA TCAGCTGAAA GTTCTAAAGC TGGCTTAGAA
AAACAAAAAG CAGCTGCTGA AGCAGAGCAA GCACGCTTAG CTGCTGAACA AAAAGCTGCA
GCTGAAAAAG CCAAACAAGC TGCTGCAAAA CCAGCTAAAG CTGAAGTGAA AGCAGAAGCA
CCAGTTGCCT CTTCATCAAC AACAGAAGCA CAAGCACCAG CAAGCTCAAG CTCAGCAACT
GAATCAAGCA CGCAACAAAC AACTGAAACA ACTACACCAA GTACAGATAA TAGTGCAACA
GAAAATACTG GCTCTTCTTC ATCAGAACAA CCAGTACAAC CTACAACACC AAGCGATAAT
GGAAATAATG GTGGCCAAAC TGGTGGTGGA ACAGTTACAC CAACACCAGA ACCAACACCA
GCGCCTTCTG CTGATCCAAC AATCAATGCA TTGAACGTTC TACGTCAATC ATTAGGTTTA
CGTCCAGTAG TATGGGATGC AGGTTTGGCA GCTTCTGCAA CTGCTCGTGC AGCACAAGTT
GAAGCAGGTG GCATTCCAAA TGATCACTGG TCTCGTGGAG ATGAAGTTAT CGCAATTATG
TGGGCGCCAG GTAACTCAGT AATCATGGCG TGGTACAATG AAACAAACAT GGTAACAGCT
TCAGGAAGCG GTCACCGTGA TTGGGAAATT AACCCAGGTA TTACGCGTGT CGGTTTTGGT
TACTCAGGTA GCACAATCGT AGGACACTCA GCCTAA EF129-2 (SEQ ID NO:482)
VKKRLFASV LLCSLTLSAI ATPSIALADN VDKKTEEKNQ EISSLKAKQG DLASQVSSLE
AEVSSVFDES MALREQKQTL KAKSEQLQQE ITNLNQRTEK RNEAIKNQAR DVQVNGQSTT
MLDAVLDADS VADAISRVQA VSTTVSANND LMQQQKEDKQ AVVDKKAENE KKVKQLEATE
AELETKRQDL LSKQSELNVM KASLALEQSS AESSKAGLEK QKAAAEAEQA RLAAEQKAAA
EKAKQAAAKP AKAEVKAEAP VASSSTTEAQ APASSSSATE SSTQQTTETT TPSTDNSATE
NTGSSSSEQP VQPTTPSDNG NNGGQTGGGT VTPTPEPTPA PSADPTINAL NVLRQSLGLR
PVVWDACLAA SATARAAQVE AGGIPNDHWS RGDEVIAIMW APGNSVIMAW YNETNMVTAS
GSGHRDWEIN PGITRVGFGY SGSTIVGHSA EF129-3 (SEQ ID NO:483) GGAC
AATGTTGATA AAAAAATTGA AGAAAAAAAT CAAGAAATTT CATCATTAAA AGCAAAACAA
GCGGATTTAG CTTCACAAGT ATCTTCTTTA GAAGCAGAAG TATCTTCAGT ATTTGATGAA
AGCATGGCTT TACGTGAACA AAAGCAAACA CTAAAAGCAA AATCAGAACA ATTACAACAA
GAAATTACAA ACTTGAATCA ACGTATTGAA AAACGTAACG AAGCAATCAA AAATCAAGCA
CGTGATGTTC AAGTTAATGG ACAAAGCACA ACAATGCTAG ATGCAGTTTT AGATGCGGAC
TCAGTTGCAG ATGCAATCAG CCGTGTTCAA GCTGTTTCAA CAATCGTAAG TGCCAACAAC
GACTTAATGC AACAACAAAA AGAAGACAAA CAAGCCGTTG TTGATAAAAA AGCTGAAAAC
GAGAAAAAAG TGAAACAACT TGAAGCAACA GAAGCTGAAT TAGAAACAAA ACGTCAAGAT
TTACTTTCTA AACAATCTGA ATTAAACGTA ATGAAAGCTT CATTAGCATT AGAACAATCA
TCAGCTGAAA GTTCTAAAGC TGGCTTAGAA AAACAAAAAG CAGCTGCTGA AGCAGAGCAA
GCACGCTTAG CTGCTGAACA AAAAGCTGCA GCTGAAAAAG CCAAACAAGC TGCTGCAAAA
CCAGCTAAAG CTGAAGTGAA AGCAGAAGCA CCAGTTGCCT CTTCATCAAC AACAGAAGCA
CAAGCACCAG CAAGCTCAAG CTCAGCAACT GAATCAAGCA CGCAACAAAC AACTGAAACA
ACTACACCAA GTACAGATAA TAGTGCAACA GAAAATACTG GCTCTTCTTC ATCAGAACAA
CCAGTACAAC CTACAACACC AAGCGATAAT GGAAATAATG GTGGCCAAAC TGGTGGTGGA
ACAGTTACAC CAACACCAGA ACCAACACCA GCGCCTTCTG CTGATCCAAC AATCAATGCA
TTGAACGTTC TACGTCAATC ATTAGGTTTA CGTCCAGTAG TATGGGATGC AGGTTTGGCA
GCTTCTGCAA CTGCTCGTGC AGCACAAGTT GAAGCAGGTG GCATTCCAAA TGATCACTGG
TCTCGTGGAG ATGAAGTTAT CGCAATTATG TGGGCGCCAG GTAACTCAGT AATCATGGCG
TGGTACAATG AAACAAACAT GGTAACAGCT TCACGAAGCG GTCACCGTGA TTGGGAAATT
AACCCAGGTA TTACGCGTGT CGGTTTTGGT TACTCAGGTA GCACAATCGT AGGACACTCA
GCC EF129-4 (SEQ ID NO:484) DN VDKKIEEKNQ EISSLKAKQG DLASQVSSLE
AEVSSVFDES MALREQKQTL KAKSEQLQQE ITNLNQRIEK RNEAIKNQAR DVQVNGQSTT
MLDAVLDADS VADAISRVQA VSTIVSANND LMQQQKEDKQ AVVDKKAENE KKVKQLEATE
AELETKRQDL LSKQSELNVM KASLALEQSS AESSKAGLEK QKAAAEAEQA RLAAEQKAAA
EKAKQAAAKP AKAEVKAEAP VASSSTTEAQ APASSSSATE SSTQQTTETT TPSTDNSATE
NTGSSSSEQP VQPTTPSDNG NNGGQTGGGT VTPTPEPTPA PSADPTINAL NVLRQSLGLR
PVVWDAGLAA SATARAAQVE AGGIPNDHWS RGDEVIAIMW APGNSVIMAW YNETNMVTAS
GSGHEDWEIN PGITRVGFGY SGSTIVGHSA EF130-1 (SEQ ID NO:485) TGATACATTA
AAAGGAGGGA AAATATGCGC CCAAAAGAGA AAAAAAGAGG AAAAAATTGG TTAATCAACA
GTTTATTAGT TTTACTATTT ATCATTGGCT TAGCCTTAAT TTTTAACAAT CACATACGTA
GTTGGGTGGT TCAACAAAAT AGCCGCTCGT ACGCCGTTAG CAAGTTCAAA CCAGCTGATG
TCAAGAAAAA TATGGCTCGT GAAACAACGT TTGACTTTGA TTCAGTTGAG TCCTTGAGCA
CAGAAGCGGT GATGAAAGCC CAATTTGAAA ACAAAAACTT ACCTGTGATT GGTGCCATTG
CGATACCAAG TGTCGAAATT AATTTGCCCA TTTTTAAAGG ATTGTCCAAT GTCGCTTTAT
TAACTGGTGC CGGGACCATG AAAGAAGATC AAGTCATGGG GAAAAACAAT TATGCCTTCG
CTAGTCATCG AACGGAAGAT GGCGTTTCCT TATTTTCACC TTTAGAAAGA ACCAAAAAAG
ACGAACTCAT TTATATCACT GATTTATCTA CTGTTTATAC ATACAAAATA ACTTCTGTAG
AAAAAATCGA ACCAACCCGT GTTGAGTTAA TTGATGACGT TCCTGGTCAA AATATGATTA
CCTTAATTAC CTGTGGCGAT TTACAAGCAA CGACGCGAAT TGCTGTTCAA GGAACATTAG
CAGCAACGAC GCCTATTAAA GACCCCAACG ACGATATGTT GAAGGCTTTC CAATTGCAGC
AAAAAACTTT ACCCGATTGG GTGGCTTAA EF130-2 (SEQ ID NO:486) YIKRRENMRP
KEKKRGKNWL INSLLVLLFT ICLALIFNNQ IRSWVVQQNS RSYAVSKLKP ADVKKNMARE
TTFDFDSVES LSTEAVMKAQ FENKNLPVIG AIAIPSVEIN LPIFKGLSNV ALLTGAGTMK
EDQVMGKNNY ALASHRTEDG VSLFSPLERT KKDELIYTTD LSTVYTYKIT SVEKIEPTRV
ELIDDVPGQN MITLITCGDL QATTRIAVQG TLAATTPIKD ANDDMLKAFQ LEQKTLADWV A
EF130-3 (SEQ ID NO:487) CGTTAG CAAGTTGAAA CCAGCTGATG TGAAGAAAAA
TATGGCTCGT GAAACAACGT TTGACTTTGA TTCAGTTGAG TCCTTGAGCA CAGAAGCGGT
GATGAAAGCC CAATTTGAAA ACAAAAACTT ACCTGTGATT GGTGCCATTG CGATACCAAG
TGTCGAAATT AATTTGCCCA TTTTTAAAGG ATTGTCCAAT GTCGCTTTAT TAACTGGTGC
CGGGACCATG AAAGAAGATC AAGTCATGGG GAAAAACAAT TATGCCTTGG CTAGTCATCG
AACGGAAGAT GGCGTTTCCT TATTTTCACC TTTAGAAAGA ACCAAAAAAG ACGAACTCAT
TTATATCACT GATTTATCTA CTGTTTATAC ATACAAAATA ACTTCTGTAG AAAAAATCGA
ACCAACCCGT GTTGAGTTAA TTGATGACGT TCCTGGTCAA AATATGATTA CCTTAATTAC
CTGTGGCGAT TTACAAGCAA CGACGCGAAT TGCTGTTCAA GGAACATTAG CAGCAACGAC
GCCTATTAAA GACGCCAACG ACGATATGTT GAAGGCTTTC CAATTGGAGC AAAAAACTTT
AGCCGATTGG GTGGCT EF130-4 (SEQ ID NO:488) VSKLKP ADVKKNMARE
TTFDFDSVES LSTEAVMKAQ FENKNLPVIG AIAIPSVEIN LPILFKGLSNV ALLTGAGTMK
EDQVMGKNNY ALASHRTEDG VSLFSPLERT KKDELIYTTD LSTVYTYKIT SVEKIEPTRV
ELIDDVPGQN MITLITCGDL QATTRIAVQG TLAATTPIKD ANDDMLKAFQ LEQKTLADWV A
EF131-1 (SEQ ID NO:489) TAGGCGGAGG TAAGCGGTAT GCGTAAACGA CATGCAAAGA
AAAGACATGG AGGAGTGAAT TGGCTTTTTA TAGTATGTTT GTTGGTGGTG ATTGGTGGTA
GTGGTTATTT AATAAAAACG TTCTTTTTCA OTAGAGATTO ACAAGTTAGT CAAGAATCGA
AAGTGGTCTT GGAACAAGAT CGCCGAAGTG ATAATTATGC GAATTTAACG AAAGAAATAG
TTGCACCAGA TAGTGGCGAA CTTGATCAAA AAATTCAAGA AACAAATTAT ATTGGTTCGG
CTTTGATCAT TAAAGATGAT CAGGTTTTAG TAAATAAAGG ATATGGCTTT GCCAATTTTG
AAAAGCAACA AGCCAACACG CCAAACACAA GGTTTCAGAT TGGCTCAATT CAAAAATCTT
TTACCACAAC CTTGATCTTA AAAGCAATTG AAGAAGGTAA ACTTACATTA GATACAAAAC
TCGCTACGTT TTATCCGCAA ATTCAAGGTG CTGAGGATAT TACGATTAGC GATATGTTGA
ATATGACAAG TGGTTTAAAG TTATCAGCAA TGCCTAATAA TATCGTTACC GATGAAGAAA
TTATTCAATT TGTTAAACAA AATACCATTC AAGTCAATAA AGGAAAATAC AATTATTCCC
CAGTAAATTT TGTCCTTTTA GCAGGAATGT TAGAGAAAAT GTATCAACGT ACCTATCAAG
AATTATTTAA TAATCTTTAT CACAAAACGG CTGGTTTAAA GAATTTTGGC TTCTATGAAA
CCTTATTGGA ACAGCCCAAT AATTCAACAA GTTATAAATG GACAGAAGAT AATTCATATA
ACCAAGTGCT CTCAATTCCT GCAGCTAGTT TTCCCCATGA ATTTGGGACT GGTAATGTGG
ATATOAGGAC ACGTGATTTC TATTGGTACT TACATCAATT AACGAGTGGA CATTTAGTTT
CCACCGCACT TTTGCAAAAA TTATCCACGT CTTCTCAGCA AAGCTCTTAT CATGGCCGCA
TCTATGTTCA TGATAATTAT TTACGTTTAC ACGGCGTTGA AGCGGGTCAA CAAGCCCTGG
TTTTATTTTC AAAAGATATG AAGACAGGGG TCATATTGCT AACTAACTGT GTGAATCCAG
CCAAATACAA AGAATTAATT GGTTCGTTGT TCCATGATGT AACCAATTTA ACTGTTAAAT
TTTAA EF131-2 (SEQ ID NO:490) MRKRH AKKRHGGVNW LFILVCLLVVI
GGSGYLIKTF FFTRDSQVSQ ESKVVLEEDR RSDNYANLTK EIVAPDSGEL DQKIQETNYI
GSALIIKDDQ VLVNKGYGFA NFEKQQANTP NTRFQIGSIQ KSFTTTLILK ATEEGKLTLD
TKLATFYPQI QGAEDITISD MLNNTSGLKL SAMPNNIVTD EEIIQFVKQN TIQVNKGKYN
YSPVNFVLLA GMLEKMYQRT YQELFNNLYH KTAGLKNFGF YETLLEQPNN STSYKWTEDN
SYNQVLSIPA ASFAHEFGTG NVDNTTGDLY WYLHQLTSGH LVSTALLQKL WTSSQQSSYH
GGIYVHDNYL RLHGVEACQQ ALVLFSKDMK TGVILLTNCV NPAKYKELIG SLFHDVTNLT
VKF EF131-3 (SEQ ID NO:491) TTT AATAAAAACG TTCTTTTTCA CTAGAGATTC
ACAAGTTACT CAAGAATCGA AACTGGTCTT GGAACAAGAT CGCCGAAGTG ATAATTATGC
GAATTTAACC AAAGAAATAC TTGCACCAGA TAGTGGCGAA CTTGATCAAA AAATTCAAGA
AACAAATTAT ATTCGTTCGG CTTTGATCAT TAAAGATGAT CAGGTTTTAG TAAATAAAGG
ATATGGCTTT GCCAATTTTG AAAAGCAACA AGCCAACACG CCAAACACAA GGTTTCAGAT
TGGCTCAATT CAAAAATCTT TTACCACAAC CTTGATCTTA AAAGCAATTG AAGAAGGTAA
ACTTACATTA GATACAAAAC TCGCTACGTT TTATCCGCAA ATTCAAGGTG CTGAGGATAT
TACGATTAGC GATATGTTGA ATATGACAAG TGGTTTAAAG TTATCAGCAA TCCCTAATAA
TATCGTTACC GATGAAGAAA TTATTCAATT TGTTAAACAA AATACCATTC AAGTCAATAA
AGGAAAATAC AATTATTCCC CAGTAAATTT TCTCCTTTTA GCAGGAATCT TAGAGAAAAT
GTATCAACGT ACCTATCAAG AATTATTTAA TAATCTTTAT CACAAAACGG CTGGTTTAAA
GAATTTTGGC TTCTATGAAA CCTTATTGGA ACAGCCCAAT AATTCAACAA GTTATAAATG
GACAGAAGAT AATTCATATA ACCAAGTGCT CTCAATTCCT GCAGCTAGTT TTGCCCATGA
ATTTGGGACT GGTAATGTGG ATATGACGAC AGGTGATTTG TATTGGTACT TACATCAATT
AACGAGTCGA CATTTACTTT CCACCGCACT TTTCCAAAAA TTATGGACGT CTTCTCAGCA
AAGCTCTTAT CATGGCGGCA TCTATGTTCA TGATAATTAT TTACCTTTAC ACGGCGTTGA
AGCGGGTCAA CAAGCCCTGG TTTTATTTTC AAAAGATATG AAGACAGGCG TCATATTGCT
AACTAACTGT CTGAATCCAG CGAAATACAA AGAATTAATT GGTTCGTTGT TCCATGATGT
AACCAATTTA ACTGTTAAAT TT EF131-4 (SEQ ID NO:492) LIKTF FFTRDSQVSQ
ESKVVLEEDR RSDNYANLTK EIVAPDSGEL DQKIQETNYI GSALIIKDDQ VLVNKGYGFA
NFEKQQANTP NTRFQTGSIQ KSFTTTLILK AIEEGKLTLD TKLATFYPQI QGAEDITISD
MLNMTSGLKL SAMPNNIVTD EEIIQFVKQN TIQVNKGKYN YSPVNFVLLA GMLEKMYQRT
YQELFNNLYH KTAGLKNFGF YETLLEQPNN STSYKWTEDN SYNQVLSIPA ASFAHEFGTG
NVDMTTGDLY WYLHQLTSGH LVSTALLQKL WTSSQQSSYH GGIYVHDNYL RLHGVEAGQQ
ALVLFSKDMK TGVILLTNCV NPAKYKELIG SLFHDVTNLT VKF EF132-1 (SEQ ID
NO:493) TAGTTTTCTAATCTCACCAAAACAAAAATTTTTAAGAAAGAAGGAGAGA-
TCGTTATGATGAGAAAATGGAAAGTAGTA GTGGGAAGTCTCCGAATCTTGATTGCTC-
TTTTTATATTCGGGGCATGTTCAACAAATAGTAAAGACAAAGATACAGTG
GCTTCGAACGAAAAATTAAAGGTAGTAGTTACTAATTCGATTTTAGCAGATATTACTGAAAATATAGCAAAAG-
ATAAA ATTGATTTACACAGTATCGTACCTATTGGGAAAGATCCCCACGAATATGAAC-
CtTTGCCTGAAGATGTTCAAAAAACT TCAAAAGCAGATTTGATTTTTTATAACGGTG-
TTAACTTGGAmACTGGAGGAAATGCTTGGTTTACAAAATTAGThAAA
mATGCCAACAAAGAGGAAAACAAAGACTATTTTGCAGCAAGTGATGGCATAGATGTTATTTACTTAGAGGGTC-
AGAGT GAGAAAGGGAAGGAAGATCCCCATGCTTGGTTAAATTTAGAAAACGGTATTA-
TTTACGCTAAAAATATTGAAAAATGG TTAGCGGAAAAAGATCCTGATAATAAAAAAT-
TCTATAAAGAAAATCTAGATAAGTATATTGAAAAGTTGGATTCTCTA
GACAAAGAAGCTAAATCTAAATTTGCTTCAATTCCGAATGATAAAAAAATGATTCTTACAAGTGAAGGATGCT-
TtAAA TATTTCTCGAAAGCGTATAATGTGCCTTCTGCTTACATTTGGGAAAtCAACA-
CTGAAGAAGAAGGAACACCAGATCAA ATAAAACACTTAGTTGAAAAATTACGCACAA-
CAAAAGTTCCCTCCTTATTCGTAGAAAGTAGTGTGGACGATAGACCG
ATGAAAACAGTATCAAAAGATACCAATATTCCTATCTATTCAACGATTTTTACTGATTCAATTGCAGAAAAAG-
GACAA GATGGTGATAGTTACTATGCGATGATGAAATGGAACCTGGATAAAATTGCTG-
AAGGCCTTTCGAAATAA EF132-2 (SEQ ID NO:494)
MNRKWKVVVGSLGMLIALFIFGACSTNSKDKDTVASNEKLKVVVTNSTLADITENIAKDKIDLHSIVPIGKDP-
HEYEP LPEDVQKTSKADLIFYNGVNLXTGGNAWFTKLVKXANKEENKDYFAASDGID-
VIYLEGQSEKGKEDPHAWLNLENGII YAKNIEKWLAEKDPDNKKFYKENLDKYIEKL-
DSLDKEAKSKFASIPNIDKKMVTSEGCFKYFSKAYNVPSAYIWEINT
EEEGTPDQIKHLVEKLRTTKVPSLFVESSVDDRPMKTVSKDTNIPIYSTIFTDSIAEKGQDGDSYYAMMKWNL-
DKIAE GLSK. EF132-3 (SEQ ID NO:495)
ATGTTCAACAAATAGTAAAGACAAAGATACAGTGGCTTCGAACGAAAAATTAAACGTAGTAGTT-
ACTAATTCGATTTT AGCAGATATTACTGAAAATATAGCAAAAGATAAAATTGATTTA-
CACAGTATCGTACCTATTCCGAAAGATCCCCACGA
ATATGAACCtTTGCCTGAAGATGTTCAAAAAACTTCAAAAGCAGATTTGATTTTTTATAACGGTGTTAACTTG-
GAmAC TGGAGGAAATGCTTGGTTTACAAAATTAGTAAAAmATGCGAACAAAGAGGAA-
AACAAAGACTATTTTGCAGCAAGTGA TGGCATAGATGTTATTTACTTAGAGGGTCAG-
AGTGAGAAAGGGAAGGAACATCCCCATGCTTGGTTAAATTTAGAAAA
CGGTATTATTTACGCTAAAAATATTGAAAAATGGTTAGCGGAAAAAGATCCTGATAATAAAAAATTCTATAAA-
GAAAA TCTAGATAAGTATATTGAAAAGTTGGATTCTCTAGACAAAGAAGCTAAATCT-
AAATTTCCTTCAATTCCGAATCATAA AAAAATGATTGTTACAAGTGAAGGATGCTTt-
AAATATTTCTCGAAAGCGTATAATGTGCCTTCTGCTTACATTTGGGA
AAtCAACACTGAAGAAGAAGGAACACCAGATCAAATAAAACACTTAGTTGAAAAATTACGCACAACAAAAGTT-
CCCTC CTTATTCGTAGAAAGTAGTGTGGACGATAGACCGATGAAAACAGTATCAAAA-
GATACCAATATTCCTATCTATTCAAC GATTTTTACTGATTCAATTGCAGAAAAAGGA-
CAAGATGGTGATAGTTACTATCCGATGATGAAATGGAACCTGGATAA
AATTGCTGAAGGCCTTTCGAAA EF132-4 (SEQ ID NO:496)
CSTNSKDKDTVASNEKLKVVVTNSILADITENIAKDKIDLHSIVPIGKDPHEYEPLPEDVQKTSKADL-
IFYNGVNLXT GGNAWFTKLVKXANKEENKDYFAASDGIDVIYLEGQSEKGKEDPHAW-
LNLENGIIYAKNIEKWLAEKDPDNKKFYKEN LDKYIEKLDSLDKEAKSKFASEPNDK-
KNIVTSEGCFKYFSKAYNVPSAYIWEINTEEEGTPDQIKHLVEKLRTTKVPS
LFVESSVDDRPMKTVSKDTNIPIYSTIFTDSIAEKGQDGDSYYANINKWNLDKTAEGLSK
[0268]
2TABLE 2 Closest matching sequences between the polypeptides of the
present invention and sequences in GenBank and Derwent databases.
GenBank BLAST BLAST Query Access. No. GenBank Gene Description
Score P-Value EF002-2 gi.vertline.2338759 (AF018073) periplasmic
sorbitol-binding protein; SmoE 113 3.60E-18 [Rhodobacter EF003-2
gi.vertline.1552773 hypothetical [Escherichia coli]
>gnl.vertline.PID.vertline.d1012634 hypothetical 278 1.20E-53
29.4 EF003-2 gi.vertline.2196996 lipoprotein homolog [Treponema
pallidum] >gi.vertline.2108234 29K 309 3.30E-44 protein EF003-2
gi.vertline.146649 lipoprotein-28 precursor [Escherichia coli]
>gi.vertline.290510 263 9.20E-40 EF003-2 gi.vertline.48838 28
3kDa membrane protein [Haemophilus influenzae] 197 2.10E-39 EF003-2
gi.vertline.1573614 28 kDa membrane protein (hlpA) [Haemophilus
influenzae] 197 7.80E-39 EF003-2 gi.vertline.2314748 (AE000654)
outer membrane protein [Helicobacter pylori] 263 4.60E-37 EF003-2
gi.vertline.349530 lipoprotein [Pasteurella haemolytica]
>gi.vertline.150508 lipoprotein 189 4.10E-29 EF003-2
gnl.vertline.PID.vertline.e118435 similar to hypothetical proteins
[Bacillus subtilis] 158 2.70E-26 EF003-2 gi.vertline.349532
lipoprotein [Pasteurella haemolytica]
>pir.vertline.JN0753.vertline.JN- 0753 outer 200 1.20E-25
EF003-2 gi.vertline.1336657 lipoprotein [Bacillus subtilis] 182
2.70E-25 EF003-2 gnl.vertline.PID.vertlin- e.e233873 hypothetical
protein [Bacillus subtilis] >gnl.vertline.PID.ve-
rtline.e1182900 186 1.30E-23 EF003-2 gi.vertline.294071 lipoprotein
3 [Pasteurella haemolytica] 199 6.60E-23 EF003-2 gi.vertline.349531
lipoprotein [Pasteurella haemolytica]
>pir.vertline.JN07521JN0752 outer 198 1.30E-20 EF003-2
gi.vertline.294070 lipoprotein 2 [Pasteurella haemolytica] 198
1.80E-20 EF005-2 gi.vertline.537235 Kenn Rudd identifies as gpmB
[Escherichia coli] >gi.vertline.1790856 127 6.20E-12 EF006-2
gi.vertline.552773 hypothetical [Escherichia coli]
>gnl.vertline.PID.vertline.d1012634 hypothetical 255 1.40E-60
29.4 EF006-2 gi.vertline.349532 lipoprotein [Pasteurella
haemolytica] >pir.vertline.JN0753.vertline.JN0753 outer 221
6.40E-49 EF006-2 gi.vertline.2314748 (AE000654) outer membrane
protein [Helicobacter pylori] 283 2.70E-48 EF006-2
gi.vertline.2196996 lipoprotein homolog [Treponema pallidum]
>gi.vertline.2108234 29K 267 4.40E-47 protein EF006-2
gnl.vertline.PID.vertline.e118- 435 similar to hypothetical
proteins [Bacillus subtilis] 359 1.80E-44 EF006-2
gi.vertline.349531 lipoprotein [Pasteurella haemolytica]
>pin.vertline.JN0752.vertline.JN0752 outer 218 3.80E-41 EF006-2
gi.vertline.294071 lipoprotein 3 [Pasteurella haemolytica] 220
2.30E-38 EF006-2 gi.vertline.146649 lipoprotein-28 precursor
[Escherichia coli] >gi.vertline.290510 193 2.60E-38 EF006-2
gi.vertline.294070 lipoprotein 2 [Pasteurella haemolytica] 218
1.20E-36 EF006-2 gi.vertline.148838 28 3kDa membrane protein
[Haemophilus influenzae] 112 8.50E-34 EF006-2 gi.vertline.1573614
28 kDa membrane protein (hlpA) [Haemophilus influenzae] 112
1.50E-33 EF006-2 gi.vertline.349530 lipoprotein [Pasteurella
haemolytica] >gi.vertline.150508 lipoprotein 114 4.30E-29
EF006-2 gi.vertline.294069 lipoprotein 1 [Pasteurella haemolytica]
114 1.30E-27 EF006-2 gi.vertline.1336657 lipoprotein [Bacillus
subtilis] 202 2.10E-26 EF006-2 gnl.vertline.PID.vertline.e233873
hypothetical protein [Bacillus subtilis]
>gnl.vertline.PID.vertline.e1182900 200 6.50E-25 EF008-2
gi.vertline.493017 endocarditis specific antigen [Enterococcus
faecalis] 1590 2.70E-211 EF008-2 gi.vertline.393269 adhesion
protein [Streptococcus pneumoniae] 986 1.80E-129 EF008-2
gi.vertline.153834 adhesin specific for salivary pellicle of dental
surfaces 973 1.00E-127 EF008-2 gi.vertline.1575030 surface adhesin
A precursor [Streptococcus pneumoniae] 934 2.90E-126 EF008-2
gi.vertline.153826 adhesin B [Streptococcus sanguis]
>pir.vertline.A43583.vertline.A43583 adhesin 916 3.90E-126 B
EF008-2 gi.vertline.1184932 ScbA [Streptococcus crista] 915
3.40E-125 EF008-2 gi.vertline.1117994 surface antigen A variant
precursor [Streptococcus pneumoniae] 917 5.60E-124 EF008-2
gi.vertline.310633 adhesin [Streptococcus gordonii] 891 6.00E-122
EF008-2 gnl.vertline.PID.vertline.e255529 lipoprotein
[Staphylococcus epidermidis] 476 1.20E-99 EF008-2
gi.vertline.1573330 adhesin B precursor (fimA) [Haemophilus
influenzae] 380 1.60E-68 EF008-2 gi.vertline.1245464 YfeA [Yersinia
pestis] >gi.vertline.1245464 YfeA [Yersinia pestis] 355 1.20E-64
EF008-2 gi.vertline.755075 periplasmic-binding protein
[Synechocystis sp.] 321 1.70E-62
>gnl.vertline.PID.vertline.d1018652 Mn EF008-2
gi.vertline.1335912 EwlA [Erysipelothrix rhusiopathiae] 232
4.40E-42 EF008-2 gnl.vertline.PID.vertline.e118595 similar to ABC
transporter (membrane protein) [Bacillus 204 4.10E-38 EF008-2
gi.vertline.1777933 TroA [Treponema pallidum] 181 2.40E-35 EF009-2
gi.vertline.349531 lipoprotein [Pasteurella haemolytica]
>pir.vertline.JN0752.vertline.JN0752 outer 391 4.00E-64 EF009-2
gi.vertline.1552773 hypothetical [Escherichia coli]
>gnl.vertline.PID.vertline.d1012634 hypothetical 359 1.90E-63
29.4 EF009-2 gi.vertline.294070 lipoprotein 2 [Pasteurella
haemolytica] 391 6.40E-63 EF009-2 gi.vertline.349532 lipoprotein
[Pasteurella haemolytica] >pir.vertline.JN0753.vertline.JN0753
outer 386 1.10E-61 EF009-2 gi.vertline.148838 28 3kDa membrane
protein [Haemophilus influenzae] 286 5.60E-60 EF009-2
gi.vertline.1573614 28 kDa membrane protein (hlpA) [Haemophilus
influenzae] 286 7.60E-60 EF009-2 gi.vertline.294069 lipoprotein 1
[Pasteurella haemolytica] 122 4.70E-59 EF009-2 gi.vertline.146649
lipoprotein-28 precursor [Escherichia coli] >gi.vertline.290510
326 2.20E-58 EF009-2 gi.vertline.349530 lipoprotein [Pasteurella
haemolytica] >gi.vertline.150508 lipoprotein 239 7.80E-57
EF009-2 gi.vertline.294071 lipoprotein 3 [Pasteurella haemolytica]
344 4.90E-56 EF009-2 gi.vertline.2314748 (AE000654) outer membrane
protein [Helicobacter pylori] 319 4.20E-53 EF009-2
gi.vertline.2196996 lipoprotein homolog [Treponema pallidum]
>gi.vertline.2108234 29K 312 2.60E-41 protein EF009-2
gi.vertline.1336657 lipoprotein [Bacillus subtilis] 234 4.00E-32
EF009-2 gnl.vertline.PID.vertline.e233873 hypothetical protein
[Bacillus subtilis] >gnl.vertline.PID.vertline.e1182900 242
1.40E-31 EF009-2 gnl.vertline.PID.vertline.e118435 similar to
hypothetical proteins [Bacillus subtilis] 102 6.80E-22 EF011-2
gnl.vertline.PID.vertline.d100965 ferric anguibactin-binding
protein precusor FatB of V. 579 3.10E-98 EF011-2
gnl.vertline.PID.vertlin- e.d100965 ferric anguibactin-binding
protein precusor FatB of V. 579 3.10E-98 EF011-2
gnl.vertline.PID.vertline.e185374 ceuE gene product [Campylobacter
coli] 284 1.30E-89 EF011-2 gnl.vertline.PID.vertline.e185374 ceuE
gene product [Campylobacter coli] 284 1.30E-89 EF011-2
gi.vertline.150756 40 kDa protein [Plasmid pJM1]
>pir.vertline.A29928.vertline.A29928 222 2.80E-52
membrane-associated EF011-2 gi.vertline.150756 40 kDa protein
[Plasmid pJM1] >pir.vertline.A29928.vertline.A29928 222 2.80E-52
membrane-associated EF012-2 gi.vertline.309662 pheromone binding
protein [Plasmid pCF10] 266 8.70E-116
>pir.vertline.B53309.vertline.B53309 EF012-2 gi.vertline.388269
traC [Plasmid pAD1] >pir.vertline.A53310.vertline.A- 53310
pheromone cAD1 252 1.10E-109 binding EF012-2
gnl.vertline.PID.vertline.d101185 TRAC [Enterococcus faecalis] 281
3.60E-103 EF012-2 gnl.vertline.PID.vertline.d100655 TraC
[Enterococcus faecalis] 277 2.30E-102 EF012-2 gi.vertline.312940
threonine kinase [Streptococcus equisimilis] 227 1.90E-67
>pir.vertline.S28153.vertline.S28153 EF012-2 gi.vertline.48808
dciAE [Bacillus subtilis] 228 1.70E-46 EF012-2
pir.vertline.S16651.vertline.S166 dciAE protein - Bacillus subtilis
228 1.00E-45 EF012-2 gnl.vertline.PID.vertline.e118149 (AJ002571)
DppE [Bacillus subtilis] >gnl.vertline.PID.vertline.e1183316 228
3.80E-45 EF012-2 gi.vertline.40005 OppA gene product [Bacillus
subtilis] 281 3.90E-44 EF012-2 gi.vertline.143603 sporulation
protein [Bacillus subtilis] >gnl.vertline.PID.vertline.e1183163
281 7.70E-44 EF012-2 gnl.vertline.PID.vertline.d101563 Periplasmic
oligopeptide-binding protein precursor. 152 2.20E-43 EF012-2
gi.vertline.1574679 oligopeptide binding protein (oppA)
[Haemophilus influenzae] 178 2.20E-42 EF012-2 gi.vertline.478020
Opp A (AA1-542) [Salmonella typhimurium] >gi.vertline.47808 128
1.00E-37 precursor EF012-2 gi.vertline.882550 ORF_f535 [Escherichia
coli] >gi.vertline.1789397 (AE000384) f535; 228 5.30E-36 This
535 aa EF014-2 pir.vertline.D70070.vertline.D700 transcriptional
regulator homolog ywtF-Bacillus subtilis 101 1.40E-27 EF014-2
gnl.vertline.PID.vertline.e116988 capsular polysaccharide synthesis
protein [Streptococcus 121 9.50E-27 EF014-2 gi.vertline.2804769
(AF030373) putative regulatory protein [Streptococcus 121 9.50E-27
pneumoniae] EF014-2 gnl.vertline.PID.vertline.e289126 unknown
[Streptococcus pneumoniae] 121 1.00E-24 EF014-2 gi.vertline.2267239
ORF1 [Staphylococcus epidermidis] 234 1.50E-24 EF014-2
gi.vertline.485275 putative regulatory protein [Streptococcus
pneumoniae] 121 3.90E-24 EF014-2 gi.vertline.2804735 (AF030367)
putative regulatory protein [Streptococcus 121 3.90E-24 pneumoniae]
EF014-2 gi.vertline.2804747 (AF030369) putative regulatory protein
[Streptococcus 121 3.90E-24 pneumoniae] EF014-2 gi.vertline.1762327
putative transcriptional regulator [Bacillus subtilis] 185 2.80E-22
EF014-2 gi.vertline.143156 membrane bound protein [Bacillus
subtilis] >gnl.vertline.PID.vertline.e1184471 116 1.10E-21
EF014-2 gnl.vertline.PID.vertline.d101895 membrane bound protein
LytR [Synechocystis sp.] 113 6.20E-20 EF014-2 gi.vertline.1276874
EpsA [Streptococcus thermophilus] 103 4.00E-17 EF016-2
gnl.vertline.PID.vertline.e118566 similar to amino acid ABC
transporter (binding protein) 194 3.70E-35 EF016-2
gi.vertline.40934 arginine binding protein [Escherichia coli]
>gi.vertline.769794 artJ 121 1.60E-31 EF016-2
gnl.vertline.PID.vertline.d101527 Arginine-binding periplasmic
protein 2 precursor [Escherichia 121 4.80E-31 EF016-2
gi.vertline.687652 FliY [Escherichia coli]
>gnl.vertline.PID.vertline.d1016464 FliY protein 160 5.70E-31
precursor. EF016-2 gi.vertline.2650410 (AE001090) glutamine ABC
transporter, periplasmic glutamine- 122 3.30E-29 binding EF016-2
gi.vertline.1649035 high-affinity periplasmic glutamine binding
protein [Salmonella 104 1.80E-27 EF016-2 gi.vertline.1574634
glutamine-binding periplasmic protein (glnH) [Haemophilus 174
2.50E-27 EF016-2 gi.vertline.41569 GlnH precursor (AA -22 to 226)
[Escherichia coli] 106 4.70E-27 >gnl.vertline.PID.vertline.d-
1015250 EF016-2 gnl.vertline.PID.vertline.d101527 Arginine-binding
periplasmic protein 1 precursor [Escherichia 109 3.70E-26 EF016-2
gi.vertline.769791 artI [Escherichia coli] >gi.vertline.769791
artI [Escherichia coli] 127 2.30E-25 EF016-2
gnl.vertline.PID.vertline- .d100892 homologous to Gln-binding
periplasmic proteins [Bacillus 117 8.50E-24 EF016-2
gi.vertline.154125 J protein [Salmonella typhimurium]
>gi.vertline.47718 reading frame 118 2.10E-23 hisJ EF016-2
gnl.vertline.PID.vertline.d101688 HISTIDINE-BINDING PERIPLASMIC
PROTEIN 117 4.50E-23 PRECURSOR (HBP). EF016-2 gi.vertline.1166636
histidine-binding periplasmic protein HisJ [Escherichia coli] 117
6.60E-23 EF017-2 gi.vertline.388269 traC [Plasmid pAD1]
>pir.vertline.A53310.vertline.A53310 pheromone cAD1 421
4.50E-128 binding EF017-2 gnl.vertline.PID.vertline.d10- 1185 TRAC
[Enterococcus faecalis] 417 5.10E-124 EF017-2
gnl.vertline.PID.vertline.d100655 TraC [Enterococcus faecalis] 414
4.40E-123 EF017-2 gi.vertline.309662 pheromone binding protein
[Plasmid pCF10] 415 2.40E-119 >pir.vertline.B53309.vertline.-
B53309 EF017-2 gi.vertline.40005 OppA gene product [Bacillus
subtilis] 294 6.20E-82 EF017-2 gi.vertline.143603 sporulation
protein [Bacillus subtilis] >gnl.vertline.PID.vertline.e1183163
290 2.80E-79 EF017-2 gi.vertline.312940 threonine kinase
[Streptococcus equisimilis] 241 2.40E-71
>pir.vertline.S28153.vertline.S28153 EF017-2 gi.vertline.48808
dciAE [Bacillus subtilis] 270 1.10E-61 EF017-2
gnl.vertline.PID.vertline.e118149 (AJ002571) DppE [Bacillus
subtilis] >gnl.vertline.PID.vertline.e1183316 270 1.50E-61
EF017-2 pir.vertline.S16651.vertline.S166 dciAE protein-Bacillus
subtilis 270 3.10E-60 EF017-2 gi.vertline.304925 periplasmic
oligopeptide binding protein [Escherichia coli] 171 2.60E-57
EF017-2 gi.vertline.147014 oligopeptide binding protein precursor
[Escherichia coli] 171 8.70E-56 EF017-2 gi.vertline.47802 Opp A
(AA1-542) [Salmonella typhimurium] >gi.vertline.47808 154
1.30E-52 precursor EF017-2 gi.vertline.882550 ORF_f535 [Escherichia
coli] >gi.vertline.1789397 (AE000384) f535; 135 5.50E-52 This
535 aa EF017-2 gi.vertline.1574679 oligopeptide binding protein
(oppA) [Haemophilus influenzae] 168 2.90E-43 EF019-2
gi.vertline.438458 likely N-terminal signal sequence; mature
protein probably 104 2.30E-17 EF021-2
gnl.vertline.PID.vertline.e311492 unknown [Bacillus subtilis]
>gnl.vertline.PID.vertline.e1184232 similar to ABC 317 2.50E-103
EF021-2 bbs.vertline.173803 CD4+ T cell-stimulating antigen
[Listeria monocytogenes, 476 2.80E-81 85EO-1167, EF021-2
gi.vertline.581809 tmbC gene product [Treponema pallidum]
>pir.vertline.A43595.vertline.A43595 152 3.20E-71 membrane
EF021-2 gi.vertline.2688280 (AE001143) basic membrane protein C
(bmpC) [Borrelia 101 5.50E-27 burgdorferi] EF021-2
gnl.vertline.PID.vertline.e117283 membrane protein A [Borrelia
garinii] 142 6.50E-22 EF021-2 gnl.vertline.PID.vertline- .e117283
membrane protein A [Borrelia burgdorferi] 141 9.20E-22 EF021-2
gnl.vertline.PID.vertline.e117283 membrane protein A [Borrelia
burgdorferi] >gi.vertline.516592 141 9.20E-22 membrane EF021-2
gnl.vertline.PID.vertline.e117283 bmpA(p39,ORF1) [Borrelia
burgdorferi] 141 1.70E-21 EF021-2 gi.vertline.508421 antigen P39
[Borrelia burgdorferi] >gi.vertline.2688281 (AE001143) 141
1.70E-21 basic EF021-2 gi.vertline.1753225 BmpA protein [Borrelia
burgdorferi] 141 2.70E-20 EF021-2 gnl.vertline.PID.vertline.e1172-
82 membrane protein A [Borrelia afzelii] 141 8.60E-20 EF021-2
gnl.vertline.PID.vertline.e117283 membrane protein A [Borrelia
afzelii] 141 8.60E-20 EF021-2 gnl.vertline.PID.vertline.e117283
membrane protein A [Borrelia afzelii] 141 8.60E-20 EF021-2
gnl.vertline.PID.vertline.e117282 bmpA(p39,ORF1) [Borrelia
burgdorferi] 141 1.50E-19 EF022-2 gi.vertline.312940 threonine
kinase [Streptococcus equisimilis] 324 5.90E-66
>pir.vertline.S28153.vertline.S28153 EF022-2 gi.vertline.309662
pheromone binding protein [Plasmid pCF10] 307 5.60E-60
>pir.vertline.B53309.vertline.B53309 EF022-2
gnl.vertline.PID.vertline.d101185 TRAC [Enterococcus faecalis] 301
4.80E-59 EF022-2 gnl.vertline.PID.vertline.e118149 (AJ002571) DppE
[Bacillus subtilis] >gnl.vertline.PID.vertline.e1183316 170
5.10E-59 EF022-2 gi.vertline.48808 dciAE [Bacillus subtilis] 170
5.20E-59 EF022-2 gnl.vertline.PID.vertline.d100655 TraC
[Enterococcus faecalis] 299 2.80E-58 EF022-2
pir.vertline.S16651.vertline.S166 dciAE protein - Bacillus subtilis
170 1.60E-57 EF022-2 gi.vertline.388269 traC [Plasmid pAD1]
>pir.vertline.A53310.vertline.A53310 pheromone cAD1 280 2.70E-53
binding EF022-2 gi.vertline.40005 OppA gene product [Bacillus
subtilis] 154 7.30E-48 EF022-2 gi.vertline.143603 sporulation
protein [Bacillus subtilis] >gnl.vertline.PID.vertline.e1183163
154 3.10E-47 EF022-2 gi.vertline.2688227 (AE001139) oligopeptide
ABC transporter, periplasmic 215 1.00E-36 EF022-2
gi.vertline.2281458 (AF000366) oligopeptide permease homolog AII
[Borrelia 215 1.00E-36
burgdorferi] EF022-2 gi.vertline.304925 periplasmic oligopeptide
binding protein [Escherichia coli] 131 1.30E-35 EF022-2
gi.vertline.147014 oligopeptide binding protein precursor
[Escherichia coli] 131 1.80E-34 EF022-2 gi.vertline.47802 Opp A
(AA1-542) [Salmonella typhimurium] >gi.vertline.47808 138
4.90E-34 precursor EF023-2 gi.vertline.309662 pheromone binding
protein [Plasmid pCF10] 231 4.70E-66 >pir.vertline.B53309.ve-
rtline.B53309 EF023-2 gi.vertline.388269 traC [Plasmid pAD1]
>pir.vertline.A53310.vertline.A53310 pheromone cAD1 223 4.80E-62
binding EF023-2 gnl.vertline.PID.vertline.d101185 TRAC
[Enterococcus faecalis] 226 1.00E-58 EF023-2
gnl.vertline.PID.vertline.d100655 TraC [Enterococcus faecalis] 226
4.40E-58 EF023-2 gi.vertline.48808 dciAE [Bacillus subtilis] 157
1.20E-57 EF023-2 gnl.vertline.PID.vertline.e118149 (AJ002571) DppE
[Bacillus subtilis] >gnl.vertline.PID.vertline.e1183316 157
1.20E-57 EF023-2 pir.vertline.S16651.vertline.S166 dciAE protein -
Bacillus subtilis 157 3.80E-56 EF023-2 gi.vertline.40005 OppA gene
product [Bacillus subtilis] 137 2.30E-53 EF023-2 gi.vertline.143603
sporulation protein [Bacillus subtilis]
>gnl.vertline.PID.vertline.e1183163 133 6.90E-53 EF023-2
gi.vertline.47802 Opp A (AA1-542) [Salmonella typhimurium]
>gi.vertline.47808 135 2.00E-41 precursor EF023-2
gi.vertline.2688227 (AE004139) oligopeptide ABC transportcr,
periplasmic 187 9.40E-41 EF023-2 gi.vertline.2281458 (AF000366)
oligopeptide permease homolog AII [Borrelia 187 1.90E-40
burgdorferi] EF023-2 gi.vertline.882550 ORF_f535 [Escherichia coli]
>gi.vertline.1789397 (AE000384) f535; 155 1.30E-38 This 535 aa
EF023-2 gi.vertline.304925 periplasmic oligopeptide binding protein
[Escherichia coli] 130 9.00E-37 EF023-2 gi.vertline.147014
oligopeptide binding protein precursor [Escherichia coli] 130
3.70E-34 EF026-2 gi.vertline.2352482 (AF005097) unknown
[Lactococcus lactis] 141 1.10E-23 EF027-2 gi.vertline.309662
pheromone binding protein [Plasmid pCF10] 198 6.20E-71
>pir.vertline.B53309.vertline.B53309 EF027-2
gnl.vertline.PID.vertline.d100655 TraC [Enterococcus faecalis] 202
1.50E-68 EF027-2 gnl.vertline.PID.vertline.d101185 TRAC
[Enterococcus faecalis] 202 1.50E-68 EF027-2 gi.vertline.388269
traC [Plasmid pAD1] >pir.vertline.A53310.vertline.A53310
pheromone cAD1 213 8.30E-68 binding EF027-2
gnl.vertline.PID.vertline.e118149 (AJ002571) DppE [Bacillus
subtilis] >gnl.vertline.PID.vertline.e1183316 222 3.70E-41
EF027-2 gi.vertline.48808 dciAE [Bacillus subtilis] 222 4.90E-41
EF027-2 pir.vertline.S16651.vertline.S166 dciAE protein - Bacillus
subtilis 222 1.10E-39 EF027-2 gi.vertline.40005 OppA gene product
[Bacillus subtilis] 251 4.10E-39 EF027-2 gi.vertline.143603
sporulation protein [Bacillus subtilis]>
gnl.vertline.PID.vertline.e1183163 247 5.80E-39 EF027-2
gi.vertline.312940 threonine kinase [Streptococcus equisimilis] 233
8.90E-33 >pir.vertline.S28153.vertline.S28153 EF027-2
gi.vertline.2688227 (AE001139) oligopeptide ABC transporter,
periplasmic 131 2.40E-24 EF027-2 gi.vertline.2281458 (AF000366)
oligopeptide permease homolog AII [Borrelia 131 2.40E-24
burgdorferi] EF027-2 gi.vertline.2281468 (AF000948) OppAIV
[Borrelia burgdorferi] >gi.vertline.2689891 117 3.00E-20
(AE000792) EF027-2 gi.vertline.1574679 oligopeptide binding protein
(oppA) [Haemophilus influenzae] 130 3.50E-20 EF028-2
gnl.vertline.PID.vertline.d10204- 7 B. subtilis alkaline
phosphatase IIIA; P19405 secretory 996 3.60E-131 EF028-2
pir.vertline.B39096.vertline.B390 alkaline phosphatase (EC 3.1.3.1)
III precursor - Bacillus 982 2.90E-129 EF028-2 gi.vertline.470383
alkaline phosphatase A [Bacillus subtilis]
>gnl.vertline.PID.vertline.e1182942 803 4.80E-119 EF028-2
gi.vertline.143324 APase I [Bacillus licheniformis]
>pir.vertline.A44828.vertline.A44828 alkaline 184 3.00E-54
EF028-2 gi.vertline.147243 alkaline phosphatase precursor (BC
3.1.3.1) [Escherichia coli] 183 8.30E-54 EF028-2 gi.vertline.147237
alkaline phosphatase precursor (BC 3.1.3.1) [Escherichia coli] 178
4.40E-53 EF028-2 gi.vertline.147239 alkaline phosphatase precursor
(BC 3.1.3.1) [Escherichia coli] 178 4.40E-53 EF028-2
gi.vertline.147241 alkaline phosphatase precursor (BC 3.1.3.1)
[Escherichia coli] 178 4.40E-53 EF028-2 gi.vertline.1277127 phoA
gene product [Cloning vector pFW_phoA1] > gi.vertline.1277130
174 4.90E-53 phoA gene EF028-2 gi.vertline.147229 alkaline
phosphatase precursor (EC 3.1.3.1) [Escherichia coli] 178 8.40E-53
EF028-2 gi.vertline.818851 alkaline phosphatase [synthetic
construct] 174 1.10E-52 EF028-2 gi.vertline.147245 alkaline
phosphatase (phoA) (EC 3.1.3.1) [Escherichia 177 1.20E-52
fergusonii] EF028-2 gi.vertline.147231 alkaline phosphatase
precursor (EC 3.1.3.1) [Escherichia coli] 174 1.60E-52 EF028-2
gi.vertline.147235 alkaline phosphatase precursor (EC 3.1.3.1)
[Escherichia coli] 174 1.60E-52 EF028-2 gi.vertline.1016010
alkaline phosphatase with N-terminal PelB-leader and C- 174
1.60E-52 terminal EF029-2 gi.vertline.1750126 YncB [Bacillus
subtilis] >gnl.vertline.PID.vertline.e1183421 similar to 257
3.50E-55 micrococcal EF029-2 gnl.vertline.PID.vertline.e118360
similar to hypothetical proteins [Bacillus subtilis] 263 7.80E-53
EF029-2 gi.vertline.673492 nuclease [Staphylococcus aureus]
>pir.vertline.A00790.vertline.NCSAF 320 2.20E-39 micrococcal
EF029-2 gi.vertline.532653 thermonuclease [Staphylococcus hyicus]
155 9.10E-39 EF029-2 gi.vertline.47146 thermonuclease
[Staphylococcus intermedius] 145 4.90E-32
>pir.vertline.526079.vertline.S26079 EF030-2 gi.vertline.48808
dciAE [Bacillus subtilis] 149 1.10E-66 EF030-2
gnl.vertline.PID.vertline.e118149 (AJ002571) DppE [Bacillus
subtilis] >gnl.vertline.PID.vertline.e1183316 149 1.50E-66
EF030-2 pir.vertline.S16651.vertline.S166 dciAE protein - Bacillus
subtilis 149 5.90E-66 EF030-2 gi.vertline.309662 pheromone binding
protein [Plasmid pCF10] 227 7.40E-52
>pir.vertline.B53309.vertline.B- 53309 EF030-2
gnl.vertline.PID.vertline.d101185 TRAC [Enterococeus faecalis] 237
7.40E-52 EF030-2 gnl.vertline.PID.vertline.d100655 TraC
[Enterococcus faecalis] 233 9.70E-51 EF030-2 gi.vertline.388269
traC [Plasmid pAD1] >pir.vertline.A53310.vertline.A- 53310
pheromone cAD1 229 3.00E-48 binding EF030-2 gi.vertline.312940
threonine kinase [Streptococcus equisimilis] 277 3.00E-45
>pir.vertline.S28153.vertline.S28153 EF030-2 gi.vertline.47802
Opp A (AA1-542) [Salmonella typhimurium] >gi.vertline.47808 125
8.50E-34 precursor EF030-2 gi.vertline.2688227 (AE001139)
oligopeptide ABC transporter, periplasmic 211 4.80E-31 EF030-2
gi.vertline.2281458 (AF000366) oligopeptide permease homolog AII
[Borrelia 211 4.80E-31 burgdorferi] EF030-2 gi.vertline.40005 OppA
gene product [Bacillus subtilis] 148 1.20E-30 EF030-2
gi.vertline.143603 sporulation protein [Bacillus subtilis]
>gnl.vertline.PID.vertline.e1183163 144 4.80E-30 EF030-2
gi.vertline.2281468 (AF000948) OppAIV [Borrelia burgdorferi]
>gi.vertline.2689891 210 2.10E-29 (AE000792) EF030-2
gi.vertline.1574679 oligopeptide binding protein (oppA)
[Haemophilus influenzae] 148 6.00E-29 EF033-2
gnl.vertline.PID.vertline.e11843- 9 similar to iron-binding protein
[Bacillus subtilis] 164 2.60E-14 EF033-2
pir.vertline.S54437.vertline.S544 hemin binding protein - Yersinia
enterocolitica 108 1.40E-11 EF033-2 gi.vertline.1619623 hemin
binding protein [Yersinia enterocolitica] 108 2.00E-11 EF036-2
gnl.vertline.PID.vertline.d101022 ORF108 [Bacillus subtilis]
>gnl.vertline.PID.vertline.e1185766 alternate gene 544 1.20E-96
EF036-2 gi.vertline.2622858 (AE000929) phosphate-binding protein
PstS 183 1.40E-45 [Methanobacterium EF036-2 gi.vertline.2622859
(AE000929) phosphate-binding protein PstS homolog 158 2.40E-41
[Methanobacterium EF036-2 gi.vertline.2688115 (AE001132) phosphate
ABC transporter, periplasmic 117 1.10E-12 phosphate-binding EF037-2
gi.vertline.2352482 (AF005097) unknown [Lactococcus lactis] 141
1.10E-23 EF040-2 gi.vertline.1657516 hypothetical protein
[Escherichia coli] >gi.vertline.1786511 (AE000139) 208 1.90E-29
EF040-2 gi.vertline.293265 2-5A-dependent RNase [Mus musculus]
>pir.vertline.B45771.vertline.B45771 105 1.00E-17 EF040-2
gi.vertline.287865 G9a [Homo sapiens]
>pir.vertline.S30385.vertline.S30385 G9a protein - human 143
8.30E-14 EF040-2 gi.vertline.311817 erythroid ankyrin [Mus
musculus] >pir.vertline.S37771.vertline.S37771 ankyrin, 119
4.80E-13 EF040-2 gi.vertline.191940 ankyrin [Mus musculus]
>pir.vertline.I49502- .vertline.I49502 ankyrin - mouse 119
4.90E-13 EF040-2 gi.vertline.747710 alt. ankyrin (variant 2.2)
[Homo sapiens] 120 1.50E-12 EF040-2 gi.vertline.178646 ankyrin
[Homo sapiens] 120 1.80E-12 EF040-2 gi.vertline.1845265 ankyrin
[Homo sapiens] 120 1.80E-12 EF040-2
pir.vertline.A35049.vertline.A350 ankyrin 1, erythrocyte splice
form 2-human 120 1.80E-12 EF040-2 pir.vertline.B35049.vert-
line.B350 ankyrin 1, erythrocyte splice form 3-human 120 1.80E-12
EF040-2 gi.vertline.28702 ankyrin (variant 2.1) [Homo sapiens]
>pir.vertline.S08275.vertline.SJHUK 120 1.80E-12 ankyrin 1,
EF041-2 gi.vertline.388269 traC [Plasmid pAD1]
>pir.vertline.A53310.vertline.A53310 pheromone cAD1 670 1.40E-87
binding EF041-2 gnl.vertline.PID.vertline.d100655 TraC
[Enterococcus faecalis] 662 1.50E-85 EF041-2
gnl.vertline.PID.vertline.d101185 TRAC [Enterococcus faecalis] 662
1.50E-85 EF041-2 gi.vertline.309662 pheromone binding protein
[Plasmid pCF10] 648 1.20E-83 >pir.vertline.B53309.vertline.B-
53309 EF041-2 gi.vertline.48808 dciAE [Bacillus subtilis] 218
1.20E-57 EF041-2 gnl.vertline.PID.vertline.e118149 (AJ002571) DppE
[Bacillus subtilis] >gnl.vertline.PID.vertline.e1183316 218
1.40E-57 EF041-2 pir.vertline.S16651.vertline.S166 dciAE protein -
Bacillus subtilis 218 2.10E-56 EF041-2 gi.vertline.882550 ORF_f535
[Escherichia coli] >gi.vertline.1789397 (AE000384) f535; 146
7.30E-40 This 535 aa EF041-2 gi.vertline.143603 sporulation protein
[Bacillus subtilis] >gnl.vertline.PID.vertline.e11- 83163 278
1.00E-34 EF041-2 gi.vertline.40005 OppA gene product [Bacillus
subtilis] 279 1.00E-34 EF041-2 gi.vertline.47802 Opp A (AA1-542)
[Salmonella typhimurium] >gi.vertline.47808 141 6.60E-30
precursor EF041-2 gi.vertline.304925 periplasmic oligopeptide
binding protein [Escherichia coli] 160 1.90E-29 EF041-2
gi.vertline.1574679 oligopeptide binding protein (oppA)
[Haemophilus influenzae] 163 1.00E-28 EF041-2 gi.vertline.147014
oligopeptide binding protein precursor [Escherichia coli] 160
1.50E-28 EF041-2 gi.vertline.2253286 (AF005657) plasminogen binding
protein [Borrelia burgdorferi] 134 5.00E-27 EF045-2
gi.vertline.308854 oligopeptide binding protein [Lactococcus
lactis] 437 3.20E-125 >pir.vertline.E53290.vertline.E53290
EF045-2 gi.vertline.495181 oligopeptide binding protein
[Lactococcus lactis] 426 9.70E-124 EF045-2 gi.vertline.677945 AppA
[Bacillus subtilis] >gnl.vertline.PID.vertline.e1183158
oligopeptide ABC 154 2.30E-31 EF045-2 gi.vertline.293014
peptide-binding protein [Lactococcus lactis] 158 2.40E-14
>pir.vertline.B47098.vertline.B47098 EF048-2 gi.vertline.1574060
hypothetical [Haemophilus influenzae]
>pir.vertline.I64164.vertline.I64164 250 2.30E-41 EF048-2
dbj.vertline..vertline.AB0014882 (AB001488) SIMILAR TO
C4-DICARBOXYLATE- 208 3.60E-34 BINDING PERIPLASMIC EF048-2
gi.vertline.466717 No definition line found [Escherichia coli]
>gi.vertline.1790004 199 1.30E-30 (AE000435) EF048-2
gi.vertline.46006 periplasmic C4-dicarboxylate binding-protein
[Rhodobacter 162 1.40E-25 capsulatus] EF048-2 gi.vertline.1573102
hypothetical [Haemophilus influenzae]
>pir.vertline.H641431.vertline.H64143 244 3.80E-25 EF048-2
gi.vertline.2182530 (AE000085) Y4mM [Rhizobium sp. NGR234] 114
5.60E-18 EF048-2 gi.vertline.1572999 hypothetical [Haemophilus
influenzae] >pir.vertline.E64141.vertline.E64141 116 5.90E-15
EF049-2 gi.vertline.149581 maturation protein [Lactobacillus
paracasei] 241 2.40E-55 >pir.vertline.A44858.vertline.A44858
EF049-2 gi.vertline.47198 ORF (AA 1 to 299) [Lactococcus lactis
cremoris] 239 1.00E-54 >pir.vertline.S08083.vertline.S08083
EF049-2 gi.vertline.432402 maturation protein [Lactococcus lactis]
>gi.vertline.623055 proteinase 239 6.20E-54 EF049-2
gi.vertline.472835 ORF1 [Lactococcus lactis cremoris] 241 1.50E-53
EF049-2 gi.vertline.39782 33kDa lipoprotein [Bacillus subtilis]
>gnl.vertline.PID.vertline.e325181 33kDa 128 8.90E-40 EF051-2
gnl.vertline.PID.vertline.d101142 molybdate-binding periplasmic
protein [Synechocystis sp.] 173 3.20E-50 EF051-2
gnl.vertline.PID.vertlin- e.e118602 alternate gene name: yvsD;
similar to molybdate-binding 314 5.90E-50 EF051-2
gi.vertline.1574546 lsg locus hypothetical [Haemophilus influenzae]
161 2.20E-43 >pir.vertline.A64175.v- ertline.A64175 EF051-2
gi.vertline.504498 periplasmic molybdate-binding protein
[Escherichia coli] 148 1.40E-30 >gi.vertline.1147817 EF051-2
gi.vertline.148939 ORF 8 [Haemophilus influenzae]
>pir.vertline.S27583.vertline.S27S83 150 8.10E-28 hypothetical
EF054-2 gi.vertline.150556 surface protein [Plasmid pCF10]
>pir.vertline.A41826.vertline.A41826 probable 1490 1.80E-192
EF054-2 gnl.vertline.PID.vertline.e236571 cell wall anchoring
signal [Enterococcus faecalis] 515 8.10E-64 EF054-2
gi.vertline.45738 ORFC [Enterococcus faecalis]
>pir.vertline.JH0204.vertline.JH0204 372 1.60E-58 hypothetical
30.5K EF054-2 gi.vertline.496520 orf iota [Streptococcus pyogenes]
>pir.vertline.S68125.vertline.S45091 362 1.30E-43 hypothetical
EF054-2 gi.vertline.160693 sporozoite surface protein [Plasmodium
yoelii] 286 4.30E-33 >pir.vertline.A45559.vertline.A45559
EF054-2 gi.vertline.1813523 PbTRAP [Plasmodium berghei] 305
1.30E-32 EF054-2 gnl.vertline.PID.vertline.e225687 zinc finger
protein [Mus musculus] >gnl.vertline.PID.vertline.e225688 zinc
246 3.60E-26 EF054-2 gi.vertline.2290394 IgG and IgE immunoreactive
antigen recognized by sera from 242 1.40E-25 patients EF054-2
gi.vertline.2290392 IgG and IgE immunoreactive antigen recognized
by sera from 237 7.80E-25 patients EF054-2 gi.vertline.46523 B
antigen [Streptococcus agalactiae] 232 2.80E-23 EF054-2
pir.vertline.S15330.vertline.FCSO IgA Fc receptor precursor -
Streptococcus agalactiae 228 1.00E-22 EF054-2 gi.vertline.1620100
Pro- and Glu-rich, PENPEV (10.times.); similar to Streptococcus B
210 3.10E-21 EF054-2 gi.vertline.63686 NF-M c-terminus [Gallus
gallus] 222 6.90E-21 EF054-2 gi.vertline.63689 NF-M protein [Gallus
gallus] >pir.vertline.S15762.vertline.S15762 222 8.50E-21
neurofilament triplet EF054-2 gi.vertline.757867 TATA-box like
sequence (Us11) [Human herpesvirus 1] 194 4.10E-19
>gi.vertline.291493 18 EF059-2 gnl.vertline.PID.vertline.e2365-
71 cell wall anchoring signal [Enterococcus faecalis] 418 5.60E-95
EF059-2 gi.vertline.150556 surface protein [Plasmid pCF10] >pir
A41826.vertline.A41826 probable 606 3.70E-87 EF059-2
gi.vertline.45738 ORFC [Enterococcus faecalis]
>pir.vertline.JH0204.ve- rtline.JH0204 366 9.30E-50 hypothetical
30.5K EF059-2 gi.vertline.496520 orf iota [Streptococcus pyogenes]
>pir.vertline.S68125.vertline.S545091 367 5.90E-44 hypothetical
EF059-2 gi.vertline.160693 sporozoite surface protein [Plasmodium
yoelii] 344 1.10E-38 >pir.vertline.A45559.vertline.A45559
EF059-2 gi.vertline.1813523 PbTRAP [Plasmodium berghei] 295
2.50E-32 EF059-2 gi.vertline.2290394 IgG and IgE immunoreactive
antigen recognized by sera from 251 3.00E-29 patients EF059-2
gi.vertline.2290392 IgG and IgE immunoreactive antigen recognized
by sera
from 251 3.40E-29 patients EF059-2 gi.vertline.162100 Pro- and
Glu-rich, PENPEV (10.times.); similar to Streptococcus B 253
6.40E-27 EF059-2 gi.vertline.46521 Fc receptor [Streptococcus
agalactiae] >pir.vertline.A60234.vertline.A60234 IgA 197
2.70E-26 Fc EF059-2 gi.vertline.46523 B antigen [Streptococcus
agalactiae] 232 9.30E-26 EF059-2 pir.vertline.S15330.vertline.FCS-
O IgA Fc receptor precursor - Streptococcus agalactiae 232 9.30E-26
EF059-2 gnl.vertline.PID.vertline.e225687 zinc finger protein [Mus
musculus] >gnl.vertline.PID.vertline.e225688 zinc 234 1.40E-22
EF059-2 gi.vertline.425356 zona pellucida protein
[Pseudopleuronectes americanus] 229 1.00E-21 EF059-2
gi.vertline.457769 Collagen [Bombyx mori]
>pir.vertline.S42886.vertline.S42886 collagen - 209 7.60E-19
silkworm EF061-2 gnl.vertline.PID.vertline.e23- 6571 cell wall
anchoring signal [Enterococcus faecalis] 925 8.10E-118 EF061-2
gi.vertline.150556 surface protein [Plasmid pCF10]
>pir.vertline.A41826.vertline.A41826 probable 350 1.50E-107
EF061-2 gi.vertline.496520 orf iota [Streptococcus pyogenes]
>pir.vertline.S68125.vertline.S45091 308 1.40E-58 hypothetical
EF061-2 gi.vertline.45738 ORFC [Enterococcus faecalis]
>pir.vertline.JH0204.vertline.JH0204 322 6.40E-50 hypothetical
30.5K EF061-2 gi.vertline.1813523 PbTRAP [Plasmodium berghei] 263
1.00E-26 EF061-2 gi.vertline.160693 sporozoite surface protein
[Plasmodium yoelii] 241 9.00E-25
>pir.vertline.A45559.vertline.A45559 EF061-2 gi.vertline.63686
NF-M c-terminus [Gallus gallus] 232 2.10E-22 EF061-2
gi.vertline.63689 NF-M protein [Gallus gallus]
>pir.vertline.S15762.ve- rtline.S15762 232 2.60E-22
neurofilament triplet EF061-2 gi.vertline.2290392 IgG and IgE
immunoreactive antigen recognized by sera from 176 2.40E-21
patients EF061-2 gi.vertline.1620100 Pro- and Glu-rich, PENPEV
(10.times.); similar to Streptococcus B 165 2.70E-20 EF061-2
gnl.vertline.PID.vertline.e225687 zinc finger protein [Mus
musculus] >gnl.vertline.PID.vertline.e225688 zinc 197 7.80E-19
EF061-2 gi.vertline.160355 interspersed repeat antigen [Plasmodium
falciparum] 199 8.20E-18 EF061-2 gi.vertline.410750 interspersed
repeat antigen [Plasmodium falciparum] 199 8.90E-18 EF061-2
gi.vertline.2290388 IgG and IgE immunoreactive antigen recognized
by sera from 182 1.40E-17 patients EF061-2 gi.vertline.2290394 IgG
and IgE immunoreactive antigen recognized by sera from 180 2.80E-17
patients EF062-2 gi.vertline.47049 asal gene product (AA 1-1296)
[Enterococcus faecalis] 3716 0 EF062-2 gi.vertline.43324
aggregation substance (ASP1) [Enterococcus faecalis] 4003 0 EF062-2
gi.vertline.2109266 aggregation substance [Enterococcus faecium]
5523 0 EF062-2 gi.vertline.150555 aggregation substance [Plasmid
pCF10] >pir.vertline.H41662.vertline.H41662 6338 0 150K mating
EF062-2 gi.vertline.1100973 SspB precursor [Streptococcus gordonii]
110 9.90E-39 EF062-2 gi.vertline.47248 PAc protein precursor (AA
-38 to 1527) [Streptococcus 107 1.70E-38 mutans] EF062-2
gnl.vertline.PID.vertline.d101507 surface protein antigen precursor
[Streptococcus sobrinus] 132 5.00E-36 EF062-2 gi.vertline.47267
cell surface antigen I/II [Streptococcus mutans] 107 6.50E-36
>pir.vertline.S06839.vertline.S06839 EF062-2 bbs.vertline.148453
SpaA=endocarditis immunodominant antigen [Streptococcus 132
1.20E-35 sobrinus, EF062-2 gi.vertline.47620 antigen I/II
[Streptococcus sobrinus] >pir.vertline.A60338.vertline.A60338
132 2.90E-35 surface EF062-2 pir.vertline.A35186.vertline.A351
salivary agglutinin receptor precursor - Streptococcus 109 2.10E-34
EF062-2 gi.vertline.1100971 SspA [Streptococcus gordonii] 110
3.80E-32 EF062-2 gi.vertline.1100975 SspA [Streptococcus gordonii]
110 2.30E-21 EF063-2 gi.vertline.47049 asal gene product (AA
1-1296) [Enterococcus faecalis] 3716 0 EF063-2 gi.vertline.43324
aggregation substance (ASP1) [Enterococcus faecalis] 4003 0 EF063-2
gi.vertline.2109266 aggregation substance [Enterococcus faecium]
5523 0 EF063-2 gi.vertline.150555 aggregation substance [Plasmid
pCF10] >pir.vertline.H41662.vertline.H41662 6338 0 150K mating
EF063-2 gi.vertline.1100973 SspB precursor [Streptococcus gordonii]
110 9.90E-39 EF063-2 gi.vertline.47248 PAc protein precursor (AA
-38 to 1527) [Streptococcus 107 1.70E-38 mutans] EF063-2
gnl.vertline.PID.vertline.d101507 surface protein antigen precursor
[Streptococcus sobrinus] 132 5.00E-36 EF063-2 gi.vertline.47267
cell surface antigen I/II [Streptococcus mutans] 107 6.50E-36
>pir.vertline.S06839.vertline.S06839 EF063-2 bbs.vertline.148453
SpaA=endocarditis immunodominant antigen [Streptococcus 132
1.20E-35 sobrinus, EF063-2 gi.vertline.47620 antigen I/II
[Streptococcus sobrinus] >pir.vertline.A60338.vertline.A60338
132 2.90E-35 surface EF063-2 pir.vertline.A35186.vertline.A351
salivary agglutinin receptor precursor - Streptococcus 109 2.10E-34
EF063-2 gi.vertline.1100971 SspA [Streptococcus gordonii] 110
3.80E-32 EF063-2 gi.vertline.1100975 SspA [Streptococcus gordonii]
110 2.30E-21 EF064-2 gi.vertline.47049 asal gene product (AA
1-1296) [Enterococcus faecalis] 3716 0 EF064-2 gi.vertline.43324
aggregation substance (ASP1) [Enterococcus faecalis] 4003 0 EF064-2
gi.vertline.2109266 aggregation substance [Enterococcus faecium]
5523 0 EF064-2 gi.vertline.150555 aggregation substance [Plasmid
pCF10] >pir.vertline.H41662.vertline.H41662 6338 0 150K mating
EF064-2 gi.vertline.1100973 SspB precursor [Streptococcus gordonii]
110 9.90E-39 EF064-2 gi.vertline.47248 PAc protein precursor (AA
-38 to 1527) [Streptococcus 107 1.70E-38 mutans] EF064-2
gnl.vertline.PID.vertline.d101507 surface protein antigen precursor
[Streptococcus sobrinus] 132 5.00E-36 EF064-2 gi.vertline.47267
cell surface antigen I/II [Streptococcus mutans] 107 6.50E-36
>pir.vertline.S06839.vertline.S06839 EF064-2 bbs.vertline.148453
SpaA=endocarditis immunodominant antigen [Streptococcus 132
1.20E-35 sobrinus, EF064-2 gi.vertline.47620 antigen I/II
[Streptococcus sobrinus] >pir.vertline.A60338.vertline.A60338
132 2.90E-35 surface EF064-2 pir.vertline.A35186.vertline.A351
salivary agglutinin receptor precursor - Streptococcus 109 2.10E-34
EF064-2 gi.vertline.1100971 SspA [Streptococcus gordonii] 110
3.80E-32 EF064-2 gi.vertline.1100975 SspA [Streptococcus gordonii]
110 2.30E-21 EF068-2 gi.vertline.790398 T06D8.1 [Caenorhabditis
elegans] 137 8.50E-17 EF068-2 gnl.vertline.PID.vertline.d102084
membrane glycoprotein [Equine herpesvirus 1] 210 5.80E-16 EF068-2
gi.vertline.2286204 (AF011339) unknown [Acinetobacter
calcoaceticus] 121 8.40E-16 EF068-2 gi.vertline.330862 membrane
glycoprotein [Equine herpesvirus 1] 208 1.10E-15
>pir.vertline.H36802.vertline.VG- BEX1 EF068-2
gi.vertline.1707247 partial CDS [Caenorhabditis elegans] 131
3.70E-15 EF068-2 gnl.vertline.PID.vertline.d102084 membrane
glycoprotein [Equine herpesvirus 1] 203 6.20E-15 EF068-2
gi.vertline.213392 antifreeze glycoprotein [Notothenia coriiceps]
102 4.60E-13 >pir.vertline.A38420.vertline.A38420 EF068-2
gnl.vertline.PID.vertline.e125464 (AL022022) PGRS-family protein
[Mycobacterium 145 1.50E-12 tuberculosis] EF068-2
gi.vertline.951460 FIM-C.1 gene product [Xenopus laevis]
>pir.vertline.A45155.vertline.A45155 109 2.70E-12 mucin EF069-2
gi.vertline.790398 T06D8.1 [Caenorhabditis elegans] 137 8.50E-17
EF069-2 gnl.vertline.PID.vertline.d102084 membrane glycoprotein
[Equine herpesvirus 1] 210 5.80E-16 EF069-2 gi.vertline.2286204
(AF011339) unknown [Acinetobacter calcoaceticus] 121 8.40E-16
EF069-2 gi.vertline.330862 membrane glycoprotein [Equine
herpesvirus1] 208 1.10E-15 >pir.vertline.H36802.vertline.VGB-
EX1 EF069-2 gi.vertline.1707247 partial CDS [Caenorhabditis
elegans] 131 3.70E-15 EF069-2 gnl.vertline.PID.vertline.d102084
membrane glycoprotein [Equine herpesvirus 1] 203 6.20E-15 EF069-2
gi.vertline.213392 antifreeze glycoprotein [Notothenia coriiceps]
102 4.60E-13 >pir.vertline.A38420.vertline.A38420 EF069-2
gnl.vertline.PID.vertline.e125464 (AL022022) PGRS-family protein
[Mycobacterium 145 1.50E-12 tuberculosis] EF069-2
gi.vertline.951460 FIM-C.1 gene product [Xenopus laevis]
>pir.vertline.A45155.vertline.A45155 109 2.70E-12 mucin EF070-2
gi.vertline.790398 T06D8.1 [Caenorhabditis elegans] 137 8.50E-17
EF070-2 gnl.vertline.PID.vertline.d102084 membrane glycoprotein
[Equine herpesvirus 1] 210 5.80E-16 EF070-2 gi.vertline.2286204
(AF011339) unknown [Acinetobacter calcoaceticus] 121 8.40E-16
EF070-2 gi.vertline.330862 membrane glycoprotein [Equine
herpesvirus 1] 208 1.10E-15 >pir.vertline.H36802.vertline.VG-
BEX1 EF070-2 gi.vertline.1707247 partial CDS [Caenorhabditis
elegans] 131 3.70E-15 EF070-2 gnl.vertline.PID.vertline.d102084
membrane glycoprotein [Equine herpesvirus 1] 203 6.20E-15 EF070-2
gi.vertline.213392 antifreeze glycoprotein [Notothenia coriiceps]
102 4.60E-13 >pir.vertline.A38420.vertline.A38420 EF070-2
gnl.vertline.PID.vertline.e125464 (AL022022) PGRS-family protein
[Mycobacterium 145 1.50E-12 tuberculosis] EF070-2
gi.vertline.951460 FIM-C.1 gene product [Xenopus laevis]
>pir.vertline.A45155.vertline.A45155 109 2.70E-12 mucin EF071-2
gnl.vertline.PID.vertline.e306428 unnamed protein product
[Bacteriophage rlt] >gi.vertline.1353566 127 2.00E-37 Lysin
EF071-2 gi.vertline.853751 N-acetylmuramoyl-L-alanine amidase
[Bacteriophage A511] 273 2.60E-36 EF073-2 gi.vertline.143830 xpaC
[Bacillus subtilis] > gnl.vertline.PID.vertline.d1005803
hydrolysis of 173 7.10E-16 EF074-2 gi.vertline.1256698 chitinase
[Serratia marcescens] >gi.vertline.1256698 chitinase [Serratia
618 2.60E-104 EF074-2 gi.vertline.1763985 chitinase A [Vibrio
harveyi] 526 2.80E-84 EF075-2 gi.vertline.143156 membrane bound
protein [Bacillus subtilis] >gnl.vertline.PID.vertline.e1184471
593 1.70E-91 EF075-2 pir.vertline.D70070.vertline.D700
transcriptional regulator homolog ywtF-Bacillus subtilis 118
1.90E-59 EF075-2 gi.vertline.1762327 putative transcriptional
regulator [Bacillus subtilis] 148 9.60E-53 EF075-2
gi.vertline.1276874 EpsA [Streptococcus thermophilus] 239 2.20E-33
EF075-2 gnl.vertline.PID.vertline.e289126 unknown [Streptococcus
pneumoniae] 150 1.20E-27 EF075-2 gi.vertline.485275 putative
regulatory protein [Streptococcus pneumoniae] 150 2.50E-27 EF075-2
gi.vertline.2804735 (AF030367) putative regulatory protein
[Streptococcus 150 2.50E-27 pneumoniae] EF075-2 gi.vertline.2804747
(AF030369) putative regulatory protein [Streptococcus 150 2.50E-27
pneumoniae] EF075-2 gnl.vertline.PID.vertline.e116988 capsular
polysaccharide synthesis protein [Streptococcus 148 5.30E-27
EF075-2 gi.vertline.2804769 (AF030373) putative regulatory protein
[Streptococcus 148 5.30E-27 pneumoniae] EF075-2 gi.vertline.1147744
PSR [Enterococcus hirae] 109 2.10E-23 EF075-2 gi.vertline.790435
PSR [Enterococcus faecium] >pir.vertline.S54177.ver-
tline.S54177 PSR protein - 102 4.40E-19 EF075-2 gi.vertline.2267239
ORF1 [Staphylococcus epidermidis] 109 8.50E-19 EF075-2
gnl.vertline.PID.vertline.d101895 membrane bound protein LytR
[Synechocystis sp.] 121 2.80E-16 EF077-2 gnl.vertline.PID.vertlin-
e.d101135 cadmium-transporting ATPase [Synechocystis sp.] 396
2.30E-113 EF077-2 gi.vertline.150719 cadmium resistance protein
[Plasmid pI258] 373 8.60E-112
>pir.vertline.A32561.vertline.A32561 EF077-2 gi.vertline.143753
cadmium-efflux ATPase [Bacillus firmus]
>pir.vertline.D42707.vertline.D42707 361 8.10E-111 probable
EF077-2 gi.vertline.152978 E1-E2 cadmium efflux adenosine
triphosphatase 381 4.30E-110 [Staphylococcus EF077-2
gnl.vertline.PID.vertline.e248808 unknown [Mycobacterium
tuberculosis] 298 3.50E-107 EF077-2 gi.vertline.495646 ATPase
[Transposon Tn5422] 361 2.10E-106 EF077-2
gnl.vertline.PID.vertline.e118497 similar to heavy
metal-transporting ATPase [Bacillus 286 3.50E-104 EF077-2
gi.vertline.1699049 cadmium resistance protein [Lactococcus lactis]
352 3.60E-100 EF077-2 gnl.vertline.PID.vertline.e118603 similar to
heavy metal-transporting ATPase [Bacillus 254 9.90E-100 EF077-2
gnl.vertline.PID.vertline.e306540 unknown [Mycobacterium
tuberculosis] 352 5.20E-88 EF077-2 gnl.vertline.PID.vertline.e263-
525 P-type ATPase [Mycobacterium tuberculosis] 199 5.50E-86
>gnl.vertline.PID.vertline.e249413 EF077-2
gnl.vertline.PID.vertline.e264090 unknown [Mycobacterium
tuberculosis] 250 3.00E-84 EF077-2
gnl.vertline.PID.vertline.d101135 cadmium-transporting ATPase
[Synechocystis sp.] 260 1.00E-81 EF077-2 gi.vertline.1773166
probable copper-transporting atpase [Escherichia coli] 212 4.70E-80
>gi.vertline.1786691 EF077-2 gi.vertline.1354935 probable
copper-transporting atpase [Escherichia coli] 212 8.50E-79 EF078-2
gi.vertline.143331 alkaline phosphatase regulatory protein
[Bacillus subtilis] 257 5.50E-58 EF078-2 gi.vertline.410142 ORFX18
[Bacillus subtilis] >gnl.vertline.PID.vertline.e1185580
two-component 235 8.20E-51 sensor EF078-2
gnl.vertline.PID.vertline.d101196 homologous to sp:PHOR_BACSU
[Bacillus subtilis] 219 4.20E-44 EF078-2 gi.vertline.1575578
histidine protein kinase [Thermotoga maritima] 191 7.10E-44 EF078-2
gi.vertline.2182990 histidine kinase [Lactococcus lactis cremoris]
169 6.40E-40 EF078-2 gi.vertline.2182992 histidine kinase
[Lactococcus lactis cremoris] 152 1.10E-39 EF078-2
gnl.vertline.PID.vertline.d101134 sensory transduction histidine
kinase [Synechocystis sp.] 259 3.90E-38 EF078-2 gi.vertline.149296
phosphate regulatory protein phoR (gtg start codon) [Klebsiella 228
7.60E-33 EF078-2 gi.vertline.581188 phoR gene product (AA 1-431)
[Escherichia coli] >gi.vertline.1657596 226 1.60E-32 EF078-2
gnl.vertline.PID.vertline.d101087 sensory transduction histidine
kinase [Synechocystis sp.] 138 3.70E-32 EF078-2
gnl.vertline.PID.vertline.e266592 unknown [Mycobacterium
tuberculosis] 232 1.10E-31 EF078-2 gi.vertline.2182996 histidine
kinase [Lactococcus lactis cremoris] 206 1.30E-31 EF078-2
gnl.vertline.PID.vertline.d101135 sensory transduction histidine
kinase [Synechocystis sp.] 256 1.30E-31 EF078-2 gi.vertline.294893
phosphate regulatory protein phoR (gtg start codon) [Shigella 225
1.60E-31 EF078-2 gi.vertline.288420 drug sensory protein A
[Synechocystis PCC6803] 106 2.50E-31 >gnl.vertline.PID.vertl-
ine.d1017420 EF079-2 gi.vertline.2098719 putative
fimbrial-associated protein [Actinomyces naeslundii] 183 8.60E-26
EF081-2 gi.vertline.467806 penicillin-binding protein [Enterococcus
faecalis] 1356 2.10E-178 EF081-2 gi.vertline.790429 low affinity
penicillin-binding protein 5 (PBP5) [Enterococcus 607 1.00E-78
EF081-2 gnl.vertline.PID.vertline.e208365 penicillin-binding
protein 5 [Enterococcus faecium] 604 1.10E-78 EF081-2
gi.vertline.790433 low affinity penicillin-binding protein 5 (PBP5)
[Enterococcus 604 2.70E-78 lysin EF084-2 gi.vertline.2293312
(AF008220) YtfP [Bacillus subtilis]
>gnl.vertline.PID.vertline.e1185879 similar 438 1.70E-140 to
EF084-2 gi.vertline.2367234 (AE000425) hypothetical 43.8 kD protein
in rhsB-pit intergenic 167 2.20E-51 EF084-2 gi.vertline.912464 No
definition line found [Escherichia coli] 167 6.00E-51 EF084-2
gnl.vertline.PID.vertline.d101127 hypothetical protein
[Synechocystis sp.] >pir.vertline.S76678.vertline- .S76678 151
6.10E-42 EF084-2 gi.vertline.1573954 hypothetical [Haemophilus
influenzae] >pir.vertline.G64161.vertline.G64161 142 2.90E-40
EF085-2 gi.vertline.1209527 protein histidine kinase [Enterococcus
faecalis] 2023 8.00E-279 EF085-2 gi.vertline.467057 phoR;
B2168_C3_247 [Mycobacterium leprae] 226 8.80E-23
>pir.vertline.S72905.vertline.S72905 EF085-2
gnl.vertline.PID.vertline.e119229 SenX3 [Mycobacterium bovis BCG]
222 3.10E-22 EF085-2 gnl.vertline.PID.vertline.e255152 unknown
[Mycobacterium tuberculosis] >gnl.vertline.PID.vertline.e321546
222
3.10E-22 SenX3 EF085-2 gi.vertline.1778485 PcoS homolog
[Escherichia coli] >gi.vertline.1786783 (AE000162) f480; 111
3.80E-16 This EF085-2 gi.vertline.149296 phosphate regulatory
protein phoR (gtg start codon) [Klebsiella 110 1.40E-14 EF085-2
gi.vertline.581188 phoR gene product (AA 1-431) [Escherichia coli]
>gi.vertline.1657596 103 5.30E-14 EF085-2 gi.vertline.143331
alkaline phosphatase regulatory protein [Bacillus subtilis] 118
4.90E-13 EF085-2 gi.vertline.537239 alternate gene name phoM; CG
Site No. 395 [Escherichia coli] 126 9.50E-13 EF085-2
gi.vertline.147251 phoM [Escherichia coli] >gi.vertline.809670
phoM protein (1 is 3rd 126 9.50E-13 base in EF085-2
gi.vertline.2182992 histidine kinase [Lactococcus lactis cremoris]
109 5.90E-12 EF086-2 gi.vertline.437706 alternative truncated
translation product from E. coli 221 3.00E-54 [Streptococcus
EF086-2 gi.vertline.437705 hyaluronidase [Streptococcus pneumoniae]
221 1.60E-53 EF086-2 gi.vertline.595847 hyaluronate lyase
[Streptococcus agalactiae] 203 3.30E-44
>pir.vertline.A55137.vertline.A55137 EF086-2 gi.vertline.705406
hyaluronate lyase [Staphylococcus aureus] 191 3.40E-42 EF086-2
gi.vertline.562086 hyaluronidase [Propionibacterium acnes] 198
6.00E-27 EF087-2 gi.vertline.437706 alternative truncated
translation product from E. coli 221 3.00E-54 [Streptococcus
EF087-2 gi.vertline.437705 hyaluronidase [Streptococcus pneumoniae]
221 1.60E-53 EF087-2 gi.vertline.595847 hyaluronate lyase
[Streptococcus agalactiae] 203 3.30E-44
>pir.vertline.A55137.vertline.A55137 EF087-2 gi.vertline.705406
hyaluronate lyase [Staphylococcus aureus] 191 3.40E-42 EF087-2
gi.vertline.562056 hyaluronidase [Propionibacterium acnes] 198
6.00E-27 EF088-2 gi.vertline.437706 alternative truncated
translation product from E.coli 221 3.00E-54 [Streptococcus EF088-2
gi.vertline.437705 hyaluronidase [Streptococcus pneumoniae] 221
1.60E-53 EF088-2 gi.vertline.595847 hyaluronate lyase
[Streptococcus agalactiae] 203 3.30E-44
>pir.vertline.A55137.vertline.A55137 EF088-2 gi.vertline.705406
hyaluronate lyase [Staphylococcus aureus] 191 3.40E-42 EF088-2
gi.vertline.562086 hyaluronidase [Propionibacterium acnes] 198
6.00E-27 EF091-2 gi.vertline.556016 similar to plant water stress
proteins; ORF2 [Bacillus subtilis] 198 5.50E-21 EF091-2
gi.vertline.2353333 (AF016513) Ce-LEA [Caenorhabditis elegans] 189
2.40E-17 EF091-2 gnl.vertline.PID.vertline.e353216 seed maturation
protein homolog [Arabidopsis thaliana] 146 3.60E-11 EF091-2
gi.vertline.1161171 late embryogenesis abundant protein [Picea
glauca] 132 5.70E-11 EF091-2 pir.vertline.S04909.vertline.S049
embryonic protein DC8 (clone 8/10) - carrot 127 6.50E-11 EF092-2
gi.vertline.2689898 (AE000792) PTS system, cellobiose-specific IIB
component 145 4.00E-27 (celA) EF092-2
gnl.vertline.PID.vertline.d102048 B. subtilis, cellobiose
phosphotransferase system, celA; 116 1.40E-26 EF096-2
gi.vertline.147329 transport protein [Escherichia coli]
>gnl.vertline.PID.vertline.d1015409 532 2.10E-91 EF096-2
gi.vertline.1573475 spermidine/putrescine-binding periplasmic
protein precursor 527 1.10E-79 (potD) EF096-2 gi.vertline.1574803
spermidine/putrescine-binding periplasmic protein precursor 468
1.60E-75 (potD) EF096-2 gi.vertline.1142681 Lpp38 [Pasteurella
haemolytica] 446 4.40E-72 EF096-2 gnl.vertline.PID.vertline.d101526
Putrescine transport protein PotF [Escherichia coli] 216 1.50E-54
EF096-2 gi.vertline.147334 periplasmic putrescine binding protein
[Escherichia coli] 216 2.10E-53 EF096-2 gi.vertline.2688565
(AB001165) spermidine/putrescine ABC transporter, 240 2.00E-48
EF096-2 gi.vertline.1881733 PotD [Salmonella typhimurium] 253
2.70E-28 EF096-2 gnl.vertline.PID.vertline.d101926
spermidine/putrescine-binding periplasmic protein 243 4.20E-26
EF096-2 gnl.vertline.PID.vertlin- e.e152543 potF gene product
[Clostridium perfringens] 204 3.30E-21 EF097-2 gi.vertline.622991
mannitol transport protein [Bacillus stearothermophilus] 547
4.90E-93 EF097-2 gi.vertline.42034 mannitol permease [Escherichia
coli] >gi.vertline.466737 mannitol- 535 5.50E-85 specific
EF097-2 gi.vertline.633650 enzyme II(mannitol) [Staphylococcus
carnosus] 516 2.10E-82 >pir.vertline.S68193.vertline.S22385
EF097-2 gi.vertline.882462 protein-N(pi)-phosphohistidine-sugar
phosphotransferase 509 3.00E-76 [Escherichia EF097-2
gi.vertline.312763 protein-N(pi)-phosphohistidine-sugar
phosphotransferase 357 7.50E-70 [Escherichia EF097-2
gnl.vertline.PID.vertline.d100966 homologue of mannitol transport
protein of B. 492 3.10E-62 EF097-2
gnl.vertline.PID.vertline.d100792 mannitol-specific
phophotransferase enzyme II [Bacillus 484 5.20E-61 EF097-2
gi.vertline.1673855 (AE000020) Mycoplasma pneumoniae, PTS system
mannitol- 232 3.50E-59 specific EF097-2
gnl.vertline.PID.vertline.d100651 phosphotransferase enzymeII,
mannitol-specific [Mycoplasma 158 8.20E-18 EF097-2
pir.vertline.S77757.vertline.S777 phosphotransferase system enzyme
II (EC 2.7.1.69), 103 2.00E-13 EF100-2 gi.vertline.2058546 Com YC
[Streptococcus gordonii] 193 7.30E-27 EF100-2 gi.vertline.2058546
Com YC [Streptococcus gordonii] 193 7.30E-27 EF100-2
gi.vertline.142708 comG3 gene product [Bacillus subtilis]
>gnl.vertline.PID.vertline.e1185739 150 2.90E-22 comGC EF100-2
gi.vertline.142708 comG3 gene product [Bacillus subtilis]
>gnl.vertline.PID.vertline.e1185739 150 2.90E-22 comGC EF100-2
gi.vertline.148437 secretory component [Erwinia chrysanthemi] 134
4.40E-15 >pir.vertline.E47021.vertline.E47021 pectic EF100-2
gi.vertline.148437 secretory component [Erwinia chrysanthemi] 134
4.40E-15 >pir.vertline.E47021.vertline.E47021 pectic EF100-2
gi.vertline.606262 ORF_o145 [Escherichia coli]
>gi.vertline.693706 HopG [Escherichia 136 9.10E-13 coli] EF100-2
gi.vertline.606262 ORF_o145 [Escherichia coli]
>gi.vertline.693706 HopG [Escherichia 136 9.10E-13 coli] EF100-2
gi.vertline.38828 ExeG gene product [Aeromonas hydrophila] 132
3.50E-12 >pir.vertline.S22910.vertline.149905 protein EF100-2
gi.vertline.38828 ExeG gene product [Aeromonas hydrophila] 132
3.50E-12 >pir.vertline.S22910.vertline.149905 protein EF100-2
gnl.vertline.PID.vertline.e117259 etpG [Escherichia coli] 131
5.10E-12 EF100-2 gnl.vertline.PID.vertline.e117259 etpG
[Escherichia coli] 131 5.10E-12 EF100-2 gi.vertline.42189 outG gene
product [Erwinia carotovora]
>pir.vertline.S32861.vertline.S32861 130 9.90E-12 outG EF100-2
gi.vertline.42189 outG gene product [Erwinia carotovora]
>pir.vertline.S32861.vertline.S32861 130 9.90E-12 outG EF100-2
gi.vertline.609628 putative [Vibrio cholerae] 128 1.60E-11 EF100-2
gi.vertline.609628 putative [Vibrio cholerae] 128 1.60E-11 EF101-2
gnl.vertline.PID.vertline.d102573 bacG [Enterococcus faecalis] 106
3.60E-17 EF101-2 gnl.vertline.PID.vertline.e321943 hypothetical
protein [Enterococcus faecalis]
>gnl.vertline.PID.vertline.e321943 105 1.80E-16 EF101-2
gnl.vertline.PID.vertline.e118502 similar to hypothetical proteins
from B. subtilis [Bacillus 113 1.80E-15 EF110-2 gi.vertline.43338
Staphylococcal serine proteinase homologue [Enterococcus 1462
2.30E-195 faecalis] EF110-2 gnl.vertline.PID.vertline.d100108
glutamic acid specific protease prepropeptide [Staphylococcus 106
3.70E-14 EF110-2 gi.vertline.46687 preproenzyme (AA -68 to 268)
[Staphylococcus aureus] 106 6.70E-14 EF111-2 gi.vertline.606018
ORF_o783 [Escherichia coli] >gi.vertline.1789462 (AE000390) 477
8.10E-80 hypothetical 88.3 EF121-2 gi.vertline.2626826 YfkN
[Bacillus subtilis] >gnl.vertline.PID.vertline.e1182774 similar
to 143 1.30E-96 FE121-2 gi.vertline.2313187 (AE000532)
2',3'-cyclic-nucleotide 2'-phosphodiesterase (cpdB) 413 2.60E-82
EF121-2 gi.vertline.48453 5'-nucleotidase [Vibrio parahaemolyticus]
>gnl.vertline.PID.vertline.d1001218 279 8.50E-47 EF121-2
gi.vertline.757842 UDP-sugar hydrolase [Escherichia coli] 239
1.60E-44 EF121-2 gi.vertline.1773162 UDP-sugar hydrolase precursor
[Escherichia coli] >gi.vertline.1786687 239 1.60E-44 EF121-2
gi.vertline.47950 precursor polypeptide (AA -25 to 525) [Salmonella
229 2.10E-41 typhimurium] EF121-2 gi.vertline.747913
2',3'-cyclic-nucleotide 2'-phosphodiesterase [Yersinia 115 4.70E-36
EF121-2 gi.vertline.62772 5'-nucleotidase [Discopyge ommata]
>pir.vertline.S19564.vertline.S19564 5'- 137 5.80E-35
nucleotidase EF121-2 gi.vertline.1573573 2',3'-cyclic-nucleotide
2'-phosphodiesterase (cpdB) 114 8.90E-34 [Haemophilus EF121-2
gi.vertline.537054 2',3'-cyclic-nucleotide 2'-phosphodiesterase
[Escherichia coli] 110 1.10E-31 EF121-2 bbs.vertline.135915
5'-nucleotidase=glycosylphosphatidylinositol-anchored protein 128
7.70E-29 {EC EF121-2 gi.vertline.1737443 5'-nucleotidase [Boophilus
microplus] 104 1.60E-28 EF121-2 gi.vertline.202551 5'-nucleotidase
precursor (EC 3.1.3.5) [Rattus norvegicus] 138 6.10E-28 EF121-2
gi.vertline.349783 ecto-5'-nucleotidase [Mus musculus]
>pir.vertline.JC2001.vertline.JC20- 01 136 1.10E-27 EF121-2
gi.vertline.23897 5'-nucleotidase [Homo sapiens]
>pir.vertline.S10032.vertline.S11032 5'- 133 1.60E-27
nucleotidase (EC EF122-2 gi.vertline.2626826 YfkN [Bacillus
subtilis] >gnl.vertline.PID.vertline.e1182774 similar to 143
1.30E-96 EF122-2 gi.vertline.2313187 (AE000532)
2',3'-cyclic-nucleotide 2'-phosphodiesterase (cpdB) 413 2.60E-82
EF122-2 gi.vertline.48453 5'-nucleotidase [Vibrio parahaemolyticus]
>gnl.vertline.PID.vertline.d1001218 279 8.50E-47 EF122-2
gi.vertline.757842 UDP-sugar hydrolase [Escherichia coli] 239
1.60E-44 EF122-2 gi.vertline.1773162 UDP-sugar hydrolase precursor
[Escherichia coli] >gi.vertline.1786687 239 1.60E-44 EF122-2
gi.vertline.47950 precursor polypeptide (AA -25 to 525) [Salmonella
229 2.10E-41 typhimurium] EF122-2 gi.vertline.747913
2',3'-cyclic-nucleotide 2'-phosphodiesterase [Yersinia 115 4.70E-36
EF122-2 gi.vertline.62772 5'-nucleotidase [Discopyge ommata]
>pir.vertline.S19564.vertline.S19564 5'- 137 5.80E-35
nucleotidase EF122-2 gi.vertline.1573573 2',3'-cyclic-nucleotide
2'-phosphodiesterase (cpdB) 114 8.90E-34 [Haemophilus EF122-2
gi.vertline.537054 2',3'-cyclic-nucleotide 2'-phosphodiesterase
[Escherichia coli] 110 1.10E-31 EF122-2 bbs.vertline.135915
5'-nucleotidase=glycosylphosphatidylinositol-anchored protein 128
7.70E-29 {EC EF122-2 gi.vertline.1737443 5'-nucleotidase [Boophilus
microplus] 104 1.60E-28 EF122-2 gi.vertline.202551 5'-nucleotidase
precursor (EC 3.1.3.5) [Rattus norvegicus] 138 6.10E-28 EF122-2
gi.vertline.349783 ecto-5'-nucleotidase [Mus musculus]
>pir.vertline.C2001.vertline.JC200- 1 136 1.10E-27 EF122-2
gi.vertline.23897 5'-nucleotidase [Homo sapiens]
>pir.vertline.S11032.vertline.S11032 5'- 133 1.60E-27
nucleotidase (EC EF129-2 gi.vertline.43334 P54 protein
[Enterococcus faecium] >pir.vertline.S05542.vertline.S05542 630
9.40E-79 hypothetical EF129-2 gi.vertline.512521 usp 45 gene
product [Lactococcus lactis]
>pir.vertline.JN0097.vertline.JN0097 374 1.30E-42 secreted
EF129-2 gi.vertline.149525 secreted protein [Lactococcus lactis]
371 3.60E-42 EF129-2 gnl.vertline.PlD.vertline.e313022 hypothetical
protein [Bacillus subtilis] >gnl.vertline.PID.vertline.e1186168
317 2.30E-33 EF130-2 gi.vertline.488339 alpha-amylase [unidentified
cloning vector] 621 6.70E-81 EF130-2 gi.vertline.488336 ORF
[unidentified cloning vector] 242 8.00E-27 EF130-2
bbs.vertline.112518 alpha-amylase {N-terminal region} [Artificial
sequence, Peptide 237 4.80E-26 EF130-2
gnl.vertline.PlD.vertline.e289144 ywpE [Bacillus subtilis]
>gnl.vertline.PID.vertline.e1184540 ywpE [Bacillus 129 5.40E-11
EF131-2 gnl.vertline.PID.vertline.e118528 penicillin-binding
protein [Bacillus subtilis] 277 7.40E-43 EF131-2 gi.vertline.488330
alpha-amylase [unidentified cloning vector] 280 1.30E-31 EF131-2
gi.vertline.509249 No definition line found [Lactobacillus
plantarum] 274 1.10E-30 EF131-2 gnl.vertline.PID.vertline.d102491
(AB009635) Fmt [Staphylococcus aureus] 170 5.60E-20 EF131-2
gi.vertline.515050 DD-peptidase precursor [Streptomyces lividans]
131 2.30E-14 >pir.vertline.S48220.vertline.S48220 EF131-2
gi.vertline.153448 serine DD-peptidase [Streptomyces lividans] 131
1.20E-12 EF132-2 gi.vertline.153826 adhesin B [Streptococcus
sanguis] >pir.vertline.A43583.vertline.A43583 adhesin 1257
2.30E-166 B EF132-2 gi.vertline.1184932 ScbA [Streptococcus crista]
1248 3.70E-165 EF132-2 gi.vertline.310633 adhesin [Streptococcus
gordonii] 1247 5.10E-165 EF132-2 gi.vertline.393269 adhesion
protein [Streptococcus pneumoniae] 1204 3.40E-163 EF132-2
gi.vertline.1575030 surface adhesin A precursor [Streptococcus
pneumoniae] 1220 2.40E-161 EF132-2 gi.vertline.153834 adhesin
specific for salivary pellicle of dental surfaces 1203 4.80E-159
EF132-2 gi.vertline.1117994 surface antigen A variant precursor
[Streptococcus pneumoniae] 1191 2.00E-157 EF132-2
gi.vertline.493017 endocarditis specific antigen [Enterococcus
faecalis] 931 3.70E-122 EF132-2 gnl.vertline.PID.vertline.e255529
lipoprotein [Staphylococcus epidermidis] 453 3.20E-92 EF132-2
gi.vertline.1245464 YfeA [Yersinia pestis] >gi.vertline.1245464
YfeA [Yersinia pestis] 364 3.60E-64 EF132-2 gi.vertline.1573330
adhesin B precursor (fimA) [Haemophilus influenzae] 349 3.50E-63
EF132-2 gi.vertline.755075 periplasmic-binding protein
[Synechocystis sp.] 326 6.80E-62
>gnl.vertline.PID.vertline.d1018652 Mn EF132-2
gnl.vertline.PID.vertline.e118595 similar to ABC transporter
(membrane protein) [Bacillus 174 3.10E-32 EF132-2
gi.vertline.1777933 TroA [Treponema pallidum] 171 3.40E-32 EF132-2
gi.vertline.790546 Tromp1 [Treponema pallidum] 171 5.10E-32 Derwent
BLAST BLAST Query Access. No. Derwent Gene Description Score
P-Value EF003-2 W20909 H. pylori outer membrane protein
14ge10705orf5. 268 4.20E-39 EF003-2 W20166 Helicobacter pylori
outer membrane protein, 16225006.aa. 241 3.00E-27 EF006-2 W20909 H.
pylori outer membrane protein 14ge10705orf5. 283 1.20E-48 EF006-2
W20166 Helicobacter pylori outer membrane protein, 16225006.aa. 266
1.10E-30 EF008-2 R37495 Pneumococcal fimbrial protein A. 967
1.20E-127 EF008-2 W26367 Staphylococcus aureus saliva binding
protein. 467 7.50E-100 EF008-2 R79722 ROM precursor TROMP1. 181
8.00E-36 EF008-2 W22134 Treponema pallidum rare outer membrane
protein (TROMP-1). 181 8.00E-36 EF009-2 W20909 H. pylori outer
membrane protein 14ge10705orf5. 319 1.40E-53 EF009-2 W20166
Helicobacter pylori outer membrane protein, 16225006.aa. 278
2.50E-32 EF012-2 R48035 Hyaluronic acid synthase of Streptococcus
equisimilis. 227 3.20E-69 EF014-2 W14070 S. thermophilus
exopolysaccharide biosynthesis protein EpsR. 103 5.90E-19 EF014-2
W22169 S. thermophilus exopolysaccharide synthesis operon epsA gene
103 7.30E-18 product. EF016-2 W15799 Adherence factor 104R of
Lactobacillus fermentum. 157 9.60E-22 EF016-2 W15793 Adherence
factor consensus sequence. 103 1.00E-11 EF017-2 R48035 Hyaluronic
acid synthase of Streptococcus equisimilis. 241 8.90E-71 EF021-2
R31013 P39-alpha. 141 1.60E-19 EF021-2 R33280 P39-beta. 134
7.00E-14 EF022-2 R48035 Hyaluronic acid synthase of Streptococcus
equisimilis. 324 2.20E-65 EF023-2 R48035 Hyaluronic acid synthase
of Streptococcus equisimilis. 155 9.90E-33 EF023-2 R70152
Streptococcus pneumoniae strain SPRU98 PlpA. 125 5.90E-17 EF027-2
R48035 Hyaluronic acid synthase of Streptococcus equisimilis. 233
2.20E-34 EF028-2 W17830 Thermophilic alkaline phosphatase. 202
7.70E-59 EF028-2 W11568 E. coli alkaline phosphatase mutant
D153H/Q329A. 182 7.90E-56 EF028-2 W11570 E. coli alkaline
phosphatase mutant D153H/K328H/Q329A. 182 7.90E-56 EF028-2 W26300
E. coli alkaline phosphatase mutant 182
1.10E-55 D153H/K328H/Q329A/D330H. EF028-2 W11565 E. coli alkaline
phosphatase mutant D153H/K328H/D330A. 182 3.10E-55 EF028-2 W11557
E. coli alkaline phosphatase mutant D153H/D330N. 182 4.30E-55
EF028-2 W11561 E. coli alkaline phosphatase mutant D153H/D330A. 182
4.30E-55 EF028-2 W11555 E. coli alkaline phosphatase mutant
D153H/K328H/D330N. 182 4.70E-55 EF028-2 W11566 E. coli alkaline
phosphatase mutant D153H/K328H/D330L. 182 1.20E-54 EF028-2 W11569
E. coli alkaline phosphatase mutant K328H/Q329A. 180 1.70E-54
EF028-2 W11562 E. coli alkaline phosphatase mutant D153H/D330L. 182
1.70E-54 EF028-2 R26980 Fv(FRP5)-phoA recombinant antibody. 174
1.90E-54 EF028-2 W11567 E. coli alkaline phosphatase mutant Q329A.
179 2.30E-54 EF028-2 W11558 E. coli alkaline phosphatase mutant
K328H/D330N. 176 6.40E-54 EF028-2 W11563 E. coli alkaline
phosphatase mutant K328H/D330A. 176 6.40E-54 EF029-2 R10044 Plasmid
pOW360 encoded Human Growth Hormone (HGH) - 320 3.50E-40 nuclease A
EF029-2 R10041 Plasmid pOW350 nuclease A product. 320 4.30E-40
EF029-2 R73997 Staphylococcus aureus (Foggi) nuclease signal and
mature 320 5.60E-40 sequences. EF029-2 R10043 Plasmid pOW360
encoding Human Growth Hormone (HGH) - 320 2.90E-38 nuclease EF030-2
R48035 Hyaluronic acid synthase of Streptococcus equisimilis. 277
6.10E-47 EF040-2 R59077 2-5A-dependent RNA-ase. 105 1.90E-18
EF040-2 W12703 Mouse 2-5A-dependent RNase. 105 1.90E-18 EF040-2
R82661 Partial murine 2-5A-dependent RNase. 105 1.90E-18 EF041-2
R48035 Hyaluronic acid synthase of Streptococcus equisimilis. 225
6.30E-26 EF054-2 R26042 P. yoelii SSP2 antigen. 286 8.00E-34
EF054-2 R85782 Group B Streptococcal mutant beta antigen without
IgA binding 232 3.30E-24 domain. EF054-2 R85781 Group B
Streptococcal wild-type beta antigen. 232 5.20E-24 EF054-2 P91941
Sequence of preprospasmolysin. 204 3.10E-19 EF054-2 W32519
Collagen-like polypeptide SEQ ID NO:2. 180 7.50E-18 EF054-2 W12324
Silver halide emulsion protein monomeric repeat unit #2. 180
7.50E-18 EF054-2 W32522 Collagen-like polypeptide SEQ ID NO:5. 192
1.60E-17 EF054-2 W12327 Silver halide emulsion protein monomeric
repeat unit #5. 192 1.60E-17 EF054-2 W32520 Collagen-like
polypeptide SEQ ID NO:3. 189 2.40E-17 EF054-2 W32532 Collagen-like
polypeptide SEQ ID NO:15. 189 2.40E-17 EF054-2 W12325 Silver halide
emulsion protein monomeric repeat unit #3. 189 2.40E-17 EF054-2
W12337 Silver halide emulsion protein monomeric repeat unit #15.
189 2.40E-17 EF054-2 W12341 Silver halide emulsion FLAG(RTM)-tagged
protein #2. 189 2.60E-17 EF054-2 W02098 S. mutans antigen I/II. 161
5.40E-15 EF054-2 W02096 S. mutans antigen I/II fragment
(aa803-1114). 161 1.90E-13 EF059-2 R26042 P. yoelii SSP2 antigen.
344 1.90E-39 EF059-2 R85782 Group B Streptococcal mutant beta
antigen without IgA binding 232 1.10E-26 domain. EF059-2 R85781
Group B Streptococcal wild-type beta antigen. 232 1.70E-26 EF059-2
P91941 Sequence of preprospasmolysin. 200 1.50E-18 EF059-2 P60570
Sequence of the Falciparum Interspersed Repeat Antigen 186 4.60E-18
EF059-2 W02096 S. mutans antigen I/II fragment (aa803-1114). 167
8.20E-16 EF059-2 W02098 S. mutans antigen I/II. 167 4.90E-15
EF059-2 R79625 Endocarditis specific antigen region. 147 4.40E-12
EF059-2 R26049 MSF precursor. 143 1.30E-11 EF059-2 R28150 Sugar
beet chitinase 1. 148 1.70E-11 EF059-2 R26842 Protease from S.
Aureus ATCC12600. 147 2.10E-11 EF059-2 R79643 Immunodominant
antigen of Streptococcus sobrinus. 151 2.10E-11 EF059-2 W07539
Collagen like protein (CLP). 146 3.00E-11 EF061-2 R26042 P. yoelii
SSP2 antigen. 241 1.70E-25 EF061-2 P60570 Sequence of the
Falciparum Interspersed Repeat Antigen 199 1.60E-18 EF061-2 R85782
Group B Streptococcal mutant beta antigen without IgA binding 153
2.40E-14 domain. EF061-2 R85781 Group B Streptococcal wild-type
beta antigen. 153 3.60E-14 EF061-2 P91941 Sequence of
preprospasmolysin. 163 9.70E-14 EF061-2 P83194 Sequence of a
bioadhesive precursor protein encoded by cDNA 156 7.90E-13 clone
EF061-2 R28150 Sugar beet chitinase 1. 156 9.10E-13 EF061-2 W02096
S. mutans antigen I/II fragment (aa803-1114). 148 1.20E-12 EF061-2
P82971 Bioadhesive precursor protein from cDNA 52. 148 9.70E-12
EF061-2 W02098 S. mutans antigen I/II. 148 1.50E-11 EF062-2 W02098
S. mutans antigen I/II. 107 1.20E-36 EF062-2 R79643 Immunodominant
antigen of Streptococcus sobrinus. 132 3.00E-36 EF063-2 W02098 S.
mutans antigen I/II. 107 1.20E-36 EF063-2 R79643 Immunodominant
antigen of Streptococcus sobrinus. 132 3.00E-36 EF064-2 W02098 S.
mutans antigen I/II. 107 1.20E-36 EF064-2 R79643 Immunodominant
antigen of Streptococcus sobrinus. 132 3.00E-36 EF071-2 R85294
Phage R1-t LytR lysin. 127 3.70E-38 EF071-2 R91515 Listeria phage
lysin PLY511. 273 4.70E-37 EF075-2 W14070 S. thermophilus
exopolysaccharide biosynthesis protein EpsR. 239 4.20E-36 EF075-2
W22169 S. thermophilus exopolysaccharide synthesis operon epsA gene
239 4.00E-34 product. EF077-2 R97280 Helicobacter-specific ATPase
439. 258 4.10E-74 EF077-2 R48036 Mycobacterium BCG immunogen. 192
2.20E-67 EF077-2 W06712 Helicobacter-specific ATPase 948 (ORF-4).
220 2.50E-67 EF077-2 R70419 Rat homologue of human Wilson disease
gene ATP7B. 186 9.80E-54 EF077-2 R72343 Wilson disease protein
ATP7B. 176 6.70E-40 EF077-2 R06376 Product of the ssc 1 gene. 166
3.10E-28 EF077-2 R75396 Flea sodium pump alpha subunit. 146
2.40E-25 EF077-2 W20891 H. pylori transporter protein,
14ce20219orf1. 156 8.60E-14 EF078-2 R56667 Bacteroides fragilis
RprX regulatory response protein. 148 8.30E-18 EF078-2 R74630
Tomato TGETR1 ethylene response protein. 130 7.80E-13 EF078-2
R69849 Ethylene response (ETR) gene product. 128 1.70E-11 EF078-2
R69850 Ethylene response (ETR) mutant protein etr1-1. 128 1.70E-11
EF078-2 R69851 Ethylene response (ETR) mutant protein etr1-2. 128
1.70E-11 EF078-2 R69852 Ethylene response (ETR) mutant protein
etr1-3. 128 1.70E-11 EF078-2 R69853 Ethylene response (ETR) mutant
protein etr1-4. 128 1.70E-11 EF078-2 R24296 Regulatory protein VanS
involved in glycopeptide resistance. 142 2.70E-11 EF081-2 R27253
Penicillin binding protein PBP2A-epi. 101 4.70E-16 EF081-2 R27256
Penicillin binding protein PBP2A-27R. 101 6.00E-15 EF081-2 R27257
Penicillin binding protein derivative #1. 101 6.20E-15 EF081-2
R27258 Penicillin binding protein derivative #2. 101 6.20E-15
EF081-2 R27259 Penicillin binding protein derivative #3. 101
6.20E-15 EF081-2 R27260 Penicillin binding protein derivative #4.
101 6.20E-15 EF081-2 R27261 Penicillin binding protein derivative
#5. 101 6.20E-15 EF081-2 R27263 Penicillin binding protein
derivative #7. 101 6.20E-15 EF081-2 R27264 Penicillin binding
protein derivative #8. 101 6.20E-15 EF081-2 R27262 Penicillin
binding protein derivative #6. 101 6.50E-15 EF081-2 R30845 Sequence
encoded by the mec A gene. 101 6.90E-15 EF081-2 R27255 Penicillin
binding protein PBP2A-27R. 101 6.90E-15 EF081-2 R31216 Penicillin
binding protein PBP2A-27R. 101 7.00E-15 EF110-2 R91042 V8 mature
protease (aa1-213). 106 6.60E-16 EF110-2 R91043 V8 mature protease
(aa1-214). 106 7.20E-16 EF110-2 R91044 V8 mature protease
(aa1-215). 106 7.80E-16 EF110-2 R26842 Protease from S. Aureus
ATCC12600. 106 6.70E-15 EF110-2 R29644 Protease from S. Aureus. 106
1.20E-14 EF110-2 W22218 Protein encoded by pV8RPT(-) construct. 106
7.60E-14 EF110-2 R91033 Beta-galactosidase-V8 protease fusion
protein. 106 7.60E-14 EF110-2 R91034 Beta-galactosidase-V8 protease
fusion protein. 106 1.70E-13 EF110-2 W22219 Protein encoded by pV8D
construct. 106 7.60E-13 EF110-2 R91035 Recombinant V8 protease V8D
fusion protein. 106 7.60E-13 EF110-2 W22220 Protein encoded by pV8F
construct. 106 7.90E-13 EF129-2 R14530 Usp45 protein. 374 2.40E-43
EF129-2 R14150 MSP encoded by pUCRS (DSM 5803). 372 4.70E-43
EF131-2 R37495 Pneumococcal fimbrial protein A. 1185 6.80E-163
EF131-2 W26367 Staphylococcus aureus saliva binding protein. 418
3.70E-85 EF131-2 R79722 ROM precursor TROMP1. 171 9.00E-31 EF131-2
W22134 Treponema pallidum rare outer membrane protein (TROMP-1).
171 9.00E-31
[0269]
3TABLE 3 Conservative Amino Acid Substitutions. Aromatic
Phenylalanine Tryptophan Tyrosine Hydrophobic Leucine Isoleucine
Valine Polar Glutamine Asparagine Basic Arginine Lysine Histidine
Acidic Aspartic Acid Glutamic Acid Small Alanine Serine Threonine
Methionine Glycine
[0270]
4TABLE 4 Residues Comprising Antigenic Epitope-Bearing Portion.
EF001-2 from about Asp-150 to about Lys-152, from about Ser-256 to
about Tyr-259, from about Lys-360 to about Lys-363, from about
Asn-406 to about Asp-408. EF002-2 from about Asp-80 to about
Asp-83, from about Asp-281 to about Gly-283. EF003-2 from about
Asn-263 to about Gly-266. EF004-2 from about Asn-23 to about
Asn-26, from about Lys-83 to about Ser-87, from about Tyr-154 to
about Asp-159. EF005-2 from about Lys-249 to about Glu-252. EF006-2
from about Gly-23 to about Asp-28. EF008-2 from about Thr-92 to
about Gly-94, from about Pro-161 to about Asp-165, from about
Gly-287 to about Thr-289. EF010-2 from about Pro-129 to about
Asn-131. EF012-2 from about Asp-77 to about Asp-79, from about
Asp-94 to about Lys-98, from about Asp-256 to about Thr-258, from
about Glu-461 to about Asn-468. EF013-2 from about Thr-30 to about
Asp-32, from about Glu-73 to about Ala-75, from about Gln-164 to
about Asn-166, from about Lys-193 to about Gly-195. EF014-2 from
about Ser-203 to about Asp-206, from about Gln-314 to about Gly-316
EF015-2 from about Pro-66 to about Gly-69. EF016-2 from about
Lys-236 to about Asn-239. EF017-2 from about Ser-90 to about
Gly-93, from about Thr-197 to about Lys-199, from about Lys-230 to
about Asn-233, from about Ser-428 to about Gly-431. EF018-2 from
about Lys-159 to about Tyr-161, from about Asn-165 to about
Ser-167, from about Asn-250 to about Arg-256, from about Asn-392 to
about Gly-395, from about Lys-416 to about Tyr-418, from about
Asn-428 to about Arg-430. EF019-2 from about Arg-209 to about
Ser-211, from about Lys-287 to about Ser-290. EF020-2 from about
Lys-57 to about Asn-62. EF021-2 from about Ser-33 to about Gly-35,
from about Glu-77 to about Gly-81, from about Asp-139 to about
Lys-141, from about Glu-255 to about Ser-258, from about Gln-271 to
about Tyr-277. EF023-2 from about Lys-232 to about Asp-234, from
about Arg-304 to about Gly-306, from about Thr-453 to about
Arg-456, from about Ser-478 to about Thr-480. EF025-2 from about
Arg-183 to about Asp-185. EF026-2 from about Ser-25 to about
Asp-30, from about Asp-90 to about Asp-94, from about Gln-107 to
about Asn-110. EF027-2 from about Gln-72 to about Lys-74, from
about Lys-229 to about Asp-231. EF028-2 from about Asp-186 to about
Gln-188. EF029-2 from about Asp-118 to about Lys-122, from about
Asp-124 to about Tyr-126. EF031-2 from about Glu-30 to about
Gly-33. EF034-2 from about Glu-25 to about Gly-27, from about
Glu-75 to about Thr-77. EF36-2 from about Gln-177 to about Ser-179.
EF037-2 from about Ser-25 to about Asp-30, from about Asp-90 to
about Asp-94, from about Gln-107 to about Asn-110. EF038-2 from
about Asn-77 to about Lys-79, from about Tyr-88 to about Asn-92.
EF040-2 from about Lys-167 to about Gly-172, from about Lys-240 to
about Asn-242. EF044-2 from about Arg-192 to about Gly-194, from
about Asn-200 to about Asn-203. EF045-2 from about Asp-159 to about
Asn-161, from about His-172 to about Gly-174, from about Tyr-261 to
about Gly-264, from about Lys-305 to about Glu-308. EF046-2 from
about Ser-18 to about Gly-23, from about Gln-41 to about Ser-47,
from about Thr-76 to about Asp-78. EF047-2 from about Asn-28 to
about Asp-30, from about Asp-273 to about Asn-277. EF048-2 from
about Asp-138 to about Lys-141, from about Asp-152 to about
Gly-154. EF051-2 from about Asp-73 to about Gly-76. EF053-2 from
about Ser-79 to about Gly-82. EF055-2 from about Asp-26 to about
Gly-28, from about Gln-67 to about Asp-69, from about Arg-71 to
about Gly-74, from about Arg-87 to about Gly-89. EF056-2 from about
Arg-71 to about Gly-74, from about Arg-87 to about Gly-89. EF058-2
from about Lys-129 to about Gly-133, from about Gln-571 to about
Tyr-573, from about Pro-586 to about Gly-591. EF065-2 from about
Ser-236 to about Tyr-239, from about Asp-350 to about Gly-352, from
about Lys-415 to about Asn-418, from about Arg-446 to about
Asp-448, from about Asn-489 to about Lys-491, from about Ser-516 to
about Asp-518, from about Glu-639 to about Lys-642. EF066-2 from
about Ser-236 to about Tyr-239, from about Asp-350 to about
Gly-352, from about Lys-415 to about Asn-418, from about Arg-446 to
about Asp-448, from about Asn-489 to about Lys-491, from about
Ser-516 to about Asp-518, from about Glu-639 to about Lys-642.
EF067-2 from about Ser-236 to about Tyr-239, from about Asp-350 to
about Gly-352, from about Lys-415 to about Asn-418, from about
Arg-446 to about Asp-448, from about Asn-489 to about Lys-491, from
about Ser-516 to about Asp-518, from about Glu-639 to about
Lys-642. EF073-2 from about Met-98 to about Arg-100, from about
Arg-110 to about Asp-112. EF074-2 from about Ser-53 to about
Tyr-59, from about Ser-86 to about Gly-88, from about Pro-97 to
about Gln-100, from about Gln-230 to about Gly-232. EF076-2 from
about Asn-38 to about Tyr-40, from about Asp-48 to about Asn-53,
from about Lys-79 to about Gly-81. EF077-2 from about Arg-411 to
about Gly-413. EF078-2 from about Thr-294 to about Gly-296, from
about Asp-366 to about Gln-368, from about Glu-524 to about
Gly-526. EF080-2 from about Glu-164 to about Gly-166, from about
Ser-206 to about Tyr-208, from about Lys-239 to about Gly-243.
EF081-2 from about Asn-7 to about Ser-11, from about Lys-77 to
about Tyr-80, from about Lys-112 to about Asn-114, from about
Gly-162 to about Asp-164, from about Arg-181 to about Gly-183.
EF083-2 from about Gln-38 to about Arg-40. EF084-2 from about
Lys-140 to about Asp-142, from about Gly-164 to about Arg-166, from
about Arg-262 to about Gly-264. EF085-2 from about Asn-95 to about
Asp-97, from about Arg-112 to about Asp-114, from about Asp-258 to
about Ser-260, from about Arg-401 to about Ser-403. EF086-2 from
about Pro-112 to about Gly-115, from about Ser-222 to about
Ser-224, from about Asn-296 to about Gly-299, from about Thr-346 to
about Lys-348, from about Asp-428 to about Ser-432. EF087-2 from
about Pro-112 to about Gly-115, from about Ser-222 to about
Ser-224, from about Asn-296 to about Gly-299, from about Thr-346 to
about Lys-348, from about Asp-428 to about Ser-432. EF088-2 from
about Pro-112 to about Gly-115, from about Ser-222 to about
Ser-224, from about Asn-296 to about Gly-299, from about Thr-346 to
about Lys-348, from about Asp-428 to about Ser-432. EF090-2 from
about Arg-2 to about Arg-5. EF091-2 from about Gln-40 to about
Asp-43. EF093-2 from about Lys-95 to about Gly-97. EF094-2 from
about Asp-314 to about Asp-316. EF095-2 from about Ser-328 to about
Thr-330, from about Asp-359 to about Asp-363, from about Glu-637 to
about Gly-639, from about Asn-744 to about Gly-746. EF096-2 from
about Pro-128 to about Asn-130, from about Ser-193 to about
Asp-196. EF097-2 from about Val-357 to about Gly-359. EF099-2 from
about Glu-44 to about Asp-47, from about Lys-154 to about Gly-156,
from about Asn-286 to about Asp-289. EF101-2 from about Lys-40 to
about Asp-42, from about Pro-255 to about Asn-258, from about
Lys-288 to about Gly-290. EF102-2 from about Asp-314 to about
Asp-316. EF103-2 from about Asn-46 to about Gly-48. EF104-2 from
about Pro-232 to about Lys-237, from about Ala-362 to about
Asn-366, from about Ser-421 to about Gly-423, from about Lys-488 to
about Ser-490, from about Asp-550 to about Asn-552, from about
Pro-637 to about Lys-640, from about Asp-727 to about Gly-729, from
about Asn-751 to about Ser-754, from about Lys-771 to about
Asn-774, from about Ile-835 to about Asn-837, from about Pro-851 to
about Gly-853. EF105-2 from about Ser-40 to about Gly-43, from
about Asn-94 to about Gln-97, from about Gln-220 to about Gly-222,
from about Asn-263 to about Gly-265. EF106-2 from about Asp-72 to
about Gly-75, from about Thr-274 to about Asp-277, from about
Asn-310 to about Arg-313. EF107-2 from about Thr-155 to about
Asn-157, from about Thr-189 to about Asp-191, from about Arg-270 to
about Gly-272, from about Thr-330 to about Lys-335, from about
Asp-365 to about Asp-368, from about Pro-451 to about Asp-453, from
about Gly-485 to about Thr-488. EF108-2 from about Lys-142 to about
Trp-145, from about Thr-147 to about Tyr-150, from about Arg-212 to
about Gly-214, from about Ser-248 to about Asp-251, from about
Asp-384 to about Asp-387, from about Pro-481 to about Arg-483, from
about Lys-491 to about Gly-494, from about Thr-619 to about
Gly-624, from about Asp-656 to about Asp-659, from about Lys-717 to
about Asn-721, from about Ser-822 to about Gly-824, from about
Tyr-1137 to about Thr-1141. EF110-2 from about Pro-123 to about
Gly-127, from about Thr-223 to about Gly-225. EF111-2 from about
Lys-207 to about Asn-209, from about Asp-245 to about Asn-248, from
about Lys-396 to about Asp-398, from about Glu-429 to about
Ser-432, from about Thr-470 to about His-474. EF119-2 from about
Asp-90 to about Asn-92, from about Gln-142 to about Gly-144.
EF121-2 from about Asn-159 to about Asp-161, from about Asn-351 to
about Lys-353, from about Pro-658 to about Gly-660, from about
Lys-786 to about Ser-789. EF122-2 from about Asn-159 to about
Asp-161, from about Asn-351 to about Lys-353, from about Pro-658 to
about Gly-660, from about Lys-786 to about Ser-789. EF123-2 from
about Asn-331 to about Arg-336, from about Asp-634 to about
Gly-636, from about Glu-780 to about Ser-782, from about Tyr-909 to
about Asn-911, from about Lys-939 to about Glu-942, from about
Asp-1074 to about Gly-1076, from about Asp-1367 to about Gly-1369,
from about Pro-1433 to about Lys-1435, from about Gly-1516 to about
Asp-1518, from about Lys-1656 to about Asp-1660, from about
Lys-1860 to about Gln-1863, from about Ser-1916 to about Gln-1919,
from about Pro-1940 to about Gly-1942. EF124-2 from about Asn-331
to about Arg-336, from about Asp-634 to about Gly-636, from about
Glu-780 to about Ser-782, from about Tyr-909 to about Asn-911, from
about Lys-939 to about Glu-942, from about Asp-1074 to about
Gly-1076, from about Asp-1367 to about Gly-1369, from about
Pro-1433 to about Lys-1435, from about Gly-1516 to about Asp-1518,
from about Lys-1656 to about Asp-1660, from about Lys-1860 to about
Gln-1863, from about Ser-1916 to about Gln-1919, from about
Pro-1940 to about Gly-1942. EF125-2 from about Asn-331 to about
Arg-336, from about Asp-634 to about Gly-636, from about Glu-780 to
about Ser-782, from about Tyr-909 to about Asn-911, from about
Lys-939 to about Glu-942, from about Asp-1074 to about Gly-1076,
from about Asp-1367 to about Gly-1369, from about Pro-1433 to about
Lys-1435, from about Gly-1516 to about Asp-1518, from about
Lys-1656 to about Asp-1660, from about Lys-1860 to about Gln-1863,
from about Ser-1916 to about Gln-1919, from about Pro-1940 to about
Gly-1942. EF126-2 from about Ser-236 to about Tyr-239, from about
Asp-350 to about Gly-352, from about Lys-415 to about Asn-418, from
about Arg-446 to about Asp-448, from about Asn-489 to about
Lys-491, from about Ser-516 to about Asp-518, from about Glu-639 to
about Lys-642. EF127-2 from about Ser-236 to about Tyr-239, from
about Asp-350 to about Gly-352, from about Lys-415 to about
Asn-418, from about Arg-446 to about Asp-448, from about Asn-489 to
about Lys-491, from about Ser-516 to about Asp-518, from about
Glu-639 to about Lys-642. EF128-2 from about Ser-236 to about
Tyr-239, from about Asp-350 to about Gly-352, from about Lys-415 to
about Asn-418, from about Arg-446 to about Asp-448, from about
Asn-489 to about Lys-491, from about Ser-516 to about Asp-518, from
about Glu-639 to about Lys-642. EF129-2 from about Asn-300 to about
Gly-302, from about Ser-316 to about Gly-319, from about Asn-385 to
about His-387 EF131-2 from about Lys-201 to about Tyr-204, from
about Glu-263 to about Ser-266. EF132-2 from about Thr-26 to about
Ser-28.
[0271]
Sequence CWU 0
0
* * * * *