Vaccines Comprising Heat-sensitive Transgenes

Nano; Francis E.

Patent Application Summary

U.S. patent application number 15/414111 was filed with the patent office on 2017-05-18 for vaccines comprising heat-sensitive transgenes. This patent application is currently assigned to UVic Industry Partnerships Inc.. The applicant listed for this patent is UVic Industry Partnerships Inc.. Invention is credited to Francis E. Nano.

Application Number20170136111 15/414111
Document ID /
Family ID43856334
Filed Date2017-05-18

United States Patent Application 20170136111
Kind Code A1
Nano; Francis E. May 18, 2017

VACCINES COMPRISING HEAT-SENSITIVE TRANSGENES

Abstract

The present disclosure provides temperature sensitive essential nucleic acid molecules from a psychrophilic bacterium, proteins encoded by the nucleic acid molecules, as well as recombinant cells into which have been introduced such nucleic acid molecules. The disclosed recombinant cells containing one or more essential nucleic acid molecules from a psychrophilic bacterium are thereby made temperature sensitive, and can be administered to a mammal to induce an immune response in the mammal.


Inventors: Nano; Francis E.; (Victoria, CA)
Applicant:
Name City State Country Type

UVic Industry Partnerships Inc.

Victoria

CA
Assignee: UVic Industry Partnerships Inc.
Victoria
CA

Family ID: 43856334
Appl. No.: 15/414111
Filed: January 24, 2017

Related U.S. Patent Documents

Application Number Filing Date Patent Number
13795445 Mar 12, 2013 9580478
15414111
13496723 Apr 12, 2012 8778683
PCT/CA2010/001561 Oct 7, 2010
13795445
61249385 Oct 7, 2009
61322634 Apr 9, 2010

Current U.S. Class: 1/1
Current CPC Class: A61P 31/06 20180101; C12N 9/1014 20130101; C12N 9/00 20130101; A61K 2039/552 20130101; C07K 14/195 20130101; C12N 15/74 20130101; C12N 9/88 20130101; C12N 9/93 20130101; A61K 2039/523 20130101; A61K 2039/521 20130101; A61K 39/02 20130101; A61K 2039/572 20130101; A61K 35/74 20130101; A61P 31/04 20180101; A61P 37/04 20180101; A61K 2039/57 20130101
International Class: A61K 39/02 20060101 A61K039/02; C07K 14/195 20060101 C07K014/195; C12N 15/74 20060101 C12N015/74

Claims



1. A method of stimulating an immune response in an animal, comprising: administering to the animal a therapeutically effective amount of a recombinant mesophilic bacterium that is temperature-sensitive (TS), wherein the recombinant mesophilic bacterium that is TS is made by a method comprising: introducing into the genome of a mesophilic bacterium a nucleic acid construct comprising a TS essential nucleic acid molecule from a psychrophilic bacterium, and functionally replacing the mesophilic bacterium's homolog of the TS essential nucleic acid molecule, thereby making the recombinant mesophilic bacterium that is TS, wherein a protein encoded by the TS essential nucleic acid molecule is operable at a temperature less than 30.degree. C. and inoperable at a temperature greater than 30.degree. C., and wherein the recombinant mesophilic bacterium that is TS has a restrictive temperature between 33.degree. C. and 44.degree. C., thereby stimulating the immune response in the animal.

2. The method of claim 1, wherein the psychrophilic bacterium is Colwellia sp., Pseudoalteromonas sp., or Shewanella sp.

3. The method of claim 1, wherein the recombinant mesophilic bacterium that is TS is Salmonella sp., Mycobacterium sp. or Escherichia sp.

4. The method of claim 1, wherein the TS essential nucleic acid molecule comprises the nucleotide sequence shown in SEQ ID NO: 1, 3, 5, 7, 9, 11, 13, 15, 17, 19, 21, 23, 25 or 27, wherein expression of the TS essential nucleic acid molecule confers temperature sensitivity.

5. The method of claim 1, wherein the protein encoded by the TS essential nucleic acid molecule comprises the amino acid sequence shown in SEQ ID NO: 2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, or 28.

6. The method of claim 1, wherein the nucleic acid construct comprising the TS essential nucleic acid molecule comprises a vector.

7. The method of claim 1, wherein the recombinant mesophilic bacterium that is TS is present in a composition comprising an adjuvant.

8. The method of claim 1, wherein the recombinant mesophilic bacterium that is TS is present in a composition comprising a pharmaceutically acceptable carrier.

9. The method of claim 1, wherein the animal is a bird.

10. The method of claim 1, wherein the animal is a mammal.

11. The method of claim 1, wherein at least 10.sup.5 CFU of the recombinant mesophilic bacterium that is TS are administered to the animal.

12. The method of claim 1, wherein administering comprises intramuscular injection, subcutaneous injection, oral administration, intranasal administration or inhalation.

13. The method of claim 1, wherein the immune response comprises a cytotoxic T lymphocyte (CTL) response.

14. The method of claim 1, wherein the immune response comprises a B cell response.

15. The method of claim 1, wherein the method treats a Francisella novicida, Mycobacterium tuberculosis, Salmonella enteritidis, or Escherichia coli infection in the animal.

16. The method of claim 1, wherein the method prevents a Francisella novicida, Mycobacterium tuberculosis, Salmonella enteritidis, or Escherichia coli infection in the animal.

17. The method of claim 1, wherein more than one type of recombinant mesophilic bacterium that is TS is administered to the animal.

18. The method of claim 1, further comprising: culturing the recombinant mesophilic bacterium that is TS at a temperature wherein a protein encoded by the TS essential nucleic acid molecule is operable, whereby the recombinant mesophilic bacterium that is TS produces a plurality of peptides; increasing the culturing temperature to a temperature at which a protein encoded by the TS nucleic acid molecule is inoperable; maintaining said culturing for a period of time sufficient to kill the recombinant mesophilic bacterium that is TS; and harvesting the killed recombinant mesophilic bacterium that is TS.

19. A method of stimulating an immune response in an animal, comprising: administering to the animal a therapeutically effective amount of a recombinant mesophilic bacterium that is temperature-sensitive (TS), wherein the recombinant mesophilic bacterium that is TS is made by a method comprising: introducing into the genome of a mesophilic bacterium by homologous recombination a nucleic acid construct comprising a TS essential nucleic acid molecule from a psychrophilic bacterium flanked on both sides by a nucleic acid molecule homologous to a region of the mesophilic bacterium genome where the TS essential nucleic acid molecule from the psychrophilic bacterium will be inserted into the mesophilic bacterium genome, and functionally replacing the mesophilic bacterium's homolog of the TS essential nucleic acid molecule, thereby making a recombinant mesophilic bacterium that is TS, wherein a protein encoded by the TS essential nucleic acid molecule is operable at a temperature less than 30.degree. C. and inoperable at a temperature greater than 30.degree. C., and wherein the recombinant mesophilic bacterium that is TS has a restrictive temperature between 33.degree. C. and 44.degree. C., thereby stimulating the immune response in the animal.

20. A method of stimulating an immune response in an animal, comprising: administering to the animal a therapeutically effective amount of a recombinant mesophilic bacterium that is temperature-sensitive (TS), wherein the recombinant mesophilic bacterium that is TS is made by a method comprising: introducing into the genome of a mesophilic bacterium a nucleic acid construct comprising a TS essential nucleic acid molecule from a psychrophilic bacterium flanked on both sides by a nucleic acid molecule homologous to a region of the mesophilic bacterium genome where the TS essential nucleic acid molecule from the psychrophilic bacterium will be inserted into the mesophilic bacterium genome, wherein a protein encoded by the TS essential nucleic acid molecule is operable at a temperature less than 30.degree. C. and inoperable at a temperature greater than 30.degree. C., and wherein the recombinant mesophilic bacterium that is TS is viable at a temperature of 0.degree. C. to 30.degree. C., nonviable at a temperature greater than 30.degree. C., and has a restrictive temperature between 33.degree. C. and 44.degree. C.; culturing the recombinant mesophilic bacterium that is TS at a temperature wherein a protein encoded by the TS essential nucleic acid molecule is operable, whereby the recombinant mesophilic bacterium that is TS produces a plurality of peptides; increasing the culturing temperature to a temperature at which a protein encoded by the TS nucleic acid molecule is inoperable; maintaining said culturing for a period of time sufficient to kill the recombinant mesophilic bacterium that is TS; and harvesting the killed recombinant mesophilic bacterium that is TS, thereby stimulating the immune response in the animal.
Description



CROSS REFERENCE TO RELATED APPLICATIONS

[0001] This is a continuation application of U.S. application Ser. No. 13/795,445 filed Mar. 12, 2013, pending, which is a continuation of U.S. application Ser. No. 13/496,723 filed Apr. 12, 2012, now U.S. Pat. No. 8,778,683, which is the U.S. National Stage of International Application No. PCT/CA2010/001561, filed Oct. 7, 2010, which was published in English under PCT Article 21(2), which in turn claims the benefit of U.S. Provisional Application No. 61/249,385 filed Oct. 7, 2009, and U.S. Provisional Application No. 61/322,634 filed on Apr. 9, 2010, all herein incorporated by reference.

FIELD

[0002] The technology relates to genes derived from psychrophilic bacteria, for use in the development of heat-sensitive vaccines. In one example, the technology relates to recombinant pathogens harboring the heat-sensitive gene ligA from Colwellia psychrerythraea, Pseudoalteromonas haloplanktis, and Shewanella frigidimarina and to genes ligA, pyrG, hemC, ftsZ, cmk, murG, fmt, and dnaK from C. psychrerythraea.

BACKGROUND

[0003] Vaccines against bacterial and viral diseases have played an important role in reducing infectious diseases in humans; however, there is still a need for innovative vaccines to reduce the current global burden of infectious diseases. Cold-adapted viruses have been used for decades as vaccines against human viral diseases. The best known example of such a vaccine is the Sabin polio virus vaccine. An alternate example is a cold adapted influenza vaccine called FluMist.RTM. (Medimmune LLC, Gaithersburg, Md., USA), which was introduced in the U.S. in 2003. FluMist.RTM. has been shown to be considerably more effective in certain demographic groups than influenza vaccines that practice the more common vaccination strategy of using inactivated virus to stimulate an immune response. Typically cold-adapted or "temperature-sensitive" (TS) viral strains have been developed by passing the virus repeatedly in eggs or cell culture at low temperatures and then testing the progeny for their inability to grow above about 37.degree. C., generally thought of as the "normal" human body temperature.

[0004] The concept of a "normal" human body temperature takes into consideration anatomical sites, individual variations, gender, physiological conditions and ambient temperature. Despite the number of variables, the human body can function only in a very narrow temperature range, which is generally about 36.degree. C.-39.degree. C. If the human body core temperature falls to about 35.degree. C., the body must be warmed or death will ensue. The skin temperature is always cooler than the body core regardless of the ambient temperature and clothing worn. At moderate temperatures (e.g., 21.degree. C.), the temperature of the skin is about 32.degree. C.-35.degree. C.

[0005] Those skilled in these arts are of the view that bacteria generally have a set of about 100 to 150 genes, called "essential genes" that are absolutely required for maintenance of bacterial viability. Identifying essential genes is difficult due to their nature, as knockouts of these genes results in death of the organism. Essential genes encode proteins composed of amino acid sequences that are highly conserved among almost all bacterial genera and species. This conservation presumably reflects their common function and structure among the different species. A select number of essential genes have been shown to be competent in substituting for a homologue in another bacterial species and in some cases these substitutions were from distantly related bacterial species. The conservation of amino acid sequences is widespread among bacteria, the deduced amino acid sequences of essential genes from psychrophiles and thermophiles shows high identity with their mesophilic counterparts. Microbiologists have generally used conditional lethal mutations, such as TS mutations, to identify essential genes.

[0006] Many bacterial species play significant roles in the global burden of infectious diseases. However, the causative agent of tuberculosis is probably the most significant contributor to human morbidity and mortality caused by an infectious bacterial disease. Although the Bacille Calmette-Guerin (BCG) vaccine has been used for several decades to protect against tuberculosis, its low efficacy has failed to lower the incidence of tuberculosis to acceptable levels.

SUMMARY

[0007] The present disclosure provides methods for engineering, producing and using heat-sensitive host microbial cells. In one example, recombinant pathogens contain heat-sensitive essential genes, for example inserted using homologous recombination. "Psychrophile" is a term that is applied to organisms that function optimally at cold temperatures e.g., <20.degree. C. Bacteria that live in cold ocean water, especially the Arctic and Antarctic oceans, are examples of psychrophilic bacteria. Enzymes and other proteins in psychrophilic bacteria function better in the cold than their homologous counterparts in mesophilic bacteria. Many of the enzymes from psychrophilic bacteria are also prone to denaturation at temperatures much lower than those that would affect their mesophilic counterpart. Presumably the pattern of temperature-sensitivity of psychrophilic enzymes extends to the products of essential genes.

[0008] Methods of identifying and manipulating psychrophilic essential genes with desired TS properties are provided. in vitro and in vivo recombinant technologies can be used. Francisella tularensis is the etiologic agent of the zoonotic disease, tularaemia. It can infect numerous animals by a variety of routes, and typically infects and grows in monocyte-derived cells in organs of the reticuloendothelial system. A closely related bacterium, Francisella novicida, has many of the properties of F. tularensis, and, in addition, is highly amenable to many genetic manipulations, including gene substitutions. The pathophysiology and genetic properties of F. novicida make it ideal for studying the effects of gene substitutions on a pathogenic bacterium. F. novicida is a mesophile with a maximal growth temperature of about 45.degree. C.

[0009] This disclosure also provides methods to determine maximal growth temperature of both bacterial strains and their growth properties at restrictive temperatures. The recombinant bacterial strains tested grew below the restrictive temperature but not above the restrictive temperature. When a psychrophilic essential allele encoding an essential gene is inserted into an area of a mammalian body that is colder than the human body core, e.g., the skin, the recombinant pathogenic bacteria will have the ability to thrive thereby inducing an immune response. When the pathogenic recombinant bacteria migrate to organs in the human body core where the temperature is higher, they die and are unable to harm the host.

[0010] The present disclosure provides isolated temperature-sensitive essential nucleic acid molecules from a psychrophilic bacterium comprising at least 80%, at least 90%, or at least 95% sequence identity to the nucleotide sequence shown in SEQ ID NO: 1, 3, 5, 7, 9, 11, 13, 15, 17, 19, 21, 22, 23, or 24. In some examples, the psychrophilic bacteria are operable at a temperature of about -10.degree. C. to about 30.degree. C., but inoperable at a temperature greater than about 30.degree. C. Vectors and recombinant host cells (such as a recombinant bacterial host cell) that include such temperature-sensitive essential nucleic acid molecules from a psychrophilic bacterium are also provided. Immunogenic compositions that include such recombinant host bacteria (such as live or killed cells) are also disclosed. The disclosure also provides isolated proteins encoded by the disclosed isolated temperature-sensitive essential nucleic acid molecules, such as proteins having at least 80%, at least 90%, or at least 95% sequence identity to the amino acid sequence shown in SEQ ID NO: 2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 26, 27, or 28.

[0011] Methods of making a temperature-sensitive microbial host cell, such as a recombinant host cell, are provided. In one example the method includes introducing (for example by inserting, substituting or replacing) a nucleic acid construct into the genome of a mesophilic bacterial strain, wherein the nucleic acid construct includes a temperature-sensitive essential nucleic acid molecule from a psychrophilic bacterial strain and one or more control sequences operably linked to the temperature-sensitive essential nucleic acid molecule, wherein the temperature-sensitive essential peptide encoded by the introduced temperature-sensitive essential nucleic acid molecule is operable (e.g., functional) at a temperature less than about 30.degree. C. and inoperable (e.g., non-functional) at a temperature greater than about 30.degree. C. In some examples the method also includes culturing the temperature-sensitive microbial host cell at a temperature wherein the temperature-sensitive peptide is operable, whereby said microbial host cell produces a plurality of peptides; increasing the culturing temperature to a temperature at which the temperature-sensitive peptide is inoperable; maintaining said culturing for a period of time sufficient to kill the temperature-sensitive microbial host cell; and harvesting the killed temperature-sensitive microbial host cells.

[0012] Methods for producing an immune response to a bacterium in a subject using the disclosed nucleic acid molecules, proteins, and recombinant host cells are provided. In one example the method includes administering to the subject a therapeutically effective amount of a temperature-sensitive bacterium, wherein the temperature-sensitive bacterium expresses a temperature-sensitive essential nucleic acid molecule from a psychrophilic bacterial strain, thereby inducing an immune response to the bacterium. Such methods can be used to prevent or treat a bacterial infection (such as a M. tuberculosis, Salmonella or Francisella infection).

[0013] The foregoing and other features of the disclosure will become more apparent from the following detailed description of several embodiments which proceeds with reference to the accompanying figures.

BRIEF DESCRIPTION OF THE DRAWINGS

[0014] FIG. 1a is a flowchart illustrating an exemplary method using polymerase chain reaction (PCR), FIG. 1b is a schematic chart illustrating an exemplary method showing DNA integration-excision events that result in a gene substitution.

[0015] FIG. 2a is a schematic illustrating the sequence of the wild type (wt) F. novicida ligA gene as it exists normally in a chromosome, FIG. 2b is a schematic illustrating ligA.sub.Cp gene substitutions into the F. novicida chromosome according to an exemplary method of the present disclosure, FIG. 2c is a schematic illustrating ligA.sub.Sf gene substitution into the F. novicida chromosome according to an exemplary method of the present disclosure, FIG. 2d is a schematic illustrating ligA.sub.Ph gene substitutions into the F. novicida chromosome according to an exemplary method of the present disclosure, FIG. 2e is a schematic illustrating ligA.sub.Ph2 gene substitutions into the F. novicida chromosome according to an exemplary method of the present disclosure.

[0016] FIG. 3a is a graph illustrating the growth curve of wt F. novicida and F. novicida with the C. psychrerythraea ligA.sub.Cp gene substituted for the F. novicida homologue at 30.degree. C., FIG. 3b is a graph illustrating the growth curve of F. novicida with the C. psychrerythraea ligA.sub.Cp gene substituted for the F. novicida homologue and wt F. novicida with a temperature shift from 30.degree. C. to 33.degree. C. after 2 hours, FIG. 3c is a graph illustrating the growth curve of F. novicida with the C. psychrerythraea ligA.sub.Cp gene substituted for the F. novicida homologue and wt F. novicida with a temperature shift from 30.degree. C. to 34.degree. C. after 3.5 hours, FIG. 3d is a graph illustrating the growth curve of F. novicida with the C. psychrerythraea ligA.sub.Cp gene substituted for the F. novicida homologue and wt F. novicida with a temperature shift from 30.degree. C. to 35.degree. C. after 2 hours, FIG. 3e is a graph illustrating the growth curve of F. novicida with the C. psychrerythraea ligA.sub.Cp gene substituted for the F. novicida homologue and wt F. novicida with a temperature shift from 30.degree. C. to 37.degree. C. after 2 hours.

[0017] FIG. 4a is a graph illustrating the growth curve of wt F. novicida and F. novicida with the S. frigidimarina ligA.sub.Sf gene substituted for the F. novicida homologue at 30.degree. C., FIG. 4b is a graph illustrating the growth curve of F. novicida with the S. frigidimarina ligA.sub.Sf gene substituted for the F. novicida homologue and wt F. novicida with a temperature shift from 30.degree. C. to 33.degree. C. after 2 hours, FIG. 4c is a graph illustrating the growth curve of F. novicida with the S. frigidimarina ligA.sub.Sf gene substituted for the F. novicida homologue and wt F. novicida with a temperature shift from 30.degree. C. to 35.degree. C. after 2 hours, FIG. 4d is a graph illustrating the growth curve of F. novicida with the S. frigidimarina ligA.sub.Sf gene substituted for the F. novicida homologue and wt F. novicida with a temperature shift from 30.degree. C. to 37.degree. C. after 2 hours.

[0018] FIG. 5a is a graph illustrating the growth curve of wt F. novicida and F. novicida with the P. haloplanktis ligA.sub.Ph gene substituted for the F. novicida homologue at 30.degree. C., FIG. 5b is a graph illustrating the growth curve of F. novicida with the P. haloplanktis ligA.sub.Ph gene substituted for the F. novicida homologue and wt F. novicida with a temperature shift from 30.degree. C. to 33.degree. C. after 2 hours, FIG. 5c is a graph illustrating the growth curve of F. novicida with the P. haloplanktis ligA.sub.Ph gene substituted for the F. novicida homologue and wt F. novicida with a temperature shift from 30.degree. C. to 35.degree. C. after 2 hours, FIG. 5d is a graph illustrating the growth curve of F. novicida with the P. haloplanktis ligA.sub.Ph gene substituted for the F. novicida homologue and wt F. novicida with a temperature shift from 30.degree. C. to 37.degree. C. after 2 hours.

[0019] FIG. 6a is a graph illustrating the growth curve of wt F. novicida and F. novicida with the P. haloplanktis ligA.sub.Ph2 gene substituted for the F. novicida homologue at 21.degree. C., FIG. 6b is a graph illustrating the growth curve of F. novicida with the P. haloplanktis ligA.sub.Ph2 gene substituted for the F. novicida homologue and wt F. novicida with a temperature shift from 21.degree. C. to 26.degree. C. after 2 hours, FIG. 6c is a graph illustrating the growth curve of F. novicida with the P. haloplanktis ligA.sub.Ph2 gene substituted for the F. novicida homologue and wt F. novicida with a temperature shift from 21.degree. C. to 28.degree. C. after 2 hours, FIG. 6d is a graph illustrating the growth curve of F. novicida with the P. haloplanktis ligA.sub.Ph2 gene substituted for the F. novicida homologue and wt F. novicida with a temperature shift from 21.degree. C. to 30.degree. C. after 2 hours.

[0020] FIG. 7 is a graph illustrating the decline in viability of wt F. novicida and F. novicida--ligA.sub.CP cultures at 37.degree. C. after being grown to late exponential phase at 33.degree. C.

[0021] FIG. 8 is a digital image illustrating the growth of S. ser. Typhimurium-ligA.sub.CP at 30.degree. C. and the lack of growth at 37.degree. C.

[0022] FIG. 9a is a graph illustrating the growth curve of wt Mycobacterium smegmatis and M. smegmatis-ligA.sub.CP at 30.degree. C., FIG. 9b is a graph illustrating the growth curve of M. smegmatis-ligA.sub.CP and wt M. smegmatis with a temperature shift from 30.degree. C. to 35.degree. C. after 4 hours, FIG. 9c is a graph illustrating the growth curve of M. smegmatis-ligA.sub.CP and wt M. smegmatis with a temperature shift from 30.degree. C. to 37.degree. C. after 4 hours.

[0023] FIGS. 10a-10d are a series of graphs showing the protective immunity induced by TS F. novicida strains.

[0024] FIGS. 11A-11L show sequences disclosed herein, with underlined portions being F. novicida sequence.

SEQUENCE LISTING

[0025] The nucleic and amino acid sequences listed in the accompanying sequence listing are shown using standard letter abbreviations for nucleotide bases, and three letter code for amino acids. Only one strand of each nucleic acid sequence is shown, but the complementary strand is understood to be included by any reference to the displayed strand.

[0026] SEQ ID NO: 1 is a full length nucleic acid coding sequence of the ligA.sub.Cp hybrid gene.

[0027] SEQ ID NO: 2 is the deduced 689 amino acid sequence of LigA.sub.Cp hybrid protein.

[0028] SEQ ID NO: 3 is a full length nucleic acid coding sequence of the ligA.sub.Ph hybrid gene.

[0029] SEQ ID NO: 4 is the deduced 673 amino acid sequence of LigA.sub.Ph hybrid protein.

[0030] SEQ ID NO: 5 is a full length nucleic acid coding sequence of the ligA.sub.Ph2 hybrid gene.

[0031] SEQ ID NO: 6 is the deduced 673 amino acid sequence of LigA.sub.Ph2 hybrid protein.

[0032] SEQ ID NO: 7 is a full length nucleic acid coding sequence of the ligA.sub.Sf hybrid gene.

[0033] SEQ ID NO: 8 is the deduced 670 amino acid sequence of LigA.sub.Sf hybrid protein.

[0034] SEQ ID NO: 9 is a full length nucleic acid coding sequence of the pyrG.sub.Cp hybrid gene.

[0035] SEQ ID NO: 10 is the deduced 545 amino acid sequence of PyrG.sub.Cp hybrid protein.

[0036] SEQ ID NO: 11 is a full length nucleic acid coding sequence of the hemC.sub.Cp hybrid gene.

[0037] SEQ ID NO: 12 is the deduced 317 amino acid sequence of HemC.sub.Cp hybrid protein.

[0038] SEQ ID NO: 13 is a full length nucleic acid coding sequence of the fmt.sub.Cp hybrid gene.

[0039] SEQ ID NO: 14 is the deduced 327 amino acid sequence of Fmt.sub.Cp hybrid protein.

[0040] SEQ ID NO: 15 is a full length nucleic acid coding sequence of the murG.sub.Cp hybrid gene.

[0041] SEQ ID NO: 16 is the deduced 387 amino acid sequence of MurG.sub.Cp hybrid protein.

[0042] SEQ ID NO: 17 is a full length nucleic acid coding sequence of codon optimized ligA.sub.Cp optimized for M. tuberculosis.

[0043] SEQ ID NO: 18 is the deduced 689 amino acid coding sequence of codon optimized LigA.sub.Cp hybrid protein with the first four codons changed to the M. tuberculosis form.

[0044] SEQ ID NO: 19 is a full length nucleic acid coding sequence of the dnaK.sub.Cp hybrid gene.

[0045] SEQ ID NO: 20 is the deduced 638 amino acid coding sequence of DnaK.sub.Cp hybrid protein.

[0046] SEQ ID NOS: 21 and 22 are a full length nucleic acid coding sequence of the essential gene tyrS from Colwellia psychrerythraea (normal font, uppercase), and the corresponding amino acid sequence, respectively.

[0047] SEQ ID NO: 23 and 24 are a full length nucleic acid coding sequence of the essential gene cmk from Colwellia psychrerythraea (normal font, uppercase) and the corresponding amino acid sequence, respectively. As shown in FIG. 11J, F. novicida sequence is underlined. The underlined regions correspond to the F. novicida sequence in both the nucleotide and amino acid sequence. The "non-underlined" is Colwellia psychrerythraea sequence. In the amino acid sequence there is no underlined amino acids at the end since the F. novicida sequence starts at the stop codon.

[0048] SEQ ID NO: 25 and 26 are a full length nucleic acid coding sequence of the essential gene dnaKsf from Shewanella frigidimarina (normal font, uppercase) and the corresponding amino acid sequence, respectively. As shown in FIG. 11K, Francisella novicida sequence is underlined. The underlined regions correspond to the F. novicida sequence in both the nucleotide and amino acid sequence. The "non-underlined" shows the Shewanella frigidimarina sequence. In the amino acid sequence at the beginning (MGK) is identical between Shewanella and Francisella, so it is double underlined. The single underline at the end of the amino acid sequence corresponds to the F. novicida sequence.

[0049] SEQ ID NO: 27 and 28 are a full length nucleic acid coding sequence of the essential gene ftsZ from Colwellia psychrerythraea (normal font, uppercase) and the corresponding amino acid sequence, respectively. As shown in FIG. 11L, Francisella novicida sequence is underlined. The underlined regions correspond to the F. novicida sequence in both the nucleotide and amino acid sequence. The "non-underlined" regions are Colwellia psychrerythraea sequence. There is extensive F. novicida region at the 5'-end (N-terminus).

DETAILED DESCRIPTION

[0050] The following explanations of terms and methods are provided to better describe the present disclosure. The singular forms "a," "an," and "the" refer to one or more than one, unless the context clearly dictates otherwise. For example, the term "comprising a nucleic acid molecule" includes single or plural nucleic acid molecules and is considered equivalent to the phrase "comprising at least one nucleic acid molecule." The term "or" refers to a single element of stated alternative elements or a combination of two or more elements, unless the context clearly indicates otherwise. As used herein, "comprises" means "includes." Thus, "comprising A or B," means "including A, B, or A and B," without excluding additional elements.

[0051] Suitable methods and materials for the practice and/or testing of embodiments of the disclosure are described below. Such methods and materials are illustrative only and are not intended to be limiting. Other methods and materials similar or equivalent to those described herein can be used. For example, conventional methods well known in the art to which a disclosed invention pertains are described in various general and more specific references, including, for example, Sambrook et al., Molecular Cloning: A Laboratory Manual, 2d ed., Cold Spring Harbor Laboratory Press, 1989; Sambrook et al., Molecular Cloning: A Laboratory Manual, 3d ed., Cold Spring Harbor Press, 2001; Ausubel et al., Current Protocols in Molecular Biology, Greene Publishing Associates, 1992 (and Supplements to 2000); Ausubel et al., Short Protocols in Molecular Biology: A Compendium of Methods from Current Protocols in Molecular Biology, 4th ed., Wiley & Sons, 1999; Harlow and Lane, Antibodies: A Laboratory Manual, Cold Spring Harbor Laboratory Press, 1990; and Harlow and Lane, Using Antibodies: A Laboratory Manual, Cold Spring Harbor Laboratory Press, 1999.

[0052] The references cited herein are incorporated by reference.

[0053] In order to facilitate review of the various embodiments of the disclosure, the following explanations of specific terms are provided. Unless otherwise noted, technical terms are used according to conventional usage by those skilled in the arts

[0054] Adjuvant: A vehicle used to enhance antigenicity, for example antigenicity of a recombinant host bacterium containing a TS essential psychrophilic bacteria sequence disclosed herein. Adjuvants include a suspension of minerals (e.g., alum, aluminum hydroxide, or phosphate) on which antigen is adsorbed; or water-in-oil emulsion in which antigen solution is emulsified in mineral oil (Freund incomplete adjuvant), sometimes with the inclusion of killed mycobacteria (Freund's complete adjuvant) to further enhance antigenicity (inhibits degradation of antigen and/or causes influx of macrophages). Immunostimulatory oligonucleotides (such as those including a CpG motif) can also be used as adjuvants (for example see U.S. Pat. No. 6,194,388; U.S. Pat. No. 6,207,646; U.S. Pat. No. 6,214,806; U.S. Pat. No. 6,218,371; U.S. Pat. No. 6,239,116; U.S. Pat. No. 6,339,068; U.S. Pat. No. 6,406,705; and U.S. Pat. No. 6,429,199). Adjuvants include biological molecules (a "biological adjuvant"), such as costimulatory molecules. Exemplary adjuvants include IL-2, RANTES, GM-CSF, TNF-.alpha., IFN-.gamma., G-CSF, LFA-3, CD72, B7-1, B7-2, OX-40L and 41 BBL.

[0055] Administration: The introduction of a composition (such as an immunogenic composition) into a subject (such as a mammal, for example a human) by a selected route. Exemplary routes of administration include, but are not limited to, topical, injection (such as subcutaneous, intramuscular, intradermal, intraperitoneal, intratumoral, and intravenous), oral, sublingual, rectal, transdermal, intranasal, vaginal and inhalation routes.

[0056] Ameliorate: The improvement of a disease or pathological condition (such as a bacterial infection) with respect to the effect of the treatment. The beneficial effect can be evidenced, for example, by a delayed onset of clinical symptoms of the disease in a susceptible subject, a reduction in severity of some or all clinical symptoms of the disease, a slower progression of the disease, an improvement in the overall health or well-being of the subject, or by other parameters well known to those skilled in the arts specific to the particular disease.

[0057] Animal: Living multi-cellular vertebrate organisms, a category that includes mammals and birds. The term "mammal" includes both human and non-human mammals. Similarly, the term "subject" includes both human and veterinary subjects (such as mice, rats, rabbits, dogs, cats, horses, and cattle).

[0058] Antibody: A polypeptide ligand comprising at least a light chain or heavy chain immunoglobulin variable region which specifically recognizes and binds an epitope of an antigen. Antibodies are composed of a heavy and a light chain, each of which has a variable region, termed the variable heavy (V.sub.H) region and the variable light (V.sub.L) region. Together, the V.sub.H region and the V.sub.L region are responsible for binding the antigen recognized by the antibody.

[0059] Antibodies include intact immunoglobulins and the variants and portions of antibodies well known in the art, such as Fab fragments, Fab' fragments, F(ab)'.sub.2 fragments, single chain Fv proteins ("scFv"), and disulfide stabilized Fv proteins ("dsFv"). A scFv protein is a fusion protein in which a light chain variable region of an immunoglobulin and a heavy chain variable region of an immunoglobulin are bound by a linker, while in dsFvs, the chains have been mutated to introduce a disulfide bond to stabilize the association of the chains.

[0060] Typically, a naturally occurring immunoglobulin has heavy (H) chains and light (L) chains interconnected by disulfide bonds. There are two types of light chain, lambda (.lamda.) and kappa (k). There are five main heavy chain classes (or isotypes) which determine the functional activity of an antibody molecule: IgM, IgD, IgG, IgA and IgE.

[0061] "Specifically binds" refers to the ability of individual antibodies to specifically immunoreact with an antigen, such as a bacterial antigen, relative to binding to unrelated proteins, such as non-bacterial proteins. The binding is a non-random binding reaction between an antibody molecule and an antigenic determinant of the T cell surface molecule. The desired binding specificity is typically determined from the reference point of the ability of the antibody to differentially bind the T cell surface molecule and an unrelated antigen, and therefore distinguish between two different antigens, particularly where the two antigens have unique epitopes. An antibody that specifically binds to a particular epitope is referred to as a "specific antibody".

[0062] In some examples, an antibody specifically binds to a target (such as a bacterial protein) with a binding constant that is at least 10.sup.3 M.sup.-1 greater, 10.sup.4 M.sup.-1 greater or 10.sup.5 M.sup.-1 greater than a binding constant for other molecules in a sample or subject. In some examples, an antibody or fragments thereof, has an equilibrium constant (Kd) of 1 nM or less. For example, an antibody binds to a target, such as a bacterial protein with a binding affinity of at least about 0.1.times.10.sup.-8 M, at least about 0.3.times.10.sup.-8 M, at least about 0.5.times.10.sup.-8 M, at least about 0.75.times.10.sup.-8 M, at least about 1.0.times.10.sup.-8 M, at least about 1.3.times.10.sup.-8 M at least about 1.5.times.10.sup.-8 M, or at least about 2.0.times.10.sup.-8 M. Kd values can, for example, be determined by competitive ELISA (enzyme-linked immunosorbent assay) or using a surface-plasmon resonance device such as the Biacore T100, which is available from Biacore, Inc., Piscataway, N.J.

[0063] Antigen: A compound, composition, or substance that can stimulate the production of antibodies or a T cell response in an animal, including compositions that are injected or absorbed into an animal. An antigen reacts with the products of specific humoral or cellular immunity, including those induced by heterologous immunogens. The term "antigen" includes all related antigenic epitopes. "Epitope" or "antigenic determinant" refers to a site on an antigen to which B and/or T cells respond. In one embodiment, T cells respond to the epitope, when the epitope is presented in conjunction with an MHC molecule. Epitopes can be formed both from contiguous amino acids or noncontiguous amino acids juxtaposed by tertiary folding of a protein. Generally, T cells recognize epitopes of continuous amino acids. Epitopes formed from contiguous amino acids are typically retained on exposure to denaturing solvents whereas epitopes formed by tertiary folding are typically lost on treatment with denaturing solvents. An epitope typically includes at least 3, and more usually, at least 5, about 9, or about 8-10 amino acids in a unique spatial conformation. Methods of determining spatial conformation of epitopes include, for example, x-ray crystallography and 2-dimensional nuclear magnetic resonance.

[0064] Examples of antigens include, but are not limited to, peptides, lipids, polysaccharides, and nucleic acids containing antigenic determinants, such as those recognized by an immune cell. An antigen can be a tissue-specific antigen, or a disease-specific antigen. These terms are not exclusive, as a tissue-specific antigen can also be a disease specific antigen. A tissue-specific antigen is expressed in a limited number of tissues, such as a single tissue. A tissue specific antigen may be expressed by more than one related type of tissue, such as alveolar and bronchial tissue. A disease-specific antigen is expressed coincidentally with a disease process. Specific non-limiting examples of a disease-specific antigen are an antigen whose expression correlates with, or is predictive of, a bacterial infection, such as tuberculosis. A disease-specific antigen can be an antigen recognized by T cells or B cells.

[0065] CD4: Cluster of differentiation factor 4, a T cell surface protein that mediates interaction with the MHC Class II molecule. CD4 also serves as the primary receptor site for HIV on T cells during HIV infection. Cells that express CD4 are often helper T cells.

[0066] CD8: Cluster of differentiation factor 8, a T cell surface protein that mediates interaction with the MHC Class I molecule. Cells that express CD8 are often cytotoxic T cells. "CD8+ T cell mediated immunity" is an immune response implemented by presentation of antigens to CD8+ T cells.

[0067] Contacting: The process of incubating one agent in the presence of another. Thus, when a cell is contacted with an agent (such as an immunogenic composition), the cell is incubated with the agent for a sufficient period of time for the agent and the cell to interact.

[0068] Cool parts of the body: Regions of a human or other mammalian body that generally have a lower temperature than other parts of the body. The concept of natural human (or other mammal) body temperature variation due to anatomical sites, gender, physiological and ambient temperature. Despite the number of variables, the human (or other mammalian) body can function only in a very narrow temperature range, hence, for example the human body core remains at about 36.degree. C.-39.degree. C. Cool parts of the body include skin, mouth and rectum. Skin temperature, for example, is about 32.degree. C.-35.degree. C. Thus, in some examples, cool parts of the body have temperatures that are at least 1.degree. C. less, at least 2.degree. C. less, at least 3.degree. C. less, at least 4.degree. C. less, at least 4.degree. C. less, or at least 6.degree. C. less, such as 1.degree. C. to 8.degree. C. less, 1.degree. C. to 6.degree. C. less, 2.degree. C. to 6.degree. C. less, or 2.degree. C. to 4.degree. C. less, than other parts of the body, such as the core.

[0069] Cytokine: Proteins made by cells that affect the behavior of other cells, such as lymphocytes. In one embodiment, a cytokine is a chemokine, a molecule that affects cellular trafficking. Specific, non-limiting examples of cytokines include the interleukins (IL-2, IL-4, IL-6, IL-10, IL-21, etc.), and IFN-.gamma..

[0070] Degenerate variant: A TS essential psychrophilic bacteria nucleic acid sequence that encodes a TS essential psychrophilic bacteria protein that includes a nucleic acid sequence that is degenerate as a result of the genetic code. There are 20 natural amino acids, most of which are specified by more than one codon. Therefore, all degenerate nucleotide sequences are included in this disclosure as long as the amino acid sequence of the TS essential psychrophilic bacteria peptide encoded by the nucleotide sequence is unchanged.

[0071] Em.sup.R: Erythromycin resistance.

[0072] Essential gene: A gene that is necessary for the growth of the organism (such as a mesophilic bacterium) under all culturing conditions.

[0073] Expression Control Sequences: Nucleic acid sequences that regulate the expression of a heterologous nucleic acid sequence to which it is operatively linked. Expression control sequences are operatively linked to a nucleic acid sequence when the expression control sequences control and regulate the transcription and, as appropriate, translation of the nucleic acid sequence. Thus expression control sequences can include appropriate promoters, enhancers, transcription terminators, a start codon (i.e., ATG) in front of a protein-encoding gene, splicing signal for introns, maintenance of the correct reading frame of that gene to permit proper translation of mRNA, and stop codons. The term "control sequences" is intended to include, at a minimum, components whose presence can influence expression, and can also include additional components whose presence is advantageous, for example, leader sequences and fusion partner sequences. Expression control sequences can include a promoter.

[0074] A promoter is a minimal sequence sufficient to direct transcription. Also included are those promoter elements which are sufficient to render promoter-dependent gene expression controllable for cell-type specific, tissue-specific, or inducible by external signals or agents; such elements may be located in the 5' or 3' regions of the gene. Both constitutive and inducible promoters, are included (see e.g., Bitter et al., Methods in Enzymology 153:516-544, 1987). For example, when cloning in bacterial systems, inducible promoters such as pL of bacteriophage lambda, plac, ptrp, ptac (ptrp-lac hybrid promoter) and the like may be used. In one embodiment, when cloning in mammalian cell systems, promoters derived from the genome of mammalian cells (e.g., metallothionein promoter) or from mammalian viruses (e.g., the retrovirus long terminal repeat; the adenovirus late promoter; the vaccinia virus 7.5K promoter) can be used. Promoters produced by recombinant DNA or synthetic techniques may also be used to provide for transcription of the nucleic acid sequences. In one embodiment, the promoter is a cytomegalovirus promoter.

[0075] Heat-sensitive: An inability to perform an essential biological function at temperatures above about 28.degree. C. Similarly, the term "heat-sensitive protein or polypeptide" refers to a non-functional mature protein resulting from heat-induced deactivation. An enzyme that does not catalyze its known reaction efficiently enough to support growth, development or life of the organism above about 28.degree. C. is an example of such a protein.

[0076] Heat-sensitive allele: An allele comprising a gene encoding a heat-sensitive protein. Similarly the term, "heat-sensitive gene" refers to a gene encoding a heat-sensitive protein.

[0077] Host cells: Cells into which a heterologous nucleic acid molecule has been introduced. For example, such cells may include a nucleic acid vector that is propagated and its DNA expressed. The cell may be prokaryotic or eukaryotic. The cell can be prokaryotic, such as a bacterial cell. The term also includes any progeny of the subject host cell. It is understood that all progeny may not be identical to the parental cell since there may be mutations that occur during replication. However, such progeny are included when the term "host cell" is used.

[0078] Immune response: A response of a cell of the immune system, such as a B cell, T cell, or monocyte, to a stimulus. In one embodiment, the response is specific for a particular antigen or a particular TS recombinant microbial cell, such as mesophilic bacteria containing a psychrophile essential nucleic acid molecule provided herein. In one embodiment, an immune response is a T cell response, such as a CD4+ response or a CD8+ response. In another embodiment, the response is a B cell response, and results in the production of specific antibodies. The development of an immune response following administration of mesophilic bacteria containing a psychrophile TS essential nucleic acid molecule can be measured using routine methods known in the art, for example by measuring cytokine production as an indication of a protective immune response.

[0079] Immunogenic composition: Compositions that include recombinant mesophilic bacteria containing a psychrophile TS essential nucleic acid molecule that induces a measurable CTL response against a recombinant mesophilic bacteria protein, or induces a measurable B cell response (such as production of antibodies that specifically bind a recombinant mesophilic bacteria-specific protein) against a recombinant mesophilic bacteria protein. For example, the immunogenic polypeptide or a nucleic acid encoding the immunogenic polypeptide can be present in a heat-sensitive mesophilic bacteria generated using the methods provided herein, wherein the bacteria is art of an immunogenic composition that can further include pharmaceutically acceptable carriers, and/or other therapeutic agents. An immunogenic composition can optionally include an adjuvant, a PD-1 antagonist, a co-stimulatory molecule, or a nucleic acid encoding a costimulatory molecule. An immunogenic composition can be readily tested for its ability to induce a CTL by art-recognized assays.

[0080] Immunogenic peptide: A peptide which comprises an allele-specific motif or other sequence such that the peptide will bind an MHC molecule and induce a cytotoxic T lymphocyte ("CTL") response, or a B cell response (e.g. antibody production) against the antigen from which the immunogenic peptide is derived. Immunogenic peptides can also be identified by measuring their binding to a specific MHC protein and by their ability to stimulate CD4 and/or CD8 when presented in the context of the MHC protein.

[0081] Generally, immunogenic polypeptides can be used to induce an immune response in a subject, such as a B cell response or a T cell response. In one example, an immunogenic polypeptide, when bound to a MHC Class I molecule, activates cytotoxic T lymphocytes (CTLs) against the polypeptide. Induction of CTLs using synthetic peptides and CTL cytotoxicity assays are known in the art, see U.S. Pat. No. 5,662,907. In one example, an immunogenic peptide includes an allele-specific motif or other sequence such that the peptide will bind an MHC molecule and induce a cytotoxic T lymphocyte ("CTL") response against the antigen from which the immunogenic peptide is derived.

[0082] Immunologically reactive conditions: Conditions that allow an antibody specific for a particular epitope to bind to that epitope to a greater degree than, and/or to the substantial exclusion of, binding to substantially all other epitopes. These conditions are dependent upon the format of the antibody binding reaction and typically are those utilized in immunoassay protocols or those conditions encountered in vivo. The immunologically reactive conditions employed in the disclosed methods are "physiological conditions" which include reference to conditions (e.g., temperature, osmolarity, pH) that are typical inside a living mammal or a mammalian cell. While it is recognized that some organs are subject to extreme conditions, the intra-organ and intracellular environment is generally about pH 7 (e.g., from pH 6.0 to pH 8.0, or pH 6.5 to pH 7.5, such as pH 7.2), contains water as the predominant solvent, and exists at a temperature above 0.degree. C. and below 50.degree. C. Osmolarity is within the range that is supportive of cell viability and proliferation. These conditions are well known to those skilled in these arts.

[0083] Interferon gamma (IFN-.gamma.): IFN-.gamma. is a dimeric protein with subunits of 146 amino acids. The protein is glycosylated at two sites, and the pI is 8.3-8.5. IFN-.gamma. is synthesized as a precursor protein of 166 amino acids including a secretory signal sequence of 23 amino acids. Two molecular forms of the biologically active protein of 20 and 25 kDa have been described. Both of them are glycosylated at position 25. The 25 kDa form is also glycosylated at position 97. The observed differences of natural IFN-.gamma. with respect to molecular mass and charge are due to variable glycosylation patterns. 40-60 kDa forms observed under non-denaturing conditions are dimers and tetramers of IFN-.gamma.. The human gene has a length of approximately 6 kb. It contains four exons and maps to chromosome 12q24.1.

[0084] IFN-.gamma. can be detected by sensitive immunoassays, such as an ELISPOT test that allows detection of individual cells producing IFN-.gamma.. Minute amounts of IFN-.gamma. can be detected indirectly by measuring IFN-induced proteins such as Mx protein. The induction of the synthesis of IP-10 has been used also to measure IFN-.gamma. concentrations. In addition, bioassays can be used to detect IFN-.gamma., such as an assay that employs induction of indoleamine 2,3-dioxygenase activity in 2D9 cells. The production of IFN-.gamma. can be used to assess T cell activation, such as activation of a T cell by bacterial antigen.

[0085] Isolated: A biological component (such as a nucleic acid molecule, protein or organelle) that has been substantially separated or purified away from other biological components in the cell of the organism in which the component naturally occurs, e.g., other chromosomal and extra-chromosomal DNA and RNA, proteins and organelles. Nucleic acid molecules and proteins that have been "isolated" include nucleic acid molecules and proteins purified by standard purification methods. In another embodiment, "isolated" refers to nucleic acid molecules and proteins prepared by recombinant expression in a host cell as well as chemically synthesized nucleic acids.

[0086] ligA: A wt allele of the gene encoding NAD-dependent DNA ligase found in mesophilic bacteria such as F. novicida, M. smegmatis or E. coli. Furthermore, ligA with a subscript, such as ligA.sub.Cp, ligA.sub.Sf, or ligA.sub.Ph, refers to a wt allele of the gene encoding NAD-dependent DNA ligase found in psychrophilic bacteria. For example ligA.sub.Cp refers to the wt allele of ligA found in the Arctic bacterium C. psychrerythraea strain 34H which has a maximal growth temperature below 18.degree. C. The ligA sequences from psychrophilic bacteria can be introduced into mesophilic bacteria, to confer temperature sensitivity to the mesophilic bacteria.

[0087] Mesophile: An organism naturally found in environments at temperatures between about 20.degree. C. and 50.degree. C. A bacterial mesophile refers to a bacterium that is normally associated with a mammal and thus is normally functioning at temperatures between about 32.degree. C. and 45.degree. C.

[0088] Psychrophile: An organism naturally found in environments that are permanently below 20.degree. C., often permanently below 10.degree. C. and sometimes below 0.degree. C. Such permanently cold environments include most ocean environments, permafrost soils, Arctic and Antarctic environments. Those skilled in these arts will understand that "psychrophile" and "psychrotroph" are commonly used to describe bacteria that grow in cold environments.

[0089] Psychrophilic: Features found in psychrophiles. For example, a "psychrophilic enzyme" is an enzyme isolated from a psychrophile.

[0090] Peptide modifications: Analogs (non-peptide organic molecules), derivatives (chemically functionalized peptide molecules obtained starting with the disclosed peptide sequences) and variants (homologs) of proteins that can be used in the methods and compositions provided herein. Peptides are comprised of amino acids, which may be either L- and/or D-amino acids, naturally occurring and otherwise. The peptides can be modified by a variety of chemical techniques to produce derivatives having essentially the same activity as the unmodified peptides, and optionally having other desirable properties. Modifications are well known to those skilled in these arts.

[0091] Pharmaceutically acceptable carriers: The pharmaceutically acceptable carriers (vehicles) useful in this disclosure are conventional. Remington's Pharmaceutical Sciences, by E. W. Martin, Mack Publishing Co., Easton, Pa., 19th Edition (1995), describes compositions and formulations suitable for pharmaceutical delivery of one or more therapeutic composition, such as an immunogenic composition.

[0092] The disclosed purified active compositions can be administered alone or combined with an acceptable carrier. Preparations can contain one type of therapeutic molecule, or can be composed of a combination of several types of therapeutic molecules. The nature of the carrier will depend on the particular mode of administration being utilized.

[0093] In general, the nature of the carrier will depend on the particular mode of administration being employed. For instance, parenteral formulations usually comprise injectable fluids that include pharmaceutically and physiologically acceptable fluids such as water, physiological saline, balanced salt solutions, aqueous dextrose, glycerol or the like as a vehicle. For solid compositions (for example, powder, pill, tablet, or capsule forms), conventional non-toxic solid carriers can include, for example, pharmaceutical grades of mannitol, lactose, starch, or magnesium stearate. In addition to biologically-neutral carriers, pharmaceutical compositions to be administered can contain minor amounts of non-toxic auxiliary substances, such as wetting or emulsifying agents, preservatives, and pH buffering agents and the like, for example sodium acetate or sorbitan monolaurate.

[0094] Preventing or treating a disease: "Preventing" a disease refers to inhibiting the full development of a disease, for example in a person who is known to be at risk of infection with M. tuberculosis, or M. leprae. An example of a person with a known predisposition is someone living with a person diagnosed with tuberculosis, health care professionals, or someone otherwise known to have been exposed to M. tuberculosis. "Treatment" refers to a therapeutic intervention that ameliorates a sign or symptom of a disease or pathological condition, such as tuberculosis, after it has begun to develop.

[0095] Purified: The term purified does not require absolute purity; rather, it is intended as a relative term. Thus, for example, a purified protein preparation is one in which the protein is more pure than the protein in its originating environment within a cell. A preparation of a protein is typically purified such that the protein represents at least 50% of the total protein content of the preparation. However, more highly purified preparations may be required for certain applications. For example, for such applications, preparations in which the protein includes at least 75% or at least 90% of the total protein content may be employed.

[0096] Recombinant: A nucleic acid molecule that has a sequence not naturally occurring or a sequence that is made by an artificial combination of two naturally separated segments of sequence. This artificial combination is often accomplished by chemical synthesis or by the artificial manipulation of isolated segments of nucleic acids, by genetic engineering techniques, for example. Also refers to cells into which a non-native nucleic acid molecule has been introduced.

[0097] Resistant to infection: Animals (e.g., mammals) that demonstrate decreased symptoms of infection compared to non-resistant animals. Evidence of resistance to infection can appear as, for example, lower rates of mortality, increased life spans measured after exposure to the infective agent, fewer or less intense physiological symptoms, such as fewer lesions, or decreased cellular or tissue concentrations of the infective agent. In one embodiment, resistance to infection is demonstrated by a heightened immune response.

[0098] Restrictive temperature: The lowest temperature at which an organism is unable to grow. For example, in Table 1 "restrictive temperature" specifically refers to the lowest temperature at which the F. novicida strain with a psychrophilic gene integrated is unable to form an isolated colony on agar media. Due to the variation in the temperature of incubators, these temperatures are interpreted as being about .+-.1.degree. C.

[0099] sacB cassette: A modular DNA sequence encoding the enzyme levansucrase from Bacillus subtilus. Expression of this gene is lethal in the presence of sucrose to many bacteria and can thus be used as a counter-selection agent to help select for the loss of gene sequences.

[0100] Selective hybridization: Hybridization under moderately or highly stringent conditions that exclude non-related nucleotide sequences, the techniques of hybridization are known to those skilled in these arts.

[0101] Sequence identity: The identity/similarity between two or more nucleic acid sequences, or two or more amino acid sequences, expressed in terms of the identity or similarity between the sequences. Sequence identity can be measured in terms of percentage identity; the higher the percentage, the more identical the sequences are. Sequence similarity can be measured in terms of percentage similarity (which takes into account conservative amino acid substitutions); the higher the percentage, the more similar the sequences are.

[0102] Methods of alignment of sequences for comparison are well known in the art. Various programs and alignment algorithms are described in: Smith & Waterman, Adv. Appl. Math. 2:482, 1981; Needleman & Wunsch, J. Mol. Biol. 48:443, 1970; Pearson & Lipman, Proc. Natl. Acad. Sci. USA 85:2444, 1988; Higgins & Sharp, Gene, 73:237-44, 1988; Higgins & Sharp, CABIOS 5:151-3, 1989; Corpet et al., Nuc. Acids Res. 16:10881-90, 1988; Huang et al. Computer Appls. in the Biosciences 8, 155-65, 1992; and Pearson et al., Meth. Mol. Bio. 24:307-31, 1994. Altschul et al., J. Mol. Biol. 215:403-10, 1990, presents a detailed consideration of sequence alignment methods and homology calculations.

[0103] The NCBI Basic Local Alignment Search Tool (BLAST) (Altschul et al., J. Mol. Biol. 215:403-10, 1990) is available from several sources, including the National Center for Biological Information (NCBI, National Library of Medicine, Building 38A, Room 8N805, Bethesda, Md. 20894) and on the Internet, for use in connection with the sequence analysis programs blastp, blastn, blastx, tblastn and tblastx. Additional information can be found at the NCBI web site.

[0104] BLASTN is used to compare nucleic acid sequences, while BLASTP is used to compare amino acid sequences. If the two compared sequences share homology, then the designated output file will present those regions of homology as aligned sequences. If the two compared sequences do not share homology, then the designated output file will not present aligned sequences.

[0105] Once aligned, the number of matches is determined by counting the number of positions where an identical nucleotide or amino acid residue is presented in both sequences. The percent sequence identity is determined by dividing the number of matches either by the length of the sequence set forth in the identified sequence, or by an articulated length (such as 100 consecutive nucleotides or amino acid residues from a sequence set forth in an identified sequence), followed by multiplying the resulting value by 100. For example, a nucleic acid sequence that has 1166 matches when aligned with a test sequence having 1154 nucleotides is 75.0 percent identical to the test sequence (1166/1554*100=75.0). The percent sequence identity value is rounded to the nearest tenth. For example, 75.11, 75.12, 75.13, and 75.14 are rounded down to 75.1, while 75.15, 75.16, 75.17, 75.18, and 75.19 are rounded up to 75.2. The length value will always be an integer. In another example, a target sequence containing a 15-nucleotide region that aligns with 20 consecutive nucleotides from an identified sequence as follows contains a region that shares 75 percent sequence identity to that identified sequence (that is, 15/20*100=75).

[0106] For comparisons of amino acid sequences of greater than about 30 amino acids, the Blast 2 sequences function is employed using the default BLOSUM62 matrix set to default parameters, (gap existence cost of 11, and a per residue gap cost of 1). Homologs are typically characterized by possession of at least 30% sequence identity or more counted over the full-length alignment with an amino acid sequence using the NCBI Basic Blast 2.0, gapped blastp with databases such as the nr or swissprot database. Queries searched with the blastn program are filtered with DUST (Hancock and Armstrong, 1994, Comput. Appl. Biosci. 10:67-70). Other programs use SEG. In addition, a manual alignment can be performed. Proteins with even greater similarity will show increasing percentage identities when assessed by this method, such as at least about 75%, 80%, 85%, 90%, 95%, 98%, or 99% sequence identity with a protein disclosed herein. Thus in one example, a protein that can be used in the disclosed methods and compositions has at least 60%, at least 70%, at least 80%, at least 90%, at least 95%, at least 98%, or at least 99% sequence identity to SEQ ID NO: 2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26 and 28 and retains the ability to confer TS (such as heat-sensitivity) to a mesophilic bacteria.

[0107] One indication that two nucleic acid molecules are closely related is that the two molecules hybridize to each other under stringent conditions, as described above. Nucleic acid sequences that do not show a high degree of identity may nevertheless encode identical or similar (conserved) amino acid sequences, due to the degeneracy of the genetic code. Changes in a nucleic acid sequence can be made using this degeneracy to produce multiple nucleic acid molecules that all encode substantially the same protein. Such homologous nucleic acid sequences can, for example, possess at least about 60%, 70%, 80%, 90%, 95%, 98%, or 99% sequence identity with a disclosed nucleic acid sequence as determined by this method. Thus in one example, a nucleic acid sequence that can be used in disclosed methods and compositions has at least 60%, at least 70%, at least 80%, at least 90%, at least 95%, at least 98%, or at least 99% sequence identity to SEQ ID NO: 1, 3, 5, 7, 9, 11, 13, 15, 17, 19, 21, 23, 25, and 27 and retains the ability to encode a protein that can confer TS (such as heat-sensitivity) to a mesophilic bacteria. An alternative (and not necessarily cumulative) indication that two nucleic acid sequences are substantially identical is that the peptide which the first nucleic acid encodes is immunologically cross reactive with the peptide encoded by the second nucleic acid.

[0108] Temperature-sensitive (TS)" or "heat-sensitive (HS): A bacterial component (such as a protein) or bacterium that is active up to about 30.degree. C. and inactivated at a temperature that is normally found in the human body, e.g., above about 30.degree. C.

[0109] Tester strain: A mesophilic bacterium that is amenable to gene replacement allowing the substitution of a psychrophilic essential gene for the homologue naturally found in the tester strain.

[0110] Therapeutically effective amount: An amount of a composition that alone, or together with an additional therapeutic agent(s) sufficient to achieve a desired effect in a subject, or in a cell, being treated with the agent. The effective amount of the agent (such as an immunogenic composition provided herein) can be dependent on several factors, including, but not limited to the subject or cells being treated, the particular therapeutic agent, and the manner of administration of the therapeutic composition. In one example, a therapeutically effective amount or concentration is one that is sufficient to prevent advancement, delay progression, or to cause regression of a disease, or which is capable of reducing symptoms caused by the disease, such as a bacterial infection (e.g., tuberculosis).

[0111] In one example, a desired response is to reduce or inhibit one or more symptoms associated with a bacterial infection. The one or more symptoms do not have to be completely eliminated for the composition to be effective. The effective amount of an agent that includes one of the disclosed immunogenic compositions that is administered to a human or veterinary subject will vary depending upon a number of factors associated with that subject, for example the overall health of the subject. An effective amount of an agent can be determined by varying the dosage of the product and measuring the resulting therapeutic response, such as the prevention of bacterial infection. Effective amounts also can be determined through various in vitro, in vivo or in situ immunoassays. The disclosed agents can be administered in a single dose, or in several doses, as needed to obtain the desired response.

[0112] In particular examples, a therapeutically effective dose of an immunogenic composition includes at least 10.sup.2 colony forming units (CFU), such as at least 10.sup.3, at least 10.sup.4, at least 10.sup.5, at least 10.sup.6, at least 10.sup.7, or at least 10.sup.8 CFU, for example 10.sup.2 to 10.sup.8 CFU. In one example, 10.sup.2 to 10.sup.8 CFU of live bacteria are administered intradermally or intranasally. However, one skilled in the art will recognize that higher or lower dosages also could be used, for example depending on the particular immunogenic composition. In particular examples, such daily dosages are administered in one or more divided doses (such as 2, 3, or 4 doses) or in a single formulation. The disclosed immunogenic composition can be administered alone, in the presence of a pharmaceutically acceptable carrier, in the presence of other therapeutic agents.

[0113] Treatment: A therapeutic intervention that ameliorates a sign or symptom of a disease or pathological condition after it has begun to develop. In one example, the immunogenic compositions disclosed herein following administration to a mammal achieves a reduction in one or more signs of a bacterial infection.

[0114] Vector: A nucleic acid molecule as introduced into a host cell, thereby producing a transduced or transformed host cell, referred to herein as a recombinant cell. A vector may include nucleic acid sequences that permit it to replicate in a host cell, such as an origin of replication. A vector may also include one or more selectable marker gene and other genetic elements known in the art. Vectors include plasmid vectors, including plasmids for expression in gram negative and gram positive bacterial cells. Exemplary vectors include those for expression in E. coli and Salmonella. Vectors also include viral vectors, such as, but are not limited to, retrovirus, orthopox, avipox, fowlpox, capripox, suipox, adenoviral, herpes virus, alpha virus, baculovirus, Sindbis virus, vaccinia virus and poliovirus vectors.

[0115] Temperature-Sensitive Essential Genes from Psychrophilic Bacteria

[0116] It is disclosed herein that several nucleic acid molecules, and their corresponding peptides, can be introduced into a bacteria to confer temperature sensitivity (TS), such as heat-sensitivity, to the host bacteria. The resulting bacteria can be used to induce an immune response to the temperature-sensitive bacteria, such as a T cell response. Exemplary psychrophilic essential genes with desired temperature sensitivity, and their corresponding peptides, are provided herein. For example, host mesophilic bacteria can be transformed with one or more psychrophile TS essential nucleic acid molecules, thereby conferring TS to the mesophilic bacteria. The resulting recombinant mesophilic bacteria can be formulated into an immunogenic composition, to treat or prevent infection by the meosophilic bacteria. For example, recombinant mesophilic M. tuberculosis bacterium containing one or more psychrophile TS essential nucleic acid molecules can be used to treat or prevent tuberculosis. The same approach can be used to make TS forms of Bacillus anthracis, Brucella abortus, Burkholderia pseudomallei, Haemophilus influenzae, Mycobacterium bovis, Salmonella typhi, Shigella dysenteriae, Staphylococcus aureus, Streptococcus pneumoniae, and Yersinia pestis which cause anthrax, brucellosis, melioidosis, meningitis, bovine tuberculosis, typhoid fever, dysentery, numerous types of nosocomial infections, pneumonia, and plague. Thus, such TS bacteria can be used to treat or prevent such conditions.

[0117] Temperature-sensitive essential proteins from a psychrophilic bacterium are provided herein, such as those from Colwellia sp., Pseudoalteromonas sp., or Shewanella sp. Exemplarily proteins include ligA, pyrG, hemC, ftsZ, cmk, murG, fmt, and dnaK. Exemplary sequences are provided in the amino acid sequence shown in SEQ ID NO: 2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, and 28. However, one skilled in the art will appreciate that variant sequences can also be used. For example, a peptide having a sequence that is at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98% or at least 99% identical to the amino acid sequence set forth in one of SEQ ID NOs: 2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, and 28 is encompassed by the present disclosure, and can be used in the methods provided herein. Variant sequences retain the biological activity of the native temperature-sensitive essential protein from a psychrophilic bacterium, such as conferring the ability to make a bacterium TS (such as heat sensitivity), for example operable at a temperature of -10.degree. C. to about 30.degree. C. (such as 0.degree. C. to 30.degree. C.), but inoperable at a temperature greater than about 30.degree. C. (for example 4.degree. C. to 30.degree. C.), such as greater than 35.degree. C. Exemplary sequences can be obtained using computer programs that are readily available on the internet and the amino acid sequences set forth herein. In one example, the variant peptide retains a function of the native protein, such as the ability to confer temperature sensitivity to a bacterium.

[0118] A specific, non-limiting example of a variant protein is a conservative variant of the native protein (e.g., SEQ ID NOs: 2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, and 28). Substitutions of the amino acids sequence shown in SEQ ID NOs: 2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, and 28 can be made based on this table, as long as the pathogenic mesophilic bacteria are rendered TS and are able to initiate an immune response to its pathogenic antigens. For example, protein sequences can be altered without significantly altering their biological properties, for example by introducing one or more conservative amino acid substitutions. Therefore, any of SEQ ID NOS: 2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22 24, 26, or 28 can be modified by making 1 to 20, 1 to 15, 1 to 12, 1 to 10, or 1 to 5 conservative amino acid substitutions, such as 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20 or 50 conservative amino acid substitutions, while retaining the ability to render a mesophilic bacteria temperature sensitive (TS). Examples of conservative substitutions are shown below:

TABLE-US-00001 Original Conservative Residue Substitutions Ala Ser Arg Lys Asn Gln, His Asp Glu Cys Ser Gln Asn Glu Asp His Asn; Gln Ile Leu, Val Leu Ile; Val Lys Arg; Gln; Glu Met Leu; Ile Phe Met; Leu; Tyr Ser Thr Thr Ser Trp Tyr Tyr Trp; Phe Val Ile; Leu

[0119] Minor modifications to the disclosed protein sequences can result in peptides which have substantially equivalent activity as compared to the unmodified counterpart protein described herein. Such modifications may be deliberate, as by site-directed mutagenesis, or may be spontaneous. All of the proteins produced by these modifications are included herein.

[0120] Temperature-sensitive essential proteins (and nucleic acid molecules) from a psychrophilic bacterium are disclosed herein that can be used to induce temperature sensitivity in a desired bacterial host, wherein the resulting recombinant bacteria can be used to induce an immune response (for example in a mammal). These peptides can include fragments of the full-length native protein, as long as the ability to confer temperature sensitivity in the host cell is retained. In these examples, the peptide does not include the full-length amino acid sequences set forth as 2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, and 28. For example no more than 10%, no more than 5%, or no more than 1% of the amino acids can be deleted, such as 1% to 5% of the amino acids.

[0121] The isolated temperature-sensitive essential proteins can be part of a fusion protein. Thus, the fusion protein can include the temperature-sensitive essential protein (see above) and a second heterologous moiety, such as a myc protein, an enzyme or a carrier (such as a hepatitis carrier protein or bovine serum albumin) covalently linked to the temperature-sensitive essential protein. In additional examples, the temperature-sensitive essential protein includes six sequential histidine residues, a .beta.-galactosidase amino acid sequence, or an immunoglobulin amino acid sequence, for example at the C- or N-terminus of the temperature-sensitive essential protein. The temperature-sensitive essential protein can also be covalently linked to a carrier. Suitable carriers include, but are not limited to, a hepatitis B small envelope protein HBsAg.

[0122] The temperature-sensitive essential proteins disclosed herein can be chemically synthesized by standard methods, or can be produced recombinantly. An exemplary process for polypeptide production is described in Lu et al., Federation of European Biochemical Societies Letters. 429:31-35, 1998. Proteins can also be produced using molecular genetic techniques, such as by inserting a nucleic acid encoding a temperature-sensitive essential protein into an expression vector, introducing the expression vector into a host cell. They can also be isolated by methods including preparative chromatography and immunological separations.

[0123] Temperature-sensitive essential nucleic acid molecules from a psychrophilic bacterium are provided herein. Exemplary sequences are provided in the nucleic acid sequence shown in SEQ ID NO: 1, 3, 5, 7, 9, 11, 13, 15, 17, 19, 21, 23, 25, and 27. However, one skilled in the art will appreciate that variant sequences can also be used. For example, a nucleic acid molecule having a sequence that is at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98% or at least 99% identical to the nucleic acid sequence set forth in one of SEQ ID NOs: 1, 3, 5, 7, 9, 11, 13, 15, 17, 19, 21, 23, 25 and 27 (such as at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98% or at least 99% identical to the nucleic acid sequence set forth in nucleotides 10-2067 of SEQ ID NO: 1, nucleotides 10-2019 of SEQ ID NO: 3, nucleotides 10-2019 of SEQ ID NO: 5, nucleotides 10-2010 of SEQ ID NO: 7) is encompassed by the present disclosure, and can be used in the methods provided herein. In some examples, the codons of a nucleic acid molecule are optimized for the bacterium into which it is introduced. In some examples, such optimization does not alter the amino acid sequence encoded thereby. For example, the psychrophilic bacterium TS essential nucleic acid can be modified to optimize codon usage for the mesophilic bacterium (e.g., M. tuberculosis or F. novicida) into which the psychrophilic bacterium TS essential nucleic acid is introduced. Exemplary sequences can be obtained using computer programs that are readily available on the internet and the nucleic acid sequences set forth herein. In one example, the variant nucleic acid sequence retains the ability to encode a protein having the function of the native protein, such as the ability to confer temperature sensitivity (e.g., heat sensitivity) to a mesophilic bacterium.

[0124] The disclosed temperature-sensitive essential nucleic acid molecules from a psychrophilic bacterium include DNA, cDNA and RNA sequences which encode the temperature-sensitive essential peptide. Silent mutations in the coding sequence result from the degeneracy (i.e., redundancy) of the genetic code, whereby more than one codon can encode the same amino acid residue. Thus, for example, leucine can be encoded by CTT, CTC, CTA, CTG, TTA, or TTG; serine can be encoded by TCT, TCC, TCA, TCG, AGT, or AGC; asparagine can be encoded by AAT or AAC; aspartic acid can be encoded by GAT or GAC; cysteine can be encoded by TGT or TGC; alanine can be encoded by GCT, GCC, GCA, or GCG; glutamine can be encoded by CAA or CAG; tyrosine can be encoded by TAT or TAC; and isoleucine can be encoded by ATT, ATC, or ATA. Tables showing the standard genetic code can be found in various sources (e.g., L. Stryer, 1988, Biochemistry, 3.sup.rd Edition, W.H. 5 Freeman and Co., NY).

[0125] A nucleic acid molecule encoding a temperature-sensitive essential peptide from a psychrophilic bacterium can be cloned or amplified by in vitro methods, such as the polymerase chain reaction (PCR), the ligase chain reaction (LCR), the transcription-based amplification system (TAS), the self-sustained sequence replication system (3SR) and the Q.beta. replicase amplification system (QB). For example, a polynucleotide encoding the protein can be isolated by polymerase chain reaction of cDNA using primers based on the DNA sequence of the molecule. A wide variety of cloning and in vitro amplification methodologies are well known to persons skilled in the art. PCR methods are described in, for example, U.S. Pat. No. 4,683,195; Mullis et al., Cold Spring Harbor Symp. Quant. Biol. 51:263, 1987; and Erlich, ed., PCR Technology, (Stockton Press, N Y, 1989). Polynucleotides also can be isolated by screening genomic or cDNA libraries with probes selected from the sequences of the desired polynucleotide under stringent hybridization conditions.

[0126] The nucleic acid molecules encoding a temperature-sensitive essential peptide from a psychrophilic bacterium include a recombinant DNA which is incorporated into a vector, into an autonomously replicating plasmid or virus, or into the genomic DNA of a prokaryote or eukaryote, or which exists as a separate molecule (such as a cDNA) independent of other sequences. The nucleic acid molecules disclosed herein can be ribonucleotides, deoxyribonucleotides, or modified forms of either nucleotide. The term includes single and double forms of DNA.

[0127] The nucleic acid molecules encoding a temperature-sensitive essential peptide from a psychrophilic bacterium can be part of a vector, such as a plasmid or viral vector. Suitable vectors include retrovirus vectors, orthopox vectors, avipox vectors, fowlpox vectors, capripox vectors, suipox vectors, adenoviral vectors, herpes virus vectors, alpha virus vectors, baculovirus vectors, Sindbis virus vectors, vaccinia virus vectors and poliovirus vectors. Specific exemplary vectors are poxvirus vectors such as vaccinia virus, fowlpox virus and a highly attenuated vaccinia virus (MVA), adenovirus, baculovirus and the like. Other viral vectors that can be used include other DNA viruses such as herpes virus and adenoviruses, and RNA viruses such as retroviruses and polio.

[0128] The nucleic acid molecules encoding a temperature-sensitive essential peptide from a psychrophilic bacterium can be operably linked to at least one expression control element. The expression control elements are inserted in the vector or plasmid to control and regulate the expression of the nucleic acid sequence. For example, an expression control sequence operatively linked to a temperature-sensitive essential peptide coding sequence is ligated such that expression of the coding sequence is achieved under conditions compatible with the expression control sequences. The expression control sequences include, but are not limited to, appropriate promoters, enhancers, transcription terminators, a start codon (i.e., ATG) in front of a protein-encoding gene, splicing signal for introns, maintenance of the correct reading frame of that gene to permit proper translation of mRNA, and stop codons. Specific examples of expression control elements include, but are not limited to, lac system, operator and promoter regions of phage lambda, yeast promoters and promoters derived from polyoma, adenovirus, retrovirus or SV40. Additional operational elements include, but are not limited to, leader sequence, termination codons, polyadenylation signals and any other sequences necessary for the appropriate transcription and subsequent translation of the nucleic acid sequence encoding the temperature-sensitive essential peptide from a psychrophilic bacterium in the host system. The expression vector can contain additional elements necessary for the transfer and subsequent replication of the expression vector containing the nucleic acid sequence in the host system. Examples of such elements include, but are not limited to, origins of replication and selectable markers. It will further be understood by one skilled in the art that such vectors are easily constructed using conventional methods (Ausubel et al., (1987) in "Current Protocols in Molecular Biology," John Wiley and Sons, New York, N.Y.) and are commercially available.

[0129] In one example, vector introduced into a host bacterium includes one or more of the following elements: (i) a prokaryotic origin of replication, so that the vector may be amplified in a prokaryotic host; (ii) a gene encoding a marker which allows selection of prokaryotic host cells that contain the vector (e.g., a gene encoding antibiotic resistance); (iii) at least one DNA sequence encoding one or more temperature-sensitive essential peptides from a psychrophilic bacterium located adjacent to a transcriptional promoter capable of directing the expression of the sequence; and (iv) DNA sequences homologous to the region of the parent virus genome where the foreign gene(s) will be inserted, flanking the construct of element (iii).

[0130] The vector can contain an additional gene that encodes a marker that will allow identification of recombinant cells containing inserted foreign DNA. These include genes that encode antibiotic or chemical resistance (e.g., see Spyropoulos et al., 1988, J. Virol. 62:1046; Falkner and Moss, 1988, J. Virol. 62:1849; Franke et al., 1985, Mol. Cell. Biol. 5:1918), as well as genes such as the E. coli lacZ gene, that permits identification of recombinant plaques by colorimetric assay.

[0131] Methods of introducing nucleic acid molecules, such as those that encode a temperature-sensitive essential peptide from a psychrophilic bacterium, are well known to those skilled in the art. Where the host is prokaryotic, such as, a bacterium, competent cells which are capable of DNA uptake can be prepared from cells harvested after exponential growth phase and subsequently treated by the CaCl.sub.2 method using procedures well known in the art. Alternatively, MgCl.sub.2 or RbCl can be used. Transformation can also be performed after forming a protoplast of the host cell if desired, or by electroporation. Hosts cells can include bacterial cells, such as bacteria that cause disease. Examples of such bacteria that can be used as host cells for temperature-sensitive essential nucleic acids/peptides from a psychrophilic bacterium include without limitation any one or more of (or any combination of) Acinetobacter baumanii, Actinobacillus sp., Actinomycetes, Actinomyces sp. (such as Actinomyces israelii and Actinomyces naeslundii), Aeromonas sp. (such as Aeromonas hydrophila, Aeromonas veronii biovar sobria (Aeromonas sobria), and Aeromonas caviae), Anaplasma phagocytophilum, Alcaligenes xylosoxidans, Acinetobacter baumanii, Actinobacillus actinomycetemcomitans, Bacillus sp. (such as Bacillus anthracis, Bacillus cereus, Bacillus subtilis, Bacillus thuringiensis, and Bacillus stearothermophilus), Bacteroides sp. (such as Bacteroides fragilis), Bartonella sp. (such as Bartonella bacilliformis and Bartonella henselae, Bifidobacterium sp., Bordetella sp. (such as Bordetella pertussis, Bordetella parapertussis, and Bordetella bronchiseptica), Borrelia sp. (such as Borrelia recurrentis, and Borrelia burgdorferi), Brucella sp. (such as Brucella abortus, Brucella canis, Brucella melintensis and Brucella suis), Burkholderia sp. (such as Burkholderia pseudomallei and Burkholderia cepacia), Campylobacter sp. (such as Campylobacter jejuni, Campylobacter coli, Campylobacter lari and Campylobacter fetus), Capnocytophaga sp., Cardiobacterium hominis, Chlamydia trachomatis, Chlamydophila pneumoniae, Chlamydophila psittaci, Citrobacter sp. Coxiella burnetii, Corynebacterium sp. (such as, Corynebacterium diphtheriae, Corynebacterium jeikeum and Corynebacterium), Clostridium sp. (such as Clostridium perfringens, Clostridium difficile, Clostridium botulinum and Clostridium tetani), Eikenella corrodens, Enterobacter sp. (such as Enterobacter aerogenes, Enterobacter agglomerans, Enterobacter cloacae and Escherichia coli, including opportunistic Escherichia coli, such as enterotoxigenic E. coli, enteroinvasive E. coli, enteropathogenic E. coli, enterohemorrhagic E. coli, enteroaggregative E. coli and uropathogenic E. coli) Enterococcus sp. (such as Enterococcus faecalis and Enterococcus faecium) Ehrlichia sp. (such as Ehrlichia chafeensia and Ehrlichia canis), Erysipelothrix rhusiopathiae, Eubacterium sp., Francisella tularensis, Fusobacterium nucleatum, Gardnerella vaginalis, Gemella morbillorum, Haemophilus sp. (such as Haemophilus influenzae, Haemophilus ducreyi, Haemophilus aegyptius, Haemophilus parainfluenzae, Haemophilus haemolyticus and Haemophilus parahaemolyticus, Helicobacter sp. (such as Helicobacter pylori, Helicobacter cinaedi and Helicobacter fennelliae), Kingella kingii, Klebsiella sp. (such as Klebsiella pneumoniae, Klebsiella granulomatis and Klebsiella oxytoca), Lactobacillus sp., Listeria monocytogenes, Leptospira interrogans, Legionella pneumophila, Leptospira interrogans, Peptostreptococcus sp., Moraxella catarrhalis, Morganella sp., Mobiluncus sp., Micrococcus sp., Mycobacterium sp. (such as Mycobacterium leprae, Mycobacterium tuberculosis, Mycobacterium intracellulare, Mycobacterium avium, Mycobacterium bovis, and Mycobacterium marinum), Mycoplasm sp. (such as Mycoplasma pneumoniae, Mycoplasma hominis, and Mycoplasma genitalium), Nocardia sp. (such as Nocardia asteroides, Nocardia cyriacigeorgica and Nocardia brasiliensis), Neisseria sp. (such as Neisseria gonorrhoeae and Neisseria meningitidis), Pasteurella multocida, Plesiomonas shigelloides. Prevotella sp., Porphyromonas sp., Prevotella melaninogenica, Proteus sp. (such as Proteus vulgaris and Proteus mirabilis), Providencia sp. (such as Providencia alcalifaciens, Providencia rettgeri and Providencia stuartii), Pseudomonas aeruginosa, Propionibacterium acnes, Rhodococcus equi, Rickettsia sp. (such as Rickettsia rickettsii, Rickettsia akari and Rickettsia prowazekii, Orientia tsutsugamushi (formerly: Rickettsia tsutsugamushi) and Rickettsia typhi), Rhodococcus sp., Serratia marcescens, Stenotrophomonas maltophilia, Salmonella sp. (such as Salmonella enterica, Salmonella typhi, Salmonella paratyphi, Salmonella enteritidis, Salmonella cholerasuis and Salmonella typhimurium), Serratia sp. (such as Serratia marcesans and Serratia liquifaciens), Shigella sp. (such as Shigella dysenteriae, Shigella flexneri, Shigella boydii and Shigella sonnei), Staphylococcus sp. (such as Staphylococcus aureus, Staphylococcus epidermidis, Staphylococcus hemolyticus, Staphylococcus saprophyticus), Streptococcus sp. (such as Streptococcus pneumoniae (for example chloramphenicol-resistant serotype 4 Streptococcus pneumoniae, spectinomycin-resistant serotype 6B Streptococcus pneumoniae, streptomycin-resistant serotype 9V Streptococcus pneumoniae, erythromycin-resistant serotype 14 Streptococcus pneumoniae, optochin-resistant serotype 14 Streptococcus pneumoniae, rifampicin-resistant serotype 18C Streptococcus pneumoniae, tetracycline-resistant serotype 19F Streptococcus pneumoniae, penicillin-resistant serotype 19F Streptococcus pneumoniae, and trimethoprim-resistant serotype 23F Streptococcus pneumoniae, chloramphenicol-resistant serotype 4 Streptococcus pneumoniae, spectinomycin-resistant serotype 6B Streptococcus pneumoniae, streptomycin-resistant serotype 9V Streptococcus pneumoniae, optochin-resistant serotype 14 Streptococcus pneumoniae, rifampicin-resistant serotype 18C Streptococcus pneumoniae, penicillin-resistant serotype 19F Streptococcus pneumoniae, or trimethoprim-resistant serotype 23F Streptococcus pneumoniae), Streptococcus agalactiae, Streptococcus mutans, Streptococcus pyogenes, Group A streptococci, Streptococcus pyogenes, Group B streptococci, Streptococcus agalactiae, Group C streptococci, Streptococcus anginosus, Streptococcus equismilis, Group D streptococci, Streptococcus bovis, Group F streptococci, and Streptococcus anginosus Group G streptococci), Spirillum minus, Streptobacillus moniliformi, Treponema sp. (such as Treponema carateum, Treponema petenue, Treponema pallidum and Treponema endemicum, Tropheryma whippelii, Ureaplasma urealyticum, Veillonella sp., Vibrio sp. (such as Vibrio cholerae, Vibrio parahemolyticus, Vibrio vulnificus, Vibrio parahaemolyticus, Vibrio vulnificus, Vibrio alginolyticus, Vibrio mimicus, Vibrio hollisae, Vibrio fluvialis, Vibrio metchnikovii, Vibrio damsela and Vibrio furnisii), Yersinia sp. (such as Yersinia enterocolitica, Yersinia pestis, and Yersinia pseudotuberculosis) and Xanthomonas maltophilia among others.

[0132] Following transformation of bacterial cells, recombinant host cells can be identified by one of several techniques. For example, expression of a gene encoding a marker or indicator gene with the temperature-sensitive gene, as described above, can be used to identify recombinant progeny. One specific non-limiting example of an indicator gene is the E. coli lacZ gene. Recombinant bacterial cells expressing beta-galactosidase can be selected using a chromogenic substrate for the enzyme. Once a recombinant bacterium has been identified, it can be selected and amplified for use in an immunogenic composition provided herein.

Methods of Making Temperature-Sensitive Bacterial Strains

[0133] The exemplary embodiments relate to methods for generating recombinant TS bacteria for use in stimulating an immune response to the TS bacteria. In one aspect, an exemplary TS immunogenic composition is suitable for immunoprophylaxis to prevent infectious disease or alternatively immunotherapy to treat an infectious disease. Such TS bacteria are generated by the introduction of one or more TS essential genes from psychrophilic bacteria into a target bacteria (such as a mesophilic bacteria that causes a disease that one wants to treat or prevent). Thus, the disclosure provides safe immunogenic compositions based on live genetically altered bacterial microorganisms. This was accomplished by taking advantage of essential genes from psychrophilic bacteria, by creating a fusion of the psychrophilic structural genes with the transcriptional and translational control elements of the "host" genome or by making fusions between the host gene and the psychrophilic gene. The exemplary embodiments provide live vaccines and immunogenic compositions that mimic a number of cold adapted viral vaccines and are unable to grow at the normal body temperature.

[0134] According to another exemplary embodiment it is suitable for mass production purposes, specifically of antigen; due to the TS strain's non-virulent nature the aerosols produced are rendered harmless and therefore, this methods and compositions disclosed herein can significantly reduce or eliminate human risk of infection.

[0135] Another aspect, the methods and compositions provided herein has value as a research diagnostic, or as a research/educational tool because it allows for experimentation to be performed on organisms that are normally highly pathogenic in their viable state without posing threats to the researcher.

[0136] The methods and compositions provided herein can be employed to stimulate the immune system with TS organisms with the intention of prevention or treatment of a disease.

[0137] A large number of psychrophilic bacteria contain TS genes, which can be used to generate TS mesophilic bacteria of the present disclosure. For example, one or more TS essential genes from psychrophilic bacteria can be introduced into a mesophilic bacterium (for example into a chromosome of a mesophilic bacteria), thereby generating a TS strain that can be used to induce an immune response in a subject into whom it is administered. Recombinant methods for introducing a nucleic acid into bacteria are routine in the art. Appropriate TS essential genes from psychrophilic bacteria can be identified using the methods provided herein. As shown in Tables 1 and 2, nine of the twenty one essential genes from the psychrophilic C. psychrerhraea were introduced into F. novicida and substituted for an essential host gene to generate TS strains of F. novicida ("Group I"). Group I genes generated a range of TS phenotypes with the restrictive temperatures of about 33.degree. C. to 44.degree. C. Thus, the genes of Group I can be used to generate TS strains of the present disclosure. Group II in Table 1 consists of the C. psychrerhraea genes that either functioned poorly or not at all in the exemplary bacterial strain F. novicida. F. novicida strains carrying an integrate with the psychrophilic essential gene resolve the integrate under counter selection pressure generated by the presence of sacB and sucrose. However, the resolved strains retain copies of both the psychrophilic gene and the F. novicida homologue and the strains are not TS ("Group III" in Table 1); indicating that these psychrophilic essential genes do not function in the mesophilic host. Alleles of the same gene from different psychrophilic bacteria can be selected to identify those that generate hybrid strains with the same TS properties when substituted into the chromosomes of mesophilic bacteria. The ligA alleles from three different psychrophilic bacteria generated three different TS phenotypes when substituted into the mesophile F. novicida. The pyrG.sub.Cp allele from C. psychrerhraea created a TS strain when substituted into F. novicida but the pyrG.sub.Sf allele from S. frigidimarina (SF) did not. PH refers to P. haloplanktis.

TABLE-US-00002 TABLE 1 Restrictive Gene Temp.(.degree. C.) symbol Source Product Function Group ligA.sub.Ph2 28/PH NAD-dependent DNA ligase I ligA.sub.Sf 33/SF NAD-dependent DNA ligase I ligA.sub.Cp 34/CP NAD-dependent DNA ligase I ligA.sub.Ph 36.8/PH NAD-dependent DNA ligase I hemC.sub.Cp 36.8/CP Porphobilinogen deaminase (Hydroxymethylbilane I synthase) pyrG.sub.Cp 37.2/CP CTP synthetase I dnaK.sub.Cp 38.2/CP Molecular chaperone DnaK I murG.sub.Cp 38.2/CP UDP-N-acetylglucosamine-N-acetylmuramyl- I (pentapeptide) pyrophosphoryl-undecaprenol N- acetylglucosamine transferase fmt.sub.Cp 41/CP Methionyl-tRNA formyltransferase I ftsZ.sub.Cp 42/CP Cell division protein I cmk.sub.Cp 43/CP Cytidylate kinase I tyrS.sub.Cp 44/CP Aminoacyl tRNA synthetases for Tyr I adk.sub.Cp >44/CP Adenylate kinase (proved resolution) II accD.sub.Cp >44/CP AcetylCoA carboxylase. The F. novicida integrate II containing accD.sub.Cp fails to resolve. murI.sub.Cp >44/CP Glutamate racemase. The CP version of MurI II appears to function poorly at all temperatures in F. novicida. pyrG.sub.Sf >44/SF CTP synthetase III trxA.sub.Cp >44/CP Thioredoxin III glmS.sub.Cp >44/CP Glucosamine-fructose-6-phosphate III aminotransferase argS.sub.Cp >44/CP Aminoacyl tRNA synthetases for Arg III cds.sub.Cp >44/CP phosphatidate cytidylyltransferase III mur.sub.CpC >44/CP UDP-N-acetylmuramate-alanine ligase III valS.sub.Cp >44/CP Aminoacyl tRNA synthetases for Val III proS.sub.Cp >44/CP Aminoacyl tRNA synthetases for Pro III metK.sub.Cp .ltoreq.44/CP S-adenosylmethionine synthetase III ftsW.sub.Cp >44/CP Cell division protein III

TABLE-US-00003 TABLE 2 Mutation rate in F. novicida to Restricted Challenge temperature resistance Gene Temp (.degree. C.) Temp (.degree. C.) Trial #1 Trial #2 Trial #3 ligA.sub.Sf 33 37 .sup. 4.0 .times. 10.sup.-6 .sup. 3.3 .times. 10.sup.-7 .sup. 9.7 .times. 10.sup.-7 ligA.sub.Cp 34 37 <1.2 .times. 10.sup.-10 <7.93 .times. 10.sup.-11 <1.1 .times. 10.sup.-10 ligA.sub.Ph 36.8 39 <1.5 .times. 10.sup.-10 <7.8 .times. 10.sup.-11 <6.2 .times. 10.sup.-11 dnaK.sub.Cp 38.2 39.5 <3.2 .times. 10.sup.-10 <1.9 .times. 10.sup.-10 <3.2 .times. 10.sup.-10 hemC.sub.Cp 36.8 43 <2.5 .times. 10.sup.-10 <3.6 .times. 10.sup.-11 <3.7 .times. 10.sup.-11 pyrG.sub.Cp 37.2 40 .sup. 8.5 .times. 10.sup.-8 1.0 .times. 10.sup.-9 .sup. 6.5 .times. 10.sup.-8 murG.sub.Cp 38.2 43 .sup. 2.6 .times. 10.sup.-4 3.0 .times. 10.sup.-5 .sup. 8.5 .times. 10.sup.-5 dnaK.sub.Sf 39 42 3.1 .times. 10.sup.-10 8.5 .times. 10.sup.-10

[0138] To make a TS bacterial pathogen, an essential gene from an Arctic psychrophile bacterium was substituted into the genome of a mesophilic pathogenic bacterium. The Arctic bacterial essential gene ligA.sub.Sf rendered F. novicida unable to grow at a temperature of 33.degree. C. or higher. Table 2 outlines the restrictive temperature properties imposed on F. novicida following the replacement of the mesophilic essential gene for its psychrophilic counterpart. Any of the genes in Table 2 may be introduced into a pathogenic bacteria strain to create live heat-sensitive vaccines. Exemplary pathogenic bacteria include but are not limited to: Mycobacterium sp., Haemophilus sp., Vibrio sp., Escherichia sp., Salmonella sp., Streptococcus sp., Burkholderia sp., Campylobacter sp., Neisseria sp., and Francisella sp.

[0139] The disclosure relates to genes derived from psychrophilic bacteria for use in the development of heat-sensitive immunogenic compositions, and methods of using these compositions to stimulate an immune response in a subject. In a specific example, the disclosure provides recombinant pathogens (such as Mycobacterium sp., Haemophilus sp., Vibrio sp., Escherichia sp., Salmonella sp., Streptococcus sp., Burkholderia sp., Campylobacter sp., Neisseria sp., and Francisella sp.) containing one or more heat-sensitive genes, exemplified by ligA, pyrG, hemC, ftsZ, cmk, dnaK, and fmt, that can be administered to a subject to provide a prophylactic immune response against diseases caused by such bacteria.

[0140] Methods of making a recombinant temperature-sensitive (TS) bacterial cell are provided. In one example the method includes introducing into the genome of a mesophilic bacterial strain a nucleic acid construct that includes a TS essential nucleic acid molecule from a psychrophilic bacteria (such as one that encodes a peptide that is operable at a temperature of about -10.degree. C. to about 30.degree. C., and/or inoperable at a temperature greater than about 30.degree. C., for example Colwellia sp., Psuedoalteromonas sp., or Shewanella sp) and one or more control sequences operably linked to the TS essential nucleic acid molecule. The temperature-sensitive essential polynucleotide renders the mesophilic bacteria operable at a temperature less than about 30.degree. C. and inoperable at a temperature greater than about 30.degree. C. In some examples, the temperature-sensitive essential nucleic acid molecule includes a nucleotide sequence having at least 80%, at least 90%, or at least 95% sequence identity to the nucleotide sequence shown in SEQ ID NO: 1, 3, 5, 7, 9, 11, 13, 15, 17, 19, 21, 23, 25, or 27. In some examples the method also includes isolating the TS essential nucleic acid molecule from the genome of the psychrophilic bacterial strain. The method can also include constructing or generating the nucleic acid construct comprising the TS essential nucleic acid molecule and one or more control sequences operably linked to the TS essential nucleic acid molecule.

[0141] In some examples, the method further includes culturing the recombinant TS bacterial host cell at a temperature wherein the temperature-sensitive peptide is operable, whereby said recombinant TS bacterial host cell produces a plurality of peptides; increasing the culturing temperature to a temperature at which the temperature-sensitive peptide is inoperable; maintaining said culturing for a period of time sufficient to kill the recombinant TS bacterial host cell; and harvesting the killed recombinant TS bacterial host cells.

[0142] Methods of making a recombinant TS bacterial host cell can also include the following. A psychrophilic microbial genome is screened for detection of a TS essential polynucleotide that encodes a peptide that is inactivated at about greater than 30.degree. C.; isolating said TS essential polynucleotide; constructing a nucleic acid construct comprising the TS essential polynucleotide and one or more control sequences operably linked to the TS polynucleotide; inserting the nucleic acid construct into the genome of a selected mesophilic bacterial host cell (such as Francisella novicida) thereby functionally replacing the host cell's homologue of the TS essential polynucleotide whereby the TS peptide (and thus the bacteria in which it is expressed) is operable at a temperature less than about 30.degree. C., and inoperable at a temperature greater than about 30.degree. C. and mimics the temperature sensitivity of the original designated host bacterium. The resulting recombinant mesophilic bacterial host cell comprising the TS polynucleotide is cultured or grown at a temperature less than about 30.degree. C. to confirm the viability of the recombinant mesophilic bacterial host cell; further culturing the recombinant mesophilic bacterial host cell comprising the TS polynucleotide at a temperature greater than about 30.degree. C. to determine if the mesophilic bacterial host cell is killed. If the mesophilic bacterial host cell is killed, the nucleic acid construct is introduced into the genome of a selected destination mesophilic bacterial host cell (such as Salmonella sp. or Mycobacterium sp.) thereby functionally replacing the host cell's homologue of the temperature-sensitive essential polynucleotide whereby the temperature-sensitive peptide (and thus the bacteria in which it is expressed) is operable at a temperature less than about 30.degree. C., and inoperable at a temperature greater than about 30.degree. C. and mimics the temperature sensitivity of the original tester host bacterium.

[0143] In some examples, the mesophilic bacteria is one that is operable at a temperature selected from the range of about 10.degree. C. to about 50.degree. C. prior to introduction of the TS essential nucleic acid molecule from a psychrophilic bacteria. Examples of such mesophilic bacteria include strains of fermentative bacteria or bioremediation bacteria. Other exemplary bacteria are provided above.

[0144] In some examples, the TS essential nucleic acid molecule expresses a peptide during a culturing of the recombinant TS bacteria, such as a peptide having at least 80%, at least 90%, or at least 95% sequence identity to an amino acid sequence shown in SEQ ID NO: 2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, or 28.

[0145] Recombinant TS bacteria generated by these methods, as well as compositions that include such bacteria, are also provided herein.

Temperature-Sensitive Bacterial Strain Compositions

[0146] Compositions are provided that include recombinant TS bacteria provided herein. In some examples, the compositions include more than one type of recombinant TS bacteria, such as 2, 3, 4 or 5 different recombinant TS bacteria. In some examples, the recombinant TS bacteria contain two or more different TS essential psychrophilic coding sequences (such as two or more of the Group I genes listed in Table 1, such as ligA and another Group I gene). In particular examples, the recombinant TS bacteria is a Francisella sp., Salmonella sp., or Mycobacterium sp. (other particular examples are provided above).

[0147] In some examples, such compositions are immunogenic, in that they can stimulate an immune response in a mammal. The compositions can include other components, such as pharmaceutically acceptable carriers (such as saline), adjuvants, preservatives, combinations thereof, and the like.

Methods of Stimulating an Immune Response Using Temperature-Sensitive Bacterial Strains

[0148] The TS recombinant bacteria disclosed herein can be used to generate an immune response in a subject. In some examples, the subject is infected with a bacterium, or as at risk of being infected with a bacterium (such as a health care worker), such as Mycobacterium tuberculosis. Thus, in several embodiments, the methods include administering to a subject a therapeutically effective amount of one or more of the TS recombinant bacteria disclosed herein in order to generate an immune response, such as, but not limited to, a protective immune response. For example, two or more different TS recombinant bacteria (such as those expressing different TS essential peptides from psychrophilic bacteria) can be used to generate an immune response in a subject. In some examples, the recombinant bacterium used to generate an immune response in a subject expresses two or more different temperature-sensitive essential peptides from a psychrophilic bacterium or the same temperature-sensitive essential peptide from two or more different psychrophilic bacteria.

[0149] The TS recombinant bacterium administered is selected based on the bacterial infection to be prevented or treated. For example, if the bacterial infection to be prevented or treated in the subject is tuberculosis, then the TS recombinant bacteria is Mycobacterium tuberculosis expressing at least one TS essential peptide from a psychrophilic bacterium. In another example, if the bacterial infection to be prevented or treated in the subject is tularemia, then the TS recombinant bacteria is F. tularensis expressing at least one TS essential peptide from a psychrophilic bacterium.

[0150] In exemplary applications, compositions are administered to a subject having in an amount sufficient to produce an immune response to the TS recombinant bacteria. These TS recombinant bacteria are of use to prevent a bacterial infection (such as Mycobacterium tuberculosis) prevent progression to disease in a subject having a latent bacterial infection, or to treat a disease resulting from the bacterial infection (such as tuberculosis). In several examples, administration of a therapeutically effective amount of a composition including the TS recombinant bacteria disclosed herein induces a sufficient immune response to decrease a symptom of a disease due to bacterial infection, to prevent the development of one or more symptoms of the disease associated with the infection, or to prevent infection with the bacteria.

[0151] In some examples, the compositions are of use in preventing a future bacterial infection. Thus, a therapeutically effective amount of the composition is administered to a subject at risk of becoming infected with a bacterium, such as Mycobacterium tuberculosis. For example the disclosed compositions can be used to prevent the development of tuberculosis, such as latent or active tuberculosis in the subject upon subsequent exposure to Mycobacterium tuberculosis. In one example, the compositions are administered to a subject with a latent Mycobacterium tuberculosis infection, and prevent the development of symptoms of tuberculosis. Thus the compositions are of use in treating a subject with latent tuberculosis, such that the subject does not develop active tuberculosis.

[0152] Amounts effective for these uses will depend upon the severity of the disease, the general state of the patient's health, and the robustness of the patient's immune system. In one example, a therapeutically effective amount of the compound is that which provides either subjective relief of a symptom(s) or an objectively identifiable improvement as noted by the clinician or other qualified observer. In other examples, a therapeutically effective amount is an amount sufficient to prevent an infection with the bacterium in a subject upon subsequent exposure of the subject to the bacterium. In additional examples, a therapeutically effective amount is an amount sufficient to prevent development of symptom in a subject infected with a bacterium.

[0153] The TS recombinant bacteria-containing composition can be administered by any means known to one of skill in the art either locally or systemically, such as by intramuscular injection, subcutaneous injection, intraperitoneal infection, intravenous injection, oral administration, nasal administration, transdermal administration or even anal administration. In one embodiment, administration is by oral, subcutaneous injection or intramuscular injection. To extend the time during which the TS recombinant bacteria is available to stimulate a response, the TS recombinant bacteria can be provided as an implant, an oily injection, or as a particulate system. The particulate system can be a microparticle, a microcapsule, a microsphere, a nanocapsule, or similar particle. A particulate carrier based on a synthetic polymer has been shown to act as an adjuvant to enhance the immune response, in addition to providing a controlled release. Aluminum salts can also be used as adjuvants to produce an immune response.

[0154] In one specific, non-limiting example, the TS recombinant bacteria are administered in a manner to direct the immune response to a cellular response (that is, a cytotoxic T lymphocyte (CTL) response), rather than a humoral (antibody) response.

[0155] Optionally, one or more cytokines, such as IL-2, IL-6, IL-12, RANTES, GM-CSF, TNF-.alpha., or IFN-.gamma., one or more growth factors, such as GM-CSF or G-CSF; one or more costimulatory molecules, such as ICAM-1, LFA-3, CD72, B7-1, B7-2, or other B7 related molecules; one or more molecules such as OX-40L or 41 BBL, or combinations of these molecules, can be used as biological adjuvants (see, for example, Salgaller et al., 1998, J. Surg. Oncol. 68(2):122-38; Lotze et al., 2000, Cancer J Sci. Am. 6(Suppl 1):S61-6; Cao et al., 1998, Stem Cells 16(Suppl 1):251-60; Kuiper et al., 2000, Adv. Exp. Med. Biol. 465:381-90). These molecules can be administered systemically (or locally) to the subject. In some examples, IL-2, RANTES, GM-CSF, TNF-.alpha., IFN-.gamma., G-CSF, LFA-3, CD72, B7-1, B7-2, B7-1 B7-2, OX-40L, 41 BBL and ICAM-1 are administered. In various embodiments, the nucleic acid encoding the biological adjuvant can be cloned into same vector as the psychrophilic TS essential peptide coding sequence, or the nucleic acid can be cloned into one or more separate vectors for co-administration into the bacteria.

[0156] A pharmaceutical composition including TS recombinant bacteria is thus provided. These compositions are of use to promote an immune response to a particular bacterium. In one embodiment, TS recombinant bacteria are mixed with an adjuvant containing two or more of a stabilizing detergent, a micelle-forming agent, and an oil. Suitable stabilizing detergents, micelle-forming agents, and oils are detailed in U.S. Pat. No. 5,585,103; U.S. Pat. No. 5,709,860; U.S. Pat. No. 5,270,202; and U.S. Pat. No. 5,695,770, all of which are incorporated by reference. A stabilizing detergent is any detergent that allows the components of the emulsion to remain as a stable emulsion. Such detergents include polysorbate, 80 (TWEEN) (Sorbitan-mono-9-octadecenoate-poly(oxy-1,2-ethanediyl; manufactured by ICI Americas, Wilmington, Del.), TWEEN 40.TM., TWEEN 20.TM., TWEEN 60.TM., ZWITTERGENT.TM. 3-12, TEEPOL HB7.TM., and SPAN 85.TM.. These detergents are usually provided in an amount of approximately 0.05 to 0.5%, such as at about 0.2%. A micelle forming agent is an agent which is able to stabilize the emulsion formed with the other components such that a micelle-like structure is formed. Such agents generally cause some irritation at the site of injection in order to recruit macrophages to enhance the cellular response. Examples of such agents include polymer surfactants described by BASF Wyandotte publications, e.g., Schmolka, J. Am. Oil. Chem. Soc. 54:110, 1977, and Hunter et al., J. Immunol 129:1244, 1981, PLURONIC.TM. L62LF, L101, and L64, PEG1000, and TETRONIC.TM. 1501, 150R1, 701, 901, 1301, and 130R1. The chemical structures of such agents are well known in the art. In one embodiment, the agent is chosen to have a hydrophile-lipophile balance (HLB) of between 0 and 2, as defined by Hunter and Bennett, J. Immun. 133:3167, 1984. The agent can be provided in an effective amount, for example between 0.5 and 10%, or in an amount between 1.25 and 5%.

[0157] In one example oil is included in the composition. Examples of such oils include squalene, Squalane, EICOSANE.TM., tetratetracontane, glycerol, and peanut oil or other vegetable oils. In one specific, non-limiting example, the oil is provided in an amount between 1 and 10%, or between 2.5 and 5%. The oil should be both biodegradable and biocompatible so that the body can break down the oil over time, and so that no adverse affects, such as granulomas, are evident upon use of the oil.

[0158] In one embodiment, the adjuvant in the composition is a mixture of stabilizing detergents, micelle-forming agent, and oil available under the name PROVAX.RTM. (IDEC Pharmaceuticals, San Diego, Calif.). An adjuvant can also be an immunostimulatory nucleic acid, such as a nucleic acid including a CpG motif, or a biological adjuvant (see above).

[0159] Controlled release parenteral formulations can be made as implants, oily injections, or as particulate systems. For a broad overview of protein delivery systems, see Banga, Therapeutic Peptides and Proteins: Formulation, Processing, and Delivery Systems, Technomic Publishing Company, Inc., Lancaster, Pa., 1995. Particulate systems include microspheres, microparticles, microcapsules, nanocapsules, nanospheres, and nanoparticles. Microcapsules contain the therapeutic protein as a central core. In microspheres, the therapeutic agent is dispersed throughout the particle. Particles, microspheres, and microcapsules smaller than about 1 .mu.m are generally referred to as nanoparticles, nanospheres, and nanocapsules, respectively. Capillaries have a diameter of approximately 5 .mu.m so that only nanoparticles are administered intravenously. Microparticles are typically around 100 .mu.m in diameter and are administered subcutaneously or intramuscularly (see Kreuter, Colloidal Drug Delivery Systems, J. Kreuter, ed., Marcel Dekker, Inc., New York, N.Y., pp. 219-342, 1994; Tice & Tabibi, Treatise on Controlled Drug Delivery, A. Kydonieus, ed., Marcel Dekker, Inc. New York, N.Y., pp. 315-339, 1992).

[0160] In particular examples, at least 10.sup.2 CFU of the TS bacteria disclosed herein are administered per dose, such as at least 10.sup.3 CFU, at least 10.sup.4 CFU, at least 10.sup.5 CFU, at least 10.sup.6 CFU, at least 10.sup.7 CFU, at least 10.sup.8 CFU, such as 10.sup.2 to 10.sup.8 CFU or 10.sup.4 to 10.sup.8 CFU. In particular examples, such dosages are administered intradermal or intranasal.

[0161] Single or multiple administrations of the compositions are administered depending on the dosage and frequency as required and tolerated by the subject. In one embodiment, the dosage is administered once as a bolus, but in another embodiment can be applied periodically until a therapeutic result is achieved. In one embodiment, the dose is sufficient to treat or ameliorate symptoms or signs of bacterial infection without producing unacceptable toxicity to the subject. In another embodiment, the dose is sufficient to prevent infection with a bacterium upon subsequent exposure to the bacterium (such as M. tuberculosis). In a further embodiment, the dose is sufficient to prevent a symptom of bacterial infection (e.g., tuberculosis) in a subject with a latent bacterial infection. Systemic or local administration can be utilized.

[0162] Thus the disclosure provides methods for producing an immune response to a bacterium in a subject. The method can include administering to the subject a therapeutically effective amount of a TS bacterium, wherein the temperature-sensitive bacterium expresses a psychrophilic TS essential protein or nucleic acid molecule provided herein (such as a nucleic acid coding sequence in a vector), thereby inducing an immune response to the bacterium. The method can further include administering other agents, such as an adjuvant or antimicrobial agent (such as an antibiotic). In some examples, the immune response is a protective immune response. The subject may have a bacterial infection, be at risk for acquiring a bacterial infection, or have a latent bacterial infection. Exemplary bacterial infections include infections with is M. tuberculosis, Salmonella or Francisella.

[0163] Methods of measuring an immune response following stimulation with a bacterial antigen, such as a cytokine response, are known in the art. In some examples, the method further includes measuring an immune response following administration of the therapeutic compositions provided herein. In one example, a cytokine response is increased following administration of the composition provided herein, such as an increase relative to the absence of administration of the composition. In one example, cytokine production increases by at least 20%, such as at least 40%, at least 50%, at least 75%, at least 90%, or at least 95% following administration of the composition, relative to the cytokine response in the absence of administration of the composition.

[0164] The disclosure is illustrated by the following non-limiting Examples.

Example 1

[0165] This example pertains to an exemplary method to create recombinant psychrophilic genes joined to flanking DNA of a mesophilic host.

[0166] FIG. 1a exemplifies the fusion PCR (also known as, "extension overlap PCR", "overlap PCR" or "splice overlap PCR") strategy used to incorporate the C. psychrerythraea essential gene (C2) into the wt F. novicida genome. The C. psychrerythraea genes were engineered with overlap PCR to contain the ribosome binding site (RBS) and the initial three codons and stop codon of the surrounding F. novicida genes (F1 and F3) to promote translation of the C2 gene at normal levels for F. novicida. The fusion PCR product was ligated to an erythromycin resistant sacB cassette (Em.sup.R-sacB) prior to its transformation into F. novicida. Em.sup.R colonies containing the fusion PCR product were grown in the presence of sucrose and colonies were screened for the loss of Em.sup.R, the F. novicida essential gene (F2), and the presence of C2.

[0167] FIG. 1b illustrates the introduction of the psychrophilic gene fusion construct into the target organism's chromosome via a single crossover event. Furthermore, it illustrates that the excision can be enhanced using the counter-selectable sacB marker. For genes that are not a part of a multi-cistronic operon the upstream pathogen genomic region was fused to the psychrophilic structural gene from codon 4 through to the stop codon. A similar approach was used when substituting a psychrophilic allele into the middle of an operon. However, as one skilled in the art can appreciate, depending on the nature of the operon, some of the codons at the C terminus of the host homologue remained if they were important for translation of the downstream cistron.

Example 2

[0168] This example pertains to an exemplary method to insert the psychrophilic allele into the mesophilic bacteria.

[0169] FIG. 2a illustrates the substitution region for the psychrophilic ligA gene, corresponding to SEQ ID NO: 1. Additionally, it illustrates its incorporation into the wt F. novicida chromosome. FIGS. 2b-e illustrate the integration point for the psychrophilic ligA genes of C. psychrerythraea, S. frigidmarina, P. haloplanktis I, and P. haloplanktis 2 respectively. The first three codons for F. novicida were retained in order to maximize the potential for ligA expression levels. FIG. 2a-e illustrates that in most cases the integration and excision events result in a simple substitution of the psychrophilic gene for the mesophilic host homologue. However, the integration and excision events may also lead to the formation of a hybrid gene as illustrated in FIG. 1b.

Example 3

[0170] This example pertains to an exemplary method to determine the maximal growth temperature of each bacterial strain and to show its growth properties at restrictive temperatures.

[0171] Each bacterial strain was tested on agar plates placed in a highly stable (.+-.1.degree. C.) incubator; the restrictive temperatures were defined as the lowest temperatures that did not permit the formation of isolated colonies on an agar streak plate. The growth properties at different temperatures of four different transgenic strains of F. novicida carrying psychrophilic ligA substitutions and the growth properties of wt F. novicida are shown in FIGS. 3-6. The psychrophilic ligA genes are ligA.sub.Cp, ligA.sub.Sf, ligA.sub.Ph and ligA.sub.Ph2, as represented by SEQ ID NOs: 1, 7, 3, and 5 respectively. In the first panels in FIGS. 3-6, growth is shown at a permissive temperature, i.e., a temperature below the restrictive temperature. In subsequent panels, growth of both the transgenic and wt strains is shown before and after a shift to the restrictive temperature or higher.

[0172] Extended growth curves of both the F. novicida transgenic and wt strains are shown as inserts in select panels of FIG. 3-6. These curves were generated by taking a fully grown culture, diluting it, and monitoring its growth in fresh growth media. More specifically, the wt and TS transgenic F. novicida cultures were grown at restrictive temperatures until they reached stationary phase at which point, they were diluted and re-incubated for growth at the restrictive temperatures again. The additional growth curves demonstrate that the cessation of growth exhibited by the transgenic strain is a real phenomenon, as opposed to a temporary adjustment to the temperature shift.

Example 4

[0173] This pertains to an exemplary method used to determine the frequency of mutations that permit bacterial growth at temperatures higher than the restrictive temperature of TS F. novicida transgenic strains.

[0174] Cultures were grown to late logarithmic phase at the permissive growth temperature, they were then diluted in a series of 10.sup.9-10.sup.5 cells/plate on agar and incubated at temperatures about 3.degree. C. above the restrictive temperature, as well as at temperatures about 3.degree. C. below the restrictive temperature. From this dilution series the rate of mutations that allow for growth at higher temperatures were calculated, Table 2 exemplifies the frequency of mutation to temperature resistance in F. novicida. Remarkably, some of the psychrophilic genes are unable to mutate to forms that will function above their restrictive temperature. One skilled in the art may hypothesize that the millions of years required to adapt to a cold climate renders some of the psychrophilic essential gene products unable to adopt simple changes allowing them to function in temperatures typical to their mesophilic counterparts. These include ligACp, ligA.sub.Ph, hemC.sub.Cp, dnaK.sub.Cp, fmt.sub.Cp, and dnaK.sub.Sf.

Example 5

[0175] This example pertains to an exemplary method to determine the duration of viability of the recombinant TS bacterial strains at the restrictive temperature.

[0176] An exemplary culture of a TS transgenic strain that has a maximal growth temperature of about 33.degree. C., was grown at about 30.degree. C. and a sample of the culture was incubated at about 37.degree. C. to mimic the typical temperature of human body core tissues. Samples were taken at varying time points between 0-24 hours, and the individual samples were re-diluted, plated on to growth media, and then cultured at about 30.degree. C. to determine the death rate above the restrictive temperature. As a control, the same experiment was carried out with the wt bacterium.

[0177] The persistence of F. tularensis strains carrying the psychrophilic essential genes within their macrophages was determined. Transgenic strains were cultured at about 30.degree. C. and used to infect macrophages at about 37.degree. C. in 24 well tissue culture plates using standard methods known to those skilled in these arts. For several days monitoring the infected macrophages a subset of cells were lysed and the bacteria were plated onto agar medium and incubated at about 30.degree. C. The data generated in these experiments showed the lifespan of transgene strains during an infection with macrophages at a restrictive temperature and helped to predict the persistence of TS strains during infections.

[0178] This example can be extrapolated to provide an in vitro correlation for what can occur in a mammal. A TS transgenic strain will grow in a cool part of the body such as the skin. Replication of the strain at and about this cool site will constantly cause the TS transgenic strain progeny to be moved into the draining lymph nodes. Depending on the locations of the lymph nodes and the restrictive temperature of the TS transgenic strain, the TS progeny will die over a period of several hours. The presence of the TS transgenic strain both in its live and dead states will stimulate an immune response.

Example 6

[0179] This pertains to an exemplary method to determine the ability of a TS essential gene from a psychrophile to impart its TS phenotype on a mesophilic bacterium. Specifically, it provides a method for transferring a psychrophilic essential gene encoding a TS product to a variety of bacteria as well as the transfer of the TS essential gene between mesophiles.

[0180] Several psychrophilic essential genes were substituted into the genome of the mesophilic bacterium F. novicida. Multiple approaches can be used to inserting a psychrophilic essential gene into a given bacterium in place of its mesophilic homologue. Furthermore, it can be appreciated that one can substitute a given psychrophilic essential gene into many different bacteria. The following three methods exemplify various ways of substituting ligA.sub.Cp into three different bacteria. A common approach to gene substitution is illustrated in FIG. 1b, and involves the integration of a foreign gene in a bacterium that is in close proximity to the hosts' homologous gene through PCR. Following integration, a counter selective marker, such as sacB, can be used to help identify the results of the integration and excision events. Specifically this approach was used to replace the F. novicida ligA gene with the psychrophilic ligA.sub.Cp gene.

[0181] An alternate approach was used to replace the S. enterica ligA. The strain of S. enterica used had a bacteriophage Mu insertion in the chromosomal copy of ligA (Park et al., 1989. J. Bacteriol. 171: 2173-80.). A wt copy of the bacteriophage T4 DNA ligase was carried on the ampicillin resistant plasmid, pBR313. The ligA.sub.Cp gene was introduced on the compatible chloramphenicol resistant plasmid, pSUP2716, and the recombinant S. enterica strain was cultured in the absence of ampicillin and the presence of chloramphenicol. These growth conditions allow the pBR313:T4 DNA ligase recombinant plasmid to be lost. S. enterica strains that had lost the plasmid encoding the T4 DNA ligase, rendering them ampicillin sensitive, were dependant on the ligA.sub.Cp for viability and were TS.

[0182] Another alternate approach can be employed when introducing a psychrophilic essential gene into Gram-positive bacteria. The method of insertion of ligA.sub.Cp into M. smegmatis described herein exemplifies this method. A version of ligA.sub.Cp (SEQ ID NOS: 17 and 18) designed with optimal codons was cloned into the mycobacterial plasmid, pSM1; this a precautionary step due to the low G+C content in the ligA.sub.Cp gene when compared to that of the M. smegmatis and M. tuberculosis ligA genes. The recombinant pSMT3:ligA.sub.Cp was electroporated into M. smegmatis. Subsequently a large fragment of the M. smegmatis ligA gene was deleted resulting in a strain dependent on ligA.sub.Cp for viability. This strain was TS at about 34.degree. C. This temperature is reflective of the TS nature of the F. novicida transgene strain encoding ligA.sub.Cp.

[0183] This example illustrates the use of a mesophilic tester strain which contains a psychrophilic essential gene to predict the TS phenotype when said psychrophilic essential gene is used to construct a transgene strain of another mesophilic bacterium. In this example, the tester strain was F. novicida. The substitution of ligA.sub.Cp for the F. novicida ligA homologue showed that ligA.sub.Cp functioned in the mesophile and imparted a TS phenotype having a restrictive temperature of about 34.degree. C. The phenotype of the transgenic strain of F. novicida carrying ligA.sub.Cp predicted that substitution of the ligA.sub.Cp gene into other mesophiles (destination hosts) would results in viable bacteria that had a restrictive temperature of 34.degree. C. The phenotype of the Salmonella and Mycobacteria transgene strains carrying ligA.sub.Cp showed that the inter-genus transfer of a TS psychrophilic essential gene could result in a phenotype seen in the tester strain.

Example 7

[0184] This example describes an exemplary method to combine psychrophilic genes or fragments thereof (as represented by SEQ ID NO 1-24) or mutant essential psychrophilic genes to create gene products with desired TS properties.

[0185] Combining about 30%, at the 5'-end, of the novicida pyrG gene with about 2/3 of the 3'-end of the C. psychrerythraea pyrG gene (pyrG.sub.Cp) in the region of codon 157-159 created a recombinant gene that was TS at 37.degree. C. The F. novicida and C. psychrerythraea pyrG genes are identical at codons 157-159 inclusive. Additionally, the single point mutation at amino acid residue 149 in ligA.sub.Ph from an asparagine ("N") residue to a lysine ("K") residue changes the restrictive temperature from 37.degree. C. to 28.degree. C.

[0186] This approach could be applied to different psychrophilic genes by using either in vitro or in vivo recombinant technologies to combine two or more homologues of the same gene.

Example 8

[0187] This example pertains to an exemplary method to determine the distribution of a transgenic strain from a site of infection in a mammal.

[0188] F. novicida (a.k.a. F. tularensis subspecies novicida) carrying a psychrophilic transgene was used. One skilled in the art will appreciate that similar methods can be used to generate and examine TS strains of F. tularensis. F. novicida is highly virulent in mice. The infection of mice by F. novicida serves as a model for the infection of larger mammals with F. tularensis. Most strains of F. tularensis are highly virulent in most mammals.

[0189] The distribution of F. novicida transgenic strains from the site of infection was assessed either by injecting the recombinant strains through the skin, or by introduction via the nose, and measuring the amount of viable F. novicida cells in internal organs such as the lung, liver and spleen about three to ten days after the inoculation. It was found that TS F. novicida transgenic strains did not spread significantly from the site of inoculation. A direct correlation between the inactivation temperature of the psychrophilic essential gene and the level of distribution throughout the system was observed; the dissemination of TS F. novicida strains is Lewis Rats is outlined in Table 3.

TABLE-US-00004 TABLE 3 F. novicida Restrictive CFU/Tail strain Temp. (.degree. C.) injection site CFU/Spleen wt.sup.- 45 9.7 .times. 10.sup.3/7.1 .times. 10.sup.3 3.7 .times. 10.sup.6/2.2 .times. 10.sup.6 ligA.sub.Cp 34 5 .times. 10.sup.2/3 .times. 10.sup.2 0/0 ligA.sub.Ph 36.8 3 .times. 10.sup.2/2 .times. 10.sup.2 0/0 dnaK.sub.Cp 38.2 1.5 .times. 10.sup.4/7.6 .times. 10.sup.4 5.0 .times. 10.sup.2/0 .sup. fmt.sub.Cp 41 5.2 .times. 10.sup.3/2.4 .times. 10.sup.3 3.5 .times. 10.sup.5/2.1 .times. 10.sup.5

[0190] As a further example, one of the psychrophilic essential genes (ligA.sub.Cp) was substituted into the genome of M. tuberculosis to create a transgenic strain. Some psychrophilic essential genes originate in bacteria with DNA with low G+C content. Thus the genes were optimized with codons for M. tuberculosis prior to inserting the psychrophilic genes into the pathogenic bacteria (SEQ ID NOS: 17 and 18 provide the optimized sequences). Codon optimization is a method well known to those skilled in these arts and can be accomplished using freely available bioinformatic tools. The codon optimized psychrophilic essential genes were inserted into M. tuberculosis by methods that are well described in Examples 1 and 2. M. tuberculosis, like M. smegmatis, are Gram-positive bacteria.

[0191] Another exemplary method pertains to an exemplary method the distribution of a Gram-negative pathogenic strain. A psychrophilic essential gene was introduced into S. enterica. Upon introduction of the ligA.sub.Cp psychrophilic essential gene into S. enterica, the result was a transgenic strain that was unable to grow at 37.degree. C., as illustrated in FIG. 8. Furthermore, this strain was unable to disperse from the site of inoculation in infected mice, as evidenced by the inability of the strain to migrate to the lungs, liver or spleen.

Example 9

[0192] This example pertains to an exemplary method to determine the level of protective immune response generated from the inoculation of a mammal with a TS transgenic bacterial strain. Methods of inoculation are known in the art, and can include i.v., i.m., s.c., or i.p injection, as well as inhalation, oral, and transdermal routes of delivery. One skilled in the art will appreciate that methods similar to those described in this example can be used to test any transgenic TS bacterial strain that includes one or more psychrophilic essential nucleic acid sequences.

[0193] Inoculation of mice with a TS F. novicida transgenic strain (Fn-ligA.sub.Ph, Fn-ligA.sub.Cp or Fn-dnaK.sub.Cp) caused the cells of their immune systems to be stimulated (as measured by reduced bacterial organ burdens) resulting in protection against infection with wt F. novicida (FIGS. 10a-d). Mice were initially inoculated with the TS transgenic strain and then challenged with an inoculation three weeks later of the wt F. novicida strain. This resulted in reduced growth in the livers and spleens of mice infected with the wt strains as compared to mice that had not been inoculated with recombinant F. novicida. Furthermore, decreases in the morbidity and mortalities were observed among the inoculated group of mice resulting in the conclusion that immune protection was achieved.

[0194] Similarly, mice vaccinated with M. tuberculosis and S. enterica transgenic strains (ligAPh) were shown to be more resistant to infections with the wt pathogens than were un-vaccinated mice.

Example 10

[0195] This example pertains to an exemplary method of discovering novel psychrophilic essential genes.

[0196] Psychrophilic bacterium can be isolated from a cold environment, for example ocean waters near the Earth's poles. Essential genes can be identified by using degenerate PCR or other standard techniques to find highly conserved genes, such as bacterial essential genes. Once these genes have been identified, they can be substituted into the genome of a mesophile using the methods provided herein or known in the art, displacing the host homologue of the gene. The resulting strain can then be tested for temperature sensitivity as described herein.

Example 11

[0197] This example pertains to an exemplary method of using TS transgene strains in drug discovery research. Although a TS F. tularensis strain is exemplified, one skilled in the art will appreciate that similar methods can be used for other TS strains generated using the methods provided herein.

[0198] A TS transgenic strain of F. tularensis (ligA.sub.Ph) that was inoperable above about 37.degree. C. was used to infect cell line macrophages grown in microtiter plates at 34.degree. C. A library of antimicrobial drug candidates was introduced to individual wells that contained the infected macrophages, and the effect of the drug candidates on the killing of F. tularensis was measured by lysing the macrophages at various time points and determining the number of viable TS transgenic F. tularensis by plating on agar plates. Wt F. tularensis is extremely infectious and causes a deadly disease. The use of the TS transgenic F. tularensis strain allowed one to use greatly relaxed biological containment conditions because the strain is incapable of causing disease in humans.

Example 12

[0199] This example pertains to an exemplary method of generating and using TS strains of Mycobacterium containing temperature-sensitive essential nucleic acid molecules from psychrophilic bacteria to develop an immunogenic composition, which for example can be used to stimulate an immune response in a mammal, to protect or treat an M. tuberculosis infection in the mammal.

[0200] The ligA.sub.Ph and pryG.sub.Cp genes will separately be introduced into M. tuberculosis H37Rv using an integration/excision approach. The counter-selectable marker sacB will be used to enhance the generation of excision events that can be detected. C57BL/6 mice will be vaccinated by introducing 10,000 bacteria subcutaneously at the base of the tail. Negative controls mice injected with PBS and positive control mice injected with the BCG strain will processed at the same time. The mice will be rested for 30 days. Following this period all of the mice will be exposed to an aerosol of M. tuberculosis H37Rv that deposits 150 bacteria into the lungs. At weeks 0, 4, 8 16 and 32 following exposure to M. tuberculosis H37Rv, the mice will be euthanized and the number of M. tuberculosis H37Rv in the lungs and spleens determined. If the transgenes TS M. tuberculosis strains are successful at inducing a protective immune response, the number of bacteria in the mice organs will be less than that of the negative control. Subsequent experiments will be performed in a guinea pig model of tuberculosis.

[0201] In view of the many possible embodiments to which the principles of our invention may be applied, it should be recognized that illustrated embodiments are only examples of the invention and should not be considered a limitation on the scope of the invention. Rather, the scope of the invention is defined by the following claims. We therefore claim as our invention all that comes within the scope and spirit of these claims.

Sequence CWU 1

1

2812070DNAArtificial SequenceligACp hybrid 1atg act cca gtc gaa aag aaa att agc caa ctg caa cag cag ctt aat 48Met Thr Pro Val Glu Lys Lys Ile Ser Gln Leu Gln Gln Gln Leu Asn 1 5 10 15 caa tat aat cat gaa tat tat gta tta gac caa cct agt gtg cct gat 96Gln Tyr Asn His Glu Tyr Tyr Val Leu Asp Gln Pro Ser Val Pro Asp 20 25 30 gca gaa tat gac cga tta atg aca gca tta atc gat tta gaa aag act 144Ala Glu Tyr Asp Arg Leu Met Thr Ala Leu Ile Asp Leu Glu Lys Thr 35 40 45 aat cct gag ctt aag act att gac tca cct agc caa aaa gtt ggc ggt 192Asn Pro Glu Leu Lys Thr Ile Asp Ser Pro Ser Gln Lys Val Gly Gly 50 55 60 cag gca tta aaa tct ttc act caa gta act cat cag ctg ccg atg ctt 240Gln Ala Leu Lys Ser Phe Thr Gln Val Thr His Gln Leu Pro Met Leu 65 70 75 80 tct ctt gat aat gtt ttt tct tta gat gat ttt cac gca ttt gtt aaa 288Ser Leu Asp Asn Val Phe Ser Leu Asp Asp Phe His Ala Phe Val Lys 85 90 95 cgc gta aaa gat agg tta aat gac aat caa gcg ata gtc ttt tgt gcc 336Arg Val Lys Asp Arg Leu Asn Asp Asn Gln Ala Ile Val Phe Cys Ala 100 105 110 gag cct aaa tta gac ggt tta gca gtg agt tta cgt tat gag cac ggg 384Glu Pro Lys Leu Asp Gly Leu Ala Val Ser Leu Arg Tyr Glu His Gly 115 120 125 cag tta ata caa gcg gct aca cgt ggc gat ggt agt gta ggg gag aat 432Gln Leu Ile Gln Ala Ala Thr Arg Gly Asp Gly Ser Val Gly Glu Asn 130 135 140 att acg act aac att cgt aca ata aaa tct att ccg ctt aag tta atg 480Ile Thr Thr Asn Ile Arg Thr Ile Lys Ser Ile Pro Leu Lys Leu Met 145 150 155 160 ggc aca cca ggt aaa gat ttt cct gat atc gtt gaa gtc cgc ggt gaa 528Gly Thr Pro Gly Lys Asp Phe Pro Asp Ile Val Glu Val Arg Gly Glu 165 170 175 gtt ttt atg cct aag gca agt ttt gac gca tta aat aca tkg gct aaa 576Val Phe Met Pro Lys Ala Ser Phe Asp Ala Leu Asn Thr Xaa Ala Lys 180 185 190 aaa cgt ggc gag aaa ggt ttt gca aat cca cgt aat gca gcg gcg gga 624Lys Arg Gly Glu Lys Gly Phe Ala Asn Pro Arg Asn Ala Ala Ala Gly 195 200 205 agt tta cga caa ctt gat tct aaa atc acc gct aaa cgt aat tta gct 672Ser Leu Arg Gln Leu Asp Ser Lys Ile Thr Ala Lys Arg Asn Leu Ala 210 215 220 ttt tac gct tat agc ctt gga ttt gta ggg aaa ctg tct gat gga ggc 720Phe Tyr Ala Tyr Ser Leu Gly Phe Val Gly Lys Leu Ser Asp Gly Gly 225 230 235 240 gct gaa agt acc gat tta acc aat gac ttt ttt gct aac tct cat cat 768Ala Glu Ser Thr Asp Leu Thr Asn Asp Phe Phe Ala Asn Ser His His 245 250 255 gaa aga cta tgt cag ctt aaa agg ttg ggt ttg cct atg tgt cca gaa 816Glu Arg Leu Cys Gln Leu Lys Arg Leu Gly Leu Pro Met Cys Pro Glu 260 265 270 gta cgc tta ctt gaa agt gag caa gcc tgt gat gcg ttt tat caa gat 864Val Arg Leu Leu Glu Ser Glu Gln Ala Cys Asp Ala Phe Tyr Gln Asp 275 280 285 atc tta gca aag cgt agt gcc ttg agt tat gaa att gat ggc act gta 912Ile Leu Ala Lys Arg Ser Ala Leu Ser Tyr Glu Ile Asp Gly Thr Val 290 295 300 tta aaa gtt gat gaa atc tct ttg cag aaa cgt tta ggg ttt gtc gca 960Leu Lys Val Asp Glu Ile Ser Leu Gln Lys Arg Leu Gly Phe Val Ala 305 310 315 320 cgt gcc cca cgt tgg gct att gct tat aaa ttc cct gcg gaa gaa gaa 1008Arg Ala Pro Arg Trp Ala Ile Ala Tyr Lys Phe Pro Ala Glu Glu Glu 325 330 335 tta acc tgt gtt gaa gat gtc gag ttt caa gta ggg cgt acc ggc gcg 1056Leu Thr Cys Val Glu Asp Val Glu Phe Gln Val Gly Arg Thr Gly Ala 340 345 350 att act ccc gta gca cgt ttg aaa ccg gta ttt gtt ggt ggc gta aca 1104Ile Thr Pro Val Ala Arg Leu Lys Pro Val Phe Val Gly Gly Val Thr 355 360 365 gtt tct aat gcc aca tta cat aac caa gat gaa ata acc cga tta ggg 1152Val Ser Asn Ala Thr Leu His Asn Gln Asp Glu Ile Thr Arg Leu Gly 370 375 380 ctg aaa gtg aat gat ttc gtg gtt atc cgc cgt gcc ggt gat gtt att 1200Leu Lys Val Asn Asp Phe Val Val Ile Arg Arg Ala Gly Asp Val Ile 385 390 395 400 cct caa att gtt agc gta gta ctt gat aaa cga cca gat aat gcc gtc 1248Pro Gln Ile Val Ser Val Val Leu Asp Lys Arg Pro Asp Asn Ala Val 405 410 415 gat ata gtc ttt cct acc agt tgc cct gtt tgt gac tct gca gtg gct 1296Asp Ile Val Phe Pro Thr Ser Cys Pro Val Cys Asp Ser Ala Val Ala 420 425 430 aaa cct gaa ggt gaa gcc gta ctg aga tgt acc gcc gga ctt ttc tgt 1344Lys Pro Glu Gly Glu Ala Val Leu Arg Cys Thr Ala Gly Leu Phe Cys 435 440 445 gcg gcg caa aga aaa gaa gct att aaa cat ttt gct tct cga aaa gca 1392Ala Ala Gln Arg Lys Glu Ala Ile Lys His Phe Ala Ser Arg Lys Ala 450 455 460 cat gat gtt gat ggt tta ggt gac aaa cta gta gag caa ctt gta gat 1440His Asp Val Asp Gly Leu Gly Asp Lys Leu Val Glu Gln Leu Val Asp 465 470 475 480 gaa aag tta att aat acg cyc gct gat tta ttc aaa tta acc gaa ata 1488Glu Lys Leu Ile Asn Thr Xaa Ala Asp Leu Phe Lys Leu Thr Glu Ile 485 490 495 caa gtt agt act ata gat cgt atg ggt aaa aaa tca gcg acc aat tta 1536Gln Val Ser Thr Ile Asp Arg Met Gly Lys Lys Ser Ala Thr Asn Leu 500 505 510 att aat gga ctt gag cag gct aaa agt acc aca cta gca aaa ttt att 1584Ile Asn Gly Leu Glu Gln Ala Lys Ser Thr Thr Leu Ala Lys Phe Ile 515 520 525 tat ggt ctg ggc ata cgc gaa gtc ggt gaa gca act gct gct aat cta 1632Tyr Gly Leu Gly Ile Arg Glu Val Gly Glu Ala Thr Ala Ala Asn Leu 530 535 540 gca aat cat ttt tat acc tta gcg gca att gaa agt gct tct ctt gaa 1680Ala Asn His Phe Tyr Thr Leu Ala Ala Ile Glu Ser Ala Ser Leu Glu 545 550 555 560 gac tta caa aat gtt tca gat gtt ggc gaa gtc gtt gcc aaa aat att 1728Asp Leu Gln Asn Val Ser Asp Val Gly Glu Val Val Ala Lys Asn Ile 565 570 575 att aat ttc ttt aaa gaa gag cat aac tta gcg atc gtt tct gga cta 1776Ile Asn Phe Phe Lys Glu Glu His Asn Leu Ala Ile Val Ser Gly Leu 580 585 590 agt gaa gta atg cac tgg cca act att gaa ata aag tca gct gag gag 1824Ser Glu Val Met His Trp Pro Thr Ile Glu Ile Lys Ser Ala Glu Glu 595 600 605 tta ccg ctt gca gag cag ata ttt gtt tta aca ggc aca tta acc caa 1872Leu Pro Leu Ala Glu Gln Ile Phe Val Leu Thr Gly Thr Leu Thr Gln 610 615 620 atg gga aga act gaa gct aaa aca gcc tta cag tcc ttg gga gct aaa 1920Met Gly Arg Thr Glu Ala Lys Thr Ala Leu Gln Ser Leu Gly Ala Lys 625 630 635 640 gta tca ggt agt gtc tcg aag aat aca cac ttc gtt gtt gca ggt gat 1968Val Ser Gly Ser Val Ser Lys Asn Thr His Phe Val Val Ala Gly Asp 645 650 655 aaa gcg gga tct aaa ctg act aag gct cag gat tta ggt atc tca gtg 2016Lys Ala Gly Ser Lys Leu Thr Lys Ala Gln Asp Leu Gly Ile Ser Val 660 665 670 ctt acc gaa gat ggg tta gta gcg tta ctt gcc gaa cat ggc ata act 2064Leu Thr Glu Asp Gly Leu Val Ala Leu Leu Ala Glu His Gly Ile Thr 675 680 685 att tga 2070Ile 2689PRTArtificial Sequencemisc_feature(190)..(190)The 'Xaa' at location 190 stands for Trp, or Leu. 2Met Thr Pro Val Glu Lys Lys Ile Ser Gln Leu Gln Gln Gln Leu Asn 1 5 10 15 Gln Tyr Asn His Glu Tyr Tyr Val Leu Asp Gln Pro Ser Val Pro Asp 20 25 30 Ala Glu Tyr Asp Arg Leu Met Thr Ala Leu Ile Asp Leu Glu Lys Thr 35 40 45 Asn Pro Glu Leu Lys Thr Ile Asp Ser Pro Ser Gln Lys Val Gly Gly 50 55 60 Gln Ala Leu Lys Ser Phe Thr Gln Val Thr His Gln Leu Pro Met Leu 65 70 75 80 Ser Leu Asp Asn Val Phe Ser Leu Asp Asp Phe His Ala Phe Val Lys 85 90 95 Arg Val Lys Asp Arg Leu Asn Asp Asn Gln Ala Ile Val Phe Cys Ala 100 105 110 Glu Pro Lys Leu Asp Gly Leu Ala Val Ser Leu Arg Tyr Glu His Gly 115 120 125 Gln Leu Ile Gln Ala Ala Thr Arg Gly Asp Gly Ser Val Gly Glu Asn 130 135 140 Ile Thr Thr Asn Ile Arg Thr Ile Lys Ser Ile Pro Leu Lys Leu Met 145 150 155 160 Gly Thr Pro Gly Lys Asp Phe Pro Asp Ile Val Glu Val Arg Gly Glu 165 170 175 Val Phe Met Pro Lys Ala Ser Phe Asp Ala Leu Asn Thr Xaa Ala Lys 180 185 190 Lys Arg Gly Glu Lys Gly Phe Ala Asn Pro Arg Asn Ala Ala Ala Gly 195 200 205 Ser Leu Arg Gln Leu Asp Ser Lys Ile Thr Ala Lys Arg Asn Leu Ala 210 215 220 Phe Tyr Ala Tyr Ser Leu Gly Phe Val Gly Lys Leu Ser Asp Gly Gly 225 230 235 240 Ala Glu Ser Thr Asp Leu Thr Asn Asp Phe Phe Ala Asn Ser His His 245 250 255 Glu Arg Leu Cys Gln Leu Lys Arg Leu Gly Leu Pro Met Cys Pro Glu 260 265 270 Val Arg Leu Leu Glu Ser Glu Gln Ala Cys Asp Ala Phe Tyr Gln Asp 275 280 285 Ile Leu Ala Lys Arg Ser Ala Leu Ser Tyr Glu Ile Asp Gly Thr Val 290 295 300 Leu Lys Val Asp Glu Ile Ser Leu Gln Lys Arg Leu Gly Phe Val Ala 305 310 315 320 Arg Ala Pro Arg Trp Ala Ile Ala Tyr Lys Phe Pro Ala Glu Glu Glu 325 330 335 Leu Thr Cys Val Glu Asp Val Glu Phe Gln Val Gly Arg Thr Gly Ala 340 345 350 Ile Thr Pro Val Ala Arg Leu Lys Pro Val Phe Val Gly Gly Val Thr 355 360 365 Val Ser Asn Ala Thr Leu His Asn Gln Asp Glu Ile Thr Arg Leu Gly 370 375 380 Leu Lys Val Asn Asp Phe Val Val Ile Arg Arg Ala Gly Asp Val Ile 385 390 395 400 Pro Gln Ile Val Ser Val Val Leu Asp Lys Arg Pro Asp Asn Ala Val 405 410 415 Asp Ile Val Phe Pro Thr Ser Cys Pro Val Cys Asp Ser Ala Val Ala 420 425 430 Lys Pro Glu Gly Glu Ala Val Leu Arg Cys Thr Ala Gly Leu Phe Cys 435 440 445 Ala Ala Gln Arg Lys Glu Ala Ile Lys His Phe Ala Ser Arg Lys Ala 450 455 460 His Asp Val Asp Gly Leu Gly Asp Lys Leu Val Glu Gln Leu Val Asp 465 470 475 480 Glu Lys Leu Ile Asn Thr Xaa Ala Asp Leu Phe Lys Leu Thr Glu Ile 485 490 495 Gln Val Ser Thr Ile Asp Arg Met Gly Lys Lys Ser Ala Thr Asn Leu 500 505 510 Ile Asn Gly Leu Glu Gln Ala Lys Ser Thr Thr Leu Ala Lys Phe Ile 515 520 525 Tyr Gly Leu Gly Ile Arg Glu Val Gly Glu Ala Thr Ala Ala Asn Leu 530 535 540 Ala Asn His Phe Tyr Thr Leu Ala Ala Ile Glu Ser Ala Ser Leu Glu 545 550 555 560 Asp Leu Gln Asn Val Ser Asp Val Gly Glu Val Val Ala Lys Asn Ile 565 570 575 Ile Asn Phe Phe Lys Glu Glu His Asn Leu Ala Ile Val Ser Gly Leu 580 585 590 Ser Glu Val Met His Trp Pro Thr Ile Glu Ile Lys Ser Ala Glu Glu 595 600 605 Leu Pro Leu Ala Glu Gln Ile Phe Val Leu Thr Gly Thr Leu Thr Gln 610 615 620 Met Gly Arg Thr Glu Ala Lys Thr Ala Leu Gln Ser Leu Gly Ala Lys 625 630 635 640 Val Ser Gly Ser Val Ser Lys Asn Thr His Phe Val Val Ala Gly Asp 645 650 655 Lys Ala Gly Ser Lys Leu Thr Lys Ala Gln Asp Leu Gly Ile Ser Val 660 665 670 Leu Thr Glu Asp Gly Leu Val Ala Leu Leu Ala Glu His Gly Ile Thr 675 680 685 Ile 32022DNAArtificial SequenceligAPh hybrid 3atg act cca agc att agt gag caa ata aac cat ctt cgt agt acg ctt 48Met Thr Pro Ser Ile Ser Glu Gln Ile Asn His Leu Arg Ser Thr Leu 1 5 10 15 gaa cag cac agt tac aat tat tat gta ctt gat acc ccc agt att cct 96Glu Gln His Ser Tyr Asn Tyr Tyr Val Leu Asp Thr Pro Ser Ile Pro 20 25 30 gat gct gaa tac gac cgt tta tta caa caa ctc agc gca cta gaa act 144Asp Ala Glu Tyr Asp Arg Leu Leu Gln Gln Leu Ser Ala Leu Glu Thr 35 40 45 cag cac cca gaa tta ata act gcc gac tca cca acc caa aaa gtg ggc 192Gln His Pro Glu Leu Ile Thr Ala Asp Ser Pro Thr Gln Lys Val Gly 50 55 60 ggt gct gcg cta agt aaa ttt gag caa gta gcg cac caa gtg cct atg 240Gly Ala Ala Leu Ser Lys Phe Glu Gln Val Ala His Gln Val Pro Met 65 70 75 80 tta tcg ctt gat aac gcc ttt agc gaa gat gag ttt att gcc ttt aat 288Leu Ser Leu Asp Asn Ala Phe Ser Glu Asp Glu Phe Ile Ala Phe Asn 85 90 95 cgc cgt ata aaa gag cgt tta atg agt acc gaa gag ctt act ttt tgt 336Arg Arg Ile Lys Glu Arg Leu Met Ser Thr Glu Glu Leu Thr Phe Cys 100 105 110 tgt gag cca aaa cta gat ggc tta gct gtg tcg att att tat cgt gat 384Cys Glu Pro Lys Leu Asp Gly Leu Ala Val Ser Ile Ile Tyr Arg Asp 115 120 125 ggc gta cta gtg caa gcc gcg acc cga ggt gat ggg ttg acg gga gaa 432Gly Val Leu Val Gln Ala Ala Thr Arg Gly Asp Gly Leu Thr Gly Glu 130 135 140 aat gta act caa aac gtt aaa aca att cgt aat gtg cca ctt aaa tta 480Asn Val Thr Gln Asn Val Lys Thr Ile Arg Asn Val Pro Leu Lys Leu 145 150 155 160 cga ggt agc gat tat cct gct gaa cta gaa gtg cgc ggc gaa gtg ttt 528Arg Gly Ser Asp Tyr Pro Ala Glu Leu Glu Val Arg Gly Glu Val Phe 165 170 175 atg gat aat gca ggc ttt gaa aag ttt aac att gaa gct gaa aaa cgt 576Met Asp Asn Ala Gly Phe Glu Lys Phe Asn Ile Glu Ala Glu Lys Arg 180 185 190 ggt gaa aaa gta ttt gta aac cca cgc aac gcc gcc gca ggt agc ctg 624Gly Glu Lys Val Phe Val Asn Pro Arg Asn Ala Ala Ala Gly Ser Leu 195 200 205 cgc cag ctt gac tct aaa att acg gct aaa cgc cca ctg atg ttt tat 672Arg Gln Leu Asp Ser Lys Ile Thr Ala Lys Arg Pro Leu Met Phe Tyr 210 215 220 gcc tac agc aca ggt ctt gta gcc gac ggt agc att gca gag gat cat

720Ala Tyr Ser Thr Gly Leu Val Ala Asp Gly Ser Ile Ala Glu Asp His 225 230 235 240 tat cag caa tta gaa aaa ttg act gat tgg ggg tta cca ctt tgc cct 768Tyr Gln Gln Leu Glu Lys Leu Thr Asp Trp Gly Leu Pro Leu Cys Pro 245 250 255 gaa aca aaa tta gta gaa ggc cca caa gct gca ctg gct tat tat act 816Glu Thr Lys Leu Val Glu Gly Pro Gln Ala Ala Leu Ala Tyr Tyr Thr 260 265 270 gac att tta acg cgc cgt ggc gag ctt aaa tat gaa ata gat ggc gtg 864Asp Ile Leu Thr Arg Arg Gly Glu Leu Lys Tyr Glu Ile Asp Gly Val 275 280 285 gta ata aaa ata aat caa aaa gcc tta caa gag cgt tta ggc ttt gta 912Val Ile Lys Ile Asn Gln Lys Ala Leu Gln Glu Arg Leu Gly Phe Val 290 295 300 gca cgc gct ccg cgt tgg gct att gct tat aag ttc ccg gcc caa gaa 960Ala Arg Ala Pro Arg Trp Ala Ile Ala Tyr Lys Phe Pro Ala Gln Glu 305 310 315 320 gaa ata acc aaa tta ctc gat gta gag ttt cag gtg ggg cgt acg gga 1008Glu Ile Thr Lys Leu Leu Asp Val Glu Phe Gln Val Gly Arg Thr Gly 325 330 335 gca att aca ccg gtt gca cgc tta gag ccg gta ttt gtt ggt ggt gtt 1056Ala Ile Thr Pro Val Ala Arg Leu Glu Pro Val Phe Val Gly Gly Val 340 345 350 act gta tca aac gct acc ttg cac aat ggc gat gaa ata gcg cgc tta 1104Thr Val Ser Asn Ala Thr Leu His Asn Gly Asp Glu Ile Ala Arg Leu 355 360 365 ggc gta aaa gtg ggc gac acg gta att att cgc cgt gca ggg gac gta 1152Gly Val Lys Val Gly Asp Thr Val Ile Ile Arg Arg Ala Gly Asp Val 370 375 380 att cca caa ata acg caa gta gta ctt gag cgc cgc cct gat gat gcc 1200Ile Pro Gln Ile Thr Gln Val Val Leu Glu Arg Arg Pro Asp Asp Ala 385 390 395 400 cgc gat att gag ttt ccg gta act tgc cca att tgt gac tcc cat gta 1248Arg Asp Ile Glu Phe Pro Val Thr Cys Pro Ile Cys Asp Ser His Val 405 410 415 gaa aaa gta gaa ggt gaa gcc gta gcg cgt tgt act ggt ggt tta gtg 1296Glu Lys Val Glu Gly Glu Ala Val Ala Arg Cys Thr Gly Gly Leu Val 420 425 430 tgc ccg gcg caa cgt aaa caa gcg att aaa cac ttt gca tcg cgc aaa 1344Cys Pro Ala Gln Arg Lys Gln Ala Ile Lys His Phe Ala Ser Arg Lys 435 440 445 gca ctc gat att gac ggc ctt ggc gat aaa att gtt gat caa ctc gtc 1392Ala Leu Asp Ile Asp Gly Leu Gly Asp Lys Ile Val Asp Gln Leu Val 450 455 460 gac aga gag ctg att aaa acc cct gca gat ttg ttt att tta aag caa 1440Asp Arg Glu Leu Ile Lys Thr Pro Ala Asp Leu Phe Ile Leu Lys Gln 465 470 475 480 gga cat ttt gaa tcg ctt gag cgt atg ggg cca aag tcg gct aaa aat 1488Gly His Phe Glu Ser Leu Glu Arg Met Gly Pro Lys Ser Ala Lys Asn 485 490 495 ttg gtt act gcg ctt caa gac gct aaa gca aca act ttg gct aag ttt 1536Leu Val Thr Ala Leu Gln Asp Ala Lys Ala Thr Thr Leu Ala Lys Phe 500 505 510 tta tac tca ttg ggt att cgt gaa gcg ggt gag gca acc aca caa aat 1584Leu Tyr Ser Leu Gly Ile Arg Glu Ala Gly Glu Ala Thr Thr Gln Asn 515 520 525 tta gct aat cat ttc tta acc ctt gaa aac gta ata aat gcc agc att 1632Leu Ala Asn His Phe Leu Thr Leu Glu Asn Val Ile Asn Ala Ser Ile 530 535 540 gat agt tta act caa gta agt gat gtg ggc gaa ata gta gca acc cat 1680Asp Ser Leu Thr Gln Val Ser Asp Val Gly Glu Ile Val Ala Thr His 545 550 555 560 gta cgt agc ttt ttt gcc gaa cag cat aat tta gat gtt gta aat gcg 1728Val Arg Ser Phe Phe Ala Glu Gln His Asn Leu Asp Val Val Asn Ala 565 570 575 ctg gta gag caa ggt att aat tgg cct gaa ctt act cca cct tca gcg 1776Leu Val Glu Gln Gly Ile Asn Trp Pro Glu Leu Thr Pro Pro Ser Ala 580 585 590 caa gag cag cca tta gct ggc ctt gtt tat gtg ctt acc ggt acc tta 1824Gln Glu Gln Pro Leu Ala Gly Leu Val Tyr Val Leu Thr Gly Thr Leu 595 600 605 aac aca tta aac cgt aat gac gcc aaa gca cgt ttg caa cag tta ggt 1872Asn Thr Leu Asn Arg Asn Asp Ala Lys Ala Arg Leu Gln Gln Leu Gly 610 615 620 gct aaa gtg tcg ggt agt gtg tcg gct aaa acc gat gcg tta gta gca 1920Ala Lys Val Ser Gly Ser Val Ser Ala Lys Thr Asp Ala Leu Val Ala 625 630 635 640 ggc gaa aag gcc ggc tct aaa cta act aag gca caa gac tta ggt ata 1968Gly Glu Lys Ala Gly Ser Lys Leu Thr Lys Ala Gln Asp Leu Gly Ile 645 650 655 gat gta ctg aca gaa gaa gat tta att aat tta tta gag caa cat aat 2016Asp Val Leu Thr Glu Glu Asp Leu Ile Asn Leu Leu Glu Gln His Asn 660 665 670 ggc tga 2022Gly 4673PRTArtificial SequenceSynthetic Construct 4Met Thr Pro Ser Ile Ser Glu Gln Ile Asn His Leu Arg Ser Thr Leu 1 5 10 15 Glu Gln His Ser Tyr Asn Tyr Tyr Val Leu Asp Thr Pro Ser Ile Pro 20 25 30 Asp Ala Glu Tyr Asp Arg Leu Leu Gln Gln Leu Ser Ala Leu Glu Thr 35 40 45 Gln His Pro Glu Leu Ile Thr Ala Asp Ser Pro Thr Gln Lys Val Gly 50 55 60 Gly Ala Ala Leu Ser Lys Phe Glu Gln Val Ala His Gln Val Pro Met 65 70 75 80 Leu Ser Leu Asp Asn Ala Phe Ser Glu Asp Glu Phe Ile Ala Phe Asn 85 90 95 Arg Arg Ile Lys Glu Arg Leu Met Ser Thr Glu Glu Leu Thr Phe Cys 100 105 110 Cys Glu Pro Lys Leu Asp Gly Leu Ala Val Ser Ile Ile Tyr Arg Asp 115 120 125 Gly Val Leu Val Gln Ala Ala Thr Arg Gly Asp Gly Leu Thr Gly Glu 130 135 140 Asn Val Thr Gln Asn Val Lys Thr Ile Arg Asn Val Pro Leu Lys Leu 145 150 155 160 Arg Gly Ser Asp Tyr Pro Ala Glu Leu Glu Val Arg Gly Glu Val Phe 165 170 175 Met Asp Asn Ala Gly Phe Glu Lys Phe Asn Ile Glu Ala Glu Lys Arg 180 185 190 Gly Glu Lys Val Phe Val Asn Pro Arg Asn Ala Ala Ala Gly Ser Leu 195 200 205 Arg Gln Leu Asp Ser Lys Ile Thr Ala Lys Arg Pro Leu Met Phe Tyr 210 215 220 Ala Tyr Ser Thr Gly Leu Val Ala Asp Gly Ser Ile Ala Glu Asp His 225 230 235 240 Tyr Gln Gln Leu Glu Lys Leu Thr Asp Trp Gly Leu Pro Leu Cys Pro 245 250 255 Glu Thr Lys Leu Val Glu Gly Pro Gln Ala Ala Leu Ala Tyr Tyr Thr 260 265 270 Asp Ile Leu Thr Arg Arg Gly Glu Leu Lys Tyr Glu Ile Asp Gly Val 275 280 285 Val Ile Lys Ile Asn Gln Lys Ala Leu Gln Glu Arg Leu Gly Phe Val 290 295 300 Ala Arg Ala Pro Arg Trp Ala Ile Ala Tyr Lys Phe Pro Ala Gln Glu 305 310 315 320 Glu Ile Thr Lys Leu Leu Asp Val Glu Phe Gln Val Gly Arg Thr Gly 325 330 335 Ala Ile Thr Pro Val Ala Arg Leu Glu Pro Val Phe Val Gly Gly Val 340 345 350 Thr Val Ser Asn Ala Thr Leu His Asn Gly Asp Glu Ile Ala Arg Leu 355 360 365 Gly Val Lys Val Gly Asp Thr Val Ile Ile Arg Arg Ala Gly Asp Val 370 375 380 Ile Pro Gln Ile Thr Gln Val Val Leu Glu Arg Arg Pro Asp Asp Ala 385 390 395 400 Arg Asp Ile Glu Phe Pro Val Thr Cys Pro Ile Cys Asp Ser His Val 405 410 415 Glu Lys Val Glu Gly Glu Ala Val Ala Arg Cys Thr Gly Gly Leu Val 420 425 430 Cys Pro Ala Gln Arg Lys Gln Ala Ile Lys His Phe Ala Ser Arg Lys 435 440 445 Ala Leu Asp Ile Asp Gly Leu Gly Asp Lys Ile Val Asp Gln Leu Val 450 455 460 Asp Arg Glu Leu Ile Lys Thr Pro Ala Asp Leu Phe Ile Leu Lys Gln 465 470 475 480 Gly His Phe Glu Ser Leu Glu Arg Met Gly Pro Lys Ser Ala Lys Asn 485 490 495 Leu Val Thr Ala Leu Gln Asp Ala Lys Ala Thr Thr Leu Ala Lys Phe 500 505 510 Leu Tyr Ser Leu Gly Ile Arg Glu Ala Gly Glu Ala Thr Thr Gln Asn 515 520 525 Leu Ala Asn His Phe Leu Thr Leu Glu Asn Val Ile Asn Ala Ser Ile 530 535 540 Asp Ser Leu Thr Gln Val Ser Asp Val Gly Glu Ile Val Ala Thr His 545 550 555 560 Val Arg Ser Phe Phe Ala Glu Gln His Asn Leu Asp Val Val Asn Ala 565 570 575 Leu Val Glu Gln Gly Ile Asn Trp Pro Glu Leu Thr Pro Pro Ser Ala 580 585 590 Gln Glu Gln Pro Leu Ala Gly Leu Val Tyr Val Leu Thr Gly Thr Leu 595 600 605 Asn Thr Leu Asn Arg Asn Asp Ala Lys Ala Arg Leu Gln Gln Leu Gly 610 615 620 Ala Lys Val Ser Gly Ser Val Ser Ala Lys Thr Asp Ala Leu Val Ala 625 630 635 640 Gly Glu Lys Ala Gly Ser Lys Leu Thr Lys Ala Gln Asp Leu Gly Ile 645 650 655 Asp Val Leu Thr Glu Glu Asp Leu Ile Asn Leu Leu Glu Gln His Asn 660 665 670 Gly 52022DNAArtificial SequenceLigAPh2 hybrid 5atg act cca agc att agt gag caa ata aac cat ctt cgt agt acg ctt 48Met Thr Pro Ser Ile Ser Glu Gln Ile Asn His Leu Arg Ser Thr Leu 1 5 10 15 gaa cag cac agt tac aat tat tat gta ctt gat acc ccc agt att cct 96Glu Gln His Ser Tyr Asn Tyr Tyr Val Leu Asp Thr Pro Ser Ile Pro 20 25 30 gat gct gaa tac gac cgt tta tta caa caa ctc agc gca cta gaa act 144Asp Ala Glu Tyr Asp Arg Leu Leu Gln Gln Leu Ser Ala Leu Glu Thr 35 40 45 cag cac cca gaa tta ata act gcc gac tca cca acc caa aaa gtg ggc 192Gln His Pro Glu Leu Ile Thr Ala Asp Ser Pro Thr Gln Lys Val Gly 50 55 60 ggt gct gcg cta agt aaa ttt gag caa gta gcg cac caa gtg cct atg 240Gly Ala Ala Leu Ser Lys Phe Glu Gln Val Ala His Gln Val Pro Met 65 70 75 80 tta tcg ctt gat aac gcc ttt agc gaa gat gag ttt att gcc ttt aat 288Leu Ser Leu Asp Asn Ala Phe Ser Glu Asp Glu Phe Ile Ala Phe Asn 85 90 95 cgc cgt ata aaa gag cgt tta atg agt acc gaa gag ctt act ttt tgt 336Arg Arg Ile Lys Glu Arg Leu Met Ser Thr Glu Glu Leu Thr Phe Cys 100 105 110 tgt gag cca aaa cta gat ggc tta gct gtg tcg att att tat cgt gat 384Cys Glu Pro Lys Leu Asp Gly Leu Ala Val Ser Ile Ile Tyr Arg Asp 115 120 125 ggc gta cta gtg caa gcc gcg acc cga ggt gat ggg ttg acg gga gaa 432Gly Val Leu Val Gln Ala Ala Thr Arg Gly Asp Gly Leu Thr Gly Glu 130 135 140 aat gta act caa aaa gtt aaa aca att cgt aat gtg cca ctt aaa tta 480Asn Val Thr Gln Lys Val Lys Thr Ile Arg Asn Val Pro Leu Lys Leu 145 150 155 160 cga ggt agc gat tat cct gct gaa cta gaa gtg cgc ggc gaa gtg ttt 528Arg Gly Ser Asp Tyr Pro Ala Glu Leu Glu Val Arg Gly Glu Val Phe 165 170 175 atg gat aat gca ggc ttt gaa aag ttt aac att gaa gct gaa aaa cgt 576Met Asp Asn Ala Gly Phe Glu Lys Phe Asn Ile Glu Ala Glu Lys Arg 180 185 190 ggt gaa aaa gta ttt gta aac cca cgc aac gcc gcc gca ggt agc ctg 624Gly Glu Lys Val Phe Val Asn Pro Arg Asn Ala Ala Ala Gly Ser Leu 195 200 205 cgc cag ctt gac tct aaa att acg gct aaa cgc cca ctg atg ttt tat 672Arg Gln Leu Asp Ser Lys Ile Thr Ala Lys Arg Pro Leu Met Phe Tyr 210 215 220 gcc tac agc aca ggt ctt gta gcc gac ggt agc att gca gag gat cat 720Ala Tyr Ser Thr Gly Leu Val Ala Asp Gly Ser Ile Ala Glu Asp His 225 230 235 240 tat cag caa tta gaa aaa ttg act gat tgg ggg tta cca ctt tgc cct 768Tyr Gln Gln Leu Glu Lys Leu Thr Asp Trp Gly Leu Pro Leu Cys Pro 245 250 255 gaa aca aaa tta gta gaa ggc cca caa gct gca ctg gct tat tat act 816Glu Thr Lys Leu Val Glu Gly Pro Gln Ala Ala Leu Ala Tyr Tyr Thr 260 265 270 gac att tta acg cgc cgt ggc gag ctt aaa tat gaa ata gat ggc gtg 864Asp Ile Leu Thr Arg Arg Gly Glu Leu Lys Tyr Glu Ile Asp Gly Val 275 280 285 gta ata aaa ata aat caa aaa gcc tta caa gag cgt tta ggc ttt gta 912Val Ile Lys Ile Asn Gln Lys Ala Leu Gln Glu Arg Leu Gly Phe Val 290 295 300 gca cgc gct ccg cgt tgg gct att gct tat aag ttc ccg gcc caa gaa 960Ala Arg Ala Pro Arg Trp Ala Ile Ala Tyr Lys Phe Pro Ala Gln Glu 305 310 315 320 gaa ata acc aaa tta ctc gat gta gag ttt cag gtg ggg cgt acg gga 1008Glu Ile Thr Lys Leu Leu Asp Val Glu Phe Gln Val Gly Arg Thr Gly 325 330 335 gca att aca ccg gtt gca cgc tta gag ccg gta ttt gtt ggt ggt gtt 1056Ala Ile Thr Pro Val Ala Arg Leu Glu Pro Val Phe Val Gly Gly Val 340 345 350 act gta tca aac gct acc ttg cac aat ggc gat gaa ata gcg cgc tta 1104Thr Val Ser Asn Ala Thr Leu His Asn Gly Asp Glu Ile Ala Arg Leu 355 360 365 ggc gta aaa gtg ggc gac acg gta att att cgc cgt gca ggg gac gta 1152Gly Val Lys Val Gly Asp Thr Val Ile Ile Arg Arg Ala Gly Asp Val 370 375 380 att cca caa ata acg caa gta gta ctt gag cgc cgc cct gat gat gcc 1200Ile Pro Gln Ile Thr Gln Val Val Leu Glu Arg Arg Pro Asp Asp Ala 385 390 395 400 cgc gat att gag ttt ccg gta act tgc cca att tgt gac tcc cat gta 1248Arg Asp Ile Glu Phe Pro Val Thr Cys Pro Ile Cys Asp Ser His Val 405 410 415 gaa aaa gta gaa ggt gaa gcc gta gcg cgt tgt act ggt ggt tta gtg 1296Glu Lys Val Glu Gly Glu Ala Val Ala Arg Cys Thr Gly Gly Leu Val 420 425 430 tgc ccg gcg caa cgt aaa caa gcg att aaa cac ttt gca tcg cgc aaa 1344Cys Pro Ala Gln Arg Lys Gln Ala Ile Lys His Phe Ala Ser Arg Lys 435 440 445 gca ctc gat att gac ggc ctt ggc gat aaa att gtt gat caa ctc gtc 1392Ala Leu Asp Ile Asp Gly Leu Gly Asp Lys Ile Val Asp Gln Leu Val 450 455 460 gac aga gag ctg att aaa acc cct gca gat ttg ttt att tta aag caa 1440Asp Arg Glu Leu Ile Lys Thr Pro Ala Asp Leu Phe Ile Leu Lys Gln 465 470 475 480 gga cat ttt gaa tcg ctt gag cgt atg ggg cca aag tcg gct aaa aat 1488Gly His Phe Glu Ser Leu Glu Arg Met Gly Pro Lys Ser Ala Lys Asn

485 490 495 ttg gtt act gcg ctt caa gac gct aaa gca aca act ttg gct aag ttt 1536Leu Val Thr Ala Leu Gln Asp Ala Lys Ala Thr Thr Leu Ala Lys Phe 500 505 510 tta tac tca ttg ggt att cgt gaa gcg ggt gag gca acc aca caa aat 1584Leu Tyr Ser Leu Gly Ile Arg Glu Ala Gly Glu Ala Thr Thr Gln Asn 515 520 525 tta gct aat cat ttc tta acc ctt gaa aac gta ata aat gcc agc att 1632Leu Ala Asn His Phe Leu Thr Leu Glu Asn Val Ile Asn Ala Ser Ile 530 535 540 gat agt tta act caa gta agt gat gtg ggc gaa ata gta gca acc cat 1680Asp Ser Leu Thr Gln Val Ser Asp Val Gly Glu Ile Val Ala Thr His 545 550 555 560 gta cgt agc ttt ttt gcc gaa cag cat aat tta gat gtt gta aat gcg 1728Val Arg Ser Phe Phe Ala Glu Gln His Asn Leu Asp Val Val Asn Ala 565 570 575 ctg gta gag caa ggt att aat tgg cct gaa ctt act cca cct tca gcg 1776Leu Val Glu Gln Gly Ile Asn Trp Pro Glu Leu Thr Pro Pro Ser Ala 580 585 590 caa gag cag cca tta gct ggc ctt gtt tat gtg ctt acc ggt acc tta 1824Gln Glu Gln Pro Leu Ala Gly Leu Val Tyr Val Leu Thr Gly Thr Leu 595 600 605 aac aca tta aac cgt aat gac gcc aaa gca cgt ttg caa cag tta ggt 1872Asn Thr Leu Asn Arg Asn Asp Ala Lys Ala Arg Leu Gln Gln Leu Gly 610 615 620 gct aaa gtg tcg ggt agt gtg tcg gct aaa acc gat gcg tta gta gca 1920Ala Lys Val Ser Gly Ser Val Ser Ala Lys Thr Asp Ala Leu Val Ala 625 630 635 640 ggc gaa aag gcc ggc tct aaa cta act aag gca caa gac tta ggt ata 1968Gly Glu Lys Ala Gly Ser Lys Leu Thr Lys Ala Gln Asp Leu Gly Ile 645 650 655 gat gta ctg aca gaa gaa gat tta att aat tta tta gag caa cat aat 2016Asp Val Leu Thr Glu Glu Asp Leu Ile Asn Leu Leu Glu Gln His Asn 660 665 670 ggc tga 2022Gly 6673PRTArtificial SequenceSynthetic Construct 6Met Thr Pro Ser Ile Ser Glu Gln Ile Asn His Leu Arg Ser Thr Leu 1 5 10 15 Glu Gln His Ser Tyr Asn Tyr Tyr Val Leu Asp Thr Pro Ser Ile Pro 20 25 30 Asp Ala Glu Tyr Asp Arg Leu Leu Gln Gln Leu Ser Ala Leu Glu Thr 35 40 45 Gln His Pro Glu Leu Ile Thr Ala Asp Ser Pro Thr Gln Lys Val Gly 50 55 60 Gly Ala Ala Leu Ser Lys Phe Glu Gln Val Ala His Gln Val Pro Met 65 70 75 80 Leu Ser Leu Asp Asn Ala Phe Ser Glu Asp Glu Phe Ile Ala Phe Asn 85 90 95 Arg Arg Ile Lys Glu Arg Leu Met Ser Thr Glu Glu Leu Thr Phe Cys 100 105 110 Cys Glu Pro Lys Leu Asp Gly Leu Ala Val Ser Ile Ile Tyr Arg Asp 115 120 125 Gly Val Leu Val Gln Ala Ala Thr Arg Gly Asp Gly Leu Thr Gly Glu 130 135 140 Asn Val Thr Gln Lys Val Lys Thr Ile Arg Asn Val Pro Leu Lys Leu 145 150 155 160 Arg Gly Ser Asp Tyr Pro Ala Glu Leu Glu Val Arg Gly Glu Val Phe 165 170 175 Met Asp Asn Ala Gly Phe Glu Lys Phe Asn Ile Glu Ala Glu Lys Arg 180 185 190 Gly Glu Lys Val Phe Val Asn Pro Arg Asn Ala Ala Ala Gly Ser Leu 195 200 205 Arg Gln Leu Asp Ser Lys Ile Thr Ala Lys Arg Pro Leu Met Phe Tyr 210 215 220 Ala Tyr Ser Thr Gly Leu Val Ala Asp Gly Ser Ile Ala Glu Asp His 225 230 235 240 Tyr Gln Gln Leu Glu Lys Leu Thr Asp Trp Gly Leu Pro Leu Cys Pro 245 250 255 Glu Thr Lys Leu Val Glu Gly Pro Gln Ala Ala Leu Ala Tyr Tyr Thr 260 265 270 Asp Ile Leu Thr Arg Arg Gly Glu Leu Lys Tyr Glu Ile Asp Gly Val 275 280 285 Val Ile Lys Ile Asn Gln Lys Ala Leu Gln Glu Arg Leu Gly Phe Val 290 295 300 Ala Arg Ala Pro Arg Trp Ala Ile Ala Tyr Lys Phe Pro Ala Gln Glu 305 310 315 320 Glu Ile Thr Lys Leu Leu Asp Val Glu Phe Gln Val Gly Arg Thr Gly 325 330 335 Ala Ile Thr Pro Val Ala Arg Leu Glu Pro Val Phe Val Gly Gly Val 340 345 350 Thr Val Ser Asn Ala Thr Leu His Asn Gly Asp Glu Ile Ala Arg Leu 355 360 365 Gly Val Lys Val Gly Asp Thr Val Ile Ile Arg Arg Ala Gly Asp Val 370 375 380 Ile Pro Gln Ile Thr Gln Val Val Leu Glu Arg Arg Pro Asp Asp Ala 385 390 395 400 Arg Asp Ile Glu Phe Pro Val Thr Cys Pro Ile Cys Asp Ser His Val 405 410 415 Glu Lys Val Glu Gly Glu Ala Val Ala Arg Cys Thr Gly Gly Leu Val 420 425 430 Cys Pro Ala Gln Arg Lys Gln Ala Ile Lys His Phe Ala Ser Arg Lys 435 440 445 Ala Leu Asp Ile Asp Gly Leu Gly Asp Lys Ile Val Asp Gln Leu Val 450 455 460 Asp Arg Glu Leu Ile Lys Thr Pro Ala Asp Leu Phe Ile Leu Lys Gln 465 470 475 480 Gly His Phe Glu Ser Leu Glu Arg Met Gly Pro Lys Ser Ala Lys Asn 485 490 495 Leu Val Thr Ala Leu Gln Asp Ala Lys Ala Thr Thr Leu Ala Lys Phe 500 505 510 Leu Tyr Ser Leu Gly Ile Arg Glu Ala Gly Glu Ala Thr Thr Gln Asn 515 520 525 Leu Ala Asn His Phe Leu Thr Leu Glu Asn Val Ile Asn Ala Ser Ile 530 535 540 Asp Ser Leu Thr Gln Val Ser Asp Val Gly Glu Ile Val Ala Thr His 545 550 555 560 Val Arg Ser Phe Phe Ala Glu Gln His Asn Leu Asp Val Val Asn Ala 565 570 575 Leu Val Glu Gln Gly Ile Asn Trp Pro Glu Leu Thr Pro Pro Ser Ala 580 585 590 Gln Glu Gln Pro Leu Ala Gly Leu Val Tyr Val Leu Thr Gly Thr Leu 595 600 605 Asn Thr Leu Asn Arg Asn Asp Ala Lys Ala Arg Leu Gln Gln Leu Gly 610 615 620 Ala Lys Val Ser Gly Ser Val Ser Ala Lys Thr Asp Ala Leu Val Ala 625 630 635 640 Gly Glu Lys Ala Gly Ser Lys Leu Thr Lys Ala Gln Asp Leu Gly Ile 645 650 655 Asp Val Leu Thr Glu Glu Asp Leu Ile Asn Leu Leu Glu Gln His Asn 660 665 670 Gly 72013DNAArtificial SequenceLigASf hybrid 7atg act cca att caa act gaa atg gat caa ctt act cac acc att aac 48Met Thr Pro Ile Gln Thr Glu Met Asp Gln Leu Thr His Thr Ile Asn 1 5 10 15 caa cat aat att cgt tat tac gtt gat gat gct ccg tca ata ccc gat 96Gln His Asn Ile Arg Tyr Tyr Val Asp Asp Ala Pro Ser Ile Pro Asp 20 25 30 gct gaa tac gac aga tta att aag cgc tta act gag tta gaa cgt gac 144Ala Glu Tyr Asp Arg Leu Ile Lys Arg Leu Thr Glu Leu Glu Arg Asp 35 40 45 tat ccg caa ttt aaa tcg gta gat tca ccg aca caa cgc gtc ggt ggt 192Tyr Pro Gln Phe Lys Ser Val Asp Ser Pro Thr Gln Arg Val Gly Gly 50 55 60 ata gca tta caa aaa ttt gct caa att acc cac ctt aaa ccg atg tta 240Ile Ala Leu Gln Lys Phe Ala Gln Ile Thr His Leu Lys Pro Met Leu 65 70 75 80 agt ctc gac aat gcg ttt gaa caa gcc gat ttt gca gca ttt aat aag 288Ser Leu Asp Asn Ala Phe Glu Gln Ala Asp Phe Ala Ala Phe Asn Lys 85 90 95 cgt ata act gat aaa gtc gat agc gtc gat tat gtt tgc gaa cca aaa 336Arg Ile Thr Asp Lys Val Asp Ser Val Asp Tyr Val Cys Glu Pro Lys 100 105 110 cta gac gga ttg gcc gtg agt att act tat cgt ttt ggc gtt ctt gaa 384Leu Asp Gly Leu Ala Val Ser Ile Thr Tyr Arg Phe Gly Val Leu Glu 115 120 125 cgc gcc gca acg cga ggt gat ggc agt gtc ggc gaa gat att acc gct 432Arg Ala Ala Thr Arg Gly Asp Gly Ser Val Gly Glu Asp Ile Thr Ala 130 135 140 aat gtg cgt act att cgt tca att cct ctt aag tta cgc ggt gaa gga 480Asn Val Arg Thr Ile Arg Ser Ile Pro Leu Lys Leu Arg Gly Glu Gly 145 150 155 160 ttt cca gat tta gtt gaa gta cgt ggc gaa gtg ttt atg cct aaa gcg 528Phe Pro Asp Leu Val Glu Val Arg Gly Glu Val Phe Met Pro Lys Ala 165 170 175 gca ttt gag gca tta aac cag cgt caa atc agc aaa ggt gac aaa gtc 576Ala Phe Glu Ala Leu Asn Gln Arg Gln Ile Ser Lys Gly Asp Lys Val 180 185 190 ttt gtt aat cct cgc aac gca gct gcc ggc agt ttg cgc caa tta gac 624Phe Val Asn Pro Arg Asn Ala Ala Ala Gly Ser Leu Arg Gln Leu Asp 195 200 205 agt aaa att acc gct tca agg gct ctt ggg ttt tat gct tat gca tta 672Ser Lys Ile Thr Ala Ser Arg Ala Leu Gly Phe Tyr Ala Tyr Ala Leu 210 215 220 ggt gta gtc gaa ggc gag tca caa ccg atg caa aca agc cac tat ggc 720Gly Val Val Glu Gly Glu Ser Gln Pro Met Gln Thr Ser His Tyr Gly 225 230 235 240 caa cta aca cag ctg caa caa tgg ggt att ccc gtt agt agt gaa gtg 768Gln Leu Thr Gln Leu Gln Gln Trp Gly Ile Pro Val Ser Ser Glu Val 245 250 255 aaa gtg act gat tta tta gaa aaa gtc tat gca tat tac gcc gat att 816Lys Val Thr Asp Leu Leu Glu Lys Val Tyr Ala Tyr Tyr Ala Asp Ile 260 265 270 atg gcc aga cga agt gcg ctt gaa tat gaa att gac ggc gtc gtc ata 864Met Ala Arg Arg Ser Ala Leu Glu Tyr Glu Ile Asp Gly Val Val Ile 275 280 285 aag gtt aat gac att gcc aag caa caa aca ctt ggt ttt gtg gct aaa 912Lys Val Asn Asp Ile Ala Lys Gln Gln Thr Leu Gly Phe Val Ala Lys 290 295 300 gct cct cga tgg gcc ata gcc tat aaa ttt cca gcc cag gaa gaa atg 960Ala Pro Arg Trp Ala Ile Ala Tyr Lys Phe Pro Ala Gln Glu Glu Met 305 310 315 320 acc ttg tta gag tct gtt gac ttt cag gtt ggc cga acg ggt gct gtt 1008Thr Leu Leu Glu Ser Val Asp Phe Gln Val Gly Arg Thr Gly Ala Val 325 330 335 acc cct gtc gct cgc ctc aaa ccg ata ttt gtc ggt ggc gtg act gtg 1056Thr Pro Val Ala Arg Leu Lys Pro Ile Phe Val Gly Gly Val Thr Val 340 345 350 tcg aat gcg acc ttg cac aat gct gat gaa att gcc cgt ctt ggg gtg 1104Ser Asn Ala Thr Leu His Asn Ala Asp Glu Ile Ala Arg Leu Gly Val 355 360 365 aaa ata ggc gat aca gtg att att cgc cgc gca ggt gac gtt atc ccg 1152Lys Ile Gly Asp Thr Val Ile Ile Arg Arg Ala Gly Asp Val Ile Pro 370 375 380 caa att gtt gct atc gtg cca gaa aag cgc cct gat gat gca caa gat 1200Gln Ile Val Ala Ile Val Pro Glu Lys Arg Pro Asp Asp Ala Gln Asp 385 390 395 400 att atc ttt cca ctg cat tgt cct gtg tgc caa agc att gtt gag cgt 1248Ile Ile Phe Pro Leu His Cys Pro Val Cys Gln Ser Ile Val Glu Arg 405 410 415 tta gaa ggt gaa gct gta gcg cgt tgt agt ggt gga ctt ttt tgt gaa 1296Leu Glu Gly Glu Ala Val Ala Arg Cys Ser Gly Gly Leu Phe Cys Glu 420 425 430 gcg caa cgt aaa gag gcg att aaa cat ttt gca tcc cgt aaa gca tta 1344Ala Gln Arg Lys Glu Ala Ile Lys His Phe Ala Ser Arg Lys Ala Leu 435 440 445 aat att gat ggc atg ggc gat aaa atc gtt gag caa tta att gat aaa 1392Asn Ile Asp Gly Met Gly Asp Lys Ile Val Glu Gln Leu Ile Asp Lys 450 455 460 gaa cta gtc aaa acg cca gca gac ttg ttt tcc ctt acc gct tct agc 1440Glu Leu Val Lys Thr Pro Ala Asp Leu Phe Ser Leu Thr Ala Ser Ser 465 470 475 480 atc acg atg tta gat cgc atg gcg atg aag tca gcc aca aat att gtc 1488Ile Thr Met Leu Asp Arg Met Ala Met Lys Ser Ala Thr Asn Ile Val 485 490 495 gcg gcg att aaa cac gct aaa gcc act aca tta gcg cgt ttt tta tat 1536Ala Ala Ile Lys His Ala Lys Ala Thr Thr Leu Ala Arg Phe Leu Tyr 500 505 510 agt ctt ggg atc cgc gaa gtc ggc gaa gct acc gcc gct aat tta gcc 1584Ser Leu Gly Ile Arg Glu Val Gly Glu Ala Thr Ala Ala Asn Leu Ala 515 520 525 caa cac ttt gcc gaa ttt gag cgt att cga act gct agc gtt gaa caa 1632Gln His Phe Ala Glu Phe Glu Arg Ile Arg Thr Ala Ser Val Glu Gln 530 535 540 ctg ctc gaa gtc gct gat gtt ggt gac att gta gca aaa cac att cga 1680Leu Leu Glu Val Ala Asp Val Gly Asp Ile Val Ala Lys His Ile Arg 545 550 555 560 caa ttt ttt gca cag cca cat aac att gaa gta ata gag caa ttg ctt 1728Gln Phe Phe Ala Gln Pro His Asn Ile Glu Val Ile Glu Gln Leu Leu 565 570 575 gaa gcc ggc att act tgg cct gtt att gaa caa gct gac gaa tcg cag 1776Glu Ala Gly Ile Thr Trp Pro Val Ile Glu Gln Ala Asp Glu Ser Gln 580 585 590 ctt agt ctt aaa ggg caa acg tgg gtg tta act ggt acg cta act caa 1824Leu Ser Leu Lys Gly Gln Thr Trp Val Leu Thr Gly Thr Leu Thr Gln 595 600 605 ctt aat cgt aac gat gcc aaa gcc caa tta cag gct ttg ggc gcc aaa 1872Leu Asn Arg Asn Asp Ala Lys Ala Gln Leu Gln Ala Leu Gly Ala Lys 610 615 620 gtg gct ggc agt gtt tcg aaa aat act gat tgc ctt gtt gct ggt gaa 1920Val Ala Gly Ser Val Ser Lys Asn Thr Asp Cys Leu Val Ala Gly Glu 625 630 635 640 gca gcg ggt tct aaa tta gca aaa gct gaa gaa ttg ggc gtt aag gtg 1968Ala Ala Gly Ser Lys Leu Ala Lys Ala Glu Glu Leu Gly Val Lys Val 645 650 655 ata gat gaa caa gct ctg atg gat tta ttg aat gcg gct aac tga 2013Ile Asp Glu Gln Ala Leu Met Asp Leu Leu Asn Ala Ala Asn 660 665 670 8670PRTArtificial SequenceSynthetic Construct 8Met Thr Pro Ile Gln Thr Glu Met Asp Gln Leu Thr His Thr Ile Asn 1 5 10 15 Gln His Asn Ile Arg Tyr Tyr Val Asp Asp Ala Pro Ser Ile Pro Asp 20 25 30 Ala Glu Tyr Asp Arg Leu Ile Lys Arg Leu Thr Glu Leu Glu Arg Asp 35 40 45 Tyr Pro Gln Phe Lys Ser Val Asp Ser Pro Thr Gln Arg Val Gly Gly 50 55 60 Ile Ala Leu Gln Lys Phe Ala Gln Ile Thr His Leu Lys Pro Met Leu 65 70 75 80 Ser Leu Asp Asn Ala Phe Glu Gln Ala Asp Phe Ala Ala Phe Asn Lys 85 90 95 Arg Ile Thr Asp Lys Val Asp Ser Val Asp Tyr Val Cys Glu Pro Lys 100 105 110 Leu Asp Gly Leu Ala Val Ser Ile Thr Tyr Arg Phe Gly Val Leu Glu 115 120 125 Arg Ala Ala Thr Arg Gly Asp

Gly Ser Val Gly Glu Asp Ile Thr Ala 130 135 140 Asn Val Arg Thr Ile Arg Ser Ile Pro Leu Lys Leu Arg Gly Glu Gly 145 150 155 160 Phe Pro Asp Leu Val Glu Val Arg Gly Glu Val Phe Met Pro Lys Ala 165 170 175 Ala Phe Glu Ala Leu Asn Gln Arg Gln Ile Ser Lys Gly Asp Lys Val 180 185 190 Phe Val Asn Pro Arg Asn Ala Ala Ala Gly Ser Leu Arg Gln Leu Asp 195 200 205 Ser Lys Ile Thr Ala Ser Arg Ala Leu Gly Phe Tyr Ala Tyr Ala Leu 210 215 220 Gly Val Val Glu Gly Glu Ser Gln Pro Met Gln Thr Ser His Tyr Gly 225 230 235 240 Gln Leu Thr Gln Leu Gln Gln Trp Gly Ile Pro Val Ser Ser Glu Val 245 250 255 Lys Val Thr Asp Leu Leu Glu Lys Val Tyr Ala Tyr Tyr Ala Asp Ile 260 265 270 Met Ala Arg Arg Ser Ala Leu Glu Tyr Glu Ile Asp Gly Val Val Ile 275 280 285 Lys Val Asn Asp Ile Ala Lys Gln Gln Thr Leu Gly Phe Val Ala Lys 290 295 300 Ala Pro Arg Trp Ala Ile Ala Tyr Lys Phe Pro Ala Gln Glu Glu Met 305 310 315 320 Thr Leu Leu Glu Ser Val Asp Phe Gln Val Gly Arg Thr Gly Ala Val 325 330 335 Thr Pro Val Ala Arg Leu Lys Pro Ile Phe Val Gly Gly Val Thr Val 340 345 350 Ser Asn Ala Thr Leu His Asn Ala Asp Glu Ile Ala Arg Leu Gly Val 355 360 365 Lys Ile Gly Asp Thr Val Ile Ile Arg Arg Ala Gly Asp Val Ile Pro 370 375 380 Gln Ile Val Ala Ile Val Pro Glu Lys Arg Pro Asp Asp Ala Gln Asp 385 390 395 400 Ile Ile Phe Pro Leu His Cys Pro Val Cys Gln Ser Ile Val Glu Arg 405 410 415 Leu Glu Gly Glu Ala Val Ala Arg Cys Ser Gly Gly Leu Phe Cys Glu 420 425 430 Ala Gln Arg Lys Glu Ala Ile Lys His Phe Ala Ser Arg Lys Ala Leu 435 440 445 Asn Ile Asp Gly Met Gly Asp Lys Ile Val Glu Gln Leu Ile Asp Lys 450 455 460 Glu Leu Val Lys Thr Pro Ala Asp Leu Phe Ser Leu Thr Ala Ser Ser 465 470 475 480 Ile Thr Met Leu Asp Arg Met Ala Met Lys Ser Ala Thr Asn Ile Val 485 490 495 Ala Ala Ile Lys His Ala Lys Ala Thr Thr Leu Ala Arg Phe Leu Tyr 500 505 510 Ser Leu Gly Ile Arg Glu Val Gly Glu Ala Thr Ala Ala Asn Leu Ala 515 520 525 Gln His Phe Ala Glu Phe Glu Arg Ile Arg Thr Ala Ser Val Glu Gln 530 535 540 Leu Leu Glu Val Ala Asp Val Gly Asp Ile Val Ala Lys His Ile Arg 545 550 555 560 Gln Phe Phe Ala Gln Pro His Asn Ile Glu Val Ile Glu Gln Leu Leu 565 570 575 Glu Ala Gly Ile Thr Trp Pro Val Ile Glu Gln Ala Asp Glu Ser Gln 580 585 590 Leu Ser Leu Lys Gly Gln Thr Trp Val Leu Thr Gly Thr Leu Thr Gln 595 600 605 Leu Asn Arg Asn Asp Ala Lys Ala Gln Leu Gln Ala Leu Gly Ala Lys 610 615 620 Val Ala Gly Ser Val Ser Lys Asn Thr Asp Cys Leu Val Ala Gly Glu 625 630 635 640 Ala Ala Gly Ser Lys Leu Ala Lys Ala Glu Glu Leu Gly Val Lys Val 645 650 655 Ile Asp Glu Gln Ala Leu Met Asp Leu Leu Asn Ala Ala Asn 660 665 670 91638DNAArtificial SequencePyrGCp hybrid 9atg aat tct aac act aaa att att ttc gtc aca ggt ggg gta gta tca 48Met Asn Ser Asn Thr Lys Ile Ile Phe Val Thr Gly Gly Val Val Ser 1 5 10 15 tca ctt ggt aag ggt gta act gcg gca tct ttg gct act ctc tta gaa 96Ser Leu Gly Lys Gly Val Thr Ala Ala Ser Leu Ala Thr Leu Leu Glu 20 25 30 agt cgt ggt ctt aat gta aca atg atg aag ctt gat cca tac atc aat 144Ser Arg Gly Leu Asn Val Thr Met Met Lys Leu Asp Pro Tyr Ile Asn 35 40 45 gtt gat cca ggg act atg agt cca ttg caa cat ggt gaa gtt ttt gta 192Val Asp Pro Gly Thr Met Ser Pro Leu Gln His Gly Glu Val Phe Val 50 55 60 acc gaa gat ggc gca gag act gat ctt gat tta ggt cat tat gag cgc 240Thr Glu Asp Gly Ala Glu Thr Asp Leu Asp Leu Gly His Tyr Glu Arg 65 70 75 80 ttt atc cgc aat aag atg act caa gca aat aac ttc aca acc ggt aaa 288Phe Ile Arg Asn Lys Met Thr Gln Ala Asn Asn Phe Thr Thr Gly Lys 85 90 95 gta tac cag agt gtg tta aga aga gag cgt aag ggt gat tat cta ggt 336Val Tyr Gln Ser Val Leu Arg Arg Glu Arg Lys Gly Asp Tyr Leu Gly 100 105 110 gct act atc cag gtg att cca cat atc att gat gag atc aaa agg cgt 384Ala Thr Ile Gln Val Ile Pro His Ile Ile Asp Glu Ile Lys Arg Arg 115 120 125 att tgt agt ggt att gct gat gat gtt gat gtt gcg att gtt gag att 432Ile Cys Ser Gly Ile Ala Asp Asp Val Asp Val Ala Ile Val Glu Ile 130 135 140 ggt ggt act gtt ggt gat atc gag tca caa cca ttt tta gaa gct att 480Gly Gly Thr Val Gly Asp Ile Glu Ser Gln Pro Phe Leu Glu Ala Ile 145 150 155 160 cgt caa ttg gca tta gag gta ggt cgt gat cgt gct atg ttt atg cat 528Arg Gln Leu Ala Leu Glu Val Gly Arg Asp Arg Ala Met Phe Met His 165 170 175 ttg acc tta gtg cca tat tta gca gca gca ggt gaa atc aaa act aaa 576Leu Thr Leu Val Pro Tyr Leu Ala Ala Ala Gly Glu Ile Lys Thr Lys 180 185 190 cca aca cag cac tca gta aaa gat tta cgc tct atc ggt att ttt cct 624Pro Thr Gln His Ser Val Lys Asp Leu Arg Ser Ile Gly Ile Phe Pro 195 200 205 gac att tta gta tgt cgt tca gac cgc gct att cct aac gcc gaa cgc 672Asp Ile Leu Val Cys Arg Ser Asp Arg Ala Ile Pro Asn Ala Glu Arg 210 215 220 gct aaa ata tct ctc ttc act aat gtt gaa gag aaa gcg gtt gta tca 720Ala Lys Ile Ser Leu Phe Thr Asn Val Glu Glu Lys Ala Val Val Ser 225 230 235 240 atg cgt gat gta gac agt att tat aag att cct gct tta tta aaa gct 768Met Arg Asp Val Asp Ser Ile Tyr Lys Ile Pro Ala Leu Leu Lys Ala 245 250 255 caa ggt acc gat gaa ata gtt gtt aag cga ttt ggt tta gat gta cct 816Gln Gly Thr Asp Glu Ile Val Val Lys Arg Phe Gly Leu Asp Val Pro 260 265 270 gaa gcc gac tta act gaa tgg gaa gaa gtg ctt tac cat gaa gca aat 864Glu Ala Asp Leu Thr Glu Trp Glu Glu Val Leu Tyr His Glu Ala Asn 275 280 285 cct atc ggt gaa gtg act att ggt atg gtt ggt aaa tac act gaa tta 912Pro Ile Gly Glu Val Thr Ile Gly Met Val Gly Lys Tyr Thr Glu Leu 290 295 300 cct gat gcg tac aaa tca gta aac gaa gcg tta aaa cat gca ggt ctt 960Pro Asp Ala Tyr Lys Ser Val Asn Glu Ala Leu Lys His Ala Gly Leu 305 310 315 320 aaa aac caa gtc act gta aat att aaa tac att gac tcg caa gat gta 1008Lys Asn Gln Val Thr Val Asn Ile Lys Tyr Ile Asp Ser Gln Asp Val 325 330 335 gaa gtc aaa ggt gtt gaa atc tta gct aac ttg gat gct att tta gtt 1056Glu Val Lys Gly Val Glu Ile Leu Ala Asn Leu Asp Ala Ile Leu Val 340 345 350 cct ggt ggt ttc ggt gaa cgt ggt gtt gaa ggt aaa att tta acg gca 1104Pro Gly Gly Phe Gly Glu Arg Gly Val Glu Gly Lys Ile Leu Thr Ala 355 360 365 caa tat gcg cgt gaa aac aaa gta cct tat tta ggt att tgt tta ggt 1152Gln Tyr Ala Arg Glu Asn Lys Val Pro Tyr Leu Gly Ile Cys Leu Gly 370 375 380 atg caa gta gcc tta att gaa ttt gct cgt aat gtt gcc ggt tta act 1200Met Gln Val Ala Leu Ile Glu Phe Ala Arg Asn Val Ala Gly Leu Thr 385 390 395 400 gat gcg cac agt act gaa ttt aat agc gaa act cca cac cca gtg gtt 1248Asp Ala His Ser Thr Glu Phe Asn Ser Glu Thr Pro His Pro Val Val 405 410 415 ggt tta atc agt gaa tgg tta gac gaa gaa ggc caa gtt gag tac cga 1296Gly Leu Ile Ser Glu Trp Leu Asp Glu Glu Gly Gln Val Glu Tyr Arg 420 425 430 aat gag caa tca gat tta ggt ggt act atg cgt tta ggt tca caa ttg 1344Asn Glu Gln Ser Asp Leu Gly Gly Thr Met Arg Leu Gly Ser Gln Leu 435 440 445 tgc cac ttg gtg aaa ggt acc aag gct tgc gac gta tat ggt agt gaa 1392Cys His Leu Val Lys Gly Thr Lys Ala Cys Asp Val Tyr Gly Ser Glu 450 455 460 aca atc aat gag aga cac cgt cat cgt ttt gag gta aat aat aac tac 1440Thr Ile Asn Glu Arg His Arg His Arg Phe Glu Val Asn Asn Asn Tyr 465 470 475 480 cga gaa caa tta agc aaa gca ggt ttg att ttc tcg ggt tta tcg tca 1488Arg Glu Gln Leu Ser Lys Ala Gly Leu Ile Phe Ser Gly Leu Ser Ser 485 490 495 gat aaa agt tta gtt gag gtg att gaa ata gcg gat cat cca tgg ttt 1536Asp Lys Ser Leu Val Glu Val Ile Glu Ile Ala Asp His Pro Trp Phe 500 505 510 att gcg ggt caa ttc cat cct gag ttt aat tct act cca cgt gat ggt 1584Ile Ala Gly Gln Phe His Pro Glu Phe Asn Ser Thr Pro Arg Asp Gly 515 520 525 cac ccg tta ttc gaa agc ttt gtt gca gcg agt ttt aaa ctg caa aat 1632His Pro Leu Phe Glu Ser Phe Val Ala Ala Ser Phe Lys Leu Gln Asn 530 535 540 aat tag 1638Asn 545 10545PRTArtificial SequenceSynthetic Construct 10Met Asn Ser Asn Thr Lys Ile Ile Phe Val Thr Gly Gly Val Val Ser 1 5 10 15 Ser Leu Gly Lys Gly Val Thr Ala Ala Ser Leu Ala Thr Leu Leu Glu 20 25 30 Ser Arg Gly Leu Asn Val Thr Met Met Lys Leu Asp Pro Tyr Ile Asn 35 40 45 Val Asp Pro Gly Thr Met Ser Pro Leu Gln His Gly Glu Val Phe Val 50 55 60 Thr Glu Asp Gly Ala Glu Thr Asp Leu Asp Leu Gly His Tyr Glu Arg 65 70 75 80 Phe Ile Arg Asn Lys Met Thr Gln Ala Asn Asn Phe Thr Thr Gly Lys 85 90 95 Val Tyr Gln Ser Val Leu Arg Arg Glu Arg Lys Gly Asp Tyr Leu Gly 100 105 110 Ala Thr Ile Gln Val Ile Pro His Ile Ile Asp Glu Ile Lys Arg Arg 115 120 125 Ile Cys Ser Gly Ile Ala Asp Asp Val Asp Val Ala Ile Val Glu Ile 130 135 140 Gly Gly Thr Val Gly Asp Ile Glu Ser Gln Pro Phe Leu Glu Ala Ile 145 150 155 160 Arg Gln Leu Ala Leu Glu Val Gly Arg Asp Arg Ala Met Phe Met His 165 170 175 Leu Thr Leu Val Pro Tyr Leu Ala Ala Ala Gly Glu Ile Lys Thr Lys 180 185 190 Pro Thr Gln His Ser Val Lys Asp Leu Arg Ser Ile Gly Ile Phe Pro 195 200 205 Asp Ile Leu Val Cys Arg Ser Asp Arg Ala Ile Pro Asn Ala Glu Arg 210 215 220 Ala Lys Ile Ser Leu Phe Thr Asn Val Glu Glu Lys Ala Val Val Ser 225 230 235 240 Met Arg Asp Val Asp Ser Ile Tyr Lys Ile Pro Ala Leu Leu Lys Ala 245 250 255 Gln Gly Thr Asp Glu Ile Val Val Lys Arg Phe Gly Leu Asp Val Pro 260 265 270 Glu Ala Asp Leu Thr Glu Trp Glu Glu Val Leu Tyr His Glu Ala Asn 275 280 285 Pro Ile Gly Glu Val Thr Ile Gly Met Val Gly Lys Tyr Thr Glu Leu 290 295 300 Pro Asp Ala Tyr Lys Ser Val Asn Glu Ala Leu Lys His Ala Gly Leu 305 310 315 320 Lys Asn Gln Val Thr Val Asn Ile Lys Tyr Ile Asp Ser Gln Asp Val 325 330 335 Glu Val Lys Gly Val Glu Ile Leu Ala Asn Leu Asp Ala Ile Leu Val 340 345 350 Pro Gly Gly Phe Gly Glu Arg Gly Val Glu Gly Lys Ile Leu Thr Ala 355 360 365 Gln Tyr Ala Arg Glu Asn Lys Val Pro Tyr Leu Gly Ile Cys Leu Gly 370 375 380 Met Gln Val Ala Leu Ile Glu Phe Ala Arg Asn Val Ala Gly Leu Thr 385 390 395 400 Asp Ala His Ser Thr Glu Phe Asn Ser Glu Thr Pro His Pro Val Val 405 410 415 Gly Leu Ile Ser Glu Trp Leu Asp Glu Glu Gly Gln Val Glu Tyr Arg 420 425 430 Asn Glu Gln Ser Asp Leu Gly Gly Thr Met Arg Leu Gly Ser Gln Leu 435 440 445 Cys His Leu Val Lys Gly Thr Lys Ala Cys Asp Val Tyr Gly Ser Glu 450 455 460 Thr Ile Asn Glu Arg His Arg His Arg Phe Glu Val Asn Asn Asn Tyr 465 470 475 480 Arg Glu Gln Leu Ser Lys Ala Gly Leu Ile Phe Ser Gly Leu Ser Ser 485 490 495 Asp Lys Ser Leu Val Glu Val Ile Glu Ile Ala Asp His Pro Trp Phe 500 505 510 Ile Ala Gly Gln Phe His Pro Glu Phe Asn Ser Thr Pro Arg Asp Gly 515 520 525 His Pro Leu Phe Glu Ser Phe Val Ala Ala Ser Phe Lys Leu Gln Asn 530 535 540 Asn 545 11954DNAArtificial SequencehemCCp hybrid 11atg aaa caa act aca gta cga att gcc acg cgt aaa agc gcc ctc gcc 48Met Lys Gln Thr Thr Val Arg Ile Ala Thr Arg Lys Ser Ala Leu Ala 1 5 10 15 tta tgg caa gca gaa tat gtt aaa gcg caa ctt gag cat ttt cat gac 96Leu Trp Gln Ala Glu Tyr Val Lys Ala Gln Leu Glu His Phe His Asp 20 25 30 ggt att aat gtt gaa tta gtg cct atg aca acg aaa ggc gac atc att 144Gly Ile Asn Val Glu Leu Val Pro Met Thr Thr Lys Gly Asp Ile Ile 35 40 45 tta gac acg cct tta gcc aaa gtc ggc ggt aaa ggt tta ttt gtt aaa 192Leu Asp Thr Pro Leu Ala Lys Val Gly Gly Lys Gly Leu Phe Val Lys 50 55 60 gag ctt gaa gta gca atg ctt gaa gac cgt gct gat att gct gtt cat 240Glu Leu Glu Val Ala Met Leu Glu Asp Arg Ala Asp Ile Ala Val His 65 70 75 80 tca atg aaa gat gtt cct gtc gat ttt cca gaa ggc tta gga tta gaa 288Ser Met Lys Asp Val Pro Val Asp Phe Pro Glu Gly Leu Gly Leu Glu 85 90 95 gtc att tgt cct cgt gaa gat ccc cgt gat gct ttt gtt tct aat acc 336Val Ile Cys Pro Arg Glu Asp Pro Arg Asp Ala Phe Val Ser Asn Thr 100 105 110 atc aaa tca tta agt gat tta cca caa ggc tct att gtt ggc acc tca 384Ile Lys Ser Leu Ser Asp Leu Pro Gln Gly Ser Ile Val Gly Thr Ser 115 120 125 agc tta cgc cgt cag tgt caa tta aaa gca agc cgc cct gat tta gat 432Ser Leu Arg Arg Gln Cys Gln Leu Lys Ala Ser Arg Pro Asp Leu Asp 130 135 140

att cgt gat tta cgt ggc aat gta aat acc cgc cta aga aaa tta gat 480Ile Arg Asp Leu Arg Gly Asn Val Asn Thr Arg Leu Arg Lys Leu Asp 145 150 155 160 gaa ggt cag tac gac gct att ata tta gcc gct gca ggc cta att cgc 528Glu Gly Gln Tyr Asp Ala Ile Ile Leu Ala Ala Ala Gly Leu Ile Arg 165 170 175 tta gaa atg agc gag cgt att gca cag ttt atc gaa cca gaa gaa atg 576Leu Glu Met Ser Glu Arg Ile Ala Gln Phe Ile Glu Pro Glu Glu Met 180 185 190 ctt cct gca aat ggc caa ggc gct gtt ggc att gaa tgt cgt aat gat 624Leu Pro Ala Asn Gly Gln Gly Ala Val Gly Ile Glu Cys Arg Asn Asp 195 200 205 gat gcg aca att aaa gcc tta tta gca cca tta gaa tgt gct acc acc 672Asp Ala Thr Ile Lys Ala Leu Leu Ala Pro Leu Glu Cys Ala Thr Thr 210 215 220 cgt att cgt gtt ctt gca gaa cgt gca atg aat aga gca tta caa ggc 720Arg Ile Arg Val Leu Ala Glu Arg Ala Met Asn Arg Ala Leu Gln Gly 225 230 235 240 ggt tgc cag gtt cct atc ggt agc tat ggt gtt att tct gct gat ggt 768Gly Cys Gln Val Pro Ile Gly Ser Tyr Gly Val Ile Ser Ala Asp Gly 245 250 255 aaa aat atc cac tta cgt ggc tta gtt ggc tct gtc gat ggt agt gaa 816Lys Asn Ile His Leu Arg Gly Leu Val Gly Ser Val Asp Gly Ser Glu 260 265 270 atg ata gaa agt gaa atc acc ggc cct gtt gaa gaa ggt gaa gcg ctc 864Met Ile Glu Ser Glu Ile Thr Gly Pro Val Glu Glu Gly Glu Ala Leu 275 280 285 ggc aat aaa ctc gcg caa gag tta cta agc cga ggt gca gat aaa att 912Gly Asn Lys Leu Ala Gln Glu Leu Leu Ser Arg Gly Ala Asp Lys Ile 290 295 300 tta cag caa gtt tat tca gaa aat gat atc aaa gag agt taa 954Leu Gln Gln Val Tyr Ser Glu Asn Asp Ile Lys Glu Ser 305 310 315 12317PRTArtificial SequenceSynthetic Construct 12Met Lys Gln Thr Thr Val Arg Ile Ala Thr Arg Lys Ser Ala Leu Ala 1 5 10 15 Leu Trp Gln Ala Glu Tyr Val Lys Ala Gln Leu Glu His Phe His Asp 20 25 30 Gly Ile Asn Val Glu Leu Val Pro Met Thr Thr Lys Gly Asp Ile Ile 35 40 45 Leu Asp Thr Pro Leu Ala Lys Val Gly Gly Lys Gly Leu Phe Val Lys 50 55 60 Glu Leu Glu Val Ala Met Leu Glu Asp Arg Ala Asp Ile Ala Val His 65 70 75 80 Ser Met Lys Asp Val Pro Val Asp Phe Pro Glu Gly Leu Gly Leu Glu 85 90 95 Val Ile Cys Pro Arg Glu Asp Pro Arg Asp Ala Phe Val Ser Asn Thr 100 105 110 Ile Lys Ser Leu Ser Asp Leu Pro Gln Gly Ser Ile Val Gly Thr Ser 115 120 125 Ser Leu Arg Arg Gln Cys Gln Leu Lys Ala Ser Arg Pro Asp Leu Asp 130 135 140 Ile Arg Asp Leu Arg Gly Asn Val Asn Thr Arg Leu Arg Lys Leu Asp 145 150 155 160 Glu Gly Gln Tyr Asp Ala Ile Ile Leu Ala Ala Ala Gly Leu Ile Arg 165 170 175 Leu Glu Met Ser Glu Arg Ile Ala Gln Phe Ile Glu Pro Glu Glu Met 180 185 190 Leu Pro Ala Asn Gly Gln Gly Ala Val Gly Ile Glu Cys Arg Asn Asp 195 200 205 Asp Ala Thr Ile Lys Ala Leu Leu Ala Pro Leu Glu Cys Ala Thr Thr 210 215 220 Arg Ile Arg Val Leu Ala Glu Arg Ala Met Asn Arg Ala Leu Gln Gly 225 230 235 240 Gly Cys Gln Val Pro Ile Gly Ser Tyr Gly Val Ile Ser Ala Asp Gly 245 250 255 Lys Asn Ile His Leu Arg Gly Leu Val Gly Ser Val Asp Gly Ser Glu 260 265 270 Met Ile Glu Ser Glu Ile Thr Gly Pro Val Glu Glu Gly Glu Ala Leu 275 280 285 Gly Asn Lys Leu Ala Gln Glu Leu Leu Ser Arg Gly Ala Asp Lys Ile 290 295 300 Leu Gln Gln Val Tyr Ser Glu Asn Asp Ile Lys Glu Ser 305 310 315 13984DNAArtificial SequencefmtCp hybrid 13atg aaa aaa cca cta aat atc att ttt gca ggt act cct gaa ttc gct 48Met Lys Lys Pro Leu Asn Ile Ile Phe Ala Gly Thr Pro Glu Phe Ala 1 5 10 15 gcc caa cat tta gca gcg tta att aat tct gaa cat aat att gtc gcc 96Ala Gln His Leu Ala Ala Leu Ile Asn Ser Glu His Asn Ile Val Ala 20 25 30 gtt tat tgt ccc cct gat aaa cca gct ggc cgc ggt aaa aaa cta aca 144Val Tyr Cys Pro Pro Asp Lys Pro Ala Gly Arg Gly Lys Lys Leu Thr 35 40 45 gct tgt gca aca aag tta ctc gca ata gag cac gac att att gtt gag 192Ala Cys Ala Thr Lys Leu Leu Ala Ile Glu His Asp Ile Ile Val Glu 50 55 60 caa cct att aac ttt aaa aat gag gaa gac caa caa caa tta gcg aaa 240Gln Pro Ile Asn Phe Lys Asn Glu Glu Asp Gln Gln Gln Leu Ala Lys 65 70 75 80 tat aac gct gat atc atg gtt gtt gtt gct tat ggt ctg cta tta cct 288Tyr Asn Ala Asp Ile Met Val Val Val Ala Tyr Gly Leu Leu Leu Pro 85 90 95 gaa gtc att tta aac tct cca cgt tta ggc tgc att aac gta cat ggc 336Glu Val Ile Leu Asn Ser Pro Arg Leu Gly Cys Ile Asn Val His Gly 100 105 110 tca att cta cca aaa tgg cgt ggt gca gca cct att caa cgt tct ctt 384Ser Ile Leu Pro Lys Trp Arg Gly Ala Ala Pro Ile Gln Arg Ser Leu 115 120 125 gaa gct gga gat aag aaa acc ggt gtc acc att atg caa atg gat aaa 432Glu Ala Gly Asp Lys Lys Thr Gly Val Thr Ile Met Gln Met Asp Lys 130 135 140 ggg tta gac acg gga gac atg att cta tcc gct gag tgc gaa ata gaa 480Gly Leu Asp Thr Gly Asp Met Ile Leu Ser Ala Glu Cys Glu Ile Glu 145 150 155 160 aat aca gat acc agt gca agt ctt tat gaa aaa ctt gcc aac tta ggg 528Asn Thr Asp Thr Ser Ala Ser Leu Tyr Glu Lys Leu Ala Asn Leu Gly 165 170 175 cca act gcc tta gtt aat aca tta act att atg gct gaa cct gat tat 576Pro Thr Ala Leu Val Asn Thr Leu Thr Ile Met Ala Glu Pro Asp Tyr 180 185 190 caa gcc agt aat cat aat atc gct caa gat gat gaa tta gcg act tat 624Gln Ala Ser Asn His Asn Ile Ala Gln Asp Asp Glu Leu Ala Thr Tyr 195 200 205 gcc aag aaa ctt gat aaa act gaa gca gag ctt aac tgg caa ttc agt 672Ala Lys Lys Leu Asp Lys Thr Glu Ala Glu Leu Asn Trp Gln Phe Ser 210 215 220 gct gat gaa cta cat cga aaa att cgt gct tat att cct tgg cca gtt 720Ala Asp Glu Leu His Arg Lys Ile Arg Ala Tyr Ile Pro Trp Pro Val 225 230 235 240 gct caa ttt acc ttt aca gaa tct gaa ggt aag cag cat agg tta cgc 768Ala Gln Phe Thr Phe Thr Glu Ser Glu Gly Lys Gln His Arg Leu Arg 245 250 255 ata tgg caa gca tcc gtg caa gaa tat cga ggc aat gct gat cca ggc 816Ile Trp Gln Ala Ser Val Gln Glu Tyr Arg Gly Asn Ala Asp Pro Gly 260 265 270 acg ata ata aag gca gac aaa gaa ggg ata gaa gta gca aca acc agt 864Thr Ile Ile Lys Ala Asp Lys Glu Gly Ile Glu Val Ala Thr Thr Ser 275 280 285 ggt tcg tta cga cta gaa gtc att caa ctt cca ggg aaa aaa gca tta 912Gly Ser Leu Arg Leu Glu Val Ile Gln Leu Pro Gly Lys Lys Ala Leu 290 295 300 gcc gta aaa gac atc cta aat ggt cgc agc gat tgg ttc gtt gtt ggc 960Ala Val Lys Asp Ile Leu Asn Gly Arg Ser Asp Trp Phe Val Val Gly 305 310 315 320 agc act att aac aag cta gga taa 984Ser Thr Ile Asn Lys Leu Gly 325 14327PRTArtificial SequenceSynthetic Construct 14Met Lys Lys Pro Leu Asn Ile Ile Phe Ala Gly Thr Pro Glu Phe Ala 1 5 10 15 Ala Gln His Leu Ala Ala Leu Ile Asn Ser Glu His Asn Ile Val Ala 20 25 30 Val Tyr Cys Pro Pro Asp Lys Pro Ala Gly Arg Gly Lys Lys Leu Thr 35 40 45 Ala Cys Ala Thr Lys Leu Leu Ala Ile Glu His Asp Ile Ile Val Glu 50 55 60 Gln Pro Ile Asn Phe Lys Asn Glu Glu Asp Gln Gln Gln Leu Ala Lys 65 70 75 80 Tyr Asn Ala Asp Ile Met Val Val Val Ala Tyr Gly Leu Leu Leu Pro 85 90 95 Glu Val Ile Leu Asn Ser Pro Arg Leu Gly Cys Ile Asn Val His Gly 100 105 110 Ser Ile Leu Pro Lys Trp Arg Gly Ala Ala Pro Ile Gln Arg Ser Leu 115 120 125 Glu Ala Gly Asp Lys Lys Thr Gly Val Thr Ile Met Gln Met Asp Lys 130 135 140 Gly Leu Asp Thr Gly Asp Met Ile Leu Ser Ala Glu Cys Glu Ile Glu 145 150 155 160 Asn Thr Asp Thr Ser Ala Ser Leu Tyr Glu Lys Leu Ala Asn Leu Gly 165 170 175 Pro Thr Ala Leu Val Asn Thr Leu Thr Ile Met Ala Glu Pro Asp Tyr 180 185 190 Gln Ala Ser Asn His Asn Ile Ala Gln Asp Asp Glu Leu Ala Thr Tyr 195 200 205 Ala Lys Lys Leu Asp Lys Thr Glu Ala Glu Leu Asn Trp Gln Phe Ser 210 215 220 Ala Asp Glu Leu His Arg Lys Ile Arg Ala Tyr Ile Pro Trp Pro Val 225 230 235 240 Ala Gln Phe Thr Phe Thr Glu Ser Glu Gly Lys Gln His Arg Leu Arg 245 250 255 Ile Trp Gln Ala Ser Val Gln Glu Tyr Arg Gly Asn Ala Asp Pro Gly 260 265 270 Thr Ile Ile Lys Ala Asp Lys Glu Gly Ile Glu Val Ala Thr Thr Ser 275 280 285 Gly Ser Leu Arg Leu Glu Val Ile Gln Leu Pro Gly Lys Lys Ala Leu 290 295 300 Ala Val Lys Asp Ile Leu Asn Gly Arg Ser Asp Trp Phe Val Val Gly 305 310 315 320 Ser Thr Ile Asn Lys Leu Gly 325 151164DNAArtificial SequencemurGCp hybrid 15atg agt cta aat cat ggc caa ggt aat aaa gat tta gca aaa act ttg 48Met Ser Leu Asn His Gly Gln Gly Asn Lys Asp Leu Ala Lys Thr Leu 1 5 10 15 tta gtc atg gct ggt ggc acc ggt gga cat ata ttc cct ggt att gcg 96Leu Val Met Ala Gly Gly Thr Gly Gly His Ile Phe Pro Gly Ile Ala 20 25 30 gtc gcc gat gag ctg aaa gcg caa gga tgg aaa atc cat tgg ttg gga 144Val Ala Asp Glu Leu Lys Ala Gln Gly Trp Lys Ile His Trp Leu Gly 35 40 45 act gcc gat cgt atg gaa gct caa att gta cct atg cat ggt tat gat 192Thr Ala Asp Arg Met Glu Ala Gln Ile Val Pro Met His Gly Tyr Asp 50 55 60 att tcg ttt atc aat ata agt ggt ctg cgt ggt aaa aat cta tta aca 240Ile Ser Phe Ile Asn Ile Ser Gly Leu Arg Gly Lys Asn Leu Leu Thr 65 70 75 80 acg ctt gtt atg cct ttt aaa ttg tta agg tcg ctt ttt caa gcg aga 288Thr Leu Val Met Pro Phe Lys Leu Leu Arg Ser Leu Phe Gln Ala Arg 85 90 95 cgc gtg att aaa aca gtg aaa cct gat gtt gtt ata ggc atg ggt ggc 336Arg Val Ile Lys Thr Val Lys Pro Asp Val Val Ile Gly Met Gly Gly 100 105 110 tat gca agt gct ccg ggt ggt ttg gcc gct tgg cta agt aaa ata ccg 384Tyr Ala Ser Ala Pro Gly Gly Leu Ala Ala Trp Leu Ser Lys Ile Pro 115 120 125 cta atc gtt cat gaa caa aat gct gct gcc gga tta agt aat cgc ttg 432Leu Ile Val His Glu Gln Asn Ala Ala Ala Gly Leu Ser Asn Arg Leu 130 135 140 tta gcg cgt atc gcc aat aaa gta tgc tgc gcc ttt cct aat gca ttt 480Leu Ala Arg Ile Ala Asn Lys Val Cys Cys Ala Phe Pro Asn Ala Phe 145 150 155 160 gtt agc gga att gat gtt gaa gtg gtt ggt aat cct tta cgc gcg tca 528Val Ser Gly Ile Asp Val Glu Val Val Gly Asn Pro Leu Arg Ala Ser 165 170 175 atc ggt cag caa gca ctg gtt tca gaa aat ata gat caa agc cac gaa 576Ile Gly Gln Gln Ala Leu Val Ser Glu Asn Ile Asp Gln Ser His Glu 180 185 190 ggt agt aaa aat att cta gtg gta ggt ggt agt tta ggc gct caa gtc 624Gly Ser Lys Asn Ile Leu Val Val Gly Gly Ser Leu Gly Ala Gln Val 195 200 205 tta aat aag gtg atg ccg gat agc ttt aag gat tta tca gaa agt gat 672Leu Asn Lys Val Met Pro Asp Ser Phe Lys Asp Leu Ser Glu Ser Asp 210 215 220 gag aaa tat tgt ata tgg cac caa acg ggc gac aat aac caa gca cta 720Glu Lys Tyr Cys Ile Trp His Gln Thr Gly Asp Asn Asn Gln Ala Leu 225 230 235 240 gtc acc gca tct tat aaa cag gaa tat att gat act gga aaa gtg aga 768Val Thr Ala Ser Tyr Lys Gln Glu Tyr Ile Asp Thr Gly Lys Val Arg 245 250 255 gtt acc gaa ttt att act gat att gct gct gca tat cag tgg gct gat 816Val Thr Glu Phe Ile Thr Asp Ile Ala Ala Ala Tyr Gln Trp Ala Asp 260 265 270 ata gtg att tgt cgt gcg gga gcg cta acc gtt tca gaa tta gcc atg 864Ile Val Ile Cys Arg Ala Gly Ala Leu Thr Val Ser Glu Leu Ala Met 275 280 285 gca gca aca cca gcc att ttt gta cca cta ccg cat gca gta gat gat 912Ala Ala Thr Pro Ala Ile Phe Val Pro Leu Pro His Ala Val Asp Asp 290 295 300 cat caa aca aaa aat gcg ttg tac ctc gta aag cga gat gca gca aag 960His Gln Thr Lys Asn Ala Leu Tyr Leu Val Lys Arg Asp Ala Ala Lys 305 310 315 320 tta ttg cca cag gca gaa cta aat aat gag agt atc acg tcg tta ata 1008Leu Leu Pro Gln Ala Glu Leu Asn Asn Glu Ser Ile Thr Ser Leu Ile 325 330 335 atc gag ctg ttt gat cag cct caa act tta gct gac atg gct aaa gct 1056Ile Glu Leu Phe Asp Gln Pro Gln Thr Leu Ala Asp Met Ala Lys Ala 340 345 350 tct ttg agt gct gca act agt gat gca agt cag aaa gta gca aaa ttg 1104Ser Leu Ser Ala Ala Thr Ser Asp Ala Ser Gln Lys Val Ala Lys Leu 355 360 365 tgc caa cag ctt tca ata tcg aat ggc gca aaa ctt aga aat aat gaa 1152Cys Gln Gln Leu Ser Ile Ser Asn Gly Ala Lys Leu Arg Asn Asn Glu 370 375 380 gag aac aaa taa 1164Glu Asn Lys 385 16387PRTArtificial SequenceSynthetic Construct 16Met Ser Leu Asn His Gly Gln Gly Asn Lys Asp Leu Ala Lys Thr Leu 1 5 10 15 Leu Val Met Ala Gly Gly Thr Gly Gly His Ile Phe Pro Gly Ile Ala 20 25 30 Val Ala Asp Glu Leu Lys Ala Gln Gly Trp Lys Ile His Trp Leu Gly 35 40 45 Thr Ala Asp Arg Met Glu Ala Gln Ile Val Pro Met His Gly Tyr Asp 50 55 60 Ile Ser Phe Ile Asn Ile Ser Gly Leu Arg Gly Lys Asn Leu Leu Thr 65 70

75 80 Thr Leu Val Met Pro Phe Lys Leu Leu Arg Ser Leu Phe Gln Ala Arg 85 90 95 Arg Val Ile Lys Thr Val Lys Pro Asp Val Val Ile Gly Met Gly Gly 100 105 110 Tyr Ala Ser Ala Pro Gly Gly Leu Ala Ala Trp Leu Ser Lys Ile Pro 115 120 125 Leu Ile Val His Glu Gln Asn Ala Ala Ala Gly Leu Ser Asn Arg Leu 130 135 140 Leu Ala Arg Ile Ala Asn Lys Val Cys Cys Ala Phe Pro Asn Ala Phe 145 150 155 160 Val Ser Gly Ile Asp Val Glu Val Val Gly Asn Pro Leu Arg Ala Ser 165 170 175 Ile Gly Gln Gln Ala Leu Val Ser Glu Asn Ile Asp Gln Ser His Glu 180 185 190 Gly Ser Lys Asn Ile Leu Val Val Gly Gly Ser Leu Gly Ala Gln Val 195 200 205 Leu Asn Lys Val Met Pro Asp Ser Phe Lys Asp Leu Ser Glu Ser Asp 210 215 220 Glu Lys Tyr Cys Ile Trp His Gln Thr Gly Asp Asn Asn Gln Ala Leu 225 230 235 240 Val Thr Ala Ser Tyr Lys Gln Glu Tyr Ile Asp Thr Gly Lys Val Arg 245 250 255 Val Thr Glu Phe Ile Thr Asp Ile Ala Ala Ala Tyr Gln Trp Ala Asp 260 265 270 Ile Val Ile Cys Arg Ala Gly Ala Leu Thr Val Ser Glu Leu Ala Met 275 280 285 Ala Ala Thr Pro Ala Ile Phe Val Pro Leu Pro His Ala Val Asp Asp 290 295 300 His Gln Thr Lys Asn Ala Leu Tyr Leu Val Lys Arg Asp Ala Ala Lys 305 310 315 320 Leu Leu Pro Gln Ala Glu Leu Asn Asn Glu Ser Ile Thr Ser Leu Ile 325 330 335 Ile Glu Leu Phe Asp Gln Pro Gln Thr Leu Ala Asp Met Ala Lys Ala 340 345 350 Ser Leu Ser Ala Ala Thr Ser Asp Ala Ser Gln Lys Val Ala Lys Leu 355 360 365 Cys Gln Gln Leu Ser Ile Ser Asn Gly Ala Lys Leu Arg Asn Asn Glu 370 375 380 Glu Asn Lys 385 172067DNAArtificial SequenceCodon optimized ligACp optimized for M. tuberculosis 17gtg agc gag aag gag aag aaa ata tcc cag ctg caa cag caa ctg aac 48Val Ser Glu Lys Glu Lys Lys Ile Ser Gln Leu Gln Gln Gln Leu Asn 1 5 10 15 caa tat aac cat gag tac tat gtc ctc gac cag cca tcg gtc ccc gat 96Gln Tyr Asn His Glu Tyr Tyr Val Leu Asp Gln Pro Ser Val Pro Asp 20 25 30 gcg gag tac gat cgc ctg atg acc gcg tta atc gat ctg gaa aag acc 144Ala Glu Tyr Asp Arg Leu Met Thr Ala Leu Ile Asp Leu Glu Lys Thr 35 40 45 aac ccg gag ttg aag acg atc gac agt ccg tcg cag aag gtg ggc ggt 192Asn Pro Glu Leu Lys Thr Ile Asp Ser Pro Ser Gln Lys Val Gly Gly 50 55 60 cag gcc ctg aag agc ttc acc caa gtg acg cat cag ctg ccc atg ctc 240Gln Ala Leu Lys Ser Phe Thr Gln Val Thr His Gln Leu Pro Met Leu 65 70 75 80 tcg ctt gac aac gtc ttt tcg ctg gat gac ttc cac gcc ttc gtc aag 288Ser Leu Asp Asn Val Phe Ser Leu Asp Asp Phe His Ala Phe Val Lys 85 90 95 agg gtc aaa gac cgt ctc aat gac aac cag gcg atc gtg ttc tgt gcc 336Arg Val Lys Asp Arg Leu Asn Asp Asn Gln Ala Ile Val Phe Cys Ala 100 105 110 gag ccg aag ctg gac ggc ctc gcg gta tcg ctc cgc tac gag cat ggc 384Glu Pro Lys Leu Asp Gly Leu Ala Val Ser Leu Arg Tyr Glu His Gly 115 120 125 cag ctc atc cag gcg gcc acg cgg ggc gac ggc tca gtc ggg gag aat 432Gln Leu Ile Gln Ala Ala Thr Arg Gly Asp Gly Ser Val Gly Glu Asn 130 135 140 atc acc acg aac atc cgg acg atc aag tcc atc ccc ctg aag ctc atg 480Ile Thr Thr Asn Ile Arg Thr Ile Lys Ser Ile Pro Leu Lys Leu Met 145 150 155 160 ggc act ccc ggc aaa gac ttt cca gac att gtg gaa gtc cgg ggc gaa 528Gly Thr Pro Gly Lys Asp Phe Pro Asp Ile Val Glu Val Arg Gly Glu 165 170 175 gtg ttc atg ccg aag gcc tcg ttc gac gcg ctg aac acc ctg gct aag 576Val Phe Met Pro Lys Ala Ser Phe Asp Ala Leu Asn Thr Leu Ala Lys 180 185 190 aaa cgg ggg gag aag ggc ttc gct aac ccg cgg aac gcg gca gcc ggc 624Lys Arg Gly Glu Lys Gly Phe Ala Asn Pro Arg Asn Ala Ala Ala Gly 195 200 205 agt ctg cgt cag ctg gac agc aag atc acg gcc aag cgc aac ctg gcg 672Ser Leu Arg Gln Leu Asp Ser Lys Ile Thr Ala Lys Arg Asn Leu Ala 210 215 220 ttc tat gcc tac agc cta ggt ttc gtg ggg aaa ctg agc gac ggg ggc 720Phe Tyr Ala Tyr Ser Leu Gly Phe Val Gly Lys Leu Ser Asp Gly Gly 225 230 235 240 gcg gaa agc acc gac ttg acg aac gac ttt ttc gcg aac tcg cac cat 768Ala Glu Ser Thr Asp Leu Thr Asn Asp Phe Phe Ala Asn Ser His His 245 250 255 gag cga ttg tgt caa ttg aag cga ctg ggt ttg ccg atg tgt ccg gag 816Glu Arg Leu Cys Gln Leu Lys Arg Leu Gly Leu Pro Met Cys Pro Glu 260 265 270 gtg cgg ctg cta gaa tcg gag cag gct tgc gac gcg ttc tac cag gac 864Val Arg Leu Leu Glu Ser Glu Gln Ala Cys Asp Ala Phe Tyr Gln Asp 275 280 285 atc ctc gcg aag cgc tcg gct ctt tca tac gaa atc gac ggt acc gtt 912Ile Leu Ala Lys Arg Ser Ala Leu Ser Tyr Glu Ile Asp Gly Thr Val 290 295 300 ttg aag gtt gac gag atc tcc ctc cag aag cgc ctg ggt ttc gtg gcg 960Leu Lys Val Asp Glu Ile Ser Leu Gln Lys Arg Leu Gly Phe Val Ala 305 310 315 320 cgg gca ccg cgc tgg gcc atc gcc tac aag ttc cca gca gag gaa gag 1008Arg Ala Pro Arg Trp Ala Ile Ala Tyr Lys Phe Pro Ala Glu Glu Glu 325 330 335 ctg acc tgc gtg gag gac gta gaa ttt caa gtg ggc cgc acc ggt gcc 1056Leu Thr Cys Val Glu Asp Val Glu Phe Gln Val Gly Arg Thr Gly Ala 340 345 350 atc acc ccg gtt gcc cgc ctg aag cct gtc ttc gtg ggc ggt gtc acc 1104Ile Thr Pro Val Ala Arg Leu Lys Pro Val Phe Val Gly Gly Val Thr 355 360 365 gtg agc aac gct acc ctt cat aac cag gac gag atc aca cgt ctg ggg 1152Val Ser Asn Ala Thr Leu His Asn Gln Asp Glu Ile Thr Arg Leu Gly 370 375 380 ctg aag gtc aac gat ttc gtc gtg att cgc cgg gca ggc gac gtt att 1200Leu Lys Val Asn Asp Phe Val Val Ile Arg Arg Ala Gly Asp Val Ile 385 390 395 400 ccg cag atc gtg tcg gtg gtc ctg gac aaa agg ccg gat aac gcc gtc 1248Pro Gln Ile Val Ser Val Val Leu Asp Lys Arg Pro Asp Asn Ala Val 405 410 415 gat atc gtc ttc ccc acg tcg tgc ccg gtg tgc gac tcg gcc gtg gcc 1296Asp Ile Val Phe Pro Thr Ser Cys Pro Val Cys Asp Ser Ala Val Ala 420 425 430 aag ccc gaa ggc gag gca gtc ctg cgg tgc aca gcc ggg ctc ttc tgt 1344Lys Pro Glu Gly Glu Ala Val Leu Arg Cys Thr Ala Gly Leu Phe Cys 435 440 445 gcg gcc cag cgc aag gaa gcc atc aag cac ttc gcc tcc cgc aag gcc 1392Ala Ala Gln Arg Lys Glu Ala Ile Lys His Phe Ala Ser Arg Lys Ala 450 455 460 cac gac gtc gac gga ctg ggc gac aag ctc gtc gag cag ctt gta gac 1440His Asp Val Asp Gly Leu Gly Asp Lys Leu Val Glu Gln Leu Val Asp 465 470 475 480 gag aag ctg atc aac acc ccc gcg gat ctg ttc aag ctc acc gaa atc 1488Glu Lys Leu Ile Asn Thr Pro Ala Asp Leu Phe Lys Leu Thr Glu Ile 485 490 495 cag gtg agt acc att gac aga atg gga aag aag tct gcc acc aac ctg 1536Gln Val Ser Thr Ile Asp Arg Met Gly Lys Lys Ser Ala Thr Asn Leu 500 505 510 ata aat ggt ctg gag cag gcg aag agc act acg ctg gcg aag ttc att 1584Ile Asn Gly Leu Glu Gln Ala Lys Ser Thr Thr Leu Ala Lys Phe Ile 515 520 525 tac ggc ctg ggg atc cgg gaa gtg gga gag gcc acg gcc gcg aac ctg 1632Tyr Gly Leu Gly Ile Arg Glu Val Gly Glu Ala Thr Ala Ala Asn Leu 530 535 540 gcc aac cac ttc tac acc ctc gcc gcg atc gag agc gcc agc ttg gag 1680Ala Asn His Phe Tyr Thr Leu Ala Ala Ile Glu Ser Ala Ser Leu Glu 545 550 555 560 gat ctg cag aac gta tcc gac gtg ggt gag gtc gtg gca aag aac atc 1728Asp Leu Gln Asn Val Ser Asp Val Gly Glu Val Val Ala Lys Asn Ile 565 570 575 att aat ttc ttc aag gaa gag cac aac ctg gcg atc gtc agc ggt ttg 1776Ile Asn Phe Phe Lys Glu Glu His Asn Leu Ala Ile Val Ser Gly Leu 580 585 590 agc gaa gtg atg cac tgg ccc acc atc gag atc aag tcg gcc gag gag 1824Ser Glu Val Met His Trp Pro Thr Ile Glu Ile Lys Ser Ala Glu Glu 595 600 605 ctt cct ctg gcg gag cag atc ttc gtc ctc acc gga act ctc acc cag 1872Leu Pro Leu Ala Glu Gln Ile Phe Val Leu Thr Gly Thr Leu Thr Gln 610 615 620 atg ggc cgc acg gag gcg aag acc gcc ttg caa tcc ctg ggc gct aag 1920Met Gly Arg Thr Glu Ala Lys Thr Ala Leu Gln Ser Leu Gly Ala Lys 625 630 635 640 gtc tcg ggc tcc gtc tcc aag aac acc cac ttc gtg gtt gcg ggc gac 1968Val Ser Gly Ser Val Ser Lys Asn Thr His Phe Val Val Ala Gly Asp 645 650 655 aag gct ggc agc aag ctg acg aag gcg cag gac ctc ggc atc tca gtc 2016Lys Ala Gly Ser Lys Leu Thr Lys Ala Gln Asp Leu Gly Ile Ser Val 660 665 670 ctg aca gag gat ggc ctg gtc gcc ctg ctg gca gag cac ggc atc acc 2064Leu Thr Glu Asp Gly Leu Val Ala Leu Leu Ala Glu His Gly Ile Thr 675 680 685 att 2067Ile 18689PRTArtificial SequenceSynthetic Construct 18Val Ser Glu Lys Glu Lys Lys Ile Ser Gln Leu Gln Gln Gln Leu Asn 1 5 10 15 Gln Tyr Asn His Glu Tyr Tyr Val Leu Asp Gln Pro Ser Val Pro Asp 20 25 30 Ala Glu Tyr Asp Arg Leu Met Thr Ala Leu Ile Asp Leu Glu Lys Thr 35 40 45 Asn Pro Glu Leu Lys Thr Ile Asp Ser Pro Ser Gln Lys Val Gly Gly 50 55 60 Gln Ala Leu Lys Ser Phe Thr Gln Val Thr His Gln Leu Pro Met Leu 65 70 75 80 Ser Leu Asp Asn Val Phe Ser Leu Asp Asp Phe His Ala Phe Val Lys 85 90 95 Arg Val Lys Asp Arg Leu Asn Asp Asn Gln Ala Ile Val Phe Cys Ala 100 105 110 Glu Pro Lys Leu Asp Gly Leu Ala Val Ser Leu Arg Tyr Glu His Gly 115 120 125 Gln Leu Ile Gln Ala Ala Thr Arg Gly Asp Gly Ser Val Gly Glu Asn 130 135 140 Ile Thr Thr Asn Ile Arg Thr Ile Lys Ser Ile Pro Leu Lys Leu Met 145 150 155 160 Gly Thr Pro Gly Lys Asp Phe Pro Asp Ile Val Glu Val Arg Gly Glu 165 170 175 Val Phe Met Pro Lys Ala Ser Phe Asp Ala Leu Asn Thr Leu Ala Lys 180 185 190 Lys Arg Gly Glu Lys Gly Phe Ala Asn Pro Arg Asn Ala Ala Ala Gly 195 200 205 Ser Leu Arg Gln Leu Asp Ser Lys Ile Thr Ala Lys Arg Asn Leu Ala 210 215 220 Phe Tyr Ala Tyr Ser Leu Gly Phe Val Gly Lys Leu Ser Asp Gly Gly 225 230 235 240 Ala Glu Ser Thr Asp Leu Thr Asn Asp Phe Phe Ala Asn Ser His His 245 250 255 Glu Arg Leu Cys Gln Leu Lys Arg Leu Gly Leu Pro Met Cys Pro Glu 260 265 270 Val Arg Leu Leu Glu Ser Glu Gln Ala Cys Asp Ala Phe Tyr Gln Asp 275 280 285 Ile Leu Ala Lys Arg Ser Ala Leu Ser Tyr Glu Ile Asp Gly Thr Val 290 295 300 Leu Lys Val Asp Glu Ile Ser Leu Gln Lys Arg Leu Gly Phe Val Ala 305 310 315 320 Arg Ala Pro Arg Trp Ala Ile Ala Tyr Lys Phe Pro Ala Glu Glu Glu 325 330 335 Leu Thr Cys Val Glu Asp Val Glu Phe Gln Val Gly Arg Thr Gly Ala 340 345 350 Ile Thr Pro Val Ala Arg Leu Lys Pro Val Phe Val Gly Gly Val Thr 355 360 365 Val Ser Asn Ala Thr Leu His Asn Gln Asp Glu Ile Thr Arg Leu Gly 370 375 380 Leu Lys Val Asn Asp Phe Val Val Ile Arg Arg Ala Gly Asp Val Ile 385 390 395 400 Pro Gln Ile Val Ser Val Val Leu Asp Lys Arg Pro Asp Asn Ala Val 405 410 415 Asp Ile Val Phe Pro Thr Ser Cys Pro Val Cys Asp Ser Ala Val Ala 420 425 430 Lys Pro Glu Gly Glu Ala Val Leu Arg Cys Thr Ala Gly Leu Phe Cys 435 440 445 Ala Ala Gln Arg Lys Glu Ala Ile Lys His Phe Ala Ser Arg Lys Ala 450 455 460 His Asp Val Asp Gly Leu Gly Asp Lys Leu Val Glu Gln Leu Val Asp 465 470 475 480 Glu Lys Leu Ile Asn Thr Pro Ala Asp Leu Phe Lys Leu Thr Glu Ile 485 490 495 Gln Val Ser Thr Ile Asp Arg Met Gly Lys Lys Ser Ala Thr Asn Leu 500 505 510 Ile Asn Gly Leu Glu Gln Ala Lys Ser Thr Thr Leu Ala Lys Phe Ile 515 520 525 Tyr Gly Leu Gly Ile Arg Glu Val Gly Glu Ala Thr Ala Ala Asn Leu 530 535 540 Ala Asn His Phe Tyr Thr Leu Ala Ala Ile Glu Ser Ala Ser Leu Glu 545 550 555 560 Asp Leu Gln Asn Val Ser Asp Val Gly Glu Val Val Ala Lys Asn Ile 565 570 575 Ile Asn Phe Phe Lys Glu Glu His Asn Leu Ala Ile Val Ser Gly Leu 580 585 590 Ser Glu Val Met His Trp Pro Thr Ile Glu Ile Lys Ser Ala Glu Glu 595 600 605 Leu Pro Leu Ala Glu Gln Ile Phe Val Leu Thr Gly Thr Leu Thr Gln 610 615 620 Met Gly Arg Thr Glu Ala Lys Thr Ala Leu Gln Ser Leu Gly Ala Lys 625 630 635 640 Val Ser Gly Ser Val Ser Lys Asn Thr His Phe Val Val Ala Gly Asp 645 650 655 Lys Ala Gly Ser Lys Leu Thr Lys Ala Gln Asp Leu Gly Ile Ser Val 660 665 670 Leu Thr Glu Asp Gly Leu Val Ala Leu Leu Ala Glu His Gly Ile Thr 675 680 685 Ile 191917DNAColwellia psychroerythrusCDS(1)..(1917)misc_feature(556)..(556)s at location 556 is g or c 19atg gga aaa att att ggt att gac cta gga aca act aac tca tgt gtt 48Met Gly Lys Ile Ile Gly Ile Asp Leu Gly Thr Thr Asn Ser Cys Val 1 5 10 15 gct gtt tta gat ggc gac agt gta cgt gtt att gaa aat gca gaa ggc 96Ala Val Leu Asp Gly Asp Ser Val Arg Val Ile Glu Asn Ala Glu Gly 20 25 30 gat cgt aca act cct tct att att ggt tat aca gcc gaa ggc gaa aca 144Asp Arg Thr Thr Pro Ser Ile Ile Gly Tyr Thr Ala Glu Gly Glu Thr 35 40 45

tta gta ggt caa cct gct aag cgt caa tct gta act aac cca gaa aac 192Leu Val Gly Gln Pro Ala Lys Arg Gln Ser Val Thr Asn Pro Glu Asn 50 55 60 act tta tat gca att aaa cgc tta atc ggt cgt cgt ttc gaa gat aaa 240Thr Leu Tyr Ala Ile Lys Arg Leu Ile Gly Arg Arg Phe Glu Asp Lys 65 70 75 80 gaa aca caa cgt gac atc gat att atg cca ttt ggt att gtt aaa gcg 288Glu Thr Gln Arg Asp Ile Asp Ile Met Pro Phe Gly Ile Val Lys Ala 85 90 95 gat aac ggt gat gct tgg gtt caa gta aaa ggc gag aaa att gct ccg 336Asp Asn Gly Asp Ala Trp Val Gln Val Lys Gly Glu Lys Ile Ala Pro 100 105 110 cca caa gtt tca gct gaa gtt ctt aag aaa atg aaa aag act gct gaa 384Pro Gln Val Ser Ala Glu Val Leu Lys Lys Met Lys Lys Thr Ala Glu 115 120 125 gac ttc tta ggt gaa acc gta act gaa gct gtt att act gta cct gct 432Asp Phe Leu Gly Glu Thr Val Thr Glu Ala Val Ile Thr Val Pro Ala 130 135 140 tac ttt aac gat tca caa cgc caa gca acg aaa gat gct ggt cgt att 480Tyr Phe Asn Asp Ser Gln Arg Gln Ala Thr Lys Asp Ala Gly Arg Ile 145 150 155 160 gct ggt ctt gaa gtc aaa cgt att atc aac gaa cct act gct gct gcc 528Ala Gly Leu Glu Val Lys Arg Ile Ile Asn Glu Pro Thr Ala Ala Ala 165 170 175 ctt gct tac ggc atg gac aaa caa gaa sgt gac aaa gtt gtt gca gtt 576Leu Ala Tyr Gly Met Asp Lys Gln Glu Xaa Asp Lys Val Val Ala Val 180 185 190 tac gat tta ggt ggt ggt aca ttc gat att tca atc att gaa att gat 624Tyr Asp Leu Gly Gly Gly Thr Phe Asp Ile Ser Ile Ile Glu Ile Asp 195 200 205 gaa atg gat ggc gaa cac act ttt gaa gta tta gcg act aac ggt gat 672Glu Met Asp Gly Glu His Thr Phe Glu Val Leu Ala Thr Asn Gly Asp 210 215 220 act cac tta ggt ggt gaa gat ttt gat aac cgt tta atc aac tac ctt 720Thr His Leu Gly Gly Glu Asp Phe Asp Asn Arg Leu Ile Asn Tyr Leu 225 230 235 240 gta gct gaa ttc aaa aaa gac caa ggc atg gac tta acg tct gat cct 768Val Ala Glu Phe Lys Lys Asp Gln Gly Met Asp Leu Thr Ser Asp Pro 245 250 255 tta gca atg cag cgt tta aaa gaa gca gca gaa aaa gct aaa tgt gaa 816Leu Ala Met Gln Arg Leu Lys Glu Ala Ala Glu Lys Ala Lys Cys Glu 260 265 270 ctt tct tca gca caa caa aca gat gta aac tta cct tac atc act gct 864Leu Ser Ser Ala Gln Gln Thr Asp Val Asn Leu Pro Tyr Ile Thr Ala 275 280 285 gat ggt tca ggt cct aag cac atg aac atc aaa gtg act cgt gct aag 912Asp Gly Ser Gly Pro Lys His Met Asn Ile Lys Val Thr Arg Ala Lys 290 295 300 tta gaa tca cta gtt gaa gat atg gtt aaa gca aca tta gag ccg ctt 960Leu Glu Ser Leu Val Glu Asp Met Val Lys Ala Thr Leu Glu Pro Leu 305 310 315 320 aaa caa gcg ctt aaa gat gca gac tta tca gta agc aag att gat gat 1008Lys Gln Ala Leu Lys Asp Ala Asp Leu Ser Val Ser Lys Ile Asp Asp 325 330 335 gtt att tta gtt ggt ggt caa tct cgt atg cca cta gtt caa aaa act 1056Val Ile Leu Val Gly Gly Gln Ser Arg Met Pro Leu Val Gln Lys Thr 340 345 350 gtt act gat ttc ttc ggt aaa gag cca cgt aaa gat gtt aac cct gat 1104Val Thr Asp Phe Phe Gly Lys Glu Pro Arg Lys Asp Val Asn Pro Asp 355 360 365 gaa gca gta gct tct ggt gcg gcg att caa gcg ggt gtt ctt tct ggt 1152Glu Ala Val Ala Ser Gly Ala Ala Ile Gln Ala Gly Val Leu Ser Gly 370 375 380 gat gtg act gac gtt ctt tta tta gac gtt aca cca cta tca tta ggt 1200Asp Val Thr Asp Val Leu Leu Leu Asp Val Thr Pro Leu Ser Leu Gly 385 390 395 400 atc gaa act atg ggc ggt gtg atg act aag gtt atc gac aaa aac act 1248Ile Glu Thr Met Gly Gly Val Met Thr Lys Val Ile Asp Lys Asn Thr 405 410 415 act atc cca act aag caa tca caa act ttc tct aca gct gat gat aac 1296Thr Ile Pro Thr Lys Gln Ser Gln Thr Phe Ser Thr Ala Asp Asp Asn 420 425 430 caa gct gca gta act gtt cat gtt tgt cag ggt gag cgt aag caa gct 1344Gln Ala Ala Val Thr Val His Val Cys Gln Gly Glu Arg Lys Gln Ala 435 440 445 tca gca aac aaa tct tta ggt caa ttt aac ctt gaa ggt att gaa cca 1392Ser Ala Asn Lys Ser Leu Gly Gln Phe Asn Leu Glu Gly Ile Glu Pro 450 455 460 gca caa cgt ggt aca cca caa atc gaa gta act ttt gat att gat gct 1440Ala Gln Arg Gly Thr Pro Gln Ile Glu Val Thr Phe Asp Ile Asp Ala 465 470 475 480 gat ggt atc ttg cac gtt acg gct aaa gat aag aat act ggt aaa gag 1488Asp Gly Ile Leu His Val Thr Ala Lys Asp Lys Asn Thr Gly Lys Glu 485 490 495 caa aaa atc act atc aaa gcc tct tct ggt tta tct gat gaa gaa gta 1536Gln Lys Ile Thr Ile Lys Ala Ser Ser Gly Leu Ser Asp Glu Glu Val 500 505 510 gag cag atg gta cgt gat gca gaa gct aac gct gat gct gat gct aaa 1584Glu Gln Met Val Arg Asp Ala Glu Ala Asn Ala Asp Ala Asp Ala Lys 515 520 525 ttt gaa gag cta gta act gca cgt aat caa gct gat ggc atg att cac 1632Phe Glu Glu Leu Val Thr Ala Arg Asn Gln Ala Asp Gly Met Ile His 530 535 540 gcg act cgc aag caa gtt gaa gaa gca ggc gaa gag tta cca agc gaa 1680Ala Thr Arg Lys Gln Val Glu Glu Ala Gly Glu Glu Leu Pro Ser Glu 545 550 555 560 gat aaa gaa aaa att gaa gca gca tta act gag ctt gaa gaa gca gtt 1728Asp Lys Glu Lys Ile Glu Ala Ala Leu Thr Glu Leu Glu Glu Ala Val 565 570 575 aaa ggt gat gat aaa gaa gta att gaa gct aaa act caa gca ctt atg 1776Lys Gly Asp Asp Lys Glu Val Ile Glu Ala Lys Thr Gln Ala Leu Met 580 585 590 gaa gca tca gct aag tta atg gaa att gct caa gct aaa gaa caa gct 1824Glu Ala Ser Ala Lys Leu Met Glu Ile Ala Gln Ala Lys Glu Gln Ala 595 600 605 caa agc gct cct gaa ggt gct caa gaa gct gac gca gct cct gca gac 1872Gln Ser Ala Pro Glu Gly Ala Gln Glu Ala Asp Ala Ala Pro Ala Asp 610 615 620 gat gtt gtt gat gct gag ttt gaa gaa gtt aaa gac aaa aaa taa 1917Asp Val Val Asp Ala Glu Phe Glu Glu Val Lys Asp Lys Lys 625 630 635 20638PRTColwellia psychroerythrusmisc_feature(186)..(186)The 'Xaa' at location 186 stands for Gly, or Arg. 20Met Gly Lys Ile Ile Gly Ile Asp Leu Gly Thr Thr Asn Ser Cys Val 1 5 10 15 Ala Val Leu Asp Gly Asp Ser Val Arg Val Ile Glu Asn Ala Glu Gly 20 25 30 Asp Arg Thr Thr Pro Ser Ile Ile Gly Tyr Thr Ala Glu Gly Glu Thr 35 40 45 Leu Val Gly Gln Pro Ala Lys Arg Gln Ser Val Thr Asn Pro Glu Asn 50 55 60 Thr Leu Tyr Ala Ile Lys Arg Leu Ile Gly Arg Arg Phe Glu Asp Lys 65 70 75 80 Glu Thr Gln Arg Asp Ile Asp Ile Met Pro Phe Gly Ile Val Lys Ala 85 90 95 Asp Asn Gly Asp Ala Trp Val Gln Val Lys Gly Glu Lys Ile Ala Pro 100 105 110 Pro Gln Val Ser Ala Glu Val Leu Lys Lys Met Lys Lys Thr Ala Glu 115 120 125 Asp Phe Leu Gly Glu Thr Val Thr Glu Ala Val Ile Thr Val Pro Ala 130 135 140 Tyr Phe Asn Asp Ser Gln Arg Gln Ala Thr Lys Asp Ala Gly Arg Ile 145 150 155 160 Ala Gly Leu Glu Val Lys Arg Ile Ile Asn Glu Pro Thr Ala Ala Ala 165 170 175 Leu Ala Tyr Gly Met Asp Lys Gln Glu Xaa Asp Lys Val Val Ala Val 180 185 190 Tyr Asp Leu Gly Gly Gly Thr Phe Asp Ile Ser Ile Ile Glu Ile Asp 195 200 205 Glu Met Asp Gly Glu His Thr Phe Glu Val Leu Ala Thr Asn Gly Asp 210 215 220 Thr His Leu Gly Gly Glu Asp Phe Asp Asn Arg Leu Ile Asn Tyr Leu 225 230 235 240 Val Ala Glu Phe Lys Lys Asp Gln Gly Met Asp Leu Thr Ser Asp Pro 245 250 255 Leu Ala Met Gln Arg Leu Lys Glu Ala Ala Glu Lys Ala Lys Cys Glu 260 265 270 Leu Ser Ser Ala Gln Gln Thr Asp Val Asn Leu Pro Tyr Ile Thr Ala 275 280 285 Asp Gly Ser Gly Pro Lys His Met Asn Ile Lys Val Thr Arg Ala Lys 290 295 300 Leu Glu Ser Leu Val Glu Asp Met Val Lys Ala Thr Leu Glu Pro Leu 305 310 315 320 Lys Gln Ala Leu Lys Asp Ala Asp Leu Ser Val Ser Lys Ile Asp Asp 325 330 335 Val Ile Leu Val Gly Gly Gln Ser Arg Met Pro Leu Val Gln Lys Thr 340 345 350 Val Thr Asp Phe Phe Gly Lys Glu Pro Arg Lys Asp Val Asn Pro Asp 355 360 365 Glu Ala Val Ala Ser Gly Ala Ala Ile Gln Ala Gly Val Leu Ser Gly 370 375 380 Asp Val Thr Asp Val Leu Leu Leu Asp Val Thr Pro Leu Ser Leu Gly 385 390 395 400 Ile Glu Thr Met Gly Gly Val Met Thr Lys Val Ile Asp Lys Asn Thr 405 410 415 Thr Ile Pro Thr Lys Gln Ser Gln Thr Phe Ser Thr Ala Asp Asp Asn 420 425 430 Gln Ala Ala Val Thr Val His Val Cys Gln Gly Glu Arg Lys Gln Ala 435 440 445 Ser Ala Asn Lys Ser Leu Gly Gln Phe Asn Leu Glu Gly Ile Glu Pro 450 455 460 Ala Gln Arg Gly Thr Pro Gln Ile Glu Val Thr Phe Asp Ile Asp Ala 465 470 475 480 Asp Gly Ile Leu His Val Thr Ala Lys Asp Lys Asn Thr Gly Lys Glu 485 490 495 Gln Lys Ile Thr Ile Lys Ala Ser Ser Gly Leu Ser Asp Glu Glu Val 500 505 510 Glu Gln Met Val Arg Asp Ala Glu Ala Asn Ala Asp Ala Asp Ala Lys 515 520 525 Phe Glu Glu Leu Val Thr Ala Arg Asn Gln Ala Asp Gly Met Ile His 530 535 540 Ala Thr Arg Lys Gln Val Glu Glu Ala Gly Glu Glu Leu Pro Ser Glu 545 550 555 560 Asp Lys Glu Lys Ile Glu Ala Ala Leu Thr Glu Leu Glu Glu Ala Val 565 570 575 Lys Gly Asp Asp Lys Glu Val Ile Glu Ala Lys Thr Gln Ala Leu Met 580 585 590 Glu Ala Ser Ala Lys Leu Met Glu Ile Ala Gln Ala Lys Glu Gln Ala 595 600 605 Gln Ser Ala Pro Glu Gly Ala Gln Glu Ala Asp Ala Ala Pro Ala Asp 610 615 620 Asp Val Val Asp Ala Glu Phe Glu Glu Val Lys Asp Lys Lys 625 630 635 211220DNAArtificial SequencetyrS hybrid 21tataaatata atg tcg agc ttt aac caa gca ttc gcc gaa cta aaa cgc 49 Met Ser Ser Phe Asn Gln Ala Phe Ala Glu Leu Lys Arg 1 5 10 gga gca gaa gaa ata tta gta gaa gaa gaa tta tta aca aag ctt aag 97Gly Ala Glu Glu Ile Leu Val Glu Glu Glu Leu Leu Thr Lys Leu Lys 15 20 25 aca ggt aag ccg cta aaa atc aaa gcg ggt ttt gat cct act gcg cct 145Thr Gly Lys Pro Leu Lys Ile Lys Ala Gly Phe Asp Pro Thr Ala Pro 30 35 40 45 gac tta cat tta ggc cac acg gta tta att aac aag ctt cgt caa ttc 193Asp Leu His Leu Gly His Thr Val Leu Ile Asn Lys Leu Arg Gln Phe 50 55 60 caa caa tta ggt cat gaa gtt att ttc ttg att ggt gac ttc acc gga 241Gln Gln Leu Gly His Glu Val Ile Phe Leu Ile Gly Asp Phe Thr Gly 65 70 75 atg att ggt gat cca acg ggt aaa aat gtg acg cgt aag gca ctc act 289Met Ile Gly Asp Pro Thr Gly Lys Asn Val Thr Arg Lys Ala Leu Thr 80 85 90 aaa gaa gac gta tta gcc aat gct gaa acg tat aaa gag caa gtc ttt 337Lys Glu Asp Val Leu Ala Asn Ala Glu Thr Tyr Lys Glu Gln Val Phe 95 100 105 aaa ata tta gat ccc gct aaa aca acc gtt gcc ttt aac tct act tgg 385Lys Ile Leu Asp Pro Ala Lys Thr Thr Val Ala Phe Asn Ser Thr Trp 110 115 120 125 atg gat aaa tta ggc gcg gca ggt atg tta caa ctt gcc tct cgt caa 433Met Asp Lys Leu Gly Ala Ala Gly Met Leu Gln Leu Ala Ser Arg Gln 130 135 140 acg gtt gcc cgt atg atg gag cgt gac gac ttt aaa aaa cgt tat gct 481Thr Val Ala Arg Met Met Glu Arg Asp Asp Phe Lys Lys Arg Tyr Ala 145 150 155 aac ggc cag gcc att gct att cat gag ttt atg tac cct tta gta caa 529Asn Gly Gln Ala Ile Ala Ile His Glu Phe Met Tyr Pro Leu Val Gln 160 165 170 ggt tgg gat tca gtt gcg ctt gag gct gat gtt gag ctg ggt ggt acc 577Gly Trp Asp Ser Val Ala Leu Glu Ala Asp Val Glu Leu Gly Gly Thr 175 180 185 gac caa aag ttt aat tta tta atg ggt cgt gag tta caa aaa tct gaa 625Asp Gln Lys Phe Asn Leu Leu Met Gly Arg Glu Leu Gln Lys Ser Glu 190 195 200 205 ggc cag cgt cca caa aca gta tta atg atg cca tta ctt gaa ggc cta 673Gly Gln Arg Pro Gln Thr Val Leu Met Met Pro Leu Leu Glu Gly Leu 210 215 220 gat ggc gtt cag aaa atg tct aag tca tta ggc aac tac att ggc att 721Asp Gly Val Gln Lys Met Ser Lys Ser Leu Gly Asn Tyr Ile Gly Ile 225 230 235 act gat acg cct acc gac atg ttt ggc aaa ata atg tca att tca gat 769Thr Asp Thr Pro Thr Asp Met Phe Gly Lys Ile Met Ser Ile Ser Asp 240 245 250 gta tta atg tgg cgt tac tac gag tta ctt agc ttt aaa ccg ctt gaa 817Val Leu Met Trp Arg Tyr Tyr Glu Leu Leu Ser Phe Lys Pro Leu Glu 255 260 265 gaa att gaa ggt tat aaa acc gag ata gaa aat ggc aaa aat cct cgt 865Glu Ile Glu Gly Tyr Lys Thr Glu Ile Glu Asn Gly Lys Asn Pro Arg 270 275 280 285 gat gtt aaa att gat tta gcc aaa gaa ttg att gct cgt ttt cat gat 913Asp Val Lys Ile Asp Leu Ala Lys Glu Leu Ile Ala Arg Phe His Asp 290 295 300 gaa gct gct gca caa gct gcc cat gat gaa ttc atc aat cgt ttc caa 961Glu Ala Ala Ala Gln Ala Ala His Asp Glu Phe Ile Asn Arg Phe Gln 305 310 315 aaa ggt gcg tta cct gat gat atg ccg gaa tta acg att acc act gaa 1009Lys Gly Ala Leu Pro Asp Asp Met Pro Glu Leu Thr Ile Thr Thr Glu 320 325 330 aat ggt gaa ata gcc att gct aac ttg ctt aaa gat gca gga tta gtc 1057Asn Gly Glu Ile Ala Ile Ala Asn Leu Leu Lys Asp Ala Gly Leu Val 335 340 345 ggt agt act tct gat gcc ttt aga atg atc aaa caa ggg gcg gct aaa 1105Gly Ser Thr Ser Asp Ala Phe Arg Met Ile Lys Gln Gly Ala Ala Lys 350 355 360 365

att gat agc gaa aaa gta act gac cgt agc tta gtt att agc gct ggc 1153Ile Asp Ser Glu Lys Val Thr Asp Arg Ser Leu Val Ile Ser Ala Gly 370 375 380 acg acg gca gtt tat caa gtc ggc aaa cgt aaa ttt gct cgt att acc 1201Thr Thr Ala Val Tyr Gln Val Gly Lys Arg Lys Phe Ala Arg Ile Thr 385 390 395 ata aaa taaggggttg taa 1220Ile Lys 22399PRTArtificial SequenceSynthetic Construct 22Met Ser Ser Phe Asn Gln Ala Phe Ala Glu Leu Lys Arg Gly Ala Glu 1 5 10 15 Glu Ile Leu Val Glu Glu Glu Leu Leu Thr Lys Leu Lys Thr Gly Lys 20 25 30 Pro Leu Lys Ile Lys Ala Gly Phe Asp Pro Thr Ala Pro Asp Leu His 35 40 45 Leu Gly His Thr Val Leu Ile Asn Lys Leu Arg Gln Phe Gln Gln Leu 50 55 60 Gly His Glu Val Ile Phe Leu Ile Gly Asp Phe Thr Gly Met Ile Gly 65 70 75 80 Asp Pro Thr Gly Lys Asn Val Thr Arg Lys Ala Leu Thr Lys Glu Asp 85 90 95 Val Leu Ala Asn Ala Glu Thr Tyr Lys Glu Gln Val Phe Lys Ile Leu 100 105 110 Asp Pro Ala Lys Thr Thr Val Ala Phe Asn Ser Thr Trp Met Asp Lys 115 120 125 Leu Gly Ala Ala Gly Met Leu Gln Leu Ala Ser Arg Gln Thr Val Ala 130 135 140 Arg Met Met Glu Arg Asp Asp Phe Lys Lys Arg Tyr Ala Asn Gly Gln 145 150 155 160 Ala Ile Ala Ile His Glu Phe Met Tyr Pro Leu Val Gln Gly Trp Asp 165 170 175 Ser Val Ala Leu Glu Ala Asp Val Glu Leu Gly Gly Thr Asp Gln Lys 180 185 190 Phe Asn Leu Leu Met Gly Arg Glu Leu Gln Lys Ser Glu Gly Gln Arg 195 200 205 Pro Gln Thr Val Leu Met Met Pro Leu Leu Glu Gly Leu Asp Gly Val 210 215 220 Gln Lys Met Ser Lys Ser Leu Gly Asn Tyr Ile Gly Ile Thr Asp Thr 225 230 235 240 Pro Thr Asp Met Phe Gly Lys Ile Met Ser Ile Ser Asp Val Leu Met 245 250 255 Trp Arg Tyr Tyr Glu Leu Leu Ser Phe Lys Pro Leu Glu Glu Ile Glu 260 265 270 Gly Tyr Lys Thr Glu Ile Glu Asn Gly Lys Asn Pro Arg Asp Val Lys 275 280 285 Ile Asp Leu Ala Lys Glu Leu Ile Ala Arg Phe His Asp Glu Ala Ala 290 295 300 Ala Gln Ala Ala His Asp Glu Phe Ile Asn Arg Phe Gln Lys Gly Ala 305 310 315 320 Leu Pro Asp Asp Met Pro Glu Leu Thr Ile Thr Thr Glu Asn Gly Glu 325 330 335 Ile Ala Ile Ala Asn Leu Leu Lys Asp Ala Gly Leu Val Gly Ser Thr 340 345 350 Ser Asp Ala Phe Arg Met Ile Lys Gln Gly Ala Ala Lys Ile Asp Ser 355 360 365 Glu Lys Val Thr Asp Arg Ser Leu Val Ile Ser Ala Gly Thr Thr Ala 370 375 380 Val Tyr Gln Val Gly Lys Arg Lys Phe Ala Arg Ile Thr Ile Lys 385 390 395 23741DNAArtificial Sequencecmk hybrid 23aagtctagaa gatgtcgaaa aacttgtaca gtttatgcag gaatttgaga atg aac 56 Met Asn 1 aat agc aca cca gtt ata acc att gat ggc cca agt ggg gct ggt aaa 104Asn Ser Thr Pro Val Ile Thr Ile Asp Gly Pro Ser Gly Ala Gly Lys 5 10 15 gga acc gtt gca agg ata gtt gcg gac caa tta ggt tgg cac ctt ctt 152Gly Thr Val Ala Arg Ile Val Ala Asp Gln Leu Gly Trp His Leu Leu 20 25 30 gac agt ggg gct att tac cgc gtc tta gct gtt gcc att caa cat cac 200Asp Ser Gly Ala Ile Tyr Arg Val Leu Ala Val Ala Ile Gln His His 35 40 45 50 caa ctt tca tta gat gat gaa gag cct ctt atc cct atg gct gca cat 248Gln Leu Ser Leu Asp Asp Glu Glu Pro Leu Ile Pro Met Ala Ala His 55 60 65 tta gat gtt caa ttt gaa att aat agt caa ggt gaa gct aaa gtt att 296Leu Asp Val Gln Phe Glu Ile Asn Ser Gln Gly Glu Ala Lys Val Ile 70 75 80 tta gaa ggt gaa aat gtt act gaa att att cgt act gaa gaa gtt ggc 344Leu Glu Gly Glu Asn Val Thr Glu Ile Ile Arg Thr Glu Glu Val Gly 85 90 95 gga tta gca tcg aaa gta gca gca ttt cca cgt gtt aga gaa gcg cta 392Gly Leu Ala Ser Lys Val Ala Ala Phe Pro Arg Val Arg Glu Ala Leu 100 105 110 tta cga aga caa cgt gca ttt agc gtt agc cct ggc tta att gca gat 440Leu Arg Arg Gln Arg Ala Phe Ser Val Ser Pro Gly Leu Ile Ala Asp 115 120 125 130 ggt cgc gac atg gga acc gtt gtt ttt ccg aaa gct cca gta aaa ata 488Gly Arg Asp Met Gly Thr Val Val Phe Pro Lys Ala Pro Val Lys Ile 135 140 145 ttt tta act gct agt gct gaa gaa cga gct gac cga aga ttt aat cag 536Phe Leu Thr Ala Ser Ala Glu Glu Arg Ala Asp Arg Arg Phe Asn Gln 150 155 160 ttg aaa gaa aaa gga att gat gtt aac atc ggg cgc ctt ttg gat gac 584Leu Lys Glu Lys Gly Ile Asp Val Asn Ile Gly Arg Leu Leu Asp Asp 165 170 175 ata cgt caa cga gat gag cga gat caa aac cgc aag gta gct cct ctt 632Ile Arg Gln Arg Asp Glu Arg Asp Gln Asn Arg Lys Val Ala Pro Leu 180 185 190 atc ccg gca gaa gga gcg tta act att gat tct act gat att tct att 680Ile Pro Ala Glu Gly Ala Leu Thr Ile Asp Ser Thr Asp Ile Ser Ile 195 200 205 210 aca gaa gtc gtc aat aaa atc ctt atg ttt gcc aat ggc aaa tta acg 728Thr Glu Val Val Asn Lys Ile Leu Met Phe Ala Asn Gly Lys Leu Thr 215 220 225 tag atattttagc 74124226PRTArtificial SequenceSynthetic Construct 24Met Asn Asn Ser Thr Pro Val Ile Thr Ile Asp Gly Pro Ser Gly Ala 1 5 10 15 Gly Lys Gly Thr Val Ala Arg Ile Val Ala Asp Gln Leu Gly Trp His 20 25 30 Leu Leu Asp Ser Gly Ala Ile Tyr Arg Val Leu Ala Val Ala Ile Gln 35 40 45 His His Gln Leu Ser Leu Asp Asp Glu Glu Pro Leu Ile Pro Met Ala 50 55 60 Ala His Leu Asp Val Gln Phe Glu Ile Asn Ser Gln Gly Glu Ala Lys 65 70 75 80 Val Ile Leu Glu Gly Glu Asn Val Thr Glu Ile Ile Arg Thr Glu Glu 85 90 95 Val Gly Gly Leu Ala Ser Lys Val Ala Ala Phe Pro Arg Val Arg Glu 100 105 110 Ala Leu Leu Arg Arg Gln Arg Ala Phe Ser Val Ser Pro Gly Leu Ile 115 120 125 Ala Asp Gly Arg Asp Met Gly Thr Val Val Phe Pro Lys Ala Pro Val 130 135 140 Lys Ile Phe Leu Thr Ala Ser Ala Glu Glu Arg Ala Asp Arg Arg Phe 145 150 155 160 Asn Gln Leu Lys Glu Lys Gly Ile Asp Val Asn Ile Gly Arg Leu Leu 165 170 175 Asp Asp Ile Arg Gln Arg Asp Glu Arg Asp Gln Asn Arg Lys Val Ala 180 185 190 Pro Leu Ile Pro Ala Glu Gly Ala Leu Thr Ile Asp Ser Thr Asp Ile 195 200 205 Ser Ile Thr Glu Val Val Asn Lys Ile Leu Met Phe Ala Asn Gly Lys 210 215 220 Leu Thr 225 251937DNAArtificial SequencednaKsf hybrid 25ggagaatcaa atg gga aaa att att ggt atc gat tta ggc aca aca aac 49 Met Gly Lys Ile Ile Gly Ile Asp Leu Gly Thr Thr Asn 1 5 10 tcg tgt gta gca gtc ctt gat ggc ggc aaa gca cgc gta att gaa aac 97Ser Cys Val Ala Val Leu Asp Gly Gly Lys Ala Arg Val Ile Glu Asn 15 20 25 gca gag ggt gat cgc aca acc cca tca att atc gct tat acc gat gat 145Ala Glu Gly Asp Arg Thr Thr Pro Ser Ile Ile Ala Tyr Thr Asp Asp 30 35 40 45 gaa att att gta ggc cag cca gca aag cgt cag gct gta acc aac cca 193Glu Ile Ile Val Gly Gln Pro Ala Lys Arg Gln Ala Val Thr Asn Pro 50 55 60 aca aac aca ttc ttt gcc atc aag cgt tta atc ggt cgt cgt ttt aaa 241Thr Asn Thr Phe Phe Ala Ile Lys Arg Leu Ile Gly Arg Arg Phe Lys 65 70 75 gat gac gaa gtt caa cgt gat gtg aac atc atg cca ttc aaa att atc 289Asp Asp Glu Val Gln Arg Asp Val Asn Ile Met Pro Phe Lys Ile Ile 80 85 90 gca gct gat aat ggt gat gca tgg gtt gag tca cgt ggt aac aaa atg 337Ala Ala Asp Asn Gly Asp Ala Trp Val Glu Ser Arg Gly Asn Lys Met 95 100 105 gca cca cca caa gtt tca gct gaa atc ttg aaa aag atg aaa aag act 385Ala Pro Pro Gln Val Ser Ala Glu Ile Leu Lys Lys Met Lys Lys Thr 110 115 120 125 gct gaa gac ttt tta ggt gaa gaa gtg act gaa gcg gtt att acc gtt 433Ala Glu Asp Phe Leu Gly Glu Glu Val Thr Glu Ala Val Ile Thr Val 130 135 140 cct gct tac ttt aac gat tca caa cgt caa gcc act aaa gat gct ggt 481Pro Ala Tyr Phe Asn Asp Ser Gln Arg Gln Ala Thr Lys Asp Ala Gly 145 150 155 cgt atc gca ggt ctt gat gtt aag cgt att atc aac gaa cct act gct 529Arg Ile Ala Gly Leu Asp Val Lys Arg Ile Ile Asn Glu Pro Thr Ala 160 165 170 gct gca ctt gca tac ggt atc gac aag aaa caa ggc gac aac att gtt 577Ala Ala Leu Ala Tyr Gly Ile Asp Lys Lys Gln Gly Asp Asn Ile Val 175 180 185 gct gta tac gat tta ggt ggt ggt aca ttc gat atc tct atc atc gaa 625Ala Val Tyr Asp Leu Gly Gly Gly Thr Phe Asp Ile Ser Ile Ile Glu 190 195 200 205 att gac agc aac gat ggt gac caa aca ttt gaa gta cta gca acc aat 673Ile Asp Ser Asn Asp Gly Asp Gln Thr Phe Glu Val Leu Ala Thr Asn 210 215 220 ggt gat act cac tta ggt ggt gaa gac ttt gat aac cgt atg att aac 721Gly Asp Thr His Leu Gly Gly Glu Asp Phe Asp Asn Arg Met Ile Asn 225 230 235 tat tta gct gat gaa ttc aaa aaa gac caa ggc tta gat ctt cgt aga 769Tyr Leu Ala Asp Glu Phe Lys Lys Asp Gln Gly Leu Asp Leu Arg Arg 240 245 250 gat cct tta gca atg caa cgt ttg aaa gaa gcc gct gaa aaa gca aaa 817Asp Pro Leu Ala Met Gln Arg Leu Lys Glu Ala Ala Glu Lys Ala Lys 255 260 265 atc gag ctt tca agc act aac cac act gaa gtt aac ttg cct tac atc 865Ile Glu Leu Ser Ser Thr Asn His Thr Glu Val Asn Leu Pro Tyr Ile 270 275 280 285 act gct gat gca tca ggt cct aag cat tta gtg gtt aaa att act cgt 913Thr Ala Asp Ala Ser Gly Pro Lys His Leu Val Val Lys Ile Thr Arg 290 295 300 gct aag tta gag tca tta gtt gaa gat tta att caa cgt act cta gag 961Ala Lys Leu Glu Ser Leu Val Glu Asp Leu Ile Gln Arg Thr Leu Glu 305 310 315 ccg ctt aaa gtt gca cta gct gat gct gat tta tca ata tca gat atc 1009Pro Leu Lys Val Ala Leu Ala Asp Ala Asp Leu Ser Ile Ser Asp Ile 320 325 330 aat gaa gtg att ctt gtg ggt ggt cag act cgt atg cct aaa gta caa 1057Asn Glu Val Ile Leu Val Gly Gly Gln Thr Arg Met Pro Lys Val Gln 335 340 345 gaa gca gtc act aac ttc ttt ggc aaa gag cct cgt aaa gat gtt aac 1105Glu Ala Val Thr Asn Phe Phe Gly Lys Glu Pro Arg Lys Asp Val Asn 350 355 360 365 cct gat gaa gcg gtt gct gtt ggt gcg gcg att cag gct ggc gta ctt 1153Pro Asp Glu Ala Val Ala Val Gly Ala Ala Ile Gln Ala Gly Val Leu 370 375 380 tct ggt gaa gtg aaa gac gta ctt cta ctt gac gtt acc cca cta tct 1201Ser Gly Glu Val Lys Asp Val Leu Leu Leu Asp Val Thr Pro Leu Ser 385 390 395 ctt ggt att gaa acc atg ggc agt gtg atg aca aag ctt atc gag aag 1249Leu Gly Ile Glu Thr Met Gly Ser Val Met Thr Lys Leu Ile Glu Lys 400 405 410 aac acc act atc ccg act aaa gct cag caa gta ttc tca aca gct gac 1297Asn Thr Thr Ile Pro Thr Lys Ala Gln Gln Val Phe Ser Thr Ala Asp 415 420 425 gac aac caa agt gcc gtg act att cac gta ctt caa ggt gaa cgt aag 1345Asp Asn Gln Ser Ala Val Thr Ile His Val Leu Gln Gly Glu Arg Lys 430 435 440 445 caa gcg agt gct aac aag tca tta ggt caa ttt aac ctt gaa ggt att 1393Gln Ala Ser Ala Asn Lys Ser Leu Gly Gln Phe Asn Leu Glu Gly Ile 450 455 460 gag cca gca cca cgt ggc caa cca cag gtt gaa gtg atg ttc gac att 1441Glu Pro Ala Pro Arg Gly Gln Pro Gln Val Glu Val Met Phe Asp Ile 465 470 475 gat gct gat ggt atc tta cat gtg tct gca aca gac aag aaa aca ggt 1489Asp Ala Asp Gly Ile Leu His Val Ser Ala Thr Asp Lys Lys Thr Gly 480 485 490 aag aaa caa aac att act atc aaa gcc tct tca ggt tta tct gat gaa 1537Lys Lys Gln Asn Ile Thr Ile Lys Ala Ser Ser Gly Leu Ser Asp Glu 495 500 505 gaa gtt gaa caa atg gta cgt gat gca gaa gct cat gct gat gaa gat 1585Glu Val Glu Gln Met Val Arg Asp Ala Glu Ala His Ala Asp Glu Asp 510 515 520 525 gct aaa ttt gaa gag tta gtt aaa gcg cgt aat caa gca gat ggt tta 1633Ala Lys Phe Glu Glu Leu Val Lys Ala Arg Asn Gln Ala Asp Gly Leu 530 535 540 gct cat tca act aaa aaa caa gtt gaa gaa gct ggc gat gca cta gct 1681Ala His Ser Thr Lys Lys Gln Val Glu Glu Ala Gly Asp Ala Leu Ala 545 550 555 agt gac gaa aaa gaa aag att gaa gca gca atc gca act tta gaa act 1729Ser Asp Glu Lys Glu Lys Ile Glu Ala Ala Ile Ala Thr Leu Glu Thr 560 565 570 gcc ata aaa ggc aaa gat aaa gaa gcc att gat aca gca act caa gcg 1777Ala Ile Lys Gly Lys Asp Lys Glu Ala Ile Asp Thr Ala Thr Gln Ala 575 580 585 cta atc gaa gcg tct gct aag tta atg gaa att gct caa gct aaa gct 1825Leu Ile Glu Ala Ser Ala Lys Leu Met Glu Ile Ala Gln Ala Lys Ala 590 595 600 605 caa ggt gaa gca gaa ggt caa gcg cac gat gct ggc caa gaa aag cct 1873Gln Gly Glu Ala Glu Gly Gln Ala His Asp Ala Gly Gln Glu Lys Pro 610 615 620 gct gat gat gtt gtt gat gct gag ttc gaa gaa gtt aaa gac gac aaa 1921Ala Asp Asp Val Val Asp Ala Glu Phe Glu Glu Val Lys Asp Asp Lys 625 630 635 aaa taa ataatctttt 1937Lys 26638PRTArtificial SequenceSynthetic Construct 26Met Gly Lys Ile Ile Gly Ile Asp Leu Gly Thr Thr Asn Ser Cys Val 1 5 10 15 Ala Val Leu Asp Gly Gly Lys Ala Arg Val Ile Glu Asn Ala Glu Gly 20 25 30 Asp Arg Thr Thr Pro Ser Ile Ile Ala Tyr Thr Asp Asp Glu Ile

Ile 35 40 45 Val Gly Gln Pro Ala Lys Arg Gln Ala Val Thr Asn Pro Thr Asn Thr 50 55 60 Phe Phe Ala Ile Lys Arg Leu Ile Gly Arg Arg Phe Lys Asp Asp Glu 65 70 75 80 Val Gln Arg Asp Val Asn Ile Met Pro Phe Lys Ile Ile Ala Ala Asp 85 90 95 Asn Gly Asp Ala Trp Val Glu Ser Arg Gly Asn Lys Met Ala Pro Pro 100 105 110 Gln Val Ser Ala Glu Ile Leu Lys Lys Met Lys Lys Thr Ala Glu Asp 115 120 125 Phe Leu Gly Glu Glu Val Thr Glu Ala Val Ile Thr Val Pro Ala Tyr 130 135 140 Phe Asn Asp Ser Gln Arg Gln Ala Thr Lys Asp Ala Gly Arg Ile Ala 145 150 155 160 Gly Leu Asp Val Lys Arg Ile Ile Asn Glu Pro Thr Ala Ala Ala Leu 165 170 175 Ala Tyr Gly Ile Asp Lys Lys Gln Gly Asp Asn Ile Val Ala Val Tyr 180 185 190 Asp Leu Gly Gly Gly Thr Phe Asp Ile Ser Ile Ile Glu Ile Asp Ser 195 200 205 Asn Asp Gly Asp Gln Thr Phe Glu Val Leu Ala Thr Asn Gly Asp Thr 210 215 220 His Leu Gly Gly Glu Asp Phe Asp Asn Arg Met Ile Asn Tyr Leu Ala 225 230 235 240 Asp Glu Phe Lys Lys Asp Gln Gly Leu Asp Leu Arg Arg Asp Pro Leu 245 250 255 Ala Met Gln Arg Leu Lys Glu Ala Ala Glu Lys Ala Lys Ile Glu Leu 260 265 270 Ser Ser Thr Asn His Thr Glu Val Asn Leu Pro Tyr Ile Thr Ala Asp 275 280 285 Ala Ser Gly Pro Lys His Leu Val Val Lys Ile Thr Arg Ala Lys Leu 290 295 300 Glu Ser Leu Val Glu Asp Leu Ile Gln Arg Thr Leu Glu Pro Leu Lys 305 310 315 320 Val Ala Leu Ala Asp Ala Asp Leu Ser Ile Ser Asp Ile Asn Glu Val 325 330 335 Ile Leu Val Gly Gly Gln Thr Arg Met Pro Lys Val Gln Glu Ala Val 340 345 350 Thr Asn Phe Phe Gly Lys Glu Pro Arg Lys Asp Val Asn Pro Asp Glu 355 360 365 Ala Val Ala Val Gly Ala Ala Ile Gln Ala Gly Val Leu Ser Gly Glu 370 375 380 Val Lys Asp Val Leu Leu Leu Asp Val Thr Pro Leu Ser Leu Gly Ile 385 390 395 400 Glu Thr Met Gly Ser Val Met Thr Lys Leu Ile Glu Lys Asn Thr Thr 405 410 415 Ile Pro Thr Lys Ala Gln Gln Val Phe Ser Thr Ala Asp Asp Asn Gln 420 425 430 Ser Ala Val Thr Ile His Val Leu Gln Gly Glu Arg Lys Gln Ala Ser 435 440 445 Ala Asn Lys Ser Leu Gly Gln Phe Asn Leu Glu Gly Ile Glu Pro Ala 450 455 460 Pro Arg Gly Gln Pro Gln Val Glu Val Met Phe Asp Ile Asp Ala Asp 465 470 475 480 Gly Ile Leu His Val Ser Ala Thr Asp Lys Lys Thr Gly Lys Lys Gln 485 490 495 Asn Ile Thr Ile Lys Ala Ser Ser Gly Leu Ser Asp Glu Glu Val Glu 500 505 510 Gln Met Val Arg Asp Ala Glu Ala His Ala Asp Glu Asp Ala Lys Phe 515 520 525 Glu Glu Leu Val Lys Ala Arg Asn Gln Ala Asp Gly Leu Ala His Ser 530 535 540 Thr Lys Lys Gln Val Glu Glu Ala Gly Asp Ala Leu Ala Ser Asp Glu 545 550 555 560 Lys Glu Lys Ile Glu Ala Ala Ile Ala Thr Leu Glu Thr Ala Ile Lys 565 570 575 Gly Lys Asp Lys Glu Ala Ile Asp Thr Ala Thr Gln Ala Leu Ile Glu 580 585 590 Ala Ser Ala Lys Leu Met Glu Ile Ala Gln Ala Lys Ala Gln Gly Glu 595 600 605 Ala Glu Gly Gln Ala His Asp Ala Gly Gln Glu Lys Pro Ala Asp Asp 610 615 620 Val Val Asp Ala Glu Phe Glu Glu Val Lys Asp Asp Lys Lys 625 630 635 271175DNAArtificial SequenceftsZ hybrid 27atg ttt gat ttt aac gat tca atg gtt tca aat gcc ata att aaa gtt 48Met Phe Asp Phe Asn Asp Ser Met Val Ser Asn Ala Ile Ile Lys Val 1 5 10 15 gtc ggt gtt ggt ggc ggt ggc ggt aat gct gta caa cat atg tgt gaa 96Val Gly Val Gly Gly Gly Gly Gly Asn Ala Val Gln His Met Cys Glu 20 25 30 gaa gtt tct gat gtt gag ttt ttt gcc cta aat aca gat ggt caa gca 144Glu Val Ser Asp Val Glu Phe Phe Ala Leu Asn Thr Asp Gly Gln Ala 35 40 45 tta tca aaa tca aaa gtt caa aat ata tta caa att ggt aca aac cta 192Leu Ser Lys Ser Lys Val Gln Asn Ile Leu Gln Ile Gly Thr Asn Leu 50 55 60 aca aaa ggt tta ggt gct ggt gcg aat cct gaa att ggt aag aga gct 240Thr Lys Gly Leu Gly Ala Gly Ala Asn Pro Glu Ile Gly Lys Arg Ala 65 70 75 80 gca act gaa gat aga gcg aaa atc gag caa ctt tta gag ggt gct gat 288Ala Thr Glu Asp Arg Ala Lys Ile Glu Gln Leu Leu Glu Gly Ala Asp 85 90 95 atg gtt ttc atc act gct ggt atg ggt ggt ggt aca ggt aca ggt gga 336Met Val Phe Ile Thr Ala Gly Met Gly Gly Gly Thr Gly Thr Gly Gly 100 105 110 gct cct gta gtt gca gaa gtt gca aaa gag atg ggt ata ctt aca gta 384Ala Pro Val Val Ala Glu Val Ala Lys Glu Met Gly Ile Leu Thr Val 115 120 125 gct gta gtt act aag cct ttc cct ttt gaa gga cca aga aga atg aaa 432Ala Val Val Thr Lys Pro Phe Pro Phe Glu Gly Pro Arg Arg Met Lys 130 135 140 gca gca gag caa ggt att gag ttt tta tct aaa agt gtt gat tca ctg 480Ala Ala Glu Gln Gly Ile Glu Phe Leu Ser Lys Ser Val Asp Ser Leu 145 150 155 160 att act att cct aac gaa aag tta ctg aaa gta ctt ggc cct gga aca 528Ile Thr Ile Pro Asn Glu Lys Leu Leu Lys Val Leu Gly Pro Gly Thr 165 170 175 agc tta tta gat gcc ttt aaa gca gca aat aac gtg cta ctt ggc gcc 576Ser Leu Leu Asp Ala Phe Lys Ala Ala Asn Asn Val Leu Leu Gly Ala 180 185 190 gtt cag ggt att gca gaa tta att act cgt cct ggt ttg ata aat gtc 624Val Gln Gly Ile Ala Glu Leu Ile Thr Arg Pro Gly Leu Ile Asn Val 195 200 205 gat ttt gct gat gta cgt acc gtt atg tct gag atg ggt act gcc atg 672Asp Phe Ala Asp Val Arg Thr Val Met Ser Glu Met Gly Thr Ala Met 210 215 220 atg ggt tct ggt act gct tct ggc gat gat aga gca caa gaa gct gct 720Met Gly Ser Gly Thr Ala Ser Gly Asp Asp Arg Ala Gln Glu Ala Ala 225 230 235 240 gat gct gct att tca agt cct tta tta gag gat gtg gat tta gct ggt 768Asp Ala Ala Ile Ser Ser Pro Leu Leu Glu Asp Val Asp Leu Ala Gly 245 250 255 gca cgc ggg atc tta gtt aat att acc gca ggt atg gat att agt atc 816Ala Arg Gly Ile Leu Val Asn Ile Thr Ala Gly Met Asp Ile Ser Ile 260 265 270 gat gag ttt gaa act gtt ggt aat gcc gtt aaa gct ttc gct tct gaa 864Asp Glu Phe Glu Thr Val Gly Asn Ala Val Lys Ala Phe Ala Ser Glu 275 280 285 aat gcg act gtt gtt gtt ggt gct gtt att gat atg gat atg aca gat 912Asn Ala Thr Val Val Val Gly Ala Val Ile Asp Met Asp Met Thr Asp 290 295 300 gag ctt cgt gtg act gtt gtt gct acg ggt att ggc gct gaa agt aag 960Glu Leu Arg Val Thr Val Val Ala Thr Gly Ile Gly Ala Glu Ser Lys 305 310 315 320 cct gat att acg tta gta aat cct atg cca atg gct gaa gca aaa gtt 1008Pro Asp Ile Thr Leu Val Asn Pro Met Pro Met Ala Glu Ala Lys Val 325 330 335 gtc ggt ggg gat tat aca cca gct gca cca cag gca aat tta gcg act 1056Val Gly Gly Asp Tyr Thr Pro Ala Ala Pro Gln Ala Asn Leu Ala Thr 340 345 350 gaa gca ata gct atg act gat agc aat gcg cag aaa gca gca gca acc 1104Glu Ala Ile Ala Met Thr Asp Ser Asn Ala Gln Lys Ala Ala Ala Thr 355 360 365 gac tta gat act tat tta gat att cct gct ttt tta cgt aag caa gcg 1152Asp Leu Asp Thr Tyr Leu Asp Ile Pro Ala Phe Leu Arg Lys Gln Ala 370 375 380 gat taataaaaac caaaattaag 1175Asp 385 28385PRTArtificial SequenceSynthetic Construct 28Met Phe Asp Phe Asn Asp Ser Met Val Ser Asn Ala Ile Ile Lys Val 1 5 10 15 Val Gly Val Gly Gly Gly Gly Gly Asn Ala Val Gln His Met Cys Glu 20 25 30 Glu Val Ser Asp Val Glu Phe Phe Ala Leu Asn Thr Asp Gly Gln Ala 35 40 45 Leu Ser Lys Ser Lys Val Gln Asn Ile Leu Gln Ile Gly Thr Asn Leu 50 55 60 Thr Lys Gly Leu Gly Ala Gly Ala Asn Pro Glu Ile Gly Lys Arg Ala 65 70 75 80 Ala Thr Glu Asp Arg Ala Lys Ile Glu Gln Leu Leu Glu Gly Ala Asp 85 90 95 Met Val Phe Ile Thr Ala Gly Met Gly Gly Gly Thr Gly Thr Gly Gly 100 105 110 Ala Pro Val Val Ala Glu Val Ala Lys Glu Met Gly Ile Leu Thr Val 115 120 125 Ala Val Val Thr Lys Pro Phe Pro Phe Glu Gly Pro Arg Arg Met Lys 130 135 140 Ala Ala Glu Gln Gly Ile Glu Phe Leu Ser Lys Ser Val Asp Ser Leu 145 150 155 160 Ile Thr Ile Pro Asn Glu Lys Leu Leu Lys Val Leu Gly Pro Gly Thr 165 170 175 Ser Leu Leu Asp Ala Phe Lys Ala Ala Asn Asn Val Leu Leu Gly Ala 180 185 190 Val Gln Gly Ile Ala Glu Leu Ile Thr Arg Pro Gly Leu Ile Asn Val 195 200 205 Asp Phe Ala Asp Val Arg Thr Val Met Ser Glu Met Gly Thr Ala Met 210 215 220 Met Gly Ser Gly Thr Ala Ser Gly Asp Asp Arg Ala Gln Glu Ala Ala 225 230 235 240 Asp Ala Ala Ile Ser Ser Pro Leu Leu Glu Asp Val Asp Leu Ala Gly 245 250 255 Ala Arg Gly Ile Leu Val Asn Ile Thr Ala Gly Met Asp Ile Ser Ile 260 265 270 Asp Glu Phe Glu Thr Val Gly Asn Ala Val Lys Ala Phe Ala Ser Glu 275 280 285 Asn Ala Thr Val Val Val Gly Ala Val Ile Asp Met Asp Met Thr Asp 290 295 300 Glu Leu Arg Val Thr Val Val Ala Thr Gly Ile Gly Ala Glu Ser Lys 305 310 315 320 Pro Asp Ile Thr Leu Val Asn Pro Met Pro Met Ala Glu Ala Lys Val 325 330 335 Val Gly Gly Asp Tyr Thr Pro Ala Ala Pro Gln Ala Asn Leu Ala Thr 340 345 350 Glu Ala Ile Ala Met Thr Asp Ser Asn Ala Gln Lys Ala Ala Ala Thr 355 360 365 Asp Leu Asp Thr Tyr Leu Asp Ile Pro Ala Phe Leu Arg Lys Gln Ala 370 375 380 Asp 385

* * * * *


uspto.report is an independent third-party trademark research tool that is not affiliated, endorsed, or sponsored by the United States Patent and Trademark Office (USPTO) or any other governmental organization. The information provided by uspto.report is based on publicly available data at the time of writing and is intended for informational purposes only.

While we strive to provide accurate and up-to-date information, we do not guarantee the accuracy, completeness, reliability, or suitability of the information displayed on this site. The use of this site is at your own risk. Any reliance you place on such information is therefore strictly at your own risk.

All official trademark data, including owner information, should be verified by visiting the official USPTO website at www.uspto.gov. This site is not intended to replace professional legal advice and should not be used as a substitute for consulting with a legal professional who is knowledgeable about trademark law.

© 2024 USPTO.report | Privacy Policy | Resources | RSS Feed of Trademarks | Trademark Filings Twitter Feed