U.S. patent application number 12/743106 was filed with the patent office on 2012-06-28 for succinic acid production in a eukaryotic cell. Invention is credited to Robbertus Antonius Damveld, Cornelis Maria Jacobus Sagt, Rene Verwaal, Liang Wu.
Application Number | 20120165569 12/743106 |
Document ID | / |
Family ID | 40268299 |
Filed Date | 2012-06-28 |
United States Patent Application | 20120165569 |
Kind Code | A1 |
Verwaal; Rene ; et al. | June 28, 2012 |
The present invention relates to a recombinant eukaryotic cell selected from a yeast of a filamentous fungus comprising a nucleotide sequence encoding a NAD(H)-dependent fumarate reductase that catalyses the conversion of fumaric acid to succinic acid. The invention further relates to a process for the production of succinic acid wherein the eukaryotic cell according to the present invention is used.
Inventors: | Verwaal; Rene; (Nootdorp, NL) ; Wu; Liang; (Delft, NL) ; Damveld; Robbertus Antonius; (Berkel En Rodenrijs, NL) ; Sagt; Cornelis Maria Jacobus; (Utrecht, NL) |
Family ID: | 40268299 |
Appl. No.: | 12/743106 |
Filed: | November 14, 2008 |
PCT Filed: | November 14, 2008 |
PCT NO: | PCT/EP2008/065583 |
371 Date: | May 14, 2010 |
Current U.S. Class: | 562/590 ; 435/145; 435/254.11; 435/254.2; 435/254.21; 435/254.3 |
Current CPC Class: | C12N 15/815 20130101; C12N 15/81 20130101; C12P 7/46 20130101; C12N 9/88 20130101; C12N 9/001 20130101; Y02P 20/52 20151101 |
Class at Publication: | 562/590 ; 435/254.11; 435/254.2; 435/254.3; 435/254.21; 435/145 |
International Class: | C07C 55/10 20060101 C07C055/10; C12N 1/19 20060101 C12N001/19; C12P 7/46 20060101 C12P007/46; C12N 1/15 20060101 C12N001/15 |
Date | Code | Application Number |
---|---|---|
Nov 20, 2007 | EP | 07121113.0 |
Nov 20, 2007 | EP | 07121117.1 |
Nov 20, 2007 | EP | 07121120.5 |
May 27, 2008 | EP | 08156959.2 |
May 27, 2008 | EP | 08156960.0 |
May 27, 2008 | EP | 08156961.8 |
Sequence CWU 1
1
4411232PRTTrypanosoma brucei 1Met Leu Ser Thr Lys Gln Leu Leu Leu
Arg Ala Thr Ser Ala Leu Val1 5 10 15Ala Gly Ser Ser Gly Val Ala Arg
Asp Ser Pro Ser Leu Val Gly Asp 20 25 30Pro Cys Asp Ser Val Ser Pro
Thr Arg Val Val Trp Gly Arg Phe Phe 35 40 45Lys Ser Leu Ala Pro Pro
Ala Pro Ser Val Val Ser Cys Gln Lys Arg 50 55 60Phe Thr Ser His Gly
Ala Asp Gly Ile Ser Ser Ala Ser Ile Val Val65 70 75 80Thr Asp Pro
Glu Ala Ala Ala Lys Lys Arg Asp Arg Met Ala Arg Glu 85 90 95Leu Leu
Ser Ser Asn Ser Gly Leu Cys Gln Glu Asp Glu Pro Thr Ile 100 105
110Ile Asn Leu Lys Gly Leu Glu His Thr Ile Pro Tyr Arg Leu Ala Val
115 120 125Val Leu Cys Asn Ser Arg Ser Thr Gly Glu Phe Glu Ala Lys
Ala Ala 130 135 140Glu Ile Leu Arg Lys Ala Phe His Met Val Asp Tyr
Ser Leu Asn Cys145 150 155 160Phe Asn Pro Glu Ser Glu Leu Ser Arg
Val Asn Ser Leu Pro Val Gly 165 170 175Glu Lys His Gln Met Ser Glu
Asp Leu Arg His Val Met Glu Cys Thr 180 185 190Ile Ser Val His His
Ser Ser Gly Met Gly Phe Asp Pro Ala Ala Gly 195 200 205Pro Ile Ile
Ser Arg Leu Arg Gly Ala Met Arg Asp His Asn Asp Met 210 215 220Ser
Asp Ile Ser Val Thr Glu Ala Glu Val Glu Leu Phe Ser Leu Ala225 230
235 240Gln Ser Phe Asp Val Asp Leu Glu Glu Gly Thr Ile Ala Arg Lys
His 245 250 255Ser Glu Ala Arg Leu Asp Leu Gly Gly Val Asn Lys Gly
Tyr Thr Val 260 265 270Asp Tyr Val Val Asp His Leu Arg Ala Ala Gly
Met Pro Asn Val Leu 275 280 285Phe Glu Trp Gly Gly Asp Ile Arg Ala
Ser Gly Arg Asn Ile Lys Gly 290 295 300Asn Leu Trp Ala Val Ala Ile
Lys Arg Pro Pro Ser Val Glu Glu Val305 310 315 320Ile Arg Arg Ala
Lys Gly Lys Met Leu Lys Met Gly Glu Glu Glu Gln 325 330 335Glu Glu
Lys Asp Asp Asp Ser Pro Ser Leu Leu His Val Val Glu Leu 340 345
350Asp Asp Glu Ala Leu Cys Thr Ser Gly Asp Tyr Glu Asn Val Leu Tyr
355 360 365His Pro Lys His Gly Val Ala Gly Ser Ile Phe Asp Trp Gln
Arg Arg 370 375 380Gly Leu Leu Ser Pro Glu Glu Gly Ala Leu Ala Gln
Val Ser Val Lys385 390 395 400Cys Tyr Ser Ala Met Tyr Ala Asp Ala
Leu Ala Thr Val Cys Leu Val 405 410 415Lys Arg Asp Ala Val Arg Ile
Arg Tyr Leu Leu Glu Gly Trp Arg Tyr 420 425 430Val Arg Ser Arg Val
Thr Asn Tyr Phe Ala Tyr Thr Arg Gln Gly Glu 435 440 445Arg Leu Ala
His Met His Glu Ile Ala Gln Glu Thr Arg Glu Leu Arg 450 455 460Glu
Ile Arg Ile Ala Gly Ser Leu Pro Ser Arg Ile Val Ile Val Gly465 470
475 480Gly Gly Leu Ala Gly Leu Ser Ala Ala Ile Glu Ala Ala Ser Cys
Gly 485 490 495Ala Gln Val Ile Leu Met Glu Lys Glu Gly Arg Ile Gly
Gly Asn Ser 500 505 510Ala Lys Ala Thr Ser Gly Ile Asn Gly Trp Gly
Thr Arg Thr Gln Ala 515 520 525Lys Ser Asp Ile Leu Asp Gly Gly Lys
Tyr Phe Glu Arg Asp Thr Phe 530 535 540Leu Ser Gly Val Gly Gly Thr
Thr Asp Pro Ala Leu Val Lys Val Leu545 550 555 560Ser Val Lys Ser
Gly Asp Ala Ile Gly Trp Leu Thr Ser Leu Gly Val 565 570 575Pro Leu
Ser Val Leu Ser Gln Leu Gly Gly His Ser Phe Lys Arg Thr 580 585
590His Arg Ala Pro Asp Lys Thr Asp Gly Thr Pro Leu Pro Ile Gly His
595 600 605Thr Ile Met Arg Thr Leu Glu Asp His Ile Arg Asn Asn Leu
Ser Glu 610 615 620Arg Val Thr Ile Met Thr His Val Ser Val Thr Glu
Leu Leu His Glu625 630 635 640Thr Asp Thr Thr Pro Asp Gly Ala Ser
Glu Val Arg Val Thr Gly Val 645 650 655Arg Tyr Arg Asp Leu Ser Asp
Val Asp Gly Gln Pro Ser Lys Leu Leu 660 665 670Ala Asp Ala Val Val
Leu Ala Thr Gly Gly Phe Ser Asn Asp Arg Glu 675 680 685Glu Asn Ser
Leu Leu Cys Lys Tyr Ala Pro His Leu Ala Ser Phe Pro 690 695 700Thr
Thr Asn Gly Pro Trp Ala Thr Gly Asp Gly Val Lys Leu Ala Thr705 710
715 720Ser Val Gly Ala Lys Leu Val Asp Met Asp Lys Val Gln Leu His
Pro 725 730 735Thr Gly Leu Ile Asp Pro Lys Asp Pro Ala Asn Thr Thr
Lys Ile Leu 740 745 750Gly Pro Glu Ala Leu Arg Gly Ser Gly Gly Ile
Leu Leu Asn Lys Gln 755 760 765Gly Lys Arg Phe Val Asn Glu Leu Asp
Leu Arg Ser Val Val Ser Lys 770 775 780Ala Ile Asn Thr Gln Gly Asn
Glu Tyr Pro Gly Ser Gly Gly Cys Tyr785 790 795 800Phe Ala Tyr Cys
Val Leu Asn Glu Asp Ala Thr Asn Leu Phe Cys Gly 805 810 815Gly Ala
Leu Gly Phe Tyr Gly Lys Lys Leu Gly Leu Phe Gln Arg Ala 820 825
830Glu Thr Val Glu Glu Leu Ala Lys Leu Ile Gly Cys Asp Glu Gly Glu
835 840 845Leu Arg Asp Thr Leu Glu Lys Tyr Glu Thr Cys Ser Lys Ala
Lys Val 850 855 860Ala Cys Pro Val Thr Gly Lys Val Val Phe Pro Cys
Val Val Gly Thr865 870 875 880Arg Gly Pro Tyr Asn Val Ala Phe Val
Thr Pro Ser Ile His Tyr Thr 885 890 895Met Gly Gly Cys Leu Ile Ser
Pro Ala Ala Glu Val Leu Gln Glu Tyr 900 905 910Lys Gly Leu Asn Ile
Leu Glu Asn His Arg Pro Ile Arg Cys Leu Phe 915 920 925Gly Ala Gly
Glu Val Thr Gly Gly Val His Gly Gly Asn Arg Leu Gly 930 935 940Gly
Asn Ser Leu Leu Glu Cys Val Val Phe Gly Lys Ile Ala Gly Asp945 950
955 960Arg Ala Ala Thr Ile Leu Gln Lys Arg Glu Ile Ala Leu Ser Lys
Thr 965 970 975Ser Trp Thr Ser Val Val Val Arg Glu Ser Arg Ser Gly
Glu Gln Phe 980 985 990Gly Thr Gly Ser Arg Val Leu Arg Phe Asn Leu
Pro Gly Ala Leu Gln 995 1000 1005Arg Thr Gly Leu Asn Leu Gly Glu
Phe Val Ala Ile Arg Gly Glu 1010 1015 1020Trp Asp Gly Gln Gln Leu
Val Gly Tyr Phe Ser Pro Ile Thr Leu 1025 1030 1035Pro Glu Asp Leu
Gly Thr Ile Ser Leu Leu Val Arg Ala Asp Lys 1040 1045 1050Gly Thr
Leu Lys Glu Trp Ile Cys Ala Leu Arg Pro Gly Asp Ser 1055 1060
1065Val Glu Ile Lys Ala Cys Gly Gly Leu Arg Ile Asp Gln Asp Pro
1070 1075 1080Val Lys Lys Cys Leu Leu Phe Arg Asn Arg Pro Ile Thr
Arg Phe 1085 1090 1095Ala Leu Val Ala Ala Gly Thr Gly Val Ala Pro
Met Leu Gln Val 1100 1105 1110Ile Arg Ala Ala Leu Lys Lys Pro Tyr
Val Asp Thr Leu Glu Ser 1115 1120 1125Ile Arg Leu Ile Tyr Ala Ala
Glu Glu Tyr Asp Thr Leu Thr Tyr 1130 1135 1140Arg Ser Ile Leu Gln
Arg Phe Ala Glu Glu Phe Pro Asp Lys Phe 1145 1150 1155Val Cys Asn
Phe Val Leu Asn Asn Pro Pro Glu Gly Trp Thr Gly 1160 1165 1170Gly
Val Gly Phe Val Asn Lys Lys Ser Leu Gln Lys Val Leu Gln 1175 1180
1185Pro Pro Ser Ser Glu Pro Leu Ile Val Val Cys Gly Pro Pro Val
1190 1195 1200Met Gln Arg Asp Val Lys Asn Glu Leu Leu Ser Met Gly
Tyr Asp 1205 1210 1215Lys Glu Leu Val His Thr Val Asp Gly Glu Ser
Gly Thr Leu 1220 1225 123023698DNATrypanosoma brucei 2atgctctcaa
cgaagcaact tctccttcga gccacatctg cattagtggc gggaagctct 60ggagttgcgc
gagacagccc ttcgcttgtc ggcgaccctt gcgactcggt ttcaccaacg
120cgggtcgtat gggggcgctt cttcaaatcc ctagcgccac ccgctccctc
ggttgtttca 180tgtcaaaagc gttttacgtc ccatggcgcc gatggtattt
cctcggcttc gattgttgtc 240actgacccgg aggcggcagc aaagaagcgt
gaccgcatgg cgcgcgagtt gctctcaagt 300aatagtggtc tttgtcaaga
agatgaaccc actatcatta acttaaaggg gttggagcac 360acgattccgt
acaggctcgc cgtggttctt tgtaactcgc gctctacagg tgaattcgaa
420gcaaaggcag ctgagatttt gcgaaaggca tttcacatgg tggactactc
cctcaattgt 480ttcaatcctg aaagcgagtt gtcgcgtgtc aactctctgc
cggtgggtga gaagcatcaa 540atgtcggagg atctccggca cgtgatggag
tgcaccatca gtgtacatca ctccagcgga 600atgggcttcg acccggcggc
aggtccaatt atcagccgac ttcggggggc aatgagggac 660cacaacgaca
tgtccgacat ttccgtaacg gaagccgagg tagagctctt ctccttagcg
720caaagttttg acgtggacct cgaggaggga acaatagctc gcaagcactc
tgaagcgagg 780cttgatcttg gtggtgtgaa caaaggctac acagttgatt
atgtagtgga tcatcttcgt 840gcggccggta tgccaaacgt gctctttgag
tggggcgggg atattcgagc gtcgggtagg 900aacatcaaag gaaacctatg
ggcagttgct atcaaacgac cgccatctgt ggaggaggtg 960attcggcgcg
ccaaagggaa aatgttaaaa atgggggagg aggagcagga agagaaggac
1020gatgattctc catccctgct tcatgtggtg gagcttgatg atgaagccct
ttgcaccagt 1080ggtgactacg aaaacgtttt gtatcatcca aagcatggag
tggcggggag catttttgac 1140tggcagcgaa gggggctact atctcctgag
gaaggggcac tcgctcaagt gtctgtgaaa 1200tgttatagcg caatgtacgc
tgatgctctg gcaacagtgt gccttgtgaa gcgtgatgct 1260gtgaggattc
gctacttatt agagggctgg cgttacgttc gaagtcgtgt gacgaattac
1320tttgcctata cccgtcaggg cgagcggtta gcacatatgc acgagatagc
gcaagaaaca 1380cgggagctac gtgaaatacg gattgccggg agtttgccct
ccagaattgt tattgtgggt 1440ggaggtctag cgggcctttc agcggccatc
gaagccgcaa gttgtggtgc acaagtcata 1500ctcatggaaa aggaaggaag
aatcgggggg aacagcgcaa aggctacatc aggtattaat 1560gggtggggga
cgcgtacgca ggcaaagtca gatattctcg acggtggaaa gtattttgag
1620cgtgacactt ttctctctgg cgttggcggt actaccgatc ctgccctcgt
caaagtgctc 1680tcagttaaga gtggggacgc aattggttgg cttacttctc
ttggtgtgcc actcagtgtc 1740ctctcgcaac ttggtggcca cagtttcaag
cgaacccacc gtgccccgga caaaacggac 1800gggacacccc taccaattgg
tcatacgatc atgagaaccc tcgaggatca catccgtaac 1860aacctctctg
agcgagtaac gattatgaca catgtgtccg tgaccgagtt attgcacgaa
1920accgatacaa cacctgatgg cgcctccgaa gtgcgtgtta cgggtgtaag
atacagggac 1980ctctccgatg tggatggcca gccatcaaaa ttgcttgcgg
atgccgtcgt tcttgcaact 2040ggtggtttct ccaatgaccg tgaagaaaat
tcactgctct gcaagtatgc gcctcacctg 2100gccagttttc caacgacaaa
tggcccctgg gcgaccggtg acggggttaa actcgcaaca 2160tcggttggtg
caaagcttgt ggatatggat aaggttcagc tacaccccac agggcttatc
2220gatccaaagg atcccgcgaa cacaacgaag attctcggcc cggaggcact
ccgaggttca 2280ggtgggatat tactcaacaa gcaaggaaag cgcttcgtga
atgaacttga cctccgctct 2340gttgtatcca aggcaattaa tacgcagggt
aatgaatacc ctggatccgg tggatgttac 2400tttgcgtact gcgtgctcaa
cgaagatgca acaaacctct tctgtggcgg tgcactgggg 2460ttctacggaa
agaagcttgg tttgttccag cgtgctgaga ctgtggaaga gttggccaaa
2520ctgattggct gtgacgaagg tgaattacgg gatacgcttg aaaagtatga
aacttgcagc 2580aaggccaaag ttgcgtgccc tgtgacgggg aaggtagtat
tcccttgtgt ggtgggtaca 2640agggggccgt acaatgttgc ttttgtcacg
ccttccattc attacacaat gggtggctgc 2700ctcatttcac cggctgctga
agttcttcag gagtacaaag gtttaaatat tctggaaaac 2760catagaccga
ttcgatgctt gtttggtgcc ggtgaagtga cgggtggtgt gcacggtggt
2820aaccgccttg gtggtaattc gctcttggaa tgtgtggtat tcgggaaaat
tgcgggtgac 2880cgtgccgcaa caatacttca aaaacgtgag atagccctct
ccaagacgag ttggacttcc 2940gttgttgtac gtgagtcccg ctccggcgaa
cagttcggga ccggctctcg tgttcttcgt 3000tttaacctac ctggggcgct
gcagcgcaca ggtctcaatc tgggcgaatt tgtggccatc 3060cgtggcgagt
gggacggcca acaacttgtt ggttacttca gtccaattac actaccagag
3120gaccttggca ctatctccct tctggttcgt gccgacaagg gcacattgaa
ggaatggatc 3180tgcgccttgc gaccgggcga ctccgtcgaa atcaaagcgt
gtggaggtct tcgtattgat 3240caagacccgg taaagaagtg tctgctgttt
cgtaaccggc ctattacgcg gtttgctctt 3300gtcgcggcag ggactggtgt
cgcgcccatg ttgcaggtta ttcgtgcggc actcaagaag 3360ccttacgtgg
acacgttgga aagcatccgt cttatatacg ccgcagaaga gtacgacaca
3420ttgacgtatc gctcaatttt gcagcggttt gcggaagagt tccccgacaa
gttcgtctgc 3480aacttcgttc ttaacaaccc acccgaaggg tggacaggtg
gagtggggtt tgtcaacaaa 3540aaatccctgc agaaggtgct gcaaccgcca
tcgagtgagc cgctgattgt tgtgtgtgga 3600ccgcccgtga tgcagcgcga
cgtgaagaat gagttactga gcatgggtta tgacaaagag 3660ctcgttcata
cggttgacgg cgagtcggga acgctgta 369831164PRTArtificial sequenceFRDm
Trypanosoma lacking 68 aa targeting signal 3Met Ala Asp Gly Ile Ser
Ser Ala Ser Ile Val Val Thr Asp Pro Glu1 5 10 15Ala Ala Ala Lys Lys
Arg Asp Arg Met Ala Arg Glu Leu Leu Ser Ser 20 25 30Asn Ser Gly Leu
Cys Gln Glu Asp Glu Pro Thr Ile Ile Asn Leu Lys 35 40 45Gly Leu Glu
His Thr Ile Pro Tyr Arg Leu Ala Val Val Leu Cys Asn 50 55 60Ser Arg
Ser Thr Gly Glu Phe Glu Ala Lys Ala Ala Glu Ile Leu Arg65 70 75
80Lys Ala Phe His Met Val Asp Tyr Ser Leu Asn Cys Phe Asn Pro Glu
85 90 95Ser Glu Leu Ser Arg Val Asn Ser Leu Pro Val Gly Glu Lys His
Gln 100 105 110Met Ser Glu Asp Leu Arg His Val Met Glu Cys Thr Ile
Ser Val His 115 120 125His Ser Ser Gly Met Gly Phe Asp Pro Ala Ala
Gly Pro Ile Ile Ser 130 135 140Arg Leu Arg Gly Ala Met Arg Asp His
Asn Asp Met Ser Asp Ile Ser145 150 155 160Val Thr Glu Ala Glu Val
Glu Leu Phe Ser Leu Ala Gln Ser Phe Asp 165 170 175Val Asp Leu Glu
Glu Gly Thr Ile Ala Arg Lys His Ser Glu Ala Arg 180 185 190Leu Asp
Leu Gly Gly Val Asn Lys Gly Tyr Thr Val Asp Tyr Val Val 195 200
205Asp His Leu Arg Ala Ala Gly Met Pro Asn Val Leu Phe Glu Trp Gly
210 215 220Gly Asp Ile Arg Ala Ser Gly Arg Asn Ile Lys Gly Asn Leu
Trp Ala225 230 235 240Val Ala Ile Lys Arg Pro Pro Ser Val Glu Glu
Val Ile Arg Arg Ala 245 250 255Lys Gly Lys Met Leu Lys Met Gly Glu
Glu Glu Gln Glu Glu Lys Asp 260 265 270Asp Asp Ser Pro Ser Leu Leu
His Val Val Glu Leu Asp Asp Glu Ala 275 280 285Leu Cys Thr Ser Gly
Asp Tyr Glu Asn Val Leu Tyr His Pro Lys His 290 295 300Gly Val Ala
Gly Ser Ile Phe Asp Trp Gln Arg Arg Gly Leu Leu Ser305 310 315
320Pro Glu Glu Gly Ala Leu Ala Gln Val Ser Val Lys Cys Tyr Ser Ala
325 330 335Met Tyr Ala Asp Ala Leu Ala Thr Val Cys Leu Val Lys Arg
Asp Ala 340 345 350Val Arg Ile Arg Tyr Leu Leu Glu Gly Trp Arg Tyr
Val Arg Ser Arg 355 360 365Val Thr Asn Tyr Phe Ala Tyr Thr Arg Gln
Gly Glu Arg Leu Ala His 370 375 380Met His Glu Ile Ala Gln Glu Thr
Arg Glu Leu Arg Glu Ile Arg Ile385 390 395 400Ala Gly Ser Leu Pro
Ser Arg Ile Val Ile Val Gly Gly Gly Leu Ala 405 410 415Gly Leu Ser
Ala Ala Ile Glu Ala Ala Ser Cys Gly Ala Gln Val Ile 420 425 430Leu
Met Glu Lys Glu Gly Arg Ile Gly Gly Asn Ser Ala Lys Ala Thr 435 440
445Ser Gly Ile Asn Gly Trp Gly Thr Arg Thr Gln Ala Lys Ser Asp Ile
450 455 460Leu Asp Gly Gly Lys Tyr Phe Glu Arg Asp Thr Phe Leu Ser
Gly Val465 470 475 480Gly Gly Thr Thr Asp Pro Ala Leu Val Lys Val
Leu Ser Val Lys Ser 485 490 495Gly Asp Ala Ile Gly Trp Leu Thr Ser
Leu Gly Val Pro Leu Ser Val 500 505 510Leu Ser Gln Leu Gly Gly His
Ser Phe Lys Arg Thr His Arg Ala Pro 515 520 525Asp Lys Thr Asp Gly
Thr Pro Leu Pro Ile Gly His Thr Ile Met Arg 530 535 540Thr Leu Glu
Asp His Ile Arg Asn Asn Leu Ser Glu Arg Val Thr Ile545 550 555
560Met Thr His Val Ser Val Thr Glu Leu Leu His Glu Thr Asp Thr Thr
565 570 575Pro Asp Gly Ala Ser Glu Val Arg Val Thr Gly Val Arg Tyr
Arg Asp 580 585 590Leu Ser Asp Val Asp Gly Gln Pro Ser Lys Leu Leu
Ala Asp Ala Val 595 600 605Val Leu
Ala Thr Gly Gly Phe Ser Asn Asp Arg Glu Glu Asn Ser Leu 610 615
620Leu Cys Lys Tyr Ala Pro His Leu Ala Ser Phe Pro Thr Thr Asn
Gly625 630 635 640Pro Trp Ala Thr Gly Asp Gly Val Lys Leu Ala Thr
Ser Val Gly Ala 645 650 655Lys Leu Val Asp Met Asp Lys Val Gln Leu
His Pro Thr Gly Leu Ile 660 665 670Asp Pro Lys Asp Pro Ala Asn Thr
Thr Lys Ile Leu Gly Pro Glu Ala 675 680 685Leu Arg Gly Ser Gly Gly
Ile Leu Leu Asn Lys Gln Gly Lys Arg Phe 690 695 700Val Asn Glu Leu
Asp Leu Arg Ser Val Val Ser Lys Ala Ile Asn Thr705 710 715 720Gln
Gly Asn Glu Tyr Pro Gly Ser Gly Gly Cys Tyr Phe Ala Tyr Cys 725 730
735Val Leu Asn Glu Asp Ala Thr Asn Leu Phe Cys Gly Gly Ala Leu Gly
740 745 750Phe Tyr Gly Lys Lys Leu Gly Leu Phe Gln Arg Ala Glu Thr
Val Glu 755 760 765Glu Leu Ala Lys Leu Ile Gly Cys Asp Glu Gly Glu
Leu Arg Asp Thr 770 775 780Leu Glu Lys Tyr Glu Thr Cys Ser Lys Ala
Lys Val Ala Cys Pro Val785 790 795 800Thr Gly Lys Val Val Phe Pro
Cys Val Val Gly Thr Arg Gly Pro Tyr 805 810 815Asn Val Ala Phe Val
Thr Pro Ser Ile His Tyr Thr Met Gly Gly Cys 820 825 830Leu Ile Ser
Pro Ala Ala Glu Val Leu Gln Glu Tyr Lys Gly Leu Asn 835 840 845Ile
Leu Glu Asn His Arg Pro Ile Arg Cys Leu Phe Gly Ala Gly Glu 850 855
860Val Thr Gly Gly Val His Gly Gly Asn Arg Leu Gly Gly Asn Ser
Leu865 870 875 880Leu Glu Cys Val Val Phe Gly Lys Ile Ala Gly Asp
Arg Ala Ala Thr 885 890 895Ile Leu Gln Lys Arg Glu Ile Ala Leu Ser
Lys Thr Ser Trp Thr Ser 900 905 910Val Val Val Arg Glu Ser Arg Ser
Gly Glu Gln Phe Gly Thr Gly Ser 915 920 925Arg Val Leu Arg Phe Asn
Leu Pro Gly Ala Leu Gln Arg Thr Gly Leu 930 935 940Asn Leu Gly Glu
Phe Val Ala Ile Arg Gly Glu Trp Asp Gly Gln Gln945 950 955 960Leu
Val Gly Tyr Phe Ser Pro Ile Thr Leu Pro Glu Asp Leu Gly Thr 965 970
975Ile Ser Leu Leu Val Arg Ala Asp Lys Gly Thr Leu Lys Glu Trp Ile
980 985 990Cys Ala Leu Arg Pro Gly Asp Ser Val Glu Ile Lys Ala Cys
Gly Gly 995 1000 1005Leu Arg Ile Asp Gln Asp Pro Val Lys Lys Cys
Leu Leu Phe Arg 1010 1015 1020Asn Arg Pro Ile Thr Arg Phe Ala Leu
Val Ala Ala Gly Thr Gly 1025 1030 1035Val Ala Pro Met Leu Gln Val
Ile Arg Ala Ala Leu Lys Lys Pro 1040 1045 1050Tyr Val Asp Thr Leu
Glu Ser Ile Arg Leu Ile Tyr Ala Ala Glu 1055 1060 1065Glu Tyr Asp
Thr Leu Thr Tyr Arg Ser Ile Leu Gln Arg Phe Ala 1070 1075 1080Glu
Glu Phe Pro Asp Lys Phe Val Cys Asn Phe Val Leu Asn Asn 1085 1090
1095Pro Pro Glu Gly Trp Thr Gly Gly Val Gly Phe Val Asn Lys Lys
1100 1105 1110Ser Leu Gln Lys Val Leu Gln Pro Pro Ser Ser Glu Pro
Leu Ile 1115 1120 1125Val Val Cys Gly Pro Pro Val Met Gln Arg Asp
Val Lys Asn Glu 1130 1135 1140Leu Leu Ser Met Gly Tyr Asp Lys Glu
Leu Val His Thr Val Asp 1145 1150 1155Gly Glu Ser Gly Thr Leu
116041142PRTTrypanosoma brucei 4Met Val Asp Gly Arg Ser Ser Ala Ser
Ile Val Ala Val Asp Pro Glu1 5 10 15Arg Ala Ala Arg Glu Arg Asp Ala
Ala Ala Arg Ala Leu Leu Gln Asp 20 25 30Ser Pro Leu His Thr Thr Met
Gln Tyr Ala Thr Ser Gly Leu Glu Leu 35 40 45Thr Val Pro Tyr Ala Leu
Lys Val Val Ala Ser Ala Asp Thr Phe Asp 50 55 60Arg Ala Lys Glu Val
Ala Asp Glu Val Leu Arg Cys Ala Trp Gln Leu65 70 75 80Ala Asp Thr
Val Leu Asn Ser Phe Asn Pro Asn Ser Glu Val Ser Leu 85 90 95Val Gly
Arg Leu Pro Val Gly Gln Lys His Gln Met Ser Ala Pro Leu 100 105
110Lys Arg Val Met Ala Cys Cys Gln Arg Val Tyr Asn Ser Ser Ala Gly
115 120 125Cys Phe Asp Pro Ser Thr Ala Pro Val Ala Lys Ala Leu Arg
Glu Ile 130 135 140Ala Leu Gly Lys Glu Arg Asn Asn Ala Cys Leu Glu
Ala Leu Thr Gln145 150 155 160Ala Cys Thr Leu Pro Asn Ser Phe Val
Ile Asp Phe Glu Ala Gly Thr 165 170 175Ile Ser Arg Lys His Glu His
Ala Ser Leu Asp Leu Gly Gly Val Ser 180 185 190Lys Gly Tyr Ile Val
Asp Tyr Val Ile Asp Asn Ile Asn Ala Ala Gly 195 200 205Phe Gln Asn
Val Phe Phe Asp Trp Gly Gly Asp Cys Arg Ala Ser Gly 210 215 220Met
Asn Ala Arg Asn Thr Pro Trp Val Val Gly Ile Thr Arg Pro Pro225 230
235 240Ser Leu Asp Met Leu Pro Asn Pro Pro Lys Glu Ala Ser Tyr Ile
Ser 245 250 255Val Ile Ser Leu Asp Asn Glu Ala Leu Ala Thr Ser Gly
Asp Tyr Glu 260 265 270Asn Leu Ile Tyr Thr Ala Asp Asp Lys Pro Leu
Thr Cys Thr Tyr Asp 275 280 285Trp Lys Gly Lys Glu Leu Met Lys Pro
Ser Gln Ser Asn Ile Ala Gln 290 295 300Val Ser Val Lys Cys Tyr Ser
Ala Met Tyr Ala Asp Ala Leu Ala Thr305 310 315 320Ala Cys Phe Ile
Lys Arg Asp Pro Ala Lys Val Arg Gln Leu Leu Asp 325 330 335Gly Trp
Arg Tyr Val Arg Asp Thr Val Arg Asp Tyr Arg Val Tyr Val 340 345
350Arg Glu Asn Glu Arg Val Ala Lys Met Phe Glu Ile Ala Thr Glu Asp
355 360 365Ala Glu Met Arg Lys Arg Arg Ile Ser Asn Thr Leu Pro Ala
Arg Val 370 375 380Ile Val Val Gly Gly Gly Leu Ala Gly Leu Ser Ala
Ala Ile Glu Ala385 390 395 400Ala Gly Cys Gly Ala Gln Val Val Leu
Met Glu Lys Glu Ala Lys Leu 405 410 415Gly Gly Asn Ser Ala Lys Ala
Thr Ser Gly Ile Asn Gly Trp Gly Thr 420 425 430Arg Ala Gln Ala Lys
Ala Ser Ile Val Asp Gly Gly Lys Tyr Phe Glu 435 440 445Arg Asp Thr
Tyr Lys Ser Gly Ile Gly Gly Asn Thr Asp Pro Ala Leu 450 455 460Val
Lys Thr Leu Ser Met Lys Ser Ala Asp Ala Ile Gly Trp Leu Thr465 470
475 480Ser Leu Gly Val Pro Leu Thr Val Leu Ser Gln Leu Gly Gly His
Ser 485 490 495Arg Lys Arg Thr His Arg Ala Pro Asp Lys Lys Asp Gly
Thr Pro Leu 500 505 510Pro Ile Gly Phe Thr Ile Met Lys Thr Leu Glu
Asp His Val Arg Gly 515 520 525Asn Leu Ser Gly Arg Ile Thr Ile Met
Glu Asn Cys Ser Val Thr Ser 530 535 540Leu Leu Ser Glu Thr Lys Glu
Arg Pro Asp Gly Thr Lys Gln Ile Arg545 550 555 560Val Thr Gly Val
Glu Phe Thr Gln Ala Gly Ser Gly Lys Thr Thr Ile 565 570 575Leu Ala
Asp Ala Val Ile Leu Ala Thr Gly Gly Phe Ser Asn Asp Lys 580 585
590Thr Ala Asp Ser Leu Leu Arg Glu His Ala Pro His Leu Val Asn Phe
595 600 605Pro Thr Thr Asn Gly Pro Trp Ala Thr Gly Asp Gly Val Lys
Leu Ala 610 615 620Gln Arg Leu Gly Ala Gln Leu Val Asp Met Asp Lys
Val Gln Leu His625 630 635 640Pro Thr Gly Leu Ile Asn Pro Lys Asp
Pro Ala Asn Pro Thr Lys Phe 645 650 655Leu Gly Pro Glu Ala Leu Arg
Gly Ser Gly Gly Val Leu Leu Asn Lys 660 665 670Gln Gly Lys Arg Phe
Val Asn Glu Leu Asp Leu Arg Ser Val Val Ser 675 680 685Lys Ala Ile
Met Glu Gln Gly Ala Glu Tyr Pro Gly Ser Gly Gly Ser 690 695 700Met
Phe Ala Tyr Cys Val Leu Asn Ala Ala Ala Gln Lys Leu Phe Gly705 710
715 720Val Ser Ser His Glu Phe Tyr Trp Lys Lys Met Gly Leu Phe Val
Lys 725 730 735Ala Asp Thr Met Arg Asp Leu Ala Ala Leu Ile Gly Cys
Pro Val Glu 740 745 750Ser Val Gln Gln Thr Leu Glu Glu Tyr Glu Arg
Leu Ser Ile Ser Gln 755 760 765Arg Ser Cys Pro Ile Thr Arg Lys Ser
Val Tyr Pro Cys Val Leu Gly 770 775 780Thr Lys Gly Pro Tyr Tyr Val
Ala Phe Val Thr Pro Ser Ile His Tyr785 790 795 800Thr Met Gly Gly
Cys Leu Ile Ser Pro Ser Ala Glu Ile Gln Met Lys 805 810 815Asn Thr
Ser Ser Arg Ala Pro Leu Ser His Ser Asn Pro Ile Leu Gly 820 825
830Leu Phe Gly Ala Gly Glu Val Thr Gly Gly Val His Gly Gly Asn Arg
835 840 845Leu Gly Gly Asn Ser Leu Leu Glu Cys Val Val Phe Gly Arg
Ile Ala 850 855 860Gly Asp Arg Ala Ser Thr Ile Leu Gln Arg Lys Ser
Ser Ala Leu Ser865 870 875 880Phe Lys Val Trp Thr Thr Val Val Leu
Arg Glu Val Arg Glu Gly Gly 885 890 895Val Tyr Gly Ala Gly Ser Arg
Val Leu Arg Phe Asn Leu Pro Gly Ala 900 905 910Leu Gln Arg Ser Gly
Leu Ser Leu Gly Gln Phe Ile Ala Ile Arg Gly 915 920 925Asp Trp Asp
Gly Gln Gln Leu Ile Gly Tyr Tyr Ser Pro Ile Thr Leu 930 935 940Pro
Asp Asp Leu Gly Met Ile Asp Ile Leu Ala Arg Ser Asp Lys Gly945 950
955 960Thr Leu Arg Glu Trp Ile Ser Ala Leu Glu Pro Gly Asp Ala Val
Glu 965 970 975Met Lys Ala Cys Gly Gly Leu Val Ile Glu Arg Arg Leu
Ser Asp Lys 980 985 990His Phe Val Phe Met Gly His Ile Ile Asn Lys
Leu Cys Leu Ile Ala 995 1000 1005Gly Gly Thr Gly Val Ala Pro Met
Leu Gln Ile Ile Lys Ala Ala 1010 1015 1020Phe Met Lys Pro Phe Ile
Asp Thr Leu Glu Ser Val His Leu Ile 1025 1030 1035Tyr Ala Ala Glu
Asp Val Thr Glu Leu Thr Tyr Arg Glu Val Leu 1040 1045 1050Glu Glu
Arg Arg Arg Glu Ser Arg Gly Lys Phe Lys Lys Thr Phe 1055 1060
1065Val Leu Asn Arg Pro Pro Pro Leu Trp Thr Asp Gly Val Gly Phe
1070 1075 1080Ile Asp Arg Gly Ile Leu Thr Asn His Val Gln Pro Pro
Ser Asp 1085 1090 1095Asn Leu Leu Val Ala Ile Cys Gly Pro Pro Val
Met Gln Arg Ile 1100 1105 1110Val Lys Ala Thr Leu Lys Thr Leu Gly
Tyr Asn Met Asn Leu Val 1115 1120 1125Arg Thr Val Asp Glu Thr Glu
Pro Ser Gly Ser Ser Lys Ile 1130 1135 114053429DNATrypanosoma
brucei 5atggtagacg ggcgatcttc tgcatcaatt gttgccgttg atcccgaaag
ggctgcgcgt 60gagcgcgacg cagcagcgcg tgcccttctt caagacagtc cgctacacac
gaccatgcaa 120tatgcaacgt ctggtcttga gcttaccgtt ccctatgcac
ttaaggtggt tgccagtgct 180gacaccttcg atcgcgctaa ggaggttgcc
gatgaggtgc tacgctgcgc atggcaactc 240gccgacaccg tgttgaacag
tttcaacccg aacagtgagg tttcactcgt gggtcgcctg 300cctgtggggc
agaagcacca aatgtctgct ccactcaagc gtgtgatggc atgctgccag
360cgtgtgtata actcatcggc tggatgtttt gatccctcca cagcacccgt
cgcaaaggcg 420ctgcgtgaga ttgcactggg gaaggagcgg aacaatgctt
gtctggaggc acttactcaa 480gcgtgtacgc ttcccaacag ttttgtgatc
gatttcgaag ctggaactat cagccgtaag 540cacgagcatg cgtctctgga
cctaggtggg gttagcaaag gttatatcgt tgattatgtc 600attgataata
tcaatgctgc tggatttcaa aacgtttttt ttgactgggg tggagactgc
660cgtgcgagtg gtatgaatgc gcgcaatacc ccgtgggttg ttggtataac
tcgccctccg 720tcccttgata tgctccctaa cccgccaaag gaggcgtcgt
atatcagcgt tatctctctc 780gacaacgagg cccttgccac gagtggcgat
tatgaaaact taatatacac cgctgatgat 840aaacccctta cctgcactta
tgactggaag gggaaggaac tgatgaaacc ttctcagtcc 900aatatcgcgc
aggtatcggt taaatgttat agcgccatgt acgctgacgc gcttgcgact
960gcgtgtttca taaagcggga tcccgcgaag gttcgacagc tgctggacgg
ttggcgttac 1020gtgcgtgata cagtgagaga ttacagggtc tacgttcgtg
aaaatgagcg agtagcgaag 1080atgtttgaga tcgccacaga ggatgcggaa
atgaggaaga ggcggatcag caacacactt 1140cccgctcgtg tcattgtggt
gggcggtggt cttgcgggtt tgtccgcggc catcgaagct 1200gcaggatgcg
gtgctcaggt tgtgcttatg gagaaggagg cgaagctcgg aggcaacagc
1260gccaaggcga catctggtat caacggatgg ggcacacgtg ctcaggcgaa
ggcaagcatt 1320gtggatggtg ggaaatactt cgagcgtgac acatacaagt
ctggtatcgg gggtaacacc 1380gatcctgccc ttgtgaagac actttctatg
aaaagtgctg acgctattgg gtggctgacc 1440tcgttgggtg taccgctgac
ggtattgtca cagcttgggg gtcacagccg caagcgcaca 1500catcgggcac
cggataagaa agatggtaca cctctaccta tcggatttac aatcatgaaa
1560accctcgagg atcacgtgcg tggtaacctt tctggccgca tcaccataat
ggaaaactgc 1620agtgtaacgt cgttgctcag tgagacgaag gaacggccag
atggcactaa acagatacga 1680gttactggtg tggagttcac gcaggctggc
agtgggaaga cgaccatact tgcagatgct 1740gtcatccttg ccactggtgg
attttctaac gacaaaactg cagactccct gcttcgtgag 1800cacgccccgc
acttggtcaa cttccctacg acgaatggcc cgtgggcgac aggtgatggc
1860gtgaaacttg cacagcgact tggcgctcaa ctggtggata tggacaaggt
ccagttgcat 1920ccgacaggcc tcatcaaccc gaaggatcca gcgaacccta
caaagttcct tggacctgag 1980gcgctacgtg gatccggtgg cgttttgttg
aacaagcaag gcaagcgctt cgttaatgaa 2040cttgacctcc gttctgtggt
atcgaaagcc atcatggaac agggtgcgga atatcctgga 2100tcgggtggta
gcatgttcgc ctactgtgtg ttgaatgctg cggcgcagaa gctctttggt
2160gtcagctcac acgagttcta ctggaagaag atgggtctct tcgtgaaggc
tgacaccatg 2220agggacctcg ctgcactcat tgggtgccca gtggaatctg
tgcagcagac gctggaggag 2280tacgagcggc tctccatatc acagcgttcc
tgccccatca cgcgcaaaag cgtctatccg 2340tgcgtgctcg gcactaaggg
cccctactac gtcgccttcg tgacaccttc gattcactac 2400acaatgggtg
gatgtctcat ctcgccttct gctgaaatac aaatgaagaa cacatcatca
2460cgcgctccac tgagtcacag caacccaatc ctcgggttat ttggtgccgg
tgaggtaacg 2520ggtggtgtgc acggtgggaa ccggttgggc ggcaattcgc
tgcttgagtg cgtcgtgttt 2580gggagaattg cgggtgatcg ggcctcgacc
atccttcaga ggaagtcctc agcactttcc 2640ttcaaggtgt ggacgaccgt
ggtgctgcgt gaagtacgcg aaggtggtgt gtacggtgct 2700gggtcccgcg
tgcttcgctt taatttaccc ggggcgctgc aacggtctgg tctgagcctc
2760ggccaattta tcgcaattcg tggtgattgg gacggtcagc agttgatcgg
ttattacagt 2820cccatcacgc tgccagatga tcttggcatg atcgatatac
tcgcccgcag tgataagggg 2880acgctgaggg agtggatttc cgctctggag
ccgggtgacg ctgtggagat gaaggcatgc 2940ggtggtctgg tgattgagcg
ccgcttaagc gataagcact ttgtgttcat gggacacatt 3000atcaacaagc
tttgtctaat tgctggtgga acgggtgtgg caccgatgct gcaaataatc
3060aaagcagcct ttatgaaacc cttcattgac acattggaga gcgttcatct
catctatgcc 3120gcggaggacg tgacggagtt gacgtatcgc gaggtgctgg
aggagcgccg tcgtgagtca 3180cgtggaaagt tcaagaaaac gtttgtcctc
aaccggcccc cgcccctatg gactgatggt 3240gttggcttca tcgaccgggg
catcctcaca aatcatgtgc agccgccatc tgacaacctg 3300ctggtggcca
tatgcggacc accggtaatg cagcgcattg taaaggcgac cctgaagact
3360ttgggctaca acatgaacct tgtgaggact gtggatgaaa cggagccgag
cggctcatcc 3420aaaatttga 342961139PRTArtificial sequenceFRDg
lacking 3 aa C-terminal targeting signal 6Met Val Asp Gly Arg Ser
Ser Ala Ser Ile Val Ala Val Asp Pro Glu1 5 10 15Arg Ala Ala Arg Glu
Arg Asp Ala Ala Ala Arg Ala Leu Leu Gln Asp 20 25 30Ser Pro Leu His
Thr Thr Met Gln Tyr Ala Thr Ser Gly Leu Glu Leu 35 40 45Thr Val Pro
Tyr Ala Leu Lys Val Val Ala Ser Ala Asp Thr Phe Asp 50 55 60Arg Ala
Lys Glu Val Ala Asp Glu Val Leu Arg Cys Ala Trp Gln Leu65 70 75
80Ala Asp Thr Val Leu Asn Ser Phe Asn Pro Asn Ser Glu Val Ser Leu
85 90 95Val Gly Arg Leu Pro Val Gly Gln Lys His Gln Met Ser Ala Pro
Leu 100 105 110Lys Arg Val Met Ala Cys Cys Gln Arg Val Tyr Asn Ser
Ser Ala Gly 115 120 125Cys Phe Asp Pro Ser Thr Ala Pro Val Ala Lys
Ala Leu Arg Glu Ile 130 135 140Ala Leu Gly Lys Glu Arg Asn Asn Ala
Cys Leu Glu Ala Leu Thr Gln145 150 155 160Ala Cys Thr Leu Pro Asn
Ser Phe Val Ile Asp Phe Glu Ala Gly Thr 165 170 175Ile Ser Arg Lys
His Glu His Ala Ser Leu Asp Leu
Gly Gly Val Ser 180 185 190Lys Gly Tyr Ile Val Asp Tyr Val Ile Asp
Asn Ile Asn Ala Ala Gly 195 200 205Phe Gln Asn Val Phe Phe Asp Trp
Gly Gly Asp Cys Arg Ala Ser Gly 210 215 220Met Asn Ala Arg Asn Thr
Pro Trp Val Val Gly Ile Thr Arg Pro Pro225 230 235 240Ser Leu Asp
Met Leu Pro Asn Pro Pro Lys Glu Ala Ser Tyr Ile Ser 245 250 255Val
Ile Ser Leu Asp Asn Glu Ala Leu Ala Thr Ser Gly Asp Tyr Glu 260 265
270Asn Leu Ile Tyr Thr Ala Asp Asp Lys Pro Leu Thr Cys Thr Tyr Asp
275 280 285Trp Lys Gly Lys Glu Leu Met Lys Pro Ser Gln Ser Asn Ile
Ala Gln 290 295 300Val Ser Val Lys Cys Tyr Ser Ala Met Tyr Ala Asp
Ala Leu Ala Thr305 310 315 320Ala Cys Phe Ile Lys Arg Asp Pro Ala
Lys Val Arg Gln Leu Leu Asp 325 330 335Gly Trp Arg Tyr Val Arg Asp
Thr Val Arg Asp Tyr Arg Val Tyr Val 340 345 350Arg Glu Asn Glu Arg
Val Ala Lys Met Phe Glu Ile Ala Thr Glu Asp 355 360 365Ala Glu Met
Arg Lys Arg Arg Ile Ser Asn Thr Leu Pro Ala Arg Val 370 375 380Ile
Val Val Gly Gly Gly Leu Ala Gly Leu Ser Ala Ala Ile Glu Ala385 390
395 400Ala Gly Cys Gly Ala Gln Val Val Leu Met Glu Lys Glu Ala Lys
Leu 405 410 415Gly Gly Asn Ser Ala Lys Ala Thr Ser Gly Ile Asn Gly
Trp Gly Thr 420 425 430Arg Ala Gln Ala Lys Ala Ser Ile Val Asp Gly
Gly Lys Tyr Phe Glu 435 440 445Arg Asp Thr Tyr Lys Ser Gly Ile Gly
Gly Asn Thr Asp Pro Ala Leu 450 455 460Val Lys Thr Leu Ser Met Lys
Ser Ala Asp Ala Ile Gly Trp Leu Thr465 470 475 480Ser Leu Gly Val
Pro Leu Thr Val Leu Ser Gln Leu Gly Gly His Ser 485 490 495Arg Lys
Arg Thr His Arg Ala Pro Asp Lys Lys Asp Gly Thr Pro Leu 500 505
510Pro Ile Gly Phe Thr Ile Met Lys Thr Leu Glu Asp His Val Arg Gly
515 520 525Asn Leu Ser Gly Arg Ile Thr Ile Met Glu Asn Cys Ser Val
Thr Ser 530 535 540Leu Leu Ser Glu Thr Lys Glu Arg Pro Asp Gly Thr
Lys Gln Ile Arg545 550 555 560Val Thr Gly Val Glu Phe Thr Gln Ala
Gly Ser Gly Lys Thr Thr Ile 565 570 575Leu Ala Asp Ala Val Ile Leu
Ala Thr Gly Gly Phe Ser Asn Asp Lys 580 585 590Thr Ala Asp Ser Leu
Leu Arg Glu His Ala Pro His Leu Val Asn Phe 595 600 605Pro Thr Thr
Asn Gly Pro Trp Ala Thr Gly Asp Gly Val Lys Leu Ala 610 615 620Gln
Arg Leu Gly Ala Gln Leu Val Asp Met Asp Lys Val Gln Leu His625 630
635 640Pro Thr Gly Leu Ile Asn Pro Lys Asp Pro Ala Asn Pro Thr Lys
Phe 645 650 655Leu Gly Pro Glu Ala Leu Arg Gly Ser Gly Gly Val Leu
Leu Asn Lys 660 665 670Gln Gly Lys Arg Phe Val Asn Glu Leu Asp Leu
Arg Ser Val Val Ser 675 680 685Lys Ala Ile Met Glu Gln Gly Ala Glu
Tyr Pro Gly Ser Gly Gly Ser 690 695 700Met Phe Ala Tyr Cys Val Leu
Asn Ala Ala Ala Gln Lys Leu Phe Gly705 710 715 720Val Ser Ser His
Glu Phe Tyr Trp Lys Lys Met Gly Leu Phe Val Lys 725 730 735Ala Asp
Thr Met Arg Asp Leu Ala Ala Leu Ile Gly Cys Pro Val Glu 740 745
750Ser Val Gln Gln Thr Leu Glu Glu Tyr Glu Arg Leu Ser Ile Ser Gln
755 760 765Arg Ser Cys Pro Ile Thr Arg Lys Ser Val Tyr Pro Cys Val
Leu Gly 770 775 780Thr Lys Gly Pro Tyr Tyr Val Ala Phe Val Thr Pro
Ser Ile His Tyr785 790 795 800Thr Met Gly Gly Cys Leu Ile Ser Pro
Ser Ala Glu Ile Gln Met Lys 805 810 815Asn Thr Ser Ser Arg Ala Pro
Leu Ser His Ser Asn Pro Ile Leu Gly 820 825 830Leu Phe Gly Ala Gly
Glu Val Thr Gly Gly Val His Gly Gly Asn Arg 835 840 845Leu Gly Gly
Asn Ser Leu Leu Glu Cys Val Val Phe Gly Arg Ile Ala 850 855 860Gly
Asp Arg Ala Ser Thr Ile Leu Gln Arg Lys Ser Ser Ala Leu Ser865 870
875 880Phe Lys Val Trp Thr Thr Val Val Leu Arg Glu Val Arg Glu Gly
Gly 885 890 895Val Tyr Gly Ala Gly Ser Arg Val Leu Arg Phe Asn Leu
Pro Gly Ala 900 905 910Leu Gln Arg Ser Gly Leu Ser Leu Gly Gln Phe
Ile Ala Ile Arg Gly 915 920 925Asp Trp Asp Gly Gln Gln Leu Ile Gly
Tyr Tyr Ser Pro Ile Thr Leu 930 935 940Pro Asp Asp Leu Gly Met Ile
Asp Ile Leu Ala Arg Ser Asp Lys Gly945 950 955 960Thr Leu Arg Glu
Trp Ile Ser Ala Leu Glu Pro Gly Asp Ala Val Glu 965 970 975Met Lys
Ala Cys Gly Gly Leu Val Ile Glu Arg Arg Leu Ser Asp Lys 980 985
990His Phe Val Phe Met Gly His Ile Ile Asn Lys Leu Cys Leu Ile Ala
995 1000 1005Gly Gly Thr Gly Val Ala Pro Met Leu Gln Ile Ile Lys
Ala Ala 1010 1015 1020Phe Met Lys Pro Phe Ile Asp Thr Leu Glu Ser
Val His Leu Ile 1025 1030 1035Tyr Ala Ala Glu Asp Val Thr Glu Leu
Thr Tyr Arg Glu Val Leu 1040 1045 1050Glu Glu Arg Arg Arg Glu Ser
Arg Gly Lys Phe Lys Lys Thr Phe 1055 1060 1065Val Leu Asn Arg Pro
Pro Pro Leu Trp Thr Asp Gly Val Gly Phe 1070 1075 1080Ile Asp Arg
Gly Ile Leu Thr Asn His Val Gln Pro Pro Ser Asp 1085 1090 1095Asn
Leu Leu Val Ala Ile Cys Gly Pro Pro Val Met Gln Arg Ile 1100 1105
1110Val Lys Ala Thr Leu Lys Thr Leu Gly Tyr Asn Met Asn Leu Val
1115 1120 1125Arg Thr Val Asp Glu Thr Glu Pro Ser Gly Ser 1130
113573498DNAArtificial sequenceFRDm1 codon optimised for A. niger
7atgggtgccg atggtatctc ctctgcctcc attgtcgtca ccgaccccga ggctgctgcc
60aagaagcgtg accgcatggc ccgtgagctc ctctcctcca actccggtct ttgccaggag
120gatgagccca ccatcatcaa cctgaagggt ctggaacaca ccatccccta
ccgtcttgct 180gttgtccttt gcaactctcg cagcactggt gaattcgagg
ccaaggctgc tgagatcctc 240cgcaaggctt tccacatggt tgactactct
ctgaactgct tcaaccccga gtccgagctc 300tcccgtgtca acagcttgcc
tgtcggtgag aagcaccaga tgagcgaaga tctgcgccac 360gtcatggagt
gcaccatctc cgtccaccac tcctctggca tgggtttcga ccctgctgct
420ggtcccatca tctcccgtct gcgtggtgcc atgcgcgacc acaacgacat
gtccgacatc 480tccgtcaccg aggctgaggt tgagctgttc tcgctagcgc
agtcgttcga tgttgacctc 540gaggagggca ccattgctcg caagcactcc
gaggctcgcc tcgaccttgg tggtgtcaac 600aagggctaca ctgttgacta
cgtggtggac cacctccgcg ctgctggcat gcccaacgtc 660ctgttcgaat
ggggtggtga catccgtgcc tccggccgca acatcaaggg caacctctgg
720gctgttgcca tcaagcgccc tccctccgtt gaggaggtca tccgccgtgc
caagggcaag 780atgctcaaga tgggtgaaga agaacaggag gagaaggatg
atgactctcc cagccttctg 840cacgttgttg agctcgatga tgaggccctc
tgcacctccg gtgactacga gaacgtcctc 900taccacccca agcacggtgt
tgctggcagc atcttcgact ggcagcgccg tggtctgctg 960tctcctgagg
agggtgctct tgctcaggtt tccgtcaagt gctactctgc catgtacgcc
1020gatgcccttg ccaccgtctg cctggtcaag cgtgatgccg tccgtatccg
ctacctcctg 1080gaaggctggc gctacgtgcg ctctcgtgtc accaactact
tcgcctacac ccgccagggt 1140gagcgtcttg ctcacatgca cgaaattgcc
caggagactc gtgagctccg tgagatccgc 1200attgctggct ccctcccctc
ccgtatcgtc atcgtcggtg gtggtctggc cggtctgtct 1260gctgccattg
aggctgcctc ctgcggtgct caggtcatcc tgatggagaa ggagggtcgt
1320attggtggca actctgccaa ggccacctcc ggtatcaacg gctggggtac
tcgcactcag 1380gccaagtccg acatcctgga tggcggcaag tacttcgagc
gtgacacctt cctgagcggt 1440gttggtggta ccactgaccc tgctctggtc
aaggtcctct ccgtcaagtc cggtgatgcc 1500attggctggt tgaccagcct
tggtgttcct ctttctgttc tctcccagct gggtggtcac 1560tctttcaagc
gtacccaccg tgctcctgac aagactgatg gcactcctct ccccatcggt
1620cacaccatca tgcgcaccct cgaggaccac atccgcaaca acctgagcga
acgtgtcacc 1680atcatgaccc acgtttccgt cactgagctc ctccacgaga
ctgacaccac tcccgatggt 1740gcctccgagg tccgtgtcac cggtgtccgc
taccgtgacc tctccgatgt tgacggccag 1800cccagcaagc tccttgccga
tgccgttgtc cttgccactg gtggtttctc caacgaccgc 1860gaggagaaca
gcttgctttg caagtacgcc ccccacctgg cctccttccc caccaccaac
1920ggcccttggg ccactggtga tggtgtcaag ctggccacct ccgtcggtgc
caagctcgtc 1980gacatggaca aggtccagct gcaccccact ggcttgattg
accccaagga ccccgccaac 2040accaccaaga tcctgggccc cgaggctctc
cgtggcagcg gtggtatcct gctcaacaag 2100cagggcaagc gcttcgtcaa
cgagcttgac ctccgcagcg ttgtctccaa ggccatcaac 2160actcagggca
acgaataccc cggcagcggt ggctgctact tcgcctactg cgtgttgaac
2220gaagatgcca ccaacctgtt ctgcggtggt gctcttggat tctacggcaa
gaagcttggt 2280ctgttccagc gtgctgagac tgttgaggag cttgccaagt
tgattggctg cgatgagggc 2340gagctccgtg acaccctcga gaagtacgag
acttgctcga aggccaaggt tgcctgcccc 2400gtgaccggca aggtcgtgtt
cccctgcgtt gttggtaccc gtggtcccta caacgtcgct 2460ttcgtcaccc
cctccatcca ctacaccatg ggtggctgct tgatttctcc tgctgctgag
2520gtcctccagg aatacaaggg tctgaacatc ctggagaacc accgtcccat
tcgctgcttg 2580ttcggtgctg gtgaagtcac cggtggtgtc cacggtggca
accgcctggg tggcaactcc 2640ctcctcgagt gcgttgtgtt cggcaagatc
gctggtgacc gtgctgccac cattctccag 2700aagcgcgaaa ttgccctctc
caagaccagc tggacctccg tcgtcgtccg cgagtcccgc 2760tctggcgagc
agttcggtac cggctctcgt gtcctccgct tcaacctgcc cggtgctctc
2820cagcgcactg gtctgaacct gggtgagttc gtcgccatcc gtggtgaatg
ggatggccag 2880cagctggtcg gctacttctc ccccatcacc ctccccgaag
atcttggtac catctccctc 2940ctggtccgtg ccgacaaggg caccctcaag
gaatggatat gtgccctccg ccccggtgac 3000agcgttgaga tcaaggcctg
cggtggtctg cgtatcgacc aggaccctgt caagaagtgc 3060ttgctattcc
gcaaccgccc catcacccgc ttcgctcttg ttgctgctgg tactggtgtt
3120gctcccatgc tccaggtcat ccgtgctgct ctcaagaagc cctacgtgga
tacattggag 3180tccatccgtc tgatctacgc tgctgaagaa tacgacaccc
tgacctaccg ctccatcctc 3240cagcgcttcg ctgaggagtt ccccgacaag
ttcgtctgca acttcgtcct caacaaccct 3300cctgaaggct ggactggtgg
tgttggtttc gtcaacaaga agtccctcca gaaggtcctc 3360cagcctccta
gctctgagcc tctgattgtc gtctgcggtc ctcctgtcat gcagcgtgat
3420gtcaagaacg agctcctcag catgggctac gacaaggagc ttgtccacac
cgttgacggc 3480gagtctggca ccctataa 349883420DNAArtificial
sequenceFRDg gene optimised for A. niger 8atggtcgatg gccgctcctc
cgcctccatt gttgctgttg accccgagcg tgctgctcgt 60gagcgtgatg ctgctgctcg
tgccctcctc caggactctc ccttgcacac caccatgcag 120tacgccacct
ccggtctgga attgactgtt ccctacgccc tcaaggttgt tgcctccgcc
180gacaccttcg accgtgccaa ggaggttgcc gatgaggtcc tccgctgcgc
ctggcagctg 240gccgacaccg tcctcaactc tttcaacccc aacagcgaag
tctctctggt cggccgcctc 300cccgtcggtc agaagcacca gatgagcgct
cctctcaagc gtgtcatggc ctgctgccag 360cgtgtctaca acagctctgc
tggctgcttc gaccccagca ctgctcctgt tgccaaggcc 420ctccgtgaga
tcgctcttgg caaggagcgc aacaacgcct gcttggaggc tcttactcag
480gcctgcaccc tccccaactc gttcgtcatt gacttcgagg ctggcaccat
ctcccgcaag 540cacgaacacg cctccctcga tcttggtggt gtcagcaagg
gctacatcgt cgactacgtc 600attgacaaca tcaacgctgc tggtttccag
aacgttttct tcgactgggg tggtgactgc 660cgtgcctccg gcatgaacgc
ccgcaacacc ccctgggttg ttggtatcac ccgccccccg 720tcattggaca
tgcttcccaa ccctcccaag gaggccagct acatctccgt catctccctc
780gacaacgagg ctcttgccac cagcggtgac tacgagaacc tgatctacac
tgccgatgac 840aagcctctga cctgcaccta cgactggaag ggcaaggagc
tcatgaagcc cagccagtcc 900aacattgccc aggtcagcgt caagtgctac
tctgccatgt acgccgatgc ccttgccact 960gcttgcttca tcaagcgtga
ccccgccaag gtccgccagc tgttggatgg ctggcgctac 1020gtgcgcgaca
ccgtccgtga ctaccgtgtc tacgtgcgcg agaacgagcg tgttgccaag
1080atgttcgaaa ttgccactga ggatgccgag atgcgcaagc gccgtatctc
caacaccctc 1140cctgctcgtg tcattgttgt tggtggtggt ctggctggtc
tttctgctgc cattgaggct 1200gctggctgcg gtgctcaggt tgtcctgatg
gagaaggagg ccaagctcgg tggcaactcc 1260gccaaggcca cctccggtat
caacggctgg ggtactcgtg ctcaggccaa ggcctccatc 1320gtcgatggcg
gcaagtactt cgagcgtgac acctacaagt ccggtatcgg tggcaacacc
1380gaccctgctc tggtcaagac cctgagcatg aagtccgccg atgccattgg
ctggttgacc 1440agccttggtg ttcctcttac tgtcctttct cagctgggtg
gccactctcg caagcgcacc 1500caccgtgctc ctgacaagaa ggacggcacc
cccctcccca tcggtttcac catcatgaaa 1560actctcgagg accacgtccg
tggcaacctg tctggccgta tcaccatcat ggagaactgc 1620tcggtgacct
cgctactctc cgagactaag gagcgccccg atggcaccaa gcagatccgt
1680gtcaccggtg ttgagttcac ccaggctggc tctggcaaga ccaccatcct
ggccgatgcc 1740gtcatcctgg ccactggtgg tttctccaac gacaagactg
ccgactcgct actccgcgaa 1800cacgctcccc acctggtcaa cttccccacc
accaacggcc cctgggcgac tggtgatggt 1860gtcaagctgg cccagcgtct
gggtgctcag ctcgtcgaca tggacaaggt ccagctccac 1920cccactggtc
tgatcaaccc caaggaccct gccaacccca ccaagttcct tggacctgag
1980gctctccgtg gctccggtgg tgtccttctg aacaagcagg gcaagcgctt
cgtcaacgag 2040ctcgatctcc gcagcgttgt ctccaaggcc atcatggagc
agggtgctga ataccccggc 2100agcggtggca gcatgttcgc ctactgcgtt
ctcaacgctg ctgctcagaa gctgttcggt 2160gtctcctccc acgaattcta
ctggaagaag atgggtctgt tcgtcaaggc cgacaccatg 2220cgtgatcttg
ctgctctgat cggttgcccc gttgagagcg tgcagcagac cctggaagaa
2280tacgagcgcc tctccatctc ccagcgctct tgccccatca cccgcaagtc
ggtgtaccct 2340tgcgtgcttg gcaccaaggg tccctactac gtggctttcg
tcaccccctc catccactac 2400accatgggtg gctgcttgat ctctccttct
gctgagatcc agatgaagaa cacctcctcc 2460cgtgctcctc tctcccactc
caaccccatc ctcggtctgt tcggtgctgg tgaagtcact 2520ggtggtgtcc
acggtggcaa ccgtcttggt ggcaactccc tcctcgagtg cgttgtgttc
2580ggccgtatcg ctggtgaccg tgccagcacc atcctccagc gcaagagctc
tgctctctcc 2640ttcaaggtct ggaccactgt tgtcctccgc gaagtccgcg
agggtggtgt ctacggtgct 2700ggctctcgtg tcctccgctt caacctcccc
ggtgctctcc agcgctccgg tctgtctctt 2760ggccagttca ttgccatccg
tggtgactgg gatggccagc agctcattgg ctactactct 2820cccatcaccc
tccccgatga tcttggaatg atcgacatcc tggctcgctc cgacaagggt
2880accctccgcg aatggatctc cgctctggag cccggtgatg ccgttgagat
gaaggcctgc 2940ggtggtctgg tcattgagcg tcgtctgtcc gacaagcact
tcgtgttcat gggtcacatc 3000atcaacaagc tctgcttgat tgccggtggt
actggtgttg ctcccatgct tcagatcatc 3060aaggctgctt tcatgaagcc
cttcattgac accctcgagt ccgtccacct gatctacgct 3120gctgaggatg
tcactgagct gacctaccgt gaggtccttg aggagcgccg ccgcgagtcc
3180cgtggcaagt tcaagaaaac cttcgtcctg aaccgccctc ctcctctctg
gactgatggt 3240gttggtttca ttgaccgtgg tatcctgacc aaccacgtcc
agcctccctc cgacaaccta 3300ttagtggcca tctgcggtcc tcctgtcatg
cagcgcattg tcaaggccac tctcaagacc 3360ctaggataca acatgaacct
ggtccgcact gttgatgaga ctgagccctc cggatcataa 342093498DNAArtificial
sequenceFRDm1 gene optimsied for S. cerevisiae 9atgggtgctg
atggtatttc ttctgcttcc attgttgtta ctgacccaga agctgctgcc 60aagaagcgtg
acagaatggc cagagaattg ttgtcctcca actctggtct atgtcaagaa
120gatgaaccaa ccatcatcaa cttaaagggt ttggaacaca ccattccata
cagattggcc 180gttgttttgt gtaactccag atccactggt gaattcgaag
ccaaggctgc tgaaatcttg 240agaaaggctt tccacatggt tgactactct
ttgaattgtt tcaacccaga atctgaattg 300tcccgtgtca actctttacc
agtcggtgaa aagcaccaaa tgtccgaaga tctaagacat 360gtcatggaat
gtaccatttc tgtccaccac tcctctggta tgggtttcga cccagctgct
420ggtccaatca tctccagatt gagaggtgcc atgagagatc acaacgacat
gtccgatatc 480tccgtcactg aagctgaagt tgaattattc tctttggctc
aatctttcga tgtcgacttg 540gaagaaggta ctattgccag aaagcactct
gaagccagat tggatttggg tggtgtcaac 600aagggttaca ctgttgacta
cgttgttgac catttgagag ctgctggtat gccaaacgtc 660ttgttcgaat
ggggtggtga tatcagagct tctggtagaa acatcaaggg taacttgtgg
720gctgttgcca tcaagcgtcc accatctgtt gaagaagtta tccgtcgtgc
caagggtaag 780atgttaaaga tgggtgaaga agaacaagaa gaaaaggacg
atgactctcc atctttgttg 840cacgttgttg aattggatga cgaagctttg
tgtacctctg gtgactacga aaacgtctta 900taccatccaa agcacggtgt
tgctggttcc attttcgact ggcaacgtcg tggtttattg 960tctccagaag
aaggtgcttt agctcaagtt tccgtcaaat gttactctgc catgtacgct
1020gatgctttgg ccactgtttg tttggtcaag agagatgctg tcagaatcag
atacttgttg 1080gaaggttgga gatacgtcag atctcgtgtc accaactact
tcgcttacac cagacaaggt 1140gaaagattgg ctcacatgca cgaaattgct
caagaaacca gagaattaag agaaatcaga 1200attgctggtt ctttgccatc
cagaattgtt atcgtcggtg gtggtttggc tggtctatcc 1260gctgccattg
aagctgcttc ttgtggtgct caagtcattt tgatggaaaa ggaaggtaga
1320attggtggta actctgccaa ggctacctct ggtatcaacg gttggggtac
cagaacccaa 1380gccaagtctg atatcttgga tggtggtaag tactttgaaa
gagacacttt cttgtccggt 1440gtcggtggta ccactgaccc agctttggtc
aaggtcttgt ccgtcaaatc tggtgacgct 1500atcggttggt taacttcttt
gggtgtccca ttgtccgttt tgtctcaatt gggtggtcac 1560tctttcaaga
gaactcacag agctccagac aagactgatg gtactccatt accaattggt
1620cacaccatca tgagaacttt ggaagatcat atcagaaaca acttgtctga
aagagttacc 1680atcatgaccc acgtttctgt tactgaattg ttgcacgaaa
ctgacaccac tccagatggt 1740gcttctgaag ttcgtgtcac cggtgtccgt
tacagagact tgtctgatgt cgatggtcaa 1800ccttccaaac tattggctga
cgctgttgtt ttggccactg gtggtttctc caacgacaga 1860gaagaaaact
ctttgttgtg taaatacgct cctcatttgg cttctttccc aactaccaac
1920ggtccatggg ctactggtga cggtgtcaaa ttggccacct ccgttggtgc
caagttggtt 1980gacatggaca aggttcaatt gcacccaact ggtttgattg
acccaaagga cccagctaac 2040accactaaga tcttgggtcc agaagctttg
agaggttctg gtggtatttt gttgaacaag
2100caaggtaaga gattcgtcaa cgaattggac ttgagatccg ttgtttccaa
ggccattaac 2160actcaaggta acgaataccc aggttctggt ggttgttact
ttgcttactg tgtcttaaac 2220gaagatgcta ccaacttatt ctgtggtggt
gctttgggtt tctacggtaa gaaattaggt 2280ttgttccaaa gagctgaaac
tgttgaagaa ttggccaaat tgattggttg tgacgaaggt 2340gaattgagag
acactttgga aaaatacgaa acctgttcca aggccaaggt tgcttgtcca
2400gtcactggta aggttgtttt cccatgtgtt gtcggtacca gaggtccata
caatgttgct 2460ttcgtcactc catccatcca ctacaccatg ggtggttgtt
tgatctctcc agctgctgaa 2520gtcttgcaag aatacaaggg tttgaatatc
ttggaaaacc acagaccaat cagatgtttg 2580ttcggtgctg gtgaagtcac
tggtggtgtc cacggtggta acagattagg tggtaactct 2640ctattggaat
gtgttgtctt tggtaagatt gctggtgaca gagctgccac tatcttgcaa
2700aagagagaaa ttgctttgtc caagacctcc tggacctctg ttgttgtcag
agaatccaga 2760tctggtgaac aattcggtac cggttccaga gttttgagat
tcaacttgcc aggtgcttta 2820caaagaaccg gtttgaactt gggtgaattc
gttgccatca gaggtgaatg ggatggtcaa 2880caattagtcg gttacttctc
tccaatcact ttgccagaag atttgggtac catctctttg 2940ttggtcagag
ctgacaaggg tactttgaag gaatggatct gtgctttgcg tccaggtgac
3000tccgttgaaa tcaaggcttg tggtggtcta agaattgacc aagatccagt
caagaaatgt 3060ttgttgttca gaaacagacc aattaccaga tttgctttgg
ttgctgctgg taccggtgtt 3120gctccaatgt tgcaagttat cagagctgct
ttgaagaagc catacgtcga cactttggaa 3180tccatcagat tgatctacgc
tgctgaagaa tatgacactt taacctacag atctatcttg 3240caaagatttg
ctgaagaatt cccagacaaa ttcgtttgta acttcgtctt aaacaaccct
3300ccagaaggtt ggaccggtgg tgttggtttc gtcaacaaga aatctttgca
aaaggttttg 3360caaccacctt cttctgaacc attgattgtt gtttgtggtc
cacctgttat gcaaagagat 3420gtcaaaaatg aattgttgtc catgggttac
gacaaggaat tggttcacac tgtcgatggt 3480gaatctggta ccttgtaa
3498103420DNAArtificial sequenceFRDg gene optimised for S.
cerevisiae 10atggttgatg gtagatcttc tgcttccatt gttgccgttg acccagaaag
agctgccaga 60gaaagagatg ctgctgccag agctttgttg caagactctc cattgcacac
caccatgcaa 120tacgctacct ctggtttgga attgactgtt ccatacgctt
tgaaggttgt tgcttctgct 180gacactttcg acagagccaa ggaagttgct
gatgaagtct tgagatgtgc ctggcaattg 240gctgacaccg ttttgaactc
tttcaaccca aactctgaag tctctttagt cggtagatta 300ccagtcggtc
aaaagcatca aatgtctgct ccattgaaac gtgtcatggc ttgttgtcaa
360agagtctaca actcctctgc tggttgtttc gacccatcca ctgctccagt
tgccaaggct 420ttgagagaaa ttgctttggg taaggaaaga aacaatgctt
gtttggaagc tttgactcaa 480gcttgtacct tgccaaactc tttcgtcatt
gatttcgaag ctggtactat ctccagaaag 540cacgaacacg cttctttgga
tttgggtggt gtttccaagg gttacatcgt cgattacgtc 600attgacaaca
tcaatgctgc tggtttccaa aacgttttct ttgactgggg tggtgactgt
660cgtgcctccg gtatgaacgc cagaaacact ccatgggttg tcggtatcac
tagacctcct 720tccttggaca tgttgccaaa ccctccaaag gaagcttctt
acatctccgt catctctttg 780gacaatgaag ctttggctac ctctggtgat
tacgaaaact tgatctacac tgctgacgat 840aaaccattga cctgtaccta
cgattggaaa ggtaaggaat tgatgaagcc atctcaatcc 900aatatcgctc
aagtttccgt caagtgttac tctgccatgt acgctgacgc tttggctacc
960gcttgtttca tcaagcgtga cccagccaag gtcagacaat tgttggatgg
ttggagatac 1020gttagagaca ccgtcagaga ttaccgtgtc tacgtcagag
aaaacgaaag agttgccaag 1080atgttcgaaa ttgccactga agatgctgaa
atgagaaaga gaagaatttc caacacttta 1140ccagctcgtg tcattgttgt
tggtggtggt ttggctggtt tgtccgctgc cattgaagct 1200gctggttgtg
gtgctcaagt tgttttgatg gaaaaggaag ccaagttggg tggtaactct
1260gccaaggcta cctctggtat caacggttgg ggtactagag ctcaagctaa
ggcttccatt 1320gtcgatggtg gtaagtactt cgaaagagat acctacaagt
ctggtatcgg tggtaacacc 1380gatccagctt tggttaagac tttgtccatg
aaatctgctg acgctatcgg ttggttgact 1440tctctaggtg ttccattgac
tgttttgtcc caattaggtg gtcactccag aaagagaact 1500cacagagctc
cagacaagaa ggatggtact ccattgccaa ttggtttcac catcatgaaa
1560actttagaag atcatgttag aggtaacttg tccggtagaa tcaccatcat
ggaaaactgt 1620tccgttacct ctttgttgtc tgaaaccaag gaaagaccag
acggtaccaa gcaaatcaga 1680gttaccggtg tcgaattcac tcaagctggt
tctggtaaga ccaccatttt ggctgatgct 1740gttatcttgg ccaccggtgg
tttctccaac gacaagactg ctgattcttt gttgagagaa 1800catgccccac
acttggttaa cttcccaacc accaacggtc catgggctac tggtgatggt
1860gtcaagttgg ctcaaagatt aggtgctcaa ttggtcgata tggacaaggt
tcaattgcac 1920ccaactggtt tgatcaaccc aaaggaccca gccaacccaa
ccaaattctt gggtccagaa 1980gctctaagag gttctggtgg tgttttgttg
aacaaacaag gtaagagatt tgtcaacgaa 2040ttggatttga gatctgttgt
ttccaaggcc atcatggaac aaggtgctga atacccaggt 2100tctggtggtt
ccatgtttgc ttactgtgtc ttgaacgctg ctgctcaaaa attgtttggt
2160gtttcctctc acgaattcta ctggaagaag atgggtttgt tcgtcaaggc
tgacaccatg 2220agagacttgg ctgctttgat tggttgtcca gttgaatccg
ttcaacaaac tttagaagaa 2280tacgaaagat tatccatctc tcaaagatct
tgtccaatta ccagaaaatc tgtttaccca 2340tgtgttttgg gtaccaaagg
tccatactat gtcgcctttg tcactccatc tatccactac 2400accatgggtg
gttgtttgat ttctccatct gctgaaatcc aaatgaagaa cacttcttcc
2460agagctccat tgtcccactc caacccaatc ttgggtttat tcggtgctgg
tgaagtcacc 2520ggtggtgtcc acggtggtaa cagattaggt ggtaactctt
tgttggaatg tgttgttttc 2580ggtagaattg ccggtgacag agcttctacc
attttgcaaa gaaagtcctc tgctttgtct 2640ttcaaggtct ggaccactgt
tgttttgaga gaagtcagag aaggtggtgt ctacggtgct 2700ggttcccgtg
tcttgagatt caacttacca ggtgctctac aaagatctgg tctatccttg
2760ggtcaattca ttgccatcag aggtgactgg gacggtcaac aattgattgg
ttactactct 2820ccaatcactt tgccagacga tttgggtatg attgacattt
tggccagatc tgacaagggt 2880actttacgtg aatggatctc tgctttggaa
ccaggtgacg ctgtcgaaat gaaggcttgt 2940ggtggtttgg tcatcgaaag
aagattatct gacaagcact tcgttttcat gggtcacatt 3000atcaacaagc
tatgtttgat tgctggtggt accggtgttg ctccaatgtt gcaaatcatc
3060aaggccgctt tcatgaagcc attcatcgac actttggaat ccgtccactt
gatctacgct 3120gctgaagatg tcactgaatt gacttacaga gaagttttgg
aagaacgtcg tcgtgaatcc 3180agaggtaaat tcaagaaaac tttcgttttg
aacagacctc ctccattatg gactgacggt 3240gtcggtttca tcgaccgtgg
tatcttgacc aaccacgttc aaccaccatc tgacaactta 3300ttggttgcca
tctgtggtcc accagttatg caaagaattg tcaaggccac tttaaagact
3360ttaggttaca acatgaactt ggtcagaacc gttgacgaaa ctgaaccatc
tggaagttaa 342011898DNAArtificial sequenceGPDA promotor
11tcagcgtcca attcgagctc tgtacagtga ccggtgactc tttctggcat gcggagacac
60ggacggtcgc agagaggagg gctgagtaat aagcgcactc atgtcagctc tggcgctctg
120aggtgcagtg gatgattatt aatccgggac cggccgcccc tccgccccga
agtggaaagg 180ctggtgtgcc cctcgttgac caagaatcta ttgcatcatc
ggagaatatg gagcttcatc 240gaatcaccgg cagtaagcga aggagaatgt
gaagccaggg gtgtatagcc gtcggcgaaa 300tagcatgcca ttaacctagg
tacagaagtc caattgcttc cgatctggta aaagattcac 360gagatagtac
cttctccgaa gtaggtagag cgagtacccg gcgcgtaagc tccctaattg
420gcccatccgg catctgtagg gcgtccaaat atcgtgcctc tcctgctttg
cccggtgtat 480gaaaccggaa aggccgctca ggagctggcc agcggcgcag
accgggaaca caagctggca 540gtcgacccat ccggtgctct gcactcgacc
tgctgaggtc cctcagtccc tggtaggcag 600ctttgccccg tctgtccgcc
cggtgtgtcg gcggggttga caaggtcgtt gcgtcagtcc 660aacatttgtt
gccatatttt cctgctctcc ccaccagctg ctcttttctt ttctctttct
720tttcccatct tcagtatatt catcttccca tccaagaacc tttatttccc
ctaagtaagt 780actttgctac atccatactc catccttccc atcccttatt
cctttgaacc tttcagttcg 840agctttccca cttcatcgca gcttgactaa
cagctacccc gcttgagcca ccgtcaaa 898121000DNAArtificial sequenceTDH3
promotor 12ctattttcga ggaccttgtc accttgagcc caagagagcc aagatttaaa
ttttcctatg 60acttgatgca aattcccaaa gctaataaca tgcaagacac gtacggtcaa
gaagacatat 120ttgacctctt aacaggttca gacgcgactg cctcatcagt
aagacccgtt gaaaagaact 180tacctgaaaa aaacgaatat atactagcgt
tgaatgttag cgtcaacaac aagaagttta 240atgacgcgga ggccaaggca
aaaagattcc ttgattacgt aagggagtta gaatcatttt 300gaataaaaaa
cacgcttttt cagttcgagt ttatcattat caatactgcc atttcaaaga
360atacgtaaat aattaatagt agtgattttc ctaactttat ttagtcaaaa
aattagcctt 420ttaattctgc tgtaacccgt acatgcccaa aatagggggc
gggttacaca gaatatataa 480catcgtaggt gtctgggtga acagtttatt
cctggcatcc actaaatata atggagcccg 540ctttttaagc tggcatccag
aaaaaaaaag aatcccagca ccaaaatatt gttttcttca 600ccaaccatca
gttcataggt ccattctctt agcgcaacta cagagaacag gggcacaaac
660aggcaaaaaa cgggcacaac ctcaatggag tgatgcaacc tgcctggagt
aaatgatgac 720acaaggcaat tgacccacgc atgtatctat ctcattttct
tacaccttct attaccttct 780gctctctctg atttggaaaa agctgaaaaa
aaaggttgaa accagttccc tgaaattatt 840cccctacttg actaataagt
atataaagac ggtaggtatt gattgtaatt ctgtaaatct 900atttcttaaa
cttcttaaat tctactttta tagttagtct tttttttagt tttaaaacac
960caagaactta gtttcgaata aacacacata aacaaacaaa
100013500DNAArtificial sequenceTDH3 terminator 13gtgaatttac
tttaaatctt gcatttaaat aaattttctt tttatagctt tatgacttag 60tttcaattta
tatactattt taatgacatt ttcgattcat tgattgaaag ctttgtgttt
120tttcttgatg cgctattgca ttgttcttgt ctttttcgcc acatgtaata
tctgtagtag 180atacctgata cattgtggat gctgagtgaa attttagtta
ataatggagg cgctcttaat 240aattttgggg atattggctt ttttttttaa
agtttacaaa tgaatttttt ccgccaggat 300aacgattctg aagttactct
tagcgttcct atcggtacag ccatcaaatc atgcctataa 360atcatgccta
tatttgcgtg cagtcagtat catctacatg aaaaaaactc ccgcaatttc
420ttatagaata cgttgaaaat taaatgtacg cgccaagata agataacata
tatctagatg 480cagtaatata cacagattcc 50014538PRTArtificial
sequenceA. succinogenes PEP carboxykinase wherein EGY at position
120-122 is replaced by DAF 14Met Thr Asp Leu Asn Lys Leu Val Lys
Glu Leu Asn Asp Leu Gly Leu1 5 10 15Thr Asp Val Lys Glu Ile Val Tyr
Asn Pro Ser Tyr Glu Gln Leu Phe 20 25 30Glu Glu Glu Thr Lys Pro Gly
Leu Glu Gly Phe Asp Lys Gly Thr Leu 35 40 45Thr Thr Leu Gly Ala Val
Ala Val Asp Thr Gly Ile Phe Thr Gly Arg 50 55 60Ser Pro Lys Asp Lys
Tyr Ile Val Cys Asp Glu Thr Thr Lys Asp Thr65 70 75 80Val Trp Trp
Asn Ser Glu Ala Ala Lys Asn Asp Asn Lys Pro Met Thr 85 90 95Gln Glu
Thr Trp Lys Ser Leu Arg Glu Leu Val Ala Lys Gln Leu Ser 100 105
110Gly Lys Arg Leu Phe Val Val Asp Ala Phe Cys Gly Ala Ser Glu Lys
115 120 125His Arg Ile Gly Val Arg Met Val Thr Glu Val Ala Trp Gln
Ala His 130 135 140Phe Val Lys Asn Met Phe Ile Arg Pro Thr Asp Glu
Glu Leu Lys Asn145 150 155 160Phe Lys Ala Asp Phe Thr Val Leu Asn
Gly Ala Lys Cys Thr Asn Pro 165 170 175Asn Trp Lys Glu Gln Gly Leu
Asn Ser Glu Asn Phe Val Ala Phe Asn 180 185 190Ile Thr Glu Gly Ile
Gln Leu Ile Gly Gly Thr Trp Tyr Gly Gly Glu 195 200 205Met Lys Lys
Gly Met Phe Ser Met Met Asn Tyr Phe Leu Pro Leu Lys 210 215 220Gly
Val Ala Ser Met His Cys Ser Ala Asn Val Gly Lys Asp Gly Asp225 230
235 240Val Ala Ile Phe Phe Gly Leu Ser Gly Thr Gly Lys Thr Thr Leu
Ser 245 250 255Thr Asp Pro Lys Arg Gln Leu Ile Gly Asp Asp Glu His
Gly Trp Asp 260 265 270Glu Ser Gly Val Phe Asn Phe Glu Gly Gly Cys
Tyr Ala Lys Thr Ile 275 280 285Asn Leu Ser Gln Glu Asn Glu Pro Asp
Ile Tyr Gly Ala Ile Arg Arg 290 295 300Asp Ala Leu Leu Glu Asn Val
Val Val Arg Ala Asp Gly Ser Val Asp305 310 315 320Phe Asp Asp Gly
Ser Lys Thr Glu Asn Thr Arg Val Ser Tyr Pro Ile 325 330 335Tyr His
Ile Asp Asn Ile Val Arg Pro Val Ser Lys Ala Gly His Ala 340 345
350Thr Lys Val Ile Phe Leu Thr Ala Asp Ala Phe Gly Val Leu Pro Pro
355 360 365Val Ser Lys Leu Thr Pro Glu Gln Thr Glu Tyr Tyr Phe Leu
Ser Gly 370 375 380Phe Thr Ala Lys Leu Ala Gly Thr Glu Arg Gly Val
Thr Glu Pro Thr385 390 395 400Pro Thr Phe Ser Ala Cys Phe Gly Ala
Ala Phe Leu Ser Leu His Pro 405 410 415Ile Gln Tyr Ala Asp Val Leu
Val Glu Arg Met Lys Ala Ser Gly Ala 420 425 430Glu Ala Tyr Leu Val
Asn Thr Gly Trp Asn Gly Thr Gly Lys Arg Ile 435 440 445Ser Ile Lys
Asp Thr Arg Gly Ile Ile Asp Ala Ile Leu Asp Gly Ser 450 455 460Ile
Glu Lys Ala Glu Met Gly Glu Leu Pro Ile Phe Asn Leu Ala Ile465 470
475 480Pro Lys Ala Leu Pro Gly Val Asp Pro Ala Ile Leu Asp Pro Arg
Asp 485 490 495Thr Tyr Ala Asp Lys Ala Gln Trp Gln Val Lys Ala Glu
Asp Leu Ala 500 505 510Asn Arg Phe Val Lys Asn Phe Val Lys Tyr Thr
Ala Asn Pro Glu Ala 515 520 525Ala Lys Leu Val Gly Ala Gly Pro Lys
Ala 530 535151617DNAArtificial sequencent. A. succinogenes PEP
carboxykinase encoding DAF instead of EGY 15atgactgact taaacaaact
cgttaaagaa cttaatgact tagggcttac cgatgttaag 60gaaattgtgt ataacccgag
ttatgaacaa cttttcgagg aagaaaccaa accgggtttg 120gagggtttcg
ataaagggac gttaaccacg cttggcgcgg ttgccgtcga tacggggatt
180tttaccggtc gttcaccgaa agataaatat atcgtttgcg atgaaactac
gaaagacacc 240gtttggtgga acagcgaagc ggcgaaaaac gataacaaac
cgatgacgca agaaacttgg 300aaaagtttga gagaattagt ggcgaaacaa
ctttccggta aacgtttatt cgtggtagac 360gcattctgcg gcgccagtga
aaaacaccgt atcggtgtgc gtatggttac tgaagtggca 420tggcaggcgc
attttgtgaa aaacatgttt atccgaccga ccgatgaaga gttgaaaaat
480ttcaaagcgg attttaccgt gttaaacggt gctaaatgta ctaatccgaa
ctggaaagaa 540caaggtttga acagtgaaaa ctttgtcgct ttcaatatta
ccgaaggtat tcagcttatc 600ggcggtactt ggtacggcgg tgaaatgaaa
aaaggtatgt tctcaatgat gaactacttc 660ctgccgttaa aaggtgtggc
ttccatgcac tgttccgcca acgtaggtaa agacggtgac 720gtggctattt
tcttcggttt atccggtacg ggtaaaacaa cgctttcgac cgatcctaaa
780cgccaattaa tcggtgatga cgaacacggt tgggatgaat ccggcgtatt
taactttgaa 840ggcggttgtt acgcgaaaac cattaactta tctcaagaaa
acgaaccgga tatttacggc 900gcaatccgtc gtgacgcatt attagaaaac
gtcgtggttc gtgcagacgg ttccgttgac 960tttgacgacg gttcaaaaac
agaaaatacc cgtgtttcat atccgattta ccacatcgac 1020aacatcgttc
gtccggtatc gaaagccggt catgcaacca aagtgatttt cttaaccgcg
1080gacgcattcg gcgtattgcc gccggtttca aaactgactc cggaacaaac
cgaatactac 1140ttcttatccg gctttactgc aaaattagcg ggtacggaac
gcggcgtaac cgaaccgact 1200ccgacattct cggcctgttt cggtgcggca
ttcttaagcc tgcatccgat tcaatatgcg 1260gacgtgttgg tcgaacgcat
gaaagcctcc ggtgcggaag cttatttggt gaacaccggt 1320tggaacggca
cgggtaaacg tatttcaatc aaagataccc gcggtattat cgatgcgatt
1380ttggacggtt caatcgaaaa agcggaaatg ggcgaattgc caatctttaa
tttagcgatt 1440cctaaagcat taccgggtgt tgatcctgct attttggatc
cgcgcgatac ttacgcagac 1500aaagcgcaat ggcaagttaa agcggaagat
ttggcaaacc gtttcgtgaa aaactttgtg 1560aaatatacgg cgaatccgga
agcggctaaa ttagttggcg ccggtccaaa agcataa 1617161617DNAArtificial
sequenceCodon pair optimised A. succinogenes PEPCK for S.
cerevisiae 16atgactgatt tgaacaaatt ggtcaaggaa ttgaatgatt tgggtttgac
tgacgtcaag 60gaaattgtct acaacccatc ttacgaacaa ttattcgaag aagaaaccaa
gccaggtttg 120gaaggtttcg acaagggtac tttgaccact ttaggtgctg
ttgctgttga caccggtatt 180ttcaccggtc gttctccaaa ggacaaatac
attgtttgtg atgaaaccac caaggacacc 240gtctggtgga actctgaagc
tgccaagaac gataacaagc caatgactca agaaacctgg 300aaatctttga
gagaattggt tgccaagcaa ttgtctggta agagattatt cgttgttgac
360gctttctgtg gtgcttctga aaagcacaga attggtgtca gaatggtcac
tgaagttgct 420tggcaagctc atttcgtcaa gaacatgttc atcagaccaa
ctgacgaaga attgaagaac 480ttcaaggctg acttcaccgt tttgaatggt
gccaagtgta ccaacccaaa ctggaaggaa 540caaggtttga actctgaaaa
ctttgttgct ttcaacatca ctgaaggtat ccaattgatt 600ggtggtacct
ggtacggtgg tgaaatgaag aagggtatgt tctccatgat gaactatttc
660ttgccattga aaggtgttgc ttccatgcac tgttctgcca atgtcggtaa
ggatggtgac 720gttgccatct tcttcggtct atccggtact ggtaagacca
ctctatccac tgacccaaag 780agacaattga ttggtgatga cgaacacggt
tgggacgaat ctggtgtctt taactttgaa 840ggtggttgtt acgccaagac
catcaactta tctcaagaaa acgaaccaga tatctacggt 900gccatccgtc
gtgatgcttt gttggaaaac gttgttgtca gagctgacgg ttctgttgac
960ttcgacgacg gttccaagac tgaaaacacc agagtttctt acccaatcta
ccacattgac 1020aacattgtca gacctgtttc caaggctggt cacgctacca
aggttatctt cttgactgct 1080gatgctttcg gtgtcttgcc acctgtttcc
aaattgactc cagaacaaac cgaatactac 1140ttcttgtccg gtttcactgc
caaattggct ggtactgaaa gaggtgtcac tgaaccaact 1200ccaactttct
ctgcttgttt cggtgctgct ttcttatctt tgcacccaat ccaatacgct
1260gatgtcttgg ttgaaagaat gaaggcttct ggtgctgaag cttacttggt
caacaccggt 1320tggaacggta ccggtaagag aatctccatc aaggatacca
gaggtatcat tgatgctatc 1380ttggacggtt ccattgaaaa ggctgaaatg
ggtgaattgc caatcttcaa cttggccatt 1440ccaaaggctt tgccaggtgt
tgacccagcc atcttagatc caagagacac ctacgctgac 1500aaggctcaat
ggcaagtcaa ggctgaagat ttggctaaca gattcgtcaa gaactttgtc
1560aaatacactg ctaacccaga agctgccaaa ttggttggtg ctggtccaaa ggcttaa
161717538PRTMannheimia succinicipoducens 17Met Thr Asp Leu Asn Gln
Leu Thr Gln Glu Leu Gly Ala Leu Gly Ile1 5 10 15His Asp Val Gln Glu
Val Val Tyr Asn Pro Ser Tyr Glu Leu Leu Phe 20 25 30Ala Glu Glu Thr
Lys Pro Gly Leu Glu Gly Tyr Glu Lys Gly Thr Val 35 40 45Thr Asn Gln
Gly Ala Val Ala Val Asn Thr Gly Ile Phe Thr Gly Arg 50 55 60Ser Pro
Lys Asp Lys Tyr Ile Val Leu Asp Asp Lys Thr Lys Asp Thr65 70 75
80Val Trp Trp Thr Ser Glu Lys Val Lys Asn Asp Asn Lys Pro Met Ser
85 90 95Gln Asp Thr Trp Asn Ser Leu Lys Gly Leu Val Ala Asp Gln Leu
Ser 100 105 110Gly Lys Arg Leu Phe Val Val Asp Ala Phe Cys Gly Ala
Asn Lys Asp 115
120 125Thr Arg Leu Ala Val Arg Val Val Thr Glu Val Ala Trp Gln Ala
His 130 135 140Phe Val Thr Asn Met Phe Ile Arg Pro Ser Ala Glu Glu
Leu Lys Gly145 150 155 160Phe Lys Pro Asp Phe Val Val Met Asn Gly
Ala Lys Cys Thr Asn Pro 165 170 175Asn Trp Lys Glu Gln Gly Leu Asn
Ser Glu Asn Phe Val Ala Phe Asn 180 185 190Ile Thr Glu Gly Val Gln
Leu Ile Gly Gly Thr Trp Tyr Gly Gly Glu 195 200 205Met Lys Lys Gly
Met Phe Ser Met Met Asn Tyr Phe Leu Pro Leu Arg 210 215 220Gly Ile
Ala Ser Met His Cys Ser Ala Asn Val Gly Lys Asp Gly Asp225 230 235
240Thr Ala Ile Phe Phe Gly Leu Ser Gly Thr Gly Lys Thr Thr Leu Ser
245 250 255Thr Asp Pro Lys Arg Gln Leu Ile Gly Asp Asp Glu His Gly
Trp Asp 260 265 270Asp Glu Gly Val Phe Asn Phe Glu Gly Gly Cys Tyr
Ala Lys Thr Ile 275 280 285Asn Leu Ser Ala Glu Asn Glu Pro Asp Ile
Tyr Gly Ala Ile Lys Arg 290 295 300Asp Ala Leu Leu Glu Asn Val Val
Val Leu Asp Asn Gly Asp Val Asp305 310 315 320Tyr Ala Asp Gly Ser
Lys Thr Glu Asn Thr Arg Val Ser Tyr Pro Ile 325 330 335Tyr His Ile
Gln Asn Ile Val Lys Pro Val Ser Lys Ala Gly Pro Ala 340 345 350Thr
Lys Val Ile Phe Leu Ser Ala Asp Ala Phe Gly Val Leu Pro Pro 355 360
365Val Ser Lys Leu Thr Pro Glu Gln Thr Lys Tyr Tyr Phe Leu Ser Gly
370 375 380Phe Thr Ala Lys Leu Ala Gly Thr Glu Arg Gly Ile Thr Glu
Pro Thr385 390 395 400Pro Thr Phe Ser Ala Cys Phe Gly Ala Ala Phe
Leu Ser Leu His Pro 405 410 415Thr Gln Tyr Ala Glu Val Leu Val Lys
Arg Met Gln Glu Ser Gly Ala 420 425 430Glu Ala Tyr Leu Val Asn Thr
Gly Trp Asn Gly Thr Gly Lys Arg Ile 435 440 445Ser Ile Lys Asp Thr
Arg Gly Ile Ile Asp Ala Ile Leu Asp Gly Ser 450 455 460Ile Asp Lys
Ala Glu Met Gly Ser Leu Pro Ile Phe Asp Phe Ser Ile465 470 475
480Pro Lys Ala Leu Pro Gly Val Asn Pro Ala Ile Leu Asp Pro Arg Asp
485 490 495Thr Tyr Ala Asp Lys Ala Gln Trp Glu Glu Lys Ala Gln Asp
Leu Ala 500 505 510Gly Arg Phe Val Lys Asn Phe Glu Lys Tyr Thr Gly
Thr Ala Glu Gly 515 520 525Gln Ala Leu Val Ala Ala Gly Pro Lys Ala
530 535181617DNAArtificial sequencePEPcarboxykinase M.
succiniciproducens cpo for S. cerevisiae 18atgaccgatt tgaaccaatt
gactcaagaa ttgggtgctt tgggtattca cgatgtccaa 60gaagttgtct acaacccatc
ttacgaattg ttgtttgctg aagaaaccaa gccaggtttg 120gaaggttacg
aaaagggtac tgttaccaac caaggtgctg ttgctgtcaa caccggtatc
180ttcaccggtc gttctccaaa ggacaaatac attgtcttgg atgacaagac
caaggacact 240gtctggtgga cttctgaaaa ggtcaagaac gacaacaaac
caatgtccca agacacttgg 300aactctttaa agggtttagt cgctgaccaa
ttgtctggta agagattatt cgttgtcgat 360gctttctgtg gtgccaacaa
ggacaccaga ttagctgtca gagttgtcac tgaagttgct 420tggcaagctc
acttcgttac caacatgttc atcagaccat ctgctgaaga attgaaaggt
480ttcaagccag atttcgttgt catgaacggt gccaaatgta ccaacccaaa
ctggaaggaa 540caaggtttga actctgaaaa ctttgttgct ttcaacatca
ctgaaggtgt tcaattgatt 600ggtggtacct ggtacggtgg tgaaatgaag
aagggtatgt tctccatgat gaactacttc 660ttgccattga gaggtattgc
ttccatgcac tgttctgcca atgtcggtaa ggacggtgac 720actgccatct
tcttcggtct atccggtacc ggtaagacca ctttgtccac tgacccaaag
780agacaattga ttggtgatga cgaacacggt tgggatgacg aaggtgtttt
caactttgaa 840ggtggttgtt acgccaagac catcaactta tctgctgaaa
atgaaccaga tatctacggt 900gccatcaagc gtgacgctct attggaaaac
gttgttgttt tggacaatgg tgacgtcgat 960tatgctgacg gttccaagac
tgaaaacacc agagtttctt acccaatcta ccatattcaa 1020aacattgtca
agccagtttc caaggctggt ccagctacca aagttatctt cttgtctgct
1080gatgctttcg gtgttttgcc tcctgtttcc aagttgactc cagaacaaac
caagtactac 1140ttcttgtctg gtttcaccgc caagttggct ggtactgaaa
gaggtatcac tgaaccaact 1200ccaactttct ctgcttgttt cggtgctgcc
tttttgtctt tgcacccaac tcaatacgct 1260gaagttttgg tcaagagaat
gcaagaatct ggtgctgaag cttacttggt caacactggt 1320tggaacggta
ccggtaagag aatctccatc aaagatacca gaggtatcat cgatgccatc
1380ttggatggtt ccattgacaa ggctgaaatg ggttctttgc caattttcga
tttctccatt 1440ccaaaggctt tgccaggtgt caacccagcc atcttagacc
caagagacac ctacgctgac 1500aaagctcaat gggaagaaaa ggctcaagac
ttggctggta gattcgtcaa gaacttcgaa 1560aaatacactg gtactgctga
aggtcaagct ttggttgctg ctggtccaaa ggcctaa 161719365PRTArtificial
sequenceMDH2 S. cerevisiae lacking first 12 a.a. 19Met Leu Lys Ile
Ala Ile Leu Gly Ala Ala Gly Gly Ile Gly Gln Ser1 5 10 15Leu Ser Leu
Leu Leu Lys Ala Gln Leu Gln Tyr Gln Leu Lys Glu Ser 20 25 30Asn Arg
Ser Val Thr His Ile His Leu Ala Leu Tyr Asp Val Asn Gln 35 40 45Glu
Ala Ile Asn Gly Val Thr Ala Asp Leu Ser His Ile Asp Thr Pro 50 55
60Ile Ser Val Ser Ser His Ser Pro Ala Gly Gly Ile Glu Asn Cys Leu65
70 75 80His Asn Ala Ser Ile Val Val Ile Pro Ala Gly Val Pro Arg Lys
Pro 85 90 95Gly Met Thr Arg Asp Asp Leu Phe Asn Val Asn Ala Gly Ile
Ile Ser 100 105 110Gln Leu Gly Asp Ser Ile Ala Glu Cys Cys Asp Leu
Ser Lys Val Phe 115 120 125Val Leu Val Ile Ser Asn Pro Val Asn Ser
Leu Val Pro Val Met Val 130 135 140Ser Asn Ile Leu Lys Asn His Pro
Gln Ser Arg Asn Ser Gly Ile Glu145 150 155 160Arg Arg Ile Met Gly
Val Thr Lys Leu Asp Ile Val Arg Ala Ser Thr 165 170 175Phe Leu Arg
Glu Ile Asn Ile Glu Ser Gly Leu Thr Pro Arg Val Asn 180 185 190Ser
Met Pro Asp Val Pro Val Ile Gly Gly His Ser Gly Glu Thr Ile 195 200
205Ile Pro Leu Phe Ser Gln Ser Asn Phe Leu Ser Arg Leu Asn Glu Asp
210 215 220Gln Leu Lys Tyr Leu Ile His Arg Val Gln Tyr Gly Gly Asp
Glu Val225 230 235 240Val Lys Ala Lys Asn Gly Lys Gly Ser Ala Thr
Leu Ser Met Ala His 245 250 255Ala Gly Tyr Lys Cys Val Val Gln Phe
Val Ser Leu Leu Leu Gly Asn 260 265 270Ile Glu Gln Ile His Gly Thr
Tyr Tyr Val Pro Leu Lys Asp Ala Asn 275 280 285Asn Phe Pro Ile Ala
Pro Gly Ala Asp Gln Leu Leu Pro Leu Val Asp 290 295 300Gly Ala Asp
Tyr Phe Ala Ile Pro Leu Thr Ile Thr Thr Lys Gly Val305 310 315
320Ser Tyr Val Asp Tyr Asp Ile Val Asn Arg Met Asn Asp Met Glu Arg
325 330 335Asn Gln Met Leu Pro Ile Cys Val Ser Gln Leu Lys Lys Asn
Ile Asp 340 345 350Lys Gly Leu Glu Phe Val Ala Ser Arg Ser Ala Ser
Ser 355 360 365201099DNAArtificial sequencecpo MDH2 S. cerevisiae
lacking fisrt 12 a.a. 20atgttgaaga ttgccatctt gggtgctgct ggtggtatcg
gtcaatcttt gtctttgttg 60ttgaaggctc aattgcaata ccaattgaag gaatccaaca
gatctgttac ccacattcat 120ttggctttgt acgatgtcaa ccaagaagct
atcaacggtg tcactgctga cttgtctcac 180atcgataccc caatctctgt
ttcctctcac tctccagctg gtggtattga aaactgtttg 240cacaacgctt
ccattgttgt cattccagcc ggtgttccaa gaaagccagg tatgacccgt
300gacgatttgt tcaacgtcaa tgccggtatc atctctcaat taggtgattc
cattgctgaa 360tgttgtgact tgtccaaggt tttcgtcttg gttatctcca
acccagtcaa ctctttggtt 420cctgttatgg tttccaacat cttgaagaac
cacccacaat ccagaaactc tggtattgaa 480agaagaatca tgggtgtcac
caaattggac attgtcagag cttccacttt cttgagagaa 540atcaacattg
aatctggttt gactccaaga gtcaactcca tgccagatgt tccagttatc
600ggtggtcact ctggtgaaac tatcatccca ttattctctc aatctaactt
cttgtccaga 660ttgaatgaag atcaattgaa atacttgatt caccgtgtcc
aatacggtgg tgacgaagtt 720gtcaaggcca agaacggtaa gggttctgct
actctatcca tggctcatgc cggttacaag 780tgtgttgtcc aattcgtttc
tctattatta ggtaacattg aacaaatcca cggtacctac 840tacgttccat
tgaaagatgc taacaacttc ccaattgctc caggtgctga ccaattattg
900ccattagtcg acggtgctga ctactttgcc atcccattga ccatcactac
caagggtgtt 960tcttacgttg actacgatat cgtcaacaga atgaacgaca
tggaaagaaa ccaaatgttg 1020cctatctgtg tttctcaatt gaagaagaac
attgacaagg gtttggaatt cgttgcttcc 1080agatctgctt ccagttaag
109921340PRTArtificial sequenceMDH3 S. cerevisiae lacking
C-terminal SKL 21Met Val Lys Val Ala Ile Leu Gly Ala Ser Gly Gly
Val Gly Gln Pro1 5 10 15Leu Ser Leu Leu Leu Lys Leu Ser Pro Tyr Val
Ser Glu Leu Ala Leu 20 25 30Tyr Asp Ile Arg Ala Ala Glu Gly Ile Gly
Lys Asp Leu Ser His Ile 35 40 45Asn Thr Asn Ser Ser Cys Val Gly Tyr
Asp Lys Asp Ser Ile Glu Asn 50 55 60Thr Leu Ser Asn Ala Gln Val Val
Leu Ile Pro Ala Gly Val Pro Arg65 70 75 80Lys Pro Gly Leu Thr Arg
Asp Asp Leu Phe Lys Met Asn Ala Gly Ile 85 90 95Val Lys Ser Leu Val
Thr Ala Val Gly Lys Phe Ala Pro Asn Ala Arg 100 105 110Ile Leu Val
Ile Ser Asn Pro Val Asn Ser Leu Val Pro Ile Ala Val 115 120 125Glu
Thr Leu Lys Lys Met Gly Lys Phe Lys Pro Gly Asn Val Met Gly 130 135
140Val Thr Asn Leu Asp Leu Val Arg Ala Glu Thr Phe Leu Val Asp
Tyr145 150 155 160Leu Met Leu Lys Asn Pro Lys Ile Gly Gln Glu Gln
Asp Lys Thr Thr 165 170 175Met His Arg Lys Val Thr Val Ile Gly Gly
His Ser Gly Glu Thr Ile 180 185 190Ile Pro Ile Ile Thr Asp Lys Ser
Leu Val Phe Gln Leu Asp Lys Gln 195 200 205Tyr Glu His Phe Ile His
Arg Val Gln Phe Gly Gly Asp Glu Ile Val 210 215 220Lys Ala Lys Gln
Gly Ala Gly Ser Ala Thr Leu Ser Met Ala Phe Ala225 230 235 240Gly
Ala Lys Phe Ala Glu Glu Val Leu Arg Ser Phe His Asn Glu Lys 245 250
255Pro Glu Thr Glu Ser Leu Ser Ala Phe Val Tyr Leu Pro Gly Leu Lys
260 265 270Asn Gly Lys Lys Ala Gln Gln Leu Val Gly Asp Asn Ser Ile
Glu Tyr 275 280 285Phe Ser Leu Pro Ile Val Leu Arg Asn Gly Ser Val
Val Ser Ile Asp 290 295 300Thr Ser Val Leu Glu Lys Leu Ser Pro Arg
Glu Glu Gln Leu Val Asn305 310 315 320Thr Ala Val Lys Glu Leu Arg
Lys Asn Ile Glu Lys Gly Lys Ser Phe 325 330 335Ile Leu Asp Ser
340221024DNAArtificial sequenceMDH3 S. cerevisiae lacking SKL
encoding nt, cpo 22atggttaagg ttgccatctt aggtgcttct ggtggtgtcg
gtcaaccatt atctctatta 60ttgaaattgt ctccatacgt ttctgaattg gctttgtacg
atatcagagc tgctgaaggt 120attggtaagg atttgtccca catcaacacc
aactcctctt gtgttggtta cgacaaggat 180tccatcgaaa acactttgtc
caatgctcaa gttgtcttga ttccagctgg tgttccaaga 240aagccaggtt
tgaccagaga tgatttgttc aagatgaacg ctggtatcgt taagtctttg
300gttactgctg tcggtaaatt tgccccaaac gctcgtatct tagtcatctc
caaccctgtt 360aactctttgg ttccaattgc cgttgaaact ttgaagaaga
tgggtaagtt caagccaggt 420aacgttatgg gtgtcaccaa cttggatttg
gtcagagctg aaactttctt ggttgactac 480ttgatgttga agaacccaaa
gatcggtcaa gaacaagaca agaccaccat gcacagaaag 540gtcaccgtca
tcggtggtca ctctggtgaa accatcattc caatcatcac tgacaaatcc
600ttggttttcc aattggacaa gcaatacgaa catttcatcc acagagtcca
attcggtggt 660gacgaaattg tcaaggccaa gcaaggtgcc ggttctgcta
ccttgtccat ggctttcgct 720ggtgccaaat ttgctgaaga agtcttacgt
tctttccaca acgaaaagcc agaaactgaa 780tctttgtctg ctttcgtcta
cttgccaggt ttgaagaacg gtaagaaggc tcaacaatta 840gtcggtgaca
actccattga atacttctct ttgccaattg ttttgagaaa cggttccgtt
900gtttccattg acacttctgt tttggaaaaa ttgtctccaa gagaagaaca
attggtcaac 960actgctgtca aggaattgag aaagaacatt gaaaagggta
agtctttcat cttggacagt 1020taag 102423472PRTArtificial
sequenceFumarase R. oryzae lacking first 23 aa+ new M 23Met Ser Ser
Ala Ser Ala Ala Leu Gln Lys Phe Arg Ala Glu Arg Asp1 5 10 15Thr Phe
Gly Asp Leu Gln Val Pro Ala Asp Arg Tyr Trp Gly Ala Gln 20 25 30Thr
Gln Arg Ser Leu Gln Asn Phe Asp Ile Gly Gly Pro Thr Glu Arg 35 40
45Met Pro Glu Pro Leu Ile Arg Ala Phe Gly Val Leu Lys Lys Ala Ala
50 55 60Ala Thr Val Asn Met Thr Tyr Gly Leu Asp Pro Lys Val Gly Glu
Ala65 70 75 80Ile Gln Lys Ala Ala Asp Glu Val Ile Asp Gly Ser Leu
Ile Asp His 85 90 95Phe Pro Leu Val Val Trp Gln Thr Gly Ser Gly Thr
Gln Thr Lys Met 100 105 110Asn Val Asn Glu Val Ile Ser Asn Arg Ala
Ile Glu Leu Leu Gly Gly 115 120 125Glu Leu Gly Ser Lys Ala Pro Val
His Pro Asn Asp His Val Asn Met 130 135 140Ser Gln Ser Ser Asn Asp
Thr Phe Pro Thr Ala Met His Val Ala Ala145 150 155 160Val Val Glu
Ile His Gly Arg Leu Ile Pro Ala Leu Thr Thr Leu Arg 165 170 175Asp
Ala Leu Gln Ala Lys Ser Ala Glu Phe Glu His Ile Ile Lys Ile 180 185
190Gly Arg Thr His Leu Gln Asp Ala Thr Pro Leu Thr Leu Gly Gln Glu
195 200 205Phe Ser Gly Tyr Thr Gln Gln Leu Thr Tyr Gly Ile Ala Arg
Val Gln 210 215 220Gly Thr Leu Glu Arg Leu Tyr Asn Leu Ala Gln Gly
Gly Thr Ala Val225 230 235 240Gly Thr Gly Leu Asn Thr Arg Lys Gly
Phe Asp Ala Lys Val Ala Glu 245 250 255Ala Ile Ala Ser Ile Thr Gly
Leu Pro Phe Lys Thr Ala Pro Asn Lys 260 265 270Phe Glu Ala Leu Ala
Ala His Asp Ala Leu Val Glu Ala His Gly Ala 275 280 285Leu Asn Thr
Val Ala Cys Ser Leu Met Lys Ile Ala Asn Asp Ile Arg 290 295 300Tyr
Leu Gly Ser Gly Pro Arg Cys Gly Leu Gly Glu Leu Ser Leu Pro305 310
315 320Glu Asn Glu Pro Gly Ser Ser Ile Met Pro Gly Lys Val Asn Pro
Thr 325 330 335Gln Cys Glu Ala Met Thr Met Val Cys Ala Gln Val Met
Gly Asn Asn 340 345 350Thr Ala Ile Ser Val Ala Gly Ser Asn Gly Gln
Phe Glu Leu Asn Val 355 360 365Phe Lys Pro Val Met Ile Lys Asn Leu
Ile Gln Ser Ile Arg Leu Ile 370 375 380Ser Asp Ala Ser Ile Ser Phe
Thr Lys Asn Cys Val Val Gly Ile Glu385 390 395 400Ala Asn Glu Lys
Lys Ile Ser Ser Ile Met Asn Glu Ser Leu Met Leu 405 410 415Val Thr
Ala Leu Asn Pro His Ile Gly Tyr Asp Lys Ala Ala Lys Cys 420 425
430Ala Lys Lys Ala His Lys Glu Gly Thr Thr Leu Lys Glu Ala Ala Leu
435 440 445Ser Leu Gly Tyr Leu Thr Ser Glu Glu Phe Asp Gln Trp Val
Arg Pro 450 455 460Glu Asp Met Ile Ser Ala Lys Asp465
470241419DNAArtificial sequenceFumarase R. oryzae lacking nt
encoding first aa + M 24atgtcctctg cttctgctgc tttgcaaaaa ttcagagctg
aaagagatac cttcggtgac 60ttgcaagttc cagctgaccg ttactggggt gctcaaactc
aaagatcttt gcaaaacttt 120gacattggtg gtccaactga aagaatgcca
gaaccattaa tcagagcttt cggtgttttg 180aagaaggctg ctgccaccgt
caacatgacc tacggtttgg acccaaaggt tggtgaagcc 240atccaaaagg
ctgctgacga agttatcgat ggttctttga ttgaccattt cccattggtt
300gtctggcaaa ccggttctgg tactcaaacc aagatgaacg tcaatgaagt
catctccaac 360agagccattg aattgttggg tggtgaatta ggttccaagg
ctccagtcca cccaaacgat 420catgtcaaca tgtctcaatc ttccaacgac
actttcccaa ctgccatgca cgttgctgcc 480gttgttgaaa ttcacggtag
attgattcca gctttgacca ctttgagaga tgctttgcaa 540gccaaatctg
ctgaattcga acacatcatc aagattggta gaacccactt gcaagatgct
600accccattga ctttaggtca agaattctcc ggttacactc aacaattgac
ctacggtatt 660gctcgtgttc aaggtacttt ggaaagatta tacaacttgg
ctcaaggtgg tactgctgtc 720ggtactggtt tgaacaccag aaagggtttc
gatgccaagg ttgctgaagc cattgcttcc 780atcactggtt taccattcaa
gaccgctcca aacaaattcg aagctttggc tgctcacgac 840gctttggttg
aagctcacgg tgctttgaac accgttgctt gttctttgat gaagattgcc
900aacgatatcc gttacttggg ttctggtcca agatgtggtt taggtgaatt
gtctctacca 960gaaaacgaac caggttcttc catcatgcca ggtaaggtca
acccaactca atgtgaagct 1020atgaccatgg tttgtgctca agtcatgggt
aacaacactg
ccatctctgt tgctggttcc 1080aacggtcaat tcgaattgaa tgtctttaaa
ccagtcatga tcaagaactt gatccaatcc 1140atcagattaa tctctgacgc
ttccatctct ttcaccaaga actgtgttgt cggtattgaa 1200gctaacgaaa
agaagatctc ctccatcatg aacgaatctt tgatgttggt cactgctttg
1260aaccctcaca ttggttacga caaggctgcc aagtgtgcca agaaggctca
caaggaaggt 1320accactttga aagaagctgc tctatctttg ggttacttga
cctctgaaga attcgaccaa 1380tgggttagac ctgaggacat gatttctgcc
aaggattaa 1419251000DNAArtificial sequenceTDH1 promotor
25cttccctttt acagtgcttc ggaaaagcac agcgttgtcc aagggaacaa tttttcttca
60agttaatgca taagaaatat ctttttttat gtttagctaa gtaaaagcag cttggagtaa
120aaaaaaaaat gagtaaattt ctcgatggat tagtttctca caggtaacat
aacaaaaacc 180aagaaaagcc cgcttctgaa aactacagtt gacttgtatg
ctaaagggcc agactaatgg 240gaggagaaaa agaaacgaat gtatatgctc
atttacactc tatatcacca tatggaggat 300aagttgggct gagcttctga
tccaatttat tctatccatt agttgctgat atgtcccacc 360agccaacact
tgatagtatc tactcgccat tcacttccag cagcgccagt agggttgttg
420agcttagtaa aaatgtgcgc accacaagcc tacatgactc cacgtcacat
gaaaccacac 480cgtggggcct tgttgcgcta ggaataggat atgcgacgaa
gacgcttctg cttagtaacc 540acaccacatt ttcagggggt cgatctgctt
gcttccttta ctgtcacgag cggcccataa 600tcgcgctttt tttttaaaag
gcgcgagaca gcaaacagga agctcgggtt tcaaccttcg 660gagtggtcgc
agatctggag actggatctt tacaatacag taaggcaagc caccatctgc
720ttcttaggtg catgcgacgg tatccacgtg cagaacaaca tagtctgaag
aaggggggga 780ggagcatgtt cattctctgt agcagtaaga gcttggtgat
aatgaccaaa actggagtct 840cgaaatcata taaatagaca atatattttc
acacaatgag atttgtagta cagttctatt 900ctctctcttg cataaataag
aaattcatca agaacttggt ttgatatttc accaacacac 960acaaaaaaca
gtacttcact aaatttacac acaaaacaaa 100026500DNAArtificial
sequenceTDH1 terminator 26ataaagcaat cttgatgagg ataatgattt
ttttttgaat atacataaat actaccgttt 60ttctgctaga ttttgtgaag acgtaaataa
gtacatatta ctttttaagc caagacaaga 120ttaagcatta actttaccct
tttctcttct aagtttcaat actagttatc actgtttaaa 180agttatggcg
agaacgtcgg cggttaaaat atattaccct gaacgtggtg aattgaagtt
240ctaggatggt ttaaagattt ttcctttttg ggaaataagt aaacaatata
ttgctgcctt 300tgcaaaacgc acatacccac aatatgtgac tattggcaaa
gaacgcatta tcctttgaag 360aggtggatac tgatactaag agagtctcta
ttccggctcc acttttagtc cagagattac 420ttgtcttctt acgtatcaga
acaagaaagc atttccaaag taattgcatt tgcccttgag 480cagtatatat
atactaagaa 50027600DNAArtificial sequencesecond TDH3 promotor
27ttagtcaaaa aattagcctt ttaattctgc tgtaacccgt acatgcccaa aatagggggc
60gggttacaca gaatatataa catcgtaggt gtctgggtga acagtttatt cctggcatcc
120actaaatata atggagcccg ctttttaagc tggcatccag aaaaaaaaag
aatcccagca 180ccaaaatatt gttttcttca ccaaccatca gttcataggt
ccattctctt agcgcaacta 240cagagaacag gggcacaaac aggcaaaaaa
cgggcacaac ctcaatggag tgatgcaacc 300tgcctggagt aaatgatgac
acaaggcaat tgacccacgc atgtatctat ctcattttct 360tacaccttct
attaccttct gctctctctg atttggaaaa agctgaaaaa aaaggttgaa
420accagttccc tgaaattatt cccctacttg actaataagt atataaagac
ggtaggtatt 480gattgtaatt ctgtaaatct atttcttaaa cttcttaaat
tctactttta tagttagtct 540tttttttagt tttaaaacac caagaactta
gtttcgaata aacacacata aacaaacaaa 60028300DNAArtificial
sequencesecond TDH3 terminator 28gtgaatttac tttaaatctt gcatttaaat
aaattttctt tttatagctt tatgacttag 60tttcaattta tatactattt taatgacatt
ttcgattcat tgattgaaag ctttgtgttt 120tttcttgatg cgctattgca
ttgttcttgt ctttttcgcc acatgtaata tctgtagtag 180atacctgata
cattgtggat gctgagtgaa attttagtta ataatggagg cgctcttaat
240aattttgggg atattggctt ttttttttaa agtttacaaa tgaatttttt
ccgccaggat 300293148DNAArtificial sequenceTDH1p-PCKm-TDH1t
synthetic construct 29ggatcccttc ccttttacag tgcttcggaa aagcacagcg
ttgtccaagg gaacaatttt 60tcttcaagtt aatgcataag aaatatcttt ttttatgttt
agctaagtaa aagcagcttg 120gagtaaaaaa aaaaatgagt aaatttctcg
atggattagt ttctcacagg taacataaca 180aaaaccaaga aaagcccgct
tctgaaaact acagttgact tgtatgctaa agggccagac 240taatgggagg
agaaaaagaa acgaatgtat atgctcattt acactctata tcaccatatg
300gaggataagt tgggctgagc ttctgatcca atttattcta tccattagtt
gctgatatgt 360cccaccagcc aacacttgat agtatctact cgccattcac
ttccagcagc gccagtaggg 420ttgttgagct tagtaaaaat gtgcgcacca
caagcctaca tgactccacg tcacatgaaa 480ccacaccgtg gggccttgtt
gcgctaggaa taggatatgc gacgaagacg cttctgctta 540gtaaccacac
cacattttca gggggtcgat ctgcttgctt cctttactgt cacgagcggc
600ccataatcgc gctttttttt taaaaggcgc gagacagcaa acaggaagct
cgggtttcaa 660ccttcggagt ggtcgcagat ctggagactg gatctttaca
atacagtaag gcaagccacc 720atctgcttct taggtgcatg cgacggtatc
cacgtgcaga acaacatagt ctgaagaagg 780gggggaggag catgttcatt
ctctgtagca gtaagagctt ggtgataatg accaaaactg 840gagtctcgaa
atcatataaa tagacaatat attttcacac aatgagattt gtagtacagt
900tctattctct ctcttgcata aataagaaat tcatcaagaa cttggtttga
tatttcacca 960acacacacaa aaaacagtac ttcactaaat ttacacacaa
aacaaaatga ctgatttgaa 1020caaattggtc aaggaattga atgatttggg
tttgactgac gtcaaggaaa ttgtctacaa 1080cccatcttac gaacaattat
tcgaagaaga aaccaagcca ggtttggaag gtttcgacaa 1140gggtactttg
accactttag gtgctgttgc tgttgacacc ggtattttca ccggtcgttc
1200tccaaaggac aaatacattg tttgtgatga aaccaccaag gacaccgtct
ggtggaactc 1260tgaagctgcc aagaacgata acaagccaat gactcaagaa
acctggaaat ctttgagaga 1320attggttgcc aagcaattgt ctggtaagag
attattcgtt gttgacgctt tctgtggtgc 1380ttctgaaaag cacagaattg
gtgtcagaat ggtcactgaa gttgcttggc aagctcattt 1440cgtcaagaac
atgttcatca gaccaactga cgaagaattg aagaacttca aggctgactt
1500caccgttttg aatggtgcca agtgtaccaa cccaaactgg aaggaacaag
gtttgaactc 1560tgaaaacttt gttgctttca acatcactga aggtatccaa
ttgattggtg gtacctggta 1620cggtggtgaa atgaagaagg gtatgttctc
catgatgaac tatttcttgc cattgaaagg 1680tgttgcttcc atgcactgtt
ctgccaatgt cggtaaggat ggtgacgttg ccatcttctt 1740cggtctatcc
ggtactggta agaccactct atccactgac ccaaagagac aattgattgg
1800tgatgacgaa cacggttggg acgaatctgg tgtctttaac tttgaaggtg
gttgttacgc 1860caagaccatc aacttatctc aagaaaacga accagatatc
tacggtgcca tccgtcgtga 1920tgctttgttg gaaaacgttg ttgtcagagc
tgacggttct gttgacttcg acgacggttc 1980caagactgaa aacaccagag
tttcttaccc aatctaccac attgacaaca ttgtcagacc 2040tgtttccaag
gctggtcacg ctaccaaggt tatcttcttg actgctgatg ctttcggtgt
2100cttgccacct gtttccaaat tgactccaga acaaaccgaa tactacttct
tgtccggttt 2160cactgccaaa ttggctggta ctgaaagagg tgtcactgaa
ccaactccaa ctttctctgc 2220ttgtttcggt gctgctttct tatctttgca
cccaatccaa tacgctgatg tcttggttga 2280aagaatgaag gcttctggtg
ctgaagctta cttggtcaac accggttgga acggtaccgg 2340taagagaatc
tccatcaagg ataccagagg tatcattgat gctatcttgg acggttccat
2400tgaaaaggct gaaatgggtg aattgccaat cttcaacttg gccattccaa
aggctttgcc 2460aggtgttgac ccagccatct tagatccaag agacacctac
gctgacaagg ctcaatggca 2520agtcaaggct gaagatttgg ctaacagatt
cgtcaagaac tttgtcaaat acactgctaa 2580cccagaagct gccaaattgg
ttggtgctgg tccaaaggct taaggcccgg gcataaagca 2640atcttgatga
ggataatgat ttttttttga atatacataa atactaccgt ttttctgcta
2700gattttgtga agacgtaaat aagtacatat tactttttaa gccaagacaa
gattaagcat 2760taactttacc cttttctctt ctaagtttca atactagtta
tcactgttta aaagttatgg 2820cgagaacgtc ggcggttaaa atatattacc
ctgaacgtgg tgaattgaag ttctaggatg 2880gtttaaagat ttttcctttt
tgggaaataa gtaaacaata tattgctgcc tttgcaaaac 2940gcacataccc
acaatatgtg actattggca aagaacgcat tatcctttga agaggtggat
3000actgatacta agagagtctc tattccggct ccacttttag tccagagatt
acttgtcttc 3060ttacgtatca gaacaagaaa gcatttccaa agtaattgca
tttgcccttg agcagtatat 3120atatactaag aaggcgcgcc gcggccgc
3148303148DNAArtificial sequenceTDH1p-PCK1-TDH1t synthetic
construct 30ggatcccttc ccttttacag tgcttcggaa aagcacagcg ttgtccaagg
gaacaatttt 60tcttcaagtt aatgcataag aaatatcttt ttttatgttt agctaagtaa
aagcagcttg 120gagtaaaaaa aaaaatgagt aaatttctcg atggattagt
ttctcacagg taacataaca 180aaaaccaaga aaagcccgct tctgaaaact
acagttgact tgtatgctaa agggccagac 240taatgggagg agaaaaagaa
acgaatgtat atgctcattt acactctata tcaccatatg 300gaggataagt
tgggctgagc ttctgatcca atttattcta tccattagtt gctgatatgt
360cccaccagcc aacacttgat agtatctact cgccattcac ttccagcagc
gccagtaggg 420ttgttgagct tagtaaaaat gtgcgcacca caagcctaca
tgactccacg tcacatgaaa 480ccacaccgtg gggccttgtt gcgctaggaa
taggatatgc gacgaagacg cttctgctta 540gtaaccacac cacattttca
gggggtcgat ctgcttgctt cctttactgt cacgagcggc 600ccataatcgc
gctttttttt taaaaggcgc gagacagcaa acaggaagct cgggtttcaa
660ccttcggagt ggtcgcagat ctggagactg gatctttaca atacagtaag
gcaagccacc 720atctgcttct taggtgcatg cgacggtatc cacgtgcaga
acaacatagt ctgaagaagg 780gggggaggag catgttcatt ctctgtagca
gtaagagctt ggtgataatg accaaaactg 840gagtctcgaa atcatataaa
tagacaatat attttcacac aatgagattt gtagtacagt 900tctattctct
ctcttgcata aataagaaat tcatcaagaa cttggtttga tatttcacca
960acacacacaa aaaacagtac ttcactaaat ttacacacaa aacaaaatga
ccgatttgaa 1020ccaattgact caagaattgg gtgctttggg tattcacgat
gtccaagaag ttgtctacaa 1080cccatcttac gaattgttgt ttgctgaaga
aaccaagcca ggtttggaag gttacgaaaa 1140gggtactgtt accaaccaag
gtgctgttgc tgtcaacacc ggtatcttca ccggtcgttc 1200tccaaaggac
aaatacattg tcttggatga caagaccaag gacactgtct ggtggacttc
1260tgaaaaggtc aagaacgaca acaaaccaat gtcccaagac acttggaact
ctttaaaggg 1320tttagtcgct gaccaattgt ctggtaagag attattcgtt
gtcgatgctt tctgtggtgc 1380caacaaggac accagattag ctgtcagagt
tgtcactgaa gttgcttggc aagctcactt 1440cgttaccaac atgttcatca
gaccatctgc tgaagaattg aaaggtttca agccagattt 1500cgttgtcatg
aacggtgcca aatgtaccaa cccaaactgg aaggaacaag gtttgaactc
1560tgaaaacttt gttgctttca acatcactga aggtgttcaa ttgattggtg
gtacctggta 1620cggtggtgaa atgaagaagg gtatgttctc catgatgaac
tacttcttgc cattgagagg 1680tattgcttcc atgcactgtt ctgccaatgt
cggtaaggac ggtgacactg ccatcttctt 1740cggtctatcc ggtaccggta
agaccacttt gtccactgac ccaaagagac aattgattgg 1800tgatgacgaa
cacggttggg atgacgaagg tgttttcaac tttgaaggtg gttgttacgc
1860caagaccatc aacttatctg ctgaaaatga accagatatc tacggtgcca
tcaagcgtga 1920cgctctattg gaaaacgttg ttgttttgga caatggtgac
gtcgattatg ctgacggttc 1980caagactgaa aacaccagag tttcttaccc
aatctaccat attcaaaaca ttgtcaagcc 2040agtttccaag gctggtccag
ctaccaaagt tatcttcttg tctgctgatg ctttcggtgt 2100tttgcctcct
gtttccaagt tgactccaga acaaaccaag tactacttct tgtctggttt
2160caccgccaag ttggctggta ctgaaagagg tatcactgaa ccaactccaa
ctttctctgc 2220ttgtttcggt gctgcctttt tgtctttgca cccaactcaa
tacgctgaag ttttggtcaa 2280gagaatgcaa gaatctggtg ctgaagctta
cttggtcaac actggttgga acggtaccgg 2340taagagaatc tccatcaaag
ataccagagg tatcatcgat gccatcttgg atggttccat 2400tgacaaggct
gaaatgggtt ctttgccaat tttcgatttc tccattccaa aggctttgcc
2460aggtgtcaac ccagccatct tagacccaag agacacctac gctgacaaag
ctcaatggga 2520agaaaaggct caagacttgg ctggtagatt cgtcaagaac
ttcgaaaaat acactggtac 2580tgctgaaggt caagctttgg ttgctgctgg
tccaaaggcc taaggcccgg gcataaagca 2640atcttgatga ggataatgat
ttttttttga atatacataa atactaccgt ttttctgcta 2700gattttgtga
agacgtaaat aagtacatat tactttttaa gccaagacaa gattaagcat
2760taactttacc cttttctctt ctaagtttca atactagtta tcactgttta
aaagttatgg 2820cgagaacgtc ggcggttaaa atatattacc ctgaacgtgg
tgaattgaag ttctaggatg 2880gtttaaagat ttttcctttt tgggaaataa
gtaaacaata tattgctgcc tttgcaaaac 2940gcacataccc acaatatgtg
actattggca aagaacgcat tatcctttga agaggtggat 3000actgatacta
agagagtctc tattccggct ccacttttag tccagagatt acttgtcttc
3060ttacgtatca gaacaagaaa gcatttccaa agtaattgca tttgcccttg
agcagtatat 3120atatactaag aaggcgcgcc gcggccgc
3148312637DNAArtificial sequenceTDH3p-delta 12N MDH2-TDH3t
synthetic construct 31ggatccggcg cgccctattt tcgaggacct tgtcaccttg
agcccaagag agccaagatt 60taaattttcc tatgacttga tgcaaattcc caaagctaat
aacatgcaag acacgtacgg 120tcaagaagac atatttgacc tcttaacagg
ttcagacgcg actgcctcat cagtaagacc 180cgttgaaaag aacttacctg
aaaaaaacga atatatacta gcgttgaatg ttagcgtcaa 240caacaagaag
tttaatgacg cggaggccaa ggcaaaaaga ttccttgatt acgtaaggga
300gttagaatca ttttgaataa aaaacacgct ttttcagttc gagtttatca
ttatcaatac 360tgccatttca aagaatacgt aaataattaa tagtagtgat
tttcctaact ttatttagtc 420aaaaaattag ccttttaatt ctgctgtaac
ccgtacatgc ccaaaatagg gggcgggtta 480cacagaatat ataacatcgt
aggtgtctgg gtgaacagtt tattcctggc atccactaaa 540tataatggag
cccgcttttt aagctggcat ccagaaaaaa aaagaatccc agcaccaaaa
600tattgttttc ttcaccaacc atcagttcat aggtccattc tcttagcgca
actacagaga 660acaggggcac aaacaggcaa aaaacgggca caacctcaat
ggagtgatgc aacctgcctg 720gagtaaatga tgacacaagg caattgaccc
acgcatgtat ctatctcatt ttcttacacc 780ttctattacc ttctgctctc
tctgatttgg aaaaagctga aaaaaaaggt tgaaaccagt 840tccctgaaat
tattccccta cttgactaat aagtatataa agacggtagg tattgattgt
900aattctgtaa atctatttct taaacttctt aaattctact tttatagtta
gtcttttttt 960tagttttaaa acaccaagaa cttagtttcg aataaacaca
cataaacaaa caaaatgttg 1020aagattgcca tcttgggtgc tgctggtggt
atcggtcaat ctttgtcttt gttgttgaag 1080gctcaattgc aataccaatt
gaaggaatcc aacagatctg ttacccacat tcatttggct 1140ttgtacgatg
tcaaccaaga agctatcaac ggtgtcactg ctgacttgtc tcacatcgat
1200accccaatct ctgtttcctc tcactctcca gctggtggta ttgaaaactg
tttgcacaac 1260gcttccattg ttgtcattcc agccggtgtt ccaagaaagc
caggtatgac ccgtgacgat 1320ttgttcaacg tcaatgccgg tatcatctct
caattaggtg attccattgc tgaatgttgt 1380gacttgtcca aggttttcgt
cttggttatc tccaacccag tcaactcttt ggttcctgtt 1440atggtttcca
acatcttgaa gaaccaccca caatccagaa actctggtat tgaaagaaga
1500atcatgggtg tcaccaaatt ggacattgtc agagcttcca ctttcttgag
agaaatcaac 1560attgaatctg gtttgactcc aagagtcaac tccatgccag
atgttccagt tatcggtggt 1620cactctggtg aaactatcat cccattattc
tctcaatcta acttcttgtc cagattgaat 1680gaagatcaat tgaaatactt
gattcaccgt gtccaatacg gtggtgacga agttgtcaag 1740gccaagaacg
gtaagggttc tgctactcta tccatggctc atgccggtta caagtgtgtt
1800gtccaattcg tttctctatt attaggtaac attgaacaaa tccacggtac
ctactacgtt 1860ccattgaaag atgctaacaa cttcccaatt gctccaggtg
ctgaccaatt attgccatta 1920gtcgacggtg ctgactactt tgccatccca
ttgaccatca ctaccaaggg tgtttcttac 1980gttgactacg atatcgtcaa
cagaatgaac gacatggaaa gaaaccaaat gttgcctatc 2040tgtgtttctc
aattgaagaa gaacattgac aagggtttgg aattcgttgc ttccagatct
2100gcttccagtt aaggcccggg cgtgaattta ctttaaatct tgcatttaaa
taaattttct 2160ttttatagct ttatgactta gtttcaattt atatactatt
ttaatgacat tttcgattca 2220ttgattgaaa gctttgtgtt ttttcttgat
gcgctattgc attgttcttg tctttttcgc 2280cacatgtaat atctgtagta
gatacctgat acattgtgga tgctgagtga aattttagtt 2340aataatggag
gcgctcttaa taattttggg gatattggct ttttttttta aagtttacaa
2400atgaattttt tccgccagga taacgattct gaagttactc ttagcgttcc
tatcggtaca 2460gccatcaaat catgcctata aatcatgcct atatttgcgt
gcagtcagta tcatctacat 2520gaaaaaaact cccgcaattt cttatagaat
acgttgaaaa ttaaatgtac gcgccaagat 2580aagataacat atatctagat
gcagtaatat acacagattc cggccggccg cggccgc 2637321966DNAArtificial
sequenceTDH3p-MDH3-TDH3t synthetic construct 32ggatccggcg
cgccacgcgt ggccggcctt agtcaaaaaa ttagcctttt aattctgctg 60taacccgtac
atgcccaaaa tagggggcgg gttacacaga atatataaca tcgtaggtgt
120ctgggtgaac agtttattcc tggcatccac taaatataat ggagcccgct
ttttaagctg 180gcatccagaa aaaaaaagaa tcccagcacc aaaatattgt
tttcttcacc aaccatcagt 240tcataggtcc attctcttag cgcaactaca
gagaacaggg gcacaaacag gcaaaaaacg 300ggcacaacct caatggagtg
atgcaacctg cctggagtaa atgatgacac aaggcaattg 360acccacgcat
gtatctatct cattttctta caccttctat taccttctgc tctctctgat
420ttggaaaaag ctgaaaaaaa aggttgaaac cagttccctg aaattattcc
cctacttgac 480taataagtat ataaagacgg taggtattga ttgtaattct
gtaaatctat ttcttaaact 540tcttaaattc tacttttata gttagtcttt
tttttagttt taaaacacca agaacttagt 600ttcgaataaa cacacataaa
caaacaaaat ggttaaggtt gccatcttag gtgcttctgg 660tggtgtcggt
caaccattat ctctattatt gaaattgtct ccatacgttt ctgaattggc
720tttgtacgat atcagagctg ctgaaggtat tggtaaggat ttgtcccaca
tcaacaccaa 780ctcctcttgt gttggttacg acaaggattc catcgaaaac
actttgtcca atgctcaagt 840tgtcttgatt ccagctggtg ttccaagaaa
gccaggtttg accagagatg atttgttcaa 900gatgaacgct ggtatcgtta
agtctttggt tactgctgtc ggtaaatttg ccccaaacgc 960tcgtatctta
gtcatctcca accctgttaa ctctttggtt ccaattgccg ttgaaacttt
1020gaagaagatg ggtaagttca agccaggtaa cgttatgggt gtcaccaact
tggatttggt 1080cagagctgaa actttcttgg ttgactactt gatgttgaag
aacccaaaga tcggtcaaga 1140acaagacaag accaccatgc acagaaaggt
caccgtcatc ggtggtcact ctggtgaaac 1200catcattcca atcatcactg
acaaatcctt ggttttccaa ttggacaagc aatacgaaca 1260tttcatccac
agagtccaat tcggtggtga cgaaattgtc aaggccaagc aaggtgccgg
1320ttctgctacc ttgtccatgg ctttcgctgg tgccaaattt gctgaagaag
tcttacgttc 1380tttccacaac gaaaagccag aaactgaatc tttgtctgct
ttcgtctact tgccaggttt 1440gaagaacggt aagaaggctc aacaattagt
cggtgacaac tccattgaat acttctcttt 1500gccaattgtt ttgagaaacg
gttccgttgt ttccattgac acttctgttt tggaaaaatt 1560gtctccaaga
gaagaacaat tggtcaacac tgctgtcaag gaattgagaa agaacattga
1620aaagggtaag tctttcatct tggacagtta aggtgaattt actttaaatc
ttgcatttaa 1680ataaattttc tttttatagc tttatgactt agtttcaatt
tatatactat tttaatgaca 1740ttttcgattc attgattgaa agctttgtgt
tttttcttga tgcgctattg cattgttctt 1800gtctttttcg ccacatgtaa
tatctgtagt agatacctga tacattgtgg atgctgagtg 1860aaattttagt
taataatgga ggcgctctta ataattttgg ggatattggc tttttttttt
1920aaagtttaca aatgaatttt ttccgccagg atgggcccgc ggccgc
1966332950DNAArtificial sequenceTDH1-FUMR-TDH1t synthetic construct
33ggatcccttc ccttttacag tgcttcggaa aagcacagcg ttgtccaagg gaacaatttt
60tcttcaagtt aatgcataag aaatatcttt ttttatgttt agctaagtaa aagcagcttg
120gagtaaaaaa aaaaatgagt aaatttctcg atggattagt ttctcacagg
taacataaca 180aaaaccaaga aaagcccgct tctgaaaact acagttgact
tgtatgctaa agggccagac 240taatgggagg agaaaaagaa acgaatgtat
atgctcattt acactctata tcaccatatg 300gaggataagt tgggctgagc
ttctgatcca atttattcta tccattagtt gctgatatgt 360cccaccagcc
aacacttgat agtatctact cgccattcac ttccagcagc gccagtaggg
420ttgttgagct tagtaaaaat gtgcgcacca caagcctaca tgactccacg
tcacatgaaa 480ccacaccgtg gggccttgtt gcgctaggaa taggatatgc
gacgaagacg cttctgctta 540gtaaccacac cacattttca gggggtcgat
ctgcttgctt cctttactgt cacgagcggc 600ccataatcgc gctttttttt
taaaaggcgc gagacagcaa acaggaagct cgggtttcaa 660ccttcggagt
ggtcgcagat ctggagactg gatctttaca atacagtaag gcaagccacc
720atctgcttct taggtgcatg cgacggtatc cacgtgcaga
acaacatagt ctgaagaagg 780gggggaggag catgttcatt ctctgtagca
gtaagagctt ggtgataatg accaaaactg 840gagtctcgaa atcatataaa
tagacaatat attttcacac aatgagattt gtagtacagt 900tctattctct
ctcttgcata aataagaaat tcatcaagaa cttggtttga tatttcacca
960acacacacaa aaaacagtac ttcactaaat ttacacacaa aacaaaatgt
cctctgcttc 1020tgctgctttg caaaaattca gagctgaaag agataccttc
ggtgacttgc aagttccagc 1080tgaccgttac tggggtgctc aaactcaaag
atctttgcaa aactttgaca ttggtggtcc 1140aactgaaaga atgccagaac
cattaatcag agctttcggt gttttgaaga aggctgctgc 1200caccgtcaac
atgacctacg gtttggaccc aaaggttggt gaagccatcc aaaaggctgc
1260tgacgaagtt atcgatggtt ctttgattga ccatttccca ttggttgtct
ggcaaaccgg 1320ttctggtact caaaccaaga tgaacgtcaa tgaagtcatc
tccaacagag ccattgaatt 1380gttgggtggt gaattaggtt ccaaggctcc
agtccaccca aacgatcatg tcaacatgtc 1440tcaatcttcc aacgacactt
tcccaactgc catgcacgtt gctgccgttg ttgaaattca 1500cggtagattg
attccagctt tgaccacttt gagagatgct ttgcaagcca aatctgctga
1560attcgaacac atcatcaaga ttggtagaac ccacttgcaa gatgctaccc
cattgacttt 1620aggtcaagaa ttctccggtt acactcaaca attgacctac
ggtattgctc gtgttcaagg 1680tactttggaa agattataca acttggctca
aggtggtact gctgtcggta ctggtttgaa 1740caccagaaag ggtttcgatg
ccaaggttgc tgaagccatt gcttccatca ctggtttacc 1800attcaagacc
gctccaaaca aattcgaagc tttggctgct cacgacgctt tggttgaagc
1860tcacggtgct ttgaacaccg ttgcttgttc tttgatgaag attgccaacg
atatccgtta 1920cttgggttct ggtccaagat gtggtttagg tgaattgtct
ctaccagaaa acgaaccagg 1980ttcttccatc atgccaggta aggtcaaccc
aactcaatgt gaagctatga ccatggtttg 2040tgctcaagtc atgggtaaca
acactgccat ctctgttgct ggttccaacg gtcaattcga 2100attgaatgtc
tttaaaccag tcatgatcaa gaacttgatc caatccatca gattaatctc
2160tgacgcttcc atctctttca ccaagaactg tgttgtcggt attgaagcta
acgaaaagaa 2220gatctcctcc atcatgaacg aatctttgat gttggtcact
gctttgaacc ctcacattgg 2280ttacgacaag gctgccaagt gtgccaagaa
ggctcacaag gaaggtacca ctttgaaaga 2340agctgctcta tctttgggtt
acttgacctc tgaagaattc gaccaatggg ttagacctga 2400ggacatgatt
tctgccaagg attaaggccc gggcataaag caatcttgat gaggataatg
2460attttttttt gaatatacat aaatactacc gtttttctgc tagattttgt
gaagacgtaa 2520ataagtacat attacttttt aagccaagac aagattaagc
attaacttta cccttttctc 2580ttctaagttt caatactagt tatcactgtt
taaaagttat ggcgagaacg tcggcggtta 2640aaatatatta ccctgaacgt
ggtgaattga agttctagga tggtttaaag atttttcctt 2700tttgggaaat
aagtaaacaa tatattgctg cctttgcaaa acgcacatac ccacaatatg
2760tgactattgg caaagaacgc attatccttt gaagaggtgg atactgatac
taagagagtc 2820tctattccgg ctccactttt agtccagaga ttacttgtct
tcttacgtat cagaacaaga 2880aagcatttcc aaagtaattg catttgccct
tgagcagtat atatatacta agaaggcgcg 2940ccgcggccgc
2950345037DNAArtificial sequenceTDH3p-FRDm1-TDH3t synthetic
construct 34ggatccggcg cgccctattt tcgaggacct tgtcaccttg agcccaagag
agccaagatt 60taaattttcc tatgacttga tgcaaattcc caaagctaat aacatgcaag
acacgtacgg 120tcaagaagac atatttgacc tcttaacagg ttcagacgcg
actgcctcat cagtaagacc 180cgttgaaaag aacttacctg aaaaaaacga
atatatacta gcgttgaatg ttagcgtcaa 240caacaagaag tttaatgacg
cggaggccaa ggcaaaaaga ttccttgatt acgtaaggga 300gttagaatca
ttttgaataa aaaacacgct ttttcagttc gagtttatca ttatcaatac
360tgccatttca aagaatacgt aaataattaa tagtagtgat tttcctaact
ttatttagtc 420aaaaaattag ccttttaatt ctgctgtaac ccgtacatgc
ccaaaatagg gggcgggtta 480cacagaatat ataacatcgt aggtgtctgg
gtgaacagtt tattcctggc atccactaaa 540tataatggag cccgcttttt
aagctggcat ccagaaaaaa aaagaatccc agcaccaaaa 600tattgttttc
ttcaccaacc atcagttcat aggtccattc tcttagcgca actacagaga
660acaggggcac aaacaggcaa aaaacgggca caacctcaat ggagtgatgc
aacctgcctg 720gagtaaatga tgacacaagg caattgaccc acgcatgtat
ctatctcatt ttcttacacc 780ttctattacc ttctgctctc tctgatttgg
aaaaagctga aaaaaaaggt tgaaaccagt 840tccctgaaat tattccccta
cttgactaat aagtatataa agacggtagg tattgattgt 900aattctgtaa
atctatttct taaacttctt aaattctact tttatagtta gtcttttttt
960tagttttaaa acaccaagaa cttagtttcg aataaacaca cataaacaaa
caaaatgggt 1020gctgatggta tttcttctgc ttccattgtt gttactgacc
cagaagctgc tgccaagaag 1080cgtgacagaa tggccagaga attgttgtcc
tccaactctg gtctatgtca agaagatgaa 1140ccaaccatca tcaacttaaa
gggtttggaa cacaccattc catacagatt ggccgttgtt 1200ttgtgtaact
ccagatccac tggtgaattc gaagccaagg ctgctgaaat cttgagaaag
1260gctttccaca tggttgacta ctctttgaat tgtttcaacc cagaatctga
attgtcccgt 1320gtcaactctt taccagtcgg tgaaaagcac caaatgtccg
aagatctaag acatgtcatg 1380gaatgtacca tttctgtcca ccactcctct
ggtatgggtt tcgacccagc tgctggtcca 1440atcatctcca gattgagagg
tgccatgaga gatcacaacg acatgtccga tatctccgtc 1500actgaagctg
aagttgaatt attctctttg gctcaatctt tcgatgtcga cttggaagaa
1560ggtactattg ccagaaagca ctctgaagcc agattggatt tgggtggtgt
caacaagggt 1620tacactgttg actacgttgt tgaccatttg agagctgctg
gtatgccaaa cgtcttgttc 1680gaatggggtg gtgatatcag agcttctggt
agaaacatca agggtaactt gtgggctgtt 1740gccatcaagc gtccaccatc
tgttgaagaa gttatccgtc gtgccaaggg taagatgtta 1800aagatgggtg
aagaagaaca agaagaaaag gacgatgact ctccatcttt gttgcacgtt
1860gttgaattgg atgacgaagc tttgtgtacc tctggtgact acgaaaacgt
cttataccat 1920ccaaagcacg gtgttgctgg ttccattttc gactggcaac
gtcgtggttt attgtctcca 1980gaagaaggtg ctttagctca agtttccgtc
aaatgttact ctgccatgta cgctgatgct 2040ttggccactg tttgtttggt
caagagagat gctgtcagaa tcagatactt gttggaaggt 2100tggagatacg
tcagatctcg tgtcaccaac tacttcgctt acaccagaca aggtgaaaga
2160ttggctcaca tgcacgaaat tgctcaagaa accagagaat taagagaaat
cagaattgct 2220ggttctttgc catccagaat tgttatcgtc ggtggtggtt
tggctggtct atccgctgcc 2280attgaagctg cttcttgtgg tgctcaagtc
attttgatgg aaaaggaagg tagaattggt 2340ggtaactctg ccaaggctac
ctctggtatc aacggttggg gtaccagaac ccaagccaag 2400tctgatatct
tggatggtgg taagtacttt gaaagagaca ctttcttgtc cggtgtcggt
2460ggtaccactg acccagcttt ggtcaaggtc ttgtccgtca aatctggtga
cgctatcggt 2520tggttaactt ctttgggtgt cccattgtcc gttttgtctc
aattgggtgg tcactctttc 2580aagagaactc acagagctcc agacaagact
gatggtactc cattaccaat tggtcacacc 2640atcatgagaa ctttggaaga
tcatatcaga aacaacttgt ctgaaagagt taccatcatg 2700acccacgttt
ctgttactga attgttgcac gaaactgaca ccactccaga tggtgcttct
2760gaagttcgtg tcaccggtgt ccgttacaga gacttgtctg atgtcgatgg
tcaaccttcc 2820aaactattgg ctgacgctgt tgttttggcc actggtggtt
tctccaacga cagagaagaa 2880aactctttgt tgtgtaaata cgctcctcat
ttggcttctt tcccaactac caacggtcca 2940tgggctactg gtgacggtgt
caaattggcc acctccgttg gtgccaagtt ggttgacatg 3000gacaaggttc
aattgcaccc aactggtttg attgacccaa aggacccagc taacaccact
3060aagatcttgg gtccagaagc tttgagaggt tctggtggta ttttgttgaa
caagcaaggt 3120aagagattcg tcaacgaatt ggacttgaga tccgttgttt
ccaaggccat taacactcaa 3180ggtaacgaat acccaggttc tggtggttgt
tactttgctt actgtgtctt aaacgaagat 3240gctaccaact tattctgtgg
tggtgctttg ggtttctacg gtaagaaatt aggtttgttc 3300caaagagctg
aaactgttga agaattggcc aaattgattg gttgtgacga aggtgaattg
3360agagacactt tggaaaaata cgaaacctgt tccaaggcca aggttgcttg
tccagtcact 3420ggtaaggttg ttttcccatg tgttgtcggt accagaggtc
catacaatgt tgctttcgtc 3480actccatcca tccactacac catgggtggt
tgtttgatct ctccagctgc tgaagtcttg 3540caagaataca agggtttgaa
tatcttggaa aaccacagac caatcagatg tttgttcggt 3600gctggtgaag
tcactggtgg tgtccacggt ggtaacagat taggtggtaa ctctctattg
3660gaatgtgttg tctttggtaa gattgctggt gacagagctg ccactatctt
gcaaaagaga 3720gaaattgctt tgtccaagac ctcctggacc tctgttgttg
tcagagaatc cagatctggt 3780gaacaattcg gtaccggttc cagagttttg
agattcaact tgccaggtgc tttacaaaga 3840accggtttga acttgggtga
attcgttgcc atcagaggtg aatgggatgg tcaacaatta 3900gtcggttact
tctctccaat cactttgcca gaagatttgg gtaccatctc tttgttggtc
3960agagctgaca agggtacttt gaaggaatgg atctgtgctt tgcgtccagg
tgactccgtt 4020gaaatcaagg cttgtggtgg tctaagaatt gaccaagatc
cagtcaagaa atgtttgttg 4080ttcagaaaca gaccaattac cagatttgct
ttggttgctg ctggtaccgg tgttgctcca 4140atgttgcaag ttatcagagc
tgctttgaag aagccatacg tcgacacttt ggaatccatc 4200agattgatct
acgctgctga agaatatgac actttaacct acagatctat cttgcaaaga
4260tttgctgaag aattcccaga caaattcgtt tgtaacttcg tcttaaacaa
ccctccagaa 4320ggttggaccg gtggtgttgg tttcgtcaac aagaaatctt
tgcaaaaggt tttgcaacca 4380ccttcttctg aaccattgat tgttgtttgt
ggtccacctg ttatgcaaag agatgtcaaa 4440aatgaattgt tgtccatggg
ttacgacaag gaattggttc acactgtcga tggtgaatct 4500ggtaccttgt
aaggcccggg cgtgaattta ctttaaatct tgcatttaaa taaattttct
4560ttttatagct ttatgactta gtttcaattt atatactatt ttaatgacat
tttcgattca 4620ttgattgaaa gctttgtgtt ttttcttgat gcgctattgc
attgttcttg tctttttcgc 4680cacatgtaat atctgtagta gatacctgat
acattgtgga tgctgagtga aattttagtt 4740aataatggag gcgctcttaa
taattttggg gatattggct ttttttttta aagtttacaa 4800atgaattttt
tccgccagga taacgattct gaagttactc ttagcgttcc tatcggtaca
4860gccatcaaat catgcctata aatcatgcct atatttgcgt gcagtcagta
tcatctacat 4920gaaaaaaact cccgcaattt cttatagaat acgttgaaaa
ttaaatgtac gcgccaagat 4980aagataacat atatctagat gcagtaatat
acacagattc cggccggccg cggccgc 5037354959DNAArtificial
SequenceTDH3p-FRDg-TDH3t artificial sequence 35ggatccggcg
cgccctattt tcgaggacct tgtcaccttg agcccaagag agccaagatt 60taaattttcc
tatgacttga tgcaaattcc caaagctaat aacatgcaag acacgtacgg
120tcaagaagac atatttgacc tcttaacagg ttcagacgcg actgcctcat
cagtaagacc 180cgttgaaaag aacttacctg aaaaaaacga atatatacta
gcgttgaatg ttagcgtcaa 240caacaagaag tttaatgacg cggaggccaa
ggcaaaaaga ttccttgatt acgtaaggga 300gttagaatca ttttgaataa
aaaacacgct ttttcagttc gagtttatca ttatcaatac 360tgccatttca
aagaatacgt aaataattaa tagtagtgat tttcctaact ttatttagtc
420aaaaaattag ccttttaatt ctgctgtaac ccgtacatgc ccaaaatagg
gggcgggtta 480cacagaatat ataacatcgt aggtgtctgg gtgaacagtt
tattcctggc atccactaaa 540tataatggag cccgcttttt aagctggcat
ccagaaaaaa aaagaatccc agcaccaaaa 600tattgttttc ttcaccaacc
atcagttcat aggtccattc tcttagcgca actacagaga 660acaggggcac
aaacaggcaa aaaacgggca caacctcaat ggagtgatgc aacctgcctg
720gagtaaatga tgacacaagg caattgaccc acgcatgtat ctatctcatt
ttcttacacc 780ttctattacc ttctgctctc tctgatttgg aaaaagctga
aaaaaaaggt tgaaaccagt 840tccctgaaat tattccccta cttgactaat
aagtatataa agacggtagg tattgattgt 900aattctgtaa atctatttct
taaacttctt aaattctact tttatagtta gtcttttttt 960tagttttaaa
acaccaagaa cttagtttcg aataaacaca cataaacaaa caaaatggtt
1020gatggtagat cttctgcttc cattgttgcc gttgacccag aaagagctgc
cagagaaaga 1080gatgctgctg ccagagcttt gttgcaagac tctccattgc
acaccaccat gcaatacgct 1140acctctggtt tggaattgac tgttccatac
gctttgaagg ttgttgcttc tgctgacact 1200ttcgacagag ccaaggaagt
tgctgatgaa gtcttgagat gtgcctggca attggctgac 1260accgttttga
actctttcaa cccaaactct gaagtctctt tagtcggtag attaccagtc
1320ggtcaaaagc atcaaatgtc tgctccattg aaacgtgtca tggcttgttg
tcaaagagtc 1380tacaactcct ctgctggttg tttcgaccca tccactgctc
cagttgccaa ggctttgaga 1440gaaattgctt tgggtaagga aagaaacaat
gcttgtttgg aagctttgac tcaagcttgt 1500accttgccaa actctttcgt
cattgatttc gaagctggta ctatctccag aaagcacgaa 1560cacgcttctt
tggatttggg tggtgtttcc aagggttaca tcgtcgatta cgtcattgac
1620aacatcaatg ctgctggttt ccaaaacgtt ttctttgact ggggtggtga
ctgtcgtgcc 1680tccggtatga acgccagaaa cactccatgg gttgtcggta
tcactagacc tccttccttg 1740gacatgttgc caaaccctcc aaaggaagct
tcttacatct ccgtcatctc tttggacaat 1800gaagctttgg ctacctctgg
tgattacgaa aacttgatct acactgctga cgataaacca 1860ttgacctgta
cctacgattg gaaaggtaag gaattgatga agccatctca atccaatatc
1920gctcaagttt ccgtcaagtg ttactctgcc atgtacgctg acgctttggc
taccgcttgt 1980ttcatcaagc gtgacccagc caaggtcaga caattgttgg
atggttggag atacgttaga 2040gacaccgtca gagattaccg tgtctacgtc
agagaaaacg aaagagttgc caagatgttc 2100gaaattgcca ctgaagatgc
tgaaatgaga aagagaagaa tttccaacac tttaccagct 2160cgtgtcattg
ttgttggtgg tggtttggct ggtttgtccg ctgccattga agctgctggt
2220tgtggtgctc aagttgtttt gatggaaaag gaagccaagt tgggtggtaa
ctctgccaag 2280gctacctctg gtatcaacgg ttggggtact agagctcaag
ctaaggcttc cattgtcgat 2340ggtggtaagt acttcgaaag agatacctac
aagtctggta tcggtggtaa caccgatcca 2400gctttggtta agactttgtc
catgaaatct gctgacgcta tcggttggtt gacttctcta 2460ggtgttccat
tgactgtttt gtcccaatta ggtggtcact ccagaaagag aactcacaga
2520gctccagaca agaaggatgg tactccattg ccaattggtt tcaccatcat
gaaaacttta 2580gaagatcatg ttagaggtaa cttgtccggt agaatcacca
tcatggaaaa ctgttccgtt 2640acctctttgt tgtctgaaac caaggaaaga
ccagacggta ccaagcaaat cagagttacc 2700ggtgtcgaat tcactcaagc
tggttctggt aagaccacca ttttggctga tgctgttatc 2760ttggccaccg
gtggtttctc caacgacaag actgctgatt ctttgttgag agaacatgcc
2820ccacacttgg ttaacttccc aaccaccaac ggtccatggg ctactggtga
tggtgtcaag 2880ttggctcaaa gattaggtgc tcaattggtc gatatggaca
aggttcaatt gcacccaact 2940ggtttgatca acccaaagga cccagccaac
ccaaccaaat tcttgggtcc agaagctcta 3000agaggttctg gtggtgtttt
gttgaacaaa caaggtaaga gatttgtcaa cgaattggat 3060ttgagatctg
ttgtttccaa ggccatcatg gaacaaggtg ctgaataccc aggttctggt
3120ggttccatgt ttgcttactg tgtcttgaac gctgctgctc aaaaattgtt
tggtgtttcc 3180tctcacgaat tctactggaa gaagatgggt ttgttcgtca
aggctgacac catgagagac 3240ttggctgctt tgattggttg tccagttgaa
tccgttcaac aaactttaga agaatacgaa 3300agattatcca tctctcaaag
atcttgtcca attaccagaa aatctgttta cccatgtgtt 3360ttgggtacca
aaggtccata ctatgtcgcc tttgtcactc catctatcca ctacaccatg
3420ggtggttgtt tgatttctcc atctgctgaa atccaaatga agaacacttc
ttccagagct 3480ccattgtccc actccaaccc aatcttgggt ttattcggtg
ctggtgaagt caccggtggt 3540gtccacggtg gtaacagatt aggtggtaac
tctttgttgg aatgtgttgt tttcggtaga 3600attgccggtg acagagcttc
taccattttg caaagaaagt cctctgcttt gtctttcaag 3660gtctggacca
ctgttgtttt gagagaagtc agagaaggtg gtgtctacgg tgctggttcc
3720cgtgtcttga gattcaactt accaggtgct ctacaaagat ctggtctatc
cttgggtcaa 3780ttcattgcca tcagaggtga ctgggacggt caacaattga
ttggttacta ctctccaatc 3840actttgccag acgatttggg tatgattgac
attttggcca gatctgacaa gggtacttta 3900cgtgaatgga tctctgcttt
ggaaccaggt gacgctgtcg aaatgaaggc ttgtggtggt 3960ttggtcatcg
aaagaagatt atctgacaag cacttcgttt tcatgggtca cattatcaac
4020aagctatgtt tgattgctgg tggtaccggt gttgctccaa tgttgcaaat
catcaaggcc 4080gctttcatga agccattcat cgacactttg gaatccgtcc
acttgatcta cgctgctgaa 4140gatgtcactg aattgactta cagagaagtt
ttggaagaac gtcgtcgtga atccagaggt 4200aaattcaaga aaactttcgt
tttgaacaga cctcctccat tatggactga cggtgtcggt 4260ttcatcgacc
gtggtatctt gaccaaccac gttcaaccac catctgacaa cttattggtt
4320gccatctgtg gtccaccagt tatgcaaaga attgtcaagg ccactttaaa
gactttaggt 4380tacaacatga acttggtcag aaccgttgac gaaactgaac
catctggaag ttaaggcccg 4440ggcgtgaatt tactttaaat cttgcattta
aataaatttt ctttttatag ctttatgact 4500tagtttcaat ttatatacta
ttttaatgac attttcgatt cattgattga aagctttgtg 4560ttttttcttg
atgcgctatt gcattgttct tgtctttttc gccacatgta atatctgtag
4620tagatacctg atacattgtg gatgctgagt gaaattttag ttaataatgg
aggcgctctt 4680aataattttg gggatattgg cttttttttt taaagtttac
aaatgaattt tttccgccag 4740gataacgatt ctgaagttac tcttagcgtt
cctatcggta cagccatcaa atcatgccta 4800taaatcatgc ctatatttgc
gtgcagtcag tatcatctac atgaaaaaaa ctcccgcaat 4860ttcttataga
atacgttgaa aattaaatgt acgcgccaag ataagataac atatatctag
4920atgcagtaat atacacagat tccggccggc cgcggccgc
495936438PRTSchizosaccharomyces pombe 36Met Gly Glu Leu Lys Glu Ile
Leu Lys Gln Arg Tyr His Glu Leu Leu1 5 10 15Asp Trp Asn Val Lys Ala
Pro His Val Pro Leu Ser Gln Arg Leu Lys 20 25 30His Phe Thr Trp Ser
Trp Phe Ala Cys Thr Met Ala Thr Gly Gly Val 35 40 45Gly Leu Ile Ile
Gly Ser Phe Pro Phe Arg Phe Tyr Gly Leu Asn Thr 50 55 60Ile Gly Lys
Ile Val Tyr Ile Leu Gln Ile Phe Leu Phe Ser Leu Phe65 70 75 80Gly
Ser Cys Met Leu Phe Arg Phe Ile Lys Tyr Pro Ser Thr Ile Lys 85 90
95Asp Ser Trp Asn His His Leu Glu Lys Leu Phe Ile Ala Thr Cys Leu
100 105 110Leu Ser Ile Ser Thr Phe Ile Asp Met Leu Ala Ile Tyr Ala
Tyr Pro 115 120 125Asp Thr Gly Glu Trp Met Val Trp Val Ile Arg Ile
Leu Tyr Tyr Ile 130 135 140Tyr Val Ala Val Ser Phe Ile Tyr Cys Val
Met Ala Phe Phe Thr Ile145 150 155 160Phe Asn Asn His Val Tyr Thr
Ile Glu Thr Ala Ser Pro Ala Trp Ile 165 170 175Leu Pro Ile Phe Pro
Pro Met Ile Cys Gly Val Ile Ala Gly Ala Val 180 185 190Asn Ser Thr
Gln Pro Ala His Gln Leu Lys Asn Met Val Ile Phe Gly 195 200 205Ile
Leu Phe Gln Gly Leu Gly Phe Trp Val Tyr Leu Leu Leu Phe Ala 210 215
220Val Asn Val Leu Arg Phe Phe Thr Val Gly Leu Ala Lys Pro Gln
Asp225 230 235 240Arg Pro Gly Met Phe Met Phe Val Gly Pro Pro Ala
Phe Ser Gly Leu 245 250 255Ala Leu Ile Asn Ile Ala Arg Gly Ala Met
Gly Ser Arg Pro Tyr Ile 260 265 270Phe Val Gly Ala Asn Ser Ser Glu
Tyr Leu Gly Phe Val Ser Thr Phe 275 280 285Met Ala Ile Phe Ile Trp
Gly Leu Ala Ala Trp Cys Tyr Cys Leu Ala 290 295 300Met Val Ser Phe
Leu Ala Gly Phe Phe Thr Arg Ala Pro Leu Lys Phe305 310 315 320Ala
Cys Gly Trp Phe Ala Phe Ile Phe Pro Asn Val Gly Phe Val Asn 325 330
335Cys Thr Ile Glu Ile Gly Lys Met Ile Asp Ser Lys Ala Phe Gln Met
340 345 350Phe Gly His Ile Ile Gly Val Ile Leu Cys Ile Gln Trp Ile
Leu Leu 355 360 365Met Tyr Leu Met Val Arg Ala Phe Leu Val Asn Asp
Leu Cys Tyr Pro 370 375 380Gly Lys Asp Glu Asp Ala His Pro Pro Pro
Lys Pro Asn Thr Gly Val385 390 395 400Leu Asn Pro Thr Phe Pro Pro
Glu Lys Ala Pro Ala Ser Leu Glu Lys 405 410 415Val Asp Thr His Val
Thr Ser Thr Gly Gly Glu Ser Asp Pro Pro Ser 420 425 430Ser Glu His
Glu Ser Val
435371317DNAArtificial sequenceS. pombe malae permease cpo for S.
cerevisiae 37atgggtgaat tgaaggaaat cttgaagcaa cgttaccatg aattgttgga
ctggaacgtc 60aaggctccac acgttccatt gtctcaaaga ttgaagcatt tcacctggtc
ctggtttgct 120tgtaccatgg ccactggtgg tgtcggtttg atcattggtt
ctttcccatt cagattctac 180ggtttgaaca ccattggtaa gattgtctac
atcttacaaa tcttcttatt ctctttgttt 240ggttcttgta tgttgttcag
attcatcaaa tacccatcta ccatcaagga ctcctggaac 300caccacttgg
aaaaattatt cattgctacc tgtttgctat ccatctccac tttcattgac
360atgttggcca tctacgctta cccagacact ggtgaatgga tggtctgggt
tatcagaatc 420ttatactaca tctacgttgc tgtctctttc atctactgtg
tcatggcttt cttcaccatt 480ttcaacaacc acgtttacac cattgaaact
gcttctccag cttggatctt accaattttc 540ccaccaatga tctgtggtgt
cattgctggt gctgtcaact ccactcaacc agctcaccaa 600ttgaagaaca
tggttatctt cggtatctta ttccaaggtt tgggtttctg ggtttacttg
660ttgttgtttg ctgtcaacgt tttgagattc ttcaccgttg gtttggccaa
gcctcaagac 720agaccaggta tgttcatgtt tgttggtcca ccagctttct
ccggtttggc tttgatcaac 780attgcccgtg gtgctatggg ttccagacca
tacattttcg tcggtgccaa ttcttctgaa 840tacttgggtt tcgtttccac
tttcatggcc attttcatct ggggtttggc tgcttggtgt 900tactgtttgg
ccatggtttc tttcttggct ggtttcttca ccagagctcc attgaaattt
960gcttgtggtt ggtttgcttt catcttccca aacgtcggtt tcgttaactg
taccattgaa 1020attggtaaga tgattgactc caaggccttc caaatgttcg
gtcacatcat cggtgtcatc 1080ctatgtatcc aatggatctt gttgatgtac
ttgatggtca gagctttctt ggtcaacgat 1140ttgtgttacc caggtaagga
tgaagatgct cacccacctc caaagccaaa cactggtgtt 1200ttgaacccaa
ctttcccacc agaaaaggct ccagcttctt tggaaaaggt tgacacccac
1260gttacttcca ctggtggtga atctgatcct ccatcttctg aacacgaaag cgtttaa
131738600DNAArtificial sequenceENO1 promotor T at position -5 was
changed to A in order to obtain a better Kozak sequence
38ccgcggaacc gccagatatt cattacttga cgcaaaagcg tttgaaataa tgacgaaaaa
60gaaggaagaa aaaaaaagaa aaataccgct tctaggcggg ttatctactg atccgagctt
120ccactaggat agcacccaaa cacctgcata tttggacgac ctttacttac
accaccaaaa 180accactttcg cctctcccgc ccctgataac gtccactaat
tgagcgatta cctgagcggt 240cctcttttgt ttgcagcatg agacttgcat
actgcaaatc gtaagtagca acgtctcaag 300gtcaaaactg tatggaaacc
ttgtcacctc acttaattct agctagccta ccctgcaagt 360caagaggtct
ccgtgattcc tagccacctc aaggtatgcc tctccccgga aactgtggcc
420ttttctggca cacatgatct ccacgatttc aacatataaa tagcttttga
taatggcaat 480attaatcaaa tttattttac ttctttcttg taacatctct
cttgtaatcc cttattcctt 540ctagctattt ttcataaaaa accaagcaac
tgcttatcaa cacacaaaca ctaaaacaaa 60039300DNAArtificial sequenceENO1
terminator 39agcttttgat taagccttct agtccaaaaa acacgttttt ttgtcattta
tttcattttc 60ttagaatagt ttagtttatt cattttatag tcacgaatgt tttatgattc
tatatagggt 120tgcaaacaag catttttcat tttatgttaa aacaatttca
ggtttacctt ttattctgct 180tgtggtgacg cgggtatccg cccgctcttt
tggtcaccca tgtatttaat tgcataaata 240attcttaaaa gtggagctag
tctatttcta tttacatacc tctcatttct catttcctcc 300402240DNAArtificial
sequenceENO1p-SpMAE-ENO1t synthetic construct 40ggatccggcg
cgccccgcgg aaccgccaga tattcattac ttgacgcaaa agcgtttgaa 60ataatgacga
aaaagaagga agaaaaaaaa agaaaaatac cgcttctagg cgggttatct
120actgatccga gcttccacta ggatagcacc caaacacctg catatttgga
cgacctttac 180ttacaccacc aaaaaccact ttcgcctctc ccgcccctga
taacgtccac taattgagcg 240attacctgag cggtcctctt ttgtttgcag
catgagactt gcatactgca aatcgtaagt 300agcaacgtct caaggtcaaa
actgtatgga aaccttgtca cctcacttaa ttctagctag 360cctaccctgc
aagtcaagag gtctccgtga ttcctagcca cctcaaggta tgcctctccc
420cggaaactgt ggccttttct ggcacacatg atctccacga tttcaacata
taaatagctt 480ttgataatgg caatattaat caaatttatt ttacttcttt
cttgtaacat ctctcttgta 540atcccttatt ccttctagct atttttcata
aaaaaccaag caactgctta tcaacacaca 600aacactaaaa caaaatgggt
gaattgaagg aaatcttgaa gcaacgttac catgaattgt 660tggactggaa
cgtcaaggct ccacacgttc cattgtctca aagattgaag catttcacct
720ggtcctggtt tgcttgtacc atggccactg gtggtgtcgg tttgatcatt
ggttctttcc 780cattcagatt ctacggtttg aacaccattg gtaagattgt
ctacatctta caaatcttct 840tattctcttt gtttggttct tgtatgttgt
tcagattcat caaataccca tctaccatca 900aggactcctg gaaccaccac
ttggaaaaat tattcattgc tacctgtttg ctatccatct 960ccactttcat
tgacatgttg gccatctacg cttacccaga cactggtgaa tggatggtct
1020gggttatcag aatcttatac tacatctacg ttgctgtctc tttcatctac
tgtgtcatgg 1080ctttcttcac cattttcaac aaccacgttt acaccattga
aactgcttct ccagcttgga 1140tcttaccaat tttcccacca atgatctgtg
gtgtcattgc tggtgctgtc aactccactc 1200aaccagctca ccaattgaag
aacatggtta tcttcggtat cttattccaa ggtttgggtt 1260tctgggttta
cttgttgttg tttgctgtca acgttttgag attcttcacc gttggtttgg
1320ccaagcctca agacagacca ggtatgttca tgtttgttgg tccaccagct
ttctccggtt 1380tggctttgat caacattgcc cgtggtgcta tgggttccag
accatacatt ttcgtcggtg 1440ccaattcttc tgaatacttg ggtttcgttt
ccactttcat ggccattttc atctggggtt 1500tggctgcttg gtgttactgt
ttggccatgg tttctttctt ggctggtttc ttcaccagag 1560ctccattgaa
atttgcttgt ggttggtttg ctttcatctt cccaaacgtc ggtttcgtta
1620actgtaccat tgaaattggt aagatgattg actccaaggc cttccaaatg
ttcggtcaca 1680tcatcggtgt catcctatgt atccaatgga tcttgttgat
gtacttgatg gtcagagctt 1740tcttggtcaa cgatttgtgt tacccaggta
aggatgaaga tgctcaccca cctccaaagc 1800caaacactgg tgttttgaac
ccaactttcc caccagaaaa ggctccagct tctttggaaa 1860aggttgacac
ccacgttact tccactggtg gtgaatctga tcctccatct tctgaacacg
1920aaagcgttta agagcttttg attaagcctt ctagtccaaa aaacacgttt
ttttgtcatt 1980tatttcattt tcttagaata gtttagttta ttcattttat
agtcacgaat gttttatgat 2040tctatatagg gttgcaaaca agcatttttc
attttatgtt aaaacaattt caggtttacc 2100ttttattctg cttgtggtga
cgcgggtatc cgcccgctct tttggtcacc catgtattta 2160attgcataaa
taattcttaa aagtggagct agtctatttc tatttacata cctctcattt
2220ctcatttcct ccgcggccgc 2240411180PRTSaccharomyces cerevisiae
41Met Ser Ser Ser Lys Lys Leu Ala Gly Leu Arg Asp Asn Phe Ser Leu1
5 10 15Leu Gly Glu Lys Asn Lys Ile Leu Val Ala Asn Arg Gly Glu Ile
Pro 20 25 30Ile Arg Ile Phe Arg Ser Ala His Glu Leu Ser Met Arg Thr
Ile Ala 35 40 45Ile Tyr Ser His Glu Asp Arg Leu Ser Met His Arg Leu
Lys Ala Asp 50 55 60Glu Ala Tyr Val Ile Gly Glu Glu Gly Gln Tyr Thr
Pro Val Gly Ala65 70 75 80Tyr Leu Ala Met Asp Glu Ile Ile Glu Ile
Ala Lys Lys His Lys Val 85 90 95Asp Phe Ile His Pro Gly Tyr Gly Phe
Leu Ser Glu Asn Ser Glu Phe 100 105 110Ala Asp Lys Val Val Lys Ala
Gly Ile Thr Trp Ile Gly Pro Pro Ala 115 120 125Glu Val Ile Asp Ser
Val Gly Asp Lys Val Ser Ala Arg His Leu Ala 130 135 140Ala Arg Ala
Asn Val Pro Thr Val Pro Gly Thr Pro Gly Pro Ile Glu145 150 155
160Thr Val Gln Glu Ala Leu Asp Phe Val Asn Glu Tyr Gly Tyr Pro Val
165 170 175Ile Ile Lys Ala Ala Phe Gly Gly Gly Gly Arg Gly Met Arg
Val Val 180 185 190Arg Glu Gly Asp Asp Val Ala Asp Ala Phe Gln Arg
Ala Thr Ser Glu 195 200 205Ala Arg Thr Ala Phe Gly Asn Gly Thr Cys
Phe Val Glu Arg Phe Leu 210 215 220Asp Lys Pro Lys His Ile Glu Val
Gln Leu Leu Ala Asp Asn His Gly225 230 235 240Asn Val Val His Leu
Phe Glu Arg Asp Cys Ser Val Gln Arg Arg His 245 250 255Gln Lys Val
Val Glu Val Ala Pro Ala Lys Thr Leu Pro Arg Glu Val 260 265 270Arg
Asp Ala Ile Leu Thr Asp Ala Val Lys Leu Ala Lys Val Cys Gly 275 280
285Tyr Arg Asn Ala Gly Thr Ala Glu Phe Leu Val Asp Asn Gln Asn Arg
290 295 300His Tyr Phe Ile Glu Ile Asn Pro Arg Ile Gln Val Glu His
Thr Ile305 310 315 320Thr Glu Glu Ile Thr Gly Ile Asp Ile Val Ser
Ala Gln Ile Gln Ile 325 330 335Ala Ala Gly Ala Thr Leu Thr Gln Leu
Gly Leu Leu Gln Asp Lys Ile 340 345 350Thr Thr Arg Gly Phe Ser Ile
Gln Cys Arg Ile Thr Thr Glu Asp Pro 355 360 365Ser Lys Asn Phe Gln
Pro Asp Thr Gly Arg Leu Glu Val Tyr Arg Ser 370 375 380Ala Gly Gly
Asn Gly Val Arg Leu Asp Gly Gly Asn Ala Tyr Ala Gly385 390 395
400Ala Thr Ile Ser Pro His Tyr Asp Ser Met Leu Val Lys Cys Ser Cys
405 410 415Ser Gly Ser Thr Tyr Glu Ile Val Arg Arg Lys Met Ile Arg
Ala Leu 420 425 430Ile Glu Phe Arg Ile Arg Gly Val Lys Thr Asn Ile
Pro Phe Leu Leu 435 440 445Thr Leu Leu Thr Asn Pro Val Phe Ile Glu
Gly Thr Tyr Trp Thr Thr 450 455 460Phe Ile Asp Asp Thr Pro Gln Leu
Phe Gln Met Val Ser Ser Gln Asn465 470 475 480Arg Ala Gln Lys Leu
Leu His Tyr Leu Ala Asp Leu Ala Val Asn Gly 485 490 495Ser Ser Ile
Lys Gly Gln Ile Gly Leu Pro Lys Leu Lys Ser Asn Pro 500 505 510Ser
Val Pro His Leu His Asp Ala Gln Gly Asn Val Ile Asn Val Thr 515 520
525Lys Ser Ala Pro Pro Ser Gly Trp Arg Gln Val Leu Leu Glu Lys Gly
530 535 540Pro Ser Glu Phe Ala Lys Gln Val Arg Gln Phe Asn Gly Thr
Leu Leu545 550 555 560Met Asp Thr Thr Trp Arg Asp Ala His Gln Ser
Leu Leu Ala Thr Arg 565 570 575Val Arg Thr His Asp Leu Ala Thr Ile
Ala Pro Thr Thr Ala His Ala 580 585 590Leu Ala Gly Ala Phe Ala Leu
Glu Cys Trp Gly Gly Ala Thr Phe Asp 595 600 605Val Ala Met Arg Phe
Leu His Glu Asp Pro Trp Glu Arg Leu Arg Lys 610 615 620Leu Arg Ser
Leu Val Pro Asn Ile Pro Phe Gln Met Leu Leu Arg Gly625 630 635
640Ala Asn Gly Val Ala Tyr Ser Ser Leu Pro Asp Asn Ala Ile Asp His
645 650 655Phe Val Lys Gln Ala Lys Asp Asn Gly Val Asp Ile Phe Arg
Val Phe 660 665 670Asp Ala Leu Asn Asp Leu Glu Gln Leu Lys Val Gly
Val Asn Ala Val 675 680 685Lys Lys Ala Gly Gly Val Val Glu Ala Thr
Val Cys Tyr Ser Gly Asp 690 695 700Met Leu Gln Pro Gly Lys Lys Tyr
Asn Leu Asp Tyr Tyr Leu Glu Val705 710 715 720Val Glu Lys Ile Val
Gln Met Gly Thr His Ile Leu Gly Ile Lys Asp 725 730 735Met Ala Gly
Thr Met Lys Pro Ala Ala Ala Lys Leu Leu Ile Gly Ser 740 745 750Leu
Arg Thr Arg Tyr Pro Asp Leu Pro Ile His Val His Ser His Asp 755 760
765Ser Ala Gly Thr Ala Val Ala Ser Met Thr Ala Cys Ala Leu Ala Gly
770 775 780Ala Asp Val Val Asp Val Ala Ile Asn Ser Met Ser Gly Leu
Thr Ser785 790 795 800Gln Pro Ser Ile Asn Ala Leu Leu Ala Ser Leu
Glu Gly Asn Ile Asp 805 810 815Thr Gly Ile Asn Val Glu His Val Arg
Glu Leu Asp Ala Tyr Trp Ala 820 825 830Glu Met Arg Leu Leu Tyr Ser
Cys Phe Glu Ala Asp Leu Lys Gly Pro 835 840 845Asp Pro Glu Val Tyr
Gln His Glu Ile Pro Gly Gly Gln Leu Thr Asn 850 855 860Leu Leu Phe
Gln Ala Gln Gln Leu Gly Leu Gly Glu Gln Trp Ala Glu865 870 875
880Thr Lys Arg Ala Tyr Arg Glu Ala Asn Tyr Leu Leu Gly Asp Ile Val
885 890 895Lys Val Thr Pro Thr Ser Lys Val Val Gly Asp Leu Ala Gln
Phe Met 900 905 910Val Ser Asn Lys Leu Thr Ser Asp Asp Ile Arg Arg
Leu Ala Asn Ser 915 920 925Leu Asp Phe Pro Asp Ser Val Met Asp Phe
Phe Glu Gly Leu Ile Gly 930 935 940Gln Pro Tyr Gly Gly Phe Pro Glu
Pro Leu Arg Ser Asp Val Leu Arg945 950 955 960Asn Lys Arg Arg Lys
Leu Thr Cys Arg Pro Gly Leu Glu Leu Glu Pro 965 970 975Phe Asp Leu
Glu Lys Ile Arg Glu Asp Leu Gln Asn Arg Phe Gly Asp 980 985 990Ile
Asp Glu Cys Asp Val Ala Ser Tyr Asn Met Tyr Pro Arg Val Tyr 995
1000 1005Glu Asp Phe Gln Lys Ile Arg Glu Thr Tyr Gly Asp Leu Ser
Val 1010 1015 1020Leu Pro Thr Lys Asn Phe Leu Ala Pro Ala Glu Pro
Asp Glu Glu 1025 1030 1035Ile Glu Val Thr Ile Glu Gln Gly Lys Thr
Leu Ile Ile Lys Leu 1040 1045 1050Gln Ala Val Gly Asp Leu Asn Lys
Lys Thr Gly Gln Arg Glu Val 1055 1060 1065Tyr Phe Glu Leu Asn Gly
Glu Leu Arg Lys Ile Arg Val Ala Asp 1070 1075 1080Lys Ser Gln Asn
Ile Gln Ser Val Ala Lys Pro Lys Ala Asp Val 1085 1090 1095His Asp
Thr His Gln Ile Gly Ala Pro Met Ala Gly Val Ile Ile 1100 1105
1110Glu Val Lys Val His Lys Gly Ser Leu Val Lys Lys Gly Glu Ser
1115 1120 1125Ile Ala Val Leu Ser Ala Met Lys Met Glu Met Val Val
Ser Ser 1130 1135 1140Pro Ala Asp Gly Gln Val Lys Asp Val Phe Ile
Lys Asp Gly Glu 1145 1150 1155Ser Val Asp Ala Ser Asp Leu Leu Val
Val Leu Glu Glu Glu Thr 1160 1165 1170Leu Pro Pro Ser Gln Lys Lys
1175 1180423543DNASaccharomyces cerevisiae 42atgagcagta gcaagaaatt
ggccggtctt agggacaatt tcagtttgct cggcgaaaag 60aataagatct tggtcgccaa
tagaggtgaa attccgatta gaatttttag atctgctcat 120gagctgtcta
tgagaaccat cgccatatac tcccatgagg accgtctttc aatgcacagg
180ttgaaggcgg acgaagcgta tgttatcggg gaggagggcc agtatacacc
tgtgggtgct 240tacttggcaa tggacgagat catcgaaatt gcaaagaagc
ataaggtgga tttcatccat 300ccaggttatg ggttcttgtc tgaaaattcg
gaatttgccg acaaagtagt gaaggccggt 360atcacttgga tcggccctcc
agctgaagtt attgactctg tgggtgacaa agtctctgcc 420agacacttgg
cagcaagagc taacgttcct accgttcccg gtactccagg acctatcgaa
480actgtgcaag aggcacttga cttcgttaat gaatacggct acccggtgat
cattaaggcc 540gcctttggtg gtggtggtag aggtatgaga gtcgttagag
aaggtgacga cgtggcagat 600gcctttcaac gtgctacctc cgaagcccgt
actgccttcg gtaatggtac ctgctttgtg 660gaaagattct tggacaagcc
aaagcatatt gaagttcaat tgttggctga taaccacgga 720aacgtggttc
atcttttcga aagagactgt tctgtgcaaa gaagacacca aaaagttgtc
780gaagtcgctc cagcaaagac tttgccccgt gaagttcgtg acgctatttt
gacagatgct 840gttaaattag ctaaggtatg tggttacaga aacgcaggta
ccgccgaatt cttggttgac 900aaccaaaaca gacactattt cattgaaatt
aatccaagaa ttcaagtgga gcataccatc 960actgaagaaa tcaccggtat
tgacattgtt tctgcccaaa tccagattgc cgcaggtgcc 1020actttgactc
aactaggtct attacaggat aaaatcacca cccgtgggtt ttccatccaa
1080tgtcgtatta ccactgaaga tccctctaag aatttccaac cggataccgg
tcgcctggag 1140gtctatcgtt ctgccggtgg taatggtgtg agattggacg
gtggtaacgc ttatgcaggt 1200gctactatct cgcctcacta cgactcaatg
ctggtcaaat gttcatgctc tggttctact 1260tatgaaatcg tccgtaggaa
gatgattcgt gccctgatcg aattcagaat cagaggtgtt 1320aagaccaaca
ttcccttcct attgactctt ttgaccaatc cagtttttat tgagggtaca
1380tactggacga cttttattga cgacacccca caactgttcc aaatggtatc
gtcacaaaac 1440agagcgcaaa aactgttaca ctatttggca gacttggcag
ttaacggttc ttctattaag 1500ggtcaaattg gcttgccaaa actaaaatca
aatccaagtg tcccccattt gcacgatgct 1560cagggcaatg tcatcaacgt
tacaaagtct gcaccaccat ccggatggag acaagtgcta 1620ctggaaaagg
gaccatctga atttgccaag caagtcagac agttcaatgg tactctactg
1680atggacacca cctggagaga cgctcatcaa tctctacttg caacaagagt
cagaacccac 1740gatttggcta caatcgctcc aacaaccgca catgcccttg
caggtgcttt cgctttagaa 1800tgttggggtg gtgctacatt cgacgttgca
atgagattct tgcatgagga tccatgggaa 1860cgtctgagaa aattaagatc
tctggtgcct aatattccat tccaaatgtt attacgtggt 1920gccaacggtg
tggcttactc ttcattacct gacaatgcta ttgaccattt tgtcaagcaa
1980gccaaggata atggtgttga tatatttaga gtttttgatg ccttgaatga
tttagaacaa 2040ttaaaagttg gtgtgaatgc tgtcaagaag gccggtggtg
ttgtcgaagc tactgtttgt 2100tactctggtg acatgcttca gccaggtaag
aaatacaact tagactacta cctagaagtt 2160gttgaaaaaa tagttcaaat
gggtacacat atcttgggta ttaaggatat ggcaggtact 2220atgaaaccgg
ccgctgccaa attattaatt ggctccctaa gaaccagata tccggattta
2280ccaattcatg ttcacagtca tgactccgca ggtactgctg ttgcgtctat
gactgcatgt 2340gccctagcag gtgctgatgt tgtcgatgta gctatcaatt
caatgtcggg cttaacttcc 2400caaccatcaa ttaatgcact gttggcttca
ttagaaggta acattgatac tgggattaac 2460gttgagcatg ttcgtgaatt
agatgcatac tgggccgaaa tgagactgtt gtattcttgt 2520ttcgaggccg
acttgaaggg accagatcca gaagtttacc aacatgaaat cccaggtggt
2580caattgacta acttgttatt ccaagctcaa caactgggtc ttggtgaaca
atgggctgaa 2640actaaaagag cttacagaga agccaattac ctactgggag
atattgttaa agttacccca 2700acttctaagg ttgtcggtga tttagctcaa
ttcatggttt ctaacaaact gacttccgac 2760gatattagac gtttagctaa
ttctttggac tttcctgact ctgttatgga cttttttgaa 2820ggtttaattg
gtcaaccata cggtgggttc ccagaaccat taagatctga tgtattgaga
2880aacaagagaa gaaagttgac gtgccgtcca ggtttagaat tagaaccatt
tgatctcgaa 2940aaaattagag aagacttgca gaacagattc ggtgatattg
atgaatgcga tgttgcttct 3000tacaatatgt
atccaagggt ctatgaagat ttccaaaaga tcagagaaac atacggtgat
3060ttatcagttc taccaaccaa aaatttccta gcaccagcag aacctgatga
agaaatcgaa 3120gtcaccatcg aacaaggtaa gactttgatt atcaaattgc
aagctgttgg tgacttaaat 3180aagaaaactg ggcaaagaga agtgtatttt
gaattgaacg gtgaattaag aaagatcaga 3240gttgcagaca agtcacaaaa
catacaatct gttgctaaac caaaggctga tgtccacgat 3300actcaccaaa
tcggtgcacc aatggctggt gttatcatag aagttaaagt acataaaggg
3360tctttggtga aaaagggcga atcgattgct gttttgagtg ccatgaaaat
ggaaatggtt 3420gtctcttcac cagcagatgg tcaagttaaa gacgttttca
ttaaggatgg tgaaagtgtt 3480gacgcatcag atttgttggt tgtcctagaa
gaagaaaccc tacccccatc ccaaaaaaag 3540taa 35434330DNAArtificial
sequenceP1 primer 43ggactagtat gagcagtagc aagaaattgg
304431DNAArtificial sequenceP2 primer 44ccgctcgagt tacttttttt
gggatggggg t 31
uspto.report is an independent third-party trademark research tool that is not affiliated, endorsed, or sponsored by the United States Patent and Trademark Office (USPTO) or any other governmental organization. The information provided by uspto.report is based on publicly available data at the time of writing and is intended for informational purposes only.
While we strive to provide accurate and up-to-date information, we do not guarantee the accuracy, completeness, reliability, or suitability of the information displayed on this site. The use of this site is at your own risk. Any reliance you place on such information is therefore strictly at your own risk.
All official trademark data, including owner information, should be verified by visiting the official USPTO website at www.uspto.gov. This site is not intended to replace professional legal advice and should not be used as a substitute for consulting with a legal professional who is knowledgeable about trademark law.