U.S. patent application number 12/904738 was filed with the patent office on 2011-07-07 for pufa polyketide synthase systems and uses thereof. This patent application is currently assigned to MARTEK BIOSCIENCES CORPORATION. Invention is credited to WILLIAM R. BARCLAY, JAMES H. FLATT, JAMES G. METZ, CRAIG A. WEAVER.
Application Number | 20110167508 12/904738 |
Document ID | / |
Family ID | 34557910 |
Filed Date | 2011-07-07 |
United States Patent Application | 20110167508 |
Kind Code | A1 |
METZ; JAMES G. ; et al. | July 7, 2011 |
The invention generally relates to polyunsaturated fatty acid (PUFA) polyketide synthase (PKS) systems, to homologues thereof, to isolated nucleic acid molecules and recombinant nucleic acid molecules encoding biologically active domains of such a PUFA PKS system, to genetically modified organisms comprising PUFA PKS systems, to methods of making and using such systems for the production of bioactive molecules of interest, and to novel methods for identifying new bacterial and non-bacterial microorganisms having such a PUFA PKS system.
Inventors: | METZ; JAMES G.; (LONGMONT, CO) ; WEAVER; CRAIG A.; (BOULDER, CO) ; BARCLAY; WILLIAM R.; (BOULDER, CO) ; FLATT; JAMES H.; (COLORADO SPRINGS, CO) |
Assignee: | MARTEK BIOSCIENCES
CORPORATION COLUMBIA MD |
Family ID: | 34557910 |
Appl. No.: | 12/904738 |
Filed: | October 14, 2010 |
Application Number | Filing Date | Patent Number | ||
---|---|---|---|---|
11778620 | Jul 16, 2007 | 7847077 | ||
12904738 | ||||
11676971 | Feb 20, 2007 | 7642074 | ||
11778620 | ||||
10810352 | Mar 26, 2004 | 7211418 | ||
11676971 | ||||
10124800 | Apr 16, 2002 | 7247461 | ||
10810352 | ||||
09231899 | Jan 14, 1999 | 6566583 | ||
10124800 | ||||
60457979 | Mar 26, 2003 | |||
60284066 | Apr 16, 2001 | |||
60298796 | Jun 15, 2001 | |||
60323269 | Sep 18, 2001 | |||
Current U.S. Class: | 800/21 ; 435/134; 435/188; 435/243; 435/320.1; 435/41; 536/23.2; 800/281; 800/295 |
Current CPC Class: | C12P 7/6427 20130101; A23L 33/115 20160801; C07K 14/28 20130101; A23D 7/001 20130101; A23L 33/135 20160801; C12P 7/6472 20130101; C12N 15/52 20130101 |
Class at Publication: | 800/21 ; 536/23.2; 435/320.1; 435/243; 800/295; 435/41; 800/281; 435/134; 435/188 |
International Class: | A01K 67/027 20060101 A01K067/027; C07H 21/00 20060101 C07H021/00; C12N 15/63 20060101 C12N015/63; C12N 1/00 20060101 C12N001/00; A01H 5/00 20060101 A01H005/00; C12P 1/00 20060101 C12P001/00; A01H 1/06 20060101 A01H001/06; C12P 7/64 20060101 C12P007/64; C12N 9/96 20060101 C12N009/96 |
Sequence CWU 1
1
8218730DNASchizochytrium sp.CDS(1)..(8730) 1atg gcg gcc cgt ctg cag
gag caa aag gga ggc gag atg gat acc cgc 48Met Ala Ala Arg Leu Gln
Glu Gln Lys Gly Gly Glu Met Asp Thr Arg1 5 10 15att gcc atc atc ggc
atg tcg gcc atc ctc ccc tgc ggc acg acc gtg 96Ile Ala Ile Ile Gly
Met Ser Ala Ile Leu Pro Cys Gly Thr Thr Val 20 25 30cgc gag tcg tgg
gag acc atc cgc gcc ggc atc gac tgc ctg tcg gat 144Arg Glu Ser Trp
Glu Thr Ile Arg Ala Gly Ile Asp Cys Leu Ser Asp 35 40 45ctc ccc gag
gac cgc gtc gac gtg acg gcg tac ttt gac ccc gtc aag 192Leu Pro Glu
Asp Arg Val Asp Val Thr Ala Tyr Phe Asp Pro Val Lys 50 55 60acc acc
aag gac aag atc tac tgc aag cgc ggt ggc ttc att ccc gag 240Thr Thr
Lys Asp Lys Ile Tyr Cys Lys Arg Gly Gly Phe Ile Pro Glu65 70 75
80tac gac ttt gac gcc cgc gag ttc gga ctc aac atg ttc cag atg gag
288Tyr Asp Phe Asp Ala Arg Glu Phe Gly Leu Asn Met Phe Gln Met Glu
85 90 95gac tcg gac gca aac cag acc atc tcg ctt ctc aag gtc aag gag
gcc 336Asp Ser Asp Ala Asn Gln Thr Ile Ser Leu Leu Lys Val Lys Glu
Ala 100 105 110ctc cag gac gcc ggc atc gac gcc ctc ggc aag gaa aag
aag aac atc 384Leu Gln Asp Ala Gly Ile Asp Ala Leu Gly Lys Glu Lys
Lys Asn Ile 115 120 125ggc tgc gtg ctc ggc att ggc ggc ggc caa aag
tcc agc cac gag ttc 432Gly Cys Val Leu Gly Ile Gly Gly Gly Gln Lys
Ser Ser His Glu Phe 130 135 140tac tcg cgc ctt aat tat gtt gtc gtg
gag aag gtc ctc cgc aag atg 480Tyr Ser Arg Leu Asn Tyr Val Val Val
Glu Lys Val Leu Arg Lys Met145 150 155 160ggc atg ccc gag gag gac
gtc aag gtc gcc gtc gaa aag tac aag gcc 528Gly Met Pro Glu Glu Asp
Val Lys Val Ala Val Glu Lys Tyr Lys Ala 165 170 175aac ttc ccc gag
tgg cgc ctc gac tcc ttc cct ggc ttc ctc ggc aac 576Asn Phe Pro Glu
Trp Arg Leu Asp Ser Phe Pro Gly Phe Leu Gly Asn 180 185 190gtc acc
gcc ggt cgc tgc acc aac acc ttc aac ctc gac ggc atg aac 624Val Thr
Ala Gly Arg Cys Thr Asn Thr Phe Asn Leu Asp Gly Met Asn 195 200
205tgc gtt gtc gac gcc gca tgc gcc tcg tcc ctc atc gcc gtc aag gtc
672Cys Val Val Asp Ala Ala Cys Ala Ser Ser Leu Ile Ala Val Lys Val
210 215 220gcc atc gac gag ctg ctc tac ggt gac tgc gac atg atg gtc
acc ggt 720Ala Ile Asp Glu Leu Leu Tyr Gly Asp Cys Asp Met Met Val
Thr Gly225 230 235 240gcc acc tgc acg gat aac tcc atc ggc atg tac
atg gcc ttc tcc aag 768Ala Thr Cys Thr Asp Asn Ser Ile Gly Met Tyr
Met Ala Phe Ser Lys 245 250 255acc ccc gtg ttc tcc acg gac ccc agc
gtg cgc gcc tac gac gaa aag 816Thr Pro Val Phe Ser Thr Asp Pro Ser
Val Arg Ala Tyr Asp Glu Lys 260 265 270aca aag ggc atg ctc atc ggc
gag ggc tcc gcc atg ctc gtc ctc aag 864Thr Lys Gly Met Leu Ile Gly
Glu Gly Ser Ala Met Leu Val Leu Lys 275 280 285cgc tac gcc gac gcc
gtc cgc gac ggc gat gag atc cac gct gtt att 912Arg Tyr Ala Asp Ala
Val Arg Asp Gly Asp Glu Ile His Ala Val Ile 290 295 300cgc ggc tgc
gcc tcc tcc agt gat ggc aag gcc gcc ggc atc tac acg 960Arg Gly Cys
Ala Ser Ser Ser Asp Gly Lys Ala Ala Gly Ile Tyr Thr305 310 315
320ccc acc att tcg ggc cag gag gag gcc ctc cgc cgc gcc tac aac cgc
1008Pro Thr Ile Ser Gly Gln Glu Glu Ala Leu Arg Arg Ala Tyr Asn Arg
325 330 335gcc tgt gtc gac ccg gcc acc gtc act ctc gtc gag ggt cac
ggc acc 1056Ala Cys Val Asp Pro Ala Thr Val Thr Leu Val Glu Gly His
Gly Thr 340 345 350ggt act ccc gtt ggc gac cgc atc gag ctc acc gcc
ttg cgc aac ctc 1104Gly Thr Pro Val Gly Asp Arg Ile Glu Leu Thr Ala
Leu Arg Asn Leu 355 360 365ttt gac aag gcc tac ggc gag ggc aac acc
gaa aag gtc gct gtg ggc 1152Phe Asp Lys Ala Tyr Gly Glu Gly Asn Thr
Glu Lys Val Ala Val Gly 370 375 380agc atc aag tcc agc atc ggc cat
ctc aag gcc gtc gcc ggt ctc gcc 1200Ser Ile Lys Ser Ser Ile Gly His
Leu Lys Ala Val Ala Gly Leu Ala385 390 395 400ggt atg atc aag gtc
atc atg gcg ctc aag cac aag act ctc ccg ggc 1248Gly Met Ile Lys Val
Ile Met Ala Leu Lys His Lys Thr Leu Pro Gly 405 410 415acc atc aac
gtc gac aac cca ccc aac ctc tac gac aac acg ccc atc 1296Thr Ile Asn
Val Asp Asn Pro Pro Asn Leu Tyr Asp Asn Thr Pro Ile 420 425 430aac
gag tcc tcg ctc tac att aac acc atg aac cgc ccc tgg ttc ccg 1344Asn
Glu Ser Ser Leu Tyr Ile Asn Thr Met Asn Arg Pro Trp Phe Pro 435 440
445ccc cct ggt gtg ccc cgc cgc gcc ggc att tcg agc ttt ggc ttt ggt
1392Pro Pro Gly Val Pro Arg Arg Ala Gly Ile Ser Ser Phe Gly Phe Gly
450 455 460ggc gcc aac tac cac gcc gtc ctc gag gag gcc gag ccc gag
cac acg 1440Gly Ala Asn Tyr His Ala Val Leu Glu Glu Ala Glu Pro Glu
His Thr465 470 475 480acc gcg tac cgc ctc aac aag cgc ccg cag ccc
gtg ctc atg atg gcc 1488Thr Ala Tyr Arg Leu Asn Lys Arg Pro Gln Pro
Val Leu Met Met Ala 485 490 495gcc acg ccc gcg gcc ctc cag tcg ctc
tgc gag gcc cag ctc aag gag 1536Ala Thr Pro Ala Ala Leu Gln Ser Leu
Cys Glu Ala Gln Leu Lys Glu 500 505 510ttc gag gcc gcc atc aag gag
aac gag acc gtc aag aac acc gcc tac 1584Phe Glu Ala Ala Ile Lys Glu
Asn Glu Thr Val Lys Asn Thr Ala Tyr 515 520 525atc aag tgc gtc aag
ttc ggc gag cag ttc aaa ttc cct ggc tcc atc 1632Ile Lys Cys Val Lys
Phe Gly Glu Gln Phe Lys Phe Pro Gly Ser Ile 530 535 540ccg gcc aca
aac gcg cgc ctc ggc ttc ctc gtc aag gat gct gag gat 1680Pro Ala Thr
Asn Ala Arg Leu Gly Phe Leu Val Lys Asp Ala Glu Asp545 550 555
560gcc tgc tcc acc ctc cgt gcc atc tgc gcc caa ttc gcc aag gat gtc
1728Ala Cys Ser Thr Leu Arg Ala Ile Cys Ala Gln Phe Ala Lys Asp Val
565 570 575acc aag gag gcc tgg cgc ctc ccc cgc gag ggc gtc agc ttc
cgc gcc 1776Thr Lys Glu Ala Trp Arg Leu Pro Arg Glu Gly Val Ser Phe
Arg Ala 580 585 590aag ggc atc gcc acc aac ggc gct gtc gcc gcg ctc
ttc tcc ggc cag 1824Lys Gly Ile Ala Thr Asn Gly Ala Val Ala Ala Leu
Phe Ser Gly Gln 595 600 605ggc gcg cag tac acg cac atg ttt agc gag
gtg gcc atg aac tgg ccc 1872Gly Ala Gln Tyr Thr His Met Phe Ser Glu
Val Ala Met Asn Trp Pro 610 615 620cag ttc cgc cag agc att gcc gcc
atg gac gcc gcc cag tcc aag gtc 1920Gln Phe Arg Gln Ser Ile Ala Ala
Met Asp Ala Ala Gln Ser Lys Val625 630 635 640gct gga agc gac aag
gac ttt gag cgc gtc tcc cag gtc ctc tac ccg 1968Ala Gly Ser Asp Lys
Asp Phe Glu Arg Val Ser Gln Val Leu Tyr Pro 645 650 655cgc aag ccg
tac gag cgt gag ccc gag cag aac ccc aag aag atc tcc 2016Arg Lys Pro
Tyr Glu Arg Glu Pro Glu Gln Asn Pro Lys Lys Ile Ser 660 665 670ctc
acc gcc tac tcg cag ccc tcg acc ctg gcc tgc gct ctc ggt gcc 2064Leu
Thr Ala Tyr Ser Gln Pro Ser Thr Leu Ala Cys Ala Leu Gly Ala 675 680
685ttt gag atc ttc aag gag gcc ggc ttc acc ccg gac ttt gcc gcc ggc
2112Phe Glu Ile Phe Lys Glu Ala Gly Phe Thr Pro Asp Phe Ala Ala Gly
690 695 700cat tcg ctc ggt gag ttc gcc gcc ctc tac gcc gcg ggc tgc
gtc gac 2160His Ser Leu Gly Glu Phe Ala Ala Leu Tyr Ala Ala Gly Cys
Val Asp705 710 715 720cgc gac gag ctc ttt gag ctt gtc tgc cgc cgc
gcc cgc atc atg ggc 2208Arg Asp Glu Leu Phe Glu Leu Val Cys Arg Arg
Ala Arg Ile Met Gly 725 730 735ggc aag gac gca ccg gcc acc ccc aag
gga tgc atg gcc gcc gtc att 2256Gly Lys Asp Ala Pro Ala Thr Pro Lys
Gly Cys Met Ala Ala Val Ile 740 745 750ggc ccc aac gcc gag aac atc
aag gtc cag gcc gcc aac gtc tgg ctc 2304Gly Pro Asn Ala Glu Asn Ile
Lys Val Gln Ala Ala Asn Val Trp Leu 755 760 765ggc aac tcc aac tcg
cct tcg cag acc gtc atc acc ggc tcc gtc gaa 2352Gly Asn Ser Asn Ser
Pro Ser Gln Thr Val Ile Thr Gly Ser Val Glu 770 775 780ggt atc cag
gcc gag agc gcc cgc ctc cag aag gag ggc ttc cgc gtc 2400Gly Ile Gln
Ala Glu Ser Ala Arg Leu Gln Lys Glu Gly Phe Arg Val785 790 795
800gtg cct ctt gcc tgc gag agc gcc ttc cac tcg ccc cag atg gag aac
2448Val Pro Leu Ala Cys Glu Ser Ala Phe His Ser Pro Gln Met Glu Asn
805 810 815gcc tcg tcg gcc ttc aag gac gtc atc tcc aag gtc tcc ttc
cgc acc 2496Ala Ser Ser Ala Phe Lys Asp Val Ile Ser Lys Val Ser Phe
Arg Thr 820 825 830ccc aag gcc gag acc aag ctc ttc agc aac gtc tct
ggc gag acc tac 2544Pro Lys Ala Glu Thr Lys Leu Phe Ser Asn Val Ser
Gly Glu Thr Tyr 835 840 845ccc acg gac gcc cgc gag atg ctt acg cag
cac atg acc agc agc gtc 2592Pro Thr Asp Ala Arg Glu Met Leu Thr Gln
His Met Thr Ser Ser Val 850 855 860aag ttc ctc acc cag gtc cgc aac
atg cac cag gcc ggt gcg cgc atc 2640Lys Phe Leu Thr Gln Val Arg Asn
Met His Gln Ala Gly Ala Arg Ile865 870 875 880ttt gtc gag ttc gga
ccc aag cag gtg ctc tcc aag ctt gtc tcc gag 2688Phe Val Glu Phe Gly
Pro Lys Gln Val Leu Ser Lys Leu Val Ser Glu 885 890 895acc ctc aag
gat gac ccc tcg gtt gtc acc gtc tct gtc aac ccg gcc 2736Thr Leu Lys
Asp Asp Pro Ser Val Val Thr Val Ser Val Asn Pro Ala 900 905 910tcg
ggc acg gat tcg gac atc cag ctc cgc gac gcg gcc gtc cag ctc 2784Ser
Gly Thr Asp Ser Asp Ile Gln Leu Arg Asp Ala Ala Val Gln Leu 915 920
925gtt gtc gct ggc gtc aac ctt cag ggc ttt gac aag tgg gac gcc ccc
2832Val Val Ala Gly Val Asn Leu Gln Gly Phe Asp Lys Trp Asp Ala Pro
930 935 940gat gcc acc cgc atg cag gcc atc aag aag aag cgc act acc
ctc cgc 2880Asp Ala Thr Arg Met Gln Ala Ile Lys Lys Lys Arg Thr Thr
Leu Arg945 950 955 960ctt tcg gcc gcc acc tac gtc tcg gac aag acc
aag aag gtc cgc gac 2928Leu Ser Ala Ala Thr Tyr Val Ser Asp Lys Thr
Lys Lys Val Arg Asp 965 970 975gcc gcc atg aac gat ggc cgc tgc gtc
acc tac ctc aag ggc gcc gca 2976Ala Ala Met Asn Asp Gly Arg Cys Val
Thr Tyr Leu Lys Gly Ala Ala 980 985 990ccg ctc atc aag gcc ccg gag
ccc gtt gtc gac gag gcc gcc aag cgc 3024Pro Leu Ile Lys Ala Pro Glu
Pro Val Val Asp Glu Ala Ala Lys Arg 995 1000 1005gag gcc gag cgt
ctc cag aag gag ctt cag gat gcc cag cgc cag 3069Glu Ala Glu Arg Leu
Gln Lys Glu Leu Gln Asp Ala Gln Arg Gln 1010 1015 1020ctc gac gac
gcc aag cgc gcc gcc gcc gag gcc aac tcc aag ctc 3114Leu Asp Asp Ala
Lys Arg Ala Ala Ala Glu Ala Asn Ser Lys Leu 1025 1030 1035gcc gct
gcc aag gag gag gcc aag acc gcc gct gct tcg gcc aag 3159Ala Ala Ala
Lys Glu Glu Ala Lys Thr Ala Ala Ala Ser Ala Lys 1040 1045 1050ccc
gca gtt gac act gct gtt gtc gaa aag cat cgt gcc atc ctc 3204Pro Ala
Val Asp Thr Ala Val Val Glu Lys His Arg Ala Ile Leu 1055 1060
1065aag tcc atg ctc gcg gag ctc gat ggc tac gga tcg gtc gac gct
3249Lys Ser Met Leu Ala Glu Leu Asp Gly Tyr Gly Ser Val Asp Ala
1070 1075 1080tct tcc ctc cag cag cag cag cag cag cag acg gcc ccc
gcc ccg 3294Ser Ser Leu Gln Gln Gln Gln Gln Gln Gln Thr Ala Pro Ala
Pro 1085 1090 1095gtc aag gct gct gcg cct gcc gcc ccc gtt gcc tcg
gcc cct gcc 3339Val Lys Ala Ala Ala Pro Ala Ala Pro Val Ala Ser Ala
Pro Ala 1100 1105 1110ccg gct gtc tcg aac gag ctt ctt gag aag gcc
gag act gtc gtc 3384Pro Ala Val Ser Asn Glu Leu Leu Glu Lys Ala Glu
Thr Val Val 1115 1120 1125atg gag gtc ctc gcc gcc aag acc ggc tac
gag acc gac atg atc 3429Met Glu Val Leu Ala Ala Lys Thr Gly Tyr Glu
Thr Asp Met Ile 1130 1135 1140gag gct gac atg gag ctc gag acc gag
ctc ggc att gac tcc atc 3474Glu Ala Asp Met Glu Leu Glu Thr Glu Leu
Gly Ile Asp Ser Ile 1145 1150 1155aag cgt gtc gag atc ctc tcc gag
gtc cag gcc atg ctc aat gtc 3519Lys Arg Val Glu Ile Leu Ser Glu Val
Gln Ala Met Leu Asn Val 1160 1165 1170gag gcc aag gat gtc gat gcc
ctc agc cgc act cgc act gtt ggt 3564Glu Ala Lys Asp Val Asp Ala Leu
Ser Arg Thr Arg Thr Val Gly 1175 1180 1185gag gtt gtc aac gcc atg
aag gcc gag atc gct ggc agc tct gcc 3609Glu Val Val Asn Ala Met Lys
Ala Glu Ile Ala Gly Ser Ser Ala 1190 1195 1200ccg gcg cct gct gcc
gct gct ccg gct ccg gcc aag gct gcc cct 3654Pro Ala Pro Ala Ala Ala
Ala Pro Ala Pro Ala Lys Ala Ala Pro 1205 1210 1215gcc gcc gct gcg
cct gct gtc tcg aac gag ctt ctc gag aag gcc 3699Ala Ala Ala Ala Pro
Ala Val Ser Asn Glu Leu Leu Glu Lys Ala 1220 1225 1230gag acc gtc
gtc atg gag gtc ctc gcc gcc aag act ggc tac gag 3744Glu Thr Val Val
Met Glu Val Leu Ala Ala Lys Thr Gly Tyr Glu 1235 1240 1245act gac
atg atc gag tcc gac atg gag ctc gag act gag ctc ggc 3789Thr Asp Met
Ile Glu Ser Asp Met Glu Leu Glu Thr Glu Leu Gly 1250 1255 1260att
gac tcc atc aag cgt gtc gag atc ctc tcc gag gtt cag gcc 3834Ile Asp
Ser Ile Lys Arg Val Glu Ile Leu Ser Glu Val Gln Ala 1265 1270
1275atg ctc aac gtc gag gcc aag gac gtc gac gct ctc agc cgc act
3879Met Leu Asn Val Glu Ala Lys Asp Val Asp Ala Leu Ser Arg Thr
1280 1285 1290cgc act gtg ggt gag gtc gtc aac gcc atg aag gct gag
atc gct 3924Arg Thr Val Gly Glu Val Val Asn Ala Met Lys Ala Glu Ile
Ala 1295 1300 1305ggt ggc tct gcc ccg gcg cct gcc gcc gct gcc cca
ggt ccg gct 3969Gly Gly Ser Ala Pro Ala Pro Ala Ala Ala Ala Pro Gly
Pro Ala 1310 1315 1320gct gcc gcc cct gcg cct gcc gcc gcc gcc cct
gct gtc tcg aac 4014Ala Ala Ala Pro Ala Pro Ala Ala Ala Ala Pro Ala
Val Ser Asn 1325 1330 1335gag ctt ctt gag aag gcc gag acc gtc gtc
atg gag gtc ctc gcc 4059Glu Leu Leu Glu Lys Ala Glu Thr Val Val Met
Glu Val Leu Ala 1340 1345 1350gcc aag act ggc tac gag act gac atg
atc gag tcc gac atg gag 4104Ala Lys Thr Gly Tyr Glu Thr Asp Met Ile
Glu Ser Asp Met Glu 1355 1360 1365ctc gag acc gag ctc ggc att gac
tcc atc aag cgt gtc gag att 4149Leu Glu Thr Glu Leu Gly Ile Asp Ser
Ile Lys Arg Val Glu Ile 1370 1375 1380ctc tcc gag gtc cag gcc atg
ctc aac gtc gag gcc aag gac gtc 4194Leu Ser Glu Val Gln Ala Met Leu
Asn Val Glu Ala Lys Asp Val 1385 1390 1395gac gct ctc agc cgc acc
cgc act gtt ggc gag gtc gtc gat gcc 4239Asp Ala Leu Ser Arg Thr Arg
Thr Val Gly Glu Val Val Asp Ala 1400 1405 1410atg aag gcc gag atc
gct ggt ggc tct gcc ccg gcg cct gcc gcc 4284Met Lys Ala Glu Ile Ala
Gly Gly Ser Ala Pro Ala Pro Ala Ala 1415 1420 1425gct gct cct gct
ccg gct gct gcc gcc cct gcg cct gcc gcc cct 4329Ala Ala Pro Ala Pro
Ala Ala Ala Ala Pro Ala Pro Ala Ala Pro 1430 1435 1440gcg cct gct
gtc tcg agc gag ctt ctc gag aag gcc gag act gtc 4374Ala Pro Ala Val
Ser Ser Glu Leu Leu Glu Lys Ala Glu Thr Val 1445 1450 1455gtc atg
gag gtc ctc gcc gcc aag act ggc tac gag act gac atg 4419Val Met Glu
Val Leu Ala Ala Lys Thr Gly Tyr Glu Thr Asp Met 1460 1465 1470atc
gag tcc gac atg gag ctc gag acc gag ctc ggc att gac tcc 4464Ile Glu
Ser Asp Met Glu Leu Glu Thr Glu Leu Gly Ile Asp Ser 1475 1480
1485atc aag cgt gtc gag att ctc tcc gag gtc cag gcc atg ctc aac
4509Ile Lys Arg Val Glu Ile Leu Ser Glu Val Gln Ala Met Leu Asn
1490 1495 1500gtc gag gcc aag gac gtc gac gct ctc agc cgc acc cgc
act gtt 4554Val Glu Ala Lys Asp Val Asp Ala Leu Ser Arg Thr Arg Thr
Val 1505 1510
1515ggc gag gtc gtc gat gcc atg aag gcc gag atc gct ggt ggc tct
4599Gly Glu Val Val Asp Ala Met Lys Ala Glu Ile Ala Gly Gly Ser
1520 1525 1530gcc ccg gcg cct gcc gcc gct gct cct gct ccg gct gct
gcc gcc 4644Ala Pro Ala Pro Ala Ala Ala Ala Pro Ala Pro Ala Ala Ala
Ala 1535 1540 1545cct gcg cct gcc gcc cct gcg cct gcc gcc cct gcg
cct gct gtc 4689Pro Ala Pro Ala Ala Pro Ala Pro Ala Ala Pro Ala Pro
Ala Val 1550 1555 1560tcg agc gag ctt ctc gag aag gcc gag act gtc
gtc atg gag gtc 4734Ser Ser Glu Leu Leu Glu Lys Ala Glu Thr Val Val
Met Glu Val 1565 1570 1575ctc gcc gcc aag act ggc tac gag act gac
atg att gag tcc gac 4779Leu Ala Ala Lys Thr Gly Tyr Glu Thr Asp Met
Ile Glu Ser Asp 1580 1585 1590atg gag ctc gag acc gag ctc ggc att
gac tcc atc aag cgt gtc 4824Met Glu Leu Glu Thr Glu Leu Gly Ile Asp
Ser Ile Lys Arg Val 1595 1600 1605gag att ctc tcc gag gtt cag gcc
atg ctc aac gtc gag gcc aag 4869Glu Ile Leu Ser Glu Val Gln Ala Met
Leu Asn Val Glu Ala Lys 1610 1615 1620gac gtc gac gct ctc agc cgc
act cgc act gtt ggt gag gtc gtc 4914Asp Val Asp Ala Leu Ser Arg Thr
Arg Thr Val Gly Glu Val Val 1625 1630 1635gat gcc atg aag gct gag
atc gct ggc agc tcc gcc tcg gcg cct 4959Asp Ala Met Lys Ala Glu Ile
Ala Gly Ser Ser Ala Ser Ala Pro 1640 1645 1650gcc gcc gct gct cct
gct ccg gct gct gcc gct cct gcg ccc gct 5004Ala Ala Ala Ala Pro Ala
Pro Ala Ala Ala Ala Pro Ala Pro Ala 1655 1660 1665gcc gcc gcc cct
gct gtc tcg aac gag ctt ctc gag aaa gcc gag 5049Ala Ala Ala Pro Ala
Val Ser Asn Glu Leu Leu Glu Lys Ala Glu 1670 1675 1680act gtc gtc
atg gag gtc ctc gcc gcc aag act ggc tac gag act 5094Thr Val Val Met
Glu Val Leu Ala Ala Lys Thr Gly Tyr Glu Thr 1685 1690 1695gac atg
atc gag tcc gac atg gag ctc gag act gag ctc ggc att 5139Asp Met Ile
Glu Ser Asp Met Glu Leu Glu Thr Glu Leu Gly Ile 1700 1705 1710gac
tcc atc aag cgt gtc gag atc ctc tcc gag gtt cag gcc atg 5184Asp Ser
Ile Lys Arg Val Glu Ile Leu Ser Glu Val Gln Ala Met 1715 1720
1725ctc aac gtc gag gcc aag gac gtc gat gcc ctc agc cgc acc cgc
5229Leu Asn Val Glu Ala Lys Asp Val Asp Ala Leu Ser Arg Thr Arg
1730 1735 1740act gtt ggc gag gtt gtc gat gcc atg aag gcc gag atc
gct ggt 5274Thr Val Gly Glu Val Val Asp Ala Met Lys Ala Glu Ile Ala
Gly 1745 1750 1755ggc tct gcc ccg gcg cct gcc gcc gct gcc cct gct
ccg gct gcc 5319Gly Ser Ala Pro Ala Pro Ala Ala Ala Ala Pro Ala Pro
Ala Ala 1760 1765 1770gcc gcc cct gct gtc tcg aac gag ctt ctc gag
aag gcc gag act 5364Ala Ala Pro Ala Val Ser Asn Glu Leu Leu Glu Lys
Ala Glu Thr 1775 1780 1785gtc gtc atg gag gtc ctc gcc gcc aag act
ggc tac gag acc gac 5409Val Val Met Glu Val Leu Ala Ala Lys Thr Gly
Tyr Glu Thr Asp 1790 1795 1800atg atc gag tcc gac atg gag ctc gag
acc gag ctc ggc att gac 5454Met Ile Glu Ser Asp Met Glu Leu Glu Thr
Glu Leu Gly Ile Asp 1805 1810 1815tcc atc aag cgt gtc gag att ctc
tcc gag gtt cag gcc atg ctc 5499Ser Ile Lys Arg Val Glu Ile Leu Ser
Glu Val Gln Ala Met Leu 1820 1825 1830aac gtc gag gcc aag gac gtc
gat gct ctc agc cgc act cgc act 5544Asn Val Glu Ala Lys Asp Val Asp
Ala Leu Ser Arg Thr Arg Thr 1835 1840 1845gtt ggc gag gtc gtc gat
gcc atg aag gct gag atc gcc ggc agc 5589Val Gly Glu Val Val Asp Ala
Met Lys Ala Glu Ile Ala Gly Ser 1850 1855 1860tcc gcc ccg gcg cct
gcc gcc gct gct cct gct ccg gct gct gcc 5634Ser Ala Pro Ala Pro Ala
Ala Ala Ala Pro Ala Pro Ala Ala Ala 1865 1870 1875gct cct gcg ccc
gct gcc gct gcc cct gct gtc tcg agc gag ctt 5679Ala Pro Ala Pro Ala
Ala Ala Ala Pro Ala Val Ser Ser Glu Leu 1880 1885 1890ctc gag aag
gcc gag acc gtc gtc atg gag gtc ctc gcc gcc aag 5724Leu Glu Lys Ala
Glu Thr Val Val Met Glu Val Leu Ala Ala Lys 1895 1900 1905act ggc
tac gag act gac atg att gag tcc gac atg gag ctc gag 5769Thr Gly Tyr
Glu Thr Asp Met Ile Glu Ser Asp Met Glu Leu Glu 1910 1915 1920act
gag ctc ggc att gac tcc atc aag cgt gtc gag atc ctc tcc 5814Thr Glu
Leu Gly Ile Asp Ser Ile Lys Arg Val Glu Ile Leu Ser 1925 1930
1935gag gtt cag gcc atg ctc aac gtc gag gcc aag gac gtc gat gcc
5859Glu Val Gln Ala Met Leu Asn Val Glu Ala Lys Asp Val Asp Ala
1940 1945 1950ctc agc cgc acc cgc act gtt ggc gag gtt gtc gat gcc
atg aag 5904Leu Ser Arg Thr Arg Thr Val Gly Glu Val Val Asp Ala Met
Lys 1955 1960 1965gcc gag atc gct ggt ggc tct gcc ccg gcg cct gcc
gcc gct gcc 5949Ala Glu Ile Ala Gly Gly Ser Ala Pro Ala Pro Ala Ala
Ala Ala 1970 1975 1980cct gct ccg gct gcc gcc gcc cct gct gtc tcg
aac gag ctt ctt 5994Pro Ala Pro Ala Ala Ala Ala Pro Ala Val Ser Asn
Glu Leu Leu 1985 1990 1995gag aag gcc gag acc gtc gtc atg gag gtc
ctc gcc gcc aag act 6039Glu Lys Ala Glu Thr Val Val Met Glu Val Leu
Ala Ala Lys Thr 2000 2005 2010ggc tac gag acc gac atg atc gag tcc
gac atg gag ctc gag acc 6084Gly Tyr Glu Thr Asp Met Ile Glu Ser Asp
Met Glu Leu Glu Thr 2015 2020 2025gag ctc ggc att gac tcc atc aag
cgt gtc gag att ctc tcc gag 6129Glu Leu Gly Ile Asp Ser Ile Lys Arg
Val Glu Ile Leu Ser Glu 2030 2035 2040gtt cag gcc atg ctc aac gtc
gag gcc aag gac gtc gac gct ctc 6174Val Gln Ala Met Leu Asn Val Glu
Ala Lys Asp Val Asp Ala Leu 2045 2050 2055agc cgc act cgc act gtt
ggc gag gtc gtc gat gcc atg aag gct 6219Ser Arg Thr Arg Thr Val Gly
Glu Val Val Asp Ala Met Lys Ala 2060 2065 2070gag atc gct ggt ggc
tct gcc ccg gcg cct gcc gcc gct gct cct 6264Glu Ile Ala Gly Gly Ser
Ala Pro Ala Pro Ala Ala Ala Ala Pro 2075 2080 2085gcc tcg gct ggc
gcc gcg cct gcg gtc aag att gac tcg gtc cac 6309Ala Ser Ala Gly Ala
Ala Pro Ala Val Lys Ile Asp Ser Val His 2090 2095 2100ggc gct gac
tgt gat gat ctt tcc ctg atg cac gcc aag gtg gtt 6354Gly Ala Asp Cys
Asp Asp Leu Ser Leu Met His Ala Lys Val Val 2105 2110 2115gac atc
cgc cgc ccg gac gag ctc atc ctg gag cgc ccc gag aac 6399Asp Ile Arg
Arg Pro Asp Glu Leu Ile Leu Glu Arg Pro Glu Asn 2120 2125 2130cgc
ccc gtt ctc gtt gtc gat gac ggc agc gag ctc acc ctc gcc 6444Arg Pro
Val Leu Val Val Asp Asp Gly Ser Glu Leu Thr Leu Ala 2135 2140
2145ctg gtc cgc gtc ctc ggc gcc tgc gcc gtt gtc ctg acc ttt gag
6489Leu Val Arg Val Leu Gly Ala Cys Ala Val Val Leu Thr Phe Glu
2150 2155 2160ggt ctc cag ctc gct cag cgc gct ggt gcc gct gcc atc
cgc cac 6534Gly Leu Gln Leu Ala Gln Arg Ala Gly Ala Ala Ala Ile Arg
His 2165 2170 2175gtg ctc gcc aag gat ctt tcc gcg gag agc gcc gag
aag gcc atc 6579Val Leu Ala Lys Asp Leu Ser Ala Glu Ser Ala Glu Lys
Ala Ile 2180 2185 2190aag gag gcc gag cag cgc ttt ggc gct ctc ggc
ggc ttc atc tcg 6624Lys Glu Ala Glu Gln Arg Phe Gly Ala Leu Gly Gly
Phe Ile Ser 2195 2200 2205cag cag gcg gag cgc ttc gag ccc gcc gaa
atc ctc ggc ttc acg 6669Gln Gln Ala Glu Arg Phe Glu Pro Ala Glu Ile
Leu Gly Phe Thr 2210 2215 2220ctc atg tgc gcc aag ttc gcc aag gct
tcc ctc tgc acg gct gtg 6714Leu Met Cys Ala Lys Phe Ala Lys Ala Ser
Leu Cys Thr Ala Val 2225 2230 2235gct ggc ggc cgc ccg gcc ttt atc
ggt gtg gcg cgc ctt gac ggc 6759Ala Gly Gly Arg Pro Ala Phe Ile Gly
Val Ala Arg Leu Asp Gly 2240 2245 2250cgc ctc gga ttc act tcg cag
ggc act tct gac gcg ctc aag cgt 6804Arg Leu Gly Phe Thr Ser Gln Gly
Thr Ser Asp Ala Leu Lys Arg 2255 2260 2265gcc cag cgt ggt gcc atc
ttt ggc ctc tgc aag acc atc ggc ctc 6849Ala Gln Arg Gly Ala Ile Phe
Gly Leu Cys Lys Thr Ile Gly Leu 2270 2275 2280gag tgg tcc gag tct
gac gtc ttt tcc cgc ggc gtg gac att gct 6894Glu Trp Ser Glu Ser Asp
Val Phe Ser Arg Gly Val Asp Ile Ala 2285 2290 2295cag ggc atg cac
ccc gag gat gcc gcc gtg gcg att gtg cgc gag 6939Gln Gly Met His Pro
Glu Asp Ala Ala Val Ala Ile Val Arg Glu 2300 2305 2310atg gcg tgc
gct gac att cgc att cgc gag gtc ggc att ggc gca 6984Met Ala Cys Ala
Asp Ile Arg Ile Arg Glu Val Gly Ile Gly Ala 2315 2320 2325aac cag
cag cgc tgc acg atc cgt gcc gcc aag ctc gag acc ggc 7029Asn Gln Gln
Arg Cys Thr Ile Arg Ala Ala Lys Leu Glu Thr Gly 2330 2335 2340aac
ccg cag cgc cag atc gcc aag gac gac gtg ctg ctc gtt tct 7074Asn Pro
Gln Arg Gln Ile Ala Lys Asp Asp Val Leu Leu Val Ser 2345 2350
2355ggc ggc gct cgc ggc atc acg cct ctt tgc atc cgg gag atc acg
7119Gly Gly Ala Arg Gly Ile Thr Pro Leu Cys Ile Arg Glu Ile Thr
2360 2365 2370cgc cag atc gcg ggc ggc aag tac att ctg ctt ggc cgc
agc aag 7164Arg Gln Ile Ala Gly Gly Lys Tyr Ile Leu Leu Gly Arg Ser
Lys 2375 2380 2385gtc tct gcg agc gaa ccg gca tgg tgc gct ggc atc
act gac gag 7209Val Ser Ala Ser Glu Pro Ala Trp Cys Ala Gly Ile Thr
Asp Glu 2390 2395 2400aag gct gtg caa aag gct gct acc cag gag ctc
aag cgc gcc ttt 7254Lys Ala Val Gln Lys Ala Ala Thr Gln Glu Leu Lys
Arg Ala Phe 2405 2410 2415agc gct ggc gag ggc ccc aag ccc acg ccc
cgc gct gtc act aag 7299Ser Ala Gly Glu Gly Pro Lys Pro Thr Pro Arg
Ala Val Thr Lys 2420 2425 2430ctt gtg ggc tct gtt ctt ggc gct cgc
gag gtg cgc agc tct att 7344Leu Val Gly Ser Val Leu Gly Ala Arg Glu
Val Arg Ser Ser Ile 2435 2440 2445gct gcg att gaa gcg ctc ggc ggc
aag gcc atc tac tcg tcg tgc 7389Ala Ala Ile Glu Ala Leu Gly Gly Lys
Ala Ile Tyr Ser Ser Cys 2450 2455 2460gac gtg aac tct gcc gcc gac
gtg gcc aag gcc gtg cgc gat gcc 7434Asp Val Asn Ser Ala Ala Asp Val
Ala Lys Ala Val Arg Asp Ala 2465 2470 2475gag tcc cag ctc ggt gcc
cgc gtc tcg ggc atc gtt cat gcc tcg 7479Glu Ser Gln Leu Gly Ala Arg
Val Ser Gly Ile Val His Ala Ser 2480 2485 2490ggc gtg ctc cgc gac
cgt ctc atc gag aag aag ctc ccc gac gag 7524Gly Val Leu Arg Asp Arg
Leu Ile Glu Lys Lys Leu Pro Asp Glu 2495 2500 2505ttc gac gcc gtc
ttt ggc acc aag gtc acc ggt ctc gag aac ctc 7569Phe Asp Ala Val Phe
Gly Thr Lys Val Thr Gly Leu Glu Asn Leu 2510 2515 2520ctc gcc gcc
gtc gac cgc gcc aac ctc aag cac atg gtc ctc ttc 7614Leu Ala Ala Val
Asp Arg Ala Asn Leu Lys His Met Val Leu Phe 2525 2530 2535agc tcg
ctc gcc ggc ttc cac ggc aac gtc ggc cag tct gac tac 7659Ser Ser Leu
Ala Gly Phe His Gly Asn Val Gly Gln Ser Asp Tyr 2540 2545 2550gcc
atg gcc aac gag gcc ctt aac aag atg ggc ctc gag ctc gcc 7704Ala Met
Ala Asn Glu Ala Leu Asn Lys Met Gly Leu Glu Leu Ala 2555 2560
2565aag gac gtc tcg gtc aag tcg atc tgc ttc ggt ccc tgg gac ggt
7749Lys Asp Val Ser Val Lys Ser Ile Cys Phe Gly Pro Trp Asp Gly
2570 2575 2580ggc atg gtg acg ccg cag ctc aag aag cag ttc cag gag
atg ggc 7794Gly Met Val Thr Pro Gln Leu Lys Lys Gln Phe Gln Glu Met
Gly 2585 2590 2595gtg cag atc atc ccc cgc gag ggc ggc gct gat acc
gtg gcg cgc 7839Val Gln Ile Ile Pro Arg Glu Gly Gly Ala Asp Thr Val
Ala Arg 2600 2605 2610atc gtg ctc ggc tcc tcg ccg gct gag atc ctt
gtc ggc aac tgg 7884Ile Val Leu Gly Ser Ser Pro Ala Glu Ile Leu Val
Gly Asn Trp 2615 2620 2625cgc acc ccg tcc aag aag gtc ggc tcg gac
acc atc acc ctg cac 7929Arg Thr Pro Ser Lys Lys Val Gly Ser Asp Thr
Ile Thr Leu His 2630 2635 2640cgc aag att tcc gcc aag tcc aac ccc
ttc ctc gag gac cac gtc 7974Arg Lys Ile Ser Ala Lys Ser Asn Pro Phe
Leu Glu Asp His Val 2645 2650 2655atc cag ggc cgc cgc gtg ctg ccc
atg acg ctg gcc att ggc tcg 8019Ile Gln Gly Arg Arg Val Leu Pro Met
Thr Leu Ala Ile Gly Ser 2660 2665 2670ctc gcg gag acc tgc ctc ggc
ctc ttc ccc ggc tac tcg ctc tgg 8064Leu Ala Glu Thr Cys Leu Gly Leu
Phe Pro Gly Tyr Ser Leu Trp 2675 2680 2685gcc att gac gac gcc cag
ctc ttc aag ggt gtc act gtc gac ggc 8109Ala Ile Asp Asp Ala Gln Leu
Phe Lys Gly Val Thr Val Asp Gly 2690 2695 2700gac gtc aac tgc gag
gtg acc ctc acc ccg tcg acg gcg ccc tcg 8154Asp Val Asn Cys Glu Val
Thr Leu Thr Pro Ser Thr Ala Pro Ser 2705 2710 2715ggc cgc gtc aac
gtc cag gcc acg ctc aag acc ttt tcc agc ggc 8199Gly Arg Val Asn Val
Gln Ala Thr Leu Lys Thr Phe Ser Ser Gly 2720 2725 2730aag ctg gtc
ccg gcc tac cgc gcc gtc atc gtg ctc tcc aac cag 8244Lys Leu Val Pro
Ala Tyr Arg Ala Val Ile Val Leu Ser Asn Gln 2735 2740 2745ggc gcg
ccc ccg gcc aac gcc acc atg cag ccg ccc tcg ctc gat 8289Gly Ala Pro
Pro Ala Asn Ala Thr Met Gln Pro Pro Ser Leu Asp 2750 2755 2760gcc
gat ccg gcg ctc cag ggc tcc gtc tac gac ggc aag acc ctc 8334Ala Asp
Pro Ala Leu Gln Gly Ser Val Tyr Asp Gly Lys Thr Leu 2765 2770
2775ttc cac ggc ccg gcc ttc cgc ggc atc gat gac gtg ctc tcg tgc
8379Phe His Gly Pro Ala Phe Arg Gly Ile Asp Asp Val Leu Ser Cys
2780 2785 2790acc aag agc cag ctt gtg gcc aag tgc agc gct gtc ccc
ggc tcc 8424Thr Lys Ser Gln Leu Val Ala Lys Cys Ser Ala Val Pro Gly
Ser 2795 2800 2805gac gcc gct cgc ggc gag ttt gcc acg gac act gac
gcc cat gac 8469Asp Ala Ala Arg Gly Glu Phe Ala Thr Asp Thr Asp Ala
His Asp 2810 2815 2820ccc ttc gtg aac gac ctg gcc ttt cag gcc atg
ctc gtc tgg gtg 8514Pro Phe Val Asn Asp Leu Ala Phe Gln Ala Met Leu
Val Trp Val 2825 2830 2835cgc cgc acg ctc ggc cag gct gcg ctc ccc
aac tcg atc cag cgc 8559Arg Arg Thr Leu Gly Gln Ala Ala Leu Pro Asn
Ser Ile Gln Arg 2840 2845 2850atc gtc cag cac cgc ccg gtc ccg cag
gac aag ccc ttc tac att 8604Ile Val Gln His Arg Pro Val Pro Gln Asp
Lys Pro Phe Tyr Ile 2855 2860 2865acc ctc cgc tcc aac cag tcg ggc
ggt cac tcc cag cac aag cac 8649Thr Leu Arg Ser Asn Gln Ser Gly Gly
His Ser Gln His Lys His 2870 2875 2880gcc ctt cag ttc cac aac gag
cag ggc gat ctc ttc att gat gtc 8694Ala Leu Gln Phe His Asn Glu Gln
Gly Asp Leu Phe Ile Asp Val 2885 2890 2895cag gct tcg gtc atc gcc
acg gac agc ctt gcc ttc 8730Gln Ala Ser Val Ile Ala Thr Asp Ser Leu
Ala Phe 2900 2905 291022910PRTSchizochytrium sp. 2Met Ala Ala Arg
Leu Gln Glu Gln Lys Gly Gly Glu Met Asp Thr Arg1 5 10 15Ile Ala Ile
Ile Gly Met Ser Ala Ile Leu Pro Cys Gly Thr Thr Val 20 25 30Arg Glu
Ser Trp Glu Thr Ile Arg Ala Gly Ile Asp Cys Leu Ser Asp 35 40 45Leu
Pro Glu Asp Arg Val Asp Val Thr Ala Tyr Phe Asp Pro Val Lys 50 55
60Thr Thr Lys Asp Lys Ile Tyr Cys Lys Arg Gly Gly Phe Ile Pro Glu65
70 75 80Tyr Asp Phe Asp Ala Arg Glu Phe Gly Leu Asn Met Phe Gln Met
Glu 85 90 95Asp Ser Asp Ala Asn Gln Thr Ile Ser Leu Leu Lys Val Lys
Glu Ala 100 105 110Leu Gln Asp Ala Gly Ile Asp Ala Leu Gly Lys Glu
Lys Lys Asn Ile 115 120 125Gly Cys Val Leu Gly Ile Gly Gly Gly Gln
Lys Ser
Ser His Glu Phe 130 135 140Tyr Ser Arg Leu Asn Tyr Val Val Val Glu
Lys Val Leu Arg Lys Met145 150 155 160Gly Met Pro Glu Glu Asp Val
Lys Val Ala Val Glu Lys Tyr Lys Ala 165 170 175Asn Phe Pro Glu Trp
Arg Leu Asp Ser Phe Pro Gly Phe Leu Gly Asn 180 185 190Val Thr Ala
Gly Arg Cys Thr Asn Thr Phe Asn Leu Asp Gly Met Asn 195 200 205Cys
Val Val Asp Ala Ala Cys Ala Ser Ser Leu Ile Ala Val Lys Val 210 215
220Ala Ile Asp Glu Leu Leu Tyr Gly Asp Cys Asp Met Met Val Thr
Gly225 230 235 240Ala Thr Cys Thr Asp Asn Ser Ile Gly Met Tyr Met
Ala Phe Ser Lys 245 250 255Thr Pro Val Phe Ser Thr Asp Pro Ser Val
Arg Ala Tyr Asp Glu Lys 260 265 270Thr Lys Gly Met Leu Ile Gly Glu
Gly Ser Ala Met Leu Val Leu Lys 275 280 285Arg Tyr Ala Asp Ala Val
Arg Asp Gly Asp Glu Ile His Ala Val Ile 290 295 300Arg Gly Cys Ala
Ser Ser Ser Asp Gly Lys Ala Ala Gly Ile Tyr Thr305 310 315 320Pro
Thr Ile Ser Gly Gln Glu Glu Ala Leu Arg Arg Ala Tyr Asn Arg 325 330
335Ala Cys Val Asp Pro Ala Thr Val Thr Leu Val Glu Gly His Gly Thr
340 345 350Gly Thr Pro Val Gly Asp Arg Ile Glu Leu Thr Ala Leu Arg
Asn Leu 355 360 365Phe Asp Lys Ala Tyr Gly Glu Gly Asn Thr Glu Lys
Val Ala Val Gly 370 375 380Ser Ile Lys Ser Ser Ile Gly His Leu Lys
Ala Val Ala Gly Leu Ala385 390 395 400Gly Met Ile Lys Val Ile Met
Ala Leu Lys His Lys Thr Leu Pro Gly 405 410 415Thr Ile Asn Val Asp
Asn Pro Pro Asn Leu Tyr Asp Asn Thr Pro Ile 420 425 430Asn Glu Ser
Ser Leu Tyr Ile Asn Thr Met Asn Arg Pro Trp Phe Pro 435 440 445Pro
Pro Gly Val Pro Arg Arg Ala Gly Ile Ser Ser Phe Gly Phe Gly 450 455
460Gly Ala Asn Tyr His Ala Val Leu Glu Glu Ala Glu Pro Glu His
Thr465 470 475 480Thr Ala Tyr Arg Leu Asn Lys Arg Pro Gln Pro Val
Leu Met Met Ala 485 490 495Ala Thr Pro Ala Ala Leu Gln Ser Leu Cys
Glu Ala Gln Leu Lys Glu 500 505 510Phe Glu Ala Ala Ile Lys Glu Asn
Glu Thr Val Lys Asn Thr Ala Tyr 515 520 525Ile Lys Cys Val Lys Phe
Gly Glu Gln Phe Lys Phe Pro Gly Ser Ile 530 535 540Pro Ala Thr Asn
Ala Arg Leu Gly Phe Leu Val Lys Asp Ala Glu Asp545 550 555 560Ala
Cys Ser Thr Leu Arg Ala Ile Cys Ala Gln Phe Ala Lys Asp Val 565 570
575Thr Lys Glu Ala Trp Arg Leu Pro Arg Glu Gly Val Ser Phe Arg Ala
580 585 590Lys Gly Ile Ala Thr Asn Gly Ala Val Ala Ala Leu Phe Ser
Gly Gln 595 600 605Gly Ala Gln Tyr Thr His Met Phe Ser Glu Val Ala
Met Asn Trp Pro 610 615 620Gln Phe Arg Gln Ser Ile Ala Ala Met Asp
Ala Ala Gln Ser Lys Val625 630 635 640Ala Gly Ser Asp Lys Asp Phe
Glu Arg Val Ser Gln Val Leu Tyr Pro 645 650 655Arg Lys Pro Tyr Glu
Arg Glu Pro Glu Gln Asn Pro Lys Lys Ile Ser 660 665 670Leu Thr Ala
Tyr Ser Gln Pro Ser Thr Leu Ala Cys Ala Leu Gly Ala 675 680 685Phe
Glu Ile Phe Lys Glu Ala Gly Phe Thr Pro Asp Phe Ala Ala Gly 690 695
700His Ser Leu Gly Glu Phe Ala Ala Leu Tyr Ala Ala Gly Cys Val
Asp705 710 715 720Arg Asp Glu Leu Phe Glu Leu Val Cys Arg Arg Ala
Arg Ile Met Gly 725 730 735Gly Lys Asp Ala Pro Ala Thr Pro Lys Gly
Cys Met Ala Ala Val Ile 740 745 750Gly Pro Asn Ala Glu Asn Ile Lys
Val Gln Ala Ala Asn Val Trp Leu 755 760 765Gly Asn Ser Asn Ser Pro
Ser Gln Thr Val Ile Thr Gly Ser Val Glu 770 775 780Gly Ile Gln Ala
Glu Ser Ala Arg Leu Gln Lys Glu Gly Phe Arg Val785 790 795 800Val
Pro Leu Ala Cys Glu Ser Ala Phe His Ser Pro Gln Met Glu Asn 805 810
815Ala Ser Ser Ala Phe Lys Asp Val Ile Ser Lys Val Ser Phe Arg Thr
820 825 830Pro Lys Ala Glu Thr Lys Leu Phe Ser Asn Val Ser Gly Glu
Thr Tyr 835 840 845Pro Thr Asp Ala Arg Glu Met Leu Thr Gln His Met
Thr Ser Ser Val 850 855 860Lys Phe Leu Thr Gln Val Arg Asn Met His
Gln Ala Gly Ala Arg Ile865 870 875 880Phe Val Glu Phe Gly Pro Lys
Gln Val Leu Ser Lys Leu Val Ser Glu 885 890 895Thr Leu Lys Asp Asp
Pro Ser Val Val Thr Val Ser Val Asn Pro Ala 900 905 910Ser Gly Thr
Asp Ser Asp Ile Gln Leu Arg Asp Ala Ala Val Gln Leu 915 920 925Val
Val Ala Gly Val Asn Leu Gln Gly Phe Asp Lys Trp Asp Ala Pro 930 935
940Asp Ala Thr Arg Met Gln Ala Ile Lys Lys Lys Arg Thr Thr Leu
Arg945 950 955 960Leu Ser Ala Ala Thr Tyr Val Ser Asp Lys Thr Lys
Lys Val Arg Asp 965 970 975Ala Ala Met Asn Asp Gly Arg Cys Val Thr
Tyr Leu Lys Gly Ala Ala 980 985 990Pro Leu Ile Lys Ala Pro Glu Pro
Val Val Asp Glu Ala Ala Lys Arg 995 1000 1005Glu Ala Glu Arg Leu
Gln Lys Glu Leu Gln Asp Ala Gln Arg Gln 1010 1015 1020Leu Asp Asp
Ala Lys Arg Ala Ala Ala Glu Ala Asn Ser Lys Leu 1025 1030 1035Ala
Ala Ala Lys Glu Glu Ala Lys Thr Ala Ala Ala Ser Ala Lys 1040 1045
1050Pro Ala Val Asp Thr Ala Val Val Glu Lys His Arg Ala Ile Leu
1055 1060 1065Lys Ser Met Leu Ala Glu Leu Asp Gly Tyr Gly Ser Val
Asp Ala 1070 1075 1080Ser Ser Leu Gln Gln Gln Gln Gln Gln Gln Thr
Ala Pro Ala Pro 1085 1090 1095Val Lys Ala Ala Ala Pro Ala Ala Pro
Val Ala Ser Ala Pro Ala 1100 1105 1110Pro Ala Val Ser Asn Glu Leu
Leu Glu Lys Ala Glu Thr Val Val 1115 1120 1125Met Glu Val Leu Ala
Ala Lys Thr Gly Tyr Glu Thr Asp Met Ile 1130 1135 1140Glu Ala Asp
Met Glu Leu Glu Thr Glu Leu Gly Ile Asp Ser Ile 1145 1150 1155Lys
Arg Val Glu Ile Leu Ser Glu Val Gln Ala Met Leu Asn Val 1160 1165
1170Glu Ala Lys Asp Val Asp Ala Leu Ser Arg Thr Arg Thr Val Gly
1175 1180 1185Glu Val Val Asn Ala Met Lys Ala Glu Ile Ala Gly Ser
Ser Ala 1190 1195 1200Pro Ala Pro Ala Ala Ala Ala Pro Ala Pro Ala
Lys Ala Ala Pro 1205 1210 1215Ala Ala Ala Ala Pro Ala Val Ser Asn
Glu Leu Leu Glu Lys Ala 1220 1225 1230Glu Thr Val Val Met Glu Val
Leu Ala Ala Lys Thr Gly Tyr Glu 1235 1240 1245Thr Asp Met Ile Glu
Ser Asp Met Glu Leu Glu Thr Glu Leu Gly 1250 1255 1260Ile Asp Ser
Ile Lys Arg Val Glu Ile Leu Ser Glu Val Gln Ala 1265 1270 1275Met
Leu Asn Val Glu Ala Lys Asp Val Asp Ala Leu Ser Arg Thr 1280 1285
1290Arg Thr Val Gly Glu Val Val Asn Ala Met Lys Ala Glu Ile Ala
1295 1300 1305Gly Gly Ser Ala Pro Ala Pro Ala Ala Ala Ala Pro Gly
Pro Ala 1310 1315 1320Ala Ala Ala Pro Ala Pro Ala Ala Ala Ala Pro
Ala Val Ser Asn 1325 1330 1335Glu Leu Leu Glu Lys Ala Glu Thr Val
Val Met Glu Val Leu Ala 1340 1345 1350Ala Lys Thr Gly Tyr Glu Thr
Asp Met Ile Glu Ser Asp Met Glu 1355 1360 1365Leu Glu Thr Glu Leu
Gly Ile Asp Ser Ile Lys Arg Val Glu Ile 1370 1375 1380Leu Ser Glu
Val Gln Ala Met Leu Asn Val Glu Ala Lys Asp Val 1385 1390 1395Asp
Ala Leu Ser Arg Thr Arg Thr Val Gly Glu Val Val Asp Ala 1400 1405
1410Met Lys Ala Glu Ile Ala Gly Gly Ser Ala Pro Ala Pro Ala Ala
1415 1420 1425Ala Ala Pro Ala Pro Ala Ala Ala Ala Pro Ala Pro Ala
Ala Pro 1430 1435 1440Ala Pro Ala Val Ser Ser Glu Leu Leu Glu Lys
Ala Glu Thr Val 1445 1450 1455Val Met Glu Val Leu Ala Ala Lys Thr
Gly Tyr Glu Thr Asp Met 1460 1465 1470Ile Glu Ser Asp Met Glu Leu
Glu Thr Glu Leu Gly Ile Asp Ser 1475 1480 1485Ile Lys Arg Val Glu
Ile Leu Ser Glu Val Gln Ala Met Leu Asn 1490 1495 1500Val Glu Ala
Lys Asp Val Asp Ala Leu Ser Arg Thr Arg Thr Val 1505 1510 1515Gly
Glu Val Val Asp Ala Met Lys Ala Glu Ile Ala Gly Gly Ser 1520 1525
1530Ala Pro Ala Pro Ala Ala Ala Ala Pro Ala Pro Ala Ala Ala Ala
1535 1540 1545Pro Ala Pro Ala Ala Pro Ala Pro Ala Ala Pro Ala Pro
Ala Val 1550 1555 1560Ser Ser Glu Leu Leu Glu Lys Ala Glu Thr Val
Val Met Glu Val 1565 1570 1575Leu Ala Ala Lys Thr Gly Tyr Glu Thr
Asp Met Ile Glu Ser Asp 1580 1585 1590Met Glu Leu Glu Thr Glu Leu
Gly Ile Asp Ser Ile Lys Arg Val 1595 1600 1605Glu Ile Leu Ser Glu
Val Gln Ala Met Leu Asn Val Glu Ala Lys 1610 1615 1620Asp Val Asp
Ala Leu Ser Arg Thr Arg Thr Val Gly Glu Val Val 1625 1630 1635Asp
Ala Met Lys Ala Glu Ile Ala Gly Ser Ser Ala Ser Ala Pro 1640 1645
1650Ala Ala Ala Ala Pro Ala Pro Ala Ala Ala Ala Pro Ala Pro Ala
1655 1660 1665Ala Ala Ala Pro Ala Val Ser Asn Glu Leu Leu Glu Lys
Ala Glu 1670 1675 1680Thr Val Val Met Glu Val Leu Ala Ala Lys Thr
Gly Tyr Glu Thr 1685 1690 1695Asp Met Ile Glu Ser Asp Met Glu Leu
Glu Thr Glu Leu Gly Ile 1700 1705 1710Asp Ser Ile Lys Arg Val Glu
Ile Leu Ser Glu Val Gln Ala Met 1715 1720 1725Leu Asn Val Glu Ala
Lys Asp Val Asp Ala Leu Ser Arg Thr Arg 1730 1735 1740Thr Val Gly
Glu Val Val Asp Ala Met Lys Ala Glu Ile Ala Gly 1745 1750 1755Gly
Ser Ala Pro Ala Pro Ala Ala Ala Ala Pro Ala Pro Ala Ala 1760 1765
1770Ala Ala Pro Ala Val Ser Asn Glu Leu Leu Glu Lys Ala Glu Thr
1775 1780 1785Val Val Met Glu Val Leu Ala Ala Lys Thr Gly Tyr Glu
Thr Asp 1790 1795 1800Met Ile Glu Ser Asp Met Glu Leu Glu Thr Glu
Leu Gly Ile Asp 1805 1810 1815Ser Ile Lys Arg Val Glu Ile Leu Ser
Glu Val Gln Ala Met Leu 1820 1825 1830Asn Val Glu Ala Lys Asp Val
Asp Ala Leu Ser Arg Thr Arg Thr 1835 1840 1845Val Gly Glu Val Val
Asp Ala Met Lys Ala Glu Ile Ala Gly Ser 1850 1855 1860Ser Ala Pro
Ala Pro Ala Ala Ala Ala Pro Ala Pro Ala Ala Ala 1865 1870 1875Ala
Pro Ala Pro Ala Ala Ala Ala Pro Ala Val Ser Ser Glu Leu 1880 1885
1890Leu Glu Lys Ala Glu Thr Val Val Met Glu Val Leu Ala Ala Lys
1895 1900 1905Thr Gly Tyr Glu Thr Asp Met Ile Glu Ser Asp Met Glu
Leu Glu 1910 1915 1920Thr Glu Leu Gly Ile Asp Ser Ile Lys Arg Val
Glu Ile Leu Ser 1925 1930 1935Glu Val Gln Ala Met Leu Asn Val Glu
Ala Lys Asp Val Asp Ala 1940 1945 1950Leu Ser Arg Thr Arg Thr Val
Gly Glu Val Val Asp Ala Met Lys 1955 1960 1965Ala Glu Ile Ala Gly
Gly Ser Ala Pro Ala Pro Ala Ala Ala Ala 1970 1975 1980Pro Ala Pro
Ala Ala Ala Ala Pro Ala Val Ser Asn Glu Leu Leu 1985 1990 1995Glu
Lys Ala Glu Thr Val Val Met Glu Val Leu Ala Ala Lys Thr 2000 2005
2010Gly Tyr Glu Thr Asp Met Ile Glu Ser Asp Met Glu Leu Glu Thr
2015 2020 2025Glu Leu Gly Ile Asp Ser Ile Lys Arg Val Glu Ile Leu
Ser Glu 2030 2035 2040Val Gln Ala Met Leu Asn Val Glu Ala Lys Asp
Val Asp Ala Leu 2045 2050 2055Ser Arg Thr Arg Thr Val Gly Glu Val
Val Asp Ala Met Lys Ala 2060 2065 2070Glu Ile Ala Gly Gly Ser Ala
Pro Ala Pro Ala Ala Ala Ala Pro 2075 2080 2085Ala Ser Ala Gly Ala
Ala Pro Ala Val Lys Ile Asp Ser Val His 2090 2095 2100Gly Ala Asp
Cys Asp Asp Leu Ser Leu Met His Ala Lys Val Val 2105 2110 2115Asp
Ile Arg Arg Pro Asp Glu Leu Ile Leu Glu Arg Pro Glu Asn 2120 2125
2130Arg Pro Val Leu Val Val Asp Asp Gly Ser Glu Leu Thr Leu Ala
2135 2140 2145Leu Val Arg Val Leu Gly Ala Cys Ala Val Val Leu Thr
Phe Glu 2150 2155 2160Gly Leu Gln Leu Ala Gln Arg Ala Gly Ala Ala
Ala Ile Arg His 2165 2170 2175Val Leu Ala Lys Asp Leu Ser Ala Glu
Ser Ala Glu Lys Ala Ile 2180 2185 2190Lys Glu Ala Glu Gln Arg Phe
Gly Ala Leu Gly Gly Phe Ile Ser 2195 2200 2205Gln Gln Ala Glu Arg
Phe Glu Pro Ala Glu Ile Leu Gly Phe Thr 2210 2215 2220Leu Met Cys
Ala Lys Phe Ala Lys Ala Ser Leu Cys Thr Ala Val 2225 2230 2235Ala
Gly Gly Arg Pro Ala Phe Ile Gly Val Ala Arg Leu Asp Gly 2240 2245
2250Arg Leu Gly Phe Thr Ser Gln Gly Thr Ser Asp Ala Leu Lys Arg
2255 2260 2265Ala Gln Arg Gly Ala Ile Phe Gly Leu Cys Lys Thr Ile
Gly Leu 2270 2275 2280Glu Trp Ser Glu Ser Asp Val Phe Ser Arg Gly
Val Asp Ile Ala 2285 2290 2295Gln Gly Met His Pro Glu Asp Ala Ala
Val Ala Ile Val Arg Glu 2300 2305 2310Met Ala Cys Ala Asp Ile Arg
Ile Arg Glu Val Gly Ile Gly Ala 2315 2320 2325Asn Gln Gln Arg Cys
Thr Ile Arg Ala Ala Lys Leu Glu Thr Gly 2330 2335 2340Asn Pro Gln
Arg Gln Ile Ala Lys Asp Asp Val Leu Leu Val Ser 2345 2350 2355Gly
Gly Ala Arg Gly Ile Thr Pro Leu Cys Ile Arg Glu Ile Thr 2360 2365
2370Arg Gln Ile Ala Gly Gly Lys Tyr Ile Leu Leu Gly Arg Ser Lys
2375 2380 2385Val Ser Ala Ser Glu Pro Ala Trp Cys Ala Gly Ile Thr
Asp Glu 2390 2395 2400Lys Ala Val Gln Lys Ala Ala Thr Gln Glu Leu
Lys Arg Ala Phe 2405 2410 2415Ser Ala Gly Glu Gly Pro Lys Pro Thr
Pro Arg Ala Val Thr Lys 2420 2425 2430Leu Val Gly Ser Val Leu Gly
Ala Arg Glu Val Arg Ser Ser Ile 2435 2440 2445Ala Ala Ile Glu Ala
Leu Gly Gly Lys Ala Ile Tyr Ser Ser Cys 2450 2455 2460Asp Val Asn
Ser Ala Ala Asp Val Ala Lys Ala Val Arg Asp Ala 2465 2470 2475Glu
Ser Gln Leu Gly Ala Arg Val Ser Gly Ile Val His Ala Ser 2480 2485
2490Gly Val Leu Arg Asp Arg Leu Ile Glu Lys Lys Leu Pro Asp Glu
2495 2500 2505Phe Asp Ala Val Phe Gly Thr Lys Val Thr Gly Leu Glu
Asn Leu 2510 2515 2520Leu Ala Ala Val Asp Arg Ala Asn Leu Lys His
Met Val Leu Phe 2525 2530 2535Ser Ser Leu Ala Gly Phe His Gly Asn
Val Gly Gln Ser Asp Tyr 2540 2545 2550Ala Met Ala Asn Glu Ala Leu
Asn Lys Met Gly Leu Glu Leu Ala 2555 2560 2565Lys Asp Val Ser Val
Lys Ser Ile Cys Phe Gly Pro Trp Asp Gly 2570 2575
2580Gly Met Val Thr Pro Gln Leu Lys Lys Gln Phe Gln Glu Met Gly
2585 2590 2595Val Gln Ile Ile Pro Arg Glu Gly Gly Ala Asp Thr Val
Ala Arg 2600 2605 2610Ile Val Leu Gly Ser Ser Pro Ala Glu Ile Leu
Val Gly Asn Trp 2615 2620 2625Arg Thr Pro Ser Lys Lys Val Gly Ser
Asp Thr Ile Thr Leu His 2630 2635 2640Arg Lys Ile Ser Ala Lys Ser
Asn Pro Phe Leu Glu Asp His Val 2645 2650 2655Ile Gln Gly Arg Arg
Val Leu Pro Met Thr Leu Ala Ile Gly Ser 2660 2665 2670Leu Ala Glu
Thr Cys Leu Gly Leu Phe Pro Gly Tyr Ser Leu Trp 2675 2680 2685Ala
Ile Asp Asp Ala Gln Leu Phe Lys Gly Val Thr Val Asp Gly 2690 2695
2700Asp Val Asn Cys Glu Val Thr Leu Thr Pro Ser Thr Ala Pro Ser
2705 2710 2715Gly Arg Val Asn Val Gln Ala Thr Leu Lys Thr Phe Ser
Ser Gly 2720 2725 2730Lys Leu Val Pro Ala Tyr Arg Ala Val Ile Val
Leu Ser Asn Gln 2735 2740 2745Gly Ala Pro Pro Ala Asn Ala Thr Met
Gln Pro Pro Ser Leu Asp 2750 2755 2760Ala Asp Pro Ala Leu Gln Gly
Ser Val Tyr Asp Gly Lys Thr Leu 2765 2770 2775Phe His Gly Pro Ala
Phe Arg Gly Ile Asp Asp Val Leu Ser Cys 2780 2785 2790Thr Lys Ser
Gln Leu Val Ala Lys Cys Ser Ala Val Pro Gly Ser 2795 2800 2805Asp
Ala Ala Arg Gly Glu Phe Ala Thr Asp Thr Asp Ala His Asp 2810 2815
2820Pro Phe Val Asn Asp Leu Ala Phe Gln Ala Met Leu Val Trp Val
2825 2830 2835Arg Arg Thr Leu Gly Gln Ala Ala Leu Pro Asn Ser Ile
Gln Arg 2840 2845 2850Ile Val Gln His Arg Pro Val Pro Gln Asp Lys
Pro Phe Tyr Ile 2855 2860 2865Thr Leu Arg Ser Asn Gln Ser Gly Gly
His Ser Gln His Lys His 2870 2875 2880Ala Leu Gln Phe His Asn Glu
Gln Gly Asp Leu Phe Ile Asp Val 2885 2890 2895Gln Ala Ser Val Ile
Ala Thr Asp Ser Leu Ala Phe 2900 2905 291036177DNASchizochytrium
sp.CDS(1)..(6177) 3atg gcc gct cgg aat gtg agc gcc gcg cat gag atg
cac gat gaa aag 48Met Ala Ala Arg Asn Val Ser Ala Ala His Glu Met
His Asp Glu Lys1 5 10 15cgc atc gcc gtc gtc ggc atg gcc gtc cag tac
gcc gga tgc aaa acc 96Arg Ile Ala Val Val Gly Met Ala Val Gln Tyr
Ala Gly Cys Lys Thr 20 25 30aag gac gag ttc tgg gag gtg ctc atg aac
ggc aag gtc gag tcc aag 144Lys Asp Glu Phe Trp Glu Val Leu Met Asn
Gly Lys Val Glu Ser Lys 35 40 45gtg atc agc gac aaa cga ctc ggc tcc
aac tac cgc gcc gag cac tac 192Val Ile Ser Asp Lys Arg Leu Gly Ser
Asn Tyr Arg Ala Glu His Tyr 50 55 60aaa gca gag cgc agc aag tat gcc
gac acc ttt tgc aac gaa acg tac 240Lys Ala Glu Arg Ser Lys Tyr Ala
Asp Thr Phe Cys Asn Glu Thr Tyr65 70 75 80ggc acc ctt gac gag aac
gag atc gac aac gag cac gaa ctc ctc ctc 288Gly Thr Leu Asp Glu Asn
Glu Ile Asp Asn Glu His Glu Leu Leu Leu 85 90 95aac ctc gcc aag cag
gca ctc gca gag aca tcc gtc aaa gac tcg aca 336Asn Leu Ala Lys Gln
Ala Leu Ala Glu Thr Ser Val Lys Asp Ser Thr 100 105 110cgc tgc ggc
atc gtc agc ggc tgc ctc tcg ttc ccc atg gac aac ctc 384Arg Cys Gly
Ile Val Ser Gly Cys Leu Ser Phe Pro Met Asp Asn Leu 115 120 125cag
ggt gaa ctc ctc aac gtg tac caa aac cat gtc gag aaa aag ctc 432Gln
Gly Glu Leu Leu Asn Val Tyr Gln Asn His Val Glu Lys Lys Leu 130 135
140ggg gcc cgc gtc ttc aag gac gcc tcc cat tgg tcc gaa cgc gag cag
480Gly Ala Arg Val Phe Lys Asp Ala Ser His Trp Ser Glu Arg Glu
Gln145 150 155 160tcc aac aaa ccc gag gcc ggt gac cgc cgc atc ttc
atg gac ccg gcc 528Ser Asn Lys Pro Glu Ala Gly Asp Arg Arg Ile Phe
Met Asp Pro Ala 165 170 175tcc ttc gtc gcc gaa gaa ctc aac ctc ggc
gcc ctt cac tac tcc gtc 576Ser Phe Val Ala Glu Glu Leu Asn Leu Gly
Ala Leu His Tyr Ser Val 180 185 190gac gca gca tgc gcc acg gcg ctc
tac gtg ctc cgc ctc gcg cag gat 624Asp Ala Ala Cys Ala Thr Ala Leu
Tyr Val Leu Arg Leu Ala Gln Asp 195 200 205cat ctc gtc tcc ggc gcc
gcc gac gtc atg ctc tgc ggt gcc acc tgc 672His Leu Val Ser Gly Ala
Ala Asp Val Met Leu Cys Gly Ala Thr Cys 210 215 220ctg ccg gag ccc
ttt ttc atc ctt tcg ggc ttt tcc acc ttc cag gcc 720Leu Pro Glu Pro
Phe Phe Ile Leu Ser Gly Phe Ser Thr Phe Gln Ala225 230 235 240atg
ccc gtc ggc acg ggc cag aac gtg tcc atg ccg ctg cac aag gac 768Met
Pro Val Gly Thr Gly Gln Asn Val Ser Met Pro Leu His Lys Asp 245 250
255agc cag ggc ctc acc ccg ggt gag ggc ggc tcc atc atg gtc ctc aag
816Ser Gln Gly Leu Thr Pro Gly Glu Gly Gly Ser Ile Met Val Leu Lys
260 265 270cgt ctc gat gat gcc atc cgc gac ggc gac cac att tac ggc
acc ctt 864Arg Leu Asp Asp Ala Ile Arg Asp Gly Asp His Ile Tyr Gly
Thr Leu 275 280 285ctc ggc gcc aat gtc agc aac tcc ggc aca ggt ctg
ccc ctc aag ccc 912Leu Gly Ala Asn Val Ser Asn Ser Gly Thr Gly Leu
Pro Leu Lys Pro 290 295 300ctt ctc ccc agc gag aaa aag tgc ctc atg
gac acc tac acg cgc att 960Leu Leu Pro Ser Glu Lys Lys Cys Leu Met
Asp Thr Tyr Thr Arg Ile305 310 315 320aac gtg cac ccg cac aag att
cag tac gtc gag tgc cac gcc acc ggc 1008Asn Val His Pro His Lys Ile
Gln Tyr Val Glu Cys His Ala Thr Gly 325 330 335acg ccc cag ggt gat
cgt gtg gaa atc gac gcc gtc aag gcc tgc ttt 1056Thr Pro Gln Gly Asp
Arg Val Glu Ile Asp Ala Val Lys Ala Cys Phe 340 345 350gaa ggc aag
gtc ccc cgt ttc ggt acc aca aag ggc aac ttt gga cac 1104Glu Gly Lys
Val Pro Arg Phe Gly Thr Thr Lys Gly Asn Phe Gly His 355 360 365acc
cts gyc gca gcc ggc ttt gcc ggt atg tgc aag gtc ctc ctc tcc 1152Thr
Xaa Xaa Ala Ala Gly Phe Ala Gly Met Cys Lys Val Leu Leu Ser 370 375
380atg aag cat ggc atc atc ccg ccc acc ccg ggt atc gat gac gag acc
1200Met Lys His Gly Ile Ile Pro Pro Thr Pro Gly Ile Asp Asp Glu
Thr385 390 395 400aag atg gac cct ctc gtc gtc tcc ggt gag gcc atc
cca tgg cca gag 1248Lys Met Asp Pro Leu Val Val Ser Gly Glu Ala Ile
Pro Trp Pro Glu 405 410 415acc aac ggc gag ccc aag cgc gcc ggt ctc
tcg gcc ttt ggc ttt ggt 1296Thr Asn Gly Glu Pro Lys Arg Ala Gly Leu
Ser Ala Phe Gly Phe Gly 420 425 430ggc acc aac gcc cat gcc gtc ttt
gag gag cat gac ccc tcc aac gcc 1344Gly Thr Asn Ala His Ala Val Phe
Glu Glu His Asp Pro Ser Asn Ala 435 440 445gcc tgc acg ggc cac gac
tcc att tct gcg ctc tcg gcc cgc tgc ggc 1392Ala Cys Thr Gly His Asp
Ser Ile Ser Ala Leu Ser Ala Arg Cys Gly 450 455 460ggt gaa agc aac
atg cgc atc gcc atc act ggt atg gac gcc acc ttt 1440Gly Glu Ser Asn
Met Arg Ile Ala Ile Thr Gly Met Asp Ala Thr Phe465 470 475 480ggc
gct ctc aag gga ctc gac gcc ttc gag cgc gcc att tac acc ggc 1488Gly
Ala Leu Lys Gly Leu Asp Ala Phe Glu Arg Ala Ile Tyr Thr Gly 485 490
495gct cac ggt gcc atc cca ctc cca gaa aag cgc tgg cgc ttt ctc ggc
1536Ala His Gly Ala Ile Pro Leu Pro Glu Lys Arg Trp Arg Phe Leu Gly
500 505 510aag gac aag gac ttt ctt gac ctc tgc ggc gtc aag gcc acc
ccg cac 1584Lys Asp Lys Asp Phe Leu Asp Leu Cys Gly Val Lys Ala Thr
Pro His 515 520 525ggc tgc tac att gaa gat gtt gag gtc gac ttc cag
cgc ctc cgc acg 1632Gly Cys Tyr Ile Glu Asp Val Glu Val Asp Phe Gln
Arg Leu Arg Thr 530 535 540ccc atg acc cct gaa gac atg ctc ctc cct
cag cag ctt ctg gcc gtc 1680Pro Met Thr Pro Glu Asp Met Leu Leu Pro
Gln Gln Leu Leu Ala Val545 550 555 560acc acc att gac cgc gcc atc
ctc gac tcg gga atg aaa aag ggt ggc 1728Thr Thr Ile Asp Arg Ala Ile
Leu Asp Ser Gly Met Lys Lys Gly Gly 565 570 575aat gtc gcc gtc ttt
gtc ggc ctc ggc acc gac ctc gag ctc tac cgt 1776Asn Val Ala Val Phe
Val Gly Leu Gly Thr Asp Leu Glu Leu Tyr Arg 580 585 590cac cgt gct
cgc gtc gct ctc aag gag cgc gtc cgc cct gaa gcc tcc 1824His Arg Ala
Arg Val Ala Leu Lys Glu Arg Val Arg Pro Glu Ala Ser 595 600 605aag
aag ctc aat gac atg atg cag tac att aac gac tgc ggc aca tcc 1872Lys
Lys Leu Asn Asp Met Met Gln Tyr Ile Asn Asp Cys Gly Thr Ser 610 615
620aca tcg tac acc tcg tac att ggc aac ctc gtc gcc acg cgc gtc tcg
1920Thr Ser Tyr Thr Ser Tyr Ile Gly Asn Leu Val Ala Thr Arg Val
Ser625 630 635 640tcg cag tgg ggc ttc acg ggc ccc tcc ttt acg atc
acc gag ggc aac 1968Ser Gln Trp Gly Phe Thr Gly Pro Ser Phe Thr Ile
Thr Glu Gly Asn 645 650 655aac tcc gtc tac cgc tgc gcc gag ctc ggc
aag tac ctc ctc gag acc 2016Asn Ser Val Tyr Arg Cys Ala Glu Leu Gly
Lys Tyr Leu Leu Glu Thr 660 665 670ggc gag gtc gat ggc gtc gtc gtt
gcg ggt gtc gat ctc tgc ggc agt 2064Gly Glu Val Asp Gly Val Val Val
Ala Gly Val Asp Leu Cys Gly Ser 675 680 685gcc gaa aac ctt tac gtc
aag tct cgc cgc ttc aag gtg tcc acc tcc 2112Ala Glu Asn Leu Tyr Val
Lys Ser Arg Arg Phe Lys Val Ser Thr Ser 690 695 700gat acc ccg cgc
gcc agc ttt gac gcc gcc gcc gat ggc tac ttt gtc 2160Asp Thr Pro Arg
Ala Ser Phe Asp Ala Ala Ala Asp Gly Tyr Phe Val705 710 715 720ggc
gag ggc tgc ggt gcc ttt gtg ctc aag cgt gag act agc tgc acc 2208Gly
Glu Gly Cys Gly Ala Phe Val Leu Lys Arg Glu Thr Ser Cys Thr 725 730
735aag gac gac cgt atc tac gct tgc atg gat gcc atc gtc cct ggc aac
2256Lys Asp Asp Arg Ile Tyr Ala Cys Met Asp Ala Ile Val Pro Gly Asn
740 745 750gtc cct agc gcc tgc ttg cgc gag gcc ctc gac cag gcg cgc
gtc aag 2304Val Pro Ser Ala Cys Leu Arg Glu Ala Leu Asp Gln Ala Arg
Val Lys 755 760 765ccg ggc gat atc gag atg ctc gag ctc agc gcc gac
tcc gcc cgc cac 2352Pro Gly Asp Ile Glu Met Leu Glu Leu Ser Ala Asp
Ser Ala Arg His 770 775 780ctc aag gac ccg tcc gtc ctg ccc aag gag
ctc act gcc gag gag gaa 2400Leu Lys Asp Pro Ser Val Leu Pro Lys Glu
Leu Thr Ala Glu Glu Glu785 790 795 800atc ggc ggc ctt cag acg atc
ctt cgt gac gat gac aag ctc ccg cgc 2448Ile Gly Gly Leu Gln Thr Ile
Leu Arg Asp Asp Asp Lys Leu Pro Arg 805 810 815aac gtc gca acg ggc
agt gtc aag gcc acc gtc ggt gac acc ggt tat 2496Asn Val Ala Thr Gly
Ser Val Lys Ala Thr Val Gly Asp Thr Gly Tyr 820 825 830gcc tct ggt
gct gcc agc ctc atc aag gct gcg ctt tgc atc tac aac 2544Ala Ser Gly
Ala Ala Ser Leu Ile Lys Ala Ala Leu Cys Ile Tyr Asn 835 840 845cgc
tac ctg ccc agc aac ggc gac gac tgg gat gaa ccc gcc cct gag 2592Arg
Tyr Leu Pro Ser Asn Gly Asp Asp Trp Asp Glu Pro Ala Pro Glu 850 855
860gcg ccc tgg gac agc acc ctc ttt gcg tgc cag acc tcg cgc gct tgg
2640Ala Pro Trp Asp Ser Thr Leu Phe Ala Cys Gln Thr Ser Arg Ala
Trp865 870 875 880ctc aag aac cct ggc gag cgt cgc tat gcg gcc gtc
tcg ggc gtc tcc 2688Leu Lys Asn Pro Gly Glu Arg Arg Tyr Ala Ala Val
Ser Gly Val Ser 885 890 895gag acg cgc tcg tgc tat tcc gtg ctc ctc
tcc gaa gcc gag ggc cac 2736Glu Thr Arg Ser Cys Tyr Ser Val Leu Leu
Ser Glu Ala Glu Gly His 900 905 910tac gag cgc gag aac cgc atc tcg
ctc gac gag gag gcg ccc aag ctc 2784Tyr Glu Arg Glu Asn Arg Ile Ser
Leu Asp Glu Glu Ala Pro Lys Leu 915 920 925att gtg ctt cgc gcc gac
tcc cac gag gag atc ctt ggt cgc ctc gac 2832Ile Val Leu Arg Ala Asp
Ser His Glu Glu Ile Leu Gly Arg Leu Asp 930 935 940aag atc cgc gag
cgc ttc ttg cag ccc acg ggc gcc gcc ccg cgc gag 2880Lys Ile Arg Glu
Arg Phe Leu Gln Pro Thr Gly Ala Ala Pro Arg Glu945 950 955 960tcc
gag ctc aag gcg cag gcc cgc cgc atc ttc ctc gag ctc ctc ggc 2928Ser
Glu Leu Lys Ala Gln Ala Arg Arg Ile Phe Leu Glu Leu Leu Gly 965 970
975gag acc ctt gcc cag gat gcc gct tct tca ggc tcg caa aag ccc ctc
2976Glu Thr Leu Ala Gln Asp Ala Ala Ser Ser Gly Ser Gln Lys Pro Leu
980 985 990gct ctc agc ctc gtc tcc acg ccc tcc aag ctc cag cgc gag
gtc gag 3024Ala Leu Ser Leu Val Ser Thr Pro Ser Lys Leu Gln Arg Glu
Val Glu 995 1000 1005ctc gcg gcc aag ggt atc ccg cgc tgc ctc aag
atg cgc cgc gat 3069Leu Ala Ala Lys Gly Ile Pro Arg Cys Leu Lys Met
Arg Arg Asp 1010 1015 1020tgg agc tcc cct gct ggc agc cgc tac gcg
cct gag ccg ctc gcc 3114Trp Ser Ser Pro Ala Gly Ser Arg Tyr Ala Pro
Glu Pro Leu Ala 1025 1030 1035agc gac cgc gtc gcc ttc atg tac ggc
gaa ggt cgc agc cct tac 3159Ser Asp Arg Val Ala Phe Met Tyr Gly Glu
Gly Arg Ser Pro Tyr 1040 1045 1050tac ggc atc acc caa gac att cac
cgc att tgg ccc gaa ctc cac 3204Tyr Gly Ile Thr Gln Asp Ile His Arg
Ile Trp Pro Glu Leu His 1055 1060 1065gag gtc atc aac gaa aag acg
aac cgt ctc tgg gcc gaa ggc gac 3249Glu Val Ile Asn Glu Lys Thr Asn
Arg Leu Trp Ala Glu Gly Asp 1070 1075 1080cgc tgg gtc atg ccg cgc
gcc agc ttc aag tcg gag ctc gag agc 3294Arg Trp Val Met Pro Arg Ala
Ser Phe Lys Ser Glu Leu Glu Ser 1085 1090 1095cag cag caa gag ttt
gat cgc aac atg att gaa atg ttc cgt ctt 3339Gln Gln Gln Glu Phe Asp
Arg Asn Met Ile Glu Met Phe Arg Leu 1100 1105 1110gga atc ctc acc
tca att gcc ttc acc aat ctg gcg cgc gac gtt 3384Gly Ile Leu Thr Ser
Ile Ala Phe Thr Asn Leu Ala Arg Asp Val 1115 1120 1125ctc aac atc
acg ccc aag gcc gcc ttt ggc ctc agt ctt ggc gag 3429Leu Asn Ile Thr
Pro Lys Ala Ala Phe Gly Leu Ser Leu Gly Glu 1130 1135 1140att tcc
atg att ttt gcc ttt tcc aag aag aac ggt ctc atc tcc 3474Ile Ser Met
Ile Phe Ala Phe Ser Lys Lys Asn Gly Leu Ile Ser 1145 1150 1155gac
cag ctc acc aag gat ctt cgc gag tcc gac gtg tgg aac aag 3519Asp Gln
Leu Thr Lys Asp Leu Arg Glu Ser Asp Val Trp Asn Lys 1160 1165
1170gct ctg gcc gtt gaa ttt aat gcg ctg cgc gag gcc tgg ggc att
3564Ala Leu Ala Val Glu Phe Asn Ala Leu Arg Glu Ala Trp Gly Ile
1175 1180 1185cca cag agt gtc ccc aag gac gag ttc tgg caa ggc tac
att gtg 3609Pro Gln Ser Val Pro Lys Asp Glu Phe Trp Gln Gly Tyr Ile
Val 1190 1195 1200cgc ggc acc aag cag gat atc gag gcg gcc atc gcc
ccg gac agc 3654Arg Gly Thr Lys Gln Asp Ile Glu Ala Ala Ile Ala Pro
Asp Ser 1205 1210 1215aag tac gtg cgc ctc acc atc atc aat gat gcc
aac acc gcc ctc 3699Lys Tyr Val Arg Leu Thr Ile Ile Asn Asp Ala Asn
Thr Ala Leu 1220 1225 1230att agc ggc aag ccc gac gcc tgc aag gct
gcg atc gcg cgt ctc 3744Ile Ser Gly Lys Pro Asp Ala Cys Lys Ala Ala
Ile Ala Arg Leu 1235 1240 1245ggt ggc aac att cct gcg ctt ccc gtg
acc cag ggc atg tgc ggc 3789Gly Gly Asn Ile Pro Ala Leu Pro Val Thr
Gln Gly Met Cys Gly 1250 1255 1260cac tgc ccc gag gtg gga cct tat
acc aag gat atc gcc aag atc 3834His Cys Pro Glu Val Gly Pro Tyr Thr
Lys Asp Ile Ala Lys Ile 1265 1270 1275cat gcc aac ctt gag ttc ccc
gtt gtc gac ggc ctt gac ctc tgg 3879His Ala Asn Leu Glu Phe Pro Val
Val Asp Gly Leu Asp Leu Trp 1280 1285 1290acc aca atc aac cag aag
cgc ctc gtg cca cgc gcc acg ggc gcc 3924Thr Thr Ile Asn Gln Lys Arg
Leu Val Pro Arg Ala Thr Gly Ala 1295 1300 1305aag gac gaa tgg gcc
cct tct tcc ttt ggc gag tac gcc ggc cag 3969Lys Asp Glu Trp Ala Pro
Ser Ser Phe
Gly Glu Tyr Ala Gly Gln 1310 1315 1320ctc tac gag aag cag gct aac
ttc ccc caa atc gtc gag acc att 4014Leu Tyr Glu Lys Gln Ala Asn Phe
Pro Gln Ile Val Glu Thr Ile 1325 1330 1335tac aag caa aac tac gac
gtc ttt gtc gag gtt ggg ccc aac aac 4059Tyr Lys Gln Asn Tyr Asp Val
Phe Val Glu Val Gly Pro Asn Asn 1340 1345 1350cac cgt agc acc gca
gtg cgc acc acg ctt ggt ccc cag cgc aac 4104His Arg Ser Thr Ala Val
Arg Thr Thr Leu Gly Pro Gln Arg Asn 1355 1360 1365cac ctt gct ggc
gcc atc gac aag cag aac gag gat gct tgg acg 4149His Leu Ala Gly Ala
Ile Asp Lys Gln Asn Glu Asp Ala Trp Thr 1370 1375 1380acc atc gtc
aag ctt gtg gct tcg ctc aag gcc cac ctt gtt cct 4194Thr Ile Val Lys
Leu Val Ala Ser Leu Lys Ala His Leu Val Pro 1385 1390 1395ggc gtc
acg atc tcg ccg ctg tac cac tcc aag ctt gtg gcg gag 4239Gly Val Thr
Ile Ser Pro Leu Tyr His Ser Lys Leu Val Ala Glu 1400 1405 1410gct
cag gct tgc tac gct gcg ctc tgc aag ggt gaa aag ccc aag 4284Ala Gln
Ala Cys Tyr Ala Ala Leu Cys Lys Gly Glu Lys Pro Lys 1415 1420
1425aag aac aag ttt gtg cgc aag att cag ctc aac ggt cgc ttc aac
4329Lys Asn Lys Phe Val Arg Lys Ile Gln Leu Asn Gly Arg Phe Asn
1430 1435 1440agc aag gcg gac ccc atc tcc tcg gcc gat ctt gcc agc
ttt ccg 4374Ser Lys Ala Asp Pro Ile Ser Ser Ala Asp Leu Ala Ser Phe
Pro 1445 1450 1455cct gcg gac cct gcc att gaa gcc gcc atc tcg agc
cgc atc atg 4419Pro Ala Asp Pro Ala Ile Glu Ala Ala Ile Ser Ser Arg
Ile Met 1460 1465 1470aag cct gtc gct ccc aag ttc tac gcg cgt ctc
aac att gac gag 4464Lys Pro Val Ala Pro Lys Phe Tyr Ala Arg Leu Asn
Ile Asp Glu 1475 1480 1485cag gac gag acc cga gat ccg atc ctc aac
aag gac aac gcg ccg 4509Gln Asp Glu Thr Arg Asp Pro Ile Leu Asn Lys
Asp Asn Ala Pro 1490 1495 1500tct tct tct tct tct tct tct tct tct
tct tct tct tct tct tct 4554Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser
Ser Ser Ser Ser Ser 1505 1510 1515ccg tcg cct gct cct tcg gcc ccc
gtg caa aag aag gct gct ccc 4599Pro Ser Pro Ala Pro Ser Ala Pro Val
Gln Lys Lys Ala Ala Pro 1520 1525 1530gcc gcg gag acc aag gct gtt
gct tcg gct gac gca ctt cgc agt 4644Ala Ala Glu Thr Lys Ala Val Ala
Ser Ala Asp Ala Leu Arg Ser 1535 1540 1545gcc ctg ctc gat ctc gac
agt atg ctt gcg ctg agc tct gcc agt 4689Ala Leu Leu Asp Leu Asp Ser
Met Leu Ala Leu Ser Ser Ala Ser 1550 1555 1560gcc tcc ggc aac ctt
gtt gag act gcg cct agc gac gcc tcg gtc 4734Ala Ser Gly Asn Leu Val
Glu Thr Ala Pro Ser Asp Ala Ser Val 1565 1570 1575att gtg ccg ccc
tgc aac att gcg gat ctc ggc agc cgc gcc ttc 4779Ile Val Pro Pro Cys
Asn Ile Ala Asp Leu Gly Ser Arg Ala Phe 1580 1585 1590atg aaa acg
tac ggt gtt tcg gcg cct ctg tac acg ggc gcc atg 4824Met Lys Thr Tyr
Gly Val Ser Ala Pro Leu Tyr Thr Gly Ala Met 1595 1600 1605gcc aag
ggc att gcc tct gcg gac ctc gtc att gcc gcc ggc cgc 4869Ala Lys Gly
Ile Ala Ser Ala Asp Leu Val Ile Ala Ala Gly Arg 1610 1615 1620cag
ggc atc ctt gcg tcc ttt ggc gcc ggc gga ctt ccc atg cag 4914Gln Gly
Ile Leu Ala Ser Phe Gly Ala Gly Gly Leu Pro Met Gln 1625 1630
1635gtt gtg cgt gag tcc atc gaa aag att cag gcc gcc ctg ccc aat
4959Val Val Arg Glu Ser Ile Glu Lys Ile Gln Ala Ala Leu Pro Asn
1640 1645 1650ggc ccg tac gct gtc aac ctt atc cat tct ccc ttt gac
agc aac 5004Gly Pro Tyr Ala Val Asn Leu Ile His Ser Pro Phe Asp Ser
Asn 1655 1660 1665ctc gaa aag ggc aat gtc gat ctc ttc ctc gag aag
ggt gtc acc 5049Leu Glu Lys Gly Asn Val Asp Leu Phe Leu Glu Lys Gly
Val Thr 1670 1675 1680ttt gtc gag gcc tcg gcc ttt atg acg ctc acc
ccg cag gtc gtg 5094Phe Val Glu Ala Ser Ala Phe Met Thr Leu Thr Pro
Gln Val Val 1685 1690 1695cgg tac cgc gcg gct ggc ctc acg cgc aac
gcc gac ggc tcg gtc 5139Arg Tyr Arg Ala Ala Gly Leu Thr Arg Asn Ala
Asp Gly Ser Val 1700 1705 1710aac atc cgc aac cgt atc att ggc aag
gtc tcg cgc acc gag ctc 5184Asn Ile Arg Asn Arg Ile Ile Gly Lys Val
Ser Arg Thr Glu Leu 1715 1720 1725gcc gag atg ttc atg cgt cct gcg
ccc gag cac ctt ctt cag aag 5229Ala Glu Met Phe Met Arg Pro Ala Pro
Glu His Leu Leu Gln Lys 1730 1735 1740ctc att gct tcc ggc gag atc
aac cag gag cag gcc gag ctc gcc 5274Leu Ile Ala Ser Gly Glu Ile Asn
Gln Glu Gln Ala Glu Leu Ala 1745 1750 1755cgc cgt gtt ccc gtc gct
gac gac atc gcg gtc gaa gct gac tcg 5319Arg Arg Val Pro Val Ala Asp
Asp Ile Ala Val Glu Ala Asp Ser 1760 1765 1770ggt ggc cac acc gac
aac cgc ccc atc cac gtc att ctg ccc ctc 5364Gly Gly His Thr Asp Asn
Arg Pro Ile His Val Ile Leu Pro Leu 1775 1780 1785atc atc aac ctt
cgc gac cgc ctt cac cgc gag tgc ggc tac ccg 5409Ile Ile Asn Leu Arg
Asp Arg Leu His Arg Glu Cys Gly Tyr Pro 1790 1795 1800gcc aac ctt
cgc gtc cgt gtg ggc gcc ggc ggt ggc att ggg tgc 5454Ala Asn Leu Arg
Val Arg Val Gly Ala Gly Gly Gly Ile Gly Cys 1805 1810 1815ccc cag
gcg gcg ctg gcc acc ttc aac atg ggt gcc tcc ttt att 5499Pro Gln Ala
Ala Leu Ala Thr Phe Asn Met Gly Ala Ser Phe Ile 1820 1825 1830gtc
acc ggc acc gtg aac cag gtc gcc aag cag tcg ggc acg tgc 5544Val Thr
Gly Thr Val Asn Gln Val Ala Lys Gln Ser Gly Thr Cys 1835 1840
1845gac aat gtg cgc aag cag ctc gcg aag gcc act tac tcg gac gta
5589Asp Asn Val Arg Lys Gln Leu Ala Lys Ala Thr Tyr Ser Asp Val
1850 1855 1860tgc atg gcc ccg gct gcc gac atg ttc gag gaa ggc gtc
aag ctt 5634Cys Met Ala Pro Ala Ala Asp Met Phe Glu Glu Gly Val Lys
Leu 1865 1870 1875cag gtc ctc aag aag gga acc atg ttt ccc tcg cgc
gcc aac aag 5679Gln Val Leu Lys Lys Gly Thr Met Phe Pro Ser Arg Ala
Asn Lys 1880 1885 1890ctc tac gag ctc ttt tgc aag tac gac tcg ttc
gag tcc atg ccc 5724Leu Tyr Glu Leu Phe Cys Lys Tyr Asp Ser Phe Glu
Ser Met Pro 1895 1900 1905ccc gca gag ctt gcg cgc gtc gag aag cgc
atc ttc agc cgc gcg 5769Pro Ala Glu Leu Ala Arg Val Glu Lys Arg Ile
Phe Ser Arg Ala 1910 1915 1920ctc gaa gag gtc tgg gac gag acc aaa
aac ttt tac att aac cgt 5814Leu Glu Glu Val Trp Asp Glu Thr Lys Asn
Phe Tyr Ile Asn Arg 1925 1930 1935ctt cac aac ccg gag aag atc cag
cgc gcc gag cgc gac ccc aag 5859Leu His Asn Pro Glu Lys Ile Gln Arg
Ala Glu Arg Asp Pro Lys 1940 1945 1950ctc aag atg tcg ctg tgc ttt
cgc tgg tac ctg agc ctg gcg agc 5904Leu Lys Met Ser Leu Cys Phe Arg
Trp Tyr Leu Ser Leu Ala Ser 1955 1960 1965cgc tgg gcc aac act gga
gct tcc gat cgc gtc atg gac tac cag 5949Arg Trp Ala Asn Thr Gly Ala
Ser Asp Arg Val Met Asp Tyr Gln 1970 1975 1980gtc tgg tgc ggt cct
gcc att ggt tcc ttc aac gat ttc atc aag 5994Val Trp Cys Gly Pro Ala
Ile Gly Ser Phe Asn Asp Phe Ile Lys 1985 1990 1995gga act tac ctt
gat ccg gcc gtc gca aac gag tac ccg tgc gtc 6039Gly Thr Tyr Leu Asp
Pro Ala Val Ala Asn Glu Tyr Pro Cys Val 2000 2005 2010gtt cag att
aac aag cag atc ctt cgt gga gcg tgc ttc ttg cgc 6084Val Gln Ile Asn
Lys Gln Ile Leu Arg Gly Ala Cys Phe Leu Arg 2015 2020 2025cgt ctc
gaa att ctg cgc aac gca cgc ctt tcc gat ggc gct gcc 6129Arg Leu Glu
Ile Leu Arg Asn Ala Arg Leu Ser Asp Gly Ala Ala 2030 2035 2040gct
ctt gtg gcc agc atc gat gac aca tac gtc ccg gcc gag aag 6174Ala Leu
Val Ala Ser Ile Asp Asp Thr Tyr Val Pro Ala Glu Lys 2045 2050
2055ctg 6177Leu 42059PRTSchizochytrium
sp.misc_feature(370)..(370)The 'Xaa' at location 370 stands for
Leu. 4Met Ala Ala Arg Asn Val Ser Ala Ala His Glu Met His Asp Glu
Lys1 5 10 15Arg Ile Ala Val Val Gly Met Ala Val Gln Tyr Ala Gly Cys
Lys Thr 20 25 30Lys Asp Glu Phe Trp Glu Val Leu Met Asn Gly Lys Val
Glu Ser Lys 35 40 45Val Ile Ser Asp Lys Arg Leu Gly Ser Asn Tyr Arg
Ala Glu His Tyr 50 55 60Lys Ala Glu Arg Ser Lys Tyr Ala Asp Thr Phe
Cys Asn Glu Thr Tyr65 70 75 80Gly Thr Leu Asp Glu Asn Glu Ile Asp
Asn Glu His Glu Leu Leu Leu 85 90 95Asn Leu Ala Lys Gln Ala Leu Ala
Glu Thr Ser Val Lys Asp Ser Thr 100 105 110Arg Cys Gly Ile Val Ser
Gly Cys Leu Ser Phe Pro Met Asp Asn Leu 115 120 125Gln Gly Glu Leu
Leu Asn Val Tyr Gln Asn His Val Glu Lys Lys Leu 130 135 140Gly Ala
Arg Val Phe Lys Asp Ala Ser His Trp Ser Glu Arg Glu Gln145 150 155
160Ser Asn Lys Pro Glu Ala Gly Asp Arg Arg Ile Phe Met Asp Pro Ala
165 170 175Ser Phe Val Ala Glu Glu Leu Asn Leu Gly Ala Leu His Tyr
Ser Val 180 185 190Asp Ala Ala Cys Ala Thr Ala Leu Tyr Val Leu Arg
Leu Ala Gln Asp 195 200 205His Leu Val Ser Gly Ala Ala Asp Val Met
Leu Cys Gly Ala Thr Cys 210 215 220Leu Pro Glu Pro Phe Phe Ile Leu
Ser Gly Phe Ser Thr Phe Gln Ala225 230 235 240Met Pro Val Gly Thr
Gly Gln Asn Val Ser Met Pro Leu His Lys Asp 245 250 255Ser Gln Gly
Leu Thr Pro Gly Glu Gly Gly Ser Ile Met Val Leu Lys 260 265 270Arg
Leu Asp Asp Ala Ile Arg Asp Gly Asp His Ile Tyr Gly Thr Leu 275 280
285Leu Gly Ala Asn Val Ser Asn Ser Gly Thr Gly Leu Pro Leu Lys Pro
290 295 300Leu Leu Pro Ser Glu Lys Lys Cys Leu Met Asp Thr Tyr Thr
Arg Ile305 310 315 320Asn Val His Pro His Lys Ile Gln Tyr Val Glu
Cys His Ala Thr Gly 325 330 335Thr Pro Gln Gly Asp Arg Val Glu Ile
Asp Ala Val Lys Ala Cys Phe 340 345 350Glu Gly Lys Val Pro Arg Phe
Gly Thr Thr Lys Gly Asn Phe Gly His 355 360 365Thr Xaa Xaa Ala Ala
Gly Phe Ala Gly Met Cys Lys Val Leu Leu Ser 370 375 380Met Lys His
Gly Ile Ile Pro Pro Thr Pro Gly Ile Asp Asp Glu Thr385 390 395
400Lys Met Asp Pro Leu Val Val Ser Gly Glu Ala Ile Pro Trp Pro Glu
405 410 415Thr Asn Gly Glu Pro Lys Arg Ala Gly Leu Ser Ala Phe Gly
Phe Gly 420 425 430Gly Thr Asn Ala His Ala Val Phe Glu Glu His Asp
Pro Ser Asn Ala 435 440 445Ala Cys Thr Gly His Asp Ser Ile Ser Ala
Leu Ser Ala Arg Cys Gly 450 455 460Gly Glu Ser Asn Met Arg Ile Ala
Ile Thr Gly Met Asp Ala Thr Phe465 470 475 480Gly Ala Leu Lys Gly
Leu Asp Ala Phe Glu Arg Ala Ile Tyr Thr Gly 485 490 495Ala His Gly
Ala Ile Pro Leu Pro Glu Lys Arg Trp Arg Phe Leu Gly 500 505 510Lys
Asp Lys Asp Phe Leu Asp Leu Cys Gly Val Lys Ala Thr Pro His 515 520
525Gly Cys Tyr Ile Glu Asp Val Glu Val Asp Phe Gln Arg Leu Arg Thr
530 535 540Pro Met Thr Pro Glu Asp Met Leu Leu Pro Gln Gln Leu Leu
Ala Val545 550 555 560Thr Thr Ile Asp Arg Ala Ile Leu Asp Ser Gly
Met Lys Lys Gly Gly 565 570 575Asn Val Ala Val Phe Val Gly Leu Gly
Thr Asp Leu Glu Leu Tyr Arg 580 585 590His Arg Ala Arg Val Ala Leu
Lys Glu Arg Val Arg Pro Glu Ala Ser 595 600 605Lys Lys Leu Asn Asp
Met Met Gln Tyr Ile Asn Asp Cys Gly Thr Ser 610 615 620Thr Ser Tyr
Thr Ser Tyr Ile Gly Asn Leu Val Ala Thr Arg Val Ser625 630 635
640Ser Gln Trp Gly Phe Thr Gly Pro Ser Phe Thr Ile Thr Glu Gly Asn
645 650 655Asn Ser Val Tyr Arg Cys Ala Glu Leu Gly Lys Tyr Leu Leu
Glu Thr 660 665 670Gly Glu Val Asp Gly Val Val Val Ala Gly Val Asp
Leu Cys Gly Ser 675 680 685Ala Glu Asn Leu Tyr Val Lys Ser Arg Arg
Phe Lys Val Ser Thr Ser 690 695 700Asp Thr Pro Arg Ala Ser Phe Asp
Ala Ala Ala Asp Gly Tyr Phe Val705 710 715 720Gly Glu Gly Cys Gly
Ala Phe Val Leu Lys Arg Glu Thr Ser Cys Thr 725 730 735Lys Asp Asp
Arg Ile Tyr Ala Cys Met Asp Ala Ile Val Pro Gly Asn 740 745 750Val
Pro Ser Ala Cys Leu Arg Glu Ala Leu Asp Gln Ala Arg Val Lys 755 760
765Pro Gly Asp Ile Glu Met Leu Glu Leu Ser Ala Asp Ser Ala Arg His
770 775 780Leu Lys Asp Pro Ser Val Leu Pro Lys Glu Leu Thr Ala Glu
Glu Glu785 790 795 800Ile Gly Gly Leu Gln Thr Ile Leu Arg Asp Asp
Asp Lys Leu Pro Arg 805 810 815Asn Val Ala Thr Gly Ser Val Lys Ala
Thr Val Gly Asp Thr Gly Tyr 820 825 830Ala Ser Gly Ala Ala Ser Leu
Ile Lys Ala Ala Leu Cys Ile Tyr Asn 835 840 845Arg Tyr Leu Pro Ser
Asn Gly Asp Asp Trp Asp Glu Pro Ala Pro Glu 850 855 860Ala Pro Trp
Asp Ser Thr Leu Phe Ala Cys Gln Thr Ser Arg Ala Trp865 870 875
880Leu Lys Asn Pro Gly Glu Arg Arg Tyr Ala Ala Val Ser Gly Val Ser
885 890 895Glu Thr Arg Ser Cys Tyr Ser Val Leu Leu Ser Glu Ala Glu
Gly His 900 905 910Tyr Glu Arg Glu Asn Arg Ile Ser Leu Asp Glu Glu
Ala Pro Lys Leu 915 920 925Ile Val Leu Arg Ala Asp Ser His Glu Glu
Ile Leu Gly Arg Leu Asp 930 935 940Lys Ile Arg Glu Arg Phe Leu Gln
Pro Thr Gly Ala Ala Pro Arg Glu945 950 955 960Ser Glu Leu Lys Ala
Gln Ala Arg Arg Ile Phe Leu Glu Leu Leu Gly 965 970 975Glu Thr Leu
Ala Gln Asp Ala Ala Ser Ser Gly Ser Gln Lys Pro Leu 980 985 990Ala
Leu Ser Leu Val Ser Thr Pro Ser Lys Leu Gln Arg Glu Val Glu 995
1000 1005Leu Ala Ala Lys Gly Ile Pro Arg Cys Leu Lys Met Arg Arg
Asp 1010 1015 1020Trp Ser Ser Pro Ala Gly Ser Arg Tyr Ala Pro Glu
Pro Leu Ala 1025 1030 1035Ser Asp Arg Val Ala Phe Met Tyr Gly Glu
Gly Arg Ser Pro Tyr 1040 1045 1050Tyr Gly Ile Thr Gln Asp Ile His
Arg Ile Trp Pro Glu Leu His 1055 1060 1065Glu Val Ile Asn Glu Lys
Thr Asn Arg Leu Trp Ala Glu Gly Asp 1070 1075 1080Arg Trp Val Met
Pro Arg Ala Ser Phe Lys Ser Glu Leu Glu Ser 1085 1090 1095Gln Gln
Gln Glu Phe Asp Arg Asn Met Ile Glu Met Phe Arg Leu 1100 1105
1110Gly Ile Leu Thr Ser Ile Ala Phe Thr Asn Leu Ala Arg Asp Val
1115 1120 1125Leu Asn Ile Thr Pro Lys Ala Ala Phe Gly Leu Ser Leu
Gly Glu 1130 1135 1140Ile Ser Met Ile Phe Ala Phe Ser Lys Lys Asn
Gly Leu Ile Ser 1145 1150 1155Asp Gln Leu Thr Lys Asp Leu Arg Glu
Ser Asp Val Trp Asn Lys 1160 1165 1170Ala Leu Ala Val Glu Phe Asn
Ala Leu Arg Glu Ala Trp Gly Ile 1175 1180 1185Pro Gln Ser Val Pro
Lys Asp Glu Phe Trp Gln Gly Tyr Ile Val 1190 1195 1200Arg Gly Thr
Lys Gln Asp
Ile Glu Ala Ala Ile Ala Pro Asp Ser 1205 1210 1215Lys Tyr Val Arg
Leu Thr Ile Ile Asn Asp Ala Asn Thr Ala Leu 1220 1225 1230Ile Ser
Gly Lys Pro Asp Ala Cys Lys Ala Ala Ile Ala Arg Leu 1235 1240
1245Gly Gly Asn Ile Pro Ala Leu Pro Val Thr Gln Gly Met Cys Gly
1250 1255 1260His Cys Pro Glu Val Gly Pro Tyr Thr Lys Asp Ile Ala
Lys Ile 1265 1270 1275His Ala Asn Leu Glu Phe Pro Val Val Asp Gly
Leu Asp Leu Trp 1280 1285 1290Thr Thr Ile Asn Gln Lys Arg Leu Val
Pro Arg Ala Thr Gly Ala 1295 1300 1305Lys Asp Glu Trp Ala Pro Ser
Ser Phe Gly Glu Tyr Ala Gly Gln 1310 1315 1320Leu Tyr Glu Lys Gln
Ala Asn Phe Pro Gln Ile Val Glu Thr Ile 1325 1330 1335Tyr Lys Gln
Asn Tyr Asp Val Phe Val Glu Val Gly Pro Asn Asn 1340 1345 1350His
Arg Ser Thr Ala Val Arg Thr Thr Leu Gly Pro Gln Arg Asn 1355 1360
1365His Leu Ala Gly Ala Ile Asp Lys Gln Asn Glu Asp Ala Trp Thr
1370 1375 1380Thr Ile Val Lys Leu Val Ala Ser Leu Lys Ala His Leu
Val Pro 1385 1390 1395Gly Val Thr Ile Ser Pro Leu Tyr His Ser Lys
Leu Val Ala Glu 1400 1405 1410Ala Gln Ala Cys Tyr Ala Ala Leu Cys
Lys Gly Glu Lys Pro Lys 1415 1420 1425Lys Asn Lys Phe Val Arg Lys
Ile Gln Leu Asn Gly Arg Phe Asn 1430 1435 1440Ser Lys Ala Asp Pro
Ile Ser Ser Ala Asp Leu Ala Ser Phe Pro 1445 1450 1455Pro Ala Asp
Pro Ala Ile Glu Ala Ala Ile Ser Ser Arg Ile Met 1460 1465 1470Lys
Pro Val Ala Pro Lys Phe Tyr Ala Arg Leu Asn Ile Asp Glu 1475 1480
1485Gln Asp Glu Thr Arg Asp Pro Ile Leu Asn Lys Asp Asn Ala Pro
1490 1495 1500Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser
Ser Ser 1505 1510 1515Pro Ser Pro Ala Pro Ser Ala Pro Val Gln Lys
Lys Ala Ala Pro 1520 1525 1530Ala Ala Glu Thr Lys Ala Val Ala Ser
Ala Asp Ala Leu Arg Ser 1535 1540 1545Ala Leu Leu Asp Leu Asp Ser
Met Leu Ala Leu Ser Ser Ala Ser 1550 1555 1560Ala Ser Gly Asn Leu
Val Glu Thr Ala Pro Ser Asp Ala Ser Val 1565 1570 1575Ile Val Pro
Pro Cys Asn Ile Ala Asp Leu Gly Ser Arg Ala Phe 1580 1585 1590Met
Lys Thr Tyr Gly Val Ser Ala Pro Leu Tyr Thr Gly Ala Met 1595 1600
1605Ala Lys Gly Ile Ala Ser Ala Asp Leu Val Ile Ala Ala Gly Arg
1610 1615 1620Gln Gly Ile Leu Ala Ser Phe Gly Ala Gly Gly Leu Pro
Met Gln 1625 1630 1635Val Val Arg Glu Ser Ile Glu Lys Ile Gln Ala
Ala Leu Pro Asn 1640 1645 1650Gly Pro Tyr Ala Val Asn Leu Ile His
Ser Pro Phe Asp Ser Asn 1655 1660 1665Leu Glu Lys Gly Asn Val Asp
Leu Phe Leu Glu Lys Gly Val Thr 1670 1675 1680Phe Val Glu Ala Ser
Ala Phe Met Thr Leu Thr Pro Gln Val Val 1685 1690 1695Arg Tyr Arg
Ala Ala Gly Leu Thr Arg Asn Ala Asp Gly Ser Val 1700 1705 1710Asn
Ile Arg Asn Arg Ile Ile Gly Lys Val Ser Arg Thr Glu Leu 1715 1720
1725Ala Glu Met Phe Met Arg Pro Ala Pro Glu His Leu Leu Gln Lys
1730 1735 1740Leu Ile Ala Ser Gly Glu Ile Asn Gln Glu Gln Ala Glu
Leu Ala 1745 1750 1755Arg Arg Val Pro Val Ala Asp Asp Ile Ala Val
Glu Ala Asp Ser 1760 1765 1770Gly Gly His Thr Asp Asn Arg Pro Ile
His Val Ile Leu Pro Leu 1775 1780 1785Ile Ile Asn Leu Arg Asp Arg
Leu His Arg Glu Cys Gly Tyr Pro 1790 1795 1800Ala Asn Leu Arg Val
Arg Val Gly Ala Gly Gly Gly Ile Gly Cys 1805 1810 1815Pro Gln Ala
Ala Leu Ala Thr Phe Asn Met Gly Ala Ser Phe Ile 1820 1825 1830Val
Thr Gly Thr Val Asn Gln Val Ala Lys Gln Ser Gly Thr Cys 1835 1840
1845Asp Asn Val Arg Lys Gln Leu Ala Lys Ala Thr Tyr Ser Asp Val
1850 1855 1860Cys Met Ala Pro Ala Ala Asp Met Phe Glu Glu Gly Val
Lys Leu 1865 1870 1875Gln Val Leu Lys Lys Gly Thr Met Phe Pro Ser
Arg Ala Asn Lys 1880 1885 1890Leu Tyr Glu Leu Phe Cys Lys Tyr Asp
Ser Phe Glu Ser Met Pro 1895 1900 1905Pro Ala Glu Leu Ala Arg Val
Glu Lys Arg Ile Phe Ser Arg Ala 1910 1915 1920Leu Glu Glu Val Trp
Asp Glu Thr Lys Asn Phe Tyr Ile Asn Arg 1925 1930 1935Leu His Asn
Pro Glu Lys Ile Gln Arg Ala Glu Arg Asp Pro Lys 1940 1945 1950Leu
Lys Met Ser Leu Cys Phe Arg Trp Tyr Leu Ser Leu Ala Ser 1955 1960
1965Arg Trp Ala Asn Thr Gly Ala Ser Asp Arg Val Met Asp Tyr Gln
1970 1975 1980Val Trp Cys Gly Pro Ala Ile Gly Ser Phe Asn Asp Phe
Ile Lys 1985 1990 1995Gly Thr Tyr Leu Asp Pro Ala Val Ala Asn Glu
Tyr Pro Cys Val 2000 2005 2010Val Gln Ile Asn Lys Gln Ile Leu Arg
Gly Ala Cys Phe Leu Arg 2015 2020 2025Arg Leu Glu Ile Leu Arg Asn
Ala Arg Leu Ser Asp Gly Ala Ala 2030 2035 2040Ala Leu Val Ala Ser
Ile Asp Asp Thr Tyr Val Pro Ala Glu Lys 2045 2050 2055Leu
54509DNASchizochytrium sp.CDS(1)..(4509) 5atg gcg ctc cgt gtc aag
acg aac aag aag cca tgc tgg gag atg acc 48Met Ala Leu Arg Val Lys
Thr Asn Lys Lys Pro Cys Trp Glu Met Thr1 5 10 15aag gag gag ctg acc
agc ggc aag acc gag gtg ttc aac tat gag gaa 96Lys Glu Glu Leu Thr
Ser Gly Lys Thr Glu Val Phe Asn Tyr Glu Glu 20 25 30ctc ctc gag ttc
gca gag ggc gac atc gcc aag gtc ttc gga ccc gag 144Leu Leu Glu Phe
Ala Glu Gly Asp Ile Ala Lys Val Phe Gly Pro Glu 35 40 45ttc gcc gtc
atc gac aag tac ccg cgc cgc gtg cgc ctg ccc gcc cgc 192Phe Ala Val
Ile Asp Lys Tyr Pro Arg Arg Val Arg Leu Pro Ala Arg 50 55 60gag tac
ctg ctc gtg acc cgc gtc acc ctc atg gac gcc gag gtc aac 240Glu Tyr
Leu Leu Val Thr Arg Val Thr Leu Met Asp Ala Glu Val Asn65 70 75
80aac tac cgc gtc ggc gcc cgc atg gtc acc gag tac gat ctc ccc gtc
288Asn Tyr Arg Val Gly Ala Arg Met Val Thr Glu Tyr Asp Leu Pro Val
85 90 95aac gga gag ctc tcc gag ggc gga gac tgc ccc tgg gcc gtc ctg
gtc 336Asn Gly Glu Leu Ser Glu Gly Gly Asp Cys Pro Trp Ala Val Leu
Val 100 105 110gag agt ggc cag tgc gat ctc atg ctc atc tcc tac atg
ggc att gac 384Glu Ser Gly Gln Cys Asp Leu Met Leu Ile Ser Tyr Met
Gly Ile Asp 115 120 125ttc cag aac cag ggc gac cgc gtc tac cgc ctg
ctc aac acc acg ctc 432Phe Gln Asn Gln Gly Asp Arg Val Tyr Arg Leu
Leu Asn Thr Thr Leu 130 135 140acc ttt tac ggc gtg gcc cac gag ggc
gag acc ctc gag tac gac att 480Thr Phe Tyr Gly Val Ala His Glu Gly
Glu Thr Leu Glu Tyr Asp Ile145 150 155 160cgc gtc acc ggc ttc gcc
aag cgt ctc gac ggc ggc atc tcc atg ttc 528Arg Val Thr Gly Phe Ala
Lys Arg Leu Asp Gly Gly Ile Ser Met Phe 165 170 175ttc ttc gag tac
gac tgc tac gtc aac ggc cgc ctc ctc atc gag atg 576Phe Phe Glu Tyr
Asp Cys Tyr Val Asn Gly Arg Leu Leu Ile Glu Met 180 185 190cgc gat
ggc tgc gcc ggc ttc ttc acc aac gag gag ctc gac gcc ggc 624Arg Asp
Gly Cys Ala Gly Phe Phe Thr Asn Glu Glu Leu Asp Ala Gly 195 200
205aag ggc gtc gtc ttc acc cgc ggc gac ctc gcc gcc cgc gcc aag atc
672Lys Gly Val Val Phe Thr Arg Gly Asp Leu Ala Ala Arg Ala Lys Ile
210 215 220cca aag cag gac gtc tcc ccc tac gcc gtc gcc ccc tgc ctc
cac aag 720Pro Lys Gln Asp Val Ser Pro Tyr Ala Val Ala Pro Cys Leu
His Lys225 230 235 240acc aag ctc aac gaa aag gag atg cag acc ctc
gtc gac aag gac tgg 768Thr Lys Leu Asn Glu Lys Glu Met Gln Thr Leu
Val Asp Lys Asp Trp 245 250 255gca tcc gtc ttt ggc tcc aag aac ggc
atg ccg gaa atc aac tac aaa 816Ala Ser Val Phe Gly Ser Lys Asn Gly
Met Pro Glu Ile Asn Tyr Lys 260 265 270ctc tgc gcg cgt aag atg ctc
atg att gac cgc gtc acc agc att gac 864Leu Cys Ala Arg Lys Met Leu
Met Ile Asp Arg Val Thr Ser Ile Asp 275 280 285cac aag ggc ggt gtc
tac ggc ctc ggt cag ctc gtc ggt gaa aag atc 912His Lys Gly Gly Val
Tyr Gly Leu Gly Gln Leu Val Gly Glu Lys Ile 290 295 300ctc gag cgc
gac cac tgg tac ttt ccc tgc cac ttt gtc aag gat cag 960Leu Glu Arg
Asp His Trp Tyr Phe Pro Cys His Phe Val Lys Asp Gln305 310 315
320gtc atg gcc gga tcc ctc gtc tcc gac ggc tgc agc cag atg ctc aag
1008Val Met Ala Gly Ser Leu Val Ser Asp Gly Cys Ser Gln Met Leu Lys
325 330 335atg tac atg atc tgg ctc ggc ctc cac ctc acc acc gga ccc
ttt gac 1056Met Tyr Met Ile Trp Leu Gly Leu His Leu Thr Thr Gly Pro
Phe Asp 340 345 350ttc cgc ccg gtc aac ggc cac ccc aac aag gtc cgc
tgc cgc ggc caa 1104Phe Arg Pro Val Asn Gly His Pro Asn Lys Val Arg
Cys Arg Gly Gln 355 360 365atc tcc ccg cac aag ggc aag ctc gtc tac
gtc atg gag atc aag gag 1152Ile Ser Pro His Lys Gly Lys Leu Val Tyr
Val Met Glu Ile Lys Glu 370 375 380atg ggc ttc gac gag gac aac gac
ccg tac gcc att gcc gac gtc aac 1200Met Gly Phe Asp Glu Asp Asn Asp
Pro Tyr Ala Ile Ala Asp Val Asn385 390 395 400atc att gat gtc gac
ttc gaa aag ggc cag gac ttt agc ctc gac cgc 1248Ile Ile Asp Val Asp
Phe Glu Lys Gly Gln Asp Phe Ser Leu Asp Arg 405 410 415atc agc gac
tac ggc aag ggc gac ctc aac aag aag atc gtc gtc gac 1296Ile Ser Asp
Tyr Gly Lys Gly Asp Leu Asn Lys Lys Ile Val Val Asp 420 425 430ttt
aag ggc atc gct ctc aag atg cag aag cgc tcc acc aac aag aac 1344Phe
Lys Gly Ile Ala Leu Lys Met Gln Lys Arg Ser Thr Asn Lys Asn 435 440
445ccc tcc aag gtt cag ccc gtc ttt gcc aac ggc gcc gcc act gtc ggc
1392Pro Ser Lys Val Gln Pro Val Phe Ala Asn Gly Ala Ala Thr Val Gly
450 455 460ccc gag gcc tcc aag gct tcc tcc ggc gcc agc gcc agc gcc
agc gcc 1440Pro Glu Ala Ser Lys Ala Ser Ser Gly Ala Ser Ala Ser Ala
Ser Ala465 470 475 480gcc ccg gcc aag cct gcc ttc agc gcc gat gtt
ctt gcg ccc aag ccc 1488Ala Pro Ala Lys Pro Ala Phe Ser Ala Asp Val
Leu Ala Pro Lys Pro 485 490 495gtt gcc ctt ccc gag cac atc ctc aag
ggc gac gcc ctc gcc ccc aag 1536Val Ala Leu Pro Glu His Ile Leu Lys
Gly Asp Ala Leu Ala Pro Lys 500 505 510gag atg tcc tgg cac ccc atg
gcc cgc atc ccg ggc aac ccg acg ccc 1584Glu Met Ser Trp His Pro Met
Ala Arg Ile Pro Gly Asn Pro Thr Pro 515 520 525tct ttt gcg ccc tcg
gcc tac aag ccg cgc aac atc gcc ttt acg ccc 1632Ser Phe Ala Pro Ser
Ala Tyr Lys Pro Arg Asn Ile Ala Phe Thr Pro 530 535 540ttc ccc ggc
aac ccc aac gat aac gac cac acc ccg ggc aag atg ccg 1680Phe Pro Gly
Asn Pro Asn Asp Asn Asp His Thr Pro Gly Lys Met Pro545 550 555
560ctc acc tgg ttc aac atg gcc gag ttc atg gcc ggc aag gtc agc atg
1728Leu Thr Trp Phe Asn Met Ala Glu Phe Met Ala Gly Lys Val Ser Met
565 570 575tgc ctc ggc ccc gag ttc gcc aag ttc gac gac tcg aac acc
agc cgc 1776Cys Leu Gly Pro Glu Phe Ala Lys Phe Asp Asp Ser Asn Thr
Ser Arg 580 585 590agc ccc gct tgg gac ctc gct ctc gtc acc cgc gcc
gtg tct gtg tct 1824Ser Pro Ala Trp Asp Leu Ala Leu Val Thr Arg Ala
Val Ser Val Ser 595 600 605gac ctc aag cac gtc aac tac cgc aac atc
gac ctc gac ccc tcc aag 1872Asp Leu Lys His Val Asn Tyr Arg Asn Ile
Asp Leu Asp Pro Ser Lys 610 615 620ggt acc atg gtc ggc gag ttc gac
tgc ccc gcg gac gcc tgg ttc tac 1920Gly Thr Met Val Gly Glu Phe Asp
Cys Pro Ala Asp Ala Trp Phe Tyr625 630 635 640aag ggc gcc tgc aac
gat gcc cac atg ccg tac tcg atc ctc atg gag 1968Lys Gly Ala Cys Asn
Asp Ala His Met Pro Tyr Ser Ile Leu Met Glu 645 650 655atc gcc ctc
cag acc tcg ggt gtg ctc acc tcg gtg ctc aag gcg ccc 2016Ile Ala Leu
Gln Thr Ser Gly Val Leu Thr Ser Val Leu Lys Ala Pro 660 665 670ctg
acc atg gag aag gac gac atc ctc ttc cgc aac ctc gac gcc aac 2064Leu
Thr Met Glu Lys Asp Asp Ile Leu Phe Arg Asn Leu Asp Ala Asn 675 680
685gcc gag ttc gtg cgc gcc gac ctc gac tac cgc ggc aag act atc cgc
2112Ala Glu Phe Val Arg Ala Asp Leu Asp Tyr Arg Gly Lys Thr Ile Arg
690 695 700aac gtc acc aag tgc act ggc tac agc atg ctc ggc gag atg
ggc gtc 2160Asn Val Thr Lys Cys Thr Gly Tyr Ser Met Leu Gly Glu Met
Gly Val705 710 715 720cac cgc ttc acc ttt gag ctc tac gtc gat gat
gtg ctc ttt tac aag 2208His Arg Phe Thr Phe Glu Leu Tyr Val Asp Asp
Val Leu Phe Tyr Lys 725 730 735ggc tcg acc tcg ttc ggc tgg ttc gtg
ccc gag gtc ttt gcc gcc cag 2256Gly Ser Thr Ser Phe Gly Trp Phe Val
Pro Glu Val Phe Ala Ala Gln 740 745 750gcc ggc ctc gac aac ggc cgc
aag tcg gag ccc tgg ttc att gag aac 2304Ala Gly Leu Asp Asn Gly Arg
Lys Ser Glu Pro Trp Phe Ile Glu Asn 755 760 765aag gtt ccg gcc tcg
cag gtc tcc tcc ttt gac gtg cgc ccc aac ggc 2352Lys Val Pro Ala Ser
Gln Val Ser Ser Phe Asp Val Arg Pro Asn Gly 770 775 780agc ggc cgc
acc gcc atc ttc gcc aac gcc ccc agc ggc gcc cag ctc 2400Ser Gly Arg
Thr Ala Ile Phe Ala Asn Ala Pro Ser Gly Ala Gln Leu785 790 795
800aac cgc cgc acg gac cag ggc cag tac ctc gac gcc gtc gac att gtc
2448Asn Arg Arg Thr Asp Gln Gly Gln Tyr Leu Asp Ala Val Asp Ile Val
805 810 815tcc ggc agc ggc aag aag agc ctc ggc tac gcc cac ggt tcc
aag acg 2496Ser Gly Ser Gly Lys Lys Ser Leu Gly Tyr Ala His Gly Ser
Lys Thr 820 825 830gtc aac ccg aac gac tgg ttc ttc tcg tgc cac ttt
tgg ttt gac tcg 2544Val Asn Pro Asn Asp Trp Phe Phe Ser Cys His Phe
Trp Phe Asp Ser 835 840 845gtc atg ccc gga agt ctc ggt gtc gag tcc
atg ttc cag ctc gtc gag 2592Val Met Pro Gly Ser Leu Gly Val Glu Ser
Met Phe Gln Leu Val Glu 850 855 860gcc atc gcc gcc cac gag gat ctc
gct ggc aaa gca cgg cat tgc caa 2640Ala Ile Ala Ala His Glu Asp Leu
Ala Gly Lys Ala Arg His Cys Gln865 870 875 880ccc cac ctt tgt gca
cgc ccc cgg gca aga tca agc tgg aag tac cgc 2688Pro His Leu Cys Ala
Arg Pro Arg Ala Arg Ser Ser Trp Lys Tyr Arg 885 890 895ggc cag ctc
acg ccc aag agc aag aag atg gac tcg gag gtc cac atc 2736Gly Gln Leu
Thr Pro Lys Ser Lys Lys Met Asp Ser Glu Val His Ile 900 905 910gtg
tcc gtg gac gcc cac gac ggc gtt gtc gac ctc gtc gcc gac ggc 2784Val
Ser Val Asp Ala His Asp Gly Val Val Asp Leu Val Ala Asp Gly 915 920
925ttc ctc tgg gcc gac agc ctc cgc gtc tac tcg gtg agc aac att cgc
2832Phe Leu Trp Ala Asp Ser Leu Arg Val Tyr Ser Val Ser Asn Ile Arg
930 935 940gtg cgc atc gcc tcc ggt gag gcc cct gcc gcc gcc tcc tcc
gcc gcc 2880Val Arg Ile Ala Ser Gly Glu Ala Pro Ala Ala Ala Ser Ser
Ala Ala945 950 955 960tct gtg ggc tcc tcg gct tcg tcc gtc gag cgc
acg cgc tcg agc ccc 2928Ser Val Gly Ser Ser Ala Ser Ser Val Glu Arg
Thr Arg Ser Ser Pro 965 970 975gct gtc gcc tcc ggc ccg gcc cag acc
atc gac ctc aag cag ctc aag 2976Ala Val Ala Ser Gly Pro Ala Gln Thr
Ile Asp Leu Lys Gln Leu Lys 980 985 990acc gag ctc ctc gag ctc gat
gcc ccg ctc tac ctc tcg cag gac ccg
3024Thr Glu Leu Leu Glu Leu Asp Ala Pro Leu Tyr Leu Ser Gln Asp Pro
995 1000 1005acc agc ggc cag ctc aag aag cac acc gac gtg gcc tcc
ggc cag 3069Thr Ser Gly Gln Leu Lys Lys His Thr Asp Val Ala Ser Gly
Gln 1010 1015 1020gcc acc atc gtg cag ccc tgc acg ctc ggc gac ctc
ggt gac cgc 3114Ala Thr Ile Val Gln Pro Cys Thr Leu Gly Asp Leu Gly
Asp Arg 1025 1030 1035tcc ttc atg gag acc tac ggc gtc gtc gcc ccg
ctg tac acg ggc 3159Ser Phe Met Glu Thr Tyr Gly Val Val Ala Pro Leu
Tyr Thr Gly 1040 1045 1050gcc atg gcc aag ggc att gcc tcg gcg gac
ctc gtc atc gcc gcc 3204Ala Met Ala Lys Gly Ile Ala Ser Ala Asp Leu
Val Ile Ala Ala 1055 1060 1065ggc aag cgc aag atc ctc ggc tcc ttt
ggc gcc ggc ggc ctc ccc 3249Gly Lys Arg Lys Ile Leu Gly Ser Phe Gly
Ala Gly Gly Leu Pro 1070 1075 1080atg cac cac gtg cgc gcc gcc ctc
gag aag atc cag gcc gcc ctg 3294Met His His Val Arg Ala Ala Leu Glu
Lys Ile Gln Ala Ala Leu 1085 1090 1095cct cag ggc ccc tac gcc gtc
aac ctc atc cac tcg cct ttt gac 3339Pro Gln Gly Pro Tyr Ala Val Asn
Leu Ile His Ser Pro Phe Asp 1100 1105 1110agc aac ctc gag aag ggc
aac gtc gat ctc ttc ctc gag aag ggc 3384Ser Asn Leu Glu Lys Gly Asn
Val Asp Leu Phe Leu Glu Lys Gly 1115 1120 1125gtc act gtg gtg gag
gcc tcg gca ttc atg acc ctc acc ccg cag 3429Val Thr Val Val Glu Ala
Ser Ala Phe Met Thr Leu Thr Pro Gln 1130 1135 1140gtc gtg cgc tac
cgc gcc gcc ggc ctc tcg cgc aac gcc gac ggt 3474Val Val Arg Tyr Arg
Ala Ala Gly Leu Ser Arg Asn Ala Asp Gly 1145 1150 1155tcg gtc aac
atc cgc aac cgc atc atc ggc aag gtc tcg cgc acc 3519Ser Val Asn Ile
Arg Asn Arg Ile Ile Gly Lys Val Ser Arg Thr 1160 1165 1170gag ctc
gcc gag atg ttc atc cgc ccg gcc ccg gag cac ctc ctc 3564Glu Leu Ala
Glu Met Phe Ile Arg Pro Ala Pro Glu His Leu Leu 1175 1180 1185gag
aag ctc atc gcc tcg ggc gag atc acc cag gag cag gcc gag 3609Glu Lys
Leu Ile Ala Ser Gly Glu Ile Thr Gln Glu Gln Ala Glu 1190 1195
1200ctc gcg cgc cgc gtt ccc gtc gcc gac gat atc gct gtc gag gct
3654Leu Ala Arg Arg Val Pro Val Ala Asp Asp Ile Ala Val Glu Ala
1205 1210 1215gac tcg ggc ggc cac acc gac aac cgc ccc atc cac gtc
atc ctc 3699Asp Ser Gly Gly His Thr Asp Asn Arg Pro Ile His Val Ile
Leu 1220 1225 1230ccg ctc atc atc aac ctc cgc aac cgc ctg cac cgc
gag tgc ggc 3744Pro Leu Ile Ile Asn Leu Arg Asn Arg Leu His Arg Glu
Cys Gly 1235 1240 1245tac ccc gcg cac ctc cgc gtc cgc gtt ggc gcc
ggc ggt ggc gtc 3789Tyr Pro Ala His Leu Arg Val Arg Val Gly Ala Gly
Gly Gly Val 1250 1255 1260ggc tgc ccg cag gcc gcc gcc gcc gcg ctc
acc atg ggc gcc gcc 3834Gly Cys Pro Gln Ala Ala Ala Ala Ala Leu Thr
Met Gly Ala Ala 1265 1270 1275ttc atc gtc acc ggc act gtc aac cag
gtc gcc aag cag tcc ggc 3879Phe Ile Val Thr Gly Thr Val Asn Gln Val
Ala Lys Gln Ser Gly 1280 1285 1290acc tgc gac aac gtg cgc aag cag
ctc tcg cag gcc acc tac tcg 3924Thr Cys Asp Asn Val Arg Lys Gln Leu
Ser Gln Ala Thr Tyr Ser 1295 1300 1305gat atc tgc atg gcc ccg gcc
gcc gac atg ttc gag gag ggc gtc 3969Asp Ile Cys Met Ala Pro Ala Ala
Asp Met Phe Glu Glu Gly Val 1310 1315 1320aag ctc cag gtc ctc aag
aag gga acc atg ttc ccc tcg cgc gcc 4014Lys Leu Gln Val Leu Lys Lys
Gly Thr Met Phe Pro Ser Arg Ala 1325 1330 1335aac aag ctc tac gag
ctc ttt tgc aag tac gac tcc ttc gac tcc 4059Asn Lys Leu Tyr Glu Leu
Phe Cys Lys Tyr Asp Ser Phe Asp Ser 1340 1345 1350atg cct cct gcc
gag ctc gag cgc atc gag aag cgt atc ttc aag 4104Met Pro Pro Ala Glu
Leu Glu Arg Ile Glu Lys Arg Ile Phe Lys 1355 1360 1365cgc gca ctc
cag gag gtc tgg gag gag acc aag gac ttt tac att 4149Arg Ala Leu Gln
Glu Val Trp Glu Glu Thr Lys Asp Phe Tyr Ile 1370 1375 1380aac ggt
ctc aag aac ccg gag aag atc cag cgc gcc gag cac gac 4194Asn Gly Leu
Lys Asn Pro Glu Lys Ile Gln Arg Ala Glu His Asp 1385 1390 1395ccc
aag ctc aag atg tcg ctc tgc ttc cgc tgg tac ctt ggt ctt 4239Pro Lys
Leu Lys Met Ser Leu Cys Phe Arg Trp Tyr Leu Gly Leu 1400 1405
1410gcc agc cgc tgg gcc aac atg ggc gcc ccg gac cgc gtc atg gac
4284Ala Ser Arg Trp Ala Asn Met Gly Ala Pro Asp Arg Val Met Asp
1415 1420 1425tac cag gtc tgg tgt ggc ccg gcc att ggc gcc ttc aac
gac ttc 4329Tyr Gln Val Trp Cys Gly Pro Ala Ile Gly Ala Phe Asn Asp
Phe 1430 1435 1440atc aag ggc acc tac ctc gac ccc gct gtc tcc aac
gag tac ccc 4374Ile Lys Gly Thr Tyr Leu Asp Pro Ala Val Ser Asn Glu
Tyr Pro 1445 1450 1455tgt gtc gtc cag atc aac ctg caa atc ctc cgt
ggt gcc tgc tac 4419Cys Val Val Gln Ile Asn Leu Gln Ile Leu Arg Gly
Ala Cys Tyr 1460 1465 1470ctg cgc cgt ctc aac gcc ctg cgc aac gac
ccg cgc att gac ctc 4464Leu Arg Arg Leu Asn Ala Leu Arg Asn Asp Pro
Arg Ile Asp Leu 1475 1480 1485gag acc gag gat gct gcc ttt gtc tac
gag ccc acc aac gcg ctc 4509Glu Thr Glu Asp Ala Ala Phe Val Tyr Glu
Pro Thr Asn Ala Leu 1490 1495 150061503PRTSchizochytrium sp. 6Met
Ala Leu Arg Val Lys Thr Asn Lys Lys Pro Cys Trp Glu Met Thr1 5 10
15Lys Glu Glu Leu Thr Ser Gly Lys Thr Glu Val Phe Asn Tyr Glu Glu
20 25 30Leu Leu Glu Phe Ala Glu Gly Asp Ile Ala Lys Val Phe Gly Pro
Glu 35 40 45Phe Ala Val Ile Asp Lys Tyr Pro Arg Arg Val Arg Leu Pro
Ala Arg 50 55 60Glu Tyr Leu Leu Val Thr Arg Val Thr Leu Met Asp Ala
Glu Val Asn65 70 75 80Asn Tyr Arg Val Gly Ala Arg Met Val Thr Glu
Tyr Asp Leu Pro Val 85 90 95Asn Gly Glu Leu Ser Glu Gly Gly Asp Cys
Pro Trp Ala Val Leu Val 100 105 110Glu Ser Gly Gln Cys Asp Leu Met
Leu Ile Ser Tyr Met Gly Ile Asp 115 120 125Phe Gln Asn Gln Gly Asp
Arg Val Tyr Arg Leu Leu Asn Thr Thr Leu 130 135 140Thr Phe Tyr Gly
Val Ala His Glu Gly Glu Thr Leu Glu Tyr Asp Ile145 150 155 160Arg
Val Thr Gly Phe Ala Lys Arg Leu Asp Gly Gly Ile Ser Met Phe 165 170
175Phe Phe Glu Tyr Asp Cys Tyr Val Asn Gly Arg Leu Leu Ile Glu Met
180 185 190Arg Asp Gly Cys Ala Gly Phe Phe Thr Asn Glu Glu Leu Asp
Ala Gly 195 200 205Lys Gly Val Val Phe Thr Arg Gly Asp Leu Ala Ala
Arg Ala Lys Ile 210 215 220Pro Lys Gln Asp Val Ser Pro Tyr Ala Val
Ala Pro Cys Leu His Lys225 230 235 240Thr Lys Leu Asn Glu Lys Glu
Met Gln Thr Leu Val Asp Lys Asp Trp 245 250 255Ala Ser Val Phe Gly
Ser Lys Asn Gly Met Pro Glu Ile Asn Tyr Lys 260 265 270Leu Cys Ala
Arg Lys Met Leu Met Ile Asp Arg Val Thr Ser Ile Asp 275 280 285His
Lys Gly Gly Val Tyr Gly Leu Gly Gln Leu Val Gly Glu Lys Ile 290 295
300Leu Glu Arg Asp His Trp Tyr Phe Pro Cys His Phe Val Lys Asp
Gln305 310 315 320Val Met Ala Gly Ser Leu Val Ser Asp Gly Cys Ser
Gln Met Leu Lys 325 330 335Met Tyr Met Ile Trp Leu Gly Leu His Leu
Thr Thr Gly Pro Phe Asp 340 345 350Phe Arg Pro Val Asn Gly His Pro
Asn Lys Val Arg Cys Arg Gly Gln 355 360 365Ile Ser Pro His Lys Gly
Lys Leu Val Tyr Val Met Glu Ile Lys Glu 370 375 380Met Gly Phe Asp
Glu Asp Asn Asp Pro Tyr Ala Ile Ala Asp Val Asn385 390 395 400Ile
Ile Asp Val Asp Phe Glu Lys Gly Gln Asp Phe Ser Leu Asp Arg 405 410
415Ile Ser Asp Tyr Gly Lys Gly Asp Leu Asn Lys Lys Ile Val Val Asp
420 425 430Phe Lys Gly Ile Ala Leu Lys Met Gln Lys Arg Ser Thr Asn
Lys Asn 435 440 445Pro Ser Lys Val Gln Pro Val Phe Ala Asn Gly Ala
Ala Thr Val Gly 450 455 460Pro Glu Ala Ser Lys Ala Ser Ser Gly Ala
Ser Ala Ser Ala Ser Ala465 470 475 480Ala Pro Ala Lys Pro Ala Phe
Ser Ala Asp Val Leu Ala Pro Lys Pro 485 490 495Val Ala Leu Pro Glu
His Ile Leu Lys Gly Asp Ala Leu Ala Pro Lys 500 505 510Glu Met Ser
Trp His Pro Met Ala Arg Ile Pro Gly Asn Pro Thr Pro 515 520 525Ser
Phe Ala Pro Ser Ala Tyr Lys Pro Arg Asn Ile Ala Phe Thr Pro 530 535
540Phe Pro Gly Asn Pro Asn Asp Asn Asp His Thr Pro Gly Lys Met
Pro545 550 555 560Leu Thr Trp Phe Asn Met Ala Glu Phe Met Ala Gly
Lys Val Ser Met 565 570 575Cys Leu Gly Pro Glu Phe Ala Lys Phe Asp
Asp Ser Asn Thr Ser Arg 580 585 590Ser Pro Ala Trp Asp Leu Ala Leu
Val Thr Arg Ala Val Ser Val Ser 595 600 605Asp Leu Lys His Val Asn
Tyr Arg Asn Ile Asp Leu Asp Pro Ser Lys 610 615 620Gly Thr Met Val
Gly Glu Phe Asp Cys Pro Ala Asp Ala Trp Phe Tyr625 630 635 640Lys
Gly Ala Cys Asn Asp Ala His Met Pro Tyr Ser Ile Leu Met Glu 645 650
655Ile Ala Leu Gln Thr Ser Gly Val Leu Thr Ser Val Leu Lys Ala Pro
660 665 670Leu Thr Met Glu Lys Asp Asp Ile Leu Phe Arg Asn Leu Asp
Ala Asn 675 680 685Ala Glu Phe Val Arg Ala Asp Leu Asp Tyr Arg Gly
Lys Thr Ile Arg 690 695 700Asn Val Thr Lys Cys Thr Gly Tyr Ser Met
Leu Gly Glu Met Gly Val705 710 715 720His Arg Phe Thr Phe Glu Leu
Tyr Val Asp Asp Val Leu Phe Tyr Lys 725 730 735Gly Ser Thr Ser Phe
Gly Trp Phe Val Pro Glu Val Phe Ala Ala Gln 740 745 750Ala Gly Leu
Asp Asn Gly Arg Lys Ser Glu Pro Trp Phe Ile Glu Asn 755 760 765Lys
Val Pro Ala Ser Gln Val Ser Ser Phe Asp Val Arg Pro Asn Gly 770 775
780Ser Gly Arg Thr Ala Ile Phe Ala Asn Ala Pro Ser Gly Ala Gln
Leu785 790 795 800Asn Arg Arg Thr Asp Gln Gly Gln Tyr Leu Asp Ala
Val Asp Ile Val 805 810 815Ser Gly Ser Gly Lys Lys Ser Leu Gly Tyr
Ala His Gly Ser Lys Thr 820 825 830Val Asn Pro Asn Asp Trp Phe Phe
Ser Cys His Phe Trp Phe Asp Ser 835 840 845Val Met Pro Gly Ser Leu
Gly Val Glu Ser Met Phe Gln Leu Val Glu 850 855 860Ala Ile Ala Ala
His Glu Asp Leu Ala Gly Lys Ala Arg His Cys Gln865 870 875 880Pro
His Leu Cys Ala Arg Pro Arg Ala Arg Ser Ser Trp Lys Tyr Arg 885 890
895Gly Gln Leu Thr Pro Lys Ser Lys Lys Met Asp Ser Glu Val His Ile
900 905 910Val Ser Val Asp Ala His Asp Gly Val Val Asp Leu Val Ala
Asp Gly 915 920 925Phe Leu Trp Ala Asp Ser Leu Arg Val Tyr Ser Val
Ser Asn Ile Arg 930 935 940Val Arg Ile Ala Ser Gly Glu Ala Pro Ala
Ala Ala Ser Ser Ala Ala945 950 955 960Ser Val Gly Ser Ser Ala Ser
Ser Val Glu Arg Thr Arg Ser Ser Pro 965 970 975Ala Val Ala Ser Gly
Pro Ala Gln Thr Ile Asp Leu Lys Gln Leu Lys 980 985 990Thr Glu Leu
Leu Glu Leu Asp Ala Pro Leu Tyr Leu Ser Gln Asp Pro 995 1000
1005Thr Ser Gly Gln Leu Lys Lys His Thr Asp Val Ala Ser Gly Gln
1010 1015 1020Ala Thr Ile Val Gln Pro Cys Thr Leu Gly Asp Leu Gly
Asp Arg 1025 1030 1035Ser Phe Met Glu Thr Tyr Gly Val Val Ala Pro
Leu Tyr Thr Gly 1040 1045 1050Ala Met Ala Lys Gly Ile Ala Ser Ala
Asp Leu Val Ile Ala Ala 1055 1060 1065Gly Lys Arg Lys Ile Leu Gly
Ser Phe Gly Ala Gly Gly Leu Pro 1070 1075 1080Met His His Val Arg
Ala Ala Leu Glu Lys Ile Gln Ala Ala Leu 1085 1090 1095Pro Gln Gly
Pro Tyr Ala Val Asn Leu Ile His Ser Pro Phe Asp 1100 1105 1110Ser
Asn Leu Glu Lys Gly Asn Val Asp Leu Phe Leu Glu Lys Gly 1115 1120
1125Val Thr Val Val Glu Ala Ser Ala Phe Met Thr Leu Thr Pro Gln
1130 1135 1140Val Val Arg Tyr Arg Ala Ala Gly Leu Ser Arg Asn Ala
Asp Gly 1145 1150 1155Ser Val Asn Ile Arg Asn Arg Ile Ile Gly Lys
Val Ser Arg Thr 1160 1165 1170Glu Leu Ala Glu Met Phe Ile Arg Pro
Ala Pro Glu His Leu Leu 1175 1180 1185Glu Lys Leu Ile Ala Ser Gly
Glu Ile Thr Gln Glu Gln Ala Glu 1190 1195 1200Leu Ala Arg Arg Val
Pro Val Ala Asp Asp Ile Ala Val Glu Ala 1205 1210 1215Asp Ser Gly
Gly His Thr Asp Asn Arg Pro Ile His Val Ile Leu 1220 1225 1230Pro
Leu Ile Ile Asn Leu Arg Asn Arg Leu His Arg Glu Cys Gly 1235 1240
1245Tyr Pro Ala His Leu Arg Val Arg Val Gly Ala Gly Gly Gly Val
1250 1255 1260Gly Cys Pro Gln Ala Ala Ala Ala Ala Leu Thr Met Gly
Ala Ala 1265 1270 1275Phe Ile Val Thr Gly Thr Val Asn Gln Val Ala
Lys Gln Ser Gly 1280 1285 1290Thr Cys Asp Asn Val Arg Lys Gln Leu
Ser Gln Ala Thr Tyr Ser 1295 1300 1305Asp Ile Cys Met Ala Pro Ala
Ala Asp Met Phe Glu Glu Gly Val 1310 1315 1320Lys Leu Gln Val Leu
Lys Lys Gly Thr Met Phe Pro Ser Arg Ala 1325 1330 1335Asn Lys Leu
Tyr Glu Leu Phe Cys Lys Tyr Asp Ser Phe Asp Ser 1340 1345 1350Met
Pro Pro Ala Glu Leu Glu Arg Ile Glu Lys Arg Ile Phe Lys 1355 1360
1365Arg Ala Leu Gln Glu Val Trp Glu Glu Thr Lys Asp Phe Tyr Ile
1370 1375 1380Asn Gly Leu Lys Asn Pro Glu Lys Ile Gln Arg Ala Glu
His Asp 1385 1390 1395Pro Lys Leu Lys Met Ser Leu Cys Phe Arg Trp
Tyr Leu Gly Leu 1400 1405 1410Ala Ser Arg Trp Ala Asn Met Gly Ala
Pro Asp Arg Val Met Asp 1415 1420 1425Tyr Gln Val Trp Cys Gly Pro
Ala Ile Gly Ala Phe Asn Asp Phe 1430 1435 1440Ile Lys Gly Thr Tyr
Leu Asp Pro Ala Val Ser Asn Glu Tyr Pro 1445 1450 1455Cys Val Val
Gln Ile Asn Leu Gln Ile Leu Arg Gly Ala Cys Tyr 1460 1465 1470Leu
Arg Arg Leu Asn Ala Leu Arg Asn Asp Pro Arg Ile Asp Leu 1475 1480
1485Glu Thr Glu Asp Ala Ala Phe Val Tyr Glu Pro Thr Asn Ala Leu
1490 1495 15007600DNASchizochytrium sp.CDS(1)..(600) 7atg gcg gcc
cgt ctg cag gag caa aag gga ggc gag atg gat acc cgc 48Met Ala Ala
Arg Leu Gln Glu Gln Lys Gly Gly Glu Met Asp Thr Arg1 5 10 15att gcc
atc atc ggc atg tcg gcc atc ctc ccc tgc ggc acg acc gtg 96Ile Ala
Ile Ile Gly Met Ser Ala Ile Leu Pro Cys Gly Thr Thr Val 20 25 30cgc
gag tcg tgg gag acc atc cgc gcc ggc atc gac tgc ctg tcg gat 144Arg
Glu Ser Trp Glu Thr Ile Arg Ala Gly Ile Asp Cys Leu Ser Asp 35 40
45ctc ccc gag gac cgc gtc gac gtg acg gcg tac ttt gac ccc gtc aag
192Leu Pro Glu Asp Arg Val Asp Val Thr Ala Tyr Phe Asp Pro Val Lys
50 55 60acc acc aag gac aag atc tac tgc aag cgc ggt ggc ttc att ccc
gag
240Thr Thr Lys Asp Lys Ile Tyr Cys Lys Arg Gly Gly Phe Ile Pro
Glu65 70 75 80tac gac ttt gac gcc cgc gag ttc gga ctc aac atg ttc
cag atg gag 288Tyr Asp Phe Asp Ala Arg Glu Phe Gly Leu Asn Met Phe
Gln Met Glu 85 90 95gac tcg gac gca aac cag acc atc tcg ctt ctc aag
gtc aag gag gcc 336Asp Ser Asp Ala Asn Gln Thr Ile Ser Leu Leu Lys
Val Lys Glu Ala 100 105 110ctc cag gac gcc ggc atc gac gcc ctc ggc
aag gaa aag aag aac atc 384Leu Gln Asp Ala Gly Ile Asp Ala Leu Gly
Lys Glu Lys Lys Asn Ile 115 120 125ggc tgc gtg ctc ggc att ggc ggc
ggc caa aag tcc agc cac gag ttc 432Gly Cys Val Leu Gly Ile Gly Gly
Gly Gln Lys Ser Ser His Glu Phe 130 135 140tac tcg cgc ctt aat tat
gtt gtc gtg gag aag gtc ctc cgc aag atg 480Tyr Ser Arg Leu Asn Tyr
Val Val Val Glu Lys Val Leu Arg Lys Met145 150 155 160ggc atg ccc
gag gag gac gtc aag gtc gcc gtc gaa aag tac aag gcc 528Gly Met Pro
Glu Glu Asp Val Lys Val Ala Val Glu Lys Tyr Lys Ala 165 170 175aac
ttc ccc gag tgg cgc ctc gac tcc ttc cct ggc ttc ctc ggc aac 576Asn
Phe Pro Glu Trp Arg Leu Asp Ser Phe Pro Gly Phe Leu Gly Asn 180 185
190gtc acc gcc ggt cgc tgc acc aac 600Val Thr Ala Gly Arg Cys Thr
Asn 195 2008200PRTSchizochytrium sp. 8Met Ala Ala Arg Leu Gln Glu
Gln Lys Gly Gly Glu Met Asp Thr Arg1 5 10 15Ile Ala Ile Ile Gly Met
Ser Ala Ile Leu Pro Cys Gly Thr Thr Val 20 25 30Arg Glu Ser Trp Glu
Thr Ile Arg Ala Gly Ile Asp Cys Leu Ser Asp 35 40 45Leu Pro Glu Asp
Arg Val Asp Val Thr Ala Tyr Phe Asp Pro Val Lys 50 55 60Thr Thr Lys
Asp Lys Ile Tyr Cys Lys Arg Gly Gly Phe Ile Pro Glu65 70 75 80Tyr
Asp Phe Asp Ala Arg Glu Phe Gly Leu Asn Met Phe Gln Met Glu 85 90
95Asp Ser Asp Ala Asn Gln Thr Ile Ser Leu Leu Lys Val Lys Glu Ala
100 105 110Leu Gln Asp Ala Gly Ile Asp Ala Leu Gly Lys Glu Lys Lys
Asn Ile 115 120 125Gly Cys Val Leu Gly Ile Gly Gly Gly Gln Lys Ser
Ser His Glu Phe 130 135 140Tyr Ser Arg Leu Asn Tyr Val Val Val Glu
Lys Val Leu Arg Lys Met145 150 155 160Gly Met Pro Glu Glu Asp Val
Lys Val Ala Val Glu Lys Tyr Lys Ala 165 170 175Asn Phe Pro Glu Trp
Arg Leu Asp Ser Phe Pro Gly Phe Leu Gly Asn 180 185 190Val Thr Ala
Gly Arg Cys Thr Asn 195 20091278DNASchizochytrium sp.CDS(1)..(1278)
9gat gtc acc aag gag gcc tgg cgc ctc ccc cgc gag ggc gtc agc ttc
48Asp Val Thr Lys Glu Ala Trp Arg Leu Pro Arg Glu Gly Val Ser Phe1
5 10 15cgc gcc aag ggc atc gcc acc aac ggc gct gtc gcc gcg ctc ttc
tcc 96Arg Ala Lys Gly Ile Ala Thr Asn Gly Ala Val Ala Ala Leu Phe
Ser 20 25 30ggc cag ggc gcg cag tac acg cac atg ttt agc gag gtg gcc
atg aac 144Gly Gln Gly Ala Gln Tyr Thr His Met Phe Ser Glu Val Ala
Met Asn 35 40 45tgg ccc cag ttc cgc cag agc att gcc gcc atg gac gcc
gcc cag tcc 192Trp Pro Gln Phe Arg Gln Ser Ile Ala Ala Met Asp Ala
Ala Gln Ser 50 55 60aag gtc gct gga agc gac aag gac ttt gag cgc gtc
tcc cag gtc ctc 240Lys Val Ala Gly Ser Asp Lys Asp Phe Glu Arg Val
Ser Gln Val Leu65 70 75 80tac ccg cgc aag ccg tac gag cgt gag ccc
gag cag aac ccc aag aag 288Tyr Pro Arg Lys Pro Tyr Glu Arg Glu Pro
Glu Gln Asn Pro Lys Lys 85 90 95atc tcc ctc acc gcc tac tcg cag ccc
tcg acc ctg gcc tgc gct ctc 336Ile Ser Leu Thr Ala Tyr Ser Gln Pro
Ser Thr Leu Ala Cys Ala Leu 100 105 110ggt gcc ttt gag atc ttc aag
gag gcc ggc ttc acc ccg gac ttt gcc 384Gly Ala Phe Glu Ile Phe Lys
Glu Ala Gly Phe Thr Pro Asp Phe Ala 115 120 125gcc ggc cat tcg ctc
ggt gag ttc gcc gcc ctc tac gcc gcg ggc tgc 432Ala Gly His Ser Leu
Gly Glu Phe Ala Ala Leu Tyr Ala Ala Gly Cys 130 135 140gtc gac cgc
gac gag ctc ttt gag ctt gtc tgc cgc cgc gcc cgc atc 480Val Asp Arg
Asp Glu Leu Phe Glu Leu Val Cys Arg Arg Ala Arg Ile145 150 155
160atg ggc ggc aag gac gca ccg gcc acc ccc aag gga tgc atg gcc gcc
528Met Gly Gly Lys Asp Ala Pro Ala Thr Pro Lys Gly Cys Met Ala Ala
165 170 175gtc att ggc ccc aac gcc gag aac atc aag gtc cag gcc gcc
aac gtc 576Val Ile Gly Pro Asn Ala Glu Asn Ile Lys Val Gln Ala Ala
Asn Val 180 185 190tgg ctc ggc aac tcc aac tcg cct tcg cag acc gtc
atc acc ggc tcc 624Trp Leu Gly Asn Ser Asn Ser Pro Ser Gln Thr Val
Ile Thr Gly Ser 195 200 205gtc gaa ggt atc cag gcc gag agc gcc cgc
ctc cag aag gag ggc ttc 672Val Glu Gly Ile Gln Ala Glu Ser Ala Arg
Leu Gln Lys Glu Gly Phe 210 215 220cgc gtc gtg cct ctt gcc tgc gag
agc gcc ttc cac tcg ccc cag atg 720Arg Val Val Pro Leu Ala Cys Glu
Ser Ala Phe His Ser Pro Gln Met225 230 235 240gag aac gcc tcg tcg
gcc ttc aag gac gtc atc tcc aag gtc tcc ttc 768Glu Asn Ala Ser Ser
Ala Phe Lys Asp Val Ile Ser Lys Val Ser Phe 245 250 255cgc acc ccc
aag gcc gag acc aag ctc ttc agc aac gtc tct ggc gag 816Arg Thr Pro
Lys Ala Glu Thr Lys Leu Phe Ser Asn Val Ser Gly Glu 260 265 270acc
tac ccc acg gac gcc cgc gag atg ctt acg cag cac atg acc agc 864Thr
Tyr Pro Thr Asp Ala Arg Glu Met Leu Thr Gln His Met Thr Ser 275 280
285agc gtc aag ttc ctc acc cag gtc cgc aac atg cac cag gcc ggt gcg
912Ser Val Lys Phe Leu Thr Gln Val Arg Asn Met His Gln Ala Gly Ala
290 295 300cgc atc ttt gtc gag ttc gga ccc aag cag gtg ctc tcc aag
ctt gtc 960Arg Ile Phe Val Glu Phe Gly Pro Lys Gln Val Leu Ser Lys
Leu Val305 310 315 320tcc gag acc ctc aag gat gac ccc tcg gtt gtc
acc gtc tct gtc aac 1008Ser Glu Thr Leu Lys Asp Asp Pro Ser Val Val
Thr Val Ser Val Asn 325 330 335ccg gcc tcg ggc acg gat tcg gac atc
cag ctc cgc gac gcg gcc gtc 1056Pro Ala Ser Gly Thr Asp Ser Asp Ile
Gln Leu Arg Asp Ala Ala Val 340 345 350cag ctc gtt gtc gct ggc gtc
aac ctt cag ggc ttt gac aag tgg gac 1104Gln Leu Val Val Ala Gly Val
Asn Leu Gln Gly Phe Asp Lys Trp Asp 355 360 365gcc ccc gat gcc acc
cgc atg cag gcc atc aag aag aag cgc act acc 1152Ala Pro Asp Ala Thr
Arg Met Gln Ala Ile Lys Lys Lys Arg Thr Thr 370 375 380ctc cgc ctt
tcg gcc gcc acc tac gtc tcg gac aag acc aag aag gtc 1200Leu Arg Leu
Ser Ala Ala Thr Tyr Val Ser Asp Lys Thr Lys Lys Val385 390 395
400cgc gac gcc gcc atg aac gat ggc cgc tgc gtc acc tac ctc aag ggc
1248Arg Asp Ala Ala Met Asn Asp Gly Arg Cys Val Thr Tyr Leu Lys Gly
405 410 415gcc gca ccg ctc atc aag gcc ccg gag ccc 1278Ala Ala Pro
Leu Ile Lys Ala Pro Glu Pro 420 42510426PRTSchizochytrium sp. 10Asp
Val Thr Lys Glu Ala Trp Arg Leu Pro Arg Glu Gly Val Ser Phe1 5 10
15Arg Ala Lys Gly Ile Ala Thr Asn Gly Ala Val Ala Ala Leu Phe Ser
20 25 30Gly Gln Gly Ala Gln Tyr Thr His Met Phe Ser Glu Val Ala Met
Asn 35 40 45Trp Pro Gln Phe Arg Gln Ser Ile Ala Ala Met Asp Ala Ala
Gln Ser 50 55 60Lys Val Ala Gly Ser Asp Lys Asp Phe Glu Arg Val Ser
Gln Val Leu65 70 75 80Tyr Pro Arg Lys Pro Tyr Glu Arg Glu Pro Glu
Gln Asn Pro Lys Lys 85 90 95Ile Ser Leu Thr Ala Tyr Ser Gln Pro Ser
Thr Leu Ala Cys Ala Leu 100 105 110Gly Ala Phe Glu Ile Phe Lys Glu
Ala Gly Phe Thr Pro Asp Phe Ala 115 120 125Ala Gly His Ser Leu Gly
Glu Phe Ala Ala Leu Tyr Ala Ala Gly Cys 130 135 140Val Asp Arg Asp
Glu Leu Phe Glu Leu Val Cys Arg Arg Ala Arg Ile145 150 155 160Met
Gly Gly Lys Asp Ala Pro Ala Thr Pro Lys Gly Cys Met Ala Ala 165 170
175Val Ile Gly Pro Asn Ala Glu Asn Ile Lys Val Gln Ala Ala Asn Val
180 185 190Trp Leu Gly Asn Ser Asn Ser Pro Ser Gln Thr Val Ile Thr
Gly Ser 195 200 205Val Glu Gly Ile Gln Ala Glu Ser Ala Arg Leu Gln
Lys Glu Gly Phe 210 215 220Arg Val Val Pro Leu Ala Cys Glu Ser Ala
Phe His Ser Pro Gln Met225 230 235 240Glu Asn Ala Ser Ser Ala Phe
Lys Asp Val Ile Ser Lys Val Ser Phe 245 250 255Arg Thr Pro Lys Ala
Glu Thr Lys Leu Phe Ser Asn Val Ser Gly Glu 260 265 270Thr Tyr Pro
Thr Asp Ala Arg Glu Met Leu Thr Gln His Met Thr Ser 275 280 285Ser
Val Lys Phe Leu Thr Gln Val Arg Asn Met His Gln Ala Gly Ala 290 295
300Arg Ile Phe Val Glu Phe Gly Pro Lys Gln Val Leu Ser Lys Leu
Val305 310 315 320Ser Glu Thr Leu Lys Asp Asp Pro Ser Val Val Thr
Val Ser Val Asn 325 330 335Pro Ala Ser Gly Thr Asp Ser Asp Ile Gln
Leu Arg Asp Ala Ala Val 340 345 350Gln Leu Val Val Ala Gly Val Asn
Leu Gln Gly Phe Asp Lys Trp Asp 355 360 365Ala Pro Asp Ala Thr Arg
Met Gln Ala Ile Lys Lys Lys Arg Thr Thr 370 375 380Leu Arg Leu Ser
Ala Ala Thr Tyr Val Ser Asp Lys Thr Lys Lys Val385 390 395 400Arg
Asp Ala Ala Met Asn Asp Gly Arg Cys Val Thr Tyr Leu Lys Gly 405 410
415Ala Ala Pro Leu Ile Lys Ala Pro Glu Pro 420
425115PRTSchizochytrium sp.MISC_FEATURE(1)..(5)Xaa = any amino acid
11Gly His Ser Xaa Gly1 512258DNASchizochytrium sp.CDS(1)..(258)
12gct gtc tcg aac gag ctt ctt gag aag gcc gag act gtc gtc atg gag
48Ala Val Ser Asn Glu Leu Leu Glu Lys Ala Glu Thr Val Val Met Glu1
5 10 15gtc ctc gcc gcc aag acc ggc tac gag acc gac atg atc gag gct
gac 96Val Leu Ala Ala Lys Thr Gly Tyr Glu Thr Asp Met Ile Glu Ala
Asp 20 25 30atg gag ctc gag acc gag ctc ggc att gac tcc atc aag cgt
gtc gag 144Met Glu Leu Glu Thr Glu Leu Gly Ile Asp Ser Ile Lys Arg
Val Glu 35 40 45atc ctc tcc gag gtc cag gcc atg ctc aat gtc gag gcc
aag gat gtc 192Ile Leu Ser Glu Val Gln Ala Met Leu Asn Val Glu Ala
Lys Asp Val 50 55 60gat gcc ctc agc cgc act cgc act gtt ggt gag gtt
gtc aac gcc atg 240Asp Ala Leu Ser Arg Thr Arg Thr Val Gly Glu Val
Val Asn Ala Met65 70 75 80aag gcc gag atc gct ggc 258Lys Ala Glu
Ile Ala Gly 851386PRTSchizochytrium sp. 13Ala Val Ser Asn Glu Leu
Leu Glu Lys Ala Glu Thr Val Val Met Glu1 5 10 15Val Leu Ala Ala Lys
Thr Gly Tyr Glu Thr Asp Met Ile Glu Ala Asp 20 25 30Met Glu Leu Glu
Thr Glu Leu Gly Ile Asp Ser Ile Lys Arg Val Glu 35 40 45Ile Leu Ser
Glu Val Gln Ala Met Leu Asn Val Glu Ala Lys Asp Val 50 55 60Asp Ala
Leu Ser Arg Thr Arg Thr Val Gly Glu Val Val Asn Ala Met65 70 75
80Lys Ala Glu Ile Ala Gly 85145PRTSchizochytrium sp. 14Leu Gly Ile
Asp Ser1 51521PRTSchizochytrium sp. 15Ala Pro Ala Pro Val Lys Ala
Ala Ala Pro Ala Ala Pro Val Ala Ser1 5 10 15Ala Pro Ala Pro Ala
20163006DNASchizochytrium sp. 16gcccccgccc cggtcaaggc tgctgcgcct
gccgcccccg ttgcctcggc ccctgccccg 60gctgtctcga acgagcttct tgagaaggcc
gagactgtcg tcatggaggt cctcgccgcc 120aagaccggct acgagaccga
catgatcgag gctgacatgg agctcgagac cgagctcggc 180attgactcca
tcaagcgtgt cgagatcctc tccgaggtcc aggccatgct caatgtcgag
240gccaaggatg tcgatgccct cagccgcact cgcactgttg gtgaggttgt
caacgccatg 300aaggccgaga tcgctggcag ctctgccccg gcgcctgctg
ccgctgctcc ggctccggcc 360aaggctgccc ctgccgccgc tgcgcctgct
gtctcgaacg agcttctcga gaaggccgag 420accgtcgtca tggaggtcct
cgccgccaag actggctacg agactgacat gatcgagtcc 480gacatggagc
tcgagactga gctcggcatt gactccatca agcgtgtcga gatcctctcc
540gaggttcagg ccatgctcaa cgtcgaggcc aaggacgtcg acgctctcag
ccgcactcgc 600actgtgggtg aggtcgtcaa cgccatgaag gctgagatcg
ctggtggctc tgccccggcg 660cctgccgccg ctgccccagg tccggctgct
gccgcccctg cgcctgccgc cgccgcccct 720gctgtctcga acgagcttct
tgagaaggcc gagaccgtcg tcatggaggt cctcgccgcc 780aagactggct
acgagactga catgatcgag tccgacatgg agctcgagac cgagctcggc
840attgactcca tcaagcgtgt cgagattctc tccgaggtcc aggccatgct
caacgtcgag 900gccaaggacg tcgacgctct cagccgcacc cgcactgttg
gcgaggtcgt cgatgccatg 960aaggccgaga tcgctggtgg ctctgccccg
gcgcctgccg ccgctgctcc tgctccggct 1020gctgccgccc ctgcgcctgc
cgcccctgcg cctgctgtct cgagcgagct tctcgagaag 1080gccgagactg
tcgtcatgga ggtcctcgcc gccaagactg gctacgagac tgacatgatc
1140gagtccgaca tggagctcga gaccgagctc ggcattgact ccatcaagcg
tgtcgagatt 1200ctctccgagg tccaggccat gctcaacgtc gaggccaagg
acgtcgacgc tctcagccgc 1260acccgcactg ttggcgaggt cgtcgatgcc
atgaaggccg agatcgctgg tggctctgcc 1320ccggcgcctg ccgccgctgc
tcctgctccg gctgctgccg cccctgcgcc tgccgcccct 1380gcgcctgccg
cccctgcgcc tgctgtctcg agcgagcttc tcgagaaggc cgagactgtc
1440gtcatggagg tcctcgccgc caagactggc tacgagactg acatgattga
gtccgacatg 1500gagctcgaga ccgagctcgg cattgactcc atcaagcgtg
tcgagattct ctccgaggtt 1560caggccatgc tcaacgtcga ggccaaggac
gtcgacgctc tcagccgcac tcgcactgtt 1620ggtgaggtcg tcgatgccat
gaaggctgag atcgctggca gctccgcctc ggcgcctgcc 1680gccgctgctc
ctgctccggc tgctgccgct cctgcgcccg ctgccgccgc ccctgctgtc
1740tcgaacgagc ttctcgagaa agccgagact gtcgtcatgg aggtcctcgc
cgccaagact 1800ggctacgaga ctgacatgat cgagtccgac atggagctcg
agactgagct cggcattgac 1860tccatcaagc gtgtcgagat cctctccgag
gttcaggcca tgctcaacgt cgaggccaag 1920gacgtcgatg ccctcagccg
cacccgcact gttggcgagg ttgtcgatgc catgaaggcc 1980gagatcgctg
gtggctctgc cccggcgcct gccgccgctg cccctgctcc ggctgccgcc
2040gcccctgctg tctcgaacga gcttctcgag aaggccgaga ctgtcgtcat
ggaggtcctc 2100gccgccaaga ctggctacga gaccgacatg atcgagtccg
acatggagct cgagaccgag 2160ctcggcattg actccatcaa gcgtgtcgag
attctctccg aggttcaggc catgctcaac 2220gtcgaggcca aggacgtcga
tgctctcagc cgcactcgca ctgttggcga ggtcgtcgat 2280gccatgaagg
ctgagatcgc cggcagctcc gccccggcgc ctgccgccgc tgctcctgct
2340ccggctgctg ccgctcctgc gcccgctgcc gctgcccctg ctgtctcgag
cgagcttctc 2400gagaaggccg agaccgtcgt catggaggtc ctcgccgcca
agactggcta cgagactgac 2460atgattgagt ccgacatgga gctcgagact
gagctcggca ttgactccat caagcgtgtc 2520gagatcctct ccgaggttca
ggccatgctc aacgtcgagg ccaaggacgt cgatgccctc 2580agccgcaccc
gcactgttgg cgaggttgtc gatgccatga aggccgagat cgctggtggc
2640tctgccccgg cgcctgccgc cgctgcccct gctccggctg ccgccgcccc
tgctgtctcg 2700aacgagcttc ttgagaaggc cgagaccgtc gtcatggagg
tcctcgccgc caagactggc 2760tacgagaccg acatgatcga gtccgacatg
gagctcgaga ccgagctcgg cattgactcc 2820atcaagcgtg tcgagattct
ctccgaggtt caggccatgc tcaacgtcga ggccaaggac 2880gtcgacgctc
tcagccgcac tcgcactgtt ggcgaggtcg tcgatgccat gaaggctgag
2940atcgctggtg gctctgcccc ggcgcctgcc gccgctgctc ctgcctcggc
tggcgccgcg 3000cctgcg 3006172133DNASchizochytrium sp.CDS(1)..(2133)
17ttt ggc gct ctc ggc ggc ttc atc tcg cag cag gcg gag cgc ttc gag
48Phe Gly Ala Leu Gly Gly Phe Ile Ser Gln Gln Ala Glu Arg Phe Glu1
5 10 15ccc gcc gaa atc ctc ggc ttc acg ctc atg tgc gcc aag ttc gcc
aag 96Pro Ala Glu Ile Leu Gly Phe Thr Leu Met Cys Ala Lys Phe Ala
Lys 20 25 30gct tcc ctc tgc acg gct gtg gct ggc ggc cgc ccg gcc ttt
atc ggt 144Ala Ser Leu Cys Thr Ala Val Ala Gly Gly Arg Pro Ala Phe
Ile Gly 35 40 45gtg gcg cgc ctt gac ggc cgc ctc gga ttc act tcg cag
ggc act tct 192Val Ala Arg Leu Asp Gly Arg Leu Gly Phe Thr Ser Gln
Gly Thr Ser 50 55 60gac gcg ctc aag cgt gcc cag cgt ggt gcc atc ttt
ggc ctc tgc aag 240Asp Ala Leu Lys Arg Ala Gln Arg Gly Ala Ile Phe
Gly Leu Cys Lys65 70 75
80acc atc ggc ctc gag tgg tcc gag tct gac gtc ttt tcc cgc ggc gtg
288Thr Ile Gly Leu Glu Trp Ser Glu Ser Asp Val Phe Ser Arg Gly Val
85 90 95gac att gct cag ggc atg cac ccc gag gat gcc gcc gtg gcg att
gtg 336Asp Ile Ala Gln Gly Met His Pro Glu Asp Ala Ala Val Ala Ile
Val 100 105 110cgc gag atg gcg tgc gct gac att cgc att cgc gag gtc
ggc att ggc 384Arg Glu Met Ala Cys Ala Asp Ile Arg Ile Arg Glu Val
Gly Ile Gly 115 120 125gca aac cag cag cgc tgc acg atc cgt gcc gcc
aag ctc gag acc ggc 432Ala Asn Gln Gln Arg Cys Thr Ile Arg Ala Ala
Lys Leu Glu Thr Gly 130 135 140aac ccg cag cgc cag atc gcc aag gac
gac gtg ctg ctc gtt tct ggc 480Asn Pro Gln Arg Gln Ile Ala Lys Asp
Asp Val Leu Leu Val Ser Gly145 150 155 160ggc gct cgc ggc atc acg
cct ctt tgc atc cgg gag atc acg cgc cag 528Gly Ala Arg Gly Ile Thr
Pro Leu Cys Ile Arg Glu Ile Thr Arg Gln 165 170 175atc gcg ggc ggc
aag tac att ctg ctt ggc cgc agc aag gtc tct gcg 576Ile Ala Gly Gly
Lys Tyr Ile Leu Leu Gly Arg Ser Lys Val Ser Ala 180 185 190agc gaa
ccg gca tgg tgc gct ggc atc act gac gag aag gct gtg caa 624Ser Glu
Pro Ala Trp Cys Ala Gly Ile Thr Asp Glu Lys Ala Val Gln 195 200
205aag gct gct acc cag gag ctc aag cgc gcc ttt agc gct ggc gag ggc
672Lys Ala Ala Thr Gln Glu Leu Lys Arg Ala Phe Ser Ala Gly Glu Gly
210 215 220ccc aag ccc acg ccc cgc gct gtc act aag ctt gtg ggc tct
gtt ctt 720Pro Lys Pro Thr Pro Arg Ala Val Thr Lys Leu Val Gly Ser
Val Leu225 230 235 240ggc gct cgc gag gtg cgc agc tct att gct gcg
att gaa gcg ctc ggc 768Gly Ala Arg Glu Val Arg Ser Ser Ile Ala Ala
Ile Glu Ala Leu Gly 245 250 255ggc aag gcc atc tac tcg tcg tgc gac
gtg aac tct gcc gcc gac gtg 816Gly Lys Ala Ile Tyr Ser Ser Cys Asp
Val Asn Ser Ala Ala Asp Val 260 265 270gcc aag gcc gtg cgc gat gcc
gag tcc cag ctc ggt gcc cgc gtc tcg 864Ala Lys Ala Val Arg Asp Ala
Glu Ser Gln Leu Gly Ala Arg Val Ser 275 280 285ggc atc gtt cat gcc
tcg ggc gtg ctc cgc gac cgt ctc atc gag aag 912Gly Ile Val His Ala
Ser Gly Val Leu Arg Asp Arg Leu Ile Glu Lys 290 295 300aag ctc ccc
gac gag ttc gac gcc gtc ttt ggc acc aag gtc acc ggt 960Lys Leu Pro
Asp Glu Phe Asp Ala Val Phe Gly Thr Lys Val Thr Gly305 310 315
320ctc gag aac ctc ctc gcc gcc gtc gac cgc gcc aac ctc aag cac atg
1008Leu Glu Asn Leu Leu Ala Ala Val Asp Arg Ala Asn Leu Lys His Met
325 330 335gtc ctc ttc agc tcg ctc gcc ggc ttc cac ggc aac gtc ggc
cag tct 1056Val Leu Phe Ser Ser Leu Ala Gly Phe His Gly Asn Val Gly
Gln Ser 340 345 350gac tac gcc atg gcc aac gag gcc ctt aac aag atg
ggc ctc gag ctc 1104Asp Tyr Ala Met Ala Asn Glu Ala Leu Asn Lys Met
Gly Leu Glu Leu 355 360 365gcc aag gac gtc tcg gtc aag tcg atc tgc
ttc ggt ccc tgg gac ggt 1152Ala Lys Asp Val Ser Val Lys Ser Ile Cys
Phe Gly Pro Trp Asp Gly 370 375 380ggc atg gtg acg ccg cag ctc aag
aag cag ttc cag gag atg ggc gtg 1200Gly Met Val Thr Pro Gln Leu Lys
Lys Gln Phe Gln Glu Met Gly Val385 390 395 400cag atc atc ccc cgc
gag ggc ggc gct gat acc gtg gcg cgc atc gtg 1248Gln Ile Ile Pro Arg
Glu Gly Gly Ala Asp Thr Val Ala Arg Ile Val 405 410 415ctc ggc tcc
tcg ccg gct gag atc ctt gtc ggc aac tgg cgc acc ccg 1296Leu Gly Ser
Ser Pro Ala Glu Ile Leu Val Gly Asn Trp Arg Thr Pro 420 425 430tcc
aag aag gtc ggc tcg gac acc atc acc ctg cac cgc aag att tcc 1344Ser
Lys Lys Val Gly Ser Asp Thr Ile Thr Leu His Arg Lys Ile Ser 435 440
445gcc aag tcc aac ccc ttc ctc gag gac cac gtc atc cag ggc cgc cgc
1392Ala Lys Ser Asn Pro Phe Leu Glu Asp His Val Ile Gln Gly Arg Arg
450 455 460gtg ctg ccc atg acg ctg gcc att ggc tcg ctc gcg gag acc
tgc ctc 1440Val Leu Pro Met Thr Leu Ala Ile Gly Ser Leu Ala Glu Thr
Cys Leu465 470 475 480ggc ctc ttc ccc ggc tac tcg ctc tgg gcc att
gac gac gcc cag ctc 1488Gly Leu Phe Pro Gly Tyr Ser Leu Trp Ala Ile
Asp Asp Ala Gln Leu 485 490 495ttc aag ggt gtc act gtc gac ggc gac
gtc aac tgc gag gtg acc ctc 1536Phe Lys Gly Val Thr Val Asp Gly Asp
Val Asn Cys Glu Val Thr Leu 500 505 510acc ccg tcg acg gcg ccc tcg
ggc cgc gtc aac gtc cag gcc acg ctc 1584Thr Pro Ser Thr Ala Pro Ser
Gly Arg Val Asn Val Gln Ala Thr Leu 515 520 525aag acc ttt tcc agc
ggc aag ctg gtc ccg gcc tac cgc gcc gtc atc 1632Lys Thr Phe Ser Ser
Gly Lys Leu Val Pro Ala Tyr Arg Ala Val Ile 530 535 540gtg ctc tcc
aac cag ggc gcg ccc ccg gcc aac gcc acc atg cag ccg 1680Val Leu Ser
Asn Gln Gly Ala Pro Pro Ala Asn Ala Thr Met Gln Pro545 550 555
560ccc tcg ctc gat gcc gat ccg gcg ctc cag ggc tcc gtc tac gac ggc
1728Pro Ser Leu Asp Ala Asp Pro Ala Leu Gln Gly Ser Val Tyr Asp Gly
565 570 575aag acc ctc ttc cac ggc ccg gcc ttc cgc ggc atc gat gac
gtg ctc 1776Lys Thr Leu Phe His Gly Pro Ala Phe Arg Gly Ile Asp Asp
Val Leu 580 585 590tcg tgc acc aag agc cag ctt gtg gcc aag tgc agc
gct gtc ccc ggc 1824Ser Cys Thr Lys Ser Gln Leu Val Ala Lys Cys Ser
Ala Val Pro Gly 595 600 605tcc gac gcc gct cgc ggc gag ttt gcc acg
gac act gac gcc cat gac 1872Ser Asp Ala Ala Arg Gly Glu Phe Ala Thr
Asp Thr Asp Ala His Asp 610 615 620ccc ttc gtg aac gac ctg gcc ttt
cag gcc atg ctc gtc tgg gtg cgc 1920Pro Phe Val Asn Asp Leu Ala Phe
Gln Ala Met Leu Val Trp Val Arg625 630 635 640cgc acg ctc ggc cag
gct gcg ctc ccc aac tcg atc cag cgc atc gtc 1968Arg Thr Leu Gly Gln
Ala Ala Leu Pro Asn Ser Ile Gln Arg Ile Val 645 650 655cag cac cgc
ccg gtc ccg cag gac aag ccc ttc tac att acc ctc cgc 2016Gln His Arg
Pro Val Pro Gln Asp Lys Pro Phe Tyr Ile Thr Leu Arg 660 665 670tcc
aac cag tcg ggc ggt cac tcc cag cac aag cac gcc ctt cag ttc 2064Ser
Asn Gln Ser Gly Gly His Ser Gln His Lys His Ala Leu Gln Phe 675 680
685cac aac gag cag ggc gat ctc ttc att gat gtc cag gct tcg gtc atc
2112His Asn Glu Gln Gly Asp Leu Phe Ile Asp Val Gln Ala Ser Val Ile
690 695 700gcc acg gac agc ctt gcc ttc 2133Ala Thr Asp Ser Leu Ala
Phe705 71018711PRTSchizochytrium sp. 18Phe Gly Ala Leu Gly Gly Phe
Ile Ser Gln Gln Ala Glu Arg Phe Glu1 5 10 15Pro Ala Glu Ile Leu Gly
Phe Thr Leu Met Cys Ala Lys Phe Ala Lys 20 25 30Ala Ser Leu Cys Thr
Ala Val Ala Gly Gly Arg Pro Ala Phe Ile Gly 35 40 45Val Ala Arg Leu
Asp Gly Arg Leu Gly Phe Thr Ser Gln Gly Thr Ser 50 55 60Asp Ala Leu
Lys Arg Ala Gln Arg Gly Ala Ile Phe Gly Leu Cys Lys65 70 75 80Thr
Ile Gly Leu Glu Trp Ser Glu Ser Asp Val Phe Ser Arg Gly Val 85 90
95Asp Ile Ala Gln Gly Met His Pro Glu Asp Ala Ala Val Ala Ile Val
100 105 110Arg Glu Met Ala Cys Ala Asp Ile Arg Ile Arg Glu Val Gly
Ile Gly 115 120 125Ala Asn Gln Gln Arg Cys Thr Ile Arg Ala Ala Lys
Leu Glu Thr Gly 130 135 140Asn Pro Gln Arg Gln Ile Ala Lys Asp Asp
Val Leu Leu Val Ser Gly145 150 155 160Gly Ala Arg Gly Ile Thr Pro
Leu Cys Ile Arg Glu Ile Thr Arg Gln 165 170 175Ile Ala Gly Gly Lys
Tyr Ile Leu Leu Gly Arg Ser Lys Val Ser Ala 180 185 190Ser Glu Pro
Ala Trp Cys Ala Gly Ile Thr Asp Glu Lys Ala Val Gln 195 200 205Lys
Ala Ala Thr Gln Glu Leu Lys Arg Ala Phe Ser Ala Gly Glu Gly 210 215
220Pro Lys Pro Thr Pro Arg Ala Val Thr Lys Leu Val Gly Ser Val
Leu225 230 235 240Gly Ala Arg Glu Val Arg Ser Ser Ile Ala Ala Ile
Glu Ala Leu Gly 245 250 255Gly Lys Ala Ile Tyr Ser Ser Cys Asp Val
Asn Ser Ala Ala Asp Val 260 265 270Ala Lys Ala Val Arg Asp Ala Glu
Ser Gln Leu Gly Ala Arg Val Ser 275 280 285Gly Ile Val His Ala Ser
Gly Val Leu Arg Asp Arg Leu Ile Glu Lys 290 295 300Lys Leu Pro Asp
Glu Phe Asp Ala Val Phe Gly Thr Lys Val Thr Gly305 310 315 320Leu
Glu Asn Leu Leu Ala Ala Val Asp Arg Ala Asn Leu Lys His Met 325 330
335Val Leu Phe Ser Ser Leu Ala Gly Phe His Gly Asn Val Gly Gln Ser
340 345 350Asp Tyr Ala Met Ala Asn Glu Ala Leu Asn Lys Met Gly Leu
Glu Leu 355 360 365Ala Lys Asp Val Ser Val Lys Ser Ile Cys Phe Gly
Pro Trp Asp Gly 370 375 380Gly Met Val Thr Pro Gln Leu Lys Lys Gln
Phe Gln Glu Met Gly Val385 390 395 400Gln Ile Ile Pro Arg Glu Gly
Gly Ala Asp Thr Val Ala Arg Ile Val 405 410 415Leu Gly Ser Ser Pro
Ala Glu Ile Leu Val Gly Asn Trp Arg Thr Pro 420 425 430Ser Lys Lys
Val Gly Ser Asp Thr Ile Thr Leu His Arg Lys Ile Ser 435 440 445Ala
Lys Ser Asn Pro Phe Leu Glu Asp His Val Ile Gln Gly Arg Arg 450 455
460Val Leu Pro Met Thr Leu Ala Ile Gly Ser Leu Ala Glu Thr Cys
Leu465 470 475 480Gly Leu Phe Pro Gly Tyr Ser Leu Trp Ala Ile Asp
Asp Ala Gln Leu 485 490 495Phe Lys Gly Val Thr Val Asp Gly Asp Val
Asn Cys Glu Val Thr Leu 500 505 510Thr Pro Ser Thr Ala Pro Ser Gly
Arg Val Asn Val Gln Ala Thr Leu 515 520 525Lys Thr Phe Ser Ser Gly
Lys Leu Val Pro Ala Tyr Arg Ala Val Ile 530 535 540Val Leu Ser Asn
Gln Gly Ala Pro Pro Ala Asn Ala Thr Met Gln Pro545 550 555 560Pro
Ser Leu Asp Ala Asp Pro Ala Leu Gln Gly Ser Val Tyr Asp Gly 565 570
575Lys Thr Leu Phe His Gly Pro Ala Phe Arg Gly Ile Asp Asp Val Leu
580 585 590Ser Cys Thr Lys Ser Gln Leu Val Ala Lys Cys Ser Ala Val
Pro Gly 595 600 605Ser Asp Ala Ala Arg Gly Glu Phe Ala Thr Asp Thr
Asp Ala His Asp 610 615 620Pro Phe Val Asn Asp Leu Ala Phe Gln Ala
Met Leu Val Trp Val Arg625 630 635 640Arg Thr Leu Gly Gln Ala Ala
Leu Pro Asn Ser Ile Gln Arg Ile Val 645 650 655Gln His Arg Pro Val
Pro Gln Asp Lys Pro Phe Tyr Ile Thr Leu Arg 660 665 670Ser Asn Gln
Ser Gly Gly His Ser Gln His Lys His Ala Leu Gln Phe 675 680 685His
Asn Glu Gln Gly Asp Leu Phe Ile Asp Val Gln Ala Ser Val Ile 690 695
700Ala Thr Asp Ser Leu Ala Phe705 710191350DNASchizochytrium
sp.CDS(1)..(1350) 19atg gcc gct cgg aat gtg agc gcc gcg cat gag atg
cac gat gaa aag 48Met Ala Ala Arg Asn Val Ser Ala Ala His Glu Met
His Asp Glu Lys1 5 10 15cgc atc gcc gtc gtc ggc atg gcc gtc cag tac
gcc gga tgc aaa acc 96Arg Ile Ala Val Val Gly Met Ala Val Gln Tyr
Ala Gly Cys Lys Thr 20 25 30aag gac gag ttc tgg gag gtg ctc atg aac
ggc aag gtc gag tcc aag 144Lys Asp Glu Phe Trp Glu Val Leu Met Asn
Gly Lys Val Glu Ser Lys 35 40 45gtg atc agc gac aaa cga ctc ggc tcc
aac tac cgc gcc gag cac tac 192Val Ile Ser Asp Lys Arg Leu Gly Ser
Asn Tyr Arg Ala Glu His Tyr 50 55 60aaa gca gag cgc agc aag tat gcc
gac acc ttt tgc aac gaa acg tac 240Lys Ala Glu Arg Ser Lys Tyr Ala
Asp Thr Phe Cys Asn Glu Thr Tyr65 70 75 80ggc acc ctt gac gag aac
gag atc gac aac gag cac gaa ctc ctc ctc 288Gly Thr Leu Asp Glu Asn
Glu Ile Asp Asn Glu His Glu Leu Leu Leu 85 90 95aac ctc gcc aag cag
gca ctc gca gag aca tcc gtc aaa gac tcg aca 336Asn Leu Ala Lys Gln
Ala Leu Ala Glu Thr Ser Val Lys Asp Ser Thr 100 105 110cgc tgc ggc
atc gtc agc ggc tgc ctc tcg ttc ccc atg gac aac ctc 384Arg Cys Gly
Ile Val Ser Gly Cys Leu Ser Phe Pro Met Asp Asn Leu 115 120 125cag
ggt gaa ctc ctc aac gtg tac caa aac cat gtc gag aaa aag ctc 432Gln
Gly Glu Leu Leu Asn Val Tyr Gln Asn His Val Glu Lys Lys Leu 130 135
140ggg gcc cgc gtc ttc aag gac gcc tcc cat tgg tcc gaa cgc gag cag
480Gly Ala Arg Val Phe Lys Asp Ala Ser His Trp Ser Glu Arg Glu
Gln145 150 155 160tcc aac aaa ccc gag gcc ggt gac cgc cgc atc ttc
atg gac ccg gcc 528Ser Asn Lys Pro Glu Ala Gly Asp Arg Arg Ile Phe
Met Asp Pro Ala 165 170 175tcc ttc gtc gcc gaa gaa ctc aac ctc ggc
gcc ctt cac tac tcc gtc 576Ser Phe Val Ala Glu Glu Leu Asn Leu Gly
Ala Leu His Tyr Ser Val 180 185 190gac gca gca tgc gcc acg gcg ctc
tac gtg ctc cgc ctc gcg cag gat 624Asp Ala Ala Cys Ala Thr Ala Leu
Tyr Val Leu Arg Leu Ala Gln Asp 195 200 205cat ctc gtc tcc ggc gcc
gcc gac gtc atg ctc tgc ggt gcc acc tgc 672His Leu Val Ser Gly Ala
Ala Asp Val Met Leu Cys Gly Ala Thr Cys 210 215 220ctg ccg gag ccc
ttt ttc atc ctt tcg ggc ttt tcc acc ttc cag gcc 720Leu Pro Glu Pro
Phe Phe Ile Leu Ser Gly Phe Ser Thr Phe Gln Ala225 230 235 240atg
ccc gtc ggc acg ggc cag aac gtg tcc atg ccg ctg cac aag gac 768Met
Pro Val Gly Thr Gly Gln Asn Val Ser Met Pro Leu His Lys Asp 245 250
255agc cag ggc ctc acc ccg ggt gag ggc ggc tcc atc atg gtc ctc aag
816Ser Gln Gly Leu Thr Pro Gly Glu Gly Gly Ser Ile Met Val Leu Lys
260 265 270cgt ctc gat gat gcc atc cgc gac ggc gac cac att tac ggc
acc ctt 864Arg Leu Asp Asp Ala Ile Arg Asp Gly Asp His Ile Tyr Gly
Thr Leu 275 280 285ctc ggc gcc aat gtc agc aac tcc ggc aca ggt ctg
ccc ctc aag ccc 912Leu Gly Ala Asn Val Ser Asn Ser Gly Thr Gly Leu
Pro Leu Lys Pro 290 295 300ctt ctc ccc agc gag aaa aag tgc ctc atg
gac acc tac acg cgc att 960Leu Leu Pro Ser Glu Lys Lys Cys Leu Met
Asp Thr Tyr Thr Arg Ile305 310 315 320aac gtg cac ccg cac aag att
cag tac gtc gag tgc cac gcc acc ggc 1008Asn Val His Pro His Lys Ile
Gln Tyr Val Glu Cys His Ala Thr Gly 325 330 335acg ccc cag ggt gat
cgt gtg gaa atc gac gcc gtc aag gcc tgc ttt 1056Thr Pro Gln Gly Asp
Arg Val Glu Ile Asp Ala Val Lys Ala Cys Phe 340 345 350gaa ggc aag
gtc ccc cgt ttc ggt acc aca aag ggc aac ttt gga cac 1104Glu Gly Lys
Val Pro Arg Phe Gly Thr Thr Lys Gly Asn Phe Gly His 355 360 365acc
cts gyc gca gcc ggc ttt gcc ggt atg tgc aag gtc ctc ctc tcc 1152Thr
Xaa Xaa Ala Ala Gly Phe Ala Gly Met Cys Lys Val Leu Leu Ser 370 375
380atg aag cat ggc atc atc ccg ccc acc ccg ggt atc gat gac gag acc
1200Met Lys His Gly Ile Ile Pro Pro Thr Pro Gly Ile Asp Asp Glu
Thr385 390 395 400aag atg gac cct ctc gtc gtc tcc ggt gag gcc atc
cca tgg cca gag 1248Lys Met Asp Pro Leu Val Val Ser Gly Glu Ala Ile
Pro Trp Pro Glu 405 410 415acc aac ggc gag ccc aag cgc gcc ggt ctc
tcg gcc ttt ggc ttt ggt 1296Thr Asn Gly Glu Pro Lys Arg Ala Gly Leu
Ser Ala Phe Gly Phe Gly 420 425 430ggc acc aac gcc cat gcc gtc ttt
gag gag cat gac ccc tcc aac gcc 1344Gly Thr Asn Ala His Ala Val Phe
Glu Glu His Asp Pro Ser Asn Ala 435 440 445gcc tgc 1350Ala Cys
45020450PRTSchizochytrium sp.misc_feature(370)..(370)The 'Xaa' at
location 370 stands for Leu. 20Met Ala Ala Arg Asn Val Ser Ala Ala
His Glu Met His Asp Glu Lys1 5 10 15Arg Ile Ala Val Val Gly Met Ala
Val Gln Tyr Ala Gly Cys Lys Thr 20 25 30Lys Asp Glu Phe Trp Glu Val
Leu Met Asn Gly Lys Val Glu Ser Lys 35 40 45Val Ile Ser Asp Lys Arg
Leu Gly Ser Asn Tyr Arg Ala Glu His Tyr 50 55 60Lys Ala Glu Arg Ser
Lys Tyr Ala Asp Thr Phe Cys Asn Glu Thr Tyr65 70 75 80Gly Thr Leu
Asp Glu Asn Glu Ile Asp Asn Glu His Glu Leu Leu Leu 85 90 95Asn Leu
Ala Lys Gln Ala Leu Ala Glu Thr Ser Val Lys Asp Ser Thr 100 105
110Arg Cys Gly Ile Val Ser Gly Cys Leu Ser Phe Pro Met Asp Asn Leu
115 120 125Gln Gly Glu Leu Leu Asn Val Tyr Gln Asn His Val Glu Lys
Lys Leu 130 135 140Gly Ala Arg Val Phe Lys Asp Ala Ser His Trp Ser
Glu Arg Glu Gln145 150 155 160Ser Asn Lys Pro Glu Ala Gly Asp Arg
Arg Ile Phe Met Asp Pro Ala 165 170 175Ser Phe Val Ala Glu Glu Leu
Asn Leu Gly Ala Leu His Tyr Ser Val 180 185 190Asp Ala Ala Cys Ala
Thr Ala Leu Tyr Val Leu Arg Leu Ala Gln Asp 195 200 205His Leu Val
Ser Gly Ala Ala Asp Val Met Leu Cys Gly Ala Thr Cys 210 215 220Leu
Pro Glu Pro Phe Phe Ile Leu Ser Gly Phe Ser Thr Phe Gln Ala225 230
235 240Met Pro Val Gly Thr Gly Gln Asn Val Ser Met Pro Leu His Lys
Asp 245 250 255Ser Gln Gly Leu Thr Pro Gly Glu Gly Gly Ser Ile Met
Val Leu Lys 260 265 270Arg Leu Asp Asp Ala Ile Arg Asp Gly Asp His
Ile Tyr Gly Thr Leu 275 280 285Leu Gly Ala Asn Val Ser Asn Ser Gly
Thr Gly Leu Pro Leu Lys Pro 290 295 300Leu Leu Pro Ser Glu Lys Lys
Cys Leu Met Asp Thr Tyr Thr Arg Ile305 310 315 320Asn Val His Pro
His Lys Ile Gln Tyr Val Glu Cys His Ala Thr Gly 325 330 335Thr Pro
Gln Gly Asp Arg Val Glu Ile Asp Ala Val Lys Ala Cys Phe 340 345
350Glu Gly Lys Val Pro Arg Phe Gly Thr Thr Lys Gly Asn Phe Gly His
355 360 365Thr Xaa Xaa Ala Ala Gly Phe Ala Gly Met Cys Lys Val Leu
Leu Ser 370 375 380Met Lys His Gly Ile Ile Pro Pro Thr Pro Gly Ile
Asp Asp Glu Thr385 390 395 400Lys Met Asp Pro Leu Val Val Ser Gly
Glu Ala Ile Pro Trp Pro Glu 405 410 415Thr Asn Gly Glu Pro Lys Arg
Ala Gly Leu Ser Ala Phe Gly Phe Gly 420 425 430Gly Thr Asn Ala His
Ala Val Phe Glu Glu His Asp Pro Ser Asn Ala 435 440 445Ala Cys
450211323DNASchizochytrium sp.CDS(1)..(1323) 21tcg gcc cgc tgc ggc
ggt gaa agc aac atg cgc atc gcc atc act ggt 48Ser Ala Arg Cys Gly
Gly Glu Ser Asn Met Arg Ile Ala Ile Thr Gly1 5 10 15atg gac gcc acc
ttt ggc gct ctc aag gga ctc gac gcc ttc gag cgc 96Met Asp Ala Thr
Phe Gly Ala Leu Lys Gly Leu Asp Ala Phe Glu Arg 20 25 30gcc att tac
acc ggc gct cac ggt gcc atc cca ctc cca gaa aag cgc 144Ala Ile Tyr
Thr Gly Ala His Gly Ala Ile Pro Leu Pro Glu Lys Arg 35 40 45tgg cgc
ttt ctc ggc aag gac aag gac ttt ctt gac ctc tgc ggc gtc 192Trp Arg
Phe Leu Gly Lys Asp Lys Asp Phe Leu Asp Leu Cys Gly Val 50 55 60aag
gcc acc ccg cac ggc tgc tac att gaa gat gtt gag gtc gac ttc 240Lys
Ala Thr Pro His Gly Cys Tyr Ile Glu Asp Val Glu Val Asp Phe65 70 75
80cag cgc ctc cgc acg ccc atg acc cct gaa gac atg ctc ctc cct cag
288Gln Arg Leu Arg Thr Pro Met Thr Pro Glu Asp Met Leu Leu Pro Gln
85 90 95cag ctt ctg gcc gtc acc acc att gac cgc gcc atc ctc gac tcg
gga 336Gln Leu Leu Ala Val Thr Thr Ile Asp Arg Ala Ile Leu Asp Ser
Gly 100 105 110atg aaa aag ggt ggc aat gtc gcc gtc ttt gtc ggc ctc
ggc acc gac 384Met Lys Lys Gly Gly Asn Val Ala Val Phe Val Gly Leu
Gly Thr Asp 115 120 125ctc gag ctc tac cgt cac cgt gct cgc gtc gct
ctc aag gag cgc gtc 432Leu Glu Leu Tyr Arg His Arg Ala Arg Val Ala
Leu Lys Glu Arg Val 130 135 140cgc cct gaa gcc tcc aag aag ctc aat
gac atg atg cag tac att aac 480Arg Pro Glu Ala Ser Lys Lys Leu Asn
Asp Met Met Gln Tyr Ile Asn145 150 155 160gac tgc ggc aca tcc aca
tcg tac acc tcg tac att ggc aac ctc gtc 528Asp Cys Gly Thr Ser Thr
Ser Tyr Thr Ser Tyr Ile Gly Asn Leu Val 165 170 175gcc acg cgc gtc
tcg tcg cag tgg ggc ttc acg ggc ccc tcc ttt acg 576Ala Thr Arg Val
Ser Ser Gln Trp Gly Phe Thr Gly Pro Ser Phe Thr 180 185 190atc acc
gag ggc aac aac tcc gtc tac cgc tgc gcc gag ctc ggc aag 624Ile Thr
Glu Gly Asn Asn Ser Val Tyr Arg Cys Ala Glu Leu Gly Lys 195 200
205tac ctc ctc gag acc ggc gag gtc gat ggc gtc gtc gtt gcg ggt gtc
672Tyr Leu Leu Glu Thr Gly Glu Val Asp Gly Val Val Val Ala Gly Val
210 215 220gat ctc tgc ggc agt gcc gaa aac ctt tac gtc aag tct cgc
cgc ttc 720Asp Leu Cys Gly Ser Ala Glu Asn Leu Tyr Val Lys Ser Arg
Arg Phe225 230 235 240aag gtg tcc acc tcc gat acc ccg cgc gcc agc
ttt gac gcc gcc gcc 768Lys Val Ser Thr Ser Asp Thr Pro Arg Ala Ser
Phe Asp Ala Ala Ala 245 250 255gat ggc tac ttt gtc ggc gag ggc tgc
ggt gcc ttt gtg ctc aag cgt 816Asp Gly Tyr Phe Val Gly Glu Gly Cys
Gly Ala Phe Val Leu Lys Arg 260 265 270gag act agc tgc acc aag gac
gac cgt atc tac gct tgc atg gat gcc 864Glu Thr Ser Cys Thr Lys Asp
Asp Arg Ile Tyr Ala Cys Met Asp Ala 275 280 285atc gtc cct ggc aac
gtc cct agc gcc tgc ttg cgc gag gcc ctc gac 912Ile Val Pro Gly Asn
Val Pro Ser Ala Cys Leu Arg Glu Ala Leu Asp 290 295 300cag gcg cgc
gtc aag ccg ggc gat atc gag atg ctc gag ctc agc gcc 960Gln Ala Arg
Val Lys Pro Gly Asp Ile Glu Met Leu Glu Leu Ser Ala305 310 315
320gac tcc gcc cgc cac ctc aag gac ccg tcc gtc ctg ccc aag gag ctc
1008Asp Ser Ala Arg His Leu Lys Asp Pro Ser Val Leu Pro Lys Glu Leu
325 330 335act gcc gag gag gaa atc ggc ggc ctt cag acg atc ctt cgt
gac gat 1056Thr Ala Glu Glu Glu Ile Gly Gly Leu Gln Thr Ile Leu Arg
Asp Asp 340 345 350gac aag ctc ccg cgc aac gtc gca acg ggc agt gtc
aag gcc acc gtc 1104Asp Lys Leu Pro Arg Asn Val Ala Thr Gly Ser Val
Lys Ala Thr Val 355 360 365ggt gac acc ggt tat gcc tct ggt gct gcc
agc ctc atc aag gct gcg 1152Gly Asp Thr Gly Tyr Ala Ser Gly Ala Ala
Ser Leu Ile Lys Ala Ala 370 375 380ctt tgc atc tac aac cgc tac ctg
ccc agc aac ggc gac gac tgg gat 1200Leu Cys Ile Tyr Asn Arg Tyr Leu
Pro Ser Asn Gly Asp Asp Trp Asp385 390 395 400gaa ccc gcc cct gag
gcg ccc tgg gac agc acc ctc ttt gcg tgc cag 1248Glu Pro Ala Pro Glu
Ala Pro Trp Asp Ser Thr Leu Phe Ala Cys Gln 405 410 415acc tcg cgc
gct tgg ctc aag aac cct ggc gag cgt cgc tat gcg gcc 1296Thr Ser Arg
Ala Trp Leu Lys Asn Pro Gly Glu Arg Arg Tyr Ala Ala 420 425 430gtc
tcg ggc gtc tcc gag acg cgc tcg 1323Val Ser Gly Val Ser Glu Thr Arg
Ser 435 44022441PRTSchizochytrium sp. 22Ser Ala Arg Cys Gly Gly Glu
Ser Asn Met Arg Ile Ala Ile Thr Gly1 5 10 15Met Asp Ala Thr Phe Gly
Ala Leu Lys Gly Leu Asp Ala Phe Glu Arg 20 25 30Ala Ile Tyr Thr Gly
Ala His Gly Ala Ile Pro Leu Pro Glu Lys Arg 35 40 45Trp Arg Phe Leu
Gly Lys Asp Lys Asp Phe Leu Asp Leu Cys Gly Val 50 55 60Lys Ala Thr
Pro His Gly Cys Tyr Ile Glu Asp Val Glu Val Asp Phe65 70 75 80Gln
Arg Leu Arg Thr Pro Met Thr Pro Glu Asp Met Leu Leu Pro Gln 85 90
95Gln Leu Leu Ala Val Thr Thr Ile Asp Arg Ala Ile Leu Asp Ser Gly
100 105 110Met Lys Lys Gly Gly Asn Val Ala Val Phe Val Gly Leu Gly
Thr Asp 115 120 125Leu Glu Leu Tyr Arg His Arg Ala Arg Val Ala Leu
Lys Glu Arg Val 130 135 140Arg Pro Glu Ala Ser Lys Lys Leu Asn Asp
Met Met Gln Tyr Ile Asn145 150 155 160Asp Cys Gly Thr Ser Thr Ser
Tyr Thr Ser Tyr Ile Gly Asn Leu Val 165 170 175Ala Thr Arg Val Ser
Ser Gln Trp Gly Phe Thr Gly Pro Ser Phe Thr 180 185 190Ile Thr Glu
Gly Asn Asn Ser Val Tyr Arg Cys Ala Glu Leu Gly Lys 195 200 205Tyr
Leu Leu Glu Thr Gly Glu Val Asp Gly Val Val Val Ala Gly Val 210 215
220Asp Leu Cys Gly Ser Ala Glu Asn Leu Tyr Val Lys Ser Arg Arg
Phe225 230 235 240Lys Val Ser Thr Ser Asp Thr Pro Arg Ala Ser Phe
Asp Ala Ala Ala 245 250 255Asp Gly Tyr Phe Val Gly Glu Gly Cys Gly
Ala Phe Val Leu Lys Arg 260 265 270Glu Thr Ser Cys Thr Lys Asp Asp
Arg Ile Tyr Ala Cys Met Asp Ala 275 280 285Ile Val Pro Gly Asn Val
Pro Ser Ala Cys Leu Arg Glu Ala Leu Asp 290 295 300Gln Ala Arg Val
Lys Pro Gly Asp Ile Glu Met Leu Glu Leu Ser Ala305 310 315 320Asp
Ser Ala Arg His Leu Lys Asp Pro Ser Val Leu Pro Lys Glu Leu 325 330
335Thr Ala Glu Glu Glu Ile Gly Gly Leu Gln Thr Ile Leu Arg Asp Asp
340 345 350Asp Lys Leu Pro Arg Asn Val Ala Thr Gly Ser Val Lys Ala
Thr Val 355 360 365Gly Asp Thr Gly Tyr Ala Ser Gly Ala Ala Ser Leu
Ile Lys Ala Ala 370 375 380Leu Cys Ile Tyr Asn Arg Tyr Leu Pro Ser
Asn Gly Asp Asp Trp Asp385 390 395 400Glu Pro Ala Pro Glu Ala Pro
Trp Asp Ser Thr Leu Phe Ala Cys Gln 405 410 415Thr Ser Arg Ala Trp
Leu Lys Asn Pro Gly Glu Arg Arg Tyr Ala Ala 420 425 430Val Ser Gly
Val Ser Glu Thr Arg Ser 435 440231500DNASchizochytrium
sp.CDS(1)..(1500) 23tgc tat tcc gtg ctc ctc tcc gaa gcc gag ggc cac
tac gag cgc gag 48Cys Tyr Ser Val Leu Leu Ser Glu Ala Glu Gly His
Tyr Glu Arg Glu1 5 10 15aac cgc atc tcg ctc gac gag gag gcg ccc aag
ctc att gtg ctt cgc 96Asn Arg Ile Ser Leu Asp Glu Glu Ala Pro Lys
Leu Ile Val Leu Arg 20 25 30gcc gac tcc cac gag gag atc ctt ggt cgc
ctc gac aag atc cgc gag 144Ala Asp Ser His Glu Glu Ile Leu Gly Arg
Leu Asp Lys Ile Arg Glu 35 40 45cgc ttc ttg cag ccc acg ggc gcc gcc
ccg cgc gag tcc gag ctc aag 192Arg Phe Leu Gln Pro Thr Gly Ala Ala
Pro Arg Glu Ser Glu Leu Lys 50 55 60gcg cag gcc cgc cgc atc ttc ctc
gag ctc ctc ggc gag acc ctt gcc 240Ala Gln Ala Arg Arg Ile Phe Leu
Glu Leu Leu Gly Glu Thr Leu Ala65 70 75 80cag gat gcc gct tct tca
ggc tcg caa aag ccc ctc gct ctc agc ctc 288Gln Asp Ala Ala Ser Ser
Gly Ser Gln Lys Pro Leu Ala Leu Ser Leu 85 90 95gtc tcc acg ccc tcc
aag ctc cag cgc gag gtc gag ctc gcg gcc aag 336Val Ser Thr Pro Ser
Lys Leu Gln Arg Glu Val Glu Leu Ala Ala Lys 100 105 110ggt atc ccg
cgc tgc ctc aag atg cgc cgc gat tgg agc tcc cct gct 384Gly Ile Pro
Arg Cys Leu Lys Met Arg Arg Asp Trp Ser Ser Pro Ala 115 120 125ggc
agc cgc tac gcg cct gag ccg ctc gcc agc gac cgc gtc gcc ttc 432Gly
Ser Arg Tyr Ala Pro Glu Pro Leu Ala Ser Asp Arg Val Ala Phe 130 135
140atg tac ggc gaa ggt cgc agc cct tac tac ggc atc acc caa gac att
480Met Tyr Gly Glu Gly Arg Ser Pro Tyr Tyr Gly Ile Thr Gln Asp
Ile145 150 155 160cac cgc att tgg ccc gaa ctc cac gag gtc atc aac
gaa aag acg aac 528His Arg Ile Trp Pro Glu Leu His Glu Val Ile Asn
Glu Lys Thr Asn 165 170 175cgt ctc tgg gcc gaa ggc gac cgc tgg gtc
atg ccg cgc gcc agc ttc 576Arg Leu Trp Ala Glu Gly Asp Arg Trp Val
Met Pro Arg Ala Ser Phe 180 185 190aag tcg gag ctc gag agc cag cag
caa gag ttt gat cgc aac atg att 624Lys Ser Glu Leu Glu Ser Gln Gln
Gln Glu Phe Asp Arg Asn Met Ile 195 200 205gaa atg ttc cgt ctt gga
atc ctc acc tca att gcc ttc acc aat ctg 672Glu Met Phe Arg Leu Gly
Ile Leu Thr Ser Ile Ala Phe Thr Asn Leu 210 215 220gcg cgc gac gtt
ctc aac atc acg ccc aag gcc gcc ttt ggc ctc agt 720Ala Arg Asp Val
Leu Asn Ile Thr Pro Lys Ala Ala Phe Gly Leu Ser225 230 235 240ctt
ggc gag att tcc atg att ttt gcc ttt tcc aag aag aac ggt ctc 768Leu
Gly Glu Ile Ser Met Ile Phe Ala Phe Ser Lys Lys Asn Gly Leu 245 250
255atc tcc gac cag ctc acc aag gat ctt cgc gag tcc gac gtg tgg aac
816Ile Ser Asp Gln Leu Thr Lys Asp Leu Arg Glu Ser Asp Val Trp Asn
260 265 270aag gct ctg gcc gtt gaa ttt aat gcg ctg cgc gag gcc tgg
ggc att 864Lys Ala Leu Ala Val Glu Phe Asn Ala Leu Arg Glu Ala Trp
Gly Ile 275 280 285cca cag agt gtc ccc aag gac gag ttc tgg caa ggc
tac att gtg cgc 912Pro Gln Ser Val Pro Lys Asp Glu Phe Trp Gln Gly
Tyr Ile Val Arg 290 295 300ggc acc aag cag gat atc gag gcg gcc atc
gcc ccg gac agc aag tac 960Gly Thr Lys Gln Asp Ile Glu Ala Ala Ile
Ala Pro Asp Ser Lys Tyr305 310 315 320gtg cgc ctc acc atc atc aat
gat gcc aac acc gcc ctc att agc ggc 1008Val Arg Leu Thr Ile Ile Asn
Asp Ala Asn Thr Ala Leu Ile Ser Gly 325 330 335aag ccc gac gcc tgc
aag gct gcg atc gcg cgt ctc ggt ggc aac att 1056Lys Pro Asp Ala Cys
Lys Ala Ala Ile Ala Arg Leu Gly Gly Asn Ile 340 345 350cct gcg ctt
ccc gtg acc cag ggc atg tgc ggc cac tgc ccc gag gtg 1104Pro Ala Leu
Pro Val Thr Gln Gly Met Cys Gly His Cys Pro Glu Val 355 360 365gga
cct tat acc aag gat atc gcc aag atc cat gcc aac ctt gag ttc 1152Gly
Pro Tyr Thr Lys Asp Ile Ala Lys Ile His Ala Asn Leu Glu Phe 370 375
380ccc gtt gtc gac ggc ctt gac ctc tgg acc aca atc aac cag aag cgc
1200Pro Val Val Asp Gly Leu Asp Leu Trp Thr Thr Ile Asn Gln Lys
Arg385 390 395 400ctc gtg cca cgc gcc acg ggc gcc aag gac gaa tgg
gcc cct tct tcc 1248Leu Val Pro Arg Ala Thr Gly Ala Lys Asp Glu Trp
Ala Pro Ser Ser 405 410 415ttt ggc gag tac gcc ggc cag ctc tac gag
aag cag gct aac ttc ccc 1296Phe Gly Glu Tyr Ala Gly Gln Leu Tyr Glu
Lys Gln Ala Asn Phe Pro 420 425 430caa atc gtc gag acc att tac aag
caa aac tac gac gtc ttt gtc gag 1344Gln Ile Val Glu Thr Ile Tyr Lys
Gln Asn Tyr Asp Val Phe Val Glu 435 440 445gtt ggg ccc aac aac cac
cgt agc acc gca gtg cgc acc acg ctt ggt 1392Val Gly Pro Asn Asn His
Arg Ser Thr Ala Val Arg Thr Thr Leu Gly 450 455 460ccc cag cgc aac
cac ctt gct ggc gcc atc gac aag cag aac gag gat 1440Pro Gln Arg Asn
His Leu Ala Gly Ala Ile Asp Lys Gln Asn Glu Asp465 470 475 480gct
tgg acg acc atc gtc aag ctt gtg gct tcg ctc aag gcc cac ctt 1488Ala
Trp Thr Thr Ile Val Lys Leu Val Ala Ser Leu Lys Ala His Leu 485 490
495gtt cct ggc gtc 1500Val Pro Gly Val 50024500PRTSchizochytrium
sp. 24Cys Tyr Ser Val Leu Leu Ser Glu Ala Glu Gly His Tyr Glu Arg
Glu1 5 10 15Asn Arg Ile Ser Leu Asp Glu Glu Ala Pro Lys Leu Ile Val
Leu Arg
20 25 30Ala Asp Ser His Glu Glu Ile Leu Gly Arg Leu Asp Lys Ile Arg
Glu 35 40 45Arg Phe Leu Gln Pro Thr Gly Ala Ala Pro Arg Glu Ser Glu
Leu Lys 50 55 60Ala Gln Ala Arg Arg Ile Phe Leu Glu Leu Leu Gly Glu
Thr Leu Ala65 70 75 80Gln Asp Ala Ala Ser Ser Gly Ser Gln Lys Pro
Leu Ala Leu Ser Leu 85 90 95Val Ser Thr Pro Ser Lys Leu Gln Arg Glu
Val Glu Leu Ala Ala Lys 100 105 110Gly Ile Pro Arg Cys Leu Lys Met
Arg Arg Asp Trp Ser Ser Pro Ala 115 120 125Gly Ser Arg Tyr Ala Pro
Glu Pro Leu Ala Ser Asp Arg Val Ala Phe 130 135 140Met Tyr Gly Glu
Gly Arg Ser Pro Tyr Tyr Gly Ile Thr Gln Asp Ile145 150 155 160His
Arg Ile Trp Pro Glu Leu His Glu Val Ile Asn Glu Lys Thr Asn 165 170
175Arg Leu Trp Ala Glu Gly Asp Arg Trp Val Met Pro Arg Ala Ser Phe
180 185 190Lys Ser Glu Leu Glu Ser Gln Gln Gln Glu Phe Asp Arg Asn
Met Ile 195 200 205Glu Met Phe Arg Leu Gly Ile Leu Thr Ser Ile Ala
Phe Thr Asn Leu 210 215 220Ala Arg Asp Val Leu Asn Ile Thr Pro Lys
Ala Ala Phe Gly Leu Ser225 230 235 240Leu Gly Glu Ile Ser Met Ile
Phe Ala Phe Ser Lys Lys Asn Gly Leu 245 250 255Ile Ser Asp Gln Leu
Thr Lys Asp Leu Arg Glu Ser Asp Val Trp Asn 260 265 270Lys Ala Leu
Ala Val Glu Phe Asn Ala Leu Arg Glu Ala Trp Gly Ile 275 280 285Pro
Gln Ser Val Pro Lys Asp Glu Phe Trp Gln Gly Tyr Ile Val Arg 290 295
300Gly Thr Lys Gln Asp Ile Glu Ala Ala Ile Ala Pro Asp Ser Lys
Tyr305 310 315 320Val Arg Leu Thr Ile Ile Asn Asp Ala Asn Thr Ala
Leu Ile Ser Gly 325 330 335Lys Pro Asp Ala Cys Lys Ala Ala Ile Ala
Arg Leu Gly Gly Asn Ile 340 345 350Pro Ala Leu Pro Val Thr Gln Gly
Met Cys Gly His Cys Pro Glu Val 355 360 365Gly Pro Tyr Thr Lys Asp
Ile Ala Lys Ile His Ala Asn Leu Glu Phe 370 375 380Pro Val Val Asp
Gly Leu Asp Leu Trp Thr Thr Ile Asn Gln Lys Arg385 390 395 400Leu
Val Pro Arg Ala Thr Gly Ala Lys Asp Glu Trp Ala Pro Ser Ser 405 410
415Phe Gly Glu Tyr Ala Gly Gln Leu Tyr Glu Lys Gln Ala Asn Phe Pro
420 425 430Gln Ile Val Glu Thr Ile Tyr Lys Gln Asn Tyr Asp Val Phe
Val Glu 435 440 445Val Gly Pro Asn Asn His Arg Ser Thr Ala Val Arg
Thr Thr Leu Gly 450 455 460Pro Gln Arg Asn His Leu Ala Gly Ala Ile
Asp Lys Gln Asn Glu Asp465 470 475 480Ala Trp Thr Thr Ile Val Lys
Leu Val Ala Ser Leu Lys Ala His Leu 485 490 495Val Pro Gly Val
500251530DNASchizochytrium sp.CDS(1)..(1530) 25ctg ctc gat ctc gac
agt atg ctt gcg ctg agc tct gcc agt gcc tcc 48Leu Leu Asp Leu Asp
Ser Met Leu Ala Leu Ser Ser Ala Ser Ala Ser1 5 10 15ggc aac ctt gtt
gag act gcg cct agc gac gcc tcg gtc att gtg ccg 96Gly Asn Leu Val
Glu Thr Ala Pro Ser Asp Ala Ser Val Ile Val Pro 20 25 30ccc tgc aac
att gcg gat ctc ggc agc cgc gcc ttc atg aaa acg tac 144Pro Cys Asn
Ile Ala Asp Leu Gly Ser Arg Ala Phe Met Lys Thr Tyr 35 40 45ggt gtt
tcg gcg cct ctg tac acg ggc gcc atg gcc aag ggc att gcc 192Gly Val
Ser Ala Pro Leu Tyr Thr Gly Ala Met Ala Lys Gly Ile Ala 50 55 60tct
gcg gac ctc gtc att gcc gcc ggc cgc cag ggc atc ctt gcg tcc 240Ser
Ala Asp Leu Val Ile Ala Ala Gly Arg Gln Gly Ile Leu Ala Ser65 70 75
80ttt ggc gcc ggc gga ctt ccc atg cag gtt gtg cgt gag tcc atc gaa
288Phe Gly Ala Gly Gly Leu Pro Met Gln Val Val Arg Glu Ser Ile Glu
85 90 95aag att cag gcc gcc ctg ccc aat ggc ccg tac gct gtc aac ctt
atc 336Lys Ile Gln Ala Ala Leu Pro Asn Gly Pro Tyr Ala Val Asn Leu
Ile 100 105 110cat tct ccc ttt gac agc aac ctc gaa aag ggc aat gtc
gat ctc ttc 384His Ser Pro Phe Asp Ser Asn Leu Glu Lys Gly Asn Val
Asp Leu Phe 115 120 125ctc gag aag ggt gtc acc ttt gtc gag gcc tcg
gcc ttt atg acg ctc 432Leu Glu Lys Gly Val Thr Phe Val Glu Ala Ser
Ala Phe Met Thr Leu 130 135 140acc ccg cag gtc gtg cgg tac cgc gcg
gct ggc ctc acg cgc aac gcc 480Thr Pro Gln Val Val Arg Tyr Arg Ala
Ala Gly Leu Thr Arg Asn Ala145 150 155 160gac ggc tcg gtc aac atc
cgc aac cgt atc att ggc aag gtc tcg cgc 528Asp Gly Ser Val Asn Ile
Arg Asn Arg Ile Ile Gly Lys Val Ser Arg 165 170 175acc gag ctc gcc
gag atg ttc atg cgt cct gcg ccc gag cac ctt ctt 576Thr Glu Leu Ala
Glu Met Phe Met Arg Pro Ala Pro Glu His Leu Leu 180 185 190cag aag
ctc att gct tcc ggc gag atc aac cag gag cag gcc gag ctc 624Gln Lys
Leu Ile Ala Ser Gly Glu Ile Asn Gln Glu Gln Ala Glu Leu 195 200
205gcc cgc cgt gtt ccc gtc gct gac gac atc gcg gtc gaa gct gac tcg
672Ala Arg Arg Val Pro Val Ala Asp Asp Ile Ala Val Glu Ala Asp Ser
210 215 220ggt ggc cac acc gac aac cgc ccc atc cac gtc att ctg ccc
ctc atc 720Gly Gly His Thr Asp Asn Arg Pro Ile His Val Ile Leu Pro
Leu Ile225 230 235 240atc aac ctt cgc gac cgc ctt cac cgc gag tgc
ggc tac ccg gcc aac 768Ile Asn Leu Arg Asp Arg Leu His Arg Glu Cys
Gly Tyr Pro Ala Asn 245 250 255ctt cgc gtc cgt gtg ggc gcc ggc ggt
ggc att ggg tgc ccc cag gcg 816Leu Arg Val Arg Val Gly Ala Gly Gly
Gly Ile Gly Cys Pro Gln Ala 260 265 270gcg ctg gcc acc ttc aac atg
ggt gcc tcc ttt att gtc acc ggc acc 864Ala Leu Ala Thr Phe Asn Met
Gly Ala Ser Phe Ile Val Thr Gly Thr 275 280 285gtg aac cag gtc gcc
aag cag tcg ggc acg tgc gac aat gtg cgc aag 912Val Asn Gln Val Ala
Lys Gln Ser Gly Thr Cys Asp Asn Val Arg Lys 290 295 300cag ctc gcg
aag gcc act tac tcg gac gta tgc atg gcc ccg gct gcc 960Gln Leu Ala
Lys Ala Thr Tyr Ser Asp Val Cys Met Ala Pro Ala Ala305 310 315
320gac atg ttc gag gaa ggc gtc aag ctt cag gtc ctc aag aag gga acc
1008Asp Met Phe Glu Glu Gly Val Lys Leu Gln Val Leu Lys Lys Gly Thr
325 330 335atg ttt ccc tcg cgc gcc aac aag ctc tac gag ctc ttt tgc
aag tac 1056Met Phe Pro Ser Arg Ala Asn Lys Leu Tyr Glu Leu Phe Cys
Lys Tyr 340 345 350gac tcg ttc gag tcc atg ccc ccc gca gag ctt gcg
cgc gtc gag aag 1104Asp Ser Phe Glu Ser Met Pro Pro Ala Glu Leu Ala
Arg Val Glu Lys 355 360 365cgc atc ttc agc cgc gcg ctc gaa gag gtc
tgg gac gag acc aaa aac 1152Arg Ile Phe Ser Arg Ala Leu Glu Glu Val
Trp Asp Glu Thr Lys Asn 370 375 380ttt tac att aac cgt ctt cac aac
ccg gag aag atc cag cgc gcc gag 1200Phe Tyr Ile Asn Arg Leu His Asn
Pro Glu Lys Ile Gln Arg Ala Glu385 390 395 400cgc gac ccc aag ctc
aag atg tcg ctg tgc ttt cgc tgg tac ctg agc 1248Arg Asp Pro Lys Leu
Lys Met Ser Leu Cys Phe Arg Trp Tyr Leu Ser 405 410 415ctg gcg agc
cgc tgg gcc aac act gga gct tcc gat cgc gtc atg gac 1296Leu Ala Ser
Arg Trp Ala Asn Thr Gly Ala Ser Asp Arg Val Met Asp 420 425 430tac
cag gtc tgg tgc ggt cct gcc att ggt tcc ttc aac gat ttc atc 1344Tyr
Gln Val Trp Cys Gly Pro Ala Ile Gly Ser Phe Asn Asp Phe Ile 435 440
445aag gga act tac ctt gat ccg gcc gtc gca aac gag tac ccg tgc gtc
1392Lys Gly Thr Tyr Leu Asp Pro Ala Val Ala Asn Glu Tyr Pro Cys Val
450 455 460gtt cag att aac aag cag atc ctt cgt gga gcg tgc ttc ttg
cgc cgt 1440Val Gln Ile Asn Lys Gln Ile Leu Arg Gly Ala Cys Phe Leu
Arg Arg465 470 475 480ctc gaa att ctg cgc aac gca cgc ctt tcc gat
ggc gct gcc gct ctt 1488Leu Glu Ile Leu Arg Asn Ala Arg Leu Ser Asp
Gly Ala Ala Ala Leu 485 490 495gtg gcc agc atc gat gac aca tac gtc
ccg gcc gag aag ctg 1530Val Ala Ser Ile Asp Asp Thr Tyr Val Pro Ala
Glu Lys Leu 500 505 51026510PRTSchizochytrium sp. 26Leu Leu Asp Leu
Asp Ser Met Leu Ala Leu Ser Ser Ala Ser Ala Ser1 5 10 15Gly Asn Leu
Val Glu Thr Ala Pro Ser Asp Ala Ser Val Ile Val Pro 20 25 30Pro Cys
Asn Ile Ala Asp Leu Gly Ser Arg Ala Phe Met Lys Thr Tyr 35 40 45Gly
Val Ser Ala Pro Leu Tyr Thr Gly Ala Met Ala Lys Gly Ile Ala 50 55
60Ser Ala Asp Leu Val Ile Ala Ala Gly Arg Gln Gly Ile Leu Ala Ser65
70 75 80Phe Gly Ala Gly Gly Leu Pro Met Gln Val Val Arg Glu Ser Ile
Glu 85 90 95Lys Ile Gln Ala Ala Leu Pro Asn Gly Pro Tyr Ala Val Asn
Leu Ile 100 105 110His Ser Pro Phe Asp Ser Asn Leu Glu Lys Gly Asn
Val Asp Leu Phe 115 120 125Leu Glu Lys Gly Val Thr Phe Val Glu Ala
Ser Ala Phe Met Thr Leu 130 135 140Thr Pro Gln Val Val Arg Tyr Arg
Ala Ala Gly Leu Thr Arg Asn Ala145 150 155 160Asp Gly Ser Val Asn
Ile Arg Asn Arg Ile Ile Gly Lys Val Ser Arg 165 170 175Thr Glu Leu
Ala Glu Met Phe Met Arg Pro Ala Pro Glu His Leu Leu 180 185 190Gln
Lys Leu Ile Ala Ser Gly Glu Ile Asn Gln Glu Gln Ala Glu Leu 195 200
205Ala Arg Arg Val Pro Val Ala Asp Asp Ile Ala Val Glu Ala Asp Ser
210 215 220Gly Gly His Thr Asp Asn Arg Pro Ile His Val Ile Leu Pro
Leu Ile225 230 235 240Ile Asn Leu Arg Asp Arg Leu His Arg Glu Cys
Gly Tyr Pro Ala Asn 245 250 255Leu Arg Val Arg Val Gly Ala Gly Gly
Gly Ile Gly Cys Pro Gln Ala 260 265 270Ala Leu Ala Thr Phe Asn Met
Gly Ala Ser Phe Ile Val Thr Gly Thr 275 280 285Val Asn Gln Val Ala
Lys Gln Ser Gly Thr Cys Asp Asn Val Arg Lys 290 295 300Gln Leu Ala
Lys Ala Thr Tyr Ser Asp Val Cys Met Ala Pro Ala Ala305 310 315
320Asp Met Phe Glu Glu Gly Val Lys Leu Gln Val Leu Lys Lys Gly Thr
325 330 335Met Phe Pro Ser Arg Ala Asn Lys Leu Tyr Glu Leu Phe Cys
Lys Tyr 340 345 350Asp Ser Phe Glu Ser Met Pro Pro Ala Glu Leu Ala
Arg Val Glu Lys 355 360 365Arg Ile Phe Ser Arg Ala Leu Glu Glu Val
Trp Asp Glu Thr Lys Asn 370 375 380Phe Tyr Ile Asn Arg Leu His Asn
Pro Glu Lys Ile Gln Arg Ala Glu385 390 395 400Arg Asp Pro Lys Leu
Lys Met Ser Leu Cys Phe Arg Trp Tyr Leu Ser 405 410 415Leu Ala Ser
Arg Trp Ala Asn Thr Gly Ala Ser Asp Arg Val Met Asp 420 425 430Tyr
Gln Val Trp Cys Gly Pro Ala Ile Gly Ser Phe Asn Asp Phe Ile 435 440
445Lys Gly Thr Tyr Leu Asp Pro Ala Val Ala Asn Glu Tyr Pro Cys Val
450 455 460Val Gln Ile Asn Lys Gln Ile Leu Arg Gly Ala Cys Phe Leu
Arg Arg465 470 475 480Leu Glu Ile Leu Arg Asn Ala Arg Leu Ser Asp
Gly Ala Ala Ala Leu 485 490 495Val Ala Ser Ile Asp Asp Thr Tyr Val
Pro Ala Glu Lys Leu 500 505 510274512DNASchizochytrium
sp.CDS(1)..(4512) 27atg gcg ctc cgt gtc aag acg aac aag aag cca tgc
tgg gag atg acc 48Met Ala Leu Arg Val Lys Thr Asn Lys Lys Pro Cys
Trp Glu Met Thr1 5 10 15aag gag gag ctg acc agc ggc aag acc gag gtg
ttc aac tat gag gaa 96Lys Glu Glu Leu Thr Ser Gly Lys Thr Glu Val
Phe Asn Tyr Glu Glu 20 25 30ctc ctc gag ttc gca gag ggc gac atc gcc
aag gtc ttc gga ccc gag 144Leu Leu Glu Phe Ala Glu Gly Asp Ile Ala
Lys Val Phe Gly Pro Glu 35 40 45ttc gcc gtc atc gac aag tac ccg cgc
cgc gtg cgc ctg ccc gcc cgc 192Phe Ala Val Ile Asp Lys Tyr Pro Arg
Arg Val Arg Leu Pro Ala Arg 50 55 60gag tac ctg ctc gtg acc cgc gtc
acc ctc atg gac gcc gag gtc aac 240Glu Tyr Leu Leu Val Thr Arg Val
Thr Leu Met Asp Ala Glu Val Asn65 70 75 80aac tac cgc gtc ggc gcc
cgc atg gtc acc gag tac gat ctc ccc gtc 288Asn Tyr Arg Val Gly Ala
Arg Met Val Thr Glu Tyr Asp Leu Pro Val 85 90 95aac gga gag ctc tcc
gag ggc gga gac tgc ccc tgg gcc gtc ctg gtc 336Asn Gly Glu Leu Ser
Glu Gly Gly Asp Cys Pro Trp Ala Val Leu Val 100 105 110gag agt ggc
cag tgc gat ctc atg ctc atc tcc tac atg ggc att gac 384Glu Ser Gly
Gln Cys Asp Leu Met Leu Ile Ser Tyr Met Gly Ile Asp 115 120 125ttc
cag aac cag ggc gac cgc gtc tac cgc ctg ctc aac acc acg ctc 432Phe
Gln Asn Gln Gly Asp Arg Val Tyr Arg Leu Leu Asn Thr Thr Leu 130 135
140acc ttt tac ggc gtg gcc cac gag ggc gag acc ctc gag tac gac att
480Thr Phe Tyr Gly Val Ala His Glu Gly Glu Thr Leu Glu Tyr Asp
Ile145 150 155 160cgc gtc acc ggc ttc gcc aag cgt ctc gac ggc ggc
atc tcc atg ttc 528Arg Val Thr Gly Phe Ala Lys Arg Leu Asp Gly Gly
Ile Ser Met Phe 165 170 175ttc ttc gag tac gac tgc tac gtc aac ggc
cgc ctc ctc atc gag atg 576Phe Phe Glu Tyr Asp Cys Tyr Val Asn Gly
Arg Leu Leu Ile Glu Met 180 185 190cgc gat ggc tgc gcc ggc ttc ttc
acc aac gag gag ctc gac gcc ggc 624Arg Asp Gly Cys Ala Gly Phe Phe
Thr Asn Glu Glu Leu Asp Ala Gly 195 200 205aag ggc gtc gtc ttc acc
cgc ggc gac ctc gcc gcc cgc gcc aag atc 672Lys Gly Val Val Phe Thr
Arg Gly Asp Leu Ala Ala Arg Ala Lys Ile 210 215 220cca aag cag gac
gtc tcc ccc tac gcc gtc gcc ccc tgc ctc cac aag 720Pro Lys Gln Asp
Val Ser Pro Tyr Ala Val Ala Pro Cys Leu His Lys225 230 235 240acc
aag ctc aac gaa aag gag atg cag acc ctc gtc gac aag gac tgg 768Thr
Lys Leu Asn Glu Lys Glu Met Gln Thr Leu Val Asp Lys Asp Trp 245 250
255gca tcc gtc ttt ggc tcc aag aac ggc atg ccg gaa atc aac tac aaa
816Ala Ser Val Phe Gly Ser Lys Asn Gly Met Pro Glu Ile Asn Tyr Lys
260 265 270ctc tgc gcg cgt aag atg ctc atg att gac cgc gtc acc agc
att gac 864Leu Cys Ala Arg Lys Met Leu Met Ile Asp Arg Val Thr Ser
Ile Asp 275 280 285cac aag ggc ggt gtc tac ggc ctc ggt cag ctc gtc
ggt gaa aag atc 912His Lys Gly Gly Val Tyr Gly Leu Gly Gln Leu Val
Gly Glu Lys Ile 290 295 300ctc gag cgc gac cac tgg tac ttt ccc tgc
cac ttt gtc aag gat cag 960Leu Glu Arg Asp His Trp Tyr Phe Pro Cys
His Phe Val Lys Asp Gln305 310 315 320gtc atg gcc gga tcc ctc gtc
tcc gac ggc tgc agc cag atg ctc aag 1008Val Met Ala Gly Ser Leu Val
Ser Asp Gly Cys Ser Gln Met Leu Lys 325 330 335atg tac atg atc tgg
ctc ggc ctc cac ctc acc acc gga ccc ttt gac 1056Met Tyr Met Ile Trp
Leu Gly Leu His Leu Thr Thr Gly Pro Phe Asp 340 345 350ttc cgc ccg
gtc aac ggc cac ccc aac aag gtc cgc tgc cgc ggc caa 1104Phe Arg Pro
Val Asn Gly His Pro Asn Lys Val Arg Cys Arg Gly Gln 355 360 365atc
tcc ccg cac aag ggc aag ctc gtc tac gtc atg gag atc aag gag 1152Ile
Ser Pro His Lys Gly Lys Leu Val Tyr Val Met Glu Ile Lys Glu 370 375
380atg ggc ttc gac gag gac aac gac ccg tac gcc att gcc gac gtc aac
1200Met Gly Phe Asp Glu Asp Asn Asp Pro Tyr Ala Ile Ala Asp Val
Asn385 390 395 400atc att gat gtc gac ttc gaa aag ggc cag gac ttt
agc ctc gac cgc
1248Ile Ile Asp Val Asp Phe Glu Lys Gly Gln Asp Phe Ser Leu Asp Arg
405 410 415atc agc gac tac ggc aag ggc gac ctc aac aag aag atc gtc
gtc gac 1296Ile Ser Asp Tyr Gly Lys Gly Asp Leu Asn Lys Lys Ile Val
Val Asp 420 425 430ttt aag ggc atc gct ctc aag atg cag aag cgc tcc
acc aac aag aac 1344Phe Lys Gly Ile Ala Leu Lys Met Gln Lys Arg Ser
Thr Asn Lys Asn 435 440 445ccc tcc aag gtt cag ccc gtc ttt gcc aac
ggc gcc gcc act gtc ggc 1392Pro Ser Lys Val Gln Pro Val Phe Ala Asn
Gly Ala Ala Thr Val Gly 450 455 460ccc gag gcc tcc aag gct tcc tcc
ggc gcc agc gcc agc gcc agc gcc 1440Pro Glu Ala Ser Lys Ala Ser Ser
Gly Ala Ser Ala Ser Ala Ser Ala465 470 475 480gcc ccg gcc aag cct
gcc ttc agc gcc gat gtt ctt gcg ccc aag ccc 1488Ala Pro Ala Lys Pro
Ala Phe Ser Ala Asp Val Leu Ala Pro Lys Pro 485 490 495gtt gcc ctt
ccc gag cac atc ctc aag ggc gac gcc ctc gcc ccc aag 1536Val Ala Leu
Pro Glu His Ile Leu Lys Gly Asp Ala Leu Ala Pro Lys 500 505 510gag
atg tcc tgg cac ccc atg gcc cgc atc ccg ggc aac ccg acg ccc 1584Glu
Met Ser Trp His Pro Met Ala Arg Ile Pro Gly Asn Pro Thr Pro 515 520
525tct ttt gcg ccc tcg gcc tac aag ccg cgc aac atc gcc ttt acg ccc
1632Ser Phe Ala Pro Ser Ala Tyr Lys Pro Arg Asn Ile Ala Phe Thr Pro
530 535 540ttc ccc ggc aac ccc aac gat aac gac cac acc ccg ggc aag
atg ccg 1680Phe Pro Gly Asn Pro Asn Asp Asn Asp His Thr Pro Gly Lys
Met Pro545 550 555 560ctc acc tgg ttc aac atg gcc gag ttc atg gcc
ggc aag gtc agc atg 1728Leu Thr Trp Phe Asn Met Ala Glu Phe Met Ala
Gly Lys Val Ser Met 565 570 575tgc ctc ggc ccc gag ttc gcc aag ttc
gac gac tcg aac acc agc cgc 1776Cys Leu Gly Pro Glu Phe Ala Lys Phe
Asp Asp Ser Asn Thr Ser Arg 580 585 590agc ccc gct tgg gac ctc gct
ctc gtc acc cgc gcc gtg tct gtg tct 1824Ser Pro Ala Trp Asp Leu Ala
Leu Val Thr Arg Ala Val Ser Val Ser 595 600 605gac ctc aag cac gtc
aac tac cgc aac atc gac ctc gac ccc tcc aag 1872Asp Leu Lys His Val
Asn Tyr Arg Asn Ile Asp Leu Asp Pro Ser Lys 610 615 620ggt acc atg
gtc ggc gag ttc gac tgc ccc gcg gac gcc tgg ttc tac 1920Gly Thr Met
Val Gly Glu Phe Asp Cys Pro Ala Asp Ala Trp Phe Tyr625 630 635
640aag ggc gcc tgc aac gat gcc cac atg ccg tac tcg atc ctc atg gag
1968Lys Gly Ala Cys Asn Asp Ala His Met Pro Tyr Ser Ile Leu Met Glu
645 650 655atc gcc ctc cag acc tcg ggt gtg ctc acc tcg gtg ctc aag
gcg ccc 2016Ile Ala Leu Gln Thr Ser Gly Val Leu Thr Ser Val Leu Lys
Ala Pro 660 665 670ctg acc atg gag aag gac gac atc ctc ttc cgc aac
ctc gac gcc aac 2064Leu Thr Met Glu Lys Asp Asp Ile Leu Phe Arg Asn
Leu Asp Ala Asn 675 680 685gcc gag ttc gtg cgc gcc gac ctc gac tac
cgc ggc aag act atc cgc 2112Ala Glu Phe Val Arg Ala Asp Leu Asp Tyr
Arg Gly Lys Thr Ile Arg 690 695 700aac gtc acc aag tgc act ggc tac
agc atg ctc ggc gag atg ggc gtc 2160Asn Val Thr Lys Cys Thr Gly Tyr
Ser Met Leu Gly Glu Met Gly Val705 710 715 720cac cgc ttc acc ttt
gag ctc tac gtc gat gat gtg ctc ttt tac aag 2208His Arg Phe Thr Phe
Glu Leu Tyr Val Asp Asp Val Leu Phe Tyr Lys 725 730 735ggc tcg acc
tcg ttc ggc tgg ttc gtg ccc gag gtc ttt gcc gcc cag 2256Gly Ser Thr
Ser Phe Gly Trp Phe Val Pro Glu Val Phe Ala Ala Gln 740 745 750gcc
ggc ctc gac aac ggc cgc aag tcg gag ccc tgg ttc att gag aac 2304Ala
Gly Leu Asp Asn Gly Arg Lys Ser Glu Pro Trp Phe Ile Glu Asn 755 760
765aag gtt ccg gcc tcg cag gtc tcc tcc ttt gac gtg cgc ccc aac ggc
2352Lys Val Pro Ala Ser Gln Val Ser Ser Phe Asp Val Arg Pro Asn Gly
770 775 780agc ggc cgc acc gcc atc ttc gcc aac gcc ccc agc ggc gcc
cag ctc 2400Ser Gly Arg Thr Ala Ile Phe Ala Asn Ala Pro Ser Gly Ala
Gln Leu785 790 795 800aac cgc cgc acg gac cag ggc cag tac ctc gac
gcc gtc gac att gtc 2448Asn Arg Arg Thr Asp Gln Gly Gln Tyr Leu Asp
Ala Val Asp Ile Val 805 810 815tcc ggc agc ggc aag aag agc ctc ggc
tac gcc cac ggt tcc aag acg 2496Ser Gly Ser Gly Lys Lys Ser Leu Gly
Tyr Ala His Gly Ser Lys Thr 820 825 830gtc aac ccg aac gac tgg ttc
ttc tcg tgc cac ttt tgg ttt gac tcg 2544Val Asn Pro Asn Asp Trp Phe
Phe Ser Cys His Phe Trp Phe Asp Ser 835 840 845gtc atg ccc gga agt
ctc ggt gtc gag tcc atg ttc cag ctc gtc gag 2592Val Met Pro Gly Ser
Leu Gly Val Glu Ser Met Phe Gln Leu Val Glu 850 855 860gcc atc gcc
gcc cac gag gat ctc gct ggc aaa gca cgg cat tgc caa 2640Ala Ile Ala
Ala His Glu Asp Leu Ala Gly Lys Ala Arg His Cys Gln865 870 875
880ccc cac ctt tgt gca cgc ccc cgg gca aga tca agc tgg aag tac cgc
2688Pro His Leu Cys Ala Arg Pro Arg Ala Arg Ser Ser Trp Lys Tyr Arg
885 890 895ggc cag ctc acg ccc aag agc aag aag atg gac tcg gag gtc
cac atc 2736Gly Gln Leu Thr Pro Lys Ser Lys Lys Met Asp Ser Glu Val
His Ile 900 905 910gtg tcc gtg gac gcc cac gac ggc gtt gtc gac ctc
gtc gcc gac ggc 2784Val Ser Val Asp Ala His Asp Gly Val Val Asp Leu
Val Ala Asp Gly 915 920 925ttc ctc tgg gcc gac agc ctc cgc gtc tac
tcg gtg agc aac att cgc 2832Phe Leu Trp Ala Asp Ser Leu Arg Val Tyr
Ser Val Ser Asn Ile Arg 930 935 940gtg cgc atc gcc tcc ggt gag gcc
cct gcc gcc gcc tcc tcc gcc gcc 2880Val Arg Ile Ala Ser Gly Glu Ala
Pro Ala Ala Ala Ser Ser Ala Ala945 950 955 960tct gtg ggc tcc tcg
gct tcg tcc gtc gag cgc acg cgc tcg agc ccc 2928Ser Val Gly Ser Ser
Ala Ser Ser Val Glu Arg Thr Arg Ser Ser Pro 965 970 975gct gtc gcc
tcc ggc ccg gcc cag acc atc gac ctc aag cag ctc aag 2976Ala Val Ala
Ser Gly Pro Ala Gln Thr Ile Asp Leu Lys Gln Leu Lys 980 985 990acc
gag ctc ctc gag ctc gat gcc ccg ctc tac ctc tcg cag gac ccg 3024Thr
Glu Leu Leu Glu Leu Asp Ala Pro Leu Tyr Leu Ser Gln Asp Pro 995
1000 1005acc agc ggc cag ctc aag aag cac acc gac gtg gcc tcc ggc
cag 3069Thr Ser Gly Gln Leu Lys Lys His Thr Asp Val Ala Ser Gly Gln
1010 1015 1020gcc acc atc gtg cag ccc tgc acg ctc ggc gac ctc ggt
gac cgc 3114Ala Thr Ile Val Gln Pro Cys Thr Leu Gly Asp Leu Gly Asp
Arg 1025 1030 1035tcc ttc atg gag acc tac ggc gtc gtc gcc ccg ctg
tac acg ggc 3159Ser Phe Met Glu Thr Tyr Gly Val Val Ala Pro Leu Tyr
Thr Gly 1040 1045 1050gcc atg gcc aag ggc att gcc tcg gcg gac ctc
gtc atc gcc gcc 3204Ala Met Ala Lys Gly Ile Ala Ser Ala Asp Leu Val
Ile Ala Ala 1055 1060 1065ggc aag cgc aag atc ctc ggc tcc ttt ggc
gcc ggc ggc ctc ccc 3249Gly Lys Arg Lys Ile Leu Gly Ser Phe Gly Ala
Gly Gly Leu Pro 1070 1075 1080atg cac cac gtg cgc gcc gcc ctc gag
aag atc cag gcc gcc ctg 3294Met His His Val Arg Ala Ala Leu Glu Lys
Ile Gln Ala Ala Leu 1085 1090 1095cct cag ggc ccc tac gcc gtc aac
ctc atc cac tcg cct ttt gac 3339Pro Gln Gly Pro Tyr Ala Val Asn Leu
Ile His Ser Pro Phe Asp 1100 1105 1110agc aac ctc gag aag ggc aac
gtc gat ctc ttc ctc gag aag ggc 3384Ser Asn Leu Glu Lys Gly Asn Val
Asp Leu Phe Leu Glu Lys Gly 1115 1120 1125gtc act gtg gtg gag gcc
tcg gca ttc atg acc ctc acc ccg cag 3429Val Thr Val Val Glu Ala Ser
Ala Phe Met Thr Leu Thr Pro Gln 1130 1135 1140gtc gtg cgc tac cgc
gcc gcc ggc ctc tcg cgc aac gcc gac ggt 3474Val Val Arg Tyr Arg Ala
Ala Gly Leu Ser Arg Asn Ala Asp Gly 1145 1150 1155tcg gtc aac atc
cgc aac cgc atc atc ggc aag gtc tcg cgc acc 3519Ser Val Asn Ile Arg
Asn Arg Ile Ile Gly Lys Val Ser Arg Thr 1160 1165 1170gag ctc gcc
gag atg ttc atc cgc ccg gcc ccg gag cac ctc ctc 3564Glu Leu Ala Glu
Met Phe Ile Arg Pro Ala Pro Glu His Leu Leu 1175 1180 1185gag aag
ctc atc gcc tcg ggc gag atc acc cag gag cag gcc gag 3609Glu Lys Leu
Ile Ala Ser Gly Glu Ile Thr Gln Glu Gln Ala Glu 1190 1195 1200ctc
gcg cgc cgc gtt ccc gtc gcc gac gat atc gct gtc gag gct 3654Leu Ala
Arg Arg Val Pro Val Ala Asp Asp Ile Ala Val Glu Ala 1205 1210
1215gac tcg ggc ggc cac acc gac aac cgc ccc atc cac gtc atc ctc
3699Asp Ser Gly Gly His Thr Asp Asn Arg Pro Ile His Val Ile Leu
1220 1225 1230ccg ctc atc atc aac ctc cgc aac cgc ctg cac cgc gag
tgc ggc 3744Pro Leu Ile Ile Asn Leu Arg Asn Arg Leu His Arg Glu Cys
Gly 1235 1240 1245tac ccc gcg cac ctc cgc gtc cgc gtt ggc gcc ggc
ggt ggc gtc 3789Tyr Pro Ala His Leu Arg Val Arg Val Gly Ala Gly Gly
Gly Val 1250 1255 1260ggc tgc ccg cag gcc gcc gcc gcc gcg ctc acc
atg ggc gcc gcc 3834Gly Cys Pro Gln Ala Ala Ala Ala Ala Leu Thr Met
Gly Ala Ala 1265 1270 1275ttc atc gtc acc ggc act gtc aac cag gtc
gcc aag cag tcc ggc 3879Phe Ile Val Thr Gly Thr Val Asn Gln Val Ala
Lys Gln Ser Gly 1280 1285 1290acc tgc gac aac gtg cgc aag cag ctc
tcg cag gcc acc tac tcg 3924Thr Cys Asp Asn Val Arg Lys Gln Leu Ser
Gln Ala Thr Tyr Ser 1295 1300 1305gat atc tgc atg gcc ccg gcc gcc
gac atg ttc gag gag ggc gtc 3969Asp Ile Cys Met Ala Pro Ala Ala Asp
Met Phe Glu Glu Gly Val 1310 1315 1320aag ctc cag gtc ctc aag aag
gga acc atg ttc ccc tcg cgc gcc 4014Lys Leu Gln Val Leu Lys Lys Gly
Thr Met Phe Pro Ser Arg Ala 1325 1330 1335aac aag ctc tac gag ctc
ttt tgc aag tac gac tcc ttc gac tcc 4059Asn Lys Leu Tyr Glu Leu Phe
Cys Lys Tyr Asp Ser Phe Asp Ser 1340 1345 1350atg cct cct gcc gag
ctc gag cgc atc gag aag cgt atc ttc aag 4104Met Pro Pro Ala Glu Leu
Glu Arg Ile Glu Lys Arg Ile Phe Lys 1355 1360 1365cgc gca ctc cag
gag gtc tgg gag gag acc aag gac ttt tac att 4149Arg Ala Leu Gln Glu
Val Trp Glu Glu Thr Lys Asp Phe Tyr Ile 1370 1375 1380aac ggt ctc
aag aac ccg gag aag atc cag cgc gcc gag cac gac 4194Asn Gly Leu Lys
Asn Pro Glu Lys Ile Gln Arg Ala Glu His Asp 1385 1390 1395ccc aag
ctc aag atg tcg ctc tgc ttc cgc tgg tac ctt ggt ctt 4239Pro Lys Leu
Lys Met Ser Leu Cys Phe Arg Trp Tyr Leu Gly Leu 1400 1405 1410gcc
agc cgc tgg gcc aac atg ggc gcc ccg gac cgc gtc atg gac 4284Ala Ser
Arg Trp Ala Asn Met Gly Ala Pro Asp Arg Val Met Asp 1415 1420
1425tac cag gtc tgg tgt ggc ccg gcc att ggc gcc ttc aac gac ttc
4329Tyr Gln Val Trp Cys Gly Pro Ala Ile Gly Ala Phe Asn Asp Phe
1430 1435 1440atc aag ggc acc tac ctc gac ccc gct gtc tcc aac gag
tac ccc 4374Ile Lys Gly Thr Tyr Leu Asp Pro Ala Val Ser Asn Glu Tyr
Pro 1445 1450 1455tgt gtc gtc cag atc aac ctg caa atc ctc cgt ggt
gcc tgc tac 4419Cys Val Val Gln Ile Asn Leu Gln Ile Leu Arg Gly Ala
Cys Tyr 1460 1465 1470ctg cgc cgt ctc aac gcc ctg cgc aac gac ccg
cgc att gac ctc 4464Leu Arg Arg Leu Asn Ala Leu Arg Asn Asp Pro Arg
Ile Asp Leu 1475 1480 1485gag acc gag gat gct gcc ttt gtc tac gag
ccc acc aac gcg ctc 4509Glu Thr Glu Asp Ala Ala Phe Val Tyr Glu Pro
Thr Asn Ala Leu 1490 1495 1500taa 4512281503PRTSchizochytrium sp.
28Met Ala Leu Arg Val Lys Thr Asn Lys Lys Pro Cys Trp Glu Met Thr1
5 10 15Lys Glu Glu Leu Thr Ser Gly Lys Thr Glu Val Phe Asn Tyr Glu
Glu 20 25 30Leu Leu Glu Phe Ala Glu Gly Asp Ile Ala Lys Val Phe Gly
Pro Glu 35 40 45Phe Ala Val Ile Asp Lys Tyr Pro Arg Arg Val Arg Leu
Pro Ala Arg 50 55 60Glu Tyr Leu Leu Val Thr Arg Val Thr Leu Met Asp
Ala Glu Val Asn65 70 75 80Asn Tyr Arg Val Gly Ala Arg Met Val Thr
Glu Tyr Asp Leu Pro Val 85 90 95Asn Gly Glu Leu Ser Glu Gly Gly Asp
Cys Pro Trp Ala Val Leu Val 100 105 110Glu Ser Gly Gln Cys Asp Leu
Met Leu Ile Ser Tyr Met Gly Ile Asp 115 120 125Phe Gln Asn Gln Gly
Asp Arg Val Tyr Arg Leu Leu Asn Thr Thr Leu 130 135 140Thr Phe Tyr
Gly Val Ala His Glu Gly Glu Thr Leu Glu Tyr Asp Ile145 150 155
160Arg Val Thr Gly Phe Ala Lys Arg Leu Asp Gly Gly Ile Ser Met Phe
165 170 175Phe Phe Glu Tyr Asp Cys Tyr Val Asn Gly Arg Leu Leu Ile
Glu Met 180 185 190Arg Asp Gly Cys Ala Gly Phe Phe Thr Asn Glu Glu
Leu Asp Ala Gly 195 200 205Lys Gly Val Val Phe Thr Arg Gly Asp Leu
Ala Ala Arg Ala Lys Ile 210 215 220Pro Lys Gln Asp Val Ser Pro Tyr
Ala Val Ala Pro Cys Leu His Lys225 230 235 240Thr Lys Leu Asn Glu
Lys Glu Met Gln Thr Leu Val Asp Lys Asp Trp 245 250 255Ala Ser Val
Phe Gly Ser Lys Asn Gly Met Pro Glu Ile Asn Tyr Lys 260 265 270Leu
Cys Ala Arg Lys Met Leu Met Ile Asp Arg Val Thr Ser Ile Asp 275 280
285His Lys Gly Gly Val Tyr Gly Leu Gly Gln Leu Val Gly Glu Lys Ile
290 295 300Leu Glu Arg Asp His Trp Tyr Phe Pro Cys His Phe Val Lys
Asp Gln305 310 315 320Val Met Ala Gly Ser Leu Val Ser Asp Gly Cys
Ser Gln Met Leu Lys 325 330 335Met Tyr Met Ile Trp Leu Gly Leu His
Leu Thr Thr Gly Pro Phe Asp 340 345 350Phe Arg Pro Val Asn Gly His
Pro Asn Lys Val Arg Cys Arg Gly Gln 355 360 365Ile Ser Pro His Lys
Gly Lys Leu Val Tyr Val Met Glu Ile Lys Glu 370 375 380Met Gly Phe
Asp Glu Asp Asn Asp Pro Tyr Ala Ile Ala Asp Val Asn385 390 395
400Ile Ile Asp Val Asp Phe Glu Lys Gly Gln Asp Phe Ser Leu Asp Arg
405 410 415Ile Ser Asp Tyr Gly Lys Gly Asp Leu Asn Lys Lys Ile Val
Val Asp 420 425 430Phe Lys Gly Ile Ala Leu Lys Met Gln Lys Arg Ser
Thr Asn Lys Asn 435 440 445Pro Ser Lys Val Gln Pro Val Phe Ala Asn
Gly Ala Ala Thr Val Gly 450 455 460Pro Glu Ala Ser Lys Ala Ser Ser
Gly Ala Ser Ala Ser Ala Ser Ala465 470 475 480Ala Pro Ala Lys Pro
Ala Phe Ser Ala Asp Val Leu Ala Pro Lys Pro 485 490 495Val Ala Leu
Pro Glu His Ile Leu Lys Gly Asp Ala Leu Ala Pro Lys 500 505 510Glu
Met Ser Trp His Pro Met Ala Arg Ile Pro Gly Asn Pro Thr Pro 515 520
525Ser Phe Ala Pro Ser Ala Tyr Lys Pro Arg Asn Ile Ala Phe Thr Pro
530 535 540Phe Pro Gly Asn Pro Asn Asp Asn Asp His Thr Pro Gly Lys
Met Pro545 550 555 560Leu Thr Trp Phe Asn Met Ala Glu Phe Met Ala
Gly Lys Val Ser Met 565 570 575Cys Leu Gly Pro Glu Phe Ala Lys Phe
Asp Asp Ser Asn Thr Ser Arg 580 585 590Ser Pro Ala Trp Asp Leu Ala
Leu Val Thr Arg Ala Val Ser Val Ser 595 600 605Asp Leu Lys His Val
Asn Tyr Arg Asn Ile Asp Leu Asp Pro Ser Lys 610 615 620Gly Thr Met
Val Gly Glu Phe Asp Cys Pro Ala Asp Ala Trp Phe Tyr625 630 635
640Lys Gly Ala Cys Asn Asp Ala His Met Pro Tyr Ser Ile Leu Met Glu
645 650 655Ile Ala Leu Gln Thr Ser Gly Val Leu Thr Ser Val Leu Lys
Ala Pro
660 665 670Leu Thr Met Glu Lys Asp Asp Ile Leu Phe Arg Asn Leu Asp
Ala Asn 675 680 685Ala Glu Phe Val Arg Ala Asp Leu Asp Tyr Arg Gly
Lys Thr Ile Arg 690 695 700Asn Val Thr Lys Cys Thr Gly Tyr Ser Met
Leu Gly Glu Met Gly Val705 710 715 720His Arg Phe Thr Phe Glu Leu
Tyr Val Asp Asp Val Leu Phe Tyr Lys 725 730 735Gly Ser Thr Ser Phe
Gly Trp Phe Val Pro Glu Val Phe Ala Ala Gln 740 745 750Ala Gly Leu
Asp Asn Gly Arg Lys Ser Glu Pro Trp Phe Ile Glu Asn 755 760 765Lys
Val Pro Ala Ser Gln Val Ser Ser Phe Asp Val Arg Pro Asn Gly 770 775
780Ser Gly Arg Thr Ala Ile Phe Ala Asn Ala Pro Ser Gly Ala Gln
Leu785 790 795 800Asn Arg Arg Thr Asp Gln Gly Gln Tyr Leu Asp Ala
Val Asp Ile Val 805 810 815Ser Gly Ser Gly Lys Lys Ser Leu Gly Tyr
Ala His Gly Ser Lys Thr 820 825 830Val Asn Pro Asn Asp Trp Phe Phe
Ser Cys His Phe Trp Phe Asp Ser 835 840 845Val Met Pro Gly Ser Leu
Gly Val Glu Ser Met Phe Gln Leu Val Glu 850 855 860Ala Ile Ala Ala
His Glu Asp Leu Ala Gly Lys Ala Arg His Cys Gln865 870 875 880Pro
His Leu Cys Ala Arg Pro Arg Ala Arg Ser Ser Trp Lys Tyr Arg 885 890
895Gly Gln Leu Thr Pro Lys Ser Lys Lys Met Asp Ser Glu Val His Ile
900 905 910Val Ser Val Asp Ala His Asp Gly Val Val Asp Leu Val Ala
Asp Gly 915 920 925Phe Leu Trp Ala Asp Ser Leu Arg Val Tyr Ser Val
Ser Asn Ile Arg 930 935 940Val Arg Ile Ala Ser Gly Glu Ala Pro Ala
Ala Ala Ser Ser Ala Ala945 950 955 960Ser Val Gly Ser Ser Ala Ser
Ser Val Glu Arg Thr Arg Ser Ser Pro 965 970 975Ala Val Ala Ser Gly
Pro Ala Gln Thr Ile Asp Leu Lys Gln Leu Lys 980 985 990Thr Glu Leu
Leu Glu Leu Asp Ala Pro Leu Tyr Leu Ser Gln Asp Pro 995 1000
1005Thr Ser Gly Gln Leu Lys Lys His Thr Asp Val Ala Ser Gly Gln
1010 1015 1020Ala Thr Ile Val Gln Pro Cys Thr Leu Gly Asp Leu Gly
Asp Arg 1025 1030 1035Ser Phe Met Glu Thr Tyr Gly Val Val Ala Pro
Leu Tyr Thr Gly 1040 1045 1050Ala Met Ala Lys Gly Ile Ala Ser Ala
Asp Leu Val Ile Ala Ala 1055 1060 1065Gly Lys Arg Lys Ile Leu Gly
Ser Phe Gly Ala Gly Gly Leu Pro 1070 1075 1080Met His His Val Arg
Ala Ala Leu Glu Lys Ile Gln Ala Ala Leu 1085 1090 1095Pro Gln Gly
Pro Tyr Ala Val Asn Leu Ile His Ser Pro Phe Asp 1100 1105 1110Ser
Asn Leu Glu Lys Gly Asn Val Asp Leu Phe Leu Glu Lys Gly 1115 1120
1125Val Thr Val Val Glu Ala Ser Ala Phe Met Thr Leu Thr Pro Gln
1130 1135 1140Val Val Arg Tyr Arg Ala Ala Gly Leu Ser Arg Asn Ala
Asp Gly 1145 1150 1155Ser Val Asn Ile Arg Asn Arg Ile Ile Gly Lys
Val Ser Arg Thr 1160 1165 1170Glu Leu Ala Glu Met Phe Ile Arg Pro
Ala Pro Glu His Leu Leu 1175 1180 1185Glu Lys Leu Ile Ala Ser Gly
Glu Ile Thr Gln Glu Gln Ala Glu 1190 1195 1200Leu Ala Arg Arg Val
Pro Val Ala Asp Asp Ile Ala Val Glu Ala 1205 1210 1215Asp Ser Gly
Gly His Thr Asp Asn Arg Pro Ile His Val Ile Leu 1220 1225 1230Pro
Leu Ile Ile Asn Leu Arg Asn Arg Leu His Arg Glu Cys Gly 1235 1240
1245Tyr Pro Ala His Leu Arg Val Arg Val Gly Ala Gly Gly Gly Val
1250 1255 1260Gly Cys Pro Gln Ala Ala Ala Ala Ala Leu Thr Met Gly
Ala Ala 1265 1270 1275Phe Ile Val Thr Gly Thr Val Asn Gln Val Ala
Lys Gln Ser Gly 1280 1285 1290Thr Cys Asp Asn Val Arg Lys Gln Leu
Ser Gln Ala Thr Tyr Ser 1295 1300 1305Asp Ile Cys Met Ala Pro Ala
Ala Asp Met Phe Glu Glu Gly Val 1310 1315 1320Lys Leu Gln Val Leu
Lys Lys Gly Thr Met Phe Pro Ser Arg Ala 1325 1330 1335Asn Lys Leu
Tyr Glu Leu Phe Cys Lys Tyr Asp Ser Phe Asp Ser 1340 1345 1350Met
Pro Pro Ala Glu Leu Glu Arg Ile Glu Lys Arg Ile Phe Lys 1355 1360
1365Arg Ala Leu Gln Glu Val Trp Glu Glu Thr Lys Asp Phe Tyr Ile
1370 1375 1380Asn Gly Leu Lys Asn Pro Glu Lys Ile Gln Arg Ala Glu
His Asp 1385 1390 1395Pro Lys Leu Lys Met Ser Leu Cys Phe Arg Trp
Tyr Leu Gly Leu 1400 1405 1410Ala Ser Arg Trp Ala Asn Met Gly Ala
Pro Asp Arg Val Met Asp 1415 1420 1425Tyr Gln Val Trp Cys Gly Pro
Ala Ile Gly Ala Phe Asn Asp Phe 1430 1435 1440Ile Lys Gly Thr Tyr
Leu Asp Pro Ala Val Ser Asn Glu Tyr Pro 1445 1450 1455Cys Val Val
Gln Ile Asn Leu Gln Ile Leu Arg Gly Ala Cys Tyr 1460 1465 1470Leu
Arg Arg Leu Asn Ala Leu Arg Asn Asp Pro Arg Ile Asp Leu 1475 1480
1485Glu Thr Glu Asp Ala Ala Phe Val Tyr Glu Pro Thr Asn Ala Leu
1490 1495 1500291500DNASchizochytrium sp.CDS(1)..(1500) 29aag gtt
cag ccc gtc ttt gcc aac ggc gcc gcc act gtc ggc ccc gag 48Lys Val
Gln Pro Val Phe Ala Asn Gly Ala Ala Thr Val Gly Pro Glu1 5 10 15gcc
tcc aag gct tcc tcc ggc gcc agc gcc agc gcc agc gcc gcc ccg 96Ala
Ser Lys Ala Ser Ser Gly Ala Ser Ala Ser Ala Ser Ala Ala Pro 20 25
30gcc aag cct gcc ttc agc gcc gat gtt ctt gcg ccc aag ccc gtt gcc
144Ala Lys Pro Ala Phe Ser Ala Asp Val Leu Ala Pro Lys Pro Val Ala
35 40 45ctt ccc gag cac atc ctc aag ggc gac gcc ctc gcc ccc aag gag
atg 192Leu Pro Glu His Ile Leu Lys Gly Asp Ala Leu Ala Pro Lys Glu
Met 50 55 60tcc tgg cac ccc atg gcc cgc atc ccg ggc aac ccg acg ccc
tct ttt 240Ser Trp His Pro Met Ala Arg Ile Pro Gly Asn Pro Thr Pro
Ser Phe65 70 75 80gcg ccc tcg gcc tac aag ccg cgc aac atc gcc ttt
acg ccc ttc ccc 288Ala Pro Ser Ala Tyr Lys Pro Arg Asn Ile Ala Phe
Thr Pro Phe Pro 85 90 95ggc aac ccc aac gat aac gac cac acc ccg ggc
aag atg ccg ctc acc 336Gly Asn Pro Asn Asp Asn Asp His Thr Pro Gly
Lys Met Pro Leu Thr 100 105 110tgg ttc aac atg gcc gag ttc atg gcc
ggc aag gtc agc atg tgc ctc 384Trp Phe Asn Met Ala Glu Phe Met Ala
Gly Lys Val Ser Met Cys Leu 115 120 125ggc ccc gag ttc gcc aag ttc
gac gac tcg aac acc agc cgc agc ccc 432Gly Pro Glu Phe Ala Lys Phe
Asp Asp Ser Asn Thr Ser Arg Ser Pro 130 135 140gct tgg gac ctc gct
ctc gtc acc cgc gcc gtg tct gtg tct gac ctc 480Ala Trp Asp Leu Ala
Leu Val Thr Arg Ala Val Ser Val Ser Asp Leu145 150 155 160aag cac
gtc aac tac cgc aac atc gac ctc gac ccc tcc aag ggt acc 528Lys His
Val Asn Tyr Arg Asn Ile Asp Leu Asp Pro Ser Lys Gly Thr 165 170
175atg gtc ggc gag ttc gac tgc ccc gcg gac gcc tgg ttc tac aag ggc
576Met Val Gly Glu Phe Asp Cys Pro Ala Asp Ala Trp Phe Tyr Lys Gly
180 185 190gcc tgc aac gat gcc cac atg ccg tac tcg atc ctc atg gag
atc gcc 624Ala Cys Asn Asp Ala His Met Pro Tyr Ser Ile Leu Met Glu
Ile Ala 195 200 205ctc cag acc tcg ggt gtg ctc acc tcg gtg ctc aag
gcg ccc ctg acc 672Leu Gln Thr Ser Gly Val Leu Thr Ser Val Leu Lys
Ala Pro Leu Thr 210 215 220atg gag aag gac gac atc ctc ttc cgc aac
ctc gac gcc aac gcc gag 720Met Glu Lys Asp Asp Ile Leu Phe Arg Asn
Leu Asp Ala Asn Ala Glu225 230 235 240ttc gtg cgc gcc gac ctc gac
tac cgc ggc aag act atc cgc aac gtc 768Phe Val Arg Ala Asp Leu Asp
Tyr Arg Gly Lys Thr Ile Arg Asn Val 245 250 255acc aag tgc act ggc
tac agc atg ctc ggc gag atg ggc gtc cac cgc 816Thr Lys Cys Thr Gly
Tyr Ser Met Leu Gly Glu Met Gly Val His Arg 260 265 270ttc acc ttt
gag ctc tac gtc gat gat gtg ctc ttt tac aag ggc tcg 864Phe Thr Phe
Glu Leu Tyr Val Asp Asp Val Leu Phe Tyr Lys Gly Ser 275 280 285acc
tcg ttc ggc tgg ttc gtg ccc gag gtc ttt gcc gcc cag gcc ggc 912Thr
Ser Phe Gly Trp Phe Val Pro Glu Val Phe Ala Ala Gln Ala Gly 290 295
300ctc gac aac ggc cgc aag tcg gag ccc tgg ttc att gag aac aag gtt
960Leu Asp Asn Gly Arg Lys Ser Glu Pro Trp Phe Ile Glu Asn Lys
Val305 310 315 320ccg gcc tcg cag gtc tcc tcc ttt gac gtg cgc ccc
aac ggc agc ggc 1008Pro Ala Ser Gln Val Ser Ser Phe Asp Val Arg Pro
Asn Gly Ser Gly 325 330 335cgc acc gcc atc ttc gcc aac gcc ccc agc
ggc gcc cag ctc aac cgc 1056Arg Thr Ala Ile Phe Ala Asn Ala Pro Ser
Gly Ala Gln Leu Asn Arg 340 345 350cgc acg gac cag ggc cag tac ctc
gac gcc gtc gac att gtc tcc ggc 1104Arg Thr Asp Gln Gly Gln Tyr Leu
Asp Ala Val Asp Ile Val Ser Gly 355 360 365agc ggc aag aag agc ctc
ggc tac gcc cac ggt tcc aag acg gtc aac 1152Ser Gly Lys Lys Ser Leu
Gly Tyr Ala His Gly Ser Lys Thr Val Asn 370 375 380ccg aac gac tgg
ttc ttc tcg tgc cac ttt tgg ttt gac tcg gtc atg 1200Pro Asn Asp Trp
Phe Phe Ser Cys His Phe Trp Phe Asp Ser Val Met385 390 395 400ccc
gga agt ctc ggt gtc gag tcc atg ttc cag ctc gtc gag gcc atc 1248Pro
Gly Ser Leu Gly Val Glu Ser Met Phe Gln Leu Val Glu Ala Ile 405 410
415gcc gcc cac gag gat ctc gct ggc aaa gca cgg cat tgc caa ccc cac
1296Ala Ala His Glu Asp Leu Ala Gly Lys Ala Arg His Cys Gln Pro His
420 425 430ctt tgt gca cgc ccc cgg gca aga tca agc tgg aag tac cgc
ggc cag 1344Leu Cys Ala Arg Pro Arg Ala Arg Ser Ser Trp Lys Tyr Arg
Gly Gln 435 440 445ctc acg ccc aag agc aag aag atg gac tcg gag gtc
cac atc gtg tcc 1392Leu Thr Pro Lys Ser Lys Lys Met Asp Ser Glu Val
His Ile Val Ser 450 455 460gtg gac gcc cac gac ggc gtt gtc gac ctc
gtc gcc gac ggc ttc ctc 1440Val Asp Ala His Asp Gly Val Val Asp Leu
Val Ala Asp Gly Phe Leu465 470 475 480tgg gcc gac agc ctc cgc gtc
tac tcg gtg agc aac att cgc gtg cgc 1488Trp Ala Asp Ser Leu Arg Val
Tyr Ser Val Ser Asn Ile Arg Val Arg 485 490 495atc gcc tcc ggt
1500Ile Ala Ser Gly 50030500PRTSchizochytrium sp. 30Lys Val Gln Pro
Val Phe Ala Asn Gly Ala Ala Thr Val Gly Pro Glu1 5 10 15Ala Ser Lys
Ala Ser Ser Gly Ala Ser Ala Ser Ala Ser Ala Ala Pro 20 25 30Ala Lys
Pro Ala Phe Ser Ala Asp Val Leu Ala Pro Lys Pro Val Ala 35 40 45Leu
Pro Glu His Ile Leu Lys Gly Asp Ala Leu Ala Pro Lys Glu Met 50 55
60Ser Trp His Pro Met Ala Arg Ile Pro Gly Asn Pro Thr Pro Ser Phe65
70 75 80Ala Pro Ser Ala Tyr Lys Pro Arg Asn Ile Ala Phe Thr Pro Phe
Pro 85 90 95Gly Asn Pro Asn Asp Asn Asp His Thr Pro Gly Lys Met Pro
Leu Thr 100 105 110Trp Phe Asn Met Ala Glu Phe Met Ala Gly Lys Val
Ser Met Cys Leu 115 120 125Gly Pro Glu Phe Ala Lys Phe Asp Asp Ser
Asn Thr Ser Arg Ser Pro 130 135 140Ala Trp Asp Leu Ala Leu Val Thr
Arg Ala Val Ser Val Ser Asp Leu145 150 155 160Lys His Val Asn Tyr
Arg Asn Ile Asp Leu Asp Pro Ser Lys Gly Thr 165 170 175Met Val Gly
Glu Phe Asp Cys Pro Ala Asp Ala Trp Phe Tyr Lys Gly 180 185 190Ala
Cys Asn Asp Ala His Met Pro Tyr Ser Ile Leu Met Glu Ile Ala 195 200
205Leu Gln Thr Ser Gly Val Leu Thr Ser Val Leu Lys Ala Pro Leu Thr
210 215 220Met Glu Lys Asp Asp Ile Leu Phe Arg Asn Leu Asp Ala Asn
Ala Glu225 230 235 240Phe Val Arg Ala Asp Leu Asp Tyr Arg Gly Lys
Thr Ile Arg Asn Val 245 250 255Thr Lys Cys Thr Gly Tyr Ser Met Leu
Gly Glu Met Gly Val His Arg 260 265 270Phe Thr Phe Glu Leu Tyr Val
Asp Asp Val Leu Phe Tyr Lys Gly Ser 275 280 285Thr Ser Phe Gly Trp
Phe Val Pro Glu Val Phe Ala Ala Gln Ala Gly 290 295 300Leu Asp Asn
Gly Arg Lys Ser Glu Pro Trp Phe Ile Glu Asn Lys Val305 310 315
320Pro Ala Ser Gln Val Ser Ser Phe Asp Val Arg Pro Asn Gly Ser Gly
325 330 335Arg Thr Ala Ile Phe Ala Asn Ala Pro Ser Gly Ala Gln Leu
Asn Arg 340 345 350Arg Thr Asp Gln Gly Gln Tyr Leu Asp Ala Val Asp
Ile Val Ser Gly 355 360 365Ser Gly Lys Lys Ser Leu Gly Tyr Ala His
Gly Ser Lys Thr Val Asn 370 375 380Pro Asn Asp Trp Phe Phe Ser Cys
His Phe Trp Phe Asp Ser Val Met385 390 395 400Pro Gly Ser Leu Gly
Val Glu Ser Met Phe Gln Leu Val Glu Ala Ile 405 410 415Ala Ala His
Glu Asp Leu Ala Gly Lys Ala Arg His Cys Gln Pro His 420 425 430Leu
Cys Ala Arg Pro Arg Ala Arg Ser Ser Trp Lys Tyr Arg Gly Gln 435 440
445Leu Thr Pro Lys Ser Lys Lys Met Asp Ser Glu Val His Ile Val Ser
450 455 460Val Asp Ala His Asp Gly Val Val Asp Leu Val Ala Asp Gly
Phe Leu465 470 475 480Trp Ala Asp Ser Leu Arg Val Tyr Ser Val Ser
Asn Ile Arg Val Arg 485 490 495Ile Ala Ser Gly
500311512DNASchizochytrium sp.CDS(1)..(1512) 31gcc ccg ctc tac ctc
tcg cag gac ccg acc agc ggc cag ctc aag aag 48Ala Pro Leu Tyr Leu
Ser Gln Asp Pro Thr Ser Gly Gln Leu Lys Lys1 5 10 15cac acc gac gtg
gcc tcc ggc cag gcc acc atc gtg cag ccc tgc acg 96His Thr Asp Val
Ala Ser Gly Gln Ala Thr Ile Val Gln Pro Cys Thr 20 25 30ctc ggc gac
ctc ggt gac cgc tcc ttc atg gag acc tac ggc gtc gtc 144Leu Gly Asp
Leu Gly Asp Arg Ser Phe Met Glu Thr Tyr Gly Val Val 35 40 45gcc ccg
ctg tac acg ggc gcc atg gcc aag ggc att gcc tcg gcg gac 192Ala Pro
Leu Tyr Thr Gly Ala Met Ala Lys Gly Ile Ala Ser Ala Asp 50 55 60ctc
gtc atc gcc gcc ggc aag cgc aag atc ctc ggc tcc ttt ggc gcc 240Leu
Val Ile Ala Ala Gly Lys Arg Lys Ile Leu Gly Ser Phe Gly Ala65 70 75
80ggc ggc ctc ccc atg cac cac gtg cgc gcc gcc ctc gag aag atc cag
288Gly Gly Leu Pro Met His His Val Arg Ala Ala Leu Glu Lys Ile Gln
85 90 95gcc gcc ctg cct cag ggc ccc tac gcc gtc aac ctc atc cac tcg
cct 336Ala Ala Leu Pro Gln Gly Pro Tyr Ala Val Asn Leu Ile His Ser
Pro 100 105 110ttt gac agc aac ctc gag aag ggc aac gtc gat ctc ttc
ctc gag aag 384Phe Asp Ser Asn Leu Glu Lys Gly Asn Val Asp Leu Phe
Leu Glu Lys 115 120 125ggc gtc act gtg gtg gag gcc tcg gca ttc atg
acc ctc acc ccg cag 432Gly Val Thr Val Val Glu Ala Ser Ala Phe Met
Thr Leu Thr Pro Gln 130 135 140gtc gtg cgc tac cgc gcc gcc ggc ctc
tcg cgc aac gcc gac ggt tcg 480Val Val Arg Tyr Arg Ala Ala Gly Leu
Ser Arg Asn Ala Asp Gly Ser145 150 155 160gtc aac atc cgc aac cgc
atc atc ggc aag gtc tcg cgc acc gag ctc 528Val Asn Ile Arg Asn Arg
Ile Ile Gly Lys Val Ser Arg Thr Glu Leu 165 170 175gcc gag atg ttc
atc cgc ccg gcc ccg gag cac ctc ctc gag aag ctc 576Ala Glu Met Phe
Ile Arg Pro Ala Pro Glu
His Leu Leu Glu Lys Leu 180 185 190atc gcc tcg ggc gag atc acc cag
gag cag gcc gag ctc gcg cgc cgc 624Ile Ala Ser Gly Glu Ile Thr Gln
Glu Gln Ala Glu Leu Ala Arg Arg 195 200 205gtt ccc gtc gcc gac gat
atc gct gtc gag gct gac tcg ggc ggc cac 672Val Pro Val Ala Asp Asp
Ile Ala Val Glu Ala Asp Ser Gly Gly His 210 215 220acc gac aac cgc
ccc atc cac gtc atc ctc ccg ctc atc atc aac ctc 720Thr Asp Asn Arg
Pro Ile His Val Ile Leu Pro Leu Ile Ile Asn Leu225 230 235 240cgc
aac cgc ctg cac cgc gag tgc ggc tac ccc gcg cac ctc cgc gtc 768Arg
Asn Arg Leu His Arg Glu Cys Gly Tyr Pro Ala His Leu Arg Val 245 250
255cgc gtt ggc gcc ggc ggt ggc gtc ggc tgc ccg cag gcc gcc gcc gcc
816Arg Val Gly Ala Gly Gly Gly Val Gly Cys Pro Gln Ala Ala Ala Ala
260 265 270gcg ctc acc atg ggc gcc gcc ttc atc gtc acc ggc act gtc
aac cag 864Ala Leu Thr Met Gly Ala Ala Phe Ile Val Thr Gly Thr Val
Asn Gln 275 280 285gtc gcc aag cag tcc ggc acc tgc gac aac gtg cgc
aag cag ctc tcg 912Val Ala Lys Gln Ser Gly Thr Cys Asp Asn Val Arg
Lys Gln Leu Ser 290 295 300cag gcc acc tac tcg gat atc tgc atg gcc
ccg gcc gcc gac atg ttc 960Gln Ala Thr Tyr Ser Asp Ile Cys Met Ala
Pro Ala Ala Asp Met Phe305 310 315 320gag gag ggc gtc aag ctc cag
gtc ctc aag aag gga acc atg ttc ccc 1008Glu Glu Gly Val Lys Leu Gln
Val Leu Lys Lys Gly Thr Met Phe Pro 325 330 335tcg cgc gcc aac aag
ctc tac gag ctc ttt tgc aag tac gac tcc ttc 1056Ser Arg Ala Asn Lys
Leu Tyr Glu Leu Phe Cys Lys Tyr Asp Ser Phe 340 345 350gac tcc atg
cct cct gcc gag ctc gag cgc atc gag aag cgt atc ttc 1104Asp Ser Met
Pro Pro Ala Glu Leu Glu Arg Ile Glu Lys Arg Ile Phe 355 360 365aag
cgc gca ctc cag gag gtc tgg gag gag acc aag gac ttt tac att 1152Lys
Arg Ala Leu Gln Glu Val Trp Glu Glu Thr Lys Asp Phe Tyr Ile 370 375
380aac ggt ctc aag aac ccg gag aag atc cag cgc gcc gag cac gac ccc
1200Asn Gly Leu Lys Asn Pro Glu Lys Ile Gln Arg Ala Glu His Asp
Pro385 390 395 400aag ctc aag atg tcg ctc tgc ttc cgc tgg tac ctt
ggt ctt gcc agc 1248Lys Leu Lys Met Ser Leu Cys Phe Arg Trp Tyr Leu
Gly Leu Ala Ser 405 410 415cgc tgg gcc aac atg ggc gcc ccg gac cgc
gtc atg gac tac cag gtc 1296Arg Trp Ala Asn Met Gly Ala Pro Asp Arg
Val Met Asp Tyr Gln Val 420 425 430tgg tgt ggc ccg gcc att ggc gcc
ttc aac gac ttc atc aag ggc acc 1344Trp Cys Gly Pro Ala Ile Gly Ala
Phe Asn Asp Phe Ile Lys Gly Thr 435 440 445tac ctc gac ccc gct gtc
tcc aac gag tac ccc tgt gtc gtc cag atc 1392Tyr Leu Asp Pro Ala Val
Ser Asn Glu Tyr Pro Cys Val Val Gln Ile 450 455 460aac ctg caa atc
ctc cgt ggt gcc tgc tac ctg cgc cgt ctc aac gcc 1440Asn Leu Gln Ile
Leu Arg Gly Ala Cys Tyr Leu Arg Arg Leu Asn Ala465 470 475 480ctg
cgc aac gac ccg cgc att gac ctc gag acc gag gat gct gcc ttt 1488Leu
Arg Asn Asp Pro Arg Ile Asp Leu Glu Thr Glu Asp Ala Ala Phe 485 490
495gtc tac gag ccc acc aac gcg ctc 1512Val Tyr Glu Pro Thr Asn Ala
Leu 50032504PRTSchizochytrium sp. 32Ala Pro Leu Tyr Leu Ser Gln Asp
Pro Thr Ser Gly Gln Leu Lys Lys1 5 10 15His Thr Asp Val Ala Ser Gly
Gln Ala Thr Ile Val Gln Pro Cys Thr 20 25 30Leu Gly Asp Leu Gly Asp
Arg Ser Phe Met Glu Thr Tyr Gly Val Val 35 40 45Ala Pro Leu Tyr Thr
Gly Ala Met Ala Lys Gly Ile Ala Ser Ala Asp 50 55 60Leu Val Ile Ala
Ala Gly Lys Arg Lys Ile Leu Gly Ser Phe Gly Ala65 70 75 80Gly Gly
Leu Pro Met His His Val Arg Ala Ala Leu Glu Lys Ile Gln 85 90 95Ala
Ala Leu Pro Gln Gly Pro Tyr Ala Val Asn Leu Ile His Ser Pro 100 105
110Phe Asp Ser Asn Leu Glu Lys Gly Asn Val Asp Leu Phe Leu Glu Lys
115 120 125Gly Val Thr Val Val Glu Ala Ser Ala Phe Met Thr Leu Thr
Pro Gln 130 135 140Val Val Arg Tyr Arg Ala Ala Gly Leu Ser Arg Asn
Ala Asp Gly Ser145 150 155 160Val Asn Ile Arg Asn Arg Ile Ile Gly
Lys Val Ser Arg Thr Glu Leu 165 170 175Ala Glu Met Phe Ile Arg Pro
Ala Pro Glu His Leu Leu Glu Lys Leu 180 185 190Ile Ala Ser Gly Glu
Ile Thr Gln Glu Gln Ala Glu Leu Ala Arg Arg 195 200 205Val Pro Val
Ala Asp Asp Ile Ala Val Glu Ala Asp Ser Gly Gly His 210 215 220Thr
Asp Asn Arg Pro Ile His Val Ile Leu Pro Leu Ile Ile Asn Leu225 230
235 240Arg Asn Arg Leu His Arg Glu Cys Gly Tyr Pro Ala His Leu Arg
Val 245 250 255Arg Val Gly Ala Gly Gly Gly Val Gly Cys Pro Gln Ala
Ala Ala Ala 260 265 270Ala Leu Thr Met Gly Ala Ala Phe Ile Val Thr
Gly Thr Val Asn Gln 275 280 285Val Ala Lys Gln Ser Gly Thr Cys Asp
Asn Val Arg Lys Gln Leu Ser 290 295 300Gln Ala Thr Tyr Ser Asp Ile
Cys Met Ala Pro Ala Ala Asp Met Phe305 310 315 320Glu Glu Gly Val
Lys Leu Gln Val Leu Lys Lys Gly Thr Met Phe Pro 325 330 335Ser Arg
Ala Asn Lys Leu Tyr Glu Leu Phe Cys Lys Tyr Asp Ser Phe 340 345
350Asp Ser Met Pro Pro Ala Glu Leu Glu Arg Ile Glu Lys Arg Ile Phe
355 360 365Lys Arg Ala Leu Gln Glu Val Trp Glu Glu Thr Lys Asp Phe
Tyr Ile 370 375 380Asn Gly Leu Lys Asn Pro Glu Lys Ile Gln Arg Ala
Glu His Asp Pro385 390 395 400Lys Leu Lys Met Ser Leu Cys Phe Arg
Trp Tyr Leu Gly Leu Ala Ser 405 410 415Arg Trp Ala Asn Met Gly Ala
Pro Asp Arg Val Met Asp Tyr Gln Val 420 425 430Trp Cys Gly Pro Ala
Ile Gly Ala Phe Asn Asp Phe Ile Lys Gly Thr 435 440 445Tyr Leu Asp
Pro Ala Val Ser Asn Glu Tyr Pro Cys Val Val Gln Ile 450 455 460Asn
Leu Gln Ile Leu Arg Gly Ala Cys Tyr Leu Arg Arg Leu Asn Ala465 470
475 480Leu Arg Asn Asp Pro Arg Ile Asp Leu Glu Thr Glu Asp Ala Ala
Phe 485 490 495Val Tyr Glu Pro Thr Asn Ala Leu 500339PRTArtificial
sequencemotif 33Trp Xaa Xaa Lys Glu Xaa Xaa Xaa Lys1
5346PRTArtificial sequencemotif 34Phe Asn Xaa Ser His Ser1
5355PRTArtificial sequencemotif 35Xaa Gly Xaa Asp Xaa1
5364244DNASchizochytrium sp. 36tttctctctc tcgagctgtt gctgctgctg
ctgctgctgc tgcttccttg ctggttctca 60cgtccgttcg atcaagcgct cgctcgctcg
accgatcggt gcgtgcgtgc gtgcgtgagt 120cttgttgcca ggcagccgca
ggctgtctgt ctgtttgtgt agttttaccc tcggggttcg 180gggtctgcct
gcctcccgct cccgcccgcc gccgcccgta tccaccccgc tcgcctccgc
240ccatcgggcc tcgcctcctc gcgccgcacg catcgcgcgc atcgcatgca
tcatgctgcc 300acgcacgggg ggacgcgcgc cccgcgtccc ccgccgccgc
cgtcgtcgtc tggcgatgcc 360gtcgccgccc tccttccttc cctcgcctcc
tcttcctccc gagcccccct gtcttccttc 420gcccccgcag cggcgcgcag
gaagcgagga gagcggggag gagagaagaa aagaaaagaa 480aagaaaagaa
aataacagcg ccgtctcgcg cagacgcgcg cggccgcgtg cgaggcggcg
540tgatggggct tctcgtggcg cggctgcggc ctggcccggc ctcgcctttg
aggtgcaggc 600tttgggagag aagagtggga cgcggagaag ataagatggt
gccatggcgc aggacggaga 660ggttgctgaa acttcttcga gcggcacagg
cgatggcgag agaccgacag ctgccggcgc 720ggaggggatg gatacctccc
gaggctggca tggacgagct ggccgcgcgg atctggctgg 780ccgcgcggcg
gtgggtccgg aggcgcgagg ttggttttct tcatacctga taccatacgg
840tattcattct tcctctccag gaaggaagca agtcacatag agtatcacta
gcctaatgat 900ggactctatg ttttagggca cgtcggagca gaaggcgcga
gcgattcgaa tgcgagcgat 960agatacagca cagagacctt gccggcgacg
cggatgcagg cgagcacgca cgcaccgcac 1020gcacggcagc ggtgcacgcg
ctcctcggca gatgcacggt tctgcgccgc gcctttacat 1080tttttgattt
taggtggtgt gcctgccact ttgaacatca tccacaagtc aacgcagcat
1140caagaggcaa gcaagtacat acatccattc gaattcaagt tcaagagacg
cagcaacagc 1200cgccgctccg ctcaagctgc agctagctgg ctgacagggc
tcgctggctg tagtggaaaa 1260ttccattcac ttttctgcat ccgcggccag
caggcccgta cgcacgttct ctcgtttgtt 1320tgttcgttcg tgcgtgcgtg
cgtgcgtccc agctgcctgt ctaatctgcc gcgcgatcca 1380acgaccctcg
gtcgtcgccg caagcgaaac ccgacgccga cctggccaat gccgcaagaa
1440tgctaagcgc gcagcaatgc tgagagtaat cttcagccca ccaagtcatt
atcgctgccc 1500aagtctccat cgcagccaca ttcaggcttt ctctctctct
ccctccctct ctttctgccg 1560ggagagaagg aaagacccgc cgccgccgcc
tctgcgcctg tgacgggctg tccgttgtaa 1620gccctcttag acagttccta
ggtgccgggc gccgccgcgc ctccgtcgca ggcacacgta 1680ggcggccacg
ggttcccccc gcaccttcca caccttcttc ccccgcagcc ggaccgcgcg
1740ccgtctgctt acgcacttcg cgcggccgcc gcccgcgaac ccgagcgcgt
gctgtgggcg 1800ccgtcttccg gccgcgtcgg aggtcgtccc cgcgccgcgc
tactccgggt cctgtgcggt 1860acgtacttaa tattaacagt gggacctcgc
acaggacctg acggcagcac agacgtcgcc 1920gcctcgcatc gctggggacg
caggcgaggc atcccggcgc ggccccgcac cggggaggct 1980gcggggcggc
ctcttccggc cggcggccgc atcaggcgga tgacgcaaga gccctcgcag
2040tcgctcgctc gcgggagcgc agcgcggcgc cagcgtggcc aagctcccgc
cccttctggc 2100tggctgcatg cctgcctgcc tgcctgcctg cgtgcgtgcg
tgcgtgcgtg ccttcgtgcg 2160tgcctgcctt cgtgcgtgcg tgcgtgagtg
cggcggaaga gggatcatgc gaggatcaat 2220cacccgccgc acctcgactt
ttgaagaagc cgcgatgcga tgcgatgcga tgcgatgcga 2280cgcgataccg
tgcgaggcta cgaagcgagt ctggccggcc gtcatacaac gcacgttttc
2340gagaaggagg gctggcggag gcgtgcatgc cggcgaccat tgcgaacgcg
gcgtctcgtg 2400gctggcgaag gtgcctggag gatctaacga tcgctgctat
gatgctatag ctgtgctgat 2460ccccggtcca ttccaccacg tctgtgcctg
ccgcctgacc tgcgcttggc tttccttcaa 2520gttctcctcc gccgggcctt
caggaccgag acgagacctg cagctgcagc tagactcgcg 2580ctcgctcgcg
gaggattcgc cggccgccgg gccggacggg actcgcgagg tcacacggcc
2640gccggcgatc gcgatggctg tgctgacgta ctcgtgcgtg gcagccgtac
gtcagcgacg 2700ccgcctccgt attgtggatt cgttagttgg ttgttggttg
atttgttgat taattttttt 2760gttcgtaggc ttggttatag ctaatagttt
agtttatact ggtgctcttc ggtgctgatt 2820tagctcgact tgggtccaca
ccactgcccc tctactgtga atggatcaat ggacgcacga 2880cgggccgacg
aaagtgcgcg agtgaggtaa cctaagcaac ggcggtcttc agaggggacg
2940cacgccctcc gtcgcagtca gtccagacag gcagaaaagc gtcttaggga
ccacgcacgc 3000acgcacgcac gcacgcacgc ccgcacgcac gctccctccc
tcgcgtgcct atttttttag 3060gcttccttcc gcacgggcct acctctcgct
ccctcgcctc gccgcaccag gcggcagcag 3120cgatacctgc cggtgccgcc
tccgtcacgc gctcagccgc agctcagccc agccgcgagc 3180tagggtttgt
tcgtcctgaa ttgtttgatt tgatttgatt tgatttgatc cgatccgatc
3240cgatctgatc tgatttgctt tgctttgctt tgtctccctc ccggcgcgga
ccaagcgtcc 3300gtctgcgcgc cgcagcttcc cttcttctcc cagccctcct
tctgctcccg cctctcgcgc 3360aagcacgcag cttcgccgcc gcatccggtc
ggtcggtcgg tcgatcgacc cgcctgccgc 3420tgctgctgtg gccgggcttt
tctccatcgg cgactctttc ttctccatac gtcctactac 3480gtacatacat
actgccggct tcctcctctt ccagcgcggc gacggcggca ggctgcgacg
3540tcgtcgccgc cgcgggcgcc gcgcgcgccg ccgccgccgc ccgcgtcgca
gggcctcgtc 3600gccgccgccg ctccgctccg ctccgaggcc gcgagagggc
cgcggcggcg cgatggatgg 3660atggatggat ggatggatgg atggattttg
ttgatcgatg gcggcgcatg ggcggagatg 3720agcgaggacg agcgcgcgag
cgcggcagcc ggattcgcag ggcctcgctc gcctcgcgcc 3780cgctgccgcg
cccgccttgc gagcctgcgc cgcgagcgag cgagcgagcg agcggggctt
3840tctttgtctc gcgcgccgct tggcctcgtg tgtcttgtgc ttgcgtagcg
ggcgccgcgg 3900tggaagatgg ctcattcaat cgacccattc acgcacgcac
tccggcgcgc agagaaggcc 3960gaggaggagc agcaagcaaa ccaaaagctc
tcgcgctcgc ggtctcgggc tcgagcggtc 4020tcggagagag agtcttgcgg
cgaccaccgg cagcagcagc agcagcagca gcgctgtcga 4080gcacgagcac
gagcacgagc acgagcacga gcattcgagc aagaggacag acacggttgt
4140cagcgcctag ctcgctcgat acagaaagag gcgggttggg cgtaaaaaaa
aaggagcacg 4200caagccgcca gccagccagc tagctagcca gcctgcctgc caaa
4244373886DNASchizochytrium sp.misc_feature(1)..(3886)n = a, c, g,
or t 37gatcttgatt gccaagctct ggattgtcga ttccgatgaa tcgagctctt
tgttgtcgag 60ctctggcttg ccgagctttc agaaatagac aaaattgccg agttcctgat
tgcggggctc 120tcgattgcca aggtctggtg gattctcgaa ctctcgattg
tcaaaatctt ggtcgtctcg 180tcggattctt tcctgatttg ttttgtcaag
accttgagat tgtgcaaaac cttgatcgtt 240gacaaaccct tgatcgacag
cagcctttca tcacgctcag ctcttgtcat tgattatatt 300ccccctgaca
gccaacacct tgatgcaggg tctcaacctt gatttttgga ggccatcatc
360agcatcacgc cccggcactc accctcaaca ttcgacagcc aacgcttttt
tttcttcgac 420taggatctga gaataaaagc aggtcaccac gaccgtaggc
caacgcgaca accatggaaa 480taaagtgaca acgaacgact tgcaagttta
aatgtaaaga gcagcaattg cccgcccaca 540gacaaatgaa agcaggcgcc
gagtcttatt tgaggaggtg ggcctgtggc aatgggcgaa 600agaaaatcaa
ggacaaggag agcaggttac gtaccggtat actggtatac gtacatggat
660ggttcttggc aagttgacgg gatgtgtgcg agtgaccgtg gtagttaacg
aaagagccgc 720aagggcaagg aaagcaagag aatgcagact tttccacagg
atggatgggt ccgcagcttg 780ccgcatgatg aaacgctgta tttcacctgg
cacgtggtgg cgcacgcgcc cacatatgat 840cgcggcggcg ggtgtattat
acattttccc cctcaggtct actgccatcc ctccatgcgt 900cgctcgtgcg
aacgacgcaa gcctttcgca tcgtgcagcc tctttctggt aaggcaagag
960ctaaacccaa acctaaacga aagaacattt ttacctctct ctctctccca
ttggtcgcgt 1020gcgctccgcc gctcgctcct cctcctgcca gtgtcgcgcc
ctaacttccc ccctccctcc 1080ctccctccct ccctccctct ctcctgccac
cgcccctctc tccgcgctgc gtgcggtgct 1140gccctggacc aatggcatgc
tgctgcacgc tcggcggatg acgcaagccg cttcgcaatt 1200tccggatcag
atctcggcgg ggcgtgcgcc gcggggtcac tgcggacctg ccgcggcccc
1260tgcttctttc acatccatca tgtcctccaa acctccgcct cctccacgca
cgtacgcacg 1320cccgctcgca cgcgcgcact gccgctgcga aagcaagcgc
ccgcccgccg cccggcgacg 1380ggaaggcggc cgcggtctcc ctccgcggtt
gcctcgctcc cgcgcggggc tgggcgggca 1440gcagaaggcg ggtggcggcg
gcggcttccg tcttcgtcag cggcctacgt cggcggcggc 1500gcgcgagact
acgcatgccc ttgcgtcatg cgctcgcagg tagccgccgc gggcctagcg
1560tttccgctgg cgccgcgcct aagcccccgg cgcgcacggt attgccgcga
taccgtacgg 1620ccaagaccgc cgcagacgtc ggccctctcg cggccagcca
gccagcagcg cagcggagga 1680agagcgcgca ggcgcggcgg gagggcggcc
gcggagcagc gcagagcggg gcggagcagc 1740gcggagcaga acgggcagac
tcggagcggg cagggcgggc agagctttgg ggtttaagga 1800ccgggttacc
ggcgaagtga gcggctgcgg ggagcggctg tgggaggggt gagtacgcaa
1860gcacgatgcg agcgagagag agacgctgcc gcgaatcaag aaggtaggcg
cgctgcgagg 1920cgcggcggcg gagcggagcg agggagaggg agagggagag
agagggaggg agacgtcgcc 1980gcggcggggc ctggcctggc ctggtttggc
ttggtcagcg cggccttgtc cgagcgtgca 2040gctggagttg ggtggattca
tttggatttt cttttgtttt tgtttttctc tctttcccgg 2100aaagtgttgg
ccggncggtg ttctttgttt tgatttcttc aaaagttttg gtggttggtt
2160ctctctcttg gctctctgtc aggcggtccg gtccacgccc cggcctctcc
tctcctctcc 2220tctcctctcc tctccgtgcg tatacgtacg tacgtttgta
tacgtacata catcccgccc 2280gccgtgccgg cgagggtttg ctcagcctgg
agcaatgcga tgcgatgcga tgcgatgcga 2340cgcgacgcga cgcgagtcac
tggttcgcgc tgtggctgtg gcttgcttgc ttacttgctt 2400tcgagctctc
ccgctttctt ctttccttct cacgccacca ccaacgaaag aagatcggcc
2460ccggcacgcc gctgagaagg gctggcggcg atgacggcac gcgcgcccgc
tgccacgttg 2520gcgctcgctg ctgctgctgc tgctgctgct gctgctgctg
ctgctgctgc tgctgcttct 2580gcgcgcaggc tttgccacga ggccggcgtg
ctggccgctg ccgcttccag tccgcgtgga 2640gagatcgaat gagagataaa
ctggatggat tcatcgaggg atgaatgaac gatggttgga 2700tgcctttttc
ctttttcagg tccacagcgg gaagcaggag cgcgtgaatc tgccgccatc
2760cgcatacgtc tgcatcgcat cgcatcgcat gcacgcatcg ctcgccggga
gccacagacg 2820ggcgacaggg cggccagcca gccaggcagc cagccaggca
ggcaccagag ggccagagag 2880cgcgcctcac gcacgcgccg cagtgcgcgc
atcgctcgca gtgcagacct tgattccccg 2940cgcggatctc cgcgagcccg
aaacgaagag cgccgtacgg gcccatccta gcgtcgcctc 3000gcaccgcatc
gcatcgcatc gcgttcccta gagagtagta ctcgacgaag gcaccatttc
3060cgcgctcctc ttcggcgcga tcgaggcccc cggcgccgcg acgatcgcgg
cggccgcggc 3120gctggcggcg gccctggcgc tcgcgctggc ggccgccgcg
ggcgtctggc cctggcgcgc 3180gcgggcgccg caggaggagc ggcagcggct
gctcgccgcc agagaagagc gcgccgggcc 3240cggggaggga cggggaggag
aaggagaagg cgcgcaaggc ggccccgaaa gagaagaccc 3300tggacttgaa
cgcgaagaag aagaagaagg agaagaagtt gaagaagaag aagaagaagg
3360agaggaagtt gaagaagacg aggagcaggc gcgttccaag gcgcgttctc
ttccggaggc 3420gcgttccagc tgcggcggcg gggcgggctg cggggcgggc
gcgggcgcgg gtgcgggcag 3480aggggacgcg cgcgcggagg cggagggggc
cgagcgggag cccctgctgc tgcggggcgc 3540ccgggccgca ggtgtggcgc
gcgcgacgac ggaggcgacg acgccagcgg ccgcgacgac 3600aaggccggcg
gcgtcggcgg gcggaaggcc ccgcgcggag caggggcggg agcaggacaa
3660ggcgcaggag caggagcagg gccgggagcg ggagcgggag cgggcggcgg
agcccgaggc 3720agaacccaat cgagatccag agcgagcaga ggccggccgc
gagcccgagc ccgcgccgca 3780gatcactagt accgctgcgg aatcacagca
gcagcagcag cagcagcagc agcagcagca 3840gcagcagcag ccacgagagg
gagataaaga aaaagcggca gagacg 3886388436DNAThraustochytrium
sp.CDS(1)..(8433) 38atg aag gac atg gaa gat aga cgg gtc gct att gtg
ggc atg tca gct 48Met Lys Asp Met Glu Asp Arg Arg Val Ala Ile Val
Gly Met Ser Ala1 5 10 15cac ttg cct tgt ggg aca gat gtg aag gaa tca
tgg cag gct att cgc 96His Leu Pro Cys Gly Thr Asp Val Lys Glu Ser
Trp Gln Ala Ile Arg 20 25 30gat
gga atc gac tgt cta agt gac cta ccc gcg gat cgt ctc gac gtt 144Asp
Gly Ile Asp Cys Leu Ser Asp Leu Pro Ala Asp Arg Leu Asp Val 35 40
45aca gct tac tac aat ccc aac aaa gcc acg aaa gac aag atc tac tgc
192Thr Ala Tyr Tyr Asn Pro Asn Lys Ala Thr Lys Asp Lys Ile Tyr Cys
50 55 60aaa cgg ggt ggc ttc atc ccg aac tat gac ttc gac ccc cgc gaa
ttt 240Lys Arg Gly Gly Phe Ile Pro Asn Tyr Asp Phe Asp Pro Arg Glu
Phe65 70 75 80ggg ctc aac atg ttt caa atg gaa gac tct gat gcg aat
cag aca ctt 288Gly Leu Asn Met Phe Gln Met Glu Asp Ser Asp Ala Asn
Gln Thr Leu 85 90 95acc ttg ctc aaa gtc aaa caa gct ctc gaa gat gca
agc ata gag cct 336Thr Leu Leu Lys Val Lys Gln Ala Leu Glu Asp Ala
Ser Ile Glu Pro 100 105 110ttc acc aag gag aag aag aac att gga tgt
gtt tta ggt att ggt ggg 384Phe Thr Lys Glu Lys Lys Asn Ile Gly Cys
Val Leu Gly Ile Gly Gly 115 120 125ggc caa aag gcg agt cat gag ttc
tac tct cgt ctc aac tac gtt gtc 432Gly Gln Lys Ala Ser His Glu Phe
Tyr Ser Arg Leu Asn Tyr Val Val 130 135 140gtt gaa aag gta ctt cgg
aaa atg ggt tta cca gat gct gat gtt gaa 480Val Glu Lys Val Leu Arg
Lys Met Gly Leu Pro Asp Ala Asp Val Glu145 150 155 160gaa gct gtg
gag aaa tac aag gca aat ttt ccc gag tgg cgc cta gac 528Glu Ala Val
Glu Lys Tyr Lys Ala Asn Phe Pro Glu Trp Arg Leu Asp 165 170 175tct
ttc cct ggg ttt ctt ggg aat gta acg gct ggt cgg tgc agt aac 576Ser
Phe Pro Gly Phe Leu Gly Asn Val Thr Ala Gly Arg Cys Ser Asn 180 185
190acc ttc aac atg gaa ggt atg aac tgc gtt gtg gat gct gca tgt gcc
624Thr Phe Asn Met Glu Gly Met Asn Cys Val Val Asp Ala Ala Cys Ala
195 200 205agt tct cta att gca atc aag gtt gca gtt gaa gag cta ctc
ttt ggt 672Ser Ser Leu Ile Ala Ile Lys Val Ala Val Glu Glu Leu Leu
Phe Gly 210 215 220gac tgt gac acc atg att gca ggt gcc acc tgc acg
gac aat tca ctt 720Asp Cys Asp Thr Met Ile Ala Gly Ala Thr Cys Thr
Asp Asn Ser Leu225 230 235 240ggc atg tac atg gcc ttc tct aaa acg
cca gtt ttt tct act gac cca 768Gly Met Tyr Met Ala Phe Ser Lys Thr
Pro Val Phe Ser Thr Asp Pro 245 250 255agt gtc cgc gcg tat gat gag
aaa aca aaa ggg atg cta att gga gaa 816Ser Val Arg Ala Tyr Asp Glu
Lys Thr Lys Gly Met Leu Ile Gly Glu 260 265 270ggt tca gca atg ttc
gtt ctt aaa cgc tat gcg gat gcc gta cgt gat 864Gly Ser Ala Met Phe
Val Leu Lys Arg Tyr Ala Asp Ala Val Arg Asp 275 280 285ggc gac aca
att cac gcg gtt ctg cgt tct tgc tct tcg tct agt gat 912Gly Asp Thr
Ile His Ala Val Leu Arg Ser Cys Ser Ser Ser Ser Asp 290 295 300gga
aaa gcg gca gga att tat act cct act ata tct gga caa gaa gaa 960Gly
Lys Ala Ala Gly Ile Tyr Thr Pro Thr Ile Ser Gly Gln Glu Glu305 310
315 320gct ttg cgt cga gcg tat gcc cgt gcg ggg gta tgt cca tct acg
atc 1008Ala Leu Arg Arg Ala Tyr Ala Arg Ala Gly Val Cys Pro Ser Thr
Ile 325 330 335ggg ctt gtt gag ggt cac ggg aca ggg acc cct gtt gga
gat cgc att 1056Gly Leu Val Glu Gly His Gly Thr Gly Thr Pro Val Gly
Asp Arg Ile 340 345 350gag tta aca gct ctg cgg aac ttg ttt gac aaa
gct ttt ggt agc aag 1104Glu Leu Thr Ala Leu Arg Asn Leu Phe Asp Lys
Ala Phe Gly Ser Lys 355 360 365aag gaa caa ata gca gtt ggc agc ata
aag tct cag ata ggt cac ctg 1152Lys Glu Gln Ile Ala Val Gly Ser Ile
Lys Ser Gln Ile Gly His Leu 370 375 380aaa tct gtt gcc ggc ttt gcc
ggc ttg gtc aaa gct gtg ctt gcg ctt 1200Lys Ser Val Ala Gly Phe Ala
Gly Leu Val Lys Ala Val Leu Ala Leu385 390 395 400aaa cac aaa acg
ctc cca ggt tcg att aat gtc gac cag cca cct ttg 1248Lys His Lys Thr
Leu Pro Gly Ser Ile Asn Val Asp Gln Pro Pro Leu 405 410 415ttg tat
gac ggt act caa att caa gac tct tct tta tat atc aac aag 1296Leu Tyr
Asp Gly Thr Gln Ile Gln Asp Ser Ser Leu Tyr Ile Asn Lys 420 425
430aca aat aga cca tgg ttt acg caa aac aag ctt ccg cgt cgg gct ggt
1344Thr Asn Arg Pro Trp Phe Thr Gln Asn Lys Leu Pro Arg Arg Ala Gly
435 440 445gtc tca agt ttt gga ttt gga ggt gca aac tac cac gcg gtt
ctg gaa 1392Val Ser Ser Phe Gly Phe Gly Gly Ala Asn Tyr His Ala Val
Leu Glu 450 455 460gaa ttc gag ccc gag cat gaa aaa cca tac cgc ctc
aat act gtt gga 1440Glu Phe Glu Pro Glu His Glu Lys Pro Tyr Arg Leu
Asn Thr Val Gly465 470 475 480cat cct gtc ctc ttg tac gct ccg tct
gtg gaa gcc ctc aaa gta ctt 1488His Pro Val Leu Leu Tyr Ala Pro Ser
Val Glu Ala Leu Lys Val Leu 485 490 495tgc aac gac cag ctt gcg gag
ctc aca att gca ttg gaa gag gca aaa 1536Cys Asn Asp Gln Leu Ala Glu
Leu Thr Ile Ala Leu Glu Glu Ala Lys 500 505 510aca cat aaa aat gtt
gac aaa gtt tgt ggc tac aag ttt att gac gaa 1584Thr His Lys Asn Val
Asp Lys Val Cys Gly Tyr Lys Phe Ile Asp Glu 515 520 525ttt cag ctc
caa gga agc tgt cct cca gaa aat ccg aga gta gga ttt 1632Phe Gln Leu
Gln Gly Ser Cys Pro Pro Glu Asn Pro Arg Val Gly Phe 530 535 540tta
gca aca ctg cct act tca aat atc att gtc gcg ctt aag gca att 1680Leu
Ala Thr Leu Pro Thr Ser Asn Ile Ile Val Ala Leu Lys Ala Ile545 550
555 560ctc gcg cag ctt gat gca aaa cca gat gcg aag aaa tgg gat ttg
cct 1728Leu Ala Gln Leu Asp Ala Lys Pro Asp Ala Lys Lys Trp Asp Leu
Pro 565 570 575cat aaa aag gct ttt ggg gct acc ttc gca tcg tct tca
gtg aaa ggc 1776His Lys Lys Ala Phe Gly Ala Thr Phe Ala Ser Ser Ser
Val Lys Gly 580 585 590tct gtt gct gcg ctc ttc gca gga cag ggt acc
cag tac tta aac atg 1824Ser Val Ala Ala Leu Phe Ala Gly Gln Gly Thr
Gln Tyr Leu Asn Met 595 600 605ttc tct gat gtg gca atg aac tgg cca
ccg ttc cgt gac agc att gtc 1872Phe Ser Asp Val Ala Met Asn Trp Pro
Pro Phe Arg Asp Ser Ile Val 610 615 620gca atg gaa gaa gct caa act
gag gta ttt gag ggc caa gtt gaa cca 1920Ala Met Glu Glu Ala Gln Thr
Glu Val Phe Glu Gly Gln Val Glu Pro625 630 635 640att agc aaa gtt
ctg ttt cca cga gag cgc tat gca tcc gaa agt gaa 1968Ile Ser Lys Val
Leu Phe Pro Arg Glu Arg Tyr Ala Ser Glu Ser Glu 645 650 655cag ggg
aat gaa ctt ctt tgc tta aca gag tac tct cag cca act acg 2016Gln Gly
Asn Glu Leu Leu Cys Leu Thr Glu Tyr Ser Gln Pro Thr Thr 660 665
670ata gca gcc gca gta ggg gcc ttc gat att ttc aaa gcg gct ggc ttt
2064Ile Ala Ala Ala Val Gly Ala Phe Asp Ile Phe Lys Ala Ala Gly Phe
675 680 685aag cca gac atg gtt gga ggg cat tca ctt ggc gaa ttt gct
gct ttg 2112Lys Pro Asp Met Val Gly Gly His Ser Leu Gly Glu Phe Ala
Ala Leu 690 695 700tac gcg gct ggg tcc att tcg cgt gac gac ctg tac
aag ctt gtg tgc 2160Tyr Ala Ala Gly Ser Ile Ser Arg Asp Asp Leu Tyr
Lys Leu Val Cys705 710 715 720aaa cgg gca aag gca atg gcg aac gct
agt gac gga gct atg gca gca 2208Lys Arg Ala Lys Ala Met Ala Asn Ala
Ser Asp Gly Ala Met Ala Ala 725 730 735gtg att ggc cca gat gca cgt
cta gtt acg cca caa aat agt gac gtt 2256Val Ile Gly Pro Asp Ala Arg
Leu Val Thr Pro Gln Asn Ser Asp Val 740 745 750tat gtc gca aac ttc
aac tcc gca act caa gta gtc atc agt ggc act 2304Tyr Val Ala Asn Phe
Asn Ser Ala Thr Gln Val Val Ile Ser Gly Thr 755 760 765gtt caa ggt
gtg aaa gaa gag tcg aaa ttg ctc att tca aag ggg ttc 2352Val Gln Gly
Val Lys Glu Glu Ser Lys Leu Leu Ile Ser Lys Gly Phe 770 775 780cgc
gta ctg cca ctt aaa tgc cag ggc gcc ttc cat tct cct ttg atg 2400Arg
Val Leu Pro Leu Lys Cys Gln Gly Ala Phe His Ser Pro Leu Met785 790
795 800ggg cct tct gag gat agt ttc aaa tca ctt gtg gag act tgt acc
atc 2448Gly Pro Ser Glu Asp Ser Phe Lys Ser Leu Val Glu Thr Cys Thr
Ile 805 810 815tcg ccg cca aaa aat gtg aaa ttc ttt tgc aat gtt agt
ggc aag gaa 2496Ser Pro Pro Lys Asn Val Lys Phe Phe Cys Asn Val Ser
Gly Lys Glu 820 825 830agc cca aac cca aaa cag acc ctc aag tca cac
atg acg tct agc gtt 2544Ser Pro Asn Pro Lys Gln Thr Leu Lys Ser His
Met Thr Ser Ser Val 835 840 845cag ttc gag gag cag att cgt aac atg
tac gat gcc gga gca cgt gtt 2592Gln Phe Glu Glu Gln Ile Arg Asn Met
Tyr Asp Ala Gly Ala Arg Val 850 855 860ttt ctg gag ttt gga ccc cgc
caa gtc ctt gca aag ctt atc gcg gaa 2640Phe Leu Glu Phe Gly Pro Arg
Gln Val Leu Ala Lys Leu Ile Ala Glu865 870 875 880atg ttt ccc tcg
tgt aca gct atc agc gtt aac ccc gcg agc agt ggt 2688Met Phe Pro Ser
Cys Thr Ala Ile Ser Val Asn Pro Ala Ser Ser Gly 885 890 895gac agt
gac gtg caa ctc cgc ctc gcc gcc gta aaa ttc gcg gtc tcg 2736Asp Ser
Asp Val Gln Leu Arg Leu Ala Ala Val Lys Phe Ala Val Ser 900 905
910ggt gca gcc ctt agc acc ttt gat cca tgg gag tat cgc aag cca caa
2784Gly Ala Ala Leu Ser Thr Phe Asp Pro Trp Glu Tyr Arg Lys Pro Gln
915 920 925gat ctt ctt att cga aaa cca cga aaa act gcc ctt gtt cta
tca gca 2832Asp Leu Leu Ile Arg Lys Pro Arg Lys Thr Ala Leu Val Leu
Ser Ala 930 935 940gca aca tat gtt tcc cca aag act ctt gca gaa cgt
aaa aag gct atg 2880Ala Thr Tyr Val Ser Pro Lys Thr Leu Ala Glu Arg
Lys Lys Ala Met945 950 955 960gaa gat atc aag cta gta tcc att aca
cca aga gat agt atg gta tca 2928Glu Asp Ile Lys Leu Val Ser Ile Thr
Pro Arg Asp Ser Met Val Ser 965 970 975att gga aaa atc gcg caa gaa
gta cgg aca gct aaa cag cct tta gaa 2976Ile Gly Lys Ile Ala Gln Glu
Val Arg Thr Ala Lys Gln Pro Leu Glu 980 985 990acc gaa att cga aga
ctc aac aaa gaa tta gaa cat ctc aag aga gag 3024Thr Glu Ile Arg Arg
Leu Asn Lys Glu Leu Glu His Leu Lys Arg Glu 995 1000 1005cta gca
gca gcc aaa gcg agt gtc aag tct gca tca aaa agc tct 3069Leu Ala Ala
Ala Lys Ala Ser Val Lys Ser Ala Ser Lys Ser Ser 1010 1015 1020aaa
gag cga tct gtc cta tca aag cac cgc gct ttg ctt caa aac 3114Lys Glu
Arg Ser Val Leu Ser Lys His Arg Ala Leu Leu Gln Asn 1025 1030
1035att ttg caa gac tac gat gat ctt cgt gtg gtg cca ttc gct gtt
3159Ile Leu Gln Asp Tyr Asp Asp Leu Arg Val Val Pro Phe Ala Val
1040 1045 1050cgt tct gtt gca gtg gac aac acc gcg ccg tat gct gac
caa gtt 3204Arg Ser Val Ala Val Asp Asn Thr Ala Pro Tyr Ala Asp Gln
Val 1055 1060 1065tcg acc cca gcg tca gag cgg tcg gct tca ccg ctt
ttc gag aaa 3249Ser Thr Pro Ala Ser Glu Arg Ser Ala Ser Pro Leu Phe
Glu Lys 1070 1075 1080cgc agt tcg gtt tcg tca gca cgc ctc gct gaa
gct gaa gcc gcg 3294Arg Ser Ser Val Ser Ser Ala Arg Leu Ala Glu Ala
Glu Ala Ala 1085 1090 1095gta ctg agc gtt ctc gca gac aag aca ggc
tac gac agc tca atg 3339Val Leu Ser Val Leu Ala Asp Lys Thr Gly Tyr
Asp Ser Ser Met 1100 1105 1110atc gag atg gac atg gac ctg gag agt
gag ctt ggc gtt gat agc 3384Ile Glu Met Asp Met Asp Leu Glu Ser Glu
Leu Gly Val Asp Ser 1115 1120 1125atc aaa cgc gtg gag atc atg agc
gag gtt caa acg ctg ctc agc 3429Ile Lys Arg Val Glu Ile Met Ser Glu
Val Gln Thr Leu Leu Ser 1130 1135 1140gtg gaa gtc tcc gac gtt gac
gct ctg tca aga acc aag act gtt 3474Val Glu Val Ser Asp Val Asp Ala
Leu Ser Arg Thr Lys Thr Val 1145 1150 1155ggc gac gtc atc gag gcg
atg aag ctg gaa ctc ggt gga ccc caa 3519Gly Asp Val Ile Glu Ala Met
Lys Leu Glu Leu Gly Gly Pro Gln 1160 1165 1170ggc cag act ttg acc
gcg gaa tcg atc cgt cag cca ccg gtg tcc 3564Gly Gln Thr Leu Thr Ala
Glu Ser Ile Arg Gln Pro Pro Val Ser 1175 1180 1185gag cct gct gta
ccg acc tca tcg tca agc agt att gct aat gtt 3609Glu Pro Ala Val Pro
Thr Ser Ser Ser Ser Ser Ile Ala Asn Val 1190 1195 1200tcg tca gca
cgc ctc gct gaa gct gaa gct gcg gta ctg agc gtt 3654Ser Ser Ala Arg
Leu Ala Glu Ala Glu Ala Ala Val Leu Ser Val 1205 1210 1215ctc gca
gac aag aca ggc tac gac agc tca atg atc gag atg gac 3699Leu Ala Asp
Lys Thr Gly Tyr Asp Ser Ser Met Ile Glu Met Asp 1220 1225 1230atg
gac ctg gag agc gag ctt ggc gtt gat agc atc aaa cgc gtg 3744Met Asp
Leu Glu Ser Glu Leu Gly Val Asp Ser Ile Lys Arg Val 1235 1240
1245gag atc atg agc gag gtt caa acg ctg ctc agc gtg gaa gtc tcc
3789Glu Ile Met Ser Glu Val Gln Thr Leu Leu Ser Val Glu Val Ser
1250 1255 1260gac gtt gac gct ctg tca aga act aag act gtt ggc gac
gtc atc 3834Asp Val Asp Ala Leu Ser Arg Thr Lys Thr Val Gly Asp Val
Ile 1265 1270 1275gag gcg atg aag ctg gaa ctc ggt gga ccc caa ggc
cag act ttg 3879Glu Ala Met Lys Leu Glu Leu Gly Gly Pro Gln Gly Gln
Thr Leu 1280 1285 1290acc gcg gaa tcg atc cgt cag cca ccg gtg tct
gag cct gct gta 3924Thr Ala Glu Ser Ile Arg Gln Pro Pro Val Ser Glu
Pro Ala Val 1295 1300 1305ccg acc tca tcg tca agc agt att gct aat
gtt tcg tca gca cgc 3969Pro Thr Ser Ser Ser Ser Ser Ile Ala Asn Val
Ser Ser Ala Arg 1310 1315 1320ctc gct gaa gct gaa gcg gcg gta ctg
agc gtt ctc gca gac aag 4014Leu Ala Glu Ala Glu Ala Ala Val Leu Ser
Val Leu Ala Asp Lys 1325 1330 1335aca ggc tac gac agc tca atg atc
gag atg gac atg gac ctg gag 4059Thr Gly Tyr Asp Ser Ser Met Ile Glu
Met Asp Met Asp Leu Glu 1340 1345 1350agc gag ctt ggc gtc gac agc
atc aaa cgc gtg gag atc atg agc 4104Ser Glu Leu Gly Val Asp Ser Ile
Lys Arg Val Glu Ile Met Ser 1355 1360 1365gag gtt caa acg ctg ctc
agc gtg gaa gtc tcc gac gtt gac gct 4149Glu Val Gln Thr Leu Leu Ser
Val Glu Val Ser Asp Val Asp Ala 1370 1375 1380ctg tca aga acc aag
act gtt ggc gac gtc atc gag gcg atg aag 4194Leu Ser Arg Thr Lys Thr
Val Gly Asp Val Ile Glu Ala Met Lys 1385 1390 1395ctg gaa ctc ggt
gga ccc caa ggc cag act ttg acc gcg gaa tcg 4239Leu Glu Leu Gly Gly
Pro Gln Gly Gln Thr Leu Thr Ala Glu Ser 1400 1405 1410atc cgt cag
cca ccg gtg tcc gag cct gct gta ccg acc tca tcg 4284Ile Arg Gln Pro
Pro Val Ser Glu Pro Ala Val Pro Thr Ser Ser 1415 1420 1425tca agc
agt att gct aat gtt ttg tca gca cgc ctc gct gaa gct 4329Ser Ser Ser
Ile Ala Asn Val Leu Ser Ala Arg Leu Ala Glu Ala 1430 1435 1440gaa
gcc gcg gta ctg agc gtt ctc gca gac aag aca ggc tac gac 4374Glu Ala
Ala Val Leu Ser Val Leu Ala Asp Lys Thr Gly Tyr Asp 1445 1450
1455agc tca atg atc gag atg gac atg gac ctg gag agc gag ctt ggc
4419Ser Ser Met Ile Glu Met Asp Met Asp Leu Glu Ser Glu Leu Gly
1460 1465 1470gtt gat agc atc aaa cgc gtg gag atc atg agc gag gtt
caa acg 4464Val Asp Ser Ile Lys Arg Val Glu Ile Met Ser Glu Val Gln
Thr 1475 1480 1485ttg ctc agc gtg gaa gtc tcc gac gtt gac gct ctg
tca aga acc 4509Leu Leu Ser Val Glu Val Ser Asp Val Asp Ala Leu Ser
Arg Thr 1490 1495 1500aag act gtt ggc gac gtc atc gag gcg atg aag
ctg gaa ctc ggt 4554Lys Thr Val Gly Asp Val Ile Glu Ala Met Lys Leu
Glu Leu Gly 1505 1510 1515gga ccc caa ggc cag act ttg acc gcg gaa
tcg atc cgt cag cca 4599Gly Pro Gln Gly Gln Thr Leu Thr Ala Glu Ser
Ile Arg Gln Pro 1520 1525 1530ccg gtg tct gag cct gct gta ccg acc
tca tcg tca agc agt att 4644Pro Val Ser Glu Pro Ala Val Pro Thr Ser
Ser Ser Ser Ser Ile 1535 1540 1545gct aat gtt tcg tca gca cgc ctc
gct gaa gct gaa gcc
gcg gta 4689Ala Asn Val Ser Ser Ala Arg Leu Ala Glu Ala Glu Ala Ala
Val 1550 1555 1560ctg agc gtt ctc gca gac aag aca ggc tac gac agc
tca atg atc 4734Leu Ser Val Leu Ala Asp Lys Thr Gly Tyr Asp Ser Ser
Met Ile 1565 1570 1575gag atg gac atg gac ctg gag agt gag ctt ggc
gtc gac agc atc 4779Glu Met Asp Met Asp Leu Glu Ser Glu Leu Gly Val
Asp Ser Ile 1580 1585 1590aaa cgc gtg gag atc atg agc gag gtt caa
acg ctg ctc agc gtg 4824Lys Arg Val Glu Ile Met Ser Glu Val Gln Thr
Leu Leu Ser Val 1595 1600 1605gaa gtc tcc gac gtt gac gct ctg tca
aga acc aag act gtt ggc 4869Glu Val Ser Asp Val Asp Ala Leu Ser Arg
Thr Lys Thr Val Gly 1610 1615 1620gac gtc atc gag gcg atg aag ctg
gaa ctc ggt gga ccc caa ggc 4914Asp Val Ile Glu Ala Met Lys Leu Glu
Leu Gly Gly Pro Gln Gly 1625 1630 1635cag act ttg acc tct gaa ccg
atc cat cag cca cca gtg tcc gag 4959Gln Thr Leu Thr Ser Glu Pro Ile
His Gln Pro Pro Val Ser Glu 1640 1645 1650cct gct gta ccg acc tca
tcg tca agc agt att gct aat gtt tct 5004Pro Ala Val Pro Thr Ser Ser
Ser Ser Ser Ile Ala Asn Val Ser 1655 1660 1665tca gca cgc ctc gct
gaa gct gaa gcc gcg gta ctg agc gtt ctc 5049Ser Ala Arg Leu Ala Glu
Ala Glu Ala Ala Val Leu Ser Val Leu 1670 1675 1680gca gac aag aca
ggc tac gac agc tca atg atc gag atg gac atg 5094Ala Asp Lys Thr Gly
Tyr Asp Ser Ser Met Ile Glu Met Asp Met 1685 1690 1695gac ctg gag
agc gag ctt ggc gtt gat agc atc aaa cgc gtg gaa 5139Asp Leu Glu Ser
Glu Leu Gly Val Asp Ser Ile Lys Arg Val Glu 1700 1705 1710atc atg
agc gag gtt caa acg ctg ctc agc gtg gaa gtc tcc gac 5184Ile Met Ser
Glu Val Gln Thr Leu Leu Ser Val Glu Val Ser Asp 1715 1720 1725gtt
gac gct ctg tca aga acc aag act gtt ggc gac gtc atc gag 5229Val Asp
Ala Leu Ser Arg Thr Lys Thr Val Gly Asp Val Ile Glu 1730 1735
1740gcg atg aag atg gaa ctc ggt gga ccc caa ggc cag act ttg acc
5274Ala Met Lys Met Glu Leu Gly Gly Pro Gln Gly Gln Thr Leu Thr
1745 1750 1755gcg gaa tcg atc cgt cag cca ccg gtg tct gag cct gct
gta ccg 5319Ala Glu Ser Ile Arg Gln Pro Pro Val Ser Glu Pro Ala Val
Pro 1760 1765 1770acc tca tcg tca agc agt att gct aat gtt tcg tca
gca cgc ctc 5364Thr Ser Ser Ser Ser Ser Ile Ala Asn Val Ser Ser Ala
Arg Leu 1775 1780 1785gct gaa gct gaa gcg gcg gta ctg agc gtt ctc
gca gac aag aca 5409Ala Glu Ala Glu Ala Ala Val Leu Ser Val Leu Ala
Asp Lys Thr 1790 1795 1800ggc tac gac agc tca atg atc gag atg gac
atg gac ctg gag agc 5454Gly Tyr Asp Ser Ser Met Ile Glu Met Asp Met
Asp Leu Glu Ser 1805 1810 1815gag ctt ggc gtt gat agc atc aaa cgc
gtg gag atc atg agc gag 5499Glu Leu Gly Val Asp Ser Ile Lys Arg Val
Glu Ile Met Ser Glu 1820 1825 1830gtt caa gcg ctg ctc agc gtg gaa
gtc tcc gac gtt gac gct ctg 5544Val Gln Ala Leu Leu Ser Val Glu Val
Ser Asp Val Asp Ala Leu 1835 1840 1845tca aga acc aag act gtt ggc
gac gtc atc gag gcg atg aag atg 5589Ser Arg Thr Lys Thr Val Gly Asp
Val Ile Glu Ala Met Lys Met 1850 1855 1860gaa ctc ggt gga ccc caa
ggc cag act ttg acc gca gaa tcg atc 5634Glu Leu Gly Gly Pro Gln Gly
Gln Thr Leu Thr Ala Glu Ser Ile 1865 1870 1875cgt gag cca ccg gtg
tct gag cct gct gta ccg acc tca tcg tca 5679Arg Glu Pro Pro Val Ser
Glu Pro Ala Val Pro Thr Ser Ser Ser 1880 1885 1890agt agt atc gct
aat gtt tct tca gct cgc ctc gct gaa gct gaa 5724Ser Ser Ile Ala Asn
Val Ser Ser Ala Arg Leu Ala Glu Ala Glu 1895 1900 1905gcc gcg gta
ctg agc gtt ctc gca gac aag aca ggc tac gac agc 5769Ala Ala Val Leu
Ser Val Leu Ala Asp Lys Thr Gly Tyr Asp Ser 1910 1915 1920tca atg
atc gag atg gac atg gac ctg gag agt gag ctt ggc gtc 5814Ser Met Ile
Glu Met Asp Met Asp Leu Glu Ser Glu Leu Gly Val 1925 1930 1935gac
agc atc aaa cgc gtg gag atc atg agc gag gtt caa acg ttg 5859Asp Ser
Ile Lys Arg Val Glu Ile Met Ser Glu Val Gln Thr Leu 1940 1945
1950ctc agc gtg gaa gtc tcc gac gtt gac gct ctg tca aga acc aag
5904Leu Ser Val Glu Val Ser Asp Val Asp Ala Leu Ser Arg Thr Lys
1955 1960 1965act gtt ggc gac gtc atc gag gcg atg aag ctg gaa ctt
ggg gaa 5949Thr Val Gly Asp Val Ile Glu Ala Met Lys Leu Glu Leu Gly
Glu 1970 1975 1980tca tca agt att gag act ctc aat tgt acc gag gtt
gag cac acg 5994Ser Ser Ser Ile Glu Thr Leu Asn Cys Thr Glu Val Glu
His Thr 1985 1990 1995agc tac aaa agt gtc aag gct tca ggg tgt gag
aat gta gat acc 6039Ser Tyr Lys Ser Val Lys Ala Ser Gly Cys Glu Asn
Val Asp Thr 2000 2005 2010cgt ttc gct aag gtt gta caa atc tcg ctt
cct agc aag ctg aaa 6084Arg Phe Ala Lys Val Val Gln Ile Ser Leu Pro
Ser Lys Leu Lys 2015 2020 2025tcc act gtg tcg cac gat cga cct gta
att gtt gta gat gat gga 6129Ser Thr Val Ser His Asp Arg Pro Val Ile
Val Val Asp Asp Gly 2030 2035 2040acg ccc tta acc acg gag ctt tgt
aaa att ctt ggg ggt aat att 6174Thr Pro Leu Thr Thr Glu Leu Cys Lys
Ile Leu Gly Gly Asn Ile 2045 2050 2055gtg gtt ctc tct tat caa ggg
aag ccc gct ggt cca cgg gga gtc 6219Val Val Leu Ser Tyr Gln Gly Lys
Pro Ala Gly Pro Arg Gly Val 2060 2065 2070gag gtg cca gat ctt tcc
gag gaa gcc cta att caa gct ctt gca 6264Glu Val Pro Asp Leu Ser Glu
Glu Ala Leu Ile Gln Ala Leu Ala 2075 2080 2085ttg att cgg tct aca
tat gga gtt cca att ggt ttt att tgt cag 6309Leu Ile Arg Ser Thr Tyr
Gly Val Pro Ile Gly Phe Ile Cys Gln 2090 2095 2100caa gtg tct aat
gtg agc acc aag gca cag ctt tgt tgg gca ctc 6354Gln Val Ser Asn Val
Ser Thr Lys Ala Gln Leu Cys Trp Ala Leu 2105 2110 2115ctc gca gcg
aag cat ctc aag aag gat ttg aat gct gtc tta ccc 6399Leu Ala Ala Lys
His Leu Lys Lys Asp Leu Asn Ala Val Leu Pro 2120 2125 2130gat tca
aga tcc ttc ttc gtc gga gtt gta cgc ttg aac ggg aaa 6444Asp Ser Arg
Ser Phe Phe Val Gly Val Val Arg Leu Asn Gly Lys 2135 2140 2145ctt
gga act ttc gaa aac atc agc gac ttc tct aaa ttt gat ttg 6489Leu Gly
Thr Phe Glu Asn Ile Ser Asp Phe Ser Lys Phe Asp Leu 2150 2155
2160acg aaa gcc cta gat tac gga cag cgt ggt tct ctc tta ggc ctg
6534Thr Lys Ala Leu Asp Tyr Gly Gln Arg Gly Ser Leu Leu Gly Leu
2165 2170 2175tgc aag tca cta gac tta gaa tgg gaa cag gtg ttt tgc
cgt gga 6579Cys Lys Ser Leu Asp Leu Glu Trp Glu Gln Val Phe Cys Arg
Gly 2180 2185 2190ata gat ctt gcg tgt gat ctt atg cca ctc cag gcc
gca agg ata 6624Ile Asp Leu Ala Cys Asp Leu Met Pro Leu Gln Ala Ala
Arg Ile 2195 2200 2205ctc aga aat gag ctt cag tgt ccc aat atg cgc
ctt cgc gag gtt 6669Leu Arg Asn Glu Leu Gln Cys Pro Asn Met Arg Leu
Arg Glu Val 2210 2215 2220ggg tac gat att tct ggc gcc agg tac acc
att tca acc gat gac 6714Gly Tyr Asp Ile Ser Gly Ala Arg Tyr Thr Ile
Ser Thr Asp Asp 2225 2230 2235ctg cta tgt gga ccc tcg aag gct aaa
gta gag gcc gca gac ttg 6759Leu Leu Cys Gly Pro Ser Lys Ala Lys Val
Glu Ala Ala Asp Leu 2240 2245 2250ttt ctt gtg aca ggt ggc gca cga
ggt att aca cct cat tgt gtt 6804Phe Leu Val Thr Gly Gly Ala Arg Gly
Ile Thr Pro His Cys Val 2255 2260 2265cgt gag att gca agt cga tcc
ccc gga acc aca ttt gtg ctg gtt 6849Arg Glu Ile Ala Ser Arg Ser Pro
Gly Thr Thr Phe Val Leu Val 2270 2275 2280gga aga agc gaa atg tcc
gac gag cct gac tgg gct gtt ggc cac 6894Gly Arg Ser Glu Met Ser Asp
Glu Pro Asp Trp Ala Val Gly His 2285 2290 2295tac aat aaa gac ctg
gac caa agc aca atg aaa cac ttg aaa gca 6939Tyr Asn Lys Asp Leu Asp
Gln Ser Thr Met Lys His Leu Lys Ala 2300 2305 2310acg cat gct gct
gga ggg gta aaa cct acg cct aaa gca cat cgt 6984Thr His Ala Ala Gly
Gly Val Lys Pro Thr Pro Lys Ala His Arg 2315 2320 2325gca ctt gtg
aac agg gtc act ggc tca cgg gag gta cga gaa tct 7029Ala Leu Val Asn
Arg Val Thr Gly Ser Arg Glu Val Arg Glu Ser 2330 2335 2340ctt aga
gca atc cag gag gca ggg gca aat gtc gaa tat atc gcc 7074Leu Arg Ala
Ile Gln Glu Ala Gly Ala Asn Val Glu Tyr Ile Ala 2345 2350 2355tgt
gat gtt tcg gat gaa aac aag gtc cgc caa ctt gtg caa aga 7119Cys Asp
Val Ser Asp Glu Asn Lys Val Arg Gln Leu Val Gln Arg 2360 2365
2370gtg gag caa aag tat ggc tgt gaa ata act ggg att tgg cat gca
7164Val Glu Gln Lys Tyr Gly Cys Glu Ile Thr Gly Ile Trp His Ala
2375 2380 2385agc ggg gtt ctt cgt gac aaa ctt gtc gag caa aag act
aca gac 7209Ser Gly Val Leu Arg Asp Lys Leu Val Glu Gln Lys Thr Thr
Asp 2390 2395 2400gac ttt gag gca gtt ttt ggg acc aag gtg act ggc
ctt gta aac 7254Asp Phe Glu Ala Val Phe Gly Thr Lys Val Thr Gly Leu
Val Asn 2405 2410 2415atc gtg tca caa gtc aat atg tct aag cta cga
cac ttc atc ctc 7299Ile Val Ser Gln Val Asn Met Ser Lys Leu Arg His
Phe Ile Leu 2420 2425 2430ttc agt tct ttg gct gga ttt cat ggg aac
aag ggc caa acg gat 7344Phe Ser Ser Leu Ala Gly Phe His Gly Asn Lys
Gly Gln Thr Asp 2435 2440 2445tat gca att gct aat gaa gcc ttg aac
aaa atc gcg cat act ctc 7389Tyr Ala Ile Ala Asn Glu Ala Leu Asn Lys
Ile Ala His Thr Leu 2450 2455 2460tca gcg ttt ttg ccc aaa ctg aat
gca aag gtg cta gac ttc ggt 7434Ser Ala Phe Leu Pro Lys Leu Asn Ala
Lys Val Leu Asp Phe Gly 2465 2470 2475ccg tgg gta ggt tca gga atg
gta acc gaa aca ctt gag aag cat 7479Pro Trp Val Gly Ser Gly Met Val
Thr Glu Thr Leu Glu Lys His 2480 2485 2490ttt aaa gct atg ggg gtt
cag act att cct ctc gag cca gga gca 7524Phe Lys Ala Met Gly Val Gln
Thr Ile Pro Leu Glu Pro Gly Ala 2495 2500 2505cgg act gtt gcg caa
atc att ttg gca agt tcg cca ccg caa tcg 7569Arg Thr Val Ala Gln Ile
Ile Leu Ala Ser Ser Pro Pro Gln Ser 2510 2515 2520ctt ttg ggg aac
tgg ggc ttt cca gcc acc aaa ccg cta caa cgc 7614Leu Leu Gly Asn Trp
Gly Phe Pro Ala Thr Lys Pro Leu Gln Arg 2525 2530 2535tct aat gta
gtc acg ggc aca ctc tct ccg gaa gag ata gaa ttc 7659Ser Asn Val Val
Thr Gly Thr Leu Ser Pro Glu Glu Ile Glu Phe 2540 2545 2550atc gca
gac cac aaa att caa ggc cgc aag gtg ctt ccc atg atg 7704Ile Ala Asp
His Lys Ile Gln Gly Arg Lys Val Leu Pro Met Met 2555 2560 2565gct
gca atc ggg ttc atg gcc tct att gcg gaa gga ctc tac ccg 7749Ala Ala
Ile Gly Phe Met Ala Ser Ile Ala Glu Gly Leu Tyr Pro 2570 2575
2580ggg tac aat ctg caa ggc gtg gaa aat gct cag ctc ttt caa ggc
7794Gly Tyr Asn Leu Gln Gly Val Glu Asn Ala Gln Leu Phe Gln Gly
2585 2590 2595ttg act atc aac caa gag aca aaa ttt caa atc act ctc
att gag 7839Leu Thr Ile Asn Gln Glu Thr Lys Phe Gln Ile Thr Leu Ile
Glu 2600 2605 2610gag cac aac tct gag gaa aac ctg gat gtc ctg aca
tcc ctt ggt 7884Glu His Asn Ser Glu Glu Asn Leu Asp Val Leu Thr Ser
Leu Gly 2615 2620 2625gta atg ttg gaa agc ggg aag gtg ctt ccc gct
tac cga tgt gtt 7929Val Met Leu Glu Ser Gly Lys Val Leu Pro Ala Tyr
Arg Cys Val 2630 2635 2640gta tgc ttg aat aca acc cag cag cag ccc
aag cta tct cca aaa 7974Val Cys Leu Asn Thr Thr Gln Gln Gln Pro Lys
Leu Ser Pro Lys 2645 2650 2655att ctt aac ttg gaa gtt gac cct gca
tgc gag gtt aac ccc tat 8019Ile Leu Asn Leu Glu Val Asp Pro Ala Cys
Glu Val Asn Pro Tyr 2660 2665 2670gat gga aag tcg ttg ttc cac ggt
ccg ctt ttg caa ttc gtt caa 8064Asp Gly Lys Ser Leu Phe His Gly Pro
Leu Leu Gln Phe Val Gln 2675 2680 2685caa gtg ttg cac tca agt acc
aaa ggc ctc gtt gcc aag tgc cgc 8109Gln Val Leu His Ser Ser Thr Lys
Gly Leu Val Ala Lys Cys Arg 2690 2695 2700gcg ctt cca atc aaa gaa
gcc atc cga ggg cca ttt atc aag caa 8154Ala Leu Pro Ile Lys Glu Ala
Ile Arg Gly Pro Phe Ile Lys Gln 2705 2710 2715aca ctc cat gat cca
att cta gac gac gtc att ttt cag cta atg 8199Thr Leu His Asp Pro Ile
Leu Asp Asp Val Ile Phe Gln Leu Met 2720 2725 2730ctc gtg tgg tgt
cgt aat gct cta gga agt gca tcg cta ccc aac 8244Leu Val Trp Cys Arg
Asn Ala Leu Gly Ser Ala Ser Leu Pro Asn 2735 2740 2745aga att gaa
aag atg tca tac ttt ggg aat gtc tca gaa ggt agc 8289Arg Ile Glu Lys
Met Ser Tyr Phe Gly Asn Val Ser Glu Gly Ser 2750 2755 2760act ttc
ttt gcc tca gtt aca cct gtg gga cca aga gta cca aag 8334Thr Phe Phe
Ala Ser Val Thr Pro Val Gly Pro Arg Val Pro Lys 2765 2770 2775gat
ccc gtg atc aaa atg cag ttt ctt ctc caa gat gaa tcc ggc 8379Asp Pro
Val Ile Lys Met Gln Phe Leu Leu Gln Asp Glu Ser Gly 2780 2785
2790aac aca ttt tca tcg ggg gag ggc tcg gtt gtg ctt agt gac gaa
8424Asn Thr Phe Ser Ser Gly Glu Gly Ser Val Val Leu Ser Asp Glu
2795 2800 2805ctc gtc ttt tga 8436Leu Val Phe
2810392811PRTThraustochytrium sp. 39Met Lys Asp Met Glu Asp Arg Arg
Val Ala Ile Val Gly Met Ser Ala1 5 10 15His Leu Pro Cys Gly Thr Asp
Val Lys Glu Ser Trp Gln Ala Ile Arg 20 25 30Asp Gly Ile Asp Cys Leu
Ser Asp Leu Pro Ala Asp Arg Leu Asp Val 35 40 45Thr Ala Tyr Tyr Asn
Pro Asn Lys Ala Thr Lys Asp Lys Ile Tyr Cys 50 55 60Lys Arg Gly Gly
Phe Ile Pro Asn Tyr Asp Phe Asp Pro Arg Glu Phe65 70 75 80Gly Leu
Asn Met Phe Gln Met Glu Asp Ser Asp Ala Asn Gln Thr Leu 85 90 95Thr
Leu Leu Lys Val Lys Gln Ala Leu Glu Asp Ala Ser Ile Glu Pro 100 105
110Phe Thr Lys Glu Lys Lys Asn Ile Gly Cys Val Leu Gly Ile Gly Gly
115 120 125Gly Gln Lys Ala Ser His Glu Phe Tyr Ser Arg Leu Asn Tyr
Val Val 130 135 140Val Glu Lys Val Leu Arg Lys Met Gly Leu Pro Asp
Ala Asp Val Glu145 150 155 160Glu Ala Val Glu Lys Tyr Lys Ala Asn
Phe Pro Glu Trp Arg Leu Asp 165 170 175Ser Phe Pro Gly Phe Leu Gly
Asn Val Thr Ala Gly Arg Cys Ser Asn 180 185 190Thr Phe Asn Met Glu
Gly Met Asn Cys Val Val Asp Ala Ala Cys Ala 195 200 205Ser Ser Leu
Ile Ala Ile Lys Val Ala Val Glu Glu Leu Leu Phe Gly 210 215 220Asp
Cys Asp Thr Met Ile Ala Gly Ala Thr Cys Thr Asp Asn Ser Leu225 230
235 240Gly Met Tyr Met Ala Phe Ser Lys Thr Pro Val Phe Ser Thr Asp
Pro 245 250 255Ser Val Arg Ala Tyr Asp Glu Lys Thr Lys Gly Met Leu
Ile Gly Glu 260 265 270Gly Ser Ala Met Phe Val Leu Lys Arg Tyr Ala
Asp Ala Val Arg Asp 275 280 285Gly Asp Thr Ile His Ala Val Leu Arg
Ser Cys Ser Ser Ser Ser Asp 290 295 300Gly Lys Ala Ala Gly Ile Tyr
Thr Pro Thr Ile Ser Gly Gln Glu Glu305 310 315 320Ala Leu Arg Arg
Ala Tyr Ala Arg Ala Gly Val Cys Pro Ser Thr Ile 325 330 335Gly Leu
Val Glu Gly His Gly Thr Gly Thr Pro Val Gly Asp Arg Ile 340 345
350Glu Leu Thr Ala Leu Arg Asn Leu Phe Asp Lys
Ala Phe Gly Ser Lys 355 360 365Lys Glu Gln Ile Ala Val Gly Ser Ile
Lys Ser Gln Ile Gly His Leu 370 375 380Lys Ser Val Ala Gly Phe Ala
Gly Leu Val Lys Ala Val Leu Ala Leu385 390 395 400Lys His Lys Thr
Leu Pro Gly Ser Ile Asn Val Asp Gln Pro Pro Leu 405 410 415Leu Tyr
Asp Gly Thr Gln Ile Gln Asp Ser Ser Leu Tyr Ile Asn Lys 420 425
430Thr Asn Arg Pro Trp Phe Thr Gln Asn Lys Leu Pro Arg Arg Ala Gly
435 440 445Val Ser Ser Phe Gly Phe Gly Gly Ala Asn Tyr His Ala Val
Leu Glu 450 455 460Glu Phe Glu Pro Glu His Glu Lys Pro Tyr Arg Leu
Asn Thr Val Gly465 470 475 480His Pro Val Leu Leu Tyr Ala Pro Ser
Val Glu Ala Leu Lys Val Leu 485 490 495Cys Asn Asp Gln Leu Ala Glu
Leu Thr Ile Ala Leu Glu Glu Ala Lys 500 505 510Thr His Lys Asn Val
Asp Lys Val Cys Gly Tyr Lys Phe Ile Asp Glu 515 520 525Phe Gln Leu
Gln Gly Ser Cys Pro Pro Glu Asn Pro Arg Val Gly Phe 530 535 540Leu
Ala Thr Leu Pro Thr Ser Asn Ile Ile Val Ala Leu Lys Ala Ile545 550
555 560Leu Ala Gln Leu Asp Ala Lys Pro Asp Ala Lys Lys Trp Asp Leu
Pro 565 570 575His Lys Lys Ala Phe Gly Ala Thr Phe Ala Ser Ser Ser
Val Lys Gly 580 585 590Ser Val Ala Ala Leu Phe Ala Gly Gln Gly Thr
Gln Tyr Leu Asn Met 595 600 605Phe Ser Asp Val Ala Met Asn Trp Pro
Pro Phe Arg Asp Ser Ile Val 610 615 620Ala Met Glu Glu Ala Gln Thr
Glu Val Phe Glu Gly Gln Val Glu Pro625 630 635 640Ile Ser Lys Val
Leu Phe Pro Arg Glu Arg Tyr Ala Ser Glu Ser Glu 645 650 655Gln Gly
Asn Glu Leu Leu Cys Leu Thr Glu Tyr Ser Gln Pro Thr Thr 660 665
670Ile Ala Ala Ala Val Gly Ala Phe Asp Ile Phe Lys Ala Ala Gly Phe
675 680 685Lys Pro Asp Met Val Gly Gly His Ser Leu Gly Glu Phe Ala
Ala Leu 690 695 700Tyr Ala Ala Gly Ser Ile Ser Arg Asp Asp Leu Tyr
Lys Leu Val Cys705 710 715 720Lys Arg Ala Lys Ala Met Ala Asn Ala
Ser Asp Gly Ala Met Ala Ala 725 730 735Val Ile Gly Pro Asp Ala Arg
Leu Val Thr Pro Gln Asn Ser Asp Val 740 745 750Tyr Val Ala Asn Phe
Asn Ser Ala Thr Gln Val Val Ile Ser Gly Thr 755 760 765Val Gln Gly
Val Lys Glu Glu Ser Lys Leu Leu Ile Ser Lys Gly Phe 770 775 780Arg
Val Leu Pro Leu Lys Cys Gln Gly Ala Phe His Ser Pro Leu Met785 790
795 800Gly Pro Ser Glu Asp Ser Phe Lys Ser Leu Val Glu Thr Cys Thr
Ile 805 810 815Ser Pro Pro Lys Asn Val Lys Phe Phe Cys Asn Val Ser
Gly Lys Glu 820 825 830Ser Pro Asn Pro Lys Gln Thr Leu Lys Ser His
Met Thr Ser Ser Val 835 840 845Gln Phe Glu Glu Gln Ile Arg Asn Met
Tyr Asp Ala Gly Ala Arg Val 850 855 860Phe Leu Glu Phe Gly Pro Arg
Gln Val Leu Ala Lys Leu Ile Ala Glu865 870 875 880Met Phe Pro Ser
Cys Thr Ala Ile Ser Val Asn Pro Ala Ser Ser Gly 885 890 895Asp Ser
Asp Val Gln Leu Arg Leu Ala Ala Val Lys Phe Ala Val Ser 900 905
910Gly Ala Ala Leu Ser Thr Phe Asp Pro Trp Glu Tyr Arg Lys Pro Gln
915 920 925Asp Leu Leu Ile Arg Lys Pro Arg Lys Thr Ala Leu Val Leu
Ser Ala 930 935 940Ala Thr Tyr Val Ser Pro Lys Thr Leu Ala Glu Arg
Lys Lys Ala Met945 950 955 960Glu Asp Ile Lys Leu Val Ser Ile Thr
Pro Arg Asp Ser Met Val Ser 965 970 975Ile Gly Lys Ile Ala Gln Glu
Val Arg Thr Ala Lys Gln Pro Leu Glu 980 985 990Thr Glu Ile Arg Arg
Leu Asn Lys Glu Leu Glu His Leu Lys Arg Glu 995 1000 1005Leu Ala
Ala Ala Lys Ala Ser Val Lys Ser Ala Ser Lys Ser Ser 1010 1015
1020Lys Glu Arg Ser Val Leu Ser Lys His Arg Ala Leu Leu Gln Asn
1025 1030 1035Ile Leu Gln Asp Tyr Asp Asp Leu Arg Val Val Pro Phe
Ala Val 1040 1045 1050Arg Ser Val Ala Val Asp Asn Thr Ala Pro Tyr
Ala Asp Gln Val 1055 1060 1065Ser Thr Pro Ala Ser Glu Arg Ser Ala
Ser Pro Leu Phe Glu Lys 1070 1075 1080Arg Ser Ser Val Ser Ser Ala
Arg Leu Ala Glu Ala Glu Ala Ala 1085 1090 1095Val Leu Ser Val Leu
Ala Asp Lys Thr Gly Tyr Asp Ser Ser Met 1100 1105 1110Ile Glu Met
Asp Met Asp Leu Glu Ser Glu Leu Gly Val Asp Ser 1115 1120 1125Ile
Lys Arg Val Glu Ile Met Ser Glu Val Gln Thr Leu Leu Ser 1130 1135
1140Val Glu Val Ser Asp Val Asp Ala Leu Ser Arg Thr Lys Thr Val
1145 1150 1155Gly Asp Val Ile Glu Ala Met Lys Leu Glu Leu Gly Gly
Pro Gln 1160 1165 1170Gly Gln Thr Leu Thr Ala Glu Ser Ile Arg Gln
Pro Pro Val Ser 1175 1180 1185Glu Pro Ala Val Pro Thr Ser Ser Ser
Ser Ser Ile Ala Asn Val 1190 1195 1200Ser Ser Ala Arg Leu Ala Glu
Ala Glu Ala Ala Val Leu Ser Val 1205 1210 1215Leu Ala Asp Lys Thr
Gly Tyr Asp Ser Ser Met Ile Glu Met Asp 1220 1225 1230Met Asp Leu
Glu Ser Glu Leu Gly Val Asp Ser Ile Lys Arg Val 1235 1240 1245Glu
Ile Met Ser Glu Val Gln Thr Leu Leu Ser Val Glu Val Ser 1250 1255
1260Asp Val Asp Ala Leu Ser Arg Thr Lys Thr Val Gly Asp Val Ile
1265 1270 1275Glu Ala Met Lys Leu Glu Leu Gly Gly Pro Gln Gly Gln
Thr Leu 1280 1285 1290Thr Ala Glu Ser Ile Arg Gln Pro Pro Val Ser
Glu Pro Ala Val 1295 1300 1305Pro Thr Ser Ser Ser Ser Ser Ile Ala
Asn Val Ser Ser Ala Arg 1310 1315 1320Leu Ala Glu Ala Glu Ala Ala
Val Leu Ser Val Leu Ala Asp Lys 1325 1330 1335Thr Gly Tyr Asp Ser
Ser Met Ile Glu Met Asp Met Asp Leu Glu 1340 1345 1350Ser Glu Leu
Gly Val Asp Ser Ile Lys Arg Val Glu Ile Met Ser 1355 1360 1365Glu
Val Gln Thr Leu Leu Ser Val Glu Val Ser Asp Val Asp Ala 1370 1375
1380Leu Ser Arg Thr Lys Thr Val Gly Asp Val Ile Glu Ala Met Lys
1385 1390 1395Leu Glu Leu Gly Gly Pro Gln Gly Gln Thr Leu Thr Ala
Glu Ser 1400 1405 1410Ile Arg Gln Pro Pro Val Ser Glu Pro Ala Val
Pro Thr Ser Ser 1415 1420 1425Ser Ser Ser Ile Ala Asn Val Leu Ser
Ala Arg Leu Ala Glu Ala 1430 1435 1440Glu Ala Ala Val Leu Ser Val
Leu Ala Asp Lys Thr Gly Tyr Asp 1445 1450 1455Ser Ser Met Ile Glu
Met Asp Met Asp Leu Glu Ser Glu Leu Gly 1460 1465 1470Val Asp Ser
Ile Lys Arg Val Glu Ile Met Ser Glu Val Gln Thr 1475 1480 1485Leu
Leu Ser Val Glu Val Ser Asp Val Asp Ala Leu Ser Arg Thr 1490 1495
1500Lys Thr Val Gly Asp Val Ile Glu Ala Met Lys Leu Glu Leu Gly
1505 1510 1515Gly Pro Gln Gly Gln Thr Leu Thr Ala Glu Ser Ile Arg
Gln Pro 1520 1525 1530Pro Val Ser Glu Pro Ala Val Pro Thr Ser Ser
Ser Ser Ser Ile 1535 1540 1545Ala Asn Val Ser Ser Ala Arg Leu Ala
Glu Ala Glu Ala Ala Val 1550 1555 1560Leu Ser Val Leu Ala Asp Lys
Thr Gly Tyr Asp Ser Ser Met Ile 1565 1570 1575Glu Met Asp Met Asp
Leu Glu Ser Glu Leu Gly Val Asp Ser Ile 1580 1585 1590Lys Arg Val
Glu Ile Met Ser Glu Val Gln Thr Leu Leu Ser Val 1595 1600 1605Glu
Val Ser Asp Val Asp Ala Leu Ser Arg Thr Lys Thr Val Gly 1610 1615
1620Asp Val Ile Glu Ala Met Lys Leu Glu Leu Gly Gly Pro Gln Gly
1625 1630 1635Gln Thr Leu Thr Ser Glu Pro Ile His Gln Pro Pro Val
Ser Glu 1640 1645 1650Pro Ala Val Pro Thr Ser Ser Ser Ser Ser Ile
Ala Asn Val Ser 1655 1660 1665Ser Ala Arg Leu Ala Glu Ala Glu Ala
Ala Val Leu Ser Val Leu 1670 1675 1680Ala Asp Lys Thr Gly Tyr Asp
Ser Ser Met Ile Glu Met Asp Met 1685 1690 1695Asp Leu Glu Ser Glu
Leu Gly Val Asp Ser Ile Lys Arg Val Glu 1700 1705 1710Ile Met Ser
Glu Val Gln Thr Leu Leu Ser Val Glu Val Ser Asp 1715 1720 1725Val
Asp Ala Leu Ser Arg Thr Lys Thr Val Gly Asp Val Ile Glu 1730 1735
1740Ala Met Lys Met Glu Leu Gly Gly Pro Gln Gly Gln Thr Leu Thr
1745 1750 1755Ala Glu Ser Ile Arg Gln Pro Pro Val Ser Glu Pro Ala
Val Pro 1760 1765 1770Thr Ser Ser Ser Ser Ser Ile Ala Asn Val Ser
Ser Ala Arg Leu 1775 1780 1785Ala Glu Ala Glu Ala Ala Val Leu Ser
Val Leu Ala Asp Lys Thr 1790 1795 1800Gly Tyr Asp Ser Ser Met Ile
Glu Met Asp Met Asp Leu Glu Ser 1805 1810 1815Glu Leu Gly Val Asp
Ser Ile Lys Arg Val Glu Ile Met Ser Glu 1820 1825 1830Val Gln Ala
Leu Leu Ser Val Glu Val Ser Asp Val Asp Ala Leu 1835 1840 1845Ser
Arg Thr Lys Thr Val Gly Asp Val Ile Glu Ala Met Lys Met 1850 1855
1860Glu Leu Gly Gly Pro Gln Gly Gln Thr Leu Thr Ala Glu Ser Ile
1865 1870 1875Arg Glu Pro Pro Val Ser Glu Pro Ala Val Pro Thr Ser
Ser Ser 1880 1885 1890Ser Ser Ile Ala Asn Val Ser Ser Ala Arg Leu
Ala Glu Ala Glu 1895 1900 1905Ala Ala Val Leu Ser Val Leu Ala Asp
Lys Thr Gly Tyr Asp Ser 1910 1915 1920Ser Met Ile Glu Met Asp Met
Asp Leu Glu Ser Glu Leu Gly Val 1925 1930 1935Asp Ser Ile Lys Arg
Val Glu Ile Met Ser Glu Val Gln Thr Leu 1940 1945 1950Leu Ser Val
Glu Val Ser Asp Val Asp Ala Leu Ser Arg Thr Lys 1955 1960 1965Thr
Val Gly Asp Val Ile Glu Ala Met Lys Leu Glu Leu Gly Glu 1970 1975
1980Ser Ser Ser Ile Glu Thr Leu Asn Cys Thr Glu Val Glu His Thr
1985 1990 1995Ser Tyr Lys Ser Val Lys Ala Ser Gly Cys Glu Asn Val
Asp Thr 2000 2005 2010Arg Phe Ala Lys Val Val Gln Ile Ser Leu Pro
Ser Lys Leu Lys 2015 2020 2025Ser Thr Val Ser His Asp Arg Pro Val
Ile Val Val Asp Asp Gly 2030 2035 2040Thr Pro Leu Thr Thr Glu Leu
Cys Lys Ile Leu Gly Gly Asn Ile 2045 2050 2055Val Val Leu Ser Tyr
Gln Gly Lys Pro Ala Gly Pro Arg Gly Val 2060 2065 2070Glu Val Pro
Asp Leu Ser Glu Glu Ala Leu Ile Gln Ala Leu Ala 2075 2080 2085Leu
Ile Arg Ser Thr Tyr Gly Val Pro Ile Gly Phe Ile Cys Gln 2090 2095
2100Gln Val Ser Asn Val Ser Thr Lys Ala Gln Leu Cys Trp Ala Leu
2105 2110 2115Leu Ala Ala Lys His Leu Lys Lys Asp Leu Asn Ala Val
Leu Pro 2120 2125 2130Asp Ser Arg Ser Phe Phe Val Gly Val Val Arg
Leu Asn Gly Lys 2135 2140 2145Leu Gly Thr Phe Glu Asn Ile Ser Asp
Phe Ser Lys Phe Asp Leu 2150 2155 2160Thr Lys Ala Leu Asp Tyr Gly
Gln Arg Gly Ser Leu Leu Gly Leu 2165 2170 2175Cys Lys Ser Leu Asp
Leu Glu Trp Glu Gln Val Phe Cys Arg Gly 2180 2185 2190Ile Asp Leu
Ala Cys Asp Leu Met Pro Leu Gln Ala Ala Arg Ile 2195 2200 2205Leu
Arg Asn Glu Leu Gln Cys Pro Asn Met Arg Leu Arg Glu Val 2210 2215
2220Gly Tyr Asp Ile Ser Gly Ala Arg Tyr Thr Ile Ser Thr Asp Asp
2225 2230 2235Leu Leu Cys Gly Pro Ser Lys Ala Lys Val Glu Ala Ala
Asp Leu 2240 2245 2250Phe Leu Val Thr Gly Gly Ala Arg Gly Ile Thr
Pro His Cys Val 2255 2260 2265Arg Glu Ile Ala Ser Arg Ser Pro Gly
Thr Thr Phe Val Leu Val 2270 2275 2280Gly Arg Ser Glu Met Ser Asp
Glu Pro Asp Trp Ala Val Gly His 2285 2290 2295Tyr Asn Lys Asp Leu
Asp Gln Ser Thr Met Lys His Leu Lys Ala 2300 2305 2310Thr His Ala
Ala Gly Gly Val Lys Pro Thr Pro Lys Ala His Arg 2315 2320 2325Ala
Leu Val Asn Arg Val Thr Gly Ser Arg Glu Val Arg Glu Ser 2330 2335
2340Leu Arg Ala Ile Gln Glu Ala Gly Ala Asn Val Glu Tyr Ile Ala
2345 2350 2355Cys Asp Val Ser Asp Glu Asn Lys Val Arg Gln Leu Val
Gln Arg 2360 2365 2370Val Glu Gln Lys Tyr Gly Cys Glu Ile Thr Gly
Ile Trp His Ala 2375 2380 2385Ser Gly Val Leu Arg Asp Lys Leu Val
Glu Gln Lys Thr Thr Asp 2390 2395 2400Asp Phe Glu Ala Val Phe Gly
Thr Lys Val Thr Gly Leu Val Asn 2405 2410 2415Ile Val Ser Gln Val
Asn Met Ser Lys Leu Arg His Phe Ile Leu 2420 2425 2430Phe Ser Ser
Leu Ala Gly Phe His Gly Asn Lys Gly Gln Thr Asp 2435 2440 2445Tyr
Ala Ile Ala Asn Glu Ala Leu Asn Lys Ile Ala His Thr Leu 2450 2455
2460Ser Ala Phe Leu Pro Lys Leu Asn Ala Lys Val Leu Asp Phe Gly
2465 2470 2475Pro Trp Val Gly Ser Gly Met Val Thr Glu Thr Leu Glu
Lys His 2480 2485 2490Phe Lys Ala Met Gly Val Gln Thr Ile Pro Leu
Glu Pro Gly Ala 2495 2500 2505Arg Thr Val Ala Gln Ile Ile Leu Ala
Ser Ser Pro Pro Gln Ser 2510 2515 2520Leu Leu Gly Asn Trp Gly Phe
Pro Ala Thr Lys Pro Leu Gln Arg 2525 2530 2535Ser Asn Val Val Thr
Gly Thr Leu Ser Pro Glu Glu Ile Glu Phe 2540 2545 2550Ile Ala Asp
His Lys Ile Gln Gly Arg Lys Val Leu Pro Met Met 2555 2560 2565Ala
Ala Ile Gly Phe Met Ala Ser Ile Ala Glu Gly Leu Tyr Pro 2570 2575
2580Gly Tyr Asn Leu Gln Gly Val Glu Asn Ala Gln Leu Phe Gln Gly
2585 2590 2595Leu Thr Ile Asn Gln Glu Thr Lys Phe Gln Ile Thr Leu
Ile Glu 2600 2605 2610Glu His Asn Ser Glu Glu Asn Leu Asp Val Leu
Thr Ser Leu Gly 2615 2620 2625Val Met Leu Glu Ser Gly Lys Val Leu
Pro Ala Tyr Arg Cys Val 2630 2635 2640Val Cys Leu Asn Thr Thr Gln
Gln Gln Pro Lys Leu Ser Pro Lys 2645 2650 2655Ile Leu Asn Leu Glu
Val Asp Pro Ala Cys Glu Val Asn Pro Tyr 2660 2665 2670Asp Gly Lys
Ser Leu Phe His Gly Pro Leu Leu Gln Phe Val Gln 2675 2680 2685Gln
Val Leu His Ser Ser Thr Lys Gly Leu Val Ala Lys Cys Arg 2690 2695
2700Ala Leu Pro Ile Lys Glu Ala Ile Arg Gly Pro Phe Ile Lys Gln
2705 2710 2715Thr Leu His Asp Pro Ile Leu Asp Asp Val Ile Phe Gln
Leu Met 2720 2725 2730Leu Val Trp Cys Arg Asn Ala Leu Gly Ser Ala
Ser Leu Pro Asn 2735 2740 2745Arg Ile Glu Lys Met Ser Tyr Phe Gly
Asn Val Ser Glu Gly Ser 2750 2755 2760Thr Phe Phe Ala Ser Val Thr
Pro Val Gly Pro Arg Val Pro Lys 2765 2770 2775Asp Pro Val Ile Lys
Met Gln Phe Leu Leu Gln Asp Glu Ser Gly 2780 2785 2790Asn Thr Phe
Ser Ser Gly Glu Gly Ser Val Val Leu Ser Asp Glu 2795
2800 2805Leu Val Phe 2810401500DNAThraustochytrium
sp.CDS(1)..(1500) 40atg aag gac atg gaa gat aga cgg gtc gct att gtg
ggc atg tca gct 48Met Lys Asp Met Glu Asp Arg Arg Val Ala Ile Val
Gly Met Ser Ala1 5 10 15cac ttg cct tgt ggg aca gat gtg aag gaa tca
tgg cag gct att cgc 96His Leu Pro Cys Gly Thr Asp Val Lys Glu Ser
Trp Gln Ala Ile Arg 20 25 30gat gga atc gac tgt cta agt gac cta ccc
gcg gat cgt ctc gac gtt 144Asp Gly Ile Asp Cys Leu Ser Asp Leu Pro
Ala Asp Arg Leu Asp Val 35 40 45aca gct tac tac aat ccc aac aaa gcc
acg aaa gac aag atc tac tgc 192Thr Ala Tyr Tyr Asn Pro Asn Lys Ala
Thr Lys Asp Lys Ile Tyr Cys 50 55 60aaa cgg ggt ggc ttc atc ccg aac
tat gac ttc gac ccc cgc gaa ttt 240Lys Arg Gly Gly Phe Ile Pro Asn
Tyr Asp Phe Asp Pro Arg Glu Phe65 70 75 80ggg ctc aac atg ttt caa
atg gaa gac tct gat gcg aat cag aca ctt 288Gly Leu Asn Met Phe Gln
Met Glu Asp Ser Asp Ala Asn Gln Thr Leu 85 90 95acc ttg ctc aaa gtc
aaa caa gct ctc gaa gat gca agc ata gag cct 336Thr Leu Leu Lys Val
Lys Gln Ala Leu Glu Asp Ala Ser Ile Glu Pro 100 105 110ttc acc aag
gag aag aag aac att gga tgt gtt tta ggt att ggt ggg 384Phe Thr Lys
Glu Lys Lys Asn Ile Gly Cys Val Leu Gly Ile Gly Gly 115 120 125ggc
caa aag gcg agt cat gag ttc tac tct cgt ctc aac tac gtt gtc 432Gly
Gln Lys Ala Ser His Glu Phe Tyr Ser Arg Leu Asn Tyr Val Val 130 135
140gtt gaa aag gta ctt cgg aaa atg ggt tta cca gat gct gat gtt gaa
480Val Glu Lys Val Leu Arg Lys Met Gly Leu Pro Asp Ala Asp Val
Glu145 150 155 160gaa gct gtg gag aaa tac aag gca aat ttt ccc gag
tgg cgc cta gac 528Glu Ala Val Glu Lys Tyr Lys Ala Asn Phe Pro Glu
Trp Arg Leu Asp 165 170 175tct ttc cct ggg ttt ctt ggg aat gta acg
gct ggt cgg tgc agt aac 576Ser Phe Pro Gly Phe Leu Gly Asn Val Thr
Ala Gly Arg Cys Ser Asn 180 185 190acc ttc aac atg gaa ggt atg aac
tgc gtt gtg gat gct gca tgt gcc 624Thr Phe Asn Met Glu Gly Met Asn
Cys Val Val Asp Ala Ala Cys Ala 195 200 205agt tct cta att gca atc
aag gtt gca gtt gaa gag cta ctc ttt ggt 672Ser Ser Leu Ile Ala Ile
Lys Val Ala Val Glu Glu Leu Leu Phe Gly 210 215 220gac tgt gac acc
atg att gca ggt gcc acc tgc acg gac aat tca ctt 720Asp Cys Asp Thr
Met Ile Ala Gly Ala Thr Cys Thr Asp Asn Ser Leu225 230 235 240ggc
atg tac atg gcc ttc tct aaa acg cca gtt ttt tct act gac cca 768Gly
Met Tyr Met Ala Phe Ser Lys Thr Pro Val Phe Ser Thr Asp Pro 245 250
255agt gtc cgc gcg tat gat gag aaa aca aaa ggg atg cta att gga gaa
816Ser Val Arg Ala Tyr Asp Glu Lys Thr Lys Gly Met Leu Ile Gly Glu
260 265 270ggt tca gca atg ttc gtt ctt aaa cgc tat gcg gat gcc gta
cgt gat 864Gly Ser Ala Met Phe Val Leu Lys Arg Tyr Ala Asp Ala Val
Arg Asp 275 280 285ggc gac aca att cac gcg gtt ctg cgt tct tgc tct
tcg tct agt gat 912Gly Asp Thr Ile His Ala Val Leu Arg Ser Cys Ser
Ser Ser Ser Asp 290 295 300gga aaa gcg gca gga att tat act cct act
ata tct gga caa gaa gaa 960Gly Lys Ala Ala Gly Ile Tyr Thr Pro Thr
Ile Ser Gly Gln Glu Glu305 310 315 320gct ttg cgt cga gcg tat gcc
cgt gcg ggg gta tgt cca tct acg atc 1008Ala Leu Arg Arg Ala Tyr Ala
Arg Ala Gly Val Cys Pro Ser Thr Ile 325 330 335ggg ctt gtt gag ggt
cac ggg aca ggg acc cct gtt gga gat cgc att 1056Gly Leu Val Glu Gly
His Gly Thr Gly Thr Pro Val Gly Asp Arg Ile 340 345 350gag tta aca
gct ctg cgg aac ttg ttt gac aaa gct ttt ggt agc aag 1104Glu Leu Thr
Ala Leu Arg Asn Leu Phe Asp Lys Ala Phe Gly Ser Lys 355 360 365aag
gaa caa ata gca gtt ggc agc ata aag tct cag ata ggt cac ctg 1152Lys
Glu Gln Ile Ala Val Gly Ser Ile Lys Ser Gln Ile Gly His Leu 370 375
380aaa tct gtt gcc ggc ttt gcc ggc ttg gtc aaa gct gtg ctt gcg ctt
1200Lys Ser Val Ala Gly Phe Ala Gly Leu Val Lys Ala Val Leu Ala
Leu385 390 395 400aaa cac aaa acg ctc cca ggt tcg att aat gtc gac
cag cca cct ttg 1248Lys His Lys Thr Leu Pro Gly Ser Ile Asn Val Asp
Gln Pro Pro Leu 405 410 415ttg tat gac ggt act caa att caa gac tct
tct tta tat atc aac aag 1296Leu Tyr Asp Gly Thr Gln Ile Gln Asp Ser
Ser Leu Tyr Ile Asn Lys 420 425 430aca aat aga cca tgg ttt acg caa
aac aag ctt ccg cgt cgg gct ggt 1344Thr Asn Arg Pro Trp Phe Thr Gln
Asn Lys Leu Pro Arg Arg Ala Gly 435 440 445gtc tca agt ttt gga ttt
gga ggt gca aac tac cac gcg gtt ctg gaa 1392Val Ser Ser Phe Gly Phe
Gly Gly Ala Asn Tyr His Ala Val Leu Glu 450 455 460gaa ttc gag ccc
gag cat gaa aaa cca tac cgc ctc aat act gtt gga 1440Glu Phe Glu Pro
Glu His Glu Lys Pro Tyr Arg Leu Asn Thr Val Gly465 470 475 480cat
cct gtc ctc ttg tac gct ccg tct gtg gaa gcc ctc aaa gta ctt 1488His
Pro Val Leu Leu Tyr Ala Pro Ser Val Glu Ala Leu Lys Val Leu 485 490
495tgc aac gac cag 1500Cys Asn Asp Gln 50041500PRTThraustochytrium
sp. 41Met Lys Asp Met Glu Asp Arg Arg Val Ala Ile Val Gly Met Ser
Ala1 5 10 15His Leu Pro Cys Gly Thr Asp Val Lys Glu Ser Trp Gln Ala
Ile Arg 20 25 30Asp Gly Ile Asp Cys Leu Ser Asp Leu Pro Ala Asp Arg
Leu Asp Val 35 40 45Thr Ala Tyr Tyr Asn Pro Asn Lys Ala Thr Lys Asp
Lys Ile Tyr Cys 50 55 60Lys Arg Gly Gly Phe Ile Pro Asn Tyr Asp Phe
Asp Pro Arg Glu Phe65 70 75 80Gly Leu Asn Met Phe Gln Met Glu Asp
Ser Asp Ala Asn Gln Thr Leu 85 90 95Thr Leu Leu Lys Val Lys Gln Ala
Leu Glu Asp Ala Ser Ile Glu Pro 100 105 110Phe Thr Lys Glu Lys Lys
Asn Ile Gly Cys Val Leu Gly Ile Gly Gly 115 120 125Gly Gln Lys Ala
Ser His Glu Phe Tyr Ser Arg Leu Asn Tyr Val Val 130 135 140Val Glu
Lys Val Leu Arg Lys Met Gly Leu Pro Asp Ala Asp Val Glu145 150 155
160Glu Ala Val Glu Lys Tyr Lys Ala Asn Phe Pro Glu Trp Arg Leu Asp
165 170 175Ser Phe Pro Gly Phe Leu Gly Asn Val Thr Ala Gly Arg Cys
Ser Asn 180 185 190Thr Phe Asn Met Glu Gly Met Asn Cys Val Val Asp
Ala Ala Cys Ala 195 200 205Ser Ser Leu Ile Ala Ile Lys Val Ala Val
Glu Glu Leu Leu Phe Gly 210 215 220Asp Cys Asp Thr Met Ile Ala Gly
Ala Thr Cys Thr Asp Asn Ser Leu225 230 235 240Gly Met Tyr Met Ala
Phe Ser Lys Thr Pro Val Phe Ser Thr Asp Pro 245 250 255Ser Val Arg
Ala Tyr Asp Glu Lys Thr Lys Gly Met Leu Ile Gly Glu 260 265 270Gly
Ser Ala Met Phe Val Leu Lys Arg Tyr Ala Asp Ala Val Arg Asp 275 280
285Gly Asp Thr Ile His Ala Val Leu Arg Ser Cys Ser Ser Ser Ser Asp
290 295 300Gly Lys Ala Ala Gly Ile Tyr Thr Pro Thr Ile Ser Gly Gln
Glu Glu305 310 315 320Ala Leu Arg Arg Ala Tyr Ala Arg Ala Gly Val
Cys Pro Ser Thr Ile 325 330 335Gly Leu Val Glu Gly His Gly Thr Gly
Thr Pro Val Gly Asp Arg Ile 340 345 350Glu Leu Thr Ala Leu Arg Asn
Leu Phe Asp Lys Ala Phe Gly Ser Lys 355 360 365Lys Glu Gln Ile Ala
Val Gly Ser Ile Lys Ser Gln Ile Gly His Leu 370 375 380Lys Ser Val
Ala Gly Phe Ala Gly Leu Val Lys Ala Val Leu Ala Leu385 390 395
400Lys His Lys Thr Leu Pro Gly Ser Ile Asn Val Asp Gln Pro Pro Leu
405 410 415Leu Tyr Asp Gly Thr Gln Ile Gln Asp Ser Ser Leu Tyr Ile
Asn Lys 420 425 430Thr Asn Arg Pro Trp Phe Thr Gln Asn Lys Leu Pro
Arg Arg Ala Gly 435 440 445Val Ser Ser Phe Gly Phe Gly Gly Ala Asn
Tyr His Ala Val Leu Glu 450 455 460Glu Phe Glu Pro Glu His Glu Lys
Pro Tyr Arg Leu Asn Thr Val Gly465 470 475 480His Pro Val Leu Leu
Tyr Ala Pro Ser Val Glu Ala Leu Lys Val Leu 485 490 495Cys Asn Asp
Gln 500421500DNAThraustochytrium sp.CDS(1)..(1500) 42ctt gcg gag
ctc aca att gca ttg gaa gag gca aaa aca cat aaa aat 48Leu Ala Glu
Leu Thr Ile Ala Leu Glu Glu Ala Lys Thr His Lys Asn1 5 10 15gtt gac
aaa gtt tgt ggc tac aag ttt att gac gaa ttt cag ctc caa 96Val Asp
Lys Val Cys Gly Tyr Lys Phe Ile Asp Glu Phe Gln Leu Gln 20 25 30gga
agc tgt cct cca gaa aat ccg aga gta gga ttt tta gca aca ctg 144Gly
Ser Cys Pro Pro Glu Asn Pro Arg Val Gly Phe Leu Ala Thr Leu 35 40
45cct act tca aat atc att gtc gcg ctt aag gca att ctc gcg cag ctt
192Pro Thr Ser Asn Ile Ile Val Ala Leu Lys Ala Ile Leu Ala Gln Leu
50 55 60gat gca aaa cca gat gcg aag aaa tgg gat ttg cct cat aaa aag
gct 240Asp Ala Lys Pro Asp Ala Lys Lys Trp Asp Leu Pro His Lys Lys
Ala65 70 75 80ttt ggg gct acc ttc gca tcg tct tca gtg aaa ggc tct
gtt gct gcg 288Phe Gly Ala Thr Phe Ala Ser Ser Ser Val Lys Gly Ser
Val Ala Ala 85 90 95ctc ttc gca gga cag ggt acc cag tac tta aac atg
ttc tct gat gtg 336Leu Phe Ala Gly Gln Gly Thr Gln Tyr Leu Asn Met
Phe Ser Asp Val 100 105 110gca atg aac tgg cca ccg ttc cgt gac agc
att gtc gca atg gaa gaa 384Ala Met Asn Trp Pro Pro Phe Arg Asp Ser
Ile Val Ala Met Glu Glu 115 120 125gct caa act gag gta ttt gag ggc
caa gtt gaa cca att agc aaa gtt 432Ala Gln Thr Glu Val Phe Glu Gly
Gln Val Glu Pro Ile Ser Lys Val 130 135 140ctg ttt cca cga gag cgc
tat gca tcc gaa agt gaa cag ggg aat gaa 480Leu Phe Pro Arg Glu Arg
Tyr Ala Ser Glu Ser Glu Gln Gly Asn Glu145 150 155 160ctt ctt tgc
tta aca gag tac tct cag cca act acg ata gca gcc gca 528Leu Leu Cys
Leu Thr Glu Tyr Ser Gln Pro Thr Thr Ile Ala Ala Ala 165 170 175gta
ggg gcc ttc gat att ttc aaa gcg gct ggc ttt aag cca gac atg 576Val
Gly Ala Phe Asp Ile Phe Lys Ala Ala Gly Phe Lys Pro Asp Met 180 185
190gtt gga ggg cat tca ctt ggc gaa ttt gct gct ttg tac gcg gct ggg
624Val Gly Gly His Ser Leu Gly Glu Phe Ala Ala Leu Tyr Ala Ala Gly
195 200 205tcc att tcg cgt gac gac ctg tac aag ctt gtg tgc aaa cgg
gca aag 672Ser Ile Ser Arg Asp Asp Leu Tyr Lys Leu Val Cys Lys Arg
Ala Lys 210 215 220gca atg gcg aac gct agt gac gga gct atg gca gca
gtg att ggc cca 720Ala Met Ala Asn Ala Ser Asp Gly Ala Met Ala Ala
Val Ile Gly Pro225 230 235 240gat gca cgt cta gtt acg cca caa aat
agt gac gtt tat gtc gca aac 768Asp Ala Arg Leu Val Thr Pro Gln Asn
Ser Asp Val Tyr Val Ala Asn 245 250 255ttc aac tcc gca act caa gta
gtc atc agt ggc act gtt caa ggt gtg 816Phe Asn Ser Ala Thr Gln Val
Val Ile Ser Gly Thr Val Gln Gly Val 260 265 270aaa gaa gag tcg aaa
ttg ctc att tca aag ggg ttc cgc gta ctg cca 864Lys Glu Glu Ser Lys
Leu Leu Ile Ser Lys Gly Phe Arg Val Leu Pro 275 280 285ctt aaa tgc
cag ggc gcc ttc cat tct cct ttg atg ggg cct tct gag 912Leu Lys Cys
Gln Gly Ala Phe His Ser Pro Leu Met Gly Pro Ser Glu 290 295 300gat
agt ttc aaa tca ctt gtg gag act tgt acc atc tcg ccg cca aaa 960Asp
Ser Phe Lys Ser Leu Val Glu Thr Cys Thr Ile Ser Pro Pro Lys305 310
315 320aat gtg aaa ttc ttt tgc aat gtt agt ggc aag gaa agc cca aac
cca 1008Asn Val Lys Phe Phe Cys Asn Val Ser Gly Lys Glu Ser Pro Asn
Pro 325 330 335aaa cag acc ctc aag tca cac atg acg tct agc gtt cag
ttc gag gag 1056Lys Gln Thr Leu Lys Ser His Met Thr Ser Ser Val Gln
Phe Glu Glu 340 345 350cag att cgt aac atg tac gat gcc gga gca cgt
gtt ttt ctg gag ttt 1104Gln Ile Arg Asn Met Tyr Asp Ala Gly Ala Arg
Val Phe Leu Glu Phe 355 360 365gga ccc cgc caa gtc ctt gca aag ctt
atc gcg gaa atg ttt ccc tcg 1152Gly Pro Arg Gln Val Leu Ala Lys Leu
Ile Ala Glu Met Phe Pro Ser 370 375 380tgt aca gct atc agc gtt aac
ccc gcg agc agt ggt gac agt gac gtg 1200Cys Thr Ala Ile Ser Val Asn
Pro Ala Ser Ser Gly Asp Ser Asp Val385 390 395 400caa ctc cgc ctc
gcc gcc gta aaa ttc gcg gtc tcg ggt gca gcc ctt 1248Gln Leu Arg Leu
Ala Ala Val Lys Phe Ala Val Ser Gly Ala Ala Leu 405 410 415agc acc
ttt gat cca tgg gag tat cgc aag cca caa gat ctt ctt att 1296Ser Thr
Phe Asp Pro Trp Glu Tyr Arg Lys Pro Gln Asp Leu Leu Ile 420 425
430cga aaa cca cga aaa act gcc ctt gtt cta tca gca gca aca tat gtt
1344Arg Lys Pro Arg Lys Thr Ala Leu Val Leu Ser Ala Ala Thr Tyr Val
435 440 445tcc cca aag act ctt gca gaa cgt aaa aag gct atg gaa gat
atc aag 1392Ser Pro Lys Thr Leu Ala Glu Arg Lys Lys Ala Met Glu Asp
Ile Lys 450 455 460cta gta tcc att aca cca aga gat agt atg gta tca
att gga aaa atc 1440Leu Val Ser Ile Thr Pro Arg Asp Ser Met Val Ser
Ile Gly Lys Ile465 470 475 480gcg caa gaa gta cgg aca gct aaa cag
cct tta gaa acc gaa att cga 1488Ala Gln Glu Val Arg Thr Ala Lys Gln
Pro Leu Glu Thr Glu Ile Arg 485 490 495aga ctc aac aaa 1500Arg Leu
Asn Lys 50043500PRTThraustochytrium sp. 43Leu Ala Glu Leu Thr Ile
Ala Leu Glu Glu Ala Lys Thr His Lys Asn1 5 10 15Val Asp Lys Val Cys
Gly Tyr Lys Phe Ile Asp Glu Phe Gln Leu Gln 20 25 30Gly Ser Cys Pro
Pro Glu Asn Pro Arg Val Gly Phe Leu Ala Thr Leu 35 40 45Pro Thr Ser
Asn Ile Ile Val Ala Leu Lys Ala Ile Leu Ala Gln Leu 50 55 60Asp Ala
Lys Pro Asp Ala Lys Lys Trp Asp Leu Pro His Lys Lys Ala65 70 75
80Phe Gly Ala Thr Phe Ala Ser Ser Ser Val Lys Gly Ser Val Ala Ala
85 90 95Leu Phe Ala Gly Gln Gly Thr Gln Tyr Leu Asn Met Phe Ser Asp
Val 100 105 110Ala Met Asn Trp Pro Pro Phe Arg Asp Ser Ile Val Ala
Met Glu Glu 115 120 125Ala Gln Thr Glu Val Phe Glu Gly Gln Val Glu
Pro Ile Ser Lys Val 130 135 140Leu Phe Pro Arg Glu Arg Tyr Ala Ser
Glu Ser Glu Gln Gly Asn Glu145 150 155 160Leu Leu Cys Leu Thr Glu
Tyr Ser Gln Pro Thr Thr Ile Ala Ala Ala 165 170 175Val Gly Ala Phe
Asp Ile Phe Lys Ala Ala Gly Phe Lys Pro Asp Met 180 185 190Val Gly
Gly His Ser Leu Gly Glu Phe Ala Ala Leu Tyr Ala Ala Gly 195 200
205Ser Ile Ser Arg Asp Asp Leu Tyr Lys Leu Val Cys Lys Arg Ala Lys
210 215 220Ala Met Ala Asn Ala Ser Asp Gly Ala Met Ala Ala Val Ile
Gly Pro225 230 235 240Asp Ala Arg Leu Val Thr Pro Gln Asn Ser Asp
Val Tyr Val Ala Asn 245 250 255Phe Asn Ser Ala Thr Gln Val Val Ile
Ser Gly Thr Val Gln Gly Val 260 265 270Lys Glu Glu Ser Lys Leu Leu
Ile Ser Lys Gly Phe Arg Val Leu Pro 275 280 285Leu Lys Cys Gln Gly
Ala Phe His Ser Pro Leu Met Gly Pro Ser Glu 290 295 300Asp Ser Phe
Lys Ser Leu Val Glu Thr Cys Thr Ile Ser Pro Pro Lys305 310 315
320Asn Val
Lys Phe Phe Cys Asn Val Ser Gly Lys Glu Ser Pro Asn Pro 325 330
335Lys Gln Thr Leu Lys Ser His Met Thr Ser Ser Val Gln Phe Glu Glu
340 345 350Gln Ile Arg Asn Met Tyr Asp Ala Gly Ala Arg Val Phe Leu
Glu Phe 355 360 365Gly Pro Arg Gln Val Leu Ala Lys Leu Ile Ala Glu
Met Phe Pro Ser 370 375 380Cys Thr Ala Ile Ser Val Asn Pro Ala Ser
Ser Gly Asp Ser Asp Val385 390 395 400Gln Leu Arg Leu Ala Ala Val
Lys Phe Ala Val Ser Gly Ala Ala Leu 405 410 415Ser Thr Phe Asp Pro
Trp Glu Tyr Arg Lys Pro Gln Asp Leu Leu Ile 420 425 430Arg Lys Pro
Arg Lys Thr Ala Leu Val Leu Ser Ala Ala Thr Tyr Val 435 440 445Ser
Pro Lys Thr Leu Ala Glu Arg Lys Lys Ala Met Glu Asp Ile Lys 450 455
460Leu Val Ser Ile Thr Pro Arg Asp Ser Met Val Ser Ile Gly Lys
Ile465 470 475 480Ala Gln Glu Val Arg Thr Ala Lys Gln Pro Leu Glu
Thr Glu Ile Arg 485 490 495Arg Leu Asn Lys
50044351DNAThraustochytrium sp.CDS(1)..(351) 44tcg acc cca gcg tca
gag cgg tcg gct tca ccg ctt ttc gag aaa cgc 48Ser Thr Pro Ala Ser
Glu Arg Ser Ala Ser Pro Leu Phe Glu Lys Arg1 5 10 15agt tcg gtt tcg
tca gca cgc ctc gct gaa gct gaa gcc gcg gta ctg 96Ser Ser Val Ser
Ser Ala Arg Leu Ala Glu Ala Glu Ala Ala Val Leu 20 25 30agc gtt ctc
gca gac aag aca ggc tac gac agc tca atg atc gag atg 144Ser Val Leu
Ala Asp Lys Thr Gly Tyr Asp Ser Ser Met Ile Glu Met 35 40 45gac atg
gac ctg gag agt gag ctt ggc gtt gat agc atc aaa cgc gtg 192Asp Met
Asp Leu Glu Ser Glu Leu Gly Val Asp Ser Ile Lys Arg Val 50 55 60gag
atc atg agc gag gtt caa acg ctg ctc agc gtg gaa gtc tcc gac 240Glu
Ile Met Ser Glu Val Gln Thr Leu Leu Ser Val Glu Val Ser Asp65 70 75
80gtt gac gct ctg tca aga acc aag act gtt ggc gac gtc atc gag gcg
288Val Asp Ala Leu Ser Arg Thr Lys Thr Val Gly Asp Val Ile Glu Ala
85 90 95atg aag ctg gaa ctc ggt gga ccc caa ggc cag act ttg acc gcg
gaa 336Met Lys Leu Glu Leu Gly Gly Pro Gln Gly Gln Thr Leu Thr Ala
Glu 100 105 110tcg atc cgt cag cca 351Ser Ile Arg Gln Pro
11545117PRTThraustochytrium sp. 45Ser Thr Pro Ala Ser Glu Arg Ser
Ala Ser Pro Leu Phe Glu Lys Arg1 5 10 15Ser Ser Val Ser Ser Ala Arg
Leu Ala Glu Ala Glu Ala Ala Val Leu 20 25 30Ser Val Leu Ala Asp Lys
Thr Gly Tyr Asp Ser Ser Met Ile Glu Met 35 40 45Asp Met Asp Leu Glu
Ser Glu Leu Gly Val Asp Ser Ile Lys Arg Val 50 55 60Glu Ile Met Ser
Glu Val Gln Thr Leu Leu Ser Val Glu Val Ser Asp65 70 75 80Val Asp
Ala Leu Ser Arg Thr Lys Thr Val Gly Asp Val Ile Glu Ala 85 90 95Met
Lys Leu Glu Leu Gly Gly Pro Gln Gly Gln Thr Leu Thr Ala Glu 100 105
110Ser Ile Arg Gln Pro 115465PRTThraustochytrium
sp.MISC_FEATURE(1)..(5)Xaa = any amino acid 46Leu Gly Xaa Asp Ser1
5472790DNAThraustochytrium sp.CDS(1)..(2790) 47tcg acc cca gcg tca
gag cgg tcg gct tca ccg ctt ttc gag aaa cgc 48Ser Thr Pro Ala Ser
Glu Arg Ser Ala Ser Pro Leu Phe Glu Lys Arg1 5 10 15agt tcg gtt tcg
tca gca cgc ctc gct gaa gct gaa gcc gcg gta ctg 96Ser Ser Val Ser
Ser Ala Arg Leu Ala Glu Ala Glu Ala Ala Val Leu 20 25 30agc gtt ctc
gca gac aag aca ggc tac gac agc tca atg atc gag atg 144Ser Val Leu
Ala Asp Lys Thr Gly Tyr Asp Ser Ser Met Ile Glu Met 35 40 45gac atg
gac ctg gag agt gag ctt ggc gtt gat agc atc aaa cgc gtg 192Asp Met
Asp Leu Glu Ser Glu Leu Gly Val Asp Ser Ile Lys Arg Val 50 55 60gag
atc atg agc gag gtt caa acg ctg ctc agc gtg gaa gtc tcc gac 240Glu
Ile Met Ser Glu Val Gln Thr Leu Leu Ser Val Glu Val Ser Asp65 70 75
80gtt gac gct ctg tca aga acc aag act gtt ggc gac gtc atc gag gcg
288Val Asp Ala Leu Ser Arg Thr Lys Thr Val Gly Asp Val Ile Glu Ala
85 90 95atg aag ctg gaa ctc ggt gga ccc caa ggc cag act ttg acc gcg
gaa 336Met Lys Leu Glu Leu Gly Gly Pro Gln Gly Gln Thr Leu Thr Ala
Glu 100 105 110tcg atc cgt cag cca ccg gtg tcc gag cct gct gta ccg
acc tca tcg 384Ser Ile Arg Gln Pro Pro Val Ser Glu Pro Ala Val Pro
Thr Ser Ser 115 120 125tca agc agt att gct aat gtt tcg tca gca cgc
ctc gct gaa gct gaa 432Ser Ser Ser Ile Ala Asn Val Ser Ser Ala Arg
Leu Ala Glu Ala Glu 130 135 140gct gcg gta ctg agc gtt ctc gca gac
aag aca ggc tac gac agc tca 480Ala Ala Val Leu Ser Val Leu Ala Asp
Lys Thr Gly Tyr Asp Ser Ser145 150 155 160atg atc gag atg gac atg
gac ctg gag agc gag ctt ggc gtt gat agc 528Met Ile Glu Met Asp Met
Asp Leu Glu Ser Glu Leu Gly Val Asp Ser 165 170 175atc aaa cgc gtg
gag atc atg agc gag gtt caa acg ctg ctc agc gtg 576Ile Lys Arg Val
Glu Ile Met Ser Glu Val Gln Thr Leu Leu Ser Val 180 185 190gaa gtc
tcc gac gtt gac gct ctg tca aga act aag act gtt ggc gac 624Glu Val
Ser Asp Val Asp Ala Leu Ser Arg Thr Lys Thr Val Gly Asp 195 200
205gtc atc gag gcg atg aag ctg gaa ctc ggt gga ccc caa ggc cag act
672Val Ile Glu Ala Met Lys Leu Glu Leu Gly Gly Pro Gln Gly Gln Thr
210 215 220ttg acc gcg gaa tcg atc cgt cag cca ccg gtg tct gag cct
gct gta 720Leu Thr Ala Glu Ser Ile Arg Gln Pro Pro Val Ser Glu Pro
Ala Val225 230 235 240ccg acc tca tcg tca agc agt att gct aat gtt
tcg tca gca cgc ctc 768Pro Thr Ser Ser Ser Ser Ser Ile Ala Asn Val
Ser Ser Ala Arg Leu 245 250 255gct gaa gct gaa gcg gcg gta ctg agc
gtt ctc gca gac aag aca ggc 816Ala Glu Ala Glu Ala Ala Val Leu Ser
Val Leu Ala Asp Lys Thr Gly 260 265 270tac gac agc tca atg atc gag
atg gac atg gac ctg gag agc gag ctt 864Tyr Asp Ser Ser Met Ile Glu
Met Asp Met Asp Leu Glu Ser Glu Leu 275 280 285ggc gtc gac agc atc
aaa cgc gtg gag atc atg agc gag gtt caa acg 912Gly Val Asp Ser Ile
Lys Arg Val Glu Ile Met Ser Glu Val Gln Thr 290 295 300ctg ctc agc
gtg gaa gtc tcc gac gtt gac gct ctg tca aga acc aag 960Leu Leu Ser
Val Glu Val Ser Asp Val Asp Ala Leu Ser Arg Thr Lys305 310 315
320act gtt ggc gac gtc atc gag gcg atg aag ctg gaa ctc ggt gga ccc
1008Thr Val Gly Asp Val Ile Glu Ala Met Lys Leu Glu Leu Gly Gly Pro
325 330 335caa ggc cag act ttg acc gcg gaa tcg atc cgt cag cca ccg
gtg tcc 1056Gln Gly Gln Thr Leu Thr Ala Glu Ser Ile Arg Gln Pro Pro
Val Ser 340 345 350gag cct gct gta ccg acc tca tcg tca agc agt att
gct aat gtt ttg 1104Glu Pro Ala Val Pro Thr Ser Ser Ser Ser Ser Ile
Ala Asn Val Leu 355 360 365tca gca cgc ctc gct gaa gct gaa gcc gcg
gta ctg agc gtt ctc gca 1152Ser Ala Arg Leu Ala Glu Ala Glu Ala Ala
Val Leu Ser Val Leu Ala 370 375 380gac aag aca ggc tac gac agc tca
atg atc gag atg gac atg gac ctg 1200Asp Lys Thr Gly Tyr Asp Ser Ser
Met Ile Glu Met Asp Met Asp Leu385 390 395 400gag agc gag ctt ggc
gtt gat agc atc aaa cgc gtg gag atc atg agc 1248Glu Ser Glu Leu Gly
Val Asp Ser Ile Lys Arg Val Glu Ile Met Ser 405 410 415gag gtt caa
acg ttg ctc agc gtg gaa gtc tcc gac gtt gac gct ctg 1296Glu Val Gln
Thr Leu Leu Ser Val Glu Val Ser Asp Val Asp Ala Leu 420 425 430tca
aga acc aag act gtt ggc gac gtc atc gag gcg atg aag ctg gaa 1344Ser
Arg Thr Lys Thr Val Gly Asp Val Ile Glu Ala Met Lys Leu Glu 435 440
445ctc ggt gga ccc caa ggc cag act ttg acc gcg gaa tcg atc cgt cag
1392Leu Gly Gly Pro Gln Gly Gln Thr Leu Thr Ala Glu Ser Ile Arg Gln
450 455 460cca ccg gtg tct gag cct gct gta ccg acc tca tcg tca agc
agt att 1440Pro Pro Val Ser Glu Pro Ala Val Pro Thr Ser Ser Ser Ser
Ser Ile465 470 475 480gct aat gtt tcg tca gca cgc ctc gct gaa gct
gaa gcc gcg gta ctg 1488Ala Asn Val Ser Ser Ala Arg Leu Ala Glu Ala
Glu Ala Ala Val Leu 485 490 495agc gtt ctc gca gac aag aca ggc tac
gac agc tca atg atc gag atg 1536Ser Val Leu Ala Asp Lys Thr Gly Tyr
Asp Ser Ser Met Ile Glu Met 500 505 510gac atg gac ctg gag agt gag
ctt ggc gtc gac agc atc aaa cgc gtg 1584Asp Met Asp Leu Glu Ser Glu
Leu Gly Val Asp Ser Ile Lys Arg Val 515 520 525gag atc atg agc gag
gtt caa acg ctg ctc agc gtg gaa gtc tcc gac 1632Glu Ile Met Ser Glu
Val Gln Thr Leu Leu Ser Val Glu Val Ser Asp 530 535 540gtt gac gct
ctg tca aga acc aag act gtt ggc gac gtc atc gag gcg 1680Val Asp Ala
Leu Ser Arg Thr Lys Thr Val Gly Asp Val Ile Glu Ala545 550 555
560atg aag ctg gaa ctc ggt gga ccc caa ggc cag act ttg acc tct gaa
1728Met Lys Leu Glu Leu Gly Gly Pro Gln Gly Gln Thr Leu Thr Ser Glu
565 570 575ccg atc cat cag cca cca gtg tcc gag cct gct gta ccg acc
tca tcg 1776Pro Ile His Gln Pro Pro Val Ser Glu Pro Ala Val Pro Thr
Ser Ser 580 585 590tca agc agt att gct aat gtt tct tca gca cgc ctc
gct gaa gct gaa 1824Ser Ser Ser Ile Ala Asn Val Ser Ser Ala Arg Leu
Ala Glu Ala Glu 595 600 605gcc gcg gta ctg agc gtt ctc gca gac aag
aca ggc tac gac agc tca 1872Ala Ala Val Leu Ser Val Leu Ala Asp Lys
Thr Gly Tyr Asp Ser Ser 610 615 620atg atc gag atg gac atg gac ctg
gag agc gag ctt ggc gtt gat agc 1920Met Ile Glu Met Asp Met Asp Leu
Glu Ser Glu Leu Gly Val Asp Ser625 630 635 640atc aaa cgc gtg gaa
atc atg agc gag gtt caa acg ctg ctc agc gtg 1968Ile Lys Arg Val Glu
Ile Met Ser Glu Val Gln Thr Leu Leu Ser Val 645 650 655gaa gtc tcc
gac gtt gac gct ctg tca aga acc aag act gtt ggc gac 2016Glu Val Ser
Asp Val Asp Ala Leu Ser Arg Thr Lys Thr Val Gly Asp 660 665 670gtc
atc gag gcg atg aag atg gaa ctc ggt gga ccc caa ggc cag act 2064Val
Ile Glu Ala Met Lys Met Glu Leu Gly Gly Pro Gln Gly Gln Thr 675 680
685ttg acc gcg gaa tcg atc cgt cag cca ccg gtg tct gag cct gct gta
2112Leu Thr Ala Glu Ser Ile Arg Gln Pro Pro Val Ser Glu Pro Ala Val
690 695 700ccg acc tca tcg tca agc agt att gct aat gtt tcg tca gca
cgc ctc 2160Pro Thr Ser Ser Ser Ser Ser Ile Ala Asn Val Ser Ser Ala
Arg Leu705 710 715 720gct gaa gct gaa gcg gcg gta ctg agc gtt ctc
gca gac aag aca ggc 2208Ala Glu Ala Glu Ala Ala Val Leu Ser Val Leu
Ala Asp Lys Thr Gly 725 730 735tac gac agc tca atg atc gag atg gac
atg gac ctg gag agc gag ctt 2256Tyr Asp Ser Ser Met Ile Glu Met Asp
Met Asp Leu Glu Ser Glu Leu 740 745 750ggc gtt gat agc atc aaa cgc
gtg gag atc atg agc gag gtt caa gcg 2304Gly Val Asp Ser Ile Lys Arg
Val Glu Ile Met Ser Glu Val Gln Ala 755 760 765ctg ctc agc gtg gaa
gtc tcc gac gtt gac gct ctg tca aga acc aag 2352Leu Leu Ser Val Glu
Val Ser Asp Val Asp Ala Leu Ser Arg Thr Lys 770 775 780act gtt ggc
gac gtc atc gag gcg atg aag atg gaa ctc ggt gga ccc 2400Thr Val Gly
Asp Val Ile Glu Ala Met Lys Met Glu Leu Gly Gly Pro785 790 795
800caa ggc cag act ttg acc gca gaa tcg atc cgt gag cca ccg gtg tct
2448Gln Gly Gln Thr Leu Thr Ala Glu Ser Ile Arg Glu Pro Pro Val Ser
805 810 815gag cct gct gta ccg acc tca tcg tca agt agt atc gct aat
gtt tct 2496Glu Pro Ala Val Pro Thr Ser Ser Ser Ser Ser Ile Ala Asn
Val Ser 820 825 830tca gct cgc ctc gct gaa gct gaa gcc gcg gta ctg
agc gtt ctc gca 2544Ser Ala Arg Leu Ala Glu Ala Glu Ala Ala Val Leu
Ser Val Leu Ala 835 840 845gac aag aca ggc tac gac agc tca atg atc
gag atg gac atg gac ctg 2592Asp Lys Thr Gly Tyr Asp Ser Ser Met Ile
Glu Met Asp Met Asp Leu 850 855 860gag agt gag ctt ggc gtc gac agc
atc aaa cgc gtg gag atc atg agc 2640Glu Ser Glu Leu Gly Val Asp Ser
Ile Lys Arg Val Glu Ile Met Ser865 870 875 880gag gtt caa acg ttg
ctc agc gtg gaa gtc tcc gac gtt gac gct ctg 2688Glu Val Gln Thr Leu
Leu Ser Val Glu Val Ser Asp Val Asp Ala Leu 885 890 895tca aga acc
aag act gtt ggc gac gtc atc gag gcg atg aag ctg gaa 2736Ser Arg Thr
Lys Thr Val Gly Asp Val Ile Glu Ala Met Lys Leu Glu 900 905 910ctt
ggg gaa tca tca agt att gag act ctc aat tgt acc gag gtt gag 2784Leu
Gly Glu Ser Ser Ser Ile Glu Thr Leu Asn Cys Thr Glu Val Glu 915 920
925cac acg 2790His Thr 93048930PRTThraustochytrium sp. 48Ser Thr
Pro Ala Ser Glu Arg Ser Ala Ser Pro Leu Phe Glu Lys Arg1 5 10 15Ser
Ser Val Ser Ser Ala Arg Leu Ala Glu Ala Glu Ala Ala Val Leu 20 25
30Ser Val Leu Ala Asp Lys Thr Gly Tyr Asp Ser Ser Met Ile Glu Met
35 40 45Asp Met Asp Leu Glu Ser Glu Leu Gly Val Asp Ser Ile Lys Arg
Val 50 55 60Glu Ile Met Ser Glu Val Gln Thr Leu Leu Ser Val Glu Val
Ser Asp65 70 75 80Val Asp Ala Leu Ser Arg Thr Lys Thr Val Gly Asp
Val Ile Glu Ala 85 90 95Met Lys Leu Glu Leu Gly Gly Pro Gln Gly Gln
Thr Leu Thr Ala Glu 100 105 110Ser Ile Arg Gln Pro Pro Val Ser Glu
Pro Ala Val Pro Thr Ser Ser 115 120 125Ser Ser Ser Ile Ala Asn Val
Ser Ser Ala Arg Leu Ala Glu Ala Glu 130 135 140Ala Ala Val Leu Ser
Val Leu Ala Asp Lys Thr Gly Tyr Asp Ser Ser145 150 155 160Met Ile
Glu Met Asp Met Asp Leu Glu Ser Glu Leu Gly Val Asp Ser 165 170
175Ile Lys Arg Val Glu Ile Met Ser Glu Val Gln Thr Leu Leu Ser Val
180 185 190Glu Val Ser Asp Val Asp Ala Leu Ser Arg Thr Lys Thr Val
Gly Asp 195 200 205Val Ile Glu Ala Met Lys Leu Glu Leu Gly Gly Pro
Gln Gly Gln Thr 210 215 220Leu Thr Ala Glu Ser Ile Arg Gln Pro Pro
Val Ser Glu Pro Ala Val225 230 235 240Pro Thr Ser Ser Ser Ser Ser
Ile Ala Asn Val Ser Ser Ala Arg Leu 245 250 255Ala Glu Ala Glu Ala
Ala Val Leu Ser Val Leu Ala Asp Lys Thr Gly 260 265 270Tyr Asp Ser
Ser Met Ile Glu Met Asp Met Asp Leu Glu Ser Glu Leu 275 280 285Gly
Val Asp Ser Ile Lys Arg Val Glu Ile Met Ser Glu Val Gln Thr 290 295
300Leu Leu Ser Val Glu Val Ser Asp Val Asp Ala Leu Ser Arg Thr
Lys305 310 315 320Thr Val Gly Asp Val Ile Glu Ala Met Lys Leu Glu
Leu Gly Gly Pro 325 330 335Gln Gly Gln Thr Leu Thr Ala Glu Ser Ile
Arg Gln Pro Pro Val Ser 340 345 350Glu Pro Ala Val Pro Thr Ser Ser
Ser Ser Ser Ile Ala Asn Val Leu 355 360 365Ser Ala Arg Leu Ala Glu
Ala Glu Ala Ala Val Leu Ser Val Leu Ala 370 375 380Asp Lys Thr Gly
Tyr Asp Ser Ser Met Ile Glu Met Asp Met Asp Leu385 390 395 400Glu
Ser Glu Leu Gly Val Asp Ser Ile Lys Arg Val Glu Ile Met Ser 405 410
415Glu Val Gln Thr Leu Leu Ser Val Glu Val Ser Asp Val Asp Ala Leu
420 425 430Ser Arg Thr Lys Thr Val Gly Asp Val Ile Glu Ala Met Lys
Leu Glu 435 440 445Leu Gly
Gly Pro Gln Gly Gln Thr Leu Thr Ala Glu Ser Ile Arg Gln 450 455
460Pro Pro Val Ser Glu Pro Ala Val Pro Thr Ser Ser Ser Ser Ser
Ile465 470 475 480Ala Asn Val Ser Ser Ala Arg Leu Ala Glu Ala Glu
Ala Ala Val Leu 485 490 495Ser Val Leu Ala Asp Lys Thr Gly Tyr Asp
Ser Ser Met Ile Glu Met 500 505 510Asp Met Asp Leu Glu Ser Glu Leu
Gly Val Asp Ser Ile Lys Arg Val 515 520 525Glu Ile Met Ser Glu Val
Gln Thr Leu Leu Ser Val Glu Val Ser Asp 530 535 540Val Asp Ala Leu
Ser Arg Thr Lys Thr Val Gly Asp Val Ile Glu Ala545 550 555 560Met
Lys Leu Glu Leu Gly Gly Pro Gln Gly Gln Thr Leu Thr Ser Glu 565 570
575Pro Ile His Gln Pro Pro Val Ser Glu Pro Ala Val Pro Thr Ser Ser
580 585 590Ser Ser Ser Ile Ala Asn Val Ser Ser Ala Arg Leu Ala Glu
Ala Glu 595 600 605Ala Ala Val Leu Ser Val Leu Ala Asp Lys Thr Gly
Tyr Asp Ser Ser 610 615 620Met Ile Glu Met Asp Met Asp Leu Glu Ser
Glu Leu Gly Val Asp Ser625 630 635 640Ile Lys Arg Val Glu Ile Met
Ser Glu Val Gln Thr Leu Leu Ser Val 645 650 655Glu Val Ser Asp Val
Asp Ala Leu Ser Arg Thr Lys Thr Val Gly Asp 660 665 670Val Ile Glu
Ala Met Lys Met Glu Leu Gly Gly Pro Gln Gly Gln Thr 675 680 685Leu
Thr Ala Glu Ser Ile Arg Gln Pro Pro Val Ser Glu Pro Ala Val 690 695
700Pro Thr Ser Ser Ser Ser Ser Ile Ala Asn Val Ser Ser Ala Arg
Leu705 710 715 720Ala Glu Ala Glu Ala Ala Val Leu Ser Val Leu Ala
Asp Lys Thr Gly 725 730 735Tyr Asp Ser Ser Met Ile Glu Met Asp Met
Asp Leu Glu Ser Glu Leu 740 745 750Gly Val Asp Ser Ile Lys Arg Val
Glu Ile Met Ser Glu Val Gln Ala 755 760 765Leu Leu Ser Val Glu Val
Ser Asp Val Asp Ala Leu Ser Arg Thr Lys 770 775 780Thr Val Gly Asp
Val Ile Glu Ala Met Lys Met Glu Leu Gly Gly Pro785 790 795 800Gln
Gly Gln Thr Leu Thr Ala Glu Ser Ile Arg Glu Pro Pro Val Ser 805 810
815Glu Pro Ala Val Pro Thr Ser Ser Ser Ser Ser Ile Ala Asn Val Ser
820 825 830Ser Ala Arg Leu Ala Glu Ala Glu Ala Ala Val Leu Ser Val
Leu Ala 835 840 845Asp Lys Thr Gly Tyr Asp Ser Ser Met Ile Glu Met
Asp Met Asp Leu 850 855 860Glu Ser Glu Leu Gly Val Asp Ser Ile Lys
Arg Val Glu Ile Met Ser865 870 875 880Glu Val Gln Thr Leu Leu Ser
Val Glu Val Ser Asp Val Asp Ala Leu 885 890 895Ser Arg Thr Lys Thr
Val Gly Asp Val Ile Glu Ala Met Lys Leu Glu 900 905 910Leu Gly Glu
Ser Ser Ser Ile Glu Thr Leu Asn Cys Thr Glu Val Glu 915 920 925His
Thr 930492433DNAThraustochytrium sp.CDS(1)..(2433) 49aaa agt gtc
aag gct tca ggg tgt gag aat gta gat acc cgt ttc gct 48Lys Ser Val
Lys Ala Ser Gly Cys Glu Asn Val Asp Thr Arg Phe Ala1 5 10 15aag gtt
gta caa atc tcg ctt cct agc aag ctg aaa tcc act gtg tcg 96Lys Val
Val Gln Ile Ser Leu Pro Ser Lys Leu Lys Ser Thr Val Ser 20 25 30cac
gat cga cct gta att gtt gta gat gat gga acg ccc tta acc acg 144His
Asp Arg Pro Val Ile Val Val Asp Asp Gly Thr Pro Leu Thr Thr 35 40
45gag ctt tgt aaa att ctt ggg ggt aat att gtg gtt ctc tct tat caa
192Glu Leu Cys Lys Ile Leu Gly Gly Asn Ile Val Val Leu Ser Tyr Gln
50 55 60ggg aag ccc gct ggt cca cgg gga gtc gag gtg cca gat ctt tcc
gag 240Gly Lys Pro Ala Gly Pro Arg Gly Val Glu Val Pro Asp Leu Ser
Glu65 70 75 80gaa gcc cta att caa gct ctt gca ttg att cgg tct aca
tat gga gtt 288Glu Ala Leu Ile Gln Ala Leu Ala Leu Ile Arg Ser Thr
Tyr Gly Val 85 90 95cca att ggt ttt att tgt cag caa gtg tct aat gtg
agc acc aag gca 336Pro Ile Gly Phe Ile Cys Gln Gln Val Ser Asn Val
Ser Thr Lys Ala 100 105 110cag ctt tgt tgg gca ctc ctc gca gcg aag
cat ctc aag aag gat ttg 384Gln Leu Cys Trp Ala Leu Leu Ala Ala Lys
His Leu Lys Lys Asp Leu 115 120 125aat gct gtc tta ccc gat tca aga
tcc ttc ttc gtc gga gtt gta cgc 432Asn Ala Val Leu Pro Asp Ser Arg
Ser Phe Phe Val Gly Val Val Arg 130 135 140ttg aac ggg aaa ctt gga
act ttc gaa aac atc agc gac ttc tct aaa 480Leu Asn Gly Lys Leu Gly
Thr Phe Glu Asn Ile Ser Asp Phe Ser Lys145 150 155 160ttt gat ttg
acg aaa gcc cta gat tac gga cag cgt ggt tct ctc tta 528Phe Asp Leu
Thr Lys Ala Leu Asp Tyr Gly Gln Arg Gly Ser Leu Leu 165 170 175ggc
ctg tgc aag tca cta gac tta gaa tgg gaa cag gtg ttt tgc cgt 576Gly
Leu Cys Lys Ser Leu Asp Leu Glu Trp Glu Gln Val Phe Cys Arg 180 185
190gga ata gat ctt gcg tgt gat ctt atg cca ctc cag gcc gca agg ata
624Gly Ile Asp Leu Ala Cys Asp Leu Met Pro Leu Gln Ala Ala Arg Ile
195 200 205ctc aga aat gag ctt cag tgt ccc aat atg cgc ctt cgc gag
gtt ggg 672Leu Arg Asn Glu Leu Gln Cys Pro Asn Met Arg Leu Arg Glu
Val Gly 210 215 220tac gat att tct ggc gcc agg tac acc att tca acc
gat gac ctg cta 720Tyr Asp Ile Ser Gly Ala Arg Tyr Thr Ile Ser Thr
Asp Asp Leu Leu225 230 235 240tgt gga ccc tcg aag gct aaa gta gag
gcc gca gac ttg ttt ctt gtg 768Cys Gly Pro Ser Lys Ala Lys Val Glu
Ala Ala Asp Leu Phe Leu Val 245 250 255aca ggt ggc gca cga ggt att
aca cct cat tgt gtt cgt gag att gca 816Thr Gly Gly Ala Arg Gly Ile
Thr Pro His Cys Val Arg Glu Ile Ala 260 265 270agt cga tcc ccc gga
acc aca ttt gtg ctg gtt gga aga agc gaa atg 864Ser Arg Ser Pro Gly
Thr Thr Phe Val Leu Val Gly Arg Ser Glu Met 275 280 285tcc gac gag
cct gac tgg gct gtt ggc cac tac aat aaa gac ctg gac 912Ser Asp Glu
Pro Asp Trp Ala Val Gly His Tyr Asn Lys Asp Leu Asp 290 295 300caa
agc aca atg aaa cac ttg aaa gca acg cat gct gct gga ggg gta 960Gln
Ser Thr Met Lys His Leu Lys Ala Thr His Ala Ala Gly Gly Val305 310
315 320aaa cct acg cct aaa gca cat cgt gca ctt gtg aac agg gtc act
ggc 1008Lys Pro Thr Pro Lys Ala His Arg Ala Leu Val Asn Arg Val Thr
Gly 325 330 335tca cgg gag gta cga gaa tct ctt aga gca atc cag gag
gca ggg gca 1056Ser Arg Glu Val Arg Glu Ser Leu Arg Ala Ile Gln Glu
Ala Gly Ala 340 345 350aat gtc gaa tat atc gcc tgt gat gtt tcg gat
gaa aac aag gtc cgc 1104Asn Val Glu Tyr Ile Ala Cys Asp Val Ser Asp
Glu Asn Lys Val Arg 355 360 365caa ctt gtg caa aga gtg gag caa aag
tat ggc tgt gaa ata act ggg 1152Gln Leu Val Gln Arg Val Glu Gln Lys
Tyr Gly Cys Glu Ile Thr Gly 370 375 380att tgg cat gca agc ggg gtt
ctt cgt gac aaa ctt gtc gag caa aag 1200Ile Trp His Ala Ser Gly Val
Leu Arg Asp Lys Leu Val Glu Gln Lys385 390 395 400act aca gac gac
ttt gag gca gtt ttt ggg acc aag gtg act ggc ctt 1248Thr Thr Asp Asp
Phe Glu Ala Val Phe Gly Thr Lys Val Thr Gly Leu 405 410 415gta aac
atc gtg tca caa gtc aat atg tct aag cta cga cac ttc atc 1296Val Asn
Ile Val Ser Gln Val Asn Met Ser Lys Leu Arg His Phe Ile 420 425
430ctc ttc agt tct ttg gct gga ttt cat ggg aac aag ggc caa acg gat
1344Leu Phe Ser Ser Leu Ala Gly Phe His Gly Asn Lys Gly Gln Thr Asp
435 440 445tat gca att gct aat gaa gcc ttg aac aaa atc gcg cat act
ctc tca 1392Tyr Ala Ile Ala Asn Glu Ala Leu Asn Lys Ile Ala His Thr
Leu Ser 450 455 460gcg ttt ttg ccc aaa ctg aat gca aag gtg cta gac
ttc ggt ccg tgg 1440Ala Phe Leu Pro Lys Leu Asn Ala Lys Val Leu Asp
Phe Gly Pro Trp465 470 475 480gta ggt tca gga atg gta acc gaa aca
ctt gag aag cat ttt aaa gct 1488Val Gly Ser Gly Met Val Thr Glu Thr
Leu Glu Lys His Phe Lys Ala 485 490 495atg ggg gtt cag act att cct
ctc gag cca gga gca cgg act gtt gcg 1536Met Gly Val Gln Thr Ile Pro
Leu Glu Pro Gly Ala Arg Thr Val Ala 500 505 510caa atc att ttg gca
agt tcg cca ccg caa tcg ctt ttg ggg aac tgg 1584Gln Ile Ile Leu Ala
Ser Ser Pro Pro Gln Ser Leu Leu Gly Asn Trp 515 520 525ggc ttt cca
gcc acc aaa ccg cta caa cgc tct aat gta gtc acg ggc 1632Gly Phe Pro
Ala Thr Lys Pro Leu Gln Arg Ser Asn Val Val Thr Gly 530 535 540aca
ctc tct ccg gaa gag ata gaa ttc atc gca gac cac aaa att caa 1680Thr
Leu Ser Pro Glu Glu Ile Glu Phe Ile Ala Asp His Lys Ile Gln545 550
555 560ggc cgc aag gtg ctt ccc atg atg gct gca atc ggg ttc atg gcc
tct 1728Gly Arg Lys Val Leu Pro Met Met Ala Ala Ile Gly Phe Met Ala
Ser 565 570 575att gcg gaa gga ctc tac ccg ggg tac aat ctg caa ggc
gtg gaa aat 1776Ile Ala Glu Gly Leu Tyr Pro Gly Tyr Asn Leu Gln Gly
Val Glu Asn 580 585 590gct cag ctc ttt caa ggc ttg act atc aac caa
gag aca aaa ttt caa 1824Ala Gln Leu Phe Gln Gly Leu Thr Ile Asn Gln
Glu Thr Lys Phe Gln 595 600 605atc act ctc att gag gag cac aac tct
gag gaa aac ctg gat gtc ctg 1872Ile Thr Leu Ile Glu Glu His Asn Ser
Glu Glu Asn Leu Asp Val Leu 610 615 620aca tcc ctt ggt gta atg ttg
gaa agc ggg aag gtg ctt ccc gct tac 1920Thr Ser Leu Gly Val Met Leu
Glu Ser Gly Lys Val Leu Pro Ala Tyr625 630 635 640cga tgt gtt gta
tgc ttg aat aca acc cag cag cag ccc aag cta tct 1968Arg Cys Val Val
Cys Leu Asn Thr Thr Gln Gln Gln Pro Lys Leu Ser 645 650 655cca aaa
att ctt aac ttg gaa gtt gac cct gca tgc gag gtt aac ccc 2016Pro Lys
Ile Leu Asn Leu Glu Val Asp Pro Ala Cys Glu Val Asn Pro 660 665
670tat gat gga aag tcg ttg ttc cac ggt ccg ctt ttg caa ttc gtt caa
2064Tyr Asp Gly Lys Ser Leu Phe His Gly Pro Leu Leu Gln Phe Val Gln
675 680 685caa gtg ttg cac tca agt acc aaa ggc ctc gtt gcc aag tgc
cgc gcg 2112Gln Val Leu His Ser Ser Thr Lys Gly Leu Val Ala Lys Cys
Arg Ala 690 695 700ctt cca atc aaa gaa gcc atc cga ggg cca ttt atc
aag caa aca ctc 2160Leu Pro Ile Lys Glu Ala Ile Arg Gly Pro Phe Ile
Lys Gln Thr Leu705 710 715 720cat gat cca att cta gac gac gtc att
ttt cag cta atg ctc gtg tgg 2208His Asp Pro Ile Leu Asp Asp Val Ile
Phe Gln Leu Met Leu Val Trp 725 730 735tgt cgt aat gct cta gga agt
gca tcg cta ccc aac aga att gaa aag 2256Cys Arg Asn Ala Leu Gly Ser
Ala Ser Leu Pro Asn Arg Ile Glu Lys 740 745 750atg tca tac ttt ggg
aat gtc tca gaa ggt agc act ttc ttt gcc tca 2304Met Ser Tyr Phe Gly
Asn Val Ser Glu Gly Ser Thr Phe Phe Ala Ser 755 760 765gtt aca cct
gtg gga cca aga gta cca aag gat ccc gtg atc aaa atg 2352Val Thr Pro
Val Gly Pro Arg Val Pro Lys Asp Pro Val Ile Lys Met 770 775 780cag
ttt ctt ctc caa gat gaa tcc ggc aac aca ttt tca tcg ggg gag 2400Gln
Phe Leu Leu Gln Asp Glu Ser Gly Asn Thr Phe Ser Ser Gly Glu785 790
795 800ggc tcg gtt gtg ctt agt gac gaa ctc gtc ttt 2433Gly Ser Val
Val Leu Ser Asp Glu Leu Val Phe 805 81050811PRTThraustochytrium sp.
50Lys Ser Val Lys Ala Ser Gly Cys Glu Asn Val Asp Thr Arg Phe Ala1
5 10 15Lys Val Val Gln Ile Ser Leu Pro Ser Lys Leu Lys Ser Thr Val
Ser 20 25 30His Asp Arg Pro Val Ile Val Val Asp Asp Gly Thr Pro Leu
Thr Thr 35 40 45Glu Leu Cys Lys Ile Leu Gly Gly Asn Ile Val Val Leu
Ser Tyr Gln 50 55 60Gly Lys Pro Ala Gly Pro Arg Gly Val Glu Val Pro
Asp Leu Ser Glu65 70 75 80Glu Ala Leu Ile Gln Ala Leu Ala Leu Ile
Arg Ser Thr Tyr Gly Val 85 90 95Pro Ile Gly Phe Ile Cys Gln Gln Val
Ser Asn Val Ser Thr Lys Ala 100 105 110Gln Leu Cys Trp Ala Leu Leu
Ala Ala Lys His Leu Lys Lys Asp Leu 115 120 125Asn Ala Val Leu Pro
Asp Ser Arg Ser Phe Phe Val Gly Val Val Arg 130 135 140Leu Asn Gly
Lys Leu Gly Thr Phe Glu Asn Ile Ser Asp Phe Ser Lys145 150 155
160Phe Asp Leu Thr Lys Ala Leu Asp Tyr Gly Gln Arg Gly Ser Leu Leu
165 170 175Gly Leu Cys Lys Ser Leu Asp Leu Glu Trp Glu Gln Val Phe
Cys Arg 180 185 190Gly Ile Asp Leu Ala Cys Asp Leu Met Pro Leu Gln
Ala Ala Arg Ile 195 200 205Leu Arg Asn Glu Leu Gln Cys Pro Asn Met
Arg Leu Arg Glu Val Gly 210 215 220Tyr Asp Ile Ser Gly Ala Arg Tyr
Thr Ile Ser Thr Asp Asp Leu Leu225 230 235 240Cys Gly Pro Ser Lys
Ala Lys Val Glu Ala Ala Asp Leu Phe Leu Val 245 250 255Thr Gly Gly
Ala Arg Gly Ile Thr Pro His Cys Val Arg Glu Ile Ala 260 265 270Ser
Arg Ser Pro Gly Thr Thr Phe Val Leu Val Gly Arg Ser Glu Met 275 280
285Ser Asp Glu Pro Asp Trp Ala Val Gly His Tyr Asn Lys Asp Leu Asp
290 295 300Gln Ser Thr Met Lys His Leu Lys Ala Thr His Ala Ala Gly
Gly Val305 310 315 320Lys Pro Thr Pro Lys Ala His Arg Ala Leu Val
Asn Arg Val Thr Gly 325 330 335Ser Arg Glu Val Arg Glu Ser Leu Arg
Ala Ile Gln Glu Ala Gly Ala 340 345 350Asn Val Glu Tyr Ile Ala Cys
Asp Val Ser Asp Glu Asn Lys Val Arg 355 360 365Gln Leu Val Gln Arg
Val Glu Gln Lys Tyr Gly Cys Glu Ile Thr Gly 370 375 380Ile Trp His
Ala Ser Gly Val Leu Arg Asp Lys Leu Val Glu Gln Lys385 390 395
400Thr Thr Asp Asp Phe Glu Ala Val Phe Gly Thr Lys Val Thr Gly Leu
405 410 415Val Asn Ile Val Ser Gln Val Asn Met Ser Lys Leu Arg His
Phe Ile 420 425 430Leu Phe Ser Ser Leu Ala Gly Phe His Gly Asn Lys
Gly Gln Thr Asp 435 440 445Tyr Ala Ile Ala Asn Glu Ala Leu Asn Lys
Ile Ala His Thr Leu Ser 450 455 460Ala Phe Leu Pro Lys Leu Asn Ala
Lys Val Leu Asp Phe Gly Pro Trp465 470 475 480Val Gly Ser Gly Met
Val Thr Glu Thr Leu Glu Lys His Phe Lys Ala 485 490 495Met Gly Val
Gln Thr Ile Pro Leu Glu Pro Gly Ala Arg Thr Val Ala 500 505 510Gln
Ile Ile Leu Ala Ser Ser Pro Pro Gln Ser Leu Leu Gly Asn Trp 515 520
525Gly Phe Pro Ala Thr Lys Pro Leu Gln Arg Ser Asn Val Val Thr Gly
530 535 540Thr Leu Ser Pro Glu Glu Ile Glu Phe Ile Ala Asp His Lys
Ile Gln545 550 555 560Gly Arg Lys Val Leu Pro Met Met Ala Ala Ile
Gly Phe Met Ala Ser 565 570 575Ile Ala Glu Gly Leu Tyr Pro Gly Tyr
Asn Leu Gln Gly Val Glu Asn 580 585 590Ala Gln Leu Phe Gln Gly Leu
Thr Ile Asn Gln Glu Thr Lys Phe Gln 595 600 605Ile Thr Leu Ile Glu
Glu His Asn Ser Glu Glu Asn Leu Asp Val Leu 610 615 620Thr Ser Leu
Gly Val Met Leu Glu Ser Gly Lys Val Leu Pro Ala Tyr625 630 635
640Arg Cys Val Val Cys Leu Asn Thr Thr Gln Gln Gln Pro Lys Leu Ser
645 650 655Pro Lys Ile Leu Asn Leu Glu Val Asp Pro Ala Cys Glu Val
Asn Pro 660 665 670Tyr Asp Gly Lys Ser Leu
Phe His Gly Pro Leu Leu Gln Phe Val Gln 675 680 685Gln Val Leu His
Ser Ser Thr Lys Gly Leu Val Ala Lys Cys Arg Ala 690 695 700Leu Pro
Ile Lys Glu Ala Ile Arg Gly Pro Phe Ile Lys Gln Thr Leu705 710 715
720His Asp Pro Ile Leu Asp Asp Val Ile Phe Gln Leu Met Leu Val Trp
725 730 735Cys Arg Asn Ala Leu Gly Ser Ala Ser Leu Pro Asn Arg Ile
Glu Lys 740 745 750Met Ser Tyr Phe Gly Asn Val Ser Glu Gly Ser Thr
Phe Phe Ala Ser 755 760 765Val Thr Pro Val Gly Pro Arg Val Pro Lys
Asp Pro Val Ile Lys Met 770 775 780Gln Phe Leu Leu Gln Asp Glu Ser
Gly Asn Thr Phe Ser Ser Gly Glu785 790 795 800Gly Ser Val Val Leu
Ser Asp Glu Leu Val Phe 805 810515808DNAThraustochytrium
sp.CDS(1)..(5805)misc_feature(1)..(5808)n = a c t or g 51atg caa
ctt cct cca gcg cat tct gcc gat gag aat cgc atc gcg gtc 48Met Gln
Leu Pro Pro Ala His Ser Ala Asp Glu Asn Arg Ile Ala Val1 5 10 15gtg
ggc atg gcc gtc aaa tat gcg ggc tgt gac aat aaa gaa gag ttt 96Val
Gly Met Ala Val Lys Tyr Ala Gly Cys Asp Asn Lys Glu Glu Phe 20 25
30tgg aag act ttg atg aat ggt agt atc aat acc aag tcg att tcg gca
144Trp Lys Thr Leu Met Asn Gly Ser Ile Asn Thr Lys Ser Ile Ser Ala
35 40 45gca agg ttg ggc agc aat aag cgt gac gaa cac tat gtt cct gaa
cga 192Ala Arg Leu Gly Ser Asn Lys Arg Asp Glu His Tyr Val Pro Glu
Arg 50 55 60tcg aaa tat gca gat acg ttc tgt aac gaa agg tac ggt tgt
atc cag 240Ser Lys Tyr Ala Asp Thr Phe Cys Asn Glu Arg Tyr Gly Cys
Ile Gln65 70 75 80caa ggt acg gat aat gag cat gac ctc ctc cta ggt
ctt gct caa gaa 288Gln Gly Thr Asp Asn Glu His Asp Leu Leu Leu Gly
Leu Ala Gln Glu 85 90 95gct ctc gct gac gct gcc ggg cgg atg gag aaa
caa cct tcg gag gcg 336Ala Leu Ala Asp Ala Ala Gly Arg Met Glu Lys
Gln Pro Ser Glu Ala 100 105 110ttc gat ctg gaa aat act ggc atc gtg
agt ggg tgc tta tct ttt cca 384Phe Asp Leu Glu Asn Thr Gly Ile Val
Ser Gly Cys Leu Ser Phe Pro 115 120 125atg gat aac ctg caa gga gag
ttg ttg aac ttg tat caa agc cat gtg 432Met Asp Asn Leu Gln Gly Glu
Leu Leu Asn Leu Tyr Gln Ser His Val 130 135 140gag aaa caa ctt cca
cct agt gcc ttg gta gaa gcc gtg aag ctt tgg 480Glu Lys Gln Leu Pro
Pro Ser Ala Leu Val Glu Ala Val Lys Leu Trp145 150 155 160tct gag
cga cag aaa tct acg aaa gca cat gca ggg gac aag cgc cgg 528Ser Glu
Arg Gln Lys Ser Thr Lys Ala His Ala Gly Asp Lys Arg Arg 165 170
175ttc att gac cca gct tct ttt gta gct gat aaa ctg aac cta ggc cca
576Phe Ile Asp Pro Ala Ser Phe Val Ala Asp Lys Leu Asn Leu Gly Pro
180 185 190cta cat tat gcg atc gat gca gca tgc gct tct gca ttg tac
gtg tta 624Leu His Tyr Ala Ile Asp Ala Ala Cys Ala Ser Ala Leu Tyr
Val Leu 195 200 205aaa tta gct caa gac cac ctt gtt tca ggt gcc gtt
gat atg atg tta 672Lys Leu Ala Gln Asp His Leu Val Ser Gly Ala Val
Asp Met Met Leu 210 215 220tgt gga gcg acg tgc ttc cca gaa cca ttc
ttc atc ttg tct ggg ttc 720Cys Gly Ala Thr Cys Phe Pro Glu Pro Phe
Phe Ile Leu Ser Gly Phe225 230 235 240tcg act ttt caa gcg atg cct
gnt ggg gca gat gga gtc tca cta cct 768Ser Thr Phe Gln Ala Met Pro
Xaa Gly Ala Asp Gly Val Ser Leu Pro 245 250 255ctc cat aaa acg agt
gct ggg ctc act cca ggt gaa ggg ggg tcc att 816Leu His Lys Thr Ser
Ala Gly Leu Thr Pro Gly Glu Gly Gly Ser Ile 260 265 270atg gtg ctc
aag cga ctg aaa gac gct atc aga gat gga aat cac att 864Met Val Leu
Lys Arg Leu Lys Asp Ala Ile Arg Asp Gly Asn His Ile 275 280 285tat
ggt gtg ctc ctt gaa gca aat tta agt aac gca ggt tgt ggg ctt 912Tyr
Gly Val Leu Leu Glu Ala Asn Leu Ser Asn Ala Gly Cys Gly Leu 290 295
300cca ctc agc ccg cac tta ccg agc gaa gaa tca tgt att cgt gat acc
960Pro Leu Ser Pro His Leu Pro Ser Glu Glu Ser Cys Ile Arg Asp
Thr305 310 315 320tac cgc cgt gct gga gtt gct gca gat caa agt att
cag tat att gag 1008Tyr Arg Arg Ala Gly Val Ala Ala Asp Gln Ser Ile
Gln Tyr Ile Glu 325 330 335tgc cac gct acg gga acc cct cga ggg gat
gtc gtg gaa att gag gcg 1056Cys His Ala Thr Gly Thr Pro Arg Gly Asp
Val Val Glu Ile Glu Ala 340 345 350gtt gaa aga gtt ttc aag aaa aac
gtt cca cgc tta ggc tcg acg aaa 1104Val Glu Arg Val Phe Lys Lys Asn
Val Pro Arg Leu Gly Ser Thr Lys 355 360 365gga aat ttt ggt cac tcg
tta gtt gcg gct ggt ttc gca ggt atg gca 1152Gly Asn Phe Gly His Ser
Leu Val Ala Ala Gly Phe Ala Gly Met Ala 370 375 380aag ctt ctt ctt
gca atg gaa cat gga gtg att cct ccc aca cca ggt 1200Lys Leu Leu Leu
Ala Met Glu His Gly Val Ile Pro Pro Thr Pro Gly385 390 395 400ctt
gat gct tcg aac cag gca agt gag cac gtt gtg aca aag gct atc 1248Leu
Asp Ala Ser Asn Gln Ala Ser Glu His Val Val Thr Lys Ala Ile 405 410
415act tgg cct gag aca cat ggg gct cca aaa cga gct ggc ctt tca gca
1296Thr Trp Pro Glu Thr His Gly Ala Pro Lys Arg Ala Gly Leu Ser Ala
420 425 430ttt gga ttt ggt ggg act aat gcg cat gca ctc ttc gaa gag
ttt aat 1344Phe Gly Phe Gly Gly Thr Asn Ala His Ala Leu Phe Glu Glu
Phe Asn 435 440 445gcc gag ggc ata agt tat cgc cct gga aag cct cca
gtc gaa tcg aat 1392Ala Glu Gly Ile Ser Tyr Arg Pro Gly Lys Pro Pro
Val Glu Ser Asn 450 455 460acc cgt cct tcc gtc gta ata act ggg atg
gac tgt acc ttt ggg agc 1440Thr Arg Pro Ser Val Val Ile Thr Gly Met
Asp Cys Thr Phe Gly Ser465 470 475 480ctt gaa ggg att gat gcg ttc
gag act gcc ctg tac gag ggg cgt gac 1488Leu Glu Gly Ile Asp Ala Phe
Glu Thr Ala Leu Tyr Glu Gly Arg Asp 485 490 495gca gct cgt gac tta
ccc gcc aaa cgt tgg agg ttc cta ggt gag gac 1536Ala Ala Arg Asp Leu
Pro Ala Lys Arg Trp Arg Phe Leu Gly Glu Asp 500 505 510ttg gag ttt
ctc cga gcc atc agg ctc aag gaa aag cct agg ggt tgt 1584Leu Glu Phe
Leu Arg Ala Ile Arg Leu Lys Glu Lys Pro Arg Gly Cys 515 520 525ttt
gtg gag agt gtt gac gtt aac ttt aga cgg ctg aaa acg ccc ttg 1632Phe
Val Glu Ser Val Asp Val Asn Phe Arg Arg Leu Lys Thr Pro Leu 530 535
540aca cca gaa gat atg ttg cgg ccc caa caa ctc ttg gcg gtt tct acg
1680Thr Pro Glu Asp Met Leu Arg Pro Gln Gln Leu Leu Ala Val Ser
Thr545 550 555 560atg gac cga gca att atc gat gca ggt cta aag aag
ggc caa cat gta 1728Met Asp Arg Ala Ile Ile Asp Ala Gly Leu Lys Lys
Gly Gln His Val 565 570 575gca gtt ctt gtt ggc cta gga act gac ctg
gaa ctt tac cgt cat cga 1776Ala Val Leu Val Gly Leu Gly Thr Asp Leu
Glu Leu Tyr Arg His Arg 580 585 590gca aga gtc gcg ctt aaa gag gtt
ttg cac ccg agc tta aag tca gac 1824Ala Arg Val Ala Leu Lys Glu Val
Leu His Pro Ser Leu Lys Ser Asp 595 600 605act gca att ctc cag aaa
ata atg caa tat gtg aat gat gca gga act 1872Thr Ala Ile Leu Gln Lys
Ile Met Gln Tyr Val Asn Asp Ala Gly Thr 610 615 620tcg act tca tac
aca tct tac att gga aac ctc gtt gcc acg cgt att 1920Ser Thr Ser Tyr
Thr Ser Tyr Ile Gly Asn Leu Val Ala Thr Arg Ile625 630 635 640tcg
tct cag tgg gga ttc aca ggg ccg tcc ttt act gtc aca gaa gga 1968Ser
Ser Gln Trp Gly Phe Thr Gly Pro Ser Phe Thr Val Thr Glu Gly 645 650
655aat aat tcc gtg tac aga tgt gca caa cta gcc aaa gat atg ctt cag
2016Asn Asn Ser Val Tyr Arg Cys Ala Gln Leu Ala Lys Asp Met Leu Gln
660 665 670gtt aac cga gtt gat gct gtc gtc atc gca ggc gtt gat ctc
aac gga 2064Val Asn Arg Val Asp Ala Val Val Ile Ala Gly Val Asp Leu
Asn Gly 675 680 685agc gcc gaa agt ttt ttt gtc cga gca aat cgt caa
aag ata tcc aag 2112Ser Ala Glu Ser Phe Phe Val Arg Ala Asn Arg Gln
Lys Ile Ser Lys 690 695 700cta agt cat cca tgt gca agc ttc gac aga
gat gca gat gga ttt ttc 2160Leu Ser His Pro Cys Ala Ser Phe Asp Arg
Asp Ala Asp Gly Phe Phe705 710 715 720gca ggt gag ggc tgt ggt gcc
cta gtt ttc aag agg tta gaa gac tgt 2208Ala Gly Glu Gly Cys Gly Ala
Leu Val Phe Lys Arg Leu Glu Asp Cys 725 730 735gct cct cag gaa aaa
att tat gct agt ata gac tct atc gca ata gat 2256Ala Pro Gln Glu Lys
Ile Tyr Ala Ser Ile Asp Ser Ile Ala Ile Asp 740 745 750aaa gag cct
act agc tca gct gtg aaa gct gtc tac caa agt gat tcg 2304Lys Glu Pro
Thr Ser Ser Ala Val Lys Ala Val Tyr Gln Ser Asp Ser 755 760 765agt
ctc tcc gat att gag ctg tta gaa atc agt gga gac tcc aaa cgg 2352Ser
Leu Ser Asp Ile Glu Leu Leu Glu Ile Ser Gly Asp Ser Lys Arg 770 775
780ttt gca gca ttc gaa ggc gct gtg gaa att caa tca agt gtg gaa gcc
2400Phe Ala Ala Phe Glu Gly Ala Val Glu Ile Gln Ser Ser Val Glu
Ala785 790 795 800cag cta aaa gga ctt tcc aaa gtc ctt gaa cct gca
aaa ggc caa ggc 2448Gln Leu Lys Gly Leu Ser Lys Val Leu Glu Pro Ala
Lys Gly Gln Gly 805 810 815gta gcg gtg gga agt act cga gca acc gtt
ggg gat ata ggg tat gct 2496Val Ala Val Gly Ser Thr Arg Ala Thr Val
Gly Asp Ile Gly Tyr Ala 820 825 830aca gga gcg gca agc ctg att aaa
act gca ctc tgc tta tat aat cgc 2544Thr Gly Ala Ala Ser Leu Ile Lys
Thr Ala Leu Cys Leu Tyr Asn Arg 835 840 845tac ctt ccg gca tta gca
aac tgg agt ggc cca tgt gaa cag tcc gcc 2592Tyr Leu Pro Ala Leu Ala
Asn Trp Ser Gly Pro Cys Glu Gln Ser Ala 850 855 860tgg ggc tca aac
atg ttc gtt tgc cat gaa aca cgg ccg tgg atg aaa 2640Trp Gly Ser Asn
Met Phe Val Cys His Glu Thr Arg Pro Trp Met Lys865 870 875 880aac
cag aat gaa aag aga tgt gcc ctc att tct gga aca gat cca tct 2688Asn
Gln Asn Glu Lys Arg Cys Ala Leu Ile Ser Gly Thr Asp Pro Ser 885 890
895cat aca tgc ttt tcc ctc gta cta tcg gat act ggg tgt tat gaa gag
2736His Thr Cys Phe Ser Leu Val Leu Ser Asp Thr Gly Cys Tyr Glu Glu
900 905 910cac aat cga acg tgc ttt gat gtg caa gcg cca cag cta gtt
ctg ata 2784His Asn Arg Thr Cys Phe Asp Val Gln Ala Pro Gln Leu Val
Leu Ile 915 920 925cac gga ttc gat gga aaa act att gtg cgg cga ctt
gaa gga tat ctc 2832His Gly Phe Asp Gly Lys Thr Ile Val Arg Arg Leu
Glu Gly Tyr Leu 930 935 940ctt gaa ctt gtt gaa ggg cat gca agc cct
tca gag tat ttc cac aaa 2880Leu Glu Leu Val Glu Gly His Ala Ser Pro
Ser Glu Tyr Phe His Lys945 950 955 960ctg att gga caa agt cta ctt
gag aac tcg aaa gaa agt aaa ctc aca 2928Leu Ile Gly Gln Ser Leu Leu
Glu Asn Ser Lys Glu Ser Lys Leu Thr 965 970 975ctt tcg ctt gtg tgc
aat ccg aac cag ctc caa aag gag ctc atg ctt 2976Leu Ser Leu Val Cys
Asn Pro Asn Gln Leu Gln Lys Glu Leu Met Leu 980 985 990gct atc aaa
gga gta caa cga agc atg tta aca ggg aag gat tgg gtc 3024Ala Ile Lys
Gly Val Gln Arg Ser Met Leu Thr Gly Lys Asp Trp Val 995 1000
1005agt cca tca gga agt tgt ttt gcc cca aat ccg tta tca agc gca
3069Ser Pro Ser Gly Ser Cys Phe Ala Pro Asn Pro Leu Ser Ser Ala
1010 1015 1020aaa gtg gca ttc atg tac gga gaa ggc cga agc ccg tac
tgt ggt 3114Lys Val Ala Phe Met Tyr Gly Glu Gly Arg Ser Pro Tyr Cys
Gly 1025 1030 1035gta ggc ttg ggt cta cat cgt ttg tgg ccc ggt ctc
cat gaa aat 3159Val Gly Leu Gly Leu His Arg Leu Trp Pro Gly Leu His
Glu Asn 1040 1045 1050gtg aac aat aag aca gtc gat tta tgg acg gaa
gga gat ggt tgg 3204Val Asn Asn Lys Thr Val Asp Leu Trp Thr Glu Gly
Asp Gly Trp 1055 1060 1065tta tat cct cga acg ttg aca cga gaa gag
cat aca aaa gcc atc 3249Leu Tyr Pro Arg Thr Leu Thr Arg Glu Glu His
Thr Lys Ala Ile 1070 1075 1080gaa tct ttc aac gca aat caa att gaa
atg ttt cgc gct ggg att 3294Glu Ser Phe Asn Ala Asn Gln Ile Glu Met
Phe Arg Ala Gly Ile 1085 1090 1095ttc atc tca atg tgt cag aca gac
tat gtc atg aat gtt ctc ggt 3339Phe Ile Ser Met Cys Gln Thr Asp Tyr
Val Met Asn Val Leu Gly 1100 1105 1110gtc cag cct aag gcc gga ttt
ggg ctg agc ttg gga gaa att tca 3384Val Gln Pro Lys Ala Gly Phe Gly
Leu Ser Leu Gly Glu Ile Ser 1115 1120 1125atg ctc ttt gcg atg tca
aag gag aac tgc agg cag tca cag gaa 3429Met Leu Phe Ala Met Ser Lys
Glu Asn Cys Arg Gln Ser Gln Glu 1130 1135 1140atg acc aat cgt ttg
cgc ggt tct cca gtg tgg tct aac gag ctt 3474Met Thr Asn Arg Leu Arg
Gly Ser Pro Val Trp Ser Asn Glu Leu 1145 1150 1155gct atc aac ttc
aat gca att cgc aag tta tgg aaa atc ccc cga 3519Ala Ile Asn Phe Asn
Ala Ile Arg Lys Leu Trp Lys Ile Pro Arg 1160 1165 1170gga gct ccc
tta gaa tcc ttt tgg caa gga tac ttg gtt cac ggc 3564Gly Ala Pro Leu
Glu Ser Phe Trp Gln Gly Tyr Leu Val His Gly 1175 1180 1185aca aga
gaa gaa gta gag cat gct att ggt ctt tct gag cct tat 3609Thr Arg Glu
Glu Val Glu His Ala Ile Gly Leu Ser Glu Pro Tyr 1190 1195 1200gta
cgt ctg ctt att gtg aac gat tca agg agt gcc ttg att gct 3654Val Arg
Leu Leu Ile Val Asn Asp Ser Arg Ser Ala Leu Ile Ala 1205 1210
1215gga aaa cca gac gcc tgt cag gca gta atc agt aga cta aac tcc
3699Gly Lys Pro Asp Ala Cys Gln Ala Val Ile Ser Arg Leu Asn Ser
1220 1225 1230aag ttc cct tct ctg ccg gta aag caa gga atg att ggt
cat tgc 3744Lys Phe Pro Ser Leu Pro Val Lys Gln Gly Met Ile Gly His
Cys 1235 1240 1245cca gaa gtt cgt gcg ttc atc aaa gat att ggg tac
atc cat gaa 3789Pro Glu Val Arg Ala Phe Ile Lys Asp Ile Gly Tyr Ile
His Glu 1250 1255 1260aca ctc cga att tcc aat gac tat tcg gat tgt
cag ctt ttc tca 3834Thr Leu Arg Ile Ser Asn Asp Tyr Ser Asp Cys Gln
Leu Phe Ser 1265 1270 1275gcg gta acc aag ggc gca ctt gac agc tcc
aca atg gaa atc aaa 3879Ala Val Thr Lys Gly Ala Leu Asp Ser Ser Thr
Met Glu Ile Lys 1280 1285 1290cac ttt gtg gga gag gtc tac tcc cgg
atc gca gac ttt cct caa 3924His Phe Val Gly Glu Val Tyr Ser Arg Ile
Ala Asp Phe Pro Gln 1295 1300 1305atc gtc aac acg gtg cat tcg gct
ggt tat gac gta ttt ctt gag 3969Ile Val Asn Thr Val His Ser Ala Gly
Tyr Asp Val Phe Leu Glu 1310 1315 1320ctt ggc tgt gat gct tct aga
tct gca gca gtt caa aac att ctt 4014Leu Gly Cys Asp Ala Ser Arg Ser
Ala Ala Val Gln Asn Ile Leu 1325 1330 1335ggt ggt caa gga aag ttc
ttg tct aca gct att gac aaa aaa gga 4059Gly Gly Gln Gly Lys Phe Leu
Ser Thr Ala Ile Asp Lys Lys Gly 1340 1345 1350cac tcc gcc tgg tca
caa gta ctt cgg gct acc gca tca tta gct 4104His Ser Ala Trp Ser Gln
Val Leu Arg Ala Thr Ala Ser Leu Ala 1355 1360 1365gca cat cga gta
ccg gga atc tca att ttg gat ttg ttt cac cca 4149Ala His Arg Val Pro
Gly Ile Ser Ile Leu Asp Leu Phe His Pro 1370 1375 1380aat ttc cga
gaa atg tgc tgt aca atg gca acc aca cct aaa gtg 4194Asn Phe Arg Glu
Met Cys Cys Thr Met Ala Thr Thr Pro Lys Val 1385 1390 1395gaa gat
aag ttc ctg cgc acg att caa atc aat ggt cgg ttt gaa 4239Glu Asp Lys
Phe Leu Arg Thr Ile Gln Ile Asn Gly Arg Phe Glu 1400 1405 1410aaa
gaa atg att cac cta gaa gat aca aca tta agt tgc tta ccc 4284Lys Glu
Met Ile His Leu Glu Asp Thr Thr Leu Ser Cys Leu Pro 1415 1420
1425gct cca agt gaa gca aat atc gca gct
att caa tct cgg tca att 4329Ala Pro Ser Glu Ala Asn Ile Ala Ala Ile
Gln Ser Arg Ser Ile 1430 1435 1440cga tct gct gcg gcg cgt tct gga
caa tcc cat gat tgt gca tcc 4374Arg Ser Ala Ala Ala Arg Ser Gly Gln
Ser His Asp Cys Ala Ser 1445 1450 1455cat agc cat gaa gaa aat aag
gat tca tgc cct gaa aag ctg aag 4419His Ser His Glu Glu Asn Lys Asp
Ser Cys Pro Glu Lys Leu Lys 1460 1465 1470ctt gat tct gtg tcc gtc
gcc ata aat ttc gac aat gat gac cgc 4464Leu Asp Ser Val Ser Val Ala
Ile Asn Phe Asp Asn Asp Asp Arg 1475 1480 1485att cag ctt ggg cac
gcg ggt ttt cgg gag atg tac aat aca aga 4509Ile Gln Leu Gly His Ala
Gly Phe Arg Glu Met Tyr Asn Thr Arg 1490 1495 1500tat agc ttg tac
aca ggg gcg atg gca aag gga att gca tct gca 4554Tyr Ser Leu Tyr Thr
Gly Ala Met Ala Lys Gly Ile Ala Ser Ala 1505 1510 1515gat ctt gtc
att gcc gct ggg aaa gag ggc atc cta gct tcc tat 4599Asp Leu Val Ile
Ala Ala Gly Lys Glu Gly Ile Leu Ala Ser Tyr 1520 1525 1530gga gct
gga gga cta cct ctt gct act gtt cga aag gga ata gac 4644Gly Ala Gly
Gly Leu Pro Leu Ala Thr Val Arg Lys Gly Ile Asp 1535 1540 1545aaa
att caa caa gcc ttg cca agt ggc cca tat gct gta aat ctt 4689Lys Ile
Gln Gln Ala Leu Pro Ser Gly Pro Tyr Ala Val Asn Leu 1550 1555
1560att cac tct ccc ttt gac ggc aac ttg gag cag gga aac gtc gat
4734Ile His Ser Pro Phe Asp Gly Asn Leu Glu Gln Gly Asn Val Asp
1565 1570 1575ttg ttc ttg gaa aag aac gtc cgc gtg gcg gaa tgt tcc
gcg ttt 4779Leu Phe Leu Glu Lys Asn Val Arg Val Ala Glu Cys Ser Ala
Phe 1580 1585 1590aca acg cta aca gtg cca gta gta cac tat cgt gct
gca ggg ctt 4824Thr Thr Leu Thr Val Pro Val Val His Tyr Arg Ala Ala
Gly Leu 1595 1600 1605gtt cgg cgc caa gat gga agc att ttg atc aag
aac cga atc att 4869Val Arg Arg Gln Asp Gly Ser Ile Leu Ile Lys Asn
Arg Ile Ile 1610 1615 1620gct aaa gta tct agg aca gaa ctc gct gag
atg ttc ctt cgt ccg 4914Ala Lys Val Ser Arg Thr Glu Leu Ala Glu Met
Phe Leu Arg Pro 1625 1630 1635gca cct caa atc atc ctc gaa aaa ctg
gta gca gca gaa atc att 4959Ala Pro Gln Ile Ile Leu Glu Lys Leu Val
Ala Ala Glu Ile Ile 1640 1645 1650tca tct gac caa gcg cgt atg gca
gcc aaa gtt ccc atg gcg gac 5004Ser Ser Asp Gln Ala Arg Met Ala Ala
Lys Val Pro Met Ala Asp 1655 1660 1665gac atc gca gtc gaa gcc gac
tct ggt ggg cac acg gat aat cgg 5049Asp Ile Ala Val Glu Ala Asp Ser
Gly Gly His Thr Asp Asn Arg 1670 1675 1680cct atg cac gtc att ttg
ccc ctg ata att caa ctc cgc aat act 5094Pro Met His Val Ile Leu Pro
Leu Ile Ile Gln Leu Arg Asn Thr 1685 1690 1695ata ctt gca gag tat
ggc tgt gcc acg gct ttt cgt acc cgt ata 5139Ile Leu Ala Glu Tyr Gly
Cys Ala Thr Ala Phe Arg Thr Arg Ile 1700 1705 1710ggc gct gga gga
ggc att ggt tgt cct tca gcg gcc ctc gca gcc 5184Gly Ala Gly Gly Gly
Ile Gly Cys Pro Ser Ala Ala Leu Ala Ala 1715 1720 1725ttt gat atg
ggt gcg agt ttt gtc gtg act gga agc ata aat caa 5229Phe Asp Met Gly
Ala Ser Phe Val Val Thr Gly Ser Ile Asn Gln 1730 1735 1740att tgc
cgc gag gca ggg act tgc gat act gtt cgg gag cta ctt 5274Ile Cys Arg
Glu Ala Gly Thr Cys Asp Thr Val Arg Glu Leu Leu 1745 1750 1755gcc
aac tca agc tac tcg gac gtg acg atg gcg cca gca gca gac 5319Ala Asn
Ser Ser Tyr Ser Asp Val Thr Met Ala Pro Ala Ala Asp 1760 1765
1770atg ttt gac caa ggt gtg aaa ctc caa gtc tta aaa cga gga acg
5364Met Phe Asp Gln Gly Val Lys Leu Gln Val Leu Lys Arg Gly Thr
1775 1780 1785atg ttt cca agc aga gca aat aaa ctc cgg aag ctc ttt
gtg aac 5409Met Phe Pro Ser Arg Ala Asn Lys Leu Arg Lys Leu Phe Val
Asn 1790 1795 1800tac gaa tct cta gaa aca ctc ccg tcg aaa gag ttg
aaa tac ctg 5454Tyr Glu Ser Leu Glu Thr Leu Pro Ser Lys Glu Leu Lys
Tyr Leu 1805 1810 1815gaa aac atc ata ttc aag caa gca gta gac cag
gtg tgg gag gaa 5499Glu Asn Ile Ile Phe Lys Gln Ala Val Asp Gln Val
Trp Glu Glu 1820 1825 1830aca aag cgc ttt tac tgt gaa aaa ctg aac
aat cca gat aaa att 5544Thr Lys Arg Phe Tyr Cys Glu Lys Leu Asn Asn
Pro Asp Lys Ile 1835 1840 1845gca agg gcc atg aaa gat cct aaa ttg
aag atg tcg ctt tgc ttt 5589Ala Arg Ala Met Lys Asp Pro Lys Leu Lys
Met Ser Leu Cys Phe 1850 1855 1860cgg tgg tat ctc tcc aag agc tct
ggg tgg gcc aac gca gga att 5634Arg Trp Tyr Leu Ser Lys Ser Ser Gly
Trp Ala Asn Ala Gly Ile 1865 1870 1875aaa tct cgt gca ctc gac tac
cag atc tgg tgt ggc ccg gca atg 5679Lys Ser Arg Ala Leu Asp Tyr Gln
Ile Trp Cys Gly Pro Ala Met 1880 1885 1890ggc tcg ttc aac aat ttc
gcc agc ggc aca tcc ctc gat tgg aaa 5724Gly Ser Phe Asn Asn Phe Ala
Ser Gly Thr Ser Leu Asp Trp Lys 1895 1900 1905gtg act ggg gtt ttc
cct ggc gtt gcg gaa gta aac atg gcc att 5769Val Thr Gly Val Phe Pro
Gly Val Ala Glu Val Asn Met Ala Ile 1910 1915 1920tta gat ggc gcg
cga gaa cta gct gct aaa cga aat taa 5808Leu Asp Gly Ala Arg Glu Leu
Ala Ala Lys Arg Asn 1925 1930 1935521935PRTThraustochytrium
sp.misc_feature(248)..(248)The 'Xaa' at location 248 stands for
Asp, Gly, Ala, or Val. 52Met Gln Leu Pro Pro Ala His Ser Ala Asp
Glu Asn Arg Ile Ala Val1 5 10 15Val Gly Met Ala Val Lys Tyr Ala Gly
Cys Asp Asn Lys Glu Glu Phe 20 25 30Trp Lys Thr Leu Met Asn Gly Ser
Ile Asn Thr Lys Ser Ile Ser Ala 35 40 45Ala Arg Leu Gly Ser Asn Lys
Arg Asp Glu His Tyr Val Pro Glu Arg 50 55 60Ser Lys Tyr Ala Asp Thr
Phe Cys Asn Glu Arg Tyr Gly Cys Ile Gln65 70 75 80Gln Gly Thr Asp
Asn Glu His Asp Leu Leu Leu Gly Leu Ala Gln Glu 85 90 95Ala Leu Ala
Asp Ala Ala Gly Arg Met Glu Lys Gln Pro Ser Glu Ala 100 105 110Phe
Asp Leu Glu Asn Thr Gly Ile Val Ser Gly Cys Leu Ser Phe Pro 115 120
125Met Asp Asn Leu Gln Gly Glu Leu Leu Asn Leu Tyr Gln Ser His Val
130 135 140Glu Lys Gln Leu Pro Pro Ser Ala Leu Val Glu Ala Val Lys
Leu Trp145 150 155 160Ser Glu Arg Gln Lys Ser Thr Lys Ala His Ala
Gly Asp Lys Arg Arg 165 170 175Phe Ile Asp Pro Ala Ser Phe Val Ala
Asp Lys Leu Asn Leu Gly Pro 180 185 190Leu His Tyr Ala Ile Asp Ala
Ala Cys Ala Ser Ala Leu Tyr Val Leu 195 200 205Lys Leu Ala Gln Asp
His Leu Val Ser Gly Ala Val Asp Met Met Leu 210 215 220Cys Gly Ala
Thr Cys Phe Pro Glu Pro Phe Phe Ile Leu Ser Gly Phe225 230 235
240Ser Thr Phe Gln Ala Met Pro Xaa Gly Ala Asp Gly Val Ser Leu Pro
245 250 255Leu His Lys Thr Ser Ala Gly Leu Thr Pro Gly Glu Gly Gly
Ser Ile 260 265 270Met Val Leu Lys Arg Leu Lys Asp Ala Ile Arg Asp
Gly Asn His Ile 275 280 285Tyr Gly Val Leu Leu Glu Ala Asn Leu Ser
Asn Ala Gly Cys Gly Leu 290 295 300Pro Leu Ser Pro His Leu Pro Ser
Glu Glu Ser Cys Ile Arg Asp Thr305 310 315 320Tyr Arg Arg Ala Gly
Val Ala Ala Asp Gln Ser Ile Gln Tyr Ile Glu 325 330 335Cys His Ala
Thr Gly Thr Pro Arg Gly Asp Val Val Glu Ile Glu Ala 340 345 350Val
Glu Arg Val Phe Lys Lys Asn Val Pro Arg Leu Gly Ser Thr Lys 355 360
365Gly Asn Phe Gly His Ser Leu Val Ala Ala Gly Phe Ala Gly Met Ala
370 375 380Lys Leu Leu Leu Ala Met Glu His Gly Val Ile Pro Pro Thr
Pro Gly385 390 395 400Leu Asp Ala Ser Asn Gln Ala Ser Glu His Val
Val Thr Lys Ala Ile 405 410 415Thr Trp Pro Glu Thr His Gly Ala Pro
Lys Arg Ala Gly Leu Ser Ala 420 425 430Phe Gly Phe Gly Gly Thr Asn
Ala His Ala Leu Phe Glu Glu Phe Asn 435 440 445Ala Glu Gly Ile Ser
Tyr Arg Pro Gly Lys Pro Pro Val Glu Ser Asn 450 455 460Thr Arg Pro
Ser Val Val Ile Thr Gly Met Asp Cys Thr Phe Gly Ser465 470 475
480Leu Glu Gly Ile Asp Ala Phe Glu Thr Ala Leu Tyr Glu Gly Arg Asp
485 490 495Ala Ala Arg Asp Leu Pro Ala Lys Arg Trp Arg Phe Leu Gly
Glu Asp 500 505 510Leu Glu Phe Leu Arg Ala Ile Arg Leu Lys Glu Lys
Pro Arg Gly Cys 515 520 525Phe Val Glu Ser Val Asp Val Asn Phe Arg
Arg Leu Lys Thr Pro Leu 530 535 540Thr Pro Glu Asp Met Leu Arg Pro
Gln Gln Leu Leu Ala Val Ser Thr545 550 555 560Met Asp Arg Ala Ile
Ile Asp Ala Gly Leu Lys Lys Gly Gln His Val 565 570 575Ala Val Leu
Val Gly Leu Gly Thr Asp Leu Glu Leu Tyr Arg His Arg 580 585 590Ala
Arg Val Ala Leu Lys Glu Val Leu His Pro Ser Leu Lys Ser Asp 595 600
605Thr Ala Ile Leu Gln Lys Ile Met Gln Tyr Val Asn Asp Ala Gly Thr
610 615 620Ser Thr Ser Tyr Thr Ser Tyr Ile Gly Asn Leu Val Ala Thr
Arg Ile625 630 635 640Ser Ser Gln Trp Gly Phe Thr Gly Pro Ser Phe
Thr Val Thr Glu Gly 645 650 655Asn Asn Ser Val Tyr Arg Cys Ala Gln
Leu Ala Lys Asp Met Leu Gln 660 665 670Val Asn Arg Val Asp Ala Val
Val Ile Ala Gly Val Asp Leu Asn Gly 675 680 685Ser Ala Glu Ser Phe
Phe Val Arg Ala Asn Arg Gln Lys Ile Ser Lys 690 695 700Leu Ser His
Pro Cys Ala Ser Phe Asp Arg Asp Ala Asp Gly Phe Phe705 710 715
720Ala Gly Glu Gly Cys Gly Ala Leu Val Phe Lys Arg Leu Glu Asp Cys
725 730 735Ala Pro Gln Glu Lys Ile Tyr Ala Ser Ile Asp Ser Ile Ala
Ile Asp 740 745 750Lys Glu Pro Thr Ser Ser Ala Val Lys Ala Val Tyr
Gln Ser Asp Ser 755 760 765Ser Leu Ser Asp Ile Glu Leu Leu Glu Ile
Ser Gly Asp Ser Lys Arg 770 775 780Phe Ala Ala Phe Glu Gly Ala Val
Glu Ile Gln Ser Ser Val Glu Ala785 790 795 800Gln Leu Lys Gly Leu
Ser Lys Val Leu Glu Pro Ala Lys Gly Gln Gly 805 810 815Val Ala Val
Gly Ser Thr Arg Ala Thr Val Gly Asp Ile Gly Tyr Ala 820 825 830Thr
Gly Ala Ala Ser Leu Ile Lys Thr Ala Leu Cys Leu Tyr Asn Arg 835 840
845Tyr Leu Pro Ala Leu Ala Asn Trp Ser Gly Pro Cys Glu Gln Ser Ala
850 855 860Trp Gly Ser Asn Met Phe Val Cys His Glu Thr Arg Pro Trp
Met Lys865 870 875 880Asn Gln Asn Glu Lys Arg Cys Ala Leu Ile Ser
Gly Thr Asp Pro Ser 885 890 895His Thr Cys Phe Ser Leu Val Leu Ser
Asp Thr Gly Cys Tyr Glu Glu 900 905 910His Asn Arg Thr Cys Phe Asp
Val Gln Ala Pro Gln Leu Val Leu Ile 915 920 925His Gly Phe Asp Gly
Lys Thr Ile Val Arg Arg Leu Glu Gly Tyr Leu 930 935 940Leu Glu Leu
Val Glu Gly His Ala Ser Pro Ser Glu Tyr Phe His Lys945 950 955
960Leu Ile Gly Gln Ser Leu Leu Glu Asn Ser Lys Glu Ser Lys Leu Thr
965 970 975Leu Ser Leu Val Cys Asn Pro Asn Gln Leu Gln Lys Glu Leu
Met Leu 980 985 990Ala Ile Lys Gly Val Gln Arg Ser Met Leu Thr Gly
Lys Asp Trp Val 995 1000 1005Ser Pro Ser Gly Ser Cys Phe Ala Pro
Asn Pro Leu Ser Ser Ala 1010 1015 1020Lys Val Ala Phe Met Tyr Gly
Glu Gly Arg Ser Pro Tyr Cys Gly 1025 1030 1035Val Gly Leu Gly Leu
His Arg Leu Trp Pro Gly Leu His Glu Asn 1040 1045 1050Val Asn Asn
Lys Thr Val Asp Leu Trp Thr Glu Gly Asp Gly Trp 1055 1060 1065Leu
Tyr Pro Arg Thr Leu Thr Arg Glu Glu His Thr Lys Ala Ile 1070 1075
1080Glu Ser Phe Asn Ala Asn Gln Ile Glu Met Phe Arg Ala Gly Ile
1085 1090 1095Phe Ile Ser Met Cys Gln Thr Asp Tyr Val Met Asn Val
Leu Gly 1100 1105 1110Val Gln Pro Lys Ala Gly Phe Gly Leu Ser Leu
Gly Glu Ile Ser 1115 1120 1125Met Leu Phe Ala Met Ser Lys Glu Asn
Cys Arg Gln Ser Gln Glu 1130 1135 1140Met Thr Asn Arg Leu Arg Gly
Ser Pro Val Trp Ser Asn Glu Leu 1145 1150 1155Ala Ile Asn Phe Asn
Ala Ile Arg Lys Leu Trp Lys Ile Pro Arg 1160 1165 1170Gly Ala Pro
Leu Glu Ser Phe Trp Gln Gly Tyr Leu Val His Gly 1175 1180 1185Thr
Arg Glu Glu Val Glu His Ala Ile Gly Leu Ser Glu Pro Tyr 1190 1195
1200Val Arg Leu Leu Ile Val Asn Asp Ser Arg Ser Ala Leu Ile Ala
1205 1210 1215Gly Lys Pro Asp Ala Cys Gln Ala Val Ile Ser Arg Leu
Asn Ser 1220 1225 1230Lys Phe Pro Ser Leu Pro Val Lys Gln Gly Met
Ile Gly His Cys 1235 1240 1245Pro Glu Val Arg Ala Phe Ile Lys Asp
Ile Gly Tyr Ile His Glu 1250 1255 1260Thr Leu Arg Ile Ser Asn Asp
Tyr Ser Asp Cys Gln Leu Phe Ser 1265 1270 1275Ala Val Thr Lys Gly
Ala Leu Asp Ser Ser Thr Met Glu Ile Lys 1280 1285 1290His Phe Val
Gly Glu Val Tyr Ser Arg Ile Ala Asp Phe Pro Gln 1295 1300 1305Ile
Val Asn Thr Val His Ser Ala Gly Tyr Asp Val Phe Leu Glu 1310 1315
1320Leu Gly Cys Asp Ala Ser Arg Ser Ala Ala Val Gln Asn Ile Leu
1325 1330 1335Gly Gly Gln Gly Lys Phe Leu Ser Thr Ala Ile Asp Lys
Lys Gly 1340 1345 1350His Ser Ala Trp Ser Gln Val Leu Arg Ala Thr
Ala Ser Leu Ala 1355 1360 1365Ala His Arg Val Pro Gly Ile Ser Ile
Leu Asp Leu Phe His Pro 1370 1375 1380Asn Phe Arg Glu Met Cys Cys
Thr Met Ala Thr Thr Pro Lys Val 1385 1390 1395Glu Asp Lys Phe Leu
Arg Thr Ile Gln Ile Asn Gly Arg Phe Glu 1400 1405 1410Lys Glu Met
Ile His Leu Glu Asp Thr Thr Leu Ser Cys Leu Pro 1415 1420 1425Ala
Pro Ser Glu Ala Asn Ile Ala Ala Ile Gln Ser Arg Ser Ile 1430 1435
1440Arg Ser Ala Ala Ala Arg Ser Gly Gln Ser His Asp Cys Ala Ser
1445 1450 1455His Ser His Glu Glu Asn Lys Asp Ser Cys Pro Glu Lys
Leu Lys 1460 1465 1470Leu Asp Ser Val Ser Val Ala Ile Asn Phe Asp
Asn Asp Asp Arg 1475 1480 1485Ile Gln Leu Gly His Ala Gly Phe Arg
Glu Met Tyr Asn Thr Arg 1490 1495 1500Tyr Ser Leu Tyr Thr Gly Ala
Met Ala Lys Gly Ile Ala Ser Ala 1505 1510 1515Asp Leu Val Ile Ala
Ala Gly Lys Glu Gly Ile Leu Ala Ser Tyr 1520 1525 1530Gly Ala Gly
Gly Leu Pro Leu Ala Thr Val Arg Lys Gly Ile Asp 1535 1540 1545Lys
Ile Gln Gln Ala Leu Pro Ser Gly Pro Tyr Ala Val Asn Leu 1550 1555
1560Ile His Ser Pro Phe Asp Gly Asn Leu Glu Gln Gly Asn Val Asp
1565 1570 1575Leu Phe Leu Glu Lys Asn Val Arg Val Ala Glu Cys Ser
Ala Phe 1580 1585 1590Thr Thr Leu Thr Val Pro Val Val His Tyr
Arg
Ala Ala Gly Leu 1595 1600 1605Val Arg Arg Gln Asp Gly Ser Ile Leu
Ile Lys Asn Arg Ile Ile 1610 1615 1620Ala Lys Val Ser Arg Thr Glu
Leu Ala Glu Met Phe Leu Arg Pro 1625 1630 1635Ala Pro Gln Ile Ile
Leu Glu Lys Leu Val Ala Ala Glu Ile Ile 1640 1645 1650Ser Ser Asp
Gln Ala Arg Met Ala Ala Lys Val Pro Met Ala Asp 1655 1660 1665Asp
Ile Ala Val Glu Ala Asp Ser Gly Gly His Thr Asp Asn Arg 1670 1675
1680Pro Met His Val Ile Leu Pro Leu Ile Ile Gln Leu Arg Asn Thr
1685 1690 1695Ile Leu Ala Glu Tyr Gly Cys Ala Thr Ala Phe Arg Thr
Arg Ile 1700 1705 1710Gly Ala Gly Gly Gly Ile Gly Cys Pro Ser Ala
Ala Leu Ala Ala 1715 1720 1725Phe Asp Met Gly Ala Ser Phe Val Val
Thr Gly Ser Ile Asn Gln 1730 1735 1740Ile Cys Arg Glu Ala Gly Thr
Cys Asp Thr Val Arg Glu Leu Leu 1745 1750 1755Ala Asn Ser Ser Tyr
Ser Asp Val Thr Met Ala Pro Ala Ala Asp 1760 1765 1770Met Phe Asp
Gln Gly Val Lys Leu Gln Val Leu Lys Arg Gly Thr 1775 1780 1785Met
Phe Pro Ser Arg Ala Asn Lys Leu Arg Lys Leu Phe Val Asn 1790 1795
1800Tyr Glu Ser Leu Glu Thr Leu Pro Ser Lys Glu Leu Lys Tyr Leu
1805 1810 1815Glu Asn Ile Ile Phe Lys Gln Ala Val Asp Gln Val Trp
Glu Glu 1820 1825 1830Thr Lys Arg Phe Tyr Cys Glu Lys Leu Asn Asn
Pro Asp Lys Ile 1835 1840 1845Ala Arg Ala Met Lys Asp Pro Lys Leu
Lys Met Ser Leu Cys Phe 1850 1855 1860Arg Trp Tyr Leu Ser Lys Ser
Ser Gly Trp Ala Asn Ala Gly Ile 1865 1870 1875Lys Ser Arg Ala Leu
Asp Tyr Gln Ile Trp Cys Gly Pro Ala Met 1880 1885 1890Gly Ser Phe
Asn Asn Phe Ala Ser Gly Thr Ser Leu Asp Trp Lys 1895 1900 1905Val
Thr Gly Val Phe Pro Gly Val Ala Glu Val Asn Met Ala Ile 1910 1915
1920Leu Asp Gly Ala Arg Glu Leu Ala Ala Lys Arg Asn 1925 1930
1935531500DNAThraustochytrium
sp.CDS(1)..(1500)misc_feature(1)..(1500)n = a c t or g 53atg caa
ctt cct cca gcg cat tct gcc gat gag aat cgc atc gcg gtc 48Met Gln
Leu Pro Pro Ala His Ser Ala Asp Glu Asn Arg Ile Ala Val1 5 10 15gtg
ggc atg gcc gtc aaa tat gcg ggc tgt gac aat aaa gaa gag ttt 96Val
Gly Met Ala Val Lys Tyr Ala Gly Cys Asp Asn Lys Glu Glu Phe 20 25
30tgg aag act ttg atg aat ggt agt atc aat acc aag tcg att tcg gca
144Trp Lys Thr Leu Met Asn Gly Ser Ile Asn Thr Lys Ser Ile Ser Ala
35 40 45gca agg ttg ggc agc aat aag cgt gac gaa cac tat gtt cct gaa
cga 192Ala Arg Leu Gly Ser Asn Lys Arg Asp Glu His Tyr Val Pro Glu
Arg 50 55 60tcg aaa tat gca gat acg ttc tgt aac gaa agg tac ggt tgt
atc cag 240Ser Lys Tyr Ala Asp Thr Phe Cys Asn Glu Arg Tyr Gly Cys
Ile Gln65 70 75 80caa ggt acg gat aat gag cat gac ctc ctc cta ggt
ctt gct caa gaa 288Gln Gly Thr Asp Asn Glu His Asp Leu Leu Leu Gly
Leu Ala Gln Glu 85 90 95gct ctc gct gac gct gcc ggg cgg atg gag aaa
caa cct tcg gag gcg 336Ala Leu Ala Asp Ala Ala Gly Arg Met Glu Lys
Gln Pro Ser Glu Ala 100 105 110ttc gat ctg gaa aat act ggc atc gtg
agt ggg tgc tta tct ttt cca 384Phe Asp Leu Glu Asn Thr Gly Ile Val
Ser Gly Cys Leu Ser Phe Pro 115 120 125atg gat aac ctg caa gga gag
ttg ttg aac ttg tat caa agc cat gtg 432Met Asp Asn Leu Gln Gly Glu
Leu Leu Asn Leu Tyr Gln Ser His Val 130 135 140gag aaa caa ctt cca
cct agt gcc ttg gta gaa gcc gtg aag ctt tgg 480Glu Lys Gln Leu Pro
Pro Ser Ala Leu Val Glu Ala Val Lys Leu Trp145 150 155 160tct gag
cga cag aaa tct acg aaa gca cat gca ggg gac aag cgc cgg 528Ser Glu
Arg Gln Lys Ser Thr Lys Ala His Ala Gly Asp Lys Arg Arg 165 170
175ttc att gac cca gct tct ttt gta gct gat aaa ctg aac cta ggc cca
576Phe Ile Asp Pro Ala Ser Phe Val Ala Asp Lys Leu Asn Leu Gly Pro
180 185 190cta cat tat gcg atc gat gca gca tgc gct tct gca ttg tac
gtg tta 624Leu His Tyr Ala Ile Asp Ala Ala Cys Ala Ser Ala Leu Tyr
Val Leu 195 200 205aaa tta gct caa gac cac ctt gtt tca ggt gcc gtt
gat atg atg tta 672Lys Leu Ala Gln Asp His Leu Val Ser Gly Ala Val
Asp Met Met Leu 210 215 220tgt gga gcg acg tgc ttc cca gaa cca ttc
ttc atc ttg tct ggg ttc 720Cys Gly Ala Thr Cys Phe Pro Glu Pro Phe
Phe Ile Leu Ser Gly Phe225 230 235 240tcg act ttt caa gcg atg cct
gnt ggg gca gat gga gtc tca cta cct 768Ser Thr Phe Gln Ala Met Pro
Xaa Gly Ala Asp Gly Val Ser Leu Pro 245 250 255ctc cat aaa acg agt
gct ggg ctc act cca ggt gaa ggg ggg tcc att 816Leu His Lys Thr Ser
Ala Gly Leu Thr Pro Gly Glu Gly Gly Ser Ile 260 265 270atg gtg ctc
aag cga ctg aaa gac gct atc aga gat gga aat cac att 864Met Val Leu
Lys Arg Leu Lys Asp Ala Ile Arg Asp Gly Asn His Ile 275 280 285tat
ggt gtg ctc ctt gaa gca aat tta agt aac gca ggt tgt ggg ctt 912Tyr
Gly Val Leu Leu Glu Ala Asn Leu Ser Asn Ala Gly Cys Gly Leu 290 295
300cca ctc agc ccg cac tta ccg agc gaa gaa tca tgt att cgt gat acc
960Pro Leu Ser Pro His Leu Pro Ser Glu Glu Ser Cys Ile Arg Asp
Thr305 310 315 320tac cgc cgt gct gga gtt gct gca gat caa agt att
cag tat att gag 1008Tyr Arg Arg Ala Gly Val Ala Ala Asp Gln Ser Ile
Gln Tyr Ile Glu 325 330 335tgc cac gct acg gga acc cct cga ggg gat
gtc gtg gaa att gag gcg 1056Cys His Ala Thr Gly Thr Pro Arg Gly Asp
Val Val Glu Ile Glu Ala 340 345 350gtt gaa aga gtt ttc aag aaa aac
gtt cca cgc tta ggc tcg acg aaa 1104Val Glu Arg Val Phe Lys Lys Asn
Val Pro Arg Leu Gly Ser Thr Lys 355 360 365gga aat ttt ggt cac tcg
tta gtt gcg gct ggt ttc gca ggt atg gca 1152Gly Asn Phe Gly His Ser
Leu Val Ala Ala Gly Phe Ala Gly Met Ala 370 375 380aag ctt ctt ctt
gca atg gaa cat gga gtg att cct ccc aca cca ggt 1200Lys Leu Leu Leu
Ala Met Glu His Gly Val Ile Pro Pro Thr Pro Gly385 390 395 400ctt
gat gct tcg aac cag gca agt gag cac gtt gtg aca aag gct atc 1248Leu
Asp Ala Ser Asn Gln Ala Ser Glu His Val Val Thr Lys Ala Ile 405 410
415act tgg cct gag aca cat ggg gct cca aaa cga gct ggc ctt tca gca
1296Thr Trp Pro Glu Thr His Gly Ala Pro Lys Arg Ala Gly Leu Ser Ala
420 425 430ttt gga ttt ggt ggg act aat gcg cat gca ctc ttc gaa gag
ttt aat 1344Phe Gly Phe Gly Gly Thr Asn Ala His Ala Leu Phe Glu Glu
Phe Asn 435 440 445gcc gag ggc ata agt tat cgc cct gga aag cct cca
gtc gaa tcg aat 1392Ala Glu Gly Ile Ser Tyr Arg Pro Gly Lys Pro Pro
Val Glu Ser Asn 450 455 460acc cgt cct tcc gtc gta ata act ggg atg
gac tgt acc ttt ggg agc 1440Thr Arg Pro Ser Val Val Ile Thr Gly Met
Asp Cys Thr Phe Gly Ser465 470 475 480ctt gaa ggg att gat gcg ttc
gag act gcc ctg tac gag ggg cgt gac 1488Leu Glu Gly Ile Asp Ala Phe
Glu Thr Ala Leu Tyr Glu Gly Arg Asp 485 490 495gca gct cgt gac
1500Ala Ala Arg Asp 50054500PRTThraustochytrium
sp.misc_feature(248)..(248)The 'Xaa' at location 248 stands for
Asp, Gly, Ala, or Val. 54Met Gln Leu Pro Pro Ala His Ser Ala Asp
Glu Asn Arg Ile Ala Val1 5 10 15Val Gly Met Ala Val Lys Tyr Ala Gly
Cys Asp Asn Lys Glu Glu Phe 20 25 30Trp Lys Thr Leu Met Asn Gly Ser
Ile Asn Thr Lys Ser Ile Ser Ala 35 40 45Ala Arg Leu Gly Ser Asn Lys
Arg Asp Glu His Tyr Val Pro Glu Arg 50 55 60Ser Lys Tyr Ala Asp Thr
Phe Cys Asn Glu Arg Tyr Gly Cys Ile Gln65 70 75 80Gln Gly Thr Asp
Asn Glu His Asp Leu Leu Leu Gly Leu Ala Gln Glu 85 90 95Ala Leu Ala
Asp Ala Ala Gly Arg Met Glu Lys Gln Pro Ser Glu Ala 100 105 110Phe
Asp Leu Glu Asn Thr Gly Ile Val Ser Gly Cys Leu Ser Phe Pro 115 120
125Met Asp Asn Leu Gln Gly Glu Leu Leu Asn Leu Tyr Gln Ser His Val
130 135 140Glu Lys Gln Leu Pro Pro Ser Ala Leu Val Glu Ala Val Lys
Leu Trp145 150 155 160Ser Glu Arg Gln Lys Ser Thr Lys Ala His Ala
Gly Asp Lys Arg Arg 165 170 175Phe Ile Asp Pro Ala Ser Phe Val Ala
Asp Lys Leu Asn Leu Gly Pro 180 185 190Leu His Tyr Ala Ile Asp Ala
Ala Cys Ala Ser Ala Leu Tyr Val Leu 195 200 205Lys Leu Ala Gln Asp
His Leu Val Ser Gly Ala Val Asp Met Met Leu 210 215 220Cys Gly Ala
Thr Cys Phe Pro Glu Pro Phe Phe Ile Leu Ser Gly Phe225 230 235
240Ser Thr Phe Gln Ala Met Pro Xaa Gly Ala Asp Gly Val Ser Leu Pro
245 250 255Leu His Lys Thr Ser Ala Gly Leu Thr Pro Gly Glu Gly Gly
Ser Ile 260 265 270Met Val Leu Lys Arg Leu Lys Asp Ala Ile Arg Asp
Gly Asn His Ile 275 280 285Tyr Gly Val Leu Leu Glu Ala Asn Leu Ser
Asn Ala Gly Cys Gly Leu 290 295 300Pro Leu Ser Pro His Leu Pro Ser
Glu Glu Ser Cys Ile Arg Asp Thr305 310 315 320Tyr Arg Arg Ala Gly
Val Ala Ala Asp Gln Ser Ile Gln Tyr Ile Glu 325 330 335Cys His Ala
Thr Gly Thr Pro Arg Gly Asp Val Val Glu Ile Glu Ala 340 345 350Val
Glu Arg Val Phe Lys Lys Asn Val Pro Arg Leu Gly Ser Thr Lys 355 360
365Gly Asn Phe Gly His Ser Leu Val Ala Ala Gly Phe Ala Gly Met Ala
370 375 380Lys Leu Leu Leu Ala Met Glu His Gly Val Ile Pro Pro Thr
Pro Gly385 390 395 400Leu Asp Ala Ser Asn Gln Ala Ser Glu His Val
Val Thr Lys Ala Ile 405 410 415Thr Trp Pro Glu Thr His Gly Ala Pro
Lys Arg Ala Gly Leu Ser Ala 420 425 430Phe Gly Phe Gly Gly Thr Asn
Ala His Ala Leu Phe Glu Glu Phe Asn 435 440 445Ala Glu Gly Ile Ser
Tyr Arg Pro Gly Lys Pro Pro Val Glu Ser Asn 450 455 460Thr Arg Pro
Ser Val Val Ile Thr Gly Met Asp Cys Thr Phe Gly Ser465 470 475
480Leu Glu Gly Ile Asp Ala Phe Glu Thr Ala Leu Tyr Glu Gly Arg Asp
485 490 495Ala Ala Arg Asp 500551500DNAThraustochytrium
sp.CDS(1)..(1500) 55tta ccc gcc aaa cgt tgg agg ttc cta ggt gag gac
ttg gag ttt ctc 48Leu Pro Ala Lys Arg Trp Arg Phe Leu Gly Glu Asp
Leu Glu Phe Leu1 5 10 15cga gcc atc agg ctc aag gaa aag cct agg ggt
tgt ttt gtg gag agt 96Arg Ala Ile Arg Leu Lys Glu Lys Pro Arg Gly
Cys Phe Val Glu Ser 20 25 30gtt gac gtt aac ttt aga cgg ctg aaa acg
ccc ttg aca cca gaa gat 144Val Asp Val Asn Phe Arg Arg Leu Lys Thr
Pro Leu Thr Pro Glu Asp 35 40 45atg ttg cgg ccc caa caa ctc ttg gcg
gtt tct acg atg gac cga gca 192Met Leu Arg Pro Gln Gln Leu Leu Ala
Val Ser Thr Met Asp Arg Ala 50 55 60att atc gat gca ggt cta aag aag
ggc caa cat gta gca gtt ctt gtt 240Ile Ile Asp Ala Gly Leu Lys Lys
Gly Gln His Val Ala Val Leu Val65 70 75 80ggc cta gga act gac ctg
gaa ctt tac cgt cat cga gca aga gtc gcg 288Gly Leu Gly Thr Asp Leu
Glu Leu Tyr Arg His Arg Ala Arg Val Ala 85 90 95ctt aaa gag gtt ttg
cac ccg agc tta aag tca gac act gca att ctc 336Leu Lys Glu Val Leu
His Pro Ser Leu Lys Ser Asp Thr Ala Ile Leu 100 105 110cag aaa ata
atg caa tat gtg aat gat gca gga act tcg act tca tac 384Gln Lys Ile
Met Gln Tyr Val Asn Asp Ala Gly Thr Ser Thr Ser Tyr 115 120 125aca
tct tac att gga aac ctc gtt gcc acg cgt att tcg tct cag tgg 432Thr
Ser Tyr Ile Gly Asn Leu Val Ala Thr Arg Ile Ser Ser Gln Trp 130 135
140gga ttc aca ggg ccg tcc ttt act gtc aca gaa gga aat aat tcc gtg
480Gly Phe Thr Gly Pro Ser Phe Thr Val Thr Glu Gly Asn Asn Ser
Val145 150 155 160tac aga tgt gca caa cta gcc aaa gat atg ctt cag
gtt aac cga gtt 528Tyr Arg Cys Ala Gln Leu Ala Lys Asp Met Leu Gln
Val Asn Arg Val 165 170 175gat gct gtc gtc atc gca ggc gtt gat ctc
aac gga agc gcc gaa agt 576Asp Ala Val Val Ile Ala Gly Val Asp Leu
Asn Gly Ser Ala Glu Ser 180 185 190ttt ttt gtc cga gca aat cgt caa
aag ata tcc aag cta agt cat cca 624Phe Phe Val Arg Ala Asn Arg Gln
Lys Ile Ser Lys Leu Ser His Pro 195 200 205tgt gca agc ttc gac aga
gat gca gat gga ttt ttc gca ggt gag ggc 672Cys Ala Ser Phe Asp Arg
Asp Ala Asp Gly Phe Phe Ala Gly Glu Gly 210 215 220tgt ggt gcc cta
gtt ttc aag agg tta gaa gac tgt gct cct cag gaa 720Cys Gly Ala Leu
Val Phe Lys Arg Leu Glu Asp Cys Ala Pro Gln Glu225 230 235 240aaa
att tat gct agt ata gac tct atc gca ata gat aaa gag cct act 768Lys
Ile Tyr Ala Ser Ile Asp Ser Ile Ala Ile Asp Lys Glu Pro Thr 245 250
255agc tca gct gtg aaa gct gtc tac caa agt gat tcg agt ctc tcc gat
816Ser Ser Ala Val Lys Ala Val Tyr Gln Ser Asp Ser Ser Leu Ser Asp
260 265 270att gag ctg tta gaa atc agt gga gac tcc aaa cgg ttt gca
gca ttc 864Ile Glu Leu Leu Glu Ile Ser Gly Asp Ser Lys Arg Phe Ala
Ala Phe 275 280 285gaa ggc gct gtg gaa att caa tca agt gtg gaa gcc
cag cta aaa gga 912Glu Gly Ala Val Glu Ile Gln Ser Ser Val Glu Ala
Gln Leu Lys Gly 290 295 300ctt tcc aaa gtc ctt gaa cct gca aaa ggc
caa ggc gta gcg gtg gga 960Leu Ser Lys Val Leu Glu Pro Ala Lys Gly
Gln Gly Val Ala Val Gly305 310 315 320agt act cga gca acc gtt ggg
gat ata ggg tat gct aca gga gcg gca 1008Ser Thr Arg Ala Thr Val Gly
Asp Ile Gly Tyr Ala Thr Gly Ala Ala 325 330 335agc ctg att aaa act
gca ctc tgc tta tat aat cgc tac ctt ccg gca 1056Ser Leu Ile Lys Thr
Ala Leu Cys Leu Tyr Asn Arg Tyr Leu Pro Ala 340 345 350tta gca aac
tgg agt ggc cca tgt gaa cag tcc gcc tgg ggc tca aac 1104Leu Ala Asn
Trp Ser Gly Pro Cys Glu Gln Ser Ala Trp Gly Ser Asn 355 360 365atg
ttc gtt tgc cat gaa aca cgg ccg tgg atg aaa aac cag aat gaa 1152Met
Phe Val Cys His Glu Thr Arg Pro Trp Met Lys Asn Gln Asn Glu 370 375
380aag aga tgt gcc ctc att tct gga aca gat cca tct cat aca tgc ttt
1200Lys Arg Cys Ala Leu Ile Ser Gly Thr Asp Pro Ser His Thr Cys
Phe385 390 395 400tcc ctc gta cta tcg gat act ggg tgt tat gaa gag
cac aat cga acg 1248Ser Leu Val Leu Ser Asp Thr Gly Cys Tyr Glu Glu
His Asn Arg Thr 405 410 415tgc ttt gat gtg caa gcg cca cag cta gtt
ctg ata cac gga ttc gat 1296Cys Phe Asp Val Gln Ala Pro Gln Leu Val
Leu Ile His Gly Phe Asp 420 425 430gga aaa act att gtg cgg cga ctt
gaa gga tat ctc ctt gaa ctt gtt 1344Gly Lys Thr Ile Val Arg Arg Leu
Glu Gly Tyr Leu Leu Glu Leu Val 435 440 445gaa ggg cat gca agc cct
tca gag tat ttc cac aaa ctg att gga caa 1392Glu Gly His Ala Ser Pro
Ser Glu Tyr Phe His Lys Leu Ile Gly Gln 450 455 460agt cta ctt gag
aac tcg aaa gaa agt aaa ctc aca ctt tcg ctt gtg 1440Ser Leu Leu Glu
Asn Ser Lys Glu Ser Lys Leu Thr Leu Ser Leu Val465 470 475 480tgc
aat ccg aac cag ctc caa aag gag ctc atg ctt gct atc aaa gga 1488Cys
Asn Pro Asn
Gln Leu Gln Lys Glu Leu Met Leu Ala Ile Lys Gly 485 490 495gta caa
cga agc 1500Val Gln Arg Ser 50056500PRTThraustochytrium sp. 56Leu
Pro Ala Lys Arg Trp Arg Phe Leu Gly Glu Asp Leu Glu Phe Leu1 5 10
15Arg Ala Ile Arg Leu Lys Glu Lys Pro Arg Gly Cys Phe Val Glu Ser
20 25 30Val Asp Val Asn Phe Arg Arg Leu Lys Thr Pro Leu Thr Pro Glu
Asp 35 40 45Met Leu Arg Pro Gln Gln Leu Leu Ala Val Ser Thr Met Asp
Arg Ala 50 55 60Ile Ile Asp Ala Gly Leu Lys Lys Gly Gln His Val Ala
Val Leu Val65 70 75 80Gly Leu Gly Thr Asp Leu Glu Leu Tyr Arg His
Arg Ala Arg Val Ala 85 90 95Leu Lys Glu Val Leu His Pro Ser Leu Lys
Ser Asp Thr Ala Ile Leu 100 105 110Gln Lys Ile Met Gln Tyr Val Asn
Asp Ala Gly Thr Ser Thr Ser Tyr 115 120 125Thr Ser Tyr Ile Gly Asn
Leu Val Ala Thr Arg Ile Ser Ser Gln Trp 130 135 140Gly Phe Thr Gly
Pro Ser Phe Thr Val Thr Glu Gly Asn Asn Ser Val145 150 155 160Tyr
Arg Cys Ala Gln Leu Ala Lys Asp Met Leu Gln Val Asn Arg Val 165 170
175Asp Ala Val Val Ile Ala Gly Val Asp Leu Asn Gly Ser Ala Glu Ser
180 185 190Phe Phe Val Arg Ala Asn Arg Gln Lys Ile Ser Lys Leu Ser
His Pro 195 200 205Cys Ala Ser Phe Asp Arg Asp Ala Asp Gly Phe Phe
Ala Gly Glu Gly 210 215 220Cys Gly Ala Leu Val Phe Lys Arg Leu Glu
Asp Cys Ala Pro Gln Glu225 230 235 240Lys Ile Tyr Ala Ser Ile Asp
Ser Ile Ala Ile Asp Lys Glu Pro Thr 245 250 255Ser Ser Ala Val Lys
Ala Val Tyr Gln Ser Asp Ser Ser Leu Ser Asp 260 265 270Ile Glu Leu
Leu Glu Ile Ser Gly Asp Ser Lys Arg Phe Ala Ala Phe 275 280 285Glu
Gly Ala Val Glu Ile Gln Ser Ser Val Glu Ala Gln Leu Lys Gly 290 295
300Leu Ser Lys Val Leu Glu Pro Ala Lys Gly Gln Gly Val Ala Val
Gly305 310 315 320Ser Thr Arg Ala Thr Val Gly Asp Ile Gly Tyr Ala
Thr Gly Ala Ala 325 330 335Ser Leu Ile Lys Thr Ala Leu Cys Leu Tyr
Asn Arg Tyr Leu Pro Ala 340 345 350Leu Ala Asn Trp Ser Gly Pro Cys
Glu Gln Ser Ala Trp Gly Ser Asn 355 360 365Met Phe Val Cys His Glu
Thr Arg Pro Trp Met Lys Asn Gln Asn Glu 370 375 380Lys Arg Cys Ala
Leu Ile Ser Gly Thr Asp Pro Ser His Thr Cys Phe385 390 395 400Ser
Leu Val Leu Ser Asp Thr Gly Cys Tyr Glu Glu His Asn Arg Thr 405 410
415Cys Phe Asp Val Gln Ala Pro Gln Leu Val Leu Ile His Gly Phe Asp
420 425 430Gly Lys Thr Ile Val Arg Arg Leu Glu Gly Tyr Leu Leu Glu
Leu Val 435 440 445Glu Gly His Ala Ser Pro Ser Glu Tyr Phe His Lys
Leu Ile Gly Gln 450 455 460Ser Leu Leu Glu Asn Ser Lys Glu Ser Lys
Leu Thr Leu Ser Leu Val465 470 475 480Cys Asn Pro Asn Gln Leu Gln
Lys Glu Leu Met Leu Ala Ile Lys Gly 485 490 495Val Gln Arg Ser
500571500DNAThraustochytrium sp.CDS(1)..(1500) 57atg tta aca ggg
aag gat tgg gtc agt cca tca gga agt tgt ttt gcc 48Met Leu Thr Gly
Lys Asp Trp Val Ser Pro Ser Gly Ser Cys Phe Ala1 5 10 15cca aat ccg
tta tca agc gca aaa gtg gca ttc atg tac gga gaa ggc 96Pro Asn Pro
Leu Ser Ser Ala Lys Val Ala Phe Met Tyr Gly Glu Gly 20 25 30cga agc
ccg tac tgt ggt gta ggc ttg ggt cta cat cgt ttg tgg ccc 144Arg Ser
Pro Tyr Cys Gly Val Gly Leu Gly Leu His Arg Leu Trp Pro 35 40 45ggt
ctc cat gaa aat gtg aac aat aag aca gtc gat tta tgg acg gaa 192Gly
Leu His Glu Asn Val Asn Asn Lys Thr Val Asp Leu Trp Thr Glu 50 55
60gga gat ggt tgg tta tat cct cga acg ttg aca cga gaa gag cat aca
240Gly Asp Gly Trp Leu Tyr Pro Arg Thr Leu Thr Arg Glu Glu His
Thr65 70 75 80aaa gcc atc gaa tct ttc aac gca aat caa att gaa atg
ttt cgc gct 288Lys Ala Ile Glu Ser Phe Asn Ala Asn Gln Ile Glu Met
Phe Arg Ala 85 90 95ggg att ttc atc tca atg tgt cag aca gac tat gtc
atg aat gtt ctc 336Gly Ile Phe Ile Ser Met Cys Gln Thr Asp Tyr Val
Met Asn Val Leu 100 105 110ggt gtc cag cct aag gcc gga ttt ggg ctg
agc ttg gga gaa att tca 384Gly Val Gln Pro Lys Ala Gly Phe Gly Leu
Ser Leu Gly Glu Ile Ser 115 120 125atg ctc ttt gcg atg tca aag gag
aac tgc agg cag tca cag gaa atg 432Met Leu Phe Ala Met Ser Lys Glu
Asn Cys Arg Gln Ser Gln Glu Met 130 135 140acc aat cgt ttg cgc ggt
tct cca gtg tgg tct aac gag ctt gct atc 480Thr Asn Arg Leu Arg Gly
Ser Pro Val Trp Ser Asn Glu Leu Ala Ile145 150 155 160aac ttc aat
gca att cgc aag tta tgg aaa atc ccc cga gga gct ccc 528Asn Phe Asn
Ala Ile Arg Lys Leu Trp Lys Ile Pro Arg Gly Ala Pro 165 170 175tta
gaa tcc ttt tgg caa gga tac ttg gtt cac ggc aca aga gaa gaa 576Leu
Glu Ser Phe Trp Gln Gly Tyr Leu Val His Gly Thr Arg Glu Glu 180 185
190gta gag cat gct att ggt ctt tct gag cct tat gta cgt ctg ctt att
624Val Glu His Ala Ile Gly Leu Ser Glu Pro Tyr Val Arg Leu Leu Ile
195 200 205gtg aac gat tca agg agt gcc ttg att gct gga aaa cca gac
gcc tgt 672Val Asn Asp Ser Arg Ser Ala Leu Ile Ala Gly Lys Pro Asp
Ala Cys 210 215 220cag gca gta atc agt aga cta aac tcc aag ttc cct
tct ctg ccg gta 720Gln Ala Val Ile Ser Arg Leu Asn Ser Lys Phe Pro
Ser Leu Pro Val225 230 235 240aag caa gga atg att ggt cat tgc cca
gaa gtt cgt gcg ttc atc aaa 768Lys Gln Gly Met Ile Gly His Cys Pro
Glu Val Arg Ala Phe Ile Lys 245 250 255gat att ggg tac atc cat gaa
aca ctc cga att tcc aat gac tat tcg 816Asp Ile Gly Tyr Ile His Glu
Thr Leu Arg Ile Ser Asn Asp Tyr Ser 260 265 270gat tgt cag ctt ttc
tca gcg gta acc aag ggc gca ctt gac agc tcc 864Asp Cys Gln Leu Phe
Ser Ala Val Thr Lys Gly Ala Leu Asp Ser Ser 275 280 285aca atg gaa
atc aaa cac ttt gtg gga gag gtc tac tcc cgg atc gca 912Thr Met Glu
Ile Lys His Phe Val Gly Glu Val Tyr Ser Arg Ile Ala 290 295 300gac
ttt cct caa atc gtc aac acg gtg cat tcg gct ggt tat gac gta 960Asp
Phe Pro Gln Ile Val Asn Thr Val His Ser Ala Gly Tyr Asp Val305 310
315 320ttt ctt gag ctt ggc tgt gat gct tct aga tct gca gca gtt caa
aac 1008Phe Leu Glu Leu Gly Cys Asp Ala Ser Arg Ser Ala Ala Val Gln
Asn 325 330 335att ctt ggt ggt caa gga aag ttc ttg tct aca gct att
gac aaa aaa 1056Ile Leu Gly Gly Gln Gly Lys Phe Leu Ser Thr Ala Ile
Asp Lys Lys 340 345 350gga cac tcc gcc tgg tca caa gta ctt cgg gct
acc gca tca tta gct 1104Gly His Ser Ala Trp Ser Gln Val Leu Arg Ala
Thr Ala Ser Leu Ala 355 360 365gca cat cga gta ccg gga atc tca att
ttg gat ttg ttt cac cca aat 1152Ala His Arg Val Pro Gly Ile Ser Ile
Leu Asp Leu Phe His Pro Asn 370 375 380ttc cga gaa atg tgc tgt aca
atg gca acc aca cct aaa gtg gaa gat 1200Phe Arg Glu Met Cys Cys Thr
Met Ala Thr Thr Pro Lys Val Glu Asp385 390 395 400aag ttc ctg cgc
acg att caa atc aat ggt cgg ttt gaa aaa gaa atg 1248Lys Phe Leu Arg
Thr Ile Gln Ile Asn Gly Arg Phe Glu Lys Glu Met 405 410 415att cac
cta gaa gat aca aca tta agt tgc tta ccc gct cca agt gaa 1296Ile His
Leu Glu Asp Thr Thr Leu Ser Cys Leu Pro Ala Pro Ser Glu 420 425
430gca aat atc gca gct att caa tct cgg tca att cga tct gct gcg gcg
1344Ala Asn Ile Ala Ala Ile Gln Ser Arg Ser Ile Arg Ser Ala Ala Ala
435 440 445cgt tct gga caa tcc cat gat tgt gca tcc cat agc cat gaa
gaa aat 1392Arg Ser Gly Gln Ser His Asp Cys Ala Ser His Ser His Glu
Glu Asn 450 455 460aag gat tca tgc cct gaa aag ctg aag ctt gat tct
gtg tcc gtc gcc 1440Lys Asp Ser Cys Pro Glu Lys Leu Lys Leu Asp Ser
Val Ser Val Ala465 470 475 480ata aat ttc gac aat gat gac cgc att
cag ctt ggg cac gcg ggt ttt 1488Ile Asn Phe Asp Asn Asp Asp Arg Ile
Gln Leu Gly His Ala Gly Phe 485 490 495cgg gag atg tac 1500Arg Glu
Met Tyr 50058500PRTThraustochytrium sp. 58Met Leu Thr Gly Lys Asp
Trp Val Ser Pro Ser Gly Ser Cys Phe Ala1 5 10 15Pro Asn Pro Leu Ser
Ser Ala Lys Val Ala Phe Met Tyr Gly Glu Gly 20 25 30Arg Ser Pro Tyr
Cys Gly Val Gly Leu Gly Leu His Arg Leu Trp Pro 35 40 45Gly Leu His
Glu Asn Val Asn Asn Lys Thr Val Asp Leu Trp Thr Glu 50 55 60Gly Asp
Gly Trp Leu Tyr Pro Arg Thr Leu Thr Arg Glu Glu His Thr65 70 75
80Lys Ala Ile Glu Ser Phe Asn Ala Asn Gln Ile Glu Met Phe Arg Ala
85 90 95Gly Ile Phe Ile Ser Met Cys Gln Thr Asp Tyr Val Met Asn Val
Leu 100 105 110Gly Val Gln Pro Lys Ala Gly Phe Gly Leu Ser Leu Gly
Glu Ile Ser 115 120 125Met Leu Phe Ala Met Ser Lys Glu Asn Cys Arg
Gln Ser Gln Glu Met 130 135 140Thr Asn Arg Leu Arg Gly Ser Pro Val
Trp Ser Asn Glu Leu Ala Ile145 150 155 160Asn Phe Asn Ala Ile Arg
Lys Leu Trp Lys Ile Pro Arg Gly Ala Pro 165 170 175Leu Glu Ser Phe
Trp Gln Gly Tyr Leu Val His Gly Thr Arg Glu Glu 180 185 190Val Glu
His Ala Ile Gly Leu Ser Glu Pro Tyr Val Arg Leu Leu Ile 195 200
205Val Asn Asp Ser Arg Ser Ala Leu Ile Ala Gly Lys Pro Asp Ala Cys
210 215 220Gln Ala Val Ile Ser Arg Leu Asn Ser Lys Phe Pro Ser Leu
Pro Val225 230 235 240Lys Gln Gly Met Ile Gly His Cys Pro Glu Val
Arg Ala Phe Ile Lys 245 250 255Asp Ile Gly Tyr Ile His Glu Thr Leu
Arg Ile Ser Asn Asp Tyr Ser 260 265 270Asp Cys Gln Leu Phe Ser Ala
Val Thr Lys Gly Ala Leu Asp Ser Ser 275 280 285Thr Met Glu Ile Lys
His Phe Val Gly Glu Val Tyr Ser Arg Ile Ala 290 295 300Asp Phe Pro
Gln Ile Val Asn Thr Val His Ser Ala Gly Tyr Asp Val305 310 315
320Phe Leu Glu Leu Gly Cys Asp Ala Ser Arg Ser Ala Ala Val Gln Asn
325 330 335Ile Leu Gly Gly Gln Gly Lys Phe Leu Ser Thr Ala Ile Asp
Lys Lys 340 345 350Gly His Ser Ala Trp Ser Gln Val Leu Arg Ala Thr
Ala Ser Leu Ala 355 360 365Ala His Arg Val Pro Gly Ile Ser Ile Leu
Asp Leu Phe His Pro Asn 370 375 380Phe Arg Glu Met Cys Cys Thr Met
Ala Thr Thr Pro Lys Val Glu Asp385 390 395 400Lys Phe Leu Arg Thr
Ile Gln Ile Asn Gly Arg Phe Glu Lys Glu Met 405 410 415Ile His Leu
Glu Asp Thr Thr Leu Ser Cys Leu Pro Ala Pro Ser Glu 420 425 430Ala
Asn Ile Ala Ala Ile Gln Ser Arg Ser Ile Arg Ser Ala Ala Ala 435 440
445Arg Ser Gly Gln Ser His Asp Cys Ala Ser His Ser His Glu Glu Asn
450 455 460Lys Asp Ser Cys Pro Glu Lys Leu Lys Leu Asp Ser Val Ser
Val Ala465 470 475 480Ile Asn Phe Asp Asn Asp Asp Arg Ile Gln Leu
Gly His Ala Gly Phe 485 490 495Arg Glu Met Tyr
500591305DNAThraustochytrium sp.CDS(1)..(1305) 59aat aca aga tat
agc ttg tac aca ggg gcg atg gca aag gga att gca 48Asn Thr Arg Tyr
Ser Leu Tyr Thr Gly Ala Met Ala Lys Gly Ile Ala1 5 10 15tct gca gat
ctt gtc att gcc gct ggg aaa gag ggc atc cta gct tcc 96Ser Ala Asp
Leu Val Ile Ala Ala Gly Lys Glu Gly Ile Leu Ala Ser 20 25 30tat gga
gct gga gga cta cct ctt gct act gtt cga aag gga ata gac 144Tyr Gly
Ala Gly Gly Leu Pro Leu Ala Thr Val Arg Lys Gly Ile Asp 35 40 45aaa
att caa caa gcc ttg cca agt ggc cca tat gct gta aat ctt att 192Lys
Ile Gln Gln Ala Leu Pro Ser Gly Pro Tyr Ala Val Asn Leu Ile 50 55
60cac tct ccc ttt gac ggc aac ttg gag cag gga aac gtc gat ttg ttc
240His Ser Pro Phe Asp Gly Asn Leu Glu Gln Gly Asn Val Asp Leu
Phe65 70 75 80ttg gaa aag aac gtc cgc gtg gcg gaa tgt tcc gcg ttt
aca acg cta 288Leu Glu Lys Asn Val Arg Val Ala Glu Cys Ser Ala Phe
Thr Thr Leu 85 90 95aca gtg cca gta gta cac tat cgt gct gca ggg ctt
gtt cgg cgc caa 336Thr Val Pro Val Val His Tyr Arg Ala Ala Gly Leu
Val Arg Arg Gln 100 105 110gat gga agc att ttg atc aag aac cga atc
att gct aaa gta tct agg 384Asp Gly Ser Ile Leu Ile Lys Asn Arg Ile
Ile Ala Lys Val Ser Arg 115 120 125aca gaa ctc gct gag atg ttc ctt
cgt ccg gca cct caa atc atc ctc 432Thr Glu Leu Ala Glu Met Phe Leu
Arg Pro Ala Pro Gln Ile Ile Leu 130 135 140gaa aaa ctg gta gca gca
gaa atc att tca tct gac caa gcg cgt atg 480Glu Lys Leu Val Ala Ala
Glu Ile Ile Ser Ser Asp Gln Ala Arg Met145 150 155 160gca gcc aaa
gtt ccc atg gcg gac gac atc gca gtc gaa gcc gac tct 528Ala Ala Lys
Val Pro Met Ala Asp Asp Ile Ala Val Glu Ala Asp Ser 165 170 175ggt
ggg cac acg gat aat cgg cct atg cac gtc att ttg ccc ctg ata 576Gly
Gly His Thr Asp Asn Arg Pro Met His Val Ile Leu Pro Leu Ile 180 185
190att caa ctc cgc aat act ata ctt gca gag tat ggc tgt gcc acg gct
624Ile Gln Leu Arg Asn Thr Ile Leu Ala Glu Tyr Gly Cys Ala Thr Ala
195 200 205ttt cgt acc cgt ata ggc gct gga gga ggc att ggt tgt cct
tca gcg 672Phe Arg Thr Arg Ile Gly Ala Gly Gly Gly Ile Gly Cys Pro
Ser Ala 210 215 220gcc ctc gca gcc ttt gat atg ggt gcg agt ttt gtc
gtg act gga agc 720Ala Leu Ala Ala Phe Asp Met Gly Ala Ser Phe Val
Val Thr Gly Ser225 230 235 240ata aat caa att tgc cgc gag gca ggg
act tgc gat act gtt cgg gag 768Ile Asn Gln Ile Cys Arg Glu Ala Gly
Thr Cys Asp Thr Val Arg Glu 245 250 255cta ctt gcc aac tca agc tac
tcg gac gtg acg atg gcg cca gca gca 816Leu Leu Ala Asn Ser Ser Tyr
Ser Asp Val Thr Met Ala Pro Ala Ala 260 265 270gac atg ttt gac caa
ggt gtg aaa ctc caa gtc tta aaa cga gga acg 864Asp Met Phe Asp Gln
Gly Val Lys Leu Gln Val Leu Lys Arg Gly Thr 275 280 285atg ttt cca
agc aga gca aat aaa ctc cgg aag ctc ttt gtg aac tac 912Met Phe Pro
Ser Arg Ala Asn Lys Leu Arg Lys Leu Phe Val Asn Tyr 290 295 300gaa
tct cta gaa aca ctc ccg tcg aaa gag ttg aaa tac ctg gaa aac 960Glu
Ser Leu Glu Thr Leu Pro Ser Lys Glu Leu Lys Tyr Leu Glu Asn305 310
315 320atc ata ttc aag caa gca gta gac cag gtg tgg gag gaa aca aag
cgc 1008Ile Ile Phe Lys Gln Ala Val Asp Gln Val Trp Glu Glu Thr Lys
Arg 325 330 335ttt tac tgt gaa aaa ctg aac aat cca gat aaa att gca
agg gcc atg 1056Phe Tyr Cys Glu Lys Leu Asn Asn Pro Asp Lys Ile Ala
Arg Ala Met 340 345 350aaa gat cct aaa ttg aag atg tcg ctt tgc ttt
cgg tgg tat ctc tcc 1104Lys Asp Pro Lys Leu Lys Met Ser Leu Cys Phe
Arg Trp Tyr Leu Ser 355 360 365aag agc tct ggg tgg gcc aac gca gga
att aaa tct cgt gca ctc gac 1152Lys Ser Ser Gly Trp Ala Asn Ala Gly
Ile Lys Ser Arg Ala Leu Asp 370 375
380tac cag atc tgg tgt ggc ccg gca atg ggc tcg ttc aac aat ttc gcc
1200Tyr Gln Ile Trp Cys Gly Pro Ala Met Gly Ser Phe Asn Asn Phe
Ala385 390 395 400agc ggc aca tcc ctc gat tgg aaa gtg act ggg gtt
ttc cct ggc gtt 1248Ser Gly Thr Ser Leu Asp Trp Lys Val Thr Gly Val
Phe Pro Gly Val 405 410 415gcg gaa gta aac atg gcc att tta gat ggc
gcg cga gaa cta gct gct 1296Ala Glu Val Asn Met Ala Ile Leu Asp Gly
Ala Arg Glu Leu Ala Ala 420 425 430aaa cga aat 1305Lys Arg Asn
43560435PRTThraustochytrium sp. 60Asn Thr Arg Tyr Ser Leu Tyr Thr
Gly Ala Met Ala Lys Gly Ile Ala1 5 10 15Ser Ala Asp Leu Val Ile Ala
Ala Gly Lys Glu Gly Ile Leu Ala Ser 20 25 30Tyr Gly Ala Gly Gly Leu
Pro Leu Ala Thr Val Arg Lys Gly Ile Asp 35 40 45Lys Ile Gln Gln Ala
Leu Pro Ser Gly Pro Tyr Ala Val Asn Leu Ile 50 55 60His Ser Pro Phe
Asp Gly Asn Leu Glu Gln Gly Asn Val Asp Leu Phe65 70 75 80Leu Glu
Lys Asn Val Arg Val Ala Glu Cys Ser Ala Phe Thr Thr Leu 85 90 95Thr
Val Pro Val Val His Tyr Arg Ala Ala Gly Leu Val Arg Arg Gln 100 105
110Asp Gly Ser Ile Leu Ile Lys Asn Arg Ile Ile Ala Lys Val Ser Arg
115 120 125Thr Glu Leu Ala Glu Met Phe Leu Arg Pro Ala Pro Gln Ile
Ile Leu 130 135 140Glu Lys Leu Val Ala Ala Glu Ile Ile Ser Ser Asp
Gln Ala Arg Met145 150 155 160Ala Ala Lys Val Pro Met Ala Asp Asp
Ile Ala Val Glu Ala Asp Ser 165 170 175Gly Gly His Thr Asp Asn Arg
Pro Met His Val Ile Leu Pro Leu Ile 180 185 190Ile Gln Leu Arg Asn
Thr Ile Leu Ala Glu Tyr Gly Cys Ala Thr Ala 195 200 205Phe Arg Thr
Arg Ile Gly Ala Gly Gly Gly Ile Gly Cys Pro Ser Ala 210 215 220Ala
Leu Ala Ala Phe Asp Met Gly Ala Ser Phe Val Val Thr Gly Ser225 230
235 240Ile Asn Gln Ile Cys Arg Glu Ala Gly Thr Cys Asp Thr Val Arg
Glu 245 250 255Leu Leu Ala Asn Ser Ser Tyr Ser Asp Val Thr Met Ala
Pro Ala Ala 260 265 270Asp Met Phe Asp Gln Gly Val Lys Leu Gln Val
Leu Lys Arg Gly Thr 275 280 285Met Phe Pro Ser Arg Ala Asn Lys Leu
Arg Lys Leu Phe Val Asn Tyr 290 295 300Glu Ser Leu Glu Thr Leu Pro
Ser Lys Glu Leu Lys Tyr Leu Glu Asn305 310 315 320Ile Ile Phe Lys
Gln Ala Val Asp Gln Val Trp Glu Glu Thr Lys Arg 325 330 335Phe Tyr
Cys Glu Lys Leu Asn Asn Pro Asp Lys Ile Ala Arg Ala Met 340 345
350Lys Asp Pro Lys Leu Lys Met Ser Leu Cys Phe Arg Trp Tyr Leu Ser
355 360 365Lys Ser Ser Gly Trp Ala Asn Ala Gly Ile Lys Ser Arg Ala
Leu Asp 370 375 380Tyr Gln Ile Trp Cys Gly Pro Ala Met Gly Ser Phe
Asn Asn Phe Ala385 390 395 400Ser Gly Thr Ser Leu Asp Trp Lys Val
Thr Gly Val Phe Pro Gly Val 405 410 415Ala Glu Val Asn Met Ala Ile
Leu Asp Gly Ala Arg Glu Leu Ala Ala 420 425 430Lys Arg Asn
435614410DNAThraustochytrium sp.CDS(1)..(4410) 61atg ggc ccg cga
gtg gcg tca ggc aag gtg ccg gct tgg gag atg agc 48Met Gly Pro Arg
Val Ala Ser Gly Lys Val Pro Ala Trp Glu Met Ser1 5 10 15aag tcc gag
ctg tgt gat gac cgc acg gta gtc ttt gac tat gag gag 96Lys Ser Glu
Leu Cys Asp Asp Arg Thr Val Val Phe Asp Tyr Glu Glu 20 25 30ctg ctg
gag ttc gct gag ggc gat atc agt aag gtt ttt ggg ccg gag 144Leu Leu
Glu Phe Ala Glu Gly Asp Ile Ser Lys Val Phe Gly Pro Glu 35 40 45ttc
aaa gtg gtg gac ggg ttt agg cgc agg gtg agg ttg ccc gct cga 192Phe
Lys Val Val Asp Gly Phe Arg Arg Arg Val Arg Leu Pro Ala Arg 50 55
60gag tac ctg ctg gtg acc cgg gtt acg ctg atg gat gcc gag gtg ggc
240Glu Tyr Leu Leu Val Thr Arg Val Thr Leu Met Asp Ala Glu Val
Gly65 70 75 80aac ttt cga gtg gga gca cgt atg gtg aca gag tat gac
gta cct gtg 288Asn Phe Arg Val Gly Ala Arg Met Val Thr Glu Tyr Asp
Val Pro Val 85 90 95aac gga gag ctc tcg gaa ggg gga gat gtg ccg tgg
gct gtg ttg gtg 336Asn Gly Glu Leu Ser Glu Gly Gly Asp Val Pro Trp
Ala Val Leu Val 100 105 110gaa gcc ggg cag tgc gac ttg ctg cta att
tct tac atg ggc atc gat 384Glu Ala Gly Gln Cys Asp Leu Leu Leu Ile
Ser Tyr Met Gly Ile Asp 115 120 125ttc cag tgc aaa gga gag cgg gtc
tac cgg ctg ctg aac acc acc ttg 432Phe Gln Cys Lys Gly Glu Arg Val
Tyr Arg Leu Leu Asn Thr Thr Leu 130 135 140acg ttt ttt ggc gtc gcg
aaa gaa ggg gaa acg ctt gtg tac gat att 480Thr Phe Phe Gly Val Ala
Lys Glu Gly Glu Thr Leu Val Tyr Asp Ile145 150 155 160cgc gtc acg
ggt ttc gcc aag agg ccg gac gga gat atc tcc atg ttc 528Arg Val Thr
Gly Phe Ala Lys Arg Pro Asp Gly Asp Ile Ser Met Phe 165 170 175ttt
ttc gaa tat gat tgc tac tgc aat ggc aag ctt ctc atc gaa atg 576Phe
Phe Glu Tyr Asp Cys Tyr Cys Asn Gly Lys Leu Leu Ile Glu Met 180 185
190cga gat ggc tct gca ggc ttc ttc acg gac gaa gag ctc gct gcc ggc
624Arg Asp Gly Ser Ala Gly Phe Phe Thr Asp Glu Glu Leu Ala Ala Gly
195 200 205aaa gga gtg gtc gtc act cgt gca cag caa aac atg cgg gac
aaa att 672Lys Gly Val Val Val Thr Arg Ala Gln Gln Asn Met Arg Asp
Lys Ile 210 215 220gta cgg cag tcc att gag cct ttt gca ctg gcg gct
tgc acg cac aaa 720Val Arg Gln Ser Ile Glu Pro Phe Ala Leu Ala Ala
Cys Thr His Lys225 230 235 240acg act ctg aac gag agt gac atg cag
tcc ctt gtg gag cga aac tgg 768Thr Thr Leu Asn Glu Ser Asp Met Gln
Ser Leu Val Glu Arg Asn Trp 245 250 255gca aac gtt ttt ggc acc agt
aac aag atg gcg gag ctc aac tat aaa 816Ala Asn Val Phe Gly Thr Ser
Asn Lys Met Ala Glu Leu Asn Tyr Lys 260 265 270att tgc gcc agg aaa
atg ctc atg atc gac agg gtt acc cac att gac 864Ile Cys Ala Arg Lys
Met Leu Met Ile Asp Arg Val Thr His Ile Asp 275 280 285cac cac ggt
ggg gcg tat ggc ctc gga cta ctt gtt gga gag aag atc 912His His Gly
Gly Ala Tyr Gly Leu Gly Leu Leu Val Gly Glu Lys Ile 290 295 300ttg
gat cga aac cat tgg tac ttt cct tgt cac ttt gtc aat gat caa 960Leu
Asp Arg Asn His Trp Tyr Phe Pro Cys His Phe Val Asn Asp Gln305 310
315 320gtc atg gca ggg tca ctg gtc agc gat ggt tgc agc cag ctc tta
aaa 1008Val Met Ala Gly Ser Leu Val Ser Asp Gly Cys Ser Gln Leu Leu
Lys 325 330 335ctc tat atg atc tgg ctt ggc ctc cac ctg aaa atg gag
gaa ttt gat 1056Leu Tyr Met Ile Trp Leu Gly Leu His Leu Lys Met Glu
Glu Phe Asp 340 345 350ttt ctc cca gtt agc ggc cac aaa aac aag gtg
cga tgc agg gga caa 1104Phe Leu Pro Val Ser Gly His Lys Asn Lys Val
Arg Cys Arg Gly Gln 355 360 365att tca ccg cat aaa ggc aag ctt gtc
tac gtc atg gaa atc aaa aag 1152Ile Ser Pro His Lys Gly Lys Leu Val
Tyr Val Met Glu Ile Lys Lys 370 375 380atg ggt tac gat caa gca tct
gga agc cca tac gcc atc gcg gac gtt 1200Met Gly Tyr Asp Gln Ala Ser
Gly Ser Pro Tyr Ala Ile Ala Asp Val385 390 395 400gat atc att gac
gtc aac gaa gag ctg ggt caa agt ttt gac atc aac 1248Asp Ile Ile Asp
Val Asn Glu Glu Leu Gly Gln Ser Phe Asp Ile Asn 405 410 415gac ctt
gcg agc tac gga aaa ggt gac ctg agc aaa aaa atc gtg gtt 1296Asp Leu
Ala Ser Tyr Gly Lys Gly Asp Leu Ser Lys Lys Ile Val Val 420 425
430gac ttc aaa gga att gct ttg cag ctc aaa ggc cgc gct ttt tca cgc
1344Asp Phe Lys Gly Ile Ala Leu Gln Leu Lys Gly Arg Ala Phe Ser Arg
435 440 445atg agt tcc agc tcg tcc ttg aac gaa gga tgg caa tgt gtt
cca aaa 1392Met Ser Ser Ser Ser Ser Leu Asn Glu Gly Trp Gln Cys Val
Pro Lys 450 455 460cca agc cag aga atg gaa cac gaa cag ccc cct gct
cac tgc ctt gca 1440Pro Ser Gln Arg Met Glu His Glu Gln Pro Pro Ala
His Cys Leu Ala465 470 475 480agc gac ccc gaa gcc cct tca act gtg
acc tgg cac cca atg tca aag 1488Ser Asp Pro Glu Ala Pro Ser Thr Val
Thr Trp His Pro Met Ser Lys 485 490 495ctt cct ggc aac cct acg ccg
ttc ttc tcc cct tca tct tac cct ccg 1536Leu Pro Gly Asn Pro Thr Pro
Phe Phe Ser Pro Ser Ser Tyr Pro Pro 500 505 510agg gca att tgc ttc
atc cct ttc ccg ggc aat ccc ctt gac aac aac 1584Arg Ala Ile Cys Phe
Ile Pro Phe Pro Gly Asn Pro Leu Asp Asn Asn 515 520 525tgc aag gct
gga gaa atg ccc ctg aac tgg tac aac atg tca gag ttc 1632Cys Lys Ala
Gly Glu Met Pro Leu Asn Trp Tyr Asn Met Ser Glu Phe 530 535 540atg
tgt ggc aag gtt tct aac tgc ttg ggc cca gaa ttc gca cgc ttt 1680Met
Cys Gly Lys Val Ser Asn Cys Leu Gly Pro Glu Phe Ala Arg Phe545 550
555 560gac aag tcg aac acc agc cgg agc cct gct ttt gac ttg gct ctg
gtg 1728Asp Lys Ser Asn Thr Ser Arg Ser Pro Ala Phe Asp Leu Ala Leu
Val 565 570 575acc cga gtt gtt gaa gtc aca aac atg gaa cac ggc aag
ttt cta aac 1776Thr Arg Val Val Glu Val Thr Asn Met Glu His Gly Lys
Phe Leu Asn 580 585 590gtt gat tgc aat cca agc aaa ggc aca atg gtg
ggg gag ttt gac tgt 1824Val Asp Cys Asn Pro Ser Lys Gly Thr Met Val
Gly Glu Phe Asp Cys 595 600 605ccc caa gac gcg tgg ttc ttt gat ggt
tcg tgc aac gac ggc cat atg 1872Pro Gln Asp Ala Trp Phe Phe Asp Gly
Ser Cys Asn Asp Gly His Met 610 615 620ccg tat tcc att atc atg gaa
atc gga ctg caa acc tca ggt gtt ctc 1920Pro Tyr Ser Ile Ile Met Glu
Ile Gly Leu Gln Thr Ser Gly Val Leu625 630 635 640acc tcg gtg ttg
aag gca ccg ctg act atg gac aag gat gac att ctc 1968Thr Ser Val Leu
Lys Ala Pro Leu Thr Met Asp Lys Asp Asp Ile Leu 645 650 655ttt cga
aac ctc gat gca agt gct gaa atg gtg cgt cca gac gtg gat 2016Phe Arg
Asn Leu Asp Ala Ser Ala Glu Met Val Arg Pro Asp Val Asp 660 665
670gtt cgc ggc aaa acg att cga aac gtg acc aag tgt acc ggc tat gca
2064Val Arg Gly Lys Thr Ile Arg Asn Val Thr Lys Cys Thr Gly Tyr Ala
675 680 685atg ttg gga aag atg ggg att cac cgg ttc acg ttt gag ttg
agc gtt 2112Met Leu Gly Lys Met Gly Ile His Arg Phe Thr Phe Glu Leu
Ser Val 690 695 700gac ggc gtg gta ttt tat aaa gga tcc act tcc ttt
gga tgg ttc act 2160Asp Gly Val Val Phe Tyr Lys Gly Ser Thr Ser Phe
Gly Trp Phe Thr705 710 715 720ccc gag gtg ttt gct cag caa gct gga
ctc gac aac ggg aaa aag acg 2208Pro Glu Val Phe Ala Gln Gln Ala Gly
Leu Asp Asn Gly Lys Lys Thr 725 730 735gag ccc tgg tgc aag act aac
aac acc tcg gtt cga aga gtt gaa atc 2256Glu Pro Trp Cys Lys Thr Asn
Asn Thr Ser Val Arg Arg Val Glu Ile 740 745 750gca tcc gcc aaa gga
aaa gag cag ctg act gag aag ctt ccc gac gca 2304Ala Ser Ala Lys Gly
Lys Glu Gln Leu Thr Glu Lys Leu Pro Asp Ala 755 760 765act aat gct
caa gtt ctt cgg cgt tca gag cag tgt gaa tac ctc gat 2352Thr Asn Ala
Gln Val Leu Arg Arg Ser Glu Gln Cys Glu Tyr Leu Asp 770 775 780tac
ctc aat att gcc cct gac tct ggg ctg cat ggg aag ggc tac gcc 2400Tyr
Leu Asn Ile Ala Pro Asp Ser Gly Leu His Gly Lys Gly Tyr Ala785 790
795 800cac gga cac aaa gac gtt aac ccg caa gac tgg ttc ttc tct tgc
cac 2448His Gly His Lys Asp Val Asn Pro Gln Asp Trp Phe Phe Ser Cys
His 805 810 815ttt tgg ttc gat cct gta atg cca gga tct tta gga att
gaa tca atg 2496Phe Trp Phe Asp Pro Val Met Pro Gly Ser Leu Gly Ile
Glu Ser Met 820 825 830ttc cag ctt atc gag gcc ttt gcg gtg gac caa
aac att cct gga gag 2544Phe Gln Leu Ile Glu Ala Phe Ala Val Asp Gln
Asn Ile Pro Gly Glu 835 840 845tac aac gta tcc aat ccg acc ttt gcc
cat gca cca ggc aaa acg gcg 2592Tyr Asn Val Ser Asn Pro Thr Phe Ala
His Ala Pro Gly Lys Thr Ala 850 855 860tgg aaa tac cga ggc cag ctc
aca cca aag aac cgt gcg atg gac tgc 2640Trp Lys Tyr Arg Gly Gln Leu
Thr Pro Lys Asn Arg Ala Met Asp Cys865 870 875 880gag gtg cat atc
gtt tca att acc gcc tcc ccc gag aac ggg ggc tac 2688Glu Val His Ile
Val Ser Ile Thr Ala Ser Pro Glu Asn Gly Gly Tyr 885 890 895gtt gac
atc gtg gcc gat gga gcg ctt tgg gta gat gga ctt cgc gtg 2736Val Asp
Ile Val Ala Asp Gly Ala Leu Trp Val Asp Gly Leu Arg Val 900 905
910tac gaa gcc aaa gag ctt cga gtt cgt gtc gtt tcg gca aaa cct caa
2784Tyr Glu Ala Lys Glu Leu Arg Val Arg Val Val Ser Ala Lys Pro Gln
915 920 925gca att ccg gat gta caa caa cag cca cct agc gca aag gcg
gac ccg 2832Ala Ile Pro Asp Val Gln Gln Gln Pro Pro Ser Ala Lys Ala
Asp Pro 930 935 940ggg aaa aca gga gtt gca ctt tcg ccc act cag cta
cgc gac gtc ctg 2880Gly Lys Thr Gly Val Ala Leu Ser Pro Thr Gln Leu
Arg Asp Val Leu945 950 955 960ctt gaa gtg gac aat cca ttg tat ctt
ggt gta gag aac tcc aat ttg 2928Leu Glu Val Asp Asn Pro Leu Tyr Leu
Gly Val Glu Asn Ser Asn Leu 965 970 975gtg cag ttt gag tcg aaa cct
gca act tct tca cgt atc gtt tcg atc 2976Val Gln Phe Glu Ser Lys Pro
Ala Thr Ser Ser Arg Ile Val Ser Ile 980 985 990aaa ccg tgc tcg att
agt gac ctt ggc gat aag tct ttt atg gaa acg 3024Lys Pro Cys Ser Ile
Ser Asp Leu Gly Asp Lys Ser Phe Met Glu Thr 995 1000 1005tac aac
gtg tca gca cct ctg tat act gga gca atg gcc aag ggc 3069Tyr Asn Val
Ser Ala Pro Leu Tyr Thr Gly Ala Met Ala Lys Gly 1010 1015 1020att
gca tcc gcc gac ttg gtc att gct gct ggg aaa cgc aag ata 3114Ile Ala
Ser Ala Asp Leu Val Ile Ala Ala Gly Lys Arg Lys Ile 1025 1030
1035ctt gga tcg ttt ggt gcg gga ggg ctg cct att tcc ata gtc cgt
3159Leu Gly Ser Phe Gly Ala Gly Gly Leu Pro Ile Ser Ile Val Arg
1040 1045 1050gaa gca ctg gag aaa att caa caa cac ctg ccc cac ggc
ccc tac 3204Glu Ala Leu Glu Lys Ile Gln Gln His Leu Pro His Gly Pro
Tyr 1055 1060 1065gct gtt aac ctc att cac tcg cct ttc gac agc aac
ttg gaa aag 3249Ala Val Asn Leu Ile His Ser Pro Phe Asp Ser Asn Leu
Glu Lys 1070 1075 1080ggc aac gtt gac ctc ttt ctc gag atg ggc gtg
aca gtg gta gaa 3294Gly Asn Val Asp Leu Phe Leu Glu Met Gly Val Thr
Val Val Glu 1085 1090 1095tgc agc gcg ttc atg gaa ctc acg gcc cag
gtt gtc cgg tac cgc 3339Cys Ser Ala Phe Met Glu Leu Thr Ala Gln Val
Val Arg Tyr Arg 1100 1105 1110gcg tct ggt cta agc aaa agt gcg gac
ggt tcg att cgc att gct 3384Ala Ser Gly Leu Ser Lys Ser Ala Asp Gly
Ser Ile Arg Ile Ala 1115 1120 1125cac cgt att att ggc aag gtt tcc
aga acc gag ctg gca gaa atg 3429His Arg Ile Ile Gly Lys Val Ser Arg
Thr Glu Leu Ala Glu Met 1130 1135 1140ttt att cgt cca gca cca cag
cac ctc ctc caa aaa ctc gta gcc 3474Phe Ile Arg Pro Ala Pro Gln His
Leu Leu Gln Lys Leu Val Ala 1145 1150 1155tcc ggc gag ctg aca gct
gag caa gcc gag ctt gca aca cag gtt 3519Ser Gly Glu Leu Thr Ala Glu
Gln Ala Glu Leu Ala Thr Gln Val 1160 1165 1170ccg gtg gcg gat gac
att gcg gtc gaa gcc gac tcg ggg ggg cat 3564Pro Val Ala Asp Asp Ile
Ala Val Glu Ala Asp Ser Gly Gly His 1175 1180 1185acc gac aac agg
cct att cac gtc att ctt cct cta atc atc aac 3609Thr Asp Asn Arg Pro
Ile His Val Ile Leu Pro Leu Ile Ile
Asn 1190 1195 1200cta cgc aac cgt ttg cat aaa gag ctt gac tac cct
tcg cat ctc 3654Leu Arg Asn Arg Leu His Lys Glu Leu Asp Tyr Pro Ser
His Leu 1205 1210 1215cgg gta cgt gtg ggt gct ggt ggt ggt att gga
tgt cct caa gcc 3699Arg Val Arg Val Gly Ala Gly Gly Gly Ile Gly Cys
Pro Gln Ala 1220 1225 1230gct ctt gca gca ttt caa atg ggg gca gcg
ttt tta atc act gga 3744Ala Leu Ala Ala Phe Gln Met Gly Ala Ala Phe
Leu Ile Thr Gly 1235 1240 1245acg gtg aac cag ctt gct cgt gaa agt
ggc act tgt gac aac gtc 3789Thr Val Asn Gln Leu Ala Arg Glu Ser Gly
Thr Cys Asp Asn Val 1250 1255 1260cgg tta cag ctc tca aag gcc acg
tat agc gac gtg tgt atg gct 3834Arg Leu Gln Leu Ser Lys Ala Thr Tyr
Ser Asp Val Cys Met Ala 1265 1270 1275cct gct gcc gat atg ttt gac
caa ggc gtg gag ctg caa gta ttg 3879Pro Ala Ala Asp Met Phe Asp Gln
Gly Val Glu Leu Gln Val Leu 1280 1285 1290aag aaa ggc acg ctg ttc
cca agt cgt gct aag aag ctg tac gag 3924Lys Lys Gly Thr Leu Phe Pro
Ser Arg Ala Lys Lys Leu Tyr Glu 1295 1300 1305ctg ttc tgc aag tat
gac tcg ttt gag gca atg ccg gct gaa gaa 3969Leu Phe Cys Lys Tyr Asp
Ser Phe Glu Ala Met Pro Ala Glu Glu 1310 1315 1320ttg caa cgg gtt
gaa aag cgg att ttt caa aag tcg ctt gct gaa 4014Leu Gln Arg Val Glu
Lys Arg Ile Phe Gln Lys Ser Leu Ala Glu 1325 1330 1335gtt tgg cag
gag acc agt gac ttt tac att cat cgt atc aag aac 4059Val Trp Gln Glu
Thr Ser Asp Phe Tyr Ile His Arg Ile Lys Asn 1340 1345 1350cct gag
aaa atc aat cgt gct gca agc gat ggc aaa ctg aaa atg 4104Pro Glu Lys
Ile Asn Arg Ala Ala Ser Asp Gly Lys Leu Lys Met 1355 1360 1365tcg
ctt tgc ttt cgc tgg tac ctt ggg ctt tcc tca ttt tgg gcc 4149Ser Leu
Cys Phe Arg Trp Tyr Leu Gly Leu Ser Ser Phe Trp Ala 1370 1375
1380aac tct ggg gca caa gat cgc gtc atg gac tat caa att tgg tgt
4194Asn Ser Gly Ala Gln Asp Arg Val Met Asp Tyr Gln Ile Trp Cys
1385 1390 1395ggc cct gct att ggc gct ttc aat gat ttt acc aag ggc
acg tac 4239Gly Pro Ala Ile Gly Ala Phe Asn Asp Phe Thr Lys Gly Thr
Tyr 1400 1405 1410ctt gac gtg act gtt gca aag agt tac cct tgt gtg
gca cag atc 4284Leu Asp Val Thr Val Ala Lys Ser Tyr Pro Cys Val Ala
Gln Ile 1415 1420 1425aat ttg caa att ttg caa gga gct gcg tat ctg
aaa cgc ctt ggt 4329Asn Leu Gln Ile Leu Gln Gly Ala Ala Tyr Leu Lys
Arg Leu Gly 1430 1435 1440gtc att cgt ttt gac cgc atg ctg ctg cag
gcc gtc gat atc gac 4374Val Ile Arg Phe Asp Arg Met Leu Leu Gln Ala
Val Asp Ile Asp 1445 1450 1455gat cct gta ttt act tac gtg ccg acc
cag cca ctt 4410Asp Pro Val Phe Thr Tyr Val Pro Thr Gln Pro Leu
1460 1465 1470621470PRTThraustochytrium sp. 62Met Gly Pro Arg Val
Ala Ser Gly Lys Val Pro Ala Trp Glu Met Ser1 5 10 15Lys Ser Glu Leu
Cys Asp Asp Arg Thr Val Val Phe Asp Tyr Glu Glu 20 25 30Leu Leu Glu
Phe Ala Glu Gly Asp Ile Ser Lys Val Phe Gly Pro Glu 35 40 45Phe Lys
Val Val Asp Gly Phe Arg Arg Arg Val Arg Leu Pro Ala Arg 50 55 60Glu
Tyr Leu Leu Val Thr Arg Val Thr Leu Met Asp Ala Glu Val Gly65 70 75
80Asn Phe Arg Val Gly Ala Arg Met Val Thr Glu Tyr Asp Val Pro Val
85 90 95Asn Gly Glu Leu Ser Glu Gly Gly Asp Val Pro Trp Ala Val Leu
Val 100 105 110Glu Ala Gly Gln Cys Asp Leu Leu Leu Ile Ser Tyr Met
Gly Ile Asp 115 120 125Phe Gln Cys Lys Gly Glu Arg Val Tyr Arg Leu
Leu Asn Thr Thr Leu 130 135 140Thr Phe Phe Gly Val Ala Lys Glu Gly
Glu Thr Leu Val Tyr Asp Ile145 150 155 160Arg Val Thr Gly Phe Ala
Lys Arg Pro Asp Gly Asp Ile Ser Met Phe 165 170 175Phe Phe Glu Tyr
Asp Cys Tyr Cys Asn Gly Lys Leu Leu Ile Glu Met 180 185 190Arg Asp
Gly Ser Ala Gly Phe Phe Thr Asp Glu Glu Leu Ala Ala Gly 195 200
205Lys Gly Val Val Val Thr Arg Ala Gln Gln Asn Met Arg Asp Lys Ile
210 215 220Val Arg Gln Ser Ile Glu Pro Phe Ala Leu Ala Ala Cys Thr
His Lys225 230 235 240Thr Thr Leu Asn Glu Ser Asp Met Gln Ser Leu
Val Glu Arg Asn Trp 245 250 255Ala Asn Val Phe Gly Thr Ser Asn Lys
Met Ala Glu Leu Asn Tyr Lys 260 265 270Ile Cys Ala Arg Lys Met Leu
Met Ile Asp Arg Val Thr His Ile Asp 275 280 285His His Gly Gly Ala
Tyr Gly Leu Gly Leu Leu Val Gly Glu Lys Ile 290 295 300Leu Asp Arg
Asn His Trp Tyr Phe Pro Cys His Phe Val Asn Asp Gln305 310 315
320Val Met Ala Gly Ser Leu Val Ser Asp Gly Cys Ser Gln Leu Leu Lys
325 330 335Leu Tyr Met Ile Trp Leu Gly Leu His Leu Lys Met Glu Glu
Phe Asp 340 345 350Phe Leu Pro Val Ser Gly His Lys Asn Lys Val Arg
Cys Arg Gly Gln 355 360 365Ile Ser Pro His Lys Gly Lys Leu Val Tyr
Val Met Glu Ile Lys Lys 370 375 380Met Gly Tyr Asp Gln Ala Ser Gly
Ser Pro Tyr Ala Ile Ala Asp Val385 390 395 400Asp Ile Ile Asp Val
Asn Glu Glu Leu Gly Gln Ser Phe Asp Ile Asn 405 410 415Asp Leu Ala
Ser Tyr Gly Lys Gly Asp Leu Ser Lys Lys Ile Val Val 420 425 430Asp
Phe Lys Gly Ile Ala Leu Gln Leu Lys Gly Arg Ala Phe Ser Arg 435 440
445Met Ser Ser Ser Ser Ser Leu Asn Glu Gly Trp Gln Cys Val Pro Lys
450 455 460Pro Ser Gln Arg Met Glu His Glu Gln Pro Pro Ala His Cys
Leu Ala465 470 475 480Ser Asp Pro Glu Ala Pro Ser Thr Val Thr Trp
His Pro Met Ser Lys 485 490 495Leu Pro Gly Asn Pro Thr Pro Phe Phe
Ser Pro Ser Ser Tyr Pro Pro 500 505 510Arg Ala Ile Cys Phe Ile Pro
Phe Pro Gly Asn Pro Leu Asp Asn Asn 515 520 525Cys Lys Ala Gly Glu
Met Pro Leu Asn Trp Tyr Asn Met Ser Glu Phe 530 535 540Met Cys Gly
Lys Val Ser Asn Cys Leu Gly Pro Glu Phe Ala Arg Phe545 550 555
560Asp Lys Ser Asn Thr Ser Arg Ser Pro Ala Phe Asp Leu Ala Leu Val
565 570 575Thr Arg Val Val Glu Val Thr Asn Met Glu His Gly Lys Phe
Leu Asn 580 585 590Val Asp Cys Asn Pro Ser Lys Gly Thr Met Val Gly
Glu Phe Asp Cys 595 600 605Pro Gln Asp Ala Trp Phe Phe Asp Gly Ser
Cys Asn Asp Gly His Met 610 615 620Pro Tyr Ser Ile Ile Met Glu Ile
Gly Leu Gln Thr Ser Gly Val Leu625 630 635 640Thr Ser Val Leu Lys
Ala Pro Leu Thr Met Asp Lys Asp Asp Ile Leu 645 650 655Phe Arg Asn
Leu Asp Ala Ser Ala Glu Met Val Arg Pro Asp Val Asp 660 665 670Val
Arg Gly Lys Thr Ile Arg Asn Val Thr Lys Cys Thr Gly Tyr Ala 675 680
685Met Leu Gly Lys Met Gly Ile His Arg Phe Thr Phe Glu Leu Ser Val
690 695 700Asp Gly Val Val Phe Tyr Lys Gly Ser Thr Ser Phe Gly Trp
Phe Thr705 710 715 720Pro Glu Val Phe Ala Gln Gln Ala Gly Leu Asp
Asn Gly Lys Lys Thr 725 730 735Glu Pro Trp Cys Lys Thr Asn Asn Thr
Ser Val Arg Arg Val Glu Ile 740 745 750Ala Ser Ala Lys Gly Lys Glu
Gln Leu Thr Glu Lys Leu Pro Asp Ala 755 760 765Thr Asn Ala Gln Val
Leu Arg Arg Ser Glu Gln Cys Glu Tyr Leu Asp 770 775 780Tyr Leu Asn
Ile Ala Pro Asp Ser Gly Leu His Gly Lys Gly Tyr Ala785 790 795
800His Gly His Lys Asp Val Asn Pro Gln Asp Trp Phe Phe Ser Cys His
805 810 815Phe Trp Phe Asp Pro Val Met Pro Gly Ser Leu Gly Ile Glu
Ser Met 820 825 830Phe Gln Leu Ile Glu Ala Phe Ala Val Asp Gln Asn
Ile Pro Gly Glu 835 840 845Tyr Asn Val Ser Asn Pro Thr Phe Ala His
Ala Pro Gly Lys Thr Ala 850 855 860Trp Lys Tyr Arg Gly Gln Leu Thr
Pro Lys Asn Arg Ala Met Asp Cys865 870 875 880Glu Val His Ile Val
Ser Ile Thr Ala Ser Pro Glu Asn Gly Gly Tyr 885 890 895Val Asp Ile
Val Ala Asp Gly Ala Leu Trp Val Asp Gly Leu Arg Val 900 905 910Tyr
Glu Ala Lys Glu Leu Arg Val Arg Val Val Ser Ala Lys Pro Gln 915 920
925Ala Ile Pro Asp Val Gln Gln Gln Pro Pro Ser Ala Lys Ala Asp Pro
930 935 940Gly Lys Thr Gly Val Ala Leu Ser Pro Thr Gln Leu Arg Asp
Val Leu945 950 955 960Leu Glu Val Asp Asn Pro Leu Tyr Leu Gly Val
Glu Asn Ser Asn Leu 965 970 975Val Gln Phe Glu Ser Lys Pro Ala Thr
Ser Ser Arg Ile Val Ser Ile 980 985 990Lys Pro Cys Ser Ile Ser Asp
Leu Gly Asp Lys Ser Phe Met Glu Thr 995 1000 1005Tyr Asn Val Ser
Ala Pro Leu Tyr Thr Gly Ala Met Ala Lys Gly 1010 1015 1020Ile Ala
Ser Ala Asp Leu Val Ile Ala Ala Gly Lys Arg Lys Ile 1025 1030
1035Leu Gly Ser Phe Gly Ala Gly Gly Leu Pro Ile Ser Ile Val Arg
1040 1045 1050Glu Ala Leu Glu Lys Ile Gln Gln His Leu Pro His Gly
Pro Tyr 1055 1060 1065Ala Val Asn Leu Ile His Ser Pro Phe Asp Ser
Asn Leu Glu Lys 1070 1075 1080Gly Asn Val Asp Leu Phe Leu Glu Met
Gly Val Thr Val Val Glu 1085 1090 1095Cys Ser Ala Phe Met Glu Leu
Thr Ala Gln Val Val Arg Tyr Arg 1100 1105 1110Ala Ser Gly Leu Ser
Lys Ser Ala Asp Gly Ser Ile Arg Ile Ala 1115 1120 1125His Arg Ile
Ile Gly Lys Val Ser Arg Thr Glu Leu Ala Glu Met 1130 1135 1140Phe
Ile Arg Pro Ala Pro Gln His Leu Leu Gln Lys Leu Val Ala 1145 1150
1155Ser Gly Glu Leu Thr Ala Glu Gln Ala Glu Leu Ala Thr Gln Val
1160 1165 1170Pro Val Ala Asp Asp Ile Ala Val Glu Ala Asp Ser Gly
Gly His 1175 1180 1185Thr Asp Asn Arg Pro Ile His Val Ile Leu Pro
Leu Ile Ile Asn 1190 1195 1200Leu Arg Asn Arg Leu His Lys Glu Leu
Asp Tyr Pro Ser His Leu 1205 1210 1215Arg Val Arg Val Gly Ala Gly
Gly Gly Ile Gly Cys Pro Gln Ala 1220 1225 1230Ala Leu Ala Ala Phe
Gln Met Gly Ala Ala Phe Leu Ile Thr Gly 1235 1240 1245Thr Val Asn
Gln Leu Ala Arg Glu Ser Gly Thr Cys Asp Asn Val 1250 1255 1260Arg
Leu Gln Leu Ser Lys Ala Thr Tyr Ser Asp Val Cys Met Ala 1265 1270
1275Pro Ala Ala Asp Met Phe Asp Gln Gly Val Glu Leu Gln Val Leu
1280 1285 1290Lys Lys Gly Thr Leu Phe Pro Ser Arg Ala Lys Lys Leu
Tyr Glu 1295 1300 1305Leu Phe Cys Lys Tyr Asp Ser Phe Glu Ala Met
Pro Ala Glu Glu 1310 1315 1320Leu Gln Arg Val Glu Lys Arg Ile Phe
Gln Lys Ser Leu Ala Glu 1325 1330 1335Val Trp Gln Glu Thr Ser Asp
Phe Tyr Ile His Arg Ile Lys Asn 1340 1345 1350Pro Glu Lys Ile Asn
Arg Ala Ala Ser Asp Gly Lys Leu Lys Met 1355 1360 1365Ser Leu Cys
Phe Arg Trp Tyr Leu Gly Leu Ser Ser Phe Trp Ala 1370 1375 1380Asn
Ser Gly Ala Gln Asp Arg Val Met Asp Tyr Gln Ile Trp Cys 1385 1390
1395Gly Pro Ala Ile Gly Ala Phe Asn Asp Phe Thr Lys Gly Thr Tyr
1400 1405 1410Leu Asp Val Thr Val Ala Lys Ser Tyr Pro Cys Val Ala
Gln Ile 1415 1420 1425Asn Leu Gln Ile Leu Gln Gly Ala Ala Tyr Leu
Lys Arg Leu Gly 1430 1435 1440Val Ile Arg Phe Asp Arg Met Leu Leu
Gln Ala Val Asp Ile Asp 1445 1450 1455Asp Pro Val Phe Thr Tyr Val
Pro Thr Gln Pro Leu 1460 1465 1470631500DNAThraustochytrium
sp.CDS(1)..(1500) 63atg ggc ccg cga gtg gcg tca ggc aag gtg ccg gct
tgg gag atg agc 48Met Gly Pro Arg Val Ala Ser Gly Lys Val Pro Ala
Trp Glu Met Ser1 5 10 15aag tcc gag ctg tgt gat gac cgc acg gta gtc
ttt gac tat gag gag 96Lys Ser Glu Leu Cys Asp Asp Arg Thr Val Val
Phe Asp Tyr Glu Glu 20 25 30ctg ctg gag ttc gct gag ggc gat atc agt
aag gtt ttt ggg ccg gag 144Leu Leu Glu Phe Ala Glu Gly Asp Ile Ser
Lys Val Phe Gly Pro Glu 35 40 45ttc aaa gtg gtg gac ggg ttt agg cgc
agg gtg agg ttg ccc gct cga 192Phe Lys Val Val Asp Gly Phe Arg Arg
Arg Val Arg Leu Pro Ala Arg 50 55 60gag tac ctg ctg gtg acc cgg gtt
acg ctg atg gat gcc gag gtg ggc 240Glu Tyr Leu Leu Val Thr Arg Val
Thr Leu Met Asp Ala Glu Val Gly65 70 75 80aac ttt cga gtg gga gca
cgt atg gtg aca gag tat gac gta cct gtg 288Asn Phe Arg Val Gly Ala
Arg Met Val Thr Glu Tyr Asp Val Pro Val 85 90 95aac gga gag ctc tcg
gaa ggg gga gat gtg ccg tgg gct gtg ttg gtg 336Asn Gly Glu Leu Ser
Glu Gly Gly Asp Val Pro Trp Ala Val Leu Val 100 105 110gaa gcc ggg
cag tgc gac ttg ctg cta att tct tac atg ggc atc gat 384Glu Ala Gly
Gln Cys Asp Leu Leu Leu Ile Ser Tyr Met Gly Ile Asp 115 120 125ttc
cag tgc aaa gga gag cgg gtc tac cgg ctg ctg aac acc acc ttg 432Phe
Gln Cys Lys Gly Glu Arg Val Tyr Arg Leu Leu Asn Thr Thr Leu 130 135
140acg ttt ttt ggc gtc gcg aaa gaa ggg gaa acg ctt gtg tac gat att
480Thr Phe Phe Gly Val Ala Lys Glu Gly Glu Thr Leu Val Tyr Asp
Ile145 150 155 160cgc gtc acg ggt ttc gcc aag agg ccg gac gga gat
atc tcc atg ttc 528Arg Val Thr Gly Phe Ala Lys Arg Pro Asp Gly Asp
Ile Ser Met Phe 165 170 175ttt ttc gaa tat gat tgc tac tgc aat ggc
aag ctt ctc atc gaa atg 576Phe Phe Glu Tyr Asp Cys Tyr Cys Asn Gly
Lys Leu Leu Ile Glu Met 180 185 190cga gat ggc tct gca ggc ttc ttc
acg gac gaa gag ctc gct gcc ggc 624Arg Asp Gly Ser Ala Gly Phe Phe
Thr Asp Glu Glu Leu Ala Ala Gly 195 200 205aaa gga gtg gtc gtc act
cgt gca cag caa aac atg cgg gac aaa att 672Lys Gly Val Val Val Thr
Arg Ala Gln Gln Asn Met Arg Asp Lys Ile 210 215 220gta cgg cag tcc
att gag cct ttt gca ctg gcg gct tgc acg cac aaa 720Val Arg Gln Ser
Ile Glu Pro Phe Ala Leu Ala Ala Cys Thr His Lys225 230 235 240acg
act ctg aac gag agt gac atg cag tcc ctt gtg gag cga aac tgg 768Thr
Thr Leu Asn Glu Ser Asp Met Gln Ser Leu Val Glu Arg Asn Trp 245 250
255gca aac gtt ttt ggc acc agt aac aag atg gcg gag ctc aac tat aaa
816Ala Asn Val Phe Gly Thr Ser Asn Lys Met Ala Glu Leu Asn Tyr Lys
260 265 270att tgc gcc agg aaa atg ctc atg atc gac agg gtt acc cac
att gac 864Ile Cys Ala Arg Lys Met Leu Met Ile Asp Arg Val Thr His
Ile Asp 275 280 285cac cac ggt ggg gcg tat ggc ctc gga cta ctt gtt
gga gag aag atc 912His His Gly Gly Ala Tyr Gly Leu Gly Leu Leu Val
Gly Glu Lys Ile 290 295 300ttg gat cga aac cat tgg tac ttt cct tgt
cac ttt gtc aat gat caa 960Leu Asp Arg Asn His Trp Tyr Phe Pro Cys
His Phe Val Asn Asp Gln305 310 315 320gtc atg gca ggg tca ctg gtc
agc gat ggt tgc agc cag ctc tta aaa 1008Val Met Ala Gly
Ser Leu Val Ser Asp Gly Cys Ser Gln Leu Leu Lys 325 330 335ctc tat
atg atc tgg ctt ggc ctc cac ctg aaa atg gag gaa ttt gat 1056Leu Tyr
Met Ile Trp Leu Gly Leu His Leu Lys Met Glu Glu Phe Asp 340 345
350ttt ctc cca gtt agc ggc cac aaa aac aag gtg cga tgc agg gga caa
1104Phe Leu Pro Val Ser Gly His Lys Asn Lys Val Arg Cys Arg Gly Gln
355 360 365att tca ccg cat aaa ggc aag ctt gtc tac gtc atg gaa atc
aaa aag 1152Ile Ser Pro His Lys Gly Lys Leu Val Tyr Val Met Glu Ile
Lys Lys 370 375 380atg ggt tac gat caa gca tct gga agc cca tac gcc
atc gcg gac gtt 1200Met Gly Tyr Asp Gln Ala Ser Gly Ser Pro Tyr Ala
Ile Ala Asp Val385 390 395 400gat atc att gac gtc aac gaa gag ctg
ggt caa agt ttt gac atc aac 1248Asp Ile Ile Asp Val Asn Glu Glu Leu
Gly Gln Ser Phe Asp Ile Asn 405 410 415gac ctt gcg agc tac gga aaa
ggt gac ctg agc aaa aaa atc gtg gtt 1296Asp Leu Ala Ser Tyr Gly Lys
Gly Asp Leu Ser Lys Lys Ile Val Val 420 425 430gac ttc aaa gga att
gct ttg cag ctc aaa ggc cgc gct ttt tca cgc 1344Asp Phe Lys Gly Ile
Ala Leu Gln Leu Lys Gly Arg Ala Phe Ser Arg 435 440 445atg agt tcc
agc tcg tcc ttg aac gaa gga tgg caa tgt gtt cca aaa 1392Met Ser Ser
Ser Ser Ser Leu Asn Glu Gly Trp Gln Cys Val Pro Lys 450 455 460cca
agc cag aga atg gaa cac gaa cag ccc cct gct cac tgc ctt gca 1440Pro
Ser Gln Arg Met Glu His Glu Gln Pro Pro Ala His Cys Leu Ala465 470
475 480agc gac ccc gaa gcc cct tca act gtg acc tgg cac cca atg tca
aag 1488Ser Asp Pro Glu Ala Pro Ser Thr Val Thr Trp His Pro Met Ser
Lys 485 490 495ctt cct ggc aac 1500Leu Pro Gly Asn
50064500PRTThraustochytrium sp. 64Met Gly Pro Arg Val Ala Ser Gly
Lys Val Pro Ala Trp Glu Met Ser1 5 10 15Lys Ser Glu Leu Cys Asp Asp
Arg Thr Val Val Phe Asp Tyr Glu Glu 20 25 30Leu Leu Glu Phe Ala Glu
Gly Asp Ile Ser Lys Val Phe Gly Pro Glu 35 40 45Phe Lys Val Val Asp
Gly Phe Arg Arg Arg Val Arg Leu Pro Ala Arg 50 55 60Glu Tyr Leu Leu
Val Thr Arg Val Thr Leu Met Asp Ala Glu Val Gly65 70 75 80Asn Phe
Arg Val Gly Ala Arg Met Val Thr Glu Tyr Asp Val Pro Val 85 90 95Asn
Gly Glu Leu Ser Glu Gly Gly Asp Val Pro Trp Ala Val Leu Val 100 105
110Glu Ala Gly Gln Cys Asp Leu Leu Leu Ile Ser Tyr Met Gly Ile Asp
115 120 125Phe Gln Cys Lys Gly Glu Arg Val Tyr Arg Leu Leu Asn Thr
Thr Leu 130 135 140Thr Phe Phe Gly Val Ala Lys Glu Gly Glu Thr Leu
Val Tyr Asp Ile145 150 155 160Arg Val Thr Gly Phe Ala Lys Arg Pro
Asp Gly Asp Ile Ser Met Phe 165 170 175Phe Phe Glu Tyr Asp Cys Tyr
Cys Asn Gly Lys Leu Leu Ile Glu Met 180 185 190Arg Asp Gly Ser Ala
Gly Phe Phe Thr Asp Glu Glu Leu Ala Ala Gly 195 200 205Lys Gly Val
Val Val Thr Arg Ala Gln Gln Asn Met Arg Asp Lys Ile 210 215 220Val
Arg Gln Ser Ile Glu Pro Phe Ala Leu Ala Ala Cys Thr His Lys225 230
235 240Thr Thr Leu Asn Glu Ser Asp Met Gln Ser Leu Val Glu Arg Asn
Trp 245 250 255Ala Asn Val Phe Gly Thr Ser Asn Lys Met Ala Glu Leu
Asn Tyr Lys 260 265 270Ile Cys Ala Arg Lys Met Leu Met Ile Asp Arg
Val Thr His Ile Asp 275 280 285His His Gly Gly Ala Tyr Gly Leu Gly
Leu Leu Val Gly Glu Lys Ile 290 295 300Leu Asp Arg Asn His Trp Tyr
Phe Pro Cys His Phe Val Asn Asp Gln305 310 315 320Val Met Ala Gly
Ser Leu Val Ser Asp Gly Cys Ser Gln Leu Leu Lys 325 330 335Leu Tyr
Met Ile Trp Leu Gly Leu His Leu Lys Met Glu Glu Phe Asp 340 345
350Phe Leu Pro Val Ser Gly His Lys Asn Lys Val Arg Cys Arg Gly Gln
355 360 365Ile Ser Pro His Lys Gly Lys Leu Val Tyr Val Met Glu Ile
Lys Lys 370 375 380Met Gly Tyr Asp Gln Ala Ser Gly Ser Pro Tyr Ala
Ile Ala Asp Val385 390 395 400Asp Ile Ile Asp Val Asn Glu Glu Leu
Gly Gln Ser Phe Asp Ile Asn 405 410 415Asp Leu Ala Ser Tyr Gly Lys
Gly Asp Leu Ser Lys Lys Ile Val Val 420 425 430Asp Phe Lys Gly Ile
Ala Leu Gln Leu Lys Gly Arg Ala Phe Ser Arg 435 440 445Met Ser Ser
Ser Ser Ser Leu Asn Glu Gly Trp Gln Cys Val Pro Lys 450 455 460Pro
Ser Gln Arg Met Glu His Glu Gln Pro Pro Ala His Cys Leu Ala465 470
475 480Ser Asp Pro Glu Ala Pro Ser Thr Val Thr Trp His Pro Met Ser
Lys 485 490 495Leu Pro Gly Asn 500651500DNAThraustochytrium
sp.CDS(1)..(1500) 65cct acg ccg ttc ttc tcc cct tca tct tac cct ccg
agg gca att tgc 48Pro Thr Pro Phe Phe Ser Pro Ser Ser Tyr Pro Pro
Arg Ala Ile Cys1 5 10 15ttc atc cct ttc ccg ggc aat ccc ctt gac aac
aac tgc aag gct gga 96Phe Ile Pro Phe Pro Gly Asn Pro Leu Asp Asn
Asn Cys Lys Ala Gly 20 25 30gaa atg ccc ctg aac tgg tac aac atg tca
gag ttc atg tgt ggc aag 144Glu Met Pro Leu Asn Trp Tyr Asn Met Ser
Glu Phe Met Cys Gly Lys 35 40 45gtt tct aac tgc ttg ggc cca gaa ttc
gca cgc ttt gac aag tcg aac 192Val Ser Asn Cys Leu Gly Pro Glu Phe
Ala Arg Phe Asp Lys Ser Asn 50 55 60acc agc cgg agc cct gct ttt gac
ttg gct ctg gtg acc cga gtt gtt 240Thr Ser Arg Ser Pro Ala Phe Asp
Leu Ala Leu Val Thr Arg Val Val65 70 75 80gaa gtc aca aac atg gaa
cac ggc aag ttt cta aac gtt gat tgc aat 288Glu Val Thr Asn Met Glu
His Gly Lys Phe Leu Asn Val Asp Cys Asn 85 90 95cca agc aaa ggc aca
atg gtg ggg gag ttt gac tgt ccc caa gac gcg 336Pro Ser Lys Gly Thr
Met Val Gly Glu Phe Asp Cys Pro Gln Asp Ala 100 105 110tgg ttc ttt
gat ggt tcg tgc aac gac ggc cat atg ccg tat tcc att 384Trp Phe Phe
Asp Gly Ser Cys Asn Asp Gly His Met Pro Tyr Ser Ile 115 120 125atc
atg gaa atc gga ctg caa acc tca ggt gtt ctc acc tcg gtg ttg 432Ile
Met Glu Ile Gly Leu Gln Thr Ser Gly Val Leu Thr Ser Val Leu 130 135
140aag gca ccg ctg act atg gac aag gat gac att ctc ttt cga aac ctc
480Lys Ala Pro Leu Thr Met Asp Lys Asp Asp Ile Leu Phe Arg Asn
Leu145 150 155 160gat gca agt gct gaa atg gtg cgt cca gac gtg gat
gtt cgc ggc aaa 528Asp Ala Ser Ala Glu Met Val Arg Pro Asp Val Asp
Val Arg Gly Lys 165 170 175acg att cga aac gtg acc aag tgt acc ggc
tat gca atg ttg gga aag 576Thr Ile Arg Asn Val Thr Lys Cys Thr Gly
Tyr Ala Met Leu Gly Lys 180 185 190atg ggg att cac cgg ttc acg ttt
gag ttg agc gtt gac ggc gtg gta 624Met Gly Ile His Arg Phe Thr Phe
Glu Leu Ser Val Asp Gly Val Val 195 200 205ttt tat aaa gga tcc act
tcc ttt gga tgg ttc act ccc gag gtg ttt 672Phe Tyr Lys Gly Ser Thr
Ser Phe Gly Trp Phe Thr Pro Glu Val Phe 210 215 220gct cag caa gct
gga ctc gac aac ggg aaa aag acg gag ccc tgg tgc 720Ala Gln Gln Ala
Gly Leu Asp Asn Gly Lys Lys Thr Glu Pro Trp Cys225 230 235 240aag
act aac aac acc tcg gtt cga aga gtt gaa atc gca tcc gcc aaa 768Lys
Thr Asn Asn Thr Ser Val Arg Arg Val Glu Ile Ala Ser Ala Lys 245 250
255gga aaa gag cag ctg act gag aag ctt ccc gac gca act aat gct caa
816Gly Lys Glu Gln Leu Thr Glu Lys Leu Pro Asp Ala Thr Asn Ala Gln
260 265 270gtt ctt cgg cgt tca gag cag tgt gaa tac ctc gat tac ctc
aat att 864Val Leu Arg Arg Ser Glu Gln Cys Glu Tyr Leu Asp Tyr Leu
Asn Ile 275 280 285gcc cct gac tct ggg ctg cat ggg aag ggc tac gcc
cac gga cac aaa 912Ala Pro Asp Ser Gly Leu His Gly Lys Gly Tyr Ala
His Gly His Lys 290 295 300gac gtt aac ccg caa gac tgg ttc ttc tct
tgc cac ttt tgg ttc gat 960Asp Val Asn Pro Gln Asp Trp Phe Phe Ser
Cys His Phe Trp Phe Asp305 310 315 320cct gta atg cca gga tct tta
gga att gaa tca atg ttc cag ctt atc 1008Pro Val Met Pro Gly Ser Leu
Gly Ile Glu Ser Met Phe Gln Leu Ile 325 330 335gag gcc ttt gcg gtg
gac caa aac att cct gga gag tac aac gta tcc 1056Glu Ala Phe Ala Val
Asp Gln Asn Ile Pro Gly Glu Tyr Asn Val Ser 340 345 350aat ccg acc
ttt gcc cat gca cca ggc aaa acg gcg tgg aaa tac cga 1104Asn Pro Thr
Phe Ala His Ala Pro Gly Lys Thr Ala Trp Lys Tyr Arg 355 360 365ggc
cag ctc aca cca aag aac cgt gcg atg gac tgc gag gtg cat atc 1152Gly
Gln Leu Thr Pro Lys Asn Arg Ala Met Asp Cys Glu Val His Ile 370 375
380gtt tca att acc gcc tcc ccc gag aac ggg ggc tac gtt gac atc gtg
1200Val Ser Ile Thr Ala Ser Pro Glu Asn Gly Gly Tyr Val Asp Ile
Val385 390 395 400gcc gat gga gcg ctt tgg gta gat gga ctt cgc gtg
tac gaa gcc aaa 1248Ala Asp Gly Ala Leu Trp Val Asp Gly Leu Arg Val
Tyr Glu Ala Lys 405 410 415gag ctt cga gtt cgt gtc gtt tcg gca aaa
cct caa gca att ccg gat 1296Glu Leu Arg Val Arg Val Val Ser Ala Lys
Pro Gln Ala Ile Pro Asp 420 425 430gta caa caa cag cca cct agc gca
aag gcg gac ccg ggg aaa aca gga 1344Val Gln Gln Gln Pro Pro Ser Ala
Lys Ala Asp Pro Gly Lys Thr Gly 435 440 445gtt gca ctt tcg ccc act
cag cta cgc gac gtc ctg ctt gaa gtg gac 1392Val Ala Leu Ser Pro Thr
Gln Leu Arg Asp Val Leu Leu Glu Val Asp 450 455 460aat cca ttg tat
ctt ggt gta gag aac tcc aat ttg gtg cag ttt gag 1440Asn Pro Leu Tyr
Leu Gly Val Glu Asn Ser Asn Leu Val Gln Phe Glu465 470 475 480tcg
aaa cct gca act tct tca cgt atc gtt tcg atc aaa ccg tgc tcg 1488Ser
Lys Pro Ala Thr Ser Ser Arg Ile Val Ser Ile Lys Pro Cys Ser 485 490
495att agt gac ctt 1500Ile Ser Asp Leu 50066500PRTThraustochytrium
sp. 66Pro Thr Pro Phe Phe Ser Pro Ser Ser Tyr Pro Pro Arg Ala Ile
Cys1 5 10 15Phe Ile Pro Phe Pro Gly Asn Pro Leu Asp Asn Asn Cys Lys
Ala Gly 20 25 30Glu Met Pro Leu Asn Trp Tyr Asn Met Ser Glu Phe Met
Cys Gly Lys 35 40 45Val Ser Asn Cys Leu Gly Pro Glu Phe Ala Arg Phe
Asp Lys Ser Asn 50 55 60Thr Ser Arg Ser Pro Ala Phe Asp Leu Ala Leu
Val Thr Arg Val Val65 70 75 80Glu Val Thr Asn Met Glu His Gly Lys
Phe Leu Asn Val Asp Cys Asn 85 90 95Pro Ser Lys Gly Thr Met Val Gly
Glu Phe Asp Cys Pro Gln Asp Ala 100 105 110Trp Phe Phe Asp Gly Ser
Cys Asn Asp Gly His Met Pro Tyr Ser Ile 115 120 125Ile Met Glu Ile
Gly Leu Gln Thr Ser Gly Val Leu Thr Ser Val Leu 130 135 140Lys Ala
Pro Leu Thr Met Asp Lys Asp Asp Ile Leu Phe Arg Asn Leu145 150 155
160Asp Ala Ser Ala Glu Met Val Arg Pro Asp Val Asp Val Arg Gly Lys
165 170 175Thr Ile Arg Asn Val Thr Lys Cys Thr Gly Tyr Ala Met Leu
Gly Lys 180 185 190Met Gly Ile His Arg Phe Thr Phe Glu Leu Ser Val
Asp Gly Val Val 195 200 205Phe Tyr Lys Gly Ser Thr Ser Phe Gly Trp
Phe Thr Pro Glu Val Phe 210 215 220Ala Gln Gln Ala Gly Leu Asp Asn
Gly Lys Lys Thr Glu Pro Trp Cys225 230 235 240Lys Thr Asn Asn Thr
Ser Val Arg Arg Val Glu Ile Ala Ser Ala Lys 245 250 255Gly Lys Glu
Gln Leu Thr Glu Lys Leu Pro Asp Ala Thr Asn Ala Gln 260 265 270Val
Leu Arg Arg Ser Glu Gln Cys Glu Tyr Leu Asp Tyr Leu Asn Ile 275 280
285Ala Pro Asp Ser Gly Leu His Gly Lys Gly Tyr Ala His Gly His Lys
290 295 300Asp Val Asn Pro Gln Asp Trp Phe Phe Ser Cys His Phe Trp
Phe Asp305 310 315 320Pro Val Met Pro Gly Ser Leu Gly Ile Glu Ser
Met Phe Gln Leu Ile 325 330 335Glu Ala Phe Ala Val Asp Gln Asn Ile
Pro Gly Glu Tyr Asn Val Ser 340 345 350Asn Pro Thr Phe Ala His Ala
Pro Gly Lys Thr Ala Trp Lys Tyr Arg 355 360 365Gly Gln Leu Thr Pro
Lys Asn Arg Ala Met Asp Cys Glu Val His Ile 370 375 380Val Ser Ile
Thr Ala Ser Pro Glu Asn Gly Gly Tyr Val Asp Ile Val385 390 395
400Ala Asp Gly Ala Leu Trp Val Asp Gly Leu Arg Val Tyr Glu Ala Lys
405 410 415Glu Leu Arg Val Arg Val Val Ser Ala Lys Pro Gln Ala Ile
Pro Asp 420 425 430Val Gln Gln Gln Pro Pro Ser Ala Lys Ala Asp Pro
Gly Lys Thr Gly 435 440 445Val Ala Leu Ser Pro Thr Gln Leu Arg Asp
Val Leu Leu Glu Val Asp 450 455 460Asn Pro Leu Tyr Leu Gly Val Glu
Asn Ser Asn Leu Val Gln Phe Glu465 470 475 480Ser Lys Pro Ala Thr
Ser Ser Arg Ile Val Ser Ile Lys Pro Cys Ser 485 490 495Ile Ser Asp
Leu 500671410DNAThraustochytrium sp.CDS(1)..(1410) 67ggc gat aag
tct ttt atg gaa acg tac aac gtg tca gca cct ctg tat 48Gly Asp Lys
Ser Phe Met Glu Thr Tyr Asn Val Ser Ala Pro Leu Tyr1 5 10 15act gga
gca atg gcc aag ggc att gca tcc gcc gac ttg gtc att gct 96Thr Gly
Ala Met Ala Lys Gly Ile Ala Ser Ala Asp Leu Val Ile Ala 20 25 30gct
ggg aaa cgc aag ata ctt gga tcg ttt ggt gcg gga ggg ctg cct 144Ala
Gly Lys Arg Lys Ile Leu Gly Ser Phe Gly Ala Gly Gly Leu Pro 35 40
45att tcc ata gtc cgt gaa gca ctg gag aaa att caa caa cac ctg ccc
192Ile Ser Ile Val Arg Glu Ala Leu Glu Lys Ile Gln Gln His Leu Pro
50 55 60cac ggc ccc tac gct gtt aac ctc att cac tcg cct ttc gac agc
aac 240His Gly Pro Tyr Ala Val Asn Leu Ile His Ser Pro Phe Asp Ser
Asn65 70 75 80ttg gaa aag ggc aac gtt gac ctc ttt ctc gag atg ggc
gtg aca gtg 288Leu Glu Lys Gly Asn Val Asp Leu Phe Leu Glu Met Gly
Val Thr Val 85 90 95gta gaa tgc agc gcg ttc atg gaa ctc acg gcc cag
gtt gtc cgg tac 336Val Glu Cys Ser Ala Phe Met Glu Leu Thr Ala Gln
Val Val Arg Tyr 100 105 110cgc gcg tct ggt cta agc aaa agt gcg gac
ggt tcg att cgc att gct 384Arg Ala Ser Gly Leu Ser Lys Ser Ala Asp
Gly Ser Ile Arg Ile Ala 115 120 125cac cgt att att ggc aag gtt tcc
aga acc gag ctg gca gaa atg ttt 432His Arg Ile Ile Gly Lys Val Ser
Arg Thr Glu Leu Ala Glu Met Phe 130 135 140att cgt cca gca cca cag
cac ctc ctc caa aaa ctc gta gcc tcc ggc 480Ile Arg Pro Ala Pro Gln
His Leu Leu Gln Lys Leu Val Ala Ser Gly145 150 155 160gag ctg aca
gct gag caa gcc gag ctt gca aca cag gtt ccg gtg gcg 528Glu Leu Thr
Ala Glu Gln Ala Glu Leu Ala Thr Gln Val Pro Val Ala 165 170 175gat
gac att gcg gtc gaa gcc gac tcg ggg ggg cat acc gac aac agg 576Asp
Asp Ile Ala Val Glu Ala Asp Ser Gly Gly His Thr Asp Asn Arg 180 185
190cct att cac gtc att ctt cct cta atc atc aac cta cgc aac cgt ttg
624Pro Ile His Val Ile Leu Pro Leu Ile Ile Asn Leu Arg Asn Arg Leu
195 200 205cat aaa gag ctt gac tac cct tcg cat ctc cgg gta cgt gtg
ggt gct 672His Lys Glu Leu Asp Tyr Pro Ser His Leu Arg Val Arg Val
Gly Ala 210 215
220ggt ggt ggt att gga tgt cct caa gcc gct ctt gca gca ttt caa atg
720Gly Gly Gly Ile Gly Cys Pro Gln Ala Ala Leu Ala Ala Phe Gln
Met225 230 235 240ggg gca gcg ttt tta atc act gga acg gtg aac cag
ctt gct cgt gaa 768Gly Ala Ala Phe Leu Ile Thr Gly Thr Val Asn Gln
Leu Ala Arg Glu 245 250 255agt ggc act tgt gac aac gtc cgg tta cag
ctc tca aag gcc acg tat 816Ser Gly Thr Cys Asp Asn Val Arg Leu Gln
Leu Ser Lys Ala Thr Tyr 260 265 270agc gac gtg tgt atg gct cct gct
gcc gat atg ttt gac caa ggc gtg 864Ser Asp Val Cys Met Ala Pro Ala
Ala Asp Met Phe Asp Gln Gly Val 275 280 285gag ctg caa gta ttg aag
aaa ggc acg ctg ttc cca agt cgt gct aag 912Glu Leu Gln Val Leu Lys
Lys Gly Thr Leu Phe Pro Ser Arg Ala Lys 290 295 300aag ctg tac gag
ctg ttc tgc aag tat gac tcg ttt gag gca atg ccg 960Lys Leu Tyr Glu
Leu Phe Cys Lys Tyr Asp Ser Phe Glu Ala Met Pro305 310 315 320gct
gaa gaa ttg caa cgg gtt gaa aag cgg att ttt caa aag tcg ctt 1008Ala
Glu Glu Leu Gln Arg Val Glu Lys Arg Ile Phe Gln Lys Ser Leu 325 330
335gct gaa gtt tgg cag gag acc agt gac ttt tac att cat cgt atc aag
1056Ala Glu Val Trp Gln Glu Thr Ser Asp Phe Tyr Ile His Arg Ile Lys
340 345 350aac cct gag aaa atc aat cgt gct gca agc gat ggc aaa ctg
aaa atg 1104Asn Pro Glu Lys Ile Asn Arg Ala Ala Ser Asp Gly Lys Leu
Lys Met 355 360 365tcg ctt tgc ttt cgc tgg tac ctt ggg ctt tcc tca
ttt tgg gcc aac 1152Ser Leu Cys Phe Arg Trp Tyr Leu Gly Leu Ser Ser
Phe Trp Ala Asn 370 375 380tct ggg gca caa gat cgc gtc atg gac tat
caa att tgg tgt ggc cct 1200Ser Gly Ala Gln Asp Arg Val Met Asp Tyr
Gln Ile Trp Cys Gly Pro385 390 395 400gct att ggc gct ttc aat gat
ttt acc aag ggc acg tac ctt gac gtg 1248Ala Ile Gly Ala Phe Asn Asp
Phe Thr Lys Gly Thr Tyr Leu Asp Val 405 410 415act gtt gca aag agt
tac cct tgt gtg gca cag atc aat ttg caa att 1296Thr Val Ala Lys Ser
Tyr Pro Cys Val Ala Gln Ile Asn Leu Gln Ile 420 425 430ttg caa gga
gct gcg tat ctg aaa cgc ctt ggt gtc att cgt ttt gac 1344Leu Gln Gly
Ala Ala Tyr Leu Lys Arg Leu Gly Val Ile Arg Phe Asp 435 440 445cgc
atg ctg ctg cag gcc gtc gat atc gac gat cct gta ttt act tac 1392Arg
Met Leu Leu Gln Ala Val Asp Ile Asp Asp Pro Val Phe Thr Tyr 450 455
460gtg ccg acc cag cca ctt 1410Val Pro Thr Gln Pro Leu465
47068470PRTThraustochytrium sp. 68Gly Asp Lys Ser Phe Met Glu Thr
Tyr Asn Val Ser Ala Pro Leu Tyr1 5 10 15Thr Gly Ala Met Ala Lys Gly
Ile Ala Ser Ala Asp Leu Val Ile Ala 20 25 30Ala Gly Lys Arg Lys Ile
Leu Gly Ser Phe Gly Ala Gly Gly Leu Pro 35 40 45Ile Ser Ile Val Arg
Glu Ala Leu Glu Lys Ile Gln Gln His Leu Pro 50 55 60His Gly Pro Tyr
Ala Val Asn Leu Ile His Ser Pro Phe Asp Ser Asn65 70 75 80Leu Glu
Lys Gly Asn Val Asp Leu Phe Leu Glu Met Gly Val Thr Val 85 90 95Val
Glu Cys Ser Ala Phe Met Glu Leu Thr Ala Gln Val Val Arg Tyr 100 105
110Arg Ala Ser Gly Leu Ser Lys Ser Ala Asp Gly Ser Ile Arg Ile Ala
115 120 125His Arg Ile Ile Gly Lys Val Ser Arg Thr Glu Leu Ala Glu
Met Phe 130 135 140Ile Arg Pro Ala Pro Gln His Leu Leu Gln Lys Leu
Val Ala Ser Gly145 150 155 160Glu Leu Thr Ala Glu Gln Ala Glu Leu
Ala Thr Gln Val Pro Val Ala 165 170 175Asp Asp Ile Ala Val Glu Ala
Asp Ser Gly Gly His Thr Asp Asn Arg 180 185 190Pro Ile His Val Ile
Leu Pro Leu Ile Ile Asn Leu Arg Asn Arg Leu 195 200 205His Lys Glu
Leu Asp Tyr Pro Ser His Leu Arg Val Arg Val Gly Ala 210 215 220Gly
Gly Gly Ile Gly Cys Pro Gln Ala Ala Leu Ala Ala Phe Gln Met225 230
235 240Gly Ala Ala Phe Leu Ile Thr Gly Thr Val Asn Gln Leu Ala Arg
Glu 245 250 255Ser Gly Thr Cys Asp Asn Val Arg Leu Gln Leu Ser Lys
Ala Thr Tyr 260 265 270Ser Asp Val Cys Met Ala Pro Ala Ala Asp Met
Phe Asp Gln Gly Val 275 280 285Glu Leu Gln Val Leu Lys Lys Gly Thr
Leu Phe Pro Ser Arg Ala Lys 290 295 300Lys Leu Tyr Glu Leu Phe Cys
Lys Tyr Asp Ser Phe Glu Ala Met Pro305 310 315 320Ala Glu Glu Leu
Gln Arg Val Glu Lys Arg Ile Phe Gln Lys Ser Leu 325 330 335Ala Glu
Val Trp Gln Glu Thr Ser Asp Phe Tyr Ile His Arg Ile Lys 340 345
350Asn Pro Glu Lys Ile Asn Arg Ala Ala Ser Asp Gly Lys Leu Lys Met
355 360 365Ser Leu Cys Phe Arg Trp Tyr Leu Gly Leu Ser Ser Phe Trp
Ala Asn 370 375 380Ser Gly Ala Gln Asp Arg Val Met Asp Tyr Gln Ile
Trp Cys Gly Pro385 390 395 400Ala Ile Gly Ala Phe Asn Asp Phe Thr
Lys Gly Thr Tyr Leu Asp Val 405 410 415Thr Val Ala Lys Ser Tyr Pro
Cys Val Ala Gln Ile Asn Leu Gln Ile 420 425 430Leu Gln Gly Ala Ala
Tyr Leu Lys Arg Leu Gly Val Ile Arg Phe Asp 435 440 445Arg Met Leu
Leu Gln Ala Val Asp Ile Asp Asp Pro Val Phe Thr Tyr 450 455 460Val
Pro Thr Gln Pro Leu465 4706920DNAArtificial sequenceprimer
69ggyatgmtgr ttggtgaagg 207022DNAArtificial sequenceprimer
70trttsasrta ytgygaacct tg 227120DNAArtificial sequenceprimer
71atgkcngaag gttgtggcca 207223DNAArtificial sequenceprimer
72ccwgaratra agccrttdgg ttg 237330DNAArtificial Sequenceprimer
73cggggtaccc gggagccgcc ttggctttgt 307435DNAArtificial
Sequenceprimer 74aaactgcagc ccgggtccag ctggcaggca ccctg
3575798DNAShewanella olleyana 75atggcagaag gttgtggcca attattgcag
ttcttcatgc tgcacattgg tatgcacacc 60ttagttgaaa acggacgttt ccagccttta
gaaaatgctt cacaaaaagt acgttgtcgt 120ggccaagtac tgccacaaca
tggtgaactg acgtaccgca tggaagtcac agaaattggt 180actcaccctc
gcccatacgc caaagccaat attgaaatat tgctcaatgg taaagcggtc
240gtggacttcc aaaatcttgg ggtgatgatt aaagaagaag gtgaatgtac
tcgttacact 300gccgactcta ctgaaacaca tacaacctca ggcacagtcc
aaaaaaacaa cagccacaac 360acaccagcat cattaaatgc accgttaatg
gcacaagtgc cagacttaag tgaaccagcc 420aataaaggcg ttatcccgct
gcaacatgtt gaagcgccta tgctgccaga ctacccaaat 480cgaacccctg
atacgctgcc gttcaccgcg taccatatgt ttgagtttgc aacaggtgac
540atcgaaaact gttttggacc tgactttagt atttaccggg gctttattcc
gccgcgcacg 600ccatgtggtg acttacagct aacaacccgt gttgttgata
ttcaaggtaa acgtggcgag 660cttaaaaaac cgtcatcgtg tatcgctgaa
tatgaagtgc caaccgatgc gtggtatttt 720gctaaaaaca gtcacgcttc
agtgatgcct tactcggtat taatggaaat atcactgcaa 780cccaacggct tcatctca
798761042DNAShewanella olleyana 76ggcatgatgg ttggtgaagg tatcggcatg
gtagcactta agcgcctaga agatgctgag 60cgtgatggcg accgtattta ttctgtgatt
aaaggtgtcg gcgcttcatc agacggtaaa 120tttaagagta tttatgcacc
gcgccctgaa ggccaagcaa aagcattaaa acgagcttat 180gatgacgctg
gttttgcccc tgaaacagtt ggcttaatcg aagctcacgg tacgggtact
240gctgcaggtg atgtagccga atttaacggc cttaaatctg tatttggtga
aaacgatcca 300actaagcaac acatcgcttt aggttcagtg aaatcacaag
tgggtcacac gaaatcaacc 360gctggtactg ctggcgtgat taaagctgcc
cttgccctgc accataaagt attgccaccg 420accattaacg tctctaagcc
aaaccctaag cttaatgttg aggattcacc gtttttcgtt 480aataccgaaa
cacgcccatg gatgcctcgc cctgacggca ctcctcgccg tgctggtatt
540agctcgttcg gttttggtgg aactaacttc cacttagtat tagaagaata
cacccctgag 600cacagccatg atgagaaata ccgtcaacgc caagtggctc
aaagcttatt aatgagtgct 660gataataaag cagccttgat tgcagaagtg
aataagctaa ctgcagacat cagcgcgctt 720aaaggcacag ataacagcag
cattgaacaa gctgaacttg ctcgcattgc taaactatat 780gctgttcgca
ccatagatac ttcagcagcc cgtttaggtc ttgtggtatc aagccttaat
840gaattaacca ctcagcttgg tttagcgtta aagcagctta ataatgatgt
tgatgcatgg 900caactgccat cagggactag ctaccgctct tcagcactca
tcacgattaa tgcaaaccaa 960aaggcgacta aaggtaaaaa agcgactaac
gcaccgaaag ttgcagcatt gtttgcaggt 1020caaggttcac aatacctcaa ca
104277806DNAShewanella japonica 77atggcggaag gttgtggcca attactgcaa
ttctttatgc tgcacattgg tatgcacacg 60ctcgttgaaa atggccgttt ccaaccactt
gaaaatgctt cacaaaaagt gcgttgtcgt 120ggtcaagttc tgccgcagca
cggtgaactg acttaccgga tggaaatcac tgaaattggc 180attcaccctc
gcccatatgc caaagcgaat attgatattt tgcttaacgg taaagcggtt
240gtcgacttcc aaaacttagg tgtcatgatc aaagaagaaa gcgaatgtac
gcgctacctt 300aatgatacgc ccgctgtcga tgcctcagct gatcgaatta
attcagcaac caataatatt 360ctatacccag cggcttcaac caatgcgcca
ctcatggctc aactgcctga tttgaatgcc 420ccaacgaata aaggcgttat
cccactgcaa catgttgaag cgccgataat tccagattat 480ccaaatcgta
ctcctgatac cctgccattc acggcgtatc acatgttcga atttgccact
540ggcaatattg aaaactgctt tggaccggac tttagtattt accgtggttt
cattccaccg 600cgcacaccat gtggcgactt acagctaacg actcgtattg
ttgatattca aggtaaacgt 660ggcgaattga aaaagccatc atcgtgtatc
gcagaatatg aagtgccaac tgatgcatgg 720tatttcgcta aaaacagcca
cgcctcggtc ataccttatt cagtgttgat ggaaatttca 780ctgcaaccca
atggcttcat ctcagg 806781042DNAShewanella japonica 78ggtatgctgg
ttggtgaagg cattggcatg gtggcattaa aacgtcttga agatgctgag 60cgtgacggtg
accgtattta ctcagtcatt aaaggggtcg gcgcttcatc tgatggtaag
120ttcaaatcaa tttatgcacc tcgacctgaa ggccaagcta aagcgctgaa
gcgtgcttat 180gatgacgccg gctttgcacc tgaaaccgtt ggcttaattg
aagctcacgg aacaggcact 240gcagcgggtg atgtggcaga atttaatggt
cttaaatctg tatttggtga gaatgactca 300acaaagcaac acattgcttt
aggttcagtt aagtcacaag tgggccatac taaatcaact 360gcgggaaccg
cgggtgtgat taaagcggcg ttagcactgc atcataaagt gctgccgcca
420accatcaacg tctctaagcc taaccctaag cttaatgttg aggattcacc
gtttttcatt 480aacactgaaa ctcgcccttg gatgcctcgc cctgatggca
caccacgccg agctggtata 540agttcgttcg gttttggtgg cacaaacttc
cacttagtac tagaagaata cagcccagag 600cacagccgtg atgagaaata
tcgtcagcgc caagtagcac aaagcttatt gattagcgct 660gacaataaag
ctgagctcat tgcagaaatc aacaagctta acgctgacat cagcgcgctt
720aaaggcacag ataacagcag catcgaacaa gctgaacttg cccgcattgc
taaactatat 780gctgttcgca ctttagatac ttcagcagtc cgtttgggtc
ttgtggtctc aagccttaat 840gaattaacca ctcaacttgg tttagcgtta
aagcagctaa gtaacgacgc tgaagcatgg 900caattaccat caggtacgag
ctatcgctca tctgcgctca tcacgattaa tgccaaccaa 960aagacgacta
aaggtaaaaa agcagctaac acaccgaaag tagcagcatt atttgcaggt
1020caaggttcgc aatatctcaa ca 1042796DNAArtificial
Sequencerestriction enzyme site 79catatg 680192DNASchizochytrium
sp.misc_feature(1)..(7)BspHI restriction site 80tcatgaagcc
ggttgctccg aagttctacg cgcgtctcaa cattgacgag caggacgaga 60cccgtgatcc
gatcctcaac aaggacaacg cgccgtcttc cagctctagc tcctcttcca
120gctcttccag ctcttccagc ccgtcgccag ctccgtccgc cccagtgcaa
aagaaggctg 180ctccggccgc gg 19281717DNANostoc sp.CDS(4)..(714)
81taa ttg ttg cag cat act tgg cta cca aaa ccc cca aat tta acc tta
48 Leu Leu Gln His Thr Trp Leu Pro Lys Pro Pro Asn Leu Thr Leu 1 5
10 15ttg tca gat gaa gtt cat ctc tgg cgc att ccc ctt gac caa cca
gaa 96Leu Ser Asp Glu Val His Leu Trp Arg Ile Pro Leu Asp Gln Pro
Glu 20 25 30tca cag cta cag gat tta gcc gct acc tta tct agt gac gaa
tta gcc 144Ser Gln Leu Gln Asp Leu Ala Ala Thr Leu Ser Ser Asp Glu
Leu Ala 35 40 45cgt gca aac aga ttt tat ttt ccc gaa cat cgc cgg cgt
ttt act gct 192Arg Ala Asn Arg Phe Tyr Phe Pro Glu His Arg Arg Arg
Phe Thr Ala 50 55 60ggt cgt ggt att ctc cgc agt atc ttg ggg ggc tat
ttg ggt gtg gaa 240Gly Arg Gly Ile Leu Arg Ser Ile Leu Gly Gly Tyr
Leu Gly Val Glu 65 70 75cca ggg caa gtt aaa ttt gat tat gaa tcc cgt
ggt aaa cca ata tta 288Pro Gly Gln Val Lys Phe Asp Tyr Glu Ser Arg
Gly Lys Pro Ile Leu80 85 90 95ggc gat cgc ttt gcc gag agt ggt tta
tta ttt aac ttg tca cac tcc 336Gly Asp Arg Phe Ala Glu Ser Gly Leu
Leu Phe Asn Leu Ser His Ser 100 105 110cag aac ttg gcc ttg tgt gca
gtc aat tac acg cgc caa atc ggc atc 384Gln Asn Leu Ala Leu Cys Ala
Val Asn Tyr Thr Arg Gln Ile Gly Ile 115 120 125gat tta gaa tat ctc
cgc ccc aca tct gat tta gaa tcc ctt gcc aaa 432Asp Leu Glu Tyr Leu
Arg Pro Thr Ser Asp Leu Glu Ser Leu Ala Lys 130 135 140agg ttc ttt
tta ccg cga gaa tat gaa tta ttg cga tcg cta ccc gat 480Arg Phe Phe
Leu Pro Arg Glu Tyr Glu Leu Leu Arg Ser Leu Pro Asp 145 150 155gag
caa aaa caa aaa att ttc ttt cgt tac tgg act tgt aaa gag gct 528Glu
Gln Lys Gln Lys Ile Phe Phe Arg Tyr Trp Thr Cys Lys Glu Ala160 165
170 175tat ctt aaa gca acg ggt gac ggc atc gct aaa tta gag gaa att
gaa 576Tyr Leu Lys Ala Thr Gly Asp Gly Ile Ala Lys Leu Glu Glu Ile
Glu 180 185 190ata gca cta act ccc aca gaa cca gct aag tta cag aca
gct cca gcg 624Ile Ala Leu Thr Pro Thr Glu Pro Ala Lys Leu Gln Thr
Ala Pro Ala 195 200 205tgg agt ctc cta gag cta gtg cca gat gat aat
tgt gtt gct gct gtt 672Trp Ser Leu Leu Glu Leu Val Pro Asp Asp Asn
Cys Val Ala Ala Val 210 215 220gcc gtg gcg ggt ttt ggc tgg cag cca
aaa ttc tgg cat tat tga 717Ala Val Ala Gly Phe Gly Trp Gln Pro Lys
Phe Trp His Tyr 225 230 23582237PRTNostoc sp. 82Leu Leu Gln His Thr
Trp Leu Pro Lys Pro Pro Asn Leu Thr Leu Leu1 5 10 15Ser Asp Glu Val
His Leu Trp Arg Ile Pro Leu Asp Gln Pro Glu Ser 20 25 30Gln Leu Gln
Asp Leu Ala Ala Thr Leu Ser Ser Asp Glu Leu Ala Arg 35 40 45Ala Asn
Arg Phe Tyr Phe Pro Glu His Arg Arg Arg Phe Thr Ala Gly 50 55 60Arg
Gly Ile Leu Arg Ser Ile Leu Gly Gly Tyr Leu Gly Val Glu Pro65 70 75
80Gly Gln Val Lys Phe Asp Tyr Glu Ser Arg Gly Lys Pro Ile Leu Gly
85 90 95Asp Arg Phe Ala Glu Ser Gly Leu Leu Phe Asn Leu Ser His Ser
Gln 100 105 110Asn Leu Ala Leu Cys Ala Val Asn Tyr Thr Arg Gln Ile
Gly Ile Asp 115 120 125Leu Glu Tyr Leu Arg Pro Thr Ser Asp Leu Glu
Ser Leu Ala Lys Arg 130 135 140Phe Phe Leu Pro Arg Glu Tyr Glu Leu
Leu Arg Ser Leu Pro Asp Glu145 150 155 160Gln Lys Gln Lys Ile Phe
Phe Arg Tyr Trp Thr Cys Lys Glu Ala Tyr 165 170 175Leu Lys Ala Thr
Gly Asp Gly Ile Ala Lys Leu Glu Glu Ile Glu Ile 180 185 190Ala Leu
Thr Pro Thr Glu Pro Ala Lys Leu Gln Thr Ala Pro Ala Trp 195 200
205Ser Leu Leu Glu Leu Val Pro Asp Asp Asn Cys Val Ala Ala Val Ala
210 215 220Val Ala Gly Phe Gly Trp Gln Pro Lys Phe Trp His Tyr225
230 235
uspto.report is an independent third-party trademark research tool that is not affiliated, endorsed, or sponsored by the United States Patent and Trademark Office (USPTO) or any other governmental organization. The information provided by uspto.report is based on publicly available data at the time of writing and is intended for informational purposes only.
While we strive to provide accurate and up-to-date information, we do not guarantee the accuracy, completeness, reliability, or suitability of the information displayed on this site. The use of this site is at your own risk. Any reliance you place on such information is therefore strictly at your own risk.
All official trademark data, including owner information, should be verified by visiting the official USPTO website at www.uspto.gov. This site is not intended to replace professional legal advice and should not be used as a substitute for consulting with a legal professional who is knowledgeable about trademark law.