U.S. patent application number 17/739418 was filed with the patent office on 2022-09-01 for base editors with improved precision and specificity. The applicant listed for this patent is The General Hospital Corporation. Invention is credited to Jason Michael Gehrke, J. Keith Joung.
Application Number | 20220275356 17/739418 |
Document ID | / |
Family ID | 1000006333483 |
Filed Date | 2022-09-01 |
United States Patent Application | 20220275356 |
Kind Code | A1 |
Joung; J. Keith ; et al. | September 1, 2022 |
Methods and compositions for improving the genome-wide specificities of targeted base editing technologies.
Inventors: | Joung; J. Keith; (Winchester, MA) ; Gehrke; Jason Michael; (Cambridge, MA) | ||||||||||
Applicant: |
|
||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|
Family ID: | 1000006333483 | ||||||||||
Appl. No.: | 17/739418 | ||||||||||
Filed: | May 9, 2022 |
Application Number | Filing Date | Patent Number | ||
---|---|---|---|---|
16615559 | Nov 21, 2019 | 11326157 | ||
PCT/US2018/034719 | May 25, 2018 | |||
17739418 | ||||
62511296 | May 25, 2017 | |||
62541544 | Aug 4, 2017 | |||
62622676 | Jan 26, 2018 | |||
Current U.S. Class: | 1/1 |
Current CPC Class: | C07K 2319/81 20130101; C07K 14/4703 20130101; C12N 9/78 20130101; C12Y 305/04001 20130101; C12N 15/102 20130101; C12N 5/0647 20130101; C12N 15/90 20130101; C07K 14/435 20130101; C07K 2319/80 20130101; C12N 15/63 20130101; C12N 9/22 20130101; C12N 2310/20 20170501; C07K 2319/70 20130101 |
International Class: | C12N 9/78 20060101 C12N009/78; C12N 9/22 20060101 C12N009/22; C12N 15/10 20060101 C12N015/10; C12N 15/63 20060101 C12N015/63; C12N 15/90 20060101 C12N015/90; C07K 14/435 20060101 C07K014/435; C07K 14/47 20060101 C07K014/47; C12N 5/0789 20060101 C12N005/0789 |
Sequence CWU 1 SEQUENCE LISTING <160> NUMBER OF SEQ ID
NOS: 67 <210> SEQ ID NO 1 <211> LENGTH: 12 <212>
TYPE: PRT <213> ORGANISM: Artificial <220> FEATURE:
<223> OTHER INFORMATION: fragment of hAID <400>
SEQUENCE: 1 Gln Phe Lys Asn Val Arg Trp Ala Lys Gly Arg Arg 1 5 10
<210> SEQ ID NO 2 <211> LENGTH: 9 <212> TYPE: PRT
<213> ORGANISM: Artificial <220> FEATURE: <223>
OTHER INFORMATION: fragment of hAID solubility variant (hAIDv)
<400> SEQUENCE: 2 Asn Phe Asn Asn Gly Ile Gly Arg His 1 5
<210> SEQ ID NO 3 <211> LENGTH: 9 <212> TYPE: PRT
<213> ORGANISM: Artificial <220> FEATURE: <223>
OTHER INFORMATION: fragment of hAPOBEC3A <400> SEQUENCE: 3
Asn Phe Asn Asn Gly Ile Gly Arg His 1 5 <210> SEQ ID NO 4
<211> LENGTH: 12 <212> TYPE: PRT <213> ORGANISM:
Artificial <220> FEATURE: <223> OTHER INFORMATION:
fragment of hAPOBEC3C <400> SEQUENCE: 4 Gln Phe Lys Asn Leu
Trp Glu Ala Asn Asp Arg Asn 1 5 10 <210> SEQ ID NO 5
<211> LENGTH: 12 <212> TYPE: PRT <213> ORGANISM:
Artificial <220> FEATURE: <223> OTHER INFORMATION:
fragment of hAPOBEC3F - catalytic domain <400> SEQUENCE: 5
His Phe Lys Asn Leu Arg Lys Ala Tyr Gly Arg Asn 1 5 10 <210>
SEQ ID NO 6 <211> LENGTH: 12 <212> TYPE: PRT
<213> ORGANISM: Artificial <220> FEATURE: <223>
OTHER INFORMATION: fragment of hAPOBEC3G - catalytic domain
<400> SEQUENCE: 6 Asn Phe Asn Asn Glu Pro Trp Val Arg Gly Arg
His 1 5 10 <210> SEQ ID NO 7 <211> LENGTH: 12
<212> TYPE: PRT <213> ORGANISM: Artificial <220>
FEATURE: <223> OTHER INFORMATION: fragment of mAPOBEC3 -
catalytic domain <400> SEQUENCE: 7 His Phe Lys Asn Leu Gly
Tyr Ala Lys Gly Arg Lys 1 5 10 <210> SEQ ID NO 8 <211>
LENGTH: 15 <212> TYPE: PRT <213> ORGANISM: Artificial
<220> FEATURE: <223> OTHER INFORMATION: fragment of
hAPOBEC3H <400> SEQUENCE: 8 Gln Phe Asn Asn Lys Arg Arg Leu
Arg Arg Pro Tyr Tyr Pro Arg 1 5 10 15 <210> SEQ ID NO 9
<211> LENGTH: 9 <212> TYPE: PRT <213> ORGANISM:
Artificial <220> FEATURE: <223> OTHER INFORMATION:
fragment of rAPOBEC1 <400> SEQUENCE: 9 Phe Phe Asp Pro Arg
Glu Leu Arg Lys 1 5 <210> SEQ ID NO 10 <211> LENGTH: 17
<212> TYPE: PRT <213> ORGANISM: Artificial <220>
FEATURE: <223> OTHER INFORMATION: fragment of hAID
<400> SEQUENCE: 10 Phe Thr Ala Arg Leu Tyr Phe Cys Glu Asp
Arg Lys Ala Glu Pro Glu 1 5 10 15 Gly <210> SEQ ID NO 11
<211> LENGTH: 17 <212> TYPE: PRT <213> ORGANISM:
Artificial <220> FEATURE: <223> OTHER INFORMATION:
fragment of hAID solubility variant (hAIDv) <400> SEQUENCE:
11 Phe Thr Ala Arg Leu Tyr Phe Cys Glu Asp Arg Lys Ala Glu Pro Glu
1 5 10 15 Gly <210> SEQ ID NO 12 <211> LENGTH: 15
<212> TYPE: PRT <213> ORGANISM: Artificial <220>
FEATURE: <223> OTHER INFORMATION: fragment of hAPOBEC3A
<400> SEQUENCE: 12 Phe Ala Ala Arg Ile Tyr Asp Tyr Asp Pro
Leu Tyr Lys Glu Ala 1 5 10 15 <210> SEQ ID NO 13 <211>
LENGTH: 16 <212> TYPE: PRT <213> ORGANISM: Artificial
<220> FEATURE: <223> OTHER INFORMATION: fragment of
hAPOBEC3C <400> SEQUENCE: 13 Phe Thr Ala Arg Leu Tyr Tyr Phe
Gln Tyr Pro Cys Tyr Gln Glu Gly 1 5 10 15 <210> SEQ ID NO 14
<211> LENGTH: 16 <212> TYPE: PRT <213> ORGANISM:
Artificial <220> FEATURE: <223> OTHER INFORMATION:
fragment of hAPOBEC3F - catalytic domain <400> SEQUENCE: 14
Phe Thr Ala Arg Leu Tyr Tyr Phe Trp Asp Thr Asp Tyr Gln Glu Gly 1 5
10 15 <210> SEQ ID NO 15 <211> LENGTH: 15 <212>
TYPE: PRT <213> ORGANISM: Artificial <220> FEATURE:
<223> OTHER INFORMATION: fragment of hAPOBEC3G - catalytic
domain <400> SEQUENCE: 15 Phe Thr Ala Arg Ile Tyr Asp Asp Gln
Gly Arg Cys Gln Glu Gly 1 5 10 15 <210> SEQ ID NO 16
<211> LENGTH: 16 <212> TYPE: PRT <213> ORGANISM:
Artificial <220> FEATURE: <223> OTHER INFORMATION:
fragment of mAPOBEC3 - catalytic domain <400> SEQUENCE: 16
Phe Ser Ser Arg Leu Tyr Asn Val Gln Asp Pro Glu Thr Gln Gln Asn 1 5
10 15 <210> SEQ ID NO 17 <211> LENGTH: 16 <212>
TYPE: PRT <213> ORGANISM: Artificial <220> FEATURE:
<223> OTHER INFORMATION: fragment of hAPOBEC3H <400>
SEQUENCE: 17 Phe Ala Ser Arg Leu Tyr Tyr His Trp Cys Lys Pro Gln
Gln Lys Gly 1 5 10 15 <210> SEQ ID NO 18 <211> LENGTH:
16 <212> TYPE: PRT <213> ORGANISM: Artificial
<220> FEATURE: <223> OTHER INFORMATION: fragment of
rAPOBEC1 <400> SEQUENCE: 18 Tyr Ile Ala Arg Leu Tyr His His
Ala Asp Pro Arg Asn Arg Gln Gly 1 5 10 15 <210> SEQ ID NO 19
<211> LENGTH: 21 <212> TYPE: DNA <213> ORGANISM:
Artificial <220> FEATURE: <223> OTHER INFORMATION: EGFP
gRNA target sequence <400> SEQUENCE: 19 tcagctcgat gcggttcacc
a 21 <210> SEQ ID NO 20 <211> LENGTH: 21 <212>
TYPE: DNA <213> ORGANISM: Artificial <220> FEATURE:
<223> OTHER INFORMATION: EGFP gRNA target sequence
<400> SEQUENCE: 20 gcagaacacc cccatcggcg a 21 <210> SEQ
ID NO 21 <400> SEQUENCE: 21 000 <210> SEQ ID NO 22
<211> LENGTH: 24 <212> TYPE: DNA <213> ORGANISM:
Artificial <220> FEATURE: <223> OTHER INFORMATION: EGFP
gRNA target sequence <400> SEQUENCE: 22 tcagctcgat gcggttcacc
aggg 24 <210> SEQ ID NO 23 <211> LENGTH: 20 <212>
TYPE: DNA <213> ORGANISM: Artificial <220> FEATURE:
<223> OTHER INFORMATION: AAVS1 target site 1 reference
sequence <400> SEQUENCE: 23 gactcaccca ggagtgcgtt 20
<210> SEQ ID NO 24 <211> LENGTH: 20 <212> TYPE:
DNA <213> ORGANISM: Artificial <220> FEATURE:
<223> OTHER INFORMATION: AAVS1 target site 2 reference
sequence <400> SEQUENCE: 24 gtccgactcg gccaggtcca 20
<210> SEQ ID NO 25 <211> LENGTH: 20 <212> TYPE:
DNA <213> ORGANISM: Artificial <220> FEATURE:
<223> OTHER INFORMATION: AAVS1 target site 3 reference
sequence <400> SEQUENCE: 25 gaccctcagc cgtgctgctc 20
<210> SEQ ID NO 26 <211> LENGTH: 20 <212> TYPE:
DNA <213> ORGANISM: Artificial <220> FEATURE:
<223> OTHER INFORMATION: AAVS1 target site 4 reference
sequence <400> SEQUENCE: 26 gctctcagcc tggagaccac 20
<210> SEQ ID NO 27 <211> LENGTH: 20 <212> TYPE:
DNA <213> ORGANISM: Artificial <220> FEATURE:
<223> OTHER INFORMATION: AAVS1 target site 5 reference
sequence <400> SEQUENCE: 27 gctgactcag agaccctgag 20
<210> SEQ ID NO 28 <211> LENGTH: 20 <212> TYPE:
DNA <213> ORGANISM: Artificial <220> FEATURE:
<223> OTHER INFORMATION: AAVS1 target site 6 reference
sequence <400> SEQUENCE: 28 ggggctcaac atcggaagag 20
<210> SEQ ID NO 29 <211> LENGTH: 20 <212> TYPE:
DNA <213> ORGANISM: Artificial <220> FEATURE:
<223> OTHER INFORMATION: AAVS1 target site 7 reference
sequence <400> SEQUENCE: 29 ggcactcggg ggcgagagga 20
<210> SEQ ID NO 30 <211> LENGTH: 20 <212> TYPE:
DNA <213> ORGANISM: Artificial <220> FEATURE:
<223> OTHER INFORMATION: EMX1 target site 2 reference
sequence <400> SEQUENCE: 30 gtattcacct gaaagtgtgc 20
<210> SEQ ID NO 31 <211> LENGTH: 20 <212> TYPE:
DNA <213> ORGANISM: Artificial <220> FEATURE:
<223> OTHER INFORMATION: AAVS1 target site 8 reference
sequence <400> SEQUENCE: 31 gagctcactg aacgctggca 20
<210> SEQ ID NO 32 <211> LENGTH: 20 <212> TYPE:
DNA <213> ORGANISM: Artificial <220> FEATURE:
<223> OTHER INFORMATION: AAVS1 target site 9 reference
sequence <400> SEQUENCE: 32 gctggctcag gttcaggaga 20
<210> SEQ ID NO 33 <211> LENGTH: 20 <212> TYPE:
DNA <213> ORGANISM: Artificial <220> FEATURE:
<223> OTHER INFORMATION: FANCF target site 1 reference
sequence <400> SEQUENCE: 33 ggaatccctt ctgcagcacc 20
<210> SEQ ID NO 34 <211> LENGTH: 20 <212> TYPE:
DNA <213> ORGANISM: Artificial <220> FEATURE:
<223> OTHER INFORMATION: EMX1 target site 1 reference
sequence <400> SEQUENCE: 34 gagtccgagc agaagaagaa 20
<210> SEQ ID NO 35 <211> LENGTH: 20 <212> TYPE:
DNA <213> ORGANISM: Artificial Sequence <220> FEATURE:
<223> OTHER INFORMATION: AAVS1 target site 6 reference
sequence <400> SEQUENCE: 35 ggggctcaac atcggaagag 20
<210> SEQ ID NO 36 <211> LENGTH: 20 <212> TYPE:
DNA <213> ORGANISM: Artificial <220> FEATURE:
<223> OTHER INFORMATION: AAVS1 target site 3 reference
sequence <400> SEQUENCE: 36 gaccctcagc cgtgctgctc 20
<210> SEQ ID NO 37 <211> LENGTH: 20 <212> TYPE:
DNA <213> ORGANISM: Artificial <220> FEATURE:
<223> OTHER INFORMATION: AAVS1 target site 7 reference
sequence <400> SEQUENCE: 37 ggcactcggg ggcgagagga 20
<210> SEQ ID NO 38 <211> LENGTH: 20 <212> TYPE:
DNA <213> ORGANISM: Artificial <220> FEATURE:
<223> OTHER INFORMATION: potential HBB allele products
<400> SEQUENCE: 38 ctgaatttta tgcccagccc 20 <210> SEQ
ID NO 39 <211> LENGTH: 20 <212> TYPE: DNA <213>
ORGANISM: Artificial <220> FEATURE: <223> OTHER
INFORMATION: potential HBB allele products <400> SEQUENCE: 39
ctgagtttta tgcccagccc 20 <210> SEQ ID NO 40 <211>
LENGTH: 20 <212> TYPE: DNA <213> ORGANISM: Artificial
<220> FEATURE: <223> OTHER INFORMATION: potential HBB
allele products <400> SEQUENCE: 40 ctgattttta tgcccagccc 20
<210> SEQ ID NO 41 <211> LENGTH: 20 <212> TYPE:
DNA <213> ORGANISM: Artificial <220> FEATURE:
<223> OTHER INFORMATION: potential HBB allele products
<400> SEQUENCE: 41 ctgacttgta tgcccagccc 20 <210> SEQ
ID NO 42 <211> LENGTH: 20 <212> TYPE: DNA <213>
ORGANISM: Artificial <220> FEATURE: <223> OTHER
INFORMATION: potential HBB allele products <400> SEQUENCE: 42
ctgactttta tgcccagccc 20 <210> SEQ ID NO 43 <211>
LENGTH: 20 <212> TYPE: DNA <213> ORGANISM: Artificial
<220> FEATURE: <223> OTHER INFORMATION: HBB target site
reference sequence <400> SEQUENCE: 43 ctgacttcta tgcccagccc
20 <210> SEQ ID NO 44 <211> LENGTH: 23 <212>
TYPE: DNA <213> ORGANISM: Artificial <220> FEATURE:
<223> OTHER INFORMATION: HBB target site reference sequence
<220> FEATURE: <221> NAME/KEY: misc_feature <222>
LOCATION: (21)..(21) <223> OTHER INFORMATION: n is a, c, g,
or t <400> SEQUENCE: 44 ctgacttcta tgcccagccc ngg 23
<210> SEQ ID NO 45 <211> LENGTH: 83 <212> TYPE:
PRT <213> ORGANISM: Artificial <220> FEATURE:
<223> OTHER INFORMATION: Uracil glycosylase inhibitor (UGI)
<400> SEQUENCE: 45 Thr Asn Leu Ser Asp Ile Ile Glu Lys Glu
Thr Gly Lys Gln Leu Val 1 5 10 15 Ile Gln Glu Ser Ile Leu Met Leu
Pro Glu Glu Val Glu Glu Val Ile 20 25 30 Gly Asn Lys Pro Glu Ser
Asp Ile Leu Val His Thr Ala Tyr Asp Glu 35 40 45 Ser Thr Asp Glu
Asn Val Met Leu Leu Thr Ser Asp Ala Pro Glu Tyr 50 55 60 Lys Pro
Trp Ala Leu Val Ile Gln Asp Ser Asn Gly Glu Asn Lys Ile 65 70 75 80
Lys Met Leu <210> SEQ ID NO 46 <211> LENGTH: 4
<212> TYPE: PRT <213> ORGANISM: Artificial <220>
FEATURE: <223> OTHER INFORMATION: linker sequence <400>
SEQUENCE: 46 Gly Gly Gly Ser 1 <210> SEQ ID NO 47 <211>
LENGTH: 5 <212> TYPE: PRT <213> ORGANISM: Artificial
<220> FEATURE: <223> OTHER INFORMATION: linker sequence
<400> SEQUENCE: 47 Gly Gly Gly Gly Ser 1 5 <210> SEQ ID
NO 48 <211> LENGTH: 7 <212> TYPE: PRT <213>
ORGANISM: Artificial <220> FEATURE: <223> OTHER
INFORMATION: SV40 large T antigen NLS <400> SEQUENCE: 48 Pro
Lys Lys Lys Arg Arg Val 1 5 <210> SEQ ID NO 49 <211>
LENGTH: 15 <212> TYPE: PRT <213> ORGANISM: Artificial
<220> FEATURE: <223> OTHER INFORMATION: nucleoplasmin
NLS <400> SEQUENCE: 49 Lys Arg Pro Ala Ala Thr Lys Lys Ala
Gly Gln Ala Lys Lys Lys 1 5 10 15 <210> SEQ ID NO 50
<211> LENGTH: 1710 <212> TYPE: PRT <213>
ORGANISM: Artificial <220> FEATURE: <223> OTHER
INFORMATION: rAPOBEC1-XTEN L8-nCas9-UGI-SV40 NLS <400>
SEQUENCE: 50 Met Ser Ser Glu Thr Gly Pro Val Ala Val Asp Pro Thr
Leu Arg Arg 1 5 10 15 Arg Ile Glu Pro His Glu Phe Glu Val Phe Phe
Asp Pro Arg Glu Leu 20 25 30 Arg Lys Glu Thr Cys Leu Leu Tyr Glu
Ile Asn Trp Gly Gly Arg His 35 40 45 Ser Ile Trp Arg His Thr Ser
Gln Asn Thr Asn Lys His Val Glu Val 50 55 60 Asn Phe Ile Glu Lys
Phe Thr Thr Glu Arg Tyr Phe Cys Pro Asn Thr 65 70 75 80 Arg Cys Ser
Ile Thr Trp Phe Leu Ser Trp Ser Pro Cys Gly Glu Cys 85 90 95 Ser
Arg Ala Ile Thr Glu Phe Leu Ser Arg Tyr Pro His Val Thr Leu 100 105
110 Phe Ile Tyr Ile Ala Arg Leu Tyr His His Ala Asp Pro Arg Asn Arg
115 120 125 Gln Gly Leu Arg Asp Leu Ile Ser Ser Gly Val Thr Ile Gln
Ile Met 130 135 140 Thr Glu Gln Glu Ser Gly Tyr Cys Trp Arg Asn Phe
Val Asn Tyr Ser 145 150 155 160 Pro Ser Asn Glu Ala His Trp Pro Arg
Tyr Pro His Leu Trp Val Arg 165 170 175 Leu Tyr Val Leu Glu Leu Tyr
Cys Ile Ile Leu Gly Leu Pro Pro Cys 180 185 190 Leu Asn Ile Leu Arg
Arg Lys Gln Pro Gln Leu Thr Phe Phe Thr Ile 195 200 205 Ala Leu Gln
Ser Cys His Tyr Gln Arg Leu Pro Pro His Ile Leu Trp 210 215 220 Ala
Thr Gly Leu Lys Ser Gly Ser Glu Thr Pro Gly Thr Ser Glu Ser 225 230
235 240 Ala Thr Pro Glu Ser Asp Lys Lys Tyr Ser Ile Gly Leu Ala Ile
Gly 245 250 255 Thr Asn Ser Val Gly Trp Ala Val Ile Thr Asp Glu Tyr
Lys Val Pro 260 265 270 Ser Lys Lys Phe Lys Val Leu Gly Asn Thr Asp
Arg His Ser Ile Lys 275 280 285 Lys Asn Leu Ile Gly Ala Leu Leu Phe
Asp Ser Gly Glu Thr Ala Glu 290 295 300 Ala Thr Arg Leu Lys Arg Thr
Ala Arg Arg Arg Tyr Thr Arg Arg Lys 305 310 315 320 Asn Arg Ile Cys
Tyr Leu Gln Glu Ile Phe Ser Asn Glu Met Ala Lys 325 330 335 Val Asp
Asp Ser Phe Phe His Arg Leu Glu Glu Ser Phe Leu Val Glu 340 345 350
Glu Asp Lys Lys His Glu Arg His Pro Ile Phe Gly Asn Ile Val Asp 355
360 365 Glu Val Ala Tyr His Glu Lys Tyr Pro Thr Ile Tyr His Leu Arg
Lys 370 375 380 Lys Leu Val Asp Ser Thr Asp Lys Ala Asp Leu Arg Leu
Ile Tyr Leu 385 390 395 400 Ala Leu Ala His Met Ile Lys Phe Arg Gly
His Phe Leu Ile Glu Gly 405 410 415 Asp Leu Asn Pro Asp Asn Ser Asp
Val Asp Lys Leu Phe Ile Gln Leu 420 425 430 Val Gln Thr Tyr Asn Gln
Leu Phe Glu Glu Asn Pro Ile Asn Ala Ser 435 440 445 Gly Val Asp Ala
Lys Ala Ile Leu Ser Ala Arg Leu Ser Lys Ser Arg 450 455 460 Arg Leu
Glu Asn Leu Ile Ala Gln Leu Pro Gly Glu Lys Lys Asn Gly 465 470 475
480 Leu Phe Gly Asn Leu Ile Ala Leu Ser Leu Gly Leu Thr Pro Asn Phe
485 490 495 Lys Ser Asn Phe Asp Leu Ala Glu Asp Ala Lys Leu Gln Leu
Ser Lys 500 505 510 Asp Thr Tyr Asp Asp Asp Leu Asp Asn Leu Leu Ala
Gln Ile Gly Asp 515 520 525 Gln Tyr Ala Asp Leu Phe Leu Ala Ala Lys
Asn Leu Ser Asp Ala Ile 530 535 540 Leu Leu Ser Asp Ile Leu Arg Val
Asn Thr Glu Ile Thr Lys Ala Pro 545 550 555 560 Leu Ser Ala Ser Met
Ile Lys Arg Tyr Asp Glu His His Gln Asp Leu 565 570 575 Thr Leu Leu
Lys Ala Leu Val Arg Gln Gln Leu Pro Glu Lys Tyr Lys 580 585 590 Glu
Ile Phe Phe Asp Gln Ser Lys Asn Gly Tyr Ala Gly Tyr Ile Asp 595 600
605 Gly Gly Ala Ser Gln Glu Glu Phe Tyr Lys Phe Ile Lys Pro Ile Leu
610 615 620 Glu Lys Met Asp Gly Thr Glu Glu Leu Leu Val Lys Leu Asn
Arg Glu 625 630 635 640 Asp Leu Leu Arg Lys Gln Arg Thr Phe Asp Asn
Gly Ser Ile Pro His 645 650 655 Gln Ile His Leu Gly Glu Leu His Ala
Ile Leu Arg Arg Gln Glu Asp 660 665 670 Phe Tyr Pro Phe Leu Lys Asp
Asn Arg Glu Lys Ile Glu Lys Ile Leu 675 680 685 Thr Phe Arg Ile Pro
Tyr Tyr Val Gly Pro Leu Ala Arg Gly Asn Ser 690 695 700 Arg Phe Ala
Trp Met Thr Arg Lys Ser Glu Glu Thr Ile Thr Pro Trp 705 710 715 720
Asn Phe Glu Glu Val Val Asp Lys Gly Ala Ser Ala Gln Ser Phe Ile 725
730 735 Glu Arg Met Thr Asn Phe Asp Lys Asn Leu Pro Asn Glu Lys Val
Leu 740 745 750 Pro Lys His Ser Leu Leu Tyr Glu Tyr Phe Thr Val Tyr
Asn Glu Leu 755 760 765 Thr Lys Val Lys Tyr Val Thr Glu Gly Met Arg
Lys Pro Ala Phe Leu 770 775 780 Ser Gly Glu Gln Lys Lys Ala Ile Val
Asp Leu Leu Phe Lys Thr Asn 785 790 795 800 Arg Lys Val Thr Val Lys
Gln Leu Lys Glu Asp Tyr Phe Lys Lys Ile 805 810 815 Glu Cys Phe Asp
Ser Val Glu Ile Ser Gly Val Glu Asp Arg Phe Asn 820 825 830 Ala Ser
Leu Gly Thr Tyr His Asp Leu Leu Lys Ile Ile Lys Asp Lys 835 840 845
Asp Phe Leu Asp Asn Glu Glu Asn Glu Asp Ile Leu Glu Asp Ile Val 850
855 860 Leu Thr Leu Thr Leu Phe Glu Asp Arg Glu Met Ile Glu Glu Arg
Leu 865 870 875 880 Lys Thr Tyr Ala His Leu Phe Asp Asp Lys Val Met
Lys Gln Leu Lys 885 890 895 Arg Arg Arg Tyr Thr Gly Trp Gly Arg Leu
Ser Arg Lys Leu Ile Asn 900 905 910 Gly Ile Arg Asp Lys Gln Ser Gly
Lys Thr Ile Leu Asp Phe Leu Lys 915 920 925 Ser Asp Gly Phe Ala Asn
Arg Asn Phe Met Gln Leu Ile His Asp Asp 930 935 940 Ser Leu Thr Phe
Lys Glu Asp Ile Gln Lys Ala Gln Val Ser Gly Gln 945 950 955 960 Gly
Asp Ser Leu His Glu His Ile Ala Asn Leu Ala Gly Ser Pro Ala 965 970
975 Ile Lys Lys Gly Ile Leu Gln Thr Val Lys Val Val Asp Glu Leu Val
980 985 990 Lys Val Met Gly Arg His Lys Pro Glu Asn Ile Val Ile Glu
Met Ala 995 1000 1005 Arg Glu Asn Gln Thr Thr Gln Lys Gly Gln Lys
Asn Ser Arg Glu 1010 1015 1020 Arg Met Lys Arg Ile Glu Glu Gly Ile
Lys Glu Leu Gly Ser Gln 1025 1030 1035 Ile Leu Lys Glu His Pro Val
Glu Asn Thr Gln Leu Gln Asn Glu 1040 1045 1050 Lys Leu Tyr Leu Tyr
Tyr Leu Gln Asn Gly Arg Asp Met Tyr Val 1055 1060 1065 Asp Gln Glu
Leu Asp Ile Asn Arg Leu Ser Asp Tyr Asp Val Asp 1070 1075 1080 His
Ile Val Pro Gln Ser Phe Leu Lys Asp Asp Ser Ile Asp Asn 1085 1090
1095 Lys Val Leu Thr Arg Ser Asp Lys Asn Arg Gly Lys Ser Asp Asn
1100 1105 1110 Val Pro Ser Glu Glu Val Val Lys Lys Met Lys Asn Tyr
Trp Arg 1115 1120 1125 Gln Leu Leu Asn Ala Lys Leu Ile Thr Gln Arg
Lys Phe Asp Asn 1130 1135 1140 Leu Thr Lys Ala Glu Arg Gly Gly Leu
Ser Glu Leu Asp Lys Ala 1145 1150 1155 Gly Phe Ile Lys Arg Gln Leu
Val Glu Thr Arg Gln Ile Thr Lys 1160 1165 1170 His Val Ala Gln Ile
Leu Asp Ser Arg Met Asn Thr Lys Tyr Asp 1175 1180 1185 Glu Asn Asp
Lys Leu Ile Arg Glu Val Lys Val Ile Thr Leu Lys 1190 1195 1200 Ser
Lys Leu Val Ser Asp Phe Arg Lys Asp Phe Gln Phe Tyr Lys 1205 1210
1215 Val Arg Glu Ile Asn Asn Tyr His His Ala His Asp Ala Tyr Leu
1220 1225 1230 Asn Ala Val Val Gly Thr Ala Leu Ile Lys Lys Tyr Pro
Lys Leu 1235 1240 1245 Glu Ser Glu Phe Val Tyr Gly Asp Tyr Lys Val
Tyr Asp Val Arg 1250 1255 1260 Lys Met Ile Ala Lys Ser Glu Gln Glu
Ile Gly Lys Ala Thr Ala 1265 1270 1275 Lys Tyr Phe Phe Tyr Ser Asn
Ile Met Asn Phe Phe Lys Thr Glu 1280 1285 1290 Ile Thr Leu Ala Asn
Gly Glu Ile Arg Lys Arg Pro Leu Ile Glu 1295 1300 1305 Thr Asn Gly
Glu Thr Gly Glu Ile Val Trp Asp Lys Gly Arg Asp 1310 1315 1320 Phe
Ala Thr Val Arg Lys Val Leu Ser Met Pro Gln Val Asn Ile 1325 1330
1335 Val Lys Lys Thr Glu Val Gln Thr Gly Gly Phe Ser Lys Glu Ser
1340 1345 1350 Ile Leu Pro Lys Arg Asn Ser Asp Lys Leu Ile Ala Arg
Lys Lys 1355 1360 1365 Asp Trp Asp Pro Lys Lys Tyr Gly Gly Phe Asp
Ser Pro Thr Val 1370 1375 1380 Ala Tyr Ser Val Leu Val Val Ala Lys
Val Glu Lys Gly Lys Ser 1385 1390 1395 Lys Lys Leu Lys Ser Val Lys
Glu Leu Leu Gly Ile Thr Ile Met 1400 1405 1410 Glu Arg Ser Ser Phe
Glu Lys Asn Pro Ile Asp Phe Leu Glu Ala 1415 1420 1425 Lys Gly Tyr
Lys Glu Val Lys Lys Asp Leu Ile Ile Lys Leu Pro 1430 1435 1440 Lys
Tyr Ser Leu Phe Glu Leu Glu Asn Gly Arg Lys Arg Met Leu 1445 1450
1455 Ala Ser Ala Gly Glu Leu Gln Lys Gly Asn Glu Leu Ala Leu Pro
1460 1465 1470 Ser Lys Tyr Val Asn Phe Leu Tyr Leu Ala Ser His Tyr
Glu Lys 1475 1480 1485 Leu Lys Gly Ser Pro Glu Asp Asn Glu Gln Lys
Gln Leu Phe Val 1490 1495 1500 Glu Gln His Lys His Tyr Leu Asp Glu
Ile Ile Glu Gln Ile Ser 1505 1510 1515 Glu Phe Ser Lys Arg Val Ile
Leu Ala Asp Ala Asn Leu Asp Lys 1520 1525 1530 Val Leu Ser Ala Tyr
Asn Lys His Arg Asp Lys Pro Ile Arg Glu 1535 1540 1545 Gln Ala Glu
Asn Ile Ile His Leu Phe Thr Leu Thr Asn Leu Gly 1550 1555 1560 Ala
Pro Ala Ala Phe Lys Tyr Phe Asp Thr Thr Ile Asp Arg Lys 1565 1570
1575 Arg Tyr Thr Ser Thr Lys Glu Val Leu Asp Ala Thr Leu Ile His
1580 1585 1590 Gln Ser Ile Thr Gly Leu Tyr Glu Thr Arg Ile Asp Leu
Ser Gln 1595 1600 1605 Leu Gly Gly Asp Ser Gly Gly Ser Thr Asn Leu
Ser Asp Ile Ile 1610 1615 1620 Glu Lys Glu Thr Gly Lys Gln Leu Val
Ile Gln Glu Ser Ile Leu 1625 1630 1635 Met Leu Pro Glu Glu Val Glu
Glu Val Ile Gly Asn Lys Pro Glu 1640 1645 1650 Ser Asp Ile Leu Val
His Thr Ala Tyr Asp Glu Ser Thr Asp Glu 1655 1660 1665 Asn Val Met
Leu Leu Thr Ser Asp Ala Pro Glu Tyr Lys Pro Trp 1670 1675 1680 Ala
Leu Val Ile Gln Asp Ser Asn Gly Glu Asn Lys Ile Lys Met 1685 1690
1695 Leu Ser Gly Gly Ser Pro Lys Lys Lys Arg Lys Val 1700 1705 1710
<210> SEQ ID NO 51 <211> LENGTH: 198 <212> TYPE:
PRT <213> ORGANISM: Homo sapiens <400> SEQUENCE: 51 Met
Asp Ser Leu Leu Met Asn Arg Arg Lys Phe Leu Tyr Gln Phe Lys 1 5 10
15 Asn Val Arg Trp Ala Lys Gly Arg Arg Glu Thr Tyr Leu Cys Tyr Val
20 25 30 Val Lys Arg Arg Asp Ser Ala Thr Ser Phe Ser Leu Asp Phe
Gly Tyr 35 40 45 Leu Arg Asn Lys Asn Gly Cys His Val Glu Leu Leu
Phe Leu Arg Tyr 50 55 60 Ile Ser Asp Trp Asp Leu Asp Pro Gly Arg
Cys Tyr Arg Val Thr Trp 65 70 75 80 Phe Thr Ser Trp Ser Pro Cys Tyr
Asp Cys Ala Arg His Val Ala Asp 85 90 95 Phe Leu Arg Gly Asn Pro
Asn Leu Ser Leu Arg Ile Phe Thr Ala Arg 100 105 110 Leu Tyr Phe Cys
Glu Asp Arg Lys Ala Glu Pro Glu Gly Leu Arg Arg 115 120 125 Leu His
Arg Ala Gly Val Gln Ile Ala Ile Met Thr Phe Lys Asp Tyr 130 135 140
Phe Tyr Cys Trp Asn Thr Phe Val Glu Asn His Glu Arg Thr Phe Lys 145
150 155 160 Ala Trp Glu Gly Leu His Glu Asn Ser Val Arg Leu Ser Arg
Gln Leu 165 170 175 Arg Arg Ile Leu Leu Pro Leu Tyr Glu Val Asp Asp
Leu Arg Asp Ala 180 185 190 Phe Arg Thr Leu Gly Leu 195 <210>
SEQ ID NO 52 <211> LENGTH: 190 <212> TYPE: PRT
<213> ORGANISM: Artificial <220> FEATURE: <223>
OTHER INFORMATION: hAIDv solubility variant lacking N-terminal
RNA-binding region <400> SEQUENCE: 52 Met Asp Pro His Ile Phe
Thr Ser Asn Phe Asn Asn Gly Ile Gly Arg 1 5 10 15 His Lys Thr Tyr
Leu Cys Tyr Glu Val Glu Arg Leu Asp Ser Ala Thr 20 25 30 Ser Phe
Ser Leu Asp Phe Gly Tyr Leu Arg Asn Lys Asn Gly Cys His 35 40 45
Val Glu Leu Leu Phe Leu Arg Tyr Ile Ser Asp Trp Asp Leu Asp Pro 50
55 60 Gly Arg Cys Tyr Arg Val Thr Trp Phe Thr Ser Trp Ser Pro Cys
Tyr 65 70 75 80 Asp Cys Ala Arg His Val Ala Asp Phe Leu Arg Gly Asn
Pro Asn Leu 85 90 95 Ser Leu Arg Ile Phe Thr Ala Arg Leu Tyr Phe
Cys Glu Asp Arg Lys 100 105 110 Ala Glu Pro Glu Gly Leu Arg Arg Leu
His Arg Ala Gly Val Gln Ile 115 120 125 Ala Ile Met Thr Phe Lys Asp
Tyr Phe Tyr Cys Trp Asn Thr Phe Val 130 135 140 Glu Asn His Glu Arg
Thr Phe Lys Ala Trp Glu Gly Leu His Glu Asn 145 150 155 160 Ser Val
Arg Leu Ser Arg Gln Leu Arg Arg Ile Leu Leu Pro Leu Tyr 165 170 175
Glu Val Asp Asp Leu Arg Asp Ala Phe Arg Thr Leu Gly Leu 180 185 190
<210> SEQ ID NO 53 <211> LENGTH: 175 <212> TYPE:
PRT <213> ORGANISM: Artificial <220> FEATURE:
<223> OTHER INFORMATION: hAIDv solubility variant lacking
N-terminal RNA-binding region and the C-terminal poorly structured
region <400> SEQUENCE: 53 Met Asp Pro His Ile Phe Thr Ser Asn
Phe Asn Asn Gly Ile Gly Arg 1 5 10 15 His Lys Thr Tyr Leu Cys Tyr
Glu Val Glu Arg Leu Asp Ser Ala Thr 20 25 30 Ser Phe Ser Leu Asp
Phe Gly Tyr Leu Arg Asn Lys Asn Gly Cys His 35 40 45 Val Glu Leu
Leu Phe Leu Arg Tyr Ile Ser Asp Trp Asp Leu Asp Pro 50 55 60 Gly
Arg Cys Tyr Arg Val Thr Trp Phe Thr Ser Trp Ser Pro Cys Tyr 65 70
75 80 Asp Cys Ala Arg His Val Ala Asp Phe Leu Arg Gly Asn Pro Asn
Leu 85 90 95 Ser Leu Arg Ile Phe Thr Ala Arg Leu Tyr Phe Cys Glu
Asp Arg Lys 100 105 110 Ala Glu Pro Glu Gly Leu Arg Arg Leu His Arg
Ala Gly Val Gln Ile 115 120 125 Ala Ile Met Thr Phe Lys Asp Tyr Phe
Tyr Cys Trp Asn Thr Phe Val 130 135 140 Glu Asn His Glu Arg Thr Phe
Lys Ala Trp Glu Gly Leu His Glu Asn 145 150 155 160 Ser Val Arg Leu
Ser Arg Gln Leu Arg Arg Ile Leu Leu Pro Leu 165 170 175 <210>
SEQ ID NO 54 <211> LENGTH: 229 <212> TYPE: PRT
<213> ORGANISM: Rattus norvegicus <400> SEQUENCE: 54
Met Ser Ser Glu Thr Gly Pro Val Ala Val Asp Pro Thr Leu Arg Arg 1 5
10 15 Arg Ile Glu Pro His Glu Phe Glu Val Phe Phe Asp Pro Arg Glu
Leu 20 25 30 Arg Lys Glu Thr Cys Leu Leu Tyr Glu Ile Asn Trp Gly
Gly Arg His 35 40 45 Ser Ile Trp Arg His Thr Ser Gln Asn Thr Asn
Lys His Val Glu Val 50 55 60 Asn Phe Ile Glu Lys Phe Thr Thr Glu
Arg Tyr Phe Cys Pro Asn Thr 65 70 75 80 Arg Cys Ser Ile Thr Trp Phe
Leu Ser Trp Ser Pro Cys Gly Glu Cys 85 90 95 Ser Arg Ala Ile Thr
Glu Phe Leu Ser Arg Tyr Pro His Val Thr Leu 100 105 110 Phe Ile Tyr
Ile Ala Arg Leu Tyr His His Ala Asp Pro Arg Asn Arg 115 120 125 Gln
Gly Leu Arg Asp Leu Ile Ser Ser Gly Val Thr Ile Gln Ile Met 130 135
140 Thr Glu Gln Glu Ser Gly Tyr Cys Trp Arg Asn Phe Val Asn Tyr Ser
145 150 155 160 Pro Ser Asn Glu Ala His Trp Pro Arg Tyr Pro His Leu
Trp Val Arg 165 170 175 Leu Tyr Val Leu Glu Leu Tyr Cys Ile Ile Leu
Gly Leu Pro Pro Cys 180 185 190 Leu Asn Ile Leu Arg Arg Lys Gln Pro
Gln Leu Thr Phe Phe Thr Ile 195 200 205 Ala Leu Gln Ser Cys His Tyr
Gln Arg Leu Pro Pro His Ile Leu Trp 210 215 220 Ala Thr Gly Leu Lys
225 <210> SEQ ID NO 55 <211> LENGTH: 397 <212>
TYPE: PRT <213> ORGANISM: Mus musculus <400> SEQUENCE:
55 Met Gly Pro Phe Cys Leu Gly Cys Ser His Arg Lys Cys Tyr Ser Pro
1 5 10 15 Ile Arg Asn Leu Ile Ser Gln Glu Thr Phe Lys Phe His Phe
Lys Asn 20 25 30 Leu Gly Tyr Ala Lys Gly Arg Lys Asp Thr Phe Leu
Cys Tyr Glu Val 35 40 45 Thr Arg Lys Asp Cys Asp Ser Pro Val Ser
Leu His His Gly Val Phe 50 55 60 Lys Asn Lys Asp Asn Ile His Ala
Glu Ile Cys Phe Leu Tyr Trp Phe 65 70 75 80 His Asp Lys Val Leu Lys
Val Leu Ser Pro Arg Glu Glu Phe Lys Ile 85 90 95 Thr Trp Tyr Met
Ser Trp Ser Pro Cys Phe Glu Cys Ala Glu Gln Ile 100 105 110 Val Arg
Phe Leu Ala Thr His His Asn Leu Ser Leu Asp Ile Phe Ser 115 120 125
Ser Arg Leu Tyr Asn Val Gln Asp Pro Glu Thr Gln Gln Asn Leu Cys 130
135 140 Arg Leu Val Gln Glu Gly Ala Gln Val Ala Ala Met Asp Leu Tyr
Glu 145 150 155 160 Phe Lys Lys Cys Trp Lys Lys Phe Val Asp Asn Gly
Gly Arg Arg Phe 165 170 175 Arg Pro Trp Lys Arg Leu Leu Thr Asn Phe
Arg Tyr Gln Asp Ser Lys 180 185 190 Leu Gln Glu Ile Leu Arg Arg Met
Asp Pro Leu Ser Glu Glu Glu Phe 195 200 205 Tyr Ser Gln Phe Tyr Asn
Gln Arg Val Lys His Leu Cys Tyr Tyr His 210 215 220 Arg Met Lys Pro
Tyr Leu Cys Tyr Gln Leu Glu Gln Phe Asn Gly Gln 225 230 235 240 Ala
Pro Leu Lys Gly Cys Leu Leu Ser Glu Lys Gly Lys Gln His Ala 245 250
255 Glu Ile Leu Phe Leu Asp Lys Ile Arg Ser Met Glu Leu Ser Gln Val
260 265 270 Thr Ile Thr Cys Tyr Leu Thr Trp Ser Pro Cys Pro Asn Cys
Ala Trp 275 280 285 Gln Leu Ala Ala Phe Lys Arg Asp Arg Pro Asp Leu
Ile Leu His Ile 290 295 300 Tyr Thr Ser Arg Leu Tyr Phe His Trp Lys
Arg Pro Phe Gln Lys Gly 305 310 315 320 Leu Cys Ser Leu Trp Gln Ser
Gly Ile Leu Val Asp Val Met Asp Leu 325 330 335 Pro Gln Phe Thr Asp
Cys Trp Thr Asn Phe Val Asn Pro Lys Arg Pro 340 345 350 Phe Arg Pro
Trp Lys Gly Leu Glu Ile Ile Ser Arg Arg Thr Gln Arg 355 360 365 Arg
Leu Arg Arg Ile Lys Glu Ser Trp Gly Leu Gln Asp Leu Val Asn 370 375
380 Asp Phe Gly Asn Leu Gln Leu Gly Pro Pro Met Ser Asn 385 390 395
<210> SEQ ID NO 56 <211> LENGTH: 199 <212> TYPE:
PRT <213> ORGANISM: Artificial <220> FEATURE:
<223> OTHER INFORMATION: mAPOBEC3 catalytic domain
<400> SEQUENCE: 56 Met Gly Pro Phe Cys Leu Gly Cys Ser His
Arg Lys Cys Tyr Ser Pro 1 5 10 15 Ile Arg Asn Leu Ile Ser Gln Glu
Thr Phe Lys Phe His Phe Lys Asn 20 25 30 Leu Gly Tyr Ala Lys Gly
Arg Lys Asp Thr Phe Leu Cys Tyr Glu Val 35 40 45 Thr Arg Lys Asp
Cys Asp Ser Pro Val Ser Leu His His Gly Val Phe 50 55 60 Lys Asn
Lys Asp Asn Ile His Ala Glu Ile Cys Phe Leu Tyr Trp Phe 65 70 75 80
His Asp Lys Val Leu Lys Val Leu Ser Pro Arg Glu Glu Phe Lys Ile 85
90 95 Thr Trp Tyr Met Ser Trp Ser Pro Cys Phe Glu Cys Ala Glu Gln
Ile 100 105 110 Val Arg Phe Leu Ala Thr His His Asn Leu Ser Leu Asp
Ile Phe Ser 115 120 125 Ser Arg Leu Tyr Asn Val Gln Asp Pro Glu Thr
Gln Gln Asn Leu Cys 130 135 140 Arg Leu Val Gln Glu Gly Ala Gln Val
Ala Ala Met Asp Leu Tyr Glu 145 150 155 160 Phe Lys Lys Cys Trp Lys
Lys Phe Val Asp Asn Gly Gly Arg Arg Phe 165 170 175 Arg Pro Trp Lys
Arg Leu Leu Thr Asn Phe Arg Tyr Gln Asp Ser Lys 180 185 190 Leu Gln
Glu Ile Leu Arg Arg 195 <210> SEQ ID NO 57 <211>
LENGTH: 199 <212> TYPE: PRT <213> ORGANISM: Homo
sapiens <400> SEQUENCE: 57 Met Glu Ala Ser Pro Ala Ser Gly
Pro Arg His Leu Met Asp Pro His 1 5 10 15 Ile Phe Thr Ser Asn Phe
Asn Asn Gly Ile Gly Arg His Lys Thr Tyr 20 25 30 Leu Cys Tyr Glu
Val Glu Arg Leu Asp Asn Gly Thr Ser Val Lys Met 35 40 45 Asp Gln
His Arg Gly Phe Leu His Asn Gln Ala Lys Asn Leu Leu Cys 50 55 60
Gly Phe Tyr Gly Arg His Ala Glu Leu Arg Phe Leu Asp Leu Val Pro 65
70 75 80 Ser Leu Gln Leu Asp Pro Ala Gln Ile Tyr Arg Val Thr Trp
Phe Ile 85 90 95 Ser Trp Ser Pro Cys Phe Ser Trp Gly Cys Ala Gly
Glu Val Arg Ala 100 105 110 Phe Leu Gln Glu Asn Thr His Val Arg Leu
Arg Ile Phe Ala Ala Arg 115 120 125 Ile Tyr Asp Tyr Asp Pro Leu Tyr
Lys Glu Ala Leu Gln Met Leu Arg 130 135 140 Asp Ala Gly Ala Gln Val
Ser Ile Met Thr Tyr Asp Glu Phe Lys His 145 150 155 160 Cys Trp Asp
Thr Phe Val Asp His Gln Gly Cys Pro Phe Gln Pro Trp 165 170 175 Asp
Gly Leu Asp Glu His Ser Gln Ala Leu Ser Gly Arg Leu Arg Ala 180 185
190 Ile Leu Gln Asn Gln Gly Asn 195 <210> SEQ ID NO 58
<211> LENGTH: 384 <212> TYPE: PRT <213> ORGANISM:
Homo sapiens <400> SEQUENCE: 58 Met Lys Pro His Phe Arg Asn
Thr Val Glu Arg Met Tyr Arg Asp Thr 1 5 10 15 Phe Ser Tyr Asn Phe
Tyr Asn Arg Pro Ile Leu Ser Arg Arg Asn Thr 20 25 30 Val Trp Leu
Cys Tyr Glu Val Lys Thr Lys Gly Pro Ser Arg Pro Pro 35 40 45 Leu
Asp Ala Lys Ile Phe Arg Gly Gln Val Tyr Ser Glu Leu Lys Tyr 50 55
60 His Pro Glu Met Arg Phe Phe His Trp Phe Ser Lys Trp Arg Lys Leu
65 70 75 80 His Arg Asp Gln Glu Tyr Glu Val Thr Trp Tyr Ile Ser Trp
Ser Pro 85 90 95 Cys Thr Lys Cys Thr Arg Asp Met Ala Thr Phe Leu
Ala Glu Asp Pro 100 105 110 Lys Val Thr Leu Thr Ile Phe Val Ala Arg
Leu Tyr Tyr Phe Trp Asp 115 120 125 Pro Asp Tyr Gln Glu Ala Leu Arg
Ser Leu Cys Gln Lys Arg Asp Gly 130 135 140 Pro Arg Ala Thr Met Lys
Ile Met Asn Tyr Asp Glu Phe Gln His Cys 145 150 155 160 Trp Ser Lys
Phe Val Tyr Ser Gln Arg Glu Leu Phe Glu Pro Trp Asn 165 170 175 Asn
Leu Pro Lys Tyr Tyr Ile Leu Leu His Ile Met Leu Gly Glu Ile 180 185
190 Leu Arg His Ser Met Asp Pro Pro Thr Phe Thr Phe Asn Phe Asn Asn
195 200 205 Glu Pro Trp Val Arg Gly Arg His Glu Thr Tyr Leu Cys Tyr
Glu Val 210 215 220 Glu Arg Met His Asn Asp Thr Trp Val Leu Leu Asn
Gln Arg Arg Gly 225 230 235 240 Phe Leu Cys Asn Gln Ala Pro His Lys
His Gly Phe Leu Glu Gly Arg 245 250 255 His Ala Glu Leu Cys Phe Leu
Asp Val Ile Pro Phe Trp Lys Leu Asp 260 265 270 Leu Asp Gln Asp Tyr
Arg Val Thr Cys Phe Thr Ser Trp Ser Pro Cys 275 280 285 Phe Ser Cys
Ala Gln Glu Met Ala Lys Phe Ile Ser Lys Asn Lys His 290 295 300 Val
Ser Leu Cys Ile Phe Thr Ala Arg Ile Tyr Asp Asp Gln Gly Arg 305 310
315 320 Cys Gln Glu Gly Leu Arg Thr Leu Ala Glu Ala Gly Ala Lys Ile
Ser 325 330 335 Ile Met Thr Tyr Ser Glu Phe Lys His Cys Trp Asp Thr
Phe Val Asp 340 345 350 His Gln Gly Cys Pro Phe Gln Pro Trp Asp Gly
Leu Asp Glu His Ser 355 360 365 Gln Asp Leu Ser Gly Arg Leu Arg Ala
Ile Leu Gln Asn Gln Glu Asn 370 375 380 <210> SEQ ID NO 59
<211> LENGTH: 186 <212> TYPE: PRT <213> ORGANISM:
Artificial <220> FEATURE: <223> OTHER INFORMATION:
hAPOBEC3G catalytic domain <400> SEQUENCE: 59 Pro Pro Thr Phe
Thr Phe Asn Phe Asn Asn Glu Pro Trp Val Arg Gly 1 5 10 15 Arg His
Glu Thr Tyr Leu Cys Tyr Glu Val Glu Arg Met His Asn Asp 20 25 30
Thr Trp Val Leu Leu Asn Gln Arg Arg Gly Phe Leu Cys Asn Gln Ala 35
40 45 Pro His Lys His Gly Phe Leu Glu Gly Arg His Ala Glu Leu Cys
Phe 50 55 60 Leu Asp Val Ile Pro Phe Trp Lys Leu Asp Leu Asp Gln
Asp Tyr Arg 65 70 75 80 Val Thr Cys Phe Thr Ser Trp Ser Pro Cys Phe
Ser Cys Ala Gln Glu 85 90 95 Met Ala Lys Phe Ile Ser Lys Asn Lys
His Val Ser Leu Cys Ile Phe 100 105 110 Thr Ala Arg Ile Tyr Asp Asp
Gln Gly Arg Cys Gln Glu Gly Leu Arg 115 120 125 Thr Leu Ala Glu Ala
Gly Ala Lys Ile Ser Ile Met Thr Tyr Ser Glu 130 135 140 Phe Lys His
Cys Trp Asp Thr Phe Val Asp His Gln Gly Cys Pro Phe 145 150 155 160
Gln Pro Trp Asp Gly Leu Asp Glu His Ser Gln Asp Leu Ser Gly Arg 165
170 175 Leu Arg Ala Ile Leu Gln Asn Gln Glu Asn 180 185 <210>
SEQ ID NO 60 <211> LENGTH: 200 <212> TYPE: PRT
<213> ORGANISM: Homo sapiens <400> SEQUENCE: 60 Met Ala
Leu Leu Thr Ala Glu Thr Phe Arg Leu Gln Phe Asn Asn Lys 1 5 10 15
Arg Arg Leu Arg Arg Pro Tyr Tyr Pro Arg Lys Ala Leu Leu Cys Tyr 20
25 30 Gln Leu Thr Pro Gln Asn Gly Ser Thr Pro Thr Arg Gly Tyr Phe
Glu 35 40 45 Asn Lys Lys Lys Cys His Ala Glu Ile Cys Phe Ile Asn
Glu Ile Lys 50 55 60 Ser Met Gly Leu Asp Glu Thr Gln Cys Tyr Gln
Val Thr Cys Tyr Leu 65 70 75 80 Thr Trp Ser Pro Cys Ser Ser Cys Ala
Trp Glu Leu Val Asp Phe Ile 85 90 95 Lys Ala His Asp His Leu Asn
Leu Gly Ile Phe Ala Ser Arg Leu Tyr 100 105 110 Tyr His Trp Cys Lys
Pro Gln Gln Lys Gly Leu Arg Leu Leu Cys Gly 115 120 125 Ser Gln Val
Pro Val Glu Val Met Gly Phe Pro Lys Phe Ala Asp Cys 130 135 140 Trp
Glu Asn Phe Val Asp His Glu Lys Pro Leu Ser Phe Asn Pro Tyr 145 150
155 160 Lys Met Leu Glu Glu Leu Asp Lys Asn Ser Arg Ala Ile Lys Arg
Arg 165 170 175 Leu Glu Arg Ile Lys Ile Pro Gly Val Arg Ala Gln Gly
Arg Tyr Met 180 185 190 Asp Ile Leu Cys Asp Ala Glu Val 195 200
<210> SEQ ID NO 61 <211> LENGTH: 373 <212> TYPE:
PRT <213> ORGANISM: Homo sapiens <400> SEQUENCE: 61 Met
Lys Pro His Phe Arg Asn Thr Val Glu Arg Met Tyr Arg Asp Thr 1 5 10
15 Phe Ser Tyr Asn Phe Tyr Asn Arg Pro Ile Leu Ser Arg Arg Asn Thr
20 25 30 Val Trp Leu Cys Tyr Glu Val Lys Thr Lys Gly Pro Ser Arg
Pro Arg 35 40 45 Leu Asp Ala Lys Ile Phe Arg Gly Gln Val Tyr Ser
Gln Pro Glu His 50 55 60 His Ala Glu Met Cys Phe Leu Ser Trp Phe
Cys Gly Asn Gln Leu Pro 65 70 75 80 Ala Tyr Lys Cys Phe Gln Ile Thr
Trp Phe Val Ser Trp Thr Pro Cys 85 90 95 Pro Asp Cys Val Ala Lys
Leu Ala Glu Phe Leu Ala Glu His Pro Asn 100 105 110 Val Thr Leu Thr
Ile Ser Ala Ala Arg Leu Tyr Tyr Tyr Trp Glu Arg 115 120 125 Asp Tyr
Arg Arg Ala Leu Cys Arg Leu Ser Gln Ala Gly Ala Arg Val 130 135 140
Lys Ile Met Asp Asp Glu Glu Phe Ala Tyr Cys Trp Glu Asn Phe Val 145
150 155 160 Tyr Ser Glu Gly Gln Pro Phe Met Pro Trp Tyr Lys Phe Asp
Asp Asn 165 170 175 Tyr Ala Phe Leu His Arg Thr Leu Lys Glu Ile Leu
Arg Asn Pro Met 180 185 190 Glu Ala Met Tyr Pro His Ile Phe Tyr Phe
His Phe Lys Asn Leu Arg 195 200 205 Lys Ala Tyr Gly Arg Asn Glu Ser
Trp Leu Cys Phe Thr Met Glu Val 210 215 220 Val Lys His His Ser Pro
Val Ser Trp Lys Arg Gly Val Phe Arg Asn 225 230 235 240 Gln Val Asp
Pro Glu Thr His Cys His Ala Glu Arg Cys Phe Leu Ser 245 250 255 Trp
Phe Cys Asp Asp Ile Leu Ser Pro Asn Thr Asn Tyr Glu Val Thr 260 265
270 Trp Tyr Thr Ser Trp Ser Pro Cys Pro Glu Cys Ala Gly Glu Val Ala
275 280 285 Glu Phe Leu Ala Arg His Ser Asn Val Asn Leu Thr Ile Phe
Thr Ala 290 295 300 Arg Leu Tyr Tyr Phe Trp Asp Thr Asp Tyr Gln Glu
Gly Leu Arg Ser 305 310 315 320 Leu Ser Gln Glu Gly Ala Ser Val Glu
Ile Met Gly Tyr Lys Asp Phe 325 330 335 Lys Tyr Cys Trp Glu Asn Phe
Val Tyr Asn Asp Asp Glu Pro Phe Lys 340 345 350 Pro Trp Lys Gly Leu
Lys Tyr Asn Phe Leu Phe Leu Asp Ser Lys Leu 355 360 365 Gln Glu Ile
Leu Glu 370 <210> SEQ ID NO 62 <211> LENGTH: 189
<212> TYPE: PRT <213> ORGANISM: Artificial <220>
FEATURE: <223> OTHER INFORMATION: hAPOBEC3F catalytic domain
<400> SEQUENCE: 62 Lys Glu Ile Leu Arg Asn Pro Met Glu Ala
Met Tyr Pro His Ile Phe 1 5 10 15 Tyr Phe His Phe Lys Asn Leu Arg
Lys Ala Tyr Gly Arg Asn Glu Ser 20 25 30 Trp Leu Cys Phe Thr Met
Glu Val Val Lys His His Ser Pro Val Ser 35 40 45 Trp Lys Arg Gly
Val Phe Arg Asn Gln Val Asp Pro Glu Thr His Cys 50 55 60 His Ala
Glu Arg Cys Phe Leu Ser Trp Phe Cys Asp Asp Ile Leu Ser 65 70 75 80
Pro Asn Thr Asn Tyr Glu Val Thr Trp Tyr Thr Ser Trp Ser Pro Cys 85
90 95 Pro Glu Cys Ala Gly Glu Val Ala Glu Phe Leu Ala Arg His Ser
Asn 100 105 110 Val Asn Leu Thr Ile Phe Thr Ala Arg Leu Tyr Tyr Phe
Trp Asp Thr 115 120 125 Asp Tyr Gln Glu Gly Leu Arg Ser Leu Ser Gln
Glu Gly Ala Ser Val 130 135 140 Glu Ile Met Gly Tyr Lys Asp Phe Lys
Tyr Cys Trp Glu Asn Phe Val 145 150 155 160 Tyr Asn Asp Asp Glu Pro
Phe Lys Pro Trp Lys Gly Leu Lys Tyr Asn 165 170 175 Phe Leu Phe Leu
Asp Ser Lys Leu Gln Glu Ile Leu Glu 180 185 <210> SEQ ID NO
63 <211> LENGTH: 1053 <212> TYPE: PRT <213>
ORGANISM: Staphylococcus aureus <400> SEQUENCE: 63 Met Lys
Arg Asn Tyr Ile Leu Gly Leu Asp Ile Gly Ile Thr Ser Val 1 5 10 15
Gly Tyr Gly Ile Ile Asp Tyr Glu Thr Arg Asp Val Ile Asp Ala Gly 20
25 30 Val Arg Leu Phe Lys Glu Ala Asn Val Glu Asn Asn Glu Gly Arg
Arg 35 40 45 Ser Lys Arg Gly Ala Arg Arg Leu Lys Arg Arg Arg Arg
His Arg Ile 50 55 60 Gln Arg Val Lys Lys Leu Leu Phe Asp Tyr Asn
Leu Leu Thr Asp His 65 70 75 80 Ser Glu Leu Ser Gly Ile Asn Pro Tyr
Glu Ala Arg Val Lys Gly Leu 85 90 95 Ser Gln Lys Leu Ser Glu Glu
Glu Phe Ser Ala Ala Leu Leu His Leu 100 105 110 Ala Lys Arg Arg Gly
Val His Asn Val Asn Glu Val Glu Glu Asp Thr 115 120 125 Gly Asn Glu
Leu Ser Thr Lys Glu Gln Ile Ser Arg Asn Ser Lys Ala 130 135 140 Leu
Glu Glu Lys Tyr Val Ala Glu Leu Gln Leu Glu Arg Leu Lys Lys 145 150
155 160 Asp Gly Glu Val Arg Gly Ser Ile Asn Arg Phe Lys Thr Ser Asp
Tyr 165 170 175 Val Lys Glu Ala Lys Gln Leu Leu Lys Val Gln Lys Ala
Tyr His Gln 180 185 190 Leu Asp Gln Ser Phe Ile Asp Thr Tyr Ile Asp
Leu Leu Glu Thr Arg 195 200 205 Arg Thr Tyr Tyr Glu Gly Pro Gly Glu
Gly Ser Pro Phe Gly Trp Lys 210 215 220 Asp Ile Lys Glu Trp Tyr Glu
Met Leu Met Gly His Cys Thr Tyr Phe 225 230 235 240 Pro Glu Glu Leu
Arg Ser Val Lys Tyr Ala Tyr Asn Ala Asp Leu Tyr 245 250 255 Asn Ala
Leu Asn Asp Leu Asn Asn Leu Val Ile Thr Arg Asp Glu Asn 260 265 270
Glu Lys Leu Glu Tyr Tyr Glu Lys Phe Gln Ile Ile Glu Asn Val Phe 275
280 285 Lys Gln Lys Lys Lys Pro Thr Leu Lys Gln Ile Ala Lys Glu Ile
Leu 290 295 300 Val Asn Glu Glu Asp Ile Lys Gly Tyr Arg Val Thr Ser
Thr Gly Lys 305 310 315 320 Pro Glu Phe Thr Asn Leu Lys Val Tyr His
Asp Ile Lys Asp Ile Thr 325 330 335 Ala Arg Lys Glu Ile Ile Glu Asn
Ala Glu Leu Leu Asp Gln Ile Ala 340 345 350 Lys Ile Leu Thr Ile Tyr
Gln Ser Ser Glu Asp Ile Gln Glu Glu Leu 355 360 365 Thr Asn Leu Asn
Ser Glu Leu Thr Gln Glu Glu Ile Glu Gln Ile Ser 370 375 380 Asn Leu
Lys Gly Tyr Thr Gly Thr His Asn Leu Ser Leu Lys Ala Ile 385 390 395
400 Asn Leu Ile Leu Asp Glu Leu Trp His Thr Asn Asp Asn Gln Ile Ala
405 410 415 Ile Phe Asn Arg Leu Lys Leu Val Pro Lys Lys Val Asp Leu
Ser Gln 420 425 430 Gln Lys Glu Ile Pro Thr Thr Leu Val Asp Asp Phe
Ile Leu Ser Pro 435 440 445 Val Val Lys Arg Ser Phe Ile Gln Ser Ile
Lys Val Ile Asn Ala Ile 450 455 460 Ile Lys Lys Tyr Gly Leu Pro Asn
Asp Ile Ile Ile Glu Leu Ala Arg 465 470 475 480 Glu Lys Asn Ser Lys
Asp Ala Gln Lys Met Ile Asn Glu Met Gln Lys 485 490 495 Arg Asn Arg
Gln Thr Asn Glu Arg Ile Glu Glu Ile Ile Arg Thr Thr 500 505 510 Gly
Lys Glu Asn Ala Lys Tyr Leu Ile Glu Lys Ile Lys Leu His Asp 515 520
525 Met Gln Glu Gly Lys Cys Leu Tyr Ser Leu Glu Ala Ile Pro Leu Glu
530 535 540 Asp Leu Leu Asn Asn Pro Phe Asn Tyr Glu Val Asp His Ile
Ile Pro 545 550 555 560 Arg Ser Val Ser Phe Asp Asn Ser Phe Asn Asn
Lys Val Leu Val Lys 565 570 575 Gln Glu Glu Asn Ser Lys Lys Gly Asn
Arg Thr Pro Phe Gln Tyr Leu 580 585 590 Ser Ser Ser Asp Ser Lys Ile
Ser Tyr Glu Thr Phe Lys Lys His Ile 595 600 605 Leu Asn Leu Ala Lys
Gly Lys Gly Arg Ile Ser Lys Thr Lys Lys Glu 610 615 620 Tyr Leu Leu
Glu Glu Arg Asp Ile Asn Arg Phe Ser Val Gln Lys Asp 625 630 635 640
Phe Ile Asn Arg Asn Leu Val Asp Thr Arg Tyr Ala Thr Arg Gly Leu 645
650 655 Met Asn Leu Leu Arg Ser Tyr Phe Arg Val Asn Asn Leu Asp Val
Lys 660 665 670 Val Lys Ser Ile Asn Gly Gly Phe Thr Ser Phe Leu Arg
Arg Lys Trp 675 680 685 Lys Phe Lys Lys Glu Arg Asn Lys Gly Tyr Lys
His His Ala Glu Asp 690 695 700 Ala Leu Ile Ile Ala Asn Ala Asp Phe
Ile Phe Lys Glu Trp Lys Lys 705 710 715 720 Leu Asp Lys Ala Lys Lys
Val Met Glu Asn Gln Met Phe Glu Glu Lys 725 730 735 Gln Ala Glu Ser
Met Pro Glu Ile Glu Thr Glu Gln Glu Tyr Lys Glu 740 745 750 Ile Phe
Ile Thr Pro His Gln Ile Lys His Ile Lys Asp Phe Lys Asp 755 760 765
Tyr Lys Tyr Ser His Arg Val Asp Lys Lys Pro Asn Arg Glu Leu Ile 770
775 780 Asn Asp Thr Leu Tyr Ser Thr Arg Lys Asp Asp Lys Gly Asn Thr
Leu 785 790 795 800 Ile Val Asn Asn Leu Asn Gly Leu Tyr Asp Lys Asp
Asn Asp Lys Leu 805 810 815 Lys Lys Leu Ile Asn Lys Ser Pro Glu Lys
Leu Leu Met Tyr His His 820 825 830 Asp Pro Gln Thr Tyr Gln Lys Leu
Lys Leu Ile Met Glu Gln Tyr Gly 835 840 845 Asp Glu Lys Asn Pro Leu
Tyr Lys Tyr Tyr Glu Glu Thr Gly Asn Tyr 850 855 860 Leu Thr Lys Tyr
Ser Lys Lys Asp Asn Gly Pro Val Ile Lys Lys Ile 865 870 875 880 Lys
Tyr Tyr Gly Asn Lys Leu Asn Ala His Leu Asp Ile Thr Asp Asp 885 890
895 Tyr Pro Asn Ser Arg Asn Lys Val Val Lys Leu Ser Leu Lys Pro Tyr
900 905 910 Arg Phe Asp Val Tyr Leu Asp Asn Gly Val Tyr Lys Phe Val
Thr Val 915 920 925 Lys Asn Leu Asp Val Ile Lys Lys Glu Asn Tyr Tyr
Glu Val Asn Ser 930 935 940 Lys Cys Tyr Glu Glu Ala Lys Lys Leu Lys
Lys Ile Ser Asn Gln Ala 945 950 955 960 Glu Phe Ile Ala Ser Phe Tyr
Asn Asn Asp Leu Ile Lys Ile Asn Gly 965 970 975 Glu Leu Tyr Arg Val
Ile Gly Val Asn Asn Asp Leu Leu Asn Arg Ile 980 985 990 Glu Val Asn
Met Ile Asp Ile Thr Tyr Arg Glu Tyr Leu Glu Asn Met 995 1000 1005
Asn Asp Lys Arg Pro Pro Arg Ile Ile Lys Thr Ile Ala Ser Lys 1010
1015 1020 Thr Gln Ser Ile Lys Lys Tyr Ser Thr Asp Ile Leu Gly Asn
Leu 1025 1030 1035 Tyr Glu Val Lys Ser Lys Lys His Pro Gln Ile Ile
Lys Lys Gly 1040 1045 1050 <210> SEQ ID NO 64 <211>
LENGTH: 984 <212> TYPE: PRT <213> ORGANISM:
Campylobacter jejuni <400> SEQUENCE: 64 Met Ala Arg Ile Leu
Ala Phe Asp Ile Gly Ile Ser Ser Ile Gly Trp 1 5 10 15 Ala Phe Ser
Glu Asn Asp Glu Leu Lys Asp Cys Gly Val Arg Ile Phe 20 25 30 Thr
Lys Val Glu Asn Pro Lys Thr Gly Glu Ser Leu Ala Leu Pro Arg 35 40
45 Arg Leu Ala Arg Ser Ala Arg Lys Arg Leu Ala Arg Arg Lys Ala Arg
50 55 60 Leu Asn His Leu Lys His Leu Ile Ala Asn Glu Phe Lys Leu
Asn Tyr 65 70 75 80 Glu Asp Tyr Gln Ser Phe Asp Glu Ser Leu Ala Lys
Ala Tyr Lys Gly 85 90 95 Ser Leu Ile Ser Pro Tyr Glu Leu Arg Phe
Arg Ala Leu Asn Glu Leu 100 105 110 Leu Ser Lys Gln Asp Phe Ala Arg
Val Ile Leu His Ile Ala Lys Arg 115 120 125 Arg Gly Tyr Asp Asp Ile
Lys Asn Ser Asp Asp Lys Glu Lys Gly Ala 130 135 140 Ile Leu Lys Ala
Ile Lys Gln Asn Glu Glu Lys Leu Ala Asn Tyr Gln 145 150 155 160 Ser
Val Gly Glu Tyr Leu Tyr Lys Glu Tyr Phe Gln Lys Phe Lys Glu 165 170
175 Asn Ser Lys Glu Phe Thr Asn Val Arg Asn Lys Lys Glu Ser Tyr Glu
180 185 190 Arg Cys Ile Ala Gln Ser Phe Leu Lys Asp Glu Leu Lys Leu
Ile Phe 195 200 205 Lys Lys Gln Arg Glu Phe Gly Phe Ser Phe Ser Lys
Lys Phe Glu Glu 210 215 220 Glu Val Leu Ser Val Ala Phe Tyr Lys Arg
Ala Leu Lys Asp Phe Ser 225 230 235 240 His Leu Val Gly Asn Cys Ser
Phe Phe Thr Asp Glu Lys Arg Ala Pro 245 250 255 Lys Asn Ser Pro Leu
Ala Phe Met Phe Val Ala Leu Thr Arg Ile Ile 260 265 270 Asn Leu Leu
Asn Asn Leu Lys Asn Thr Glu Gly Ile Leu Tyr Thr Lys 275 280 285 Asp
Asp Leu Asn Ala Leu Leu Asn Glu Val Leu Lys Asn Gly Thr Leu 290 295
300 Thr Tyr Lys Gln Thr Lys Lys Leu Leu Gly Leu Ser Asp Asp Tyr Glu
305 310 315 320 Phe Lys Gly Glu Lys Gly Thr Tyr Phe Ile Glu Phe Lys
Lys Tyr Lys 325 330 335 Glu Phe Ile Lys Ala Leu Gly Glu His Asn Leu
Ser Gln Asp Asp Leu 340 345 350 Asn Glu Ile Ala Lys Asp Ile Thr Leu
Ile Lys Asp Glu Ile Lys Leu 355 360 365 Lys Lys Ala Leu Ala Lys Tyr
Asp Leu Asn Gln Asn Gln Ile Asp Ser 370 375 380 Leu Ser Lys Leu Glu
Phe Lys Asp His Leu Asn Ile Ser Phe Lys Ala 385 390 395 400 Leu Lys
Leu Val Thr Pro Leu Met Leu Glu Gly Lys Lys Tyr Asp Glu 405 410 415
Ala Cys Asn Glu Leu Asn Leu Lys Val Ala Ile Asn Glu Asp Lys Lys 420
425 430 Asp Phe Leu Pro Ala Phe Asn Glu Thr Tyr Tyr Lys Asp Glu Val
Thr 435 440 445 Asn Pro Val Val Leu Arg Ala Ile Lys Glu Tyr Arg Lys
Val Leu Asn 450 455 460 Ala Leu Leu Lys Lys Tyr Gly Lys Val His Lys
Ile Asn Ile Glu Leu 465 470 475 480 Ala Arg Glu Val Gly Lys Asn His
Ser Gln Arg Ala Lys Ile Glu Lys 485 490 495 Glu Gln Asn Glu Asn Tyr
Lys Ala Lys Lys Asp Ala Glu Leu Glu Cys 500 505 510 Glu Lys Leu Gly
Leu Lys Ile Asn Ser Lys Asn Ile Leu Lys Leu Arg 515 520 525 Leu Phe
Lys Glu Gln Lys Glu Phe Cys Ala Tyr Ser Gly Glu Lys Ile 530 535 540
Lys Ile Ser Asp Leu Gln Asp Glu Lys Met Leu Glu Ile Asp His Ile 545
550 555 560 Tyr Pro Tyr Ser Arg Ser Phe Asp Asp Ser Tyr Met Asn Lys
Val Leu 565 570 575 Val Phe Thr Lys Gln Asn Gln Glu Lys Leu Asn Gln
Thr Pro Phe Glu 580 585 590 Ala Phe Gly Asn Asp Ser Ala Lys Trp Gln
Lys Ile Glu Val Leu Ala 595 600 605 Lys Asn Leu Pro Thr Lys Lys Gln
Lys Arg Ile Leu Asp Lys Asn Tyr 610 615 620 Lys Asp Lys Glu Gln Lys
Asn Phe Lys Asp Arg Asn Leu Asn Asp Thr 625 630 635 640 Arg Tyr Ile
Ala Arg Leu Val Leu Asn Tyr Thr Lys Asp Tyr Leu Asp 645 650 655 Phe
Leu Pro Leu Ser Asp Asp Glu Asn Thr Lys Leu Asn Asp Thr Gln 660 665
670 Lys Gly Ser Lys Val His Val Glu Ala Lys Ser Gly Met Leu Thr Ser
675 680 685 Ala Leu Arg His Thr Trp Gly Phe Ser Ala Lys Asp Arg Asn
Asn His 690 695 700 Leu His His Ala Ile Asp Ala Val Ile Ile Ala Tyr
Ala Asn Asn Ser 705 710 715 720 Ile Val Lys Ala Phe Ser Asp Phe Lys
Lys Glu Gln Glu Ser Asn Ser 725 730 735 Ala Glu Leu Tyr Ala Lys Lys
Ile Ser Glu Leu Asp Tyr Lys Asn Lys 740 745 750 Arg Lys Phe Phe Glu
Pro Phe Ser Gly Phe Arg Gln Lys Val Leu Asp 755 760 765 Lys Ile Asp
Glu Ile Phe Val Ser Lys Pro Glu Arg Lys Lys Pro Ser 770 775 780 Gly
Ala Leu His Glu Glu Thr Phe Arg Lys Glu Glu Glu Phe Tyr Gln 785 790
795 800 Ser Tyr Gly Gly Lys Glu Gly Val Leu Lys Ala Leu Glu Leu Gly
Lys 805 810 815 Ile Arg Lys Val Asn Gly Lys Ile Val Lys Asn Gly Asp
Met Phe Arg 820 825 830 Val Asp Ile Phe Lys His Lys Lys Thr Asn Lys
Phe Tyr Ala Val Pro 835 840 845 Ile Tyr Thr Met Asp Phe Ala Leu Lys
Val Leu Pro Asn Lys Ala Val 850 855 860 Ala Arg Ser Lys Lys Gly Glu
Ile Lys Asp Trp Ile Leu Met Asp Glu 865 870 875 880 Asn Tyr Glu Phe
Cys Phe Ser Leu Tyr Lys Asp Ser Leu Ile Leu Ile 885 890 895 Gln Thr
Lys Asp Met Gln Glu Pro Glu Phe Val Tyr Tyr Asn Ala Phe 900 905 910
Thr Ser Ser Thr Val Ser Leu Ile Val Ser Lys His Asp Asn Lys Phe 915
920 925 Glu Thr Leu Ser Lys Asn Gln Lys Ile Leu Phe Lys Asn Ala Asn
Glu 930 935 940 Lys Glu Val Ile Ala Lys Ser Ile Gly Ile Gln Asn Leu
Lys Val Phe 945 950 955 960 Glu Lys Tyr Ile Val Ser Ala Leu Gly Glu
Val Thr Lys Ala Glu Phe 965 970 975 Arg Gln Arg Glu Asp Phe Lys Lys
980 <210> SEQ ID NO 65 <211> LENGTH: 1037 <212>
TYPE: PRT <213> ORGANISM: Parvibaculum lavamentivorans
<400> SEQUENCE: 65 Met Glu Arg Ile Phe Gly Phe Asp Ile Gly
Thr Thr Ser Ile Gly Phe 1 5 10 15 Ser Val Ile Asp Tyr Ser Ser Thr
Gln Ser Ala Gly Asn Ile Gln Arg 20 25 30 Leu Gly Val Arg Ile Phe
Pro Glu Ala Arg Asp Pro Asp Gly Thr Pro 35 40 45 Leu Asn Gln Gln
Arg Arg Gln Lys Arg Met Met Arg Arg Gln Leu Arg 50 55 60 Arg Arg
Arg Ile Arg Arg Lys Ala Leu Asn Glu Thr Leu His Glu Ala 65 70 75 80
Gly Phe Leu Pro Ala Tyr Gly Ser Ala Asp Trp Pro Val Val Met Ala 85
90 95 Asp Glu Pro Tyr Glu Leu Arg Arg Arg Gly Leu Glu Glu Gly Leu
Ser 100 105 110 Ala Tyr Glu Phe Gly Arg Ala Ile Tyr His Leu Ala Gln
His Arg His 115 120 125 Phe Lys Gly Arg Glu Leu Glu Glu Ser Asp Thr
Pro Asp Pro Asp Val 130 135 140 Asp Asp Glu Lys Glu Ala Ala Asn Glu
Arg Ala Ala Thr Leu Lys Ala 145 150 155 160 Leu Lys Asn Glu Gln Thr
Thr Leu Gly Ala Trp Leu Ala Arg Arg Pro 165 170 175 Pro Ser Asp Arg
Lys Arg Gly Ile His Ala His Arg Asn Val Val Ala 180 185 190 Glu Glu
Phe Glu Arg Leu Trp Glu Val Gln Ser Lys Phe His Pro Ala 195 200 205
Leu Lys Ser Glu Glu Met Arg Ala Arg Ile Ser Asp Thr Ile Phe Ala 210
215 220 Gln Arg Pro Val Phe Trp Arg Lys Asn Thr Leu Gly Glu Cys Arg
Phe 225 230 235 240 Met Pro Gly Glu Pro Leu Cys Pro Lys Gly Ser Trp
Leu Ser Gln Gln 245 250 255 Arg Arg Met Leu Glu Lys Leu Asn Asn Leu
Ala Ile Ala Gly Gly Asn 260 265 270 Ala Arg Pro Leu Asp Ala Glu Glu
Arg Asp Ala Ile Leu Ser Lys Leu 275 280 285 Gln Gln Gln Ala Ser Met
Ser Trp Pro Gly Val Arg Ser Ala Leu Lys 290 295 300 Ala Leu Tyr Lys
Gln Arg Gly Glu Pro Gly Ala Glu Lys Ser Leu Lys 305 310 315 320 Phe
Asn Leu Glu Leu Gly Gly Glu Ser Lys Leu Leu Gly Asn Ala Leu 325 330
335 Glu Ala Lys Leu Ala Asp Met Phe Gly Pro Asp Trp Pro Ala His Pro
340 345 350 Arg Lys Gln Glu Ile Arg His Ala Val His Glu Arg Leu Trp
Ala Ala 355 360 365 Asp Tyr Gly Glu Thr Pro Asp Lys Lys Arg Val Ile
Ile Leu Ser Glu 370 375 380 Lys Asp Arg Lys Ala His Arg Glu Ala Ala
Ala Asn Ser Phe Val Ala 385 390 395 400 Asp Phe Gly Ile Thr Gly Glu
Gln Ala Ala Gln Leu Gln Ala Leu Lys 405 410 415 Leu Pro Thr Gly Trp
Glu Pro Tyr Ser Ile Pro Ala Leu Asn Leu Phe 420 425 430 Leu Ala Glu
Leu Glu Lys Gly Glu Arg Phe Gly Ala Leu Val Asn Gly 435 440 445 Pro
Asp Trp Glu Gly Trp Arg Arg Thr Asn Phe Pro His Arg Asn Gln 450 455
460 Pro Thr Gly Glu Ile Leu Asp Lys Leu Pro Ser Pro Ala Ser Lys Glu
465 470 475 480 Glu Arg Glu Arg Ile Ser Gln Leu Arg Asn Pro Thr Val
Val Arg Thr 485 490 495 Gln Asn Glu Leu Arg Lys Val Val Asn Asn Leu
Ile Gly Leu Tyr Gly 500 505 510 Lys Pro Asp Arg Ile Arg Ile Glu Val
Gly Arg Asp Val Gly Lys Ser 515 520 525 Lys Arg Glu Arg Glu Glu Ile
Gln Ser Gly Ile Arg Arg Asn Glu Lys 530 535 540 Gln Arg Lys Lys Ala
Thr Glu Asp Leu Ile Lys Asn Gly Ile Ala Asn 545 550 555 560 Pro Ser
Arg Asp Asp Val Glu Lys Trp Ile Leu Trp Lys Glu Gly Gln 565 570 575
Glu Arg Cys Pro Tyr Thr Gly Asp Gln Ile Gly Phe Asn Ala Leu Phe 580
585 590 Arg Glu Gly Arg Tyr Glu Val Glu His Ile Trp Pro Arg Ser Arg
Ser 595 600 605 Phe Asp Asn Ser Pro Arg Asn Lys Thr Leu Cys Arg Lys
Asp Val Asn 610 615 620 Ile Glu Lys Gly Asn Arg Met Pro Phe Glu Ala
Phe Gly His Asp Glu 625 630 635 640 Asp Arg Trp Ser Ala Ile Gln Ile
Arg Leu Gln Gly Met Val Ser Ala 645 650 655 Lys Gly Gly Thr Gly Met
Ser Pro Gly Lys Val Lys Arg Phe Leu Ala 660 665 670 Lys Thr Met Pro
Glu Asp Phe Ala Ala Arg Gln Leu Asn Asp Thr Arg 675 680 685 Tyr Ala
Ala Lys Gln Ile Leu Ala Gln Leu Lys Arg Leu Trp Pro Asp 690 695 700
Met Gly Pro Glu Ala Pro Val Lys Val Glu Ala Val Thr Gly Gln Val 705
710 715 720 Thr Ala Gln Leu Arg Lys Leu Trp Thr Leu Asn Asn Ile Leu
Ala Asp 725 730 735 Asp Gly Glu Lys Thr Arg Ala Asp His Arg His His
Ala Ile Asp Ala 740 745 750 Leu Thr Val Ala Cys Thr His Pro Gly Met
Thr Asn Lys Leu Ser Arg 755 760 765 Tyr Trp Gln Leu Arg Asp Asp Pro
Arg Ala Glu Lys Pro Ala Leu Thr 770 775 780 Pro Pro Trp Asp Thr Ile
Arg Ala Asp Ala Glu Lys Ala Val Ser Glu 785 790 795 800 Ile Val Val
Ser His Arg Val Arg Lys Lys Val Ser Gly Pro Leu His 805 810 815 Lys
Glu Thr Thr Tyr Gly Asp Thr Gly Thr Asp Ile Lys Thr Lys Ser 820 825
830 Gly Thr Tyr Arg Gln Phe Val Thr Arg Lys Lys Ile Glu Ser Leu Ser
835 840 845 Lys Gly Glu Leu Asp Glu Ile Arg Asp Pro Arg Ile Lys Glu
Ile Val 850 855 860 Ala Ala His Val Ala Gly Arg Gly Gly Asp Pro Lys
Lys Ala Phe Pro 865 870 875 880 Pro Tyr Pro Cys Val Ser Pro Gly Gly
Pro Glu Ile Arg Lys Val Arg 885 890 895 Leu Thr Ser Lys Gln Gln Leu
Asn Leu Met Ala Gln Thr Gly Asn Gly 900 905 910 Tyr Ala Asp Leu Gly
Ser Asn His His Ile Ala Ile Tyr Arg Leu Pro 915 920 925 Asp Gly Lys
Ala Asp Phe Glu Ile Val Ser Leu Phe Asp Ala Ser Arg 930 935 940 Arg
Leu Ala Gln Arg Asn Pro Ile Val Gln Arg Thr Arg Ala Asp Gly 945 950
955 960 Ala Ser Phe Val Met Ser Leu Ala Ala Gly Glu Ala Ile Met Ile
Pro 965 970 975 Glu Gly Ser Lys Lys Gly Ile Trp Ile Val Gln Gly Val
Trp Ala Ser 980 985 990 Gly Gln Val Val Leu Glu Arg Asp Thr Asp Ala
Asp His Ser Thr Thr 995 1000 1005 Thr Arg Pro Met Pro Asn Pro Ile
Leu Lys Asp Asp Ala Lys Lys 1010 1015 1020 Val Ser Ile Asp Pro Ile
Gly Arg Val Arg Pro Ser Asn Asp 1025 1030 1035 <210> SEQ ID
NO 66 <211> LENGTH: 1082 <212> TYPE: PRT <213>
ORGANISM: Neisseria cinerea <400> SEQUENCE: 66 Met Ala Ala
Phe Lys Pro Asn Pro Met Asn Tyr Ile Leu Gly Leu Asp 1 5 10 15 Ile
Gly Ile Ala Ser Val Gly Trp Ala Ile Val Glu Ile Asp Glu Glu 20 25
30 Glu Asn Pro Ile Arg Leu Ile Asp Leu Gly Val Arg Val Phe Glu Arg
35 40 45 Ala Glu Val Pro Lys Thr Gly Asp Ser Leu Ala Ala Ala Arg
Arg Leu 50 55 60 Ala Arg Ser Val Arg Arg Leu Thr Arg Arg Arg Ala
His Arg Leu Leu 65 70 75 80 Arg Ala Arg Arg Leu Leu Lys Arg Glu Gly
Val Leu Gln Ala Ala Asp 85 90 95 Phe Asp Glu Asn Gly Leu Ile Lys
Ser Leu Pro Asn Thr Pro Trp Gln 100 105 110 Leu Arg Ala Ala Ala Leu
Asp Arg Lys Leu Thr Pro Leu Glu Trp Ser 115 120 125 Ala Val Leu Leu
His Leu Ile Lys His Arg Gly Tyr Leu Ser Gln Arg 130 135 140 Lys Asn
Glu Gly Glu Thr Ala Asp Lys Glu Leu Gly Ala Leu Leu Lys 145 150 155
160 Gly Val Ala Asp Asn Thr His Ala Leu Gln Thr Gly Asp Phe Arg Thr
165 170 175 Pro Ala Glu Leu Ala Leu Asn Lys Phe Glu Lys Glu Ser Gly
His Ile 180 185 190 Arg Asn Gln Arg Gly Asp Tyr Ser His Thr Phe Asn
Arg Lys Asp Leu 195 200 205 Gln Ala Glu Leu Asn Leu Leu Phe Glu Lys
Gln Lys Glu Phe Gly Asn 210 215 220 Pro His Val Ser Asp Gly Leu Lys
Glu Gly Ile Glu Thr Leu Leu Met 225 230 235 240 Thr Gln Arg Pro Ala
Leu Ser Gly Asp Ala Val Gln Lys Met Leu Gly 245 250 255 His Cys Thr
Phe Glu Pro Thr Glu Pro Lys Ala Ala Lys Asn Thr Tyr 260 265 270 Thr
Ala Glu Arg Phe Val Trp Leu Thr Lys Leu Asn Asn Leu Arg Ile 275 280
285 Leu Glu Gln Gly Ser Glu Arg Pro Leu Thr Asp Thr Glu Arg Ala Thr
290 295 300 Leu Met Asp Glu Pro Tyr Arg Lys Ser Lys Leu Thr Tyr Ala
Gln Ala 305 310 315 320 Arg Lys Leu Leu Asp Leu Asp Asp Thr Ala Phe
Phe Lys Gly Leu Arg 325 330 335 Tyr Gly Lys Asp Asn Ala Glu Ala Ser
Thr Leu Met Glu Met Lys Ala 340 345 350 Tyr His Ala Ile Ser Arg Ala
Leu Glu Lys Glu Gly Leu Lys Asp Lys 355 360 365 Lys Ser Pro Leu Asn
Leu Ser Pro Glu Leu Gln Asp Glu Ile Gly Thr 370 375 380 Ala Phe Ser
Leu Phe Lys Thr Asp Glu Asp Ile Thr Gly Arg Leu Lys 385 390 395 400
Asp Arg Val Gln Pro Glu Ile Leu Glu Ala Leu Leu Lys His Ile Ser 405
410 415 Phe Asp Lys Phe Val Gln Ile Ser Leu Lys Ala Leu Arg Arg Ile
Val 420 425 430 Pro Leu Met Glu Gln Gly Asn Arg Tyr Asp Glu Ala Cys
Thr Glu Ile 435 440 445 Tyr Gly Asp His Tyr Gly Lys Lys Asn Thr Glu
Glu Lys Ile Tyr Leu 450 455 460 Pro Pro Ile Pro Ala Asp Glu Ile Arg
Asn Pro Val Val Leu Arg Ala 465 470 475 480 Leu Ser Gln Ala Arg Lys
Val Ile Asn Gly Val Val Arg Arg Tyr Gly 485 490 495 Ser Pro Ala Arg
Ile His Ile Glu Thr Ala Arg Glu Val Gly Lys Ser 500 505 510 Phe Lys
Asp Arg Lys Glu Ile Glu Lys Arg Gln Glu Glu Asn Arg Lys 515 520 525
Asp Arg Glu Lys Ser Ala Ala Lys Phe Arg Glu Tyr Phe Pro Asn Phe 530
535 540 Val Gly Glu Pro Lys Ser Lys Asp Ile Leu Lys Leu Arg Leu Tyr
Glu 545 550 555 560 Gln Gln His Gly Lys Cys Leu Tyr Ser Gly Lys Glu
Ile Asn Leu Gly 565 570 575 Arg Leu Asn Glu Lys Gly Tyr Val Glu Ile
Asp His Ala Leu Pro Phe 580 585 590 Ser Arg Thr Trp Asp Asp Ser Phe
Asn Asn Lys Val Leu Ala Leu Gly 595 600 605 Ser Glu Asn Gln Asn Lys
Gly Asn Gln Thr Pro Tyr Glu Tyr Phe Asn 610 615 620 Gly Lys Asp Asn
Ser Arg Glu Trp Gln Glu Phe Lys Ala Arg Val Glu 625 630 635 640 Thr
Ser Arg Phe Pro Arg Ser Lys Lys Gln Arg Ile Leu Leu Gln Lys 645 650
655 Phe Asp Glu Asp Gly Phe Lys Glu Arg Asn Leu Asn Asp Thr Arg Tyr
660 665 670 Ile Asn Arg Phe Leu Cys Gln Phe Val Ala Asp His Met Leu
Leu Thr 675 680 685 Gly Lys Gly Lys Arg Arg Val Phe Ala Ser Asn Gly
Gln Ile Thr Asn 690 695 700 Leu Leu Arg Gly Phe Trp Gly Leu Arg Lys
Val Arg Ala Glu Asn Asp 705 710 715 720 Arg His His Ala Leu Asp Ala
Val Val Val Ala Cys Ser Thr Ile Ala 725 730 735 Met Gln Gln Lys Ile
Thr Arg Phe Val Arg Tyr Lys Glu Met Asn Ala 740 745 750 Phe Asp Gly
Lys Thr Ile Asp Lys Glu Thr Gly Glu Val Leu His Gln 755 760 765 Lys
Ala His Phe Pro Gln Pro Trp Glu Phe Phe Ala Gln Glu Val Met 770 775
780 Ile Arg Val Phe Gly Lys Pro Asp Gly Lys Pro Glu Phe Glu Glu Ala
785 790 795 800 Asp Thr Pro Glu Lys Leu Arg Thr Leu Leu Ala Glu Lys
Leu Ser Ser 805 810 815 Arg Pro Glu Ala Val His Lys Tyr Val Thr Pro
Leu Phe Ile Ser Arg 820 825 830 Ala Pro Asn Arg Lys Met Ser Gly Gln
Gly His Met Glu Thr Val Lys 835 840 845 Ser Ala Lys Arg Leu Asp Glu
Gly Ile Ser Val Leu Arg Val Pro Leu 850 855 860 Thr Gln Leu Lys Leu
Lys Asp Leu Glu Lys Met Val Asn Arg Glu Arg 865 870 875 880 Glu Pro
Lys Leu Tyr Glu Ala Leu Lys Ala Arg Leu Glu Ala His Lys 885 890 895
Asp Asp Pro Ala Lys Ala Phe Ala Glu Pro Phe Tyr Lys Tyr Asp Lys 900
905 910 Ala Gly Asn Arg Thr Gln Gln Val Lys Ala Val Arg Val Glu Gln
Val 915 920 925 Gln Lys Thr Gly Val Trp Val His Asn His Asn Gly Ile
Ala Asp Asn 930 935 940 Ala Thr Ile Val Arg Val Asp Val Phe Glu Lys
Gly Gly Lys Tyr Tyr 945 950 955 960 Leu Val Pro Ile Tyr Ser Trp Gln
Val Ala Lys Gly Ile Leu Pro Asp 965 970 975 Arg Ala Val Val Gln Gly
Lys Asp Glu Glu Asp Trp Thr Val Met Asp 980 985 990 Asp Ser Phe Glu
Phe Lys Phe Val Leu Tyr Ala Asn Asp Leu Ile Lys 995 1000 1005 Leu
Thr Ala Lys Lys Asn Glu Phe Leu Gly Tyr Phe Val Ser Leu 1010 1015
1020 Asn Arg Ala Thr Gly Ala Ile Asp Ile Arg Thr His Asp Thr Asp
1025 1030 1035 Ser Thr Lys Gly Lys Asn Gly Ile Phe Gln Ser Val Gly
Val Lys 1040 1045 1050 Thr Ala Leu Ser Phe Gln Lys Tyr Gln Ile Asp
Glu Leu Gly Lys 1055 1060 1065 Glu Ile Arg Pro Cys Arg Leu Lys Lys
Arg Pro Pro Val Arg 1070 1075 1080 <210> SEQ ID NO 67
<211> LENGTH: 1003 <212> TYPE: PRT <213>
ORGANISM: Campylobacter lari <400> SEQUENCE: 67 Met Arg Ile
Leu Gly Phe Asp Ile Gly Ile Asn Ser Ile Gly Trp Ala 1 5 10 15 Phe
Val Glu Asn Asp Glu Leu Lys Asp Cys Gly Val Arg Ile Phe Thr 20 25
30 Lys Ala Glu Asn Pro Lys Asn Lys Glu Ser Leu Ala Leu Pro Arg Arg
35 40 45 Asn Ala Arg Ser Ser Arg Arg Arg Leu Lys Arg Arg Lys Ala
Arg Leu 50 55 60 Ile Ala Ile Lys Arg Ile Leu Ala Lys Glu Leu Lys
Leu Asn Tyr Lys 65 70 75 80 Asp Tyr Val Ala Ala Asp Gly Glu Leu Pro
Lys Ala Tyr Glu Gly Ser 85 90 95 Leu Ala Ser Val Tyr Glu Leu Arg
Tyr Lys Ala Leu Thr Gln Asn Leu 100 105 110 Glu Thr Lys Asp Leu Ala
Arg Val Ile Leu His Ile Ala Lys His Arg 115 120 125 Gly Tyr Met Asn
Lys Asn Glu Lys Lys Ser Asn Asp Ala Lys Lys Gly 130 135 140 Lys Ile
Leu Ser Ala Leu Lys Asn Asn Ala Leu Lys Leu Glu Asn Tyr 145 150 155
160 Gln Ser Val Gly Glu Tyr Phe Tyr Lys Glu Phe Phe Gln Lys Tyr Lys
165 170 175 Lys Asn Thr Lys Asn Phe Ile Lys Ile Arg Asn Thr Lys Asp
Asn Tyr 180 185 190 Asn Asn Cys Val Leu Ser Ser Asp Leu Glu Lys Glu
Leu Lys Leu Ile 195 200 205 Leu Glu Lys Gln Lys Glu Phe Gly Tyr Asn
Tyr Ser Glu Asp Phe Ile 210 215 220 Asn Glu Ile Leu Lys Val Ala Phe
Phe Gln Arg Pro Leu Lys Asp Phe 225 230 235 240 Ser His Leu Val Gly
Ala Cys Thr Phe Phe Glu Glu Glu Lys Arg Ala 245 250 255 Cys Lys Asn
Ser Tyr Ser Ala Trp Glu Phe Val Ala Leu Thr Lys Ile 260 265 270 Ile
Asn Glu Ile Lys Ser Leu Glu Lys Ile Ser Gly Glu Ile Val Pro 275 280
285 Thr Gln Thr Ile Asn Glu Val Leu Asn Leu Ile Leu Asp Lys Gly Ser
290 295 300 Ile Thr Tyr Lys Lys Phe Arg Ser Cys Ile Asn Leu His Glu
Ser Ile 305 310 315 320 Ser Phe Lys Ser Leu Lys Tyr Asp Lys Glu Asn
Ala Glu Asn Ala Lys 325 330 335 Leu Ile Asp Phe Arg Lys Leu Val Glu
Phe Lys Lys Ala Leu Gly Val 340 345 350 His Ser Leu Ser Arg Gln Glu
Leu Asp Gln Ile Ser Thr His Ile Thr 355 360 365 Leu Ile Lys Asp Asn
Val Lys Leu Lys Thr Val Leu Glu Lys Tyr Asn 370 375 380 Leu Ser Asn
Glu Gln Ile Asn Asn Leu Leu Glu Ile Glu Phe Asn Asp 385 390 395 400
Tyr Ile Asn Leu Ser Phe Lys Ala Leu Gly Met Ile Leu Pro Leu Met 405
410 415 Arg Glu Gly Lys Arg Tyr Asp Glu Ala Cys Glu Ile Ala Asn Leu
Lys 420 425 430 Pro Lys Thr Val Asp Glu Lys Lys Asp Phe Leu Pro Ala
Phe Cys Asp 435 440 445 Ser Ile Phe Ala His Glu Leu Ser Asn Pro Val
Val Asn Arg Ala Ile 450 455 460 Ser Glu Tyr Arg Lys Val Leu Asn Ala
Leu Leu Lys Lys Tyr Gly Lys 465 470 475 480 Val His Lys Ile His Leu
Glu Leu Ala Arg Asp Val Gly Leu Ser Lys 485 490 495 Lys Ala Arg Glu
Lys Ile Glu Lys Glu Gln Lys Glu Asn Gln Ala Val 500 505 510 Asn Ala
Trp Ala Leu Lys Glu Cys Glu Asn Ile Gly Leu Lys Ala Ser 515 520 525
Ala Lys Asn Ile Leu Lys Leu Lys Leu Trp Lys Glu Gln Lys Glu Ile 530
535 540 Cys Ile Tyr Ser Gly Asn Lys Ile Ser Ile Glu His Leu Lys Asp
Glu 545 550 555 560 Lys Ala Leu Glu Val Asp His Ile Tyr Pro Tyr Ser
Arg Ser Phe Asp 565 570 575 Asp Ser Phe Ile Asn Lys Val Leu Val Phe
Thr Lys Glu Asn Gln Glu 580 585 590 Lys Leu Asn Lys Thr Pro Phe Glu
Ala Phe Gly Lys Asn Ile Glu Lys 595 600 605 Trp Ser Lys Ile Gln Thr
Leu Ala Gln Asn Leu Pro Tyr Lys Lys Lys 610 615 620 Asn Lys Ile Leu
Asp Glu Asn Phe Lys Asp Lys Gln Gln Glu Asp Phe 625 630 635 640 Ile
Ser Arg Asn Leu Asn Asp Thr Arg Tyr Ile Ala Thr Leu Ile Ala 645 650
655 Lys Tyr Thr Lys Glu Tyr Leu Asn Phe Leu Leu Leu Ser Glu Asn Glu
660 665 670 Asn Ala Asn Leu Lys Ser Gly Glu Lys Gly Ser Lys Ile His
Val Gln 675 680 685 Thr Ile Ser Gly Met Leu Thr Ser Val Leu Arg His
Thr Trp Gly Phe 690 695 700 Asp Lys Lys Asp Arg Asn Asn His Leu His
His Ala Leu Asp Ala Ile 705 710 715 720 Ile Val Ala Tyr Ser Thr Asn
Ser Ile Ile Lys Ala Phe Ser Asp Phe 725 730 735 Arg Lys Asn Gln Glu
Leu Leu Lys Ala Arg Phe Tyr Ala Lys Glu Leu 740 745 750 Thr Ser Asp
Asn Tyr Lys His Gln Val Lys Phe Phe Glu Pro Phe Lys 755 760 765 Ser
Phe Arg Glu Lys Ile Leu Ser Lys Ile Asp Glu Ile Phe Val Ser 770 775
780 Lys Pro Pro Arg Lys Arg Ala Arg Arg Ala Leu His Lys Asp Thr Phe
785 790 795 800 His Ser Glu Asn Lys Ile Ile Asp Lys Cys Ser Tyr Asn
Ser Lys Glu 805 810 815 Gly Leu Gln Ile Ala Leu Ser Cys Gly Arg Val
Arg Lys Ile Gly Thr 820 825 830 Lys Tyr Val Glu Asn Asp Thr Ile Val
Arg Val Asp Ile Phe Lys Lys 835 840 845 Gln Asn Lys Phe Tyr Ala Ile
Pro Ile Tyr Ala Met Asp Phe Ala Leu 850 855 860 Gly Ile Leu Pro Asn
Lys Ile Val Ile Thr Gly Lys Asp Lys Asn Asn 865 870 875 880 Asn Pro
Lys Gln Trp Gln Thr Ile Asp Glu Ser Tyr Glu Phe Cys Phe 885 890 895
Ser Leu Tyr Lys Asn Asp Leu Ile Leu Leu Gln Lys Lys Asn Met Gln 900
905 910 Glu Pro Glu Phe Ala Tyr Tyr Asn Asp Phe Ser Ile Ser Thr Ser
Ser 915 920 925 Ile Cys Val Glu Lys His Asp Asn Lys Phe Glu Asn Leu
Thr Ser Asn 930 935 940 Gln Lys Leu Leu Phe Ser Asn Ala Lys Glu Gly
Ser Val Lys Val Glu 945 950 955 960 Ser Leu Gly Ile Gln Asn Leu Lys
Val Phe Glu Lys Tyr Ile Ile Thr 965 970 975 Pro Leu Gly Asp Lys Ile
Lys Ala Asp Phe Gln Pro Arg Glu Asn Ile 980 985 990 Ser Leu Lys Thr
Ser Lys Lys Tyr Gly Leu Arg 995 1000
1 SEQUENCE LISTING <160> NUMBER OF SEQ ID NOS: 67 <210>
SEQ ID NO 1 <211> LENGTH: 12 <212> TYPE: PRT
<213> ORGANISM: Artificial <220> FEATURE: <223>
OTHER INFORMATION: fragment of hAID <400> SEQUENCE: 1 Gln Phe
Lys Asn Val Arg Trp Ala Lys Gly Arg Arg 1 5 10 <210> SEQ ID
NO 2 <211> LENGTH: 9 <212> TYPE: PRT <213>
ORGANISM: Artificial <220> FEATURE: <223> OTHER
INFORMATION: fragment of hAID solubility variant (hAIDv)
<400> SEQUENCE: 2 Asn Phe Asn Asn Gly Ile Gly Arg His 1 5
<210> SEQ ID NO 3 <211> LENGTH: 9 <212> TYPE: PRT
<213> ORGANISM: Artificial <220> FEATURE: <223>
OTHER INFORMATION: fragment of hAPOBEC3A <400> SEQUENCE: 3
Asn Phe Asn Asn Gly Ile Gly Arg His 1 5 <210> SEQ ID NO 4
<211> LENGTH: 12 <212> TYPE: PRT <213> ORGANISM:
Artificial <220> FEATURE: <223> OTHER INFORMATION:
fragment of hAPOBEC3C <400> SEQUENCE: 4 Gln Phe Lys Asn Leu
Trp Glu Ala Asn Asp Arg Asn 1 5 10 <210> SEQ ID NO 5
<211> LENGTH: 12 <212> TYPE: PRT <213> ORGANISM:
Artificial <220> FEATURE: <223> OTHER INFORMATION:
fragment of hAPOBEC3F - catalytic domain <400> SEQUENCE: 5
His Phe Lys Asn Leu Arg Lys Ala Tyr Gly Arg Asn 1 5 10 <210>
SEQ ID NO 6 <211> LENGTH: 12 <212> TYPE: PRT
<213> ORGANISM: Artificial <220> FEATURE: <223>
OTHER INFORMATION: fragment of hAPOBEC3G - catalytic domain
<400> SEQUENCE: 6 Asn Phe Asn Asn Glu Pro Trp Val Arg Gly Arg
His 1 5 10 <210> SEQ ID NO 7 <211> LENGTH: 12
<212> TYPE: PRT <213> ORGANISM: Artificial <220>
FEATURE: <223> OTHER INFORMATION: fragment of mAPOBEC3 -
catalytic domain <400> SEQUENCE: 7 His Phe Lys Asn Leu Gly
Tyr Ala Lys Gly Arg Lys 1 5 10 <210> SEQ ID NO 8 <211>
LENGTH: 15 <212> TYPE: PRT <213> ORGANISM: Artificial
<220> FEATURE: <223> OTHER INFORMATION: fragment of
hAPOBEC3H <400> SEQUENCE: 8 Gln Phe Asn Asn Lys Arg Arg Leu
Arg Arg Pro Tyr Tyr Pro Arg 1 5 10 15 <210> SEQ ID NO 9
<211> LENGTH: 9 <212> TYPE: PRT <213> ORGANISM:
Artificial <220> FEATURE: <223> OTHER INFORMATION:
fragment of rAPOBEC1 <400> SEQUENCE: 9 Phe Phe Asp Pro Arg
Glu Leu Arg Lys 1 5 <210> SEQ ID NO 10 <211> LENGTH: 17
<212> TYPE: PRT <213> ORGANISM: Artificial <220>
FEATURE: <223> OTHER INFORMATION: fragment of hAID
<400> SEQUENCE: 10 Phe Thr Ala Arg Leu Tyr Phe Cys Glu Asp
Arg Lys Ala Glu Pro Glu 1 5 10 15 Gly <210> SEQ ID NO 11
<211> LENGTH: 17 <212> TYPE: PRT <213> ORGANISM:
Artificial <220> FEATURE: <223> OTHER INFORMATION:
fragment of hAID solubility variant (hAIDv) <400> SEQUENCE:
11 Phe Thr Ala Arg Leu Tyr Phe Cys Glu Asp Arg Lys Ala Glu Pro Glu
1 5 10 15 Gly <210> SEQ ID NO 12 <211> LENGTH: 15
<212> TYPE: PRT <213> ORGANISM: Artificial <220>
FEATURE: <223> OTHER INFORMATION: fragment of hAPOBEC3A
<400> SEQUENCE: 12 Phe Ala Ala Arg Ile Tyr Asp Tyr Asp Pro
Leu Tyr Lys Glu Ala 1 5 10 15 <210> SEQ ID NO 13 <211>
LENGTH: 16 <212> TYPE: PRT <213> ORGANISM: Artificial
<220> FEATURE: <223> OTHER INFORMATION: fragment of
hAPOBEC3C <400> SEQUENCE: 13 Phe Thr Ala Arg Leu Tyr Tyr Phe
Gln Tyr Pro Cys Tyr Gln Glu Gly 1 5 10 15 <210> SEQ ID NO 14
<211> LENGTH: 16 <212> TYPE: PRT <213> ORGANISM:
Artificial <220> FEATURE: <223> OTHER INFORMATION:
fragment of hAPOBEC3F - catalytic domain <400> SEQUENCE: 14
Phe Thr Ala Arg Leu Tyr Tyr Phe Trp Asp Thr Asp Tyr Gln Glu Gly 1 5
10 15 <210> SEQ ID NO 15 <211> LENGTH: 15 <212>
TYPE: PRT <213> ORGANISM: Artificial <220> FEATURE:
<223> OTHER INFORMATION: fragment of hAPOBEC3G - catalytic
domain <400> SEQUENCE: 15 Phe Thr Ala Arg Ile Tyr Asp Asp Gln
Gly Arg Cys Gln Glu Gly 1 5 10 15 <210> SEQ ID NO 16
<211> LENGTH: 16 <212> TYPE: PRT <213> ORGANISM:
Artificial <220> FEATURE: <223> OTHER INFORMATION:
fragment of mAPOBEC3 - catalytic domain <400> SEQUENCE: 16
Phe Ser Ser Arg Leu Tyr Asn Val Gln Asp Pro Glu Thr Gln Gln Asn 1 5
10 15 <210> SEQ ID NO 17 <211> LENGTH: 16 <212>
TYPE: PRT <213> ORGANISM: Artificial <220> FEATURE:
<223> OTHER INFORMATION: fragment of hAPOBEC3H <400>
SEQUENCE: 17 Phe Ala Ser Arg Leu Tyr Tyr His Trp Cys Lys Pro Gln
Gln Lys Gly 1 5 10 15 <210> SEQ ID NO 18 <211> LENGTH:
16 <212> TYPE: PRT <213> ORGANISM: Artificial
<220> FEATURE: <223> OTHER INFORMATION: fragment of
rAPOBEC1 <400> SEQUENCE: 18 Tyr Ile Ala Arg Leu Tyr His His
Ala Asp Pro Arg Asn Arg Gln Gly 1 5 10 15 <210> SEQ ID NO 19
<211> LENGTH: 21 <212> TYPE: DNA <213> ORGANISM:
Artificial
<220> FEATURE: <223> OTHER INFORMATION: EGFP gRNA
target sequence <400> SEQUENCE: 19 tcagctcgat gcggttcacc a 21
<210> SEQ ID NO 20 <211> LENGTH: 21 <212> TYPE:
DNA <213> ORGANISM: Artificial <220> FEATURE:
<223> OTHER INFORMATION: EGFP gRNA target sequence
<400> SEQUENCE: 20 gcagaacacc cccatcggcg a 21 <210> SEQ
ID NO 21 <400> SEQUENCE: 21 000 <210> SEQ ID NO 22
<211> LENGTH: 24 <212> TYPE: DNA <213> ORGANISM:
Artificial <220> FEATURE: <223> OTHER INFORMATION: EGFP
gRNA target sequence <400> SEQUENCE: 22 tcagctcgat gcggttcacc
aggg 24 <210> SEQ ID NO 23 <211> LENGTH: 20 <212>
TYPE: DNA <213> ORGANISM: Artificial <220> FEATURE:
<223> OTHER INFORMATION: AAVS1 target site 1 reference
sequence <400> SEQUENCE: 23 gactcaccca ggagtgcgtt 20
<210> SEQ ID NO 24 <211> LENGTH: 20 <212> TYPE:
DNA <213> ORGANISM: Artificial <220> FEATURE:
<223> OTHER INFORMATION: AAVS1 target site 2 reference
sequence <400> SEQUENCE: 24 gtccgactcg gccaggtcca 20
<210> SEQ ID NO 25 <211> LENGTH: 20 <212> TYPE:
DNA <213> ORGANISM: Artificial <220> FEATURE:
<223> OTHER INFORMATION: AAVS1 target site 3 reference
sequence <400> SEQUENCE: 25 gaccctcagc cgtgctgctc 20
<210> SEQ ID NO 26 <211> LENGTH: 20 <212> TYPE:
DNA <213> ORGANISM: Artificial <220> FEATURE:
<223> OTHER INFORMATION: AAVS1 target site 4 reference
sequence <400> SEQUENCE: 26 gctctcagcc tggagaccac 20
<210> SEQ ID NO 27 <211> LENGTH: 20 <212> TYPE:
DNA <213> ORGANISM: Artificial <220> FEATURE:
<223> OTHER INFORMATION: AAVS1 target site 5 reference
sequence <400> SEQUENCE: 27 gctgactcag agaccctgag 20
<210> SEQ ID NO 28 <211> LENGTH: 20 <212> TYPE:
DNA <213> ORGANISM: Artificial <220> FEATURE:
<223> OTHER INFORMATION: AAVS1 target site 6 reference
sequence <400> SEQUENCE: 28 ggggctcaac atcggaagag 20
<210> SEQ ID NO 29 <211> LENGTH: 20 <212> TYPE:
DNA <213> ORGANISM: Artificial <220> FEATURE:
<223> OTHER INFORMATION: AAVS1 target site 7 reference
sequence <400> SEQUENCE: 29 ggcactcggg ggcgagagga 20
<210> SEQ ID NO 30 <211> LENGTH: 20 <212> TYPE:
DNA <213> ORGANISM: Artificial <220> FEATURE:
<223> OTHER INFORMATION: EMX1 target site 2 reference
sequence <400> SEQUENCE: 30 gtattcacct gaaagtgtgc 20
<210> SEQ ID NO 31 <211> LENGTH: 20 <212> TYPE:
DNA <213> ORGANISM: Artificial <220> FEATURE:
<223> OTHER INFORMATION: AAVS1 target site 8 reference
sequence <400> SEQUENCE: 31 gagctcactg aacgctggca 20
<210> SEQ ID NO 32 <211> LENGTH: 20 <212> TYPE:
DNA <213> ORGANISM: Artificial <220> FEATURE:
<223> OTHER INFORMATION: AAVS1 target site 9 reference
sequence <400> SEQUENCE: 32 gctggctcag gttcaggaga 20
<210> SEQ ID NO 33 <211> LENGTH: 20 <212> TYPE:
DNA <213> ORGANISM: Artificial <220> FEATURE:
<223> OTHER INFORMATION: FANCF target site 1 reference
sequence <400> SEQUENCE: 33 ggaatccctt ctgcagcacc 20
<210> SEQ ID NO 34 <211> LENGTH: 20 <212> TYPE:
DNA <213> ORGANISM: Artificial <220> FEATURE:
<223> OTHER INFORMATION: EMX1 target site 1 reference
sequence <400> SEQUENCE: 34 gagtccgagc agaagaagaa 20
<210> SEQ ID NO 35 <211> LENGTH: 20 <212> TYPE:
DNA <213> ORGANISM: Artificial Sequence <220> FEATURE:
<223> OTHER INFORMATION: AAVS1 target site 6 reference
sequence <400> SEQUENCE: 35 ggggctcaac atcggaagag 20
<210> SEQ ID NO 36 <211> LENGTH: 20 <212> TYPE:
DNA <213> ORGANISM: Artificial <220> FEATURE:
<223> OTHER INFORMATION: AAVS1 target site 3 reference
sequence <400> SEQUENCE: 36 gaccctcagc cgtgctgctc 20
<210> SEQ ID NO 37 <211> LENGTH: 20 <212> TYPE:
DNA <213> ORGANISM: Artificial <220> FEATURE:
<223> OTHER INFORMATION: AAVS1 target site 7 reference
sequence <400> SEQUENCE: 37 ggcactcggg ggcgagagga 20
<210> SEQ ID NO 38 <211> LENGTH: 20 <212> TYPE:
DNA <213> ORGANISM: Artificial <220> FEATURE:
<223> OTHER INFORMATION: potential HBB allele products
<400> SEQUENCE: 38 ctgaatttta tgcccagccc 20 <210> SEQ
ID NO 39 <211> LENGTH: 20 <212> TYPE: DNA <213>
ORGANISM: Artificial <220> FEATURE: <223> OTHER
INFORMATION: potential HBB allele products <400> SEQUENCE: 39
ctgagtttta tgcccagccc 20 <210> SEQ ID NO 40 <211>
LENGTH: 20 <212> TYPE: DNA <213> ORGANISM: Artificial
<220> FEATURE: <223> OTHER INFORMATION: potential HBB
allele products <400> SEQUENCE: 40
ctgattttta tgcccagccc 20 <210> SEQ ID NO 41 <211>
LENGTH: 20 <212> TYPE: DNA <213> ORGANISM: Artificial
<220> FEATURE: <223> OTHER INFORMATION: potential HBB
allele products <400> SEQUENCE: 41 ctgacttgta tgcccagccc 20
<210> SEQ ID NO 42 <211> LENGTH: 20 <212> TYPE:
DNA <213> ORGANISM: Artificial <220> FEATURE:
<223> OTHER INFORMATION: potential HBB allele products
<400> SEQUENCE: 42 ctgactttta tgcccagccc 20 <210> SEQ
ID NO 43 <211> LENGTH: 20 <212> TYPE: DNA <213>
ORGANISM: Artificial <220> FEATURE: <223> OTHER
INFORMATION: HBB target site reference sequence <400>
SEQUENCE: 43 ctgacttcta tgcccagccc 20 <210> SEQ ID NO 44
<211> LENGTH: 23 <212> TYPE: DNA <213> ORGANISM:
Artificial <220> FEATURE: <223> OTHER INFORMATION: HBB
target site reference sequence <220> FEATURE: <221>
NAME/KEY: misc_feature <222> LOCATION: (21)..(21) <223>
OTHER INFORMATION: n is a, c, g, or t <400> SEQUENCE: 44
ctgacttcta tgcccagccc ngg 23 <210> SEQ ID NO 45 <211>
LENGTH: 83 <212> TYPE: PRT <213> ORGANISM: Artificial
<220> FEATURE: <223> OTHER INFORMATION: Uracil
glycosylase inhibitor (UGI) <400> SEQUENCE: 45 Thr Asn Leu
Ser Asp Ile Ile Glu Lys Glu Thr Gly Lys Gln Leu Val 1 5 10 15 Ile
Gln Glu Ser Ile Leu Met Leu Pro Glu Glu Val Glu Glu Val Ile 20 25
30 Gly Asn Lys Pro Glu Ser Asp Ile Leu Val His Thr Ala Tyr Asp Glu
35 40 45 Ser Thr Asp Glu Asn Val Met Leu Leu Thr Ser Asp Ala Pro
Glu Tyr 50 55 60 Lys Pro Trp Ala Leu Val Ile Gln Asp Ser Asn Gly
Glu Asn Lys Ile 65 70 75 80 Lys Met Leu <210> SEQ ID NO 46
<211> LENGTH: 4 <212> TYPE: PRT <213> ORGANISM:
Artificial <220> FEATURE: <223> OTHER INFORMATION:
linker sequence <400> SEQUENCE: 46 Gly Gly Gly Ser 1
<210> SEQ ID NO 47 <211> LENGTH: 5 <212> TYPE:
PRT <213> ORGANISM: Artificial <220> FEATURE:
<223> OTHER INFORMATION: linker sequence <400>
SEQUENCE: 47 Gly Gly Gly Gly Ser 1 5 <210> SEQ ID NO 48
<211> LENGTH: 7 <212> TYPE: PRT <213> ORGANISM:
Artificial <220> FEATURE: <223> OTHER INFORMATION: SV40
large T antigen NLS <400> SEQUENCE: 48 Pro Lys Lys Lys Arg
Arg Val 1 5 <210> SEQ ID NO 49 <211> LENGTH: 15
<212> TYPE: PRT <213> ORGANISM: Artificial <220>
FEATURE: <223> OTHER INFORMATION: nucleoplasmin NLS
<400> SEQUENCE: 49 Lys Arg Pro Ala Ala Thr Lys Lys Ala Gly
Gln Ala Lys Lys Lys 1 5 10 15 <210> SEQ ID NO 50 <211>
LENGTH: 1710 <212> TYPE: PRT <213> ORGANISM: Artificial
<220> FEATURE: <223> OTHER INFORMATION: rAPOBEC1-XTEN
L8-nCas9-UGI-SV40 NLS <400> SEQUENCE: 50 Met Ser Ser Glu Thr
Gly Pro Val Ala Val Asp Pro Thr Leu Arg Arg 1 5 10 15 Arg Ile Glu
Pro His Glu Phe Glu Val Phe Phe Asp Pro Arg Glu Leu 20 25 30 Arg
Lys Glu Thr Cys Leu Leu Tyr Glu Ile Asn Trp Gly Gly Arg His 35 40
45 Ser Ile Trp Arg His Thr Ser Gln Asn Thr Asn Lys His Val Glu Val
50 55 60 Asn Phe Ile Glu Lys Phe Thr Thr Glu Arg Tyr Phe Cys Pro
Asn Thr 65 70 75 80 Arg Cys Ser Ile Thr Trp Phe Leu Ser Trp Ser Pro
Cys Gly Glu Cys 85 90 95 Ser Arg Ala Ile Thr Glu Phe Leu Ser Arg
Tyr Pro His Val Thr Leu 100 105 110 Phe Ile Tyr Ile Ala Arg Leu Tyr
His His Ala Asp Pro Arg Asn Arg 115 120 125 Gln Gly Leu Arg Asp Leu
Ile Ser Ser Gly Val Thr Ile Gln Ile Met 130 135 140 Thr Glu Gln Glu
Ser Gly Tyr Cys Trp Arg Asn Phe Val Asn Tyr Ser 145 150 155 160 Pro
Ser Asn Glu Ala His Trp Pro Arg Tyr Pro His Leu Trp Val Arg 165 170
175 Leu Tyr Val Leu Glu Leu Tyr Cys Ile Ile Leu Gly Leu Pro Pro Cys
180 185 190 Leu Asn Ile Leu Arg Arg Lys Gln Pro Gln Leu Thr Phe Phe
Thr Ile 195 200 205 Ala Leu Gln Ser Cys His Tyr Gln Arg Leu Pro Pro
His Ile Leu Trp 210 215 220 Ala Thr Gly Leu Lys Ser Gly Ser Glu Thr
Pro Gly Thr Ser Glu Ser 225 230 235 240 Ala Thr Pro Glu Ser Asp Lys
Lys Tyr Ser Ile Gly Leu Ala Ile Gly 245 250 255 Thr Asn Ser Val Gly
Trp Ala Val Ile Thr Asp Glu Tyr Lys Val Pro 260 265 270 Ser Lys Lys
Phe Lys Val Leu Gly Asn Thr Asp Arg His Ser Ile Lys 275 280 285 Lys
Asn Leu Ile Gly Ala Leu Leu Phe Asp Ser Gly Glu Thr Ala Glu 290 295
300 Ala Thr Arg Leu Lys Arg Thr Ala Arg Arg Arg Tyr Thr Arg Arg Lys
305 310 315 320 Asn Arg Ile Cys Tyr Leu Gln Glu Ile Phe Ser Asn Glu
Met Ala Lys 325 330 335 Val Asp Asp Ser Phe Phe His Arg Leu Glu Glu
Ser Phe Leu Val Glu 340 345 350 Glu Asp Lys Lys His Glu Arg His Pro
Ile Phe Gly Asn Ile Val Asp 355 360 365 Glu Val Ala Tyr His Glu Lys
Tyr Pro Thr Ile Tyr His Leu Arg Lys 370 375 380 Lys Leu Val Asp Ser
Thr Asp Lys Ala Asp Leu Arg Leu Ile Tyr Leu 385 390 395 400 Ala Leu
Ala His Met Ile Lys Phe Arg Gly His Phe Leu Ile Glu Gly 405 410 415
Asp Leu Asn Pro Asp Asn Ser Asp Val Asp Lys Leu Phe Ile Gln Leu 420
425 430 Val Gln Thr Tyr Asn Gln Leu Phe Glu Glu Asn Pro Ile Asn Ala
Ser 435 440 445 Gly Val Asp Ala Lys Ala Ile Leu Ser Ala Arg Leu Ser
Lys Ser Arg 450 455 460 Arg Leu Glu Asn Leu Ile Ala Gln Leu Pro Gly
Glu Lys Lys Asn Gly 465 470 475 480 Leu Phe Gly Asn Leu Ile Ala Leu
Ser Leu Gly Leu Thr Pro Asn Phe 485 490 495 Lys Ser Asn Phe Asp Leu
Ala Glu Asp Ala Lys Leu Gln Leu Ser Lys 500 505 510 Asp Thr Tyr Asp
Asp Asp Leu Asp Asn Leu Leu Ala Gln Ile Gly Asp 515 520 525 Gln Tyr
Ala Asp Leu Phe Leu Ala Ala Lys Asn Leu Ser Asp Ala Ile 530 535 540
Leu Leu Ser Asp Ile Leu Arg Val Asn Thr Glu Ile Thr Lys Ala Pro 545
550 555 560 Leu Ser Ala Ser Met Ile Lys Arg Tyr Asp Glu His His Gln
Asp Leu 565 570 575
Thr Leu Leu Lys Ala Leu Val Arg Gln Gln Leu Pro Glu Lys Tyr Lys 580
585 590 Glu Ile Phe Phe Asp Gln Ser Lys Asn Gly Tyr Ala Gly Tyr Ile
Asp 595 600 605 Gly Gly Ala Ser Gln Glu Glu Phe Tyr Lys Phe Ile Lys
Pro Ile Leu 610 615 620 Glu Lys Met Asp Gly Thr Glu Glu Leu Leu Val
Lys Leu Asn Arg Glu 625 630 635 640 Asp Leu Leu Arg Lys Gln Arg Thr
Phe Asp Asn Gly Ser Ile Pro His 645 650 655 Gln Ile His Leu Gly Glu
Leu His Ala Ile Leu Arg Arg Gln Glu Asp 660 665 670 Phe Tyr Pro Phe
Leu Lys Asp Asn Arg Glu Lys Ile Glu Lys Ile Leu 675 680 685 Thr Phe
Arg Ile Pro Tyr Tyr Val Gly Pro Leu Ala Arg Gly Asn Ser 690 695 700
Arg Phe Ala Trp Met Thr Arg Lys Ser Glu Glu Thr Ile Thr Pro Trp 705
710 715 720 Asn Phe Glu Glu Val Val Asp Lys Gly Ala Ser Ala Gln Ser
Phe Ile 725 730 735 Glu Arg Met Thr Asn Phe Asp Lys Asn Leu Pro Asn
Glu Lys Val Leu 740 745 750 Pro Lys His Ser Leu Leu Tyr Glu Tyr Phe
Thr Val Tyr Asn Glu Leu 755 760 765 Thr Lys Val Lys Tyr Val Thr Glu
Gly Met Arg Lys Pro Ala Phe Leu 770 775 780 Ser Gly Glu Gln Lys Lys
Ala Ile Val Asp Leu Leu Phe Lys Thr Asn 785 790 795 800 Arg Lys Val
Thr Val Lys Gln Leu Lys Glu Asp Tyr Phe Lys Lys Ile 805 810 815 Glu
Cys Phe Asp Ser Val Glu Ile Ser Gly Val Glu Asp Arg Phe Asn 820 825
830 Ala Ser Leu Gly Thr Tyr His Asp Leu Leu Lys Ile Ile Lys Asp Lys
835 840 845 Asp Phe Leu Asp Asn Glu Glu Asn Glu Asp Ile Leu Glu Asp
Ile Val 850 855 860 Leu Thr Leu Thr Leu Phe Glu Asp Arg Glu Met Ile
Glu Glu Arg Leu 865 870 875 880 Lys Thr Tyr Ala His Leu Phe Asp Asp
Lys Val Met Lys Gln Leu Lys 885 890 895 Arg Arg Arg Tyr Thr Gly Trp
Gly Arg Leu Ser Arg Lys Leu Ile Asn 900 905 910 Gly Ile Arg Asp Lys
Gln Ser Gly Lys Thr Ile Leu Asp Phe Leu Lys 915 920 925 Ser Asp Gly
Phe Ala Asn Arg Asn Phe Met Gln Leu Ile His Asp Asp 930 935 940 Ser
Leu Thr Phe Lys Glu Asp Ile Gln Lys Ala Gln Val Ser Gly Gln 945 950
955 960 Gly Asp Ser Leu His Glu His Ile Ala Asn Leu Ala Gly Ser Pro
Ala 965 970 975 Ile Lys Lys Gly Ile Leu Gln Thr Val Lys Val Val Asp
Glu Leu Val 980 985 990 Lys Val Met Gly Arg His Lys Pro Glu Asn Ile
Val Ile Glu Met Ala 995 1000 1005 Arg Glu Asn Gln Thr Thr Gln Lys
Gly Gln Lys Asn Ser Arg Glu 1010 1015 1020 Arg Met Lys Arg Ile Glu
Glu Gly Ile Lys Glu Leu Gly Ser Gln 1025 1030 1035 Ile Leu Lys Glu
His Pro Val Glu Asn Thr Gln Leu Gln Asn Glu 1040 1045 1050 Lys Leu
Tyr Leu Tyr Tyr Leu Gln Asn Gly Arg Asp Met Tyr Val 1055 1060 1065
Asp Gln Glu Leu Asp Ile Asn Arg Leu Ser Asp Tyr Asp Val Asp 1070
1075 1080 His Ile Val Pro Gln Ser Phe Leu Lys Asp Asp Ser Ile Asp
Asn 1085 1090 1095 Lys Val Leu Thr Arg Ser Asp Lys Asn Arg Gly Lys
Ser Asp Asn 1100 1105 1110 Val Pro Ser Glu Glu Val Val Lys Lys Met
Lys Asn Tyr Trp Arg 1115 1120 1125 Gln Leu Leu Asn Ala Lys Leu Ile
Thr Gln Arg Lys Phe Asp Asn 1130 1135 1140 Leu Thr Lys Ala Glu Arg
Gly Gly Leu Ser Glu Leu Asp Lys Ala 1145 1150 1155 Gly Phe Ile Lys
Arg Gln Leu Val Glu Thr Arg Gln Ile Thr Lys 1160 1165 1170 His Val
Ala Gln Ile Leu Asp Ser Arg Met Asn Thr Lys Tyr Asp 1175 1180 1185
Glu Asn Asp Lys Leu Ile Arg Glu Val Lys Val Ile Thr Leu Lys 1190
1195 1200 Ser Lys Leu Val Ser Asp Phe Arg Lys Asp Phe Gln Phe Tyr
Lys 1205 1210 1215 Val Arg Glu Ile Asn Asn Tyr His His Ala His Asp
Ala Tyr Leu 1220 1225 1230 Asn Ala Val Val Gly Thr Ala Leu Ile Lys
Lys Tyr Pro Lys Leu 1235 1240 1245 Glu Ser Glu Phe Val Tyr Gly Asp
Tyr Lys Val Tyr Asp Val Arg 1250 1255 1260 Lys Met Ile Ala Lys Ser
Glu Gln Glu Ile Gly Lys Ala Thr Ala 1265 1270 1275 Lys Tyr Phe Phe
Tyr Ser Asn Ile Met Asn Phe Phe Lys Thr Glu 1280 1285 1290 Ile Thr
Leu Ala Asn Gly Glu Ile Arg Lys Arg Pro Leu Ile Glu 1295 1300 1305
Thr Asn Gly Glu Thr Gly Glu Ile Val Trp Asp Lys Gly Arg Asp 1310
1315 1320 Phe Ala Thr Val Arg Lys Val Leu Ser Met Pro Gln Val Asn
Ile 1325 1330 1335 Val Lys Lys Thr Glu Val Gln Thr Gly Gly Phe Ser
Lys Glu Ser 1340 1345 1350 Ile Leu Pro Lys Arg Asn Ser Asp Lys Leu
Ile Ala Arg Lys Lys 1355 1360 1365 Asp Trp Asp Pro Lys Lys Tyr Gly
Gly Phe Asp Ser Pro Thr Val 1370 1375 1380 Ala Tyr Ser Val Leu Val
Val Ala Lys Val Glu Lys Gly Lys Ser 1385 1390 1395 Lys Lys Leu Lys
Ser Val Lys Glu Leu Leu Gly Ile Thr Ile Met 1400 1405 1410 Glu Arg
Ser Ser Phe Glu Lys Asn Pro Ile Asp Phe Leu Glu Ala 1415 1420 1425
Lys Gly Tyr Lys Glu Val Lys Lys Asp Leu Ile Ile Lys Leu Pro 1430
1435 1440 Lys Tyr Ser Leu Phe Glu Leu Glu Asn Gly Arg Lys Arg Met
Leu 1445 1450 1455 Ala Ser Ala Gly Glu Leu Gln Lys Gly Asn Glu Leu
Ala Leu Pro 1460 1465 1470 Ser Lys Tyr Val Asn Phe Leu Tyr Leu Ala
Ser His Tyr Glu Lys 1475 1480 1485 Leu Lys Gly Ser Pro Glu Asp Asn
Glu Gln Lys Gln Leu Phe Val 1490 1495 1500 Glu Gln His Lys His Tyr
Leu Asp Glu Ile Ile Glu Gln Ile Ser 1505 1510 1515 Glu Phe Ser Lys
Arg Val Ile Leu Ala Asp Ala Asn Leu Asp Lys 1520 1525 1530 Val Leu
Ser Ala Tyr Asn Lys His Arg Asp Lys Pro Ile Arg Glu 1535 1540 1545
Gln Ala Glu Asn Ile Ile His Leu Phe Thr Leu Thr Asn Leu Gly 1550
1555 1560 Ala Pro Ala Ala Phe Lys Tyr Phe Asp Thr Thr Ile Asp Arg
Lys 1565 1570 1575 Arg Tyr Thr Ser Thr Lys Glu Val Leu Asp Ala Thr
Leu Ile His 1580 1585 1590 Gln Ser Ile Thr Gly Leu Tyr Glu Thr Arg
Ile Asp Leu Ser Gln 1595 1600 1605 Leu Gly Gly Asp Ser Gly Gly Ser
Thr Asn Leu Ser Asp Ile Ile 1610 1615 1620 Glu Lys Glu Thr Gly Lys
Gln Leu Val Ile Gln Glu Ser Ile Leu 1625 1630 1635 Met Leu Pro Glu
Glu Val Glu Glu Val Ile Gly Asn Lys Pro Glu 1640 1645 1650 Ser Asp
Ile Leu Val His Thr Ala Tyr Asp Glu Ser Thr Asp Glu 1655 1660 1665
Asn Val Met Leu Leu Thr Ser Asp Ala Pro Glu Tyr Lys Pro Trp 1670
1675 1680 Ala Leu Val Ile Gln Asp Ser Asn Gly Glu Asn Lys Ile Lys
Met 1685 1690 1695 Leu Ser Gly Gly Ser Pro Lys Lys Lys Arg Lys Val
1700 1705 1710 <210> SEQ ID NO 51 <211> LENGTH: 198
<212> TYPE: PRT <213> ORGANISM: Homo sapiens
<400> SEQUENCE: 51 Met Asp Ser Leu Leu Met Asn Arg Arg Lys
Phe Leu Tyr Gln Phe Lys 1 5 10 15 Asn Val Arg Trp Ala Lys Gly Arg
Arg Glu Thr Tyr Leu Cys Tyr Val 20 25 30 Val Lys Arg Arg Asp Ser
Ala Thr Ser Phe Ser Leu Asp Phe Gly Tyr 35 40 45 Leu Arg Asn Lys
Asn Gly Cys His Val Glu Leu Leu Phe Leu Arg Tyr 50 55 60 Ile Ser
Asp Trp Asp Leu Asp Pro Gly Arg Cys Tyr Arg Val Thr Trp 65 70 75 80
Phe Thr Ser Trp Ser Pro Cys Tyr Asp Cys Ala Arg His Val Ala Asp 85
90 95 Phe Leu Arg Gly Asn Pro Asn Leu Ser Leu Arg Ile Phe Thr Ala
Arg 100 105 110
Leu Tyr Phe Cys Glu Asp Arg Lys Ala Glu Pro Glu Gly Leu Arg Arg 115
120 125 Leu His Arg Ala Gly Val Gln Ile Ala Ile Met Thr Phe Lys Asp
Tyr 130 135 140 Phe Tyr Cys Trp Asn Thr Phe Val Glu Asn His Glu Arg
Thr Phe Lys 145 150 155 160 Ala Trp Glu Gly Leu His Glu Asn Ser Val
Arg Leu Ser Arg Gln Leu 165 170 175 Arg Arg Ile Leu Leu Pro Leu Tyr
Glu Val Asp Asp Leu Arg Asp Ala 180 185 190 Phe Arg Thr Leu Gly Leu
195 <210> SEQ ID NO 52 <211> LENGTH: 190 <212>
TYPE: PRT <213> ORGANISM: Artificial <220> FEATURE:
<223> OTHER INFORMATION: hAIDv solubility variant lacking
N-terminal RNA-binding region <400> SEQUENCE: 52 Met Asp Pro
His Ile Phe Thr Ser Asn Phe Asn Asn Gly Ile Gly Arg 1 5 10 15 His
Lys Thr Tyr Leu Cys Tyr Glu Val Glu Arg Leu Asp Ser Ala Thr 20 25
30 Ser Phe Ser Leu Asp Phe Gly Tyr Leu Arg Asn Lys Asn Gly Cys His
35 40 45 Val Glu Leu Leu Phe Leu Arg Tyr Ile Ser Asp Trp Asp Leu
Asp Pro 50 55 60 Gly Arg Cys Tyr Arg Val Thr Trp Phe Thr Ser Trp
Ser Pro Cys Tyr 65 70 75 80 Asp Cys Ala Arg His Val Ala Asp Phe Leu
Arg Gly Asn Pro Asn Leu 85 90 95 Ser Leu Arg Ile Phe Thr Ala Arg
Leu Tyr Phe Cys Glu Asp Arg Lys 100 105 110 Ala Glu Pro Glu Gly Leu
Arg Arg Leu His Arg Ala Gly Val Gln Ile 115 120 125 Ala Ile Met Thr
Phe Lys Asp Tyr Phe Tyr Cys Trp Asn Thr Phe Val 130 135 140 Glu Asn
His Glu Arg Thr Phe Lys Ala Trp Glu Gly Leu His Glu Asn 145 150 155
160 Ser Val Arg Leu Ser Arg Gln Leu Arg Arg Ile Leu Leu Pro Leu Tyr
165 170 175 Glu Val Asp Asp Leu Arg Asp Ala Phe Arg Thr Leu Gly Leu
180 185 190 <210> SEQ ID NO 53 <211> LENGTH: 175
<212> TYPE: PRT <213> ORGANISM: Artificial <220>
FEATURE: <223> OTHER INFORMATION: hAIDv solubility variant
lacking N-terminal RNA-binding region and the C-terminal poorly
structured region <400> SEQUENCE: 53 Met Asp Pro His Ile Phe
Thr Ser Asn Phe Asn Asn Gly Ile Gly Arg 1 5 10 15 His Lys Thr Tyr
Leu Cys Tyr Glu Val Glu Arg Leu Asp Ser Ala Thr 20 25 30 Ser Phe
Ser Leu Asp Phe Gly Tyr Leu Arg Asn Lys Asn Gly Cys His 35 40 45
Val Glu Leu Leu Phe Leu Arg Tyr Ile Ser Asp Trp Asp Leu Asp Pro 50
55 60 Gly Arg Cys Tyr Arg Val Thr Trp Phe Thr Ser Trp Ser Pro Cys
Tyr 65 70 75 80 Asp Cys Ala Arg His Val Ala Asp Phe Leu Arg Gly Asn
Pro Asn Leu 85 90 95 Ser Leu Arg Ile Phe Thr Ala Arg Leu Tyr Phe
Cys Glu Asp Arg Lys 100 105 110 Ala Glu Pro Glu Gly Leu Arg Arg Leu
His Arg Ala Gly Val Gln Ile 115 120 125 Ala Ile Met Thr Phe Lys Asp
Tyr Phe Tyr Cys Trp Asn Thr Phe Val 130 135 140 Glu Asn His Glu Arg
Thr Phe Lys Ala Trp Glu Gly Leu His Glu Asn 145 150 155 160 Ser Val
Arg Leu Ser Arg Gln Leu Arg Arg Ile Leu Leu Pro Leu 165 170 175
<210> SEQ ID NO 54 <211> LENGTH: 229 <212> TYPE:
PRT <213> ORGANISM: Rattus norvegicus <400> SEQUENCE:
54 Met Ser Ser Glu Thr Gly Pro Val Ala Val Asp Pro Thr Leu Arg Arg
1 5 10 15 Arg Ile Glu Pro His Glu Phe Glu Val Phe Phe Asp Pro Arg
Glu Leu 20 25 30 Arg Lys Glu Thr Cys Leu Leu Tyr Glu Ile Asn Trp
Gly Gly Arg His 35 40 45 Ser Ile Trp Arg His Thr Ser Gln Asn Thr
Asn Lys His Val Glu Val 50 55 60 Asn Phe Ile Glu Lys Phe Thr Thr
Glu Arg Tyr Phe Cys Pro Asn Thr 65 70 75 80 Arg Cys Ser Ile Thr Trp
Phe Leu Ser Trp Ser Pro Cys Gly Glu Cys 85 90 95 Ser Arg Ala Ile
Thr Glu Phe Leu Ser Arg Tyr Pro His Val Thr Leu 100 105 110 Phe Ile
Tyr Ile Ala Arg Leu Tyr His His Ala Asp Pro Arg Asn Arg 115 120 125
Gln Gly Leu Arg Asp Leu Ile Ser Ser Gly Val Thr Ile Gln Ile Met 130
135 140 Thr Glu Gln Glu Ser Gly Tyr Cys Trp Arg Asn Phe Val Asn Tyr
Ser 145 150 155 160 Pro Ser Asn Glu Ala His Trp Pro Arg Tyr Pro His
Leu Trp Val Arg 165 170 175 Leu Tyr Val Leu Glu Leu Tyr Cys Ile Ile
Leu Gly Leu Pro Pro Cys 180 185 190 Leu Asn Ile Leu Arg Arg Lys Gln
Pro Gln Leu Thr Phe Phe Thr Ile 195 200 205 Ala Leu Gln Ser Cys His
Tyr Gln Arg Leu Pro Pro His Ile Leu Trp 210 215 220 Ala Thr Gly Leu
Lys 225 <210> SEQ ID NO 55 <211> LENGTH: 397
<212> TYPE: PRT <213> ORGANISM: Mus musculus
<400> SEQUENCE: 55 Met Gly Pro Phe Cys Leu Gly Cys Ser His
Arg Lys Cys Tyr Ser Pro 1 5 10 15 Ile Arg Asn Leu Ile Ser Gln Glu
Thr Phe Lys Phe His Phe Lys Asn 20 25 30 Leu Gly Tyr Ala Lys Gly
Arg Lys Asp Thr Phe Leu Cys Tyr Glu Val 35 40 45 Thr Arg Lys Asp
Cys Asp Ser Pro Val Ser Leu His His Gly Val Phe 50 55 60 Lys Asn
Lys Asp Asn Ile His Ala Glu Ile Cys Phe Leu Tyr Trp Phe 65 70 75 80
His Asp Lys Val Leu Lys Val Leu Ser Pro Arg Glu Glu Phe Lys Ile 85
90 95 Thr Trp Tyr Met Ser Trp Ser Pro Cys Phe Glu Cys Ala Glu Gln
Ile 100 105 110 Val Arg Phe Leu Ala Thr His His Asn Leu Ser Leu Asp
Ile Phe Ser 115 120 125 Ser Arg Leu Tyr Asn Val Gln Asp Pro Glu Thr
Gln Gln Asn Leu Cys 130 135 140 Arg Leu Val Gln Glu Gly Ala Gln Val
Ala Ala Met Asp Leu Tyr Glu 145 150 155 160 Phe Lys Lys Cys Trp Lys
Lys Phe Val Asp Asn Gly Gly Arg Arg Phe 165 170 175 Arg Pro Trp Lys
Arg Leu Leu Thr Asn Phe Arg Tyr Gln Asp Ser Lys 180 185 190 Leu Gln
Glu Ile Leu Arg Arg Met Asp Pro Leu Ser Glu Glu Glu Phe 195 200 205
Tyr Ser Gln Phe Tyr Asn Gln Arg Val Lys His Leu Cys Tyr Tyr His 210
215 220 Arg Met Lys Pro Tyr Leu Cys Tyr Gln Leu Glu Gln Phe Asn Gly
Gln 225 230 235 240 Ala Pro Leu Lys Gly Cys Leu Leu Ser Glu Lys Gly
Lys Gln His Ala 245 250 255 Glu Ile Leu Phe Leu Asp Lys Ile Arg Ser
Met Glu Leu Ser Gln Val 260 265 270 Thr Ile Thr Cys Tyr Leu Thr Trp
Ser Pro Cys Pro Asn Cys Ala Trp 275 280 285 Gln Leu Ala Ala Phe Lys
Arg Asp Arg Pro Asp Leu Ile Leu His Ile 290 295 300 Tyr Thr Ser Arg
Leu Tyr Phe His Trp Lys Arg Pro Phe Gln Lys Gly 305 310 315 320 Leu
Cys Ser Leu Trp Gln Ser Gly Ile Leu Val Asp Val Met Asp Leu 325 330
335 Pro Gln Phe Thr Asp Cys Trp Thr Asn Phe Val Asn Pro Lys Arg Pro
340 345 350 Phe Arg Pro Trp Lys Gly Leu Glu Ile Ile Ser Arg Arg Thr
Gln Arg 355 360 365 Arg Leu Arg Arg Ile Lys Glu Ser Trp Gly Leu Gln
Asp Leu Val Asn 370 375 380 Asp Phe Gly Asn Leu Gln Leu Gly Pro Pro
Met Ser Asn 385 390 395 <210> SEQ ID NO 56 <211>
LENGTH: 199
<212> TYPE: PRT <213> ORGANISM: Artificial <220>
FEATURE: <223> OTHER INFORMATION: mAPOBEC3 catalytic domain
<400> SEQUENCE: 56 Met Gly Pro Phe Cys Leu Gly Cys Ser His
Arg Lys Cys Tyr Ser Pro 1 5 10 15 Ile Arg Asn Leu Ile Ser Gln Glu
Thr Phe Lys Phe His Phe Lys Asn 20 25 30 Leu Gly Tyr Ala Lys Gly
Arg Lys Asp Thr Phe Leu Cys Tyr Glu Val 35 40 45 Thr Arg Lys Asp
Cys Asp Ser Pro Val Ser Leu His His Gly Val Phe 50 55 60 Lys Asn
Lys Asp Asn Ile His Ala Glu Ile Cys Phe Leu Tyr Trp Phe 65 70 75 80
His Asp Lys Val Leu Lys Val Leu Ser Pro Arg Glu Glu Phe Lys Ile 85
90 95 Thr Trp Tyr Met Ser Trp Ser Pro Cys Phe Glu Cys Ala Glu Gln
Ile 100 105 110 Val Arg Phe Leu Ala Thr His His Asn Leu Ser Leu Asp
Ile Phe Ser 115 120 125 Ser Arg Leu Tyr Asn Val Gln Asp Pro Glu Thr
Gln Gln Asn Leu Cys 130 135 140 Arg Leu Val Gln Glu Gly Ala Gln Val
Ala Ala Met Asp Leu Tyr Glu 145 150 155 160 Phe Lys Lys Cys Trp Lys
Lys Phe Val Asp Asn Gly Gly Arg Arg Phe 165 170 175 Arg Pro Trp Lys
Arg Leu Leu Thr Asn Phe Arg Tyr Gln Asp Ser Lys 180 185 190 Leu Gln
Glu Ile Leu Arg Arg 195 <210> SEQ ID NO 57 <211>
LENGTH: 199 <212> TYPE: PRT <213> ORGANISM: Homo
sapiens <400> SEQUENCE: 57 Met Glu Ala Ser Pro Ala Ser Gly
Pro Arg His Leu Met Asp Pro His 1 5 10 15 Ile Phe Thr Ser Asn Phe
Asn Asn Gly Ile Gly Arg His Lys Thr Tyr 20 25 30 Leu Cys Tyr Glu
Val Glu Arg Leu Asp Asn Gly Thr Ser Val Lys Met 35 40 45 Asp Gln
His Arg Gly Phe Leu His Asn Gln Ala Lys Asn Leu Leu Cys 50 55 60
Gly Phe Tyr Gly Arg His Ala Glu Leu Arg Phe Leu Asp Leu Val Pro 65
70 75 80 Ser Leu Gln Leu Asp Pro Ala Gln Ile Tyr Arg Val Thr Trp
Phe Ile 85 90 95 Ser Trp Ser Pro Cys Phe Ser Trp Gly Cys Ala Gly
Glu Val Arg Ala 100 105 110 Phe Leu Gln Glu Asn Thr His Val Arg Leu
Arg Ile Phe Ala Ala Arg 115 120 125 Ile Tyr Asp Tyr Asp Pro Leu Tyr
Lys Glu Ala Leu Gln Met Leu Arg 130 135 140 Asp Ala Gly Ala Gln Val
Ser Ile Met Thr Tyr Asp Glu Phe Lys His 145 150 155 160 Cys Trp Asp
Thr Phe Val Asp His Gln Gly Cys Pro Phe Gln Pro Trp 165 170 175 Asp
Gly Leu Asp Glu His Ser Gln Ala Leu Ser Gly Arg Leu Arg Ala 180 185
190 Ile Leu Gln Asn Gln Gly Asn 195 <210> SEQ ID NO 58
<211> LENGTH: 384 <212> TYPE: PRT <213> ORGANISM:
Homo sapiens <400> SEQUENCE: 58 Met Lys Pro His Phe Arg Asn
Thr Val Glu Arg Met Tyr Arg Asp Thr 1 5 10 15 Phe Ser Tyr Asn Phe
Tyr Asn Arg Pro Ile Leu Ser Arg Arg Asn Thr 20 25 30 Val Trp Leu
Cys Tyr Glu Val Lys Thr Lys Gly Pro Ser Arg Pro Pro 35 40 45 Leu
Asp Ala Lys Ile Phe Arg Gly Gln Val Tyr Ser Glu Leu Lys Tyr 50 55
60 His Pro Glu Met Arg Phe Phe His Trp Phe Ser Lys Trp Arg Lys Leu
65 70 75 80 His Arg Asp Gln Glu Tyr Glu Val Thr Trp Tyr Ile Ser Trp
Ser Pro 85 90 95 Cys Thr Lys Cys Thr Arg Asp Met Ala Thr Phe Leu
Ala Glu Asp Pro 100 105 110 Lys Val Thr Leu Thr Ile Phe Val Ala Arg
Leu Tyr Tyr Phe Trp Asp 115 120 125 Pro Asp Tyr Gln Glu Ala Leu Arg
Ser Leu Cys Gln Lys Arg Asp Gly 130 135 140 Pro Arg Ala Thr Met Lys
Ile Met Asn Tyr Asp Glu Phe Gln His Cys 145 150 155 160 Trp Ser Lys
Phe Val Tyr Ser Gln Arg Glu Leu Phe Glu Pro Trp Asn 165 170 175 Asn
Leu Pro Lys Tyr Tyr Ile Leu Leu His Ile Met Leu Gly Glu Ile 180 185
190 Leu Arg His Ser Met Asp Pro Pro Thr Phe Thr Phe Asn Phe Asn Asn
195 200 205 Glu Pro Trp Val Arg Gly Arg His Glu Thr Tyr Leu Cys Tyr
Glu Val 210 215 220 Glu Arg Met His Asn Asp Thr Trp Val Leu Leu Asn
Gln Arg Arg Gly 225 230 235 240 Phe Leu Cys Asn Gln Ala Pro His Lys
His Gly Phe Leu Glu Gly Arg 245 250 255 His Ala Glu Leu Cys Phe Leu
Asp Val Ile Pro Phe Trp Lys Leu Asp 260 265 270 Leu Asp Gln Asp Tyr
Arg Val Thr Cys Phe Thr Ser Trp Ser Pro Cys 275 280 285 Phe Ser Cys
Ala Gln Glu Met Ala Lys Phe Ile Ser Lys Asn Lys His 290 295 300 Val
Ser Leu Cys Ile Phe Thr Ala Arg Ile Tyr Asp Asp Gln Gly Arg 305 310
315 320 Cys Gln Glu Gly Leu Arg Thr Leu Ala Glu Ala Gly Ala Lys Ile
Ser 325 330 335 Ile Met Thr Tyr Ser Glu Phe Lys His Cys Trp Asp Thr
Phe Val Asp 340 345 350 His Gln Gly Cys Pro Phe Gln Pro Trp Asp Gly
Leu Asp Glu His Ser 355 360 365 Gln Asp Leu Ser Gly Arg Leu Arg Ala
Ile Leu Gln Asn Gln Glu Asn 370 375 380 <210> SEQ ID NO 59
<211> LENGTH: 186 <212> TYPE: PRT <213> ORGANISM:
Artificial <220> FEATURE: <223> OTHER INFORMATION:
hAPOBEC3G catalytic domain <400> SEQUENCE: 59 Pro Pro Thr Phe
Thr Phe Asn Phe Asn Asn Glu Pro Trp Val Arg Gly 1 5 10 15 Arg His
Glu Thr Tyr Leu Cys Tyr Glu Val Glu Arg Met His Asn Asp 20 25 30
Thr Trp Val Leu Leu Asn Gln Arg Arg Gly Phe Leu Cys Asn Gln Ala 35
40 45 Pro His Lys His Gly Phe Leu Glu Gly Arg His Ala Glu Leu Cys
Phe 50 55 60 Leu Asp Val Ile Pro Phe Trp Lys Leu Asp Leu Asp Gln
Asp Tyr Arg 65 70 75 80 Val Thr Cys Phe Thr Ser Trp Ser Pro Cys Phe
Ser Cys Ala Gln Glu 85 90 95 Met Ala Lys Phe Ile Ser Lys Asn Lys
His Val Ser Leu Cys Ile Phe 100 105 110 Thr Ala Arg Ile Tyr Asp Asp
Gln Gly Arg Cys Gln Glu Gly Leu Arg 115 120 125 Thr Leu Ala Glu Ala
Gly Ala Lys Ile Ser Ile Met Thr Tyr Ser Glu 130 135 140 Phe Lys His
Cys Trp Asp Thr Phe Val Asp His Gln Gly Cys Pro Phe 145 150 155 160
Gln Pro Trp Asp Gly Leu Asp Glu His Ser Gln Asp Leu Ser Gly Arg 165
170 175 Leu Arg Ala Ile Leu Gln Asn Gln Glu Asn 180 185 <210>
SEQ ID NO 60 <211> LENGTH: 200 <212> TYPE: PRT
<213> ORGANISM: Homo sapiens <400> SEQUENCE: 60 Met Ala
Leu Leu Thr Ala Glu Thr Phe Arg Leu Gln Phe Asn Asn Lys 1 5 10 15
Arg Arg Leu Arg Arg Pro Tyr Tyr Pro Arg Lys Ala Leu Leu Cys Tyr 20
25 30 Gln Leu Thr Pro Gln Asn Gly Ser Thr Pro Thr Arg Gly Tyr Phe
Glu 35 40 45 Asn Lys Lys Lys Cys His Ala Glu Ile Cys Phe Ile Asn
Glu Ile Lys 50 55 60 Ser Met Gly Leu Asp Glu Thr Gln Cys Tyr Gln
Val Thr Cys Tyr Leu 65 70 75 80 Thr Trp Ser Pro Cys Ser Ser Cys Ala
Trp Glu Leu Val Asp Phe Ile 85 90 95 Lys Ala His Asp His Leu Asn
Leu Gly Ile Phe Ala Ser Arg Leu Tyr 100 105 110 Tyr His Trp Cys Lys
Pro Gln Gln Lys Gly Leu Arg Leu Leu Cys Gly 115 120 125
Ser Gln Val Pro Val Glu Val Met Gly Phe Pro Lys Phe Ala Asp Cys 130
135 140 Trp Glu Asn Phe Val Asp His Glu Lys Pro Leu Ser Phe Asn Pro
Tyr 145 150 155 160 Lys Met Leu Glu Glu Leu Asp Lys Asn Ser Arg Ala
Ile Lys Arg Arg 165 170 175 Leu Glu Arg Ile Lys Ile Pro Gly Val Arg
Ala Gln Gly Arg Tyr Met 180 185 190 Asp Ile Leu Cys Asp Ala Glu Val
195 200 <210> SEQ ID NO 61 <211> LENGTH: 373
<212> TYPE: PRT <213> ORGANISM: Homo sapiens
<400> SEQUENCE: 61 Met Lys Pro His Phe Arg Asn Thr Val Glu
Arg Met Tyr Arg Asp Thr 1 5 10 15 Phe Ser Tyr Asn Phe Tyr Asn Arg
Pro Ile Leu Ser Arg Arg Asn Thr 20 25 30 Val Trp Leu Cys Tyr Glu
Val Lys Thr Lys Gly Pro Ser Arg Pro Arg 35 40 45 Leu Asp Ala Lys
Ile Phe Arg Gly Gln Val Tyr Ser Gln Pro Glu His 50 55 60 His Ala
Glu Met Cys Phe Leu Ser Trp Phe Cys Gly Asn Gln Leu Pro 65 70 75 80
Ala Tyr Lys Cys Phe Gln Ile Thr Trp Phe Val Ser Trp Thr Pro Cys 85
90 95 Pro Asp Cys Val Ala Lys Leu Ala Glu Phe Leu Ala Glu His Pro
Asn 100 105 110 Val Thr Leu Thr Ile Ser Ala Ala Arg Leu Tyr Tyr Tyr
Trp Glu Arg 115 120 125 Asp Tyr Arg Arg Ala Leu Cys Arg Leu Ser Gln
Ala Gly Ala Arg Val 130 135 140 Lys Ile Met Asp Asp Glu Glu Phe Ala
Tyr Cys Trp Glu Asn Phe Val 145 150 155 160 Tyr Ser Glu Gly Gln Pro
Phe Met Pro Trp Tyr Lys Phe Asp Asp Asn 165 170 175 Tyr Ala Phe Leu
His Arg Thr Leu Lys Glu Ile Leu Arg Asn Pro Met 180 185 190 Glu Ala
Met Tyr Pro His Ile Phe Tyr Phe His Phe Lys Asn Leu Arg 195 200 205
Lys Ala Tyr Gly Arg Asn Glu Ser Trp Leu Cys Phe Thr Met Glu Val 210
215 220 Val Lys His His Ser Pro Val Ser Trp Lys Arg Gly Val Phe Arg
Asn 225 230 235 240 Gln Val Asp Pro Glu Thr His Cys His Ala Glu Arg
Cys Phe Leu Ser 245 250 255 Trp Phe Cys Asp Asp Ile Leu Ser Pro Asn
Thr Asn Tyr Glu Val Thr 260 265 270 Trp Tyr Thr Ser Trp Ser Pro Cys
Pro Glu Cys Ala Gly Glu Val Ala 275 280 285 Glu Phe Leu Ala Arg His
Ser Asn Val Asn Leu Thr Ile Phe Thr Ala 290 295 300 Arg Leu Tyr Tyr
Phe Trp Asp Thr Asp Tyr Gln Glu Gly Leu Arg Ser 305 310 315 320 Leu
Ser Gln Glu Gly Ala Ser Val Glu Ile Met Gly Tyr Lys Asp Phe 325 330
335 Lys Tyr Cys Trp Glu Asn Phe Val Tyr Asn Asp Asp Glu Pro Phe Lys
340 345 350 Pro Trp Lys Gly Leu Lys Tyr Asn Phe Leu Phe Leu Asp Ser
Lys Leu 355 360 365 Gln Glu Ile Leu Glu 370 <210> SEQ ID NO
62 <211> LENGTH: 189 <212> TYPE: PRT <213>
ORGANISM: Artificial <220> FEATURE: <223> OTHER
INFORMATION: hAPOBEC3F catalytic domain <400> SEQUENCE: 62
Lys Glu Ile Leu Arg Asn Pro Met Glu Ala Met Tyr Pro His Ile Phe 1 5
10 15 Tyr Phe His Phe Lys Asn Leu Arg Lys Ala Tyr Gly Arg Asn Glu
Ser 20 25 30 Trp Leu Cys Phe Thr Met Glu Val Val Lys His His Ser
Pro Val Ser 35 40 45 Trp Lys Arg Gly Val Phe Arg Asn Gln Val Asp
Pro Glu Thr His Cys 50 55 60 His Ala Glu Arg Cys Phe Leu Ser Trp
Phe Cys Asp Asp Ile Leu Ser 65 70 75 80 Pro Asn Thr Asn Tyr Glu Val
Thr Trp Tyr Thr Ser Trp Ser Pro Cys 85 90 95 Pro Glu Cys Ala Gly
Glu Val Ala Glu Phe Leu Ala Arg His Ser Asn 100 105 110 Val Asn Leu
Thr Ile Phe Thr Ala Arg Leu Tyr Tyr Phe Trp Asp Thr 115 120 125 Asp
Tyr Gln Glu Gly Leu Arg Ser Leu Ser Gln Glu Gly Ala Ser Val 130 135
140 Glu Ile Met Gly Tyr Lys Asp Phe Lys Tyr Cys Trp Glu Asn Phe Val
145 150 155 160 Tyr Asn Asp Asp Glu Pro Phe Lys Pro Trp Lys Gly Leu
Lys Tyr Asn 165 170 175 Phe Leu Phe Leu Asp Ser Lys Leu Gln Glu Ile
Leu Glu 180 185 <210> SEQ ID NO 63 <211> LENGTH: 1053
<212> TYPE: PRT <213> ORGANISM: Staphylococcus aureus
<400> SEQUENCE: 63 Met Lys Arg Asn Tyr Ile Leu Gly Leu Asp
Ile Gly Ile Thr Ser Val 1 5 10 15 Gly Tyr Gly Ile Ile Asp Tyr Glu
Thr Arg Asp Val Ile Asp Ala Gly 20 25 30 Val Arg Leu Phe Lys Glu
Ala Asn Val Glu Asn Asn Glu Gly Arg Arg 35 40 45 Ser Lys Arg Gly
Ala Arg Arg Leu Lys Arg Arg Arg Arg His Arg Ile 50 55 60 Gln Arg
Val Lys Lys Leu Leu Phe Asp Tyr Asn Leu Leu Thr Asp His 65 70 75 80
Ser Glu Leu Ser Gly Ile Asn Pro Tyr Glu Ala Arg Val Lys Gly Leu 85
90 95 Ser Gln Lys Leu Ser Glu Glu Glu Phe Ser Ala Ala Leu Leu His
Leu 100 105 110 Ala Lys Arg Arg Gly Val His Asn Val Asn Glu Val Glu
Glu Asp Thr 115 120 125 Gly Asn Glu Leu Ser Thr Lys Glu Gln Ile Ser
Arg Asn Ser Lys Ala 130 135 140 Leu Glu Glu Lys Tyr Val Ala Glu Leu
Gln Leu Glu Arg Leu Lys Lys 145 150 155 160 Asp Gly Glu Val Arg Gly
Ser Ile Asn Arg Phe Lys Thr Ser Asp Tyr 165 170 175 Val Lys Glu Ala
Lys Gln Leu Leu Lys Val Gln Lys Ala Tyr His Gln 180 185 190 Leu Asp
Gln Ser Phe Ile Asp Thr Tyr Ile Asp Leu Leu Glu Thr Arg 195 200 205
Arg Thr Tyr Tyr Glu Gly Pro Gly Glu Gly Ser Pro Phe Gly Trp Lys 210
215 220 Asp Ile Lys Glu Trp Tyr Glu Met Leu Met Gly His Cys Thr Tyr
Phe 225 230 235 240 Pro Glu Glu Leu Arg Ser Val Lys Tyr Ala Tyr Asn
Ala Asp Leu Tyr 245 250 255 Asn Ala Leu Asn Asp Leu Asn Asn Leu Val
Ile Thr Arg Asp Glu Asn 260 265 270 Glu Lys Leu Glu Tyr Tyr Glu Lys
Phe Gln Ile Ile Glu Asn Val Phe 275 280 285 Lys Gln Lys Lys Lys Pro
Thr Leu Lys Gln Ile Ala Lys Glu Ile Leu 290 295 300 Val Asn Glu Glu
Asp Ile Lys Gly Tyr Arg Val Thr Ser Thr Gly Lys 305 310 315 320 Pro
Glu Phe Thr Asn Leu Lys Val Tyr His Asp Ile Lys Asp Ile Thr 325 330
335 Ala Arg Lys Glu Ile Ile Glu Asn Ala Glu Leu Leu Asp Gln Ile Ala
340 345 350 Lys Ile Leu Thr Ile Tyr Gln Ser Ser Glu Asp Ile Gln Glu
Glu Leu 355 360 365 Thr Asn Leu Asn Ser Glu Leu Thr Gln Glu Glu Ile
Glu Gln Ile Ser 370 375 380 Asn Leu Lys Gly Tyr Thr Gly Thr His Asn
Leu Ser Leu Lys Ala Ile 385 390 395 400 Asn Leu Ile Leu Asp Glu Leu
Trp His Thr Asn Asp Asn Gln Ile Ala 405 410 415 Ile Phe Asn Arg Leu
Lys Leu Val Pro Lys Lys Val Asp Leu Ser Gln 420 425 430 Gln Lys Glu
Ile Pro Thr Thr Leu Val Asp Asp Phe Ile Leu Ser Pro 435 440 445 Val
Val Lys Arg Ser Phe Ile Gln Ser Ile Lys Val Ile Asn Ala Ile 450 455
460 Ile Lys Lys Tyr Gly Leu Pro Asn Asp Ile Ile Ile Glu Leu Ala Arg
465 470 475 480 Glu Lys Asn Ser Lys Asp Ala Gln Lys Met Ile Asn Glu
Met Gln Lys 485 490 495 Arg Asn Arg Gln Thr Asn Glu Arg Ile Glu Glu
Ile Ile Arg Thr Thr 500 505 510 Gly Lys Glu Asn Ala Lys Tyr Leu Ile
Glu Lys Ile Lys Leu His Asp 515 520 525 Met Gln Glu Gly Lys Cys Leu
Tyr Ser Leu Glu Ala Ile Pro Leu Glu 530 535 540
Asp Leu Leu Asn Asn Pro Phe Asn Tyr Glu Val Asp His Ile Ile Pro 545
550 555 560 Arg Ser Val Ser Phe Asp Asn Ser Phe Asn Asn Lys Val Leu
Val Lys 565 570 575 Gln Glu Glu Asn Ser Lys Lys Gly Asn Arg Thr Pro
Phe Gln Tyr Leu 580 585 590 Ser Ser Ser Asp Ser Lys Ile Ser Tyr Glu
Thr Phe Lys Lys His Ile 595 600 605 Leu Asn Leu Ala Lys Gly Lys Gly
Arg Ile Ser Lys Thr Lys Lys Glu 610 615 620 Tyr Leu Leu Glu Glu Arg
Asp Ile Asn Arg Phe Ser Val Gln Lys Asp 625 630 635 640 Phe Ile Asn
Arg Asn Leu Val Asp Thr Arg Tyr Ala Thr Arg Gly Leu 645 650 655 Met
Asn Leu Leu Arg Ser Tyr Phe Arg Val Asn Asn Leu Asp Val Lys 660 665
670 Val Lys Ser Ile Asn Gly Gly Phe Thr Ser Phe Leu Arg Arg Lys Trp
675 680 685 Lys Phe Lys Lys Glu Arg Asn Lys Gly Tyr Lys His His Ala
Glu Asp 690 695 700 Ala Leu Ile Ile Ala Asn Ala Asp Phe Ile Phe Lys
Glu Trp Lys Lys 705 710 715 720 Leu Asp Lys Ala Lys Lys Val Met Glu
Asn Gln Met Phe Glu Glu Lys 725 730 735 Gln Ala Glu Ser Met Pro Glu
Ile Glu Thr Glu Gln Glu Tyr Lys Glu 740 745 750 Ile Phe Ile Thr Pro
His Gln Ile Lys His Ile Lys Asp Phe Lys Asp 755 760 765 Tyr Lys Tyr
Ser His Arg Val Asp Lys Lys Pro Asn Arg Glu Leu Ile 770 775 780 Asn
Asp Thr Leu Tyr Ser Thr Arg Lys Asp Asp Lys Gly Asn Thr Leu 785 790
795 800 Ile Val Asn Asn Leu Asn Gly Leu Tyr Asp Lys Asp Asn Asp Lys
Leu 805 810 815 Lys Lys Leu Ile Asn Lys Ser Pro Glu Lys Leu Leu Met
Tyr His His 820 825 830 Asp Pro Gln Thr Tyr Gln Lys Leu Lys Leu Ile
Met Glu Gln Tyr Gly 835 840 845 Asp Glu Lys Asn Pro Leu Tyr Lys Tyr
Tyr Glu Glu Thr Gly Asn Tyr 850 855 860 Leu Thr Lys Tyr Ser Lys Lys
Asp Asn Gly Pro Val Ile Lys Lys Ile 865 870 875 880 Lys Tyr Tyr Gly
Asn Lys Leu Asn Ala His Leu Asp Ile Thr Asp Asp 885 890 895 Tyr Pro
Asn Ser Arg Asn Lys Val Val Lys Leu Ser Leu Lys Pro Tyr 900 905 910
Arg Phe Asp Val Tyr Leu Asp Asn Gly Val Tyr Lys Phe Val Thr Val 915
920 925 Lys Asn Leu Asp Val Ile Lys Lys Glu Asn Tyr Tyr Glu Val Asn
Ser 930 935 940 Lys Cys Tyr Glu Glu Ala Lys Lys Leu Lys Lys Ile Ser
Asn Gln Ala 945 950 955 960 Glu Phe Ile Ala Ser Phe Tyr Asn Asn Asp
Leu Ile Lys Ile Asn Gly 965 970 975 Glu Leu Tyr Arg Val Ile Gly Val
Asn Asn Asp Leu Leu Asn Arg Ile 980 985 990 Glu Val Asn Met Ile Asp
Ile Thr Tyr Arg Glu Tyr Leu Glu Asn Met 995 1000 1005 Asn Asp Lys
Arg Pro Pro Arg Ile Ile Lys Thr Ile Ala Ser Lys 1010 1015 1020 Thr
Gln Ser Ile Lys Lys Tyr Ser Thr Asp Ile Leu Gly Asn Leu 1025 1030
1035 Tyr Glu Val Lys Ser Lys Lys His Pro Gln Ile Ile Lys Lys Gly
1040 1045 1050 <210> SEQ ID NO 64 <211> LENGTH: 984
<212> TYPE: PRT <213> ORGANISM: Campylobacter jejuni
<400> SEQUENCE: 64 Met Ala Arg Ile Leu Ala Phe Asp Ile Gly
Ile Ser Ser Ile Gly Trp 1 5 10 15 Ala Phe Ser Glu Asn Asp Glu Leu
Lys Asp Cys Gly Val Arg Ile Phe 20 25 30 Thr Lys Val Glu Asn Pro
Lys Thr Gly Glu Ser Leu Ala Leu Pro Arg 35 40 45 Arg Leu Ala Arg
Ser Ala Arg Lys Arg Leu Ala Arg Arg Lys Ala Arg 50 55 60 Leu Asn
His Leu Lys His Leu Ile Ala Asn Glu Phe Lys Leu Asn Tyr 65 70 75 80
Glu Asp Tyr Gln Ser Phe Asp Glu Ser Leu Ala Lys Ala Tyr Lys Gly 85
90 95 Ser Leu Ile Ser Pro Tyr Glu Leu Arg Phe Arg Ala Leu Asn Glu
Leu 100 105 110 Leu Ser Lys Gln Asp Phe Ala Arg Val Ile Leu His Ile
Ala Lys Arg 115 120 125 Arg Gly Tyr Asp Asp Ile Lys Asn Ser Asp Asp
Lys Glu Lys Gly Ala 130 135 140 Ile Leu Lys Ala Ile Lys Gln Asn Glu
Glu Lys Leu Ala Asn Tyr Gln 145 150 155 160 Ser Val Gly Glu Tyr Leu
Tyr Lys Glu Tyr Phe Gln Lys Phe Lys Glu 165 170 175 Asn Ser Lys Glu
Phe Thr Asn Val Arg Asn Lys Lys Glu Ser Tyr Glu 180 185 190 Arg Cys
Ile Ala Gln Ser Phe Leu Lys Asp Glu Leu Lys Leu Ile Phe 195 200 205
Lys Lys Gln Arg Glu Phe Gly Phe Ser Phe Ser Lys Lys Phe Glu Glu 210
215 220 Glu Val Leu Ser Val Ala Phe Tyr Lys Arg Ala Leu Lys Asp Phe
Ser 225 230 235 240 His Leu Val Gly Asn Cys Ser Phe Phe Thr Asp Glu
Lys Arg Ala Pro 245 250 255 Lys Asn Ser Pro Leu Ala Phe Met Phe Val
Ala Leu Thr Arg Ile Ile 260 265 270 Asn Leu Leu Asn Asn Leu Lys Asn
Thr Glu Gly Ile Leu Tyr Thr Lys 275 280 285 Asp Asp Leu Asn Ala Leu
Leu Asn Glu Val Leu Lys Asn Gly Thr Leu 290 295 300 Thr Tyr Lys Gln
Thr Lys Lys Leu Leu Gly Leu Ser Asp Asp Tyr Glu 305 310 315 320 Phe
Lys Gly Glu Lys Gly Thr Tyr Phe Ile Glu Phe Lys Lys Tyr Lys 325 330
335 Glu Phe Ile Lys Ala Leu Gly Glu His Asn Leu Ser Gln Asp Asp Leu
340 345 350 Asn Glu Ile Ala Lys Asp Ile Thr Leu Ile Lys Asp Glu Ile
Lys Leu 355 360 365 Lys Lys Ala Leu Ala Lys Tyr Asp Leu Asn Gln Asn
Gln Ile Asp Ser 370 375 380 Leu Ser Lys Leu Glu Phe Lys Asp His Leu
Asn Ile Ser Phe Lys Ala 385 390 395 400 Leu Lys Leu Val Thr Pro Leu
Met Leu Glu Gly Lys Lys Tyr Asp Glu 405 410 415 Ala Cys Asn Glu Leu
Asn Leu Lys Val Ala Ile Asn Glu Asp Lys Lys 420 425 430 Asp Phe Leu
Pro Ala Phe Asn Glu Thr Tyr Tyr Lys Asp Glu Val Thr 435 440 445 Asn
Pro Val Val Leu Arg Ala Ile Lys Glu Tyr Arg Lys Val Leu Asn 450 455
460 Ala Leu Leu Lys Lys Tyr Gly Lys Val His Lys Ile Asn Ile Glu Leu
465 470 475 480 Ala Arg Glu Val Gly Lys Asn His Ser Gln Arg Ala Lys
Ile Glu Lys 485 490 495 Glu Gln Asn Glu Asn Tyr Lys Ala Lys Lys Asp
Ala Glu Leu Glu Cys 500 505 510 Glu Lys Leu Gly Leu Lys Ile Asn Ser
Lys Asn Ile Leu Lys Leu Arg 515 520 525 Leu Phe Lys Glu Gln Lys Glu
Phe Cys Ala Tyr Ser Gly Glu Lys Ile 530 535 540 Lys Ile Ser Asp Leu
Gln Asp Glu Lys Met Leu Glu Ile Asp His Ile 545 550 555 560 Tyr Pro
Tyr Ser Arg Ser Phe Asp Asp Ser Tyr Met Asn Lys Val Leu 565 570 575
Val Phe Thr Lys Gln Asn Gln Glu Lys Leu Asn Gln Thr Pro Phe Glu 580
585 590 Ala Phe Gly Asn Asp Ser Ala Lys Trp Gln Lys Ile Glu Val Leu
Ala 595 600 605 Lys Asn Leu Pro Thr Lys Lys Gln Lys Arg Ile Leu Asp
Lys Asn Tyr 610 615 620 Lys Asp Lys Glu Gln Lys Asn Phe Lys Asp Arg
Asn Leu Asn Asp Thr 625 630 635 640 Arg Tyr Ile Ala Arg Leu Val Leu
Asn Tyr Thr Lys Asp Tyr Leu Asp 645 650 655 Phe Leu Pro Leu Ser Asp
Asp Glu Asn Thr Lys Leu Asn Asp Thr Gln 660 665 670 Lys Gly Ser Lys
Val His Val Glu Ala Lys Ser Gly Met Leu Thr Ser 675 680 685 Ala Leu
Arg His Thr Trp Gly Phe Ser Ala Lys Asp Arg Asn Asn His 690 695 700
Leu His His Ala Ile Asp Ala Val Ile Ile Ala Tyr Ala Asn Asn Ser 705
710 715 720 Ile Val Lys Ala Phe Ser Asp Phe Lys Lys Glu Gln Glu Ser
Asn Ser 725 730 735 Ala Glu Leu Tyr Ala Lys Lys Ile Ser Glu Leu Asp
Tyr Lys Asn Lys 740 745 750 Arg Lys Phe Phe Glu Pro Phe Ser Gly Phe
Arg Gln Lys Val Leu Asp 755 760 765 Lys Ile Asp Glu Ile Phe Val Ser
Lys Pro Glu Arg Lys Lys Pro Ser 770 775 780
Gly Ala Leu His Glu Glu Thr Phe Arg Lys Glu Glu Glu Phe Tyr Gln 785
790 795 800 Ser Tyr Gly Gly Lys Glu Gly Val Leu Lys Ala Leu Glu Leu
Gly Lys 805 810 815 Ile Arg Lys Val Asn Gly Lys Ile Val Lys Asn Gly
Asp Met Phe Arg 820 825 830 Val Asp Ile Phe Lys His Lys Lys Thr Asn
Lys Phe Tyr Ala Val Pro 835 840 845 Ile Tyr Thr Met Asp Phe Ala Leu
Lys Val Leu Pro Asn Lys Ala Val 850 855 860 Ala Arg Ser Lys Lys Gly
Glu Ile Lys Asp Trp Ile Leu Met Asp Glu 865 870 875 880 Asn Tyr Glu
Phe Cys Phe Ser Leu Tyr Lys Asp Ser Leu Ile Leu Ile 885 890 895 Gln
Thr Lys Asp Met Gln Glu Pro Glu Phe Val Tyr Tyr Asn Ala Phe 900 905
910 Thr Ser Ser Thr Val Ser Leu Ile Val Ser Lys His Asp Asn Lys Phe
915 920 925 Glu Thr Leu Ser Lys Asn Gln Lys Ile Leu Phe Lys Asn Ala
Asn Glu 930 935 940 Lys Glu Val Ile Ala Lys Ser Ile Gly Ile Gln Asn
Leu Lys Val Phe 945 950 955 960 Glu Lys Tyr Ile Val Ser Ala Leu Gly
Glu Val Thr Lys Ala Glu Phe 965 970 975 Arg Gln Arg Glu Asp Phe Lys
Lys 980 <210> SEQ ID NO 65 <211> LENGTH: 1037
<212> TYPE: PRT <213> ORGANISM: Parvibaculum
lavamentivorans <400> SEQUENCE: 65 Met Glu Arg Ile Phe Gly
Phe Asp Ile Gly Thr Thr Ser Ile Gly Phe 1 5 10 15 Ser Val Ile Asp
Tyr Ser Ser Thr Gln Ser Ala Gly Asn Ile Gln Arg 20 25 30 Leu Gly
Val Arg Ile Phe Pro Glu Ala Arg Asp Pro Asp Gly Thr Pro 35 40 45
Leu Asn Gln Gln Arg Arg Gln Lys Arg Met Met Arg Arg Gln Leu Arg 50
55 60 Arg Arg Arg Ile Arg Arg Lys Ala Leu Asn Glu Thr Leu His Glu
Ala 65 70 75 80 Gly Phe Leu Pro Ala Tyr Gly Ser Ala Asp Trp Pro Val
Val Met Ala 85 90 95 Asp Glu Pro Tyr Glu Leu Arg Arg Arg Gly Leu
Glu Glu Gly Leu Ser 100 105 110 Ala Tyr Glu Phe Gly Arg Ala Ile Tyr
His Leu Ala Gln His Arg His 115 120 125 Phe Lys Gly Arg Glu Leu Glu
Glu Ser Asp Thr Pro Asp Pro Asp Val 130 135 140 Asp Asp Glu Lys Glu
Ala Ala Asn Glu Arg Ala Ala Thr Leu Lys Ala 145 150 155 160 Leu Lys
Asn Glu Gln Thr Thr Leu Gly Ala Trp Leu Ala Arg Arg Pro 165 170 175
Pro Ser Asp Arg Lys Arg Gly Ile His Ala His Arg Asn Val Val Ala 180
185 190 Glu Glu Phe Glu Arg Leu Trp Glu Val Gln Ser Lys Phe His Pro
Ala 195 200 205 Leu Lys Ser Glu Glu Met Arg Ala Arg Ile Ser Asp Thr
Ile Phe Ala 210 215 220 Gln Arg Pro Val Phe Trp Arg Lys Asn Thr Leu
Gly Glu Cys Arg Phe 225 230 235 240 Met Pro Gly Glu Pro Leu Cys Pro
Lys Gly Ser Trp Leu Ser Gln Gln 245 250 255 Arg Arg Met Leu Glu Lys
Leu Asn Asn Leu Ala Ile Ala Gly Gly Asn 260 265 270 Ala Arg Pro Leu
Asp Ala Glu Glu Arg Asp Ala Ile Leu Ser Lys Leu 275 280 285 Gln Gln
Gln Ala Ser Met Ser Trp Pro Gly Val Arg Ser Ala Leu Lys 290 295 300
Ala Leu Tyr Lys Gln Arg Gly Glu Pro Gly Ala Glu Lys Ser Leu Lys 305
310 315 320 Phe Asn Leu Glu Leu Gly Gly Glu Ser Lys Leu Leu Gly Asn
Ala Leu 325 330 335 Glu Ala Lys Leu Ala Asp Met Phe Gly Pro Asp Trp
Pro Ala His Pro 340 345 350 Arg Lys Gln Glu Ile Arg His Ala Val His
Glu Arg Leu Trp Ala Ala 355 360 365 Asp Tyr Gly Glu Thr Pro Asp Lys
Lys Arg Val Ile Ile Leu Ser Glu 370 375 380 Lys Asp Arg Lys Ala His
Arg Glu Ala Ala Ala Asn Ser Phe Val Ala 385 390 395 400 Asp Phe Gly
Ile Thr Gly Glu Gln Ala Ala Gln Leu Gln Ala Leu Lys 405 410 415 Leu
Pro Thr Gly Trp Glu Pro Tyr Ser Ile Pro Ala Leu Asn Leu Phe 420 425
430 Leu Ala Glu Leu Glu Lys Gly Glu Arg Phe Gly Ala Leu Val Asn Gly
435 440 445 Pro Asp Trp Glu Gly Trp Arg Arg Thr Asn Phe Pro His Arg
Asn Gln 450 455 460 Pro Thr Gly Glu Ile Leu Asp Lys Leu Pro Ser Pro
Ala Ser Lys Glu 465 470 475 480 Glu Arg Glu Arg Ile Ser Gln Leu Arg
Asn Pro Thr Val Val Arg Thr 485 490 495 Gln Asn Glu Leu Arg Lys Val
Val Asn Asn Leu Ile Gly Leu Tyr Gly 500 505 510 Lys Pro Asp Arg Ile
Arg Ile Glu Val Gly Arg Asp Val Gly Lys Ser 515 520 525 Lys Arg Glu
Arg Glu Glu Ile Gln Ser Gly Ile Arg Arg Asn Glu Lys 530 535 540 Gln
Arg Lys Lys Ala Thr Glu Asp Leu Ile Lys Asn Gly Ile Ala Asn 545 550
555 560 Pro Ser Arg Asp Asp Val Glu Lys Trp Ile Leu Trp Lys Glu Gly
Gln 565 570 575 Glu Arg Cys Pro Tyr Thr Gly Asp Gln Ile Gly Phe Asn
Ala Leu Phe 580 585 590 Arg Glu Gly Arg Tyr Glu Val Glu His Ile Trp
Pro Arg Ser Arg Ser 595 600 605 Phe Asp Asn Ser Pro Arg Asn Lys Thr
Leu Cys Arg Lys Asp Val Asn 610 615 620 Ile Glu Lys Gly Asn Arg Met
Pro Phe Glu Ala Phe Gly His Asp Glu 625 630 635 640 Asp Arg Trp Ser
Ala Ile Gln Ile Arg Leu Gln Gly Met Val Ser Ala 645 650 655 Lys Gly
Gly Thr Gly Met Ser Pro Gly Lys Val Lys Arg Phe Leu Ala 660 665 670
Lys Thr Met Pro Glu Asp Phe Ala Ala Arg Gln Leu Asn Asp Thr Arg 675
680 685 Tyr Ala Ala Lys Gln Ile Leu Ala Gln Leu Lys Arg Leu Trp Pro
Asp 690 695 700 Met Gly Pro Glu Ala Pro Val Lys Val Glu Ala Val Thr
Gly Gln Val 705 710 715 720 Thr Ala Gln Leu Arg Lys Leu Trp Thr Leu
Asn Asn Ile Leu Ala Asp 725 730 735 Asp Gly Glu Lys Thr Arg Ala Asp
His Arg His His Ala Ile Asp Ala 740 745 750 Leu Thr Val Ala Cys Thr
His Pro Gly Met Thr Asn Lys Leu Ser Arg 755 760 765 Tyr Trp Gln Leu
Arg Asp Asp Pro Arg Ala Glu Lys Pro Ala Leu Thr 770 775 780 Pro Pro
Trp Asp Thr Ile Arg Ala Asp Ala Glu Lys Ala Val Ser Glu 785 790 795
800 Ile Val Val Ser His Arg Val Arg Lys Lys Val Ser Gly Pro Leu His
805 810 815 Lys Glu Thr Thr Tyr Gly Asp Thr Gly Thr Asp Ile Lys Thr
Lys Ser 820 825 830 Gly Thr Tyr Arg Gln Phe Val Thr Arg Lys Lys Ile
Glu Ser Leu Ser 835 840 845 Lys Gly Glu Leu Asp Glu Ile Arg Asp Pro
Arg Ile Lys Glu Ile Val 850 855 860 Ala Ala His Val Ala Gly Arg Gly
Gly Asp Pro Lys Lys Ala Phe Pro 865 870 875 880 Pro Tyr Pro Cys Val
Ser Pro Gly Gly Pro Glu Ile Arg Lys Val Arg 885 890 895 Leu Thr Ser
Lys Gln Gln Leu Asn Leu Met Ala Gln Thr Gly Asn Gly 900 905 910 Tyr
Ala Asp Leu Gly Ser Asn His His Ile Ala Ile Tyr Arg Leu Pro 915 920
925 Asp Gly Lys Ala Asp Phe Glu Ile Val Ser Leu Phe Asp Ala Ser Arg
930 935 940 Arg Leu Ala Gln Arg Asn Pro Ile Val Gln Arg Thr Arg Ala
Asp Gly 945 950 955 960 Ala Ser Phe Val Met Ser Leu Ala Ala Gly Glu
Ala Ile Met Ile Pro 965 970 975 Glu Gly Ser Lys Lys Gly Ile Trp Ile
Val Gln Gly Val Trp Ala Ser 980 985 990 Gly Gln Val Val Leu Glu Arg
Asp Thr Asp Ala Asp His Ser Thr Thr 995 1000 1005 Thr Arg Pro Met
Pro Asn Pro Ile Leu Lys Asp Asp Ala Lys Lys 1010 1015 1020 Val Ser
Ile Asp Pro Ile Gly Arg Val Arg Pro Ser Asn Asp 1025 1030 1035
<210> SEQ ID NO 66 <211> LENGTH: 1082 <212> TYPE:
PRT <213> ORGANISM: Neisseria cinerea <400> SEQUENCE:
66 Met Ala Ala Phe Lys Pro Asn Pro Met Asn Tyr Ile Leu Gly Leu
Asp
1 5 10 15 Ile Gly Ile Ala Ser Val Gly Trp Ala Ile Val Glu Ile Asp
Glu Glu 20 25 30 Glu Asn Pro Ile Arg Leu Ile Asp Leu Gly Val Arg
Val Phe Glu Arg 35 40 45 Ala Glu Val Pro Lys Thr Gly Asp Ser Leu
Ala Ala Ala Arg Arg Leu 50 55 60 Ala Arg Ser Val Arg Arg Leu Thr
Arg Arg Arg Ala His Arg Leu Leu 65 70 75 80 Arg Ala Arg Arg Leu Leu
Lys Arg Glu Gly Val Leu Gln Ala Ala Asp 85 90 95 Phe Asp Glu Asn
Gly Leu Ile Lys Ser Leu Pro Asn Thr Pro Trp Gln 100 105 110 Leu Arg
Ala Ala Ala Leu Asp Arg Lys Leu Thr Pro Leu Glu Trp Ser 115 120 125
Ala Val Leu Leu His Leu Ile Lys His Arg Gly Tyr Leu Ser Gln Arg 130
135 140 Lys Asn Glu Gly Glu Thr Ala Asp Lys Glu Leu Gly Ala Leu Leu
Lys 145 150 155 160 Gly Val Ala Asp Asn Thr His Ala Leu Gln Thr Gly
Asp Phe Arg Thr 165 170 175 Pro Ala Glu Leu Ala Leu Asn Lys Phe Glu
Lys Glu Ser Gly His Ile 180 185 190 Arg Asn Gln Arg Gly Asp Tyr Ser
His Thr Phe Asn Arg Lys Asp Leu 195 200 205 Gln Ala Glu Leu Asn Leu
Leu Phe Glu Lys Gln Lys Glu Phe Gly Asn 210 215 220 Pro His Val Ser
Asp Gly Leu Lys Glu Gly Ile Glu Thr Leu Leu Met 225 230 235 240 Thr
Gln Arg Pro Ala Leu Ser Gly Asp Ala Val Gln Lys Met Leu Gly 245 250
255 His Cys Thr Phe Glu Pro Thr Glu Pro Lys Ala Ala Lys Asn Thr Tyr
260 265 270 Thr Ala Glu Arg Phe Val Trp Leu Thr Lys Leu Asn Asn Leu
Arg Ile 275 280 285 Leu Glu Gln Gly Ser Glu Arg Pro Leu Thr Asp Thr
Glu Arg Ala Thr 290 295 300 Leu Met Asp Glu Pro Tyr Arg Lys Ser Lys
Leu Thr Tyr Ala Gln Ala 305 310 315 320 Arg Lys Leu Leu Asp Leu Asp
Asp Thr Ala Phe Phe Lys Gly Leu Arg 325 330 335 Tyr Gly Lys Asp Asn
Ala Glu Ala Ser Thr Leu Met Glu Met Lys Ala 340 345 350 Tyr His Ala
Ile Ser Arg Ala Leu Glu Lys Glu Gly Leu Lys Asp Lys 355 360 365 Lys
Ser Pro Leu Asn Leu Ser Pro Glu Leu Gln Asp Glu Ile Gly Thr 370 375
380 Ala Phe Ser Leu Phe Lys Thr Asp Glu Asp Ile Thr Gly Arg Leu Lys
385 390 395 400 Asp Arg Val Gln Pro Glu Ile Leu Glu Ala Leu Leu Lys
His Ile Ser 405 410 415 Phe Asp Lys Phe Val Gln Ile Ser Leu Lys Ala
Leu Arg Arg Ile Val 420 425 430 Pro Leu Met Glu Gln Gly Asn Arg Tyr
Asp Glu Ala Cys Thr Glu Ile 435 440 445 Tyr Gly Asp His Tyr Gly Lys
Lys Asn Thr Glu Glu Lys Ile Tyr Leu 450 455 460 Pro Pro Ile Pro Ala
Asp Glu Ile Arg Asn Pro Val Val Leu Arg Ala 465 470 475 480 Leu Ser
Gln Ala Arg Lys Val Ile Asn Gly Val Val Arg Arg Tyr Gly 485 490 495
Ser Pro Ala Arg Ile His Ile Glu Thr Ala Arg Glu Val Gly Lys Ser 500
505 510 Phe Lys Asp Arg Lys Glu Ile Glu Lys Arg Gln Glu Glu Asn Arg
Lys 515 520 525 Asp Arg Glu Lys Ser Ala Ala Lys Phe Arg Glu Tyr Phe
Pro Asn Phe 530 535 540 Val Gly Glu Pro Lys Ser Lys Asp Ile Leu Lys
Leu Arg Leu Tyr Glu 545 550 555 560 Gln Gln His Gly Lys Cys Leu Tyr
Ser Gly Lys Glu Ile Asn Leu Gly 565 570 575 Arg Leu Asn Glu Lys Gly
Tyr Val Glu Ile Asp His Ala Leu Pro Phe 580 585 590 Ser Arg Thr Trp
Asp Asp Ser Phe Asn Asn Lys Val Leu Ala Leu Gly 595 600 605 Ser Glu
Asn Gln Asn Lys Gly Asn Gln Thr Pro Tyr Glu Tyr Phe Asn 610 615 620
Gly Lys Asp Asn Ser Arg Glu Trp Gln Glu Phe Lys Ala Arg Val Glu 625
630 635 640 Thr Ser Arg Phe Pro Arg Ser Lys Lys Gln Arg Ile Leu Leu
Gln Lys 645 650 655 Phe Asp Glu Asp Gly Phe Lys Glu Arg Asn Leu Asn
Asp Thr Arg Tyr 660 665 670 Ile Asn Arg Phe Leu Cys Gln Phe Val Ala
Asp His Met Leu Leu Thr 675 680 685 Gly Lys Gly Lys Arg Arg Val Phe
Ala Ser Asn Gly Gln Ile Thr Asn 690 695 700 Leu Leu Arg Gly Phe Trp
Gly Leu Arg Lys Val Arg Ala Glu Asn Asp 705 710 715 720 Arg His His
Ala Leu Asp Ala Val Val Val Ala Cys Ser Thr Ile Ala 725 730 735 Met
Gln Gln Lys Ile Thr Arg Phe Val Arg Tyr Lys Glu Met Asn Ala 740 745
750 Phe Asp Gly Lys Thr Ile Asp Lys Glu Thr Gly Glu Val Leu His Gln
755 760 765 Lys Ala His Phe Pro Gln Pro Trp Glu Phe Phe Ala Gln Glu
Val Met 770 775 780 Ile Arg Val Phe Gly Lys Pro Asp Gly Lys Pro Glu
Phe Glu Glu Ala 785 790 795 800 Asp Thr Pro Glu Lys Leu Arg Thr Leu
Leu Ala Glu Lys Leu Ser Ser 805 810 815 Arg Pro Glu Ala Val His Lys
Tyr Val Thr Pro Leu Phe Ile Ser Arg 820 825 830 Ala Pro Asn Arg Lys
Met Ser Gly Gln Gly His Met Glu Thr Val Lys 835 840 845 Ser Ala Lys
Arg Leu Asp Glu Gly Ile Ser Val Leu Arg Val Pro Leu 850 855 860 Thr
Gln Leu Lys Leu Lys Asp Leu Glu Lys Met Val Asn Arg Glu Arg 865 870
875 880 Glu Pro Lys Leu Tyr Glu Ala Leu Lys Ala Arg Leu Glu Ala His
Lys 885 890 895 Asp Asp Pro Ala Lys Ala Phe Ala Glu Pro Phe Tyr Lys
Tyr Asp Lys 900 905 910 Ala Gly Asn Arg Thr Gln Gln Val Lys Ala Val
Arg Val Glu Gln Val 915 920 925 Gln Lys Thr Gly Val Trp Val His Asn
His Asn Gly Ile Ala Asp Asn 930 935 940 Ala Thr Ile Val Arg Val Asp
Val Phe Glu Lys Gly Gly Lys Tyr Tyr 945 950 955 960 Leu Val Pro Ile
Tyr Ser Trp Gln Val Ala Lys Gly Ile Leu Pro Asp 965 970 975 Arg Ala
Val Val Gln Gly Lys Asp Glu Glu Asp Trp Thr Val Met Asp 980 985 990
Asp Ser Phe Glu Phe Lys Phe Val Leu Tyr Ala Asn Asp Leu Ile Lys 995
1000 1005 Leu Thr Ala Lys Lys Asn Glu Phe Leu Gly Tyr Phe Val Ser
Leu 1010 1015 1020 Asn Arg Ala Thr Gly Ala Ile Asp Ile Arg Thr His
Asp Thr Asp 1025 1030 1035 Ser Thr Lys Gly Lys Asn Gly Ile Phe Gln
Ser Val Gly Val Lys 1040 1045 1050 Thr Ala Leu Ser Phe Gln Lys Tyr
Gln Ile Asp Glu Leu Gly Lys 1055 1060 1065 Glu Ile Arg Pro Cys Arg
Leu Lys Lys Arg Pro Pro Val Arg 1070 1075 1080 <210> SEQ ID
NO 67 <211> LENGTH: 1003 <212> TYPE: PRT <213>
ORGANISM: Campylobacter lari <400> SEQUENCE: 67 Met Arg Ile
Leu Gly Phe Asp Ile Gly Ile Asn Ser Ile Gly Trp Ala 1 5 10 15 Phe
Val Glu Asn Asp Glu Leu Lys Asp Cys Gly Val Arg Ile Phe Thr 20 25
30 Lys Ala Glu Asn Pro Lys Asn Lys Glu Ser Leu Ala Leu Pro Arg Arg
35 40 45 Asn Ala Arg Ser Ser Arg Arg Arg Leu Lys Arg Arg Lys Ala
Arg Leu 50 55 60 Ile Ala Ile Lys Arg Ile Leu Ala Lys Glu Leu Lys
Leu Asn Tyr Lys 65 70 75 80 Asp Tyr Val Ala Ala Asp Gly Glu Leu Pro
Lys Ala Tyr Glu Gly Ser 85 90 95 Leu Ala Ser Val Tyr Glu Leu Arg
Tyr Lys Ala Leu Thr Gln Asn Leu 100 105 110 Glu Thr Lys Asp Leu Ala
Arg Val Ile Leu His Ile Ala Lys His Arg 115 120 125 Gly Tyr Met Asn
Lys Asn Glu Lys Lys Ser Asn Asp Ala Lys Lys Gly 130 135 140 Lys Ile
Leu Ser Ala Leu Lys Asn Asn Ala Leu Lys Leu Glu Asn Tyr 145 150 155
160 Gln Ser Val Gly Glu Tyr Phe Tyr Lys Glu Phe Phe Gln Lys Tyr Lys
165 170 175 Lys Asn Thr Lys Asn Phe Ile Lys Ile Arg Asn Thr Lys Asp
Asn Tyr 180 185 190 Asn Asn Cys Val Leu Ser Ser Asp Leu Glu Lys Glu
Leu Lys Leu Ile 195 200 205 Leu Glu Lys Gln Lys Glu Phe Gly Tyr Asn
Tyr Ser Glu Asp Phe Ile
210 215 220 Asn Glu Ile Leu Lys Val Ala Phe Phe Gln Arg Pro Leu Lys
Asp Phe 225 230 235 240 Ser His Leu Val Gly Ala Cys Thr Phe Phe Glu
Glu Glu Lys Arg Ala 245 250 255 Cys Lys Asn Ser Tyr Ser Ala Trp Glu
Phe Val Ala Leu Thr Lys Ile 260 265 270 Ile Asn Glu Ile Lys Ser Leu
Glu Lys Ile Ser Gly Glu Ile Val Pro 275 280 285 Thr Gln Thr Ile Asn
Glu Val Leu Asn Leu Ile Leu Asp Lys Gly Ser 290 295 300 Ile Thr Tyr
Lys Lys Phe Arg Ser Cys Ile Asn Leu His Glu Ser Ile 305 310 315 320
Ser Phe Lys Ser Leu Lys Tyr Asp Lys Glu Asn Ala Glu Asn Ala Lys 325
330 335 Leu Ile Asp Phe Arg Lys Leu Val Glu Phe Lys Lys Ala Leu Gly
Val 340 345 350 His Ser Leu Ser Arg Gln Glu Leu Asp Gln Ile Ser Thr
His Ile Thr 355 360 365 Leu Ile Lys Asp Asn Val Lys Leu Lys Thr Val
Leu Glu Lys Tyr Asn 370 375 380 Leu Ser Asn Glu Gln Ile Asn Asn Leu
Leu Glu Ile Glu Phe Asn Asp 385 390 395 400 Tyr Ile Asn Leu Ser Phe
Lys Ala Leu Gly Met Ile Leu Pro Leu Met 405 410 415 Arg Glu Gly Lys
Arg Tyr Asp Glu Ala Cys Glu Ile Ala Asn Leu Lys 420 425 430 Pro Lys
Thr Val Asp Glu Lys Lys Asp Phe Leu Pro Ala Phe Cys Asp 435 440 445
Ser Ile Phe Ala His Glu Leu Ser Asn Pro Val Val Asn Arg Ala Ile 450
455 460 Ser Glu Tyr Arg Lys Val Leu Asn Ala Leu Leu Lys Lys Tyr Gly
Lys 465 470 475 480 Val His Lys Ile His Leu Glu Leu Ala Arg Asp Val
Gly Leu Ser Lys 485 490 495 Lys Ala Arg Glu Lys Ile Glu Lys Glu Gln
Lys Glu Asn Gln Ala Val 500 505 510 Asn Ala Trp Ala Leu Lys Glu Cys
Glu Asn Ile Gly Leu Lys Ala Ser 515 520 525 Ala Lys Asn Ile Leu Lys
Leu Lys Leu Trp Lys Glu Gln Lys Glu Ile 530 535 540 Cys Ile Tyr Ser
Gly Asn Lys Ile Ser Ile Glu His Leu Lys Asp Glu 545 550 555 560 Lys
Ala Leu Glu Val Asp His Ile Tyr Pro Tyr Ser Arg Ser Phe Asp 565 570
575 Asp Ser Phe Ile Asn Lys Val Leu Val Phe Thr Lys Glu Asn Gln Glu
580 585 590 Lys Leu Asn Lys Thr Pro Phe Glu Ala Phe Gly Lys Asn Ile
Glu Lys 595 600 605 Trp Ser Lys Ile Gln Thr Leu Ala Gln Asn Leu Pro
Tyr Lys Lys Lys 610 615 620 Asn Lys Ile Leu Asp Glu Asn Phe Lys Asp
Lys Gln Gln Glu Asp Phe 625 630 635 640 Ile Ser Arg Asn Leu Asn Asp
Thr Arg Tyr Ile Ala Thr Leu Ile Ala 645 650 655 Lys Tyr Thr Lys Glu
Tyr Leu Asn Phe Leu Leu Leu Ser Glu Asn Glu 660 665 670 Asn Ala Asn
Leu Lys Ser Gly Glu Lys Gly Ser Lys Ile His Val Gln 675 680 685 Thr
Ile Ser Gly Met Leu Thr Ser Val Leu Arg His Thr Trp Gly Phe 690 695
700 Asp Lys Lys Asp Arg Asn Asn His Leu His His Ala Leu Asp Ala Ile
705 710 715 720 Ile Val Ala Tyr Ser Thr Asn Ser Ile Ile Lys Ala Phe
Ser Asp Phe 725 730 735 Arg Lys Asn Gln Glu Leu Leu Lys Ala Arg Phe
Tyr Ala Lys Glu Leu 740 745 750 Thr Ser Asp Asn Tyr Lys His Gln Val
Lys Phe Phe Glu Pro Phe Lys 755 760 765 Ser Phe Arg Glu Lys Ile Leu
Ser Lys Ile Asp Glu Ile Phe Val Ser 770 775 780 Lys Pro Pro Arg Lys
Arg Ala Arg Arg Ala Leu His Lys Asp Thr Phe 785 790 795 800 His Ser
Glu Asn Lys Ile Ile Asp Lys Cys Ser Tyr Asn Ser Lys Glu 805 810 815
Gly Leu Gln Ile Ala Leu Ser Cys Gly Arg Val Arg Lys Ile Gly Thr 820
825 830 Lys Tyr Val Glu Asn Asp Thr Ile Val Arg Val Asp Ile Phe Lys
Lys 835 840 845 Gln Asn Lys Phe Tyr Ala Ile Pro Ile Tyr Ala Met Asp
Phe Ala Leu 850 855 860 Gly Ile Leu Pro Asn Lys Ile Val Ile Thr Gly
Lys Asp Lys Asn Asn 865 870 875 880 Asn Pro Lys Gln Trp Gln Thr Ile
Asp Glu Ser Tyr Glu Phe Cys Phe 885 890 895 Ser Leu Tyr Lys Asn Asp
Leu Ile Leu Leu Gln Lys Lys Asn Met Gln 900 905 910 Glu Pro Glu Phe
Ala Tyr Tyr Asn Asp Phe Ser Ile Ser Thr Ser Ser 915 920 925 Ile Cys
Val Glu Lys His Asp Asn Lys Phe Glu Asn Leu Thr Ser Asn 930 935 940
Gln Lys Leu Leu Phe Ser Asn Ala Lys Glu Gly Ser Val Lys Val Glu 945
950 955 960 Ser Leu Gly Ile Gln Asn Leu Lys Val Phe Glu Lys Tyr Ile
Ile Thr 965 970 975 Pro Leu Gly Asp Lys Ile Lys Ala Asp Phe Gln Pro
Arg Glu Asn Ile 980 985 990 Ser Leu Lys Thr Ser Lys Lys Tyr Gly Leu
Arg 995 1000
uspto.report is an independent third-party trademark research tool that is not affiliated, endorsed, or sponsored by the United States Patent and Trademark Office (USPTO) or any other governmental organization. The information provided by uspto.report is based on publicly available data at the time of writing and is intended for informational purposes only.
While we strive to provide accurate and up-to-date information, we do not guarantee the accuracy, completeness, reliability, or suitability of the information displayed on this site. The use of this site is at your own risk. Any reliance you place on such information is therefore strictly at your own risk.
All official trademark data, including owner information, should be verified by visiting the official USPTO website at www.uspto.gov. This site is not intended to replace professional legal advice and should not be used as a substitute for consulting with a legal professional who is knowledgeable about trademark law.