U.S. patent application number 16/752887 was filed with the patent office on 2020-05-21 for peptide-mediated delivery of rna-guided endonuclease into cells. The applicant listed for this patent is DUPONT US HOLDING, LLC. Invention is credited to XIAOCHUN FAN, RYAN L. FRISCH, SEUNG-PYO HONG, ETHEL NOLAND JACKSON.
Application Number | 20200157516 16/752887 |
Document ID | / |
Family ID | 54884364 |
Filed Date | 2020-05-21 |
United States Patent Application | 20200157516 |
Kind Code | A1 |
FRISCH; RYAN L. ; et al. | May 21, 2020 |
A composition is disclosed that comprises at least one protein component of an RNA-guided endonuclease (RGEN) and at least one cell-penetrating peptide (CPP), wherein the RGEN protein component and CPP are covalently or non-covalently linked to each other in an RGEN protein-CPP complex. The RGEN protein-CPP complex can traverse (i) a cell membrane, or (ii) a cell wall and cell membrane, of a cell. The RGEN protein component of an RGEN protein-CPP complex in certain embodiments can be associated with a suitable RNA component to provide an RGEN capable of specific DNA targeting. Further disclosed are compositions comprising at least one protein component of a guide polynucleotide/Cas endonuclease complex and at least one CPP, as well as methods of delivering RGEN proteins into microbial cells, as well as methods of targeting DNA with RGENs.
Inventors: | FRISCH; RYAN L.; (PALO ALTO, CA) ; FAN; XIAOCHUN; (WEST CHESTER, PA) ; HONG; SEUNG-PYO; (HOCKESSIN, DE) ; JACKSON; ETHEL NOLAND; (GREENVILLE, DE) | ||||||||||
Applicant: |
|
||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|
Family ID: | 54884364 | ||||||||||
Appl. No.: | 16/752887 | ||||||||||
Filed: | January 27, 2020 |
Application Number | Filing Date | Patent Number | ||
---|---|---|---|---|
16218808 | Dec 13, 2018 | 10584322 | ||
16752887 | ||||
15523741 | May 2, 2017 | 10208298 | ||
PCT/US15/58760 | Nov 3, 2015 | |||
16218808 | ||||
62075999 | Nov 6, 2014 | |||
Current U.S. Class: | 1/1 |
Current CPC Class: | C12N 15/102 20130101; C12N 2750/14141 20130101; C12N 15/907 20130101; C12N 9/96 20130101; C12N 15/62 20130101; C12N 15/86 20130101; C07K 2319/10 20130101; C12N 9/22 20130101; C07K 2319/09 20130101; C12N 2310/3513 20130101 |
International Class: | C12N 9/22 20060101 C12N009/22; C12N 15/90 20060101 C12N015/90; C12N 15/86 20060101 C12N015/86; C12N 15/62 20060101 C12N015/62; C12N 15/10 20060101 C12N015/10; C12N 9/96 20060101 C12N009/96 |
Sequence CWU 1
1
14414107DNAartificial sequenceS. pyogenes Cas9 1atggacaaga
aatactccat cggcctggac attggaacca actctgtcgg ctgggctgtc 60atcaccgacg
agtacaaggt gccctccaag aaattcaagg tcctcggaaa caccgatcga
120cactccatca agaaaaacct cattggtgcc ctgttgttcg attctggcga
gactgccgaa 180gctaccagac tcaagcgaac tgctcggcga cgttacaccc
gacggaagaa ccgaatctgc 240tacctgcagg agatcttttc caacgagatg
gccaaggtgg acgattcgtt ctttcatcga 300ctggaggaat ccttcctcgt
cgaggaagac aagaaacacg agcgtcatcc catctttggc 360aacattgtgg
acgaggttgc ttaccacgag aagtatccta ccatctacca cctgcgaaag
420aaactcgtcg attccaccga caaggcggat ctcagactta tctacctcgc
tctggcacac 480atgatcaagt ttcgaggtca tttcctcatc gagggcgatc
tcaatcccga caacagcgat 540gtggacaagc tgttcattca gctcgttcag
acctacaacc agctgttcga ggaaaacccc 600atcaatgcct ccggagtcga
tgcaaaggcc atcttgtctg ctcgactctc gaagagcaga 660cgactggaga
acctcattgc ccaacttcct ggcgagaaaa agaacggact gtttggcaac
720ctcattgccc tttctcttgg tctcacaccc aacttcaagt ccaacttcga
tctggcggag 780gacgccaagc tccagctgtc caaggacacc tacgacgatg
acctcgacaa cctgcttgca 840cagattggcg atcagtacgc cgacctgttt
ctcgctgcca agaacctttc ggatgctatt 900ctcttgtctg acattctgcg
agtcaacacc gagatcacaa aggctcccct ttctgcctcc 960atgatcaagc
gatacgacga gcaccatcag gatctcacac tgctcaaggc tcttgtccga
1020cagcaactgc ccgagaagta caaggagatc tttttcgatc agtcgaagaa
cggctacgct 1080ggatacatcg acggcggagc ctctcaggaa gagttctaca
agttcatcaa gccaattctc 1140gagaagatgg acggaaccga ggaactgctt
gtcaagctca atcgagagga tctgcttcgg 1200aagcaacgaa ccttcgacaa
cggcagcatt cctcatcaga tccacctcgg tgagctgcac 1260gccattcttc
gacgtcagga agacttctac ccctttctca aggacaaccg agagaagatc
1320gagaagattc ttacctttcg aatcccctac tatgttggtc ctcttgccag
aggaaactct 1380cgatttgctt ggatgactcg aaagtccgag gaaaccatca
ctccctggaa cttcgaggaa 1440gtcgtggaca agggtgcctc tgcacagtcc
ttcatcgagc gaatgaccaa cttcgacaag 1500aatctgccca acgagaaggt
tcttcccaag cattcgctgc tctacgagta ctttacagtc 1560tacaacgaac
tcaccaaagt caagtacgtt accgagggaa tgcgaaagcc tgccttcttg
1620tctggcgaac agaagaaagc cattgtcgat ctcctgttca agaccaaccg
aaaggtcact 1680gttaagcagc tcaaggagga ctacttcaag aaaatcgagt
gtttcgacag cgtcgagatt 1740tccggagttg aggaccgatt caacgcctct
ttgggcacct atcacgatct gctcaagatt 1800atcaaggaca aggattttct
cgacaacgag gaaaacgagg acattctgga ggacatcgtg 1860ctcactctta
ccctgttcga agatcgggag atgatcgagg aacgactcaa gacatacgct
1920cacctgttcg acgacaaggt catgaaacaa ctcaagcgac gtagatacac
cggctgggga 1980agactttcgc gaaagctcat caacggcatc agagacaagc
agtccggaaa gaccattctg 2040gactttctca agtccgatgg ctttgccaac
cgaaacttca tgcagctcat tcacgacgat 2100tctcttacct tcaaggagga
catccagaag gcacaagtgt ccggtcaggg cgacagcttg 2160cacgaacata
ttgccaacct ggctggttcg ccagccatca agaaaggcat tctccagact
2220gtcaaggttg tcgacgagct ggtgaaggtc atgggacgtc acaagcccga
gaacattgtg 2280atcgagatgg ccagagagaa ccagacaact caaaagggtc
agaaaaactc gcgagagcgg 2340atgaagcgaa tcgaggaagg catcaaggag
ctgggatccc agattctcaa ggagcatccc 2400gtcgagaaca ctcaactgca
gaacgagaag ctgtatctct actatctgca gaatggtcga 2460gacatgtacg
tggatcagga actggacatc aatcgtctca gcgactacga tgtggaccac
2520attgtccctc aatcctttct caaggacgat tctatcgaca acaaggtcct
tacacgatcc 2580gacaagaaca gaggcaagtc ggacaacgtt cccagcgaag
aggtggtcaa aaagatgaag 2640aactactggc gacagctgct caacgccaag
ctcattaccc agcgaaagtt cgacaatctt 2700accaaggccg agcgaggcgg
tctgtccgag ctcgacaagg ctggcttcat caagcgtcaa 2760ctcgtcgaga
ccagacagat cacaaagcac gtcgcacaga ttctcgattc tcggatgaac
2820accaagtacg acgagaacga caagctcatc cgagaggtca aggtgattac
tctcaagtcc 2880aaactggtct ccgatttccg aaaggacttt cagttctaca
aggtgcgaga gatcaacaat 2940taccaccatg cccacgatgc ttacctcaac
gccgtcgttg gcactgcgct catcaagaaa 3000taccccaagc tcgaaagcga
gttcgtttac ggcgattaca aggtctacga cgttcgaaag 3060atgattgcca
agtccgaaca ggagattggc aaggctactg ccaagtactt cttttactcc
3120aacatcatga actttttcaa gaccgagatc accttggcca acggagagat
tcgaaagaga 3180ccacttatcg agaccaacgg cgaaactgga gagatcgtgt
gggacaaggg tcgagacttt 3240gcaaccgtgc gaaaggttct gtcgatgcct
caggtcaaca tcgtcaagaa aaccgaggtt 3300cagactggcg gattctccaa
ggagtcgatt ctgcccaagc gaaactccga caagctcatc 3360gctcgaaaga
aagactggga tcccaagaaa tacggtggct tcgattctcc taccgtcgcc
3420tattccgtgc ttgtcgttgc gaaggtcgag aagggcaagt ccaaaaagct
caagtccgtc 3480aaggagctgc tcggaattac catcatggag cgatcgagct
tcgagaagaa tcccatcgac 3540ttcttggaag ccaagggtta caaggaggtc
aagaaagacc tcattatcaa gctgcccaag 3600tactctctgt tcgaactgga
gaacggtcga aagcgtatgc tcgcctccgc tggcgagctg 3660cagaagggaa
acgagcttgc cttgccttcg aagtacgtca actttctcta tctggcttct
3720cactacgaga agctcaaggg ttctcccgag gacaacgaac agaagcaact
cttcgttgag 3780cagcacaaac attacctcga cgagattatc gagcagattt
ccgagttttc gaagcgagtc 3840atcctggctg atgccaactt ggacaaggtg
ctctctgcct acaacaagca tcgggacaaa 3900cccattcgag aacaggcgga
gaacatcatt cacctgttta ctcttaccaa cctgggtgct 3960cctgcagctt
tcaagtactt cgataccact atcgaccgaa agcggtacac atccaccaag
4020gaggttctcg atgccaccct gattcaccag tccatcactg gcctgtacga
gacccgaatc 4080gacctgtctc agcttggtgg cgactaa 410724140DNAartificial
sequenceS. pyogenes Cas9 with NLS 2atggacaaga aatactccat cggcctggac
attggaacca actctgtcgg ctgggctgtc 60atcaccgacg agtacaaggt gccctccaag
aaattcaagg tcctcggaaa caccgatcga 120cactccatca agaaaaacct
cattggtgcc ctgttgttcg attctggcga gactgccgaa 180gctaccagac
tcaagcgaac tgctcggcga cgttacaccc gacggaagaa ccgaatctgc
240tacctgcagg agatcttttc caacgagatg gccaaggtgg acgattcgtt
ctttcatcga 300ctggaggaat ccttcctcgt cgaggaagac aagaaacacg
agcgtcatcc catctttggc 360aacattgtgg acgaggttgc ttaccacgag
aagtatccta ccatctacca cctgcgaaag 420aaactcgtcg attccaccga
caaggcggat ctcagactta tctacctcgc tctggcacac 480atgatcaagt
ttcgaggtca tttcctcatc gagggcgatc tcaatcccga caacagcgat
540gtggacaagc tgttcattca gctcgttcag acctacaacc agctgttcga
ggaaaacccc 600atcaatgcct ccggagtcga tgcaaaggcc atcttgtctg
ctcgactctc gaagagcaga 660cgactggaga acctcattgc ccaacttcct
ggcgagaaaa agaacggact gtttggcaac 720ctcattgccc tttctcttgg
tctcacaccc aacttcaagt ccaacttcga tctggcggag 780gacgccaagc
tccagctgtc caaggacacc tacgacgatg acctcgacaa cctgcttgca
840cagattggcg atcagtacgc cgacctgttt ctcgctgcca agaacctttc
ggatgctatt 900ctcttgtctg acattctgcg agtcaacacc gagatcacaa
aggctcccct ttctgcctcc 960atgatcaagc gatacgacga gcaccatcag
gatctcacac tgctcaaggc tcttgtccga 1020cagcaactgc ccgagaagta
caaggagatc tttttcgatc agtcgaagaa cggctacgct 1080ggatacatcg
acggcggagc ctctcaggaa gagttctaca agttcatcaa gccaattctc
1140gagaagatgg acggaaccga ggaactgctt gtcaagctca atcgagagga
tctgcttcgg 1200aagcaacgaa ccttcgacaa cggcagcatt cctcatcaga
tccacctcgg tgagctgcac 1260gccattcttc gacgtcagga agacttctac
ccctttctca aggacaaccg agagaagatc 1320gagaagattc ttacctttcg
aatcccctac tatgttggtc ctcttgccag aggaaactct 1380cgatttgctt
ggatgactcg aaagtccgag gaaaccatca ctccctggaa cttcgaggaa
1440gtcgtggaca agggtgcctc tgcacagtcc ttcatcgagc gaatgaccaa
cttcgacaag 1500aatctgccca acgagaaggt tcttcccaag cattcgctgc
tctacgagta ctttacagtc 1560tacaacgaac tcaccaaagt caagtacgtt
accgagggaa tgcgaaagcc tgccttcttg 1620tctggcgaac agaagaaagc
cattgtcgat ctcctgttca agaccaaccg aaaggtcact 1680gttaagcagc
tcaaggagga ctacttcaag aaaatcgagt gtttcgacag cgtcgagatt
1740tccggagttg aggaccgatt caacgcctct ttgggcacct atcacgatct
gctcaagatt 1800atcaaggaca aggattttct cgacaacgag gaaaacgagg
acattctgga ggacatcgtg 1860ctcactctta ccctgttcga agatcgggag
atgatcgagg aacgactcaa gacatacgct 1920cacctgttcg acgacaaggt
catgaaacaa ctcaagcgac gtagatacac cggctgggga 1980agactttcgc
gaaagctcat caacggcatc agagacaagc agtccggaaa gaccattctg
2040gactttctca agtccgatgg ctttgccaac cgaaacttca tgcagctcat
tcacgacgat 2100tctcttacct tcaaggagga catccagaag gcacaagtgt
ccggtcaggg cgacagcttg 2160cacgaacata ttgccaacct ggctggttcg
ccagccatca agaaaggcat tctccagact 2220gtcaaggttg tcgacgagct
ggtgaaggtc atgggacgtc acaagcccga gaacattgtg 2280atcgagatgg
ccagagagaa ccagacaact caaaagggtc agaaaaactc gcgagagcgg
2340atgaagcgaa tcgaggaagg catcaaggag ctgggatccc agattctcaa
ggagcatccc 2400gtcgagaaca ctcaactgca gaacgagaag ctgtatctct
actatctgca gaatggtcga 2460gacatgtacg tggatcagga actggacatc
aatcgtctca gcgactacga tgtggaccac 2520attgtccctc aatcctttct
caaggacgat tctatcgaca acaaggtcct tacacgatcc 2580gacaagaaca
gaggcaagtc ggacaacgtt cccagcgaag aggtggtcaa aaagatgaag
2640aactactggc gacagctgct caacgccaag ctcattaccc agcgaaagtt
cgacaatctt 2700accaaggccg agcgaggcgg tctgtccgag ctcgacaagg
ctggcttcat caagcgtcaa 2760ctcgtcgaga ccagacagat cacaaagcac
gtcgcacaga ttctcgattc tcggatgaac 2820accaagtacg acgagaacga
caagctcatc cgagaggtca aggtgattac tctcaagtcc 2880aaactggtct
ccgatttccg aaaggacttt cagttctaca aggtgcgaga gatcaacaat
2940taccaccatg cccacgatgc ttacctcaac gccgtcgttg gcactgcgct
catcaagaaa 3000taccccaagc tcgaaagcga gttcgtttac ggcgattaca
aggtctacga cgttcgaaag 3060atgattgcca agtccgaaca ggagattggc
aaggctactg ccaagtactt cttttactcc 3120aacatcatga actttttcaa
gaccgagatc accttggcca acggagagat tcgaaagaga 3180ccacttatcg
agaccaacgg cgaaactgga gagatcgtgt gggacaaggg tcgagacttt
3240gcaaccgtgc gaaaggttct gtcgatgcct caggtcaaca tcgtcaagaa
aaccgaggtt 3300cagactggcg gattctccaa ggagtcgatt ctgcccaagc
gaaactccga caagctcatc 3360gctcgaaaga aagactggga tcccaagaaa
tacggtggct tcgattctcc taccgtcgcc 3420tattccgtgc ttgtcgttgc
gaaggtcgag aagggcaagt ccaaaaagct caagtccgtc 3480aaggagctgc
tcggaattac catcatggag cgatcgagct tcgagaagaa tcccatcgac
3540ttcttggaag ccaagggtta caaggaggtc aagaaagacc tcattatcaa
gctgcccaag 3600tactctctgt tcgaactgga gaacggtcga aagcgtatgc
tcgcctccgc tggcgagctg 3660cagaagggaa acgagcttgc cttgccttcg
aagtacgtca actttctcta tctggcttct 3720cactacgaga agctcaaggg
ttctcccgag gacaacgaac agaagcaact cttcgttgag 3780cagcacaaac
attacctcga cgagattatc gagcagattt ccgagttttc gaagcgagtc
3840atcctggctg atgccaactt ggacaaggtg ctctctgcct acaacaagca
tcgggacaaa 3900cccattcgag aacaggcgga gaacatcatt cacctgttta
ctcttaccaa cctgggtgct 3960cctgcagctt tcaagtactt cgataccact
atcgaccgaa agcggtacac atccaccaag 4020gaggttctcg atgccaccct
gattcaccag tccatcactg gcctgtacga gacccgaatc 4080gacctgtctc
agcttggtgg cgactccaga gccgatccca agaaaaagcg aaaggtctaa
414031379PRTartificial sequenceS. pyogenes Cas9 with NLS 3Met Asp
Lys Lys Tyr Ser Ile Gly Leu Asp Ile Gly Thr Asn Ser Val1 5 10 15Gly
Trp Ala Val Ile Thr Asp Glu Tyr Lys Val Pro Ser Lys Lys Phe 20 25
30Lys Val Leu Gly Asn Thr Asp Arg His Ser Ile Lys Lys Asn Leu Ile
35 40 45Gly Ala Leu Leu Phe Asp Ser Gly Glu Thr Ala Glu Ala Thr Arg
Leu 50 55 60Lys Arg Thr Ala Arg Arg Arg Tyr Thr Arg Arg Lys Asn Arg
Ile Cys65 70 75 80Tyr Leu Gln Glu Ile Phe Ser Asn Glu Met Ala Lys
Val Asp Asp Ser 85 90 95Phe Phe His Arg Leu Glu Glu Ser Phe Leu Val
Glu Glu Asp Lys Lys 100 105 110His Glu Arg His Pro Ile Phe Gly Asn
Ile Val Asp Glu Val Ala Tyr 115 120 125His Glu Lys Tyr Pro Thr Ile
Tyr His Leu Arg Lys Lys Leu Val Asp 130 135 140Ser Thr Asp Lys Ala
Asp Leu Arg Leu Ile Tyr Leu Ala Leu Ala His145 150 155 160Met Ile
Lys Phe Arg Gly His Phe Leu Ile Glu Gly Asp Leu Asn Pro 165 170
175Asp Asn Ser Asp Val Asp Lys Leu Phe Ile Gln Leu Val Gln Thr Tyr
180 185 190Asn Gln Leu Phe Glu Glu Asn Pro Ile Asn Ala Ser Gly Val
Asp Ala 195 200 205Lys Ala Ile Leu Ser Ala Arg Leu Ser Lys Ser Arg
Arg Leu Glu Asn 210 215 220Leu Ile Ala Gln Leu Pro Gly Glu Lys Lys
Asn Gly Leu Phe Gly Asn225 230 235 240Leu Ile Ala Leu Ser Leu Gly
Leu Thr Pro Asn Phe Lys Ser Asn Phe 245 250 255Asp Leu Ala Glu Asp
Ala Lys Leu Gln Leu Ser Lys Asp Thr Tyr Asp 260 265 270Asp Asp Leu
Asp Asn Leu Leu Ala Gln Ile Gly Asp Gln Tyr Ala Asp 275 280 285Leu
Phe Leu Ala Ala Lys Asn Leu Ser Asp Ala Ile Leu Leu Ser Asp 290 295
300Ile Leu Arg Val Asn Thr Glu Ile Thr Lys Ala Pro Leu Ser Ala
Ser305 310 315 320Met Ile Lys Arg Tyr Asp Glu His His Gln Asp Leu
Thr Leu Leu Lys 325 330 335Ala Leu Val Arg Gln Gln Leu Pro Glu Lys
Tyr Lys Glu Ile Phe Phe 340 345 350Asp Gln Ser Lys Asn Gly Tyr Ala
Gly Tyr Ile Asp Gly Gly Ala Ser 355 360 365Gln Glu Glu Phe Tyr Lys
Phe Ile Lys Pro Ile Leu Glu Lys Met Asp 370 375 380Gly Thr Glu Glu
Leu Leu Val Lys Leu Asn Arg Glu Asp Leu Leu Arg385 390 395 400Lys
Gln Arg Thr Phe Asp Asn Gly Ser Ile Pro His Gln Ile His Leu 405 410
415Gly Glu Leu His Ala Ile Leu Arg Arg Gln Glu Asp Phe Tyr Pro Phe
420 425 430Leu Lys Asp Asn Arg Glu Lys Ile Glu Lys Ile Leu Thr Phe
Arg Ile 435 440 445Pro Tyr Tyr Val Gly Pro Leu Ala Arg Gly Asn Ser
Arg Phe Ala Trp 450 455 460Met Thr Arg Lys Ser Glu Glu Thr Ile Thr
Pro Trp Asn Phe Glu Glu465 470 475 480Val Val Asp Lys Gly Ala Ser
Ala Gln Ser Phe Ile Glu Arg Met Thr 485 490 495Asn Phe Asp Lys Asn
Leu Pro Asn Glu Lys Val Leu Pro Lys His Ser 500 505 510Leu Leu Tyr
Glu Tyr Phe Thr Val Tyr Asn Glu Leu Thr Lys Val Lys 515 520 525Tyr
Val Thr Glu Gly Met Arg Lys Pro Ala Phe Leu Ser Gly Glu Gln 530 535
540Lys Lys Ala Ile Val Asp Leu Leu Phe Lys Thr Asn Arg Lys Val
Thr545 550 555 560Val Lys Gln Leu Lys Glu Asp Tyr Phe Lys Lys Ile
Glu Cys Phe Asp 565 570 575Ser Val Glu Ile Ser Gly Val Glu Asp Arg
Phe Asn Ala Ser Leu Gly 580 585 590Thr Tyr His Asp Leu Leu Lys Ile
Ile Lys Asp Lys Asp Phe Leu Asp 595 600 605Asn Glu Glu Asn Glu Asp
Ile Leu Glu Asp Ile Val Leu Thr Leu Thr 610 615 620Leu Phe Glu Asp
Arg Glu Met Ile Glu Glu Arg Leu Lys Thr Tyr Ala625 630 635 640His
Leu Phe Asp Asp Lys Val Met Lys Gln Leu Lys Arg Arg Arg Tyr 645 650
655Thr Gly Trp Gly Arg Leu Ser Arg Lys Leu Ile Asn Gly Ile Arg Asp
660 665 670Lys Gln Ser Gly Lys Thr Ile Leu Asp Phe Leu Lys Ser Asp
Gly Phe 675 680 685Ala Asn Arg Asn Phe Met Gln Leu Ile His Asp Asp
Ser Leu Thr Phe 690 695 700Lys Glu Asp Ile Gln Lys Ala Gln Val Ser
Gly Gln Gly Asp Ser Leu705 710 715 720His Glu His Ile Ala Asn Leu
Ala Gly Ser Pro Ala Ile Lys Lys Gly 725 730 735Ile Leu Gln Thr Val
Lys Val Val Asp Glu Leu Val Lys Val Met Gly 740 745 750Arg His Lys
Pro Glu Asn Ile Val Ile Glu Met Ala Arg Glu Asn Gln 755 760 765Thr
Thr Gln Lys Gly Gln Lys Asn Ser Arg Glu Arg Met Lys Arg Ile 770 775
780Glu Glu Gly Ile Lys Glu Leu Gly Ser Gln Ile Leu Lys Glu His
Pro785 790 795 800Val Glu Asn Thr Gln Leu Gln Asn Glu Lys Leu Tyr
Leu Tyr Tyr Leu 805 810 815Gln Asn Gly Arg Asp Met Tyr Val Asp Gln
Glu Leu Asp Ile Asn Arg 820 825 830Leu Ser Asp Tyr Asp Val Asp His
Ile Val Pro Gln Ser Phe Leu Lys 835 840 845Asp Asp Ser Ile Asp Asn
Lys Val Leu Thr Arg Ser Asp Lys Asn Arg 850 855 860Gly Lys Ser Asp
Asn Val Pro Ser Glu Glu Val Val Lys Lys Met Lys865 870 875 880Asn
Tyr Trp Arg Gln Leu Leu Asn Ala Lys Leu Ile Thr Gln Arg Lys 885 890
895Phe Asp Asn Leu Thr Lys Ala Glu Arg Gly Gly Leu Ser Glu Leu Asp
900 905 910Lys Ala Gly Phe Ile Lys Arg Gln Leu Val Glu Thr Arg Gln
Ile Thr 915 920 925Lys His Val Ala Gln Ile Leu Asp Ser Arg Met Asn
Thr Lys Tyr Asp 930 935 940Glu Asn Asp Lys Leu Ile Arg Glu Val Lys
Val Ile Thr Leu Lys Ser945 950 955 960Lys Leu Val Ser Asp Phe Arg
Lys Asp Phe Gln Phe Tyr Lys Val Arg 965 970 975Glu Ile Asn Asn Tyr
His His Ala His Asp Ala Tyr Leu Asn Ala Val 980 985 990Val Gly Thr
Ala Leu Ile Lys Lys Tyr Pro Lys Leu Glu Ser Glu Phe 995 1000
1005Val Tyr Gly Asp Tyr Lys Val Tyr Asp Val Arg Lys Met Ile Ala
1010 1015 1020Lys Ser Glu Gln Glu Ile Gly Lys Ala Thr Ala Lys Tyr
Phe Phe 1025 1030 1035Tyr Ser Asn Ile Met Asn Phe Phe Lys Thr Glu
Ile Thr Leu Ala 1040 1045 1050Asn Gly Glu Ile Arg Lys Arg Pro Leu
Ile Glu Thr Asn Gly Glu 1055 1060 1065Thr Gly Glu Ile Val Trp Asp
Lys Gly Arg Asp Phe Ala Thr Val 1070 1075 1080Arg Lys Val Leu Ser
Met Pro Gln Val Asn Ile
Val Lys Lys Thr 1085 1090 1095Glu Val Gln Thr Gly Gly Phe Ser Lys
Glu Ser Ile Leu Pro Lys 1100 1105 1110Arg Asn Ser Asp Lys Leu Ile
Ala Arg Lys Lys Asp Trp Asp Pro 1115 1120 1125Lys Lys Tyr Gly Gly
Phe Asp Ser Pro Thr Val Ala Tyr Ser Val 1130 1135 1140Leu Val Val
Ala Lys Val Glu Lys Gly Lys Ser Lys Lys Leu Lys 1145 1150 1155Ser
Val Lys Glu Leu Leu Gly Ile Thr Ile Met Glu Arg Ser Ser 1160 1165
1170Phe Glu Lys Asn Pro Ile Asp Phe Leu Glu Ala Lys Gly Tyr Lys
1175 1180 1185Glu Val Lys Lys Asp Leu Ile Ile Lys Leu Pro Lys Tyr
Ser Leu 1190 1195 1200Phe Glu Leu Glu Asn Gly Arg Lys Arg Met Leu
Ala Ser Ala Gly 1205 1210 1215Glu Leu Gln Lys Gly Asn Glu Leu Ala
Leu Pro Ser Lys Tyr Val 1220 1225 1230Asn Phe Leu Tyr Leu Ala Ser
His Tyr Glu Lys Leu Lys Gly Ser 1235 1240 1245Pro Glu Asp Asn Glu
Gln Lys Gln Leu Phe Val Glu Gln His Lys 1250 1255 1260His Tyr Leu
Asp Glu Ile Ile Glu Gln Ile Ser Glu Phe Ser Lys 1265 1270 1275Arg
Val Ile Leu Ala Asp Ala Asn Leu Asp Lys Val Leu Ser Ala 1280 1285
1290Tyr Asn Lys His Arg Asp Lys Pro Ile Arg Glu Gln Ala Glu Asn
1295 1300 1305Ile Ile His Leu Phe Thr Leu Thr Asn Leu Gly Ala Pro
Ala Ala 1310 1315 1320Phe Lys Tyr Phe Asp Thr Thr Ile Asp Arg Lys
Arg Tyr Thr Ser 1325 1330 1335Thr Lys Glu Val Leu Asp Ala Thr Leu
Ile His Gln Ser Ile Thr 1340 1345 1350Gly Leu Tyr Glu Thr Arg Ile
Asp Leu Ser Gln Leu Gly Gly Asp 1355 1360 1365Ser Arg Ala Asp Pro
Lys Lys Lys Arg Lys Val 1370 13754543DNAYarrowia lipolytica
4tcgacgttta aaccatcatc taagggcctc aaaactacct cggaactgct gcgctgatct
60ggacaccaca gaggttccga gcactttagg ttgcaccaaa tgtcccacca ggtgcaggca
120gaaaacgctg gaacagcgtg tacagtttgt cttaacaaaa agtgagggcg
ctgaggtcga 180gcagggtggt gtgacttgtt atagccttta gagctgcgaa
agcgcgtatg gatttggctc 240atcaggccag attgagggtc tgtggacaca
tgtcatgtta gtgtacttca atcgccccct 300ggatatagcc ccgacaatag
gccgtggcct catttttttg ccttccgcac atttccattg 360ctcggtaccc
acaccttgct tctcctgcac ttgccaacct taatactggt ttacattgac
420caacatctta caagcggggg gcttgtctag ggtatatata aacagtggct
ctcccaatcg 480gttgccagtc tcttttttcc tttctttccc cacagattcg
aaatctaaac tacacatcac 540acc 54354683DNAartificial sequenceCas9-NLS
expression cassette (FBA1 promoter and Cas9-NLS open reading frame)
5tcgacgttta aaccatcatc taagggcctc aaaactacct cggaactgct gcgctgatct
60ggacaccaca gaggttccga gcactttagg ttgcaccaaa tgtcccacca ggtgcaggca
120gaaaacgctg gaacagcgtg tacagtttgt cttaacaaaa agtgagggcg
ctgaggtcga 180gcagggtggt gtgacttgtt atagccttta gagctgcgaa
agcgcgtatg gatttggctc 240atcaggccag attgagggtc tgtggacaca
tgtcatgtta gtgtacttca atcgccccct 300ggatatagcc ccgacaatag
gccgtggcct catttttttg ccttccgcac atttccattg 360ctcggtaccc
acaccttgct tctcctgcac ttgccaacct taatactggt ttacattgac
420caacatctta caagcggggg gcttgtctag ggtatatata aacagtggct
ctcccaatcg 480gttgccagtc tcttttttcc tttctttccc cacagattcg
aaatctaaac tacacatcac 540accatggaca agaaatactc catcggcctg
gacattggaa ccaactctgt cggctgggct 600gtcatcaccg acgagtacaa
ggtgccctcc aagaaattca aggtcctcgg aaacaccgat 660cgacactcca
tcaagaaaaa cctcattggt gccctgttgt tcgattctgg cgagactgcc
720gaagctacca gactcaagcg aactgctcgg cgacgttaca cccgacggaa
gaaccgaatc 780tgctacctgc aggagatctt ttccaacgag atggccaagg
tggacgattc gttctttcat 840cgactggagg aatccttcct cgtcgaggaa
gacaagaaac acgagcgtca tcccatcttt 900ggcaacattg tggacgaggt
tgcttaccac gagaagtatc ctaccatcta ccacctgcga 960aagaaactcg
tcgattccac cgacaaggcg gatctcagac ttatctacct cgctctggca
1020cacatgatca agtttcgagg tcatttcctc atcgagggcg atctcaatcc
cgacaacagc 1080gatgtggaca agctgttcat tcagctcgtt cagacctaca
accagctgtt cgaggaaaac 1140cccatcaatg cctccggagt cgatgcaaag
gccatcttgt ctgctcgact ctcgaagagc 1200agacgactgg agaacctcat
tgcccaactt cctggcgaga aaaagaacgg actgtttggc 1260aacctcattg
ccctttctct tggtctcaca cccaacttca agtccaactt cgatctggcg
1320gaggacgcca agctccagct gtccaaggac acctacgacg atgacctcga
caacctgctt 1380gcacagattg gcgatcagta cgccgacctg tttctcgctg
ccaagaacct ttcggatgct 1440attctcttgt ctgacattct gcgagtcaac
accgagatca caaaggctcc cctttctgcc 1500tccatgatca agcgatacga
cgagcaccat caggatctca cactgctcaa ggctcttgtc 1560cgacagcaac
tgcccgagaa gtacaaggag atctttttcg atcagtcgaa gaacggctac
1620gctggataca tcgacggcgg agcctctcag gaagagttct acaagttcat
caagccaatt 1680ctcgagaaga tggacggaac cgaggaactg cttgtcaagc
tcaatcgaga ggatctgctt 1740cggaagcaac gaaccttcga caacggcagc
attcctcatc agatccacct cggtgagctg 1800cacgccattc ttcgacgtca
ggaagacttc tacccctttc tcaaggacaa ccgagagaag 1860atcgagaaga
ttcttacctt tcgaatcccc tactatgttg gtcctcttgc cagaggaaac
1920tctcgatttg cttggatgac tcgaaagtcc gaggaaacca tcactccctg
gaacttcgag 1980gaagtcgtgg acaagggtgc ctctgcacag tccttcatcg
agcgaatgac caacttcgac 2040aagaatctgc ccaacgagaa ggttcttccc
aagcattcgc tgctctacga gtactttaca 2100gtctacaacg aactcaccaa
agtcaagtac gttaccgagg gaatgcgaaa gcctgccttc 2160ttgtctggcg
aacagaagaa agccattgtc gatctcctgt tcaagaccaa ccgaaaggtc
2220actgttaagc agctcaagga ggactacttc aagaaaatcg agtgtttcga
cagcgtcgag 2280atttccggag ttgaggaccg attcaacgcc tctttgggca
cctatcacga tctgctcaag 2340attatcaagg acaaggattt tctcgacaac
gaggaaaacg aggacattct ggaggacatc 2400gtgctcactc ttaccctgtt
cgaagatcgg gagatgatcg aggaacgact caagacatac 2460gctcacctgt
tcgacgacaa ggtcatgaaa caactcaagc gacgtagata caccggctgg
2520ggaagacttt cgcgaaagct catcaacggc atcagagaca agcagtccgg
aaagaccatt 2580ctggactttc tcaagtccga tggctttgcc aaccgaaact
tcatgcagct cattcacgac 2640gattctctta ccttcaagga ggacatccag
aaggcacaag tgtccggtca gggcgacagc 2700ttgcacgaac atattgccaa
cctggctggt tcgccagcca tcaagaaagg cattctccag 2760actgtcaagg
ttgtcgacga gctggtgaag gtcatgggac gtcacaagcc cgagaacatt
2820gtgatcgaga tggccagaga gaaccagaca actcaaaagg gtcagaaaaa
ctcgcgagag 2880cggatgaagc gaatcgagga aggcatcaag gagctgggat
cccagattct caaggagcat 2940cccgtcgaga acactcaact gcagaacgag
aagctgtatc tctactatct gcagaatggt 3000cgagacatgt acgtggatca
ggaactggac atcaatcgtc tcagcgacta cgatgtggac 3060cacattgtcc
ctcaatcctt tctcaaggac gattctatcg acaacaaggt ccttacacga
3120tccgacaaga acagaggcaa gtcggacaac gttcccagcg aagaggtggt
caaaaagatg 3180aagaactact ggcgacagct gctcaacgcc aagctcatta
cccagcgaaa gttcgacaat 3240cttaccaagg ccgagcgagg cggtctgtcc
gagctcgaca aggctggctt catcaagcgt 3300caactcgtcg agaccagaca
gatcacaaag cacgtcgcac agattctcga ttctcggatg 3360aacaccaagt
acgacgagaa cgacaagctc atccgagagg tcaaggtgat tactctcaag
3420tccaaactgg tctccgattt ccgaaaggac tttcagttct acaaggtgcg
agagatcaac 3480aattaccacc atgcccacga tgcttacctc aacgccgtcg
ttggcactgc gctcatcaag 3540aaatacccca agctcgaaag cgagttcgtt
tacggcgatt acaaggtcta cgacgttcga 3600aagatgattg ccaagtccga
acaggagatt ggcaaggcta ctgccaagta cttcttttac 3660tccaacatca
tgaacttttt caagaccgag atcaccttgg ccaacggaga gattcgaaag
3720agaccactta tcgagaccaa cggcgaaact ggagagatcg tgtgggacaa
gggtcgagac 3780tttgcaaccg tgcgaaaggt tctgtcgatg cctcaggtca
acatcgtcaa gaaaaccgag 3840gttcagactg gcggattctc caaggagtcg
attctgccca agcgaaactc cgacaagctc 3900atcgctcgaa agaaagactg
ggatcccaag aaatacggtg gcttcgattc tcctaccgtc 3960gcctattccg
tgcttgtcgt tgcgaaggtc gagaagggca agtccaaaaa gctcaagtcc
4020gtcaaggagc tgctcggaat taccatcatg gagcgatcga gcttcgagaa
gaatcccatc 4080gacttcttgg aagccaaggg ttacaaggag gtcaagaaag
acctcattat caagctgccc 4140aagtactctc tgttcgaact ggagaacggt
cgaaagcgta tgctcgcctc cgctggcgag 4200ctgcagaagg gaaacgagct
tgccttgcct tcgaagtacg tcaactttct ctatctggct 4260tctcactacg
agaagctcaa gggttctccc gaggacaacg aacagaagca actcttcgtt
4320gagcagcaca aacattacct cgacgagatt atcgagcaga tttccgagtt
ttcgaagcga 4380gtcatcctgg ctgatgccaa cttggacaag gtgctctctg
cctacaacaa gcatcgggac 4440aaacccattc gagaacaggc ggagaacatc
attcacctgt ttactcttac caacctgggt 4500gctcctgcag ctttcaagta
cttcgatacc actatcgacc gaaagcggta cacatccacc 4560aaggaggttc
tcgatgccac cctgattcac cagtccatca ctggcctgta cgagacccga
4620atcgacctgt ctcagcttgg tggcgactcc agagccgatc ccaagaaaaa
gcgaaaggtc 4680taa 4683610706DNAartificial sequencepZUFCas9 plasmid
6catggacaag aaatactcca tcggcctgga cattggaacc aactctgtcg gctgggctgt
60catcaccgac gagtacaagg tgccctccaa gaaattcaag gtcctcggaa acaccgatcg
120acactccatc aagaaaaacc tcattggtgc cctgttgttc gattctggcg
agactgccga 180agctaccaga ctcaagcgaa ctgctcggcg acgttacacc
cgacggaaga accgaatctg 240ctacctgcag gagatctttt ccaacgagat
ggccaaggtg gacgattcgt tctttcatcg 300actggaggaa tccttcctcg
tcgaggaaga caagaaacac gagcgtcatc ccatctttgg 360caacattgtg
gacgaggttg cttaccacga gaagtatcct accatctacc acctgcgaaa
420gaaactcgtc gattccaccg acaaggcgga tctcagactt atctacctcg
ctctggcaca 480catgatcaag tttcgaggtc atttcctcat cgagggcgat
ctcaatcccg acaacagcga 540tgtggacaag ctgttcattc agctcgttca
gacctacaac cagctgttcg aggaaaaccc 600catcaatgcc tccggagtcg
atgcaaaggc catcttgtct gctcgactct cgaagagcag 660acgactggag
aacctcattg cccaacttcc tggcgagaaa aagaacggac tgtttggcaa
720cctcattgcc ctttctcttg gtctcacacc caacttcaag tccaacttcg
atctggcgga 780ggacgccaag ctccagctgt ccaaggacac ctacgacgat
gacctcgaca acctgcttgc 840acagattggc gatcagtacg ccgacctgtt
tctcgctgcc aagaaccttt cggatgctat 900tctcttgtct gacattctgc
gagtcaacac cgagatcaca aaggctcccc tttctgcctc 960catgatcaag
cgatacgacg agcaccatca ggatctcaca ctgctcaagg ctcttgtccg
1020acagcaactg cccgagaagt acaaggagat ctttttcgat cagtcgaaga
acggctacgc 1080tggatacatc gacggcggag cctctcagga agagttctac
aagttcatca agccaattct 1140cgagaagatg gacggaaccg aggaactgct
tgtcaagctc aatcgagagg atctgcttcg 1200gaagcaacga accttcgaca
acggcagcat tcctcatcag atccacctcg gtgagctgca 1260cgccattctt
cgacgtcagg aagacttcta cccctttctc aaggacaacc gagagaagat
1320cgagaagatt cttacctttc gaatccccta ctatgttggt cctcttgcca
gaggaaactc 1380tcgatttgct tggatgactc gaaagtccga ggaaaccatc
actccctgga acttcgagga 1440agtcgtggac aagggtgcct ctgcacagtc
cttcatcgag cgaatgacca acttcgacaa 1500gaatctgccc aacgagaagg
ttcttcccaa gcattcgctg ctctacgagt actttacagt 1560ctacaacgaa
ctcaccaaag tcaagtacgt taccgaggga atgcgaaagc ctgccttctt
1620gtctggcgaa cagaagaaag ccattgtcga tctcctgttc aagaccaacc
gaaaggtcac 1680tgttaagcag ctcaaggagg actacttcaa gaaaatcgag
tgtttcgaca gcgtcgagat 1740ttccggagtt gaggaccgat tcaacgcctc
tttgggcacc tatcacgatc tgctcaagat 1800tatcaaggac aaggattttc
tcgacaacga ggaaaacgag gacattctgg aggacatcgt 1860gctcactctt
accctgttcg aagatcggga gatgatcgag gaacgactca agacatacgc
1920tcacctgttc gacgacaagg tcatgaaaca actcaagcga cgtagataca
ccggctgggg 1980aagactttcg cgaaagctca tcaacggcat cagagacaag
cagtccggaa agaccattct 2040ggactttctc aagtccgatg gctttgccaa
ccgaaacttc atgcagctca ttcacgacga 2100ttctcttacc ttcaaggagg
acatccagaa ggcacaagtg tccggtcagg gcgacagctt 2160gcacgaacat
attgccaacc tggctggttc gccagccatc aagaaaggca ttctccagac
2220tgtcaaggtt gtcgacgagc tggtgaaggt catgggacgt cacaagcccg
agaacattgt 2280gatcgagatg gccagagaga accagacaac tcaaaagggt
cagaaaaact cgcgagagcg 2340gatgaagcga atcgaggaag gcatcaagga
gctgggatcc cagattctca aggagcatcc 2400cgtcgagaac actcaactgc
agaacgagaa gctgtatctc tactatctgc agaatggtcg 2460agacatgtac
gtggatcagg aactggacat caatcgtctc agcgactacg atgtggacca
2520cattgtccct caatcctttc tcaaggacga ttctatcgac aacaaggtcc
ttacacgatc 2580cgacaagaac agaggcaagt cggacaacgt tcccagcgaa
gaggtggtca aaaagatgaa 2640gaactactgg cgacagctgc tcaacgccaa
gctcattacc cagcgaaagt tcgacaatct 2700taccaaggcc gagcgaggcg
gtctgtccga gctcgacaag gctggcttca tcaagcgtca 2760actcgtcgag
accagacaga tcacaaagca cgtcgcacag attctcgatt ctcggatgaa
2820caccaagtac gacgagaacg acaagctcat ccgagaggtc aaggtgatta
ctctcaagtc 2880caaactggtc tccgatttcc gaaaggactt tcagttctac
aaggtgcgag agatcaacaa 2940ttaccaccat gcccacgatg cttacctcaa
cgccgtcgtt ggcactgcgc tcatcaagaa 3000ataccccaag ctcgaaagcg
agttcgttta cggcgattac aaggtctacg acgttcgaaa 3060gatgattgcc
aagtccgaac aggagattgg caaggctact gccaagtact tcttttactc
3120caacatcatg aactttttca agaccgagat caccttggcc aacggagaga
ttcgaaagag 3180accacttatc gagaccaacg gcgaaactgg agagatcgtg
tgggacaagg gtcgagactt 3240tgcaaccgtg cgaaaggttc tgtcgatgcc
tcaggtcaac atcgtcaaga aaaccgaggt 3300tcagactggc ggattctcca
aggagtcgat tctgcccaag cgaaactccg acaagctcat 3360cgctcgaaag
aaagactggg atcccaagaa atacggtggc ttcgattctc ctaccgtcgc
3420ctattccgtg cttgtcgttg cgaaggtcga gaagggcaag tccaaaaagc
tcaagtccgt 3480caaggagctg ctcggaatta ccatcatgga gcgatcgagc
ttcgagaaga atcccatcga 3540cttcttggaa gccaagggtt acaaggaggt
caagaaagac ctcattatca agctgcccaa 3600gtactctctg ttcgaactgg
agaacggtcg aaagcgtatg ctcgcctccg ctggcgagct 3660gcagaaggga
aacgagcttg ccttgccttc gaagtacgtc aactttctct atctggcttc
3720tcactacgag aagctcaagg gttctcccga ggacaacgaa cagaagcaac
tcttcgttga 3780gcagcacaaa cattacctcg acgagattat cgagcagatt
tccgagtttt cgaagcgagt 3840catcctggct gatgccaact tggacaaggt
gctctctgcc tacaacaagc atcgggacaa 3900acccattcga gaacaggcgg
agaacatcat tcacctgttt actcttacca acctgggtgc 3960tcctgcagct
ttcaagtact tcgataccac tatcgaccga aagcggtaca catccaccaa
4020ggaggttctc gatgccaccc tgattcacca gtccatcact ggcctgtacg
agacccgaat 4080cgacctgtct cagcttggtg gcgactccag agccgatccc
aagaaaaagc gaaaggtcta 4140agcggccgca agtgtggatg gggaagtgag
tgcccggttc tgtgtgcaca attggcaatc 4200caagatggat ggattcaaca
cagggatata gcgagctacg tggtggtgcg aggatatagc 4260aacggatatt
tatgtttgac acttgagaat gtacgataca agcactgtcc aagtacaata
4320ctaaacatac tgtacatact catactcgta cccgggcaac ggtttcactt
gagtgcagtg 4380gctagtgctc ttactcgtac agtgtgcaat actgcgtatc
atagtctttg atgtatatcg 4440tattcattca tgttagttgc gtacgagccg
gaagcataaa gtgtaaagcc tggggtgcct 4500aatgagtgag ctaactcaca
ttaattgcgt tgcgctcact gcccgctttc cagtcgggaa 4560acctgtcgtg
ccagctgcat taatgaatcg gccaacgcgc ggggagaggc ggtttgcgta
4620ttgggcgctc ttccgcttcc tcgctcactg actcgctgcg ctcggtcgtt
cggctgcggc 4680gagcggtatc agctcactca aaggcggtaa tacggttatc
cacagaatca ggggataacg 4740caggaaagaa catgtgagca aaaggccagc
aaaaggccag gaaccgtaaa aaggccgcgt 4800tgctggcgtt tttccatagg
ctccgccccc ctgacgagca tcacaaaaat cgacgctcaa 4860gtcagaggtg
gcgaaacccg acaggactat aaagatacca ggcgtttccc cctggaagct
4920ccctcgtgcg ctctcctgtt ccgaccctgc cgcttaccgg atacctgtcc
gcctttctcc 4980cttcgggaag cgtggcgctt tctcatagct cacgctgtag
gtatctcagt tcggtgtagg 5040tcgttcgctc caagctgggc tgtgtgcacg
aaccccccgt tcagcccgac cgctgcgcct 5100tatccggtaa ctatcgtctt
gagtccaacc cggtaagaca cgacttatcg ccactggcag 5160cagccactgg
taacaggatt agcagagcga ggtatgtagg cggtgctaca gagttcttga
5220agtggtggcc taactacggc tacactagaa ggacagtatt tggtatctgc
gctctgctga 5280agccagttac cttcggaaaa agagttggta gctcttgatc
cggcaaacaa accaccgctg 5340gtagcggtgg tttttttgtt tgcaagcagc
agattacgcg cagaaaaaaa ggatctcaag 5400aagatccttt gatcttttct
acggggtctg acgctcagtg gaacgaaaac tcacgttaag 5460ggattttggt
catgagatta tcaaaaagga tcttcaccta gatcctttta aattaaaaat
5520gaagttttaa atcaatctaa agtatatatg agtaaacttg gtctgacagt
taccaatgct 5580taatcagtga ggcacctatc tcagcgatct gtctatttcg
ttcatccata gttgcctgac 5640tccccgtcgt gtagataact acgatacggg
agggcttacc atctggcccc agtgctgcaa 5700tgataccgcg agacccacgc
tcaccggctc cagatttatc agcaataaac cagccagccg 5760gaagggccga
gcgcagaagt ggtcctgcaa ctttatccgc ctccatccag tctattaatt
5820gttgccggga agctagagta agtagttcgc cagttaatag tttgcgcaac
gttgttgcca 5880ttgctacagg catcgtggtg tcacgctcgt cgtttggtat
ggcttcattc agctccggtt 5940cccaacgatc aaggcgagtt acatgatccc
ccatgttgtg caaaaaagcg gttagctcct 6000tcggtcctcc gatcgttgtc
agaagtaagt tggccgcagt gttatcactc atggttatgg 6060cagcactgca
taattctctt actgtcatgc catccgtaag atgcttttct gtgactggtg
6120agtactcaac caagtcattc tgagaatagt gtatgcggcg accgagttgc
tcttgcccgg 6180cgtcaatacg ggataatacc gcgccacata gcagaacttt
aaaagtgctc atcattggaa 6240aacgttcttc ggggcgaaaa ctctcaagga
tcttaccgct gttgagatcc agttcgatgt 6300aacccactcg tgcacccaac
tgatcttcag catcttttac tttcaccagc gtttctgggt 6360gagcaaaaac
aggaaggcaa aatgccgcaa aaaagggaat aagggcgaca cggaaatgtt
6420gaatactcat actcttcctt tttcaatatt attgaagcat ttatcagggt
tattgtctca 6480tgagcggata catatttgaa tgtatttaga aaaataaaca
aataggggtt ccgcgcacat 6540ttccccgaaa agtgccacct gacgcgccct
gtagcggcgc attaagcgcg gcgggtgtgg 6600tggttacgcg cagcgtgacc
gctacacttg ccagcgccct agcgcccgct cctttcgctt 6660tcttcccttc
ctttctcgcc acgttcgccg gctttccccg tcaagctcta aatcgggggc
6720tccctttagg gttccgattt agtgctttac ggcacctcga ccccaaaaaa
cttgattagg 6780gtgatggttc acgtagtggg ccatcgccct gatagacggt
ttttcgccct ttgacgttgg 6840agtccacgtt ctttaatagt ggactcttgt
tccaaactgg aacaacactc aaccctatct 6900cggtctattc ttttgattta
taagggattt tgccgatttc ggcctattgg ttaaaaaatg 6960agctgattta
acaaaaattt aacgcgaatt ttaacaaaat attaacgctt acaatttcca
7020ttcgccattc aggctgcgca actgttggga agggcgatcg gtgcgggcct
cttcgctatt 7080acgccagctg gcgaaagggg gatgtgctgc aaggcgatta
agttgggtaa cgccagggtt 7140ttcccagtca cgacgttgta aaacgacggc
cagtgaattg taatacgact cactataggg 7200cgaattgggt accgggcccc
ccctcgaggt cgatggtgtc gataagcttg atatcgaatt 7260catgtcacac
aaaccgatct tcgcctcaag gaaacctaat tctacatccg agagactgcc
7320gagatccagt ctacactgat taattttcgg gccaataatt taaaaaaatc
gtgttatata 7380atattatatg tattatatat atacatcatg atgatactga
cagtcatgtc ccattgctaa 7440atagacagac tccatctgcc gcctccaact
gatgttctca atatttaagg ggtcatctcg 7500cattgtttaa taataaacag
actccatcta ccgcctccaa atgatgttct caaaatatat 7560tgtatgaact
tatttttatt acttagtatt attagacaac ttacttgctt tatgaaaaac
7620acttcctatt taggaaacaa tttataatgg cagttcgttc atttaacaat
ttatgtagaa 7680taaatgttat aaatgcgtat gggaaatctt aaatatggat
agcataaatg atatctgcat 7740tgcctaattc gaaatcaaca
gcaacgaaaa aaatcccttg tacaacataa atagtcatcg 7800agaaatatca
actatcaaag aacagctatt cacacgttac tattgagatt attattggac
7860gagaatcaca cactcaactg tctttctctc ttctagaaat acaggtacaa
gtatgtacta 7920ttctcattgt tcatacttct agtcatttca tcccacatat
tccttggatt tctctccaat 7980gaatgacatt ctatcttgca aattcaacaa
ttataataag atataccaaa gtagcggtat 8040agtggcaatc aaaaagcttc
tctggtgtgc ttctcgtatt tatttttatt ctaatgatcc 8100attaaaggta
tatatttatt tcttgttata taatcctttt gtttattaca tgggctggat
8160acataaaggt attttgattt aattttttgc ttaaattcaa tcccccctcg
ttcagtgtca 8220actgtaatgg taggaaatta ccatactttt gaagaagcaa
aaaaaatgaa agaaaaaaaa 8280aatcgtattt ccaggttaga cgttccgcag
aatctagaat gcggtatgcg gtacattgtt 8340cttcgaacgt aaaagttgcg
ctccctgaga tattgtacat ttttgctttt acaagtacaa 8400gtacatcgta
caactatgta ctactgttga tgcatccaca acagtttgtt ttgttttttt
8460ttgttttttt tttttctaat gattcattac cgctatgtat acctacttgt
acttgtagta 8520agccgggtta ttggcgttca attaatcata gacttatgaa
tctgcacggt gtgcgctgcg 8580agttactttt agcttatgca tgctacttgg
gtgtaatatt gggatctgtt cggaaatcaa 8640cggatgctca atcgatttcg
acagtaatta attaagtcat acacaagtca gctttcttcg 8700agcctcatat
aagtataagt agttcaacgt attagcactg tacccagcat ctccgtatcg
8760agaaacacaa caacatgccc cattggacag atcatgcgga tacacaggtt
gtgcagtatc 8820atacatactc gatcagacag gtcgtctgac catcatacaa
gctgaacaag cgctccatac 8880ttgcacgctc tctatataca cagttaaatt
acatatccat agtctaacct ctaacagtta 8940atcttctggt aagcctccca
gccagccttc tggtatcgct tggcctcctc aataggatct 9000cggttctggc
cgtacagacc tcggccgaca attatgatat ccgttccggt agacatgaca
9060tcctcaacag ttcggtactg ctgtccgaga gcgtctccct tgtcgtcaag
acccaccccg 9120ggggtcagaa taagccagtc ctcagagtcg cccttaggtc
ggttctgggc aatgaagcca 9180accacaaact cggggtcgga tcgggcaagc
tcaatggtct gcttggagta ctcgccagtg 9240gccagagagc ccttgcaaga
cagctcggcc agcatgagca gacctctggc cagcttctcg 9300ttgggagagg
ggactaggaa ctccttgtac tgggagttct cgtagtcaga gacgtcctcc
9360ttcttctgtt cagagacagt ttcctcggca ccagctcgca ggccagcaat
gattccggtt 9420ccgggtacac cgtgggcgtt ggtgatatcg gaccactcgg
cgattcggtg acaccggtac 9480tggtgcttga cagtgttgcc aatatctgcg
aactttctgt cctcgaacag gaagaaaccg 9540tgcttaagag caagttcctt
gagggggagc acagtgccgg cgtaggtgaa gtcgtcaatg 9600atgtcgatat
gggttttgat catgcacaca taaggtccga ccttatcggc aagctcaatg
9660agctccttgg tggtggtaac atccagagaa gcacacaggt tggttttctt
ggctgccacg 9720agcttgagca ctcgagcggc aaaggcggac ttgtggacgt
tagctcgagc ttcgtaggag 9780ggcattttgg tggtgaagag gagactgaaa
taaatttagt ctgcagaact ttttatcgga 9840accttatctg gggcagtgaa
gtatatgtta tggtaatagt tacgagttag ttgaacttat 9900agatagactg
gactatacgg ctatcggtcc aaattagaaa gaacgtcaat ggctctctgg
9960gcgtcgcctt tgccgacaaa aatgtgatca tgatgaaagc cagcaatgac
gttgcagctg 10020atattgttgt cggccaaccg cgccgaaaac gcagctgtca
gacccacagc ctccaacgaa 10080gaatgtatcg tcaaagtgat ccaagcacac
tcatagttgg agtcgtactc caaaggcggc 10140aatgacgagt cagacagata
ctcgtcgacg tttaaaccat catctaaggg cctcaaaact 10200acctcggaac
tgctgcgctg atctggacac cacagaggtt ccgagcactt taggttgcac
10260caaatgtccc accaggtgca ggcagaaaac gctggaacag cgtgtacagt
ttgtcttaac 10320aaaaagtgag ggcgctgagg tcgagcaggg tggtgtgact
tgttatagcc tttagagctg 10380cgaaagcgcg tatggatttg gctcatcagg
ccagattgag ggtctgtgga cacatgtcat 10440gttagtgtac ttcaatcgcc
ccctggatat agccccgaca ataggccgtg gcctcatttt 10500tttgccttcc
gcacatttcc attgctcggt acccacacct tgcttctcct gcacttgcca
10560accttaatac tggtttacat tgaccaacat cttacaagcg gggggcttgt
ctagggtata 10620tataaacagt ggctctccca atcggttgcc agtctctttt
ttcctttctt tccccacaga 10680ttcgaaatct aaactacaca tcacac
10706735DNAartificial sequenceCas9-NLS forward PCR primer
7gggggaattc gacaagaaat actccatcgg cctgg 35831DNAartificial
sequenceCas9-NLS reverse PCR primer 8ccccaagctt agcggccgct
tagacctttc g 3194166DNAartificial sequenceEcoRI-Cas9-NLS-HinDIII
PCR product 9gggggaattc gacaagaaat actccatcgg cctggacatt ggaaccaact
ctgtcggctg 60ggctgtcatc accgacgagt acaaggtgcc ctccaagaaa ttcaaggtcc
tcggaaacac 120cgatcgacac tccatcaaga aaaacctcat tggtgccctg
ttgttcgatt ctggcgagac 180tgccgaagct accagactca agcgaactgc
tcggcgacgt tacacccgac ggaagaaccg 240aatctgctac ctgcaggaga
tcttttccaa cgagatggcc aaggtggacg attcgttctt 300tcatcgactg
gaggaatcct tcctcgtcga ggaagacaag aaacacgagc gtcatcccat
360ctttggcaac attgtggacg aggttgctta ccacgagaag tatcctacca
tctaccacct 420gcgaaagaaa ctcgtcgatt ccaccgacaa ggcggatctc
agacttatct acctcgctct 480ggcacacatg atcaagtttc gaggtcattt
cctcatcgag ggcgatctca atcccgacaa 540cagcgatgtg gacaagctgt
tcattcagct cgttcagacc tacaaccagc tgttcgagga 600aaaccccatc
aatgcctccg gagtcgatgc aaaggccatc ttgtctgctc gactctcgaa
660gagcagacga ctggagaacc tcattgccca acttcctggc gagaaaaaga
acggactgtt 720tggcaacctc attgcccttt ctcttggtct cacacccaac
ttcaagtcca acttcgatct 780ggcggaggac gccaagctcc agctgtccaa
ggacacctac gacgatgacc tcgacaacct 840gcttgcacag attggcgatc
agtacgccga cctgtttctc gctgccaaga acctttcgga 900tgctattctc
ttgtctgaca ttctgcgagt caacaccgag atcacaaagg ctcccctttc
960tgcctccatg atcaagcgat acgacgagca ccatcaggat ctcacactgc
tcaaggctct 1020tgtccgacag caactgcccg agaagtacaa ggagatcttt
ttcgatcagt cgaagaacgg 1080ctacgctgga tacatcgacg gcggagcctc
tcaggaagag ttctacaagt tcatcaagcc 1140aattctcgag aagatggacg
gaaccgagga actgcttgtc aagctcaatc gagaggatct 1200gcttcggaag
caacgaacct tcgacaacgg cagcattcct catcagatcc acctcggtga
1260gctgcacgcc attcttcgac gtcaggaaga cttctacccc tttctcaagg
acaaccgaga 1320gaagatcgag aagattctta cctttcgaat cccctactat
gttggtcctc ttgccagagg 1380aaactctcga tttgcttgga tgactcgaaa
gtccgaggaa accatcactc cctggaactt 1440cgaggaagtc gtggacaagg
gtgcctctgc acagtccttc atcgagcgaa tgaccaactt 1500cgacaagaat
ctgcccaacg agaaggttct tcccaagcat tcgctgctct acgagtactt
1560tacagtctac aacgaactca ccaaagtcaa gtacgttacc gagggaatgc
gaaagcctgc 1620cttcttgtct ggcgaacaga agaaagccat tgtcgatctc
ctgttcaaga ccaaccgaaa 1680ggtcactgtt aagcagctca aggaggacta
cttcaagaaa atcgagtgtt tcgacagcgt 1740cgagatttcc ggagttgagg
accgattcaa cgcctctttg ggcacctatc acgatctgct 1800caagattatc
aaggacaagg attttctcga caacgaggaa aacgaggaca ttctggagga
1860catcgtgctc actcttaccc tgttcgaaga tcgggagatg atcgaggaac
gactcaagac 1920atacgctcac ctgttcgacg acaaggtcat gaaacaactc
aagcgacgta gatacaccgg 1980ctggggaaga ctttcgcgaa agctcatcaa
cggcatcaga gacaagcagt ccggaaagac 2040cattctggac tttctcaagt
ccgatggctt tgccaaccga aacttcatgc agctcattca 2100cgacgattct
cttaccttca aggaggacat ccagaaggca caagtgtccg gtcagggcga
2160cagcttgcac gaacatattg ccaacctggc tggttcgcca gccatcaaga
aaggcattct 2220ccagactgtc aaggttgtcg acgagctggt gaaggtcatg
ggacgtcaca agcccgagaa 2280cattgtgatc gagatggcca gagagaacca
gacaactcaa aagggtcaga aaaactcgcg 2340agagcggatg aagcgaatcg
aggaaggcat caaggagctg ggatcccaga ttctcaagga 2400gcatcccgtc
gagaacactc aactgcagaa cgagaagctg tatctctact atctgcagaa
2460tggtcgagac atgtacgtgg atcaggaact ggacatcaat cgtctcagcg
actacgatgt 2520ggaccacatt gtccctcaat cctttctcaa ggacgattct
atcgacaaca aggtccttac 2580acgatccgac aagaacagag gcaagtcgga
caacgttccc agcgaagagg tggtcaaaaa 2640gatgaagaac tactggcgac
agctgctcaa cgccaagctc attacccagc gaaagttcga 2700caatcttacc
aaggccgagc gaggcggtct gtccgagctc gacaaggctg gcttcatcaa
2760gcgtcaactc gtcgagacca gacagatcac aaagcacgtc gcacagattc
tcgattctcg 2820gatgaacacc aagtacgacg agaacgacaa gctcatccga
gaggtcaagg tgattactct 2880caagtccaaa ctggtctccg atttccgaaa
ggactttcag ttctacaagg tgcgagagat 2940caacaattac caccatgccc
acgatgctta cctcaacgcc gtcgttggca ctgcgctcat 3000caagaaatac
cccaagctcg aaagcgagtt cgtttacggc gattacaagg tctacgacgt
3060tcgaaagatg attgccaagt ccgaacagga gattggcaag gctactgcca
agtacttctt 3120ttactccaac atcatgaact ttttcaagac cgagatcacc
ttggccaacg gagagattcg 3180aaagagacca cttatcgaga ccaacggcga
aactggagag atcgtgtggg acaagggtcg 3240agactttgca accgtgcgaa
aggttctgtc gatgcctcag gtcaacatcg tcaagaaaac 3300cgaggttcag
actggcggat tctccaagga gtcgattctg cccaagcgaa actccgacaa
3360gctcatcgct cgaaagaaag actgggatcc caagaaatac ggtggcttcg
attctcctac 3420cgtcgcctat tccgtgcttg tcgttgcgaa ggtcgagaag
ggcaagtcca aaaagctcaa 3480gtccgtcaag gagctgctcg gaattaccat
catggagcga tcgagcttcg agaagaatcc 3540catcgacttc ttggaagcca
agggttacaa ggaggtcaag aaagacctca ttatcaagct 3600gcccaagtac
tctctgttcg aactggagaa cggtcgaaag cgtatgctcg cctccgctgg
3660cgagctgcag aagggaaacg agcttgcctt gccttcgaag tacgtcaact
ttctctatct 3720ggcttctcac tacgagaagc tcaagggttc tcccgaggac
aacgaacaga agcaactctt 3780cgttgagcag cacaaacatt acctcgacga
gattatcgag cagatttccg agttttcgaa 3840gcgagtcatc ctggctgatg
ccaacttgga caaggtgctc tctgcctaca acaagcatcg 3900ggacaaaccc
attcgagaac aggcggagaa catcattcac ctgtttactc ttaccaacct
3960gggtgctcct gcagctttca agtacttcga taccactatc gaccgaaagc
ggtacacatc 4020caccaaggag gttctcgatg ccaccctgat tcaccagtcc
atcactggcc tgtacgagac 4080ccgaatcgac ctgtctcagc ttggtggcga
ctccagagcc gatcccaaga aaaagcgaaa 4140ggtctaagcg gccgctaagc ttgggg
4166104092DNAartificial sequencepBAD/HisB plasmid 10aagaaaccaa
ttgtccatat tgcatcagac attgccgtca ctgcgtcttt tactggctct 60tctcgctaac
caaaccggta accccgctta ttaaaagcat tctgtaacaa agcgggacca
120aagccatgac aaaaacgcgt aacaaaagtg tctataatca cggcagaaaa
gtccacattg 180attatttgca cggcgtcaca ctttgctatg ccatagcatt
tttatccata agattagcgg 240atcctacctg acgcttttta tcgcaactct
ctactgtttc tccatacccg ttttttgggc 300taacaggagg aattaaccat
ggggggttct catcatcatc atcatcatgg tatggctagc 360atgactggtg
gacagcaaat gggtcgggat ctgtacgacg atgacgataa ggatccgagc
420tcgagatctg cagctggtac catatgggaa ttcgaagctt ggctgttttg
gcggatgaga 480gaagattttc agcctgatac agattaaatc agaacgcaga
agcggtctga taaaacagaa 540tttgcctggc ggcagtagcg cggtggtccc
acctgacccc atgccgaact cagaagtgaa 600acgccgtagc gccgatggta
gtgtggggtc tccccatgcg agagtaggga actgccaggc 660atcaaataaa
acgaaaggct cagtcgaaag actgggcctt tcgttttatc tgttgtttgt
720cggtgaacgc tctcctgagt aggacaaatc cgccgggagc ggatttgaac
gttgcgaagc 780aacggcccgg agggtggcgg gcaggacgcc cgccataaac
tgccaggcat caaattaagc 840agaaggccat cctgacggat ggcctttttg
cgtttctaca aactcttttg tttatttttc 900taaatacatt caaatatgta
tccgctcatg agacaataac cctgataaat gcttcaataa 960tattgaaaaa
ggaagagtat gagtattcaa catttccgtg tcgcccttat tccctttttt
1020gcggcatttt gccttcctgt ttttgctcac ccagaaacgc tggtgaaagt
aaaagatgct 1080gaagatcagt tgggtgcacg agtgggttac atcgaactgg
atctcaacag cggtaagatc 1140cttgagagtt ttcgccccga agaacgtttt
ccaatgatga gcacttttaa agttctgcta 1200tgtggcgcgg tattatcccg
tgttgacgcc gggcaagagc aactcggtcg ccgcatacac 1260tattctcaga
atgacttggt tgagtactca ccagtcacag aaaagcatct tacggatggc
1320atgacagtaa gagaattatg cagtgctgcc ataaccatga gtgataacac
tgcggccaac 1380ttacttctga caacgatcgg aggaccgaag gagctaaccg
cttttttgca caacatgggg 1440gatcatgtaa ctcgccttga tcgttgggaa
ccggagctga atgaagccat accaaacgac 1500gagcgtgaca ccacgatgcc
tgtagcaatg gcaacaacgt tgcgcaaact attaactggc 1560gaactactta
ctctagcttc ccggcaacaa ttaatagact ggatggaggc ggataaagtt
1620gcaggaccac ttctgcgctc ggcccttccg gctggctggt ttattgctga
taaatctgga 1680gccggtgagc gtgggtctcg cggtatcatt gcagcactgg
ggccagatgg taagccctcc 1740cgtatcgtag ttatctacac gacggggagt
caggcaacta tggatgaacg aaatagacag 1800atcgctgaga taggtgcctc
actgattaag cattggtaac tgtcagacca agtttactca 1860tatatacttt
agattgattt aaaacttcat ttttaattta aaaggatcta ggtgaagatc
1920ctttttgata atctcatgac caaaatccct taacgtgagt tttcgttcca
ctgagcgtca 1980gaccccgtag aaaagatcaa aggatcttct tgagatcctt
tttttctgcg cgtaatctgc 2040tgcttgcaaa caaaaaaacc accgctacca
gcggtggttt gtttgccgga tcaagagcta 2100ccaactcttt ttccgaaggt
aactggcttc agcagagcgc agataccaaa tactgtcctt 2160ctagtgtagc
cgtagttagg ccaccacttc aagaactctg tagcaccgcc tacatacctc
2220gctctgctaa tcctgttacc agtggctgct gccagtggcg ataagtcgtg
tcttaccggg 2280ttggactcaa gacgatagtt accggataag gcgcagcggt
cgggctgaac ggggggttcg 2340tgcacacagc ccagcttgga gcgaacgacc
tacaccgaac tgagatacct acagcgtgag 2400ctatgagaaa gcgccacgct
tcccgaaggg agaaaggcgg acaggtatcc ggtaagcggc 2460agggtcggaa
caggagagcg cacgagggag cttccagggg gaaacgcctg gtatctttat
2520agtcctgtcg ggtttcgcca cctctgactt gagcgtcgat ttttgtgatg
ctcgtcaggg 2580gggcggagcc tatggaaaaa cgccagcaac gcggcctttt
tacggttcct ggccttttgc 2640tggccttttg ctcacatgtt ctttcctgcg
ttatcccctg attctgtgga taaccgtatt 2700accgcctttg agtgagctga
taccgctcgc cgcagccgaa cgaccgagcg cagcgagtca 2760gtgagcgagg
aagcggaaga gcgcctgatg cggtattttc tccttacgca tctgtgcggt
2820atttcacacc gcatatggtg cactctcagt acaatctgct ctgatgccgc
atagttaagc 2880cagtatacac tccgctatcg ctacgtgact gggtcatggc
tgcgccccga cacccgccaa 2940cacccgctga cgcgccctga cgggcttgtc
tgctcccggc atccgcttac agacaagctg 3000tgaccgtctc cgggagctgc
atgtgtcaga ggttttcacc gtcatcaccg aaacgcgcga 3060ggcagcagat
caattcgcgc gcgaaggcga agcggcatgc ataatgtgcc tgtcaaatgg
3120acgaagcagg gattctgcaa accctatgct actccgtcaa gccgtcaatt
gtctgattcg 3180ttaccaatta tgacaacttg acggctacat cattcacttt
ttcttcacaa ccggcacgga 3240actcgctcgg gctggccccg gtgcattttt
taaatacccg cgagaaatag agttgatcgt 3300caaaaccaac attgcgaccg
acggtggcga taggcatccg ggtggtgctc aaaagcagct 3360tcgcctggct
gatacgttgg tcctcgcgcc agcttaagac gctaatccct aactgctggc
3420ggaaaagatg tgacagacgc gacggcgaca agcaaacatg ctgtgcgacg
ctggcgatat 3480caaaattgct gtctgccagg tgatcgctga tgtactgaca
agcctcgcgt acccgattat 3540ccatcggtgg atggagcgac tcgttaatcg
cttccatgcg ccgcagtaac aattgctcaa 3600gcagatttat cgccagcagc
tccgaatagc gcccttcccc ttgcccggcg ttaatgattt 3660gcccaaacag
gtcgctgaaa tgcggctggt gcgcttcatc cgggcgaaag aaccccgtat
3720tggcaaatat tgacggccag ttaagccatt catgccagta ggcgcgcgga
cgaaagtaaa 3780cccactggtg ataccattcg cgagcctccg gatgacgacc
gtagtgatga atctctcctg 3840gcgggaacag caaaatatca cccggtcggc
aaacaaattc tcgtccctga tttttcacca 3900ccccctgacc gcgaatggtg
agattgagaa tataaccttt cattcccagc ggtcggtcga 3960taaaaaaatc
gagataaccg ttggcctcaa tcggcgttaa acccgccacc agatgggcat
4020taaacgagta tcccggcagc aggggatcat tttgcgcttc agccatactt
ttcatactcc 4080cgccattcag ag 4092118237DNAartificial sequencepRF48
plasmid 11aattcgacaa gaaatactcc atcggcctgg acattggaac caactctgtc
ggctgggctg 60tcatcaccga cgagtacaag gtgccctcca agaaattcaa ggtcctcgga
aacaccgatc 120gacactccat caagaaaaac ctcattggtg ccctgttgtt
cgattctggc gagactgccg 180aagctaccag actcaagcga actgctcggc
gacgttacac ccgacggaag aaccgaatct 240gctacctgca ggagatcttt
tccaacgaga tggccaaggt ggacgattcg ttctttcatc 300gactggagga
atccttcctc gtcgaggaag acaagaaaca cgagcgtcat cccatctttg
360gcaacattgt ggacgaggtt gcttaccacg agaagtatcc taccatctac
cacctgcgaa 420agaaactcgt cgattccacc gacaaggcgg atctcagact
tatctacctc gctctggcac 480acatgatcaa gtttcgaggt catttcctca
tcgagggcga tctcaatccc gacaacagcg 540atgtggacaa gctgttcatt
cagctcgttc agacctacaa ccagctgttc gaggaaaacc 600ccatcaatgc
ctccggagtc gatgcaaagg ccatcttgtc tgctcgactc tcgaagagca
660gacgactgga gaacctcatt gcccaacttc ctggcgagaa aaagaacgga
ctgtttggca 720acctcattgc cctttctctt ggtctcacac ccaacttcaa
gtccaacttc gatctggcgg 780aggacgccaa gctccagctg tccaaggaca
cctacgacga tgacctcgac aacctgcttg 840cacagattgg cgatcagtac
gccgacctgt ttctcgctgc caagaacctt tcggatgcta 900ttctcttgtc
tgacattctg cgagtcaaca ccgagatcac aaaggctccc ctttctgcct
960ccatgatcaa gcgatacgac gagcaccatc aggatctcac actgctcaag
gctcttgtcc 1020gacagcaact gcccgagaag tacaaggaga tctttttcga
tcagtcgaag aacggctacg 1080ctggatacat cgacggcgga gcctctcagg
aagagttcta caagttcatc aagccaattc 1140tcgagaagat ggacggaacc
gaggaactgc ttgtcaagct caatcgagag gatctgcttc 1200ggaagcaacg
aaccttcgac aacggcagca ttcctcatca gatccacctc ggtgagctgc
1260acgccattct tcgacgtcag gaagacttct acccctttct caaggacaac
cgagagaaga 1320tcgagaagat tcttaccttt cgaatcccct actatgttgg
tcctcttgcc agaggaaact 1380ctcgatttgc ttggatgact cgaaagtccg
aggaaaccat cactccctgg aacttcgagg 1440aagtcgtgga caagggtgcc
tctgcacagt ccttcatcga gcgaatgacc aacttcgaca 1500agaatctgcc
caacgagaag gttcttccca agcattcgct gctctacgag tactttacag
1560tctacaacga actcaccaaa gtcaagtacg ttaccgaggg aatgcgaaag
cctgccttct 1620tgtctggcga acagaagaaa gccattgtcg atctcctgtt
caagaccaac cgaaaggtca 1680ctgttaagca gctcaaggag gactacttca
agaaaatcga gtgtttcgac agcgtcgaga 1740tttccggagt tgaggaccga
ttcaacgcct ctttgggcac ctatcacgat ctgctcaaga 1800ttatcaagga
caaggatttt ctcgacaacg aggaaaacga ggacattctg gaggacatcg
1860tgctcactct taccctgttc gaagatcggg agatgatcga ggaacgactc
aagacatacg 1920ctcacctgtt cgacgacaag gtcatgaaac aactcaagcg
acgtagatac accggctggg 1980gaagactttc gcgaaagctc atcaacggca
tcagagacaa gcagtccgga aagaccattc 2040tggactttct caagtccgat
ggctttgcca accgaaactt catgcagctc attcacgacg 2100attctcttac
cttcaaggag gacatccaga aggcacaagt gtccggtcag ggcgacagct
2160tgcacgaaca tattgccaac ctggctggtt cgccagccat caagaaaggc
attctccaga 2220ctgtcaaggt tgtcgacgag ctggtgaagg tcatgggacg
tcacaagccc gagaacattg 2280tgatcgagat ggccagagag aaccagacaa
ctcaaaaggg tcagaaaaac tcgcgagagc 2340ggatgaagcg aatcgaggaa
ggcatcaagg agctgggatc ccagattctc aaggagcatc 2400ccgtcgagaa
cactcaactg cagaacgaga agctgtatct ctactatctg cagaatggtc
2460gagacatgta cgtggatcag gaactggaca tcaatcgtct cagcgactac
gatgtggacc 2520acattgtccc tcaatccttt ctcaaggacg attctatcga
caacaaggtc cttacacgat 2580ccgacaagaa cagaggcaag tcggacaacg
ttcccagcga agaggtggtc aaaaagatga 2640agaactactg gcgacagctg
ctcaacgcca agctcattac ccagcgaaag ttcgacaatc 2700ttaccaaggc
cgagcgaggc ggtctgtccg agctcgacaa ggctggcttc atcaagcgtc
2760aactcgtcga gaccagacag atcacaaagc acgtcgcaca gattctcgat
tctcggatga 2820acaccaagta cgacgagaac gacaagctca tccgagaggt
caaggtgatt actctcaagt 2880ccaaactggt ctccgatttc cgaaaggact
ttcagttcta caaggtgcga gagatcaaca 2940attaccacca tgcccacgat
gcttacctca acgccgtcgt tggcactgcg ctcatcaaga 3000aataccccaa
gctcgaaagc gagttcgttt acggcgatta caaggtctac gacgttcgaa
3060agatgattgc caagtccgaa caggagattg gcaaggctac tgccaagtac
ttcttttact 3120ccaacatcat gaactttttc aagaccgaga tcaccttggc
caacggagag attcgaaaga 3180gaccacttat cgagaccaac ggcgaaactg
gagagatcgt gtgggacaag ggtcgagact 3240ttgcaaccgt gcgaaaggtt
ctgtcgatgc ctcaggtcaa catcgtcaag aaaaccgagg 3300ttcagactgg
cggattctcc aaggagtcga ttctgcccaa gcgaaactcc gacaagctca
3360tcgctcgaaa gaaagactgg
gatcccaaga aatacggtgg cttcgattct cctaccgtcg 3420cctattccgt
gcttgtcgtt gcgaaggtcg agaagggcaa gtccaaaaag ctcaagtccg
3480tcaaggagct gctcggaatt accatcatgg agcgatcgag cttcgagaag
aatcccatcg 3540acttcttgga agccaagggt tacaaggagg tcaagaaaga
cctcattatc aagctgccca 3600agtactctct gttcgaactg gagaacggtc
gaaagcgtat gctcgcctcc gctggcgagc 3660tgcagaaggg aaacgagctt
gccttgcctt cgaagtacgt caactttctc tatctggctt 3720ctcactacga
gaagctcaag ggttctcccg aggacaacga acagaagcaa ctcttcgttg
3780agcagcacaa acattacctc gacgagatta tcgagcagat ttccgagttt
tcgaagcgag 3840tcatcctggc tgatgccaac ttggacaagg tgctctctgc
ctacaacaag catcgggaca 3900aacccattcg agaacaggcg gagaacatca
ttcacctgtt tactcttacc aacctgggtg 3960ctcctgcagc tttcaagtac
ttcgatacca ctatcgaccg aaagcggtac acatccacca 4020aggaggttct
cgatgccacc ctgattcacc agtccatcac tggcctgtac gagacccgaa
4080tcgacctgtc tcagcttggt ggcgactcca gagccgatcc caagaaaaag
cgaaaggtct 4140aagcggccgc taagcttggc tgttttggcg gatgagagaa
gattttcagc ctgatacaga 4200ttaaatcaga acgcagaagc ggtctgataa
aacagaattt gcctggcggc agtagcgcgg 4260tggtcccacc tgaccccatg
ccgaactcag aagtgaaacg ccgtagcgcc gatggtagtg 4320tggggtctcc
ccatgcgaga gtagggaact gccaggcatc aaataaaacg aaaggctcag
4380tcgaaagact gggcctttcg ttttatctgt tgtttgtcgg tgaacgctct
cctgagtagg 4440acaaatccgc cgggagcgga tttgaacgtt gcgaagcaac
ggcccggagg gtggcgggca 4500ggacgcccgc cataaactgc caggcatcaa
attaagcaga aggccatcct gacggatggc 4560ctttttgcgt ttctacaaac
tcttttgttt atttttctaa atacattcaa atatgtatcc 4620gctcatgaga
caataaccct gataaatgct tcaataatat tgaaaaagga agagtatgag
4680tattcaacat ttccgtgtcg cccttattcc cttttttgcg gcattttgcc
ttcctgtttt 4740tgctcaccca gaaacgctgg tgaaagtaaa agatgctgaa
gatcagttgg gtgcacgagt 4800gggttacatc gaactggatc tcaacagcgg
taagatcctt gagagttttc gccccgaaga 4860acgttttcca atgatgagca
cttttaaagt tctgctatgt ggcgcggtat tatcccgtgt 4920tgacgccggg
caagagcaac tcggtcgccg catacactat tctcagaatg acttggttga
4980gtactcacca gtcacagaaa agcatcttac ggatggcatg acagtaagag
aattatgcag 5040tgctgccata accatgagtg ataacactgc ggccaactta
cttctgacaa cgatcggagg 5100accgaaggag ctaaccgctt ttttgcacaa
catgggggat catgtaactc gccttgatcg 5160ttgggaaccg gagctgaatg
aagccatacc aaacgacgag cgtgacacca cgatgcctgt 5220agcaatggca
acaacgttgc gcaaactatt aactggcgaa ctacttactc tagcttcccg
5280gcaacaatta atagactgga tggaggcgga taaagttgca ggaccacttc
tgcgctcggc 5340ccttccggct ggctggttta ttgctgataa atctggagcc
ggtgagcgtg ggtctcgcgg 5400tatcattgca gcactggggc cagatggtaa
gccctcccgt atcgtagtta tctacacgac 5460ggggagtcag gcaactatgg
atgaacgaaa tagacagatc gctgagatag gtgcctcact 5520gattaagcat
tggtaactgt cagaccaagt ttactcatat atactttaga ttgatttaaa
5580acttcatttt taatttaaaa ggatctaggt gaagatcctt tttgataatc
tcatgaccaa 5640aatcccttaa cgtgagtttt cgttccactg agcgtcagac
cccgtagaaa agatcaaagg 5700atcttcttga gatccttttt ttctgcgcgt
aatctgctgc ttgcaaacaa aaaaaccacc 5760gctaccagcg gtggtttgtt
tgccggatca agagctacca actctttttc cgaaggtaac 5820tggcttcagc
agagcgcaga taccaaatac tgtccttcta gtgtagccgt agttaggcca
5880ccacttcaag aactctgtag caccgcctac atacctcgct ctgctaatcc
tgttaccagt 5940ggctgctgcc agtggcgata agtcgtgtct taccgggttg
gactcaagac gatagttacc 6000ggataaggcg cagcggtcgg gctgaacggg
gggttcgtgc acacagccca gcttggagcg 6060aacgacctac accgaactga
gatacctaca gcgtgagcta tgagaaagcg ccacgcttcc 6120cgaagggaga
aaggcggaca ggtatccggt aagcggcagg gtcggaacag gagagcgcac
6180gagggagctt ccagggggaa acgcctggta tctttatagt cctgtcgggt
ttcgccacct 6240ctgacttgag cgtcgatttt tgtgatgctc gtcagggggg
cggagcctat ggaaaaacgc 6300cagcaacgcg gcctttttac ggttcctggc
cttttgctgg ccttttgctc acatgttctt 6360tcctgcgtta tcccctgatt
ctgtggataa ccgtattacc gcctttgagt gagctgatac 6420cgctcgccgc
agccgaacga ccgagcgcag cgagtcagtg agcgaggaag cggaagagcg
6480cctgatgcgg tattttctcc ttacgcatct gtgcggtatt tcacaccgca
tatggtgcac 6540tctcagtaca atctgctctg atgccgcata gttaagccag
tatacactcc gctatcgcta 6600cgtgactggg tcatggctgc gccccgacac
ccgccaacac ccgctgacgc gccctgacgg 6660gcttgtctgc tcccggcatc
cgcttacaga caagctgtga ccgtctccgg gagctgcatg 6720tgtcagaggt
tttcaccgtc atcaccgaaa cgcgcgaggc agcagatcaa ttcgcgcgcg
6780aaggcgaagc ggcatgcata atgtgcctgt caaatggacg aagcagggat
tctgcaaacc 6840ctatgctact ccgtcaagcc gtcaattgtc tgattcgtta
ccaattatga caacttgacg 6900gctacatcat tcactttttc ttcacaaccg
gcacggaact cgctcgggct ggccccggtg 6960cattttttaa atacccgcga
gaaatagagt tgatcgtcaa aaccaacatt gcgaccgacg 7020gtggcgatag
gcatccgggt ggtgctcaaa agcagcttcg cctggctgat acgttggtcc
7080tcgcgccagc ttaagacgct aatccctaac tgctggcgga aaagatgtga
cagacgcgac 7140ggcgacaagc aaacatgctg tgcgacgctg gcgatatcaa
aattgctgtc tgccaggtga 7200tcgctgatgt actgacaagc ctcgcgtacc
cgattatcca tcggtggatg gagcgactcg 7260ttaatcgctt ccatgcgccg
cagtaacaat tgctcaagca gatttatcgc cagcagctcc 7320gaatagcgcc
cttccccttg cccggcgtta atgatttgcc caaacaggtc gctgaaatgc
7380ggctggtgcg cttcatccgg gcgaaagaac cccgtattgg caaatattga
cggccagtta 7440agccattcat gccagtaggc gcgcggacga aagtaaaccc
actggtgata ccattcgcga 7500gcctccggat gacgaccgta gtgatgaatc
tctcctggcg ggaacagcaa aatatcaccc 7560ggtcggcaaa caaattctcg
tccctgattt ttcaccaccc cctgaccgcg aatggtgaga 7620ttgagaatat
aacctttcat tcccagcggt cggtcgataa aaaaatcgag ataaccgttg
7680gcctcaatcg gcgttaaacc cgccaccaga tgggcattaa acgagtatcc
cggcagcagg 7740ggatcatttt gcgcttcagc catacttttc atactcccgc
cattcagaga agaaaccaat 7800tgtccatatt gcatcagaca ttgccgtcac
tgcgtctttt actggctctt ctcgctaacc 7860aaaccggtaa ccccgcttat
taaaagcatt ctgtaacaaa gcgggaccaa agccatgaca 7920aaaacgcgta
acaaaagtgt ctataatcac ggcagaaaag tccacattga ttatttgcac
7980ggcgtcacac tttgctatgc catagcattt ttatccataa gattagcgga
tcctacctga 8040cgctttttat cgcaactctc tactgtttct ccatacccgt
tttttgggct aacaggagga 8100attaaccatg gggggttctc atcatcatca
tcatcatggt atggctagca tgactggtgg 8160acagcaaatg ggtcgggatc
tgtacgacga tgacgataag gatccgagct cgagatctgc 8220agctggtacc atatggg
82371254PRTEpstein-Barr virus 12Glu Cys Asp Ser Glu Leu Glu Ile Lys
Arg Tyr Lys Arg Val Arg Val1 5 10 15Ala Ser Arg Lys Cys Arg Ala Lys
Phe Lys Gln Leu Leu Gln His Tyr 20 25 30Arg Glu Val Ala Ala Ala Lys
Ser Ser Glu Asn Asp Arg Leu Arg Leu 35 40 45Leu Leu Lys Gln Met Cys
501318PRTmus musculus 13Leu Leu Ile Ile Leu Arg Arg Arg Ile Arg Lys
Gln Ala His Ala His1 5 10 15Ser Lys1421PRTunknownTP10 CPP 14Ala Gly
Tyr Leu Leu Gly Lys Ile Asn Leu Lys Ala Cys Ala Ala Cys1 5 10 15Ala
Lys Lys Ile Leu 201517PRTartificial sequencePolyR CPP 15Gly Gly Gly
Gly Arg Arg Arg Arg Arg Arg Arg Arg Arg Leu Leu Leu1 5 10
15Leu16194DNAartificial sequenceNcoI-6xHis-ZEBRA CPP-EcoRI
16ccatggggca tcaccaccat caccacgaat gcgactcaga actggaaatc aaacgctata
60aacgtgtgcg tgtggcatcc cgtaaatgtc gcgcaaagtt taaacagctg ctgcaacatt
120atcgtgaagt agcggctgcg aaaagctccg aaaacgaccg tttacgcctc
ctcctgaagc 180aaatgtgcga attc 1941786DNAartificial
sequenceNcoI-6xHis-pVEC CPP-EcoRI 17ccatggggca tcaccaccat
caccacttat tgattatctt gcgtcgtcgc atccgcaaac 60aggcgcacgc acatagcaag
gaattc 861895DNAartificial sequenceNcoI-6xHis-TP10 CPP-EcoRI
18ccatggggca tcaccaccat caccacgcgg gttacctgct gggcaagatt aatcttaaag
60cctgcgccgc gtgtgctaag aaaattttgg aattc 951983DNAartificial
sequenceNcoI-6xHis-PolyR CPP-EcoRI 19ccatggggca tcaccaccat
caccacggcg ggggtggtcg tcgtcgccgt cgccgccgtc 60gtcgcctcct gctgctggaa
ttc 83208294DNAartificial sequencepRF144 plasmid 20ccatggggca
tcaccaccat caccacgaat gcgactcaga actggaaatc aaacgctata 60aacgtgtgcg
tgtggcatcc cgtaaatgtc gcgcaaagtt taaacagctg ctgcaacatt
120atcgtgaagt agcggctgcg aaaagctccg aaaacgaccg tttacgcctc
ctcctgaagc 180aaatgtgcga attcgacaag aaatactcca tcggcctgga
cattggaacc aactctgtcg 240gctgggctgt catcaccgac gagtacaagg
tgccctccaa gaaattcaag gtcctcggaa 300acaccgatcg acactccatc
aagaaaaacc tcattggtgc cctgttgttc gattctggcg 360agactgccga
agctaccaga ctcaagcgaa ctgctcggcg acgttacacc cgacggaaga
420accgaatctg ctacctgcag gagatctttt ccaacgagat ggccaaggtg
gacgattcgt 480tctttcatcg actggaggaa tccttcctcg tcgaggaaga
caagaaacac gagcgtcatc 540ccatctttgg caacattgtg gacgaggttg
cttaccacga gaagtatcct accatctacc 600acctgcgaaa gaaactcgtc
gattccaccg acaaggcgga tctcagactt atctacctcg 660ctctggcaca
catgatcaag tttcgaggtc atttcctcat cgagggcgat ctcaatcccg
720acaacagcga tgtggacaag ctgttcattc agctcgttca gacctacaac
cagctgttcg 780aggaaaaccc catcaatgcc tccggagtcg atgcaaaggc
catcttgtct gctcgactct 840cgaagagcag acgactggag aacctcattg
cccaacttcc tggcgagaaa aagaacggac 900tgtttggcaa cctcattgcc
ctttctcttg gtctcacacc caacttcaag tccaacttcg 960atctggcgga
ggacgccaag ctccagctgt ccaaggacac ctacgacgat gacctcgaca
1020acctgcttgc acagattggc gatcagtacg ccgacctgtt tctcgctgcc
aagaaccttt 1080cggatgctat tctcttgtct gacattctgc gagtcaacac
cgagatcaca aaggctcccc 1140tttctgcctc catgatcaag cgatacgacg
agcaccatca ggatctcaca ctgctcaagg 1200ctcttgtccg acagcaactg
cccgagaagt acaaggagat ctttttcgat cagtcgaaga 1260acggctacgc
tggatacatc gacggcggag cctctcagga agagttctac aagttcatca
1320agccaattct cgagaagatg gacggaaccg aggaactgct tgtcaagctc
aatcgagagg 1380atctgcttcg gaagcaacga accttcgaca acggcagcat
tcctcatcag atccacctcg 1440gtgagctgca cgccattctt cgacgtcagg
aagacttcta cccctttctc aaggacaacc 1500gagagaagat cgagaagatt
cttacctttc gaatccccta ctatgttggt cctcttgcca 1560gaggaaactc
tcgatttgct tggatgactc gaaagtccga ggaaaccatc actccctgga
1620acttcgagga agtcgtggac aagggtgcct ctgcacagtc cttcatcgag
cgaatgacca 1680acttcgacaa gaatctgccc aacgagaagg ttcttcccaa
gcattcgctg ctctacgagt 1740actttacagt ctacaacgaa ctcaccaaag
tcaagtacgt taccgaggga atgcgaaagc 1800ctgccttctt gtctggcgaa
cagaagaaag ccattgtcga tctcctgttc aagaccaacc 1860gaaaggtcac
tgttaagcag ctcaaggagg actacttcaa gaaaatcgag tgtttcgaca
1920gcgtcgagat ttccggagtt gaggaccgat tcaacgcctc tttgggcacc
tatcacgatc 1980tgctcaagat tatcaaggac aaggattttc tcgacaacga
ggaaaacgag gacattctgg 2040aggacatcgt gctcactctt accctgttcg
aagatcggga gatgatcgag gaacgactca 2100agacatacgc tcacctgttc
gacgacaagg tcatgaaaca actcaagcga cgtagataca 2160ccggctgggg
aagactttcg cgaaagctca tcaacggcat cagagacaag cagtccggaa
2220agaccattct ggactttctc aagtccgatg gctttgccaa ccgaaacttc
atgcagctca 2280ttcacgacga ttctcttacc ttcaaggagg acatccagaa
ggcacaagtg tccggtcagg 2340gcgacagctt gcacgaacat attgccaacc
tggctggttc gccagccatc aagaaaggca 2400ttctccagac tgtcaaggtt
gtcgacgagc tggtgaaggt catgggacgt cacaagcccg 2460agaacattgt
gatcgagatg gccagagaga accagacaac tcaaaagggt cagaaaaact
2520cgcgagagcg gatgaagcga atcgaggaag gcatcaagga gctgggatcc
cagattctca 2580aggagcatcc cgtcgagaac actcaactgc agaacgagaa
gctgtatctc tactatctgc 2640agaatggtcg agacatgtac gtggatcagg
aactggacat caatcgtctc agcgactacg 2700atgtggacca cattgtccct
caatcctttc tcaaggacga ttctatcgac aacaaggtcc 2760ttacacgatc
cgacaagaac agaggcaagt cggacaacgt tcccagcgaa gaggtggtca
2820aaaagatgaa gaactactgg cgacagctgc tcaacgccaa gctcattacc
cagcgaaagt 2880tcgacaatct taccaaggcc gagcgaggcg gtctgtccga
gctcgacaag gctggcttca 2940tcaagcgtca actcgtcgag accagacaga
tcacaaagca cgtcgcacag attctcgatt 3000ctcggatgaa caccaagtac
gacgagaacg acaagctcat ccgagaggtc aaggtgatta 3060ctctcaagtc
caaactggtc tccgatttcc gaaaggactt tcagttctac aaggtgcgag
3120agatcaacaa ttaccaccat gcccacgatg cttacctcaa cgccgtcgtt
ggcactgcgc 3180tcatcaagaa ataccccaag ctcgaaagcg agttcgttta
cggcgattac aaggtctacg 3240acgttcgaaa gatgattgcc aagtccgaac
aggagattgg caaggctact gccaagtact 3300tcttttactc caacatcatg
aactttttca agaccgagat caccttggcc aacggagaga 3360ttcgaaagag
accacttatc gagaccaacg gcgaaactgg agagatcgtg tgggacaagg
3420gtcgagactt tgcaaccgtg cgaaaggttc tgtcgatgcc tcaggtcaac
atcgtcaaga 3480aaaccgaggt tcagactggc ggattctcca aggagtcgat
tctgcccaag cgaaactccg 3540acaagctcat cgctcgaaag aaagactggg
atcccaagaa atacggtggc ttcgattctc 3600ctaccgtcgc ctattccgtg
cttgtcgttg cgaaggtcga gaagggcaag tccaaaaagc 3660tcaagtccgt
caaggagctg ctcggaatta ccatcatgga gcgatcgagc ttcgagaaga
3720atcccatcga cttcttggaa gccaagggtt acaaggaggt caagaaagac
ctcattatca 3780agctgcccaa gtactctctg ttcgaactgg agaacggtcg
aaagcgtatg ctcgcctccg 3840ctggcgagct gcagaaggga aacgagcttg
ccttgccttc gaagtacgtc aactttctct 3900atctggcttc tcactacgag
aagctcaagg gttctcccga ggacaacgaa cagaagcaac 3960tcttcgttga
gcagcacaaa cattacctcg acgagattat cgagcagatt tccgagtttt
4020cgaagcgagt catcctggct gatgccaact tggacaaggt gctctctgcc
tacaacaagc 4080atcgggacaa acccattcga gaacaggcgg agaacatcat
tcacctgttt actcttacca 4140acctgggtgc tcctgcagct ttcaagtact
tcgataccac tatcgaccga aagcggtaca 4200catccaccaa ggaggttctc
gatgccaccc tgattcacca gtccatcact ggcctgtacg 4260agacccgaat
cgacctgtct cagcttggtg gcgactccag agccgatccc aagaaaaagc
4320gaaaggtcta agcggccgct aagcttggct gttttggcgg atgagagaag
attttcagcc 4380tgatacagat taaatcagaa cgcagaagcg gtctgataaa
acagaatttg cctggcggca 4440gtagcgcggt ggtcccacct gaccccatgc
cgaactcaga agtgaaacgc cgtagcgccg 4500atggtagtgt ggggtctccc
catgcgagag tagggaactg ccaggcatca aataaaacga 4560aaggctcagt
cgaaagactg ggcctttcgt tttatctgtt gtttgtcggt gaacgctctc
4620ctgagtagga caaatccgcc gggagcggat ttgaacgttg cgaagcaacg
gcccggaggg 4680tggcgggcag gacgcccgcc ataaactgcc aggcatcaaa
ttaagcagaa ggccatcctg 4740acggatggcc tttttgcgtt tctacaaact
cttttgttta tttttctaaa tacattcaaa 4800tatgtatccg ctcatgagac
aataaccctg ataaatgctt caataatatt gaaaaaggaa 4860gagtatgagt
attcaacatt tccgtgtcgc ccttattccc ttttttgcgg cattttgcct
4920tcctgttttt gctcacccag aaacgctggt gaaagtaaaa gatgctgaag
atcagttggg 4980tgcacgagtg ggttacatcg aactggatct caacagcggt
aagatccttg agagttttcg 5040ccccgaagaa cgttttccaa tgatgagcac
ttttaaagtt ctgctatgtg gcgcggtatt 5100atcccgtgtt gacgccgggc
aagagcaact cggtcgccgc atacactatt ctcagaatga 5160cttggttgag
tactcaccag tcacagaaaa gcatcttacg gatggcatga cagtaagaga
5220attatgcagt gctgccataa ccatgagtga taacactgcg gccaacttac
ttctgacaac 5280gatcggagga ccgaaggagc taaccgcttt tttgcacaac
atgggggatc atgtaactcg 5340ccttgatcgt tgggaaccgg agctgaatga
agccatacca aacgacgagc gtgacaccac 5400gatgcctgta gcaatggcaa
caacgttgcg caaactatta actggcgaac tacttactct 5460agcttcccgg
caacaattaa tagactggat ggaggcggat aaagttgcag gaccacttct
5520gcgctcggcc cttccggctg gctggtttat tgctgataaa tctggagccg
gtgagcgtgg 5580gtctcgcggt atcattgcag cactggggcc agatggtaag
ccctcccgta tcgtagttat 5640ctacacgacg gggagtcagg caactatgga
tgaacgaaat agacagatcg ctgagatagg 5700tgcctcactg attaagcatt
ggtaactgtc agaccaagtt tactcatata tactttagat 5760tgatttaaaa
cttcattttt aatttaaaag gatctaggtg aagatccttt ttgataatct
5820catgaccaaa atcccttaac gtgagttttc gttccactga gcgtcagacc
ccgtagaaaa 5880gatcaaagga tcttcttgag atcctttttt tctgcgcgta
atctgctgct tgcaaacaaa 5940aaaaccaccg ctaccagcgg tggtttgttt
gccggatcaa gagctaccaa ctctttttcc 6000gaaggtaact ggcttcagca
gagcgcagat accaaatact gtccttctag tgtagccgta 6060gttaggccac
cacttcaaga actctgtagc accgcctaca tacctcgctc tgctaatcct
6120gttaccagtg gctgctgcca gtggcgataa gtcgtgtctt accgggttgg
actcaagacg 6180atagttaccg gataaggcgc agcggtcggg ctgaacgggg
ggttcgtgca cacagcccag 6240cttggagcga acgacctaca ccgaactgag
atacctacag cgtgagctat gagaaagcgc 6300cacgcttccc gaagggagaa
aggcggacag gtatccggta agcggcaggg tcggaacagg 6360agagcgcacg
agggagcttc cagggggaaa cgcctggtat ctttatagtc ctgtcgggtt
6420tcgccacctc tgacttgagc gtcgattttt gtgatgctcg tcaggggggc
ggagcctatg 6480gaaaaacgcc agcaacgcgg cctttttacg gttcctggcc
ttttgctggc cttttgctca 6540catgttcttt cctgcgttat cccctgattc
tgtggataac cgtattaccg cctttgagtg 6600agctgatacc gctcgccgca
gccgaacgac cgagcgcagc gagtcagtga gcgaggaagc 6660ggaagagcgc
ctgatgcggt attttctcct tacgcatctg tgcggtattt cacaccgcat
6720atggtgcact ctcagtacaa tctgctctga tgccgcatag ttaagccagt
atacactccg 6780ctatcgctac gtgactgggt catggctgcg ccccgacacc
cgccaacacc cgctgacgcg 6840ccctgacggg cttgtctgct cccggcatcc
gcttacagac aagctgtgac cgtctccggg 6900agctgcatgt gtcagaggtt
ttcaccgtca tcaccgaaac gcgcgaggca gcagatcaat 6960tcgcgcgcga
aggcgaagcg gcatgcataa tgtgcctgtc aaatggacga agcagggatt
7020ctgcaaaccc tatgctactc cgtcaagccg tcaattgtct gattcgttac
caattatgac 7080aacttgacgg ctacatcatt cactttttct tcacaaccgg
cacggaactc gctcgggctg 7140gccccggtgc attttttaaa tacccgcgag
aaatagagtt gatcgtcaaa accaacattg 7200cgaccgacgg tggcgatagg
catccgggtg gtgctcaaaa gcagcttcgc ctggctgata 7260cgttggtcct
cgcgccagct taagacgcta atccctaact gctggcggaa aagatgtgac
7320agacgcgacg gcgacaagca aacatgctgt gcgacgctgg cgatatcaaa
attgctgtct 7380gccaggtgat cgctgatgta ctgacaagcc tcgcgtaccc
gattatccat cggtggatgg 7440agcgactcgt taatcgcttc catgcgccgc
agtaacaatt gctcaagcag atttatcgcc 7500agcagctccg aatagcgccc
ttccccttgc ccggcgttaa tgatttgccc aaacaggtcg 7560ctgaaatgcg
gctggtgcgc ttcatccggg cgaaagaacc ccgtattggc aaatattgac
7620ggccagttaa gccattcatg ccagtaggcg cgcggacgaa agtaaaccca
ctggtgatac 7680cattcgcgag cctccggatg acgaccgtag tgatgaatct
ctcctggcgg gaacagcaaa 7740atatcacccg gtcggcaaac aaattctcgt
ccctgatttt tcaccacccc ctgaccgcga 7800atggtgagat tgagaatata
acctttcatt cccagcggtc ggtcgataaa aaaatcgaga 7860taaccgttgg
cctcaatcgg cgttaaaccc gccaccagat gggcattaaa cgagtatccc
7920ggcagcaggg gatcattttg cgcttcagcc atacttttca tactcccgcc
attcagagaa 7980gaaaccaatt gtccatattg catcagacat tgccgtcact
gcgtctttta ctggctcttc 8040tcgctaacca aaccggtaac cccgcttatt
aaaagcattc tgtaacaaag cgggaccaaa 8100gccatgacaa aaacgcgtaa
caaaagtgtc tataatcacg gcagaaaagt ccacattgat 8160tatttgcacg
gcgtcacact ttgctatgcc atagcatttt tatccataag attagcggat
8220cctacctgac gctttttatc gcaactctct actgtttctc catacccgtt
ttttgggcta 8280acaggaggaa ttaa 8294218183DNAartificial
sequencepRF145 plasmid 21aattcgacaa gaaatactcc atcggcctgg
acattggaac caactctgtc ggctgggctg 60tcatcaccga cgagtacaag gtgccctcca
agaaattcaa ggtcctcgga aacaccgatc 120gacactccat caagaaaaac
ctcattggtg ccctgttgtt
cgattctggc gagactgccg 180aagctaccag actcaagcga actgctcggc
gacgttacac ccgacggaag aaccgaatct 240gctacctgca ggagatcttt
tccaacgaga tggccaaggt ggacgattcg ttctttcatc 300gactggagga
atccttcctc gtcgaggaag acaagaaaca cgagcgtcat cccatctttg
360gcaacattgt ggacgaggtt gcttaccacg agaagtatcc taccatctac
cacctgcgaa 420agaaactcgt cgattccacc gacaaggcgg atctcagact
tatctacctc gctctggcac 480acatgatcaa gtttcgaggt catttcctca
tcgagggcga tctcaatccc gacaacagcg 540atgtggacaa gctgttcatt
cagctcgttc agacctacaa ccagctgttc gaggaaaacc 600ccatcaatgc
ctccggagtc gatgcaaagg ccatcttgtc tgctcgactc tcgaagagca
660gacgactgga gaacctcatt gcccaacttc ctggcgagaa aaagaacgga
ctgtttggca 720acctcattgc cctttctctt ggtctcacac ccaacttcaa
gtccaacttc gatctggcgg 780aggacgccaa gctccagctg tccaaggaca
cctacgacga tgacctcgac aacctgcttg 840cacagattgg cgatcagtac
gccgacctgt ttctcgctgc caagaacctt tcggatgcta 900ttctcttgtc
tgacattctg cgagtcaaca ccgagatcac aaaggctccc ctttctgcct
960ccatgatcaa gcgatacgac gagcaccatc aggatctcac actgctcaag
gctcttgtcc 1020gacagcaact gcccgagaag tacaaggaga tctttttcga
tcagtcgaag aacggctacg 1080ctggatacat cgacggcgga gcctctcagg
aagagttcta caagttcatc aagccaattc 1140tcgagaagat ggacggaacc
gaggaactgc ttgtcaagct caatcgagag gatctgcttc 1200ggaagcaacg
aaccttcgac aacggcagca ttcctcatca gatccacctc ggtgagctgc
1260acgccattct tcgacgtcag gaagacttct acccctttct caaggacaac
cgagagaaga 1320tcgagaagat tcttaccttt cgaatcccct actatgttgg
tcctcttgcc agaggaaact 1380ctcgatttgc ttggatgact cgaaagtccg
aggaaaccat cactccctgg aacttcgagg 1440aagtcgtgga caagggtgcc
tctgcacagt ccttcatcga gcgaatgacc aacttcgaca 1500agaatctgcc
caacgagaag gttcttccca agcattcgct gctctacgag tactttacag
1560tctacaacga actcaccaaa gtcaagtacg ttaccgaggg aatgcgaaag
cctgccttct 1620tgtctggcga acagaagaaa gccattgtcg atctcctgtt
caagaccaac cgaaaggtca 1680ctgttaagca gctcaaggag gactacttca
agaaaatcga gtgtttcgac agcgtcgaga 1740tttccggagt tgaggaccga
ttcaacgcct ctttgggcac ctatcacgat ctgctcaaga 1800ttatcaagga
caaggatttt ctcgacaacg aggaaaacga ggacattctg gaggacatcg
1860tgctcactct taccctgttc gaagatcggg agatgatcga ggaacgactc
aagacatacg 1920ctcacctgtt cgacgacaag gtcatgaaac aactcaagcg
acgtagatac accggctggg 1980gaagactttc gcgaaagctc atcaacggca
tcagagacaa gcagtccgga aagaccattc 2040tggactttct caagtccgat
ggctttgcca accgaaactt catgcagctc attcacgacg 2100attctcttac
cttcaaggag gacatccaga aggcacaagt gtccggtcag ggcgacagct
2160tgcacgaaca tattgccaac ctggctggtt cgccagccat caagaaaggc
attctccaga 2220ctgtcaaggt tgtcgacgag ctggtgaagg tcatgggacg
tcacaagccc gagaacattg 2280tgatcgagat ggccagagag aaccagacaa
ctcaaaaggg tcagaaaaac tcgcgagagc 2340ggatgaagcg aatcgaggaa
ggcatcaagg agctgggatc ccagattctc aaggagcatc 2400ccgtcgagaa
cactcaactg cagaacgaga agctgtatct ctactatctg cagaatggtc
2460gagacatgta cgtggatcag gaactggaca tcaatcgtct cagcgactac
gatgtggacc 2520acattgtccc tcaatccttt ctcaaggacg attctatcga
caacaaggtc cttacacgat 2580ccgacaagaa cagaggcaag tcggacaacg
ttcccagcga agaggtggtc aaaaagatga 2640agaactactg gcgacagctg
ctcaacgcca agctcattac ccagcgaaag ttcgacaatc 2700ttaccaaggc
cgagcgaggc ggtctgtccg agctcgacaa ggctggcttc atcaagcgtc
2760aactcgtcga gaccagacag atcacaaagc acgtcgcaca gattctcgat
tctcggatga 2820acaccaagta cgacgagaac gacaagctca tccgagaggt
caaggtgatt actctcaagt 2880ccaaactggt ctccgatttc cgaaaggact
ttcagttcta caaggtgcga gagatcaaca 2940attaccacca tgcccacgat
gcttacctca acgccgtcgt tggcactgcg ctcatcaaga 3000aataccccaa
gctcgaaagc gagttcgttt acggcgatta caaggtctac gacgttcgaa
3060agatgattgc caagtccgaa caggagattg gcaaggctac tgccaagtac
ttcttttact 3120ccaacatcat gaactttttc aagaccgaga tcaccttggc
caacggagag attcgaaaga 3180gaccacttat cgagaccaac ggcgaaactg
gagagatcgt gtgggacaag ggtcgagact 3240ttgcaaccgt gcgaaaggtt
ctgtcgatgc ctcaggtcaa catcgtcaag aaaaccgagg 3300ttcagactgg
cggattctcc aaggagtcga ttctgcccaa gcgaaactcc gacaagctca
3360tcgctcgaaa gaaagactgg gatcccaaga aatacggtgg cttcgattct
cctaccgtcg 3420cctattccgt gcttgtcgtt gcgaaggtcg agaagggcaa
gtccaaaaag ctcaagtccg 3480tcaaggagct gctcggaatt accatcatgg
agcgatcgag cttcgagaag aatcccatcg 3540acttcttgga agccaagggt
tacaaggagg tcaagaaaga cctcattatc aagctgccca 3600agtactctct
gttcgaactg gagaacggtc gaaagcgtat gctcgcctcc gctggcgagc
3660tgcagaaggg aaacgagctt gccttgcctt cgaagtacgt caactttctc
tatctggctt 3720ctcactacga gaagctcaag ggttctcccg aggacaacga
acagaagcaa ctcttcgttg 3780agcagcacaa acattacctc gacgagatta
tcgagcagat ttccgagttt tcgaagcgag 3840tcatcctggc tgatgccaac
ttggacaagg tgctctctgc ctacaacaag catcgggaca 3900aacccattcg
agaacaggcg gagaacatca ttcacctgtt tactcttacc aacctgggtg
3960ctcctgcagc tttcaagtac ttcgatacca ctatcgaccg aaagcggtac
acatccacca 4020aggaggttct cgatgccacc ctgattcacc agtccatcac
tggcctgtac gagacccgaa 4080tcgacctgtc tcagcttggt ggcgactcca
gagccgatcc caagaaaaag cgaaaggtct 4140aagcggccgc taagcttggc
tgttttggcg gatgagagaa gattttcagc ctgatacaga 4200ttaaatcaga
acgcagaagc ggtctgataa aacagaattt gcctggcggc agtagcgcgg
4260tggtcccacc tgaccccatg ccgaactcag aagtgaaacg ccgtagcgcc
gatggtagtg 4320tggggtctcc ccatgcgaga gtagggaact gccaggcatc
aaataaaacg aaaggctcag 4380tcgaaagact gggcctttcg ttttatctgt
tgtttgtcgg tgaacgctct cctgagtagg 4440acaaatccgc cgggagcgga
tttgaacgtt gcgaagcaac ggcccggagg gtggcgggca 4500ggacgcccgc
cataaactgc caggcatcaa attaagcaga aggccatcct gacggatggc
4560ctttttgcgt ttctacaaac tcttttgttt atttttctaa atacattcaa
atatgtatcc 4620gctcatgaga caataaccct gataaatgct tcaataatat
tgaaaaagga agagtatgag 4680tattcaacat ttccgtgtcg cccttattcc
cttttttgcg gcattttgcc ttcctgtttt 4740tgctcaccca gaaacgctgg
tgaaagtaaa agatgctgaa gatcagttgg gtgcacgagt 4800gggttacatc
gaactggatc tcaacagcgg taagatcctt gagagttttc gccccgaaga
4860acgttttcca atgatgagca cttttaaagt tctgctatgt ggcgcggtat
tatcccgtgt 4920tgacgccggg caagagcaac tcggtcgccg catacactat
tctcagaatg acttggttga 4980gtactcacca gtcacagaaa agcatcttac
ggatggcatg acagtaagag aattatgcag 5040tgctgccata accatgagtg
ataacactgc ggccaactta cttctgacaa cgatcggagg 5100accgaaggag
ctaaccgctt ttttgcacaa catgggggat catgtaactc gccttgatcg
5160ttgggaaccg gagctgaatg aagccatacc aaacgacgag cgtgacacca
cgatgcctgt 5220agcaatggca acaacgttgc gcaaactatt aactggcgaa
ctacttactc tagcttcccg 5280gcaacaatta atagactgga tggaggcgga
taaagttgca ggaccacttc tgcgctcggc 5340ccttccggct ggctggttta
ttgctgataa atctggagcc ggtgagcgtg ggtctcgcgg 5400tatcattgca
gcactggggc cagatggtaa gccctcccgt atcgtagtta tctacacgac
5460ggggagtcag gcaactatgg atgaacgaaa tagacagatc gctgagatag
gtgcctcact 5520gattaagcat tggtaactgt cagaccaagt ttactcatat
atactttaga ttgatttaaa 5580acttcatttt taatttaaaa ggatctaggt
gaagatcctt tttgataatc tcatgaccaa 5640aatcccttaa cgtgagtttt
cgttccactg agcgtcagac cccgtagaaa agatcaaagg 5700atcttcttga
gatccttttt ttctgcgcgt aatctgctgc ttgcaaacaa aaaaaccacc
5760gctaccagcg gtggtttgtt tgccggatca agagctacca actctttttc
cgaaggtaac 5820tggcttcagc agagcgcaga taccaaatac tgtccttcta
gtgtagccgt agttaggcca 5880ccacttcaag aactctgtag caccgcctac
atacctcgct ctgctaatcc tgttaccagt 5940ggctgctgcc agtggcgata
agtcgtgtct taccgggttg gactcaagac gatagttacc 6000ggataaggcg
cagcggtcgg gctgaacggg gggttcgtgc acacagccca gcttggagcg
6060aacgacctac accgaactga gatacctaca gcgtgagcta tgagaaagcg
ccacgcttcc 6120cgaagggaga aaggcggaca ggtatccggt aagcggcagg
gtcggaacag gagagcgcac 6180gagggagctt ccagggggaa acgcctggta
tctttatagt cctgtcgggt ttcgccacct 6240ctgacttgag cgtcgatttt
tgtgatgctc gtcagggggg cggagcctat ggaaaaacgc 6300cagcaacgcg
gcctttttac ggttcctggc cttttgctgg ccttttgctc acatgttctt
6360tcctgcgtta tcccctgatt ctgtggataa ccgtattacc gcctttgagt
gagctgatac 6420cgctcgccgc agccgaacga ccgagcgcag cgagtcagtg
agcgaggaag cggaagagcg 6480cctgatgcgg tattttctcc ttacgcatct
gtgcggtatt tcacaccgca tatggtgcac 6540tctcagtaca atctgctctg
atgccgcata gttaagccag tatacactcc gctatcgcta 6600cgtgactggg
tcatggctgc gccccgacac ccgccaacac ccgctgacgc gccctgacgg
6660gcttgtctgc tcccggcatc cgcttacaga caagctgtga ccgtctccgg
gagctgcatg 6720tgtcagaggt tttcaccgtc atcaccgaaa cgcgcgaggc
agcagatcaa ttcgcgcgcg 6780aaggcgaagc ggcatgcata atgtgcctgt
caaatggacg aagcagggat tctgcaaacc 6840ctatgctact ccgtcaagcc
gtcaattgtc tgattcgtta ccaattatga caacttgacg 6900gctacatcat
tcactttttc ttcacaaccg gcacggaact cgctcgggct ggccccggtg
6960cattttttaa atacccgcga gaaatagagt tgatcgtcaa aaccaacatt
gcgaccgacg 7020gtggcgatag gcatccgggt ggtgctcaaa agcagcttcg
cctggctgat acgttggtcc 7080tcgcgccagc ttaagacgct aatccctaac
tgctggcgga aaagatgtga cagacgcgac 7140ggcgacaagc aaacatgctg
tgcgacgctg gcgatatcaa aattgctgtc tgccaggtga 7200tcgctgatgt
actgacaagc ctcgcgtacc cgattatcca tcggtggatg gagcgactcg
7260ttaatcgctt ccatgcgccg cagtaacaat tgctcaagca gatttatcgc
cagcagctcc 7320gaatagcgcc cttccccttg cccggcgtta atgatttgcc
caaacaggtc gctgaaatgc 7380ggctggtgcg cttcatccgg gcgaaagaac
cccgtattgg caaatattga cggccagtta 7440agccattcat gccagtaggc
gcgcggacga aagtaaaccc actggtgata ccattcgcga 7500gcctccggat
gacgaccgta gtgatgaatc tctcctggcg ggaacagcaa aatatcaccc
7560ggtcggcaaa caaattctcg tccctgattt ttcaccaccc cctgaccgcg
aatggtgaga 7620ttgagaatat aacctttcat tcccagcggt cggtcgataa
aaaaatcgag ataaccgttg 7680gcctcaatcg gcgttaaacc cgccaccaga
tgggcattaa acgagtatcc cggcagcagg 7740ggatcatttt gcgcttcagc
catacttttc atactcccgc cattcagaga agaaaccaat 7800tgtccatatt
gcatcagaca ttgccgtcac tgcgtctttt actggctctt ctcgctaacc
7860aaaccggtaa ccccgcttat taaaagcatt ctgtaacaaa gcgggaccaa
agccatgaca 7920aaaacgcgta acaaaagtgt ctataatcac ggcagaaaag
tccacattga ttatttgcac 7980ggcgtcacac tttgctatgc catagcattt
ttatccataa gattagcgga tcctacctga 8040cgctttttat cgcaactctc
tactgtttct ccatacccgt tttttgggct aacaggagga 8100attaaccatg
gggcatcacc accatcacca cggcgggggt ggtcgtcgtc gccgtcgccg
8160ccgtcgtcgc ctcctgctgc tgg 8183228195DNAartificial
sequencepRF146 plasmid 22aattcgacaa gaaatactcc atcggcctgg
acattggaac caactctgtc ggctgggctg 60tcatcaccga cgagtacaag gtgccctcca
agaaattcaa ggtcctcgga aacaccgatc 120gacactccat caagaaaaac
ctcattggtg ccctgttgtt cgattctggc gagactgccg 180aagctaccag
actcaagcga actgctcggc gacgttacac ccgacggaag aaccgaatct
240gctacctgca ggagatcttt tccaacgaga tggccaaggt ggacgattcg
ttctttcatc 300gactggagga atccttcctc gtcgaggaag acaagaaaca
cgagcgtcat cccatctttg 360gcaacattgt ggacgaggtt gcttaccacg
agaagtatcc taccatctac cacctgcgaa 420agaaactcgt cgattccacc
gacaaggcgg atctcagact tatctacctc gctctggcac 480acatgatcaa
gtttcgaggt catttcctca tcgagggcga tctcaatccc gacaacagcg
540atgtggacaa gctgttcatt cagctcgttc agacctacaa ccagctgttc
gaggaaaacc 600ccatcaatgc ctccggagtc gatgcaaagg ccatcttgtc
tgctcgactc tcgaagagca 660gacgactgga gaacctcatt gcccaacttc
ctggcgagaa aaagaacgga ctgtttggca 720acctcattgc cctttctctt
ggtctcacac ccaacttcaa gtccaacttc gatctggcgg 780aggacgccaa
gctccagctg tccaaggaca cctacgacga tgacctcgac aacctgcttg
840cacagattgg cgatcagtac gccgacctgt ttctcgctgc caagaacctt
tcggatgcta 900ttctcttgtc tgacattctg cgagtcaaca ccgagatcac
aaaggctccc ctttctgcct 960ccatgatcaa gcgatacgac gagcaccatc
aggatctcac actgctcaag gctcttgtcc 1020gacagcaact gcccgagaag
tacaaggaga tctttttcga tcagtcgaag aacggctacg 1080ctggatacat
cgacggcgga gcctctcagg aagagttcta caagttcatc aagccaattc
1140tcgagaagat ggacggaacc gaggaactgc ttgtcaagct caatcgagag
gatctgcttc 1200ggaagcaacg aaccttcgac aacggcagca ttcctcatca
gatccacctc ggtgagctgc 1260acgccattct tcgacgtcag gaagacttct
acccctttct caaggacaac cgagagaaga 1320tcgagaagat tcttaccttt
cgaatcccct actatgttgg tcctcttgcc agaggaaact 1380ctcgatttgc
ttggatgact cgaaagtccg aggaaaccat cactccctgg aacttcgagg
1440aagtcgtgga caagggtgcc tctgcacagt ccttcatcga gcgaatgacc
aacttcgaca 1500agaatctgcc caacgagaag gttcttccca agcattcgct
gctctacgag tactttacag 1560tctacaacga actcaccaaa gtcaagtacg
ttaccgaggg aatgcgaaag cctgccttct 1620tgtctggcga acagaagaaa
gccattgtcg atctcctgtt caagaccaac cgaaaggtca 1680ctgttaagca
gctcaaggag gactacttca agaaaatcga gtgtttcgac agcgtcgaga
1740tttccggagt tgaggaccga ttcaacgcct ctttgggcac ctatcacgat
ctgctcaaga 1800ttatcaagga caaggatttt ctcgacaacg aggaaaacga
ggacattctg gaggacatcg 1860tgctcactct taccctgttc gaagatcggg
agatgatcga ggaacgactc aagacatacg 1920ctcacctgtt cgacgacaag
gtcatgaaac aactcaagcg acgtagatac accggctggg 1980gaagactttc
gcgaaagctc atcaacggca tcagagacaa gcagtccgga aagaccattc
2040tggactttct caagtccgat ggctttgcca accgaaactt catgcagctc
attcacgacg 2100attctcttac cttcaaggag gacatccaga aggcacaagt
gtccggtcag ggcgacagct 2160tgcacgaaca tattgccaac ctggctggtt
cgccagccat caagaaaggc attctccaga 2220ctgtcaaggt tgtcgacgag
ctggtgaagg tcatgggacg tcacaagccc gagaacattg 2280tgatcgagat
ggccagagag aaccagacaa ctcaaaaggg tcagaaaaac tcgcgagagc
2340ggatgaagcg aatcgaggaa ggcatcaagg agctgggatc ccagattctc
aaggagcatc 2400ccgtcgagaa cactcaactg cagaacgaga agctgtatct
ctactatctg cagaatggtc 2460gagacatgta cgtggatcag gaactggaca
tcaatcgtct cagcgactac gatgtggacc 2520acattgtccc tcaatccttt
ctcaaggacg attctatcga caacaaggtc cttacacgat 2580ccgacaagaa
cagaggcaag tcggacaacg ttcccagcga agaggtggtc aaaaagatga
2640agaactactg gcgacagctg ctcaacgcca agctcattac ccagcgaaag
ttcgacaatc 2700ttaccaaggc cgagcgaggc ggtctgtccg agctcgacaa
ggctggcttc atcaagcgtc 2760aactcgtcga gaccagacag atcacaaagc
acgtcgcaca gattctcgat tctcggatga 2820acaccaagta cgacgagaac
gacaagctca tccgagaggt caaggtgatt actctcaagt 2880ccaaactggt
ctccgatttc cgaaaggact ttcagttcta caaggtgcga gagatcaaca
2940attaccacca tgcccacgat gcttacctca acgccgtcgt tggcactgcg
ctcatcaaga 3000aataccccaa gctcgaaagc gagttcgttt acggcgatta
caaggtctac gacgttcgaa 3060agatgattgc caagtccgaa caggagattg
gcaaggctac tgccaagtac ttcttttact 3120ccaacatcat gaactttttc
aagaccgaga tcaccttggc caacggagag attcgaaaga 3180gaccacttat
cgagaccaac ggcgaaactg gagagatcgt gtgggacaag ggtcgagact
3240ttgcaaccgt gcgaaaggtt ctgtcgatgc ctcaggtcaa catcgtcaag
aaaaccgagg 3300ttcagactgg cggattctcc aaggagtcga ttctgcccaa
gcgaaactcc gacaagctca 3360tcgctcgaaa gaaagactgg gatcccaaga
aatacggtgg cttcgattct cctaccgtcg 3420cctattccgt gcttgtcgtt
gcgaaggtcg agaagggcaa gtccaaaaag ctcaagtccg 3480tcaaggagct
gctcggaatt accatcatgg agcgatcgag cttcgagaag aatcccatcg
3540acttcttgga agccaagggt tacaaggagg tcaagaaaga cctcattatc
aagctgccca 3600agtactctct gttcgaactg gagaacggtc gaaagcgtat
gctcgcctcc gctggcgagc 3660tgcagaaggg aaacgagctt gccttgcctt
cgaagtacgt caactttctc tatctggctt 3720ctcactacga gaagctcaag
ggttctcccg aggacaacga acagaagcaa ctcttcgttg 3780agcagcacaa
acattacctc gacgagatta tcgagcagat ttccgagttt tcgaagcgag
3840tcatcctggc tgatgccaac ttggacaagg tgctctctgc ctacaacaag
catcgggaca 3900aacccattcg agaacaggcg gagaacatca ttcacctgtt
tactcttacc aacctgggtg 3960ctcctgcagc tttcaagtac ttcgatacca
ctatcgaccg aaagcggtac acatccacca 4020aggaggttct cgatgccacc
ctgattcacc agtccatcac tggcctgtac gagacccgaa 4080tcgacctgtc
tcagcttggt ggcgactcca gagccgatcc caagaaaaag cgaaaggtct
4140aagcggccgc taagcttggc tgttttggcg gatgagagaa gattttcagc
ctgatacaga 4200ttaaatcaga acgcagaagc ggtctgataa aacagaattt
gcctggcggc agtagcgcgg 4260tggtcccacc tgaccccatg ccgaactcag
aagtgaaacg ccgtagcgcc gatggtagtg 4320tggggtctcc ccatgcgaga
gtagggaact gccaggcatc aaataaaacg aaaggctcag 4380tcgaaagact
gggcctttcg ttttatctgt tgtttgtcgg tgaacgctct cctgagtagg
4440acaaatccgc cgggagcgga tttgaacgtt gcgaagcaac ggcccggagg
gtggcgggca 4500ggacgcccgc cataaactgc caggcatcaa attaagcaga
aggccatcct gacggatggc 4560ctttttgcgt ttctacaaac tcttttgttt
atttttctaa atacattcaa atatgtatcc 4620gctcatgaga caataaccct
gataaatgct tcaataatat tgaaaaagga agagtatgag 4680tattcaacat
ttccgtgtcg cccttattcc cttttttgcg gcattttgcc ttcctgtttt
4740tgctcaccca gaaacgctgg tgaaagtaaa agatgctgaa gatcagttgg
gtgcacgagt 4800gggttacatc gaactggatc tcaacagcgg taagatcctt
gagagttttc gccccgaaga 4860acgttttcca atgatgagca cttttaaagt
tctgctatgt ggcgcggtat tatcccgtgt 4920tgacgccggg caagagcaac
tcggtcgccg catacactat tctcagaatg acttggttga 4980gtactcacca
gtcacagaaa agcatcttac ggatggcatg acagtaagag aattatgcag
5040tgctgccata accatgagtg ataacactgc ggccaactta cttctgacaa
cgatcggagg 5100accgaaggag ctaaccgctt ttttgcacaa catgggggat
catgtaactc gccttgatcg 5160ttgggaaccg gagctgaatg aagccatacc
aaacgacgag cgtgacacca cgatgcctgt 5220agcaatggca acaacgttgc
gcaaactatt aactggcgaa ctacttactc tagcttcccg 5280gcaacaatta
atagactgga tggaggcgga taaagttgca ggaccacttc tgcgctcggc
5340ccttccggct ggctggttta ttgctgataa atctggagcc ggtgagcgtg
ggtctcgcgg 5400tatcattgca gcactggggc cagatggtaa gccctcccgt
atcgtagtta tctacacgac 5460ggggagtcag gcaactatgg atgaacgaaa
tagacagatc gctgagatag gtgcctcact 5520gattaagcat tggtaactgt
cagaccaagt ttactcatat atactttaga ttgatttaaa 5580acttcatttt
taatttaaaa ggatctaggt gaagatcctt tttgataatc tcatgaccaa
5640aatcccttaa cgtgagtttt cgttccactg agcgtcagac cccgtagaaa
agatcaaagg 5700atcttcttga gatccttttt ttctgcgcgt aatctgctgc
ttgcaaacaa aaaaaccacc 5760gctaccagcg gtggtttgtt tgccggatca
agagctacca actctttttc cgaaggtaac 5820tggcttcagc agagcgcaga
taccaaatac tgtccttcta gtgtagccgt agttaggcca 5880ccacttcaag
aactctgtag caccgcctac atacctcgct ctgctaatcc tgttaccagt
5940ggctgctgcc agtggcgata agtcgtgtct taccgggttg gactcaagac
gatagttacc 6000ggataaggcg cagcggtcgg gctgaacggg gggttcgtgc
acacagccca gcttggagcg 6060aacgacctac accgaactga gatacctaca
gcgtgagcta tgagaaagcg ccacgcttcc 6120cgaagggaga aaggcggaca
ggtatccggt aagcggcagg gtcggaacag gagagcgcac 6180gagggagctt
ccagggggaa acgcctggta tctttatagt cctgtcgggt ttcgccacct
6240ctgacttgag cgtcgatttt tgtgatgctc gtcagggggg cggagcctat
ggaaaaacgc 6300cagcaacgcg gcctttttac ggttcctggc cttttgctgg
ccttttgctc acatgttctt 6360tcctgcgtta tcccctgatt ctgtggataa
ccgtattacc gcctttgagt gagctgatac 6420cgctcgccgc agccgaacga
ccgagcgcag cgagtcagtg agcgaggaag cggaagagcg 6480cctgatgcgg
tattttctcc ttacgcatct gtgcggtatt tcacaccgca tatggtgcac
6540tctcagtaca atctgctctg atgccgcata gttaagccag tatacactcc
gctatcgcta 6600cgtgactggg tcatggctgc gccccgacac ccgccaacac
ccgctgacgc gccctgacgg 6660gcttgtctgc tcccggcatc cgcttacaga
caagctgtga ccgtctccgg gagctgcatg 6720tgtcagaggt tttcaccgtc
atcaccgaaa cgcgcgaggc agcagatcaa ttcgcgcgcg 6780aaggcgaagc
ggcatgcata atgtgcctgt caaatggacg aagcagggat tctgcaaacc
6840ctatgctact ccgtcaagcc gtcaattgtc tgattcgtta ccaattatga
caacttgacg 6900gctacatcat tcactttttc ttcacaaccg gcacggaact
cgctcgggct ggccccggtg
6960cattttttaa atacccgcga gaaatagagt tgatcgtcaa aaccaacatt
gcgaccgacg 7020gtggcgatag gcatccgggt ggtgctcaaa agcagcttcg
cctggctgat acgttggtcc 7080tcgcgccagc ttaagacgct aatccctaac
tgctggcgga aaagatgtga cagacgcgac 7140ggcgacaagc aaacatgctg
tgcgacgctg gcgatatcaa aattgctgtc tgccaggtga 7200tcgctgatgt
actgacaagc ctcgcgtacc cgattatcca tcggtggatg gagcgactcg
7260ttaatcgctt ccatgcgccg cagtaacaat tgctcaagca gatttatcgc
cagcagctcc 7320gaatagcgcc cttccccttg cccggcgtta atgatttgcc
caaacaggtc gctgaaatgc 7380ggctggtgcg cttcatccgg gcgaaagaac
cccgtattgg caaatattga cggccagtta 7440agccattcat gccagtaggc
gcgcggacga aagtaaaccc actggtgata ccattcgcga 7500gcctccggat
gacgaccgta gtgatgaatc tctcctggcg ggaacagcaa aatatcaccc
7560ggtcggcaaa caaattctcg tccctgattt ttcaccaccc cctgaccgcg
aatggtgaga 7620ttgagaatat aacctttcat tcccagcggt cggtcgataa
aaaaatcgag ataaccgttg 7680gcctcaatcg gcgttaaacc cgccaccaga
tgggcattaa acgagtatcc cggcagcagg 7740ggatcatttt gcgcttcagc
catacttttc atactcccgc cattcagaga agaaaccaat 7800tgtccatatt
gcatcagaca ttgccgtcac tgcgtctttt actggctctt ctcgctaacc
7860aaaccggtaa ccccgcttat taaaagcatt ctgtaacaaa gcgggaccaa
agccatgaca 7920aaaacgcgta acaaaagtgt ctataatcac ggcagaaaag
tccacattga ttatttgcac 7980ggcgtcacac tttgctatgc catagcattt
ttatccataa gattagcgga tcctacctga 8040cgctttttat cgcaactctc
tactgtttct ccatacccgt tttttgggct aacaggagga 8100attaaccatg
gggcatcacc accatcacca cgcgggttac ctgctgggca agattaatct
8160taaagcctgc gccgcgtgtg ctaagaaaat tttgg 8195238186DNAartificial
sequencepRF162 plasmid 23aattcgacaa gaaatactcc atcggcctgg
acattggaac caactctgtc ggctgggctg 60tcatcaccga cgagtacaag gtgccctcca
agaaattcaa ggtcctcgga aacaccgatc 120gacactccat caagaaaaac
ctcattggtg ccctgttgtt cgattctggc gagactgccg 180aagctaccag
actcaagcga actgctcggc gacgttacac ccgacggaag aaccgaatct
240gctacctgca ggagatcttt tccaacgaga tggccaaggt ggacgattcg
ttctttcatc 300gactggagga atccttcctc gtcgaggaag acaagaaaca
cgagcgtcat cccatctttg 360gcaacattgt ggacgaggtt gcttaccacg
agaagtatcc taccatctac cacctgcgaa 420agaaactcgt cgattccacc
gacaaggcgg atctcagact tatctacctc gctctggcac 480acatgatcaa
gtttcgaggt catttcctca tcgagggcga tctcaatccc gacaacagcg
540atgtggacaa gctgttcatt cagctcgttc agacctacaa ccagctgttc
gaggaaaacc 600ccatcaatgc ctccggagtc gatgcaaagg ccatcttgtc
tgctcgactc tcgaagagca 660gacgactgga gaacctcatt gcccaacttc
ctggcgagaa aaagaacgga ctgtttggca 720acctcattgc cctttctctt
ggtctcacac ccaacttcaa gtccaacttc gatctggcgg 780aggacgccaa
gctccagctg tccaaggaca cctacgacga tgacctcgac aacctgcttg
840cacagattgg cgatcagtac gccgacctgt ttctcgctgc caagaacctt
tcggatgcta 900ttctcttgtc tgacattctg cgagtcaaca ccgagatcac
aaaggctccc ctttctgcct 960ccatgatcaa gcgatacgac gagcaccatc
aggatctcac actgctcaag gctcttgtcc 1020gacagcaact gcccgagaag
tacaaggaga tctttttcga tcagtcgaag aacggctacg 1080ctggatacat
cgacggcgga gcctctcagg aagagttcta caagttcatc aagccaattc
1140tcgagaagat ggacggaacc gaggaactgc ttgtcaagct caatcgagag
gatctgcttc 1200ggaagcaacg aaccttcgac aacggcagca ttcctcatca
gatccacctc ggtgagctgc 1260acgccattct tcgacgtcag gaagacttct
acccctttct caaggacaac cgagagaaga 1320tcgagaagat tcttaccttt
cgaatcccct actatgttgg tcctcttgcc agaggaaact 1380ctcgatttgc
ttggatgact cgaaagtccg aggaaaccat cactccctgg aacttcgagg
1440aagtcgtgga caagggtgcc tctgcacagt ccttcatcga gcgaatgacc
aacttcgaca 1500agaatctgcc caacgagaag gttcttccca agcattcgct
gctctacgag tactttacag 1560tctacaacga actcaccaaa gtcaagtacg
ttaccgaggg aatgcgaaag cctgccttct 1620tgtctggcga acagaagaaa
gccattgtcg atctcctgtt caagaccaac cgaaaggtca 1680ctgttaagca
gctcaaggag gactacttca agaaaatcga gtgtttcgac agcgtcgaga
1740tttccggagt tgaggaccga ttcaacgcct ctttgggcac ctatcacgat
ctgctcaaga 1800ttatcaagga caaggatttt ctcgacaacg aggaaaacga
ggacattctg gaggacatcg 1860tgctcactct taccctgttc gaagatcggg
agatgatcga ggaacgactc aagacatacg 1920ctcacctgtt cgacgacaag
gtcatgaaac aactcaagcg acgtagatac accggctggg 1980gaagactttc
gcgaaagctc atcaacggca tcagagacaa gcagtccgga aagaccattc
2040tggactttct caagtccgat ggctttgcca accgaaactt catgcagctc
attcacgacg 2100attctcttac cttcaaggag gacatccaga aggcacaagt
gtccggtcag ggcgacagct 2160tgcacgaaca tattgccaac ctggctggtt
cgccagccat caagaaaggc attctccaga 2220ctgtcaaggt tgtcgacgag
ctggtgaagg tcatgggacg tcacaagccc gagaacattg 2280tgatcgagat
ggccagagag aaccagacaa ctcaaaaggg tcagaaaaac tcgcgagagc
2340ggatgaagcg aatcgaggaa ggcatcaagg agctgggatc ccagattctc
aaggagcatc 2400ccgtcgagaa cactcaactg cagaacgaga agctgtatct
ctactatctg cagaatggtc 2460gagacatgta cgtggatcag gaactggaca
tcaatcgtct cagcgactac gatgtggacc 2520acattgtccc tcaatccttt
ctcaaggacg attctatcga caacaaggtc cttacacgat 2580ccgacaagaa
cagaggcaag tcggacaacg ttcccagcga agaggtggtc aaaaagatga
2640agaactactg gcgacagctg ctcaacgcca agctcattac ccagcgaaag
ttcgacaatc 2700ttaccaaggc cgagcgaggc ggtctgtccg agctcgacaa
ggctggcttc atcaagcgtc 2760aactcgtcga gaccagacag atcacaaagc
acgtcgcaca gattctcgat tctcggatga 2820acaccaagta cgacgagaac
gacaagctca tccgagaggt caaggtgatt actctcaagt 2880ccaaactggt
ctccgatttc cgaaaggact ttcagttcta caaggtgcga gagatcaaca
2940attaccacca tgcccacgat gcttacctca acgccgtcgt tggcactgcg
ctcatcaaga 3000aataccccaa gctcgaaagc gagttcgttt acggcgatta
caaggtctac gacgttcgaa 3060agatgattgc caagtccgaa caggagattg
gcaaggctac tgccaagtac ttcttttact 3120ccaacatcat gaactttttc
aagaccgaga tcaccttggc caacggagag attcgaaaga 3180gaccacttat
cgagaccaac ggcgaaactg gagagatcgt gtgggacaag ggtcgagact
3240ttgcaaccgt gcgaaaggtt ctgtcgatgc ctcaggtcaa catcgtcaag
aaaaccgagg 3300ttcagactgg cggattctcc aaggagtcga ttctgcccaa
gcgaaactcc gacaagctca 3360tcgctcgaaa gaaagactgg gatcccaaga
aatacggtgg cttcgattct cctaccgtcg 3420cctattccgt gcttgtcgtt
gcgaaggtcg agaagggcaa gtccaaaaag ctcaagtccg 3480tcaaggagct
gctcggaatt accatcatgg agcgatcgag cttcgagaag aatcccatcg
3540acttcttgga agccaagggt tacaaggagg tcaagaaaga cctcattatc
aagctgccca 3600agtactctct gttcgaactg gagaacggtc gaaagcgtat
gctcgcctcc gctggcgagc 3660tgcagaaggg aaacgagctt gccttgcctt
cgaagtacgt caactttctc tatctggctt 3720ctcactacga gaagctcaag
ggttctcccg aggacaacga acagaagcaa ctcttcgttg 3780agcagcacaa
acattacctc gacgagatta tcgagcagat ttccgagttt tcgaagcgag
3840tcatcctggc tgatgccaac ttggacaagg tgctctctgc ctacaacaag
catcgggaca 3900aacccattcg agaacaggcg gagaacatca ttcacctgtt
tactcttacc aacctgggtg 3960ctcctgcagc tttcaagtac ttcgatacca
ctatcgaccg aaagcggtac acatccacca 4020aggaggttct cgatgccacc
ctgattcacc agtccatcac tggcctgtac gagacccgaa 4080tcgacctgtc
tcagcttggt ggcgactcca gagccgatcc caagaaaaag cgaaaggtct
4140aagcggccgc taagcttggc tgttttggcg gatgagagaa gattttcagc
ctgatacaga 4200ttaaatcaga acgcagaagc ggtctgataa aacagaattt
gcctggcggc agtagcgcgg 4260tggtcccacc tgaccccatg ccgaactcag
aagtgaaacg ccgtagcgcc gatggtagtg 4320tggggtctcc ccatgcgaga
gtagggaact gccaggcatc aaataaaacg aaaggctcag 4380tcgaaagact
gggcctttcg ttttatctgt tgtttgtcgg tgaacgctct cctgagtagg
4440acaaatccgc cgggagcgga tttgaacgtt gcgaagcaac ggcccggagg
gtggcgggca 4500ggacgcccgc cataaactgc caggcatcaa attaagcaga
aggccatcct gacggatggc 4560ctttttgcgt ttctacaaac tcttttgttt
atttttctaa atacattcaa atatgtatcc 4620gctcatgaga caataaccct
gataaatgct tcaataatat tgaaaaagga agagtatgag 4680tattcaacat
ttccgtgtcg cccttattcc cttttttgcg gcattttgcc ttcctgtttt
4740tgctcaccca gaaacgctgg tgaaagtaaa agatgctgaa gatcagttgg
gtgcacgagt 4800gggttacatc gaactggatc tcaacagcgg taagatcctt
gagagttttc gccccgaaga 4860acgttttcca atgatgagca cttttaaagt
tctgctatgt ggcgcggtat tatcccgtgt 4920tgacgccggg caagagcaac
tcggtcgccg catacactat tctcagaatg acttggttga 4980gtactcacca
gtcacagaaa agcatcttac ggatggcatg acagtaagag aattatgcag
5040tgctgccata accatgagtg ataacactgc ggccaactta cttctgacaa
cgatcggagg 5100accgaaggag ctaaccgctt ttttgcacaa catgggggat
catgtaactc gccttgatcg 5160ttgggaaccg gagctgaatg aagccatacc
aaacgacgag cgtgacacca cgatgcctgt 5220agcaatggca acaacgttgc
gcaaactatt aactggcgaa ctacttactc tagcttcccg 5280gcaacaatta
atagactgga tggaggcgga taaagttgca ggaccacttc tgcgctcggc
5340ccttccggct ggctggttta ttgctgataa atctggagcc ggtgagcgtg
ggtctcgcgg 5400tatcattgca gcactggggc cagatggtaa gccctcccgt
atcgtagtta tctacacgac 5460ggggagtcag gcaactatgg atgaacgaaa
tagacagatc gctgagatag gtgcctcact 5520gattaagcat tggtaactgt
cagaccaagt ttactcatat atactttaga ttgatttaaa 5580acttcatttt
taatttaaaa ggatctaggt gaagatcctt tttgataatc tcatgaccaa
5640aatcccttaa cgtgagtttt cgttccactg agcgtcagac cccgtagaaa
agatcaaagg 5700atcttcttga gatccttttt ttctgcgcgt aatctgctgc
ttgcaaacaa aaaaaccacc 5760gctaccagcg gtggtttgtt tgccggatca
agagctacca actctttttc cgaaggtaac 5820tggcttcagc agagcgcaga
taccaaatac tgtccttcta gtgtagccgt agttaggcca 5880ccacttcaag
aactctgtag caccgcctac atacctcgct ctgctaatcc tgttaccagt
5940ggctgctgcc agtggcgata agtcgtgtct taccgggttg gactcaagac
gatagttacc 6000ggataaggcg cagcggtcgg gctgaacggg gggttcgtgc
acacagccca gcttggagcg 6060aacgacctac accgaactga gatacctaca
gcgtgagcta tgagaaagcg ccacgcttcc 6120cgaagggaga aaggcggaca
ggtatccggt aagcggcagg gtcggaacag gagagcgcac 6180gagggagctt
ccagggggaa acgcctggta tctttatagt cctgtcgggt ttcgccacct
6240ctgacttgag cgtcgatttt tgtgatgctc gtcagggggg cggagcctat
ggaaaaacgc 6300cagcaacgcg gcctttttac ggttcctggc cttttgctgg
ccttttgctc acatgttctt 6360tcctgcgtta tcccctgatt ctgtggataa
ccgtattacc gcctttgagt gagctgatac 6420cgctcgccgc agccgaacga
ccgagcgcag cgagtcagtg agcgaggaag cggaagagcg 6480cctgatgcgg
tattttctcc ttacgcatct gtgcggtatt tcacaccgca tatggtgcac
6540tctcagtaca atctgctctg atgccgcata gttaagccag tatacactcc
gctatcgcta 6600cgtgactggg tcatggctgc gccccgacac ccgccaacac
ccgctgacgc gccctgacgg 6660gcttgtctgc tcccggcatc cgcttacaga
caagctgtga ccgtctccgg gagctgcatg 6720tgtcagaggt tttcaccgtc
atcaccgaaa cgcgcgaggc agcagatcaa ttcgcgcgcg 6780aaggcgaagc
ggcatgcata atgtgcctgt caaatggacg aagcagggat tctgcaaacc
6840ctatgctact ccgtcaagcc gtcaattgtc tgattcgtta ccaattatga
caacttgacg 6900gctacatcat tcactttttc ttcacaaccg gcacggaact
cgctcgggct ggccccggtg 6960cattttttaa atacccgcga gaaatagagt
tgatcgtcaa aaccaacatt gcgaccgacg 7020gtggcgatag gcatccgggt
ggtgctcaaa agcagcttcg cctggctgat acgttggtcc 7080tcgcgccagc
ttaagacgct aatccctaac tgctggcgga aaagatgtga cagacgcgac
7140ggcgacaagc aaacatgctg tgcgacgctg gcgatatcaa aattgctgtc
tgccaggtga 7200tcgctgatgt actgacaagc ctcgcgtacc cgattatcca
tcggtggatg gagcgactcg 7260ttaatcgctt ccatgcgccg cagtaacaat
tgctcaagca gatttatcgc cagcagctcc 7320gaatagcgcc cttccccttg
cccggcgtta atgatttgcc caaacaggtc gctgaaatgc 7380ggctggtgcg
cttcatccgg gcgaaagaac cccgtattgg caaatattga cggccagtta
7440agccattcat gccagtaggc gcgcggacga aagtaaaccc actggtgata
ccattcgcga 7500gcctccggat gacgaccgta gtgatgaatc tctcctggcg
ggaacagcaa aatatcaccc 7560ggtcggcaaa caaattctcg tccctgattt
ttcaccaccc cctgaccgcg aatggtgaga 7620ttgagaatat aacctttcat
tcccagcggt cggtcgataa aaaaatcgag ataaccgttg 7680gcctcaatcg
gcgttaaacc cgccaccaga tgggcattaa acgagtatcc cggcagcagg
7740ggatcatttt gcgcttcagc catacttttc atactcccgc cattcagaga
agaaaccaat 7800tgtccatatt gcatcagaca ttgccgtcac tgcgtctttt
actggctctt ctcgctaacc 7860aaaccggtaa ccccgcttat taaaagcatt
ctgtaacaaa gcgggaccaa agccatgaca 7920aaaacgcgta acaaaagtgt
ctataatcac ggcagaaaag tccacattga ttatttgcac 7980ggcgtcacac
tttgctatgc catagcattt ttatccataa gattagcgga tcctacctga
8040cgctttttat cgcaactctc tactgtttct ccatacccgt tttttgggct
aacaggagga 8100attaaccatg gggcatcacc accatcacca cttattgatt
atcttgcgtc gtcgcatccg 8160caaacaggcg cacgcacata gcaagg
81862480RNAartificial sequenceCas9 endonuclease recognition (CER)
domain 24guuuuagagc uagaaauagc aaguuaaaau aaggcuaguc cguuaucaac
uugaaaaagu 60ggcaccgagu cggugcuuuu 802520DNAYarrowia lipolytica
25tcaaacgatt acccaccctc 202643RNAartificial sequenceHammerhead (HH)
ribozymemisc_feature(1)..(6)n is a, c, g, or u 26nnnnnncuga
ugaguccgug aggacgaaac gaguaagcuc guc 432768RNAhepatitis delta virus
27ggccggcaug gucccagccu ccucgcuggc gccggcuggg caacaugcuu cggcauggcg
60aaugggac 6828211DNAartificial sequenceHH-sgRNA-HDV (RGR)
pre-sgRNA expression cassette 28gtttgactga tgagtccgtg aggacgaaac
gagtaagctc gtctcaaacg attacccacc 60ctcgttttag agctagaaat agcaagttaa
aataaggcta gtccgttatc aacttgaaaa 120agtggcaccg agtcggtgct
tttggccggc atggtcccag cctcctcgct ggcgccggct 180gggcaacatg
cttcggcatg gcgaatggga c 2112920DNABacteriophage T7 29taatacgact
cactataggg 20302875DNAartificial sequencepRF46 plasmid 30agcttgtccc
attcgccatg ccgaagcatg ttgcccagcc ggcgccagcg aggaggctgg 60gaccatgccg
gccaaaagca ccaccgactc ggtgccactt tttcaagttg ataacggact
120agccttattt taacttgcta tttctagctc taaaacgagg gtgggtaatc
gtttgagacg 180agcttactcg tttcgtcctc acggactcat cagtcaaacc
cctatagtga gtcgtattag 240aattcgtaat catggtcata gctgtttcct
gtgtgaaatt gttatccgct cacaattcca 300cacaacatac gagccggaag
cataaagtgt aaagcctggg gtgcctaatg agtgagctaa 360ctcacattaa
ttgcgttgcg ctcactgccc gctttccagt cgggaaacct gtcgtgccag
420ctgcattaat gaatcggcca acgcgcgggg agaggcggtt tgcgtattgg
gcgctcttcc 480gcttcctcgc tcactgactc gctgcgctcg gtcgttcggc
tgcggcgagc ggtatcagct 540cactcaaagg cggtaatacg gttatccaca
gaatcagggg ataacgcagg aaagaacatg 600tgagcaaaag gccagcaaaa
ggccaggaac cgtaaaaagg ccgcgttgct ggcgtttttc 660cataggctcc
gcccccctga cgagcatcac aaaaatcgac gctcaagtca gaggtggcga
720aacccgacag gactataaag ataccaggcg tttccccctg gaagctccct
cgtgcgctct 780cctgttccga ccctgccgct taccggatac ctgtccgcct
ttctcccttc gggaagcgtg 840gcgctttctc atagctcacg ctgtaggtat
ctcagttcgg tgtaggtcgt tcgctccaag 900ctgggctgtg tgcacgaacc
ccccgttcag cccgaccgct gcgccttatc cggtaactat 960cgtcttgagt
ccaacccggt aagacacgac ttatcgccac tggcagcagc cactggtaac
1020aggattagca gagcgaggta tgtaggcggt gctacagagt tcttgaagtg
gtggcctaac 1080tacggctaca ctagaaggac agtatttggt atctgcgctc
tgctgaagcc agttaccttc 1140ggaaaaagag ttggtagctc ttgatccggc
aaacaaacca ccgctggtag cggtggtttt 1200tttgtttgca agcagcagat
tacgcgcaga aaaaaaggat ctcaagaaga tcctttgatc 1260ttttctacgg
ggtctgacgc tcagtggaac gaaaactcac gttaagggat tttggtcatg
1320agattatcaa aaaggatctt cacctagatc cttttaaatt aaaaatgaag
ttttaaatca 1380atctaaagta tatatgagta aacttggtct gacagttacc
aatgcttaat cagtgaggca 1440cctatctcag cgatctgtct atttcgttca
tccatagttg cctgactccc cgtcgtgtag 1500ataactacga tacgggaggg
cttaccatct ggccccagtg ctgcaatgat accgcgagac 1560ccacgctcac
cggctccaga tttatcagca ataaaccagc cagccggaag ggccgagcgc
1620agaagtggtc ctgcaacttt atccgcctcc atccagtcta ttaattgttg
ccgggaagct 1680agagtaagta gttcgccagt taatagtttg cgcaacgttg
ttgccattgc tacaggcatc 1740gtggtgtcac gctcgtcgtt tggtatggct
tcattcagct ccggttccca acgatcaagg 1800cgagttacat gatcccccat
gttgtgcaaa aaagcggtta gctccttcgg tcctccgatc 1860gttgtcagaa
gtaagttggc cgcagtgtta tcactcatgg ttatggcagc actgcataat
1920tctcttactg tcatgccatc cgtaagatgc ttttctgtga ctggtgagta
ctcaaccaag 1980tcattctgag aatagtgtat gcggcgaccg agttgctctt
gcccggcgtc aatacgggat 2040aataccgcgc cacatagcag aactttaaaa
gtgctcatca ttggaaaacg ttcttcgggg 2100cgaaaactct caaggatctt
accgctgttg agatccagtt cgatgtaacc cactcgtgca 2160cccaactgat
cttcagcatc ttttactttc accagcgttt ctgggtgagc aaaaacagga
2220aggcaaaatg ccgcaaaaaa gggaataagg gcgacacgga aatgttgaat
actcatactc 2280ttcctttttc aatattattg aagcatttat cagggttatt
gtctcatgag cggatacata 2340tttgaatgta tttagaaaaa taaacaaata
ggggttccgc gcacatttcc ccgaaaagtg 2400ccacctgacg tctaagaaac
cattattatc atgacattaa cctataaaaa taggcgtatc 2460acgaggccct
ttcgtctcgc gcgtttcggt gatgacggtg aaaacctctg acacatgcag
2520ctcccggaga cggtcacagc ttgtctgtaa gcggatgccg ggagcagaca
agcccgtcag 2580ggcgcgtcag cgggtgttgg cgggtgtcgg ggctggctta
actatgcggc atcagagcag 2640attgtactga gagtgcacca tatgcggtgt
gaaataccgc acagatgcgt aaggagaaaa 2700taccgcatca ggcgccattc
gccattcagg ctgcgcaact gttgggaagg gcgatcggtg 2760cgggcctctt
cgctattacg ccagctggcg aaagggggat gtgctgcaag gcgattaagt
2820tgggtaacgc cagggttttc ccagtcacga cgttgtaaaa cgacggccag tgcca
28753120DNAartificial sequenceT7 forward PCR primer 31ccggctcgta
tgttgtgtgg 203220DNAartificial sequencegRNArev1 reverse primer
32aaaagcaccg actcggtgcc 203321DNAartificial sequenceIV-up forward
primer 33ccacgaaacg acgtttcgac c 213420DNAartificial
sequenceIV-down reverse primer 34gcaaagactc ggttgatggc
2035982DNAYarrowia lipolytica 35ccacgaaacg acgtttcgac cttaacgacc
ctgccgtctc catccatccg accacaatgg 60aaaagacatt ttcaaacgat tacccaccct
ccgggactga ggcccacatc cacatcaacc 120acacggccca ctcggatgac
tcagaggagg tgccctcgca caaggaaaat tacaacacca 180gtggccacga
cctggaggag tccgacccgg ataaccatgt cggtgagacc ctcgaggtca
240agcgaggtct caagatgcga cacatctcca tgatctcgct tggaggaacc
attggtaccg 300gtctcttcat tggtaccgga ggagctctcc agcaggccgg
tccctgtggc gccctcgtcg 360cctacgtgtt catggccacc attgtctact
ctgttgccga gtctcttgga gaactggcta 420cgtacattcc catcaccggc
tcctttgccg tctttactac ccgatatctg tcacagtcgt 480ttggtgcctc
catgggctgg ctatactggt tctcgtgggc gatcaccttc gccatcgagc
540tcaacaccat tggtcccgtg attgagtact ggactgacgc cgttcctact
gctgcctgga 600ttgccatctt cttcgtcatc ctcactacca tcaacttctt
ccccgtgggc ttctatggcg 660aagtcgagtt ctgggtggcc tccgtgaagg
tcattgccat cattggatgg ctcatctacg 720cgctctgcat gacgtgtgga
gcaggtgtaa caggtcctgt gggattcaga tactggaacc 780accccggacc
catgggagac ggaatctgga ccgacggcgt gcccattgtg cgaaacgcgc
840ccggtcgacg attcatggga tggctcaatt cgctcgttaa cgccgccttc
acctaccagg 900gctgtgagct ggtcggagtc actgccggtg aggcccagaa
ccccagaaag tccgtccctc 960gagccatcaa ccgagtcttt gc
982364RNAunknownRNA loop-forming sequence (GAAA) 36gaaa
4374RNAunknownRNA loop-forming sequence (CAAA) 37caaa
4384RNAunknownRNA loop-forming sequence (AAAG) 38aaag
4391434PRTartificial sequenceZebra CPP-Cas9-NLS fusion protein
39Glu Cys Asp Ser Glu Leu Glu Ile Lys Arg Tyr Lys Arg Val Arg Val1
5 10 15Ala Ser Arg Lys Cys Arg Ala Lys Phe Lys Gln Leu Leu Gln His
Tyr 20 25 30Arg Glu Val Ala Ala Ala Lys Ser Ser Glu Asn Asp Arg Leu
Arg Leu 35 40 45Leu Leu Lys Gln Met Cys Glu Phe Asp Lys Lys Tyr Ser
Ile Gly Leu 50 55 60Asp Ile Gly Thr Asn Ser Val Gly Trp Ala Val Ile
Thr Asp Glu Tyr65 70 75 80Lys Val Pro Ser Lys Lys Phe Lys Val Leu
Gly Asn Thr Asp Arg His 85 90 95Ser Ile Lys Lys Asn Leu Ile Gly Ala
Leu Leu Phe Asp Ser Gly Glu 100 105 110Thr Ala Glu Ala Thr Arg Leu
Lys Arg Thr Ala Arg Arg Arg Tyr Thr 115 120 125Arg Arg Lys Asn Arg
Ile Cys Tyr Leu Gln Glu Ile Phe Ser Asn Glu 130 135 140Met Ala Lys
Val Asp Asp Ser Phe Phe His Arg Leu Glu Glu Ser Phe145 150 155
160Leu Val Glu Glu Asp Lys Lys His Glu Arg His Pro Ile Phe Gly Asn
165 170 175Ile Val Asp Glu Val Ala Tyr His Glu Lys Tyr Pro Thr Ile
Tyr His 180 185 190Leu Arg Lys Lys Leu Val Asp Ser Thr Asp Lys Ala
Asp Leu Arg Leu 195 200 205Ile Tyr Leu Ala Leu Ala His Met Ile Lys
Phe Arg Gly His Phe Leu 210 215 220Ile Glu Gly Asp Leu Asn Pro Asp
Asn Ser Asp Val Asp Lys Leu Phe225 230 235 240Ile Gln Leu Val Gln
Thr Tyr Asn Gln Leu Phe Glu Glu Asn Pro Ile 245 250 255Asn Ala Ser
Gly Val Asp Ala Lys Ala Ile Leu Ser Ala Arg Leu Ser 260 265 270Lys
Ser Arg Arg Leu Glu Asn Leu Ile Ala Gln Leu Pro Gly Glu Lys 275 280
285Lys Asn Gly Leu Phe Gly Asn Leu Ile Ala Leu Ser Leu Gly Leu Thr
290 295 300Pro Asn Phe Lys Ser Asn Phe Asp Leu Ala Glu Asp Ala Lys
Leu Gln305 310 315 320Leu Ser Lys Asp Thr Tyr Asp Asp Asp Leu Asp
Asn Leu Leu Ala Gln 325 330 335Ile Gly Asp Gln Tyr Ala Asp Leu Phe
Leu Ala Ala Lys Asn Leu Ser 340 345 350Asp Ala Ile Leu Leu Ser Asp
Ile Leu Arg Val Asn Thr Glu Ile Thr 355 360 365Lys Ala Pro Leu Ser
Ala Ser Met Ile Lys Arg Tyr Asp Glu His His 370 375 380Gln Asp Leu
Thr Leu Leu Lys Ala Leu Val Arg Gln Gln Leu Pro Glu385 390 395
400Lys Tyr Lys Glu Ile Phe Phe Asp Gln Ser Lys Asn Gly Tyr Ala Gly
405 410 415Tyr Ile Asp Gly Gly Ala Ser Gln Glu Glu Phe Tyr Lys Phe
Ile Lys 420 425 430Pro Ile Leu Glu Lys Met Asp Gly Thr Glu Glu Leu
Leu Val Lys Leu 435 440 445Asn Arg Glu Asp Leu Leu Arg Lys Gln Arg
Thr Phe Asp Asn Gly Ser 450 455 460Ile Pro His Gln Ile His Leu Gly
Glu Leu His Ala Ile Leu Arg Arg465 470 475 480Gln Glu Asp Phe Tyr
Pro Phe Leu Lys Asp Asn Arg Glu Lys Ile Glu 485 490 495Lys Ile Leu
Thr Phe Arg Ile Pro Tyr Tyr Val Gly Pro Leu Ala Arg 500 505 510Gly
Asn Ser Arg Phe Ala Trp Met Thr Arg Lys Ser Glu Glu Thr Ile 515 520
525Thr Pro Trp Asn Phe Glu Glu Val Val Asp Lys Gly Ala Ser Ala Gln
530 535 540Ser Phe Ile Glu Arg Met Thr Asn Phe Asp Lys Asn Leu Pro
Asn Glu545 550 555 560Lys Val Leu Pro Lys His Ser Leu Leu Tyr Glu
Tyr Phe Thr Val Tyr 565 570 575Asn Glu Leu Thr Lys Val Lys Tyr Val
Thr Glu Gly Met Arg Lys Pro 580 585 590Ala Phe Leu Ser Gly Glu Gln
Lys Lys Ala Ile Val Asp Leu Leu Phe 595 600 605Lys Thr Asn Arg Lys
Val Thr Val Lys Gln Leu Lys Glu Asp Tyr Phe 610 615 620Lys Lys Ile
Glu Cys Phe Asp Ser Val Glu Ile Ser Gly Val Glu Asp625 630 635
640Arg Phe Asn Ala Ser Leu Gly Thr Tyr His Asp Leu Leu Lys Ile Ile
645 650 655Lys Asp Lys Asp Phe Leu Asp Asn Glu Glu Asn Glu Asp Ile
Leu Glu 660 665 670Asp Ile Val Leu Thr Leu Thr Leu Phe Glu Asp Arg
Glu Met Ile Glu 675 680 685Glu Arg Leu Lys Thr Tyr Ala His Leu Phe
Asp Asp Lys Val Met Lys 690 695 700Gln Leu Lys Arg Arg Arg Tyr Thr
Gly Trp Gly Arg Leu Ser Arg Lys705 710 715 720Leu Ile Asn Gly Ile
Arg Asp Lys Gln Ser Gly Lys Thr Ile Leu Asp 725 730 735Phe Leu Lys
Ser Asp Gly Phe Ala Asn Arg Asn Phe Met Gln Leu Ile 740 745 750His
Asp Asp Ser Leu Thr Phe Lys Glu Asp Ile Gln Lys Ala Gln Val 755 760
765Ser Gly Gln Gly Asp Ser Leu His Glu His Ile Ala Asn Leu Ala Gly
770 775 780Ser Pro Ala Ile Lys Lys Gly Ile Leu Gln Thr Val Lys Val
Val Asp785 790 795 800Glu Leu Val Lys Val Met Gly Arg His Lys Pro
Glu Asn Ile Val Ile 805 810 815Glu Met Ala Arg Glu Asn Gln Thr Thr
Gln Lys Gly Gln Lys Asn Ser 820 825 830Arg Glu Arg Met Lys Arg Ile
Glu Glu Gly Ile Lys Glu Leu Gly Ser 835 840 845Gln Ile Leu Lys Glu
His Pro Val Glu Asn Thr Gln Leu Gln Asn Glu 850 855 860Lys Leu Tyr
Leu Tyr Tyr Leu Gln Asn Gly Arg Asp Met Tyr Val Asp865 870 875
880Gln Glu Leu Asp Ile Asn Arg Leu Ser Asp Tyr Asp Val Asp His Ile
885 890 895Val Pro Gln Ser Phe Leu Lys Asp Asp Ser Ile Asp Asn Lys
Val Leu 900 905 910Thr Arg Ser Asp Lys Asn Arg Gly Lys Ser Asp Asn
Val Pro Ser Glu 915 920 925Glu Val Val Lys Lys Met Lys Asn Tyr Trp
Arg Gln Leu Leu Asn Ala 930 935 940Lys Leu Ile Thr Gln Arg Lys Phe
Asp Asn Leu Thr Lys Ala Glu Arg945 950 955 960Gly Gly Leu Ser Glu
Leu Asp Lys Ala Gly Phe Ile Lys Arg Gln Leu 965 970 975Val Glu Thr
Arg Gln Ile Thr Lys His Val Ala Gln Ile Leu Asp Ser 980 985 990Arg
Met Asn Thr Lys Tyr Asp Glu Asn Asp Lys Leu Ile Arg Glu Val 995
1000 1005Lys Val Ile Thr Leu Lys Ser Lys Leu Val Ser Asp Phe Arg
Lys 1010 1015 1020Asp Phe Gln Phe Tyr Lys Val Arg Glu Ile Asn Asn
Tyr His His 1025 1030 1035Ala His Asp Ala Tyr Leu Asn Ala Val Val
Gly Thr Ala Leu Ile 1040 1045 1050Lys Lys Tyr Pro Lys Leu Glu Ser
Glu Phe Val Tyr Gly Asp Tyr 1055 1060 1065Lys Val Tyr Asp Val Arg
Lys Met Ile Ala Lys Ser Glu Gln Glu 1070 1075 1080Ile Gly Lys Ala
Thr Ala Lys Tyr Phe Phe Tyr Ser Asn Ile Met 1085 1090 1095Asn Phe
Phe Lys Thr Glu Ile Thr Leu Ala Asn Gly Glu Ile Arg 1100 1105
1110Lys Arg Pro Leu Ile Glu Thr Asn Gly Glu Thr Gly Glu Ile Val
1115 1120 1125Trp Asp Lys Gly Arg Asp Phe Ala Thr Val Arg Lys Val
Leu Ser 1130 1135 1140Met Pro Gln Val Asn Ile Val Lys Lys Thr Glu
Val Gln Thr Gly 1145 1150 1155Gly Phe Ser Lys Glu Ser Ile Leu Pro
Lys Arg Asn Ser Asp Lys 1160 1165 1170Leu Ile Ala Arg Lys Lys Asp
Trp Asp Pro Lys Lys Tyr Gly Gly 1175 1180 1185Phe Asp Ser Pro Thr
Val Ala Tyr Ser Val Leu Val Val Ala Lys 1190 1195 1200Val Glu Lys
Gly Lys Ser Lys Lys Leu Lys Ser Val Lys Glu Leu 1205 1210 1215Leu
Gly Ile Thr Ile Met Glu Arg Ser Ser Phe Glu Lys Asn Pro 1220 1225
1230Ile Asp Phe Leu Glu Ala Lys Gly Tyr Lys Glu Val Lys Lys Asp
1235 1240 1245Leu Ile Ile Lys Leu Pro Lys Tyr Ser Leu Phe Glu Leu
Glu Asn 1250 1255 1260Gly Arg Lys Arg Met Leu Ala Ser Ala Gly Glu
Leu Gln Lys Gly 1265 1270 1275Asn Glu Leu Ala Leu Pro Ser Lys Tyr
Val Asn Phe Leu Tyr Leu 1280 1285 1290Ala Ser His Tyr Glu Lys Leu
Lys Gly Ser Pro Glu Asp Asn Glu 1295 1300 1305Gln Lys Gln Leu Phe
Val Glu Gln His Lys His Tyr Leu Asp Glu 1310 1315 1320Ile Ile Glu
Gln Ile Ser Glu Phe Ser Lys Arg Val Ile Leu Ala 1325 1330 1335Asp
Ala Asn Leu Asp Lys Val Leu Ser Ala Tyr Asn Lys His Arg 1340 1345
1350Asp Lys Pro Ile Arg Glu Gln Ala Glu Asn Ile Ile His Leu Phe
1355 1360 1365Thr Leu Thr Asn Leu Gly Ala Pro Ala Ala Phe Lys Tyr
Phe Asp 1370 1375 1380Thr Thr Ile Asp Arg Lys Arg Tyr Thr Ser Thr
Lys Glu Val Leu 1385 1390 1395Asp Ala Thr Leu Ile His Gln Ser Ile
Thr Gly Leu Tyr Glu Thr 1400 1405 1410Arg Ile Asp Leu Ser Gln Leu
Gly Gly Asp Ser Arg Ala Asp Pro 1415 1420 1425Lys Lys Lys Arg Lys
Val 1430401397PRTartificial sequencePolyR CPP-Cas9-NLS fusion
protein 40Gly Gly Gly Gly Arg Arg Arg Arg Arg Arg Arg Arg Arg Leu
Leu Leu1 5 10 15Leu Glu Phe Asp Lys Lys Tyr Ser Ile Gly Leu Asp Ile
Gly Thr Asn 20 25 30Ser Val Gly Trp Ala Val Ile Thr Asp Glu Tyr Lys
Val Pro Ser Lys 35 40 45Lys Phe Lys Val Leu Gly Asn Thr Asp Arg His
Ser Ile Lys Lys Asn 50 55 60Leu Ile Gly Ala Leu Leu Phe Asp Ser Gly
Glu Thr Ala Glu Ala Thr65 70 75 80Arg Leu Lys Arg Thr Ala Arg Arg
Arg Tyr Thr Arg Arg Lys Asn Arg 85 90 95Ile Cys Tyr Leu Gln Glu Ile
Phe Ser Asn Glu Met Ala Lys Val Asp 100 105 110Asp Ser Phe Phe His
Arg Leu Glu Glu Ser Phe Leu Val Glu Glu Asp 115 120 125Lys Lys His
Glu Arg His Pro Ile Phe Gly Asn Ile Val Asp Glu Val 130 135 140Ala
Tyr His Glu Lys Tyr Pro Thr Ile Tyr His Leu Arg Lys Lys Leu145 150
155 160Val Asp Ser Thr Asp Lys Ala Asp Leu Arg Leu Ile Tyr Leu Ala
Leu 165 170 175Ala His Met Ile Lys Phe Arg Gly His Phe Leu Ile Glu
Gly Asp Leu 180 185 190Asn Pro Asp Asn Ser Asp Val Asp Lys Leu Phe
Ile Gln Leu Val Gln 195 200 205Thr Tyr Asn Gln Leu Phe Glu Glu Asn
Pro Ile Asn Ala Ser Gly Val 210 215 220Asp Ala Lys Ala Ile Leu Ser
Ala Arg Leu Ser Lys Ser Arg Arg Leu225 230 235 240Glu Asn Leu Ile
Ala Gln Leu Pro Gly Glu Lys Lys Asn Gly Leu Phe 245 250 255Gly Asn
Leu Ile Ala Leu Ser Leu Gly Leu Thr Pro Asn Phe Lys Ser 260 265
270Asn Phe Asp Leu Ala Glu Asp Ala Lys Leu Gln Leu Ser Lys Asp Thr
275 280 285Tyr Asp Asp Asp Leu Asp Asn Leu Leu Ala Gln Ile Gly Asp
Gln Tyr 290 295 300Ala Asp Leu Phe Leu Ala Ala Lys Asn Leu Ser Asp
Ala Ile Leu Leu305 310 315 320Ser Asp Ile Leu Arg Val Asn Thr Glu
Ile Thr Lys Ala Pro Leu Ser 325 330 335Ala Ser Met Ile Lys Arg Tyr
Asp Glu His His Gln Asp Leu Thr Leu 340 345 350Leu Lys Ala Leu Val
Arg Gln Gln Leu Pro Glu Lys Tyr Lys Glu Ile 355 360 365Phe Phe Asp
Gln Ser Lys Asn Gly Tyr Ala Gly Tyr Ile Asp Gly Gly 370 375 380Ala
Ser Gln Glu Glu Phe Tyr Lys Phe Ile Lys Pro Ile Leu Glu Lys385 390
395 400Met Asp Gly Thr Glu Glu Leu Leu Val Lys Leu Asn Arg Glu Asp
Leu 405 410 415Leu Arg Lys Gln Arg Thr Phe Asp Asn Gly Ser Ile Pro
His Gln Ile 420 425 430His Leu Gly Glu Leu His Ala Ile Leu Arg Arg
Gln Glu Asp Phe Tyr 435 440 445Pro Phe Leu Lys Asp Asn Arg Glu Lys
Ile Glu Lys Ile Leu Thr Phe 450 455 460Arg Ile Pro Tyr Tyr Val Gly
Pro Leu Ala Arg Gly Asn Ser Arg Phe465 470 475 480Ala Trp Met Thr
Arg Lys Ser Glu Glu Thr Ile Thr Pro Trp Asn Phe 485 490 495Glu Glu
Val Val Asp Lys Gly Ala Ser Ala Gln Ser Phe Ile Glu Arg 500 505
510Met Thr Asn Phe Asp Lys Asn Leu Pro Asn Glu Lys Val Leu Pro Lys
515 520 525His Ser Leu Leu Tyr Glu Tyr Phe Thr Val Tyr Asn Glu Leu
Thr Lys 530 535 540Val Lys Tyr Val Thr Glu Gly Met Arg Lys Pro Ala
Phe Leu Ser Gly545 550 555 560Glu Gln Lys Lys Ala Ile Val Asp Leu
Leu Phe Lys Thr Asn Arg Lys 565 570 575Val Thr Val Lys Gln Leu Lys
Glu Asp Tyr Phe Lys Lys Ile Glu Cys 580 585 590Phe Asp Ser Val Glu
Ile Ser Gly Val Glu Asp Arg Phe Asn Ala Ser 595 600 605Leu Gly Thr
Tyr His Asp Leu Leu Lys Ile Ile Lys Asp Lys Asp Phe 610 615 620Leu
Asp Asn Glu Glu Asn Glu Asp Ile Leu Glu Asp Ile Val Leu Thr625 630
635 640Leu Thr Leu Phe Glu Asp Arg Glu Met Ile Glu Glu Arg Leu Lys
Thr 645 650 655Tyr Ala His Leu Phe Asp Asp Lys Val Met Lys Gln Leu
Lys Arg Arg 660 665 670Arg Tyr Thr Gly Trp Gly Arg Leu Ser Arg Lys
Leu Ile Asn Gly Ile 675 680 685Arg Asp Lys Gln Ser Gly Lys Thr Ile
Leu Asp Phe Leu Lys Ser Asp 690 695 700Gly Phe Ala Asn Arg Asn Phe
Met Gln Leu Ile His Asp Asp Ser Leu705 710 715 720Thr Phe Lys Glu
Asp Ile Gln Lys Ala Gln Val Ser Gly Gln Gly Asp 725 730 735Ser Leu
His Glu His Ile Ala Asn Leu Ala Gly Ser Pro Ala Ile Lys 740 745
750Lys Gly Ile Leu Gln Thr Val Lys Val Val Asp Glu Leu Val Lys Val
755 760 765Met Gly Arg His Lys Pro Glu Asn Ile Val Ile Glu Met Ala
Arg Glu 770 775 780Asn Gln Thr Thr Gln Lys Gly Gln Lys Asn Ser Arg
Glu Arg Met Lys785 790 795 800Arg Ile Glu Glu Gly Ile Lys Glu Leu
Gly Ser Gln Ile Leu Lys Glu 805 810 815His Pro Val Glu Asn Thr Gln
Leu Gln Asn Glu Lys Leu Tyr Leu Tyr 820 825 830Tyr Leu Gln Asn Gly
Arg Asp Met Tyr Val Asp Gln Glu Leu Asp Ile 835 840 845Asn Arg Leu
Ser Asp Tyr Asp Val Asp His Ile Val Pro Gln Ser Phe 850 855 860Leu
Lys Asp Asp Ser Ile Asp Asn Lys Val Leu Thr Arg Ser Asp Lys865 870
875 880Asn Arg Gly Lys Ser Asp Asn Val Pro Ser Glu Glu Val Val Lys
Lys 885 890 895Met Lys Asn Tyr Trp Arg Gln Leu Leu Asn Ala Lys Leu
Ile Thr Gln 900 905 910Arg Lys Phe Asp Asn Leu Thr Lys Ala Glu Arg
Gly Gly Leu Ser Glu 915 920 925Leu Asp Lys Ala Gly Phe Ile Lys Arg
Gln Leu Val Glu Thr Arg Gln 930 935 940Ile Thr Lys His Val Ala Gln
Ile Leu Asp Ser Arg Met Asn Thr Lys945 950 955 960Tyr Asp Glu Asn
Asp Lys Leu Ile Arg Glu Val Lys Val Ile Thr Leu 965 970 975Lys Ser
Lys Leu Val Ser Asp Phe Arg Lys Asp Phe Gln Phe Tyr Lys 980 985
990Val Arg Glu Ile Asn Asn
Tyr His His Ala His Asp Ala Tyr Leu Asn 995 1000 1005Ala Val Val
Gly Thr Ala Leu Ile Lys Lys Tyr Pro Lys Leu Glu 1010 1015 1020Ser
Glu Phe Val Tyr Gly Asp Tyr Lys Val Tyr Asp Val Arg Lys 1025 1030
1035Met Ile Ala Lys Ser Glu Gln Glu Ile Gly Lys Ala Thr Ala Lys
1040 1045 1050Tyr Phe Phe Tyr Ser Asn Ile Met Asn Phe Phe Lys Thr
Glu Ile 1055 1060 1065Thr Leu Ala Asn Gly Glu Ile Arg Lys Arg Pro
Leu Ile Glu Thr 1070 1075 1080Asn Gly Glu Thr Gly Glu Ile Val Trp
Asp Lys Gly Arg Asp Phe 1085 1090 1095Ala Thr Val Arg Lys Val Leu
Ser Met Pro Gln Val Asn Ile Val 1100 1105 1110Lys Lys Thr Glu Val
Gln Thr Gly Gly Phe Ser Lys Glu Ser Ile 1115 1120 1125Leu Pro Lys
Arg Asn Ser Asp Lys Leu Ile Ala Arg Lys Lys Asp 1130 1135 1140Trp
Asp Pro Lys Lys Tyr Gly Gly Phe Asp Ser Pro Thr Val Ala 1145 1150
1155Tyr Ser Val Leu Val Val Ala Lys Val Glu Lys Gly Lys Ser Lys
1160 1165 1170Lys Leu Lys Ser Val Lys Glu Leu Leu Gly Ile Thr Ile
Met Glu 1175 1180 1185Arg Ser Ser Phe Glu Lys Asn Pro Ile Asp Phe
Leu Glu Ala Lys 1190 1195 1200Gly Tyr Lys Glu Val Lys Lys Asp Leu
Ile Ile Lys Leu Pro Lys 1205 1210 1215Tyr Ser Leu Phe Glu Leu Glu
Asn Gly Arg Lys Arg Met Leu Ala 1220 1225 1230Ser Ala Gly Glu Leu
Gln Lys Gly Asn Glu Leu Ala Leu Pro Ser 1235 1240 1245Lys Tyr Val
Asn Phe Leu Tyr Leu Ala Ser His Tyr Glu Lys Leu 1250 1255 1260Lys
Gly Ser Pro Glu Asp Asn Glu Gln Lys Gln Leu Phe Val Glu 1265 1270
1275Gln His Lys His Tyr Leu Asp Glu Ile Ile Glu Gln Ile Ser Glu
1280 1285 1290Phe Ser Lys Arg Val Ile Leu Ala Asp Ala Asn Leu Asp
Lys Val 1295 1300 1305Leu Ser Ala Tyr Asn Lys His Arg Asp Lys Pro
Ile Arg Glu Gln 1310 1315 1320Ala Glu Asn Ile Ile His Leu Phe Thr
Leu Thr Asn Leu Gly Ala 1325 1330 1335Pro Ala Ala Phe Lys Tyr Phe
Asp Thr Thr Ile Asp Arg Lys Arg 1340 1345 1350Tyr Thr Ser Thr Lys
Glu Val Leu Asp Ala Thr Leu Ile His Gln 1355 1360 1365Ser Ile Thr
Gly Leu Tyr Glu Thr Arg Ile Asp Leu Ser Gln Leu 1370 1375 1380Gly
Gly Asp Ser Arg Ala Asp Pro Lys Lys Lys Arg Lys Val 1385 1390
1395411401PRTartificial sequenceTP10 CPP-Cas9-NLS fusion protein
41Ala Gly Tyr Leu Leu Gly Lys Ile Asn Leu Lys Ala Cys Ala Ala Cys1
5 10 15Ala Lys Lys Ile Leu Glu Phe Asp Lys Lys Tyr Ser Ile Gly Leu
Asp 20 25 30Ile Gly Thr Asn Ser Val Gly Trp Ala Val Ile Thr Asp Glu
Tyr Lys 35 40 45Val Pro Ser Lys Lys Phe Lys Val Leu Gly Asn Thr Asp
Arg His Ser 50 55 60Ile Lys Lys Asn Leu Ile Gly Ala Leu Leu Phe Asp
Ser Gly Glu Thr65 70 75 80Ala Glu Ala Thr Arg Leu Lys Arg Thr Ala
Arg Arg Arg Tyr Thr Arg 85 90 95Arg Lys Asn Arg Ile Cys Tyr Leu Gln
Glu Ile Phe Ser Asn Glu Met 100 105 110Ala Lys Val Asp Asp Ser Phe
Phe His Arg Leu Glu Glu Ser Phe Leu 115 120 125Val Glu Glu Asp Lys
Lys His Glu Arg His Pro Ile Phe Gly Asn Ile 130 135 140Val Asp Glu
Val Ala Tyr His Glu Lys Tyr Pro Thr Ile Tyr His Leu145 150 155
160Arg Lys Lys Leu Val Asp Ser Thr Asp Lys Ala Asp Leu Arg Leu Ile
165 170 175Tyr Leu Ala Leu Ala His Met Ile Lys Phe Arg Gly His Phe
Leu Ile 180 185 190Glu Gly Asp Leu Asn Pro Asp Asn Ser Asp Val Asp
Lys Leu Phe Ile 195 200 205Gln Leu Val Gln Thr Tyr Asn Gln Leu Phe
Glu Glu Asn Pro Ile Asn 210 215 220Ala Ser Gly Val Asp Ala Lys Ala
Ile Leu Ser Ala Arg Leu Ser Lys225 230 235 240Ser Arg Arg Leu Glu
Asn Leu Ile Ala Gln Leu Pro Gly Glu Lys Lys 245 250 255Asn Gly Leu
Phe Gly Asn Leu Ile Ala Leu Ser Leu Gly Leu Thr Pro 260 265 270Asn
Phe Lys Ser Asn Phe Asp Leu Ala Glu Asp Ala Lys Leu Gln Leu 275 280
285Ser Lys Asp Thr Tyr Asp Asp Asp Leu Asp Asn Leu Leu Ala Gln Ile
290 295 300Gly Asp Gln Tyr Ala Asp Leu Phe Leu Ala Ala Lys Asn Leu
Ser Asp305 310 315 320Ala Ile Leu Leu Ser Asp Ile Leu Arg Val Asn
Thr Glu Ile Thr Lys 325 330 335Ala Pro Leu Ser Ala Ser Met Ile Lys
Arg Tyr Asp Glu His His Gln 340 345 350Asp Leu Thr Leu Leu Lys Ala
Leu Val Arg Gln Gln Leu Pro Glu Lys 355 360 365Tyr Lys Glu Ile Phe
Phe Asp Gln Ser Lys Asn Gly Tyr Ala Gly Tyr 370 375 380Ile Asp Gly
Gly Ala Ser Gln Glu Glu Phe Tyr Lys Phe Ile Lys Pro385 390 395
400Ile Leu Glu Lys Met Asp Gly Thr Glu Glu Leu Leu Val Lys Leu Asn
405 410 415Arg Glu Asp Leu Leu Arg Lys Gln Arg Thr Phe Asp Asn Gly
Ser Ile 420 425 430Pro His Gln Ile His Leu Gly Glu Leu His Ala Ile
Leu Arg Arg Gln 435 440 445Glu Asp Phe Tyr Pro Phe Leu Lys Asp Asn
Arg Glu Lys Ile Glu Lys 450 455 460Ile Leu Thr Phe Arg Ile Pro Tyr
Tyr Val Gly Pro Leu Ala Arg Gly465 470 475 480Asn Ser Arg Phe Ala
Trp Met Thr Arg Lys Ser Glu Glu Thr Ile Thr 485 490 495Pro Trp Asn
Phe Glu Glu Val Val Asp Lys Gly Ala Ser Ala Gln Ser 500 505 510Phe
Ile Glu Arg Met Thr Asn Phe Asp Lys Asn Leu Pro Asn Glu Lys 515 520
525Val Leu Pro Lys His Ser Leu Leu Tyr Glu Tyr Phe Thr Val Tyr Asn
530 535 540Glu Leu Thr Lys Val Lys Tyr Val Thr Glu Gly Met Arg Lys
Pro Ala545 550 555 560Phe Leu Ser Gly Glu Gln Lys Lys Ala Ile Val
Asp Leu Leu Phe Lys 565 570 575Thr Asn Arg Lys Val Thr Val Lys Gln
Leu Lys Glu Asp Tyr Phe Lys 580 585 590Lys Ile Glu Cys Phe Asp Ser
Val Glu Ile Ser Gly Val Glu Asp Arg 595 600 605Phe Asn Ala Ser Leu
Gly Thr Tyr His Asp Leu Leu Lys Ile Ile Lys 610 615 620Asp Lys Asp
Phe Leu Asp Asn Glu Glu Asn Glu Asp Ile Leu Glu Asp625 630 635
640Ile Val Leu Thr Leu Thr Leu Phe Glu Asp Arg Glu Met Ile Glu Glu
645 650 655Arg Leu Lys Thr Tyr Ala His Leu Phe Asp Asp Lys Val Met
Lys Gln 660 665 670Leu Lys Arg Arg Arg Tyr Thr Gly Trp Gly Arg Leu
Ser Arg Lys Leu 675 680 685Ile Asn Gly Ile Arg Asp Lys Gln Ser Gly
Lys Thr Ile Leu Asp Phe 690 695 700Leu Lys Ser Asp Gly Phe Ala Asn
Arg Asn Phe Met Gln Leu Ile His705 710 715 720Asp Asp Ser Leu Thr
Phe Lys Glu Asp Ile Gln Lys Ala Gln Val Ser 725 730 735Gly Gln Gly
Asp Ser Leu His Glu His Ile Ala Asn Leu Ala Gly Ser 740 745 750Pro
Ala Ile Lys Lys Gly Ile Leu Gln Thr Val Lys Val Val Asp Glu 755 760
765Leu Val Lys Val Met Gly Arg His Lys Pro Glu Asn Ile Val Ile Glu
770 775 780Met Ala Arg Glu Asn Gln Thr Thr Gln Lys Gly Gln Lys Asn
Ser Arg785 790 795 800Glu Arg Met Lys Arg Ile Glu Glu Gly Ile Lys
Glu Leu Gly Ser Gln 805 810 815Ile Leu Lys Glu His Pro Val Glu Asn
Thr Gln Leu Gln Asn Glu Lys 820 825 830Leu Tyr Leu Tyr Tyr Leu Gln
Asn Gly Arg Asp Met Tyr Val Asp Gln 835 840 845Glu Leu Asp Ile Asn
Arg Leu Ser Asp Tyr Asp Val Asp His Ile Val 850 855 860Pro Gln Ser
Phe Leu Lys Asp Asp Ser Ile Asp Asn Lys Val Leu Thr865 870 875
880Arg Ser Asp Lys Asn Arg Gly Lys Ser Asp Asn Val Pro Ser Glu Glu
885 890 895Val Val Lys Lys Met Lys Asn Tyr Trp Arg Gln Leu Leu Asn
Ala Lys 900 905 910Leu Ile Thr Gln Arg Lys Phe Asp Asn Leu Thr Lys
Ala Glu Arg Gly 915 920 925Gly Leu Ser Glu Leu Asp Lys Ala Gly Phe
Ile Lys Arg Gln Leu Val 930 935 940Glu Thr Arg Gln Ile Thr Lys His
Val Ala Gln Ile Leu Asp Ser Arg945 950 955 960Met Asn Thr Lys Tyr
Asp Glu Asn Asp Lys Leu Ile Arg Glu Val Lys 965 970 975Val Ile Thr
Leu Lys Ser Lys Leu Val Ser Asp Phe Arg Lys Asp Phe 980 985 990Gln
Phe Tyr Lys Val Arg Glu Ile Asn Asn Tyr His His Ala His Asp 995
1000 1005Ala Tyr Leu Asn Ala Val Val Gly Thr Ala Leu Ile Lys Lys
Tyr 1010 1015 1020Pro Lys Leu Glu Ser Glu Phe Val Tyr Gly Asp Tyr
Lys Val Tyr 1025 1030 1035Asp Val Arg Lys Met Ile Ala Lys Ser Glu
Gln Glu Ile Gly Lys 1040 1045 1050Ala Thr Ala Lys Tyr Phe Phe Tyr
Ser Asn Ile Met Asn Phe Phe 1055 1060 1065Lys Thr Glu Ile Thr Leu
Ala Asn Gly Glu Ile Arg Lys Arg Pro 1070 1075 1080Leu Ile Glu Thr
Asn Gly Glu Thr Gly Glu Ile Val Trp Asp Lys 1085 1090 1095Gly Arg
Asp Phe Ala Thr Val Arg Lys Val Leu Ser Met Pro Gln 1100 1105
1110Val Asn Ile Val Lys Lys Thr Glu Val Gln Thr Gly Gly Phe Ser
1115 1120 1125Lys Glu Ser Ile Leu Pro Lys Arg Asn Ser Asp Lys Leu
Ile Ala 1130 1135 1140Arg Lys Lys Asp Trp Asp Pro Lys Lys Tyr Gly
Gly Phe Asp Ser 1145 1150 1155Pro Thr Val Ala Tyr Ser Val Leu Val
Val Ala Lys Val Glu Lys 1160 1165 1170Gly Lys Ser Lys Lys Leu Lys
Ser Val Lys Glu Leu Leu Gly Ile 1175 1180 1185Thr Ile Met Glu Arg
Ser Ser Phe Glu Lys Asn Pro Ile Asp Phe 1190 1195 1200Leu Glu Ala
Lys Gly Tyr Lys Glu Val Lys Lys Asp Leu Ile Ile 1205 1210 1215Lys
Leu Pro Lys Tyr Ser Leu Phe Glu Leu Glu Asn Gly Arg Lys 1220 1225
1230Arg Met Leu Ala Ser Ala Gly Glu Leu Gln Lys Gly Asn Glu Leu
1235 1240 1245Ala Leu Pro Ser Lys Tyr Val Asn Phe Leu Tyr Leu Ala
Ser His 1250 1255 1260Tyr Glu Lys Leu Lys Gly Ser Pro Glu Asp Asn
Glu Gln Lys Gln 1265 1270 1275Leu Phe Val Glu Gln His Lys His Tyr
Leu Asp Glu Ile Ile Glu 1280 1285 1290Gln Ile Ser Glu Phe Ser Lys
Arg Val Ile Leu Ala Asp Ala Asn 1295 1300 1305Leu Asp Lys Val Leu
Ser Ala Tyr Asn Lys His Arg Asp Lys Pro 1310 1315 1320Ile Arg Glu
Gln Ala Glu Asn Ile Ile His Leu Phe Thr Leu Thr 1325 1330 1335Asn
Leu Gly Ala Pro Ala Ala Phe Lys Tyr Phe Asp Thr Thr Ile 1340 1345
1350Asp Arg Lys Arg Tyr Thr Ser Thr Lys Glu Val Leu Asp Ala Thr
1355 1360 1365Leu Ile His Gln Ser Ile Thr Gly Leu Tyr Glu Thr Arg
Ile Asp 1370 1375 1380Leu Ser Gln Leu Gly Gly Asp Ser Arg Ala Asp
Pro Lys Lys Lys 1385 1390 1395Arg Lys Val 1400421398PRTartificial
sequencepVEC CPP-Cas9-NLS fusion protein 42Leu Leu Ile Ile Leu Arg
Arg Arg Ile Arg Lys Gln Ala His Ala His1 5 10 15Ser Lys Glu Phe Asp
Lys Lys Tyr Ser Ile Gly Leu Asp Ile Gly Thr 20 25 30Asn Ser Val Gly
Trp Ala Val Ile Thr Asp Glu Tyr Lys Val Pro Ser 35 40 45Lys Lys Phe
Lys Val Leu Gly Asn Thr Asp Arg His Ser Ile Lys Lys 50 55 60Asn Leu
Ile Gly Ala Leu Leu Phe Asp Ser Gly Glu Thr Ala Glu Ala65 70 75
80Thr Arg Leu Lys Arg Thr Ala Arg Arg Arg Tyr Thr Arg Arg Lys Asn
85 90 95Arg Ile Cys Tyr Leu Gln Glu Ile Phe Ser Asn Glu Met Ala Lys
Val 100 105 110Asp Asp Ser Phe Phe His Arg Leu Glu Glu Ser Phe Leu
Val Glu Glu 115 120 125Asp Lys Lys His Glu Arg His Pro Ile Phe Gly
Asn Ile Val Asp Glu 130 135 140Val Ala Tyr His Glu Lys Tyr Pro Thr
Ile Tyr His Leu Arg Lys Lys145 150 155 160Leu Val Asp Ser Thr Asp
Lys Ala Asp Leu Arg Leu Ile Tyr Leu Ala 165 170 175Leu Ala His Met
Ile Lys Phe Arg Gly His Phe Leu Ile Glu Gly Asp 180 185 190Leu Asn
Pro Asp Asn Ser Asp Val Asp Lys Leu Phe Ile Gln Leu Val 195 200
205Gln Thr Tyr Asn Gln Leu Phe Glu Glu Asn Pro Ile Asn Ala Ser Gly
210 215 220Val Asp Ala Lys Ala Ile Leu Ser Ala Arg Leu Ser Lys Ser
Arg Arg225 230 235 240Leu Glu Asn Leu Ile Ala Gln Leu Pro Gly Glu
Lys Lys Asn Gly Leu 245 250 255Phe Gly Asn Leu Ile Ala Leu Ser Leu
Gly Leu Thr Pro Asn Phe Lys 260 265 270Ser Asn Phe Asp Leu Ala Glu
Asp Ala Lys Leu Gln Leu Ser Lys Asp 275 280 285Thr Tyr Asp Asp Asp
Leu Asp Asn Leu Leu Ala Gln Ile Gly Asp Gln 290 295 300Tyr Ala Asp
Leu Phe Leu Ala Ala Lys Asn Leu Ser Asp Ala Ile Leu305 310 315
320Leu Ser Asp Ile Leu Arg Val Asn Thr Glu Ile Thr Lys Ala Pro Leu
325 330 335Ser Ala Ser Met Ile Lys Arg Tyr Asp Glu His His Gln Asp
Leu Thr 340 345 350Leu Leu Lys Ala Leu Val Arg Gln Gln Leu Pro Glu
Lys Tyr Lys Glu 355 360 365Ile Phe Phe Asp Gln Ser Lys Asn Gly Tyr
Ala Gly Tyr Ile Asp Gly 370 375 380Gly Ala Ser Gln Glu Glu Phe Tyr
Lys Phe Ile Lys Pro Ile Leu Glu385 390 395 400Lys Met Asp Gly Thr
Glu Glu Leu Leu Val Lys Leu Asn Arg Glu Asp 405 410 415Leu Leu Arg
Lys Gln Arg Thr Phe Asp Asn Gly Ser Ile Pro His Gln 420 425 430Ile
His Leu Gly Glu Leu His Ala Ile Leu Arg Arg Gln Glu Asp Phe 435 440
445Tyr Pro Phe Leu Lys Asp Asn Arg Glu Lys Ile Glu Lys Ile Leu Thr
450 455 460Phe Arg Ile Pro Tyr Tyr Val Gly Pro Leu Ala Arg Gly Asn
Ser Arg465 470 475 480Phe Ala Trp Met Thr Arg Lys Ser Glu Glu Thr
Ile Thr Pro Trp Asn 485 490 495Phe Glu Glu Val Val Asp Lys Gly Ala
Ser Ala Gln Ser Phe Ile Glu 500 505 510Arg Met Thr Asn Phe Asp Lys
Asn Leu Pro Asn Glu Lys Val Leu Pro 515 520 525Lys His Ser Leu Leu
Tyr Glu Tyr Phe Thr Val Tyr Asn Glu Leu Thr 530 535 540Lys Val Lys
Tyr Val Thr Glu Gly Met Arg Lys Pro Ala Phe Leu Ser545 550 555
560Gly Glu Gln Lys Lys Ala Ile Val Asp Leu Leu Phe Lys Thr Asn Arg
565 570 575Lys Val Thr Val Lys Gln Leu Lys Glu Asp Tyr Phe Lys Lys
Ile Glu 580 585 590Cys Phe Asp Ser Val Glu Ile Ser Gly Val Glu Asp
Arg Phe Asn Ala 595 600 605Ser Leu Gly Thr Tyr His Asp Leu Leu Lys
Ile Ile Lys Asp Lys Asp 610 615 620Phe Leu Asp Asn Glu Glu Asn Glu
Asp Ile Leu Glu Asp Ile Val Leu625 630 635 640Thr Leu
Thr Leu Phe Glu Asp Arg Glu Met Ile Glu Glu Arg Leu Lys 645 650
655Thr Tyr Ala His Leu Phe Asp Asp Lys Val Met Lys Gln Leu Lys Arg
660 665 670Arg Arg Tyr Thr Gly Trp Gly Arg Leu Ser Arg Lys Leu Ile
Asn Gly 675 680 685Ile Arg Asp Lys Gln Ser Gly Lys Thr Ile Leu Asp
Phe Leu Lys Ser 690 695 700Asp Gly Phe Ala Asn Arg Asn Phe Met Gln
Leu Ile His Asp Asp Ser705 710 715 720Leu Thr Phe Lys Glu Asp Ile
Gln Lys Ala Gln Val Ser Gly Gln Gly 725 730 735Asp Ser Leu His Glu
His Ile Ala Asn Leu Ala Gly Ser Pro Ala Ile 740 745 750Lys Lys Gly
Ile Leu Gln Thr Val Lys Val Val Asp Glu Leu Val Lys 755 760 765Val
Met Gly Arg His Lys Pro Glu Asn Ile Val Ile Glu Met Ala Arg 770 775
780Glu Asn Gln Thr Thr Gln Lys Gly Gln Lys Asn Ser Arg Glu Arg
Met785 790 795 800Lys Arg Ile Glu Glu Gly Ile Lys Glu Leu Gly Ser
Gln Ile Leu Lys 805 810 815Glu His Pro Val Glu Asn Thr Gln Leu Gln
Asn Glu Lys Leu Tyr Leu 820 825 830Tyr Tyr Leu Gln Asn Gly Arg Asp
Met Tyr Val Asp Gln Glu Leu Asp 835 840 845Ile Asn Arg Leu Ser Asp
Tyr Asp Val Asp His Ile Val Pro Gln Ser 850 855 860Phe Leu Lys Asp
Asp Ser Ile Asp Asn Lys Val Leu Thr Arg Ser Asp865 870 875 880Lys
Asn Arg Gly Lys Ser Asp Asn Val Pro Ser Glu Glu Val Val Lys 885 890
895Lys Met Lys Asn Tyr Trp Arg Gln Leu Leu Asn Ala Lys Leu Ile Thr
900 905 910Gln Arg Lys Phe Asp Asn Leu Thr Lys Ala Glu Arg Gly Gly
Leu Ser 915 920 925Glu Leu Asp Lys Ala Gly Phe Ile Lys Arg Gln Leu
Val Glu Thr Arg 930 935 940Gln Ile Thr Lys His Val Ala Gln Ile Leu
Asp Ser Arg Met Asn Thr945 950 955 960Lys Tyr Asp Glu Asn Asp Lys
Leu Ile Arg Glu Val Lys Val Ile Thr 965 970 975Leu Lys Ser Lys Leu
Val Ser Asp Phe Arg Lys Asp Phe Gln Phe Tyr 980 985 990Lys Val Arg
Glu Ile Asn Asn Tyr His His Ala His Asp Ala Tyr Leu 995 1000
1005Asn Ala Val Val Gly Thr Ala Leu Ile Lys Lys Tyr Pro Lys Leu
1010 1015 1020Glu Ser Glu Phe Val Tyr Gly Asp Tyr Lys Val Tyr Asp
Val Arg 1025 1030 1035Lys Met Ile Ala Lys Ser Glu Gln Glu Ile Gly
Lys Ala Thr Ala 1040 1045 1050Lys Tyr Phe Phe Tyr Ser Asn Ile Met
Asn Phe Phe Lys Thr Glu 1055 1060 1065Ile Thr Leu Ala Asn Gly Glu
Ile Arg Lys Arg Pro Leu Ile Glu 1070 1075 1080Thr Asn Gly Glu Thr
Gly Glu Ile Val Trp Asp Lys Gly Arg Asp 1085 1090 1095Phe Ala Thr
Val Arg Lys Val Leu Ser Met Pro Gln Val Asn Ile 1100 1105 1110Val
Lys Lys Thr Glu Val Gln Thr Gly Gly Phe Ser Lys Glu Ser 1115 1120
1125Ile Leu Pro Lys Arg Asn Ser Asp Lys Leu Ile Ala Arg Lys Lys
1130 1135 1140Asp Trp Asp Pro Lys Lys Tyr Gly Gly Phe Asp Ser Pro
Thr Val 1145 1150 1155Ala Tyr Ser Val Leu Val Val Ala Lys Val Glu
Lys Gly Lys Ser 1160 1165 1170Lys Lys Leu Lys Ser Val Lys Glu Leu
Leu Gly Ile Thr Ile Met 1175 1180 1185Glu Arg Ser Ser Phe Glu Lys
Asn Pro Ile Asp Phe Leu Glu Ala 1190 1195 1200Lys Gly Tyr Lys Glu
Val Lys Lys Asp Leu Ile Ile Lys Leu Pro 1205 1210 1215Lys Tyr Ser
Leu Phe Glu Leu Glu Asn Gly Arg Lys Arg Met Leu 1220 1225 1230Ala
Ser Ala Gly Glu Leu Gln Lys Gly Asn Glu Leu Ala Leu Pro 1235 1240
1245Ser Lys Tyr Val Asn Phe Leu Tyr Leu Ala Ser His Tyr Glu Lys
1250 1255 1260Leu Lys Gly Ser Pro Glu Asp Asn Glu Gln Lys Gln Leu
Phe Val 1265 1270 1275Glu Gln His Lys His Tyr Leu Asp Glu Ile Ile
Glu Gln Ile Ser 1280 1285 1290Glu Phe Ser Lys Arg Val Ile Leu Ala
Asp Ala Asn Leu Asp Lys 1295 1300 1305Val Leu Ser Ala Tyr Asn Lys
His Arg Asp Lys Pro Ile Arg Glu 1310 1315 1320Gln Ala Glu Asn Ile
Ile His Leu Phe Thr Leu Thr Asn Leu Gly 1325 1330 1335Ala Pro Ala
Ala Phe Lys Tyr Phe Asp Thr Thr Ile Asp Arg Lys 1340 1345 1350Arg
Tyr Thr Ser Thr Lys Glu Val Leu Asp Ala Thr Leu Ile His 1355 1360
1365Gln Ser Ile Thr Gly Leu Tyr Glu Thr Arg Ile Asp Leu Ser Gln
1370 1375 1380Leu Gly Gly Asp Ser Arg Ala Asp Pro Lys Lys Lys Arg
Lys Val 1385 1390 13954323DNAunknownExample of a Cas9 target
sitePAM sequencemisc_feature(1)..(20)n = A, C, T, or
Gmisc_feature(21)..(21)n = A, C, T, or G (indicated as an "X" in
Specification) 43nnnnnnnnnn nnnnnnnnnn ngg 23443DNAunknownPAM
sequence NGGmisc_feature(1)..(1)n = A, C, T, or G 44ngg
3456DNAunknownPAM sequence NNAGAAmisc_feature(1)..(2)n = A, C, T,
or G 45nnagaa 6467DNAunknownPAM sequence
NNAGAAWmisc_feature(1)..(2)n = A, C, T, or Gmisc_feature(7)..(7)w =
A or T 46nnagaaw 7475DNAunknownPAM sequence
NGGNGmisc_feature(1)..(1)n = A, C, T, or Gmisc_feature(4)..(4)n =
A, C, T, or G 47nggng 5488DNAunknownPAM sequence
NNNNGATTmisc_feature(1)..(4)n = A, C, T, or G 48nnnngatt
8496DNAunknownPAM sequence NAAAACmisc_feature(1)..(1)n = A, C, T,
or G 49naaaac 6502DNAunknownPAM sequence NGmisc_feature(1)..(1)n =
A, C, T, or G 50ng 25122RNAunknownTracrRNA mate sequence example 1
51guuuuuguac ucucaagauu ua 225215RNAunknownTracrRNA mate sequence
example 2 52guuuuuguac ucuca 155312RNAunknownTracrRNA mate sequence
example 3 53guuuuagagc ua 125413RNAunknownTracrRNA mate sequence
example 4 54guuuuagagc uag 135560RNAStreptococcus pyogenes
55uagcaaguua aaauaaggcu aguccguuau caacuugaaa aaguggcacc gagucggugc
605645RNAStreptococcus pyogenes 56uagcaaguua aaauaaggcu aguccguuau
caacuugaaa aagug 455732RNAStreptococcus pyogenes 57uagcaaguua
aaauaaggcu aguccguuau ca 325885RNAStreptococcus thermophilus
58uaaaucuugc agaagcuaca aagauaaggc uucaugccga aaucaacacc cugucauuuu
60auggcagggu guuuucguua uuuaa 855977RNAStreptococcus thermophilus
59ugcagaagcu acaaagauaa ggcuucaugc cgaaaucaac acccugucau uuuauggcag
60gguguuuucg uuauuua 776065RNAStreptococcus thermophilus
60ugcagaagcu acaaagauaa ggcuucaugc cgaaaucaac acccugucau uuuauggcag
60ggugu 6561131RNAartificial sequencegRNA example
1misc_feature(1)..(20)n = A, C, U, or G 61nnnnnnnnnn nnnnnnnnnn
guuuuuguac ucucaagauu uagaaauaaa ucuugcagaa 60gcuacaaaga uaaggcuuca
ugccgaaauc aacacccugu cauuuuaugg caggguguuu 120ucguuauuua a
13162117RNAartificial sequencegRNA example 2misc_feature(1)..(20)n
= A, C, U, or G 62nnnnnnnnnn nnnnnnnnnn guuuuuguac ucucagaaau
gcagaagcua caaagauaag 60gcuucaugcc gaaaucaaca cccugucauu uuauggcagg
guguuuucgu uauuuaa 11763104RNAartificial sequencegRNA example
3misc_feature(1)..(20)n = A, C, U, or G 63nnnnnnnnnn nnnnnnnnnn
guuuuuguac ucucagaaau gcagaagcua caaagauaag 60gcuucaugcc gaaaucaaca
cccugucauu uuauggcagg gugu 1046499RNAartificial sequencegRNA
example 4misc_feature(1)..(20)n = A, C, U, or G 64nnnnnnnnnn
nnnnnnnnnn guuuuuguac ucucagaaau agcaaguuaa aauaaggcua 60guccguuauc
aacuugaaaa aguggcaccg agucggugc 996581RNAartificial sequencegRNA
example 5misc_feature(1)..(20)n = A, C, U, or G 65nnnnnnnnnn
nnnnnnnnnn guuuuagagc uagaaauagc aaguuaaaau aaggcuaguc 60cguuaucaac
uugaaaaagu g 816668RNAartificial sequencegRNA example
6misc_feature(1)..(20)n = A, C, U, or G 66nnnnnnnnnn nnnnnnnnnn
guuuuagagc uagaaauagc aaguuaaaau aaggcuaguc 60cguuauca
6867100RNAartificial sequencegRNA example 7misc_feature(1)..(20)n =
A, C, U, or G 67nnnnnnnnnn nnnnnnnnnn guuuuagagc uagaaauagc
aaguuaaaau aaggcuaguc 60cguuaucaac uugaaaaagu ggcaccgagu cggugcuuuu
1006810PRTHuman immunodeficiency virus 68Gly Arg Lys Lys Arg Arg
Gln Arg Arg Arg1 5 10699PRTHuman immunodeficiency virus 69Arg Lys
Lys Arg Arg Gln Arg Arg Arg1 5708PRTHuman immunodeficiency virus
70Arg Lys Lys Arg Arg Gln Arg Arg1 57116PRTDrosophila melanogaster
71Arg Gln Ile Lys Ile Trp Phe Gln Asn Arg Arg Met Lys Trp Lys Lys1
5 10 157211PRTartificial sequencepolyarginine CPP example 2 72Thr
His Arg Leu Pro Arg Arg Arg Arg Arg Arg1 5 107311PRTartificial
sequencepolyarginine CPP example 3 73Gly Gly Arg Arg Ala Arg Arg
Arg Arg Arg Arg1 5 107417PRTmus musculus 74Leu Ile Ile Leu Arg Arg
Arg Ile Arg Lys Gln Ala His Ala His Ser1 5 10
15Lys7510PRTartificial sequence(KFF)3K CPP 75Lys Phe Phe Lys Phe
Phe Lys Phe Phe Lys1 5 107618PRTartificial sequenceMAP peptide CPP
76Lys Leu Ala Leu Lys Leu Ala Leu Lys Ala Leu Lys Ala Ala Leu Lys1
5 10 15Leu Ala7712PRTartificial sequenceCPP (RRQRRTSKLMKR) 77Arg
Arg Gln Arg Arg Thr Ser Lys Leu Met Lys Arg1 5 107833PRTartificial
sequenceCPP (KALAWEAKLAKALAKALAKHLAKALAKALKCEA) 78Lys Ala Leu Ala
Trp Glu Ala Lys Leu Ala Lys Ala Leu Ala Lys Ala1 5 10 15Leu Ala Lys
His Leu Ala Lys Ala Leu Ala Lys Ala Leu Lys Cys Glu 20 25
30Ala796PRTartificial sequenceProline-rich CPP repeat example 1
79Val His Leu Pro Pro Pro1 5806PRTartificial sequenceProline-rich
CPP repeat example 2 80Val His Arg Pro Pro Pro1 58127PRTartificial
sequenceMPG peptide CPP 81Gly Ala Leu Phe Leu Gly Phe Leu Gly Ala
Ala Gly Ser Thr Met Gly1 5 10 15Ala Trp Ser Gln Pro Lys Ser Lys Arg
Lys Val 20 258221PRTartificial sequencePep-1 peptide CPP 82Lys Glu
Thr Trp Trp Glu Thr Trp Trp Thr Glu Trp Ser Gln Pro Lys1 5 10 15Lys
Lys Arg Lys Val 208324PRTHomo sapiens 83Leu Gly Thr Tyr Thr Gln Asp
Phe Asn Lys Phe His Thr Phe Pro Gln1 5 10 15Thr Ala Ile Gly Val Gly
Ala Pro 208418PRTHomo sapiens 84Cys Gly Asn Leu Ser Thr Cys Met Leu
Gly Thr Tyr Thr Gln Asp Phe1 5 10 15Asn Lys85240PRTArtificial
sequencehis tagged dsRED 85Met Gly Ser Ser His His His His His His
Glu Phe Gly Gly Gly Gly1 5 10 15Ala Ser Ser Glu Asp Val Ile Lys Glu
Phe Met Arg Phe Lys Val Arg 20 25 30Met Glu Gly Ser Val Asn Gly His
Glu Phe Glu Ile Glu Gly Glu Gly 35 40 45Glu Gly Arg Pro Tyr Glu Gly
Thr Gln Thr Ala Lys Leu Lys Val Thr 50 55 60Lys Gly Gly Pro Leu Pro
Phe Ala Trp Asp Ile Leu Ser Pro Gln Phe65 70 75 80Gln Tyr Gly Ser
Lys Val Tyr Val Lys His Pro Ala Asp Ile Pro Asp 85 90 95Tyr Lys Lys
Leu Ser Phe Pro Glu Gly Phe Lys Trp Glu Arg Val Met 100 105 110Asn
Phe Glu Asp Gly Gly Val Val Thr Val Thr Gln Asp Ser Ser Leu 115 120
125Gln Asp Gly Ser Phe Ile Tyr Lys Val Lys Phe Ile Gly Val Asn Phe
130 135 140Pro Ser Asp Gly Pro Val Met Gln Lys Lys Thr Met Gly Trp
Glu Ala145 150 155 160Ser Thr Glu Arg Leu Tyr Pro Arg Asp Gly Val
Leu Lys Gly Glu Ile 165 170 175His Lys Ala Leu Lys Leu Lys Asp Gly
Gly His Tyr Leu Val Glu Phe 180 185 190Lys Ser Ile Tyr Met Ala Lys
Lys Pro Val Gln Leu Pro Gly Tyr Tyr 195 200 205Tyr Val Asp Ser Lys
Leu Asp Ile Thr Ser His Asn Glu Asp Tyr Thr 210 215 220Ile Val Glu
Gln Tyr Glu Arg Ala Glu Gly Arg His His Leu Phe Leu225 230 235
24086731DNAArtificial sequenceE. coli codon optimized dsRED
86ccatgggctc cagccatcat catcaccatc atgaattcgg aggtggcggt gcatcctcgg
60aggatgtgat taaagaattt atgcggttta aagtacgtat ggaaggatcg gtgaatggcc
120atgaatttga gattgagggt gaaggcgaag gccgcccgta cgaaggaact
caaacagcga 180aattaaaagt tacaaaagga ggtcctctgc cgtttgcctg
ggacatcttg agcccgcaat 240tccagtacgg ttccaaagtg tatgtaaaac
accctgcgga tattccggat tataaaaaac 300tgagttttcc cgaggggttt
aaatgggaac gggtgatgaa ttttgaggat ggtggagttg 360tcaccgtgac
ccaggactct agcttacaag acggtagttt catctacaaa gtaaaattta
420tcggcgtaaa cttcccatcg gacggccccg tcatgcagaa aaagacgatg
ggctgggaag 480ccagcaccga acgtttgtac ccacgggacg gcgttttgaa
aggggaaatc cataaggccc 540ttaaactgaa agacggtggt cactatctcg
tggagtttaa atcgatttat atggctaaaa 600aaccagtaca gcttccgggt
tattattacg ttgactccaa attggacatc acatcgcata 660atgaagatta
cacgattgtt gaacagtacg agcgcgccga gggccggcac catctgtttc
720tgtaaaagct t 731874092DNAArtificial sequencepBAD/HisB
87aagaaaccaa ttgtccatat tgcatcagac attgccgtca ctgcgtcttt tactggctct
60tctcgctaac caaaccggta accccgctta ttaaaagcat tctgtaacaa agcgggacca
120aagccatgac aaaaacgcgt aacaaaagtg tctataatca cggcagaaaa
gtccacattg 180attatttgca cggcgtcaca ctttgctatg ccatagcatt
tttatccata agattagcgg 240atcctacctg acgcttttta tcgcaactct
ctactgtttc tccatacccg ttttttgggc 300taacaggagg aattaaccat
ggggggttct catcatcatc atcatcatgg tatggctagc 360atgactggtg
gacagcaaat gggtcgggat ctgtacgacg atgacgataa ggatccgagc
420tcgagatctg cagctggtac catatgggaa ttcgaagctt ggctgttttg
gcggatgaga 480gaagattttc agcctgatac agattaaatc agaacgcaga
agcggtctga taaaacagaa 540tttgcctggc ggcagtagcg cggtggtccc
acctgacccc atgccgaact cagaagtgaa 600acgccgtagc gccgatggta
gtgtggggtc tccccatgcg agagtaggga actgccaggc 660atcaaataaa
acgaaaggct cagtcgaaag actgggcctt tcgttttatc tgttgtttgt
720cggtgaacgc tctcctgagt aggacaaatc cgccgggagc ggatttgaac
gttgcgaagc 780aacggcccgg agggtggcgg gcaggacgcc cgccataaac
tgccaggcat caaattaagc 840agaaggccat cctgacggat ggcctttttg
cgtttctaca aactcttttg tttatttttc 900taaatacatt caaatatgta
tccgctcatg agacaataac cctgataaat gcttcaataa 960tattgaaaaa
ggaagagtat gagtattcaa catttccgtg tcgcccttat tccctttttt
1020gcggcatttt gccttcctgt ttttgctcac ccagaaacgc tggtgaaagt
aaaagatgct 1080gaagatcagt tgggtgcacg agtgggttac atcgaactgg
atctcaacag cggtaagatc 1140cttgagagtt ttcgccccga agaacgtttt
ccaatgatga gcacttttaa agttctgcta 1200tgtggcgcgg tattatcccg
tgttgacgcc gggcaagagc aactcggtcg ccgcatacac 1260tattctcaga
atgacttggt tgagtactca ccagtcacag aaaagcatct tacggatggc
1320atgacagtaa gagaattatg cagtgctgcc ataaccatga gtgataacac
tgcggccaac 1380ttacttctga caacgatcgg aggaccgaag gagctaaccg
cttttttgca caacatgggg 1440gatcatgtaa ctcgccttga tcgttgggaa
ccggagctga atgaagccat accaaacgac 1500gagcgtgaca ccacgatgcc
tgtagcaatg gcaacaacgt tgcgcaaact attaactggc 1560gaactactta
ctctagcttc ccggcaacaa ttaatagact ggatggaggc ggataaagtt
1620gcaggaccac ttctgcgctc ggcccttccg gctggctggt ttattgctga
taaatctgga 1680gccggtgagc gtgggtctcg cggtatcatt gcagcactgg
ggccagatgg taagccctcc 1740cgtatcgtag ttatctacac gacggggagt
caggcaacta tggatgaacg aaatagacag 1800atcgctgaga taggtgcctc
actgattaag cattggtaac tgtcagacca agtttactca 1860tatatacttt
agattgattt aaaacttcat ttttaattta aaaggatcta ggtgaagatc
1920ctttttgata atctcatgac caaaatccct taacgtgagt tttcgttcca
ctgagcgtca 1980gaccccgtag aaaagatcaa aggatcttct tgagatcctt
tttttctgcg cgtaatctgc 2040tgcttgcaaa caaaaaaacc accgctacca
gcggtggttt gtttgccgga tcaagagcta 2100ccaactcttt ttccgaaggt
aactggcttc agcagagcgc agataccaaa tactgtcctt 2160ctagtgtagc
cgtagttagg ccaccacttc aagaactctg tagcaccgcc tacatacctc
2220gctctgctaa tcctgttacc agtggctgct gccagtggcg ataagtcgtg
tcttaccggg 2280ttggactcaa gacgatagtt accggataag gcgcagcggt
cgggctgaac ggggggttcg 2340tgcacacagc ccagcttgga gcgaacgacc
tacaccgaac tgagatacct acagcgtgag 2400ctatgagaaa gcgccacgct
tcccgaaggg agaaaggcgg acaggtatcc ggtaagcggc 2460agggtcggaa
caggagagcg cacgagggag cttccagggg gaaacgcctg gtatctttat
2520agtcctgtcg ggtttcgcca cctctgactt gagcgtcgat ttttgtgatg
ctcgtcaggg 2580gggcggagcc tatggaaaaa cgccagcaac gcggcctttt
tacggttcct ggccttttgc 2640tggccttttg ctcacatgtt ctttcctgcg
ttatcccctg attctgtgga taaccgtatt 2700accgcctttg agtgagctga
taccgctcgc cgcagccgaa cgaccgagcg cagcgagtca 2760gtgagcgagg
aagcggaaga gcgcctgatg cggtattttc tccttacgca tctgtgcggt
2820atttcacacc gcatatggtg cactctcagt acaatctgct ctgatgccgc
atagttaagc 2880cagtatacac tccgctatcg ctacgtgact gggtcatggc
tgcgccccga cacccgccaa 2940cacccgctga cgcgccctga cgggcttgtc
tgctcccggc atccgcttac agacaagctg 3000tgaccgtctc cgggagctgc
atgtgtcaga ggttttcacc gtcatcaccg aaacgcgcga 3060ggcagcagat
caattcgcgc gcgaaggcga agcggcatgc ataatgtgcc tgtcaaatgg
3120acgaagcagg gattctgcaa accctatgct actccgtcaa gccgtcaatt
gtctgattcg 3180ttaccaatta tgacaacttg acggctacat cattcacttt
ttcttcacaa ccggcacgga 3240actcgctcgg gctggccccg gtgcattttt
taaatacccg cgagaaatag agttgatcgt 3300caaaaccaac attgcgaccg
acggtggcga taggcatccg ggtggtgctc aaaagcagct 3360tcgcctggct
gatacgttgg tcctcgcgcc agcttaagac gctaatccct aactgctggc
3420ggaaaagatg tgacagacgc gacggcgaca agcaaacatg ctgtgcgacg
ctggcgatat 3480caaaattgct gtctgccagg tgatcgctga tgtactgaca
agcctcgcgt acccgattat 3540ccatcggtgg atggagcgac tcgttaatcg
cttccatgcg ccgcagtaac aattgctcaa 3600gcagatttat cgccagcagc
tccgaatagc gcccttcccc ttgcccggcg ttaatgattt 3660gcccaaacag
gtcgctgaaa tgcggctggt gcgcttcatc cgggcgaaag aaccccgtat
3720tggcaaatat tgacggccag ttaagccatt catgccagta ggcgcgcgga
cgaaagtaaa 3780cccactggtg ataccattcg cgagcctccg gatgacgacc
gtagtgatga atctctcctg 3840gcgggaacag caaaatatca cccggtcggc
aaacaaattc tcgtccctga tttttcacca 3900ccccctgacc gcgaatggtg
agattgagaa tataaccttt cattcccagc ggtcggtcga 3960taaaaaaatc
gagataaccg ttggcctcaa tcggcgttaa acccgccacc agatgggcat
4020taaacgagta tcccggcagc aggggatcat tttgcgcttc agccatactt
ttcatactcc 4080cgccattcag ag 4092884679DNAArtificial sequencepRF161
88catgggctcc agccatcatc atcaccatca tgaattcgga ggtggcggtg catcctcgga
60ggatgtgatt aaagaattta tgcggtttaa agtacgtatg gaaggatcgg tgaatggcca
120tgaatttgag attgagggtg aaggcgaagg ccgcccgtac gaaggaactc
aaacagcgaa 180attaaaagtt acaaaaggag gtcctctgcc gtttgcctgg
gacatcttga gcccgcaatt 240ccagtacggt tccaaagtgt atgtaaaaca
ccctgcggat attccggatt ataaaaaact 300gagttttccc gaggggttta
aatgggaacg ggtgatgaat tttgaggatg gtggagttgt 360caccgtgacc
caggactcta gcttacaaga cggtagtttc atctacaaag taaaatttat
420cggcgtaaac ttcccatcgg acggccccgt catgcagaaa aagacgatgg
gctgggaagc 480cagcaccgaa cgtttgtacc cacgggacgg cgttttgaaa
ggggaaatcc ataaggccct 540taaactgaaa gacggtggtc actatctcgt
ggagtttaaa tcgatttata tggctaaaaa 600accagtacag cttccgggtt
attattacgt tgactccaaa ttggacatca catcgcataa 660tgaagattac
acgattgttg aacagtacga gcgcgccgag ggccggcacc atctgtttct
720gtaaaagctt ggctgttttg gcggatgaga gaagattttc agcctgatac
agattaaatc 780agaacgcaga agcggtctga taaaacagaa tttgcctggc
ggcagtagcg cggtggtccc 840acctgacccc atgccgaact cagaagtgaa
acgccgtagc gccgatggta gtgtggggtc 900tccccatgcg agagtaggga
actgccaggc atcaaataaa acgaaaggct cagtcgaaag 960actgggcctt
tcgttttatc tgttgtttgt cggtgaacgc tctcctgagt aggacaaatc
1020cgccgggagc ggatttgaac gttgcgaagc aacggcccgg agggtggcgg
gcaggacgcc 1080cgccataaac tgccaggcat caaattaagc agaaggccat
cctgacggat ggcctttttg 1140cgtttctaca aactcttttg tttatttttc
taaatacatt caaatatgta tccgctcatg 1200agacaataac cctgataaat
gcttcaataa tattgaaaaa ggaagagtat gagtattcaa 1260catttccgtg
tcgcccttat tccctttttt gcggcatttt gccttcctgt ttttgctcac
1320ccagaaacgc tggtgaaagt aaaagatgct gaagatcagt tgggtgcacg
agtgggttac 1380atcgaactgg atctcaacag cggtaagatc cttgagagtt
ttcgccccga agaacgtttt 1440ccaatgatga gcacttttaa agttctgcta
tgtggcgcgg tattatcccg tgttgacgcc 1500gggcaagagc aactcggtcg
ccgcatacac tattctcaga atgacttggt tgagtactca 1560ccagtcacag
aaaagcatct tacggatggc atgacagtaa gagaattatg cagtgctgcc
1620ataaccatga gtgataacac tgcggccaac ttacttctga caacgatcgg
aggaccgaag 1680gagctaaccg cttttttgca caacatgggg gatcatgtaa
ctcgccttga tcgttgggaa 1740ccggagctga atgaagccat accaaacgac
gagcgtgaca ccacgatgcc tgtagcaatg 1800gcaacaacgt tgcgcaaact
attaactggc gaactactta ctctagcttc ccggcaacaa 1860ttaatagact
ggatggaggc ggataaagtt gcaggaccac ttctgcgctc ggcccttccg
1920gctggctggt ttattgctga taaatctgga gccggtgagc gtgggtctcg
cggtatcatt 1980gcagcactgg ggccagatgg taagccctcc cgtatcgtag
ttatctacac gacggggagt 2040caggcaacta tggatgaacg aaatagacag
atcgctgaga taggtgcctc actgattaag 2100cattggtaac tgtcagacca
agtttactca tatatacttt agattgattt aaaacttcat 2160ttttaattta
aaaggatcta ggtgaagatc ctttttgata atctcatgac caaaatccct
2220taacgtgagt tttcgttcca ctgagcgtca gaccccgtag aaaagatcaa
aggatcttct 2280tgagatcctt tttttctgcg cgtaatctgc tgcttgcaaa
caaaaaaacc accgctacca 2340gcggtggttt gtttgccgga tcaagagcta
ccaactcttt ttccgaaggt aactggcttc 2400agcagagcgc agataccaaa
tactgtcctt ctagtgtagc cgtagttagg ccaccacttc 2460aagaactctg
tagcaccgcc tacatacctc gctctgctaa tcctgttacc agtggctgct
2520gccagtggcg ataagtcgtg tcttaccggg ttggactcaa gacgatagtt
accggataag 2580gcgcagcggt cgggctgaac ggggggttcg tgcacacagc
ccagcttgga gcgaacgacc 2640tacaccgaac tgagatacct acagcgtgag
ctatgagaaa gcgccacgct tcccgaaggg 2700agaaaggcgg acaggtatcc
ggtaagcggc agggtcggaa caggagagcg cacgagggag 2760cttccagggg
gaaacgcctg gtatctttat agtcctgtcg ggtttcgcca cctctgactt
2820gagcgtcgat ttttgtgatg ctcgtcaggg gggcggagcc tatggaaaaa
cgccagcaac 2880gcggcctttt tacggttcct ggccttttgc tggccttttg
ctcacatgtt ctttcctgcg 2940ttatcccctg attctgtgga taaccgtatt
accgcctttg agtgagctga taccgctcgc 3000cgcagccgaa cgaccgagcg
cagcgagtca gtgagcgagg aagcggaaga gcgcctgatg 3060cggtattttc
tccttacgca tctgtgcggt atttcacacc gcatatggtg cactctcagt
3120acaatctgct ctgatgccgc atagttaagc cagtatacac tccgctatcg
ctacgtgact 3180gggtcatggc tgcgccccga cacccgccaa cacccgctga
cgcgccctga cgggcttgtc 3240tgctcccggc atccgcttac agacaagctg
tgaccgtctc cgggagctgc atgtgtcaga 3300ggttttcacc gtcatcaccg
aaacgcgcga ggcagcagat caattcgcgc gcgaaggcga 3360agcggcatgc
ataatgtgcc tgtcaaatgg acgaagcagg gattctgcaa accctatgct
3420actccgtcaa gccgtcaatt gtctgattcg ttaccaatta tgacaacttg
acggctacat 3480cattcacttt ttcttcacaa ccggcacgga actcgctcgg
gctggccccg gtgcattttt 3540taaatacccg cgagaaatag agttgatcgt
caaaaccaac attgcgaccg acggtggcga 3600taggcatccg ggtggtgctc
aaaagcagct tcgcctggct gatacgttgg tcctcgcgcc 3660agcttaagac
gctaatccct aactgctggc ggaaaagatg tgacagacgc gacggcgaca
3720agcaaacatg ctgtgcgacg ctggcgatat caaaattgct gtctgccagg
tgatcgctga 3780tgtactgaca agcctcgcgt acccgattat ccatcggtgg
atggagcgac tcgttaatcg 3840cttccatgcg ccgcagtaac aattgctcaa
gcagatttat cgccagcagc tccgaatagc 3900gcccttcccc ttgcccggcg
ttaatgattt gcccaaacag gtcgctgaaa tgcggctggt 3960gcgcttcatc
cgggcgaaag aaccccgtat tggcaaatat tgacggccag ttaagccatt
4020catgccagta ggcgcgcgga cgaaagtaaa cccactggtg ataccattcg
cgagcctccg 4080gatgacgacc gtagtgatga atctctcctg gcgggaacag
caaaatatca cccggtcggc 4140aaacaaattc tcgtccctga tttttcacca
ccccctgacc gcgaatggtg agattgagaa 4200tataaccttt cattcccagc
ggtcggtcga taaaaaaatc gagataaccg ttggcctcaa 4260tcggcgttaa
acccgccacc agatgggcat taaacgagta tcccggcagc aggggatcat
4320tttgcgcttc agccatactt ttcatactcc cgccattcag agaagaaacc
aattgtccat 4380attgcatcag acattgccgt cactgcgtct tttactggct
cttctcgcta accaaaccgg 4440taaccccgct tattaaaagc attctgtaac
aaagcgggac caaagccatg acaaaaacgc 4500gtaacaaaag tgtctataat
cacggcagaa aagtccacat tgattatttg cacggcgtca 4560cactttgcta
tgccatagca tttttatcca taagattagc ggatcctacc tgacgctttt
4620tatcgcaact ctctactgtt tctccatacc cgttttttgg gctaacagga
ggaattaac 46798920PRTArtificial sequenceTAT 89Gly Arg Lys Lys Arg
Arg Gln Arg Arg Arg Pro Pro Gln Pro Lys Lys1 5 10 15Lys Arg Lys Val
209018PRTArtificial sequenceTLM 90Pro Leu Ser Ser Ile Phe Ser Arg
Ile Gly Asp Pro Lys Lys Lys Arg1 5 10 15Lys Val9123PRTArtificial
sequenceMPG1 91Ala Leu Phe Leu Gly Gln Leu Gly Ala Ala Gly Ser Thr
Met Gly Ala1 5 10 15Pro Lys Lys Lys Arg Lys Val 209221PRTArtificial
sequencepep1 92Lys Glu Thr Trp Trp Glu Thr Trp Trp Thr Glu Trp Ser
Gln Pro Lys1 5 10 15Lys Lys Arg Lys Val 20937PRTArtificial
sequenceCFFKDEL 93Cys Phe Phe Lys Asp Glu Leu1 59498DNAArtificial
sequencehis-TAT E.coli optimized 94ccatggggca tcaccatcat catcacggcc
gcaaaaaacg tcgtcagcgc cggcgtccgc 60cccagccgaa aaaacggaaa gtgggcggcg
gcgaattc 989589DNAArtificial sequencehis-TLM E.coli optimized
95ccatggggca tcaccatcat catcatccgt taagctcgat cttttctcgt atcggtgatc
60cgccaaaaaa gaaacgcaaa gtagaattc 8996104DNAArtificial
sequencehis-MPG1 E. coli optimized 96ccatggggca tcatcatcac
catcacggcg ccctgttctt aggccagctg ggcgccgcgg 60gatccacgat gggtgcgccg
aagaaaaagc gcaaagttga attc 1049795DNAArtificial sequencehis-pep1 E.
coli optimized 97ccatggggca ccatcaccat caccataaag aaacttggtg
ggagacttgg tggaccgaat 60ggtcccagcc gaagaaaaaa cgcaaggttg aattc
959851DNAArtificial sequencehis-CFFKDEL E. coli optimized
98ccatggggca tcaccatcac caccattgtt ttttcaaaga cgaactggaa t
51994739DNAArtificial sequencepRF224; 99catggggcat caccatcatc
atcacggccg caaaaaacgt cgtcagcgcc ggcgtccgcc 60ccagccgaaa aaacggaaag
tgggcggcgg cgaattcgga ggtggcggtg catcctcgga 120ggatgtgatt
aaagaattta tgcggtttaa agtacgtatg gaaggatcgg tgaatggcca
180tgaatttgag attgagggtg aaggcgaagg ccgcccgtac gaaggaactc
aaacagcgaa 240attaaaagtt acaaaaggag gtcctctgcc gtttgcctgg
gacatcttga gcccgcaatt 300ccagtacggt tccaaagtgt atgtaaaaca
ccctgcggat attccggatt ataaaaaact 360gagttttccc gaggggttta
aatgggaacg ggtgatgaat tttgaggatg gtggagttgt 420caccgtgacc
caggactcta gcttacaaga cggtagtttc atctacaaag taaaatttat
480cggcgtaaac ttcccatcgg acggccccgt catgcagaaa aagacgatgg
gctgggaagc 540cagcaccgaa cgtttgtacc cacgggacgg cgttttgaaa
ggggaaatcc ataaggccct 600taaactgaaa gacggtggtc actatctcgt
ggagtttaaa tcgatttata tggctaaaaa 660accagtacag cttccgggtt
attattacgt tgactccaaa ttggacatca catcgcataa 720tgaagattac
acgattgttg aacagtacga gcgcgccgag ggccggcacc atctgtttct
780gtaaaagctt ggctgttttg gcggatgaga gaagattttc agcctgatac
agattaaatc 840agaacgcaga agcggtctga taaaacagaa tttgcctggc
ggcagtagcg cggtggtccc 900acctgacccc atgccgaact cagaagtgaa
acgccgtagc gccgatggta gtgtggggtc 960tccccatgcg agagtaggga
actgccaggc atcaaataaa acgaaaggct cagtcgaaag 1020actgggcctt
tcgttttatc tgttgtttgt cggtgaacgc tctcctgagt aggacaaatc
1080cgccgggagc ggatttgaac gttgcgaagc aacggcccgg agggtggcgg
gcaggacgcc 1140cgccataaac tgccaggcat caaattaagc agaaggccat
cctgacggat ggcctttttg 1200cgtttctaca aactcttttg tttatttttc
taaatacatt caaatatgta tccgctcatg 1260agacaataac cctgataaat
gcttcaataa tattgaaaaa ggaagagtat gagtattcaa 1320catttccgtg
tcgcccttat tccctttttt gcggcatttt gccttcctgt ttttgctcac
1380ccagaaacgc tggtgaaagt aaaagatgct gaagatcagt tgggtgcacg
agtgggttac 1440atcgaactgg atctcaacag cggtaagatc cttgagagtt
ttcgccccga agaacgtttt 1500ccaatgatga gcacttttaa agttctgcta
tgtggcgcgg tattatcccg tgttgacgcc 1560gggcaagagc aactcggtcg
ccgcatacac tattctcaga atgacttggt tgagtactca 1620ccagtcacag
aaaagcatct tacggatggc atgacagtaa gagaattatg cagtgctgcc
1680ataaccatga gtgataacac tgcggccaac ttacttctga caacgatcgg
aggaccgaag 1740gagctaaccg cttttttgca caacatgggg gatcatgtaa
ctcgccttga tcgttgggaa 1800ccggagctga atgaagccat accaaacgac
gagcgtgaca ccacgatgcc tgtagcaatg 1860gcaacaacgt tgcgcaaact
attaactggc gaactactta ctctagcttc ccggcaacaa 1920ttaatagact
ggatggaggc ggataaagtt gcaggaccac ttctgcgctc ggcccttccg
1980gctggctggt ttattgctga taaatctgga gccggtgagc gtgggtctcg
cggtatcatt 2040gcagcactgg ggccagatgg taagccctcc cgtatcgtag
ttatctacac gacggggagt 2100caggcaacta tggatgaacg aaatagacag
atcgctgaga taggtgcctc actgattaag 2160cattggtaac tgtcagacca
agtttactca tatatacttt agattgattt aaaacttcat 2220ttttaattta
aaaggatcta ggtgaagatc ctttttgata atctcatgac caaaatccct
2280taacgtgagt tttcgttcca ctgagcgtca gaccccgtag aaaagatcaa
aggatcttct 2340tgagatcctt tttttctgcg cgtaatctgc tgcttgcaaa
caaaaaaacc accgctacca 2400gcggtggttt gtttgccgga tcaagagcta
ccaactcttt ttccgaaggt aactggcttc 2460agcagagcgc agataccaaa
tactgtcctt ctagtgtagc cgtagttagg ccaccacttc 2520aagaactctg
tagcaccgcc tacatacctc gctctgctaa tcctgttacc agtggctgct
2580gccagtggcg ataagtcgtg tcttaccggg ttggactcaa gacgatagtt
accggataag 2640gcgcagcggt cgggctgaac ggggggttcg tgcacacagc
ccagcttgga gcgaacgacc 2700tacaccgaac tgagatacct acagcgtgag
ctatgagaaa gcgccacgct tcccgaaggg 2760agaaaggcgg acaggtatcc
ggtaagcggc agggtcggaa caggagagcg cacgagggag 2820cttccagggg
gaaacgcctg gtatctttat agtcctgtcg ggtttcgcca cctctgactt
2880gagcgtcgat ttttgtgatg ctcgtcaggg gggcggagcc tatggaaaaa
cgccagcaac 2940gcggcctttt tacggttcct ggccttttgc tggccttttg
ctcacatgtt ctttcctgcg 3000ttatcccctg attctgtgga taaccgtatt
accgcctttg agtgagctga taccgctcgc 3060cgcagccgaa cgaccgagcg
cagcgagtca gtgagcgagg aagcggaaga gcgcctgatg 3120cggtattttc
tccttacgca tctgtgcggt atttcacacc gcatatggtg cactctcagt
3180acaatctgct ctgatgccgc atagttaagc cagtatacac tccgctatcg
ctacgtgact 3240gggtcatggc tgcgccccga cacccgccaa cacccgctga
cgcgccctga cgggcttgtc 3300tgctcccggc atccgcttac agacaagctg
tgaccgtctc cgggagctgc atgtgtcaga 3360ggttttcacc gtcatcaccg
aaacgcgcga ggcagcagat caattcgcgc gcgaaggcga 3420agcggcatgc
ataatgtgcc tgtcaaatgg acgaagcagg gattctgcaa accctatgct
3480actccgtcaa gccgtcaatt gtctgattcg ttaccaatta tgacaacttg
acggctacat 3540cattcacttt ttcttcacaa ccggcacgga actcgctcgg
gctggccccg gtgcattttt 3600taaatacccg cgagaaatag agttgatcgt
caaaaccaac attgcgaccg acggtggcga 3660taggcatccg ggtggtgctc
aaaagcagct tcgcctggct gatacgttgg tcctcgcgcc 3720agcttaagac
gctaatccct aactgctggc ggaaaagatg tgacagacgc gacggcgaca
3780agcaaacatg ctgtgcgacg ctggcgatat caaaattgct gtctgccagg
tgatcgctga 3840tgtactgaca agcctcgcgt acccgattat ccatcggtgg
atggagcgac tcgttaatcg 3900cttccatgcg ccgcagtaac aattgctcaa
gcagatttat cgccagcagc tccgaatagc 3960gcccttcccc ttgcccggcg
ttaatgattt gcccaaacag gtcgctgaaa tgcggctggt 4020gcgcttcatc
cgggcgaaag aaccccgtat tggcaaatat tgacggccag ttaagccatt
4080catgccagta ggcgcgcgga cgaaagtaaa cccactggtg ataccattcg
cgagcctccg 4140gatgacgacc gtagtgatga atctctcctg gcgggaacag
caaaatatca cccggtcggc 4200aaacaaattc tcgtccctga tttttcacca
ccccctgacc gcgaatggtg agattgagaa 4260tataaccttt cattcccagc
ggtcggtcga taaaaaaatc gagataaccg ttggcctcaa 4320tcggcgttaa
acccgccacc agatgggcat taaacgagta tcccggcagc aggggatcat
4380tttgcgcttc agccatactt ttcatactcc cgccattcag agaagaaacc
aattgtccat 4440attgcatcag acattgccgt cactgcgtct tttactggct
cttctcgcta accaaaccgg 4500taaccccgct tattaaaagc attctgtaac
aaagcgggac caaagccatg acaaaaacgc 4560gtaacaaaag tgtctataat
cacggcagaa aagtccacat tgattatttg cacggcgtca 4620cactttgcta
tgccatagca tttttatcca taagattagc ggatcctacc tgacgctttt
4680tatcgcaact ctctactgtt tctccatacc cgttttttgg gctaacagga
ggaattaac 47391004730DNAArtificial sequencepRF214 100catggggcat
caccatcatc atcatccgtt aagctcgatc ttttctcgta tcggtgatcc 60gccaaaaaag
aaacgcaaag tagaattcgg aggtggcggt gcatcctcgg aggatgtgat
120taaagaattt atgcggttta aagtacgtat ggaaggatcg gtgaatggcc
atgaatttga 180gattgagggt gaaggcgaag gccgcccgta cgaaggaact
caaacagcga aattaaaagt 240tacaaaagga ggtcctctgc cgtttgcctg
ggacatcttg agcccgcaat tccagtacgg 300ttccaaagtg tatgtaaaac
accctgcgga tattccggat tataaaaaac tgagttttcc 360cgaggggttt
aaatgggaac gggtgatgaa ttttgaggat ggtggagttg tcaccgtgac
420ccaggactct agcttacaag acggtagttt catctacaaa gtaaaattta
tcggcgtaaa 480cttcccatcg gacggccccg tcatgcagaa aaagacgatg
ggctgggaag ccagcaccga 540acgtttgtac ccacgggacg gcgttttgaa
aggggaaatc cataaggccc ttaaactgaa 600agacggtggt cactatctcg
tggagtttaa atcgatttat atggctaaaa aaccagtaca 660gcttccgggt
tattattacg ttgactccaa attggacatc acatcgcata atgaagatta
720cacgattgtt gaacagtacg agcgcgccga gggccggcac catctgtttc
tgtaaaagct 780tggctgtttt ggcggatgag agaagatttt cagcctgata
cagattaaat cagaacgcag 840aagcggtctg ataaaacaga atttgcctgg
cggcagtagc gcggtggtcc cacctgaccc 900catgccgaac tcagaagtga
aacgccgtag cgccgatggt agtgtggggt ctccccatgc 960gagagtaggg
aactgccagg catcaaataa aacgaaaggc tcagtcgaaa gactgggcct
1020ttcgttttat ctgttgtttg tcggtgaacg ctctcctgag taggacaaat
ccgccgggag 1080cggatttgaa cgttgcgaag caacggcccg gagggtggcg
ggcaggacgc ccgccataaa 1140ctgccaggca tcaaattaag cagaaggcca
tcctgacgga tggccttttt gcgtttctac 1200aaactctttt gtttattttt
ctaaatacat tcaaatatgt atccgctcat gagacaataa 1260ccctgataaa
tgcttcaata atattgaaaa aggaagagta tgagtattca acatttccgt
1320gtcgccctta ttcccttttt tgcggcattt tgccttcctg tttttgctca
cccagaaacg 1380ctggtgaaag taaaagatgc tgaagatcag ttgggtgcac
gagtgggtta catcgaactg 1440gatctcaaca gcggtaagat ccttgagagt
tttcgccccg aagaacgttt tccaatgatg 1500agcactttta aagttctgct
atgtggcgcg gtattatccc gtgttgacgc cgggcaagag 1560caactcggtc
gccgcataca ctattctcag aatgacttgg ttgagtactc accagtcaca
1620gaaaagcatc ttacggatgg catgacagta agagaattat gcagtgctgc
cataaccatg 1680agtgataaca ctgcggccaa cttacttctg acaacgatcg
gaggaccgaa ggagctaacc 1740gcttttttgc acaacatggg ggatcatgta
actcgccttg atcgttggga accggagctg 1800aatgaagcca taccaaacga
cgagcgtgac accacgatgc ctgtagcaat ggcaacaacg 1860ttgcgcaaac
tattaactgg cgaactactt actctagctt cccggcaaca attaatagac
1920tggatggagg cggataaagt tgcaggacca cttctgcgct cggcccttcc
ggctggctgg 1980tttattgctg ataaatctgg agccggtgag cgtgggtctc
gcggtatcat tgcagcactg 2040gggccagatg gtaagccctc ccgtatcgta
gttatctaca cgacggggag
tcaggcaact 2100atggatgaac gaaatagaca gatcgctgag ataggtgcct
cactgattaa gcattggtaa 2160ctgtcagacc aagtttactc atatatactt
tagattgatt taaaacttca tttttaattt 2220aaaaggatct aggtgaagat
cctttttgat aatctcatga ccaaaatccc ttaacgtgag 2280ttttcgttcc
actgagcgtc agaccccgta gaaaagatca aaggatcttc ttgagatcct
2340ttttttctgc gcgtaatctg ctgcttgcaa acaaaaaaac caccgctacc
agcggtggtt 2400tgtttgccgg atcaagagct accaactctt tttccgaagg
taactggctt cagcagagcg 2460cagataccaa atactgtcct tctagtgtag
ccgtagttag gccaccactt caagaactct 2520gtagcaccgc ctacatacct
cgctctgcta atcctgttac cagtggctgc tgccagtggc 2580gataagtcgt
gtcttaccgg gttggactca agacgatagt taccggataa ggcgcagcgg
2640tcgggctgaa cggggggttc gtgcacacag cccagcttgg agcgaacgac
ctacaccgaa 2700ctgagatacc tacagcgtga gctatgagaa agcgccacgc
ttcccgaagg gagaaaggcg 2760gacaggtatc cggtaagcgg cagggtcgga
acaggagagc gcacgaggga gcttccaggg 2820ggaaacgcct ggtatcttta
tagtcctgtc gggtttcgcc acctctgact tgagcgtcga 2880tttttgtgat
gctcgtcagg ggggcggagc ctatggaaaa acgccagcaa cgcggccttt
2940ttacggttcc tggccttttg ctggcctttt gctcacatgt tctttcctgc
gttatcccct 3000gattctgtgg ataaccgtat taccgccttt gagtgagctg
ataccgctcg ccgcagccga 3060acgaccgagc gcagcgagtc agtgagcgag
gaagcggaag agcgcctgat gcggtatttt 3120ctccttacgc atctgtgcgg
tatttcacac cgcatatggt gcactctcag tacaatctgc 3180tctgatgccg
catagttaag ccagtataca ctccgctatc gctacgtgac tgggtcatgg
3240ctgcgccccg acacccgcca acacccgctg acgcgccctg acgggcttgt
ctgctcccgg 3300catccgctta cagacaagct gtgaccgtct ccgggagctg
catgtgtcag aggttttcac 3360cgtcatcacc gaaacgcgcg aggcagcaga
tcaattcgcg cgcgaaggcg aagcggcatg 3420cataatgtgc ctgtcaaatg
gacgaagcag ggattctgca aaccctatgc tactccgtca 3480agccgtcaat
tgtctgattc gttaccaatt atgacaactt gacggctaca tcattcactt
3540tttcttcaca accggcacgg aactcgctcg ggctggcccc ggtgcatttt
ttaaataccc 3600gcgagaaata gagttgatcg tcaaaaccaa cattgcgacc
gacggtggcg ataggcatcc 3660gggtggtgct caaaagcagc ttcgcctggc
tgatacgttg gtcctcgcgc cagcttaaga 3720cgctaatccc taactgctgg
cggaaaagat gtgacagacg cgacggcgac aagcaaacat 3780gctgtgcgac
gctggcgata tcaaaattgc tgtctgccag gtgatcgctg atgtactgac
3840aagcctcgcg tacccgatta tccatcggtg gatggagcga ctcgttaatc
gcttccatgc 3900gccgcagtaa caattgctca agcagattta tcgccagcag
ctccgaatag cgcccttccc 3960cttgcccggc gttaatgatt tgcccaaaca
ggtcgctgaa atgcggctgg tgcgcttcat 4020ccgggcgaaa gaaccccgta
ttggcaaata ttgacggcca gttaagccat tcatgccagt 4080aggcgcgcgg
acgaaagtaa acccactggt gataccattc gcgagcctcc ggatgacgac
4140cgtagtgatg aatctctcct ggcgggaaca gcaaaatatc acccggtcgg
caaacaaatt 4200ctcgtccctg atttttcacc accccctgac cgcgaatggt
gagattgaga atataacctt 4260tcattcccag cggtcggtcg ataaaaaaat
cgagataacc gttggcctca atcggcgtta 4320aacccgccac cagatgggca
ttaaacgagt atcccggcag caggggatca ttttgcgctt 4380cagccatact
tttcatactc ccgccattca gagaagaaac caattgtcca tattgcatca
4440gacattgccg tcactgcgtc ttttactggc tcttctcgct aaccaaaccg
gtaaccccgc 4500ttattaaaag cattctgtaa caaagcggga ccaaagccat
gacaaaaacg cgtaacaaaa 4560gtgtctataa tcacggcaga aaagtccaca
ttgattattt gcacggcgtc acactttgct 4620atgccatagc atttttatcc
ataagattag cggatcctac ctgacgcttt ttatcgcaac 4680tctctactgt
ttctccatac ccgttttttg ggctaacagg aggaattaac
47301014745DNAArtificial sequencepRF213 101catggggcat catcatcacc
atcacggcgc cctgttctta ggccagctgg gcgccgcggg 60atccacgatg ggtgcgccga
agaaaaagcg caaagttgaa ttcggaggtg gcggtgcatc 120ctcggaggat
gtgattaaag aatttatgcg gtttaaagta cgtatggaag gatcggtgaa
180tggccatgaa tttgagattg agggtgaagg cgaaggccgc ccgtacgaag
gaactcaaac 240agcgaaatta aaagttacaa aaggaggtcc tctgccgttt
gcctgggaca tcttgagccc 300gcaattccag tacggttcca aagtgtatgt
aaaacaccct gcggatattc cggattataa 360aaaactgagt tttcccgagg
ggtttaaatg ggaacgggtg atgaattttg aggatggtgg 420agttgtcacc
gtgacccagg actctagctt acaagacggt agtttcatct acaaagtaaa
480atttatcggc gtaaacttcc catcggacgg ccccgtcatg cagaaaaaga
cgatgggctg 540ggaagccagc accgaacgtt tgtacccacg ggacggcgtt
ttgaaagggg aaatccataa 600ggcccttaaa ctgaaagacg gtggtcacta
tctcgtggag tttaaatcga tttatatggc 660taaaaaacca gtacagcttc
cgggttatta ttacgttgac tccaaattgg acatcacatc 720gcataatgaa
gattacacga ttgttgaaca gtacgagcgc gccgagggcc ggcaccatct
780gtttctgtaa aagcttggct gttttggcgg atgagagaag attttcagcc
tgatacagat 840taaatcagaa cgcagaagcg gtctgataaa acagaatttg
cctggcggca gtagcgcggt 900ggtcccacct gaccccatgc cgaactcaga
agtgaaacgc cgtagcgccg atggtagtgt 960ggggtctccc catgcgagag
tagggaactg ccaggcatca aataaaacga aaggctcagt 1020cgaaagactg
ggcctttcgt tttatctgtt gtttgtcggt gaacgctctc ctgagtagga
1080caaatccgcc gggagcggat ttgaacgttg cgaagcaacg gcccggaggg
tggcgggcag 1140gacgcccgcc ataaactgcc aggcatcaaa ttaagcagaa
ggccatcctg acggatggcc 1200tttttgcgtt tctacaaact cttttgttta
tttttctaaa tacattcaaa tatgtatccg 1260ctcatgagac aataaccctg
ataaatgctt caataatatt gaaaaaggaa gagtatgagt 1320attcaacatt
tccgtgtcgc ccttattccc ttttttgcgg cattttgcct tcctgttttt
1380gctcacccag aaacgctggt gaaagtaaaa gatgctgaag atcagttggg
tgcacgagtg 1440ggttacatcg aactggatct caacagcggt aagatccttg
agagttttcg ccccgaagaa 1500cgttttccaa tgatgagcac ttttaaagtt
ctgctatgtg gcgcggtatt atcccgtgtt 1560gacgccgggc aagagcaact
cggtcgccgc atacactatt ctcagaatga cttggttgag 1620tactcaccag
tcacagaaaa gcatcttacg gatggcatga cagtaagaga attatgcagt
1680gctgccataa ccatgagtga taacactgcg gccaacttac ttctgacaac
gatcggagga 1740ccgaaggagc taaccgcttt tttgcacaac atgggggatc
atgtaactcg ccttgatcgt 1800tgggaaccgg agctgaatga agccatacca
aacgacgagc gtgacaccac gatgcctgta 1860gcaatggcaa caacgttgcg
caaactatta actggcgaac tacttactct agcttcccgg 1920caacaattaa
tagactggat ggaggcggat aaagttgcag gaccacttct gcgctcggcc
1980cttccggctg gctggtttat tgctgataaa tctggagccg gtgagcgtgg
gtctcgcggt 2040atcattgcag cactggggcc agatggtaag ccctcccgta
tcgtagttat ctacacgacg 2100gggagtcagg caactatgga tgaacgaaat
agacagatcg ctgagatagg tgcctcactg 2160attaagcatt ggtaactgtc
agaccaagtt tactcatata tactttagat tgatttaaaa 2220cttcattttt
aatttaaaag gatctaggtg aagatccttt ttgataatct catgaccaaa
2280atcccttaac gtgagttttc gttccactga gcgtcagacc ccgtagaaaa
gatcaaagga 2340tcttcttgag atcctttttt tctgcgcgta atctgctgct
tgcaaacaaa aaaaccaccg 2400ctaccagcgg tggtttgttt gccggatcaa
gagctaccaa ctctttttcc gaaggtaact 2460ggcttcagca gagcgcagat
accaaatact gtccttctag tgtagccgta gttaggccac 2520cacttcaaga
actctgtagc accgcctaca tacctcgctc tgctaatcct gttaccagtg
2580gctgctgcca gtggcgataa gtcgtgtctt accgggttgg actcaagacg
atagttaccg 2640gataaggcgc agcggtcggg ctgaacgggg ggttcgtgca
cacagcccag cttggagcga 2700acgacctaca ccgaactgag atacctacag
cgtgagctat gagaaagcgc cacgcttccc 2760gaagggagaa aggcggacag
gtatccggta agcggcaggg tcggaacagg agagcgcacg 2820agggagcttc
cagggggaaa cgcctggtat ctttatagtc ctgtcgggtt tcgccacctc
2880tgacttgagc gtcgattttt gtgatgctcg tcaggggggc ggagcctatg
gaaaaacgcc 2940agcaacgcgg cctttttacg gttcctggcc ttttgctggc
cttttgctca catgttcttt 3000cctgcgttat cccctgattc tgtggataac
cgtattaccg cctttgagtg agctgatacc 3060gctcgccgca gccgaacgac
cgagcgcagc gagtcagtga gcgaggaagc ggaagagcgc 3120ctgatgcggt
attttctcct tacgcatctg tgcggtattt cacaccgcat atggtgcact
3180ctcagtacaa tctgctctga tgccgcatag ttaagccagt atacactccg
ctatcgctac 3240gtgactgggt catggctgcg ccccgacacc cgccaacacc
cgctgacgcg ccctgacggg 3300cttgtctgct cccggcatcc gcttacagac
aagctgtgac cgtctccggg agctgcatgt 3360gtcagaggtt ttcaccgtca
tcaccgaaac gcgcgaggca gcagatcaat tcgcgcgcga 3420aggcgaagcg
gcatgcataa tgtgcctgtc aaatggacga agcagggatt ctgcaaaccc
3480tatgctactc cgtcaagccg tcaattgtct gattcgttac caattatgac
aacttgacgg 3540ctacatcatt cactttttct tcacaaccgg cacggaactc
gctcgggctg gccccggtgc 3600attttttaaa tacccgcgag aaatagagtt
gatcgtcaaa accaacattg cgaccgacgg 3660tggcgatagg catccgggtg
gtgctcaaaa gcagcttcgc ctggctgata cgttggtcct 3720cgcgccagct
taagacgcta atccctaact gctggcggaa aagatgtgac agacgcgacg
3780gcgacaagca aacatgctgt gcgacgctgg cgatatcaaa attgctgtct
gccaggtgat 3840cgctgatgta ctgacaagcc tcgcgtaccc gattatccat
cggtggatgg agcgactcgt 3900taatcgcttc catgcgccgc agtaacaatt
gctcaagcag atttatcgcc agcagctccg 3960aatagcgccc ttccccttgc
ccggcgttaa tgatttgccc aaacaggtcg ctgaaatgcg 4020gctggtgcgc
ttcatccggg cgaaagaacc ccgtattggc aaatattgac ggccagttaa
4080gccattcatg ccagtaggcg cgcggacgaa agtaaaccca ctggtgatac
cattcgcgag 4140cctccggatg acgaccgtag tgatgaatct ctcctggcgg
gaacagcaaa atatcacccg 4200gtcggcaaac aaattctcgt ccctgatttt
tcaccacccc ctgaccgcga atggtgagat 4260tgagaatata acctttcatt
cccagcggtc ggtcgataaa aaaatcgaga taaccgttgg 4320cctcaatcgg
cgttaaaccc gccaccagat gggcattaaa cgagtatccc ggcagcaggg
4380gatcattttg cgcttcagcc atacttttca tactcccgcc attcagagaa
gaaaccaatt 4440gtccatattg catcagacat tgccgtcact gcgtctttta
ctggctcttc tcgctaacca 4500aaccggtaac cccgcttatt aaaagcattc
tgtaacaaag cgggaccaaa gccatgacaa 4560aaacgcgtaa caaaagtgtc
tataatcacg gcagaaaagt ccacattgat tatttgcacg 4620gcgtcacact
ttgctatgcc atagcatttt tatccataag attagcggat cctacctgac
4680gctttttatc gcaactctct actgtttctc catacccgtt ttttgggcta
acaggaggaa 4740ttaac 47451024736DNAArtificial sequencepRF217
102catggggcac catcaccatc accataaaga aacttggtgg gagacttggt
ggaccgaatg 60gtcccagccg aagaaaaaac gcaaggttga attcggaggt ggcggtgcat
cctcggagga 120tgtgattaaa gaatttatgc ggtttaaagt acgtatggaa
ggatcggtga atggccatga 180atttgagatt gagggtgaag gcgaaggccg
cccgtacgaa ggaactcaaa cagcgaaatt 240aaaagttaca aaaggaggtc
ctctgccgtt tgcctgggac atcttgagcc cgcaattcca 300gtacggttcc
aaagtgtatg taaaacaccc tgcggatatt ccggattata aaaaactgag
360ttttcccgag gggtttaaat gggaacgggt gatgaatttt gaggatggtg
gagttgtcac 420cgtgacccag gactctagct tacaagacgg tagtttcatc
tacaaagtaa aatttatcgg 480cgtaaacttc ccatcggacg gccccgtcat
gcagaaaaag acgatgggct gggaagccag 540caccgaacgt ttgtacccac
gggacggcgt tttgaaaggg gaaatccata aggcccttaa 600actgaaagac
ggtggtcact atctcgtgga gtttaaatcg atttatatgg ctaaaaaacc
660agtacagctt ccgggttatt attacgttga ctccaaattg gacatcacat
cgcataatga 720agattacacg attgttgaac agtacgagcg cgccgagggc
cggcaccatc tgtttctgta 780aaagcttggc tgttttggcg gatgagagaa
gattttcagc ctgatacaga ttaaatcaga 840acgcagaagc ggtctgataa
aacagaattt gcctggcggc agtagcgcgg tggtcccacc 900tgaccccatg
ccgaactcag aagtgaaacg ccgtagcgcc gatggtagtg tggggtctcc
960ccatgcgaga gtagggaact gccaggcatc aaataaaacg aaaggctcag
tcgaaagact 1020gggcctttcg ttttatctgt tgtttgtcgg tgaacgctct
cctgagtagg acaaatccgc 1080cgggagcgga tttgaacgtt gcgaagcaac
ggcccggagg gtggcgggca ggacgcccgc 1140cataaactgc caggcatcaa
attaagcaga aggccatcct gacggatggc ctttttgcgt 1200ttctacaaac
tcttttgttt atttttctaa atacattcaa atatgtatcc gctcatgaga
1260caataaccct gataaatgct tcaataatat tgaaaaagga agagtatgag
tattcaacat 1320ttccgtgtcg cccttattcc cttttttgcg gcattttgcc
ttcctgtttt tgctcaccca 1380gaaacgctgg tgaaagtaaa agatgctgaa
gatcagttgg gtgcacgagt gggttacatc 1440gaactggatc tcaacagcgg
taagatcctt gagagttttc gccccgaaga acgttttcca 1500atgatgagca
cttttaaagt tctgctatgt ggcgcggtat tatcccgtgt tgacgccggg
1560caagagcaac tcggtcgccg catacactat tctcagaatg acttggttga
gtactcacca 1620gtcacagaaa agcatcttac ggatggcatg acagtaagag
aattatgcag tgctgccata 1680accatgagtg ataacactgc ggccaactta
cttctgacaa cgatcggagg accgaaggag 1740ctaaccgctt ttttgcacaa
catgggggat catgtaactc gccttgatcg ttgggaaccg 1800gagctgaatg
aagccatacc aaacgacgag cgtgacacca cgatgcctgt agcaatggca
1860acaacgttgc gcaaactatt aactggcgaa ctacttactc tagcttcccg
gcaacaatta 1920atagactgga tggaggcgga taaagttgca ggaccacttc
tgcgctcggc ccttccggct 1980ggctggttta ttgctgataa atctggagcc
ggtgagcgtg ggtctcgcgg tatcattgca 2040gcactggggc cagatggtaa
gccctcccgt atcgtagtta tctacacgac ggggagtcag 2100gcaactatgg
atgaacgaaa tagacagatc gctgagatag gtgcctcact gattaagcat
2160tggtaactgt cagaccaagt ttactcatat atactttaga ttgatttaaa
acttcatttt 2220taatttaaaa ggatctaggt gaagatcctt tttgataatc
tcatgaccaa aatcccttaa 2280cgtgagtttt cgttccactg agcgtcagac
cccgtagaaa agatcaaagg atcttcttga 2340gatccttttt ttctgcgcgt
aatctgctgc ttgcaaacaa aaaaaccacc gctaccagcg 2400gtggtttgtt
tgccggatca agagctacca actctttttc cgaaggtaac tggcttcagc
2460agagcgcaga taccaaatac tgtccttcta gtgtagccgt agttaggcca
ccacttcaag 2520aactctgtag caccgcctac atacctcgct ctgctaatcc
tgttaccagt ggctgctgcc 2580agtggcgata agtcgtgtct taccgggttg
gactcaagac gatagttacc ggataaggcg 2640cagcggtcgg gctgaacggg
gggttcgtgc acacagccca gcttggagcg aacgacctac 2700accgaactga
gatacctaca gcgtgagcta tgagaaagcg ccacgcttcc cgaagggaga
2760aaggcggaca ggtatccggt aagcggcagg gtcggaacag gagagcgcac
gagggagctt 2820ccagggggaa acgcctggta tctttatagt cctgtcgggt
ttcgccacct ctgacttgag 2880cgtcgatttt tgtgatgctc gtcagggggg
cggagcctat ggaaaaacgc cagcaacgcg 2940gcctttttac ggttcctggc
cttttgctgg ccttttgctc acatgttctt tcctgcgtta 3000tcccctgatt
ctgtggataa ccgtattacc gcctttgagt gagctgatac cgctcgccgc
3060agccgaacga ccgagcgcag cgagtcagtg agcgaggaag cggaagagcg
cctgatgcgg 3120tattttctcc ttacgcatct gtgcggtatt tcacaccgca
tatggtgcac tctcagtaca 3180atctgctctg atgccgcata gttaagccag
tatacactcc gctatcgcta cgtgactggg 3240tcatggctgc gccccgacac
ccgccaacac ccgctgacgc gccctgacgg gcttgtctgc 3300tcccggcatc
cgcttacaga caagctgtga ccgtctccgg gagctgcatg tgtcagaggt
3360tttcaccgtc atcaccgaaa cgcgcgaggc agcagatcaa ttcgcgcgcg
aaggcgaagc 3420ggcatgcata atgtgcctgt caaatggacg aagcagggat
tctgcaaacc ctatgctact 3480ccgtcaagcc gtcaattgtc tgattcgtta
ccaattatga caacttgacg gctacatcat 3540tcactttttc ttcacaaccg
gcacggaact cgctcgggct ggccccggtg cattttttaa 3600atacccgcga
gaaatagagt tgatcgtcaa aaccaacatt gcgaccgacg gtggcgatag
3660gcatccgggt ggtgctcaaa agcagcttcg cctggctgat acgttggtcc
tcgcgccagc 3720ttaagacgct aatccctaac tgctggcgga aaagatgtga
cagacgcgac ggcgacaagc 3780aaacatgctg tgcgacgctg gcgatatcaa
aattgctgtc tgccaggtga tcgctgatgt 3840actgacaagc ctcgcgtacc
cgattatcca tcggtggatg gagcgactcg ttaatcgctt 3900ccatgcgccg
cagtaacaat tgctcaagca gatttatcgc cagcagctcc gaatagcgcc
3960cttccccttg cccggcgtta atgatttgcc caaacaggtc gctgaaatgc
ggctggtgcg 4020cttcatccgg gcgaaagaac cccgtattgg caaatattga
cggccagtta agccattcat 4080gccagtaggc gcgcggacga aagtaaaccc
actggtgata ccattcgcga gcctccggat 4140gacgaccgta gtgatgaatc
tctcctggcg ggaacagcaa aatatcaccc ggtcggcaaa 4200caaattctcg
tccctgattt ttcaccaccc cctgaccgcg aatggtgaga ttgagaatat
4260aacctttcat tcccagcggt cggtcgataa aaaaatcgag ataaccgttg
gcctcaatcg 4320gcgttaaacc cgccaccaga tgggcattaa acgagtatcc
cggcagcagg ggatcatttt 4380gcgcttcagc catacttttc atactcccgc
cattcagaga agaaaccaat tgtccatatt 4440gcatcagaca ttgccgtcac
tgcgtctttt actggctctt ctcgctaacc aaaccggtaa 4500ccccgcttat
taaaagcatt ctgtaacaaa gcgggaccaa agccatgaca aaaacgcgta
4560acaaaagtgt ctataatcac ggcagaaaag tccacattga ttatttgcac
ggcgtcacac 4620tttgctatgc catagcattt ttatccataa gattagcgga
tcctacctga cgctttttat 4680cgcaactctc tactgtttct ccatacccgt
tttttgggct aacaggagga attaac 47361034694DNAArtificial
sequencepRF216 103catggggcat caccatcacc accattgttt tttcaaagac
gaactggaat tcggaggtgg 60cggtgcatcc tcggaggatg tgattaaaga atttatgcgg
tttaaagtac gtatggaagg 120atcggtgaat ggccatgaat ttgagattga
gggtgaaggc gaaggccgcc cgtacgaagg 180aactcaaaca gcgaaattaa
aagttacaaa aggaggtcct ctgccgtttg cctgggacat 240cttgagcccg
caattccagt acggttccaa agtgtatgta aaacaccctg cggatattcc
300ggattataaa aaactgagtt ttcccgaggg gtttaaatgg gaacgggtga
tgaattttga 360ggatggtgga gttgtcaccg tgacccagga ctctagctta
caagacggta gtttcatcta 420caaagtaaaa tttatcggcg taaacttccc
atcggacggc cccgtcatgc agaaaaagac 480gatgggctgg gaagccagca
ccgaacgttt gtacccacgg gacggcgttt tgaaagggga 540aatccataag
gcccttaaac tgaaagacgg tggtcactat ctcgtggagt ttaaatcgat
600ttatatggct aaaaaaccag tacagcttcc gggttattat tacgttgact
ccaaattgga 660catcacatcg cataatgaag attacacgat tgttgaacag
tacgagcgcg ccgagggccg 720gcaccatctg tttctgtaaa agcttggctg
ttttggcgga tgagagaaga ttttcagcct 780gatacagatt aaatcagaac
gcagaagcgg tctgataaaa cagaatttgc ctggcggcag 840tagcgcggtg
gtcccacctg accccatgcc gaactcagaa gtgaaacgcc gtagcgccga
900tggtagtgtg gggtctcccc atgcgagagt agggaactgc caggcatcaa
ataaaacgaa 960aggctcagtc gaaagactgg gcctttcgtt ttatctgttg
tttgtcggtg aacgctctcc 1020tgagtaggac aaatccgccg ggagcggatt
tgaacgttgc gaagcaacgg cccggagggt 1080ggcgggcagg acgcccgcca
taaactgcca ggcatcaaat taagcagaag gccatcctga 1140cggatggcct
ttttgcgttt ctacaaactc ttttgtttat ttttctaaat acattcaaat
1200atgtatccgc tcatgagaca ataaccctga taaatgcttc aataatattg
aaaaaggaag 1260agtatgagta ttcaacattt ccgtgtcgcc cttattccct
tttttgcggc attttgcctt 1320cctgtttttg ctcacccaga aacgctggtg
aaagtaaaag atgctgaaga tcagttgggt 1380gcacgagtgg gttacatcga
actggatctc aacagcggta agatccttga gagttttcgc 1440cccgaagaac
gttttccaat gatgagcact tttaaagttc tgctatgtgg cgcggtatta
1500tcccgtgttg acgccgggca agagcaactc ggtcgccgca tacactattc
tcagaatgac 1560ttggttgagt actcaccagt cacagaaaag catcttacgg
atggcatgac agtaagagaa 1620ttatgcagtg ctgccataac catgagtgat
aacactgcgg ccaacttact tctgacaacg 1680atcggaggac cgaaggagct
aaccgctttt ttgcacaaca tgggggatca tgtaactcgc 1740cttgatcgtt
gggaaccgga gctgaatgaa gccataccaa acgacgagcg tgacaccacg
1800atgcctgtag caatggcaac aacgttgcgc aaactattaa ctggcgaact
acttactcta 1860gcttcccggc aacaattaat agactggatg gaggcggata
aagttgcagg accacttctg 1920cgctcggccc ttccggctgg ctggtttatt
gctgataaat ctggagccgg tgagcgtggg 1980tctcgcggta tcattgcagc
actggggcca gatggtaagc cctcccgtat cgtagttatc 2040tacacgacgg
ggagtcaggc aactatggat gaacgaaata gacagatcgc tgagataggt
2100gcctcactga ttaagcattg gtaactgtca gaccaagttt actcatatat
actttagatt 2160gatttaaaac ttcattttta atttaaaagg atctaggtga
agatcctttt tgataatctc 2220atgaccaaaa tcccttaacg tgagttttcg
ttccactgag cgtcagaccc cgtagaaaag 2280atcaaaggat cttcttgaga
tccttttttt ctgcgcgtaa tctgctgctt gcaaacaaaa 2340aaaccaccgc
taccagcggt ggtttgtttg ccggatcaag agctaccaac tctttttccg
2400aaggtaactg gcttcagcag agcgcagata ccaaatactg tccttctagt
gtagccgtag 2460ttaggccacc acttcaagaa ctctgtagca ccgcctacat
acctcgctct gctaatcctg 2520ttaccagtgg ctgctgccag tggcgataag
tcgtgtctta ccgggttgga ctcaagacga 2580tagttaccgg ataaggcgca
gcggtcgggc tgaacggggg gttcgtgcac acagcccagc 2640ttggagcgaa
cgacctacac cgaactgaga tacctacagc gtgagctatg agaaagcgcc
2700acgcttcccg aagggagaaa ggcggacagg tatccggtaa gcggcagggt
cggaacagga 2760gagcgcacga gggagcttcc agggggaaac gcctggtatc
tttatagtcc tgtcgggttt 2820cgccacctct gacttgagcg tcgatttttg
tgatgctcgt caggggggcg gagcctatgg 2880aaaaacgcca gcaacgcggc
ctttttacgg ttcctggcct tttgctggcc ttttgctcac 2940atgttctttc
ctgcgttatc ccctgattct gtggataacc gtattaccgc ctttgagtga
3000gctgataccg ctcgccgcag ccgaacgacc gagcgcagcg agtcagtgag
cgaggaagcg 3060gaagagcgcc tgatgcggta ttttctcctt acgcatctgt
gcggtatttc acaccgcata 3120tggtgcactc tcagtacaat ctgctctgat
gccgcatagt taagccagta tacactccgc 3180tatcgctacg tgactgggtc
atggctgcgc cccgacaccc gccaacaccc gctgacgcgc 3240cctgacgggc
ttgtctgctc ccggcatccg cttacagaca agctgtgacc gtctccggga
3300gctgcatgtg tcagaggttt tcaccgtcat caccgaaacg cgcgaggcag
cagatcaatt 3360cgcgcgcgaa ggcgaagcgg catgcataat gtgcctgtca
aatggacgaa gcagggattc 3420tgcaaaccct atgctactcc gtcaagccgt
caattgtctg attcgttacc aattatgaca 3480acttgacggc tacatcattc
actttttctt cacaaccggc acggaactcg ctcgggctgg 3540ccccggtgca
ttttttaaat acccgcgaga aatagagttg atcgtcaaaa ccaacattgc
3600gaccgacggt ggcgataggc atccgggtgg tgctcaaaag cagcttcgcc
tggctgatac 3660gttggtcctc gcgccagctt aagacgctaa tccctaactg
ctggcggaaa agatgtgaca 3720gacgcgacgg cgacaagcaa acatgctgtg
cgacgctggc gatatcaaaa ttgctgtctg 3780ccaggtgatc gctgatgtac
tgacaagcct cgcgtacccg attatccatc ggtggatgga 3840gcgactcgtt
aatcgcttcc atgcgccgca gtaacaattg ctcaagcaga tttatcgcca
3900gcagctccga atagcgccct tccccttgcc cggcgttaat gatttgccca
aacaggtcgc 3960tgaaatgcgg ctggtgcgct tcatccgggc gaaagaaccc
cgtattggca aatattgacg 4020gccagttaag ccattcatgc cagtaggcgc
gcggacgaaa gtaaacccac tggtgatacc 4080attcgcgagc ctccggatga
cgaccgtagt gatgaatctc tcctggcggg aacagcaaaa 4140tatcacccgg
tcggcaaaca aattctcgtc cctgattttt caccaccccc tgaccgcgaa
4200tggtgagatt gagaatataa cctttcattc ccagcggtcg gtcgataaaa
aaatcgagat 4260aaccgttggc ctcaatcggc gttaaacccg ccaccagatg
ggcattaaac gagtatcccg 4320gcagcagggg atcattttgc gcttcagcca
tacttttcat actcccgcca ttcagagaag 4380aaaccaattg tccatattgc
atcagacatt gccgtcactg cgtcttttac tggctcttct 4440cgctaaccaa
accggtaacc ccgcttatta aaagcattct gtaacaaagc gggaccaaag
4500ccatgacaaa aacgcgtaac aaaagtgtct ataatcacgg cagaaaagtc
cacattgatt 4560atttgcacgg cgtcacactt tgctatgcca tagcattttt
atccataaga ttagcggatc 4620ctacctgacg ctttttatcg caactctcta
ctgtttctcc atacccgttt tttgggctaa 4680caggaggaat taac
469410423DNAArtificial sequenceoligo 36 104ccataagatt agcggatcct
acc 23105331DNAArtificial sequenceHis-Zebra PCR 105ccataagatt
agcggatcct acctgacgct ttttatcgca actctctact gtttctccat 60acccgttttt
tgggctaaca ggaggaatta accatggggc atcaccacca tcaccacgaa
120tgcgactcag aactggaaat caaacgctat aaacgtgtgc gtgtggcatc
ccgtaaatgt 180cgcgcaaagt ttaaacagct gctgcaacat tatcgtgaag
tagcggctgc gaaaagctcc 240gaaaacgacc gtttacgcct cctcctgaag
caaatgtgcg aattcgacaa gaaatactcc 300atcggcctgg acattggaac
caactctgtc g 331106232DNAArtificial sequenceHis-tp10 PCR
106ccataagatt agcggatcct acctgacgct ttttatcgca actctctact
gtttctccat 60acccgttttt tgggctaaca ggaggaatta accatggggc atcaccacca
tcaccacgcg 120ggttacctgc tgggcaagat taatcttaaa gcctgcgccg
cgtgtgctaa gaaaattttg 180gaattcgaca agaaatactc catcggcctg
gacattggaa ccaactctgt cg 232107223DNAartificial sequenceHis-pVEC
PCR 107ccataagatt agcggatcct acctgacgct ttttatcgca actctctact
gtttctccat 60acccgttttt tgggctaaca ggaggaatta accatggggc atcaccacca
tcaccactta 120ttgattatct tgcgtcgtcg catccgcaaa caggcgcacg
cacatagcaa ggaattcgac 180aagaaatact ccatcggcct ggacattgga
accaactctg tcg 2231088294DNAArtificial sequencepRF144 108ccatggggca
tcaccaccat caccacgaat gcgactcaga actggaaatc aaacgctata 60aacgtgtgcg
tgtggcatcc cgtaaatgtc gcgcaaagtt taaacagctg ctgcaacatt
120atcgtgaagt agcggctgcg aaaagctccg aaaacgaccg tttacgcctc
ctcctgaagc 180aaatgtgcga attcgacaag aaatactcca tcggcctgga
cattggaacc aactctgtcg 240gctgggctgt catcaccgac gagtacaagg
tgccctccaa gaaattcaag gtcctcggaa 300acaccgatcg acactccatc
aagaaaaacc tcattggtgc cctgttgttc gattctggcg 360agactgccga
agctaccaga ctcaagcgaa ctgctcggcg acgttacacc cgacggaaga
420accgaatctg ctacctgcag gagatctttt ccaacgagat ggccaaggtg
gacgattcgt 480tctttcatcg actggaggaa tccttcctcg tcgaggaaga
caagaaacac gagcgtcatc 540ccatctttgg caacattgtg gacgaggttg
cttaccacga gaagtatcct accatctacc 600acctgcgaaa gaaactcgtc
gattccaccg acaaggcgga tctcagactt atctacctcg 660ctctggcaca
catgatcaag tttcgaggtc atttcctcat cgagggcgat ctcaatcccg
720acaacagcga tgtggacaag ctgttcattc agctcgttca gacctacaac
cagctgttcg 780aggaaaaccc catcaatgcc tccggagtcg atgcaaaggc
catcttgtct gctcgactct 840cgaagagcag acgactggag aacctcattg
cccaacttcc tggcgagaaa aagaacggac 900tgtttggcaa cctcattgcc
ctttctcttg gtctcacacc caacttcaag tccaacttcg 960atctggcgga
ggacgccaag ctccagctgt ccaaggacac ctacgacgat gacctcgaca
1020acctgcttgc acagattggc gatcagtacg ccgacctgtt tctcgctgcc
aagaaccttt 1080cggatgctat tctcttgtct gacattctgc gagtcaacac
cgagatcaca aaggctcccc 1140tttctgcctc catgatcaag cgatacgacg
agcaccatca ggatctcaca ctgctcaagg 1200ctcttgtccg acagcaactg
cccgagaagt acaaggagat ctttttcgat cagtcgaaga 1260acggctacgc
tggatacatc gacggcggag cctctcagga agagttctac aagttcatca
1320agccaattct cgagaagatg gacggaaccg aggaactgct tgtcaagctc
aatcgagagg 1380atctgcttcg gaagcaacga accttcgaca acggcagcat
tcctcatcag atccacctcg 1440gtgagctgca cgccattctt cgacgtcagg
aagacttcta cccctttctc aaggacaacc 1500gagagaagat cgagaagatt
cttacctttc gaatccccta ctatgttggt cctcttgcca 1560gaggaaactc
tcgatttgct tggatgactc gaaagtccga ggaaaccatc actccctgga
1620acttcgagga agtcgtggac aagggtgcct ctgcacagtc cttcatcgag
cgaatgacca 1680acttcgacaa gaatctgccc aacgagaagg ttcttcccaa
gcattcgctg ctctacgagt 1740actttacagt ctacaacgaa ctcaccaaag
tcaagtacgt taccgaggga atgcgaaagc 1800ctgccttctt gtctggcgaa
cagaagaaag ccattgtcga tctcctgttc aagaccaacc 1860gaaaggtcac
tgttaagcag ctcaaggagg actacttcaa gaaaatcgag tgtttcgaca
1920gcgtcgagat ttccggagtt gaggaccgat tcaacgcctc tttgggcacc
tatcacgatc 1980tgctcaagat tatcaaggac aaggattttc tcgacaacga
ggaaaacgag gacattctgg 2040aggacatcgt gctcactctt accctgttcg
aagatcggga gatgatcgag gaacgactca 2100agacatacgc tcacctgttc
gacgacaagg tcatgaaaca actcaagcga cgtagataca 2160ccggctgggg
aagactttcg cgaaagctca tcaacggcat cagagacaag cagtccggaa
2220agaccattct ggactttctc aagtccgatg gctttgccaa ccgaaacttc
atgcagctca 2280ttcacgacga ttctcttacc ttcaaggagg acatccagaa
ggcacaagtg tccggtcagg 2340gcgacagctt gcacgaacat attgccaacc
tggctggttc gccagccatc aagaaaggca 2400ttctccagac tgtcaaggtt
gtcgacgagc tggtgaaggt catgggacgt cacaagcccg 2460agaacattgt
gatcgagatg gccagagaga accagacaac tcaaaagggt cagaaaaact
2520cgcgagagcg gatgaagcga atcgaggaag gcatcaagga gctgggatcc
cagattctca 2580aggagcatcc cgtcgagaac actcaactgc agaacgagaa
gctgtatctc tactatctgc 2640agaatggtcg agacatgtac gtggatcagg
aactggacat caatcgtctc agcgactacg 2700atgtggacca cattgtccct
caatcctttc tcaaggacga ttctatcgac aacaaggtcc 2760ttacacgatc
cgacaagaac agaggcaagt cggacaacgt tcccagcgaa gaggtggtca
2820aaaagatgaa gaactactgg cgacagctgc tcaacgccaa gctcattacc
cagcgaaagt 2880tcgacaatct taccaaggcc gagcgaggcg gtctgtccga
gctcgacaag gctggcttca 2940tcaagcgtca actcgtcgag accagacaga
tcacaaagca cgtcgcacag attctcgatt 3000ctcggatgaa caccaagtac
gacgagaacg acaagctcat ccgagaggtc aaggtgatta 3060ctctcaagtc
caaactggtc tccgatttcc gaaaggactt tcagttctac aaggtgcgag
3120agatcaacaa ttaccaccat gcccacgatg cttacctcaa cgccgtcgtt
ggcactgcgc 3180tcatcaagaa ataccccaag ctcgaaagcg agttcgttta
cggcgattac aaggtctacg 3240acgttcgaaa gatgattgcc aagtccgaac
aggagattgg caaggctact gccaagtact 3300tcttttactc caacatcatg
aactttttca agaccgagat caccttggcc aacggagaga 3360ttcgaaagag
accacttatc gagaccaacg gcgaaactgg agagatcgtg tgggacaagg
3420gtcgagactt tgcaaccgtg cgaaaggttc tgtcgatgcc tcaggtcaac
atcgtcaaga 3480aaaccgaggt tcagactggc ggattctcca aggagtcgat
tctgcccaag cgaaactccg 3540acaagctcat cgctcgaaag aaagactggg
atcccaagaa atacggtggc ttcgattctc 3600ctaccgtcgc ctattccgtg
cttgtcgttg cgaaggtcga gaagggcaag tccaaaaagc 3660tcaagtccgt
caaggagctg ctcggaatta ccatcatgga gcgatcgagc ttcgagaaga
3720atcccatcga cttcttggaa gccaagggtt acaaggaggt caagaaagac
ctcattatca 3780agctgcccaa gtactctctg ttcgaactgg agaacggtcg
aaagcgtatg ctcgcctccg 3840ctggcgagct gcagaaggga aacgagcttg
ccttgccttc gaagtacgtc aactttctct 3900atctggcttc tcactacgag
aagctcaagg gttctcccga ggacaacgaa cagaagcaac 3960tcttcgttga
gcagcacaaa cattacctcg acgagattat cgagcagatt tccgagtttt
4020cgaagcgagt catcctggct gatgccaact tggacaaggt gctctctgcc
tacaacaagc 4080atcgggacaa acccattcga gaacaggcgg agaacatcat
tcacctgttt actcttacca 4140acctgggtgc tcctgcagct ttcaagtact
tcgataccac tatcgaccga aagcggtaca 4200catccaccaa ggaggttctc
gatgccaccc tgattcacca gtccatcact ggcctgtacg 4260agacccgaat
cgacctgtct cagcttggtg gcgactccag agccgatccc aagaaaaagc
4320gaaaggtcta agcggccgct aagcttggct gttttggcgg atgagagaag
attttcagcc 4380tgatacagat taaatcagaa cgcagaagcg gtctgataaa
acagaatttg cctggcggca 4440gtagcgcggt ggtcccacct gaccccatgc
cgaactcaga agtgaaacgc cgtagcgccg 4500atggtagtgt ggggtctccc
catgcgagag tagggaactg ccaggcatca aataaaacga 4560aaggctcagt
cgaaagactg ggcctttcgt tttatctgtt gtttgtcggt gaacgctctc
4620ctgagtagga caaatccgcc gggagcggat ttgaacgttg cgaagcaacg
gcccggaggg 4680tggcgggcag gacgcccgcc ataaactgcc aggcatcaaa
ttaagcagaa ggccatcctg 4740acggatggcc tttttgcgtt tctacaaact
cttttgttta tttttctaaa tacattcaaa 4800tatgtatccg ctcatgagac
aataaccctg ataaatgctt caataatatt gaaaaaggaa 4860gagtatgagt
attcaacatt tccgtgtcgc ccttattccc ttttttgcgg cattttgcct
4920tcctgttttt gctcacccag aaacgctggt gaaagtaaaa gatgctgaag
atcagttggg 4980tgcacgagtg ggttacatcg aactggatct caacagcggt
aagatccttg agagttttcg 5040ccccgaagaa cgttttccaa tgatgagcac
ttttaaagtt ctgctatgtg gcgcggtatt 5100atcccgtgtt gacgccgggc
aagagcaact cggtcgccgc atacactatt ctcagaatga 5160cttggttgag
tactcaccag tcacagaaaa gcatcttacg gatggcatga cagtaagaga
5220attatgcagt gctgccataa ccatgagtga taacactgcg gccaacttac
ttctgacaac 5280gatcggagga ccgaaggagc taaccgcttt tttgcacaac
atgggggatc atgtaactcg 5340ccttgatcgt tgggaaccgg agctgaatga
agccatacca aacgacgagc gtgacaccac 5400gatgcctgta gcaatggcaa
caacgttgcg caaactatta actggcgaac tacttactct 5460agcttcccgg
caacaattaa tagactggat ggaggcggat aaagttgcag gaccacttct
5520gcgctcggcc cttccggctg gctggtttat tgctgataaa tctggagccg
gtgagcgtgg 5580gtctcgcggt atcattgcag cactggggcc agatggtaag
ccctcccgta tcgtagttat 5640ctacacgacg gggagtcagg caactatgga
tgaacgaaat agacagatcg ctgagatagg 5700tgcctcactg attaagcatt
ggtaactgtc agaccaagtt tactcatata tactttagat 5760tgatttaaaa
cttcattttt aatttaaaag gatctaggtg aagatccttt ttgataatct
5820catgaccaaa atcccttaac gtgagttttc gttccactga gcgtcagacc
ccgtagaaaa 5880gatcaaagga tcttcttgag atcctttttt tctgcgcgta
atctgctgct tgcaaacaaa 5940aaaaccaccg ctaccagcgg tggtttgttt
gccggatcaa gagctaccaa ctctttttcc 6000gaaggtaact ggcttcagca
gagcgcagat accaaatact gtccttctag tgtagccgta 6060gttaggccac
cacttcaaga actctgtagc accgcctaca tacctcgctc tgctaatcct
6120gttaccagtg gctgctgcca gtggcgataa gtcgtgtctt accgggttgg
actcaagacg 6180atagttaccg gataaggcgc agcggtcggg ctgaacgggg
ggttcgtgca cacagcccag 6240cttggagcga acgacctaca ccgaactgag
atacctacag cgtgagctat gagaaagcgc 6300cacgcttccc gaagggagaa
aggcggacag gtatccggta agcggcaggg tcggaacagg 6360agagcgcacg
agggagcttc cagggggaaa cgcctggtat ctttatagtc ctgtcgggtt
6420tcgccacctc tgacttgagc gtcgattttt gtgatgctcg tcaggggggc
ggagcctatg 6480gaaaaacgcc agcaacgcgg cctttttacg gttcctggcc
ttttgctggc cttttgctca 6540catgttcttt cctgcgttat cccctgattc
tgtggataac cgtattaccg cctttgagtg 6600agctgatacc gctcgccgca
gccgaacgac cgagcgcagc gagtcagtga gcgaggaagc 6660ggaagagcgc
ctgatgcggt attttctcct tacgcatctg tgcggtattt cacaccgcat
6720atggtgcact ctcagtacaa tctgctctga tgccgcatag ttaagccagt
atacactccg 6780ctatcgctac gtgactgggt catggctgcg ccccgacacc
cgccaacacc cgctgacgcg 6840ccctgacggg cttgtctgct cccggcatcc
gcttacagac aagctgtgac cgtctccggg 6900agctgcatgt gtcagaggtt
ttcaccgtca tcaccgaaac gcgcgaggca gcagatcaat 6960tcgcgcgcga
aggcgaagcg gcatgcataa tgtgcctgtc aaatggacga agcagggatt
7020ctgcaaaccc tatgctactc cgtcaagccg tcaattgtct gattcgttac
caattatgac 7080aacttgacgg ctacatcatt cactttttct tcacaaccgg
cacggaactc gctcgggctg 7140gccccggtgc attttttaaa tacccgcgag
aaatagagtt gatcgtcaaa accaacattg 7200cgaccgacgg tggcgatagg
catccgggtg gtgctcaaaa gcagcttcgc ctggctgata 7260cgttggtcct
cgcgccagct taagacgcta atccctaact gctggcggaa aagatgtgac
7320agacgcgacg gcgacaagca aacatgctgt gcgacgctgg cgatatcaaa
attgctgtct 7380gccaggtgat cgctgatgta ctgacaagcc tcgcgtaccc
gattatccat cggtggatgg 7440agcgactcgt taatcgcttc catgcgccgc
agtaacaatt gctcaagcag atttatcgcc 7500agcagctccg aatagcgccc
ttccccttgc ccggcgttaa tgatttgccc aaacaggtcg 7560ctgaaatgcg
gctggtgcgc ttcatccggg cgaaagaacc ccgtattggc aaatattgac
7620ggccagttaa gccattcatg ccagtaggcg cgcggacgaa agtaaaccca
ctggtgatac 7680cattcgcgag cctccggatg acgaccgtag tgatgaatct
ctcctggcgg gaacagcaaa 7740atatcacccg gtcggcaaac aaattctcgt
ccctgatttt tcaccacccc ctgaccgcga 7800atggtgagat tgagaatata
acctttcatt cccagcggtc ggtcgataaa aaaatcgaga 7860taaccgttgg
cctcaatcgg cgttaaaccc gccaccagat gggcattaaa cgagtatccc
7920ggcagcaggg gatcattttg cgcttcagcc atacttttca tactcccgcc
attcagagaa 7980gaaaccaatt gtccatattg catcagacat tgccgtcact
gcgtctttta ctggctcttc 8040tcgctaacca aaccggtaac cccgcttatt
aaaagcattc tgtaacaaag cgggaccaaa 8100gccatgacaa aaacgcgtaa
caaaagtgtc tataatcacg gcagaaaagt ccacattgat 8160tatttgcacg
gcgtcacact ttgctatgcc atagcatttt tatccataag attagcggat
8220cctacctgac gctttttatc gcaactctct actgtttctc catacccgtt
ttttgggcta 8280acaggaggaa ttaa 82941098195DNAArtificial
sequencepRF162 109aattcgacaa gaaatactcc atcggcctgg acattggaac
caactctgtc ggctgggctg 60tcatcaccga cgagtacaag gtgccctcca agaaattcaa
ggtcctcgga aacaccgatc 120gacactccat caagaaaaac ctcattggtg
ccctgttgtt cgattctggc gagactgccg 180aagctaccag actcaagcga
actgctcggc gacgttacac ccgacggaag aaccgaatct 240gctacctgca
ggagatcttt tccaacgaga tggccaaggt ggacgattcg ttctttcatc
300gactggagga atccttcctc gtcgaggaag acaagaaaca cgagcgtcat
cccatctttg 360gcaacattgt ggacgaggtt gcttaccacg agaagtatcc
taccatctac cacctgcgaa 420agaaactcgt cgattccacc gacaaggcgg
atctcagact tatctacctc gctctggcac 480acatgatcaa gtttcgaggt
catttcctca tcgagggcga tctcaatccc gacaacagcg 540atgtggacaa
gctgttcatt cagctcgttc agacctacaa ccagctgttc gaggaaaacc
600ccatcaatgc ctccggagtc gatgcaaagg ccatcttgtc tgctcgactc
tcgaagagca 660gacgactgga gaacctcatt gcccaacttc ctggcgagaa
aaagaacgga ctgtttggca 720acctcattgc cctttctctt ggtctcacac
ccaacttcaa gtccaacttc gatctggcgg 780aggacgccaa gctccagctg
tccaaggaca cctacgacga tgacctcgac aacctgcttg 840cacagattgg
cgatcagtac gccgacctgt ttctcgctgc caagaacctt tcggatgcta
900ttctcttgtc tgacattctg cgagtcaaca ccgagatcac aaaggctccc
ctttctgcct 960ccatgatcaa gcgatacgac gagcaccatc aggatctcac
actgctcaag gctcttgtcc 1020gacagcaact gcccgagaag tacaaggaga
tctttttcga tcagtcgaag aacggctacg 1080ctggatacat cgacggcgga
gcctctcagg aagagttcta caagttcatc aagccaattc 1140tcgagaagat
ggacggaacc gaggaactgc ttgtcaagct caatcgagag gatctgcttc
1200ggaagcaacg aaccttcgac aacggcagca ttcctcatca gatccacctc
ggtgagctgc 1260acgccattct tcgacgtcag gaagacttct acccctttct
caaggacaac cgagagaaga 1320tcgagaagat tcttaccttt cgaatcccct
actatgttgg tcctcttgcc agaggaaact 1380ctcgatttgc ttggatgact
cgaaagtccg aggaaaccat cactccctgg aacttcgagg 1440aagtcgtgga
caagggtgcc tctgcacagt ccttcatcga gcgaatgacc aacttcgaca
1500agaatctgcc caacgagaag gttcttccca agcattcgct gctctacgag
tactttacag 1560tctacaacga actcaccaaa gtcaagtacg ttaccgaggg
aatgcgaaag cctgccttct 1620tgtctggcga acagaagaaa gccattgtcg
atctcctgtt caagaccaac cgaaaggtca 1680ctgttaagca gctcaaggag
gactacttca agaaaatcga gtgtttcgac agcgtcgaga 1740tttccggagt
tgaggaccga ttcaacgcct ctttgggcac ctatcacgat ctgctcaaga
1800ttatcaagga caaggatttt ctcgacaacg aggaaaacga ggacattctg
gaggacatcg 1860tgctcactct taccctgttc gaagatcggg agatgatcga
ggaacgactc aagacatacg 1920ctcacctgtt cgacgacaag gtcatgaaac
aactcaagcg acgtagatac accggctggg 1980gaagactttc gcgaaagctc
atcaacggca tcagagacaa gcagtccgga aagaccattc 2040tggactttct
caagtccgat ggctttgcca accgaaactt catgcagctc attcacgacg
2100attctcttac cttcaaggag gacatccaga aggcacaagt gtccggtcag
ggcgacagct 2160tgcacgaaca tattgccaac ctggctggtt cgccagccat
caagaaaggc attctccaga 2220ctgtcaaggt tgtcgacgag ctggtgaagg
tcatgggacg tcacaagccc gagaacattg 2280tgatcgagat ggccagagag
aaccagacaa ctcaaaaggg tcagaaaaac tcgcgagagc 2340ggatgaagcg
aatcgaggaa ggcatcaagg agctgggatc ccagattctc aaggagcatc
2400ccgtcgagaa cactcaactg cagaacgaga agctgtatct ctactatctg
cagaatggtc 2460gagacatgta cgtggatcag gaactggaca tcaatcgtct
cagcgactac gatgtggacc 2520acattgtccc tcaatccttt ctcaaggacg
attctatcga caacaaggtc cttacacgat 2580ccgacaagaa cagaggcaag
tcggacaacg ttcccagcga agaggtggtc aaaaagatga 2640agaactactg
gcgacagctg ctcaacgcca agctcattac ccagcgaaag ttcgacaatc
2700ttaccaaggc cgagcgaggc ggtctgtccg agctcgacaa ggctggcttc
atcaagcgtc 2760aactcgtcga gaccagacag atcacaaagc acgtcgcaca
gattctcgat tctcggatga 2820acaccaagta cgacgagaac gacaagctca
tccgagaggt caaggtgatt actctcaagt 2880ccaaactggt ctccgatttc
cgaaaggact ttcagttcta caaggtgcga gagatcaaca 2940attaccacca
tgcccacgat gcttacctca acgccgtcgt tggcactgcg ctcatcaaga
3000aataccccaa gctcgaaagc gagttcgttt acggcgatta caaggtctac
gacgttcgaa 3060agatgattgc caagtccgaa caggagattg gcaaggctac
tgccaagtac ttcttttact 3120ccaacatcat gaactttttc aagaccgaga
tcaccttggc caacggagag attcgaaaga 3180gaccacttat cgagaccaac
ggcgaaactg gagagatcgt gtgggacaag ggtcgagact 3240ttgcaaccgt
gcgaaaggtt ctgtcgatgc ctcaggtcaa catcgtcaag aaaaccgagg
3300ttcagactgg cggattctcc aaggagtcga ttctgcccaa gcgaaactcc
gacaagctca 3360tcgctcgaaa gaaagactgg gatcccaaga aatacggtgg
cttcgattct cctaccgtcg 3420cctattccgt gcttgtcgtt gcgaaggtcg
agaagggcaa gtccaaaaag ctcaagtccg 3480tcaaggagct gctcggaatt
accatcatgg agcgatcgag cttcgagaag aatcccatcg 3540acttcttgga
agccaagggt tacaaggagg tcaagaaaga cctcattatc
aagctgccca 3600agtactctct gttcgaactg gagaacggtc gaaagcgtat
gctcgcctcc gctggcgagc 3660tgcagaaggg aaacgagctt gccttgcctt
cgaagtacgt caactttctc tatctggctt 3720ctcactacga gaagctcaag
ggttctcccg aggacaacga acagaagcaa ctcttcgttg 3780agcagcacaa
acattacctc gacgagatta tcgagcagat ttccgagttt tcgaagcgag
3840tcatcctggc tgatgccaac ttggacaagg tgctctctgc ctacaacaag
catcgggaca 3900aacccattcg agaacaggcg gagaacatca ttcacctgtt
tactcttacc aacctgggtg 3960ctcctgcagc tttcaagtac ttcgatacca
ctatcgaccg aaagcggtac acatccacca 4020aggaggttct cgatgccacc
ctgattcacc agtccatcac tggcctgtac gagacccgaa 4080tcgacctgtc
tcagcttggt ggcgactcca gagccgatcc caagaaaaag cgaaaggtct
4140aagcggccgc taagcttggc tgttttggcg gatgagagaa gattttcagc
ctgatacaga 4200ttaaatcaga acgcagaagc ggtctgataa aacagaattt
gcctggcggc agtagcgcgg 4260tggtcccacc tgaccccatg ccgaactcag
aagtgaaacg ccgtagcgcc gatggtagtg 4320tggggtctcc ccatgcgaga
gtagggaact gccaggcatc aaataaaacg aaaggctcag 4380tcgaaagact
gggcctttcg ttttatctgt tgtttgtcgg tgaacgctct cctgagtagg
4440acaaatccgc cgggagcgga tttgaacgtt gcgaagcaac ggcccggagg
gtggcgggca 4500ggacgcccgc cataaactgc caggcatcaa attaagcaga
aggccatcct gacggatggc 4560ctttttgcgt ttctacaaac tcttttgttt
atttttctaa atacattcaa atatgtatcc 4620gctcatgaga caataaccct
gataaatgct tcaataatat tgaaaaagga agagtatgag 4680tattcaacat
ttccgtgtcg cccttattcc cttttttgcg gcattttgcc ttcctgtttt
4740tgctcaccca gaaacgctgg tgaaagtaaa agatgctgaa gatcagttgg
gtgcacgagt 4800gggttacatc gaactggatc tcaacagcgg taagatcctt
gagagttttc gccccgaaga 4860acgttttcca atgatgagca cttttaaagt
tctgctatgt ggcgcggtat tatcccgtgt 4920tgacgccggg caagagcaac
tcggtcgccg catacactat tctcagaatg acttggttga 4980gtactcacca
gtcacagaaa agcatcttac ggatggcatg acagtaagag aattatgcag
5040tgctgccata accatgagtg ataacactgc ggccaactta cttctgacaa
cgatcggagg 5100accgaaggag ctaaccgctt ttttgcacaa catgggggat
catgtaactc gccttgatcg 5160ttgggaaccg gagctgaatg aagccatacc
aaacgacgag cgtgacacca cgatgcctgt 5220agcaatggca acaacgttgc
gcaaactatt aactggcgaa ctacttactc tagcttcccg 5280gcaacaatta
atagactgga tggaggcgga taaagttgca ggaccacttc tgcgctcggc
5340ccttccggct ggctggttta ttgctgataa atctggagcc ggtgagcgtg
ggtctcgcgg 5400tatcattgca gcactggggc cagatggtaa gccctcccgt
atcgtagtta tctacacgac 5460ggggagtcag gcaactatgg atgaacgaaa
tagacagatc gctgagatag gtgcctcact 5520gattaagcat tggtaactgt
cagaccaagt ttactcatat atactttaga ttgatttaaa 5580acttcatttt
taatttaaaa ggatctaggt gaagatcctt tttgataatc tcatgaccaa
5640aatcccttaa cgtgagtttt cgttccactg agcgtcagac cccgtagaaa
agatcaaagg 5700atcttcttga gatccttttt ttctgcgcgt aatctgctgc
ttgcaaacaa aaaaaccacc 5760gctaccagcg gtggtttgtt tgccggatca
agagctacca actctttttc cgaaggtaac 5820tggcttcagc agagcgcaga
taccaaatac tgtccttcta gtgtagccgt agttaggcca 5880ccacttcaag
aactctgtag caccgcctac atacctcgct ctgctaatcc tgttaccagt
5940ggctgctgcc agtggcgata agtcgtgtct taccgggttg gactcaagac
gatagttacc 6000ggataaggcg cagcggtcgg gctgaacggg gggttcgtgc
acacagccca gcttggagcg 6060aacgacctac accgaactga gatacctaca
gcgtgagcta tgagaaagcg ccacgcttcc 6120cgaagggaga aaggcggaca
ggtatccggt aagcggcagg gtcggaacag gagagcgcac 6180gagggagctt
ccagggggaa acgcctggta tctttatagt cctgtcgggt ttcgccacct
6240ctgacttgag cgtcgatttt tgtgatgctc gtcagggggg cggagcctat
ggaaaaacgc 6300cagcaacgcg gcctttttac ggttcctggc cttttgctgg
ccttttgctc acatgttctt 6360tcctgcgtta tcccctgatt ctgtggataa
ccgtattacc gcctttgagt gagctgatac 6420cgctcgccgc agccgaacga
ccgagcgcag cgagtcagtg agcgaggaag cggaagagcg 6480cctgatgcgg
tattttctcc ttacgcatct gtgcggtatt tcacaccgca tatggtgcac
6540tctcagtaca atctgctctg atgccgcata gttaagccag tatacactcc
gctatcgcta 6600cgtgactggg tcatggctgc gccccgacac ccgccaacac
ccgctgacgc gccctgacgg 6660gcttgtctgc tcccggcatc cgcttacaga
caagctgtga ccgtctccgg gagctgcatg 6720tgtcagaggt tttcaccgtc
atcaccgaaa cgcgcgaggc agcagatcaa ttcgcgcgcg 6780aaggcgaagc
ggcatgcata atgtgcctgt caaatggacg aagcagggat tctgcaaacc
6840ctatgctact ccgtcaagcc gtcaattgtc tgattcgtta ccaattatga
caacttgacg 6900gctacatcat tcactttttc ttcacaaccg gcacggaact
cgctcgggct ggccccggtg 6960cattttttaa atacccgcga gaaatagagt
tgatcgtcaa aaccaacatt gcgaccgacg 7020gtggcgatag gcatccgggt
ggtgctcaaa agcagcttcg cctggctgat acgttggtcc 7080tcgcgccagc
ttaagacgct aatccctaac tgctggcgga aaagatgtga cagacgcgac
7140ggcgacaagc aaacatgctg tgcgacgctg gcgatatcaa aattgctgtc
tgccaggtga 7200tcgctgatgt actgacaagc ctcgcgtacc cgattatcca
tcggtggatg gagcgactcg 7260ttaatcgctt ccatgcgccg cagtaacaat
tgctcaagca gatttatcgc cagcagctcc 7320gaatagcgcc cttccccttg
cccggcgtta atgatttgcc caaacaggtc gctgaaatgc 7380ggctggtgcg
cttcatccgg gcgaaagaac cccgtattgg caaatattga cggccagtta
7440agccattcat gccagtaggc gcgcggacga aagtaaaccc actggtgata
ccattcgcga 7500gcctccggat gacgaccgta gtgatgaatc tctcctggcg
ggaacagcaa aatatcaccc 7560ggtcggcaaa caaattctcg tccctgattt
ttcaccaccc cctgaccgcg aatggtgaga 7620ttgagaatat aacctttcat
tcccagcggt cggtcgataa aaaaatcgag ataaccgttg 7680gcctcaatcg
gcgttaaacc cgccaccaga tgggcattaa acgagtatcc cggcagcagg
7740ggatcatttt gcgcttcagc catacttttc atactcccgc cattcagaga
agaaaccaat 7800tgtccatatt gcatcagaca ttgccgtcac tgcgtctttt
actggctctt ctcgctaacc 7860aaaccggtaa ccccgcttat taaaagcatt
ctgtaacaaa gcgggaccaa agccatgaca 7920aaaacgcgta acaaaagtgt
ctataatcac ggcagaaaag tccacattga ttatttgcac 7980ggcgtcacac
tttgctatgc catagcattt ttatccataa gattagcgga tcctacctga
8040cgctttttat cgcaactctc tactgtttct ccatacccgt tttttgggct
aacaggagga 8100attaaccatg gggcatcacc accatcacca cgcgggttac
ctgctgggca agattaatct 8160taaagcctgc gccgcgtgtg ctaagaaaat tttgg
81951108186DNAArtificial SequencepRF146 110aattcgacaa gaaatactcc
atcggcctgg acattggaac caactctgtc ggctgggctg 60tcatcaccga cgagtacaag
gtgccctcca agaaattcaa ggtcctcgga aacaccgatc 120gacactccat
caagaaaaac ctcattggtg ccctgttgtt cgattctggc gagactgccg
180aagctaccag actcaagcga actgctcggc gacgttacac ccgacggaag
aaccgaatct 240gctacctgca ggagatcttt tccaacgaga tggccaaggt
ggacgattcg ttctttcatc 300gactggagga atccttcctc gtcgaggaag
acaagaaaca cgagcgtcat cccatctttg 360gcaacattgt ggacgaggtt
gcttaccacg agaagtatcc taccatctac cacctgcgaa 420agaaactcgt
cgattccacc gacaaggcgg atctcagact tatctacctc gctctggcac
480acatgatcaa gtttcgaggt catttcctca tcgagggcga tctcaatccc
gacaacagcg 540atgtggacaa gctgttcatt cagctcgttc agacctacaa
ccagctgttc gaggaaaacc 600ccatcaatgc ctccggagtc gatgcaaagg
ccatcttgtc tgctcgactc tcgaagagca 660gacgactgga gaacctcatt
gcccaacttc ctggcgagaa aaagaacgga ctgtttggca 720acctcattgc
cctttctctt ggtctcacac ccaacttcaa gtccaacttc gatctggcgg
780aggacgccaa gctccagctg tccaaggaca cctacgacga tgacctcgac
aacctgcttg 840cacagattgg cgatcagtac gccgacctgt ttctcgctgc
caagaacctt tcggatgcta 900ttctcttgtc tgacattctg cgagtcaaca
ccgagatcac aaaggctccc ctttctgcct 960ccatgatcaa gcgatacgac
gagcaccatc aggatctcac actgctcaag gctcttgtcc 1020gacagcaact
gcccgagaag tacaaggaga tctttttcga tcagtcgaag aacggctacg
1080ctggatacat cgacggcgga gcctctcagg aagagttcta caagttcatc
aagccaattc 1140tcgagaagat ggacggaacc gaggaactgc ttgtcaagct
caatcgagag gatctgcttc 1200ggaagcaacg aaccttcgac aacggcagca
ttcctcatca gatccacctc ggtgagctgc 1260acgccattct tcgacgtcag
gaagacttct acccctttct caaggacaac cgagagaaga 1320tcgagaagat
tcttaccttt cgaatcccct actatgttgg tcctcttgcc agaggaaact
1380ctcgatttgc ttggatgact cgaaagtccg aggaaaccat cactccctgg
aacttcgagg 1440aagtcgtgga caagggtgcc tctgcacagt ccttcatcga
gcgaatgacc aacttcgaca 1500agaatctgcc caacgagaag gttcttccca
agcattcgct gctctacgag tactttacag 1560tctacaacga actcaccaaa
gtcaagtacg ttaccgaggg aatgcgaaag cctgccttct 1620tgtctggcga
acagaagaaa gccattgtcg atctcctgtt caagaccaac cgaaaggtca
1680ctgttaagca gctcaaggag gactacttca agaaaatcga gtgtttcgac
agcgtcgaga 1740tttccggagt tgaggaccga ttcaacgcct ctttgggcac
ctatcacgat ctgctcaaga 1800ttatcaagga caaggatttt ctcgacaacg
aggaaaacga ggacattctg gaggacatcg 1860tgctcactct taccctgttc
gaagatcggg agatgatcga ggaacgactc aagacatacg 1920ctcacctgtt
cgacgacaag gtcatgaaac aactcaagcg acgtagatac accggctggg
1980gaagactttc gcgaaagctc atcaacggca tcagagacaa gcagtccgga
aagaccattc 2040tggactttct caagtccgat ggctttgcca accgaaactt
catgcagctc attcacgacg 2100attctcttac cttcaaggag gacatccaga
aggcacaagt gtccggtcag ggcgacagct 2160tgcacgaaca tattgccaac
ctggctggtt cgccagccat caagaaaggc attctccaga 2220ctgtcaaggt
tgtcgacgag ctggtgaagg tcatgggacg tcacaagccc gagaacattg
2280tgatcgagat ggccagagag aaccagacaa ctcaaaaggg tcagaaaaac
tcgcgagagc 2340ggatgaagcg aatcgaggaa ggcatcaagg agctgggatc
ccagattctc aaggagcatc 2400ccgtcgagaa cactcaactg cagaacgaga
agctgtatct ctactatctg cagaatggtc 2460gagacatgta cgtggatcag
gaactggaca tcaatcgtct cagcgactac gatgtggacc 2520acattgtccc
tcaatccttt ctcaaggacg attctatcga caacaaggtc cttacacgat
2580ccgacaagaa cagaggcaag tcggacaacg ttcccagcga agaggtggtc
aaaaagatga 2640agaactactg gcgacagctg ctcaacgcca agctcattac
ccagcgaaag ttcgacaatc 2700ttaccaaggc cgagcgaggc ggtctgtccg
agctcgacaa ggctggcttc atcaagcgtc 2760aactcgtcga gaccagacag
atcacaaagc acgtcgcaca gattctcgat tctcggatga 2820acaccaagta
cgacgagaac gacaagctca tccgagaggt caaggtgatt actctcaagt
2880ccaaactggt ctccgatttc cgaaaggact ttcagttcta caaggtgcga
gagatcaaca 2940attaccacca tgcccacgat gcttacctca acgccgtcgt
tggcactgcg ctcatcaaga 3000aataccccaa gctcgaaagc gagttcgttt
acggcgatta caaggtctac gacgttcgaa 3060agatgattgc caagtccgaa
caggagattg gcaaggctac tgccaagtac ttcttttact 3120ccaacatcat
gaactttttc aagaccgaga tcaccttggc caacggagag attcgaaaga
3180gaccacttat cgagaccaac ggcgaaactg gagagatcgt gtgggacaag
ggtcgagact 3240ttgcaaccgt gcgaaaggtt ctgtcgatgc ctcaggtcaa
catcgtcaag aaaaccgagg 3300ttcagactgg cggattctcc aaggagtcga
ttctgcccaa gcgaaactcc gacaagctca 3360tcgctcgaaa gaaagactgg
gatcccaaga aatacggtgg cttcgattct cctaccgtcg 3420cctattccgt
gcttgtcgtt gcgaaggtcg agaagggcaa gtccaaaaag ctcaagtccg
3480tcaaggagct gctcggaatt accatcatgg agcgatcgag cttcgagaag
aatcccatcg 3540acttcttgga agccaagggt tacaaggagg tcaagaaaga
cctcattatc aagctgccca 3600agtactctct gttcgaactg gagaacggtc
gaaagcgtat gctcgcctcc gctggcgagc 3660tgcagaaggg aaacgagctt
gccttgcctt cgaagtacgt caactttctc tatctggctt 3720ctcactacga
gaagctcaag ggttctcccg aggacaacga acagaagcaa ctcttcgttg
3780agcagcacaa acattacctc gacgagatta tcgagcagat ttccgagttt
tcgaagcgag 3840tcatcctggc tgatgccaac ttggacaagg tgctctctgc
ctacaacaag catcgggaca 3900aacccattcg agaacaggcg gagaacatca
ttcacctgtt tactcttacc aacctgggtg 3960ctcctgcagc tttcaagtac
ttcgatacca ctatcgaccg aaagcggtac acatccacca 4020aggaggttct
cgatgccacc ctgattcacc agtccatcac tggcctgtac gagacccgaa
4080tcgacctgtc tcagcttggt ggcgactcca gagccgatcc caagaaaaag
cgaaaggtct 4140aagcggccgc taagcttggc tgttttggcg gatgagagaa
gattttcagc ctgatacaga 4200ttaaatcaga acgcagaagc ggtctgataa
aacagaattt gcctggcggc agtagcgcgg 4260tggtcccacc tgaccccatg
ccgaactcag aagtgaaacg ccgtagcgcc gatggtagtg 4320tggggtctcc
ccatgcgaga gtagggaact gccaggcatc aaataaaacg aaaggctcag
4380tcgaaagact gggcctttcg ttttatctgt tgtttgtcgg tgaacgctct
cctgagtagg 4440acaaatccgc cgggagcgga tttgaacgtt gcgaagcaac
ggcccggagg gtggcgggca 4500ggacgcccgc cataaactgc caggcatcaa
attaagcaga aggccatcct gacggatggc 4560ctttttgcgt ttctacaaac
tcttttgttt atttttctaa atacattcaa atatgtatcc 4620gctcatgaga
caataaccct gataaatgct tcaataatat tgaaaaagga agagtatgag
4680tattcaacat ttccgtgtcg cccttattcc cttttttgcg gcattttgcc
ttcctgtttt 4740tgctcaccca gaaacgctgg tgaaagtaaa agatgctgaa
gatcagttgg gtgcacgagt 4800gggttacatc gaactggatc tcaacagcgg
taagatcctt gagagttttc gccccgaaga 4860acgttttcca atgatgagca
cttttaaagt tctgctatgt ggcgcggtat tatcccgtgt 4920tgacgccggg
caagagcaac tcggtcgccg catacactat tctcagaatg acttggttga
4980gtactcacca gtcacagaaa agcatcttac ggatggcatg acagtaagag
aattatgcag 5040tgctgccata accatgagtg ataacactgc ggccaactta
cttctgacaa cgatcggagg 5100accgaaggag ctaaccgctt ttttgcacaa
catgggggat catgtaactc gccttgatcg 5160ttgggaaccg gagctgaatg
aagccatacc aaacgacgag cgtgacacca cgatgcctgt 5220agcaatggca
acaacgttgc gcaaactatt aactggcgaa ctacttactc tagcttcccg
5280gcaacaatta atagactgga tggaggcgga taaagttgca ggaccacttc
tgcgctcggc 5340ccttccggct ggctggttta ttgctgataa atctggagcc
ggtgagcgtg ggtctcgcgg 5400tatcattgca gcactggggc cagatggtaa
gccctcccgt atcgtagtta tctacacgac 5460ggggagtcag gcaactatgg
atgaacgaaa tagacagatc gctgagatag gtgcctcact 5520gattaagcat
tggtaactgt cagaccaagt ttactcatat atactttaga ttgatttaaa
5580acttcatttt taatttaaaa ggatctaggt gaagatcctt tttgataatc
tcatgaccaa 5640aatcccttaa cgtgagtttt cgttccactg agcgtcagac
cccgtagaaa agatcaaagg 5700atcttcttga gatccttttt ttctgcgcgt
aatctgctgc ttgcaaacaa aaaaaccacc 5760gctaccagcg gtggtttgtt
tgccggatca agagctacca actctttttc cgaaggtaac 5820tggcttcagc
agagcgcaga taccaaatac tgtccttcta gtgtagccgt agttaggcca
5880ccacttcaag aactctgtag caccgcctac atacctcgct ctgctaatcc
tgttaccagt 5940ggctgctgcc agtggcgata agtcgtgtct taccgggttg
gactcaagac gatagttacc 6000ggataaggcg cagcggtcgg gctgaacggg
gggttcgtgc acacagccca gcttggagcg 6060aacgacctac accgaactga
gatacctaca gcgtgagcta tgagaaagcg ccacgcttcc 6120cgaagggaga
aaggcggaca ggtatccggt aagcggcagg gtcggaacag gagagcgcac
6180gagggagctt ccagggggaa acgcctggta tctttatagt cctgtcgggt
ttcgccacct 6240ctgacttgag cgtcgatttt tgtgatgctc gtcagggggg
cggagcctat ggaaaaacgc 6300cagcaacgcg gcctttttac ggttcctggc
cttttgctgg ccttttgctc acatgttctt 6360tcctgcgtta tcccctgatt
ctgtggataa ccgtattacc gcctttgagt gagctgatac 6420cgctcgccgc
agccgaacga ccgagcgcag cgagtcagtg agcgaggaag cggaagagcg
6480cctgatgcgg tattttctcc ttacgcatct gtgcggtatt tcacaccgca
tatggtgcac 6540tctcagtaca atctgctctg atgccgcata gttaagccag
tatacactcc gctatcgcta 6600cgtgactggg tcatggctgc gccccgacac
ccgccaacac ccgctgacgc gccctgacgg 6660gcttgtctgc tcccggcatc
cgcttacaga caagctgtga ccgtctccgg gagctgcatg 6720tgtcagaggt
tttcaccgtc atcaccgaaa cgcgcgaggc agcagatcaa ttcgcgcgcg
6780aaggcgaagc ggcatgcata atgtgcctgt caaatggacg aagcagggat
tctgcaaacc 6840ctatgctact ccgtcaagcc gtcaattgtc tgattcgtta
ccaattatga caacttgacg 6900gctacatcat tcactttttc ttcacaaccg
gcacggaact cgctcgggct ggccccggtg 6960cattttttaa atacccgcga
gaaatagagt tgatcgtcaa aaccaacatt gcgaccgacg 7020gtggcgatag
gcatccgggt ggtgctcaaa agcagcttcg cctggctgat acgttggtcc
7080tcgcgccagc ttaagacgct aatccctaac tgctggcgga aaagatgtga
cagacgcgac 7140ggcgacaagc aaacatgctg tgcgacgctg gcgatatcaa
aattgctgtc tgccaggtga 7200tcgctgatgt actgacaagc ctcgcgtacc
cgattatcca tcggtggatg gagcgactcg 7260ttaatcgctt ccatgcgccg
cagtaacaat tgctcaagca gatttatcgc cagcagctcc 7320gaatagcgcc
cttccccttg cccggcgtta atgatttgcc caaacaggtc gctgaaatgc
7380ggctggtgcg cttcatccgg gcgaaagaac cccgtattgg caaatattga
cggccagtta 7440agccattcat gccagtaggc gcgcggacga aagtaaaccc
actggtgata ccattcgcga 7500gcctccggat gacgaccgta gtgatgaatc
tctcctggcg ggaacagcaa aatatcaccc 7560ggtcggcaaa caaattctcg
tccctgattt ttcaccaccc cctgaccgcg aatggtgaga 7620ttgagaatat
aacctttcat tcccagcggt cggtcgataa aaaaatcgag ataaccgttg
7680gcctcaatcg gcgttaaacc cgccaccaga tgggcattaa acgagtatcc
cggcagcagg 7740ggatcatttt gcgcttcagc catacttttc atactcccgc
cattcagaga agaaaccaat 7800tgtccatatt gcatcagaca ttgccgtcac
tgcgtctttt actggctctt ctcgctaacc 7860aaaccggtaa ccccgcttat
taaaagcatt ctgtaacaaa gcgggaccaa agccatgaca 7920aaaacgcgta
acaaaagtgt ctataatcac ggcagaaaag tccacattga ttatttgcac
7980ggcgtcacac tttgctatgc catagcattt ttatccataa gattagcgga
tcctacctga 8040cgctttttat cgcaactctc tactgtttct ccatacccgt
tttttgggct aacaggagga 8100attaaccatg gggcatcacc accatcacca
cttattgatt atcttgcgtc gtcgcatccg 8160caaacaggcg cacgcacata gcaagg
818611120DNAArtificial sequencEoligo 153 111cgacagagtt ggttccaatg
201124835DNAArtificial sequencepRF186 112catggggcat caccaccatc
accacgaatg cgactcagaa ctggaaatca aacgctataa 60acgtgtgcgt gtggcatccc
gtaaatgtcg cgcaaagttt aaacagctgc tgcaacatta 120tcgtgaagta
gcggctgcga aaagctccga aaacgaccgt ttacgcctcc tcctgaagca
180aatgtgcgaa ttcggaggtg gcggtgcatc ctcggaggat gtgattaaag
aatttatgcg 240gtttaaagta cgtatggaag gatcggtgaa tggccatgaa
tttgagattg agggtgaagg 300cgaaggccgc ccgtacgaag gaactcaaac
agcgaaatta aaagttacaa aaggaggtcc 360tctgccgttt gcctgggaca
tcttgagccc gcaattccag tacggttcca aagtgtatgt 420aaaacaccct
gcggatattc cggattataa aaaactgagt tttcccgagg ggtttaaatg
480ggaacgggtg atgaattttg aggatggtgg agttgtcacc gtgacccagg
actctagctt 540acaagacggt agtttcatct acaaagtaaa atttatcggc
gtaaacttcc catcggacgg 600ccccgtcatg cagaaaaaga cgatgggctg
ggaagccagc accgaacgtt tgtacccacg 660ggacggcgtt ttgaaagggg
aaatccataa ggcccttaaa ctgaaagacg gtggtcacta 720tctcgtggag
tttaaatcga tttatatggc taaaaaacca gtacagcttc cgggttatta
780ttacgttgac tccaaattgg acatcacatc gcataatgaa gattacacga
ttgttgaaca 840gtacgagcgc gccgagggcc ggcaccatct gtttctgtaa
aagcttggct gttttggcgg 900atgagagaag attttcagcc tgatacagat
taaatcagaa cgcagaagcg gtctgataaa 960acagaatttg cctggcggca
gtagcgcggt ggtcccacct gaccccatgc cgaactcaga 1020agtgaaacgc
cgtagcgccg atggtagtgt ggggtctccc catgcgagag tagggaactg
1080ccaggcatca aataaaacga aaggctcagt cgaaagactg ggcctttcgt
tttatctgtt 1140gtttgtcggt gaacgctctc ctgagtagga caaatccgcc
gggagcggat ttgaacgttg 1200cgaagcaacg gcccggaggg tggcgggcag
gacgcccgcc ataaactgcc aggcatcaaa 1260ttaagcagaa ggccatcctg
acggatggcc tttttgcgtt tctacaaact cttttgttta 1320tttttctaaa
tacattcaaa tatgtatccg ctcatgagac aataaccctg ataaatgctt
1380caataatatt gaaaaaggaa gagtatgagt attcaacatt tccgtgtcgc
ccttattccc 1440ttttttgcgg cattttgcct tcctgttttt gctcacccag
aaacgctggt gaaagtaaaa 1500gatgctgaag atcagttggg tgcacgagtg
ggttacatcg aactggatct caacagcggt 1560aagatccttg agagttttcg
ccccgaagaa cgttttccaa tgatgagcac ttttaaagtt 1620ctgctatgtg
gcgcggtatt atcccgtgtt gacgccgggc aagagcaact cggtcgccgc
1680atacactatt ctcagaatga cttggttgag tactcaccag tcacagaaaa
gcatcttacg 1740gatggcatga cagtaagaga attatgcagt gctgccataa
ccatgagtga taacactgcg 1800gccaacttac ttctgacaac gatcggagga
ccgaaggagc taaccgcttt tttgcacaac 1860atgggggatc atgtaactcg
ccttgatcgt tgggaaccgg agctgaatga agccatacca 1920aacgacgagc
gtgacaccac gatgcctgta gcaatggcaa caacgttgcg caaactatta
1980actggcgaac tacttactct agcttcccgg caacaattaa tagactggat
ggaggcggat 2040aaagttgcag gaccacttct gcgctcggcc cttccggctg
gctggtttat tgctgataaa 2100tctggagccg gtgagcgtgg gtctcgcggt
atcattgcag cactggggcc agatggtaag 2160ccctcccgta tcgtagttat
ctacacgacg gggagtcagg caactatgga tgaacgaaat 2220agacagatcg
ctgagatagg tgcctcactg attaagcatt ggtaactgtc agaccaagtt
2280tactcatata tactttagat tgatttaaaa cttcattttt aatttaaaag
gatctaggtg 2340aagatccttt ttgataatct catgaccaaa atcccttaac
gtgagttttc gttccactga 2400gcgtcagacc ccgtagaaaa gatcaaagga
tcttcttgag atcctttttt tctgcgcgta 2460atctgctgct tgcaaacaaa
aaaaccaccg ctaccagcgg tggtttgttt gccggatcaa 2520gagctaccaa
ctctttttcc gaaggtaact ggcttcagca gagcgcagat accaaatact
2580gtccttctag tgtagccgta gttaggccac cacttcaaga actctgtagc
accgcctaca 2640tacctcgctc tgctaatcct gttaccagtg gctgctgcca
gtggcgataa gtcgtgtctt 2700accgggttgg actcaagacg atagttaccg
gataaggcgc agcggtcggg ctgaacgggg 2760ggttcgtgca cacagcccag
cttggagcga acgacctaca ccgaactgag atacctacag 2820cgtgagctat
gagaaagcgc cacgcttccc gaagggagaa aggcggacag gtatccggta
2880agcggcaggg tcggaacagg agagcgcacg agggagcttc cagggggaaa
cgcctggtat 2940ctttatagtc ctgtcgggtt tcgccacctc tgacttgagc
gtcgattttt gtgatgctcg 3000tcaggggggc ggagcctatg gaaaaacgcc
agcaacgcgg cctttttacg gttcctggcc 3060ttttgctggc cttttgctca
catgttcttt cctgcgttat cccctgattc tgtggataac 3120cgtattaccg
cctttgagtg agctgatacc gctcgccgca gccgaacgac cgagcgcagc
3180gagtcagtga gcgaggaagc ggaagagcgc ctgatgcggt attttctcct
tacgcatctg 3240tgcggtattt cacaccgcat atggtgcact ctcagtacaa
tctgctctga tgccgcatag 3300ttaagccagt atacactccg ctatcgctac
gtgactgggt catggctgcg ccccgacacc 3360cgccaacacc cgctgacgcg
ccctgacggg cttgtctgct cccggcatcc gcttacagac 3420aagctgtgac
cgtctccggg agctgcatgt gtcagaggtt ttcaccgtca tcaccgaaac
3480gcgcgaggca gcagatcaat tcgcgcgcga aggcgaagcg gcatgcataa
tgtgcctgtc 3540aaatggacga agcagggatt ctgcaaaccc tatgctactc
cgtcaagccg tcaattgtct 3600gattcgttac caattatgac aacttgacgg
ctacatcatt cactttttct tcacaaccgg 3660cacggaactc gctcgggctg
gccccggtgc attttttaaa tacccgcgag aaatagagtt 3720gatcgtcaaa
accaacattg cgaccgacgg tggcgatagg catccgggtg gtgctcaaaa
3780gcagcttcgc ctggctgata cgttggtcct cgcgccagct taagacgcta
atccctaact 3840gctggcggaa aagatgtgac agacgcgacg gcgacaagca
aacatgctgt gcgacgctgg 3900cgatatcaaa attgctgtct gccaggtgat
cgctgatgta ctgacaagcc tcgcgtaccc 3960gattatccat cggtggatgg
agcgactcgt taatcgcttc catgcgccgc agtaacaatt 4020gctcaagcag
atttatcgcc agcagctccg aatagcgccc ttccccttgc ccggcgttaa
4080tgatttgccc aaacaggtcg ctgaaatgcg gctggtgcgc ttcatccggg
cgaaagaacc 4140ccgtattggc aaatattgac ggccagttaa gccattcatg
ccagtaggcg cgcggacgaa 4200agtaaaccca ctggtgatac cattcgcgag
cctccggatg acgaccgtag tgatgaatct 4260ctcctggcgg gaacagcaaa
atatcacccg gtcggcaaac aaattctcgt ccctgatttt 4320tcaccacccc
ctgaccgcga atggtgagat tgagaatata acctttcatt cccagcggtc
4380ggtcgataaa aaaatcgaga taaccgttgg cctcaatcgg cgttaaaccc
gccaccagat 4440gggcattaaa cgagtatccc ggcagcaggg gatcattttg
cgcttcagcc atacttttca 4500tactcccgcc attcagagaa gaaaccaatt
gtccatattg catcagacat tgccgtcact 4560gcgtctttta ctggctcttc
tcgctaacca aaccggtaac cccgcttatt aaaagcattc 4620tgtaacaaag
cgggaccaaa gccatgacaa aaacgcgtaa caaaagtgtc tataatcacg
4680gcagaaaagt ccacattgat tatttgcacg gcgtcacact ttgctatgcc
atagcatttt 4740tatccataag attagcggat cctacctgac gctttttatc
gcaactctct actgtttctc 4800catacccgtt ttttgggcta acaggaggaa ttaac
48351134736DNAArtificial sequencepRF192 113aattcggagg tggcggtgca
tcctcggagg atgtgattaa agaatttatg cggtttaaag 60tacgtatgga aggatcggtg
aatggccatg aatttgagat tgagggtgaa ggcgaaggcc 120gcccgtacga
aggaactcaa acagcgaaat taaaagttac aaaaggaggt cctctgccgt
180ttgcctggga catcttgagc ccgcaattcc agtacggttc caaagtgtat
gtaaaacacc 240ctgcggatat tccggattat aaaaaactga gttttcccga
ggggtttaaa tgggaacggg 300tgatgaattt tgaggatggt ggagttgtca
ccgtgaccca ggactctagc ttacaagacg 360gtagtttcat ctacaaagta
aaatttatcg gcgtaaactt cccatcggac ggccccgtca 420tgcagaaaaa
gacgatgggc tgggaagcca gcaccgaacg tttgtaccca cgggacggcg
480ttttgaaagg ggaaatccat aaggccctta aactgaaaga cggtggtcac
tatctcgtgg 540agtttaaatc gatttatatg gctaaaaaac cagtacagct
tccgggttat tattacgttg 600actccaaatt ggacatcaca tcgcataatg
aagattacac gattgttgaa cagtacgagc 660gcgccgaggg ccggcaccat
ctgtttctgt aaaagcttgg ctgttttggc ggatgagaga 720agattttcag
cctgatacag attaaatcag aacgcagaag cggtctgata aaacagaatt
780tgcctggcgg cagtagcgcg gtggtcccac ctgaccccat gccgaactca
gaagtgaaac 840gccgtagcgc cgatggtagt gtggggtctc cccatgcgag
agtagggaac tgccaggcat 900caaataaaac gaaaggctca gtcgaaagac
tgggcctttc gttttatctg ttgtttgtcg 960gtgaacgctc tcctgagtag
gacaaatccg ccgggagcgg atttgaacgt tgcgaagcaa 1020cggcccggag
ggtggcgggc aggacgcccg ccataaactg ccaggcatca aattaagcag
1080aaggccatcc tgacggatgg cctttttgcg tttctacaaa ctcttttgtt
tatttttcta 1140aatacattca aatatgtatc cgctcatgag acaataaccc
tgataaatgc ttcaataata 1200ttgaaaaagg aagagtatga gtattcaaca
tttccgtgtc gcccttattc ccttttttgc 1260ggcattttgc cttcctgttt
ttgctcaccc agaaacgctg gtgaaagtaa aagatgctga 1320agatcagttg
ggtgcacgag tgggttacat cgaactggat ctcaacagcg gtaagatcct
1380tgagagtttt cgccccgaag aacgttttcc aatgatgagc acttttaaag
ttctgctatg 1440tggcgcggta ttatcccgtg ttgacgccgg gcaagagcaa
ctcggtcgcc gcatacacta 1500ttctcagaat gacttggttg agtactcacc
agtcacagaa aagcatctta cggatggcat 1560gacagtaaga gaattatgca
gtgctgccat aaccatgagt gataacactg cggccaactt 1620acttctgaca
acgatcggag gaccgaagga gctaaccgct tttttgcaca acatggggga
1680tcatgtaact cgccttgatc gttgggaacc ggagctgaat gaagccatac
caaacgacga 1740gcgtgacacc acgatgcctg tagcaatggc aacaacgttg
cgcaaactat taactggcga 1800actacttact ctagcttccc ggcaacaatt
aatagactgg atggaggcgg ataaagttgc 1860aggaccactt ctgcgctcgg
cccttccggc tggctggttt attgctgata aatctggagc 1920cggtgagcgt
gggtctcgcg gtatcattgc agcactgggg ccagatggta agccctcccg
1980tatcgtagtt atctacacga cggggagtca ggcaactatg gatgaacgaa
atagacagat 2040cgctgagata ggtgcctcac tgattaagca ttggtaactg
tcagaccaag tttactcata 2100tatactttag attgatttaa aacttcattt
ttaatttaaa aggatctagg tgaagatcct 2160ttttgataat ctcatgacca
aaatccctta acgtgagttt tcgttccact gagcgtcaga 2220ccccgtagaa
aagatcaaag gatcttcttg agatcctttt tttctgcgcg taatctgctg
2280cttgcaaaca aaaaaaccac cgctaccagc ggtggtttgt ttgccggatc
aagagctacc 2340aactcttttt ccgaaggtaa ctggcttcag cagagcgcag
ataccaaata ctgtccttct 2400agtgtagccg tagttaggcc accacttcaa
gaactctgta gcaccgccta catacctcgc 2460tctgctaatc ctgttaccag
tggctgctgc cagtggcgat aagtcgtgtc ttaccgggtt 2520ggactcaaga
cgatagttac cggataaggc gcagcggtcg ggctgaacgg ggggttcgtg
2580cacacagccc agcttggagc gaacgaccta caccgaactg agatacctac
agcgtgagct 2640atgagaaagc gccacgcttc ccgaagggag aaaggcggac
aggtatccgg taagcggcag 2700ggtcggaaca ggagagcgca cgagggagct
tccaggggga aacgcctggt atctttatag 2760tcctgtcggg tttcgccacc
tctgacttga gcgtcgattt ttgtgatgct cgtcaggggg 2820gcggagccta
tggaaaaacg ccagcaacgc ggccttttta cggttcctgg ccttttgctg
2880gccttttgct cacatgttct ttcctgcgtt atcccctgat tctgtggata
accgtattac 2940cgcctttgag tgagctgata ccgctcgccg cagccgaacg
accgagcgca gcgagtcagt 3000gagcgaggaa gcggaagagc gcctgatgcg
gtattttctc cttacgcatc tgtgcggtat 3060ttcacaccgc atatggtgca
ctctcagtac aatctgctct gatgccgcat agttaagcca 3120gtatacactc
cgctatcgct acgtgactgg gtcatggctg cgccccgaca cccgccaaca
3180cccgctgacg cgccctgacg ggcttgtctg ctcccggcat ccgcttacag
acaagctgtg 3240accgtctccg ggagctgcat gtgtcagagg ttttcaccgt
catcaccgaa acgcgcgagg 3300cagcagatca attcgcgcgc gaaggcgaag
cggcatgcat aatgtgcctg tcaaatggac 3360gaagcaggga ttctgcaaac
cctatgctac tccgtcaagc cgtcaattgt ctgattcgtt 3420accaattatg
acaacttgac ggctacatca ttcacttttt cttcacaacc ggcacggaac
3480tcgctcgggc tggccccggt gcatttttta aatacccgcg agaaatagag
ttgatcgtca 3540aaaccaacat tgcgaccgac ggtggcgata ggcatccggg
tggtgctcaa aagcagcttc 3600gcctggctga tacgttggtc ctcgcgccag
cttaagacgc taatccctaa ctgctggcgg 3660aaaagatgtg acagacgcga
cggcgacaag caaacatgct gtgcgacgct ggcgatatca 3720aaattgctgt
ctgccaggtg atcgctgatg tactgacaag cctcgcgtac ccgattatcc
3780atcggtggat ggagcgactc gttaatcgct tccatgcgcc gcagtaacaa
ttgctcaagc 3840agatttatcg ccagcagctc cgaatagcgc ccttcccctt
gcccggcgtt aatgatttgc 3900ccaaacaggt cgctgaaatg cggctggtgc
gcttcatccg ggcgaaagaa ccccgtattg 3960gcaaatattg acggccagtt
aagccattca tgccagtagg cgcgcggacg aaagtaaacc 4020cactggtgat
accattcgcg agcctccgga tgacgaccgt agtgatgaat ctctcctggc
4080gggaacagca aaatatcacc cggtcggcaa acaaattctc gtccctgatt
tttcaccacc 4140ccctgaccgc gaatggtgag attgagaata taacctttca
ttcccagcgg tcggtcgata 4200aaaaaatcga gataaccgtt ggcctcaatc
ggcgttaaac ccgccaccag atgggcatta 4260aacgagtatc ccggcagcag
gggatcattt tgcgcttcag ccatactttt catactcccg 4320ccattcagag
aagaaaccaa ttgtccatat tgcatcagac attgccgtca ctgcgtcttt
4380tactggctct tctcgctaac caaaccggta accccgctta ttaaaagcat
tctgtaacaa 4440agcgggacca aagccatgac aaaaacgcgt aacaaaagtg
tctataatca cggcagaaaa 4500gtccacattg attatttgca cggcgtcaca
ctttgctatg ccatagcatt tttatccata 4560agattagcgg atcctacctg
acgcttttta tcgcaactct ctactgtttc tccatacccg 4620ttttttgggc
taacaggagg aattaaccat ggggcatcac caccatcacc acgcgggtta
4680cctgctgggc aagattaatc ttaaagcctg cgccgcgtgt gctaagaaaa ttttgg
47361144727DNAArtificial sequencepRF190 114aattcggagg tggcggtgca
tcctcggagg atgtgattaa agaatttatg cggtttaaag 60tacgtatgga aggatcggtg
aatggccatg aatttgagat tgagggtgaa ggcgaaggcc 120gcccgtacga
aggaactcaa acagcgaaat taaaagttac aaaaggaggt cctctgccgt
180ttgcctggga catcttgagc ccgcaattcc agtacggttc caaagtgtat
gtaaaacacc 240ctgcggatat tccggattat aaaaaactga gttttcccga
ggggtttaaa tgggaacggg 300tgatgaattt tgaggatggt ggagttgtca
ccgtgaccca ggactctagc ttacaagacg 360gtagtttcat ctacaaagta
aaatttatcg gcgtaaactt cccatcggac ggccccgtca 420tgcagaaaaa
gacgatgggc tgggaagcca gcaccgaacg tttgtaccca cgggacggcg
480ttttgaaagg ggaaatccat aaggccctta aactgaaaga cggtggtcac
tatctcgtgg 540agtttaaatc gatttatatg gctaaaaaac cagtacagct
tccgggttat tattacgttg 600actccaaatt ggacatcaca tcgcataatg
aagattacac gattgttgaa cagtacgagc 660gcgccgaggg ccggcaccat
ctgtttctgt aaaagcttgg ctgttttggc ggatgagaga 720agattttcag
cctgatacag attaaatcag aacgcagaag cggtctgata aaacagaatt
780tgcctggcgg cagtagcgcg gtggtcccac ctgaccccat gccgaactca
gaagtgaaac 840gccgtagcgc cgatggtagt gtggggtctc cccatgcgag
agtagggaac tgccaggcat 900caaataaaac gaaaggctca gtcgaaagac
tgggcctttc gttttatctg ttgtttgtcg 960gtgaacgctc tcctgagtag
gacaaatccg ccgggagcgg atttgaacgt tgcgaagcaa 1020cggcccggag
ggtggcgggc aggacgcccg ccataaactg ccaggcatca aattaagcag
1080aaggccatcc tgacggatgg cctttttgcg tttctacaaa ctcttttgtt
tatttttcta 1140aatacattca aatatgtatc cgctcatgag acaataaccc
tgataaatgc ttcaataata 1200ttgaaaaagg aagagtatga gtattcaaca
tttccgtgtc gcccttattc ccttttttgc 1260ggcattttgc cttcctgttt
ttgctcaccc agaaacgctg gtgaaagtaa aagatgctga 1320agatcagttg
ggtgcacgag tgggttacat cgaactggat ctcaacagcg gtaagatcct
1380tgagagtttt cgccccgaag aacgttttcc aatgatgagc acttttaaag
ttctgctatg 1440tggcgcggta ttatcccgtg ttgacgccgg gcaagagcaa
ctcggtcgcc gcatacacta 1500ttctcagaat gacttggttg agtactcacc
agtcacagaa aagcatctta cggatggcat 1560gacagtaaga gaattatgca
gtgctgccat aaccatgagt gataacactg cggccaactt 1620acttctgaca
acgatcggag gaccgaagga gctaaccgct tttttgcaca acatggggga
1680tcatgtaact cgccttgatc gttgggaacc ggagctgaat gaagccatac
caaacgacga 1740gcgtgacacc acgatgcctg tagcaatggc aacaacgttg
cgcaaactat taactggcga 1800actacttact ctagcttccc ggcaacaatt
aatagactgg atggaggcgg ataaagttgc 1860aggaccactt ctgcgctcgg
cccttccggc tggctggttt attgctgata aatctggagc 1920cggtgagcgt
gggtctcgcg gtatcattgc agcactgggg ccagatggta agccctcccg
1980tatcgtagtt atctacacga cggggagtca ggcaactatg gatgaacgaa
atagacagat 2040cgctgagata ggtgcctcac tgattaagca ttggtaactg
tcagaccaag tttactcata 2100tatactttag attgatttaa aacttcattt
ttaatttaaa aggatctagg tgaagatcct 2160ttttgataat ctcatgacca
aaatccctta acgtgagttt tcgttccact gagcgtcaga 2220ccccgtagaa
aagatcaaag gatcttcttg agatcctttt tttctgcgcg taatctgctg
2280cttgcaaaca aaaaaaccac cgctaccagc ggtggtttgt ttgccggatc
aagagctacc 2340aactcttttt ccgaaggtaa ctggcttcag cagagcgcag
ataccaaata ctgtccttct 2400agtgtagccg tagttaggcc accacttcaa
gaactctgta gcaccgccta catacctcgc 2460tctgctaatc ctgttaccag
tggctgctgc cagtggcgat aagtcgtgtc ttaccgggtt 2520ggactcaaga
cgatagttac cggataaggc gcagcggtcg ggctgaacgg ggggttcgtg
2580cacacagccc agcttggagc gaacgaccta caccgaactg agatacctac
agcgtgagct 2640atgagaaagc gccacgcttc ccgaagggag aaaggcggac
aggtatccgg taagcggcag 2700ggtcggaaca ggagagcgca cgagggagct
tccaggggga aacgcctggt atctttatag 2760tcctgtcggg tttcgccacc
tctgacttga gcgtcgattt ttgtgatgct cgtcaggggg 2820gcggagccta
tggaaaaacg ccagcaacgc ggccttttta cggttcctgg ccttttgctg
2880gccttttgct cacatgttct ttcctgcgtt atcccctgat tctgtggata
accgtattac 2940cgcctttgag tgagctgata ccgctcgccg cagccgaacg
accgagcgca gcgagtcagt 3000gagcgaggaa gcggaagagc gcctgatgcg
gtattttctc cttacgcatc tgtgcggtat 3060ttcacaccgc atatggtgca
ctctcagtac aatctgctct gatgccgcat agttaagcca 3120gtatacactc
cgctatcgct acgtgactgg gtcatggctg cgccccgaca cccgccaaca
3180cccgctgacg cgccctgacg ggcttgtctg ctcccggcat ccgcttacag
acaagctgtg 3240accgtctccg ggagctgcat gtgtcagagg ttttcaccgt
catcaccgaa acgcgcgagg 3300cagcagatca attcgcgcgc gaaggcgaag
cggcatgcat aatgtgcctg tcaaatggac 3360gaagcaggga ttctgcaaac
cctatgctac tccgtcaagc cgtcaattgt ctgattcgtt 3420accaattatg
acaacttgac ggctacatca ttcacttttt cttcacaacc ggcacggaac
3480tcgctcgggc tggccccggt gcatttttta aatacccgcg agaaatagag
ttgatcgtca 3540aaaccaacat tgcgaccgac ggtggcgata ggcatccggg
tggtgctcaa aagcagcttc 3600gcctggctga tacgttggtc ctcgcgccag
cttaagacgc taatccctaa ctgctggcgg 3660aaaagatgtg acagacgcga
cggcgacaag caaacatgct gtgcgacgct ggcgatatca 3720aaattgctgt
ctgccaggtg atcgctgatg tactgacaag cctcgcgtac ccgattatcc
3780atcggtggat ggagcgactc gttaatcgct tccatgcgcc gcagtaacaa
ttgctcaagc 3840agatttatcg ccagcagctc cgaatagcgc ccttcccctt
gcccggcgtt aatgatttgc 3900ccaaacaggt cgctgaaatg cggctggtgc
gcttcatccg ggcgaaagaa ccccgtattg 3960gcaaatattg acggccagtt
aagccattca tgccagtagg cgcgcggacg aaagtaaacc 4020cactggtgat
accattcgcg agcctccgga tgacgaccgt agtgatgaat ctctcctggc
4080gggaacagca aaatatcacc cggtcggcaa acaaattctc gtccctgatt
tttcaccacc 4140ccctgaccgc gaatggtgag attgagaata taacctttca
ttcccagcgg tcggtcgata 4200aaaaaatcga gataaccgtt ggcctcaatc
ggcgttaaac ccgccaccag atgggcatta 4260aacgagtatc ccggcagcag
gggatcattt tgcgcttcag ccatactttt catactcccg 4320ccattcagag
aagaaaccaa ttgtccatat tgcatcagac attgccgtca ctgcgtcttt
4380tactggctct tctcgctaac caaaccggta accccgctta ttaaaagcat
tctgtaacaa 4440agcgggacca aagccatgac aaaaacgcgt aacaaaagtg
tctataatca cggcagaaaa 4500gtccacattg attatttgca cggcgtcaca
ctttgctatg ccatagcatt tttatccata 4560agattagcgg atcctacctg
acgcttttta tcgcaactct ctactgtttc tccatacccg 4620ttttttgggc
taacaggagg aattaaccat ggggcatcac caccatcacc acttattgat
4680tatcttgcgt cgtcgcatcc gcaaacaggc gcacgcacat agcaagg
47271151395PRTartificial sequencehis-CFFKDEL-Cas9 115Met Gly His
His His His His His Cys Phe Phe Lys Asp Glu Leu Glu1 5 10 15Phe Asp
Lys Lys Tyr Ser Ile Gly Leu Asp Ile Gly Thr Asn Ser Val 20 25 30Gly
Trp Ala Val Ile Thr Asp Glu Tyr Lys Val Pro Ser Lys Lys Phe 35 40
45Lys Val Leu Gly Asn Thr Asp Arg His Ser Ile Lys Lys Asn Leu Ile
50 55 60Gly Ala Leu Leu Phe Asp Ser Gly Glu Thr Ala Glu Ala Thr Arg
Leu65 70 75 80Lys Arg Thr Ala Arg Arg Arg Tyr Thr Arg Arg Lys Asn
Arg Ile Cys 85 90 95Tyr Leu Gln Glu Ile Phe Ser Asn Glu Met Ala Lys
Val Asp Asp Ser 100 105 110Phe Phe His Arg Leu Glu Glu Ser Phe Leu
Val Glu Glu Asp Lys Lys 115 120 125His Glu Arg His Pro Ile Phe Gly
Asn Ile Val Asp Glu Val Ala Tyr 130 135 140His Glu Lys Tyr Pro Thr
Ile Tyr His Leu Arg Lys Lys Leu Val Asp145 150 155 160Ser Thr Asp
Lys Ala Asp Leu Arg Leu Ile Tyr Leu Ala Leu Ala His 165 170 175Met
Ile Lys Phe Arg Gly His Phe Leu Ile Glu Gly Asp Leu Asn Pro 180 185
190Asp Asn Ser Asp Val Asp Lys Leu Phe Ile Gln Leu Val Gln Thr Tyr
195 200 205Asn Gln Leu Phe Glu Glu Asn Pro Ile Asn Ala Ser Gly Val
Asp Ala 210 215 220Lys Ala Ile Leu Ser Ala Arg Leu Ser Lys Ser Arg
Arg Leu Glu Asn225 230 235 240Leu Ile Ala Gln Leu Pro Gly Glu Lys
Lys Asn Gly Leu Phe Gly Asn 245 250 255Leu Ile Ala Leu Ser Leu Gly
Leu Thr Pro Asn Phe Lys Ser Asn Phe 260 265 270Asp Leu Ala Glu Asp
Ala Lys Leu Gln Leu Ser Lys Asp Thr Tyr Asp 275 280 285Asp Asp Leu
Asp Asn Leu Leu Ala Gln Ile Gly Asp Gln Tyr Ala Asp 290 295 300Leu
Phe Leu Ala Ala Lys Asn Leu Ser Asp Ala Ile Leu Leu Ser Asp305 310
315 320Ile Leu Arg Val Asn Thr Glu Ile Thr Lys Ala Pro Leu Ser Ala
Ser 325 330 335Met Ile Lys Arg Tyr Asp Glu His His Gln Asp Leu Thr
Leu Leu Lys 340 345 350Ala Leu Val Arg Gln Gln Leu Pro Glu Lys Tyr
Lys Glu Ile Phe Phe 355 360 365Asp Gln Ser Lys Asn Gly Tyr Ala Gly
Tyr Ile Asp Gly Gly Ala Ser 370 375 380Gln Glu Glu Phe Tyr Lys Phe
Ile Lys Pro Ile Leu Glu Lys Met Asp385 390 395 400Gly Thr Glu Glu
Leu Leu Val Lys Leu Asn Arg Glu Asp Leu Leu Arg 405 410 415Lys Gln
Arg Thr Phe Asp Asn Gly Ser Ile Pro His Gln Ile His Leu 420 425
430Gly Glu Leu His Ala Ile Leu Arg Arg Gln Glu Asp Phe Tyr Pro Phe
435 440 445Leu Lys Asp Asn Arg Glu Lys Ile Glu Lys Ile Leu Thr Phe
Arg Ile 450 455 460Pro Tyr Tyr Val Gly Pro Leu Ala Arg Gly Asn Ser
Arg Phe Ala Trp465 470 475 480Met Thr Arg Lys Ser Glu Glu Thr Ile
Thr Pro Trp Asn Phe Glu Glu 485 490 495Val Val Asp Lys Gly Ala Ser
Ala Gln Ser Phe Ile Glu Arg Met Thr 500 505 510Asn Phe Asp Lys Asn
Leu Pro Asn Glu Lys Val Leu Pro Lys His Ser 515 520 525Leu Leu Tyr
Glu Tyr Phe Thr Val Tyr Asn Glu Leu Thr Lys Val Lys 530 535 540Tyr
Val Thr Glu Gly Met Arg Lys Pro Ala Phe Leu Ser Gly Glu Gln545 550
555 560Lys Lys Ala Ile Val Asp Leu Leu Phe Lys Thr Asn Arg Lys Val
Thr 565 570 575Val Lys Gln Leu Lys Glu Asp Tyr Phe Lys Lys Ile Glu
Cys Phe Asp 580 585 590Ser Val Glu Ile Ser Gly Val Glu Asp Arg Phe
Asn Ala Ser Leu Gly 595 600 605Thr Tyr His Asp Leu Leu Lys Ile Ile
Lys Asp Lys Asp Phe Leu Asp 610 615 620Asn Glu Glu Asn Glu Asp Ile
Leu Glu Asp Ile Val Leu Thr Leu Thr625 630 635 640Leu Phe Glu Asp
Arg Glu Met Ile Glu Glu Arg Leu Lys Thr Tyr Ala 645 650 655His Leu
Phe Asp Asp Lys Val Met Lys Gln Leu Lys Arg Arg Arg Tyr 660 665
670Thr Gly Trp Gly Arg Leu Ser Arg Lys Leu Ile Asn Gly Ile Arg Asp
675 680 685Lys Gln Ser Gly Lys Thr Ile Leu Asp Phe Leu Lys Ser Asp
Gly Phe 690 695 700Ala Asn Arg Asn Phe Met Gln Leu Ile His Asp Asp
Ser Leu Thr Phe705 710 715 720Lys Glu Asp Ile Gln Lys Ala Gln Val
Ser Gly Gln Gly Asp Ser Leu 725 730 735His Glu His Ile Ala Asn Leu
Ala Gly Ser Pro Ala Ile Lys Lys Gly 740 745 750Ile Leu Gln Thr Val
Lys Val Val Asp Glu Leu Val Lys Val Met Gly 755 760 765Arg His Lys
Pro Glu Asn Ile Val Ile Glu Met Ala Arg Glu Asn Gln 770 775 780Thr
Thr Gln Lys Gly Gln Lys Asn Ser Arg Glu Arg Met Lys Arg Ile785 790
795 800Glu Glu Gly Ile Lys Glu Leu Gly Ser Gln Ile Leu Lys Glu His
Pro 805 810 815Val Glu Asn Thr Gln Leu Gln Asn Glu Lys Leu Tyr Leu
Tyr Tyr Leu 820 825 830Gln Asn Gly Arg Asp Met Tyr Val Asp Gln Glu
Leu Asp Ile Asn Arg 835 840 845Leu Ser Asp Tyr Asp Val Asp His Ile
Val Pro Gln Ser Phe Leu Lys 850 855 860Asp Asp Ser Ile Asp Asn Lys
Val Leu Thr Arg Ser Asp Lys Asn Arg865 870 875 880Gly Lys Ser Asp
Asn Val Pro Ser Glu Glu Val Val Lys Lys Met Lys 885 890 895Asn Tyr
Trp Arg Gln Leu Leu Asn Ala Lys Leu Ile Thr Gln Arg Lys 900 905
910Phe Asp Asn Leu Thr Lys Ala Glu Arg Gly Gly Leu Ser Glu Leu Asp
915 920 925Lys Ala Gly Phe Ile Lys Arg Gln Leu Val Glu Thr Arg Gln
Ile Thr 930 935 940Lys His Val Ala Gln Ile Leu Asp Ser Arg Met Asn
Thr Lys Tyr Asp945 950 955 960Glu Asn Asp Lys Leu Ile Arg Glu Val
Lys Val Ile Thr Leu Lys Ser 965 970 975Lys Leu Val Ser Asp Phe Arg
Lys Asp Phe Gln Phe Tyr Lys Val Arg 980 985 990Glu Ile Asn Asn Tyr
His His Ala His Asp Ala Tyr Leu Asn Ala Val 995 1000 1005Val Gly
Thr Ala Leu Ile Lys Lys Tyr Pro Lys Leu Glu Ser Glu 1010 1015
1020Phe Val Tyr Gly Asp Tyr Lys Val Tyr Asp Val Arg Lys Met Ile
1025 1030 1035Ala Lys Ser Glu Gln Glu Ile Gly Lys Ala Thr Ala Lys
Tyr Phe 1040 1045 1050Phe Tyr Ser Asn Ile Met Asn Phe Phe Lys Thr
Glu Ile Thr Leu 1055 1060 1065Ala Asn Gly Glu Ile Arg Lys Arg Pro
Leu Ile Glu Thr Asn Gly 1070 1075 1080Glu Thr Gly Glu Ile Val Trp
Asp Lys Gly Arg Asp Phe Ala Thr 1085 1090 1095Val Arg Lys Val Leu
Ser Met Pro Gln Val Asn Ile Val Lys Lys 1100 1105 1110Thr Glu Val
Gln Thr Gly Gly Phe Ser Lys Glu Ser Ile Leu Pro 1115 1120 1125Lys
Arg Asn Ser Asp Lys Leu Ile Ala Arg Lys Lys Asp Trp Asp 1130 1135
1140Pro Lys Lys Tyr Gly Gly Phe Asp Ser Pro Thr Val Ala Tyr Ser
1145 1150 1155Val Leu Val Val Ala Lys Val Glu Lys Gly Lys Ser Lys
Lys Leu 1160 1165 1170Lys Ser Val Lys Glu Leu Leu Gly Ile Thr Ile
Met Glu Arg Ser 1175 1180 1185Ser Phe Glu Lys Asn Pro Ile Asp Phe
Leu Glu Ala Lys Gly Tyr 1190 1195 1200Lys Glu Val Lys Lys Asp Leu
Ile Ile Lys Leu Pro Lys Tyr Ser 1205 1210 1215Leu Phe Glu Leu Glu
Asn Gly Arg Lys Arg Met Leu Ala Ser Ala 1220 1225 1230Gly Glu Leu
Gln Lys Gly Asn Glu Leu Ala Leu Pro Ser Lys Tyr 1235 1240 1245Val
Asn Phe Leu Tyr Leu Ala Ser His Tyr Glu Lys Leu Lys Gly 1250 1255
1260Ser Pro Glu Asp Asn Glu Gln Lys Gln Leu Phe Val Glu Gln His
1265 1270 1275Lys His Tyr Leu Asp Glu Ile Ile Glu Gln Ile Ser Glu
Phe Ser 1280 1285 1290Lys Arg Val Ile Leu Ala Asp Ala Asn Leu Asp
Lys Val Leu Ser 1295 1300 1305Ala Tyr Asn Lys His Arg Asp Lys Pro
Ile Arg Glu Gln Ala Glu 1310 1315 1320Asn Ile Ile His Leu Phe Thr
Leu Thr Asn Leu Gly Ala Pro Ala 1325 1330 1335Ala Phe Lys Tyr Phe
Asp Thr Thr Ile Asp Arg Lys Arg Tyr Thr 1340 1345 1350Ser Thr Lys
Glu Val Leu Asp Ala Thr Leu Ile His Gln Ser Ile 1355 1360 1365Thr
Gly Leu Tyr Glu Thr Arg Ile Asp Leu Ser Gln Leu Gly Gly 1370 1375
1380Asp Ser Arg Ala Asp Pro Lys Lys Lys Arg Lys Val 1385 1390
13951161412PRTArtificial sequencehis-MPG1-Cas9 116Met Gly His His
His His His His Gly Ala Leu Phe Leu Gly Gln Leu1 5 10 15Gly Ala Ala
Gly Ser Thr Met Gly Ala Pro Lys Lys Lys Arg Lys Val 20 25 30Glu Phe
Asp Lys Lys Tyr Ser Ile Gly Leu Asp Ile Gly Thr Asn Ser 35 40 45Val
Gly Trp Ala Val Ile Thr Asp Glu Tyr Lys Val Pro Ser Lys Lys 50 55
60Phe Lys Val Leu Gly Asn Thr Asp Arg His Ser Ile Lys Lys Asn Leu65
70 75 80Ile Gly Ala Leu Leu Phe Asp Ser Gly Glu Thr Ala Glu Ala Thr
Arg 85 90 95Leu Lys Arg Thr Ala Arg Arg Arg Tyr Thr Arg Arg Lys Asn
Arg Ile 100 105 110Cys Tyr Leu Gln Glu Ile Phe Ser Asn Glu Met Ala
Lys Val Asp Asp 115 120 125Ser Phe Phe His Arg Leu Glu Glu Ser Phe
Leu Val Glu Glu Asp Lys 130 135 140Lys His Glu Arg His Pro Ile Phe
Gly Asn Ile Val Asp Glu Val Ala145 150 155 160Tyr His Glu Lys Tyr
Pro Thr Ile Tyr His Leu Arg Lys Lys Leu Val 165 170 175Asp Ser Thr
Asp Lys Ala Asp Leu Arg Leu Ile Tyr Leu Ala Leu Ala 180 185 190His
Met Ile Lys Phe Arg Gly His Phe Leu Ile Glu Gly Asp Leu Asn 195 200
205Pro Asp Asn Ser Asp Val Asp Lys Leu Phe Ile Gln Leu Val Gln Thr
210 215 220Tyr Asn Gln Leu Phe Glu Glu Asn Pro Ile Asn Ala Ser Gly
Val Asp225 230 235 240Ala Lys Ala Ile Leu Ser Ala Arg Leu Ser Lys
Ser Arg Arg Leu Glu 245 250 255Asn Leu Ile Ala Gln Leu Pro Gly Glu
Lys Lys Asn Gly Leu Phe Gly 260 265 270Asn Leu Ile Ala Leu Ser Leu
Gly Leu Thr Pro Asn Phe Lys Ser Asn 275 280 285Phe Asp Leu Ala Glu
Asp Ala Lys Leu Gln Leu Ser Lys Asp Thr Tyr 290 295 300Asp Asp Asp
Leu Asp Asn Leu Leu Ala Gln Ile Gly Asp Gln Tyr Ala305 310 315
320Asp Leu Phe Leu Ala Ala Lys Asn Leu Ser Asp Ala Ile Leu Leu Ser
325 330 335Asp Ile Leu Arg Val Asn Thr Glu Ile Thr Lys Ala Pro Leu
Ser Ala 340 345 350Ser Met Ile Lys Arg Tyr Asp Glu His His Gln Asp
Leu Thr Leu Leu 355 360 365Lys Ala Leu Val Arg Gln Gln Leu Pro Glu
Lys Tyr Lys Glu Ile Phe 370 375 380Phe Asp Gln Ser Lys Asn Gly Tyr
Ala Gly Tyr Ile Asp Gly Gly Ala385 390 395 400Ser Gln Glu Glu Phe
Tyr Lys Phe Ile Lys Pro Ile Leu Glu Lys Met 405 410 415Asp Gly Thr
Glu Glu Leu Leu Val Lys Leu Asn Arg Glu Asp Leu Leu 420 425 430Arg
Lys Gln Arg Thr Phe Asp Asn Gly Ser Ile Pro His Gln Ile His 435 440
445Leu Gly Glu Leu His Ala Ile Leu Arg Arg Gln Glu Asp Phe Tyr Pro
450 455 460Phe Leu Lys Asp Asn Arg Glu Lys Ile Glu Lys Ile Leu Thr
Phe Arg465 470 475 480Ile Pro Tyr Tyr Val Gly Pro Leu Ala Arg Gly
Asn Ser Arg Phe Ala 485 490 495Trp Met Thr Arg Lys Ser Glu Glu Thr
Ile Thr Pro Trp Asn Phe Glu 500 505 510Glu Val Val Asp Lys Gly Ala
Ser Ala Gln Ser Phe Ile Glu Arg Met 515 520 525Thr Asn Phe Asp Lys
Asn Leu Pro Asn Glu Lys Val Leu Pro Lys His 530 535 540Ser Leu Leu
Tyr Glu Tyr Phe Thr Val Tyr Asn Glu Leu Thr Lys Val545 550 555
560Lys Tyr Val Thr Glu Gly Met Arg Lys Pro Ala Phe Leu Ser Gly Glu
565 570 575Gln Lys Lys Ala Ile Val Asp Leu Leu Phe Lys Thr Asn Arg
Lys Val 580 585 590Thr Val Lys Gln Leu Lys Glu Asp Tyr Phe Lys Lys
Ile Glu Cys Phe 595 600 605Asp Ser Val Glu Ile Ser Gly Val Glu Asp
Arg Phe Asn Ala Ser Leu 610 615 620Gly Thr Tyr His Asp Leu Leu Lys
Ile Ile Lys Asp Lys Asp Phe Leu625 630 635 640Asp Asn Glu Glu Asn
Glu Asp Ile Leu Glu Asp Ile Val Leu Thr Leu 645 650 655Thr Leu Phe
Glu Asp Arg Glu Met Ile Glu Glu Arg Leu Lys Thr Tyr 660 665 670Ala
His Leu Phe Asp Asp Lys Val Met Lys Gln Leu Lys Arg Arg Arg 675 680
685Tyr Thr Gly Trp Gly Arg Leu Ser Arg Lys Leu Ile Asn Gly Ile Arg
690 695 700Asp Lys Gln Ser Gly Lys Thr Ile Leu Asp Phe Leu Lys Ser
Asp Gly705 710 715 720Phe Ala Asn Arg Asn Phe Met Gln Leu Ile His
Asp Asp Ser Leu Thr 725 730 735Phe Lys Glu Asp Ile Gln Lys Ala Gln
Val Ser Gly Gln Gly Asp Ser 740 745 750Leu His Glu His Ile Ala Asn
Leu Ala Gly Ser Pro Ala Ile Lys Lys 755 760 765Gly Ile Leu Gln Thr
Val Lys Val Val Asp Glu Leu Val Lys Val Met 770 775 780Gly Arg His
Lys Pro Glu Asn Ile Val Ile Glu Met Ala Arg Glu Asn785 790 795
800Gln Thr Thr Gln Lys Gly Gln Lys Asn Ser Arg Glu Arg Met Lys Arg
805 810 815Ile Glu Glu Gly Ile Lys Glu Leu Gly Ser Gln Ile Leu Lys
Glu His 820 825 830Pro Val Glu Asn Thr Gln Leu Gln Asn Glu Lys Leu
Tyr Leu Tyr Tyr 835 840 845Leu Gln Asn Gly Arg Asp Met Tyr Val Asp
Gln Glu Leu Asp Ile Asn 850 855 860Arg Leu Ser Asp Tyr Asp Val Asp
His Ile Val Pro Gln Ser Phe Leu865 870 875 880Lys Asp Asp Ser Ile
Asp Asn Lys Val Leu Thr Arg Ser Asp Lys Asn 885 890 895Arg Gly Lys
Ser Asp Asn Val Pro Ser Glu Glu Val Val Lys Lys Met 900 905 910Lys
Asn Tyr Trp Arg Gln Leu Leu Asn Ala Lys Leu Ile Thr Gln Arg 915 920
925Lys Phe Asp Asn Leu Thr Lys Ala Glu Arg Gly Gly Leu Ser Glu Leu
930 935 940Asp Lys Ala Gly Phe Ile Lys Arg Gln Leu Val Glu Thr Arg
Gln Ile945 950 955 960Thr Lys His Val Ala Gln Ile Leu Asp Ser Arg
Met Asn Thr Lys Tyr 965 970 975Asp Glu Asn Asp Lys Leu Ile Arg Glu
Val Lys Val Ile Thr Leu Lys 980 985 990Ser Lys Leu Val Ser Asp Phe
Arg Lys Asp Phe Gln Phe Tyr Lys Val 995 1000 1005Arg Glu Ile Asn
Asn Tyr His His Ala His Asp Ala Tyr Leu Asn 1010 1015 1020Ala Val
Val Gly Thr Ala Leu Ile Lys Lys Tyr Pro Lys Leu Glu 1025 1030
1035Ser Glu Phe Val Tyr Gly Asp Tyr Lys Val Tyr Asp Val Arg Lys
1040 1045 1050Met Ile Ala Lys Ser Glu Gln Glu Ile Gly Lys Ala Thr
Ala Lys 1055 1060 1065Tyr Phe Phe Tyr Ser Asn Ile Met Asn Phe Phe
Lys Thr Glu Ile 1070 1075 1080Thr Leu Ala Asn Gly Glu Ile Arg Lys
Arg Pro Leu Ile Glu Thr 1085 1090 1095Asn Gly Glu Thr Gly Glu Ile
Val Trp Asp Lys Gly Arg Asp Phe 1100 1105 1110Ala Thr Val Arg Lys
Val Leu Ser Met Pro Gln Val Asn Ile Val 1115 1120 1125Lys Lys Thr
Glu Val Gln Thr Gly Gly Phe Ser Lys Glu Ser Ile 1130 1135 1140Leu
Pro Lys Arg Asn Ser Asp Lys Leu Ile Ala Arg Lys Lys Asp 1145 1150
1155Trp Asp Pro Lys Lys Tyr Gly Gly Phe Asp Ser Pro Thr Val Ala
1160 1165 1170Tyr Ser Val Leu Val Val Ala Lys Val Glu Lys Gly Lys
Ser Lys 1175 1180 1185Lys Leu Lys Ser Val Lys Glu Leu Leu Gly Ile
Thr Ile Met Glu 1190 1195 1200Arg Ser Ser Phe Glu Lys Asn Pro Ile
Asp Phe Leu Glu Ala Lys 1205 1210 1215Gly Tyr Lys Glu Val Lys Lys
Asp Leu Ile Ile Lys Leu Pro Lys 1220 1225 1230Tyr Ser Leu Phe Glu
Leu Glu Asn Gly Arg Lys Arg Met Leu Ala 1235 1240 1245Ser Ala Gly
Glu Leu Gln Lys Gly Asn Glu Leu Ala Leu Pro Ser 1250 1255 1260Lys
Tyr Val Asn Phe Leu Tyr Leu Ala Ser His Tyr Glu Lys Leu 1265 1270
1275Lys Gly Ser Pro Glu Asp Asn Glu Gln Lys Gln Leu Phe Val Glu
1280 1285 1290Gln His Lys His Tyr Leu Asp Glu Ile Ile Glu Gln Ile
Ser Glu 1295 1300 1305Phe Ser Lys Arg Val Ile Leu Ala Asp Ala Asn
Leu Asp Lys Val 1310 1315 1320Leu Ser Ala Tyr Asn Lys His Arg Asp
Lys Pro Ile Arg Glu Gln 1325 1330 1335Ala Glu Asn Ile Ile His Leu
Phe Thr Leu Thr Asn Leu Gly Ala 1340 1345 1350Pro Ala Ala Phe Lys
Tyr Phe Asp Thr Thr Ile Asp Arg Lys Arg 1355 1360 1365Tyr Thr Ser
Thr Lys Glu Val Leu Asp Ala Thr Leu Ile His Gln 1370 1375 1380Ser
Ile Thr Gly Leu Tyr Glu Thr Arg Ile Asp Leu Ser Gln Leu 1385 1390
1395Gly Gly Asp Ser Arg Ala Asp Pro Lys Lys Lys Arg Lys Val 1400
1405 14101178237DNAArtificial sequencepRF48 117aattcgacaa
gaaatactcc atcggcctgg acattggaac caactctgtc ggctgggctg 60tcatcaccga
cgagtacaag gtgccctcca agaaattcaa ggtcctcgga aacaccgatc
120gacactccat caagaaaaac ctcattggtg ccctgttgtt cgattctggc
gagactgccg 180aagctaccag actcaagcga actgctcggc gacgttacac
ccgacggaag aaccgaatct 240gctacctgca ggagatcttt tccaacgaga
tggccaaggt ggacgattcg ttctttcatc 300gactggagga atccttcctc
gtcgaggaag acaagaaaca cgagcgtcat cccatctttg 360gcaacattgt
ggacgaggtt gcttaccacg agaagtatcc taccatctac cacctgcgaa
420agaaactcgt
cgattccacc gacaaggcgg atctcagact tatctacctc gctctggcac
480acatgatcaa gtttcgaggt catttcctca tcgagggcga tctcaatccc
gacaacagcg 540atgtggacaa gctgttcatt cagctcgttc agacctacaa
ccagctgttc gaggaaaacc 600ccatcaatgc ctccggagtc gatgcaaagg
ccatcttgtc tgctcgactc tcgaagagca 660gacgactgga gaacctcatt
gcccaacttc ctggcgagaa aaagaacgga ctgtttggca 720acctcattgc
cctttctctt ggtctcacac ccaacttcaa gtccaacttc gatctggcgg
780aggacgccaa gctccagctg tccaaggaca cctacgacga tgacctcgac
aacctgcttg 840cacagattgg cgatcagtac gccgacctgt ttctcgctgc
caagaacctt tcggatgcta 900ttctcttgtc tgacattctg cgagtcaaca
ccgagatcac aaaggctccc ctttctgcct 960ccatgatcaa gcgatacgac
gagcaccatc aggatctcac actgctcaag gctcttgtcc 1020gacagcaact
gcccgagaag tacaaggaga tctttttcga tcagtcgaag aacggctacg
1080ctggatacat cgacggcgga gcctctcagg aagagttcta caagttcatc
aagccaattc 1140tcgagaagat ggacggaacc gaggaactgc ttgtcaagct
caatcgagag gatctgcttc 1200ggaagcaacg aaccttcgac aacggcagca
ttcctcatca gatccacctc ggtgagctgc 1260acgccattct tcgacgtcag
gaagacttct acccctttct caaggacaac cgagagaaga 1320tcgagaagat
tcttaccttt cgaatcccct actatgttgg tcctcttgcc agaggaaact
1380ctcgatttgc ttggatgact cgaaagtccg aggaaaccat cactccctgg
aacttcgagg 1440aagtcgtgga caagggtgcc tctgcacagt ccttcatcga
gcgaatgacc aacttcgaca 1500agaatctgcc caacgagaag gttcttccca
agcattcgct gctctacgag tactttacag 1560tctacaacga actcaccaaa
gtcaagtacg ttaccgaggg aatgcgaaag cctgccttct 1620tgtctggcga
acagaagaaa gccattgtcg atctcctgtt caagaccaac cgaaaggtca
1680ctgttaagca gctcaaggag gactacttca agaaaatcga gtgtttcgac
agcgtcgaga 1740tttccggagt tgaggaccga ttcaacgcct ctttgggcac
ctatcacgat ctgctcaaga 1800ttatcaagga caaggatttt ctcgacaacg
aggaaaacga ggacattctg gaggacatcg 1860tgctcactct taccctgttc
gaagatcggg agatgatcga ggaacgactc aagacatacg 1920ctcacctgtt
cgacgacaag gtcatgaaac aactcaagcg acgtagatac accggctggg
1980gaagactttc gcgaaagctc atcaacggca tcagagacaa gcagtccgga
aagaccattc 2040tggactttct caagtccgat ggctttgcca accgaaactt
catgcagctc attcacgacg 2100attctcttac cttcaaggag gacatccaga
aggcacaagt gtccggtcag ggcgacagct 2160tgcacgaaca tattgccaac
ctggctggtt cgccagccat caagaaaggc attctccaga 2220ctgtcaaggt
tgtcgacgag ctggtgaagg tcatgggacg tcacaagccc gagaacattg
2280tgatcgagat ggccagagag aaccagacaa ctcaaaaggg tcagaaaaac
tcgcgagagc 2340ggatgaagcg aatcgaggaa ggcatcaagg agctgggatc
ccagattctc aaggagcatc 2400ccgtcgagaa cactcaactg cagaacgaga
agctgtatct ctactatctg cagaatggtc 2460gagacatgta cgtggatcag
gaactggaca tcaatcgtct cagcgactac gatgtggacc 2520acattgtccc
tcaatccttt ctcaaggacg attctatcga caacaaggtc cttacacgat
2580ccgacaagaa cagaggcaag tcggacaacg ttcccagcga agaggtggtc
aaaaagatga 2640agaactactg gcgacagctg ctcaacgcca agctcattac
ccagcgaaag ttcgacaatc 2700ttaccaaggc cgagcgaggc ggtctgtccg
agctcgacaa ggctggcttc atcaagcgtc 2760aactcgtcga gaccagacag
atcacaaagc acgtcgcaca gattctcgat tctcggatga 2820acaccaagta
cgacgagaac gacaagctca tccgagaggt caaggtgatt actctcaagt
2880ccaaactggt ctccgatttc cgaaaggact ttcagttcta caaggtgcga
gagatcaaca 2940attaccacca tgcccacgat gcttacctca acgccgtcgt
tggcactgcg ctcatcaaga 3000aataccccaa gctcgaaagc gagttcgttt
acggcgatta caaggtctac gacgttcgaa 3060agatgattgc caagtccgaa
caggagattg gcaaggctac tgccaagtac ttcttttact 3120ccaacatcat
gaactttttc aagaccgaga tcaccttggc caacggagag attcgaaaga
3180gaccacttat cgagaccaac ggcgaaactg gagagatcgt gtgggacaag
ggtcgagact 3240ttgcaaccgt gcgaaaggtt ctgtcgatgc ctcaggtcaa
catcgtcaag aaaaccgagg 3300ttcagactgg cggattctcc aaggagtcga
ttctgcccaa gcgaaactcc gacaagctca 3360tcgctcgaaa gaaagactgg
gatcccaaga aatacggtgg cttcgattct cctaccgtcg 3420cctattccgt
gcttgtcgtt gcgaaggtcg agaagggcaa gtccaaaaag ctcaagtccg
3480tcaaggagct gctcggaatt accatcatgg agcgatcgag cttcgagaag
aatcccatcg 3540acttcttgga agccaagggt tacaaggagg tcaagaaaga
cctcattatc aagctgccca 3600agtactctct gttcgaactg gagaacggtc
gaaagcgtat gctcgcctcc gctggcgagc 3660tgcagaaggg aaacgagctt
gccttgcctt cgaagtacgt caactttctc tatctggctt 3720ctcactacga
gaagctcaag ggttctcccg aggacaacga acagaagcaa ctcttcgttg
3780agcagcacaa acattacctc gacgagatta tcgagcagat ttccgagttt
tcgaagcgag 3840tcatcctggc tgatgccaac ttggacaagg tgctctctgc
ctacaacaag catcgggaca 3900aacccattcg agaacaggcg gagaacatca
ttcacctgtt tactcttacc aacctgggtg 3960ctcctgcagc tttcaagtac
ttcgatacca ctatcgaccg aaagcggtac acatccacca 4020aggaggttct
cgatgccacc ctgattcacc agtccatcac tggcctgtac gagacccgaa
4080tcgacctgtc tcagcttggt ggcgactcca gagccgatcc caagaaaaag
cgaaaggtct 4140aagcggccgc taagcttggc tgttttggcg gatgagagaa
gattttcagc ctgatacaga 4200ttaaatcaga acgcagaagc ggtctgataa
aacagaattt gcctggcggc agtagcgcgg 4260tggtcccacc tgaccccatg
ccgaactcag aagtgaaacg ccgtagcgcc gatggtagtg 4320tggggtctcc
ccatgcgaga gtagggaact gccaggcatc aaataaaacg aaaggctcag
4380tcgaaagact gggcctttcg ttttatctgt tgtttgtcgg tgaacgctct
cctgagtagg 4440acaaatccgc cgggagcgga tttgaacgtt gcgaagcaac
ggcccggagg gtggcgggca 4500ggacgcccgc cataaactgc caggcatcaa
attaagcaga aggccatcct gacggatggc 4560ctttttgcgt ttctacaaac
tcttttgttt atttttctaa atacattcaa atatgtatcc 4620gctcatgaga
caataaccct gataaatgct tcaataatat tgaaaaagga agagtatgag
4680tattcaacat ttccgtgtcg cccttattcc cttttttgcg gcattttgcc
ttcctgtttt 4740tgctcaccca gaaacgctgg tgaaagtaaa agatgctgaa
gatcagttgg gtgcacgagt 4800gggttacatc gaactggatc tcaacagcgg
taagatcctt gagagttttc gccccgaaga 4860acgttttcca atgatgagca
cttttaaagt tctgctatgt ggcgcggtat tatcccgtgt 4920tgacgccggg
caagagcaac tcggtcgccg catacactat tctcagaatg acttggttga
4980gtactcacca gtcacagaaa agcatcttac ggatggcatg acagtaagag
aattatgcag 5040tgctgccata accatgagtg ataacactgc ggccaactta
cttctgacaa cgatcggagg 5100accgaaggag ctaaccgctt ttttgcacaa
catgggggat catgtaactc gccttgatcg 5160ttgggaaccg gagctgaatg
aagccatacc aaacgacgag cgtgacacca cgatgcctgt 5220agcaatggca
acaacgttgc gcaaactatt aactggcgaa ctacttactc tagcttcccg
5280gcaacaatta atagactgga tggaggcgga taaagttgca ggaccacttc
tgcgctcggc 5340ccttccggct ggctggttta ttgctgataa atctggagcc
ggtgagcgtg ggtctcgcgg 5400tatcattgca gcactggggc cagatggtaa
gccctcccgt atcgtagtta tctacacgac 5460ggggagtcag gcaactatgg
atgaacgaaa tagacagatc gctgagatag gtgcctcact 5520gattaagcat
tggtaactgt cagaccaagt ttactcatat atactttaga ttgatttaaa
5580acttcatttt taatttaaaa ggatctaggt gaagatcctt tttgataatc
tcatgaccaa 5640aatcccttaa cgtgagtttt cgttccactg agcgtcagac
cccgtagaaa agatcaaagg 5700atcttcttga gatccttttt ttctgcgcgt
aatctgctgc ttgcaaacaa aaaaaccacc 5760gctaccagcg gtggtttgtt
tgccggatca agagctacca actctttttc cgaaggtaac 5820tggcttcagc
agagcgcaga taccaaatac tgtccttcta gtgtagccgt agttaggcca
5880ccacttcaag aactctgtag caccgcctac atacctcgct ctgctaatcc
tgttaccagt 5940ggctgctgcc agtggcgata agtcgtgtct taccgggttg
gactcaagac gatagttacc 6000ggataaggcg cagcggtcgg gctgaacggg
gggttcgtgc acacagccca gcttggagcg 6060aacgacctac accgaactga
gatacctaca gcgtgagcta tgagaaagcg ccacgcttcc 6120cgaagggaga
aaggcggaca ggtatccggt aagcggcagg gtcggaacag gagagcgcac
6180gagggagctt ccagggggaa acgcctggta tctttatagt cctgtcgggt
ttcgccacct 6240ctgacttgag cgtcgatttt tgtgatgctc gtcagggggg
cggagcctat ggaaaaacgc 6300cagcaacgcg gcctttttac ggttcctggc
cttttgctgg ccttttgctc acatgttctt 6360tcctgcgtta tcccctgatt
ctgtggataa ccgtattacc gcctttgagt gagctgatac 6420cgctcgccgc
agccgaacga ccgagcgcag cgagtcagtg agcgaggaag cggaagagcg
6480cctgatgcgg tattttctcc ttacgcatct gtgcggtatt tcacaccgca
tatggtgcac 6540tctcagtaca atctgctctg atgccgcata gttaagccag
tatacactcc gctatcgcta 6600cgtgactggg tcatggctgc gccccgacac
ccgccaacac ccgctgacgc gccctgacgg 6660gcttgtctgc tcccggcatc
cgcttacaga caagctgtga ccgtctccgg gagctgcatg 6720tgtcagaggt
tttcaccgtc atcaccgaaa cgcgcgaggc agcagatcaa ttcgcgcgcg
6780aaggcgaagc ggcatgcata atgtgcctgt caaatggacg aagcagggat
tctgcaaacc 6840ctatgctact ccgtcaagcc gtcaattgtc tgattcgtta
ccaattatga caacttgacg 6900gctacatcat tcactttttc ttcacaaccg
gcacggaact cgctcgggct ggccccggtg 6960cattttttaa atacccgcga
gaaatagagt tgatcgtcaa aaccaacatt gcgaccgacg 7020gtggcgatag
gcatccgggt ggtgctcaaa agcagcttcg cctggctgat acgttggtcc
7080tcgcgccagc ttaagacgct aatccctaac tgctggcgga aaagatgtga
cagacgcgac 7140ggcgacaagc aaacatgctg tgcgacgctg gcgatatcaa
aattgctgtc tgccaggtga 7200tcgctgatgt actgacaagc ctcgcgtacc
cgattatcca tcggtggatg gagcgactcg 7260ttaatcgctt ccatgcgccg
cagtaacaat tgctcaagca gatttatcgc cagcagctcc 7320gaatagcgcc
cttccccttg cccggcgtta atgatttgcc caaacaggtc gctgaaatgc
7380ggctggtgcg cttcatccgg gcgaaagaac cccgtattgg caaatattga
cggccagtta 7440agccattcat gccagtaggc gcgcggacga aagtaaaccc
actggtgata ccattcgcga 7500gcctccggat gacgaccgta gtgatgaatc
tctcctggcg ggaacagcaa aatatcaccc 7560ggtcggcaaa caaattctcg
tccctgattt ttcaccaccc cctgaccgcg aatggtgaga 7620ttgagaatat
aacctttcat tcccagcggt cggtcgataa aaaaatcgag ataaccgttg
7680gcctcaatcg gcgttaaacc cgccaccaga tgggcattaa acgagtatcc
cggcagcagg 7740ggatcatttt gcgcttcagc catacttttc atactcccgc
cattcagaga agaaaccaat 7800tgtccatatt gcatcagaca ttgccgtcac
tgcgtctttt actggctctt ctcgctaacc 7860aaaccggtaa ccccgcttat
taaaagcatt ctgtaacaaa gcgggaccaa agccatgaca 7920aaaacgcgta
acaaaagtgt ctataatcac ggcagaaaag tccacattga ttatttgcac
7980ggcgtcacac tttgctatgc catagcattt ttatccataa gattagcgga
tcctacctga 8040cgctttttat cgcaactctc tactgtttct ccatacccgt
tttttgggct aacaggagga 8100attaaccatg gggggttctc atcatcatca
tcatcatggt atggctagca tgactggtgg 8160acagcaaatg ggtcgggatc
tgtacgacga tgacgataag gatccgagct cgagatctgc 8220agctggtacc atatggg
82371188153DNAArtificial sequencepRF243 118catggggcat caccatcacc
accattgttt tttcaaagac gaactggaat tcgacaagaa 60atactccatc ggcctggaca
ttggaaccaa ctctgtcggc tgggctgtca tcaccgacga 120gtacaaggtg
ccctccaaga aattcaaggt cctcggaaac accgatcgac actccatcaa
180gaaaaacctc attggtgccc tgttgttcga ttctggcgag actgccgaag
ctaccagact 240caagcgaact gctcggcgac gttacacccg acggaagaac
cgaatctgct acctgcagga 300gatcttttcc aacgagatgg ccaaggtgga
cgattcgttc tttcatcgac tggaggaatc 360cttcctcgtc gaggaagaca
agaaacacga gcgtcatccc atctttggca acattgtgga 420cgaggttgct
taccacgaga agtatcctac catctaccac ctgcgaaaga aactcgtcga
480ttccaccgac aaggcggatc tcagacttat ctacctcgct ctggcacaca
tgatcaagtt 540tcgaggtcat ttcctcatcg agggcgatct caatcccgac
aacagcgatg tggacaagct 600gttcattcag ctcgttcaga cctacaacca
gctgttcgag gaaaacccca tcaatgcctc 660cggagtcgat gcaaaggcca
tcttgtctgc tcgactctcg aagagcagac gactggagaa 720cctcattgcc
caacttcctg gcgagaaaaa gaacggactg tttggcaacc tcattgccct
780ttctcttggt ctcacaccca acttcaagtc caacttcgat ctggcggagg
acgccaagct 840ccagctgtcc aaggacacct acgacgatga cctcgacaac
ctgcttgcac agattggcga 900tcagtacgcc gacctgtttc tcgctgccaa
gaacctttcg gatgctattc tcttgtctga 960cattctgcga gtcaacaccg
agatcacaaa ggctcccctt tctgcctcca tgatcaagcg 1020atacgacgag
caccatcagg atctcacact gctcaaggct cttgtccgac agcaactgcc
1080cgagaagtac aaggagatct ttttcgatca gtcgaagaac ggctacgctg
gatacatcga 1140cggcggagcc tctcaggaag agttctacaa gttcatcaag
ccaattctcg agaagatgga 1200cggaaccgag gaactgcttg tcaagctcaa
tcgagaggat ctgcttcgga agcaacgaac 1260cttcgacaac ggcagcattc
ctcatcagat ccacctcggt gagctgcacg ccattcttcg 1320acgtcaggaa
gacttctacc cctttctcaa ggacaaccga gagaagatcg agaagattct
1380tacctttcga atcccctact atgttggtcc tcttgccaga ggaaactctc
gatttgcttg 1440gatgactcga aagtccgagg aaaccatcac tccctggaac
ttcgaggaag tcgtggacaa 1500gggtgcctct gcacagtcct tcatcgagcg
aatgaccaac ttcgacaaga atctgcccaa 1560cgagaaggtt cttcccaagc
attcgctgct ctacgagtac tttacagtct acaacgaact 1620caccaaagtc
aagtacgtta ccgagggaat gcgaaagcct gccttcttgt ctggcgaaca
1680gaagaaagcc attgtcgatc tcctgttcaa gaccaaccga aaggtcactg
ttaagcagct 1740caaggaggac tacttcaaga aaatcgagtg tttcgacagc
gtcgagattt ccggagttga 1800ggaccgattc aacgcctctt tgggcaccta
tcacgatctg ctcaagatta tcaaggacaa 1860ggattttctc gacaacgagg
aaaacgagga cattctggag gacatcgtgc tcactcttac 1920cctgttcgaa
gatcgggaga tgatcgagga acgactcaag acatacgctc acctgttcga
1980cgacaaggtc atgaaacaac tcaagcgacg tagatacacc ggctggggaa
gactttcgcg 2040aaagctcatc aacggcatca gagacaagca gtccggaaag
accattctgg actttctcaa 2100gtccgatggc tttgccaacc gaaacttcat
gcagctcatt cacgacgatt ctcttacctt 2160caaggaggac atccagaagg
cacaagtgtc cggtcagggc gacagcttgc acgaacatat 2220tgccaacctg
gctggttcgc cagccatcaa gaaaggcatt ctccagactg tcaaggttgt
2280cgacgagctg gtgaaggtca tgggacgtca caagcccgag aacattgtga
tcgagatggc 2340cagagagaac cagacaactc aaaagggtca gaaaaactcg
cgagagcgga tgaagcgaat 2400cgaggaaggc atcaaggagc tgggatccca
gattctcaag gagcatcccg tcgagaacac 2460tcaactgcag aacgagaagc
tgtatctcta ctatctgcag aatggtcgag acatgtacgt 2520ggatcaggaa
ctggacatca atcgtctcag cgactacgat gtggaccaca ttgtccctca
2580atcctttctc aaggacgatt ctatcgacaa caaggtcctt acacgatccg
acaagaacag 2640aggcaagtcg gacaacgttc ccagcgaaga ggtggtcaaa
aagatgaaga actactggcg 2700acagctgctc aacgccaagc tcattaccca
gcgaaagttc gacaatctta ccaaggccga 2760gcgaggcggt ctgtccgagc
tcgacaaggc tggcttcatc aagcgtcaac tcgtcgagac 2820cagacagatc
acaaagcacg tcgcacagat tctcgattct cggatgaaca ccaagtacga
2880cgagaacgac aagctcatcc gagaggtcaa ggtgattact ctcaagtcca
aactggtctc 2940cgatttccga aaggactttc agttctacaa ggtgcgagag
atcaacaatt accaccatgc 3000ccacgatgct tacctcaacg ccgtcgttgg
cactgcgctc atcaagaaat accccaagct 3060cgaaagcgag ttcgtttacg
gcgattacaa ggtctacgac gttcgaaaga tgattgccaa 3120gtccgaacag
gagattggca aggctactgc caagtacttc ttttactcca acatcatgaa
3180ctttttcaag accgagatca ccttggccaa cggagagatt cgaaagagac
cacttatcga 3240gaccaacggc gaaactggag agatcgtgtg ggacaagggt
cgagactttg caaccgtgcg 3300aaaggttctg tcgatgcctc aggtcaacat
cgtcaagaaa accgaggttc agactggcgg 3360attctccaag gagtcgattc
tgcccaagcg aaactccgac aagctcatcg ctcgaaagaa 3420agactgggat
cccaagaaat acggtggctt cgattctcct accgtcgcct attccgtgct
3480tgtcgttgcg aaggtcgaga agggcaagtc caaaaagctc aagtccgtca
aggagctgct 3540cggaattacc atcatggagc gatcgagctt cgagaagaat
cccatcgact tcttggaagc 3600caagggttac aaggaggtca agaaagacct
cattatcaag ctgcccaagt actctctgtt 3660cgaactggag aacggtcgaa
agcgtatgct cgcctccgct ggcgagctgc agaagggaaa 3720cgagcttgcc
ttgccttcga agtacgtcaa ctttctctat ctggcttctc actacgagaa
3780gctcaagggt tctcccgagg acaacgaaca gaagcaactc ttcgttgagc
agcacaaaca 3840ttacctcgac gagattatcg agcagatttc cgagttttcg
aagcgagtca tcctggctga 3900tgccaacttg gacaaggtgc tctctgccta
caacaagcat cgggacaaac ccattcgaga 3960acaggcggag aacatcattc
acctgtttac tcttaccaac ctgggtgctc ctgcagcttt 4020caagtacttc
gataccacta tcgaccgaaa gcggtacaca tccaccaagg aggttctcga
4080tgccaccctg attcaccagt ccatcactgg cctgtacgag acccgaatcg
acctgtctca 4140gcttggtggc gactccagag ccgatcccaa gaaaaagcga
aaggtctaag cggccgctaa 4200gcttggctgt tttggcggat gagagaagat
tttcagcctg atacagatta aatcagaacg 4260cagaagcggt ctgataaaac
agaatttgcc tggcggcagt agcgcggtgg tcccacctga 4320ccccatgccg
aactcagaag tgaaacgccg tagcgccgat ggtagtgtgg ggtctcccca
4380tgcgagagta gggaactgcc aggcatcaaa taaaacgaaa ggctcagtcg
aaagactggg 4440cctttcgttt tatctgttgt ttgtcggtga acgctctcct
gagtaggaca aatccgccgg 4500gagcggattt gaacgttgcg aagcaacggc
ccggagggtg gcgggcagga cgcccgccat 4560aaactgccag gcatcaaatt
aagcagaagg ccatcctgac ggatggcctt tttgcgtttc 4620tacaaactct
tttgtttatt tttctaaata cattcaaata tgtatccgct catgagacaa
4680taaccctgat aaatgcttca ataatattga aaaaggaaga gtatgagtat
tcaacatttc 4740cgtgtcgccc ttattccctt ttttgcggca ttttgccttc
ctgtttttgc tcacccagaa 4800acgctggtga aagtaaaaga tgctgaagat
cagttgggtg cacgagtggg ttacatcgaa 4860ctggatctca acagcggtaa
gatccttgag agttttcgcc ccgaagaacg ttttccaatg 4920atgagcactt
ttaaagttct gctatgtggc gcggtattat cccgtgttga cgccgggcaa
4980gagcaactcg gtcgccgcat acactattct cagaatgact tggttgagta
ctcaccagtc 5040acagaaaagc atcttacgga tggcatgaca gtaagagaat
tatgcagtgc tgccataacc 5100atgagtgata acactgcggc caacttactt
ctgacaacga tcggaggacc gaaggagcta 5160accgcttttt tgcacaacat
gggggatcat gtaactcgcc ttgatcgttg ggaaccggag 5220ctgaatgaag
ccataccaaa cgacgagcgt gacaccacga tgcctgtagc aatggcaaca
5280acgttgcgca aactattaac tggcgaacta cttactctag cttcccggca
acaattaata 5340gactggatgg aggcggataa agttgcagga ccacttctgc
gctcggccct tccggctggc 5400tggtttattg ctgataaatc tggagccggt
gagcgtgggt ctcgcggtat cattgcagca 5460ctggggccag atggtaagcc
ctcccgtatc gtagttatct acacgacggg gagtcaggca 5520actatggatg
aacgaaatag acagatcgct gagataggtg cctcactgat taagcattgg
5580taactgtcag accaagttta ctcatatata ctttagattg atttaaaact
tcatttttaa 5640tttaaaagga tctaggtgaa gatccttttt gataatctca
tgaccaaaat cccttaacgt 5700gagttttcgt tccactgagc gtcagacccc
gtagaaaaga tcaaaggatc ttcttgagat 5760cctttttttc tgcgcgtaat
ctgctgcttg caaacaaaaa aaccaccgct accagcggtg 5820gtttgtttgc
cggatcaaga gctaccaact ctttttccga aggtaactgg cttcagcaga
5880gcgcagatac caaatactgt ccttctagtg tagccgtagt taggccacca
cttcaagaac 5940tctgtagcac cgcctacata cctcgctctg ctaatcctgt
taccagtggc tgctgccagt 6000ggcgataagt cgtgtcttac cgggttggac
tcaagacgat agttaccgga taaggcgcag 6060cggtcgggct gaacgggggg
ttcgtgcaca cagcccagct tggagcgaac gacctacacc 6120gaactgagat
acctacagcg tgagctatga gaaagcgcca cgcttcccga agggagaaag
6180gcggacaggt atccggtaag cggcagggtc ggaacaggag agcgcacgag
ggagcttcca 6240gggggaaacg cctggtatct ttatagtcct gtcgggtttc
gccacctctg acttgagcgt 6300cgatttttgt gatgctcgtc aggggggcgg
agcctatgga aaaacgccag caacgcggcc 6360tttttacggt tcctggcctt
ttgctggcct tttgctcaca tgttctttcc tgcgttatcc 6420cctgattctg
tggataaccg tattaccgcc tttgagtgag ctgataccgc tcgccgcagc
6480cgaacgaccg agcgcagcga gtcagtgagc gaggaagcgg aagagcgcct
gatgcggtat 6540tttctcctta cgcatctgtg cggtatttca caccgcatat
ggtgcactct cagtacaatc 6600tgctctgatg ccgcatagtt aagccagtat
acactccgct atcgctacgt gactgggtca 6660tggctgcgcc ccgacacccg
ccaacacccg ctgacgcgcc ctgacgggct tgtctgctcc 6720cggcatccgc
ttacagacaa gctgtgaccg tctccgggag ctgcatgtgt cagaggtttt
6780caccgtcatc accgaaacgc gcgaggcagc agatcaattc gcgcgcgaag
gcgaagcggc 6840atgcataatg tgcctgtcaa atggacgaag cagggattct
gcaaacccta tgctactccg 6900tcaagccgtc aattgtctga ttcgttacca
attatgacaa cttgacggct acatcattca 6960ctttttcttc acaaccggca
cggaactcgc tcgggctggc cccggtgcat tttttaaata 7020cccgcgagaa
atagagttga tcgtcaaaac caacattgcg accgacggtg gcgataggca
7080tccgggtggt gctcaaaagc agcttcgcct ggctgatacg ttggtcctcg
cgccagctta 7140agacgctaat ccctaactgc
tggcggaaaa gatgtgacag acgcgacggc gacaagcaaa 7200catgctgtgc
gacgctggcg atatcaaaat tgctgtctgc caggtgatcg ctgatgtact
7260gacaagcctc gcgtacccga ttatccatcg gtggatggag cgactcgtta
atcgcttcca 7320tgcgccgcag taacaattgc tcaagcagat ttatcgccag
cagctccgaa tagcgccctt 7380ccccttgccc ggcgttaatg atttgcccaa
acaggtcgct gaaatgcggc tggtgcgctt 7440catccgggcg aaagaacccc
gtattggcaa atattgacgg ccagttaagc cattcatgcc 7500agtaggcgcg
cggacgaaag taaacccact ggtgatacca ttcgcgagcc tccggatgac
7560gaccgtagtg atgaatctct cctggcggga acagcaaaat atcacccggt
cggcaaacaa 7620attctcgtcc ctgatttttc accaccccct gaccgcgaat
ggtgagattg agaatataac 7680ctttcattcc cagcggtcgg tcgataaaaa
aatcgagata accgttggcc tcaatcggcg 7740ttaaacccgc caccagatgg
gcattaaacg agtatcccgg cagcagggga tcattttgcg 7800cttcagccat
acttttcata ctcccgccat tcagagaaga aaccaattgt ccatattgca
7860tcagacattg ccgtcactgc gtcttttact ggctcttctc gctaaccaaa
ccggtaaccc 7920cgcttattaa aagcattctg taacaaagcg ggaccaaagc
catgacaaaa acgcgtaaca 7980aaagtgtcta taatcacggc agaaaagtcc
acattgatta tttgcacggc gtcacacttt 8040gctatgccat agcattttta
tccataagat tagcggatcc tacctgacgc tttttatcgc 8100aactctctac
tgtttctcca tacccgtttt ttgggctaac aggaggaatt aac
81531198204DNAArtificial sequencepRF238 119catggggcat catcatcacc
atcacggcgc cctgttctta ggccagctgg gcgccgcggg 60atccacgatg ggtgcgccga
agaaaaagcg caaagttgaa ttcgacaaga aatactccat 120cggcctggac
attggaacca actctgtcgg ctgggctgtc atcaccgacg agtacaaggt
180gccctccaag aaattcaagg tcctcggaaa caccgatcga cactccatca
agaaaaacct 240cattggtgcc ctgttgttcg attctggcga gactgccgaa
gctaccagac tcaagcgaac 300tgctcggcga cgttacaccc gacggaagaa
ccgaatctgc tacctgcagg agatcttttc 360caacgagatg gccaaggtgg
acgattcgtt ctttcatcga ctggaggaat ccttcctcgt 420cgaggaagac
aagaaacacg agcgtcatcc catctttggc aacattgtgg acgaggttgc
480ttaccacgag aagtatccta ccatctacca cctgcgaaag aaactcgtcg
attccaccga 540caaggcggat ctcagactta tctacctcgc tctggcacac
atgatcaagt ttcgaggtca 600tttcctcatc gagggcgatc tcaatcccga
caacagcgat gtggacaagc tgttcattca 660gctcgttcag acctacaacc
agctgttcga ggaaaacccc atcaatgcct ccggagtcga 720tgcaaaggcc
atcttgtctg ctcgactctc gaagagcaga cgactggaga acctcattgc
780ccaacttcct ggcgagaaaa agaacggact gtttggcaac ctcattgccc
tttctcttgg 840tctcacaccc aacttcaagt ccaacttcga tctggcggag
gacgccaagc tccagctgtc 900caaggacacc tacgacgatg acctcgacaa
cctgcttgca cagattggcg atcagtacgc 960cgacctgttt ctcgctgcca
agaacctttc ggatgctatt ctcttgtctg acattctgcg 1020agtcaacacc
gagatcacaa aggctcccct ttctgcctcc atgatcaagc gatacgacga
1080gcaccatcag gatctcacac tgctcaaggc tcttgtccga cagcaactgc
ccgagaagta 1140caaggagatc tttttcgatc agtcgaagaa cggctacgct
ggatacatcg acggcggagc 1200ctctcaggaa gagttctaca agttcatcaa
gccaattctc gagaagatgg acggaaccga 1260ggaactgctt gtcaagctca
atcgagagga tctgcttcgg aagcaacgaa ccttcgacaa 1320cggcagcatt
cctcatcaga tccacctcgg tgagctgcac gccattcttc gacgtcagga
1380agacttctac ccctttctca aggacaaccg agagaagatc gagaagattc
ttacctttcg 1440aatcccctac tatgttggtc ctcttgccag aggaaactct
cgatttgctt ggatgactcg 1500aaagtccgag gaaaccatca ctccctggaa
cttcgaggaa gtcgtggaca agggtgcctc 1560tgcacagtcc ttcatcgagc
gaatgaccaa cttcgacaag aatctgccca acgagaaggt 1620tcttcccaag
cattcgctgc tctacgagta ctttacagtc tacaacgaac tcaccaaagt
1680caagtacgtt accgagggaa tgcgaaagcc tgccttcttg tctggcgaac
agaagaaagc 1740cattgtcgat ctcctgttca agaccaaccg aaaggtcact
gttaagcagc tcaaggagga 1800ctacttcaag aaaatcgagt gtttcgacag
cgtcgagatt tccggagttg aggaccgatt 1860caacgcctct ttgggcacct
atcacgatct gctcaagatt atcaaggaca aggattttct 1920cgacaacgag
gaaaacgagg acattctgga ggacatcgtg ctcactctta ccctgttcga
1980agatcgggag atgatcgagg aacgactcaa gacatacgct cacctgttcg
acgacaaggt 2040catgaaacaa ctcaagcgac gtagatacac cggctgggga
agactttcgc gaaagctcat 2100caacggcatc agagacaagc agtccggaaa
gaccattctg gactttctca agtccgatgg 2160ctttgccaac cgaaacttca
tgcagctcat tcacgacgat tctcttacct tcaaggagga 2220catccagaag
gcacaagtgt ccggtcaggg cgacagcttg cacgaacata ttgccaacct
2280ggctggttcg ccagccatca agaaaggcat tctccagact gtcaaggttg
tcgacgagct 2340ggtgaaggtc atgggacgtc acaagcccga gaacattgtg
atcgagatgg ccagagagaa 2400ccagacaact caaaagggtc agaaaaactc
gcgagagcgg atgaagcgaa tcgaggaagg 2460catcaaggag ctgggatccc
agattctcaa ggagcatccc gtcgagaaca ctcaactgca 2520gaacgagaag
ctgtatctct actatctgca gaatggtcga gacatgtacg tggatcagga
2580actggacatc aatcgtctca gcgactacga tgtggaccac attgtccctc
aatcctttct 2640caaggacgat tctatcgaca acaaggtcct tacacgatcc
gacaagaaca gaggcaagtc 2700ggacaacgtt cccagcgaag aggtggtcaa
aaagatgaag aactactggc gacagctgct 2760caacgccaag ctcattaccc
agcgaaagtt cgacaatctt accaaggccg agcgaggcgg 2820tctgtccgag
ctcgacaagg ctggcttcat caagcgtcaa ctcgtcgaga ccagacagat
2880cacaaagcac gtcgcacaga ttctcgattc tcggatgaac accaagtacg
acgagaacga 2940caagctcatc cgagaggtca aggtgattac tctcaagtcc
aaactggtct ccgatttccg 3000aaaggacttt cagttctaca aggtgcgaga
gatcaacaat taccaccatg cccacgatgc 3060ttacctcaac gccgtcgttg
gcactgcgct catcaagaaa taccccaagc tcgaaagcga 3120gttcgtttac
ggcgattaca aggtctacga cgttcgaaag atgattgcca agtccgaaca
3180ggagattggc aaggctactg ccaagtactt cttttactcc aacatcatga
actttttcaa 3240gaccgagatc accttggcca acggagagat tcgaaagaga
ccacttatcg agaccaacgg 3300cgaaactgga gagatcgtgt gggacaaggg
tcgagacttt gcaaccgtgc gaaaggttct 3360gtcgatgcct caggtcaaca
tcgtcaagaa aaccgaggtt cagactggcg gattctccaa 3420ggagtcgatt
ctgcccaagc gaaactccga caagctcatc gctcgaaaga aagactggga
3480tcccaagaaa tacggtggct tcgattctcc taccgtcgcc tattccgtgc
ttgtcgttgc 3540gaaggtcgag aagggcaagt ccaaaaagct caagtccgtc
aaggagctgc tcggaattac 3600catcatggag cgatcgagct tcgagaagaa
tcccatcgac ttcttggaag ccaagggtta 3660caaggaggtc aagaaagacc
tcattatcaa gctgcccaag tactctctgt tcgaactgga 3720gaacggtcga
aagcgtatgc tcgcctccgc tggcgagctg cagaagggaa acgagcttgc
3780cttgccttcg aagtacgtca actttctcta tctggcttct cactacgaga
agctcaaggg 3840ttctcccgag gacaacgaac agaagcaact cttcgttgag
cagcacaaac attacctcga 3900cgagattatc gagcagattt ccgagttttc
gaagcgagtc atcctggctg atgccaactt 3960ggacaaggtg ctctctgcct
acaacaagca tcgggacaaa cccattcgag aacaggcgga 4020gaacatcatt
cacctgttta ctcttaccaa cctgggtgct cctgcagctt tcaagtactt
4080cgataccact atcgaccgaa agcggtacac atccaccaag gaggttctcg
atgccaccct 4140gattcaccag tccatcactg gcctgtacga gacccgaatc
gacctgtctc agcttggtgg 4200cgactccaga gccgatccca agaaaaagcg
aaaggtctaa gcggccgcta agcttggctg 4260ttttggcgga tgagagaaga
ttttcagcct gatacagatt aaatcagaac gcagaagcgg 4320tctgataaaa
cagaatttgc ctggcggcag tagcgcggtg gtcccacctg accccatgcc
4380gaactcagaa gtgaaacgcc gtagcgccga tggtagtgtg gggtctcccc
atgcgagagt 4440agggaactgc caggcatcaa ataaaacgaa aggctcagtc
gaaagactgg gcctttcgtt 4500ttatctgttg tttgtcggtg aacgctctcc
tgagtaggac aaatccgccg ggagcggatt 4560tgaacgttgc gaagcaacgg
cccggagggt ggcgggcagg acgcccgcca taaactgcca 4620ggcatcaaat
taagcagaag gccatcctga cggatggcct ttttgcgttt ctacaaactc
4680ttttgtttat ttttctaaat acattcaaat atgtatccgc tcatgagaca
ataaccctga 4740taaatgcttc aataatattg aaaaaggaag agtatgagta
ttcaacattt ccgtgtcgcc 4800cttattccct tttttgcggc attttgcctt
cctgtttttg ctcacccaga aacgctggtg 4860aaagtaaaag atgctgaaga
tcagttgggt gcacgagtgg gttacatcga actggatctc 4920aacagcggta
agatccttga gagttttcgc cccgaagaac gttttccaat gatgagcact
4980tttaaagttc tgctatgtgg cgcggtatta tcccgtgttg acgccgggca
agagcaactc 5040ggtcgccgca tacactattc tcagaatgac ttggttgagt
actcaccagt cacagaaaag 5100catcttacgg atggcatgac agtaagagaa
ttatgcagtg ctgccataac catgagtgat 5160aacactgcgg ccaacttact
tctgacaacg atcggaggac cgaaggagct aaccgctttt 5220ttgcacaaca
tgggggatca tgtaactcgc cttgatcgtt gggaaccgga gctgaatgaa
5280gccataccaa acgacgagcg tgacaccacg atgcctgtag caatggcaac
aacgttgcgc 5340aaactattaa ctggcgaact acttactcta gcttcccggc
aacaattaat agactggatg 5400gaggcggata aagttgcagg accacttctg
cgctcggccc ttccggctgg ctggtttatt 5460gctgataaat ctggagccgg
tgagcgtggg tctcgcggta tcattgcagc actggggcca 5520gatggtaagc
cctcccgtat cgtagttatc tacacgacgg ggagtcaggc aactatggat
5580gaacgaaata gacagatcgc tgagataggt gcctcactga ttaagcattg
gtaactgtca 5640gaccaagttt actcatatat actttagatt gatttaaaac
ttcattttta atttaaaagg 5700atctaggtga agatcctttt tgataatctc
atgaccaaaa tcccttaacg tgagttttcg 5760ttccactgag cgtcagaccc
cgtagaaaag atcaaaggat cttcttgaga tccttttttt 5820ctgcgcgtaa
tctgctgctt gcaaacaaaa aaaccaccgc taccagcggt ggtttgtttg
5880ccggatcaag agctaccaac tctttttccg aaggtaactg gcttcagcag
agcgcagata 5940ccaaatactg tccttctagt gtagccgtag ttaggccacc
acttcaagaa ctctgtagca 6000ccgcctacat acctcgctct gctaatcctg
ttaccagtgg ctgctgccag tggcgataag 6060tcgtgtctta ccgggttgga
ctcaagacga tagttaccgg ataaggcgca gcggtcgggc 6120tgaacggggg
gttcgtgcac acagcccagc ttggagcgaa cgacctacac cgaactgaga
6180tacctacagc gtgagctatg agaaagcgcc acgcttcccg aagggagaaa
ggcggacagg 6240tatccggtaa gcggcagggt cggaacagga gagcgcacga
gggagcttcc agggggaaac 6300gcctggtatc tttatagtcc tgtcgggttt
cgccacctct gacttgagcg tcgatttttg 6360tgatgctcgt caggggggcg
gagcctatgg aaaaacgcca gcaacgcggc ctttttacgg 6420ttcctggcct
tttgctggcc ttttgctcac atgttctttc ctgcgttatc ccctgattct
6480gtggataacc gtattaccgc ctttgagtga gctgataccg ctcgccgcag
ccgaacgacc 6540gagcgcagcg agtcagtgag cgaggaagcg gaagagcgcc
tgatgcggta ttttctcctt 6600acgcatctgt gcggtatttc acaccgcata
tggtgcactc tcagtacaat ctgctctgat 6660gccgcatagt taagccagta
tacactccgc tatcgctacg tgactgggtc atggctgcgc 6720cccgacaccc
gccaacaccc gctgacgcgc cctgacgggc ttgtctgctc ccggcatccg
6780cttacagaca agctgtgacc gtctccggga gctgcatgtg tcagaggttt
tcaccgtcat 6840caccgaaacg cgcgaggcag cagatcaatt cgcgcgcgaa
ggcgaagcgg catgcataat 6900gtgcctgtca aatggacgaa gcagggattc
tgcaaaccct atgctactcc gtcaagccgt 6960caattgtctg attcgttacc
aattatgaca acttgacggc tacatcattc actttttctt 7020cacaaccggc
acggaactcg ctcgggctgg ccccggtgca ttttttaaat acccgcgaga
7080aatagagttg atcgtcaaaa ccaacattgc gaccgacggt ggcgataggc
atccgggtgg 7140tgctcaaaag cagcttcgcc tggctgatac gttggtcctc
gcgccagctt aagacgctaa 7200tccctaactg ctggcggaaa agatgtgaca
gacgcgacgg cgacaagcaa acatgctgtg 7260cgacgctggc gatatcaaaa
ttgctgtctg ccaggtgatc gctgatgtac tgacaagcct 7320cgcgtacccg
attatccatc ggtggatgga gcgactcgtt aatcgcttcc atgcgccgca
7380gtaacaattg ctcaagcaga tttatcgcca gcagctccga atagcgccct
tccccttgcc 7440cggcgttaat gatttgccca aacaggtcgc tgaaatgcgg
ctggtgcgct tcatccgggc 7500gaaagaaccc cgtattggca aatattgacg
gccagttaag ccattcatgc cagtaggcgc 7560gcggacgaaa gtaaacccac
tggtgatacc attcgcgagc ctccggatga cgaccgtagt 7620gatgaatctc
tcctggcggg aacagcaaaa tatcacccgg tcggcaaaca aattctcgtc
7680cctgattttt caccaccccc tgaccgcgaa tggtgagatt gagaatataa
cctttcattc 7740ccagcggtcg gtcgataaaa aaatcgagat aaccgttggc
ctcaatcggc gttaaacccg 7800ccaccagatg ggcattaaac gagtatcccg
gcagcagggg atcattttgc gcttcagcca 7860tacttttcat actcccgcca
ttcagagaag aaaccaattg tccatattgc atcagacatt 7920gccgtcactg
cgtcttttac tggctcttct cgctaaccaa accggtaacc ccgcttatta
7980aaagcattct gtaacaaagc gggaccaaag ccatgacaaa aacgcgtaac
aaaagtgtct 8040ataatcacgg cagaaaagtc cacattgatt atttgcacgg
cgtcacactt tgctatgcca 8100tagcattttt atccataaga ttagcggatc
ctacctgacg ctttttatcg caactctcta 8160ctgtttctcc atacccgttt
tttgggctaa caggaggaat taac 82041201149DNAEscherichia
colimisc_feature(1)..(1149)galK gene 120atgagtctga aagaaaaaac
acaatctctg tttgccaacg catttggcta ccctgccact 60cacaccattc aggcgcctgg
ccgcgtgaat ttgattggtg aacacaccga ctacaacgac 120ggtttcgttc
tgccctgcgc gattgattat caaaccgtga tcagttgtgc accacgcgat
180gaccgtaaag ttcgcgtgat ggcagccgat tatgaaaatc agctcgacga
gttttccctc 240gatgcgccca ttgtcgcaca tgaaaactat caatgggcta
actacgttcg tggcgtggtg 300aaacatctgc aactgcgtaa caacagcttc
ggcggcgtgg acatggtgat cagcggcaat 360gtgccgcagg gtgccgggtt
aagttcttcc gcttcactgg aagtcgcggt cggaaccgta 420ttgcagcagc
tttatcatct gccgctggac ggcgcacaaa tcgcgcttaa cggtcaggaa
480gcagaaaacc agtttgtagg ctgtaactgc gggatcatgg atcagctaat
ttccgcgctc 540ggcaagaaag atcatgcctt gctgatcgat tgccgctcac
tggggaccaa agcagtttcc 600atgcccaaag gtgtggctgt cgtcatcatc
aacagtaact tcaaacgtac cctggttggc 660agcgaataca acacccgtcg
tgaacagtgc gaaaccggtg cgcgtttctt ccagcagcca 720gccctgcgtg
atgtcaccat tgaagagttc aacgctgttg cgcatgaact ggacccgatc
780gtggcaaaac gcgtgcgtca tatactgact gaaaacgccc gcaccgttga
agctgccagc 840gcgctggagc aaggcgacct gaaacgtatg ggcgagttga
tggcggagtc tcatgcctct 900atgcgcgatg atttcgaaat caccgtgccg
caaattgaca ctctggtaga aatcgtcaaa 960gctgtgattg gcgacaaagg
tggcgtacgc atgaccggcg gcggatttgg cggctgtatc 1020gtcgcgctga
tcccggaaga gctggtgcct gccgtacagc aagctgtcgc tgaacaatat
1080gaagcaaaaa caggtattaa agagactttt tacgtttgta aaccatcaca
aggagcagga 1140cagtgctga 11491211017DNAEscherichia coli
121atgagagttc tggttaccgg tggtagcggt tacattggaa gtcatacctg
tgtgcaatta 60ctgcaaaacg gtcatgatgt catcattctt gataacctct gtaacagtaa
gcgcagcgta 120ctgcctgtta tcgagcgttt aggcggcaaa catccaacgt
ttgttgaagg cgatattcgt 180aacgaagcgt tgatgaccga gatcctgcac
gatcacgcta tcgacaccgt gatccacttc 240gccgggctga aagccgtggg
cgaatcggta caaaaaccgc tggaatatta cgacaacaat 300gtcaacggca
ctctgcgcct gattagcgcc atgcgcgccg ctaacgtcaa aaactttatt
360tttagctcct ccgccaccgt ttatggcgat cagcccaaaa ttccatacgt
tgaaagcttc 420ccgaccggca caccgcaaag cccttacggc aaaagcaagc
tgatggtgga acagatcctc 480accgatctgc aaaaagccca gccggactgg
agcattgccc tgctgcgcta cttcaacccg 540gttggcgcgc atccgtcggg
cgatatgggc gaagatccgc aaggcattcc gaataacctg 600atgccataca
tcgcccaggt tgctgtaggc cgtcgcgact cgctggcgat ttttggtaac
660gattatccga ccgaagatgg tactggcgta cgcgattaca tccacgtaat
ggatctggcg 720gacggtcacg tcgtggcgat ggaaaaactg gcgaacaagc
caggcgtaca catctacaac 780ctcggcgctg gcgtaggcaa cagcgtgctg
gacgtggtta atgccttcag caaagcctgc 840ggcaaaccgg ttaattatca
ttttgcaccg cgtcgcgagg gcgaccttcc ggcctactgg 900gcggacgcca
gcaaagccga ccgtgaactg aactggcgcg taacgcgcac actcgatgaa
960atggcgcagg acacctggca ctggcagtca cgccatccac agggatatcc cgattaa
10171221047DNAEscherichia coli 122atgacgcaat ttaatcccgt tgatcatcca
catcgccgct acaacccgct caccgggcaa 60tggattctgg tttcaccgca ccgcgctaag
cgcccctggc agggggcgca ggaaacgcca 120gccaaacagg tgttacctgc
gcacgatcca gattgcttcc tctgcgcagg taatgtgcgg 180gtgacaggcg
ataaaaaccc cgattacacc gggacttacg ttttcactaa tgactttgcg
240gctttgatgt ctgacacgcc agatgcgcca gaaagtcacg atccgctgat
gcgttgccag 300agcgcgcgcg gcaccagccg ggtgatctgc ttttcaccgg
atcacagtaa aacgctgcca 360gagctcagcg ttgcagcatt gacggaaatc
gtcaaaacct ggcaggagca aaccgcagaa 420ctggggaaaa cgtacccatg
ggtgcaggtt tttgaaaaca aaggcgcggc gatgggctgc 480tctaacccgc
atccgcacgg tcagatttgg gcaaatagct tcctgcctaa cgaagctgag
540cgcgaagacc gcctgcaaaa agaatatttt gccgaacaga aatcaccaat
gctggtggat 600tatgttcagc gcgagctggc agacggtagc cgtaccgttg
tcgaaaccga acactggtta 660gccgtcgtgc cttactgggc tgcctggccg
ttcgaaacgc tactgctgcc caaagcccac 720gttttacgga tcaccgattt
gaccgacgcc cagcgcagcg atctggcgct ggcgttgaaa 780aagctgacca
gtcgttatga caacctcttc cagtgctcct tcccctactc tatgggctgg
840cacggcgcgc catttaatgg cgaagagaat caacactggc agctgcacgc
gcacttttat 900ccgcctctgc tgcgctccgc caccgtacgt aaatttatgg
ttggttatga aatgctggca 960gagacccagc gagacctgac cgcagaacag
gcagcagagc gtttgcgcgc agtcagcgat 1020atccattttc gcgaatccgg agtgtaa
104712376RNAArtificial sequenceCER domainl 123guuuuagagc uagaaauagc
aaguuaaaau aaggcuaguc cguuaucaac uugaaaaagu 60ggcaccgagu cggugc
7612476DNAArtificial sequenceCER encoding DNA PCR 124gttttagagc
tagaaatagc aagttaaaat aaggctagtc cgttatcaac ttgaaaaagt 60ggcaccgagt
cggtgc 7612511714DNAArtificial sequencepRF291 125cgataaaaaa
caaaaaaaaa agcaccgact cggtgccact ttttcaagtt gataacggac 60tagccttatt
ttaacttgct atttctagct ctaaaacgca ggtgtaaaaa taaaaaggcc
120tgcgattacc agcaggcctg ttattaacct aagccttagg acgcttcacg
ccatacttgg 180aacgagcctg cttacggtct ttaacgccgg agcagtcaag
cgcaccacgt acggtgtggt 240aacgaacacc cgggaggtct ttaacacgac
cgccacggat caggatcacg gagtgctcct 300gcaggttgtg accttcacca
ccgatgtagg aagtcacttc gaaaccgtta gtcagacgaa 360cacggcatac
tttacgcagc gcggagttcg gttttttagg agtggtagta tatacacgag
420tacatacgcc acgtttttgc gggcatgctt ccagcgcagg cacgttgctt
ttcgcaactt 480tgcgagcacg tggtttgcgt accagctggt taactgttgc
cattaaatag ctcctggttt 540tagcttttgc ttcgtaaaca cgtaataaaa
cgtcctcaca caatatgagg acgccgaatt 600tagggcgatg ccgaaaaggt
gtcaagaaat atacaacgat cccgccatca cctgcgtccc 660attcgccatg
ccgaagcatg ttgcccagcc ggcgccagcg aggaggctgg gaccatgccg
720gccattattt tgcgttaagt ttctaatcat cacgaaatta tctatcaaaa
ataactaggt 780cccaccgaga ttcgaactcg ggaccttaag atttgcaatc
tcacgcgcta ccgctgtgcc 840ataggaccga agttaaaatt tggccaaaga
aggacctggg caccctggac tgtgggttag 900ggtaatattc cttatggaga
caatgggcta gggtaaatta cctaaaatgg gtcgataaag 960aggggtgttc
ccagttggga agtgtaattg aagacggggt caaaaaagaa aatcaaaaaa
1020aatttaatta agtcatacac aagtcagctt tcttcgagcc tcatataagt
ataagtagtt 1080caacgtatta gcactgtacc cagcatctcc gtatcgagaa
acacaacaac atgccccatt 1140ggacagatca tgcggataca caggttgtgc
agtatcatac atactcgatc agacaggtcg 1200tctgaccatc atacaagctg
aacaagcgct ccatacttgc acgctctcta tatacacagt 1260taaattacat
atccatagtc taacctctaa cagttaatct tctggtaagc ctcccagcca
1320gccttctggt atcgcttggc ctcctcaata ggatctcggt tctggccgta
cagacctcgg 1380ccgacaatta tgatatccgt tccggtagac atgacatcct
caacagttcg gtactgctgt 1440ccgagagcgt ctcccttgtc gtcaagaccc
accccggggg tcagaataag ccagtcctca 1500gagtcgccct taggtcggtt
ctgggcaatg aagccaacca caaactcggg gtcggatcgg 1560gcaagctcaa
tggtctgctt ggagtactcg ccagtggcca gagagccctt gcaagacagc
1620tcggccagca tgagcagacc tctggccagc ttctcgttgg gagaggggac
taggaactcc 1680ttgtactggg agttctcgta gtcagagacg tcctccttct
tctgttcaga gacagtttcc 1740tcggcaccag ctcgcaggcc agcaatgatt
ccggttccgg gtacaccgtg ggcgttggtg 1800atatcggacc actcggcgat
tcggtgacac cggtactggt gcttgacagt gttgccaata 1860tctgcgaact
ttctgtcctc gaacaggaag aaaccgtgct taagagcaag ttccttgagg
1920gggagcacag tgccggcgta ggtgaagtcg tcaatgatgt cgatatgggt
tttgatcatg 1980cacacataag gtccgacctt atcggcaagc tcaatgagct
ccttggtggt ggtaacatcc
2040agagaagcac acaggttggt tttcttggct gccacgagct tgagcactcg
agcggcaaag 2100gcggacttgt ggacgttagc tcgagcttcg taggagggca
ttttggtggt gaagaggaga 2160ctgaaataaa tttagtctgc agaacttttt
atcggaacct tatctggggc agtgaagtat 2220atgttatggt aatagttacg
agttagttga acttatagat agactggact atacggctat 2280cggtccaaat
tagaaagaac gtcaatggct ctctgggcgt cgcctttgcc gacaaaaatg
2340tgatcatgat gaaagccagc aatgacgttg cagctgatat tgttgtcggc
caaccgcgcc 2400gaaaacgcag ctgtcagacc cacagcctcc aacgaagaat
gtatcgtcaa agtgatccaa 2460gcacactcat agttggagtc gtactccaaa
ggcggcaatg acgagtcaga cagatactcg 2520tcgacgttta aaccatcatc
taagggcctc aaaactacct cggaactgct gcgctgatct 2580ggacaccaca
gaggttccga gcactttagg ttgcaccaaa tgtcccacca ggtgcaggca
2640gaaaacgctg gaacagcgtg tacagtttgt cttaacaaaa agtgagggcg
ctgaggtcga 2700gcagggtggt gtgacttgtt atagccttta gagctgcgaa
agcgcgtatg gatttggctc 2760atcaggccag attgagggtc tgtggacaca
tgtcatgtta gtgtacttca atcgccccct 2820ggatatagcc ccgacaatag
gccgtggcct catttttttg ccttccgcac atttccattg 2880ctcggtaccc
acaccttgct tctcctgcac ttgccaacct taatactggt ttacattgac
2940caacatctta caagcggggg gcttgtctag ggtatatata aacagtggct
ctcccaatcg 3000gttgccagtc tcttttttcc tttctttccc cacagattcg
aaatctaaac tacacatcac 3060accatggaca agaaatactc catcggcctg
gacattggaa ccaactctgt cggctgggct 3120gtcatcaccg acgagtacaa
ggtgccctcc aagaaattca aggtcctcgg aaacaccgat 3180cgacactcca
tcaagaaaaa cctcattggt gccctgttgt tcgattctgg cgagactgcc
3240gaagctacca gactcaagcg aactgctcgg cgacgttaca cccgacggaa
gaaccgaatc 3300tgctacctgc aggagatctt ttccaacgag atggccaagg
tggacgattc gttctttcat 3360cgactggagg aatccttcct cgtcgaggaa
gacaagaaac acgagcgtca tcccatcttt 3420ggcaacattg tggacgaggt
tgcttaccac gagaagtatc ctaccatcta ccatctccga 3480aagaaactcg
tcgattccac cgacaaggcg gatctcagac ttatctacct cgctctggca
3540cacatgatca agtttcgagg tcatttcctc atcgagggcg atctcaatcc
cgacaacagc 3600gatgtggaca agctgttcat tcagctcgtt cagacctaca
accagctgtt cgaggaaaac 3660cccatcaatg cctccggagt cgatgcaaag
gccatcttgt ctgctcgact ctcgaagagc 3720agacgactgg agaacctcat
tgcccaactt cctggcgaga aaaagaacgg actgtttggc 3780aacctcattg
ccctttctct tggtctcaca cccaacttca agtccaactt cgatctggcg
3840gaggacgcca agctccagct gtccaaggac acctacgacg atgacctcga
caacctgctt 3900gcacagattg gcgatcagta cgccgacctg tttctcgctg
ccaagaacct ttcggatgct 3960attctcttgt ctgacattct gcgagtcaac
accgagatca caaaggctcc cctttctgcc 4020tccatgatca agcgatacga
cgagcaccat caggatctca cactgctcaa ggctcttgtc 4080cgacagcaac
tgcccgagaa gtacaaggag atctttttcg atcagtcgaa gaacggctac
4140gctggataca tcgacggcgg agcctctcag gaagagttct acaagttcat
caagccaatt 4200ctcgagaaga tggacggaac cgaggaactg cttgtcaagc
tcaatcgaga ggatctgctt 4260cggaagcaac gaaccttcga caacggcagc
attcctcatc agatccacct cggtgagctg 4320cacgccattc ttcgacgtca
ggaagacttc tacccctttc tcaaggacaa ccgagagaag 4380atcgagaaga
ttcttacctt tcgaatcccc tactatgttg gtcctcttgc cagaggaaac
4440tctcgatttg cttggatgac tcgaaagtcc gaggaaacca tcactccctg
gaacttcgag 4500gaagtcgtgg acaagggtgc ctctgcacag tccttcatcg
agcgaatgac caacttcgac 4560aagaatctgc ccaacgagaa ggttcttccc
aagcattcgc tgctctacga gtactttaca 4620gtctacaacg aactcaccaa
agtcaagtac gttaccgagg gaatgcgaaa gcctgccttc 4680ttgtctggcg
aacagaagaa agccattgtc gatctcctgt tcaagaccaa ccgaaaggtc
4740actgttaagc agctcaagga ggactacttc aagaaaatcg agtgtttcga
cagcgtcgag 4800atttccggag ttgaggaccg attcaacgcc tctttgggca
cctatcacga tctgctcaag 4860attatcaagg acaaggattt tctcgacaac
gaggaaaacg aggacattct ggaggacatc 4920gtgctcactc ttaccctgtt
cgaagatcgg gagatgatcg aggaacgact caagacatac 4980gctcacctgt
tcgacgacaa ggtcatgaaa caactcaagc gacgtagata caccggctgg
5040ggaagacttt cgcgaaagct catcaacggc atcagagaca agcagtccgg
aaagaccatt 5100ctggactttc tcaagtccga tggctttgcc aaccgaaact
tcatgcagct cattcacgac 5160gattctctta ccttcaagga ggacatccag
aaggcacaag tgtccggtca gggcgacagc 5220ttgcacgaac atattgccaa
cctggctggt tcgccagcca tcaagaaagg cattctccag 5280actgtcaagg
ttgtcgacga gctggtgaag gtcatgggac gtcacaagcc cgagaacatt
5340gtgatcgaga tggccagaga gaaccagaca actcaaaagg gtcagaaaaa
ctcgcgagag 5400cggatgaagc gaatcgagga aggcatcaag gagctgggat
cccagattct caaggagcat 5460cccgtcgaga acactcaact gcagaacgag
aagctgtatc tctactatct gcagaatggt 5520cgagacatgt acgtggatca
ggaactggac atcaatcgtc tcagcgacta cgatgtggac 5580cacattgtcc
ctcaatcctt tctcaaggac gattctatcg acaacaaggt ccttacacga
5640tccgacaaga acagaggcaa gtcggacaac gttcccagcg aagaggtggt
caaaaagatg 5700aagaactact ggcgacagct gctcaacgcc aagctcatta
cccagcgaaa gttcgacaat 5760cttaccaagg ccgagcgagg cggtctgtcc
gagctcgaca aggctggctt catcaagcgt 5820caactcgtcg agaccagaca
gatcacaaag cacgtcgcac agattctcga ttctcggatg 5880aacaccaagt
acgacgagaa cgacaagctc atccgagagg tcaaggtgat tactctcaag
5940tccaaactgg tctccgattt ccgaaaggac tttcagttct acaaggtgcg
agagatcaac 6000aattaccacc atgcccacga tgcttacctc aacgccgtcg
ttggcactgc gctcatcaag 6060aaatacccca agctcgaaag cgagttcgtt
tacggcgatt acaaggtcta cgacgttcga 6120aagatgattg ccaagtccga
acaggagatt ggcaaggcta ctgccaagta cttcttttac 6180tccaacatca
tgaacttttt caagaccgag atcaccttgg ccaacggaga gattcgaaag
6240agaccactta tcgagaccaa cggcgaaact ggagagatcg tgtgggacaa
gggtcgagac 6300tttgcaaccg tgcgaaaggt tctgtcgatg cctcaggtca
acatcgtcaa gaaaaccgag 6360gttcagactg gcggattctc caaggagtcg
attctgccca agcgaaactc cgacaagctc 6420atcgctcgaa agaaagactg
ggatcccaag aaatacggtg gcttcgattc tcctaccgtc 6480gcctattccg
tgcttgtcgt tgcgaaggtc gagaagggca agtccaaaaa gctcaagtcc
6540gtcaaggagc tgctcggaat taccatcatg gagcgatcga gcttcgagaa
gaatcccatc 6600gacttcttgg aagccaaggg ttacaaggag gtcaagaaag
acctcattat caagctgccc 6660aagtactctc tgttcgaact ggagaacggt
cgaaagcgta tgctcgcctc cgctggcgag 6720ctgcagaagg gaaacgagct
tgccttgcct tcgaagtacg tcaactttct ctatctggct 6780tctcactacg
agaagctcaa gggttctccc gaggacaacg aacagaagca actcttcgtt
6840gagcagcaca aacattacct cgacgagatt atcgagcaga tttccgagtt
ttcgaagcga 6900gtcatcctgg ctgatgccaa cttggacaag gtgctctctg
cctacaacaa gcatcgggac 6960aaacccattc gagaacaggc ggagaacatc
attcacctgt ttactcttac caacctgggt 7020gctcctgcag ctttcaagta
cttcgatacc actatcgacc gaaagcggta cacatccacc 7080aaggaggttc
tcgatgccac cctgattcac cagtccatca ctggcctgta cgagacccga
7140atcgacctgt ctcagcttgg tggcgactcc agagccgatc ccaagaaaaa
gcgaaaggtc 7200taagcggccg caagtgtgga tggggaagtg agtgcccggt
tctgtgtgca caattggcaa 7260tccaagatgg atggattcaa cacagggata
tagcgagcta cgtggtggtg cgaggatata 7320gcaacggata tttatgtttg
acacttgaga atgtacgata caagcactgt ccaagtacaa 7380tactaaacat
actgtacata ctcatactcg tacccgggca acggtttcac ttgagtgcag
7440tggctagtgc tcttactcgt acagtgtgca atactgcgta tcatagtctt
tgatgtatat 7500cgtattcatt catgttagtt gcgtacgagc cggaagcata
aagtgtaaag cctggggtgc 7560ctaatgagtg agctaactca cattaattgc
gttgcgctca ctgcccgctt tccagtcggg 7620aaacctgtcg tgccagctgc
attaatgaat cggccaacgc gcggggagag gcggtttgcg 7680tattgggcgc
tcttccgctt cctcgctcac tgactcgctg cgctcggtcg ttcggctgcg
7740gcgagcggta tcagctcact caaaggcggt aatacggtta tccacagaat
caggggataa 7800cgcaggaaag aacatgtgag caaaaggcca gcaaaaggcc
aggaaccgta aaaaggccgc 7860gttgctggcg tttttccata ggctccgccc
ccctgacgag catcacaaaa atcgacgctc 7920aagtcagagg tggcgaaacc
cgacaggact ataaagatac caggcgtttc cccctggaag 7980ctccctcgtg
cgctctcctg ttccgaccct gccgcttacc ggatacctgt ccgcctttct
8040cccttcggga agcgtggcgc tttctcatag ctcacgctgt aggtatctca
gttcggtgta 8100ggtcgttcgc tccaagctgg gctgtgtgca cgaacccccc
gttcagcccg accgctgcgc 8160cttatccggt aactatcgtc ttgagtccaa
cccggtaaga cacgacttat cgccactggc 8220agcagccact ggtaacagga
ttagcagagc gaggtatgta ggcggtgcta cagagttctt 8280gaagtggtgg
cctaactacg gctacactag aaggacagta tttggtatct gcgctctgct
8340gaagccagtt accttcggaa aaagagttgg tagctcttga tccggcaaac
aaaccaccgc 8400tggtagcggt ggtttttttg tttgcaagca gcagattacg
cgcagaaaaa aaggatctca 8460agaagatcct ttgatctttt ctacggggtc
tgacgctcag tggaacgaaa actcacgtta 8520agggattttg gtcatgagat
tatcaaaaag gatcttcacc tagatccttt taaattaaaa 8580atgaagtttt
aaatcaatct aaagtatata tgagtaaact tggtctgaca gttaccaatg
8640cttaatcagt gaggcaccta tctcagcgat ctgtctattt cgttcatcca
tagttgcctg 8700actccccgtc gtgtagataa ctacgatacg ggagggctta
ccatctggcc ccagtgctgc 8760aatgataccg cgagacccac gctcaccggc
tccagattta tcagcaataa accagccagc 8820cggaagggcc gagcgcagaa
gtggtcctgc aactttatcc gcctccatcc agtctattaa 8880ttgttgccgg
gaagctagag taagtagttc gccagttaat agtttgcgca acgttgttgc
8940cattgctaca ggcatcgtgg tgtcacgctc gtcgtttggt atggcttcat
tcagctccgg 9000ttcccaacga tcaaggcgag ttacatgatc ccccatgttg
tgcaaaaaag cggttagctc 9060cttcggtcct ccgatcgttg tcagaagtaa
gttggccgca gtgttatcac tcatggttat 9120ggcagcactg cataattctc
ttactgtcat gccatccgta agatgctttt ctgtgactgg 9180tgagtactca
accaagtcat tctgagaata gtgtatgcgg cgaccgagtt gctcttgccc
9240ggcgtcaata cgggataata ccgcgccaca tagcagaact ttaaaagtgc
tcatcattgg 9300aaaacgttct tcggggcgaa aactctcaag gatcttaccg
ctgttgagat ccagttcgat 9360gtaacccact cgtgcaccca actgatcttc
agcatctttt actttcacca gcgtttctgg 9420gtgagcaaaa acaggaaggc
aaaatgccgc aaaaaaggga ataagggcga cacggaaatg 9480ttgaatactc
atactcttcc tttttcaata ttattgaagc atttatcagg gttattgtct
9540catgagcgga tacatatttg aatgtattta gaaaaataaa caaatagggg
ttccgcgcac 9600atttccccga aaagtgccac ctgacgcgcc ctgtagcggc
gcattaagcg cggcgggtgt 9660ggtggttacg cgcagcgtga ccgctacact
tgccagcgcc ctagcgcccg ctcctttcgc 9720tttcttccct tcctttctcg
ccacgttcgc cggctttccc cgtcaagctc taaatcgggg 9780gctcccttta
gggttccgat ttagtgcttt acggcacctc gaccccaaaa aacttgatta
9840gggtgatggt tcacgtagtg ggccatcgcc ctgatagacg gtttttcgcc
ctttgacgtt 9900ggagtccacg ttctttaata gtggactctt gttccaaact
ggaacaacac tcaaccctat 9960ctcggtctat tcttttgatt tataagggat
tttgccgatt tcggcctatt ggttaaaaaa 10020tgagctgatt taacaaaaat
ttaacgcgaa ttttaacaaa atattaacgc ttacaatttc 10080cattcgccat
tcaggctgcg caactgttgg gaagggcgat cggtgcgggc ctcttcgcta
10140ttacgccagc tggcgaaagg gggatgtgct gcaaggcgat taagttgggt
aacgccaggg 10200ttttcccagt cacgacgttg taaaacgacg gccagtgaat
tgtaatacga ctcactatag 10260ggcgaattgg gtaccgggcc ccccctcgag
gtcgatggtg tcgataagct tgatatcgaa 10320ttcatgtcac acaaaccgat
cttcgcctca aggaaaccta attctacatc cgagagactg 10380ccgagatcca
gtctacactg attaattttc gggccaataa tttaaaaaaa tcgtgttata
10440taatattata tgtattatat atatacatca tgatgatact gacagtcatg
tcccattgct 10500aaatagacag actccatctg ccgcctccaa ctgatgttct
caatatttaa ggggtcatct 10560cgcattgttt aataataaac agactccatc
taccgcctcc aaatgatgtt ctcaaaatat 10620attgtatgaa cttattttta
ttacttagta ttattagaca acttacttgc tttatgaaaa 10680acacttccta
tttaggaaac aatttataat ggcagttcgt tcatttaaca atttatgtag
10740aataaatgtt ataaatgcgt atgggaaatc ttaaatatgg atagcataaa
tgatatctgc 10800attgcctaat tcgaaatcaa cagcaacgaa aaaaatccct
tgtacaacat aaatagtcat 10860cgagaaatat caactatcaa agaacagcta
ttcacacgtt actattgaga ttattattgg 10920acgagaatca cacactcaac
tgtctttctc tcttctagaa atacaggtac aagtatgtac 10980tattctcatt
gttcatactt ctagtcattt catcccacat attccttgga tttctctcca
11040atgaatgaca ttctatcttg caaattcaac aattataata agatatacca
aagtagcggt 11100atagtggcaa tcaaaaagct tctctggtgt gcttctcgta
tttattttta ttctaatgat 11160ccattaaagg tatatattta tttcttgtta
tataatcctt ttgtttatta catgggctgg 11220atacataaag gtattttgat
ttaatttttt gcttaaattc aatcccccct cgttcagtgt 11280caactgtaat
ggtaggaaat taccatactt ttgaagaagc aaaaaaaatg aaagaaaaaa
11340aaaatcgtat ttccaggtta gacgttccgc agaatctaga atgcggtatg
cggtacattg 11400ttcttcgaac gtaaaagttg cgctccctga gatattgtac
atttttgctt ttacaagtac 11460aagtacatcg tacaactatg tactactgtt
gatgcatcca caacagtttg ttttgttttt 11520ttttgttttt tttttttcta
atgattcatt accgctatgt atacctactt gtacttgtag 11580taagccgggt
tattggcgtt caattaatca tagacttatg aatctgcacg gtgtgcgctg
11640cgagttactt ttagcttatg catgctactt gggtgtaata ttgggatctg
ttcggaaatc 11700aacggatgct caat 1171412620DNAArtificial sequenceCER
forward 126gttttagagc tagaaatagc 2012720DNAArtificial
sequenceuniveral reverse 127gcaccgactc ggtgccactt
2012818DNAArtificial sequenceuniversal forward T7 primer
128taatacgact cactatag 1812934DNAArtificial sequencegalK2-1 forward
primer 129taatacgact cactatagat cagcggcaat gtgc
3413034DNAArtificial sequencegalK2-1 reverse primer 130ttctagctct
aaaactgcgg cacattgccg ctga 34131114DNAArtificial SequencegalK2-1
sgRNA in vitro transcription template 131taatacgact cactatagat
cagcggcaat gtgccgcagt tttagagcta gaaatagcaa 60gttaaaataa ggctagtccg
ttatcaactt gaaaaagtgg caccgagtcg gtgc 11413218DNAT7 phage
132taatacgact cactatag 1813320DNAEscherichia coli 133atcagcggca
atgtgccgca 2013423DNAEscherichia coli 134atcagcggca atgtgccgca ggg
2313596RNAArtificial sequencegalK2-1 sgRNA 135aucagcggca augugccgca
guuuuagagc uagaaauagc aaguuaaaau aaggcuaguc 60cguuaucaac uugaaaaagu
ggcaccgagu cggugc 96136262PRTArtificial
sequencehis-MPG1-dsREDexpress 136Met Gly His His His His His His
Gly Ala Leu Phe Leu Gly Gln Leu1 5 10 15Gly Ala Ala Gly Ser Thr Met
Gly Ala Pro Lys Lys Lys Arg Lys Val 20 25 30Glu Phe Gly Gly Gly Gly
Ala Ser Ser Glu Asp Val Ile Lys Glu Phe 35 40 45Met Arg Phe Lys Val
Arg Met Glu Gly Ser Val Asn Gly His Glu Phe 50 55 60Glu Ile Glu Gly
Glu Gly Glu Gly Arg Pro Tyr Glu Gly Thr Gln Thr65 70 75 80Ala Lys
Leu Lys Val Thr Lys Gly Gly Pro Leu Pro Phe Ala Trp Asp 85 90 95Ile
Leu Ser Pro Gln Phe Gln Tyr Gly Ser Lys Val Tyr Val Lys His 100 105
110Pro Ala Asp Ile Pro Asp Tyr Lys Lys Leu Ser Phe Pro Glu Gly Phe
115 120 125Lys Trp Glu Arg Val Met Asn Phe Glu Asp Gly Gly Val Val
Thr Val 130 135 140Thr Gln Asp Ser Ser Leu Gln Asp Gly Ser Phe Ile
Tyr Lys Val Lys145 150 155 160Phe Ile Gly Val Asn Phe Pro Ser Asp
Gly Pro Val Met Gln Lys Lys 165 170 175Thr Met Gly Trp Glu Ala Ser
Thr Glu Arg Leu Tyr Pro Arg Asp Gly 180 185 190Val Leu Lys Gly Glu
Ile His Lys Ala Leu Lys Leu Lys Asp Gly Gly 195 200 205His Tyr Leu
Val Glu Phe Lys Ser Ile Tyr Met Ala Lys Lys Pro Val 210 215 220Gln
Leu Pro Gly Tyr Tyr Tyr Val Asp Ser Lys Leu Asp Ile Thr Ser225 230
235 240His Asn Glu Asp Tyr Thr Ile Val Glu Gln Tyr Glu Arg Ala Glu
Gly 245 250 255Arg His His Leu Phe Leu 260137256PRTArtificial
sequencepVEC-dsREDexpress 137Met Gly His His His His His His Leu
Leu Ile Ile Leu Arg Arg Arg1 5 10 15Ile Arg Lys Gln Ala His Ala His
Ser Lys Glu Phe Gly Gly Gly Gly 20 25 30Ala Ser Ser Glu Asp Val Ile
Lys Glu Phe Met Arg Phe Lys Val Arg 35 40 45Met Glu Gly Ser Val Asn
Gly His Glu Phe Glu Ile Glu Gly Glu Gly 50 55 60Glu Gly Arg Pro Tyr
Glu Gly Thr Gln Thr Ala Lys Leu Lys Val Thr65 70 75 80Lys Gly Gly
Pro Leu Pro Phe Ala Trp Asp Ile Leu Ser Pro Gln Phe 85 90 95Gln Tyr
Gly Ser Lys Val Tyr Val Lys His Pro Ala Asp Ile Pro Asp 100 105
110Tyr Lys Lys Leu Ser Phe Pro Glu Gly Phe Lys Trp Glu Arg Val Met
115 120 125Asn Phe Glu Asp Gly Gly Val Val Thr Val Thr Gln Asp Ser
Ser Leu 130 135 140Gln Asp Gly Ser Phe Ile Tyr Lys Val Lys Phe Ile
Gly Val Asn Phe145 150 155 160Pro Ser Asp Gly Pro Val Met Gln Lys
Lys Thr Met Gly Trp Glu Ala 165 170 175Ser Thr Glu Arg Leu Tyr Pro
Arg Asp Gly Val Leu Lys Gly Glu Ile 180 185 190His Lys Ala Leu Lys
Leu Lys Asp Gly Gly His Tyr Leu Val Glu Phe 195 200 205Lys Ser Ile
Tyr Met Ala Lys Lys Pro Val Gln Leu Pro Gly Tyr Tyr 210 215 220Tyr
Val Asp Ser Lys Leu Asp Ile Thr Ser His Asn Glu Asp Tyr Thr225 230
235 240Ile Val Glu Gln Tyr Glu Arg Ala Glu Gly Arg His His Leu Phe
Leu 245 250 255138245PRTArtificial sequenceCFFKDEL-dsREDexpress
138Met Gly His His His His His His Cys Phe Phe Lys Asp Glu Leu Glu1
5 10 15Phe Gly Gly Gly Gly Ala Ser Ser Glu Asp Val Ile Lys Glu Phe
Met 20 25 30Arg Phe Lys Val Arg Met Glu Gly Ser Val Asn Gly His Glu
Phe Glu 35 40 45Ile Glu Gly Glu Gly Glu Gly Arg Pro Tyr Glu Gly Thr
Gln Thr Ala 50 55 60Lys Leu Lys Val Thr Lys Gly Gly Pro Leu Pro Phe
Ala Trp Asp Ile65 70 75 80Leu Ser Pro Gln Phe Gln Tyr Gly Ser Lys
Val Tyr Val Lys His Pro 85 90 95Ala Asp Ile Pro Asp Tyr Lys Lys Leu
Ser Phe Pro Glu Gly Phe Lys 100 105 110Trp Glu Arg Val Met Asn Phe
Glu Asp Gly Gly Val Val Thr Val Thr 115 120 125Gln Asp Ser Ser Leu
Gln Asp Gly Ser Phe Ile Tyr Lys Val Lys Phe 130 135 140Ile Gly Val
Asn Phe Pro Ser Asp Gly Pro Val Met Gln Lys Lys
Thr145 150 155 160Met Gly Trp Glu Ala Ser Thr Glu Arg Leu Tyr Pro
Arg Asp Gly Val 165 170 175Leu Lys Gly Glu Ile His Lys Ala Leu Lys
Leu Lys Asp Gly Gly His 180 185 190Tyr Leu Val Glu Phe Lys Ser Ile
Tyr Met Ala Lys Lys Pro Val Gln 195 200 205Leu Pro Gly Tyr Tyr Tyr
Val Asp Ser Lys Leu Asp Ile Thr Ser His 210 215 220Asn Glu Asp Tyr
Thr Ile Val Glu Gln Tyr Glu Arg Ala Glu Gly Arg225 230 235 240His
His Leu Phe Leu 245139257PRTArtificial SequenceTLM-dsREDexpress
139Met Gly His His His His His His Pro Leu Ser Ser Ile Phe Ser Arg1
5 10 15Ile Gly Asp Pro Pro Lys Lys Lys Arg Lys Val Glu Phe Gly Gly
Gly 20 25 30Gly Ala Ser Ser Glu Asp Val Ile Lys Glu Phe Met Arg Phe
Lys Val 35 40 45Arg Met Glu Gly Ser Val Asn Gly His Glu Phe Glu Ile
Glu Gly Glu 50 55 60Gly Glu Gly Arg Pro Tyr Glu Gly Thr Gln Thr Ala
Lys Leu Lys Val65 70 75 80Thr Lys Gly Gly Pro Leu Pro Phe Ala Trp
Asp Ile Leu Ser Pro Gln 85 90 95Phe Gln Tyr Gly Ser Lys Val Tyr Val
Lys His Pro Ala Asp Ile Pro 100 105 110Asp Tyr Lys Lys Leu Ser Phe
Pro Glu Gly Phe Lys Trp Glu Arg Val 115 120 125Met Asn Phe Glu Asp
Gly Gly Val Val Thr Val Thr Gln Asp Ser Ser 130 135 140Leu Gln Asp
Gly Ser Phe Ile Tyr Lys Val Lys Phe Ile Gly Val Asn145 150 155
160Phe Pro Ser Asp Gly Pro Val Met Gln Lys Lys Thr Met Gly Trp Glu
165 170 175Ala Ser Thr Glu Arg Leu Tyr Pro Arg Asp Gly Val Leu Lys
Gly Glu 180 185 190Ile His Lys Ala Leu Lys Leu Lys Asp Gly Gly His
Tyr Leu Val Glu 195 200 205Phe Lys Ser Ile Tyr Met Ala Lys Lys Pro
Val Gln Leu Pro Gly Tyr 210 215 220Tyr Tyr Val Asp Ser Lys Leu Asp
Ile Thr Ser His Asn Glu Asp Tyr225 230 235 240Thr Ile Val Glu Gln
Tyr Glu Arg Ala Glu Gly Arg His His Leu Phe 245 250
255Leu140292PRTArtificial sequenceZebra-dsREDexpress 140Met Gly His
His His His His His Glu Cys Asp Ser Glu Leu Glu Ile1 5 10 15Lys Arg
Tyr Lys Arg Val Arg Val Ala Ser Arg Lys Cys Arg Ala Lys 20 25 30Phe
Lys Gln Leu Leu Gln His Tyr Arg Glu Val Ala Ala Ala Lys Ser 35 40
45Ser Glu Asn Asp Arg Leu Arg Leu Leu Leu Lys Gln Met Cys Glu Phe
50 55 60Gly Gly Gly Gly Ala Ser Ser Glu Asp Val Ile Lys Glu Phe Met
Arg65 70 75 80Phe Lys Val Arg Met Glu Gly Ser Val Asn Gly His Glu
Phe Glu Ile 85 90 95Glu Gly Glu Gly Glu Gly Arg Pro Tyr Glu Gly Thr
Gln Thr Ala Lys 100 105 110Leu Lys Val Thr Lys Gly Gly Pro Leu Pro
Phe Ala Trp Asp Ile Leu 115 120 125Ser Pro Gln Phe Gln Tyr Gly Ser
Lys Val Tyr Val Lys His Pro Ala 130 135 140Asp Ile Pro Asp Tyr Lys
Lys Leu Ser Phe Pro Glu Gly Phe Lys Trp145 150 155 160Glu Arg Val
Met Asn Phe Glu Asp Gly Gly Val Val Thr Val Thr Gln 165 170 175Asp
Ser Ser Leu Gln Asp Gly Ser Phe Ile Tyr Lys Val Lys Phe Ile 180 185
190Gly Val Asn Phe Pro Ser Asp Gly Pro Val Met Gln Lys Lys Thr Met
195 200 205Gly Trp Glu Ala Ser Thr Glu Arg Leu Tyr Pro Arg Asp Gly
Val Leu 210 215 220Lys Gly Glu Ile His Lys Ala Leu Lys Leu Lys Asp
Gly Gly His Tyr225 230 235 240Leu Val Glu Phe Lys Ser Ile Tyr Met
Ala Lys Lys Pro Val Gln Leu 245 250 255Pro Gly Tyr Tyr Tyr Val Asp
Ser Lys Leu Asp Ile Thr Ser His Asn 260 265 270Glu Asp Tyr Thr Ile
Val Glu Gln Tyr Glu Arg Ala Glu Gly Arg His 275 280 285His Leu Phe
Leu 290141259PRTArtificial sequencepep1-dsREDexpress 141Met Gly His
His His His His His Lys Glu Thr Trp Trp Glu Thr Trp1 5 10 15Trp Thr
Glu Trp Ser Gln Pro Lys Lys Lys Arg Lys Val Glu Phe Gly 20 25 30Gly
Gly Gly Ala Ser Ser Glu Asp Val Ile Lys Glu Phe Met Arg Phe 35 40
45Lys Val Arg Met Glu Gly Ser Val Asn Gly His Glu Phe Glu Ile Glu
50 55 60Gly Glu Gly Glu Gly Arg Pro Tyr Glu Gly Thr Gln Thr Ala Lys
Leu65 70 75 80Lys Val Thr Lys Gly Gly Pro Leu Pro Phe Ala Trp Asp
Ile Leu Ser 85 90 95Pro Gln Phe Gln Tyr Gly Ser Lys Val Tyr Val Lys
His Pro Ala Asp 100 105 110Ile Pro Asp Tyr Lys Lys Leu Ser Phe Pro
Glu Gly Phe Lys Trp Glu 115 120 125Arg Val Met Asn Phe Glu Asp Gly
Gly Val Val Thr Val Thr Gln Asp 130 135 140Ser Ser Leu Gln Asp Gly
Ser Phe Ile Tyr Lys Val Lys Phe Ile Gly145 150 155 160Val Asn Phe
Pro Ser Asp Gly Pro Val Met Gln Lys Lys Thr Met Gly 165 170 175Trp
Glu Ala Ser Thr Glu Arg Leu Tyr Pro Arg Asp Gly Val Leu Lys 180 185
190Gly Glu Ile His Lys Ala Leu Lys Leu Lys Asp Gly Gly His Tyr Leu
195 200 205Val Glu Phe Lys Ser Ile Tyr Met Ala Lys Lys Pro Val Gln
Leu Pro 210 215 220Gly Tyr Tyr Tyr Val Asp Ser Lys Leu Asp Ile Thr
Ser His Asn Glu225 230 235 240Asp Tyr Thr Ile Val Glu Gln Tyr Glu
Arg Ala Glu Gly Arg His His 245 250 255Leu Phe
Leu142259PRTArtificial sequencetp10-dsREDexpress 142Met Gly His His
His His His His Ala Gly Tyr Leu Leu Gly Lys Ile1 5 10 15Asn Leu Lys
Ala Cys Ala Ala Cys Ala Lys Lys Ile Leu Glu Phe Gly 20 25 30Gly Gly
Gly Ala Ser Ser Glu Asp Val Ile Lys Glu Phe Met Arg Phe 35 40 45Lys
Val Arg Met Glu Gly Ser Val Asn Gly His Glu Phe Glu Ile Glu 50 55
60Gly Glu Gly Glu Gly Arg Pro Tyr Glu Gly Thr Gln Thr Ala Lys Leu65
70 75 80Lys Val Thr Lys Gly Gly Pro Leu Pro Phe Ala Trp Asp Ile Leu
Ser 85 90 95Pro Gln Phe Gln Tyr Gly Ser Lys Val Tyr Val Lys His Pro
Ala Asp 100 105 110Ile Pro Asp Tyr Lys Lys Leu Ser Phe Pro Glu Gly
Phe Lys Trp Glu 115 120 125Arg Val Met Asn Phe Glu Asp Gly Gly Val
Val Thr Val Thr Gln Asp 130 135 140Ser Ser Leu Gln Asp Gly Ser Phe
Ile Tyr Lys Val Lys Phe Ile Gly145 150 155 160Val Asn Phe Pro Ser
Asp Gly Pro Val Met Gln Lys Lys Thr Met Gly 165 170 175Trp Glu Ala
Ser Thr Glu Arg Leu Tyr Pro Arg Asp Gly Val Leu Lys 180 185 190Gly
Glu Ile His Lys Ala Leu Lys Leu Lys Asp Gly Gly His Tyr Leu 195 200
205Val Glu Phe Lys Ser Ile Tyr Met Ala Lys Lys Pro Val Gln Leu Pro
210 215 220Gly Tyr Tyr Tyr Val Asp Ser Lys Leu Asp Ile Thr Ser His
Asn Glu225 230 235 240Asp Tyr Thr Ile Val Glu Gln Tyr Glu Arg Ala
Glu Gly Arg His His 245 250 255Leu Phe Leu1431442PRTArtificial
sequenceZebra-Cas9 143Met Gly His His His His His His Glu Cys Asp
Ser Glu Leu Glu Ile1 5 10 15Lys Arg Tyr Lys Arg Val Arg Val Ala Ser
Arg Lys Cys Arg Ala Lys 20 25 30Phe Lys Gln Leu Leu Gln His Tyr Arg
Glu Val Ala Ala Ala Lys Ser 35 40 45Ser Glu Asn Asp Arg Leu Arg Leu
Leu Leu Lys Gln Met Cys Glu Phe 50 55 60Asp Lys Lys Tyr Ser Ile Gly
Leu Asp Ile Gly Thr Asn Ser Val Gly65 70 75 80Trp Ala Val Ile Thr
Asp Glu Tyr Lys Val Pro Ser Lys Lys Phe Lys 85 90 95Val Leu Gly Asn
Thr Asp Arg His Ser Ile Lys Lys Asn Leu Ile Gly 100 105 110Ala Leu
Leu Phe Asp Ser Gly Glu Thr Ala Glu Ala Thr Arg Leu Lys 115 120
125Arg Thr Ala Arg Arg Arg Tyr Thr Arg Arg Lys Asn Arg Ile Cys Tyr
130 135 140Leu Gln Glu Ile Phe Ser Asn Glu Met Ala Lys Val Asp Asp
Ser Phe145 150 155 160Phe His Arg Leu Glu Glu Ser Phe Leu Val Glu
Glu Asp Lys Lys His 165 170 175Glu Arg His Pro Ile Phe Gly Asn Ile
Val Asp Glu Val Ala Tyr His 180 185 190Glu Lys Tyr Pro Thr Ile Tyr
His Leu Arg Lys Lys Leu Val Asp Ser 195 200 205Thr Asp Lys Ala Asp
Leu Arg Leu Ile Tyr Leu Ala Leu Ala His Met 210 215 220Ile Lys Phe
Arg Gly His Phe Leu Ile Glu Gly Asp Leu Asn Pro Asp225 230 235
240Asn Ser Asp Val Asp Lys Leu Phe Ile Gln Leu Val Gln Thr Tyr Asn
245 250 255Gln Leu Phe Glu Glu Asn Pro Ile Asn Ala Ser Gly Val Asp
Ala Lys 260 265 270Ala Ile Leu Ser Ala Arg Leu Ser Lys Ser Arg Arg
Leu Glu Asn Leu 275 280 285Ile Ala Gln Leu Pro Gly Glu Lys Lys Asn
Gly Leu Phe Gly Asn Leu 290 295 300Ile Ala Leu Ser Leu Gly Leu Thr
Pro Asn Phe Lys Ser Asn Phe Asp305 310 315 320Leu Ala Glu Asp Ala
Lys Leu Gln Leu Ser Lys Asp Thr Tyr Asp Asp 325 330 335Asp Leu Asp
Asn Leu Leu Ala Gln Ile Gly Asp Gln Tyr Ala Asp Leu 340 345 350Phe
Leu Ala Ala Lys Asn Leu Ser Asp Ala Ile Leu Leu Ser Asp Ile 355 360
365Leu Arg Val Asn Thr Glu Ile Thr Lys Ala Pro Leu Ser Ala Ser Met
370 375 380Ile Lys Arg Tyr Asp Glu His His Gln Asp Leu Thr Leu Leu
Lys Ala385 390 395 400Leu Val Arg Gln Gln Leu Pro Glu Lys Tyr Lys
Glu Ile Phe Phe Asp 405 410 415Gln Ser Lys Asn Gly Tyr Ala Gly Tyr
Ile Asp Gly Gly Ala Ser Gln 420 425 430Glu Glu Phe Tyr Lys Phe Ile
Lys Pro Ile Leu Glu Lys Met Asp Gly 435 440 445Thr Glu Glu Leu Leu
Val Lys Leu Asn Arg Glu Asp Leu Leu Arg Lys 450 455 460Gln Arg Thr
Phe Asp Asn Gly Ser Ile Pro His Gln Ile His Leu Gly465 470 475
480Glu Leu His Ala Ile Leu Arg Arg Gln Glu Asp Phe Tyr Pro Phe Leu
485 490 495Lys Asp Asn Arg Glu Lys Ile Glu Lys Ile Leu Thr Phe Arg
Ile Pro 500 505 510Tyr Tyr Val Gly Pro Leu Ala Arg Gly Asn Ser Arg
Phe Ala Trp Met 515 520 525Thr Arg Lys Ser Glu Glu Thr Ile Thr Pro
Trp Asn Phe Glu Glu Val 530 535 540Val Asp Lys Gly Ala Ser Ala Gln
Ser Phe Ile Glu Arg Met Thr Asn545 550 555 560Phe Asp Lys Asn Leu
Pro Asn Glu Lys Val Leu Pro Lys His Ser Leu 565 570 575Leu Tyr Glu
Tyr Phe Thr Val Tyr Asn Glu Leu Thr Lys Val Lys Tyr 580 585 590Val
Thr Glu Gly Met Arg Lys Pro Ala Phe Leu Ser Gly Glu Gln Lys 595 600
605Lys Ala Ile Val Asp Leu Leu Phe Lys Thr Asn Arg Lys Val Thr Val
610 615 620Lys Gln Leu Lys Glu Asp Tyr Phe Lys Lys Ile Glu Cys Phe
Asp Ser625 630 635 640Val Glu Ile Ser Gly Val Glu Asp Arg Phe Asn
Ala Ser Leu Gly Thr 645 650 655Tyr His Asp Leu Leu Lys Ile Ile Lys
Asp Lys Asp Phe Leu Asp Asn 660 665 670Glu Glu Asn Glu Asp Ile Leu
Glu Asp Ile Val Leu Thr Leu Thr Leu 675 680 685Phe Glu Asp Arg Glu
Met Ile Glu Glu Arg Leu Lys Thr Tyr Ala His 690 695 700Leu Phe Asp
Asp Lys Val Met Lys Gln Leu Lys Arg Arg Arg Tyr Thr705 710 715
720Gly Trp Gly Arg Leu Ser Arg Lys Leu Ile Asn Gly Ile Arg Asp Lys
725 730 735Gln Ser Gly Lys Thr Ile Leu Asp Phe Leu Lys Ser Asp Gly
Phe Ala 740 745 750Asn Arg Asn Phe Met Gln Leu Ile His Asp Asp Ser
Leu Thr Phe Lys 755 760 765Glu Asp Ile Gln Lys Ala Gln Val Ser Gly
Gln Gly Asp Ser Leu His 770 775 780Glu His Ile Ala Asn Leu Ala Gly
Ser Pro Ala Ile Lys Lys Gly Ile785 790 795 800Leu Gln Thr Val Lys
Val Val Asp Glu Leu Val Lys Val Met Gly Arg 805 810 815His Lys Pro
Glu Asn Ile Val Ile Glu Met Ala Arg Glu Asn Gln Thr 820 825 830Thr
Gln Lys Gly Gln Lys Asn Ser Arg Glu Arg Met Lys Arg Ile Glu 835 840
845Glu Gly Ile Lys Glu Leu Gly Ser Gln Ile Leu Lys Glu His Pro Val
850 855 860Glu Asn Thr Gln Leu Gln Asn Glu Lys Leu Tyr Leu Tyr Tyr
Leu Gln865 870 875 880Asn Gly Arg Asp Met Tyr Val Asp Gln Glu Leu
Asp Ile Asn Arg Leu 885 890 895Ser Asp Tyr Asp Val Asp His Ile Val
Pro Gln Ser Phe Leu Lys Asp 900 905 910Asp Ser Ile Asp Asn Lys Val
Leu Thr Arg Ser Asp Lys Asn Arg Gly 915 920 925Lys Ser Asp Asn Val
Pro Ser Glu Glu Val Val Lys Lys Met Lys Asn 930 935 940Tyr Trp Arg
Gln Leu Leu Asn Ala Lys Leu Ile Thr Gln Arg Lys Phe945 950 955
960Asp Asn Leu Thr Lys Ala Glu Arg Gly Gly Leu Ser Glu Leu Asp Lys
965 970 975Ala Gly Phe Ile Lys Arg Gln Leu Val Glu Thr Arg Gln Ile
Thr Lys 980 985 990His Val Ala Gln Ile Leu Asp Ser Arg Met Asn Thr
Lys Tyr Asp Glu 995 1000 1005Asn Asp Lys Leu Ile Arg Glu Val Lys
Val Ile Thr Leu Lys Ser 1010 1015 1020Lys Leu Val Ser Asp Phe Arg
Lys Asp Phe Gln Phe Tyr Lys Val 1025 1030 1035Arg Glu Ile Asn Asn
Tyr His His Ala His Asp Ala Tyr Leu Asn 1040 1045 1050Ala Val Val
Gly Thr Ala Leu Ile Lys Lys Tyr Pro Lys Leu Glu 1055 1060 1065Ser
Glu Phe Val Tyr Gly Asp Tyr Lys Val Tyr Asp Val Arg Lys 1070 1075
1080Met Ile Ala Lys Ser Glu Gln Glu Ile Gly Lys Ala Thr Ala Lys
1085 1090 1095Tyr Phe Phe Tyr Ser Asn Ile Met Asn Phe Phe Lys Thr
Glu Ile 1100 1105 1110Thr Leu Ala Asn Gly Glu Ile Arg Lys Arg Pro
Leu Ile Glu Thr 1115 1120 1125Asn Gly Glu Thr Gly Glu Ile Val Trp
Asp Lys Gly Arg Asp Phe 1130 1135 1140Ala Thr Val Arg Lys Val Leu
Ser Met Pro Gln Val Asn Ile Val 1145 1150 1155Lys Lys Thr Glu Val
Gln Thr Gly Gly Phe Ser Lys Glu Ser Ile 1160 1165 1170Leu Pro Lys
Arg Asn Ser Asp Lys Leu Ile Ala Arg Lys Lys Asp 1175 1180 1185Trp
Asp Pro Lys Lys Tyr Gly Gly Phe Asp Ser Pro Thr Val Ala 1190 1195
1200Tyr Ser Val Leu Val Val Ala Lys Val Glu Lys Gly Lys Ser Lys
1205 1210 1215Lys Leu Lys Ser Val Lys Glu Leu Leu Gly Ile Thr Ile
Met Glu 1220 1225 1230Arg Ser Ser Phe Glu Lys Asn Pro Ile Asp Phe
Leu Glu Ala Lys 1235 1240 1245Gly Tyr Lys Glu Val Lys Lys Asp Leu
Ile Ile Lys Leu Pro Lys 1250 1255 1260Tyr Ser Leu Phe Glu Leu Glu
Asn Gly Arg Lys Arg Met Leu Ala 1265 1270 1275Ser Ala Gly Glu
Leu Gln Lys Gly Asn Glu Leu Ala Leu Pro Ser 1280 1285 1290Lys Tyr
Val Asn Phe Leu Tyr Leu Ala Ser His Tyr Glu Lys Leu 1295 1300
1305Lys Gly Ser Pro Glu Asp Asn Glu Gln Lys Gln Leu Phe Val Glu
1310 1315 1320Gln His Lys His Tyr Leu Asp Glu Ile Ile Glu Gln Ile
Ser Glu 1325 1330 1335Phe Ser Lys Arg Val Ile Leu Ala Asp Ala Asn
Leu Asp Lys Val 1340 1345 1350Leu Ser Ala Tyr Asn Lys His Arg Asp
Lys Pro Ile Arg Glu Gln 1355 1360 1365Ala Glu Asn Ile Ile His Leu
Phe Thr Leu Thr Asn Leu Gly Ala 1370 1375 1380Pro Ala Ala Phe Lys
Tyr Phe Asp Thr Thr Ile Asp Arg Lys Arg 1385 1390 1395Tyr Thr Ser
Thr Lys Glu Val Leu Asp Ala Thr Leu Ile His Gln 1400 1405 1410Ser
Ile Thr Gly Leu Tyr Glu Thr Arg Ile Asp Leu Ser Gln Leu 1415 1420
1425Gly Gly Asp Ser Arg Ala Asp Pro Lys Lys Lys Arg Lys Val 1430
1435 14401441406PRTArtificial sequencepVEC-Cas9 144Met Gly His His
His His His His Leu Leu Ile Ile Leu Arg Arg Arg1 5 10 15Ile Arg Lys
Gln Ala His Ala His Ser Lys Glu Phe Asp Lys Lys Tyr 20 25 30Ser Ile
Gly Leu Asp Ile Gly Thr Asn Ser Val Gly Trp Ala Val Ile 35 40 45Thr
Asp Glu Tyr Lys Val Pro Ser Lys Lys Phe Lys Val Leu Gly Asn 50 55
60Thr Asp Arg His Ser Ile Lys Lys Asn Leu Ile Gly Ala Leu Leu Phe65
70 75 80Asp Ser Gly Glu Thr Ala Glu Ala Thr Arg Leu Lys Arg Thr Ala
Arg 85 90 95Arg Arg Tyr Thr Arg Arg Lys Asn Arg Ile Cys Tyr Leu Gln
Glu Ile 100 105 110Phe Ser Asn Glu Met Ala Lys Val Asp Asp Ser Phe
Phe His Arg Leu 115 120 125Glu Glu Ser Phe Leu Val Glu Glu Asp Lys
Lys His Glu Arg His Pro 130 135 140Ile Phe Gly Asn Ile Val Asp Glu
Val Ala Tyr His Glu Lys Tyr Pro145 150 155 160Thr Ile Tyr His Leu
Arg Lys Lys Leu Val Asp Ser Thr Asp Lys Ala 165 170 175Asp Leu Arg
Leu Ile Tyr Leu Ala Leu Ala His Met Ile Lys Phe Arg 180 185 190Gly
His Phe Leu Ile Glu Gly Asp Leu Asn Pro Asp Asn Ser Asp Val 195 200
205Asp Lys Leu Phe Ile Gln Leu Val Gln Thr Tyr Asn Gln Leu Phe Glu
210 215 220Glu Asn Pro Ile Asn Ala Ser Gly Val Asp Ala Lys Ala Ile
Leu Ser225 230 235 240Ala Arg Leu Ser Lys Ser Arg Arg Leu Glu Asn
Leu Ile Ala Gln Leu 245 250 255Pro Gly Glu Lys Lys Asn Gly Leu Phe
Gly Asn Leu Ile Ala Leu Ser 260 265 270Leu Gly Leu Thr Pro Asn Phe
Lys Ser Asn Phe Asp Leu Ala Glu Asp 275 280 285Ala Lys Leu Gln Leu
Ser Lys Asp Thr Tyr Asp Asp Asp Leu Asp Asn 290 295 300Leu Leu Ala
Gln Ile Gly Asp Gln Tyr Ala Asp Leu Phe Leu Ala Ala305 310 315
320Lys Asn Leu Ser Asp Ala Ile Leu Leu Ser Asp Ile Leu Arg Val Asn
325 330 335Thr Glu Ile Thr Lys Ala Pro Leu Ser Ala Ser Met Ile Lys
Arg Tyr 340 345 350Asp Glu His His Gln Asp Leu Thr Leu Leu Lys Ala
Leu Val Arg Gln 355 360 365Gln Leu Pro Glu Lys Tyr Lys Glu Ile Phe
Phe Asp Gln Ser Lys Asn 370 375 380Gly Tyr Ala Gly Tyr Ile Asp Gly
Gly Ala Ser Gln Glu Glu Phe Tyr385 390 395 400Lys Phe Ile Lys Pro
Ile Leu Glu Lys Met Asp Gly Thr Glu Glu Leu 405 410 415Leu Val Lys
Leu Asn Arg Glu Asp Leu Leu Arg Lys Gln Arg Thr Phe 420 425 430Asp
Asn Gly Ser Ile Pro His Gln Ile His Leu Gly Glu Leu His Ala 435 440
445Ile Leu Arg Arg Gln Glu Asp Phe Tyr Pro Phe Leu Lys Asp Asn Arg
450 455 460Glu Lys Ile Glu Lys Ile Leu Thr Phe Arg Ile Pro Tyr Tyr
Val Gly465 470 475 480Pro Leu Ala Arg Gly Asn Ser Arg Phe Ala Trp
Met Thr Arg Lys Ser 485 490 495Glu Glu Thr Ile Thr Pro Trp Asn Phe
Glu Glu Val Val Asp Lys Gly 500 505 510Ala Ser Ala Gln Ser Phe Ile
Glu Arg Met Thr Asn Phe Asp Lys Asn 515 520 525Leu Pro Asn Glu Lys
Val Leu Pro Lys His Ser Leu Leu Tyr Glu Tyr 530 535 540Phe Thr Val
Tyr Asn Glu Leu Thr Lys Val Lys Tyr Val Thr Glu Gly545 550 555
560Met Arg Lys Pro Ala Phe Leu Ser Gly Glu Gln Lys Lys Ala Ile Val
565 570 575Asp Leu Leu Phe Lys Thr Asn Arg Lys Val Thr Val Lys Gln
Leu Lys 580 585 590Glu Asp Tyr Phe Lys Lys Ile Glu Cys Phe Asp Ser
Val Glu Ile Ser 595 600 605Gly Val Glu Asp Arg Phe Asn Ala Ser Leu
Gly Thr Tyr His Asp Leu 610 615 620Leu Lys Ile Ile Lys Asp Lys Asp
Phe Leu Asp Asn Glu Glu Asn Glu625 630 635 640Asp Ile Leu Glu Asp
Ile Val Leu Thr Leu Thr Leu Phe Glu Asp Arg 645 650 655Glu Met Ile
Glu Glu Arg Leu Lys Thr Tyr Ala His Leu Phe Asp Asp 660 665 670Lys
Val Met Lys Gln Leu Lys Arg Arg Arg Tyr Thr Gly Trp Gly Arg 675 680
685Leu Ser Arg Lys Leu Ile Asn Gly Ile Arg Asp Lys Gln Ser Gly Lys
690 695 700Thr Ile Leu Asp Phe Leu Lys Ser Asp Gly Phe Ala Asn Arg
Asn Phe705 710 715 720Met Gln Leu Ile His Asp Asp Ser Leu Thr Phe
Lys Glu Asp Ile Gln 725 730 735Lys Ala Gln Val Ser Gly Gln Gly Asp
Ser Leu His Glu His Ile Ala 740 745 750Asn Leu Ala Gly Ser Pro Ala
Ile Lys Lys Gly Ile Leu Gln Thr Val 755 760 765Lys Val Val Asp Glu
Leu Val Lys Val Met Gly Arg His Lys Pro Glu 770 775 780Asn Ile Val
Ile Glu Met Ala Arg Glu Asn Gln Thr Thr Gln Lys Gly785 790 795
800Gln Lys Asn Ser Arg Glu Arg Met Lys Arg Ile Glu Glu Gly Ile Lys
805 810 815Glu Leu Gly Ser Gln Ile Leu Lys Glu His Pro Val Glu Asn
Thr Gln 820 825 830Leu Gln Asn Glu Lys Leu Tyr Leu Tyr Tyr Leu Gln
Asn Gly Arg Asp 835 840 845Met Tyr Val Asp Gln Glu Leu Asp Ile Asn
Arg Leu Ser Asp Tyr Asp 850 855 860Val Asp His Ile Val Pro Gln Ser
Phe Leu Lys Asp Asp Ser Ile Asp865 870 875 880Asn Lys Val Leu Thr
Arg Ser Asp Lys Asn Arg Gly Lys Ser Asp Asn 885 890 895Val Pro Ser
Glu Glu Val Val Lys Lys Met Lys Asn Tyr Trp Arg Gln 900 905 910Leu
Leu Asn Ala Lys Leu Ile Thr Gln Arg Lys Phe Asp Asn Leu Thr 915 920
925Lys Ala Glu Arg Gly Gly Leu Ser Glu Leu Asp Lys Ala Gly Phe Ile
930 935 940Lys Arg Gln Leu Val Glu Thr Arg Gln Ile Thr Lys His Val
Ala Gln945 950 955 960Ile Leu Asp Ser Arg Met Asn Thr Lys Tyr Asp
Glu Asn Asp Lys Leu 965 970 975Ile Arg Glu Val Lys Val Ile Thr Leu
Lys Ser Lys Leu Val Ser Asp 980 985 990Phe Arg Lys Asp Phe Gln Phe
Tyr Lys Val Arg Glu Ile Asn Asn Tyr 995 1000 1005His His Ala His
Asp Ala Tyr Leu Asn Ala Val Val Gly Thr Ala 1010 1015 1020Leu Ile
Lys Lys Tyr Pro Lys Leu Glu Ser Glu Phe Val Tyr Gly 1025 1030
1035Asp Tyr Lys Val Tyr Asp Val Arg Lys Met Ile Ala Lys Ser Glu
1040 1045 1050Gln Glu Ile Gly Lys Ala Thr Ala Lys Tyr Phe Phe Tyr
Ser Asn 1055 1060 1065Ile Met Asn Phe Phe Lys Thr Glu Ile Thr Leu
Ala Asn Gly Glu 1070 1075 1080Ile Arg Lys Arg Pro Leu Ile Glu Thr
Asn Gly Glu Thr Gly Glu 1085 1090 1095Ile Val Trp Asp Lys Gly Arg
Asp Phe Ala Thr Val Arg Lys Val 1100 1105 1110Leu Ser Met Pro Gln
Val Asn Ile Val Lys Lys Thr Glu Val Gln 1115 1120 1125Thr Gly Gly
Phe Ser Lys Glu Ser Ile Leu Pro Lys Arg Asn Ser 1130 1135 1140Asp
Lys Leu Ile Ala Arg Lys Lys Asp Trp Asp Pro Lys Lys Tyr 1145 1150
1155Gly Gly Phe Asp Ser Pro Thr Val Ala Tyr Ser Val Leu Val Val
1160 1165 1170Ala Lys Val Glu Lys Gly Lys Ser Lys Lys Leu Lys Ser
Val Lys 1175 1180 1185Glu Leu Leu Gly Ile Thr Ile Met Glu Arg Ser
Ser Phe Glu Lys 1190 1195 1200Asn Pro Ile Asp Phe Leu Glu Ala Lys
Gly Tyr Lys Glu Val Lys 1205 1210 1215Lys Asp Leu Ile Ile Lys Leu
Pro Lys Tyr Ser Leu Phe Glu Leu 1220 1225 1230Glu Asn Gly Arg Lys
Arg Met Leu Ala Ser Ala Gly Glu Leu Gln 1235 1240 1245Lys Gly Asn
Glu Leu Ala Leu Pro Ser Lys Tyr Val Asn Phe Leu 1250 1255 1260Tyr
Leu Ala Ser His Tyr Glu Lys Leu Lys Gly Ser Pro Glu Asp 1265 1270
1275Asn Glu Gln Lys Gln Leu Phe Val Glu Gln His Lys His Tyr Leu
1280 1285 1290Asp Glu Ile Ile Glu Gln Ile Ser Glu Phe Ser Lys Arg
Val Ile 1295 1300 1305Leu Ala Asp Ala Asn Leu Asp Lys Val Leu Ser
Ala Tyr Asn Lys 1310 1315 1320His Arg Asp Lys Pro Ile Arg Glu Gln
Ala Glu Asn Ile Ile His 1325 1330 1335Leu Phe Thr Leu Thr Asn Leu
Gly Ala Pro Ala Ala Phe Lys Tyr 1340 1345 1350Phe Asp Thr Thr Ile
Asp Arg Lys Arg Tyr Thr Ser Thr Lys Glu 1355 1360 1365Val Leu Asp
Ala Thr Leu Ile His Gln Ser Ile Thr Gly Leu Tyr 1370 1375 1380Glu
Thr Arg Ile Asp Leu Ser Gln Leu Gly Gly Asp Ser Arg Ala 1385 1390
1395Asp Pro Lys Lys Lys Arg Lys Val 1400 1405
uspto.report is an independent third-party trademark research tool that is not affiliated, endorsed, or sponsored by the United States Patent and Trademark Office (USPTO) or any other governmental organization. The information provided by uspto.report is based on publicly available data at the time of writing and is intended for informational purposes only.
While we strive to provide accurate and up-to-date information, we do not guarantee the accuracy, completeness, reliability, or suitability of the information displayed on this site. The use of this site is at your own risk. Any reliance you place on such information is therefore strictly at your own risk.
All official trademark data, including owner information, should be verified by visiting the official USPTO website at www.uspto.gov. This site is not intended to replace professional legal advice and should not be used as a substitute for consulting with a legal professional who is knowledgeable about trademark law.