JP2001145490A - Plasmid cloned icosapentaenoic acid biosynthesis genes and cyanobacteria producing icosapentaenoic acid - Google Patents
Plasmid cloned icosapentaenoic acid biosynthesis genes and cyanobacteria producing icosapentaenoic acidInfo
- Publication number
- JP2001145490A JP2001145490A JP32916999A JP32916999A JP2001145490A JP 2001145490 A JP2001145490 A JP 2001145490A JP 32916999 A JP32916999 A JP 32916999A JP 32916999 A JP32916999 A JP 32916999A JP 2001145490 A JP2001145490 A JP 2001145490A
- Authority
- JP
- Japan
- Prior art keywords
- ala
- leu
- ser
- gly
- val
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
- 235000020673 eicosapentaenoic acid Nutrition 0.000 title claims abstract description 34
- 239000013612 plasmid Substances 0.000 title claims abstract description 34
- 108090000623 proteins and genes Proteins 0.000 title claims abstract description 27
- JAZBEHYOTPTENJ-JLNKQSITSA-N all-cis-5,8,11,14,17-icosapentaenoic acid Chemical compound CC\C=C/C\C=C/C\C=C/C\C=C/C\C=C/CCCC(O)=O JAZBEHYOTPTENJ-JLNKQSITSA-N 0.000 title claims abstract description 19
- 229960005135 eicosapentaenoic acid Drugs 0.000 title claims abstract description 17
- 241000192700 Cyanobacteria Species 0.000 title abstract description 22
- 230000015572 biosynthetic process Effects 0.000 title description 8
- 239000013598 vector Substances 0.000 claims abstract description 14
- 230000001851 biosynthetic effect Effects 0.000 claims abstract description 10
- 108090000790 Enzymes Proteins 0.000 claims abstract description 8
- 102000004190 Enzymes Human genes 0.000 claims abstract description 8
- JAZBEHYOTPTENJ-UHFFFAOYSA-N eicosapentaenoic acid Natural products CCC=CCC=CCC=CCC=CCC=CCCCC(O)=O JAZBEHYOTPTENJ-UHFFFAOYSA-N 0.000 claims abstract description 8
- 238000010367 cloning Methods 0.000 claims abstract description 6
- 241001464430 Cyanobacterium Species 0.000 claims description 13
- 239000002773 nucleotide Substances 0.000 claims 2
- 125000003729 nucleotide group Chemical group 0.000 claims 2
- 235000020777 polyunsaturated fatty acids Nutrition 0.000 abstract description 9
- 241000894006 Bacteria Species 0.000 abstract description 8
- 235000021122 unsaturated fatty acids Nutrition 0.000 abstract description 8
- 150000004670 unsaturated fatty acids Chemical class 0.000 abstract description 7
- 238000004519 manufacturing process Methods 0.000 abstract description 6
- 239000000126 substance Substances 0.000 abstract description 6
- 235000013305 food Nutrition 0.000 abstract description 3
- 229940079593 drug Drugs 0.000 abstract description 2
- 239000003814 drug Substances 0.000 abstract description 2
- 230000000694 effects Effects 0.000 abstract 1
- 150000004671 saturated fatty acids Chemical class 0.000 abstract 1
- 235000003441 saturated fatty acids Nutrition 0.000 abstract 1
- 241000282326 Felis catus Species 0.000 description 61
- 108010050848 glycylleucine Proteins 0.000 description 51
- YBAFDPFAUTYYRW-UHFFFAOYSA-N N-L-alpha-glutamyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CCC(O)=O YBAFDPFAUTYYRW-UHFFFAOYSA-N 0.000 description 32
- 241000880493 Leptailurus serval Species 0.000 description 28
- 108010047495 alanylglycine Proteins 0.000 description 26
- 108010044940 alanylglutamine Proteins 0.000 description 24
- 239000012634 fragment Substances 0.000 description 22
- 108010087924 alanylproline Proteins 0.000 description 20
- 108010017391 lysylvaline Proteins 0.000 description 20
- 108010057821 leucylproline Proteins 0.000 description 19
- VPZXBVLAVMBEQI-UHFFFAOYSA-N glycyl-DL-alpha-alanine Natural products OC(=O)C(C)NC(=O)CN VPZXBVLAVMBEQI-UHFFFAOYSA-N 0.000 description 17
- SITLTJHOQZFJGG-UHFFFAOYSA-N N-L-alpha-glutamyl-L-valine Natural products CC(C)C(C(O)=O)NC(=O)C(N)CCC(O)=O SITLTJHOQZFJGG-UHFFFAOYSA-N 0.000 description 16
- 108010005233 alanylglutamic acid Proteins 0.000 description 16
- XBGGUPMXALFZOT-UHFFFAOYSA-N glycyl-L-tyrosine hemihydrate Natural products NCC(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 XBGGUPMXALFZOT-UHFFFAOYSA-N 0.000 description 16
- 108020004414 DNA Proteins 0.000 description 15
- 235000014113 dietary fatty acids Nutrition 0.000 description 15
- 239000000194 fatty acid Substances 0.000 description 15
- 229930195729 fatty acid Natural products 0.000 description 15
- 108010034529 leucyl-lysine Proteins 0.000 description 15
- 108010048397 seryl-lysyl-leucine Proteins 0.000 description 15
- 210000004027 cell Anatomy 0.000 description 14
- 150000004665 fatty acids Chemical class 0.000 description 14
- 108010049041 glutamylalanine Proteins 0.000 description 14
- 108010010147 glycylglutamine Proteins 0.000 description 13
- 108010061238 threonyl-glycine Proteins 0.000 description 13
- HYIFFZAQXPUEAU-QWRGUYRKSA-N Leu-Gly-Leu Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(C)C HYIFFZAQXPUEAU-QWRGUYRKSA-N 0.000 description 12
- JHNOXVASMSXSNB-WEDXCCLWSA-N Lys-Thr-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O JHNOXVASMSXSNB-WEDXCCLWSA-N 0.000 description 12
- OKKJLVBELUTLKV-UHFFFAOYSA-N Methanol Chemical compound OC OKKJLVBELUTLKV-UHFFFAOYSA-N 0.000 description 12
- 108010002311 N-glycylglutamic acid Proteins 0.000 description 12
- 108010069205 aspartyl-phenylalanine Proteins 0.000 description 12
- 108010092854 aspartyllysine Proteins 0.000 description 12
- 239000002609 medium Substances 0.000 description 12
- VLKZOEOYAKHREP-UHFFFAOYSA-N n-Hexane Chemical compound CCCCCC VLKZOEOYAKHREP-UHFFFAOYSA-N 0.000 description 12
- 108010070643 prolylglutamic acid Proteins 0.000 description 12
- JBCLFWXMTIKCCB-UHFFFAOYSA-N H-Gly-Phe-OH Natural products NCC(=O)NC(C(O)=O)CC1=CC=CC=C1 JBCLFWXMTIKCCB-UHFFFAOYSA-N 0.000 description 11
- XZFYRXDAULDNFX-UHFFFAOYSA-N N-L-cysteinyl-L-phenylalanine Natural products SCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 XZFYRXDAULDNFX-UHFFFAOYSA-N 0.000 description 11
- 108010079364 N-glycylalanine Proteins 0.000 description 11
- 108010064235 lysylglycine Proteins 0.000 description 11
- 238000000034 method Methods 0.000 description 11
- SOBIAADAMRHGKH-CIUDSAMLSA-N Ala-Leu-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O SOBIAADAMRHGKH-CIUDSAMLSA-N 0.000 description 10
- MDNAVFBZPROEHO-UHFFFAOYSA-N Ala-Lys-Val Natural products CC(C)C(C(O)=O)NC(=O)C(NC(=O)C(C)N)CCCCN MDNAVFBZPROEHO-UHFFFAOYSA-N 0.000 description 10
- IPZQNYYAYVRKKK-FXQIFTODSA-N Ala-Pro-Ala Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O IPZQNYYAYVRKKK-FXQIFTODSA-N 0.000 description 10
- HYQYLOSCICEYTR-YUMQZZPRSA-N Asn-Gly-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O HYQYLOSCICEYTR-YUMQZZPRSA-N 0.000 description 10
- CBEUFCJRFNZMCU-SRVKXCTJSA-N Glu-Met-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O CBEUFCJRFNZMCU-SRVKXCTJSA-N 0.000 description 10
- RYAOJUMWLWUGNW-QMMMGPOBSA-N Gly-Val-Gly Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O RYAOJUMWLWUGNW-QMMMGPOBSA-N 0.000 description 10
- BQSLGJHIAGOZCD-CIUDSAMLSA-N Leu-Ala-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O BQSLGJHIAGOZCD-CIUDSAMLSA-N 0.000 description 10
- CQGSYZCULZMEDE-UHFFFAOYSA-N Leu-Gln-Pro Natural products CC(C)CC(N)C(=O)NC(CCC(N)=O)C(=O)N1CCCC1C(O)=O CQGSYZCULZMEDE-UHFFFAOYSA-N 0.000 description 10
- SRTCFKGBYBZRHA-ACZMJKKPSA-N Ser-Ala-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O SRTCFKGBYBZRHA-ACZMJKKPSA-N 0.000 description 10
- YFOCMOVJBQDBCE-NRPADANISA-N Val-Ala-Glu Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N YFOCMOVJBQDBCE-NRPADANISA-N 0.000 description 10
- 108010077245 asparaginyl-proline Proteins 0.000 description 10
- 108010068265 aspartyltyrosine Proteins 0.000 description 10
- 108010057083 glutamyl-aspartyl-leucine Proteins 0.000 description 10
- 108010044374 isoleucyl-tyrosine Proteins 0.000 description 10
- 229930027917 kanamycin Natural products 0.000 description 10
- 229960000318 kanamycin Drugs 0.000 description 10
- SBUJHOSQTJFQJX-NOAMYHISSA-N kanamycin Chemical compound O[C@@H]1[C@@H](O)[C@H](O)[C@@H](CN)O[C@@H]1O[C@H]1[C@H](O)[C@@H](O[C@@H]2[C@@H]([C@@H](N)[C@H](O)[C@@H](CO)O2)O)[C@H](N)C[C@@H]1N SBUJHOSQTJFQJX-NOAMYHISSA-N 0.000 description 10
- 229930182823 kanamycin A Natural products 0.000 description 10
- 108010056582 methionylglutamic acid Proteins 0.000 description 10
- 108010031719 prolyl-serine Proteins 0.000 description 10
- 239000000523 sample Substances 0.000 description 10
- PAIHPOGPJVUFJY-WDSKDSINSA-N Ala-Glu-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O PAIHPOGPJVUFJY-WDSKDSINSA-N 0.000 description 9
- MNZHHDPWDWQJCQ-YUMQZZPRSA-N Ala-Leu-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O MNZHHDPWDWQJCQ-YUMQZZPRSA-N 0.000 description 9
- HRTRLSRYZZKPCO-BJDJZHNGSA-N Leu-Ile-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O HRTRLSRYZZKPCO-BJDJZHNGSA-N 0.000 description 9
- JXVXYRZQIUPYSA-NHCYSSNCSA-N Pro-Val-Gln Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O JXVXYRZQIUPYSA-NHCYSSNCSA-N 0.000 description 9
- 108010070944 alanylhistidine Proteins 0.000 description 9
- 108010078144 glutaminyl-glycine Proteins 0.000 description 9
- 108010051242 phenylalanylserine Proteins 0.000 description 9
- JBVSSSZFNTXJDX-YTLHQDLWSA-N Ala-Ala-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@H](C)N JBVSSSZFNTXJDX-YTLHQDLWSA-N 0.000 description 8
- KIUYPHAMDKDICO-WHFBIAKZSA-N Ala-Asp-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O KIUYPHAMDKDICO-WHFBIAKZSA-N 0.000 description 8
- ZIWWTZWAKYBUOB-CIUDSAMLSA-N Ala-Asp-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O ZIWWTZWAKYBUOB-CIUDSAMLSA-N 0.000 description 8
- ZVFVBBGVOILKPO-WHFBIAKZSA-N Ala-Gly-Ala Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(O)=O ZVFVBBGVOILKPO-WHFBIAKZSA-N 0.000 description 8
- PNALXAODQKTNLV-JBDRJPRFSA-N Ala-Ile-Ala Chemical compound C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O PNALXAODQKTNLV-JBDRJPRFSA-N 0.000 description 8
- MLZRSFQRBDNJON-GUBZILKMSA-N Gln-Ala-Lys Chemical compound C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)N)N MLZRSFQRBDNJON-GUBZILKMSA-N 0.000 description 8
- XXLBHPPXDUWYAG-XQXXSGGOSA-N Gln-Ala-Thr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XXLBHPPXDUWYAG-XQXXSGGOSA-N 0.000 description 8
- PAOHIZNRJNIXQY-XQXXSGGOSA-N Gln-Thr-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O PAOHIZNRJNIXQY-XQXXSGGOSA-N 0.000 description 8
- SOEXCCGNHQBFPV-DLOVCJGASA-N Gln-Val-Val Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C(C)C)C(O)=O SOEXCCGNHQBFPV-DLOVCJGASA-N 0.000 description 8
- IRXNJYPKBVERCW-DCAQKATOSA-N Glu-Leu-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O IRXNJYPKBVERCW-DCAQKATOSA-N 0.000 description 8
- FNXSYBOHALPRHV-ONGXEEELSA-N Gly-Val-Lys Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCCN FNXSYBOHALPRHV-ONGXEEELSA-N 0.000 description 8
- CQGSYZCULZMEDE-SRVKXCTJSA-N Leu-Gln-Pro Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N1CCC[C@H]1C(O)=O CQGSYZCULZMEDE-SRVKXCTJSA-N 0.000 description 8
- UCBPDSYUVAAHCD-UWVGGRQHSA-N Leu-Pro-Gly Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O UCBPDSYUVAAHCD-UWVGGRQHSA-N 0.000 description 8
- LFSQWRSVPNKJGP-WDCWCFNPSA-N Leu-Thr-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCC(O)=O LFSQWRSVPNKJGP-WDCWCFNPSA-N 0.000 description 8
- ZOPISOXXPQNOCO-SVSWQMSJSA-N Ser-Ile-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](CO)N ZOPISOXXPQNOCO-SVSWQMSJSA-N 0.000 description 8
- FAPWRFPIFSIZLT-UHFFFAOYSA-M Sodium chloride Chemical compound [Na+].[Cl-] FAPWRFPIFSIZLT-UHFFFAOYSA-M 0.000 description 8
- OHAJHDJOCKKJLV-LKXGYXEUSA-N Thr-Asp-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O OHAJHDJOCKKJLV-LKXGYXEUSA-N 0.000 description 8
- XGZBEGGGAUQBMB-KJEVXHAQSA-N Tyr-Pro-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CC2=CC=C(C=C2)O)N)O XGZBEGGGAUQBMB-KJEVXHAQSA-N 0.000 description 8
- XGJLNBNZNMVJRS-NRPADANISA-N Val-Glu-Ala Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O XGJLNBNZNMVJRS-NRPADANISA-N 0.000 description 8
- 108010047857 aspartylglycine Proteins 0.000 description 8
- 108010079547 glutamylmethionine Proteins 0.000 description 8
- 108010082286 glycyl-seryl-alanine Proteins 0.000 description 8
- 108010089804 glycyl-threonine Proteins 0.000 description 8
- 108010092114 histidylphenylalanine Proteins 0.000 description 8
- 108010073472 leucyl-prolyl-proline Proteins 0.000 description 8
- 108010071207 serylmethionine Proteins 0.000 description 8
- YLTKNGYYPIWKHZ-ACZMJKKPSA-N Ala-Ala-Glu Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(O)=O YLTKNGYYPIWKHZ-ACZMJKKPSA-N 0.000 description 7
- WZZSKAJIHTUUSG-ACZMJKKPSA-N Glu-Ala-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(O)=O WZZSKAJIHTUUSG-ACZMJKKPSA-N 0.000 description 7
- IVGJYOOGJLFKQE-AVGNSLFASA-N Glu-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)O)N IVGJYOOGJLFKQE-AVGNSLFASA-N 0.000 description 7
- WUFYAPWIHCUMLL-CIUDSAMLSA-N Leu-Asn-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(O)=O WUFYAPWIHCUMLL-CIUDSAMLSA-N 0.000 description 7
- CHJKEDSZNSONPS-DCAQKATOSA-N Leu-Pro-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O CHJKEDSZNSONPS-DCAQKATOSA-N 0.000 description 7
- XMBSYZWANAQXEV-UHFFFAOYSA-N N-alpha-L-glutamyl-L-phenylalanine Natural products OC(=O)CCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 XMBSYZWANAQXEV-UHFFFAOYSA-N 0.000 description 7
- 241000863432 Shewanella putrefaciens Species 0.000 description 7
- 108010024078 alanyl-glycyl-serine Proteins 0.000 description 7
- 108010041407 alanylaspartic acid Proteins 0.000 description 7
- 108010038633 aspartylglutamate Proteins 0.000 description 7
- 108010081551 glycylphenylalanine Proteins 0.000 description 7
- 239000000203 mixture Substances 0.000 description 7
- 108010048818 seryl-histidine Proteins 0.000 description 7
- 101000768957 Acholeplasma phage L2 Uncharacterized 37.2 kDa protein Proteins 0.000 description 6
- 101000823746 Acidianus ambivalens Uncharacterized 17.7 kDa protein in bps2 3'region Proteins 0.000 description 6
- 101000916369 Acidianus ambivalens Uncharacterized protein in sor 5'region Proteins 0.000 description 6
- 101000769342 Acinetobacter guillouiae Uncharacterized protein in rpoN-murA intergenic region Proteins 0.000 description 6
- 101000823696 Actinobacillus pleuropneumoniae Uncharacterized glycosyltransferase in aroQ 3'region Proteins 0.000 description 6
- 101000786513 Agrobacterium tumefaciens (strain 15955) Uncharacterized protein outside the virF region Proteins 0.000 description 6
- AAQGRPOPTAUUBM-ZLUOBGJFSA-N Ala-Ala-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O AAQGRPOPTAUUBM-ZLUOBGJFSA-N 0.000 description 6
- ZIBWKCRKNFYTPT-ZKWXMUAHSA-N Ala-Asn-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O ZIBWKCRKNFYTPT-ZKWXMUAHSA-N 0.000 description 6
- BGNLUHXLSAQYRQ-FXQIFTODSA-N Ala-Glu-Gln Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O BGNLUHXLSAQYRQ-FXQIFTODSA-N 0.000 description 6
- PCIFXPRIFWKWLK-YUMQZZPRSA-N Ala-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@H](C)N PCIFXPRIFWKWLK-YUMQZZPRSA-N 0.000 description 6
- QUIGLPSHIFPEOV-CIUDSAMLSA-N Ala-Lys-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O QUIGLPSHIFPEOV-CIUDSAMLSA-N 0.000 description 6
- RMAWDDRDTRSZIR-ZLUOBGJFSA-N Ala-Ser-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O RMAWDDRDTRSZIR-ZLUOBGJFSA-N 0.000 description 6
- 101000618005 Alkalihalobacillus pseudofirmus (strain ATCC BAA-2126 / JCM 17055 / OF4) Uncharacterized protein BpOF4_00885 Proteins 0.000 description 6
- 102100020724 Ankyrin repeat, SAM and basic leucine zipper domain-containing protein 1 Human genes 0.000 description 6
- UGXYFDQFLVCDFC-CIUDSAMLSA-N Asn-Ser-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC(N)=O UGXYFDQFLVCDFC-CIUDSAMLSA-N 0.000 description 6
- KRXIWXCXOARFNT-ZLUOBGJFSA-N Asp-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC(O)=O KRXIWXCXOARFNT-ZLUOBGJFSA-N 0.000 description 6
- PDECQIHABNQRHN-GUBZILKMSA-N Asp-Glu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC(O)=O PDECQIHABNQRHN-GUBZILKMSA-N 0.000 description 6
- UMHUHHJMEXNSIV-CIUDSAMLSA-N Asp-Leu-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC(O)=O UMHUHHJMEXNSIV-CIUDSAMLSA-N 0.000 description 6
- HJCGDIGVVWETRO-ZPFDUUQYSA-N Asp-Lys-Ile Chemical compound CC[C@H](C)[C@H](NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC(O)=O)C(O)=O HJCGDIGVVWETRO-ZPFDUUQYSA-N 0.000 description 6
- JXGJJQJHXHXJQF-CIUDSAMLSA-N Asp-Met-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(O)=O)C(O)=O JXGJJQJHXHXJQF-CIUDSAMLSA-N 0.000 description 6
- 101000967489 Azorhizobium caulinodans (strain ATCC 43989 / DSM 5975 / JCM 20966 / LMG 6465 / NBRC 14845 / NCIMB 13405 / ORS 571) Uncharacterized protein AZC_3924 Proteins 0.000 description 6
- 101000823761 Bacillus licheniformis Uncharacterized 9.4 kDa protein in flaL 3'region Proteins 0.000 description 6
- 101000819719 Bacillus methanolicus Uncharacterized N-acetyltransferase in lysA 3'region Proteins 0.000 description 6
- 101000789586 Bacillus subtilis (strain 168) UPF0702 transmembrane protein YkjA Proteins 0.000 description 6
- 101000792624 Bacillus subtilis (strain 168) Uncharacterized protein YbxH Proteins 0.000 description 6
- 101000790792 Bacillus subtilis (strain 168) Uncharacterized protein YckC Proteins 0.000 description 6
- 101000819705 Bacillus subtilis (strain 168) Uncharacterized protein YlxR Proteins 0.000 description 6
- 101000948218 Bacillus subtilis (strain 168) Uncharacterized protein YtxJ Proteins 0.000 description 6
- 101000718627 Bacillus thuringiensis subsp. kurstaki Putative RNA polymerase sigma-G factor Proteins 0.000 description 6
- 101000641200 Bombyx mori densovirus Putative non-structural protein Proteins 0.000 description 6
- 101000947633 Claviceps purpurea Uncharacterized 13.8 kDa protein Proteins 0.000 description 6
- 101000948901 Enterobacteria phage T4 Uncharacterized 16.0 kDa protein in segB-ipI intergenic region Proteins 0.000 description 6
- 101000805958 Equine herpesvirus 4 (strain 1942) Virion protein US10 homolog Proteins 0.000 description 6
- 101000790442 Escherichia coli Insertion element IS2 uncharacterized 11.1 kDa protein Proteins 0.000 description 6
- 101000788354 Escherichia phage P2 Uncharacterized 8.2 kDa protein in gpA 5'region Proteins 0.000 description 6
- 101000770304 Frankia alni UPF0460 protein in nifX-nifW intergenic region Proteins 0.000 description 6
- 101000797344 Geobacillus stearothermophilus Putative tRNA (cytidine(34)-2'-O)-methyltransferase Proteins 0.000 description 6
- 101000748410 Geobacillus stearothermophilus Uncharacterized protein in fumA 3'region Proteins 0.000 description 6
- KVXVVDFOZNYYKZ-DCAQKATOSA-N Gln-Gln-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O KVXVVDFOZNYYKZ-DCAQKATOSA-N 0.000 description 6
- HYPVLWGNBIYTNA-GUBZILKMSA-N Gln-Leu-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O HYPVLWGNBIYTNA-GUBZILKMSA-N 0.000 description 6
- XFAUJGNLHIGXET-AVGNSLFASA-N Gln-Leu-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O XFAUJGNLHIGXET-AVGNSLFASA-N 0.000 description 6
- NKSGKPWXSWBRRX-ACZMJKKPSA-N Glu-Asn-Cys Chemical compound C(CC(=O)O)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CS)C(=O)O)N NKSGKPWXSWBRRX-ACZMJKKPSA-N 0.000 description 6
- JVSBYEDSSRZQGV-GUBZILKMSA-N Glu-Asp-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CCC(O)=O JVSBYEDSSRZQGV-GUBZILKMSA-N 0.000 description 6
- VMKCPNBBPGGQBJ-GUBZILKMSA-N Glu-Leu-Asn Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)O)N VMKCPNBBPGGQBJ-GUBZILKMSA-N 0.000 description 6
- JSNNHGHYGYMVCK-XVKPBYJWSA-N Gly-Glu-Val Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O JSNNHGHYGYMVCK-XVKPBYJWSA-N 0.000 description 6
- 101000772675 Haemophilus influenzae (strain ATCC 51907 / DSM 11121 / KW20 / Rd) UPF0438 protein HI_0847 Proteins 0.000 description 6
- 101000631019 Haemophilus influenzae (strain ATCC 51907 / DSM 11121 / KW20 / Rd) Uncharacterized protein HI_0350 Proteins 0.000 description 6
- 101000768938 Haemophilus phage HP1 (strain HP1c1) Uncharacterized 8.9 kDa protein in int-C1 intergenic region Proteins 0.000 description 6
- PFOUFRJYHWZJKW-NKIYYHGXSA-N His-Thr-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N)O PFOUFRJYHWZJKW-NKIYYHGXSA-N 0.000 description 6
- 101000785414 Homo sapiens Ankyrin repeat, SAM and basic leucine zipper domain-containing protein 1 Proteins 0.000 description 6
- NKVZTQVGUNLLQW-JBDRJPRFSA-N Ile-Ala-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(=O)O)N NKVZTQVGUNLLQW-JBDRJPRFSA-N 0.000 description 6
- QYZYJFXHXYUZMZ-UGYAYLCHSA-N Ile-Asn-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)N)C(=O)O)N QYZYJFXHXYUZMZ-UGYAYLCHSA-N 0.000 description 6
- JZNVOBUNTWNZPW-GHCJXIJMSA-N Ile-Ser-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(=O)O)C(=O)O)N JZNVOBUNTWNZPW-GHCJXIJMSA-N 0.000 description 6
- YBKKLDBBPFIXBQ-MBLNEYKQSA-N Ile-Thr-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(=O)O)N YBKKLDBBPFIXBQ-MBLNEYKQSA-N 0.000 description 6
- 101000782488 Junonia coenia densovirus (isolate pBRJ/1990) Putative non-structural protein NS2 Proteins 0.000 description 6
- 101000811523 Klebsiella pneumoniae Uncharacterized 55.8 kDa protein in cps region Proteins 0.000 description 6
- 101000818409 Lactococcus lactis subsp. lactis Uncharacterized HTH-type transcriptional regulator in lacX 3'region Proteins 0.000 description 6
- 101000878851 Leptolyngbya boryana Putative Fe(2+) transport protein A Proteins 0.000 description 6
- MYGQXVYRZMKRDB-SRVKXCTJSA-N Leu-Asp-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN MYGQXVYRZMKRDB-SRVKXCTJSA-N 0.000 description 6
- RVVBWTWPNFDYBE-SRVKXCTJSA-N Leu-Glu-Arg Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O RVVBWTWPNFDYBE-SRVKXCTJSA-N 0.000 description 6
- RXGLHDWAZQECBI-SRVKXCTJSA-N Leu-Leu-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O RXGLHDWAZQECBI-SRVKXCTJSA-N 0.000 description 6
- VULJUQZPSOASBZ-SRVKXCTJSA-N Leu-Pro-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O VULJUQZPSOASBZ-SRVKXCTJSA-N 0.000 description 6
- SBANPBVRHYIMRR-GARJFASQSA-N Leu-Ser-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CO)C(=O)N1CCC[C@@H]1C(=O)O)N SBANPBVRHYIMRR-GARJFASQSA-N 0.000 description 6
- KCXUCYYZNZFGLL-SRVKXCTJSA-N Lys-Ala-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O KCXUCYYZNZFGLL-SRVKXCTJSA-N 0.000 description 6
- AIRZWUMAHCDDHR-KKUMJFAQSA-N Lys-Leu-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O AIRZWUMAHCDDHR-KKUMJFAQSA-N 0.000 description 6
- YPLVCBKEPJPBDQ-MELADBBJSA-N Lys-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCCN)N YPLVCBKEPJPBDQ-MELADBBJSA-N 0.000 description 6
- XABXVVSWUVCZST-GVXVVHGQSA-N Lys-Val-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCCN XABXVVSWUVCZST-GVXVVHGQSA-N 0.000 description 6
- 101000758828 Methanosarcina barkeri (strain Fusaro / DSM 804) Uncharacterized protein Mbar_A1602 Proteins 0.000 description 6
- 101001122401 Middle East respiratory syndrome-related coronavirus (isolate United Kingdom/H123990006/2012) Non-structural protein ORF3 Proteins 0.000 description 6
- 101001055788 Mycolicibacterium smegmatis (strain ATCC 700084 / mc(2)155) Pentapeptide repeat protein MfpA Proteins 0.000 description 6
- KZNQNBZMBZJQJO-UHFFFAOYSA-N N-glycyl-L-proline Natural products NCC(=O)N1CCCC1C(O)=O KZNQNBZMBZJQJO-UHFFFAOYSA-N 0.000 description 6
- 108010047562 NGR peptide Proteins 0.000 description 6
- 101000740670 Orgyia pseudotsugata multicapsid polyhedrosis virus Protein C42 Proteins 0.000 description 6
- GMWNQSGWWGKTSF-LFSVMHDDSA-N Phe-Thr-Ala Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O GMWNQSGWWGKTSF-LFSVMHDDSA-N 0.000 description 6
- 101000769182 Photorhabdus luminescens Uncharacterized protein in pnp 3'region Proteins 0.000 description 6
- DZZCICYRSZASNF-FXQIFTODSA-N Pro-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1 DZZCICYRSZASNF-FXQIFTODSA-N 0.000 description 6
- IFMDQWDAJUMMJC-DCAQKATOSA-N Pro-Ala-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O IFMDQWDAJUMMJC-DCAQKATOSA-N 0.000 description 6
- NXEYSLRNNPWCRN-SRVKXCTJSA-N Pro-Glu-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O NXEYSLRNNPWCRN-SRVKXCTJSA-N 0.000 description 6
- 101000961392 Pseudescherichia vulneris Uncharacterized 29.9 kDa protein in crtE 3'region Proteins 0.000 description 6
- 101000731030 Pseudomonas oleovorans Poly(3-hydroxyalkanoate) polymerase 2 Proteins 0.000 description 6
- 101001065485 Pseudomonas putida Probable fatty acid methyltransferase Proteins 0.000 description 6
- 101000711023 Rhizobium leguminosarum bv. trifolii Uncharacterized protein in tfuA 3'region Proteins 0.000 description 6
- 101000948156 Rhodococcus erythropolis Uncharacterized 47.3 kDa protein in thcA 5'region Proteins 0.000 description 6
- 101000917565 Rhodococcus fascians Uncharacterized 33.6 kDa protein in fasciation locus Proteins 0.000 description 6
- 101000790284 Saimiriine herpesvirus 2 (strain 488) Uncharacterized 9.5 kDa protein in DHFR 3'region Proteins 0.000 description 6
- XUDRHBPSPAPDJP-SRVKXCTJSA-N Ser-Lys-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CO XUDRHBPSPAPDJP-SRVKXCTJSA-N 0.000 description 6
- JCLAFVNDBJMLBC-JBDRJPRFSA-N Ser-Ser-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JCLAFVNDBJMLBC-JBDRJPRFSA-N 0.000 description 6
- 101000936719 Streptococcus gordonii Accessory Sec system protein Asp3 Proteins 0.000 description 6
- 101000788499 Streptomyces coelicolor Uncharacterized oxidoreductase in mprA 5'region Proteins 0.000 description 6
- 101001102841 Streptomyces griseus Purine nucleoside phosphorylase ORF3 Proteins 0.000 description 6
- 101000708557 Streptomyces lincolnensis Uncharacterized 17.2 kDa protein in melC2-rnhH intergenic region Proteins 0.000 description 6
- 101000649826 Thermotoga neapolitana Putative anti-sigma factor antagonist TM1081 homolog Proteins 0.000 description 6
- DWYAUVCQDTZIJI-VZFHVOOUSA-N Thr-Ala-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O DWYAUVCQDTZIJI-VZFHVOOUSA-N 0.000 description 6
- RFKVQLIXNVEOMB-WEDXCCLWSA-N Thr-Leu-Gly Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)NCC(=O)O)N)O RFKVQLIXNVEOMB-WEDXCCLWSA-N 0.000 description 6
- VCAWFLIWYNMHQP-UKJIMTQDSA-N Val-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](C(C)C)N VCAWFLIWYNMHQP-UKJIMTQDSA-N 0.000 description 6
- ZXAGTABZUOMUDO-GVXVVHGQSA-N Val-Glu-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N ZXAGTABZUOMUDO-GVXVVHGQSA-N 0.000 description 6
- VXDSPJJQUQDCKH-UKJIMTQDSA-N Val-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N VXDSPJJQUQDCKH-UKJIMTQDSA-N 0.000 description 6
- JAIZPWVHPQRYOU-ZJDVBMNYSA-N Val-Thr-Thr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](C(C)C)N)O JAIZPWVHPQRYOU-ZJDVBMNYSA-N 0.000 description 6
- 101000827562 Vibrio alginolyticus Uncharacterized protein in proC 3'region Proteins 0.000 description 6
- 101000778915 Vibrio parahaemolyticus serotype O3:K6 (strain RIMD 2210633) Uncharacterized membrane protein VP2115 Proteins 0.000 description 6
- KOSRFJWDECSPRO-UHFFFAOYSA-N alpha-L-glutamyl-L-glutamic acid Natural products OC(=O)CCC(N)C(=O)NC(CCC(O)=O)C(O)=O KOSRFJWDECSPRO-UHFFFAOYSA-N 0.000 description 6
- 108010090037 glycyl-alanyl-isoleucine Proteins 0.000 description 6
- 108010025306 histidylleucine Proteins 0.000 description 6
- 239000007788 liquid Substances 0.000 description 6
- 108010085203 methionylmethionine Proteins 0.000 description 6
- 108010026333 seryl-proline Proteins 0.000 description 6
- 108010073969 valyllysine Proteins 0.000 description 6
- RLMISHABBKUNFO-WHFBIAKZSA-N Ala-Ala-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O RLMISHABBKUNFO-WHFBIAKZSA-N 0.000 description 5
- LSLIRHLIUDVNBN-CIUDSAMLSA-N Ala-Asp-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN LSLIRHLIUDVNBN-CIUDSAMLSA-N 0.000 description 5
- MVBWLRJESQOQTM-ACZMJKKPSA-N Ala-Gln-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O MVBWLRJESQOQTM-ACZMJKKPSA-N 0.000 description 5
- IXTPACPAXIOCRG-ACZMJKKPSA-N Ala-Glu-Cys Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CS)C(=O)O)N IXTPACPAXIOCRG-ACZMJKKPSA-N 0.000 description 5
- NBTGEURICRTMGL-WHFBIAKZSA-N Ala-Gly-Ser Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O NBTGEURICRTMGL-WHFBIAKZSA-N 0.000 description 5
- LXAARTARZJJCMB-CIQUZCHMSA-N Ala-Ile-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LXAARTARZJJCMB-CIQUZCHMSA-N 0.000 description 5
- NYGILGUOUOXGMJ-YUMQZZPRSA-N Asn-Lys-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O NYGILGUOUOXGMJ-YUMQZZPRSA-N 0.000 description 5
- JWQWPRCDYWNVNM-ACZMJKKPSA-N Asn-Ser-Gln Chemical compound C(CC(=O)N)[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CC(=O)N)N JWQWPRCDYWNVNM-ACZMJKKPSA-N 0.000 description 5
- YPMDZWPZFOZYFG-GUBZILKMSA-N Gln-Leu-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O YPMDZWPZFOZYFG-GUBZILKMSA-N 0.000 description 5
- LRQXRHGQEVWGPV-NHCYSSNCSA-N Gly-Leu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)CN LRQXRHGQEVWGPV-NHCYSSNCSA-N 0.000 description 5
- IRJWAYCXIYUHQE-WHFBIAKZSA-N Gly-Ser-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CO)NC(=O)CN IRJWAYCXIYUHQE-WHFBIAKZSA-N 0.000 description 5
- WZDCVAWMBUNDDY-KBIXCLLPSA-N Ile-Glu-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](C)C(=O)O)N WZDCVAWMBUNDDY-KBIXCLLPSA-N 0.000 description 5
- APQYGMBHIVXFML-OSUNSFLBSA-N Ile-Val-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N APQYGMBHIVXFML-OSUNSFLBSA-N 0.000 description 5
- FADYJNXDPBKVCA-UHFFFAOYSA-N L-Phenylalanyl-L-lysin Natural products NCCCCC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 FADYJNXDPBKVCA-UHFFFAOYSA-N 0.000 description 5
- RCFDOSNHHZGBOY-UHFFFAOYSA-N L-isoleucyl-L-alanine Natural products CCC(C)C(N)C(=O)NC(C)C(O)=O RCFDOSNHHZGBOY-UHFFFAOYSA-N 0.000 description 5
- FIJMQLGQLBLBOL-HJGDQZAQSA-N Leu-Asn-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O FIJMQLGQLBLBOL-HJGDQZAQSA-N 0.000 description 5
- LAPSXOAUPNOINL-YUMQZZPRSA-N Leu-Gly-Asp Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(O)=O LAPSXOAUPNOINL-YUMQZZPRSA-N 0.000 description 5
- CCQLQKZTXZBXTN-NHCYSSNCSA-N Leu-Gly-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)NCC(=O)N[C@@H]([C@@H](C)CC)C(O)=O CCQLQKZTXZBXTN-NHCYSSNCSA-N 0.000 description 5
- HYMLKESRWLZDBR-WEDXCCLWSA-N Leu-Gly-Thr Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O HYMLKESRWLZDBR-WEDXCCLWSA-N 0.000 description 5
- LXKNSJLSGPNHSK-KKUMJFAQSA-N Leu-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N LXKNSJLSGPNHSK-KKUMJFAQSA-N 0.000 description 5
- RZXLZBIUTDQHJQ-SRVKXCTJSA-N Leu-Lys-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O RZXLZBIUTDQHJQ-SRVKXCTJSA-N 0.000 description 5
- DPURXCQCHSQPAN-AVGNSLFASA-N Leu-Pro-Pro Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 DPURXCQCHSQPAN-AVGNSLFASA-N 0.000 description 5
- XNKDCYABMBBEKN-IUCAKERBSA-N Lys-Gly-Gln Chemical compound NCCCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(N)=O XNKDCYABMBBEKN-IUCAKERBSA-N 0.000 description 5
- FBQMBZLJHOQAIH-GUBZILKMSA-N Met-Asp-Met Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCSC)C(O)=O FBQMBZLJHOQAIH-GUBZILKMSA-N 0.000 description 5
- MUARUIBTKQJKFY-WHFBIAKZSA-N Ser-Gly-Asp Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O MUARUIBTKQJKFY-WHFBIAKZSA-N 0.000 description 5
- XXXAXOWMBOKTRN-XPUUQOCRSA-N Ser-Gly-Val Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O XXXAXOWMBOKTRN-XPUUQOCRSA-N 0.000 description 5
- NJEMRSFGDNECGF-GCJQMDKQSA-N Thr-Ala-Asp Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC(O)=O NJEMRSFGDNECGF-GCJQMDKQSA-N 0.000 description 5
- QGVBFDIREUUSHX-IFFSRLJSSA-N Thr-Val-Gln Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O QGVBFDIREUUSHX-IFFSRLJSSA-N 0.000 description 5
- AJNUKMZFHXUBMK-GUBZILKMSA-N Val-Ser-Arg Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N AJNUKMZFHXUBMK-GUBZILKMSA-N 0.000 description 5
- 108010015792 glycyllysine Proteins 0.000 description 5
- 108010037850 glycylvaline Proteins 0.000 description 5
- 108010040030 histidinoalanine Proteins 0.000 description 5
- 230000014759 maintenance of location Effects 0.000 description 5
- 108010012581 phenylalanylglutamate Proteins 0.000 description 5
- 108010004914 prolylarginine Proteins 0.000 description 5
- 108010051110 tyrosyl-lysine Proteins 0.000 description 5
- 108010036211 5-HT-moduline Proteins 0.000 description 4
- FJVAQLJNTSUQPY-CIUDSAMLSA-N Ala-Ala-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCCN FJVAQLJNTSUQPY-CIUDSAMLSA-N 0.000 description 4
- YYSWCHMLFJLLBJ-ZLUOBGJFSA-N Ala-Ala-Ser Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O YYSWCHMLFJLLBJ-ZLUOBGJFSA-N 0.000 description 4
- NHCPCLJZRSIDHS-ZLUOBGJFSA-N Ala-Asp-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O NHCPCLJZRSIDHS-ZLUOBGJFSA-N 0.000 description 4
- GSCLWXDNIMNIJE-ZLUOBGJFSA-N Ala-Asp-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O GSCLWXDNIMNIJE-ZLUOBGJFSA-N 0.000 description 4
- GWFSQQNGMPGBEF-GHCJXIJMSA-N Ala-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](C)N GWFSQQNGMPGBEF-GHCJXIJMSA-N 0.000 description 4
- BUDNAJYVCUHLSV-ZLUOBGJFSA-N Ala-Asp-Ser Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O BUDNAJYVCUHLSV-ZLUOBGJFSA-N 0.000 description 4
- BLGHHPHXVJWCNK-GUBZILKMSA-N Ala-Gln-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O BLGHHPHXVJWCNK-GUBZILKMSA-N 0.000 description 4
- SFNFGFDRYJKZKN-XQXXSGGOSA-N Ala-Gln-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](C)N)O SFNFGFDRYJKZKN-XQXXSGGOSA-N 0.000 description 4
- YIGLXQRFQVWFEY-NRPADANISA-N Ala-Gln-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O YIGLXQRFQVWFEY-NRPADANISA-N 0.000 description 4
- HXNNRBHASOSVPG-GUBZILKMSA-N Ala-Glu-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O HXNNRBHASOSVPG-GUBZILKMSA-N 0.000 description 4
- XYTNPQNAZREREP-XQXXSGGOSA-N Ala-Glu-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XYTNPQNAZREREP-XQXXSGGOSA-N 0.000 description 4
- OBVSBEYOMDWLRJ-BFHQHQDPSA-N Ala-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@H](C)N OBVSBEYOMDWLRJ-BFHQHQDPSA-N 0.000 description 4
- IFKQPMZRDQZSHI-GHCJXIJMSA-N Ala-Ile-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O IFKQPMZRDQZSHI-GHCJXIJMSA-N 0.000 description 4
- OKIKVSXTXVVFDV-MMWGEVLESA-N Ala-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](C)N OKIKVSXTXVVFDV-MMWGEVLESA-N 0.000 description 4
- HHRAXZAYZFFRAM-CIUDSAMLSA-N Ala-Leu-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O HHRAXZAYZFFRAM-CIUDSAMLSA-N 0.000 description 4
- SUMYEVXWCAYLLJ-GUBZILKMSA-N Ala-Leu-Gln Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O SUMYEVXWCAYLLJ-GUBZILKMSA-N 0.000 description 4
- AWZKCUCQJNTBAD-SRVKXCTJSA-N Ala-Leu-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCCN AWZKCUCQJNTBAD-SRVKXCTJSA-N 0.000 description 4
- VHVVPYOJIIQCKS-QEJZJMRPSA-N Ala-Leu-Phe Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 VHVVPYOJIIQCKS-QEJZJMRPSA-N 0.000 description 4
- MFMDKJIPHSWSBM-GUBZILKMSA-N Ala-Lys-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O MFMDKJIPHSWSBM-GUBZILKMSA-N 0.000 description 4
- VCSABYLVNWQYQE-UHFFFAOYSA-N Ala-Lys-Lys Natural products NCCCCC(NC(=O)C(N)C)C(=O)NC(CCCCN)C(O)=O VCSABYLVNWQYQE-UHFFFAOYSA-N 0.000 description 4
- ZBLQIYPCUWZSRZ-QEJZJMRPSA-N Ala-Phe-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](C)N)CC1=CC=CC=C1 ZBLQIYPCUWZSRZ-QEJZJMRPSA-N 0.000 description 4
- DCVYRWFAMZFSDA-ZLUOBGJFSA-N Ala-Ser-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O DCVYRWFAMZFSDA-ZLUOBGJFSA-N 0.000 description 4
- DYXOFPBJBAHWFY-JBDRJPRFSA-N Ala-Ser-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@H](C)N DYXOFPBJBAHWFY-JBDRJPRFSA-N 0.000 description 4
- NCQMBSJGJMYKCK-ZLUOBGJFSA-N Ala-Ser-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O NCQMBSJGJMYKCK-ZLUOBGJFSA-N 0.000 description 4
- WQKAQKZRDIZYNV-VZFHVOOUSA-N Ala-Ser-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O WQKAQKZRDIZYNV-VZFHVOOUSA-N 0.000 description 4
- OEVCHROQUIVQFZ-YTLHQDLWSA-N Ala-Thr-Ala Chemical compound C[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](C)C(O)=O OEVCHROQUIVQFZ-YTLHQDLWSA-N 0.000 description 4
- YNOCMHZSWJMGBB-GCJQMDKQSA-N Ala-Thr-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(O)=O YNOCMHZSWJMGBB-GCJQMDKQSA-N 0.000 description 4
- IOFVWPYSRSCWHI-JXUBOQSCSA-N Ala-Thr-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@H](C)N IOFVWPYSRSCWHI-JXUBOQSCSA-N 0.000 description 4
- SAHQGRZIQVEJPF-JXUBOQSCSA-N Ala-Thr-Lys Chemical compound C[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@H](C(O)=O)CCCCN SAHQGRZIQVEJPF-JXUBOQSCSA-N 0.000 description 4
- KUFVXLQLDHJVOG-SHGPDSBTSA-N Ala-Thr-Thr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](C)N)O KUFVXLQLDHJVOG-SHGPDSBTSA-N 0.000 description 4
- YJHKTAMKPGFJCT-NRPADANISA-N Ala-Val-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O YJHKTAMKPGFJCT-NRPADANISA-N 0.000 description 4
- OMSKGWFGWCQFBD-KZVJFYERSA-N Ala-Val-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OMSKGWFGWCQFBD-KZVJFYERSA-N 0.000 description 4
- KWKQGHSSNHPGOW-BQBZGAKWSA-N Arg-Ala-Gly Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(=O)NCC(O)=O KWKQGHSSNHPGOW-BQBZGAKWSA-N 0.000 description 4
- HJAICMSAKODKRF-GUBZILKMSA-N Arg-Cys-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O HJAICMSAKODKRF-GUBZILKMSA-N 0.000 description 4
- FEZJJKXNPSEYEV-CIUDSAMLSA-N Arg-Gln-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O FEZJJKXNPSEYEV-CIUDSAMLSA-N 0.000 description 4
- LMPKCSXZJSXBBL-NHCYSSNCSA-N Arg-Gln-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O LMPKCSXZJSXBBL-NHCYSSNCSA-N 0.000 description 4
- NKBQZKVMKJJDLX-SRVKXCTJSA-N Arg-Glu-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O NKBQZKVMKJJDLX-SRVKXCTJSA-N 0.000 description 4
- ATABBWFGOHKROJ-GUBZILKMSA-N Arg-Pro-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O ATABBWFGOHKROJ-GUBZILKMSA-N 0.000 description 4
- RYQSYXFGFOTJDJ-RHYQMDGZSA-N Arg-Thr-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O RYQSYXFGFOTJDJ-RHYQMDGZSA-N 0.000 description 4
- UVTGNSWSRSCPLP-UHFFFAOYSA-N Arg-Tyr Natural products NC(CCNC(=N)N)C(=O)NC(Cc1ccc(O)cc1)C(=O)O UVTGNSWSRSCPLP-UHFFFAOYSA-N 0.000 description 4
- ULBHWNVWSCJLCO-NHCYSSNCSA-N Arg-Val-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCN=C(N)N ULBHWNVWSCJLCO-NHCYSSNCSA-N 0.000 description 4
- RZVVKNIACROXRM-ZLUOBGJFSA-N Asn-Ala-Asp Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC(=O)N)N RZVVKNIACROXRM-ZLUOBGJFSA-N 0.000 description 4
- CMLGVVWQQHUXOZ-GHCJXIJMSA-N Asn-Ala-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O CMLGVVWQQHUXOZ-GHCJXIJMSA-N 0.000 description 4
- XWGJDUSDTRPQRK-ZLUOBGJFSA-N Asn-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC(N)=O XWGJDUSDTRPQRK-ZLUOBGJFSA-N 0.000 description 4
- QEYJFBMTSMLPKZ-ZKWXMUAHSA-N Asn-Ala-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O QEYJFBMTSMLPKZ-ZKWXMUAHSA-N 0.000 description 4
- JEPNYDRDYNSFIU-QXEWZRGKSA-N Asn-Arg-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H](N)CC(N)=O)C(O)=O JEPNYDRDYNSFIU-QXEWZRGKSA-N 0.000 description 4
- JREOBWLIZLXRIS-GUBZILKMSA-N Asn-Glu-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O JREOBWLIZLXRIS-GUBZILKMSA-N 0.000 description 4
- NVWJMQNYLYWVNQ-BYULHYEWSA-N Asn-Ile-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O NVWJMQNYLYWVNQ-BYULHYEWSA-N 0.000 description 4
- ORJQQZIXTOYGGH-SRVKXCTJSA-N Asn-Lys-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O ORJQQZIXTOYGGH-SRVKXCTJSA-N 0.000 description 4
- JTXVXGXTRXMOFJ-FXQIFTODSA-N Asn-Pro-Asn Chemical compound NC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O JTXVXGXTRXMOFJ-FXQIFTODSA-N 0.000 description 4
- SNYCNNPOFYBCEK-ZLUOBGJFSA-N Asn-Ser-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O SNYCNNPOFYBCEK-ZLUOBGJFSA-N 0.000 description 4
- HNXWVVHIGTZTBO-LKXGYXEUSA-N Asn-Ser-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC(N)=O HNXWVVHIGTZTBO-LKXGYXEUSA-N 0.000 description 4
- AMGQTNHANMRPOE-LKXGYXEUSA-N Asn-Thr-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O AMGQTNHANMRPOE-LKXGYXEUSA-N 0.000 description 4
- GWTLRDMPMJCNMH-WHFBIAKZSA-N Asp-Asn-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O GWTLRDMPMJCNMH-WHFBIAKZSA-N 0.000 description 4
- FANQWNCPNFEPGZ-WHFBIAKZSA-N Asp-Asp-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O FANQWNCPNFEPGZ-WHFBIAKZSA-N 0.000 description 4
- VAWNQIGQPUOPQW-ACZMJKKPSA-N Asp-Glu-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O VAWNQIGQPUOPQW-ACZMJKKPSA-N 0.000 description 4
- DTNUIAJCPRMNBT-WHFBIAKZSA-N Asp-Gly-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](C)C(O)=O DTNUIAJCPRMNBT-WHFBIAKZSA-N 0.000 description 4
- WBDWQKRLTVCDSY-WHFBIAKZSA-N Asp-Gly-Asp Chemical compound OC(=O)C[C@H](N)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O WBDWQKRLTVCDSY-WHFBIAKZSA-N 0.000 description 4
- OMMIEVATLAGRCK-BYPYZUCNSA-N Asp-Gly-Gly Chemical compound OC(=O)C[C@H](N)C(=O)NCC(=O)NCC(O)=O OMMIEVATLAGRCK-BYPYZUCNSA-N 0.000 description 4
- PZXPWHFYZXTFBI-YUMQZZPRSA-N Asp-Gly-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC(O)=O PZXPWHFYZXTFBI-YUMQZZPRSA-N 0.000 description 4
- KQBVNNAPIURMPD-PEFMBERDSA-N Asp-Ile-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(O)=O KQBVNNAPIURMPD-PEFMBERDSA-N 0.000 description 4
- JNNVNVRBYUJYGS-CIUDSAMLSA-N Asp-Leu-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O JNNVNVRBYUJYGS-CIUDSAMLSA-N 0.000 description 4
- JSHWXQIZOCVWIA-ZKWXMUAHSA-N Asp-Ser-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O JSHWXQIZOCVWIA-ZKWXMUAHSA-N 0.000 description 4
- WOKXEQLPBLLWHC-IHRRRGAJSA-N Asp-Tyr-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CC(O)=O)CC1=CC=C(O)C=C1 WOKXEQLPBLLWHC-IHRRRGAJSA-N 0.000 description 4
- NOCCABSVTRONIN-CIUDSAMLSA-N Cys-Ala-Leu Chemical compound C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)O)NC(=O)[C@H](CS)N NOCCABSVTRONIN-CIUDSAMLSA-N 0.000 description 4
- BYALSSDCQYHKMY-XGEHTFHBSA-N Cys-Arg-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CS)N)O BYALSSDCQYHKMY-XGEHTFHBSA-N 0.000 description 4
- 102000016928 DNA-directed DNA polymerase Human genes 0.000 description 4
- 108010014303 DNA-directed DNA polymerase Proteins 0.000 description 4
- RTZKZFJDLAIYFH-UHFFFAOYSA-N Diethyl ether Chemical compound CCOCC RTZKZFJDLAIYFH-UHFFFAOYSA-N 0.000 description 4
- SHERTACNJPYHAR-ACZMJKKPSA-N Gln-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(N)=O SHERTACNJPYHAR-ACZMJKKPSA-N 0.000 description 4
- KZEUVLLVULIPNX-GUBZILKMSA-N Gln-Asp-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCC(=O)N)N KZEUVLLVULIPNX-GUBZILKMSA-N 0.000 description 4
- LPYPANUXJGFMGV-FXQIFTODSA-N Gln-Gln-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCC(=O)N)N LPYPANUXJGFMGV-FXQIFTODSA-N 0.000 description 4
- KKCJHBXMYYVWMX-KQXIARHKSA-N Gln-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)N)N KKCJHBXMYYVWMX-KQXIARHKSA-N 0.000 description 4
- QKCZZAZNMMVICF-DCAQKATOSA-N Gln-Leu-Glu Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O QKCZZAZNMMVICF-DCAQKATOSA-N 0.000 description 4
- FKXCBKCOSVIGCT-AVGNSLFASA-N Gln-Lys-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O FKXCBKCOSVIGCT-AVGNSLFASA-N 0.000 description 4
- LHMWTCWZARHLPV-CIUDSAMLSA-N Gln-Met-Ser Chemical compound CSCC[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCC(=O)N)N LHMWTCWZARHLPV-CIUDSAMLSA-N 0.000 description 4
- RLZBLVSJDFHDBL-KBIXCLLPSA-N Glu-Ala-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O RLZBLVSJDFHDBL-KBIXCLLPSA-N 0.000 description 4
- IRDASPPCLZIERZ-XHNCKOQMSA-N Glu-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)O)N IRDASPPCLZIERZ-XHNCKOQMSA-N 0.000 description 4
- MXOODARRORARSU-ACZMJKKPSA-N Glu-Ala-Ser Chemical compound C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCC(=O)O)N MXOODARRORARSU-ACZMJKKPSA-N 0.000 description 4
- OXEMJGCAJFFREE-FXQIFTODSA-N Glu-Gln-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O OXEMJGCAJFFREE-FXQIFTODSA-N 0.000 description 4
- XHWLNISLUFEWNS-CIUDSAMLSA-N Glu-Gln-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O XHWLNISLUFEWNS-CIUDSAMLSA-N 0.000 description 4
- XOIATPHFYVWFEU-DCAQKATOSA-N Glu-His-Gln Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)O)N XOIATPHFYVWFEU-DCAQKATOSA-N 0.000 description 4
- ZHNHJYYFCGUZNQ-KBIXCLLPSA-N Glu-Ile-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCC(O)=O ZHNHJYYFCGUZNQ-KBIXCLLPSA-N 0.000 description 4
- INGJLBQKTRJLFO-UKJIMTQDSA-N Glu-Ile-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCC(O)=O INGJLBQKTRJLFO-UKJIMTQDSA-N 0.000 description 4
- PJBVXVBTTFZPHJ-GUBZILKMSA-N Glu-Leu-Asp Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CCC(=O)O)N PJBVXVBTTFZPHJ-GUBZILKMSA-N 0.000 description 4
- ATVYZJGOZLVXDK-IUCAKERBSA-N Glu-Leu-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O ATVYZJGOZLVXDK-IUCAKERBSA-N 0.000 description 4
- UGSVSNXPJJDJKL-SDDRHHMPSA-N Glu-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)O)N UGSVSNXPJJDJKL-SDDRHHMPSA-N 0.000 description 4
- FBEJIDRSQCGFJI-GUBZILKMSA-N Glu-Leu-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O FBEJIDRSQCGFJI-GUBZILKMSA-N 0.000 description 4
- ILWHFUZZCFYSKT-AVGNSLFASA-N Glu-Lys-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O ILWHFUZZCFYSKT-AVGNSLFASA-N 0.000 description 4
- YRMZCZIRHYCNHX-RYUDHWBXSA-N Glu-Phe-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)NCC(O)=O YRMZCZIRHYCNHX-RYUDHWBXSA-N 0.000 description 4
- AAJHGGDRKHYSDH-GUBZILKMSA-N Glu-Pro-Gln Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CCC(=O)O)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O AAJHGGDRKHYSDH-GUBZILKMSA-N 0.000 description 4
- IDEODOAVGCMUQV-GUBZILKMSA-N Glu-Ser-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O IDEODOAVGCMUQV-GUBZILKMSA-N 0.000 description 4
- VNCNWQPIQYAMAK-ACZMJKKPSA-N Glu-Ser-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O VNCNWQPIQYAMAK-ACZMJKKPSA-N 0.000 description 4
- RGJKYNUINKGPJN-RWRJDSDZSA-N Glu-Thr-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@H](CCC(=O)O)N RGJKYNUINKGPJN-RWRJDSDZSA-N 0.000 description 4
- HBMRTXJZQDVRFT-DZKIICNBSA-N Glu-Tyr-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O HBMRTXJZQDVRFT-DZKIICNBSA-N 0.000 description 4
- RLFSBAPJTYKSLG-WHFBIAKZSA-N Gly-Ala-Asp Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(O)=O RLFSBAPJTYKSLG-WHFBIAKZSA-N 0.000 description 4
- LJPIRKICOISLKN-WHFBIAKZSA-N Gly-Ala-Ser Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O LJPIRKICOISLKN-WHFBIAKZSA-N 0.000 description 4
- JRDYDYXZKFNNRQ-XPUUQOCRSA-N Gly-Ala-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)CN JRDYDYXZKFNNRQ-XPUUQOCRSA-N 0.000 description 4
- GWCRIHNSVMOBEQ-BQBZGAKWSA-N Gly-Arg-Ser Chemical compound [H]NCC(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O GWCRIHNSVMOBEQ-BQBZGAKWSA-N 0.000 description 4
- STVHDEHTKFXBJQ-LAEOZQHASA-N Gly-Glu-Ile Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O STVHDEHTKFXBJQ-LAEOZQHASA-N 0.000 description 4
- CCQOOWAONKGYKQ-BYPYZUCNSA-N Gly-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)CN CCQOOWAONKGYKQ-BYPYZUCNSA-N 0.000 description 4
- UQJNXZSSGQIPIQ-FBCQKBJTSA-N Gly-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)CN UQJNXZSSGQIPIQ-FBCQKBJTSA-N 0.000 description 4
- SWQALSGKVLYKDT-UHFFFAOYSA-N Gly-Ile-Ala Natural products NCC(=O)NC(C(C)CC)C(=O)NC(C)C(O)=O SWQALSGKVLYKDT-UHFFFAOYSA-N 0.000 description 4
- UTYGDAHJBBDPBA-BYULHYEWSA-N Gly-Ile-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)CN UTYGDAHJBBDPBA-BYULHYEWSA-N 0.000 description 4
- LLZXNUUIBOALNY-QWRGUYRKSA-N Gly-Leu-Lys Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCCN LLZXNUUIBOALNY-QWRGUYRKSA-N 0.000 description 4
- OQQKUTVULYLCDG-ONGXEEELSA-N Gly-Lys-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CCCCN)NC(=O)CN)C(O)=O OQQKUTVULYLCDG-ONGXEEELSA-N 0.000 description 4
- IALQAMYQJBZNSK-WHFBIAKZSA-N Gly-Ser-Asn Chemical compound [H]NCC(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O IALQAMYQJBZNSK-WHFBIAKZSA-N 0.000 description 4
- ZLCLYFGMKFCDCN-XPUUQOCRSA-N Gly-Ser-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CO)NC(=O)CN)C(O)=O ZLCLYFGMKFCDCN-XPUUQOCRSA-N 0.000 description 4
- KOYUSMBPJOVSOO-XEGUGMAKSA-N Gly-Tyr-Ile Chemical compound [H]NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KOYUSMBPJOVSOO-XEGUGMAKSA-N 0.000 description 4
- BAYQNCWLXIDLHX-ONGXEEELSA-N Gly-Val-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)CN BAYQNCWLXIDLHX-ONGXEEELSA-N 0.000 description 4
- SBVMXEZQJVUARN-XPUUQOCRSA-N Gly-Val-Ser Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O SBVMXEZQJVUARN-XPUUQOCRSA-N 0.000 description 4
- QIVPRLJQQVXCIY-HGNGGELXSA-N His-Ala-Gln Chemical compound C[C@H](NC(=O)[C@@H](N)Cc1cnc[nH]1)C(=O)N[C@@H](CCC(N)=O)C(O)=O QIVPRLJQQVXCIY-HGNGGELXSA-N 0.000 description 4
- TVMNTHXFRSXZGR-IHRRRGAJSA-N His-Lys-Val Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O TVMNTHXFRSXZGR-IHRRRGAJSA-N 0.000 description 4
- VSZALHITQINTGC-GHCJXIJMSA-N Ile-Ala-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CC(=O)O)C(=O)O)N VSZALHITQINTGC-GHCJXIJMSA-N 0.000 description 4
- AQCUAZTZSPQJFF-ZKWXMUAHSA-N Ile-Ala-Gly Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O AQCUAZTZSPQJFF-ZKWXMUAHSA-N 0.000 description 4
- QICVAHODWHIWIS-HTFCKZLJSA-N Ile-Ala-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N QICVAHODWHIWIS-HTFCKZLJSA-N 0.000 description 4
- FZWVCYCYWCLQDH-NHCYSSNCSA-N Ile-Leu-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)NCC(=O)O)N FZWVCYCYWCLQDH-NHCYSSNCSA-N 0.000 description 4
- HPCFRQWLTRDGHT-AJNGGQMLSA-N Ile-Leu-Leu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O HPCFRQWLTRDGHT-AJNGGQMLSA-N 0.000 description 4
- TVYWVSJGSHQWMT-AJNGGQMLSA-N Ile-Leu-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N TVYWVSJGSHQWMT-AJNGGQMLSA-N 0.000 description 4
- PNTWNAXGBOZMBO-MNXVOIDGSA-N Ile-Lys-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N PNTWNAXGBOZMBO-MNXVOIDGSA-N 0.000 description 4
- YKZAMJXNJUWFIK-JBDRJPRFSA-N Ile-Ser-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(=O)O)N YKZAMJXNJUWFIK-JBDRJPRFSA-N 0.000 description 4
- ZNOBVZFCHNHKHA-KBIXCLLPSA-N Ile-Ser-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N ZNOBVZFCHNHKHA-KBIXCLLPSA-N 0.000 description 4
- AGGIYSLVUKVOPT-HTFCKZLJSA-N Ile-Ser-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N AGGIYSLVUKVOPT-HTFCKZLJSA-N 0.000 description 4
- 108010065920 Insulin Lispro Proteins 0.000 description 4
- CQQGCWPXDHTTNF-GUBZILKMSA-N Leu-Ala-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(O)=O CQQGCWPXDHTTNF-GUBZILKMSA-N 0.000 description 4
- OGCQGUIWMSBHRZ-CIUDSAMLSA-N Leu-Asn-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O OGCQGUIWMSBHRZ-CIUDSAMLSA-N 0.000 description 4
- QDSKNVXKLPQNOJ-GVXVVHGQSA-N Leu-Gln-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O QDSKNVXKLPQNOJ-GVXVVHGQSA-N 0.000 description 4
- QVFGXCVIXXBFHO-AVGNSLFASA-N Leu-Glu-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O QVFGXCVIXXBFHO-AVGNSLFASA-N 0.000 description 4
- OXRLYTYUXAQTHP-YUMQZZPRSA-N Leu-Gly-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)NCC(=O)N[C@@H](C)C(O)=O OXRLYTYUXAQTHP-YUMQZZPRSA-N 0.000 description 4
- QPXBPQUGXHURGP-UWVGGRQHSA-N Leu-Gly-Met Chemical compound CC(C)C[C@@H](C(=O)NCC(=O)N[C@@H](CCSC)C(=O)O)N QPXBPQUGXHURGP-UWVGGRQHSA-N 0.000 description 4
- POZULHZYLPGXMR-ONGXEEELSA-N Leu-Gly-Val Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O POZULHZYLPGXMR-ONGXEEELSA-N 0.000 description 4
- DSFYPIUSAMSERP-IHRRRGAJSA-N Leu-Leu-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N DSFYPIUSAMSERP-IHRRRGAJSA-N 0.000 description 4
- QNBVTHNJGCOVFA-AVGNSLFASA-N Leu-Leu-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCC(O)=O QNBVTHNJGCOVFA-AVGNSLFASA-N 0.000 description 4
- YOKVEHGYYQEQOP-QWRGUYRKSA-N Leu-Leu-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O YOKVEHGYYQEQOP-QWRGUYRKSA-N 0.000 description 4
- XVZCXCTYGHPNEM-UHFFFAOYSA-N Leu-Leu-Pro Natural products CC(C)CC(N)C(=O)NC(CC(C)C)C(=O)N1CCCC1C(O)=O XVZCXCTYGHPNEM-UHFFFAOYSA-N 0.000 description 4
- IEWBEPKLKUXQBU-VOAKCMCISA-N Leu-Leu-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O IEWBEPKLKUXQBU-VOAKCMCISA-N 0.000 description 4
- IRMLZWSRWSGTOP-CIUDSAMLSA-N Leu-Ser-Ala Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O IRMLZWSRWSGTOP-CIUDSAMLSA-N 0.000 description 4
- ADJWHHZETYAAAX-SRVKXCTJSA-N Leu-Ser-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N ADJWHHZETYAAAX-SRVKXCTJSA-N 0.000 description 4
- XOWMDXHFSBCAKQ-SRVKXCTJSA-N Leu-Ser-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC(C)C XOWMDXHFSBCAKQ-SRVKXCTJSA-N 0.000 description 4
- SBANPBVRHYIMRR-UHFFFAOYSA-N Leu-Ser-Pro Natural products CC(C)CC(N)C(=O)NC(CO)C(=O)N1CCCC1C(O)=O SBANPBVRHYIMRR-UHFFFAOYSA-N 0.000 description 4
- ZJZNLRVCZWUONM-JXUBOQSCSA-N Leu-Thr-Ala Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O ZJZNLRVCZWUONM-JXUBOQSCSA-N 0.000 description 4
- QWWPYKKLXWOITQ-VOAKCMCISA-N Leu-Thr-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(C)C QWWPYKKLXWOITQ-VOAKCMCISA-N 0.000 description 4
- AAKRWBIIGKPOKQ-ONGXEEELSA-N Leu-Val-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O AAKRWBIIGKPOKQ-ONGXEEELSA-N 0.000 description 4
- YQFZRHYZLARWDY-IHRRRGAJSA-N Leu-Val-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCCN YQFZRHYZLARWDY-IHRRRGAJSA-N 0.000 description 4
- FZIJIFCXUCZHOL-CIUDSAMLSA-N Lys-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN FZIJIFCXUCZHOL-CIUDSAMLSA-N 0.000 description 4
- PNPYKQFJGRFYJE-GUBZILKMSA-N Lys-Ala-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O PNPYKQFJGRFYJE-GUBZILKMSA-N 0.000 description 4
- UWKNTTJNVSYXPC-CIUDSAMLSA-N Lys-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN UWKNTTJNVSYXPC-CIUDSAMLSA-N 0.000 description 4
- GAOJCVKPIGHTGO-UWVGGRQHSA-N Lys-Arg-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O GAOJCVKPIGHTGO-UWVGGRQHSA-N 0.000 description 4
- SSJBMGCZZXCGJJ-DCAQKATOSA-N Lys-Asp-Met Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCSC)C(O)=O SSJBMGCZZXCGJJ-DCAQKATOSA-N 0.000 description 4
- RZHLIPMZXOEJTL-AVGNSLFASA-N Lys-Gln-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCCCN)N RZHLIPMZXOEJTL-AVGNSLFASA-N 0.000 description 4
- HAUUXTXKJNVIFY-ONGXEEELSA-N Lys-Gly-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O HAUUXTXKJNVIFY-ONGXEEELSA-N 0.000 description 4
- SKRGVGLIRUGANF-AVGNSLFASA-N Lys-Leu-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O SKRGVGLIRUGANF-AVGNSLFASA-N 0.000 description 4
- LJADEBULDNKJNK-IHRRRGAJSA-N Lys-Leu-Val Chemical compound CC(C)C[C@H](NC(=O)[C@@H](N)CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O LJADEBULDNKJNK-IHRRRGAJSA-N 0.000 description 4
- UWHCKWNPWKTMBM-WDCWCFNPSA-N Lys-Thr-Gln Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(O)=O UWHCKWNPWKTMBM-WDCWCFNPSA-N 0.000 description 4
- RMOKGALPSPOYKE-KATARQTJSA-N Lys-Thr-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O RMOKGALPSPOYKE-KATARQTJSA-N 0.000 description 4
- HMZPYMSEAALNAE-ULQDDVLXSA-N Lys-Val-Tyr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O HMZPYMSEAALNAE-ULQDDVLXSA-N 0.000 description 4
- FVKRQMQQFGBXHV-QXEWZRGKSA-N Met-Asp-Val Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O FVKRQMQQFGBXHV-QXEWZRGKSA-N 0.000 description 4
- HGAJNEWOUHDUMZ-SRVKXCTJSA-N Met-Leu-Glu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCC(O)=O HGAJNEWOUHDUMZ-SRVKXCTJSA-N 0.000 description 4
- XOFDBXYPKZUAAM-GUBZILKMSA-N Met-Met-Ser Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CO)C(=O)O)N XOFDBXYPKZUAAM-GUBZILKMSA-N 0.000 description 4
- GGXZOTSDJJTDGB-GUBZILKMSA-N Met-Ser-Val Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O GGXZOTSDJJTDGB-GUBZILKMSA-N 0.000 description 4
- AUEJLPRZGVVDNU-UHFFFAOYSA-N N-L-tyrosyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CC1=CC=C(O)C=C1 AUEJLPRZGVVDNU-UHFFFAOYSA-N 0.000 description 4
- BJEYSVHMGIJORT-NHCYSSNCSA-N Phe-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=CC=C1 BJEYSVHMGIJORT-NHCYSSNCSA-N 0.000 description 4
- BKWJQWJPZMUWEG-LFSVMHDDSA-N Phe-Ala-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=CC=C1 BKWJQWJPZMUWEG-LFSVMHDDSA-N 0.000 description 4
- GYEPCBNTTRORKW-PCBIJLKTSA-N Phe-Ile-Asp Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O GYEPCBNTTRORKW-PCBIJLKTSA-N 0.000 description 4
- JTKGCYOOJLUETJ-ULQDDVLXSA-N Phe-Val-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 JTKGCYOOJLUETJ-ULQDDVLXSA-N 0.000 description 4
- IWNOFCGBMSFTBC-CIUDSAMLSA-N Pro-Ala-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O IWNOFCGBMSFTBC-CIUDSAMLSA-N 0.000 description 4
- FYQSMXKJYTZYRP-DCAQKATOSA-N Pro-Ala-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1 FYQSMXKJYTZYRP-DCAQKATOSA-N 0.000 description 4
- CQZNGNCAIXMAIQ-UBHSHLNASA-N Pro-Ala-Phe Chemical compound C[C@H](NC(=O)[C@@H]1CCCN1)C(=O)N[C@@H](Cc1ccccc1)C(O)=O CQZNGNCAIXMAIQ-UBHSHLNASA-N 0.000 description 4
- CGBYDGAJHSOGFQ-LPEHRKFASA-N Pro-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@@H]2CCCN2 CGBYDGAJHSOGFQ-LPEHRKFASA-N 0.000 description 4
- OOLOTUZJUBOMAX-GUBZILKMSA-N Pro-Ala-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O OOLOTUZJUBOMAX-GUBZILKMSA-N 0.000 description 4
- DIFXZGPHVCIVSQ-CIUDSAMLSA-N Pro-Gln-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O DIFXZGPHVCIVSQ-CIUDSAMLSA-N 0.000 description 4
- MGDFPGCFVJFITQ-CIUDSAMLSA-N Pro-Glu-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O MGDFPGCFVJFITQ-CIUDSAMLSA-N 0.000 description 4
- FKLSMYYLJHYPHH-UWVGGRQHSA-N Pro-Gly-Leu Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O FKLSMYYLJHYPHH-UWVGGRQHSA-N 0.000 description 4
- CLJLVCYFABNTHP-DCAQKATOSA-N Pro-Leu-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O CLJLVCYFABNTHP-DCAQKATOSA-N 0.000 description 4
- GURGCNUWVSDYTP-SRVKXCTJSA-N Pro-Leu-Gln Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O GURGCNUWVSDYTP-SRVKXCTJSA-N 0.000 description 4
- INDVYIOKMXFQFM-SRVKXCTJSA-N Pro-Lys-Gln Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(=O)N)C(=O)O INDVYIOKMXFQFM-SRVKXCTJSA-N 0.000 description 4
- BARPGRUZBKFJMA-SRVKXCTJSA-N Pro-Met-Arg Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@@H]1CCCN1 BARPGRUZBKFJMA-SRVKXCTJSA-N 0.000 description 4
- HRIXMVRZRGFKNQ-HJGDQZAQSA-N Pro-Thr-Gln Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(O)=O HRIXMVRZRGFKNQ-HJGDQZAQSA-N 0.000 description 4
- IURWWZYKYPEANQ-HJGDQZAQSA-N Pro-Thr-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O IURWWZYKYPEANQ-HJGDQZAQSA-N 0.000 description 4
- AIOWVDNPESPXRB-YTWAJWBKSA-N Pro-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@@H]2CCCN2)O AIOWVDNPESPXRB-YTWAJWBKSA-N 0.000 description 4
- XDKKMRPRRCOELJ-GUBZILKMSA-N Pro-Val-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C(C)C)NC(=O)[C@@H]1CCCN1 XDKKMRPRRCOELJ-GUBZILKMSA-N 0.000 description 4
- WTWGOQRNRFHFQD-JBDRJPRFSA-N Ser-Ala-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WTWGOQRNRFHFQD-JBDRJPRFSA-N 0.000 description 4
- WTUJZHKANPDPIN-CIUDSAMLSA-N Ser-Ala-Lys Chemical compound C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CO)N WTUJZHKANPDPIN-CIUDSAMLSA-N 0.000 description 4
- YQHZVYJAGWMHES-ZLUOBGJFSA-N Ser-Ala-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O YQHZVYJAGWMHES-ZLUOBGJFSA-N 0.000 description 4
- YMEXHZTVKDAKIY-GHCJXIJMSA-N Ser-Asn-Ile Chemical compound CC[C@H](C)[C@H](NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CO)C(O)=O YMEXHZTVKDAKIY-GHCJXIJMSA-N 0.000 description 4
- YPUSXTWURJANKF-KBIXCLLPSA-N Ser-Gln-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O YPUSXTWURJANKF-KBIXCLLPSA-N 0.000 description 4
- OJPHFSOMBZKQKQ-GUBZILKMSA-N Ser-Gln-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CO OJPHFSOMBZKQKQ-GUBZILKMSA-N 0.000 description 4
- KJMOINFQVCCSDX-XKBZYTNZSA-N Ser-Gln-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KJMOINFQVCCSDX-XKBZYTNZSA-N 0.000 description 4
- SMIDBHKWSYUBRZ-ACZMJKKPSA-N Ser-Glu-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O SMIDBHKWSYUBRZ-ACZMJKKPSA-N 0.000 description 4
- YQQKYAZABFEYAF-FXQIFTODSA-N Ser-Glu-Gln Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O YQQKYAZABFEYAF-FXQIFTODSA-N 0.000 description 4
- MOQDPPUMFSMYOM-KKUMJFAQSA-N Ser-His-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CC2=CN=CN2)NC(=O)[C@H](CO)N MOQDPPUMFSMYOM-KKUMJFAQSA-N 0.000 description 4
- IFPBAGJBHSNYPR-ZKWXMUAHSA-N Ser-Ile-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O IFPBAGJBHSNYPR-ZKWXMUAHSA-N 0.000 description 4
- JIPVNVNKXJLFJF-BJDJZHNGSA-N Ser-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CO)N JIPVNVNKXJLFJF-BJDJZHNGSA-N 0.000 description 4
- FUMGHWDRRFCKEP-CIUDSAMLSA-N Ser-Leu-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O FUMGHWDRRFCKEP-CIUDSAMLSA-N 0.000 description 4
- ZIFYDQAFEMIZII-GUBZILKMSA-N Ser-Leu-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O ZIFYDQAFEMIZII-GUBZILKMSA-N 0.000 description 4
- FZXOPYUEQGDGMS-ACZMJKKPSA-N Ser-Ser-Gln Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O FZXOPYUEQGDGMS-ACZMJKKPSA-N 0.000 description 4
- XJDMUQCLVSCRSJ-VZFHVOOUSA-N Ser-Thr-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O XJDMUQCLVSCRSJ-VZFHVOOUSA-N 0.000 description 4
- HSWXBJCBYSWBPT-GUBZILKMSA-N Ser-Val-Val Chemical compound CC(C)[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CO)C(C)C)C(O)=O HSWXBJCBYSWBPT-GUBZILKMSA-N 0.000 description 4
- IGROJMCBGRFRGI-YTLHQDLWSA-N Thr-Ala-Ala Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O IGROJMCBGRFRGI-YTLHQDLWSA-N 0.000 description 4
- DDPVJPIGACCMEH-XQXXSGGOSA-N Thr-Ala-Gln Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(O)=O DDPVJPIGACCMEH-XQXXSGGOSA-N 0.000 description 4
- FQPQPTHMHZKGFM-XQXXSGGOSA-N Thr-Ala-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O FQPQPTHMHZKGFM-XQXXSGGOSA-N 0.000 description 4
- SWIKDOUVROTZCW-GCJQMDKQSA-N Thr-Asn-Ala Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](C)C(=O)O)N)O SWIKDOUVROTZCW-GCJQMDKQSA-N 0.000 description 4
- BVOVIGCHYNFJBZ-JXUBOQSCSA-N Thr-Leu-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O BVOVIGCHYNFJBZ-JXUBOQSCSA-N 0.000 description 4
- WNQJTLATMXYSEL-OEAJRASXSA-N Thr-Phe-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O WNQJTLATMXYSEL-OEAJRASXSA-N 0.000 description 4
- DEGCBBCMYWNJNA-RHYQMDGZSA-N Thr-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)[C@@H](C)O DEGCBBCMYWNJNA-RHYQMDGZSA-N 0.000 description 4
- GFRIEEKFXOVPIR-RHYQMDGZSA-N Thr-Pro-Lys Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(O)=O GFRIEEKFXOVPIR-RHYQMDGZSA-N 0.000 description 4
- HYVLNORXQGKONN-NUTKFTJISA-N Trp-Ala-Lys Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCCN)C(O)=O)=CNC2=C1 HYVLNORXQGKONN-NUTKFTJISA-N 0.000 description 4
- QHEGAOPHISYNDF-XDTLVQLUSA-N Tyr-Gln-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC1=CC=C(C=C1)O)N QHEGAOPHISYNDF-XDTLVQLUSA-N 0.000 description 4
- XJPXTYLVMUZGNW-IHRRRGAJSA-N Tyr-Pro-Asp Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O XJPXTYLVMUZGNW-IHRRRGAJSA-N 0.000 description 4
- QWDCYFDDFPWISL-UHFFFAOYSA-N UNPD207407 Natural products CCC=CCC=CCC=CCC=CCC=CCCCC(=O)OC QWDCYFDDFPWISL-UHFFFAOYSA-N 0.000 description 4
- UEOOXDLMQZBPFR-ZKWXMUAHSA-N Val-Ala-Asn Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N UEOOXDLMQZBPFR-ZKWXMUAHSA-N 0.000 description 4
- DNOOLPROHJWCSQ-RCWTZXSCSA-N Val-Arg-Thr Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O DNOOLPROHJWCSQ-RCWTZXSCSA-N 0.000 description 4
- QPZMOUMNTGTEFR-ZKWXMUAHSA-N Val-Asn-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](C(C)C)N QPZMOUMNTGTEFR-ZKWXMUAHSA-N 0.000 description 4
- UDNYEPLJTRDMEJ-RCOVLWMOSA-N Val-Asn-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)NCC(=O)O)N UDNYEPLJTRDMEJ-RCOVLWMOSA-N 0.000 description 4
- JLFKWDAZBRYCGX-ZKWXMUAHSA-N Val-Asn-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CO)C(=O)O)N JLFKWDAZBRYCGX-ZKWXMUAHSA-N 0.000 description 4
- ISERLACIZUGCDX-ZKWXMUAHSA-N Val-Asp-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](C(C)C)N ISERLACIZUGCDX-ZKWXMUAHSA-N 0.000 description 4
- CGGVNFJRZJUVAE-BYULHYEWSA-N Val-Asp-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N CGGVNFJRZJUVAE-BYULHYEWSA-N 0.000 description 4
- OVLIFGQSBSNGHY-KKHAAJSZSA-N Val-Asp-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](C(C)C)N)O OVLIFGQSBSNGHY-KKHAAJSZSA-N 0.000 description 4
- IRLYZKKNBFPQBW-XGEHTFHBSA-N Val-Cys-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](C(C)C)N)O IRLYZKKNBFPQBW-XGEHTFHBSA-N 0.000 description 4
- CFSSLXZJEMERJY-NRPADANISA-N Val-Gln-Ala Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O CFSSLXZJEMERJY-NRPADANISA-N 0.000 description 4
- YCMXFKWYJFZFKS-LAEOZQHASA-N Val-Gln-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N YCMXFKWYJFZFKS-LAEOZQHASA-N 0.000 description 4
- UEHRGZCNLSWGHK-DLOVCJGASA-N Val-Glu-Val Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O UEHRGZCNLSWGHK-DLOVCJGASA-N 0.000 description 4
- CPGJELLYDQEDRK-NAKRPEOUSA-N Val-Ile-Ala Chemical compound CC[C@H](C)[C@H](NC(=O)[C@@H](N)C(C)C)C(=O)N[C@@H](C)C(O)=O CPGJELLYDQEDRK-NAKRPEOUSA-N 0.000 description 4
- CKTMJBPRVQWPHU-JSGCOSHPSA-N Val-Phe-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)NCC(=O)O)N CKTMJBPRVQWPHU-JSGCOSHPSA-N 0.000 description 4
- QTPQHINADBYBNA-DCAQKATOSA-N Val-Ser-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCCN QTPQHINADBYBNA-DCAQKATOSA-N 0.000 description 4
- DFQZDQPLWBSFEJ-LSJOCFKGSA-N Val-Val-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(=O)N)C(=O)O)N DFQZDQPLWBSFEJ-LSJOCFKGSA-N 0.000 description 4
- ZLNYBMWGPOKSLW-LSJOCFKGSA-N Val-Val-Asp Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O ZLNYBMWGPOKSLW-LSJOCFKGSA-N 0.000 description 4
- AEFJNECXZCODJM-UWVGGRQHSA-N Val-Val-Gly Chemical compound CC(C)[C@H]([NH3+])C(=O)N[C@@H](C(C)C)C(=O)NCC([O-])=O AEFJNECXZCODJM-UWVGGRQHSA-N 0.000 description 4
- 108010008685 alanyl-glutamyl-aspartic acid Proteins 0.000 description 4
- 108010045350 alanyl-tyrosyl-alanine Proteins 0.000 description 4
- 108010070783 alanyltyrosine Proteins 0.000 description 4
- 108010038850 arginyl-isoleucyl-tyrosine Proteins 0.000 description 4
- 108010062796 arginyllysine Proteins 0.000 description 4
- 108010060035 arginylproline Proteins 0.000 description 4
- 238000005119 centrifugation Methods 0.000 description 4
- FSXRLASFHBWESK-UHFFFAOYSA-N dipeptide phenylalanyl-tyrosine Natural products C=1C=C(O)C=CC=1CC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 FSXRLASFHBWESK-UHFFFAOYSA-N 0.000 description 4
- 108010080575 glutamyl-aspartyl-alanine Proteins 0.000 description 4
- 108010027668 glycyl-alanyl-valine Proteins 0.000 description 4
- XKUKSGPZAADMRA-UHFFFAOYSA-N glycyl-glycyl-glycine Natural products NCC(=O)NCC(=O)NCC(O)=O XKUKSGPZAADMRA-UHFFFAOYSA-N 0.000 description 4
- 108010020688 glycylhistidine Proteins 0.000 description 4
- 108010087823 glycyltyrosine Proteins 0.000 description 4
- 108010036413 histidylglycine Proteins 0.000 description 4
- 108010028295 histidylhistidine Proteins 0.000 description 4
- 108010018006 histidylserine Proteins 0.000 description 4
- 108010076756 leucyl-alanyl-phenylalanine Proteins 0.000 description 4
- 108010044311 leucyl-glycyl-glycine Proteins 0.000 description 4
- 108010090333 leucyl-lysyl-proline Proteins 0.000 description 4
- 108010003700 lysyl aspartic acid Proteins 0.000 description 4
- 108010038320 lysylphenylalanine Proteins 0.000 description 4
- 108010016686 methionyl-alanyl-serine Proteins 0.000 description 4
- 150000004702 methyl esters Chemical class 0.000 description 4
- 108010074082 phenylalanyl-alanyl-lysine Proteins 0.000 description 4
- 108010024607 phenylalanylalanine Proteins 0.000 description 4
- 108010053725 prolylvaline Proteins 0.000 description 4
- 102220201851 rs143406017 Human genes 0.000 description 4
- 108010069117 seryl-lysyl-aspartic acid Proteins 0.000 description 4
- 239000011780 sodium chloride Substances 0.000 description 4
- UCSJYZPVAKXKNQ-HZYVHMACSA-N streptomycin Chemical compound CN[C@H]1[C@H](O)[C@@H](O)[C@H](CO)O[C@H]1O[C@@H]1[C@](C=O)(O)[C@H](C)O[C@H]1O[C@@H]1[C@@H](NC(N)=N)[C@H](O)[C@@H](NC(N)=N)[C@H](O)[C@H]1O UCSJYZPVAKXKNQ-HZYVHMACSA-N 0.000 description 4
- 108010072986 threonyl-seryl-lysine Proteins 0.000 description 4
- 108010078580 tyrosylleucine Proteins 0.000 description 4
- IBIDRSSEHFLGSD-UHFFFAOYSA-N valinyl-arginine Natural products CC(C)C(N)C(=O)NC(C(O)=O)CCCN=C(N)N IBIDRSSEHFLGSD-UHFFFAOYSA-N 0.000 description 4
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Chemical compound O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 4
- 241000251468 Actinopterygii Species 0.000 description 3
- 229920001817 Agar Polymers 0.000 description 3
- QHASENCZLDHBGX-ONGXEEELSA-N Ala-Gly-Phe Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 QHASENCZLDHBGX-ONGXEEELSA-N 0.000 description 3
- CFPQUJZTLUQUTJ-HTFCKZLJSA-N Ala-Ile-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@H](C)N CFPQUJZTLUQUTJ-HTFCKZLJSA-N 0.000 description 3
- DGLQWAFPIXDKRL-UBHSHLNASA-N Ala-Met-Phe Chemical compound C[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N DGLQWAFPIXDKRL-UBHSHLNASA-N 0.000 description 3
- HCBKAOZYACJUEF-XQXXSGGOSA-N Ala-Thr-Gln Chemical compound N[C@@H](C)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CCC(N)=O)C(=O)O HCBKAOZYACJUEF-XQXXSGGOSA-N 0.000 description 3
- ZJLORAAXDAJLDC-CQDKDKBSSA-N Ala-Tyr-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(O)=O ZJLORAAXDAJLDC-CQDKDKBSSA-N 0.000 description 3
- UBCPNBUIQNMDNH-NAKRPEOUSA-N Arg-Ile-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O UBCPNBUIQNMDNH-NAKRPEOUSA-N 0.000 description 3
- YNDLOUMBVDVALC-ZLUOBGJFSA-N Asn-Ala-Ala Chemical compound C[C@@H](C(=O)N[C@@H](C)C(=O)O)NC(=O)[C@H](CC(=O)N)N YNDLOUMBVDVALC-ZLUOBGJFSA-N 0.000 description 3
- FAEFJTCTNZTPHX-ACZMJKKPSA-N Asn-Gln-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O FAEFJTCTNZTPHX-ACZMJKKPSA-N 0.000 description 3
- BZWRLDPIWKOVKB-ZPFDUUQYSA-N Asn-Leu-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O BZWRLDPIWKOVKB-ZPFDUUQYSA-N 0.000 description 3
- NLDNNZKUSLAYFW-NHCYSSNCSA-N Asn-Lys-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O NLDNNZKUSLAYFW-NHCYSSNCSA-N 0.000 description 3
- VLDRQOHCMKCXLY-SRVKXCTJSA-N Asn-Ser-Phe Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O VLDRQOHCMKCXLY-SRVKXCTJSA-N 0.000 description 3
- PBVLJOIPOGUQQP-CIUDSAMLSA-N Asp-Ala-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O PBVLJOIPOGUQQP-CIUDSAMLSA-N 0.000 description 3
- DRCOAZZDQRCGGP-GHCJXIJMSA-N Asp-Ser-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O DRCOAZZDQRCGGP-GHCJXIJMSA-N 0.000 description 3
- VHUKCUHLFMRHOD-MELADBBJSA-N Asp-Tyr-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)[C@H](CC(=O)O)N)C(=O)O VHUKCUHLFMRHOD-MELADBBJSA-N 0.000 description 3
- 241000588724 Escherichia coli Species 0.000 description 3
- YJIUYQKQBBQYHZ-ACZMJKKPSA-N Gln-Ala-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O YJIUYQKQBBQYHZ-ACZMJKKPSA-N 0.000 description 3
- RZSLYUUFFVHFRQ-FXQIFTODSA-N Gln-Ala-Glu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O RZSLYUUFFVHFRQ-FXQIFTODSA-N 0.000 description 3
- OETQLUYCMBARHJ-CIUDSAMLSA-N Gln-Asn-Arg Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O OETQLUYCMBARHJ-CIUDSAMLSA-N 0.000 description 3
- ARYKRXHBIPLULY-XKBZYTNZSA-N Gln-Thr-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O ARYKRXHBIPLULY-XKBZYTNZSA-N 0.000 description 3
- FLQAKQOBSPFGKG-CIUDSAMLSA-N Glu-Cys-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@H](C(O)=O)CCCN=C(N)N FLQAKQOBSPFGKG-CIUDSAMLSA-N 0.000 description 3
- RAUDKMVXNOWDLS-WDSKDSINSA-N Glu-Gly-Ser Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O RAUDKMVXNOWDLS-WDSKDSINSA-N 0.000 description 3
- CQGBSALYGOXQPE-HTUGSXCWSA-N Glu-Thr-Phe Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CCC(=O)O)N)O CQGBSALYGOXQPE-HTUGSXCWSA-N 0.000 description 3
- KIEICAOUSNYOLM-NRPADANISA-N Glu-Val-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O KIEICAOUSNYOLM-NRPADANISA-N 0.000 description 3
- UZWUBBRJWFTHTD-LAEOZQHASA-N Glu-Val-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCC(O)=O UZWUBBRJWFTHTD-LAEOZQHASA-N 0.000 description 3
- BGVYNAQWHSTTSP-BYULHYEWSA-N Gly-Asn-Ile Chemical compound [H]NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O BGVYNAQWHSTTSP-BYULHYEWSA-N 0.000 description 3
- MHHUEAIBJZWDBH-YUMQZZPRSA-N Gly-Asp-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)CN MHHUEAIBJZWDBH-YUMQZZPRSA-N 0.000 description 3
- YWAQATDNEKZFFK-BYPYZUCNSA-N Gly-Gly-Ser Chemical compound NCC(=O)NCC(=O)N[C@@H](CO)C(O)=O YWAQATDNEKZFFK-BYPYZUCNSA-N 0.000 description 3
- CSMYMGFCEJWALV-WDSKDSINSA-N Gly-Ser-Gln Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(N)=O CSMYMGFCEJWALV-WDSKDSINSA-N 0.000 description 3
- FKYQEVBRZSFAMJ-QWRGUYRKSA-N Gly-Ser-Tyr Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 FKYQEVBRZSFAMJ-QWRGUYRKSA-N 0.000 description 3
- MYXNLWDWWOTERK-BHNWBGBOSA-N Gly-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN)O MYXNLWDWWOTERK-BHNWBGBOSA-N 0.000 description 3
- CUVBTVWFVIIDOC-YEPSODPASA-N Gly-Thr-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)O)NC(=O)CN CUVBTVWFVIIDOC-YEPSODPASA-N 0.000 description 3
- RIYIFUFFFBIOEU-KBPBESRZSA-N Gly-Tyr-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CC=C(O)C=C1 RIYIFUFFFBIOEU-KBPBESRZSA-N 0.000 description 3
- FULZDMOZUZKGQU-ONGXEEELSA-N Gly-Val-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)CN FULZDMOZUZKGQU-ONGXEEELSA-N 0.000 description 3
- KYFGGRHWLFZXPU-KKUMJFAQSA-N His-Phe-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC2=CN=CN2)N KYFGGRHWLFZXPU-KKUMJFAQSA-N 0.000 description 3
- YKRYHWJRQUSTKG-KBIXCLLPSA-N Ile-Ala-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N YKRYHWJRQUSTKG-KBIXCLLPSA-N 0.000 description 3
- VAXBXNPRXPHGHG-BJDJZHNGSA-N Ile-Ala-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)O)N VAXBXNPRXPHGHG-BJDJZHNGSA-N 0.000 description 3
- PJLLMGWWINYQPB-PEFMBERDSA-N Ile-Asn-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N PJLLMGWWINYQPB-PEFMBERDSA-N 0.000 description 3
- CDGLBYSAZFIIJO-RCOVLWMOSA-N Ile-Gly-Gly Chemical compound CC[C@H](C)[C@H]([NH3+])C(=O)NCC(=O)NCC([O-])=O CDGLBYSAZFIIJO-RCOVLWMOSA-N 0.000 description 3
- NZGTYCMLUGYMCV-XUXIUFHCSA-N Ile-Lys-Arg Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N NZGTYCMLUGYMCV-XUXIUFHCSA-N 0.000 description 3
- BCISUQVFDGYZBO-QSFUFRPTSA-N Ile-Val-Asp Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC(O)=O BCISUQVFDGYZBO-QSFUFRPTSA-N 0.000 description 3
- HGCNKOLVKRAVHD-UHFFFAOYSA-N L-Met-L-Phe Natural products CSCCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 HGCNKOLVKRAVHD-UHFFFAOYSA-N 0.000 description 3
- ZRLUISBDKUWAIZ-CIUDSAMLSA-N Leu-Ala-Asp Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC(O)=O ZRLUISBDKUWAIZ-CIUDSAMLSA-N 0.000 description 3
- KWTVLKBOQATPHJ-SRVKXCTJSA-N Leu-Ala-Lys Chemical compound C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(C)C)N KWTVLKBOQATPHJ-SRVKXCTJSA-N 0.000 description 3
- XIRYQRLFHWWWTC-QEJZJMRPSA-N Leu-Ala-Phe Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 XIRYQRLFHWWWTC-QEJZJMRPSA-N 0.000 description 3
- VQPPIMUZCZCOIL-GUBZILKMSA-N Leu-Gln-Ala Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O VQPPIMUZCZCOIL-GUBZILKMSA-N 0.000 description 3
- FIYMBBHGYNQFOP-IUCAKERBSA-N Leu-Gly-Gln Chemical compound CC(C)C[C@@H](C(=O)NCC(=O)N[C@@H](CCC(=O)N)C(=O)O)N FIYMBBHGYNQFOP-IUCAKERBSA-N 0.000 description 3
- WMIOEVKKYIMVKI-DCAQKATOSA-N Leu-Pro-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O WMIOEVKKYIMVKI-DCAQKATOSA-N 0.000 description 3
- PPGBXYKMUMHFBF-KATARQTJSA-N Leu-Ser-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PPGBXYKMUMHFBF-KATARQTJSA-N 0.000 description 3
- ILDSIMPXNFWKLH-KATARQTJSA-N Leu-Thr-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O ILDSIMPXNFWKLH-KATARQTJSA-N 0.000 description 3
- VUBIPAHVHMZHCM-KKUMJFAQSA-N Leu-Tyr-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CO)C(O)=O)CC1=CC=C(O)C=C1 VUBIPAHVHMZHCM-KKUMJFAQSA-N 0.000 description 3
- NTSPQIONFJUMJV-AVGNSLFASA-N Lys-Arg-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(O)=O NTSPQIONFJUMJV-AVGNSLFASA-N 0.000 description 3
- GQFDWEDHOQRNLC-QWRGUYRKSA-N Lys-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CCCCN GQFDWEDHOQRNLC-QWRGUYRKSA-N 0.000 description 3
- HVAUKHLDSDDROB-KKUMJFAQSA-N Lys-Lys-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O HVAUKHLDSDDROB-KKUMJFAQSA-N 0.000 description 3
- VWPJQIHBBOJWDN-DCAQKATOSA-N Lys-Val-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O VWPJQIHBBOJWDN-DCAQKATOSA-N 0.000 description 3
- VKCPHIOZDWUFSW-ONGXEEELSA-N Lys-Val-Gly Chemical compound OC(=O)CNC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCCN VKCPHIOZDWUFSW-ONGXEEELSA-N 0.000 description 3
- RPWQJSBMXJSCPD-XUXIUFHCSA-N Lys-Val-Ile Chemical compound CC[C@H](C)[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CCCCN)C(C)C)C(O)=O RPWQJSBMXJSCPD-XUXIUFHCSA-N 0.000 description 3
- TXTZMVNJIRZABH-ULQDDVLXSA-N Lys-Val-Phe Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 TXTZMVNJIRZABH-ULQDDVLXSA-N 0.000 description 3
- IKXQOBUBZSOWDY-AVGNSLFASA-N Lys-Val-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H](CCCCN)N IKXQOBUBZSOWDY-AVGNSLFASA-N 0.000 description 3
- CAODKDAPYGUMLK-FXQIFTODSA-N Met-Asn-Ser Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O CAODKDAPYGUMLK-FXQIFTODSA-N 0.000 description 3
- JPCHYAUKOUGOIB-HJGDQZAQSA-N Met-Glu-Thr Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O JPCHYAUKOUGOIB-HJGDQZAQSA-N 0.000 description 3
- ZIIMORLEZLVRIP-SRVKXCTJSA-N Met-Leu-Gln Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O ZIIMORLEZLVRIP-SRVKXCTJSA-N 0.000 description 3
- RSOMVHWMIAZNLE-HJWJTTGWSA-N Met-Phe-Ile Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O RSOMVHWMIAZNLE-HJWJTTGWSA-N 0.000 description 3
- MMJJFXWMCMJMQA-STQMWFEESA-N Phe-Pro-Gly Chemical compound C([C@H](N)C(=O)N1[C@@H](CCC1)C(=O)NCC(O)=O)C1=CC=CC=C1 MMJJFXWMCMJMQA-STQMWFEESA-N 0.000 description 3
- UPJGUQPLYWTISV-GUBZILKMSA-N Pro-Gln-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O UPJGUQPLYWTISV-GUBZILKMSA-N 0.000 description 3
- DXTOOBDIIAJZBJ-BQBZGAKWSA-N Pro-Gly-Ser Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CO)C(O)=O DXTOOBDIIAJZBJ-BQBZGAKWSA-N 0.000 description 3
- FKYKZHOKDOPHSA-DCAQKATOSA-N Pro-Leu-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O FKYKZHOKDOPHSA-DCAQKATOSA-N 0.000 description 3
- IDQFQFVEWMWRQQ-DLOVCJGASA-N Ser-Ala-Phe Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O IDQFQFVEWMWRQQ-DLOVCJGASA-N 0.000 description 3
- OYEDZGNMSBZCIM-XGEHTFHBSA-N Ser-Arg-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OYEDZGNMSBZCIM-XGEHTFHBSA-N 0.000 description 3
- UBRXAVQWXOWRSJ-ZLUOBGJFSA-N Ser-Asn-Asp Chemical compound C([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CO)N)C(=O)N UBRXAVQWXOWRSJ-ZLUOBGJFSA-N 0.000 description 3
- LALNXSXEYFUUDD-GUBZILKMSA-N Ser-Glu-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O LALNXSXEYFUUDD-GUBZILKMSA-N 0.000 description 3
- GZFAWAQTEYDKII-YUMQZZPRSA-N Ser-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CO GZFAWAQTEYDKII-YUMQZZPRSA-N 0.000 description 3
- XNCUYZKGQOCOQH-YUMQZZPRSA-N Ser-Leu-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O XNCUYZKGQOCOQH-YUMQZZPRSA-N 0.000 description 3
- FHXGMDRKJHKLKW-QWRGUYRKSA-N Ser-Tyr-Gly Chemical compound OC[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CC=C(O)C=C1 FHXGMDRKJHKLKW-QWRGUYRKSA-N 0.000 description 3
- CEXFELBFVHLYDZ-XGEHTFHBSA-N Thr-Arg-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O CEXFELBFVHLYDZ-XGEHTFHBSA-N 0.000 description 3
- YBXMGKCLOPDEKA-NUMRIWBASA-N Thr-Asp-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O YBXMGKCLOPDEKA-NUMRIWBASA-N 0.000 description 3
- DKDHTRVDOUZZTP-IFFSRLJSSA-N Thr-Gln-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)[C@@H](C)O)C(O)=O DKDHTRVDOUZZTP-IFFSRLJSSA-N 0.000 description 3
- BNGDYRRHRGOPHX-IFFSRLJSSA-N Thr-Glu-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)[C@@H](C)O)C(O)=O BNGDYRRHRGOPHX-IFFSRLJSSA-N 0.000 description 3
- CRZNCABIJLRFKZ-IUKAMOBKSA-N Thr-Ile-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N CRZNCABIJLRFKZ-IUKAMOBKSA-N 0.000 description 3
- XZUBGOYOGDRYFC-XGEHTFHBSA-N Thr-Ser-Met Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCSC)C(O)=O XZUBGOYOGDRYFC-XGEHTFHBSA-N 0.000 description 3
- ZESGVALRVJIVLZ-VFCFLDTKSA-N Thr-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@@H]1C(=O)O)N)O ZESGVALRVJIVLZ-VFCFLDTKSA-N 0.000 description 3
- AKXBNSZMYAOGLS-STQMWFEESA-N Tyr-Arg-Gly Chemical compound NC(N)=NCCC[C@@H](C(=O)NCC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 AKXBNSZMYAOGLS-STQMWFEESA-N 0.000 description 3
- FMXFHNSFABRVFZ-BZSNNMDCSA-N Tyr-Lys-Leu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O FMXFHNSFABRVFZ-BZSNNMDCSA-N 0.000 description 3
- FZSPNKUFROZBSG-ZKWXMUAHSA-N Val-Ala-Asp Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC(O)=O FZSPNKUFROZBSG-ZKWXMUAHSA-N 0.000 description 3
- AZSHAZJLOZQYAY-FXQIFTODSA-N Val-Ala-Ser Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O AZSHAZJLOZQYAY-FXQIFTODSA-N 0.000 description 3
- QHDXUYOYTPWCSK-RCOVLWMOSA-N Val-Asp-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)NCC(=O)O)N QHDXUYOYTPWCSK-RCOVLWMOSA-N 0.000 description 3
- JZWZACGUZVCQPS-RNJOBUHISA-N Val-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](C(C)C)N JZWZACGUZVCQPS-RNJOBUHISA-N 0.000 description 3
- ZRSZTKTVPNSUNA-IHRRRGAJSA-N Val-Lys-Leu Chemical compound CC(C)C[C@H](NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)C(C)C)C(O)=O ZRSZTKTVPNSUNA-IHRRRGAJSA-N 0.000 description 3
- LJSZPMSUYKKKCP-UBHSHLNASA-N Val-Phe-Ala Chemical compound CC(C)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C)C(O)=O)CC1=CC=CC=C1 LJSZPMSUYKKKCP-UBHSHLNASA-N 0.000 description 3
- DEGUERSKQBRZMZ-FXQIFTODSA-N Val-Ser-Ala Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O DEGUERSKQBRZMZ-FXQIFTODSA-N 0.000 description 3
- RTJPAGFXOWEBAI-SRVKXCTJSA-N Val-Val-Arg Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N RTJPAGFXOWEBAI-SRVKXCTJSA-N 0.000 description 3
- VVIZITNVZUAEMI-DLOVCJGASA-N Val-Val-Gln Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCC(N)=O VVIZITNVZUAEMI-DLOVCJGASA-N 0.000 description 3
- 239000008272 agar Substances 0.000 description 3
- 108010013835 arginine glutamate Proteins 0.000 description 3
- 108010093581 aspartyl-proline Proteins 0.000 description 3
- KRKNYBCHXYNGOX-UHFFFAOYSA-N citric acid Chemical compound OC(=O)CC(O)(C(O)=O)CC(O)=O KRKNYBCHXYNGOX-UHFFFAOYSA-N 0.000 description 3
- 238000005516 engineering process Methods 0.000 description 3
- 235000019688 fish Nutrition 0.000 description 3
- 238000004817 gas chromatography Methods 0.000 description 3
- 238000002290 gas chromatography-mass spectrometry Methods 0.000 description 3
- 108010055341 glutamyl-glutamic acid Proteins 0.000 description 3
- 108010068488 methionylphenylalanine Proteins 0.000 description 3
- 238000002360 preparation method Methods 0.000 description 3
- 108700042769 prolyl-leucyl-glycine Proteins 0.000 description 3
- 108010090894 prolylleucine Proteins 0.000 description 3
- OFHXPCLWHLXQHT-JKQORVJESA-N (2s)-2-[[(2s)-2-[[(2s)-2-[[(2s)-2,6-diaminohexanoyl]amino]-3-methylbutanoyl]amino]-4-methylpentanoyl]amino]butanedioic acid Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCCN OFHXPCLWHLXQHT-JKQORVJESA-N 0.000 description 2
- ZEIYPKQQLSUPOT-QORCZRPOSA-N (2s)-2-[[(2s)-2-[[(2s)-2-[[(2s)-2-amino-3-phenylpropanoyl]amino]-4-methylpentanoyl]amino]-5-(diaminomethylideneamino)pentanoyl]amino]-3-phenylpropanoic acid Chemical compound C([C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 ZEIYPKQQLSUPOT-QORCZRPOSA-N 0.000 description 2
- ARNGIGOPGOEJCH-KKUMJFAQSA-N (3s)-3-[[2-[[(2s)-2-amino-5-(diaminomethylideneamino)pentanoyl]amino]acetyl]amino]-4-[[(1s)-1-carboxy-2-phenylethyl]amino]-4-oxobutanoic acid Chemical compound NC(N)=NCCC[C@H](N)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 ARNGIGOPGOEJCH-KKUMJFAQSA-N 0.000 description 2
- HXUVTXPOZRFMOY-NSHDSACASA-N 2-[[(2s)-2-[[2-[(2-aminoacetyl)amino]acetyl]amino]-3-phenylpropanoyl]amino]acetic acid Chemical compound NCC(=O)NCC(=O)N[C@H](C(=O)NCC(O)=O)CC1=CC=CC=C1 HXUVTXPOZRFMOY-NSHDSACASA-N 0.000 description 2
- QVOBNSFUVPLVPE-ROUUACIJSA-N 2-[[(2s)-2-[[2-[[(2s)-2-amino-3-phenylpropanoyl]amino]acetyl]amino]-3-phenylpropanoyl]amino]acetic acid Chemical compound C([C@H](N)C(=O)NCC(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)NCC(O)=O)C1=CC=CC=C1 QVOBNSFUVPLVPE-ROUUACIJSA-N 0.000 description 2
- QKNYBSVHEMOAJP-UHFFFAOYSA-N 2-amino-2-(hydroxymethyl)propane-1,3-diol;hydron;chloride Chemical compound Cl.OCC(N)(CO)CO QKNYBSVHEMOAJP-UHFFFAOYSA-N 0.000 description 2
- 101000621943 Acholeplasma phage L2 Probable integrase/recombinase Proteins 0.000 description 2
- UWQJHXKARZWDIJ-ZLUOBGJFSA-N Ala-Ala-Cys Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CS)C(O)=O UWQJHXKARZWDIJ-ZLUOBGJFSA-N 0.000 description 2
- BUANFPRKJKJSRR-ACZMJKKPSA-N Ala-Ala-Gln Chemical compound C[C@H]([NH3+])C(=O)N[C@@H](C)C(=O)N[C@H](C([O-])=O)CCC(N)=O BUANFPRKJKJSRR-ACZMJKKPSA-N 0.000 description 2
- WQVFQXXBNHHPLX-ZKWXMUAHSA-N Ala-Ala-His Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O WQVFQXXBNHHPLX-ZKWXMUAHSA-N 0.000 description 2
- CXRCVCURMBFFOL-FXQIFTODSA-N Ala-Ala-Pro Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@H]1C(O)=O CXRCVCURMBFFOL-FXQIFTODSA-N 0.000 description 2
- DVWVZSJAYIJZFI-FXQIFTODSA-N Ala-Arg-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(O)=O DVWVZSJAYIJZFI-FXQIFTODSA-N 0.000 description 2
- WRDANSJTFOHBPI-FXQIFTODSA-N Ala-Arg-Cys Chemical compound C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CS)C(=O)O)N WRDANSJTFOHBPI-FXQIFTODSA-N 0.000 description 2
- KVWLTGNCJYDJET-LSJOCFKGSA-N Ala-Arg-His Chemical compound C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N KVWLTGNCJYDJET-LSJOCFKGSA-N 0.000 description 2
- LBJYAILUMSUTAM-ZLUOBGJFSA-N Ala-Asn-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O LBJYAILUMSUTAM-ZLUOBGJFSA-N 0.000 description 2
- ZEXDYVGDZJBRMO-ACZMJKKPSA-N Ala-Asn-Gln Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N ZEXDYVGDZJBRMO-ACZMJKKPSA-N 0.000 description 2
- CVGNCMIULZNYES-WHFBIAKZSA-N Ala-Asn-Gly Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O CVGNCMIULZNYES-WHFBIAKZSA-N 0.000 description 2
- NXSFUECZFORGOG-CIUDSAMLSA-N Ala-Asn-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O NXSFUECZFORGOG-CIUDSAMLSA-N 0.000 description 2
- FXKNPWNXPQZLES-ZLUOBGJFSA-N Ala-Asn-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O FXKNPWNXPQZLES-ZLUOBGJFSA-N 0.000 description 2
- GORKKVHIBWAQHM-GCJQMDKQSA-N Ala-Asn-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GORKKVHIBWAQHM-GCJQMDKQSA-N 0.000 description 2
- FOWHQTWRLFTELJ-FXQIFTODSA-N Ala-Asp-Met Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCSC)C(=O)O)N FOWHQTWRLFTELJ-FXQIFTODSA-N 0.000 description 2
- DAEFQZCYZKRTLR-ZLUOBGJFSA-N Ala-Cys-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(O)=O)C(O)=O DAEFQZCYZKRTLR-ZLUOBGJFSA-N 0.000 description 2
- YEELWQSXYBJVSV-UWJYBYFXSA-N Ala-Cys-Tyr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O YEELWQSXYBJVSV-UWJYBYFXSA-N 0.000 description 2
- RXTBLQVXNIECFP-FXQIFTODSA-N Ala-Gln-Gln Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O RXTBLQVXNIECFP-FXQIFTODSA-N 0.000 description 2
- ZODMADSIQZZBSQ-FXQIFTODSA-N Ala-Gln-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O ZODMADSIQZZBSQ-FXQIFTODSA-N 0.000 description 2
- FVSOUJZKYWEFOB-KBIXCLLPSA-N Ala-Gln-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@H](C)N FVSOUJZKYWEFOB-KBIXCLLPSA-N 0.000 description 2
- JPGBXANAQYHTLA-DRZSPHRISA-N Ala-Gln-Phe Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 JPGBXANAQYHTLA-DRZSPHRISA-N 0.000 description 2
- NWVVKQZOVSTDBQ-CIUDSAMLSA-N Ala-Glu-Arg Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O NWVVKQZOVSTDBQ-CIUDSAMLSA-N 0.000 description 2
- GGNHBHYDMUDXQB-KBIXCLLPSA-N Ala-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@H](C)N GGNHBHYDMUDXQB-KBIXCLLPSA-N 0.000 description 2
- FBHOPGDGELNWRH-DRZSPHRISA-N Ala-Glu-Phe Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O FBHOPGDGELNWRH-DRZSPHRISA-N 0.000 description 2
- PUBLUECXJRHTBK-ACZMJKKPSA-N Ala-Glu-Ser Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O PUBLUECXJRHTBK-ACZMJKKPSA-N 0.000 description 2
- VGPWRRFOPXVGOH-BYPYZUCNSA-N Ala-Gly-Gly Chemical compound C[C@H](N)C(=O)NCC(=O)NCC(O)=O VGPWRRFOPXVGOH-BYPYZUCNSA-N 0.000 description 2
- LMFXXZPPZDCPTA-ZKWXMUAHSA-N Ala-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@H](C)N LMFXXZPPZDCPTA-ZKWXMUAHSA-N 0.000 description 2
- MQIGTEQXYCRLGK-BQBZGAKWSA-N Ala-Gly-Pro Chemical compound C[C@H](N)C(=O)NCC(=O)N1CCC[C@H]1C(O)=O MQIGTEQXYCRLGK-BQBZGAKWSA-N 0.000 description 2
- NIZKGBJVCMRDKO-KWQFWETISA-N Ala-Gly-Tyr Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 NIZKGBJVCMRDKO-KWQFWETISA-N 0.000 description 2
- SMCGQGDVTPFXKB-XPUUQOCRSA-N Ala-Gly-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@H](C)N SMCGQGDVTPFXKB-XPUUQOCRSA-N 0.000 description 2
- NMXKFWOEASXOGB-QSFUFRPTSA-N Ala-Ile-His Chemical compound C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 NMXKFWOEASXOGB-QSFUFRPTSA-N 0.000 description 2
- TZDNWXDLYFIFPT-BJDJZHNGSA-N Ala-Ile-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O TZDNWXDLYFIFPT-BJDJZHNGSA-N 0.000 description 2
- VNYMOTCMNHJGTG-JBDRJPRFSA-N Ala-Ile-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O VNYMOTCMNHJGTG-JBDRJPRFSA-N 0.000 description 2
- LBYMZCVBOKYZNS-CIUDSAMLSA-N Ala-Leu-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O LBYMZCVBOKYZNS-CIUDSAMLSA-N 0.000 description 2
- CCDFBRZVTDDJNM-GUBZILKMSA-N Ala-Leu-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O CCDFBRZVTDDJNM-GUBZILKMSA-N 0.000 description 2
- DPNZTBKGAUAZQU-DLOVCJGASA-N Ala-Leu-His Chemical compound C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N DPNZTBKGAUAZQU-DLOVCJGASA-N 0.000 description 2
- ZKEHTYWGPMMGBC-XUXIUFHCSA-N Ala-Leu-Leu-Ser Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O ZKEHTYWGPMMGBC-XUXIUFHCSA-N 0.000 description 2
- OYJCVIGKMXUVKB-GARJFASQSA-N Ala-Leu-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N OYJCVIGKMXUVKB-GARJFASQSA-N 0.000 description 2
- MEFILNJXAVSUTO-JXUBOQSCSA-N Ala-Leu-Thr Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MEFILNJXAVSUTO-JXUBOQSCSA-N 0.000 description 2
- PIXQDIGKDNNOOV-GUBZILKMSA-N Ala-Lys-Gln Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O PIXQDIGKDNNOOV-GUBZILKMSA-N 0.000 description 2
- SUHLZMHFRALVSY-YUMQZZPRSA-N Ala-Lys-Gly Chemical compound NCCCC[C@H](NC(=O)[C@@H](N)C)C(=O)NCC(O)=O SUHLZMHFRALVSY-YUMQZZPRSA-N 0.000 description 2
- OQWQTGBOFPJOIF-DLOVCJGASA-N Ala-Lys-His Chemical compound C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N OQWQTGBOFPJOIF-DLOVCJGASA-N 0.000 description 2
- XHNLCGXYBXNRIS-BJDJZHNGSA-N Ala-Lys-Ile Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O XHNLCGXYBXNRIS-BJDJZHNGSA-N 0.000 description 2
- NINQYGGNRIBFSC-CIUDSAMLSA-N Ala-Lys-Ser Chemical compound NCCCC[C@H](NC(=O)[C@@H](N)C)C(=O)N[C@@H](CO)C(O)=O NINQYGGNRIBFSC-CIUDSAMLSA-N 0.000 description 2
- OINVDEKBKBCPLX-JXUBOQSCSA-N Ala-Lys-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OINVDEKBKBCPLX-JXUBOQSCSA-N 0.000 description 2
- NLOMBWNGESDVJU-GUBZILKMSA-N Ala-Met-Arg Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O NLOMBWNGESDVJU-GUBZILKMSA-N 0.000 description 2
- PVQLRJRPUTXFFX-CIUDSAMLSA-N Ala-Met-Gln Chemical compound CSCC[C@H](NC(=O)[C@H](C)N)C(=O)N[C@@H](CCC(N)=O)C(O)=O PVQLRJRPUTXFFX-CIUDSAMLSA-N 0.000 description 2
- SBYABBIDCYVZFF-BJDJZHNGSA-N Ala-Met-Gln-Gln Chemical compound CSCC[C@H](NC(=O)[C@H](C)N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O SBYABBIDCYVZFF-BJDJZHNGSA-N 0.000 description 2
- OMDNCNKNEGFOMM-BQBZGAKWSA-N Ala-Met-Gly Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCSC)C(=O)NCC(O)=O OMDNCNKNEGFOMM-BQBZGAKWSA-N 0.000 description 2
- MAEQBGQTDWDSJQ-LSJOCFKGSA-N Ala-Met-His Chemical compound C[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N MAEQBGQTDWDSJQ-LSJOCFKGSA-N 0.000 description 2
- CJQAEJMHBAOQHA-DLOVCJGASA-N Ala-Phe-Asn Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(=O)N)C(=O)O)N CJQAEJMHBAOQHA-DLOVCJGASA-N 0.000 description 2
- BFMIRJBURUXDRG-DLOVCJGASA-N Ala-Phe-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=CC=C1 BFMIRJBURUXDRG-DLOVCJGASA-N 0.000 description 2
- WEZNQZHACPSMEF-QEJZJMRPSA-N Ala-Phe-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=CC=C1 WEZNQZHACPSMEF-QEJZJMRPSA-N 0.000 description 2
- ADSGHMXEAZJJNF-DCAQKATOSA-N Ala-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](C)N ADSGHMXEAZJJNF-DCAQKATOSA-N 0.000 description 2
- BTRULDJUUVGRNE-DCAQKATOSA-N Ala-Pro-Lys Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(O)=O BTRULDJUUVGRNE-DCAQKATOSA-N 0.000 description 2
- GMGWOTQMUKYZIE-UBHSHLNASA-N Ala-Pro-Phe Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 GMGWOTQMUKYZIE-UBHSHLNASA-N 0.000 description 2
- VJVQKGYHIZPSNS-FXQIFTODSA-N Ala-Ser-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCN=C(N)N VJVQKGYHIZPSNS-FXQIFTODSA-N 0.000 description 2
- KLALXKYLOMZDQT-ZLUOBGJFSA-N Ala-Ser-Asn Chemical compound C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC(N)=O KLALXKYLOMZDQT-ZLUOBGJFSA-N 0.000 description 2
- HOVPGJUNRLMIOZ-CIUDSAMLSA-N Ala-Ser-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@H](C)N HOVPGJUNRLMIOZ-CIUDSAMLSA-N 0.000 description 2
- PEEYDECOOVQKRZ-DLOVCJGASA-N Ala-Ser-Phe Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O PEEYDECOOVQKRZ-DLOVCJGASA-N 0.000 description 2
- UCDOXFBTMLKASE-HERUPUMHSA-N Ala-Ser-Trp Chemical compound C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N UCDOXFBTMLKASE-HERUPUMHSA-N 0.000 description 2
- ARHJJAAWNWOACN-FXQIFTODSA-N Ala-Ser-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O ARHJJAAWNWOACN-FXQIFTODSA-N 0.000 description 2
- XQNRANMFRPCFFW-GCJQMDKQSA-N Ala-Thr-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](C)N)O XQNRANMFRPCFFW-GCJQMDKQSA-N 0.000 description 2
- VNFSAYFQLXPHPY-CIQUZCHMSA-N Ala-Thr-Ile Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O VNFSAYFQLXPHPY-CIQUZCHMSA-N 0.000 description 2
- ISCYZXFOCXWUJU-KZVJFYERSA-N Ala-Thr-Met Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCSC)C(O)=O ISCYZXFOCXWUJU-KZVJFYERSA-N 0.000 description 2
- QOIGKCBMXUCDQU-KDXUFGMBSA-N Ala-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](C)N)O QOIGKCBMXUCDQU-KDXUFGMBSA-N 0.000 description 2
- IEAUDUOCWNPZBR-LKTVYLICSA-N Ala-Trp-Gln Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N IEAUDUOCWNPZBR-LKTVYLICSA-N 0.000 description 2
- QDGMZAOSMNGBLP-MRFFXTKBSA-N Ala-Trp-Tyr Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CC3=CC=C(C=C3)O)C(=O)O)N QDGMZAOSMNGBLP-MRFFXTKBSA-N 0.000 description 2
- AENHOIXXHKNIQL-AUTRQRHGSA-N Ala-Tyr-Ala Chemical compound [O-]C(=O)[C@H](C)NC(=O)[C@@H](NC(=O)[C@@H]([NH3+])C)CC1=CC=C(O)C=C1 AENHOIXXHKNIQL-AUTRQRHGSA-N 0.000 description 2
- MUGAESARFRGOTQ-IGNZVWTISA-N Ala-Tyr-Tyr Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O)N MUGAESARFRGOTQ-IGNZVWTISA-N 0.000 description 2
- ZCUFMRIQCPNOHZ-NRPADANISA-N Ala-Val-Gln Chemical compound C[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N ZCUFMRIQCPNOHZ-NRPADANISA-N 0.000 description 2
- REWSWYIDQIELBE-FXQIFTODSA-N Ala-Val-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O REWSWYIDQIELBE-FXQIFTODSA-N 0.000 description 2
- SSQHYGLFYWZWDV-UVBJJODRSA-N Ala-Val-Trp Chemical compound CC(C)[C@H](NC(=O)[C@H](C)N)C(=O)N[C@@H](Cc1c[nH]c2ccccc12)C(O)=O SSQHYGLFYWZWDV-UVBJJODRSA-N 0.000 description 2
- 101000618348 Allochromatium vinosum (strain ATCC 17899 / DSM 180 / NBRC 103801 / NCIMB 10441 / D) Uncharacterized protein Alvin_0065 Proteins 0.000 description 2
- KJGNDQCYBNBXDA-GUBZILKMSA-N Arg-Arg-Cys Chemical compound C(C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CS)C(=O)O)N)CN=C(N)N KJGNDQCYBNBXDA-GUBZILKMSA-N 0.000 description 2
- DCGLNNVKIZXQOJ-FXQIFTODSA-N Arg-Asn-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCCN=C(N)N)N DCGLNNVKIZXQOJ-FXQIFTODSA-N 0.000 description 2
- BVBKBQRPOJFCQM-DCAQKATOSA-N Arg-Asn-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O BVBKBQRPOJFCQM-DCAQKATOSA-N 0.000 description 2
- GHNDBBVSWOWYII-LPEHRKFASA-N Arg-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCCN=C(N)N)N)C(=O)O GHNDBBVSWOWYII-LPEHRKFASA-N 0.000 description 2
- RWCLSUOSKWTXLA-FXQIFTODSA-N Arg-Asp-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O RWCLSUOSKWTXLA-FXQIFTODSA-N 0.000 description 2
- XVLLUZMFSAYKJV-GUBZILKMSA-N Arg-Asp-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O XVLLUZMFSAYKJV-GUBZILKMSA-N 0.000 description 2
- PQWTZSNVWSOFFK-FXQIFTODSA-N Arg-Asp-Asn Chemical compound C(C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N)CN=C(N)N PQWTZSNVWSOFFK-FXQIFTODSA-N 0.000 description 2
- JSHVMZANPXCDTL-GMOBBJLQSA-N Arg-Asp-Ile Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JSHVMZANPXCDTL-GMOBBJLQSA-N 0.000 description 2
- XTGGTAWGUFXJSV-NAKRPEOUSA-N Arg-Cys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CCCN=C(N)N)N XTGGTAWGUFXJSV-NAKRPEOUSA-N 0.000 description 2
- SNBHMYQRNCJSOJ-CIUDSAMLSA-N Arg-Gln-Asn Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O SNBHMYQRNCJSOJ-CIUDSAMLSA-N 0.000 description 2
- HPKSHFSEXICTLI-CIUDSAMLSA-N Arg-Glu-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O HPKSHFSEXICTLI-CIUDSAMLSA-N 0.000 description 2
- CYXCAHZVPFREJD-LURJTMIESA-N Arg-Gly-Gly Chemical compound NC(=N)NCCC[C@H](N)C(=O)NCC(=O)NCC(O)=O CYXCAHZVPFREJD-LURJTMIESA-N 0.000 description 2
- PHHRSPBBQUFULD-UWVGGRQHSA-N Arg-Gly-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CCCN=C(N)N)N PHHRSPBBQUFULD-UWVGGRQHSA-N 0.000 description 2
- SLNCSSWAIDUUGF-LSJOCFKGSA-N Arg-His-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(O)=O SLNCSSWAIDUUGF-LSJOCFKGSA-N 0.000 description 2
- YKBHOXLMMPZPHQ-GMOBBJLQSA-N Arg-Ile-Asp Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O YKBHOXLMMPZPHQ-GMOBBJLQSA-N 0.000 description 2
- YQGZIRIYGHNSQO-ZPFDUUQYSA-N Arg-Ile-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N YQGZIRIYGHNSQO-ZPFDUUQYSA-N 0.000 description 2
- FNXCAFKDGBROCU-STECZYCISA-N Arg-Ile-Tyr Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 FNXCAFKDGBROCU-STECZYCISA-N 0.000 description 2
- UHFUZWSZQKMDSX-DCAQKATOSA-N Arg-Leu-Asn Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N UHFUZWSZQKMDSX-DCAQKATOSA-N 0.000 description 2
- GMFAGHNRXPSSJS-SRVKXCTJSA-N Arg-Leu-Gln Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O GMFAGHNRXPSSJS-SRVKXCTJSA-N 0.000 description 2
- OTZMRMHZCMZOJZ-SRVKXCTJSA-N Arg-Leu-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O OTZMRMHZCMZOJZ-SRVKXCTJSA-N 0.000 description 2
- COXMUHNBYCVVRG-DCAQKATOSA-N Arg-Leu-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O COXMUHNBYCVVRG-DCAQKATOSA-N 0.000 description 2
- CLICCYPMVFGUOF-IHRRRGAJSA-N Arg-Lys-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O CLICCYPMVFGUOF-IHRRRGAJSA-N 0.000 description 2
- MTYLORHAQXVQOW-AVGNSLFASA-N Arg-Lys-Met Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCSC)C(O)=O MTYLORHAQXVQOW-AVGNSLFASA-N 0.000 description 2
- YTMKMRSYXHBGER-IHRRRGAJSA-N Arg-Phe-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N YTMKMRSYXHBGER-IHRRRGAJSA-N 0.000 description 2
- NIELFHOLFTUZME-HJWJTTGWSA-N Arg-Phe-Ile Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O NIELFHOLFTUZME-HJWJTTGWSA-N 0.000 description 2
- LXMKTIZAGIBQRX-HRCADAONSA-N Arg-Phe-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)NC(=O)[C@H](CCCN=C(N)N)N)C(=O)O LXMKTIZAGIBQRX-HRCADAONSA-N 0.000 description 2
- HNJNAMGZQZPSRE-GUBZILKMSA-N Arg-Pro-Asn Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O HNJNAMGZQZPSRE-GUBZILKMSA-N 0.000 description 2
- DNBMCNQKNOKOSD-DCAQKATOSA-N Arg-Pro-Gln Chemical compound NC(N)=NCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O DNBMCNQKNOKOSD-DCAQKATOSA-N 0.000 description 2
- OWSMKCJUBAPHED-JYJNAYRXSA-N Arg-Pro-Tyr Chemical compound NC(N)=NCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 OWSMKCJUBAPHED-JYJNAYRXSA-N 0.000 description 2
- KXOPYFNQLVUOAQ-FXQIFTODSA-N Arg-Ser-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O KXOPYFNQLVUOAQ-FXQIFTODSA-N 0.000 description 2
- VRTWYUYCJGNFES-CIUDSAMLSA-N Arg-Ser-Gln Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O VRTWYUYCJGNFES-CIUDSAMLSA-N 0.000 description 2
- JQHASVQBAKRJKD-GUBZILKMSA-N Arg-Ser-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCCN=C(N)N)N JQHASVQBAKRJKD-GUBZILKMSA-N 0.000 description 2
- VJIQPOJMISSUPO-BVSLBCMMSA-N Arg-Trp-Tyr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O VJIQPOJMISSUPO-BVSLBCMMSA-N 0.000 description 2
- CTAPSNCVKPOOSM-KKUMJFAQSA-N Arg-Tyr-Gln Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O CTAPSNCVKPOOSM-KKUMJFAQSA-N 0.000 description 2
- PSUXEQYPYZLNER-QXEWZRGKSA-N Arg-Val-Asn Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O PSUXEQYPYZLNER-QXEWZRGKSA-N 0.000 description 2
- XEOXPCNONWHHSW-AVGNSLFASA-N Arg-Val-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N XEOXPCNONWHHSW-AVGNSLFASA-N 0.000 description 2
- FXGMURPOWCKNAZ-JYJNAYRXSA-N Arg-Val-Phe Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 FXGMURPOWCKNAZ-JYJNAYRXSA-N 0.000 description 2
- CPTXATAOUQJQRO-GUBZILKMSA-N Arg-Val-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O CPTXATAOUQJQRO-GUBZILKMSA-N 0.000 description 2
- QLSRIZIDQXDQHK-RCWTZXSCSA-N Arg-Val-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O QLSRIZIDQXDQHK-RCWTZXSCSA-N 0.000 description 2
- PDQBXRSOSCTGKY-ACZMJKKPSA-N Asn-Ala-Gln Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N PDQBXRSOSCTGKY-ACZMJKKPSA-N 0.000 description 2
- NXVGBGZQQFDUTM-XVYDVKMFSA-N Asn-Ala-His Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC(=O)N)N NXVGBGZQQFDUTM-XVYDVKMFSA-N 0.000 description 2
- XYOVHPDDWCEUDY-CIUDSAMLSA-N Asn-Ala-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O XYOVHPDDWCEUDY-CIUDSAMLSA-N 0.000 description 2
- SLKLLQWZQHXYSV-CIUDSAMLSA-N Asn-Ala-Lys Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCCN)C(O)=O SLKLLQWZQHXYSV-CIUDSAMLSA-N 0.000 description 2
- IARGXWMWRFOQPG-GCJQMDKQSA-N Asn-Ala-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O IARGXWMWRFOQPG-GCJQMDKQSA-N 0.000 description 2
- VDCIPFYVCICPEC-FXQIFTODSA-N Asn-Arg-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O VDCIPFYVCICPEC-FXQIFTODSA-N 0.000 description 2
- MEFGKQUUYZOLHM-GMOBBJLQSA-N Asn-Arg-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O MEFGKQUUYZOLHM-GMOBBJLQSA-N 0.000 description 2
- PIWWUBYJNONVTJ-ZLUOBGJFSA-N Asn-Asp-Asn Chemical compound C([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N)C(=O)N PIWWUBYJNONVTJ-ZLUOBGJFSA-N 0.000 description 2
- XSGBIBGAMKTHMY-WHFBIAKZSA-N Asn-Asp-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O XSGBIBGAMKTHMY-WHFBIAKZSA-N 0.000 description 2
- QISZHYWZHJRDAO-CIUDSAMLSA-N Asn-Asp-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(=O)N)N QISZHYWZHJRDAO-CIUDSAMLSA-N 0.000 description 2
- VYLVOMUVLMGCRF-ZLUOBGJFSA-N Asn-Asp-Ser Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O VYLVOMUVLMGCRF-ZLUOBGJFSA-N 0.000 description 2
- XWFPGQVLOVGSLU-CIUDSAMLSA-N Asn-Gln-Arg Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N XWFPGQVLOVGSLU-CIUDSAMLSA-N 0.000 description 2
- PQAIOUVVZCOLJK-FXQIFTODSA-N Asn-Gln-Gln Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N PQAIOUVVZCOLJK-FXQIFTODSA-N 0.000 description 2
- NNMUHYLAYUSTTN-FXQIFTODSA-N Asn-Gln-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O NNMUHYLAYUSTTN-FXQIFTODSA-N 0.000 description 2
- FUHFYEKSGWOWGZ-XHNCKOQMSA-N Asn-Gln-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC(=O)N)N)C(=O)O FUHFYEKSGWOWGZ-XHNCKOQMSA-N 0.000 description 2
- SRUUBQBAVNQZGJ-LAEOZQHASA-N Asn-Gln-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC(=O)N)N SRUUBQBAVNQZGJ-LAEOZQHASA-N 0.000 description 2
- ASCGFDYEKSRNPL-CIUDSAMLSA-N Asn-Glu-Met Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCSC)C(O)=O ASCGFDYEKSRNPL-CIUDSAMLSA-N 0.000 description 2
- CTQIOCMSIJATNX-WHFBIAKZSA-N Asn-Gly-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](C)C(O)=O CTQIOCMSIJATNX-WHFBIAKZSA-N 0.000 description 2
- JQSWHKKUZMTOIH-QWRGUYRKSA-N Asn-Gly-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CC(=O)N)N JQSWHKKUZMTOIH-QWRGUYRKSA-N 0.000 description 2
- GJFYPBDMUGGLFR-NKWVEPMBSA-N Asn-Gly-Pro Chemical compound C1C[C@@H](N(C1)C(=O)CNC(=O)[C@H](CC(=O)N)N)C(=O)O GJFYPBDMUGGLFR-NKWVEPMBSA-N 0.000 description 2
- RAQMSGVCGSJKCL-FOHZUACHSA-N Asn-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC(N)=O RAQMSGVCGSJKCL-FOHZUACHSA-N 0.000 description 2
- UYXXMIZGHYKYAT-NHCYSSNCSA-N Asn-His-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CC(=O)N)N UYXXMIZGHYKYAT-NHCYSSNCSA-N 0.000 description 2
- ANPFQTJEPONRPL-UGYAYLCHSA-N Asn-Ile-Asp Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O ANPFQTJEPONRPL-UGYAYLCHSA-N 0.000 description 2
- KMCRKVOLRCOMBG-DJFWLOJKSA-N Asn-Ile-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC(=O)N)N KMCRKVOLRCOMBG-DJFWLOJKSA-N 0.000 description 2
- YYSYDIYQTUPNQQ-SXTJYALSSA-N Asn-Ile-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O YYSYDIYQTUPNQQ-SXTJYALSSA-N 0.000 description 2
- GQRDIVQPSMPQME-ZPFDUUQYSA-N Asn-Ile-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O GQRDIVQPSMPQME-ZPFDUUQYSA-N 0.000 description 2
- ACKNRKFVYUVWAC-ZPFDUUQYSA-N Asn-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)N)N ACKNRKFVYUVWAC-ZPFDUUQYSA-N 0.000 description 2
- SEKBHZJLARBNPB-GHCJXIJMSA-N Asn-Ile-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O SEKBHZJLARBNPB-GHCJXIJMSA-N 0.000 description 2
- HDHZCEDPLTVHFZ-GUBZILKMSA-N Asn-Leu-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O HDHZCEDPLTVHFZ-GUBZILKMSA-N 0.000 description 2
- GLWFAWNYGWBMOC-SRVKXCTJSA-N Asn-Leu-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O GLWFAWNYGWBMOC-SRVKXCTJSA-N 0.000 description 2
- YVXRYLVELQYAEQ-SRVKXCTJSA-N Asn-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)N)N YVXRYLVELQYAEQ-SRVKXCTJSA-N 0.000 description 2
- NCFJQJRLQJEECD-NHCYSSNCSA-N Asn-Leu-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O NCFJQJRLQJEECD-NHCYSSNCSA-N 0.000 description 2
- ALHMNHZJBYBYHS-DCAQKATOSA-N Asn-Lys-Arg Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O ALHMNHZJBYBYHS-DCAQKATOSA-N 0.000 description 2
- FTSAJSADJCMDHH-CIUDSAMLSA-N Asn-Lys-Asp Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC(=O)N)N FTSAJSADJCMDHH-CIUDSAMLSA-N 0.000 description 2
- FODVBOKTYKYRFJ-CIUDSAMLSA-N Asn-Lys-Cys Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(=O)N)N FODVBOKTYKYRFJ-CIUDSAMLSA-N 0.000 description 2
- LZLCLRQMUQWUHJ-GUBZILKMSA-N Asn-Lys-Gln Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N LZLCLRQMUQWUHJ-GUBZILKMSA-N 0.000 description 2
- WXVGISRWSYGEDK-KKUMJFAQSA-N Asn-Lys-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC(=O)N)N WXVGISRWSYGEDK-KKUMJFAQSA-N 0.000 description 2
- AYOAHKWVQLNPDM-HJGDQZAQSA-N Asn-Lys-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O AYOAHKWVQLNPDM-HJGDQZAQSA-N 0.000 description 2
- MYVBTYXSWILFCG-BQBZGAKWSA-N Asn-Met-Gly Chemical compound CSCC[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CC(=O)N)N MYVBTYXSWILFCG-BQBZGAKWSA-N 0.000 description 2
- OROMFUQQTSWUTI-IHRRRGAJSA-N Asn-Phe-Arg Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N OROMFUQQTSWUTI-IHRRRGAJSA-N 0.000 description 2
- ZVUMKOMKQCANOM-AVGNSLFASA-N Asn-Phe-Gln Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O ZVUMKOMKQCANOM-AVGNSLFASA-N 0.000 description 2
- YUOXLJYVSZYPBJ-CIUDSAMLSA-N Asn-Pro-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O YUOXLJYVSZYPBJ-CIUDSAMLSA-N 0.000 description 2
- GFGUPLIETCNQGF-DCAQKATOSA-N Asn-Pro-His Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC(=O)N)N)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O GFGUPLIETCNQGF-DCAQKATOSA-N 0.000 description 2
- AWXDRZJQCVHCIT-DCAQKATOSA-N Asn-Pro-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CC(N)=O AWXDRZJQCVHCIT-DCAQKATOSA-N 0.000 description 2
- UWFOMGUWGPRVBW-GUBZILKMSA-N Asn-Pro-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CC(=O)N)N UWFOMGUWGPRVBW-GUBZILKMSA-N 0.000 description 2
- IDUUACUJKUXKKD-VEVYYDQMSA-N Asn-Pro-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O IDUUACUJKUXKKD-VEVYYDQMSA-N 0.000 description 2
- XTMZYFMTYJNABC-ZLUOBGJFSA-N Asn-Ser-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CC(=O)N)N XTMZYFMTYJNABC-ZLUOBGJFSA-N 0.000 description 2
- HPBNLFLSSQDFQW-WHFBIAKZSA-N Asn-Ser-Gly Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O HPBNLFLSSQDFQW-WHFBIAKZSA-N 0.000 description 2
- DOURAOODTFJRIC-CIUDSAMLSA-N Asn-Ser-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CC(=O)N)N DOURAOODTFJRIC-CIUDSAMLSA-N 0.000 description 2
- MKJBPDLENBUHQU-CIUDSAMLSA-N Asn-Ser-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O MKJBPDLENBUHQU-CIUDSAMLSA-N 0.000 description 2
- NCXTYSVDWLAQGZ-ZKWXMUAHSA-N Asn-Ser-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC(N)=O NCXTYSVDWLAQGZ-ZKWXMUAHSA-N 0.000 description 2
- WLVLIYYBPPONRJ-GCJQMDKQSA-N Asn-Thr-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O WLVLIYYBPPONRJ-GCJQMDKQSA-N 0.000 description 2
- HCZQKHSRYHCPSD-IUKAMOBKSA-N Asn-Thr-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O HCZQKHSRYHCPSD-IUKAMOBKSA-N 0.000 description 2
- BCADFFUQHIMQAA-KKHAAJSZSA-N Asn-Thr-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O BCADFFUQHIMQAA-KKHAAJSZSA-N 0.000 description 2
- KTDWFWNZLLFEFU-KKUMJFAQSA-N Asn-Tyr-His Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)NC(=O)[C@H](CC(=O)N)N)O KTDWFWNZLLFEFU-KKUMJFAQSA-N 0.000 description 2
- ZAESWDKAMDVHLL-RCOVLWMOSA-N Asn-Val-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O ZAESWDKAMDVHLL-RCOVLWMOSA-N 0.000 description 2
- SYZWMVSXBZCOBZ-QXEWZRGKSA-N Asn-Val-Met Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CC(=O)N)N SYZWMVSXBZCOBZ-QXEWZRGKSA-N 0.000 description 2
- WQAOZCVOOYUWKG-LSJOCFKGSA-N Asn-Val-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H](CC(=O)N)N WQAOZCVOOYUWKG-LSJOCFKGSA-N 0.000 description 2
- VTYQAQFKMQTKQD-ACZMJKKPSA-N Asp-Ala-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(O)=O VTYQAQFKMQTKQD-ACZMJKKPSA-N 0.000 description 2
- XEDQMTWEYFBOIK-ACZMJKKPSA-N Asp-Ala-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O XEDQMTWEYFBOIK-ACZMJKKPSA-N 0.000 description 2
- HPNDBHLITCHRSO-WHFBIAKZSA-N Asp-Ala-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)NCC(O)=O HPNDBHLITCHRSO-WHFBIAKZSA-N 0.000 description 2
- XBQSLMACWDXWLJ-GHCJXIJMSA-N Asp-Ala-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O XBQSLMACWDXWLJ-GHCJXIJMSA-N 0.000 description 2
- CASGONAXMZPHCK-FXQIFTODSA-N Asp-Asn-Arg Chemical compound C(C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC(=O)O)N)CN=C(N)N CASGONAXMZPHCK-FXQIFTODSA-N 0.000 description 2
- KNMRXHIAVXHCLW-ZLUOBGJFSA-N Asp-Asn-Ser Chemical compound C([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CO)C(=O)O)N)C(=O)O KNMRXHIAVXHCLW-ZLUOBGJFSA-N 0.000 description 2
- JGDBHIVECJGXJA-FXQIFTODSA-N Asp-Asp-Arg Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O JGDBHIVECJGXJA-FXQIFTODSA-N 0.000 description 2
- QOVWVLLHMMCFFY-ZLUOBGJFSA-N Asp-Asp-Asn Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O QOVWVLLHMMCFFY-ZLUOBGJFSA-N 0.000 description 2
- TVVYVAUGRHNTGT-UGYAYLCHSA-N Asp-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC(O)=O TVVYVAUGRHNTGT-UGYAYLCHSA-N 0.000 description 2
- QXHVOUSPVAWEMX-ZLUOBGJFSA-N Asp-Asp-Ser Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O QXHVOUSPVAWEMX-ZLUOBGJFSA-N 0.000 description 2
- BFOYULZBKYOKAN-OLHMAJIHSA-N Asp-Asp-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O BFOYULZBKYOKAN-OLHMAJIHSA-N 0.000 description 2
- QQXOYLWJQUPXJU-WHFBIAKZSA-N Asp-Cys-Gly Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CS)C(=O)NCC(O)=O QQXOYLWJQUPXJU-WHFBIAKZSA-N 0.000 description 2
- KVPHTGVUMJGMCX-BIIVOSGPSA-N Asp-Cys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CS)NC(=O)[C@H](CC(=O)O)N)C(=O)O KVPHTGVUMJGMCX-BIIVOSGPSA-N 0.000 description 2
- VHQOCWWKXIOAQI-WDSKDSINSA-N Asp-Gln-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O VHQOCWWKXIOAQI-WDSKDSINSA-N 0.000 description 2
- GHODABZPVZMWCE-FXQIFTODSA-N Asp-Glu-Glu Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O GHODABZPVZMWCE-FXQIFTODSA-N 0.000 description 2
- OVPHVTCDVYYTHN-AVGNSLFASA-N Asp-Glu-Phe Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 OVPHVTCDVYYTHN-AVGNSLFASA-N 0.000 description 2
- WSGVTKZFVJSJOG-RCOVLWMOSA-N Asp-Gly-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O WSGVTKZFVJSJOG-RCOVLWMOSA-N 0.000 description 2
- KTTCQQNRRLCIBC-GHCJXIJMSA-N Asp-Ile-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O KTTCQQNRRLCIBC-GHCJXIJMSA-N 0.000 description 2
- TZOZNVLBTAFJRW-UGYAYLCHSA-N Asp-Ile-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC(=O)O)N TZOZNVLBTAFJRW-UGYAYLCHSA-N 0.000 description 2
- QNFRBNZGVVKBNJ-PEFMBERDSA-N Asp-Ile-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)O)N QNFRBNZGVVKBNJ-PEFMBERDSA-N 0.000 description 2
- SEMWSADZTMJELF-BYULHYEWSA-N Asp-Ile-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O SEMWSADZTMJELF-BYULHYEWSA-N 0.000 description 2
- MFTVXYMXSAQZNL-DJFWLOJKSA-N Asp-Ile-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC(=O)O)N MFTVXYMXSAQZNL-DJFWLOJKSA-N 0.000 description 2
- YFSLJHLQOALGSY-ZPFDUUQYSA-N Asp-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)O)N YFSLJHLQOALGSY-ZPFDUUQYSA-N 0.000 description 2
- SPWXXPFDTMYTRI-IUKAMOBKSA-N Asp-Ile-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SPWXXPFDTMYTRI-IUKAMOBKSA-N 0.000 description 2
- DWOGMPWRQQWPPF-GUBZILKMSA-N Asp-Leu-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O DWOGMPWRQQWPPF-GUBZILKMSA-N 0.000 description 2
- UJGRZQYSNYTCAX-SRVKXCTJSA-N Asp-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC(O)=O UJGRZQYSNYTCAX-SRVKXCTJSA-N 0.000 description 2
- KFAFUJMGHVVYRC-DCAQKATOSA-N Asp-Leu-Met Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(O)=O KFAFUJMGHVVYRC-DCAQKATOSA-N 0.000 description 2
- IVPNEDNYYYFAGI-GARJFASQSA-N Asp-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)O)N IVPNEDNYYYFAGI-GARJFASQSA-N 0.000 description 2
- FQHBAQLBIXLWAG-DCAQKATOSA-N Asp-Lys-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC(=O)O)N FQHBAQLBIXLWAG-DCAQKATOSA-N 0.000 description 2
- DPNWSMBUYCLEDG-CIUDSAMLSA-N Asp-Lys-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O DPNWSMBUYCLEDG-CIUDSAMLSA-N 0.000 description 2
- MYLZFUMPZCPJCJ-NHCYSSNCSA-N Asp-Lys-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O MYLZFUMPZCPJCJ-NHCYSSNCSA-N 0.000 description 2
- HXVILZUZXFLVEN-DCAQKATOSA-N Asp-Met-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O HXVILZUZXFLVEN-DCAQKATOSA-N 0.000 description 2
- QTIZKMMLNUMHHU-DCAQKATOSA-N Asp-Pro-His Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC(=O)O)N)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O QTIZKMMLNUMHHU-DCAQKATOSA-N 0.000 description 2
- UAXIKORUDGGIGA-DCAQKATOSA-N Asp-Pro-Lys Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC(=O)O)N)C(=O)N[C@@H](CCCCN)C(=O)O UAXIKORUDGGIGA-DCAQKATOSA-N 0.000 description 2
- GGRSYTUJHAZTFN-IHRRRGAJSA-N Asp-Pro-Tyr Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC(=O)O)N)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O GGRSYTUJHAZTFN-IHRRRGAJSA-N 0.000 description 2
- ZBYLEBZCVKLPCY-FXQIFTODSA-N Asp-Ser-Arg Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O ZBYLEBZCVKLPCY-FXQIFTODSA-N 0.000 description 2
- CUQDCPXNZPDYFQ-ZLUOBGJFSA-N Asp-Ser-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O CUQDCPXNZPDYFQ-ZLUOBGJFSA-N 0.000 description 2
- ZVGRHIRJLWBWGJ-ACZMJKKPSA-N Asp-Ser-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O ZVGRHIRJLWBWGJ-ACZMJKKPSA-N 0.000 description 2
- VNXQRBXEQXLERQ-CIUDSAMLSA-N Asp-Ser-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CC(=O)O)N VNXQRBXEQXLERQ-CIUDSAMLSA-N 0.000 description 2
- XYPJXLLXNSAWHZ-SRVKXCTJSA-N Asp-Ser-Tyr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O XYPJXLLXNSAWHZ-SRVKXCTJSA-N 0.000 description 2
- QOCFFCUFZGDHTP-NUMRIWBASA-N Asp-Thr-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(O)=O QOCFFCUFZGDHTP-NUMRIWBASA-N 0.000 description 2
- PDIYGFYAMZZFCW-JIOCBJNQSA-N Asp-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)O)N)O PDIYGFYAMZZFCW-JIOCBJNQSA-N 0.000 description 2
- GCACQYDBDHRVGE-LKXGYXEUSA-N Asp-Thr-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H]([C@H](O)C)NC(=O)[C@@H](N)CC(O)=O GCACQYDBDHRVGE-LKXGYXEUSA-N 0.000 description 2
- YUELDQUPTAYEGM-XIRDDKMYSA-N Asp-Trp-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](CC(=O)O)N YUELDQUPTAYEGM-XIRDDKMYSA-N 0.000 description 2
- XWKBWZXGNXTDKY-ZKWXMUAHSA-N Asp-Val-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC(O)=O XWKBWZXGNXTDKY-ZKWXMUAHSA-N 0.000 description 2
- QOJJMJKTMKNFEF-ZKWXMUAHSA-N Asp-Val-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC(O)=O QOJJMJKTMKNFEF-ZKWXMUAHSA-N 0.000 description 2
- 101000781117 Autographa californica nuclear polyhedrosis virus Uncharacterized 12.4 kDa protein in CTL-LEF2 intergenic region Proteins 0.000 description 2
- 101000708323 Azospirillum brasilense Uncharacterized 28.8 kDa protein in nifR3-like 5'region Proteins 0.000 description 2
- 101000770311 Azotobacter chroococcum mcd 1 Uncharacterized 19.8 kDa protein in nifW 5'region Proteins 0.000 description 2
- 101000748761 Bacillus subtilis (strain 168) Uncharacterized MFS-type transporter YcxA Proteins 0.000 description 2
- 101000765620 Bacillus subtilis (strain 168) Uncharacterized protein YlxP Proteins 0.000 description 2
- 101000916134 Bacillus subtilis (strain 168) Uncharacterized protein YqxJ Proteins 0.000 description 2
- 101000754349 Bordetella pertussis (strain Tohama I / ATCC BAA-589 / NCTC 13251) UPF0065 protein BP0148 Proteins 0.000 description 2
- 101100512078 Caenorhabditis elegans lys-1 gene Proteins 0.000 description 2
- 101000827633 Caldicellulosiruptor sp. (strain Rt8B.4) Uncharacterized 23.9 kDa protein in xynA 3'region Proteins 0.000 description 2
- CURLTUGMZLYLDI-UHFFFAOYSA-N Carbon dioxide Chemical compound O=C=O CURLTUGMZLYLDI-UHFFFAOYSA-N 0.000 description 2
- 101000947628 Claviceps purpurea Uncharacterized 11.8 kDa protein Proteins 0.000 description 2
- 101000686796 Clostridium perfringens Replication protein Proteins 0.000 description 2
- TVYMKYUSZSVOAG-ZLUOBGJFSA-N Cys-Ala-Ala Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O TVYMKYUSZSVOAG-ZLUOBGJFSA-N 0.000 description 2
- AEJSNWMRPXAKCW-WHFBIAKZSA-N Cys-Ala-Gly Chemical compound SC[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O AEJSNWMRPXAKCW-WHFBIAKZSA-N 0.000 description 2
- PRXCTTWKGJAPMT-ZLUOBGJFSA-N Cys-Ala-Ser Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O PRXCTTWKGJAPMT-ZLUOBGJFSA-N 0.000 description 2
- OIMUAKUQOUEPCZ-WHFBIAKZSA-N Cys-Asn-Gly Chemical compound SC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O OIMUAKUQOUEPCZ-WHFBIAKZSA-N 0.000 description 2
- KOHBWQDSVCARMI-BWBBJGPYSA-N Cys-Cys-Thr Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CS)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KOHBWQDSVCARMI-BWBBJGPYSA-N 0.000 description 2
- CFQVGYWKSLKWFX-KBIXCLLPSA-N Cys-Glu-Ile Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O CFQVGYWKSLKWFX-KBIXCLLPSA-N 0.000 description 2
- GUKYYUFHWYRMEU-WHFBIAKZSA-N Cys-Gly-Asp Chemical compound [H]N[C@@H](CS)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O GUKYYUFHWYRMEU-WHFBIAKZSA-N 0.000 description 2
- DZLQXIFVQFTFJY-BYPYZUCNSA-N Cys-Gly-Gly Chemical compound SC[C@H](N)C(=O)NCC(=O)NCC(O)=O DZLQXIFVQFTFJY-BYPYZUCNSA-N 0.000 description 2
- XVLMKWWVBNESPX-XVYDVKMFSA-N Cys-His-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CS)N XVLMKWWVBNESPX-XVYDVKMFSA-N 0.000 description 2
- OZSBRCONEMXYOJ-AVGNSLFASA-N Cys-Phe-Glu Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CS)N OZSBRCONEMXYOJ-AVGNSLFASA-N 0.000 description 2
- VCPHQVQGVSKDHY-FXQIFTODSA-N Cys-Ser-Met Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCSC)C(O)=O VCPHQVQGVSKDHY-FXQIFTODSA-N 0.000 description 2
- NDNZRWUDUMTITL-FXQIFTODSA-N Cys-Ser-Val Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O NDNZRWUDUMTITL-FXQIFTODSA-N 0.000 description 2
- YWEHYKGJWHPGPY-XGEHTFHBSA-N Cys-Thr-Arg Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CS)N)O YWEHYKGJWHPGPY-XGEHTFHBSA-N 0.000 description 2
- SAEVTQWAYDPXMU-KATARQTJSA-N Cys-Thr-Leu Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O SAEVTQWAYDPXMU-KATARQTJSA-N 0.000 description 2
- DGQJGBDBFVGLGL-ZKWXMUAHSA-N Cys-Val-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CS)N DGQJGBDBFVGLGL-ZKWXMUAHSA-N 0.000 description 2
- 108010090461 DFG peptide Proteins 0.000 description 2
- 230000003682 DNA packaging effect Effects 0.000 description 2
- 101000788129 Escherichia coli Uncharacterized protein in sul1 3'region Proteins 0.000 description 2
- 101000788370 Escherichia phage P2 Uncharacterized 12.9 kDa protein in GpA 3'region Proteins 0.000 description 2
- 241001123946 Gaga Species 0.000 description 2
- 101000787096 Geobacillus stearothermophilus Uncharacterized protein in gldA 3'region Proteins 0.000 description 2
- REJJNXODKSHOKA-ACZMJKKPSA-N Gln-Ala-Asp Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CCC(=O)N)N REJJNXODKSHOKA-ACZMJKKPSA-N 0.000 description 2
- NNQHEEQNPQYPGL-FXQIFTODSA-N Gln-Ala-Gln Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(O)=O NNQHEEQNPQYPGL-FXQIFTODSA-N 0.000 description 2
- HHWQMFIGMMOVFK-WDSKDSINSA-N Gln-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(N)=O HHWQMFIGMMOVFK-WDSKDSINSA-N 0.000 description 2
- OYTPNWYZORARHL-XHNCKOQMSA-N Gln-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)N)N OYTPNWYZORARHL-XHNCKOQMSA-N 0.000 description 2
- KZKBJEUWNMQTLV-XDTLVQLUSA-N Gln-Ala-Tyr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O KZKBJEUWNMQTLV-XDTLVQLUSA-N 0.000 description 2
- LJEPDHWNQXPXMM-NHCYSSNCSA-N Gln-Arg-Val Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(O)=O LJEPDHWNQXPXMM-NHCYSSNCSA-N 0.000 description 2
- TWHDOEYLXXQYOZ-FXQIFTODSA-N Gln-Asn-Gln Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N TWHDOEYLXXQYOZ-FXQIFTODSA-N 0.000 description 2
- BTSPOOHJBYJRKO-CIUDSAMLSA-N Gln-Asp-Arg Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O BTSPOOHJBYJRKO-CIUDSAMLSA-N 0.000 description 2
- IKDOHQHEFPPGJG-FXQIFTODSA-N Gln-Asp-Glu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O IKDOHQHEFPPGJG-FXQIFTODSA-N 0.000 description 2
- SXIJQMBEVYWAQT-GUBZILKMSA-N Gln-Asp-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCC(=O)N)N SXIJQMBEVYWAQT-GUBZILKMSA-N 0.000 description 2
- CXFUMJQFZVCETK-FXQIFTODSA-N Gln-Cys-Gln Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(N)=O)C(O)=O CXFUMJQFZVCETK-FXQIFTODSA-N 0.000 description 2
- AJDMYLOISOCHHC-YVNDNENWSA-N Gln-Gln-Ile Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O AJDMYLOISOCHHC-YVNDNENWSA-N 0.000 description 2
- LVNILKSSFHCSJZ-IHRRRGAJSA-N Gln-Gln-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCC(=O)N)N LVNILKSSFHCSJZ-IHRRRGAJSA-N 0.000 description 2
- UFNSPPFJOHNXRE-AUTRQRHGSA-N Gln-Gln-Val Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O UFNSPPFJOHNXRE-AUTRQRHGSA-N 0.000 description 2
- ZNZPKVQURDQFFS-FXQIFTODSA-N Gln-Glu-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O ZNZPKVQURDQFFS-FXQIFTODSA-N 0.000 description 2
- NSNUZSPSADIMJQ-WDSKDSINSA-N Gln-Gly-Asp Chemical compound NC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O NSNUZSPSADIMJQ-WDSKDSINSA-N 0.000 description 2
- GNMQDOGFWYWPNM-LAEOZQHASA-N Gln-Gly-Ile Chemical compound CC[C@H](C)[C@H](NC(=O)CNC(=O)[C@@H](N)CCC(N)=O)C(O)=O GNMQDOGFWYWPNM-LAEOZQHASA-N 0.000 description 2
- SMLDOQHTOAAFJQ-WDSKDSINSA-N Gln-Gly-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)NCC(=O)N[C@@H](CO)C(O)=O SMLDOQHTOAAFJQ-WDSKDSINSA-N 0.000 description 2
- JXFLPKSDLDEOQK-JHEQGTHGSA-N Gln-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CCC(N)=O JXFLPKSDLDEOQK-JHEQGTHGSA-N 0.000 description 2
- NXPXQIZKDOXIHH-JSGCOSHPSA-N Gln-Gly-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CCC(=O)N)N NXPXQIZKDOXIHH-JSGCOSHPSA-N 0.000 description 2
- ORYMMTRPKVTGSJ-XVKPBYJWSA-N Gln-Gly-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CCC(N)=O ORYMMTRPKVTGSJ-XVKPBYJWSA-N 0.000 description 2
- PODFFOWWLUPNMN-DCAQKATOSA-N Gln-His-Gln Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(N)=O)C(O)=O PODFFOWWLUPNMN-DCAQKATOSA-N 0.000 description 2
- ICDIMQAMJGDHSE-GUBZILKMSA-N Gln-His-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(O)=O ICDIMQAMJGDHSE-GUBZILKMSA-N 0.000 description 2
- HXOLDXKNWKLDMM-YVNDNENWSA-N Gln-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CCC(=O)N)N HXOLDXKNWKLDMM-YVNDNENWSA-N 0.000 description 2
- FTIJVMLAGRAYMJ-MNXVOIDGSA-N Gln-Ile-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCC(N)=O FTIJVMLAGRAYMJ-MNXVOIDGSA-N 0.000 description 2
- TYRMVTKPOWPZBC-SXNHZJKMSA-N Gln-Ile-Trp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CCC(=O)N)N TYRMVTKPOWPZBC-SXNHZJKMSA-N 0.000 description 2
- VZRAXPGTUNDIDK-GUBZILKMSA-N Gln-Leu-Asn Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)N)N VZRAXPGTUNDIDK-GUBZILKMSA-N 0.000 description 2
- VUVKKXPCKILIBD-AVGNSLFASA-N Gln-Leu-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CCC(=O)N)N VUVKKXPCKILIBD-AVGNSLFASA-N 0.000 description 2
- IULKWYSYZSURJK-AVGNSLFASA-N Gln-Leu-Lys Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(O)=O IULKWYSYZSURJK-AVGNSLFASA-N 0.000 description 2
- ZBKUIQNCRIYVGH-SDDRHHMPSA-N Gln-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)N)N ZBKUIQNCRIYVGH-SDDRHHMPSA-N 0.000 description 2
- UWKPRVKWEKEMSY-DCAQKATOSA-N Gln-Lys-Gln Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O UWKPRVKWEKEMSY-DCAQKATOSA-N 0.000 description 2
- XZLLTYBONVKGLO-SDDRHHMPSA-N Gln-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)[C@H](CCC(=O)N)N)C(=O)O XZLLTYBONVKGLO-SDDRHHMPSA-N 0.000 description 2
- CELXWPDNIGWCJN-WDCWCFNPSA-N Gln-Lys-Thr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CELXWPDNIGWCJN-WDCWCFNPSA-N 0.000 description 2
- LUGUNEGJNDEBLU-DCAQKATOSA-N Gln-Met-Arg Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CCC(=O)N)N LUGUNEGJNDEBLU-DCAQKATOSA-N 0.000 description 2
- BZULIEARJFRINC-IHRRRGAJSA-N Gln-Phe-Glu Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CCC(=O)N)N BZULIEARJFRINC-IHRRRGAJSA-N 0.000 description 2
- KFHASAPTUOASQN-JYJNAYRXSA-N Gln-Phe-His Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)NC(=O)[C@H](CCC(=O)N)N KFHASAPTUOASQN-JYJNAYRXSA-N 0.000 description 2
- HMIXCETWRYDVMO-GUBZILKMSA-N Gln-Pro-Glu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O HMIXCETWRYDVMO-GUBZILKMSA-N 0.000 description 2
- XQDGOJPVMSWZSO-SRVKXCTJSA-N Gln-Pro-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CCC(=O)N)N XQDGOJPVMSWZSO-SRVKXCTJSA-N 0.000 description 2
- MQJDLNRXBOELJW-KKUMJFAQSA-N Gln-Pro-Phe Chemical compound N[C@@H](CCC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](Cc1ccccc1)C(O)=O MQJDLNRXBOELJW-KKUMJFAQSA-N 0.000 description 2
- UTOQQOMEJDPDMX-ACZMJKKPSA-N Gln-Ser-Asp Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O UTOQQOMEJDPDMX-ACZMJKKPSA-N 0.000 description 2
- LGWNISYVKDNJRP-FXQIFTODSA-N Gln-Ser-Gln Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O LGWNISYVKDNJRP-FXQIFTODSA-N 0.000 description 2
- SXFPZRRVWSUYII-KBIXCLLPSA-N Gln-Ser-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCC(=O)N)N SXFPZRRVWSUYII-KBIXCLLPSA-N 0.000 description 2
- ZGHMRONFHDVXEF-AVGNSLFASA-N Gln-Ser-Phe Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O ZGHMRONFHDVXEF-AVGNSLFASA-N 0.000 description 2
- OTQSTOXRUBVWAP-NRPADANISA-N Gln-Ser-Val Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O OTQSTOXRUBVWAP-NRPADANISA-N 0.000 description 2
- UXXIVIQGOODKQC-NUMRIWBASA-N Gln-Thr-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)N)N)O UXXIVIQGOODKQC-NUMRIWBASA-N 0.000 description 2
- XKPACHRGOWQHFH-IRIUXVKKSA-N Gln-Thr-Tyr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O XKPACHRGOWQHFH-IRIUXVKKSA-N 0.000 description 2
- RNPGPFAVRLERPP-QEJZJMRPSA-N Gln-Trp-Asn Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC(N)=O)C(O)=O RNPGPFAVRLERPP-QEJZJMRPSA-N 0.000 description 2
- OACQOWPRWGNKTP-AVGNSLFASA-N Gln-Tyr-Asp Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O OACQOWPRWGNKTP-AVGNSLFASA-N 0.000 description 2
- SGVGIVDZLSHSEN-RYUDHWBXSA-N Gln-Tyr-Gly Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(O)=O SGVGIVDZLSHSEN-RYUDHWBXSA-N 0.000 description 2
- UQKVUFGUSVYJMQ-IRIUXVKKSA-N Gln-Tyr-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CCC(=O)N)N)O UQKVUFGUSVYJMQ-IRIUXVKKSA-N 0.000 description 2
- ICRKQMRFXYDYMK-LAEOZQHASA-N Gln-Val-Asn Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O ICRKQMRFXYDYMK-LAEOZQHASA-N 0.000 description 2
- MKRDNSWGJWTBKZ-GVXVVHGQSA-N Gln-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)N)N MKRDNSWGJWTBKZ-GVXVVHGQSA-N 0.000 description 2
- HNAUFGBKJLTWQE-IFFSRLJSSA-N Gln-Val-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CCC(=O)N)N)O HNAUFGBKJLTWQE-IFFSRLJSSA-N 0.000 description 2
- RUFHOVYUYSNDNY-ACZMJKKPSA-N Glu-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(O)=O RUFHOVYUYSNDNY-ACZMJKKPSA-N 0.000 description 2
- SZXSSXUNOALWCH-ACZMJKKPSA-N Glu-Ala-Asn Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O SZXSSXUNOALWCH-ACZMJKKPSA-N 0.000 description 2
- UTKUTMJSWKKHEM-WDSKDSINSA-N Glu-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(O)=O UTKUTMJSWKKHEM-WDSKDSINSA-N 0.000 description 2
- CGYDXNKRIMJMLV-GUBZILKMSA-N Glu-Arg-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O CGYDXNKRIMJMLV-GUBZILKMSA-N 0.000 description 2
- SBYVDRJAXWSXQL-AVGNSLFASA-N Glu-Asn-Phe Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O SBYVDRJAXWSXQL-AVGNSLFASA-N 0.000 description 2
- PCBBLFVHTYNQGG-LAEOZQHASA-N Glu-Asn-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)O)N PCBBLFVHTYNQGG-LAEOZQHASA-N 0.000 description 2
- XXCDTYBVGMPIOA-FXQIFTODSA-N Glu-Asp-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O XXCDTYBVGMPIOA-FXQIFTODSA-N 0.000 description 2
- DSPQRJXOIXHOHK-WDSKDSINSA-N Glu-Asp-Gly Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O DSPQRJXOIXHOHK-WDSKDSINSA-N 0.000 description 2
- RTOOAKXIJADOLL-GUBZILKMSA-N Glu-Asp-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCC(=O)O)N RTOOAKXIJADOLL-GUBZILKMSA-N 0.000 description 2
- RFDHKPSHTXZKLL-IHRRRGAJSA-N Glu-Gln-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCC(=O)O)N RFDHKPSHTXZKLL-IHRRRGAJSA-N 0.000 description 2
- HTTSBEBKVNEDFE-AUTRQRHGSA-N Glu-Gln-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCC(=O)O)N HTTSBEBKVNEDFE-AUTRQRHGSA-N 0.000 description 2
- BUZMZDDKFCSKOT-CIUDSAMLSA-N Glu-Glu-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O BUZMZDDKFCSKOT-CIUDSAMLSA-N 0.000 description 2
- AUTNXSQEVVHSJK-YVNDNENWSA-N Glu-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O AUTNXSQEVVHSJK-YVNDNENWSA-N 0.000 description 2
- MUSGDMDGNGXULI-DCAQKATOSA-N Glu-Glu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O MUSGDMDGNGXULI-DCAQKATOSA-N 0.000 description 2
- LGYZYFFDELZWRS-DCAQKATOSA-N Glu-Glu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O LGYZYFFDELZWRS-DCAQKATOSA-N 0.000 description 2
- BUAKRRKDHSSIKK-IHRRRGAJSA-N Glu-Glu-Tyr Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 BUAKRRKDHSSIKK-IHRRRGAJSA-N 0.000 description 2
- AIGROOHQXCACHL-WDSKDSINSA-N Glu-Gly-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H](C)C(O)=O AIGROOHQXCACHL-WDSKDSINSA-N 0.000 description 2
- LYCDZGLXQBPNQU-WDSKDSINSA-N Glu-Gly-Cys Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@@H](CS)C(O)=O LYCDZGLXQBPNQU-WDSKDSINSA-N 0.000 description 2
- CAVMESABQIKFKT-IUCAKERBSA-N Glu-Gly-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CCC(=O)O)N CAVMESABQIKFKT-IUCAKERBSA-N 0.000 description 2
- CUXJIASLBRJOFV-LAEOZQHASA-N Glu-Gly-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H]([C@@H](C)CC)C(O)=O CUXJIASLBRJOFV-LAEOZQHASA-N 0.000 description 2
- LRPXYSGPOBVBEH-IUCAKERBSA-N Glu-Gly-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O LRPXYSGPOBVBEH-IUCAKERBSA-N 0.000 description 2
- DVLZZEPUNFEUBW-AVGNSLFASA-N Glu-His-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CCC(=O)O)N DVLZZEPUNFEUBW-AVGNSLFASA-N 0.000 description 2
- CXRWMMRLEMVSEH-PEFMBERDSA-N Glu-Ile-Asn Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O CXRWMMRLEMVSEH-PEFMBERDSA-N 0.000 description 2
- QXDXIXFSFHUYAX-MNXVOIDGSA-N Glu-Ile-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCC(O)=O QXDXIXFSFHUYAX-MNXVOIDGSA-N 0.000 description 2
- HVYWQYLBVXMXSV-GUBZILKMSA-N Glu-Leu-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O HVYWQYLBVXMXSV-GUBZILKMSA-N 0.000 description 2
- DNPCBMNFQVTHMA-DCAQKATOSA-N Glu-Leu-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O DNPCBMNFQVTHMA-DCAQKATOSA-N 0.000 description 2
- WNRZUESNGGDCJX-JYJNAYRXSA-N Glu-Leu-Phe Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O WNRZUESNGGDCJX-JYJNAYRXSA-N 0.000 description 2
- NJCALAAIGREHDR-WDCWCFNPSA-N Glu-Leu-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O NJCALAAIGREHDR-WDCWCFNPSA-N 0.000 description 2
- SJJHXJDSNQJMMW-SRVKXCTJSA-N Glu-Lys-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O SJJHXJDSNQJMMW-SRVKXCTJSA-N 0.000 description 2
- YKBUCXNNBYZYAY-MNXVOIDGSA-N Glu-Lys-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O YKBUCXNNBYZYAY-MNXVOIDGSA-N 0.000 description 2
- MFNUFCFRAZPJFW-JYJNAYRXSA-N Glu-Lys-Phe Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 MFNUFCFRAZPJFW-JYJNAYRXSA-N 0.000 description 2
- AOCARQDSFTWWFT-DCAQKATOSA-N Glu-Met-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O AOCARQDSFTWWFT-DCAQKATOSA-N 0.000 description 2
- MCGNJCNXIMQCMN-DCAQKATOSA-N Glu-Met-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](CCSC)NC(=O)[C@@H](N)CCC(O)=O MCGNJCNXIMQCMN-DCAQKATOSA-N 0.000 description 2
- JZJGEKDPWVJOLD-QEWYBTABSA-N Glu-Phe-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JZJGEKDPWVJOLD-QEWYBTABSA-N 0.000 description 2
- ITVBKCZZLJUUHI-HTUGSXCWSA-N Glu-Phe-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ITVBKCZZLJUUHI-HTUGSXCWSA-N 0.000 description 2
- QJVZSVUYZFYLFQ-CIUDSAMLSA-N Glu-Pro-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O QJVZSVUYZFYLFQ-CIUDSAMLSA-N 0.000 description 2
- JYXKPJVDCAWMDG-ZPFDUUQYSA-N Glu-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CCC(=O)O)N JYXKPJVDCAWMDG-ZPFDUUQYSA-N 0.000 description 2
- BFEZQZKEPRKKHV-SRVKXCTJSA-N Glu-Pro-Lys Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CCC(=O)O)N)C(=O)N[C@@H](CCCCN)C(=O)O BFEZQZKEPRKKHV-SRVKXCTJSA-N 0.000 description 2
- ALMBZBOCGSVSAI-ACZMJKKPSA-N Glu-Ser-Asn Chemical compound C(CC(=O)O)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(=O)N)C(=O)O)N ALMBZBOCGSVSAI-ACZMJKKPSA-N 0.000 description 2
- DAHLWSFUXOHMIA-FXQIFTODSA-N Glu-Ser-Gln Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O DAHLWSFUXOHMIA-FXQIFTODSA-N 0.000 description 2
- TWYSSILQABLLME-HJGDQZAQSA-N Glu-Thr-Arg Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O TWYSSILQABLLME-HJGDQZAQSA-N 0.000 description 2
- QCMVGXDELYMZET-GLLZPBPUSA-N Glu-Thr-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O QCMVGXDELYMZET-GLLZPBPUSA-N 0.000 description 2
- GPSHCSTUYOQPAI-JHEQGTHGSA-N Glu-Thr-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O GPSHCSTUYOQPAI-JHEQGTHGSA-N 0.000 description 2
- ZGXGVBYEJGVJMV-HJGDQZAQSA-N Glu-Thr-Met Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CCC(=O)O)N)O ZGXGVBYEJGVJMV-HJGDQZAQSA-N 0.000 description 2
- JDAYMLXPUJRSDJ-XIRDDKMYSA-N Glu-Trp-Arg Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CCC(O)=O)N)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O)=CNC2=C1 JDAYMLXPUJRSDJ-XIRDDKMYSA-N 0.000 description 2
- YPHPEHMXOYTEQG-LAEOZQHASA-N Glu-Val-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCC(O)=O YPHPEHMXOYTEQG-LAEOZQHASA-N 0.000 description 2
- HQTDNEZTGZUWSY-XVKPBYJWSA-N Glu-Val-Gly Chemical compound CC(C)[C@H](NC(=O)[C@@H](N)CCC(O)=O)C(=O)NCC(O)=O HQTDNEZTGZUWSY-XVKPBYJWSA-N 0.000 description 2
- BRFJMRSRMOMIMU-WHFBIAKZSA-N Gly-Ala-Asn Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O BRFJMRSRMOMIMU-WHFBIAKZSA-N 0.000 description 2
- UGVQELHRNUDMAA-BYPYZUCNSA-N Gly-Ala-Gly Chemical compound [NH3+]CC(=O)N[C@@H](C)C(=O)NCC([O-])=O UGVQELHRNUDMAA-BYPYZUCNSA-N 0.000 description 2
- YMUFWNJHVPQNQD-ZKWXMUAHSA-N Gly-Ala-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)CN YMUFWNJHVPQNQD-ZKWXMUAHSA-N 0.000 description 2
- QSDKBRMVXSWAQE-BFHQHQDPSA-N Gly-Ala-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)CN QSDKBRMVXSWAQE-BFHQHQDPSA-N 0.000 description 2
- QIZJOTQTCAGKPU-KWQFWETISA-N Gly-Ala-Tyr Chemical compound [NH3+]CC(=O)N[C@@H](C)C(=O)N[C@H](C([O-])=O)CC1=CC=C(O)C=C1 QIZJOTQTCAGKPU-KWQFWETISA-N 0.000 description 2
- OGCIHJPYKVSMTE-YUMQZZPRSA-N Gly-Arg-Glu Chemical compound [H]NCC(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O OGCIHJPYKVSMTE-YUMQZZPRSA-N 0.000 description 2
- KRRMJKMGWWXWDW-STQMWFEESA-N Gly-Arg-Phe Chemical compound NC(=N)NCCC[C@H](NC(=O)CN)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 KRRMJKMGWWXWDW-STQMWFEESA-N 0.000 description 2
- KKBWDNZXYLGJEY-UHFFFAOYSA-N Gly-Arg-Pro Natural products NCC(=O)NC(CCNC(=N)N)C(=O)N1CCCC1C(=O)O KKBWDNZXYLGJEY-UHFFFAOYSA-N 0.000 description 2
- GGEJHJIXRBTJPD-BYPYZUCNSA-N Gly-Asn-Gly Chemical compound NCC(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O GGEJHJIXRBTJPD-BYPYZUCNSA-N 0.000 description 2
- KQDMENMTYNBWMR-WHFBIAKZSA-N Gly-Asp-Ala Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O KQDMENMTYNBWMR-WHFBIAKZSA-N 0.000 description 2
- XQHSBNVACKQWAV-WHFBIAKZSA-N Gly-Asp-Asn Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O XQHSBNVACKQWAV-WHFBIAKZSA-N 0.000 description 2
- PMNHJLASAAWELO-FOHZUACHSA-N Gly-Asp-Thr Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PMNHJLASAAWELO-FOHZUACHSA-N 0.000 description 2
- CQZDZKRHFWJXDF-WDSKDSINSA-N Gly-Gln-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CCC(N)=O)NC(=O)CN CQZDZKRHFWJXDF-WDSKDSINSA-N 0.000 description 2
- VUUOMYFPWDYETE-WDSKDSINSA-N Gly-Gln-Cys Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)CN VUUOMYFPWDYETE-WDSKDSINSA-N 0.000 description 2
- LXXANCRPFBSSKS-IUCAKERBSA-N Gly-Gln-Leu Chemical compound [H]NCC(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O LXXANCRPFBSSKS-IUCAKERBSA-N 0.000 description 2
- QPDUVFSVVAOUHE-XVKPBYJWSA-N Gly-Gln-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CCC(N)=O)NC(=O)CN)C(O)=O QPDUVFSVVAOUHE-XVKPBYJWSA-N 0.000 description 2
- XTQFHTHIAKKCTM-YFKPBYRVSA-N Gly-Glu-Gly Chemical compound NCC(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O XTQFHTHIAKKCTM-YFKPBYRVSA-N 0.000 description 2
- YYPFZVIXAVDHIK-IUCAKERBSA-N Gly-Glu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)CN YYPFZVIXAVDHIK-IUCAKERBSA-N 0.000 description 2
- BEQGFMIBZFNROK-JGVFFNPUSA-N Gly-Glu-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)O)NC(=O)CN)C(=O)O BEQGFMIBZFNROK-JGVFFNPUSA-N 0.000 description 2
- MBOAPAXLTUSMQI-JHEQGTHGSA-N Gly-Glu-Thr Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MBOAPAXLTUSMQI-JHEQGTHGSA-N 0.000 description 2
- UFPXDFOYHVEIPI-BYPYZUCNSA-N Gly-Gly-Asp Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CC(O)=O UFPXDFOYHVEIPI-BYPYZUCNSA-N 0.000 description 2
- IDOGEHIWMJMAHT-BYPYZUCNSA-N Gly-Gly-Cys Chemical compound NCC(=O)NCC(=O)N[C@@H](CS)C(O)=O IDOGEHIWMJMAHT-BYPYZUCNSA-N 0.000 description 2
- QPTNELDXWKRIFX-YFKPBYRVSA-N Gly-Gly-Gln Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CCC(N)=O QPTNELDXWKRIFX-YFKPBYRVSA-N 0.000 description 2
- PDAWDNVHMUKWJR-ZETCQYMHSA-N Gly-Gly-His Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CC1=CNC=N1 PDAWDNVHMUKWJR-ZETCQYMHSA-N 0.000 description 2
- IVSWQHKONQIOHA-YUMQZZPRSA-N Gly-His-Cys Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)CN IVSWQHKONQIOHA-YUMQZZPRSA-N 0.000 description 2
- LPCKHUXOGVNZRS-YUMQZZPRSA-N Gly-His-Ser Chemical compound [H]NCC(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(O)=O LPCKHUXOGVNZRS-YUMQZZPRSA-N 0.000 description 2
- QSVMIMFAAZPCAQ-PMVVWTBXSA-N Gly-His-Thr Chemical compound [H]NCC(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O QSVMIMFAAZPCAQ-PMVVWTBXSA-N 0.000 description 2
- HAXARWKYFIIHKD-ZKWXMUAHSA-N Gly-Ile-Ser Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O HAXARWKYFIIHKD-ZKWXMUAHSA-N 0.000 description 2
- PAWIVEIWWYGBAM-YUMQZZPRSA-N Gly-Leu-Ala Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O PAWIVEIWWYGBAM-YUMQZZPRSA-N 0.000 description 2
- NSTUFLGQJCOCDL-UWVGGRQHSA-N Gly-Leu-Arg Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N NSTUFLGQJCOCDL-UWVGGRQHSA-N 0.000 description 2
- UUYBFNKHOCJCHT-VHSXEESVSA-N Gly-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN UUYBFNKHOCJCHT-VHSXEESVSA-N 0.000 description 2
- NNCSJUBVFBDDLC-YUMQZZPRSA-N Gly-Leu-Ser Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O NNCSJUBVFBDDLC-YUMQZZPRSA-N 0.000 description 2
- MIIVFRCYJABHTQ-ONGXEEELSA-N Gly-Leu-Val Chemical compound [H]NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O MIIVFRCYJABHTQ-ONGXEEELSA-N 0.000 description 2
- VBOBNHSVQKKTOT-YUMQZZPRSA-N Gly-Lys-Ala Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O VBOBNHSVQKKTOT-YUMQZZPRSA-N 0.000 description 2
- PDUHNKAFQXQNLH-ZETCQYMHSA-N Gly-Lys-Gly Chemical compound NCCCC[C@H](NC(=O)CN)C(=O)NCC(O)=O PDUHNKAFQXQNLH-ZETCQYMHSA-N 0.000 description 2
- MHXKHKWHPNETGG-QWRGUYRKSA-N Gly-Lys-Leu Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O MHXKHKWHPNETGG-QWRGUYRKSA-N 0.000 description 2
- WDEHMRNSGHVNOH-VHSXEESVSA-N Gly-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)CN)C(=O)O WDEHMRNSGHVNOH-VHSXEESVSA-N 0.000 description 2
- DBJYVKDPGIFXFO-BQBZGAKWSA-N Gly-Met-Ala Chemical compound [H]NCC(=O)N[C@@H](CCSC)C(=O)N[C@@H](C)C(O)=O DBJYVKDPGIFXFO-BQBZGAKWSA-N 0.000 description 2
- ZWRDOVYMQAAISL-UWVGGRQHSA-N Gly-Met-Lys Chemical compound CSCC[C@H](NC(=O)CN)C(=O)N[C@H](C(O)=O)CCCCN ZWRDOVYMQAAISL-UWVGGRQHSA-N 0.000 description 2
- FJWSJWACLMTDMI-WPRPVWTQSA-N Gly-Met-Val Chemical compound [H]NCC(=O)N[C@@H](CCSC)C(=O)N[C@@H](C(C)C)C(O)=O FJWSJWACLMTDMI-WPRPVWTQSA-N 0.000 description 2
- WMGHDYWNHNLGBV-ONGXEEELSA-N Gly-Phe-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H](NC(=O)CN)CC1=CC=CC=C1 WMGHDYWNHNLGBV-ONGXEEELSA-N 0.000 description 2
- IGOYNRWLWHWAQO-JTQLQIEISA-N Gly-Phe-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)CN)CC1=CC=CC=C1 IGOYNRWLWHWAQO-JTQLQIEISA-N 0.000 description 2
- JJGBXTYGTKWGAT-YUMQZZPRSA-N Gly-Pro-Glu Chemical compound NCC(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O JJGBXTYGTKWGAT-YUMQZZPRSA-N 0.000 description 2
- HFPVRZWORNJRRC-UWVGGRQHSA-N Gly-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)CN HFPVRZWORNJRRC-UWVGGRQHSA-N 0.000 description 2
- GAAHQHNCMIAYEX-UWVGGRQHSA-N Gly-Pro-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)CN GAAHQHNCMIAYEX-UWVGGRQHSA-N 0.000 description 2
- GLACUWHUYFBSPJ-FJXKBIBVSA-N Gly-Pro-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)CN GLACUWHUYFBSPJ-FJXKBIBVSA-N 0.000 description 2
- ISSDODCYBOWWIP-GJZGRUSLSA-N Gly-Pro-Trp Chemical compound [H]NCC(=O)N1CCC[C@H]1C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O ISSDODCYBOWWIP-GJZGRUSLSA-N 0.000 description 2
- OHUKZZYSJBKFRR-WHFBIAKZSA-N Gly-Ser-Asp Chemical compound [H]NCC(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O OHUKZZYSJBKFRR-WHFBIAKZSA-N 0.000 description 2
- SOEGEPHNZOISMT-BYPYZUCNSA-N Gly-Ser-Gly Chemical compound NCC(=O)N[C@@H](CO)C(=O)NCC(O)=O SOEGEPHNZOISMT-BYPYZUCNSA-N 0.000 description 2
- JSLVAHYTAJJEQH-QWRGUYRKSA-N Gly-Ser-Phe Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 JSLVAHYTAJJEQH-QWRGUYRKSA-N 0.000 description 2
- FFJQHWKSGAWSTJ-BFHQHQDPSA-N Gly-Thr-Ala Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O FFJQHWKSGAWSTJ-BFHQHQDPSA-N 0.000 description 2
- FKESCSGWBPUTPN-FOHZUACHSA-N Gly-Thr-Asn Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(O)=O FKESCSGWBPUTPN-FOHZUACHSA-N 0.000 description 2
- NVTPVQLIZCOJFK-FOHZUACHSA-N Gly-Thr-Asp Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(O)=O NVTPVQLIZCOJFK-FOHZUACHSA-N 0.000 description 2
- HUFUVTYGPOUCBN-MBLNEYKQSA-N Gly-Thr-Ile Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O HUFUVTYGPOUCBN-MBLNEYKQSA-N 0.000 description 2
- FFALDIDGPLUDKV-ZDLURKLDSA-N Gly-Thr-Ser Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O FFALDIDGPLUDKV-ZDLURKLDSA-N 0.000 description 2
- PNUFMLXHOLFRLD-KBPBESRZSA-N Gly-Tyr-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CC=C(O)C=C1 PNUFMLXHOLFRLD-KBPBESRZSA-N 0.000 description 2
- GWCJMBNBFYBQCV-XPUUQOCRSA-N Gly-Val-Ala Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O GWCJMBNBFYBQCV-XPUUQOCRSA-N 0.000 description 2
- GJHWILMUOANXTG-WPRPVWTQSA-N Gly-Val-Arg Chemical compound [H]NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O GJHWILMUOANXTG-WPRPVWTQSA-N 0.000 description 2
- NGRPGJGKJMUGDM-XVKPBYJWSA-N Gly-Val-Gln Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O NGRPGJGKJMUGDM-XVKPBYJWSA-N 0.000 description 2
- ZVXMEWXHFBYJPI-LSJOCFKGSA-N Gly-Val-Ile Chemical compound [H]NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O ZVXMEWXHFBYJPI-LSJOCFKGSA-N 0.000 description 2
- KSOBNUBCYHGUKH-UWVGGRQHSA-N Gly-Val-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)CN KSOBNUBCYHGUKH-UWVGGRQHSA-N 0.000 description 2
- 101000976889 Haemophilus phage HP1 (strain HP1c1) Uncharacterized 19.2 kDa protein in cox-rep intergenic region Proteins 0.000 description 2
- BIAKMWKJMQLZOJ-ZKWXMUAHSA-N His-Ala-Ala Chemical compound C[C@H](NC(=O)[C@H](C)NC(=O)[C@@H](N)Cc1cnc[nH]1)C(O)=O BIAKMWKJMQLZOJ-ZKWXMUAHSA-N 0.000 description 2
- KZTLOHBDLMIFSH-XVYDVKMFSA-N His-Ala-Asp Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(O)=O KZTLOHBDLMIFSH-XVYDVKMFSA-N 0.000 description 2
- MAABHGXCIBEYQR-XVYDVKMFSA-N His-Asn-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC1=CN=CN1)N MAABHGXCIBEYQR-XVYDVKMFSA-N 0.000 description 2
- HVCRQRQPIIRNLY-IUCAKERBSA-N His-Gln-Gly Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)NCC(=O)O)N HVCRQRQPIIRNLY-IUCAKERBSA-N 0.000 description 2
- VHHYJBSXXMPQGZ-AVGNSLFASA-N His-Gln-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC1=CN=CN1)N VHHYJBSXXMPQGZ-AVGNSLFASA-N 0.000 description 2
- FMRKUXFLLPKVPG-JYJNAYRXSA-N His-Gln-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC2=CN=CN2)N)O FMRKUXFLLPKVPG-JYJNAYRXSA-N 0.000 description 2
- DGYNAJNQMBFYIF-SZMVWBNQSA-N His-Glu-Trp Chemical compound C([C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(O)=O)C1=CN=CN1 DGYNAJNQMBFYIF-SZMVWBNQSA-N 0.000 description 2
- PGTISAJTWZPFGN-PEXQALLHSA-N His-Gly-Ile Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)NCC(=O)N[C@@H]([C@@H](C)CC)C(O)=O PGTISAJTWZPFGN-PEXQALLHSA-N 0.000 description 2
- BDFCIKANUNMFGB-PMVVWTBXSA-N His-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC1=CN=CN1 BDFCIKANUNMFGB-PMVVWTBXSA-N 0.000 description 2
- MLZVJIREOKTDAR-SIGLWIIPSA-N His-Ile-Ile Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O MLZVJIREOKTDAR-SIGLWIIPSA-N 0.000 description 2
- VYUXYMRNGALHEA-DLOVCJGASA-N His-Leu-Ala Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O VYUXYMRNGALHEA-DLOVCJGASA-N 0.000 description 2
- LVXFNTIIGOQBMD-SRVKXCTJSA-N His-Leu-Ser Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O LVXFNTIIGOQBMD-SRVKXCTJSA-N 0.000 description 2
- BSVLMPMIXPQNKC-KBPBESRZSA-N His-Phe-Gly Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)NCC(O)=O BSVLMPMIXPQNKC-KBPBESRZSA-N 0.000 description 2
- SOYCWSKCUVDLMC-AVGNSLFASA-N His-Pro-Arg Chemical compound N[C@@H](Cc1cnc[nH]1)C(=O)N2CCC[C@H]2C(=O)N[C@@H](CCCNC(=N)N)C(=O)O SOYCWSKCUVDLMC-AVGNSLFASA-N 0.000 description 2
- FHKZHRMERJUXRJ-DCAQKATOSA-N His-Ser-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CN=CN1 FHKZHRMERJUXRJ-DCAQKATOSA-N 0.000 description 2
- ZHHLTWUOWXHVQJ-YUMQZZPRSA-N His-Ser-Gly Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CO)C(=O)NCC(=O)O)N ZHHLTWUOWXHVQJ-YUMQZZPRSA-N 0.000 description 2
- PZAJPILZRFPYJJ-SRVKXCTJSA-N His-Ser-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O PZAJPILZRFPYJJ-SRVKXCTJSA-N 0.000 description 2
- GIRSNERMXCMDBO-GARJFASQSA-N His-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CC2=CN=CN2)N)C(=O)O GIRSNERMXCMDBO-GARJFASQSA-N 0.000 description 2
- BRQKGRLDDDQWQJ-MBLNEYKQSA-N His-Thr-Ala Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O BRQKGRLDDDQWQJ-MBLNEYKQSA-N 0.000 description 2
- FCPSGEVYIVXPPO-QTKMDUPCSA-N His-Thr-Arg Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O FCPSGEVYIVXPPO-QTKMDUPCSA-N 0.000 description 2
- DQZCEKQPSOBNMJ-NKIYYHGXSA-N His-Thr-Glu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O DQZCEKQPSOBNMJ-NKIYYHGXSA-N 0.000 description 2
- MRVZCDSYLJXKKX-ACRUOGEOSA-N His-Tyr-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)[C@H](CC3=CN=CN3)N MRVZCDSYLJXKKX-ACRUOGEOSA-N 0.000 description 2
- QLBXWYXMLHAREM-PYJNHQTQSA-N His-Val-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CC1=CN=CN1)N QLBXWYXMLHAREM-PYJNHQTQSA-N 0.000 description 2
- JRHFQUPIZOYKQP-KBIXCLLPSA-N Ile-Ala-Glu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(O)=O JRHFQUPIZOYKQP-KBIXCLLPSA-N 0.000 description 2
- WUEIUSDAECDLQO-NAKRPEOUSA-N Ile-Ala-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CCSC)C(=O)O)N WUEIUSDAECDLQO-NAKRPEOUSA-N 0.000 description 2
- QTUSJASXLGLJSR-OSUNSFLBSA-N Ile-Arg-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N QTUSJASXLGLJSR-OSUNSFLBSA-N 0.000 description 2
- AZEYWPUCOYXFOE-CYDGBPFRSA-N Ile-Arg-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](C(C)C)C(=O)O)N AZEYWPUCOYXFOE-CYDGBPFRSA-N 0.000 description 2
- UAVQIQOOBXFKRC-BYULHYEWSA-N Ile-Asn-Gly Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O UAVQIQOOBXFKRC-BYULHYEWSA-N 0.000 description 2
- ZZHGKECPZXPXJF-PCBIJLKTSA-N Ile-Asn-Phe Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 ZZHGKECPZXPXJF-PCBIJLKTSA-N 0.000 description 2
- UKTUOMWSJPXODT-GUDRVLHUSA-N Ile-Asn-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N1CCC[C@@H]1C(=O)O)N UKTUOMWSJPXODT-GUDRVLHUSA-N 0.000 description 2
- HDODQNPMSHDXJT-GHCJXIJMSA-N Ile-Asn-Ser Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O HDODQNPMSHDXJT-GHCJXIJMSA-N 0.000 description 2
- BGZIJZJBXRVBGJ-SXTJYALSSA-N Ile-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N BGZIJZJBXRVBGJ-SXTJYALSSA-N 0.000 description 2
- DCQMJRSOGCYKTR-GHCJXIJMSA-N Ile-Asp-Ser Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O DCQMJRSOGCYKTR-GHCJXIJMSA-N 0.000 description 2
- REJKOQYVFDEZHA-SLBDDTMCSA-N Ile-Asp-Trp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N REJKOQYVFDEZHA-SLBDDTMCSA-N 0.000 description 2
- VQUCKIAECLVLAD-SVSWQMSJSA-N Ile-Cys-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N VQUCKIAECLVLAD-SVSWQMSJSA-N 0.000 description 2
- OVPYIUNCVSOVNF-ZPFDUUQYSA-N Ile-Gln-Pro Natural products CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N1CCC[C@H]1C(O)=O OVPYIUNCVSOVNF-ZPFDUUQYSA-N 0.000 description 2
- HTDRTKMNJRRYOJ-SIUGBPQLSA-N Ile-Gln-Tyr Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 HTDRTKMNJRRYOJ-SIUGBPQLSA-N 0.000 description 2
- QHGBCRCMBCWMBJ-UHFFFAOYSA-N Ile-Glu-Ala-Lys Natural products CCC(C)C(N)C(=O)NC(CCC(O)=O)C(=O)NC(C)C(=O)NC(C(O)=O)CCCCN QHGBCRCMBCWMBJ-UHFFFAOYSA-N 0.000 description 2
- JDAWAWXGAUZPNJ-ZPFDUUQYSA-N Ile-Glu-Arg Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N JDAWAWXGAUZPNJ-ZPFDUUQYSA-N 0.000 description 2
- BEWFWZRGBDVXRP-PEFMBERDSA-N Ile-Glu-Asn Chemical compound [H]N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O BEWFWZRGBDVXRP-PEFMBERDSA-N 0.000 description 2
- LGMUPVWZEYYUMU-YVNDNENWSA-N Ile-Glu-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N LGMUPVWZEYYUMU-YVNDNENWSA-N 0.000 description 2
- PHIXPNQDGGILMP-YVNDNENWSA-N Ile-Glu-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N PHIXPNQDGGILMP-YVNDNENWSA-N 0.000 description 2
- MTFVYKQRLXYAQN-LAEOZQHASA-N Ile-Glu-Gly Chemical compound [H]N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O MTFVYKQRLXYAQN-LAEOZQHASA-N 0.000 description 2
- XLCZWMJPVGRWHJ-KQXIARHKSA-N Ile-Glu-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N XLCZWMJPVGRWHJ-KQXIARHKSA-N 0.000 description 2
- PNDMHTTXXPUQJH-RWRJDSDZSA-N Ile-Glu-Thr Chemical compound N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H]([C@H](O)C)C(=O)O PNDMHTTXXPUQJH-RWRJDSDZSA-N 0.000 description 2
- LBRCLQMZAHRTLV-ZKWXMUAHSA-N Ile-Gly-Ser Chemical compound CC[C@H](C)[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O LBRCLQMZAHRTLV-ZKWXMUAHSA-N 0.000 description 2
- LNJLOZYNZFGJMM-DEQVHRJGSA-N Ile-His-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N2CCC[C@@H]2C(=O)O)N LNJLOZYNZFGJMM-DEQVHRJGSA-N 0.000 description 2
- KLBVGHCGHUNHEA-BJDJZHNGSA-N Ile-Leu-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(=O)O)N KLBVGHCGHUNHEA-BJDJZHNGSA-N 0.000 description 2
- TWYOYAKMLHWMOJ-ZPFDUUQYSA-N Ile-Leu-Asn Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O TWYOYAKMLHWMOJ-ZPFDUUQYSA-N 0.000 description 2
- YGDWPQCLFJNMOL-MNXVOIDGSA-N Ile-Leu-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N YGDWPQCLFJNMOL-MNXVOIDGSA-N 0.000 description 2
- FCWFBHMAJZGWRY-XUXIUFHCSA-N Ile-Leu-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(=O)O)N FCWFBHMAJZGWRY-XUXIUFHCSA-N 0.000 description 2
- GVKKVHNRTUFCCE-BJDJZHNGSA-N Ile-Leu-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)O)N GVKKVHNRTUFCCE-BJDJZHNGSA-N 0.000 description 2
- UIEZQYNXCYHMQS-BJDJZHNGSA-N Ile-Lys-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)O)N UIEZQYNXCYHMQS-BJDJZHNGSA-N 0.000 description 2
- RMNMUUCYTMLWNA-ZPFDUUQYSA-N Ile-Lys-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(=O)O)C(=O)O)N RMNMUUCYTMLWNA-ZPFDUUQYSA-N 0.000 description 2
- XDUVMJCBYUKNFJ-MXAVVETBSA-N Ile-Lys-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N XDUVMJCBYUKNFJ-MXAVVETBSA-N 0.000 description 2
- GVNNAHIRSDRIII-AJNGGQMLSA-N Ile-Lys-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)O)N GVNNAHIRSDRIII-AJNGGQMLSA-N 0.000 description 2
- GLYJPWIRLBAIJH-UHFFFAOYSA-N Ile-Lys-Pro Natural products CCC(C)C(N)C(=O)NC(CCCCN)C(=O)N1CCCC1C(O)=O GLYJPWIRLBAIJH-UHFFFAOYSA-N 0.000 description 2
- UOPBQSJRBONRON-STECZYCISA-N Ile-Met-Tyr Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCSC)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 UOPBQSJRBONRON-STECZYCISA-N 0.000 description 2
- XMYURPUVJSKTMC-KBIXCLLPSA-N Ile-Ser-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N XMYURPUVJSKTMC-KBIXCLLPSA-N 0.000 description 2
- SHVFUCSSACPBTF-VGDYDELISA-N Ile-Ser-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N SHVFUCSSACPBTF-VGDYDELISA-N 0.000 description 2
- PXKACEXYLPBMAD-JBDRJPRFSA-N Ile-Ser-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)O)N PXKACEXYLPBMAD-JBDRJPRFSA-N 0.000 description 2
- HJDZMPFEXINXLO-QPHKQPEJSA-N Ile-Thr-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N HJDZMPFEXINXLO-QPHKQPEJSA-N 0.000 description 2
- KBDIBHQICWDGDL-PPCPHDFISA-N Ile-Thr-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)O)N KBDIBHQICWDGDL-PPCPHDFISA-N 0.000 description 2
- QGXQHJQPAPMACW-PPCPHDFISA-N Ile-Thr-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)O)N QGXQHJQPAPMACW-PPCPHDFISA-N 0.000 description 2
- ANTFEOSJMAUGIB-KNZXXDILSA-N Ile-Thr-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@@H]1C(=O)O)N ANTFEOSJMAUGIB-KNZXXDILSA-N 0.000 description 2
- NURNJECQNNCRBK-FLBSBUHZSA-N Ile-Thr-Thr Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O NURNJECQNNCRBK-FLBSBUHZSA-N 0.000 description 2
- BZUOLKFQVVBTJY-SLBDDTMCSA-N Ile-Trp-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CC(=O)N)C(=O)O)N BZUOLKFQVVBTJY-SLBDDTMCSA-N 0.000 description 2
- DTPGSUQHUMELQB-GVARAGBVSA-N Ile-Tyr-Ala Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C)C(O)=O)CC1=CC=C(O)C=C1 DTPGSUQHUMELQB-GVARAGBVSA-N 0.000 description 2
- RMJWFINHACYKJI-SIUGBPQLSA-N Ile-Tyr-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N RMJWFINHACYKJI-SIUGBPQLSA-N 0.000 description 2
- YWCJXQKATPNPOE-UKJIMTQDSA-N Ile-Val-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N YWCJXQKATPNPOE-UKJIMTQDSA-N 0.000 description 2
- 101000827627 Klebsiella pneumoniae Putative low molecular weight protein-tyrosine-phosphatase Proteins 0.000 description 2
- LHSGPCFBGJHPCY-UHFFFAOYSA-N L-leucine-L-tyrosine Natural products CC(C)CC(N)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 LHSGPCFBGJHPCY-UHFFFAOYSA-N 0.000 description 2
- CZCSUZMIRKFFFA-CIUDSAMLSA-N Leu-Ala-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O CZCSUZMIRKFFFA-CIUDSAMLSA-N 0.000 description 2
- MJOZZTKJZQFKDK-GUBZILKMSA-N Leu-Ala-Gln Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(N)=O MJOZZTKJZQFKDK-GUBZILKMSA-N 0.000 description 2
- WNGVUZWBXZKQES-YUMQZZPRSA-N Leu-Ala-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O WNGVUZWBXZKQES-YUMQZZPRSA-N 0.000 description 2
- XBBKIIGCUMBKCO-JXUBOQSCSA-N Leu-Ala-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XBBKIIGCUMBKCO-JXUBOQSCSA-N 0.000 description 2
- VCSBGUACOYUIGD-CIUDSAMLSA-N Leu-Asn-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O VCSBGUACOYUIGD-CIUDSAMLSA-N 0.000 description 2
- RFUBXQQFJFGJFV-GUBZILKMSA-N Leu-Asn-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O RFUBXQQFJFGJFV-GUBZILKMSA-N 0.000 description 2
- OXKYZSRZKBTVEY-ZPFDUUQYSA-N Leu-Asn-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O OXKYZSRZKBTVEY-ZPFDUUQYSA-N 0.000 description 2
- POJPZSMTTMLSTG-SRVKXCTJSA-N Leu-Asn-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N POJPZSMTTMLSTG-SRVKXCTJSA-N 0.000 description 2
- WGNOPSQMIQERPK-GARJFASQSA-N Leu-Asn-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N1CCC[C@@H]1C(=O)O)N WGNOPSQMIQERPK-GARJFASQSA-N 0.000 description 2
- WGNOPSQMIQERPK-UHFFFAOYSA-N Leu-Asn-Pro Natural products CC(C)CC(N)C(=O)NC(CC(=O)N)C(=O)N1CCCC1C(=O)O WGNOPSQMIQERPK-UHFFFAOYSA-N 0.000 description 2
- TWQIYNGNYNJUFM-NHCYSSNCSA-N Leu-Asn-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O TWQIYNGNYNJUFM-NHCYSSNCSA-N 0.000 description 2
- BPANDPNDMJHFEV-CIUDSAMLSA-N Leu-Asp-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O BPANDPNDMJHFEV-CIUDSAMLSA-N 0.000 description 2
- ZURHXHNAEJJRNU-CIUDSAMLSA-N Leu-Asp-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O ZURHXHNAEJJRNU-CIUDSAMLSA-N 0.000 description 2
- DLFAACQHIRSQGG-CIUDSAMLSA-N Leu-Asp-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O DLFAACQHIRSQGG-CIUDSAMLSA-N 0.000 description 2
- FGNQZXKVAZIMCI-CIUDSAMLSA-N Leu-Asp-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CS)C(=O)O)N FGNQZXKVAZIMCI-CIUDSAMLSA-N 0.000 description 2
- ILJREDZFPHTUIE-GUBZILKMSA-N Leu-Asp-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O ILJREDZFPHTUIE-GUBZILKMSA-N 0.000 description 2
- ULXYQAJWJGLCNR-YUMQZZPRSA-N Leu-Asp-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O ULXYQAJWJGLCNR-YUMQZZPRSA-N 0.000 description 2
- CLVUXCBGKUECIT-HJGDQZAQSA-N Leu-Asp-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CLVUXCBGKUECIT-HJGDQZAQSA-N 0.000 description 2
- PNUCWVAGVNLUMW-CIUDSAMLSA-N Leu-Cys-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CS)C(=O)N[C@@H](CO)C(O)=O PNUCWVAGVNLUMW-CIUDSAMLSA-N 0.000 description 2
- GLBNEGIOFRVRHO-JYJNAYRXSA-N Leu-Gln-Phe Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O GLBNEGIOFRVRHO-JYJNAYRXSA-N 0.000 description 2
- DZQMXBALGUHGJT-GUBZILKMSA-N Leu-Glu-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O DZQMXBALGUHGJT-GUBZILKMSA-N 0.000 description 2
- HFBCHNRFRYLZNV-GUBZILKMSA-N Leu-Glu-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O HFBCHNRFRYLZNV-GUBZILKMSA-N 0.000 description 2
- WMTOVWLLDGQGCV-GUBZILKMSA-N Leu-Glu-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CS)C(=O)O)N WMTOVWLLDGQGCV-GUBZILKMSA-N 0.000 description 2
- PRZVBIAOPFGAQF-SRVKXCTJSA-N Leu-Glu-Met Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCSC)C(O)=O PRZVBIAOPFGAQF-SRVKXCTJSA-N 0.000 description 2
- WQWSMEOYXJTFRU-GUBZILKMSA-N Leu-Glu-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O WQWSMEOYXJTFRU-GUBZILKMSA-N 0.000 description 2
- KGCLIYGPQXUNLO-IUCAKERBSA-N Leu-Gly-Glu Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(O)=O KGCLIYGPQXUNLO-IUCAKERBSA-N 0.000 description 2
- VWHGTYCRDRBSFI-ZETCQYMHSA-N Leu-Gly-Gly Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)NCC(O)=O VWHGTYCRDRBSFI-ZETCQYMHSA-N 0.000 description 2
- VBZOAGIPCULURB-QWRGUYRKSA-N Leu-Gly-His Chemical compound CC(C)C[C@@H](C(=O)NCC(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N VBZOAGIPCULURB-QWRGUYRKSA-N 0.000 description 2
- BKTXKJMNTSMJDQ-AVGNSLFASA-N Leu-His-Gln Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N BKTXKJMNTSMJDQ-AVGNSLFASA-N 0.000 description 2
- DBSLVQBXKVKDKJ-BJDJZHNGSA-N Leu-Ile-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O DBSLVQBXKVKDKJ-BJDJZHNGSA-N 0.000 description 2
- USLNHQZCDQJBOV-ZPFDUUQYSA-N Leu-Ile-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O USLNHQZCDQJBOV-ZPFDUUQYSA-N 0.000 description 2
- AUBMZAMQCOYSIC-MNXVOIDGSA-N Leu-Ile-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(N)=O)C(O)=O AUBMZAMQCOYSIC-MNXVOIDGSA-N 0.000 description 2
- QJXHMYMRGDOHRU-NHCYSSNCSA-N Leu-Ile-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O QJXHMYMRGDOHRU-NHCYSSNCSA-N 0.000 description 2
- JFSGIJSCJFQGSZ-MXAVVETBSA-N Leu-Ile-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC(C)C)N JFSGIJSCJFQGSZ-MXAVVETBSA-N 0.000 description 2
- QLDHBYRUNQZIJQ-DKIMLUQUSA-N Leu-Ile-Phe Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O QLDHBYRUNQZIJQ-DKIMLUQUSA-N 0.000 description 2
- JKSIBWITFMQTOA-XUXIUFHCSA-N Leu-Ile-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O JKSIBWITFMQTOA-XUXIUFHCSA-N 0.000 description 2
- JNDYEOUZBLOVOF-AVGNSLFASA-N Leu-Leu-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O JNDYEOUZBLOVOF-AVGNSLFASA-N 0.000 description 2
- ZRHDPZAAWLXXIR-SRVKXCTJSA-N Leu-Lys-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O ZRHDPZAAWLXXIR-SRVKXCTJSA-N 0.000 description 2
- ZGUMORRUBUCXEH-AVGNSLFASA-N Leu-Lys-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O ZGUMORRUBUCXEH-AVGNSLFASA-N 0.000 description 2
- LVTJJOJKDCVZGP-QWRGUYRKSA-N Leu-Lys-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O LVTJJOJKDCVZGP-QWRGUYRKSA-N 0.000 description 2
- REPBGZHJKYWFMJ-KKUMJFAQSA-N Leu-Lys-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N REPBGZHJKYWFMJ-KKUMJFAQSA-N 0.000 description 2
- KPYAOIVPJKPIOU-KKUMJFAQSA-N Leu-Lys-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(O)=O KPYAOIVPJKPIOU-KKUMJFAQSA-N 0.000 description 2
- VVQJGYPTIYOFBR-IHRRRGAJSA-N Leu-Lys-Met Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCSC)C(=O)O)N VVQJGYPTIYOFBR-IHRRRGAJSA-N 0.000 description 2
- CPONGMJGVIAWEH-DCAQKATOSA-N Leu-Met-Ala Chemical compound CSCC[C@H](NC(=O)[C@@H](N)CC(C)C)C(=O)N[C@@H](C)C(O)=O CPONGMJGVIAWEH-DCAQKATOSA-N 0.000 description 2
- DDVHDMSBLRAKNV-IHRRRGAJSA-N Leu-Met-Leu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O DDVHDMSBLRAKNV-IHRRRGAJSA-N 0.000 description 2
- ZDBMWELMUCLUPL-QEJZJMRPSA-N Leu-Phe-Ala Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C)C(O)=O)CC1=CC=CC=C1 ZDBMWELMUCLUPL-QEJZJMRPSA-N 0.000 description 2
- ZAVCJRJOQKIOJW-KKUMJFAQSA-N Leu-Phe-Asp Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(O)=O)C(O)=O)CC1=CC=CC=C1 ZAVCJRJOQKIOJW-KKUMJFAQSA-N 0.000 description 2
- YWKNKRAKOCLOLH-OEAJRASXSA-N Leu-Phe-Thr Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H]([C@@H](C)O)C(O)=O)CC1=CC=CC=C1 YWKNKRAKOCLOLH-OEAJRASXSA-N 0.000 description 2
- QMKFDEUJGYNFMC-AVGNSLFASA-N Leu-Pro-Arg Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCN=C(N)N)C(O)=O QMKFDEUJGYNFMC-AVGNSLFASA-N 0.000 description 2
- YUTNOGOMBNYPFH-XUXIUFHCSA-N Leu-Pro-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(O)=O YUTNOGOMBNYPFH-XUXIUFHCSA-N 0.000 description 2
- YRRCOJOXAJNSAX-IHRRRGAJSA-N Leu-Pro-Lys Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)O)N YRRCOJOXAJNSAX-IHRRRGAJSA-N 0.000 description 2
- PWPBLZXWFXJFHE-RHYQMDGZSA-N Leu-Pro-Thr Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O PWPBLZXWFXJFHE-RHYQMDGZSA-N 0.000 description 2
- JDBQSGMJBMPNFT-AVGNSLFASA-N Leu-Pro-Val Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O JDBQSGMJBMPNFT-AVGNSLFASA-N 0.000 description 2
- IDGZVZJLYFTXSL-DCAQKATOSA-N Leu-Ser-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCN=C(N)N IDGZVZJLYFTXSL-DCAQKATOSA-N 0.000 description 2
- IZPVWNSAVUQBGP-CIUDSAMLSA-N Leu-Ser-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O IZPVWNSAVUQBGP-CIUDSAMLSA-N 0.000 description 2
- JIHDFWWRYHSAQB-GUBZILKMSA-N Leu-Ser-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O JIHDFWWRYHSAQB-GUBZILKMSA-N 0.000 description 2
- BRTVHXHCUSXYRI-CIUDSAMLSA-N Leu-Ser-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O BRTVHXHCUSXYRI-CIUDSAMLSA-N 0.000 description 2
- ICYRCNICGBJLGM-HJGDQZAQSA-N Leu-Thr-Asp Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(O)=O ICYRCNICGBJLGM-HJGDQZAQSA-N 0.000 description 2
- DAYQSYGBCUKVKT-VOAKCMCISA-N Leu-Thr-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(O)=O DAYQSYGBCUKVKT-VOAKCMCISA-N 0.000 description 2
- ODRREERHVHMIPT-OEAJRASXSA-N Leu-Thr-Phe Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 ODRREERHVHMIPT-OEAJRASXSA-N 0.000 description 2
- KLSUAWUZBMAZCL-RHYQMDGZSA-N Leu-Thr-Pro Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@H]1C(O)=O KLSUAWUZBMAZCL-RHYQMDGZSA-N 0.000 description 2
- CNWDWAMPKVYJJB-NUTKFTJISA-N Leu-Trp-Ala Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](N)CC(C)C)C(=O)N[C@@H](C)C(O)=O)=CNC2=C1 CNWDWAMPKVYJJB-NUTKFTJISA-N 0.000 description 2
- LSLUTXRANSUGFY-XIRDDKMYSA-N Leu-Trp-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC(O)=O)C(O)=O LSLUTXRANSUGFY-XIRDDKMYSA-N 0.000 description 2
- RIHIGSWBLHSGLV-CQDKDKBSSA-N Leu-Tyr-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C)C(O)=O RIHIGSWBLHSGLV-CQDKDKBSSA-N 0.000 description 2
- UCRJTSIIAYHOHE-ULQDDVLXSA-N Leu-Tyr-Arg Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N UCRJTSIIAYHOHE-ULQDDVLXSA-N 0.000 description 2
- VJGQRELPQWNURN-JYJNAYRXSA-N Leu-Tyr-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O VJGQRELPQWNURN-JYJNAYRXSA-N 0.000 description 2
- AXVIGSRGTMNSJU-YESZJQIVSA-N Leu-Tyr-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N2CCC[C@@H]2C(=O)O)N AXVIGSRGTMNSJU-YESZJQIVSA-N 0.000 description 2
- FBNPMTNBFFAMMH-AVGNSLFASA-N Leu-Val-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N FBNPMTNBFFAMMH-AVGNSLFASA-N 0.000 description 2
- FBNPMTNBFFAMMH-UHFFFAOYSA-N Leu-Val-Arg Natural products CC(C)CC(N)C(=O)NC(C(C)C)C(=O)NC(C(O)=O)CCCN=C(N)N FBNPMTNBFFAMMH-UHFFFAOYSA-N 0.000 description 2
- XZNJZXJZBMBGGS-NHCYSSNCSA-N Leu-Val-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O XZNJZXJZBMBGGS-NHCYSSNCSA-N 0.000 description 2
- LMDVGHQPPPLYAR-IHRRRGAJSA-N Leu-Val-His Chemical compound N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)O LMDVGHQPPPLYAR-IHRRRGAJSA-N 0.000 description 2
- 239000006142 Luria-Bertani Agar Substances 0.000 description 2
- MPOHDJKRBLVGCT-CIUDSAMLSA-N Lys-Ala-Asn Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCCCN)N MPOHDJKRBLVGCT-CIUDSAMLSA-N 0.000 description 2
- JCFYLFOCALSNLQ-GUBZILKMSA-N Lys-Ala-Gln Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(O)=O JCFYLFOCALSNLQ-GUBZILKMSA-N 0.000 description 2
- WSXTWLJHTLRFLW-SRVKXCTJSA-N Lys-Ala-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCCN)C(O)=O WSXTWLJHTLRFLW-SRVKXCTJSA-N 0.000 description 2
- YNNPKXBBRZVIRX-IHRRRGAJSA-N Lys-Arg-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O YNNPKXBBRZVIRX-IHRRRGAJSA-N 0.000 description 2
- GGAPIOORBXHMNY-ULQDDVLXSA-N Lys-Arg-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CCCCN)N)O GGAPIOORBXHMNY-ULQDDVLXSA-N 0.000 description 2
- DGAAQRAUOFHBFJ-CIUDSAMLSA-N Lys-Asn-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(O)=O DGAAQRAUOFHBFJ-CIUDSAMLSA-N 0.000 description 2
- HKCCVDWHHTVVPN-CIUDSAMLSA-N Lys-Asp-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O HKCCVDWHHTVVPN-CIUDSAMLSA-N 0.000 description 2
- HIIZIQUUHIXUJY-GUBZILKMSA-N Lys-Asp-Gln Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O HIIZIQUUHIXUJY-GUBZILKMSA-N 0.000 description 2
- IWWMPCPLFXFBAF-SRVKXCTJSA-N Lys-Asp-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O IWWMPCPLFXFBAF-SRVKXCTJSA-N 0.000 description 2
- LXNPMPIQDNSMTA-AVGNSLFASA-N Lys-Gln-His Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 LXNPMPIQDNSMTA-AVGNSLFASA-N 0.000 description 2
- NNCDAORZCMPZPX-GUBZILKMSA-N Lys-Gln-Ser Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CO)C(=O)O)N NNCDAORZCMPZPX-GUBZILKMSA-N 0.000 description 2
- HEWWNLVEWBJBKA-WDCWCFNPSA-N Lys-Gln-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CCCCN HEWWNLVEWBJBKA-WDCWCFNPSA-N 0.000 description 2
- NDORZBUHCOJQDO-GVXVVHGQSA-N Lys-Gln-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O NDORZBUHCOJQDO-GVXVVHGQSA-N 0.000 description 2
- PBIPLDMFHAICIP-DCAQKATOSA-N Lys-Glu-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O PBIPLDMFHAICIP-DCAQKATOSA-N 0.000 description 2
- PAMDBWYMLWOELY-SDDRHHMPSA-N Lys-Glu-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCCCN)N)C(=O)O PAMDBWYMLWOELY-SDDRHHMPSA-N 0.000 description 2
- VEGLGAOVLFODGC-GUBZILKMSA-N Lys-Glu-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O VEGLGAOVLFODGC-GUBZILKMSA-N 0.000 description 2
- JZMGVXLDOQOKAH-UWVGGRQHSA-N Lys-Gly-Met Chemical compound [H]N[C@@H](CCCCN)C(=O)NCC(=O)N[C@@H](CCSC)C(O)=O JZMGVXLDOQOKAH-UWVGGRQHSA-N 0.000 description 2
- VLMNBMFYRMGEMB-QWRGUYRKSA-N Lys-His-Gly Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CNC=N1 VLMNBMFYRMGEMB-QWRGUYRKSA-N 0.000 description 2
- WOEDRPCHKPSFDT-MXAVVETBSA-N Lys-His-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CCCCN)N WOEDRPCHKPSFDT-MXAVVETBSA-N 0.000 description 2
- SLQJJFAVWSZLBL-BJDJZHNGSA-N Lys-Ile-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCCCN SLQJJFAVWSZLBL-BJDJZHNGSA-N 0.000 description 2
- IUWMQCZOTYRXPL-ZPFDUUQYSA-N Lys-Ile-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O IUWMQCZOTYRXPL-ZPFDUUQYSA-N 0.000 description 2
- PRSBSVAVOQOAMI-BJDJZHNGSA-N Lys-Ile-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCCCN PRSBSVAVOQOAMI-BJDJZHNGSA-N 0.000 description 2
- NCZIQZYZPUPMKY-PPCPHDFISA-N Lys-Ile-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O NCZIQZYZPUPMKY-PPCPHDFISA-N 0.000 description 2
- MYZMQWHPDAYKIE-SRVKXCTJSA-N Lys-Leu-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O MYZMQWHPDAYKIE-SRVKXCTJSA-N 0.000 description 2
- NJNRBRKHOWSGMN-SRVKXCTJSA-N Lys-Leu-Asn Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O NJNRBRKHOWSGMN-SRVKXCTJSA-N 0.000 description 2
- MUXNCRWTWBMNHX-SRVKXCTJSA-N Lys-Leu-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O MUXNCRWTWBMNHX-SRVKXCTJSA-N 0.000 description 2
- ONPDTSFZAIWMDI-AVGNSLFASA-N Lys-Leu-Gln Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O ONPDTSFZAIWMDI-AVGNSLFASA-N 0.000 description 2
- XIZQPFCRXLUNMK-BZSNNMDCSA-N Lys-Leu-Phe Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CCCCN)N XIZQPFCRXLUNMK-BZSNNMDCSA-N 0.000 description 2
- OIQSIMFSVLLWBX-VOAKCMCISA-N Lys-Leu-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OIQSIMFSVLLWBX-VOAKCMCISA-N 0.000 description 2
- JQSIGLHQNSZZRL-KKUMJFAQSA-N Lys-Lys-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCCCN)N JQSIGLHQNSZZRL-KKUMJFAQSA-N 0.000 description 2
- YXPJCVNIDDKGOE-MELADBBJSA-N Lys-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)[C@H](CCCCN)N)C(=O)O YXPJCVNIDDKGOE-MELADBBJSA-N 0.000 description 2
- QQPSCXKFDSORFT-IHRRRGAJSA-N Lys-Lys-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCCCN QQPSCXKFDSORFT-IHRRRGAJSA-N 0.000 description 2
- URGPVYGVWLIRGT-DCAQKATOSA-N Lys-Met-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C)C(O)=O URGPVYGVWLIRGT-DCAQKATOSA-N 0.000 description 2
- VSTNAUBHKQPVJX-IHRRRGAJSA-N Lys-Met-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O VSTNAUBHKQPVJX-IHRRRGAJSA-N 0.000 description 2
- TWPCWKVOZDUYAA-KKUMJFAQSA-N Lys-Phe-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O TWPCWKVOZDUYAA-KKUMJFAQSA-N 0.000 description 2
- MSSJJDVQTFTLIF-KBPBESRZSA-N Lys-Phe-Gly Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](Cc1ccccc1)C(=O)NCC(O)=O MSSJJDVQTFTLIF-KBPBESRZSA-N 0.000 description 2
- UDXSLGLHFUBRRM-OEAJRASXSA-N Lys-Phe-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](CCCCN)N)O UDXSLGLHFUBRRM-OEAJRASXSA-N 0.000 description 2
- JCVOHUKUYSYBAD-DCAQKATOSA-N Lys-Pro-Cys Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CCCCN)N)C(=O)N[C@@H](CS)C(=O)O JCVOHUKUYSYBAD-DCAQKATOSA-N 0.000 description 2
- YSPZCHGIWAQVKQ-AVGNSLFASA-N Lys-Pro-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CCCCN YSPZCHGIWAQVKQ-AVGNSLFASA-N 0.000 description 2
- HKXSZKJMDBHOTG-CIUDSAMLSA-N Lys-Ser-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CCCCN HKXSZKJMDBHOTG-CIUDSAMLSA-N 0.000 description 2
- GHKXHCMRAUYLBS-CIUDSAMLSA-N Lys-Ser-Asn Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O GHKXHCMRAUYLBS-CIUDSAMLSA-N 0.000 description 2
- MGKFCQFVPKOWOL-CIUDSAMLSA-N Lys-Ser-Asp Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(=O)O)C(=O)O)N MGKFCQFVPKOWOL-CIUDSAMLSA-N 0.000 description 2
- LKDXINHHSWFFJC-SRVKXCTJSA-N Lys-Ser-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCCCN)N LKDXINHHSWFFJC-SRVKXCTJSA-N 0.000 description 2
- IOQWIOPSKJOEKI-SRVKXCTJSA-N Lys-Ser-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O IOQWIOPSKJOEKI-SRVKXCTJSA-N 0.000 description 2
- SQXZLVXQXWILKW-KKUMJFAQSA-N Lys-Ser-Phe Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O SQXZLVXQXWILKW-KKUMJFAQSA-N 0.000 description 2
- MIFFFXHMAHFACR-KATARQTJSA-N Lys-Ser-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CCCCN MIFFFXHMAHFACR-KATARQTJSA-N 0.000 description 2
- PLOUVAYOMTYJRG-JXUBOQSCSA-N Lys-Thr-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O PLOUVAYOMTYJRG-JXUBOQSCSA-N 0.000 description 2
- BDFHWFUAQLIMJO-KXNHARMFSA-N Lys-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCCN)N)O BDFHWFUAQLIMJO-KXNHARMFSA-N 0.000 description 2
- YUTZYVTZDVZBJJ-IHPCNDPISA-N Lys-Trp-Lys Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](N)CCCCN)C(=O)N[C@@H](CCCCN)C(O)=O)=CNC2=C1 YUTZYVTZDVZBJJ-IHPCNDPISA-N 0.000 description 2
- MDDUIRLQCYVRDO-NHCYSSNCSA-N Lys-Val-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCCN MDDUIRLQCYVRDO-NHCYSSNCSA-N 0.000 description 2
- RIPJMCFGQHGHNP-RHYQMDGZSA-N Lys-Val-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CCCCN)N)O RIPJMCFGQHGHNP-RHYQMDGZSA-N 0.000 description 2
- LMKSBGIUPVRHEH-FXQIFTODSA-N Met-Ala-Asn Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC(N)=O LMKSBGIUPVRHEH-FXQIFTODSA-N 0.000 description 2
- VHGIWFGJIHTASW-FXQIFTODSA-N Met-Ala-Asp Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC(O)=O VHGIWFGJIHTASW-FXQIFTODSA-N 0.000 description 2
- QAHFGYLFLVGBNW-DCAQKATOSA-N Met-Ala-Lys Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCCN QAHFGYLFLVGBNW-DCAQKATOSA-N 0.000 description 2
- WYEXWKAWMNJKPN-UBHSHLNASA-N Met-Ala-Phe Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CCSC)N WYEXWKAWMNJKPN-UBHSHLNASA-N 0.000 description 2
- DCHHUGLTVLJYKA-FXQIFTODSA-N Met-Asn-Ala Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(O)=O DCHHUGLTVLJYKA-FXQIFTODSA-N 0.000 description 2
- IHITVQKJXQQGLJ-LPEHRKFASA-N Met-Asn-Pro Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N1CCC[C@@H]1C(=O)O)N IHITVQKJXQQGLJ-LPEHRKFASA-N 0.000 description 2
- PNDCUTDWYVKBHX-IHRRRGAJSA-N Met-Asp-Tyr Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 PNDCUTDWYVKBHX-IHRRRGAJSA-N 0.000 description 2
- YLLWCSDBVGZLOW-CIUDSAMLSA-N Met-Gln-Ala Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O YLLWCSDBVGZLOW-CIUDSAMLSA-N 0.000 description 2
- JYCQGAGDJQYEDB-GUBZILKMSA-N Met-Gln-Gln Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O JYCQGAGDJQYEDB-GUBZILKMSA-N 0.000 description 2
- VOOINLQYUZOREH-SRVKXCTJSA-N Met-Gln-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCSC)N VOOINLQYUZOREH-SRVKXCTJSA-N 0.000 description 2
- DJDFBVNNDAUPRW-GUBZILKMSA-N Met-Glu-Gln Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O DJDFBVNNDAUPRW-GUBZILKMSA-N 0.000 description 2
- SJDQOYTYNGZZJX-SRVKXCTJSA-N Met-Glu-Leu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O SJDQOYTYNGZZJX-SRVKXCTJSA-N 0.000 description 2
- OOSPRDCGTLQLBP-NHCYSSNCSA-N Met-Glu-Val Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O OOSPRDCGTLQLBP-NHCYSSNCSA-N 0.000 description 2
- IUYCGMNKIZDRQI-BQBZGAKWSA-N Met-Gly-Ala Chemical compound CSCC[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(O)=O IUYCGMNKIZDRQI-BQBZGAKWSA-N 0.000 description 2
- LRALLISKBZNSKN-BQBZGAKWSA-N Met-Gly-Ser Chemical compound CSCC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O LRALLISKBZNSKN-BQBZGAKWSA-N 0.000 description 2
- SXWQMBGNFXAGAT-FJXKBIBVSA-N Met-Gly-Thr Chemical compound CSCC[C@H](N)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O SXWQMBGNFXAGAT-FJXKBIBVSA-N 0.000 description 2
- MXEASDMFHUKOGE-ULQDDVLXSA-N Met-His-Tyr Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O)N MXEASDMFHUKOGE-ULQDDVLXSA-N 0.000 description 2
- GETCJHFFECHWHI-QXEWZRGKSA-N Met-Ile-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CCSC)N GETCJHFFECHWHI-QXEWZRGKSA-N 0.000 description 2
- AFFKUNVPPLQUGA-DCAQKATOSA-N Met-Leu-Ala Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O AFFKUNVPPLQUGA-DCAQKATOSA-N 0.000 description 2
- HZVXPUHLTZRQEL-UWVGGRQHSA-N Met-Leu-Gly Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O HZVXPUHLTZRQEL-UWVGGRQHSA-N 0.000 description 2
- JYPITOUIQVSCKM-IHRRRGAJSA-N Met-Leu-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CCSC)N JYPITOUIQVSCKM-IHRRRGAJSA-N 0.000 description 2
- LBNFTWKGISQVEE-AVGNSLFASA-N Met-Leu-Met Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCSC LBNFTWKGISQVEE-AVGNSLFASA-N 0.000 description 2
- OBPCXINRFKHSRY-SDDRHHMPSA-N Met-Met-Pro Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCSC)C(=O)N1CCC[C@@H]1C(=O)O)N OBPCXINRFKHSRY-SDDRHHMPSA-N 0.000 description 2
- FBLBCGLSRXBANI-KKUMJFAQSA-N Met-Phe-Glu Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N FBLBCGLSRXBANI-KKUMJFAQSA-N 0.000 description 2
- VSJAPSMRFYUOKS-IUCAKERBSA-N Met-Pro-Gly Chemical compound CSCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O VSJAPSMRFYUOKS-IUCAKERBSA-N 0.000 description 2
- YLDSJJOGQNEQJK-AVGNSLFASA-N Met-Pro-Leu Chemical compound CSCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O YLDSJJOGQNEQJK-AVGNSLFASA-N 0.000 description 2
- ZDJICAUBMUKVEJ-CIUDSAMLSA-N Met-Ser-Gln Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(N)=O ZDJICAUBMUKVEJ-CIUDSAMLSA-N 0.000 description 2
- HLZORBMOISUNIV-DCAQKATOSA-N Met-Ser-Leu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC(C)C HLZORBMOISUNIV-DCAQKATOSA-N 0.000 description 2
- FXBKQTOGURNXSL-HJGDQZAQSA-N Met-Thr-Glu Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCC(O)=O FXBKQTOGURNXSL-HJGDQZAQSA-N 0.000 description 2
- TUZSWDCTCGTVDJ-PJODQICGSA-N Met-Trp-Ala Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](N)CCSC)C(=O)N[C@@H](C)C(O)=O)=CNC2=C1 TUZSWDCTCGTVDJ-PJODQICGSA-N 0.000 description 2
- RMLWDZINJUDMEB-IHRRRGAJSA-N Met-Tyr-Asn Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N RMLWDZINJUDMEB-IHRRRGAJSA-N 0.000 description 2
- YGNUDKAPJARTEM-GUBZILKMSA-N Met-Val-Ala Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O YGNUDKAPJARTEM-GUBZILKMSA-N 0.000 description 2
- VWFHWJGVLVZVIS-QXEWZRGKSA-N Met-Val-Asn Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O VWFHWJGVLVZVIS-QXEWZRGKSA-N 0.000 description 2
- 101001130841 Middle East respiratory syndrome-related coronavirus (isolate United Kingdom/H123990006/2012) Non-structural protein ORF5 Proteins 0.000 description 2
- -1 N-3 series unsaturated fatty acids Chemical class 0.000 description 2
- WUGMRIBZSVSJNP-UHFFFAOYSA-N N-L-alanyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)C)C(O)=O)=CNC2=C1 WUGMRIBZSVSJNP-UHFFFAOYSA-N 0.000 description 2
- PESQCPHRXOFIPX-UHFFFAOYSA-N N-L-methionyl-L-tyrosine Natural products CSCCC(N)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 PESQCPHRXOFIPX-UHFFFAOYSA-N 0.000 description 2
- AFCARXCZXQIEQB-UHFFFAOYSA-N N-[3-oxo-3-(2,4,6,7-tetrahydrotriazolo[4,5-c]pyridin-5-yl)propyl]-2-[[3-(trifluoromethoxy)phenyl]methylamino]pyrimidine-5-carboxamide Chemical compound O=C(CCNC(=O)C=1C=NC(=NC=1)NCC1=CC(=CC=C1)OC(F)(F)F)N1CC2=C(CC1)NN=N2 AFCARXCZXQIEQB-UHFFFAOYSA-N 0.000 description 2
- 101100342977 Neurospora crassa (strain ATCC 24698 / 74-OR23-1A / CBS 708.71 / DSM 1257 / FGSC 987) leu-1 gene Proteins 0.000 description 2
- 108091028043 Nucleic acid sequence Proteins 0.000 description 2
- WSXKXSBOJXEZDV-DLOVCJGASA-N Phe-Ala-Asn Chemical compound NC(=O)C[C@@H](C([O-])=O)NC(=O)[C@H](C)NC(=O)[C@@H]([NH3+])CC1=CC=CC=C1 WSXKXSBOJXEZDV-DLOVCJGASA-N 0.000 description 2
- CYZBFPYMSJGBRL-DRZSPHRISA-N Phe-Ala-Glu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O CYZBFPYMSJGBRL-DRZSPHRISA-N 0.000 description 2
- BBDSZDHUCPSYAC-QEJZJMRPSA-N Phe-Ala-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O BBDSZDHUCPSYAC-QEJZJMRPSA-N 0.000 description 2
- ULECEJGNDHWSKD-QEJZJMRPSA-N Phe-Ala-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=CC=C1 ULECEJGNDHWSKD-QEJZJMRPSA-N 0.000 description 2
- AYPMIIKUMNADSU-IHRRRGAJSA-N Phe-Arg-Asn Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(O)=O AYPMIIKUMNADSU-IHRRRGAJSA-N 0.000 description 2
- CGOMLCQJEMWMCE-STQMWFEESA-N Phe-Arg-Gly Chemical compound NC(N)=NCCC[C@@H](C(=O)NCC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 CGOMLCQJEMWMCE-STQMWFEESA-N 0.000 description 2
- JEGFCFLCRSJCMA-IHRRRGAJSA-N Phe-Arg-Ser Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CO)C(=O)O)N JEGFCFLCRSJCMA-IHRRRGAJSA-N 0.000 description 2
- QCHNRQQVLJYDSI-DLOVCJGASA-N Phe-Asn-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 QCHNRQQVLJYDSI-DLOVCJGASA-N 0.000 description 2
- MRNRMSDVVSKPGM-AVGNSLFASA-N Phe-Asn-Gln Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O MRNRMSDVVSKPGM-AVGNSLFASA-N 0.000 description 2
- HHOOEUSPFGPZFP-QWRGUYRKSA-N Phe-Asn-Gly Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O HHOOEUSPFGPZFP-QWRGUYRKSA-N 0.000 description 2
- JOXIIFVCSATTDH-IHPCNDPISA-N Phe-Asn-Trp Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC2=CNC3=CC=CC=C32)C(=O)O)N JOXIIFVCSATTDH-IHPCNDPISA-N 0.000 description 2
- JIYJYFIXQTYDNF-YDHLFZDLSA-N Phe-Asn-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC1=CC=CC=C1)N JIYJYFIXQTYDNF-YDHLFZDLSA-N 0.000 description 2
- LDSOBEJVGGVWGD-DLOVCJGASA-N Phe-Asp-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 LDSOBEJVGGVWGD-DLOVCJGASA-N 0.000 description 2
- RIYZXJVARWJLKS-KKUMJFAQSA-N Phe-Asp-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 RIYZXJVARWJLKS-KKUMJFAQSA-N 0.000 description 2
- WIVCOAKLPICYGY-KKUMJFAQSA-N Phe-Asp-Lys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N WIVCOAKLPICYGY-KKUMJFAQSA-N 0.000 description 2
- UMKYAYXCMYYNHI-AVGNSLFASA-N Phe-Gln-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC(=O)N)C(=O)O)N UMKYAYXCMYYNHI-AVGNSLFASA-N 0.000 description 2
- SXJGROGVINAYSH-AVGNSLFASA-N Phe-Gln-Asp Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N SXJGROGVINAYSH-AVGNSLFASA-N 0.000 description 2
- HOYQLNNGMHXZDW-KKUMJFAQSA-N Phe-Glu-Arg Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O HOYQLNNGMHXZDW-KKUMJFAQSA-N 0.000 description 2
- MPFGIYLYWUCSJG-AVGNSLFASA-N Phe-Glu-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 MPFGIYLYWUCSJG-AVGNSLFASA-N 0.000 description 2
- XXAOSEUPEMQJOF-KKUMJFAQSA-N Phe-Glu-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 XXAOSEUPEMQJOF-KKUMJFAQSA-N 0.000 description 2
- BFYHIHGIHGROAT-HTUGSXCWSA-N Phe-Glu-Thr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O BFYHIHGIHGROAT-HTUGSXCWSA-N 0.000 description 2
- UAMFZRNCIFFMLE-FHWLQOOXSA-N Phe-Glu-Tyr Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O)N UAMFZRNCIFFMLE-FHWLQOOXSA-N 0.000 description 2
- JJHVFCUWLSKADD-ONGXEEELSA-N Phe-Gly-Ala Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)NCC(=O)N[C@@H](C)C(O)=O JJHVFCUWLSKADD-ONGXEEELSA-N 0.000 description 2
- HNFUGJUZJRYUHN-JSGCOSHPSA-N Phe-Gly-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC1=CC=CC=C1 HNFUGJUZJRYUHN-JSGCOSHPSA-N 0.000 description 2
- PMKIMKUGCSVFSV-CQDKDKBSSA-N Phe-His-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CC2=CC=CC=C2)N PMKIMKUGCSVFSV-CQDKDKBSSA-N 0.000 description 2
- PPHFTNABKQRAJV-JYJNAYRXSA-N Phe-His-Gln Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N PPHFTNABKQRAJV-JYJNAYRXSA-N 0.000 description 2
- SFKOEHXABNPLRT-KBPBESRZSA-N Phe-His-Gly Chemical compound N[C@@H](Cc1ccccc1)C(=O)N[C@@H](Cc1cnc[nH]1)C(=O)NCC(O)=O SFKOEHXABNPLRT-KBPBESRZSA-N 0.000 description 2
- VADLTGVIOIOKGM-BZSNNMDCSA-N Phe-His-Leu Chemical compound C([C@@H](C(=O)N[C@@H](CC(C)C)C(O)=O)NC(=O)[C@@H](N)CC=1C=CC=CC=1)C1=CN=CN1 VADLTGVIOIOKGM-BZSNNMDCSA-N 0.000 description 2
- KRYSMKKRRRWOCZ-QEWYBTABSA-N Phe-Ile-Glu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(O)=O KRYSMKKRRRWOCZ-QEWYBTABSA-N 0.000 description 2
- DVOCGBNHAUHKHJ-DKIMLUQUSA-N Phe-Ile-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O DVOCGBNHAUHKHJ-DKIMLUQUSA-N 0.000 description 2
- KDYPMIZMXDECSU-JYJNAYRXSA-N Phe-Leu-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 KDYPMIZMXDECSU-JYJNAYRXSA-N 0.000 description 2
- KPEIBEPEUAZWNS-ULQDDVLXSA-N Phe-Leu-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 KPEIBEPEUAZWNS-ULQDDVLXSA-N 0.000 description 2
- OSBADCBXAMSPQD-YESZJQIVSA-N Phe-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CC=CC=C2)N OSBADCBXAMSPQD-YESZJQIVSA-N 0.000 description 2
- HQPWNHXERZCIHP-PMVMPFDFSA-N Phe-Leu-Trp Chemical compound C([C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(O)=O)C1=CC=CC=C1 HQPWNHXERZCIHP-PMVMPFDFSA-N 0.000 description 2
- RMKGXGPQIPLTFC-KKUMJFAQSA-N Phe-Lys-Asn Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O RMKGXGPQIPLTFC-KKUMJFAQSA-N 0.000 description 2
- BNRFQGLWLQESBG-YESZJQIVSA-N Phe-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)[C@H](CC2=CC=CC=C2)N)C(=O)O BNRFQGLWLQESBG-YESZJQIVSA-N 0.000 description 2
- SCKXGHWQPPURGT-KKUMJFAQSA-N Phe-Lys-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O SCKXGHWQPPURGT-KKUMJFAQSA-N 0.000 description 2
- JKJSIYKSGIDHPM-WBAXXEDZSA-N Phe-Phe-Ala Chemical compound C[C@H](NC(=O)[C@H](Cc1ccccc1)NC(=O)[C@@H](N)Cc1ccccc1)C(O)=O JKJSIYKSGIDHPM-WBAXXEDZSA-N 0.000 description 2
- KAJLHCWRWDSROH-BZSNNMDCSA-N Phe-Phe-Asp Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)N[C@@H](CC(O)=O)C(O)=O)C1=CC=CC=C1 KAJLHCWRWDSROH-BZSNNMDCSA-N 0.000 description 2
- GPLWGAYGROGDEN-BZSNNMDCSA-N Phe-Phe-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O GPLWGAYGROGDEN-BZSNNMDCSA-N 0.000 description 2
- MGLBSROLWAWCKN-FCLVOEFKSA-N Phe-Phe-Thr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MGLBSROLWAWCKN-FCLVOEFKSA-N 0.000 description 2
- JLLJTMHNXQTMCK-UBHSHLNASA-N Phe-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CC1=CC=CC=C1 JLLJTMHNXQTMCK-UBHSHLNASA-N 0.000 description 2
- XOHJOMKCRLHGCY-UNQGMJICSA-N Phe-Pro-Thr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O XOHJOMKCRLHGCY-UNQGMJICSA-N 0.000 description 2
- AFNJAQVMTIQTCB-DLOVCJGASA-N Phe-Ser-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CC=CC=C1 AFNJAQVMTIQTCB-DLOVCJGASA-N 0.000 description 2
- WEDZFLRYSIDIRX-IHRRRGAJSA-N Phe-Ser-Arg Chemical compound NC(=N)NCCC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CC=CC=C1 WEDZFLRYSIDIRX-IHRRRGAJSA-N 0.000 description 2
- JXQVYPWVGUOIDV-MXAVVETBSA-N Phe-Ser-Ile Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JXQVYPWVGUOIDV-MXAVVETBSA-N 0.000 description 2
- IPFXYNKCXYGSSV-KKUMJFAQSA-N Phe-Ser-Lys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)O)N IPFXYNKCXYGSSV-KKUMJFAQSA-N 0.000 description 2
- QSWKNJAPHQDAAS-MELADBBJSA-N Phe-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CC2=CC=CC=C2)N)C(=O)O QSWKNJAPHQDAAS-MELADBBJSA-N 0.000 description 2
- XNQMZHLAYFWSGJ-HTUGSXCWSA-N Phe-Thr-Glu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O XNQMZHLAYFWSGJ-HTUGSXCWSA-N 0.000 description 2
- CXMSESHALPOLRE-MEYUZBJRSA-N Phe-Thr-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC2=CC=CC=C2)N)O CXMSESHALPOLRE-MEYUZBJRSA-N 0.000 description 2
- SHUFSZDAIPLZLF-BEAPCOKYSA-N Phe-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CC=CC=C2)N)O SHUFSZDAIPLZLF-BEAPCOKYSA-N 0.000 description 2
- GNRMAQSIROFNMI-IXOXFDKPSA-N Phe-Thr-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O GNRMAQSIROFNMI-IXOXFDKPSA-N 0.000 description 2
- VFDRDMOMHBJGKD-UFYCRDLUSA-N Phe-Tyr-Arg Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N VFDRDMOMHBJGKD-UFYCRDLUSA-N 0.000 description 2
- CVAUVSOFHJKCHN-BZSNNMDCSA-N Phe-Tyr-Cys Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)N[C@@H](CS)C(O)=O)C1=CC=CC=C1 CVAUVSOFHJKCHN-BZSNNMDCSA-N 0.000 description 2
- ZYNBEWGJFXTBDU-ACRUOGEOSA-N Phe-Tyr-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CC2=CC=CC=C2)N ZYNBEWGJFXTBDU-ACRUOGEOSA-N 0.000 description 2
- XALFIVXGQUEGKV-JSGCOSHPSA-N Phe-Val-Gly Chemical compound OC(=O)CNC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 XALFIVXGQUEGKV-JSGCOSHPSA-N 0.000 description 2
- APKRGYLBSCWJJP-FXQIFTODSA-N Pro-Ala-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(O)=O APKRGYLBSCWJJP-FXQIFTODSA-N 0.000 description 2
- AJLVKXCNXIJHDV-CIUDSAMLSA-N Pro-Ala-Gln Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(O)=O AJLVKXCNXIJHDV-CIUDSAMLSA-N 0.000 description 2
- XQLBWXHVZVBNJM-FXQIFTODSA-N Pro-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1 XQLBWXHVZVBNJM-FXQIFTODSA-N 0.000 description 2
- GRIRJQGZZJVANI-CYDGBPFRSA-N Pro-Arg-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H]1CCCN1 GRIRJQGZZJVANI-CYDGBPFRSA-N 0.000 description 2
- VCYJKOLZYPYGJV-AVGNSLFASA-N Pro-Arg-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O VCYJKOLZYPYGJV-AVGNSLFASA-N 0.000 description 2
- UVKNEILZSJMKSR-FXQIFTODSA-N Pro-Asn-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H]1CCCN1 UVKNEILZSJMKSR-FXQIFTODSA-N 0.000 description 2
- OBVCYFIHIIYIQF-CIUDSAMLSA-N Pro-Asn-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O OBVCYFIHIIYIQF-CIUDSAMLSA-N 0.000 description 2
- VOHFZDSRPZLXLH-IHRRRGAJSA-N Pro-Asn-Phe Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O VOHFZDSRPZLXLH-IHRRRGAJSA-N 0.000 description 2
- TXPUNZXZDVJUJQ-LPEHRKFASA-N Pro-Asn-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC(=O)N)C(=O)N2CCC[C@@H]2C(=O)O TXPUNZXZDVJUJQ-LPEHRKFASA-N 0.000 description 2
- SWXSLPHTJVAWDF-VEVYYDQMSA-N Pro-Asn-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SWXSLPHTJVAWDF-VEVYYDQMSA-N 0.000 description 2
- CJZTUKSFZUSNCC-FXQIFTODSA-N Pro-Asp-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H]1CCCN1 CJZTUKSFZUSNCC-FXQIFTODSA-N 0.000 description 2
- ILMLVTGTUJPQFP-FXQIFTODSA-N Pro-Asp-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O ILMLVTGTUJPQFP-FXQIFTODSA-N 0.000 description 2
- KIGGUSRFHJCIEJ-DCAQKATOSA-N Pro-Asp-His Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O KIGGUSRFHJCIEJ-DCAQKATOSA-N 0.000 description 2
- KPDRZQUWJKTMBP-DCAQKATOSA-N Pro-Asp-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@@H]1CCCN1 KPDRZQUWJKTMBP-DCAQKATOSA-N 0.000 description 2
- XKHCJJPNXFBADI-DCAQKATOSA-N Pro-Asp-Lys Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O XKHCJJPNXFBADI-DCAQKATOSA-N 0.000 description 2
- SFECXGVELZFBFJ-VEVYYDQMSA-N Pro-Asp-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SFECXGVELZFBFJ-VEVYYDQMSA-N 0.000 description 2
- UAYHMOIGIQZLFR-NHCYSSNCSA-N Pro-Gln-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O UAYHMOIGIQZLFR-NHCYSSNCSA-N 0.000 description 2
- KIPIKSXPPLABPN-CIUDSAMLSA-N Pro-Glu-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H]1CCCN1 KIPIKSXPPLABPN-CIUDSAMLSA-N 0.000 description 2
- PTLOFJZJADCNCD-DCAQKATOSA-N Pro-Glu-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@@H]1CCCN1 PTLOFJZJADCNCD-DCAQKATOSA-N 0.000 description 2
- UEHYFUCOGHWASA-HJGDQZAQSA-N Pro-Glu-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H]1CCCN1 UEHYFUCOGHWASA-HJGDQZAQSA-N 0.000 description 2
- JUJGNDZIKKQMDJ-IHRRRGAJSA-N Pro-His-His Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CNC=N1)C(O)=O JUJGNDZIKKQMDJ-IHRRRGAJSA-N 0.000 description 2
- AQSMZTIEJMZQEC-DCAQKATOSA-N Pro-His-Ser Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC2=CN=CN2)C(=O)N[C@@H](CO)C(=O)O AQSMZTIEJMZQEC-DCAQKATOSA-N 0.000 description 2
- BODDREDDDRZUCF-QTKMDUPCSA-N Pro-His-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@@H]2CCCN2)O BODDREDDDRZUCF-QTKMDUPCSA-N 0.000 description 2
- LXLFEIHKWGHJJB-XUXIUFHCSA-N Pro-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@@H]1CCCN1 LXLFEIHKWGHJJB-XUXIUFHCSA-N 0.000 description 2
- AUQGUYPHJSMAKI-CYDGBPFRSA-N Pro-Ile-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H]1CCCN1 AUQGUYPHJSMAKI-CYDGBPFRSA-N 0.000 description 2
- FMLRRBDLBJLJIK-DCAQKATOSA-N Pro-Leu-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]1CCCN1 FMLRRBDLBJLJIK-DCAQKATOSA-N 0.000 description 2
- HFNPOYOKIPGAEI-SRVKXCTJSA-N Pro-Leu-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]1CCCN1 HFNPOYOKIPGAEI-SRVKXCTJSA-N 0.000 description 2
- XYSXOCIWCPFOCG-IHRRRGAJSA-N Pro-Leu-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O XYSXOCIWCPFOCG-IHRRRGAJSA-N 0.000 description 2
- HATVCTYBNCNMAA-AVGNSLFASA-N Pro-Leu-Met Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(O)=O HATVCTYBNCNMAA-AVGNSLFASA-N 0.000 description 2
- SUENWIFTSTWUKD-AVGNSLFASA-N Pro-Leu-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O SUENWIFTSTWUKD-AVGNSLFASA-N 0.000 description 2
- MHHQQZIFLWFZGR-DCAQKATOSA-N Pro-Lys-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O MHHQQZIFLWFZGR-DCAQKATOSA-N 0.000 description 2
- WOIFYRZPIORBRY-AVGNSLFASA-N Pro-Lys-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O WOIFYRZPIORBRY-AVGNSLFASA-N 0.000 description 2
- HBBBLSVBQGZKOZ-GUBZILKMSA-N Pro-Met-Ala Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C)C(O)=O HBBBLSVBQGZKOZ-GUBZILKMSA-N 0.000 description 2
- QGLFRQCECIWXFA-RCWTZXSCSA-N Pro-Met-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@@H]1CCCN1)O QGLFRQCECIWXFA-RCWTZXSCSA-N 0.000 description 2
- PCWLNNZTBJTZRN-AVGNSLFASA-N Pro-Pro-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 PCWLNNZTBJTZRN-AVGNSLFASA-N 0.000 description 2
- KBUAPZAZPWNYSW-SRVKXCTJSA-N Pro-Pro-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 KBUAPZAZPWNYSW-SRVKXCTJSA-N 0.000 description 2
- GOMUXSCOIWIJFP-GUBZILKMSA-N Pro-Ser-Arg Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O GOMUXSCOIWIJFP-GUBZILKMSA-N 0.000 description 2
- OWQXAJQZLWHPBH-FXQIFTODSA-N Pro-Ser-Asn Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O OWQXAJQZLWHPBH-FXQIFTODSA-N 0.000 description 2
- GMJDSFYVTAMIBF-FXQIFTODSA-N Pro-Ser-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O GMJDSFYVTAMIBF-FXQIFTODSA-N 0.000 description 2
- BJCXXMGGPHRSHV-GUBZILKMSA-N Pro-Ser-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@@H]1CCCN1 BJCXXMGGPHRSHV-GUBZILKMSA-N 0.000 description 2
- SNGZLPOXVRTNMB-LPEHRKFASA-N Pro-Ser-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CO)C(=O)N2CCC[C@@H]2C(=O)O SNGZLPOXVRTNMB-LPEHRKFASA-N 0.000 description 2
- KWMZPPWYBVZIER-XGEHTFHBSA-N Pro-Ser-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KWMZPPWYBVZIER-XGEHTFHBSA-N 0.000 description 2
- PRKWBYCXBBSLSK-GUBZILKMSA-N Pro-Ser-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O PRKWBYCXBBSLSK-GUBZILKMSA-N 0.000 description 2
- WVXQQUWOKUZIEG-VEVYYDQMSA-N Pro-Thr-Asn Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(O)=O WVXQQUWOKUZIEG-VEVYYDQMSA-N 0.000 description 2
- CHYAYDLYYIJCKY-OSUNSFLBSA-N Pro-Thr-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O CHYAYDLYYIJCKY-OSUNSFLBSA-N 0.000 description 2
- RJTUIDFUUHPJMP-FHWLQOOXSA-N Pro-Trp-His Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC2=CNC3=CC=CC=C32)C(=O)N[C@@H](CC4=CN=CN4)C(=O)O RJTUIDFUUHPJMP-FHWLQOOXSA-N 0.000 description 2
- SHTKRJHDMNSKRM-ULQDDVLXSA-N Pro-Tyr-His Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)N[C@@H](CC3=CN=CN3)C(=O)O SHTKRJHDMNSKRM-ULQDDVLXSA-N 0.000 description 2
- YHUBAXGAAYULJY-ULQDDVLXSA-N Pro-Tyr-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(O)=O YHUBAXGAAYULJY-ULQDDVLXSA-N 0.000 description 2
- VEUACYMXJKXALX-IHRRRGAJSA-N Pro-Tyr-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(O)=O VEUACYMXJKXALX-IHRRRGAJSA-N 0.000 description 2
- QKWYXRPICJEQAJ-KJEVXHAQSA-N Pro-Tyr-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@@H]2CCCN2)O QKWYXRPICJEQAJ-KJEVXHAQSA-N 0.000 description 2
- XRGIDCGRSSWCKE-SRVKXCTJSA-N Pro-Val-Met Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCSC)C(O)=O XRGIDCGRSSWCKE-SRVKXCTJSA-N 0.000 description 2
- FIODMZKLZFLYQP-GUBZILKMSA-N Pro-Val-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O FIODMZKLZFLYQP-GUBZILKMSA-N 0.000 description 2
- 108010025216 RVF peptide Proteins 0.000 description 2
- 101000974028 Rhizobium leguminosarum bv. viciae (strain 3841) Putative cystathionine beta-lyase Proteins 0.000 description 2
- 101000756519 Rhodobacter capsulatus (strain ATCC BAA-309 / NBRC 16581 / SB1003) Uncharacterized protein RCAP_rcc00048 Proteins 0.000 description 2
- 101000948219 Rhodococcus erythropolis Uncharacterized 11.5 kDa protein in thcD 3'region Proteins 0.000 description 2
- ZUGXSSFMTXKHJS-ZLUOBGJFSA-N Ser-Ala-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O ZUGXSSFMTXKHJS-ZLUOBGJFSA-N 0.000 description 2
- LVVBAKCGXXUHFO-ZLUOBGJFSA-N Ser-Ala-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(O)=O LVVBAKCGXXUHFO-ZLUOBGJFSA-N 0.000 description 2
- IYCBDVBJWDXQRR-FXQIFTODSA-N Ser-Ala-Met Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CCSC)C(O)=O IYCBDVBJWDXQRR-FXQIFTODSA-N 0.000 description 2
- JPIDMRXXNMIVKY-VZFHVOOUSA-N Ser-Ala-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O JPIDMRXXNMIVKY-VZFHVOOUSA-N 0.000 description 2
- HBZBPFLJNDXRAY-FXQIFTODSA-N Ser-Ala-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O HBZBPFLJNDXRAY-FXQIFTODSA-N 0.000 description 2
- NLQUOHDCLSFABG-GUBZILKMSA-N Ser-Arg-Arg Chemical compound NC(N)=NCCC[C@H](NC(=O)[C@H](CO)N)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O NLQUOHDCLSFABG-GUBZILKMSA-N 0.000 description 2
- FCRMLGJMPXCAHD-FXQIFTODSA-N Ser-Arg-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(O)=O FCRMLGJMPXCAHD-FXQIFTODSA-N 0.000 description 2
- WXUBSIDKNMFAGS-IHRRRGAJSA-N Ser-Arg-Tyr Chemical compound NC(N)=NCCC[C@H](NC(=O)[C@H](CO)N)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 WXUBSIDKNMFAGS-IHRRRGAJSA-N 0.000 description 2
- ZXLUWXWISXIFIX-ACZMJKKPSA-N Ser-Asn-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O ZXLUWXWISXIFIX-ACZMJKKPSA-N 0.000 description 2
- KNZQGAUEYZJUSQ-ZLUOBGJFSA-N Ser-Asp-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CO)N KNZQGAUEYZJUSQ-ZLUOBGJFSA-N 0.000 description 2
- CNIIKZQXBBQHCX-FXQIFTODSA-N Ser-Asp-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O CNIIKZQXBBQHCX-FXQIFTODSA-N 0.000 description 2
- OHKLFYXEOGGGCK-ZLUOBGJFSA-N Ser-Asp-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O OHKLFYXEOGGGCK-ZLUOBGJFSA-N 0.000 description 2
- FTVRVZNYIYWJGB-ACZMJKKPSA-N Ser-Asp-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O FTVRVZNYIYWJGB-ACZMJKKPSA-N 0.000 description 2
- QPFJSHSJFIYDJZ-GHCJXIJMSA-N Ser-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CO QPFJSHSJFIYDJZ-GHCJXIJMSA-N 0.000 description 2
- HEQPKICPPDOSIN-SRVKXCTJSA-N Ser-Asp-Tyr Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 HEQPKICPPDOSIN-SRVKXCTJSA-N 0.000 description 2
- SWSRFJZZMNLMLY-ZKWXMUAHSA-N Ser-Asp-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O SWSRFJZZMNLMLY-ZKWXMUAHSA-N 0.000 description 2
- BLPYXIXXCFVIIF-FXQIFTODSA-N Ser-Cys-Arg Chemical compound C(C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CO)N)CN=C(N)N BLPYXIXXCFVIIF-FXQIFTODSA-N 0.000 description 2
- CRZRTKAVUUGKEQ-ACZMJKKPSA-N Ser-Gln-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O CRZRTKAVUUGKEQ-ACZMJKKPSA-N 0.000 description 2
- ULVMNZOKDBHKKI-ACZMJKKPSA-N Ser-Gln-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O ULVMNZOKDBHKKI-ACZMJKKPSA-N 0.000 description 2
- ZOHGLPQGEHSLPD-FXQIFTODSA-N Ser-Gln-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O ZOHGLPQGEHSLPD-FXQIFTODSA-N 0.000 description 2
- UOLGINIHBRIECN-FXQIFTODSA-N Ser-Glu-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O UOLGINIHBRIECN-FXQIFTODSA-N 0.000 description 2
- BRGQQXQKPUCUJQ-KBIXCLLPSA-N Ser-Glu-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O BRGQQXQKPUCUJQ-KBIXCLLPSA-N 0.000 description 2
- QKQDTEYDEIJPNK-GUBZILKMSA-N Ser-Glu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CO QKQDTEYDEIJPNK-GUBZILKMSA-N 0.000 description 2
- VQBCMLMPEWPUTB-ACZMJKKPSA-N Ser-Glu-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O VQBCMLMPEWPUTB-ACZMJKKPSA-N 0.000 description 2
- UQFYNFTYDHUIMI-WHFBIAKZSA-N Ser-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)[C@@H](N)CO UQFYNFTYDHUIMI-WHFBIAKZSA-N 0.000 description 2
- MIJWOJAXARLEHA-WDSKDSINSA-N Ser-Gly-Glu Chemical compound OC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(O)=O MIJWOJAXARLEHA-WDSKDSINSA-N 0.000 description 2
- OQPNSDWGAMFJNU-QWRGUYRKSA-N Ser-Gly-Tyr Chemical compound OC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 OQPNSDWGAMFJNU-QWRGUYRKSA-N 0.000 description 2
- VMLONWHIORGALA-SRVKXCTJSA-N Ser-Leu-Leu Chemical compound CC(C)C[C@@H](C([O-])=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]([NH3+])CO VMLONWHIORGALA-SRVKXCTJSA-N 0.000 description 2
- UBRMZSHOOIVJPW-SRVKXCTJSA-N Ser-Leu-Lys Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(O)=O UBRMZSHOOIVJPW-SRVKXCTJSA-N 0.000 description 2
- MUJQWSAWLLRJCE-KATARQTJSA-N Ser-Leu-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MUJQWSAWLLRJCE-KATARQTJSA-N 0.000 description 2
- PPNPDKGQRFSCAC-CIUDSAMLSA-N Ser-Lys-Asp Chemical compound NCCCC[C@H](NC(=O)[C@@H](N)CO)C(=O)N[C@@H](CC(O)=O)C(O)=O PPNPDKGQRFSCAC-CIUDSAMLSA-N 0.000 description 2
- OWCVUSJMEBGMOK-YUMQZZPRSA-N Ser-Lys-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O OWCVUSJMEBGMOK-YUMQZZPRSA-N 0.000 description 2
- LRWBCWGEUCKDTN-BJDJZHNGSA-N Ser-Lys-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O LRWBCWGEUCKDTN-BJDJZHNGSA-N 0.000 description 2
- PTWIYDNFWPXQSD-GARJFASQSA-N Ser-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)[C@H](CO)N)C(=O)O PTWIYDNFWPXQSD-GARJFASQSA-N 0.000 description 2
- FPCGZYMRFFIYIH-CIUDSAMLSA-N Ser-Lys-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O FPCGZYMRFFIYIH-CIUDSAMLSA-N 0.000 description 2
- AMRRYKHCILPAKD-FXQIFTODSA-N Ser-Met-Asn Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CO)N AMRRYKHCILPAKD-FXQIFTODSA-N 0.000 description 2
- VXYQOFXBIXKPCX-BQBZGAKWSA-N Ser-Met-Gly Chemical compound CSCC[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CO)N VXYQOFXBIXKPCX-BQBZGAKWSA-N 0.000 description 2
- XNXRTQZTFVMJIJ-DCAQKATOSA-N Ser-Met-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O XNXRTQZTFVMJIJ-DCAQKATOSA-N 0.000 description 2
- ASGYVPAVFNDZMA-GUBZILKMSA-N Ser-Met-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](CO)N ASGYVPAVFNDZMA-GUBZILKMSA-N 0.000 description 2
- UGTZYIPOBYXWRW-SRVKXCTJSA-N Ser-Phe-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O UGTZYIPOBYXWRW-SRVKXCTJSA-N 0.000 description 2
- BUYHXYIUQUBEQP-AVGNSLFASA-N Ser-Phe-Glu Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CO)N BUYHXYIUQUBEQP-AVGNSLFASA-N 0.000 description 2
- KZPRPBLHYMZIMH-MXAVVETBSA-N Ser-Phe-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KZPRPBLHYMZIMH-MXAVVETBSA-N 0.000 description 2
- QMCDMHWAKMUGJE-IHRRRGAJSA-N Ser-Phe-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(O)=O QMCDMHWAKMUGJE-IHRRRGAJSA-N 0.000 description 2
- QPPYAWVLAVXISR-DCAQKATOSA-N Ser-Pro-His Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CO)N)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O QPPYAWVLAVXISR-DCAQKATOSA-N 0.000 description 2
- OVQZAFXWIWNYKA-GUBZILKMSA-N Ser-Pro-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CO)N OVQZAFXWIWNYKA-GUBZILKMSA-N 0.000 description 2
- QUGRFWPMPVIAPW-IHRRRGAJSA-N Ser-Pro-Phe Chemical compound OC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 QUGRFWPMPVIAPW-IHRRRGAJSA-N 0.000 description 2
- KQNDIKOYWZTZIX-FXQIFTODSA-N Ser-Ser-Arg Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCNC(N)=N KQNDIKOYWZTZIX-FXQIFTODSA-N 0.000 description 2
- VFWQQZMRKFOGLE-ZLUOBGJFSA-N Ser-Ser-Cys Chemical compound C([C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)O)N)O VFWQQZMRKFOGLE-ZLUOBGJFSA-N 0.000 description 2
- XQJCEKXQUJQNNK-ZLUOBGJFSA-N Ser-Ser-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O XQJCEKXQUJQNNK-ZLUOBGJFSA-N 0.000 description 2
- OLKICIBQRVSQMA-SRVKXCTJSA-N Ser-Ser-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O OLKICIBQRVSQMA-SRVKXCTJSA-N 0.000 description 2
- VGQVAVQWKJLIRM-FXQIFTODSA-N Ser-Ser-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O VGQVAVQWKJLIRM-FXQIFTODSA-N 0.000 description 2
- SQHKXWODKJDZRC-LKXGYXEUSA-N Ser-Thr-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(O)=O SQHKXWODKJDZRC-LKXGYXEUSA-N 0.000 description 2
- PURRNJBBXDDWLX-ZDLURKLDSA-N Ser-Thr-Gly Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CO)N)O PURRNJBBXDDWLX-ZDLURKLDSA-N 0.000 description 2
- NADLKBTYNKUJEP-KATARQTJSA-N Ser-Thr-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O NADLKBTYNKUJEP-KATARQTJSA-N 0.000 description 2
- UYLKOSODXYSWMQ-XGEHTFHBSA-N Ser-Thr-Met Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CO)N)O UYLKOSODXYSWMQ-XGEHTFHBSA-N 0.000 description 2
- SNXUIBACCONSOH-BWBBJGPYSA-N Ser-Thr-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CO)C(O)=O SNXUIBACCONSOH-BWBBJGPYSA-N 0.000 description 2
- VLMIUSLQONKLDV-HEIBUPTGSA-N Ser-Thr-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O VLMIUSLQONKLDV-HEIBUPTGSA-N 0.000 description 2
- FGBLCMLXHRPVOF-IHRRRGAJSA-N Ser-Tyr-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O FGBLCMLXHRPVOF-IHRRRGAJSA-N 0.000 description 2
- PQEQXWRVHQAAKS-SRVKXCTJSA-N Ser-Tyr-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](CO)N)CC1=CC=C(O)C=C1 PQEQXWRVHQAAKS-SRVKXCTJSA-N 0.000 description 2
- IAOHCSQDQDWRQU-GUBZILKMSA-N Ser-Val-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O IAOHCSQDQDWRQU-GUBZILKMSA-N 0.000 description 2
- BEBVVQPDSHHWQL-NRPADANISA-N Ser-Val-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O BEBVVQPDSHHWQL-NRPADANISA-N 0.000 description 2
- MFQMZDPAZRZAPV-NAKRPEOUSA-N Ser-Val-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CO)N MFQMZDPAZRZAPV-NAKRPEOUSA-N 0.000 description 2
- ANOQEBQWIAYIMV-AEJSXWLSSA-N Ser-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N ANOQEBQWIAYIMV-AEJSXWLSSA-N 0.000 description 2
- JGUWRQWULDWNCM-FXQIFTODSA-N Ser-Val-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O JGUWRQWULDWNCM-FXQIFTODSA-N 0.000 description 2
- ODRUTDLAONAVDV-IHRRRGAJSA-N Ser-Val-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O ODRUTDLAONAVDV-IHRRRGAJSA-N 0.000 description 2
- VYPSYNLAJGMNEJ-UHFFFAOYSA-N Silicium dioxide Chemical compound O=[Si]=O VYPSYNLAJGMNEJ-UHFFFAOYSA-N 0.000 description 2
- 101000936711 Streptococcus gordonii Accessory secretory protein Asp4 Proteins 0.000 description 2
- 101000929863 Streptomyces cinnamonensis Monensin polyketide synthase putative ketoacyl reductase Proteins 0.000 description 2
- 101000788468 Streptomyces coelicolor Uncharacterized protein in mprR 3'region Proteins 0.000 description 2
- 101000845085 Streptomyces violaceoruber Granaticin polyketide synthase putative ketoacyl reductase 1 Proteins 0.000 description 2
- 241000192560 Synechococcus sp. Species 0.000 description 2
- 101000711771 Thiocystis violacea Uncharacterized 76.5 kDa protein in phbC 3'region Proteins 0.000 description 2
- BSNZTJXVDOINSR-JXUBOQSCSA-N Thr-Ala-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O BSNZTJXVDOINSR-JXUBOQSCSA-N 0.000 description 2
- CAJFZCICSVBOJK-SHGPDSBTSA-N Thr-Ala-Thr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CAJFZCICSVBOJK-SHGPDSBTSA-N 0.000 description 2
- XSLXHSYIVPGEER-KZVJFYERSA-N Thr-Ala-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O XSLXHSYIVPGEER-KZVJFYERSA-N 0.000 description 2
- CAGTXGDOIFXLPC-KZVJFYERSA-N Thr-Arg-Ala Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C)C(O)=O)CCCN=C(N)N CAGTXGDOIFXLPC-KZVJFYERSA-N 0.000 description 2
- GLQFKOVWXPPFTP-VEVYYDQMSA-N Thr-Arg-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O GLQFKOVWXPPFTP-VEVYYDQMSA-N 0.000 description 2
- VFEHSAJCWWHDBH-RHYQMDGZSA-N Thr-Arg-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O VFEHSAJCWWHDBH-RHYQMDGZSA-N 0.000 description 2
- IRKWVRSEQFTGGV-VEVYYDQMSA-N Thr-Asn-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O IRKWVRSEQFTGGV-VEVYYDQMSA-N 0.000 description 2
- PQLXHSACXPGWPD-GSSVUCPTSA-N Thr-Asn-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PQLXHSACXPGWPD-GSSVUCPTSA-N 0.000 description 2
- KRPKYGOFYUNIGM-XVSYOHENSA-N Thr-Asp-Phe Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N)O KRPKYGOFYUNIGM-XVSYOHENSA-N 0.000 description 2
- JXKMXEBNZCKSDY-JIOCBJNQSA-N Thr-Asp-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N)O JXKMXEBNZCKSDY-JIOCBJNQSA-N 0.000 description 2
- DSLHSTIUAPKERR-XGEHTFHBSA-N Thr-Cys-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CS)C(=O)N[C@@H](C(C)C)C(O)=O DSLHSTIUAPKERR-XGEHTFHBSA-N 0.000 description 2
- OYTNZCBFDXGQGE-XQXXSGGOSA-N Thr-Gln-Ala Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](C)C(=O)O)N)O OYTNZCBFDXGQGE-XQXXSGGOSA-N 0.000 description 2
- WLDUCKSCDRIVLJ-NUMRIWBASA-N Thr-Gln-Asp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N)O WLDUCKSCDRIVLJ-NUMRIWBASA-N 0.000 description 2
- RKDFEMGVMMYYNG-WDCWCFNPSA-N Thr-Gln-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O RKDFEMGVMMYYNG-WDCWCFNPSA-N 0.000 description 2
- DIPIPFHFLPTCLK-LOKLDPHHSA-N Thr-Gln-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N1CCC[C@@H]1C(=O)O)N)O DIPIPFHFLPTCLK-LOKLDPHHSA-N 0.000 description 2
- KGKWKSSSQGGYAU-SUSMZKCASA-N Thr-Gln-Thr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N)O KGKWKSSSQGGYAU-SUSMZKCASA-N 0.000 description 2
- CQNFRKAKGDSJFR-NUMRIWBASA-N Thr-Glu-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N)O CQNFRKAKGDSJFR-NUMRIWBASA-N 0.000 description 2
- UDQBCBUXAQIZAK-GLLZPBPUSA-N Thr-Glu-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O UDQBCBUXAQIZAK-GLLZPBPUSA-N 0.000 description 2
- JMGJDTNUMAZNLX-RWRJDSDZSA-N Thr-Glu-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JMGJDTNUMAZNLX-RWRJDSDZSA-N 0.000 description 2
- LHEZGZQRLDBSRR-WDCWCFNPSA-N Thr-Glu-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O LHEZGZQRLDBSRR-WDCWCFNPSA-N 0.000 description 2
- XPNSAQMEAVSQRD-FBCQKBJTSA-N Thr-Gly-Gly Chemical compound C[C@@H](O)[C@H](N)C(=O)NCC(=O)NCC(O)=O XPNSAQMEAVSQRD-FBCQKBJTSA-N 0.000 description 2
- MPUMPERGHHJGRP-WEDXCCLWSA-N Thr-Gly-Lys Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)N[C@@H](CCCCN)C(=O)O)N)O MPUMPERGHHJGRP-WEDXCCLWSA-N 0.000 description 2
- DJDSEDOKJTZBAR-ZDLURKLDSA-N Thr-Gly-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O DJDSEDOKJTZBAR-ZDLURKLDSA-N 0.000 description 2
- JQAWYCUUFIMTHE-WLTAIBSBSA-N Thr-Gly-Tyr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O JQAWYCUUFIMTHE-WLTAIBSBSA-N 0.000 description 2
- SIMKLINEDYOTKL-MBLNEYKQSA-N Thr-His-Ala Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](C)C(=O)O)N)O SIMKLINEDYOTKL-MBLNEYKQSA-N 0.000 description 2
- SXAGUVRFGJSFKC-ZEILLAHLSA-N Thr-His-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SXAGUVRFGJSFKC-ZEILLAHLSA-N 0.000 description 2
- NCGUQWSJUKYCIT-SZZJOZGLSA-N Thr-His-Trp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O NCGUQWSJUKYCIT-SZZJOZGLSA-N 0.000 description 2
- WPAKPLPGQNUXGN-OSUNSFLBSA-N Thr-Ile-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O WPAKPLPGQNUXGN-OSUNSFLBSA-N 0.000 description 2
- PAXANSWUSVPFNK-IUKAMOBKSA-N Thr-Ile-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N PAXANSWUSVPFNK-IUKAMOBKSA-N 0.000 description 2
- GXUWHVZYDAHFSV-FLBSBUHZSA-N Thr-Ile-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GXUWHVZYDAHFSV-FLBSBUHZSA-N 0.000 description 2
- AMXMBCAXAZUCFA-RHYQMDGZSA-N Thr-Leu-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O AMXMBCAXAZUCFA-RHYQMDGZSA-N 0.000 description 2
- RRRRCRYTLZVCEN-HJGDQZAQSA-N Thr-Leu-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O RRRRCRYTLZVCEN-HJGDQZAQSA-N 0.000 description 2
- ODXKUIGEPAGKKV-KATARQTJSA-N Thr-Leu-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CS)C(=O)O)N)O ODXKUIGEPAGKKV-KATARQTJSA-N 0.000 description 2
- HOVLHEKTGVIKAP-WDCWCFNPSA-N Thr-Leu-Gln Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O HOVLHEKTGVIKAP-WDCWCFNPSA-N 0.000 description 2
- VTVVYQOXJCZVEB-WDCWCFNPSA-N Thr-Leu-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O VTVVYQOXJCZVEB-WDCWCFNPSA-N 0.000 description 2
- XIULAFZYEKSGAJ-IXOXFDKPSA-N Thr-Leu-His Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CNC=N1 XIULAFZYEKSGAJ-IXOXFDKPSA-N 0.000 description 2
- FLPZMPOZGYPBEN-PPCPHDFISA-N Thr-Leu-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FLPZMPOZGYPBEN-PPCPHDFISA-N 0.000 description 2
- MEJHFIOYJHTWMK-VOAKCMCISA-N Thr-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)[C@@H](C)O MEJHFIOYJHTWMK-VOAKCMCISA-N 0.000 description 2
- PRNGXSILMXSWQQ-OEAJRASXSA-N Thr-Leu-Phe Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O PRNGXSILMXSWQQ-OEAJRASXSA-N 0.000 description 2
- NCXVJIQMWSGRHY-KXNHARMFSA-N Thr-Leu-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N)O NCXVJIQMWSGRHY-KXNHARMFSA-N 0.000 description 2
- YOOAQCZYZHGUAZ-KATARQTJSA-N Thr-Leu-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O YOOAQCZYZHGUAZ-KATARQTJSA-N 0.000 description 2
- VRUFCJZQDACGLH-UVOCVTCTSA-N Thr-Leu-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O VRUFCJZQDACGLH-UVOCVTCTSA-N 0.000 description 2
- BDGBHYCAZJPLHX-HJGDQZAQSA-N Thr-Lys-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O BDGBHYCAZJPLHX-HJGDQZAQSA-N 0.000 description 2
- DXPURPNJDFCKKO-RHYQMDGZSA-N Thr-Lys-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)[C@@H](C)O)C(O)=O DXPURPNJDFCKKO-RHYQMDGZSA-N 0.000 description 2
- XNTVWRJTUIOGQO-RHYQMDGZSA-N Thr-Met-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O XNTVWRJTUIOGQO-RHYQMDGZSA-N 0.000 description 2
- UXUAZXWKIGPUCH-RCWTZXSCSA-N Thr-Met-Met Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCSC)C(O)=O UXUAZXWKIGPUCH-RCWTZXSCSA-N 0.000 description 2
- UGFSAPWZBROURT-IXOXFDKPSA-N Thr-Phe-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CS)C(=O)O)N)O UGFSAPWZBROURT-IXOXFDKPSA-N 0.000 description 2
- NZRUWPIYECBYRK-HTUGSXCWSA-N Thr-Phe-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O NZRUWPIYECBYRK-HTUGSXCWSA-N 0.000 description 2
- NDXSOKGYKCGYKT-VEVYYDQMSA-N Thr-Pro-Asp Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O NDXSOKGYKCGYKT-VEVYYDQMSA-N 0.000 description 2
- LKJCABTUFGTPPY-HJGDQZAQSA-N Thr-Pro-Gln Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O LKJCABTUFGTPPY-HJGDQZAQSA-N 0.000 description 2
- BDENGIGFTNYZSJ-RCWTZXSCSA-N Thr-Pro-Met Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCSC)C(O)=O BDENGIGFTNYZSJ-RCWTZXSCSA-N 0.000 description 2
- FWTFAZKJORVTIR-VZFHVOOUSA-N Thr-Ser-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O FWTFAZKJORVTIR-VZFHVOOUSA-N 0.000 description 2
- IQPWNQRRAJHOKV-KATARQTJSA-N Thr-Ser-Lys Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCCN IQPWNQRRAJHOKV-KATARQTJSA-N 0.000 description 2
- RVMNUBQWPVOUKH-HEIBUPTGSA-N Thr-Ser-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O RVMNUBQWPVOUKH-HEIBUPTGSA-N 0.000 description 2
- HUPLKEHTTQBXSC-YJRXYDGGSA-N Thr-Ser-Tyr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 HUPLKEHTTQBXSC-YJRXYDGGSA-N 0.000 description 2
- QYDKSNXSBXZPFK-ZJDVBMNYSA-N Thr-Thr-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O QYDKSNXSBXZPFK-ZJDVBMNYSA-N 0.000 description 2
- AAZOYLQUEQRUMZ-GSSVUCPTSA-N Thr-Thr-Asn Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(N)=O AAZOYLQUEQRUMZ-GSSVUCPTSA-N 0.000 description 2
- YRJOLUDFVAUXLI-GSSVUCPTSA-N Thr-Thr-Asp Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(O)=O YRJOLUDFVAUXLI-GSSVUCPTSA-N 0.000 description 2
- VBMOVTMNHWPZJR-SUSMZKCASA-N Thr-Thr-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O VBMOVTMNHWPZJR-SUSMZKCASA-N 0.000 description 2
- BBPCSGKKPJUYRB-UVOCVTCTSA-N Thr-Thr-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O BBPCSGKKPJUYRB-UVOCVTCTSA-N 0.000 description 2
- BEZTUFWTPVOROW-KJEVXHAQSA-N Thr-Tyr-Arg Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N)O BEZTUFWTPVOROW-KJEVXHAQSA-N 0.000 description 2
- XGFYGMKZKFRGAI-RCWTZXSCSA-N Thr-Val-Arg Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N XGFYGMKZKFRGAI-RCWTZXSCSA-N 0.000 description 2
- KPMIQCXJDVKWKO-IFFSRLJSSA-N Thr-Val-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O KPMIQCXJDVKWKO-IFFSRLJSSA-N 0.000 description 2
- PWONLXBUSVIZPH-RHYQMDGZSA-N Thr-Val-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N)O PWONLXBUSVIZPH-RHYQMDGZSA-N 0.000 description 2
- SBYQHZCMVSPQCS-RCWTZXSCSA-N Thr-Val-Met Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCSC)C(O)=O SBYQHZCMVSPQCS-RCWTZXSCSA-N 0.000 description 2
- MNYNCKZAEIAONY-XGEHTFHBSA-N Thr-Val-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O MNYNCKZAEIAONY-XGEHTFHBSA-N 0.000 description 2
- BPGDJSUFQKWUBK-KJEVXHAQSA-N Thr-Val-Tyr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 BPGDJSUFQKWUBK-KJEVXHAQSA-N 0.000 description 2
- BIJDDZBDSJLWJY-PJODQICGSA-N Trp-Ala-Val Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O BIJDDZBDSJLWJY-PJODQICGSA-N 0.000 description 2
- MVHHTXAUJCIOMZ-WDSOQIARSA-N Trp-Arg-Lys Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCCN)C(=O)O)N MVHHTXAUJCIOMZ-WDSOQIARSA-N 0.000 description 2
- GTNCSPKYWCJZAC-XIRDDKMYSA-N Trp-Asp-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N GTNCSPKYWCJZAC-XIRDDKMYSA-N 0.000 description 2
- SSNGFWKILJLTQM-QEJZJMRPSA-N Trp-Gln-Asn Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC(=O)N)C(=O)O)N SSNGFWKILJLTQM-QEJZJMRPSA-N 0.000 description 2
- MDDYTWOFHZFABW-SZMVWBNQSA-N Trp-Gln-Leu Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O)=CNC2=C1 MDDYTWOFHZFABW-SZMVWBNQSA-N 0.000 description 2
- RPVDDQYNBOVWLR-HOCLYGCPSA-N Trp-Gly-Leu Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O RPVDDQYNBOVWLR-HOCLYGCPSA-N 0.000 description 2
- YRSOERSDNRSCBC-XIRDDKMYSA-N Trp-His-Cys Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC3=CN=CN3)C(=O)N[C@@H](CS)C(=O)O)N YRSOERSDNRSCBC-XIRDDKMYSA-N 0.000 description 2
- VDUJEEQMRQCLHB-YTQUADARSA-N Trp-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)N)C(=O)O VDUJEEQMRQCLHB-YTQUADARSA-N 0.000 description 2
- BOMYCJXTWRMKJA-RNXOBYDBSA-N Trp-Phe-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CC=CC=C2)C(=O)O)NC(=O)[C@H](CC3=CNC4=CC=CC=C43)N BOMYCJXTWRMKJA-RNXOBYDBSA-N 0.000 description 2
- JGLXHHQUSIULAK-OYDLWJJNSA-N Trp-Pro-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H]3CCCN3C(=O)[C@H](CC=3C4=CC=CC=C4NC=3)N)C(O)=O)=CNC2=C1 JGLXHHQUSIULAK-OYDLWJJNSA-N 0.000 description 2
- KBKTUNYBNJWFRL-UBHSHLNASA-N Trp-Ser-Asn Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O)=CNC2=C1 KBKTUNYBNJWFRL-UBHSHLNASA-N 0.000 description 2
- IELISNUVHBKYBX-XDTLVQLUSA-N Tyr-Ala-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 IELISNUVHBKYBX-XDTLVQLUSA-N 0.000 description 2
- CDRYEAWHKJSGAF-BPNCWPANSA-N Tyr-Ala-Met Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C)C(=O)N[C@@H](CCSC)C(O)=O CDRYEAWHKJSGAF-BPNCWPANSA-N 0.000 description 2
- OOEUVMFKKZYSRX-LEWSCRJBSA-N Tyr-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N OOEUVMFKKZYSRX-LEWSCRJBSA-N 0.000 description 2
- DXYWRYQRKPIGGU-BPNCWPANSA-N Tyr-Ala-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 DXYWRYQRKPIGGU-BPNCWPANSA-N 0.000 description 2
- HSVPZJLMPLMPOX-BPNCWPANSA-N Tyr-Arg-Ala Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O HSVPZJLMPLMPOX-BPNCWPANSA-N 0.000 description 2
- HKIUVWMZYFBIHG-KKUMJFAQSA-N Tyr-Arg-Gln Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N)O HKIUVWMZYFBIHG-KKUMJFAQSA-N 0.000 description 2
- ADBDQGBDNUTRDB-ULQDDVLXSA-N Tyr-Arg-Leu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O ADBDQGBDNUTRDB-ULQDDVLXSA-N 0.000 description 2
- DKKHULUSOSWGHS-UWJYBYFXSA-N Tyr-Asn-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC1=CC=C(C=C1)O)N DKKHULUSOSWGHS-UWJYBYFXSA-N 0.000 description 2
- BARBHMSSVWPKPZ-IHRRRGAJSA-N Tyr-Asp-Arg Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O BARBHMSSVWPKPZ-IHRRRGAJSA-N 0.000 description 2
- BEIGSKUPTIFYRZ-SRVKXCTJSA-N Tyr-Asp-Asp Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N)O BEIGSKUPTIFYRZ-SRVKXCTJSA-N 0.000 description 2
- NLMXVDDEQFKQQU-CFMVVWHZSA-N Tyr-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 NLMXVDDEQFKQQU-CFMVVWHZSA-N 0.000 description 2
- VFJIWSJKZJTQII-SRVKXCTJSA-N Tyr-Asp-Ser Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O VFJIWSJKZJTQII-SRVKXCTJSA-N 0.000 description 2
- FFCRCJZJARTYCG-KKUMJFAQSA-N Tyr-Cys-Lys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CCCCN)C(=O)O)N)O FFCRCJZJARTYCG-KKUMJFAQSA-N 0.000 description 2
- RYSNTWVRSLCAJZ-RYUDHWBXSA-N Tyr-Gln-Gly Chemical compound OC(=O)CNC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 RYSNTWVRSLCAJZ-RYUDHWBXSA-N 0.000 description 2
- RIJPHPUJRLEOAK-JYJNAYRXSA-N Tyr-Gln-His Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N)O RIJPHPUJRLEOAK-JYJNAYRXSA-N 0.000 description 2
- CKHQKYHIZCRTAP-SOUVJXGZSA-N Tyr-Gln-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC2=CC=C(C=C2)O)N)C(=O)O CKHQKYHIZCRTAP-SOUVJXGZSA-N 0.000 description 2
- MPKPIWFFDWVJGC-IRIUXVKKSA-N Tyr-Gln-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC1=CC=C(C=C1)O)N)O MPKPIWFFDWVJGC-IRIUXVKKSA-N 0.000 description 2
- FJKXUIJOMUWCDD-FHWLQOOXSA-N Tyr-Gln-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O)N)O FJKXUIJOMUWCDD-FHWLQOOXSA-N 0.000 description 2
- IWRMTNJCCMEBEX-AVGNSLFASA-N Tyr-Glu-Cys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CS)C(=O)O)N)O IWRMTNJCCMEBEX-AVGNSLFASA-N 0.000 description 2
- IMXAAEFAIBRCQF-SIUGBPQLSA-N Tyr-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N IMXAAEFAIBRCQF-SIUGBPQLSA-N 0.000 description 2
- KOVXHANYYYMBRF-IRIUXVKKSA-N Tyr-Glu-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N)O KOVXHANYYYMBRF-IRIUXVKKSA-N 0.000 description 2
- UNUZEBFXGWVAOP-DZKIICNBSA-N Tyr-Glu-Val Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O UNUZEBFXGWVAOP-DZKIICNBSA-N 0.000 description 2
- FBHBVXUBTYVCRU-BZSNNMDCSA-N Tyr-His-Leu Chemical compound C([C@@H](C(=O)N[C@@H](CC(C)C)C(O)=O)NC(=O)[C@@H](N)CC=1C=CC(O)=CC=1)C1=CN=CN1 FBHBVXUBTYVCRU-BZSNNMDCSA-N 0.000 description 2
- JHORGUYURUBVOM-KKUMJFAQSA-N Tyr-His-Ser Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(O)=O JHORGUYURUBVOM-KKUMJFAQSA-N 0.000 description 2
- WSFXJLFSJSXGMQ-MGHWNKPDSA-N Tyr-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N WSFXJLFSJSXGMQ-MGHWNKPDSA-N 0.000 description 2
- QSFJHIRIHOJRKS-ULQDDVLXSA-N Tyr-Leu-Arg Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O QSFJHIRIHOJRKS-ULQDDVLXSA-N 0.000 description 2
- MVFQLSPDMMFCMW-KKUMJFAQSA-N Tyr-Leu-Asn Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O MVFQLSPDMMFCMW-KKUMJFAQSA-N 0.000 description 2
- DWAMXBFJNZIHMC-KBPBESRZSA-N Tyr-Leu-Gly Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O DWAMXBFJNZIHMC-KBPBESRZSA-N 0.000 description 2
- KHCSOLAHNLOXJR-BZSNNMDCSA-N Tyr-Leu-Leu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O KHCSOLAHNLOXJR-BZSNNMDCSA-N 0.000 description 2
- NSGZILIDHCIZAM-KKUMJFAQSA-N Tyr-Leu-Ser Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N NSGZILIDHCIZAM-KKUMJFAQSA-N 0.000 description 2
- JAGGEZACYAAMIL-CQDKDKBSSA-N Tyr-Lys-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC1=CC=C(C=C1)O)N JAGGEZACYAAMIL-CQDKDKBSSA-N 0.000 description 2
- SBLZVFCEOCWRLS-BPNCWPANSA-N Tyr-Met-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](CC1=CC=C(C=C1)O)N SBLZVFCEOCWRLS-BPNCWPANSA-N 0.000 description 2
- FWOVTJKVUCGVND-UFYCRDLUSA-N Tyr-Met-Phe Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N FWOVTJKVUCGVND-UFYCRDLUSA-N 0.000 description 2
- OGPKMBOPMDTEDM-IHRRRGAJSA-N Tyr-Met-Ser Chemical compound CSCC[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N OGPKMBOPMDTEDM-IHRRRGAJSA-N 0.000 description 2
- BGFCXQXETBDEHP-BZSNNMDCSA-N Tyr-Phe-Asn Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(N)=O)C(O)=O BGFCXQXETBDEHP-BZSNNMDCSA-N 0.000 description 2
- PSALWJCUIAQKFW-ACRUOGEOSA-N Tyr-Phe-Lys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N PSALWJCUIAQKFW-ACRUOGEOSA-N 0.000 description 2
- BIWVVOHTKDLRMP-ULQDDVLXSA-N Tyr-Pro-Leu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O BIWVVOHTKDLRMP-ULQDDVLXSA-N 0.000 description 2
- RGYCVIZZTUBSSG-JYJNAYRXSA-N Tyr-Pro-Val Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O RGYCVIZZTUBSSG-JYJNAYRXSA-N 0.000 description 2
- XYBNMHRFAUKPAW-IHRRRGAJSA-N Tyr-Ser-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CC1=CC=C(C=C1)O)N XYBNMHRFAUKPAW-IHRRRGAJSA-N 0.000 description 2
- ITDWWLTTWRRLCC-KJEVXHAQSA-N Tyr-Thr-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H]([C@H](O)C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 ITDWWLTTWRRLCC-KJEVXHAQSA-N 0.000 description 2
- OJCISMMNNUNNJA-BZSNNMDCSA-N Tyr-Tyr-Asp Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)N[C@@H](CC(O)=O)C(O)=O)C1=CC=C(O)C=C1 OJCISMMNNUNNJA-BZSNNMDCSA-N 0.000 description 2
- HZWPGKAKGYJWCI-ULQDDVLXSA-N Tyr-Val-Leu Chemical compound CC(C)C[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)Cc1ccc(O)cc1)C(C)C)C(O)=O HZWPGKAKGYJWCI-ULQDDVLXSA-N 0.000 description 2
- ZLFHAAGHGQBQQN-AEJSXWLSSA-N Val-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](C(C)C)N ZLFHAAGHGQBQQN-AEJSXWLSSA-N 0.000 description 2
- ZLFHAAGHGQBQQN-GUBZILKMSA-N Val-Ala-Pro Natural products CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@H]1C(O)=O ZLFHAAGHGQBQQN-GUBZILKMSA-N 0.000 description 2
- SLLKXDSRVAOREO-KZVJFYERSA-N Val-Ala-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](C)NC(=O)[C@H](C(C)C)N)O SLLKXDSRVAOREO-KZVJFYERSA-N 0.000 description 2
- COYSIHFOCOMGCF-WPRPVWTQSA-N Val-Arg-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CCCN=C(N)N COYSIHFOCOMGCF-WPRPVWTQSA-N 0.000 description 2
- COYSIHFOCOMGCF-UHFFFAOYSA-N Val-Arg-Gly Natural products CC(C)C(N)C(=O)NC(C(=O)NCC(O)=O)CCCN=C(N)N COYSIHFOCOMGCF-UHFFFAOYSA-N 0.000 description 2
- JYVKKBDANPZIAW-AVGNSLFASA-N Val-Arg-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](C(C)C)N JYVKKBDANPZIAW-AVGNSLFASA-N 0.000 description 2
- PAPWZOJOLKZEFR-AVGNSLFASA-N Val-Arg-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCCN)C(=O)O)N PAPWZOJOLKZEFR-AVGNSLFASA-N 0.000 description 2
- VMRFIKXKOFNMHW-GUBZILKMSA-N Val-Arg-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CO)C(=O)O)N VMRFIKXKOFNMHW-GUBZILKMSA-N 0.000 description 2
- ZMDCGGKHRKNWKD-LAEOZQHASA-N Val-Asn-Glu Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N ZMDCGGKHRKNWKD-LAEOZQHASA-N 0.000 description 2
- LNYOXPDEIZJDEI-NHCYSSNCSA-N Val-Asn-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](C(C)C)N LNYOXPDEIZJDEI-NHCYSSNCSA-N 0.000 description 2
- VUTHNLMCXKLLFI-LAEOZQHASA-N Val-Asp-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N VUTHNLMCXKLLFI-LAEOZQHASA-N 0.000 description 2
- HHSILIQTHXABKM-YDHLFZDLSA-N Val-Asp-Phe Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](Cc1ccccc1)C(O)=O HHSILIQTHXABKM-YDHLFZDLSA-N 0.000 description 2
- CWSIBTLMMQLPPZ-FXQIFTODSA-N Val-Cys-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](C(C)C)N CWSIBTLMMQLPPZ-FXQIFTODSA-N 0.000 description 2
- XEYUMGGWQCIWAR-XVKPBYJWSA-N Val-Gln-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)NCC(=O)O)N XEYUMGGWQCIWAR-XVKPBYJWSA-N 0.000 description 2
- CPTQYHDSVGVGDZ-UKJIMTQDSA-N Val-Gln-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](C(C)C)N CPTQYHDSVGVGDZ-UKJIMTQDSA-N 0.000 description 2
- JXGWQYWDUOWQHA-DZKIICNBSA-N Val-Gln-Phe Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N JXGWQYWDUOWQHA-DZKIICNBSA-N 0.000 description 2
- AGKDVLSDNSTLFA-UMNHJUIQSA-N Val-Gln-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N1CCC[C@@H]1C(=O)O)N AGKDVLSDNSTLFA-UMNHJUIQSA-N 0.000 description 2
- UZDHNIJRRTUKKC-DLOVCJGASA-N Val-Gln-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](C(C)C)C(=O)O)N UZDHNIJRRTUKKC-DLOVCJGASA-N 0.000 description 2
- VLDMQVZZWDOKQF-AUTRQRHGSA-N Val-Glu-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N VLDMQVZZWDOKQF-AUTRQRHGSA-N 0.000 description 2
- ROLGIBMFNMZANA-GVXVVHGQSA-N Val-Glu-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](C(C)C)N ROLGIBMFNMZANA-GVXVVHGQSA-N 0.000 description 2
- MHAHQDBEIDPFQS-NHCYSSNCSA-N Val-Glu-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)C(C)C MHAHQDBEIDPFQS-NHCYSSNCSA-N 0.000 description 2
- FOADDSDHGRFUOC-DZKIICNBSA-N Val-Glu-Phe Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N FOADDSDHGRFUOC-DZKIICNBSA-N 0.000 description 2
- OQWNEUXPKHIEJO-NRPADANISA-N Val-Glu-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CO)C(=O)O)N OQWNEUXPKHIEJO-NRPADANISA-N 0.000 description 2
- PMXBARDFIAPBGK-DZKIICNBSA-N Val-Glu-Tyr Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 PMXBARDFIAPBGK-DZKIICNBSA-N 0.000 description 2
- URIRWLJVWHYLET-ONGXEEELSA-N Val-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)C(C)C URIRWLJVWHYLET-ONGXEEELSA-N 0.000 description 2
- FXVDGDZRYLFQKY-WPRPVWTQSA-N Val-Gly-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)C(C)C FXVDGDZRYLFQKY-WPRPVWTQSA-N 0.000 description 2
- LAYSXAOGWHKNED-XPUUQOCRSA-N Val-Gly-Ser Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O LAYSXAOGWHKNED-XPUUQOCRSA-N 0.000 description 2
- BZMIYHIJVVJPCK-QSFUFRPTSA-N Val-Ile-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N BZMIYHIJVVJPCK-QSFUFRPTSA-N 0.000 description 2
- WNZSAUMKZQXHNC-UKJIMTQDSA-N Val-Ile-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N WNZSAUMKZQXHNC-UKJIMTQDSA-N 0.000 description 2
- OVBMCNDKCWAXMZ-NAKRPEOUSA-N Val-Ile-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](C(C)C)N OVBMCNDKCWAXMZ-NAKRPEOUSA-N 0.000 description 2
- AEMPCGRFEZTWIF-IHRRRGAJSA-N Val-Leu-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(O)=O AEMPCGRFEZTWIF-IHRRRGAJSA-N 0.000 description 2
- SYSWVVCYSXBVJG-RHYQMDGZSA-N Val-Leu-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](C(C)C)N)O SYSWVVCYSXBVJG-RHYQMDGZSA-N 0.000 description 2
- RWOGENDAOGMHLX-DCAQKATOSA-N Val-Lys-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](C(C)C)N RWOGENDAOGMHLX-DCAQKATOSA-N 0.000 description 2
- WBAJDGWKRIHOAC-GVXVVHGQSA-N Val-Lys-Gln Chemical compound [H]N[C@@H](C(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O WBAJDGWKRIHOAC-GVXVVHGQSA-N 0.000 description 2
- YMTOEGGOCHVGEH-IHRRRGAJSA-N Val-Lys-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(O)=O YMTOEGGOCHVGEH-IHRRRGAJSA-N 0.000 description 2
- JVGHIFMSFBZDHH-WPRPVWTQSA-N Val-Met-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)NCC(=O)O)N JVGHIFMSFBZDHH-WPRPVWTQSA-N 0.000 description 2
- YDVDTCJGBBJGRT-GUBZILKMSA-N Val-Met-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CO)C(=O)O)N YDVDTCJGBBJGRT-GUBZILKMSA-N 0.000 description 2
- WMRWZYSRQUORHJ-YDHLFZDLSA-N Val-Phe-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(=O)O)C(=O)O)N WMRWZYSRQUORHJ-YDHLFZDLSA-N 0.000 description 2
- FMQGYTMERWBMSI-HJWJTTGWSA-N Val-Phe-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](C(C)C)N FMQGYTMERWBMSI-HJWJTTGWSA-N 0.000 description 2
- WANVRBAZGSICCP-SRVKXCTJSA-N Val-Pro-Met Chemical compound CSCC[C@H](NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)C(C)C)C(O)=O WANVRBAZGSICCP-SRVKXCTJSA-N 0.000 description 2
- QIVPZSWBBHRNBA-JYJNAYRXSA-N Val-Pro-Phe Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](Cc1ccccc1)C(O)=O QIVPZSWBBHRNBA-JYJNAYRXSA-N 0.000 description 2
- KSFXWENSJABBFI-ZKWXMUAHSA-N Val-Ser-Asn Chemical compound [H]N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O KSFXWENSJABBFI-ZKWXMUAHSA-N 0.000 description 2
- RYHUIHUOYRNNIE-NRPADANISA-N Val-Ser-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N RYHUIHUOYRNNIE-NRPADANISA-N 0.000 description 2
- VIKZGAUAKQZDOF-NRPADANISA-N Val-Ser-Glu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O VIKZGAUAKQZDOF-NRPADANISA-N 0.000 description 2
- UGFMVXRXULGLNO-XPUUQOCRSA-N Val-Ser-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O UGFMVXRXULGLNO-XPUUQOCRSA-N 0.000 description 2
- VHIZXDZMTDVFGX-DCAQKATOSA-N Val-Ser-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](C(C)C)N VHIZXDZMTDVFGX-DCAQKATOSA-N 0.000 description 2
- HWNYVQMOLCYHEA-IHRRRGAJSA-N Val-Ser-Tyr Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N HWNYVQMOLCYHEA-IHRRRGAJSA-N 0.000 description 2
- UQMPYVLTQCGRSK-IFFSRLJSSA-N Val-Thr-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N)O UQMPYVLTQCGRSK-IFFSRLJSSA-N 0.000 description 2
- UVHFONIHVHLDDQ-IFFSRLJSSA-N Val-Thr-Glu Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N)O UVHFONIHVHLDDQ-IFFSRLJSSA-N 0.000 description 2
- LCHZBEUVGAVMKS-RHYQMDGZSA-N Val-Thr-Leu Chemical compound CC(C)C[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)C(C)C)[C@@H](C)O)C(O)=O LCHZBEUVGAVMKS-RHYQMDGZSA-N 0.000 description 2
- GVNLOVJNNDZUHS-RHYQMDGZSA-N Val-Thr-Lys Chemical compound [H]N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(O)=O GVNLOVJNNDZUHS-RHYQMDGZSA-N 0.000 description 2
- USXYVSTVPHELAF-RCWTZXSCSA-N Val-Thr-Met Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](C(C)C)N)O USXYVSTVPHELAF-RCWTZXSCSA-N 0.000 description 2
- YLBNZCJFSVJDRJ-KJEVXHAQSA-N Val-Thr-Tyr Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](Cc1ccc(O)cc1)C(O)=O YLBNZCJFSVJDRJ-KJEVXHAQSA-N 0.000 description 2
- NGXQOQNXSGOYOI-BQFCYCMXSA-N Val-Trp-Gln Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](N)C(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O)=CNC2=C1 NGXQOQNXSGOYOI-BQFCYCMXSA-N 0.000 description 2
- JXCOEPXCBVCTRD-JYJNAYRXSA-N Val-Tyr-Arg Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N JXCOEPXCBVCTRD-JYJNAYRXSA-N 0.000 description 2
- PMKQKNBISAOSRI-XHSDSOJGSA-N Val-Tyr-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N2CCC[C@@H]2C(=O)O)N PMKQKNBISAOSRI-XHSDSOJGSA-N 0.000 description 2
- IECQJCJNPJVUSB-IHRRRGAJSA-N Val-Tyr-Ser Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](Cc1ccc(O)cc1)C(=O)N[C@@H](CO)C(O)=O IECQJCJNPJVUSB-IHRRRGAJSA-N 0.000 description 2
- OWFGFHQMSBTKLX-UFYCRDLUSA-N Val-Tyr-Tyr Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O)N OWFGFHQMSBTKLX-UFYCRDLUSA-N 0.000 description 2
- NLNCNKIVJPEFBC-DLOVCJGASA-N Val-Val-Glu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCC(O)=O NLNCNKIVJPEFBC-DLOVCJGASA-N 0.000 description 2
- WBPFYNYTYASCQP-CYDGBPFRSA-N Val-Val-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](C(C)C)N WBPFYNYTYASCQP-CYDGBPFRSA-N 0.000 description 2
- 101000711318 Vibrio alginolyticus Uncharacterized 11.6 kDa protein in scrR 3'region Proteins 0.000 description 2
- 239000002253 acid Substances 0.000 description 2
- 108010028939 alanyl-alanyl-lysyl-alanine Proteins 0.000 description 2
- 108010047506 alanyl-glutaminyl-glycyl-valine Proteins 0.000 description 2
- 108010076324 alanyl-glycyl-glycine Proteins 0.000 description 2
- 108010031014 alanyl-histidyl-leucyl-leucine Proteins 0.000 description 2
- 108010045023 alanyl-prolyl-tyrosine Proteins 0.000 description 2
- MBMBGCFOFBJSGT-KUBAVDMBSA-N all-cis-docosa-4,7,10,13,16,19-hexaenoic acid Chemical compound CC\C=C/C\C=C/C\C=C/C\C=C/C\C=C/C\C=C/CCC(O)=O MBMBGCFOFBJSGT-KUBAVDMBSA-N 0.000 description 2
- 238000004458 analytical method Methods 0.000 description 2
- 108010057412 arginyl-glycyl-aspartyl-phenylalanine Proteins 0.000 description 2
- 108010009111 arginyl-glycyl-glutamic acid Proteins 0.000 description 2
- 108010069926 arginyl-glycyl-serine Proteins 0.000 description 2
- 108010018691 arginyl-threonyl-arginine Proteins 0.000 description 2
- 108010084758 arginyl-tyrosyl-aspartic acid Proteins 0.000 description 2
- 108010068380 arginylarginine Proteins 0.000 description 2
- 108010010430 asparagine-proline-alanine Proteins 0.000 description 2
- 108010040443 aspartyl-aspartic acid Proteins 0.000 description 2
- 230000008859 change Effects 0.000 description 2
- 239000013599 cloning vector Substances 0.000 description 2
- 238000004040 coloring Methods 0.000 description 2
- 108010016616 cysteinylglycine Proteins 0.000 description 2
- 108010060199 cysteinylproline Proteins 0.000 description 2
- 108010009297 diglycyl-histidine Proteins 0.000 description 2
- 108010054812 diprotin A Proteins 0.000 description 2
- 108010054813 diprotin B Proteins 0.000 description 2
- 239000012153 distilled water Substances 0.000 description 2
- 230000032050 esterification Effects 0.000 description 2
- 238000005886 esterification reaction Methods 0.000 description 2
- 108010063718 gamma-glutamylaspartic acid Proteins 0.000 description 2
- 108010042598 glutamyl-aspartyl-glycine Proteins 0.000 description 2
- 108010008237 glutamyl-valyl-glycine Proteins 0.000 description 2
- 108010073628 glutamyl-valyl-phenylalanine Proteins 0.000 description 2
- JYPCXBJRLBHWME-UHFFFAOYSA-N glycyl-L-prolyl-L-arginine Natural products NCC(=O)N1CCCC1C(=O)NC(CCCN=C(N)N)C(O)=O JYPCXBJRLBHWME-UHFFFAOYSA-N 0.000 description 2
- 108010075431 glycyl-alanyl-phenylalanine Proteins 0.000 description 2
- 108010084264 glycyl-glycyl-cysteine Proteins 0.000 description 2
- 108010067216 glycyl-glycyl-glycine Proteins 0.000 description 2
- 108010078326 glycyl-glycyl-valine Proteins 0.000 description 2
- 108010038983 glycyl-histidyl-lysine Proteins 0.000 description 2
- 108010050475 glycyl-leucyl-tyrosine Proteins 0.000 description 2
- 108010077435 glycyl-phenylalanyl-glycine Proteins 0.000 description 2
- 108010025801 glycyl-prolyl-arginine Proteins 0.000 description 2
- 108010079413 glycyl-prolyl-glutamic acid Proteins 0.000 description 2
- 108010074027 glycyl-seryl-phenylalanine Proteins 0.000 description 2
- 108010077515 glycylproline Proteins 0.000 description 2
- 238000005286 illumination Methods 0.000 description 2
- 108010031424 isoleucyl-prolyl-proline Proteins 0.000 description 2
- 108010053037 kyotorphin Proteins 0.000 description 2
- 108010077158 leucinyl-arginyl-tryptophan Proteins 0.000 description 2
- 108010051673 leucyl-glycyl-phenylalanine Proteins 0.000 description 2
- 108010044056 leucyl-phenylalanine Proteins 0.000 description 2
- 108010030617 leucyl-phenylalanyl-valine Proteins 0.000 description 2
- 108010091871 leucylmethionine Proteins 0.000 description 2
- 108010012058 leucyltyrosine Proteins 0.000 description 2
- 150000002632 lipids Chemical class 0.000 description 2
- 108010025153 lysyl-alanyl-alanine Proteins 0.000 description 2
- 108010089256 lysyl-aspartyl-glutamyl-leucine Proteins 0.000 description 2
- 108010057952 lysyl-phenylalanyl-lysine Proteins 0.000 description 2
- 108010043322 lysyl-tryptophyl-alpha-lysine Proteins 0.000 description 2
- 108010009298 lysylglutamic acid Proteins 0.000 description 2
- 108010005942 methionylglycine Proteins 0.000 description 2
- 244000005700 microbiome Species 0.000 description 2
- 108010082795 phenylalanyl-arginyl-arginine Proteins 0.000 description 2
- 108010024654 phenylalanyl-prolyl-alanine Proteins 0.000 description 2
- 108010025826 prolyl-leucyl-arginine Proteins 0.000 description 2
- 108010077112 prolyl-proline Proteins 0.000 description 2
- 108010029020 prolylglycine Proteins 0.000 description 2
- 108010015796 prolylisoleucine Proteins 0.000 description 2
- 239000002994 raw material Substances 0.000 description 2
- 108010015840 seryl-prolyl-lysyl-lysine Proteins 0.000 description 2
- 239000000741 silica gel Substances 0.000 description 2
- 229910002027 silica gel Inorganic materials 0.000 description 2
- SQGYOTSLMSWVJD-UHFFFAOYSA-N silver(1+) nitrate Chemical compound [Ag+].[O-]N(=O)=O SQGYOTSLMSWVJD-UHFFFAOYSA-N 0.000 description 2
- 108010005652 splenotritin Proteins 0.000 description 2
- 229960005322 streptomycin Drugs 0.000 description 2
- 238000003786 synthesis reaction Methods 0.000 description 2
- 108010033670 threonyl-aspartyl-tyrosine Proteins 0.000 description 2
- 229910021654 trace metal Inorganic materials 0.000 description 2
- 230000009261 transgenic effect Effects 0.000 description 2
- 108010084932 tryptophyl-proline Proteins 0.000 description 2
- 108010038745 tryptophylglycine Proteins 0.000 description 2
- 108010005834 tyrosyl-alanyl-glycine Proteins 0.000 description 2
- 108010020532 tyrosyl-proline Proteins 0.000 description 2
- 238000005406 washing Methods 0.000 description 2
- 108010027345 wheylin-1 peptide Proteins 0.000 description 2
- PAHHYDSPOXDASW-VGWMRTNUSA-N (2s)-6-amino-2-[[(2s)-6-amino-2-[[(2s)-1-[(2s)-2-amino-3-hydroxypropanoyl]pyrrolidine-2-carbonyl]amino]hexanoyl]amino]hexanoic acid Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CO PAHHYDSPOXDASW-VGWMRTNUSA-N 0.000 description 1
- VFNKZQNIXUFLBC-UHFFFAOYSA-N 2',7'-dichlorofluorescein Chemical compound O1C(=O)C2=CC=CC=C2C21C1=CC(Cl)=C(O)C=C1OC1=C2C=C(Cl)C(O)=C1 VFNKZQNIXUFLBC-UHFFFAOYSA-N 0.000 description 1
- YLZOPXRUQYQQID-UHFFFAOYSA-N 3-(2,4,6,7-tetrahydrotriazolo[4,5-c]pyridin-5-yl)-1-[4-[2-[[3-(trifluoromethoxy)phenyl]methylamino]pyrimidin-5-yl]piperazin-1-yl]propan-1-one Chemical compound N1N=NC=2CN(CCC=21)CCC(=O)N1CCN(CC1)C=1C=NC(=NC=1)NCC1=CC(=CC=C1)OC(F)(F)F YLZOPXRUQYQQID-UHFFFAOYSA-N 0.000 description 1
- DKJPOZOEBONHFS-ZLUOBGJFSA-N Ala-Ala-Asp Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC(O)=O DKJPOZOEBONHFS-ZLUOBGJFSA-N 0.000 description 1
- JBGSZRYCXBPWGX-BQBZGAKWSA-N Ala-Arg-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)[C@@H](N)C)CCCN=C(N)N JBGSZRYCXBPWGX-BQBZGAKWSA-N 0.000 description 1
- HGRBNYQIMKTUNT-XVYDVKMFSA-N Ala-Asn-His Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N HGRBNYQIMKTUNT-XVYDVKMFSA-N 0.000 description 1
- LZRNYBIJOSKKRJ-XVYDVKMFSA-N Ala-Asp-His Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N LZRNYBIJOSKKRJ-XVYDVKMFSA-N 0.000 description 1
- MIPWEZAIMPYQST-FXQIFTODSA-N Ala-Cys-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CS)C(=O)N[C@@H](C(C)C)C(O)=O MIPWEZAIMPYQST-FXQIFTODSA-N 0.000 description 1
- FUSPCLTUKXQREV-ACZMJKKPSA-N Ala-Glu-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O FUSPCLTUKXQREV-ACZMJKKPSA-N 0.000 description 1
- KXEVYGKATAMXJJ-ACZMJKKPSA-N Ala-Glu-Asp Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O KXEVYGKATAMXJJ-ACZMJKKPSA-N 0.000 description 1
- WKOBSJOZRJJVRZ-FXQIFTODSA-N Ala-Glu-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O WKOBSJOZRJJVRZ-FXQIFTODSA-N 0.000 description 1
- VBRDBGCROKWTPV-XHNCKOQMSA-N Ala-Glu-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N VBRDBGCROKWTPV-XHNCKOQMSA-N 0.000 description 1
- VWEWCZSUWOEEFM-WDSKDSINSA-N Ala-Gly-Ala-Gly Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(=O)NCC(O)=O VWEWCZSUWOEEFM-WDSKDSINSA-N 0.000 description 1
- FAJIYNONGXEXAI-CQDKDKBSSA-N Ala-His-Phe Chemical compound C([C@H](NC(=O)[C@@H](N)C)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CNC=N1 FAJIYNONGXEXAI-CQDKDKBSSA-N 0.000 description 1
- DVJSJDDYCYSMFR-ZKWXMUAHSA-N Ala-Ile-Gly Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O DVJSJDDYCYSMFR-ZKWXMUAHSA-N 0.000 description 1
- MDNAVFBZPROEHO-DCAQKATOSA-N Ala-Lys-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O MDNAVFBZPROEHO-DCAQKATOSA-N 0.000 description 1
- XUCHENWTTBFODJ-FXQIFTODSA-N Ala-Met-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C)C(O)=O XUCHENWTTBFODJ-FXQIFTODSA-N 0.000 description 1
- GKAZXNDATBWNBI-DCAQKATOSA-N Ala-Met-Lys Chemical compound C[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCCCN)C(=O)O)N GKAZXNDATBWNBI-DCAQKATOSA-N 0.000 description 1
- XRUJOVRWNMBAAA-NHCYSSNCSA-N Ala-Phe-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=CC=C1 XRUJOVRWNMBAAA-NHCYSSNCSA-N 0.000 description 1
- MMLHRUJLOUSRJX-CIUDSAMLSA-N Ala-Ser-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCCN MMLHRUJLOUSRJX-CIUDSAMLSA-N 0.000 description 1
- NZGRHTKZFSVPAN-BIIVOSGPSA-N Ala-Ser-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CO)C(=O)N1CCC[C@@H]1C(=O)O)N NZGRHTKZFSVPAN-BIIVOSGPSA-N 0.000 description 1
- LSMDIAAALJJLRO-XQXXSGGOSA-N Ala-Thr-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O LSMDIAAALJJLRO-XQXXSGGOSA-N 0.000 description 1
- GCTANJIJJROSLH-GVARAGBVSA-N Ala-Tyr-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](C)N GCTANJIJJROSLH-GVARAGBVSA-N 0.000 description 1
- OTCJMMRQBVDQRK-DCAQKATOSA-N Arg-Asp-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O OTCJMMRQBVDQRK-DCAQKATOSA-N 0.000 description 1
- KBBKCNHWCDJPGN-GUBZILKMSA-N Arg-Gln-Gln Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O KBBKCNHWCDJPGN-GUBZILKMSA-N 0.000 description 1
- AUFHLLPVPSMEOG-YUMQZZPRSA-N Arg-Gly-Glu Chemical compound NC(N)=NCCC[C@H](N)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O AUFHLLPVPSMEOG-YUMQZZPRSA-N 0.000 description 1
- NKNILFJYKKHBKE-WPRPVWTQSA-N Arg-Gly-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O NKNILFJYKKHBKE-WPRPVWTQSA-N 0.000 description 1
- FKQITMVNILRUCQ-IHRRRGAJSA-N Arg-Phe-Asp Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O FKQITMVNILRUCQ-IHRRRGAJSA-N 0.000 description 1
- KMFPQTITXUKJOV-DCAQKATOSA-N Arg-Ser-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O KMFPQTITXUKJOV-DCAQKATOSA-N 0.000 description 1
- JBQORRNSZGTLCV-WDSOQIARSA-N Arg-Trp-Lys Chemical compound C1=CC=C2C(C[C@@H](C(=O)N[C@@H](CCCCN)C(O)=O)NC(=O)[C@@H](N)CCCN=C(N)N)=CNC2=C1 JBQORRNSZGTLCV-WDSOQIARSA-N 0.000 description 1
- HUZGPXBILPMCHM-IHRRRGAJSA-N Asn-Arg-Phe Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O HUZGPXBILPMCHM-IHRRRGAJSA-N 0.000 description 1
- UGXVKHRDGLYFKR-CIUDSAMLSA-N Asn-Asp-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC(N)=O UGXVKHRDGLYFKR-CIUDSAMLSA-N 0.000 description 1
- XQQVCUIBGYFKDC-OLHMAJIHSA-N Asn-Asp-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XQQVCUIBGYFKDC-OLHMAJIHSA-N 0.000 description 1
- ULRPXVNMIIYDDJ-ACZMJKKPSA-N Asn-Glu-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC(=O)N)N ULRPXVNMIIYDDJ-ACZMJKKPSA-N 0.000 description 1
- PPMTUXJSQDNUDE-CIUDSAMLSA-N Asn-Glu-Arg Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N PPMTUXJSQDNUDE-CIUDSAMLSA-N 0.000 description 1
- PLVAAIPKSGUXDV-WHFBIAKZSA-N Asn-Gly-Cys Chemical compound C([C@@H](C(=O)NCC(=O)N[C@@H](CS)C(=O)O)N)C(=O)N PLVAAIPKSGUXDV-WHFBIAKZSA-N 0.000 description 1
- YGHCVNQOZZMHRZ-DJFWLOJKSA-N Asn-His-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CC(=O)N)N YGHCVNQOZZMHRZ-DJFWLOJKSA-N 0.000 description 1
- BXUHCIXDSWRSBS-CIUDSAMLSA-N Asn-Leu-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O BXUHCIXDSWRSBS-CIUDSAMLSA-N 0.000 description 1
- NYLBGYLHBDFRHL-VEVYYDQMSA-N Asp-Arg-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O NYLBGYLHBDFRHL-VEVYYDQMSA-N 0.000 description 1
- ATYWBXGNXZYZGI-ACZMJKKPSA-N Asp-Asn-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O ATYWBXGNXZYZGI-ACZMJKKPSA-N 0.000 description 1
- UGKZHCBLMLSANF-CIUDSAMLSA-N Asp-Asn-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O UGKZHCBLMLSANF-CIUDSAMLSA-N 0.000 description 1
- ZSJFGGSPCCHMNE-LAEOZQHASA-N Asp-Gln-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC(=O)O)N ZSJFGGSPCCHMNE-LAEOZQHASA-N 0.000 description 1
- JUWZKMBALYLZCK-WHFBIAKZSA-N Asp-Gly-Asn Chemical compound OC(=O)C[C@H](N)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O JUWZKMBALYLZCK-WHFBIAKZSA-N 0.000 description 1
- SNDBKTFJWVEVPO-WHFBIAKZSA-N Asp-Gly-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](CO)C(O)=O SNDBKTFJWVEVPO-WHFBIAKZSA-N 0.000 description 1
- SVABRQFIHCSNCI-FOHZUACHSA-N Asp-Gly-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O SVABRQFIHCSNCI-FOHZUACHSA-N 0.000 description 1
- RTXQQDVBACBSCW-CFMVVWHZSA-N Asp-Ile-Tyr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O RTXQQDVBACBSCW-CFMVVWHZSA-N 0.000 description 1
- AYFVRYXNDHBECD-YUMQZZPRSA-N Asp-Leu-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O AYFVRYXNDHBECD-YUMQZZPRSA-N 0.000 description 1
- ORRJQLIATJDMQM-HJGDQZAQSA-N Asp-Leu-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC(O)=O ORRJQLIATJDMQM-HJGDQZAQSA-N 0.000 description 1
- GWIJZUVQVDJHDI-AVGNSLFASA-N Asp-Phe-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O GWIJZUVQVDJHDI-AVGNSLFASA-N 0.000 description 1
- QJHOOKBAHRJPPX-QWRGUYRKSA-N Asp-Phe-Gly Chemical compound OC(=O)C[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CC=CC=C1 QJHOOKBAHRJPPX-QWRGUYRKSA-N 0.000 description 1
- NAAAPCLFJPURAM-HJGDQZAQSA-N Asp-Thr-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)O)N)O NAAAPCLFJPURAM-HJGDQZAQSA-N 0.000 description 1
- KACWACLNYLSVCA-VHWLVUOQSA-N Asp-Trp-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KACWACLNYLSVCA-VHWLVUOQSA-N 0.000 description 1
- 101100316860 Autographa californica nuclear polyhedrosis virus DA18 gene Proteins 0.000 description 1
- 241000555825 Clupeidae Species 0.000 description 1
- IXPSSIBVVKSOIE-SRVKXCTJSA-N Cys-Ser-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CS)N)O IXPSSIBVVKSOIE-SRVKXCTJSA-N 0.000 description 1
- ZKAUCGZIIXXWJQ-BZSNNMDCSA-N Cys-Tyr-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O)NC(=O)[C@H](CS)N)O ZKAUCGZIIXXWJQ-BZSNNMDCSA-N 0.000 description 1
- KCXVZYZYPLLWCC-UHFFFAOYSA-N EDTA Chemical compound OC(=O)CN(CC(O)=O)CCN(CC(O)=O)CC(O)=O KCXVZYZYPLLWCC-UHFFFAOYSA-N 0.000 description 1
- 241001646716 Escherichia coli K-12 Species 0.000 description 1
- 241000078280 Escherichia coli S17 Species 0.000 description 1
- DTCCMDYODDPHBG-ACZMJKKPSA-N Gln-Ala-Cys Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CS)C(O)=O DTCCMDYODDPHBG-ACZMJKKPSA-N 0.000 description 1
- WQWMZOIPXWSZNE-WDSKDSINSA-N Gln-Asp-Gly Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O WQWMZOIPXWSZNE-WDSKDSINSA-N 0.000 description 1
- VNCLJDOTEPPBBD-GUBZILKMSA-N Gln-Cys-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CCC(=O)N)N VNCLJDOTEPPBBD-GUBZILKMSA-N 0.000 description 1
- JXBZEDIQFFCHPZ-PEFMBERDSA-N Gln-Ile-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CCC(=O)N)N JXBZEDIQFFCHPZ-PEFMBERDSA-N 0.000 description 1
- PSERKXGRRADTKA-MNXVOIDGSA-N Gln-Leu-Ile Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O PSERKXGRRADTKA-MNXVOIDGSA-N 0.000 description 1
- QFXNFFZTMFHPST-DZKIICNBSA-N Gln-Phe-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](CCC(=O)N)N QFXNFFZTMFHPST-DZKIICNBSA-N 0.000 description 1
- VNTGPISAOMAXRK-CIUDSAMLSA-N Gln-Pro-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O VNTGPISAOMAXRK-CIUDSAMLSA-N 0.000 description 1
- UEILCTONAMOGBR-RWRJDSDZSA-N Gln-Thr-Ile Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O UEILCTONAMOGBR-RWRJDSDZSA-N 0.000 description 1
- JKDBRTNMYXYLHO-JYJNAYRXSA-N Gln-Tyr-Leu Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C(O)=O)CC1=CC=C(O)C=C1 JKDBRTNMYXYLHO-JYJNAYRXSA-N 0.000 description 1
- QGWXAMDECCKGRU-XVKPBYJWSA-N Gln-Val-Gly Chemical compound CC(C)[C@H](NC(=O)[C@@H](N)CCC(N)=O)C(=O)NCC(O)=O QGWXAMDECCKGRU-XVKPBYJWSA-N 0.000 description 1
- DIXKFOPPGWKZLY-CIUDSAMLSA-N Glu-Arg-Asp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O DIXKFOPPGWKZLY-CIUDSAMLSA-N 0.000 description 1
- RJONUNZIMUXUOI-GUBZILKMSA-N Glu-Asn-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)O)N RJONUNZIMUXUOI-GUBZILKMSA-N 0.000 description 1
- RDDSZZJOKDVPAE-ACZMJKKPSA-N Glu-Asn-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O RDDSZZJOKDVPAE-ACZMJKKPSA-N 0.000 description 1
- IESFZVCAVACGPH-PEFMBERDSA-N Glu-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CCC(O)=O IESFZVCAVACGPH-PEFMBERDSA-N 0.000 description 1
- WPLGNDORMXTMQS-FXQIFTODSA-N Glu-Gln-Ser Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O WPLGNDORMXTMQS-FXQIFTODSA-N 0.000 description 1
- ILGFBUGLBSAQQB-GUBZILKMSA-N Glu-Glu-Arg Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O ILGFBUGLBSAQQB-GUBZILKMSA-N 0.000 description 1
- MTAOBYXRYJZRGQ-WDSKDSINSA-N Glu-Gly-Asp Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O MTAOBYXRYJZRGQ-WDSKDSINSA-N 0.000 description 1
- KRGZZKWSBGPLKL-IUCAKERBSA-N Glu-Gly-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CCC(=O)O)N KRGZZKWSBGPLKL-IUCAKERBSA-N 0.000 description 1
- XMPAXPSENRSOSV-RYUDHWBXSA-N Glu-Gly-Tyr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O XMPAXPSENRSOSV-RYUDHWBXSA-N 0.000 description 1
- MWMJCGBSIORNCD-AVGNSLFASA-N Glu-Leu-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O MWMJCGBSIORNCD-AVGNSLFASA-N 0.000 description 1
- YGLCLCMAYUYZSG-AVGNSLFASA-N Glu-Lys-His Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 YGLCLCMAYUYZSG-AVGNSLFASA-N 0.000 description 1
- ZWMYUDZLXAQHCK-CIUDSAMLSA-N Glu-Met-Asp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(O)=O)C(O)=O ZWMYUDZLXAQHCK-CIUDSAMLSA-N 0.000 description 1
- RXESHTOTINOODU-JYJNAYRXSA-N Glu-Phe-His Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)NC(=O)[C@H](CCC(=O)O)N RXESHTOTINOODU-JYJNAYRXSA-N 0.000 description 1
- QNJNPKSWAHPYGI-JYJNAYRXSA-N Glu-Phe-Leu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C(O)=O)CC1=CC=CC=C1 QNJNPKSWAHPYGI-JYJNAYRXSA-N 0.000 description 1
- BDISFWMLMNBTGP-NUMRIWBASA-N Glu-Thr-Asp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(O)=O BDISFWMLMNBTGP-NUMRIWBASA-N 0.000 description 1
- MWTGQXBHVRTCOR-GLLZPBPUSA-N Glu-Thr-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(O)=O MWTGQXBHVRTCOR-GLLZPBPUSA-N 0.000 description 1
- QXPRJQPCFXMCIY-NKWVEPMBSA-N Gly-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN QXPRJQPCFXMCIY-NKWVEPMBSA-N 0.000 description 1
- FMNHBTKMRFVGRO-FOHZUACHSA-N Gly-Asn-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)CN FMNHBTKMRFVGRO-FOHZUACHSA-N 0.000 description 1
- XEJTYSCIXKYSHR-WDSKDSINSA-N Gly-Asp-Gln Chemical compound C(CC(=O)N)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)CN XEJTYSCIXKYSHR-WDSKDSINSA-N 0.000 description 1
- XBWMTPAIUQIWKA-BYULHYEWSA-N Gly-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)CN XBWMTPAIUQIWKA-BYULHYEWSA-N 0.000 description 1
- FZQLXNIMCPJVJE-YUMQZZPRSA-N Gly-Asp-Leu Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O FZQLXNIMCPJVJE-YUMQZZPRSA-N 0.000 description 1
- BPQYBFAXRGMGGY-LAEOZQHASA-N Gly-Gln-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)CN BPQYBFAXRGMGGY-LAEOZQHASA-N 0.000 description 1
- AQLHORCVPGXDJW-IUCAKERBSA-N Gly-Gln-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)CN AQLHORCVPGXDJW-IUCAKERBSA-N 0.000 description 1
- DHDOADIPGZTAHT-YUMQZZPRSA-N Gly-Glu-Arg Chemical compound NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N DHDOADIPGZTAHT-YUMQZZPRSA-N 0.000 description 1
- ULZCYBYDTUMHNF-IUCAKERBSA-N Gly-Leu-Glu Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O ULZCYBYDTUMHNF-IUCAKERBSA-N 0.000 description 1
- FHQRLHFYVZAQHU-IUCAKERBSA-N Gly-Lys-Gln Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O FHQRLHFYVZAQHU-IUCAKERBSA-N 0.000 description 1
- LPHQAFLNEHWKFF-QXEWZRGKSA-N Gly-Met-Ile Chemical compound [H]NCC(=O)N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O LPHQAFLNEHWKFF-QXEWZRGKSA-N 0.000 description 1
- IBYOLNARKHMLBG-WHOFXGATSA-N Gly-Phe-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CC=CC=C1 IBYOLNARKHMLBG-WHOFXGATSA-N 0.000 description 1
- GGAPHLIUUTVYMX-QWRGUYRKSA-N Gly-Phe-Ser Chemical compound OC[C@@H](C([O-])=O)NC(=O)[C@@H](NC(=O)C[NH3+])CC1=CC=CC=C1 GGAPHLIUUTVYMX-QWRGUYRKSA-N 0.000 description 1
- VNNRLUNBJSWZPF-ZKWXMUAHSA-N Gly-Ser-Ile Chemical compound [H]NCC(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O VNNRLUNBJSWZPF-ZKWXMUAHSA-N 0.000 description 1
- YABRDIBSPZONIY-BQBZGAKWSA-N Gly-Ser-Met Chemical compound [H]NCC(=O)N[C@@H](CO)C(=O)N[C@@H](CCSC)C(O)=O YABRDIBSPZONIY-BQBZGAKWSA-N 0.000 description 1
- ZZWUYQXMIFTIIY-WEDXCCLWSA-N Gly-Thr-Leu Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O ZZWUYQXMIFTIIY-WEDXCCLWSA-N 0.000 description 1
- IZVICCORZOSGPT-JSGCOSHPSA-N Gly-Val-Tyr Chemical compound [H]NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O IZVICCORZOSGPT-JSGCOSHPSA-N 0.000 description 1
- ZNPRMNDAFQKATM-LKTVYLICSA-N His-Ala-Tyr Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O ZNPRMNDAFQKATM-LKTVYLICSA-N 0.000 description 1
- VYMGAXSNYUFVCK-GUBZILKMSA-N His-Gln-Asn Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC(=O)N)C(=O)O)N VYMGAXSNYUFVCK-GUBZILKMSA-N 0.000 description 1
- NELVFWFDOKRTOR-SDDRHHMPSA-N His-Gln-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC2=CN=CN2)N)C(=O)O NELVFWFDOKRTOR-SDDRHHMPSA-N 0.000 description 1
- IIVZNQCUUMBBKF-GVXVVHGQSA-N His-Gln-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CC1=CN=CN1 IIVZNQCUUMBBKF-GVXVVHGQSA-N 0.000 description 1
- RGPWUJOMKFYFSR-QWRGUYRKSA-N His-Gly-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O RGPWUJOMKFYFSR-QWRGUYRKSA-N 0.000 description 1
- YXASFUBDSDAXQD-UWVGGRQHSA-N His-Met-Gly Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCSC)C(=O)NCC(O)=O YXASFUBDSDAXQD-UWVGGRQHSA-N 0.000 description 1
- ZFDKSLBEWYCOCS-BZSNNMDCSA-N His-Phe-Lys Chemical compound C([C@@H](C(=O)N[C@@H](CCCCN)C(O)=O)NC(=O)[C@@H](N)CC=1NC=NC=1)C1=CC=CC=C1 ZFDKSLBEWYCOCS-BZSNNMDCSA-N 0.000 description 1
- YSMZBYPVVYSGOT-SZMVWBNQSA-N His-Trp-Glu Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CC3=CN=CN3)N YSMZBYPVVYSGOT-SZMVWBNQSA-N 0.000 description 1
- FOCSWPCHUDVNLP-PMVMPFDFSA-N His-Trp-Tyr Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC3=CC=C(C=C3)O)C(=O)O)NC(=O)[C@H](CC4=CN=CN4)N FOCSWPCHUDVNLP-PMVMPFDFSA-N 0.000 description 1
- VEXZGXHMUGYJMC-UHFFFAOYSA-N Hydrochloric acid Chemical compound Cl VEXZGXHMUGYJMC-UHFFFAOYSA-N 0.000 description 1
- LQSBBHNVAVNZSX-GHCJXIJMSA-N Ile-Ala-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CC(=O)N)C(=O)O)N LQSBBHNVAVNZSX-GHCJXIJMSA-N 0.000 description 1
- RWIKBYVJQAJYDP-BJDJZHNGSA-N Ile-Ala-Lys Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCCN RWIKBYVJQAJYDP-BJDJZHNGSA-N 0.000 description 1
- IPYVXYDYLHVWHU-GMOBBJLQSA-N Ile-Asn-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCSC)C(=O)O)N IPYVXYDYLHVWHU-GMOBBJLQSA-N 0.000 description 1
- UMYZBHKAVTXWIW-GMOBBJLQSA-N Ile-Asp-Arg Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N UMYZBHKAVTXWIW-GMOBBJLQSA-N 0.000 description 1
- LPXHYGGZJOCAFR-MNXVOIDGSA-N Ile-Glu-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC(C)C)C(=O)O)N LPXHYGGZJOCAFR-MNXVOIDGSA-N 0.000 description 1
- DSDPLOODKXISDT-XUXIUFHCSA-N Ile-Leu-Val Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O DSDPLOODKXISDT-XUXIUFHCSA-N 0.000 description 1
- RCMNUBZKIIJCOI-ZPFDUUQYSA-N Ile-Met-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N RCMNUBZKIIJCOI-ZPFDUUQYSA-N 0.000 description 1
- HQEPKOFULQTSFV-JURCDPSOSA-N Ile-Phe-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(=O)O)N HQEPKOFULQTSFV-JURCDPSOSA-N 0.000 description 1
- VGSPNSSCMOHRRR-BJDJZHNGSA-N Ile-Ser-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)O)N VGSPNSSCMOHRRR-BJDJZHNGSA-N 0.000 description 1
- CNMOKANDJMLAIF-CIQUZCHMSA-N Ile-Thr-Ala Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O CNMOKANDJMLAIF-CIQUZCHMSA-N 0.000 description 1
- WCNWGAUZWWSYDG-SVSWQMSJSA-N Ile-Thr-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)O)N WCNWGAUZWWSYDG-SVSWQMSJSA-N 0.000 description 1
- JTBFQNHKNRZJDS-SYWGBEHUSA-N Ile-Trp-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](C)C(=O)O)N JTBFQNHKNRZJDS-SYWGBEHUSA-N 0.000 description 1
- HQLSBZFLOUHQJK-STECZYCISA-N Ile-Tyr-Arg Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N HQLSBZFLOUHQJK-STECZYCISA-N 0.000 description 1
- GNXGAVNTVNOCLL-SIUGBPQLSA-N Ile-Tyr-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N GNXGAVNTVNOCLL-SIUGBPQLSA-N 0.000 description 1
- GVEODXUBBFDBPW-MGHWNKPDSA-N Ile-Tyr-Leu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C(O)=O)CC1=CC=C(O)C=C1 GVEODXUBBFDBPW-MGHWNKPDSA-N 0.000 description 1
- WIYDLTIBHZSPKY-HJWJTTGWSA-N Ile-Val-Phe Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 WIYDLTIBHZSPKY-HJWJTTGWSA-N 0.000 description 1
- PMGDADKJMCOXHX-UHFFFAOYSA-N L-Arginyl-L-glutamin-acetat Natural products NC(=N)NCCCC(N)C(=O)NC(CCC(N)=O)C(O)=O PMGDADKJMCOXHX-UHFFFAOYSA-N 0.000 description 1
- LZDNBBYBDGBADK-UHFFFAOYSA-N L-valyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)C(C)C)C(O)=O)=CNC2=C1 LZDNBBYBDGBADK-UHFFFAOYSA-N 0.000 description 1
- QPRQGENIBFLVEB-BJDJZHNGSA-N Leu-Ala-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O QPRQGENIBFLVEB-BJDJZHNGSA-N 0.000 description 1
- DQPQTXMIRBUWKO-DCAQKATOSA-N Leu-Ala-Met Chemical compound C[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CC(C)C)N DQPQTXMIRBUWKO-DCAQKATOSA-N 0.000 description 1
- OIARJGNVARWKFP-YUMQZZPRSA-N Leu-Asn-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O OIARJGNVARWKFP-YUMQZZPRSA-N 0.000 description 1
- JKGHDYGZRDWHGA-SRVKXCTJSA-N Leu-Asn-Leu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O JKGHDYGZRDWHGA-SRVKXCTJSA-N 0.000 description 1
- YORLGJINWYYIMX-KKUMJFAQSA-N Leu-Cys-Phe Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O YORLGJINWYYIMX-KKUMJFAQSA-N 0.000 description 1
- FMEICTQWUKNAGC-YUMQZZPRSA-N Leu-Gly-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O FMEICTQWUKNAGC-YUMQZZPRSA-N 0.000 description 1
- CFZZDVMBRYFFNU-QWRGUYRKSA-N Leu-His-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)NCC(O)=O CFZZDVMBRYFFNU-QWRGUYRKSA-N 0.000 description 1
- IAJFFZORSWOZPQ-SRVKXCTJSA-N Leu-Leu-Asn Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O IAJFFZORSWOZPQ-SRVKXCTJSA-N 0.000 description 1
- VCHVSKNMTXWIIP-SRVKXCTJSA-N Leu-Lys-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O VCHVSKNMTXWIIP-SRVKXCTJSA-N 0.000 description 1
- INCJJHQRZGQLFC-KBPBESRZSA-N Leu-Phe-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)NCC(O)=O INCJJHQRZGQLFC-KBPBESRZSA-N 0.000 description 1
- XWEVVRRSIOBJOO-SRVKXCTJSA-N Leu-Pro-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O XWEVVRRSIOBJOO-SRVKXCTJSA-N 0.000 description 1
- NTXYXFDMIHXTHE-WDSOQIARSA-N Leu-Val-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC(C)C)C(O)=O)=CNC2=C1 NTXYXFDMIHXTHE-WDSOQIARSA-N 0.000 description 1
- WQWZXKWOEVSGQM-DCAQKATOSA-N Lys-Ala-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN WQWZXKWOEVSGQM-DCAQKATOSA-N 0.000 description 1
- KNKHAVVBVXKOGX-JXUBOQSCSA-N Lys-Ala-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KNKHAVVBVXKOGX-JXUBOQSCSA-N 0.000 description 1
- IRNSXVOWSXSULE-DCAQKATOSA-N Lys-Ala-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN IRNSXVOWSXSULE-DCAQKATOSA-N 0.000 description 1
- JGAMUXDWYSXYLM-SRVKXCTJSA-N Lys-Arg-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O JGAMUXDWYSXYLM-SRVKXCTJSA-N 0.000 description 1
- NLOZZWJNIKKYSC-WDSOQIARSA-N Lys-Arg-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H](N)CCCCN)C(O)=O)=CNC2=C1 NLOZZWJNIKKYSC-WDSOQIARSA-N 0.000 description 1
- DNEJSAIMVANNPA-DCAQKATOSA-N Lys-Asn-Arg Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O DNEJSAIMVANNPA-DCAQKATOSA-N 0.000 description 1
- OWRUUFUVXFREBD-KKUMJFAQSA-N Lys-His-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(O)=O OWRUUFUVXFREBD-KKUMJFAQSA-N 0.000 description 1
- PGLGNCVOWIORQE-SRVKXCTJSA-N Lys-His-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(O)=O PGLGNCVOWIORQE-SRVKXCTJSA-N 0.000 description 1
- QBEPTBMRQALPEV-MNXVOIDGSA-N Lys-Ile-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCCCN QBEPTBMRQALPEV-MNXVOIDGSA-N 0.000 description 1
- ZXFRGTAIIZHNHG-AJNGGQMLSA-N Lys-Ile-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)O)NC(=O)[C@H](CCCCN)N ZXFRGTAIIZHNHG-AJNGGQMLSA-N 0.000 description 1
- QOJDBRUCOXQSSK-AJNGGQMLSA-N Lys-Ile-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCCN)C(O)=O QOJDBRUCOXQSSK-AJNGGQMLSA-N 0.000 description 1
- RBEATVHTWHTHTJ-KKUMJFAQSA-N Lys-Leu-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(O)=O RBEATVHTWHTHTJ-KKUMJFAQSA-N 0.000 description 1
- SKUOQDYMJFUMOE-ULQDDVLXSA-N Lys-Met-Phe Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CCCCN)N SKUOQDYMJFUMOE-ULQDDVLXSA-N 0.000 description 1
- MTBBHUKKPWKXBT-ULQDDVLXSA-N Lys-Met-Tyr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O MTBBHUKKPWKXBT-ULQDDVLXSA-N 0.000 description 1
- SBQDRNOLGSYHQA-YUMQZZPRSA-N Lys-Ser-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)NCC(O)=O SBQDRNOLGSYHQA-YUMQZZPRSA-N 0.000 description 1
- QVTDVTONTRSQMF-WDCWCFNPSA-N Lys-Thr-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H]([C@H](O)C)NC(=O)[C@@H](N)CCCCN QVTDVTONTRSQMF-WDCWCFNPSA-N 0.000 description 1
- YDIKCZBMBPOGFT-PWUSVEHZSA-N Malvidin 3-galactoside Chemical compound [Cl-].COC1=C(O)C(OC)=CC(C=2C(=CC=3C(O)=CC(O)=CC=3[O+]=2)O[C@H]2[C@@H]([C@@H](O)[C@@H](O)[C@@H](CO)O2)O)=C1 YDIKCZBMBPOGFT-PWUSVEHZSA-N 0.000 description 1
- HUKLXYYPZWPXCC-KZVJFYERSA-N Met-Ala-Thr Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O HUKLXYYPZWPXCC-KZVJFYERSA-N 0.000 description 1
- BXNZDLVLGYYFIB-FXQIFTODSA-N Met-Asn-Cys Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CS)C(=O)O)N BXNZDLVLGYYFIB-FXQIFTODSA-N 0.000 description 1
- UYAKZHGIPRCGPF-CIUDSAMLSA-N Met-Glu-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCSC)N UYAKZHGIPRCGPF-CIUDSAMLSA-N 0.000 description 1
- MTBVQFFQMXHCPC-CIUDSAMLSA-N Met-Glu-Asp Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O MTBVQFFQMXHCPC-CIUDSAMLSA-N 0.000 description 1
- FYRUJIJAUPHUNB-IUCAKERBSA-N Met-Gly-Arg Chemical compound CSCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCNC(N)=N FYRUJIJAUPHUNB-IUCAKERBSA-N 0.000 description 1
- RRIHXWPHQSXHAQ-XUXIUFHCSA-N Met-Ile-Lys Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCCN)C(O)=O RRIHXWPHQSXHAQ-XUXIUFHCSA-N 0.000 description 1
- ORRNBLTZBBESPN-HJWJTTGWSA-N Met-Ile-Phe Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 ORRNBLTZBBESPN-HJWJTTGWSA-N 0.000 description 1
- HZLSUXCMSIBCRV-RVMXOQNASA-N Met-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCSC)N HZLSUXCMSIBCRV-RVMXOQNASA-N 0.000 description 1
- USBFEVBHEQBWDD-AVGNSLFASA-N Met-Leu-Val Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O USBFEVBHEQBWDD-AVGNSLFASA-N 0.000 description 1
- IMTUWVJPCQPJEE-IUCAKERBSA-N Met-Lys Chemical compound CSCC[C@H](N)C(=O)N[C@H](C(O)=O)CCCCN IMTUWVJPCQPJEE-IUCAKERBSA-N 0.000 description 1
- WPTHAGXMYDRPFD-SRVKXCTJSA-N Met-Lys-Glu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O WPTHAGXMYDRPFD-SRVKXCTJSA-N 0.000 description 1
- VBGGTAPDGFQMKF-AVGNSLFASA-N Met-Lys-Met Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCSC)C(O)=O VBGGTAPDGFQMKF-AVGNSLFASA-N 0.000 description 1
- MIAZEQZXAFTCCG-UBHSHLNASA-N Met-Phe-Ala Chemical compound CSCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C)C(O)=O)CC1=CC=CC=C1 MIAZEQZXAFTCCG-UBHSHLNASA-N 0.000 description 1
- CULGJGUDIJATIP-STQMWFEESA-N Met-Tyr-Gly Chemical compound CSCC[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CC=C(O)C=C1 CULGJGUDIJATIP-STQMWFEESA-N 0.000 description 1
- 108010066427 N-valyltryptophan Proteins 0.000 description 1
- BQVUABVGYYSDCJ-UHFFFAOYSA-N Nalpha-L-Leucyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)CC(C)C)C(O)=O)=CNC2=C1 BQVUABVGYYSDCJ-UHFFFAOYSA-N 0.000 description 1
- PDUVELWDJZOUEI-IHRRRGAJSA-N Phe-Cys-Arg Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O PDUVELWDJZOUEI-IHRRRGAJSA-N 0.000 description 1
- NAXPHWZXEXNDIW-JTQLQIEISA-N Phe-Gly-Gly Chemical compound OC(=O)CNC(=O)CNC(=O)[C@@H](N)CC1=CC=CC=C1 NAXPHWZXEXNDIW-JTQLQIEISA-N 0.000 description 1
- HBGFEEQFVBWYJQ-KBPBESRZSA-N Phe-Gly-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC1=CC=CC=C1 HBGFEEQFVBWYJQ-KBPBESRZSA-N 0.000 description 1
- NPLGQVKZFGJWAI-QWHCGFSZSA-N Phe-Gly-Pro Chemical compound C1C[C@@H](N(C1)C(=O)CNC(=O)[C@H](CC2=CC=CC=C2)N)C(=O)O NPLGQVKZFGJWAI-QWHCGFSZSA-N 0.000 description 1
- BIYWZVCPZIFGPY-QWRGUYRKSA-N Phe-Gly-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)NCC(=O)N[C@@H](CO)C(O)=O BIYWZVCPZIFGPY-QWRGUYRKSA-N 0.000 description 1
- HTXVATDVCRFORF-MGHWNKPDSA-N Phe-Ile-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC2=CC=CC=C2)N HTXVATDVCRFORF-MGHWNKPDSA-N 0.000 description 1
- INHMISZWLJZQGH-ULQDDVLXSA-N Phe-Leu-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 INHMISZWLJZQGH-ULQDDVLXSA-N 0.000 description 1
- DNAXXTQSTKOHFO-QEJZJMRPSA-N Phe-Lys-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC1=CC=CC=C1 DNAXXTQSTKOHFO-QEJZJMRPSA-N 0.000 description 1
- AUJWXNGCAQWLEI-KBPBESRZSA-N Phe-Lys-Gly Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O AUJWXNGCAQWLEI-KBPBESRZSA-N 0.000 description 1
- FENSZYFJQOFSQR-FIRPJDEBSA-N Phe-Phe-Ile Chemical compound C([C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(O)=O)NC(=O)[C@@H](N)CC=1C=CC=CC=1)C1=CC=CC=C1 FENSZYFJQOFSQR-FIRPJDEBSA-N 0.000 description 1
- YVXPUUOTMVBKDO-IHRRRGAJSA-N Phe-Pro-Cys Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)N)C(=O)N[C@@H](CS)C(=O)O YVXPUUOTMVBKDO-IHRRRGAJSA-N 0.000 description 1
- MCIXMYKSPQUMJG-SRVKXCTJSA-N Phe-Ser-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O MCIXMYKSPQUMJG-SRVKXCTJSA-N 0.000 description 1
- GCFNFKNPCMBHNT-IRXDYDNUSA-N Phe-Tyr-Gly Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)NCC(=O)O)N GCFNFKNPCMBHNT-IRXDYDNUSA-N 0.000 description 1
- MWQXFDIQXIXPMS-UNQGMJICSA-N Phe-Val-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CC1=CC=CC=C1)N)O MWQXFDIQXIXPMS-UNQGMJICSA-N 0.000 description 1
- PXUQTDZNOHRWLI-QOPOCTTISA-O Primulin Natural products O(C)c1c(O)c(OC)cc(-c2c(O[C@H]3[C@H](O)[C@@H](O)[C@@H](O)[C@H](CO)O3)cc3c(O)cc(O)cc3[o+]2)c1 PXUQTDZNOHRWLI-QOPOCTTISA-O 0.000 description 1
- HPXVFFIIGOAQRV-DCAQKATOSA-N Pro-Arg-Gln Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(O)=O HPXVFFIIGOAQRV-DCAQKATOSA-N 0.000 description 1
- ZSKJPKFTPQCPIH-RCWTZXSCSA-N Pro-Arg-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ZSKJPKFTPQCPIH-RCWTZXSCSA-N 0.000 description 1
- ORPZXBQTEHINPB-SRVKXCTJSA-N Pro-Arg-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CCCNC(N)=N)NC(=O)[C@@H]1CCCN1)C(O)=O ORPZXBQTEHINPB-SRVKXCTJSA-N 0.000 description 1
- OGRYXQOUFHAMPI-DCAQKATOSA-N Pro-Cys-His Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O OGRYXQOUFHAMPI-DCAQKATOSA-N 0.000 description 1
- LANQLYHLMYDWJP-SRVKXCTJSA-N Pro-Gln-Lys Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O LANQLYHLMYDWJP-SRVKXCTJSA-N 0.000 description 1
- XZONQWUEBAFQPO-HJGDQZAQSA-N Pro-Gln-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XZONQWUEBAFQPO-HJGDQZAQSA-N 0.000 description 1
- JMVQDLDPDBXAAX-YUMQZZPRSA-N Pro-Gly-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H]1CCCN1 JMVQDLDPDBXAAX-YUMQZZPRSA-N 0.000 description 1
- MRYUJHGPZQNOAD-IHRRRGAJSA-N Pro-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@@H]1CCCN1 MRYUJHGPZQNOAD-IHRRRGAJSA-N 0.000 description 1
- DWGFLKQSGRUQTI-IHRRRGAJSA-N Pro-Lys-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H]1CCCN1 DWGFLKQSGRUQTI-IHRRRGAJSA-N 0.000 description 1
- BUEIYHBJHCDAMI-UFYCRDLUSA-N Pro-Phe-Phe Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O BUEIYHBJHCDAMI-UFYCRDLUSA-N 0.000 description 1
- JLMZKEQFMVORMA-SRVKXCTJSA-N Pro-Pro-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 JLMZKEQFMVORMA-SRVKXCTJSA-N 0.000 description 1
- FDMKYQQYJKYCLV-GUBZILKMSA-N Pro-Pro-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 FDMKYQQYJKYCLV-GUBZILKMSA-N 0.000 description 1
- POQFNPILEQEODH-FXQIFTODSA-N Pro-Ser-Ala Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O POQFNPILEQEODH-FXQIFTODSA-N 0.000 description 1
- YIPFBJGBRCJJJD-FHWLQOOXSA-N Pro-Trp-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@@H]3CCCN3 YIPFBJGBRCJJJD-FHWLQOOXSA-N 0.000 description 1
- 108020005091 Replication Origin Proteins 0.000 description 1
- MWMKFWJYRRGXOR-ZLUOBGJFSA-N Ser-Ala-Asn Chemical compound N[C@H](C(=O)N[C@H](C(=O)N[C@H](C(=O)O)CC(N)=O)C)CO MWMKFWJYRRGXOR-ZLUOBGJFSA-N 0.000 description 1
- XVAUJOAYHWWNQF-ZLUOBGJFSA-N Ser-Asn-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(O)=O XVAUJOAYHWWNQF-ZLUOBGJFSA-N 0.000 description 1
- NJSPTZXVPZDRCU-UBHSHLNASA-N Ser-Asp-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CO)N NJSPTZXVPZDRCU-UBHSHLNASA-N 0.000 description 1
- DSGYZICNAMEJOC-AVGNSLFASA-N Ser-Glu-Phe Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O DSGYZICNAMEJOC-AVGNSLFASA-N 0.000 description 1
- ZFVFHHZBCVNLGD-GUBZILKMSA-N Ser-His-Gln Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(N)=O)C(O)=O ZFVFHHZBCVNLGD-GUBZILKMSA-N 0.000 description 1
- CAOYHZOWXFFAIR-CIUDSAMLSA-N Ser-His-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(O)=O CAOYHZOWXFFAIR-CIUDSAMLSA-N 0.000 description 1
- UIPXCLNLUUAMJU-JBDRJPRFSA-N Ser-Ile-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O UIPXCLNLUUAMJU-JBDRJPRFSA-N 0.000 description 1
- KCNSGAMPBPYUAI-CIUDSAMLSA-N Ser-Leu-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O KCNSGAMPBPYUAI-CIUDSAMLSA-N 0.000 description 1
- IAORETPTUDBBGV-CIUDSAMLSA-N Ser-Leu-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CO)N IAORETPTUDBBGV-CIUDSAMLSA-N 0.000 description 1
- GZSZPKSBVAOGIE-CIUDSAMLSA-N Ser-Lys-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O GZSZPKSBVAOGIE-CIUDSAMLSA-N 0.000 description 1
- HHJFMHQYEAAOBM-ZLUOBGJFSA-N Ser-Ser-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O HHJFMHQYEAAOBM-ZLUOBGJFSA-N 0.000 description 1
- BMKNXTJLHFIAAH-CIUDSAMLSA-N Ser-Ser-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O BMKNXTJLHFIAAH-CIUDSAMLSA-N 0.000 description 1
- STIAINRLUUKYKM-WFBYXXMGSA-N Ser-Trp-Ala Chemical compound C1=CC=C2C(C[C@@H](C(=O)N[C@@H](C)C(O)=O)NC(=O)[C@@H](N)CO)=CNC2=C1 STIAINRLUUKYKM-WFBYXXMGSA-N 0.000 description 1
- 241000135402 Synechococcus elongatus PCC 6301 Species 0.000 description 1
- 241000412075 Synechococcus sp. NKBG15041c Species 0.000 description 1
- LMMDEZPNUTZJAY-GCJQMDKQSA-N Thr-Asp-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O LMMDEZPNUTZJAY-GCJQMDKQSA-N 0.000 description 1
- GNHRVXYZKWSJTF-HJGDQZAQSA-N Thr-Asp-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N)O GNHRVXYZKWSJTF-HJGDQZAQSA-N 0.000 description 1
- VEWZSFGRQDUAJM-YJRXYDGGSA-N Thr-Cys-Tyr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N)O VEWZSFGRQDUAJM-YJRXYDGGSA-N 0.000 description 1
- FHDLKMFZKRUQCE-HJGDQZAQSA-N Thr-Glu-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O FHDLKMFZKRUQCE-HJGDQZAQSA-N 0.000 description 1
- VYEHBMMAJFVTOI-JHEQGTHGSA-N Thr-Gly-Gln Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CCC(N)=O)C(O)=O VYEHBMMAJFVTOI-JHEQGTHGSA-N 0.000 description 1
- AYCQVUUPIJHJTA-IXOXFDKPSA-N Thr-His-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(O)=O AYCQVUUPIJHJTA-IXOXFDKPSA-N 0.000 description 1
- KLCCPYZXGXHAGS-QTKMDUPCSA-N Thr-His-Met Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CCSC)C(=O)O)N)O KLCCPYZXGXHAGS-QTKMDUPCSA-N 0.000 description 1
- KRDSCBLRHORMRK-JXUBOQSCSA-N Thr-Lys-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O KRDSCBLRHORMRK-JXUBOQSCSA-N 0.000 description 1
- APIDTRXFGYOLLH-VQVTYTSYSA-N Thr-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@@H](N)[C@@H](C)O APIDTRXFGYOLLH-VQVTYTSYSA-N 0.000 description 1
- XHWCDRUPDNSDAZ-XKBZYTNZSA-N Thr-Ser-Glu Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N)O XHWCDRUPDNSDAZ-XKBZYTNZSA-N 0.000 description 1
- SGAOHNPSEPVAFP-ZDLURKLDSA-N Thr-Ser-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)NCC(O)=O SGAOHNPSEPVAFP-ZDLURKLDSA-N 0.000 description 1
- COYHRQWNJDJCNA-NUJDXYNKSA-N Thr-Thr-Thr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O COYHRQWNJDJCNA-NUJDXYNKSA-N 0.000 description 1
- OGOYMQWIWHGTGH-KZVJFYERSA-N Thr-Val-Ala Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O OGOYMQWIWHGTGH-KZVJFYERSA-N 0.000 description 1
- QAXCHNZDPLSFPC-PJODQICGSA-N Trp-Ala-Arg Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O)=CNC2=C1 QAXCHNZDPLSFPC-PJODQICGSA-N 0.000 description 1
- MQVGIFJSFFVGFW-XEGUGMAKSA-N Trp-Ala-Glu Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O MQVGIFJSFFVGFW-XEGUGMAKSA-N 0.000 description 1
- CXUFDWZBHKUGKK-CABZTGNLSA-N Trp-Ala-Gly Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O)=CNC2=C1 CXUFDWZBHKUGKK-CABZTGNLSA-N 0.000 description 1
- CZWIHKFGHICAJX-BPUTZDHNSA-N Trp-Glu-Glu Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O)=CNC2=C1 CZWIHKFGHICAJX-BPUTZDHNSA-N 0.000 description 1
- GWBWCGITOYODER-YTQUADARSA-N Trp-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)N GWBWCGITOYODER-YTQUADARSA-N 0.000 description 1
- GIAMKIPJSRZVJB-IHPCNDPISA-N Trp-Phe-Ser Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)N GIAMKIPJSRZVJB-IHPCNDPISA-N 0.000 description 1
- UIRVSEPRMWDVEW-RNXOBYDBSA-N Trp-Tyr-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)[C@H](CC3=CNC4=CC=CC=C43)N UIRVSEPRMWDVEW-RNXOBYDBSA-N 0.000 description 1
- BSCBBPKDVOZICB-KKUMJFAQSA-N Tyr-Leu-Asp Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O BSCBBPKDVOZICB-KKUMJFAQSA-N 0.000 description 1
- UBKKNELWDCBNCF-STQMWFEESA-N Tyr-Met-Gly Chemical compound OC(=O)CNC(=O)[C@H](CCSC)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 UBKKNELWDCBNCF-STQMWFEESA-N 0.000 description 1
- QHONGSVIVOFKAC-ULQDDVLXSA-N Tyr-Pro-His Chemical compound N[C@@H](Cc1ccc(O)cc1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O QHONGSVIVOFKAC-ULQDDVLXSA-N 0.000 description 1
- ABZWHLRQBSBPTO-RNXOBYDBSA-N Tyr-Trp-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)NC(=O)[C@H](CC4=CC=C(C=C4)O)N ABZWHLRQBSBPTO-RNXOBYDBSA-N 0.000 description 1
- QVYFTFIBKCDHIE-ACRUOGEOSA-N Tyr-Tyr-Lys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)N[C@@H](CCCCN)C(=O)O)N)O QVYFTFIBKCDHIE-ACRUOGEOSA-N 0.000 description 1
- NXPDPYYCIRDUHO-ULQDDVLXSA-N Tyr-Val-His Chemical compound C([C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC=1NC=NC=1)C(O)=O)C1=CC=C(O)C=C1 NXPDPYYCIRDUHO-ULQDDVLXSA-N 0.000 description 1
- VKYDVKAKGDNZED-STECZYCISA-N Tyr-Val-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CC1=CC=C(C=C1)O)N VKYDVKAKGDNZED-STECZYCISA-N 0.000 description 1
- ASQFIHTXXMFENG-XPUUQOCRSA-N Val-Ala-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O ASQFIHTXXMFENG-XPUUQOCRSA-N 0.000 description 1
- VDPRBUOZLIFUIM-GUBZILKMSA-N Val-Arg-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](C(C)C)N VDPRBUOZLIFUIM-GUBZILKMSA-N 0.000 description 1
- UDLYXGYWTVOIKU-QXEWZRGKSA-N Val-Asn-Arg Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N UDLYXGYWTVOIKU-QXEWZRGKSA-N 0.000 description 1
- QGFPYRPIUXBYGR-YDHLFZDLSA-N Val-Asn-Phe Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N QGFPYRPIUXBYGR-YDHLFZDLSA-N 0.000 description 1
- XQVRMLRMTAGSFJ-QXEWZRGKSA-N Val-Asp-Arg Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N XQVRMLRMTAGSFJ-QXEWZRGKSA-N 0.000 description 1
- CELJCNRXKZPTCX-XPUUQOCRSA-N Val-Gly-Ala Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(O)=O CELJCNRXKZPTCX-XPUUQOCRSA-N 0.000 description 1
- SDUBQHUJJWQTEU-XUXIUFHCSA-N Val-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](C(C)C)N SDUBQHUJJWQTEU-XUXIUFHCSA-N 0.000 description 1
- YLRAFVVWZRSZQC-DZKIICNBSA-N Val-Phe-Glu Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N YLRAFVVWZRSZQC-DZKIICNBSA-N 0.000 description 1
- YTNGABPUXFEOGU-SRVKXCTJSA-N Val-Pro-Arg Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCN=C(N)N)C(O)=O YTNGABPUXFEOGU-SRVKXCTJSA-N 0.000 description 1
- NHXZRXLFOBFMDM-AVGNSLFASA-N Val-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)C(C)C NHXZRXLFOBFMDM-AVGNSLFASA-N 0.000 description 1
- YQYFYUSYEDNLSD-YEPSODPASA-N Val-Thr-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O YQYFYUSYEDNLSD-YEPSODPASA-N 0.000 description 1
- 108010087049 alanyl-alanyl-prolyl-valine Proteins 0.000 description 1
- 108010011559 alanylphenylalanine Proteins 0.000 description 1
- OFIDNKMQBYGNIW-UHFFFAOYSA-N arachidonic acid methyl ester Natural products CCCCCC=CCC=CCC=CCC=CCCCC(=O)OC OFIDNKMQBYGNIW-UHFFFAOYSA-N 0.000 description 1
- QVGXLLKOCUKJST-UHFFFAOYSA-N atomic oxygen Chemical compound [O] QVGXLLKOCUKJST-UHFFFAOYSA-N 0.000 description 1
- 244000062766 autotrophic organism Species 0.000 description 1
- GLMQHZPGHAPYIO-UHFFFAOYSA-L azanium;2-hydroxypropane-1,2,3-tricarboxylate;iron(2+) Chemical compound [NH4+].[Fe+2].[O-]C(=O)CC(O)(CC([O-])=O)C([O-])=O GLMQHZPGHAPYIO-UHFFFAOYSA-L 0.000 description 1
- 230000001580 bacterial effect Effects 0.000 description 1
- 229940041514 candida albicans extract Drugs 0.000 description 1
- 239000001569 carbon dioxide Substances 0.000 description 1
- 229910002092 carbon dioxide Inorganic materials 0.000 description 1
- 210000000170 cell membrane Anatomy 0.000 description 1
- 230000002490 cerebral effect Effects 0.000 description 1
- 238000003776 cleavage reaction Methods 0.000 description 1
- FDJOLVPMNUYSCM-UVKKECPRSA-L cobalt(3+);[(2r,3s,4r,5s)-5-(5,6-dimethylbenzimidazol-1-yl)-4-hydroxy-2-(hydroxymethyl)oxolan-3-yl] [(2r)-1-[3-[(2r,3r,4z,7s,9z,12s,13s,14z,17s,18s,19r)-2,13,18-tris(2-amino-2-oxoethyl)-7,12,17-tris(3-amino-3-oxopropyl)-3,5,8,8,13,15,18,19-octamethyl-2,7, Chemical compound [Co+3].N#[C-].C1([C@H](CC(N)=O)[C@@]2(C)CCC(=O)NC[C@@H](C)OP([O-])(=O)O[C@H]3[C@H]([C@H](O[C@@H]3CO)N3C4=CC(C)=C(C)C=C4N=C3)O)[N-]\C2=C(C)/C([C@H](C\2(C)C)CCC(N)=O)=N/C/2=C\C([C@H]([C@@]/2(CC(N)=O)C)CCC(N)=O)=N\C\2=C(C)/C2=N[C@]1(C)[C@@](C)(CC(N)=O)[C@@H]2CCC(N)=O FDJOLVPMNUYSCM-UVKKECPRSA-L 0.000 description 1
- 230000021615 conjugation Effects 0.000 description 1
- 239000000470 constituent Substances 0.000 description 1
- 238000007796 conventional method Methods 0.000 description 1
- 238000001816 cooling Methods 0.000 description 1
- 238000010790 dilution Methods 0.000 description 1
- 239000012895 dilution Substances 0.000 description 1
- 201000010099 disease Diseases 0.000 description 1
- 208000037265 diseases, disorders, signs and symptoms Diseases 0.000 description 1
- 235000020669 docosahexaenoic acid Nutrition 0.000 description 1
- 229940090949 docosahexaenoic acid Drugs 0.000 description 1
- 238000004520 electroporation Methods 0.000 description 1
- 235000020774 essential nutrients Nutrition 0.000 description 1
- 238000000605 extraction Methods 0.000 description 1
- 235000019387 fatty acid methyl ester Nutrition 0.000 description 1
- 238000004108 freeze drying Methods 0.000 description 1
- 235000013402 health food Nutrition 0.000 description 1
- 238000009396 hybridization Methods 0.000 description 1
- IXCSERBJSXMMFS-UHFFFAOYSA-N hydrogen chloride Substances Cl.Cl IXCSERBJSXMMFS-UHFFFAOYSA-N 0.000 description 1
- 229910000041 hydrogen chloride Inorganic materials 0.000 description 1
- 150000002500 ions Chemical class 0.000 description 1
- 239000004313 iron ammonium citrate Substances 0.000 description 1
- 235000000011 iron ammonium citrate Nutrition 0.000 description 1
- 238000002955 isolation Methods 0.000 description 1
- 238000001819 mass spectrum Methods 0.000 description 1
- OFIDNKMQBYGNIW-ZKWNWVNESA-N methyl arachidonate Chemical compound CCCCC\C=C/C\C=C/C\C=C/C\C=C/CCCC(=O)OC OFIDNKMQBYGNIW-ZKWNWVNESA-N 0.000 description 1
- 125000002496 methyl group Chemical group [H]C([H])([H])* 0.000 description 1
- 210000000653 nervous system Anatomy 0.000 description 1
- 239000003960 organic solvent Substances 0.000 description 1
- 239000001301 oxygen Substances 0.000 description 1
- 229910052760 oxygen Inorganic materials 0.000 description 1
- 108010064486 phenylalanyl-leucyl-valine Proteins 0.000 description 1
- 230000029553 photosynthesis Effects 0.000 description 1
- 238000010672 photosynthesis Methods 0.000 description 1
- 235000008476 powdered milk Nutrition 0.000 description 1
- 230000002265 prevention Effects 0.000 description 1
- 230000009467 reduction Effects 0.000 description 1
- 230000010076 replication Effects 0.000 description 1
- 230000003362 replicative effect Effects 0.000 description 1
- 235000019512 sardine Nutrition 0.000 description 1
- 230000007017 scission Effects 0.000 description 1
- 235000014102 seafood Nutrition 0.000 description 1
- 235000015170 shellfish Nutrition 0.000 description 1
- 229910001961 silver nitrate Inorganic materials 0.000 description 1
- 239000000243 solution Substances 0.000 description 1
- 238000000638 solvent extraction Methods 0.000 description 1
- 230000003068 static effect Effects 0.000 description 1
- 239000000725 suspension Substances 0.000 description 1
- 238000011426 transformation method Methods 0.000 description 1
- 239000012137 tryptone Substances 0.000 description 1
- 239000002699 waste material Substances 0.000 description 1
- 239000012138 yeast extract Substances 0.000 description 1
Landscapes
- Preparation Of Compounds By Using Micro-Organisms (AREA)
- Micro-Organisms Or Cultivation Processes Thereof (AREA)
Abstract
(57)【要約】 (修正有)
【課題】 イコサペンタエン酸(EPA)などのn-3系列高
度不飽和脂肪酸をグラム陰性細菌で効率よく発現するプ
ラスミド及びこれを導入してn-3系列高度不飽和脂肪酸
を効率よく発現するラン藻の提供。
【解決手段】特定の6種類の塩基配列によってコードさ
れたイコサペンタエン酸生合成酵素群をコードする遺伝
子群を、広域宿主ベクターにクローニングして得られる
プラスミド及び該プラスミドを導入して得られるイコサ
ペンタエン酸を産生するラン藻。
【効果】 EPAなどのn-3系列高度不飽和脂肪酸は、医
薬、食品、飼料等に有用であり、物質生産に使用する生
物としてのラン藻は、価格、環境面からも有利である。
(57) [Summary] (Modified) [Problem] A plasmid that efficiently expresses n-3 series highly unsaturated fatty acids such as eicosapentaenoic acid (EPA) in Gram-negative bacteria and introduces the plasmid into the n-3 series highly unsaturated fatty acid. Provision of cyanobacteria that efficiently express saturated fatty acids. Kind Code: A1 A plasmid obtained by cloning a gene group encoding a group of eicosapentaenoic acid biosynthetic enzymes encoded by six specific base sequences into a broad-range host vector and an eicosapentaenoic acid obtained by introducing the plasmid, Produced cyanobacteria. [Effects] n-3 series polyunsaturated fatty acids such as EPA are useful for medicines, foods, feeds, etc., and cyanobacteria as an organism used for substance production are also advantageous in terms of price and environment.
Description
【0001】[0001]
【発明の属する技術分野】本発明は、イコサペンタエン
酸(以下EPAと称する)産生菌から得られたEPA生合成遺
伝子群を含有するプラスミド及び当該プラスミドを導入
して作製されるイコサペンタエン酸を産生する形質転換
体ラン藻に関する。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a plasmid containing a group of EPA biosynthesis genes obtained from a bacterium producing icosapentaenoic acid (hereinafter referred to as EPA), and a trait for producing icosapentaenoic acid produced by introducing the plasmid. Related to transformed cyanobacteria.
【0002】[0002]
【従来の技術】イコサペンタエン酸・ドコサヘキサエン
酸に代表されるn-3系列高度不飽和脂肪酸は、細胞膜の
構成脂肪酸であり循環系・脳神経系の正常な機能の発現
に重要な役割を果たしていることが知られている。しか
し、人間はこれらを生合成することができないため主に
魚介類からの摂取に頼っている。また、慣習的に魚介類
を摂取しない民族も多く存在している。そのためn-3系
列高度不飽和脂肪酸は、生活習慣病の治療や予防のため
医薬品・健康食品として広く販売されている。さらに、
粉ミルク等の乳幼児向けの食品や養殖魚の稚魚の必須栄
養として飼料に添加するかたちでも用いられている。2. Description of the Related Art N-3 series unsaturated fatty acids such as icosapentaenoic acid and docosahexaenoic acid are constituent fatty acids of cell membranes and play an important role in the expression of normal functions of the circulatory system and the cerebral nervous system. Are known. However, humans cannot rely on biosynthesis of these, and thus rely mainly on their intake from seafood. There are also many ethnic groups who do not conventionally consume fish and shellfish. Therefore, n-3 series polyunsaturated fatty acids are widely sold as medicines and health foods for treatment and prevention of lifestyle-related diseases. further,
It is also used as an essential nutrient for foods for infants, such as powdered milk, and for cultured fry, and is added to feed.
【0003】現在n-3系列高度不飽和脂肪酸の原料とし
ては、主としてマグロ・イワシなどの魚類が用いられて
いる。これらの魚類は自ら高度不飽和脂肪酸を生合成す
るのではなく餌として取り込んだものを蓄積している。
n-3系列高度不飽和脂肪酸を生合成できる生物として知
られているのは主に海洋細菌や海洋微細藻類等の海にお
ける食物連鎖の低次生産者となる海洋微生物であるが、
海洋ラン藻でこれを産生するものは知られていない。At present, fish such as tuna and sardines are mainly used as raw materials for n-3 series polyunsaturated fatty acids. These fish accumulate what they take in as bait rather than biosynthesize polyunsaturated fatty acids themselves.
Known organisms that can biosynthesize n-3 series highly unsaturated fatty acids are marine microorganisms that are mainly low-order producers of the food chain in the sea, such as marine bacteria and marine microalgae,
No marine cyanobacteria producing this is known.
【0004】海洋ラン藻に遺伝子組換え技術を用いてn-
3系列高度不飽和脂肪酸の生合成遺伝群を導入した例と
して、海洋細菌由来のEPA生合成遺伝子群をシネココッ
カス(Synechoccus)sp.NKBG042902に導入した竹山らの報
告(H. Takeyama et al., Microbiolgy,143,2725-2731
(1997))が有る。しかし、導入したプラスミドが約50kb
と巨大なものであったため複製の際問題が生じて安定な
EPA生産はみられなかった。[0004] The marine cyanobacterium is n-
Takeyama et al., Microbiolgy (H. Takeyama et al., Who introduced the EPA biosynthetic genes derived from marine bacteria into Synechoccus sp. NKBG042902 as an example of introducing the biosynthetic genes of the three series of highly unsaturated fatty acids. , 143,2725-2731
(1997)). However, the introduced plasmid is about 50kb
Because of the huge thing, a problem occurred at the time of duplication and it was stable
No EPA production was seen.
【0005】[0005]
【発明が解決しようとする課題】海洋ラン藻は地球レベ
ルでの酸素の生産者として重要であり、その物質生産能
力を利用することにより二酸化炭素の削減にも寄与でき
る。また、ラン藻は光合成による独立栄養生物であるた
め、培養には安価な無機培地を用いることができ照明以
外には特殊な設備を要さない点で産業レベルに載せやす
いと考えられる。さらに、ラン藻は原核生物であるため
菌体全体を物質生産の原料として用いることにより廃棄
物の量を低く押さえることができるのも有利な点であ
る。このような性質を持ったラン藻に有用物質であるn-
3系列高度不飽和脂肪酸を生産させることは非常に意義
のあることと考えられる。本発明は、実用化のためにn-
3系列高度不飽和脂肪酸、とくにEPAの産生能の向上
と導入するプラスミドの安定化を目的とする。The marine cyanobacteria is important as a producer of oxygen on a global level, and can contribute to the reduction of carbon dioxide by utilizing its substance production capacity. Also, since cyanobacteria are autotrophic organisms by photosynthesis, it is considered that an inexpensive inorganic medium can be used for cultivation and no special equipment other than lighting is required, so that they can be easily put on an industrial level. Furthermore, since cyanobacteria are prokaryotes, it is also advantageous that the amount of waste can be kept low by using the whole cells as a raw material for producing substances. N-, a useful substance for cyanobacteria having such properties
The production of three series of unsaturated fatty acids is considered to be very significant. The present invention provides n-
The objective is to improve the ability to produce three-series highly unsaturated fatty acids, particularly EPA, and to stabilize the plasmid to be introduced.
【0006】[0006]
【課題を解決するための手段】本発明は、高度不飽和脂
肪酸を広範な細菌で発現するプラスミド及び該プラスミ
ドを導入し高度不飽和脂肪酸を安定に効率よく産生する
形質転換体ラン藻を提供する。すなわち、本発明は、
配列番号2,4,6,8,10,及び12で示される塩
基配列によってコードされたイコサペンタエン酸生合成
酵素群をコードする遺伝子群を、広域宿主ベクターにク
ローニングして得られるプラスミド及び3,5,7,
9,11,及び13で示される塩基配列で表されるイコ
サペンタエン酸生合成酵素群をコードする遺伝子群を、
広域宿主ベクターにクローニングして得られるプラスミ
ド、並びにこれらのプラスミドを導入して得られるイコ
サペンタエン酸を産生するラン藻に関する。DISCLOSURE OF THE INVENTION The present invention provides a plasmid expressing polyunsaturated fatty acids in a wide range of bacteria, and a transformant cyanobacterium which stably and efficiently produces polyunsaturated fatty acids by introducing the plasmid. . That is, the present invention
Plasmids obtained by cloning the genes encoding the icosapentaenoic acid biosynthetic enzymes encoded by the nucleotide sequences represented by SEQ ID NOs: 2, 4, 6, 8, 10, and 12 into a broad-range host vector; , 7,
Genes encoding the icosapentaenoic acid biosynthetic enzymes represented by the base sequences represented by 9, 11, and 13
The present invention relates to a plasmid obtained by cloning into a broad-range host vector, and a cyanobacterium producing icosapentaenoic acid obtained by introducing these plasmids.
【0007】[0007]
【発明の実施の形態】本発明において用いられるEPA
生合成酵素群をコードする遺伝子群は、例えば実施例1
あるいは特開平8−242867号に記載の方法により
単離することができる。なお、本発明において、EPA
生合成酵素群をコードする遺伝子群は、配列番号2,
4,6,8,10,及び12で示される塩基配列によっ
てコードされるものとストリンジェントな条件でハイブ
リダイズするものを包含する。DESCRIPTION OF THE PREFERRED EMBODIMENTS EPA used in the present invention
Genes encoding biosynthetic enzymes are described in, for example, Example 1.
Alternatively, it can be isolated by the method described in JP-A-8-242867. In the present invention, EPA
The genes encoding the biosynthetic enzymes are SEQ.
Includes those that hybridize under stringent conditions with those encoded by the nucleotide sequences represented by 4, 6, 8, 10, and 12.
【0008】EPA生合成酵素群をコードする遺伝子群
を他の生物に導入するためには、これらの遺伝子を運ん
で発現させる部品を備えたベクターが必要である。原核
生物に上述の遺伝子群を導入する際は、多くの場合には
そのままのプロモーター/ターミネーターが使用できる
ので、それぞれの微生物で複製のできる複製開始点を有
したベクターを用いることができる。一般的な広域宿主
ベクターが使用可能であり、例えばpJRD215(Davidson
et al.,Gene, 51,275-280(1987))及びpBBR1MCSシリー
ズ(Kovach et al., Gene, 166,175-176(1995))等を例
示することができる。これらのベクターへの上述の遺伝
子群のクローニングは、すべての遺伝子群を含むDNA
断片あるいはすべての遺伝子群を含むことになる複数の
DNA断片を用いて、慣用の方法で行うことができる。[0008] In order to introduce the genes encoding the EPA biosynthetic enzymes into other organisms, a vector having components for carrying and expressing these genes is required. When introducing the above-mentioned genes into prokaryotes, in many cases, the promoter / terminator can be used as it is, and therefore, a vector having a replication origin capable of replicating in each microorganism can be used. Common broad-range host vectors can be used, for example, pJRD215 (Davidson
et al., Gene, 51, 275-280 (1987)) and pBBR1MCS series (Kovach et al., Gene, 166, 175-176 (1995)). The cloning of the above-mentioned genes into these vectors is carried out by using DNA containing all the genes.
Using a fragment or a plurality of DNA fragments containing the entire gene group, it can be carried out in a conventional manner.
【0009】一般的にベクターにクローニングした遺伝
子をそれぞれの原核生物に導入する方法としては、形質
転換法・接合法・エレクトロポーレーション法が挙げら
れる。細胞内への遺伝子の導入を確認する方法として
は、直接的には、その発現の結果である生産物、高度不
飽和脂肪酸を検出する。上述のように作製されたプラス
ミドを導入した生物から有機溶媒抽出を行うことにより
EPA等の高度不飽和脂肪酸を得ることができる。間接的
には、導入した遺伝子の一部をプローブやプライマーと
したハイブリダイゼーションやPCRにより遺伝子が導入
されたことを確認することが可能である。In general, methods for introducing a gene cloned into a vector into each prokaryote include a transformation method, a conjugation method, and an electroporation method. As a method for confirming the introduction of a gene into a cell, directly, a product resulting from the expression and a polyunsaturated fatty acid are detected. By performing organic solvent extraction from the organism into which the plasmid prepared as described above has been introduced,
Highly unsaturated fatty acids such as EPA can be obtained. Indirectly, it is possible to confirm that the gene has been introduced by hybridization or PCR using a part of the introduced gene as a probe or primer.
【0010】本発明の上記プラスミドをラン藻に導入す
る際にも、上記の方法を適用して行うことができる。用
いられるラン藻としては、特に制限はないが、例えばシ
ネココッカスsp.NKBG15041c、シネココッカスsp.NKBG04
2902等を挙げることができる。上記プラスミドを導入し
たイコサペンタエン酸を産生するラン藻として、平成11
年11月9日に工業技術院生命工学工業技術研究所に受託
番号FERM P-17634として寄託された、15041c/pJRDEPA-S
が挙げられる。[0010] The above-described method can be applied to the introduction of the plasmid of the present invention into cyanobacteria. The cyanobacteria to be used is not particularly limited. For example, Synechococcus sp.NKBG15041c, Synechococcus sp.NKBG04
2902 and the like. As a cyanobacterium producing icosapentaenoic acid into which the above-described plasmid has been introduced,
15041c / pJRDEPA-S deposited with the National Institute of Advanced Industrial Science and Technology under the accession number FERM P-17634 on November 9, 2015.
Is mentioned.
【0011】[0011]
【発明の効果】本発明では、高度不飽和脂肪酸の生合成
に必須でないORF及び遺伝子をコードしていていない部
分を可能な限り取り除いて短縮化することにより、他の
生物に導入した場合に細胞分裂に伴う脱落を防ぎ遺伝子
群の安定で効率良い発現が達成される。According to the present invention, the cells which are not essential for the biosynthesis of polyunsaturated fatty acids are shortened by removing as much as possible the portions which do not encode the ORF and the gene, so that when introduced into other organisms, Prevents dropout due to division and achieves stable and efficient expression of genes.
【0012】[0012]
【実施例】次に、実施例及び参考例により本発明をさら
に具体的に説明する。Next, the present invention will be described more specifically with reference to examples and reference examples.
【0013】実施例1 小型化プラスミドの作製 特開平8-242867に記載のプラスミドpEPAに挿入されたEP
A生合成遺伝子群のうち、EPA生合成に必須であるORF3、
6、7、8および9のサブクローニングを行った。ORF5、
6、7、8および9については、クローニングベクターpBSI
IKS(+)(Stratagene社製)のXbaI-SpeI部位にXbaI-SpeI
断片(23,045-31,443)、XbaI部位にXbaI-XbaI断片(1
2,314-23,045)SpeI部位にSpeI-NheI断片(31,443-32,5
14)を順次サブクローニングを行いΔX4XbNh/pBS を作
製した。ΔX4XbNh/pBS をNotIで処理したものをT4DNAポ
リメラーゼにより平滑末端を作り、それをXhoIで処理し
て DNA断片Aを得た。 また、ORF3については、R/pSTV28
(HpaI断片7,951-9,129を宝酒造製ベクターpSTV28のSma
I部位に挿入したもの)をEcoRI及びPstIで処理して切り
出した断片をpBSIIKS(+)のEcoRI-PstI部位に挿入してR/
pBSを作製した。R/pBSをPstIで処理後T4DNAポリメラー
ゼにより平滑末端を作りXhoIリンカーを導入してからXh
oIでORF3を含む断片を切り出しDNA断片Bを得た。広域宿
主ベクターであるpJRD215(カナマイシン及びストレプ
トマイシン耐性)のXhoI-StuI部位に断片Aをパッカジン
ラムダDNAパケージングシステム(Promega社製)により
導入した後に、XhoI部位に断片BをDNA ライゲーション
キット (宝酒造製)を用いて導入しプラスミドを完成
させた。これをpJRDEPA-Sと命名した(図1)。Example 1 Preparation of miniaturized plasmid EP inserted into plasmid pEPA described in JP-A-8-242867
ORF3, which is essential for EPA biosynthesis,
6, 7, 8 and 9 were subcloned. ORF5,
For 6, 7, 8 and 9, the cloning vector pBSI
XbaI-SpeI at the XbaI-SpeI site of IKS (+) (Stratagene)
Fragment (23,045-31,443), XbaI-XbaI fragment (1
2,314-23,045) SpeI-NheI fragment (31,443-32,5)
14) was sequentially subcloned to prepare ΔX4XbNh / pBS. ΔX4XbNh / pBS treated with NotI was blunt-ended with T4 DNA polymerase and treated with XhoI to obtain DNA fragment A. For ORF3, R / pSTV28
(HpaI fragment 7,951-9,129 was converted to Sma of Takara Shuzo vector pSTV28.
The fragment cut out after treating with EcoRI and PstI) was inserted into the EcoRI-PstI site of pBSIIKS (+) to
pBS was prepared. After treating R / pBS with PstI, blunt ends were created with T4 DNA polymerase and an XhoI linker was introduced, and then Xh
A fragment containing ORF3 was cut out with oI to obtain a DNA fragment B. Fragment A was introduced into the XhoI-StuI site of pJRD215 (kanamycin and streptomycin resistance), which is a broad-range host vector, using the Pacazine Lambda DNA packaging system (Promega), and fragment B was inserted into the XhoI site using a DNA ligation kit (Takara Shuzo). ) To complete the plasmid. This was named pJRDEPA-S (Figure 1).
【0014】参考例1 pJRDEPA-Sを導入した大腸菌で
のEPA生産 pJRDEPA-Sを用いて大腸菌K12/JM109を常法により形質転
換した。50μg/mlのカナマイシンを含むLB寒天培地(ト
リプトン1%、酵母エキス0.5%、NaCl 1%、寒天1.5
%)を用いて選別を行いJM109/pJRDEPA-Sのコロニーを
得た。このコロニーを50μg/mlのカナマイシンを含む2m
lLB液体培地に接種し25℃で24時間培養した。これを遠
心分離して菌体を集め培地を除いた後、10mMトリス塩酸
緩衝液 pH7.5を加えて懸濁し再度遠心分離を行って洗浄
した。洗浄菌体に少量の10mMトリス塩酸緩衝液 pH7.5を
加えて再懸濁し凍結乾燥をした。乾燥菌体に5%塩化水
素を含むメタノルーを1ml加え80℃1時間処理して脂肪酸
のメチルエステル化を行った。冷却後同量のn-ヘキサン
で3回抽出しn-ヘキサン層を減圧乾固して20μlのメタノ
ールに溶解し試料とした。この試料の一部をガスクロマ
トグラフィー(以下GLCと略す)により分析した。その
結果、標品のEPAメチルエステルと同様の保持時間にピ
ークが検出された。そのピークの面積比から算出される
総脂肪酸に対する割合は5.7%であった。また、このピ
ークはガスクロマトグラフィー質量スペクトル(以下GC
-MSと略す)分析により親イオン(M) m/z316、ベースピ
ークm/z79でありEPAメチルエステル標品と一致した。Reference Example 1 Production of EPA by Escherichia coli Transfected with pJRDEPA-S Escherichia coli K12 / JM109 was transformed by a conventional method using pJRDEPA-S. LB agar medium containing 50 μg / ml kanamycin (1% tryptone, 0.5% yeast extract, 1% NaCl, 1.5% agar
%) To obtain a colony of JM109 / pJRDEPA-S. 2m containing 50μg / ml kanamycin
The cells were inoculated into an lLB liquid medium and cultured at 25 ° C for 24 hours. After the cells were collected by centrifugation to collect the cells and the medium was removed, 10 mM Tris-HCl buffer (pH 7.5) was added to suspend the cells, followed by centrifugation again for washing. A small amount of 10 mM Tris-HCl buffer (pH 7.5) was added to the washed cells, resuspended, and freeze-dried. To the dried cells, 1 ml of methanol containing 5% hydrogen chloride was added, and the mixture was treated at 80 ° C. for 1 hour to perform fatty acid methyl esterification. After cooling, the same amount of n-hexane was extracted three times, and the n-hexane layer was dried under reduced pressure and dissolved in 20 μl of methanol to obtain a sample. A part of this sample was analyzed by gas chromatography (hereinafter abbreviated as GLC). As a result, a peak was detected at a retention time similar to that of the standard EPA methyl ester. The ratio to the total fatty acids calculated from the area ratio of the peak was 5.7%. In addition, this peak has a gas chromatography mass spectrum (GC
The analysis showed that the parent ion (M) was m / z 316 and the base peak was m / z 79, which was consistent with the EPA methyl ester standard.
【0015】実施例2−1 ラン藻のトランスコンジュ
ゲーション ラン藻シネココッカス(Synechococcus) sp. NKBG15041c
(K.Sode et al., Appl. Microbiol. Biotechnol., 37,
369-373 (1992))を3%NaClを含むBG11(表1)液体培地
(BG11M)1,000-1,500 Luxの光照射下23℃で 4-5日間
静置培養を行い(A550<1)、室温で3,000rpm20分間遠
心して集め、BG11M液体培地に懸濁して3回洗った。この
ラン藻の濃度を分光光度計でA550=1に合わせた。上記p
JRDEPA-Sでトランスコンジュゲーション用大腸菌S-17
(Simon et al., Bio/Technolgy,118,640-659(1983) )
を形質転換し、50μg/mlのカナマイシンを含むLB寒天培
地に37℃一晩培養して生育したコロニーをBG11Mに懸濁
したものを分光光度計で濃度を測定しA650=10に合わせ
た。以上のように調製したラン藻と大腸菌を等量ずつラ
ン藻A550=1に対し大腸菌A650=10になるように混合し
た。この菌液を遠心後元の10分の1になるように懸濁し
たものをBG11に15mMNaClを加えた1.2%寒天培地に50μl
ずつスポットし、照明下24〜48時間23℃で培養した。寒
天培地上にできた緑色のコロニーをメスで切り出して1m
lBG11Mに懸濁し75μg/mlカナマイシンを含むBG11M液体
培地に50分の1の濃度になるように加え照明下23℃で培
養した。Example 2-1 Transconjugation of Cyanobacterium Cyanobacterium Synechococcus sp. NKBG15041c
(K. Sode et al., Appl. Microbiol. Biotechnol., 37,
369-373 (1992)) BG11 containing 3% NaCl (Table 1) liquid medium (BG11M) 1,000-1,500 by light irradiation under 23 ° C. of Lux performed 4-5 days static culture (A 550 <1), The cells were collected by centrifugation at 3,000 rpm for 20 minutes at room temperature, suspended in BG11M liquid medium, and washed three times. The concentration of the cyanobacteria was adjusted to A 550 = 1 with a spectrophotometer. Above p
E. coli S-17 for transconjugation with JRDEPA-S
(Simon et al., Bio / Technolgy, 118, 640-659 (1983))
Was cultured on an LB agar medium containing 50 μg / ml of kanamycin at 37 ° C. overnight, and the colonies grown were suspended in BG11M. The concentration was measured with a spectrophotometer, and adjusted to A 650 = 10. The cyanobacterium and Escherichia coli prepared as described above were mixed in equal amounts so that Escherichia coli A650 = 10 with respect to the cyanobacteria A550 = 1. After suspending this bacterial solution to 1/10 of the original after centrifugation, 50 μl of the suspension was added to a 1.2% agar medium obtained by adding 15 mM NaCl to BG11.
Each was spotted and cultured at 23 ° C. for 24 to 48 hours under light. Cut out the green colony formed on the agar medium with a scalpel and 1 m
The cells were suspended in lBG11M and added to a BG11M liquid medium containing 75 μg / ml kanamycin so as to have a concentration of 1/50, and cultured at 23 ° C. under illumination.
【0016】表1 BG11の組成 ━━━━━━━━━━━━━━━━ NaNO3 1.5g MgSO4・7H2O 75mg CaCl2・2H2O 36mg クエン酸 6mg Na2EDTA 1mg Na2CO3 20mg 微量金属混合物 A5* 1ml K2HPO4 30mg クエン酸 鉄アンモニウム 6mg ビタミンB12 1mg 脱イオン蒸留水 -1l pH 7.2-7.4 ━━━━━━━━━━━━━━━━ *微量金属混合物A5 ━━━━━━━━━━━━━━━ H3BO4 2.86g MnCl2・6H2O 1.81g ZnSO4・7H2O 222mg Na2MoO4・2H2O 390mg CuSO4・5H2O 79mg Co(NO3)2・7H2O 49.4mg 脱イオン蒸留水 -1l ━━━━━━━━━━━━━━━Table 1 Composition of BG11 ━━━━━━━━━━━━━━━━ NaNO 3 1.5 g MgSO 4 .7H 2 O 75 mg CaCl 2 .2H 2 O 36 mg Citric acid 6 mg Na 2 EDTA 1 mg Na 2 CO 3 20mg Trace metal mixture A5 * 1ml K 2 HPO 4 30mg Iron ammonium citrate 6mg Vitamin B 12 1mg Deionized distilled water -1L pH 7.2-7.4 ━━━━━━━━━━━━━━━━ * trace metal mixture A5 ━━━━━━━━━━━━━━━ H 3 BO 4 2.86g MnCl 2 · 6H 2 O 1.81g ZnSO 4 · 7H 2 O 222mg Na 2 MoO 4 · 2H 2 O 390mg CuSO 4 · 5H 2 O 79mg Co (NO 3) 2 · 7H 2 O 49.4mg deionized distilled water -1 l ━━━━━━━━━━━━━━━
【0017】実施例2−2 ラン藻のシングルコロニー
アイソレーション pJRDEPA-Sを有するラン藻をA730=3-4に生育させ、10-4-
10-5希釈したものを75μg/mlカナマイシンを含むBG11M
の寒天培地に塗布し23℃照明下約1ヶ月培養してシング
ルコロニーを形成させた。このコロニーを75μg/mlカナ
マイシンを含むBG11M液体培地に移して培養した。得ら
れた組換え体ラン藻を15041c/pJRDEPA-Sと命名した(受
託番号FERM P-17634)。[0017] The cyanobacteria with a single colony isolation pJRDEPA-S of Example 2-2 cyanobacteria grown to A 730 = 3-4, 10 -4 -
10-5 dilution of BG11M containing 75 μg / ml kanamycin
And cultured for about 1 month under 23 ° C. illumination to form a single colony. This colony was transferred to a BG11M liquid medium containing 75 μg / ml kanamycin and cultured. The obtained recombinant cyanobacterium was named 15041c / pJRDEPA-S (Accession No. FERM P-17634).
【0018】実施例2−3 ラン藻の菌体脂質の調製及
び分析 pJRDEPA-S を有するラン藻及び非組換え体のラン藻を培
養しその菌体を実施例1-2と同様に遠心分離で集め洗浄
後凍結乾燥を行った。乾燥菌体に実施例1-2と同様の脂
肪酸のメチルエステル化を行いn-ヘキサン抽出し減圧乾
固後メタノールに溶解して粗試料とした。この粗試料を
シリカゲル薄層プレートにスポットしn-ヘキサン:エチ
ルエーテル(4:1、v/v)で展開してメチルエステルを
分離した。プリムリン発色により検出したメチルエステ
ル画分を掻き取り、メタノール:10%食塩(9:1、v/
v)1ml及び水1mlを加えたものから上述と同様にn-ヘキ
サン抽出を行い試料を調製した。この試料の一部をGLC
にかけて分析した。標品のEPAメチルエステルと保持時
間が一致するピーク及びその約1分前にもほぼ同じ高さ
のピークが検出された。2本ののピークはその面積比か
ら求めた総脂肪酸に対する割合はそれぞれ3.8%及び2.6
%であった。主な脂肪酸の総脂質に占める割合を表2に
示す。残りの試料を硝酸銀シリカゲル薄層プレートにス
ポットしn-ヘキサン:エチルエーテル(85:15、v/v)
で3回展開して2',7'-ジクロロフルオレセイン発色によ
り検出した多価不飽和脂肪酸画分を掻き取り上述と同様
な方法で GC-MS分析用試料を調製した。GC-MS分析によ
り、EPAメチルエステルと保持時間が一致するピークの
分子量は316でEPAと一致した。また、その前のピークは
分子量318で20:4であるが、アラキドン酸メチルエステ
ル(20:4(n-6))のガスクロマトグラフィーの保持時
間とは異なっていた。このピークは、標品の20:4(n-
3)メチルエステルの保持時間及びGC-MSのの解裂パター
ンと一致しており、20:4(n-3)メチルエステルと同定
された。Example 2-3 Preparation and Analysis of Cell Lipids of Cyanobacteria Cyanobacteria having pJRDEPA-S and non-recombinant cyanobacteria were cultured, and the cells were centrifuged in the same manner as in Example 1-2. After washing and freeze-drying. The dried cells were subjected to the same methyl esterification of fatty acids as in Example 1-2, extracted with n-hexane, dried under reduced pressure, and dissolved in methanol to obtain a crude sample. The crude sample was spotted on a silica gel thin layer plate and developed with n-hexane: ethyl ether (4: 1, v / v) to separate a methyl ester. The methyl ester fraction detected by primulin coloring was scraped off, and methanol: 10% saline (9: 1, v / v
v) From 1 ml of water and 1 ml of water, n-hexane extraction was performed in the same manner as above to prepare a sample. Part of this sample was GLC
And analyzed. A peak having the same retention time as that of the EPA methyl ester of the sample and a peak having almost the same height were detected about 1 minute before that. The ratio of the two peaks to the total fatty acids determined from the area ratio was 3.8% and 2.6%, respectively.
%Met. Table 2 shows the ratio of main fatty acids to total lipids. The remaining sample was spotted on a silver nitrate silica gel thin plate and n-hexane: ethyl ether (85:15, v / v)
Then, the polyunsaturated fatty acid fraction detected by 2 ', 7'-dichlorofluorescein coloring was scraped off, and a sample for GC-MS analysis was prepared in the same manner as described above. According to GC-MS analysis, the molecular weight of the peak having the same retention time as EPA methyl ester was 316, which was consistent with EPA. The previous peak had a molecular weight of 318 and a ratio of 20: 4, which was different from the retention time of arachidonic acid methyl ester (20: 4 (n-6)) by gas chromatography. This peak corresponds to the standard 20: 4 (n-
3) It was consistent with the retention time of methyl ester and the cleavage pattern of GC-MS, and was identified as 20: 4 (n-3) methyl ester.
【0019】 表2 pJRDEPA-Sを導入したラン藻のおもな脂肪酸組成(総脂肪酸に対する%) ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 脂肪酸 pJRDEPA-Sを導入したラン藻 非組換え体ラン藻 ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 16:0 32.6 33.1 16:1 (n-7) 12.2 13.1 18:1 (n-9) 12.8 19.1 18:2 (n-6) 14.9 17.9 18:3 (n-3) 6.7 7.1 20:4 (n-3) 2.6 N.D. 20:5 (n-3) 3.8 N.D. ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ N.D. :検出せずTable 2 Main fatty acid composition of cyanobacteria introduced with pJRDEPA-S (% of total fatty acids)ラ ン Cyanobacteria introduced with fatty acid pJRDEPA-S Non-recombinant cyanobacterium ━━━━━━━━━━━━━━━━━━━━━━━━ ━━━━━━━━━━━━ 16: 0 32.6 33.1 16: 1 (n-7) 12.2 13.1 18: 1 (n-9) 12.8 19.1 18: 2 (n-6) 14.9 17.9 18: 3 (n-3) 6.7 7.1 20: 4 (n-3) 2.6 ND 20: 5 (n-3) 3.8 ND ━━━━━━━━━━━━━━━━━━━━━━━ ━━━━━━━━━━━━━ ND: Not detected
【0020】実施例3. プラスミドの比較 pJRDEPA-Sと同時にpJRDEPA(H. Takeyama et al., Micr
obiolgy,143,2725-2731(1997))を実施例2-1と同様にシ
ネココッカスsp.NKBG15041cに導入して75μg/mlのカナ
マイシンを加えたBG11M液体培地1lで培養を行った。実
施例2-3と同様に脂肪酸メチルエステルを調製しGLCで分
析を行った。その結果を表3に示す。Example 3 Comparison of plasmids pJRDEPA-S and pJRDEPA (H. Takeyama et al., Micr
obiolgy, 143, 2725-2731 (1997)) was introduced into Synechococcus sp. NKBG15041c in the same manner as in Example 2-1 and cultured in 1 l of a BG11M liquid medium supplemented with 75 μg / ml kanamycin. A fatty acid methyl ester was prepared and analyzed by GLC in the same manner as in Example 2-3. The results are shown in Table 3.
【0021】 表3 NKBG15041CのEPA生産能と導入したプラスミド ━━━━━━━━━━━━━━━━━━━━━━━━━━━ プラスミド pJRDEPA-S pJRDEPA ━━━━━━━━━━━━━━━━━━━━━━━━━━━ EPA(%*) 1.79 0.03 サイズ(Kb) 31 47 ━━━━━━━━━━━━━━━━━━━━━━━━━━━ *総脂肪酸に対する% このようにpJRDEPA-Sを導入したNKBG15041cでは、pJRDE
PAを導入したものに比較して飛躍的にEPA生産能が増加
した。Table 3 EPA-producing ability of NKBG15041C and introduced plasmid ━━━━━━━━━━━━━━━━━━━━━━━━━━━ Plasmid pJRDEPA-S pJRDEPA プ ラ ス ミ ド━━━━━━━━━━━━━━━━━━━━━━━ EPA (% *) 1.79 0.03 Size (Kb) 31 47 ━━━━━━━━━━━━━━ ━━━━━━━━━━━━━ *% of total fatty acids In NKBG15041c introduced with pJRDEPA-S, pJRDE
The EPA production capacity increased dramatically compared to the case where PA was introduced.
【0022】参考例2 ラン藻の培養条件 pJRDEPA-S を有するラン藻を通常(23℃、1,000Lux、静
置)、低温(18℃、800Lux、振とう)及び弱照明(23
℃、40Lux、静置)の3条件で培養し、それぞれの脂肪酸
組成を実施例2-3を同様の方法でGLCにより分析した結果
を表3に示す。EPA及び20:4(n-3)の総脂肪酸に対する
割合は、通常に比べ生育を抑制した場合(低温及び弱照
明)の方が高く、それぞれ3.8%・2.6%に対し、6.0%
・5.7%及び5.2%・6.6%であった。Reference Example 2 Cultivation Conditions for Cyanobacteria Cyanobacteria having pJRDEPA-S were cultured at normal temperature (23 ° C., 1,000 Lux, standing), at low temperature (18 ° C., 800 Lux, shaking) and under low light (23 ° C.).
(C, 40 Lux, standing)), and the fatty acid composition of each was analyzed by GLC in the same manner as in Example 2-3. Table 3 shows the results. The ratio of EPA and 20: 4 (n-3) to total fatty acids is higher when growth is suppressed (low temperature and low light) than normal, 6.0% compared to 3.8% and 2.6%, respectively.
・ 5.7%, 5.2% and 6.6%.
【0023】 表4 おもな脂肪酸組成に及ぼす培養条件の影響(総脂肪酸に対する%) ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 脂肪酸 低温 通常 弱照明 ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 16:0 37.4 32.6 36.4 16:1 (n-7) 9.3 12.2 9.1 18:1 (n-9) 8.3 12.8 7.5 18:2 (n-6) 8.0 14.9 8.9 18:3 (n-3) 4.8 6.7 2.6 20:4 (n-3) 6.0 2.6 5.2 20:5 (n-3) 5.7 3.8 6.6 A730 4.28 4.48 1.51 ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 低温:18℃、800Lux、振とう 通常:23℃、1,000lux、静置 弱照明:23℃、40Lux、静置Table 4 Influence of culture conditions on main fatty acid composition (% of total fatty acids)脂肪酸 Fatty acid Low temperature Normal Low light ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 16: 0 37.4 32.6 36.4 16: 1 (n-7) 9.3 12.2 9.1 18: 1 (n-9) 8.3 12.8 7.5 18: 2 (n-6) 8.0 14.9 8.9 18: 3 (n-3) 4.8 6.7 2.6 20: 4 (n-3) 6.0 2.6 5.2 20: 5 ( n-3) 5.7 3.8 6.6 A 730 4.28 4.48 1.51 ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ Low temperature: 18 ℃, 800Lux, shake Normal: 23 ℃, 1,000lux, stationary Low light: 23 ℃, 40Lux, stationary
【0024】参考例3 pJRDEPAに関しては、ラン藻を継代すると、ラン藻の分
裂にプラスミドの複製が間に合わなくなりプラスミドが
失われていったとの報告がある(H. Takeyama etal., M
icrobiolgy,143,2725-2731(1997))。しかし、 pJRDEPA
-Sを有するラン藻ではそのような現象は観察されず、カ
ナマイシンの存在下で継代を繰り返すことができた。Reference Example 3 With respect to pJRDEPA, it has been reported that when the cyanobacterium was subcultured, the plasmid was lost in time for the replication of the cyanobacterium and the plasmid was lost (H. Takeyama et al., M.
icrobiolgy, 143,2725-2731 (1997)). But pJRDEPA
No such phenomenon was observed in the cyanobacteria having -S, and the passage could be repeated in the presence of kanamycin.
【0025】[0025]
【配列表】 <110> Sagami Chemical Reserach Center, Japan Bioindustry Association, Director-General of Agency of Industrial Science and Technology. <120> A plasmid carrying the eicosapentanoic acid synthesis gene claster and a transgenic cyanobacterium producing eicosapentanoic acid. <130> SO18226 <160> 13[Sequence List] <110> Sagami Chemical Reserach Center, Japan Bioindustry Association, Director-General of Agency of Industrial Science and Technology. <120> A plasmid carrying the eicosapentanoic acid synthesis gene claster and a transgenic cyanobacterium producing eicosapentanoic acid. <130> SO18226 <160> 13
【0026】 <210> 1 <211> 37895 <212> DNA <213> Shewanella putrefaciens SCRC-2874 (FERM BP-1625) <400> 1 Position gatctcttac aaagaaacta tctcaatgtg aatttaacct taattccgtt taattacggc 60 ctgatagagc atcacccaat cagccataaa actgtaaagt gggtactcaa aggtggctgg 120 gcgattcttc tcaaatacaa agtgcccaac ccaagcaaat ccatatccga taacaggtaa 180 aagtagcaat aaaccccagc gctgagttag taatacataa gcgaataata ggatcactaa 240 actactgccg aaatagtgta atattcgaca gtttctatgc tgatgttgag ataaataaaa 300 agggtaaaat tcagcaaaag aacgatagcg cttactcatt actcacacct cggtaaaaaa 360 gcaactcgcc attaacttgg ccaatcgtca gttgttctat cgtctcaaag ttatgccgac 420 taaataactc tatatgtgca ttatgattag caaaaactcc gataccatca agatgaagtt 480 gttcatcaca ccaactcaaa actgcgtcga taagcttact gccatagccc ttgccttgct 540 ccacatttgc gatagcaata aactgtaaaa tgccacattg gccacttggt aagctctcta 600 taatctgatt ttctttgtta ataagtgcct gagttgaata ccaaccagta cttaacaaca 660 tctttaaacg ccaatgccaa aaacgcgctt cacctaaggg aacctgctga gtcactatgc 720 aggctacgcc tatcaatcta tccccaacga acataccaat aagtgcttgc tcctgttgcc 780 agagctcatt gagttcttct cgaatagccc cgcgaagctt ttgctcatac tgcgcttgat 840 caccactaaa aagtgtttcg ataaaaaagg gatcatcatg ataggcgtta tagagaatag 900 aggctgctat gcgtaaatct tctgccgtga gataaactgc acgacactct tccatggctt 960 gatcttccat tgttattgtc cttgaccttg atcacacaac accaatgtaa caagactgta 1020 tagaagtgca attaataatc aattcgtgca ttaagcaggt cagcatttct ttgctaaaca 1080 agctttattg gctttgacaa aactttgcct agactttaac gatagaaatc ataatgaaag 1140 agaaaagcta caacctagag gggaataatc aaacaactgc taagatctag ataatgtaat 1200 aaacaccgag tttatcgacc atacttagat agagtcatag caacgagaat agttatggat 1260 acaacgccgc aagatctatc acacctgttt ttacagctag gattagcaaa tgatcaaccc 1320 gcaattgaac agtttatcaa tgaccatcaa ttagcggaca atatattgct acatcaagca 1380 agcttttgga gcccatcgca aaagcacttc ttaattgagt catttaatga agatgcccag 1440 tggaccgaag tcatcgacca cttagacacc ttattaagaa aaaactaacc attacaacag 1500 caactttaaa ttttgccgta agccatctcc ccccacccca caacagcgtt gttgcttatg 1560 accactggag tacattcgtc tttagtcgtt ttaccatcac catgggtacg ttgagtgcga 1620 taaaaaagca cataaacttc tttatcggcc tgaatatagg cttcgttaaa atcagctgtt 1680 cccattaaag taaccacttg ctctttactc atgcctagag atatctttgt caaattgtca 1740 cggtttttat cttgagtttt ctcccaagca ccgtgattat cccagtcaga ttccccatca 1800 ccaacattga ccacacagcc cgttagccct aagcttgcaa tcccaaaaca tgctaaacct 1860 aataatttat ttttcatttt aacttcctgt tatgacatta tttttgctta gaagaaaagc 1920 aacttacatg ccaaaacaca agctgttgtt ttaaatgact ttatttatta ttagcctttt 1980 aggatatgcc tagagcaata ataattacca atgtttaagg aatttgacta actatgagtc 2040 cgattgagca agtgctaaca gctgctaaaa aaatcaatga acaaggtaga gaaccaacat 2100 tagcattgat taaaaccaaa cttggtaata gcatcccaat gcgcgagtta atccaaggtt 2160 tgcaacagtt taagtctatg agtgcagaag aaagacaagc aatacctagc agcttagcaa 2220 cagcaaaaga aactcaatat ggtcaatcaa gcttatctca atctgaacaa gctgatagga 2280 tcctccagct agaaaacgcc ctcaatgaat taagaaacga atttaatggg ctaaaaagtc 2340 aatttgataa cttacaacaa aacctgatga ataaagagcc tgacaccaaa tgcatgtaat 2400 tgaactacga tttgaatgtt ttgataacac cacgattact gcagcagaaa aagccattaa 2460 tggtttgctt gaagcttatc gagccaatgg ccaggttcta ggtcgtgaat ttgccgttgc 2520 atttaacgat ggtgagttta aagcacgcat gttaacccca gaaaaaagca gcttatctaa 2580 acgctttaat agtccttggg taaatagtgc actcgaagag ctaaccgaag ccaaattgct 2640 tgcgccacgt gaaaagtata ttggccaaga tattaattct gaagcatcta gccaagacac 2700 accaagttgg cagctacttt acacaagtta tgtgcacatg tgctcaccac taagaaatgg 2760 cgacaccttg cagcctattc cactgtatca aattccagca actgccaacg gcgatcataa 2820 acgaatgatc cgttggcaaa cagaatggca agcttgtgat gaattgcaaa tggccgcagc 2880 tactaaagct gaatttgccg cacttgaaga gctaaccagt catcagagtg atctatttag 2940 gcgtggttgg gacttacgtg gcagagtcga atacttgacg aaaattccga cctattacta 3000 tttataccgt gttggcggtg aaagcttagc agtagaaaag cagcgctctt gtcctaagtg 3060 tggcagtcaa gaatggctgc tcgataaacc attattggat atgttccatt ttcgctgtga 3120 cacctgccgc atcgtatcta atatctcttg ggaccattta taactcttcc gagtcttatc 3180 acactagagt ttagtcagca taaaaatggc gcttatattt caattaaaag aaatataagc 3240 gccattttca tcgatactat atatcagcag actattttcc gcgtaaatta gcccacatta 3300 atttcattct ttgccagatc cctggatgat ctagttgtgg catcgactct tcaataggtt 3360 taaccgcagg tgtaaccctt ggagtcaatt cgtttataaa ctcgtttaaa ctgtcactta 3420 atttaacgct ttgtacttca cctggaattt caatccatac gctgccatca ctattattaa 3480 ccgtcaacat tttatcttca tcatcaagaa taccaataaa ccaagtcggc tcttgcttaa 3540 gctttctctt catcattaaa tgaccaatga tgttttgttg taagtattca aaatcagttt 3600 gatcccacac ttggattagc tcaccttggc cccattgtga gtcaaaaaat agcggtgcag 3660 aaaaatgact gccaaaaaat ggattaattt ctgcagataa tgtcatttca agtgctgttt 3720 caacattagc aaattcacca ggttgttgac gtacaaccga ttgccaaaac actgcgccat 3780 cggagcccgc ttcggcgaca acacactcag acttttgtcc ttgcgcataa tatcttggct 3840 gttcaccaag cttatccatg taggcttgtt gatatttaga taaaaaaaga tctaaagcag 3900 gtaaagaaga cacttaagcc agttccaaaa tcagttataa taggggtcta ttttgacatg 3960 gaaaccgtat tgatgacaca acatcatgat ccctacagta acgcccccga actttctgaa 4020 ttaactttag gaaagtcgac cggttatcaa gagcagtatg atgcatcttt actacaagcg 4080 tgccgcgtaa attaaaccgt gatgctatcg gtctaaccaa tgagctacct tttcatggct 4140 gtgatatttg gactggctac gaactgtctt ggctaaatgc taaaggcaag ccaatgattg 4200 ctattgcaga ctttaaccta agttttgata gtaaaaatct gatcgagtct aagtcgttta 4260 agctgtattt aaacagctat aaccaaacac gatttgatag cgttcaagcg gttcaagaac 4320 gtttaactga agacttaagc gcctgtgccc aaggcacagt tacggtaaaa gtgattgaac 4380 ctaagcaatt taaccacctg agagtggttg atatgccagg tacctgcatt gacgatttag 4440 atattgaagt tgatgactat agctttaact ctgactatct caccgacagt gttgatgaca 4500 aagtcatggt tgctgaaacg ctaacgtcaa acttattgaa atcaaactgc ctaatcactt 4560 ctcagcctga ctggggtaca gtgatgatcc gttatcaagg gcctaagata gaccgtgaaa 4620 agctacttag atatctgatt tcatttagac agcacaatga atttcatgag cagtgtgttg 4680 agcgtatatt tgttgattta aagcactatt gccaatgtgc caaacttact gtctatgcac 4740 gttatacccg ccgtggtggt ttagatatca acccatatcg tagcgacttt gaaaaccctg 4800 cagaaaatca gcgcctagcg agacagtaat tgattgcagt acctacaaaa aacaatgcct 4860 ataagccaag cttatgggca tttttatatt atcaacttgt catcaaacct cagccgccaa 4920 gccttttagt tttatcgcta aattaagccg ctctctcagc caaatatttg caggattttg 4980 ctgtaattta tggctccaca ccatgaaata ctctatcggc tctaccgcaa aaggtaagtc 5040 aaatacctgt aagccaaaca gcttggcata ttcgtcagtg tgggcttttg acgcgatagc 5100 taacgcatca ctttttgagg caaccgacat catacttaat attgatgatt gctcgctgtg 5160 catttgcctt gccggtaaca cctgtttagt cagcaagtcg gcaacactta aattgtagcg 5220 gcgcatctta aaaataatat gcttttcatt aaagtattgc tcttgcgtca acccaccttg 5280 gatccttggg tgagcatttc gtgccacaca aactaattta tcctgcatta ctttttgact 5340 cttaaatgcc gcagattctg gcagccaaat atctaaggct aaatccacct tttctagttg 5400 taggtccatc tgcaactctt cttcaatgag cggcggctca cgaaatacaa tattaattgc 5460 agtgccctgt aacacttgct caatttgatc ttgcaagagt tgtattgccg actcgctggc 5520 atacacataa aaagttcgct cacttgaagt ggggtcaaat gcttcaaagc tagtcgcaac 5580 ttgctcaatt gttgacatag cgcccgcgag ctgttgataa agcgtcatcg cacttgcggt 5640 aggtttaact cccctaccca ctcgagtaaa caactcttct ccaacaatac tttttagcct 5700 cgaaatcgca ttactaaccg acgactgagt caaatccagc tcttctgccg cccggctaaa 5760 agatgaggtg cgatacaccg cagtaaaaac gcgaaataaa ttaagatcaa aagctttttg 5820 ctgcgacata aatcagctat ctccttatcc ttatccttat ccttataaaa agttagctcc 5880 agagcactct agctcaaaaa caactcagcg tattaagcca atattttggg aactcaatta 5940 atattcataa taaaagtatt cataatataa ataccaagtc ataatttagc cctaattatt 6000 aatcaattca agttacctat actggcctca attaagcaaa tgtctcatca gtctccctgc 6060 aactaaatgc aatattgaga cataaagctt tgaactgatt caatcttacg agggtaactt 6120 atgaaacaga ctctaatggc tatctcaatc atgtcgcttt tttcattcaa tgcgctagca 6180 gcgcaacatg aacatgacca catcactgtt gattacgaag ggaaagccgc aacagaacac 6240 accatagctc acaaccaagc tgtagctaaa acacttaact ttgccgacac gcgtgcattt 6300 gagcaatcgt ctaaaaatct agtcgccaag tttgataaag caactgccga tatattacgt 6360 gccgaatttg cttttattag cgatgaaatc cctgactcgg ttaacccgtc tctctaccgt 6420 caggctcagc ttaatatggt gcctaatggt ctgtataaag tgagcgatgg catttaccag 6480 gtccgcggta ccgacttatc taaccttaca cttatccgca gtgataacgg ttggatagca 6540 tacgatgttt tgttaaccaa agaagcagca aaagcctcac tacaatttgc gttaaagaat 6600 ctacctaaag atggcgattt acccgttgtt gcgatgattt actcccatag ccatgcggac 6660 cactttggcg gagctcgcgg tgttcaagag atgttccctg atgtcaaagt ctacggctca 6720 gataacatca ctaaagaaat tgtcgatgag aacgtacttg ccggtaacgc catgagccgc 6780 cgcgcagctt atcaatacgg cgcaacactg ggcaaacatg accacggtat tgttgatgct 6840 gcgctaggta aaggtctatc aaaaggtgaa atcacttacg tcgccccaga ctacacctta 6900 aacagtgaag gcaaatggga aacgctgacg attgatggtc tagagatggt gtttatggat 6960 gcctcgggca ccgaagctga gtcagaaatg atcacttata ttccctctaa aaaagcgctc 7020 tggacggcgg agcttaccta tcaaggtatg cacaacattt atacgctgcg cggcgctaaa 7080 gtacgtgatg cgctcaagtg gtcaaaagat atcaacgaaa tgatcaatgc ctttggtcaa 7140 gatgtcgaag tgctgtttgc ctcgcactct gcgccagtgt ggggtaacca agcgatcaac 7200 gatttcttac gcctacagcg tgataactac ggcctagtgc acaatcaaac cttgagactt 7260 gccaacgatg gtgtcggtat acaagatatt ggcgatgcga ttcaagacac gattccagag 7320 tctatctaca agacgtggca taccaatggt taccacggca cttatagcca taacgctaaa 7380 gcggtttata acaagtatct aggctacttc gatatgaacc cagccaacct taatccgctg 7440 ccaaccaagc aagaatctgc caagtttgtc gaatacatgg gcggcgcaga tgccgcaatt 7500 aagcgcgcta aagatgatta cgctcaaggt gaataccgct ttgttgcaac ggcattaaat 7560 aaggtggtga tggccgagcc agaaaatgac tccgctcgtc aattgctagc cgatacctat 7620 gagcaacttg gttatcaagc agaaggggct ggctggagaa acatttactt aactggcgca 7680 caagagctac gagtaggtat tcaagctggc gcgcctaaaa ccgcatcggc agatgtcatc 7740 agtgaaatgg acatgccgac tctatttgac ttcctcgcgg tgaagattga tagtcaacag 7800 gcggctaagc acggcttagt taagatgaat gttatcaccc ctgatactaa agatattctc 7860 tatattgagc taagcaacgg taacttaagc aacgcagtgg tcgacaaaga gcaagcagct 7920 gacgcaaacc ttatggttaa taaagctgac gttaaccgca tcttacttgg ccaagtaacc 7980 ctaaaagcgt tattagccag cggcgatgcc aagctcactg gtgataaaac ggcatttagt 8040 aaaatagccg atagcatggt cgagtttaca cctgacttcg aaatcgtacc aacgcctgtt 8100 aaatgaggca ttaatctcaa caagtgcaag ctagacataa aaatggggcg attagacgcc 8160 ccatttttta tgcaattttg aactagctag tcttagctga agctcgaaca acagctttaa 8220 aattcacttc ttctgctgca atacttattt gctgacactg accaatactc agtgcaaaac 8280 gataactatc atcaagatgg cccagtaaac aatgccaatt atcagcagcg ttcatttgct 8340 gttctttagc ctcaatcaaa cctaaaccag acttttgtgg ctcagcgtta ggcttattag 8400 aactcgactc tagtaaagca agaccaatat cttgttttaa caaaacctgt cgctgattaa 8460 gttgatgctc aaccttgtga tccgcaatag catcggaaat atcaacacaa tggctcaagc 8520 ttttaggtgc attaactcca agaaaagttt cgctcagtgc agagaagtca aacgcaaaag 8580 attttagcga taatgccagc ccaagtcctt tcgctttaat gtaagactcc ttgagcgccc 8640 acaaatcaaa aaagcggtct cgctgcaagg cctctggtaa cgctaacaag gctcgctttt 8700 ctgattcaga gaaataatga ctaagaatag agtggatatt ggtgctgtta cggcaacgct 8760 caatgtcgac gccaaactca atactagcag agtcagtttc ctccttgctt gcctgactgg 8820 cgcctttatt atcagcagtg caaatgccta ctaatagcca atctccacta tgactcacat 8880 taaagtggac cccggtttga gcaaattgcg catcactcaa tctaggctta cctttgtcgc 8940 catattcaaa gcgccattca ttggggcgta tttcactatg ttgtgacaat aaagcgcgca 9000 aatagcctct taccattaaa ccttgagttt tagcttcttg tttaatgtag cgattaacct 9060 taattaactc atcttcaggc agccatgact taaccaactc tgtagtctgg ttatcgcact 9120 cttgtattgt taacggacag aagtataagg aaatcaatcg agaagttagc aatttttcag 9180 gacactcttt aaagcaacaa acataacccc tatttttacc aatttaagat caaaactaaa 9240 gccaaaacta attgagaata gtgtcaaact agctttaaag gaaaaaaata taaaaagaac 9300 attatacttg tataaattat tttacacacc aaagccatga tcttcacaaa attagctccc 9360 tctccctaaa acaagattga ataaaaaaat aaaccttaac tttcatatag ataaaacaaa 9420 ccaatgggat aaagtatatt gaattcattt ttaaggaaaa attcaaattg aattcaagct 9480 cttcagtaaa agcatatttt gccgttagtg tgaaaaaaaa caaatttaaa aaccaacata 9540 gaacaaataa gcagacaata aaaccaaggc gcaacacaaa caacgcgctt acaattttca 9600 caaaaaagca acaagagtaa cgtttagtat ttggatatgg ttattgtaat tgagaatttt 9660 ataacaatta tattaaggga atgagtatgt ttttaaattc aaaactttcg cgctcagtca 9720 aacttgccat atccgcaggc ttaacagcct cgctagctat gcctgttttt gcagaagaaa 9780 ctgctgctga agaacaaata gaaagagtcg cagtgaccgg atcgcgaatc gctaaagcag 9840 agctaactca accagctcca gtcgtcagcc tttcagccga agaactgaca aaatttggta 9900 atcaagattt aggtagcgta ctagcagaat tacctgctat tggtgcaacc aacactatta 9960 ttggtaataa caatagcaac tcaagcgcag gtgttagctc agcagacttg cgtcgtctag 10020 gtgctaacag aaccttagta ttagtcaacg gtaagcgcta cgttgccggc caaccgggct 10080 cagctgaggt agatttgtca actataccaa ctagcatgat ctcgcgagtt gagattgtaa 10140 ccggcggtgc ttcagcaatt tatggttcgg acgctgtatc aggtgttatc aacgttatcc 10200 ttaaagaaga ctttgaaggc tttgagttta acgcacgtac tagcggttct actgaaagtg 10260 taggcactca agagcactct tttgacattt tgggtggtgc aaacgttgca gatggacgtg 10320 gtaatgtaac cttctacgca ggttatgaac gtacaaaaga agtcatggct accgacattc 10380 gccaattcga tgcttgggga acaattaaaa acgaagccga tggtggtgaa gatgatggta 10440 ttccagacag actacgtgta ccacgagttt attctgaaat gattaatgct accggtgtta 10500 tcaatgcatt tggtggtgga attggtcgct caacctttga cagtaacggc aatcctattg 10560 cacaacaaga acgtgatggg actaacagct ttgcatttgg ttcattccct aatggctgtg 10620 acacatgttt caacactgaa gcatacgaaa actatattcc aggggtagaa agaataaacg 10680 ttggctcatc attcaacttt gattttaccg ataacattca attttacact gacttcagat 10740 atgtaaagtc agatattcag caacaatttc agccttcatt ccgttttggt aacattaata 10800 tcaatgttga agataacgcc tttttgaatg acgacttgcg tcagcaaatg ctcgatgcgg 10860 gtcaaaccaa tgctagtttt gccaagtttt ttgatgaatt aggaaatcgc tcagcagaaa 10920 ataaacgcga acttttccgt tacgtaggtg gctttaaagg tggctttgat attagcgaaa 10980 ccatatttga ttacgacctt tactatgttt atggcgagac taataaccgt cgtaaaaccc 11040 ttaatgacct aattcctgat aactttgtcg cagctgtcga ctctgttatt gatcctgata 11100 ctggcttagc agcgtgtcgc tcacaagtag caagcgctca aggcgatgac tatacagatc 11160 ccgcgtctgt aaatggtagc gactgtgttg cttataaccc atttggcatg ggtcaagctt 11220 cagcagaagc ccgcgactgg gtttctgctg atgtgactcg tgaagacaaa ataactcaac 11280 aagtgattgg tggtactctc ggtaccgatt ctgaagaact atttgagctt caaggtggtg 11340 caatcgctat ggttgttggt tttgaatacc gtgaagaaac gtctggttca acaaccgatg 11400 aatttactaa agcaggtttc ttgacaagcg ctgcaacgcc agattcttat ggcgaatacg 11460 acgtgactga gtattttgtt gaggtgaaca tcccagtact aaaagaatta ccttttgcac 11520 atgagttgag ctttgacggt gcataccgta atgctgatta ctcacatgcc ggtaagactg 11580 aagcatggaa agctggtatg ttctactcac cattagagca acttgcatta cgtggtacgg 11640 taggtgaagc agtacgagca ccaaacattg cagaagcctt tagtccacgc tctcctggtt 11700 ttggccgcgt ttcagatcca tgtgatgcag ataacattaa tgacgatccg gatcgcgtgt 11760 caaactgtgc agcattgggg atccctccag gattccaagc taatgataac gtcagtgtag 11820 ataccttatc tggtggtaac ccagatctaa aacctgaaac atcaacatcc tttacaggtg 11880 gtcttgtttg gacaccaacg tttgctgaca atctatcatt cactgtcgat tattatgata 11940 ttcaaattga ggatgctatt ttgtcagtag ccacccagac tgtggctgat aactgtgttg 12000 actcaactgg cggacctgac accgacttct gtagtcaagt tgatcgtaat ccaacgacct 12060 atgatattga acttgttcgc tctggttatc taaatgccgc ggcattgaat accaaaggta 12120 ttgaatttca agctgcatac tcattagatc tagagtcttt caacgcgcct ggtgaactac 12180 gcttcaacct attggggaac caattacttg aactagaacg tcttgaattc caaaatcgtc 12240 ctgatgagat taatgatgaa aaaggcgaag taggtgatcc agagctgcag ttccgcctag 12300 gcatcgatta ccgtctagat gatctaagtg ttagctggaa cacgcgttat attgatagcg 12360 tagtaactta tgatgtctct gaaaatggtg gctctcctga agatttatat ccaggccaca 12420 taggctcaat gacaactcat gacttgagcg ctacatacta catcaatgag aacttcatga 12480 ttaacggtgg tgtacgtaac ctatttgacg cacttccacc tggatacact aacgatgcgc 12540 tatatgatct agttggtcgc cgtgcattcc taggtattaa ggtaatgatg taattaatta 12600 ttacgcctct aactaataaa aatgcaatct cttcgtagag attgcatttt tttatgaaat 12660 ccaatcttaa actggttctc cgagcatctt acgccttaaa aaccccgccc ctcaatgtaa 12720 cgccaaagtt aattgcttac acgcacttac acaaacgaac aatttcatta acacgagaca 12780 cagctcacgc tttttatttt acccttgatt ttactacata aaattgcgtt ttagcgcaca 12840 agtgttctcc caagctggtc gtatctgtaa ttattcagtc ccaggtgatt gtattgaccc 12900 ataagctcag gtagtctgct ctgccattag ctaaacaata ttgacaaaat ggcgataaaa 12960 tgtggcttag cgctaagttc accgtaagtt ttatcggcat taagtcccaa cagattatta 13020 acggaaaccc gctaaactga tggcaaaaat aaatagtgaa cacttggatg aagctactat 13080 tacttcgaat aagtgtacgc aaacagagac tgaggctcgg catagaaatg ccactacaac 13140 acctgagatg cgccgattca tacaagagtc ggatctcagt gttagccaac tgtctaaaat 13200 attaaatatc agtgaagcta ccgtacgtaa gtggcgcaag cgtgactctg tcgaaaactg 13260 tcctaatacc ccgcaccatc tcaataccac gctaacccct ttgcaagaat atgtggttgt 13320 gggcctgcgt tatcaattga aaatgccatt agacagattg ctcaaagcaa cccaagagtt 13380 tatcaatcca aacgtgtcgc gctcaggttt agcaagatgt ttgaagcgtt atggcgtttc 13440 acgggtgagt gatatccaaa gcccacacgt accaatgcgc tactttaatc aaattccagt 13500 cactcaaggc agcgatgtgc aaacctacac cctgcactat gaaacgctgg caaaaacctt 13560 agccttacct agtaccgatg gtgacaatgt ggtgcaagtg gtgtctctca ccattccacc 13620 aaagttaacc gaagaagcac ccagttcaat tttgctcggc attgatcctc atagcgactg 13680 gatctatctc gacatatacc aagatggcaa tacacaagcc acgaatagat atatggctta 13740 tgtgctaaaa cacgggccat tccatttacg aaagttactc gtgcgtaact atcacacctt 13800 tttacagcgc tttcctggag cgacgcaaaa tcgccgcccc tctaaagata tgcctgaaac 13860 aatcaacaag acgcctgaaa cacaggcacc cagtggagac tcataatgag ccagacctct 13920 aaacctacaa actcagcaac tgagcaagca caagactcac aagctgactc tcgtttaaat 13980 aaacgactaa aagatatgcc aattgctatt gttggcatgg cgagtatttt tgcaaactct 14040 cgctatttga ataagttttg ggacttaatc agcgaaaaaa ttgatgcgat tactgaatta 14100 ccatcaactc actggcagcc tgaagaatat tacgacgcag ataaaaccgc agcagacaaa 14160 agctactgta aacgtggtgg ctttttgcca gatgtagact tcaacccaat ggagtttggc 14220 ctgccgccaa acattttgga actgaccgat tcatcgcaac tattatcact catcgttgct 14280 aaagaagtgt tggctgatgc taacttacct gagaattacg accgcgataa aattggtatc 14340 accttaggtg tcggcggtgg tcaaaaaatt agccacagcc taacagcgcg tctgcaatac 14400 ccagtattga agaaagtatt cgccaatagc ggcattagtg acaccgacag cgaaatgctt 14460 atcaagaaat tccaagacca atatgtacac tgggaagaaa actcgttccc aggttcactt 14520 ggtaacgtta ttgcgggccg tatcgccaac cgcttcgatt ttggcggcat gaactgtgtg 14580 gttgatgctg cctgtgctgg atcacttgct gctatgcgta tggcgctaac agagctaact 14640 gaaggtcgct ctgaaatgat gatcaccggt ggtgtgtgta ctgataactc accctctatg 14700 tatatgagct tttcaaaaac gcccgccttt accactaacg aaaccattca gccatttgat 14760 atcgactcaa aaggcatgat gattggtgaa ggtattggca tggtggcgct aaagcgtctt 14820 gaagatgcag agcgcgatgg cgaccgcatt tactctgtaa ttaaaggtgt gggtgcatca 14880 tctgacggta agtttaaatc aatctatgcc cctcgcccat caggccaagc taaagcactt 14940 aaccgtgcct atgatgacgc aggttttgcg ccgcatacct taggtctaat tgaagctcac 15000 ggaacaggta ctgcagcagg tgacgcggca gagtttgccg gcctttgctc agtatttgct 15060 gaaggcaacg ataccaagca acacattgcg ctaggttcag ttaaatcaca aattggtcat 15120 actaaatcaa ctgcaggtac agcaggttta attaaagctg ctcttgcttt gcatcacaag 15180 gtactgccgc cgaccattaa cgttagtcag ccaagcccta aacttgatat cgaaaactca 15240 ccgttttatc taaacactga gactcgtcca tggttaccac gtgttgatgg tacgccgcgc 15300 cgcgcgggta ttagctcatt tggttttggt ggcactaact tccattttgt actagaagag 15360 tacaaccaag aacacagccg tactgatagc gaaaaagcta agtatcgtca acgccaagtg 15420 gcgcaaagct tccttgttag cgcaagcgat aaagcatcgc taattaacga gttaaacgta 15480 ctagcagcat ctgcaagcca agctgagttt atcctcaaag atgcagcagc aaactatggc 15540 gtacgtgagc ttgataaaaa tgcaccacgg atcggtttag ttgcaaacac agctgaagag 15600 ttagcaggcc taattaagca agcacttgcc aaactagcag ctagcgatga taacgcatgg 15660 cagctacctg gtggcactag ctaccgcgcc gctgcagtag aaggtaaagt tgccgcactg 15720 tttgctggcc aaggttcaca atatctcaat atgggccgtg accttacttg ttattaccca 15780 gagatgcgtc agcaatttgt aactgcagat aaagtatttg ccgcaaatga taaaacgccg 15840 ttatcgcaaa ctctgtatcc aaagcctgta tttaataaag atgaattaaa ggctcaagaa 15900 gccattttga ccaataccgc caatgcccaa agcgcaattg gtgcgatttc aatgggtcaa 15960 tacgatttgt ttactgcggc tggctttaat gccgacatgg ttgcaggcca tagctttggt 16020 gagctaagtg cactgtgtgc tgcaggtgtt atttcagctg atgactacta caagctggct 16080 tttgctcgtg gtgaggctat ggcaacaaaa gcaccggcta aagacggcgt tgaagcagat 16140 gcaggagcaa tgtttgcaat cataaccaag agtgctgcag accttgaaac cgttgaagcc 16200 accatcgcta aatttgatgg ggtgaaagtc gctaactata acgcgccaac gcaatcagta 16260 attgcaggcc caacagcaac taccgctgat gcggctaaag cgctaactga gcttggttac 16320 aaagcgatta acctgccagt atcaggtgca ttccacactg aacttgttgg tcacgctcaa 16380 gcgccatttg ctaaagcgat tgacgcagcc aaatttacta aaacaagccg agcactttac 16440 tcaaatgcaa ctggcggact ttatgaaagc actgctgcaa agattaaagc ctcgtttaag 16500 aaacatatgc ttcaatcagt gcgctttact agccagctag aagccatgta caacgacggc 16560 gcccgtgtat ttgttgaatt tggtccaaag aacatcttac aaaaattagt tcaaggcacg 16620 cttgtcaaca ctgaaaatga agtttgcact atctctatca accctaatcc taaagttgat 16680 agtgatctgc agcttaagca agcagcaatg cagctagcgg ttactggtgt ggtactcagt 16740 gaaattgacc cataccaagc cgatattgcc gcaccagcga aaaagtcgcc aatgagcatt 16800 tcgcttaatg ctgctaacca tatcagcaaa gcaactcgcg ctaagatggc caagtcttta 16860 gagacaggta tcgtcacctc gcaaatagaa catgttattg aagaaaaaat cgttgaagtt 16920 gagaaactgg ttgaagtcga aaagatcgtc gaaaaagtgg ttgaagtaga gaaagttgtt 16980 gaggttgaag ctcctgttaa ttcagtgcaa gccaatgcaa ttcaaacccg ttcagttgtc 17040 gctccagtaa tagagaacca agtcgtgtct aaaaacagta agccagcagt ccagagcatt 17100 agtggtgatg cactcagcaa cttttttgct gcacagcagc aaaccgcaca gttgcatcag 17160 cagttcttag ctattccgca gcaatatggt gagacgttca ctacgctgat gaccgagcaa 17220 gctaaactgg caagttctgg tgttgcaatt ccagagagtc tgcaacgctc aatggagcaa 17280 ttccaccaac tacaagcgca aacactacaa agccacaccc agttccttga gatgcaagcg 17340 ggtagcaaca ttgcagcgtt aaacctactc aatagcagcc aagcaactta cgctccagcc 17400 attcacaatg aagcgattca aagccaagtg gttcaaagcc aaactgcagt ccagccagta 17460 atttcaacac aagttaacca tgtgtcagag cagccaactc aagctccagc tccaaaagcg 17520 cagccagcac ctgtgacaac tccagttcaa actgctccgg cacaagttgt tcgtcaagcc 17580 gcaccagttc aagccgctat tgaaccgatt aatacaagtg ttgcgactac aacgccttca 17640 gccttcagcg ccgaaacagc cctgagcgca acaaaagtcc aagccactat gcttgaagtg 17700 gttgctgaga aaaccggtta cccaactgaa atgctagagc ttgaaatgga tatggaagcc 17760 gatttaggca tcgattctat caagcgtgta gaaattcttg gcacagtaca agatgagcta 17820 ccgggtctac ctgagcttag ccctgaagat ctagctgagt gtcgaacgct aggcgaaatc 17880 gttgactata tgggcagtaa actgccggct gaaggctcta tgaattctca gctgtctaca 17940 ggttccgcag ctgcgactcc tgcagcgaat ggtctttctg cggagaaagt tcaagcgact 18000 atgatgtctg tggttgccga aaagactggc tacccaactg aaatgctaga gcttgaaatg 18060 gatatggaag ccgatttagg catagattct atcaagcgcg ttgaaattct tggcacagta 18120 caagatgagc taccgggtct acctgagctt agccctgaag atctagctga gtgtcgtact 18180 ctaggcgaaa tcgttgacta tatgaactct aaactcgctg acggctctaa gctgccggct 18240 gaaggctcta tgaattctca gctgtctaca agtgccgcag ctgcgactcc tgcagcgaat 18300 ggtctctctg cggagaaagt tcaagcgact atgatgtctg tggttgccga aaagactggc 18360 tacccaactg aaatgctaga acttgaaatg gatatggaag ctgaccttgg catcgattca 18420 atcaagcgcg ttgaaattct tggcacagta caagatgagc taccgggttt acctgagcta 18480 aatccagaag atttggcaga gtgtcgtact cttggcgaaa tcgtgactta tatgaactct 18540 aaactcgctg acggctctaa gctgccagct gaaggctcta tgcactatca gctgtctaca 18600 agtaccgctg ctgcgactcc tgtagcgaat ggtctctctg cagaaaaagt tcaagcgacc 18660 atgatgtctg tagttgcaga taaaactggc tacccaactg aaatgcttga acttgaaatg 18720 gatatggaag ccgatttagg tatcgattct atcaagcgcg ttgaaattct tggcacagta 18780 caagatgagc taccgggttt acctgagcta aatccagaag atctagcaga gtgtcgcacc 18840 ctaggcgaaa tcgttgacta tatgggcagt aaactgccgg ctgaaggctc tgctaataca 18900 agtgccgctg cgtctcttaa tgttagtgcc gttgcggcgc ctcaagctgc tgcgactcct 18960 gtatcgaacg gtctctctgc agagaaagtg caaagcacta tgatgtcagt agttgcagaa 19020 aagaccggct acccaactga aatgctagaa cttggcatgg atatggaagc cgatttaggt 19080 atcgactcaa ttaaacgcgt tgagattctt ggcacagtac aagatgagct accgggtcta 19140 ccagagctta atcctgaaga tttagctgag tgccgtacgc tgggcgaaat cgttgactat 19200 atgaactcta agctggctga cggctctaag cttccagctg aaggctctgc taatacaagt 19260 gccactgctg cgactcctgc agtgaatggt ctttctgctg acaaggtaca ggcgactatg 19320 atgtctgtag ttgctgaaaa gaccggctac ccaactgaaa tgctagaact tggcatggat 19380 atggaagcag accttggtat tgattctatt aagcgcgttg aaattcttgg cacagtacaa 19440 gatgagctcc caggtttacc tgagcttaat cctgaagatc tcgctgagtg ccgcacgctt 19500 ggcgaaatcg ttagctatat gaactctcaa ctggctgatg gctctaaact ttctacaagt 19560 gcggctgaag gctctgctga tacaagtgct gcaaatgctg caaagccggc agcaatttcg 19620 gcagaaccaa gtgttgagct tcctcctcat agcgaggtag cgctaaaaaa gcttaatgcg 19680 gcgaacaagc tagaaaattg tttcgccgca gacgcaagtg ttgtgattaa cgatgatggt 19740 cacaacgcag gcgttttagc tgagaaactt attaaacaag gcctaaaagt agccgttgtg 19800 cgtttaccga aaggtcagcc tcaatcgcca ctttcaagcg atgttgctag ctttgagctt 19860 gcctcaagcc aagaatctga gcttgaagcc agtatcactg cagttatcgc gcagattgaa 19920 actcaggttg gcgctattgg tggctttatt cacttgcaac cagaagcgaa tacagaagag 19980 caaacggcag taaacctaga tgcgcaaagt tttactcacg ttagcaatgc gttcttgtgg 20040 gccaaattat tgcaaccaaa gctcgttgct ggagcagatg cgcgtcgctg ttttgtaaca 20100 gtaagccgta tcgacggtgg ctttggttac ctaaatactg acgccctaaa agatgctgag 20160 ctaaaccaag cagcattagc tggtttaact aaaaccttaa gccatgaatg gccacaagtg 20220 ttctgtcgcg cgctagatat tgcaacagat gttgatgcaa cccatcttgc tgatgcaatc 20280 accagtgaac tatttgatag ccaagctcag ctacctgaag tgggcttaag cttaattgat 20340 ggcaaagtta accgcgtaac tctagttgct gctgaagctg cagataaaac agcaaaagca 20400 gagcttaaca gcacagataa aatcttagtg actggtgggg caaaaggggt gacatttgaa 20460 tgtgcactgg cattagcatc tcgcagccag tctcacttta tcttagctgg gcgcagtgaa 20520 ttacaagctt taccaagctg ggctgagggt aagcaaacta gcgagctaaa atcagctgca 20580 atcgcacata ttatttctac tggtcaaaag ccaacgccta agcaagttga agccgctgtg 20640 tggccagtgc aaagcagcat tgaaattaat gccgccctag ccgcctttaa caaagttggc 20700 gcctcagctg aatacgtcag catggatgtt accgatagcg ccgcaatcac agcagcactt 20760 aatggtcgct caaatgagat caccggtctt attcatggcg caggtgtact agccgacaag 20820 catattcaag acaagactct tgctgaactt gctaaagttt atggcactaa agtcaacggc 20880 ctaaaagcgc tgctcgcggc acttgagcca agcaaaatta aattacttgc tatgttctca 20940 tctgcagcag gtttttacgg taatatcggc caaagcgatt acgcgatgtc gaacgatatt 21000 cttaacaagg cagcgctgca gttcaccgct cgcaacccac aagctaaagt catgagcttt 21060 aactggggtc cttgggatgg cggcatggtt aacccagcgc ttaaaaagat gtttaccgag 21120 cgtggtgtgt acgttattcc actaaaagca ggtgcagagc tatttgccac tcagctattg 21180 gctgaaactg gcgtgcagtt gctcattggt acgtcaatgc aaggtggcag cgacactaaa 21240 gcaactgaga ctgcttctgt aaaaaagctt aatgcgggtg aggtgctaag tgcatcgcat 21300 ccgcgtgctg gtgcacaaaa aacaccacta caagctgtca ctgcaacgcg tctgttaacc 21360 ccaagtgcca tggtcttcat tgaagatcac cgcattggcg gtaacagtgt gttgccaacg 21420 gtatgcgcca tcgactggat gcgtgaagcg gcaagcgaca tgcttggcgc tcaagttaag 21480 gtacttgatt acaagctatt aaaaggcatt gtatttgaga ctgatgagcc gcaagagtta 21540 acacttgagc taacgccaga cgattcagac gaagctacgc tacaagcatt aatcagctgt 21600 aatgggcgtc cgcaatacaa ggcgacgctt atcagtgata atgccgatat taagcaactt 21660 aacaagcagt ttgatttaag cgctaaggcg attaccacag caaaagagct ttatagcaac 21720 ggcaccttgt tccacggtcc gcgtctacaa gggatccaat ctgtagtgca gttcgatgat 21780 caaggcttaa ttgctaaagt cgctctgcct aaggttgaac ttagcgattg tggtgagttc 21840 ttgccgcaaa cccacatggg tggcagtcaa ccttttgctg aggacttgct attacaagct 21900 atgctggttt gggctcgcct taaaactggc tcggcaagtt tgccatcaag cattggtgag 21960 tttacctcat accaaccaat ggcctttggt gaaactggta ccatagagct tgaagtgatt 22020 aagcacaaca aacgctcact tgaagcgaat gttgcgctat atcgtgacaa cggcgagtta 22080 agtgccatgt ttaagtcagc taaaatcacc attagcaaaa gcttaaattc agcattttta 22140 cctgctgtct tagcaaacga cagtgaggcg aattagtgga acaaacgcct aaagctagtg 22200 cgatgccgct gcgcatcgca cttatcttac tgccaacacc gcagtttgaa gttaactctg 22260 tcgaccagtc agtattagcc agctatcaaa cactgcagcc tgagctaaat gccctgctta 22320 atagtgcgcc gacacctgaa atgctcagca tcactatctc agatgatagc gatgcaaaca 22380 gctttgagtc gcagctaaat gctgcgacca acgcaattaa caatggctat atcgtcaagc 22440 ttgctacggc aactcacgct ttgttaatgc tgcctgcatt aaaagcggcg caaatgcgga 22500 tccatcctca tgcgcagctt gccgctatgc agcaagctaa atcgacgcca atgagtcaag 22560 tatctggtga gctaaagctt ggcgctaatg cgctaagcct agctcagact aatgcgctgt 22620 ctcatgcttt aagccaagcc aagcgtaact taactgatgt cagcgtgaat gagtgttttg 22680 agaacctcaa aagtgaacag cagttcacag aggtttattc gcttattcag caacttgcta 22740 gccgcaccca tgtgagaaaa gaggttaatc aaggtgtgga acttggccct aaacaagcca 22800 aaagccacta ttggtttagc gaatttcacc aaaaccgtgt tgctgccatc aactttatta 22860 atggccaaca agcaaccagc tatgtgctta ctcaaggttc aggattgtta gctgcgaaat 22920 caatgctaaa ccagcaaaga ttaatgttta tcttgccggg taacagtcag caacaaataa 22980 ccgcatcaat aactcagtta atgcagcaat tagagcgttt gcaggtaact gaggttaatg 23040 agctttctct agaatgccaa ctagagctgc tcagcataat gtatgacaac ttagtcaacg 23100 cagacaaact cactactcgc gatagtaagc ccgcttatca ggctgtgatt caagcaagct 23160 ctgttagcgc tgcaaagcaa gagttaagcg cgcttaacga tgcactcaca gcgctgtttg 23220 ctgagcaaac aaacgccaca tcaacgaata aaggcttaat ccaatacaaa acaccggcgg 23280 gcagttactt aaccctaaca ccgcttggca gcaacaatga caacgcccaa gcgggtcttg 23340 cttttgtcta tccgggtgtg ggaacggttt acgccgatat gcttaatgag ctgcatcagt 23400 acttccctgc gctttacgcc aaacttgagc gtgaaggcga tttaaaggcg atgctacaag 23460 cagaagatat ctatcatctt gaccctaaac atgctgccca aatgagctta ggtgacttag 23520 ccattgctgg cgtggggagc agctacctgt taactcagct gctcaccgat gagtttaata 23580 ttaagcctaa ttttgcatta ggttactcaa tgggtgaagc atcaatgtgg gcaagcttag 23640 gcgtatggca aaacccgcat gcgctgatca gcaaaaccca aaccgacccg ctatttactt 23700 ctgctatttc cggcaaattg accgcggtta gacaagcttg gcagcttgat gataccgcag 23760 cggaaatcca gtggaatagc tttgtggtta gaagtgaagc agcgccgatt gaagccttgc 23820 taaaagatta cccacacgct tacctcgcga ttattcaagg ggatacctgc gtaatcgctg 23880 gctgtgaaat ccaatgtaaa gcgctacttg cagcactggg taaacgcggt attgcagcta 23940 atcgtgtaac ggcgatgcat acgcagcctg cgatgcaaga gcatcaaaat gtgatggatt 24000 tttatctgca accgttaaaa gcagagcttc ctagtgaaat aagctttatc agcgccgctg 24060 atttaactgc caagcaaacg gtgagtgagc aagcacttag cagccaagtc gttgctcagt 24120 ctattgccga caccttctgc caaaccttgg actttaccgc gctagtacat cacgcccaac 24180 atcaaggcgc taagctgttt gttgaaattg gcgcggatag acaaaactgc accttgatag 24240 acaagattgt taaacaagat ggtgccagca gtgtacaaca tcaaccttgt tgcacagtgc 24300 ctatgaacgc aaaaggtagc caagatatta ccagcgtgat taaagcgctt ggccaattaa 24360 ttagccatca ggtgccatta tcggtgcaac catttattga tggactcaag cgcgagctaa 24420 cactttgcca attgaccagc caacagctgg cagcacatgc aaatgttgac agcaagtttg 24480 agtctaacca agaccattta cttcaagggg aagtctaatg tcattaccag acaatgcttc 24540 taaccacctt tctgccaacc agaaaggcgc atctcaggca agtaaaacca gtaagcaaag 24600 caaaatcgcc attgtcggtt tagccactct gtatccagac gctaaaaccc cgcaagaatt 24660 ttggcagaat ttgctggata aacgcgactc tcgcagcacc ttaactaacg aaaaactcgg 24720 cgctaacagc caagattatc aaggtgtgca aggccaatct gaccgttttt attgtaataa 24780 aggcggctac attgagaact tcagctttaa tgctgcaggc tacaaattgc cggagcaaag 24840 cttaaatggc ttggacgaca gcttcctttg ggcgctcgat actagccgta acgcactaat 24900 tgatgctggt attgatatca acggcgctga tttaagccgc gcaggtgtag tcatgggcgc 24960 gctgtcgttc ccaactaccc gctcaaacga tctgtttttg ccaatttatc acagcgccgt 25020 tgaaaaagcc ctgcaagata aactaggcgt aaaggcattt aagctaagcc caactaatgc 25080 tcataccgct cgcgcggcaa atgagagcag cctaaatgca gccaatggtg ccattgccca 25140 taacagctca aaagtggtgg ccgatgcact tggccttggc ggcgcacaac taagcctaga 25200 tgctgcctgt gctagttcgg tttactcatt aaagcttgcc tgcgattacc taagcactgg 25260 caaagccgat atcatgctag caggcgcagt atctggcgcg gatcctttct ttattaatat 25320 gggattctca atcttccacg cctacccaga ccatggtatc tcagtaccgt ttgatgccag 25380 cagtaaaggt ttgtttgctg gcgaaggcgc tggcgtatta gtgcttaaac gtcttgaaga 25440 tgccgagcgc gacaatgaca aaatctatgc ggttgttagc ggcgtaggtc tatcaaacga 25500 cggtaaaggc cagtttgtat taagccctaa tccaaaaggt caggtgaagg cctttgaacg 25560 tgcttatgct gccagtgaca ttgagccaaa agacattgaa gtgattgagt gccacgcaac 25620 aggcacaccg cttggcgata aaattgagct cacttcaatg gaaaccttct ttgaagacaa 25680 gctgcaaggc accgatgcac cgttaattgg ctcagctaag tctaacttag gccacctatt 25740 aactgcagcg ggcatgccgg ggatcatgaa gatgatcttc gccatgaaag aaggttacct 25800 gccgccaagt atcaatatta gtgatgctat cgcttcgccg aaaaaactct tcggtaaacc 25860 aaccctgcct agcatggttc aaggctggcc agataagcca tcgaataatc attttggtgt 25920 aagaacccgt cacgcaggcg tatcggtatt tggctttggt ggctgtaacg cccatctgtt 25980 gcttgagtca tacaacggca aaggaacagt aaaggcagaa gccactcaag taccgcgtca 26040 agctgagccg ctaaaagtgg ttggccttgc ctcgcacttt gggcctctta gcagcattaa 26100 tgcactcaac aatgctgtga cccaagatgg gaatggcttt atcgaactgc cgaaaaagcg 26160 ctggaaaggc cttgaaaagc acagtgaact gttagctgaa tttggcttag catctgcgcc 26220 aaaaggtgct tatgttgata acttcgagct ggacttttta cgctttaaac tgccgccaaa 26280 cgaagatgac cgtttgatct cacagcagct aatgctaatg cgagtaacag acgaagccat 26340 tcgtgatgcc aagcttgagc cggggcaaaa agtagctgta ttagtggcaa tggaaactga 26400 gcttgaactg catcagttcc gcggccgggt taacttgcat actcaattag cgcaaagtct 26460 tgccgccatg ggcgtgagtt tatcaacgga tgaataccaa gcgcttgaag ccatcgccat 26520 ggacagcgtg cttgatgctg ccaagctcaa tcagtacacc agctttattg gtaatattat 26580 ggcgtcacgc gtggcgtcac tatgggactt taatggccca gccttcacta tttcagcagc 26640 agagcaatct gtgagccgct gtatcgatgt ggcgcaaaac ctcatcatgg aggataacct 26700 agatgcggtg gtgattgcag cggtcgatct ctctggtagc tttgagcaag tcattcttaa 26760 aaatgccatt gcacctgtag ccattgagcc aaacctcgaa gcaagcctta atccaacatc 26820 agcaagctgg aatgtcggtg aaggtgctgg cgcggtcgtg cttgttaaaa atgaagctac 26880 atcgggctgc tcatacggcc aaattgatgc acttggcttt gctaaaactg ccgaaacagc 26940 gttggctacc gacaagctac tgagccaaac tgccacagac tttaataagg ttaaagtgat 27000 tgaaactatg gcagcgcctg ctagccaaat tcaattagcg ccaatagtta gctctcaagt 27060 gactcacact gctgcagagc agcgtgttgg tcactgcttt gctgcagcgg gtatggcaag 27120 cctattacac ggcttactta acttaaatac tgtagcccaa accaataaag ccaattgcgc 27180 gcttatcaac aatatcagtg aaaaccaatt atcacagctg ttgattagcc aaacagcgag 27240 cgaacaacaa gcattaaccg cgcgtttaag caatgagctt aaatccgatg ctaaacacca 27300 actggttaag caagtcacct taggtggccg tgatatctac cagcatattg ttgatacacc 27360 gcttgcaagc cttgaaagca ttactcagaa attggcgcaa gcgacagcat cgacagtggt 27420 caaccaagtt aaacctatta aggccgctgg ctcagtcgaa atggctaact cattcgaaac 27480 ggaaagctca gcagagccac aaataacaat tgcagcacaa cagactgcaa acattggcgt 27540 caccgctcag gcaaccaaac gtgaattagg taccccacca atgacaacaa ataccattgc 27600 taatacagca aataatttag acaagactct tgagactgtt gctggcaata ctgttgctag 27660 caaggttggc tctggcgaca tagtcaattt tcaacagaac caacaattgg ctcaacaagc 27720 tcacctcgcc tttcttgaaa gccgcagtgc gggtatgaag gtggctgatg ctttattgaa 27780 gcaacagcta gctcaagtaa caggccaaac tatcgataat caggccctcg atactcaagc 27840 cgtcgatact caaacaagcg agaatgtagc gattgccgca gaatcaccag ttcaagttac 27900 aacacctgtt caagttacaa cacctgttca aatcagtgtt gtggagttaa aaccagatca 27960 cgctaatgtg ccaccataca cgccgccagt gcctgcatta aagccgtgta tctggaacta 28020 tgccgattta gttgagtacg cagaaggcga tatcgccaag gtatttggca gtgattatgc 28080 cattatcgac agctactcgc gccgcgtacg tctaccgacc actgactacc tgttggtatc 28140 gcgcgtgacc aaacttgatg cgaccatcaa tcaatttaag ccatgctcaa tgaccactga 28200 gtacgacatc cctgttgatg cgccgtactt agtagacgga caaatccctt gggcggtagc 28260 agtagaatca ggccaatgtg acttgatgct tattagctat ctcggtatcg actttgagaa 28320 caaaggcgag cgggtttatc gactactcga ttgtaccctc accttcctag gcgacttgcc 28380 acgtggcgga gataccctac gttacgacat taagatcaat aactatgctc gcaacggcga 28440 caccctgctg ttcttcttct cgtatgagtg ttttgttggc gacaagatga tcctcaagat 28500 ggatggcggc tgcgctggct tcttcactga tgaagagctt gccgacggta aaggcgtgat 28560 tcgcacagaa gaagagatta aagctcgcag cctagtgcaa aagcaacgct ttaatccgtt 28620 actagattgt cctaaaaccc aatttagtta tggtgatatt cataagctat taactgctga 28680 tattgagggt tgttttggcc caagccacag tggcgtccac cagccgtcac tttgtttcgc 28740 atctgaaaaa ttcttgatga ttgaacaagt cagcaaggtt gatcgcactg gcggtacttg 28800 gggacttggc ttaattgagg gtcataagca gcttgaagca gaccactggt acttcccatg 28860 tcatttcaag ggcgaccaag tgatggctgg ctcgctaatg gctgaaggtt gtggccagtt 28920 attgcagttc tatatgctgc accttggtat gcatacccaa actaaaaatg gtcgtttcca 28980 acctcttgaa aacgcctcac agcaagtacg ctgtcgcggt caagtgctgc cacaatcagg 29040 cgtgctaact taccgtatgg aagtgactga aatcggtttc agtccacgcc catatgctaa 29100 agctaacatc gatatcttgc ttaatggcaa agcggtagtg gatttccaaa acctaggggt 29160 gatgataaaa gaggaagatg agtgtactcg ttatccactt ttgactgaat caacaacggc 29220 tagcactgca caagtaaacg ctcaaacaag tgcgaaaaag gtatacaagc cagcatcagt 29280 caatgcgcca ttaatggcac aaattcctga tctgactaaa gagccaaaca agggcgttat 29340 tccgatttcc catgttgaag caccaattac gccagactac ccgaaccgtg tacctgatac 29400 agtgccattc acgccgtatc acatgtttga gtttgctaca ggcaatatcg aaaactgttt 29460 cgggccagag ttctcaatct atcgcggcat gatcccacca cgtacaccat gcggtgactt 29520 acaagtgacc acacgtgtga ttgaagttaa cggtaagcgt ggcgacttta aaaagccatc 29580 atcgtgtatc gctgaatatg aagtgcctgc agatgcgtgg tatttcgata aaaacagcca 29640 cggcgcagtg atgccatatt caattttaat ggagatctca ctgcaaccta acggctttat 29700 ctcaggttac atgggcacaa ccctaggctt ccctggcctt gagctgttct tccgtaactt 29760 agacggtagc ggtgagttac tacgtgaagt agatttacgt ggtaaaacca tccgtaacga 29820 ctcacgttta ttatcaacag tgatggccgg cactaacatc atccaaagct ttagcttcga 29880 gctaagcact gacggtgagc ctttctatcg cggcactgcg gtatttggct attttaaagg 29940 tgacgcactt aaagatcagc taggcctaga taacggtaaa gtcactcagc catggcatgt 30000 agctaacggc gttgctgcaa gcactaaggt gaacctgctt gataagagct gccgtcactt 30060 taatgcgcca gctaaccagc cacactatcg tctagccggt ggtcagctga actttatcga 30120 cagtgttgaa attgttgata atggcggcac cgaaggttta ggttacttgt atgccgagcg 30180 caccattgac ccaagtgatt ggttcttcca gttccacttc caccaagatc cggttatgcc 30240 aggctcctta ggtgttgaag caattattga aaccatgcaa gcttacgcta ttagtaaaga 30300 cttgggcgca gatttcaaaa atcctaagtt tggtcagatt ttatcgaaca tcaagtggaa 30360 gtatcgcggt caaatcaatc cgctgaacaa gcagatgtct atggatgtca gcattacttc 30420 aatcaaagat gaagacggta agaaagtcat cacaggtaat gccagcttga gtaaagatgg 30480 tctgcgcata tacgaggtct tcgatatagc tatcagcatc gaagaatctg tataaatcgg 30540 agtgactgtc tggctatttt actcaatttc tgtgtcaaaa gtgctcacct atattcatag 30600 gctgcgcgct tttttctgga aattgagcaa aagtatctgc gtcctaactc gatttataag 30660 aatggtttaa ttgaaaagaa caacagctaa gagccgcaag ctcaatataa ataattaagg 30720 gtcttacaaa taatgaatcc tacagcaact aacgaaatgc tttctccgtg gccatgggct 30780 gtgacagagt caaatatcag ttttgacgtg caagtgatgg aacaacaact taaagatttt 30840 agccgggcat gttacgtggt caatcatgcc gaccacggct ttggtattgc gcaaactgcc 30900 gatatcgtga ctgaacaagc ggcaaacagc acagatttac ctgttagtgc ttttactcct 30960 gcattaggta ccgaaagcct aggcgacaat aatttccgcc gcgttcacgg cgttaaatac 31020 gcttattacg caggcgctat ggcaaacggt atttcatctg aagagctagt gattgcccta 31080 ggtcaagctg gcattttgtg ttcgtttgga gcagccggtc ttattccaag tcgcgttgaa 31140 gcggcaatta accgtattca agcagcgctg ccaaatggcc cttatatgtt taaccttatc 31200 catagtccta gcgagccagc attagagcgt ggcagcgtag agctattttt aaagcataag 31260 gtacgcaccg ttgaagcatc agctttctta ggtctaacac cacaaatcgt ctattaccgt 31320 gcagcaggat tgagccgaga cgcacaaggt aaagttgtgg ttggtaacaa ggttatcgct 31380 aaagtaagtc gcaccgaagt ggctgaaaag tttatgatgc cagcgcccgc aaaaatgcta 31440 caaaaactag ttgatgacgg ttcaattacc gctgagcaaa tggagctggc gcaacttgta 31500 cctatggctg acgacatcac tgcagaggcc gattcaggtg gccatactga taaccgtcca 31560 ttagtaacat tgctgccaac cattttagcg ctgaaagaag aaattcaagc taaataccaa 31620 tacgacactc ctattcgtgt cggttgtggt ggcggtgtgg gtacgcctga tgcagcgctg 31680 gcaacgttta acatgggcgc ggcgtatatt gttaccggct ctatcaacca agcttgtgtt 31740 gaagcgggcg caagtgatca cactcgtaaa ttacttgcca ccactgaaat ggccgatgtg 31800 actatggcac cagctgcaga tatgttcgag atgggcgtaa aactgcaggt ggttaagcgc 31860 ggcacgctat tcccaatgcg cgctaacaag ctatatgaga tctacacccg ttacgattca 31920 atcgaagcga tcccattaga cgagcgtgaa aagcttgaga aacaagtatt ccgctcaagc 31980 ctagatgaaa tatgggcagg tacagtggcg cactttaacg agcgcgaccc taagcaaatc 32040 gaacgcgcag agggtaaccc taagcgtaaa atggcattga ttttccgttg gtacttaggt 32100 ctttctagtc gctggtcaaa ctcaggcgaa gtgggtcgtg aaatggatta tcaaatttgg 32160 gctggccctg ctctcggtgc atttaaccaa tgggcaaaag gcagttactt agataactat 32220 caagaccgaa atgccgtcga tttggcaaag cacttaatgt acggcgcggc ttacttaaat 32280 cgtattaact cgctaacggc tcaaggcgtt aaagtgccag cacagttact tcgctggaag 32340 ccaaaccaaa gaatggccta atacacttac aaagcaccag tctaaaaagc cactaatctt 32400 gattagtggc tttttttatt gtggtcaata tgaggctatt tagcctgtaa gcctgaaaat 32460 atcagcactc tgactttaca agcaaattat aattaaggca gggctctact catttatact 32520 gctagcaaac aagcaagttg cccagtaaaa caacaaggta cctgatttat atcgtcataa 32580 aagttggcta gagattcgtt attgatcttt actgattaga gtcgctctgt ttggaaaaag 32640 gtttctcgtt atcatcaaaa tacactctca aacctttaat caattacaac ttaggctttc 32700 tgcgggcatt tttatcttat ttgccacagc tgtatttgcc tttaggtttt gggtgcaact 32760 accattaatt gaggcctcat tagttaaatt atctgagcaa gagctcacct ctttaaatta 32820 cgcttttcag caaatgagaa agccactaca aaccattaat tacgactatg cggtgtggga 32880 cagaacctac agctatatga aatcaaactc agcgagcgct aaaaggtact atgaaaaaca 32940 tgagtaccca gatgatacgt tcaagagttt aaaagtcgac ggagtattta tattcaaccg 33000 tacaaatcag ccagttttta gtaaaggttt taatcataga aatgatatac cgctggtctt 33060 tgaattaact gactttaaac aacatccaca aaacatcgca ttatctccac aaaccaaaca 33120 ggcacaccca ccggcaagta agccgttaga ctcccctgat gatgtgcctt ctacccatgg 33180 ggttatcgcc acacgatacg gtccagcaat ttatagctct accagcattt taaaatctga 33240 tcgtagcggc tcccaacttg gttatttagt cttcattagg ttaattgatg aatggttcat 33300 cgctgagcta tcgcaataca ctgccgcagg tgttgaaatc gctatggctg atgccgcaga 33360 cgcacaatta gcgagattag gcgcaaacac taagcttaat aaagtaaccg ctacatccga 33420 acggttaata actaatgtcg atggtaagcc tctgttgaag ttagtgcttt accataccaa 33480 taaccaaccg ccgccgatgc tagattacag tataataatt ctattagttg agatgtcatt 33540 tttactgatc ctcgcttatt tcctttactc ctacttctta gtcaggccag ttagaaagct 33600 ggcttcagat attaaaaaaa tggataaaag tcgtgaaatt aaaaagctaa ggtatcacta 33660 ccctattact gagctagtca aagttgcgac tcacttcaac gccctaatgg ggacgattca 33720 ggaacaaact aaacagctta atgaacaagt ttttattgat aaattaacca atattcccaa 33780 tcgtcgcgct tttgagcagc gacttgaaac ctattgccaa ctgctagccc ggcaacaaat 33840 tggctttact ctcatcattg ccgatgtgga tcattttaaa gagtacaacg atactcttgg 33900 gcaccttgct ggggatgaag cattaataaa agtggcacaa acactatcgc aacagtttta 33960 ccgtgcagaa gatatttgtg cccgttttgg tggtgaagaa tttattatgt tatttcgaga 34020 catacctgat gagcccttgc agagaaagct cgatgcgatg ctgcactctt ttgcagagct 34080 caacctacct catccaaact catcaaccgc taattacgtt actgtgagcc ttggggtttg 34140 cacagttgtt gctgttgatg attttgaatt taaaagtgag tcgcatatta ttggcagtca 34200 ggctgcatta atcgcagata aggcgcttta tcatgctaaa gcctgtggtc gtaaccagtt 34260 gtcaaaaact actattactg ttgatgagat tgagcaatta gaagcaaata aaatcggtca 34320 tcaagcctaa actcgttcga gtactttccc ctaagtcaga gctatttgcc acttcaagat 34380 gtggctacaa ggcttactct ttcaaaacct gcatcaatag aacacagcaa aatacaataa 34440 tttaagtcaa tttagcctat taaacagagt taatgacagc tcatggtcgc aacttattag 34500 ctatttctag caatataaaa acttatccat tagtagtaac caataaaaaa actaatatat 34560 aaaactattt aatcattatt ttacagatga ttagctacca cccaccttaa gctggctata 34620 ttcgcactag taaaaataaa cattagatcg ggttcagatc aatttacgag tctcgtataa 34680 aatgtacaat aattcactta atttaatact gcatattttt acaagtagag agcggtgatg 34740 aaacaaaata cgaaaggctt tacattaatt gaattagtca tcgtgattat tattctcggt 34800 atacttgctg ctgtggcact gccgaaattc atcaatgttc aagatgacgc taggatctct 34860 gcgatgagcg gtcagttttc atcatttgaa agtgccgtaa aactatacca tagcggttgg 34920 ttagccaaag gctacaacac tgcggttgaa aagctctcag gctttggcca aggtaatgtt 34980 gcatcaagtg acacaggttt tccgtactca acatcaggca cgagtactga tgtgcataaa 35040 gcttgtggtg aactatggca tggcattacc gatacagact tcacaattgg tgcggttagt 35100 gatggcgatc taatgactgc agatgtcgat attgcttaca cctatcgtgg tgatatgtgt 35160 atctatcgcg atctgtattt tattcagcgc tcattaccta ctaaggtgat gaactacaaa 35220 tttaaaactg gtgaaataga aattattgat gctttctaca accctgacgg ctcaactggt 35280 caattaccat aaatttggcg cttatctaag ttgtacttgc tctgaccgac acaaataatg 35340 tcgtttctca gcatatatca aaatacacag caaaaatttg gggttagcta tatagctaac 35400 cccaaatcat atctaacttt acactgcatc taattccaaa cagtatccag ccaaaagcct 35460 aaactattgt tgactcagcg ctaaaatatg cgatgcaaca aacaagtctt ggatcgcaat 35520 acctgagcta tcaaaaatgg tcacctcatc agcactttga cgtcctgttg cggactcgtt 35580 tatcacctga ccaatctcaa ttatcggcgt atttctgcta tgttgaaact caccaataac 35640 aatagattga gaagcaaagt cgcaaaacaa gcgagcatga ctatataggt cagttggcaa 35700 ctcttgctta cccactttat cagcgcccat tgcagaaata tgcgttcctg cttgtaccca 35760 ctgcgcttca aataaaggcg cttgagctgt ggttgctgtg ataataatat ctgcttgttc 35820 acaagcagct tgtgcatcac aagcttcggc attaatgcct ttttctaata aacgcttaac 35880 caagttttca gttttgctag cactacggcc aactaccaat accttagtta atgaacgaac 35940 cttgctcact gctagcactt catattcagc ctgatgaccg gtaccaaaaa cagttaatac 36000 cgtagcatct tctctcgcga ggtaactcac tgctactgca tcggcagcac cagtgcggta 36060 agcattaacg gtagtggcag caatcaccgn ctgcaacata ccggttaatg gatcgagtaa 36120 aaatacgtta gtgccgtggc atggtaaacc atgtttatgg ttatcaggcc aatagctgcc 36180 tgttttccag ccgacaaggt ttggcgttga agccgacttt aatgagaaca tttcattaag 36240 gttcgcgccc tgtgcattaa ctaccgggaa caaggttgct ttatcatcta cggcagcgac 36300 aaacgcttct ttaacagcga tataagccag ctcatgggag atgagctttg atgtttgcgc 36360 ttcagttaaa tagatcatat taccacccct gcactcgatt ccagatctca tagccaccat 36420 tatcaccatc agtatcaaat acatggtact gagcgtgcat tgaagctgtt gcacaggcgt 36480 ggttcggcaa aatatgtaga cgactaccta ccgggaactg cgctaaatca ataacgccgc 36540 catcaactgc ttcaataatg ccgtgctctt gattaacagt tataacctgt agacctgata 36600 acacgtgacc gctgtcgtca cacactaaac cataaccaca atcttttggc tgctctgcag 36660 tacctctatc acccgaaaga gccatccaac ccgcatcaat gaaaatccag tttttatcag 36720 gattatgacc aataacactg gtcactaccg ttgcggcaat atcagttaac tgacacacgt 36780 ttagccctgc catgactaaa tcgaagaagg tgtacacacc cgctctaacc tcggtgatcc 36840 catcaaggtt ttgatagctt tgcgctgttg gtgttgaacc aatactaacg atgtcacatt 36900 gcatacccgc tgcgcgaatg cgtcagcagc ttgtacagcc gctgcaactt cattttgcgc 36960 cgcatcaatt aattgctgtt tttcaaaaca ttgatatgac tcaccagcgt gagtgagtac 37020 gccgtgaaaa ctcgctgcgc cagacgttag tatctgagca atttcaatca acttatcggc 37080 ttccggtgga ataccaccac gatggccatc acaatcaatt tcaattaatg ctggtatttg 37140 gcagtcataa gaaccacaga aatgatttag ctgatgcgct tgctcaacac tatcaagtaa 37200 aactcttgca ttaatacctt ggtccaacat tttagcaata cgcggcaact taccatcggc 37260 aatacctact gcataaataa tgtctgtgta acctttagat gctaaggcct cggcctcttt 37320 taccgttgat acagtgactg gtgagttttt agtgggtaat aaaaactcgg ctgcttcaag 37380 tgatcttaac gttttaaaat gcggtcttag gtttgcacct aatccttcaa ttttttggcg 37440 tagttgactg aggttattaa taaatactgg cttatttaca tataaaaacg gtgtatcaat 37500 tgcttgatac tgactttgct gagtcgtgga aagtatttga gtagatggca tctttaatat 37560 cctagttcat caatcaatct aacaagtttg atgcctagcc acagtggctt gtattcatga 37620 tgctttggaa aatgcttata ttcaaagtat ttgaaagaca tcaaacttct tgtttaatgc 37680 tcagtatcca ccagcacgca tttattttat attaactatt atcaagatat agattaggtt 37740 caaaccaaat gattagtact gaagatctac gttttatcag cgtaatcgcc agtcatcgca 37800 ccttagctga tgccgctaga acactaaata tcacgccacc atcagtgaca ttaaggttgc 37860 agcatattga aaagaaacta tcgattagcc tgatc 37895 <210> 2 <211> 831 <212> DNA <213> Shewanella putrefaciens SCRC-2874 (FERM BP-1625) <400> 1 atg gta aga ggc tat ttg cgc gct tta ttg tca caa cat agt gaa ata 48 Met Val Arg Gly Tyr Leu Arg Ala Leu Leu Ser Gln His Ser Glu Ile 1 5 10 15 cgc ccc aat gaa tgg cgc ttt gaa tat ggc gac aaa ggt aag cct aga 96 Arg Pro Asn Glu Trp Arg Phe Glu Tyr Gly Asp Lys Gly Lys Pro Arg 20 25 30 ttg agt gat gcg caa ttt gct caa acc ggg gtc cac ttt aat gtg agt 144 Leu Ser Asp Ala Gln Phe Ala Gln Thr Gly Val His Phe Asn Val Ser 35 40 45 cat agt gga gat tgg cta tta gta ggc att tgc act gct gat aat aaa 192 His Ser Gly Asp Trp Leu Leu Val Gly Ile Cys Thr Ala Asp Asn Lys 50 55 60 ggc gcc agt cag gca agc aag gag gaa act gac tct gct agt att gag 240 Gly Ala Ser Gln Ala Ser Lys Glu Glu Thr Asp Ser Ala Ser Ile Glu 65 70 75 80 ttt ggc gtc gac att gag cgt tgc cgt aac agc acc aat atc cac tct 288 Phe Gly Val Asp Ile Glu Arg Cys Arg Asn Ser Thr Asn Ile His Ser 85 90 95 att ctt agt cat tat ttc tct gaa tca gaa aag cga gcc ttg tta gcg 336 Ile Leu Ser His Tyr Phe Ser Glu Ser Glu Lys Arg Ala Leu Leu Ala 100 105 110 tta cca gag gcc ttg cag cga gac cgc ttt ttt gat ttg tgg gcg ctc 384 Leu Pro Glu Ala Leu Gln Arg Asp Arg Phe Phe Asp Leu Trp Ala Leu 115 120 125 aag gag tct tac att aaa gcg aaa gga ctt ggg ctg gca tta tcg cta 432 Lys Glu Ser Tyr Ile Lys Ala Lys Gly Leu Gly Leu Ala Leu Ser Leu 130 135 140 aaa tct ttt gcg ttt gac ttc tct gca ctg agc gaa act ttt ctt gga 480 Lys Ser Phe Ala Phe Asp Phe Ser Ala Leu Ser Glu Thr Phe Leu Gly 145 150 155 160 gtt aat gca cct aaa agc ttg agc cat tgt gtt gat att tcc gat gct 528 Val Asn Ala Pro Lys Ser Leu Ser His Cys Val Asp Ile Ser Asp Ala 165 170 175 att gcg gat cac aag gtt gag cat caa ctt aat cag cga cag gtt ttg 576 Ile Ala Asp His Lys Val Glu His Gln Leu Asn Gln Arg Gln Val Leu 180 185 190 tta aaa caa gat att ggt ctt gct tta cta gag tcg agt tct aat aag 624 Leu Lys Gln Asp Ile Gly Leu Ala Leu Leu Glu Ser Ser Ser Asn Lys 195 200 205 cct aac gct gag cca caa aag tct ggt tta ggt ttg att gag gct aaa 672 Pro Asn Ala Glu Pro Gln Lys Ser Gly Leu Gly Leu Ile Glu Ala Lys 210 215 220 gaa cag caa atg aac gct gct gat aat tgg cat tgt tta ctg ggc cat 720 Glu Gln Gln Met Asn Ala Ala Asp Asn Trp His Cys Leu Leu Gly His 225 230 235 240 ctt gat gat agt tat cgt ttt gca ctg agt att ggt cag tgt cag caa 768 Leu Asp Asp Ser Tyr Arg Phe Ala Leu Ser Ile Gly Gln Cys Gln Gln 245 250 255 ata agt att gca gca gaa gaa gtg aat ttt aaa gct gtt gtt cga gct 816 Ile Ser Ile Ala Ala Glu Glu Val Asn Phe Lys Ala Val Val Arg Ala 260 265 270 tca gct aag act agc 831 Ser Ala Lys Thr Ser 275 <210> 3 <211> 277 <212> PRT <400> 1 Met Val Arg Gly Tyr Leu Arg Ala Leu Leu Ser Gln His Ser Glu Ile 1 5 10 15 Arg Pro Asn Glu Trp Arg Phe Glu Tyr Gly Asp Lys Gly Lys Pro Arg 20 25 30 Leu Ser Asp Ala Gln Phe Ala Gln Thr Gly Val His Phe Asn Val Ser 35 40 45 His Ser Gly Asp Trp Leu Leu Val Gly Ile Cys Thr Ala Asp Asn Lys 50 55 60 Gly Ala Ser Gln Ala Ser Lys Glu Glu Thr Asp Ser Ala Ser Ile Glu 65 70 75 80 Phe Gly Val Asp Ile Glu Arg Cys Arg Asn Ser Thr Asn Ile His Ser 85 90 95 Ile Leu Ser His Tyr Phe Ser Glu Ser Glu Lys Arg Ala Leu Leu Ala 100 105 110 Leu Pro Glu Ala Leu Gln Arg Asp Arg Phe Phe Asp Leu Trp Ala Leu 115 120 125 Lys Glu Ser Tyr Ile Lys Ala Lys Gly Leu Gly Leu Ala Leu Ser Leu 130 135 140 Lys Ser Phe Ala Phe Asp Phe Ser Ala Leu Ser Glu Thr Phe Leu Gly 145 150 155 160 Val Asn Ala Pro Lys Ser Leu Ser His Cys Val Asp Ile Ser Asp Ala 165 170 175 Ile Ala Asp His Lys Val Glu His Gln Leu Asn Gln Arg Gln Val Leu 180 185 190 Leu Lys Gln Asp Ile Gly Leu Ala Leu Leu Glu Ser Ser Ser Asn Lys 195 200 205 Pro Asn Ala Glu Pro Gln Lys Ser Gly Leu Gly Leu Ile Glu Ala Lys 210 215 220 Glu Gln Gln Met Asn Ala Ala Asp Asn Trp His Cys Leu Leu Gly His 225 230 235 240 Leu Asp Asp Ser Tyr Arg Phe Ala Leu Ser Ile Gly Gln Cys Gln Gln 245 250 255 Ile Ser Ile Ala Ala Glu Glu Val Asn Phe Lys Ala Val Val Arg Ala 260 265 270 Ser Ala Lys Thr Ser 275 <210> 4 <211> 864 <212> DNA <213> Shewanella putrefaciens SCRC-2874 (FERM BP-1625) <400> 1 atg gca aaa ata aat agt gaa cac ttg gat gaa gct act att act tcg 48 Met Ala Lys Ile Asn Ser Glu His Leu Asp Glu Ala Thr Ile Thr Ser 1 5 10 15 aat aag tgt acg caa aca gag act gag gct cgg cat aga aat gcc act 96 Asn Lys Cys Thr Gln Thr Glu Thr Glu Ala Arg His Arg Asn Ala Thr 20 25 30 aca aca cct gag atg cgc cga ttc ata caa gag tcg gat ctc agt gtt 144 Thr Thr Pro Glu Met Arg Arg Phe Ile Gln Glu Ser Asp Leu Ser Val 35 40 45 agc caa ctg tct aaa ata tta aat atc agt gaa gct acc gta cgt aag 192 Ser Gln Leu Ser Lys Ile Leu Asn Ile Ser Glu Ala Thr Val Arg Lys 50 55 60 tgg cgc aag cgt gac tct gtc gaa aac tgt cct aat acc ccg cac cat 240 Trp Arg Lys Arg Asp Ser Val Glu Asn Cys Pro Asn Thr Pro His His 65 70 75 80 ctc aat acc acg cta acc cct ttg caa gaa tat gtg gtt gtg ggc ctg 288 Leu Asn Thr Thr Leu Thr Pro Leu Gln Glu Tyr Val Val Val Gly Leu 85 90 95 cgt tat caa ttg aaa atg cca tta gac aga ttg ctc aaa gca acc caa 336 Arg Tyr Gln Leu Lys Met Pro Leu Asp Arg Leu Leu Lys Ala Thr Gln 100 105 110 gag ttt atc aat cca aac gtg tcg cgc tca ggt tta gca aga tgt ttg 384 Glu Phe Ile Asn Pro Asn Val Ser Arg Ser Gly Leu Ala Arg Cys Leu 115 120 125 aag cgt tat ggc gtt tca cgg gtg agt gat atc caa agc cca cac gta 432 Lys Arg Tyr Gly Val Ser Arg Val Ser Asp Ile Gln Ser Pro His Val 130 135 140 cca atg cgc tac ttt aat caa att cca gtc act caa ggc agc gat gtg 480 Pro Met Arg Tyr Phe Asn Gln Ile Pro Val Thr Gln Gly Ser Asp Val 145 150 155 160 caa acc tac acc ctg cac tat gaa acg ctg gca aaa acc tta gcc tta 528 Gln Thr Tyr Thr Leu His Tyr Glu Thr Leu Ala Lys Thr Leu Ala Leu 165 170 175 cct agt acc gat ggt gac aat gtg gtg caa gtg gtg tct ctc acc att 576 Pro Ser Thr Asp Gly Asp Asn Val Val Gln Val Val Ser Leu Thr Ile 180 185 190 cca cca aag tta acc gaa gaa gca ccc agt tca att ttg ctc ggc att 624 Pro Pro Lys Leu Thr Glu Glu Ala Pro Ser Ser Ile Leu Leu Gly Ile 195 200 205 gat cct cat agc gac tgg atc tat ctc gac ata tac caa gat ggc aat 672 Asp Pro His Ser Asp Trp Ile Tyr Leu Asp Ile Tyr Gln Asp Gly Asn 210 215 220 aca caa gcc acg aat aga tat atg gct tat gtg cta aaa cac ggg cca 720 Thr Gln Ala Thr Asn Arg Tyr Met Ala Tyr Val Leu Lys His Gly Pro 225 230 235 240 ttc cat tta cga aag tta ctc gtg cgt aac tat cac acc ttt tta cag 768 Phe His Leu Arg Lys Leu Leu Val Arg Asn Tyr His Thr Phe Leu Gln 245 250 255 cgc ttt cct gga gcg acg caa aat cgc cgc ccc tct aaa gat atg cct 816 Arg Phe Pro Gly Ala Thr Gln Asn Arg Arg Pro Ser Lys Asp Met Pro 260 265 270 gaa aca atc aac aag acg cct gaa aca cag gca ccc agt gga gac tca 864 Glu Thr Ile Asn Lys Thr Pro Glu Thr Gln Ala Pro Ser Gly Asp Ser 275 280 285 <210> 5 <211> 288 <212> PRT <400> 1 Met Ala Lys Ile Asn Ser Glu His Leu Asp Glu Ala Thr Ile Thr Ser 1 5 10 15 Asn Lys Cys Thr Gln Thr Glu Thr Glu Ala Arg His Arg Asn Ala Thr 20 25 30 Thr Thr Pro Glu Met Arg Arg Phe Ile Gln Glu Ser Asp Leu Ser Val 35 40 45 Ser Gln Leu Ser Lys Ile Leu Asn Ile Ser Glu Ala Thr Val Arg Lys 50 55 60 Trp Arg Lys Arg Asp Ser Val Glu Asn Cys Pro Asn Thr Pro His His 65 70 75 80 Leu Asn Thr Thr Leu Thr Pro Leu Gln Glu Tyr Val Val Val Gly Leu 85 90 95 Arg Tyr Gln Leu Lys Met Pro Leu Asp Arg Leu Leu Lys Ala Thr Gln 100 105 110 Glu Phe Ile Asn Pro Asn Val Ser Arg Ser Gly Leu Ala Arg Cys Leu 115 120 125 Lys Arg Tyr Gly Val Ser Arg Val Ser Asp Ile Gln Ser Pro His Val 130 135 140 Pro Met Arg Tyr Phe Asn Gln Ile Pro Val Thr Gln Gly Ser Asp Val 145 150 155 160 Gln Thr Tyr Thr Leu His Tyr Glu Thr Leu Ala Lys Thr Leu Ala Leu 165 170 175 Pro Ser Thr Asp Gly Asp Asn Val Val Gln Val Val Ser Leu Thr Ile 180 185 190 Pro Pro Lys Leu Thr Glu Glu Ala Pro Ser Ser Ile Leu Leu Gly Ile 195 200 205 Asp Pro His Ser Asp Trp Ile Tyr Leu Asp Ile Tyr Gln Asp Gly Asn 210 215 220 Thr Gln Ala Thr Asn Arg Tyr Met Ala Tyr Val Leu Lys His Gly Pro 225 230 235 240 Phe His Leu Arg Lys Leu Leu Val Arg Asn Tyr His Thr Phe Leu Gln 245 250 255 Arg Phe Pro Gly Ala Thr Gln Asn Arg Arg Pro Ser Lys Asp Met Pro 260 265 270 Glu Thr Ile Asn Lys Thr Pro Glu Thr Gln Ala Pro Ser Gly Asp Ser 275 280 285 <210> 6 <211> 8268 <212> DNA <213> Shewanella putrefaciens SCRC-2874 (FERM BP-1625) <400> 1 atg agc cag acc tct aaa cct aca aac tca gca act gag caa gca caa 48 Met Ser Gln Thr Ser Lys Pro Thr Asn Ser Ala Thr Glu Gln Ala Gln 1 5 10 15 gac tca caa gct gac tct cgt tta aat aaa cga cta aaa gat atg cca 96 Asp Ser Gln Ala Asp Ser Arg Leu Asn Lys Arg Leu Lys Asp Met Pro 20 25 30 att gct att gtt ggc atg gcg agt att ttt gca aac tct cgc tat ttg 144 Ile Ala Ile Val Gly Met Ala Ser Ile Phe Ala Asn Ser Arg Tyr Leu 35 40 45 aat aag ttt tgg gac tta atc agc gaa aaa att gat gcg att act gaa 192 Asn Lys Phe Trp Asp Leu Ile Ser Glu Lys Ile Asp Ala Ile Thr Glu 50 55 60 tta cca tca act cac tgg cag cct gaa gaa tat tac gac gca gat aaa 240 Leu Pro Ser Thr His Trp Gln Pro Glu Glu Tyr Tyr Asp Ala Asp Lys 65 70 75 80 acc gca gca gac aaa agc tac tgt aaa cgt ggt ggc ttt ttg cca gat 288 Thr Ala Ala Asp Lys Ser Tyr Cys Lys Arg Gly Gly Phe Leu Pro Asp 85 90 95 gta gac ttc aac cca atg gag ttt ggc ctg ccg cca aac att ttg gaa 336 Val Asp Phe Asn Pro Met Glu Phe Gly Leu Pro Pro Asn Ile Leu Glu 100 105 110 ctg acc gat tca tcg caa cta tta tca ctc atc gtt gct aaa gaa gtg 384 Leu Thr Asp Ser Ser Gln Leu Leu Ser Leu Ile Val Ala Lys Glu Val 115 120 125 ttg gct gat gct aac tta cct gag aat tac gac cgc gat aaa att ggt 432 Leu Ala Asp Ala Asn Leu Pro Glu Asn Tyr Asp Arg Asp Lys Ile Gly 130 135 140 atc acc tta ggt gtc ggc ggt ggt caa aaa att agc cac agc cta aca 480 Ile Thr Leu Gly Val Gly Gly Gly Gln Lys Ile Ser His Ser Leu Thr 145 150 155 160 gcg cgt ctg caa tac cca gta ttg aag aaa gta ttc gcc aat agc ggc 528 Ala Arg Leu Gln Tyr Pro Val Leu Lys Lys Val Phe Ala Asn Ser Gly 165 170 175 att agt gac acc gac agc gaa atg ctt atc aag aaa ttc caa gac caa 576 Ile Ser Asp Thr Asp Ser Glu Met Leu Ile Lys Lys Phe Gln Asp Gln 180 185 190 tat gta cac tgg gaa gaa aac tcg ttc cca ggt tca ctt ggt aac gtt 624 Tyr Val His Trp Glu Glu Asn Ser Phe Pro Gly Ser Leu Gly Asn Val 195 200 205 att gcg ggc cgt atc gcc aac cgc ttc gat ttt ggc ggc atg aac tgt 672 Ile Ala Gly Arg Ile Ala Asn Arg Phe Asp Phe Gly Gly Met Asn Cys 210 215 220 gtg gtt gat gct gcc tgt gct gga tca ctt gct gct atg cgt atg gcg 720 Val Val Asp Ala Ala Cys Ala Gly Ser Leu Ala Ala Met Arg Met Ala 225 230 235 240 cta aca gag cta act gaa ggt cgc tct gaa atg atg atc acc ggt ggt 768 Leu Thr Glu Leu Thr Glu Gly Arg Ser Glu Met Met Ile Thr Gly Gly 245 250 255 gtg tgt act gat aac tca ccc tct atg tat atg agc ttt tca aaa acg 816 Val Cys Thr Asp Asn Ser Pro Ser Met Tyr Met Ser Phe Ser Lys Thr 260 265 270 ccc gcc ttt acc act aac gaa acc att cag cca ttt gat atc gac tca 864 Pro Ala Phe Thr Thr Asn Glu Thr Ile Gln Pro Phe Asp Ile Asp Ser 275 280 285 aaa ggc atg atg att ggt gaa ggt att ggc atg gtg gcg cta aag cgt 912 Lys Gly Met Met Ile Gly Glu Gly Ile Gly Met Val Ala Leu Lys Arg 290 295 300 ctt gaa gat gca gag cgc gat ggc gac cgc att tac tct gta att aaa 960 Leu Glu Asp Ala Glu Arg Asp Gly Asp Arg Ile Tyr Ser Val Ile Lys 305 310 315 320 ggt gtg ggt gca tca tct gac ggt aag ttt aaa tca atc tat gcc cct 1008 Gly Val Gly Ala Ser Ser Asp Gly Lys Phe Lys Ser Ile Tyr Ala Pro 325 330 335 cgc cca tca ggc caa gct aaa gca ctt aac cgt gcc tat gat gac gca 1056 Arg Pro Ser Gly Gln Ala Lys Ala Leu Asn Arg Ala Tyr Asp Asp Ala 340 345 350 ggt ttt gcg ccg cat acc tta ggt cta att gaa gct cac gga aca ggt 1104 Gly Phe Ala Pro His Thr Leu Gly Leu Ile Glu Ala His Gly Thr Gly 355 360 365 act gca gca ggt gac gcg gca gag ttt gcc ggc ctt tgc tca gta ttt 1152 Thr Ala Ala Gly Asp Ala Ala Glu Phe Ala Gly Leu Cys Ser Val Phe 370 375 380 gct gaa ggc aac gat acc aag caa cac att gcg cta ggt tca gtt aaa 1200 Ala Glu Gly Asn Asp Thr Lys Gln His Ile Ala Leu Gly Ser Val Lys 385 390 395 400 tca caa att ggt cat act aaa tca act gca ggt aca gca ggt tta att 1248 Ser Gln Ile Gly His Thr Lys Ser Thr Ala Gly Thr Ala Gly Leu Ile 405 410 415 aaa gct gct ctt gct ttg cat cac aag gta ctg ccg ccg acc att aac 1296 Lys Ala Ala Leu Ala Leu His His Lys Val Leu Pro Pro Thr Ile Asn 420 425 430 gtt agt cag cca agc cct aaa ctt gat atc gaa aac tca ccg ttt tat 1344 Val Ser Gln Pro Ser Pro Lys Leu Asp Ile Glu Asn Ser Pro Phe Tyr 435 440 445 cta aac act gag act cgt cca tgg tta cca cgt gtt gat ggt acg ccg 1392 Leu Asn Thr Glu Thr Arg Pro Trp Leu Pro Arg Val Asp Gly Thr Pro 450 455 460 cgc cgc gcg ggt att agc tca ttt ggt ttt ggt ggc act aac ttc cat 1440 Arg Arg Ala Gly Ile Ser Ser Phe Gly Phe Gly Gly Thr Asn Phe His 465 470 475 480 ttt gta cta gaa gag tac aac caa gaa cac agc cgt act gat agc gaa 1488 Phe Val Leu Glu Glu Tyr Asn Gln Glu His Ser Arg Thr Asp Ser Glu 485 490 495 aaa gct aag tat cgt caa cgc caa gtg gcg caa agc ttc ctt gtt agc 1536 Lys Ala Lys Tyr Arg Gln Arg Gln Val Ala Gln Ser Phe Leu Val Ser 500 505 510 gca agc gat aaa gca tcg cta att aac gag tta aac gta cta gca gca 1584 Ala Ser Asp Lys Ala Ser Leu Ile Asn Glu Leu Asn Val Leu Ala Ala 515 520 525 tct gca agc caa gct gag ttt atc ctc aaa gat gca gca gca aac tat 1632 Ser Ala Ser Gln Ala Glu Phe Ile Leu Lys Asp Ala Ala Ala Asn Tyr 530 535 540 ggc gta cgt gag ctt gat aaa aat gca cca cgg atc ggt tta gtt gca 1680 Gly Val Arg Glu Leu Asp Lys Asn Ala Pro Arg Ile Gly Leu Val Ala 545 550 555 560 aac aca gct gaa gag tta gca ggc cta att aag caa gca ctt gcc aaa 1728 Asn Thr Ala Glu Glu Leu Ala Gly Leu Ile Lys Gln Ala Leu Ala Lys 565 570 575 cta gca gct agc gat gat aac gca tgg cag cta cct ggt ggc act agc 1776 Leu Ala Ala Ser Asp Asp Asn Ala Trp Gln Leu Pro Gly Gly Thr Ser 580 585 590 tac cgc gcc gct gca gta gaa ggt aaa gtt gcc gca ctg ttt gct ggc 1824 Tyr Arg Ala Ala Ala Val Glu Gly Lys Val Ala Ala Leu Phe Ala Gly 595 600 605 caa ggt tca caa tat ctc aat atg ggc cgt gac ctt act tgt tat tac 1872 Gln Gly Ser Gln Tyr Leu Asn Met Gly Arg Asp Leu Thr Cys Tyr Tyr 610 615 620 cca gag atg cgt cag caa ttt gta act gca gat aaa gta ttt gcc gca 1920 Pro Glu Met Arg Gln Gln Phe Val Thr Ala Asp Lys Val Phe Ala Ala 625 630 635 640 aat gat aaa acg ccg tta tcg caa act ctg tat cca aag cct gta ttt 1968 Asn Asp Lys Thr Pro Leu Ser Gln Thr Leu Tyr Pro Lys Pro Val Phe 645 650 655 aat aaa gat gaa tta aag gct caa gaa gcc att ttg acc aat acc gcc 2016 Asn Lys Asp Glu Leu Lys Ala Gln Glu Ala Ile Leu Thr Asn Thr Ala 660 665 670 aat gcc caa agc gca att ggt gcg att tca atg ggt caa tac gat ttg 2064 Asn Ala Gln Ser Ala Ile Gly Ala Ile Ser Met Gly Gln Tyr Asp Leu 675 680 685 ttt act gcg gct ggc ttt aat gcc gac atg gtt gca ggc cat agc ttt 2112 Phe Thr Ala Ala Gly Phe Asn Ala Asp Met Val Ala Gly His Ser Phe 690 695 700 ggt gag cta agt gca ctg tgt gct gca ggt gtt att tca gct gat gac 2160 Gly Glu Leu Ser Ala Leu Cys Ala Ala Gly Val Ile Ser Ala Asp Asp 705 710 715 720 tac tac aag ctg gct ttt gct cgt ggt gag gct atg gca aca aaa gca 2208 Tyr Tyr Lys Leu Ala Phe Ala Arg Gly Glu Ala Met Ala Thr Lys Ala 725 730 735 ccg gct aaa gac ggc gtt gaa gca gat gca gga gca atg ttt gca atc 2256 Pro Ala Lys Asp Gly Val Glu Ala Asp Ala Gly Ala Met Phe Ala Ile 740 745 750 ata acc aag agt gct gca gac ctt gaa acc gtt gaa gcc acc atc gct 2304 Ile Thr Lys Ser Ala Ala Asp Leu Glu Thr Val Glu Ala Thr Ile Ala 755 760 765 aaa ttt gat ggg gtg aaa gtc gct aac tat aac gcg cca acg caa tca 2352 Lys Phe Asp Gly Val Lys Val Ala Asn Tyr Asn Ala Pro Thr Gln Ser 770 775 780 gta att gca ggc cca aca gca act acc gct gat gcg gct aaa gcg cta 2400 Val Ile Ala Gly Pro Thr Ala Thr Thr Ala Asp Ala Ala Lys Ala Leu 785 790 795 800 act gag ctt ggt tac aaa gcg att aac ctg cca gta tca ggt gca ttc 2448 Thr Glu Leu Gly Tyr Lys Ala Ile Asn Leu Pro Val Ser Gly Ala Phe 805 810 815 cac act gaa ctt gtt ggt cac gct caa gcg cca ttt gct aaa gcg att 2496 His Thr Glu Leu Val Gly His Ala Gln Ala Pro Phe Ala Lys Ala Ile 820 825 830 gac gca gcc aaa ttt act aaa aca agc cga gca ctt tac tca aat gca 2544 Asp Ala Ala Lys Phe Thr Lys Thr Ser Arg Ala Leu Tyr Ser Asn Ala 835 840 845 act ggc gga ctt tat gaa agc act gct gca aag att aaa gcc tcg ttt 2592 Thr Gly Gly Leu Tyr Glu Ser Thr Ala Ala Lys Ile Lys Ala Ser Phe 850 855 860 aag aaa cat atg ctt caa tca gtg cgc ttt act agc cag cta gaa gcc 2640 Lys Lys His Met Leu Gln Ser Val Arg Phe Thr Ser Gln Leu Glu Ala 865 870 875 880 atg tac aac gac ggc gcc cgt gta ttt gtt gaa ttt ggt cca aag aac 2688 Met Tyr Asn Asp Gly Ala Arg Val Phe Val Glu Phe Gly Pro Lys Asn 885 890 895 atc tta caa aaa tta gtt caa ggc acg ctt gtc aac act gaa aat gaa 2736 Ile Leu Gln Lys Leu Val Gln Gly Thr Leu Val Asn Thr Glu Asn Glu 900 905 910 gtt tgc act atc tct atc aac cct aat cct aaa gtt gat agt gat ctg 2784 Val Cys Thr Ile Ser Ile Asn Pro Asn Pro Lys Val Asp Ser Asp Leu 915 920 925 cag ctt aag caa gca gca atg cag cta gcg gtt act ggt gtg gta ctc 2832 Gln Leu Lys Gln Ala Ala Met Gln Leu Ala Val Thr Gly Val Val Leu 930 935 940 agt gaa att gac cca tac caa gcc gat att gcc gca cca gcg aaa aag 2880 Ser Glu Ile Asp Pro Tyr Gln Ala Asp Ile Ala Ala Pro Ala Lys Lys 945 950 955 960 tcg cca atg agc att tcg ctt aat gct gct aac cat atc agc aaa gca 2928 Ser Pro Met Ser Ile Ser Leu Asn Ala Ala Asn His Ile Ser Lys Ala 965 970 975 act cgc gct aag atg gcc aag tct tta gag aca ggt atc gtc acc tcg 2976 Thr Arg Ala Lys Met Ala Lys Ser Leu Glu Thr Gly Ile Val Thr Ser 980 985 990 caa ata gaa cat gtt att gaa gaa aaa atc gtt gaa gtt gag aaa ctg 3024 Gln Ile Glu His Val Ile Glu Glu Lys Ile Val Glu Val Glu Lys Leu 995 1000 1005 gtt gaa gtc gaa aag atc gtc gaa aaa gtg gtt gaa gta gag aaa gtt 3072 Val Glu Val Glu Lys Ile Val Glu Lys Val Val Glu Val Glu Lys Val 1010 1015 1020 gtt gag gtt gaa gct cct gtt aat tca gtg caa gcc aat gca att caa 3120 Val Glu Val Glu Ala Pro Val Asn Ser Val Gln Ala Asn Ala Ile Gln 1025 1030 1035 1040 acc cgt tca gtt gtc gct cca gta ata gag aac caa gtc gtg tct aaa 3168 Thr Arg Ser Val Val Ala Pro Val Ile Glu Asn Gln Val Val Ser Lys 1045 1050 1055 aac agt aag cca gca gtc cag agc att agt ggt gat gca ctc agc aac 3216 Asn Ser Lys Pro Ala Val Gln Ser Ile Ser Gly Asp Ala Leu Ser Asn 1060 1065 1070 ttt ttt gct gca cag cag caa acc gca cag ttg cat cag cag ttc tta 3264 Phe Phe Ala Ala Gln Gln Gln Thr Ala Gln Leu His Gln Gln Phe Leu 1075 1080 1085 gct att ccg cag caa tat ggt gag acg ttc act acg ctg atg acc gag 3312 Ala Ile Pro Gln Gln Tyr Gly Glu Thr Phe Thr Thr Leu Met Thr Glu 1090 1095 1100 caa gct aaa ctg gca agt tct ggt gtt gca att cca gag agt ctg caa 3360 Gln Ala Lys Leu Ala Ser Ser Gly Val Ala Ile Pro Glu Ser Leu Gln 1105 1110 1115 1120 cgc tca atg gag caa ttc cac caa cta caa gcg caa aca cta caa agc 3408 Arg Ser Met Glu Gln Phe His Gln Leu Gln Ala Gln Thr Leu Gln Ser 1125 1130 1135 cac acc cag ttc ctt gag atg caa gcg ggt agc aac att gca gcg tta 3456 His Thr Gln Phe Leu Glu Met Gln Ala Gly Ser Asn Ile Ala Ala Leu 1140 1145 1150 aac cta ctc aat agc agc caa gca act tac gct cca gcc att cac aat 3504 Asn Leu Leu Asn Ser Ser Gln Ala Thr Tyr Ala Pro Ala Ile His Asn 1155 1160 1165 gaa gcg att caa agc caa gtg gtt caa agc caa act gca gtc cag cca 3552 Glu Ala Ile Gln Ser Gln Val Val Gln Ser Gln Thr Ala Val Gln Pro 1170 1175 1180 gta att tca aca caa gtt aac cat gtg tca gag cag cca act caa gct 3600 Val Ile Ser Thr Gln Val Asn His Val Ser Glu Gln Pro Thr Gln Ala 1185 1190 1195 1200 cca gct cca aaa gcg cag cca gca cct gtg aca act cca gtt caa act 3648 Pro Ala Pro Lys Ala Gln Pro Ala Pro Val Thr Thr Pro Val Gln Thr 1205 1210 1215 gct ccg gca caa gtt gtt cgt caa gcc gca cca gtt caa gcc gct att 3696 Ala Pro Ala Gln Val Val Arg Gln Ala Ala Pro Val Gln Ala Ala Ile 1220 1225 1230 gaa ccg att aat aca agt gtt gcg act aca acg cct tca gcc ttc agc 3744 Glu Pro Ile Asn Thr Ser Val Ala Thr Thr Thr Pro Ser Ala Phe Ser 1235 1240 1245 gcc gaa aca gcc ctg agc gca aca aaa gtc caa gcc act atg ctt gaa 3792 Ala Glu Thr Ala Leu Ser Ala Thr Lys Val Gln Ala Thr Met Leu Glu 1250 1255 1260 gtg gtt gct gag aaa acc ggt tac cca act gaa atg cta gag ctt gaa 3840 Val Val Ala Glu Lys Thr Gly Tyr Pro Thr Glu Met Leu Glu Leu Glu 1265 1270 1275 1280 atg gat atg gaa gcc gat tta ggc atc gat tct atc aag cgt gta gaa 3888 Met Asp Met Glu Ala Asp Leu Gly Ile Asp Ser Ile Lys Arg Val Glu 1285 1290 1295 att ctt ggc aca gta caa gat gag cta ccg ggt cta cct gag ctt agc 3936 Ile Leu Gly Thr Val Gln Asp Glu Leu Pro Gly Leu Pro Glu Leu Ser 1300 1305 1310 cct gaa gat cta gct gag tgt cga acg cta ggc gaa atc gtt gac tat 3984 Pro Glu Asp Leu Ala Glu Cys Arg Thr Leu Gly Glu Ile Val Asp Tyr 1315 1320 1325 atg ggc agt aaa ctg ccg gct gaa ggc tct atg aat tct cag ctg tct 4032 Met Gly Ser Lys Leu Pro Ala Glu Gly Ser Met Asn Ser Gln Leu Ser 1330 1335 1340 aca ggt tcc gca gct gcg act cct gca gcg aat ggt ctt tct gcg gag 4080 Thr Gly Ser Ala Ala Ala Thr Pro Ala Ala Asn Gly Leu Ser Ala Glu 1345 1350 1355 1360 aaa gtt caa gcg act atg atg tct gtg gtt gcc gaa aag act ggc tac 4128 Lys Val Gln Ala Thr Met Met Ser Val Val Ala Glu Lys Thr Gly Tyr 1365 1370 1375 cca act gaa atg cta gag ctt gaa atg gat atg gaa gcc gat tta ggc 4176 Pro Thr Glu Met Leu Glu Leu Glu Met Asp Met Glu Ala Asp Leu Gly 1380 1385 1390 ata gat tct atc aag cgc gtt gaa att ctt ggc aca gta caa gat gag 4224 Ile Asp Ser Ile Lys Arg Val Glu Ile Leu Gly Thr Val Gln Asp Glu 1395 1400 1405 cta ccg ggt cta cct gag ctt agc cct gaa gat cta gct gag tgt cgt 4272 Leu Pro Gly Leu Pro Glu Leu Ser Pro Glu Asp Leu Ala Glu Cys Arg 1410 1415 1420 act cta ggc gaa atc gtt gac tat atg aac tct aaa ctc gct gac ggc 4320 Thr Leu Gly Glu Ile Val Asp Tyr Met Asn Ser Lys Leu Ala Asp Gly 1425 1430 1435 1440 tct aag ctg ccg gct gaa ggc tct atg aat tct cag ctg tct aca agt 4368 Ser Lys Leu Pro Ala Glu Gly Ser Met Asn Ser Gln Leu Ser Thr Ser 1445 1450 1455 gcc gca gct gcg act cct gca gcg aat ggt ctc tct gcg gag aaa gtt 4416 Ala Ala Ala Ala Thr Pro Ala Ala Asn Gly Leu Ser Ala Glu Lys Val 1460 1465 1470 caa gcg act atg atg tct gtg gtt gcc gaa aag act ggc tac cca act 4464 Gln Ala Thr Met Met Ser Val Val Ala Glu Lys Thr Gly Tyr Pro Thr 1475 1480 1485 gaa atg cta gaa ctt gaa atg gat atg gaa gct gac ctt ggc atc gat 4512 Glu Met Leu Glu Leu Glu Met Asp Met Glu Ala Asp Leu Gly Ile Asp 1490 1495 1500 tca atc aag cgc gtt gaa att ctt ggc aca gta caa gat gag cta ccg 4560 Ser Ile Lys Arg Val Glu Ile Leu Gly Thr Val Gln Asp Glu Leu Pro 1505 1510 1515 1520 ggt tta cct gag cta aat cca gaa gat ttg gca gag tgt cgt act ctt 4608 Gly Leu Pro Glu Leu Asn Pro Glu Asp Leu Ala Glu Cys Arg Thr Leu 1525 1530 1535 ggc gaa atc gtg act tat atg aac tct aaa ctc gct gac ggc tct aag 4656 Gly Glu Ile Val Thr Tyr Met Asn Ser Lys Leu Ala Asp Gly Ser Lys 1540 1545 1550 ctg cca gct gaa ggc tct atg cac tat cag ctg tct aca agt acc gct 4704 Leu Pro Ala Glu Gly Ser Met His Tyr Gln Leu Ser Thr Ser Thr Ala 1555 1560 1565 gct gcg act cct gta gcg aat ggt ctc tct gca gaa aaa gtt caa gcg 4752 Ala Ala Thr Pro Val Ala Asn Gly Leu Ser Ala Glu Lys Val Gln Ala 1570 1575 1580 acc atg atg tct gta gtt gca gat aaa act ggc tac cca act gaa atg 4800 Thr Met Met Ser Val Val Ala Asp Lys Thr Gly Tyr Pro Thr Glu Met 1585 1590 1595 1600 ctt gaa ctt gaa atg gat atg gaa gcc gat tta ggt atc gat tct atc 4848 Leu Glu Leu Glu Met Asp Met Glu Ala Asp Leu Gly Ile Asp Ser Ile 1605 1610 1615 aag cgc gtt gaa att ctt ggc aca gta caa gat gag cta ccg ggt tta 4896 Lys Arg Val Glu Ile Leu Gly Thr Val Gln Asp Glu Leu Pro Gly Leu 1620 1625 1630 cct gag cta aat cca gaa gat cta gca gag tgt cgc acc cta ggc gaa 4944 Pro Glu Leu Asn Pro Glu Asp Leu Ala Glu Cys Arg Thr Leu Gly Glu 1635 1640 1645 atc gtt gac tat atg ggc agt aaa ctg ccg gct gaa ggc tct gct aat 4992 Ile Val Asp Tyr Met Gly Ser Lys Leu Pro Ala Glu Gly Ser Ala Asn 1650 1655 1660 aca agt gcc gct gcg tct ctt aat gtt agt gcc gtt gcg gcg cct caa 5040 Thr Ser Ala Ala Ala Ser Leu Asn Val Ser Ala Val Ala Ala Pro Gln 1665 1670 1675 1680 gct gct gcg act cct gta tcg aac ggt ctc tct gca gag aaa gtg caa 5088 Ala Ala Ala Thr Pro Val Ser Asn Gly Leu Ser Ala Glu Lys Val Gln 1685 1690 1695 agc act atg atg tca gta gtt gca gaa aag acc ggc tac cca act gaa 5136 Ser Thr Met Met Ser Val Val Ala Glu Lys Thr Gly Tyr Pro Thr Glu 1700 1705 1710 atg cta gaa ctt ggc atg gat atg gaa gcc gat tta ggt atc gac tca 5184 Met Leu Glu Leu Gly Met Asp Met Glu Ala Asp Leu Gly Ile Asp Ser 1715 1720 1725 att aaa cgc gtt gag att ctt ggc aca gta caa gat gag cta ccg ggt 5232 Ile Lys Arg Val Glu Ile Leu Gly Thr Val Gln Asp Glu Leu Pro Gly 1730 1735 1740 cta cca gag ctt aat cct gaa gat tta gct gag tgc cgt acg ctg ggc 5280 Leu Pro Glu Leu Asn Pro Glu Asp Leu Ala Glu Cys Arg Thr Leu Gly 1745 1750 1755 1760 gaa atc gtt gac tat atg aac tct aag ctg gct gac ggc tct aag ctt 5328 Glu Ile Val Asp Tyr Met Asn Ser Lys Leu Ala Asp Gly Ser Lys Leu 1765 1770 1775 cca gct gaa ggc tct gct aat aca agt gcc act gct gcg act cct gca 5376 Pro Ala Glu Gly Ser Ala Asn Thr Ser Ala Thr Ala Ala Thr Pro Ala 1780 1785 1790 gtg aat ggt ctt tct gct gac aag gta cag gcg act atg atg tct gta 5424 Val Asn Gly Leu Ser Ala Asp Lys Val Gln Ala Thr Met Met Ser Val 1795 1800 1805 gtt gct gaa aag acc ggc tac cca act gaa atg cta gaa ctt ggc atg 5472 Val Ala Glu Lys Thr Gly Tyr Pro Thr Glu Met Leu Glu Leu Gly Met 1810 1815 1820 gat atg gaa gca gac ctt ggt att gat tct att aag cgc gtt gaa att 5520 Asp Met Glu Ala Asp Leu Gly Ile Asp Ser Ile Lys Arg Val Glu Ile 1825 1830 1835 1840 ctt ggc aca gta caa gat gag ctc cca ggt tta cct gag ctt aat cct 5568 Leu Gly Thr Val Gln Asp Glu Leu Pro Gly Leu Pro Glu Leu Asn Pro 1845 1850 1855 gaa gat ctc gct gag tgc cgc acg ctt ggc gaa atc gtt agc tat atg 5616 Glu Asp Leu Ala Glu Cys Arg Thr Leu Gly Glu Ile Val Ser Tyr Met 1860 1865 1870 aac tct caa ctg gct gat ggc tct aaa ctt tct aca agt gcg gct gaa 5664 Asn Ser Gln Leu Ala Asp Gly Ser Lys Leu Ser Thr Ser Ala Ala Glu 1875 1880 1885 ggc tct gct gat aca agt gct gca aat gct gca aag ccg gca gca att 5712 Gly Ser Ala Asp Thr Ser Ala Ala Asn Ala Ala Lys Pro Ala Ala Ile 1890 1895 1900 tcg gca gaa cca agt gtt gag ctt cct cct cat agc gag gta gcg cta 5760 Ser Ala Glu Pro Ser Val Glu Leu Pro Pro His Ser Glu Val Ala Leu 1905 1910 1915 1920 aaa aag ctt aat gcg gcg aac aag cta gaa aat tgt ttc gcc gca gac 5808 Lys Lys Leu Asn Ala Ala Asn Lys Leu Glu Asn Cys Phe Ala Ala Asp 1925 1930 1935 gca agt gtt gtg att aac gat gat ggt cac aac gca ggc gtt tta gct 5856 Ala Ser Val Val Ile Asn Asp Asp Gly His Asn Ala Gly Val Leu Ala 1940 1945 1950 gag aaa ctt att aaa caa ggc cta aaa gta gcc gtt gtg cgt tta ccg 5904 Glu Lys Leu Ile Lys Gln Gly Leu Lys Val Ala Val Val Arg Leu Pro 1955 1960 1965 aaa ggt cag cct caa tcg cca ctt tca agc gat gtt gct agc ttt gag 5952 Lys Gly Gln Pro Gln Ser Pro Leu Ser Ser Asp Val Ala Ser Phe Glu 1970 1975 1980 ctt gcc tca agc caa gaa tct gag ctt gaa gcc agt atc act gca gtt 6000 Leu Ala Ser Ser Gln Glu Ser Glu Leu Glu Ala Ser Ile Thr Ala Val 1985 1990 1995 2000 atc gcg cag att gaa act cag gtt ggc gct att ggt ggc ttt att cac 6048 Ile Ala Gln Ile Glu Thr Gln Val Gly Ala Ile Gly Gly Phe Ile His 2005 2010 2015 ttg caa cca gaa gcg aat aca gaa gag caa acg gca gta aac cta gat 6096 Leu Gln Pro Glu Ala Asn Thr Glu Glu Gln Thr Ala Val Asn Leu Asp 2020 2025 2030 gcg caa agt ttt act cac gtt agc aat gcg ttc ttg tgg gcc aaa tta 6144 Ala Gln Ser Phe Thr His Val Ser Asn Ala Phe Leu Trp Ala Lys Leu 2035 2040 2045 ttg caa cca aag ctc gtt gct gga gca gat gcg cgt cgc tgt ttt gta 6192 Leu Gln Pro Lys Leu Val Ala Gly Ala Asp Ala Arg Arg Cys Phe Val 2050 2055 2060 aca gta agc cgt atc gac ggt ggc ttt ggt tac cta aat act gac gcc 6240 Thr Val Ser Arg Ile Asp Gly Gly Phe Gly Tyr Leu Asn Thr Asp Ala 2065 2070 2075 2080 cta aaa gat gct gag cta aac caa gca gca tta gct ggt tta act aaa 6288 Leu Lys Asp Ala Glu Leu Asn Gln Ala Ala Leu Ala Gly Leu Thr Lys 2085 2090 2095 acc tta agc cat gaa tgg cca caa gtg ttc tgt cgc gcg cta gat att 6336 Thr Leu Ser His Glu Trp Pro Gln Val Phe Cys Arg Ala Leu Asp Ile 2100 2105 2110 gca aca gat gtt gat gca acc cat ctt gct gat gca atc acc agt gaa 6384 Ala Thr Asp Val Asp Ala Thr His Leu Ala Asp Ala Ile Thr Ser Glu 2115 2120 2125 cta ttt gat agc caa gct cag cta cct gaa gtg ggc tta agc tta att 6432 Leu Phe Asp Ser Gln Ala Gln Leu Pro Glu Val Gly Leu Ser Leu Ile 2130 2135 2140 gat ggc aaa gtt aac cgc gta act cta gtt gct gct gaa gct gca gat 6480 Asp Gly Lys Val Asn Arg Val Thr Leu Val Ala Ala Glu Ala Ala Asp 2145 2150 2155 2160 aaa aca gca aaa gca gag ctt aac agc aca gat aaa atc tta gtg act 6528 Lys Thr Ala Lys Ala Glu Leu Asn Ser Thr Asp Lys Ile Leu Val Thr 2165 2170 2175 ggt ggg gca aaa ggg gtg aca ttt gaa tgt gca ctg gca tta gca tct 6576 Gly Gly Ala Lys Gly Val Thr Phe Glu Cys Ala Leu Ala Leu Ala Ser 2180 2185 2190 cgc agc cag tct cac ttt atc tta gct ggg cgc agt gaa tta caa gct 6624 Arg Ser Gln Ser His Phe Ile Leu Ala Gly Arg Ser Glu Leu Gln Ala 2195 2200 2205 tta cca agc tgg gct gag ggt aag caa act agc gag cta aaa tca gct 6672 Leu Pro Ser Trp Ala Glu Gly Lys Gln Thr Ser Glu Leu Lys Ser Ala 2210 2215 2220 gca atc gca cat att att tct act ggt caa aag cca acg cct aag caa 6720 Ala Ile Ala His Ile Ile Ser Thr Gly Gln Lys Pro Thr Pro Lys Gln 2225 2230 2235 2240 gtt gaa gcc gct gtg tgg cca gtg caa agc agc att gaa att aat gcc 6768 Val Glu Ala Ala Val Trp Pro Val Gln Ser Ser Ile Glu Ile Asn Ala 2245 2250 2255 gcc cta gcc gcc ttt aac aaa gtt ggc gcc tca gct gaa tac gtc agc 6816 Ala Leu Ala Ala Phe Asn Lys Val Gly Ala Ser Ala Glu Tyr Val Ser 2260 2265 2270 atg gat gtt acc gat agc gcc gca atc aca gca gca ctt aat ggt cgc 6864 Met Asp Val Thr Asp Ser Ala Ala Ile Thr Ala Ala Leu Asn Gly Arg 2275 2280 2285 tca aat gag atc acc ggt ctt att cat ggc gca ggt gta cta gcc gac 6912 Ser Asn Glu Ile Thr Gly Leu Ile His Gly Ala Gly Val Leu Ala Asp 2290 2295 2300 aag cat att caa gac aag act ctt gct gaa ctt gct aaa gtt tat ggc 6960 Lys His Ile Gln Asp Lys Thr Leu Ala Glu Leu Ala Lys Val Tyr Gly 2305 2310 2315 2320 act aaa gtc aac ggc cta aaa gcg ctg ctc gcg gca ctt gag cca agc 7008 Thr Lys Val Asn Gly Leu Lys Ala Leu Leu Ala Ala Leu Glu Pro Ser 2325 2330 2335 aaa att aaa tta ctt gct atg ttc tca tct gca gca ggt ttt tac ggt 7056 Lys Ile Lys Leu Leu Ala Met Phe Ser Ser Ala Ala Gly Phe Tyr Gly 2340 2345 2350 aat atc ggc caa agc gat tac gcg atg tcg aac gat att ctt aac aag 7104 Asn Ile Gly Gln Ser Asp Tyr Ala Met Ser Asn Asp Ile Leu Asn Lys 2355 2360 2365 gca gcg ctg cag ttc acc gct cgc aac cca caa gct aaa gtc atg agc 7152 Ala Ala Leu Gln Phe Thr Ala Arg Asn Pro Gln Ala Lys Val Met Ser 2370 2375 2380 ttt aac tgg ggt cct tgg gat ggc ggc atg gtt aac cca gcg ctt aaa 7200 Phe Asn Trp Gly Pro Trp Asp Gly Gly Met Val Asn Pro Ala Leu Lys 2385 2390 2395 2400 aag atg ttt acc gag cgt ggt gtg tac gtt att cca cta aaa gca ggt 7248 Lys Met Phe Thr Glu Arg Gly Val Tyr Val Ile Pro Leu Lys Ala Gly 2405 2410 2415 gca gag cta ttt gcc act cag cta ttg gct gaa act ggc gtg cag ttg 7296 Ala Glu Leu Phe Ala Thr Gln Leu Leu Ala Glu Thr Gly Val Gln Leu 2420 2425 2430 ctc att ggt acg tca atg caa ggt ggc agc gac act aaa gca act gag 7344 Leu Ile Gly Thr Ser Met Gln Gly Gly Ser Asp Thr Lys Ala Thr Glu 2435 2440 2445 act gct tct gta aaa aag ctt aat gcg ggt gag gtg cta agt gca tcg 7392 Thr Ala Ser Val Lys Lys Leu Asn Ala Gly Glu Val Leu Ser Ala Ser 2450 2455 2460 cat ccg cgt gct ggt gca caa aaa aca cca cta caa gct gtc act gca 7440 His Pro Arg Ala Gly Ala Gln Lys Thr Pro Leu Gln Ala Val Thr Ala 2465 2470 2475 2480 acg cgt ctg tta acc cca agt gcc atg gtc ttc att gaa gat cac cgc 7488 Thr Arg Leu Leu Thr Pro Ser Ala Met Val Phe Ile Glu Asp His Arg 2485 2490 2495 att ggc ggt aac agt gtg ttg cca acg gta tgc gcc atc gac tgg atg 7536 Ile Gly Gly Asn Ser Val Leu Pro Thr Val Cys Ala Ile Asp Trp Met 2500 2505 2510 cgt gaa gcg gca agc gac atg ctt ggc gct caa gtt aag gta ctt gat 7584 Arg Glu Ala Ala Ser Asp Met Leu Gly Ala Gln Val Lys Val Leu Asp 2515 2520 2525 tac aag cta tta aaa ggc att gta ttt gag act gat gag ccg caa gag 7632 Tyr Lys Leu Leu Lys Gly Ile Val Phe Glu Thr Asp Glu Pro Gln Glu 2530 2535 2540 tta aca ctt gag cta acg cca gac gat tca gac gaa gct acg cta caa 7680 Leu Thr Leu Glu Leu Thr Pro Asp Asp Ser Asp Glu Ala Thr Leu Gln 2545 2550 2555 2560 gca tta atc agc tgt aat ggg cgt ccg caa tac aag gcg acg ctt atc 7728 Ala Leu Ile Ser Cys Asn Gly Arg Pro Gln Tyr Lys Ala Thr Leu Ile 2565 2570 2575 agt gat aat gcc gat att aag caa ctt aac aag cag ttt gat tta agc 7776 Ser Asp Asn Ala Asp Ile Lys Gln Leu Asn Lys Gln Phe Asp Leu Ser 2580 2585 2590 gct aag gcg att acc aca gca aaa gag ctt tat agc aac ggc acc ttg 7824 Ala Lys Ala Ile Thr Thr Ala Lys Glu Leu Tyr Ser Asn Gly Thr Leu 2595 2600 2605 ttc cac ggt ccg cgt cta caa ggg atc caa tct gta gtg cag ttc gat 7872 Phe His Gly Pro Arg Leu Gln Gly Ile Gln Ser Val Val Gln Phe Asp 2610 2615 2620 gat caa ggc tta att gct aaa gtc gct ctg cct aag gtt gaa ctt agc 7920 Asp Gln Gly Leu Ile Ala Lys Val Ala Leu Pro Lys Val Glu Leu Ser 2625 2630 2635 2640 gat tgt ggt gag ttc ttg ccg caa acc cac atg ggt ggc agt caa cct 7968 Asp Cys Gly Glu Phe Leu Pro Gln Thr His Met Gly Gly Ser Gln Pro 2645 2650 2655 ttt gct gag gac ttg cta tta caa gct atg ctg gtt tgg gct cgc ctt 8016 Phe Ala Glu Asp Leu Leu Leu Gln Ala Met Leu Val Trp Ala Arg Leu 2660 2665 2670 aaa act ggc tcg gca agt ttg cca tca agc att ggt gag ttt acc tca 8064 Lys Thr Gly Ser Ala Ser Leu Pro Ser Ser Ile Gly Glu Phe Thr Ser 2675 2680 2685 tac caa cca atg gcc ttt ggt gaa act ggt acc ata gag ctt gaa gtg 8112 Tyr Gln Pro Met Ala Phe Gly Glu Thr Gly Thr Ile Glu Leu Glu Val 2690 2695 2700 att aag cac aac aaa cgc tca ctt gaa gcg aat gtt gcg cta tat cgt 8160 Ile Lys His Asn Lys Arg Ser Leu Glu Ala Asn Val Ala Leu Tyr Arg 2705 2710 2715 2720 gac aac ggc gag tta agt gcc atg ttt aag tca gct aaa atc acc att 8208 Asp Asn Gly Glu Leu Ser Ala Met Phe Lys Ser Ala Lys Ile Thr Ile 2725 2730 2735 agc aaa agc tta aat tca gca ttt tta cct gct gtc tta gca aac gac 8256 Ser Lys Ser Leu Asn Ser Ala Phe Leu Pro Ala Val Leu Ala Asn Asp 2740 2745 2750 agt gag gcg aat 8268 Ser Glu Ala Asn 2755 <210> 7 <211> 2756 <212> PRT <400> 1 Met Ser Gln Thr Ser Lys Pro Thr Asn Ser Ala Thr Glu Gln Ala Gln 1 5 10 15 Asp Ser Gln Ala Asp Ser Arg Leu Asn Lys Arg Leu Lys Asp Met Pro 20 25 30 Ile Ala Ile Val Gly Met Ala Ser Ile Phe Ala Asn Ser Arg Tyr Leu 35 40 45 Asn Lys Phe Trp Asp Leu Ile Ser Glu Lys Ile Asp Ala Ile Thr Glu 50 55 60 Leu Pro Ser Thr His Trp Gln Pro Glu Glu Tyr Tyr Asp Ala Asp Lys 65 70 75 80 Thr Ala Ala Asp Lys Ser Tyr Cys Lys Arg Gly Gly Phe Leu Pro Asp 85 90 95 Val Asp Phe Asn Pro Met Glu Phe Gly Leu Pro Pro Asn Ile Leu Glu 100 105 110 Leu Thr Asp Ser Ser Gln Leu Leu Ser Leu Ile Val Ala Lys Glu Val 115 120 125 Leu Ala Asp Ala Asn Leu Pro Glu Asn Tyr Asp Arg Asp Lys Ile Gly 130 135 140 Ile Thr Leu Gly Val Gly Gly Gly Gln Lys Ile Ser His Ser Leu Thr 145 150 155 160 Ala Arg Leu Gln Tyr Pro Val Leu Lys Lys Val Phe Ala Asn Ser Gly 165 170 175 Ile Ser Asp Thr Asp Ser Glu Met Leu Ile Lys Lys Phe Gln Asp Gln 180 185 190 Tyr Val His Trp Glu Glu Asn Ser Phe Pro Gly Ser Leu Gly Asn Val 195 200 205 Ile Ala Gly Arg Ile Ala Asn Arg Phe Asp Phe Gly Gly Met Asn Cys 210 215 220 Val Val Asp Ala Ala Cys Ala Gly Ser Leu Ala Ala Met Arg Met Ala 225 230 235 240 Leu Thr Glu Leu Thr Glu Gly Arg Ser Glu Met Met Ile Thr Gly Gly 245 250 255 Val Cys Thr Asp Asn Ser Pro Ser Met Tyr Met Ser Phe Ser Lys Thr 260 265 270 Pro Ala Phe Thr Thr Asn Glu Thr Ile Gln Pro Phe Asp Ile Asp Ser 275 280 285 Lys Gly Met Met Ile Gly Glu Gly Ile Gly Met Val Ala Leu Lys Arg 290 295 300 Leu Glu Asp Ala Glu Arg Asp Gly Asp Arg Ile Tyr Ser Val Ile Lys 305 310 315 320 Gly Val Gly Ala Ser Ser Asp Gly Lys Phe Lys Ser Ile Tyr Ala Pro 325 330 335 Arg Pro Ser Gly Gln Ala Lys Ala Leu Asn Arg Ala Tyr Asp Asp Ala 340 345 350 Gly Phe Ala Pro His Thr Leu Gly Leu Ile Glu Ala His Gly Thr Gly 355 360 365 Thr Ala Ala Gly Asp Ala Ala Glu Phe Ala Gly Leu Cys Ser Val Phe 370 375 380 Ala Glu Gly Asn Asp Thr Lys Gln His Ile Ala Leu Gly Ser Val Lys 385 390 395 400 Ser Gln Ile Gly His Thr Lys Ser Thr Ala Gly Thr Ala Gly Leu Ile 405 410 415 Lys Ala Ala Leu Ala Leu His His Lys Val Leu Pro Pro Thr Ile Asn 420 425 430 Val Ser Gln Pro Ser Pro Lys Leu Asp Ile Glu Asn Ser Pro Phe Tyr 435 440 445 Leu Asn Thr Glu Thr Arg Pro Trp Leu Pro Arg Val Asp Gly Thr Pro 450 455 460 Arg Arg Ala Gly Ile Ser Ser Phe Gly Phe Gly Gly Thr Asn Phe His 465 470 475 480 Phe Val Leu Glu Glu Tyr Asn Gln Glu His Ser Arg Thr Asp Ser Glu 485 490 495 Lys Ala Lys Tyr Arg Gln Arg Gln Val Ala Gln Ser Phe Leu Val Ser 500 505 510 Ala Ser Asp Lys Ala Ser Leu Ile Asn Glu Leu Asn Val Leu Ala Ala 515 520 525 Ser Ala Ser Gln Ala Glu Phe Ile Leu Lys Asp Ala Ala Ala Asn Tyr 530 535 540 Gly Val Arg Glu Leu Asp Lys Asn Ala Pro Arg Ile Gly Leu Val Ala 545 550 555 560 Asn Thr Ala Glu Glu Leu Ala Gly Leu Ile Lys Gln Ala Leu Ala Lys 565 570 575 Leu Ala Ala Ser Asp Asp Asn Ala Trp Gln Leu Pro Gly Gly Thr Ser 580 585 590 Tyr Arg Ala Ala Ala Val Glu Gly Lys Val Ala Ala Leu Phe Ala Gly 595 600 605 Gln Gly Ser Gln Tyr Leu Asn Met Gly Arg Asp Leu Thr Cys Tyr Tyr 610 615 620 Pro Glu Met Arg Gln Gln Phe Val Thr Ala Asp Lys Val Phe Ala Ala 625 630 635 640 Asn Asp Lys Thr Pro Leu Ser Gln Thr Leu Tyr Pro Lys Pro Val Phe 645 650 655 Asn Lys Asp Glu Leu Lys Ala Gln Glu Ala Ile Leu Thr Asn Thr Ala 660 665 670 Asn Ala Gln Ser Ala Ile Gly Ala Ile Ser Met Gly Gln Tyr Asp Leu 675 680 685 Phe Thr Ala Ala Gly Phe Asn Ala Asp Met Val Ala Gly His Ser Phe 690 695 700 Gly Glu Leu Ser Ala Leu Cys Ala Ala Gly Val Ile Ser Ala Asp Asp 705 710 715 720 Tyr Tyr Lys Leu Ala Phe Ala Arg Gly Glu Ala Met Ala Thr Lys Ala 725 730 735 Pro Ala Lys Asp Gly Val Glu Ala Asp Ala Gly Ala Met Phe Ala Ile 740 745 750 Ile Thr Lys Ser Ala Ala Asp Leu Glu Thr Val Glu Ala Thr Ile Ala 755 760 765 Lys Phe Asp Gly Val Lys Val Ala Asn Tyr Asn Ala Pro Thr Gln Ser 770 775 780 Val Ile Ala Gly Pro Thr Ala Thr Thr Ala Asp Ala Ala Lys Ala Leu 785 790 795 800 Thr Glu Leu Gly Tyr Lys Ala Ile Asn Leu Pro Val Ser Gly Ala Phe 805 810 815 His Thr Glu Leu Val Gly His Ala Gln Ala Pro Phe Ala Lys Ala Ile 820 825 830 Asp Ala Ala Lys Phe Thr Lys Thr Ser Arg Ala Leu Tyr Ser Asn Ala 835 840 845 Thr Gly Gly Leu Tyr Glu Ser Thr Ala Ala Lys Ile Lys Ala Ser Phe 850 855 860 Lys Lys His Met Leu Gln Ser Val Arg Phe Thr Ser Gln Leu Glu Ala 865 870 875 880 Met Tyr Asn Asp Gly Ala Arg Val Phe Val Glu Phe Gly Pro Lys Asn 885 890 895 Ile Leu Gln Lys Leu Val Gln Gly Thr Leu Val Asn Thr Glu Asn Glu 900 905 910 Val Cys Thr Ile Ser Ile Asn Pro Asn Pro Lys Val Asp Ser Asp Leu 915 920 925 Gln Leu Lys Gln Ala Ala Met Gln Leu Ala Val Thr Gly Val Val Leu 930 935 940 Ser Glu Ile Asp Pro Tyr Gln Ala Asp Ile Ala Ala Pro Ala Lys Lys 945 950 955 960 Ser Pro Met Ser Ile Ser Leu Asn Ala Ala Asn His Ile Ser Lys Ala 965 970 975 Thr Arg Ala Lys Met Ala Lys Ser Leu Glu Thr Gly Ile Val Thr Ser 980 985 990 Gln Ile Glu His Val Ile Glu Glu Lys Ile Val Glu Val Glu Lys Leu 995 1000 1005 Val Glu Val Glu Lys Ile Val Glu Lys Val Val Glu Val Glu Lys Val 1010 1015 1020 Val Glu Val Glu Ala Pro Val Asn Ser Val Gln Ala Asn Ala Ile Gln 1025 1030 1035 1040 Thr Arg Ser Val Val Ala Pro Val Ile Glu Asn Gln Val Val Ser Lys 1045 1050 1055 Asn Ser Lys Pro Ala Val Gln Ser Ile Ser Gly Asp Ala Leu Ser Asn 1060 1065 1070 Phe Phe Ala Ala Gln Gln Gln Thr Ala Gln Leu His Gln Gln Phe Leu 1075 1080 1085 Ala Ile Pro Gln Gln Tyr Gly Glu Thr Phe Thr Thr Leu Met Thr Glu 1090 1095 1100 Gln Ala Lys Leu Ala Ser Ser Gly Val Ala Ile Pro Glu Ser Leu Gln 1105 1110 1115 1120 Arg Ser Met Glu Gln Phe His Gln Leu Gln Ala Gln Thr Leu Gln Ser 1125 1130 1135 His Thr Gln Phe Leu Glu Met Gln Ala Gly Ser Asn Ile Ala Ala Leu 1140 1145 1150 Asn Leu Leu Asn Ser Ser Gln Ala Thr Tyr Ala Pro Ala Ile His Asn 1155 1160 1165 Glu Ala Ile Gln Ser Gln Val Val Gln Ser Gln Thr Ala Val Gln Pro 1170 1175 1180 Val Ile Ser Thr Gln Val Asn His Val Ser Glu Gln Pro Thr Gln Ala 1185 1190 1195 1200 Pro Ala Pro Lys Ala Gln Pro Ala Pro Val Thr Thr Pro Val Gln Thr 1205 1210 1215 Ala Pro Ala Gln Val Val Arg Gln Ala Ala Pro Val Gln Ala Ala Ile 1220 1225 1230 Glu Pro Ile Asn Thr Ser Val Ala Thr Thr Thr Pro Ser Ala Phe Ser 1235 1240 1245 Ala Glu Thr Ala Leu Ser Ala Thr Lys Val Gln Ala Thr Met Leu Glu 1250 1255 1260 Val Val Ala Glu Lys Thr Gly Tyr Pro Thr Glu Met Leu Glu Leu Glu 1265 1270 1275 1280 Met Asp Met Glu Ala Asp Leu Gly Ile Asp Ser Ile Lys Arg Val Glu 1285 1290 1295 Ile Leu Gly Thr Val Gln Asp Glu Leu Pro Gly Leu Pro Glu Leu Ser 1300 1305 1310 Pro Glu Asp Leu Ala Glu Cys Arg Thr Leu Gly Glu Ile Val Asp Tyr 1315 1320 1325 Met Gly Ser Lys Leu Pro Ala Glu Gly Ser Met Asn Ser Gln Leu Ser 1330 1335 1340 Thr Gly Ser Ala Ala Ala Thr Pro Ala Ala Asn Gly Leu Ser Ala Glu 1345 1350 1355 1360 Lys Val Gln Ala Thr Met Met Ser Val Val Ala Glu Lys Thr Gly Tyr 1365 1370 1375 Pro Thr Glu Met Leu Glu Leu Glu Met Asp Met Glu Ala Asp Leu Gly 1380 1385 1390 Ile Asp Ser Ile Lys Arg Val Glu Ile Leu Gly Thr Val Gln Asp Glu 1395 1400 1405 Leu Pro Gly Leu Pro Glu Leu Ser Pro Glu Asp Leu Ala Glu Cys Arg 1410 1415 1420 Thr Leu Gly Glu Ile Val Asp Tyr Met Asn Ser Lys Leu Ala Asp Gly 1425 1430 1435 1440 Ser Lys Leu Pro Ala Glu Gly Ser Met Asn Ser Gln Leu Ser Thr Ser 1445 1450 1455 Ala Ala Ala Ala Thr Pro Ala Ala Asn Gly Leu Ser Ala Glu Lys Val 1460 1465 1470 Gln Ala Thr Met Met Ser Val Val Ala Glu Lys Thr Gly Tyr Pro Thr 1475 1480 1485 Glu Met Leu Glu Leu Glu Met Asp Met Glu Ala Asp Leu Gly Ile Asp 1490 1495 1500 Ser Ile Lys Arg Val Glu Ile Leu Gly Thr Val Gln Asp Glu Leu Pro 1505 1510 1515 1520 Gly Leu Pro Glu Leu Asn Pro Glu Asp Leu Ala Glu Cys Arg Thr Leu 1525 1530 1535 Gly Glu Ile Val Thr Tyr Met Asn Ser Lys Leu Ala Asp Gly Ser Lys 1540 1545 1550 Leu Pro Ala Glu Gly Ser Met His Tyr Gln Leu Ser Thr Ser Thr Ala 1555 1560 1565 Ala Ala Thr Pro Val Ala Asn Gly Leu Ser Ala Glu Lys Val Gln Ala 1570 1575 1580 Thr Met Met Ser Val Val Ala Asp Lys Thr Gly Tyr Pro Thr Glu Met 1585 1590 1595 1600 Leu Glu Leu Glu Met Asp Met Glu Ala Asp Leu Gly Ile Asp Ser Ile 1605 1610 1615 Lys Arg Val Glu Ile Leu Gly Thr Val Gln Asp Glu Leu Pro Gly Leu 1620 1625 1630 Pro Glu Leu Asn Pro Glu Asp Leu Ala Glu Cys Arg Thr Leu Gly Glu 1635 1640 1645 Ile Val Asp Tyr Met Gly Ser Lys Leu Pro Ala Glu Gly Ser Ala Asn 1650 1655 1660 Thr Ser Ala Ala Ala Ser Leu Asn Val Ser Ala Val Ala Ala Pro Gln 1665 1670 1675 1680 Ala Ala Ala Thr Pro Val Ser Asn Gly Leu Ser Ala Glu Lys Val Gln 1685 1690 1695 Ser Thr Met Met Ser Val Val Ala Glu Lys Thr Gly Tyr Pro Thr Glu 1700 1705 1710 Met Leu Glu Leu Gly Met Asp Met Glu Ala Asp Leu Gly Ile Asp Ser 1715 1720 1725 Ile Lys Arg Val Glu Ile Leu Gly Thr Val Gln Asp Glu Leu Pro Gly 1730 1735 1740 Leu Pro Glu Leu Asn Pro Glu Asp Leu Ala Glu Cys Arg Thr Leu Gly 1745 1750 1755 1760 Glu Ile Val Asp Tyr Met Asn Ser Lys Leu Ala Asp Gly Ser Lys Leu 1765 1770 1775 Pro Ala Glu Gly Ser Ala Asn Thr Ser Ala Thr Ala Ala Thr Pro Ala 1780 1785 1790 Val Asn Gly Leu Ser Ala Asp Lys Val Gln Ala Thr Met Met Ser Val 1795 1800 1805 Val Ala Glu Lys Thr Gly Tyr Pro Thr Glu Met Leu Glu Leu Gly Met 1810 1815 1820 Asp Met Glu Ala Asp Leu Gly Ile Asp Ser Ile Lys Arg Val Glu Ile 1825 1830 1835 1840 Leu Gly Thr Val Gln Asp Glu Leu Pro Gly Leu Pro Glu Leu Asn Pro 1845 1850 1855 Glu Asp Leu Ala Glu Cys Arg Thr Leu Gly Glu Ile Val Ser Tyr Met 1860 1865 1870 Asn Ser Gln Leu Ala Asp Gly Ser Lys Leu Ser Thr Ser Ala Ala Glu 1875 1880 1885 Gly Ser Ala Asp Thr Ser Ala Ala Asn Ala Ala Lys Pro Ala Ala Ile 1890 1895 1900 Ser Ala Glu Pro Ser Val Glu Leu Pro Pro His Ser Glu Val Ala Leu 1905 1910 1915 1920 Lys Lys Leu Asn Ala Ala Asn Lys Leu Glu Asn Cys Phe Ala Ala Asp 1925 1930 1935 Ala Ser Val Val Ile Asn Asp Asp Gly His Asn Ala Gly Val Leu Ala 1940 1945 1950 Glu Lys Leu Ile Lys Gln Gly Leu Lys Val Ala Val Val Arg Leu Pro 1955 1960 1965 Lys Gly Gln Pro Gln Ser Pro Leu Ser Ser Asp Val Ala Ser Phe Glu 1970 1975 1980 Leu Ala Ser Ser Gln Glu Ser Glu Leu Glu Ala Ser Ile Thr Ala Val 1985 1990 1995 2000 Ile Ala Gln Ile Glu Thr Gln Val Gly Ala Ile Gly Gly Phe Ile His 2005 2010 2015 Leu Gln Pro Glu Ala Asn Thr Glu Glu Gln Thr Ala Val Asn Leu Asp 2020 2025 2030 Ala Gln Ser Phe Thr His Val Ser Asn Ala Phe Leu Trp Ala Lys Leu 2035 2040 2045 Leu Gln Pro Lys Leu Val Ala Gly Ala Asp Ala Arg Arg Cys Phe Val 2050 2055 2060 Thr Val Ser Arg Ile Asp Gly Gly Phe Gly Tyr Leu Asn Thr Asp Ala 2065 2070 2075 2080 Leu Lys Asp Ala Glu Leu Asn Gln Ala Ala Leu Ala Gly Leu Thr Lys 2085 2090 2095 Thr Leu Ser His Glu Trp Pro Gln Val Phe Cys Arg Ala Leu Asp Ile 2100 2105 2110 Ala Thr Asp Val Asp Ala Thr His Leu Ala Asp Ala Ile Thr Ser Glu 2115 2120 2125 Leu Phe Asp Ser Gln Ala Gln Leu Pro Glu Val Gly Leu Ser Leu Ile 2130 2135 2140 Asp Gly Lys Val Asn Arg Val Thr Leu Val Ala Ala Glu Ala Ala Asp 2145 2150 2155 2160 Lys Thr Ala Lys Ala Glu Leu Asn Ser Thr Asp Lys Ile Leu Val Thr 2165 2170 2175 Gly Gly Ala Lys Gly Val Thr Phe Glu Cys Ala Leu Ala Leu Ala Ser 2180 2185 2190 Arg Ser Gln Ser His Phe Ile Leu Ala Gly Arg Ser Glu Leu Gln Ala 2195 2200 2205 Leu Pro Ser Trp Ala Glu Gly Lys Gln Thr Ser Glu Leu Lys Ser Ala 2210 2215 2220 Ala Ile Ala His Ile Ile Ser Thr Gly Gln Lys Pro Thr Pro Lys Gln 2225 2230 2235 2240 Val Glu Ala Ala Val Trp Pro Val Gln Ser Ser Ile Glu Ile Asn Ala 2245 2250 2255 Ala Leu Ala Ala Phe Asn Lys Val Gly Ala Ser Ala Glu Tyr Val Ser 2260 2265 2270 Met Asp Val Thr Asp Ser Ala Ala Ile Thr Ala Ala Leu Asn Gly Arg 2275 2280 2285 Ser Asn Glu Ile Thr Gly Leu Ile His Gly Ala Gly Val Leu Ala Asp 2290 2295 2300 Lys His Ile Gln Asp Lys Thr Leu Ala Glu Leu Ala Lys Val Tyr Gly 2305 2310 2315 2320 Thr Lys Val Asn Gly Leu Lys Ala Leu Leu Ala Ala Leu Glu Pro Ser 2325 2330 2335 Lys Ile Lys Leu Leu Ala Met Phe Ser Ser Ala Ala Gly Phe Tyr Gly 2340 2345 2350 Asn Ile Gly Gln Ser Asp Tyr Ala Met Ser Asn Asp Ile Leu Asn Lys 2355 2360 2365 Ala Ala Leu Gln Phe Thr Ala Arg Asn Pro Gln Ala Lys Val Met Ser 2370 2375 2380 Phe Asn Trp Gly Pro Trp Asp Gly Gly Met Val Asn Pro Ala Leu Lys 2385 2390 2395 2400 Lys Met Phe Thr Glu Arg Gly Val Tyr Val Ile Pro Leu Lys Ala Gly 2405 2410 2415 Ala Glu Leu Phe Ala Thr Gln Leu Leu Ala Glu Thr Gly Val Gln Leu 2420 2425 2430 Leu Ile Gly Thr Ser Met Gln Gly Gly Ser Asp Thr Lys Ala Thr Glu 2435 2440 2445 Thr Ala Ser Val Lys Lys Leu Asn Ala Gly Glu Val Leu Ser Ala Ser 2450 2455 2460 His Pro Arg Ala Gly Ala Gln Lys Thr Pro Leu Gln Ala Val Thr Ala 2465 2470 2475 2480 Thr Arg Leu Leu Thr Pro Ser Ala Met Val Phe Ile Glu Asp His Arg 2485 2490 2495 Ile Gly Gly Asn Ser Val Leu Pro Thr Val Cys Ala Ile Asp Trp Met 2500 2505 2510 Arg Glu Ala Ala Ser Asp Met Leu Gly Ala Gln Val Lys Val Leu Asp 2515 2520 2525 Tyr Lys Leu Leu Lys Gly Ile Val Phe Glu Thr Asp Glu Pro Gln Glu 2530 2535 2540 Leu Thr Leu Glu Leu Thr Pro Asp Asp Ser Asp Glu Ala Thr Leu Gln 2545 2550 2555 2560 Ala Leu Ile Ser Cys Asn Gly Arg Pro Gln Tyr Lys Ala Thr Leu Ile 2565 2570 2575 Ser Asp Asn Ala Asp Ile Lys Gln Leu Asn Lys Gln Phe Asp Leu Ser 2580 2585 2590 Ala Lys Ala Ile Thr Thr Ala Lys Glu Leu Tyr Ser Asn Gly Thr Leu 2595 2600 2605 Phe His Gly Pro Arg Leu Gln Gly Ile Gln Ser Val Val Gln Phe Asp 2610 2615 2620 Asp Gln Gly Leu Ile Ala Lys Val Ala Leu Pro Lys Val Glu Leu Ser 2625 2630 2635 2640 Asp Cys Gly Glu Phe Leu Pro Gln Thr His Met Gly Gly Ser Gln Pro 2645 2650 2655 Phe Ala Glu Asp Leu Leu Leu Gln Ala Met Leu Val Trp Ala Arg Leu 2660 2665 2670 Lys Thr Gly Ser Ala Ser Leu Pro Ser Ser Ile Gly Glu Phe Thr Ser 2675 2680 2685 Tyr Gln Pro Met Ala Phe Gly Glu Thr Gly Thr Ile Glu Leu Glu Val 2690 2695 2700 Ile Lys His Asn Lys Arg Ser Leu Glu Ala Asn Val Ala Leu Tyr Arg 2705 2710 2715 2720 Asp Asn Gly Glu Leu Ser Ala Met Phe Lys Ser Ala Lys Ile Thr Ile 2725 2730 2735 Ser Lys Ser Leu Asn Ser Ala Phe Leu Pro Ala Val Leu Ala Asn Asp 2740 2745 2750 Ser Glu Ala Asn 2755 <210> 8 <211> 2340 <212> DNA <213> Shewanella putrefaciens SCRC-2874 (FERM BP-1625) <400> 1 gtg gaa caa acg cct aaa gct agt gcg atg ccg ctg cgc atc gca ctt 48 Val Glu Gln Thr Pro Lys Ala Ser Ala Met Pro Leu Arg Ile Ala Leu 1 5 10 15 atc tta ctg cca aca ccg cag ttt gaa gtt aac tct gtc gac cag tca 96 Ile Leu Leu Pro Thr Pro Gln Phe Glu Val Asn Ser Val Asp Gln Ser 20 25 30 gta tta gcc agc tat caa aca ctg cag cct gag cta aat gcc ctg ctt 144 Val Leu Ala Ser Tyr Gln Thr Leu Gln Pro Glu Leu Asn Ala Leu Leu 35 40 45 aat agt gcg ccg aca cct gaa atg ctc agc atc act atc tca gat gat 192 Asn Ser Ala Pro Thr Pro Glu Met Leu Ser Ile Thr Ile Ser Asp Asp 50 55 60 agc gat gca aac agc ttt gag tcg cag cta aat gct gcg acc aac gca 240 Ser Asp Ala Asn Ser Phe Glu Ser Gln Leu Asn Ala Ala Thr Asn Ala 65 70 75 80 att aac aat ggc tat atc gtc aag ctt gct acg gca act cac gct ttg 288 Ile Asn Asn Gly Tyr Ile Val Lys Leu Ala Thr Ala Thr His Ala Leu 85 90 95 tta atg ctg cct gca tta aaa gcg gcg caa atg cgg atc cat cct cat 336 Leu Met Leu Pro Ala Leu Lys Ala Ala Gln Met Arg Ile His Pro His 100 105 110 gcg cag ctt gcc gct atg cag caa gct aaa tcg acg cca atg agt caa 384 Ala Gln Leu Ala Ala Met Gln Gln Ala Lys Ser Thr Pro Met Ser Gln 115 120 125 gta tct ggt gag cta aag ctt ggc gct aat gcg cta agc cta gct cag 432 Val Ser Gly Glu Leu Lys Leu Gly Ala Asn Ala Leu Ser Leu Ala Gln 130 135 140 act aat gcg ctg tct cat gct tta agc caa gcc aag cgt aac tta act 480 Thr Asn Ala Leu Ser His Ala Leu Ser Gln Ala Lys Arg Asn Leu Thr 145 150 155 160 gat gtc agc gtg aat gag tgt ttt gag aac ctc aaa agt gaa cag cag 528 Asp Val Ser Val Asn Glu Cys Phe Glu Asn Leu Lys Ser Glu Gln Gln 165 170 175 ttc aca gag gtt tat tcg ctt att cag caa ctt gct agc cgc acc cat 576 Phe Thr Glu Val Tyr Ser Leu Ile Gln Gln Leu Ala Ser Arg Thr His 180 185 190 gtg aga aaa gag gtt aat caa ggt gtg gaa ctt ggc cct aaa caa gcc 624 Val Arg Lys Glu Val Asn Gln Gly Val Glu Leu Gly Pro Lys Gln Ala 195 200 205 aaa agc cac tat tgg ttt agc gaa ttt cac caa aac cgt gtt gct gcc 672 Lys Ser His Tyr Trp Phe Ser Glu Phe His Gln Asn Arg Val Ala Ala 210 215 220 atc aac ttt att aat ggc caa caa gca acc agc tat gtg ctt act caa 720 Ile Asn Phe Ile Asn Gly Gln Gln Ala Thr Ser Tyr Val Leu Thr Gln 225 230 235 240 ggt tca gga ttg tta gct gcg aaa tca atg cta aac cag caa aga tta 768 Gly Ser Gly Leu Leu Ala Ala Lys Ser Met Leu Asn Gln Gln Arg Leu 245 250 255 atg ttt atc ttg ccg ggt aac agt cag caa caa ata acc gca tca ata 816 Met Phe Ile Leu Pro Gly Asn Ser Gln Gln Gln Ile Thr Ala Ser Ile 260 265 270 act cag tta atg cag caa tta gag cgt ttg cag gta act gag gtt aat 864 Thr Gln Leu Met Gln Gln Leu Glu Arg Leu Gln Val Thr Glu Val Asn 275 280 285 gag ctt tct cta gaa tgc caa cta gag ctg ctc agc ata atg tat gac 912 Glu Leu Ser Leu Glu Cys Gln Leu Glu Leu Leu Ser Ile Met Tyr Asp 290 295 300 aac tta gtc aac gca gac aaa ctc act act cgc gat agt aag ccc gct 960 Asn Leu Val Asn Ala Asp Lys Leu Thr Thr Arg Asp Ser Lys Pro Ala 305 310 315 320 tat cag gct gtg att caa gca agc tct gtt agc gct gca aag caa gag 1008 Tyr Gln Ala Val Ile Gln Ala Ser Ser Val Ser Ala Ala Lys Gln Glu 325 330 335 tta agc gcg ctt aac gat gca ctc aca gcg ctg ttt gct gag caa aca 1056 Leu Ser Ala Leu Asn Asp Ala Leu Thr Ala Leu Phe Ala Glu Gln Thr 340 345 350 aac gcc aca tca acg aat aaa ggc tta atc caa tac aaa aca ccg gcg 1104 Asn Ala Thr Ser Thr Asn Lys Gly Leu Ile Gln Tyr Lys Thr Pro Ala 355 360 365 ggc agt tac tta acc cta aca ccg ctt ggc agc aac aat gac aac gcc 1152 Gly Ser Tyr Leu Thr Leu Thr Pro Leu Gly Ser Asn Asn Asp Asn Ala 370 375 380 caa gcg ggt ctt gct ttt gtc tat ccg ggt gtg gga acg gtt tac gcc 1200 Gln Ala Gly Leu Ala Phe Val Tyr Pro Gly Val Gly Thr Val Tyr Ala 385 390 395 400 gat atg ctt aat gag ctg cat cag tac ttc cct gcg ctt tac gcc aaa 1248 Asp Met Leu Asn Glu Leu His Gln Tyr Phe Pro Ala Leu Tyr Ala Lys 405 410 415 ctt gag cgt gaa ggc gat tta aag gcg atg cta caa gca gaa gat atc 1296 Leu Glu Arg Glu Gly Asp Leu Lys Ala Met Leu Gln Ala Glu Asp Ile 420 425 430 tat cat ctt gac cct aaa cat gct gcc caa atg agc tta ggt gac tta 1344 Tyr His Leu Asp Pro Lys His Ala Ala Gln Met Ser Leu Gly Asp Leu 435 440 445 gcc att gct ggc gtg ggg agc agc tac ctg tta act cag ctg ctc acc 1392 Ala Ile Ala Gly Val Gly Ser Ser Tyr Leu Leu Thr Gln Leu Leu Thr 450 455 460 gat gag ttt aat att aag cct aat ttt gca tta ggt tac tca atg ggt 1440 Asp Glu Phe Asn Ile Lys Pro Asn Phe Ala Leu Gly Tyr Ser Met Gly 465 470 475 480 gaa gca tca atg tgg gca agc tta ggc gta tgg caa aac ccg cat gcg 1488 Glu Ala Ser Met Trp Ala Ser Leu Gly Val Trp Gln Asn Pro His Ala 485 490 495 ctg atc agc aaa acc caa acc gac ccg cta ttt act tct gct att tcc 1536 Leu Ile Ser Lys Thr Gln Thr Asp Pro Leu Phe Thr Ser Ala Ile Ser 500 505 510 ggc aaa ttg acc gcg gtt aga caa gct tgg cag ctt gat gat acc gca 1584 Gly Lys Leu Thr Ala Val Arg Gln Ala Trp Gln Leu Asp Asp Thr Ala 515 520 525 gcg gaa atc cag tgg aat agc ttt gtg gtt aga agt gaa gca gcg ccg 1632 Ala Glu Ile Gln Trp Asn Ser Phe Val Val Arg Ser Glu Ala Ala Pro 530 535 540 att gaa gcc ttg cta aaa gat tac cca cac gct tac ctc gcg att att 1680 Ile Glu Ala Leu Leu Lys Asp Tyr Pro His Ala Tyr Leu Ala Ile Ile 545 550 555 560 caa ggg gat acc tgc gta atc gct ggc tgt gaa atc caa tgt aaa gcg 1728 Gln Gly Asp Thr Cys Val Ile Ala Gly Cys Glu Ile Gln Cys Lys Ala 565 570 575 cta ctt gca gca ctg ggt aaa cgc ggt att gca gct aat cgt gta acg 1776 Leu Leu Ala Ala Leu Gly Lys Arg Gly Ile Ala Ala Asn Arg Val Thr 580 585 590 gcg atg cat acg cag cct gcg atg caa gag cat caa aat gtg atg gat 1824 Ala Met His Thr Gln Pro Ala Met Gln Glu His Gln Asn Val Met Asp 595 600 605 ttt tat ctg caa ccg tta aaa gca gag ctt cct agt gaa ata agc ttt 1872 Phe Tyr Leu Gln Pro Leu Lys Ala Glu Leu Pro Ser Glu Ile Ser Phe 610 615 620 atc agc gcc gct gat tta act gcc aag caa acg gtg agt gag caa gca 1920 Ile Ser Ala Ala Asp Leu Thr Ala Lys Gln Thr Val Ser Glu Gln Ala 625 630 635 640 ctt agc agc caa gtc gtt gct cag tct att gcc gac acc ttc tgc caa 1968 Leu Ser Ser Gln Val Val Ala Gln Ser Ile Ala Asp Thr Phe Cys Gln 645 650 655 acc ttg gac ttt acc gcg cta gta cat cac gcc caa cat caa ggc gct 2016 Thr Leu Asp Phe Thr Ala Leu Val His His Ala Gln His Gln Gly Ala 660 665 670 aag ctg ttt gtt gaa att ggc gcg gat aga caa aac tgc acc ttg ata 2064 Lys Leu Phe Val Glu Ile Gly Ala Asp Arg Gln Asn Cys Thr Leu Ile 675 680 685 gac aag att gtt aaa caa gat ggt gcc agc agt gta caa cat caa cct 2112 Asp Lys Ile Val Lys Gln Asp Gly Ala Ser Ser Val Gln His Gln Pro 690 695 700 tgt tgc aca gtg cct atg aac gca aaa ggt agc caa gat att acc agc 2160 Cys Cys Thr Val Pro Met Asn Ala Lys Gly Ser Gln Asp Ile Thr Ser 705 710 715 720 gtg att aaa gcg ctt ggc caa tta att agc cat cag gtg cca tta tcg 2208 al Ile Lys Ala Leu Gly Gln Leu Ile Ser His Gln Val Pro Leu Ser 725 730 735 gtg caa cca ttt att gat gga ctc aag cgc gag cta aca ctt tgc caa 2256 Val Gln Pro Phe Ile Asp Gly Leu Lys Arg Glu Leu Thr Leu Cys Gln 740 745 750 ttg acc agc caa cag ctg gca gca cat gca aat gtt gac agc aag ttt 2304 Leu Thr Ser Gln Gln Leu Ala Ala His Ala Asn Val Asp Ser Lys Phe 755 760 765 gag tct aac caa gac cat tta ctt caa ggg gaa gtc 2340 Glu Ser Asn Gln Asp His Leu Leu Gln Gly Glu Val 770 775 780 <210> 9 <211> 780 <212> PRT <400> 1 Val Glu Gln Thr Pro Lys Ala Ser Ala Met Pro Leu Arg Ile Ala Leu 1 5 10 15 Ile Leu Leu Pro Thr Pro Gln Phe Glu Val Asn Ser Val Asp Gln Ser 20 25 30 Val Leu Ala Ser Tyr Gln Thr Leu Gln Pro Glu Leu Asn Ala Leu Leu 35 40 45 Asn Ser Ala Pro Thr Pro Glu Met Leu Ser Ile Thr Ile Ser Asp Asp 50 55 60 Ser Asp Ala Asn Ser Phe Glu Ser Gln Leu Asn Ala Ala Thr Asn Ala 65 70 75 80 Ile Asn Asn Gly Tyr Ile Val Lys Leu Ala Thr Ala Thr His Ala Leu 85 90 95 Leu Met Leu Pro Ala Leu Lys Ala Ala Gln Met Arg Ile His Pro His 100 105 110 Ala Gln Leu Ala Ala Met Gln Gln Ala Lys Ser Thr Pro Met Ser Gln 115 120 125 Val Ser Gly Glu Leu Lys Leu Gly Ala Asn Ala Leu Ser Leu Ala Gln 130 135 140 Thr Asn Ala Leu Ser His Ala Leu Ser Gln Ala Lys Arg Asn Leu Thr 145 150 155 160 Asp Val Ser Val Asn Glu Cys Phe Glu Asn Leu Lys Ser Glu Gln Gln 165 170 175 Phe Thr Glu Val Tyr Ser Leu Ile Gln Gln Leu Ala Ser Arg Thr His 180 185 190 Val Arg Lys Glu Val Asn Gln Gly Val Glu Leu Gly Pro Lys Gln Ala 195 200 205 Lys Ser His Tyr Trp Phe Ser Glu Phe His Gln Asn Arg Val Ala Ala 210 215 220 Ile Asn Phe Ile Asn Gly Gln Gln Ala Thr Ser Tyr Val Leu Thr Gln 225 230 235 240 Gly Ser Gly Leu Leu Ala Ala Lys Ser Met Leu Asn Gln Gln Arg Leu 245 250 255 Met Phe Ile Leu Pro Gly Asn Ser Gln Gln Gln Ile Thr Ala Ser Ile 260 265 270 Thr Gln Leu Met Gln Gln Leu Glu Arg Leu Gln Val Thr Glu Val Asn 275 280 285 Glu Leu Ser Leu Glu Cys Gln Leu Glu Leu Leu Ser Ile Met Tyr Asp 290 295 300 Asn Leu Val Asn Ala Asp Lys Leu Thr Thr Arg Asp Ser Lys Pro Ala 305 310 315 320 Tyr Gln Ala Val Ile Gln Ala Ser Ser Val Ser Ala Ala Lys Gln Glu 325 330 335 Leu Ser Ala Leu Asn Asp Ala Leu Thr Ala Leu Phe Ala Glu Gln Thr 340 345 350 Asn Ala Thr Ser Thr Asn Lys Gly Leu Ile Gln Tyr Lys Thr Pro Ala 355 360 365 Gly Ser Tyr Leu Thr Leu Thr Pro Leu Gly Ser Asn Asn Asp Asn Ala 370 375 380 Gln Ala Gly Leu Ala Phe Val Tyr Pro Gly Val Gly Thr Val Tyr Ala 385 390 395 400 Asp Met Leu Asn Glu Leu His Gln Tyr Phe Pro Ala Leu Tyr Ala Lys 405 410 415 Leu Glu Arg Glu Gly Asp Leu Lys Ala Met Leu Gln Ala Glu Asp Ile 420 425 430 Tyr His Leu Asp Pro Lys His Ala Ala Gln Met Ser Leu Gly Asp Leu 435 440 445 Ala Ile Ala Gly Val Gly Ser Ser Tyr Leu Leu Thr Gln Leu Leu Thr 450 455 460 Asp Glu Phe Asn Ile Lys Pro Asn Phe Ala Leu Gly Tyr Ser Met Gly 465 470 475 480 Glu Ala Ser Met Trp Ala Ser Leu Gly Val Trp Gln Asn Pro His Ala 485 490 495 Leu Ile Ser Lys Thr Gln Thr Asp Pro Leu Phe Thr Ser Ala Ile Ser 500 505 510 Gly Lys Leu Thr Ala Val Arg Gln Ala Trp Gln Leu Asp Asp Thr Ala 515 520 525 Ala Glu Ile Gln Trp Asn Ser Phe Val Val Arg Ser Glu Ala Ala Pro 530 535 540 Ile Glu Ala Leu Leu Lys Asp Tyr Pro His Ala Tyr Leu Ala Ile Ile 545 550 555 560 Gln Gly Asp Thr Cys Val Ile Ala Gly Cys Glu Ile Gln Cys Lys Ala 565 570 575 Leu Leu Ala Ala Leu Gly Lys Arg Gly Ile Ala Ala Asn Arg Val Thr 580 585 590 Ala Met His Thr Gln Pro Ala Met Gln Glu His Gln Asn Val Met Asp 595 600 605 Phe Tyr Leu Gln Pro Leu Lys Ala Glu Leu Pro Ser Glu Ile Ser Phe 610 615 620 Ile Ser Ala Ala Asp Leu Thr Ala Lys Gln Thr Val Ser Glu Gln Ala 625 630 635 640 Leu Ser Ser Gln Val Val Ala Gln Ser Ile Ala Asp Thr Phe Cys Gln 645 650 655 Thr Leu Asp Phe Thr Ala Leu Val His His Ala Gln His Gln Gly Ala 660 665 670 Lys Leu Phe Val Glu Ile Gly Ala Asp Arg Gln Asn Cys Thr Leu Ile 675 680 685 Asp Lys Ile Val Lys Gln Asp Gly Ala Ser Ser Val Gln His Gln Pro 690 695 700 Cys Cys Thr Val Pro Met Asn Ala Lys Gly Ser Gln Asp Ile Thr Ser 705 710 715 720 Val Ile Lys Ala Leu Gly Gln Leu Ile Ser His Gln Val Pro Leu Ser 725 730 735 Val Gln Pro Phe Ile Asp Gly Leu Lys Arg Glu Leu Thr Leu Cys Gln 740 745 750 Leu Thr Ser Gln Gln Leu Ala Ala His Ala Asn Val Asp Ser Lys Phe 755 760 765 Glu Ser Asn Gln Asp His Leu Leu Gln Gly Glu Val 770 775 780 <210> 10 <211> 6015 <212> DNA <213> Shewanella putrefaciens SCRC-2874 (FERM BP-1625) <400> 1 atg tca tta cca gac aat gct tct aac cac ctt tct gcc aac cag aaa 48 Met Ser Leu Pro Asp Asn Ala Ser Asn His Leu Ser Ala Asn Gln Lys 1 5 10 15 ggc gca tct cag gca agt aaa acc agt aag caa agc aaa atc gcc att 96 Gly Ala Ser Gln Ala Ser Lys Thr Ser Lys Gln Ser Lys Ile Ala Ile 20 25 30 gtc ggt tta gcc act ctg tat cca gac gct aaa acc ccg caa gaa ttt 144 Val Gly Leu Ala Thr Leu Tyr Pro Asp Ala Lys Thr Pro Gln Glu Phe 35 40 45 tgg cag aat ttg ctg gat aaa cgc gac tct cgc agc acc tta act aac 192 Trp Gln Asn Leu Leu Asp Lys Arg Asp Ser Arg Ser Thr Leu Thr Asn 50 55 60 gaa aaa ctc ggc gct aac agc caa gat tat caa ggt gtg caa ggc caa 240 Glu Lys Leu Gly Ala Asn Ser Gln Asp Tyr Gln Gly Val Gln Gly Gln 65 70 75 80 tct gac cgt ttt tat tgt aat aaa ggc ggc tac att gag aac ttc agc 288 Ser Asp Arg Phe Tyr Cys Asn Lys Gly Gly Tyr Ile Glu Asn Phe Ser 85 90 95 ttt aat gct gca ggc tac aaa ttg ccg gag caa agc tta aat ggc ttg 336 Phe Asn Ala Ala Gly Tyr Lys Leu Pro Glu Gln Ser Leu Asn Gly Leu 100 105 110 gac gac agc ttc ctt tgg gcg ctc gat act agc cgt aac gca cta att 384 Asp Asp Ser Phe Leu Trp Ala Leu Asp Thr Ser Arg Asn Ala Leu Ile 115 120 125 gat gct ggt att gat atc aac ggc gct gat tta agc cgc gca ggt gta 432 Asp Ala Gly Ile Asp Ile Asn Gly Ala Asp Leu Ser Arg Ala Gly Val 130 135 140 gtc atg ggc gcg ctg tcg ttc cca act acc cgc tca aac gat ctg ttt 480 Val Met Gly Ala Leu Ser Phe Pro Thr Thr Arg Ser Asn Asp Leu Phe 145 150 155 160 ttg cca att tat cac agc gcc gtt gaa aaa gcc ctg caa gat aaa cta 528 Leu Pro Ile Tyr His Ser Ala Val Glu Lys Ala Leu Gln Asp Lys Leu 165 170 175 ggc gta aag gca ttt aag cta agc cca act aat gct cat acc gct cgc 576 Gly Val Lys Ala Phe Lys Leu Ser Pro Thr Asn Ala His Thr Ala Arg 180 185 190 gcg gca aat gag agc agc cta aat gca gcc aat ggt gcc att gcc cat 624 Ala Ala Asn Glu Ser Ser Leu Asn Ala Ala Asn Gly Ala Ile Ala His 195 200 205 aac agc tca aaa gtg gtg gcc gat gca ctt ggc ctt ggc ggc gca caa 672 Asn Ser Ser Lys Val Val Ala Asp Ala Leu Gly Leu Gly Gly Ala Gln 210 215 220 cta agc cta gat gct gcc tgt gct agt tcg gtt tac tca tta aag ctt 720 Leu Ser Leu Asp Ala Ala Cys Ala Ser Ser Val Tyr Ser Leu Lys Leu 225 230 235 240 gcc tgc gat tac cta agc act ggc aaa gcc gat atc atg cta gca ggc 768 Ala Cys Asp Tyr Leu Ser Thr Gly Lys Ala Asp Ile Met Leu Ala Gly 245 250 255 gca gta tct ggc gcg gat cct ttc ttt att aat atg gga ttc tca atc 816 Ala Val Ser Gly Ala Asp Pro Phe Phe Ile Asn Met Gly Phe Ser Ile 260 265 270 ttc cac gcc tac cca gac cat ggt atc tca gta ccg ttt gat gcc agc 864 Phe His Ala Tyr Pro Asp His Gly Ile Ser Val Pro Phe Asp Ala Ser 275 280 285 agt aaa ggt ttg ttt gct ggc gaa ggc gct ggc gta tta gtg ctt aaa 912 Ser Lys Gly Leu Phe Ala Gly Glu Gly Ala Gly Val Leu Val Leu Lys 290 295 300 cgt ctt gaa gat gcc gag cgc gac aat gac aaa atc tat gcg gtt gtt 960 Arg Leu Glu Asp Ala Glu Arg Asp Asn Asp Lys Ile Tyr Ala Val Val 305 310 315 320 agc ggc gta ggt cta tca aac gac ggt aaa ggc cag ttt gta tta agc 1008 Ser Gly Val Gly Leu Ser Asn Asp Gly Lys Gly Gln Phe Val Leu Ser 325 330 335 cct aat cca aaa ggt cag gtg aag gcc ttt gaa cgt gct tat gct gcc 1056 Pro Asn Pro Lys Gly Gln Val Lys Ala Phe Glu Arg Ala Tyr Ala Ala 340 345 350 agt gac att gag cca aaa gac att gaa gtg att gag tgc cac gca aca 1104 Ser Asp Ile Glu Pro Lys Asp Ile Glu Val Ile Glu Cys His Ala Thr 355 360 365 ggc aca ccg ctt ggc gat aaa att gag ctc act tca atg gaa acc ttc 1152 Gly Thr Pro Leu Gly Asp Lys Ile Glu Leu Thr Ser Met Glu Thr Phe 370 375 380 ttt gaa gac aag ctg caa ggc acc gat gca ccg tta att ggc tca gct 1200 Phe Glu Asp Lys Leu Gln Gly Thr Asp Ala Pro Leu Ile Gly Ser Ala 385 390 395 400 aag tct aac tta ggc cac cta tta act gca gcg ggc atg ccg ggg atc 1248 Lys Ser Asn Leu Gly His Leu Leu Thr Ala Ala Gly Met Pro Gly Ile 405 410 415 atg aag atg atc ttc gcc atg aaa gaa ggt tac ctg ccg cca agt atc 1296 Met Lys Met Ile Phe Ala Met Lys Glu Gly Tyr Leu Pro Pro Ser Ile 420 425 430 aat att agt gat gct atc gct tcg ccg aaa aaa ctc ttc ggt aaa cca 1344 Asn Ile Ser Asp Ala Ile Ala Ser Pro Lys Lys Leu Phe Gly Lys Pro 435 440 445 acc ctg cct agc atg gtt caa ggc tgg cca gat aag cca tcg aat aat 1392 Thr Leu Pro Ser Met Val Gln Gly Trp Pro Asp Lys Pro Ser Asn Asn 450 455 460 cat ttt ggt gta aga acc cgt cac gca ggc gta tcg gta ttt ggc ttt 1440 His Phe Gly Val Arg Thr Arg His Ala Gly Val Ser Val Phe Gly Phe 465 470 475 ggt ggc tgt aac gcc cat ctg ttg ctt gag tca tac aac ggc aaa gga 1488 Gly Gly Cys Asn Ala His Leu Leu Leu Glu Ser Tyr Asn Gly Lys Gly 480 485 490 495 aca gta aag gca gaa gcc act caa gta ccg cgt caa gct gag ccg cta 1536 Thr Val Lys Ala Glu Ala Thr Gln Val Pro Arg Gln Ala Glu Pro Leu 500 505 510 aaa gtg gtt ggc ctt gcc tcg cac ttt ggg cct ctt agc agc att aat 1584 Lys Val Val Gly Leu Ala Ser His Phe Gly Pro Leu Ser Ser Ile Asn 515 520 525 gca ctc aac aat gct gtg acc caa gat ggg aat ggc ttt atc gaa ctg 1632 Ala Leu Asn Asn Ala Val Thr Gln Asp Gly Asn Gly Phe Ile Glu Leu 530 535 540 ccg aaa aag cgc tgg aaa ggc ctt gaa aag cac agt gaa ctg tta gct 1680 Pro Lys Lys Arg Trp Lys Gly Leu Glu Lys His Ser Glu Leu Leu Ala 545 550 555 gaa ttt ggc tta gca tct gcg cca aaa ggt gct tat gtt gat aac ttc 1728 Glu Phe Gly Leu Ala Ser Ala Pro Lys Gly Ala Tyr Val Asp Asn Phe 560 565 570 575 gag ctg gac ttt tta cgc ttt aaa ctg ccg cca aac gaa gat gac cgt 1776 Glu Leu Asp Phe Leu Arg Phe Lys Leu Pro Pro Asn Glu Asp Asp Arg 580 585 590 ttg atc tca cag cag cta atg cta atg cga gta aca gac gaa gcc att 1824 Leu Ile Ser Gln Gln Leu Met Leu Met Arg Val Thr Asp Glu Ala Ile 595 600 605 cgt gat gcc aag ctt gag ccg ggg caa aaa gta gct gta tta gtg gca 1872 Arg Asp Ala Lys Leu Glu Pro Gly Gln Lys Val Ala Val Leu Val Ala 610 615 620 atg gaa act gag ctt gaa ctg cat cag ttc cgc ggc cgg gtt aac ttg 1920 Met Glu Thr Glu Leu Glu Leu His Gln Phe Arg Gly Arg Val Asn Leu 625 630 635 cat act caa tta gcg caa agt ctt gcc gcc atg ggc gtg agt tta tca 1968 His Thr Gln Leu Ala Gln Ser Leu Ala Ala Met Gly Val Ser Leu Ser 640 645 650 655 acg gat gaa tac caa gcg ctt gaa gcc atc gcc atg gac agc gtg ctt 2016 Thr Asp Glu Tyr Gln Ala Leu Glu Ala Ile Ala Met Asp Ser Val Leu 660 665 670 gat gct gcc aag ctc aat cag tac acc agc ttt att ggt aat att atg 2064 Asp Ala Ala Lys Leu Asn Gln Tyr Thr Ser Phe Ile Gly Asn Ile Met 675 680 685 gcg tca cgc gtg gcg tca cta tgg gac ttt aat ggc cca gcc ttc act 2112 Ala Ser Arg Val Ala Ser Leu Trp Asp Phe Asn Gly Pro Ala Phe Thr 690 695 700 att tca gca gca gag caa tct gtg agc cgc tgt atc gat gtg gcg caa 2160 Ile Ser Ala Ala Glu Gln Ser Val Ser Arg Cys Ile Asp Val Ala Gln 705 710 715 aac ctc atc atg gag gat aac cta gat gcg gtg gtg att gca gcg gtc 2208 Asn Leu Ile Met Glu Asp Asn Leu Asp Ala Val Val Ile Ala Ala Val 720 725 730 735 gat ctc tct ggt agc ttt gag caa gtc att ctt aaa aat gcc att gca 2256 Asp Leu Ser Gly Ser Phe Glu Gln Val Ile Leu Lys Asn Ala Ile Ala 740 745 750 cct gta gcc att gag cca aac ctc gaa gca agc ctt aat cca aca tca 2304 Pro Val Ala Ile Glu Pro Asn Leu Glu Ala Ser Leu Asn Pro Thr Ser 755 760 765 gca agc tgg aat gtc ggt gaa ggt gct ggc gcg gtc gtg ctt gtt aaa 2352 Ala Ser Trp Asn Val Gly Glu Gly Ala Gly Ala Val Val Leu Val Lys 770 775 780 aat gaa gct aca tcg ggc tgc tca tac ggc caa att gat gca ctt ggc 2400 Asn Glu Ala Thr Ser Gly Cys Ser Tyr Gly Gln Ile Asp Ala Leu Gly 785 790 795 ttt gct aaa act gcc gaa aca gcg ttg gct acc gac aag cta ctg agc 2448 Phe Ala Lys Thr Ala Glu Thr Ala Leu Ala Thr Asp Lys Leu Leu Ser 800 805 810 815 caa act gcc aca gac ttt aat aag gtt aaa gtg att gaa act atg gca 2496 Gln Thr Ala Thr Asp Phe Asn Lys Val Lys Val Ile Glu Thr Met Ala 820 825 830 gcg cct gct agc caa att caa tta gcg cca ata gtt agc tct caa gtg 2544 Ala Pro Ala Ser Gln Ile Gln Leu Ala Pro Ile Val Ser Ser Gln Val 835 840 845 act cac act gct gca gag cag cgt gtt ggt cac tgc ttt gct gca gcg 2592 Thr His Thr Ala Ala Glu Gln Arg Val Gly His Cys Phe Ala Ala Ala 850 855 860 ggt atg gca agc cta tta cac ggc tta ctt aac tta aat act gta gcc 2640 Gly Met Ala Ser Leu Leu His Gly Leu Leu Asn Leu Asn Thr Val Ala 865 870 875 caa acc aat aaa gcc aat tgc gcg ctt atc aac aat atc agt gaa aac 2688 Gln Thr Asn Lys Ala Asn Cys Ala Leu Ile Asn Asn Ile Ser Glu Asn 880 885 890 895 caa tta tca cag ctg ttg att agc caa aca gcg agc gaa caa caa gca 2736 Gln Leu Ser Gln Leu Leu Ile Ser Gln Thr Ala Ser Glu Gln Gln Ala 900 905 910 tta acc gcg cgt tta agc aat gag ctt aaa tcc gat gct aaa cac caa 2784 Leu Thr Ala Arg Leu Ser Asn Glu Leu Lys Ser Asp Ala Lys His Gln 915 920 925 ctg gtt aag caa gtc acc tta ggt ggc cgt gat atc tac cag cat att 2832 Leu Val Lys Gln Val Thr Leu Gly Gly Arg Asp Ile Tyr Gln His Ile 930 935 940 gtt gat aca ccg ctt gca agc ctt gaa agc att act cag aaa ttg gcg 2880 Val Asp Thr Pro Leu Ala Ser Leu Glu Ser Ile Thr Gln Lys Leu Ala 945 950 955 caa gcg aca gca tcg aca gtg gtc aac caa gtt aaa cct att aag gcc 2928 Gln Ala Thr Ala Ser Thr Val Val Asn Gln Val Lys Pro Ile Lys Ala 960 965 970 975 gct ggc tca gtc gaa atg gct aac tca ttc gaa acg gaa agc tca gca 2976 Ala Gly Ser Val Glu Met Ala Asn Ser Phe Glu Thr Glu Ser Ser Ala 980 985 990 gag cca caa ata aca att gca gca caa cag act gca aac att ggc gtc 3024 Glu Pro Gln Ile Thr Ile Ala Ala Gln Gln Thr Ala Asn Ile Gly Val 995 1000 1005 acc gct cag gca acc aaa cgt gaa tta ggt acc cca cca atg aca aca 3072 Thr Ala Gln Ala Thr Lys Arg Glu Leu Gly Thr Pro Pro Met Thr Thr 1010 1015 1020 aat acc att gct aat aca gca aat aat tta gac aag act ctt gag act 3120 Asn Thr Ile Ala Asn Thr Ala Asn Asn Leu Asp Lys Thr Leu Glu Thr 1025 1030 1035 gtt gct ggc aat act gtt gct agc aag gtt ggc tct ggc gac ata gtc 3168 Val Ala Gly Asn Thr Val Ala Ser Lys Val Gly Ser Gly Asp Ile Val 1040 1045 1050 1055 aat ttt caa cag aac caa caa ttg gct caa caa gct cac ctc gcc ttt 3216 Asn Phe Gln Gln Asn Gln Gln Leu Ala Gln Gln Ala His Leu Ala Phe 1060 1065 1070 ctt gaa agc cgc agt gcg ggt atg aag gtg gct gat gct tta ttg aag 3264 Leu Glu Ser Arg Ser Ala Gly Met Lys Val Ala Asp Ala Leu Leu Lys 1075 1080 1085 caa cag cta gct caa gta aca ggc caa act atc gat aat cag gcc ctc 3312 Gln Gln Leu Ala Gln Val Thr Gly Gln Thr Ile Asp Asn Gln Ala Leu 1090 1095 1100 gat act caa gcc gtc gat act caa aca agc gag aat gta gcg att gcc 3360 Asp Thr Gln Ala Val Asp Thr Gln Thr Ser Glu Asn Val Ala Ile Ala 1105 1110 1115 gca gaa tca cca gtt caa gtt aca aca cct gtt caa gtt aca aca cct 3408 Ala Glu Ser Pro Val Gln Val Thr Thr Pro Val Gln Val Thr Thr Pro 1120 1125 1130 1135 gtt caa atc agt gtt gtg gag tta aaa cca gat cac gct aat gtg cca 3456 Val Gln Ile Ser Val Val Glu Leu Lys Pro Asp His Ala Asn Val Pro 1140 1145 1150 cca tac acg ccg cca gtg cct gca tta aag ccg tgt atc tgg aac tat 3504 Pro Tyr Thr Pro Pro Val Pro Ala Leu Lys Pro Cys Ile Trp Asn Tyr 1155 1160 1165 gcc gat tta gtt gag tac gca gaa ggc gat atc gcc aag gta ttt ggc 3552 Ala Asp Leu Val Glu Tyr Ala Glu Gly Asp Ile Ala Lys Val Phe Gly 1170 1175 1180 agt gat tat gcc att atc gac agc tac tcg cgc cgc gta cgt cta ccg 3600 Ser Asp Tyr Ala Ile Ile Asp Ser Tyr Ser Arg Arg Val Arg Leu Pro 1185 1190 1195 acc act gac tac ctg ttg gta tcg cgc gtg acc aaa ctt gat gcg acc 3648 Thr Thr Asp Tyr Leu Leu Val Ser Arg Val Thr Lys Leu Asp Ala Thr 1200 1205 1210 1215 atc aat caa ttt aag cca tgc tca atg acc act gag tac gac atc cct 3696 Ile Asn Gln Phe Lys Pro Cys Ser Met Thr Thr Glu Tyr Asp Ile Pro 1220 1225 1230 gtt gat gcg ccg tac tta gta gac gga caa atc cct tgg gcg gta gca 3744 Val Asp Ala Pro Tyr Leu Val Asp Gly Gln Ile Pro Trp Ala Val Ala 1235 1240 1245 gta gaa tca ggc caa tgt gac ttg atg ctt att agc tat ctc ggt atc 3792 Val Glu Ser Gly Gln Cys Asp Leu Met Leu Ile Ser Tyr Leu Gly Ile 1250 1255 1260 gac ttt gag aac aaa ggc gag cgg gtt tat cga cta ctc gat tgt acc 3840 Asp Phe Glu Asn Lys Gly Glu Arg Val Tyr Arg Leu Leu Asp Cys Thr 1265 1270 1275 ctc acc ttc cta ggc gac ttg cca cgt ggc gga gat acc cta cgt tac 3888 Leu Thr Phe Leu Gly Asp Leu Pro Arg Gly Gly Asp Thr Leu Arg Tyr 1280 1285 1290 1295 gac att aag atc aat aac tat gct cgc aac ggc gac acc ctg ctg ttc 3936 Asp Ile Lys Ile Asn Asn Tyr Ala Arg Asn Gly Asp Thr Leu Leu Phe 1300 1305 1310 ttc ttc tcg tat gag tgt ttt gtt ggc gac aag atg atc ctc aag atg 3984 Phe Phe Ser Tyr Glu Cys Phe Val Gly Asp Lys Met Ile Leu Lys Met 1315 1320 1325 gat ggc ggc tgc gct ggc ttc ttc act gat gaa gag ctt gcc gac ggt 4032 Asp Gly Gly Cys Ala Gly Phe Phe Thr Asp Glu Glu Leu Ala Asp Gly 1330 1335 1340 aaa ggc gtg att cgc aca gaa gaa gag att aaa gct cgc agc cta gtg 4080 Lys Gly Val Ile Arg Thr Glu Glu Glu Ile Lys Ala Arg Ser Leu Val 1345 1350 1355 caa aag caa cgc ttt aat ccg tta cta gat tgt cct aaa acc caa ttt 4128 Gln Lys Gln Arg Phe Asn Pro Leu Leu Asp Cys Pro Lys Thr Gln Phe 1360 1365 1370 1375 agt tat ggt gat att cat aag cta tta act gct gat att gag ggt tgt 4176 Ser Tyr Gly Asp Ile His Lys Leu Leu Thr Ala Asp Ile Glu Gly Cys 1380 1385 1390 ttt ggc cca agc cac agt ggc gtc cac cag ccg tca ctt tgt ttc gca 4224 Phe Gly Pro Ser His Ser Gly Val His Gln Pro Ser Leu Cys Phe Ala 1395 1400 1405 tct gaa aaa ttc ttg atg att gaa caa gtc agc aag gtt gat cgc act 4272 Ser Glu Lys Phe Leu Met Ile Glu Gln Val Ser Lys Val Asp Arg Thr 1410 1415 1420 ggc ggt act tgg gga ctt ggc tta att gag ggt cat aag cag ctt gaa 4320 Gly Gly Thr Trp Gly Leu Gly Leu Ile Glu Gly His Lys Gln Leu Glu 1425 1430 1435 gca gac cac tgg tac ttc cca tgt cat ttc aag ggc gac caa gtg atg 4368 Ala Asp His Trp Tyr Phe Pro Cys His Phe Lys Gly Asp Gln Val Met 1440 1445 1450 1455 gct ggc tcg cta atg gct gaa ggt tgt ggc cag tta ttg cag ttc tat 4416 Ala Gly Ser Leu Met Ala Glu Gly Cys Gly Gln Leu Leu Gln Phe Tyr 1460 1465 1470 atg ctg cac ctt ggt atg cat acc caa act aaa aat ggt cgt ttc caa 4464 Met Leu His Leu Gly Met His Thr Gln Thr Lys Asn Gly Arg Phe Gln 1475 1480 1485 cct ctt gaa aac gcc tca cag caa gta cgc tgt cgc ggt caa gtg ctg 4512 Pro Leu Glu Asn Ala Ser Gln Gln Val Arg Cys Arg Gly Gln Val Leu 1490 1495 1500 cca caa tca ggc gtg cta act tac cgt atg gaa gtg act gaa atc ggt 4560 Pro Gln Ser Gly Val Leu Thr Tyr Arg Met Glu Val Thr Glu Ile Gly 1505 1510 1515 ttc agt cca cgc cca tat gct aaa gct aac atc gat atc ttg ctt aat 4608 Phe Ser Pro Arg Pro Tyr Ala Lys Ala Asn Ile Asp Ile Leu Leu Asn 1520 1525 1530 1535 ggc aaa gcg gta gtg gat ttc caa aac cta ggg gtg atg ata aaa gag 4656 Gly Lys Ala Val Val Asp Phe Gln Asn Leu Gly Val Met Ile Lys Glu 1540 1545 1550 gaa gat gag tgt act cgt tat cca ctt ttg act gaa tca aca acg gct 4704 Glu Asp Glu Cys Thr Arg Tyr Pro Leu Leu Thr Glu Ser Thr Thr Ala 1555 1560 1565 agc act gca caa gta aac gct caa aca agt gcg aaa aag gta tac aag 4752 Ser Thr Ala Gln Val Asn Ala Gln Thr Ser Ala Lys Lys Val Tyr Lys 1570 1575 1580 cca gca tca gtc aat gcg cca tta atg gca caa att cct gat ctg act 4800 Pro Ala Ser Val Asn Ala Pro Leu Met Ala Gln Ile Pro Asp Leu Thr 1585 1590 1595 aaa gag cca aac aag ggc gtt att ccg att tcc cat gtt gaa gca cca 4848 Lys Glu Pro Asn Lys Gly Val Ile Pro Ile Ser His Val Glu Ala Pro 1600 1605 1610 1615 att acg cca gac tac ccg aac cgt gta cct gat aca gtg cca ttc acg 4896 Ile Thr Pro Asp Tyr Pro Asn Arg Val Pro Asp Thr Val Pro Phe Thr 1620 1625 1630 ccg tat cac atg ttt gag ttt gct aca ggc aat atc gaa aac tgt ttc 4944 Pro Tyr His Met Phe Glu Phe Ala Thr Gly Asn Ile Glu Asn Cys Phe 1635 1640 1645 ggg cca gag ttc tca atc tat cgc ggc atg atc cca cca cgt aca cca 4992 Gly Pro Glu Phe Ser Ile Tyr Arg Gly Met Ile Pro Pro Arg Thr Pro 1650 1655 1660 tgc ggt gac tta caa gtg acc aca cgt gtg att gaa gtt aac ggt aag 5040 Cys Gly Asp Leu Gln Val Thr Thr Arg Val Ile Glu Val Asn Gly Lys 1665 1670 1675 cgt ggc gac ttt aaa aag cca tca tcg tgt atc gct gaa tat gaa gtg 5088 Arg Gly Asp Phe Lys Lys Pro Ser Ser Cys Ile Ala Glu Tyr Glu Val 1680 1685 1690 1695 cct gca gat gcg tgg tat ttc gat aaa aac agc cac ggc gca gtg atg 5136 Pro Ala Asp Ala Trp Tyr Phe Asp Lys Asn Ser His Gly Ala Val Met 1700 1705 1710 cca tat tca att tta atg gag atc tca ctg caa cct aac ggc ttt atc 5184 Pro Tyr Ser Ile Leu Met Glu Ile Ser Leu Gln Pro Asn Gly Phe Ile 1715 1720 1725 tca ggt tac atg ggc aca acc cta ggc ttc cct ggc ctt gag ctg ttc 5232 Ser Gly Tyr Met Gly Thr Thr Leu Gly Phe Pro Gly Leu Glu Leu Phe 1730 1735 1740 ttc cgt aac tta gac ggt agc ggt gag tta cta cgt gaa gta gat tta 5280 Phe Arg Asn Leu Asp Gly Ser Gly Glu Leu Leu Arg Glu Val Asp Leu 1745 1750 1755 cgt ggt aaa acc atc cgt aac gac tca cgt tta tta tca aca gtg atg 5328 Arg Gly Lys Thr Ile Arg Asn Asp Ser Arg Leu Leu Ser Thr Val Met 1760 1765 1770 1775 gcc ggc act aac atc atc caa agc ttt agc ttc gag cta agc act gac 5376 Ala Gly Thr Asn Ile Ile Gln Ser Phe Ser Phe Glu Leu Ser Thr Asp 1780 1785 1790 ggt gag cct ttc tat cgc ggc act gcg gta ttt ggc tat ttt aaa ggt 5424 Gly Glu Pro Phe Tyr Arg Gly Thr Ala Val Phe Gly Tyr Phe Lys Gly 1795 1800 1805 gac gca ctt aaa gat cag cta ggc cta gat aac ggt aaa gtc act cag 5472 Asp Ala Leu Lys Asp Gln Leu Gly Leu Asp Asn Gly Lys Val Thr Gln 1810 1815 1820 cca tgg cat gta gct aac ggc gtt gct gca agc act aag gtg aac ctg 5520 Pro Trp His Val Ala Asn Gly Val Ala Ala Ser Thr Lys Val Asn Leu 1825 1830 1835 ctt gat aag agc tgc cgt cac ttt aat gcg cca gct aac cag cca cac 5568 Leu Asp Lys Ser Cys Arg His Phe Asn Ala Pro Ala Asn Gln Pro His 1840 1845 1850 1855 tat cgt cta gcc ggt ggt cag ctg aac ttt atc gac agt gtt gaa att 5616 Tyr Arg Leu Ala Gly Gly Gln Leu Asn Phe Ile Asp Ser Val Glu Ile 1860 1865 1870 gtt gat aat ggc ggc acc gaa ggt tta ggt tac ttg tat gcc gag cgc 5664 Val Asp Asn Gly Gly Thr Glu Gly Leu Gly Tyr Leu Tyr Ala Glu Arg 1875 1880 1885 acc att gac cca agt gat tgg ttc ttc cag ttc cac ttc cac caa gat 5712 Thr Ile Asp Pro Ser Asp Trp Phe Phe Gln Phe His Phe His Gln Asp 1890 1895 1900 ccg gtt atg cca ggc tcc tta ggt gtt gaa gca att att gaa acc atg 5760 Pro Val Met Pro Gly Ser Leu Gly Val Glu Ala Ile Ile Glu Thr Met 1905 1910 1915 caa gct tac gct att agt aaa gac ttg ggc gca gat ttc aaa aat cct 5808 Gln Ala Tyr Ala Ile Ser Lys Asp Leu Gly Ala Asp Phe Lys Asn Pro 1920 1925 1930 1935 aag ttt ggt cag att tta tcg aac atc aag tgg aag tat cgc ggt caa 5856 Lys Phe Gly Gln Ile Leu Ser Asn Ile Lys Trp Lys Tyr Arg Gly Gln 1940 1945 1950 atc aat ccg ctg aac aag cag atg tct atg gat gtc agc att act tca 5904 Ile Asn Pro Leu Asn Lys Gln Met Ser Met Asp Val Ser Ile Thr Ser 1955 1960 1965 atc aaa gat gaa gac ggt aag aaa gtc atc aca ggt aat gcc agc ttg 5952 Ile Lys Asp Glu Asp Gly Lys Lys Val Ile Thr Gly Asn Ala Ser Leu 1970 1975 1980 agt aaa gat ggt ctg cgc ata tac gag gtc ttc gat ata gct atc agc 6000 Ser Lys Asp Gly Leu Arg Ile Tyr Glu Val Phe Asp Ile Ala Ile Ser 1985 1990 1995 atc gaa gaa tct gta 6015 Ile Glu Glu Ser Val 2000 <210> 11 <211> 2005 <212> PRT <400> 1 Met Ser Leu Pro Asp Asn Ala Ser Asn His Leu Ser Ala Asn Gln Lys 1 5 10 15 Gly Ala Ser Gln Ala Ser Lys Thr Ser Lys Gln Ser Lys Ile Ala Ile 20 25 30 Val Gly Leu Ala Thr Leu Tyr Pro Asp Ala Lys Thr Pro Gln Glu Phe 35 40 45 Trp Gln Asn Leu Leu Asp Lys Arg Asp Ser Arg Ser Thr Leu Thr Asn 50 55 60 Glu Lys Leu Gly Ala Asn Ser Gln Asp Tyr Gln Gly Val Gln Gly Gln 65 70 75 80 Ser Asp Arg Phe Tyr Cys Asn Lys Gly Gly Tyr Ile Glu Asn Phe Ser 85 90 95 Phe Asn Ala Ala Gly Tyr Lys Leu Pro Glu Gln Ser Leu Asn Gly Leu 100 105 110 Asp Asp Ser Phe Leu Trp Ala Leu Asp Thr Ser Arg Asn Ala Leu Ile 115 120 125 Asp Ala Gly Ile Asp Ile Asn Gly Ala Asp Leu Ser Arg Ala Gly Val 130 135 140 Val Met Gly Ala Leu Ser Phe Pro Thr Thr Arg Ser Asn Asp Leu Phe 145 150 155 160 Leu Pro Ile Tyr His Ser Ala Val Glu Lys Ala Leu Gln Asp Lys Leu 165 170 175 Gly Val Lys Ala Phe Lys Leu Ser Pro Thr Asn Ala His Thr Ala Arg 180 185 190 Ala Ala Asn Glu Ser Ser Leu Asn Ala Ala Asn Gly Ala Ile Ala His 195 200 205 Asn Ser Ser Lys Val Val Ala Asp Ala Leu Gly Leu Gly Gly Ala Gln 210 215 220 Leu Ser Leu Asp Ala Ala Cys Ala Ser Ser Val Tyr Ser Leu Lys Leu 225 230 235 240 Ala Cys Asp Tyr Leu Ser Thr Gly Lys Ala Asp Ile Met Leu Ala Gly 245 250 255 Ala Val Ser Gly Ala Asp Pro Phe Phe Ile Asn Met Gly Phe Ser Ile 260 265 270 Phe His Ala Tyr Pro Asp His Gly Ile Ser Val Pro Phe Asp Ala Ser 275 280 285 Ser Lys Gly Leu Phe Ala Gly Glu Gly Ala Gly Val Leu Val Leu Lys 290 295 300 Arg Leu Glu Asp Ala Glu Arg Asp Asn Asp Lys Ile Tyr Ala Val Val 305 310 315 320 Ser Gly Val Gly Leu Ser Asn Asp Gly Lys Gly Gln Phe Val Leu Ser 325 330 335 Pro Asn Pro Lys Gly Gln Val Lys Ala Phe Glu Arg Ala Tyr Ala Ala 340 345 350 Ser Asp Ile Glu Pro Lys Asp Ile Glu Val Ile Glu Cys His Ala Thr 355 360 365 Gly Thr Pro Leu Gly Asp Lys Ile Glu Leu Thr Ser Met Glu Thr Phe 370 375 380 Phe Glu Asp Lys Leu Gln Gly Thr Asp Ala Pro Leu Ile Gly Ser Ala 385 390 395 400 Lys Ser Asn Leu Gly His Leu Leu Thr Ala Ala Gly Met Pro Gly Ile 405 410 415 Met Lys Met Ile Phe Ala Met Lys Glu Gly Tyr Leu Pro Pro Ser Ile 420 425 430 Asn Ile Ser Asp Ala Ile Ala Ser Pro Lys Lys Leu Phe Gly Lys Pro 435 440 445 Thr Leu Pro Ser Met Val Gln Gly Trp Pro Asp Lys Pro Ser Asn Asn 450 455 460 His Phe Gly Val Arg Thr Arg His Ala Gly Val Ser Val Phe Gly Phe 465 470 475 480 Gly Gly Cys Asn Ala His Leu Leu Leu Glu Ser Tyr Asn Gly Lys Gly 485 490 495 Thr Val Lys Ala Glu Ala Thr Gln Val Pro Arg Gln Ala Glu Pro Leu 500 505 510 Lys Val Val Gly Leu Ala Ser His Phe Gly Pro Leu Ser Ser Ile Asn 515 520 525 Ala Leu Asn Asn Ala Val Thr Gln Asp Gly Asn Gly Phe Ile Glu Leu 530 535 540 Pro Lys Lys Arg Trp Lys Gly Leu Glu Lys His Ser Glu Leu Leu Ala 545 550 555 560 Glu Phe Gly Leu Ala Ser Ala Pro Lys Gly Ala Tyr Val Asp Asn Phe 565 570 575 Glu Leu Asp Phe Leu Arg Phe Lys Leu Pro Pro Asn Glu Asp Asp Arg 580 585 590 Leu Ile Ser Gln Gln Leu Met Leu Met Arg Val Thr Asp Glu Ala Ile 595 600 605 Arg Asp Ala Lys Leu Glu Pro Gly Gln Lys Val Ala Val Leu Val Ala 610 615 620 Met Glu Thr Glu Leu Glu Leu His Gln Phe Arg Gly Arg Val Asn Leu 625 630 635 640 His Thr Gln Leu Ala Gln Ser Leu Ala Ala Met Gly Val Ser Leu Ser 645 650 655 Thr Asp Glu Tyr Gln Ala Leu Glu Ala Ile Ala Met Asp Ser Val Leu 660 665 670 Asp Ala Ala Lys Leu Asn Gln Tyr Thr Ser Phe Ile Gly Asn Ile Met 675 680 685 Ala Ser Arg Val Ala Ser Leu Trp Asp Phe Asn Gly Pro Ala Phe Thr 690 695 700 Ile Ser Ala Ala Glu Gln Ser Val Ser Arg Cys Ile Asp Val Ala Gln 705 710 715 720 Asn Leu Ile Met Glu Asp Asn Leu Asp Ala Val Val Ile Ala Ala Val 725 730 735 Asp Leu Ser Gly Ser Phe Glu Gln Val Ile Leu Lys Asn Ala Ile Ala 740 745 750 Pro Val Ala Ile Glu Pro Asn Leu Glu Ala Ser Leu Asn Pro Thr Ser 755 760 765 Ala Ser Trp Asn Val Gly Glu Gly Ala Gly Ala Val Val Leu Val Lys 770 775 780 Asn Glu Ala Thr Ser Gly Cys Ser Tyr Gly Gln Ile Asp Ala Leu Gly 785 790 795 800 Phe Ala Lys Thr Ala Glu Thr Ala Leu Ala Thr Asp Lys Leu Leu Ser 805 810 815 Gln Thr Ala Thr Asp Phe Asn Lys Val Lys Val Ile Glu Thr Met Ala 820 825 830 Ala Pro Ala Ser Gln Ile Gln Leu Ala Pro Ile Val Ser Ser Gln Val 835 840 845 Thr His Thr Ala Ala Glu Gln Arg Val Gly His Cys Phe Ala Ala Ala 850 855 860 Gly Met Ala Ser Leu Leu His Gly Leu Leu Asn Leu Asn Thr Val Ala 865 870 875 880 Gln Thr Asn Lys Ala Asn Cys Ala Leu Ile Asn Asn Ile Ser Glu Asn 885 890 895 Gln Leu Ser Gln Leu Leu Ile Ser Gln Thr Ala Ser Glu Gln Gln Ala 900 905 910 Leu Thr Ala Arg Leu Ser Asn Glu Leu Lys Ser Asp Ala Lys His Gln 915 920 925 Leu Val Lys Gln Val Thr Leu Gly Gly Arg Asp Ile Tyr Gln His Ile 930 935 940 Val Asp Thr Pro Leu Ala Ser Leu Glu Ser Ile Thr Gln Lys Leu Ala 945 950 955 960 Gln Ala Thr Ala Ser Thr Val Val Asn Gln Val Lys Pro Ile Lys Ala 965 970 975 Ala Gly Ser Val Glu Met Ala Asn Ser Phe Glu Thr Glu Ser Ser Ala 980 985 990 Glu Pro Gln Ile Thr Ile Ala Ala Gln Gln Thr Ala Asn Ile Gly Val 995 1000 1005 Thr Ala Gln Ala Thr Lys Arg Glu Leu Gly Thr Pro Pro Met Thr Thr 1010 1015 1020 Asn Thr Ile Ala Asn Thr Ala Asn Asn Leu Asp Lys Thr Leu Glu Thr 1025 1030 1035 1040 Val Ala Gly Asn Thr Val Ala Ser Lys Val Gly Ser Gly Asp Ile Val 1045 1050 1055 Asn Phe Gln Gln Asn Gln Gln Leu Ala Gln Gln Ala His Leu Ala Phe 1060 1065 1070 Leu Glu Ser Arg Ser Ala Gly Met Lys Val Ala Asp Ala Leu Leu Lys 1075 1080 1085 Gln Gln Leu Ala Gln Val Thr Gly Gln Thr Ile Asp Asn Gln Ala Leu 1090 1095 1100 Asp Thr Gln Ala Val Asp Thr Gln Thr Ser Glu Asn Val Ala Ile Ala 1105 1110 1115 1120 Ala Glu Ser Pro Val Gln Val Thr Thr Pro Val Gln Val Thr Thr Pro 1125 1130 1135 Val Gln Ile Ser Val Val Glu Leu Lys Pro Asp His Ala Asn Val Pro 1140 1145 1150 Pro Tyr Thr Pro Pro Val Pro Ala Leu Lys Pro Cys Ile Trp Asn Tyr 1155 1160 1165 Ala Asp Leu Val Glu Tyr Ala Glu Gly Asp Ile Ala Lys Val Phe Gly 1170 1175 1180 Ser Asp Tyr Ala Ile Ile Asp Ser Tyr Ser Arg Arg Val Arg Leu Pro 1185 1190 1195 1200 Thr Thr Asp Tyr Leu Leu Val Ser Arg Val Thr Lys Leu Asp Ala Thr 1205 1210 1215 Ile Asn Gln Phe Lys Pro Cys Ser Met Thr Thr Glu Tyr Asp Ile Pro 1220 1225 1230 Val Asp Ala Pro Tyr Leu Val Asp Gly Gln Ile Pro Trp Ala Val Ala 1235 1240 1245 Val Glu Ser Gly Gln Cys Asp Leu Met Leu Ile Ser Tyr Leu Gly Ile 1250 1255 1260 Asp Phe Glu Asn Lys Gly Glu Arg Val Tyr Arg Leu Leu Asp Cys Thr 1265 1270 1275 1280 Leu Thr Phe Leu Gly Asp Leu Pro Arg Gly Gly Asp Thr Leu Arg Tyr 1285 1290 1295 Asp Ile Lys Ile Asn Asn Tyr Ala Arg Asn Gly Asp Thr Leu Leu Phe 1300 1305 1310 Phe Phe Ser Tyr Glu Cys Phe Val Gly Asp Lys Met Ile Leu Lys Met 1315 1320 1325 Asp Gly Gly Cys Ala Gly Phe Phe Thr Asp Glu Glu Leu Ala Asp Gly 1330 1335 1340 Lys Gly Val Ile Arg Thr Glu Glu Glu Ile Lys Ala Arg Ser Leu Val 1345 1350 1355 1360 Gln Lys Gln Arg Phe Asn Pro Leu Leu Asp Cys Pro Lys Thr Gln Phe 1365 1370 1375 Ser Tyr Gly Asp Ile His Lys Leu Leu Thr Ala Asp Ile Glu Gly Cys 1380 1385 1390 Phe Gly Pro Ser His Ser Gly Val His Gln Pro Ser Leu Cys Phe Ala 1395 1400 1405 Ser Glu Lys Phe Leu Met Ile Glu Gln Val Ser Lys Val Asp Arg Thr 1410 1415 1420 Gly Gly Thr Trp Gly Leu Gly Leu Ile Glu Gly His Lys Gln Leu Glu 1425 1430 1435 1440 Ala Asp His Trp Tyr Phe Pro Cys His Phe Lys Gly Asp Gln Val Met 1445 1450 1455 Ala Gly Ser Leu Met Ala Glu Gly Cys Gly Gln Leu Leu Gln Phe Tyr 1460 1465 1470 Met Leu His Leu Gly Met His Thr Gln Thr Lys Asn Gly Arg Phe Gln 1475 1480 1485 Pro Leu Glu Asn Ala Ser Gln Gln Val Arg Cys Arg Gly Gln Val Leu 1490 1495 1500 Pro Gln Ser Gly Val Leu Thr Tyr Arg Met Glu Val Thr Glu Ile Gly 1505 1510 1515 1520 Phe Ser Pro Arg Pro Tyr Ala Lys Ala Asn Ile Asp Ile Leu Leu Asn 1525 1530 1535 Gly Lys Ala Val Val Asp Phe Gln Asn Leu Gly Val Met Ile Lys Glu 1540 1545 1550 Glu Asp Glu Cys Thr Arg Tyr Pro Leu Leu Thr Glu Ser Thr Thr Ala 1555 1560 1565 Ser Thr Ala Gln Val Asn Ala Gln Thr Ser Ala Lys Lys Val Tyr Lys 1570 1575 1580 Pro Ala Ser Val Asn Ala Pro Leu Met Ala Gln Ile Pro Asp Leu Thr 1585 1590 1595 1600 Lys Glu Pro Asn Lys Gly Val Ile Pro Ile Ser His Val Glu Ala Pro 1605 1610 1615 Ile Thr Pro Asp Tyr Pro Asn Arg Val Pro Asp Thr Val Pro Phe Thr 1620 1625 1630 Pro Tyr His Met Phe Glu Phe Ala Thr Gly Asn Ile Glu Asn Cys Phe 1635 1640 1645 Gly Pro Glu Phe Ser Ile Tyr Arg Gly Met Ile Pro Pro Arg Thr Pro 1650 1655 1660 Cys Gly Asp Leu Gln Val Thr Thr Arg Val Ile Glu Val Asn Gly Lys 1665 1670 1675 1680 Arg Gly Asp Phe Lys Lys Pro Ser Ser Cys Ile Ala Glu Tyr Glu Val 1685 1690 1695 Pro Ala Asp Ala Trp Tyr Phe Asp Lys Asn Ser His Gly Ala Val Met 1700 1705 1710 Pro Tyr Ser Ile Leu Met Glu Ile Ser Leu Gln Pro Asn Gly Phe Ile 1715 1720 1725 Ser Gly Tyr Met Gly Thr Thr Leu Gly Phe Pro Gly Leu Glu Leu Phe 1730 1735 1740 Phe Arg Asn Leu Asp Gly Ser Gly Glu Leu Leu Arg Glu Val Asp Leu 1745 1750 1755 1760 Arg Gly Lys Thr Ile Arg Asn Asp Ser Arg Leu Leu Ser Thr Val Met 1765 1770 1775 Ala Gly Thr Asn Ile Ile Gln Ser Phe Ser Phe Glu Leu Ser Thr Asp 1780 1785 1790 Gly Glu Pro Phe Tyr Arg Gly Thr Ala Val Phe Gly Tyr Phe Lys Gly 1795 1800 1805 Asp Ala Leu Lys Asp Gln Leu Gly Leu Asp Asn Gly Lys Val Thr Gln 1810 1815 1820 Pro Trp His Val Ala Asn Gly Val Ala Ala Ser Thr Lys Val Asn Leu 1825 1830 1835 1840 Leu Asp Lys Ser Cys Arg His Phe Asn Ala Pro Ala Asn Gln Pro His 1845 1850 1855 Tyr Arg Leu Ala Gly Gly Gln Leu Asn Phe Ile Asp Ser Val Glu Ile 1860 1865 1870 Val Asp Asn Gly Gly Thr Glu Gly Leu Gly Tyr Leu Tyr Ala Glu Arg 1875 1880 1885 Thr Ile Asp Pro Ser Asp Trp Phe Phe Gln Phe His Phe His Gln Asp 1890 1895 1900 Pro Val Met Pro Gly Ser Leu Gly Val Glu Ala Ile Ile Glu Thr Met 1905 1910 1915 1920 Gln Ala Tyr Ala Ile Ser Lys Asp Leu Gly Ala Asp Phe Lys Asn Pro 1925 1930 1935 Lys Phe Gly Gln Ile Leu Ser Asn Ile Lys Trp Lys Tyr Arg Gly Gln 1940 1945 1950 Ile Asn Pro Leu Asn Lys Gln Met Ser Met Asp Val Ser Ile Thr Ser 1955 1960 1965 Ile Lys Asp Glu Asp Gly Lys Lys Val Ile Thr Gly Asn Ala Ser Leu 1970 1975 1980 Ser Lys Asp Gly Leu Arg Ile Tyr Glu Val Phe Asp Ile Ala Ile Ser 1985 1990 1995 2000 Ile Glu Glu Ser Val 2005 <210> 12 <211> 1626 <212> DNA <213> Shewanella putrefaciens SCRC-2874 (FERM BP-1625) <400> 1 atg aat cct aca gca act aac gaa atg ctt tct ccg tgg cca tgg gct 48 Met Asn Pro Thr Ala Thr Asn Glu Met Leu Ser Pro Trp Pro Trp Ala 1 5 10 15 gtg aca gag tca aat atc agt ttt gac gtg caa gtg atg gaa caa caa 96 Val Thr Glu Ser Asn Ile Ser Phe Asp Val Gln Val Met Glu Gln Gln 20 25 30 ctt aaa gat ttt agc cgg gca tgt tac gtg gtc aat cat gcc gac cac 144 Leu Lys Asp Phe Ser Arg Ala Cys Tyr Val Val Asn His Ala Asp His 35 40 45 ggc ttt ggt att gcg caa act gcc gat atc gtg act gaa caa gcg gca 192 Gly Phe Gly Ile Ala Gln Thr Ala Asp Ile Val Thr Glu Gln Ala Ala 50 55 60 aac agc aca gat tta cct gtt agt gct ttt act cct gca tta ggt acc 240 Asn Ser Thr Asp Leu Pro Val Ser Ala Phe Thr Pro Ala Leu Gly Thr 65 70 75 80 gaa agc cta ggc gac aat aat ttc cgc cgc gtt cac ggc gtt aaa tac 288 Glu Ser Leu Gly Asp Asn Asn Phe Arg Arg Val His Gly Val Lys Tyr 85 90 95 gct tat tac gca ggc gct atg gca aac ggt att tca tct gaa gag cta 336 Ala Tyr Tyr Ala Gly Ala Met Ala Asn Gly Ile Ser Ser Glu Glu Leu 100 105 110 gtg att gcc cta ggt caa gct ggc att ttg tgt tcg ttt gga gca gcc 384 Val Ile Ala Leu Gly Gln Ala Gly Ile Leu Cys Ser Phe Gly Ala Ala 115 120 125 ggt ctt att cca agt cgc gtt gaa gcg gca att aac cgt att caa gca 432 Gly Leu Ile Pro Ser Arg Val Glu Ala Ala Ile Asn Arg Ile Gln Ala 130 135 140 gcg ctg cca aat ggc cct tat atg ttt aac ctt atc cat agt cct agc 480 Ala Leu Pro Asn Gly Pro Tyr Met Phe Asn Leu Ile His Ser Pro Ser 145 150 155 160 gag cca gca tta gag cgt ggc agc gta gag cta ttt tta aag cat aag 528 Glu Pro Ala Leu Glu Arg Gly Ser Val Glu Leu Phe Leu Lys His Lys 165 170 175 gta cgc acc gtt gaa gca tca gct ttc tta ggt cta aca cca caa atc 576 Val Arg Thr Val Glu Ala Ser Ala Phe Leu Gly Leu Thr Pro Gln Ile 180 185 190 gtc tat tac cgt gca gca gga ttg agc cga gac gca caa ggt aaa gtt 624 Val Tyr Tyr Arg Ala Ala Gly Leu Ser Arg Asp Ala Gln Gly Lys Val 195 200 205 gtg gtt ggt aac aag gtt atc gct aaa gta agt cgc acc gaa gtg gct 672 Val Val Gly Asn Lys Val Ile Ala Lys Val Ser Arg Thr Glu Val Ala 210 215 220 gaa aag ttt atg atg cca gcg ccc gca aaa atg cta caa aaa cta gtt 720 Glu Lys Phe Met Met Pro Ala Pro Ala Lys Met Leu Gln Lys Leu Val 225 230 235 240 gat gac ggt tca att acc gct gag caa atg gag ctg gcg caa ctt gta 768 Asp Asp Gly Ser Ile Thr Ala Glu Gln Met Glu Leu Ala Gln Leu Val 245 250 255 cct atg gct gac gac atc act gca gag gcc gat tca ggt ggc cat act 816 Pro Met Ala Asp Asp Ile Thr Ala Glu Ala Asp Ser Gly Gly His Thr 260 265 270 gat aac cgt cca tta gta aca ttg ctg cca acc att tta gcg ctg aaa 864 Asp Asn Arg Pro Leu Val Thr Leu Leu Pro Thr Ile Leu Ala Leu Lys 275 280 285 gaa gaa att caa gct aaa tac caa tac gac act cct att cgt gtc ggt 912 Glu Glu Ile Gln Ala Lys Tyr Gln Tyr Asp Thr Pro Ile Arg Val Gly 290 295 300 tgt ggt ggc ggt gtg ggt acg cct gat gca gcg ctg gca acg ttt aac 960 Cys Gly Gly Gly Val Gly Thr Pro Asp Ala Ala Leu Ala Thr Phe Asn 305 310 315 320 atg ggc gcg gcg tat att gtt acc ggc tct atc aac caa gct tgt gtt 1008 Met Gly Ala Ala Tyr Ile Val Thr Gly Ser Ile Asn Gln Ala Cys Val 325 330 335 gaa gcg ggc gca agt gat cac act cgt aaa tta ctt gcc acc act gaa 1056 Glu Ala Gly Ala Ser Asp His Thr Arg Lys Leu Leu Ala Thr Thr Glu 340 345 350 atg gcc gat gtg act atg gca cca gct gca gat atg ttc gag atg ggc 1104 Met Ala Asp Val Thr Met Ala Pro Ala Ala Asp Met Phe Glu Met Gly 355 360 365 gta aaa ctg cag gtg gtt aag cgc ggc acg cta ttc cca atg cgc gct 1152 Val Lys Leu Gln Val Val Lys Arg Gly Thr Leu Phe Pro Met Arg Ala 370 375 380 aac aag cta tat gag atc tac acc cgt tac gat tca atc gaa gcg atc 1200 Asn Lys Leu Tyr Glu Ile Tyr Thr Arg Tyr Asp Ser Ile Glu Ala Ile 385 390 395 400 cca tta gac gag cgt gaa aag ctt gag aaa caa gta ttc cgc tca agc 1248 Pro Leu Asp Glu Arg Glu Lys Leu Glu Lys Gln Val Phe Arg Ser Ser 405 410 415 cta gat gaa ata tgg gca ggt aca gtg gcg cac ttt aac gag cgc gac 1296 Leu Asp Glu Ile Trp Ala Gly Thr Val Ala His Phe Asn Glu Arg Asp 420 425 430 cct aag caa atc gaa cgc gca gag ggt aac cct aag cgt aaa atg gca 1344 Pro Lys Gln Ile Glu Arg Ala Glu Gly Asn Pro Lys Arg Lys Met Ala 435 440 445 ttg att ttc cgt tgg tac tta ggt ctt tct agt cgc tgg tca aac tca 1392 Leu Ile Phe Arg Trp Tyr Leu Gly Leu Ser Ser Arg Trp Ser Asn Ser 450 455 460 ggc gaa gtg ggt cgt gaa atg gat tat caa att tgg gct ggc cct gct 1440 Gly Glu Val Gly Arg Glu Met Asp Tyr Gln Ile Trp Ala Gly Pro Ala 465 470 475 480 ctc ggt gca ttt aac caa tgg gca aaa ggc agt tac tta gat aac tat 1488 Leu Gly Ala Phe Asn Gln Trp Ala Lys Gly Ser Tyr Leu Asp Asn Tyr 485 490 495 caa gac cga aat gcc gtc gat ttg gca aag cac tta atg tac ggc gcg 1536 Gln Asp Arg Asn Ala Val Asp Leu Ala Lys His Leu Met Tyr Gly Ala 500 505 510 gct tac tta aat cgt att aac tcg cta acg gct caa ggc gtt aaa gtg 1584 Ala Tyr Leu Asn Arg Ile Asn Ser Leu Thr Ala Gln Gly Val Lys Val 515 520 525 cca gca cag tta ctt cgc tgg aag cca aac caa aga atg gcc 1626 Pro Ala Gln Leu Leu Arg Trp Lys Pro Asn Gln Arg Met Ala 530 535 540 <210> 13 <211> 542 <212> PRT <400> 1 Met Asn Pro Thr Ala Thr Asn Glu Met Leu Ser Pro Trp Pro Trp Ala 1 5 10 15 Val Thr Glu Ser Asn Ile Ser Phe Asp Val Gln Val Met Glu Gln Gln 20 25 30 Leu Lys Asp Phe Ser Arg Ala Cys Tyr Val Val Asn His Ala Asp His 35 40 45 Gly Phe Gly Ile Ala Gln Thr Ala Asp Ile Val Thr Glu Gln Ala Ala 50 55 60 Asn Ser Thr Asp Leu Pro Val Ser Ala Phe Thr Pro Ala Leu Gly Thr 65 70 75 80 Glu Ser Leu Gly Asp Asn Asn Phe Arg Arg Val His Gly Val Lys Tyr 85 90 95 Ala Tyr Tyr Ala Gly Ala Met Ala Asn Gly Ile Ser Ser Glu Glu Leu 100 105 110 Val Ile Ala Leu Gly Gln Ala Gly Ile Leu Cys Ser Phe Gly Ala Ala 115 120 125 Gly Leu Ile Pro Ser Arg Val Glu Ala Ala Ile Asn Arg Ile Gln Ala 130 135 140 Ala Leu Pro Asn Gly Pro Tyr Met Phe Asn Leu Ile His Ser Pro Ser 145 150 155 160 Glu Pro Ala Leu Glu Arg Gly Ser Val Glu Leu Phe Leu Lys His Lys 165 170 175 Val Arg Thr Val Glu Ala Ser Ala Phe Leu Gly Leu Thr Pro Gln Ile 180 185 190 Val Tyr Tyr Arg Ala Ala Gly Leu Ser Arg Asp Ala Gln Gly Lys Val 195 200 205 Val Val Gly Asn Lys Val Ile Ala Lys Val Ser Arg Thr Glu Val Ala 210 215 220 Glu Lys Phe Met Met Pro Ala Pro Ala Lys Met Leu Gln Lys Leu Val 225 230 235 240 Asp Asp Gly Ser Ile Thr Ala Glu Gln Met Glu Leu Ala Gln Leu Val 245 250 255 Pro Met Ala Asp Asp Ile Thr Ala Glu Ala Asp Ser Gly Gly His Thr 260 265 270 Asp Asn Arg Pro Leu Val Thr Leu Leu Pro Thr Ile Leu Ala Leu Lys 275 280 285 Glu Glu Ile Gln Ala Lys Tyr Gln Tyr Asp Thr Pro Ile Arg Val Gly 290 295 300 Cys Gly Gly Gly Val Gly Thr Pro Asp Ala Ala Leu Ala Thr Phe Asn 305 310 315 320 Met Gly Ala Ala Tyr Ile Val Thr Gly Ser Ile Asn Gln Ala Cys Val 325 330 335 Glu Ala Gly Ala Ser Asp His Thr Arg Lys Leu Leu Ala Thr Thr Glu 340 345 350 Met Ala Asp Val Thr Met Ala Pro Ala Ala Asp Met Phe Glu Met Gly 355 360 365 Val Lys Leu Gln Val Val Lys Arg Gly Thr Leu Phe Pro Met Arg Ala 370 375 380 Asn Lys Leu Tyr Glu Ile Tyr Thr Arg Tyr Asp Ser Ile Glu Ala Ile 385 390 395 400 Pro Leu Asp Glu Arg Glu Lys Leu Glu Lys Gln Val Phe Arg Ser Ser 405 410 415 Leu Asp Glu Ile Trp Ala Gly Thr Val Ala His Phe Asn Glu Arg Asp 420 425 430 Pro Lys Gln Ile Glu Arg Ala Glu Gly Asn Pro Lys Arg Lys Met Ala 435 440 445 Leu Ile Phe Arg Trp Tyr Leu Gly Leu Ser Ser Arg Trp Ser Asn Ser 450 455 460 Gly Glu Val Gly Arg Glu Met Asp Tyr Gln Ile Trp Ala Gly Pro Ala 465 470 475 480 Leu Gly Ala Phe Asn Gln Trp Ala Lys Gly Ser Tyr Leu Asp Asn Tyr 485 490 495 Gln Asp Arg Asn Ala Val Asp Leu Ala Lys His Leu Met Tyr Gly Ala 500 505 510 Ala Tyr Leu Asn Arg Ile Asn Ser Leu Thr Ala Gln Gly Val Lys Val 515 520 525 Pro Ala Gln Leu Leu Arg Trp Lys Pro Asn Gln Arg Met Ala 530 535 540[0026] <210> 1 <211> 37895 <212> DNA <213> Shewanella putrefaciens SCRC-2874 (FERM BP-1625) <400> 1 Position gatctcttac aaagaaacta tctcaatgtg aatttaacct taattccgtt taattacggc 60 ctgatagagc atcacccaat cagccataaa actgtaaagt gggtactcaa aggtggctgg 120 gcgattcttc tcaaatacaa agtgcccaac ccaagcaaat ccatatccga taacaggtaa 180 aagtagcaat aaaccccagc gctgagttag taatacataa gcgaataata ggatcactaa 240 actactgccg aaatagtgta atattcgaca gtttctatgc tgatgttgag ataaataaaa 300 agggtaaaat tcagcaaaag aacgatagcg cttactcatt actcacacct cggtaaaaaa 360 gcaactcgcc attaacttgg ccaatcgtca gttgttctat cgtctcaaag ttatgccgac 420 taaataactc tatatgtgca ttatgattag caaaaactcc gataccatca agatgaagtt 480 gttcatcaca ccaactcaaa actgcgtcga taagcttact gccatagccc ttgccttgct 540 ccacatttgc gatagcaata aactgtaaaa tgccacattg gccacttggt aagctctcta 600 taatctgatt ttctttgtta ataagtgcct gagttgaata ccaaccagta cttaacaaca 660 tctttaaacg ccaatgccaa aaacgcgctt cacctaaggg aacctgctga gtcactatgc 720 aggctacgcc tatcaatcta tccccaacga acataccaat aagtgcttgc tcctgttgcc 780 agagctcatt gagttcttct cgaatagccc cgcgaagctt ttgctcatac tgcgcttgat 840 caccactaaa aagtgtttcg ataaaaaagg gatcatcatg ataggcgtta tagagaatag 900 aggctgctat gcgtaaatct tctgccgtga gataaactgc acgacactct tccatggctt 960 gatcttccat tgttattgtc cttgaccttg atcacacaac accaatgtaa caagactgta 1020 tagaagtgca attaataatc aattcgtgca ttaagcaggt cagcatttct ttgctaaaca 1080 agctttattg gctttgacaa aactttgcct agactttaac gatagaaatc ataatgaaag 1140 agaaaagcta caacctagag gggaataatc aaacaactgc taagatctag ataatgtaat 1200 aaacaccgag tttatcgacc atacttagat agagtcatag caacgagaat agttatggat 1260 acaacgccgc aagatctatc acacctgttt ttacagctag gattagcaaa tgatcaaccc 1320 gcaattgaac agtttatcaa tgaccatcaa ttagcggaca atatattgct acatcaagca 1380 agcttttgga gcccatcgca aaagcacttc ttaattgagt catttaatga agatgcccag 1440 tggaccgaag tcatcgacca cttagacacc ttattaagaa aaaactaacc attacaacag 1500 caactttaaa ttttgccgta agccatctcc ccccacccca caacagcgtt gttgcttatg 1560 accactggag tacattcgtc tttagtcgtt ttaccatcac catgggtacg ttgagtgcga 1620 taaaaaagca cataaacttc tttatcggcc tgaatatagg cttcgttaaa atcagctgtt 1680 cccattaaag taaccacttg ctctttactc atgcctagag atatctttgt caaattgtca 1740 cggtttttat cttgagtttt ctcccaagca ccgtgattat cccagtcaga ttccccatca 1800 ccaacattga ccacacagcc cgttagccct aagcttgcaa tcccaaaaca tgctaaacct 1860 aataatttat ttttcatttt aacttcctgt tatgacatta tttttgctta gaagaaaagc 1920 aacttacatg ccaaaacaca agctgttgtt ttaaatgact ttatttatta ttagcctttt 1980 aggatatgcc tagagcaata ataattacca atgtttaagg aatttgacta actatgagtc 2040 cgattgagca agtgctaaca gctgctaaaa aaatcaatga acaaggtaga gaaccaacat 2100 tagcattgat taaaaccaaa cttggtaata gcatcccaat gcgcgagtta atccaaggtt 2160 tgcaacagtt taagtctatg agtgcagaag aaagacaagc aatacctagc agcttagcaa 2220 cagcaaaaga aactcaatat ggtcaatcaa gcttatctca atctgaacaa gctgatagga 2280 tcctccagct agaaaacgcc ctcaatgaat taagaaacga atttaatggg ctaaaaagtc 2340 aatttgataa cttacaacaa aacctgatga ataaagagcc tgacaccaaa tgcatgtaat 2400 tgaactacga tttgaatgtt ttgataacac cacgattact gcagcagaaa aagccattaa 2460 tggtttgctt gaagcttatc gagccaatgg ccaggttcta ggtcgtgaat ttgccgttgc 2520 atttaacgat ggtgagttta aagcacgcat gttaacccca gaaaaaagca gcttatctaa 2580 acgctttaat agtccttggg taaatagtgc actcgaagag ctaaccgaag ccaaattgct 2640 tgcgccacgt gaaaagtata ttggccaaga tattaattct gaagcatcta gccaagacac 2700 accaagttgg cagctacttt acacaagtta tgtgcacatg tgctcaccac taagaaatgg 2760 cgacaccttg cagcctattc cactgtatca aattccagca actgccaacg gcgatcataa 2820 acgaatgatc cgttggcaaa cagaatggca agcttgtgat gaattgcaaa tggccgcagc 2880 tactaaagct gaatttgccg cacttgaaga gctaaccagt catcagagtg atctatttag 2940 gcgtggttgg gacttacgtg gcagagtcga atacttgacg aaaattccga cctattacta 3000 tttataccgt gttggcggtg aaagcttagc agtagaaaag cagcgctctt gtcctaagtg 3060 tggcagtcaa gaatggctgc tcgataaacc attattggat atgttccatt ttcgctgtga 3120 cacctgccgc atcgtatcta atatctcttg ggaccattta taactcttcc gagtcttatc 3180 acactagagt ttagtcagca taaaaatggc gcttatattt caattaaaag aaatataagc 3240 gccattttca tcgatactat atatcagcag actattttcc gcgtaaatta gcccacatta 3300 atttcattct ttgccagatc cctggatgat ctagttgtgg catcgactct tcaataggtt 3360 taaccgcagg tgtaaccctt ggagtcaatt cgtttataaa ctcgtttaaa ctgtcactta 3420 atttaacgct ttgtacttca cctggaattt caatccatac gctgccatca ctattattaa 3480 ccgtcaacat tttatcttca tcatcaagaa taccaataaa ccaagtcggc tcttgcttaa 3540 gctttctctt catcattaaa tgaccaatga tgttttgttg taagtattca aaatcagttt 3600 gatcccacac ttggattagc tcaccttggc cccattgtga gtcaaaaaat agcggtgcag 3660 aaaaatgact gccaaaaaat ggattaattt ctgcagataa tgtcatttca agtgctgttt 3720 caacattagc aaattcacca ggttgttgac gtacaaccga ttgccaaaac actgcgccat 3780 cggagcccgc ttcggcgaca acacactcag acttttgtcc ttgcgcataa tatcttggct 3840 gttcaccaag cttatccatg taggcttgtt gatatttaga taaaaaaaga tctaaagcag 3900 gtaaagaaga cacttaagcc agttccaaaa tcagttataa taggggtcta ttttgacatg 3960 gaaaccgtat tgatgacaca acatcatgat ccctacagta acgcccccga actttctgaa 4020 ttaactttag gaaagtcgac cggttatcaa gagcagtatg atgcatcttt actacaagcg 4080 tgccgcgtaa attaaaccgt gatgctatcg gtctaaccaa tgagctacct tttcatggct 4140 gtgatatttg gactggctac gaactgtctt ggctaaatgc taaaggcaag ccaatgattg 4200 ctattgcaga ctttaaccta agttttgata gtaaaaatct gatcgagtct aagtcgttta 4260 agctgtattt aaacagctat aaccaaacac gatttgatag cgttcaagcg gttcaagaac 4320 gtttaactga agacttaagc gcctgtgccc aaggcacagt tacggtaaaa gtgattgaac 4380 ctaagcaatt taaccacctg agagtggttg atatgccagg tacctgcatt gacgatttag 4440 atattgaagt tgatgactat agctttaact ctgactatct caccgacagt gttgatgaca 4500 aagtcatggt tgctgaaacg ctaacgtcaa acttattgaa atcaaactgc ctaatcactt 4560 ctcagcctga ctggggtaca gtgatgatcc gttatcaagg gcctaagata gaccgtgaaa 4620 agctacttag atatctgatt tcatttagac agcacaatga atttcatgag cagtgtgttg 4680 agcgtatatt tgttgattta aagcactatt gccaatgtgc caaacttact gtctatgcac 4740 gttatacccg ccgtggtggt ttagatatca acccatatcg tagcgacttt gaaaaccctg 4800 cagaaaatca gcgcctagcg agacagtaat tgattgcagt acctacaaaa aacaatgcct 4860 ataagccaag cttatgggca tttttatatt atcaacttgt catcaaacct cagccgccaa 4920 gccttttagt tttatcgcta aattaagccg ctctctcagc caaatatttg caggattttg 4980 ctgtaattta tggctccaca ccatgaaata ctctatcggc tctaccgcaa aaggtaagtc 5040 aaatacctgt aagccaaaca gcttggcata ttcgtcagtg tgggcttttg acgcgatagc 5100 taacgcatca ctttttgagg caaccgacat catacttaat attgatgatt gctcgctgtg 5160 catttgcctt gccggtaaca cctgtttagt cagcaagtcg gcaacactta aattgtagcg 5220 gcgcatctta aaaataatat gcttttcatt aaagtattgc tcttgcgtca acccaccttg 5280 gatccttggg tgagcatttc gtgccacaca aactaattta tcctgcatta ctttttgact 5340 cttaaatgcc gcagattctg gcagccaaat atctaaggct aaatccacct tttctagttg 5400 taggtccatc tgcaactctt cttcaatgag cggcggctca cgaaatacaa tattaattgc 5460 agtgccctgt aacacttgct caatttgatc ttgcaagagt tgtattgccg actcgctggc 5520 atacacataa aaagttcgct cacttgaagt ggggtcaaat gcttcaaagc tagtcgcaac 5580 ttgctcaatt gttgacatag cgcccgcgag ctgttgataa agcgtcatcg cacttgcggt 5640 aggtttaact cccctaccca ctcgagtaaa caactcttct ccaacaatac tttttagcct 5700 cgaaatcgca ttactaaccg acgactgagt caaatccagc tcttctgccg cccggctaaa 5760 agatgaggtg cgatacaccg cagtaaaaac gcgaaataaa ttaagatcaa aagctttttg 5820 ctgcgacata aatcagctat ctccttatcc ttatccttat ccttataaaa agttagctcc 5880 agagcactct agctcaaaaa caactcagcg tattaagcca atattttggg aactcaatta 5940 atattcataa taaaagtatt cataatataa ataccaagtc ataatttagc cctaattatt 6000 aatcaattca agttacctat actggcctca attaagcaaa tgtctcatca gtctccctgc 6060 aactaaatgc aatattgaga cataaagctt tgaactgatt caatcttacg agggtaactt 6120 atgaaacaga ctctaatggc tatctcaatc atgtcgcttt tttcattcaa tgcgctagca 6180 gcgcaacatg aacatgacca catcactgtt gattacgaag ggaaagccgc aacagaacac 6240 accatagctc acaaccaagc tgtagctaaa acacttaact ttgccgacac gcgtgcattt 6300 gagcaatcgt ctaaaaatct agtcgccaag tttgataaag caactgccga tatattacgt 6360 gccgaatttg cttttattag cgatgaaatc cctgactcgg ttaacccgtc tctctaccgt 6420 caggctcagc ttaatatggt gcctaatggt ctgtataaag tgagcgatgg catttaccag 6480 gtccgcggta ccgacttatc taaccttaca cttatccgca gtgataacgg ttggatagca 6540 tacgatgttt tgttaaccaa agaagcagca aaagcctcac tacaatttgc gttaaagaat 6600 ctacctaaag atggcgattt acccgttgtt gcgatgattt actcccatag ccatgcggac 6660 cactttggcg gagctcgcgg tgttcaagag atgttccctg atgtcaaagt ctacggctca 6720 gataacatca ctaaagaaat tgtcgatgag aacgtacttg ccggtaacgc catgagccgc 6780 cgcgcagctt atcaatacgg cgcaacactg ggcaaacatg accacggtat tgttgatgct 6840 gcgctaggta aaggtctatc aaaaggtgaa atcacttacg tcgccccaga ctacacctta 6900 aacagtgaag gcaaatggga aacgctgacg attgatggtc tagagatggt gtttatggat 6960 gcctcgggca ccgaagctga gtcagaaatg atcacttata ttccctctaa aaaagcgctc 7020 tggacggcgg agcttaccta tcaaggtatg cacaacattt atacgctgcg cggcgctaaa 7080 gtacgtgatg cgctcaagtg gtcaaaagat atcaacgaaa tgatcaatgc ctttggtcaa 7140 gatgtcgaag tgctgtttgc ctcgcactct gcgccagtgt ggggtaacca agcgatcaac 7200 gatttcttac gcctacagcg tgataactac ggcctagtgc acaatcaaac cttgagactt 7260 gccaacgatg gtgtcggtat acaagatatt ggcgatgcga ttcaagacac gattccagag 7320 tctatctaca agacgtggca taccaatggt taccacggca cttatagcca taacgctaaa 7380 gcggtttata acaagtatct aggctacttc gatatgaacc cagccaacct taatccgctg 7440 ccaaccaagc aagaatctgc caagtttgtc gaatacatgg gcggcgcaga tgccgcaatt 7500 aagcgcgcta aagatgatta cgctcaaggt gaataccgct ttgttgcaac ggcattaaat 7560 aaggtggtga tggccgagcc agaaaatgac tccgctcgtc aattgctagc cgatacctat 7620 gagcaacttg gttatcaagc agaaggggct ggctggagaa acatttactt aactggcgca 7680 caagagctac gagtaggtat tcaagctggc gcgcctaaaa ccgcatcggc agatgtcatc 7740 agtgaaatgg acatgccgac tctatttgac ttcctcgcgg tgaagattga tagtcaacag 7800 gcggctaagc acggcttagt taagatgaat gttatcaccc ctgatactaa agatattctc 7860 tatattgagc taagcaacgg taacttaagc aacgcagtgg tcgacaaaga gcaagcagct 7920 gacgcaaacc ttatggttaa taaagctgac gttaaccgca tcttacttgg ccaagtaacc 7980 ctaaaagcgt tattagccag cggcgatgcc aagctcactg gtgataaaac ggcatttagt 8040 aaaatagccg atagcatggt cgagtttaca cctgacttcg aaatcgtacc aacgcctgtt 8100 aaatgaggca ttaatctcaa caagtgcaag ctagacataa aaatggggcg attagacgcc 8160 ccatttttta tgcaattttg aactagctag tcttagctga agctcgaaca acagctttaa 8220 aattcacttc ttctgctgca atacttattt gctgacactg accaatactc agtgcaaaac 8280 gataactatc atcaagatgg cccagtaaac aatgccaatt atcagcagcg ttcatttgct 8340 gttctttagc ctcaatcaaa cctaaaccag acttttgtgg ctcagcgtta ggcttattag 8400 aactcgactc tagtaaagca agaccaatat cttgttttaa caaaacctgt cgctgattaa 8460 gttgatgctc aaccttgtga tccgcaatag catcggaaat atcaacacaa tggctcaagc 8520 ttttaggtgc attaactcca agaaaagttt cgctcagtgc agagaagtca aacgcaaaag 8580 attttagcga taatgccagc ccaagtcctt tcgctttaat gtaagactcc ttgagcgccc 8640 acaaatcaaa aaagcggtct cgctgcaagg cctctggtaa cgctaacaag gctcgctttt 8700 ctgattcaga gaaataatga ctaagaatag agtggatatt ggtgctgtta cggcaacgct 8760 caatgtcgac gccaaactca atactagcag agtcagtttc ctccttgctt gcctgactgg 8820 cgcctttatt atcagcagtg caaatgccta ctaatagcca atctccacta tgactcacat 8880 taaagtggac cccggtttga gcaaattgcg catcactcaa tctaggctta cctttgtcgc 8940 catattcaaa gcgccattca ttggggcgta tttcactatg ttgtgacaat aaagcgcgca 9000 aatagcctct taccattaaa ccttgagttt tagcttcttg tttaatgtag cgattaacct 9060 taattaactc atcttcaggc agccatgact taaccaactc tgtagtctgg ttatcgcact 9120 cttgtattgt taacggacag aagtataagg aaatcaatcg agaagttagc aatttttcag 9180 gacactcttt aaagcaacaa acataacccc tatttttacc aatttaagat caaaactaaa 9240 gccaaaacta attgagaata gtgtcaaact agctttaaag gaaaaaaata taaaaagaac 9300 attatacttg tataaattat tttacacacc aaagccatga tcttcacaaa attagctccc 9360 tctccctaaa acaagattga ataaaaaaat aaaccttaac tttcatatag ataaaacaaa 9420 ccaatgggat aaagtatatt gaattcattt ttaaggaaaa attcaaattg aattcaagct 9480 cttcagtaaa agcatatttt gccgttagtg tgaaaaaaaa caaatttaaa aaccaacata 9540 gaacaaataa gcagacaata aaaccaaggc gcaacacaaa caacgcgctt acaattttca 9600 caaaaaagca acaagagtaa cgtttagtat ttggatatgg ttattgtaat tgagaatttt 9660 ataacaatta tattaaggga atgagtatgt ttttaaattc aaaactttcg cgctcagtca 9720 aacttgccat atccgcaggc ttaacagcct cgctagctat gcctgttttt gcagaagaaa 9780 ctgctgctga agaacaaata gaaagagtcg cagtgaccgg atcgcgaatc gctaaagcag 9840 agctaactca accagctcca gtcgtcagcc tttcagccga agaactgaca aaatttggta 9900 atcaagattt aggtagcgta ctagcagaat tacctgctat tggtgcaacc aacactatta 9960 ttggtaataa caatagcaac tcaagcgcag gtgttagctc agcagacttg cgtcgtctag 10020 gtgctaacag aaccttagta ttagtcaacg gtaagcgcta cgttgccggc caaccgggct 10080 cagctgaggt agatttgtca actataccaa ctagcatgat ctcgcgagtt gagattgtaa 10140 ccggcggtgc ttcagcaatt tatggttcgg acgctgtatc aggtgttatc aacgttatcc 10200 ttaaagaaga ctttgaaggc tttgagttta acgcacgtac tagcggttct actgaaagtg 10260 taggcactca agagcactct tttgacattt tgggtggtgc aaacgttgca gatggacgtg 10320 gtaatgtaac cttctacgca ggttatgaac gtacaaaaga agtcatggct accgacattc 10380 gccaattcga tgcttgggga acaattaaaa acgaagccga tggtggtgaa gatgatggta 10440 ttccagacag actacgtgta ccacgagttt attctgaaat gattaatgct accggtgtta 10500 tcaatgcatt tggtggtgga attggtcgct caacctttga cagtaacggc aatcctattg 10560 cacaacaaga acgtgatggg actaacagct ttgcatttgg ttcattccct aatggctgtg 10620 acacatgttt caacactgaa gcatacgaaa actatattcc aggggtagaa agaataaacg 10680 ttggctcatc attcaacttt gattttaccg ataacattca attttacact gacttcagat 10740 atgtaaagtc agatattcag caacaatttc agccttcatt ccgttttggt aacattaata 10800 tcaatgttga agataacgcc tttttgaatg acgacttgcg tcagcaaatg ctcgatgcgg 10860 gtcaaaccaa tgctagtttt gccaagtttt ttgatgaatt aggaaatcgc tcagcagaaa 10920 ataaacgcga acttttccgt tacgtaggtg gctttaaagg tggctttgat attagcgaaa 10980 ccatatttga ttacgacctt tactatgttt atggcgagac taataaccgt cgtaaaaccc 11040 ttaatgacct aattcctgat aactttgtcg cagctgtcga ctctgttatt gatcctgata 11100 ctggcttagc agcgtgtcgc tcacaagtag caagcgctca aggcgatgac tatacagatc 11160 ccgcgtctgt aaatggtagc gactgtgttg cttataaccc atttggcatg ggtcaagctt 11220 cagcagaagc ccgcgactgg gtttctgctg atgtgactcg tgaagacaaa ataactcaac 11280 aagtgattgg tggtactctc ggtaccgatt ctgaagaact atttgagctt caaggtggtg 11340 caatcgctat ggttgttggt tttgaatacc gtgaagaaac gtctggttca acaaccgatg 11400 aatttactaa agcaggtttc ttgacaagcg ctgcaacgcc agattcttat ggcgaatacg 11460 acgtgactga gtattttgtt gaggtgaaca tcccagtact aaaagaatta ccttttgcac 11520 atgagttgag ctttgacggt gcataccgta atgctgatta ctcacatgcc ggtaagactg 11580 aagcatggaa agctggtatg ttctactcac cattagagca acttgcatta cgtggtacgg 11640 taggtgaagc agtacgagca ccaaacattg cagaagcctt tagtccacgc tctcctggtt 11700 ttggccgcgt ttcagatcca tgtgatgcag ataacattaa tgacgatccg gatcgcgtgt 11760 caaactgtgc agcattgggg atccctccag gattccaagc taatgataac gtcagtgtag 11820 ataccttatc tggtggtaac ccagatctaa aacctgaaac atcaacatcc tttacaggtg 11880 gtcttgtttg gacaccaacg tttgctgaca atctatcatt cactgtcgat tattatgata 11940 ttcaaattga ggatgctatt ttgtcagtag ccacccagac tgtggctgat aactgtgttg 12000 actcaactgg cggacctgac accgacttct gtagtcaagt tgatcgtaat ccaacgacct 12060 atgatattga acttgttcgc tctggttatc taaatgccgc ggcattgaat accaaaggta 12120 ttgaatttca agctgcatac tcattagatc tagagtcttt caacgcgcct ggtgaactac 12180 gcttcaacct attggggaac caattacttg aactagaacg tcttgaattc caaaatcgtc 12240 ctgatgagat taatgatgaa aaaggcgaag taggtgatcc agagctgcag ttccgcctag 12300 gcatcgatta ccgtctagat gatctaagtg ttagctggaa cacgcgttat attgatagcg 12360 tagtaactta tgatgtctct gaaaatggtg gctctcctga agatttatat ccaggccaca 12420 taggctcaat gacaactcat gacttgagcg ctacatacta catcaatgag aacttcatga 12480 ttaacggtgg tgtacgtaac ctatttgacg cacttccacc tggatacact aacgatgcgc 12540 tatatgatct agttggtcgc cgtgcattcc taggtattaa ggtaatgatg taattaatta 12600 ttacgcctct aactaataaa aatgcaatct cttcgtagag attgcatttt tttatgaaat 12660 ccaatcttaa actggttctc cgagcatctt acgccttaaa aaccccgccc ctcaatgtaa 12720 cgccaaagtt aattgcttac acgcacttac acaaacgaac aatttcatta acacgagaca 12780 cagctcacgc tttttatttt acccttgatt ttactacata aaattgcgtt ttagcgcaca 12840 agtgttctcc caagctggtc gtatctgtaa ttattcagtc ccaggtgatt gtattgaccc 12900 ataagctcag gtagtctgct ctgccattag ctaaacaata ttgacaaaat ggcgataaaa 12960 tgtggcttag cgctaagttc accgtaagtt ttatcggcat taagtcccaa cagattatta 13020 acggaaaccc gctaaactga tggcaaaaat aaatagtgaa cacttggatg aagctactat 13080 tacttcgaat aagtgtacgc aaacagagac tgaggctcgg catagaaatg ccactacaac 13140 acctgagatg cgccgattca tacaagagtc ggatctcagt gttagccaac tgtctaaaat 13200 attaaatatc agtgaagcta ccgtacgtaa gtggcgcaag cgtgactctg tcgaaaactg 13260 tcctaatacc ccgcaccatc tcaataccac gctaacccct ttgcaagaat atgtggttgt 13320 gggcctgcgt tatcaattga aaatgccatt agacagattg ctcaaagcaa cccaagagtt 13380 tatcaatcca aacgtgtcgc gctcaggttt agcaagatgt ttgaagcgtt atggcgtttc 13440 acgggtgagt gatatccaaa gcccacacgt accaatgcgc tactttaatc aaattccagt 13500 cactcaaggc agcgatgtgc aaacctacac cctgcactat gaaacgctgg caaaaacctt 13560 agccttacct agtaccgatg gtgacaatgt ggtgcaagtg gtgtctctca ccattccacc 13620 aaagttaacc gaagaagcac ccagttcaat tttgctcggc attgatcctc atagcgactg 13680 gatctatctc gacatatacc aagatggcaa tacacaagcc acgaatagat atatggctta 13740 tgtgctaaaa cacgggccat tccatttacg aaagttactc gtgcgtaact atcacacctt 13800 tttacagcgc tttcctggag cgacgcaaaa tcgccgcccc tctaaagata tgcctgaaac 13860 aatcaacaag acgcctgaaa cacaggcacc cagtggagac tcataatgag ccagacctct 13920 aaacctacaa actcagcaac tgagcaagca caagactcac aagctgactc tcgtttaaat 13980 aaacgactaa aagatatgcc aattgctatt gttggcatgg cgagtatttt tgcaaactct 14040 cgctatttga ataagttttg ggacttaatc agcgaaaaaa ttgatgcgat tactgaatta 14100 ccatcaactc actggcagcc tgaagaatat tacgacgcag ataaaaccgc agcagacaaa 14160 agctactgta aacgtggtgg ctttttgcca gatgtagact tcaacccaat ggagtttggc 14220 ctgccgccaa acattttgga actgaccgat tcatcgcaac tattatcact catcgttgct 14280 aaagaagtgt tggctgatgc taacttacct gagaattacg accgcgataa aattggtatc 14340 accttaggtg tcggcggtgg tcaaaaaatt agccacagcc taacagcgcg tctgcaatac 14400 ccagtattga agaaagtatt cgccaatagc ggcattagtg acaccgacag cgaaatgctt 14460 atcaagaaat tccaagacca atatgtacac tgggaagaaa actcgttccc aggttcactt 14520 ggtaacgtta ttgcgggccg tatcgccaac cgcttcgatt ttggcggcat gaactgtgtg 14580 gttgatgctg cctgtgctgg atcacttgct gctatgcgta tggcgctaac agagctaact 14640 gaaggtcgct ctgaaatgat gatcaccggt ggtgtgtgta ctgataactc accctctatg 14700 tatatgagct tttcaaaaac gcccgccttt accactaacg aaaccattca gccatttgat 14760 atcgactcaa aaggcatgat gattggtgaa ggtattggca tggtggcgct aaagcgtctt 14820 gaagatgcag agcgcgatgg cgaccgcatt tactctgtaa ttaaaggtgt gggtgcatca 14880 tctgacggta agtttaaatc aatctatgcc cctcgcccat caggccaagc taaagcactt 14940 aaccgtgcct atgatgacgc aggttttgcg ccgcatacct taggtctaat tgaagctcac 15000 ggaacaggta ctgcagcagg tgacgcggca gagtttgccg gcctttgctc agtatttgct 15060 gaaggcaacg ataccaagca acacattgcg ctaggttcag ttaaatcaca aattggtcat 15120 actaaatcaa ctgcaggtac agcaggttta attaaagctg ctcttgcttt gcatcacaag 15180 gtactgccgc cgaccattaa cgttagtcag ccaagcccta aacttgatat cgaaaactca 15240 ccgttttatc taaacactga gactcgtcca tggttaccac gtgttgatgg tacgccgcgc 15300 cgcgcgggta ttagctcatt tggttttggt ggcactaact tccattttgt actagaagag 15360 tacaaccaag aacacagccg tactgatagc gaaaaagcta agtatcgtca acgccaagtg 15420 gcgcaaagct tccttgttag cgcaagcgat aaagcatcgc taattaacga gttaaacgta 15480 ctagcagcat ctgcaagcca agctgagttt atcctcaaag atgcagcagc aaactatggc 15540 gtacgtgagc ttgataaaaa tgcaccacgg atcggtttag ttgcaaacac agctgaagag 15600 ttagcaggcc taattaagca agcacttgcc aaactagcag ctagcgatga taacgcatgg 15660 cagctacctg gtggcactag ctaccgcgcc gctgcagtag aaggtaaagt tgccgcactg 15720 tttgctggcc aaggttcaca atatctcaat atgggccgtg accttacttg ttattaccca 15780 gagatgcgtc agcaatttgt aactgcagat aaagtatttg ccgcaaatga taaaacgccg 15840 ttatcgcaaa ctctgtatcc aaagcctgta tttaataaag atgaattaaa ggctcaagaa 15900 gccattttga ccaataccgc caatgcccaa agcgcaattg gtgcgatttc aatgggtcaa 15960 tacgatttgt ttactgcggc tggctttaat gccgacatgg ttgcaggcca tagctttggt 16020 gagctaagtg cactgtgtgc tgcaggtgtt atttcagctg atgactacta caagctggct 16080 tttgctcgtg gtgaggctat ggcaacaaaa gcaccggcta aagacggcgt tgaagcagat 16140 gcaggagcaa tgtttgcaat cataaccaag agtgctgcag accttgaaac cgttgaagcc 16200 accatcgcta aatttgatgg ggtgaaagtc gctaactata acgcgccaac gcaatcagta 16260 attgcaggcc caacagcaac taccgctgat gcggctaaag cgctaactga gcttggttac 16320 aaagcgatta acctgccagt atcaggtgca ttccacactg aacttgttgg tcacgctcaa 16380 gcgccatttg ctaaagcgat tgacgcagcc aaatttacta aaacaagccg agcactttac 16440 tcaaatgcaa ctggcggact ttatgaaagc actgctgcaa agattaaagc ctcgtttaag 16500 aaacatatgc ttcaatcagt gcgctttact agccagctag aagccatgta caacgacggc 16560 gcccgtgtat ttgttgaatt tggtccaaag aacatcttac aaaaattagt tcaaggcacg 16620 cttgtcaaca ctgaaaatga agtttgcact atctctatca accctaatcc taaagttgat 16680 agtgatctgc agcttaagca agcagcaatg cagctagcgg ttactggtgt ggtactcagt 16740 gaaattgacc cataccaagc cgatattgcc gcaccagcga aaaagtcgcc aatgagcatt 16800 tcgcttaatg ctgctaacca tatcagcaaa gcaactcgcg ctaagatggc caagtcttta 16860 gagacaggta tcgtcacctc gcaaatagaa catgttattg aagaaaaaat cgttgaagtt 16920 gagaaactgg ttgaagtcga aaagatcgtc gaaaaagtgg ttgaagtaga gaaagttgtt 16980 gaggttgaag ctcctgttaa ttcagtgcaa gccaatgcaa ttcaaacccg ttcagttgtc 17040 gctccagtaa tagagaacca agtcgtgtct aaaaacagta agccagcagt ccagagcatt 17100 agtggtgatg cactcagcaa cttttttgct gcacagcagc aaaccgcaca gttgcatcag 17160 cagttcttag ctattccgca gcaatatggt gagacgttca ctacgctgat gaccgagcaa 17220 gctaaactgg caagttctgg tgttgcaatt ccagagagtc tgcaacgctc aatggagcaa 17280 ttccaccaac tacaagcgca aacactacaa agccacaccc agttccttga gatgcaagcg 17340 ggtagcaaca ttgcagcgtt aaacctactc aatagcagcc aagcaactta cgctccagcc 17400 attcacaatg aagcgattca aagccaagtg gttcaaagcc aaactgcagt ccagccagta 17460 atttcaacac aagttaacca tgtgtcagag cagccaactc aagctccagc tccaaaagcg 17520 cagccagcac ctgtgacaac tccagttcaa actgctccgg cacaagttgt tcgtcaagcc 17580 gcaccagttc aagccgctat tgaaccgatt aatacaagtg ttgcgactac aacgccttca 17640 gccttcagcg ccgaaacagc cctgagcgca acaaaagtcc aagccactat gcttgaagtg 17700 gttgctgaga aaaccggtta cccaactgaa atgctagagc ttgaaatgga tatggaagcc 17760 gatttaggca tcgattctat caagcgtgta gaaattcttg gcacagtaca agatgagcta 17820 ccgggtctac ctgagcttag ccctgaagat ctagctgagt gtcgaacgct aggcgaaatc 17880 gttgactata tgggcagtaa actgccggct gaaggctcta tgaattctca gctgtctaca 17940 ggttccgcag ctgcgactcc tgcagcgaat ggtctttctg cggagaaagt tcaagcgact 18000 atgatgtctg tggttgccga aaagactggc tacccaactg aaatgctaga gcttgaaatg 18060 gatatggaag ccgatttagg catagattct atcaagcgcg ttgaaattct tggcacagta 18120 caagatgagc taccgggtct acctgagctt agccctgaag atctagctga gtgtcgtact 18180 ctaggcgaaa tcgttgacta tatgaactct aaactcgctg acggctctaa gctgccggct 18240 gaaggctcta tgaattctca gctgtctaca agtgccgcag ctgcgactcc tgcagcgaat 18300 ggtctctctg cggagaaagt tcaagcgact atgatgtctg tggttgccga aaagactggc 18360 tacccaactg aaatgctaga acttgaaatg gatatggaag ctgaccttgg catcgattca 18420 atcaagcgcg ttgaaattct tggcacagta caagatgagc taccgggttt acctgagcta 18480 aatccagaag atttggcaga gtgtcgtact cttggcgaaa tcgtgactta tatgaactct 18540 aaactcgctg acggctctaa gctgccagct gaaggctcta tgcactatca gctgtctaca 18600 agtaccgctg ctgcgactcc tgtagcgaat ggtctctctg cagaaaaagt tcaagcgacc 18660 atgatgtctg tagttgcaga taaaactggc tacccaactg aaatgcttga acttgaaatg 18720 gatatggaag ccgatttagg tatcgattct atcaagcgcg ttgaaattct tggcacagta 18780 caagatgagc taccgggttt acctgagcta aatccagaag atctagcaga gtgtcgcacc 18840 ctaggcgaaa tcgttgacta tatgggcagt aaactgccgg ctgaaggctc tgctaataca 18900 agtgccgctg cgtctcttaa tgttagtgcc gttgcggcgc ctcaagctgc tgcgactcct 18960 gtatcgaacg gtctctctgc agagaaagtg caaagcacta tgatgtcagt agttgcagaa 19020 aagaccggct acccaactga aatgctagaa cttggcatgg atatggaagc cgatttaggt 19080 atcgactcaa ttaaacgcgt tgagattctt ggcacagtac aagatgagct accgggtcta 19140 ccagagctta atcctgaaga tttagctgag tgccgtacgc tgggcgaaat cgttgactat 19200 atgaactcta agctggctga cggctctaag cttccagctg aaggctctgc taatacaagt 19260 gccactgctg cgactcctgc agtgaatggt ctttctgctg acaaggtaca ggcgactatg 19320 atgtctgtag ttgctgaaaa gaccggctac ccaactgaaa tgctagaact tggcatggat 19380 atggaagcag accttggtat tgattctatt aagcgcgttg aaattcttgg cacagtacaa 19440 gatgagctcc caggtttacc tgagcttaat cctgaagatc tcgctgagtg ccgcacgctt 19500 ggcgaaatcg ttagctatat gaactctcaa ctggctgatg gctctaaact ttctacaagt 19560 gcggctgaag gctctgctga tacaagtgct gcaaatgctg caaagccggc agcaatttcg 19620 gcagaaccaa gtgttgagct tcctcctcat agcgaggtag cgctaaaaaa gcttaatgcg 19680 gcgaacaagc tagaaaattg tttcgccgca gacgcaagtg ttgtgattaa cgatgatggt 19740 cacaacgcag gcgttttagc tgagaaactt attaaacaag gcctaaaagt agccgttgtg 19800 cgtttaccga aaggtcagcc tcaatcgcca ctttcaagcg atgttgctag ctttgagctt 19860 gcctcaagcc aagaatctga gcttgaagcc agtatcactg cagttatcgc gcagattgaa 19920 actcaggttg gcgctattgg tggctttatt cacttgcaac cagaagcgaa tacagaagag 19980 caaacggcag taaacctaga tgcgcaaagt tttactcacg ttagcaatgc gttcttgtgg 20040 gccaaattat tgcaaccaaa gctcgttgct ggagcagatg cgcgtcgctg ttttgtaaca 20100 gtaagccgta tcgacggtgg ctttggttac ctaaatactg acgccctaaa agatgctgag 20160 ctaaaccaag cagcattagc tggtttaact aaaaccttaa gccatgaatg gccacaagtg 20220 ttctgtcgcg cgctagatat tgcaacagat gttgatgcaa cccatcttgc tgatgcaatc 20280 accagtgaac tatttgatag ccaagctcag ctacctgaag tgggcttaag cttaattgat 20340 ggcaaagtta accgcgtaac tctagttgct gctgaagctg cagataaaac agcaaaagca 20400 gagcttaaca gcacagataa aatcttagtg actggtgggg caaaaggggt gacatttgaa 20460 tgtgcactgg cattagcatc tcgcagccag tctcacttta tcttagctgg gcgcagtgaa 20520 ttacaagctt taccaagctg ggctgagggt aagcaaacta gcgagctaaa atcagctgca 20580 atcgcacata ttatttctac tggtcaaaag ccaacgccta agcaagttga agccgctgtg 20640 tggccagtgc aaagcagcat tgaaattaat gccgccctag ccgcctttaa caaagttggc 20700 gcctcagctg aatacgtcag catggatgtt accgatagcg ccgcaatcac agcagcactt 20760 aatggtcgct caaatgagat caccggtctt attcatggcg caggtgtact agccgacaag 20820 catattcaag acaagactct tgctgaactt gctaaagttt atggcactaa agtcaacggc 20880 ctaaaagcgc tgctcgcggc acttgagcca agcaaaatta aattacttgc tatgttctca 20940 tctgcagcag gtttttacgg taatatcggc caaagcgatt acgcgatgtc gaacgatatt 21000 cttaacaagg cagcgctgca gttcaccgct cgcaacccac aagctaaagt catgagcttt 21060 aactggggtc cttgggatgg cggcatggtt aacccagcgc ttaaaaagat gtttaccgag 21120 cgtggtgtgt acgttattcc actaaaagca ggtgcagagc tatttgccac tcagctattg 21180 gctgaaactg gcgtgcagtt gctcattggt acgtcaatgc aaggtggcag cgacactaaa 21240 gcaactgaga ctgcttctgt aaaaaagctt aatgcgggtg aggtgctaag tgcatcgcat 21300 ccgcgtgctg gtgcacaaaa aacaccacta caagctgtca ctgcaacgcg tctgttaacc 21360 ccaagtgcca tggtcttcat tgaagatcac cgcattggcg gtaacagtgt gttgccaacg 21420 gtatgcgcca tcgactggat gcgtgaagcg gcaagcgaca tgcttggcgc tcaagttaag 21480 gtacttgatt acaagctatt aaaaggcatt gtatttgaga ctgatgagcc gcaagagtta 21540 acacttgagc taacgccaga cgattcagac gaagctacgc tacaagcatt aatcagctgt 21600 aatgggcgtc cgcaatacaa ggcgacgctt atcagtgata atgccgatat taagcaactt 21660 aacaagcagt ttgatttaag cgctaaggcg attaccacag caaaagagct ttatagcaac 21720 ggcaccttgt tccacggtcc gcgtctacaa gggatccaat ctgtagtgca gttcgatgat 21780 caaggcttaa ttgctaaagt cgctctgcct aaggttgaac ttagcgattg tggtgagttc 21840 ttgccgcaaa cccacatggg tggcagtcaa ccttttgctg aggacttgct attacaagct 21900 atgctggttt gggctcgcct taaaactggc tcggcaagtt tgccatcaag cattggtgag 21960 tttacctcat accaaccaat ggcctttggt gaaactggta ccatagagct tgaagtgatt 22020 aagcacaaca aacgctcact tgaagcgaat gttgcgctat atcgtgacaa cggcgagtta 22080 agtgccatgt ttaagtcagc taaaatcacc attagcaaaa gcttaaattc agcattttta 22140 cctgctgtct tagcaaacga cagtgaggcg aattagtgga acaaacgcct aaagctagtg 22200 cgatgccgct gcgcatcgca cttatcttac tgccaacacc gcagtttgaa gttaactctg 22260 tcgaccagtc agtattagcc agctatcaaa cactgcagcc tgagctaaat gccctgctta 22320 atagtgcgcc gacacctgaa atgctcagca tcactatctc agatgatagc gatgcaaaca 22380 gctttgagtc gcagctaaat gctgcgacca acgcaattaa caatggctat atcgtcaagc 22440 ttgctacggc aactcacgct ttgttaatgc tgcctgcatt aaaagcggcg caaatgcgga 22500 tccatcctca tgcgcagctt gccgctatgc agcaagctaa atcgacgcca atgagtcaag 22560 tatctggtga gctaaagctt ggcgctaatg cgctaagcct agctcagact aatgcgctgt 22620 ctcatgcttt aagccaagcc aagcgtaact taactgatgt cagcgtgaat gagtgttttg 22680 agaacctcaa aagtgaacag cagttcacag aggtttattc gcttattcag caacttgcta 22740 gccgcaccca tgtgagaaaa gaggttaatc aaggtgtgga acttggccct aaacaagcca 22800 aaagccacta ttggtttagc gaatttcacc aaaaccgtgt tgctgccatc aactttatta 22860 atggccaaca agcaaccagc tatgtgctta ctcaaggttc aggattgtta gctgcgaaat 22920 caatgctaaa ccagcaaaga ttaatgttta tcttgccggg taacagtcag caacaaataa 22980 ccgcatcaat aactcagtta atgcagcaat tagagcgttt gcaggtaact gaggttaatg 23040 agctttctct agaatgccaa ctagagctgc tcagcataat gtatgacaac ttagtcaacg 23100 cagacaaact cactactcgc gatagtaagc ccgcttatca ggctgtgatt caagcaagct 23160 ctgttagcgc tgcaaagcaa gagttaagcg cgcttaacga tgcactcaca gcgctgtttg 23220 ctgagcaaac aaacgccaca tcaacgaata aaggcttaat ccaatacaaa acaccggcgg 23280 gcagttactt aaccctaaca ccgcttggca gcaacaatga caacgcccaa gcgggtcttg 23340 cttttgtcta tccgggtgtg ggaacggttt acgccgatat gcttaatgag ctgcatcagt 23400 acttccctgc gctttacgcc aaacttgagc gtgaaggcga tttaaaggcg atgctacaag 23460 cagaagatat ctatcatctt gaccctaaac atgctgccca aatgagctta ggtgacttag 23520 ccattgctgg cgtggggagc agctacctgt taactcagct gctcaccgat gagtttaata 23580 ttaagcctaa ttttgcatta ggttactcaa tgggtgaagc atcaatgtgg gcaagcttag 23640 gcgtatggca aaacccgcat gcgctgatca gcaaaaccca aaccgacccg ctatttactt 23700 ctgctatttc cggcaaattg accgcggtta gacaagcttg gcagcttgat gataccgcag 23760 cggaaatcca gtggaatagc tttgtggtta gaagtgaagc agcgccgatt gaagccttgc 23820 taaaagatta cccacacgct tacctcgcga ttattcaagg ggatacctgc gtaatcgctg 23880 gctgtgaaat ccaatgtaaa gcgctacttg cagcactggg taaacgcggt attgcagcta 23940 atcgtgtaac ggcgatgcat acgcagcctg cgatgcaaga gcatcaaaat gtgatggatt 24000 tttatctgca accgttaaaa gcagagcttc ctagtgaaat aagctttatc agcgccgctg 24060 atttaactgc caagcaaacg gtgagtgagc aagcacttag cagccaagtc gttgctcagt 24120 ctattgccga caccttctgc caaaccttgg actttaccgc gctagtacat cacgcccaac 24180 atcaaggcgc taagctgttt gttgaaattg gcgcggatag acaaaactgc accttgatag 24240 acaagattgt taaacaagat ggtgccagca gtgtacaaca tcaaccttgt tgcacagtgc 24300 ctatgaacgc aaaaggtagc caagatatta ccagcgtgat taaagcgctt ggccaattaa 24360 ttagccatca ggtgccatta tcggtgcaac catttattga tggactcaag cgcgagctaa 24420 cactttgcca attgaccagc caacagctgg cagcacatgc aaatgttgac agcaagtttg 24480 agtctaacca agaccattta cttcaagggg aagtctaatg tcattaccag acaatgcttc 24540 taaccacctt tctgccaacc agaaaggcgc atctcaggca agtaaaacca gtaagcaaag 24600 caaaatcgcc attgtcggtt tagccactct gtatccagac gctaaaaccc cgcaagaatt 24660 ttggcagaat ttgctggata aacgcgactc tcgcagcacc ttaactaacg aaaaactcgg 24720 cgctaacagc caagattatc aaggtgtgca aggccaatct gaccgttttt attgtaataa 24780 aggcggctac attgagaact tcagctttaa tgctgcaggc tacaaattgc cggagcaaag 24840 cttaaatggc ttggacgaca gcttcctttg ggcgctcgat actagccgta acgcactaat 24900 tgatgctggt attgatatca acggcgctga tttaagccgc gcaggtgtag tcatgggcgc 24960 gctgtcgttc ccaactaccc gctcaaacga tctgtttttg ccaatttatc acagcgccgt 25020 tgaaaaagcc ctgcaagata aactaggcgt aaaggcattt aagctaagcc caactaatgc 25080 tcataccgct cgcgcggcaa atgagagcag cctaaatgca gccaatggtg ccattgccca 25140 taacagctca aaagtggtgg ccgatgcact tggccttggc ggcgcacaac taagcctaga 25200 tgctgcctgt gctagttcgg tttactcatt aaagcttgcc tgcgattacc taagcactgg 25260 caaagccgat atcatgctag caggcgcagt atctggcgcg gatcctttct ttattaatat 25320 gggattctca atcttccacg cctacccaga ccatggtatc tcagtaccgt ttgatgccag 25380 cagtaaaggt ttgtttgctg gcgaaggcgc tggcgtatta gtgcttaaac gtcttgaaga 25440 tgccgagcgc gacaatgaca aaatctatgc ggttgttagc ggcgtaggtc tatcaaacga 25500 cggtaaaggc cagtttgtat taagccctaa tccaaaaggt caggtgaagg cctttgaacg 25560 tgcttatgct gccagtgaca ttgagccaaa agacattgaa gtgattgagt gccacgcaac 25620 aggcacaccg cttggcgata aaattgagct cacttcaatg gaaaccttct ttgaagacaa 25680 gctgcaaggc accgatgcac cgttaattgg ctcagctaag tctaacttag gccacctatt 25740 aactgcagcg ggcatgccgg ggatcatgaa gatgatcttc gccatgaaag aaggttacct 25800 gccgccaagt atcaatatta gtgatgctat cgcttcgccg aaaaaactct tcggtaaacc 25860 aaccctgcct agcatggttc aaggctggcc agataagcca tcgaataatc attttggtgt 25920 aagaacccgt cacgcaggcg tatcggtatt tggctttggt ggctgtaacg cccatctgtt 25980 gcttgagtca tacaacggca aaggaacagt aaaggcagaa gccactcaag taccgcgtca 26040 agctgagccg ctaaaagtgg ttggccttgc ctcgcacttt gggcctctta gcagcattaa 26100 tgcactcaac aatgctgtga cccaagatgg gaatggcttt atcgaactgc cgaaaaagcg 26160 ctggaaaggc cttgaaaagc acagtgaact gttagctgaa tttggcttag catctgcgcc 26220 aaaaggtgct tatgttgata acttcgagct ggacttttta cgctttaaac tgccgccaaa 26280 cgaagatgac cgtttgatct cacagcagct aatgctaatg cgagtaacag acgaagccat 26340 tcgtgatgcc aagcttgagc cggggcaaaa agtagctgta ttagtggcaa tggaaactga 26400 gcttgaactg catcagttcc gcggccgggt taacttgcat actcaattag cgcaaagtct 26460 tgccgccatg ggcgtgagtt tatcaacgga tgaataccaa gcgcttgaag ccatcgccat 26520 ggacagcgtg cttgatgctg ccaagctcaa tcagtacacc agctttattg gtaatattat 26580 ggcgtcacgc gtggcgtcac tatgggactt taatggccca gccttcacta tttcagcagc 26640 agagcaatct gtgagccgct gtatcgatgt ggcgcaaaac ctcatcatgg aggataacct 26700 agatgcggtg gtgattgcag cggtcgatct ctctggtagc tttgagcaag tcattcttaa 26760 aaatgccatt gcacctgtag ccattgagcc aaacctcgaa gcaagcctta atccaacatc 26820 agcaagctgg aatgtcggtg aaggtgctgg cgcggtcgtg cttgttaaaa atgaagctac 26880 atcgggctgc tcatacggcc aaattgatgc acttggcttt gctaaaactg ccgaaacagc 26940 gttggctacc gacaagctac tgagccaaac tgccacagac tttaataagg ttaaagtgat 27000 tgaaactatg gcagcgcctg ctagccaaat tcaattagcg ccaatagtta gctctcaagt 27060 gactcacact gctgcagagc agcgtgttgg tcactgcttt gctgcagcgg gtatggcaag 27120 cctattacac ggcttactta acttaaatac tgtagcccaa accaataaag ccaattgcgc 27180 gcttatcaac aatatcagtg aaaaccaatt atcacagctg ttgattagcc aaacagcgag 27240 cgaacaacaa gcattaaccg cgcgtttaag caatgagctt aaatccgatg ctaaacacca 27300 actggttaag caagtcacct taggtggccg tgatatctac cagcatattg ttgatacacc 27360 gcttgcaagc cttgaaagca ttactcagaa attggcgcaa gcgacagcat cgacagtggt 27420 caaccaagtt aaacctatta aggccgctgg ctcagtcgaa atggctaact cattcgaaac 27480 ggaaagctca gcagagccac aaataacaat tgcagcacaa cagactgcaa acattggcgt 27540 caccgctcag gcaaccaaac gtgaattagg taccccacca atgacaacaa ataccattgc 27600 taatacagca aataatttag acaagactct tgagactgtt gctggcaata ctgttgctag 27660 caaggttggc tctggcgaca tagtcaattt tcaacagaac caacaattgg ctcaacaagc 27720 tcacctcgcc tttcttgaaa gccgcagtgc gggtatgaag gtggctgatg ctttattgaa 27780 gcaacagcta gctcaagtaa caggccaaac tatcgataat caggccctcg atactcaagc 27840 cgtcgatact caaacaagcg agaatgtagc gattgccgca gaatcaccag ttcaagttac 27900 aacacctgtt caagttacaa cacctgttca aatcagtgtt gtggagttaa aaccagatca 27960 cgctaatgtg ccaccataca cgccgccagt gcctgcatta aagccgtgta tctggaacta 28020 tgccgattta gttgagtacg cagaaggcga tatcgccaag gtatttggca gtgattatgc 28080 cattatcgac agctactcgc gccgcgtacg tctaccgacc actgactacc tgttggtatc 28140 gcgcgtgacc aaacttgatg cgaccatcaa tcaatttaag ccatgctcaa tgaccactga 28200 gtacgacatc cctgttgatg cgccgtactt agtagacgga caaatccctt gggcggtagc 28260 agtagaatca ggccaatgtg acttgatgct tattagctat ctcggtatcg actttgagaa 28320 caaaggcgag cgggtttatc gactactcga ttgtaccctc accttcctag gcgacttgcc 28380 acgtggcgga gataccctac gttacgacat taagatcaat aactatgctc gcaacggcga 28440 caccctgctg ttcttcttct cgtatgagtg ttttgttggc gacaagatga tcctcaagat 28500 ggatggcggc tgcgctggct tcttcactga tgaagagctt gccgacggta aaggcgtgat 28560 tcgcacagaa gaagagatta aagctcgcag cctagtgcaa aagcaacgct ttaatccgtt 28620 actagattgt cctaaaaccc aatttagtta tggtgatatt cataagctat taactgctga 28680 tattgagggt tgttttggcc caagccacag tggcgtccac cagccgtcac tttgtttcgc 28740 atctgaaaaa ttcttgatga ttgaacaagt cagcaaggtt gatcgcactg gcggtacttg 28800 gggacttggc ttaattgagg gtcataagca gcttgaagca gaccactggt acttcccatg 28860 tcatttcaag ggcgaccaag tgatggctgg ctcgctaatg gctgaaggtt gtggccagtt 28920 attgcagttc tatatgctgc accttggtat gcatacccaa actaaaaatg gtcgtttcca 28980 acctcttgaa aacgcctcac agcaagtacg ctgtcgcggt caagtgctgc cacaatcagg 29040 cgtgctaact taccgtatgg aagtgactga aatcggtttc agtccacgcc catatgctaa 29100 agctaacatc gatatcttgc ttaatggcaa agcggtagtg gatttccaaa acctaggggt 29160 gatgataaaa gaggaagatg agtgtactcg ttatccactt ttgactgaat caacaacggc 29220 tagcactgca caagtaaacg ctcaaacaag tgcgaaaaag gtatacaagc cagcatcagt 29280 caatgcgcca ttaatggcac aaattcctga tctgactaaa gagccaaaca agggcgttat 29340 tccgatttcc catgttgaag caccaattac gccagactac ccgaaccgtg tacctgatac 29400 agtgccattc acgccgtatc acatgtttga gtttgctaca ggcaatatcg aaaactgttt 29460 cgggccagag ttctcaatct atcgcggcat gatcccacca cgtacaccat gcggtgactt 29520 acaagtgacc acacgtgtga ttgaagttaa cggtaagcgt ggcgacttta aaaagccatc 29580 atcgtgtatc gctgaatatg aagtgcctgc agatgcgtgg tatttcgata aaaacagcca 29640 cggcgcagtg atgccatatt caattttaat ggagatctca ctgcaaccta acggctttat 29700 ctcaggttac atgggcacaa ccctaggctt ccctggcctt gagctgttct tccgtaactt 29760 agacggtagc ggtgagttac tacgtgaagt agatttacgt ggtaaaacca tccgtaacga 29820 ctcacgttta ttatcaacag tgatggccgg cactaacatc atccaaagct ttagcttcga 29880 gctaagcact gacggtgagc ctttctatcg cggcactgcg gtatttggct attttaaagg 29940 tgacgcactt aaagatcagc taggcctaga taacggtaaa gtcactcagc catggcatgt 30000 agctaacggc gttgctgcaa gcactaaggt gaacctgctt gataagagct gccgtcactt 30060 taatgcgcca gctaaccagc cacactatcg tctagccggt ggtcagctga actttatcga 30120 cagtgttgaa attgttgata atggcggcac cgaaggttta ggttacttgt atgccgagcg 30180 caccattgac ccaagtgatt ggttcttcca gttccacttc caccaagatc cggttatgcc 30240 aggctcctta ggtgttgaag caattattga aaccatgcaa gcttacgcta ttagtaaaga 30300 cttgggcgca gatttcaaaa atcctaagtt tggtcagatt ttatcgaaca tcaagtggaa 30360 gtatcgcggt caaatcaatc cgctgaacaa gcagatgtct atggatgtca gcattacttc 30420 aatcaaagat gaagacggta agaaagtcat cacaggtaat gccagcttga gtaaagatgg 30480 tctgcgcata tacgaggtct tcgatatagc tatcagcatc gaagaatctg tataaatcgg 30540 agtgactgtc tggctatttt actcaatttc tgtgtcaaaa gtgctcacct atattcatag 30600 gctgcgcgct tttttctgga aattgagcaa aagtatctgc gtcctaactc gatttataag 30660 aatggtttaa ttgaaaagaa caacagctaa gagccgcaag ctcaatataa ataattaagg 30720 gtcttacaaa taatgaatcc tacagcaact aacgaaatgc tttctccgtg gccatgggct 30780 gtgacagagt caaatatcag ttttgacgtg caagtgatgg aacaacaact taaagatttt 30840 agccgggcat gttacgtggt caatcatgcc gaccacggct ttggtattgc gcaaactgcc 30900 gatatcgtga ctgaacaagc ggcaaacagc acagatttac ctgttagtgc ttttactcct 30960 gcattaggta ccgaaagcct aggcgacaat aatttccgcc gcgttcacgg cgttaaatac 31020 gcttattacg caggcgctat ggcaaacggt atttcatctg aagagctagt gattgcccta 31080 ggtcaagctg gcattttgtg ttcgtttgga gcagccggtc ttattccaag tcgcgttgaa 31140 gcggcaatta accgtattca agcagcgctg ccaaatggcc cttatatgtt taaccttatc 31200 catagtccta gcgagccagc attagagcgt ggcagcgtag agctattttt aaagcataag 31260 gtacgcaccg ttgaagcatc agctttctta ggtctaacac cacaaatcgt ctattaccgt 31320 gcagcaggat tgagccgaga cgcacaaggt aaagttgtgg ttggtaacaa ggttatcgct 31380 aaagtaagtc gcaccgaagt ggctgaaaag tttatgatgc cagcgcccgc aaaaatgcta 31440 caaaaactag ttgatgacgg ttcaattacc gctgagcaaa tggagctggc gcaacttgta 31500 cctatggctg acgacatcac tgcagaggcc gattcaggtg gccatactga taaccgtcca 31560 ttagtaacat tgctgccaac cattttagcg ctgaaagaag aaattcaagc taaataccaa 31620 tacgacactc ctattcgtgt cggttgtggt ggcggtgtgg gtacgcctga tgcagcgctg 31680 gcaacgttta acatgggcgc ggcgtatatt gttaccggct ctatcaacca agcttgtgtt 31740 gaagcgggcg caagtgatca cactcgtaaa ttacttgcca ccactgaaat ggccgatgtg 31800 actatggcac cagctgcaga tatgttcgag atgggcgtaa aactgcaggt ggttaagcgc 31860 ggcacgctat tcccaatgcg cgctaacaag ctatatgaga tctacacccg ttacgattca 31920 atcgaagcga tcccattaga cgagcgtgaa aagcttgaga aacaagtatt ccgctcaagc 31980 ctagatgaaa tatgggcagg tacagtggcg cactttaacg agcgcgaccc taagcaaatc 32040 gaacgcgcag agggtaaccc taagcgtaaa atggcattga ttttccgttg gtacttaggt 32100 ctttctagtc gctggtcaaa ctcaggcgaa gtgggtcgtg aaatggatta tcaaatttgg 32160 gctggccctg ctctcggtgc atttaaccaa tgggcaaaag gcagttactt agataactat 32220 caagaccgaa atgccgtcga tttggcaaag cacttaatgt acggcgcggc ttacttaaat 32280 cgtattaact cgctaacggc tcaaggcgtt aaagtgccag cacagttact tcgctggaag 32340 ccaaaccaaa gaatggccta atacacttac aaagcaccag tctaaaaagc cactaatctt 32400 gattagtggc tttttttatt gtggtcaata tgaggctatt tagcctgtaa gcctgaaaat 32460 atcagcactc tgactttaca agcaaattat aattaaggca gggctctact catttatact 32520 gctagcaaac aagcaagttg cccagtaaaa caacaaggta cctgatttat atcgtcataa 32580 aagttggcta gagattcgtt attgatcttt actgattaga gtcgctctgt ttggaaaaag 32640 gtttctcgtt atcatcaaaa tacactctca aacctttaat caattacaac ttaggctttc 32700 tgcgggcatt tttatcttat ttgccacagc tgtatttgcc tttaggtttt gggtgcaact 32760 accattaatt gaggcctcat tagttaaatt atctgagcaa gagctcacct ctttaaatta 32820 cgcttttcag caaatgagaa agccactaca aaccattaat tacgactatg cggtgtggga 32880 cagaacctac agctatatga aatcaaactc agcgagcgct aaaaggtact atgaaaaaca 32940 tgagtaccca gatgatacgt tcaagagttt aaaagtcgac ggagtattta tattcaaccg 33000 tacaaatcag ccagttttta gtaaaggttt taatcataga aatgatatac cgctggtctt 33060 tgaattaact gactttaaac aacatccaca aaacatcgca ttatctccac aaaccaaaca 33120 ggcacaccca ccggcaagta agccgttaga ctcccctgat gatgtgcctt ctacccatgg 33180 ggttatcgcc acacgatacg gtccagcaat ttatagctct accagcattt taaaatctga 33240 tcgtagcggc tcccaacttg gttatttagt cttcattagg ttaattgatg aatggttcat 33300 cgctgagcta tcgcaataca ctgccgcagg tgttgaaatc gctatggctg atgccgcaga 33360 cgcacaatta gcgagattag gcgcaaacac taagcttaat aaagtaaccg ctacatccga 33420 acggttaata actaatgtcg atggtaagcc tctgttgaag ttagtgcttt accataccaa 33480 taaccaaccg ccgccgatgc tagattacag tataataatt ctattagttg agatgtcatt 33540 tttactgatc ctcgcttatt tcctttactc ctacttctta gtcaggccag ttagaaagct 33600 ggcttcagat attaaaaaaa tggataaaag tcgtgaaatt aaaaagctaa ggtatcacta 33660 ccctattact gagctagtca aagttgcgac tcacttcaac gccctaatgg ggacgattca 33720 ggaacaaact aaacagctta atgaacaagt ttttattgat aaattaacca atattcccaa 33780 tcgtcgcgct tttgagcagc gacttgaaac ctattgccaa ctgctagccc ggcaacaaat 33840 tggctttact ctcatcattg ccgatgtgga tcattttaaa gagtacaacg atactcttgg 33900 gcaccttgct ggggatgaag cattaataaa agtggcacaa acactatcgc aacagtttta 33960 ccgtgcagaa gatatttgtg cccgttttgg tggtgaagaa tttattatgt tatttcgaga 34020 catacctgat gagcccttgc agagaaagct cgatgcgatg ctgcactctt ttgcagagct 34080 caacctacct catccaaact catcaaccgc taattacgtt actgtgagcc ttggggtttg 34140 cacagttgtt gctgttgatg attttgaatt taaaagtgag tcgcatatta ttggcagtca 34200 ggctgcatta atcgcagata aggcgcttta tcatgctaaa gcctgtggtc gtaaccagtt 34260 gtcaaaaact actattactg ttgatgagat tgagcaatta gaagcaaata aaatcggtca 34320 tcaagcctaa actcgttcga gtactttccc ctaagtcaga gctatttgcc acttcaagat 34380 gtggctacaa ggcttactct ttcaaaacct gcatcaatag aacacagcaa aatacaataa 34440 tttaagtcaa tttagcctat taaacagagt taatgacagc tcatggtcgc aacttattag 34500 ctatttctag caatataaaa acttatccat tagtagtaac caataaaaaa actaatatat 34560 aaaactattt aatcattatt ttacagatga ttagctacca cccaccttaa gctggctata 34620 ttcgcactag taaaaataaa cattagatcg ggttcagatc aatttacgag tctcgtataa 34680 aatgtacaat aattcactta atttaatact gcatattttt acaagtagag agcggtgatg 34740 aaacaaaata cgaaaggctt tacattaatt gaattagtca tcgtgattat tattctcggt 34800 atacttgctg ctgtggcact gccgaaattc atcaatgttc aagatgacgc taggatctct 34860 gcgatgagcg gtcagttttc atcatttgaa agtgccgtaa aactatacca tagcggttgg 34920 ttagccaaag gctacaacac tgcggttgaa aagctctcag gctttggcca aggtaatgtt 34980 gcatcaagtg acacaggttt tccgtactca acatcaggca cgagtactga tgtgcataaa 35040 gcttgtggtg aactatggca tggcattacc gatacagact tcacaattgg tgcggttagt 35100 gatggcgatc taatgactgc agatgtcgat attgcttaca cctatcgtgg tgatatgtgt 35160 atctatcgcg atctgtattt tattcagcgc tcattaccta ctaaggtgat gaactacaaa 35220 tttaaaactg gtgaaataga aattattgat gctttctaca accctgacgg ctcaactggt 35280 caattaccat aaatttggcg cttatctaag ttgtacttgc tctgaccgac acaaataatg 35340 tcgtttctca gcatatatca aaatacacag caaaaatttg gggttagcta tatagctaac 35400 cccaaatcat atctaacttt acactgcatc taattccaaa cagtatccag ccaaaagcct 35460 aaactattgt tgactcagcg ctaaaatatg cgatgcaaca aacaagtctt ggatcgcaat 35520 acctgagcta tcaaaaatgg tcacctcatc agcactttga cgtcctgttg cggactcgtt 35580 tatcacctga ccaatctcaa ttatcggcgt atttctgcta tgttgaaact caccaataac 35640 aatagattga gaagcaaagt cgcaaaacaa gcgagcatga ctatataggt cagttggcaa 35700 ctcttgctta cccactttat cagcgcccat tgcagaaata tgcgttcctg cttgtaccca 35760 ctgcgcttca aataaaggcg cttgagctgt ggttgctgtg ataataatat ctgcttgttc 35820 acaagcagct tgtgcatcac aagcttcggc attaatgcct ttttctaata aacgcttaac 35880 caagttttca gttttgctag cactacggcc aactaccaat accttagtta atgaacgaac 35940 cttgctcact gctagcactt catattcagc ctgatgaccg gtaccaaaaa cagttaatac 36000 cgtagcatct tctctcgcga ggtaactcac tgctactgca tcggcagcac cagtgcggta 36060 agcattaacg gtagtggcag caatcaccgn ctgcaacata ccggttaatg gatcgagtaa 36120 aaatacgtta gtgccgtggc atggtaaacc atgtttatgg ttatcaggcc aatagctgcc 36180 tgttttccag ccgacaaggt ttggcgttga agccgacttt aatgagaaca tttcattaag 36240 gttcgcgccc tgtgcattaa ctaccgggaa caaggttgct ttatcatcta cggcagcgac 36300 aaacgcttct ttaacagcga tataagccag ctcatgggag atgagctttg atgtttgcgc 36360 ttcagttaaa tagatcatat taccacccct gcactcgatt ccagatctca tagccaccat 36420 tatcaccatc agtatcaaat acatggtact gagcgtgcat tgaagctgtt gcacaggcgt 36480 ggttcggcaa aatatgtaga cgactaccta ccgggaactg cgctaaatca ataacgccgc 36540 catcaactgc ttcaataatg ccgtgctctt gattaacagt tataacctgt agacctgata 36600 acacgtgacc gctgtcgtca cacactaaac cataaccaca atcttttggc tgctctgcag 36660 tacctctatc acccgaaaga gccatccaac ccgcatcaat gaaaatccag tttttatcag 36720 gattatgacc aataacactg gtcactaccg ttgcggcaat atcagttaac tgacacacgt 36780 ttagccctgc catgactaaa tcgaagaagg tgtacacacc cgctctaacc tcggtgatcc 36840 catcaaggtt ttgatagctt tgcgctgttg gtgttgaacc aatactaacg atgtcacatt 36900 gcatacccgc tgcgcgaatg cgtcagcagc ttgtacagcc gctgcaactt cattttgcgc 36960 cgcatcaatt aattgctgtt tttcaaaaca ttgatatgac tcaccagcgt gagtgagtac 37020 gccgtgaaaa ctcgctgcgc cagacgttag tatctgagca atttcaatca acttatcggc 37080 ttccggtgga ataccaccac gatggccatc acaatcaatt tcaattaatg ctggtatttg 37140 gcagtcataa gaaccacaga aatgatttag ctgatgcgct tgctcaacac tatcaagtaa 37200 aactcttgca ttaatacctt ggtccaacat tttagcaata cgcggcaact taccatcggc 37260 aatacctact gcataaataa tgtctgtgta acctttagat gctaaggcct cggcctcttt 37320 taccgttgat acagtgactg gtgagttttt agtgggtaat aaaaactcgg ctgcttcaag 37380 tgatcttaac gttttaaaat gcggtcttag gtttgcacct aatccttcaa ttttttggcg 37440 tagttgactg aggttattaa taaatactgg cttatttaca tataaaaacg gtgtatcaat 37500 tgcttgatac tgactttgct gagtcgtgga aagtatttga gtagatggca tctttaatat 37560 cctagttcat caatcaatct aacaagtttg atgcctagcc acagtggctt gtattcatga 37620 tgctttggaa aatgcttata ttcaaagtat ttgaaagaca tcaaacttct tgtttaatgc 37680 tcagtatcca ccagcacgca tttattttat attaactatt atcaagatat agattaggtt 37740 caaaccaaat gattagtact gaagatctac gttttatcag cgtaatcgcc agtcatcgca 37800 ccttagctga tgccgctaga acactaaata tcacgccacc atcagtgaca ttaaggttgc 37860 agcatattga aaagaaacta tcgattagcc tgatc 37895 <210> 2 <211> 831 <212> DNA <213> Shewanella putrefaciens SCRC-2874 (FERM BP-1625) <400> 1 atg gta aga ggc tat ttg cgc gct tta ttg tca caa cat agt gaa ata 48 Met Val Arg Gly Tyr Leu Arg Ala Leu Leu Ser Gln His Ser Glu Ile 1 5 10 15 cgc ccc aat gaa tgg cgc ttt gaa tat ggc gac aaa ggt aag cct aga 96 Arg Pro Asn Glu Trp Arg Phe Glu Tyr Gly Asp Lys Gly Lys Pro Arg 20 25 30 ttg agt gat gcg caa ttt gct caa acc ggg gtc cac ttt aat gtg agt 144 Leu Ser Asp Ala Gln Phe Ala Gln Thr Gly Val His Phe Asn Val Ser 35 40 45 cat agt gga gat tgg cta tta gta ggc att tgc act gct gat aat aaa 192 His Ser Gly Asp Trp Leu Leu Val Gly Ile Cys Thr Ala Asp Asn Lys 50 55 60 ggc gcc agt cag gca agc aag gag gaa act gac tct gct agt att gag 240 Gly Ala Ser Gln Ala Ser Lys Glu Glu Thr Asp Ser Ala Ser Ile Glu 65 70 75 80 ttt ggc gtc gac att gag cgt tgc cgt aac agc acc aat atc cac tct 288 Phe Gly Val Asp Ile Glu Arg Cys Arg Asn Ser Thr Asn Ile His Ser 85 90 95 att ctt agt cat tat ttc tct gaa tca gaa aag cga gcc ttg tta gcg 336 Ile Leu Ser His Tyr Phe Ser Glu Ser Glu Lys Arg Ala Leu Leu Ala 100 105 110 tta cca gag gcc ttg cag cga gac cgc ttt ttt gat ttg tgg gcg ctc 384 Leu Pro Glu Ala Leu Gln Arg Asp Arg Phe Phe Asp Leu Trp Ala Leu 115 120 125 aag gag tct tac att aaa gcg aaa gga ctt ggg ctggg tcg cta 432 Lys Glu Ser Tyr Ile Lys Ala Lys Gly Leu Gly Leu Ala Leu Ser Leu 130 135 140 aaa tct ttt gcg ttt gac ttc tct gca ctg agc gaa act ttt ctt gga 480 Lys Ser Phe Ala Phe Asp Phe Ser Ala Leu Ser Glu Thr Phe Leu Gly 145 150 155 160 gtt aat gca cct aaa agc ttg agc cat tgt gtt gat att tcc gat gct 528 Val Asn Ala Pro Lys Ser Leu Ser His Cys Val Asp Ile Ser Asp Ala 165 170 175 att gcg gat cac aag gtt gag cat caa ctt aat cag cga cag gtt ttg 576 Ile Ala Asp His Lys Val Glu His Gln Leu Asn Gln Arg Gln Val Leu 180 185 190 tta aaa caa gat att ggt ctt gct tta cta gag tcg agt tct aat aag 624 Leu Lys Gln Asp Ile Gly Leu Ala Leu Leu Glu Ser Ser Ser Asn Lys 195 200 205 cct aac gct gag cca caa aag tct ggt tta ggt ttg att gag gct aaa 672 Pro Asn Ala Glu Pro Gln Lys Ser Gly Leu Gly Leu Ile Glu Ala Lys 210 215 220 gaa cag caa atg aac gct gct gat aat tgg cat tgt tta ctg ggc cat 720 Glu Gln Gln Met Asn Ala Ala Asp Asn Trp His Cys Leu Leu Gly His 225 230 235 240 ctt gat gat agt tat cgt ttt gca ctg agt att ggt cag tgt cag caa 768 Leu Asp Asp Ser Tyr Arg Phe Ala Leu Ser Ile Gly Gln Cys Gln Gln 245 250 255 ata agt att gca gca gaa gaa gtg aat ttt aaa gct gtt gtt cga gct 816 Ile Ser Ile Ala Glu Glu Val Asn Phe Lys Ala Val Val Arg Ala 260 265 270 tca gct aag act agc 831 Ser Ala Lys Thr Ser 275 <210> 3 <211> 277 <212> PRT <400> 1 Met Val Arg Gly Tyr Leu Arg Ala Leu Leu Ser Gln His Ser Glu Ile 1 5 10 15 Arg Pro Asn Glu Trp Arg Phe Glu Tyr Gly Asp Lys Gly Lys Pro Arg 20 25 30 Leu Ser Asp Ala Gln Phe Ala Gln Thr Gly Val His Phe Asn Val Ser 35 40 45 His Ser Gly Asp Trp Leu Leu Val Gly Ile Cys Thr Ala Asp Asn Lys 50 55 60 Gly Ala Ser Gln Ala Ser Lys Glu Glu Thr Asp Ser Ala Ser Ile Glu 65 70 75 80 Phe Gly Val Asp Ile Glu Arg Cys Arg Asn Ser Thr Asn Ile His Ser 85 90 95 Ile Leu Ser His Tyr Phe Ser Glu Ser Glu Lys Arg Ala Leu Leu Ala 100 105 110 Leu Pro Glu Ala Leu Gln Arg Asp Arg Phe Phe Asp Leu Trp Ala Leu 115 120 125 Lys Glu Ser Tyr Ile Lys Ala Lys Gly Leu Gly Leu Ala Leu Ser Leu 130 135 140 Lys Ser Phe Ala Phe Asp Phe Ser Ala Leu Ser Glu Thr Phe Leu Gly 145 150 155 160 Val Asn Ala Pro Lys Ser Leu Ser His Cys Val Asp Ile Ser Asp Ala 165 170 175 Ile Ala Asp His Lys Val Glu His Gln Leu Asn Gln Arg Gln Val Leu 180 185 190 Leu Lys Gln Asp Ile Gly Leu Ala Leu Leu Glu Ser Ser Ser Asn Lys 195 200 205 Pro Asn Ala Gl u Pro Gln Lys Ser Gly Leu Gly Leu Ile Glu Ala Lys 210 215 220 Glu Gln Gln Met Asn Ala Ala Asp Asn Trp His Cys Leu Leu Gly His 225 230 235 240 Leu Asp Asp Ser Tyr Arg Phe Ala Leu Ser Ile Gly Gln Cys Gln Gln 245 250 255 Ile Ser Ile Ala Ala Glu Glu Val Asn Phe Lys Ala Val Val Arg Ala 260 265 270 Ser Ala Lys Thr Ser 275 <210> 4 <211> 864 <212> DNA <213> Shewanella putrefaciens SCRC-2874 (FERM BP-1625) <400> 1 atg gca aaa ata aat agt gaa cac ttg gat gaa gct act att act tcg 48 Met Ala Lys Ile Asn Ser Glu His Leu Asp Glu Ala Thr Ile Thr Ser 1 5 10 15 aat aag tgt acg caa aca gag act gag gct cgg cat aga aat gcc act 96 Asn Lys Cys Thr Gln Thr Glu Thr Glu Ala Arg His Arg Asn Ala Thr 20 25 30 aca aca cct gag atg cgc cga ttc ata caa gag tcg gat ctc agt gtt 144 Thr Thr Pro Glu Met Arg Arg Phe Ile Gln Glu Ser Asp Leu Ser Val 35 40 45 agc caa ctg tct aaa ata tta aat atc agt gaa gct acc gta cgt aag 192 Ser Gln Leu Ser Lys Ile Leu Asn Ile Ser Glu Ala Thr Val Arg Lys 50 55 60 tgg cgc aag cgt gac tct gtc gaa aac tgt cct aat acc ccg cac cat 240 Trp Arg Lys Arg Asp Ser Val Glu Asn Cys Pro Asn Thr Pro His His 65 70 75 80 ctc aat acc acg cta acc cct ttg caa gaa tat gtg gtt gtg ggc ctg 288 Leu Asn Thr Thr Leu Thr Pro Leu Gln Glu Tyr Val Val Val Gly Leu 85 90 95 cgt tat caa ttg aaa atg cca tta gac aga ttg ctc aaa gca acc caa 336 Arg Tyr Gln Leu Lys Met Pro Leu Asp Arg Leu Leu Lys Ala Thr Gln 100 105 110 gag ttt atc aat cca aac gtg tcg cgc tca ggt tta gca aga tgt ttg 384 Glu Phe Ile Asn Pro Asn Val Ser Arg Ser Gly Leu Ala Arg Cys Leu 115 120 125 aag cgt tat ggc gtt tca cgg gtg agt gat atc caag cac gta 432 Lys Arg Tyr Gly Val Ser Arg Val Ser Asp Ile Gln Ser Pro His Val 130 135 140 cca atg cgc tac ttt aat caa att cca gtc act caa ggc agc gat gtg 480 Pro Met Arg Tyr Phe Asn Gln Ile Pro Val Thr Gln Gly Ser Asp Val 145 150 155 160 caa acc tac acc ctg cac tat gaa acg ctg gca aaa acc tta gcc tta 528 Gln Thr Tyr Thr Leu His Tyr Glu Thr Leu Ala Lys Thr Leu Ala Leu 165 170 175 cct agt acc gat ggt gac aat gtg gtg caa gtg gtg tct ctc acc att 576 Pro Ser Thr Asp Gly Asp Asn Val Val Gln Val Val Ser Leu Thr Ile 180 185 190 cca cca aag tta acc gaa gaa gca ccc agt tca att ttg ctc ggc att 624 Pro Pro Lys Leu Thr Glu Glu Ala Pro Ser Ser Ile Leu Leu Gly Ile 195 200 205 gat cct cat agc gac tgg atc tat ctc gac ata tac caa gat ggc aat 672 Asp Pro His Ser Asp Trp Ile Tyr Leu Asp Ile Tyr Gln Asp Gly Asn 210 215 220 aca caa gcc acg aat aga tat atg gct tat gtg cta aaa cac ggg cca 720 Thr Gln Ala Thr Asn Arg Tyr Met Ala Tyr Val Leu Lys His Gly Pro 225 230 235 240 ttc cat tta cga aag tta ctc gtg cgt aac tat cac acc ttt tta cag 768 Phe His Leu Arg Lys Leu Leu Val Arg Asn Tyr His Thr Phe Leu Gln 245 250 255 cgc ttt cct gga gcg acg caa aat cgc cgc ccc tct aaa gat atg cct 816 Arg Phe Pro Gly Ala Thr Gln Asn Arg Arg Pro Ser Lys Asp Met Pro 260 265 270 270 gaa aca atc aac aag acg cct gaa aca cag gca ccc agt gga gac tca 864 Glu Thr Ile Asn Lys Thr Pro Glu Thr Gln Ala Pro Ser Gly Asp Ser 275 280 285 <210> 5 <211> 288 <212> PRT <400> 1 Met Ala Lys Ile Asn Ser Glu His Leu Asp Glu Ala Thr Ile Thr Ser 1 5 10 15 Asn Lys Cys Thr Gln Thr Glu Thr Glu Ala Arg His Arg Asn Ala Thr 20 25 30 Thr Thr Pro Glu Met Arg Arg Phe Ile Gln Glu Ser Asp Leu Ser Val 35 40 45 Ser Gln Leu Ser Lys Ile Leu Asn Ile Ser Glu Ala Thr Val Arg Lys 50 55 60 Trp Arg Lys Arg Asp Ser Val Glu Asn Cys Pro Asn Thr Pro His His 65 70 75 80 Leu Asn Thr Thr Leu Thr Pro Leu Gln Glu Tyr Val Val Val Gly Leu 85 90 95 Arg Tyr Gln Leu Lys Met Pro Leu Asp Arg Leu Leu Lys Ala Thr Gln 100 105 110 Glu Phe Ile Asn Pro Asn Val Ser Arg Ser Gly Leu Ala Arg Cys Leu 115 120 125 Lys Arg Tyr Gly Val Ser Arg Val Ser Asp Ile Gln Ser Pro His Val 130 135 140 Pro Met Arg Tyr Phe Asn Gln Ile Pro Val Thr Gln Gly Ser Asp Val 145 150 155 160 Gln Thr Tyr Thr Leu His Tyr Glu Thr Leu Ala Lys Thr Leu Ala Leu 165 170 175 Pro Ser Thr Asp Gly Asp Asn Val Val Gln Val Val Ser Leu Thr Ile 180 185 190 Pro Pro Lys Leu Thr Glu Glu Ala Pro Ser Ser Ile Leu Leu Gly Ile 195 200 205 Asp Pro His S er Asp Trp Ile Tyr Leu Asp Ile Tyr Gln Asp Gly Asn 210 215 220 Thr Gln Ala Thr Asn Arg Tyr Met Ala Tyr Val Leu Lys His Gly Pro 225 230 235 240 Phe His Leu Arg Lys Leu Leu Val Arg Asn Tyr His Thr Phe Leu Gln 245 250 255 Arg Phe Pro Gly Ala Thr Gln Asn Arg Arg Pro Ser Lys Asp Met Pro 260 265 270 Glu Thr Ile Asn Lys Thr Pro Glu Thr Gln Ala Pro Ser Gly Asp Ser 275 280 285 <210> 6 <211> 8268 <212> DNA <213> Shewanella putrefaciens SCRC-2874 (FERM BP-1625) <400> 1 atg agc cag acc tct aaa cct aca aac tca gca act gag caa gca caa 48 Met Ser Gln Thr Ser Lys Pro Thr Asn Ser Ala Thr Glu Gln Ala Gln 1 5 10 15 gac tca caa gct gac tct cgt tta aat aaa cga cta aaa gat atg cca 96 Asp Ser Gln Ala Asp Ser Arg Leu Asn Lys Arg Leu Lys Asp Met Pro 20 25 30 att gct att gtt ggc atg gcg agt att ttt gca aac tct cgc tat ttg 144 Ile Ala Ile Val Gly Met Ala Ser Ile Phe Ala Asn Ser Arg Tyr Leu 35 40 45 aat aag ttt tgg gac tta atc agc gaa aaa att gat gcg att act gaa 192 Asn Lys Phe Trp Asp Leu Ile Ser Glu Lys Ile Asp Ala Ile Thr Glu 50 55 60 tta cca tca act cac tgg cag cct gaa gaa tat tac gac gca gat aaa 240 Leu Pro Ser Thr His Trp Gln Pro Glu Glu Tyr Tyr Asp Ala Asp Lys 65 70 75 80 acc gca gca gac aaa agc tac tgt aaa cgt ggt ggc ttt ttg cca gat 288 Thr Ala Ala Asp Lys Ser Tyr Cys Lys Arg Gly Gly Phe Leu Pro Asp 85 90 95 gta gac ttc aac cca atg gag ttt ggc ctg ccg cca aac att ttg gaa 336 Val Asp Phe Asn Pro Met Glu Phe Gly Leu Pro Pro Asn Ile Leu Glu 100 105 110 ctg acc gat tca tcg caa cta tta tca ctc atc gtt gct aaa gaa gtg 384 Leu Thr Asp Ser Ser Gln Leu Leu Ser Leu Ile Val Ala Lys Glu Val 115 120 125 ttg gct gat gct aac tta cct gag aat tac gac cgc gat aaa att ggt 432 Leu Ala Asp Ala Asn Leu Pro Glu Asn Tyr Asp Arg Asp Lys Ile Gly 130 135 140 atc acc tta ggt gtc ggc ggt ggt caa aaa att agc cac agc cta aca 480 Ile Thr Leu Gly Val Gly Gly Gly Gln Lys Ile Ser His Ser Leu Thr 145 150 155 160 gcg cgt ctg caa tac cca gta ttg aag aaa gta ttc gcc aat agc ggc 528 Ala Arg Leu Gln Tyr Pro Val Leu Lys Lys Val Phe Ala Asn Ser Gly 165 170 175 att agt gac acc gac agc gaa atg ctt atc aag aaa ttc caa gac caa 576 Ile Ser Asp Thr Asp Ser Glu Met Leu Ile Lys Lys Phe Gln Asp Gln 180 185 190 tat gta cac tgg gaa gaa aac tcg ttc cca ggt tca ctt ggt ayr gtt 624 His Trp Glu Glu Asn Ser Phe Pro Gly Ser Leu Gly Asn Val 195 200 205 att gcg ggc cgt atc gcc aac cgc ttc gat ttt ggc ggc atg aac tgt 672 Ile Ala Gly Arg Ile Ala Asn Arg Phe Asp Phe Gly Gly Met Asn210 215 220 gtg gtt gat gct gcc tgt gct gga tca ctt gct gct atg cgt atg gcg 720 Val Val Asp Ala Ala Cys Ala Gly Ser Leu Ala Ala Met Arg Met Ala 225 230 235 240 cta aca gag cta act gaa ggt cgc tct gaa atg atg atc acc ggt ggt 768 Leu Thr Glu Leu Thr Glu Gly Arg Ser Glu Met Met Ile Thr Gly Gly 245 250 255 gtg tgt act gat aac tca ccc tct atg tat atg agc ttt tca aaa acg 816 Val Cys Thr Asp Asn Ser Pro Ser Met Tyr Met Ser Phe Ser Lys Thr 260 265 270 ccc gcc ttt acc act aac gaa acc att cag cca ttt gat atc gac tca 864 Pro Ala Phe Thr Thr Asn Glu Thr Ile Gln Pro Phe Asp Ile Asp Ser 275 280 285 aaa ggc atg atg att ggt gaa ggt att ggc atg gtg gcg cta aag cgt 912 Lys Gly Met Met Ile Gly Glu Gly Ile Gly Met Val Ala Leu Lys Arg 290 295 300 ctt gaa gat gca gag cgc gat ggc attac ttac 960 Leu Glu Asp Ala Glu Arg Asp Gly Asp Arg Ile Tyr Ser Val Ile Lys 305 310 315 320 ggt gtg ggt gca tca tct gac ggt aag ttt aaa tca atc tat gcc cct 1008 Gly Val Gly Ala Ser Ser Asp Gly Lys Phe Lys Ser Ile Tyr Ala Pro 325 330 335 cgc cca tca ggc caa gct aaa gca ctt aac cgt gcc tat gat gac gca 1056 Arg Pro Ser Gly Gln Ala Lys Ala Leu Asn Arg Ala Tyr Asp Asp Ala 340 345 350 ggt ttt gcg ccg cat acctta ggt cta att gaa gct cac gga aca ggt 1104 Gly Phe Ala Pro His Thr Leu Gly Leu Ile Glu Ala His Gly Thr Gly 355 360 365 act gca gca ggt gac gcg gca gag ttt gcc ggc ctt tgc tca gta ttt 1152 Thr Ala Ala Gly Asp Ala Ala Glu Phe Ala Gly Leu Cys Ser Val Phe 370 375 380 gct gaa ggc aac gat acc aag caa cac att gcg cta ggt tca gtt aaa 1200 Ala Glu Gly Asn Asp Thrys Lys Gln His Ile Ala Leu Gly Ser Val Lys 385 390 395 400 tca caa att ggt cat act aaa tca act gca ggt aca gca ggt tta att 1248 Ser Gln Ile Gly His Thr Lys Ser Thr Ala Gly Thr Ala Gly Leu Ile 405 410 415 aaa gct gct ctt gct ttg cat cac aag gta ctg ccg ccg acc att aac 1296 Lys Ala Ala Leu Ala Leu His His Lys Val Leu Pro Pro Thr Ile Asn 420 425 430 gtt agt cag cca agc cct aaa ctt gat atc gaa aac tca ccg ttt tat 1344 Val Ser Gln Pro Ser Pro Lys Leu Asp Ile Glu Asn Ser Pro Phe Tyr 435 440 445 cta aac act gag act cgt cca tgg tta cca cgt gtt gat ggt acg ccg 1392 Leu Asn Thr Glu Thr Arg Pro Trp Leu Pro Arg Val Asp Gly Thr Pro 450 455 460 cgc cgc gcg ggt att agc tca ttt ggt ttt ggt ggc act aac ttc cat 1440 Arg Arg Ala Gly Ile Ser Ser Phe Gly Phe Gly Gly Thr Asn Phe His 465 470 475 475 480 ttt gta cta gaa gag tac aac caa gaa cac agc cgt act gat gaa 1488 Phe Val Leu Glu Glu Tyr Asn Gln Glu His Ser Arg Thr Asp Ser Glu 485 490 aaa aaa gct aag tat cgt caa cgc caa gtg gcg caa agc ttc ctt gtt agc 1536 Lys Ala Lys Tyr Arg Gln Arg Gln Val Ala Phe Leu Val Ser 500 505 510 gca agc gat aaa gca tcg cta att aac gag tta aac gta cta gca gca 1584 Ala Ser Asp Lys Ala Ser Leu Ile Asn Glu Leu Asn Val Leu Ala Ala 515 520 525 tct gca agc caa gct gag ttt atc ctc aaa gat gca gca gca aac tat 1632 Ser Ala Ser Gln Ala Glu Phe Ile Leu Lys Asp Ala Ala Ala Asn Tyr 530 535 540 ggc gta cgt gag ctt gat aaa aat gca cca cgg atc ggt tta gtt gca 1680 Gly Val Arg Glu Leu Asp Lys Asn Ala Pro Arg Ile Gly Leu Val Ala 545 550 555 560 aac aca gct gaa gag tta gca ggc cta att aag caa gca ctt gcc aaa 1728 Asn Thr Ala Glu Glu Leu Ala Gly Leu Ile Lys Gln Ala Le A Lys 565 570 575 cta gca gct agc gat gat aac gca tgg cag cta cct ggt ggc act agc 1776 Leu Ala Ala Ser Asp Asp Asn Ala Trp Gln Leu Pro Gly Gly Thr Ser 580 585 590 tac cgc gcc gct gca gta gatt ggt gcc gca ctg ttt gct ggc 1824 Tyr Arg Ala Ala Ala Val Glu Gly Lys Val Ala Ala Leu Phe Ala Gly 595 600 605 caa ggt tca caa tat ctc aat atg ggc cgt gac ctt act tgt tat tac 1872 Gln Gly Ser Gln Tyr Le Met Gly Arg Asp Leu Thr Cys Tyr Tyr 610 615 620 cca gag atg cgt cag caa ttt gta act gca gat aaa gta ttt gcc gca 1920 Pro Glu Met Arg Gln Gln Phe Val Thr Ala Asp Lys Val Phe Ala Ala 625 630 630 635 640 aat gat aaa acg ccg tta tcg caa act ctg tat cca aag cct gta ttt 1968 Asn Asp Lys Thr Pro Leu Ser Gln Thr Leu Tyr Pro Lys Pro Val Phe 645 650 655 aat aaa gat gaa tta aag gct caa gaa gcc att ttg acc a at acc gcc 2016 Asn Lys Asp Glu Leu Lys Ala Gln Glu Ala Ile Leu Thr Asn Thr Ala 660 665 670 aat gcc caa agc gca att ggt gcg att tca atg ggt caa tac gat ttg 2064 Asn Ala Gln Ser Ala Ile Gly Ala Ile Ser Met Gly Gln Tyr Asp Leu 675 680 685 ttt act gcg gct ggc ttt aat gcc gac atg gtt gca ggc cat agc ttt 2112 Phe Thr Ala Ala Gly Phe Asn Ala Asp Met Val Ala Gly His Ser Phe 690 695 700 ggt gag cta agt gca ctg tgt gct gca ggt gtt att tca gct gat gac 2160 Gly Glu Leu Ser Ala Leu Cys Ala Ala Gly Val Ile Ser Ala Asp Asp 705 710 715 720 tac tac aag ctg gct ttt gct cgt ggt gag gct atg gca aca aaa gca 2208yr Tyr Lys Leu Ala Phe Ala Arg Gly Glu Ala Met Ala Thr Lys Ala 725 730 735 ccg gct aaa gac ggc gtt gaa gca gat gca gga gca atg ttt gca atc 2256 Pro Ala Lys Asp Gly Val Glu Ala Asp Ala Gly Ala Met Phe Ile 740 745 750 ata acc aag agt gct gca gac ctt gaa acc gtt gaa gcc acc atc gct 2304 Ile Thr Lys Ser Ala Ala Asp Leu Glu Thr Val Glu Ala Thr Ile Ala 755 760 765 aaa ttt gat ggg gtg aaa gtc gct aa c tat aac gcg cca acg caa tca 2352 Lys Phe Asp Gly Val Lys Val Ala Asn Tyr Asn Ala Pro Thr Gln Ser 770 775 780 gta att gca ggc cca aca gca act acc gct gat gcg gct aaa gcg cta 2400 Val Ile Ala Gly Pro Thr Ala Thr Thr Ala Asp Ala Ala Lys Ala Leu 785 790 795 800 act gag ctt ggt tac aaa gcg att aac ctg cca gta tca ggt gca ttc 2448 Thr Glu Leu Gly Tyr Lys Ala Ile Asn Leu Pro Val Ser Gly Ala Phe 805 810 815 cac act gaa ctt gtt ggt cac gct caa gcg cca ttt gct aaa gcg att 2496 His Thr Glu Leu Val Gly His Ala Gln Ala Pro Phe Ala Lys Ala Ile 820 825 830 gac gca gcc aaa ttt act aaa aca agc cga gca cttac tca aat gca 2544 Asp Ala Ala Lys Phe Thr Lys Thr Ser Arg Ala Leu Tyr Ser Asn Ala 835 840 845 act ggc gga ctt tat gaa agc act gct gca aag att aaa gcc tcg ttt 2592 Thr Gly Gly Leu Tyr Glu Ser Thr Ala Ala Lys Ile Lys Ala Ser Phe 850 855 860 aag aaa cat atg ctt caa tca gtg cgc ttt act agc cag cta gaa gcc 2640 Lys Lys His Met Leu Gln Ser Val Arg Phe Thr Ser Gln Leu Glu Ala 865 870 875 875 880 atg tac aac gac ggc gcc cgt gta ttt gtt gaa ttt ggt cca aag aac 2688 Met Tyr Asn Asp Gly Ala Arg Val Phe Val Glu Phe Gly Pro Lys Asn 885 890 895 atc tta caa aaa tta gtt caa ggc acg ctt gtc aac gaa aat Ile Leu Gln Lys Leu Val Gln Gly Thr Leu Val Asn Thr Glu Asn Glu 900 905 910 gtt tgc act atc tct atc aac cct aat cct aaa gtt gat agt gat ctg 2784 Val Cys Thr Ile Ser Ile Asn Pro Asn Pro Lys Val Asp Ser Asp Leu 915 920 925 cag ctt aag caa gca gca atg cag cta gcg gtt act ggt gtg gta ctc 2832 Gln Leu Lys Gln Ala Ala Met Gln Leu Ala Val Thr Gly Val Val Leu 930 935 940 agt gaa att gac cca tac caa gcc att gcc gca cca gcg aaa aag 2880 Ser Glu Ile Asp Pro Tyr Gln Ala Asp Ile Ala Ala Pro Ala Lys Lys 945 950 955 960 tcg cca atg agc att tcg ctt aat gct gct aac cat atc agc aaa gca 2928 Ser Pro Met Ser Ser Leu Asn Ala Ala Asn His Ile Ser Lys Ala 965 970 975 act cgc gct aag atg gcc aag tct tta gag aca ggt atc gtc acc tcg 2976 Thr Arg Ala Lys Met Ala Lys Ser Leu Glu Thr Gly Ile Val Thr Ser 980 985 990 caa ata gaa cat gtt att gaa gaa aaa atc gtt gaa gtt gag aaa ctg 3024 Gln Ile Glu His Val Ile Glu Glu Lys Ile Val Glu Val Glu Lys Leu 995 1000 1005 gtt gaa gtc gaa aag atc gtc gga aaa gta gag aaa gtt 3072 Val Glu Val Glu Lys Ile Val Glu Lys Val Val Glu Val Glu Lys Val 1010 1015 1020 gtt gag gtt gaa gct cct gtt aat tca gtg caa gcc aat gca att caa 3120 Val Glu Val Glu Ala Pro Val Asn Ser Val Gln Ala Asn Ala Ile Gln 1025 1030 1035 1040 acc cgt tca gtt gtc gct cca gta ata gag aac caa gtc gtg tct aaa 3168 Thr Arg Ser Val Val Ala Pro Val Ile Glu Asn Gln Val Val Ser Lys 1045 1050 1055 aac agt aag cca gca gtc cag agc att agt ggt gat gca ctc agc aac 3216 Asn Ser Lys Pro Ala Val Gln Ser Ile Ser Gly Asp Ala Leu Ser Asn 1060 1065 1070 ttt ttt gct gca cag cag caa acc gca cag ttg cat cag cag ttc tta 3264 Phe Phe Ala Ala Gln Gln Gln Thr Ala Gln Leu His Gln Gln Phe Leu 1075 1080 1085 gct att ccg cag caa tat ggt gag acg ttc act acg ctg atg acc gag 3312 Ala Ile Pro Gln Gln Tyr Gly Glu Thr Phe Thr Thr Leu Met Thr Glu 1090 1095 1100 caa gct aaa ctg gca agt tct ggt gtt gca att cca gag agt ctg caa 3360 Gln Ala Lys Leu Ala Ser Ser Gly Val Ala Ile Pro Glu Ser Leu Gln 1105 1110 1115 1120 cgc tca atg gag caa ttc cac caa cta caa gcg caa aca cta caa agc 3408 Arg Ser Met Glu Gln Phe His Gln Leu Gln Ala Gln Thr Leu Gln Ser 1125 1130 1135 cac acc cag ttc ctt gag atg caa gcg ggt agc aac att gca 3456 His Thr Gln Phe Leu Glu Met Gln Ala Gly Ser Asn Ile Ala Ala Leu 1140 1145 1150 aac cta ctc aat agc agc caa gca act tac gct cca gcc att cac aat 3504 Asn Leu Leu Asn Ser Ser Gln Ala Thr Tyr Ala Pro Ala Ile His Asn 1155 1160 1165 gaa gcg att caa agc caa gtg gtt caa agc caa act gca gtc cag cca 3552 Glu Ala Ile Gln Ser Gln Val Val Gln Ser Gln Thr Ala Val Gln Pro 1170 1175 1180 gta att tca aca caa gtt aac cat gtg tca gag cag cca act caa gct 3600 Val Ile Ser Thr Gln Val Asn His Val Ser Glu Gln Pro Thr Gln Ala 1185 1190 1195 1200 cca gct cca aaa gcg cag cca gca cct gtg aca act cca g tt caa act 3648 Pro Ala Pro Lys Ala Gln Pro Ala Pro Val Thr Thr Pro Val Gln Thr 1205 1210 1215 gct ccg gca caa gtt gtt cgt caa gcc gca cca gtt caa gcc gct att 3696 Ala Pro Ala Gln Val Val Arg Gln Ala Ala Pro Val Gln Ala Ala Ile 1220 1225 1230 gaa ccg att aat aca agt gtt gcg act aca acg cct tca gcc ttc agc 3744 Glu Pro Ile Asn Thr Ser Val Ala Thr Thr Thr Thr Pro Ser Ala Phe Ser 1235 1240 1245 gcc gaa aca gcc ctg agc gca aca aaa gtc caa gcc act atg ctt gaa 3792 Ala Glu Thr Ala Leu Ser Ala Thr Lys Val Gln Ala Thr Met Leu Glu 1250 1255 1260 gtg gtt gct gag aaa acc ggt tac cca act gaa atg cta gag ctt gaa 3840 Val Ala Glu Lys Thr Gly Tyr Pro Thr Glu Met Leu Glu Leu Glu 1265 1270 1275 1280 atg gat atg gaa gcc gat tta ggc atc gat tct atc aag cgt gta gaa 3888 Met Asp Met Glu Ala Asp Leu Gly Ile Asp Ser Ile Lys Arg Glu 1285 1290 1295 att ctt ggc aca gta caa gat gag cta ccg ggt cta cct gag ctt agc 3936 Ile Leu Gly Thr Val Gln Asp Glu Leu Pro Gly Leu Pro Glu Leu Ser 1300 1305 1310 cct gaga gat cta gct gag tgt cga acg cta ggc gaa atc gtt gac tat 3984 Pro Glu Asp Leu Ala Glu Cys Arg Thr Leu Gly Glu Ile Val Asp Tyr 1315 1320 1325 atg ggc agt aaa ctg ccg gct gaa ggc tct 40 atg aat tct ca Met Gly Ser Lys Leu Pro Ala Glu Gly Ser Met Asn Ser Gln Leu Ser 1330 1335 1340 aca ggt tcc gca gct gcg act cct gca gcg aat ggt ctt tct gcg gag 4080 Thr Gly Ser Ala Ala Ala Thr Pro Ala Ala Asn Gly Leu Ser Ala Glu 1345 1350 1355 1360 aaa gtt caa gcg act atg atg tct gtg gtt gcc gaa aag act ggc tac 4128 Lys Val Gln Ala Thr Met Met Ser Val Val Ala Glu Lys Thr Gly Tyr 1365 1370 1375 cca act gaa atg cta gag ctt gaa atg gat atg gaa gcc gat tta ggc 4176 Pro Thr Glu Met Leu Glu Leu Glu Met Asp Met Glu Ala Asp Leu Gly 1380 1385 1390 ata gat tct atc aag cgc gtt gaa att ctt ggc aca gta caa gat gag 4224 Ile Asp Arg Val Glu Ile Leu Gly Thr Val Gln Asp Glu 1395 1400 1405 cta ccg ggt cta cct gag ctt agc cct gaa gat cta gct gag tgt cgt 4272 Leu Pro Gly Leu Pro Glu Leu Ser Pro Glu Asp Leu A la Glu Cys Arg 1410 1415 1420 act cta ggc gaa atc gtt gac tat atg aac tct aaa ctc gct gac ggc 4320 Thr Leu Gly Glu Ile Val Asp Tyr Met Asn Ser Lys Leu Ala Asp Gly 1425 1430 1435 1440 tct aag ctggag ggc tct atg aat tct cag ctg tct aca agt 4368 Ser Lys Leu Pro Ala Glu Gly Ser Met Asn Ser Gln Leu Ser Thr Ser 1445 1450 1455 gcc gca gct gcg act cct gca gcg aat ggt ctc tct gcg gag aaa gtt 4416 Ala Ala Ala Ala Thr Pro Ala Ala Asn Gly Leu Ser Ala Glu Lys Val 1460 1465 1470 caa gcg act atg atg tct gtg gtt gcc gaa aag act ggc tac cca act 4464 Gln Ala Thr Met Met Ser Val Val Ala Glu Lys Thr Gly Tyr Pro Thr 1475 1480 1485 gaa atg cta gaa ctt gaa atg gat atg gaa gct gac ctt ggc atc gat 4512 Glu Met Leu Glu Leu Glu Met Asp Met Glu Ala Asp Leu Gly Ile Asp 1490 1495 1500 tca atc aag cgc gtt gaa att g gat gag cta ccg 4560 Ser Ile Lys Arg Val Glu Ile Leu Gly Thr Val Gln Asp Glu Leu Pro 1505 1510 1515 1520 ggt tta cct gag cta aat cca gaa gat ttg gca gag tgt cgt act ctt 4608 Gly Leu Pro Glu Leu Asn Pro Glu Asp Leu Ala Glu Cys Arg Thr Leu 1525 1530 1535 ggc gaa atc gtg act tat atg aac tct aaa ctc gct gac ggc tct aag 4656 Gly Glu Ile Val Thr Tyr Met Asn Ser Lys Leu Ala Asp Gly Lys 1540 1545 1550 ctg cca gct gaa ggc tct atg cac tat cag ctg tct aca agt acc gct 4704 Leu Pro Ala Glu Gly Ser Met His Tyr Gln Leu Ser Thr Ser Thr Ala 1555 1560 1565 gct gcg act cct gta gcg aat ggt ctc tct gca gaa aaa gtt caa gcg 4752 Ala Ala Thr Pro Val Ala Asn Gly Leu Ser Ala Glu Lys Val Gln Ala 1570 1575 1580 acc atg atg tct gta gtt gca gat aaa act ggc tac cca act gaa atg 4800 Thr Met Met Ser Val Val Ala Asp Lys Thr Gly Tyr Pro Thr Glu Met 1585 1590 1595 1600 ctt gaa ctt gaa atg gat atg gaa gcc gat tta ggt atc gat tct atc 4848 Leu Glu Leu Glu Met Asp Met Glu Ala Asp Leu Gly Ile Asp Ser Ile 1605 1610 a cgc gtt gaa att ctt ggc aca gta caa gat gag cta ccg ggt tta 4896 Lys Arg Val Glu Ile Leu Gly Thr Val Gln Asp Glu Leu Pro Gly Leu 1620 1625 1630 cct gag cta aat cca gaa gat cta gca gag tgt cgc acc cta ggc gaa 4944 Pro Glu Leu Asn Pro Glu Asp Leu Ala Glu Cys Arg Thr Leu Gly Glu 1635 1640 1645 atc gtt gac tat atg ggc agt aaa ctg ccg gct gaa ggc tct gct aat Tap Ast 4992yr Val Gly Ser Lys Leu Pro Ala Glu Gly Ser Ala Asn 1650 1655 1660 aca agt gcc gct gcg tct ctt aat gtt agt gcc gtt gcg gcg cct caa 5040 Thr Ser Ala Ala Ala Ala Ser Leu Asn Val Ser Ala Val Ala Ala Pro Gln 1665 1670 1675 1680 gct gct gcg act cct gta tcg aac ggt ctc tct gca gag aaa gtg caa 5088 Ala Ala Ala Thr Pro Val Ser Asn Gly Leu Ser Ala Glu Lys Val Gln 1685 1690 1695 agc act atg atg tca gta gtt gca gaa aag acc ggc tac cca act gaa 5136 Ser Thr Met Met Ser Val Val Ala Glu Lys Thr Gly Tyr Pro Thr Glu 1700 1705 1710 atg cta gaa ctt ggc atg gat atg gaa gcc gat tta ggt atc gac tca 5184 Met Leu Glu Leu Gly Met Asp Met Glu Ala Asp Leu Gly Ile Asp Ser 1715 1720 1725 att aaa cgc gtt gag att ctt ggc aca gta caa gat gag cta ccg ggt 5232 Ile Lys Arg Val Glu Ile Leu Gly Thr Val Gln Asp Glu Leu Pro Gly 1730 1735 1740 cta cca gag ctt aat cct gaa gat tta gct gag tgc cgt acg ctg ggc 5280 Leu Pro Glu Leu Asn Pro Glu Asp Leu Ala Glu Cys Arg Thr Leu Gly 1745 1750 1755 1760 gaa atc gtt gac tat atg act gct ag tct gac ggc tct aag ctt 5328 Glu Ile Val Asp Tyr Met Asn Ser Lys Leu Ala Asp Gly Ser Lys Leu 1765 1770 1775 cca gct gaa ggc tct gct aat aca agt gcc act gct gcg act cct gca 5376 Pro Ala Glu Gly Ser Ala Asn Thr Ser Ala Thr Ala Ala Thr Pro Ala 1780 1785 1790 gtg aat ggt ctt tct gct gac aag gta cag gcg act atg atg tct gta 5424 Val Asn Gly Leu Ser Ala Asp Lys Val Gln Ala Thr Met Met Ser Val 1795 1800 1805 gtt gct gaa aag acc ggc tac cca act gaa atg cta gaa ctt ggc atg 5472 Val Ala Glu Lys Thr Gly Tyr Pro Thr Glu Met Leu Glu Leu Gly Met 1810 1815 1820 gat atg gaa gca gac ctt ggt att gat tct att aag cgc gtt gtt ata Asp Met Glu Ala Asp Leu Gly Ile Asp Ser Ile Lys Arg Val Glu Ile 1825 1830 1835 1840 ctt ggc aca gta caa gat gag ctc cca ggt tta cct gag ctt aat cct 5568 Leu Gly Thr Val Val Gln Asp Glu Leu Pro Gly Leu Pro Glu Leu Asn Pro 1845 1850 1855 gaa gat ctc gct gag tgc cgc acg ctt ggc gaa atc gtt agc tat atg 5616 Glu Asp Leu Ala Glu Cys Arg Thr Leu Gly Glu Ile Val Ser Tyr Met 1860 1870 tct caa ctg gct gat ggc tct aaa ctt tct aca agt gcg gct gaa 5664 Asn Ser Gln Leu Ala Asp Gly Ser Lys Leu Ser Thr Ser Ala Ala Glu 1875 1880 1885 ggc tct gct gat aca agt gct gca aat gct gca aagcag g att 5712 Gly Ser Ala Asp Thr Ser Ala Ala Asn Ala Ala Lys Pro Ala Ala Ile 1890 1895 1900 tcg gca gaa cca agt gtt gag ctt cct cct cat agc gag gta gcg cta 5760 Ser Ala Glu Pro Ser Val Glu Leu Pro Pro His Ser Glu Val Ala Leu 1905 1910 1915 1920 aaa aag ctt aat gcg gcg aac aag cta gaa aat tgt ttc gcc gca gac 5808 Lys Lys Leu Asn Ala Ala Asn Lys Leu Glu Asn Cys Phe Ala Ala Asp 1925 1930 1935 gca ag tgt gat gat ggt cac aac gca ggc gtt tta gct 5856 Ala Ser Val Val Ile Asn Asp Asp Gly His Asn Ala Gly Val Leu Ala 1940 1945 1950 gag aaa ctt att aaa caa ggc cta aaa gta gcc gtt gt g cgt tta ccg 5904 Glu Lys Leu Ile Lys Gln Gly Leu Lys Val Ala Val Val Arg Leu Pro 1955 1960 1965 aaa ggt cag cct caa tcg cca ctt tca agc gat gtt gct agc ttt gag 5952 Lys Gly Gln Pro Gln Ser Pro Leu Ser Ser Asp Val Ala Ser Phe Glu 1970 1975 1980 ctt gcc tca agc caa gaa tct gag ctt gaa gcc agt atc act gca gtt 6000 Leu Ala Ser Ser Gln Glu Ser Glu Leu Glu Ala Ser Ile Throra Ala Val 1985 1990 1995 2000 atc gcg cag att gaa act cag gtt ggc gct att ggt ggc ttt att cac 6048 Ile Ala Gln Ile Glu Thr Gln Val Gly Ala Ile Gly Gly Phe Ile His 2005 2010 2015 ttg caa cca gaa gcg aat aca gaa gag caa acg gca gta a96 cta gat Leu Gln Pro Glu Ala Asn Thr Glu Glu Gln Thr Ala Val Asn Leu Asp 2020 2025 2030 gcg caa agt ttt act cac gtt agc aat gcg ttc ttg tgg gcc aaa tta 6144 Ala Gln Ser Phe Thr His Val Ser Asn Ala Phe Leu Trp Ala Lys Leu 2035 2040 2045 ttg caa cca aag ctc gtt gct gga gca gat gcg cgt cgc tgt ttt gta 6192 Leu Gln Pro Lys Leu Val Ala Gly Ala Asp Ala Arg Arg Cys Phe Val 2050 2055 2060 aca gta a gc cgt atc gac ggt ggc ttt ggt tac cta aat act gac gcc 6240 Thr Val Ser Arg Ile Asp Gly Gly Phe Gly Tyr Leu Asn Thr Asp Ala 2065 2070 2075 2080 cta aaa gat gct gag cta aac caa gca gca tta gct ggt tta act aaa 6288 Leu Lys Asp Ala Glu Leu Asn Gln Ala Ala Leu Ala Gly Leu Thr Lys 2085 2090 2095 acc tta agc cat gaa tgg cca caa gtg ttc tgt cgc gcg cta gat att 6336 Thr Leu Ser His Glu Trp Pro Gln Val Phe Cys Ar Ala Leu Asp Ile 2100 2105 2110 gca aca gat gtt gat gca acc cat ctt gct gat gca atc acc agt gaa 6384 Ala Thr Asp Val Asp Ala Thr His Leu Ala Asp Ala Ile Thr Ser Glu 2115 2120 2125 cta ttt gat agc caa gct cag cta cct gaa gtg ggc tta agc tta att 6432 Leu Phe Asp Ser Gln Ala Gln Leu Pro Glu Val Gly Leu Ser Leu Ile 2130 2135 2140 gat ggc aaa gtt aac cgc gta act cta gtt gct gct gaa gct gca gat Lys Asn Arg Val Thr Leu Val Ala Ala Glu Ala Ala Asp 2145 2150 2155 2160 aaa aca gca aaa gca gag ctt aac agc aca gat aaa atc tta gtg act 6528 Lys Thr Ala Lys Ala Glu Leu Asn Ser Thr As p Lys Ile Leu Val Thr 2165 2170 2175 ggt ggg gca aaa ggg gtg aca ttt gaa tgt gca ctg gca tta gca tct 6576 Gly Gly Ala Lys Gly Val Thr Phe Glu Cys Ala Leu Ala Leu Ala Ser 2180 2185 2190 cgc agc cag tc ttt atc tta gct ggg cgc agt gaa tta caa gct 6624 Arg Ser Gln Ser His Phe Ile Leu Ala Gly Arg Ser Glu Leu Gln Ala 2195 2200 2205 tta cca agc tgg gct gag ggt aag caa act agc gag cta aaa tca gct 6672u Ser Trp Ala Glu Gly Lys Gln Thr Ser Glu Leu Lys Ser Ala 2210 2215 2220 gca atc gca cat att att tct act ggt caa aag cca acg cct aag caa 6720 Ala Ile Ala His Ile Ile Ser Thr Gly Gln Lys Pro Thr Pro Lys Gln 2225 2230 2235 2240 gtt gaa gcc gct gtg tgg cca gtg caa agc agc att gaa att aat gcc 6768 Val Glu Ala Ala Val Trp Pro Val Gln Ser Ser Ile Glu Ile Asn Ala 2245 2250 2255 gcc cta gcc gcc ttt ac gcc gtt gg tca gct gaa tac gtc agc 6816 Ala Leu Ala Ala Phe Asn Lys Val Gly Ala Ser Ala Glu Tyr Val Ser 2260 2265 2270 atg gat gtt acc gat agc gcc gca atc aca gca gca ctt aat ggt cgc 6864 Met Asp Val Thr Asp Ser Ala Ala Ile Thr Ala Ala Leu Asn Gly Arg 2275 2280 2285 tca aat gag atc acc ggt ctt att cat ggc gca ggt gta cta gcc gac 6912 Ser Asn Glu Ile Thr Gly Leu Ile His Gly Ala Gly Val Leu Ala Asp 2290 2295 2300 aag cat att caa gac aag act ctt gct gaa ctt gct aaa gtt tat ggc 6960 Lys His Ile Gln Asp Lys Thr Leu Ala Glu Leu Ala Lys Val Tyr Gly 2305 2310 2315 2320 act aaa gtc aac ggc cta aaa g ctg ctc gcg gca ctt gag cca agc 7008 Thr Lys Val Asn Gly Leu Lys Ala Leu Leu Ala Ala Leu Glu Pro Ser 2325 2330 2335 aaa att aaa tta ctt gct atg ttc tca tct gca gca ggt ttt tac ggt 7056 Lys Leu Lys Ala Met Phe Ser Ser Ala Ala Gly Phe Tyr Gly 2340 2345 2350 aat atc ggc caa agc gat tac gcg atg tcg aac gat att ctt aac aag 7104 Asn Ile Gly Gln Ser Asp Tyr Ala Met Ser Asn Asp Ile Leu Asn Lys 2355 2360 gca gcg ctg cag ttc acc gct cgc aac cca caa gct aaa gtc atg agc 7152 Ala Ala Leu Gln Phe Thr Ala Arg Asn Pro Gln Ala Lys Val Met Ser 2370 2375 2380 ttt aac tgg ggt cct tgg gat ggc ggc atg gtt aac cca gcg ctt aaa 7200 Phe Asn Trp Gly Pro Trp Asp Gly Gly Met Val Asn Pro Ala Leu Lys 2385 2390 2395 2400 aag atg ttt acc gag cgt ggt gtg tac gtt att cca cta aaa gca ggt 7248 Lys Thr Glu Arg Gly Val Tyr Val Ile Pro Leu Lys Ala Gly 2405 2410 2415 gca gag cta ttt gcc act cag cta ttg gct gaa act ggc gtg cag ttg 7296 Ala Glu Leu Phe Ala Thr Gln Leu Leu Ala Glu Thr Gly Val Gln Leu 2420 2425 2430 ctc att ggt acg tca atg caa ggt ggc agc gac act aaa gca act gag 7344 Leu Ile Gly Thr Ser Met Gln Gly Gly Ser Asp Thr Lys Ala Thr Glu 2435 2440 2445 act gct tct gta aaa aag ctt aat gcg ggt gag gtg cta agt gca tcg 7392 Thr Ala Ser Val Lys Lys Leu Asn Ala Gly Glu Val Leu Ser Ala Ser 2450 2455 2460 cat ccg cgt gct ggt gca caa aaa aca cca cta caa gct gtc act gca 7440 His Pro Arg Ala Gly Ala Gln Lys Thr Pro Leu Gln Ala Val Thr Ala 2465 2470 2475 2480 acg cgt ctg tta acc cca agt gcc atg gtc ttc att gaa gat cac cgc 7488 Thr Arg Leu Leu Thr Pro Ser Ala Met Val Phe Ile Glu Asp His A rg 2485 2490 2495 att ggc ggt aac agt gtg ttg cca acg gta tgc gcc atc gac tgg atg 7536 Ile Gly Gly Asn Ser Val Leu Pro Thr Val Cys Ala Ile Asp Trp Met 2500 2505 2510 cgt gaa gcg gca agc gc atg ctt caa gtt aag gta ctt gat 7584 Arg Glu Ala Ala Ser Asp Met Leu Gly Ala Gln Val Lys Val Leu Asp 2515 2520 2525 tac aag cta tta aaa ggc att gta ttt gag act gat gag ccg caa gag 7632 Tyr Lys Leu Leu Lys Gly Val Phe Glu Thr Asp Glu Pro Gln Glu 2530 2535 2540 tta aca ctt gag cta acg cca gac gat tca gac gaa gct acg cta caa 7680 Leu Thr Leu Glu Leu Thr Pro Asp Asp Ser Asp Glu Ala Thr Leu Gln 2545 2550 2555 2560 gca tta atc agc tgt aat ggg cgt ccg caa tac aag gcg acg ctt atc 7728 Ala Leu Ile Ser Cys Asn Gly Arg Pro Gln Tyr Lys Ala Thr Leu Ile 2565 2570 2575 agt gat aat gcc gat att aag caa ctt aac aag cag ttt gat agc 7776 Ser Asp Asn Ala Asp Ile Lys Gln Leu Asn Lys Gln Phe Asp Leu Ser 2580 2585 2590 gct aag gcg att acc aca gca aaa gag ctt tat agc aac ggc acc ttg 7824 Ala Lys Ala Ile Thr Thr Ala Lys Glu Leu Tyr Ser Asn Gly Thr Leu 2595 2600 2605 ttc cac ggt ccg cgt cta caa ggg atc caa tct gta gtg cag ttc gat 7872 Phe His Gly Pro Arg Leu Gln Gly Ile Gln Ser Val Val Gln Phe Asp 2610 2615 2620 gat caa ggc tta att gct aaa gtc gct ctg cct aag gtt gaa ctt agc 7920 Asp Gln Gly Leu Ile Ala Lys Val Ala Leu Pro Lys Val Glu Leu Ser 2625 2630 2635 2640 gat tgt ggt gag ttc ttg ccg caa acc cac agt caa cct 7968 Asp Cys Gly Glu Plu Leu Pro Gln Thr His Met Gly Gly Ser Gln Pro 2645 2650 2655 ttt gct gag gac ttg cta tta caa gct atg ctg gtt tgg gct cgc ctt 8016 Phe Ala Glu Asp Leu Leu Leu Gln Ala Mla Leu Val Trp Ala Arg Leu 2660 2665 2670 aaa act ggc tcg gca agt ttg cca tca agc att ggt gag ttt acc tca 8064 Lys Thr Gly Ser Ala Ser Leu Pro Ser Ser Ile Gly Glu Phe Thr Ser 2675 2680 2685 tac caa cca atg gcc ttt ggt gaa act ggt acc ata gag ctt gaa gtg 8112 Tyr Gln Pro Met Ala Phe Gly Glu Thr Gly Thr Ile Glu Leu Glu Val 2690 2695 2700 att aag cac aac aaa cgc tca ctt gaa gcg aat gtt gcg cta tat cgt 8160 Ile Lys His Asn Lys Arg Ser Leu Glu Ala Asn Val Ala Leu Tyr Arg 2705 2710 2715 2720 gac aac ggc gag tta agt gcc atg ttt aag tca gct aaa atc acc att 8208 Asp Asn Gly Glu Leu Ser Ala Met Phe Lys Ser Ala Lys Ile Thr Ile 2725 2730 2735 agc aaa agc tta aat tca gca ttt tta cct gct gtc tta gca aac gac 8256 Ser Lys Ser Leu Asn Ser Ala Phe Leu Pro Ala Val Leu Ala Asn Asp 2740 2745 2750 agt gag gg aat 8268 Ser Glu Ala Asn 2755 <210> 7 <211> 2756 <212> PRT <400> 1 Met Ser Gln Thr Ser Lys Pro Thr Asn Ser Ala Thr Glu Gln Ala Gln 1 5 10 15 Asp Ser Gln Ala Asp Ser Arg Leu Asn Lys Arg Leu Lys Asp Met Pro 20 25 30 Ile Ala Ile Val Gly Met Ala Ser Ile Phe Ala Asn Ser Arg Tyr Leu 35 40 45 Asn Lys Phe Trp Asp Leu Ile Ser Glu Lys Ile Asp Ala Ile Thr Glu 50 55 60 Leu Pro Ser Thr His Trp Gln Pro Glu Glu Tyr Tyr Asp Ala Asp Lys 65 70 75 80 Thr Ala Ala Asp Lys Ser Tyr Cys Lys Arg Gly Gly Phe Leu Pro Asp 85 90 95 Val Asp Phe Asn Pro Met Glu Phe Gly Leu Pro Pro Asn Ile Leu Glu 100 105 110 Leu Thr Asp Ser Ser Gln Leu Leu Ser Leu Ile Val Ala Lys Glu Val 115 120 125 Leu Ala Asp Ala Asn Leu Pro Glu Asn Tyr Asp Arg Asp Lys Ile Gly 130 135 140 Ile Thr Leu Gly Val Gly Gly Gly Gln Lys Ile Ser His Ser Leu Thr 145 150 155 160 Ala Arg Leu Gln Tyr Pro Val Leu Lys Lys Val Phe Ala Asn Ser Gly 165 170 175 Ile Ser Asp Thr Asp Ser Glu Met Leu Ile Lys Lys Phe Gln Asp Gln 180 185 190 Tyr Val His Trp Glu Glu Asn Ser Phe Pro Gly Ser Leu Gly Asn Val 195 200 205 Ile Ala Gly Ar g Ile Ala Asn Arg Phe Asp Phe Gly Gly Met Asn Cys 210 215 220 Val Val Asp Ala Ala Cys Ala Gly Ser Leu Ala Ala Met Arg Met Ala 225 230 235 240 Leu Thr Glu Leu Thr Glu Gly Arg Ser Glu Met Met Ile Thr Gly Gly 245 250 255 Val Cys Thr Asp Asn Ser Pro Ser Met Tyr Met Ser Phe Ser Lys Thr 260 265 270 Pro Ala Phe Thr Thr Asn Glu Thr Ile Gln Pro Phe Asp Ile Asp Ser 275 280 285 Lys Gly Met Met Ile Gly Glu Gly Ile Gly Met Val Ala Leu Lys Arg 290 295 300 Leu Glu Asp Ala Glu Arg Asp Gly Asp Arg Ile Tyr Ser Val Ile Lys 305 310 315 320 Gly Val Gly Ala Ser Ser Asp Gly Lys Phe Lys Ser Ile Tyr Ala Pro 325 330 335 Arg Pro Ser Gly Gln Ala Lys Ala Leu Asn Arg Ala Tyr Asp Asp Ala 340 345 350 Gly Phe Ala Pro His Thr Leu Gly Leu Ile Glu Ala His Gly Thr Gly 355 360 365 Thr Ala Ala Gly Asp Ala Ala Glu Phe Ala Gly Leu Cys Ser Val Phe 370 375 380 Ala Glu Gly Asn Asp Thr Lys Gln His Ile Ala Leu Gly Ser Val Lys 385 390 395 400 Ser Gln Ile Gly His Thr Lys Ser Thr Ala Gly Thr Ala Gly Leu Ile 405 410 415 Lys Ala Ala L eu Ala Leu His His Lys Val Leu Pro Pro Thr Ile Asn 420 425 430 Val Ser Gln Pro Ser Pro Lys Leu Asp Ile Glu Asn Ser Pro Phe Tyr 435 440 445 Leu Asn Thr Glu Thr Arg Pro Trp Leu Pro Arg Val Asp Gly Thr Pro 450 455 460 Arg Arg Ala Gly Ile Ser Ser Phe Gly Phe Gly Gly Thr Asn Phe His 465 470 475 480 Phe Val Leu Glu Glu Tyr Asn Gln Glu His Ser Arg Thr Asp Ser Glu 485 490 495 Lys Ala Lys Tyr Arg Gln Arg Gln Val Ala Gln Ser Phe Leu Val Ser 500 505 510 Ala Ser Asp Lys Ala Ser Leu Ile Asn Glu Leu Asn Val Leu Ala Ala 515 520 525 Ser Ala Ser Gln Ala Glu Phe Ile Leu Lys Asp Ala Ala Ala Asn Tyr 530 535 540 Gly Val Arg Glu Leu Asp Lys Asn Ala Pro Arg Ile Gly Leu Val Ala 545 550 555 560 Asn Thr Ala Glu Glu Leu Ala Gly Leu Ile Lys Gln Ala Leu Ala Lys 565 570 575 Leu Ala Ala Ser Asp Asp Asn Ala Trp Gln Leu Pro Gly Gly Thr Ser 580 585 590 Tyr Arg Ala Ala Ala Val Glu Gly Lys Val Ala Ala Leu Phe Ala Gly 595 600 605 Gln Gly Ser Gln Tyr Leu Asn Met Gly Arg Asp Leu Thr Cys Tyr Tyr 610 615 620 620 Pro Glu Met Arg G ln Gln Phe Val Thr Ala Asp Lys Val Phe Ala Ala 625 630 635 640 Asn Asp Lys Thr Pro Leu Ser Gln Thr Leu Tyr Pro Lys Pro Val Phe 645 650 655 Asn Lys Asp Glu Leu Lys Ala Gln Glu Ala Ile Leu Thr Asn Thr Ala 660 665 670 Asn Ala Gln Ser Ala Ile Gly Ala Ile Ser Met Gly Gln Tyr Asp Leu 675 680 685 Phe Thr Ala Ala Gly Phe Asn Ala Asp Met Val Ala Gly His Ser Phe 690 695 700 Gly Glu Leu Ser Ala Leu Cys Ala Ala Gly Val Ile Ser Ala Asp Asp 705 710 715 715 720 Tyr Tyr Lys Leu Ala Phe Ala Arg Gly Glu Alu Met Ala Thr Lys Ala 725 730 735 Pro Ala Lys Asp Gly Val Glu Ala Asp Ala Gly Ala Met Phe Ala Ile 740 745 750 Ile Thr Lys Ser Ala Ala Asp Leu Glu Thr Val Glu Ala Thr Ile Ala 755 760 765 Lys Phe Asp Gly Val Lys Val Ala Asn Tyr Asn Ala Pro Thr Gln Ser 770 775 780 Val Ile Ala Gly Pro Thr Ala Thr Thr Ala Asp Ala Ala Lys Ala Leu 785 790 795 800 Thr Glu Leu Gly Tyr Lys Ala Ile Asn Leu Pro Val Ser Gly Ala Phe 805 810 815 His Thr Glu Leu Val Gly His Ala Gln Ala Pro Phe Ala Lys Ala Ile 820 825 830 Asp Ala Ala LysPhe Thr Lys Thr Ser Arg Ala Leu Tyr Ser Asn Ala 835 840 845 Thr Gly Gly Leu Tyr Glu Ser Thr Ala Ala Lys Ile Lys Ala Ser Phe 850 855 860 Lys Lys His Met Leu Gln Ser Val Arg Phe Thr Ser Gln Leu Glu Ala 865 870 875 880 Met Tyr Asn Asp Gly Ala Arg Val Phe Val Glu Phe Gly Pro Lys Asn 885 890 895 Ile Leu Gln Lys Leu Val Gln Gly Thr Leu Val Asn Thr Glu Asn Glu 900 905 910 Val Cys Thr Ile Ser Ile Asn Pro Asn Pro Lys Val Asp Ser Asp Leu 915 920 925 925 Gln Leu Lys Gln Ala Ala Met Gln Leu Ala Val Thr Gly Val Val Leu 930 935 940 Ser Glu Ile Asp Pro Tyr Gln Ala Asp Ile Ala Ala Pro Ala Lys Lys 945 950 955 960 Ser Pro Met Ser Ile Ser Leu Asn Ala Ala Asn His Ile Ser Lys Ala 965 970 975 Thr Arg Ala Lys Met Ala Lys Ser Leu Glu Thr Gly Ile Val Thr Ser 980 985 990 Gln Ile Glu His Val Ile Glu Glu Lys Ile Val Glu Val Glu Lys Leu 995 1000 1005 Val Glu Val Glu Lys Ile Val Glu Lys Val Val Glu Val Glu Lys Val 1010 1015 1020 Val Glu Val Glu Ala Pro Val Asn Ser Val Gln Ala Asn Ala Ile Gln 1025 1030 1035 1040 Thr Arg Ser Val Val Ala Pro Val Ile Glu Asn Gln Val Val Ser Lys 1045 1050 1055 Asn Ser Lys Pro Ala Val Gln Ser Ile Ser Gly Asp Ala Leu Ser Asn 1060 1065 1070 Phe Phe Ala Ala Gln Gln Gln Thr Ala Gln Leu His Gln Gln Phe Leu 1075 1080 1085 Ala Ile Pro Gln Gln Tyr Gly Glu Thr Phe Thr Thr Leu Met Thr Glu 1090 1095 1100 Gln Ala Lys Leu Ala Ser Ser Gly Val Ala Ile Pro Glu Ser Leu Gln 1105 1110 1115 1120 Arg Ser Met Glu Gln Phe His Gln Leu Gln Ala Gln Thr Leu Gln Ser 1125 1130 1135 His Thr Gln Phe Leu Glu Met Gln Ala Gly Ser Asn Ile Ala Ala Leu 1140 1145 1150 Asn Leu Leu Asn Ser Ser Gln Ala Thr Tyr Ala Pro Ala Ile His Asn 1155 1160 1165 Glu Ala Ile Gln Ser Gln Val Val Gln Ser Gln Thr Ala Val Gln Pro 1170 1175 1180 Val Ile Ser Thr Gln Val Asn His Val Ser Glu Gln Pro Thr Gln Ala 1185 1190 1195 1200 Pro Ala Pro Lys Ala Gln Pro Ala Pro Val Thr Thr Pro Val Gln Thr 1205 1210 1215 Ala Pro Ala Gln Val Val Arg Gln Ala Ala Pro Val Gln Ala Ala Ile 1220 1225 1230 Glu Pro Ile Asn Thr Ser Val Ala Thr Thr Thr Pro Ser Ala Phe Ser 1235 1240 1245 Ala Glu Thr Ala Leu Ser Ala Thr Lys Val Gln Ala Thr Met Leu Glu 1250 1255 1260 Val Val Ala Glu Lys Thr Gly Tyr Pro Thr Glu Met Leu Glu Leu Glu 1265 1270 1275 1280 Met Asp Met Glu Ala Asp Leu Gly Ile Asp Ser Ile Lys Arg Val Glu 1285 1290 1295 Ile Leu Gly Thr Val Gln Asp Glu Leu Pro Gly Leu Pro Glu Leu Ser 1300 1305 1310 Pro Glu Asp Leu Ala Glu Cys Arg Thr Leu Gly Glu Ile Val Asp Tyr 1315 1320 1325 Met Gly Ser Lys Leu Pro Ala Glu Gly Ser Met Asn Ser Gln Leu Ser 1330 1335 1340 Thr Gly Ser Ala Ala Ala Thr Pro Ala Ala Asn Gly Leu Ser Ala Glu 1345 1350 1355 1360 Lys Val Gln Ala Thr Met Met Ser Val Val Ala Glu Lys Thr Gly Tyr 1365 1370 1375 Pro Thr Glu Met Leu Glu Leu Glu Met Asp Met Glu Ala Asp Leu Gly 1380 1385 1390 Ile Asp Ser Ile Lys Arg Val Glu Ile Leu Gly Thr Val Gln Asp Glu 1395 1400 1405 Leu Pro Gly Leu Pro Glu Leu Ser Pro Glu Asp Leu Ala Glu Cys Arg 1410 1415 1420 Thr Leu Gly Glu Ile Val Asp Tyr Met Asn Ser Lys Leu Ala Asp Gly 1425 1430 1435 1440 Ser Ly s Leu Pro Ala Glu Gly Ser Met Asn Ser Gln Leu Ser Thr Ser 1445 1450 1455 Ala Ala Ala Ala Thr Pro Ala Ala Asn Gly Leu Ser Ala Glu Lys Val 1460 1465 1470 Gln Ala Thr Met Met Ser Val Val Ala Glu Lys Thr Gly Tyr Pro Thr 1475 1480 1485 Glu Met Leu Glu Leu Glu Met Asp Met Glu Ala Asp Leu Gly Ile Asp 1490 1495 1500 Ser Ile Lys Arg Val Glu Ile Leu Gly Thr Val Gln Asp Glu Leu Pro 1505 1510 1515 1520 Gly Leu Pro Glu Leu Asn Pro Glu Asp Leu Ala Glu Cys Arg Thr Leu 1525 1530 1535 Gly Glu Ile Val Thr Tyr Met Asn Ser Lys Leu Ala Asp Gly Ser Lys 1540 1545 1550 Leu Pro Ala Glu Gly Ser Met His Tyr Gln Leu Ser Thr Ser Thr Ala 1555 1560 1565 Ala Ala Thr Pro Val Ala Asn Gly Leu Ser Ala Glu Lys Val Gln Ala 1570 1575 1580 Thr Met Met Ser Val Val Ala Asp Lys Thr Gly Tyr Pro Thr Glu Met 1585 1590 1595 1600 Leu Glu Leu Glu Met Asp Met Glu Ala Asp Leu Gly Ile Asp Ser Ile 1605 1610 1615 Lys Arg Val Glu Ile Leu Gly Thr Val Gln Asp Glu Leu Pro Gly Leu 1620 1625 1630 Pro Glu Leu Asn Pro Glu Asp Leu Ala Glu Cys Arg Thr Leu Gly Glu 1635 1640 1645 Ile Val Asp Tyr Met Gly Ser Lys Leu Pro Ala Glu Gly Ser Ala Asn 1650 1655 1660 Thr Ser Ala Ala Ala Ser Leu Asn Val Ser Ala Val Ala Ala Pro Gln 1665 1670 1675 1680 Ala Ala Ala Ala Thr Pro Val Ser Asn Gly Leu Ser Ala Glu Lys Val Gln 1685 1690 1695 Ser Thr Met Met Ser Val Val Ala Glu Lys Thr Gly Tyr Pro Thr Glu 1700 1705 1710 Met Leu Glu Leu Gly Met Asp Met Glu Ala Asp Leu Gly Ile Asp Ser 1715 1720 1725 Ile Lys Arg Val Glu Ile Leu Gly Thr Val Gln Asp Glu Leu Pro Gly 1730 1735 1740 Leu Pro Glu Leu Asn Pro Glu Asp Leu Ala Glu Cys Arg Thr Leu Gly 1745 1750 1755 1760 Glu Ile Val Asp Tyr Met Asn Ser Lys Leu Ala Asp Gly Ser Lys Leu 1765 1770 1775 Pro Ala Glu Gly Ser Ala Asn Thr Ser Ala Thr Ala Ala Thr Pro Ala 1780 1785 1790 Val Asn Gly Leu Ser Ala Asp Lys Val Gln Ala Thr Met Met Ser Val 1795 1800 1805 Val Ala Glu Lys Thr Gly Tyr Pro Thr Glu Met Leu Glu Leu Gly Met 1810 1815 1820 Asp Met Glu Ala Asp Leu Gly Ile Asp Ser Ile Lys Arg Val Glu Ile 1825 1830 1835 1840 Leu G ly Thr Val Gln Asp Glu Leu Pro Gly Leu Pro Glu Leu Asn Pro 1845 1850 1855 Glu Asp Leu Ala Glu Cys Arg Thr Leu Gly Glu Ile Val Ser Tyr Met 1860 1865 1870 Asn Ser Gln Leu Ala Asp Gly Ser Lys Leu Ser Thr Ser Ala Ala Glu 1875 1880 1885 Gly Ser Ala Asp Thr Ser Ala Ala Asn Ala Ala Lys Pro Ala Ala Ile 1890 1895 1900 Ser Ala Glu Pro Ser Val Glu Leu Pro Pro His Ser Glu Val Ala Leu 1905 1910 1915 1920 Lys Lys Leu Asn Ala Ala Asn Lys Leu Glu Asn Cys Phe Ala Ala Asp 1925 1930 1935 Ala Ser Val Val Ile Asn Asp Asp Gly His Asn Ala Gly Val Leu Ala 1940 1945 1950 Glu Lys Leu Ile Lys Gln Gly Leu Lys Val Ala Val Val Arg Leu Pro 1955 1960 1965 Lys Gly Gln Pro Gln Ser Pro Leu Ser Ser Asp Val Ala Ser Phe Glu 1970 1975 1980 Leu Ala Ser Ser Gln Glu Ser Glu Leu Glu Ala Ser Ile Thr Ala Val 1985 1990 1995 2000 Ile Ala Gla Iln Glu Thr Gln Val Gly Ala Ile Gly Gly Phe Ile His 2005 2010 2015 Leu Gln Pro Glu Ala Asn Thr Glu Glu Gln Thr Ala Val Asn Leu Asp 2020 2025 2030 Ala Gln Ser Phe Thr His Val Ser Asn Ala Phe Le u Trp Ala Lys Leu 2035 2040 2045 Leu Gln Pro Lys Leu Val Ala Gly Ala Asp Ala Arg Arg Cys Phe Val 2050 2055 2060 Thr Val Ser Arg Ile Asp Gly Gly Phe Gly Tyr Leu Asn Thrh Asp Ala 2065 2070 2075 2080 Leu Lys Asp Ala Glu Leu Asn Gln Ala Ala Leu Ala Gly Leu Thr Lys 2085 2090 2095 Thr Leu Ser His Glu Trp Pro Gln Val Phe Cys Arg Ala Leu Asp Ile 2100 2105 2110 Ala Thr Asp Val Asp Ala Thr His Leu Ala Asp Ala Ile Thr Ser Glu 2115 2120 2125 Leu Phe Asp Ser Gln Ala Gln Leu Pro Glu Val Gly Leu Ser Leu Ile 2130 2135 2140 Asp Gly Lys Val Asn Arg Val Thr Leu Val Ala Ala Glu Ala Ala Asp 2145 2150 2155 2160 Lys Thr Ala Lys Ala Glu Leu Asn Ser Thr Asp Lys Ile Leu Val Thr 2165 2170 2175 Gly Gly Ala Lys Gly Val Thr Phe Glu Cys Ala Leu Ala Leu Ala Ser 2180 2185 2190 Arg Ser Gln Ser His Phe Ile Leu Ala Gly Arg Ser Glu Leu Gln Ala 2195 2200 2205 Leu Pro Ser Trp Ala Glu Gly Lys Gln Thr Ser Glu Leu Lys Ser Ala 2210 2215 2220 Ala Ile Ala His Ile Ile Ser Thr Gly Gln Lys Pro Thr Pro Lys Gln 2225 2230 2235 2240 Val Glu Ala Ala Val Trp Pro Val Gln Ser Ser Ile Glu Ile Asn Ala 2245 2250 2255 Ala Leu Ala Ala Phe Asn Lys Val Gly Ala Ser Ala Glu Tyr Val Ser 2260 2265 2270 Met Asp Val Thr Asp Ser Ala Ala Ile Thr Ala Ala Leu Asn Gly Arg 2275 2280 2285 Ser Asn Glu Ile Thr Gly Leu Ile His Gly Ala Gly Val Leu Ala Asp 2290 2295 2300 Lys His Ile Gln Asp Lys Thr Leu Ala Glu Leu Ala Lys Val Tyr Gly 2305 2310 2315 2320 Thr Lys Val Asn Gly Leu Lys Ala Leu Leu Ala Ala Leu Glu Pro Ser 2325 2330 2335 Lys Ile Lys Leu Leu Ala Met Phe Ser Ser Ala Ala Gly Phe Tyr Gly 2340 2345 2350 Asn Ile Gly Gln Ser Asp Tyr Ala Met Ser Asn Asp Ile Leu Asn Lys 2355 2360 2365 Ala Ala Leu Gln Phe Thr Ala Arg Asn Pro Gln Ala Lys Val Met Ser 2370 2375 2380 Phe Asn Trp Gly Pro Trp Asp Gly Gly Met Val Asn Pro Ala Leu Lys 2385 2390 2395 2400 Lys Met Phe Thr Glu Arg Gly Val Tyr Val Ile Pro Leu Lys Ala Gly 2405 2410 2415 Ala Glu Leu Phe Ala Thr Gln Leu Leu Ala Glu Thr Gly Val Gln Leu 2420 2425 2430 Leu Ile Gly Thr Ser Met Gln Gly Gly Ser Asp T hr Lys Ala Thr Glu 2435 2440 2445 Thr Ala Ser Val Lys Lys Leu Asn Ala Gly Glu Val Leu Ser Ala Ser 2450 2455 2460 His Pro Arg Ala Gly Ala Gln Lys Thr Pro Leu Gln Ala Val Thr Ala 2465 2470 2475 2480 Thr Arg Leu Leu Thr Pro Ser Ala Met Val Phe Ile Glu Asp His Arg 2485 2490 2495 Ile Gly Gly Asn Ser Val Leu Pro Thr Val Cys Ala Ile Asp Trp Met 2500 2505 2510 Arg Glu Ala Ala Ser Asp Met Leu Gly Ala Gln Val Lys Val Leu Asp 2515 2520 2525 Tyr Lys Leu Leu Lys Gly Ile Val Phe Glu Thr Asp Glu Pro Gln Glu 2530 2535 2540 Leu Thr Leu Glu Leu Thr Pro Asp Asp Ser Asp Glu Ala Thr Leu Gln 2545 2550 2555 2560 Ala Leu Ile Ser Cys Asn Gly Arg Pro Gln Tyr Lys Ala Thr Leu Ile 2565 2570 2575 Ser Asp Asn Ala Asp Ile Lys Gln Leu Asn Lys Gln Phe Asp Leu Ser 2580 2585 2585 2590 Ala Lys Ala Ile Thr Thr Ala Lys Glu Leu Tyr Ser Asn Gly Thr Leu 2595 2600 2605 Phe His Gly Pro Arg Leu Gln Gly Ile Gln Ser Val Val Gln Phe Asp 2610 2615 2620 Asp Gln Gly Leu Ile Ala Lys Val Ala Leu Pro Lys Val Glu Leu Ser 2625 2630 2635 2640 Asp Cys Gly Glu Phe Leu Pro Gln Thr His Met Gly Gly Ser Gln Pro 2645 2650 2655 Phe Ala Glu Asp Leu Leu Leu Gln Ala Met Leu Val Trp Ala Arg Leu 2660 2665 2670 Lys Thr Gly Ser Ala Ser Leu Pro Ser Ser Ile Gly Glu Phe Thr Ser 2675 2680 2685 Tyr Gln Pro Met Ala Phe Gly Glu Thr Gly Thr Ile Glu Leu Glu Val 2690 2695 2700 Ile Lys His Asn Lys Arg Ser Leu Glu Ala Asn Val Ala Leu Tyr Arg 2705 2710 2715 2720 Asp Asn Gly Glu Leu Ser Ala Met Phe Lys Ser Ala Lys Ile Thr Ile 2725 2730 2735 Ser Lys Ser Leu Asn Ser Ala Phe Leu Pro Ala Val Leu Ala Asn Asp 2740 2745 2750 Ser Glu Ala Asn 2755 <210> 8 <211> 2340 <212> DNA <213> Shewanella putrefaciens SCRC-2874 (FERM BP-1625) <400> 1 gtg gaa caa acg cct aaa gct agt gcg atg ccg ctg cgc atc gca ctt 48 Val Glu Gln Thr Pro Lys Ala Ser Ala Met Pro Leu Arg Ile Ala Leu 1 5 10 15 atc tta ctg cca aca ccg cag ttt gaa gtt aac tct gtc gac cag tca 96 Ile Leu Leu Pro Thr Pro Gln Phe Glu Val Asn Ser Val Asp Gln Ser 20 25 30 gta tta gcc agc tat caa aca ctg cag cct gag cta aat gcc ctg ctt 144 Val Leu Ala Ser Tyr Gln Thr Leu Gln Pro Glu Leu Asn Ala Leu Leu 35 40 45 aat agt gcg ccg aca cct gaa atg ctc agc atc act atc tca gat gat 192 Asn Ser Ala Pro Thr Pro Glu Met Leu Ser Ile Thr Ile Ser Asp Asp 50 55 60 agc gat gca aac agc ttt gag tcg cag cta aat gct gcg acc aac gca 240 Ser Asp Ala Asn Ser Phe Glu Ser Gln Leu Asn Ala Ala Thr Asn Ala 65 70 75 80 att aac aat ggc tat atc gtc aag ctt gct acg gca act cac gct ttg 288 Ile Asn Asn Gly Tyr Ile Val Lys Leu Ala Thr Ala Thr His Ala Leu 85 90 95 tta atg ctg cct gca tta aaa gcg gcg caa atg cgg atc cat cct cat 336 Leu Met Leu Pro Ala Leu Lys Ala Ala Gln Met Arg Ile His Pro His 100 105 110 gcg cag ctt gcc gct atg cag caa gct aaa tcg acg cca atg agt caa 384 Ala Gln Leu Ala Ala Met Gln Gln Ala Lys Ser Thr Pro Met Ser Gln 115 120 125 gta tct ggt gag cta aag ctt ggc gct aat gcg cta agc cta gct cag 432 Val Ser Gly Glu Leu Lys Leu Gly Ala Asn Ala Leu Ser Leu Ala Gln 130 135 140 act aat gcg ctg tct cat gct tta agc caa gcc aag cgt aac tta act 480 Thr Asn Ala Leu Ser His Ala Leu Ser Gln Ala Lys Arg Asn Leu Thr 145 150 155 160 gat gtc agc gtg aat gag tgt ttt gag aac ctc aaa agt gaa cag cag 528 Asp Val Ser Val Asn Glu Cys Phe Glu Asn Leu Lys Ser Glu Gln Gln 165 170 175 tttt aca gag gtt tat tcg ctt att cag caa ctt gct agc cgc acc cat 576 Phe Thr Glu Val Tyr Ser Leu Ile Gln Gln Leu Ala Ser Arg Thr His 180 185 190 gtg aga aaa gag gtt aat caa ggt gtg gaa ctt ggc cct aaa caa gcc 624 Val Arg Lys Glu Val Asn Gln Gly Val Glu Leu Gly Pro Lys Gln Ala 195 200 205 aaa agc cac tat tgg ttt agc gaa ttt cac caa aac cgt gtt gct gcc 672 Lys Ser His Tyr Trp Phe Ser Glu Phe His Gln Asn Arg Val Ala Ala 210 215 220 atc aac ttt att aat ggc caa caa gca acc agc tat gtg ctt act caa 720 Ile Asn Phe Ile Asn Gly Gln Gln Ala Thr Ser Tyr Val Leu Thr Gln 225 230 235 240 ggt tca gga ttg tta gct gcg aaa tca atg cta aac cag caa aga tta 768 Gly Ser Gly Leu Leu Ala Ala Lys Ser Met Leu Asn Gln Gln Arg Leu 245 250 255 atg ttt atc ttg ccg ggt aac agt cag caa caa ata acc gca tca ata 816 Met Phe Ile Leu Pro Gly Asn Ser Gln Gln Gln Ile Thr Ala Ser Ile 260 265 270 act cag tta atg cag caa tta gag cgt ttg cag gta act gag gtt aat 864 Thr Gln Leu Met Gln Gln Leu Glu Arg Leu Gln Val Thr Glu Val Asn 275 280 285 gag ctt tct cta gaa tgc caa cta gag ctg ctc agc ata atg tat gac 912 Glu Leu Ser Leu Glu Cys Gln Leu Glu Leu Leu Ser Ile Met Tyr Asp 290 295 300 aac tta gtc aac gca gac aaa ctc gat agt g 960 Asn Leu Val Asn Ala Asp Lys Leu Thr Thr Arg Asp Ser Lys Pro Ala 305 310 315 320 tat cag gct gtg att caa gca agc tct gtt agc gct gca aag caa gag 1008 Tyr Gln Ala Val Ile Gln Ala Ser Ser Val Ser Al a Ala Lys Gln Glu 325 330 335 tta agc gcg ctt aac gat gca ctc aca gcg ctg ttt gct gag caa aca 1056 Leu Ser Ala Leu Asn Asp Ala Leu Thr Ala Leu Phe Ala Glu Gln Thr 340 345 350 aac gcc aca tca ac aaa ggc tta atc caa tac aaa aca ccg gcg 1104 Asn Ala Thr Ser Thr Asn Lys Gly Leu Ile Gln Tyr Lys Thr Pro Ala 355 360 365 ggc agt tac tta acc cta aca ccg ctt ggc agc aac aat gac aac gcc 1152 Gly Seryr Leu Thr Leu Thr Pro Leu Gly Ser Asn Asn Asp Asn Ala 370 375 380 caa gcg ggt ctt gct ttt gtc tat ccg ggt gtg gga acg gtt tac gcc 1200 Gln Ala Gly Leu Ala Phe Val Tyr Pro Gly Val Gly Thr Val Tyr Ala 385 390 395 400 gat atg ctt aat gag ctg cat cag tac ttc cct gcg ctt tac gcc aaa 1248 Asp Met Leu Asn Glu Leu His Gln Tyr Phe Pro Ala Leu Tyr Ala Lys 405 410 415 ctt gag cgt gaa ggc gat tta agag gat caa gca gaa gat atc 1296 Leu Glu Arg Glu Gly Asp Leu Lys Ala Met Leu Gln Ala Glu Asp Ile 420 425 430 tat cat ctt gac cct aaa cat gct gcc caa atg agc tta ggt gac tta 1344 Tyr His Leu Asp Pro Lys His Ala Ala Gln Met Ser Leu Gly Asp Leu 435 440 445 gcc att gct ggc gtg ggg agc agc tac ctg tta act cag ctg ctc acc 1392 Ala Ile Ala Gly Val Gly Ser Ser Tyr Leu Leu Thr Gln Leu Leu Thr 450 455 460 gat g ttt aat att aag cct aat ttt gca tta ggt tac tca atg ggt 1440 Asp Glu Phe Asn Ile Lys Pro Asn Phe Ala Leu Gly Tyr Ser Met Gly 465 470 475 475 480 gaa gca tca atg tgg gca agc tta ggc gta tgg caaac gcg 1488 Glu Ala Ser Met Trp Ala Ser Leu Gly Val Trp Gln Asn Pro His Ala 485 490 495 ctg atc agc aaa acc caa acc gac ccg cta ttt act tct gct att tcc 1536 Leu Ile Ser Lys Thr Gln Thr Asp Pro Leu Phe Thr Ser Ala Ile Ser 500 505 510 ggc aaa ttg acc gcg gtt aga caa gct tgg cag ctt gat gat acc gca 1584 Gly Lys Leu Thr Ala Val Arg Gln Ala Trp Gln Leu Asp Asp Thr Ala 515 520 525 gcg gaa atc cag tgg aatg ttt gtg gtt aga agt gaa gca gcg ccg 1632 Ala Glu Ile Gln Trp Asn Ser Phe Val Val Arg Ser Glu Ala Ala Pro 530 535 540 att gaa gcc ttg cta aaa gat tac cca cac gct tac ctc gcg att att 1680 Ile Ala Leu Leu Lys Asp Tyr Pro His Ala Tyr Leu Ala Ile Ile 545 550 555 560 caa ggg gat acc tgc gta atc gct ggc tgt gaa atc caa tgt aaa gcg 1728 Gln Gly Asp Thr Cys Val Ile Ala Gly Cys Glu Ile Gln Cy Ala 565 570 575 cta ctt gca gca ctg ggt aaa cgc ggt att gca gct aat cgt gta acg 1776 Leu Leu Ala Ala Leu Gly Lys Arg Gly Ile Ala Ala Asn Arg Val Thr 580 585 590 gcg atg cat acg cag cct gcg atg cat caa aat gtg atg gat 1824 Ala Met His Thr Gln Pro Ala Met Gln Glu His Gln Asn Val Met Asp 595 600 605 ttt tat ctg caa ccg tta aaa gca gag ctt cct agt gaa ata agc ttt 1872 Phe Tyr Leu Gln Pro Leu Lys Ala Glu Leu Pro Ser Glu Ile Ser Phe 610 615 620 atc agc gcc gct gat tta act gcc aag caa acg gtg agt gag caa gca 1920 Ile Ser Ala Ala Ala Asp Leu Thr Ala Lys Gln Thr Val Ser Glu Gln Ala 625 630 635 635 640 ctt agc agc caa gtc gtt gct cag tct att gcc gac acc ttc tgc caa 1968 Leu Ser Ser Gln Val Val Ala Gln Ser Ile Ala Asp Thr Phe Cys Gln 645 650 655 acc ttg gac ttt acc gcg cta gta cat cac gcc caa cat c aa ggc gct 2016 Thr Leu Asp Phe Thr Ala Leu Val His His Ala Gln His Gln Gly Ala 660 665 670 aag ctg ttt gtt gaa att ggc gcg gat aga caa aac tgc acc ttg ata 2064 Lys Leu Phe Val Glu Ile Gly Ala Asg Arg Gln Asn Cys Thr Leu Ile 675 680 685 gac aag att gtt aaa caa gat ggt gcc agc agt gta caa cat caa cct 2112 Asp Lys Ile Val Lys Gln Asp Gly Ala Ser Ser Val Gln His Gln Pro 690 695 700 tgt tgc aca gtg cct atg aac gca aaa ggt agc caa gat att acc agc 2160 Cys Cys Thr Val Pro Met Asn Ala Lys Gly Ser Gln Asp Ile Thr Ser 705 710 715 720 gtg att aaa gcg ctt ggc caa tta att agc cat cag gtg cca tta tcg 2 al Ile Lys Ala Leu Gly Gln Leu Ile Ser His Gln Val Pro Leu Ser 725 730 735 gtg caa cca ttt att gat gga ctc aag cgc gag cta aca ctt tgc caa 2256 Val Gln Pro Phe Ile Asp Gly Leu Lys Arg Glu Leu Thr Leu Cys Gln 740 745 750 ttg acc agc caa cag ctg gca gca cat gca aat gtt gac agc aag ttt 2304 Leu Thr Ser Gln Gln Leu Ala Ala His Ala Asn Val Asp Ser Lys Phe 755 760 765 gag tct aac caa gac cat tta ctt caa ggg gaa gtc 2340 Glu Ser Asn Gln Asp His Leu Leu Gln Gly Glu Val 770 775 780 <210> 9 <211> 780 <212> PRT <400> 1 Val Glu Gln Thr Pro Lys Ala Ser Ala Met Pro Leu Arg Ile Ala Leu 1 5 10 15 Ile Leu Leu Pro Thr Pro Gln Phe Glu Val Asn Ser Val Asp Gln Ser 20 25 30 Val Leu Ala Ser Tyr Gln Thr Leu Gln Pro Glu Leu Asn Ala Leu Leu 35 40 45 Asn Ser Ala Pro Thr Pro Glu Met Leu Ser Ile Thr Ile Ser Asp Asp 50 55 60 Ser Asp Ala Asn Ser Phe Glu Ser Gln Leu Asn Ala Ala Thr Asn Ala 65 70 75 80 Ile Asn Asn Gly Tyr Ile Val Lys Leu Ala Thr Ala Thr His Ala Leu 85 90 95 Leu Met Leu Pro Ala Leu Lys Ala Ala Gln Met Arg Ile His Pro His 100 105 110 Ala Gln Leu Ala Ala Met Gln Gln Ala Lys Ser Thr Pro Met Ser Gln 115 120 125 Val Ser Gly Glu Leu Lys Leu Gly Ala Asn Ala Leu Ser Leu Ala Gln 130 135 140 Thr Asn Ala Leu Ser His Ala Leu Ser Gln Ala Lys Arg Asn Leu Thr 145 150 155 160 Asp Val Ser Val Asn Glu Cys Phe Glu Asn Leu Lys Ser Glu Gln Gln 165 170 175 Phe Thr Glu Val Tyr Ser Leu Ile Gln Gln Leu Ala Ser Arg Thr His 180 185 190 Val Arg Lys Glu Val Asn Gln Gly Val Glu Leu Gly Pro Lys Gln Ala 195 200 205 Lys Ser His Ty r Trp Phe Ser Glu Phe His Gln Asn Arg Val Ala Ala 210 215 220 Ile Asn Phe Ile Asn Gly Gln Gln Ala Thr Ser Tyr Val Leu Thr Gln 225 230 235 240 Gly Ser Gly Leu Leu Ala Ala Lys Ser Met Leu Asn Gln Gln Arg Leu 245 250 255 Met Phe Ile Leu Pro Gly Asn Ser Gln Gln Gln Ile Thr Ala Ser Ile 260 265 270 Thr Gln Leu Met Gln Gln Leu Glu Arg Leu Gln Val Thr Glu Val Asn 275 280 280 285 Glu Leu Ser Leu Glu Cys Gln Leu Glu Leu Leu Ser Ile Met Tyr Asp 290 295 300 Asn Leu Val Asn Ala Asp Lys Leu Thr Thr Arg Asp Ser Lys Pro Ala 305 310 315 320 Tyr Gln Ala Val Ile Gln Ala Ser Ser Val Ser Ala Ala Lys Gln Glu 325 330 335 Leu Ser Ala Leu Asn Asp Ala Leu Thr Ala Leu Phe Ala Glu Gln Thr 340 345 350 Asn Ala Thr Ser Thr Asn Lys Gly Leu Ile Gln Tyr Lys Thr Pro Ala 355 360 365 Gly Ser Tyr Leu Thr Leu Thr Pro Leu Gly Ser Asn Asn Asp Asn Ala 370 375 380 Gln Ala Gly Leu Ala Phe Val Tyr Pro Gly Val Gly Thr Val Tyr Ala 385 390 395 400 Asp Met Leu Asn Glu Leu His Gln Tyr Phe Pro Ala Leu Tyr Ala Lys 405 410 415 Leu Glu Arg G lu Gly Asp Leu Lys Ala Met Leu Gln Ala Glu Asp Ile 420 425 430 Tyr His Leu Asp Pro Lys His Ala Ala Gln Met Ser Leu Gly Asp Leu 435 440 445 Ala Ile Ala Gly Val Gly Ser Ser Tyr Leu Leu Thr Gln Leu Leu Thr 450 455 460 Asp Glu Phe Asn Ile Lys Pro Asn Phe Ala Leu Gly Tyr Ser Met Gly 465 470 475 480 Glu Ala Ser Met Trp Ala Ser Leu Gly Val Trp Gln Asn Pro His Ala 485 490 490 495 Leu Ile Ser Lys Thr Gln Thr Asp Pro Leu Phe Thr Ser Ala Ile Ser 500 505 510 510 Gly Lys Leu Thr Ala Val Arg Gln Ala Trp Gln Leu Asp Asp Thr Ala 515 520 525 Ala Glu Ile Gln Trp Asn Ser Phe Val Val Arg Ser Glu Ala Ala Pro 530 535 540 Ile Glu Ala Leu Leu Lys Asp Tyr Pro His Ala Tyr Leu Ala Ile Ile 545 550 555 560 Gln Gly Asp Thr Cys Val Ile Ala Gly Cys Glu Ile Gln Cys Lys Ala 565 570 575 Leu Leu Ala Ala Leu Gly Lys Arg Gly Ile Ala Ala Asn Arg Val Thr 580 585 590 Ala Met His Thr Gln Pro Ala Met Gln Glu His Gln Asn Val Met Asp 595 600 605 Phe Tyr Leu Gln Pro Leu Lys Ala Glu Leu Pro Ser Glu Ile Ser Phe 610 615 620 Ile Ser Ala Ala A sp Leu Thr Ala Lys Gln Thr Val Ser Glu Gln Ala 625 630 635 640 Leu Ser Ser Gln Val Val Ala Gln Ser Ile Ala Asp Thr Phe Cys Gln 645 650 655 Thr Leu Asp Phe Thr Ala Leu Val His His Ala Gln His Gln Gly Ala 660 665 670 Lys Leu Phe Val Glu Ile Gly Ala Asp Arg Gln Asn Cys Thr Leu Ile 675 680 685 Asp Lys Ile Val Lys Gln Asp Gly Ala Ser Ser Val Gln His Gln Pro 690 695 700 Cys Cys Thr Val Pro Met Asn Ala Lys Gly Ser Gln Asp Ile Thr Ser 705 710 715 715 720 Val Ile Lys Ala Leu Gly Gln Leu Ile Ser His Gln Val Pro Leu Ser 725 730 735 Val Gln Pro Phe Ile Asp Gly Leu Lys Arg Glu Leu Thr Leu Cys Gln 740 745 750 Leu Thr Ser Gln Gln Leu Ala Ala His Ala Asn Val Asp Ser Lys Phe 755 760 765 Glu Ser Asn Gln Asp His Leu Leu Gln Gly Glu Val 770 775 780 <210> 10 <211> 6015 <212> DNA <213> Shewanella putrefaciens SCRC-2874 (FERM BP-1625) <400> 1 atg tca tta cca gac aat gct tct aac cac ctt tct gcc aac cag aaa 48 Met Ser Leu Pro Asp Asn Ala Ser Asn His Leu Ser Ala Asn Gln Lys 1 5 10 15 ggc gca tct cag gca agt aaa acc agt aag caa agc aaa atc gcc att 96 Gly Ala Ser Gln Ala Ser Lys Thr Ser Lys Gln Ser Lys Ile Ala Ile 20 25 30 gtc ggt tta gcc act ctg tat cca gac gct aaa acc ccg caa gaa ttt 144 Val Gly Leu Ala Thr Leu Tyr Pro Asp Ala Lys Thr Pro Gln Glu Phe 35 40 45 tgg cag aat ttg ctg gat aaa cgc gac tct cgc agc acc tta act aac 192 Trp Gln Asn Leu Leu Asp Lys Arg Asp Ser Arg Ser Thr Leu Thr Asn 50 55 60 gaa aaa ctc ggc gct aac agc caa gat tat caa ggt gtg caa ggc caa 240 Glu Lys Leu Gly Ala Asn Ser Gln Asp Tyr Gln Gly Val Gln Gly Gln 65 70 75 80 tct gac cgt ttt tat tgt aat aaa ggc ggc tac attag ttc agc 288 Ser Asp Arg Phe Tyr Cys Asn Lys Gly Gly Tyr Ile Glu Asn Phe Ser 85 90 95 ttt aat gct gca ggc tac aaa ttg ccg gag caa agc tta aat ggc ttg 336 Phe Asn Ala Ala Gly Tyr Lysu Ser Leu Asn Gly Leu 100 105 110 gac gac agc ttc ctt tgg gcg ctc gat act agc cgt aac gca cta att 384 Asp Asp Ser Phe Leu Trp Ala Leu Asp Thr Ser Arg Asn Ala Leu Ile 115 120 125 gat gct ggt att gat atc aac ggc gct gat tta agc cg ggt gta 432 Asp Ala Gly Ile Asp Ile Asn Gly Ala Asp Leu Ser Arg Ala Gly Val 130 135 140 gtc atg ggc gcg ctg tcg ttc cca act acc cgc tca aac gat ctg ttt 480 Val Met Gly Ala Leu Ser Phe Pro Thr Thrg Ser Asn Asp Leu Phe 145 150 155 160 ttg cca att tat cac agc gcc gtt gaa aaa gcc ctg caa gat aaa cta 528 Leu Pro Ile Tyr His Ser Ala Val Glu Lys Ala Leu Gln Asp Lys Leu 165 170 175 ggc gta aag gcatt aag cta agc cca act aat gct cat acc gct cgc 576 Gly Val Lys Ala Phe Lys Leu Ser Pro Thr Asn Ala His Thr Ala Arg 180 185 190 gcg gca aat gag agc agc cta aat gca gcc aat ggt gcc att gcc cat 624 Ala Ala Asn Glu Ser Ser Leu Asn Ala Ala Asn Gly Ala Ile Ala His 195 200 205 aac agc tca aaa gtg gtg gcc gat gca ctt ggc ctt ggc ggc gca caa 672 Asn Ser Ser Lys Val Val Ala Asp Ala Leu Gly Leu Gly Gly Ala Gln 210 215 220 cta agc cta gat gct gcc tgt gct agt tcg gtt tac tca tta aag ctt 720 Leu Ser Leu Asp Ala Ala Cys Ala Ser Ser Val Tyr Ser Leu Lys Leu 225 230 235 240 gcc tgc gat tac cta agc act ggc aaa gcc gat atc atg cta gca ggc 768 Ala Cys Asp Tyr Leu Ser Thr Gly Lys Ala Asp Ile Met Leu Ala Gly 245 250 255 gca gta tct ggc gcg gat cct ttc ttt att aat atg gga ttc tca atc 816 Ala Val Ser Gly Ala Pro Phe Phe Ile Asn Met Gly Phe Ser Ile 260 265 270 ttc cac gcc tac cca gac cat ggt atc tca gta ccg ttt gat gcc agc 864 Phe His Ala Tyr Pro Asp His Gly Ile Ser Val Pro Phe Asp Ala Ser 275 280 285 agt aaa ggt ttg ttt gct ggc gaa ggc gct ggc gta tta gtg ctt aaa 912 Ser Lys Gly Leu Phe Ala Gly Glu Gly Ala Gly Val Leu Val Leu Lys 290 295 300 cgt ctt gaa gat gcc gag cgc gac agat gac gat gtt 960 Arg Leu Glu Asp Ala Glu Arg Asp Asn Asp Lys Ile Tyr Ala Val Val 305 310 315 320 agc ggc gta ggt cta tca aac gac ggt aaa ggc cag ttt gta tta agc 1008 Ser Gly Val Gly Leu Ser Asn Asp Gly Lys Gl n Phe Val Leu Ser 325 330 335 cct aat cca aaa ggt cag gtg aag gcc ttt gaa cgt gct tat gct gcc 1056 Pro Asn Pro Lys Gly Gln Val Lys Ala Phe Glu Arg Ala Tyr Ala Ala 340 345 350 agt gac att gag cca aaa gac att gaa gtg att gag tgc cac gca aca 1104 Ser Asp Ile Glu Pro Lys Asp Ile Glu Val Ile Glu Cys His Ala Thr 355 360 365 ggc aca ccg ctt ggc gat aaa att gag ctc act tca atg gaa acc ttc 1152 Gly Thr Leu Gly Asp Lys Ile Glu Leu Thr Ser Met Glu Thr Phe 370 375 380 ttt gaa gac aag ctg caa ggc acc gat gca ccg tta att ggc tca gct 1200 Phe Glu Asp Lys Leu Gln Gly Thr Asp Ala Pro Leu Ile Gly Ser Ala 385 390 395 400 aag tct aac tta ggc cac cta tta act gca gcg ggc atg ccg ggg atc 1248 Lys Ser Asn Leu Gly His Leu Leu Thr Ala Ala Gly Met Pro Gly Ile 405 410 415 atg aag atg atc ttc gcc atg aaa gaa ggt ctg ccg cca agt atc 1296 Met Lys Met Ile Phe Ala Met Lys Glu Gly Tyr Leu Pro Pro Ser Ile 420 425 430 aat att agt gat gct atc gct tcg ccg aaa aaa ctc ttc ggt aaa cca 1344 Asn Ile Ser Asp Ala Ile A Ser Pro Lys Lys Leu Phe Gly Lys Pro 435 440 445 acc ctg cct agc atg gtt caa ggc tgg cca gat aag cca tcg aat aat 1392 Thr Leu Pro Ser Met Val Gln Gly Trp Pro Asp Lys Pro Ser Asn Asn 450 455 460 cat ttt ggt gta aga acc cgt cac gca ggc gta tcg gta ttt ggc ttt 1440 His Phe Gly Val Arg Thr Arg His Ala Gly Val Ser Val Phe Gly Phe 465 470 475 ggt ggc tgt aac gcc cat ctg ttg ctt gag tca tac aac ggc ga g 1488 Gly Gly Cys Asn Ala His Leu Leu Leu Glu Ser Tyr Asn Gly Lys Gly 480 485 490 495 aca gta aag gca gaa gcc act caa gta ccg cgt caa gct gag ccg cta 1536 Thr Val Lys Ala Glu Ala Thr Gln Val Pro Arg Gln Ala Glu Pro Leu 500 505 510 aaa gtg gtt ggc ctt gcc tcg cac ttt ggg cct ctt agc agc att aat 1584 Lys Val Val Gly Leu Ala Ser His Phe Gly Pro Leu Ser Ser Ile Asn 515 520 525 gca ctc aac aat gct gtg acc caa gat ggg aat ggc ttt atc gaa ctg 1632 Ala Leu Asn Asn Ala Val Thr Gln Asp Gly Asn Gly Phe Ile Glu Leu 530 535 540 ccg aaa aag cgc tgg aaa ggc ctt gaa aag cac agt gag ctg tta gct 1 Lys Arg Trp Lys Gly Leu Glu Lys His Ser Glu Leu Leu Ala 545 550 555 gaa ttt ggc tta gca tct gcg cca aaa ggt gct tat gtt gat aac ttc 1728 Glu Phe Gly Leu Ala Ser Ala Pro Lys Gly Ala Tyr Val Asp Asn 560 565 570 575 gag ctg gac ttt tta cgc ttt aaa ctg ccg cca aac gaa gat gac cgt 1776 Glu Leu Asp Phe Leu Arg Phe Lys Leu Pro Pro Asn Glu Asp Asp Arg 580 585 590 ttg atc tca cag cg ctag atg gta aca gac gaa gcc att 1824 Leu Ile Ser Gln Gln Leu Met Leu Met Arg Val Thr Asp Glu Ala Ile 595 600 605 cgt gat gcc aag ctt gag ccg ggg caa aaa gta gct gta tta gtg gca 1872 Arg Asp Ala Lys Leu Glu Gly Gln Lys Val Ala Val Leu Val Ala 610 615 620 atg gaa act gag ctt gaa ctg cat cag ttc cgc ggc cgg gtt aac ttg 1920 Met Glu Thr Glu Leu Glu Leu His Gln Phe Arg Gly Arg Val Asn Leu 625 630 635 cat act caa tta gcg caa agt ctt gcc gcc atg ggc gtg agt tta tca 1968 His Thr Gln Leu Ala Gln Ser Leu Ala Ala Met Gly Val Ser Leu Ser 640 645 650 655 655 acg gat gaa tac caa gcg ctt gaa gcc atc gcc atg gac ag c gtg ctt 2016 Thr Asp Glu Tyr Gln Ala Leu Glu Ala Ile Ala Met Asp Ser Val Leu 660 665 670 gat gct gcc aag ctc aat cag tac acc agc ttt att ggt aat att atg 2064 Asp Ala Ala Lys Leu Asn Gln Tyr Thr Ser Phe Ile Gly Asn Ile Met 675 680 685 gcg tca cgc gtg gcg tca cta tgg gac ttt aat ggc cca gcc ttc act 2112 Ala Ser Arg Val Ala Ser Leu Trp Asp Phe Asn Gly Pro Ala Phe Thr 690 695 700 att tca gca gca caa tct gtg agc cgc tgt atc gat gtg gcg caa 2160 Ile Ser Ala Ala Glu Gln Ser Val Ser Arg Cys Ile Asp Val Ala Gln 705 710 715 aac ctc atc atg gag gat aac cta gat gcg gtg gtg Att gca gc Asg Ile Met Glu Asp Asn Leu Asp Ala Val Val Ile Ala Ala Val 720 725 730 735 gat ctc tct ggt agc ttt gag caa gtc att ctt aaa aat gcc att gca 2256 Asp Leu Ser Gly Ser Phe Glu Gln Val Ile Leu Lys Asn Ala Ile Ala 740 745 750 cct gta gcc att gag cca aac ctc gaa gca agc ctt aat cca aca tca 2304 Pro Val Ala Ile Glu Pro Asn Leu Glu Ala Ser Leu Asn Pro Thr Ser 755 760 765 gca agc tgg aat gtc ggt gaa ggt gc t ggc gcg gtc gtg ctt gtt aaa 2352 Ala Ser Trp Asn Val Gly Glu Gly Ala Gly Ala Val Val Leu Val Lys 770 775 780 aat gaa gct aca tcg ggc tgc tca tac ggc caa att gat gca ctt ggc 2400 Asn Gly Cys Ser Tyr Gly Gln Ile Asp Ala Leu Gly 785 790 795 ttt gct aaa act gcc gaa aca gcg ttg gct acc gac aag cta ctg agc 2448 Phe Ala Lys Thr Ala Glu Thr Ala Leu Ala Thr Asp Lys Leu Leu Ser 800 805 815 caa act gcc aca gac ttt aat aag gtt aaa gtg att gaa act atg gca 2496 Gln Thr Ala Thr Asp Phe Asn Lys Val Lys Val Ile Glu Thr Met Ala 820 825 830 gcg cct gct agc caa att caa tta gcg cca atg gtt tct caa gtg 2544 Ala Pro Ala Ser Gln Ile Gln Leu Ala Pro Ile Val Ser Ser Gln Val 835 840 845 act cac act gct gca gag cag cgt gtt ggt cac tgc ttt gct gca gcg 2592 Thr His Thr Ala Ala Glu Gln Arg Val Gly His Cys Phe Ala Ala Ala 850 855 860 ggt atg gca agc cta tta cac ggc tta ctt aac tta aat act gta gcc 2640 Gly Met Ala Ser Leu Leu Leu His Gly Leu Leu Asn Leu Asn Thr Val Ala 865 870 875 caa acc aat aaa gcc aat tgc gcg ctt atc aac aat atc agt gaa aac 2688 Gln Thr Asn Lys Ala Asn Cys Ala Leu Ile Asn Asn Ile Ser Glu Asn 880 885 890 895 caa tta tca cag ctg ttg att agc caa aca gcg caca gca gca gca gca gca caca Gln Leu Ser Gln Leu Leu Ile Ser Gln Thr Ala Ser Glu Gln Gln Ala 900 905 910 tta acc gcg cgt tta agc aat gag ctt aaa tcc gat gct aaa cac caa 2784 Leu Thr Ala Arg Leu Ser Asn Glu Leu Lys Ser Asp Ala Lys His Gln 915 920 925 ctg gtt aag caa gtc acc tta ggt ggc cgt gat atc tac cag cat att 2832 Leu Val Lys Gln Val Thr Leu Gly Gly Arg Asp Ile Tyr Gln His Ile 930 935 940 gtt gat aca ccg ctt gca agc cttga agc att act cag aaa ttg gcg 2880 Val Asp Thr Pro Leu Ala Ser Leu Glu Ser Ile Thr Gln Lys Leu Ala 945 950 955 caa gcg aca gca tcg aca gtg gtc aac caa gtt aaa cct att aag gcc 2928 Gln Ala Thr Ala Ser Thr Val Val Asn Gln Val Lys Pro Ile Lys Ala 960 965 970 975 gct ggc tca gtc gaa atg gct aac tca ttc gaa acg gaa agc tca gca 2976 Ala Gly Ser Val Glu Met Ala Asn Ser Phe Glu Thr Glu Ser Ser Ala 980 985 990 gag cca caa ata aca att gca gca caa cag act gca aac att ggc gtc 3024 Glu Pro Gln Ile Thr Ile Ala Ala Gln Gln Thr Ala Asn Ile Gly Val 995 1000 1005 acc gct cag gca acc aaa cgt gaa tta ggt acc cca cca atg aca aca 3072 Thr Ala Gln Ala Thr Lys Arg Glu Leu Gly Thr Pro Pro Met Thr Thr 1010 1015 1020 aat acc att gct aat aca gca aat aat tta gac aag act ctt gag act 3120 Asn Thr Ile Ala Asn Thr Ala Asn Asn Leu Asp Lys Thr Leu Glu Thr 1025 1030 1035 gtt gct ggc aat act gtt gct agc aag gtt ggc tct ggc gac ata gtc 3168 Val Ala Gly Asn Thr Val Ala Ser Lys Val Gly Ser Gly Asp Ile Val 1040 1045 1050 1055 aat ttt caa cag aac caa caa ttg gct caa caa gct cac ctc gcc ttt 3216 Asn Phe Gln Gln Asn Gln Gln Leu Ala Gln Gln Ala His Leu Ala Phe 1060 1065 1070 ctt gaa agc cgc agt gcg ggt atg aag gtg gct gat ag Leu Glu Ser Arg Ser Ala Gly Met Lys Val Ala Asp Ala Leu Leu Lys 1075 1080 1085 caa cag cta gct caa gta aca ggc caa act atc gat aat cag gcc ctc 3312 Gln Gln Leu Ala Gln Val Thr Gl y Gln Thr Ile Asp Asn Gln Ala Leu 1090 1095 1100 gat act caa gcc gtc gat act caa aca agc gag aat gta gcg att gcc 3360 Asp Thr Gln Ala Val Asp Thr Gln Thr Ser Glu Asn Val Ala Ile Ala 1105 1110 1115 gca gaa tca cca gtt caa gtt aca aca cct gtt caa gtt aca aca cct 3408 Ala Glu Ser Pro Val Gln Val Thr Thr Pro Val Gln Val Thr Thr Pro 1120 1125 1130 1135 gtt caa atc agt gtt gtg gag tta aaa cca gat cac gct aat gtg cca 3456 Val Gln Ile Ser Val Val Glu Leu Lys Pro Asp His Ala Asn Val Pro 1140 1145 1150 cca tac acg ccg cca gtg cct gca tta aag ccg tgt atc tgg aac tat 3504 Pro Tyr Thr Pro Pro Val Pro Ala Leu Lys Pro Cys Ile Trp Asn Tyr 1155 1160 1165 gcc gat tta gtt gag tac gca gaa ggc gat atc gcc aag gta ttt ggc 3552 Ala Asp Leu Val Glu Tyr Ala Glu Gly Asp Ile Ala Lys Val Phe Gly 1170 1175 1180 agt gat atc gat atc gat agc tac tcg cgc cgc gta cgt cta ccg 3600 Ser Asp Tyr Ala Ile Ile Asp Ser Tyr Ser Arg Arg Val Arg Leu Pro 1185 1190 1195 acc act gac tac ctg ttg gta tcg cgc gtg acc aaa ctt gat gc g acc 3648 Thr Thr Asp Tyr Leu Leu Val Ser Arg Val Thr Lys Leu Asp Ala Thr 1200 1205 1210 1215 atc aat caa ttt aag cca tgc tca atg acc act gag tac gac atc cct 3696 Ile Asn Gln Phe Lys Pro Cys Ser Met Thr Thr Glu Tyr Asp Ile Pro 1220 1225 1230 gtt gat gcg ccg tac tta gta gac gga caa atc cct tgg gcg gta gca 3744 Val Asp Ala Pro Tyr Leu Val Asp Gly Gln Ile Pro Trp Ala Val Ala 1235 1240 1245 gta gaa tca ggc caa tgt gac ttg atg ctt att agc tat ctc ggt atc 3792 Val Glu Ser Gly Gln Cys Asp Leu Met Leu Ile Ser Tyr Leu Gly Ile 1250 1255 1260 gac ttt gag aac aaa ggc gag cgg gtt tat cga cta ctc gat tgt acc 3840 Glu Asn Lys Gly Glu Arg Val Tyr Arg Leu Leu Asp Cys Thr 1265 1270 1275 ctc acc ttc cta ggc gac ttg cca cgt ggc gga gat acc cta cgt tac 3888 Leu Thr Phe Leu Gly Asp Leu Pro Arg Gly Gly Asp Thr Leu Arg 1280 1285 1290 1295 gac att aag atc aat aac tat gct cgc aac ggc gac acc ctg ctg ttc 3936 Asp Ile Lys Ile Asn Asn Tyr Ala Arg Asn Gly Asp Thr Leu Leu Phe 1300 1305 1310 ttc ttc tcg tat gag tgt ttt gtt ggc gac aag atg atc ctc aag atg 3984 Phe Phe Ser Tyr Glu Cys Phe Val Gly Asp Lys Met Ile Leu Lys Met 1315 1320 1325 gat ggc ggc tgc gct ggc ttc ttc act gat gaga gag gtt Asp Gly Gly Cys Ala Gly Phe Phe Thr Asp Glu Glu Leu Ala Asp Gly 1330 1335 1340 aaa ggc gtg att cgc aca gaa gaa gag att aaa gct cgc agc cta gtg 4080 Lys Gly Val Ile Arg Thr Glu Glu Glu Ig Leu Val 1345 1350 1355 caa aag caa cgc ttt aat ccg tta cta gat tgt cct aaa acc caa ttt 4128 Gln Lys Gln Arg Phe Asn Pro Leu Leu Asp Cys Pro Lys Thr Gln Phe 1360 1365 1370 1375 agt tat ggt gat att cat aag cta tta act gct gat att gag ggt tgt 4176 Ser Tyr Gly Asp Ile His Lys Leu Leu Thr Ala Asp Ile Glu Gly Cys 1380 1385 1390 ttt ggc cca agc cac agt ggc gtc cac cag ccg tca ctt tgt ttc gca 4224 Phe Gly Ser Ser Gly Val His Gln Pro Ser Leu Cys Phe Ala 1395 1400 1405 tct gaa aaa ttc ttg atg att gaa caa gtc agc aag gtt gat cgc act 4272 Ser Glu Lys Phe Leu Met Ile Glu Gln Val Ser Lys Va l Asp Arg Thr 1410 1415 1420 ggc ggt act tgg gga ctt ggc tta att gag ggt cat aag cag ctt gaa 4320 Gly Gly Thr Trp Gly Leu Gly Leu Ile Glu Gly His Lys Gln Leu Glu 1425 1430 1435 gca gac cac tgg tac ttcc tgt cat ttc aag ggc gac caa gtg atg 4368 Ala Asp His Trp Tyr Phe Pro Cys His Phe Lys Gly Asp Gln Val Met 1440 1445 1450 1455 gct ggc tcg cta atg gct gaa ggt tgt ggc cag tta ttg cag ttc Gtat 4416 Leu Met Ala Glu Gly Cys Gly Gln Leu Leu Gln Phe Tyr 1460 1465 1470 atg ctg cac ctt ggt atg cat acc caa act aaa aat ggt cgt ttc caa 4464 Met Leu His Leu Gly Met His Thr Gln Thr Lys Asn Gly Arg Phe Gln 1475 1480 1485 cct ctt gaa aac gcc tca cag caa gta cgc tgt cgc ggt caa gtg ctg 4512 Pro Leu Glu Asn Ala Ser Gln Gln Val Arg Cys Arg Gly Gln Val Leu 1490 1495 1500 cca caa tca ggc gtg cta act tac cgtg act gaa atc ggt 4560 Pro Gln Ser Gly Val Leu Thr Tyr Arg Met Glu Val Thr Glu Ile Gly 1505 1510 1515 ttc agt cca cgc cca tat gct aaa gct aac atc gat atc ttg ctt aat 4608 Phe Ser Pro Arg Pro Tyr Ala Lys Ala Asn Ile Asp Ile Leu Leu Asn 1520 1525 1530 1535 ggc aaa gcg gta gtg gat ttc caa aac cta ggg gtg atg ata aaa gag 4656 Gly Lys Ala Val Val Asp Phe Gln Asn Leu Gly Val Met Glu 1540 1545 1550 gaa gat gag tgt act cgt tat cca ctt ttg act gaa tca aca acg gct 4704 Glu Asp Glu Cys Thr Arg Tyr Pro Leu Leu Thr Glu Ser Thr Thr Ala 1555 1560 1565 agc act gca caa gta aac gct caa aca agt gcg aaa aag gta tac aag 4752 Ser Thr Ala Gln Val Asn Ala Gln Thr Ser Ala Lys Lys Val Tyr Lys 1570 1575 1580 cca gca tca gtc aat gcg cca tta atg gca caa att cct gat ctg act 4800 Pro Ala Ser Val Asn Ala Pro Leu Met Ala Gln Ile Pro Asp Leu Thr 1585 1590 1595 aaa gag cca aac aag ggc gtt att ccg att tcc cat gtt gaa gca cca 4848 Lys Glu Pro Asn Lys Gly Val Ile Pro Ile Ser His Val Glu Ala Pro 1600 1605 1610 1615 att acg cca gac tac ccg aac cgt gta cct gat aca gtg cca ttc acg 4896 Ile Thr Pro Asp Tyr Pro Asn Arg Val Pro Asp Thr Val Pro Phe Thr 1620 1625 1630 ccg tat cac atg ttt gag ttt gct aca ggc aat atc gaa aac tgt ttc 4944 Pro Tyr His Met Phe Glu Phe Ala Thr Gly Asn Ile Glu Asn Cys Phe 1635 1640 1645 ggg cca gag ttc tca atc tat cgc ggc atg atc cca cca cgt aca cca 4992 Gly Pro Glu Ile Tyr Arg Gly Met Ile Pro Pro Arg Thr Pro 1650 1655 1660 tgc ggt gac tta caa gtg acc aca cgt gtg att gaa gtt aac ggt aag 5040 Cys Gly Asp Leu Gln Val Thr Thr Arg Val Ile Glu Val Asn Gly Lys 1665 1670 1675 cgt ggc gac ttt aaa aag cca tca tcg tgt atc gct gaa tat gaa gtg 5088 Arg Gly Asp Phe Lys Lys Pro Ser Ser Cys Ile Ala Glu Tyr Glu Val 1680 1685 1690 1695 cct gca gat gcg tgg tat ttc gat cac aa aac aac gca gtg atg 5136 Pro Ala Asp Ala Trp Tyr Phe Asp Lys Asn Ser His Gly Ala Val Met 1700 1705 1710 cca tat tca att tta atg gag atc tca ctg caa cct aac ggc ttt atc 5184 Pro Tyr Ser Ile Leu Met Glu Ile Ser Leu Gln Pro Asn Gly Phe Ile 1715 1720 1725 tca ggt tac atg ggc aca acc cta ggc ttc cct ggc ctt gag ctg ttc 5232 Ser Gly Tyr Met Gly Thr Thr Leu Gly Phe Pro Gly Leu Glu Leu Phe 1730 1735 1740 ttc cgt aac tta gac ggt agc ggt gag tta cta cgt gaa gta gat tta 5280 Phe Arg Asn Leu Asp Gly Ser Gly Glu Leu Leu Arg Glu Val Asp Leu 1745 1750 1755 cgt ggt aaa acc atc cgt aac gac tca cgt tca aca gtg atg 5328 Arg Gly Lys Thr Ile Arg Asn Asp Ser Arg Leu Leu Ser Thr Val Met 1760 1765 1770 1775 gcc ggc act aac atc atc caa agc ttt agc ttc gag cta agc act gac 5376 Ala Gly Thr Asn Ile Ile Gln Ser Phe Ser Phe Glu Leu Ser Thr Asp 1780 1785 1790 ggt gag cct ttc tat cgc ggc act gcg gta ttt ggc tat ttt aaa ggt 5424 Gly Glu Pro Phe Tyr Arg Gly Thr Ala Val Phe Gly Tyr Phe Lys Gly 1795 1800 1805 gac gtt aaa gat cag cta ggc cta gat aac ggt aaa gtc act cag 5472 Asp Ala Leu Lys Asp Gln Leu Gly Leu Asp Asn Gly Lys Val Thr Gln 1810 1815 1820 cca tgg cat gta gct aac ggc gtt gct gca agc act aag gtg aac gt Pro Trp His Val Ala Asn Gly Val Ala Ala Ser Thr Lys Val Asn Leu 1825 1830 1835 ctt gat aag agc tgc cgt cac ttt aat gcg cca gct aac cag cca cac 5568 Leu Asp Lys Ser Cys Arg His Phe Asn Ala Pro Ala Asn Gln Pro His 1840 1845 1850 1855 tat cgt cta gcc ggt ggt cag ctg aac ttt atc gac agt gtt gaa att 5616 Tyr Arg Leu Ala Gly Gly Gln Leu Asn Phe Ile Asp Ser Val Glu Ile 1860 1865 1870 gtt gat aat ggc ggc acc gaa ggt tta ggt tac ttg tat gcc gag cgc 5664 Val Asp Asn Gly Gly Thr Glu Gly Leu Gly Tyr Leu Tyr Ala Glu Arg 1875 1880 1885 acc att gac cca agt gat tgg ttc ttc cag ttc cac ttc cac cttc gat 5712 Thr Ile Asp Pro Ser Asp Trp Phe Phe Gln Phe His Phe His Gln Asp 1890 1895 1900 ccg gtt atg cca ggc tcc tta ggt gtt gaa gca att att gaa acc atg 5760 Pro Val Met Pro Gly Ser Leu Gly Val Glu Ala Ile Ile Glu Thr Met 1905 1910 1915 caa gct tac gct att agt aaa gac ttg ggc gca gat ttc aaa aat cct 5808 Gln Ala Tyr Ala Ile Ser Lys Asp Leu Gly Ala Asp Phe Lys Asn Pro 1920 1925 1930 1935 aag ttt ggt cagt att tcg aac atc aag tgg aag tat cgc ggt caa 5856 Lys Phe Gly Gln Ile Leu Ser Asn Ile Lys Trp Lys Tyr Arg Gly Gln 1940 1945 1950 atc aat ccg ctg aac aag cag atg tct atg gat gtc ag c att act tca 5904 Ile Asn Pro Leu Asn Lys Gln Met Ser Met Asp Val Ser Ile Thr Ser 1955 1960 1965 atc aaa gat gaa gac ggt aag aaa gtc atc aca ggt aat gcc agc ttg 5952 Ile Lys Asp Glu Asp Gly Lys Lys Val Ile Thr Gly Asn Ala Ser Leu 1970 1975 1980 agt aaa gat ggt ctg cgc ata tac gag gtc ttc gat ata gct atc agc 6000 Ser Lys Asp Gly Leu Arg Ile Tyr Glu Val Phe Asp Ile Ala Ile Ser 1985 1990 1995 atc gaa gaa gta 6015 Ile Glu Glu Ser Val 2000 <210> 11 <211> 2005 <212> PRT <400> 1 Met Ser Leu Pro Asp Asn Ala Ser Asn His Leu Ser Ala Asn Gln Lys 1 5 10 15 Gly Ala Ser Gln Ala Ser Lys Thr Ser Lys Gln Ser Lys Ile Ala Ile 20 25 30 Val Gly Leu Ala Thr Leu Tyr Pro Asp Ala Lys Thr Pro Gln Glu Phe 35 40 45 Trp Gln Asn Leu Leu Asp Lys Arg Asp Ser Arg Ser Thr Leu Thr Asn 50 55 60 Glu Lys Leu Gly Ala Asn Ser Gln Asp Tyr Gln Gly Val Gln Gly Gln 65 70 75 80 Ser Asp Arg Phe Tyr Cys Asn Lys Gly Gly Tyr Ile Glu Asn Phe Ser 85 90 95 Phe Asn Ala Ala Gly Tyr Lys Leu Pro Glu Gln Ser Leu Asn Gly Leu 100 105 110 Asp Asp Ser Phe Leu Trp Ala Leu Asp Thr Ser Arg Asn Ala Leu Ile 115 120 125 Asp Ala Gly Ile Asp Ile Asn Gly Ala Asp Leu Ser Arg Ala Gly Val 130 135 140 Val Met Gly Ala Leu Ser Phe Pro Thr Thr Arg Ser Asn Asp Leu Phe 145 150 155 160 Leu Pro Ile Tyr His Ser Ala Val Glu Lys Ala Leu Gln Asp Lys Leu 165 170 175 Gly Val Lys Ala Phe Lys Leu Ser Pro Thr Asn Ala His Thr Ala Arg 180 185 190 Ala Ala Asn Glu Ser Ser Leu Asn Ala Ala Asn Gly Ala Ile Ala His 195 200 205 Asn Ser Ser L ys Val Val Ala Asp Ala Leu Gly Leu Gly Gly Ala Gln 210 215 220 Leu Ser Leu Asp Ala Ala Cys Ala Ser Ser Val Tyr Ser Leu Lys Leu 225 230 235 240 Ala Cys Asp Tyr Leu Ser Thr Gly Lys Ala Asp Ile Met Leu Ala Gly 245 250 255 Ala Val Ser Gly Ala Asp Pro Phe Phe Ile Asn Met Gly Phe Ser Ile 260 265 270 Phe His Ala Tyr Pro Asp His Gly Ile Ser Val Pro Phe Asp Ala Ser 275 280 285 Ser Lys Gly Leu Phe Ala Gly Glu Gly Ala Gly Val Leu Val Leu Lys 290 295 300 Arg Leu Glu Asp Ala Glu Arg Asp Asn Asp Lys Ile Tyr Ala Val Val 305 310 315 320 Ser Gly Val Gly Leu Ser Asn Asp Gly Lys Gly Gln Phe Val Leu Ser 325 330 335 Pro Asn Pro Lys Gly Gln Val Lys Ala Phe Glu Arg Ala Tyr Ala Ala 340 345 350 Ser Asp Ile Glu Pro Lys Asp Ile Glu Val Ile Glu Cys His Ala Thr 355 360 365 Gly Thr Pro Leu Gly Asp Lys Ile Glu Leu Thr Ser Met Glu Thr Phe 370 375 380 Phe Glu Asp Lys Leu Gln Gly Thr Asp Ala Pro Leu Ile Gly Ser Ala 385 390 395 400 Lys Ser Asn Leu Gly His Leu Leu Thr Ala Ala Gly Met Pro Gly Ile 405 410 415 Met Lys MetIle Phe Ala Met Lys Glu Gly Tyr Leu Pro Pro Ser Ile 420 425 430 Asn Ile Ser Asp Ala Ile Ala Ser Pro Lys Lys Leu Phe Gly Lys Pro 435 440 445 Thr Leu Pro Ser Met Val Gln Gly Trp Pro Asp Lys Pro Ser Asn Asn 450 455 460 His Phe Gly Val Arg Thr Arg His Ala Gly Val Ser Val Phe Gly Phe 465 470 475 480 Gly Gly Cys Asn Ala His Leu Leu Leu Glu Ser Tyr Asn Gly Lys Gly 485 490 490 495 Thr Val Lys Ala Glu Ala Thr Gln Val Pro Arg Gln Ala Glu Pro Leu 500 505 510 Lys Val Val Gly Leu Ala Ser His Phe Gly Pro Leu Ser Ser Ile Asn 515 520 525 Ala Leu Asn Asn Ala Val Thr Gln Asp Gly Asn Gly Phe Ile Glu Leu 530 535 540 Pro Lys Lys Arg Trp Lys Gly Leu Glu Lys His Ser Glu Leu Leu Ala 545 550 555 560 Glu Phe Gly Leu Ala Ser Ala Pro Lys Gly Ala Tyr Val Asp Asn Phe 565 570 575 Glu Leu Asp Phe Leu Arg Phe Lys Leu Pro Pro Asn Glu Asp Asp Arg 580 585 590 Leu Ile Ser Gln Gln Leu Met Leu Met Arg Val Thr Asp Glu Ala Ile 595 600 605 Arg Asp Ala Lys Leu Glu Pro Gly Gln Lys Val Ala Val Leu Val Ala 610 615 615 620 Met Glu Thr GluLeu Glu Leu His Gln Phe Arg Gly Arg Val Asn Leu 625 630 635 640 His Thr Gln Leu Ala Gln Ser Leu Ala Ala Met Gly Val Ser Leu Ser 645 650 655 Thr Asp Glu Tyr Gln Ala Leu Glu Ala Ile Ala Met Asp Ser Val Leu 660 665 670 Asp Ala Ala Lys Leu Asn Gln Tyr Thr Ser Phe Ile Gly Asn Ile Met 675 680 685 Ala Ser Arg Val Ala Ser Leu Trp Asp Phe Asn Gly Pro Ala Phe Thr 690 695 700 Ile Ser Ala Ala Glu Gln Ser Val Ser Arg Cys Ile Asp Val Ala Gln 705 710 715 715 720 Asn Leu Ile Met Glu Asp Asn Leu Asp Ala Val Val Ile Ala Ala Val 725 730 735 Asp Leu Ser Gly Ser Phe Glu Gln Val Ile Leu Lys Asn Ala Ile Ala 740 745 750 Pro Val Ala Ile Glu Pro Asn Leu Glu Ala Ser Leu Asn Pro Thr Ser 755 760 765 Ala Ser Trp Asn Val Gly Glu Gly Ala Gly Ala Val Val Leu Val Lys 770 775 780 780 Asn Glu Ala Thr Ser Gly Cys Ser Tyr Gly Gln Ile Asp Ala Leu Gly 785 790 795 800 Phe Ala Lys Thr Ala Glu Thr Ala Leu Ala Thr Asp Lys Leu Leu Ser 805 810 815 Gln Thr Ala Thr Asp Phe Asn Lys Val Lys Val Ile Glu Thr Met Ala 820 825 830 Ala Pro Ala Ser Gln Ile Gln Leu Ala Pro Ile Val Ser Ser Gln Val 835 840 845 Thr His Thr Ala Ala Glu Gln Arg Val Gly His Cys Phe Ala Ala Ala 850 855 860 Gly Met Ala Ser Leu Leu His Gly Leu Leu Asn Leu Asn Thr Val Ala 865 870 875 880 Gln Thr Asn Lys Ala Asn Cys Ala Leu Ile Asn Asn Ile Ser Glu Asn 885 890 895 Gln Leu Ser Gln Leu Leu Ile Ser Gln Thr Ala Ser Glu Gln Gln Ala 900 905 910 910 Leu Thr Ala Arg Leu Ser Asn Glu Leu Lys Ser Asp Ala Lys His Gln 915 920 925 925 Leu Val Lys Gln Val Thr Leu Gly Gly Arg Asp Ile Tyr Gln His Ile 930 935 940 Val Asp Thr Pro Leu Ala Ser Leu Glu Ser Ile Thr Gln Lys Leu Ala 945 950 955 960 Gln Ala Thr Ala Ser Thr Val Val Asn Gln Val Lys Pro Ile Lys Ala 965 970 975 Ala Gly Ser Val Glu Met Ala Asn Ser Phe Glu Thr Glu Ser Ser Ala 980 985 990 Glu Pro Gln Ile Thr Ile Ala Ala Gln Gln Thr Ala Asn Ile Gly Val 995 1000 1005 Thr Ala Gln Ala Thr Lys Arg Glu Leu Gly Thr Pro Pro Met Thr 1010 1015 1020 Asn Thr Ile Ala Asn Thr Ala Asn Asn Leu Asp Lys Thr Leu Glu Thr 1025 1030 1035 1040 Val Al a Gly Asn Thr Val Ala Ser Lys Val Gly Ser Gly Asp Ile Val 1045 1050 1055 Asn Phe Gln Gln Asn Gln Gln Leu Ala Gln Gln Ala His Leu Ala Phe 1060 1065 1070 Leu Glu Ser Arg Ser Ala Gly Met Lys Val Ala Asp Ala Leu Leu Lys 1075 1080 1085 Gln Gln Leu Ala Gln Val Thr Gly Gln Thr Ile Asp Asn Gln Ala Leu 1090 1095 1100 Asp Thr Gln Ala Val Asp Thr Gln Thr Ser Glu Asn Val Ala Ile Ala 1105 1110 1115 1120 Ala Glu Ser Pro Val Gln Val Thr Thr Pro Val Gln Val Thr Thr Pro 1125 1130 1135 Val Gln Ile Ser Val Val Glu Leu Lys Pro Asp His Ala Asn Val Pro 1140 1145 1150 Pro Tyr Thr Pro Pro Val Pro Ala Leu Lys Pro Cys Ile Trp Asn Tyr 1155 1160 1165 Ala Asp Leu Val Glu Tyr Ala Glu Gly Asp Ile Ala Lys Val Phe Gly 1170 1175 1180 Ser Asp Tyr Ala Ile Ile Asp Ser Tyr Ser Arg Arg Val Arg Leu Pro 1185 1190 1195 1200 Thr Thr Asp Tyr Leu Leu Val Ser Arg Val Thr Lys Leu Asp Ala Thr 1205 1210 1215 Ile Asn Gln Phe Lys Pro Cys Ser Met Thr Thr Glu Tyr Asp Ile Pro 1220 1225 1230 Val Asp Ala Pro Tyr Leu Val Asp Gly Gln Ile Pro Trp Ala Val Ala 1235 1240 1245 Val Glu Ser Gly Gln Cys Asp Leu Met Leu Ile Ser Tyr Leu Gly Ile 1250 1255 1260 Asp Phe Glu Asn Lys Gly Glu Glu Arg Val Tyr Arg Leu Leu Asp Cys Thr 1265 1270 1275 1280 Leu Thr Phe Leu Gly Asp Leu Pro Arg Gly Gly Asp Thr Leu Arg Tyr 1285 1290 1295 Asp Ile Lys Ile Asn Asn Tyr Ala Arg Asn Gly Asp Thr Leu Leu Phe 1300 1305 1310 Phe Phe Ser Tyr Glu Cys Phe Val Gly Asp Lys Met Ile Leu Lys Met 1315 1320 1325 Asp Gly Gly Cys Ala Gly Phe Phe Thr Asp Glu Glu Leu Ala Asp Gly 1330 1335 1340 Lys Gly Val Ile Arg Thr Glu Glu Glu Ile Lys Ala Arg Ser Leu Val 1345 1350 1355 1360 Gln Lys Gln Arg Phe Asn Pro Leu Leu Asp Cys Pro Lys Thr Gln Phe 1365 1370 1375 Ser Tyr Gly Asp Ile His Lys Leu Leu Thr Ala Asp Ile Glu Gly Cys 1380 1385 1390 Phe Gly Pro Ser His Ser Gly Val His Gln Pro Ser Leu Cys Phe Ala 1395 1400 1405 Ser Glu Lys Phe Leu Met Ile Glu Gln Val Ser Lys Val Asp Arg Thr 1410 1415 1420 Gly Gly Thr Trp Gly Leu Gly Leu Ile Glu Gly His Lys Gln Leu Glu 1425 1430 1435 1440 Ala A sp His Trp Tyr Phe Pro Cys His Phe Lys Gly Asp Gln Val Met 1445 1450 1455 Ala Gly Ser Leu Met Ala Glu Gly Cys Gly Gln Leu Leu Gln Phe Tyr 1460 1465 1470 Met Leu His Leu Gly Met His Thr Gln Thr Lys Asn Gly Arg Phe Gln 1475 1480 1485 Pro Leu Glu Asn Ala Ser Gln Gln Val Arg Cys Arg Gly Gln Val Leu 1490 1495 1500 Pro Gln Ser Gly Val Leu Thr Tyr Arg Met Glu Val Thr Glu Ile Gly 1505 1510 1515 1520 Phe Ser Pro Arg Pro Tyr Ala Lys Ala Asn Ile Asp Ile Leu Leu Asn 1525 1530 1535 Gly Lys Ala Val Val Asp Phe Gln Asn Leu Gly Val Met Ile Lys Glu 1540 1545 1550 Glu Asp Glu Cys Thr Arg Tyr Pro Leu Leu Thr Glu Ser Thr Thr Ala 1555 1560 1565 Ser Thr Ala Gln Val Asn Ala Gln Thr Ser Ala Lys Lys Val Tyr Lys 1570 1575 1580 Pro Ala Ser Val Asn Ala Pro Leu Met Ala Gln Ile Pro Asp Leu Thr 1585 1590 1595 1600 Lys Glu Pro Asn Lys Gly Val Ile Pro Ile Ser His Val Glu Ala Pro 1605 1610 1615 Ile Thr Pro Asp Tyr Pro Asn Arg Val Pro Asp Thr Val Pro Phe Thr 1620 1625 1630 Pro Tyr His Met Phe Glu Phe Ala Thr Gly Asn Il e Glu Asn Cys Phe 1635 1640 1645 Gly Pro Glu Phe Ser Ile Tyr Arg Gly Met Ile Pro Pro Arg Thr Pro 1650 1655 1660 Cys Gly Asp Leu Gln Val Thr Thr Arg Val Ile Glu Val Asn Gly Lys 1665 1670 1675 1680 Arg Gly Asp Phe Lys Lys Pro Ser Ser Cys Ile Ala Glu Tyr Glu Val 1685 1690 1695 Pro Ala Asp Ala Trp Tyr Phe Asp Lys Asn Ser His Gly Ala Val Met 1700 1705 1710 Pro Tyr Ser Ile Leu Met Glu Ile Ser Leu Gln Pro Asn Gly Phe Ile 1715 1720 1725 Ser Gly Tyr Met Gly Thr Thr Leu Gly Phe Pro Gly Leu Glu Leu Phe 1730 1735 1740 Phe Arg Asn Leu Asp Gly Ser Gly Glu Leu Leu Arg Glu Val Asp Leu 1745 1750 1755 1760 Arg Gly Lys Thr Ile Arg Asn Asp Ser Arg Leu Leu Ser Thr Val Met 1765 1770 1775 Ala Gly Thr Asn Ile Ile Gln Ser Phe Ser Phe Glu Leu Ser Thr Asp 1780 1785 1790 Gly Glu Pro Phe Tyr Arg Gly Thr Ala Val Phe Gly Tyr Phe Lys Gly 1795 1800 1805 Asp Ala Leu Lys Asp Gln Leu Gly Leu Asp Asn Gly Lys Val Thr Gln 1810 1815 1820 Pro Trp His Val Ala Asn Gly Val Ala Ala Ser Thr Lys Val Asn Leu 1825 1830 1835 1840 Leu Asp Lys Ser Cys Arg His Phe Asn Ala Pro Ala Asn Gln Pro His 1845 1850 1855 Tyr Arg Leu Ala Gly Gly Gln Leu Asn Phe Ile Asp Ser Val Glu Ile 1860 1865 1870 Val Asp Asn Gly Gly Thr Glu Gly Leu Gly Tyr Leu Tyr Ala Glu Arg 1875 1880 1885 Thr Ile Asp Pro Ser Asp Trp Phe Phe Gln Phe His Phe His Gln Asp 1890 1895 1900 Pro Val Met Pro Gly Ser Leu Gly Val Glu Ala Ile Ile Glu Thr Met 1905 1910 1915 1920 Gln Ala Tyr Ala Ile Ser Lys Asp Leu Gly Ala Asp Phe Lys Asn Pro 1925 1930 1935 Lys Phe Gly Gln Ile Leu Ser Asn Ile Lys Trp Lys Tyr Arg Gly Gln 1940 1945 1950 Ile Asn Pro Leu Asn Lys Gln Met Ser Met Asp Val Ser Ile Thr Ser 1955 1960 1965 Ile Lys Asp Glu Asp Gly Lys Lys Val Ile Thr Gly Asn Ala Ser Leu 1970 1975 1980 Ser Lys Asp Gly Leu Arg Ile Tyr Glu Val Phe Asp Ile Ala Ile Ser 1985 1990 1995 2000 Ile Glu Glu Ser Val 2005 <210> 12 <211> 1626 <212> DNA <213> Shewanella putrefaciens SCRC-2874 (FERM BP-1625) <400> 1 atg aat cct aca gca act aac gaa atg ctt tct ccg tgg cca tgg gct 48 Met Asn Pro Thr Ala Thr Asn Glu Met Leu Ser Pro Trp Pro Trp Ala 1 5 10 15 gtg aca gag tca aat atc agt ttt gac gtg caa gtg atg gaa caa caa 96 Val Thr Glu Ser Asn Ile Ser Phe Asp Val Gln Val Met Glu Gln Gln 20 25 30 ctt aaa gat ttt agc cgg gca tgt tac gtg gtc aat cat gcc gac cac 144 Leu Lys Asp Phe Ser Arg Ala Cys Tyr Val Val Asn His Ala Asp His 35 40 45 ggc ttt ggt att gcg caa act gcc gat atc gtg act gaa caa gcg gca 192 Gly Phe Gly Ile Ala Gln Thr Ala Asp Ile Val Thr Glu Gln Ala Ala 50 55 60 aac agc aca gat tta cct gtt agt gct ttt act cct gca tta ggt acc 240 Asn Ser Thr Asp Leu Pro Val Ser Ala Phe Thr Pro Ala Leu Gly Thr 65 70 75 80 gaa agc cta ggc gac aat aat ttc cgc cgc gtt cac ggc gtt aaa tac 288 Glu Ser Leu Gly Asp Asn Asn Phe Arg Arg Val His Gly Val Lys Tyr 85 90 95 gct tat tac gca ggc gct atg gca aac ggt att tca tct gaa gag cta 336 Ala Tyr Tyr Ala Gly Ala Met Ala Asn Gly Ile Ser Ser Glu Glu Leu 100 105 110 ggt att gcc cta ggt caa gct ggc att ttg tgt tcg ttt gga gca gcc 384 Val Ile Ala Leu Gly Gln Ala Gly Ile Leu Cys Ser Phe Gly Ala Ala 115 120 125 ggt ctt att cca agt cgc gtt gaa gcg gcat atac caa gca 432 Gly Leu Ile Pro Ser Arg Val Glu Ala Ala Ile Asn Arg Ile Gln Ala 130 135 140 gcg ctg cca aat ggc cct tat atg ttt aac ctt atc cat agt cct agc 480 Ala Leu Pro Asn Gly Pro Tyr Met Phe Asn Leu Ile His Ser Pro Ser 145 150 155 160 gag cca gca tta gag cgt ggc agc gta gag cta ttt tta aag cat aag 528 Glu Pro Ala Leu Glu Arg Gly Ser Val Glu Leu Phe Leu Lys His Lys 165 170 175 gta cgc acc gtt gaa gca tca gct ttc tta ggt cta aca cca caa atc 576 Val Arg Thr Val Glu Ala Ser Ala Phe Leu Gly Leu Thr Pro Gln Ile 180 185 190 gtc tat tac cgt gca gca gga ttg agc cga gac gca caa ggt aaa gtt 624 Val Tyr Tyr Arg Ala Ala Gly Leu Ser Arg Asp Ala Gln Gly Lys Val 195 200 205 gtg gtt ggt aac aag gtt atc gct aaa gta agt cgc acc gaa gtg gct 672 Val Val Gly Asn Lys Val Ile Ala Lys Val Ser Arg Thr Glu Val Ala210 215 220 gaa aag ttt atg atg cca gcg ccc gca aaa atg cta caa aaa cta gtt 720 Glu Lys Phe Met Met Pro Ala Pro Ala Lys Met Leu Gln Lys Leu Val 225 230 235 240 gat gac ggt tca att acc gct gag caa atg gag ctg gcg caa ctt gta 768 Asp Asp Gly Ser Ile Thr Ala Glu Gln Met Glu Leu Ala Gln Leu Val 245 250 255 cct atg gct gac gac atc act gca gag gcc gat tca ggt ggc cat act 816 Pro Met Ala Asp Asp Ile Thr Ala Glu Ala Asp Ser Gly Gly His Thr 260 265 270 gat aac cgt cca tta gta aca ttg ctg cca acc att tta gcg ctg aaa 864 Asp Asn Arg Pro Leu Val Thr Leu Leu Pro Thr Ile Leu Ala Leu Lys 275 280 285 285 gaa gaa att caa gct aaa tac caa tac gac act cct att cgt gtc ggt 912 Glu Glu Ile Gln Ala Lys Tyr Gln Tyr Asp Thr Pro Ile Arg Val Gly 290 295 300 tgt ggt ggc ggt gtg ggt acg cct gat gca gcg ctg gca ac ttt 960 Cys Gly Gly Gly Val Gly Thr Pro Asp Ala Ala Leu Ala Thr Phe Asn 305 310 315 320 atg ggc gcg gcg tat att gtt acc ggc tct atc aac caa gct tgt gtt 1008 Met Gly Ala Ala Tyr Ile Val Thr Gly Ser Ile As n Gln Ala Cys Val 325 330 335 gaa gcg ggc gca agt gat cac act cgt aaa tta ctt gcc acc act gaa 1056 Glu Ala Gly Ala Ser Asp His Thr Arg Lys Leu Leu Ala Thr Thr Glu 340 345 350 atg gcc gat gtg act atg gca cca gct gca gat atg ttc gag atg ggc 1104 Met Ala Asp Val Thr Met Ala Pro Ala Ala Asp Met Phe Glu Met Gly 355 360 365 gta aaa ctg cag gtg gtt aag cgc ggc acg cta ttc cca atg cgc Lec Val 1152 Gln Val Val Lys Arg Gly Thr Leu Phe Pro Met Arg Ala 370 375 380 aac aag cta tat gag atc tac acc cgt tac gat tca atc gaa gcg atc 1200 Asn Lys Leu Tyr Glu Ile Tyr Thr Arg Tyr Asp Ser Ile Glu Ala Ile 385 390 395 400 cca tta gac gag cgt gaa aag ctt gag aaa caa gta ttc cgc tca agc 1248 Pro Leu Asp Glu Arg Glu Lys Leu Glu Lys Gln Val Phe Arg Ser Ser 405 410 415 cta gat gaa ata tgg gca ggt aca gtg g ttt aac gag cgc gac 1296 Leu Asp Glu Ile Trp Ala Gly Thr Val Ala His Phe Asn Glu Arg Asp 420 425 430 cct aag caa atc gaa cgc gca gag ggt aac cct aag cgt aaa atg gca 1344 Pro Lys Gln Ile Glu Arg Ala Glu Gly Asn Pro Lys Arg Lys Met Ala 435 440 445 ttg att ttc cgt tgg tac tta ggt ctt tct agt cgc tgg tca aac tca 1392 Leu Ile Phe Arg Trp Tyr Leu Gly Leu Ser Ser Arg Trp Ser Asn Ser 450 455 460 ggc ga gtk ggt cgt gaa atg gat tat caa att tgg gct ggc cct gct 1440 Gly Glu Val Gly Arg Glu Met Asp Tyr Gln Ile Trp Ala Gly Pro Ala 465 470 470 475 480 ctc ggt gca ttt aac caa tgg gca aaa ggc agt tac tat 1488 Leu Gly Ala Phe Asn Gln Trp Ala Lys Gly Ser Tyr Leu Asp Asn Tyr 485 490 495 caa gac cga aat gcc gtc gat ttg gca aag cac tta atg tac ggc gcg 1536 Gln Asp Arg Asn Ala Val Asp Leu Ala Lys Met Tyr Gly Ala 500 505 510 gct tac tta aat cgt att aac tcg cta acg gct caa ggc gtt aaa gtg 1584 Ala Tyr Leu Asn Arg Ile Asn Ser Leu Thr Ala Gln Gly Val Lys Val 515 520 525 525 cca gca cag tta ctt cg aag cca aac caa aga atg gcc 1626 Pro Ala Gln Leu Leu Arg Trp Lys Pro Asn Gln Arg Met Ala 530 535 540 <210> 13 <211> 542 <212> PRT <400> 1 Met Asn Pro Thr Ala Thr Asn Glu Met Leu Ser Pro Trp Pro Trp Ala 1 5 10 15 Val Thr Glu Ser Asn Ile Ser Phe Asp Val Gln Val Met Glu Gln Gln 20 25 30 Leu Lys Asp Phe Ser Arg Ala Cys Tyr Val Val Asn His Ala Asp His 35 40 45 Gly Phe Gly Ile Ala Gln Thr Ala Asp Ile Val Thr Glu Gln Ala Ala 50 55 60 Asn Ser Thr Asp Leu Pro Val Ser Ala Phe Thr Pro Ala Leu Gly Thr 65 70 75 80 Glu Ser Leu Gly Asp Asn Asn Phe Arg Arg Val His Gly Val Lys Tyr 85 90 95 Ala Tyr Tyr Ala Gly Ala Met Ala Asn Gly Ile Ser Ser Glu Glu Leu 100 105 110 Val Ile Ala Leu Gly Gln Ala Gly Ile Leu Cys Ser Phe Gly Ala Ala 115 120 125 Gly Leu Ile Pro Ser Arg Val Glu Ala Ala Ile Asn Arg Ile Gln Ala 130 135 140 Ala Leu Pro Asn Gly Pro Tyr Met Phe Asn Leu Ile His Ser Pro Ser 145 150 155 160 Glu Pro Ala Leu Glu Arg Gly Ser Val Glu Leu Phe Leu Lys His Lys 165 170 175 Val Arg Thr Val Glu Ala Ser Ala Phe Leu Gly Leu Thr Pro Gln Ile 180 185 190 Val Tyr Tyr Arg Ala Ala Gly Leu Ser Arg Asp Ala Gln Gly Lys Val 195 200 205 Val Val Gly As n Lys Val Ile Ala Lys Val Ser Arg Thr Glu Val Ala 210 215 220 Glu Lys Phe Met Met Pro Ala Pro Ala Lys Met Leu Gln Lys Leu Val 225 230 235 240 Asp Asp Gly Ser Ile Thr Ala Glu Gln Met Glu Leu Ala Gln Leu Val 245 250 255 Pro Met Ala Asp Asp Ile Thr Ala Glu Ala Asp Ser Gly Gly His Thr 260 265 270 Asp Asn Arg Pro Leu Val Thr Leu Leu Pro Thr Ile Leu Ala Leu Lys 275 280 285 Glu Glu Ile Gln Ala Lys Tyr Gln Tyr Asp Thr Pro Ile Arg Val Gly 290 295 300 Cys Gly Gly Gly Val Gly Thr Pro Asp Ala Ala Leu Ala Thr Phe Asn 305 310 315 320 Met Gly Ala Ala Tyr Ile Val Thr Gly Ser Ile Asn Gln Ala Cys Val 325 330 335 Glu Ala Gly Ala Ser Asp His Thr Arg Lys Leu Leu Ala Thr Thr Glu 340 345 350 Met Ala Asp Val Thr Met Ala Pro Ala Ala Asp Met Phe Glu Met Gly 355 360 365 Val Lys Leu Gln Val Val Lys Arg Gly Thr Leu Phe Pro Met Arg Ala 370 375 380 Asn Lys Leu Tyr Glu Ile Tyr Thr Arg Tyr Asp Ser Ile Glu Ala Ile 385 390 395 400 Pro Leu Asp Glu Arg Glu Lys Leu Glu Lys Gln Val Phe Arg Ser Ser 405 410 415 Leu Asp Glu I le Trp Ala Gly Thr Val Ala His Phe Asn Glu Arg Asp 420 425 430 Pro Lys Gln Ile Glu Arg Ala Glu Gly Asn Pro Lys Arg Lys Met Ala 435 440 445 Leu Ile Phe Arg Trp Tyr Leu Gly Leu Ser Ser Arg Trp Ser Asn Ser 450 455 460 Gly Glu Val Gly Arg Glu Met Asp Tyr Gln Ile Trp Ala Gly Pro Ala 465 470 475 480 Leu Gly Ala Phe Asn Gln Trp Ala Lys Gly Ser Tyr Leu Asp Asn Tyr 485 490 495 Gln Asp Arg Asn Ala Val Asp Leu Ala Lys His Leu Met Tyr Gly Ala 500 505 510 Ala Tyr Leu Asn Arg Ile Asn Ser Leu Thr Ala Gln Gly Val Lys Val 515 520 520 Pro Ala Gln Leu Leu Arg Trp Lys Pro Asn Gln Arg Met Ala 530 535 540
【図1】 本発明のプラスミドの一態様であるpJRD
EPA−Sの構造を示す。FIG. 1 shows pJRD which is an embodiment of the plasmid of the present invention.
1 shows the structure of EPA-S.
【手続補正書】[Procedure amendment]
【提出日】平成12年8月29日(2000.8.2
9)[Submission date] August 29, 2000 (2008.2.
9)
【手続補正1】[Procedure amendment 1]
【補正対象書類名】明細書[Document name to be amended] Statement
【補正対象項目名】0013[Correction target item name] 0013
【補正方法】変更[Correction method] Change
【補正内容】[Correction contents]
【0013】実施例1 小型化プラスミドの作製 特開平8-242867に記載のプラスミドpEPAに挿入されたEP
A生合成遺伝子群のうち、EPA生合成に必須であるORF3、
6、7、8および9のサブクローニングを行った。ORF5、
6、7、8および9については、クローニングベクターpBSI
IKS(+)(Stratagene社製)のXbaI-SpeI部位にXbaI-SpeI
断片(23,045-31,443)、XbaI部位にXbaI-XbaI断片(1
2,314-23,045)SpeI部位にSpeI-NheI断片(31,443-32,5
14)を順次サブクローニングを行いΔX4XbNh/pBS を作
製した。ΔX4XbNh/pBS をNotIで処理したものをT4DNAポ
リメラーゼにより平滑末端を作り、それをXhoIで処理し
て DNA断片Aを得た。 また、ORF3については、R/pSTV28
(HpaI断片7,951-9,129を宝酒造製ベクターpSTV28のSma
I部位に挿入したもの)をEcoRI及びPstIで処理して切り
出した断片をpBSIIKS(+)のEcoRI-PstI部位に挿入してR/
pBSを作製した。R/pBSをPstIで処理後T4DNAポリメラー
ゼにより平滑末端を作りXhoIリンカーを導入してからXh
oIでORF3を含む断片を切り出しDNA断片Bを得た。広域宿
主ベクターであるpJRD215(カナマイシン及びストレプ
トマイシン耐性)のXhoI-StuI部位に断片Aをパッカジン
ラムダDNAパッケージングシステム(Promega社製)によ
り導入した後に、XhoI部位に断片BをDNA ライゲーショ
ンキット (宝酒造製)を用いて導入しプラスミドを完
成させた。これをpJRDEPA-Sと命名した(図1)。Example 1 Preparation of miniaturized plasmid EP inserted into plasmid pEPA described in JP-A-8-242867
ORF3, which is essential for EPA biosynthesis,
6, 7, 8 and 9 were subcloned. ORF5,
For 6, 7, 8 and 9, the cloning vector pBSI
XbaI-SpeI at the XbaI-SpeI site of IKS (+) (Stratagene)
Fragment (23,045-31,443), XbaI-XbaI fragment (1
2,314-23,045) SpeI-NheI fragment (31,443-32,5)
14) was sequentially subcloned to prepare ΔX4XbNh / pBS. ΔX4XbNh / pBS treated with NotI was blunt-ended with T4 DNA polymerase and treated with XhoI to obtain DNA fragment A. For ORF3, R / pSTV28
(HpaI fragment 7,951-9,129 was converted to Sma of Takara Shuzo vector pSTV28.
The fragment cut out after treating with EcoRI and PstI) was inserted into the EcoRI-PstI site of pBSIIKS (+) to
pBS was prepared. After treating R / pBS with PstI, blunt ends were created with T4 DNA polymerase and an XhoI linker was introduced, and then Xh
A fragment containing ORF3 was cut out with oI to obtain a DNA fragment B. Fragment A is introduced into the XhoI-StuI site of pJRD215 (kanamycin and streptomycin resistance), which is a broad-range host vector, using the Pacazine Lambda DNA packaging system (Promega), and fragment B is inserted into the XhoI site using a DNA ligation kit (Takara Shuzo). ) To complete the plasmid. This was named pJRDEPA-S (Figure 1).
【手続補正2】[Procedure amendment 2]
【補正対象書類名】明細書[Document name to be amended] Statement
【補正対象項目名】0025[Correction target item name] 0025
【補正方法】変更[Correction method] Change
【補正内容】[Correction contents]
【0025】[0025]
【配列表】 <110> Sagami Chemical Reserach Center, Japan Bioindustry Association, Director-General of Agency of Industrial Science and Technology. <120> A plasmid carrying the eicosapentaenoic acid synthesis gene claste r and a transgenic cyanobacterium producing eicosapentaenoic acid. <130> SO18226 <160> 13 [Sequence List] <110> Sagami Chemical Reserach Center, Japan Bioindustry Association, Director-General of Agency of Industrial Science and Technology. <120> A plasmid carrying the eicosapenta e noic acid synthesis gene claste r and a transgenic cyanobacterium producing eicosapenta e noic acid. <130> SO18226 <160> 13
───────────────────────────────────────────────────── フロントページの続き (51)Int.Cl.7 識別記号 FI テーマコート゛(参考) (C12N 1/12 (C12P 7/64 C12R 1:89) C12R 1:89) (C12P 7/64 C12N 15/00 ZNAA C12R 1:89) C12R 1:89) (72)発明者 湯 玲子 神奈川県横浜市緑区霧が丘3−24−4− 108 (72)発明者 山田 章子 神奈川県相模原市若松6−1−27−B− 305 (72)発明者 松永 是 東京都小金井市本町4−20−15 (72)発明者 竹山 春子 東京都府中市幸町2−40−C−106 (72)発明者 倉根 隆一郎 茨城県つくば市東1丁目1番3 工業技術 院生命工学工業技術研究所内 Fターム(参考) 4B024 AA03 BA07 CA04 DA06 GA14 HA01 4B064 AD90 CA02 CA08 CA19 CC24 DA16 4B065 AA01X AA01Y AA26X AB01 BA02 BA22 CA13 ──────────────────────────────────────────────────続 き Continued on the front page (51) Int.Cl. 7 Identification symbol FI Theme coat ゛ (Reference) (C12N 1/12 (C12P 7/64 C12R 1:89) C12R 1:89) (C12P 7/64 C12N 15 / 00 ZNAA C12R 1:89) C12R 1:89) (72) Inventor Reiko Yu 3-24-4-108, Kirigaoka, Midori-ku, Yokohama-shi, Kanagawa Prefecture (72) Inventor Akiko Yamada 6-1-, Wakamatsu, Sagamihara-shi, Kanagawa Prefecture 27-B-305 (72) Inventor, Makoto Matsunaga 4-20-15, Honmachi, Koganei-shi, Tokyo (72) Inventor Haruko Takeyama, 2-40-C-106, Sachimachi, Fuchu-shi, Tokyo (72) Inventor, Ryuichiro Kurane 1-3-1-3 Higashi, Tsukuba City, Ibaraki Prefecture F-term (Reference) 4B024 AA03 BA07 CA04 DA06 GA14 HA01 4B064 AD90 CA02 CA08 CA19 CC24 DA16 4B065 AA01X AA01Y AA26X AB01 BA02 BA22 CA 13
Claims (3)
2で示される塩基配列によってコードされたイコサペン
タエン酸生合成酵素群をコードする遺伝子群を、広域宿
主ベクターにクローニングして得られるプラスミド。1. SEQ ID NOs: 2, 4, 6, 8, 10, and 1
A plasmid obtained by cloning a gene group encoding a group of eicosapentaenoic acid biosynthetic enzymes encoded by the nucleotide sequence represented by 2 into a broad-range host vector.
3で示される塩基配列で表されるイコサペンタエン酸生
合成酵素群をコードする遺伝子群を、広域宿主ベクター
にクローニングして得られるプラスミド。2. SEQ ID NOs: 3, 5, 7, 9, 11, and 1
A plasmid obtained by cloning a gene group encoding the icosapentaenoic acid biosynthetic enzyme group represented by the nucleotide sequence represented by 3 into a broad-range host vector.
入して得られるイコサペンタエン酸を産生するラン藻。3. A cyanobacterium producing icosapentaenoic acid obtained by introducing the plasmid according to claim 1 or 2.
Priority Applications (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| JP32916999A JP4221476B2 (en) | 1999-11-19 | 1999-11-19 | Plasmid cloned icosapentaenoic acid biosynthesis genes and cyanobacteria producing icosapentaenoic acid |
Applications Claiming Priority (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| JP32916999A JP4221476B2 (en) | 1999-11-19 | 1999-11-19 | Plasmid cloned icosapentaenoic acid biosynthesis genes and cyanobacteria producing icosapentaenoic acid |
Publications (2)
| Publication Number | Publication Date |
|---|---|
| JP2001145490A true JP2001145490A (en) | 2001-05-29 |
| JP4221476B2 JP4221476B2 (en) | 2009-02-12 |
Family
ID=18218426
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| JP32916999A Expired - Lifetime JP4221476B2 (en) | 1999-11-19 | 1999-11-19 | Plasmid cloned icosapentaenoic acid biosynthesis genes and cyanobacteria producing icosapentaenoic acid |
Country Status (1)
| Country | Link |
|---|---|
| JP (1) | JP4221476B2 (en) |
Cited By (7)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US7807849B2 (en) | 2004-04-22 | 2010-10-05 | Commonwealth Scientific And Industrial Research Organisation | Synthesis of long-chain polyunsaturated fatty acids by recombinant cells |
| US7834250B2 (en) | 2004-04-22 | 2010-11-16 | Commonwealth Scientific And Industrial Research Organisation | Synthesis of long-chain polyunsaturated fatty acids by recombinant cells |
| US8809559B2 (en) | 2008-11-18 | 2014-08-19 | Commonwelath Scientific And Industrial Research Organisation | Enzymes and methods for producing omega-3 fatty acids |
| US8816111B2 (en) | 2012-06-15 | 2014-08-26 | Commonwealth Scientific And Industrial Research Organisation | Lipid comprising polyunsaturated fatty acids |
| US8816106B2 (en) | 2006-08-29 | 2014-08-26 | Commonwealth Scientific And Industrial Research Organisation | Synthesis of fatty acids |
| US9718759B2 (en) | 2013-12-18 | 2017-08-01 | Commonwealth Scientific And Industrial Research Organisation | Lipid comprising docosapentaenoic acid |
| US10005713B2 (en) | 2014-06-27 | 2018-06-26 | Commonwealth Scientific And Industrial Research Organisation | Lipid compositions comprising triacylglycerol with long-chain polyunsaturated fatty acids at the sn-2 position |
-
1999
- 1999-11-19 JP JP32916999A patent/JP4221476B2/en not_active Expired - Lifetime
Cited By (39)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US9453183B2 (en) | 2004-04-22 | 2016-09-27 | Commonwealth Scientific And Industrial Research Organisation | Synthesis of long-chain polyunsaturated fatty acids by recombinant cell |
| US7807849B2 (en) | 2004-04-22 | 2010-10-05 | Commonwealth Scientific And Industrial Research Organisation | Synthesis of long-chain polyunsaturated fatty acids by recombinant cells |
| US7932438B2 (en) | 2004-04-22 | 2011-04-26 | Commonwealth Scientific And Industrial Research Organisation | Synthesis of long-chain polyunsaturated fatty acids by recombinant cells |
| US8071341B2 (en) | 2004-04-22 | 2011-12-06 | Commonwealth Scientific And Industrial Research Organisation | Synthesis of long-chain polyunsaturated fatty acids by recombinant cells |
| US8106226B2 (en) | 2004-04-22 | 2012-01-31 | Commonwealth Scientific And Industrial Research Organisation | Synthesis of long-chain polyunsaturated fatty acids by recombinant cells |
| US8158392B1 (en) | 2004-04-22 | 2012-04-17 | Commonwealth Scientific And Industrial Research Organisation | Synthesis of long-chain polyunsaturated fatty acids by recombinant cells |
| US8535917B2 (en) | 2004-04-22 | 2013-09-17 | Commonwealth Scientific And Industrial Research Organisation | Synthesis of long-chain polyunsaturated fatty acids by recombinant cells |
| US8575377B2 (en) | 2004-04-22 | 2013-11-05 | Commonwealth Scientific And Industrial Research Organisation | Synthesis of long-chain polyunsaturated fatty acids by recombinant cell |
| US8778644B2 (en) | 2004-04-22 | 2014-07-15 | Commonwealth Scientific And Industrial Research Organisation | Synthesis of long-chain polyunsaturated fatty acids by recombinant cell |
| US10443079B2 (en) | 2004-04-22 | 2019-10-15 | Commonwealth Scientific And Industrial Research Organisation | Synthesis of long-chain polyunsaturated fatty acids by recombinant cell |
| US10781463B2 (en) | 2004-04-22 | 2020-09-22 | Commonwealth Scientific And Industrial Research Organisation | Synthesis of long-chain polyunsaturated fatty acids by recombinant cells |
| US9963723B2 (en) | 2004-04-22 | 2018-05-08 | Commonwealth Scientific And Industrial Research Organisation | Synthesis of long-chain polyunsaturated fatty acids by recombinant cells |
| US8853432B2 (en) | 2004-04-22 | 2014-10-07 | Commonwealth Scientific And Industrial Research Organisation | Synthesis of long-chain polyunsaturated fatty acids by recombinant cell |
| US11220698B2 (en) | 2004-04-22 | 2022-01-11 | Commonwealth Scientific And Industrial Research Organisation | Synthesis of long-chain polyunsaturated fatty acids by recombinant cells |
| US7834250B2 (en) | 2004-04-22 | 2010-11-16 | Commonwealth Scientific And Industrial Research Organisation | Synthesis of long-chain polyunsaturated fatty acids by recombinant cells |
| US9994880B2 (en) | 2004-04-22 | 2018-06-12 | Commonwealth Scientific And Industrial Research Organisation | Synthesis of long-chain polyunsaturated fatty acids by recombinant cell |
| US9951357B2 (en) | 2004-04-22 | 2018-04-24 | Commonweatlh Scientific And Industrial Research Organisation | Synthesis of long-chain polyunsaturated fatty acids by recombinant cell |
| US9970033B2 (en) | 2004-04-22 | 2018-05-15 | Commonwealth Scientific And Industrial Research Organisation | Synthesis of long-chain polyunsaturated fatty acids by recombinant cell |
| US9458410B2 (en) | 2004-04-22 | 2016-10-04 | Commonwealth Scientific And Industrial Research Organisation | Synthesis of long-chain polyunsaturated fatty acids by recombinant cell |
| US11597953B2 (en) | 2004-04-22 | 2023-03-07 | Commonwealth Scientific And Industrial Research Organisation | Synthesis of long-chain polyunsaturated fatty acids by recombinant cells |
| US9926579B2 (en) | 2004-04-22 | 2018-03-27 | Commonwealth Scientific And Industrial Research Organisation | Synthesis of long-chain polyunsaturated fatty acids by recombinant cell |
| US8816106B2 (en) | 2006-08-29 | 2014-08-26 | Commonwealth Scientific And Industrial Research Organisation | Synthesis of fatty acids |
| US10513717B2 (en) | 2006-08-29 | 2019-12-24 | Commonwealth Scientific And Industrial Research Organisation | Synthesis of fatty acids |
| US9938486B2 (en) | 2008-11-18 | 2018-04-10 | Commonwealth Scientific And Industrial Research Organisation | Enzymes and methods for producing omega-3 fatty acids |
| US8809559B2 (en) | 2008-11-18 | 2014-08-19 | Commonwelath Scientific And Industrial Research Organisation | Enzymes and methods for producing omega-3 fatty acids |
| US9932289B2 (en) | 2012-06-15 | 2018-04-03 | Commonwealth Scientific And Industrial Research Ogranisation | Process for producing ethyl esters of polyunsaturated fatty acids |
| US9556102B2 (en) | 2012-06-15 | 2017-01-31 | Commonwealth Scientific And Industrial Research Organisation | Process for producing ethyl esters of polyunsaturated fatty acids |
| US9550718B2 (en) | 2012-06-15 | 2017-01-24 | Commonwealth Scientific And Industrial Research Organisation | Lipid comprising polyunsaturated fatty acids |
| US8946460B2 (en) | 2012-06-15 | 2015-02-03 | Commonwealth Scientific And Industrial Research Organisation | Process for producing polyunsaturated fatty acids in an esterified form |
| US8816111B2 (en) | 2012-06-15 | 2014-08-26 | Commonwealth Scientific And Industrial Research Organisation | Lipid comprising polyunsaturated fatty acids |
| US10335386B2 (en) | 2012-06-15 | 2019-07-02 | Commonwealth Scientific And Industrial Research Organisation | Lipid comprising polyunsaturated fatty acids |
| US9718759B2 (en) | 2013-12-18 | 2017-08-01 | Commonwealth Scientific And Industrial Research Organisation | Lipid comprising docosapentaenoic acid |
| US10190073B2 (en) | 2013-12-18 | 2019-01-29 | Commonwealth Scientific And Industrial Research Organisation | Lipid comprising long chain polyunsaturated fatty acids |
| US10125084B2 (en) | 2013-12-18 | 2018-11-13 | Commonwealth Scientific And Industrial Research Organisation | Lipid comprising docosapentaenoic acid |
| US10800729B2 (en) | 2013-12-18 | 2020-10-13 | Commonwealth Scientific And Industrial Research Organisation | Lipid comprising long chain polyunsaturated fatty acids |
| US9725399B2 (en) | 2013-12-18 | 2017-08-08 | Commonwealth Scientific And Industrial Research Organisation | Lipid comprising long chain polyunsaturated fatty acids |
| US11623911B2 (en) | 2013-12-18 | 2023-04-11 | Commonwealth Scientific And Industrial Research Organisation | Lipid comprising docosapentaenoic acid |
| US10793507B2 (en) | 2014-06-27 | 2020-10-06 | Commonwealth Scientific And Industrial Research Organisation | Lipid compositions comprising triacylglycerol with long-chain polyunsaturated fatty acids at the SN-2 position |
| US10005713B2 (en) | 2014-06-27 | 2018-06-26 | Commonwealth Scientific And Industrial Research Organisation | Lipid compositions comprising triacylglycerol with long-chain polyunsaturated fatty acids at the sn-2 position |
Also Published As
| Publication number | Publication date |
|---|---|
| JP4221476B2 (en) | 2009-02-12 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| KR101539470B1 (en) | Chimera PUFA Polyketide Synthetic System and its Use | |
| KR101506347B1 (en) | Plant seed oils containing polyunsaturated fatty acids | |
| CA2359629C (en) | Schizochytrium pks genes | |
| AU673359B2 (en) | Gene which codes for eicosapentaenoic acid synthetase group and process for producing eicosapentaenoic acid | |
| TWI337619B (en) | Pufa polyketide synthase systems and uses thereof | |
| KR20070084187A (en) | PPFFA Polyketide Synthase System and Uses thereof | |
| US20070244192A1 (en) | Plant seed oils containing polyunsaturated fatty acids | |
| KR101234198B1 (en) | PUFA Polyketide Synthase Systems and Uses Thereof | |
| CN101473038A (en) | Plant seed oil containing polyunsaturated fatty acids | |
| AU727694B2 (en) | Process for producing icosapentaenoic acid by genetic recombination | |
| JP2001145490A (en) | Plasmid cloned icosapentaenoic acid biosynthesis genes and cyanobacteria producing icosapentaenoic acid | |
| CA2209987A1 (en) | Gene coding for eicosapentaenoic acid synthesizing enzymes and process for production of eicosapentaenoic acid | |
| JP2001169780A (en) | Genes of docosahexaenoic acid-producing bacteria | |
| JP2000217582A (en) | Genes encoding enzymes involved in biosynthesis of icosapentaenoic acid and / or docosahexaenoic acid | |
| US10087430B2 (en) | Factors for the production and accumulation of polyunsaturated fatty acids (PUFAS) derived from PUFA synthases | |
| JPH0646864A (en) | Gene encoding icosapentaenoic acid synthase group and method for producing icosapentaenoic acid | |
| JPH08242867A (en) | Genes encoding icosapentaenoic acid biosynthetic enzymes and method for producing icosapentaenoic acid | |
| Streit et al. | Are Key Growth Factors for Alfalfa Root Colonization by Rhizobium meliloti 1021 | |
| Taguchi et al. | Microbiology & Immunology fields |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| A621 | Written request for application examination |
Free format text: JAPANESE INTERMEDIATE CODE: A621 Effective date: 20041202 |
|
| A711 | Notification of change in applicant |
Free format text: JAPANESE INTERMEDIATE CODE: A711 Effective date: 20041202 |
|
| A521 | Request for written amendment filed |
Free format text: JAPANESE INTERMEDIATE CODE: A821 Effective date: 20041202 |
|
| A131 | Notification of reasons for refusal |
Free format text: JAPANESE INTERMEDIATE CODE: A131 Effective date: 20080708 |
|
| A521 | Request for written amendment filed |
Free format text: JAPANESE INTERMEDIATE CODE: A523 Effective date: 20080905 |
|
| TRDD | Decision of grant or rejection written | ||
| A01 | Written decision to grant a patent or to grant a registration (utility model) |
Free format text: JAPANESE INTERMEDIATE CODE: A01 Effective date: 20081007 |
|
| A01 | Written decision to grant a patent or to grant a registration (utility model) |
Free format text: JAPANESE INTERMEDIATE CODE: A01 |
|
| R150 | Certificate of patent or registration of utility model |
Ref document number: 4221476 Country of ref document: JP Free format text: JAPANESE INTERMEDIATE CODE: R150 Free format text: JAPANESE INTERMEDIATE CODE: R150 |
|
| S533 | Written request for registration of change of name |
Free format text: JAPANESE INTERMEDIATE CODE: R313533 |
|
| R350 | Written notification of registration of transfer |
Free format text: JAPANESE INTERMEDIATE CODE: R350 |
|
| EXPY | Cancellation because of completion of term |