NO20130590L - Rekombinant adenovirus, isolert vertscelle inneholdende denne, sammensetning som omfatter det rekombinante adenoviruset og avendelse - Google Patents
Rekombinant adenovirus, isolert vertscelle inneholdende denne, sammensetning som omfatter det rekombinante adenoviruset og avendelseInfo
- Publication number
- NO20130590L NO20130590L NO20130590A NO20130590A NO20130590L NO 20130590 L NO20130590 L NO 20130590L NO 20130590 A NO20130590 A NO 20130590A NO 20130590 A NO20130590 A NO 20130590A NO 20130590 L NO20130590 L NO 20130590L
- Authority
- NO
- Norway
- Prior art keywords
- thr
- leu
- ala
- ser
- asn
- Prior art date
Links
- 241000701161 unidentified adenovirus Species 0.000 title claims 22
- 108090000623 proteins and genes Proteins 0.000 claims abstract 4
- 241000990167 unclassified Simian adenoviruses Species 0.000 claims abstract 3
- 239000013598 vector Substances 0.000 claims abstract 3
- 108010050848 glycylleucine Proteins 0.000 claims 109
- 241001122767 Theaceae Species 0.000 claims 81
- 108010057821 leucylproline Proteins 0.000 claims 36
- 108010093581 aspartyl-proline Proteins 0.000 claims 33
- 108010061238 threonyl-glycine Proteins 0.000 claims 33
- 108010037850 glycylvaline Proteins 0.000 claims 32
- 108010064235 lysylglycine Proteins 0.000 claims 31
- 108010051242 phenylalanylserine Proteins 0.000 claims 31
- ZLFHAAGHGQBQQN-GUBZILKMSA-N Val-Ala-Pro Natural products CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@H]1C(O)=O ZLFHAAGHGQBQQN-GUBZILKMSA-N 0.000 claims 29
- 108010089804 glycyl-threonine Proteins 0.000 claims 29
- 108010026333 seryl-proline Proteins 0.000 claims 29
- VPZXBVLAVMBEQI-UHFFFAOYSA-N glycyl-DL-alpha-alanine Natural products OC(=O)C(C)NC(=O)CN VPZXBVLAVMBEQI-UHFFFAOYSA-N 0.000 claims 28
- 241000880493 Leptailurus serval Species 0.000 claims 27
- 108010047495 alanylglycine Proteins 0.000 claims 27
- 108010047857 aspartylglycine Proteins 0.000 claims 27
- 108010034529 leucyl-lysine Proteins 0.000 claims 26
- 108010012058 leucyltyrosine Proteins 0.000 claims 25
- LHSGPCFBGJHPCY-UHFFFAOYSA-N L-leucine-L-tyrosine Natural products CC(C)CC(N)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 LHSGPCFBGJHPCY-UHFFFAOYSA-N 0.000 claims 24
- IECQJCJNPJVUSB-IHRRRGAJSA-N Val-Tyr-Ser Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](Cc1ccc(O)cc1)C(=O)N[C@@H](CO)C(O)=O IECQJCJNPJVUSB-IHRRRGAJSA-N 0.000 claims 24
- YBAFDPFAUTYYRW-UHFFFAOYSA-N N-L-alpha-glutamyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CCC(O)=O YBAFDPFAUTYYRW-UHFFFAOYSA-N 0.000 claims 22
- 108010049041 glutamylalanine Proteins 0.000 claims 22
- 108010003700 lysyl aspartic acid Proteins 0.000 claims 22
- 108010020532 tyrosyl-proline Proteins 0.000 claims 22
- XVZCXCTYGHPNEM-UHFFFAOYSA-N Leu-Leu-Pro Natural products CC(C)CC(N)C(=O)NC(CC(C)C)C(=O)N1CCCC1C(O)=O XVZCXCTYGHPNEM-UHFFFAOYSA-N 0.000 claims 21
- 108010047562 NGR peptide Proteins 0.000 claims 21
- 241001217856 Chimpanzee adenovirus Species 0.000 claims 20
- JBCLFWXMTIKCCB-UHFFFAOYSA-N H-Gly-Phe-OH Natural products NCC(=O)NC(C(O)=O)CC1=CC=CC=C1 JBCLFWXMTIKCCB-UHFFFAOYSA-N 0.000 claims 20
- 108010087924 alanylproline Proteins 0.000 claims 20
- 108010029599 tyrosyl-glutamyl-tryptophan Proteins 0.000 claims 20
- QWWPYKKLXWOITQ-VOAKCMCISA-N Leu-Thr-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(C)C QWWPYKKLXWOITQ-VOAKCMCISA-N 0.000 claims 19
- SITLTJHOQZFJGG-UHFFFAOYSA-N N-L-alpha-glutamyl-L-valine Natural products CC(C)C(C(O)=O)NC(=O)C(N)CCC(O)=O SITLTJHOQZFJGG-UHFFFAOYSA-N 0.000 claims 19
- 108010077245 asparaginyl-proline Proteins 0.000 claims 19
- WQKAQKZRDIZYNV-VZFHVOOUSA-N Ala-Ser-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O WQKAQKZRDIZYNV-VZFHVOOUSA-N 0.000 claims 18
- 108010065920 Insulin Lispro Proteins 0.000 claims 18
- WSGXUIQTEZDVHJ-GARJFASQSA-N Leu-Ala-Pro Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@@H]1C(O)=O WSGXUIQTEZDVHJ-GARJFASQSA-N 0.000 claims 18
- 108010051110 tyrosyl-lysine Proteins 0.000 claims 18
- CXRCVCURMBFFOL-FXQIFTODSA-N Ala-Ala-Pro Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@H]1C(O)=O CXRCVCURMBFFOL-FXQIFTODSA-N 0.000 claims 17
- IPWKGIFRRBGCJO-IMJSIDKUSA-N Ala-Ser Chemical compound C[C@H]([NH3+])C(=O)N[C@@H](CO)C([O-])=O IPWKGIFRRBGCJO-IMJSIDKUSA-N 0.000 claims 17
- VHAQSYHSDKERBS-XPUUQOCRSA-N Ala-Val-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O VHAQSYHSDKERBS-XPUUQOCRSA-N 0.000 claims 17
- 108010079364 N-glycylalanine Proteins 0.000 claims 17
- ZSPQUTWLWGWTPS-HJGDQZAQSA-N Thr-Lys-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O ZSPQUTWLWGWTPS-HJGDQZAQSA-N 0.000 claims 17
- 108010040443 aspartyl-aspartic acid Proteins 0.000 claims 17
- FFZJHQODAYHGPO-KZVJFYERSA-N Ala-Pro-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](C)N FFZJHQODAYHGPO-KZVJFYERSA-N 0.000 claims 16
- QLSRIZIDQXDQHK-RCWTZXSCSA-N Arg-Val-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O QLSRIZIDQXDQHK-RCWTZXSCSA-N 0.000 claims 16
- QHAJMRDEWNAIBQ-FXQIFTODSA-N Asp-Arg-Asn Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(O)=O QHAJMRDEWNAIBQ-FXQIFTODSA-N 0.000 claims 16
- DINOVZWPTMGSRF-QXEWZRGKSA-N Asp-Pro-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O DINOVZWPTMGSRF-QXEWZRGKSA-N 0.000 claims 16
- BAYQNCWLXIDLHX-ONGXEEELSA-N Gly-Val-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)CN BAYQNCWLXIDLHX-ONGXEEELSA-N 0.000 claims 16
- NRFTYDWKWGJLAR-MELADBBJSA-N Tyr-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N)C(=O)O NRFTYDWKWGJLAR-MELADBBJSA-N 0.000 claims 16
- 108010069205 aspartyl-phenylalanine Proteins 0.000 claims 16
- XBGGUPMXALFZOT-UHFFFAOYSA-N glycyl-L-tyrosine hemihydrate Natural products NCC(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 XBGGUPMXALFZOT-UHFFFAOYSA-N 0.000 claims 16
- 108010073969 valyllysine Proteins 0.000 claims 16
- DDPKBJZLAXLQGZ-KBIXCLLPSA-N Ala-Val-Asp-Ser Chemical compound C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O DDPKBJZLAXLQGZ-KBIXCLLPSA-N 0.000 claims 15
- WUFYAPWIHCUMLL-CIUDSAMLSA-N Leu-Asn-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(O)=O WUFYAPWIHCUMLL-CIUDSAMLSA-N 0.000 claims 15
- OVZLLFONXILPDZ-VOAKCMCISA-N Leu-Lys-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OVZLLFONXILPDZ-VOAKCMCISA-N 0.000 claims 15
- ZUGXSSFMTXKHJS-ZLUOBGJFSA-N Ser-Ala-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O ZUGXSSFMTXKHJS-ZLUOBGJFSA-N 0.000 claims 15
- ZLFHAAGHGQBQQN-AEJSXWLSSA-N Val-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](C(C)C)N ZLFHAAGHGQBQQN-AEJSXWLSSA-N 0.000 claims 15
- JXWGBRRVTRAZQA-ULQDDVLXSA-N Val-Tyr-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](C(C)C)N JXWGBRRVTRAZQA-ULQDDVLXSA-N 0.000 claims 15
- 108010081551 glycylphenylalanine Proteins 0.000 claims 15
- 108010090894 prolylleucine Proteins 0.000 claims 15
- AEZCCDMZZJOGII-DCAQKATOSA-N Asn-Met-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O AEZCCDMZZJOGII-DCAQKATOSA-N 0.000 claims 14
- AIJAPFVDBFYNKN-WHFBIAKZSA-N Gly-Asn-Asp Chemical compound C([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)CN)C(=O)N AIJAPFVDBFYNKN-WHFBIAKZSA-N 0.000 claims 14
- IEWBEPKLKUXQBU-VOAKCMCISA-N Leu-Leu-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O IEWBEPKLKUXQBU-VOAKCMCISA-N 0.000 claims 14
- SBANPBVRHYIMRR-UHFFFAOYSA-N Leu-Ser-Pro Natural products CC(C)CC(N)C(=O)NC(CO)C(=O)N1CCCC1C(O)=O SBANPBVRHYIMRR-UHFFFAOYSA-N 0.000 claims 14
- XMBSYZWANAQXEV-UHFFFAOYSA-N N-alpha-L-glutamyl-L-phenylalanine Natural products OC(=O)CCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 XMBSYZWANAQXEV-UHFFFAOYSA-N 0.000 claims 14
- IALSFJSONJZBKB-HRCADAONSA-N Pro-Tyr-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)N3CCC[C@@H]3C(=O)O IALSFJSONJZBKB-HRCADAONSA-N 0.000 claims 14
- NADLKBTYNKUJEP-KATARQTJSA-N Ser-Thr-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O NADLKBTYNKUJEP-KATARQTJSA-N 0.000 claims 14
- ISLDRLHVPXABBC-IEGACIPQSA-N Thr-Leu-Trp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O ISLDRLHVPXABBC-IEGACIPQSA-N 0.000 claims 14
- MNYNCKZAEIAONY-XGEHTFHBSA-N Thr-Val-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O MNYNCKZAEIAONY-XGEHTFHBSA-N 0.000 claims 14
- 108010018691 arginyl-threonyl-arginine Proteins 0.000 claims 14
- 108010036533 arginylvaline Proteins 0.000 claims 14
- 108010035534 tyrosyl-leucyl-alanine Proteins 0.000 claims 14
- BTRULDJUUVGRNE-DCAQKATOSA-N Ala-Pro-Lys Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(O)=O BTRULDJUUVGRNE-DCAQKATOSA-N 0.000 claims 13
- WLVLIYYBPPONRJ-GCJQMDKQSA-N Asn-Thr-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O WLVLIYYBPPONRJ-GCJQMDKQSA-N 0.000 claims 13
- NJIKKGUVGUBICV-ZLUOBGJFSA-N Asp-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC(O)=O NJIKKGUVGUBICV-ZLUOBGJFSA-N 0.000 claims 13
- CASGONAXMZPHCK-FXQIFTODSA-N Asp-Asn-Arg Chemical compound C(C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC(=O)O)N)CN=C(N)N CASGONAXMZPHCK-FXQIFTODSA-N 0.000 claims 13
- GGEJHJIXRBTJPD-BYPYZUCNSA-N Gly-Asn-Gly Chemical compound NCC(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O GGEJHJIXRBTJPD-BYPYZUCNSA-N 0.000 claims 13
- FMEICTQWUKNAGC-YUMQZZPRSA-N Leu-Gly-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O FMEICTQWUKNAGC-YUMQZZPRSA-N 0.000 claims 13
- WECYCNFPGZLOOU-FXQIFTODSA-N Pro-Asn-Cys Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CS)C(=O)O WECYCNFPGZLOOU-FXQIFTODSA-N 0.000 claims 13
- LGSANCBHSMDFDY-GARJFASQSA-N Pro-Glu-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CCC(=O)O)C(=O)N2CCC[C@@H]2C(=O)O LGSANCBHSMDFDY-GARJFASQSA-N 0.000 claims 13
- SWIKDOUVROTZCW-GCJQMDKQSA-N Thr-Asn-Ala Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](C)C(=O)O)N)O SWIKDOUVROTZCW-GCJQMDKQSA-N 0.000 claims 13
- LHEZGZQRLDBSRR-WDCWCFNPSA-N Thr-Glu-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O LHEZGZQRLDBSRR-WDCWCFNPSA-N 0.000 claims 13
- NDZYTIMDOZMECO-SHGPDSBTSA-N Thr-Thr-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O NDZYTIMDOZMECO-SHGPDSBTSA-N 0.000 claims 13
- OGOYMQWIWHGTGH-KZVJFYERSA-N Thr-Val-Ala Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O OGOYMQWIWHGTGH-KZVJFYERSA-N 0.000 claims 13
- PFNZJEPSCBAVGX-CYDGBPFRSA-N Val-Arg-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](C(C)C)N PFNZJEPSCBAVGX-CYDGBPFRSA-N 0.000 claims 13
- PMKQKNBISAOSRI-XHSDSOJGSA-N Val-Tyr-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N2CCC[C@@H]2C(=O)O)N PMKQKNBISAOSRI-XHSDSOJGSA-N 0.000 claims 13
- ZLNYBMWGPOKSLW-LSJOCFKGSA-N Val-Val-Asp Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O ZLNYBMWGPOKSLW-LSJOCFKGSA-N 0.000 claims 13
- 108010041407 alanylaspartic acid Proteins 0.000 claims 13
- 108010038633 aspartylglutamate Proteins 0.000 claims 13
- 108010036413 histidylglycine Proteins 0.000 claims 13
- 108010083708 leucyl-aspartyl-valine Proteins 0.000 claims 13
- RLMISHABBKUNFO-WHFBIAKZSA-N Ala-Ala-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O RLMISHABBKUNFO-WHFBIAKZSA-N 0.000 claims 12
- OOWSBIOUKIUWLO-RCOVLWMOSA-N Asn-Gly-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O OOWSBIOUKIUWLO-RCOVLWMOSA-N 0.000 claims 12
- OROMFUQQTSWUTI-IHRRRGAJSA-N Asn-Phe-Arg Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N OROMFUQQTSWUTI-IHRRRGAJSA-N 0.000 claims 12
- UTLCRGFJFSZWAW-OLHMAJIHSA-N Asp-Thr-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)O)N)O UTLCRGFJFSZWAW-OLHMAJIHSA-N 0.000 claims 12
- RKXVTTIQNKPCHU-KKHAAJSZSA-N Asp-Val-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC(O)=O RKXVTTIQNKPCHU-KKHAAJSZSA-N 0.000 claims 12
- STAVRDQLZOTNKJ-RHYQMDGZSA-N Leu-Arg-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O STAVRDQLZOTNKJ-RHYQMDGZSA-N 0.000 claims 12
- PVMPDMIKUVNOBD-CIUDSAMLSA-N Leu-Asp-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O PVMPDMIKUVNOBD-CIUDSAMLSA-N 0.000 claims 12
- UCBPDSYUVAAHCD-UWVGGRQHSA-N Leu-Pro-Gly Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O UCBPDSYUVAAHCD-UWVGGRQHSA-N 0.000 claims 12
- DAYQSYGBCUKVKT-VOAKCMCISA-N Leu-Thr-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(O)=O DAYQSYGBCUKVKT-VOAKCMCISA-N 0.000 claims 12
- HKXSZKJMDBHOTG-CIUDSAMLSA-N Lys-Ser-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CCCCN HKXSZKJMDBHOTG-CIUDSAMLSA-N 0.000 claims 12
- PESQCPHRXOFIPX-UHFFFAOYSA-N N-L-methionyl-L-tyrosine Natural products CSCCC(N)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 PESQCPHRXOFIPX-UHFFFAOYSA-N 0.000 claims 12
- JFWDJFULOLKQFY-QWRGUYRKSA-N Ser-Gly-Phe Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O JFWDJFULOLKQFY-QWRGUYRKSA-N 0.000 claims 12
- DGDCHPCRMWEOJR-FQPOAREZSA-N Thr-Ala-Tyr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 DGDCHPCRMWEOJR-FQPOAREZSA-N 0.000 claims 12
- PAXANSWUSVPFNK-IUKAMOBKSA-N Thr-Ile-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N PAXANSWUSVPFNK-IUKAMOBKSA-N 0.000 claims 12
- JONPRIHUYSPIMA-UWJYBYFXSA-N Tyr-Ala-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 JONPRIHUYSPIMA-UWJYBYFXSA-N 0.000 claims 12
- 108010013835 arginine glutamate Proteins 0.000 claims 12
- 108010091092 arginyl-glycyl-proline Proteins 0.000 claims 12
- 108010068265 aspartyltyrosine Proteins 0.000 claims 12
- 108010033011 des-Arg- enterostatin Proteins 0.000 claims 12
- FSXRLASFHBWESK-UHFFFAOYSA-N dipeptide phenylalanyl-tyrosine Natural products C=1C=C(O)C=CC=1CC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 FSXRLASFHBWESK-UHFFFAOYSA-N 0.000 claims 12
- 108010087823 glycyltyrosine Proteins 0.000 claims 12
- 108010079317 prolyl-tyrosine Proteins 0.000 claims 12
- 108010015385 valyl-prolyl-proline Proteins 0.000 claims 12
- XVZCXCTYGHPNEM-IHRRRGAJSA-N (2s)-1-[(2s)-2-[[(2s)-2-amino-4-methylpentanoyl]amino]-4-methylpentanoyl]pyrrolidine-2-carboxylic acid Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(O)=O XVZCXCTYGHPNEM-IHRRRGAJSA-N 0.000 claims 11
- NXSFUECZFORGOG-CIUDSAMLSA-N Ala-Asn-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O NXSFUECZFORGOG-CIUDSAMLSA-N 0.000 claims 11
- RZZMZYZXNJRPOJ-BJDJZHNGSA-N Ala-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](C)N RZZMZYZXNJRPOJ-BJDJZHNGSA-N 0.000 claims 11
- AOAKQKVICDWCLB-UWJYBYFXSA-N Ala-Tyr-Asn Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N AOAKQKVICDWCLB-UWJYBYFXSA-N 0.000 claims 11
- ZXKNLCPUNZPFGY-LEWSCRJBSA-N Ala-Tyr-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N2CCC[C@@H]2C(=O)O)N ZXKNLCPUNZPFGY-LEWSCRJBSA-N 0.000 claims 11
- OMSKGWFGWCQFBD-KZVJFYERSA-N Ala-Val-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OMSKGWFGWCQFBD-KZVJFYERSA-N 0.000 claims 11
- UVTGNSWSRSCPLP-UHFFFAOYSA-N Arg-Tyr Natural products NC(CCNC(=N)N)C(=O)NC(Cc1ccc(O)cc1)C(=O)O UVTGNSWSRSCPLP-UHFFFAOYSA-N 0.000 claims 11
- JJGRJMKUOYXZRA-LPEHRKFASA-N Asn-Arg-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC(=O)N)N)C(=O)O JJGRJMKUOYXZRA-LPEHRKFASA-N 0.000 claims 11
- PDECQIHABNQRHN-GUBZILKMSA-N Asp-Glu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC(O)=O PDECQIHABNQRHN-GUBZILKMSA-N 0.000 claims 11
- FAUPLTGRUBTXNU-FXQIFTODSA-N Asp-Pro-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O FAUPLTGRUBTXNU-FXQIFTODSA-N 0.000 claims 11
- JDDYEZGPYBBPBN-JRQIVUDYSA-N Asp-Thr-Tyr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O JDDYEZGPYBBPBN-JRQIVUDYSA-N 0.000 claims 11
- JVSBYEDSSRZQGV-GUBZILKMSA-N Glu-Asp-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CCC(O)=O JVSBYEDSSRZQGV-GUBZILKMSA-N 0.000 claims 11
- OAGVHWYIBZMWLA-YFKPBYRVSA-N Glu-Gly-Gly Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)NCC(O)=O OAGVHWYIBZMWLA-YFKPBYRVSA-N 0.000 claims 11
- ZWQVYZXPYSYPJD-RYUDHWBXSA-N Glu-Gly-Phe Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 ZWQVYZXPYSYPJD-RYUDHWBXSA-N 0.000 claims 11
- UUTGYDAKPISJAO-JYJNAYRXSA-N Glu-Tyr-Leu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C(O)=O)CC1=CC=C(O)C=C1 UUTGYDAKPISJAO-JYJNAYRXSA-N 0.000 claims 11
- QAMMIGULQSIRCD-IRXDYDNUSA-N Gly-Phe-Tyr Chemical compound C([C@H](NC(=O)C[NH3+])C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C([O-])=O)C1=CC=CC=C1 QAMMIGULQSIRCD-IRXDYDNUSA-N 0.000 claims 11
- AVQOSMRPITVTRB-CIUDSAMLSA-N His-Asn-Asn Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)N)C(=O)O)N AVQOSMRPITVTRB-CIUDSAMLSA-N 0.000 claims 11
- XENGULNPUDGALZ-ZPFDUUQYSA-N Ile-Asn-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(C)C)C(=O)O)N XENGULNPUDGALZ-ZPFDUUQYSA-N 0.000 claims 11
- RCFDOSNHHZGBOY-UHFFFAOYSA-N L-isoleucyl-L-alanine Natural products CCC(C)C(N)C(=O)NC(C)C(O)=O RCFDOSNHHZGBOY-UHFFFAOYSA-N 0.000 claims 11
- WNGVUZWBXZKQES-YUMQZZPRSA-N Leu-Ala-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O WNGVUZWBXZKQES-YUMQZZPRSA-N 0.000 claims 11
- WQWSMEOYXJTFRU-GUBZILKMSA-N Leu-Glu-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O WQWSMEOYXJTFRU-GUBZILKMSA-N 0.000 claims 11
- CCQLQKZTXZBXTN-NHCYSSNCSA-N Leu-Gly-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)NCC(=O)N[C@@H]([C@@H](C)CC)C(O)=O CCQLQKZTXZBXTN-NHCYSSNCSA-N 0.000 claims 11
- NHDMNXBBSGVYGP-PYJNHQTQSA-N Met-His-Ile Chemical compound CSCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H]([C@@H](C)CC)C(O)=O)CC1=CN=CN1 NHDMNXBBSGVYGP-PYJNHQTQSA-N 0.000 claims 11
- AYPMIIKUMNADSU-IHRRRGAJSA-N Phe-Arg-Asn Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(O)=O AYPMIIKUMNADSU-IHRRRGAJSA-N 0.000 claims 11
- KAAPNMOKUUPKOE-SRVKXCTJSA-N Ser-Asn-Phe Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 KAAPNMOKUUPKOE-SRVKXCTJSA-N 0.000 claims 11
- BNFVPSRLHHPQKS-WHFBIAKZSA-N Ser-Asp-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O BNFVPSRLHHPQKS-WHFBIAKZSA-N 0.000 claims 11
- ZSLFCBHEINFXRS-LPEHRKFASA-N Ser-Met-Pro Chemical compound CSCC[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N ZSLFCBHEINFXRS-LPEHRKFASA-N 0.000 claims 11
- GIAZPLMMQOERPN-YUMQZZPRSA-N Val-Pro Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(O)=O GIAZPLMMQOERPN-YUMQZZPRSA-N 0.000 claims 11
- KSFXWENSJABBFI-ZKWXMUAHSA-N Val-Ser-Asn Chemical compound [H]N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O KSFXWENSJABBFI-ZKWXMUAHSA-N 0.000 claims 11
- DVLWZWNAQUBZBC-ZNSHCXBVSA-N Val-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](C(C)C)N)O DVLWZWNAQUBZBC-ZNSHCXBVSA-N 0.000 claims 11
- 108010029539 arginyl-prolyl-proline Proteins 0.000 claims 11
- 108010057083 glutamyl-aspartyl-leucine Proteins 0.000 claims 11
- 108010018006 histidylserine Proteins 0.000 claims 11
- 108010038320 lysylphenylalanine Proteins 0.000 claims 11
- 108010022588 methionyl-lysyl-proline Proteins 0.000 claims 11
- 108010005942 methionylglycine Proteins 0.000 claims 11
- 108010024607 phenylalanylalanine Proteins 0.000 claims 11
- 108010031719 prolyl-serine Proteins 0.000 claims 11
- 108010029020 prolylglycine Proteins 0.000 claims 11
- 108010031491 threonyl-lysyl-glutamic acid Proteins 0.000 claims 11
- 108010058119 tryptophyl-glycyl-glycine Proteins 0.000 claims 11
- YLTKNGYYPIWKHZ-ACZMJKKPSA-N Ala-Ala-Glu Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(O)=O YLTKNGYYPIWKHZ-ACZMJKKPSA-N 0.000 claims 10
- CVGNCMIULZNYES-WHFBIAKZSA-N Ala-Asn-Gly Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O CVGNCMIULZNYES-WHFBIAKZSA-N 0.000 claims 10
- ZIBWKCRKNFYTPT-ZKWXMUAHSA-N Ala-Asn-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O ZIBWKCRKNFYTPT-ZKWXMUAHSA-N 0.000 claims 10
- LSLIRHLIUDVNBN-CIUDSAMLSA-N Ala-Asp-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN LSLIRHLIUDVNBN-CIUDSAMLSA-N 0.000 claims 10
- PEIBBAXIKUAYGN-UBHSHLNASA-N Ala-Phe-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=CC=C1 PEIBBAXIKUAYGN-UBHSHLNASA-N 0.000 claims 10
- ADSGHMXEAZJJNF-DCAQKATOSA-N Ala-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](C)N ADSGHMXEAZJJNF-DCAQKATOSA-N 0.000 claims 10
- MKJBPDLENBUHQU-CIUDSAMLSA-N Asn-Ser-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O MKJBPDLENBUHQU-CIUDSAMLSA-N 0.000 claims 10
- BEHQTVDBCLSCBY-CFMVVWHZSA-N Asn-Tyr-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O BEHQTVDBCLSCBY-CFMVVWHZSA-N 0.000 claims 10
- PCBBLFVHTYNQGG-LAEOZQHASA-N Glu-Asn-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)O)N PCBBLFVHTYNQGG-LAEOZQHASA-N 0.000 claims 10
- BGVYNAQWHSTTSP-BYULHYEWSA-N Gly-Asn-Ile Chemical compound [H]NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O BGVYNAQWHSTTSP-BYULHYEWSA-N 0.000 claims 10
- OCDLPQDYTJPWNG-YUMQZZPRSA-N Gly-Asn-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)CN OCDLPQDYTJPWNG-YUMQZZPRSA-N 0.000 claims 10
- LHYJCVCQPWRMKZ-WEDXCCLWSA-N Gly-Leu-Thr Chemical compound [H]NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LHYJCVCQPWRMKZ-WEDXCCLWSA-N 0.000 claims 10
- DSDPLOODKXISDT-XUXIUFHCSA-N Ile-Leu-Val Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O DSDPLOODKXISDT-XUXIUFHCSA-N 0.000 claims 10
- LAPSXOAUPNOINL-YUMQZZPRSA-N Leu-Gly-Asp Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(O)=O LAPSXOAUPNOINL-YUMQZZPRSA-N 0.000 claims 10
- POZULHZYLPGXMR-ONGXEEELSA-N Leu-Gly-Val Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O POZULHZYLPGXMR-ONGXEEELSA-N 0.000 claims 10
- XOWMDXHFSBCAKQ-SRVKXCTJSA-N Leu-Ser-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC(C)C XOWMDXHFSBCAKQ-SRVKXCTJSA-N 0.000 claims 10
- FGZVGOAAROXFAB-IXOXFDKPSA-N Leu-Thr-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC(C)C)N)O FGZVGOAAROXFAB-IXOXFDKPSA-N 0.000 claims 10
- GZRABTMNWJXFMH-UVOCVTCTSA-N Leu-Thr-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GZRABTMNWJXFMH-UVOCVTCTSA-N 0.000 claims 10
- FBNPMTNBFFAMMH-UHFFFAOYSA-N Leu-Val-Arg Natural products CC(C)CC(N)C(=O)NC(C(C)C)C(=O)NC(C(O)=O)CCCN=C(N)N FBNPMTNBFFAMMH-UHFFFAOYSA-N 0.000 claims 10
- ATNKHRAIZCMCCN-BZSNNMDCSA-N Lys-Lys-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCCCN)N ATNKHRAIZCMCCN-BZSNNMDCSA-N 0.000 claims 10
- TWPCWKVOZDUYAA-KKUMJFAQSA-N Lys-Phe-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O TWPCWKVOZDUYAA-KKUMJFAQSA-N 0.000 claims 10
- IPTUBUUIFRZMJK-ACRUOGEOSA-N Lys-Phe-Phe Chemical compound C([C@H](NC(=O)[C@@H](N)CCCCN)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 IPTUBUUIFRZMJK-ACRUOGEOSA-N 0.000 claims 10
- HUKLXYYPZWPXCC-KZVJFYERSA-N Met-Ala-Thr Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O HUKLXYYPZWPXCC-KZVJFYERSA-N 0.000 claims 10
- DDYIRGBOZVKRFR-AVGNSLFASA-N Phe-Asp-Glu Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N DDYIRGBOZVKRFR-AVGNSLFASA-N 0.000 claims 10
- BNRFQGLWLQESBG-YESZJQIVSA-N Phe-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)[C@H](CC2=CC=CC=C2)N)C(=O)O BNRFQGLWLQESBG-YESZJQIVSA-N 0.000 claims 10
- SFECXGVELZFBFJ-VEVYYDQMSA-N Pro-Asp-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SFECXGVELZFBFJ-VEVYYDQMSA-N 0.000 claims 10
- WLJYLAQSUSIQNH-GUBZILKMSA-N Pro-Met-Ser Chemical compound CSCC[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@@H]1CCCN1 WLJYLAQSUSIQNH-GUBZILKMSA-N 0.000 claims 10
- BJCXXMGGPHRSHV-GUBZILKMSA-N Pro-Ser-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@@H]1CCCN1 BJCXXMGGPHRSHV-GUBZILKMSA-N 0.000 claims 10
- XNCUYZKGQOCOQH-YUMQZZPRSA-N Ser-Leu-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O XNCUYZKGQOCOQH-YUMQZZPRSA-N 0.000 claims 10
- RHAPJNVNWDBFQI-BQBZGAKWSA-N Ser-Pro-Gly Chemical compound OC[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O RHAPJNVNWDBFQI-BQBZGAKWSA-N 0.000 claims 10
- SCCKSNREWHMKOJ-SRVKXCTJSA-N Tyr-Asn-Ser Chemical compound N[C@@H](Cc1ccc(O)cc1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O SCCKSNREWHMKOJ-SRVKXCTJSA-N 0.000 claims 10
- ZPFLBLFITJCBTP-QWRGUYRKSA-N Tyr-Ser-Gly Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(=O)NCC(O)=O ZPFLBLFITJCBTP-QWRGUYRKSA-N 0.000 claims 10
- CGGVNFJRZJUVAE-BYULHYEWSA-N Val-Asp-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N CGGVNFJRZJUVAE-BYULHYEWSA-N 0.000 claims 10
- QHDXUYOYTPWCSK-RCOVLWMOSA-N Val-Asp-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)NCC(=O)O)N QHDXUYOYTPWCSK-RCOVLWMOSA-N 0.000 claims 10
- DJEVQCWNMQOABE-RCOVLWMOSA-N Val-Gly-Asp Chemical compound CC(C)[C@@H](C(=O)NCC(=O)N[C@@H](CC(=O)O)C(=O)O)N DJEVQCWNMQOABE-RCOVLWMOSA-N 0.000 claims 10
- QIVPZSWBBHRNBA-JYJNAYRXSA-N Val-Pro-Phe Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](Cc1ccccc1)C(O)=O QIVPZSWBBHRNBA-JYJNAYRXSA-N 0.000 claims 10
- DLRZGNXCXUGIDG-KKHAAJSZSA-N Val-Thr-Asp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N)O DLRZGNXCXUGIDG-KKHAAJSZSA-N 0.000 claims 10
- LMVWCLDJNSBOEA-FKBYEOEOSA-N Val-Tyr-Trp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC2=CNC3=CC=CC=C32)C(=O)O)N LMVWCLDJNSBOEA-FKBYEOEOSA-N 0.000 claims 10
- 108010005233 alanylglutamic acid Proteins 0.000 claims 10
- 108010070944 alanylhistidine Proteins 0.000 claims 10
- 108010092854 aspartyllysine Proteins 0.000 claims 10
- 108010006664 gamma-glutamyl-glycyl-glycine Proteins 0.000 claims 10
- 108010015792 glycyllysine Proteins 0.000 claims 10
- 108010017391 lysylvaline Proteins 0.000 claims 10
- 108010085203 methionylmethionine Proteins 0.000 claims 10
- 108010015796 prolylisoleucine Proteins 0.000 claims 10
- 108010071207 serylmethionine Proteins 0.000 claims 10
- 108010001055 thymocartin Proteins 0.000 claims 10
- 108010003137 tyrosyltyrosine Proteins 0.000 claims 10
- XQJAFSDFQZPYCU-UWJYBYFXSA-N Ala-Asn-Tyr Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N XQJAFSDFQZPYCU-UWJYBYFXSA-N 0.000 claims 9
- MEFILNJXAVSUTO-JXUBOQSCSA-N Ala-Leu-Thr Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MEFILNJXAVSUTO-JXUBOQSCSA-N 0.000 claims 9
- YCYXHLZRUSJITQ-SRVKXCTJSA-N Arg-Pro-Pro Chemical compound NC(=N)NCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 YCYXHLZRUSJITQ-SRVKXCTJSA-N 0.000 claims 9
- KSBHCUSPLWRVEK-ZLUOBGJFSA-N Asn-Asn-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O KSBHCUSPLWRVEK-ZLUOBGJFSA-N 0.000 claims 9
- HUAOKVVEVHACHR-CIUDSAMLSA-N Asn-Asp-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(=O)N)N HUAOKVVEVHACHR-CIUDSAMLSA-N 0.000 claims 9
- NVWJMQNYLYWVNQ-BYULHYEWSA-N Asn-Ile-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O NVWJMQNYLYWVNQ-BYULHYEWSA-N 0.000 claims 9
- NPZJLGMWMDNQDD-GHCJXIJMSA-N Asn-Ser-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O NPZJLGMWMDNQDD-GHCJXIJMSA-N 0.000 claims 9
- NWAHPBGBDIFUFD-KKUMJFAQSA-N Asp-Tyr-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(O)=O NWAHPBGBDIFUFD-KKUMJFAQSA-N 0.000 claims 9
- XTHUKRLJRUVVBF-WHFBIAKZSA-N Cys-Gly-Ser Chemical compound SC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O XTHUKRLJRUVVBF-WHFBIAKZSA-N 0.000 claims 9
- ZZWUYQXMIFTIIY-WEDXCCLWSA-N Gly-Thr-Leu Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O ZZWUYQXMIFTIIY-WEDXCCLWSA-N 0.000 claims 9
- SYOJVRNQCXYEOV-XVKPBYJWSA-N Gly-Val-Glu Chemical compound [H]NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O SYOJVRNQCXYEOV-XVKPBYJWSA-N 0.000 claims 9
- BEWFWZRGBDVXRP-PEFMBERDSA-N Ile-Glu-Asn Chemical compound [H]N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O BEWFWZRGBDVXRP-PEFMBERDSA-N 0.000 claims 9
- FZWVCYCYWCLQDH-NHCYSSNCSA-N Ile-Leu-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)NCC(=O)O)N FZWVCYCYWCLQDH-NHCYSSNCSA-N 0.000 claims 9
- VMTYLUGCXIEDMV-QWRGUYRKSA-N Lys-Leu-Gly Chemical compound OC(=O)CNC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CCCCN VMTYLUGCXIEDMV-QWRGUYRKSA-N 0.000 claims 9
- MPCKIRSXNKACRF-GUBZILKMSA-N Met-Pro-Asn Chemical compound CSCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O MPCKIRSXNKACRF-GUBZILKMSA-N 0.000 claims 9
- LWPMGKSZPKFKJD-DZKIICNBSA-N Phe-Glu-Val Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O LWPMGKSZPKFKJD-DZKIICNBSA-N 0.000 claims 9
- MCIXMYKSPQUMJG-SRVKXCTJSA-N Phe-Ser-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O MCIXMYKSPQUMJG-SRVKXCTJSA-N 0.000 claims 9
- LCUOTSLIVGSGAU-AVGNSLFASA-N Pro-His-Arg Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O LCUOTSLIVGSGAU-AVGNSLFASA-N 0.000 claims 9
- YXHYJEPDKSYPSQ-AVGNSLFASA-N Pro-Leu-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]1CCCN1 YXHYJEPDKSYPSQ-AVGNSLFASA-N 0.000 claims 9
- GFHXZNVJIKMAGO-IHRRRGAJSA-N Pro-Phe-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O GFHXZNVJIKMAGO-IHRRRGAJSA-N 0.000 claims 9
- IURWWZYKYPEANQ-HJGDQZAQSA-N Pro-Thr-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O IURWWZYKYPEANQ-HJGDQZAQSA-N 0.000 claims 9
- VMLONWHIORGALA-SRVKXCTJSA-N Ser-Leu-Leu Chemical compound CC(C)C[C@@H](C([O-])=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]([NH3+])CO VMLONWHIORGALA-SRVKXCTJSA-N 0.000 claims 9
- GDUZTEQRAOXYJS-SRVKXCTJSA-N Ser-Phe-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CO)N GDUZTEQRAOXYJS-SRVKXCTJSA-N 0.000 claims 9
- BMKNXTJLHFIAAH-CIUDSAMLSA-N Ser-Ser-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O BMKNXTJLHFIAAH-CIUDSAMLSA-N 0.000 claims 9
- PURRNJBBXDDWLX-ZDLURKLDSA-N Ser-Thr-Gly Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CO)N)O PURRNJBBXDDWLX-ZDLURKLDSA-N 0.000 claims 9
- AMXMBCAXAZUCFA-RHYQMDGZSA-N Thr-Leu-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O AMXMBCAXAZUCFA-RHYQMDGZSA-N 0.000 claims 9
- WRQLCVIALDUQEQ-UNQGMJICSA-N Thr-Phe-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O WRQLCVIALDUQEQ-UNQGMJICSA-N 0.000 claims 9
- ZESGVALRVJIVLZ-VFCFLDTKSA-N Thr-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@@H]1C(=O)O)N)O ZESGVALRVJIVLZ-VFCFLDTKSA-N 0.000 claims 9
- CYCGARJWIQWPQM-YJRXYDGGSA-N Thr-Tyr-Ser Chemical compound C[C@@H](O)[C@H]([NH3+])C(=O)N[C@H](C(=O)N[C@@H](CO)C([O-])=O)CC1=CC=C(O)C=C1 CYCGARJWIQWPQM-YJRXYDGGSA-N 0.000 claims 9
- AKLNEFNQWLHIGY-QWRGUYRKSA-N Tyr-Gly-Asp Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)NCC(=O)N[C@@H](CC(=O)O)C(=O)O)N)O AKLNEFNQWLHIGY-QWRGUYRKSA-N 0.000 claims 9
- BVWPHWLFGRCECJ-JSGCOSHPSA-N Val-Gly-Tyr Chemical compound CC(C)[C@@H](C(=O)NCC(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N BVWPHWLFGRCECJ-JSGCOSHPSA-N 0.000 claims 9
- WSUWDIVCPOJFCX-TUAOUCFPSA-N Val-Met-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N1CCC[C@@H]1C(=O)O)N WSUWDIVCPOJFCX-TUAOUCFPSA-N 0.000 claims 9
- PWCJARIQERIIGF-BZSNNMDCSA-N Val-Met-Trp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N PWCJARIQERIIGF-BZSNNMDCSA-N 0.000 claims 9
- JAIZPWVHPQRYOU-ZJDVBMNYSA-N Val-Thr-Thr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](C(C)C)N)O JAIZPWVHPQRYOU-ZJDVBMNYSA-N 0.000 claims 9
- 108010059459 arginyl-threonyl-phenylalanine Proteins 0.000 claims 9
- 239000000835 fiber Substances 0.000 claims 9
- 108010000434 glycyl-alanyl-leucine Proteins 0.000 claims 9
- 108010019832 glycyl-asparaginyl-glycine Proteins 0.000 claims 9
- 108010009298 lysylglutamic acid Proteins 0.000 claims 9
- 108010054155 lysyllysine Proteins 0.000 claims 9
- 108010077112 prolyl-proline Proteins 0.000 claims 9
- GNYUVVJYGJFKHN-RVMXOQNASA-N Arg-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N GNYUVVJYGJFKHN-RVMXOQNASA-N 0.000 claims 8
- VENMDXUVHSKEIN-GUBZILKMSA-N Arg-Ser-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O VENMDXUVHSKEIN-GUBZILKMSA-N 0.000 claims 8
- AIFHRTPABBBHKU-RCWTZXSCSA-N Arg-Thr-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O AIFHRTPABBBHKU-RCWTZXSCSA-N 0.000 claims 8
- QMQZYILAWUOLPV-JYJNAYRXSA-N Arg-Tyr-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CCCN=C(N)N)C(O)=O)CC1=CC=C(O)C=C1 QMQZYILAWUOLPV-JYJNAYRXSA-N 0.000 claims 8
- KTTCQQNRRLCIBC-GHCJXIJMSA-N Asp-Ile-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O KTTCQQNRRLCIBC-GHCJXIJMSA-N 0.000 claims 8
- HQRHFUYMGCHHJS-LURJTMIESA-N Gly-Gly-Arg Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N HQRHFUYMGCHHJS-LURJTMIESA-N 0.000 claims 8
- GAFKBWKVXNERFA-QWRGUYRKSA-N Gly-Phe-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CC=CC=C1 GAFKBWKVXNERFA-QWRGUYRKSA-N 0.000 claims 8
- HAOUOFNNJJLVNS-BQBZGAKWSA-N Gly-Pro-Ser Chemical compound NCC(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O HAOUOFNNJJLVNS-BQBZGAKWSA-N 0.000 claims 8
- QLRMMMQNCWBNPQ-QXEWZRGKSA-N Ile-Arg-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)NCC(=O)O)N QLRMMMQNCWBNPQ-QXEWZRGKSA-N 0.000 claims 8
- PWUMCBLVWPCKNO-MGHWNKPDSA-N Ile-Leu-Tyr Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 PWUMCBLVWPCKNO-MGHWNKPDSA-N 0.000 claims 8
- FADYJNXDPBKVCA-UHFFFAOYSA-N L-Phenylalanyl-L-lysin Natural products NCCCCC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 FADYJNXDPBKVCA-UHFFFAOYSA-N 0.000 claims 8
- HBJZFCIVFIBNSV-DCAQKATOSA-N Leu-Arg-Asn Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC(N)=O)C(O)=O HBJZFCIVFIBNSV-DCAQKATOSA-N 0.000 claims 8
- RRSLQOLASISYTB-CIUDSAMLSA-N Leu-Cys-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(O)=O)C(O)=O RRSLQOLASISYTB-CIUDSAMLSA-N 0.000 claims 8
- BMVFXOQHDQZAQU-DCAQKATOSA-N Leu-Pro-Asp Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(=O)O)C(=O)O)N BMVFXOQHDQZAQU-DCAQKATOSA-N 0.000 claims 8
- JGKHAFUAPZCCDU-BZSNNMDCSA-N Leu-Tyr-Leu Chemical compound CC(C)C[C@H]([NH3+])C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C([O-])=O)CC1=CC=C(O)C=C1 JGKHAFUAPZCCDU-BZSNNMDCSA-N 0.000 claims 8
- QUCDKEKDPYISNX-HJGDQZAQSA-N Lys-Asn-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O QUCDKEKDPYISNX-HJGDQZAQSA-N 0.000 claims 8
- OIQSIMFSVLLWBX-VOAKCMCISA-N Lys-Leu-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OIQSIMFSVLLWBX-VOAKCMCISA-N 0.000 claims 8
- MLQVJYMFASXBGZ-IHRRRGAJSA-N Pro-Asn-Tyr Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O MLQVJYMFASXBGZ-IHRRRGAJSA-N 0.000 claims 8
- VOZIBWWZSBIXQN-SRVKXCTJSA-N Pro-Glu-Lys Chemical compound NCCCC[C@H](NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H]1CCCN1)C(O)=O VOZIBWWZSBIXQN-SRVKXCTJSA-N 0.000 claims 8
- KLSOMAFWRISSNI-OSUNSFLBSA-N Pro-Ile-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H]1CCCN1 KLSOMAFWRISSNI-OSUNSFLBSA-N 0.000 claims 8
- VTFXTWDFPTWNJY-RHYQMDGZSA-N Pro-Leu-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O VTFXTWDFPTWNJY-RHYQMDGZSA-N 0.000 claims 8
- CPRLKHJUFAXVTD-ULQDDVLXSA-N Pro-Leu-Tyr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O CPRLKHJUFAXVTD-ULQDDVLXSA-N 0.000 claims 8
- WDXYVIIVDIDOSX-DCAQKATOSA-N Ser-Arg-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CO)CCCN=C(N)N WDXYVIIVDIDOSX-DCAQKATOSA-N 0.000 claims 8
- FIDMVVBUOCMMJG-CIUDSAMLSA-N Ser-Asn-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CO FIDMVVBUOCMMJG-CIUDSAMLSA-N 0.000 claims 8
- YIUWWXVTYLANCJ-NAKRPEOUSA-N Ser-Ile-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O YIUWWXVTYLANCJ-NAKRPEOUSA-N 0.000 claims 8
- KCNSGAMPBPYUAI-CIUDSAMLSA-N Ser-Leu-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O KCNSGAMPBPYUAI-CIUDSAMLSA-N 0.000 claims 8
- KCGIREHVWRXNDH-GARJFASQSA-N Ser-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N KCGIREHVWRXNDH-GARJFASQSA-N 0.000 claims 8
- XNXRTQZTFVMJIJ-DCAQKATOSA-N Ser-Met-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O XNXRTQZTFVMJIJ-DCAQKATOSA-N 0.000 claims 8
- JCLAFVNDBJMLBC-JBDRJPRFSA-N Ser-Ser-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JCLAFVNDBJMLBC-JBDRJPRFSA-N 0.000 claims 8
- VFEHSAJCWWHDBH-RHYQMDGZSA-N Thr-Arg-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O VFEHSAJCWWHDBH-RHYQMDGZSA-N 0.000 claims 8
- PZVGOVRNGKEFCB-KKHAAJSZSA-N Thr-Asn-Val Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](C(C)C)C(=O)O)N)O PZVGOVRNGKEFCB-KKHAAJSZSA-N 0.000 claims 8
- NLSNVZAREYQMGR-HJGDQZAQSA-N Thr-Asp-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O NLSNVZAREYQMGR-HJGDQZAQSA-N 0.000 claims 8
- ONNSECRQFSTMCC-XKBZYTNZSA-N Thr-Glu-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O ONNSECRQFSTMCC-XKBZYTNZSA-N 0.000 claims 8
- MEJHFIOYJHTWMK-VOAKCMCISA-N Thr-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)[C@@H](C)O MEJHFIOYJHTWMK-VOAKCMCISA-N 0.000 claims 8
- KZURUCDWKDEAFZ-XVSYOHENSA-N Thr-Phe-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(=O)N)C(=O)O)N)O KZURUCDWKDEAFZ-XVSYOHENSA-N 0.000 claims 8
- NDXSOKGYKCGYKT-VEVYYDQMSA-N Thr-Pro-Asp Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O NDXSOKGYKCGYKT-VEVYYDQMSA-N 0.000 claims 8
- PWPJLBWYRTVYQS-PMVMPFDFSA-N Trp-Phe-Leu Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O PWPJLBWYRTVYQS-PMVMPFDFSA-N 0.000 claims 8
- IIJWXEUNETVJPV-IHRRRGAJSA-N Tyr-Arg-Ser Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CO)C(=O)O)N)O IIJWXEUNETVJPV-IHRRRGAJSA-N 0.000 claims 8
- NVZVJIUDICCMHZ-BZSNNMDCSA-N Tyr-Phe-Ser Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O NVZVJIUDICCMHZ-BZSNNMDCSA-N 0.000 claims 8
- GZWPQZDVTBZVEP-BZSNNMDCSA-N Tyr-Tyr-Asn Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(O)=O GZWPQZDVTBZVEP-BZSNNMDCSA-N 0.000 claims 8
- CELJCNRXKZPTCX-XPUUQOCRSA-N Val-Gly-Ala Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(O)=O CELJCNRXKZPTCX-XPUUQOCRSA-N 0.000 claims 8
- AGXGCFSECFQMKB-NHCYSSNCSA-N Val-Leu-Asp Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N AGXGCFSECFQMKB-NHCYSSNCSA-N 0.000 claims 8
- UMPVMAYCLYMYGA-ONGXEEELSA-N Val-Leu-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O UMPVMAYCLYMYGA-ONGXEEELSA-N 0.000 claims 8
- BTWMICVCQLKKNR-DCAQKATOSA-N Val-Leu-Ser Chemical compound CC(C)[C@H]([NH3+])C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C([O-])=O BTWMICVCQLKKNR-DCAQKATOSA-N 0.000 claims 8
- NZGOVKLVQNOEKP-YDHLFZDLSA-N Val-Phe-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(=O)N)C(=O)O)N NZGOVKLVQNOEKP-YDHLFZDLSA-N 0.000 claims 8
- RYQUMYBMOJYYDK-NHCYSSNCSA-N Val-Pro-Glu Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(=O)O)C(=O)O)N RYQUMYBMOJYYDK-NHCYSSNCSA-N 0.000 claims 8
- RLVTVHSDKHBFQP-ULQDDVLXSA-N Val-Tyr-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C(C)C)CC1=CC=C(O)C=C1 RLVTVHSDKHBFQP-ULQDDVLXSA-N 0.000 claims 8
- 108010062796 arginyllysine Proteins 0.000 claims 8
- 108010016616 cysteinylglycine Proteins 0.000 claims 8
- 108010069495 cysteinyltyrosine Proteins 0.000 claims 8
- 108010063718 gamma-glutamylaspartic acid Proteins 0.000 claims 8
- 108010092114 histidylphenylalanine Proteins 0.000 claims 8
- 108010044348 lysyl-glutamyl-aspartic acid Proteins 0.000 claims 8
- 108010073025 phenylalanylphenylalanine Proteins 0.000 claims 8
- 108010070643 prolylglutamic acid Proteins 0.000 claims 8
- NHLAEBFGWPXFGI-WHFBIAKZSA-N Ala-Gly-Asn Chemical compound C[C@@H](C(=O)NCC(=O)N[C@@H](CC(=O)N)C(=O)O)N NHLAEBFGWPXFGI-WHFBIAKZSA-N 0.000 claims 7
- PCIFXPRIFWKWLK-YUMQZZPRSA-N Ala-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@H](C)N PCIFXPRIFWKWLK-YUMQZZPRSA-N 0.000 claims 7
- AWZKCUCQJNTBAD-SRVKXCTJSA-N Ala-Leu-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCCN AWZKCUCQJNTBAD-SRVKXCTJSA-N 0.000 claims 7
- PMQXMXAASGFUDX-SRVKXCTJSA-N Ala-Lys-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](C)N)CCCCN PMQXMXAASGFUDX-SRVKXCTJSA-N 0.000 claims 7
- VQAVBBCZFQAAED-FXQIFTODSA-N Ala-Pro-Asn Chemical compound C[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(=O)N)C(=O)O)N VQAVBBCZFQAAED-FXQIFTODSA-N 0.000 claims 7
- OTOXOKCIIQLMFH-KZVJFYERSA-N Arg-Ala-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCN=C(N)N OTOXOKCIIQLMFH-KZVJFYERSA-N 0.000 claims 7
- NABSCJGZKWSNHX-RCWTZXSCSA-N Arg-Arg-Thr Chemical compound NC(N)=NCCC[C@@H](C(=O)N[C@@H]([C@H](O)C)C(O)=O)NC(=O)[C@@H](N)CCCN=C(N)N NABSCJGZKWSNHX-RCWTZXSCSA-N 0.000 claims 7
- KWTVWJPNHAOREN-IHRRRGAJSA-N Arg-Asn-Phe Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O KWTVWJPNHAOREN-IHRRRGAJSA-N 0.000 claims 7
- IARGXWMWRFOQPG-GCJQMDKQSA-N Asn-Ala-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O IARGXWMWRFOQPG-GCJQMDKQSA-N 0.000 claims 7
- DDPXDCKYWDGZAL-BQBZGAKWSA-N Asn-Gly-Arg Chemical compound NC(=O)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N DDPXDCKYWDGZAL-BQBZGAKWSA-N 0.000 claims 7
- LTZIRYMWOJHRCH-GUDRVLHUSA-N Asn-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)N)N LTZIRYMWOJHRCH-GUDRVLHUSA-N 0.000 claims 7
- GLWFAWNYGWBMOC-SRVKXCTJSA-N Asn-Leu-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O GLWFAWNYGWBMOC-SRVKXCTJSA-N 0.000 claims 7
- MYVBTYXSWILFCG-BQBZGAKWSA-N Asn-Met-Gly Chemical compound CSCC[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CC(=O)N)N MYVBTYXSWILFCG-BQBZGAKWSA-N 0.000 claims 7
- YUUIAUXBNOHFRJ-IHRRRGAJSA-N Asn-Phe-Met Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCSC)C(O)=O YUUIAUXBNOHFRJ-IHRRRGAJSA-N 0.000 claims 7
- SZNGQSBRHFMZLT-IHRRRGAJSA-N Asn-Pro-Phe Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O SZNGQSBRHFMZLT-IHRRRGAJSA-N 0.000 claims 7
- HOQGTAIGQSDCHR-SRVKXCTJSA-N Asp-Asn-Phe Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O HOQGTAIGQSDCHR-SRVKXCTJSA-N 0.000 claims 7
- LGGHQRZIJSYRHA-GUBZILKMSA-N Asp-Pro-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CC(=O)O)N LGGHQRZIJSYRHA-GUBZILKMSA-N 0.000 claims 7
- SQJSYLDKQBZQTG-FXQIFTODSA-N Cys-Asn-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CS)N SQJSYLDKQBZQTG-FXQIFTODSA-N 0.000 claims 7
- NCWOMXABNYEPLY-NRPADANISA-N Glu-Ala-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O NCWOMXABNYEPLY-NRPADANISA-N 0.000 claims 7
- XXCDTYBVGMPIOA-FXQIFTODSA-N Glu-Asp-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O XXCDTYBVGMPIOA-FXQIFTODSA-N 0.000 claims 7
- FMBWLLMUPXTXFC-SDDRHHMPSA-N Glu-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)[C@H](CCC(=O)O)N)C(=O)O FMBWLLMUPXTXFC-SDDRHHMPSA-N 0.000 claims 7
- UMZHHILWZBFPGL-LOKLDPHHSA-N Glu-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)O)N)O UMZHHILWZBFPGL-LOKLDPHHSA-N 0.000 claims 7
- QXPRJQPCFXMCIY-NKWVEPMBSA-N Gly-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN QXPRJQPCFXMCIY-NKWVEPMBSA-N 0.000 claims 7
- ITZOBNKQDZEOCE-NHCYSSNCSA-N Gly-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)CN ITZOBNKQDZEOCE-NHCYSSNCSA-N 0.000 claims 7
- YSDLIYZLOTZZNP-UWVGGRQHSA-N Gly-Leu-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)CN YSDLIYZLOTZZNP-UWVGGRQHSA-N 0.000 claims 7
- FXLVSYVJDPCIHH-STQMWFEESA-N Gly-Phe-Arg Chemical compound [H]NCC(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O FXLVSYVJDPCIHH-STQMWFEESA-N 0.000 claims 7
- SOEGEPHNZOISMT-BYPYZUCNSA-N Gly-Ser-Gly Chemical compound NCC(=O)N[C@@H](CO)C(=O)NCC(O)=O SOEGEPHNZOISMT-BYPYZUCNSA-N 0.000 claims 7
- FKYQEVBRZSFAMJ-QWRGUYRKSA-N Gly-Ser-Tyr Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 FKYQEVBRZSFAMJ-QWRGUYRKSA-N 0.000 claims 7
- ONSARSFSJHTMFJ-STQMWFEESA-N Gly-Trp-Ser Chemical compound [H]NCC(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CO)C(O)=O ONSARSFSJHTMFJ-STQMWFEESA-N 0.000 claims 7
- GJHWILMUOANXTG-WPRPVWTQSA-N Gly-Val-Arg Chemical compound [H]NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O GJHWILMUOANXTG-WPRPVWTQSA-N 0.000 claims 7
- ZVXMEWXHFBYJPI-LSJOCFKGSA-N Gly-Val-Ile Chemical compound [H]NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O ZVXMEWXHFBYJPI-LSJOCFKGSA-N 0.000 claims 7
- FNXSYBOHALPRHV-ONGXEEELSA-N Gly-Val-Lys Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCCN FNXSYBOHALPRHV-ONGXEEELSA-N 0.000 claims 7
- SWBUZLFWGJETAO-KKUMJFAQSA-N His-Tyr-Asn Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC2=CN=CN2)N)O SWBUZLFWGJETAO-KKUMJFAQSA-N 0.000 claims 7
- VOBYAKCXGQQFLR-LSJOCFKGSA-N Ile-Gly-Val Chemical compound CC[C@H](C)[C@H](N)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O VOBYAKCXGQQFLR-LSJOCFKGSA-N 0.000 claims 7
- IITVUURPOYGCTD-NAKRPEOUSA-N Ile-Pro-Ala Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O IITVUURPOYGCTD-NAKRPEOUSA-N 0.000 claims 7
- CAHCWMVNBZJVAW-NAKRPEOUSA-N Ile-Pro-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)O)N CAHCWMVNBZJVAW-NAKRPEOUSA-N 0.000 claims 7
- YCKPUHHMCFSUMD-IUKAMOBKSA-N Ile-Thr-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N YCKPUHHMCFSUMD-IUKAMOBKSA-N 0.000 claims 7
- YKNBJXOJTURHCU-DCAQKATOSA-N Leu-Asp-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N YKNBJXOJTURHCU-DCAQKATOSA-N 0.000 claims 7
- HYMLKESRWLZDBR-WEDXCCLWSA-N Leu-Gly-Thr Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O HYMLKESRWLZDBR-WEDXCCLWSA-N 0.000 claims 7
- RXGLHDWAZQECBI-SRVKXCTJSA-N Leu-Leu-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O RXGLHDWAZQECBI-SRVKXCTJSA-N 0.000 claims 7
- ZRHDPZAAWLXXIR-SRVKXCTJSA-N Leu-Lys-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O ZRHDPZAAWLXXIR-SRVKXCTJSA-N 0.000 claims 7
- BRTVHXHCUSXYRI-CIUDSAMLSA-N Leu-Ser-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O BRTVHXHCUSXYRI-CIUDSAMLSA-N 0.000 claims 7
- LJBVRCDPWOJOEK-PPCPHDFISA-N Leu-Thr-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O LJBVRCDPWOJOEK-PPCPHDFISA-N 0.000 claims 7
- ODRREERHVHMIPT-OEAJRASXSA-N Leu-Thr-Phe Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 ODRREERHVHMIPT-OEAJRASXSA-N 0.000 claims 7
- YLBUMXYVQCHBPR-ULQDDVLXSA-N Met-Leu-Tyr Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 YLBUMXYVQCHBPR-ULQDDVLXSA-N 0.000 claims 7
- BEZJTLKUMFMITF-AVGNSLFASA-N Met-Lys-Arg Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CCCNC(N)=N BEZJTLKUMFMITF-AVGNSLFASA-N 0.000 claims 7
- WYNIRYZIFZGWQD-BPUTZDHNSA-N Met-Trp-Asn Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CC(=O)N)C(=O)O)N WYNIRYZIFZGWQD-BPUTZDHNSA-N 0.000 claims 7
- XZFYRXDAULDNFX-UHFFFAOYSA-N N-L-cysteinyl-L-phenylalanine Natural products SCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 XZFYRXDAULDNFX-UHFFFAOYSA-N 0.000 claims 7
- BQVUABVGYYSDCJ-UHFFFAOYSA-N Nalpha-L-Leucyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)CC(C)C)C(O)=O)=CNC2=C1 BQVUABVGYYSDCJ-UHFFFAOYSA-N 0.000 claims 7
- KIAWKQJTSGRCSA-AVGNSLFASA-N Phe-Asn-Glu Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N KIAWKQJTSGRCSA-AVGNSLFASA-N 0.000 claims 7
- MQVFHOPCKNTHGT-MELADBBJSA-N Phe-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC2=CC=CC=C2)N)C(=O)O MQVFHOPCKNTHGT-MELADBBJSA-N 0.000 claims 7
- UNBFGVQVQGXXCK-KKUMJFAQSA-N Phe-Ser-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O UNBFGVQVQGXXCK-KKUMJFAQSA-N 0.000 claims 7
- XNMYNGDKJNOKHH-BZSNNMDCSA-N Phe-Ser-Tyr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O XNMYNGDKJNOKHH-BZSNNMDCSA-N 0.000 claims 7
- SHUFSZDAIPLZLF-BEAPCOKYSA-N Phe-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CC=CC=C2)N)O SHUFSZDAIPLZLF-BEAPCOKYSA-N 0.000 claims 7
- OBVCYFIHIIYIQF-CIUDSAMLSA-N Pro-Asn-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O OBVCYFIHIIYIQF-CIUDSAMLSA-N 0.000 claims 7
- YKQNVTOIYFQMLW-IHRRRGAJSA-N Pro-Cys-Tyr Chemical compound C([C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H]1NCCC1)C1=CC=C(O)C=C1 YKQNVTOIYFQMLW-IHRRRGAJSA-N 0.000 claims 7
- NMELOOXSGDRBRU-YUMQZZPRSA-N Pro-Glu-Gly Chemical compound OC(=O)CNC(=O)[C@H](CCC(=O)O)NC(=O)[C@@H]1CCCN1 NMELOOXSGDRBRU-YUMQZZPRSA-N 0.000 claims 7
- CGSOWZUPLOKYOR-AVGNSLFASA-N Pro-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 CGSOWZUPLOKYOR-AVGNSLFASA-N 0.000 claims 7
- LNICFEXCAHIJOR-DCAQKATOSA-N Pro-Ser-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O LNICFEXCAHIJOR-DCAQKATOSA-N 0.000 claims 7
- UGDMQJSXSSZUKL-IHRRRGAJSA-N Pro-Ser-Tyr Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O UGDMQJSXSSZUKL-IHRRRGAJSA-N 0.000 claims 7
- CXGLFEOYCJFKPR-RCWTZXSCSA-N Pro-Thr-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O CXGLFEOYCJFKPR-RCWTZXSCSA-N 0.000 claims 7
- DYJTXTCEXMCPBF-UFYCRDLUSA-N Pro-Tyr-Phe Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)N[C@@H](CC3=CC=CC=C3)C(=O)O DYJTXTCEXMCPBF-UFYCRDLUSA-N 0.000 claims 7
- FIDNSJUXESUDOV-JYJNAYRXSA-N Pro-Tyr-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O FIDNSJUXESUDOV-JYJNAYRXSA-N 0.000 claims 7
- OOKCGAYXSNJBGQ-ZLUOBGJFSA-N Ser-Asn-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O OOKCGAYXSNJBGQ-ZLUOBGJFSA-N 0.000 claims 7
- FUMGHWDRRFCKEP-CIUDSAMLSA-N Ser-Leu-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O FUMGHWDRRFCKEP-CIUDSAMLSA-N 0.000 claims 7
- XQJCEKXQUJQNNK-ZLUOBGJFSA-N Ser-Ser-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O XQJCEKXQUJQNNK-ZLUOBGJFSA-N 0.000 claims 7
- XJDMUQCLVSCRSJ-VZFHVOOUSA-N Ser-Thr-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O XJDMUQCLVSCRSJ-VZFHVOOUSA-N 0.000 claims 7
- UBTNVMGPMYDYIU-HJPIBITLSA-N Ser-Tyr-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O UBTNVMGPMYDYIU-HJPIBITLSA-N 0.000 claims 7
- HKHCTNFKZXAMIF-KKUMJFAQSA-N Ser-Tyr-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CO)CC1=CC=C(O)C=C1 HKHCTNFKZXAMIF-KKUMJFAQSA-N 0.000 claims 7
- JGUWRQWULDWNCM-FXQIFTODSA-N Ser-Val-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O JGUWRQWULDWNCM-FXQIFTODSA-N 0.000 claims 7
- QGXCWPNQVCYJEL-NUMRIWBASA-N Thr-Asn-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O QGXCWPNQVCYJEL-NUMRIWBASA-N 0.000 claims 7
- VXMHQKHDKCATDV-VEVYYDQMSA-N Thr-Asp-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O VXMHQKHDKCATDV-VEVYYDQMSA-N 0.000 claims 7
- MECLEFZMPPOEAC-VOAKCMCISA-N Thr-Leu-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N)O MECLEFZMPPOEAC-VOAKCMCISA-N 0.000 claims 7
- NCXVJIQMWSGRHY-KXNHARMFSA-N Thr-Leu-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N)O NCXVJIQMWSGRHY-KXNHARMFSA-N 0.000 claims 7
- CSZFFQBUTMGHAH-UAXMHLISSA-N Thr-Thr-Trp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N)O CSZFFQBUTMGHAH-UAXMHLISSA-N 0.000 claims 7
- REJRKTOJTCPDPO-IRIUXVKKSA-N Thr-Tyr-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O REJRKTOJTCPDPO-IRIUXVKKSA-N 0.000 claims 7
- XVHAUVJXBFGUPC-RPTUDFQQSA-N Thr-Tyr-Phe Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O XVHAUVJXBFGUPC-RPTUDFQQSA-N 0.000 claims 7
- JRXKIVGWMMIIOF-YDHLFZDLSA-N Tyr-Asn-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC1=CC=C(C=C1)O)N JRXKIVGWMMIIOF-YDHLFZDLSA-N 0.000 claims 7
- OSMTVLSRTQDWHJ-JBACZVJFSA-N Tyr-Glu-Trp Chemical compound C([C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(O)=O)C1=CC=C(O)C=C1 OSMTVLSRTQDWHJ-JBACZVJFSA-N 0.000 claims 7
- MVFQLSPDMMFCMW-KKUMJFAQSA-N Tyr-Leu-Asn Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O MVFQLSPDMMFCMW-KKUMJFAQSA-N 0.000 claims 7
- JAGGEZACYAAMIL-CQDKDKBSSA-N Tyr-Lys-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC1=CC=C(C=C1)O)N JAGGEZACYAAMIL-CQDKDKBSSA-N 0.000 claims 7
- OBKOPLHSRDATFO-XHSDSOJGSA-N Tyr-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N OBKOPLHSRDATFO-XHSDSOJGSA-N 0.000 claims 7
- SLLKXDSRVAOREO-KZVJFYERSA-N Val-Ala-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](C)NC(=O)[C@H](C(C)C)N)O SLLKXDSRVAOREO-KZVJFYERSA-N 0.000 claims 7
- VLOYGOZDPGYWFO-LAEOZQHASA-N Val-Asp-Glu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O VLOYGOZDPGYWFO-LAEOZQHASA-N 0.000 claims 7
- DDNIHOWRDOXXPF-NGZCFLSTSA-N Val-Asp-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N DDNIHOWRDOXXPF-NGZCFLSTSA-N 0.000 claims 7
- OVLIFGQSBSNGHY-KKHAAJSZSA-N Val-Asp-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](C(C)C)N)O OVLIFGQSBSNGHY-KKHAAJSZSA-N 0.000 claims 7
- JTWIMNMUYLQNPI-WPRPVWTQSA-N Val-Gly-Arg Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCNC(N)=N JTWIMNMUYLQNPI-WPRPVWTQSA-N 0.000 claims 7
- KOSRFJWDECSPRO-UHFFFAOYSA-N alpha-L-glutamyl-L-glutamic acid Natural products OC(=O)CCC(N)C(=O)NC(CCC(O)=O)C(O)=O KOSRFJWDECSPRO-UHFFFAOYSA-N 0.000 claims 7
- 108010060035 arginylproline Proteins 0.000 claims 7
- 108010050475 glycyl-leucyl-tyrosine Proteins 0.000 claims 7
- 108010060857 isoleucyl-valyl-tyrosine Proteins 0.000 claims 7
- 108010076756 leucyl-alanyl-phenylalanine Proteins 0.000 claims 7
- 108010090333 leucyl-lysyl-proline Proteins 0.000 claims 7
- 108010034507 methionyltryptophan Proteins 0.000 claims 7
- 108010048818 seryl-histidine Proteins 0.000 claims 7
- 108010017949 tyrosyl-glycyl-glycine Proteins 0.000 claims 7
- CXURGFRDGROIKG-UHFFFAOYSA-N 3,3-bis(chloromethyl)oxetane Chemical compound ClCC1(CCl)COC1 CXURGFRDGROIKG-UHFFFAOYSA-N 0.000 claims 6
- VBDMWOKJZDCFJM-FXQIFTODSA-N Ala-Ala-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@H](C)N VBDMWOKJZDCFJM-FXQIFTODSA-N 0.000 claims 6
- JBVSSSZFNTXJDX-YTLHQDLWSA-N Ala-Ala-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@H](C)N JBVSSSZFNTXJDX-YTLHQDLWSA-N 0.000 claims 6
- YAXNATKKPOWVCP-ZLUOBGJFSA-N Ala-Asn-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(O)=O YAXNATKKPOWVCP-ZLUOBGJFSA-N 0.000 claims 6
- GSCLWXDNIMNIJE-ZLUOBGJFSA-N Ala-Asp-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O GSCLWXDNIMNIJE-ZLUOBGJFSA-N 0.000 claims 6
- YSMPVONNIWLJML-FXQIFTODSA-N Ala-Asp-Pro Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(O)=O YSMPVONNIWLJML-FXQIFTODSA-N 0.000 claims 6
- SOBIAADAMRHGKH-CIUDSAMLSA-N Ala-Leu-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O SOBIAADAMRHGKH-CIUDSAMLSA-N 0.000 claims 6
- KUFVXLQLDHJVOG-SHGPDSBTSA-N Ala-Thr-Thr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](C)N)O KUFVXLQLDHJVOG-SHGPDSBTSA-N 0.000 claims 6
- NLYYHIKRBRMAJV-AEJSXWLSSA-N Ala-Val-Pro Chemical compound C[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N NLYYHIKRBRMAJV-AEJSXWLSSA-N 0.000 claims 6
- REWSWYIDQIELBE-FXQIFTODSA-N Ala-Val-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O REWSWYIDQIELBE-FXQIFTODSA-N 0.000 claims 6
- OCOZPTHLDVSFCZ-BPUTZDHNSA-N Arg-Asn-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCCN=C(N)N)N OCOZPTHLDVSFCZ-BPUTZDHNSA-N 0.000 claims 6
- NKNILFJYKKHBKE-WPRPVWTQSA-N Arg-Gly-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O NKNILFJYKKHBKE-WPRPVWTQSA-N 0.000 claims 6
- FSNVAJOPUDVQAR-AVGNSLFASA-N Arg-Lys-Arg Chemical compound NC(=N)NCCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O FSNVAJOPUDVQAR-AVGNSLFASA-N 0.000 claims 6
- FKQITMVNILRUCQ-IHRRRGAJSA-N Arg-Phe-Asp Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O FKQITMVNILRUCQ-IHRRRGAJSA-N 0.000 claims 6
- LXMKTIZAGIBQRX-HRCADAONSA-N Arg-Phe-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)NC(=O)[C@H](CCCN=C(N)N)N)C(=O)O LXMKTIZAGIBQRX-HRCADAONSA-N 0.000 claims 6
- XMGVWQWEWWULNS-BPUTZDHNSA-N Arg-Trp-Ser Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N XMGVWQWEWWULNS-BPUTZDHNSA-N 0.000 claims 6
- CMLGVVWQQHUXOZ-GHCJXIJMSA-N Asn-Ala-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O CMLGVVWQQHUXOZ-GHCJXIJMSA-N 0.000 claims 6
- ZZXMOQIUIJJOKZ-ZLUOBGJFSA-N Asn-Asn-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CC(N)=O ZZXMOQIUIJJOKZ-ZLUOBGJFSA-N 0.000 claims 6
- PCKRJVZAQZWNKM-WHFBIAKZSA-N Asn-Asn-Gly Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O PCKRJVZAQZWNKM-WHFBIAKZSA-N 0.000 claims 6
- IICZCLFBILYRCU-WHFBIAKZSA-N Asn-Gly-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O IICZCLFBILYRCU-WHFBIAKZSA-N 0.000 claims 6
- SUEIIIFUBHDCCS-PBCZWWQYSA-N Asn-His-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SUEIIIFUBHDCCS-PBCZWWQYSA-N 0.000 claims 6
- IBLAOXSULLECQZ-IUKAMOBKSA-N Asn-Ile-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CC(N)=O IBLAOXSULLECQZ-IUKAMOBKSA-N 0.000 claims 6
- TZFQICWZWFNIKU-KKUMJFAQSA-N Asn-Leu-Tyr Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 TZFQICWZWFNIKU-KKUMJFAQSA-N 0.000 claims 6
- WXVGISRWSYGEDK-KKUMJFAQSA-N Asn-Lys-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC(=O)N)N WXVGISRWSYGEDK-KKUMJFAQSA-N 0.000 claims 6
- SNYCNNPOFYBCEK-ZLUOBGJFSA-N Asn-Ser-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O SNYCNNPOFYBCEK-ZLUOBGJFSA-N 0.000 claims 6
- HNXWVVHIGTZTBO-LKXGYXEUSA-N Asn-Ser-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC(N)=O HNXWVVHIGTZTBO-LKXGYXEUSA-N 0.000 claims 6
- AMGQTNHANMRPOE-LKXGYXEUSA-N Asn-Thr-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O AMGQTNHANMRPOE-LKXGYXEUSA-N 0.000 claims 6
- KZYSHAMXEBPJBD-JRQIVUDYSA-N Asn-Thr-Tyr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O KZYSHAMXEBPJBD-JRQIVUDYSA-N 0.000 claims 6
- CBWCQCANJSGUOH-ZKWXMUAHSA-N Asn-Val-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O CBWCQCANJSGUOH-ZKWXMUAHSA-N 0.000 claims 6
- GHWWTICYPDKPTE-NGZCFLSTSA-N Asn-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)N)N GHWWTICYPDKPTE-NGZCFLSTSA-N 0.000 claims 6
- RYEWQKQXRJCHIO-SRVKXCTJSA-N Asp-Asn-Tyr Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 RYEWQKQXRJCHIO-SRVKXCTJSA-N 0.000 claims 6
- SEMWSADZTMJELF-BYULHYEWSA-N Asp-Ile-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O SEMWSADZTMJELF-BYULHYEWSA-N 0.000 claims 6
- PAYPSKIBMDHZPI-CIUDSAMLSA-N Asp-Leu-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O PAYPSKIBMDHZPI-CIUDSAMLSA-N 0.000 claims 6
- YIDFBWRHIYOYAA-LKXGYXEUSA-N Asp-Ser-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O YIDFBWRHIYOYAA-LKXGYXEUSA-N 0.000 claims 6
- RSMZEHCMIOKNMW-GSSVUCPTSA-N Asp-Thr-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O RSMZEHCMIOKNMW-GSSVUCPTSA-N 0.000 claims 6
- ITGFVUYOLWBPQW-KKHAAJSZSA-N Asp-Thr-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O ITGFVUYOLWBPQW-KKHAAJSZSA-N 0.000 claims 6
- UXIYYUMGFNSGBK-XPUUQOCRSA-N Cys-Gly-Val Chemical compound [H]N[C@@H](CS)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O UXIYYUMGFNSGBK-XPUUQOCRSA-N 0.000 claims 6
- 108020004414 DNA Proteins 0.000 claims 6
- HUWSBFYAGXCXKC-CIUDSAMLSA-N Glu-Ala-Met Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCSC)C(O)=O HUWSBFYAGXCXKC-CIUDSAMLSA-N 0.000 claims 6
- INGJLBQKTRJLFO-UKJIMTQDSA-N Glu-Ile-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCC(O)=O INGJLBQKTRJLFO-UKJIMTQDSA-N 0.000 claims 6
- MWMJCGBSIORNCD-AVGNSLFASA-N Glu-Leu-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O MWMJCGBSIORNCD-AVGNSLFASA-N 0.000 claims 6
- JDUKCSSHWNIQQZ-IHRRRGAJSA-N Glu-Phe-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O JDUKCSSHWNIQQZ-IHRRRGAJSA-N 0.000 claims 6
- DMYACXMQUABZIQ-NRPADANISA-N Glu-Ser-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O DMYACXMQUABZIQ-NRPADANISA-N 0.000 claims 6
- VSVZIEVNUYDAFR-YUMQZZPRSA-N Gly-Ala-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)CN VSVZIEVNUYDAFR-YUMQZZPRSA-N 0.000 claims 6
- CEXINUGNTZFNRY-BYPYZUCNSA-N Gly-Cys-Gly Chemical compound [NH3+]CC(=O)N[C@@H](CS)C(=O)NCC([O-])=O CEXINUGNTZFNRY-BYPYZUCNSA-N 0.000 claims 6
- HAXARWKYFIIHKD-ZKWXMUAHSA-N Gly-Ile-Ser Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O HAXARWKYFIIHKD-ZKWXMUAHSA-N 0.000 claims 6
- WNGHUXFWEWTKAO-YUMQZZPRSA-N Gly-Ser-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)CN WNGHUXFWEWTKAO-YUMQZZPRSA-N 0.000 claims 6
- NTXIJPDAHXSHNL-ONGXEEELSA-N His-Gly-Val Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O NTXIJPDAHXSHNL-ONGXEEELSA-N 0.000 claims 6
- ODPKZZLRDNXTJZ-WHOFXGATSA-N Ile-Gly-Phe Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N ODPKZZLRDNXTJZ-WHOFXGATSA-N 0.000 claims 6
- UAQSZXGJGLHMNV-XEGUGMAKSA-N Ile-Gly-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N UAQSZXGJGLHMNV-XEGUGMAKSA-N 0.000 claims 6
- QGXQHJQPAPMACW-PPCPHDFISA-N Ile-Thr-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)O)N QGXQHJQPAPMACW-PPCPHDFISA-N 0.000 claims 6
- NURNJECQNNCRBK-FLBSBUHZSA-N Ile-Thr-Thr Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O NURNJECQNNCRBK-FLBSBUHZSA-N 0.000 claims 6
- DTPGSUQHUMELQB-GVARAGBVSA-N Ile-Tyr-Ala Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C)C(O)=O)CC1=CC=C(O)C=C1 DTPGSUQHUMELQB-GVARAGBVSA-N 0.000 claims 6
- UYODHPPSCXBNCS-XUXIUFHCSA-N Ile-Val-Leu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC(C)C UYODHPPSCXBNCS-XUXIUFHCSA-N 0.000 claims 6
- VKOAHIRLIUESLU-ULQDDVLXSA-N Leu-Arg-Phe Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O VKOAHIRLIUESLU-ULQDDVLXSA-N 0.000 claims 6
- WGNOPSQMIQERPK-UHFFFAOYSA-N Leu-Asn-Pro Natural products CC(C)CC(N)C(=O)NC(CC(=O)N)C(=O)N1CCCC1C(=O)O WGNOPSQMIQERPK-UHFFFAOYSA-N 0.000 claims 6
- MMEDVBWCMGRKKC-GARJFASQSA-N Leu-Asp-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N MMEDVBWCMGRKKC-GARJFASQSA-N 0.000 claims 6
- DZQMXBALGUHGJT-GUBZILKMSA-N Leu-Glu-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O DZQMXBALGUHGJT-GUBZILKMSA-N 0.000 claims 6
- VGPCJSXPPOQPBK-YUMQZZPRSA-N Leu-Gly-Ser Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O VGPCJSXPPOQPBK-YUMQZZPRSA-N 0.000 claims 6
- ARRIJPQRBWRNLT-DCAQKATOSA-N Leu-Met-Asn Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(=O)N)C(=O)O)N ARRIJPQRBWRNLT-DCAQKATOSA-N 0.000 claims 6
- LQUIENKUVKPNIC-ULQDDVLXSA-N Leu-Met-Tyr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O LQUIENKUVKPNIC-ULQDDVLXSA-N 0.000 claims 6
- RRVCZCNFXIFGRA-DCAQKATOSA-N Leu-Pro-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O RRVCZCNFXIFGRA-DCAQKATOSA-N 0.000 claims 6
- IWMJFLJQHIDZQW-KKUMJFAQSA-N Leu-Ser-Phe Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 IWMJFLJQHIDZQW-KKUMJFAQSA-N 0.000 claims 6
- ICYRCNICGBJLGM-HJGDQZAQSA-N Leu-Thr-Asp Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(O)=O ICYRCNICGBJLGM-HJGDQZAQSA-N 0.000 claims 6
- ILDSIMPXNFWKLH-KATARQTJSA-N Leu-Thr-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O ILDSIMPXNFWKLH-KATARQTJSA-N 0.000 claims 6
- XFIHDSBIPWEYJJ-YUMQZZPRSA-N Lys-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN XFIHDSBIPWEYJJ-YUMQZZPRSA-N 0.000 claims 6
- KCXUCYYZNZFGLL-SRVKXCTJSA-N Lys-Ala-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O KCXUCYYZNZFGLL-SRVKXCTJSA-N 0.000 claims 6
- NTSPQIONFJUMJV-AVGNSLFASA-N Lys-Arg-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(O)=O NTSPQIONFJUMJV-AVGNSLFASA-N 0.000 claims 6
- AAORVPFVUIHEAB-YUMQZZPRSA-N Lys-Asp-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O AAORVPFVUIHEAB-YUMQZZPRSA-N 0.000 claims 6
- MYZMQWHPDAYKIE-SRVKXCTJSA-N Lys-Leu-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O MYZMQWHPDAYKIE-SRVKXCTJSA-N 0.000 claims 6
- YFQSSOAGMZGXFT-MEYUZBJRSA-N Lys-Thr-Tyr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O YFQSSOAGMZGXFT-MEYUZBJRSA-N 0.000 claims 6
- MUYQDMBLDFEVRJ-LSJOCFKGSA-N Met-Ala-His Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CNC=N1 MUYQDMBLDFEVRJ-LSJOCFKGSA-N 0.000 claims 6
- JQEBITVYKUCBMC-SRVKXCTJSA-N Met-Arg-Arg Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O JQEBITVYKUCBMC-SRVKXCTJSA-N 0.000 claims 6
- OXIWIYOJVNOKOV-SRVKXCTJSA-N Met-Met-Arg Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCSC)C(=O)N[C@H](C(O)=O)CCCNC(N)=N OXIWIYOJVNOKOV-SRVKXCTJSA-N 0.000 claims 6
- DSZFTPCSFVWMKP-DCAQKATOSA-N Met-Ser-Lys Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCCN DSZFTPCSFVWMKP-DCAQKATOSA-N 0.000 claims 6
- PNHRPOWKRRJATF-IHRRRGAJSA-N Met-Tyr-Ser Chemical compound CSCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CO)C(O)=O)CC1=CC=C(O)C=C1 PNHRPOWKRRJATF-IHRRRGAJSA-N 0.000 claims 6
- NTIZESTWPVYFNL-UHFFFAOYSA-N Methyl isobutyl ketone Chemical compound CC(C)CC(C)=O NTIZESTWPVYFNL-UHFFFAOYSA-N 0.000 claims 6
- AGYXCMYVTBYGCT-ULQDDVLXSA-N Phe-Arg-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O AGYXCMYVTBYGCT-ULQDDVLXSA-N 0.000 claims 6
- WGXOKDLDIWSOCV-MELADBBJSA-N Phe-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC2=CC=CC=C2)N)C(=O)O WGXOKDLDIWSOCV-MELADBBJSA-N 0.000 claims 6
- IUVYJBMTHARMIP-PCBIJLKTSA-N Phe-Asp-Ile Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O IUVYJBMTHARMIP-PCBIJLKTSA-N 0.000 claims 6
- MGECUMGTSHYHEJ-QEWYBTABSA-N Phe-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 MGECUMGTSHYHEJ-QEWYBTABSA-N 0.000 claims 6
- KNYPNEYICHHLQL-ACRUOGEOSA-N Phe-Leu-Tyr Chemical compound C([C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=CC=C1 KNYPNEYICHHLQL-ACRUOGEOSA-N 0.000 claims 6
- JKJSIYKSGIDHPM-WBAXXEDZSA-N Phe-Phe-Ala Chemical compound C[C@H](NC(=O)[C@H](Cc1ccccc1)NC(=O)[C@@H](N)Cc1ccccc1)C(O)=O JKJSIYKSGIDHPM-WBAXXEDZSA-N 0.000 claims 6
- WKLMCMXFMQEKCX-SLFFLAALSA-N Phe-Phe-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)NC(=O)[C@H](CC3=CC=CC=C3)N)C(=O)O WKLMCMXFMQEKCX-SLFFLAALSA-N 0.000 claims 6
- WWPAHTZOWURIMR-ULQDDVLXSA-N Phe-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CC1=CC=CC=C1 WWPAHTZOWURIMR-ULQDDVLXSA-N 0.000 claims 6
- LTAWNJXSRUCFAN-UNQGMJICSA-N Phe-Thr-Arg Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O LTAWNJXSRUCFAN-UNQGMJICSA-N 0.000 claims 6
- VIIRRNQMMIHYHQ-XHSDSOJGSA-N Phe-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CC=CC=C2)N VIIRRNQMMIHYHQ-XHSDSOJGSA-N 0.000 claims 6
- IEIFEYBAYFSRBQ-IHRRRGAJSA-N Phe-Val-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N IEIFEYBAYFSRBQ-IHRRRGAJSA-N 0.000 claims 6
- IFMDQWDAJUMMJC-DCAQKATOSA-N Pro-Ala-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O IFMDQWDAJUMMJC-DCAQKATOSA-N 0.000 claims 6
- FKVNLUZHSFCNGY-RVMXOQNASA-N Pro-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@@H]2CCCN2 FKVNLUZHSFCNGY-RVMXOQNASA-N 0.000 claims 6
- CLJLVCYFABNTHP-DCAQKATOSA-N Pro-Leu-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O CLJLVCYFABNTHP-DCAQKATOSA-N 0.000 claims 6
- IWIANZLCJVYEFX-RYUDHWBXSA-N Pro-Phe Chemical compound C([C@@H](C(=O)O)NC(=O)[C@H]1NCCC1)C1=CC=CC=C1 IWIANZLCJVYEFX-RYUDHWBXSA-N 0.000 claims 6
- WVXQQUWOKUZIEG-VEVYYDQMSA-N Pro-Thr-Asn Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(O)=O WVXQQUWOKUZIEG-VEVYYDQMSA-N 0.000 claims 6
- GBUNEGKQPSAMNK-QTKMDUPCSA-N Pro-Thr-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@@H]2CCCN2)O GBUNEGKQPSAMNK-QTKMDUPCSA-N 0.000 claims 6
- YDTUEBLEAVANFH-RCWTZXSCSA-N Pro-Val-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H]1CCCN1 YDTUEBLEAVANFH-RCWTZXSCSA-N 0.000 claims 6
- 108010054530 RGDN peptide Proteins 0.000 claims 6
- DWUIECHTAMYEFL-XVYDVKMFSA-N Ser-Ala-His Chemical compound OC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 DWUIECHTAMYEFL-XVYDVKMFSA-N 0.000 claims 6
- OYEDZGNMSBZCIM-XGEHTFHBSA-N Ser-Arg-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OYEDZGNMSBZCIM-XGEHTFHBSA-N 0.000 claims 6
- SQBLRDDJTUJDMV-ACZMJKKPSA-N Ser-Glu-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O SQBLRDDJTUJDMV-ACZMJKKPSA-N 0.000 claims 6
- SFTZWNJFZYOLBD-ZDLURKLDSA-N Ser-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CO SFTZWNJFZYOLBD-ZDLURKLDSA-N 0.000 claims 6
- RRVFEDGUXSYWOW-BZSNNMDCSA-N Ser-Phe-Phe Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O RRVFEDGUXSYWOW-BZSNNMDCSA-N 0.000 claims 6
- MQUZANJDFOQOBX-SRVKXCTJSA-N Ser-Phe-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O MQUZANJDFOQOBX-SRVKXCTJSA-N 0.000 claims 6
- FBLNYDYPCLFTSP-IXOXFDKPSA-N Ser-Phe-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O FBLNYDYPCLFTSP-IXOXFDKPSA-N 0.000 claims 6
- VGQVAVQWKJLIRM-FXQIFTODSA-N Ser-Ser-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O VGQVAVQWKJLIRM-FXQIFTODSA-N 0.000 claims 6
- YXEYTHXDRDAIOJ-CWRNSKLLSA-N Ser-Trp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CNC3=CC=CC=C32)NC(=O)[C@H](CO)N)C(=O)O YXEYTHXDRDAIOJ-CWRNSKLLSA-N 0.000 claims 6
- OQSQCUWQOIHECT-YJRXYDGGSA-N Ser-Tyr-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OQSQCUWQOIHECT-YJRXYDGGSA-N 0.000 claims 6
- DWYAUVCQDTZIJI-VZFHVOOUSA-N Thr-Ala-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O DWYAUVCQDTZIJI-VZFHVOOUSA-N 0.000 claims 6
- QNJZOAHSYPXTAB-VEVYYDQMSA-N Thr-Asn-Met Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCSC)C(O)=O QNJZOAHSYPXTAB-VEVYYDQMSA-N 0.000 claims 6
- PQLXHSACXPGWPD-GSSVUCPTSA-N Thr-Asn-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PQLXHSACXPGWPD-GSSVUCPTSA-N 0.000 claims 6
- YOSLMIPKOUAHKI-OLHMAJIHSA-N Thr-Asp-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O YOSLMIPKOUAHKI-OLHMAJIHSA-N 0.000 claims 6
- NOWXWJLVGTVJKM-PBCZWWQYSA-N Thr-Asp-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N)O NOWXWJLVGTVJKM-PBCZWWQYSA-N 0.000 claims 6
- XFTYVCHLARBHBQ-FOHZUACHSA-N Thr-Gly-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O XFTYVCHLARBHBQ-FOHZUACHSA-N 0.000 claims 6
- YUPVPKZBKCLFLT-QTKMDUPCSA-N Thr-His-Val Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](C(C)C)C(=O)O)N)O YUPVPKZBKCLFLT-QTKMDUPCSA-N 0.000 claims 6
- BVOVIGCHYNFJBZ-JXUBOQSCSA-N Thr-Leu-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O BVOVIGCHYNFJBZ-JXUBOQSCSA-N 0.000 claims 6
- KRDSCBLRHORMRK-JXUBOQSCSA-N Thr-Lys-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O KRDSCBLRHORMRK-JXUBOQSCSA-N 0.000 claims 6
- SCSVNSNWUTYSFO-WDCWCFNPSA-N Thr-Lys-Glu Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O SCSVNSNWUTYSFO-WDCWCFNPSA-N 0.000 claims 6
- WYLAVUAWOUVUCA-XVSYOHENSA-N Thr-Phe-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O WYLAVUAWOUVUCA-XVSYOHENSA-N 0.000 claims 6
- LECUEEHKUFYOOV-ZJDVBMNYSA-N Thr-Thr-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@@H](N)[C@@H](C)O LECUEEHKUFYOOV-ZJDVBMNYSA-N 0.000 claims 6
- JAWUQFCGNVEDRN-MEYUZBJRSA-N Thr-Tyr-Leu Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC(C)C)C(=O)O)N)O JAWUQFCGNVEDRN-MEYUZBJRSA-N 0.000 claims 6
- NIWAGRRZHCMPOY-GMVOTWDCSA-N Trp-Ala-Tyr Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)N NIWAGRRZHCMPOY-GMVOTWDCSA-N 0.000 claims 6
- UTQBQJNSNXJNIH-IHPCNDPISA-N Trp-Asn-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)N UTQBQJNSNXJNIH-IHPCNDPISA-N 0.000 claims 6
- JVTHMUDOKPQBOT-NSHDSACASA-N Trp-Gly-Gly Chemical compound C1=CC=C2C(C[C@H]([NH3+])C(=O)NCC(=O)NCC([O-])=O)=CNC2=C1 JVTHMUDOKPQBOT-NSHDSACASA-N 0.000 claims 6
- WTRQBSSQBKRNKV-MNSWYVGCSA-N Trp-Thr-Tyr Chemical compound C([C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CC=1C2=CC=CC=C2NC=1)[C@H](O)C)C(O)=O)C1=CC=C(O)C=C1 WTRQBSSQBKRNKV-MNSWYVGCSA-N 0.000 claims 6
- AZGZDDNKFFUDEH-QWRGUYRKSA-N Tyr-Gly-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC1=CC=C(O)C=C1 AZGZDDNKFFUDEH-QWRGUYRKSA-N 0.000 claims 6
- LMKKMCGTDANZTR-BZSNNMDCSA-N Tyr-Phe-Asp Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)N[C@@H](CC(O)=O)C(O)=O)C1=CC=C(O)C=C1 LMKKMCGTDANZTR-BZSNNMDCSA-N 0.000 claims 6
- MNWINJDPGBNOED-ULQDDVLXSA-N Tyr-Pro-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CC1=CC=C(O)C=C1 MNWINJDPGBNOED-ULQDDVLXSA-N 0.000 claims 6
- RGYCVIZZTUBSSG-JYJNAYRXSA-N Tyr-Pro-Val Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O RGYCVIZZTUBSSG-JYJNAYRXSA-N 0.000 claims 6
- QFXVAFIHVWXXBJ-AVGNSLFASA-N Tyr-Ser-Glu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(O)=O QFXVAFIHVWXXBJ-AVGNSLFASA-N 0.000 claims 6
- BIVIUZRBCAUNPW-JRQIVUDYSA-N Tyr-Thr-Asn Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(O)=O BIVIUZRBCAUNPW-JRQIVUDYSA-N 0.000 claims 6
- WMRWZYSRQUORHJ-YDHLFZDLSA-N Val-Phe-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(=O)O)C(=O)O)N WMRWZYSRQUORHJ-YDHLFZDLSA-N 0.000 claims 6
- PGQUDQYHWICSAB-NAKRPEOUSA-N Val-Ser-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](C(C)C)N PGQUDQYHWICSAB-NAKRPEOUSA-N 0.000 claims 6
- RTJPAGFXOWEBAI-SRVKXCTJSA-N Val-Val-Arg Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N RTJPAGFXOWEBAI-SRVKXCTJSA-N 0.000 claims 6
- 108010011559 alanylphenylalanine Proteins 0.000 claims 6
- 108010054813 diprotin B Proteins 0.000 claims 6
- 108010059898 glycyl-tyrosyl-lysine Proteins 0.000 claims 6
- 108010057952 lysyl-phenylalanyl-lysine Proteins 0.000 claims 6
- 108010056582 methionylglutamic acid Proteins 0.000 claims 6
- 108010084572 phenylalanyl-valine Proteins 0.000 claims 6
- 108010012581 phenylalanylglutamate Proteins 0.000 claims 6
- 108010053725 prolylvaline Proteins 0.000 claims 6
- IBIDRSSEHFLGSD-UHFFFAOYSA-N valinyl-arginine Natural products CC(C)C(N)C(=O)NC(C(O)=O)CCCN=C(N)N IBIDRSSEHFLGSD-UHFFFAOYSA-N 0.000 claims 6
- STACJSVFHSEZJV-GHCJXIJMSA-N Ala-Asn-Ile Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O STACJSVFHSEZJV-GHCJXIJMSA-N 0.000 claims 5
- RGQCNKIDEQJEBT-CQDKDKBSSA-N Ala-Leu-Tyr Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 RGQCNKIDEQJEBT-CQDKDKBSSA-N 0.000 claims 5
- OINVDEKBKBCPLX-JXUBOQSCSA-N Ala-Lys-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OINVDEKBKBCPLX-JXUBOQSCSA-N 0.000 claims 5
- HYIDEIQUCBKIPL-CQDKDKBSSA-N Ala-Phe-His Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N HYIDEIQUCBKIPL-CQDKDKBSSA-N 0.000 claims 5
- IOFVWPYSRSCWHI-JXUBOQSCSA-N Ala-Thr-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@H](C)N IOFVWPYSRSCWHI-JXUBOQSCSA-N 0.000 claims 5
- MTDDMSUUXNQMKK-BPNCWPANSA-N Ala-Tyr-Arg Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N MTDDMSUUXNQMKK-BPNCWPANSA-N 0.000 claims 5
- BGGAIXWIZCIFSG-XDTLVQLUSA-N Ala-Tyr-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O BGGAIXWIZCIFSG-XDTLVQLUSA-N 0.000 claims 5
- QRIYOHQJRDHFKF-UWJYBYFXSA-N Ala-Tyr-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=C(O)C=C1 QRIYOHQJRDHFKF-UWJYBYFXSA-N 0.000 claims 5
- XCIGOVDXZULBBV-DCAQKATOSA-N Ala-Val-Lys Chemical compound CC(C)[C@H](NC(=O)[C@H](C)N)C(=O)N[C@@H](CCCCN)C(O)=O XCIGOVDXZULBBV-DCAQKATOSA-N 0.000 claims 5
- DCGLNNVKIZXQOJ-FXQIFTODSA-N Arg-Asn-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCCN=C(N)N)N DCGLNNVKIZXQOJ-FXQIFTODSA-N 0.000 claims 5
- PQWTZSNVWSOFFK-FXQIFTODSA-N Arg-Asp-Asn Chemical compound C(C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N)CN=C(N)N PQWTZSNVWSOFFK-FXQIFTODSA-N 0.000 claims 5
- PNIGSVZJNVUVJA-BQBZGAKWSA-N Arg-Gly-Asn Chemical compound NC(N)=NCCC[C@H](N)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O PNIGSVZJNVUVJA-BQBZGAKWSA-N 0.000 claims 5
- JEOCWTUOMKEEMF-RHYQMDGZSA-N Arg-Leu-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O JEOCWTUOMKEEMF-RHYQMDGZSA-N 0.000 claims 5
- YLVGUOGAFAJMKP-JYJNAYRXSA-N Arg-Met-Tyr Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N YLVGUOGAFAJMKP-JYJNAYRXSA-N 0.000 claims 5
- QHUOOCKNNURZSL-IHRRRGAJSA-N Arg-Tyr-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(O)=O QHUOOCKNNURZSL-IHRRRGAJSA-N 0.000 claims 5
- IZSMEUDYADKZTJ-KJEVXHAQSA-N Arg-Tyr-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O IZSMEUDYADKZTJ-KJEVXHAQSA-N 0.000 claims 5
- QTAIIXQCOPUNBQ-QXEWZRGKSA-N Arg-Val-Asp Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O QTAIIXQCOPUNBQ-QXEWZRGKSA-N 0.000 claims 5
- HZPSDHRYYIORKR-WHFBIAKZSA-N Asn-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H](N)CC(N)=O HZPSDHRYYIORKR-WHFBIAKZSA-N 0.000 claims 5
- VKCOHFFSTKCXEQ-OLHMAJIHSA-N Asn-Asn-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O VKCOHFFSTKCXEQ-OLHMAJIHSA-N 0.000 claims 5
- COUZKSSMBFADSB-AVGNSLFASA-N Asn-Glu-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC(=O)N)N COUZKSSMBFADSB-AVGNSLFASA-N 0.000 claims 5
- FTCGGKNCJZOPNB-WHFBIAKZSA-N Asn-Gly-Ser Chemical compound NC(=O)C[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O FTCGGKNCJZOPNB-WHFBIAKZSA-N 0.000 claims 5
- RAQMSGVCGSJKCL-FOHZUACHSA-N Asn-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC(N)=O RAQMSGVCGSJKCL-FOHZUACHSA-N 0.000 claims 5
- ZTRJUKDEALVRMW-SRVKXCTJSA-N Asn-His-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)NC(=O)[C@H](CC(=O)N)N ZTRJUKDEALVRMW-SRVKXCTJSA-N 0.000 claims 5
- PTSDPWIHOYMRGR-UGYAYLCHSA-N Asn-Ile-Asn Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O PTSDPWIHOYMRGR-UGYAYLCHSA-N 0.000 claims 5
- SEKBHZJLARBNPB-GHCJXIJMSA-N Asn-Ile-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O SEKBHZJLARBNPB-GHCJXIJMSA-N 0.000 claims 5
- NYGILGUOUOXGMJ-YUMQZZPRSA-N Asn-Lys-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O NYGILGUOUOXGMJ-YUMQZZPRSA-N 0.000 claims 5
- MVXJBVVLACEGCG-PCBIJLKTSA-N Asn-Phe-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O MVXJBVVLACEGCG-PCBIJLKTSA-N 0.000 claims 5
- YNQMEIJEWSHOEO-SRVKXCTJSA-N Asn-Tyr-Cys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(=O)N)N)O YNQMEIJEWSHOEO-SRVKXCTJSA-N 0.000 claims 5
- RGKKALNPOYURGE-ZKWXMUAHSA-N Asp-Ala-Val Chemical compound N[C@@H](CC(=O)O)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)O RGKKALNPOYURGE-ZKWXMUAHSA-N 0.000 claims 5
- JDHOJQJMWBKHDB-CIUDSAMLSA-N Asp-Asn-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC(=O)O)N JDHOJQJMWBKHDB-CIUDSAMLSA-N 0.000 claims 5
- XAJRHVUUVUPFQL-ACZMJKKPSA-N Asp-Glu-Asp Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O XAJRHVUUVUPFQL-ACZMJKKPSA-N 0.000 claims 5
- PSLSTUMPZILTAH-BYULHYEWSA-N Asp-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC(O)=O PSLSTUMPZILTAH-BYULHYEWSA-N 0.000 claims 5
- WSGVTKZFVJSJOG-RCOVLWMOSA-N Asp-Gly-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O WSGVTKZFVJSJOG-RCOVLWMOSA-N 0.000 claims 5
- DWOGMPWRQQWPPF-GUBZILKMSA-N Asp-Leu-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O DWOGMPWRQQWPPF-GUBZILKMSA-N 0.000 claims 5
- WDMNFNXKGSLIOB-GUBZILKMSA-N Asp-Met-Met Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CC(=O)O)N WDMNFNXKGSLIOB-GUBZILKMSA-N 0.000 claims 5
- WMLFFCRUSPNENW-ZLUOBGJFSA-N Asp-Ser-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O WMLFFCRUSPNENW-ZLUOBGJFSA-N 0.000 claims 5
- MGSVBZIBCCKGCY-ZLUOBGJFSA-N Asp-Ser-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O MGSVBZIBCCKGCY-ZLUOBGJFSA-N 0.000 claims 5
- MJJIHRWNWSQTOI-VEVYYDQMSA-N Asp-Thr-Arg Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O MJJIHRWNWSQTOI-VEVYYDQMSA-N 0.000 claims 5
- GFMJUESGWILPEN-MELADBBJSA-N Cys-Phe-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)NC(=O)[C@H](CS)N)C(=O)O GFMJUESGWILPEN-MELADBBJSA-N 0.000 claims 5
- JUNZLDGUJZIUCO-IHRRRGAJSA-N Cys-Pro-Tyr Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CS)N)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O JUNZLDGUJZIUCO-IHRRRGAJSA-N 0.000 claims 5
- BOMGEMDZTNZESV-QWRGUYRKSA-N Cys-Tyr-Gly Chemical compound SC[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CC=C(O)C=C1 BOMGEMDZTNZESV-QWRGUYRKSA-N 0.000 claims 5
- 101710145505 Fiber protein Proteins 0.000 claims 5
- IRDASPPCLZIERZ-XHNCKOQMSA-N Glu-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)O)N IRDASPPCLZIERZ-XHNCKOQMSA-N 0.000 claims 5
- SVZIKUHLRKVZIF-GUBZILKMSA-N Glu-Asn-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)O)N SVZIKUHLRKVZIF-GUBZILKMSA-N 0.000 claims 5
- MTAOBYXRYJZRGQ-WDSKDSINSA-N Glu-Gly-Asp Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O MTAOBYXRYJZRGQ-WDSKDSINSA-N 0.000 claims 5
- OPAINBJQDQTGJY-JGVFFNPUSA-N Glu-Gly-Pro Chemical compound C1C[C@@H](N(C1)C(=O)CNC(=O)[C@H](CCC(=O)O)N)C(=O)O OPAINBJQDQTGJY-JGVFFNPUSA-N 0.000 claims 5
- XMPAXPSENRSOSV-RYUDHWBXSA-N Glu-Gly-Tyr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O XMPAXPSENRSOSV-RYUDHWBXSA-N 0.000 claims 5
- CXRWMMRLEMVSEH-PEFMBERDSA-N Glu-Ile-Asn Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O CXRWMMRLEMVSEH-PEFMBERDSA-N 0.000 claims 5
- UGSVSNXPJJDJKL-SDDRHHMPSA-N Glu-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)O)N UGSVSNXPJJDJKL-SDDRHHMPSA-N 0.000 claims 5
- CUPSDFQZTVVTSK-GUBZILKMSA-N Glu-Lys-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCC(O)=O CUPSDFQZTVVTSK-GUBZILKMSA-N 0.000 claims 5
- CHDWDBPJOZVZSE-KKUMJFAQSA-N Glu-Phe-Met Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCSC)C(O)=O CHDWDBPJOZVZSE-KKUMJFAQSA-N 0.000 claims 5
- QCMVGXDELYMZET-GLLZPBPUSA-N Glu-Thr-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O QCMVGXDELYMZET-GLLZPBPUSA-N 0.000 claims 5
- UGVQELHRNUDMAA-BYPYZUCNSA-N Gly-Ala-Gly Chemical compound [NH3+]CC(=O)N[C@@H](C)C(=O)NCC([O-])=O UGVQELHRNUDMAA-BYPYZUCNSA-N 0.000 claims 5
- XUDLUKYPXQDCRX-BQBZGAKWSA-N Gly-Arg-Asn Chemical compound [H]NCC(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(O)=O XUDLUKYPXQDCRX-BQBZGAKWSA-N 0.000 claims 5
- XUORRGAFUQIMLC-STQMWFEESA-N Gly-Arg-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)CN)O XUORRGAFUQIMLC-STQMWFEESA-N 0.000 claims 5
- UXJHNZODTMHWRD-WHFBIAKZSA-N Gly-Asn-Ala Chemical compound [H]NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(O)=O UXJHNZODTMHWRD-WHFBIAKZSA-N 0.000 claims 5
- FMVLWTYYODVFRG-BQBZGAKWSA-N Gly-Asn-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)CN FMVLWTYYODVFRG-BQBZGAKWSA-N 0.000 claims 5
- IWAXHBCACVWNHT-BQBZGAKWSA-N Gly-Asp-Arg Chemical compound NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N IWAXHBCACVWNHT-BQBZGAKWSA-N 0.000 claims 5
- PMNHJLASAAWELO-FOHZUACHSA-N Gly-Asp-Thr Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PMNHJLASAAWELO-FOHZUACHSA-N 0.000 claims 5
- XTQFHTHIAKKCTM-YFKPBYRVSA-N Gly-Glu-Gly Chemical compound NCC(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O XTQFHTHIAKKCTM-YFKPBYRVSA-N 0.000 claims 5
- GDOZQTNZPCUARW-YFKPBYRVSA-N Gly-Gly-Glu Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CCC(O)=O GDOZQTNZPCUARW-YFKPBYRVSA-N 0.000 claims 5
- NSTUFLGQJCOCDL-UWVGGRQHSA-N Gly-Leu-Arg Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N NSTUFLGQJCOCDL-UWVGGRQHSA-N 0.000 claims 5
- VNNRLUNBJSWZPF-ZKWXMUAHSA-N Gly-Ser-Ile Chemical compound [H]NCC(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O VNNRLUNBJSWZPF-ZKWXMUAHSA-N 0.000 claims 5
- GULGDABMYTYMJZ-STQMWFEESA-N Gly-Trp-Asp Chemical compound [H]NCC(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC(O)=O)C(O)=O GULGDABMYTYMJZ-STQMWFEESA-N 0.000 claims 5
- DNAZKGFYFRGZIH-QWRGUYRKSA-N Gly-Tyr-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CC=C(O)C=C1 DNAZKGFYFRGZIH-QWRGUYRKSA-N 0.000 claims 5
- DNVDEMWIYLVIQU-RCOVLWMOSA-N Gly-Val-Asp Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O DNVDEMWIYLVIQU-RCOVLWMOSA-N 0.000 claims 5
- IZVICCORZOSGPT-JSGCOSHPSA-N Gly-Val-Tyr Chemical compound [H]NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O IZVICCORZOSGPT-JSGCOSHPSA-N 0.000 claims 5
- KSOBNUBCYHGUKH-UWVGGRQHSA-N Gly-Val-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)CN KSOBNUBCYHGUKH-UWVGGRQHSA-N 0.000 claims 5
- CIWILNZNBPIHEU-DCAQKATOSA-N His-Arg-Asn Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(O)=O CIWILNZNBPIHEU-DCAQKATOSA-N 0.000 claims 5
- ALPXXNRQBMRCPZ-MEYUZBJRSA-N His-Thr-Phe Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O ALPXXNRQBMRCPZ-MEYUZBJRSA-N 0.000 claims 5
- UKTUOMWSJPXODT-GUDRVLHUSA-N Ile-Asn-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N1CCC[C@@H]1C(=O)O)N UKTUOMWSJPXODT-GUDRVLHUSA-N 0.000 claims 5
- LPFBXFILACZHIB-LAEOZQHASA-N Ile-Gly-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N[C@@H](CCC(=O)O)C(=O)O)N LPFBXFILACZHIB-LAEOZQHASA-N 0.000 claims 5
- NYEYYMLUABXDMC-NHCYSSNCSA-N Ile-Gly-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N[C@@H](CC(C)C)C(=O)O)N NYEYYMLUABXDMC-NHCYSSNCSA-N 0.000 claims 5
- NZGTYCMLUGYMCV-XUXIUFHCSA-N Ile-Lys-Arg Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N NZGTYCMLUGYMCV-XUXIUFHCSA-N 0.000 claims 5
- NNVXABCGXOLIEB-PYJNHQTQSA-N Ile-Met-His Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCSC)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 NNVXABCGXOLIEB-PYJNHQTQSA-N 0.000 claims 5
- AGGIYSLVUKVOPT-HTFCKZLJSA-N Ile-Ser-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N AGGIYSLVUKVOPT-HTFCKZLJSA-N 0.000 claims 5
- PELCGFMHLZXWBQ-BJDJZHNGSA-N Ile-Ser-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)O)N PELCGFMHLZXWBQ-BJDJZHNGSA-N 0.000 claims 5
- KBDIBHQICWDGDL-PPCPHDFISA-N Ile-Thr-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)O)N KBDIBHQICWDGDL-PPCPHDFISA-N 0.000 claims 5
- HGCNKOLVKRAVHD-UHFFFAOYSA-N L-Met-L-Phe Natural products CSCCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 HGCNKOLVKRAVHD-UHFFFAOYSA-N 0.000 claims 5
- XIRYQRLFHWWWTC-QEJZJMRPSA-N Leu-Ala-Phe Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 XIRYQRLFHWWWTC-QEJZJMRPSA-N 0.000 claims 5
- JUWJEAPUNARGCF-DCAQKATOSA-N Leu-Arg-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O JUWJEAPUNARGCF-DCAQKATOSA-N 0.000 claims 5
- UCOCBWDBHCUPQP-DCAQKATOSA-N Leu-Arg-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O UCOCBWDBHCUPQP-DCAQKATOSA-N 0.000 claims 5
- WXHFZJFZWNCDNB-KKUMJFAQSA-N Leu-Asn-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 WXHFZJFZWNCDNB-KKUMJFAQSA-N 0.000 claims 5
- QCSFMCFHVGTLFF-NHCYSSNCSA-N Leu-Asp-Val Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O QCSFMCFHVGTLFF-NHCYSSNCSA-N 0.000 claims 5
- AVEGDIAXTDVBJS-XUXIUFHCSA-N Leu-Ile-Arg Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O AVEGDIAXTDVBJS-XUXIUFHCSA-N 0.000 claims 5
- QJXHMYMRGDOHRU-NHCYSSNCSA-N Leu-Ile-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O QJXHMYMRGDOHRU-NHCYSSNCSA-N 0.000 claims 5
- QNTJIDXQHWUBKC-BZSNNMDCSA-N Leu-Lys-Phe Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O QNTJIDXQHWUBKC-BZSNNMDCSA-N 0.000 claims 5
- DDVHDMSBLRAKNV-IHRRRGAJSA-N Leu-Met-Leu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O DDVHDMSBLRAKNV-IHRRRGAJSA-N 0.000 claims 5
- VULJUQZPSOASBZ-SRVKXCTJSA-N Leu-Pro-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O VULJUQZPSOASBZ-SRVKXCTJSA-N 0.000 claims 5
- KWLWZYMNUZJKMZ-IHRRRGAJSA-N Leu-Pro-Leu Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O KWLWZYMNUZJKMZ-IHRRRGAJSA-N 0.000 claims 5
- SBANPBVRHYIMRR-GARJFASQSA-N Leu-Ser-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CO)C(=O)N1CCC[C@@H]1C(=O)O)N SBANPBVRHYIMRR-GARJFASQSA-N 0.000 claims 5
- VUBIPAHVHMZHCM-KKUMJFAQSA-N Leu-Tyr-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CO)C(O)=O)CC1=CC=C(O)C=C1 VUBIPAHVHMZHCM-KKUMJFAQSA-N 0.000 claims 5
- VQHUBNVKFFLWRP-ULQDDVLXSA-N Leu-Tyr-Val Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C(C)C)C(O)=O)CC1=CC=C(O)C=C1 VQHUBNVKFFLWRP-ULQDDVLXSA-N 0.000 claims 5
- NRQRKMYZONPCTM-CIUDSAMLSA-N Lys-Asp-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O NRQRKMYZONPCTM-CIUDSAMLSA-N 0.000 claims 5
- WVJNGSFKBKOKRV-AJNGGQMLSA-N Lys-Leu-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WVJNGSFKBKOKRV-AJNGGQMLSA-N 0.000 claims 5
- HYSVGEAWTGPMOA-IHRRRGAJSA-N Lys-Pro-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O HYSVGEAWTGPMOA-IHRRRGAJSA-N 0.000 claims 5
- SQXZLVXQXWILKW-KKUMJFAQSA-N Lys-Ser-Phe Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O SQXZLVXQXWILKW-KKUMJFAQSA-N 0.000 claims 5
- OHXUUQDOBQKSNB-AVGNSLFASA-N Lys-Val-Arg Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O OHXUUQDOBQKSNB-AVGNSLFASA-N 0.000 claims 5
- ACYHZNZHIZWLQF-BQBZGAKWSA-N Met-Asn-Gly Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O ACYHZNZHIZWLQF-BQBZGAKWSA-N 0.000 claims 5
- GODBLDDYHFTUAH-CIUDSAMLSA-N Met-Asp-Glu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCC(O)=O GODBLDDYHFTUAH-CIUDSAMLSA-N 0.000 claims 5
- MVBZBRKNZVJEKK-DTWKUNHWSA-N Met-Gly-Pro Chemical compound CSCC[C@@H](C(=O)NCC(=O)N1CCC[C@@H]1C(=O)O)N MVBZBRKNZVJEKK-DTWKUNHWSA-N 0.000 claims 5
- BMHIFARYXOJDLD-WPRPVWTQSA-N Met-Gly-Val Chemical compound [H]N[C@@H](CCSC)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O BMHIFARYXOJDLD-WPRPVWTQSA-N 0.000 claims 5
- AFFKUNVPPLQUGA-DCAQKATOSA-N Met-Leu-Ala Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O AFFKUNVPPLQUGA-DCAQKATOSA-N 0.000 claims 5
- SODXFJOPSCXOHE-IHRRRGAJSA-N Met-Leu-Leu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O SODXFJOPSCXOHE-IHRRRGAJSA-N 0.000 claims 5
- VSJAPSMRFYUOKS-IUCAKERBSA-N Met-Pro-Gly Chemical compound CSCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O VSJAPSMRFYUOKS-IUCAKERBSA-N 0.000 claims 5
- WXJLBSXNUHIGSS-OSUNSFLBSA-N Met-Thr-Ile Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WXJLBSXNUHIGSS-OSUNSFLBSA-N 0.000 claims 5
- AUEJLPRZGVVDNU-UHFFFAOYSA-N N-L-tyrosyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CC1=CC=C(O)C=C1 AUEJLPRZGVVDNU-UHFFFAOYSA-N 0.000 claims 5
- KZNQNBZMBZJQJO-UHFFFAOYSA-N N-glycyl-L-proline Natural products NCC(=O)N1CCCC1C(O)=O KZNQNBZMBZJQJO-UHFFFAOYSA-N 0.000 claims 5
- 101710173835 Penton protein Proteins 0.000 claims 5
- OJUMUUXGSXUZJZ-SRVKXCTJSA-N Phe-Asp-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O OJUMUUXGSXUZJZ-SRVKXCTJSA-N 0.000 claims 5
- FRPVPGRXUKFEQE-YDHLFZDLSA-N Phe-Asp-Val Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O FRPVPGRXUKFEQE-YDHLFZDLSA-N 0.000 claims 5
- MJQFZGOIVBDIMZ-WHOFXGATSA-N Phe-Ile-Gly Chemical compound N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(=O)O MJQFZGOIVBDIMZ-WHOFXGATSA-N 0.000 claims 5
- PEFJUUYFEGBXFA-BZSNNMDCSA-N Phe-Lys-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC1=CC=CC=C1 PEFJUUYFEGBXFA-BZSNNMDCSA-N 0.000 claims 5
- SRILZRSXIKRGBF-HRCADAONSA-N Phe-Met-Pro Chemical compound CSCC[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CC=CC=C2)N SRILZRSXIKRGBF-HRCADAONSA-N 0.000 claims 5
- FKFCKDROTNIVSO-JYJNAYRXSA-N Phe-Pro-Met Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCSC)C(O)=O FKFCKDROTNIVSO-JYJNAYRXSA-N 0.000 claims 5
- ZYNBEWGJFXTBDU-ACRUOGEOSA-N Phe-Tyr-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CC2=CC=CC=C2)N ZYNBEWGJFXTBDU-ACRUOGEOSA-N 0.000 claims 5
- DBALDZKOTNSBFM-FXQIFTODSA-N Pro-Ala-Asn Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O DBALDZKOTNSBFM-FXQIFTODSA-N 0.000 claims 5
- OYEUSRAZOGIDBY-JYJNAYRXSA-N Pro-Arg-Tyr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O OYEUSRAZOGIDBY-JYJNAYRXSA-N 0.000 claims 5
- XROLYVMNVIKVEM-BQBZGAKWSA-N Pro-Asn-Gly Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O XROLYVMNVIKVEM-BQBZGAKWSA-N 0.000 claims 5
- FKLSMYYLJHYPHH-UWVGGRQHSA-N Pro-Gly-Leu Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O FKLSMYYLJHYPHH-UWVGGRQHSA-N 0.000 claims 5
- UREQLMJCKFLLHM-NAKRPEOUSA-N Pro-Ile-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O UREQLMJCKFLLHM-NAKRPEOUSA-N 0.000 claims 5
- HFNPOYOKIPGAEI-SRVKXCTJSA-N Pro-Leu-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]1CCCN1 HFNPOYOKIPGAEI-SRVKXCTJSA-N 0.000 claims 5
- MRYUJHGPZQNOAD-IHRRRGAJSA-N Pro-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@@H]1CCCN1 MRYUJHGPZQNOAD-IHRRRGAJSA-N 0.000 claims 5
- SUENWIFTSTWUKD-AVGNSLFASA-N Pro-Leu-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O SUENWIFTSTWUKD-AVGNSLFASA-N 0.000 claims 5
- MHBSUKYVBZVQRW-HJWJTTGWSA-N Pro-Phe-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O MHBSUKYVBZVQRW-HJWJTTGWSA-N 0.000 claims 5
- NAIPAPCKKRCMBL-JYJNAYRXSA-N Pro-Pro-Phe Chemical compound C([C@@H](C(=O)O)NC(=O)[C@H]1N(CCC1)C(=O)[C@H]1NCCC1)C1=CC=CC=C1 NAIPAPCKKRCMBL-JYJNAYRXSA-N 0.000 claims 5
- GOMUXSCOIWIJFP-GUBZILKMSA-N Pro-Ser-Arg Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O GOMUXSCOIWIJFP-GUBZILKMSA-N 0.000 claims 5
- YHUBAXGAAYULJY-ULQDDVLXSA-N Pro-Tyr-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(O)=O YHUBAXGAAYULJY-ULQDDVLXSA-N 0.000 claims 5
- IMNVAOPEMFDAQD-NHCYSSNCSA-N Pro-Val-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O IMNVAOPEMFDAQD-NHCYSSNCSA-N 0.000 claims 5
- LALNXSXEYFUUDD-GUBZILKMSA-N Ser-Glu-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O LALNXSXEYFUUDD-GUBZILKMSA-N 0.000 claims 5
- GZFAWAQTEYDKII-YUMQZZPRSA-N Ser-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CO GZFAWAQTEYDKII-YUMQZZPRSA-N 0.000 claims 5
- SFTZTYBXIXLRGQ-JBDRJPRFSA-N Ser-Ile-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O SFTZTYBXIXLRGQ-JBDRJPRFSA-N 0.000 claims 5
- MOINZPRHJGTCHZ-MMWGEVLESA-N Ser-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N MOINZPRHJGTCHZ-MMWGEVLESA-N 0.000 claims 5
- MUJQWSAWLLRJCE-KATARQTJSA-N Ser-Leu-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MUJQWSAWLLRJCE-KATARQTJSA-N 0.000 claims 5
- IXZHZUGGKLRHJD-DCAQKATOSA-N Ser-Leu-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O IXZHZUGGKLRHJD-DCAQKATOSA-N 0.000 claims 5
- GVMUJUPXFQFBBZ-GUBZILKMSA-N Ser-Lys-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O GVMUJUPXFQFBBZ-GUBZILKMSA-N 0.000 claims 5
- VXYQOFXBIXKPCX-BQBZGAKWSA-N Ser-Met-Gly Chemical compound CSCC[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CO)N VXYQOFXBIXKPCX-BQBZGAKWSA-N 0.000 claims 5
- FKYWFUYPVKLJLP-DCAQKATOSA-N Ser-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CO FKYWFUYPVKLJLP-DCAQKATOSA-N 0.000 claims 5
- SRSPTFBENMJHMR-WHFBIAKZSA-N Ser-Ser-Gly Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O SRSPTFBENMJHMR-WHFBIAKZSA-N 0.000 claims 5
- RTXKJFWHEBTABY-IHPCNDPISA-N Ser-Trp-Tyr Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC3=CC=C(C=C3)O)C(=O)O)NC(=O)[C@H](CO)N RTXKJFWHEBTABY-IHPCNDPISA-N 0.000 claims 5
- 241001217859 Simian adenovirus 1 Species 0.000 claims 5
- 241000745906 Simian adenovirus 25 Species 0.000 claims 5
- JMZKMSTYXHFYAK-VEVYYDQMSA-N Thr-Arg-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC(=O)N)C(=O)O)N)O JMZKMSTYXHFYAK-VEVYYDQMSA-N 0.000 claims 5
- YLXAMFZYJTZXFH-OLHMAJIHSA-N Thr-Asn-Asp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N)O YLXAMFZYJTZXFH-OLHMAJIHSA-N 0.000 claims 5
- JTEICXDKGWKRRV-HJGDQZAQSA-N Thr-Asn-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N)O JTEICXDKGWKRRV-HJGDQZAQSA-N 0.000 claims 5
- VGYBYGQXZJDZJU-XQXXSGGOSA-N Thr-Glu-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O VGYBYGQXZJDZJU-XQXXSGGOSA-N 0.000 claims 5
- XOTBWOCSLMBGMF-SUSMZKCASA-N Thr-Glu-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XOTBWOCSLMBGMF-SUSMZKCASA-N 0.000 claims 5
- KBBRNEDOYWMIJP-KYNKHSRBSA-N Thr-Gly-Thr Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(=O)O)N)O KBBRNEDOYWMIJP-KYNKHSRBSA-N 0.000 claims 5
- ODXKUIGEPAGKKV-KATARQTJSA-N Thr-Leu-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CS)C(=O)O)N)O ODXKUIGEPAGKKV-KATARQTJSA-N 0.000 claims 5
- VRUFCJZQDACGLH-UVOCVTCTSA-N Thr-Leu-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O VRUFCJZQDACGLH-UVOCVTCTSA-N 0.000 claims 5
- OHDXOXIZXSFCDN-RCWTZXSCSA-N Thr-Met-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O OHDXOXIZXSFCDN-RCWTZXSCSA-N 0.000 claims 5
- WTMPKZWHRCMMMT-KZVJFYERSA-N Thr-Pro-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O WTMPKZWHRCMMMT-KZVJFYERSA-N 0.000 claims 5
- NYQIZWROIMIQSL-VEVYYDQMSA-N Thr-Pro-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O NYQIZWROIMIQSL-VEVYYDQMSA-N 0.000 claims 5
- KERCOYANYUPLHJ-XGEHTFHBSA-N Thr-Pro-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O KERCOYANYUPLHJ-XGEHTFHBSA-N 0.000 claims 5
- IVDFVBVIVLJJHR-LKXGYXEUSA-N Thr-Ser-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O IVDFVBVIVLJJHR-LKXGYXEUSA-N 0.000 claims 5
- WPSKTVVMQCXPRO-BWBBJGPYSA-N Thr-Ser-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O WPSKTVVMQCXPRO-BWBBJGPYSA-N 0.000 claims 5
- PELIQFPESHBTMA-WLTAIBSBSA-N Thr-Tyr-Gly Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CC=C(O)C=C1 PELIQFPESHBTMA-WLTAIBSBSA-N 0.000 claims 5
- KZTLZZQTJMCGIP-ZJDVBMNYSA-N Thr-Val-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KZTLZZQTJMCGIP-ZJDVBMNYSA-N 0.000 claims 5
- ABRICLFKFRFDKS-IHPCNDPISA-N Trp-Ser-Tyr Chemical compound C([C@H](NC(=O)[C@H](CO)NC(=O)[C@H](CC=1C2=CC=CC=C2NC=1)N)C(O)=O)C1=CC=C(O)C=C1 ABRICLFKFRFDKS-IHPCNDPISA-N 0.000 claims 5
- MNMYOSZWCKYEDI-JRQIVUDYSA-N Tyr-Asp-Thr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MNMYOSZWCKYEDI-JRQIVUDYSA-N 0.000 claims 5
- BVDHHLMIZFCAAU-BZSNNMDCSA-N Tyr-Cys-Phe Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O BVDHHLMIZFCAAU-BZSNNMDCSA-N 0.000 claims 5
- GULIUBBXCYPDJU-CQDKDKBSSA-N Tyr-Leu-Ala Chemical compound [O-]C(=O)[C@H](C)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]([NH3+])CC1=CC=C(O)C=C1 GULIUBBXCYPDJU-CQDKDKBSSA-N 0.000 claims 5
- HSBZWINKRYZCSQ-KKUMJFAQSA-N Tyr-Lys-Asp Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O HSBZWINKRYZCSQ-KKUMJFAQSA-N 0.000 claims 5
- SOAUMCDLIUGXJJ-SRVKXCTJSA-N Tyr-Ser-Asn Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O SOAUMCDLIUGXJJ-SRVKXCTJSA-N 0.000 claims 5
- PWKMJDQXKCENMF-MEYUZBJRSA-N Tyr-Thr-Leu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O PWKMJDQXKCENMF-MEYUZBJRSA-N 0.000 claims 5
- AOIZTZRWMSPPAY-KAOXEZKKSA-N Tyr-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N)O AOIZTZRWMSPPAY-KAOXEZKKSA-N 0.000 claims 5
- AZSHAZJLOZQYAY-FXQIFTODSA-N Val-Ala-Ser Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O AZSHAZJLOZQYAY-FXQIFTODSA-N 0.000 claims 5
- NWDOPHYLSORNEX-QXEWZRGKSA-N Val-Asn-Met Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCSC)C(=O)O)N NWDOPHYLSORNEX-QXEWZRGKSA-N 0.000 claims 5
- ISERLACIZUGCDX-ZKWXMUAHSA-N Val-Asp-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](C(C)C)N ISERLACIZUGCDX-ZKWXMUAHSA-N 0.000 claims 5
- XQVRMLRMTAGSFJ-QXEWZRGKSA-N Val-Asp-Arg Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N XQVRMLRMTAGSFJ-QXEWZRGKSA-N 0.000 claims 5
- TZVUSFMQWPWHON-NHCYSSNCSA-N Val-Asp-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](C(C)C)N TZVUSFMQWPWHON-NHCYSSNCSA-N 0.000 claims 5
- HHSILIQTHXABKM-YDHLFZDLSA-N Val-Asp-Phe Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](Cc1ccccc1)C(O)=O HHSILIQTHXABKM-YDHLFZDLSA-N 0.000 claims 5
- AEMPCGRFEZTWIF-IHRRRGAJSA-N Val-Leu-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(O)=O AEMPCGRFEZTWIF-IHRRRGAJSA-N 0.000 claims 5
- HPANGHISDXDUQY-ULQDDVLXSA-N Val-Lys-Phe Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N HPANGHISDXDUQY-ULQDDVLXSA-N 0.000 claims 5
- MJFSRZZJQWZHFQ-SRVKXCTJSA-N Val-Met-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C(C)C)C(=O)O)N MJFSRZZJQWZHFQ-SRVKXCTJSA-N 0.000 claims 5
- GBIUHAYJGWVNLN-UHFFFAOYSA-N Val-Ser-Pro Natural products CC(C)C(N)C(=O)NC(CO)C(=O)N1CCCC1C(O)=O GBIUHAYJGWVNLN-UHFFFAOYSA-N 0.000 claims 5
- OFTXTCGQJXTNQS-XGEHTFHBSA-N Val-Thr-Ser Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](C(C)C)N)O OFTXTCGQJXTNQS-XGEHTFHBSA-N 0.000 claims 5
- YLBNZCJFSVJDRJ-KJEVXHAQSA-N Val-Thr-Tyr Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](Cc1ccc(O)cc1)C(O)=O YLBNZCJFSVJDRJ-KJEVXHAQSA-N 0.000 claims 5
- 108010086434 alanyl-seryl-glycine Proteins 0.000 claims 5
- 108010072041 arginyl-glycyl-aspartic acid Proteins 0.000 claims 5
- 108010074027 glycyl-seryl-phenylalanine Proteins 0.000 claims 5
- 108010078274 isoleucylvaline Proteins 0.000 claims 5
- 108010025153 lysyl-alanyl-alanine Proteins 0.000 claims 5
- 108010016686 methionyl-alanyl-serine Proteins 0.000 claims 5
- 108010068488 methionylphenylalanine Proteins 0.000 claims 5
- 108010073101 phenylalanylleucine Proteins 0.000 claims 5
- 108010069117 seryl-lysyl-aspartic acid Proteins 0.000 claims 5
- 108010078580 tyrosylleucine Proteins 0.000 claims 5
- 108010009962 valyltyrosine Proteins 0.000 claims 5
- GJLXVWOMRRWCIB-MERZOTPQSA-N (2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-acetamido-5-(diaminomethylideneamino)pentanoyl]amino]-3-(4-hydroxyphenyl)propanoyl]amino]-3-(4-hydroxyphenyl)propanoyl]amino]-5-(diaminomethylideneamino)pentanoyl]amino]-3-(1H-indol-3-yl)propanoyl]amino]-6-aminohexanoyl]amino]-6-aminohexanoyl]amino]-6-aminohexanoyl]amino]-6-aminohexanoyl]amino]-6-aminohexanoyl]amino]-6-aminohexanoyl]amino]-6-aminohexanamide Chemical compound C([C@H](NC(=O)[C@H](CCCN=C(N)N)NC(=O)C)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(N)=O)C1=CC=C(O)C=C1 GJLXVWOMRRWCIB-MERZOTPQSA-N 0.000 claims 4
- AAQGRPOPTAUUBM-ZLUOBGJFSA-N Ala-Ala-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O AAQGRPOPTAUUBM-ZLUOBGJFSA-N 0.000 claims 4
- PJNSIUPOXFBHDM-GUBZILKMSA-N Ala-Arg-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(O)=O PJNSIUPOXFBHDM-GUBZILKMSA-N 0.000 claims 4
- LBJYAILUMSUTAM-ZLUOBGJFSA-N Ala-Asn-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O LBJYAILUMSUTAM-ZLUOBGJFSA-N 0.000 claims 4
- SHYYAQLDNVHPFT-DLOVCJGASA-N Ala-Asn-Phe Chemical compound C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 SHYYAQLDNVHPFT-DLOVCJGASA-N 0.000 claims 4
- FUSPCLTUKXQREV-ACZMJKKPSA-N Ala-Glu-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O FUSPCLTUKXQREV-ACZMJKKPSA-N 0.000 claims 4
- IVKWMMGFLAMMKJ-XVYDVKMFSA-N Ala-His-Asn Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CC(=O)N)C(=O)O)N IVKWMMGFLAMMKJ-XVYDVKMFSA-N 0.000 claims 4
- PNALXAODQKTNLV-JBDRJPRFSA-N Ala-Ile-Ala Chemical compound C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O PNALXAODQKTNLV-JBDRJPRFSA-N 0.000 claims 4
- CKLDHDOIYBVUNP-KBIXCLLPSA-N Ala-Ile-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(O)=O CKLDHDOIYBVUNP-KBIXCLLPSA-N 0.000 claims 4
- LXAARTARZJJCMB-CIQUZCHMSA-N Ala-Ile-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LXAARTARZJJCMB-CIQUZCHMSA-N 0.000 claims 4
- LNNSWWRRYJLGNI-NAKRPEOUSA-N Ala-Ile-Val Chemical compound C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O LNNSWWRRYJLGNI-NAKRPEOUSA-N 0.000 claims 4
- CCDFBRZVTDDJNM-GUBZILKMSA-N Ala-Leu-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O CCDFBRZVTDDJNM-GUBZILKMSA-N 0.000 claims 4
- MNZHHDPWDWQJCQ-YUMQZZPRSA-N Ala-Leu-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O MNZHHDPWDWQJCQ-YUMQZZPRSA-N 0.000 claims 4
- DWYROCSXOOMOEU-CIUDSAMLSA-N Ala-Met-Glu Chemical compound C[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N DWYROCSXOOMOEU-CIUDSAMLSA-N 0.000 claims 4
- CQJHFKKGZXKZBC-BPNCWPANSA-N Ala-Pro-Tyr Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 CQJHFKKGZXKZBC-BPNCWPANSA-N 0.000 claims 4
- YHBDGLZYNIARKJ-GUBZILKMSA-N Ala-Pro-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](C)N YHBDGLZYNIARKJ-GUBZILKMSA-N 0.000 claims 4
- YNOCMHZSWJMGBB-GCJQMDKQSA-N Ala-Thr-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(O)=O YNOCMHZSWJMGBB-GCJQMDKQSA-N 0.000 claims 4
- WNHNMKOFKCHKKD-BFHQHQDPSA-N Ala-Thr-Gly Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O WNHNMKOFKCHKKD-BFHQHQDPSA-N 0.000 claims 4
- QOIGKCBMXUCDQU-KDXUFGMBSA-N Ala-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](C)N)O QOIGKCBMXUCDQU-KDXUFGMBSA-N 0.000 claims 4
- PGNNQOJOEGFAOR-KWQFWETISA-N Ala-Tyr-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=C(O)C=C1 PGNNQOJOEGFAOR-KWQFWETISA-N 0.000 claims 4
- IYKVSFNGSWTTNZ-GUBZILKMSA-N Ala-Val-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N IYKVSFNGSWTTNZ-GUBZILKMSA-N 0.000 claims 4
- XSLGWYYNOSUMRM-ZKWXMUAHSA-N Ala-Val-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O XSLGWYYNOSUMRM-ZKWXMUAHSA-N 0.000 claims 4
- WESHVRNMNFMVBE-FXQIFTODSA-N Arg-Asn-Asp Chemical compound C(C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N)CN=C(N)N WESHVRNMNFMVBE-FXQIFTODSA-N 0.000 claims 4
- AWMAZIIEFPFHCP-RCWTZXSCSA-N Arg-Pro-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O AWMAZIIEFPFHCP-RCWTZXSCSA-N 0.000 claims 4
- BECXEHHOZNFFFX-IHRRRGAJSA-N Arg-Ser-Tyr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O BECXEHHOZNFFFX-IHRRRGAJSA-N 0.000 claims 4
- PJOPLXOCKACMLK-KKUMJFAQSA-N Arg-Tyr-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O PJOPLXOCKACMLK-KKUMJFAQSA-N 0.000 claims 4
- CGWVCWFQGXOUSJ-ULQDDVLXSA-N Arg-Tyr-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(O)=O CGWVCWFQGXOUSJ-ULQDDVLXSA-N 0.000 claims 4
- WVCJSDCHTUTONA-FXQIFTODSA-N Asn-Asp-Arg Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O WVCJSDCHTUTONA-FXQIFTODSA-N 0.000 claims 4
- XVVOVPFMILMHPX-ZLUOBGJFSA-N Asn-Asp-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O XVVOVPFMILMHPX-ZLUOBGJFSA-N 0.000 claims 4
- ULRPXVNMIIYDDJ-ACZMJKKPSA-N Asn-Glu-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC(=O)N)N ULRPXVNMIIYDDJ-ACZMJKKPSA-N 0.000 claims 4
- KLKHFFMNGWULBN-VKHMYHEASA-N Asn-Gly Chemical compound NC(=O)C[C@H](N)C(=O)NCC(O)=O KLKHFFMNGWULBN-VKHMYHEASA-N 0.000 claims 4
- WIDVAWAQBRAKTI-YUMQZZPRSA-N Asn-Leu-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O WIDVAWAQBRAKTI-YUMQZZPRSA-N 0.000 claims 4
- FBODFHMLALOPHP-GUBZILKMSA-N Asn-Lys-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O FBODFHMLALOPHP-GUBZILKMSA-N 0.000 claims 4
- ORJQQZIXTOYGGH-SRVKXCTJSA-N Asn-Lys-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O ORJQQZIXTOYGGH-SRVKXCTJSA-N 0.000 claims 4
- KSGAFDTYQPKUAP-GMOBBJLQSA-N Asn-Met-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KSGAFDTYQPKUAP-GMOBBJLQSA-N 0.000 claims 4
- RAUPFUCUDBQYHE-AVGNSLFASA-N Asn-Phe-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O RAUPFUCUDBQYHE-AVGNSLFASA-N 0.000 claims 4
- HZZIFFOVHLWGCS-KKUMJFAQSA-N Asn-Phe-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O HZZIFFOVHLWGCS-KKUMJFAQSA-N 0.000 claims 4
- YXVAESUIQFDBHN-SRVKXCTJSA-N Asn-Phe-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O YXVAESUIQFDBHN-SRVKXCTJSA-N 0.000 claims 4
- RBOBTTLFPRSXKZ-BZSNNMDCSA-N Asn-Phe-Tyr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O RBOBTTLFPRSXKZ-BZSNNMDCSA-N 0.000 claims 4
- VWADICJNCPFKJS-ZLUOBGJFSA-N Asn-Ser-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O VWADICJNCPFKJS-ZLUOBGJFSA-N 0.000 claims 4
- NCXTYSVDWLAQGZ-ZKWXMUAHSA-N Asn-Ser-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC(N)=O NCXTYSVDWLAQGZ-ZKWXMUAHSA-N 0.000 claims 4
- IPPFAOCLQSGHJV-WFBYXXMGSA-N Asn-Trp-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](C)C(O)=O IPPFAOCLQSGHJV-WFBYXXMGSA-N 0.000 claims 4
- KDFQZBWWPYQBEN-ZLUOBGJFSA-N Asp-Ala-Asn Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)O)N KDFQZBWWPYQBEN-ZLUOBGJFSA-N 0.000 claims 4
- BLQBMRNMBAYREH-UWJYBYFXSA-N Asp-Ala-Tyr Chemical compound N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O BLQBMRNMBAYREH-UWJYBYFXSA-N 0.000 claims 4
- FAEIQWHBRBWUBN-FXQIFTODSA-N Asp-Arg-Ser Chemical compound C(C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC(=O)O)N)CN=C(N)N FAEIQWHBRBWUBN-FXQIFTODSA-N 0.000 claims 4
- UQBGYPFHWFZMCD-ZLUOBGJFSA-N Asp-Asn-Asn Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O UQBGYPFHWFZMCD-ZLUOBGJFSA-N 0.000 claims 4
- CELPEWWLSXMVPH-CIUDSAMLSA-N Asp-Asp-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC(O)=O CELPEWWLSXMVPH-CIUDSAMLSA-N 0.000 claims 4
- QXHVOUSPVAWEMX-ZLUOBGJFSA-N Asp-Asp-Ser Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O QXHVOUSPVAWEMX-ZLUOBGJFSA-N 0.000 claims 4
- DTNUIAJCPRMNBT-WHFBIAKZSA-N Asp-Gly-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](C)C(O)=O DTNUIAJCPRMNBT-WHFBIAKZSA-N 0.000 claims 4
- WBDWQKRLTVCDSY-WHFBIAKZSA-N Asp-Gly-Asp Chemical compound OC(=O)C[C@H](N)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O WBDWQKRLTVCDSY-WHFBIAKZSA-N 0.000 claims 4
- SVABRQFIHCSNCI-FOHZUACHSA-N Asp-Gly-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O SVABRQFIHCSNCI-FOHZUACHSA-N 0.000 claims 4
- AYFVRYXNDHBECD-YUMQZZPRSA-N Asp-Leu-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O AYFVRYXNDHBECD-YUMQZZPRSA-N 0.000 claims 4
- KFAFUJMGHVVYRC-DCAQKATOSA-N Asp-Leu-Met Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(O)=O KFAFUJMGHVVYRC-DCAQKATOSA-N 0.000 claims 4
- GKWFMNNNYZHJHV-SRVKXCTJSA-N Asp-Lys-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC(O)=O GKWFMNNNYZHJHV-SRVKXCTJSA-N 0.000 claims 4
- DONWIPDSZZJHHK-HJGDQZAQSA-N Asp-Lys-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC(=O)O)N)O DONWIPDSZZJHHK-HJGDQZAQSA-N 0.000 claims 4
- SARSTIZOZFBDOM-FXQIFTODSA-N Asp-Met-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C)C(O)=O SARSTIZOZFBDOM-FXQIFTODSA-N 0.000 claims 4
- LKVKODXGSAFOFY-VEVYYDQMSA-N Asp-Met-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LKVKODXGSAFOFY-VEVYYDQMSA-N 0.000 claims 4
- USNJAPJZSGTTPX-XVSYOHENSA-N Asp-Phe-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O USNJAPJZSGTTPX-XVSYOHENSA-N 0.000 claims 4
- CUQDCPXNZPDYFQ-ZLUOBGJFSA-N Asp-Ser-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O CUQDCPXNZPDYFQ-ZLUOBGJFSA-N 0.000 claims 4
- IWLZBRTUIVXZJD-OLHMAJIHSA-N Asp-Thr-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(O)=O IWLZBRTUIVXZJD-OLHMAJIHSA-N 0.000 claims 4
- ATRHMOJQJWPVBQ-DRZSPHRISA-N Glu-Ala-Phe Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O ATRHMOJQJWPVBQ-DRZSPHRISA-N 0.000 claims 4
- LXAUHIRMWXQRKI-XHNCKOQMSA-N Glu-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)O)N)C(=O)O LXAUHIRMWXQRKI-XHNCKOQMSA-N 0.000 claims 4
- NTBDVNJIWCKURJ-ACZMJKKPSA-N Glu-Asp-Asn Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O NTBDVNJIWCKURJ-ACZMJKKPSA-N 0.000 claims 4
- PXXGVUVQWQGGIG-YUMQZZPRSA-N Glu-Gly-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N PXXGVUVQWQGGIG-YUMQZZPRSA-N 0.000 claims 4
- UHVIQGKBMXEVGN-WDSKDSINSA-N Glu-Gly-Asn Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O UHVIQGKBMXEVGN-WDSKDSINSA-N 0.000 claims 4
- OCJRHJZKGGSPRW-IUCAKERBSA-N Glu-Lys-Gly Chemical compound NCCCC[C@@H](C(=O)NCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O OCJRHJZKGGSPRW-IUCAKERBSA-N 0.000 claims 4
- PUUYVMYCMIWHFE-BQBZGAKWSA-N Gly-Ala-Arg Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N PUUYVMYCMIWHFE-BQBZGAKWSA-N 0.000 claims 4
- JXYMPBCYRKWJEE-BQBZGAKWSA-N Gly-Arg-Ala Chemical compound [H]NCC(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O JXYMPBCYRKWJEE-BQBZGAKWSA-N 0.000 claims 4
- RJIVPOXLQFJRTG-LURJTMIESA-N Gly-Arg-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)CN)CCCN=C(N)N RJIVPOXLQFJRTG-LURJTMIESA-N 0.000 claims 4
- WKJKBELXHCTHIJ-WPRPVWTQSA-N Gly-Arg-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCN=C(N)N WKJKBELXHCTHIJ-WPRPVWTQSA-N 0.000 claims 4
- MBOAPAXLTUSMQI-JHEQGTHGSA-N Gly-Glu-Thr Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MBOAPAXLTUSMQI-JHEQGTHGSA-N 0.000 claims 4
- SWQALSGKVLYKDT-UHFFFAOYSA-N Gly-Ile-Ala Natural products NCC(=O)NC(C(C)CC)C(=O)NC(C)C(O)=O SWQALSGKVLYKDT-UHFFFAOYSA-N 0.000 claims 4
- SXJHOPPTOJACOA-QXEWZRGKSA-N Gly-Ile-Arg Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CCCN=C(N)N SXJHOPPTOJACOA-QXEWZRGKSA-N 0.000 claims 4
- CCBIBMKQNXHNIN-ZETCQYMHSA-N Gly-Leu-Gly Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O CCBIBMKQNXHNIN-ZETCQYMHSA-N 0.000 claims 4
- MIIVFRCYJABHTQ-ONGXEEELSA-N Gly-Leu-Val Chemical compound [H]NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O MIIVFRCYJABHTQ-ONGXEEELSA-N 0.000 claims 4
- MXIULRKNFSCJHT-STQMWFEESA-N Gly-Phe-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CC=CC=C1 MXIULRKNFSCJHT-STQMWFEESA-N 0.000 claims 4
- OOCFXNOVSLSHAB-IUCAKERBSA-N Gly-Pro-Pro Chemical compound NCC(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 OOCFXNOVSLSHAB-IUCAKERBSA-N 0.000 claims 4
- GLACUWHUYFBSPJ-FJXKBIBVSA-N Gly-Pro-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)CN GLACUWHUYFBSPJ-FJXKBIBVSA-N 0.000 claims 4
- NVTPVQLIZCOJFK-FOHZUACHSA-N Gly-Thr-Asp Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(O)=O NVTPVQLIZCOJFK-FOHZUACHSA-N 0.000 claims 4
- JQFILXICXLDTRR-FBCQKBJTSA-N Gly-Thr-Gly Chemical compound NCC(=O)N[C@@H]([C@H](O)C)C(=O)NCC(O)=O JQFILXICXLDTRR-FBCQKBJTSA-N 0.000 claims 4
- LLWQVJNHMYBLLK-CDMKHQONSA-N Gly-Thr-Phe Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O LLWQVJNHMYBLLK-CDMKHQONSA-N 0.000 claims 4
- CUVBTVWFVIIDOC-YEPSODPASA-N Gly-Thr-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)O)NC(=O)CN CUVBTVWFVIIDOC-YEPSODPASA-N 0.000 claims 4
- PNUFMLXHOLFRLD-KBPBESRZSA-N Gly-Tyr-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CC=C(O)C=C1 PNUFMLXHOLFRLD-KBPBESRZSA-N 0.000 claims 4
- BDFCIKANUNMFGB-PMVVWTBXSA-N His-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC1=CN=CN1 BDFCIKANUNMFGB-PMVVWTBXSA-N 0.000 claims 4
- IWXMHXYOACDSIA-PYJNHQTQSA-N His-Ile-Val Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O IWXMHXYOACDSIA-PYJNHQTQSA-N 0.000 claims 4
- RWIKBYVJQAJYDP-BJDJZHNGSA-N Ile-Ala-Lys Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCCN RWIKBYVJQAJYDP-BJDJZHNGSA-N 0.000 claims 4
- CWJQMCPYXNVMBS-STECZYCISA-N Ile-Arg-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N CWJQMCPYXNVMBS-STECZYCISA-N 0.000 claims 4
- YPQDTQJBOFOTJQ-SXTJYALSSA-N Ile-Asn-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N YPQDTQJBOFOTJQ-SXTJYALSSA-N 0.000 claims 4
- NZOCIWKZUVUNDW-ZKWXMUAHSA-N Ile-Gly-Ala Chemical compound CC[C@H](C)[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(O)=O NZOCIWKZUVUNDW-ZKWXMUAHSA-N 0.000 claims 4
- AXNGDPAKKCEKGY-QPHKQPEJSA-N Ile-Ile-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N AXNGDPAKKCEKGY-QPHKQPEJSA-N 0.000 claims 4
- DBXXASNNDTXOLU-MXAVVETBSA-N Ile-Leu-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N DBXXASNNDTXOLU-MXAVVETBSA-N 0.000 claims 4
- PHRWFSFCNJPWRO-PPCPHDFISA-N Ile-Leu-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N PHRWFSFCNJPWRO-PPCPHDFISA-N 0.000 claims 4
- GLYJPWIRLBAIJH-UHFFFAOYSA-N Ile-Lys-Pro Natural products CCC(C)C(N)C(=O)NC(CCCCN)C(=O)N1CCCC1C(O)=O GLYJPWIRLBAIJH-UHFFFAOYSA-N 0.000 claims 4
- UOPBQSJRBONRON-STECZYCISA-N Ile-Met-Tyr Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCSC)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 UOPBQSJRBONRON-STECZYCISA-N 0.000 claims 4
- SAEWJTCJQVZQNZ-IUKAMOBKSA-N Ile-Thr-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N SAEWJTCJQVZQNZ-IUKAMOBKSA-N 0.000 claims 4
- WXLYNEHOGRYNFU-URLPEUOOSA-N Ile-Thr-Phe Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N WXLYNEHOGRYNFU-URLPEUOOSA-N 0.000 claims 4
- BCISUQVFDGYZBO-QSFUFRPTSA-N Ile-Val-Asp Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC(O)=O BCISUQVFDGYZBO-QSFUFRPTSA-N 0.000 claims 4
- BQSLGJHIAGOZCD-CIUDSAMLSA-N Leu-Ala-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O BQSLGJHIAGOZCD-CIUDSAMLSA-N 0.000 claims 4
- SUPVSFFZWVOEOI-CQDKDKBSSA-N Leu-Ala-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 SUPVSFFZWVOEOI-CQDKDKBSSA-N 0.000 claims 4
- SUPVSFFZWVOEOI-UHFFFAOYSA-N Leu-Ala-Tyr Natural products CC(C)CC(N)C(=O)NC(C)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 SUPVSFFZWVOEOI-UHFFFAOYSA-N 0.000 claims 4
- OXKYZSRZKBTVEY-ZPFDUUQYSA-N Leu-Asn-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O OXKYZSRZKBTVEY-ZPFDUUQYSA-N 0.000 claims 4
- RIMMMMYKGIBOSN-DCAQKATOSA-N Leu-Asn-Met Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCSC)C(O)=O RIMMMMYKGIBOSN-DCAQKATOSA-N 0.000 claims 4
- ULXYQAJWJGLCNR-YUMQZZPRSA-N Leu-Asp-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O ULXYQAJWJGLCNR-YUMQZZPRSA-N 0.000 claims 4
- XVSJMWYYLHPDKY-DCAQKATOSA-N Leu-Asp-Met Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCSC)C(O)=O XVSJMWYYLHPDKY-DCAQKATOSA-N 0.000 claims 4
- PPBKJAQJAUHZKX-SRVKXCTJSA-N Leu-Cys-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@H](C(O)=O)CC(C)C PPBKJAQJAUHZKX-SRVKXCTJSA-N 0.000 claims 4
- ZFNLIDNJUWNIJL-WDCWCFNPSA-N Leu-Glu-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ZFNLIDNJUWNIJL-WDCWCFNPSA-N 0.000 claims 4
- JRJLGNFWYFSJHB-HOCLYGCPSA-N Leu-Gly-Trp Chemical compound [H]N[C@@H](CC(C)C)C(=O)NCC(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O JRJLGNFWYFSJHB-HOCLYGCPSA-N 0.000 claims 4
- USLNHQZCDQJBOV-ZPFDUUQYSA-N Leu-Ile-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O USLNHQZCDQJBOV-ZPFDUUQYSA-N 0.000 claims 4
- YOKVEHGYYQEQOP-QWRGUYRKSA-N Leu-Leu-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O YOKVEHGYYQEQOP-QWRGUYRKSA-N 0.000 claims 4
- RTIRBWJPYJYTLO-MELADBBJSA-N Leu-Lys-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N1CCC[C@@H]1C(=O)O)N RTIRBWJPYJYTLO-MELADBBJSA-N 0.000 claims 4
- BGGTYDNTOYRTTR-MEYUZBJRSA-N Leu-Tyr-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CC(C)C)N)O BGGTYDNTOYRTTR-MEYUZBJRSA-N 0.000 claims 4
- RVOMPSJXSRPFJT-DCAQKATOSA-N Lys-Ala-Arg Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O RVOMPSJXSRPFJT-DCAQKATOSA-N 0.000 claims 4
- NQCJGQHHYZNUDK-DCAQKATOSA-N Lys-Arg-Ser Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CO)C(O)=O)CCCN=C(N)N NQCJGQHHYZNUDK-DCAQKATOSA-N 0.000 claims 4
- FLCMXEFCTLXBTL-DCAQKATOSA-N Lys-Asp-Arg Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N FLCMXEFCTLXBTL-DCAQKATOSA-N 0.000 claims 4
- KWUKZRFFKPLUPE-HJGDQZAQSA-N Lys-Asp-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KWUKZRFFKPLUPE-HJGDQZAQSA-N 0.000 claims 4
- WAIHHELKYSFIQN-XUXIUFHCSA-N Lys-Ile-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O WAIHHELKYSFIQN-XUXIUFHCSA-N 0.000 claims 4
- MUXNCRWTWBMNHX-SRVKXCTJSA-N Lys-Leu-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O MUXNCRWTWBMNHX-SRVKXCTJSA-N 0.000 claims 4
- MTBLFIQZECOEBY-IHRRRGAJSA-N Lys-Met-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCCCN)C(O)=O MTBLFIQZECOEBY-IHRRRGAJSA-N 0.000 claims 4
- XFOAWKDQMRMCDN-ULQDDVLXSA-N Lys-Phe-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CCCCN)CC1=CC=CC=C1 XFOAWKDQMRMCDN-ULQDDVLXSA-N 0.000 claims 4
- TVHCDSBMFQYPNA-RHYQMDGZSA-N Lys-Thr-Arg Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O TVHCDSBMFQYPNA-RHYQMDGZSA-N 0.000 claims 4
- QVTDVTONTRSQMF-WDCWCFNPSA-N Lys-Thr-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H]([C@H](O)C)NC(=O)[C@@H](N)CCCCN QVTDVTONTRSQMF-WDCWCFNPSA-N 0.000 claims 4
- ZMYHJISLFYTQGK-FXQIFTODSA-N Met-Asp-Asn Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O ZMYHJISLFYTQGK-FXQIFTODSA-N 0.000 claims 4
- DBXMFHGGHMXYHY-DCAQKATOSA-N Met-Leu-Ser Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O DBXMFHGGHMXYHY-DCAQKATOSA-N 0.000 claims 4
- FDGAMQVRGORBDV-GUBZILKMSA-N Met-Ser-Met Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCSC FDGAMQVRGORBDV-GUBZILKMSA-N 0.000 claims 4
- AJHCSUXXECOXOY-UHFFFAOYSA-N N-glycyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)CN)C(O)=O)=CNC2=C1 AJHCSUXXECOXOY-UHFFFAOYSA-N 0.000 claims 4
- 108010002311 N-glycylglutamic acid Proteins 0.000 claims 4
- JVTMTFMMMHAPCR-UBHSHLNASA-N Phe-Ala-Arg Chemical compound NC(=N)NCCC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=CC=C1 JVTMTFMMMHAPCR-UBHSHLNASA-N 0.000 claims 4
- ULECEJGNDHWSKD-QEJZJMRPSA-N Phe-Ala-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=CC=C1 ULECEJGNDHWSKD-QEJZJMRPSA-N 0.000 claims 4
- JNRFYJZCMHHGMH-UBHSHLNASA-N Phe-Ala-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=CC=C1 JNRFYJZCMHHGMH-UBHSHLNASA-N 0.000 claims 4
- VHWOBXIWBDWZHK-IHRRRGAJSA-N Phe-Arg-Asp Chemical compound NC(N)=NCCC[C@@H](C(=O)N[C@@H](CC(O)=O)C(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 VHWOBXIWBDWZHK-IHRRRGAJSA-N 0.000 claims 4
- YYRCPTVAPLQRNC-ULQDDVLXSA-N Phe-Arg-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCNC(N)=N)NC(=O)[C@@H](N)CC1=CC=CC=C1 YYRCPTVAPLQRNC-ULQDDVLXSA-N 0.000 claims 4
- OXUMFAOVGFODPN-KKUMJFAQSA-N Phe-Asn-His Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N OXUMFAOVGFODPN-KKUMJFAQSA-N 0.000 claims 4
- SWZKMTDPQXLQRD-XVSYOHENSA-N Phe-Asp-Thr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SWZKMTDPQXLQRD-XVSYOHENSA-N 0.000 claims 4
- BEEVXUYVEHXWRQ-YESZJQIVSA-N Phe-His-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CN=CN2)NC(=O)[C@H](CC3=CC=CC=C3)N)C(=O)O BEEVXUYVEHXWRQ-YESZJQIVSA-N 0.000 claims 4
- KAJLHCWRWDSROH-BZSNNMDCSA-N Phe-Phe-Asp Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)N[C@@H](CC(O)=O)C(O)=O)C1=CC=CC=C1 KAJLHCWRWDSROH-BZSNNMDCSA-N 0.000 claims 4
- QARPMYDMYVLFMW-KKUMJFAQSA-N Phe-Pro-Glu Chemical compound C([C@H](N)C(=O)N1[C@@H](CCC1)C(=O)N[C@@H](CCC(O)=O)C(O)=O)C1=CC=CC=C1 QARPMYDMYVLFMW-KKUMJFAQSA-N 0.000 claims 4
- ZLAKUZDMKVKFAI-JYJNAYRXSA-N Phe-Pro-Val Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O ZLAKUZDMKVKFAI-JYJNAYRXSA-N 0.000 claims 4
- GKRCCTYAGQPMMP-IHRRRGAJSA-N Phe-Ser-Met Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCSC)C(O)=O GKRCCTYAGQPMMP-IHRRRGAJSA-N 0.000 claims 4
- GLJZDMZJHFXJQG-BZSNNMDCSA-N Phe-Ser-Phe Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O GLJZDMZJHFXJQG-BZSNNMDCSA-N 0.000 claims 4
- BPIMVBKDLSBKIJ-FCLVOEFKSA-N Phe-Thr-Phe Chemical compound C([C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 BPIMVBKDLSBKIJ-FCLVOEFKSA-N 0.000 claims 4
- GTMSCDVFQLNEOY-BZSNNMDCSA-N Phe-Tyr-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N GTMSCDVFQLNEOY-BZSNNMDCSA-N 0.000 claims 4
- AHXPYZRZRMQOAU-QXEWZRGKSA-N Pro-Asn-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H]1CCCN1)C(O)=O AHXPYZRZRMQOAU-QXEWZRGKSA-N 0.000 claims 4
- GDXZRWYXJSGWIV-GMOBBJLQSA-N Pro-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@@H]1CCCN1 GDXZRWYXJSGWIV-GMOBBJLQSA-N 0.000 claims 4
- XUSDDSLCRPUKLP-QXEWZRGKSA-N Pro-Asp-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H]1CCCN1 XUSDDSLCRPUKLP-QXEWZRGKSA-N 0.000 claims 4
- BRJGUPWVFXKBQI-XUXIUFHCSA-N Pro-Leu-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O BRJGUPWVFXKBQI-XUXIUFHCSA-N 0.000 claims 4
- GNADVDLLGVSXLS-ULQDDVLXSA-N Pro-Phe-His Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CNC=N1)C(O)=O GNADVDLLGVSXLS-ULQDDVLXSA-N 0.000 claims 4
- JLMZKEQFMVORMA-SRVKXCTJSA-N Pro-Pro-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 JLMZKEQFMVORMA-SRVKXCTJSA-N 0.000 claims 4
- FDMKYQQYJKYCLV-GUBZILKMSA-N Pro-Pro-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 FDMKYQQYJKYCLV-GUBZILKMSA-N 0.000 claims 4
- HZWAHWQZPSXNCB-BPUTZDHNSA-N Ser-Arg-Trp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O HZWAHWQZPSXNCB-BPUTZDHNSA-N 0.000 claims 4
- LQESNKGTTNHZPZ-GHCJXIJMSA-N Ser-Ile-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O LQESNKGTTNHZPZ-GHCJXIJMSA-N 0.000 claims 4
- ZOPISOXXPQNOCO-SVSWQMSJSA-N Ser-Ile-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](CO)N ZOPISOXXPQNOCO-SVSWQMSJSA-N 0.000 claims 4
- DOSZISJPMCYEHT-NAKRPEOUSA-N Ser-Ile-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O DOSZISJPMCYEHT-NAKRPEOUSA-N 0.000 claims 4
- OWCVUSJMEBGMOK-YUMQZZPRSA-N Ser-Lys-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O OWCVUSJMEBGMOK-YUMQZZPRSA-N 0.000 claims 4
- FPCGZYMRFFIYIH-CIUDSAMLSA-N Ser-Lys-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O FPCGZYMRFFIYIH-CIUDSAMLSA-N 0.000 claims 4
- NQZFFLBPNDLTPO-DLOVCJGASA-N Ser-Phe-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](CO)N NQZFFLBPNDLTPO-DLOVCJGASA-N 0.000 claims 4
- PPCZVWHJWJFTFN-ZLUOBGJFSA-N Ser-Ser-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O PPCZVWHJWJFTFN-ZLUOBGJFSA-N 0.000 claims 4
- RXUOAOOZIWABBW-XGEHTFHBSA-N Ser-Thr-Arg Chemical compound OC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N RXUOAOOZIWABBW-XGEHTFHBSA-N 0.000 claims 4
- SQHKXWODKJDZRC-LKXGYXEUSA-N Ser-Thr-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(O)=O SQHKXWODKJDZRC-LKXGYXEUSA-N 0.000 claims 4
- ZKOKTQPHFMRSJP-YJRXYDGGSA-N Ser-Thr-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O ZKOKTQPHFMRSJP-YJRXYDGGSA-N 0.000 claims 4
- NERYDXBVARJIQS-JYBASQMISA-N Ser-Trp-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](CO)N)O NERYDXBVARJIQS-JYBASQMISA-N 0.000 claims 4
- PIQRHJQWEPWFJG-UWJYBYFXSA-N Ser-Tyr-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C)C(O)=O PIQRHJQWEPWFJG-UWJYBYFXSA-N 0.000 claims 4
- PMTWIUBUQRGCSB-FXQIFTODSA-N Ser-Val-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O PMTWIUBUQRGCSB-FXQIFTODSA-N 0.000 claims 4
- SIEBDTCABMZCLF-XGEHTFHBSA-N Ser-Val-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SIEBDTCABMZCLF-XGEHTFHBSA-N 0.000 claims 4
- XSLXHSYIVPGEER-KZVJFYERSA-N Thr-Ala-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O XSLXHSYIVPGEER-KZVJFYERSA-N 0.000 claims 4
- JEDIEMIJYSRUBB-FOHZUACHSA-N Thr-Asp-Gly Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O JEDIEMIJYSRUBB-FOHZUACHSA-N 0.000 claims 4
- ZUUDNCOCILSYAM-KKHAAJSZSA-N Thr-Asp-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O ZUUDNCOCILSYAM-KKHAAJSZSA-N 0.000 claims 4
- LYGKYFKSZTUXGZ-ZDLURKLDSA-N Thr-Cys-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CS)C(=O)NCC(O)=O LYGKYFKSZTUXGZ-ZDLURKLDSA-N 0.000 claims 4
- CQNFRKAKGDSJFR-NUMRIWBASA-N Thr-Glu-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N)O CQNFRKAKGDSJFR-NUMRIWBASA-N 0.000 claims 4
- SHOMROOOQBDGRL-JHEQGTHGSA-N Thr-Glu-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O SHOMROOOQBDGRL-JHEQGTHGSA-N 0.000 claims 4
- QQWNRERCGGZOKG-WEDXCCLWSA-N Thr-Gly-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O QQWNRERCGGZOKG-WEDXCCLWSA-N 0.000 claims 4
- IGGFFPOIFHZYKC-PBCZWWQYSA-N Thr-His-Asp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CC(=O)O)C(=O)O)N)O IGGFFPOIFHZYKC-PBCZWWQYSA-N 0.000 claims 4
- CRZNCABIJLRFKZ-IUKAMOBKSA-N Thr-Ile-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N CRZNCABIJLRFKZ-IUKAMOBKSA-N 0.000 claims 4
- FQPDRTDDEZXCEC-SVSWQMSJSA-N Thr-Ile-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O FQPDRTDDEZXCEC-SVSWQMSJSA-N 0.000 claims 4
- GXUWHVZYDAHFSV-FLBSBUHZSA-N Thr-Ile-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GXUWHVZYDAHFSV-FLBSBUHZSA-N 0.000 claims 4
- YOOAQCZYZHGUAZ-KATARQTJSA-N Thr-Leu-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O YOOAQCZYZHGUAZ-KATARQTJSA-N 0.000 claims 4
- HPQHHRLWSAMMKG-KATARQTJSA-N Thr-Lys-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CS)C(=O)O)N)O HPQHHRLWSAMMKG-KATARQTJSA-N 0.000 claims 4
- QOLYAJSZHIJCTO-VQVTYTSYSA-N Thr-Pro Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(O)=O QOLYAJSZHIJCTO-VQVTYTSYSA-N 0.000 claims 4
- MXDOAJQRJBMGMO-FJXKBIBVSA-N Thr-Pro-Gly Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O MXDOAJQRJBMGMO-FJXKBIBVSA-N 0.000 claims 4
- AHERARIZBPOMNU-KATARQTJSA-N Thr-Ser-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O AHERARIZBPOMNU-KATARQTJSA-N 0.000 claims 4
- VUXIQSUQQYNLJP-XAVMHZPKSA-N Thr-Ser-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CO)C(=O)N1CCC[C@@H]1C(=O)O)N)O VUXIQSUQQYNLJP-XAVMHZPKSA-N 0.000 claims 4
- YRJOLUDFVAUXLI-GSSVUCPTSA-N Thr-Thr-Asp Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(O)=O YRJOLUDFVAUXLI-GSSVUCPTSA-N 0.000 claims 4
- COYHRQWNJDJCNA-NUJDXYNKSA-N Thr-Thr-Thr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O COYHRQWNJDJCNA-NUJDXYNKSA-N 0.000 claims 4
- WMIUTJPFHMMUGY-ZFWWWQNUSA-N Trp-Pro-Gly Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC2=CNC3=CC=CC=C32)N)C(=O)NCC(=O)O WMIUTJPFHMMUGY-ZFWWWQNUSA-N 0.000 claims 4
- GBEAUNVBIMLWIB-IHPCNDPISA-N Trp-Ser-Phe Chemical compound C([C@H](NC(=O)[C@H](CO)NC(=O)[C@H](CC=1C2=CC=CC=C2NC=1)N)C(O)=O)C1=CC=CC=C1 GBEAUNVBIMLWIB-IHPCNDPISA-N 0.000 claims 4
- BURPTJBFWIOHEY-UWJYBYFXSA-N Tyr-Ala-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 BURPTJBFWIOHEY-UWJYBYFXSA-N 0.000 claims 4
- NSTPFWRAIDTNGH-BZSNNMDCSA-N Tyr-Asn-Tyr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O NSTPFWRAIDTNGH-BZSNNMDCSA-N 0.000 claims 4
- SLCSPPCQWUHPPO-JYJNAYRXSA-N Tyr-Glu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 SLCSPPCQWUHPPO-JYJNAYRXSA-N 0.000 claims 4
- USYGMBIIUDLYHJ-GVARAGBVSA-N Tyr-Ile-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 USYGMBIIUDLYHJ-GVARAGBVSA-N 0.000 claims 4
- NXRGXTBPMOGFID-CFMVVWHZSA-N Tyr-Ile-Asn Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O NXRGXTBPMOGFID-CFMVVWHZSA-N 0.000 claims 4
- BXPOOVDVGWEXDU-WZLNRYEVSA-N Tyr-Ile-Thr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O BXPOOVDVGWEXDU-WZLNRYEVSA-N 0.000 claims 4
- BSCBBPKDVOZICB-KKUMJFAQSA-N Tyr-Leu-Asp Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O BSCBBPKDVOZICB-KKUMJFAQSA-N 0.000 claims 4
- QFHRUCJIRVILCK-YJRXYDGGSA-N Tyr-Thr-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N)O QFHRUCJIRVILCK-YJRXYDGGSA-N 0.000 claims 4
- LVFZXRQQQDTBQH-IRIUXVKKSA-N Tyr-Thr-Glu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O LVFZXRQQQDTBQH-IRIUXVKKSA-N 0.000 claims 4
- GAKBTSMAPGLQFA-JNPHEJMOSA-N Tyr-Thr-Tyr Chemical compound C([C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=C(O)C=C1 GAKBTSMAPGLQFA-JNPHEJMOSA-N 0.000 claims 4
- YKBUNNNRNZZUID-UFYCRDLUSA-N Tyr-Val-Tyr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O YKBUNNNRNZZUID-UFYCRDLUSA-N 0.000 claims 4
- VMRFIKXKOFNMHW-GUBZILKMSA-N Val-Arg-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CO)C(=O)O)N VMRFIKXKOFNMHW-GUBZILKMSA-N 0.000 claims 4
- LIQJSDDOULTANC-QSFUFRPTSA-N Val-Asn-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](C(C)C)N LIQJSDDOULTANC-QSFUFRPTSA-N 0.000 claims 4
- IQQYYFPCWKWUHW-YDHLFZDLSA-N Val-Asn-Tyr Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N IQQYYFPCWKWUHW-YDHLFZDLSA-N 0.000 claims 4
- GBESYURLQOYWLU-LAEOZQHASA-N Val-Glu-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N GBESYURLQOYWLU-LAEOZQHASA-N 0.000 claims 4
- KZKMBGXCNLPYKD-YEPSODPASA-N Val-Gly-Thr Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O KZKMBGXCNLPYKD-YEPSODPASA-N 0.000 claims 4
- ZRSZTKTVPNSUNA-IHRRRGAJSA-N Val-Lys-Leu Chemical compound CC(C)C[C@H](NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)C(C)C)C(O)=O ZRSZTKTVPNSUNA-IHRRRGAJSA-N 0.000 claims 4
- YLRAFVVWZRSZQC-DZKIICNBSA-N Val-Phe-Glu Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N YLRAFVVWZRSZQC-DZKIICNBSA-N 0.000 claims 4
- XBJKAZATRJBDCU-GUBZILKMSA-N Val-Pro-Ala Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O XBJKAZATRJBDCU-GUBZILKMSA-N 0.000 claims 4
- SSYBNWFXCFNRFN-GUBZILKMSA-N Val-Pro-Ser Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O SSYBNWFXCFNRFN-GUBZILKMSA-N 0.000 claims 4
- BZDGLJPROOOUOZ-XGEHTFHBSA-N Val-Thr-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](C(C)C)N)O BZDGLJPROOOUOZ-XGEHTFHBSA-N 0.000 claims 4
- BGTDGENDNWGMDQ-KJEVXHAQSA-N Val-Tyr-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](C(C)C)N)O BGTDGENDNWGMDQ-KJEVXHAQSA-N 0.000 claims 4
- AEFJNECXZCODJM-UWVGGRQHSA-N Val-Val-Gly Chemical compound CC(C)[C@H]([NH3+])C(=O)N[C@@H](C(C)C)C(=O)NCC([O-])=O AEFJNECXZCODJM-UWVGGRQHSA-N 0.000 claims 4
- 108010070783 alanyltyrosine Proteins 0.000 claims 4
- 108010060199 cysteinylproline Proteins 0.000 claims 4
- 108010085059 glutamyl-arginyl-proline Proteins 0.000 claims 4
- 108010062266 glycyl-glycyl-argininal Proteins 0.000 claims 4
- 108010077515 glycylproline Proteins 0.000 claims 4
- 108010040030 histidinoalanine Proteins 0.000 claims 4
- 108010028295 histidylhistidine Proteins 0.000 claims 4
- 108010053037 kyotorphin Proteins 0.000 claims 4
- 108010047079 phenylalanyl-leucyl-arginyl-phenylalanine Proteins 0.000 claims 4
- 108010018625 phenylalanylarginine Proteins 0.000 claims 4
- 108010038745 tryptophylglycine Proteins 0.000 claims 4
- XQGIRPGAVLFKBJ-CIUDSAMLSA-N Ala-Asn-Lys Chemical compound N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)O XQGIRPGAVLFKBJ-CIUDSAMLSA-N 0.000 claims 3
- GWFSQQNGMPGBEF-GHCJXIJMSA-N Ala-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](C)N GWFSQQNGMPGBEF-GHCJXIJMSA-N 0.000 claims 3
- NJPMYXWVWQWCSR-ACZMJKKPSA-N Ala-Glu-Asn Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O NJPMYXWVWQWCSR-ACZMJKKPSA-N 0.000 claims 3
- YEVZMOUUZINZCK-LKTVYLICSA-N Ala-Glu-Trp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O YEVZMOUUZINZCK-LKTVYLICSA-N 0.000 claims 3
- OBVSBEYOMDWLRJ-BFHQHQDPSA-N Ala-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@H](C)N OBVSBEYOMDWLRJ-BFHQHQDPSA-N 0.000 claims 3
- JDIQCVUDDFENPU-ZKWXMUAHSA-N Ala-His-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CNC=N1 JDIQCVUDDFENPU-ZKWXMUAHSA-N 0.000 claims 3
- DVJSJDDYCYSMFR-ZKWXMUAHSA-N Ala-Ile-Gly Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O DVJSJDDYCYSMFR-ZKWXMUAHSA-N 0.000 claims 3
- HHRAXZAYZFFRAM-CIUDSAMLSA-N Ala-Leu-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O HHRAXZAYZFFRAM-CIUDSAMLSA-N 0.000 claims 3
- LBYMZCVBOKYZNS-CIUDSAMLSA-N Ala-Leu-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O LBYMZCVBOKYZNS-CIUDSAMLSA-N 0.000 claims 3
- VGMNWQOPSFBBBG-XUXIUFHCSA-N Ala-Leu-Leu-Asp Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O VGMNWQOPSFBBBG-XUXIUFHCSA-N 0.000 claims 3
- AJBVYEYZVYPFCF-CIUDSAMLSA-N Ala-Lys-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O AJBVYEYZVYPFCF-CIUDSAMLSA-N 0.000 claims 3
- CHFFHQUVXHEGBY-GARJFASQSA-N Ala-Lys-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N1CCC[C@@H]1C(=O)O)N CHFFHQUVXHEGBY-GARJFASQSA-N 0.000 claims 3
- NINQYGGNRIBFSC-CIUDSAMLSA-N Ala-Lys-Ser Chemical compound NCCCC[C@H](NC(=O)[C@@H](N)C)C(=O)N[C@@H](CO)C(O)=O NINQYGGNRIBFSC-CIUDSAMLSA-N 0.000 claims 3
- MDNAVFBZPROEHO-UHFFFAOYSA-N Ala-Lys-Val Natural products CC(C)C(C(O)=O)NC(=O)C(NC(=O)C(C)N)CCCCN MDNAVFBZPROEHO-UHFFFAOYSA-N 0.000 claims 3
- XSTZMVAYYCJTNR-DCAQKATOSA-N Ala-Met-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O XSTZMVAYYCJTNR-DCAQKATOSA-N 0.000 claims 3
- DEWWPUNXRNGMQN-LPEHRKFASA-N Ala-Met-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CCSC)C(=O)N1CCC[C@@H]1C(=O)O)N DEWWPUNXRNGMQN-LPEHRKFASA-N 0.000 claims 3
- JAQNUEWEJWBVAY-WBAXXEDZSA-N Ala-Phe-Phe Chemical compound C([C@H](NC(=O)[C@@H](N)C)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 JAQNUEWEJWBVAY-WBAXXEDZSA-N 0.000 claims 3
- GMGWOTQMUKYZIE-UBHSHLNASA-N Ala-Pro-Phe Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 GMGWOTQMUKYZIE-UBHSHLNASA-N 0.000 claims 3
- DYXOFPBJBAHWFY-JBDRJPRFSA-N Ala-Ser-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@H](C)N DYXOFPBJBAHWFY-JBDRJPRFSA-N 0.000 claims 3
- ARHJJAAWNWOACN-FXQIFTODSA-N Ala-Ser-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O ARHJJAAWNWOACN-FXQIFTODSA-N 0.000 claims 3
- XQNRANMFRPCFFW-GCJQMDKQSA-N Ala-Thr-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](C)N)O XQNRANMFRPCFFW-GCJQMDKQSA-N 0.000 claims 3
- LSMDIAAALJJLRO-XQXXSGGOSA-N Ala-Thr-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O LSMDIAAALJJLRO-XQXXSGGOSA-N 0.000 claims 3
- IIABBYGHLYWVOS-FXQIFTODSA-N Arg-Asn-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O IIABBYGHLYWVOS-FXQIFTODSA-N 0.000 claims 3
- XLWSGICNBZGYTA-CIUDSAMLSA-N Arg-Glu-Asp Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O XLWSGICNBZGYTA-CIUDSAMLSA-N 0.000 claims 3
- FFEUXEAKYRCACT-PEDHHIEDSA-N Arg-Ile-Ile Chemical compound CC[C@H](C)[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CCCNC(N)=N)[C@@H](C)CC)C(O)=O FFEUXEAKYRCACT-PEDHHIEDSA-N 0.000 claims 3
- UHFUZWSZQKMDSX-DCAQKATOSA-N Arg-Leu-Asn Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N UHFUZWSZQKMDSX-DCAQKATOSA-N 0.000 claims 3
- UZGFHWIJWPUPOH-IHRRRGAJSA-N Arg-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N UZGFHWIJWPUPOH-IHRRRGAJSA-N 0.000 claims 3
- COXMUHNBYCVVRG-DCAQKATOSA-N Arg-Leu-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O COXMUHNBYCVVRG-DCAQKATOSA-N 0.000 claims 3
- RIQBRKVTFBWEDY-RHYQMDGZSA-N Arg-Lys-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O RIQBRKVTFBWEDY-RHYQMDGZSA-N 0.000 claims 3
- WCZXPVPHUMYLMS-VEVYYDQMSA-N Arg-Thr-Asp Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(O)=O WCZXPVPHUMYLMS-VEVYYDQMSA-N 0.000 claims 3
- HRCIIMCTUIAKQB-XGEHTFHBSA-N Arg-Thr-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N)O HRCIIMCTUIAKQB-XGEHTFHBSA-N 0.000 claims 3
- MOGMYRUNTKYZFB-UNQGMJICSA-N Arg-Thr-Phe Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 MOGMYRUNTKYZFB-UNQGMJICSA-N 0.000 claims 3
- BWMMKQPATDUYKB-IHRRRGAJSA-N Arg-Tyr-Asn Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(N)=O)C(O)=O)CC1=CC=C(O)C=C1 BWMMKQPATDUYKB-IHRRRGAJSA-N 0.000 claims 3
- LFWOQHSQNCKXRU-UFYCRDLUSA-N Arg-Tyr-Phe Chemical compound C([C@H](NC(=O)[C@H](CCCN=C(N)N)N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=C(O)C=C1 LFWOQHSQNCKXRU-UFYCRDLUSA-N 0.000 claims 3
- QCTOLCVIGRLMQS-HRCADAONSA-N Arg-Tyr-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)[C@H](CCCN=C(N)N)N)C(=O)O QCTOLCVIGRLMQS-HRCADAONSA-N 0.000 claims 3
- XMZZGVGKGXRIGJ-JYJNAYRXSA-N Arg-Tyr-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O XMZZGVGKGXRIGJ-JYJNAYRXSA-N 0.000 claims 3
- YNDLOUMBVDVALC-ZLUOBGJFSA-N Asn-Ala-Ala Chemical compound C[C@@H](C(=O)N[C@@H](C)C(=O)O)NC(=O)[C@H](CC(=O)N)N YNDLOUMBVDVALC-ZLUOBGJFSA-N 0.000 claims 3
- QEYJFBMTSMLPKZ-ZKWXMUAHSA-N Asn-Ala-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O QEYJFBMTSMLPKZ-ZKWXMUAHSA-N 0.000 claims 3
- POOCJCRBHHMAOS-FXQIFTODSA-N Asn-Arg-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O POOCJCRBHHMAOS-FXQIFTODSA-N 0.000 claims 3
- KXFCBAHYSLJCCY-ZLUOBGJFSA-N Asn-Asn-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O KXFCBAHYSLJCCY-ZLUOBGJFSA-N 0.000 claims 3
- KXEGPPNPXOKKHK-ZLUOBGJFSA-N Asn-Asp-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O KXEGPPNPXOKKHK-ZLUOBGJFSA-N 0.000 claims 3
- JZRLLSOWDYUKOK-SRVKXCTJSA-N Asn-Asp-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(=O)N)N JZRLLSOWDYUKOK-SRVKXCTJSA-N 0.000 claims 3
- OLGCWMNDJTWQAG-GUBZILKMSA-N Asn-Glu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC(N)=O OLGCWMNDJTWQAG-GUBZILKMSA-N 0.000 claims 3
- DXVMJJNAOVECBA-WHFBIAKZSA-N Asn-Gly-Asn Chemical compound NC(=O)C[C@H](N)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O DXVMJJNAOVECBA-WHFBIAKZSA-N 0.000 claims 3
- HYQYLOSCICEYTR-YUMQZZPRSA-N Asn-Gly-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O HYQYLOSCICEYTR-YUMQZZPRSA-N 0.000 claims 3
- VXLBDJWTONZHJN-YUMQZZPRSA-N Asn-His-Gly Chemical compound C1=C(NC=N1)C[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CC(=O)N)N VXLBDJWTONZHJN-YUMQZZPRSA-N 0.000 claims 3
- NCFJQJRLQJEECD-NHCYSSNCSA-N Asn-Leu-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O NCFJQJRLQJEECD-NHCYSSNCSA-N 0.000 claims 3
- GIQCDTKOIPUDSG-GARJFASQSA-N Asn-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)[C@H](CC(=O)N)N)C(=O)O GIQCDTKOIPUDSG-GARJFASQSA-N 0.000 claims 3
- WCRQQIPFSXFIRN-LPEHRKFASA-N Asn-Met-Pro Chemical compound CSCC[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)N)N WCRQQIPFSXFIRN-LPEHRKFASA-N 0.000 claims 3
- NJSNXIOKBHPFMB-GMOBBJLQSA-N Asn-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CC(=O)N)N NJSNXIOKBHPFMB-GMOBBJLQSA-N 0.000 claims 3
- VCJCPARXDBEGNE-GUBZILKMSA-N Asn-Pro-Pro Chemical compound NC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 VCJCPARXDBEGNE-GUBZILKMSA-N 0.000 claims 3
- IDUUACUJKUXKKD-VEVYYDQMSA-N Asn-Pro-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O IDUUACUJKUXKKD-VEVYYDQMSA-N 0.000 claims 3
- XTMZYFMTYJNABC-ZLUOBGJFSA-N Asn-Ser-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CC(=O)N)N XTMZYFMTYJNABC-ZLUOBGJFSA-N 0.000 claims 3
- HPBNLFLSSQDFQW-WHFBIAKZSA-N Asn-Ser-Gly Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O HPBNLFLSSQDFQW-WHFBIAKZSA-N 0.000 claims 3
- VLDRQOHCMKCXLY-SRVKXCTJSA-N Asn-Ser-Phe Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O VLDRQOHCMKCXLY-SRVKXCTJSA-N 0.000 claims 3
- QYRMBFWDSFGSFC-OLHMAJIHSA-N Asn-Thr-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N)O QYRMBFWDSFGSFC-OLHMAJIHSA-N 0.000 claims 3
- GOPFMQJUQDLUFW-LKXGYXEUSA-N Asn-Thr-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(=O)N)N)O GOPFMQJUQDLUFW-LKXGYXEUSA-N 0.000 claims 3
- FMNBYVSGRCXWEK-FOHZUACHSA-N Asn-Thr-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O FMNBYVSGRCXWEK-FOHZUACHSA-N 0.000 claims 3
- JBDLMLZNDRLDIX-HJGDQZAQSA-N Asn-Thr-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O JBDLMLZNDRLDIX-HJGDQZAQSA-N 0.000 claims 3
- DXHINQUXBZNUCF-MELADBBJSA-N Asn-Tyr-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)[C@H](CC(=O)N)N)C(=O)O DXHINQUXBZNUCF-MELADBBJSA-N 0.000 claims 3
- MJIJBEYEHBKTIM-BYULHYEWSA-N Asn-Val-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N MJIJBEYEHBKTIM-BYULHYEWSA-N 0.000 claims 3
- JZLFYAAGGYMRIK-BYULHYEWSA-N Asn-Val-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O JZLFYAAGGYMRIK-BYULHYEWSA-N 0.000 claims 3
- CBHVAFXKOYAHOY-NHCYSSNCSA-N Asn-Val-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O CBHVAFXKOYAHOY-NHCYSSNCSA-N 0.000 claims 3
- PBVLJOIPOGUQQP-CIUDSAMLSA-N Asp-Ala-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O PBVLJOIPOGUQQP-CIUDSAMLSA-N 0.000 claims 3
- VPPXTHJNTYDNFJ-CIUDSAMLSA-N Asp-Ala-Lys Chemical compound C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)O)N VPPXTHJNTYDNFJ-CIUDSAMLSA-N 0.000 claims 3
- ZLGKHJHFYSRUBH-FXQIFTODSA-N Asp-Arg-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O ZLGKHJHFYSRUBH-FXQIFTODSA-N 0.000 claims 3
- AXXCUABIFZPKPM-BQBZGAKWSA-N Asp-Arg-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O AXXCUABIFZPKPM-BQBZGAKWSA-N 0.000 claims 3
- CNKAZIGBGQIHLL-GUBZILKMSA-N Asp-Arg-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC(=O)O)N CNKAZIGBGQIHLL-GUBZILKMSA-N 0.000 claims 3
- YNQIDCRRTWGHJD-ZLUOBGJFSA-N Asp-Asn-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CC(O)=O YNQIDCRRTWGHJD-ZLUOBGJFSA-N 0.000 claims 3
- FRYULLIZUDQONW-IMJSIDKUSA-N Asp-Asp Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(O)=O FRYULLIZUDQONW-IMJSIDKUSA-N 0.000 claims 3
- SVFOIXMRMLROHO-SRVKXCTJSA-N Asp-Asp-Phe Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 SVFOIXMRMLROHO-SRVKXCTJSA-N 0.000 claims 3
- KHBLRHKVXICFMY-GUBZILKMSA-N Asp-Glu-Lys Chemical compound N[C@@H](CC(=O)O)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O KHBLRHKVXICFMY-GUBZILKMSA-N 0.000 claims 3
- QCVXMEHGFUMKCO-YUMQZZPRSA-N Asp-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC(O)=O QCVXMEHGFUMKCO-YUMQZZPRSA-N 0.000 claims 3
- ICZWAZVKLACMKR-CIUDSAMLSA-N Asp-His-Ser Chemical compound OC(=O)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CO)C(O)=O)CC1=CN=CN1 ICZWAZVKLACMKR-CIUDSAMLSA-N 0.000 claims 3
- TZOZNVLBTAFJRW-UGYAYLCHSA-N Asp-Ile-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC(=O)O)N TZOZNVLBTAFJRW-UGYAYLCHSA-N 0.000 claims 3
- XWSIYTYNLKCLJB-CIUDSAMLSA-N Asp-Lys-Asn Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O XWSIYTYNLKCLJB-CIUDSAMLSA-N 0.000 claims 3
- VSMYBNPOHYAXSD-GUBZILKMSA-N Asp-Lys-Glu Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O VSMYBNPOHYAXSD-GUBZILKMSA-N 0.000 claims 3
- BKOIIURTQAJHAT-GUBZILKMSA-N Asp-Pro-Pro Chemical compound OC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 BKOIIURTQAJHAT-GUBZILKMSA-N 0.000 claims 3
- GGRSYTUJHAZTFN-IHRRRGAJSA-N Asp-Pro-Tyr Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC(=O)O)N)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O GGRSYTUJHAZTFN-IHRRRGAJSA-N 0.000 claims 3
- KGHLGJAXYSVNJP-WHFBIAKZSA-N Asp-Ser-Gly Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O KGHLGJAXYSVNJP-WHFBIAKZSA-N 0.000 claims 3
- DRCOAZZDQRCGGP-GHCJXIJMSA-N Asp-Ser-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O DRCOAZZDQRCGGP-GHCJXIJMSA-N 0.000 claims 3
- VNXQRBXEQXLERQ-CIUDSAMLSA-N Asp-Ser-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CC(=O)O)N VNXQRBXEQXLERQ-CIUDSAMLSA-N 0.000 claims 3
- GXHDGYOXPNQCKM-XVSYOHENSA-N Asp-Thr-Phe Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CC(=O)O)N)O GXHDGYOXPNQCKM-XVSYOHENSA-N 0.000 claims 3
- BJDHEININLSZOT-KKUMJFAQSA-N Asp-Tyr-Lys Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCCN)C(O)=O BJDHEININLSZOT-KKUMJFAQSA-N 0.000 claims 3
- PLOKOIJSGCISHE-BYULHYEWSA-N Asp-Val-Asn Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O PLOKOIJSGCISHE-BYULHYEWSA-N 0.000 claims 3
- TVYMKYUSZSVOAG-ZLUOBGJFSA-N Cys-Ala-Ala Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O TVYMKYUSZSVOAG-ZLUOBGJFSA-N 0.000 claims 3
- KBKGRMNVKPSQIF-XDTLVQLUSA-N Glu-Ala-Tyr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O KBKGRMNVKPSQIF-XDTLVQLUSA-N 0.000 claims 3
- TUTIHHSZKFBMHM-WHFBIAKZSA-N Glu-Asn Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(O)=O TUTIHHSZKFBMHM-WHFBIAKZSA-N 0.000 claims 3
- VAZZOGXDUQSVQF-NUMRIWBASA-N Glu-Asn-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)O)N)O VAZZOGXDUQSVQF-NUMRIWBASA-N 0.000 claims 3
- SBCYJMOOHUDWDA-NUMRIWBASA-N Glu-Asp-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SBCYJMOOHUDWDA-NUMRIWBASA-N 0.000 claims 3
- WATXSTJXNBOHKD-LAEOZQHASA-N Glu-Asp-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O WATXSTJXNBOHKD-LAEOZQHASA-N 0.000 claims 3
- ZSWGJYOZWBHROQ-RWRJDSDZSA-N Glu-Ile-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ZSWGJYOZWBHROQ-RWRJDSDZSA-N 0.000 claims 3
- FBEJIDRSQCGFJI-GUBZILKMSA-N Glu-Leu-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O FBEJIDRSQCGFJI-GUBZILKMSA-N 0.000 claims 3
- GJBUAAAIZSRCDC-GVXVVHGQSA-N Glu-Leu-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O GJBUAAAIZSRCDC-GVXVVHGQSA-N 0.000 claims 3
- AQNYKMCFCCZEEL-JYJNAYRXSA-N Glu-Lys-Tyr Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 AQNYKMCFCCZEEL-JYJNAYRXSA-N 0.000 claims 3
- YBTCBQBIJKGSJP-BQBZGAKWSA-N Glu-Pro Chemical compound OC(=O)CC[C@H](N)C(=O)N1CCC[C@H]1C(O)=O YBTCBQBIJKGSJP-BQBZGAKWSA-N 0.000 claims 3
- SYWCGQOIIARSIX-SRVKXCTJSA-N Glu-Pro-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O SYWCGQOIIARSIX-SRVKXCTJSA-N 0.000 claims 3
- MRWYPDWDZSLWJM-ACZMJKKPSA-N Glu-Ser-Asp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O MRWYPDWDZSLWJM-ACZMJKKPSA-N 0.000 claims 3
- RFTVTKBHDXCEEX-WDSKDSINSA-N Glu-Ser-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)NCC(O)=O RFTVTKBHDXCEEX-WDSKDSINSA-N 0.000 claims 3
- GPSHCSTUYOQPAI-JHEQGTHGSA-N Glu-Thr-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O GPSHCSTUYOQPAI-JHEQGTHGSA-N 0.000 claims 3
- YQAQQKPWFOBSMU-WDCWCFNPSA-N Glu-Thr-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O YQAQQKPWFOBSMU-WDCWCFNPSA-N 0.000 claims 3
- VHPVBPCCWVDGJL-IRIUXVKKSA-N Glu-Thr-Tyr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O VHPVBPCCWVDGJL-IRIUXVKKSA-N 0.000 claims 3
- RZMXBFUSQNLEQF-QEJZJMRPSA-N Glu-Trp-Asn Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)O)N RZMXBFUSQNLEQF-QEJZJMRPSA-N 0.000 claims 3
- BPCLDCNZBUYGOD-BPUTZDHNSA-N Glu-Trp-Glu Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CCC(O)=O)N)C(=O)N[C@@H](CCC(O)=O)C(O)=O)=CNC2=C1 BPCLDCNZBUYGOD-BPUTZDHNSA-N 0.000 claims 3
- MLILEEIVMRUYBX-NHCYSSNCSA-N Glu-Val-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O MLILEEIVMRUYBX-NHCYSSNCSA-N 0.000 claims 3
- YPHPEHMXOYTEQG-LAEOZQHASA-N Glu-Val-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCC(O)=O YPHPEHMXOYTEQG-LAEOZQHASA-N 0.000 claims 3
- FKJQNJCQTKUBCD-XPUUQOCRSA-N Gly-Ala-His Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)O FKJQNJCQTKUBCD-XPUUQOCRSA-N 0.000 claims 3
- LJPIRKICOISLKN-WHFBIAKZSA-N Gly-Ala-Ser Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O LJPIRKICOISLKN-WHFBIAKZSA-N 0.000 claims 3
- DTPOVRRYXPJJAZ-FJXKBIBVSA-N Gly-Arg-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCN=C(N)N DTPOVRRYXPJJAZ-FJXKBIBVSA-N 0.000 claims 3
- JVACNFOPSUPDTK-QWRGUYRKSA-N Gly-Asn-Phe Chemical compound NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 JVACNFOPSUPDTK-QWRGUYRKSA-N 0.000 claims 3
- XQHSBNVACKQWAV-WHFBIAKZSA-N Gly-Asp-Asn Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O XQHSBNVACKQWAV-WHFBIAKZSA-N 0.000 claims 3
- FZQLXNIMCPJVJE-YUMQZZPRSA-N Gly-Asp-Leu Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O FZQLXNIMCPJVJE-YUMQZZPRSA-N 0.000 claims 3
- TZOVVRJYUDETQG-RCOVLWMOSA-N Gly-Asp-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)CN TZOVVRJYUDETQG-RCOVLWMOSA-N 0.000 claims 3
- STVHDEHTKFXBJQ-LAEOZQHASA-N Gly-Glu-Ile Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O STVHDEHTKFXBJQ-LAEOZQHASA-N 0.000 claims 3
- KMSGYZQRXPUKGI-BYPYZUCNSA-N Gly-Gly-Asn Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CC(N)=O KMSGYZQRXPUKGI-BYPYZUCNSA-N 0.000 claims 3
- UQJNXZSSGQIPIQ-FBCQKBJTSA-N Gly-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)CN UQJNXZSSGQIPIQ-FBCQKBJTSA-N 0.000 claims 3
- LRQXRHGQEVWGPV-NHCYSSNCSA-N Gly-Leu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)CN LRQXRHGQEVWGPV-NHCYSSNCSA-N 0.000 claims 3
- NNCSJUBVFBDDLC-YUMQZZPRSA-N Gly-Leu-Ser Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O NNCSJUBVFBDDLC-YUMQZZPRSA-N 0.000 claims 3
- AFWYPMDMDYCKMD-KBPBESRZSA-N Gly-Leu-Tyr Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 AFWYPMDMDYCKMD-KBPBESRZSA-N 0.000 claims 3
- LOEANKRDMMVOGZ-YUMQZZPRSA-N Gly-Lys-Asp Chemical compound NCCCC[C@H](NC(=O)CN)C(=O)N[C@@H](CC(O)=O)C(O)=O LOEANKRDMMVOGZ-YUMQZZPRSA-N 0.000 claims 3
- HFPVRZWORNJRRC-UWVGGRQHSA-N Gly-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)CN HFPVRZWORNJRRC-UWVGGRQHSA-N 0.000 claims 3
- FGPLUIQCSKGLTI-WDSKDSINSA-N Gly-Ser-Glu Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O FGPLUIQCSKGLTI-WDSKDSINSA-N 0.000 claims 3
- HUFUVTYGPOUCBN-MBLNEYKQSA-N Gly-Thr-Ile Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O HUFUVTYGPOUCBN-MBLNEYKQSA-N 0.000 claims 3
- XHVONGZZVUUORG-WEDXCCLWSA-N Gly-Thr-Lys Chemical compound NCC(=O)N[C@@H]([C@H](O)C)C(=O)N[C@H](C(O)=O)CCCCN XHVONGZZVUUORG-WEDXCCLWSA-N 0.000 claims 3
- GWNIGUKSRJBIHX-STQMWFEESA-N Gly-Tyr-Arg Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)CN)O GWNIGUKSRJBIHX-STQMWFEESA-N 0.000 claims 3
- 101710094396 Hexon protein Proteins 0.000 claims 3
- VSLXGYMEHVAJBH-DLOVCJGASA-N His-Ala-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O VSLXGYMEHVAJBH-DLOVCJGASA-N 0.000 claims 3
- ZJSMFRTVYSLKQU-DJFWLOJKSA-N His-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC1=CN=CN1)N ZJSMFRTVYSLKQU-DJFWLOJKSA-N 0.000 claims 3
- OEROYDLRVAYIMQ-YUMQZZPRSA-N His-Gly-Asp Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O OEROYDLRVAYIMQ-YUMQZZPRSA-N 0.000 claims 3
- WTJBVCUCLWFGAH-JUKXBJQTSA-N His-Ile-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@H](CC2=CN=CN2)N WTJBVCUCLWFGAH-JUKXBJQTSA-N 0.000 claims 3
- TWROVBNEHJSXDG-IHRRRGAJSA-N His-Leu-Val Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O TWROVBNEHJSXDG-IHRRRGAJSA-N 0.000 claims 3
- BZAQOPHNBFOOJS-DCAQKATOSA-N His-Pro-Asp Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O BZAQOPHNBFOOJS-DCAQKATOSA-N 0.000 claims 3
- LQSBBHNVAVNZSX-GHCJXIJMSA-N Ile-Ala-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CC(=O)N)C(=O)O)N LQSBBHNVAVNZSX-GHCJXIJMSA-N 0.000 claims 3
- HDOYNXLPTRQLAD-JBDRJPRFSA-N Ile-Ala-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(=O)O)N HDOYNXLPTRQLAD-JBDRJPRFSA-N 0.000 claims 3
- PDTMWFVVNZYWTR-NHCYSSNCSA-N Ile-Gly-Lys Chemical compound CC[C@H](C)[C@H](N)C(=O)NCC(=O)N[C@@H](CCCCN)C(O)=O PDTMWFVVNZYWTR-NHCYSSNCSA-N 0.000 claims 3
- PWDSHAAAFXISLE-SXTJYALSSA-N Ile-Ile-Asp Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O PWDSHAAAFXISLE-SXTJYALSSA-N 0.000 claims 3
- KLBVGHCGHUNHEA-BJDJZHNGSA-N Ile-Leu-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(=O)O)N KLBVGHCGHUNHEA-BJDJZHNGSA-N 0.000 claims 3
- ZLFNNVATRMCAKN-ZKWXMUAHSA-N Ile-Ser-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)NCC(=O)O)N ZLFNNVATRMCAKN-ZKWXMUAHSA-N 0.000 claims 3
- QSXSHZIRKTUXNG-STECZYCISA-N Ile-Val-Tyr Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 QSXSHZIRKTUXNG-STECZYCISA-N 0.000 claims 3
- JKGHDYGZRDWHGA-SRVKXCTJSA-N Leu-Asn-Leu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O JKGHDYGZRDWHGA-SRVKXCTJSA-N 0.000 claims 3
- NHHKSOGJYNQENP-SRVKXCTJSA-N Leu-Cys-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CCCCN)C(=O)O)N NHHKSOGJYNQENP-SRVKXCTJSA-N 0.000 claims 3
- HYIFFZAQXPUEAU-QWRGUYRKSA-N Leu-Gly-Leu Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(C)C HYIFFZAQXPUEAU-QWRGUYRKSA-N 0.000 claims 3
- DBSLVQBXKVKDKJ-BJDJZHNGSA-N Leu-Ile-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O DBSLVQBXKVKDKJ-BJDJZHNGSA-N 0.000 claims 3
- AIRUUHAOKGVJAD-JYJNAYRXSA-N Leu-Phe-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O AIRUUHAOKGVJAD-JYJNAYRXSA-N 0.000 claims 3
- PWPBLZXWFXJFHE-RHYQMDGZSA-N Leu-Pro-Thr Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O PWPBLZXWFXJFHE-RHYQMDGZSA-N 0.000 claims 3
- MVHXGBZUJLWZOH-BJDJZHNGSA-N Leu-Ser-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O MVHXGBZUJLWZOH-BJDJZHNGSA-N 0.000 claims 3
- KLSUAWUZBMAZCL-RHYQMDGZSA-N Leu-Thr-Pro Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@H]1C(O)=O KLSUAWUZBMAZCL-RHYQMDGZSA-N 0.000 claims 3
- AIQWYVFNBNNOLU-RHYQMDGZSA-N Leu-Thr-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O AIQWYVFNBNNOLU-RHYQMDGZSA-N 0.000 claims 3
- RIHIGSWBLHSGLV-CQDKDKBSSA-N Leu-Tyr-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C)C(O)=O RIHIGSWBLHSGLV-CQDKDKBSSA-N 0.000 claims 3
- ISSAURVGLGAPDK-KKUMJFAQSA-N Leu-Tyr-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O ISSAURVGLGAPDK-KKUMJFAQSA-N 0.000 claims 3
- QESXLSQLQHHTIX-RHYQMDGZSA-N Leu-Val-Thr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O QESXLSQLQHHTIX-RHYQMDGZSA-N 0.000 claims 3
- MPGHETGWWWUHPY-CIUDSAMLSA-N Lys-Ala-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN MPGHETGWWWUHPY-CIUDSAMLSA-N 0.000 claims 3
- HQVDJTYKCMIWJP-YUMQZZPRSA-N Lys-Asn-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O HQVDJTYKCMIWJP-YUMQZZPRSA-N 0.000 claims 3
- HKCCVDWHHTVVPN-CIUDSAMLSA-N Lys-Asp-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O HKCCVDWHHTVVPN-CIUDSAMLSA-N 0.000 claims 3
- PHHYNOUOUWYQRO-XIRDDKMYSA-N Lys-Asp-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCCCN)N PHHYNOUOUWYQRO-XIRDDKMYSA-N 0.000 claims 3
- NTBFKPBULZGXQL-KKUMJFAQSA-N Lys-Asp-Tyr Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 NTBFKPBULZGXQL-KKUMJFAQSA-N 0.000 claims 3
- GCMWRRQAKQXDED-IUCAKERBSA-N Lys-Glu-Gly Chemical compound [NH3+]CCCC[C@H]([NH3+])C(=O)N[C@@H](CCC([O-])=O)C(=O)NCC([O-])=O GCMWRRQAKQXDED-IUCAKERBSA-N 0.000 claims 3
- ODUQLUADRKMHOZ-JYJNAYRXSA-N Lys-Glu-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCCCN)N)O ODUQLUADRKMHOZ-JYJNAYRXSA-N 0.000 claims 3
- QZONCCHVHCOBSK-YUMQZZPRSA-N Lys-Gly-Asn Chemical compound [H]N[C@@H](CCCCN)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O QZONCCHVHCOBSK-YUMQZZPRSA-N 0.000 claims 3
- GPJGFSFYBJGYRX-YUMQZZPRSA-N Lys-Gly-Asp Chemical compound NCCCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(O)=O GPJGFSFYBJGYRX-YUMQZZPRSA-N 0.000 claims 3
- GQFDWEDHOQRNLC-QWRGUYRKSA-N Lys-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CCCCN GQFDWEDHOQRNLC-QWRGUYRKSA-N 0.000 claims 3
- HAUUXTXKJNVIFY-ONGXEEELSA-N Lys-Gly-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O HAUUXTXKJNVIFY-ONGXEEELSA-N 0.000 claims 3
- SLQJJFAVWSZLBL-BJDJZHNGSA-N Lys-Ile-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCCCN SLQJJFAVWSZLBL-BJDJZHNGSA-N 0.000 claims 3
- AIRZWUMAHCDDHR-KKUMJFAQSA-N Lys-Leu-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O AIRZWUMAHCDDHR-KKUMJFAQSA-N 0.000 claims 3
- QQPSCXKFDSORFT-IHRRRGAJSA-N Lys-Lys-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCCCN QQPSCXKFDSORFT-IHRRRGAJSA-N 0.000 claims 3
- UDXSLGLHFUBRRM-OEAJRASXSA-N Lys-Phe-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](CCCCN)N)O UDXSLGLHFUBRRM-OEAJRASXSA-N 0.000 claims 3
- JCVOHUKUYSYBAD-DCAQKATOSA-N Lys-Pro-Cys Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CCCCN)N)C(=O)N[C@@H](CS)C(=O)O JCVOHUKUYSYBAD-DCAQKATOSA-N 0.000 claims 3
- QBHGXFQJFPWJIH-XUXIUFHCSA-N Lys-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CCCCN QBHGXFQJFPWJIH-XUXIUFHCSA-N 0.000 claims 3
- LOGFVTREOLYCPF-RHYQMDGZSA-N Lys-Pro-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CCCCN LOGFVTREOLYCPF-RHYQMDGZSA-N 0.000 claims 3
- MIROMRNASYKZNL-ULQDDVLXSA-N Lys-Pro-Tyr Chemical compound NCCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 MIROMRNASYKZNL-ULQDDVLXSA-N 0.000 claims 3
- BDFHWFUAQLIMJO-KXNHARMFSA-N Lys-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCCN)N)O BDFHWFUAQLIMJO-KXNHARMFSA-N 0.000 claims 3
- RQILLQOQXLZTCK-KBPBESRZSA-N Lys-Tyr-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(O)=O RQILLQOQXLZTCK-KBPBESRZSA-N 0.000 claims 3
- PPNCMJARTHYNEC-MEYUZBJRSA-N Lys-Tyr-Thr Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H]([C@H](O)C)C(O)=O)CC1=CC=C(O)C=C1 PPNCMJARTHYNEC-MEYUZBJRSA-N 0.000 claims 3
- LRALLISKBZNSKN-BQBZGAKWSA-N Met-Gly-Ser Chemical compound CSCC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O LRALLISKBZNSKN-BQBZGAKWSA-N 0.000 claims 3
- IRVONVRHHJXWTK-RWMBFGLXSA-N Met-Lys-Pro Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N1CCC[C@@H]1C(=O)O)N IRVONVRHHJXWTK-RWMBFGLXSA-N 0.000 claims 3
- DPUOLKQSMYLRDR-UBHSHLNASA-N Phe-Arg-Ala Chemical compound NC(N)=NCCC[C@@H](C(=O)N[C@@H](C)C(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 DPUOLKQSMYLRDR-UBHSHLNASA-N 0.000 claims 3
- QCHNRQQVLJYDSI-DLOVCJGASA-N Phe-Asn-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 QCHNRQQVLJYDSI-DLOVCJGASA-N 0.000 claims 3
- HXSUFWQYLPKEHF-IHRRRGAJSA-N Phe-Asn-Arg Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N HXSUFWQYLPKEHF-IHRRRGAJSA-N 0.000 claims 3
- OXKJSGGTHFMGDT-UFYCRDLUSA-N Phe-Phe-Arg Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O)C1=CC=CC=C1 OXKJSGGTHFMGDT-UFYCRDLUSA-N 0.000 claims 3
- XNQMZHLAYFWSGJ-HTUGSXCWSA-N Phe-Thr-Glu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O XNQMZHLAYFWSGJ-HTUGSXCWSA-N 0.000 claims 3
- GNRMAQSIROFNMI-IXOXFDKPSA-N Phe-Thr-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O GNRMAQSIROFNMI-IXOXFDKPSA-N 0.000 claims 3
- IWNOFCGBMSFTBC-CIUDSAMLSA-N Pro-Ala-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O IWNOFCGBMSFTBC-CIUDSAMLSA-N 0.000 claims 3
- XWYXZPHPYKRYPA-GMOBBJLQSA-N Pro-Asn-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O XWYXZPHPYKRYPA-GMOBBJLQSA-N 0.000 claims 3
- FUVBEZJCRMHWEM-FXQIFTODSA-N Pro-Asn-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O FUVBEZJCRMHWEM-FXQIFTODSA-N 0.000 claims 3
- ZCXQTRXYZOSGJR-FXQIFTODSA-N Pro-Asp-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O ZCXQTRXYZOSGJR-FXQIFTODSA-N 0.000 claims 3
- KIPIKSXPPLABPN-CIUDSAMLSA-N Pro-Glu-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H]1CCCN1 KIPIKSXPPLABPN-CIUDSAMLSA-N 0.000 claims 3
- UUHXBJHVTVGSKM-BQBZGAKWSA-N Pro-Gly-Asn Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O UUHXBJHVTVGSKM-BQBZGAKWSA-N 0.000 claims 3
- FMLRRBDLBJLJIK-DCAQKATOSA-N Pro-Leu-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]1CCCN1 FMLRRBDLBJLJIK-DCAQKATOSA-N 0.000 claims 3
- APIAILHCTSBGLU-JYJNAYRXSA-N Pro-Met-Phe Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@@H]2CCCN2 APIAILHCTSBGLU-JYJNAYRXSA-N 0.000 claims 3
- KIDXAAQVMNLJFQ-KZVJFYERSA-N Pro-Thr-Ala Chemical compound C[C@@H](O)[C@H](NC(=O)[C@@H]1CCCN1)C(=O)N[C@@H](C)C(O)=O KIDXAAQVMNLJFQ-KZVJFYERSA-N 0.000 claims 3
- FDMCIBSQRKFSTJ-RHYQMDGZSA-N Pro-Thr-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O FDMCIBSQRKFSTJ-RHYQMDGZSA-N 0.000 claims 3
- AJJDPGVVNPUZCR-RHYQMDGZSA-N Pro-Thr-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@@H]1CCCN1)O AJJDPGVVNPUZCR-RHYQMDGZSA-N 0.000 claims 3
- CWZUFLWPEFHWEI-IHRRRGAJSA-N Pro-Tyr-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O CWZUFLWPEFHWEI-IHRRRGAJSA-N 0.000 claims 3
- KHRLUIPIMIQFGT-AVGNSLFASA-N Pro-Val-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O KHRLUIPIMIQFGT-AVGNSLFASA-N 0.000 claims 3
- PGSWNLRYYONGPE-JYJNAYRXSA-N Pro-Val-Tyr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O PGSWNLRYYONGPE-JYJNAYRXSA-N 0.000 claims 3
- LVVBAKCGXXUHFO-ZLUOBGJFSA-N Ser-Ala-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(O)=O LVVBAKCGXXUHFO-ZLUOBGJFSA-N 0.000 claims 3
- WTWGOQRNRFHFQD-JBDRJPRFSA-N Ser-Ala-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WTWGOQRNRFHFQD-JBDRJPRFSA-N 0.000 claims 3
- YMEXHZTVKDAKIY-GHCJXIJMSA-N Ser-Asn-Ile Chemical compound CC[C@H](C)[C@H](NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CO)C(O)=O YMEXHZTVKDAKIY-GHCJXIJMSA-N 0.000 claims 3
- COAHUSQNSVFYBW-FXQIFTODSA-N Ser-Asn-Met Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCSC)C(O)=O COAHUSQNSVFYBW-FXQIFTODSA-N 0.000 claims 3
- GHPQVUYZQQGEDA-BIIVOSGPSA-N Ser-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CO)N)C(=O)O GHPQVUYZQQGEDA-BIIVOSGPSA-N 0.000 claims 3
- SWSRFJZZMNLMLY-ZKWXMUAHSA-N Ser-Asp-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O SWSRFJZZMNLMLY-ZKWXMUAHSA-N 0.000 claims 3
- VQBCMLMPEWPUTB-ACZMJKKPSA-N Ser-Glu-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O VQBCMLMPEWPUTB-ACZMJKKPSA-N 0.000 claims 3
- UIGMAMGZOJVTDN-WHFBIAKZSA-N Ser-Gly-Ser Chemical compound OC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O UIGMAMGZOJVTDN-WHFBIAKZSA-N 0.000 claims 3
- XXXAXOWMBOKTRN-XPUUQOCRSA-N Ser-Gly-Val Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O XXXAXOWMBOKTRN-XPUUQOCRSA-N 0.000 claims 3
- IAORETPTUDBBGV-CIUDSAMLSA-N Ser-Leu-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CO)N IAORETPTUDBBGV-CIUDSAMLSA-N 0.000 claims 3
- JJUNLJTUIKFPRF-BPUTZDHNSA-N Ser-Met-Trp Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CO)N JJUNLJTUIKFPRF-BPUTZDHNSA-N 0.000 claims 3
- HJAXVYLCKDPPDF-SRVKXCTJSA-N Ser-Phe-Cys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CO)N HJAXVYLCKDPPDF-SRVKXCTJSA-N 0.000 claims 3
- JLKWJWPDXPKKHI-FXQIFTODSA-N Ser-Pro-Asn Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CO)N)C(=O)N[C@@H](CC(=O)N)C(=O)O JLKWJWPDXPKKHI-FXQIFTODSA-N 0.000 claims 3
- JURQXQBJKUHGJS-UHFFFAOYSA-N Ser-Ser-Ser-Ser Chemical compound OCC(N)C(=O)NC(CO)C(=O)NC(CO)C(=O)NC(CO)C(O)=O JURQXQBJKUHGJS-UHFFFAOYSA-N 0.000 claims 3
- LDEBVRIURYMKQS-WISUUJSJSA-N Ser-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H](N)CO LDEBVRIURYMKQS-WISUUJSJSA-N 0.000 claims 3
- PQEQXWRVHQAAKS-SRVKXCTJSA-N Ser-Tyr-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](CO)N)CC1=CC=C(O)C=C1 PQEQXWRVHQAAKS-SRVKXCTJSA-N 0.000 claims 3
- RCOUFINCYASMDN-GUBZILKMSA-N Ser-Val-Met Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCSC)C(O)=O RCOUFINCYASMDN-GUBZILKMSA-N 0.000 claims 3
- ANOQEBQWIAYIMV-AEJSXWLSSA-N Ser-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N ANOQEBQWIAYIMV-AEJSXWLSSA-N 0.000 claims 3
- 241000991297 Simian adenovirus 6 Species 0.000 claims 3
- IGROJMCBGRFRGI-YTLHQDLWSA-N Thr-Ala-Ala Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O IGROJMCBGRFRGI-YTLHQDLWSA-N 0.000 claims 3
- NJEMRSFGDNECGF-GCJQMDKQSA-N Thr-Ala-Asp Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC(O)=O NJEMRSFGDNECGF-GCJQMDKQSA-N 0.000 claims 3
- TYVAWPFQYFPSBR-BFHQHQDPSA-N Thr-Ala-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)NCC(O)=O TYVAWPFQYFPSBR-BFHQHQDPSA-N 0.000 claims 3
- LVHHEVGYAZGXDE-KDXUFGMBSA-N Thr-Ala-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C)C(=O)N1CCC[C@@H]1C(=O)O)N)O LVHHEVGYAZGXDE-KDXUFGMBSA-N 0.000 claims 3
- UNURFMVMXLENAZ-KJEVXHAQSA-N Thr-Arg-Tyr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O UNURFMVMXLENAZ-KJEVXHAQSA-N 0.000 claims 3
- LXWZOMSOUAMOIA-JIOCBJNQSA-N Thr-Asn-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N1CCC[C@@H]1C(=O)O)N)O LXWZOMSOUAMOIA-JIOCBJNQSA-N 0.000 claims 3
- MFEBUIFJVPNZLO-OLHMAJIHSA-N Thr-Asp-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O MFEBUIFJVPNZLO-OLHMAJIHSA-N 0.000 claims 3
- YBXMGKCLOPDEKA-NUMRIWBASA-N Thr-Asp-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O YBXMGKCLOPDEKA-NUMRIWBASA-N 0.000 claims 3
- XDARBNMYXKUFOJ-GSSVUCPTSA-N Thr-Asp-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XDARBNMYXKUFOJ-GSSVUCPTSA-N 0.000 claims 3
- KWQBJOUOSNJDRR-XAVMHZPKSA-N Thr-Cys-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)N1CCC[C@@H]1C(=O)O)N)O KWQBJOUOSNJDRR-XAVMHZPKSA-N 0.000 claims 3
- AYCQVUUPIJHJTA-IXOXFDKPSA-N Thr-His-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(O)=O AYCQVUUPIJHJTA-IXOXFDKPSA-N 0.000 claims 3
- ADPHPKGWVDHWML-PPCPHDFISA-N Thr-Ile-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N ADPHPKGWVDHWML-PPCPHDFISA-N 0.000 claims 3
- VTVVYQOXJCZVEB-WDCWCFNPSA-N Thr-Leu-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O VTVVYQOXJCZVEB-WDCWCFNPSA-N 0.000 claims 3
- KZSYAEWQMJEGRZ-RHYQMDGZSA-N Thr-Leu-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O KZSYAEWQMJEGRZ-RHYQMDGZSA-N 0.000 claims 3
- BDGBHYCAZJPLHX-HJGDQZAQSA-N Thr-Lys-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O BDGBHYCAZJPLHX-HJGDQZAQSA-N 0.000 claims 3
- DCRHJDRLCFMEBI-RHYQMDGZSA-N Thr-Lys-Met Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCSC)C(O)=O DCRHJDRLCFMEBI-RHYQMDGZSA-N 0.000 claims 3
- DEGCBBCMYWNJNA-RHYQMDGZSA-N Thr-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)[C@@H](C)O DEGCBBCMYWNJNA-RHYQMDGZSA-N 0.000 claims 3
- RVMNUBQWPVOUKH-HEIBUPTGSA-N Thr-Ser-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O RVMNUBQWPVOUKH-HEIBUPTGSA-N 0.000 claims 3
- DSGIVWSDDRDJIO-ZXXMMSQZSA-N Thr-Thr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O DSGIVWSDDRDJIO-ZXXMMSQZSA-N 0.000 claims 3
- CSNBWOJOEOPYIJ-UVOCVTCTSA-N Thr-Thr-Lys Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(O)=O CSNBWOJOEOPYIJ-UVOCVTCTSA-N 0.000 claims 3
- ZMYCLHFLHRVOEA-HEIBUPTGSA-N Thr-Thr-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O ZMYCLHFLHRVOEA-HEIBUPTGSA-N 0.000 claims 3
- NJGMALCNYAMYCB-JRQIVUDYSA-N Thr-Tyr-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(O)=O NJGMALCNYAMYCB-JRQIVUDYSA-N 0.000 claims 3
- BZTSQFWJNJYZSX-JRQIVUDYSA-N Thr-Tyr-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O BZTSQFWJNJYZSX-JRQIVUDYSA-N 0.000 claims 3
- BKIOKSLLAAZYTC-KKHAAJSZSA-N Thr-Val-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O BKIOKSLLAAZYTC-KKHAAJSZSA-N 0.000 claims 3
- FYBFTPLPAXZBOY-KKHAAJSZSA-N Thr-Val-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O FYBFTPLPAXZBOY-KKHAAJSZSA-N 0.000 claims 3
- AKHDFZHUPGVFEJ-YEPSODPASA-N Thr-Val-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O AKHDFZHUPGVFEJ-YEPSODPASA-N 0.000 claims 3
- PNKDNKGMEHJTJQ-BPUTZDHNSA-N Trp-Arg-Asn Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC(=O)N)C(=O)O)N PNKDNKGMEHJTJQ-BPUTZDHNSA-N 0.000 claims 3
- WACMTVIJWRNVSO-CWRNSKLLSA-N Trp-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)N)C(=O)O WACMTVIJWRNVSO-CWRNSKLLSA-N 0.000 claims 3
- PWIQCLSQVQBOQV-AAEUAGOBSA-N Trp-Glu Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(O)=O)=CNC2=C1 PWIQCLSQVQBOQV-AAEUAGOBSA-N 0.000 claims 3
- FEZASNVQLJQBHW-CABZTGNLSA-N Trp-Gly-Ala Chemical compound C1=CC=C2C(C[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(O)=O)=CNC2=C1 FEZASNVQLJQBHW-CABZTGNLSA-N 0.000 claims 3
- LFGHEUIUSIRJAE-TUSQITKMSA-N Trp-Lys-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC3=CNC4=CC=CC=C43)C(=O)O)N LFGHEUIUSIRJAE-TUSQITKMSA-N 0.000 claims 3
- GEGYPBOPIGNZIF-CWRNSKLLSA-N Trp-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)N)C(=O)O GEGYPBOPIGNZIF-CWRNSKLLSA-N 0.000 claims 3
- SEXRBCGSZRCIPE-LYSGOOTNSA-N Trp-Thr-Gly Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N)O SEXRBCGSZRCIPE-LYSGOOTNSA-N 0.000 claims 3
- HTGJDTPQYFMKNC-VFAJRCTISA-N Trp-Thr-Leu Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C(O)=O)[C@@H](C)O)=CNC2=C1 HTGJDTPQYFMKNC-VFAJRCTISA-N 0.000 claims 3
- UPUNWAXSLPBMRK-XTWBLICNSA-N Trp-Thr-Thr Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O UPUNWAXSLPBMRK-XTWBLICNSA-N 0.000 claims 3
- DVLHKUWLNKDINO-PMVMPFDFSA-N Trp-Tyr-Leu Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(O)=O DVLHKUWLNKDINO-PMVMPFDFSA-N 0.000 claims 3
- NOXKHHXSHQFSGJ-FQPOAREZSA-N Tyr-Ala-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 NOXKHHXSHQFSGJ-FQPOAREZSA-N 0.000 claims 3
- MICSYKFECRFCTJ-IHRRRGAJSA-N Tyr-Arg-Asp Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N)O MICSYKFECRFCTJ-IHRRRGAJSA-N 0.000 claims 3
- MTEQZJFSEMXXRK-CFMVVWHZSA-N Tyr-Asn-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC1=CC=C(C=C1)O)N MTEQZJFSEMXXRK-CFMVVWHZSA-N 0.000 claims 3
- GAYLGYUVTDMLKC-UWJYBYFXSA-N Tyr-Asp-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 GAYLGYUVTDMLKC-UWJYBYFXSA-N 0.000 claims 3
- NLMXVDDEQFKQQU-CFMVVWHZSA-N Tyr-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 NLMXVDDEQFKQQU-CFMVVWHZSA-N 0.000 claims 3
- ZRPLVTZTKPPSBT-AVGNSLFASA-N Tyr-Glu-Ser Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O ZRPLVTZTKPPSBT-AVGNSLFASA-N 0.000 claims 3
- GGXUDPQWAWRINY-XEGUGMAKSA-N Tyr-Ile-Gly Chemical compound OC(=O)CNC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 GGXUDPQWAWRINY-XEGUGMAKSA-N 0.000 claims 3
- GITNQBVCEQBDQC-KKUMJFAQSA-N Tyr-Lys-Asn Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O GITNQBVCEQBDQC-KKUMJFAQSA-N 0.000 claims 3
- KGSDLCMCDFETHU-YESZJQIVSA-N Tyr-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)[C@H](CC2=CC=C(C=C2)O)N)C(=O)O KGSDLCMCDFETHU-YESZJQIVSA-N 0.000 claims 3
- CNNVVEPJTFOGHI-ACRUOGEOSA-N Tyr-Lys-Tyr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O CNNVVEPJTFOGHI-ACRUOGEOSA-N 0.000 claims 3
- KYPMKDGKAYQCHO-RYUDHWBXSA-N Tyr-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 KYPMKDGKAYQCHO-RYUDHWBXSA-N 0.000 claims 3
- FGVFBDZSGQTYQX-UFYCRDLUSA-N Tyr-Phe-Val Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(O)=O FGVFBDZSGQTYQX-UFYCRDLUSA-N 0.000 claims 3
- XYBNMHRFAUKPAW-IHRRRGAJSA-N Tyr-Ser-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CC1=CC=C(C=C1)O)N XYBNMHRFAUKPAW-IHRRRGAJSA-N 0.000 claims 3
- UEOOXDLMQZBPFR-ZKWXMUAHSA-N Val-Ala-Asn Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N UEOOXDLMQZBPFR-ZKWXMUAHSA-N 0.000 claims 3
- YFOCMOVJBQDBCE-NRPADANISA-N Val-Ala-Glu Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N YFOCMOVJBQDBCE-NRPADANISA-N 0.000 claims 3
- UBTBGUDNDFZLGP-SRVKXCTJSA-N Val-Arg-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](C(C)C)C(=O)O)N UBTBGUDNDFZLGP-SRVKXCTJSA-N 0.000 claims 3
- LNYOXPDEIZJDEI-NHCYSSNCSA-N Val-Asn-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](C(C)C)N LNYOXPDEIZJDEI-NHCYSSNCSA-N 0.000 claims 3
- IDKGBVZGNTYYCC-QXEWZRGKSA-N Val-Asn-Pro Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N1CCC[C@H]1C(O)=O IDKGBVZGNTYYCC-QXEWZRGKSA-N 0.000 claims 3
- PVPAOIGJYHVWBT-KKHAAJSZSA-N Val-Asn-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](C(C)C)N)O PVPAOIGJYHVWBT-KKHAAJSZSA-N 0.000 claims 3
- OQWNEUXPKHIEJO-NRPADANISA-N Val-Glu-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CO)C(=O)O)N OQWNEUXPKHIEJO-NRPADANISA-N 0.000 claims 3
- WFENBJPLZMPVAX-XVKPBYJWSA-N Val-Gly-Glu Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(O)=O WFENBJPLZMPVAX-XVKPBYJWSA-N 0.000 claims 3
- LAYSXAOGWHKNED-XPUUQOCRSA-N Val-Gly-Ser Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O LAYSXAOGWHKNED-XPUUQOCRSA-N 0.000 claims 3
- VXDSPJJQUQDCKH-UKJIMTQDSA-N Val-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N VXDSPJJQUQDCKH-UKJIMTQDSA-N 0.000 claims 3
- FEXILLGKGGTLRI-NHCYSSNCSA-N Val-Leu-Asn Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N FEXILLGKGGTLRI-NHCYSSNCSA-N 0.000 claims 3
- XXWBHOWRARMUOC-NHCYSSNCSA-N Val-Lys-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(=O)N)C(=O)O)N XXWBHOWRARMUOC-NHCYSSNCSA-N 0.000 claims 3
- DOFAQXCYFQKSHT-SRVKXCTJSA-N Val-Pro-Pro Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 DOFAQXCYFQKSHT-SRVKXCTJSA-N 0.000 claims 3
- VIKZGAUAKQZDOF-NRPADANISA-N Val-Ser-Glu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O VIKZGAUAKQZDOF-NRPADANISA-N 0.000 claims 3
- VHIZXDZMTDVFGX-DCAQKATOSA-N Val-Ser-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](C(C)C)N VHIZXDZMTDVFGX-DCAQKATOSA-N 0.000 claims 3
- QTPQHINADBYBNA-DCAQKATOSA-N Val-Ser-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCCN QTPQHINADBYBNA-DCAQKATOSA-N 0.000 claims 3
- UJMCYJKPDFQLHX-XGEHTFHBSA-N Val-Ser-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](C(C)C)N)O UJMCYJKPDFQLHX-XGEHTFHBSA-N 0.000 claims 3
- SDHZOOIGIUEPDY-JYJNAYRXSA-N Val-Ser-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CO)NC(=O)[C@@H](N)C(C)C)C(O)=O)=CNC2=C1 SDHZOOIGIUEPDY-JYJNAYRXSA-N 0.000 claims 3
- USXYVSTVPHELAF-RCWTZXSCSA-N Val-Thr-Met Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](C(C)C)N)O USXYVSTVPHELAF-RCWTZXSCSA-N 0.000 claims 3
- HTONZBWRYUKUKC-RCWTZXSCSA-N Val-Thr-Val Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O HTONZBWRYUKUKC-RCWTZXSCSA-N 0.000 claims 3
- JVGDAEKKZKKZFO-RCWTZXSCSA-N Val-Val-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](C(C)C)N)O JVGDAEKKZKKZFO-RCWTZXSCSA-N 0.000 claims 3
- 150000001413 amino acids Chemical group 0.000 claims 3
- 210000000234 capsid Anatomy 0.000 claims 3
- 239000012634 fragment Substances 0.000 claims 3
- 108010080575 glutamyl-aspartyl-alanine Proteins 0.000 claims 3
- 108010055341 glutamyl-glutamic acid Proteins 0.000 claims 3
- XKUKSGPZAADMRA-UHFFFAOYSA-N glycyl-glycyl-glycine Chemical compound NCC(=O)NCC(=O)NCC(O)=O XKUKSGPZAADMRA-UHFFFAOYSA-N 0.000 claims 3
- 108010082286 glycyl-seryl-alanine Proteins 0.000 claims 3
- 108010084389 glycyltryptophan Proteins 0.000 claims 3
- STKYPAFSDFAEPH-LURJTMIESA-N glycylvaline Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)CN STKYPAFSDFAEPH-LURJTMIESA-N 0.000 claims 3
- 108010044374 isoleucyl-tyrosine Proteins 0.000 claims 3
- 108010074082 phenylalanyl-alanyl-lysine Proteins 0.000 claims 3
- 108010020755 prolyl-glycyl-glycine Proteins 0.000 claims 3
- 108700042769 prolyl-leucyl-glycine Proteins 0.000 claims 3
- 108010087846 prolyl-prolyl-glycine Proteins 0.000 claims 3
- 108010029384 tryptophyl-histidine Proteins 0.000 claims 3
- 108010084932 tryptophyl-proline Proteins 0.000 claims 3
- YSGSDAIMSCVPHG-UHFFFAOYSA-N valyl-methionine Chemical compound CSCCC(C(O)=O)NC(=O)C(N)C(C)C YSGSDAIMSCVPHG-UHFFFAOYSA-N 0.000 claims 3
- AXFMEGAFCUULFV-BLFANLJRSA-N (2s)-2-[[(2s)-1-[(2s,3r)-2-amino-3-methylpentanoyl]pyrrolidine-2-carbonyl]amino]pentanedioic acid Chemical compound CC[C@@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O AXFMEGAFCUULFV-BLFANLJRSA-N 0.000 claims 2
- PIDRBUDUWHBYSR-UHFFFAOYSA-N 1-[2-[[2-[(2-amino-4-methylpentanoyl)amino]-4-methylpentanoyl]amino]-4-methylpentanoyl]pyrrolidine-2-carboxylic acid Chemical compound CC(C)CC(N)C(=O)NC(CC(C)C)C(=O)NC(CC(C)C)C(=O)N1CCCC1C(O)=O PIDRBUDUWHBYSR-UHFFFAOYSA-N 0.000 claims 2
- 241000844498 Agatea Species 0.000 claims 2
- PIPTUBPKYFRLCP-NHCYSSNCSA-N Ala-Ala-Phe Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 PIPTUBPKYFRLCP-NHCYSSNCSA-N 0.000 claims 2
- DVWVZSJAYIJZFI-FXQIFTODSA-N Ala-Arg-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(O)=O DVWVZSJAYIJZFI-FXQIFTODSA-N 0.000 claims 2
- SVBXIUDNTRTKHE-CIUDSAMLSA-N Ala-Arg-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O SVBXIUDNTRTKHE-CIUDSAMLSA-N 0.000 claims 2
- DWINFPQUSSHSFS-UVBJJODRSA-N Ala-Arg-Trp Chemical compound N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CNC2=CC=CC=C12)C(=O)O DWINFPQUSSHSFS-UVBJJODRSA-N 0.000 claims 2
- CCUAQNUWXLYFRA-IMJSIDKUSA-N Ala-Asn Chemical compound C[C@H]([NH3+])C(=O)N[C@H](C([O-])=O)CC(N)=O CCUAQNUWXLYFRA-IMJSIDKUSA-N 0.000 claims 2
- PXKLCFFSVLKOJM-ACZMJKKPSA-N Ala-Asn-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O PXKLCFFSVLKOJM-ACZMJKKPSA-N 0.000 claims 2
- JYEBJTDTPNKQJG-FXQIFTODSA-N Ala-Asn-Met Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCSC)C(=O)O)N JYEBJTDTPNKQJG-FXQIFTODSA-N 0.000 claims 2
- FXKNPWNXPQZLES-ZLUOBGJFSA-N Ala-Asn-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O FXKNPWNXPQZLES-ZLUOBGJFSA-N 0.000 claims 2
- VIGKUFXFTPWYER-BIIVOSGPSA-N Ala-Cys-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CS)C(=O)N1CCC[C@@H]1C(=O)O)N VIGKUFXFTPWYER-BIIVOSGPSA-N 0.000 claims 2
- HXNNRBHASOSVPG-GUBZILKMSA-N Ala-Glu-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O HXNNRBHASOSVPG-GUBZILKMSA-N 0.000 claims 2
- VBRDBGCROKWTPV-XHNCKOQMSA-N Ala-Glu-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N VBRDBGCROKWTPV-XHNCKOQMSA-N 0.000 claims 2
- NIZKGBJVCMRDKO-KWQFWETISA-N Ala-Gly-Tyr Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 NIZKGBJVCMRDKO-KWQFWETISA-N 0.000 claims 2
- NYDBKUNVSALYPX-NAKRPEOUSA-N Ala-Ile-Arg Chemical compound C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CCCN=C(N)N NYDBKUNVSALYPX-NAKRPEOUSA-N 0.000 claims 2
- CFPQUJZTLUQUTJ-HTFCKZLJSA-N Ala-Ile-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@H](C)N CFPQUJZTLUQUTJ-HTFCKZLJSA-N 0.000 claims 2
- QQACQIHVWCVBBR-GVARAGBVSA-N Ala-Ile-Tyr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O QQACQIHVWCVBBR-GVARAGBVSA-N 0.000 claims 2
- QPBSRMDNJOTFAL-AICCOOGYSA-N Ala-Leu-Leu-Thr Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O QPBSRMDNJOTFAL-AICCOOGYSA-N 0.000 claims 2
- OYJCVIGKMXUVKB-GARJFASQSA-N Ala-Leu-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N OYJCVIGKMXUVKB-GARJFASQSA-N 0.000 claims 2
- OMFMCIVBKCEMAK-CYDGBPFRSA-N Ala-Leu-Val-Ser Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O OMFMCIVBKCEMAK-CYDGBPFRSA-N 0.000 claims 2
- MDNAVFBZPROEHO-DCAQKATOSA-N Ala-Lys-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O MDNAVFBZPROEHO-DCAQKATOSA-N 0.000 claims 2
- BFMIRJBURUXDRG-DLOVCJGASA-N Ala-Phe-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=CC=C1 BFMIRJBURUXDRG-DLOVCJGASA-N 0.000 claims 2
- CUOMGDPDITUMIJ-HZZBMVKVSA-N Ala-Phe-Thr-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H]([C@H](O)C)NC(=O)[C@@H](NC(=O)[C@H](C)N)CC1=CC=CC=C1 CUOMGDPDITUMIJ-HZZBMVKVSA-N 0.000 claims 2
- MAZZQZWCCYJQGZ-GUBZILKMSA-N Ala-Pro-Arg Chemical compound [H]N[C@@H](C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(O)=O MAZZQZWCCYJQGZ-GUBZILKMSA-N 0.000 claims 2
- FQNILRVJOJBFFC-FXQIFTODSA-N Ala-Pro-Asp Chemical compound C[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(=O)O)C(=O)O)N FQNILRVJOJBFFC-FXQIFTODSA-N 0.000 claims 2
- BHTBAVZSZCQZPT-GUBZILKMSA-N Ala-Pro-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](C)N BHTBAVZSZCQZPT-GUBZILKMSA-N 0.000 claims 2
- DCVYRWFAMZFSDA-ZLUOBGJFSA-N Ala-Ser-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O DCVYRWFAMZFSDA-ZLUOBGJFSA-N 0.000 claims 2
- VJVQKGYHIZPSNS-FXQIFTODSA-N Ala-Ser-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCN=C(N)N VJVQKGYHIZPSNS-FXQIFTODSA-N 0.000 claims 2
- RTZCUEHYUQZIDE-WHFBIAKZSA-N Ala-Ser-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O RTZCUEHYUQZIDE-WHFBIAKZSA-N 0.000 claims 2
- JJHBEVZAZXZREW-LFSVMHDDSA-N Ala-Thr-Phe Chemical compound C[C@@H](O)[C@H](NC(=O)[C@H](C)N)C(=O)N[C@@H](Cc1ccccc1)C(O)=O JJHBEVZAZXZREW-LFSVMHDDSA-N 0.000 claims 2
- KTXKIYXZQFWJKB-VZFHVOOUSA-N Ala-Thr-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O KTXKIYXZQFWJKB-VZFHVOOUSA-N 0.000 claims 2
- LTTLSZVJTDSACD-OWLDWWDNSA-N Ala-Thr-Trp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O LTTLSZVJTDSACD-OWLDWWDNSA-N 0.000 claims 2
- CREYEAPXISDKSB-FQPOAREZSA-N Ala-Thr-Tyr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O CREYEAPXISDKSB-FQPOAREZSA-N 0.000 claims 2
- ZDILXFDENZVOTL-BPNCWPANSA-N Ala-Val-Tyr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O ZDILXFDENZVOTL-BPNCWPANSA-N 0.000 claims 2
- VBFJESQBIWCWRL-DCAQKATOSA-N Arg-Ala-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCNC(N)=N VBFJESQBIWCWRL-DCAQKATOSA-N 0.000 claims 2
- XPSGESXVBSQZPL-SRVKXCTJSA-N Arg-Arg-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O XPSGESXVBSQZPL-SRVKXCTJSA-N 0.000 claims 2
- GHNDBBVSWOWYII-LPEHRKFASA-N Arg-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCCN=C(N)N)N)C(=O)O GHNDBBVSWOWYII-LPEHRKFASA-N 0.000 claims 2
- PBSOQGZLPFVXPU-YUMQZZPRSA-N Arg-Glu-Gly Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O PBSOQGZLPFVXPU-YUMQZZPRSA-N 0.000 claims 2
- OHYQKYUTLIPFOX-ZPFDUUQYSA-N Arg-Glu-Ile Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O OHYQKYUTLIPFOX-ZPFDUUQYSA-N 0.000 claims 2
- NXDXECQFKHXHAM-HJGDQZAQSA-N Arg-Glu-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O NXDXECQFKHXHAM-HJGDQZAQSA-N 0.000 claims 2
- AQPVUEJJARLJHB-BQBZGAKWSA-N Arg-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)[C@@H](N)CCCN=C(N)N AQPVUEJJARLJHB-BQBZGAKWSA-N 0.000 claims 2
- IYMAXBFPHPZYIK-BQBZGAKWSA-N Arg-Gly-Asp Chemical compound NC(N)=NCCC[C@H](N)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O IYMAXBFPHPZYIK-BQBZGAKWSA-N 0.000 claims 2
- FLYANDHDFRGGTM-PYJNHQTQSA-N Arg-Ile-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N FLYANDHDFRGGTM-PYJNHQTQSA-N 0.000 claims 2
- OGSQONVYSTZIJB-WDSOQIARSA-N Arg-Leu-Trp Chemical compound CC(C)C[C@H](NC(=O)[C@@H](N)CCCN=C(N)N)C(=O)N[C@@H](Cc1c[nH]c2ccccc12)C(O)=O OGSQONVYSTZIJB-WDSOQIARSA-N 0.000 claims 2
- HNJNAMGZQZPSRE-GUBZILKMSA-N Arg-Pro-Asn Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O HNJNAMGZQZPSRE-GUBZILKMSA-N 0.000 claims 2
- NGYHSXDNNOFHNE-AVGNSLFASA-N Arg-Pro-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O NGYHSXDNNOFHNE-AVGNSLFASA-N 0.000 claims 2
- JPAWCMXVNZPJLO-IHRRRGAJSA-N Arg-Ser-Phe Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O JPAWCMXVNZPJLO-IHRRRGAJSA-N 0.000 claims 2
- ZJBUILVYSXQNSW-YTWAJWBKSA-N Arg-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N)O ZJBUILVYSXQNSW-YTWAJWBKSA-N 0.000 claims 2
- XRNXPIGJPQHCPC-RCWTZXSCSA-N Arg-Thr-Val Chemical compound CC(C)[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CCCNC(N)=N)[C@@H](C)O)C(O)=O XRNXPIGJPQHCPC-RCWTZXSCSA-N 0.000 claims 2
- JWCCFNZJIRZUCL-AVGNSLFASA-N Arg-Val-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCN=C(N)N JWCCFNZJIRZUCL-AVGNSLFASA-N 0.000 claims 2
- XYOVHPDDWCEUDY-CIUDSAMLSA-N Asn-Ala-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O XYOVHPDDWCEUDY-CIUDSAMLSA-N 0.000 claims 2
- QQEWINYJRFBLNN-DLOVCJGASA-N Asn-Ala-Phe Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 QQEWINYJRFBLNN-DLOVCJGASA-N 0.000 claims 2
- NUHQMYUWLUSRJX-BIIVOSGPSA-N Asn-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)N)N NUHQMYUWLUSRJX-BIIVOSGPSA-N 0.000 claims 2
- VDCIPFYVCICPEC-FXQIFTODSA-N Asn-Arg-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O VDCIPFYVCICPEC-FXQIFTODSA-N 0.000 claims 2
- MFFOYNGMOYFPBD-DCAQKATOSA-N Asn-Arg-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O MFFOYNGMOYFPBD-DCAQKATOSA-N 0.000 claims 2
- APHUDFFMXFYRKP-CIUDSAMLSA-N Asn-Asn-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC(=O)N)N APHUDFFMXFYRKP-CIUDSAMLSA-N 0.000 claims 2
- NVGWESORMHFISY-SRVKXCTJSA-N Asn-Asn-Phe Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O NVGWESORMHFISY-SRVKXCTJSA-N 0.000 claims 2
- BVLIJXXSXBUGEC-SRVKXCTJSA-N Asn-Asn-Tyr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O BVLIJXXSXBUGEC-SRVKXCTJSA-N 0.000 claims 2
- UGXVKHRDGLYFKR-CIUDSAMLSA-N Asn-Asp-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC(N)=O UGXVKHRDGLYFKR-CIUDSAMLSA-N 0.000 claims 2
- PAXHINASXXXILC-SRVKXCTJSA-N Asn-Asp-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(=O)N)N)O PAXHINASXXXILC-SRVKXCTJSA-N 0.000 claims 2
- QRHYAUYXBVVDSB-LKXGYXEUSA-N Asn-Cys-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CS)C(=O)N[C@@H]([C@@H](C)O)C(O)=O QRHYAUYXBVVDSB-LKXGYXEUSA-N 0.000 claims 2
- JREOBWLIZLXRIS-GUBZILKMSA-N Asn-Glu-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O JREOBWLIZLXRIS-GUBZILKMSA-N 0.000 claims 2
- BKDDABUWNKGZCK-XHNCKOQMSA-N Asn-Glu-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC(=O)N)N)C(=O)O BKDDABUWNKGZCK-XHNCKOQMSA-N 0.000 claims 2
- UBKOVSLDWIHYSY-ACZMJKKPSA-N Asn-Glu-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O UBKOVSLDWIHYSY-ACZMJKKPSA-N 0.000 claims 2
- OLVIPTLKNSAYRJ-YUMQZZPRSA-N Asn-Gly-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CC(=O)N)N OLVIPTLKNSAYRJ-YUMQZZPRSA-N 0.000 claims 2
- ACKNRKFVYUVWAC-ZPFDUUQYSA-N Asn-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)N)N ACKNRKFVYUVWAC-ZPFDUUQYSA-N 0.000 claims 2
- SPCONPVIDFMDJI-QSFUFRPTSA-N Asn-Ile-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O SPCONPVIDFMDJI-QSFUFRPTSA-N 0.000 claims 2
- MYCSPQIARXTUTP-SRVKXCTJSA-N Asn-Leu-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC(=O)N)N MYCSPQIARXTUTP-SRVKXCTJSA-N 0.000 claims 2
- NUCUBYIUPVYGPP-XIRDDKMYSA-N Asn-Leu-Trp Chemical compound CC(C)C[C@H](NC(=O)[C@@H](N)CC(N)=O)C(=O)N[C@@H](Cc1c[nH]c2ccccc12)C(O)=O NUCUBYIUPVYGPP-XIRDDKMYSA-N 0.000 claims 2
- KHCNTVRVAYCPQE-CIUDSAMLSA-N Asn-Lys-Asn Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O KHCNTVRVAYCPQE-CIUDSAMLSA-N 0.000 claims 2
- MDDXKBHIMYYJLW-FXQIFTODSA-N Asn-Met-Asp Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC(=O)N)N MDDXKBHIMYYJLW-FXQIFTODSA-N 0.000 claims 2
- RLHANKIRBONJBK-IHRRRGAJSA-N Asn-Met-Phe Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CC(=O)N)N RLHANKIRBONJBK-IHRRRGAJSA-N 0.000 claims 2
- OMSMPWHEGLNQOD-UWVGGRQHSA-N Asn-Phe Chemical compound NC(=O)C[C@H](N)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 OMSMPWHEGLNQOD-UWVGGRQHSA-N 0.000 claims 2
- CDGHMJJJHYKMPA-DLOVCJGASA-N Asn-Phe-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](CC(=O)N)N CDGHMJJJHYKMPA-DLOVCJGASA-N 0.000 claims 2
- LSJQOMAZIKQMTJ-SRVKXCTJSA-N Asn-Phe-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O LSJQOMAZIKQMTJ-SRVKXCTJSA-N 0.000 claims 2
- FTNRWCPWDWRPAV-BZSNNMDCSA-N Asn-Phe-Phe Chemical compound C([C@H](NC(=O)[C@H](CC(N)=O)N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 FTNRWCPWDWRPAV-BZSNNMDCSA-N 0.000 claims 2
- YUOXLJYVSZYPBJ-CIUDSAMLSA-N Asn-Pro-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O YUOXLJYVSZYPBJ-CIUDSAMLSA-N 0.000 claims 2
- GMUOCGCDOYYWPD-FXQIFTODSA-N Asn-Pro-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O GMUOCGCDOYYWPD-FXQIFTODSA-N 0.000 claims 2
- GZXOUBTUAUAVHD-ACZMJKKPSA-N Asn-Ser-Glu Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O GZXOUBTUAUAVHD-ACZMJKKPSA-N 0.000 claims 2
- MYTHOBCLNIOFBL-SRVKXCTJSA-N Asn-Ser-Tyr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O MYTHOBCLNIOFBL-SRVKXCTJSA-N 0.000 claims 2
- HCZQKHSRYHCPSD-IUKAMOBKSA-N Asn-Thr-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O HCZQKHSRYHCPSD-IUKAMOBKSA-N 0.000 claims 2
- XCBKBPRFACFFOO-AQZXSJQPSA-N Asn-Thr-Trp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CC(=O)N)N)O XCBKBPRFACFFOO-AQZXSJQPSA-N 0.000 claims 2
- QNNBHTFDFFFHGC-KKUMJFAQSA-N Asn-Tyr-Lys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)N)N)O QNNBHTFDFFFHGC-KKUMJFAQSA-N 0.000 claims 2
- XEGZSHSPQNDNRH-JRQIVUDYSA-N Asn-Tyr-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XEGZSHSPQNDNRH-JRQIVUDYSA-N 0.000 claims 2
- LRCIOEVFVGXZKB-BZSNNMDCSA-N Asn-Tyr-Tyr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O LRCIOEVFVGXZKB-BZSNNMDCSA-N 0.000 claims 2
- XOQYDFCQPWAMSA-KKHAAJSZSA-N Asn-Val-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XOQYDFCQPWAMSA-KKHAAJSZSA-N 0.000 claims 2
- IXIWEFWRKIUMQX-DCAQKATOSA-N Asp-Arg-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H](N)CC(O)=O IXIWEFWRKIUMQX-DCAQKATOSA-N 0.000 claims 2
- GWTLRDMPMJCNMH-WHFBIAKZSA-N Asp-Asn-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O GWTLRDMPMJCNMH-WHFBIAKZSA-N 0.000 claims 2
- KNMRXHIAVXHCLW-ZLUOBGJFSA-N Asp-Asn-Ser Chemical compound C([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CO)C(=O)O)N)C(=O)O KNMRXHIAVXHCLW-ZLUOBGJFSA-N 0.000 claims 2
- NAPNAGZWHQHZLG-ZLUOBGJFSA-N Asp-Asp-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(=O)O)N NAPNAGZWHQHZLG-ZLUOBGJFSA-N 0.000 claims 2
- VPSHHQXIWLGVDD-ZLUOBGJFSA-N Asp-Asp-Asp Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O VPSHHQXIWLGVDD-ZLUOBGJFSA-N 0.000 claims 2
- ZEDBMCPXPIYJLW-XHNCKOQMSA-N Asp-Glu-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC(=O)O)N)C(=O)O ZEDBMCPXPIYJLW-XHNCKOQMSA-N 0.000 claims 2
- RRKCPMGSRIDLNC-AVGNSLFASA-N Asp-Glu-Tyr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O RRKCPMGSRIDLNC-AVGNSLFASA-N 0.000 claims 2
- VIRHEUMYXXLCBF-WDSKDSINSA-N Asp-Gly-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O VIRHEUMYXXLCBF-WDSKDSINSA-N 0.000 claims 2
- PZXPWHFYZXTFBI-YUMQZZPRSA-N Asp-Gly-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC(O)=O PZXPWHFYZXTFBI-YUMQZZPRSA-N 0.000 claims 2
- SNDBKTFJWVEVPO-WHFBIAKZSA-N Asp-Gly-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](CO)C(O)=O SNDBKTFJWVEVPO-WHFBIAKZSA-N 0.000 claims 2
- ILQCHXURSRRIRY-YUMQZZPRSA-N Asp-His-Gly Chemical compound C1=C(NC=N1)C[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CC(=O)O)N ILQCHXURSRRIRY-YUMQZZPRSA-N 0.000 claims 2
- GBSUGIXJAAKZOW-GMOBBJLQSA-N Asp-Ile-Arg Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O GBSUGIXJAAKZOW-GMOBBJLQSA-N 0.000 claims 2
- SPWXXPFDTMYTRI-IUKAMOBKSA-N Asp-Ile-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SPWXXPFDTMYTRI-IUKAMOBKSA-N 0.000 claims 2
- CTWCFPWFIGRAEP-CIUDSAMLSA-N Asp-Lys-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O CTWCFPWFIGRAEP-CIUDSAMLSA-N 0.000 claims 2
- WQSXAPPYLGNMQL-IHRRRGAJSA-N Asp-Met-Tyr Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@H](CC(=O)O)N WQSXAPPYLGNMQL-IHRRRGAJSA-N 0.000 claims 2
- LIJXJYGRSRWLCJ-IHRRRGAJSA-N Asp-Phe-Arg Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O LIJXJYGRSRWLCJ-IHRRRGAJSA-N 0.000 claims 2
- AHWRSSLYSGLBGD-CIUDSAMLSA-N Asp-Pro-Glu Chemical compound OC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O AHWRSSLYSGLBGD-CIUDSAMLSA-N 0.000 claims 2
- YFGUZQQCSDZRBN-DCAQKATOSA-N Asp-Pro-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O YFGUZQQCSDZRBN-DCAQKATOSA-N 0.000 claims 2
- XXAMCEGRCZQGEM-ZLUOBGJFSA-N Asp-Ser-Asn Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O XXAMCEGRCZQGEM-ZLUOBGJFSA-N 0.000 claims 2
- QSFHZPQUAAQHAQ-CIUDSAMLSA-N Asp-Ser-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O QSFHZPQUAAQHAQ-CIUDSAMLSA-N 0.000 claims 2
- XYPJXLLXNSAWHZ-SRVKXCTJSA-N Asp-Ser-Tyr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O XYPJXLLXNSAWHZ-SRVKXCTJSA-N 0.000 claims 2
- JSHWXQIZOCVWIA-ZKWXMUAHSA-N Asp-Ser-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O JSHWXQIZOCVWIA-ZKWXMUAHSA-N 0.000 claims 2
- XAPPCWUWHNWCPQ-PBCZWWQYSA-N Asp-Thr-His Chemical compound N[C@@H](CC(=O)O)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)O XAPPCWUWHNWCPQ-PBCZWWQYSA-N 0.000 claims 2
- NAAAPCLFJPURAM-HJGDQZAQSA-N Asp-Thr-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)O)N)O NAAAPCLFJPURAM-HJGDQZAQSA-N 0.000 claims 2
- PDIYGFYAMZZFCW-JIOCBJNQSA-N Asp-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)O)N)O PDIYGFYAMZZFCW-JIOCBJNQSA-N 0.000 claims 2
- WOKXEQLPBLLWHC-IHRRRGAJSA-N Asp-Tyr-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CC(O)=O)CC1=CC=C(O)C=C1 WOKXEQLPBLLWHC-IHRRRGAJSA-N 0.000 claims 2
- CPMKYMGGYUFOHS-FSPLSTOPSA-N Asp-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H](N)CC(O)=O CPMKYMGGYUFOHS-FSPLSTOPSA-N 0.000 claims 2
- XWKPSMRPIKKDDU-RCOVLWMOSA-N Asp-Val-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O XWKPSMRPIKKDDU-RCOVLWMOSA-N 0.000 claims 2
- GGBQDSHTXKQSLP-NHCYSSNCSA-N Asp-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)O)N GGBQDSHTXKQSLP-NHCYSSNCSA-N 0.000 claims 2
- QDFBJJABJKOLTD-FXQIFTODSA-N Cys-Asn-Arg Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O QDFBJJABJKOLTD-FXQIFTODSA-N 0.000 claims 2
- FWYBFUDWUUFLDN-FXQIFTODSA-N Cys-Asp-Arg Chemical compound C(C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CS)N)CN=C(N)N FWYBFUDWUUFLDN-FXQIFTODSA-N 0.000 claims 2
- XZKJEOMFLDVXJG-KATARQTJSA-N Cys-Leu-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CS)N)O XZKJEOMFLDVXJG-KATARQTJSA-N 0.000 claims 2
- MKMKILWCRQLDFJ-DCAQKATOSA-N Cys-Lys-Arg Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O MKMKILWCRQLDFJ-DCAQKATOSA-N 0.000 claims 2
- YXPNKXFOBHRUBL-BJDJZHNGSA-N Cys-Lys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CS)N YXPNKXFOBHRUBL-BJDJZHNGSA-N 0.000 claims 2
- RUFHOVYUYSNDNY-ACZMJKKPSA-N Glu-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(O)=O RUFHOVYUYSNDNY-ACZMJKKPSA-N 0.000 claims 2
- ITYRYNUZHPNCIK-GUBZILKMSA-N Glu-Ala-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O ITYRYNUZHPNCIK-GUBZILKMSA-N 0.000 claims 2
- MXOODARRORARSU-ACZMJKKPSA-N Glu-Ala-Ser Chemical compound C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCC(=O)O)N MXOODARRORARSU-ACZMJKKPSA-N 0.000 claims 2
- HJIFPJUEOGZWRI-GUBZILKMSA-N Glu-Asp-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCC(=O)O)N HJIFPJUEOGZWRI-GUBZILKMSA-N 0.000 claims 2
- JRCUFCXYZLPSDZ-ACZMJKKPSA-N Glu-Asp-Ser Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O JRCUFCXYZLPSDZ-ACZMJKKPSA-N 0.000 claims 2
- HILMIYALTUQTRC-XVKPBYJWSA-N Glu-Gly-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O HILMIYALTUQTRC-XVKPBYJWSA-N 0.000 claims 2
- ZWABFSSWTSAMQN-KBIXCLLPSA-N Glu-Ile-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O ZWABFSSWTSAMQN-KBIXCLLPSA-N 0.000 claims 2
- GRHXUHCFENOCOS-ZPFDUUQYSA-N Glu-Ile-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CCC(=O)O)N GRHXUHCFENOCOS-ZPFDUUQYSA-N 0.000 claims 2
- YBAFDPFAUTYYRW-YUMQZZPRSA-N Glu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](N)CCC(O)=O YBAFDPFAUTYYRW-YUMQZZPRSA-N 0.000 claims 2
- HVYWQYLBVXMXSV-GUBZILKMSA-N Glu-Leu-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O HVYWQYLBVXMXSV-GUBZILKMSA-N 0.000 claims 2
- ATVYZJGOZLVXDK-IUCAKERBSA-N Glu-Leu-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O ATVYZJGOZLVXDK-IUCAKERBSA-N 0.000 claims 2
- NJCALAAIGREHDR-WDCWCFNPSA-N Glu-Leu-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O NJCALAAIGREHDR-WDCWCFNPSA-N 0.000 claims 2
- MFNUFCFRAZPJFW-JYJNAYRXSA-N Glu-Lys-Phe Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 MFNUFCFRAZPJFW-JYJNAYRXSA-N 0.000 claims 2
- QDMVXRNLOPTPIE-WDCWCFNPSA-N Glu-Lys-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O QDMVXRNLOPTPIE-WDCWCFNPSA-N 0.000 claims 2
- LHIPZASLKPYDPI-AVGNSLFASA-N Glu-Phe-Asp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O LHIPZASLKPYDPI-AVGNSLFASA-N 0.000 claims 2
- DCBSZJJHOTXMHY-DCAQKATOSA-N Glu-Pro-Pro Chemical compound OC(=O)CC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 DCBSZJJHOTXMHY-DCAQKATOSA-N 0.000 claims 2
- BPLNJYHNAJVLRT-ACZMJKKPSA-N Glu-Ser-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O BPLNJYHNAJVLRT-ACZMJKKPSA-N 0.000 claims 2
- GMVCSRBOSIUTFC-FXQIFTODSA-N Glu-Ser-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(O)=O GMVCSRBOSIUTFC-FXQIFTODSA-N 0.000 claims 2
- DDXZHOHEABQXSE-NKIYYHGXSA-N Glu-Thr-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CCC(=O)O)N)O DDXZHOHEABQXSE-NKIYYHGXSA-N 0.000 claims 2
- STDOKNKEXOLSII-SZMVWBNQSA-N Glu-Trp-His Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC3=CN=CN3)C(=O)O)NC(=O)[C@H](CCC(=O)O)N STDOKNKEXOLSII-SZMVWBNQSA-N 0.000 claims 2
- MIWJDJAMMKHUAR-ZVZYQTTQSA-N Glu-Trp-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](CCC(=O)O)N MIWJDJAMMKHUAR-ZVZYQTTQSA-N 0.000 claims 2
- HHSKZJZWQFPSKN-AVGNSLFASA-N Glu-Tyr-Asp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O HHSKZJZWQFPSKN-AVGNSLFASA-N 0.000 claims 2
- QLNKFGTZOBVMCS-JBACZVJFSA-N Glu-Tyr-Trp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O QLNKFGTZOBVMCS-JBACZVJFSA-N 0.000 claims 2
- LZEUDRYSAZAJIO-AUTRQRHGSA-N Glu-Val-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O LZEUDRYSAZAJIO-AUTRQRHGSA-N 0.000 claims 2
- PYTZFYUXZZHOAD-WHFBIAKZSA-N Gly-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)CN PYTZFYUXZZHOAD-WHFBIAKZSA-N 0.000 claims 2
- MFVQGXGQRIXBPK-WDSKDSINSA-N Gly-Ala-Glu Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O MFVQGXGQRIXBPK-WDSKDSINSA-N 0.000 claims 2
- YMUFWNJHVPQNQD-ZKWXMUAHSA-N Gly-Ala-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)CN YMUFWNJHVPQNQD-ZKWXMUAHSA-N 0.000 claims 2
- CLODWIOAKCSBAN-BQBZGAKWSA-N Gly-Arg-Asp Chemical compound NC(N)=NCCC[C@H](NC(=O)CN)C(=O)N[C@@H](CC(O)=O)C(O)=O CLODWIOAKCSBAN-BQBZGAKWSA-N 0.000 claims 2
- XRTDOIOIBMAXCT-NKWVEPMBSA-N Gly-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)CN)C(=O)O XRTDOIOIBMAXCT-NKWVEPMBSA-N 0.000 claims 2
- KQDMENMTYNBWMR-WHFBIAKZSA-N Gly-Asp-Ala Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O KQDMENMTYNBWMR-WHFBIAKZSA-N 0.000 claims 2
- QSTLUOIOYLYLLF-WDSKDSINSA-N Gly-Asp-Glu Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O QSTLUOIOYLYLLF-WDSKDSINSA-N 0.000 claims 2
- RPLLQZBOVIVGMX-QWRGUYRKSA-N Gly-Asp-Phe Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O RPLLQZBOVIVGMX-QWRGUYRKSA-N 0.000 claims 2
- LXXLEUBUOMCAMR-NKWVEPMBSA-N Gly-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)CN)C(=O)O LXXLEUBUOMCAMR-NKWVEPMBSA-N 0.000 claims 2
- SOEATRRYCIPEHA-BQBZGAKWSA-N Gly-Glu-Glu Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O SOEATRRYCIPEHA-BQBZGAKWSA-N 0.000 claims 2
- JSNNHGHYGYMVCK-XVKPBYJWSA-N Gly-Glu-Val Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O JSNNHGHYGYMVCK-XVKPBYJWSA-N 0.000 claims 2
- QPCVIQJVRGXUSA-LURJTMIESA-N Gly-Gly-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)CNC(=O)CN QPCVIQJVRGXUSA-LURJTMIESA-N 0.000 claims 2
- KGVHCTWYMPWEGN-FSPLSTOPSA-N Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)CN KGVHCTWYMPWEGN-FSPLSTOPSA-N 0.000 claims 2
- SWQALSGKVLYKDT-ZKWXMUAHSA-N Gly-Ile-Ala Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O SWQALSGKVLYKDT-ZKWXMUAHSA-N 0.000 claims 2
- DKEXFJVMVGETOO-LURJTMIESA-N Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CN DKEXFJVMVGETOO-LURJTMIESA-N 0.000 claims 2
- PDUHNKAFQXQNLH-ZETCQYMHSA-N Gly-Lys-Gly Chemical compound NCCCC[C@H](NC(=O)CN)C(=O)NCC(O)=O PDUHNKAFQXQNLH-ZETCQYMHSA-N 0.000 claims 2
- NTBOEZICHOSJEE-YUMQZZPRSA-N Gly-Lys-Ser Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O NTBOEZICHOSJEE-YUMQZZPRSA-N 0.000 claims 2
- IRJWAYCXIYUHQE-WHFBIAKZSA-N Gly-Ser-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CO)NC(=O)CN IRJWAYCXIYUHQE-WHFBIAKZSA-N 0.000 claims 2
- BXDLTKLPPKBVEL-FJXKBIBVSA-N Gly-Thr-Met Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCSC)C(O)=O BXDLTKLPPKBVEL-FJXKBIBVSA-N 0.000 claims 2
- YJDALMUYJIENAG-QWRGUYRKSA-N Gly-Tyr-Asn Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)CN)O YJDALMUYJIENAG-QWRGUYRKSA-N 0.000 claims 2
- RIYIFUFFFBIOEU-KBPBESRZSA-N Gly-Tyr-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CC=C(O)C=C1 RIYIFUFFFBIOEU-KBPBESRZSA-N 0.000 claims 2
- AFMOTCMSEBITOE-YEPSODPASA-N Gly-Val-Thr Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O AFMOTCMSEBITOE-YEPSODPASA-N 0.000 claims 2
- HTZKFIYQMHJWSQ-INTQDDNPSA-N His-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CN=CN2)N HTZKFIYQMHJWSQ-INTQDDNPSA-N 0.000 claims 2
- DFHVLUKTTVTCKY-PBCZWWQYSA-N His-Asn-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC1=CN=CN1)N)O DFHVLUKTTVTCKY-PBCZWWQYSA-N 0.000 claims 2
- JFFAPRNXXLRINI-NHCYSSNCSA-N His-Asp-Val Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O JFFAPRNXXLRINI-NHCYSSNCSA-N 0.000 claims 2
- FIMNVXRZGUAGBI-AVGNSLFASA-N His-Glu-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O FIMNVXRZGUAGBI-AVGNSLFASA-N 0.000 claims 2
- FYTCLUIYTYFGPT-YUMQZZPRSA-N His-Gly-Ser Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)NCC(=O)N[C@@H](CO)C(O)=O FYTCLUIYTYFGPT-YUMQZZPRSA-N 0.000 claims 2
- CSTNMMIHMYJGFR-IHRRRGAJSA-N His-His-Arg Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1NC=NC=1)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O)C1=CN=CN1 CSTNMMIHMYJGFR-IHRRRGAJSA-N 0.000 claims 2
- BRQKGRLDDDQWQJ-MBLNEYKQSA-N His-Thr-Ala Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O BRQKGRLDDDQWQJ-MBLNEYKQSA-N 0.000 claims 2
- FBVHRDXSCYELMI-PBCZWWQYSA-N His-Thr-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N)O FBVHRDXSCYELMI-PBCZWWQYSA-N 0.000 claims 2
- 241000598171 Human adenovirus sp. Species 0.000 claims 2
- NKVZTQVGUNLLQW-JBDRJPRFSA-N Ile-Ala-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(=O)O)N NKVZTQVGUNLLQW-JBDRJPRFSA-N 0.000 claims 2
- DPTBVFUDCPINIP-JURCDPSOSA-N Ile-Ala-Phe Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 DPTBVFUDCPINIP-JURCDPSOSA-N 0.000 claims 2
- MKWSZEHGHSLNPF-NAKRPEOUSA-N Ile-Ala-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)O)N MKWSZEHGHSLNPF-NAKRPEOUSA-N 0.000 claims 2
- VZIFYHYNQDIPLI-HJWJTTGWSA-N Ile-Arg-Phe Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N VZIFYHYNQDIPLI-HJWJTTGWSA-N 0.000 claims 2
- HDODQNPMSHDXJT-GHCJXIJMSA-N Ile-Asn-Ser Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O HDODQNPMSHDXJT-GHCJXIJMSA-N 0.000 claims 2
- LEDRIAHEWDJRMF-CFMVVWHZSA-N Ile-Asn-Tyr Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 LEDRIAHEWDJRMF-CFMVVWHZSA-N 0.000 claims 2
- RGSOCXHDOPQREB-ZPFDUUQYSA-N Ile-Asp-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(C)C)C(=O)O)N RGSOCXHDOPQREB-ZPFDUUQYSA-N 0.000 claims 2
- CCHSQWLCOOZREA-GMOBBJLQSA-N Ile-Asp-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCSC)C(=O)O)N CCHSQWLCOOZREA-GMOBBJLQSA-N 0.000 claims 2
- UCGDDTHMMVWVMV-FSPLSTOPSA-N Ile-Gly Chemical compound CC[C@H](C)[C@H](N)C(=O)NCC(O)=O UCGDDTHMMVWVMV-FSPLSTOPSA-N 0.000 claims 2
- SLQVFYWBGNNOTK-BYULHYEWSA-N Ile-Gly-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N[C@@H](CC(=O)N)C(=O)O)N SLQVFYWBGNNOTK-BYULHYEWSA-N 0.000 claims 2
- LBRCLQMZAHRTLV-ZKWXMUAHSA-N Ile-Gly-Ser Chemical compound CC[C@H](C)[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O LBRCLQMZAHRTLV-ZKWXMUAHSA-N 0.000 claims 2
- DFFTXLCCDFYRKD-MBLNEYKQSA-N Ile-Gly-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(=O)O)N DFFTXLCCDFYRKD-MBLNEYKQSA-N 0.000 claims 2
- JWBXCSQZLLIOCI-GUBZILKMSA-N Ile-Leu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@H](C(O)=O)CC(C)C JWBXCSQZLLIOCI-GUBZILKMSA-N 0.000 claims 2
- OVDKXUDMKXAZIV-ZPFDUUQYSA-N Ile-Lys-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(=O)N)C(=O)O)N OVDKXUDMKXAZIV-ZPFDUUQYSA-N 0.000 claims 2
- YSGBJIQXTIVBHZ-AJNGGQMLSA-N Ile-Lys-Leu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O YSGBJIQXTIVBHZ-AJNGGQMLSA-N 0.000 claims 2
- DNKDIDZHXZAGRY-HJWJTTGWSA-N Ile-Met-Phe Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N DNKDIDZHXZAGRY-HJWJTTGWSA-N 0.000 claims 2
- HQEPKOFULQTSFV-JURCDPSOSA-N Ile-Phe-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(=O)O)N HQEPKOFULQTSFV-JURCDPSOSA-N 0.000 claims 2
- VISRCHQHQCLODA-NAKRPEOUSA-N Ile-Pro-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CS)C(=O)O)N VISRCHQHQCLODA-NAKRPEOUSA-N 0.000 claims 2
- CIJLNXXMDUOFPH-HJWJTTGWSA-N Ile-Pro-Phe Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 CIJLNXXMDUOFPH-HJWJTTGWSA-N 0.000 claims 2
- XOZOSAUOGRPCES-STECZYCISA-N Ile-Pro-Tyr Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 XOZOSAUOGRPCES-STECZYCISA-N 0.000 claims 2
- JHNJNTMTZHEDLJ-NAKRPEOUSA-N Ile-Ser-Arg Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O JHNJNTMTZHEDLJ-NAKRPEOUSA-N 0.000 claims 2
- CNMOKANDJMLAIF-CIQUZCHMSA-N Ile-Thr-Ala Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O CNMOKANDJMLAIF-CIQUZCHMSA-N 0.000 claims 2
- YBKKLDBBPFIXBQ-MBLNEYKQSA-N Ile-Thr-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(=O)O)N YBKKLDBBPFIXBQ-MBLNEYKQSA-N 0.000 claims 2
- BLFXHAFTNYZEQE-VKOGCVSHSA-N Ile-Trp-Arg Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N BLFXHAFTNYZEQE-VKOGCVSHSA-N 0.000 claims 2
- DZMWFIRHFFVBHS-ZEWNOJEFSA-N Ile-Tyr-Phe Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC2=CC=CC=C2)C(=O)O)N DZMWFIRHFFVBHS-ZEWNOJEFSA-N 0.000 claims 2
- RQZFWBLDTBDEOF-RNJOBUHISA-N Ile-Val-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N RQZFWBLDTBDEOF-RNJOBUHISA-N 0.000 claims 2
- PWWVAXIEGOYWEE-UHFFFAOYSA-N Isophenergan Chemical compound C1=CC=C2N(CC(C)N(C)C)C3=CC=CC=C3SC2=C1 PWWVAXIEGOYWEE-UHFFFAOYSA-N 0.000 claims 2
- CZCSUZMIRKFFFA-CIUDSAMLSA-N Leu-Ala-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O CZCSUZMIRKFFFA-CIUDSAMLSA-N 0.000 claims 2
- XBBKIIGCUMBKCO-JXUBOQSCSA-N Leu-Ala-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XBBKIIGCUMBKCO-JXUBOQSCSA-N 0.000 claims 2
- DBVWMYGBVFCRBE-CIUDSAMLSA-N Leu-Asn-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O DBVWMYGBVFCRBE-CIUDSAMLSA-N 0.000 claims 2
- MDVZJYGNAGLPGJ-KKUMJFAQSA-N Leu-Asn-Phe Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 MDVZJYGNAGLPGJ-KKUMJFAQSA-N 0.000 claims 2
- TWQIYNGNYNJUFM-NHCYSSNCSA-N Leu-Asn-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O TWQIYNGNYNJUFM-NHCYSSNCSA-N 0.000 claims 2
- BPANDPNDMJHFEV-CIUDSAMLSA-N Leu-Asp-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O BPANDPNDMJHFEV-CIUDSAMLSA-N 0.000 claims 2
- BABSVXFGKFLIGW-UWVGGRQHSA-N Leu-Gly-Arg Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCNC(N)=N BABSVXFGKFLIGW-UWVGGRQHSA-N 0.000 claims 2
- KGCLIYGPQXUNLO-IUCAKERBSA-N Leu-Gly-Glu Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(O)=O KGCLIYGPQXUNLO-IUCAKERBSA-N 0.000 claims 2
- APFJUBGRZGMQFF-QWRGUYRKSA-N Leu-Gly-Lys Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCCN APFJUBGRZGMQFF-QWRGUYRKSA-N 0.000 claims 2
- YFBBUHJJUXXZOF-UWVGGRQHSA-N Leu-Gly-Pro Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N1CCC[C@H]1C(O)=O YFBBUHJJUXXZOF-UWVGGRQHSA-N 0.000 claims 2
- XBCWOTOCBXXJDG-BZSNNMDCSA-N Leu-His-Phe Chemical compound C([C@H](NC(=O)[C@@H](N)CC(C)C)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CN=CN1 XBCWOTOCBXXJDG-BZSNNMDCSA-N 0.000 claims 2
- HRTRLSRYZZKPCO-BJDJZHNGSA-N Leu-Ile-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O HRTRLSRYZZKPCO-BJDJZHNGSA-N 0.000 claims 2
- JLWZLIQRYCTYBD-IHRRRGAJSA-N Leu-Lys-Arg Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O JLWZLIQRYCTYBD-IHRRRGAJSA-N 0.000 claims 2
- LVTJJOJKDCVZGP-QWRGUYRKSA-N Leu-Lys-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O LVTJJOJKDCVZGP-QWRGUYRKSA-N 0.000 claims 2
- FKQPWMZLIIATBA-AJNGGQMLSA-N Leu-Lys-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FKQPWMZLIIATBA-AJNGGQMLSA-N 0.000 claims 2
- BGZCJDGBBUUBHA-KKUMJFAQSA-N Leu-Lys-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O BGZCJDGBBUUBHA-KKUMJFAQSA-N 0.000 claims 2
- KPYAOIVPJKPIOU-KKUMJFAQSA-N Leu-Lys-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(O)=O KPYAOIVPJKPIOU-KKUMJFAQSA-N 0.000 claims 2
- HDHQQEDVWQGBEE-DCAQKATOSA-N Leu-Met-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CO)C(O)=O HDHQQEDVWQGBEE-DCAQKATOSA-N 0.000 claims 2
- BIZNDKMFQHDOIE-KKUMJFAQSA-N Leu-Phe-Asn Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(N)=O)C(O)=O)CC1=CC=CC=C1 BIZNDKMFQHDOIE-KKUMJFAQSA-N 0.000 claims 2
- DRWMRVFCKKXHCH-BZSNNMDCSA-N Leu-Phe-Leu Chemical compound CC(C)C[C@H]([NH3+])C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C([O-])=O)CC1=CC=CC=C1 DRWMRVFCKKXHCH-BZSNNMDCSA-N 0.000 claims 2
- PTRKPHUGYULXPU-KKUMJFAQSA-N Leu-Phe-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O PTRKPHUGYULXPU-KKUMJFAQSA-N 0.000 claims 2
- UCXQIIIFOOGYEM-ULQDDVLXSA-N Leu-Pro-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 UCXQIIIFOOGYEM-ULQDDVLXSA-N 0.000 claims 2
- RGUXWMDNCPMQFB-YUMQZZPRSA-N Leu-Ser-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O RGUXWMDNCPMQFB-YUMQZZPRSA-N 0.000 claims 2
- SQUFDMCWMFOEBA-KKUMJFAQSA-N Leu-Ser-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 SQUFDMCWMFOEBA-KKUMJFAQSA-N 0.000 claims 2
- AEDWWMMHUGYIFD-HJGDQZAQSA-N Leu-Thr-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(O)=O AEDWWMMHUGYIFD-HJGDQZAQSA-N 0.000 claims 2
- VDIARPPNADFEAV-WEDXCCLWSA-N Leu-Thr-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O VDIARPPNADFEAV-WEDXCCLWSA-N 0.000 claims 2
- IDGRADDMTTWOQC-WDSOQIARSA-N Leu-Trp-Arg Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O IDGRADDMTTWOQC-WDSOQIARSA-N 0.000 claims 2
- SUYRAPCRSCCPAK-VFAJRCTISA-N Leu-Trp-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SUYRAPCRSCCPAK-VFAJRCTISA-N 0.000 claims 2
- RDFIVFHPOSOXMW-ACRUOGEOSA-N Leu-Tyr-Phe Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O RDFIVFHPOSOXMW-ACRUOGEOSA-N 0.000 claims 2
- AXVIGSRGTMNSJU-YESZJQIVSA-N Leu-Tyr-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N2CCC[C@@H]2C(=O)O)N AXVIGSRGTMNSJU-YESZJQIVSA-N 0.000 claims 2
- YQFZRHYZLARWDY-IHRRRGAJSA-N Leu-Val-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCCN YQFZRHYZLARWDY-IHRRRGAJSA-N 0.000 claims 2
- SWWCDAGDQHTKIE-RHYQMDGZSA-N Lys-Arg-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SWWCDAGDQHTKIE-RHYQMDGZSA-N 0.000 claims 2
- GKFNXYMAMKJSKD-NHCYSSNCSA-N Lys-Asp-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O GKFNXYMAMKJSKD-NHCYSSNCSA-N 0.000 claims 2
- RLZDUFRBMQNYIJ-YUMQZZPRSA-N Lys-Cys-Gly Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CS)C(=O)NCC(=O)O)N RLZDUFRBMQNYIJ-YUMQZZPRSA-N 0.000 claims 2
- ITWQLSZTLBKWJM-YUMQZZPRSA-N Lys-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)[C@@H](N)CCCCN ITWQLSZTLBKWJM-YUMQZZPRSA-N 0.000 claims 2
- NCZIQZYZPUPMKY-PPCPHDFISA-N Lys-Ile-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O NCZIQZYZPUPMKY-PPCPHDFISA-N 0.000 claims 2
- ORVFEGYUJITPGI-IHRRRGAJSA-N Lys-Leu-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CCCCN ORVFEGYUJITPGI-IHRRRGAJSA-N 0.000 claims 2
- LJADEBULDNKJNK-IHRRRGAJSA-N Lys-Leu-Val Chemical compound CC(C)C[C@H](NC(=O)[C@@H](N)CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O LJADEBULDNKJNK-IHRRRGAJSA-N 0.000 claims 2
- ALEVUGKHINJNIF-QEJZJMRPSA-N Lys-Phe-Ala Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C)C(O)=O)CC1=CC=CC=C1 ALEVUGKHINJNIF-QEJZJMRPSA-N 0.000 claims 2
- AIXUQKMMBQJZCU-IUCAKERBSA-N Lys-Pro Chemical compound NCCCC[C@H](N)C(=O)N1CCC[C@H]1C(O)=O AIXUQKMMBQJZCU-IUCAKERBSA-N 0.000 claims 2
- WGILOYIKJVQUPT-DCAQKATOSA-N Lys-Pro-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O WGILOYIKJVQUPT-DCAQKATOSA-N 0.000 claims 2
- LKDXINHHSWFFJC-SRVKXCTJSA-N Lys-Ser-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCCCN)N LKDXINHHSWFFJC-SRVKXCTJSA-N 0.000 claims 2
- RPWTZTBIFGENIA-VOAKCMCISA-N Lys-Thr-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O RPWTZTBIFGENIA-VOAKCMCISA-N 0.000 claims 2
- YKBSXQFZWFXFIB-VOAKCMCISA-N Lys-Thr-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CCCCN)C(O)=O YKBSXQFZWFXFIB-VOAKCMCISA-N 0.000 claims 2
- CAVRAQIDHUPECU-UVOCVTCTSA-N Lys-Thr-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CAVRAQIDHUPECU-UVOCVTCTSA-N 0.000 claims 2
- NYTDJEZBAAFLLG-IHRRRGAJSA-N Lys-Val-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCCN)C(O)=O NYTDJEZBAAFLLG-IHRRRGAJSA-N 0.000 claims 2
- GILLQRYAWOMHED-DCAQKATOSA-N Lys-Val-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCCN GILLQRYAWOMHED-DCAQKATOSA-N 0.000 claims 2
- HMZPYMSEAALNAE-ULQDDVLXSA-N Lys-Val-Tyr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O HMZPYMSEAALNAE-ULQDDVLXSA-N 0.000 claims 2
- IKXQOBUBZSOWDY-AVGNSLFASA-N Lys-Val-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H](CCCCN)N IKXQOBUBZSOWDY-AVGNSLFASA-N 0.000 claims 2
- QEVRUYFHWJJUHZ-DCAQKATOSA-N Met-Ala-Leu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC(C)C QEVRUYFHWJJUHZ-DCAQKATOSA-N 0.000 claims 2
- WYEXWKAWMNJKPN-UBHSHLNASA-N Met-Ala-Phe Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CCSC)N WYEXWKAWMNJKPN-UBHSHLNASA-N 0.000 claims 2
- WXHHTBVYQOSYSL-FXQIFTODSA-N Met-Ala-Ser Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O WXHHTBVYQOSYSL-FXQIFTODSA-N 0.000 claims 2
- CTVJSFRHUOSCQQ-DCAQKATOSA-N Met-Arg-Glu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O CTVJSFRHUOSCQQ-DCAQKATOSA-N 0.000 claims 2
- TZLYIHDABYBOCJ-FXQIFTODSA-N Met-Asp-Ser Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O TZLYIHDABYBOCJ-FXQIFTODSA-N 0.000 claims 2
- CHQWUYSNAOABIP-ZPFDUUQYSA-N Met-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCSC)N CHQWUYSNAOABIP-ZPFDUUQYSA-N 0.000 claims 2
- WPTDJKDGICUFCP-XUXIUFHCSA-N Met-Ile-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)O)NC(=O)[C@H](CCSC)N WPTDJKDGICUFCP-XUXIUFHCSA-N 0.000 claims 2
- HOZNVKDCKZPRER-XUXIUFHCSA-N Met-Lys-Ile Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O HOZNVKDCKZPRER-XUXIUFHCSA-N 0.000 claims 2
- IILAGWCGKJSBGB-IHRRRGAJSA-N Met-Phe-Asp Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(=O)O)C(=O)O)N IILAGWCGKJSBGB-IHRRRGAJSA-N 0.000 claims 2
- KYXDADPHSNFWQX-VEVYYDQMSA-N Met-Thr-Asp Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(O)=O KYXDADPHSNFWQX-VEVYYDQMSA-N 0.000 claims 2
- QQPMHUCGDRJFQK-RHYQMDGZSA-N Met-Thr-Leu Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(C)C QQPMHUCGDRJFQK-RHYQMDGZSA-N 0.000 claims 2
- UZBQXELAFPCGRV-SZMVWBNQSA-N Met-Trp-Arg Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O UZBQXELAFPCGRV-SZMVWBNQSA-N 0.000 claims 2
- GHQFLTYXGUETFD-UFYCRDLUSA-N Met-Tyr-Tyr Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O)N GHQFLTYXGUETFD-UFYCRDLUSA-N 0.000 claims 2
- CQRGINSEMFBACV-WPRPVWTQSA-N Met-Val-Gly Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O CQRGINSEMFBACV-WPRPVWTQSA-N 0.000 claims 2
- WYBVBIHNJWOLCJ-UHFFFAOYSA-N N-L-arginyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CCCN=C(N)N WYBVBIHNJWOLCJ-UHFFFAOYSA-N 0.000 claims 2
- BBDSZDHUCPSYAC-QEJZJMRPSA-N Phe-Ala-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O BBDSZDHUCPSYAC-QEJZJMRPSA-N 0.000 claims 2
- SEPNOAFMZLLCEW-UBHSHLNASA-N Phe-Ala-Val Chemical compound N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)O SEPNOAFMZLLCEW-UBHSHLNASA-N 0.000 claims 2
- CGOMLCQJEMWMCE-STQMWFEESA-N Phe-Arg-Gly Chemical compound NC(N)=NCCC[C@@H](C(=O)NCC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 CGOMLCQJEMWMCE-STQMWFEESA-N 0.000 claims 2
- JEGFCFLCRSJCMA-IHRRRGAJSA-N Phe-Arg-Ser Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CO)C(=O)O)N JEGFCFLCRSJCMA-IHRRRGAJSA-N 0.000 claims 2
- HCTXJGRYAACKOB-SRVKXCTJSA-N Phe-Asn-Asp Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N HCTXJGRYAACKOB-SRVKXCTJSA-N 0.000 claims 2
- AWAYOWOUGVZXOB-BZSNNMDCSA-N Phe-Asn-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 AWAYOWOUGVZXOB-BZSNNMDCSA-N 0.000 claims 2
- CSYVXYQDIVCQNU-QWRGUYRKSA-N Phe-Asp-Gly Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O CSYVXYQDIVCQNU-QWRGUYRKSA-N 0.000 claims 2
- KJJROSNFBRWPHS-JYJNAYRXSA-N Phe-Glu-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O KJJROSNFBRWPHS-JYJNAYRXSA-N 0.000 claims 2
- OYQBFWWQSVIHBN-FHWLQOOXSA-N Phe-Glu-Phe Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O OYQBFWWQSVIHBN-FHWLQOOXSA-N 0.000 claims 2
- QPVFUAUFEBPIPT-CDMKHQONSA-N Phe-Gly-Thr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O QPVFUAUFEBPIPT-CDMKHQONSA-N 0.000 claims 2
- ZKSLXIGKRJMALF-MGHWNKPDSA-N Phe-His-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CC2=CC=CC=C2)N ZKSLXIGKRJMALF-MGHWNKPDSA-N 0.000 claims 2
- CWFGECHCRMGPPT-MXAVVETBSA-N Phe-Ile-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O CWFGECHCRMGPPT-MXAVVETBSA-N 0.000 claims 2
- BYAIIACBWBOJCU-URLPEUOOSA-N Phe-Ile-Thr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O BYAIIACBWBOJCU-URLPEUOOSA-N 0.000 claims 2
- KLXQWABNAWDRAY-ACRUOGEOSA-N Phe-Lys-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 KLXQWABNAWDRAY-ACRUOGEOSA-N 0.000 claims 2
- WZEWCHQHNCMBEN-PMVMPFDFSA-N Phe-Lys-Trp Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC2=CNC3=CC=CC=C32)C(=O)O)N WZEWCHQHNCMBEN-PMVMPFDFSA-N 0.000 claims 2
- FQUUYTNBMIBOHS-IHRRRGAJSA-N Phe-Met-Ser Chemical compound CSCC[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N FQUUYTNBMIBOHS-IHRRRGAJSA-N 0.000 claims 2
- YVIVIQWMNCWUFS-UFYCRDLUSA-N Phe-Met-Tyr Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@H](CC2=CC=CC=C2)N YVIVIQWMNCWUFS-UFYCRDLUSA-N 0.000 claims 2
- OWSLLRKCHLTUND-BZSNNMDCSA-N Phe-Phe-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CC=CC=C2)C(=O)N[C@@H](CC(=O)N)C(=O)O)N OWSLLRKCHLTUND-BZSNNMDCSA-N 0.000 claims 2
- AFNJAQVMTIQTCB-DLOVCJGASA-N Phe-Ser-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CC=CC=C1 AFNJAQVMTIQTCB-DLOVCJGASA-N 0.000 claims 2
- FGWUALWGCZJQDJ-URLPEUOOSA-N Phe-Thr-Ile Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FGWUALWGCZJQDJ-URLPEUOOSA-N 0.000 claims 2
- ABEFOXGAIIJDCL-SFJXLCSZSA-N Phe-Thr-Trp Chemical compound C([C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(O)=O)C1=CC=CC=C1 ABEFOXGAIIJDCL-SFJXLCSZSA-N 0.000 claims 2
- GAMLAXHLYGLQBJ-UFYCRDLUSA-N Phe-Val-Tyr Chemical compound N[C@H](C(=O)N[C@H](C(=O)N[C@H](C(=O)O)CC1=CC=C(C=C1)O)C(C)C)CC1=CC=CC=C1 GAMLAXHLYGLQBJ-UFYCRDLUSA-N 0.000 claims 2
- HFZNNDWPHBRNPV-KZVJFYERSA-N Pro-Ala-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O HFZNNDWPHBRNPV-KZVJFYERSA-N 0.000 claims 2
- LCRSGSIRKLXZMZ-BPNCWPANSA-N Pro-Ala-Tyr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O LCRSGSIRKLXZMZ-BPNCWPANSA-N 0.000 claims 2
- OCSACVPBMIYNJE-GUBZILKMSA-N Pro-Arg-Asn Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(O)=O OCSACVPBMIYNJE-GUBZILKMSA-N 0.000 claims 2
- AMBLXEMWFARNNQ-DCAQKATOSA-N Pro-Asn-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@@H]1CCCN1 AMBLXEMWFARNNQ-DCAQKATOSA-N 0.000 claims 2
- SWXSLPHTJVAWDF-VEVYYDQMSA-N Pro-Asn-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SWXSLPHTJVAWDF-VEVYYDQMSA-N 0.000 claims 2
- CJZTUKSFZUSNCC-FXQIFTODSA-N Pro-Asp-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H]1CCCN1 CJZTUKSFZUSNCC-FXQIFTODSA-N 0.000 claims 2
- QVIZLAUEAMQKGS-GUBZILKMSA-N Pro-Asp-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@@H]1CCCN1 QVIZLAUEAMQKGS-GUBZILKMSA-N 0.000 claims 2
- WFLWKEUBTSOFMP-FXQIFTODSA-N Pro-Cys-Cys Chemical compound OC(=O)[C@H](CS)NC(=O)[C@H](CS)NC(=O)[C@@H]1CCCN1 WFLWKEUBTSOFMP-FXQIFTODSA-N 0.000 claims 2
- LCWXSALTPTZKNM-CIUDSAMLSA-N Pro-Cys-Glu Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(=O)O)C(=O)O LCWXSALTPTZKNM-CIUDSAMLSA-N 0.000 claims 2
- HAAQQNHQZBOWFO-LURJTMIESA-N Pro-Gly-Gly Chemical compound OC(=O)CNC(=O)CNC(=O)[C@@H]1CCCN1 HAAQQNHQZBOWFO-LURJTMIESA-N 0.000 claims 2
- UIMCLYYSUCIUJM-UWVGGRQHSA-N Pro-Gly-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H]1CCCN1 UIMCLYYSUCIUJM-UWVGGRQHSA-N 0.000 claims 2
- FEVDNIBDCRKMER-IUCAKERBSA-N Pro-Gly-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)CNC(=O)[C@@H]1CCCN1 FEVDNIBDCRKMER-IUCAKERBSA-N 0.000 claims 2
- HAEGAELAYWSUNC-WPRPVWTQSA-N Pro-Gly-Val Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O HAEGAELAYWSUNC-WPRPVWTQSA-N 0.000 claims 2
- OCYROESYHWUPBP-CIUDSAMLSA-N Pro-Ile Chemical compound CC[C@H](C)[C@@H](C([O-])=O)NC(=O)[C@@H]1CCC[NH2+]1 OCYROESYHWUPBP-CIUDSAMLSA-N 0.000 claims 2
- RYJRPPUATSKNAY-STECZYCISA-N Pro-Ile-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@@H]2CCCN2 RYJRPPUATSKNAY-STECZYCISA-N 0.000 claims 2
- RUDOLGWDSKQQFF-DCAQKATOSA-N Pro-Leu-Asn Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O RUDOLGWDSKQQFF-DCAQKATOSA-N 0.000 claims 2
- FXGIMYRVJJEIIM-UWVGGRQHSA-N Pro-Leu-Gly Chemical compound OC(=O)CNC(=O)[C@H](CC(C)C)NC(=O)[C@@H]1CCCN1 FXGIMYRVJJEIIM-UWVGGRQHSA-N 0.000 claims 2
- MCWHYUWXVNRXFV-RWMBFGLXSA-N Pro-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@@H]2CCCN2 MCWHYUWXVNRXFV-RWMBFGLXSA-N 0.000 claims 2
- SXMSEHDMNIUTSP-DCAQKATOSA-N Pro-Lys-Asn Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O SXMSEHDMNIUTSP-DCAQKATOSA-N 0.000 claims 2
- ABSSTGUCBCDKMU-UWVGGRQHSA-N Pro-Lys-Gly Chemical compound NCCCC[C@@H](C(=O)NCC(O)=O)NC(=O)[C@@H]1CCCN1 ABSSTGUCBCDKMU-UWVGGRQHSA-N 0.000 claims 2
- NTXFLJULRHQMDC-GUBZILKMSA-N Pro-Met-Asp Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@@H]1CCCN1 NTXFLJULRHQMDC-GUBZILKMSA-N 0.000 claims 2
- QGLFRQCECIWXFA-RCWTZXSCSA-N Pro-Met-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@@H]1CCCN1)O QGLFRQCECIWXFA-RCWTZXSCSA-N 0.000 claims 2
- VGVCNKSUVSZEIE-IHRRRGAJSA-N Pro-Phe-Asn Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(N)=O)C(O)=O VGVCNKSUVSZEIE-IHRRRGAJSA-N 0.000 claims 2
- AWQGDZBKQTYNMN-IHRRRGAJSA-N Pro-Phe-Asp Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC2=CC=CC=C2)C(=O)N[C@@H](CC(=O)O)C(=O)O AWQGDZBKQTYNMN-IHRRRGAJSA-N 0.000 claims 2
- KDBHVPXBQADZKY-GUBZILKMSA-N Pro-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 KDBHVPXBQADZKY-GUBZILKMSA-N 0.000 claims 2
- RFWXYTJSVDUBBZ-DCAQKATOSA-N Pro-Pro-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 RFWXYTJSVDUBBZ-DCAQKATOSA-N 0.000 claims 2
- QKDIHFHGHBYTKB-IHRRRGAJSA-N Pro-Ser-Phe Chemical compound N([C@@H](CO)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C(=O)[C@@H]1CCCN1 QKDIHFHGHBYTKB-IHRRRGAJSA-N 0.000 claims 2
- SNGZLPOXVRTNMB-LPEHRKFASA-N Pro-Ser-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CO)C(=O)N2CCC[C@@H]2C(=O)O SNGZLPOXVRTNMB-LPEHRKFASA-N 0.000 claims 2
- KWMZPPWYBVZIER-XGEHTFHBSA-N Pro-Ser-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KWMZPPWYBVZIER-XGEHTFHBSA-N 0.000 claims 2
- CHYAYDLYYIJCKY-OSUNSFLBSA-N Pro-Thr-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O CHYAYDLYYIJCKY-OSUNSFLBSA-N 0.000 claims 2
- GZNYIXWOIUFLGO-ZJDVBMNYSA-N Pro-Thr-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GZNYIXWOIUFLGO-ZJDVBMNYSA-N 0.000 claims 2
- DGDCSVGVWWAJRS-AVGNSLFASA-N Pro-Val-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@@H]2CCCN2 DGDCSVGVWWAJRS-AVGNSLFASA-N 0.000 claims 2
- SSJMZMUVNKEENT-IMJSIDKUSA-N Ser-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H](N)CO SSJMZMUVNKEENT-IMJSIDKUSA-N 0.000 claims 2
- MWMKFWJYRRGXOR-ZLUOBGJFSA-N Ser-Ala-Asn Chemical compound N[C@H](C(=O)N[C@H](C(=O)N[C@H](C(=O)O)CC(N)=O)C)CO MWMKFWJYRRGXOR-ZLUOBGJFSA-N 0.000 claims 2
- BTKUIVBNGBFTTP-WHFBIAKZSA-N Ser-Ala-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)NCC(O)=O BTKUIVBNGBFTTP-WHFBIAKZSA-N 0.000 claims 2
- QFBNNYNWKYKVJO-DCAQKATOSA-N Ser-Arg-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CO)CCCN=C(N)N QFBNNYNWKYKVJO-DCAQKATOSA-N 0.000 claims 2
- WXUBSIDKNMFAGS-IHRRRGAJSA-N Ser-Arg-Tyr Chemical compound NC(N)=NCCC[C@H](NC(=O)[C@H](CO)N)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 WXUBSIDKNMFAGS-IHRRRGAJSA-N 0.000 claims 2
- XVAUJOAYHWWNQF-ZLUOBGJFSA-N Ser-Asn-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(O)=O XVAUJOAYHWWNQF-ZLUOBGJFSA-N 0.000 claims 2
- OBXVZEAMXFSGPU-FXQIFTODSA-N Ser-Asn-Arg Chemical compound C(C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CO)N)CN=C(N)N OBXVZEAMXFSGPU-FXQIFTODSA-N 0.000 claims 2
- MMAPOBOTRUVNKJ-ZLUOBGJFSA-N Ser-Asp-Ser Chemical compound C([C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CO)N)C(=O)O MMAPOBOTRUVNKJ-ZLUOBGJFSA-N 0.000 claims 2
- HEQPKICPPDOSIN-SRVKXCTJSA-N Ser-Asp-Tyr Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 HEQPKICPPDOSIN-SRVKXCTJSA-N 0.000 claims 2
- MPPHJZYXDVDGOF-BWBBJGPYSA-N Ser-Cys-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CS)NC(=O)[C@@H](N)CO MPPHJZYXDVDGOF-BWBBJGPYSA-N 0.000 claims 2
- BRGQQXQKPUCUJQ-KBIXCLLPSA-N Ser-Glu-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O BRGQQXQKPUCUJQ-KBIXCLLPSA-N 0.000 claims 2
- QKQDTEYDEIJPNK-GUBZILKMSA-N Ser-Glu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CO QKQDTEYDEIJPNK-GUBZILKMSA-N 0.000 claims 2
- UFKPDBLKLOBMRH-XHNCKOQMSA-N Ser-Glu-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CO)N)C(=O)O UFKPDBLKLOBMRH-XHNCKOQMSA-N 0.000 claims 2
- GZBKRJVCRMZAST-XKBZYTNZSA-N Ser-Glu-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GZBKRJVCRMZAST-XKBZYTNZSA-N 0.000 claims 2
- BPMRXBZYPGYPJN-WHFBIAKZSA-N Ser-Gly-Asn Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O BPMRXBZYPGYPJN-WHFBIAKZSA-N 0.000 claims 2
- MUARUIBTKQJKFY-WHFBIAKZSA-N Ser-Gly-Asp Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O MUARUIBTKQJKFY-WHFBIAKZSA-N 0.000 claims 2
- WSTIOCFMWXNOCX-YUMQZZPRSA-N Ser-Gly-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CO)N WSTIOCFMWXNOCX-YUMQZZPRSA-N 0.000 claims 2
- UGHCUDLCCVVIJR-VGDYDELISA-N Ser-His-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CO)N UGHCUDLCCVVIJR-VGDYDELISA-N 0.000 claims 2
- JEHPKECJCALLRW-CUJWVEQBSA-N Ser-His-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O JEHPKECJCALLRW-CUJWVEQBSA-N 0.000 claims 2
- YMDNFPNTIPQMJP-NAKRPEOUSA-N Ser-Ile-Met Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCSC)C(O)=O YMDNFPNTIPQMJP-NAKRPEOUSA-N 0.000 claims 2
- UIPXCLNLUUAMJU-JBDRJPRFSA-N Ser-Ile-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O UIPXCLNLUUAMJU-JBDRJPRFSA-N 0.000 claims 2
- MQQBBLVOUUJKLH-HJPIBITLSA-N Ser-Ile-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O MQQBBLVOUUJKLH-HJPIBITLSA-N 0.000 claims 2
- QYSFWUIXDFJUDW-DCAQKATOSA-N Ser-Leu-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O QYSFWUIXDFJUDW-DCAQKATOSA-N 0.000 claims 2
- UBRMZSHOOIVJPW-SRVKXCTJSA-N Ser-Leu-Lys Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(O)=O UBRMZSHOOIVJPW-SRVKXCTJSA-N 0.000 claims 2
- GVIGVIOEYBOTCB-XIRDDKMYSA-N Ser-Leu-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CO)CC(C)C)C(O)=O)=CNC2=C1 GVIGVIOEYBOTCB-XIRDDKMYSA-N 0.000 claims 2
- LRWBCWGEUCKDTN-BJDJZHNGSA-N Ser-Lys-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O LRWBCWGEUCKDTN-BJDJZHNGSA-N 0.000 claims 2
- LRZLZIUXQBIWTB-KATARQTJSA-N Ser-Lys-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LRZLZIUXQBIWTB-KATARQTJSA-N 0.000 claims 2
- UPLYXVPQLJVWMM-KKUMJFAQSA-N Ser-Phe-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O UPLYXVPQLJVWMM-KKUMJFAQSA-N 0.000 claims 2
- XZKQVQKUZMAADP-IMJSIDKUSA-N Ser-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(O)=O XZKQVQKUZMAADP-IMJSIDKUSA-N 0.000 claims 2
- WLJPJRGQRNCIQS-ZLUOBGJFSA-N Ser-Ser-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O WLJPJRGQRNCIQS-ZLUOBGJFSA-N 0.000 claims 2
- GYDFRTRSSXOZCR-ACZMJKKPSA-N Ser-Ser-Glu Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O GYDFRTRSSXOZCR-ACZMJKKPSA-N 0.000 claims 2
- OZPDGESCTGGNAD-CIUDSAMLSA-N Ser-Ser-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CO OZPDGESCTGGNAD-CIUDSAMLSA-N 0.000 claims 2
- CUXJENOFJXOSOZ-BIIVOSGPSA-N Ser-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CO)N)C(=O)O CUXJENOFJXOSOZ-BIIVOSGPSA-N 0.000 claims 2
- FLMYSKVSDVHLEW-SVSWQMSJSA-N Ser-Thr-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FLMYSKVSDVHLEW-SVSWQMSJSA-N 0.000 claims 2
- ZSDXEKUKQAKZFE-XAVMHZPKSA-N Ser-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N)O ZSDXEKUKQAKZFE-XAVMHZPKSA-N 0.000 claims 2
- OJFFAQFRCVPHNN-JYBASQMISA-N Ser-Thr-Trp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O OJFFAQFRCVPHNN-JYBASQMISA-N 0.000 claims 2
- VAIWUNAAPZZGRI-IHPCNDPISA-N Ser-Trp-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)NC(=O)[C@H](CO)N VAIWUNAAPZZGRI-IHPCNDPISA-N 0.000 claims 2
- GSCVDSBEYVGMJQ-SRVKXCTJSA-N Ser-Tyr-Asp Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CO)N)O GSCVDSBEYVGMJQ-SRVKXCTJSA-N 0.000 claims 2
- QYBRQMLZDDJBSW-AVGNSLFASA-N Ser-Tyr-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O QYBRQMLZDDJBSW-AVGNSLFASA-N 0.000 claims 2
- UKKROEYWYIHWBD-ZKWXMUAHSA-N Ser-Val-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O UKKROEYWYIHWBD-ZKWXMUAHSA-N 0.000 claims 2
- ODRUTDLAONAVDV-IHRRRGAJSA-N Ser-Val-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O ODRUTDLAONAVDV-IHRRRGAJSA-N 0.000 claims 2
- DFTCYYILCSQGIZ-GCJQMDKQSA-N Thr-Ala-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O DFTCYYILCSQGIZ-GCJQMDKQSA-N 0.000 claims 2
- PXQUBKWZENPDGE-CIQUZCHMSA-N Thr-Ala-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](C)NC(=O)[C@H]([C@@H](C)O)N PXQUBKWZENPDGE-CIQUZCHMSA-N 0.000 claims 2
- BSNZTJXVDOINSR-JXUBOQSCSA-N Thr-Ala-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O BSNZTJXVDOINSR-JXUBOQSCSA-N 0.000 claims 2
- UKBSDLHIKIXJKH-HJGDQZAQSA-N Thr-Arg-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O UKBSDLHIKIXJKH-HJGDQZAQSA-N 0.000 claims 2
- JNQZPAWOPBZGIX-RCWTZXSCSA-N Thr-Arg-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)[C@@H](C)O)CCCN=C(N)N JNQZPAWOPBZGIX-RCWTZXSCSA-N 0.000 claims 2
- OHAJHDJOCKKJLV-LKXGYXEUSA-N Thr-Asp-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O OHAJHDJOCKKJLV-LKXGYXEUSA-N 0.000 claims 2
- DCLBXIWHLVEPMQ-JRQIVUDYSA-N Thr-Asp-Tyr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 DCLBXIWHLVEPMQ-JRQIVUDYSA-N 0.000 claims 2
- LGNBRHZANHMZHK-NUMRIWBASA-N Thr-Glu-Asp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N)O LGNBRHZANHMZHK-NUMRIWBASA-N 0.000 claims 2
- HJOSVGCWOTYJFG-WDCWCFNPSA-N Thr-Glu-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N)O HJOSVGCWOTYJFG-WDCWCFNPSA-N 0.000 claims 2
- KBLYJPQSNGTDIU-LOKLDPHHSA-N Thr-Glu-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N)O KBLYJPQSNGTDIU-LOKLDPHHSA-N 0.000 claims 2
- AQAMPXBRJJWPNI-JHEQGTHGSA-N Thr-Gly-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O AQAMPXBRJJWPNI-JHEQGTHGSA-N 0.000 claims 2
- XPNSAQMEAVSQRD-FBCQKBJTSA-N Thr-Gly-Gly Chemical compound C[C@@H](O)[C@H](N)C(=O)NCC(=O)NCC(O)=O XPNSAQMEAVSQRD-FBCQKBJTSA-N 0.000 claims 2
- WBCCCPZIJIJTSD-TUBUOCAGSA-N Thr-His-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H]([C@@H](C)O)N WBCCCPZIJIJTSD-TUBUOCAGSA-N 0.000 claims 2
- AHOLTQCAVBSUDP-PPCPHDFISA-N Thr-Ile-Lys Chemical compound CC[C@H](C)[C@H](NC(=O)[C@@H](N)[C@@H](C)O)C(=O)N[C@@H](CCCCN)C(O)=O AHOLTQCAVBSUDP-PPCPHDFISA-N 0.000 claims 2
- YJCVECXVYHZOBK-KNZXXDILSA-N Thr-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H]([C@@H](C)O)N YJCVECXVYHZOBK-KNZXXDILSA-N 0.000 claims 2
- IHAPJUHCZXBPHR-WZLNRYEVSA-N Thr-Ile-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N IHAPJUHCZXBPHR-WZLNRYEVSA-N 0.000 claims 2
- RRRRCRYTLZVCEN-HJGDQZAQSA-N Thr-Leu-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O RRRRCRYTLZVCEN-HJGDQZAQSA-N 0.000 claims 2
- WVVOFCVMHAXGLE-LFSVMHDDSA-N Thr-Phe-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(O)=O WVVOFCVMHAXGLE-LFSVMHDDSA-N 0.000 claims 2
- VGYVVSQFSSKZRJ-OEAJRASXSA-N Thr-Phe-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)[C@H](O)C)CC1=CC=CC=C1 VGYVVSQFSSKZRJ-OEAJRASXSA-N 0.000 claims 2
- JMBRNXUOLJFURW-BEAPCOKYSA-N Thr-Phe-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N2CCC[C@@H]2C(=O)O)N)O JMBRNXUOLJFURW-BEAPCOKYSA-N 0.000 claims 2
- MEBDIIKMUUNBSB-RPTUDFQQSA-N Thr-Phe-Tyr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O MEBDIIKMUUNBSB-RPTUDFQQSA-N 0.000 claims 2
- MUAFDCVOHYAFNG-RCWTZXSCSA-N Thr-Pro-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(O)=O MUAFDCVOHYAFNG-RCWTZXSCSA-N 0.000 claims 2
- OLFOOYQTTQSSRK-UNQGMJICSA-N Thr-Pro-Phe Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 OLFOOYQTTQSSRK-UNQGMJICSA-N 0.000 claims 2
- MROIJTGJGIDEEJ-RCWTZXSCSA-N Thr-Pro-Pro Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 MROIJTGJGIDEEJ-RCWTZXSCSA-N 0.000 claims 2
- GVMXJJAJLIEASL-ZJDVBMNYSA-N Thr-Pro-Thr Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O GVMXJJAJLIEASL-ZJDVBMNYSA-N 0.000 claims 2
- STUAPCLEDMKXKL-LKXGYXEUSA-N Thr-Ser-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O STUAPCLEDMKXKL-LKXGYXEUSA-N 0.000 claims 2
- NQQMWWVVGIXUOX-SVSWQMSJSA-N Thr-Ser-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O NQQMWWVVGIXUOX-SVSWQMSJSA-N 0.000 claims 2
- IEZVHOULSUULHD-XGEHTFHBSA-N Thr-Ser-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O IEZVHOULSUULHD-XGEHTFHBSA-N 0.000 claims 2
- QYDKSNXSBXZPFK-ZJDVBMNYSA-N Thr-Thr-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O QYDKSNXSBXZPFK-ZJDVBMNYSA-N 0.000 claims 2
- PJCYRZVSACOYSN-ZJDVBMNYSA-N Thr-Thr-Met Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCSC)C(O)=O PJCYRZVSACOYSN-ZJDVBMNYSA-N 0.000 claims 2
- QJIODPFLAASXJC-JHYOHUSXSA-N Thr-Thr-Phe Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N)O QJIODPFLAASXJC-JHYOHUSXSA-N 0.000 claims 2
- ZOCJFNXUVSGBQI-HSHDSVGOSA-N Thr-Trp-Arg Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N)O ZOCJFNXUVSGBQI-HSHDSVGOSA-N 0.000 claims 2
- XEVHXNLPUBVQEX-DVJZZOLTSA-N Thr-Trp-Gly Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)NCC(=O)O)N)O XEVHXNLPUBVQEX-DVJZZOLTSA-N 0.000 claims 2
- LVRFMARKDGGZMX-IZPVPAKOSA-N Thr-Tyr-Thr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H]([C@@H](C)O)C(O)=O)CC1=CC=C(O)C=C1 LVRFMARKDGGZMX-IZPVPAKOSA-N 0.000 claims 2
- PWONLXBUSVIZPH-RHYQMDGZSA-N Thr-Val-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N)O PWONLXBUSVIZPH-RHYQMDGZSA-N 0.000 claims 2
- BRBCKMMXKONBAA-KWBADKCTSA-N Trp-Ala-Ala Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O)=CNC2=C1 BRBCKMMXKONBAA-KWBADKCTSA-N 0.000 claims 2
- SCQBNMKLZVCXNX-ZFWWWQNUSA-N Trp-Arg-Gly Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)NCC(=O)O)N SCQBNMKLZVCXNX-ZFWWWQNUSA-N 0.000 claims 2
- GRQCSEWEPIHLBI-JQWIXIFHSA-N Trp-Asn Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(O)=O)=CNC2=C1 GRQCSEWEPIHLBI-JQWIXIFHSA-N 0.000 claims 2
- RNFZZCMCRDFNAE-WFBYXXMGSA-N Trp-Asn-Ala Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(O)=O RNFZZCMCRDFNAE-WFBYXXMGSA-N 0.000 claims 2
- UKINEYBQXPMOJO-UBHSHLNASA-N Trp-Asn-Ser Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CO)C(=O)O)N UKINEYBQXPMOJO-UBHSHLNASA-N 0.000 claims 2
- WVHUFSCKCBQKJW-HKUYNNGSSA-N Trp-Gly-Tyr Chemical compound C([C@H](NC(=O)CNC(=O)[C@H](CC=1C2=CC=CC=C2NC=1)N)C(O)=O)C1=CC=C(O)C=C1 WVHUFSCKCBQKJW-HKUYNNGSSA-N 0.000 claims 2
- OTWIOROMZLNAQC-XIRDDKMYSA-N Trp-His-Asp Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(O)=O)C(O)=O OTWIOROMZLNAQC-XIRDDKMYSA-N 0.000 claims 2
- BOBZBMOTRORUPT-XIRDDKMYSA-N Trp-Ser-Leu Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O)=CNC2=C1 BOBZBMOTRORUPT-XIRDDKMYSA-N 0.000 claims 2
- LNGFWVPNKLWATF-ZVZYQTTQSA-N Trp-Val-Glu Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O LNGFWVPNKLWATF-ZVZYQTTQSA-N 0.000 claims 2
- PZXUIGWOEWWFQM-SRVKXCTJSA-N Tyr-Asn-Asn Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O PZXUIGWOEWWFQM-SRVKXCTJSA-N 0.000 claims 2
- PEVVXUGSAKEPEN-AVGNSLFASA-N Tyr-Asn-Glu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O PEVVXUGSAKEPEN-AVGNSLFASA-N 0.000 claims 2
- WEFIPBYPXZYPHD-HJPIBITLSA-N Tyr-Cys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CC1=CC=C(C=C1)O)N WEFIPBYPXZYPHD-HJPIBITLSA-N 0.000 claims 2
- CNLKDWSAORJEMW-KWQFWETISA-N Tyr-Gly-Ala Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(=O)N[C@@H](C)C(O)=O CNLKDWSAORJEMW-KWQFWETISA-N 0.000 claims 2
- CDHQEOXPWBDFPL-QWRGUYRKSA-N Tyr-Gly-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC1=CC=C(O)C=C1 CDHQEOXPWBDFPL-QWRGUYRKSA-N 0.000 claims 2
- KSCVLGXNQXKUAR-JYJNAYRXSA-N Tyr-Leu-Glu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O KSCVLGXNQXKUAR-JYJNAYRXSA-N 0.000 claims 2
- ARJASMXQBRNAGI-YESZJQIVSA-N Tyr-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N ARJASMXQBRNAGI-YESZJQIVSA-N 0.000 claims 2
- NSGZILIDHCIZAM-KKUMJFAQSA-N Tyr-Leu-Ser Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N NSGZILIDHCIZAM-KKUMJFAQSA-N 0.000 claims 2
- CDKZJGMPZHPAJC-ULQDDVLXSA-N Tyr-Leu-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 CDKZJGMPZHPAJC-ULQDDVLXSA-N 0.000 claims 2
- PMHLLBKTDHQMCY-ULQDDVLXSA-N Tyr-Lys-Val Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O PMHLLBKTDHQMCY-ULQDDVLXSA-N 0.000 claims 2
- SOEGLGLDSUHWTI-STECZYCISA-N Tyr-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CC1=CC=C(O)C=C1 SOEGLGLDSUHWTI-STECZYCISA-N 0.000 claims 2
- BIWVVOHTKDLRMP-ULQDDVLXSA-N Tyr-Pro-Leu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O BIWVVOHTKDLRMP-ULQDDVLXSA-N 0.000 claims 2
- XGZBEGGGAUQBMB-KJEVXHAQSA-N Tyr-Pro-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CC2=CC=C(C=C2)O)N)O XGZBEGGGAUQBMB-KJEVXHAQSA-N 0.000 claims 2
- VPEFOFYNHBWFNQ-UFYCRDLUSA-N Tyr-Pro-Tyr Chemical compound C([C@H](N)C(=O)N1[C@@H](CCC1)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=C(O)C=C1 VPEFOFYNHBWFNQ-UFYCRDLUSA-N 0.000 claims 2
- RWOKVQUCENPXGE-IHRRRGAJSA-N Tyr-Ser-Arg Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O RWOKVQUCENPXGE-IHRRRGAJSA-N 0.000 claims 2
- HRHYJNLMIJWGLF-BZSNNMDCSA-N Tyr-Ser-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=C(O)C=C1 HRHYJNLMIJWGLF-BZSNNMDCSA-N 0.000 claims 2
- UMSZZGTXGKHTFJ-SRVKXCTJSA-N Tyr-Ser-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 UMSZZGTXGKHTFJ-SRVKXCTJSA-N 0.000 claims 2
- MDXLPNRXCFOBTL-BZSNNMDCSA-N Tyr-Ser-Tyr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O MDXLPNRXCFOBTL-BZSNNMDCSA-N 0.000 claims 2
- MFEVVAXTBZELLL-GGVZMXCHSA-N Tyr-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 MFEVVAXTBZELLL-GGVZMXCHSA-N 0.000 claims 2
- YMZYSCDRTXEOKD-IHPCNDPISA-N Tyr-Trp-Asn Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC3=CC=C(C=C3)O)N YMZYSCDRTXEOKD-IHPCNDPISA-N 0.000 claims 2
- DJIJBQYBDKGDIS-JYJNAYRXSA-N Tyr-Val-Val Chemical compound CC(C)[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)Cc1ccc(O)cc1)C(C)C)C(O)=O DJIJBQYBDKGDIS-JYJNAYRXSA-N 0.000 claims 2
- DDRBQONWVBDQOY-GUBZILKMSA-N Val-Ala-Arg Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O DDRBQONWVBDQOY-GUBZILKMSA-N 0.000 claims 2
- FZSPNKUFROZBSG-ZKWXMUAHSA-N Val-Ala-Asp Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC(O)=O FZSPNKUFROZBSG-ZKWXMUAHSA-N 0.000 claims 2
- ASQFIHTXXMFENG-XPUUQOCRSA-N Val-Ala-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O ASQFIHTXXMFENG-XPUUQOCRSA-N 0.000 claims 2
- VJOWWOGRNXRQMF-UVBJJODRSA-N Val-Ala-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](C)NC(=O)[C@@H](N)C(C)C)C(O)=O)=CNC2=C1 VJOWWOGRNXRQMF-UVBJJODRSA-N 0.000 claims 2
- COYSIHFOCOMGCF-WPRPVWTQSA-N Val-Arg-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CCCN=C(N)N COYSIHFOCOMGCF-WPRPVWTQSA-N 0.000 claims 2
- ZMDCGGKHRKNWKD-LAEOZQHASA-N Val-Asn-Glu Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N ZMDCGGKHRKNWKD-LAEOZQHASA-N 0.000 claims 2
- DBOXBUDEAJVKRE-LSJOCFKGSA-N Val-Asn-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](C(C)C)C(=O)O)N DBOXBUDEAJVKRE-LSJOCFKGSA-N 0.000 claims 2
- YODDULVCGFQRFZ-ZKWXMUAHSA-N Val-Asp-Ser Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O YODDULVCGFQRFZ-ZKWXMUAHSA-N 0.000 claims 2
- XGJLNBNZNMVJRS-NRPADANISA-N Val-Glu-Ala Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O XGJLNBNZNMVJRS-NRPADANISA-N 0.000 claims 2
- ZXAGTABZUOMUDO-GVXVVHGQSA-N Val-Glu-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N ZXAGTABZUOMUDO-GVXVVHGQSA-N 0.000 claims 2
- FOADDSDHGRFUOC-DZKIICNBSA-N Val-Glu-Phe Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N FOADDSDHGRFUOC-DZKIICNBSA-N 0.000 claims 2
- PIFJAFRUVWZRKR-QMMMGPOBSA-N Val-Gly-Gly Chemical compound CC(C)[C@H]([NH3+])C(=O)NCC(=O)NCC([O-])=O PIFJAFRUVWZRKR-QMMMGPOBSA-N 0.000 claims 2
- URIRWLJVWHYLET-ONGXEEELSA-N Val-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)C(C)C URIRWLJVWHYLET-ONGXEEELSA-N 0.000 claims 2
- SYOMXKPPFZRELL-ONGXEEELSA-N Val-Gly-Lys Chemical compound CC(C)[C@@H](C(=O)NCC(=O)N[C@@H](CCCCN)C(=O)O)N SYOMXKPPFZRELL-ONGXEEELSA-N 0.000 claims 2
- MDYSKHBSPXUOPV-JSGCOSHPSA-N Val-Gly-Phe Chemical compound CC(C)[C@@H](C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N MDYSKHBSPXUOPV-JSGCOSHPSA-N 0.000 claims 2
- XXROXFHCMVXETG-UWVGGRQHSA-N Val-Gly-Val Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O XXROXFHCMVXETG-UWVGGRQHSA-N 0.000 claims 2
- PTFPUAXGIKTVNN-ONGXEEELSA-N Val-His-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)NCC(=O)O)N PTFPUAXGIKTVNN-ONGXEEELSA-N 0.000 claims 2
- CPGJELLYDQEDRK-NAKRPEOUSA-N Val-Ile-Ala Chemical compound CC[C@H](C)[C@H](NC(=O)[C@@H](N)C(C)C)C(=O)N[C@@H](C)C(O)=O CPGJELLYDQEDRK-NAKRPEOUSA-N 0.000 claims 2
- VHRLUTIMTDOVCG-PEDHHIEDSA-N Val-Ile-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)NC(=O)[C@H](C(C)C)N VHRLUTIMTDOVCG-PEDHHIEDSA-N 0.000 claims 2
- JZWZACGUZVCQPS-RNJOBUHISA-N Val-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](C(C)C)N JZWZACGUZVCQPS-RNJOBUHISA-N 0.000 claims 2
- WDIWOIRFNMLNKO-ULQDDVLXSA-N Val-Leu-Tyr Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 WDIWOIRFNMLNKO-ULQDDVLXSA-N 0.000 claims 2
- KTEZUXISLQTDDQ-NHCYSSNCSA-N Val-Lys-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(=O)O)C(=O)O)N KTEZUXISLQTDDQ-NHCYSSNCSA-N 0.000 claims 2
- YMTOEGGOCHVGEH-IHRRRGAJSA-N Val-Lys-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(O)=O YMTOEGGOCHVGEH-IHRRRGAJSA-N 0.000 claims 2
- ILMVQSHENUZYIZ-JYJNAYRXSA-N Val-Met-Tyr Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N ILMVQSHENUZYIZ-JYJNAYRXSA-N 0.000 claims 2
- HJSLDXZAZGFPDK-ULQDDVLXSA-N Val-Phe-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](C(C)C)N HJSLDXZAZGFPDK-ULQDDVLXSA-N 0.000 claims 2
- MHHAWNPHDLCPLF-ULQDDVLXSA-N Val-Phe-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C(C)C)CC1=CC=CC=C1 MHHAWNPHDLCPLF-ULQDDVLXSA-N 0.000 claims 2
- VCIYTVOBLZHFSC-XHSDSOJGSA-N Val-Phe-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N2CCC[C@@H]2C(=O)O)N VCIYTVOBLZHFSC-XHSDSOJGSA-N 0.000 claims 2
- YKNOJPJWNVHORX-UNQGMJICSA-N Val-Phe-Thr Chemical compound CC(C)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H]([C@@H](C)O)C(O)=O)CC1=CC=CC=C1 YKNOJPJWNVHORX-UNQGMJICSA-N 0.000 claims 2
- USLVEJAHTBLSIL-CYDGBPFRSA-N Val-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)C(C)C USLVEJAHTBLSIL-CYDGBPFRSA-N 0.000 claims 2
- QWCZXKIFPWPQHR-JYJNAYRXSA-N Val-Pro-Tyr Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 QWCZXKIFPWPQHR-JYJNAYRXSA-N 0.000 claims 2
- PZTZYZUTCPZWJH-FXQIFTODSA-N Val-Ser-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)O)N PZTZYZUTCPZWJH-FXQIFTODSA-N 0.000 claims 2
- GVRKWABULJAONN-VQVTYTSYSA-N Val-Thr Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GVRKWABULJAONN-VQVTYTSYSA-N 0.000 claims 2
- PQSNETRGCRUOGP-KKHAAJSZSA-N Val-Thr-Asn Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(N)=O PQSNETRGCRUOGP-KKHAAJSZSA-N 0.000 claims 2
- UVHFONIHVHLDDQ-IFFSRLJSSA-N Val-Thr-Glu Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N)O UVHFONIHVHLDDQ-IFFSRLJSSA-N 0.000 claims 2
- LLJLBRRXKZTTRD-GUBZILKMSA-N Val-Val-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(=O)O)N LLJLBRRXKZTTRD-GUBZILKMSA-N 0.000 claims 2
- STTYIMSDIYISRG-UHFFFAOYSA-N Valyl-Serine Chemical compound CC(C)C(N)C(=O)NC(CO)C(O)=O STTYIMSDIYISRG-UHFFFAOYSA-N 0.000 claims 2
- 108010008685 alanyl-glutamyl-aspartic acid Proteins 0.000 claims 2
- 108010045023 alanyl-prolyl-tyrosine Proteins 0.000 claims 2
- 108010050025 alpha-glutamyltryptophan Proteins 0.000 claims 2
- 108010069926 arginyl-glycyl-serine Proteins 0.000 claims 2
- 108010010430 asparagine-proline-alanine Proteins 0.000 claims 2
- 210000004027 cell Anatomy 0.000 claims 2
- 108010004073 cysteinylcysteine Proteins 0.000 claims 2
- 108010090037 glycyl-alanyl-isoleucine Proteins 0.000 claims 2
- 108010067216 glycyl-glycyl-glycine Proteins 0.000 claims 2
- 108010001064 glycyl-glycyl-glycyl-glycine Proteins 0.000 claims 2
- YMAWOPBAYDPSLA-UHFFFAOYSA-N glycylglycine Chemical compound [NH3+]CC(=O)NCC([O-])=O YMAWOPBAYDPSLA-UHFFFAOYSA-N 0.000 claims 2
- 108010085325 histidylproline Proteins 0.000 claims 2
- 108010027338 isoleucylcysteine Proteins 0.000 claims 2
- 108010051673 leucyl-glycyl-phenylalanine Proteins 0.000 claims 2
- 108010064486 phenylalanyl-leucyl-valine Proteins 0.000 claims 2
- 230000010076 replication Effects 0.000 claims 2
- 108010048397 seryl-lysyl-leucine Proteins 0.000 claims 2
- 108010033670 threonyl-aspartyl-tyrosine Proteins 0.000 claims 2
- 108010072986 threonyl-seryl-lysine Proteins 0.000 claims 2
- 230000009261 transgenic effect Effects 0.000 claims 2
- 108010080629 tryptophan-leucine Proteins 0.000 claims 2
- 108010071635 tyrosyl-prolyl-arginine Proteins 0.000 claims 2
- 108010027345 wheylin-1 peptide Proteins 0.000 claims 2
- ZEIYPKQQLSUPOT-QORCZRPOSA-N (2s)-2-[[(2s)-2-[[(2s)-2-[[(2s)-2-amino-3-phenylpropanoyl]amino]-4-methylpentanoyl]amino]-5-(diaminomethylideneamino)pentanoyl]amino]-3-phenylpropanoic acid Chemical compound C([C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 ZEIYPKQQLSUPOT-QORCZRPOSA-N 0.000 claims 1
- RVLOMLVNNBWRSR-KNIFDHDWSA-N (2s)-2-aminopropanoic acid;(2s)-2,6-diaminohexanoic acid Chemical compound C[C@H](N)C(O)=O.NCCCC[C@H](N)C(O)=O RVLOMLVNNBWRSR-KNIFDHDWSA-N 0.000 claims 1
- HHGYNJRJIINWAK-FXQIFTODSA-N Ala-Ala-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N HHGYNJRJIINWAK-FXQIFTODSA-N 0.000 claims 1
- FJVAQLJNTSUQPY-CIUDSAMLSA-N Ala-Ala-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCCN FJVAQLJNTSUQPY-CIUDSAMLSA-N 0.000 claims 1
- JBGSZRYCXBPWGX-BQBZGAKWSA-N Ala-Arg-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)[C@@H](N)C)CCCN=C(N)N JBGSZRYCXBPWGX-BQBZGAKWSA-N 0.000 claims 1
- IMMKUCQIKKXKNP-DCAQKATOSA-N Ala-Arg-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](C)N)CCCN=C(N)N IMMKUCQIKKXKNP-DCAQKATOSA-N 0.000 claims 1
- UCIYCBSJBQGDGM-LPEHRKFASA-N Ala-Arg-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N1CCC[C@@H]1C(=O)O)N UCIYCBSJBQGDGM-LPEHRKFASA-N 0.000 claims 1
- GORKKVHIBWAQHM-GCJQMDKQSA-N Ala-Asn-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GORKKVHIBWAQHM-GCJQMDKQSA-N 0.000 claims 1
- WDIYWDJLXOCGRW-ACZMJKKPSA-N Ala-Asp-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O WDIYWDJLXOCGRW-ACZMJKKPSA-N 0.000 claims 1
- BTYTYHBSJKQBQA-GCJQMDKQSA-N Ala-Asp-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](C)N)O BTYTYHBSJKQBQA-GCJQMDKQSA-N 0.000 claims 1
- KXEVYGKATAMXJJ-ACZMJKKPSA-N Ala-Glu-Asp Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O KXEVYGKATAMXJJ-ACZMJKKPSA-N 0.000 claims 1
- WKOBSJOZRJJVRZ-FXQIFTODSA-N Ala-Glu-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O WKOBSJOZRJJVRZ-FXQIFTODSA-N 0.000 claims 1
- PAIHPOGPJVUFJY-WDSKDSINSA-N Ala-Glu-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O PAIHPOGPJVUFJY-WDSKDSINSA-N 0.000 claims 1
- GGNHBHYDMUDXQB-KBIXCLLPSA-N Ala-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@H](C)N GGNHBHYDMUDXQB-KBIXCLLPSA-N 0.000 claims 1
- CXISPYVYMQWFLE-VKHMYHEASA-N Ala-Gly Chemical compound C[C@H]([NH3+])C(=O)NCC([O-])=O CXISPYVYMQWFLE-VKHMYHEASA-N 0.000 claims 1
- ZVFVBBGVOILKPO-WHFBIAKZSA-N Ala-Gly-Ala Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(O)=O ZVFVBBGVOILKPO-WHFBIAKZSA-N 0.000 claims 1
- WGDNWOMKBUXFHR-BQBZGAKWSA-N Ala-Gly-Arg Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N WGDNWOMKBUXFHR-BQBZGAKWSA-N 0.000 claims 1
- LBFXVAXPDOBRKU-LKTVYLICSA-N Ala-His-Tyr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O LBFXVAXPDOBRKU-LKTVYLICSA-N 0.000 claims 1
- IFKQPMZRDQZSHI-GHCJXIJMSA-N Ala-Ile-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O IFKQPMZRDQZSHI-GHCJXIJMSA-N 0.000 claims 1
- HQJKCXHQNUCKMY-GHCJXIJMSA-N Ala-Ile-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](C)N HQJKCXHQNUCKMY-GHCJXIJMSA-N 0.000 claims 1
- TZDNWXDLYFIFPT-BJDJZHNGSA-N Ala-Ile-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O TZDNWXDLYFIFPT-BJDJZHNGSA-N 0.000 claims 1
- QUIGLPSHIFPEOV-CIUDSAMLSA-N Ala-Lys-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O QUIGLPSHIFPEOV-CIUDSAMLSA-N 0.000 claims 1
- MFMDKJIPHSWSBM-GUBZILKMSA-N Ala-Lys-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O MFMDKJIPHSWSBM-GUBZILKMSA-N 0.000 claims 1
- SUHLZMHFRALVSY-YUMQZZPRSA-N Ala-Lys-Gly Chemical compound NCCCC[C@H](NC(=O)[C@@H](N)C)C(=O)NCC(O)=O SUHLZMHFRALVSY-YUMQZZPRSA-N 0.000 claims 1
- FUKFQILQFQKHLE-DCAQKATOSA-N Ala-Lys-Met Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCSC)C(O)=O FUKFQILQFQKHLE-DCAQKATOSA-N 0.000 claims 1
- BLTRAARCJYVJKV-QEJZJMRPSA-N Ala-Lys-Phe Chemical compound C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](Cc1ccccc1)C(O)=O BLTRAARCJYVJKV-QEJZJMRPSA-N 0.000 claims 1
- OMDNCNKNEGFOMM-BQBZGAKWSA-N Ala-Met-Gly Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCSC)C(=O)NCC(O)=O OMDNCNKNEGFOMM-BQBZGAKWSA-N 0.000 claims 1
- GFEDXKNBZMPEDM-KZVJFYERSA-N Ala-Met-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GFEDXKNBZMPEDM-KZVJFYERSA-N 0.000 claims 1
- DHBKYZYFEXXUAK-ONGXEEELSA-N Ala-Phe-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=CC=C1 DHBKYZYFEXXUAK-ONGXEEELSA-N 0.000 claims 1
- RNHKOQHGYMTHFR-UBHSHLNASA-N Ala-Phe-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](C)N)CC1=CC=CC=C1 RNHKOQHGYMTHFR-UBHSHLNASA-N 0.000 claims 1
- WPWUFUBLGADILS-WDSKDSINSA-N Ala-Pro Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(O)=O WPWUFUBLGADILS-WDSKDSINSA-N 0.000 claims 1
- OLVCTPPSXNRGKV-GUBZILKMSA-N Ala-Pro-Pro Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 OLVCTPPSXNRGKV-GUBZILKMSA-N 0.000 claims 1
- XWFWAXPOLRTDFZ-FXQIFTODSA-N Ala-Pro-Ser Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O XWFWAXPOLRTDFZ-FXQIFTODSA-N 0.000 claims 1
- KLALXKYLOMZDQT-ZLUOBGJFSA-N Ala-Ser-Asn Chemical compound C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC(N)=O KLALXKYLOMZDQT-ZLUOBGJFSA-N 0.000 claims 1
- HOVPGJUNRLMIOZ-CIUDSAMLSA-N Ala-Ser-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@H](C)N HOVPGJUNRLMIOZ-CIUDSAMLSA-N 0.000 claims 1
- MMLHRUJLOUSRJX-CIUDSAMLSA-N Ala-Ser-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCCN MMLHRUJLOUSRJX-CIUDSAMLSA-N 0.000 claims 1
- OEVCHROQUIVQFZ-YTLHQDLWSA-N Ala-Thr-Ala Chemical compound C[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](C)C(O)=O OEVCHROQUIVQFZ-YTLHQDLWSA-N 0.000 claims 1
- QKHWNPQNOHEFST-VZFHVOOUSA-N Ala-Thr-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](C)N)O QKHWNPQNOHEFST-VZFHVOOUSA-N 0.000 claims 1
- IETUUAHKCHOQHP-KZVJFYERSA-N Ala-Thr-Val Chemical compound CC(C)[C@H](NC(=O)[C@@H](NC(=O)[C@H](C)N)[C@@H](C)O)C(O)=O IETUUAHKCHOQHP-KZVJFYERSA-N 0.000 claims 1
- XMIAMUXIMWREBJ-HERUPUMHSA-N Ala-Trp-Asn Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CC(=O)N)C(=O)O)N XMIAMUXIMWREBJ-HERUPUMHSA-N 0.000 claims 1
- SFPRJVVDZNLUTG-OWLDWWDNSA-N Ala-Trp-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SFPRJVVDZNLUTG-OWLDWWDNSA-N 0.000 claims 1
- ALZVPLKYDKJKQU-XVKPBYJWSA-N Ala-Tyr Chemical compound C[C@H](N)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 ALZVPLKYDKJKQU-XVKPBYJWSA-N 0.000 claims 1
- AENHOIXXHKNIQL-AUTRQRHGSA-N Ala-Tyr-Ala Chemical compound [O-]C(=O)[C@H](C)NC(=O)[C@@H](NC(=O)[C@@H]([NH3+])C)CC1=CC=C(O)C=C1 AENHOIXXHKNIQL-AUTRQRHGSA-N 0.000 claims 1
- GCTANJIJJROSLH-GVARAGBVSA-N Ala-Tyr-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](C)N GCTANJIJJROSLH-GVARAGBVSA-N 0.000 claims 1
- ZJLORAAXDAJLDC-CQDKDKBSSA-N Ala-Tyr-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(O)=O ZJLORAAXDAJLDC-CQDKDKBSSA-N 0.000 claims 1
- OLDOLPWZEMHNIA-PJODQICGSA-N Arg-Ala-Trp Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O OLDOLPWZEMHNIA-PJODQICGSA-N 0.000 claims 1
- ZTKHZAXGTFXUDD-VEVYYDQMSA-N Arg-Asn-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ZTKHZAXGTFXUDD-VEVYYDQMSA-N 0.000 claims 1
- RWCLSUOSKWTXLA-FXQIFTODSA-N Arg-Asp-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O RWCLSUOSKWTXLA-FXQIFTODSA-N 0.000 claims 1
- OZNSCVPYWZRQPY-CIUDSAMLSA-N Arg-Asp-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O OZNSCVPYWZRQPY-CIUDSAMLSA-N 0.000 claims 1
- HPKSHFSEXICTLI-CIUDSAMLSA-N Arg-Glu-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O HPKSHFSEXICTLI-CIUDSAMLSA-N 0.000 claims 1
- HQIZDMIGUJOSNI-IUCAKERBSA-N Arg-Gly-Arg Chemical compound N[C@@H](CCCNC(N)=N)C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(O)=O HQIZDMIGUJOSNI-IUCAKERBSA-N 0.000 claims 1
- CYXCAHZVPFREJD-LURJTMIESA-N Arg-Gly-Gly Chemical compound NC(=N)NCCC[C@H](N)C(=O)NCC(=O)NCC(O)=O CYXCAHZVPFREJD-LURJTMIESA-N 0.000 claims 1
- RFXXUWGNVRJTNQ-QXEWZRGKSA-N Arg-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CCCN=C(N)N)N RFXXUWGNVRJTNQ-QXEWZRGKSA-N 0.000 claims 1
- OQCWXQJLCDPRHV-UWVGGRQHSA-N Arg-Gly-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O OQCWXQJLCDPRHV-UWVGGRQHSA-N 0.000 claims 1
- ZATRYQNPUHGXCU-DTWKUNHWSA-N Arg-Gly-Pro Chemical compound C1C[C@@H](N(C1)C(=O)CNC(=O)[C@H](CCCN=C(N)N)N)C(=O)O ZATRYQNPUHGXCU-DTWKUNHWSA-N 0.000 claims 1
- VRZDJJWOFXMFRO-ZFWWWQNUSA-N Arg-Gly-Trp Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)NCC(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O VRZDJJWOFXMFRO-ZFWWWQNUSA-N 0.000 claims 1
- CRCCTGPNZUCAHE-DCAQKATOSA-N Arg-His-Ser Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CO)C(O)=O)CC1=CN=CN1 CRCCTGPNZUCAHE-DCAQKATOSA-N 0.000 claims 1
- UBCPNBUIQNMDNH-NAKRPEOUSA-N Arg-Ile-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O UBCPNBUIQNMDNH-NAKRPEOUSA-N 0.000 claims 1
- YBZMTKUDWXZLIX-UWVGGRQHSA-N Arg-Leu-Gly Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O YBZMTKUDWXZLIX-UWVGGRQHSA-N 0.000 claims 1
- NOZYDJOPOGKUSR-AVGNSLFASA-N Arg-Leu-Met Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(O)=O NOZYDJOPOGKUSR-AVGNSLFASA-N 0.000 claims 1
- PZBSKYJGKNNYNK-ULQDDVLXSA-N Arg-Leu-Tyr Chemical compound CC(C)C[C@H](NC(=O)[C@@H](N)CCCN=C(N)N)C(=O)N[C@@H](Cc1ccc(O)cc1)C(O)=O PZBSKYJGKNNYNK-ULQDDVLXSA-N 0.000 claims 1
- MJINRRBEMOLJAK-DCAQKATOSA-N Arg-Lys-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCCN=C(N)N MJINRRBEMOLJAK-DCAQKATOSA-N 0.000 claims 1
- CLICCYPMVFGUOF-IHRRRGAJSA-N Arg-Lys-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O CLICCYPMVFGUOF-IHRRRGAJSA-N 0.000 claims 1
- LCBSSOCDWUTQQV-SDDRHHMPSA-N Arg-Met-Pro Chemical compound CSCC[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N LCBSSOCDWUTQQV-SDDRHHMPSA-N 0.000 claims 1
- YTMKMRSYXHBGER-IHRRRGAJSA-N Arg-Phe-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N YTMKMRSYXHBGER-IHRRRGAJSA-N 0.000 claims 1
- DPLFNLDACGGBAK-KKUMJFAQSA-N Arg-Phe-Glu Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N DPLFNLDACGGBAK-KKUMJFAQSA-N 0.000 claims 1
- SLQQPJBDBVPVQV-JYJNAYRXSA-N Arg-Phe-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(O)=O SLQQPJBDBVPVQV-JYJNAYRXSA-N 0.000 claims 1
- YFHATWYGAAXQCF-JYJNAYRXSA-N Arg-Pro-Phe Chemical compound NC(N)=NCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 YFHATWYGAAXQCF-JYJNAYRXSA-N 0.000 claims 1
- VUGWHBXPMAHEGZ-SRVKXCTJSA-N Arg-Pro-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CCCN=C(N)N VUGWHBXPMAHEGZ-SRVKXCTJSA-N 0.000 claims 1
- FRBAHXABMQXSJQ-FXQIFTODSA-N Arg-Ser-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O FRBAHXABMQXSJQ-FXQIFTODSA-N 0.000 claims 1
- FBXMCPLCVYUWBO-BPUTZDHNSA-N Arg-Ser-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCCN=C(N)N)N FBXMCPLCVYUWBO-BPUTZDHNSA-N 0.000 claims 1
- SYFHFLGAROUHNT-VEVYYDQMSA-N Arg-Thr-Asn Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(O)=O SYFHFLGAROUHNT-VEVYYDQMSA-N 0.000 claims 1
- UGJLILSJKSBVIR-ZFWWWQNUSA-N Arg-Trp-Gly Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CCCN=C(N)N)N)C(=O)NCC(O)=O)=CNC2=C1 UGJLILSJKSBVIR-ZFWWWQNUSA-N 0.000 claims 1
- VLIJAPRTSXSGFY-STQMWFEESA-N Arg-Tyr-Gly Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CC=C(O)C=C1 VLIJAPRTSXSGFY-STQMWFEESA-N 0.000 claims 1
- XEOXPCNONWHHSW-AVGNSLFASA-N Arg-Val-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N XEOXPCNONWHHSW-AVGNSLFASA-N 0.000 claims 1
- WTUZDHWWGUQEKN-SRVKXCTJSA-N Arg-Val-Met Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCSC)C(O)=O WTUZDHWWGUQEKN-SRVKXCTJSA-N 0.000 claims 1
- XUTOXNRSAGLAKO-UHFFFAOYSA-N Asn Val Asn Pro Chemical compound NC(=O)CC(N)C(=O)NC(C(C)C)C(=O)NC(CC(N)=O)C(=O)N1CCCC1C(O)=O XUTOXNRSAGLAKO-UHFFFAOYSA-N 0.000 claims 1
- SWLOHUMCUDRTCL-ZLUOBGJFSA-N Asn-Ala-Asn Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N SWLOHUMCUDRTCL-ZLUOBGJFSA-N 0.000 claims 1
- LEFKSBYHUGUWLP-ACZMJKKPSA-N Asn-Ala-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O LEFKSBYHUGUWLP-ACZMJKKPSA-N 0.000 claims 1
- ORXCYAFUCSTQGY-FXQIFTODSA-N Asn-Ala-Met Chemical compound C[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CC(=O)N)N ORXCYAFUCSTQGY-FXQIFTODSA-N 0.000 claims 1
- GXMSVVBIAMWMKO-BQBZGAKWSA-N Asn-Arg-Gly Chemical compound NC(=O)C[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CCCN=C(N)N GXMSVVBIAMWMKO-BQBZGAKWSA-N 0.000 claims 1
- HUZGPXBILPMCHM-IHRRRGAJSA-N Asn-Arg-Phe Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O HUZGPXBILPMCHM-IHRRRGAJSA-N 0.000 claims 1
- JEPNYDRDYNSFIU-QXEWZRGKSA-N Asn-Arg-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H](N)CC(N)=O)C(O)=O JEPNYDRDYNSFIU-QXEWZRGKSA-N 0.000 claims 1
- LJUOLNXOWSWGKF-ACZMJKKPSA-N Asn-Asn-Glu Chemical compound C(CC(=O)O)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC(=O)N)N LJUOLNXOWSWGKF-ACZMJKKPSA-N 0.000 claims 1
- IOTKDTZEEBZNCM-UGYAYLCHSA-N Asn-Asn-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O IOTKDTZEEBZNCM-UGYAYLCHSA-N 0.000 claims 1
- NLCDVZJDEXIDDL-BIIVOSGPSA-N Asn-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC(=O)N)N)C(=O)O NLCDVZJDEXIDDL-BIIVOSGPSA-N 0.000 claims 1
- HZYFHQOWCFUSOV-IMJSIDKUSA-N Asn-Asp Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(O)=O HZYFHQOWCFUSOV-IMJSIDKUSA-N 0.000 claims 1
- QISZHYWZHJRDAO-CIUDSAMLSA-N Asn-Asp-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(=O)N)N QISZHYWZHJRDAO-CIUDSAMLSA-N 0.000 claims 1
- VYLVOMUVLMGCRF-ZLUOBGJFSA-N Asn-Asp-Ser Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O VYLVOMUVLMGCRF-ZLUOBGJFSA-N 0.000 claims 1
- XQQVCUIBGYFKDC-OLHMAJIHSA-N Asn-Asp-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XQQVCUIBGYFKDC-OLHMAJIHSA-N 0.000 claims 1
- XVAPVJNJGLWGCS-ACZMJKKPSA-N Asn-Glu-Asn Chemical compound C(CC(=O)O)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N XVAPVJNJGLWGCS-ACZMJKKPSA-N 0.000 claims 1
- GNKVBRYFXYWXAB-WDSKDSINSA-N Asn-Glu-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O GNKVBRYFXYWXAB-WDSKDSINSA-N 0.000 claims 1
- GFFRWIJAFFMQGM-NUMRIWBASA-N Asn-Glu-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GFFRWIJAFFMQGM-NUMRIWBASA-N 0.000 claims 1
- CTQIOCMSIJATNX-WHFBIAKZSA-N Asn-Gly-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](C)C(O)=O CTQIOCMSIJATNX-WHFBIAKZSA-N 0.000 claims 1
- PBSQFBAJKPLRJY-BYULHYEWSA-N Asn-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CC(=O)N)N PBSQFBAJKPLRJY-BYULHYEWSA-N 0.000 claims 1
- ANPFQTJEPONRPL-UGYAYLCHSA-N Asn-Ile-Asp Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O ANPFQTJEPONRPL-UGYAYLCHSA-N 0.000 claims 1
- YYSYDIYQTUPNQQ-SXTJYALSSA-N Asn-Ile-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O YYSYDIYQTUPNQQ-SXTJYALSSA-N 0.000 claims 1
- HXWUJJADFMXNKA-BQBZGAKWSA-N Asn-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](N)CC(N)=O HXWUJJADFMXNKA-BQBZGAKWSA-N 0.000 claims 1
- HDHZCEDPLTVHFZ-GUBZILKMSA-N Asn-Leu-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O HDHZCEDPLTVHFZ-GUBZILKMSA-N 0.000 claims 1
- LWXJVHTUEDHDLG-XUXIUFHCSA-N Asn-Leu-Leu-Ser Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O LWXJVHTUEDHDLG-XUXIUFHCSA-N 0.000 claims 1
- DJIMLSXHXKWADV-CIUDSAMLSA-N Asn-Leu-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC(N)=O DJIMLSXHXKWADV-CIUDSAMLSA-N 0.000 claims 1
- FHETWELNCBMRMG-HJGDQZAQSA-N Asn-Leu-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O FHETWELNCBMRMG-HJGDQZAQSA-N 0.000 claims 1
- FTSAJSADJCMDHH-CIUDSAMLSA-N Asn-Lys-Asp Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC(=O)N)N FTSAJSADJCMDHH-CIUDSAMLSA-N 0.000 claims 1
- QGABLMITFKUQDF-DCAQKATOSA-N Asn-Met-Lys Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)N)N QGABLMITFKUQDF-DCAQKATOSA-N 0.000 claims 1
- VITDJIPIJZAVGC-VEVYYDQMSA-N Asn-Met-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O VITDJIPIJZAVGC-VEVYYDQMSA-N 0.000 claims 1
- BKFXFUPYETWGGA-XVSYOHENSA-N Asn-Phe-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O BKFXFUPYETWGGA-XVSYOHENSA-N 0.000 claims 1
- PLTGTJAZQRGMPP-FXQIFTODSA-N Asn-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CC(N)=O PLTGTJAZQRGMPP-FXQIFTODSA-N 0.000 claims 1
- AWXDRZJQCVHCIT-DCAQKATOSA-N Asn-Pro-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CC(N)=O AWXDRZJQCVHCIT-DCAQKATOSA-N 0.000 claims 1
- SUIJFTJDTJKSRK-IHRRRGAJSA-N Asn-Pro-Tyr Chemical compound NC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 SUIJFTJDTJKSRK-IHRRRGAJSA-N 0.000 claims 1
- VHQSGALUSWIYOD-QXEWZRGKSA-N Asn-Pro-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O VHQSGALUSWIYOD-QXEWZRGKSA-N 0.000 claims 1
- SONUFGRSSMFHFN-IMJSIDKUSA-N Asn-Ser Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CO)C(O)=O SONUFGRSSMFHFN-IMJSIDKUSA-N 0.000 claims 1
- VBKIFHUVGLOJKT-FKZODXBYSA-N Asn-Thr Chemical compound C[C@@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)N)O VBKIFHUVGLOJKT-FKZODXBYSA-N 0.000 claims 1
- WUQXMTITJLFXAU-JIOCBJNQSA-N Asn-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)N)N)O WUQXMTITJLFXAU-JIOCBJNQSA-N 0.000 claims 1
- UXHYOWXTJLBEPG-GSSVUCPTSA-N Asn-Thr-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O UXHYOWXTJLBEPG-GSSVUCPTSA-N 0.000 claims 1
- BCADFFUQHIMQAA-KKHAAJSZSA-N Asn-Thr-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O BCADFFUQHIMQAA-KKHAAJSZSA-N 0.000 claims 1
- RGGVDKVXLBOLNS-JQWIXIFHSA-N Asn-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CC(N)=O)N)C(O)=O)=CNC2=C1 RGGVDKVXLBOLNS-JQWIXIFHSA-N 0.000 claims 1
- FHCRKXCTKSHNOE-QEJZJMRPSA-N Asn-Trp-Glu Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CC(=O)N)N FHCRKXCTKSHNOE-QEJZJMRPSA-N 0.000 claims 1
- FYRVDDJMNISIKJ-UWVGGRQHSA-N Asn-Tyr Chemical compound NC(=O)C[C@H](N)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 FYRVDDJMNISIKJ-UWVGGRQHSA-N 0.000 claims 1
- QIRJQYQOIKBPBZ-IHRRRGAJSA-N Asn-Tyr-Arg Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O QIRJQYQOIKBPBZ-IHRRRGAJSA-N 0.000 claims 1
- SKQTXVZTCGSRJS-SRVKXCTJSA-N Asn-Tyr-Asp Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC(=O)N)N)O SKQTXVZTCGSRJS-SRVKXCTJSA-N 0.000 claims 1
- YSYTWUMRHSFODC-QWRGUYRKSA-N Asn-Tyr-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(O)=O YSYTWUMRHSFODC-QWRGUYRKSA-N 0.000 claims 1
- DATSKXOXPUAOLK-KKUMJFAQSA-N Asn-Tyr-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(O)=O DATSKXOXPUAOLK-KKUMJFAQSA-N 0.000 claims 1
- DPWDPEVGACCWTC-SRVKXCTJSA-N Asn-Tyr-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(O)=O DPWDPEVGACCWTC-SRVKXCTJSA-N 0.000 claims 1
- ZAESWDKAMDVHLL-RCOVLWMOSA-N Asn-Val-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O ZAESWDKAMDVHLL-RCOVLWMOSA-N 0.000 claims 1
- JNCRAQVYJZGIOW-QSFUFRPTSA-N Asn-Val-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JNCRAQVYJZGIOW-QSFUFRPTSA-N 0.000 claims 1
- LMIWYCWRJVMAIQ-NHCYSSNCSA-N Asn-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)N)N LMIWYCWRJVMAIQ-NHCYSSNCSA-N 0.000 claims 1
- KVMPVNGOKHTUHZ-GCJQMDKQSA-N Asp-Ala-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KVMPVNGOKHTUHZ-GCJQMDKQSA-N 0.000 claims 1
- WSOKZUVWBXVJHX-CIUDSAMLSA-N Asp-Arg-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O WSOKZUVWBXVJHX-CIUDSAMLSA-N 0.000 claims 1
- NYLBGYLHBDFRHL-VEVYYDQMSA-N Asp-Arg-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O NYLBGYLHBDFRHL-VEVYYDQMSA-N 0.000 claims 1
- XYBJLTKSGFBLCS-QXEWZRGKSA-N Asp-Arg-Val Chemical compound NC(N)=NCCC[C@@H](C(=O)N[C@@H](C(C)C)C(O)=O)NC(=O)[C@@H](N)CC(O)=O XYBJLTKSGFBLCS-QXEWZRGKSA-N 0.000 claims 1
- QRULNKJGYQQZMW-ZLUOBGJFSA-N Asp-Asn-Asp Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O QRULNKJGYQQZMW-ZLUOBGJFSA-N 0.000 claims 1
- BUVNWKQBMZLCDW-UGYAYLCHSA-N Asp-Asn-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O BUVNWKQBMZLCDW-UGYAYLCHSA-N 0.000 claims 1
- XACXDSRQIXRMNS-OLHMAJIHSA-N Asp-Asn-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC(=O)O)N)O XACXDSRQIXRMNS-OLHMAJIHSA-N 0.000 claims 1
- RDRMWJBLOSRRAW-BYULHYEWSA-N Asp-Asn-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O RDRMWJBLOSRRAW-BYULHYEWSA-N 0.000 claims 1
- JGDBHIVECJGXJA-FXQIFTODSA-N Asp-Asp-Arg Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O JGDBHIVECJGXJA-FXQIFTODSA-N 0.000 claims 1
- BFOYULZBKYOKAN-OLHMAJIHSA-N Asp-Asp-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O BFOYULZBKYOKAN-OLHMAJIHSA-N 0.000 claims 1
- CKAJHWFHHFSCDT-WHFBIAKZSA-N Asp-Glu Chemical compound OC(=O)C[C@H](N)C(=O)N[C@H](C(O)=O)CCC(O)=O CKAJHWFHHFSCDT-WHFBIAKZSA-N 0.000 claims 1
- VAWNQIGQPUOPQW-ACZMJKKPSA-N Asp-Glu-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O VAWNQIGQPUOPQW-ACZMJKKPSA-N 0.000 claims 1
- IJHUZMGJRGNXIW-CIUDSAMLSA-N Asp-Glu-Arg Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O IJHUZMGJRGNXIW-CIUDSAMLSA-N 0.000 claims 1
- DGKCOYGQLNWNCJ-ACZMJKKPSA-N Asp-Glu-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O DGKCOYGQLNWNCJ-ACZMJKKPSA-N 0.000 claims 1
- SWTQDYFZVOJVLL-KKUMJFAQSA-N Asp-His-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CC2=CN=CN2)NC(=O)[C@H](CC(=O)O)N)O SWTQDYFZVOJVLL-KKUMJFAQSA-N 0.000 claims 1
- KYQNAIMCTRZLNP-QSFUFRPTSA-N Asp-Ile-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O KYQNAIMCTRZLNP-QSFUFRPTSA-N 0.000 claims 1
- RQHLMGCXCZUOGT-ZPFDUUQYSA-N Asp-Leu-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O RQHLMGCXCZUOGT-ZPFDUUQYSA-N 0.000 claims 1
- UJGRZQYSNYTCAX-SRVKXCTJSA-N Asp-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC(O)=O UJGRZQYSNYTCAX-SRVKXCTJSA-N 0.000 claims 1
- CJUKAWUWBZCTDQ-SRVKXCTJSA-N Asp-Leu-Lys Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(O)=O CJUKAWUWBZCTDQ-SRVKXCTJSA-N 0.000 claims 1
- UMHUHHJMEXNSIV-CIUDSAMLSA-N Asp-Leu-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC(O)=O UMHUHHJMEXNSIV-CIUDSAMLSA-N 0.000 claims 1
- LBOVBQONZJRWPV-YUMQZZPRSA-N Asp-Lys-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O LBOVBQONZJRWPV-YUMQZZPRSA-N 0.000 claims 1
- VMVUDJUXJKDGNR-FXQIFTODSA-N Asp-Met-Asn Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)O)N VMVUDJUXJKDGNR-FXQIFTODSA-N 0.000 claims 1
- JXGJJQJHXHXJQF-CIUDSAMLSA-N Asp-Met-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(O)=O)C(O)=O JXGJJQJHXHXJQF-CIUDSAMLSA-N 0.000 claims 1
- DJCAHYVLMSRBFR-QXEWZRGKSA-N Asp-Met-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CCSC)NC(=O)[C@@H](N)CC(O)=O DJCAHYVLMSRBFR-QXEWZRGKSA-N 0.000 claims 1
- GYWQGGUCMDCUJE-DLOVCJGASA-N Asp-Phe-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(O)=O GYWQGGUCMDCUJE-DLOVCJGASA-N 0.000 claims 1
- PCJOFZYFFMBZKC-PCBIJLKTSA-N Asp-Phe-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O PCJOFZYFFMBZKC-PCBIJLKTSA-N 0.000 claims 1
- BWJZSLQJNBSUPM-FXQIFTODSA-N Asp-Pro-Asn Chemical compound OC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O BWJZSLQJNBSUPM-FXQIFTODSA-N 0.000 claims 1
- KESWRFKUZRUTAH-FXQIFTODSA-N Asp-Pro-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O KESWRFKUZRUTAH-FXQIFTODSA-N 0.000 claims 1
- MNQMTYSEKZHIDF-GCJQMDKQSA-N Asp-Thr-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O MNQMTYSEKZHIDF-GCJQMDKQSA-N 0.000 claims 1
- GWWSUMLEWKQHLR-NUMRIWBASA-N Asp-Thr-Glu Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CC(=O)O)N)O GWWSUMLEWKQHLR-NUMRIWBASA-N 0.000 claims 1
- JJQGZGOEDSSHTE-FOHZUACHSA-N Asp-Thr-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O JJQGZGOEDSSHTE-FOHZUACHSA-N 0.000 claims 1
- JSNWZMFSLIWAHS-HJGDQZAQSA-N Asp-Thr-Leu Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)O)NC(=O)[C@H](CC(=O)O)N)O JSNWZMFSLIWAHS-HJGDQZAQSA-N 0.000 claims 1
- GCACQYDBDHRVGE-LKXGYXEUSA-N Asp-Thr-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H]([C@H](O)C)NC(=O)[C@@H](N)CC(O)=O GCACQYDBDHRVGE-LKXGYXEUSA-N 0.000 claims 1
- CXEFNHOVIIDHFU-IHPCNDPISA-N Asp-Trp-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)NC(=O)[C@H](CC(=O)O)N CXEFNHOVIIDHFU-IHPCNDPISA-N 0.000 claims 1
- USENATHVGFXRNO-SRVKXCTJSA-N Asp-Tyr-Asp Chemical compound OC(=O)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(O)=O)C(O)=O)CC1=CC=C(O)C=C1 USENATHVGFXRNO-SRVKXCTJSA-N 0.000 claims 1
- BPAUXFVCSYQDQX-JRQIVUDYSA-N Asp-Tyr-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CC(=O)O)N)O BPAUXFVCSYQDQX-JRQIVUDYSA-N 0.000 claims 1
- HTSSXFASOUSJQG-IHPCNDPISA-N Asp-Tyr-Trp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O HTSSXFASOUSJQG-IHPCNDPISA-N 0.000 claims 1
- XMKXONRMGJXCJV-LAEOZQHASA-N Asp-Val-Glu Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O XMKXONRMGJXCJV-LAEOZQHASA-N 0.000 claims 1
- GXIUDSXIUSTSLO-QXEWZRGKSA-N Asp-Val-Met Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CC(=O)O)N GXIUDSXIUSTSLO-QXEWZRGKSA-N 0.000 claims 1
- QPDUWAUSSWGJSB-NGZCFLSTSA-N Asp-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)O)N QPDUWAUSSWGJSB-NGZCFLSTSA-N 0.000 claims 1
- ZUNMTUPRQMWMHX-LSJOCFKGSA-N Asp-Val-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C(C)C)C(O)=O ZUNMTUPRQMWMHX-LSJOCFKGSA-N 0.000 claims 1
- 101100177112 Caenorhabditis elegans his-70 gene Proteins 0.000 claims 1
- 101100315624 Caenorhabditis elegans tyr-1 gene Proteins 0.000 claims 1
- 108090000565 Capsid Proteins Proteins 0.000 claims 1
- 102100023321 Ceruloplasmin Human genes 0.000 claims 1
- CVLIHKBUPSFRQP-WHFBIAKZSA-N Cys-Gly-Ala Chemical compound [H]N[C@@H](CS)C(=O)NCC(=O)N[C@@H](C)C(O)=O CVLIHKBUPSFRQP-WHFBIAKZSA-N 0.000 claims 1
- NLDWTJBJFVWBDQ-KKUMJFAQSA-N Cys-Lys-Phe Chemical compound NCCCC[C@H](NC(=O)[C@@H](N)CS)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 NLDWTJBJFVWBDQ-KKUMJFAQSA-N 0.000 claims 1
- NMWZMKLDGZXRKP-BZSNNMDCSA-N Cys-Phe-Phe Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O NMWZMKLDGZXRKP-BZSNNMDCSA-N 0.000 claims 1
- HJXSYJVCMUOUNY-SRVKXCTJSA-N Cys-Ser-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CS)N HJXSYJVCMUOUNY-SRVKXCTJSA-N 0.000 claims 1
- DRXOWZZHCSBUOI-YJRXYDGGSA-N Cys-Thr-Tyr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@H](CS)N)O DRXOWZZHCSBUOI-YJRXYDGGSA-N 0.000 claims 1
- KFYPRIGJTICABD-XGEHTFHBSA-N Cys-Thr-Val Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H](CS)N)O KFYPRIGJTICABD-XGEHTFHBSA-N 0.000 claims 1
- ALTQTAKGRFLRLR-GUBZILKMSA-N Cys-Val-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H](CS)N ALTQTAKGRFLRLR-GUBZILKMSA-N 0.000 claims 1
- 241000282326 Felis catus Species 0.000 claims 1
- 241001123946 Gaga Species 0.000 claims 1
- SZXSSXUNOALWCH-ACZMJKKPSA-N Glu-Ala-Asn Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O SZXSSXUNOALWCH-ACZMJKKPSA-N 0.000 claims 1
- WZZSKAJIHTUUSG-ACZMJKKPSA-N Glu-Ala-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(O)=O WZZSKAJIHTUUSG-ACZMJKKPSA-N 0.000 claims 1
- UTKUTMJSWKKHEM-WDSKDSINSA-N Glu-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(O)=O UTKUTMJSWKKHEM-WDSKDSINSA-N 0.000 claims 1
- JJKKWYQVHRUSDG-GUBZILKMSA-N Glu-Ala-Lys Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCCN)C(O)=O JJKKWYQVHRUSDG-GUBZILKMSA-N 0.000 claims 1
- NLKVNZUFDPWPNL-YUMQZZPRSA-N Glu-Arg-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O NLKVNZUFDPWPNL-YUMQZZPRSA-N 0.000 claims 1
- OJGLIOXAKGFFDW-SRVKXCTJSA-N Glu-Arg-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CCC(=O)O)N OJGLIOXAKGFFDW-SRVKXCTJSA-N 0.000 claims 1
- VPKBCVUDBNINAH-GARJFASQSA-N Glu-Arg-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CCC(=O)O)N)C(=O)O VPKBCVUDBNINAH-GARJFASQSA-N 0.000 claims 1
- WOSRKEJQESVHGA-CIUDSAMLSA-N Glu-Arg-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O WOSRKEJQESVHGA-CIUDSAMLSA-N 0.000 claims 1
- YKLNMGJYMNPBCP-ACZMJKKPSA-N Glu-Asn-Asp Chemical compound C(CC(=O)O)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N YKLNMGJYMNPBCP-ACZMJKKPSA-N 0.000 claims 1
- CKRUHITYRFNUKW-WDSKDSINSA-N Glu-Asn-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O CKRUHITYRFNUKW-WDSKDSINSA-N 0.000 claims 1
- ZOXBSICWUDAOHX-GUBZILKMSA-N Glu-Asn-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CCC(O)=O ZOXBSICWUDAOHX-GUBZILKMSA-N 0.000 claims 1
- RJONUNZIMUXUOI-GUBZILKMSA-N Glu-Asn-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)O)N RJONUNZIMUXUOI-GUBZILKMSA-N 0.000 claims 1
- BUVMZWZNWMKASN-QEJZJMRPSA-N Glu-Asn-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CC(N)=O)NC(=O)[C@H](CCC(O)=O)N)C(O)=O)=CNC2=C1 BUVMZWZNWMKASN-QEJZJMRPSA-N 0.000 claims 1
- JPHYJQHPILOKHC-ACZMJKKPSA-N Glu-Asp-Asp Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O JPHYJQHPILOKHC-ACZMJKKPSA-N 0.000 claims 1
- PAQUJCSYVIBPLC-AVGNSLFASA-N Glu-Asp-Phe Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 PAQUJCSYVIBPLC-AVGNSLFASA-N 0.000 claims 1
- SJPMNHCEWPTRBR-BQBZGAKWSA-N Glu-Glu-Gly Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O SJPMNHCEWPTRBR-BQBZGAKWSA-N 0.000 claims 1
- LSPKYLAFTPBWIL-BYPYZUCNSA-N Glu-Gly Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(O)=O LSPKYLAFTPBWIL-BYPYZUCNSA-N 0.000 claims 1
- CUXJIASLBRJOFV-LAEOZQHASA-N Glu-Gly-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H]([C@@H](C)CC)C(O)=O CUXJIASLBRJOFV-LAEOZQHASA-N 0.000 claims 1
- HPJLZFTUUJKWAJ-JHEQGTHGSA-N Glu-Gly-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O HPJLZFTUUJKWAJ-JHEQGTHGSA-N 0.000 claims 1
- WVTIBGWZUMJBFY-GUBZILKMSA-N Glu-His-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(O)=O WVTIBGWZUMJBFY-GUBZILKMSA-N 0.000 claims 1
- SNFUTDLOCQQRQD-ZKWXMUAHSA-N Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H](N)CCC(O)=O SNFUTDLOCQQRQD-ZKWXMUAHSA-N 0.000 claims 1
- QIQABBIDHGQXGA-ZPFDUUQYSA-N Glu-Ile-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O QIQABBIDHGQXGA-ZPFDUUQYSA-N 0.000 claims 1
- ITBHUUMCJJQUSC-LAEOZQHASA-N Glu-Ile-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O ITBHUUMCJJQUSC-LAEOZQHASA-N 0.000 claims 1
- XTZDZAXYPDISRR-MNXVOIDGSA-N Glu-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)O)N XTZDZAXYPDISRR-MNXVOIDGSA-N 0.000 claims 1
- WTMZXOPHTIVFCP-QEWYBTABSA-N Glu-Ile-Phe Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 WTMZXOPHTIVFCP-QEWYBTABSA-N 0.000 claims 1
- DWBBKNPKDHXIAC-SRVKXCTJSA-N Glu-Leu-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CCC(O)=O DWBBKNPKDHXIAC-SRVKXCTJSA-N 0.000 claims 1
- BBBXWRGITSUJPB-YUMQZZPRSA-N Glu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](N)CCC(O)=O BBBXWRGITSUJPB-YUMQZZPRSA-N 0.000 claims 1
- RBXSZQRSEGYDFG-GUBZILKMSA-N Glu-Lys-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O RBXSZQRSEGYDFG-GUBZILKMSA-N 0.000 claims 1
- YRMZCZIRHYCNHX-RYUDHWBXSA-N Glu-Phe-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)NCC(O)=O YRMZCZIRHYCNHX-RYUDHWBXSA-N 0.000 claims 1
- ITVBKCZZLJUUHI-HTUGSXCWSA-N Glu-Phe-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ITVBKCZZLJUUHI-HTUGSXCWSA-N 0.000 claims 1
- JYXKPJVDCAWMDG-ZPFDUUQYSA-N Glu-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CCC(=O)O)N JYXKPJVDCAWMDG-ZPFDUUQYSA-N 0.000 claims 1
- SWDNPSMMEWRNOH-HJGDQZAQSA-N Glu-Pro-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O SWDNPSMMEWRNOH-HJGDQZAQSA-N 0.000 claims 1
- UQHGAYSULGRWRG-WHFBIAKZSA-N Glu-Ser Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CO)C(O)=O UQHGAYSULGRWRG-WHFBIAKZSA-N 0.000 claims 1
- WIKMTDVSCUJIPJ-CIUDSAMLSA-N Glu-Ser-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCN=C(N)N WIKMTDVSCUJIPJ-CIUDSAMLSA-N 0.000 claims 1
- GTFYQOVVVJASOA-ACZMJKKPSA-N Glu-Ser-Cys Chemical compound C(CC(=O)O)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)O)N GTFYQOVVVJASOA-ACZMJKKPSA-N 0.000 claims 1
- SYAYROHMAIHWFB-KBIXCLLPSA-N Glu-Ser-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O SYAYROHMAIHWFB-KBIXCLLPSA-N 0.000 claims 1
- JWNZHMSRZXXGTM-XKBZYTNZSA-N Glu-Ser-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O JWNZHMSRZXXGTM-XKBZYTNZSA-N 0.000 claims 1
- TZXOPHFCAATANZ-QEJZJMRPSA-N Glu-Ser-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCC(=O)O)N TZXOPHFCAATANZ-QEJZJMRPSA-N 0.000 claims 1
- TWYSSILQABLLME-HJGDQZAQSA-N Glu-Thr-Arg Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O TWYSSILQABLLME-HJGDQZAQSA-N 0.000 claims 1
- ZGXGVBYEJGVJMV-HJGDQZAQSA-N Glu-Thr-Met Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CCC(=O)O)N)O ZGXGVBYEJGVJMV-HJGDQZAQSA-N 0.000 claims 1
- MXJYXYDREQWUMS-XKBZYTNZSA-N Glu-Thr-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O MXJYXYDREQWUMS-XKBZYTNZSA-N 0.000 claims 1
- DLISPGXMKZTWQG-IFFSRLJSSA-N Glu-Thr-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O DLISPGXMKZTWQG-IFFSRLJSSA-N 0.000 claims 1
- JLCYOCDGIUZMKQ-JBACZVJFSA-N Glu-Trp-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)NC(=O)[C@H](CCC(=O)O)N JLCYOCDGIUZMKQ-JBACZVJFSA-N 0.000 claims 1
- DXMOIVCNJIJQSC-QEJZJMRPSA-N Glu-Trp-Ser Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCC(=O)O)N DXMOIVCNJIJQSC-QEJZJMRPSA-N 0.000 claims 1
- SFKMXFWWDUGXRT-NWLDYVSISA-N Glu-Trp-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](CCC(=O)O)N)O SFKMXFWWDUGXRT-NWLDYVSISA-N 0.000 claims 1
- HJTSRYLPAYGEEC-SIUGBPQLSA-N Glu-Tyr-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CCC(=O)O)N HJTSRYLPAYGEEC-SIUGBPQLSA-N 0.000 claims 1
- MFYLRRCYBBJYPI-JYJNAYRXSA-N Glu-Tyr-Lys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)O)N)O MFYLRRCYBBJYPI-JYJNAYRXSA-N 0.000 claims 1
- UZWUBBRJWFTHTD-LAEOZQHASA-N Glu-Val-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCC(O)=O UZWUBBRJWFTHTD-LAEOZQHASA-N 0.000 claims 1
- QXUPRMQJDWJDFR-NRPADANISA-N Glu-Val-Ser Chemical compound CC(C)[C@H](NC(=O)[C@@H](N)CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O QXUPRMQJDWJDFR-NRPADANISA-N 0.000 claims 1
- BRFJMRSRMOMIMU-WHFBIAKZSA-N Gly-Ala-Asn Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O BRFJMRSRMOMIMU-WHFBIAKZSA-N 0.000 claims 1
- QIZJOTQTCAGKPU-KWQFWETISA-N Gly-Ala-Tyr Chemical compound [NH3+]CC(=O)N[C@@H](C)C(=O)N[C@H](C([O-])=O)CC1=CC=C(O)C=C1 QIZJOTQTCAGKPU-KWQFWETISA-N 0.000 claims 1
- OGCIHJPYKVSMTE-YUMQZZPRSA-N Gly-Arg-Glu Chemical compound [H]NCC(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O OGCIHJPYKVSMTE-YUMQZZPRSA-N 0.000 claims 1
- GWCRIHNSVMOBEQ-BQBZGAKWSA-N Gly-Arg-Ser Chemical compound [H]NCC(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O GWCRIHNSVMOBEQ-BQBZGAKWSA-N 0.000 claims 1
- DWUKOTKSTDWGAE-BQBZGAKWSA-N Gly-Asn-Arg Chemical compound NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N DWUKOTKSTDWGAE-BQBZGAKWSA-N 0.000 claims 1
- CIMULJZTTOBOPN-WHFBIAKZSA-N Gly-Asn-Asn Chemical compound NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O CIMULJZTTOBOPN-WHFBIAKZSA-N 0.000 claims 1
- DJTXYXZNNDDEOU-WHFBIAKZSA-N Gly-Asn-Cys Chemical compound C([C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)CN)C(=O)N DJTXYXZNNDDEOU-WHFBIAKZSA-N 0.000 claims 1
- JVWPPCWUDRJGAE-YUMQZZPRSA-N Gly-Asn-Leu Chemical compound [H]NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O JVWPPCWUDRJGAE-YUMQZZPRSA-N 0.000 claims 1
- XCLCVBYNGXEVDU-WHFBIAKZSA-N Gly-Asn-Ser Chemical compound NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O XCLCVBYNGXEVDU-WHFBIAKZSA-N 0.000 claims 1
- FMNHBTKMRFVGRO-FOHZUACHSA-N Gly-Asn-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)CN FMNHBTKMRFVGRO-FOHZUACHSA-N 0.000 claims 1
- GNBMOZPQUXTCRW-STQMWFEESA-N Gly-Asn-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CC(N)=O)NC(=O)CN)C(O)=O)=CNC2=C1 GNBMOZPQUXTCRW-STQMWFEESA-N 0.000 claims 1
- LURCIJSJAKFCRO-QWRGUYRKSA-N Gly-Asn-Tyr Chemical compound [H]NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O LURCIJSJAKFCRO-QWRGUYRKSA-N 0.000 claims 1
- SCCPDJAQCXWPTF-VKHMYHEASA-N Gly-Asp Chemical compound NCC(=O)N[C@H](C(O)=O)CC(O)=O SCCPDJAQCXWPTF-VKHMYHEASA-N 0.000 claims 1
- LCNXZQROPKFGQK-WHFBIAKZSA-N Gly-Asp-Ser Chemical compound NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O LCNXZQROPKFGQK-WHFBIAKZSA-N 0.000 claims 1
- QSVCIFZPGLOZGH-WDSKDSINSA-N Gly-Glu-Ser Chemical compound NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O QSVCIFZPGLOZGH-WDSKDSINSA-N 0.000 claims 1
- UFPXDFOYHVEIPI-BYPYZUCNSA-N Gly-Gly-Asp Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CC(O)=O UFPXDFOYHVEIPI-BYPYZUCNSA-N 0.000 claims 1
- XPJBQTCXPJNIFE-ZETCQYMHSA-N Gly-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)CN XPJBQTCXPJNIFE-ZETCQYMHSA-N 0.000 claims 1
- YWAQATDNEKZFFK-BYPYZUCNSA-N Gly-Gly-Ser Chemical compound NCC(=O)NCC(=O)N[C@@H](CO)C(O)=O YWAQATDNEKZFFK-BYPYZUCNSA-N 0.000 claims 1
- OLPPXYMMIARYAL-QMMMGPOBSA-N Gly-Gly-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)CNC(=O)CN OLPPXYMMIARYAL-QMMMGPOBSA-N 0.000 claims 1
- AYBKPDHHVADEDA-YUMQZZPRSA-N Gly-His-Asn Chemical compound [H]NCC(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(N)=O)C(O)=O AYBKPDHHVADEDA-YUMQZZPRSA-N 0.000 claims 1
- DGKBSGNCMCLDSL-BYULHYEWSA-N Gly-Ile-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)CN DGKBSGNCMCLDSL-BYULHYEWSA-N 0.000 claims 1
- AAHSHTLISQUZJL-QSFUFRPTSA-N Gly-Ile-Ile Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O AAHSHTLISQUZJL-QSFUFRPTSA-N 0.000 claims 1
- UESJMAMHDLEHGM-NHCYSSNCSA-N Gly-Ile-Leu Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O UESJMAMHDLEHGM-NHCYSSNCSA-N 0.000 claims 1
- FCKPEGOCSVZPNC-WHOFXGATSA-N Gly-Ile-Phe Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 FCKPEGOCSVZPNC-WHOFXGATSA-N 0.000 claims 1
- COVXELOAORHTND-LSJOCFKGSA-N Gly-Ile-Val Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O COVXELOAORHTND-LSJOCFKGSA-N 0.000 claims 1
- PAWIVEIWWYGBAM-YUMQZZPRSA-N Gly-Leu-Ala Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O PAWIVEIWWYGBAM-YUMQZZPRSA-N 0.000 claims 1
- YTSVAIMKVLZUDU-YUMQZZPRSA-N Gly-Leu-Asp Chemical compound [H]NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O YTSVAIMKVLZUDU-YUMQZZPRSA-N 0.000 claims 1
- ULZCYBYDTUMHNF-IUCAKERBSA-N Gly-Leu-Glu Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O ULZCYBYDTUMHNF-IUCAKERBSA-N 0.000 claims 1
- UHPAZODVFFYEEL-QWRGUYRKSA-N Gly-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)CN UHPAZODVFFYEEL-QWRGUYRKSA-N 0.000 claims 1
- UUYBFNKHOCJCHT-VHSXEESVSA-N Gly-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN UUYBFNKHOCJCHT-VHSXEESVSA-N 0.000 claims 1
- WDEHMRNSGHVNOH-VHSXEESVSA-N Gly-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)CN)C(=O)O WDEHMRNSGHVNOH-VHSXEESVSA-N 0.000 claims 1
- FXGRXIATVXUAHO-WEDXCCLWSA-N Gly-Lys-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCCN FXGRXIATVXUAHO-WEDXCCLWSA-N 0.000 claims 1
- ZWRDOVYMQAAISL-UWVGGRQHSA-N Gly-Met-Lys Chemical compound CSCC[C@H](NC(=O)CN)C(=O)N[C@H](C(O)=O)CCCCN ZWRDOVYMQAAISL-UWVGGRQHSA-N 0.000 claims 1
- FJWSJWACLMTDMI-WPRPVWTQSA-N Gly-Met-Val Chemical compound [H]NCC(=O)N[C@@H](CCSC)C(=O)N[C@@H](C(C)C)C(O)=O FJWSJWACLMTDMI-WPRPVWTQSA-N 0.000 claims 1
- JBCLFWXMTIKCCB-VIFPVBQESA-N Gly-Phe Chemical compound NCC(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 JBCLFWXMTIKCCB-VIFPVBQESA-N 0.000 claims 1
- VDCRBJACQKOSMS-JSGCOSHPSA-N Gly-Phe-Val Chemical compound [H]NCC(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(O)=O VDCRBJACQKOSMS-JSGCOSHPSA-N 0.000 claims 1
- JSLVAHYTAJJEQH-QWRGUYRKSA-N Gly-Ser-Phe Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 JSLVAHYTAJJEQH-QWRGUYRKSA-N 0.000 claims 1
- ZLCLYFGMKFCDCN-XPUUQOCRSA-N Gly-Ser-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CO)NC(=O)CN)C(O)=O ZLCLYFGMKFCDCN-XPUUQOCRSA-N 0.000 claims 1
- FFJQHWKSGAWSTJ-BFHQHQDPSA-N Gly-Thr-Ala Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O FFJQHWKSGAWSTJ-BFHQHQDPSA-N 0.000 claims 1
- DBUNZBWUWCIELX-JHEQGTHGSA-N Gly-Thr-Glu Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O DBUNZBWUWCIELX-JHEQGTHGSA-N 0.000 claims 1
- TVTZEOHWHUVYCG-KYNKHSRBSA-N Gly-Thr-Thr Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O TVTZEOHWHUVYCG-KYNKHSRBSA-N 0.000 claims 1
- NIOPEYHPOBWLQO-KBPBESRZSA-N Gly-Trp-Glu Chemical compound NCC(=O)N[C@@H](Cc1c[nH]c2ccccc12)C(=O)N[C@@H](CCC(O)=O)C(O)=O NIOPEYHPOBWLQO-KBPBESRZSA-N 0.000 claims 1
- GWCJMBNBFYBQCV-XPUUQOCRSA-N Gly-Val-Ala Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O GWCJMBNBFYBQCV-XPUUQOCRSA-N 0.000 claims 1
- BNMRSWQOHIQTFL-JSGCOSHPSA-N Gly-Val-Phe Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 BNMRSWQOHIQTFL-JSGCOSHPSA-N 0.000 claims 1
- YGHSQRJSHKYUJY-SCZZXKLOSA-N Gly-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN YGHSQRJSHKYUJY-SCZZXKLOSA-N 0.000 claims 1
- MJNWEIMBXKKCSF-XVYDVKMFSA-N His-Ala-Asn Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N MJNWEIMBXKKCSF-XVYDVKMFSA-N 0.000 claims 1
- VCDNHBNNPCDBKV-DLOVCJGASA-N His-Ala-Lys Chemical compound C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N VCDNHBNNPCDBKV-DLOVCJGASA-N 0.000 claims 1
- ZIMTWPHIKZEHSE-UWVGGRQHSA-N His-Arg-Gly Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O ZIMTWPHIKZEHSE-UWVGGRQHSA-N 0.000 claims 1
- HIAHVKLTHNOENC-HGNGGELXSA-N His-Glu-Ala Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O HIAHVKLTHNOENC-HGNGGELXSA-N 0.000 claims 1
- NQKRILCJYCASDV-QWRGUYRKSA-N His-Gly-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC1=CN=CN1 NQKRILCJYCASDV-QWRGUYRKSA-N 0.000 claims 1
- IDXZDKMBEXLFMB-HGNGGELXSA-N His-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H](N)CC1=CNC=N1 IDXZDKMBEXLFMB-HGNGGELXSA-N 0.000 claims 1
- FSOXZQBMPBQKGJ-QSFUFRPTSA-N His-Ile-Ala Chemical compound [O-]C(=O)[C@H](C)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H]([NH3+])CC1=CN=CN1 FSOXZQBMPBQKGJ-QSFUFRPTSA-N 0.000 claims 1
- GIRSNERMXCMDBO-GARJFASQSA-N His-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CC2=CN=CN2)N)C(=O)O GIRSNERMXCMDBO-GARJFASQSA-N 0.000 claims 1
- UWSMZKRTOZEGDD-CUJWVEQBSA-N His-Thr-Ser Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O UWSMZKRTOZEGDD-CUJWVEQBSA-N 0.000 claims 1
- DAKSMIWQZPHRIB-BZSNNMDCSA-N His-Tyr-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(O)=O DAKSMIWQZPHRIB-BZSNNMDCSA-N 0.000 claims 1
- MUENHEQLLUDKSC-PMVMPFDFSA-N His-Tyr-Trp Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(O)=O)C1=CNC=N1 MUENHEQLLUDKSC-PMVMPFDFSA-N 0.000 claims 1
- FFYYUUWROYYKFY-IHRRRGAJSA-N His-Val-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O FFYYUUWROYYKFY-IHRRRGAJSA-N 0.000 claims 1
- GGXUJBKENKVYNV-ULQDDVLXSA-N His-Val-Phe Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CC2=CN=CN2)N GGXUJBKENKVYNV-ULQDDVLXSA-N 0.000 claims 1
- DRKZDEFADVYTLU-AVGNSLFASA-N His-Val-Val Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C(C)C)C(O)=O DRKZDEFADVYTLU-AVGNSLFASA-N 0.000 claims 1
- 241001135569 Human adenovirus 5 Species 0.000 claims 1
- 241000193096 Human adenovirus B3 Species 0.000 claims 1
- VSZALHITQINTGC-GHCJXIJMSA-N Ile-Ala-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CC(=O)O)C(=O)O)N VSZALHITQINTGC-GHCJXIJMSA-N 0.000 claims 1
- JRHFQUPIZOYKQP-KBIXCLLPSA-N Ile-Ala-Glu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(O)=O JRHFQUPIZOYKQP-KBIXCLLPSA-N 0.000 claims 1
- VAXBXNPRXPHGHG-BJDJZHNGSA-N Ile-Ala-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)O)N VAXBXNPRXPHGHG-BJDJZHNGSA-N 0.000 claims 1
- HYXQKVOADYPQEA-CIUDSAMLSA-N Ile-Arg Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@H](C(O)=O)CCCN=C(N)N HYXQKVOADYPQEA-CIUDSAMLSA-N 0.000 claims 1
- YOTNPRLPIPHQSB-XUXIUFHCSA-N Ile-Arg-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCCN)C(=O)O)N YOTNPRLPIPHQSB-XUXIUFHCSA-N 0.000 claims 1
- QYZYJFXHXYUZMZ-UGYAYLCHSA-N Ile-Asn-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)N)C(=O)O)N QYZYJFXHXYUZMZ-UGYAYLCHSA-N 0.000 claims 1
- HZMLFETXHFHGBB-UGYAYLCHSA-N Ile-Asn-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N HZMLFETXHFHGBB-UGYAYLCHSA-N 0.000 claims 1
- QIHJTGSVGIPHIW-QSFUFRPTSA-N Ile-Asn-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](C(C)C)C(=O)O)N QIHJTGSVGIPHIW-QSFUFRPTSA-N 0.000 claims 1
- UMYZBHKAVTXWIW-GMOBBJLQSA-N Ile-Asp-Arg Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N UMYZBHKAVTXWIW-GMOBBJLQSA-N 0.000 claims 1
- NKRJALPCDNXULF-BYULHYEWSA-N Ile-Asp-Gly Chemical compound [H]N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O NKRJALPCDNXULF-BYULHYEWSA-N 0.000 claims 1
- DCQMJRSOGCYKTR-GHCJXIJMSA-N Ile-Asp-Ser Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O DCQMJRSOGCYKTR-GHCJXIJMSA-N 0.000 claims 1
- WZDCVAWMBUNDDY-KBIXCLLPSA-N Ile-Glu-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](C)C(=O)O)N WZDCVAWMBUNDDY-KBIXCLLPSA-N 0.000 claims 1
- DFJJAVZIHDFOGQ-MNXVOIDGSA-N Ile-Glu-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N DFJJAVZIHDFOGQ-MNXVOIDGSA-N 0.000 claims 1
- SPQWWEZBHXHUJN-KBIXCLLPSA-N Ile-Glu-Ser Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O SPQWWEZBHXHUJN-KBIXCLLPSA-N 0.000 claims 1
- PNDMHTTXXPUQJH-RWRJDSDZSA-N Ile-Glu-Thr Chemical compound N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H]([C@H](O)C)C(=O)O PNDMHTTXXPUQJH-RWRJDSDZSA-N 0.000 claims 1
- KFVUBLZRFSVDGO-BYULHYEWSA-N Ile-Gly-Asp Chemical compound CC[C@H](C)[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(O)=O KFVUBLZRFSVDGO-BYULHYEWSA-N 0.000 claims 1
- QNBYCZTZNOVDMI-HGNGGELXSA-N Ile-His Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 QNBYCZTZNOVDMI-HGNGGELXSA-N 0.000 claims 1
- KEKTTYCXKGBAAL-VGDYDELISA-N Ile-His-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CO)C(=O)O)N KEKTTYCXKGBAAL-VGDYDELISA-N 0.000 claims 1
- SVBAHOMTJRFSIC-SXTJYALSSA-N Ile-Ile-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(=O)N)C(=O)O)N SVBAHOMTJRFSIC-SXTJYALSSA-N 0.000 claims 1
- SJLVSMMIFYTSGY-GRLWGSQLSA-N Ile-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N SJLVSMMIFYTSGY-GRLWGSQLSA-N 0.000 claims 1
- TWPSALMCEHCIOY-YTFOTSKYSA-N Ile-Ile-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(=O)O)N TWPSALMCEHCIOY-YTFOTSKYSA-N 0.000 claims 1
- TWYOYAKMLHWMOJ-ZPFDUUQYSA-N Ile-Leu-Asn Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O TWYOYAKMLHWMOJ-ZPFDUUQYSA-N 0.000 claims 1
- OUUCIIJSBIBCHB-ZPFDUUQYSA-N Ile-Leu-Asp Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O OUUCIIJSBIBCHB-ZPFDUUQYSA-N 0.000 claims 1
- TVYWVSJGSHQWMT-AJNGGQMLSA-N Ile-Leu-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N TVYWVSJGSHQWMT-AJNGGQMLSA-N 0.000 claims 1
- UIEZQYNXCYHMQS-BJDJZHNGSA-N Ile-Lys-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)O)N UIEZQYNXCYHMQS-BJDJZHNGSA-N 0.000 claims 1
- RMNMUUCYTMLWNA-ZPFDUUQYSA-N Ile-Lys-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(=O)O)C(=O)O)N RMNMUUCYTMLWNA-ZPFDUUQYSA-N 0.000 claims 1
- PARSHQDZROHERM-NHCYSSNCSA-N Ile-Lys-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)NCC(=O)O)N PARSHQDZROHERM-NHCYSSNCSA-N 0.000 claims 1
- GLYJPWIRLBAIJH-FQUUOJAGSA-N Ile-Lys-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N1CCC[C@@H]1C(=O)O)N GLYJPWIRLBAIJH-FQUUOJAGSA-N 0.000 claims 1
- CKRFDMPBSWYOBT-PPCPHDFISA-N Ile-Lys-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N CKRFDMPBSWYOBT-PPCPHDFISA-N 0.000 claims 1
- OTSVBELRDMSPKY-PCBIJLKTSA-N Ile-Phe-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(=O)N)C(=O)O)N OTSVBELRDMSPKY-PCBIJLKTSA-N 0.000 claims 1
- WYUHAXJAMDTOAU-IAVJCBSLSA-N Ile-Phe-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N WYUHAXJAMDTOAU-IAVJCBSLSA-N 0.000 claims 1
- XLXPYSDGMXTTNQ-DKIMLUQUSA-N Ile-Phe-Leu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](Cc1ccccc1)C(=O)N[C@@H](CC(C)C)C(O)=O XLXPYSDGMXTTNQ-DKIMLUQUSA-N 0.000 claims 1
- XLXPYSDGMXTTNQ-UHFFFAOYSA-N Ile-Phe-Leu Natural products CCC(C)C(N)C(=O)NC(C(=O)NC(CC(C)C)C(O)=O)CC1=CC=CC=C1 XLXPYSDGMXTTNQ-UHFFFAOYSA-N 0.000 claims 1
- XQLGNKLSPYCRMZ-HJWJTTGWSA-N Ile-Phe-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(=O)O)N XQLGNKLSPYCRMZ-HJWJTTGWSA-N 0.000 claims 1
- NLZVTPYXYXMCIP-XUXIUFHCSA-N Ile-Pro-Lys Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(O)=O NLZVTPYXYXMCIP-XUXIUFHCSA-N 0.000 claims 1
- JODPUDMBQBIWCK-GHCJXIJMSA-N Ile-Ser-Asn Chemical compound [H]N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O JODPUDMBQBIWCK-GHCJXIJMSA-N 0.000 claims 1
- VGSPNSSCMOHRRR-BJDJZHNGSA-N Ile-Ser-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)O)N VGSPNSSCMOHRRR-BJDJZHNGSA-N 0.000 claims 1
- JNLSTRPWUXOORL-MMWGEVLESA-N Ile-Ser-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N1CCC[C@@H]1C(=O)O)N JNLSTRPWUXOORL-MMWGEVLESA-N 0.000 claims 1
- HXIDVIFHRYRXLZ-NAKRPEOUSA-N Ile-Ser-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)O)N HXIDVIFHRYRXLZ-NAKRPEOUSA-N 0.000 claims 1
- DRCKHKZYDLJYFQ-YWIQKCBGSA-N Ile-Thr Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O DRCKHKZYDLJYFQ-YWIQKCBGSA-N 0.000 claims 1
- QHUREMVLLMNUAX-OSUNSFLBSA-N Ile-Thr-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)O)N QHUREMVLLMNUAX-OSUNSFLBSA-N 0.000 claims 1
- KXUKTDGKLAOCQK-LSJOCFKGSA-N Ile-Val-Gly Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O KXUKTDGKLAOCQK-LSJOCFKGSA-N 0.000 claims 1
- DLEBSGAVWRPTIX-PEDHHIEDSA-N Ile-Val-Ile Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)[C@@H](C)CC DLEBSGAVWRPTIX-PEDHHIEDSA-N 0.000 claims 1
- NJGXXYLPDMMFJB-XUXIUFHCSA-N Ile-Val-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N NJGXXYLPDMMFJB-XUXIUFHCSA-N 0.000 claims 1
- WIYDLTIBHZSPKY-HJWJTTGWSA-N Ile-Val-Phe Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 WIYDLTIBHZSPKY-HJWJTTGWSA-N 0.000 claims 1
- JZBVBOKASHNXAD-NAKRPEOUSA-N Ile-Val-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(=O)O)N JZBVBOKASHNXAD-NAKRPEOUSA-N 0.000 claims 1
- QLROSWPKSBORFJ-BQBZGAKWSA-N L-Prolyl-L-glutamic acid Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1 QLROSWPKSBORFJ-BQBZGAKWSA-N 0.000 claims 1
- SITWEMZOJNKJCH-UHFFFAOYSA-N L-alanine-L-arginine Natural products CC(N)C(=O)NC(C(O)=O)CCCNC(N)=N SITWEMZOJNKJCH-UHFFFAOYSA-N 0.000 claims 1
- LJHGALIOHLRRQN-DCAQKATOSA-N Leu-Ala-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N LJHGALIOHLRRQN-DCAQKATOSA-N 0.000 claims 1
- QPRQGENIBFLVEB-BJDJZHNGSA-N Leu-Ala-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O QPRQGENIBFLVEB-BJDJZHNGSA-N 0.000 claims 1
- KSZCCRIGNVSHFH-UWVGGRQHSA-N Leu-Arg-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O KSZCCRIGNVSHFH-UWVGGRQHSA-N 0.000 claims 1
- QUAAUWNLWMLERT-IHRRRGAJSA-N Leu-Arg-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC(C)C)C(O)=O QUAAUWNLWMLERT-IHRRRGAJSA-N 0.000 claims 1
- MLTRLIITQPXHBJ-BQBZGAKWSA-N Leu-Asn Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(O)=O)CC(N)=O MLTRLIITQPXHBJ-BQBZGAKWSA-N 0.000 claims 1
- VCSBGUACOYUIGD-CIUDSAMLSA-N Leu-Asn-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O VCSBGUACOYUIGD-CIUDSAMLSA-N 0.000 claims 1
- OIARJGNVARWKFP-YUMQZZPRSA-N Leu-Asn-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O OIARJGNVARWKFP-YUMQZZPRSA-N 0.000 claims 1
- VIWUBXKCYJGNCL-SRVKXCTJSA-N Leu-Asn-His Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 VIWUBXKCYJGNCL-SRVKXCTJSA-N 0.000 claims 1
- WGNOPSQMIQERPK-GARJFASQSA-N Leu-Asn-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N1CCC[C@@H]1C(=O)O)N WGNOPSQMIQERPK-GARJFASQSA-N 0.000 claims 1
- FIJMQLGQLBLBOL-HJGDQZAQSA-N Leu-Asn-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O FIJMQLGQLBLBOL-HJGDQZAQSA-N 0.000 claims 1
- ZURHXHNAEJJRNU-CIUDSAMLSA-N Leu-Asp-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O ZURHXHNAEJJRNU-CIUDSAMLSA-N 0.000 claims 1
- DLCOFDAHNMMQPP-SRVKXCTJSA-N Leu-Asp-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O DLCOFDAHNMMQPP-SRVKXCTJSA-N 0.000 claims 1
- MYGQXVYRZMKRDB-SRVKXCTJSA-N Leu-Asp-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN MYGQXVYRZMKRDB-SRVKXCTJSA-N 0.000 claims 1
- CLVUXCBGKUECIT-HJGDQZAQSA-N Leu-Asp-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CLVUXCBGKUECIT-HJGDQZAQSA-N 0.000 claims 1
- QLQHWWCSCLZUMA-KKUMJFAQSA-N Leu-Asp-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 QLQHWWCSCLZUMA-KKUMJFAQSA-N 0.000 claims 1
- NFNVDJGXRFEYTK-YUMQZZPRSA-N Leu-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(O)=O)CCC(O)=O NFNVDJGXRFEYTK-YUMQZZPRSA-N 0.000 claims 1
- YVKSMSDXKMSIRX-GUBZILKMSA-N Leu-Glu-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O YVKSMSDXKMSIRX-GUBZILKMSA-N 0.000 claims 1
- NEEOBPIXKWSBRF-IUCAKERBSA-N Leu-Glu-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O NEEOBPIXKWSBRF-IUCAKERBSA-N 0.000 claims 1
- LAGPXKYZCCTSGQ-JYJNAYRXSA-N Leu-Glu-Phe Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O LAGPXKYZCCTSGQ-JYJNAYRXSA-N 0.000 claims 1
- OGUUKPXUTHOIAV-SDDRHHMPSA-N Leu-Glu-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N OGUUKPXUTHOIAV-SDDRHHMPSA-N 0.000 claims 1
- LESXFEZIFXFIQR-LURJTMIESA-N Leu-Gly Chemical compound CC(C)C[C@H](N)C(=O)NCC(O)=O LESXFEZIFXFIQR-LURJTMIESA-N 0.000 claims 1
- VWHGTYCRDRBSFI-ZETCQYMHSA-N Leu-Gly-Gly Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)NCC(O)=O VWHGTYCRDRBSFI-ZETCQYMHSA-N 0.000 claims 1
- KEVYYIMVELOXCT-KBPBESRZSA-N Leu-Gly-Phe Chemical compound CC(C)C[C@H]([NH3+])C(=O)NCC(=O)N[C@H](C([O-])=O)CC1=CC=CC=C1 KEVYYIMVELOXCT-KBPBESRZSA-N 0.000 claims 1
- PBGDOSARRIJMEV-DLOVCJGASA-N Leu-His-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(O)=O PBGDOSARRIJMEV-DLOVCJGASA-N 0.000 claims 1
- CFZZDVMBRYFFNU-QWRGUYRKSA-N Leu-His-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)NCC(O)=O CFZZDVMBRYFFNU-QWRGUYRKSA-N 0.000 claims 1
- XQXGNBFMAXWIGI-MXAVVETBSA-N Leu-His-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CC(C)C)CC1=CN=CN1 XQXGNBFMAXWIGI-MXAVVETBSA-N 0.000 claims 1
- OHZIZVWQXJPBJS-IXOXFDKPSA-N Leu-His-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OHZIZVWQXJPBJS-IXOXFDKPSA-N 0.000 claims 1
- KOSWSHVQIVTVQF-ZPFDUUQYSA-N Leu-Ile-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O KOSWSHVQIVTVQF-ZPFDUUQYSA-N 0.000 claims 1
- SEMUSFOBZGKBGW-YTFOTSKYSA-N Leu-Ile-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O SEMUSFOBZGKBGW-YTFOTSKYSA-N 0.000 claims 1
- QLDHBYRUNQZIJQ-DKIMLUQUSA-N Leu-Ile-Phe Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O QLDHBYRUNQZIJQ-DKIMLUQUSA-N 0.000 claims 1
- JKSIBWITFMQTOA-XUXIUFHCSA-N Leu-Ile-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O JKSIBWITFMQTOA-XUXIUFHCSA-N 0.000 claims 1
- IAJFFZORSWOZPQ-SRVKXCTJSA-N Leu-Leu-Asn Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O IAJFFZORSWOZPQ-SRVKXCTJSA-N 0.000 claims 1
- PDQDCFBVYXEFSD-SRVKXCTJSA-N Leu-Leu-Asp Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O PDQDCFBVYXEFSD-SRVKXCTJSA-N 0.000 claims 1
- UCNNZELZXFXXJQ-BZSNNMDCSA-N Leu-Leu-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 UCNNZELZXFXXJQ-BZSNNMDCSA-N 0.000 claims 1
- OTXBNHIUIHNGAO-UWVGGRQHSA-N Leu-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(O)=O)CCCCN OTXBNHIUIHNGAO-UWVGGRQHSA-N 0.000 claims 1
- VCHVSKNMTXWIIP-SRVKXCTJSA-N Leu-Lys-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O VCHVSKNMTXWIIP-SRVKXCTJSA-N 0.000 claims 1
- LZHJZLHSRGWBBE-IHRRRGAJSA-N Leu-Lys-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O LZHJZLHSRGWBBE-IHRRRGAJSA-N 0.000 claims 1
- BJWKOATWNQJPSK-SRVKXCTJSA-N Leu-Met-Glu Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N BJWKOATWNQJPSK-SRVKXCTJSA-N 0.000 claims 1
- GNRPTBRHRRZCMA-RWMBFGLXSA-N Leu-Met-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCSC)C(=O)N1CCC[C@@H]1C(=O)O)N GNRPTBRHRRZCMA-RWMBFGLXSA-N 0.000 claims 1
- ZDBMWELMUCLUPL-QEJZJMRPSA-N Leu-Phe-Ala Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C)C(O)=O)CC1=CC=CC=C1 ZDBMWELMUCLUPL-QEJZJMRPSA-N 0.000 claims 1
- YWKNKRAKOCLOLH-OEAJRASXSA-N Leu-Phe-Thr Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H]([C@@H](C)O)C(O)=O)CC1=CC=CC=C1 YWKNKRAKOCLOLH-OEAJRASXSA-N 0.000 claims 1
- HGUUMQWGYCVPKG-DCAQKATOSA-N Leu-Pro-Cys Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CS)C(=O)O)N HGUUMQWGYCVPKG-DCAQKATOSA-N 0.000 claims 1
- JDBQSGMJBMPNFT-AVGNSLFASA-N Leu-Pro-Val Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O JDBQSGMJBMPNFT-AVGNSLFASA-N 0.000 claims 1
- IRMLZWSRWSGTOP-CIUDSAMLSA-N Leu-Ser-Ala Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O IRMLZWSRWSGTOP-CIUDSAMLSA-N 0.000 claims 1
- IZPVWNSAVUQBGP-CIUDSAMLSA-N Leu-Ser-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O IZPVWNSAVUQBGP-CIUDSAMLSA-N 0.000 claims 1
- PPGBXYKMUMHFBF-KATARQTJSA-N Leu-Ser-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PPGBXYKMUMHFBF-KATARQTJSA-N 0.000 claims 1
- LRKCBIUDWAXNEG-CSMHCCOUSA-N Leu-Thr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LRKCBIUDWAXNEG-CSMHCCOUSA-N 0.000 claims 1
- RNYLNYTYMXACRI-VFAJRCTISA-N Leu-Thr-Trp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O RNYLNYTYMXACRI-VFAJRCTISA-N 0.000 claims 1
- HGLKOTPFWOMPOB-MEYUZBJRSA-N Leu-Thr-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 HGLKOTPFWOMPOB-MEYUZBJRSA-N 0.000 claims 1
- UCRJTSIIAYHOHE-ULQDDVLXSA-N Leu-Tyr-Arg Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N UCRJTSIIAYHOHE-ULQDDVLXSA-N 0.000 claims 1
- ARNIBBOXIAWUOP-MGHWNKPDSA-N Leu-Tyr-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O ARNIBBOXIAWUOP-MGHWNKPDSA-N 0.000 claims 1
- FBNPMTNBFFAMMH-AVGNSLFASA-N Leu-Val-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N FBNPMTNBFFAMMH-AVGNSLFASA-N 0.000 claims 1
- CGHXMODRYJISSK-NHCYSSNCSA-N Leu-Val-Asp Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC(O)=O CGHXMODRYJISSK-NHCYSSNCSA-N 0.000 claims 1
- AIMGJYMCTAABEN-GVXVVHGQSA-N Leu-Val-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O AIMGJYMCTAABEN-GVXVVHGQSA-N 0.000 claims 1
- AAKRWBIIGKPOKQ-ONGXEEELSA-N Leu-Val-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O AAKRWBIIGKPOKQ-ONGXEEELSA-N 0.000 claims 1
- PNPYKQFJGRFYJE-GUBZILKMSA-N Lys-Ala-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O PNPYKQFJGRFYJE-GUBZILKMSA-N 0.000 claims 1
- WSXTWLJHTLRFLW-SRVKXCTJSA-N Lys-Ala-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCCN)C(O)=O WSXTWLJHTLRFLW-SRVKXCTJSA-N 0.000 claims 1
- IRNSXVOWSXSULE-DCAQKATOSA-N Lys-Ala-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN IRNSXVOWSXSULE-DCAQKATOSA-N 0.000 claims 1
- NPBGTPKLVJEOBE-IUCAKERBSA-N Lys-Arg Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(O)=O)CCCNC(N)=N NPBGTPKLVJEOBE-IUCAKERBSA-N 0.000 claims 1
- CLBGMWIYPYAZPR-AVGNSLFASA-N Lys-Arg-Arg Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O CLBGMWIYPYAZPR-AVGNSLFASA-N 0.000 claims 1
- YNNPKXBBRZVIRX-IHRRRGAJSA-N Lys-Arg-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O YNNPKXBBRZVIRX-IHRRRGAJSA-N 0.000 claims 1
- DGWXCIORNLWGGG-CIUDSAMLSA-N Lys-Asn-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O DGWXCIORNLWGGG-CIUDSAMLSA-N 0.000 claims 1
- QUYCUALODHJQLK-CIUDSAMLSA-N Lys-Asp-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O QUYCUALODHJQLK-CIUDSAMLSA-N 0.000 claims 1
- YEIYAQQKADPIBJ-GARJFASQSA-N Lys-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCCCN)N)C(=O)O YEIYAQQKADPIBJ-GARJFASQSA-N 0.000 claims 1
- DRCILAJNUJKAHC-SRVKXCTJSA-N Lys-Glu-Arg Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O DRCILAJNUJKAHC-SRVKXCTJSA-N 0.000 claims 1
- LLSUNJYOSCOOEB-GUBZILKMSA-N Lys-Glu-Asp Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O LLSUNJYOSCOOEB-GUBZILKMSA-N 0.000 claims 1
- WGLAORUKDGRINI-WDCWCFNPSA-N Lys-Glu-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O WGLAORUKDGRINI-WDCWCFNPSA-N 0.000 claims 1
- ISHNZELVUVPCHY-ZETCQYMHSA-N Lys-Gly-Gly Chemical compound NCCCC[C@H](N)C(=O)NCC(=O)NCC(O)=O ISHNZELVUVPCHY-ZETCQYMHSA-N 0.000 claims 1
- DTUZCYRNEJDKSR-NHCYSSNCSA-N Lys-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CCCCN DTUZCYRNEJDKSR-NHCYSSNCSA-N 0.000 claims 1
- CANPXOLVTMKURR-WEDXCCLWSA-N Lys-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CCCCN CANPXOLVTMKURR-WEDXCCLWSA-N 0.000 claims 1
- NNKLKUUGESXCBS-KBPBESRZSA-N Lys-Gly-Tyr Chemical compound [H]N[C@@H](CCCCN)C(=O)NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O NNKLKUUGESXCBS-KBPBESRZSA-N 0.000 claims 1
- KYNNSEJZFVCDIV-ZPFDUUQYSA-N Lys-Ile-Asn Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O KYNNSEJZFVCDIV-ZPFDUUQYSA-N 0.000 claims 1
- IVFUVMSKSFSFBT-NHCYSSNCSA-N Lys-Ile-Gly Chemical compound OC(=O)CNC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCCCN IVFUVMSKSFSFBT-NHCYSSNCSA-N 0.000 claims 1
- XREQQOATSMMAJP-MGHWNKPDSA-N Lys-Ile-Tyr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O XREQQOATSMMAJP-MGHWNKPDSA-N 0.000 claims 1
- NJNRBRKHOWSGMN-SRVKXCTJSA-N Lys-Leu-Asn Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O NJNRBRKHOWSGMN-SRVKXCTJSA-N 0.000 claims 1
- RBEATVHTWHTHTJ-KKUMJFAQSA-N Lys-Leu-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(O)=O RBEATVHTWHTHTJ-KKUMJFAQSA-N 0.000 claims 1
- YPLVCBKEPJPBDQ-MELADBBJSA-N Lys-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCCN)N YPLVCBKEPJPBDQ-MELADBBJSA-N 0.000 claims 1
- WRODMZBHNNPRLN-SRVKXCTJSA-N Lys-Leu-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O WRODMZBHNNPRLN-SRVKXCTJSA-N 0.000 claims 1
- VUTWYNQUSJWBHO-BZSNNMDCSA-N Lys-Leu-Tyr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O VUTWYNQUSJWBHO-BZSNNMDCSA-N 0.000 claims 1
- IPSDPDAOSAEWCN-RHYQMDGZSA-N Lys-Met-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O IPSDPDAOSAEWCN-RHYQMDGZSA-N 0.000 claims 1
- AZOFEHCPMBRNFD-BZSNNMDCSA-N Lys-Phe-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CCCCN)C(O)=O)CC1=CC=CC=C1 AZOFEHCPMBRNFD-BZSNNMDCSA-N 0.000 claims 1
- CRIODIGWCUPXKU-AVGNSLFASA-N Lys-Pro-Met Chemical compound [H]N[C@@H](CCCCN)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCSC)C(O)=O CRIODIGWCUPXKU-AVGNSLFASA-N 0.000 claims 1
- YSPZCHGIWAQVKQ-AVGNSLFASA-N Lys-Pro-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CCCCN YSPZCHGIWAQVKQ-AVGNSLFASA-N 0.000 claims 1
- SBQDRNOLGSYHQA-YUMQZZPRSA-N Lys-Ser-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)NCC(O)=O SBQDRNOLGSYHQA-YUMQZZPRSA-N 0.000 claims 1
- ZUGVARDEGWMMLK-SRVKXCTJSA-N Lys-Ser-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCCN ZUGVARDEGWMMLK-SRVKXCTJSA-N 0.000 claims 1
- PLOUVAYOMTYJRG-JXUBOQSCSA-N Lys-Thr-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O PLOUVAYOMTYJRG-JXUBOQSCSA-N 0.000 claims 1
- GIKFNMZSGYAPEJ-HJGDQZAQSA-N Lys-Thr-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(O)=O GIKFNMZSGYAPEJ-HJGDQZAQSA-N 0.000 claims 1
- JHNOXVASMSXSNB-WEDXCCLWSA-N Lys-Thr-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O JHNOXVASMSXSNB-WEDXCCLWSA-N 0.000 claims 1
- DLCAXBGXGOVUCD-PPCPHDFISA-N Lys-Thr-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O DLCAXBGXGOVUCD-PPCPHDFISA-N 0.000 claims 1
- BIWVMACFGZFIEB-VFAJRCTISA-N Lys-Trp-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](CCCCN)N)O BIWVMACFGZFIEB-VFAJRCTISA-N 0.000 claims 1
- SUZVLFWOCKHWET-CQDKDKBSSA-N Lys-Tyr-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C)C(O)=O SUZVLFWOCKHWET-CQDKDKBSSA-N 0.000 claims 1
- USPJSTBDIGJPFK-PMVMPFDFSA-N Lys-Tyr-Trp Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O USPJSTBDIGJPFK-PMVMPFDFSA-N 0.000 claims 1
- MDDUIRLQCYVRDO-NHCYSSNCSA-N Lys-Val-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCCN MDDUIRLQCYVRDO-NHCYSSNCSA-N 0.000 claims 1
- QLFAPXUXEBAWEK-NHCYSSNCSA-N Lys-Val-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O QLFAPXUXEBAWEK-NHCYSSNCSA-N 0.000 claims 1
- VKCPHIOZDWUFSW-ONGXEEELSA-N Lys-Val-Gly Chemical compound OC(=O)CNC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCCN VKCPHIOZDWUFSW-ONGXEEELSA-N 0.000 claims 1
- DRRXXZBXDMLGFC-IHRRRGAJSA-N Lys-Val-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCCN DRRXXZBXDMLGFC-IHRRRGAJSA-N 0.000 claims 1
- YRAWWKUTNBILNT-FXQIFTODSA-N Met-Ala-Ala Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O YRAWWKUTNBILNT-FXQIFTODSA-N 0.000 claims 1
- UASDAHIAHBRZQV-YUMQZZPRSA-N Met-Arg Chemical compound CSCC[C@H](N)C(=O)N[C@H](C(O)=O)CCCNC(N)=N UASDAHIAHBRZQV-YUMQZZPRSA-N 0.000 claims 1
- FRWZTWWOORIIBA-FXQIFTODSA-N Met-Asn-Asn Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)N)C(=O)O)N FRWZTWWOORIIBA-FXQIFTODSA-N 0.000 claims 1
- UAPZLLPGGOOCRO-IHRRRGAJSA-N Met-Asn-Phe Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N UAPZLLPGGOOCRO-IHRRRGAJSA-N 0.000 claims 1
- HDNOQCZWJGGHSS-VEVYYDQMSA-N Met-Asn-Thr Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O HDNOQCZWJGGHSS-VEVYYDQMSA-N 0.000 claims 1
- UROWNMBTQGGTHB-DCAQKATOSA-N Met-Leu-Asp Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O UROWNMBTQGGTHB-DCAQKATOSA-N 0.000 claims 1
- OSZTUONKUMCWEP-XUXIUFHCSA-N Met-Leu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CCSC OSZTUONKUMCWEP-XUXIUFHCSA-N 0.000 claims 1
- KMSMNUFBNCHMII-IHRRRGAJSA-N Met-Leu-Lys Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCCN KMSMNUFBNCHMII-IHRRRGAJSA-N 0.000 claims 1
- WTHGNAAQXISJHP-AVGNSLFASA-N Met-Lys-Val Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O WTHGNAAQXISJHP-AVGNSLFASA-N 0.000 claims 1
- ZYTPOUNUXRBYGW-YUMQZZPRSA-N Met-Met Chemical compound CSCC[C@H]([NH3+])C(=O)N[C@H](C([O-])=O)CCSC ZYTPOUNUXRBYGW-YUMQZZPRSA-N 0.000 claims 1
- MIAZEQZXAFTCCG-UBHSHLNASA-N Met-Phe-Ala Chemical compound CSCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C)C(O)=O)CC1=CC=CC=C1 MIAZEQZXAFTCCG-UBHSHLNASA-N 0.000 claims 1
- LQTGGXSOMDSWTQ-UNQGMJICSA-N Met-Phe-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](CCSC)N)O LQTGGXSOMDSWTQ-UNQGMJICSA-N 0.000 claims 1
- MQASRXPTQJJNFM-JYJNAYRXSA-N Met-Pro-Phe Chemical compound CSCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 MQASRXPTQJJNFM-JYJNAYRXSA-N 0.000 claims 1
- NHXXGBXJTLRGJI-GUBZILKMSA-N Met-Pro-Ser Chemical compound [H]N[C@@H](CCSC)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O NHXXGBXJTLRGJI-GUBZILKMSA-N 0.000 claims 1
- PCTFVQATEGYHJU-FXQIFTODSA-N Met-Ser-Asn Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O PCTFVQATEGYHJU-FXQIFTODSA-N 0.000 claims 1
- FIZZULTXMVEIAA-IHRRRGAJSA-N Met-Ser-Phe Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 FIZZULTXMVEIAA-IHRRRGAJSA-N 0.000 claims 1
- CIIJWIAORKTXAH-FJXKBIBVSA-N Met-Thr-Gly Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O CIIJWIAORKTXAH-FJXKBIBVSA-N 0.000 claims 1
- KZKVVWBOGDKHKE-QTKMDUPCSA-N Met-Thr-His Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC1=CNC=N1 KZKVVWBOGDKHKE-QTKMDUPCSA-N 0.000 claims 1
- IHRFZLQEQVHXFA-RHYQMDGZSA-N Met-Thr-Lys Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCCCN IHRFZLQEQVHXFA-RHYQMDGZSA-N 0.000 claims 1
- YIGCDRZMZNDENK-UNQGMJICSA-N Met-Thr-Phe Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O YIGCDRZMZNDENK-UNQGMJICSA-N 0.000 claims 1
- OOXVBECOTYHTCK-WDSOQIARSA-N Met-Trp-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](CCSC)N OOXVBECOTYHTCK-WDSOQIARSA-N 0.000 claims 1
- HOTNHEUETJELDL-BPNCWPANSA-N Met-Tyr-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CCSC)N HOTNHEUETJELDL-BPNCWPANSA-N 0.000 claims 1
- JHVNNUIQXOGAHI-KJEVXHAQSA-N Met-Tyr-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CCSC)N)O JHVNNUIQXOGAHI-KJEVXHAQSA-N 0.000 claims 1
- YGNUDKAPJARTEM-GUBZILKMSA-N Met-Val-Ala Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O YGNUDKAPJARTEM-GUBZILKMSA-N 0.000 claims 1
- 108010087066 N2-tryptophyllysine Proteins 0.000 claims 1
- BJEYSVHMGIJORT-NHCYSSNCSA-N Phe-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=CC=C1 BJEYSVHMGIJORT-NHCYSSNCSA-N 0.000 claims 1
- BKWJQWJPZMUWEG-LFSVMHDDSA-N Phe-Ala-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=CC=C1 BKWJQWJPZMUWEG-LFSVMHDDSA-N 0.000 claims 1
- LGBVMDMZZFYSFW-HJWJTTGWSA-N Phe-Arg-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC1=CC=CC=C1)N LGBVMDMZZFYSFW-HJWJTTGWSA-N 0.000 claims 1
- LJUUGSWZPQOJKD-JYJNAYRXSA-N Phe-Arg-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CCCNC(N)=N)NC(=O)[C@@H](N)Cc1ccccc1)C(O)=O LJUUGSWZPQOJKD-JYJNAYRXSA-N 0.000 claims 1
- HHOOEUSPFGPZFP-QWRGUYRKSA-N Phe-Asn-Gly Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O HHOOEUSPFGPZFP-QWRGUYRKSA-N 0.000 claims 1
- CDNPIRSCAFMMBE-SRVKXCTJSA-N Phe-Asn-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O CDNPIRSCAFMMBE-SRVKXCTJSA-N 0.000 claims 1
- LXVFHIBXOWJTKZ-BZSNNMDCSA-N Phe-Asn-Tyr Chemical compound N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O LXVFHIBXOWJTKZ-BZSNNMDCSA-N 0.000 claims 1
- LDSOBEJVGGVWGD-DLOVCJGASA-N Phe-Asp-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 LDSOBEJVGGVWGD-DLOVCJGASA-N 0.000 claims 1
- WMGVYPPIMZPWPN-SRVKXCTJSA-N Phe-Asp-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N WMGVYPPIMZPWPN-SRVKXCTJSA-N 0.000 claims 1
- IQXOZIDWLZYYAW-IHRRRGAJSA-N Phe-Asp-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N IQXOZIDWLZYYAW-IHRRRGAJSA-N 0.000 claims 1
- DHZOGDVYRQOGAC-BZSNNMDCSA-N Phe-Cys-Tyr Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O)N DHZOGDVYRQOGAC-BZSNNMDCSA-N 0.000 claims 1
- JXWLMUIXUXLIJR-QWRGUYRKSA-N Phe-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 JXWLMUIXUXLIJR-QWRGUYRKSA-N 0.000 claims 1
- FMMIYCMOVGXZIP-AVGNSLFASA-N Phe-Glu-Asn Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O FMMIYCMOVGXZIP-AVGNSLFASA-N 0.000 claims 1
- WPTYDQPGBMDUBI-QWRGUYRKSA-N Phe-Gly-Asn Chemical compound N[C@@H](Cc1ccccc1)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O WPTYDQPGBMDUBI-QWRGUYRKSA-N 0.000 claims 1
- HGNGAMWHGGANAU-WHOFXGATSA-N Phe-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC1=CC=CC=C1 HGNGAMWHGGANAU-WHOFXGATSA-N 0.000 claims 1
- WKTSCAXSYITIJJ-PCBIJLKTSA-N Phe-Ile-Asn Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O WKTSCAXSYITIJJ-PCBIJLKTSA-N 0.000 claims 1
- KXUZHWXENMYOHC-QEJZJMRPSA-N Phe-Leu-Ala Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O KXUZHWXENMYOHC-QEJZJMRPSA-N 0.000 claims 1
- TXKWKTWYTIAZSV-KKUMJFAQSA-N Phe-Leu-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N TXKWKTWYTIAZSV-KKUMJFAQSA-N 0.000 claims 1
- KZRQONDKKJCAOL-DKIMLUQUSA-N Phe-Leu-Ile Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KZRQONDKKJCAOL-DKIMLUQUSA-N 0.000 claims 1
- CMHTUJQZQXFNTQ-OEAJRASXSA-N Phe-Leu-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC1=CC=CC=C1)N)O CMHTUJQZQXFNTQ-OEAJRASXSA-N 0.000 claims 1
- DNAXXTQSTKOHFO-QEJZJMRPSA-N Phe-Lys-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC1=CC=CC=C1 DNAXXTQSTKOHFO-QEJZJMRPSA-N 0.000 claims 1
- DMEYUTSDVRCWRS-ULQDDVLXSA-N Phe-Lys-Arg Chemical compound NC(=N)NCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC1=CC=CC=C1 DMEYUTSDVRCWRS-ULQDDVLXSA-N 0.000 claims 1
- GPLWGAYGROGDEN-BZSNNMDCSA-N Phe-Phe-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O GPLWGAYGROGDEN-BZSNNMDCSA-N 0.000 claims 1
- YVXPUUOTMVBKDO-IHRRRGAJSA-N Phe-Pro-Cys Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)N)C(=O)N[C@@H](CS)C(=O)O YVXPUUOTMVBKDO-IHRRRGAJSA-N 0.000 claims 1
- QSWKNJAPHQDAAS-MELADBBJSA-N Phe-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CC2=CC=CC=C2)N)C(=O)O QSWKNJAPHQDAAS-MELADBBJSA-N 0.000 claims 1
- JHSRGEODDALISP-XVSYOHENSA-N Phe-Thr-Asn Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(O)=O JHSRGEODDALISP-XVSYOHENSA-N 0.000 claims 1
- KLYYKKGCPOGDPE-OEAJRASXSA-N Phe-Thr-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O KLYYKKGCPOGDPE-OEAJRASXSA-N 0.000 claims 1
- CDHURCQGUDNBMA-UBHSHLNASA-N Phe-Val-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 CDHURCQGUDNBMA-UBHSHLNASA-N 0.000 claims 1
- DZZCICYRSZASNF-FXQIFTODSA-N Pro-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1 DZZCICYRSZASNF-FXQIFTODSA-N 0.000 claims 1
- FZHBZMDRDASUHN-NAKRPEOUSA-N Pro-Ala-Ile Chemical compound CC[C@H](C)[C@H](NC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1)C(O)=O FZHBZMDRDASUHN-NAKRPEOUSA-N 0.000 claims 1
- CGBYDGAJHSOGFQ-LPEHRKFASA-N Pro-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@@H]2CCCN2 CGBYDGAJHSOGFQ-LPEHRKFASA-N 0.000 claims 1
- ORPZXBQTEHINPB-SRVKXCTJSA-N Pro-Arg-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CCCNC(N)=N)NC(=O)[C@@H]1CCCN1)C(O)=O ORPZXBQTEHINPB-SRVKXCTJSA-N 0.000 claims 1
- TXPUNZXZDVJUJQ-LPEHRKFASA-N Pro-Asn-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC(=O)N)C(=O)N2CCC[C@@H]2C(=O)O TXPUNZXZDVJUJQ-LPEHRKFASA-N 0.000 claims 1
- ILMLVTGTUJPQFP-FXQIFTODSA-N Pro-Asp-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O ILMLVTGTUJPQFP-FXQIFTODSA-N 0.000 claims 1
- HXOLCSYHGRNXJJ-IHRRRGAJSA-N Pro-Asp-Phe Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O HXOLCSYHGRNXJJ-IHRRRGAJSA-N 0.000 claims 1
- YFNOUBWUIIJQHF-LPEHRKFASA-N Pro-Asp-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC(=O)O)C(=O)N2CCC[C@@H]2C(=O)O YFNOUBWUIIJQHF-LPEHRKFASA-N 0.000 claims 1
- XJROSHJRQTXWAE-XGEHTFHBSA-N Pro-Cys-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CS)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XJROSHJRQTXWAE-XGEHTFHBSA-N 0.000 claims 1
- DXTOOBDIIAJZBJ-BQBZGAKWSA-N Pro-Gly-Ser Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CO)C(O)=O DXTOOBDIIAJZBJ-BQBZGAKWSA-N 0.000 claims 1
- BEPSGCXDIVACBU-IUCAKERBSA-N Pro-His Chemical compound C([C@@H](C(=O)O)NC(=O)[C@H]1NCCC1)C1=CN=CN1 BEPSGCXDIVACBU-IUCAKERBSA-N 0.000 claims 1
- BBFRBZYKHIKFBX-GMOBBJLQSA-N Pro-Ile-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@@H]1CCCN1 BBFRBZYKHIKFBX-GMOBBJLQSA-N 0.000 claims 1
- FJLODLCIOJUDRG-PYJNHQTQSA-N Pro-Ile-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@@H]2CCCN2 FJLODLCIOJUDRG-PYJNHQTQSA-N 0.000 claims 1
- LNOWDSPAYBWJOR-PEDHHIEDSA-N Pro-Ile-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O LNOWDSPAYBWJOR-PEDHHIEDSA-N 0.000 claims 1
- LXLFEIHKWGHJJB-XUXIUFHCSA-N Pro-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@@H]1CCCN1 LXLFEIHKWGHJJB-XUXIUFHCSA-N 0.000 claims 1
- BCNRNJWSRFDPTQ-HJWJTTGWSA-N Pro-Ile-Phe Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O BCNRNJWSRFDPTQ-HJWJTTGWSA-N 0.000 claims 1
- AUQGUYPHJSMAKI-CYDGBPFRSA-N Pro-Ile-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H]1CCCN1 AUQGUYPHJSMAKI-CYDGBPFRSA-N 0.000 claims 1
- ZKQOUHVVXABNDG-IUCAKERBSA-N Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1 ZKQOUHVVXABNDG-IUCAKERBSA-N 0.000 claims 1
- FKYKZHOKDOPHSA-DCAQKATOSA-N Pro-Leu-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O FKYKZHOKDOPHSA-DCAQKATOSA-N 0.000 claims 1
- OFGUOWQVEGTVNU-DCAQKATOSA-N Pro-Lys-Ala Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O OFGUOWQVEGTVNU-DCAQKATOSA-N 0.000 claims 1
- ULWBBFKQBDNGOY-RWMBFGLXSA-N Pro-Lys-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CCCCN)C(=O)N2CCC[C@@H]2C(=O)O ULWBBFKQBDNGOY-RWMBFGLXSA-N 0.000 claims 1
- MHHQQZIFLWFZGR-DCAQKATOSA-N Pro-Lys-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O MHHQQZIFLWFZGR-DCAQKATOSA-N 0.000 claims 1
- WFIVLLFYUZZWOD-RHYQMDGZSA-N Pro-Lys-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O WFIVLLFYUZZWOD-RHYQMDGZSA-N 0.000 claims 1
- WOIFYRZPIORBRY-AVGNSLFASA-N Pro-Lys-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O WOIFYRZPIORBRY-AVGNSLFASA-N 0.000 claims 1
- HBBBLSVBQGZKOZ-GUBZILKMSA-N Pro-Met-Ala Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C)C(O)=O HBBBLSVBQGZKOZ-GUBZILKMSA-N 0.000 claims 1
- BLJMJZOMZRCESA-GUBZILKMSA-N Pro-Met-Asn Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@@H]1CCCN1 BLJMJZOMZRCESA-GUBZILKMSA-N 0.000 claims 1
- RPLMFKUKFZOTER-AVGNSLFASA-N Pro-Met-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@@H]1CCCN1 RPLMFKUKFZOTER-AVGNSLFASA-N 0.000 claims 1
- ZUZINZIJHJFJRN-UBHSHLNASA-N Pro-Phe-Ala Chemical compound C([C@@H](C(=O)N[C@@H](C)C(O)=O)NC(=O)[C@H]1NCCC1)C1=CC=CC=C1 ZUZINZIJHJFJRN-UBHSHLNASA-N 0.000 claims 1
- LEIKGVHQTKHOLM-IUCAKERBSA-N Pro-Pro-Gly Chemical compound OC(=O)CNC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 LEIKGVHQTKHOLM-IUCAKERBSA-N 0.000 claims 1
- SBVPYBFMIGDIDX-SRVKXCTJSA-N Pro-Pro-Pro Chemical compound OC(=O)[C@@H]1CCCN1C(=O)[C@H]1N(C(=O)[C@H]2NCCC2)CCC1 SBVPYBFMIGDIDX-SRVKXCTJSA-N 0.000 claims 1
- POQFNPILEQEODH-FXQIFTODSA-N Pro-Ser-Ala Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O POQFNPILEQEODH-FXQIFTODSA-N 0.000 claims 1
- FNGOXVQBBCMFKV-CIUDSAMLSA-N Pro-Ser-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(O)=O FNGOXVQBBCMFKV-CIUDSAMLSA-N 0.000 claims 1
- ITUDDXVFGFEKPD-NAKRPEOUSA-N Pro-Ser-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O ITUDDXVFGFEKPD-NAKRPEOUSA-N 0.000 claims 1
- DCHQYSOGURGJST-FJXKBIBVSA-N Pro-Thr-Gly Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O DCHQYSOGURGJST-FJXKBIBVSA-N 0.000 claims 1
- RMJZWERKFFNNNS-XGEHTFHBSA-N Pro-Thr-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O RMJZWERKFFNNNS-XGEHTFHBSA-N 0.000 claims 1
- OIDKVWTWGDWMHY-RYUDHWBXSA-N Pro-Tyr Chemical compound C([C@@H](C(=O)O)NC(=O)[C@H]1NCCC1)C1=CC=C(O)C=C1 OIDKVWTWGDWMHY-RYUDHWBXSA-N 0.000 claims 1
- DLZBBDSPTJBOOD-BPNCWPANSA-N Pro-Tyr-Ala Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C)C(O)=O DLZBBDSPTJBOOD-BPNCWPANSA-N 0.000 claims 1
- VEUACYMXJKXALX-IHRRRGAJSA-N Pro-Tyr-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(O)=O VEUACYMXJKXALX-IHRRRGAJSA-N 0.000 claims 1
- QKWYXRPICJEQAJ-KJEVXHAQSA-N Pro-Tyr-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@@H]2CCCN2)O QKWYXRPICJEQAJ-KJEVXHAQSA-N 0.000 claims 1
- AWJGUZSYVIVZGP-YUMQZZPRSA-N Pro-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1 AWJGUZSYVIVZGP-YUMQZZPRSA-N 0.000 claims 1
- STGVYUTZKGPRCI-GUBZILKMSA-N Pro-Val-Cys Chemical compound SC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H]1CCCN1 STGVYUTZKGPRCI-GUBZILKMSA-N 0.000 claims 1
- OQSGBXGNAFQGGS-CYDGBPFRSA-N Pro-Val-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O OQSGBXGNAFQGGS-CYDGBPFRSA-N 0.000 claims 1
- XRGIDCGRSSWCKE-SRVKXCTJSA-N Pro-Val-Met Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCSC)C(O)=O XRGIDCGRSSWCKE-SRVKXCTJSA-N 0.000 claims 1
- ZMLRZBWCXPQADC-TUAOUCFPSA-N Pro-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@@H]2CCCN2 ZMLRZBWCXPQADC-TUAOUCFPSA-N 0.000 claims 1
- FIODMZKLZFLYQP-GUBZILKMSA-N Pro-Val-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O FIODMZKLZFLYQP-GUBZILKMSA-N 0.000 claims 1
- FHJQROWZEJFZPO-SRVKXCTJSA-N Pro-Val-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H]1CCCN1 FHJQROWZEJFZPO-SRVKXCTJSA-N 0.000 claims 1
- 108010003201 RGH 0205 Proteins 0.000 claims 1
- YQHZVYJAGWMHES-ZLUOBGJFSA-N Ser-Ala-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O YQHZVYJAGWMHES-ZLUOBGJFSA-N 0.000 claims 1
- JPIDMRXXNMIVKY-VZFHVOOUSA-N Ser-Ala-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O JPIDMRXXNMIVKY-VZFHVOOUSA-N 0.000 claims 1
- PZZJMBYSYAKYPK-UWJYBYFXSA-N Ser-Ala-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O PZZJMBYSYAKYPK-UWJYBYFXSA-N 0.000 claims 1
- HBZBPFLJNDXRAY-FXQIFTODSA-N Ser-Ala-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O HBZBPFLJNDXRAY-FXQIFTODSA-N 0.000 claims 1
- RZEQTVHJZCIUBT-WDSKDSINSA-N Ser-Arg Chemical compound OC[C@H](N)C(=O)N[C@H](C(O)=O)CCCNC(N)=N RZEQTVHJZCIUBT-WDSKDSINSA-N 0.000 claims 1
- FCRMLGJMPXCAHD-FXQIFTODSA-N Ser-Arg-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(O)=O FCRMLGJMPXCAHD-FXQIFTODSA-N 0.000 claims 1
- HQTKVSCNCDLXSX-BQBZGAKWSA-N Ser-Arg-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O HQTKVSCNCDLXSX-BQBZGAKWSA-N 0.000 claims 1
- LTFSLKWFMWZEBD-IMJSIDKUSA-N Ser-Asn Chemical compound OC[C@H](N)C(=O)N[C@H](C(O)=O)CC(N)=O LTFSLKWFMWZEBD-IMJSIDKUSA-N 0.000 claims 1
- VAUMZJHYZQXZBQ-WHFBIAKZSA-N Ser-Asn-Gly Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O VAUMZJHYZQXZBQ-WHFBIAKZSA-N 0.000 claims 1
- VGNYHOBZJKWRGI-CIUDSAMLSA-N Ser-Asn-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CO VGNYHOBZJKWRGI-CIUDSAMLSA-N 0.000 claims 1
- DKKGAAJTDKHWOD-BIIVOSGPSA-N Ser-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)[C@H](CO)N)C(=O)O DKKGAAJTDKHWOD-BIIVOSGPSA-N 0.000 claims 1
- UGJRQLURDVGULT-LKXGYXEUSA-N Ser-Asn-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O UGJRQLURDVGULT-LKXGYXEUSA-N 0.000 claims 1
- ICHZYBVODUVUKN-SRVKXCTJSA-N Ser-Asn-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O ICHZYBVODUVUKN-SRVKXCTJSA-N 0.000 claims 1
- FTVRVZNYIYWJGB-ACZMJKKPSA-N Ser-Asp-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O FTVRVZNYIYWJGB-ACZMJKKPSA-N 0.000 claims 1
- QPFJSHSJFIYDJZ-GHCJXIJMSA-N Ser-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CO QPFJSHSJFIYDJZ-GHCJXIJMSA-N 0.000 claims 1
- BGOWRLSWJCVYAQ-CIUDSAMLSA-N Ser-Asp-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O BGOWRLSWJCVYAQ-CIUDSAMLSA-N 0.000 claims 1
- BRIZMMZEYSAKJX-QEJZJMRPSA-N Ser-Glu-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CO)N BRIZMMZEYSAKJX-QEJZJMRPSA-N 0.000 claims 1
- WBINSDOPZHQPPM-AVGNSLFASA-N Ser-Glu-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CO)N)O WBINSDOPZHQPPM-AVGNSLFASA-N 0.000 claims 1
- OHKFXGKHSJKKAL-NRPADANISA-N Ser-Glu-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O OHKFXGKHSJKKAL-NRPADANISA-N 0.000 claims 1
- WEQAYODCJHZSJZ-KKUMJFAQSA-N Ser-His-Tyr Chemical compound C([C@H](NC(=O)[C@H](CO)N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CN=CN1 WEQAYODCJHZSJZ-KKUMJFAQSA-N 0.000 claims 1
- BXLYSRPHVMCOPS-ACZMJKKPSA-N Ser-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H](N)CO BXLYSRPHVMCOPS-ACZMJKKPSA-N 0.000 claims 1
- DJACUBDEDBZKLQ-KBIXCLLPSA-N Ser-Ile-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(O)=O DJACUBDEDBZKLQ-KBIXCLLPSA-N 0.000 claims 1
- HBTCFCHYALPXME-HTFCKZLJSA-N Ser-Ile-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O HBTCFCHYALPXME-HTFCKZLJSA-N 0.000 claims 1
- NFDYGNFETJVMSE-BQBZGAKWSA-N Ser-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](N)CO NFDYGNFETJVMSE-BQBZGAKWSA-N 0.000 claims 1
- NLOAIFSWUUFQFR-CIUDSAMLSA-N Ser-Leu-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O NLOAIFSWUUFQFR-CIUDSAMLSA-N 0.000 claims 1
- HEUVHBXOVZONPU-BJDJZHNGSA-N Ser-Leu-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O HEUVHBXOVZONPU-BJDJZHNGSA-N 0.000 claims 1
- YUJLIIRMIAGMCQ-CIUDSAMLSA-N Ser-Leu-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O YUJLIIRMIAGMCQ-CIUDSAMLSA-N 0.000 claims 1
- GZSZPKSBVAOGIE-CIUDSAMLSA-N Ser-Lys-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O GZSZPKSBVAOGIE-CIUDSAMLSA-N 0.000 claims 1
- NNFMANHDYSVNIO-DCAQKATOSA-N Ser-Lys-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O NNFMANHDYSVNIO-DCAQKATOSA-N 0.000 claims 1
- PPNPDKGQRFSCAC-CIUDSAMLSA-N Ser-Lys-Asp Chemical compound NCCCC[C@H](NC(=O)[C@@H](N)CO)C(=O)N[C@@H](CC(O)=O)C(O)=O PPNPDKGQRFSCAC-CIUDSAMLSA-N 0.000 claims 1
- PTWIYDNFWPXQSD-GARJFASQSA-N Ser-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)[C@H](CO)N)C(=O)O PTWIYDNFWPXQSD-GARJFASQSA-N 0.000 claims 1
- QJKPECIAWNNKIT-KKUMJFAQSA-N Ser-Lys-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O QJKPECIAWNNKIT-KKUMJFAQSA-N 0.000 claims 1
- PMCMLDNPAZUYGI-DCAQKATOSA-N Ser-Lys-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O PMCMLDNPAZUYGI-DCAQKATOSA-N 0.000 claims 1
- QSHKTZVJGDVFEW-GUBZILKMSA-N Ser-Met-Met Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CO)N QSHKTZVJGDVFEW-GUBZILKMSA-N 0.000 claims 1
- RXSWQCATLWVDLI-XGEHTFHBSA-N Ser-Met-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O RXSWQCATLWVDLI-XGEHTFHBSA-N 0.000 claims 1
- PPQRSMGDOHLTBE-UWVGGRQHSA-N Ser-Phe Chemical compound OC[C@H](N)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 PPQRSMGDOHLTBE-UWVGGRQHSA-N 0.000 claims 1
- JAWGSPUJAXYXJA-IHRRRGAJSA-N Ser-Phe-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](CO)N)CC1=CC=CC=C1 JAWGSPUJAXYXJA-IHRRRGAJSA-N 0.000 claims 1
- UGTZYIPOBYXWRW-SRVKXCTJSA-N Ser-Phe-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O UGTZYIPOBYXWRW-SRVKXCTJSA-N 0.000 claims 1
- XVWDJUROVRQKAE-KKUMJFAQSA-N Ser-Phe-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CO)CC1=CC=CC=C1 XVWDJUROVRQKAE-KKUMJFAQSA-N 0.000 claims 1
- ZKBKUWQVDWWSRI-BZSNNMDCSA-N Ser-Phe-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O ZKBKUWQVDWWSRI-BZSNNMDCSA-N 0.000 claims 1
- PJIQEIFXZPCWOJ-FXQIFTODSA-N Ser-Pro-Asp Chemical compound [H]N[C@@H](CO)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O PJIQEIFXZPCWOJ-FXQIFTODSA-N 0.000 claims 1
- GZGFSPWOMUKKCV-NAKRPEOUSA-N Ser-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CO GZGFSPWOMUKKCV-NAKRPEOUSA-N 0.000 claims 1
- OVQZAFXWIWNYKA-GUBZILKMSA-N Ser-Pro-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CO)N OVQZAFXWIWNYKA-GUBZILKMSA-N 0.000 claims 1
- DINQYZRMXGWWTG-GUBZILKMSA-N Ser-Pro-Pro Chemical compound OC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 DINQYZRMXGWWTG-GUBZILKMSA-N 0.000 claims 1
- HHJFMHQYEAAOBM-ZLUOBGJFSA-N Ser-Ser-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O HHJFMHQYEAAOBM-ZLUOBGJFSA-N 0.000 claims 1
- KQNDIKOYWZTZIX-FXQIFTODSA-N Ser-Ser-Arg Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCNC(N)=N KQNDIKOYWZTZIX-FXQIFTODSA-N 0.000 claims 1
- AABIBDJHSKIMJK-FXQIFTODSA-N Ser-Ser-Met Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCSC)C(O)=O AABIBDJHSKIMJK-FXQIFTODSA-N 0.000 claims 1
- ILZAUMFXKSIUEF-SRVKXCTJSA-N Ser-Ser-Phe Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 ILZAUMFXKSIUEF-SRVKXCTJSA-N 0.000 claims 1
- PYTKULIABVRXSC-BWBBJGPYSA-N Ser-Ser-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PYTKULIABVRXSC-BWBBJGPYSA-N 0.000 claims 1
- KKKVOZNCLALMPV-XKBZYTNZSA-N Ser-Thr-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O KKKVOZNCLALMPV-XKBZYTNZSA-N 0.000 claims 1
- STIAINRLUUKYKM-WFBYXXMGSA-N Ser-Trp-Ala Chemical compound C1=CC=C2C(C[C@@H](C(=O)N[C@@H](C)C(O)=O)NC(=O)[C@@H](N)CO)=CNC2=C1 STIAINRLUUKYKM-WFBYXXMGSA-N 0.000 claims 1
- YXGCIEUDOHILKR-IHRRRGAJSA-N Ser-Tyr-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CO)N YXGCIEUDOHILKR-IHRRRGAJSA-N 0.000 claims 1
- ILVGMCVCQBJPSH-WDSKDSINSA-N Ser-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H](N)CO ILVGMCVCQBJPSH-WDSKDSINSA-N 0.000 claims 1
- 240000002033 Tacca leontopetaloides Species 0.000 claims 1
- FQPQPTHMHZKGFM-XQXXSGGOSA-N Thr-Ala-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O FQPQPTHMHZKGFM-XQXXSGGOSA-N 0.000 claims 1
- KEGBFULVYKYJRD-LFSVMHDDSA-N Thr-Ala-Phe Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 KEGBFULVYKYJRD-LFSVMHDDSA-N 0.000 claims 1
- HYLXOQURIOCKIH-VQVTYTSYSA-N Thr-Arg Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@H](C(O)=O)CCCNC(N)=N HYLXOQURIOCKIH-VQVTYTSYSA-N 0.000 claims 1
- VIBXMCZWVUOZLA-OLHMAJIHSA-N Thr-Asn-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)N)C(=O)O)N)O VIBXMCZWVUOZLA-OLHMAJIHSA-N 0.000 claims 1
- TZKPNGDGUVREEB-FOHZUACHSA-N Thr-Asn-Gly Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O TZKPNGDGUVREEB-FOHZUACHSA-N 0.000 claims 1
- SKHPKKYKDYULDH-HJGDQZAQSA-N Thr-Asn-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O SKHPKKYKDYULDH-HJGDQZAQSA-N 0.000 claims 1
- OJRNZRROAIAHDL-LKXGYXEUSA-N Thr-Asn-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O OJRNZRROAIAHDL-LKXGYXEUSA-N 0.000 claims 1
- JVTHIXKSVYEWNI-JRQIVUDYSA-N Thr-Asn-Tyr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O JVTHIXKSVYEWNI-JRQIVUDYSA-N 0.000 claims 1
- IOWJRKAVLALBQB-IWGUZYHVSA-N Thr-Asp Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@H](C(O)=O)CC(O)=O IOWJRKAVLALBQB-IWGUZYHVSA-N 0.000 claims 1
- LMMDEZPNUTZJAY-GCJQMDKQSA-N Thr-Asp-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O LMMDEZPNUTZJAY-GCJQMDKQSA-N 0.000 claims 1
- UZJDBCHMIQXLOQ-HEIBUPTGSA-N Thr-Cys-Thr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N)O UZJDBCHMIQXLOQ-HEIBUPTGSA-N 0.000 claims 1
- FHDLKMFZKRUQCE-HJGDQZAQSA-N Thr-Glu-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O FHDLKMFZKRUQCE-HJGDQZAQSA-N 0.000 claims 1
- UDQBCBUXAQIZAK-GLLZPBPUSA-N Thr-Glu-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O UDQBCBUXAQIZAK-GLLZPBPUSA-N 0.000 claims 1
- SLUWOCTZVGMURC-BFHQHQDPSA-N Thr-Gly-Ala Chemical compound C[C@@H](O)[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(O)=O SLUWOCTZVGMURC-BFHQHQDPSA-N 0.000 claims 1
- MPUMPERGHHJGRP-WEDXCCLWSA-N Thr-Gly-Lys Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)N[C@@H](CCCCN)C(=O)O)N)O MPUMPERGHHJGRP-WEDXCCLWSA-N 0.000 claims 1
- SXAGUVRFGJSFKC-ZEILLAHLSA-N Thr-His-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SXAGUVRFGJSFKC-ZEILLAHLSA-N 0.000 claims 1
- WPAKPLPGQNUXGN-OSUNSFLBSA-N Thr-Ile-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O WPAKPLPGQNUXGN-OSUNSFLBSA-N 0.000 claims 1
- LCCSEJSPBWKBNT-OSUNSFLBSA-N Thr-Ile-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N LCCSEJSPBWKBNT-OSUNSFLBSA-N 0.000 claims 1
- BQBCIBCLXBKYHW-CSMHCCOUSA-N Thr-Leu Chemical compound CC(C)C[C@@H](C([O-])=O)NC(=O)[C@@H]([NH3+])[C@@H](C)O BQBCIBCLXBKYHW-CSMHCCOUSA-N 0.000 claims 1
- IMDMLDSVUSMAEJ-HJGDQZAQSA-N Thr-Leu-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O IMDMLDSVUSMAEJ-HJGDQZAQSA-N 0.000 claims 1
- FLPZMPOZGYPBEN-PPCPHDFISA-N Thr-Leu-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FLPZMPOZGYPBEN-PPCPHDFISA-N 0.000 claims 1
- IJVNLNRVDUTWDD-MEYUZBJRSA-N Thr-Leu-Tyr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O IJVNLNRVDUTWDD-MEYUZBJRSA-N 0.000 claims 1
- UUSQVWOVUYMLJA-PPCPHDFISA-N Thr-Lys-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O UUSQVWOVUYMLJA-PPCPHDFISA-N 0.000 claims 1
- SPVHQURZJCUDQC-VOAKCMCISA-N Thr-Lys-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O SPVHQURZJCUDQC-VOAKCMCISA-N 0.000 claims 1
- QNCFWHZVRNXAKW-OEAJRASXSA-N Thr-Lys-Phe Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O QNCFWHZVRNXAKW-OEAJRASXSA-N 0.000 claims 1
- JWQNAFHCXKVZKZ-UVOCVTCTSA-N Thr-Lys-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O JWQNAFHCXKVZKZ-UVOCVTCTSA-N 0.000 claims 1
- PCMDGXKXVMBIFP-VEVYYDQMSA-N Thr-Met-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(N)=O)C(O)=O PCMDGXKXVMBIFP-VEVYYDQMSA-N 0.000 claims 1
- MCDVZTRGHNXTGK-HJGDQZAQSA-N Thr-Met-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(O)=O)C(O)=O MCDVZTRGHNXTGK-HJGDQZAQSA-N 0.000 claims 1
- XNTVWRJTUIOGQO-RHYQMDGZSA-N Thr-Met-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O XNTVWRJTUIOGQO-RHYQMDGZSA-N 0.000 claims 1
- SIEZEMFJLYRUMK-YTWAJWBKSA-N Thr-Met-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCSC)C(=O)N1CCC[C@@H]1C(=O)O)N)O SIEZEMFJLYRUMK-YTWAJWBKSA-N 0.000 claims 1
- KPNSNVTUVKSBFL-ZJDVBMNYSA-N Thr-Met-Thr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N)O KPNSNVTUVKSBFL-ZJDVBMNYSA-N 0.000 claims 1
- IQHUITKNHOKGFC-MIMYLULJSA-N Thr-Phe Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 IQHUITKNHOKGFC-MIMYLULJSA-N 0.000 claims 1
- NZRUWPIYECBYRK-HTUGSXCWSA-N Thr-Phe-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O NZRUWPIYECBYRK-HTUGSXCWSA-N 0.000 claims 1
- BIBYEFRASCNLAA-CDMKHQONSA-N Thr-Phe-Gly Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CC=CC=C1 BIBYEFRASCNLAA-CDMKHQONSA-N 0.000 claims 1
- HSQXHRIRJSFDOH-URLPEUOOSA-N Thr-Phe-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O HSQXHRIRJSFDOH-URLPEUOOSA-N 0.000 claims 1
- WNQJTLATMXYSEL-OEAJRASXSA-N Thr-Phe-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O WNQJTLATMXYSEL-OEAJRASXSA-N 0.000 claims 1
- BDYBHQWMHYDRKJ-UNQGMJICSA-N Thr-Phe-Met Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCSC)C(=O)O)N)O BDYBHQWMHYDRKJ-UNQGMJICSA-N 0.000 claims 1
- VEIKMWOMUYMMMK-FCLVOEFKSA-N Thr-Phe-Phe Chemical compound C([C@H](NC(=O)[C@@H](N)[C@H](O)C)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 VEIKMWOMUYMMMK-FCLVOEFKSA-N 0.000 claims 1
- ABWNZPOIUJMNKT-IXOXFDKPSA-N Thr-Phe-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O ABWNZPOIUJMNKT-IXOXFDKPSA-N 0.000 claims 1
- MXNAOGFNFNKUPD-JHYOHUSXSA-N Thr-Phe-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MXNAOGFNFNKUPD-JHYOHUSXSA-N 0.000 claims 1
- BDENGIGFTNYZSJ-RCWTZXSCSA-N Thr-Pro-Met Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCSC)C(O)=O BDENGIGFTNYZSJ-RCWTZXSCSA-N 0.000 claims 1
- YGZWVPBHYABGLT-KJEVXHAQSA-N Thr-Pro-Tyr Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 YGZWVPBHYABGLT-KJEVXHAQSA-N 0.000 claims 1
- YGCDFAJJCRVQKU-RCWTZXSCSA-N Thr-Pro-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)[C@@H](C)O YGCDFAJJCRVQKU-RCWTZXSCSA-N 0.000 claims 1
- SGAOHNPSEPVAFP-ZDLURKLDSA-N Thr-Ser-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)NCC(O)=O SGAOHNPSEPVAFP-ZDLURKLDSA-N 0.000 claims 1
- UQCNIMDPYICBTR-KYNKHSRBSA-N Thr-Thr-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O UQCNIMDPYICBTR-KYNKHSRBSA-N 0.000 claims 1
- BBPCSGKKPJUYRB-UVOCVTCTSA-N Thr-Thr-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O BBPCSGKKPJUYRB-UVOCVTCTSA-N 0.000 claims 1
- KHTIUAKJRUIEMA-HOUAVDHOSA-N Thr-Trp-Asp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](N)[C@H](O)C)C(=O)N[C@@H](CC(O)=O)C(O)=O)=CNC2=C1 KHTIUAKJRUIEMA-HOUAVDHOSA-N 0.000 claims 1
- IJKNKFJZOJCKRR-GBALPHGKSA-N Thr-Trp-Ser Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](N)[C@H](O)C)C(=O)N[C@@H](CO)C(O)=O)=CNC2=C1 IJKNKFJZOJCKRR-GBALPHGKSA-N 0.000 claims 1
- WCRFXRIWBFRZBR-GGVZMXCHSA-N Thr-Tyr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 WCRFXRIWBFRZBR-GGVZMXCHSA-N 0.000 claims 1
- OMRWDMWXRWTQIU-YJRXYDGGSA-N Thr-Tyr-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CS)C(=O)O)N)O OMRWDMWXRWTQIU-YJRXYDGGSA-N 0.000 claims 1
- RPECVQBNONKZAT-WZLNRYEVSA-N Thr-Tyr-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H]([C@@H](C)O)N RPECVQBNONKZAT-WZLNRYEVSA-N 0.000 claims 1
- KVEWWQRTAVMOFT-KJEVXHAQSA-N Thr-Tyr-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O KVEWWQRTAVMOFT-KJEVXHAQSA-N 0.000 claims 1
- CKHWEVXPLJBEOZ-VQVTYTSYSA-N Thr-Val Chemical compound CC(C)[C@@H](C([O-])=O)NC(=O)[C@@H]([NH3+])[C@@H](C)O CKHWEVXPLJBEOZ-VQVTYTSYSA-N 0.000 claims 1
- KPMIQCXJDVKWKO-IFFSRLJSSA-N Thr-Val-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O KPMIQCXJDVKWKO-IFFSRLJSSA-N 0.000 claims 1
- CURFABYITJVKEW-QTKMDUPCSA-N Thr-Val-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N)O CURFABYITJVKEW-QTKMDUPCSA-N 0.000 claims 1
- BKVICMPZWRNWOC-RHYQMDGZSA-N Thr-Val-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)[C@@H](C)O BKVICMPZWRNWOC-RHYQMDGZSA-N 0.000 claims 1
- BDWDMRSGCXEDMR-WFBYXXMGSA-N Trp-Ala-Asn Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N BDWDMRSGCXEDMR-WFBYXXMGSA-N 0.000 claims 1
- TWJDQTTXXZDJKV-BPUTZDHNSA-N Trp-Arg-Ser Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O TWJDQTTXXZDJKV-BPUTZDHNSA-N 0.000 claims 1
- IUFQHOCOKQIOMC-XIRDDKMYSA-N Trp-Asn-Lys Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N IUFQHOCOKQIOMC-XIRDDKMYSA-N 0.000 claims 1
- PXQPYPMSLBQHJJ-WFBYXXMGSA-N Trp-Asp-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N PXQPYPMSLBQHJJ-WFBYXXMGSA-N 0.000 claims 1
- IQGJAHMZWBTRIF-UBHSHLNASA-N Trp-Asp-Asn Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N IQGJAHMZWBTRIF-UBHSHLNASA-N 0.000 claims 1
- DEZKIRSBKKXUEV-NYVOZVTQSA-N Trp-Asp-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC3=CNC4=CC=CC=C43)C(=O)O)N DEZKIRSBKKXUEV-NYVOZVTQSA-N 0.000 claims 1
- NXJZCPKZIKTYLX-XEGUGMAKSA-N Trp-Glu-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N NXJZCPKZIKTYLX-XEGUGMAKSA-N 0.000 claims 1
- OENGVSDBQHHGBU-QEJZJMRPSA-N Trp-Glu-Asn Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O OENGVSDBQHHGBU-QEJZJMRPSA-N 0.000 claims 1
- OGZRZMJASKKMJZ-XIRDDKMYSA-N Trp-Leu-Asp Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N OGZRZMJASKKMJZ-XIRDDKMYSA-N 0.000 claims 1
- RWAYYYOZMHMEGD-XIRDDKMYSA-N Trp-Leu-Ser Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O)=CNC2=C1 RWAYYYOZMHMEGD-XIRDDKMYSA-N 0.000 claims 1
- KRCPXGSWDOGHAM-XIRDDKMYSA-N Trp-Lys-Asp Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O KRCPXGSWDOGHAM-XIRDDKMYSA-N 0.000 claims 1
- YTVJTXJTNRWJCR-JBACZVJFSA-N Trp-Phe-Glu Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)N YTVJTXJTNRWJCR-JBACZVJFSA-N 0.000 claims 1
- XDQGKIMTRSVSBC-WDSOQIARSA-N Trp-Pro-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CC1=CNC2=CC=CC=C12 XDQGKIMTRSVSBC-WDSOQIARSA-N 0.000 claims 1
- YBRHKUNWEYBZGT-WLTAIBSBSA-N Trp-Thr Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(O)=O)=CNC2=C1 YBRHKUNWEYBZGT-WLTAIBSBSA-N 0.000 claims 1
- HHPSUFUXXBOFQY-AQZXSJQPSA-N Trp-Thr-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N)O HHPSUFUXXBOFQY-AQZXSJQPSA-N 0.000 claims 1
- DDHFMBDACJYSKW-AQZXSJQPSA-N Trp-Thr-Asp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N)O DDHFMBDACJYSKW-AQZXSJQPSA-N 0.000 claims 1
- WBZOZLNLXVBCNW-LTHWPDAASA-N Trp-Thr-Ile Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H]([C@@H](C)CC)C(O)=O)[C@@H](C)O)=CNC2=C1 WBZOZLNLXVBCNW-LTHWPDAASA-N 0.000 claims 1
- OOEUVMFKKZYSRX-LEWSCRJBSA-N Tyr-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N OOEUVMFKKZYSRX-LEWSCRJBSA-N 0.000 claims 1
- CYDVHRFXDMDMGX-KKUMJFAQSA-N Tyr-Asn-His Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N)O CYDVHRFXDMDMGX-KKUMJFAQSA-N 0.000 claims 1
- VTFWAGGJDRSQFG-MELADBBJSA-N Tyr-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC2=CC=C(C=C2)O)N)C(=O)O VTFWAGGJDRSQFG-MELADBBJSA-N 0.000 claims 1
- JWHOIHCOHMZSAR-QWRGUYRKSA-N Tyr-Asp-Gly Chemical compound OC(=O)CNC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 JWHOIHCOHMZSAR-QWRGUYRKSA-N 0.000 claims 1
- VFJIWSJKZJTQII-SRVKXCTJSA-N Tyr-Asp-Ser Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O VFJIWSJKZJTQII-SRVKXCTJSA-N 0.000 claims 1
- TZXFLDNBYYGLKA-BZSNNMDCSA-N Tyr-Asp-Tyr Chemical compound C([C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=C(O)C=C1 TZXFLDNBYYGLKA-BZSNNMDCSA-N 0.000 claims 1
- NQJDICVXXIMMMB-XDTLVQLUSA-N Tyr-Glu-Ala Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O NQJDICVXXIMMMB-XDTLVQLUSA-N 0.000 claims 1
- LOOCQRRBKZTPKO-AVGNSLFASA-N Tyr-Glu-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 LOOCQRRBKZTPKO-AVGNSLFASA-N 0.000 claims 1
- WVRUKYLYMFGKAN-IHRRRGAJSA-N Tyr-Glu-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 WVRUKYLYMFGKAN-IHRRRGAJSA-N 0.000 claims 1
- HVHJYXDXRIWELT-RYUDHWBXSA-N Tyr-Glu-Gly Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O HVHJYXDXRIWELT-RYUDHWBXSA-N 0.000 claims 1
- NMKJPMCEKQHRPD-IRXDYDNUSA-N Tyr-Gly-Tyr Chemical compound C([C@H](N)C(=O)NCC(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=C(O)C=C1 NMKJPMCEKQHRPD-IRXDYDNUSA-N 0.000 claims 1
- CTDPLKMBVALCGN-JSGCOSHPSA-N Tyr-Gly-Val Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O CTDPLKMBVALCGN-JSGCOSHPSA-N 0.000 claims 1
- AUEJLPRZGVVDNU-STQMWFEESA-N Tyr-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 AUEJLPRZGVVDNU-STQMWFEESA-N 0.000 claims 1
- AVIQBBOOTZENLH-KKUMJFAQSA-N Tyr-Leu-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N AVIQBBOOTZENLH-KKUMJFAQSA-N 0.000 claims 1
- VTCKHZJKWQENKX-KBPBESRZSA-N Tyr-Lys-Gly Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O VTCKHZJKWQENKX-KBPBESRZSA-N 0.000 claims 1
- OFHKXNKJXURPSY-ULQDDVLXSA-N Tyr-Met-Leu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O OFHKXNKJXURPSY-ULQDDVLXSA-N 0.000 claims 1
- AUZADXNWQMBZOO-JYJNAYRXSA-N Tyr-Pro-Arg Chemical compound C([C@H](N)C(=O)N1[C@@H](CCC1)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O)C1=CC=C(O)C=C1 AUZADXNWQMBZOO-JYJNAYRXSA-N 0.000 claims 1
- GQVZBMROTPEPIF-SRVKXCTJSA-N Tyr-Ser-Asp Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O GQVZBMROTPEPIF-SRVKXCTJSA-N 0.000 claims 1
- QPOUERMDWKKZEG-HJPIBITLSA-N Tyr-Ser-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 QPOUERMDWKKZEG-HJPIBITLSA-N 0.000 claims 1
- MQGGXGKQSVEQHR-KKUMJFAQSA-N Tyr-Ser-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 MQGGXGKQSVEQHR-KKUMJFAQSA-N 0.000 claims 1
- NHOVZGFNTGMYMI-KKUMJFAQSA-N Tyr-Ser-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 NHOVZGFNTGMYMI-KKUMJFAQSA-N 0.000 claims 1
- UUBKSZNKJUJQEJ-JRQIVUDYSA-N Tyr-Thr-Asp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N)O UUBKSZNKJUJQEJ-JRQIVUDYSA-N 0.000 claims 1
- ZZDYJFVIKVSUFA-WLTAIBSBSA-N Tyr-Thr-Gly Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O ZZDYJFVIKVSUFA-WLTAIBSBSA-N 0.000 claims 1
- CLEGSEJVGBYZBJ-MEYUZBJRSA-N Tyr-Thr-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H]([C@H](O)C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 CLEGSEJVGBYZBJ-MEYUZBJRSA-N 0.000 claims 1
- AKKYBQGHUAWPJR-MNSWYVGCSA-N Tyr-Thr-Trp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CC3=CC=C(C=C3)O)N)O AKKYBQGHUAWPJR-MNSWYVGCSA-N 0.000 claims 1
- QRCBQDPRKMYTMB-IHPCNDPISA-N Tyr-Trp-Ser Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC3=CC=C(C=C3)O)N QRCBQDPRKMYTMB-IHPCNDPISA-N 0.000 claims 1
- RGJZPXFZIUUQDN-BPNCWPANSA-N Tyr-Val-Ala Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O RGJZPXFZIUUQDN-BPNCWPANSA-N 0.000 claims 1
- AEOFMCAKYIQQFY-YDHLFZDLSA-N Tyr-Val-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 AEOFMCAKYIQQFY-YDHLFZDLSA-N 0.000 claims 1
- NWEGIYMHTZXVBP-JSGCOSHPSA-N Tyr-Val-Gly Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O NWEGIYMHTZXVBP-JSGCOSHPSA-N 0.000 claims 1
- HZWPGKAKGYJWCI-ULQDDVLXSA-N Tyr-Val-Leu Chemical compound CC(C)C[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)Cc1ccc(O)cc1)C(C)C)C(O)=O HZWPGKAKGYJWCI-ULQDDVLXSA-N 0.000 claims 1
- RVGVIWNHABGIFH-IHRRRGAJSA-N Tyr-Val-Ser Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O RVGVIWNHABGIFH-IHRRRGAJSA-N 0.000 claims 1
- 108010064997 VPY tripeptide Proteins 0.000 claims 1
- REJBPZVUHYNMEN-LSJOCFKGSA-N Val-Ala-His Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](C(C)C)N REJBPZVUHYNMEN-LSJOCFKGSA-N 0.000 claims 1
- COYSIHFOCOMGCF-UHFFFAOYSA-N Val-Arg-Gly Natural products CC(C)C(N)C(=O)NC(C(=O)NCC(O)=O)CCCN=C(N)N COYSIHFOCOMGCF-UHFFFAOYSA-N 0.000 claims 1
- CVUDMNSZAIZFAE-TUAOUCFPSA-N Val-Arg-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N1CCC[C@@H]1C(=O)O)N CVUDMNSZAIZFAE-TUAOUCFPSA-N 0.000 claims 1
- CVUDMNSZAIZFAE-UHFFFAOYSA-N Val-Arg-Pro Natural products NC(N)=NCCCC(NC(=O)C(N)C(C)C)C(=O)N1CCCC1C(O)=O CVUDMNSZAIZFAE-UHFFFAOYSA-N 0.000 claims 1
- AUMNPAUHKUNHHN-BYULHYEWSA-N Val-Asn-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N AUMNPAUHKUNHHN-BYULHYEWSA-N 0.000 claims 1
- UDNYEPLJTRDMEJ-RCOVLWMOSA-N Val-Asn-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)NCC(=O)O)N UDNYEPLJTRDMEJ-RCOVLWMOSA-N 0.000 claims 1
- OGNMURQZFMHFFD-NHCYSSNCSA-N Val-Asn-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N OGNMURQZFMHFFD-NHCYSSNCSA-N 0.000 claims 1
- JLFKWDAZBRYCGX-ZKWXMUAHSA-N Val-Asn-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CO)C(=O)O)N JLFKWDAZBRYCGX-ZKWXMUAHSA-N 0.000 claims 1
- HZYOWMGWKKRMBZ-BYULHYEWSA-N Val-Asp-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N HZYOWMGWKKRMBZ-BYULHYEWSA-N 0.000 claims 1
- BMGOFDMKDVVGJG-NHCYSSNCSA-N Val-Asp-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N BMGOFDMKDVVGJG-NHCYSSNCSA-N 0.000 claims 1
- SRWWRLKBEJZFPW-IHRRRGAJSA-N Val-Cys-Phe Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N SRWWRLKBEJZFPW-IHRRRGAJSA-N 0.000 claims 1
- UPJONISHZRADBH-XPUUQOCRSA-N Val-Glu Chemical compound CC(C)[C@H](N)C(=O)N[C@H](C(O)=O)CCC(O)=O UPJONISHZRADBH-XPUUQOCRSA-N 0.000 claims 1
- SZTTYWIUCGSURQ-AUTRQRHGSA-N Val-Glu-Glu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O SZTTYWIUCGSURQ-AUTRQRHGSA-N 0.000 claims 1
- VCAWFLIWYNMHQP-UKJIMTQDSA-N Val-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](C(C)C)N VCAWFLIWYNMHQP-UKJIMTQDSA-N 0.000 claims 1
- WDIGUPHXPBMODF-UMNHJUIQSA-N Val-Glu-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N WDIGUPHXPBMODF-UMNHJUIQSA-N 0.000 claims 1
- YTPLVNUZZOBFFC-SCZZXKLOSA-N Val-Gly-Pro Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N1CCC[C@@H]1C(O)=O YTPLVNUZZOBFFC-SCZZXKLOSA-N 0.000 claims 1
- BNQVUHQWZGTIBX-IUCAKERBSA-N Val-His Chemical compound CC(C)[C@H]([NH3+])C(=O)N[C@H](C([O-])=O)CC1=CN=CN1 BNQVUHQWZGTIBX-IUCAKERBSA-N 0.000 claims 1
- FEFZWCSXEMVSPO-LSJOCFKGSA-N Val-His-Ala Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](Cc1cnc[nH]1)C(=O)N[C@@H](C)C(O)=O FEFZWCSXEMVSPO-LSJOCFKGSA-N 0.000 claims 1
- PNVLWFYAPWAQMU-CIUDSAMLSA-N Val-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H](N)C(C)C PNVLWFYAPWAQMU-CIUDSAMLSA-N 0.000 claims 1
- LKUDRJSNRWVGMS-QSFUFRPTSA-N Val-Ile-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N LKUDRJSNRWVGMS-QSFUFRPTSA-N 0.000 claims 1
- SDUBQHUJJWQTEU-XUXIUFHCSA-N Val-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](C(C)C)N SDUBQHUJJWQTEU-XUXIUFHCSA-N 0.000 claims 1
- LYERIXUFCYVFFX-GVXVVHGQSA-N Val-Leu-Glu Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N LYERIXUFCYVFFX-GVXVVHGQSA-N 0.000 claims 1
- JAKHAONCJJZVHT-DCAQKATOSA-N Val-Lys-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)O)N JAKHAONCJJZVHT-DCAQKATOSA-N 0.000 claims 1
- XPKCFQZDQGVJCX-RHYQMDGZSA-N Val-Lys-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](C(C)C)N)O XPKCFQZDQGVJCX-RHYQMDGZSA-N 0.000 claims 1
- MBGFDZDWMDLXHQ-GUBZILKMSA-N Val-Met-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](C(C)C)N MBGFDZDWMDLXHQ-GUBZILKMSA-N 0.000 claims 1
- OJOMXGVLFKYDKP-QXEWZRGKSA-N Val-Met-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(=O)O)C(=O)O)N OJOMXGVLFKYDKP-QXEWZRGKSA-N 0.000 claims 1
- ZXYPHBKIZLAQTL-QXEWZRGKSA-N Val-Pro-Asp Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(=O)O)C(=O)O)N ZXYPHBKIZLAQTL-QXEWZRGKSA-N 0.000 claims 1
- AJNUKMZFHXUBMK-GUBZILKMSA-N Val-Ser-Arg Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N AJNUKMZFHXUBMK-GUBZILKMSA-N 0.000 claims 1
- GBIUHAYJGWVNLN-AEJSXWLSSA-N Val-Ser-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N1CCC[C@@H]1C(=O)O)N GBIUHAYJGWVNLN-AEJSXWLSSA-N 0.000 claims 1
- NZYNRRGJJVSSTJ-GUBZILKMSA-N Val-Ser-Val Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O NZYNRRGJJVSSTJ-GUBZILKMSA-N 0.000 claims 1
- CEKSLIVSNNGOKH-KZVJFYERSA-N Val-Thr-Ala Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C)C(=O)O)NC(=O)[C@H](C(C)C)N)O CEKSLIVSNNGOKH-KZVJFYERSA-N 0.000 claims 1
- LCHZBEUVGAVMKS-RHYQMDGZSA-N Val-Thr-Leu Chemical compound CC(C)C[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)C(C)C)[C@@H](C)O)C(O)=O LCHZBEUVGAVMKS-RHYQMDGZSA-N 0.000 claims 1
- GVNLOVJNNDZUHS-RHYQMDGZSA-N Val-Thr-Lys Chemical compound [H]N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(O)=O GVNLOVJNNDZUHS-RHYQMDGZSA-N 0.000 claims 1
- PDDJTOSAVNRJRH-UNQGMJICSA-N Val-Thr-Phe Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](C(C)C)N)O PDDJTOSAVNRJRH-UNQGMJICSA-N 0.000 claims 1
- PFMSJVIPEZMKSC-DZKIICNBSA-N Val-Tyr-Glu Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N PFMSJVIPEZMKSC-DZKIICNBSA-N 0.000 claims 1
- PDASTHRLDFOZMG-JYJNAYRXSA-N Val-Tyr-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C(C)C)CC1=CC=C(O)C=C1 PDASTHRLDFOZMG-JYJNAYRXSA-N 0.000 claims 1
- OWFGFHQMSBTKLX-UFYCRDLUSA-N Val-Tyr-Tyr Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O)N OWFGFHQMSBTKLX-UFYCRDLUSA-N 0.000 claims 1
- WBPFYNYTYASCQP-CYDGBPFRSA-N Val-Val-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](C(C)C)N WBPFYNYTYASCQP-CYDGBPFRSA-N 0.000 claims 1
- 108010024078 alanyl-glycyl-serine Proteins 0.000 claims 1
- 108010045350 alanyl-tyrosyl-alanine Proteins 0.000 claims 1
- ZVDPYSVOZFINEE-BQBZGAKWSA-N alpha-Asp-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](N)CC(O)=O ZVDPYSVOZFINEE-BQBZGAKWSA-N 0.000 claims 1
- 239000000427 antigen Substances 0.000 claims 1
- 102000036639 antigens Human genes 0.000 claims 1
- 108091007433 antigens Proteins 0.000 claims 1
- 108010057412 arginyl-glycyl-aspartyl-phenylalanine Proteins 0.000 claims 1
- 108010043240 arginyl-leucyl-glycine Proteins 0.000 claims 1
- 108010068380 arginylarginine Proteins 0.000 claims 1
- 210000004899 c-terminal region Anatomy 0.000 claims 1
- 239000003795 chemical substances by application Substances 0.000 claims 1
- 230000000295 complement effect Effects 0.000 claims 1
- 239000003814 drug Substances 0.000 claims 1
- 239000003937 drug carrier Substances 0.000 claims 1
- 101150110946 gatC gene Proteins 0.000 claims 1
- 108010073628 glutamyl-valyl-phenylalanine Proteins 0.000 claims 1
- JYPCXBJRLBHWME-UHFFFAOYSA-N glycyl-L-prolyl-L-arginine Natural products NCC(=O)N1CCCC1C(=O)NC(CCCN=C(N)N)C(O)=O JYPCXBJRLBHWME-UHFFFAOYSA-N 0.000 claims 1
- 108010072405 glycyl-aspartyl-glycine Proteins 0.000 claims 1
- 108010078326 glycyl-glycyl-valine Proteins 0.000 claims 1
- 108010025801 glycyl-prolyl-arginine Proteins 0.000 claims 1
- 230000028993 immune response Effects 0.000 claims 1
- 239000012678 infectious agent Substances 0.000 claims 1
- 208000015181 infectious disease Diseases 0.000 claims 1
- 230000002458 infectious effect Effects 0.000 claims 1
- 108010073093 leucyl-glycyl-glycyl-glycine Proteins 0.000 claims 1
- 108010047926 leucyl-lysyl-tyrosine Proteins 0.000 claims 1
- 108010044056 leucyl-phenylalanine Proteins 0.000 claims 1
- 108010091871 leucylmethionine Proteins 0.000 claims 1
- 238000004519 manufacturing process Methods 0.000 claims 1
- 108700023046 methionyl-leucyl-phenylalanine Proteins 0.000 claims 1
- 108020004707 nucleic acids Proteins 0.000 claims 1
- 150000007523 nucleic acids Chemical class 0.000 claims 1
- 102000039446 nucleic acids Human genes 0.000 claims 1
- 108010025826 prolyl-leucyl-arginine Proteins 0.000 claims 1
- 108010093296 prolyl-prolyl-alanine Proteins 0.000 claims 1
- 108091035539 telomere Proteins 0.000 claims 1
- 102000055501 telomere Human genes 0.000 claims 1
- 108700004896 tripeptide FEG Proteins 0.000 claims 1
- 230000001105 regulatory effect Effects 0.000 abstract 1
Classifications
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61K—PREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
- A61K39/00—Medicinal preparations containing antigens or antibodies
- A61K39/12—Viral antigens
- A61K39/235—Adenoviridae
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61P—SPECIFIC THERAPEUTIC ACTIVITY OF CHEMICAL COMPOUNDS OR MEDICINAL PREPARATIONS
- A61P31/00—Antiinfectives, i.e. antibiotics, antiseptics, chemotherapeutics
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/11—DNA or RNA fragments; Modified forms thereof; Non-coding nucleic acids having a biological activity
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61P—SPECIFIC THERAPEUTIC ACTIVITY OF CHEMICAL COMPOUNDS OR MEDICINAL PREPARATIONS
- A61P31/00—Antiinfectives, i.e. antibiotics, antiseptics, chemotherapeutics
- A61P31/12—Antivirals
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61P—SPECIFIC THERAPEUTIC ACTIVITY OF CHEMICAL COMPOUNDS OR MEDICINAL PREPARATIONS
- A61P35/00—Antineoplastic agents
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61P—SPECIFIC THERAPEUTIC ACTIVITY OF CHEMICAL COMPOUNDS OR MEDICINAL PREPARATIONS
- A61P37/00—Drugs for immunological or allergic disorders
- A61P37/02—Immunomodulators
- A61P37/04—Immunostimulants
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61P—SPECIFIC THERAPEUTIC ACTIVITY OF CHEMICAL COMPOUNDS OR MEDICINAL PREPARATIONS
- A61P43/00—Drugs for specific purposes, not provided for in groups A61P1/00-A61P41/00
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K14/00—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
- C07K14/005—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from viruses
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
- C12N15/79—Vectors or expression systems specially adapted for eukaryotic hosts
- C12N15/85—Vectors or expression systems specially adapted for eukaryotic hosts for animal cells
- C12N15/86—Viral vectors
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N7/00—Viruses; Bacteriophages; Compositions thereof; Preparation or purification thereof
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61K—PREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
- A61K39/00—Medicinal preparations containing antigens or antibodies
- A61K2039/51—Medicinal preparations containing antigens or antibodies comprising whole cells, viruses or DNA/RNA
- A61K2039/525—Virus
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61K—PREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
- A61K39/00—Medicinal preparations containing antigens or antibodies
- A61K2039/51—Medicinal preparations containing antigens or antibodies comprising whole cells, viruses or DNA/RNA
- A61K2039/525—Virus
- A61K2039/5256—Virus expressing foreign proteins
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61K—PREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
- A61K39/00—Medicinal preparations containing antigens or antibodies
- A61K2039/57—Medicinal preparations containing antigens or antibodies characterised by the type of response, e.g. Th1, Th2
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61K—PREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
- A61K39/00—Medicinal preparations containing antigens or antibodies
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61K—PREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
- A61K48/00—Medicinal preparations containing genetic material which is inserted into cells of the living body to treat genetic diseases; Gene therapy
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N2710/00—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA dsDNA viruses
- C12N2710/00011—Details
- C12N2710/10011—Adenoviridae
- C12N2710/10022—New viral proteins or individual genes, new structural or functional aspects of known viral proteins or genes
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N2710/00—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA dsDNA viruses
- C12N2710/00011—Details
- C12N2710/10011—Adenoviridae
- C12N2710/10311—Mastadenovirus, e.g. human or simian adenoviruses
- C12N2710/10321—Viruses as such, e.g. new isolates, mutants or their genomic sequences
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N2710/00—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA dsDNA viruses
- C12N2710/00011—Details
- C12N2710/10011—Adenoviridae
- C12N2710/10311—Mastadenovirus, e.g. human or simian adenoviruses
- C12N2710/10322—New viral proteins or individual genes, new structural or functional aspects of known viral proteins or genes
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N2710/00—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA dsDNA viruses
- C12N2710/00011—Details
- C12N2710/10011—Adenoviridae
- C12N2710/10311—Mastadenovirus, e.g. human or simian adenoviruses
- C12N2710/10341—Use of virus, viral particle or viral elements as a vector
- C12N2710/10343—Use of virus, viral particle or viral elements as a vector viral genome or elements thereof as genetic vector
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N2710/00—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA dsDNA viruses
- C12N2710/00011—Details
- C12N2710/10011—Adenoviridae
- C12N2710/10311—Mastadenovirus, e.g. human or simian adenoviruses
- C12N2710/10361—Methods of inactivation or attenuation
- C12N2710/10362—Methods of inactivation or attenuation by genetic engineering
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N2710/00—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA dsDNA viruses
- C12N2710/00011—Details
- C12N2710/20011—Papillomaviridae
- C12N2710/20022—New viral proteins or individual genes, new structural or functional aspects of known viral proteins or genes
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N2740/00—Reverse transcribing RNA viruses
- C12N2740/00011—Details
- C12N2740/10011—Retroviridae
- C12N2740/16011—Human Immunodeficiency Virus, HIV
- C12N2740/16111—Human Immunodeficiency Virus, HIV concerning HIV env
- C12N2740/16122—New viral proteins or individual genes, new structural or functional aspects of known viral proteins or genes
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N2740/00—Reverse transcribing RNA viruses
- C12N2740/00011—Details
- C12N2740/10011—Retroviridae
- C12N2740/17011—Spumavirus, e.g. chimpanzee foamy virus
- C12N2740/17022—New viral proteins or individual genes, new structural or functional aspects of known viral proteins or genes
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N2750/00—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA ssDNA viruses
- C12N2750/00011—Details
- C12N2750/14011—Parvoviridae
- C12N2750/14111—Dependovirus, e.g. adenoassociated viruses
- C12N2750/14122—New viral proteins or individual genes, new structural or functional aspects of known viral proteins or genes
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N2760/00—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA ssRNA viruses negative-sense
- C12N2760/00011—Details
- C12N2760/14011—Filoviridae
- C12N2760/14111—Ebolavirus, e.g. Zaire ebolavirus
- C12N2760/14134—Use of virus or viral component as vaccine, e.g. live-attenuated or inactivated virus, VLP, viral protein
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N2760/00—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA ssRNA viruses negative-sense
- C12N2760/00011—Details
- C12N2760/16011—Orthomyxoviridae
- C12N2760/16111—Influenzavirus A, i.e. influenza A virus
- C12N2760/16134—Use of virus or viral component as vaccine, e.g. live-attenuated or inactivated virus, VLP, viral protein
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N2760/00—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA ssRNA viruses negative-sense
- C12N2760/00011—Details
- C12N2760/20011—Rhabdoviridae
- C12N2760/20111—Lyssavirus, e.g. rabies virus
- C12N2760/20122—New viral proteins or individual genes, new structural or functional aspects of known viral proteins or genes
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N2830/00—Vector systems having a special element relevant for transcription
- C12N2830/001—Vector systems having a special element relevant for transcription controllable enhancer/promoter combination
- C12N2830/002—Vector systems having a special element relevant for transcription controllable enhancer/promoter combination inducible enhancer/promoter combination, e.g. hypoxia, iron, transcription factor
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N2830/00—Vector systems having a special element relevant for transcription
- C12N2830/55—Vector systems having a special element relevant for transcription from bacteria
Landscapes
- Health & Medical Sciences (AREA)
- Life Sciences & Earth Sciences (AREA)
- Chemical & Material Sciences (AREA)
- Organic Chemistry (AREA)
- Genetics & Genomics (AREA)
- Engineering & Computer Science (AREA)
- General Health & Medical Sciences (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Zoology (AREA)
- Wood Science & Technology (AREA)
- Medicinal Chemistry (AREA)
- Biomedical Technology (AREA)
- Virology (AREA)
- General Engineering & Computer Science (AREA)
- Biotechnology (AREA)
- Biochemistry (AREA)
- Microbiology (AREA)
- Molecular Biology (AREA)
- Immunology (AREA)
- Veterinary Medicine (AREA)
- Public Health (AREA)
- Animal Behavior & Ethology (AREA)
- Pharmacology & Pharmacy (AREA)
- Biophysics (AREA)
- Chemical Kinetics & Catalysis (AREA)
- General Chemical & Material Sciences (AREA)
- Nuclear Medicine, Radiotherapy & Molecular Imaging (AREA)
- Physics & Mathematics (AREA)
- Plant Pathology (AREA)
- Gastroenterology & Hepatology (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- Communicable Diseases (AREA)
- Oncology (AREA)
- Epidemiology (AREA)
- Mycology (AREA)
- Micro-Organisms Or Cultivation Processes Thereof (AREA)
- Medicines Containing Material From Animals Or Micro-Organisms (AREA)
- Medicines That Contain Protein Lipid Enzymes And Other Medicines (AREA)
- Peptides Or Proteins (AREA)
- Medicines Containing Antibodies Or Antigens For Use As Internal Diagnostic Agents (AREA)
Abstract
Det beskrives en rekombinant vektor omfattende simianadenovirussekvenser og heterologt gen under kontroll av regulatoriske sekvenser. Det beskrives videre en cellelinje som uttrykker ett eller flere simianadenovirusgener. Det beskrives fremgangsmåter for anvendelse av vektorene og cellelinjene.
Description
OPPFINNELSENS BAKGRUNN
Adenovirus er et dobbeltrådet DNA-virus med genomstørrelse på rundt 36 kilobaser (kb), som har funnet utstrakt anvendelse for genoverføringsapplikasjoner pga. dets evne til å oppnå meget effektiv genoverføring i en varietet av mål vev, og har stor transgenkapasitet. Konvensjonelt blir El-gener av adenovirus deletert og erstattet med en transgenkassett bestående av den valgte promoter, cDNA-sekvensen fra genet av interesse og et poly-A-signal, som resulterer i et replikasjonsdefekt, rekombinant virus. Adenovirus har en karakteristisk morfologi med et icosahedralkapsid bestående av tre hovedproteiner, hexon (IT), pentonbaser (HI) og en knoppet (knobbed) fiber (IV), sammen med et antall andre mindre proteiner, VI, VUt, IX, Illa og IVa2 [W. C. Russel, J. Gen. Virol, F81: 2573-2604 (Nov. 2000)]. Virusgenomet er et lineært, dobbeltrådet DNA med et terminalprotein festet kovalent ved 5' ende, som har inverterte terminalrepetisjoner (ITR). Virus-DNA er intimt assosiert med det sterkt basiske protein-VII og et lite peptid kalt mu. Et annet protein, V, er pakket med dette DNA-proteinkompleks og gir en strukturell forbindelse til kapsidet via protein-VI. Viruset inneholder også en viruskodet protease, som er nødvendig for prosessering av noen av de strukturelle proteiner for å gi modent, infeksiøst virus.
Rekombinante adenovirus er beskrevet for avlevering til molekyler hos vertsceller. Se U.S. patent nr. 6 083 716, som beskriver genomet av to sjimpanse-adenovirus.
Hva som behøves i teknikken, er mer effektive vektorer som unngår effekten av foreksisterende immunitet mot valgte adenovirusserotyper i populasjonen, og/eller som er brukbare for gjentatt administrering og for titerforsterkning ved den andre vaksinasjonen, om dette er nødvendig.
Oppsummering av oppfinnelsen
Foreliggende oppfinnelse omfatter rekombinante adenovirus, isolert vertscelle som omfatter nevnte rekombinante andenovirus, sammensetning omfattende det rekombinante adenovirus samt en anvendelse.
Foreliggende oppfinnelse tilveiebringer de isolerte nukleinsyresekvenser og aminosyresekvenser av seks simianadenovirus, vektorer som inneholder disse sekvenser og cellelinjer som uttrykker simian-adenovirusgener. Tilveiebrakt er også et antall fremgangsmåter for anvendelse av disse vektorene og celler ifølge foreliggende oppfinnelse.
Fremgangsmåtene ifølge oppfinnelsen involverer avlevering av ett eller flere utvalgte heterologe gener til en pattedyrpasient ved tilførsel av en vektor ifølge oppfinnelsen. Fordi de forskjellige vektorkonstruksjoner er avledet fra simian-adenovirus og ikke fra humant adenovirus, blir immunsystemet hos den ikke-simian humane- eller veterinærpasient ikke primet til å respondere umiddelbart på vektoren som et fremmed antigen. Anvendelsen av preparatene ifølge oppfinnelsen tillater således mer stabil ekspresjon av det valgte transgen, når dette tilføres til en ikke-simian pasient. Ifølge oppfinnelsen tillater anvendelsen av preparatene for vaksinasjon, presentasjon av et utvalgt antigen for å frembringe beskyttende immunresponser. Uten ønske om å være bundet av noen teori, er adenovirusenes evner ifølge oppfinnelsen til å transdusere humandendrittiske celler i det minste delvis ansvarlig for evnen hos de rekombinante konstruksjoner ifølge oppfinnelsen til å indusere en immunrespons. De rekombinante simianadenovirus ifølge oppfinnelsen kan også anvendes for produksjon av heterologe genprodukter in vitro. Slike genprodukter er i seg selv anvendelige i et antall formål, som beskrevet her.
Disse og andre utførelsesformer og fordeler ved oppfinnelsen er beskrevet mer detaljert nedenfor.
Kort beskrivelse av figurene
Figur 1 viser en oppstilling av aminosyresekvensene av LI - og en del av L2-løkkene i kapsidproteinhexon av sjimpanseadenovirus Cl [SEKV. ID nr. 13], sjimpanseadenovirus C68 (Pan9) [SEKV. ID nr. 14], og de nye Pan5 [SEKV. ID nr. 15], Pan6 [SEKV. ID nr. 16] ogPan7 [SEKV. ID nr. 17] sjimpanseadenovirus-sekvensene ifølge oppfinnelsen. Det intervenerende bevarte området er en del av pidestall-doménet som er bevart mellom adenovirusserotypene. Figur 2 viser en oppstilling av aminosyresekvensene av fiberknoppdoménene av sjimpanse-C68 (Pan-9) [SEKV. ID nr. 18], Pan6 [SEKV. ID nr. 19], Pan7 [SEKV. ID nr. 20] ogPan5 [SEKV. ID nr. 21], og de humane adenovirusserotypene 2 [SEKV. ID nr. 22] og 5 [SEKV. ID nr. 23].
DETALJERT BESKRIVELSE AV OPPFINNELSEN
Oppfinnelsen tilveiebringer nye nukleinsyre- og aminosyresekvenser fra Ad-Pan5 [SEKV. ID nr. 1-4, 15 og 21], Ad-Pan6 [SEKV. ID nr. 5-8, 16, 19], og Ad-serotype
Pan7 [SEKV. ID nr. 9-12, 17, 20], som opprinnelig ble isolert fra sjimpanselymfeknuter. I flere tilfeller gjennom beskrivelsen er disse adenovirus alternativt angitt som C5, C6 henholdsvis Cl. Det tilveiebringes videre sekvenser fra adenovirus SV-1 [SEKV. ID nr. 24-28], som opprinnelig ble isolert fra cynomolgusapenyreceller. Oppfinnelsen tilveiebringer også sekvenser fra adenovirus SV-25 [SEKV. ID nr. 29-33] og SV-39 [SEKV. ID nr. 34-37], som opprinnelig ble isolert fra rhesusapenyreceller.
Den foreliggende oppfinnelse tilveiebringer nye adenovirusvektorer og pakking av cellelinjer for tilveiebringelse av disse vektorer for anvendelse ved in vzZro-produksjon av rekombinante proteiner eller fragmenter eller andre reagenser. Oppfinnelsen tilveiebringer videre preparater for anvendelse ved avlevering av et heterologt molekyl for terapeutisk- eller vaksineformål. Slike terapeutiske eller vaksinepreparater inneholder de adenovirale vektorer som bærer et innskutt, heterologt molekyl. I tillegg er de nye sekvensene ifølge oppfinnelsen anvendelige ved tilveiebringelse av de essensielle hjelperfunksjoner nødvendige for fremstilling av rekombinante adenoassosierte virale (AAV) vektorer. Oppfinnelsen tilveiebringer således hjelperkonstruksjoner, fremgangsmåter og cellelinjer som anvender disse sekvenser i slike produksjonsfremgangsmåter.
Uttrykket "vesentlig homologi" eller "vesentlig likhet" anvendt under henvisning til en nukleinsyre eller et fragment derav, indikerer at det er nukleotidsekvensidentitet i det minste i omkring 95 % til 99 % av de oppstilte sekvensene, når optimalt oppstilt med egnete nukleotidinnskudd eller -delesjoner med en annen nukleinsyre (eller dens komplementære tråd).
Uttrykket "vesentlig homologi" eller "vesentlig likhet" under henvisning til aminosyrer eller fragmenter derav, indikerer at det er nukleotidsekvensidentitet i det minste i omkring 95 % til 99 % av de oppstilte sekvensene, når optimalt oppstilt med egnete aminosyreinnskudd eller -delesjoner med en annen aminosyre (eller dens komplementære tråd). Fortrinnsvis er homologien over fullengdesekvens eller et protein derav, eller et fragment derav som minst er 8 aminosyrer, eller mer ønskelig minst 15
aminosyrer i lengde. Eksempler på egnete fragmenter er beskrevet heri.
Uttrykket "prosent sekvensidentitet" eller "identisk" innenfor konteksten nukleinsyresekvenser, henviser til restene i de to sekvensene som er de samme når de oppstilles for maksimal overensstemmelse. Sekvenslengdeidentitetsammenlikning kan være over hele genomets lengde (for eksempel omkring 36 kbp), hele lengden av en åpen leseramme av et gen, protein, subenhet eller enzym (se for eksempel tabellene som viser de adenovirale kodende regioner), eller et fragment på minst 500 til 5000 nukleotider, om ønskelig. Imidlertid kan identitet blant mindre fragmenter, for eksempel på minst omkring 9 nukleotider, vanligvis på minst omkring 20 til 24 nukleotider, minst omkring 28 til 32 nukleotider, minst omkring 36 eller flere nukleotider, kan også være ønskelig. På tilsvarende måte kan "prosent sekvensidentitet" lett bestemmes for aminosyresekvenser over hele proteinlengden eller et fragment derav. Hensiktsmessig er et fragment minst omkring 8 aminosyrer i lengde og kan være opptil omkring 700
aminosyrer. Eksempler på egnete fragmenter er beskrevet heri.
Identitet bestemmes enkelt ved anvendelse av slike algoritmer og dataprogrammer som er definert heri ved standard parameterinnstillinger. Fortrinnsvis er en slik identitet over hele lengden av proteinet, enzymet, subenheten eller over et fragment på minst 8 aminosyrer i lengde. Imidlertid kan identiteten være basert på kortere områder, der dette er hensiktsmessig for anvendelsen av de identiske genprodukter.
Som beskrevet heri, gjennomføres oppstillinger ved anvendelse av et antall offentlige eller kommersielt tilgjengelige "Multiple Sequence Alignment Programs", som "Clustal W", tilgjengelig via web-servere på internett. Alternativt benyttes også vektor NTI-elementer. Det foreligger også et antall algoritmer som er kjente i teknikken og som kan anvendes for måling av nukleotidsekvensidentitet, inkludert de som er som finnes i programmene beskrevet ovenfor. Som et annet eksempel kan polynukleotidsekvenser sammenliknes ved anvendelse av FASTA, et program i GCG-versjon 6.1. FASTA tilveiebringer oppstillinger og prosentidentitet for områdene som gir best overlapping mellom sekvensene som finnes og de sekvenser som ønskes bestemt. For eksempel kan prosent sekvensidentitet mellom nukleinsyresekvens bestemmes ved anvendelse av FASTA med dennes standard parametre (en ordstørrelse på 6 og NOPAM-faktoren for bedømmelsesmatrisen) som er tilveiebrakt i GCG-versjon 6.1, herved inkorporert ved referanse. Tilsvarende programmer er tilgjengelige for gjennomføring aminosyreoppstillinger. Generelt er disse programmene benyttet ved standardinnstillinger, selv om fagmannen på området kan endre disse etter behov.
Alternativt kan fagmannen på området benytte en annen algoritme, eller et annet dataprogram som gir i det minste det nivå av identitet eller oppstilling som tilveiebringes heri ved de nevnte algoritmer eller programmer.
Som benyttet ifølge foreliggende oppfinnelses beskrivelse og krav, skal uttrykket "omfatte" og varianten "omfatter" og "omfattende", blant andre varianter inkludere andre komponenter, elementer, integere, trinn og liknende. Uttrykket "består av" eller "bestående av" utelukker andre komponenter, elementer, integere, trinn og liknende.
I. Simian- adenovirussekvenser
Foreliggende oppfinnelse tilveiebringer nukleinsyresekvenser og aminosyresekvenser av Pan5, Pan6, Pan7, SVI, SV25 og SV39, som er isolerte fra det andre virale materialet de er assosiert med i naturen.
A. Nukleinsyresekvenser
Pan5 nukleinsyresekvensene ifølge oppfinnelsen inkluderer nukleotidene 1 til 36462 av SEKV. ID nr. 1. Pan6 nukleinsyresekvensene ifølge oppfinnelsen inkluderer nukleotidene 1 til 36604 av SEKV. ID nr. 5. Pan7 nukleinsyresekvensene ifølge oppfinnelsen inkluderer nukleotidene 1 til 36535 av SEKV. ID nr. 9. SV1-nukleinsyresekvensene ifølge oppfinnelsen inkluderer nukleotidene 1 til 34264 av SEKV. ID nr. 24. SV25-nukleinsyresekvensene ifølge oppfinnelsen inkluderer nukleotidene 1 til 31044 av SEKV. ID nr. 29. SV39-nukleinsyresekvensene ifølge oppfinnelsen inkluderer nukleotidene 1 til 34115 av SEKV. ID nr. 34. Se sekvensopplistingen, som er inkorporert ved referanse heri.
Nukleinsyresekvensene ifølge oppfinnelsen omfatter videre tråden som er komplementær til sekvensene i SEKV. ID nr. 5, 9, 24, 29 og 34, så vel som de korresponderende RNA- og cDNA-sekvensene til sekvensene av disse sekvenstall og deres komplementære tråder. Videre inkludert i foreliggende oppfinnelse er nukleinsyresekvenser som er mer enn 95 % til 98 %, og mer foretrukket omkring 99 % til 99,9 % homologe eller identiske til sekvensopplistingen. Også inkludert i nukleinsyresekvensene ifølge oppfinnelsen er naturlige varianter og konstruerte modifikasjoner av sekvensene tilveiebrakt i SEKV. ID nr. 5, 9, 24, 29 og 34, og deres komplementære tråder. Slike modifikasjoner inkluderer for eksempel merkelapper kjent innen teknikken, metylering og substituering av én eller flere av de naturlig forekommende nukleotider med et degenerert nukleotid.
Oppfinnelsen omfatter videre sekvensfragmenter av Pan5, Pan6, Pan7, SVI, SV25 og SV39, deres komplementære tråd, cDNA og RNA som er komplementære dertil. Egnete fragmenter har en lengde på minst 15 nukleotider og omfatter tradisjonelle fragmenter, dvs. fragmenter som er av biologisk interesse. For eksempel kan et funksjonelt fragment uttrykke et ønsket, adenoviralt produkt eller kan være anvendelig i fremstilling av rekombinante, virale vektorer. Slike fragmenter inkluderer gensekvensene og fragmentene som er opplistet i tabellen nedenfor.
De påfølgende tabellene gir transkriptområdene og åpne leserammer i simian-adenovirussekvensene ifølge oppfinnelsen. For visse gener er transkriptene og de åpne leserammene (ORFer) lokalisert på tråden som er komplementær til den presentert i SEKV. ID nr. 5, 9, 24, 29 og 34. Se for eksempel E2b, E4 og E2a. De kalkulerte molekylvektene for de kodete proteiner er også vist. Merk at den åpne leserammen Ela Pan5 [nt 576-1436 av SEKV. ID nr. 1], pan6 [nt 576 til 1437 av SEKV. ID nr. 5] og Pan7 [nt 576 til 1437 av SEKV. ID nr. 9] inneholder interne spleiseseter. Disse spleisesetene er angitt i de påfølgende tabellene.
Pan5-, Pan6-, Pan7-, SVI-, SV25- og SV39-adenovirale nukleinsyresekvenser er nyttige som terapeutiske midler og ved konstruksjon av et antall vektorsystemer og vertsceller. Som anvendt heri inkluderer en vektor ethvert egnet nukleinsyremolekyl, inkluderende nakent DNA, et plasmid, et virus, et kosmid eller en episom. Disse sekvenser og produkter kan anvendes alene eller i kombinasjon med andre adenovirale sekvenser eller fragmenter, eller i kombinasjon med elementer fra andre adenovirale eller ikke-adenovirale sekvenser. De adenovirale sekvenser ifølge oppfinnelsen er også nyttige som antisens-avleveringsvektorer, genterapivektorer eller vaksinevektorer. Oppfinnelsen tilveiebringer således i tillegg nukleinsyremolekyler,
genavleveringsvektorer og vertsceller inneholdende Ad-sekvensene ifølge oppfinnelsen.
For eksempel omfatter oppfinnelsen et nukleinsyremolekyl inneholdende simian-Ad-ITR-sekvenser ifølge oppfinnelsen. I et annet eksempel tilveiebringer oppfinnelsen et nukleinsyremolekyl inneholdende simian-Ad-sekvenser ifølge oppfinnelsen, som koder et ønsket Ad-genprodukt. Ytterligere andre nukleinsyremolekyler som konstrueres ved anvendelse av sekvensene ifølge oppfinnelsen, vil være åpenbare for fagmannen i lys av de her gitte informasjoner.
I en utførelsesform kan simian-Ad-genområdene som identifiseres her, benyttes i et antall vektorer for avlevering av et heterologt molekyl til en celle. For eksempel genereres det vektorer for ekspresjon av et adenoviralt kapsidprotein (eller et fragment derav) for generering av en viralvektor i en pakket vertscelle. Slike vektorer kan konstrueres for ekspresjon in trans. Alternativt konstrueres slike vektorer for å gi celler som stabilt inneholder sekvenser som uttrykker ønskete adenovirale funksjoner, for eksempel én eller flere av Ela, Elb, de terminalrepeterende sekvenser, E2a, E2b, E4, E40RF6-området.
I tillegg er de adenovirale gensekvenser og fragmenter derav nyttige for å tilveiebringe de hjelperfunksjoner nødvendige for fremstilling av hjelperavhengige virus (for eksempel adenovirale vektorer der vesentlige funksjoner eller adenoassosierte virus (AAV)) er fjernet. For slike produksjonsfremgangsmåter benyttes simian-adenovirale sekvenser ifølge oppfinnelsen med en fremgangsmåte tilsvarende det som er beskrevet for humant Ad. På grunn av forskjeller i sekvensene mellom de simian-adenovirale sekvensene ifølge oppfinnelsen og de av humant Ad, eliminerer imidlertid bruken av sekvensene ifølge oppfinnelsen den vesentlige muligheten for homolog rekombinering med hjelperfunksjoner i vertscelle som bærer humant Ad El-funksjoner, for eksempel 293-celler, som kan produsere infeksiøs, adenoviral forurensing under rAAV-produksjon.
Fremgangsmåter for produksjon av rAAV ved anvendelse av adenovirale hjelperfunksjoner er utstrakt beskrevet i litteraturen med humane adenovirale serotyper. Se for eksempel U.S. patent nr. 6 258 595 og referansene sitert deri. Se også U.S. patent nr. 5 871 982; WO 99/14354; WO 99/15685; WO 99/47691. Disse fremgangsmåtene kan også anvendes ved fremstilling av ikke-humant AAV, inkludert ikke-humane primat AAV-serotyper. Simian-adenovirale gensekvenser ifølge oppfinnelsen som tilveiebringer de nødvendige hjelperfunksjoner (for eksempel Ela, Elb, E2a og/eller E40RF6) kan være særlig nyttig for å gi den nødvendige adenovirale funksjon, mens muligheten for rekombinasjon med et hvilket som helst annet adenovirus til stede i rAAV-pakkende celle, som typisk er av human opprinnelse, minimaliseres eller elimineres. Valgte gener eller åpne leserammer av de adenovirale sekvenser ifølge oppfinnelsen kan således benyttes i disse rAAV-
produksj onsfremgangsmåter.
Alternativt kan rekombinante adenovirale simian-vektorer ifølge oppfinnelsen anvendes i disse fremgangsmåtene. Slike rekombinante adenovirale simian-vektorer kan for eksempel inkludere et hybrid sjimp Ad/AAV der sjimp Ad-sekvensene flankerer en rAAV-ekspresjonskassett bestående for eksempel av AAV 3' og/eller 5' ITRer og et transgen under kontroll av regulatoriske sekvenser som kontrollerer dens ekspresjon. Fagmannen på området vil erkjenne at også andre simian-adenovirale vektorer og/eller gensekvenser ifølge oppfinnelsen vil være nyttige for fremstilling av rAAV og andre virus, avhengig av adenoviral hjelper.
I nok en annen utførelsesform konstrueres nukleinsyremolekyler for avlevering og ekspresjon av valgte adenovirale genprodukter i en vertscelle for å oppnå en ønsket fysiologisk effekt. For eksempel kan et nukleinsyremolekyl inneholdende sekvenser som koder et adenovirus Ela-protein ifølge oppfinnelsen, avleveres til et individ for anvendelse som cancerterapeutikum. Eventuelt formuleres et slikt molekyl i en lipidbasebærer og har fortrinnsvis cancerceller som mål. En slik formulering kan kombineres med andre cancerterapeutika (for eksempel cisplatin, taxol eller liknende). Ytterligere andre anvendelser for adenovirale sekvenser som er gitt her, vil være åpenbare for fagmannen.
I tillegg vil fagmannen lett forstå at Ad-sekvensene ifølge oppfinnelsen lett kan tilpasses for anvendelse i et antall virale og ikke-virale vektorsystemer for in vitro-, ex vivo- eller in vzvo-avlevering av terapeutiske og immunogene molekyler. For eksempel kan Pan5-, Pan6-, Pan7-, SVI-, SV25- og/eller SV39-simian-Ad-genomene ifølge oppfinnelsen kan anvendes i et antall rAd- og ikke-rAd-vektorsystemer. Slike vektorsystemer kan for eksempel inkludere plasmider, lentivirus, retrovirus, poxvirus, vaksinevirus og adenoassosierte virale systemer, blant andre. Valg av disse vektorsystemene er ingen begrensning for foreliggende oppfinnelse.
Oppfinnelsen tilveiebringer videre molekyler nyttige for produksjon av simian-proteiner og simian-avledete proteiner ifølge oppfinnelsen. Slike molekyler som bærer polynukleotider som inkluderer simian-Ad-DNA-sekvensene ifølge oppfinnelsen, kan foreligge i form av nakent DNA, et plasmid, et virus eller et hvilket som helst annet genetisk element.
B. Simian-adenovirale proteiner ifølge foreliggende oppfinnelse
Foreliggende oppfinnelse tilveiebringer videre genprodukter av de ovenfor beskrevne adenovirus som proteiner, enzymer og fragmenter derav, og som kodes av de adenovirale nukleinsyrer ifølge oppfinnelsen. Oppfinnelsen omfatter videre Pan5-, Pan6-, Pan7-, SVI-, SV25- og/eller SV39-proteiner, enzymer og fragmenter derav, med den aminosyresekvens som kodes av disse nukleinsyresekvenser som er generert ved andre fremgangsmåter. Slike proteiner inkluderer de som er kodet av den åpne leseramme som er identifisert i tabellene ovenfor, i figurene 1 og 2, og fragmentene derav.
I et aspekt tilveiebringer oppfinnelsen således unike simian-adenovirale proteiner som i det vesentlige er rene, dvs. frie for andre virale eller proteinøse proteiner. Fortrinnsvis er disse proteiner minst 10 % homogene, mer foretrukket 60 % homogene, og mest foretrukket 95 % homogene.
I en utførelsesform tilveiebringer oppfinnelsen unike simian-avledete kapsidproteiner. Som anvendt heri inkluderer et simian-avledet kapsidprotein et hvilket som helst adenoviralt kapsidprotein inneholdende et Pan5-, Pan6-, Pan7-, SVI-, SV25- eller SV39-kapsidprotein eller et fragment derav, som definert ovenfor, uten begrensning omfattende kimære kapsidproteiner, fusjonsproteiner, kunstige kapsidproteiner, syntetiske kapsidproteiner og rekombinerte kapsidproteiner, uten begrensning av midler for å generere disse produktene.
Hensiktsmessig inneholder disse simian-avledete kapsidproteiner ett eller flere Pan5-, Pan6-, Pan7-, SVI-, SV25- eller SV39-områder eller fragmenter derav (for eksempel hexon, penton, fiber eller et fragment derav) i kombinasjon med kapsidområder eller fragmenter derav av forskjellige adenovirale serotyper, eller modifiserte simian-kapsidproteiner eller fragmenter, som beskrevet heri. En "modifikasjon av et kapsidprotein forbundet med endret tropisme" som anvendt heri, inkluderer et endret kapsidprotein, for eksempel et penton, et hexon eller et fiberproteinområde, eller et fragment derav som knoppdoméne av fiberområdet, eller et polynukleotid som koder dette, slik at spesifisiteten endres. Det simian-avledete kapsidet kan konstrueres med én eller flere simian-Ad'ene ifølge oppfinnelsen, eller andre Ad-serotyper som kan være av human eller ikke-human opprinnelse. Slik Ad kan oppnås fra et antall kilder inkludert ATCC, kommersielle og akademiske kilder, eller Ad-sekvensene kan oppnås fra GenBank eller andre hensiktsmessige kilder.
Aminosyresekvensene av simian-adenoviruspentonproteinene ifølge oppfinnelsen tilveiebringes også. AdPan5-pentonproteinet tilveiebringes i SEKV. ID nr. 2. AdPan7-pentonproteinet tilveiebringes i SEKV. ID nr. 6. AdPan6-pentonproteinet tilveiebringes i SEKV. ID nr. 10. SVI-pentonproteinet tilveiebringes i SEKV. ID nr. 23. SV25-pentonproteinet tilveiebringes i SEKV. ID nr. 30. SV39-pentonproteinet tilveiebringes i SEKV. ID nr. 35. Ethvert av disse pentonproteinene eller unike fragmenter derav, kan hensiktsmessig anvendes for et antall formål. Eksempler på egnete fragmenter inkluderer pentonet som har N-terminal- og/eller C-terminalavkortninger på omkring 50, 100, 150 eller 200 aminosyrer, basert på aminosyrenummereringen tilveiebrakt ovenfor, og i SEKV. ID nr. 2, SEKV. ID nr. 6, SEKV. ID nr. 25, SEKV. ID nr. 30 eller SEKV. ID nr. 35. Andre hensiktsmessige fragmenter inkluderer kortere indre C- eller N-terminale fragmenter. Pentonproteinet kan videre modifiseres for en rekke forskjellige formål kjent for dem øvet i faget.
Oppfinnelsen tilveiebringer videre aminosyresekvenser av hexonproteinet av Pan5 [SEKV. ID nr. 3], Pan6 [SEKV. ID nr. 7], Pan7 [SEKV. ID nr. 11], SVI [SEKV. ID nr. 26], SV25 [SEKV. ID nr. 31] og/eller SV39 [SEKV. ID nr. 36]. Dette hexonprotein,
eller unike fragmenter derav, kan hensiktsmessig anvendes for en rekke forskjellige
formål. Eksempler på egnete fragmenter inkluderer hexoner med N-terminale og/eller C-terminale avkortninger på omkring 50, 100, 150, 200, 300, 400 eller 500 aminosyrer, basert på aminosyrenummereringen tilveiebrakt ovenfor, og i SEKV. ID nr. 3, 7, 11, 26, 31 og 36. Andre hensiktsmessige fragmenter inkluderer kortere indre C- eller N-terminale fragmenter. For eksempel er et egnet fragment løkkeområdet (doménet) av hexonproteinet, angitt som DEI og FG1 eller et hypervariabelt område derav. Slike fragmenter inkluderer områdene som overspenner aminosyrerestene omkring 125 til 443; omkring 138 til 441, eller mindre fragmenter, slik som de som spenner over restene 138 til rest 163; omkring 170 til omkring 176; omkring 195 til omkring 203; omkring 233 til omkring 246; omkring 253 til omkring 264; omkring 287 til omkring 297; og omkring 404 til omkring 430 av simian-hexonproteinene, under henvisning til SEKV. ID nr. 3,7, 11, 26, 31 eller 36. Andre egnete fragmenter kan lett identifiseres av fagmannen. Hexonproteinet kan videre være modifisert for et antall formål, som velkjent for fagmannen. På grunn av at hexonproteinet er determinant for en adenovirusserotype, vil slike kunstige hexonproteiner resultere i adenovirus med kunstige serotyper. Andre kunstige kapsidproteiner kan også konstrueres ved anvendelse av sjimp-Ad-pentonsekvenser og/eller fibersekvenser ifølge oppfinnelsen og/eller fragmenter derav.
I et eksempel kan det være ønskelig å generere et adenovirus med endret hexonprotein ved anvendelse av en hexonproteinsekvens ifølge oppfinnelsen. En egnet fremgangsmåte for å endre hexonproteiner er beskrevet i U.S. patent nr. 5 922 315, som det henvises til vedrørende detaljer. I denne fremgangsmåten blir minst ett adenovirushexonløkkeområde byttet ut med minst ett løkkeområde av en annen adenovirusserotype. Ved minst ett løkkeområde av et slikt endret adenovirus, er således hexonproteinet et simian-Ad-hexonløkkeområde ifølge oppfinnelsen (for eksempel Pan7). I en utførelsesform erstattes et løkkeområde av Pan7 hexonproteinet med et løkkeområde fra en annen adenovirusserotype. I nok en utførelsesform benyttes løkkeområdet av Pan7-hexoner til å erstatte et løkkeområde fra en annen adenovirusserotype. Egnete adenovirusserotyper kan lett velges blant humane og ikke-humane serotyper, som beskrevet her. Pan7 velges kun som illustrasjon; andre simian-Ad-hexonproteiner ifølge oppfinnelsen kan endres på tilsvarende måte eller benyttes for å endre et annet Ad-hexon. Valget av en egnet serotype er ingen begrensning for oppfinnelsen. Ytterligere andre anvendelser for hexonproteinsekvensene ifølge oppfinnelsen vil lett fremgå for fagmannen.
Oppfinnelsen omfatter videre fiberproteinene av simian-adenovirus ifølge oppfinnelsen. Fiberproteinet av Ad-Pan5 har aminosyresekvens SEKV. ID nr. 4. Fiberproteinet Ad-Pan6 har aminosyresekvens SEKV. ID nr. 8. Fiberproteinet Ad-Pan7 har aminosyresekvens SEKV. ID nr. 12. SV-1 har to fiberproteiner; fiber 2 har aminosyresekvens SEKV. ID nr. 27 og fiber 1 har aminosyresekvens SEKV. ID nr. 28. SV-25 har også to fiberproteiner; fiber 2 har aminosyresekvens SEKV. ID nr. 32, og fiber 1 har aminosyresekvens SEKV. ID nr. 33. Fiberproteinet av SV-39 har aminosyresekvens SEKV. ID nr. 37.
Dette fiberproteinet, eller unike fragmenter derav, kan hensiktsmessig anvendes for en rekke forskjellige formål. Et egnet fragment er fiberknoppen som spenner over aminosyrene 247 til 425 av SEKV. ID nr. 4, 8, 12, 28, 32, 33 og 37. Se figur 2.
Eksempler på andre egnete fragmenter inkluderer fiberen med N-terminale og/eller C-terminale avkortninger på omkring 50, 100, 150 eller 200 aminosyrer, basert på aminosyrenummereringen tilveiebrakt ovenfor, og i SEKV. ID nr. 4, 8, 12, 28, 32, 33 og 37. Andre hensiktsmessige fragmenter inkluderer indre fragmenter. Fiberproteinet kan være modifisert ved anvendelse av et antall teknikker velkjente for fagmannen.
Oppfinnelsen omfatter videre unike fragmenter av proteinene ifølge oppfinnelsen, som er minst 8 aminosyrer lange. Fragmenter med andre ønskete lengder kan imidlertid lett benyttes. I tillegg omfatter oppfinnelsen slike modifikasjoner som kan innføres for å øke utbyttet og/eller ekspresjonen av etPan5-, Pan6-, Pan7-, SVI-, SV25- eller SV39-genprodukt, for eksempel konstruksjon av et fusjonsmolekyl der hele eller et fragment av Pan5-, Pan6-, Pan7-, SVI-, SV25- eller SV39-genproduktet er fusert (enten direkte eller via en linker) til en fusjonspartner for å øke utbyttet. Andre egnete modifikasjoner inkluderer, uten begrensning, avkortning av et kodende område (for eksempel et protein eller enzym) for å eliminere et pre- eller pro-protein som vanligvis spaltes, og for å gi det modne protein eller enzym og/eller mutasjon av et kodende område for å gi et utskillbart genprodukt. Ytterligere andre modifikasjoner vil lett fremgå for fagmannen. Oppfinnelsen omfatter videre proteiner med minst 95 % til 99 % identitet med de her tilveiebrakte Pan5-, Pan6-, Pan7-, SVI-, SV25- eller SV39-proteiner.
Som beskrevet heri, er vektorer ifølge oppfinnelsen inneholdende de adenovirale kapsidproteiner ifølge oppfinnelsen, spesielt godt egnet for anvendelse der de nøytraliserende antistoffer reduserer effektiviteten for andre Ad-serotypebaserte vektorer, så vel som andre virale vektorer. rAd-vektorene ifølge oppfinnelsen er spesielt fordelaktige ved readministrering for gjentatt genterapi eller for forsterking av immunresponsen (vaksinetitere).
Under visse omstendigheter kan det være ønskelig å benytte ett eller flere av Pan5-, Pan6-, Pan7-, SVI-, SV25- og/eller SV39-genproduktene (for eksempel et kapsidprotein eller et fragment derav) for å generere et antistoff. Uttrykket "antistoff' slik det benyttes her, henviser til et immunglobulinmolekyl som spesifikt er stand til å binde til en epitop. Antistoffene ifølge oppfinnelsen binder således fortrinnsvis spesifikt og uten kryssreaktivitet til en Pan5-, Pan6-, Pan7-, SVI-, SV25- og/eller SV39-epitop. Antistoffene ifølge oppfinnelsen foreligger i et antall former inkludert for eksempel høyaffinitetspolyklonale antistoffer, monoklonale antistoffer, syntetiske antistoffer, kimære antistoffer, rekombinante antistoffer og humaniserte antistoffer. Slike antistoffer stammer fra immunglobulinklassene IgG, IgM, IgA, IgD og IgE.
Slike antistoffer kan genereres ved anvendelse av et antall fremgangsmåter som er velkjente i teknikken. Egnete antistoffer kan genereres ved velkjente, konvensjonelle teknikker, for eksempel ved Kohler og Milstein, og mange kjente modifikasjoner derav. Tilsvarende ønskelige høytiterantistoffer genereres ved anvendelse av kjente rekombinante teknikker på de monoklonale eller polyklonale antistoffer som er avledet fra disse antigener (se for eksempel PCT patent søknad nr. PCT/GB85/00392; britisk patentsøknadspublikasjonnr. GB2188638A; Amit et al, 1986 Science, 233:747-753; Queen et al, 1989 Proe. Nati. Acad. Sei. USA, 86:10029-10033; WO 90/07861; og Riechmann et al, Nature, 332:323-327 (1988); Huse et al, 1988a Science, 246:1275-1281). Alternativt kan antistoffer fremstilles ved å manipulere de komplementaritets-bestemmende områder av animalske eller humane antistoffer mot antigenet ifølge oppfinnelsen. (Se for eksempel E. Mark og Padlin, Humanization ofMonoclonal Antibodies, kapitel 4, The Handbook of Experimental Pharmacology, bind 113, The Pharmacology ofMonoclonal Antibodies, Springer-Verlag (juni 1994); Harlow et al,
1999, Using Antibodies: A Laboratory Manual, Cold Spring Harbor Laboratory Press, NY; Harlow et al, 1989, Antibodies: A Laboratory Manual, Cold Spring Harbor, New York; Houston et al, 1988, Proe. Nati. Acad. Sei. USA 85:5879-5883; og Bird et al, 1988, Science 242:423-426. Videre tilveiebringer foreliggende oppfinnelse anti-idiotype antistoffer (Ab2) og anti-anti-idiotype antistoffer (Ab3). Se for eksempel M. Wettendorff et al., Modulation of anti-tumor immunity by anti-idiotypic antibodies. In Idiotypic Network and Diseases, red. ved J. Cerny og J. Hiernaux, 1990
J. Am. Soc. Microbiol, Washington DC: s. 203-229). Disse anti-idiotype- og anti-anti-idiotype-antistoffer fremstilles ved bruk av teknikker velkjente for fagmannen. Disse antistoffer kan benyttes for et antall formål, inkludert diagnostiske og kliniske fremgangsmåter og sett. Under visse omstendigheter kan det være ønskelig å innføre en påvisbar merkelapp eller en tag på et Pan5-, Pan6-, Pan7-, SVI-, SV25- eller SV39-genprodukt, -antistoff eller en annen konstruksjon ifølge oppfinnelsen. Som benyttet heri er en påvisbar merkelapp et molekyl i stand til, alene eller i interaksjon med andre molekyler, å gi et påvisbart signal. Helst er merkelappen visuelt påvisbar, for eksempel ved fluorescens, for lett anvendelse i immunohistokjemisk analyse eller immunfluorescensmikroskopi. For eksempel inkluderer hensiktsmessige merkelapper fluorescein isotiocyanat (FITC), fycoerytrin (PE), allofycocyanin (APC), corifosfin-0
(CPO) eller tandemfargestoffer, PE-cyanin-5 (PC5), og PE-Texasrødt (ECD). Alle disse fluorescente fargestoffer er kommersielt tilgjengelige, og deres anvendelser er velkjente i teknikken. Andre nyttige merkelapper inkluderer en kolloid gullmerkelapp. Ytterligere andre nyttige merkelapper inkluderer radioaktive forbindelser eller elementer. I tillegg
inkluderer merkelapper et antall enzymsystemer som virker ved frigivelse av et kollorimetrisk signal i en analyse, for eksempel glukoseoksidase (som benytter glukose som substrat) frigir peroksid som et produkt som i nærvær av peroksidase og en hydrogendonor som tetrametylbenzidin (TMB), gir et oksidert TMB som ses som blå farge. Andre eksempler inkluderer pepperotperoksidase (HRP) eller alkalisk fosfatase (AP), og hexokinase i forbindelse med glukose-6-fosfatdehydrogenase som reagerer med ATP, glukose og NAD<+>, og gir blant andre produkter, NADH som påvises som øket absorbans ved 340 nm bølgelengde.
Andre merkelappsystemer som anvendes i fremgangsmåtene ifølge oppfinnelsen, kan påvises på annen måte, ved for eksempel farget lateks mikropartikler (Bang Laboratories, Indiana) hvori et innleiret fargestoff benyttes i stedet for enzymer for å gi konjugater med mål sekvensen, og gir et visuelt signal som indikerer nærvær av det resulterende kompleks i anvendelige analyser.
Fremgangsmåter for kobling eller assosiering av merkelappen med et ønsket molekyl er også konvensjonelle, og velkjente for fagmannen. Kjente fremgangsmåter for merkelappfesting er beskrevet. (Se for eksempel Handbook of Fluorescent probes and Research Chemicals, 6. utg., R. P. M. Haugland, Molecular Probes, Inc., Eugene, OR, 1996; Pierce Catalog and Handbook, Life Science and Analytical Research Products, Pierce Chemicals Company, Rockford, IL, 1994/1995). Valget av merkelapp og koblingsfremgangsmåte er således ingen begrensning for foreliggende oppfinnelse. Sekvensene, proteinene og fragmentene ifølge oppfinnelsen kan fremstilles på en hvilken som helst hensiktsmessig måte, inkludert rekombinant produksjon, kjemisk syntese eller andre syntetiske midler. Egnete produksjonsteknikker er velkjente for fagmannen. Se for eksempel Sambrook et al., Molecular Cloning: A Laboratory Manual, Cold Spring Harbor Press, Cold Spring Harbor, NY. Alternativt kan peptider også syntetiseres ved den velkjente fastfasepeptidsyntesefremgangsmåten, (Merrifield, J. Am. Chem. Soc, 85:2149 (1969), s. 27-62). Disse og andre hensiktsmessige produksjonsfremgangsmåter ligger innenfor fagmannens kunnskapsområde, og er ingen begrensning for oppfinnelsen.
I tillegg vil fagmannen på området lett forstå at Ad-sekvensene ifølge oppfinnelsen lett kan tilpasses for bruk i et antall virale eller ikke-virale vektorsystemer for in vitro-, ex vivo- eller in vzvo-avlevering av terapeutiske og immunogene molekyler. I en utførelsesform benyttes for eksempel simian-Ad-kapsidproteinene og andre simian-adenovirusproteiner, som beskrevet heri, til ikke-viral, proteinbasert avlevering av gener, proteiner og andre ønskelige diagnostiske, terapeutiske og immunogene molekyler. I en slik utførelsesform er et protein ifølge oppfinnelsen direkte eller indirekte forbundet med et molekyl målrettet for celler med en reseptor for adenovirus. For slik målretting velges fortrinnsvis et kapsidprotein, slik som et hexon, penton, en fiber eller et fragment derav ved en ligand for celleoverflatereseptorer. Egnete molekyler for avlevering velges blant terapeutiske molekyler og deres genprodukter, som beskrevet heri. Et utvalg av linkere inkludert lipid, polyLys og liknende kan benyttes som linkere. For eksempel kan simian-pentonprotein lett benyttes for et slikt formål ved fremstilling av et fusjonsprotein ved anvendelse av simian-pentonsekvensen på en måte analog til det som er beskrevet av Medina-Kauwe LK, et al, Gene Ther., des. 2001; 8(23):1753-1761. Alternativt kan aminosyresekvensene av simian-Ad-protein-DC benyttes for målrettete vektorer på en celleflatereseptor, som beskrevet i U.S. søknadsnummer 20 010 047 081. Egnete Ugandere inkluderer et CD40-antigen, en RGD-inneholdende eller polylysininneholdende sekvens, og liknende. Ytterligere andre simian-Ad-proteiner inkluderer for eksempel hexonproteinet og/eller fiberproteinet, kan anvendes for disse og tilsvarende formål.
Ytterligere andre adenovirale proteiner ifølge oppfinnelsen kan benyttes alene eller i kombinasjon med andre adenovirale proteiner for et antall formål som lett vil være åpenbare for fagmannen. I tillegg kan ytterligere andre anvendelser for de adenovirale proteiner ifølge oppfinnelsen lett være åpenbare for fagmannen.
n. Rekombinante adenoviralvektorer
Sammensetningene ifølge foreliggende oppfinnelse inkluderer vektorer som avgir et heterologt molekyl til celler, enten for terapeutiske eller for vaksineformål. Som anvendt heri, kan en vektor inkludere et hvilket som helst genetisk element inkludert, uten begrensning, nakent DNA, en phag, et transposon, et kosmid, et episom, et plasmid eller et virus. Slike vektorer inneholder simian-adenovirus-DNA av Pan5, Pan6, Pan7, SVI, SV25 og/eller SV39 og et minigen. Med "minigen" menes kombinasjonen av et valgt, heterologt gen og de andre regulatoriske elementer nødvendige for å drive translasjonen, transkripsjonen og/eller ekspresjonen av genproduktet i en vertscelle.
Typisk blir en adenoviral vektor ifølge oppfinnelsen konstruert slik at minigenet er lokalisert i et nukleinsyremolekyl inneholdende andre adenovirale sekvenser i området nativt til et valgt adenoviralt gen. Minigenet kan skytes inn i et foreliggende genområde for å bryte opp funksjonen i dette området, dersom dette er ønskelig. Alternativt kan minigenet innføres på setet for et partielt eller fullt deletert, adenoviralt gen. For eksempel kan minigenet være lokalisert på setet for et slikt, som setet for en funksjonell El-delesjon eller funksjonell E3-delesjon, blant andre som kan velges. Uttrykkes "funksjonelt deletert" eller "funksjonell delesjon" betyr at en tilstrekkelig mengde av genområdet er fjernet eller på annen måte skadet, for eksempel ved mutasjon eller modifikasjon, slik at genområdet ikke lenger er i stand til å gi funksjonelle produkter av genekspresjon. Dersom ønskelig kan hele genområdet fjernes. Andre egnete seter for genoppbryting eller delesjon er diskutert på annet sted i foreliggende beskrivelse.
For eksempel kan en produksjonsvektor som er nyttig for generering av et rekombinant virus, inneholde minigenet og enten den 5' og 3' ende av det adenovirale genomet, eller både 5' og 3' ende av dette adenovirale genom. Det adenovirale genomets 5' ende inneholder de 5' cis-elementene som er nødvendige for pakking og replikering; dvs. de 5' inverterte repetisjonsterminal ( YTR) sekvensene (som virker som replikasjonsopprinnelser) og de native 5' pakkingsforbedringsdoméner (som inneholder sekvensene nødvendige for pakking av lineære Ad-genomer og enhancerelementene for El-promoteren). Den 3' ende av det adenovirale genom inkluderer de 3' cis-elementene (inkludert ITRene) som er nødvendige for pakking og enkapsidering. Hensiktsmessig er et rekombinant adenovirus inneholdende både de 5'- og 3'-adenovirale cis-elementene og minigenet, lokalisert mellom de 5'- og 3'-adenovirale sekvensene. En hvilken som helst adenoviral vektor ifølge oppfinnelsen kan også inneholde ytterligere adenovirale sekvenser.
Hensiktsmessig inneholder disse adenovirale vektorer ifølge oppfinnelsen ett eller flere adenovirale elementer avledet fra et adenoviralt genom ifølge oppfinnelsen. I en utførelsesform inneholder vektorene adenovirale ITRer fra Pan5, Pan6, Pan7, SVI, SV25 eller SV39 og ytterligere adenovirale sekvenser fra den samme adenovirale serotype. I en annen utførelsesform inneholder vektorene adenovirale sekvenser avledet fra en annen adenoviral serotype enn den som gir ITRene. Som definert heri, henviser en pseudotypet adenovirus til en adenovirus der kapsidproteinet av adenoviruset er fra en annen serotype enn den serotype som gir ITRene. Valget av serotype for ITRene og serotypen for en hvilken som helst annen adenoviral sekvens som er til stede i vektoren, er ingen begrensning for oppfinnelsen. Et antall adenovirusstammer er tilgjengelige fra "American Type Culture Collection", Manassas, Virginia, eller tilgjengelig på forespørsel fra et antall kommersielle og institusjonelle kilder. Videre er sekvensene for mange slike stammer tilgjengelige fra et antall databaser, inkludert for eksempel PubMed og GenBank. Homologe adenovirusvektorer fremstilt fra andre simian- eller fra humant adenovirus er beskrevet i publisert litteratur. Se for eksempel U.S. patent nr. 5 240 846. DNA-sekvensene for et antall adenovirustyper er tilgjengelige fra GenBank inkludert type Ad5 (GenBank tilgangsnr. M73260). Adenovirussekvensene kan være oppnådd fra en hvilken som helst kjent adenovirusserotype, som serotypene 2, 3, 4, 7, 12 og 40, og videre inkludert en hvilken som helst av de i dag identifiserte humane typer. Tilsvarende adenovirus kjent for å infektere ikke-humane dyr (for eksempel aper), kan også benyttes i vektorkonstruksj onene ifølge foreliggende oppfinnelse. Se for eksempel U.S. patent nr. 6 083 716.
De virale sekvensene, hjelpervirus, hvis nødvendig, og rekombinante virale partikler, og andre vektorkomponenter og sekvenser anvendt i vektorkonstruksj onen beskrevet heri, oppnås som beskrevet ovenfor. DNA-sekvensene for Pan5-, Pan6-, Pan7-, SVI-, SV25-og/eller SV39-simian-adenovirussekvensene ifølge oppfinnelsen anvendes for vektorkonstruksj oner og cellelinjer nyttige i fremstillingen av slike vektorer.
Modifikasjoner av nukleinsyresekvenser som danner vektorene ifølge oppfinnelsen inkludert sekvensdelesjoner, innskudd eller andre mutasjoner, kan genereres ved anvendelse av standard molekylbiologiteknikker og ligger innenfor oppfinnelsens ramme.
A. " Minigenet"
Fremgangsmåtene som benyttes for valg av transgenet, kloningen og konstruksjonen av "minigenet" og dets innføring i den virale vektor, ligger innenfor teknikkens kunnskapsområder gitt den heri beskrevne lære.
1. Transgenet
Transgenet er en nukleinsyresekvens, heterolog til vektorsekvensen som flankerer transgenet, som koder et polypeptid, protein eller et annet produkt av interesse. Nukleinsyrekodingssekvensen er operativt forbundet til regulatoriske komponenter på en måte som tillater transgen transkripsjon, translasjon og/eller ekspresjon i en vertscelle.
Sammensetningen av transgensekvensen vil avhenge av den tilsiktete anvendelse for den resulterende vektor. For eksempel inkluderer én type transgensekvens en rapportørsekvens som ved ekspresjon gir et påvisbart signal. En slik rapportørsekvens inkluderer, uten begrensning, DNA-sekvenser som koder P-lactamase, P-galactosidase (LacZ), alkalisk fosfatase, tymidinkinase, grønn fluorescent protein (GFP), kloramfenikolacetyltransferase (CAT), luciferase, membranbundne proteiner som inkluderer for eksempel CD2, CD4, CD8, influensahemagglutininproteinet og andre velkjente i teknikken, mot hvilke det eksisterer høyaffinitetsantistoffer rettet mot disse, eller slike kan fremstilles ved konvensjonelle midler, og fusjonsproteiner omfattende et membranbundet protein, hensiktsmessig fusert til et antigen tag-doméne fra blant annet hemagglutinin eller Mye. Når disse kodende sekvensene er assosiert med regulatoriske elementer som driver deres ekspresjon, gir påvisbare signaler ved konvensjonelle midler, inkludert enzymatisk, radiografisk, kollorimetrisk, fluorescens eller andre spektrografiske analyser, fluorescentaktiverende celler sorterende analyser og immunologiske analyser, inkludert enzymforbundet immunosorbentanalyse (ELISA), radioimmunoanalyse (RIA) og immunohistokjemi. Der for eksempel markørsekvensene er LacZ-genet, blir nærværet av vektoren som bærer signalet påvist ved analyse på P-galaktosidaseaktivitet. Der transgenet er GFP eller luciferase, kan vektoren som bærer signalet måles visuelt ved farge- eller lysproduksjon i et luminometer.
Transgenet er imidlertid ønskelig en ikke-markørsekvens som koder et produkt som er nyttig i biologi og medisin, slik som proteiner, peptider, RNA, enzymer eller katalytiske RNAer. Ønskelige RNA-molekyler inkluderer tRNA, dsRNA, ribosomalt RNA,
katalytiske RNAer og antisens RNAer. Ett eksempel på nyttig RNA-sekvens er en sekvens som utvisker ekspresjonen av en målrettet nukleinsyresekvens i det behandlete dyret.
Transgenet kan benyttes for behandling, for eksempel for genetiske defekter, som et cancerterapeutikum eller en vaksine, for induksjon av en immunrespons, og/eller for profylaktiske vaksineformål. Som anvendt heri, henviser induksjon av en immunrespons til et molekyls evne (for eksempel et genprodukt) til å indusere en T-celle- og/eller en humoral immunrespons til molekylet. Oppfinnelsen inkluderer videre anvendelse av flere transgener, for eksempel for å korrigere eller forbedre en tilstand forårsaket av et multisubenhetsprotein. I visse situasjoner kan forskjellige transgener benyttes for å kode hver proteinsubenhet eller å kode forskjellige peptider eller proteiner. Det er ønskelig når størrelsen av det DNA som koder proteinsubenheten er stor, for eksempel for et immunglobulin, den plateavledete vekstfaktoren eller et dystrofinprotein. For at cellen skal kunne produsere multisubenhetproteinet blir en celle infisert med det rekombinante virus inneholdende alle de forskjellige subenheter. Alternativt kan forskjellige subenheter av et protein kodes ved det samme transgen. I dette tilfellet inkluderer et enkelt transgen det DNA som koder hver av subenhetene, der DNA for hver subenhet er separert av et internt ribozyminngangssete (IRES). Dette er ønskelig når størrelsen av det DNA som koder hver av subenheten er lite, for eksempel den totale størrelsen av det DNA som koder subenheten og IRES er mindre enn 5 kilobaser. Som et alternativ til en IRES kan det angjeldende DNA være separert ved sekvenser som koder et 2A-peptid, som spalter seg selv i en posttranslasjonal hendelse. Se for eksempel M. L. Donnelly et al., J. Gen. Virol. 78(Pt 1): 13-21 (jan. 1997); Furier, S., et al, Gene Ther., 8(11):864-873 (juni 2001); Klump, H., et al, Gene Ther. 8(10):811-817 (mai 2001). Dette 2A-peptidet er betydelig mindre enn et IRES, som gjør det velegnet for anvendelse når rommet er en begrensende faktor. Imidlertid kan det valgte transgen kode et hvilket som helst biologisk aktivt produkt eller annet produkt, for eksempel et produkt som er ønskelig for studie.
Egnete transgener kan lett velges av fagmannen. Valget av transgenet anses ikke som begrensende for oppfinnelsen.
2. Regulatoriske elementer
I tillegg til hovedelementene som er definert ovenfor for minigenet, inkluderer vektoren også konvensjonelle kontrollelementer nødvendige, og som er operativt forbundet med transgenet på en måte som tillater dets transkripsjon, translasjon og/eller ekspresjon i en celle transfektert med plasmidvektoren eller infektert med det virus som produseres ifølge oppfinnelsen. Som anvendt heri inkluderer "operativt forbundet" sekvenser, både ekspresjonskontrollsekvenser som er sammenfallende med genet av interesse og ekspresjonskontrollsekvenser som virker i trans eller på en avstand for å kontrollere genet av interesse.
Ekspresjonskontroll sekvenser inkluderer egnete transkripsjonsoppstart, terminering,
promoter og enhancersekvenser; effektiv RNA-prosesseringssignaler som spleise- og polyadenylerings (polyA)signaler; sekvenser som stabiliserer cytoplasmisk mRNA;
sekvenser som forbedrer translasjonseffektiviteten (dvs. Kozak konsensussekvens);
sekvenser som forbedrer proteinstabilitet; og hvis ønskelig sekvenser som forbedrer sekresjon av det kodete produktet. Et stort antall ekspresjonskontrollsekvenser inkludert native promotere, konstitutive, induserbare og/eller vevsspesifikke, er velkjente i teknikken og kan anvendes.
Eksempler på konstitutive promotere inkluderer uten begrensning det retrovirale Rous sarkomvirus (RSV) LTR-promoteren (eventuelt med RSV-enhanceren),
cytomegalovirus (CMV) promoteren (eventuelt med CMV-enhanceren), (se for eksempel Boshart et al, Cell, 41:521-530 (1985)), SV40-promoteren,
dihydrofolatreduktasepromoteren, P-aktin-promoteren, fosfoglyserolkinase (PGK) promoteren og EFla-promoteren (Invitrogen).
Induserbare promotere tillater regulering av genekspresjon og kan reguleres ved eksogent tilførte forbindelser, omgivelsesfaktorer som temperatur eller nærvær av en spesifikk, fysiologisk tilstand, for eksempel en akutt fase, en spesiell differensieringstilstand for cellen, eller kun ved replikering av celler. Induserbare promotere og systemer er tilgjengelige fra et antall kommersielle kilder som uten begrensning inkluderer Invitrogen, Clontech og Ariad. Mange andre systemer er beskrevet og kan lett velges av fagmannen. For eksempel inkluderer induserbare promotere den sink-induserbare saue-metalltionin (MT) promoter og det deksametason (Dex) induserbare musebrysttumorvirus (MMTV) promoter. Andre induserbare systemer inkluderer T7-polymerasepromotersystemet (WO 98/10088); ecdyson-insektpromoteren (No et al, Proe. Nati. Acad. Sei. USA, 93:3346-3351 (1996)), det tetrasyklinrepressive systemet (Gossen et al, Proe. Nati. Acad. Sei. USA, 89:5547-5551
(1992)), det tetrasyklininduserbare systemet (Gossen et al, Science, 268:1766-1769
(1995)), se også Harvey et al, Curr. Opin. Chem. Biol, 2:512-518 (1998)). Andre systemer inkluderer FK506-dimeren, VP 16 eller p65 som benytter castradiol, difenol murisleron, det RU486-induserbare systemet (Wang et al, Nat. Biotech., 15:239-243
(1997) og Wang et al, Gene Ther., 4:432-441 (1997)), og det rapamycininduserbare systemet (Magari et al, J. Clin. Invest., 100:2865-2872 (1997)). Noen induserbare promotere øker sin effektivitet med tiden. I slike tilfeller kan effektiviteten forbedres i slike systemer ved innføring av flere repressorer i tandem, for eksempel TetR forbundet med en TetR via en IRES. Alternativt kan man vente minst tre dager før man avsøker det hele for den ønskete funksjon. Ekspresjon av ønskete proteiner kan forbedres ved kjente midler for å bedre effektiviteten av dette systemet. For eksempel ved anvendelse av Woodchuck Hepatitis Virus Posttranscriptional Regulatory Element (WPRE).
I en annen utførelsesform kan den native promoter for transgenet benyttes. Den native promoter kan være foretrukket når det er ønskelig at ekspresjonen av transgenet etterlikner den native ekspresjonen. Den native promoteren kan benyttes når ekspresjonen av transgenet må reguleres midlertidig eller utviklingsmessig, eller på en vevsspesifikk måte, eller som respons på spesifikk transkripsjonsstimulering. I en ytterligere utførelsesform kan andre native ekspresjonskontroll elementer som enhancerelementer, polyadenyleringsseter eller Kozaks konsensussekvenser også anvendes for å etterlikne den native ekspresjonen.
Andre utførelsesformer av transgenet inkluderer et transgen, operativt forbundet med en vevsspesifikk promoter. Hvis ekspresjon i skjelettmuskelen er ønskelig, bør det for
eksempel benyttes en promoter som er aktiv i muskelen. Disse inkluderer promotere fra gener som koder skjelett P-aktin, myosin lettkjede 2A, dystrofin, muskelkreatinkinase så vel som syntetiske muskelpromotere, som aktiverer høyere enn naturlig forekommende promotere (se Li et al, Nat. Biotech., 17:241-245 (1999)). Eksempler på promotere som virker vevs-spesifikt er kjent for lever (albumin, Miyatake etal., J. Virol., 71:5124-32
(1997); hepatitt-B viruskjernepromoter, Sandig etal, Gene Ther., 3:1002-9 (1996); <x-fetoprotein (AFP), Arbuthnot etal, Hum. Gene Ther., 7:1503-14 (1996)), benosteokalsin (Stein et al, Mol. Biol. Rep., 24:185-96 (1997)); bensialoprotein (Chen etal, J. Bone Miner. Res., 11:654-65 (1996)), lymfocytter (CD2, Hansal etal, J. Immunol, 161:1063-8 (1998)); immunoglobulin tungkjede; T-celle reseptorkjede), neuronal som neuronspesifikk enolase (NSE) promoteren (Andersen etal, Cell. Mol. Neurobiol, 13:503-15 (1993)), neurofilament lettkjedegen (Piccioli etal, Proe. Nati. Acad. Sei. USA, 88:5611-5 (1991)), og det neuronspesifikke vgf-genet (Piccioli etal, Neuron, 15:373-84 (1995)), blant andre.
Eventuelt kan vektorer som bærer transgener som koder terapeutisk nyttige eller immunogene produkter også inkludere valgbare markører eller rapportørgener som kan inkludere sekvenser som koder geneticin, hygromicin- eller purimycinresistens, blant andre. Slike valgbare rapportører eller markørgener (fortrinnsvis lokalisert utenfor det virale genom som skal pakkes til en viral pakke), kan anvendes for å signalisere nærvær av plasmidet i bakterieceller, som ampicillinresistens. Andre vektorkomponenter kan inkludere replikasjonsopprinnelse. Valg av disse og andre promotere og vektorelementer er konvensjonelle og mange slike sekvenser er tilgjengelige (se for eksempel Sambrook et al., og referanser sitert deri).
Disse vektorer genereres ved anvendelse av teknikker og sekvenser som gis her, i forbindelse med teknikker som er velkjente for fagfolk på området. Slike teknikker inkluderer konvensjonelle kloningsteknikker av cDNA, som de som er beskrevet i litteraturen (Sambrook et al., Molecular Cloning: A Laboratory Manual, Cold Spring Harbor Press, Cold Spring Harbor, NY), ved anvendelse av overlappende oligonukleotidsekvenser av adenovirusgenomene, polymerasekjedereaksjon, og enhver egnet fremgangsmåte som gir den ønskete nukleotidsekvens.
Ul. Fremstilling av den rekombinante viralpartikkel
I en utførelsesform anvendes simian-adenoviralplasmidene (eller andre vektorer) for fremstilling av rekombinante adenovirale partikler. I en utførelsesform er de rekombinante adenovirus funksjonelt deletert i Ela- eller Elb-genene og bærer eventuelt andre mutasjoner, for eksempel temperaturfølsomme mutasjoner eller delesjoner i andre gener. I andre utførelsesformer er det ønskelig å beholde et intakt Ela- og/eller Elb-område i de rekombinante adenovirus. Et slikt intakt El-område kan være lokalisert i sin native plassering i det adenovirale genom, eller plassert på setet for en delesjon i det native adenovirale genom (for eksempel i E3-regionen).
Ved konstruksjon av nyttige simian-adenovirusvektorer for avlevering av et gen til den humane (eller annet pattedyr) celle, kan et spektrum av adenovirusnukleinsyresekvenser anvendes i vektorene. For eksempel kan hele eller en del av det adenovirusforsinkete
tidlige gen E3 være eliminert fra simian-adenovirussekvensene som utgjør en del av det rekombinante virus. Funksjonen av simian-E3 antas å være irrelevant for funksjonen og fremstillingen av den rekombinante viruspartikkelen. Simian-adenovirusvektorer kan også konstrueres til å ha en delesjon av minst ORF6-området av E4-genet, og mer ønskelig på grunn av funksjonsoverskuddet i dette området: hele E4-området. Ytterligere en annen vektor ifølge oppfinnelsen inneholder en delesjon i det forsinkete tidlig-gen E2a. Delesjoner kan også foretas i et hvilket som helst av de sene gener LI til og med L5 i simian-adenovirusgenomet. Tilsvarende kan delesjoner i de mellomliggende gener YK og IVa2være nyttige for visse formål. Andre delesjoner kan foretas i andre strukturelle eller ikke-strukturelle adenovirusgener. De ovenfor
diskuterte delesjoner kan anvendes individuelt, dvs. at en adenovirussekvens for anvendelse ifølge oppfinnelsen kan inneholde delesjoner i kun ett enkelt område. Alternativt kan delesjonen av hele gener eller deler derav, effektive for å ødelegge deres biologiske aktivitet, anvendes i en hvilken som helst kombinasjon. For eksempel kan i ett eksempel på en vektor, adenovirussekvensen ha delesjoner av El-genene og E4-genet, eller El-, E2a- og E3-genene, eller El - og E3-genene, eller El-, E2a- og E4-genene, med eller uten delesjon av E3, osv. Som diskutert ovenfor, kan slike delesjoner anvendes i kombinasjon med andre mutasjoner som temperaturfølsomme mutasjoner for å oppnå et ønsket resultat.
En adenoviral vektor som mangler en hvilken som helst vesentlig adenoviral sekvens (for eksempel Ela, Elb, E2a, E2b, E4 ORF6, LI, L2, L3. L4 og L5) kan dyrkes i nærvær av de manglende adenovirale genprodukter som kreves for viral infektivitet og propagering av en adenoviral partikkel. Disse hjelperfunksjoner kan tilveiebringes ved å dyrke den adenovirale vektor i nærvær av én eller flere hjelperkonstruksjoner (for eksempel et plasmid eller et virus) eller en pakkende vertscelle. Se for eksempel teknikkene beskrevet for fremstilling av en "minimal" human Ad-vektor i W096/13597, publisert 9. mai 1996, hvortil det vises for detaljer.
1. Hjelpervirus
Avhengig av simian-adenovirusgeninneholdet i de virale vektorer som anvendes for å bære minigenet, kan et hjelperadenovirus eller et ikke-replikerende virusfragment være nødvendig for å gi tilstrekkelige simian-adenovirusgensekvenser som er nødvendige for å gi en infeksiøs, rekombinant viral partikkel inneholdende minigenet. Nyttige hjelpervirus inneholder utvalgte adenovirusgensekvenser som ikke er til stede i adenovirusvektorkonstruksjonen og/eller ikke uttrykkes av den pakkende cellelinjen hvori vektoren er transfektert. I en utførelsesform er hjelpervirusene replikasjonsdefekte og inneholder et antall adenovirusgener i tillegg til sekvensene beskrevet ovenfor. Et slikt hjelpervirus benyttes helst i kombinasjon med en El-uttrykkende cellelinje.
Hjelpervirus kan også tilformes til polykationkonjugater som beskrevet av Wu et al, J. Biol. Chem., 264:16985-16987 (1989); K. J. Fisher og J. M. Wilson, Biochem. J., 299:49 (1. april 1994). Hjelpervirus kan evt. inneholde et andre rapportør-minigen. Et antall slike rapportørgener er kjente i teknikken. Nærvær av et rapportørgen på et hjelpervirus som er forskjellig fra transgenene på adenovirusvektorene, tillater både Ad-vektoren og hjelperviruset uavhengig kan overvåkes. Dette andre rapportørgenet benyttes for å muliggjøre separering mellom det resulterende rekombinante virus og hjelperviruset ved rensing.
2. Kompi etteringscellelinj er
For å generere rekombinante simian-adenovirus (Ad) som er deletert i et hvilket som helst av genene som beskrevet ovenfor, må funksjonen av det deleterte genområdet, hvis det er vesentlig for replikasjonen og infektiviteten for viruset, tilføres det rekombinante virus via et hjelpervirus eller en cellelinje, dvs. en kompletterings- eller pakkingscellelinje. I mange tilfeller kan en cellelinje som uttrykker den humane El benyttes for å transkomplettere sjimp-Ad-vektoren. Dette er spesielt fordelaktig fordi, pga. forskjellen mellom sjimp-Ad-sekvensene ifølge oppfinnelsen og de humane AdEl-sekvensene funnet i dagens tilgjengelige pakkingsceller, forhindrer bruk av de vanlige humane El-inneholdende celler generering av replikasjonskompetente adenovirus under replikerings- og fremstillingsprosessen. Imidlertid vil det under visse omstendigheter være ønskelig å benytte en cellelinje som uttrykker El-genproduktene som kan benyttes for fremstilling av et El-deletert simian-adenovirus. Slike cellelinjer er beskrevet. Se for eksempel U.S. patent nr. 6 083 716.
Hvis ønskelig kan sekvensene som er gitt her anvendes for å generere en pakkende celle eller cellelinje som i det minste uttrykker adenovirus-El-genet fra Pan5, Pan6, Pan7,
SVI, SV25 eller SV39 under den transkripsjonale kontroll av en promoter for ekspresjonen i en utvalgt opphavscellelinje. Induserbare eller konstitutive promotere kan anvendes til dette formål. Eksempler på slike promotere er beskrevet i detalj annet sted i foreliggende beskrivelse. En opphavscelle velges for generering av en ny cellelinje som uttrykker et hvilket som helst ønsket AdPan5-, Pan6-, Pan7-, SVI-, SV25- eller SV39-gen. Uten begrensning kan en slik opphavscellelinje være HeLa (ATCC aksessnr. CCL2), A549 (ATCC aksessnr. CCL 185), HEK 293, KB (CCL 17), Detroit (for eksempel Detroit 510, CCL 72) og WI-38 (CCL 75) celler, blant andre. Disse cellelinjer er alle tilgjengelige fra "American Type Culture Collection", Manassas, Virginia 20110-2209. Andre egnete opphavscellelinjer kan oppnås fra andre kilder.
Slike El-uttrykkende cellelinjer er nyttige ved generering av rekombinante simian-adenovirus-El-deleterte vektorer. I tillegg, eller alternativt tilveiebringer oppfinnelsen cellelinjer som uttrykker ett eller flere simian-adenovirale genprodukter, for eksempel kan Ela, Elb, E2a og/eller E4 ORF6 konstrueres ved anvendelse av i det vesentligste de samme prosedyrer for anvendelse ved generering av rekombinante simian-virale vektorer. Slike cellelinjer kan anvendes for transkomplettering av adenovirusvektorer som er deleterte i de essensielle gener som koder disse produkter, eller for å tilveiebringe hjelperfunksjoner nødvendige for pakking av et hjelperavhengig virus (for eksempel adenoassosiert virus). Fremstillingen av en vertscelle ifølge oppfinnelsen involverer teknikker som sammensetning av valgte DNA-sekvenser. Denne sammensetningen kan gjennomføres ved anvendelse av konvensjonelle teknikker. Slike teknikker inkluderer cDNA- og genomisk kloning, alt velkjent og beskrevet av Sambrook et al., supra, ved bruk av overlappende oligonukleotidsekvenser fra adenovirale genomer kombinert med polymerasekjedereaksjon, syntetiske fremgangsmåter og hvilke som helst andre egnete fremgangsmåter som vil gi den ønskete nukleotidsekvens.
I nok et annet alternativ tilveiebringes de essensielle adenovirale genprodukter intrans ved adenovirale vektor og/eller hjelpervirus. I et slikt tilfelle kan en egnet vertscelle velges fra en hvilken som helst biologisk organisme inkludert prokaryote (for eksempel bakterielle) celler, og eukaryote celler inkludert insektceller, gjærceller og pattedyrceller. Spesielt ønskelige vertsceller er valgt blant hvilke som helst pattedyrsarter, inkludert, men uten begrensning, celler som A549, WEFfl, 3T3, 10T1/2, HEK293-celler eller PERC6 (av hvilke begge uttrykker funksjonelt adenoviralt El)
(Fallaux, F. J. etal, (1998), Hum. Gene Ther., 9:1909-1917), Saos, C2C12, L-celler,
HT1080, HepG2 og primære fibroblast-, hepatocyt- og myoblastceller avledet fra pattedyr, inkludert menneske, ape, mus, rotte, kanin og hamster. Valget av pattedyrspesier for å tilveiebringe cellene er ingen begrensning ved oppfinnelsen; ei heller pattedyrcelletype, dvs. fibroblast-, hepatocyt-, tumorcelle, osv. 3. Sammensetning av viralpartikkel og transfeksjon av en cellelinje Ved avlevering av vektoren som omfatter minigenet ved transfeksjon blir vektoren generelt avlevert i mengde på fra omkring 5 ug til 100 ug DNA, og fortrinnsvis omkring 10 til 50 ug DNA til omkring lx IO4 celler til omkring lxlO<13>celler, og fortrinnsvis omkring 10<5>celler. Imidlertid kan disse relative mengder av vektor-DNA til vertscellen justeres, tatt i betraktning faktorer som den valgte vektor, avleveringsmetoder og de valgte vertsceller.
Vektoren kan være en hvilken som helst vektor som er kjent i teknikken eller beskrevet ovenfor, inkludert nakent DNA, et plasmid, en phag, et transposon, kosmider, episomer, virus, osv. Innføring i vektorens vertscelle kan oppnås på enhver måte som kjent i teknikken, eller som beskrevet ovenfor inkludert transfeksjon og infeksjon. Ett eller flere av de adenovirale gener kan stabilt integreres i vertscellens genom, stabilt uttrykkes som episomer eller uttrykkes transient. Genproduktene kan alle uttrykkes transgent, på et episom eller integreres stabilt, eller noen av genproduktene kan uttrykkes stabilt, mens andre uttrykkes transient. Videre kan promoterne for hvert av de virale gener velges uavhengig fra en konstitutiv promoter, en induserbar promoter eller en nativ, adenoviral promoter. Disse promoterne kan reguleres ved en spesifikk fysiologisk tilstand for organismen eller cellen (dvs. ved differensieringstilstander eller i replikerende eller hvilende celler) eller ved for eksempel eksogent tilsatte faktorer.
Innføring av molekylene (som plasmider eller virus) i vertscellen kan også gjennomføres ved anvendelse av teknikker som er kjent av fagmannen, og som diskutert her i beskrivelsen. I en foretrukket utførelsesform benyttes standard transfeksjonsteknikker, for eksempel CaPCvtransfeksjon eller elektroporering.
Sammensetning av de valgte adenovirus-DNA-sekvenser (så vel som transgenet) og andre vektorelementer til forskjellige mellomplasmider, og anvendelse av plasmidene og vektorene for fremstilling av en rekombinant, viral partikkel, oppnås ved anvendelse av konvensjonelle teknikker. Slike teknikker inkluderer konvensjonelle kloningsteknikker og cDNA, slik som dem som er beskrevet i litteraturen (Sambrook et al., suprd), bruk av overlappende oligonukleotidsekvenser av adenovirusgenomene, polymerasekjedereaksjon, og en hvilken som helst egnet fremgangsmåte som gir den ønskete nukleotidsekvens. Standard transfeksjons- og kotransfeksjonsteknikker benyttes, for eksempel CaPCvpresipiteringsteknikker. Andre konvensjonelle metoder som benyttes inkluderer homolog rekombinering av de virale genomer, plakkdannelse av virus i agaroversjikt, fremgangsmåter for måling av signalgenerering, og liknende.
Etter konstruksjon og sammensetning av den ønskete minigenholdige, virale vektor, transfekteres for eksempel vektoren in vitro i nærvær av et hjelpervirus inn i den pakkende cellelinjen. Homolog rekombinasjon inntrer mellom hjelper- og vektorsekvensene, noe som tillater at adenovirus-transgen-sekvensene i vektorene replikeres og pakkes til virionkapsider, noe som resulterer i de rekombinante, virale vektorpartikler. Den gjengse metode for fremstilling av slike viruspartikler er transfeksjonsbasert. Imidlertid er oppfinnelsen ikke begrenset til slike metoder.
De resulterende rekombinante simianadenovirus er nyttige ved overføring av et valgt transgen til en valgt celle. I in vzvo-forsøk med de rekombinante virus dyrket i de pakkende cellelinjer demonstrerer de El-deleterte, rekombinante simianadenovirale vektorer ifølge oppfinnelsen anvendelighet ved overføring av et transgen til en ikke-simian og fortrinnsvis en human celle.
IV. Anvendelse av de rekombinante adenovirusvektorene
De rekombinante simianadenovirusvektorer ifølge oppfinnelsen er nyttige for genoverføring til en human eller ikke-simian veterinærpasient in vitro, ex vivo og in vivo.
De rekombinante adenovirusvektorer som er beskrevet her, kan anvendes som ekspresjonsvektorer for fremstilling av produktene som kodes av de heterologe gener in vitro. For eksempel kan de rekombinante adenovirus som inneholder et gen innskutt i plasseringen av en El-delesjon, transfekteres inn i en El-uttrykkende cellelinje, som beskrevet ovenfor. Alternativt kan replikasjonskompetente adenovirus benyttes i en annen valgt cellelinje. De transfekterte celler dyrkes så på konvensjonell måte og tillater det rekombinante adenovirus å uttrykke genproduktet fra promotere. Genproduktet kan så gjenvinnes fra kulturmediet ved kjente, konvensjonelle fremgangsmåter som proteinisolering og gjenvinning fra kultur.
En Pan5-, Pan6-, Pan7-, SVI-, SV25- eller SV39-avledet rekombinant simianadenoviral vektor ifølge oppfinnelsen gir en effektiv genoverføringsvehikkel som kan avlevere et valgt transgen til en valgt vertscelle in vivo eller ex vivo, selv når organismen har nøytraliserende antistoffer mot én eller flere AAV-serotyper. I en utførelsesform blandes rAAV og cellene ex vivo; de infekterte celler dyrkes ved konvensjonelle fremgangsmåter; og de transduserte celler reinfuseres i pasienten. Disse sammensetningene er spesielt egnet for genavlevering for terapeutiske formål og for immunisering inkludert indusering av beskyttende immunitet.
Mer vanlig vil Pan5-, Pan6-, Pan7-, SVI-, SV25- eller SV39-rekombinante adenovirale vektorer ifølge oppfinnelsen anvendes for avlevering av terapeutiske eller immunogene molekyler, som beskrevet ovenfor. Det vil lett forstås for begge anvendelse at de rekombinante, adenovirale vektorer ifølge oppfinnelsen er spesielt godt egnet for anvendelse i regimer som involverer gjentatt avlevering av de rekombinante, adenovirale vektorer. Slike regimer involverer karakteristisk avlevering av en serie virale vektorer hvori de virale kapsider er endret. De virale kapsider kan forandres for hver etterfølgende tilførsel, om det på forhånd er valgt et antall tilførsler av et spesielt serotypekapsid (for eksempel en, to, tre, fire eller flere). Således kan et regime involvere avlevering av en rAd med et første simiankapsid, avlevering av en rAd med et andre simiankapsid, og avlevering med et tredje simiankapsid. Et antall andre regimer som benytter Ad-kapsider ifølge oppfinnelsen alene, i kombinasjon med et annet, eller i kombinasjon med andre Ad-serotyper, vil være åpenbart for fagmannen. Eventuelt kan et slikt regime involvere tilførsel av rAd med kapsider av andre, ikke-humane primatadenovirus, humant adenovirus eller kunstige serotyper, som beskrevet her. Hver fase av regimet kan involvere tilførsel av en serie injeksjoner (eller andre avleveringsruter) med et enkelt Ad-serotypekapsid, fulgt av en serie med et annet Ad-serotypekapsid. Alternativt kan de rekombinante Ad-vektorer ifølge oppfinnelsen benyttes i regimer som involverer andre ikke-adenoviralmedierte avleveringssystemer, inkludert andre virale systemer, ikke-virale avleveringssystemer, protein, peptider og andre biologisk aktive molekyler.
De påfølgende avsnitt vil fokusere på eksempler på molekyler som kan avleveres via de adenovirale vektorer ifølge oppfinnelsen.
A. Ad-mediert avlevering av terapeutiske molekyler
I en utførelsesform tilføres de ovenfor beskrevne rekombinante vektorer til mennesker i henhold til publiserte fremgangsmåter for genterapi. En simianviral vektor som bærer det valgte transgen, kan tilføres til en pasient, fortrinnsvis suspendert i en biologisk kompatibel oppløsning, eller en farmasøytisk akseptabel avleveringsbærer. En egnet bærer inkluderer steril saltoppløsning. Andre vandige eller ikke-vandige isotoniske, sterile injeksjonsoppløsninger og vandige og ikke-vandige sterile suspensjoner som er kjent for å være farmasøytisk akseptable bærere, og som er velkjente for fagmannen, kan anvendes for dette formål. De simianadenovirale vektorer tilføres i tilstrekkelige mengder til å transdusere målcellen, og å gi tilstrekkelig nivå for genoverføring og ekspresjon for å tilveiebringe en terapeutisk fordel uten urimelige, ugunstige elementer, eller med medisinsk akseptable, fysiologiske effekter, som lett kan bestemmes av fagmannen på området. Konvensjonelle og farmasøytisk akseptable tilførselsruter inkluderer, men er ikke begrenset til direkte avlevering til retina og andre intraokkulære avleveringsmetoder, direkte avlevering til lever, inhalering, intranasal, intravenøs, intramuskulær, intratrakeal, subkutan, intradermal, rektal, oral eller andre parenterale veier for tilførsel. Tilførselsveier kan kombineres dersom ønskelig, eller justeres avhengig av transgenet eller betingelsene. Tilførselsveien vil primært avhenge av tilstanden som behandles.
Doseringene av den virale vektor vil avhenge av faktorer som tilstanden som behandles, alder, vekt og pasientens helse, og kan således variere blant pasienter. For eksempel er en terapeutisk effektiv voksen, human- eller veterinærdose av den virale vektor, generelt i området omkring 100 ul til omkring 100 ml bærer inneholdende konsentrasjoner fra omkring lxlO<6>til omkring lxlO<15>parti<k>ler, omkring lxlO<11>til lxlO13 partikler eller omkring lxl09til lxlO12 partikkelvirus. Doseringer vil avhenge av dyrets størrelse og administreringsmodus. For eksempel vil en egnet human- eller veterinærdose (for et dyr på omkring 80 kg) for intramuskulær injeksjon ligge i området omkring lxlO<9>til 5xl0<12>partikler/ml for et enkelt sete. Eventuelt kan flere administreringsseter anvendes. I et annet eksempel kan en egnet human- eller veterinærdose ligge i området omkring lxlO<11>til omkring lxlO<15>partikler for oral formulering. Fagmannen på området kan justere disse doser avhengig av administreringsvei, og den terapeutiske eller vaksinesammensetningen der den rekombinante vektor benyttes. Ekspresjonsnivåene for transgenet, eller for et immunogen, nivået for sirkulerende antistoff kan overvåkes for bestemmelse av frekvensen av doseadministrering. Ytterligere andre fremgangsmåter for bestemmelse av tidsforløpet for administreringsfrekvens, vil være lett å se for fagmannen.
En eventuell fremgangsmåte involverer samtidig tilførsel til pasienten, enten samtidig med, før eller etter tilførsel av den virale vektor av en egnet mengde av en kortvirkende immunmodulator. Den valgte immunmodulator defineres her som et middel i stand til å inhibere dannelse av nøytraliserende antistoffer rettet mot den rekombinante vektor ifølge oppfinnelsen, eller som er i stand til å inhibere cytolytisk T-lymfocytt (CTL) eliminering av vektoren. Immunmodulatoren kan interferere med interaksjonene mellom T-hjelper sub-settene (THieller TH2) og B-celler for å inhibere nøytralisering av antistoffdannelse. Alternativt kan immunmodulatorer inhibere interaksjoner mellom THi-celler og CTLer for å redusere opptreden av CTL-eliminering av vektoren. Et antall nyttige immunmodulatorer og doseringer for anvendelse av disse, er for eksempel beskrevet av Yang et al, J. Virol, 70(9) (september 1996); WO96/12406, publisert 2.
mai 1996; samt PCT/US96/03035, idet det vises til alle disse vedrørende detaljer.
1. Terapeutiske transgener
Nyttige terapeutiske produkter som kodes av transgenet, inkluderer hormoner og vekst-og differensieringsfaktorer som uten begrensning inkluderer insulin, glukagon, veksthormon (GH), paratyroid hormon (PTH), veksthormonfrigivende faktor (GRF), follikkel stimulerende hormon (FSH), luteiniserende hormon (LH), humant korionisk gonadotropin (hCG), vaskulær endotelial vekstfaktor (VEGF), angjopoietiner, angjostatin, granulocyt kolonistimulerende faktor (GCSF), erytropoietin (EPO), konnektiv vev-vekstfaktor (CTGF), basisk fibroblast vekstfaktor (bFGF), sur fibroblast vekstfaktor (aFGF), epidermal vekstfaktor (EGF), transformerende vekstfaktor (TGF), plateavledet vekstfaktor (PDGF), insulin vekstfaktor I og II (IGF-I og IGF-II), enhver av den transformerende vekstfaktor superfamilie, inkludert TGF, aktiviner, inhibiner, eller ethvert av de benmorfogene proteiner (BMP) BMPene 1-15, enhver av heregluin/neuregulin/ARIA/neu differensierings faktor (NDF) familie av vekstfaktorer, nervevekstfaktor (NGF), hjerneavledet neurotrofisk faktor (BDNF), neurotrofiner NT-3 og NT-4/5, siliær neurotrofisk faktor (CNTF), glialcellelinjeavledet neurotrofisk faktor (GDNF), neurturin, agrin, enhver fra familien av semaforiner/collapsiner, netrin-I og netrin-2, hepatocytt vekstfaktor (HOF), efriner, noggin, sonisk hedgehog og tyrosinhydroksylase.
Andre nyttige transgene produkter inkluderer proteiner som regulerer immunsystemet og inkluderer uten begrensning cytokiner og lymfokiner som trombopoietin (TPO), interleukiner (IL) IL-1 til IL-25 (inkludert for eksempel IL-2, IL-4, IL-12 og IL-18), monocytkemoattrakterende protein, leukemiinhibitorisk faktor, granulocytmakrofagkolonistimulerende faktor, Fas-ligand, tumornekrosefaktorer og interferoner, og stamcellefaktor, flk-2/flt3-ligand. Genprodukter som fremstilles ved immunsystemet er også nyttige ifølge oppfinnelsen. Disse inkluderer uten begrensning immunglobuliner IgG, IgM, IgA, IgD og IgE, kimære immunglobuliner, humaniserte antistoffer, enkelkjedeantistoffer, T-celle reseptorer, kimære T-celle reseptorer, enkelkjede T-celle reseptorer, klasse I- og klasse U-MHC-molekyler, så vel som konstruerte immunglobuliner og MHC-molekyler. Nyttige genprodukter inkluderer også komplementregulatoriske proteiner som komplementregulatoriske proteiner, membran-kofaktorprotein (MCP), nedbrytningsakselererende faktor (DAF), CR1, CR2 og CD59.
Ytterligere andre nyttige genprodukter inkluderer enhver reseptor for hormoner, vekstfaktorer, cytokiner, lymfokiner, regulatoriske proteiner og immunsystemproteiner. Oppfinnelsen omfatter reseptorer for kolesterolregulering, inkludert lavtetthets lipoprotein (LDL) reseptor, høytetthets lipoprotein (HDL)reseptor, meget lavtetthets lipoprotein (VLDL) reseptor, og scavenger reseptoren. Oppfinnelsen omfatter også genprodukter som medlemmer av steroid hormonreseptorsuperfamilien inkludert glukokortikoidreseptorer og østrogenreseptorer, Vitamin D reseptorer og andre nukleære reseptorer. I tillegg inkluderer nyttige genprodukter transkripsjonsfaktorer som jun, fos, max, mad, serumresponsfaktor (SRF), API, AP2, myb, MyoD og myogenin, ETS-boksinneholdende proteiner, TFE3, E2F, ATF1, ATF2, ATF3, ATF4, ZF5, NFAT, CREB, HNF-4, C/EBP, SP1, CCAAT-boksbindende proteiner, interferonreguleringsfaktor (IRF-1), Wilms tumorprotein, ETS-bindende protein, STAT, GATA-boksbindende proteiner, for eksempel GATA-3 og gaffelhodefamilien av vingete heliksproteiner.
Andre nyttige genprodukter inkluderer karbamoyl syntetase L ornitintranskarbamylase, argjnosuksinatsyntetase, arginosuksinatlyase, argjnase, fumarylacetacetathydrolase, fenylalaninhydroksylase, a-1 antitrypsin, glukose-6fosfatase, porfobilinogen deaminase, faktor-Vin, faktor-IX, cystation P-syntase, forgrenet kjedeketosyredekarboksylase, albumin, isovaleryl-coA dehydrogenase, propionyl CoA karboksylase, metylmalonyl CoA mutase, glutaryl CoA dehydrogenase, insulin, P-glukosidase, pyruvatkarboksylat, hepatisk fosforylase, fosforylase kinase, glysindekarboksylase, H-protein, T-protein, en cystisk fibrose transmembranregulator (CFTR) sekvens og en dystrofin cDNA-sekvens.
Andre nyttige genprodukter inkluderer ikke-naturlig forekommende polypeptider som kimære eller hybride polypeptider med en ikke-naturlig opptredende aminosyresekvens inneholdende innskudd, delesjoner eller aminosyresubstitusjoner. For eksempel kunne enkelkjedekonstruerte immunglobuliner være nyttige hos visse immunkompromitterte pasienter. Andre typer ikke-naturlig forekommende gensekvenser inkluderer antisensmolekyler og katalytiske nukleinsyrer som ribozymer, som ville kunne benyttes for å redusere overekspresjon av et mål.
Reduksjon og/eller modulering av en genekspresjon er spesielt ønskelig for behandling av hyperproliferative tilstander som karakteriseres ved hyperprolifererende celler, slik som cancere og psoriasis er. Målpolypeptider inkluderer disse polypeptider som fremstilles utelukkende eller ved høyere nivåer i hyperproliferative celler, sammenliknet med normale celler. Målantigener inkluderer polypeptider kodet av onkogener som myb, mye, fyn og translokasjongenet bcr/abl, ras, src, P53, neu, trk og EGRF. I tillegg til onkogenprodukter som målantigener, inkluderer målpolypeptider for anticancerbehandling og beskyttende regimer variable områder av antistoffer dannet av B-cellelymfomer og variable områder av T-cellereseptorer av T-cellelymfomer, som i visse utførelsesformer også benyttes som målantigener for autoimmune sykdommer. Andre tumorassosierte polypeptider kan benyttes som målpolypeptider, for eksempel polypeptider som finnes i høyere nivåer i tumorceller inkludert polypeptider som gjenkjennes av monoklonalt antistoff 17-1A og folatbindingspolypeptider.
Andre egnete terapeutiske polypeptider og proteiner inkluderer dem som kan være nyttige for behandling av individer som lider av autoimmune sykdommer og lidelser ved å tilveiebringe en bred basert, beskyttende immunrespons mot mål assosiert med autoimmunitet, inkludert cellereseptorer og celler som gir cellerettete antistoffer. T-cellemedierte autoimmune sykdommer inkluderer reumatoid artritt (RA), multippel sklerose (MS), Sjogrens syndrom, sarkoidose, insulinavhengig diabetes mellitus (JDDM), autoimmun tyroiditt, reaktiv artritt, ankyloserende spondylitt, skleroderma, polymyositt, dermatomyositt, psoriasis, vaskulitt, Wegeners granulomatose, Crohns sykdom og ulserativ kolitt. Hver av disse sykdommer karakteriseres ved T-cellereseptorer (TCR) som binder til endogene antigener og starter den inflammatoriske kaskade som assosieres med autoimmune sykdommer.
Simianadenoviralvektorene ifølge oppfinnelsen er spesielt velegnet for terapeutiske regimer hvori flere adenoviralmedierte avleveringer av transgener er ønskelig, for eksempel i regimer som involverer ny avlevering av det samme transgen, eller i kombinasjonsregimer som involverer avlevering av andre transgener. Slike regimer kan involvere administrering av en Pan5-, Pan6-, Pan7-, SVI-, SV25- eller SV39-simianadenoviral vektor, fulgt av readministrering med en vektor fra det samme serotypeadenovirus. Spesielt egnete regimer involverer administrering av en Pan5-, Pan6-, Pan7-, SVI-, SV25- eller SV39-simianadenoviral vektor ifølge oppfinnelsen, hvori serotypen av den virale vektor som avleveres ved den første tilførselen, skiller seg fra serotypen av den virale vektor som anvendes i én eller flere av de påfølgende tilførsler. For eksempel involverer et terapeutisk regime administrering av en Pan5-, Pan6-, Pan7-, SVI-, SV25- eller SV39-vektor og gjentatt tilførsel med én eller flere adenovirale vektorer av den samme eller ulike serotyper. I et annet eksempel involverer et terapeutisk regime tilførsel av en adenoviral vektor, etterfulgt av gjentatt tilførsel med en Pan5-, Pan6-, Pan7-, SVI-, SV25- eller SV39-vektor ifølge oppfinnelsen, som skiller seg fra serotypen av den først avleverte adenovirale vektor, og eventuell ytterligere tilførsel med en annen vektor som er den samme, eller fortrinnsvis skiller seg når det gjelder serotype fra vektoren i de forutgående administreringstrinn. Disse regimer er ikke begrenset til avlevering av adenovirale vektorer konstruert ved anvendelse av Pan5-, Pan6-, Pan7-, SVI-, SV25- eller SV39-simianserotypene ifølge oppfinnelsen. Tvert imot kan disse regimer lett benytte vektorer av andre adenovirale serotyper, inkludert uten begrensning, andre simianadenovirale serotyper, for eksempel Pan9 eller C68, Cl, osv.), eller ikke-human primatadenovirale serotyper, eller humane adenovirale serotyper, i kombinasjon med én eller flere av Pan5-, Pan6-, Pan7-, SVI-, SV25- eller SV39-vektorene ifølge oppfinnelsen. Eksempler på slike simian-, andre ikke-humane primat- og humanadenovirale serotyper er diskutert annet sted i denne beskrivelsen. Videre kan disse terapeutiske regimer involvere enten samtidig eller sekvensielt, avlevering av Pan5-, Pan6-, Pan7-, SVI-, SV25- og/eller SV39-adenovirale vektorer ifølge oppfinnelsen, i kombinasjon med ikke-adenovirale vektorer, ikke-virale vektorer, og/eller et antall andre terapeutisk nyttige forbindelser eller molekyler. Oppfinnelsen er ikke begrenset til disse terapeutiske regimer, et spektrum av slike vil lett være åpenbart for fagmannen på området.
B. Ad-formidlet avlevering av immunogene transgener
De rekombinante simianadenovirus kan også anvendes som immunogene preparater. Som anvendt heri er et immunogent preparat et preparat der et humoralt (for eksempel antistoff) eller cellulært (for eksempel en cytotoksisk T-celle) respons er reist mot et transgenprodukt avlevert av det immunogene preparat etter avlevering til et pattedyr, og fortrinnsvis en primat. Foreliggende oppfinnelse tilveiebringer et rekombinant simian-Ad som i enhver av sine adenovirussekvensdelesjoner kan inneholde et gen som koder et ønsket immunogen. Simianadenovirus vil sannsynligvis være bedre egnet for anvendelse som levende, rekombinant virusvaksine i forskjellige animalske arter, sammenliknet med adenovirus av human opprinnelse, men er ikke begrenset til en sådan anvendelse. De rekombinante adenovirus kan anvendes som profylaktiske eller terapeutiske vaksiner mot ethvert patogen, for hvilke antigenet eller antigenene er identifisert(e), som er avgjørende for induksjon av en immunrespons og i stand til å begrense spredning av patogenet, og for hvilken den angjeldende cDNA er tilgjengelig.
Slike vaksine- (eller immunogene) sammensetninger formuleres i en egnet avleveringsbærer som beskrevet ovenfor. Generelt ligger dosene for de immunogene preparatene i de områder definert ovenfor for terapeutiske preparater. Immunitetsnivå for det valgte gen kan overvåkes for bestemmelse av behov for forsterkere, om dette er nødvendig. Etter bedømmelse av antistofftitere i serum, kan det evt. være ønskelig ned forsterket immunisering.
Eventuelt kan et vaksinepreparat ifølge oppfinnelsen formuleres til å inneholde andre komponenter, inkludert for eksempel adjuvanser, stabilisatorer, pH-justerere, preserveringsmidler og liknende. Slike komponenter er velkjente på vaksineområdet. Eksempler på egnete adjuvanser inkluderer uten begrensning, liposomer, alum, monofosforyl lipid A og enhver biologisk aktiv faktor som cytokin, interleukin, kemokin, ligander og evt. kombinasjoner derav. Visse av disse biologisk aktive faktorer kan uttrykkes in vivo, for eksempel via et plasmid eller en viral vektor. En slik adjuvans kan for eksempel administreres med en primer DNA-vaksine som koder et antigen for å øke den antigenspesifikke immunrespons sammenliknet med den immunrespons som genereres ved priming med en DNA-vaksine som kun koder antigenet.
De rekombinante adenovirus kan administreres i en "immunogenisk mengde", dvs. en mengde rekombinant adenovirus som er effektiv i en administreringsvei for å transfektere de ønskete celler og gi tilstrekkelige nivåer av ekspresjon av det valgte gen for å indusere en immunrespons. Der beskyttende immunitet er til stede, blir de rekombinante adenovirus betraktet som vaksinepreparater nyttige for prevensjon av infeksjon og/eller gjenopptredende sykdom.
Alternativt eller i tillegg kan vektorene ifølge oppfinnelsen inneholde et transgen som koder et peptid, polypeptid eller protein som induserer en immunrespons mot et valgt immunogen. De rekombinante adenovirus ifølge oppfinnelsen er ventet å være meget effektive i indusering av cytolytiske T-celler og antistoffer mot det innførte, heterologe antigeniske protein som uttrykkes av vektoren.
For eksempel kan immunogener være valgt fra et antall virale familier. Eksempler på ønskelige, virale familier mot hvilke en immunrespons ville være ønskelig, inkluderer picornavirusfamilien, som inkluderer genera rhinovirus som er ansvarlige for omkring 50 % av tilfellene av vanlig forkjølelse; genera enterovirus som inkluderer poliovirus, coxsackievirus, echovirus, og humant enterovirus som hepatitt-A-virus; og genera aptovirus, som er ansvarlige for munn- og klovsyke, primært i ikke-humane dyr. Innen picornavirusfamilien av virus inkluderer målantigenene VP1, VP2, VP3, VP4 og VPG. En annen viral familie inkluderer calsivirusfamilien, som omfatter Norwalk-gruppen av virus som er et viktig kausativt middel ved epidemisk gastroenteritt. Ytterligere en viralfamilie som er ønskelig for anvendelse ved målrettete antigener for indusering av immunresponer hos mennesker og ikke-humane dyr er togavirusfamilien, som inkluderer genera alfavirus, som inkluderer Sindbisvirus, RossRiver virus, og venezuelisk, østlig og vestlig Equine encefalitt, og rubivirus, inkludert Rubellavirus. Flaviviridae-familien inkluderer denguefeber, gulfeber, japansk encefalitt, St. Louis encefalitt og flåttbåret encefalittvirus. Andre målantigener kan genereres fra hepatitt-C eller coronavirus-familien som inkluderer et antall ikke-humane virus som infeksiøst bronkittvirus (fjærfe), porcin transmitterbar gastroenterisk virus (gris), porcin hemagglutinerende encefalomyelittvirus (gris), felin infeksiøs peritonittvirus (katter), felin enterisk koronavirus (katter), valpekoronavirus (hunder), og humant respiratorisk koronavirus, som kan forårsake vanlig forkjølelse og/eller ikke-A-, B- eller C-hepatitt. Innen koronavirusfamilien inkluderer målantigenene El (også benevnt M- eller matriksprotein), E2 (også benevnt S- eller Spikeprotein), E3 (også benevnt HE- eller hemagglutinelterose) glykoprotein (ikke til stede i alle koronavirus) eller N (nukleokapsid). Ytterligere andre antigener kan målrettes mot rabdovirusfamilien, som inkluderer genera vesiculovirus (for eksempel vesiculært stomatittvirus), og den generelle lyssavirus (for eksempel rabies). Innen rabdovirusfamilien kan egnete antigener avledes fra G-proteinet eller N-proteinet. Familien filoviridae som inkluderer hemorragisk febervirus, slik som Marburg- og Ebolavirus, kan være en egnet kilde for antigener. Paramyxovirusfamilien inkluderer parainfluensavirus type-1, parainfluensavirus type-3, bovint parainfluensavirus type-3, rubulavirus (kusmavirus), parainfluensavirus type-2, parainfluensavirus type-4, Newcastle sykdomsvirus (kylling), kvegpest, morbillivirus som inkluderer meslinger og canine distemper, og pneumovirus som inkluderer respiratorisk syncytialvirus. Influensaviruset er klassifisert innen familien ortomyxovirus og er en egnet kilde for antigener (for eksempel HA-proteinet, NI-proteinet). Bunyavirusfamilien inkluderer genera bunyavirus (California encefalitt, La Crosse), flebovirus (Rift Valley feber), hantavirus (puremala er et hemahagin-febervirus), nairovirus (Nairobi sauesykdom) og forskjellige ikke-betegnete bungavirus. Arenavirusfamilien tilveiebringer en kilde for antigener mot LCM og Lassa febervirus. Reovirusfamilien inkluderer genera reovirus, rotavirus (som forårsaker akutt gastroenteritt hos barn), orbivirus, og cultivirus (Colorado flåttfeber, Lebombo (mennesker), ekvin encefalose, blåtunge).
Retrovirusfamilien inkluderer subfamilien oncorivirinal som omfatter slike human- og veterinærsykdommer som felin leukemivirus, HTL-VI og HTL-VIJ, lentivirinal (som inkluderer humant immunodefektvirus (HIV), simian immunodefektvirus (SIV), felin immunodefektvirus (FIV), ekvin infeksiøst anemi vi rus og spumavirinal). Blant lentivirus er mange egnete antigener beskrevet og kan lett velges. Eksempler på egnete HJV- og SlV-antigener inkluderer uten begrensning gag-, pol-, Vif-, Vpx-, VPR-, Env-, Tat-, Nef- og Rev-proteiner, så vel som forskjellige fragmenter derav. For eksempel kan egnete fragmenter av Env-proteinet inkludere enhver av dets subenheter som gpl20, gpl60, gp41 eller mindre fragmenter derav, for eksempel med en lengde på minst omkring 8 aminosyrer. Tilsvarende kan fragmenter av tat-proteinet velges. (Se for eksempel U.S. patent nr. 5 891 994 og U.S. patent nr. 6 193 981). Se også HIV- og SIV-proteinene som beskrevet av D. H. Barouch etal., J. Virol., 75(5):2462-2467 (mars 2001), og R. R. Amara et al, Science, 292:69-74 (6. april 2001). I et annet eksempel kan HIV- og/eller SIV-immunogene proteiner eller peptider benyttes for å danne fusjonsproteiner eller andre immunogene molekyler. Se for eksempel HJV-1 Tat-og/eller -Nef fusjonsproteiner og immuniseringsregimene beskrevet i WO 01/54719, publisert 2. august 2001, samt WO 99/16884, publisert 8. april 1999. Oppfinnelsen er ikke begrenset til de HIV- og/eller SIV-immunogene proteiner eller -peptider som er beskrevet heri. I tillegg er en varietet av modifikasjoner av disse proteiner beskrevet eller kan lett oppnås av fagmannen. Se for eksempel det modifiserte gag-protein beskrevet i U.S. patent nr. 5 972 596. Videre kan ethvert HIV- og/eller SIV-immunogen avleveres alene eller i kombinasjon. Slike kombinasjoner kan inkludere ekspresjon fra en eller flere vektorer. Eventuelt kan andre kombinasjoner involvere avlevering av ett eller flere uttrykte immunogener ved avlevering av ett eller flere av immunogenene i proteinform. Slike kombinasjoner er diskutert i større detalj nedenfor.
Papovavirusfamilien inkluderer subfamilien polyomavirus (BKU- og JCU-virus), og subfamilien papillomavirus (assosiert med cancere eller malign progresjon av papilloma). Adenovirusfamilien inkluderer virus (EX, AD7, ARD, O.B.) som forårsaker respiratorisk sykdom og/eller enteritt. Parvovirusfamilien inkluderer felint parvovirus (felint enteritt), felint panleucopeniavirus, valpeparvovirus og porcint parvovirus. Herpesvirusfamilien inkluderer subfamilien a-herpesvirinae, som omfatter genera simpleksvirus (HSV-I, HSV-II), varicellovirus (pseudorabies, varicella zoster) og subfamilien P-herpesvirinae, som inkluderer genera cytomegalovirus (HCMV, muromegalovirus) og subfamilien y-herpesvirinae, som inkluderer genera lymphocryptovirus, EBV (Burkitts lymfom), infeksiøs rhinotracheitis, Mareks sykdomsvirus og radinovirus. Poxvirusfamilien inkluderer subfamilien kordopoxvirinae, som omfatter genera ortopoxvirus (Variola (småkopper) og vaccinia (kukopper)), parapoxvirus, avipoxvirus, capripoxvirus, leporipoxvirus, suipoxvirus og subfamilien entomopoxvirinae. Hepadnavirusfamilien inkluderer Hepatitt-B-virus. Et ikke-klassifisert virus som kan være en egnet kilde for antigener er hepatitt 8-virus. Ytterligere andre virale kilder kan inkludere avianinfeksiøst, bursalt sykdomsvirus og porcin respiratorisk- og reproduktivt syndromvirus. A-virusfamilien inkluderer ekvin arteritt virus og forskjellige encefalittvirus.
Foreliggende oppfinnelse omfatter også immunogener nyttige for immunisering av et menneske eller ikke-humant dyr mot andre patogener, inkludert bakterier, sopp, parasittmikroorganismer eller multicellulære parasitter som infiserer human- og ikke-human vertebrater, eller fra en cancercelle eller tumorcelle. Eksempler på bakterielle patogener inkluderer patogeniske grampositive cocci, inkludert pneumokokker; stafylokokker; og streptokokker. Patogene gramnegative cocci inkluderer meningokokker; gonokokker. Patogene enteriske gramnegative bacilli inkluderer enterobacteriaceae; pseudomonas, acinetobacteria og eikenella; melioidosis; salmonella; shigella; haemophilus; moraxella; H. ducreyi (som forårsaker sj anker); brucella; Franisella tularensis (som forårsaker tularemi); yersinia (pasteurella); streptobacillus moniliformis and spirillum; grampositive bacilli inkluderer listeria monocytogener; erysipelothrix rhusiopathiae; Corynebacterium diphtheria (difteri); cholera; B. anthracis (antrax); donovanosis (granuloma inguinale); og bartonellose. Sykdommer forårsaket av patogene anaerobe bakterier inkluderer tetanus; botulisme; andre klostridia; tuberkulose; lepra; og andre mycobakterier. Patogene spirochetale sykdommer inkluderer syfilis; treponematoser; yaws, pinta og endemisk syfilis; og leptospirose. Andre infeksjoner som forårsakes av høyere patogene bakterier og patogenisk sopp inkluderer actinomycosis; nocardiosis; cryptococcosis, blastomycosis, histoplasmosis og coccidioidomycosis; candidiasis, aspergjllosis og mucormycosis; sporotrichosis; paracoccidiodomycosis, petriellidiosis, torulopsosis, mycetoma og chromomycosis; og dermatophytosis. Rickettsiale infeksjoner inkluderer tyfusfeber, Rocky Mountain flekkfeber, Q-feber og rickettsiale kopper. Eksempler på mykoplasma og klamydisk infeksjon inkluderer: mykoplasma pneumoniae; lymphogranuloma venereum; psittacosis; og perinatalklamydial infeksjoner. Patogene eukaryoter omfatter patogene protozoer og helminter og infeksjoner dannet av disse inkluderer: amebiasis; malaria; leishmaniasis; trypanosomiasis; toxoplasmosis; Pneumocystis carinii; Trichans; toxoplasma gondii; babesiosis; giardiasis; trichinosis; filariasis; schistosomiasis;
nematodes; trematodes eller flukes; og cestode (bendelorm) infeksjoner.
Mange av disse organismer og/eller toksiner som fremstilles av disse er identifisert av "Centers for Disease Control" [(CDC), Department of Health and Human Services, USA], som midler som har et potensiale for anvendelse ved biologiske angrep. For eksempel inkluderer noen av disse biologiske midler Bacillus anthracis (antrax), Clostridium botulinum og dettes toksin (botulisme), Yersinia pestis (pest), vari ola major (småkopper), Francisella tularensis (tularemi), og viral hemorrhagjsk feber [filovirus (for eksempel Ebola, Marburg], og arenavirus [for eksempel Lassa, Machupo]), der alle i dag er klassifisert som midler kategori A; Coxiella burnetti (Q-feber); Brucella specier (brucellose), Burkholderia matlei (snive), Burkholderiapseudomallei (meloidosis), ricinus communis og dennes toksin (ricintoksin), Clostridium per/ ringens og dennes toksin (epsilontoksin), Staphylococcus species og deres toksiner (enterotoksin B), Chlamydiapsittaci (psittacosis), vannsikkerhetstrusler (for eksempel Vibrio cholerae, Crytosporidium parvum), tyfusfeber ( Richettsiapowozekil), og viral encefalitt
(a-virus, for eksempel Venezuelansk ekvin encefalitt; østlig ekvin encefalitt; vestlig ekvin encefalitt); alle i dag klassifisert som midler kategori B; og nipanvirus og hantavirus som i dag er klassifisert som midler kategori C. I tillegg kan andre organismer som er klassifisert slik eller på annen måte identifiseres og/eller benyttes for et slikt formål i fremtiden. Det vil være lett forståelig at de virale vektorer og andre konstruksjoner som er beskrevet her er nyttige for å avgi antigener fra disse organismer,
virus, deres toksiner eller andre biprodukter, som vil forhindre og/eller behandle infeksjoner eller andre ugunstige reaksjoner med disse biologiske midler.
Tilførsel av vektorene ifølge oppfinnelsen for avlevering av immunogener mot det variable området av T-cellene, utløser en immunrespons inkludert CTL for å eliminere disse T-celler. I RA er flere spesifikt variable områder av TCR som er involvert i sykdommen,karakterisert. Disse TCR inkluderer V-3, V-14, V-17 og Va-17. Avlevering av en nukleinsyresekvens vil således som koder minst ett av disse polypeptider, utløse en immunrespons målrettet mot T-celler involvert i RA. Ved MS er flere spesifikt variable områder av TCR som er involvert i sykdommenkarakterisert. Disse TCR inkluderer V-7 og Va-10. Avlevering av en nukleinsyresekvens som koder minst ett av disse polypeptider, vil således utløse en immunrespons målrettet mot T-celler involvert i MS. I skleroderma er detkarakterisertflere spesifikt variable områder av TCR involvert i sykdommen. Disse TCR inkluderer V-6, V-8, V-14 og Va-16, Va-3C, Va-7, Va-14, Va-15, Va-16, Va28 og Va-12. Avlevering av en rekombinant simianadenovirus som koder minst ett av disse polypeptider vil således utløse en immunrespons målrettet mot T-celler involvert i skleroderma.
C. Ad-formidlete avleveringsfremgangsmåter
De terapeutiske nivå eller immunitetsnivå for det valgte gen kan overvåkes for bestemmelse av behov for forsterkere om nødvendig. Etter en bedømmelse av CD8<+>T-cellerespons eller eventuelt antistofftitrene i serum, kan eventuelle forsterkerimmuniseringer være ønskelig. De rekombinante simianadenovirale vektorer ifølge oppfinnelsen kan eventuelt avleveres i en enkelt tilførsel eller i forskjellige kombinasjonsregimer, for eksempel i kombinasjon med et regime eller et behandlingsforløp som involverer andre aktive bestanddeler, eller i et prime-boost regime. Et antall slike regimer er beskrevet i teknikken og kan lett velges.
For eksempel kan prime-boost regimer involvere tilførsel av en DNA (for eksempel plasmid) basert vektor for å prime immunsystemet for en andre boostertilførsel med et tradisjonelt antigen, for eksempel et protein eller et rekombinant virus som bærer sekvensene som koder et slikt antigen. Se for eksempel WO 00/11140, publisert 2. mars 2000 for detaljer. Alternativt kan et immuniseringsregime involvere tilførsel av en rekombinant simianadenoviralvektor ifølge oppfinnelsen for å forsterke immunresponsen mot en vektor (enten viral eller DNA-basert) som bærer et antigen eller et protein. I nok et alternativ involverer et immuniseringsregime tilførsel av et protein, fulgt av forsterkning med en vektor som koder antigenet.
I en utførelsesform tilveiebringer oppfinnelsen en fremgangsmåte for priming og forsterkning av en immunrespons mot et valgt antigen ved å avlevere en plasmid-DNA-vektor som bærer nevnte antigen, fulgt av forsterkning med en simianadenoviral vektor ifølge oppfinnelsen. I en utførelsesform involverer prime-boost regimet ekspresjon av flere proteiner fra prime- og/eller boost-bæreren. Se for eksempel R. R. Amara, Science, 292:69-7' 4 (6. april 2001) som beskriver et multiproteinregjme for å uttrykke proteinsub enhetene som kan anvendes for å generere en immunrespons mot HIV og SIV. For eksempel kan en DNA-prime avgi Gag, Pol, Vif, VPX, Vpr, Env, Tat og Rev fra et enkelt transkript. Alternativt blir SIV Gag, Pol og HIV-1 Env avgitt i en rekombinant adenoviruskonstruksjon ifølge oppfinnelsen. Ytterligere andre regimer er beskrevet i WO 99/16884 og WO 01/54719.
Prime-boost regimer er imidlertid ikke begrenset til immunisering mot HIV eller avlevering av disse antigener. For eksempel kan priming involvere avlevering med en første sjimpvektor ifølge oppfinnelsen, fulgt av forsterkning med en andre sjimpvektor, eller med et preparat inneholdende antigenet selv i proteinform. I et eksempel kan prime-boost regimet tilveiebringe en beskyttende immunrespons mot viruset, bakterien eller andre organismer, hvorfra antigenet er avledet. I en annen ønskelig utførelsesform tilveiebringer prime-boost regimet en terapeutisk effekt som kan måles ved anvendelse av konvensjonelle analyser for påvisning av nærværet av tilstanden, mot hvilken terapi administreres.
Primingpreparatet kan administreres ved forskjellige seter i kroppen på doseavhengig måte, som avhenger av antigenet, som den ønskete immunresponsen målrettes mot. Oppfinnelsen er ikke begrenset til mengde eller sete for en eller flere injeksjoner eller den farmasøytiske bærer. Tvert imot kan regimet involvere et priming- og/eller forsterkertrinn, der hvert trinn kan inkludere en enkelt dose eller en dosering som administreres hver time, daglig, ukentlig, månedlig eller årlig. Som et eksempel kan pattedyr motta en eller flere doser inneholdende mellom omkring 10 ug til omkring 50 ug plasmid i bærer. En ønskelig mengde av et DNA-preparat ligger mellom omkring 1 Hg til 10000 ug av DNA-vektoren. Doser kan variere fra omkring 1 ug til 1000 ug DNA/kg individkroppsvekt. Mengde eller sete for avlevering velges helst basert på pattedyrets identitet og tilstand.
Vektorens doseringsenhet som er egnet for avlevering av antigenet til pattedyret, er beskrevet heri. Vektoren tilpasses for tilførsel ved suspendering eller oppløsning i en farmasøytisk eller fysiologisk akseptabel bærer som isotonisk saltoppløsning, oppløsning av isotoniske salter eller andre formuleringer som vil være åpenbare for fagmannen. Den egnete bærer vil være åpenbar for fagmannen, og vil i stor del avhenge av tilførselsvei. Preparatene ifølge oppfinnelsen kan administreres til et pattedyr i henhold til de veier beskrevet ovenfor, i en formulering med opprettholdt frigivelse ved anvendelse av en bionedbrytbar, biokompatibel polymer, eller ved på-stedet-avlevering ved anvendelse av misceller, géler og liposomer. Eventuelt inkluderer primingstrinnet ifølge oppfinnelsen også tilførsel med primingspreparat av en egnet adjuvansmengde, som definert heri.
Fortrinnsvis blir forsterkerpreparatet tilført omkring 2 til 27 uker etter tilførsel av primingspreparatet til pattedyrindividet. Administreringen av forsterkerpreparatet gjennomføres ved anvendelse av en effektiv mengde av et forsterkerpreparat, inneholdende eller i stand til å avlevere det samme antigen som ble tilført ved priming DNA-vaksinen. Forsterkingspreparatet kan bestå av en rekombinant viral vektor, avledet fra den samme virale kilde (for eksempel adenovirale sekvenser ifølge oppfinnelsen) eller fra en annen kilde. Alternativt kan "forsterkerpreparatet" være et preparat inneholdende det samme antigen som kodet i priming-DNA-vaksinen, men i form av et protein eller peptid, der preparatet induserer en immunrespons hos verten. I en annen utførelsesform inneholder forsterkerpreparatet en DNA-sekvens som koder antigenet under kontroll av en regulatorisk sekvens som styrer ekspresjonen i en pattedyrcelle, for eksempel vektorer som velkjente bakterievektorer eller virale vektorer. Primærkravene for forsterkerpreparatet er at preparatets antigen er det samme antigen, eller et kryssreaktivt antigen som det som kodes av primingpreparatet.
I en annen utførelsesform er simianadenoviralvektorene ifølge oppfinnelsen også velegnete for anvendelse i et antall andre immuniseringer og terapeutiske regimer. Slike regimer kan involvere avlevering av simianadenovirale vektorer ifølge oppfinnelsen, samtidig eller sekvensielt med Ad-vektorer eller forskjellige serotypekapsider, regimer der de adenovirale vektorer ifølge oppfinnelsen avgis samtidig eller sekvensielt med ikke Ad-vektorer, regimer der de adenovirale vektorene ifølge oppfinnelsen avgis samtidig eller sekvensielt med proteiner, peptider og/eller andre biologisk nyttige terapeutiske eller immunogene forbindelser. Slike anvendelser vil være åpenbare for fagmannen.
De følgende eksempler illustrerer kloning av simianadenovirus og konstruksjonen av eksempler på rekombinante adenovirusvektorer ifølge oppfinnelsen. Disse eksemplene er kun illustrerende og skal ikke på noen måte begrense oppfinnelsens ramme.
Eksempel 1 - Viral propagering
Pan5 (ATCC aksessnr. VR-591)-, Pan6 (ATCC aksessnr. VR-592)- ogPan7 (ATCC aksessnr. VR-593)-virus, opprinnelig isolert fra sjimpanse lymfeknuter, ble propagert i 293-celler (ATCC CRL1573). Typisk ble disse celler dyrket i Dulbeccos modifiserte Eagles medium (DMEM; Sigma, St. Louis, MO), supplert med 10 % føtalt kalveserum (FCS) (Sigma eller Hyclone, Logan, UT) og 1 % penicillin-streptomycin (Sigma). Infeksjon av 293-celler gjennomføres i DMEM supplert med 2 % FCS de første 24 timer, hvoretter FCS tilsettes for å bringe sluttkonsentrasjonen til 10 %. Infiserte celler høstet når 100 % av cellene viser virusindusert, cytopatisk effekt (CPE), og de samles deretter og konsentreres ved sentrifugering. Cellepelletts resuspenderes i 10 mM Tris (pH 8,0) og lyseres ved tre sykluser av frysing/tining. Virusprepareringen oppnås etter to ultrasentrifugeringstrinn på cesiumkloridtetthetsgradienter, og virusforråd fortynnes til 1 til 5xl012 partikler/ml i 10 mM Tris/100 mMNaCl/50 % glyserol og lagres ved -70
°C.
293-cellenes evne til å propagere disse adenovirus overskred forventningene som var basert på kunnskap om andre ikke-humane adenovirusserotyper.
Eksempel 2 - Karakterisering av viralgenomisk- DNA
Genomisk DNA ble isolert fra rensete viruspreparater fra eksempel 1 og fordøyd med Hindni- eller 5a/wHl-restriksjonsenzymer i henhold til produsentenes anbefalinger.
Resultatene (ikke vist) viste at Pan5-, Pan6-, Pan7-genomene ifølge oppfinnelsen og det publiserte Pan9 (C68) genomet viste forskjellige restriksjonsmønstre og derfor er distinkte fra hverandre.
Nukleotidsekvensene for Pan5, Pan6 og Pan7 ble bestemt. Nukleotidsekvensen for topptråden av Pan5 DNA er angitt i SEKV. ID nr. 1. Nukleotidsekvensen for topptråden av Pan6 DNA er angitt i SEKV. ID nr. 5. Nukleotidsekvensen for topptråden av Pan7 DNA er angitt i SEKV. ID nr. 9.
Regulatoriske, kodende områder i de virale DNA-sekvenser ble identifisert ved homologi til kjente adenovirale sekvenser ved anvendelse av "Clustal W"-programmet som beskrevet ovenfor ved konvensjonelle innstillinger. Se tabellene ovenfor som gir de adenovirale sekvenser. Åpne leserammer ble translaterte og de predikterte aminosyresekvenser undersøkt for homologi med tidligere beskrevne adenovirale proteinsekvenser, Ad4, Ad5, Ad7, Adl2 og Ad40.
Analyse av sekvensen viste en genomorganisering tilsvarende den til stede i humant adenovirus med den største likhet overfor humant Ad4. Imidlertid ble vesentlige forskjeller i de hexonhypervariable områder registrert mellom sjimpanseadenovirus og andre kjente adenovirus inkludert AdHu4. Disse forskjeller passer godt inn i de serologiske kryssreaktivitetsdata som er oppnådd (se nedenfor).
En oppstilling av en del av hexonsekvensene er vist i figur 1. Delen som vises er fra området av hexon som tilsvarer de utover anordnete forlengete løkker DEI og FG1, der den største variabiliteten mellom serotypene observeres.
En intervenerende del som bidrar til hexonbasen (tilsvarende residier 308-368 av den publiserte AdC68-sekvens; U.S. patent nr. 6 083 716), og er høyt bevart mellom serotypene, er også til stede. Den påfølgende tabell oppsummerer parvise sammenlikninger av aminosyrene i hexonproteinene.
Analyse av fiberknoppdoménet (som er ansvarlig for reseptorbindingen) i sjimpanseadenovirus, viser en total likhet i struktur (figur 2).
Graden av sekvenslikhet mellom El-proteinene av HuAd5 og C68 (se tabellene nedenfor) er lik dem mellom HuAd5 og Pan5, Pan6 og Pan7.
Replikasjonsdefekte versjoner av AdC5, AdC6 og AdC7 ble skapt ved molekylære kloningsmetoder som beskrives i de påfølgende eksempler, hvori minigenkassetter ble innskutt i stedet for Ela- og Elb-genene. De molekylære kloner av de rekombinante virus ble ivaretatt og dyrket i 293-celler for storskalarensing ved anvendelse av den publiserte cesiumklorid (CsCl) sedimenteringsmetode (K. Fisher et al, J. Virol., 70:520
(1996)). Vektorutbyttene var basert på 50 plate (150 mm) preps hvor omkring
lxlO<9>293-celler ble infisert med de tilsvarende virus. Utbytter ble bestemt ved å måle viralpartikkelkonsentrasj onene spektrofotometrisk. Etter å ha konstruert El-deleterte vektorer, ble det bestemt at HEK293-celler (som uttrykker humant adenovirusserotype 5E1-funksjoner) transkompletterer El-delesjonene av de nye virale vektorer og tillater produksjon av høytiterforråd. Eksempler på virusutbytter for noen av disse rekombinante virus er vist i tabellen nedenfor.
Transgenene for disse vektorene, P-galaktosidase (LacZ), grønt fluorescentprotein (GFP), a-l-antitrypsin (Al AT), ebola glykoprotein (ebo), et løselig ebola-glykoproteinvariant som mangler transmembrane og cytoplasmiske doméner (sEbo), og tre delesjonsmutanter av ebola glykoproteinet (EboA2, EboA3 og EboA4), ble uttrykt av cytomegaloviruspromoteren (CMV).
Evnen hos humant adenovirus El til å transkomplettere de El-deleterte sjimpansevirus ifølge oppfinnelsen, er meget fordelaktig da dette tillater produksjon av El-deleterte sjimpanseadenovirale vektorer ifølge oppfinnelsen, mens risikoen for homolog rekombinering pga. sekvensforskjellene mellom humant Ad og det her beskrevne sjimpanseadenovirus reduseres eller elimineres.
Eksempel 3 - Serologiske studier av Pan5-. Pan6- oe Pan7- virus På grunn av forskjellene i det hexonhypervariable området, ble det antatt C5-, C6- og C7-virus ville være serologjsk distinkte fra humant adenovirus inkludert AdHu4.
1. Kryssreaktivitet av villtypevims
Replikasjonskompetente virus ble anvendt for avsøking av villtypevims for å bestemmelse av antistoff kryssreaktivitet, og inhibering av cytopatiske defekter (CPE) ble målt. Kort sagt ble preparater av adenovirus (AdHu5, Pan5, Pan6, Pan7 og AdC68) lagret ved 5xl0<12>partikler/ml, fortynnet 1/600 for analysene. Denne konsentrasjonen av virus ble valgt fordi den resulterte i 100 % CPE innen 48 timer i fravær av nøytralisering. Før tilsetting av virus til 293-celler (4xl0<4>celler/brønn i en 96 brønners plate) ble det tilsatt 1:20 fortynninger av serum. Analysene avleses som nærvær eller fravær av CPE; full nøytralisering avleses som ingen cytopatisk effekt. Resultatene er oppsummert i tabellen nedenfor. Det faktum at 9/36 humant serum nøytralisert AdHu5 induserte CPE, er konsistent med tidligere beregninger for nøytraliserende antistoffer i den humane populasjon. Tallene antyder det totale antall individer som viser nøytralisering (numerator) versus det totale antall avsøkt (denominator). ND = ikke bestemt.
Av alle avsøkte humane sera var 35/36 negative for nøytralisering mot AdC68, mens 36/36 var negative for nøytralisering mot Pan5, Pan6 og Pan7. Av 52 rhesusaper som ble avsøkt, viste ingen nøytralisering mot noe sjimpanseadenovirus; rhesusapen er den foretrukne prekliniske modell for evaluering av HIV-vaksine. Mellom 9 og 12 av 20 sjimpanser hadde vesentlig nøytralisering mot én eller en annen av sjimpanseadenoviruser, er konsistent med det faktum at disse i realiteten er endemisk sjimpanse-spesifikke patogener. Interessant er at det er sjimpanser med nøytraliserende antistoffer kun mot Pan5, Pan6 eller AdC68, noe som støtter hypotesen at flere av disse sjimpanseadenovirale vektorer ikke vil kryssnøytralisere hverandre, og er distinkte serotyper.
Den samme analyse ble gjennomført for 20 sjimpanseserumprøver. Femti prosent (50 %) av prøvene reagerte serologt, i forskjellig grad mot Pan5; 40 % mot Pan6; 55 % mot
Pan7 og 60 % mot C68. Blant de positive serumprøver hadde én av dem sterk nøytraliserende aktivitet mot alle fire sjimpvirus.
2. Kryssnøytralisering med rekombinante virus
Høytiterpolyklonale antistoffer ble oppnådd mot hver av simianadenovirusene for mer nøyaktig måling av graden av kryssnøytralisering blant de forskjellige serotyper. Dette ble gjort ved intramuskulær immunisering av kaniner ved anvendelse av rekombinant virus inneholdende GFP basert på det tidligere beskrevne C68-sjimpanseadenovirus som en adjuvans. Dette serum ble så anvendt for å analysere på nøytraliserende aktivitet mot hver av de tre sjimpanseadenoviruser ifølge oppfinnelsen, AdC5, AdC6 og AdC7. En kanin ble injisert med 5xl0<12>viralpartikler pr. kg med C68CMVGFP-vektor intramuskulært og forsterket fem uker senere ved anvendelse av den samme dosen. En blodtapping samlet den niende uke viste ekstremt potent nøytraliserende aktivitet mot C68 så vel som Pan5 og Pan7, men ikke mot Pan6 (se tabellen nedenfor), noe som antyder at administreringen av en C68 (eller Pan5- eller Pan7-) basert vaksine kunne være effektiv, etterfulgt av anvendelse av en vektorbasert forsterkning på Pan6. Imidlertid er det funnet at dette nivå av inter-sammenheng ikke nødvendigvis er forhindrende for readministrering i en innstilling der antivirale antistofftitere ikke var så høye som det ble oppnådd i denne kaninen. I den påfølgende tabellen indikerer + 33 % CPE; ++ 66 % CPE; +++ 100 % CPE.
3. Kvantitativ analyse for påvisning av nøytraliserende antistoff Resultatene ble vurdert ved en mer kvantitativt basert analyse for å påvise nøytraliserende antistoff, og som er basert på transduksjon av en GFP-vektor. Kort sagt ble grupper av C57BL76-mus immunisert intramuskulært eller intravenøst med 5,0xl0<10>partikler/ml Pan5, Pan6, Pan7 eller C68. Sera fra tappinger på dag 28 ble testet på kryssnøytraliserende aktivitet mot C68CMVEGFP ved fortynninger på 1:20 og 1:80. Som en oppsummering og når et farmasøytisk preparat av human immunglobulin ble testet på serologiske reaksjoner mot Pan5, Pan6, Pan7 og C68, ble det påvist noen lave nivå av nøytraliserende aktiviteter mot Pan7 og C68. Det ble ikke påvist noen nøytraliserende aktivitet mot Pan5 eller Pan6. Serumprøver fra 36 humane individer ble kjørt gjennom den samme analyse. Serumprøvene ble testet i en fortynning på 1:20. Resultatene antydet at kun ett individ hadde klar nøytraliserende aktivitet mot C68. Det ble ikke påvist nøytraliserende aktivitet for Pan5, Pan6 eller Pan7.
4. In vitro kryssnøytralisering
Kryssnøytralisering av simianadenovirusene ved høytiterkaninpolyklonale antistoff mot hver av adenovirusene Pan5, Pan6, Pan7 og C68, ble undersøkt.
Kaninene ble immunisert med intramuskulære injeksjoner av IO<13>partikler av hver av sjimpanseadenovirusene og forsterket 40 dager senere med den samme dose med ufullstendig Freunds adjuvans. Sera ble analysert på nærvær av nøytraliserende antistoff ved inkubering av to ganger seriefortynninger med IO<9>genomkopier av hver av de egnete sjimpanseadenovirusvektorer som uttrykte GFP og testet på forminskning av GFP-ekspresjon anvendt på 293-celler. Serumfortynningen som produserte en 50 % reduksjon av GFP-ekspresjon, ble bedømt som den nøytraliserende antistofftiter mot det spesielle virus.
Resultatene er oppsummert i tabellen. Disse data er konsistente med forventningen fra sekvensanalyse av hexonaminosyresekvensene som antydet at Ad-Pan6 sannsynligvis ville være den mest serologt distinkte, sammenliknet med de andre sjimpanseadenovirus.
For bestemmelse av hvorvidt antistoffer som kryssreagerer med simianadenovirus sannsynlig ville være av lav prevalens i mennesker, ble simianadenovirus SVI, SV39 og SV25 testet for evnen til å motstå nøytralisering når de var inkubert med kommersielt tilgjengelige sammenslåtte immunglobuliner (lg). Den samme analyse ble også gjennomført med AdHu5 og sjimpanseadenovirus Pan5, Pan6, Pan7 og C68.1 en videre studie ble sera fra mus immunisert med en av sjimpansevirusene C5, C6, C7 og C68, og deres evne til å kryssnøytralisere simianadenovirusene SVI5, SV23, SA17 og bavianadenovirus ble testet. Det ble ikke observert noen kryssreaktivitet i noen av disse tilfeller.
Eksempel 4 - Generering av rekombinant El- deletert Pan5- vektor Et modifisert pX-plasmid ble fremstilt ved å ødelegge Fspl-setet i bla-genområdet av pX (Clontech) ved seterettet mutagenese. Det resulterende modifiserte plasmid, benevnt pX', er et sirkulært plasmid på 3000 bp inneholdende et fl-ori og et ampicillinresistensgen (AmpR-cds).
A. Fremstilling av Pan5 adenovimsplasmid
En polylinker for sekvensiell kloning av Pan5-DNA-fragmenter i pX' skapes.
Polylinkeren innsettes i stedet for den eksisterende pX'-polylinkeren etter fordøying
med Mlulog EcoRI. Det butte Fse/-fragmentet av Pan5 innføres i Smal- og Fsel- setene av polylinkeren. Fragmentet inneholder den 5' ende av det adenovirale genomet (bp 1 til 3606, SEKV. ID nr. 1). S«a5/-Fs/?/-fragmentet av Pan5 (bp 455 til 3484, SEKV. ID nr.
1) erstattes med en kort sekvens flankert av I- Ceu- og Pl- Sce- setene fra pShuttle (Clontech), for å eliminere El-området av det adenovirale genomet. Det butte EcoRI-fragmentet av Pan5 (bp 28658 til 36462, SEKV. ID nr. 1), innføres i EcoRI- og EcoRV-setene av polylinkeren (for å gi den 3' ende av det adenovirale genomet); Fsel- Mlul-fragment (bp 3606 til 15135, SEKV. ID nr. 1) innføres i polylinkeren; og MluI- EcoRI-fragmentet innføres i polylinkeren (bp 15135 til 28568, SEKV. ID nr. 1). Eventuelt innføres et ønsket transgen i I- Ceul- og Pl- Scel- setene i den nylig skapte pX'Pan5AEl-vektoren.
B. Alternativ fremgangsmåte for generering av pX' Pan5AEl
Det opprinneølige plasmid pX er avledet fra pAdX-adenovirusplasmid tilgjengelig fra Clontech, som beskrevet ovenfor. Deretter ble et PacI- XhoI- omrhåQ av pX' deletert og den butt-endete Pan5-polylinkeren ble innsatt i Fspl- setet for å generere pX'PLNK (2994 bp). Det 5' ende FseAområdet av Pan5 (bp 1-3607, SEKV. ID nr. 1), ble innsatt i Smal- og Fsel- setene av pX'LNK for å generere pX'Pan5-5' plasmid (6591 bp). SnaBi-AfafeT-området av pX'Pan5-5' ble utskåret og erstattet med Cew/Sce-kassetten, som var PCR-amplifisert fra pRCS for å skape pX'Pan5-5'AEl (4374 bp). I korthet ble en sekvens inneholdende I- Ceul- og Pl- Scel "sjeldne" kutterseter PCR-forsterket fra pRCS (3113 bp). Den 3' PCR-primeren ble innført ved et Nael- sete i PCR-produktet.
For å forlenge Pan5-DNA i pX'Pan5-5'AEl (4374 bp), ble Fse/-Mw/-området av Pan5 (bp 3607-15135, SEKV. ID nr. 1) tilsatt for å skape pX'Pan5-5'Mlu (15900 bp). Den gjenværende MluI-3' ende av Pan5-sekvensen (bp 15135-36462, SEKV. ID nr. 1), ble tilsatt til vektoren mellom Mlul- og EcoRV-setene av vektorpolylinkeren for å danne pX'Pan5AEl, som inneholder fullengde Pan5-sekvensen inneholdende en delesjon i El-området.
C. Generering av rekombinante virus
For å generere de rekombinante adenovirus fra pX'Pan5AEl, ble plasmidet transfektert med en hjelper som uttrykker El, eller fra en El-uttrykkende pakkende cellelinje, slik som 293-cellelinjen eller en cellelinje fremstilt som beskrevet heri. Ekspresjonen av El i den pakkende cellene tillater replikering og pakking av Pan5AEl i et virionkapsid. I en annen utførelsesform blir den pakkende cellen som er transfektert med pX'Pan5AEl transfektert med en adenovirusvektor, som beskrevet ovenfor, som bærer transgenet av interesse. Homolog rekombinering inntrer mellom hjelperen og plasmidet, noe som tillater adenovirustransgensekvensene i vektoren replikeres og pakkes i virionkapsider, noe som resulterer i det rekombinante adenovirus.
Transfeksjon etterfølges av et agaroverlegg i to uker, hvoretter virusene plakkes, ekspanderes og avsøkes for ekspresjon av transgenet. Flere ytterligere runder av plakkrensing etterfølges av ytterligere kulturekspansjon. Til slutt høstes cellene, en virusekstrakt fremstilles og det rekombinante sjimpanseadenovirus inneholdende det ønskete transgen renses ved utstrakt tetthetsultrasentrifugering i en CsCi-gradient eller ved alternative midler, velkjente for fagmannen.
Eksempel 5 - Generering av rekombinant El- deletert Pan6- vektor
A. Strategi for konstruksjon av Pan6-adenoviralt plasmid
1. Kloning av terminalfragmenter
Pan6-virus deproteineres ved pronase- og proteanase K-behandling og fenolekstrahering. Syntetiske 12 bp Pme-I-linkere ligeres på det virale DNA, som beskrevet avBerkner og Sharp i Nucleic Acids Research, 11:6003 (1983). Det virale DNA fordøyes så meddal for å isolere et 5' terminalfragment (6043 bp). Kå6Xbal 5' fragmentet ligeres så inn i pX-linken ved Smal- og Xbal- setene for å danne pX-AdPan6-0-16.5. Det virale DNA med Pmel-linkerne fordøyes også med PacI for å isolere 6475 bp 3' terminalfragmentet og klones inn i pX-linken ved PacI- og Smal-setene, noe som resulterer i pXAdPan6-82-100.
2. Delesjon av El fra den 5' klonen
For å deletere El (m.u. 1.2-9), ble BsiWi-^&al-fragmentet i pX-AdPan6-0-16.5 erstattet med et PCR-fragment som spenner over m.u.9-16.7-fragmentet behandlet med BsiWi og^&al, noe som fører til pX-Ad-Pan6 m.u.0-1, 9-16.5. 3. Fusjon av 5'- og 3 '- klonene og dannelse av et ankersete for å akseptere det midtre Hindlll- rfagmentet
Først blir 5' klonen pX-Ad-Pan6 m.u.0-1, 9-16.5 ytterligere ekspandert ved innføring av det andre^al-fragment (4350 bp, m.u. 16.5-28) fra Pan6-genomet inn i A7>al-setet i pXAd-Pan6 m.u.0-1, 9-16.5. Denne konstruksjonen kalles pXAd-Pan6-mu 0-1, 9-28.
Deretter blir 3'-klonen også ekspandert ved innføring av 15026 bp MluI/PacI-fragmentet som dekker m.u.41-82 fra Pan6-genomet i MluI/PacI-setene av pXAdPan6-82-100, for å generere pXAdPan6-m.u.41-100.
Så blir et 8167 bp HindUUEco 47IH Pan6-fragment isolert fra pXAd-Pan6-mu 01, 9-28 og subklonet inn i pXAdPan6-m.u.41-100 ved FlinsIII- og^Z>al-butt-setene. Denne 5'-og 3'-fusjonsklonen kalles pXAdPan6muO-l, 9-19.5, 64-100.
4. Dropping av det midtre genomfragmentet inn i fusjonsklonen
Et 16335 bp ///'«fiffn-fragment (m.u. 19.5-64) fra Pan6 innføres i i/z/fcÆD-setet av pXAdPan6muO-l, 9-19.5, 64-100 for å danne pXAdPan6-0-l, 9-100. 5. Innføring av en PKGFP- selektiv markør i det endelige konstrukt for direkte kloning av genet av interesse og grønn/ hvit seleksjon av rekombinante transformanter En minigenkassett som uttrykker GFP under en lac-promoter og som er flankert med gjenkjenningsseter for "sjeldne" intronkodende restriksjonsenzymer, PI-Sce I og I-Ceu L ble isolert fra pShuttle-pkGFP (bare) ved Sap I- og Dra JU-fordøying fulgt av innfyllingsreaksjon. pShuttle-pkGFP (bare) plasmidet er 4126 bp langt, og inneholder et ColEl-Ori, et kanamycinresistensgen, plac, en LacZ-promoter-GFPmut3-l eds (Clontech), og et GFPmut3-l eds (Clontech). Denne kassetten subklones inn i Srf I kuttet og avstumpet pXAdPan6-0-l, 9-100. Dette siste konstrukt kalles pX-Pan6-pkGFP m.u.0-1, 9-100, som er nyttig for generering av rekombinante El-deleterte Pan6-molekylære kloner som bærer gener av interesse ved direkte ligering og grønn/hvit seleksjon i kombinasjon med de generiske pShuttlepkGFP-vektorene.
B. Alternativ strategi for generering av Pan6-plasmid
1. Kloning av 5' terminalfragment
Pan6-viruset deproteineres ved pronase- og proteanase K-behandling og ved fenolekstrahering, som beskrevet ovenfor, og syntetiske 12 bp Pmel-linkere ligeres på det virale DNA, som beskrevet. AdPan6 5' ^Z>al-fragmentet isoleres og ligeres inn i pX for å danne pX-AdPan6-0-16.5 (9022 bp), som beskrevet i del A ovenfor.
2. Delesjon av El fra den 5' klon
For å deletere El (m.u. 1.2.-9), ble pX-AdPan6-0-16.5 fordøyd med SnaBI og Ndel for å fjerne områdene som koder Ela- og Elb-proteinene (3442-6310 bp). Denne vektor fordøyes deretter med BsiWI for å forberede avstumping med minigenkassetten som bærer en selektiv markør.
3. Innføring av en selektiv markør
En minigenkassett som uttrykker GFP under en lac-promoter og som er flankert med
gjenkjenningsseter for "sjeldne" intronkodende restriksjonsenzymer, PI-XceI og I-Ceul, ble isolert fra pShuttle-pkGFP, som beskrevet ovenfor. DRAIII-SapI-fragmentet ligeres så med den fordøyde pX-AdPan6-0-16.5 for å danne pX-AdPan6 MU 0-16.5AE1 (7749
bp).
4. Forlengelse av Pan6- adenovirale sekvenser
pX-AdPan6 MU 0-16.5AE1 ble underkastet ATørl-fordøying for å tillate innføring av en A7>aI-RsrII-linker. Et^Z>aI/RsrII fordøyingsfragment fra AdPan6-genomet ble isolert (mu 28-100, 26240 bp) og ligert inn i den Xba/RsrU-fordøyde pX-AdPan6 MU 0-16.5AE1 for å gi pX-AdPan6 MU 0-1, 9-16.5, 28-100. Et andre ^Jal-fragment fra Pan6-genomet (mu 16.5-28, 4350 bp) ble så legert inn i dette plasmid for å danne pX-AdPan6 MU 0-1, 9-100 (38551 bp).
C. Generering av rekombinante adenovirus
For å generere de rekombinante adenovirus fra et El-deletert Pan6-plasmid, fremstilt som beskrevet i del A eller b, ble plasmidet kotransfektert med en hjelper som uttrykker El, eller fra en El-uttrykkende pakkende cellelinje som en 293-cellelilnje eller en linje fremstilt som beskrevet her. Ekspresjonen av El i den pakkende cellen tillater replikering og pakking av Pan6-pkGFP mu.0-1, 9-100 inn i et virionkapsid. Alternativt blir den pakkende celle som er transfektert med px-Pan6-pkGFP mu.0-1, 9-100, transfektert med en adenovirusvektor, som beskrevet ovenfor og som bærer et annet transgen av interesse.
Eksempel 6 - Generering av rekombinant El- deletert Pan7- vektor A. Generering av Pan 7- plasmider
En syntetisk linker inneholdende restriksjonssetene PacI-Smal-Fsel-MluI-EcoRV-PacI ble klonet inn i pBR322 som var kuttet med EcoRI og Ndel. Den venstre ende (bpl til 3618) av AdPan7 ble klonet inn i linkeren mellom Smal- og Fsel-setene. Adenoviruset El ble så skåret ut fra den klonete, venstre ende ved kutting med SnaBI og Ndel og innføring av en I-CeuI-GFP-PI-Scel-kassett fra pShuttle (Clontech) i stedet. Det resulterende plasmid ble kuttet med Fsel og Mlul og et AdPan7-fragment Fsel (bp 3618) til Mlul (bp 155114) ble innført for å forlenge den venstre ende. Konstruktet (pPan7pGFP) ble fullført ved innføring av det 21421 bp AdPan7-høyre ende fragment fra Mlul-setet (bp 15114) inn i plasmidet ovenfor mellom Mlul og EcoRV for å generere en fullstendig molekylær klon av El-deletert adenovirus Pan7, egnet for generering av rekombinant adenovirus. Eventuelt innføres et ønsket transgen i I-Ceul-og PI-SceI-setene av det nydannete Pan7-vektorplasmidet.
B. Konstruering av El- deletert Pan 7- viralvektorer
For å generere rekombinante adenovirus fra pPan7AEl, ble plasmidet kotransfektert med en hjelper som uttrykker El, eller fra en El-uttrykkende pakkende cellelinje som en 293-cellelinje eller en cellelinje fremstilt som beskrevet her. Ekspresjonen av El iden pakkende celle tillater replikering og pakking av Pan7AEl inn i et virionkapsid. I andre utførelsesformer blir den pakkende celle som er transfektert med px'-Pan7AEl, transfektert med en adenovirusvektor, som beskrevet ovenfor og som bærer transgenet av interesse. Homolog rekombinering inntrer mellom hjelperen og plasmidet, noe som tillater at adenovirustransgensekvensen i vektoren kan replikeres og pakkes i virionkapsider og derved resultere i det rekombinante adenovirus. Transfeksjon og rensing gjennomføres som beskrevet ovenfor.
Eksempel 7 - Generering av plasmidvektorer som uttrykker El- genene Plasmidvektorer som koder Pan5-El-området konstrueres, og disse plasmider anvendes for å generere stabile cellelinjer som uttrykker virale El-proteiner.
Pan5-El-området klones inn i pX', i det vesentlige som beskrevet i eksempel 4 ovenfor, før erstatning av dette området med fragmentet fra pShuttle (Clontech). Ekspresjonsplasmidet inneholdende den Pan5 -adenovirale genomsekvens spenner over minst bp 1 til 3959 i den Pan5-genomiske sekvens. Ekspresjonsplasmidet inneholder således sekvensen som koder Ela og Elb av sjimpanse-Ad-Pan5 under kontroll av en heterolog promoter. Tilsvarende ekspresjonsplasmider kan genereres ved anvendelse av Ad-Pan6- og Ad-Pan7-El-områdene, identifisert i tabellene ovenfor.
Eksempel 8 - Generering av cellelinjer uttrykkende siimpanseadenovirus- El- proteiner Cellelinjer uttrykkende virale El-proteiner genereres ved å transfektere HeLa (ATCC aksess nr. CCL2) med plasmidet fra eksempel 6. Disse cellelinjer er nyttige for fremstilling av El-deleterte, rekombinante sjimpanseadenovirus ved ko-transfeksjon av genomisk viralt DNA og ekspresjonsplasmidene beskrevet ovenfor. Transfeksjon av disse cellelinjer så vel som isolering og rensing av rekombinant sjimpanseadenovirus fra disse utføres ved fremgangsmåter konvensjonelle for adenovirus, dvs. humane adenovirus (se for eksempel Horwitz, supra, og annen standard litteratur).
A. Cellelinjer uttrykkende Pan5- El- proteiner
HeLa-celler i 10 cm skåler transfekteres med 10 ug pX-Pan51-El-DNA ved anvendelse av et "Cellphect" sett (Pharmacia, Uppsala, Sverige) og ved å følge produsentens protokoll. 22 timer etter transfeksjon ble cellene underkastet et treminutters glyserol sjokk (15 % glyserol i Hepes-bufret saltoppløsning, pH 7,5), vasket en gang i DMEM (HeLa) eller F12K (A549; Life Technologies, Inc., Grand Island, NY)
mediasupplert med 10 % FCS, 1 % Pen-Strep, også inkubert i seks timer ved 37 °C i det ovenfor beskrevne media. De transfekterte cellene splittes så til duplikat 15 cm plater i
forhold på 1:20, 1:40, 1:80, 1:160 og 1:320. Etter inkubering ved 37 °C over natten ble disse media supplert med G418 (Life Technologies) i en konsentrasjon på 1 ug/ml. Disse media erstattes hver femte dag og kloner isoleres 20 dager etter transfeksjon.
HeLa-El-cell eki oner isoleres og analyseres for evne til å forbedre adenoassosiert virus (AAV) infeksjon og ekspresjon av rekombinant LacZ-protein, som beskrevet nedenfor.
B. AAV- forbedringsanalyse for å avsøke El- uttrykkende cellelinjer
AAV krever adenoviruskodete proteiner for å fullføre sin livssyklus. De adenovirale El-proteiner så vel som det E4-områdekodete ORF6-protein er nødvendig for forbedring av AAV-infeksjonen. En analyse for El-ekspresjon basert på AAV-forbedring anvendes. I korthet omfatter fremgangsmåten for å identifisere adenovirale El-uttrykkende celler trinnene å infektere i separate kulturer av en putativ adenovirus El-uttrykkende celle, og en celle ikke inneholdende noen adenovirussekvens, med både et adenovirusassosiert virus (AAV) som uttrykker et markørgen, og en AAV som uttrykker ORF6 av E4-genet av humant adenovirus i et egnet tidsrom. Markørgenaktiviteten i de resulterende celler måles og cellene med betydelig større målbar markøraktivitet enn kontrollcellene velges som bekreftete El-uttrykkende celler. I det påfølgende forsøk er markørgenet et lacZ-gen og markøraktiviteten er opptreden av blå farge.
For eksempel infekteres cellelinjene beskrevet ovenfor, såvel som ikke-transfekterte kontrollceller (HeLa), med 100 genomer pr. celle av en AAV-vektor som bærer et markørgen, for eksempel AV.LacZ [Fisher etal, J. Virol, 70:520 (1996)], og en AAV-vektor som uttrykker ORF6-området av humant 5 (AV.orfS). Plasmid-DNA-sekvensen genererer en ny, rekombinant adenoassosiert virus (rAAV) inneholdende LacZ-transgenet og Ad-E4-ORF6, som er en åpen leseramme hvis ekspresjonsprodukter letter enkeltrådet (ss) til dobbeltrådet (ds) konvertering av rAAV-genomisk-DNA. Disse vektorene inkuberes i medium inneholdende 2 % FCS og 1 % Pen-Strep ved 37 °C i fire timer, på hvilket tidspunkt et likt mediumvolum inneholdende 10 % FCS tilsettes. Det skal være klart for fagmannen på området at ethvert markørgen (eller rapportørgen) kan anvendes i den første AAV-vektoren i denne analysen, for eksempel alkalisk fosfatase, luciferase og andre. En antistoffenzymanalyse kan også anvendes for mengde-bestemmelse av antigennivåene, der markøren uttrykker et antigen. Analysen er ikke begrenset av identiteten av markørgenet. 20 til 24 timer etter infeksjon farges cellene for LacZ-aktivitet ved anvendelse av standard fremgangsmåter. Etter 4 timer observeres cellene mikroskopisk og cellelinjer med betydelig flere blå celler enn De A549- eller HeLa-cellekontrollene anføres som positive.
Eksempel 9 - Avlevering av transgen til vertscelle
Det resulterende, rekombinante sjimpanseadenovirus, som beskrevet i eksempel 4, 5 eller 6 ovenfor, benyttes så for avlevering av transgenet til en mammalsk- og fortrinnsvis human celle. For eksempel, etter rensing av det rekombinante virus, infekteres humane, embryonale nyre-293-celler ved en MOI på 50 partikler/celle. GFP-ekspresjonen ble dokumentert 24 timer etter infeksjon.
A. Genoverføring i musemodeller via Pan6-, Pan 7- og Pan9- vektorer Genoverføringseffektiviteter og toksikologiske profiler for rekombinante sjimpanseadenovirus ble sammenliknet ved museieverstyrt genoverføring, muselunge styrt genoverføring og musemuskel styrt genoverføring.
El-deleterte adenovirale vektorer inneholdende LacZ under kontroll av CMV-promoteren, ble konstruert ved anvendelse av teknikkene her for humant Ad5, sjimpansePan6, sjimpansePan7 og sjimpansePan9 (C68). Vektorene ble avlevert til nakne immunmanglende NCR-mus (80 pr. studie) som følger. For leverstudien ble 100 ul (lxlO<11>partikler) injisert i nålevenen. For lungestudien ble 50 ul (5xl0<10>partikler) avlevert intratrakealt. For muskelstudien ble 25 ul (5xl0<10>partikler) injisert i den bakre tibialis. Mus ble avlivet på dag 3, 7, 14 og 28 etter vektorinjeksjon (5 dyr pr. gruppe på hvert tidspunkt). Ved hver nekropsi ble lever/lunge/muskelvev samlet og tilberedt for kryoblokker og parafininnleiring. Kryoblokkene ble oppskåret for X-gal farging og parafinsnittene ble H&E-farget for histopatisk analyse. På hvert tidspunkt ble det gjennomført terminalblødning. Serumprøver ble underkastet leverfunksjonstester.
I dette forsøk ble det observert at sjimpanseadenovirusene Pan6, Pan7 og Pan9 var mindre effektive enn huAd5 ved genoverføring til lever og lunge. Det kan imidlertid være ønskelig under visse omstendigheter for derved å redusere leveitoksisiteten som ble observert for huAd5. Genoverføringseffektiviteten i muskel varierte mindre mellom serotypene.
B. Musestudie på gjennomførbarheten av readministrering av adenovimsvektorer vedserotypeskifting mellom AdHu5-, Pan6-, Pan7- ogPan9- vektorer Mus ble tilført (C57/B16; 4/gruppe) LacZ-vektorer basert på huAd5, Pan6, Pan7 og Pan9 (H5.040CMVLacZ, Pan6.000CMVLacZ, Pan7.000CMVLacZ, Pan9.000CMVLacZ; 10<11>partikler/injeksjon) via nålevenen. 30 dager senere ble musene igjen tilført adenovimsvektorer uttrykkende al-antitrypsin
(H5.040CMVhAlAT, Pan6.000CMVhAlAT, lxlO<11>partikler, Pan7.000CMVhAlAT, Pan9.000CMVhAlAT, IO<11>partikler/injeksjon). Vellykket transduksjon ved den readministrerte vektor ble overvåket ved måling av serum al-antitrypsin på dagene 3 og 7 etter readministrering.
Adenovirusvektorers evne til å transdusere leveren hos mus i nærvær av nøytraliserende antistoffer mot de andre serotyper, basert på henholdsvis huAd5, Pan6, Pan7 og Pan9, ble bestemt. Resultatene er vist i tabellen her.
Vektorers evne til å transdusere murin lever i nærvær av nøytraliserende antistoffer til andre serotyper.
Immunisering med huAd5 forhindret således ikke readministrering med noen av
sjimpanseadenovirusvektorene Pan6, Pan7 eller Pan9 (C68). Dette forsøket synes også å indikere at Pan7 ligger mellom Pan6 og Pan9 i spekteret for antigenisk sammenheng og kryssreagerer med begge; Pan6 og Pan9 nøytraliserer imidlertid ikke hverandre. Dette er et overraskende resultat basert på homologisammenlikninger, som antyder at Pan6 er heller forskjellig fra Pan7 og Pan9. Evaluering av antisera generert mot Pan9 antydet
ingen kryssnøytralisering mot Pan6, men en viss nøytralisering mot Pan9, noe som skulle argumentere for at Pan6 er distinkt fra de andre.
Eksempel 10 - Generering av rekombinant El- deletert SV25- vektor Et plasmid ble konstruert som inneholdt det fullstendige SV25-genomet bortsett fra en konstruert El-delesjon. Ved setet for El-delesjonen ble det innført gjenkjenningsseter for restriksjonsenzymene I-Ceul og Pi-Scel som ville tillate innføring av et transgen fra et shuttleplasmid der transgenekspresjonskassetten er flankert med disse to enzymgj enkj enningsseter.
En syntetisk linker inneholdende restriksjonssetene Swal-SnaBI-Spel-Aflll-EcoRV-Swal ble klonet inn i pBR322 som ble kuttet med EcoRI og Ndel. Dette ble utført ved å annile to syntetiske oligomerer SV25T (5'-AAT TTA AAT ACG TAG CGC ACT AGT CGC GCT AAG CGC GGA TAT CAT TTA AA-3', SEKV. ID nr. 38) og SV25B (5'-TAT TTA AAT GAT ATC CGC GCT TAA GCG CGA CTA GTG CGC TAC GTA TTT A-3', SEKV. ID nr. 39) og innføring derav i pBR322 fordøyd med EcoRI og Ndel. Den venstre ende (bp 1 til 1057, SEKV. ID nr. 29) av AdSV25 ble kolonet inn i linkeren ovenfor mellom SnaBI- og Spel-setene. Den høyre ende (bp 28059 til 31042, SEKV. ID nr. 29) av AdSV25 ble klonet inn i linkeren mellom Aflll- og EcoRV-setene. Adenovirus El ble så utskåret mellom £coRI-setet (bp 547) til Xhol (bp 2031) fra den klonete venstre ende som følger. En PCR-generert I-CeuI-PI Scel-kassett fra pShuttle (Clontech) ble innsatt mellom EcoRI- og Spel-setene. 10154 bp Xhol-fragmentet fra AdSV25 (bp 2031 til 12185, SEKV. ID nr. 29) ble så innført i Spel-setet. Det resulterende plasmid ble fordøyd med HindBI og konstruktet (SV25) ble komplettert ved innføring av 18344 bp Ad SV25 #/«<ÆII-fragmentet (bp 11984 til 30328, SEKV. ID nr. 29) for å generere en fullstendig molekylær klon av El-deletert adenovirus SV25, egnet for generering av rekombinante adenovirus. Eventuelt innføres et ønsket transgen i I-Ceul- og PI-SceI-setene av det nydannete pSV25-vektorplasmid.
For å generere et AdSV25 som bærer et markørgen, ble en GFP (grønn fluorescent protein) ekspresjonskassett tidligere klonet inn plasmidet pShuttle (Clontech) utskåret med restriksjonsenzymene I-Ceul og ligert inn i pSV25 (eller et annet av Ad sjimp-plasmidene beskrevet heri) fordøyd med de samme enzymer. Det resulterende plasmid (pSV25GFP) ble fordøyd med Swal for å separere det bakterielle plasmidskjelett, og transfektert inn i den El-kompleterende HEK293-cellelinjen. Omkring 10 dager senere ble en cytopatisk effekt observert, noe som antyder nærvær av replikative virus. Den vellykkete generering av en AdSV25-basert adenoviral vektor som uttrykker GFP, ble bekreftet ved å påføre supernatanten fra den transfekterte kultur på friske cellekulturer. Nærværet av sekundærtinfekterte celler ble bestemt ved observasjon av grønn fluorescens i en populasjon av cellene.
Eksempel 11 - Konstruksjon av E3- deleterte Pan5-, Pan6-, Pan7- oe C68- vektorer Forbedring av kloningsegenskapene til de adenovirale vektorene, kan skje ved at E3-området tas ut fordi dette området koder gener som ikke er nødvendige for propageringen av virus i kultur. For dette formål ble det laget E3-deleterte versjoner av Pan5, Pan6, Pan7 og C68 (et 3,5 kb Nru-AvrU-fragment inneholdende E31-9 er deletert).
A. E3- deletert Pan5- basert vektor
El-deletert pPan5-pkGFP-plasmid ble behandlet med Avrll-endonuklease for å isolere et 5,8 kb-fragment inneholdende E3-området og resirkulere pPan5-pkGFP med Avrll-delesjon for å danne konstruktet pPan5-pkGFP-E3-AvrII. Deretter ble 5,8 kb-AvrU-fragmentet subklonet inn i pSL-Pan5-E3-AvrU for en ytterligere delesjon av E3-området ved Nrul-fordøying. Dette førte til en plasmid pSL-Pan5-E3-delesjon. Det endelige konstruktet pPan5-E3-pkGFP ble fremstilt ved fjerning av et 4,3 kb AvrO/Spel-fragment fra pSL-Pan5-E3-delesjonsplasmidet og innføring i pPan5-pkGFP-E3-AvrU-setet. I dette endelige konstrukt var det gjennomført en 3,1 kb-delesjon i E3-området.
B. E3- delesjon i Pan6- basert vektor
El-deletert pPan6-pkGFP-molekylær klon ble fordøyd med Sbfl og Noti for å isolere et 19,3 kb-fragment, og ligert tilbake ved Sbfl-setet. Det resulterende konstruktet pPan6-Sbfl-E3 ble behandlet med Eco47IU og Swal, for å generere pPan6-E3. Til slutt ble 21 kb Sbfl-fragmentet fra Sbfl-fordøying av pPan6-pkGFP subklonet inn i pPan6-E3 for å danne pPan6-E3-pkGFP med en 4 kb-delesjon i E3.
C. E3- deletert Pan 7- og Pa9- vektorer
Den samme strategi ble anvendt for å oppnå E3-delesj oner i begge vektorer. Først ble et 5,8 kb Avrll-fragment som spente over E3-området subklonet til pSL-1180, fulgt av delesjon av E3 ved Nrul-fordøying. De resulterende plasmider ble behandlet med Spel og AvrU for å oppnå 4,4 kb-fragmenter, og klonet inn i pPan7-pkGFP og pPan9-pkGFP ved Avrll-setene for å erstatte de opprinnelige E3 -holdige AvrU-fragmenter. De endelige pPan7-E3-pkGFP- og pPan9-E3-pkGFP-konstruktene har 3,5 kb E3-delesjoner.
Eksempel 12 - Konstruksjon av E3- oe E4- deletert Pan7- vektor
Selv om delesj onen av El-området av adenovirus (første generasjon adenovirusvektorer) gjør dem replikasjonsinkompetente, er ekspresjonen av de adenovirale vektor skjelettgener ikke fullt avskaffet. Delesjon av E4-området forminsker denne gjenværende genekspresjonen betydelig, og kan gi en sikkerhetsfordel. En E4-deletert Pan7-vektor inneholdende en 2,5 kb delesjon (et PvuII-Agel-fragment inneholdende E40RF1-ORF7 er deletert) ble konstruert. Høytiter-forråd av dette virus ble generert ved anvendelse av en HEK 293-basert cellelinje, som i tillegg til El uttrykker et essensielt E4-gen (orf6).
1. E4- delesjon i den molekylære Pan7- klon
Et 19 kb A7>al-fragment ble deletert fra pPan7-pkGFP for å danne pPan7-A7>aI hvorfra et 2,5 kb E4-fragment ble deletert ved partiell Agel- og PvuU-fordøying, noe som resulterte i at pPan7-^Z>aI-E4.pPan7-E4-pkGFP-plasmid ble generert fra pPan7-A7>aI-E4 i to sekvensielle kloningstrinn, tilsetting av 19 kb Xbal og 15 kb I-CeuJ/MluI-fragmenter, der begge kom fra pPan7-pkGFP-konstruktet.
2. Innføring av E3- og E4- delesjoner i Pan9- vektor
Et 11 kb-plasmid, pPan9-£coRI inneholdende E4-området, ble dannet ved å hente 11 kb EcoRI-fragmentet fra pPan9-pkGFP etter £coRI-fordøying og selvligering. E4-området ble deletert fra dette konstrukt ved Agel-fordøying/innfylt og PvuU-partial fordøying og selvligering for å generere pPan9-£coRI-E4. Et 23 kb EcoRI-fragment ble isolert fra pPan9-pkGFP og innført i pPan9-£coRI-E4 ved £coRI-setet, fulgt av tilsetting av 5,8 kb AvrU-fragment fra pPan9-pkGFP, for å danne det endelige produkt pPan9-E3-E4-pkGF. Sammenliknet med genomstørrelse av villtype pPan9, kan denne E1-E3-E4-deleterte vektoren ha en transgen kapasitet opp til 8 kb. 3. Innføring av minigenkassetter med gener av interesse, inkludert rapportørgener, glyko- og nukleære proteiner av Ebo inn i molekylære kloner av Pan- vektorer En meget effektiv, direkte kloning og grønn/hvit seleksjonsprosedyre ble anvendt for å skape molekylære kloner av rekombinante virus. I korthet ble genene av interesse klonet inn pShuttlepkGFP ved avsøking av hvite kolonier for rekombinanter. Deretter ble minigenkassettene overført til sjimpanseadenovirusskjelettplasmider, pPanXpkGFP med forskjellige delesjoner, enkelt ved å bytte med pkGFP-kassett på I-Ceul- og PI-Scel-setene og avsøking av noen få, hvite kolonier for korrekte kombinanter. 4. Gjenvinning av molekylære kloner av Pan- vektorer med flere delesjoner i tidlige områder og viruspropagering
For gjenvinning av E1-E3-deleterte molekylære koner av sjimpanseadenovirusvektorer, ble klonene lineariserte med egnete restriksjonsenzymer og transfektert inn i regulære 293-celler. Når først en full cytopatisk effekt (CPE) var observert i de transfekterte celler, ble rålysat høstet og ekspandert i 293-celler til storskala infeksjoner. Virusene ble renset ved CsCl-sedimenteringsfremgangsmåten.
For E1-E4- og El-E3-E4-deleterte Pan-vektorer, ble 10-3 celler, en 293-basert E1-E4-kompleterende cellelinje anvendt for gjenvinning og propagering av vektorer. E4-ORF6-genekspresjon i 10-3 celler ble indusert ved tilsetting av 150 uM ZnSC>4 til kulturmediet.
Eksempel 13 - Vaksinering med adenovirusvektorer som uttrykker villtvpe- og variant-EboZ GP
AdHu5- eller AdC7-vektorer som uttrykker Ebola konvoluttkimærer ble produsert for in vivo immuniseringsforsøk i C57BL/6-mus. Rekombinante virus med forskjellige viral skjeletter, ble skapt ved molekylære kloningsmetoder hvori minigenkassettene ble innført i stedet for El-delesj onen. De molekylære kloner av alle rekombinante virus ble gjenvunnet og dyrket opp i 293-celler for storskalarensing ved anvendelse av CsCl-sedimenteringsfremgangsmåten.
Fem EboZ-varianter kodet av AdHu5 eller AdPan7 (C7), ble selektert og produsert for å evaluere deres relative immunogenisitet etter en intramuskulær Ad-injeksjon. wt-Ebo, en oppløselig Ebo-variant, EboAl, EboA2, EboA3, EboA4, EboA5S, EboA6S, EboA7S og EboA8S ble evaluert i de første vaksinestudier. For de data som er oppsummert i den påfølgende tabell, ble antall virale partikler (pr. ml eller totalt) som var produsert og forsterket fra infekterte 293-celler fastslått ved spektrofotometriavlesninger.
Vektor ble administrert intramuskulært (IO<11>genomkopier/celle) i C57BL76-mus og nærværet av virusnøytraliserende antistoff (VNAO) ble evaluert 28 dager senere som et første mål på en immunrespons generert mot Ebola-konvoluttglykoproteinet. VNA er definert som serumantistoff i stand til å inhibere transduksjon av HeLa-celler formidlet av HIV-basert vektor, pseudotypet med villtype Ebola konvolutten.
VNA til EboZ pseudotypene ble påvist med AdPan7 ( Cl), som ga høyere titere enn AdHu5. EboZA3 utløste den høyeste VNA uttrykt ved de transgene mål. For de data som er oppsummert i den påfølgende tabell, er nøytraliserende antistoffritere mot HIV-EboZ-GFP-pseudotypene (resiprok fortynning) anført (N = 5 dyr/gruppe).
Eksempel 14 - Pan7 formidlet ekspresjon av Ebolaproteiner
For å evaluere Pan7-vektorer som uttrykker Ebola konvoluttproteiner og det nukleære Ebola antigenet, har det blitt foretatt studier på mus. Disse er rettet mot evaluering av nøytraliserende antistoffer i C57BL/6-mus, injisert intramuskulært (IM) med AdHu5
eller Pan7 som uttrykker hver av fire Ebola env-konstrukter.
A. Evaluering av CTL fra C5 7BL/ 6- mus injisert IM med AdHu5 eller Pan 7 som uttrykker Ebola env- konstruktene
1. Utfordringsforsøk i mus med Ebola virus
Nøytraliserende antistoff (NAB) responser på Ebola konvolutten ble analysert ved å se på immunisert museserumformidlet nøytralisering av en lentiviral (HIV) vektor pseudotypet med de forskjellige konstruktene (eEbo, NTD2, NTD3, NTD4) av Ebola konvoluttglykoproteinet. C57BL/6- eller BALB/c-mus mottok en enkelt intramuskulær injeksjon på 5xl0<10>partikler/mus av Cl (AdPan7) som koder Ebola konvoluttvarianten. Nøytraliserende antistoff ble bedømt 30 dager etter vaksinering. I korthet ble Ebola Zaire pseudotypet HTV-vektor, som koder for P-galaktosidase (EboZ-HIV-LacZ), inkubert i to timer ved 37 °C med forskjellige fortynninger av varmeinaktivert museserum. Etter inkuberingen med serum, ble EboZ-HTV-LacZ så benyttet for å infektere HeLa-celler i 16 timer ved 37 °C. Infektiviteten ble fastslått ved X-gal farging av transduserte HeLa-celler som var positive for P-galaktosidase. Nøytraliserende titer representerer den serumresiproke fortynning der en 50 % reduksjon i antall P-galaktosidasepositive blå celler ble observert. Sera ble samlet 30 dager etter immunisering, som besto av en enkelt intramuskulær (IM) administrasjon av 5xl0<10>partikler/dyr. Nøytraliserende antistoff mot Ebola pseudotypet HIV kunne påvises fra alle grupper med antistofftitere fra 20 for Ad-EboZ (AdHu5 uttrykkende EboZ), Ad-NTD3 (AdHu5 uttrykkende NTD3) og C7-sEbo (AdPan7 uttrykkende oppløselig EboZ) til over 130 for C7-NTD3 (AdPan7 uttrykkende oppløselig NTD3) og C7-NTD4 (AdPan7 uttrykkende oppløselig NTD3). Den samme immuniseringsstrategi i BALB/c-mus resulterte i lavere nøytraliserende antistofftitere for Ad- og C7-NTD2, og NTD4.
B. Cellulær immunrespons
Den cellulære immunrespons på Ebola konvolutten i C57BL/6-mus ble evaluert 8 dager etter en enkelt IM administrering av 5xl0<10>partikler av C7-LacZ- eller C7-Ebola-konvoluttvariant pr. dyr. Mus ble vaksinert IM med 5xl0<10>partikler Cl, som koder LacZ eller Ebola konvoluttvariant. Spleniske lymfocytter fra immuniserte mus ble samlet 8 dager etter vaksinering og stimulert in vitro med materceller (spleniske lymfocytter fra ikke-behandlete mus infektert med human adenovirusserotype 5 som koder for villtype Ebola konvolutten og så bestrålt). Standard 5-hr CTL-analyser ble utført ved anvendelse av<51>Cr-merket, syngeniske C57-celler transfektert med en ekspressor av EboZ.
En positiv MHC-begrenset cytotoksisk T-lymfocytt (CTL) ble observert fra alle Pan7 som koder for Ebola konvoluttvarianter med en høyere respons fra NTD2-, NTD3- eller NTD4-immuniserte mus. Effektorceller fra Cl kodende Ebola konvoluttvariantimmuniserte mus gjenkjente EboZ-transfekterte målceller og ga gjentatte CTL-responser opptil 30 % spesifikk lysering. Mindre enn 5 % lysering ble observert med effektorceller fra naive eller LacZ-immuniserte kontrollmus, som bekrefter at lysering var spesifikk for Ebola konvoluttantigener.
C. Beskyttelsesstudier
De mest direkte midler for å bedømme Cl (AdPan7) som koder for EboZ-variantene som en vellykket vaksine i mus, var å bedømme beskyttelsen mot vekttap og død etter letal utfordring med museadaptert Ebola Zaire-virus. BALB/c-mus ble immunisert med en enkelt dose av 5xl0<10>partikler/dyr, som gjennomført tidligere og vaksinerte dyr ble utfordret med 200 LD50museadapterte Ebola Zaire 21 dager senere. Alle kontrollmus
(bærer og C7-LacZ) døde mellom dag 5 og dag 9 etter utfordring. Derimot overlevde alle musene unntatt én (fra C7-sEbo gruppen), utfordringen med Ebola Zaire.
Vekttap ble observert fra mus vaksinert med C7-sEbo fra dag 4 til dag 7. Sykdomstegn som pilo-ereksjon og fra lett til alvorlig letargi ble også observert hos mus vaksinert med C7-sEbo, NTD2 og NTD3 mellom dag 4 og dag 7. Mus immunisert med C7-EboZ og C7-NTD4 viste ingen tegn på sykdom. Totalt sett beskyttet en enkelt dose av C7-EboZ og C7-NTD4 immuniserte mus fullstendig mot sykdom og død, muligens pga. en signifikant T-celle formidlet immunitet.
Alle dokumenter nevnt i beskrivelsen er inkorporert heri ved referanse. Tallrike modifikasjoner og varianter av den foreliggende oppfinnelse ligger innenfor dens ramme og er ansett å være åpenbare for fagmannen. Slike modifikasjoner og endringer av preparater og prosesser ifølge oppfinnelsen, som valg av forskjellige minigen eller valg eller doseringer av vektorene eller immunmodulatorene, anses å ligge innenfor rammen av de vedlagte krav.
Claims (11)
1.
Rekombinant adenovirus, karakterisert ved at den har et kapsid som omfatter et AdPan 5 hexon protein som har aminosyresekvensen til SEQ ID NO:3, eller et fragment derav, et fiber protein og et penton protein, der nevnte fiber og penton protein er henholdsvis avledet fra human eller simian opprinnelse, adenoviruset omfatter ytterligere adenovirus sekvenser funksjonelt deletert i Ela og/eller Elb genene, 5' og 3' adenovirus cis-el em enter nødvendig for replikasjon og "encapsidiation" og en transgen heterolog til adenoviruset og operativt koplet til sekvensene som dirigerer ekspresjon av nevnte gen i en vertscelle.
2.
Rekombinant adenovirus i følge krav 1, karakterisert v e d at fiberproteinet er AdPan5 fiberproteinfragmentet fra SEQ ID NO:4.
3.
Rekombinant adenovirus i følge et hvilket som helst av kravene 1 til 2, karakterisert ved at penton proteinet er et AdPan5 penton protein.
4.
Rekombinant adenovirus i følge et hvilket som helst av kravene 1 til 3, karakterisert ved at kapsidet er et AdPan5 kapsid.
5.
Rekombinant adenovirus i følge et hvilket som helst av kravene 1 til 4, karakterisert ved at nevnte rekombinante adenovirus er et pseudotype adenovirus som omfatter 5' og 3'adenovirus cis-elementer nødvendig for replikasjon og "encapsidation", hvor nevnte cis-elementer omfatter en adenovirus 5' invertert terminal repetisjon og en adenovirus 3' invertert terminal repetisjon fra en adenovirus heterologt til AdPan5.
6.
Rekombinant adenovirus i følge krav 1, karakterisert v e d at ved at fragmentet til AdPan5 hexon proteinet er en N-terminal eller C-terminal trunkering på omtrent 50 aminosyrers lengde av AdPan5 hexon proteinsekvensen til SEQ ID NO:3.
7.
Rekombinant adenovirus i følge et hvilket som helst av kravene 1 til 6, karakterisert ved at adenovirusets cis elementer omfatter 5' inverterte terminal repetisjons (ITR) sekvenser og 3' ITR fra Pan5 SEQ ID NO:l eller en sekvens som er komplementær til denne.
8.
Adenovirus i følge krav 7, karakterisert ved at minst ett simiankapsid protein er valgt fra:
penton proteinet fra Pan5 SEQ ID NO:2 og
fiber proteinet fra Pan5 SEQ ID NO:4.
9.
Isolert vertscelle, karakterisert ved at den omfatter et rekombinant adenovirus i følge et hvilket som helst av kravene 1 til 8.
10.
Sammensetning, karakterisert ved at den omfatter et rekombinant adenovirus i følge et hvilket som helst av kravene 1 til 8 og en farmasøytisk akseptabel bærer.
11.
Anvendelse av et rekombinant adenovirus i følge et hvilket som helst av kravene 1 til 8, for fremstilling av et medikament formulert for å reise immunrespons mot et infeksiøst agens i en pattedyrsvert, hvori nevnte transgene eller heterologe sekvens koder for et antigen fra det infeksiøse agenset.
SEQUENCE LISTING
<110> The Trustees of the University of Pennsylvania Wilson, James M.
Gao, Guangping
Roy, Soumitra
<120> Simian Adenovirus Nucleic Acid and Amino Acid Sequences, Vectors Containin' <130> UPN-02677PCT
<150> US 60/331,951
<151> 2001-11-21
<150> US 60/366,798
<151> 2002-03-22
<160> 39
<170> Patentln version 3.1
<210> 1
<211> 36462
<212> DNA
<213> chimpanzee adenovirus serotype Pan5
<220>
<221> CDS
<222> (13898)..(15490)
<223> L2 Penton
<220>
<221> CDS
<222> (18315)..(21116)
<223> L3 Hexon
<220>
<221> CDS
<222> (32035)..(33372)
<223> L5 Fiber
<400> 1
catcatcaat aatatacctc aaacttttgg tgcgcgttaa tatgcaaatg aggtatttga 60 atttggggat gcggggcggt gattggctgc gggagcggcg accgttaggg gcggggcggg 120 tgacgttttg atgacgtggc cgtgaggcgg agccggtttg caagttctcg tgggaaaagt 180 gacgtcaaac gaggtgtggt ttgaacacgg aaatactcaa ttttcccgcg ctctctgaca 240 ggaaatgagg tgtttctggg cggatgcaag tgaaaacggg ccattttcgc gcgaaaactg 300 aatgaggaag tgaaaatctg agtaattccg cgtttatggc agggaggagt atttgccgag 360 ggccgagtag actttgaccg attacgtggg ggtttcgatt accgtatttt tcacctaaat 420 ttccgcgtac ggtgtcaaag tccggtgttt ttacgtaggt gtcagctgat cgccagggta 480 tttaaacctg cgctctctag tcaagaggcc actcttgagt gccagcgagt agagttttct 540 cctccgcgcc gcgagtcaga tctacacttt gaaagatgag gcacctgaga gacctgcccg 600 gtaatgtttt cctggctact gggaacgaga ttctggaact ggtggtggac gccatgatgg 660 gtgacgaccc tccggagccc cctaccccat ttgaagcgcc ttcgctgtac gatttgtatg 720 atctggaggt ggatgtgccc gagaacgacc ccaacgagga ggcggtgaat gatttgttta 780 gcgatgccgc gctgctggct gccgagcagg ctaatacgga ctctggctca gacagcgatt 840 cctctctcca taccccgaga cccggcagag gtgagaaaaa gatccccgag cttaaagggg 900 aagagctcga cctgcgctgc tatgaggaat gcttgcctcc gagcgatgat gaggaggacg 960 aggaggcgat tcgagctgca gcgaaccagg gagtgaaaac agcgagcgag ggctttagcc 1020 tggactgtcc tactctgccc ggacacggct gtaagtcttg tgaatttcat cgcatgaata 1080 ctggagataa gaatgtgatg tgtgccctgt gctatatgag agcttacaac cattgtgttt 1140 acagtaagtg tgattaactt tagctgggga ggcagagggt gactgggtgc tgactggttt 1200 atttatgtat atgtttttta tgtgtaggtc ccgtctctga cgtagatgag acccccacta 1260 cagagtgcat ttcatcaccc ccagaaattg gcgaggaacc gcccgaagat attattcata 1320 gaccagttgc agtgagagtc accgggcgta gagcagctgt ggagagtttg gatgacttgc 1380 tacagggtgg ggatgaacct ttggacttgt gtacccggaa acgccccagg cactaagtgc 1440 cacacatgtg tgtttactta aggtgatgtc agtatttata gggtgtggag tgcaataaaa 1500 tccgtgttga ctttaagtgc gtggtttatg actcaggggt ggggactgtg ggtatataag 1560 caggtgcaga cctgtgtggt cagttcagag caggactcat ggagatctgg acagtcttgg 1620 aagactttca ccagactaga cagctgctag agaactcatc ggagggagtc tcttacctgt 1680 ggagattctg cttcggtggg cctctagcta agctagtcta tagggccaag caggattata 1740 aggatcaatt tgaggatatt ttgagagagt gtcctggtat ttttgactct ctcaacttgg 1800 gccatcagtc tcactttaac cagagtattc tgagagccct tgacttttct actcctggca 1860 gaactaccgc cgcggtagcc ttttttgcct ttatccttga caaatggagt caagaaaccc 1920 atttcagcag ggattaccgt ctggactgct tagcagtagc tttgtggaga acatggaggt 1980 gccagcgcct gaatgcaatc tccggctact tgccagtaca gccggtagac acgctgagga 2040 tcctgagtct ccagtcaccc caggaacacc aacgccgcca gcagccgcag caggagcagc 2100 agcaagagga ggaccgagaa gagaacctga gagccggtct ggaccctccg gtggcggagg 2160 aggaggagta gctgacttgt ttcccgagct gcgccgggtg ctgactaggt cttccagtgg 2220 acgggagagg gggattaagc gggagaggca tgaggagact agccacagaa ctgaactgac 2280 tgtcagtctg atgagtcgca ggcgcccaga atcggtgtgg tggcatgagg tgcagtcgca 2340 ggggatagat gaggtctcag tgatgcatga gaaatattcc ctagaacaag tcaagacttg 2400 ttggttggag cccgaggatg attgggaggt agccatcagg aattatgcca agctggctct 2460 gaggccagac aagaagtaca agattaccaa actgattaat atcagaaatt cctgctacat 2520 ttcagggaat ggggccgagg tggagatcag tacccaggag agggtggcct tcagatgctg 2580 catgatgaat atgtacccgg gggtggtggg catggaggga gtcaccttta tgaacgcgag 2640 gttcaggggt gatgggtata atggggtggt ctttatggcc aacaccaagc tgacagtgca 2700 cggatgctcc ttctttggct tcaataacat gtgcattgag gcctggggca gtgtttcagt 2760 gaggggatgc agtttttcag ccaactggat gggggtcgtg ggcagaacca agagcatggt 2820 gtcagtgaag aaatgcctgt tcgagaggtg ccacctgggg gtgatgagcg agggcgaagc 2880 caaagtcaaa cactgcgcct ctaccgagac gggctgcttt gtactgatca agggcaatgc 2940 caaagtcaag cataatatga tctgtggggc ctcggatgag cgcggctacc agatgctgac 3000 ctgcgccggt gggaacagcc atatgctagc caccgtgcat gtggcctcgc acccccgcaa 3060 gacatggccc gagttcgagc acaacgtcat gacccgctgc aatgtgcacc tggggtcccg 3120 ccgaggcatg ttcatgccct accagtgcaa catgcaattt gtgaaggtgc tgctggagcc 3180 cgatgccatg tccagagtga gcctgacggg ggtgtttgac atgaatgtgg agctgtggaa 3240 aattctgaga tatgatgaat ccaagaccag gtgccgggcc tgcgaatgcg gaggcaagca 3300 cgccaggctt cagcccgtgt gtgtggaggt gacggaggac ctgcgacccg atcatttggt 3360 gttgtcctgc aacgggacgg agttcggctc cagcggggaa gaatctgact agagtgagta 3420 gtgtttggga ctgggtggga gcctgcatga tgggcagaat gactaaaatc tgtgtttttc 3480 tgcgcagcag catgagcgga agcgcctcct ttgagggagg ggtattcagc ccttatctga 3540 cggggcgtct cccctcctgg gcgggagtgc gtcagaatgt gatgggatcc acggtggacg 3600 gccggcccgt gcagcccgcg aactcttcaa ccctgaccta cgcgaccctg agctcctcgt 3660 ccgtggacgc agctgccgcc gcagctgctg cttccgccgc cagcgccgtg cgcggaatgg 3720 ccctgggcgc cggctactac agctctctgg tggccaactc gagttccacc aataatcccg 3780 ccagcctgaa cgaggagaag ctgctgctgc tgatggccca gctcgaggcc ctgacccagc 3840 gcctgggcga gctgacccag caggtggctc agctgcaggc ggagacgcgg gccgcggttg 3900 ccacggtgaa aaccaaataa aaaatgaatc aataaataaa cggagacggt tgttgatttt 3960 aacacagagt cttgaatctt tatttgattt ttcgcgcgcg gtaggccctg gaccaccggt 4020 ctcgatcatt gagcacccgg tggatctttt ccaggacccg gtagaggtgg gcttggatgt 4080 tgaggtacat gggcatgagc ccgtcccggg ggtggaggta gctccattgc agggcctcgt 4140 gctcgggggt ggtgttgtaa atcacccagt catagcaggg gcgcagggcg tggtgctgca 4200 cgatgtcctt gaggaggaga ctgatggcca cgggcagccc cttggtgtag gtgttgacga 4260 acctgttgag ctgggaggga tgcatgcggg gggagatgag atgcatcttg gcctggatct 4320 tgagattggc gatgttcccg cccagatccc gccgggggtt catgttgtgc aggaccacca 4380 gcacggtgta tccggtgcac ttggggaatt tgtcatgcaa cttggaaggg aaggcgtgaa 4440 agaatttgga gacgcccttg tgaccgccca ggttttccat gcactcatcc atgatgatgg 4500 cgatgggccc gtgggcggcg gcttgggcaa agacgtttcg ggggtcggac acatcgtagt 4560 tgtggtcctg ggtgagctcg tcataggcca ttttaatgaa tttggggcgg agggtgcccg 4620 actgggggac gaaggtgccc tcgatcccgg gggcgtagtt gccctcgcag atctgcatct 4680 cccaggcctt gagctcggag ggggggatca tgtccacctg cggggcgatg aaaaaaacgg 4740 tttccggggc gggggagatg agctgggccg aaagcaggtt ccggagcagc tgggacttgc 4800 cgcagccggt ggggccgtag atgaccccga tgaccggctg caggtggtag ttgagggaga 4860 gacagctgcc gtcctcgcgg aggagggggg ccacctcgtt catcatctcg cgcacatgca 4920 tgttctcgcg cacgagttcc gccaggaggc gctcgccccc aagcgagagg agctcttgca 4980 gcgaggcgaa gtttttcagc ggcttgagcc cgtcggccat gggcattttg gagagggtct 5040 gttgcaagag ttccagacgg tcccagagct cggtgatgtg ctctagggca tctcgatcca 5100 gcagacctcc tcgtttcgcg ggttggggcg actgcgggag tagggcacca ggcgatgggc 5160 gtccagcgag gccagggtcc ggtccttcca ggggcgcagg gtccgcgtca gcgtggtctc 5220 cgtcacggtg aaggggtgcg cgccgggctg ggcgcttgcg agggtgcgct tcaggctcat 5280 ccggctggtc gagaaccgct cccggtcggc gccctgcgcg tcggccaggt agcaattgag 5340 catgagttcg tagttgagcg cctcggccgc gtggcccttg gcgcggagct tacctttgga 5400 agtgtgtccg cagacgggac agaggaggga cttgagggcg tagagcttgg gggcgaggaa 5460 gacggactcg ggggcgtagg cgtccgcgcc gcagctggcg cagacggtct cgcactccac 5520 gagccaggtg aggtctggcc ggtcggggtc aaaaacgagg tttcctccgt gctttttgat 5580 gcgtttctta cctctggtct ccatgagctc gtgtccccgc tgggtgacaa agaggctgtc 5640 cgtgtccccg tagaccgact ttatgggccg gtcctcgagc ggggtgccgc ggtcctcgtc 5700 gtagaggaac cccgcccact ccgagacgaa ggcccgggtc caggccagca cgaaggaggc 5760 cacgtgggag gggtagcggt cgttgtccac cagcgggtcc accttctcca gggtatgcaa 5820 gcacatgtcc ccctcgtcca catccaggaa ggtgattggc ttgtaagtgt aggccacgtg 5880 accgggggtc ccggccgggg gggtataaaa gggggcgggc ccctgctcgt cctcactgtc 5940 ttccggatcg ctgtccagga gcgccagctg ttggggtagg tattccctct cgaaggcggg 6000 catgacctcg gcactcaggt tgtcagtttc tagaaacgag gaggatttga tattgacggt 6060 gccgttggag acgcctttca tgagcccctc gtccatctgg tcagaaaaga cgatcttttt 6120 gttgtcgagc ttggtggcga aggagccgta gagggcgttg gagagcagct tggcgatgga 6180 gcgcatggtc tggttctttt ccttgtcggc gcgctccttg gcggcgatgt tgagctgcac 6240 gtactcgcgc gccacgcact tccattcggg gaagacggtg gtgagcttgt cgggcacgat 6300 tctgacccgc cagccgcggt tgtgcagggt gatgaggtcc acgctggtgg ccacctcgcc 6360 gcgcaggggc tcgttggtcc agcagaggcg cccgcccttg cgcgagcaga aggggggcag 6420 cgggtccagc atgagctcgt cgggggggtc ggcgtccacg gtgaagatgc cgggcaggag 6480 ctcggggtcg aagtagctga tgcaggtgcc cagatcgtcc agcgccgctt gccagtcgcg 6540 cacggccagc gcgcgctcgt aggggctgag gggcgtgccc cagggcatgg ggtgcgtgag 6600 cgcggaggcg tacatgccgc agatgtcgta gacgtagagg ggctcctcga ggacgccgat 6660 gtaggtgggg tagcagcgcc ccccgcggat gctggcgcgc acgtagtcgt acagctcgtg 6720 cgagggcgcg aggagcccgg tgccgaggtt ggagcgctgc ggcttttcgg cgcggtagac 6780 gatctggcgg aagatggcgt gggagttgga ggagatggtg ggcctctgga agatgttgaa 6840 gtgggcgtgg ggcagtccga ccgagtccct gatgaagtgg gcgtaggagt cctgcagctt 6900 ggcgacgagc tcggcggtga cgaggacgtc cagggcgcag tagtcgaggg tctcttggat 6960 gatgtcgtac ttgagctggc ccttctgctt ccacagctcg cggttgagaa ggaactcttc 7020 gcggtccttc cagtactctt cgagggggaa cccgtcctga tcggcacggt aagagcccac 7080 catgtagaac tggttgacgg ccttgtaggc gcagcagccc ttctccacgg ggagggcgta 7140 agcttgcgcg gccttgcgca gggaggtgtg ggtgagggcg aaggtgtcgc gcaccatgac 7200 cttgaggaac tggtgcttga agtcgaggtc gtcgcagccg ccctgctccc agagctggaa 7260 gtccgtgcgc ttcttgtagg cggggttggg caaagcgaaa gtaacatcgt tgaagaggat 7320 cttgcccgcg cggggcatga agttgcgagt gatgcggaaa ggctggggca cctcggcccg 7380 gttgttgatg acctgggcgg cgaggacgat ctcgtcgaag ccgttgatgt tgtgcccgac 7440 gatgtagagt tccacgaatc gcgggcggcc cttgacgtgg ggcagcttct tgagctcgtc 7500 gtaggtgagc tcggcggggt cgctgaggcc gtgctgctcg agggcccagt cggcgaggtg 7560 ggggttggcg ccgaggaagg aagtccagag atccacggcc agggcggtct gcaagcggtc 7620 ccggtactga cggaactgct ggcccacggc cattttttcg ggggtgacgc agtagaaggt 7680 gcgggggtcg ccgtgccagc ggtcccactt gagctggagg gcgaggtcgt gggcgagctc 7740 gacgagcggc gggtccccgg agagtttcat gaccagcatg aaggggacga gctgcttgcc 7800 gaaggacccc atccaggtgt aggtttccac gtcgtaggtg aggaagagcc tttcggtgcg 7860 aggatgcgag ccgatgggga agaactggat ctcctgccac cagttggagg aatggctgtt 7920 gatgtgatgg aagtagaaat gccgacggcg cgccgagcac tcgtgcttgt gtttatacaa 7980 gcgtccgcag tgctcgcaac gctgcacggg atgcacgtgc tgcacgagct gtacctgggt 8040 tcctttgacg aggaatttca gtgggcagtg gagcgctggc ggctgcatct ggtgctgtac 8100 tacgtcctgg ccatcggcgt ggccatcgtc tgcctcgatg gtggtcatgc tgacgaggcc 8160 gcgcgggagg caggtccaga cctcggctcg gacgggtcgg agagcgagga cgagggcgcg 8220 caggccggag ctgtccaggg tcctgagacg ctgcggagtc aggtcagtgg gcagcggcgg 8280 cgcgcggttg acttgcagga gcttttccag ggcgcgcggg aggtccagat ggtacttgat 8340 ctccacggcg ccgttggtgg cgacgtccac ggcttgcagg gtcccgtgcc cctggggcgc 8400 caccaccgtg ccccgtttct tcttgggtgc tggcggcggc ggctccatgc ttagaagcgg 8460 cggcgaggac gcgcgccggg cggcaggggc ggctcggggc ccggaggcag gggcggcagg 8520 ggcacgtcgg cgccgcgcgc gggcaggttc tggtactgcg cccggagaag actggcgtga 8580 gcgacgacgc gacggttgac gtcctggatc tgacgcctct gggtgaaggc cacgggaccc 8640 gtgagtttga acctgaaaga gagttcgaca gaatcaatct cggtatcgtt gacggcggcc 8700 tgccgcagga tctcttgcac gtcgcccgag ttgtcctggt aggcgatctc ggtcatgaac 8760 tgctcgatct cctcctcctg aaggtctccg cgaccggcgc gctcgacggt ggccgcgagg 8820 tcgttggaga tgcggcccat gagctgcgag aaggcgttca tgccggcctc gttccagacg 8880 cggctgtaga ccacggctcc gtcggggtcg cgcgcgcgca tgaccacctg ggcgaggttg 8940 agctcgacgt ggcgcgtgaa gaccgcgtag ttgcagaggc gctggtagag gtagttgagc 9000 gtggtggcga tgtgctcggt gacgaagaag tacatgatcc agcggcggag cggcatctcg 9060 ctgacgtcgc ccagggcttc caagcgctcc atggcctcgt agaagtccac ggcgaagttg 9120 aaaaactggg agttgcgcgc cgagacggtc aactcctcct ccagaagacg gatgagctcg 9180 gcgatggtgg cgcgcacctc gcgctcgaag gccccggggg gctcctcttc ttccatctcc 9240 tcctcctctt ccatctcctc cactaacatc tcttctactt cctcctcagg aggcggcggc 9300 gggggagggg ccctgcgtcg ccggcggcgc acgggcagac ggtcgatgaa gcgctcgatg 9360 gtctccccgc gccggcgacg catggtctcg gtgacggcgc gcccgtcctc gcggggccgc 9420 agcgtgaaga cgccgccgcg catctccagg tggccgccgg gggggtctcc gttgggcagg 9480 gagagggcgc tgacgatgca tcttatcaat tggcccgtag ggactccgcg caaggacctg 9540 agcgtctcga gatccacggg atccgaaaac cgctgaacga aggcttcgag ccagtcgcag 9600 tcgcaaggta ggctgagccc ggtttcttgt tcttcgggta tttggtcggg aggcgggcgg 9660 gcgatgctgc tggtgatgaa gttgaagtag gcggtcctga gacggcggat ggtggcgagg 9720 agcaccaggt ccttgggccc ggcttgctgg atgcgcagac ggtcggccat gccccaggcg 9780 tggtcctgac acctggcgag gtccttgtag tagtcctgca tgagccgctc cacgggcacc 9840 tcctcctcgc ccgcgcggcc gtgcatgcgc gtgagcccga acccgcgctg cggctggacg 9900 agcgccaggt cggcgacgac gcgctcggcg aggatggcct gctggatctg ggtgagggtg 9960 gtctggaagt cgtcgaagtc gacgaagcgg tggtaggctc cggtgttgat ggtgtaggag 10020 cagttggcca tgacggacca gttgacggtc tggtggccgg ggcgcacgag ctcgtggtac 10080 ttgaggcgcg agtaggcgcg cgtgtcgaag atgtagtcgt tgcaggtgcg cacgaggtac 10140 tggtatccga cgaggaagtg cggcggcggc tggcggtaga gcggccatcg ctcggtggcg 10200 ggggcgccgg gcgcgaggtc ctcgagcatg aggcggtggt agccgtagat gtacctggac 10260 atccaggtga tgccggcggc ggtggtggag gcgcgcggga actcgcggac gcggttccag 10320 atgttgcgca gcggcaggaa gtagttcatg gtggccgcgg tctggcccgt gaggcgcgcg 10380 cagtcgtgga tgctctagac atacgggcaa aaacgaaagc ggtcagcggc tcgactccgt 10440 ggcctggagg ctaagcgaac gggttgggct gcgcgtgtac cccggttcga gtccctgctc 10500 gaatcaggct ggagccgcag ctaacgtggt actggcactc ccgtctcgac ccaagcctgc 10560 taacgaaacc tccaggatac ggaggcgggt cgttttggcc attttcgtca ggccggaaat 10620 gaaactagta agcgcggaaa gcggccgtcc gcgatggctc gctgccgtag tctggagaaa 10680 gaatcgccag ggttgcgttg cggtgtgccc cggttcgagc ctcagcgctc ggcgccggcc 10740 ggattccgcg gctaacgtgg gcgtggctgc cccgtcgttt ccaagacccc ttagccagcc 10800 gacttctcca gttacggagc gagcccctct ttttcttgtg tttttgccag atgcatcccg 10860 tactgcggca gatgcgcccc caccctccac cacaaccgcc cctaccgcag cagcagcaac 10920 agccggcgct tctgcccccg ccccagcagc agcagccagc cactaccgcg gcggccgccg 10980 tgagcggagc cggcgttcag tatgacctgg ccttggaaga gggcgagggg ctggcgcggc 11040 tgggggcgtc gtcgccggag cggcacccgc gcgtgcagat gaaaagggac gctcgcgagg 11100 cctacgtgcc caagcagaac ctgttcagag acaggagcgg cgaggagccc gaggagatgc 11160 gcgcctcccg cttccacgcg gggcgggagc tgcggcgcgg cctggaccga aagcgggtgc 11220 tgagggacga ggatttcgag gcggacgagc tgacggggat cagccccgcg cgcgcgcacg 11280 tggccgcggc caacctggtc acggcgtacg agcagaccgt gaaggaggag agcaacttcc 11340 aaaaatcctt caacaaccac gtgcgcacgc tgatcgcgcg cgaggaggtg accctgggcc 11400 tgatgcacct gtgggacctg ctggaggcca tcgtgcagaa ccccacgagc aagccgctga 11460 cggcgcagct gtttctggtg gtgcagcaca gtcgggacaa cgagacgttc agggaggcgc 11520 tgctgaatat caccgagccc gagggccgct ggctcctgga cctggtgaac attctgcaga 11580 gcatcgtggt gcaggagcgc gggctgccgc tgtccgagaa gctggcggcc atcaacttct 11640 cggtgctgag cctgggcaag tactacgcta ggaagatcta caagaccccg tacgtgccca 11700 tagacaagga ggtgaagatc gacgggtttt acatgcgcat gaccctgaaa gtgctgaccc 11760 tgagcgacga tctgggggtg taccgcaacg acaggatgca ccgcgcggtg agcgccagcc 11820 gccggcgcga gctgagcgac caggagctga tgcacagcct gcagcgggcc ctgaccgggg 11880 ccgggaccga gggggagagc tactttgaca tgggcgcgga cctgcgctgg cagcctagcc 11940 gccgggcctt ggaagctgcc ggcggttccc cctacgtgga ggaggtggac gatgaggagg 12000 aggagggcga gtacctggaa gactgatggc gcgaccgtat ttttgctaga tgcagcaaca 12060 gccaccgccg cctcctgatc ccgcgatgcg ggcggcgctg cagagccagc cgtccggcat 12120 taactcctcg gacgattgga cccaggccat gcaacgcatc atggcgctga cgacccgcaa 12180 tcccgaagcc tttagacagc agcctcaggc caaccgactc tcggccatcc tggaggccgt 12240 ggtgccctcg cgctcgaacc ccacgcacga gaaggtgctg gccatcgtga acgcgctggt 12300 ggagaacaag gccatccgcg gcgacgaggc cgggctggtg tacaacgcgc tgctggagcg 12360 cgtggcccgc tacaacagca ccaacgtgca gacgaacctg gaccgcatgg tgaccgacgt 12420 gcgcgaggcg gtgtcgcagc gcgagcggtt ccaccgcgag tcgaacctgg gctccatggt 12480 ggcgctgaac gccttcctga gcacgcagcc cgccaacgtg ccccggggcc aggaggacta 12540 caccaacttc atcagcgcgc tgcggctgat ggtggccgag gtgccccaga gcgaggtgta 12600 ccagtcgggg ccggactact tcttccagac cagtcgccag ggcttgcaga ccgtgaacct 12660 gagccaggct ttcaagaact tgcagggact gtggggcgtg caggccccgg tcggggaccg 12720 cgcgacggtg tcgagcctgc tgacgccgaa ctcgcgcctg ctgctgctgc tggtggcgcc 12780 cttcacggac agcggcagcg tgagccgcga ctcgtacctg ggctacctgc ttaacctgta 12840 ccgcgaggcc atcgggcagg cgcacgtgga cgagcagacc taccaggaga tcacccacgt 12900 gagccgcgcg ctgggccagg aggacccggg caacctggag gccaccctga acttcctgct 12960 gaccaaccgg tcgcagaaga tcccgcccca gtacgcgctg agcaccgagg aggagcgcat 13020 cctgcgctac gtgcagcaga gcgtggggct gttcctgatg caggaggggg ccacgcccag 13080 cgccgcgctc gacatgaccg cgcgcaacat ggagcccagc atgtacgccc gcaaccgccc 13140 gttcatcaat aagctgatgg actacttgca tcgggcggcc gccatgaact cggactactt 13200 taccaacgcc atcttgaacc cgcactggct cccgccgccc gggttctaca cgggcgagta 13260 cgacatgccc gaccccaacg acgggttcct gtgggacgac gtggacagca gcgtgttctc 13320 gccgcgcccc accaccacca ccgtgtggaa gaaagagggc ggggaccggc ggccgtcctc 13380 ggcgctgtcc ggtcgcgcgg gtgctgccgc ggcggtgccc gaggccgcca gccccttccc 13440 gagcctgccc ttttcgctga acagcgtgcg cagcagcgag ctgggtcggc tgacgcggcc 13500 gcgcctgctg ggcgaggagg agtacctgaa cgactccttg cttcggcccg agcgcgagaa 13560 gaacttcccc aataacggga tagagagcct ggtggacaag atgagccgct ggaagacgta 13620 cgcgcacgag cacagggacg agccccgagc tagcagcagc accggcgcca cccgtagacg 13680 ccagcggcac gacaggcagc ggggtctggt gtgggacgat gaggattccg ccgacgacag 13740 cagcgtgttg gacttgggtg ggagtggtgg tggtaacccg ttcgctcacc tgcgcccccg 13800 tatcgggcgc ctgatgtaag aatctgaaaa aataaaagac ggtactcacc aaggccatgg 13860 cgaccagcgt gcgttcttct ctgttgtttg tagtagt atg atg agg cgc gtg tac 13915
Met Met Arg Arg Val Tyr 1 5
ccg gag ggt cct cct ccc tcg tac gag age gtg atg cag cag gcg gtg 13963 Pro Glu Gly Pro Pro Pro Ser Tyr Glu Ser Val Met Gin Gin Ala Val 10 15 20
gcg gcg gcg atg cag ccc ccg etg gag gcg cct tac gtg ccc ccg egg 14011 Ala Ala Ala Met Gin Pro Pro Leu Glu Ala Pro Tyr Val Pro Pro Arg 25 30 35
tac etg gcg cct acg gag ggg egg aac age att cgt tac tcg gag etg 14059 Tyr Leu Ala Pro Thr Glu Gly Arg Asn Ser Ile Arg Tyr Ser Glu Leu 40 45 50
gca ccc ttg tac gat acc acc egg ttg tac etg gtg gac aac aag tcg 14107 Ala Pro Leu Tyr Asp Thr Thr Arg Leu Tyr Leu Val Asp Asn Lys Ser 55 60 65 70
gcg gac atc gcc tcg etg aac tac cag aac gac cac age aac ttc etg 14155 Ala Asp Ile Ala Ser Leu Asn Tyr Gin Asn Asp His Ser Asn Phe Leu 75 80 85
acc acc gtg gtg cag aac aac gat ttc acc ccc acg gag gcc age acc 14203 Thr Thr Val Val Gin Asn Asn Asp Phe Thr Pro Thr Glu Ala Ser Thr 90 95 100
cag acc atc aac ttt gac gag cgc tcg egg tgg ggc ggc cag etg aaa 14251 Gin Thr Ile Asn Phe Asp Glu Arg Ser Arg Trp Gly Gly Gin Leu Lys 105 110 115
acc atc atg cac acc aac atg ccc aac gtg aac gag ttc atg tac age 14299 Thr Ile Met His Thr Asn Met Pro Asn Val Asn Glu Phe Met Tyr Ser 120 125 130
aac aag ttc aag gcg egg gtg atg gtc tcg cgc aag acc ccc aac ggg 14347 Asn Lys Phe Lys Ala Arg Val Met Val Ser Arg Lys Thr Pro Asn Gly 135 140 145 150
gtc aca gta aca gat ggt agt cag gac gag etg acc tac gag tgg gtg 14395 Val Thr Val Thr Asp Gly Ser Gin Asp Glu Leu Thr Tyr Glu Trp Val 155 160 165
gag ttt gag etg ccc gag ggc aac ttc tcg gtg acc atg acc atc gat 14443 Glu Phe Glu Leu Pro Glu Gly Asn Phe Ser Val Thr Met Thr Ile Asp 170 175 180
etg atg aac aac gcc atc atc gac aac tac ttg gcg gtg ggg egg cag 14491 Leu Met Asn Asn Ala Ile Ile Asp Asn Tyr Leu Ala Val Gly Arg Gin 185 190 195
aac ggg gtg etg gag age gac atc ggc gtg aag ttc gac acg cgc aac 14539 Asn Gly Val Leu Glu Ser Asp Ile Gly Val Lys Phe Asp Thr Arg Asn 200 205 210
ttc egg etg ggc tgg gac ccc gtg acc gag etg gtg atg ccg ggc gtg 14587 Phe Arg Leu Gly Trp Asp Pro Val Thr Glu Leu Val Met Pro Gly Val 215 220 225 230
tac acc aac gag gcc ttc cac ccc gac atc gtc etg etg ccc ggc tgc 14635 Tyr Thr Asn Glu Ala Phe His Pro Asp Ile Val Leu Leu Pro Gly Cys 235 240 245
ggc gtg gac ttc acc gag age cgc etc age aac etg etg ggc atc cgc 14683 Gly Val Asp Phe Thr Glu Ser Arg Leu Ser Asn Leu Leu Gly Ile Arg 250 255 260
aag egg cag ccc ttc cag gag ggc ttc cag atc etg tac gag gac etg 14731 Lys Arg Gin Pro Phe Gin Glu Gly Phe Gin Ile Leu Tyr Glu Asp Leu 265 270 275
gag ggg ggc aac atc ccc gcg etg etg gac gtg gac gcc tac gag aaa 14779 Glu Gly Gly Asn Ile Pro Ala Leu Leu Asp Val Asp Ala Tyr Glu Lys 280 285 290
age aag gag gat age gcc gcc gcg gcg acc gca gcc gtg gcc acc gcc 14827 Ser Lys Glu Asp Ser Ala Ala Ala Ala Thr Ala Ala Val Ala Thr Ala 295 300 305 310
tet acc gag gtg egg ggc gat aat ttt get age gcc gcg aca etg gca 14875 Ser Thr Glu Val Arg Gly Asp Asn Phe Ala Ser Ala Ala Thr Leu Ala 315 320 325
gcg gcc gag gcg get gaa acc gaa agt aag ata gtg atc cag ccg gtg 14923 Ala Ala Glu Ala Ala Glu Thr Glu Ser Lys Ile Val Ile Gin Pro Val 330 335 340
gag aag gac age aag gag agg age tac aac gtg etc gcg gac aag aaa 14971 Glu Lys Asp Ser Lys Glu Arg Ser Tyr Asn Val Leu Ala Asp Lys Lys 345 350 355
aac acc gcc tac cgc age tgg tac etg gcc tac aac tac ggc gac ccc 15019 Asn Thr Ala Tyr Arg Ser Trp Tyr Leu Ala Tyr Asn Tyr Gly Asp Pro 360 365 370
gag aag ggc gtg cgc tcc tgg acg etg etc acc acc tcg gac gtc acc 15067 Glu Lys Gly Val Arg Ser Trp Thr Leu Leu Thr Thr Ser Asp Val Thr 375 380 385 390
tgc ggc gtg gag caa gtc tac tgg tcg etg ccc gac atg atg caa gac 15115 Cys Gly Val Glu Gin Val Tyr Trp Ser Leu Pro Asp Met Met Gin Asp 395 400 405
ccg gtc acc ttc cgc tcc acg cgt caa gtt age aac tac ccg gtg gtg 15163 Pro Val Thr Phe Arg Ser Thr Arg Gin Val Ser Asn Tyr Pro Val Val 410 415 420
ggc gcc gag etc etg ccc gtc tac tcc aag age ttc ttc aac gag cag 15211 Gly Ala Glu Leu Leu Pro Val Tyr Ser Lys Ser Phe Phe Asn Glu Gin 425 430 435
gcc gtc tac tcg cag cag etg cgc gcc ttc acc tcg etc acg cac gtc 15259 Ala Val Tyr Ser Gin Gin Leu Arg Ala Phe Thr Ser Leu Thr His Val 440 445 450
ttc aac cgc ttc ccc gag aac cag atc etc gtt cgc ccg ccc gcg ccc 15307 Phe Asn Arg Phe Pro Glu Asn Gin Ile Leu Val Arg Pro Pro Ala Pro 455 460 465 470
acc att acc acc gtc agt gaa aac gtt cct get etc aca gat cac ggg 15355 Thr Ile Thr Thr Val Ser Glu Asn Val Pro Ala Leu Thr Asp His Gly 475 480 485
acc etg ccg etg cgc age agt atc egg gga gtc cag cgc gtg acc gtc 15403 Thr Leu Pro Leu Arg Ser Ser Ile Arg Gly Val Gin Arg Val Thr Val 490 495 500
act gac gcc aga cgc cgc acc tgc ccc tac gtc tac aag gcc etg ggc 15451 Thr Asp Ala Arg Arg Arg Thr Cys Pro Tyr Val Tyr Lys Ala Leu Gly 505 510 515
gta gtc gcg ccg cgc gtc etc tcg age cgc acc ttc taa aaaatgtcca 15500 Val Val Ala Pro Arg Val Leu Ser Ser Arg Thr Phe
520 525 530
ttctcatctc gcccagtaat aacaceggtt ggggcctgcg cgcgcccagc aagatgtacg 15560 gaggegcteg ccaacgctcc acgcaacacc ccgtgcgcgt gcgcgggcac ttccgcgctc 15620 cctggggcgc cctcaagggc cgcgtgcgct cgcgcaccac cgtcgacgac gtgatcgacc 15680 aggtggtggc cgacgcgcgc aactacacgc ccgccgccgc gcccgtctcc acegtggaeg 15740 ccgtcatcga cagegtggtg gccgacgcgc gccggtacgc ccgcgccaag agccggcggc 15800 ggcgcatcgc ccggcggcac cggagcaccc ccgccatgcg cgcggcgcga gccttgctgc 15860 gcagggccag gegcaeggga cgcagggcca tgetcaggge ggccagacgc gcggcctccg 15920 gcagcagcag cgccggcagg acccgcagac gcgcggccac ggcggcggcg gcggccatcg 15980 ccagcatgtc ccgcccgcgg cgcggcaacg tgtactgggt gcgcgacgcc gccaccggtg 16040 tgcgcgtgcc cgtgcgcacc cgcccccctc gcacttgaag atgetgaett egegatgttg 16100 atgtgtccca geggegagga ggatgtccaa gegcaaatte aaggaagaga tgctccaggt 16160 catcgcgcct gagatctacg gcccggcggc ggtgaaggag gaaagaaagc cccgcaaact 16220 gaagcgggtc aaaaaggaca aaaaggagga ggaagatgtg gacggactgg tggagtttgt 16280 gcgcgagttc gccccccggc ggcgcgtgca gtggcgcggg cggaaagtga aaccggtgct 16340 gcgacccggc accacggtgg tcttcacgcc cggcgagcgt tccggctccg cctccaagcg 16400 ctcctacgac gaggtgtacg gggacgagga catcctcgag caggcggccg aacgtctggg 16460 cgagtttgct tacggcaagc gcagccgccc cgcgcccttg aaagaggagg cggtgtccat 16520 cccgctggac cacggcaacc ccacgccgag cctgaagccg gtgaccctgc agcaggtgct 16580 gcctggtgcg gcgccgcgcc ggggcttcaa gcgcgagggc ggcgaggatc tgtacccgac 16640 catgcagctg atggtgccca agcgccagaa gctggaggac gtgctggagc acatgaaggt 16700 ggaccccgag gtgcagcccg aggtcaaggt gcggcccatc aagcaggtgg ccccgggcct 16760 gggcgtgcag accgtggaca tcaagatccc cacggagccc atggaaacgc agaccgagcc 16820 cgtgaagccc agcaccagca ccatggaggt gcagacggat ccctggatgc cggcaccggc 16880 ttccaccacc cgccgaagac gcaagtacgg cgcggccagc ctgctgatgc ccaactacgc 16940 gctgcatcct tccatcatcc ccacgccggg ctaccgcggc acgcgcttct accgcggcta 17000 caccagcagc cgccgccgca agaccaccac ccgccgccgc cgtcgtcgca cccgccgcag 17060 cagcaccgcg acttccgccg ccgccctggt gcggagagtg taccgcagcg ggcgcgagcc 17120 tctgaccctg ccgcgcgcgc gctaccaccc gagcatcgcc atttaactac cgcctcctac 17180 ttgcagatat ggccctcaca tgccgcctcc gcgtccccat tacgggctac cgaggaagaa 17240 agccgcgccg tagaaggctg acggggaacg ggctgcgtcg ccatcaccac cggcggcggc 17300 gcgccatcag caagcggttg gggggaggct tcctgcccgc gctgatgccc atcatcgccg 17360 cggcgatcgg ggcgatcccc ggcatagctt ccgtggcggt gcaggcctct cagcgccact 17420 gagacacagc ttggaaaatt tgtaataaaa aatggactga cgctcctggt cctgtgatgt 17480 gtgtttttag atggaagaca tcaatttttc gtccctggca ccgcgacacg gcacgcggcc 17540 gtttatgggc acctggagcg acatcggcaa cagccaactg aacgggggcg ccttcaattg 17600 gagcagtctc tggagcgggc ttaagaattt cgggtccacg ctcaaaacct atggcaacaa 17660 ggcgtggaac agcagcacag ggcaggcgct gagggaaaag ctgaaagagc agaacttcca 17720 gcagaaggtg gtcgatggcc tggcctcggg catcaacggg gtggtggacc tggccaacca 17780 ggccgtgcag aaacagatca acagccgcct ggacgcggtc ccgcccgcgg ggtccgtgga 17840 gatgccccag gtggaggagg agctgcctcc cctggacaag cgcggcgaca agcgaccgcg 17900 tcccgacgcg gaggagacgc tgctgacgca cacggacgag ccgcccccgt acgaggaggc 17960 ggtgaaactg ggtctgccca ccacgcggcc cgtggcgcct ctggccaccg gggtgctgaa 18020 acccagcagc agcagcagcc agcccgcgac cctggacttg cctccgcctg cttcccgccc 18080 ctccacagtg gctaagcccc tgccgccggt ggccgtcgcg tcgcgcgccc cccgaggccg 18140 cccccaggcg aactggcaga gcactctgaa cagcatcgtg ggtctgggag tgcagagtgt 18200 gaagcgccgc cgctgctatt aaaagacact gtagcgctta acttgcttgt ctgtgtgtat 18260 atgtatgtcc gccgaccaga aggaggagga agaggcgcgt cgccgagttg caag atg 18317
Met
gcc acc cca tcg atg etg ccc cag tgg gcg tac atg cac atc gcc gga 18365 Ala Thr Pro Ser Met Leu Pro Gin Trp Ala Tyr Met His Ile Ala Gly 535 540 545
cag gac get tcg gag tac etg agt ccg ggt etg gtg cag ttc gcc cgc 18413 Gin Asp Ala Ser Glu Tyr Leu Ser Pro Gly Leu Val Gin Phe Ala Arg 550 555 560
gcc aca gac acc tac ttc agt etg ggg aac aag ttt agg aac ccc acg 18461 Ala Thr Asp Thr Tyr Phe Ser Leu Gly Asn Lys Phe Arg Asn Pro Thr 565 570 575
gtg gcg ccc acg cac gat gtg acc acc gac cgc age cag egg etg acg 18509 Val Ala Pro Thr His Asp Val Thr Thr Asp Arg Ser Gin Arg Leu Thr 580 585 590 595
etg cgc ttc gtg ccc gtg gac cgc gag gac aac acc tac tcg tac aaa 18557 Leu Arg Phe Val Pro Val Asp Arg Glu Asp Asn Thr Tyr Ser Tyr Lys 600 605 610
gtg cgc tac acg etg gcc gtg ggc gac aac cgc gtg etg gac atg gcc 18605 Val Arg Tyr Thr Leu Ala Val Gly Asp Asn Arg Val Leu Asp Met Ala 615 620 625
age acc tac ttt gac atc cgc ggc gtg etg gat egg ggc cct age ttc 18653 Ser Thr Tyr Phe Asp Ile Arg Gly Val Leu Asp Arg Gly Pro Ser Phe 630 635 640
aaa ccc tac tcc ggc acc get tac aac age etg get ccc aag gga gcg 18701 Lys Pro Tyr Ser Gly Thr Ala Tyr Asn Ser Leu Ala Pro Lys Gly Ala 645 650 655
ccc aac act tgc cag tgg aca tat aaa get gat ggt gat act ggt aca 18749 Pro Asn Thr Cys Gin Trp Thr Tyr Lys Ala Asp Gly Asp Thr Gly Thr 660 665 670 675
gaa aaa acc tat aca tat gga aat gcg cct gtg caa ggc att agt att 18797 Glu Lys Thr Tyr Thr Tyr Gly Asn Ala Pro Val Gin Gly Ile Ser Ile 680 685 690
aca aaa gat ggt att caa ett gga act gac act gat gat cag ccc att 18845 Thr Lys Asp Gly Ile Gin Leu Gly Thr Asp Thr Asp Asp Gin Pro Ile 695 700 705
tat gca gat aaa act tat caa cca gag cct caa gtg ggt gat get gaa 18893 Tyr Ala Asp Lys Thr Tyr Gin Pro Glu Pro Gin Val Gly Asp Ala Glu 710 715 720
tgg eat gac atc act ggt act gat gaa aaa tat gga ggc aga get etc 18941 Trp His Asp Ile Thr Gly Thr Asp Glu Lys Tyr Gly Gly Arg Ala Leu 725 730 735
aag cct gac acc aaa atg aag ccc tgc tat ggt tet ttt gcc aag cct 18989 Lys Pro Asp Thr Lys Met Lys Pro Cys Tyr Gly Ser Phe Ala Lys Pro 740 745 750 755 acc aat aaa gaa gga ggt cag gca aat gtg aaa acc gaa aca ggc ggt 19037 Thr Asn Lys Glu Gly Gly Gin Ala Asn Val Lys Thr Glu Thr Gly Gly 760 765 770
acc aaa gaa tat gac att gac atg gca ttc ttc gat aat ega agt gca 19085 Thr Lys Glu Tyr Asp Ile Asp Met Ala Phe Phe Asp Asn Arg Ser Ala 775 780 785
get gcg get ggc etg gcc cca gaa att gtt ttg tat act gag aat gtg 19133 Ala Ala Ala Gly Leu Ala Pro Glu Ile Val Leu Tyr Thr Glu Asn Val 790 795 800
gat etg gaa act cca gat act eat att gta tac aag gcg ggc aca gat 19181 Asp Leu Glu Thr Pro Asp Thr His Ile Val Tyr Lys Ala Gly Thr Asp 805 810 815
gac age age tet tet atc aat ttg ggt cag cag tcc atg ccc aac aga 19229 Asp Ser Ser Ser Ser Ile Asn Leu Gly Gin Gin Ser Met Pro Asn Arg 820 825 830 835
ccc aac tac att ggc ttt aga gac aac ttt atc ggg etc atg tac tac 19277 Pro Asn Tyr Ile Gly Phe Arg Asp Asn Phe Ile Gly Leu Met Tyr Tyr 840 845 850
aac age act ggc aac atg ggc gtg etg get ggt cag gcc tcc cag etg 19325 Asn Ser Thr Gly Asn Met Gly Val Leu Ala Gly Gin Ala Ser Gin Leu 855 860 865
aat get gtg gtg gac ttg cag gac aga aac act gaa etg tcc tac cag 19373 Asn Ala Val Val Asp Leu Gin Asp Arg Asn Thr Glu Leu Ser Tyr Gin 870 875 880
etc ttg ett gac tet etg ggc gac aga acc agg tat ttc agt atg tgg 19421 Leu Leu Leu Asp Ser Leu Gly Asp Arg Thr Arg Tyr Phe Ser Met Trp 885 890 895
aat cag gcg gtg gac age tat gac ccc gat gtg cgc att att gaa aat 19469 Asn Gin Ala Val Asp Ser Tyr Asp Pro Asp Val Arg Ile Ile Glu Asn 900 905 910 915
cac ggt gtg gag gat gaa etc cct aac tat tgc ttc ccc etg gat get 19517 His Gly Val Glu Asp Glu Leu Pro Asn Tyr Cys Phe Pro Leu Asp Ala 920 925 930
gtg ggt aga act gat act tac cag gga att aag gcc aat ggt get gat 19565 Val Gly Arg Thr Asp Thr Tyr Gin Gly Ile Lys Ala Asn Gly Ala Asp 935 940 945
caa acc acc tgg acc aaa gat gat act gtt aat gat get aat gaa ttg 19613 Gin Thr Thr Trp Thr Lys Asp Asp Thr Val Asn Asp Ala Asn Glu Leu 950 955 960
ggc aag ggc aat cct ttc gcc atg gag atc aac atc cag gcc aac etg 19661 Gly Lys Gly Asn Pro Phe Ala Met Glu Ile Asn Ile Gin Ala Asn Leu 965 970 975
tgg egg aac ttc etc tac gcg aac gtg gcg etg tac etg ccc gac tcc 19709 Trp Arg Asn Phe Leu Tyr Ala Asn Val Ala Leu Tyr Leu Pro Asp Ser 980 985 990 995
tac aag tac acg ccg gcc aac atc acg etg ccg acc aac acc aac 19754 Tyr Lys Tyr Thr Pro Ala Asn Ile Thr Leu Pro Thr Asn Thr Asn 1000 1005 1010 acc tac gat tac atg aac ggc cgc gtg gtg gcg ccc tcg etg gtg 19799 Thr Tyr Asp Tyr Met Asn Gly Arg Val Val Ala Pro Ser Leu Val 1015 1020 1025
gac gcc tac atc aac atc ggg gcg cgc tgg tcg etg gac ccc atg 19844 Asp Ala Tyr Ile Asn Ile Gly Ala Arg Trp Ser Leu Asp Pro Met 1030 1035 1040
gac aac gtc aac ccc ttc aac cac cac cgc aac gcg ggc etg cgc 19889 Asp Asn Val Asn Pro Phe Asn His His Arg Asn Ala Gly Leu Arg 1045 1050 1055
tac cgc tcc atg etc etg ggc aac ggg cgc tac gtg ccc ttc cac 19934 Tyr Arg Ser Met Leu Leu Gly Asn Gly Arg Tyr Val Pro Phe His 1060 1065 1070
atc cag gtg ccc caa aag ttc ttc gcc atc aag age etc etg etc 19979 Ile Gin Val Pro Gin Lys Phe Phe Ala Ile Lys Ser Leu Leu Leu 1075 1080 1085
etg ccc ggg tcc tac acc tac gag tgg aac ttc cgc aag gac gtc 20024 Leu Pro Gly Ser Tyr Thr Tyr Glu Trp Asn Phe Arg Lys Asp Val 1090 1095 1100
aac atg atc etg cag age tcc etc ggc aac gac etg cgc acg gac 20069 Asn Met Ile Leu Gin Ser Ser Leu Gly Asn Asp Leu Arg Thr Asp 1105 1110 1115
ggg gcc tcc atc gcc ttc acc age atc aac etc tac gcc acc ttc 20114 Gly Ala Ser Ile Ala Phe Thr Ser Ile Asn Leu Tyr Ala Thr Phe 1120 1125 1130
ttc ccc atg gcg cac aac acc gcc tcc acg etc gag gcc atg etg 20159 Phe Pro Met Ala His Asn Thr Ala Ser Thr Leu Glu Ala Met Leu 1135 1140 1145
cgc aac gac acc aac gac cag tcc ttc aac gac tac etc tcg gcg 20204 Arg Asn Asp Thr Asn Asp Gin Ser Phe Asn Asp Tyr Leu Ser Ala 1150 1155 1160
gcc aac atg etc tac ccc atc ccg gcc aac gcc acc aac gtg ccc 20249 Ala Asn Met Leu Tyr Pro Ile Pro Ala Asn Ala Thr Asn Val Pro 1165 1170 1175
atc tcc atc ccc tcg cgc aac tgg gcc gcc ttc cgc gga tgg tcc 20294 Ile Ser Ile Pro Ser Arg Asn Trp Ala Ala Phe Arg Gly Trp Ser 1180 1185 1190
ttc acg cgc etc aag acc cgc gag acg ccc tcg etc ggc tcc ggg 20339 Phe Thr Arg Leu Lys Thr Arg Glu Thr Pro Ser Leu Gly Ser Gly 1195 1200 1205
ttc gac ccc tac ttc gtc tac tcg ggc tcc atc ccc tac etc gac 20384 Phe Asp Pro Tyr Phe Val Tyr Ser Gly Ser Ile Pro Tyr Leu Asp 1210 1215 1220
ggc acc ttc tac etc aac cac acc ttc aag aag gtc tcc atc acc 20429 Gly Thr Phe Tyr Leu Asn His Thr Phe Lys Lys Val Ser Ile Thr 1225 1230 1235
ttc gac tcc tcc gtc age tgg ccc ggc aac gac cgc etc etg acg 20474 Phe Asp Ser Ser Val Ser Trp Pro Gly Asn Asp Arg Leu Leu Thr 1240 1245 1250 ccc aac gag ttc gaa atc aag cgc acc gtc gac gga gag ggg tac 20519 Pro Asn Glu Phe Glu Ile Lys Arg Thr Val Asp Gly Glu Gly Tyr 1255 1260 1265
aac gtg gcc cag tgc aac atg acc aag gac tgg ttc etg gtc cag 20564 Asn Val Ala Gin Cys Asn Met Thr Lys Asp Trp Phe Leu Val Gin 1270 1275 1280
atg etg gcc cac tac aac atc ggc tac cag ggc ttc tac gtg ccc 20609 Met Leu Ala His Tyr Asn Ile Gly Tyr Gin Gly Phe Tyr Val Pro 1285 1290 1295
gag ggc tac aag gac cgc atg tac tcc ttc ttc cgc aac ttc cag 20654 Glu Gly Tyr Lys Asp Arg Met Tyr Ser Phe Phe Arg Asn Phe Gin 1300 1305 1310
ccc atg age cgc cag gtc gtg gac gag gtc aac tac aag gac tac 20699 Pro Met Ser Arg Gin Val Val Asp Glu Val Asn Tyr Lys Asp Tyr 1315 1320 1325
cag gcc gtc acc etg gcc tac cag cac aac aac tcg ggc ttc gtc 20744 Gin Ala Val Thr Leu Ala Tyr Gin His Asn Asn Ser Gly Phe Val 1330 1335 1340
ggc tac etc gcg ccc acc atg cgc cag gga cag ccc tac ccc gcc 20789 Gly Tyr Leu Ala Pro Thr Met Arg Gin Gly Gin Pro Tyr Pro Ala 1345 1350 1355
aac tac ccc tac ccg etc atc ggc aag age gcc gtc gcc age gtc 20834 Asn Tyr Pro Tyr Pro Leu Ile Gly Lys Ser Ala Val Ala Ser Val 1360 1365 1370
acc cag aaa aag ttc etc tgc gac egg gtc atg tgg cgc atc ccc 20879 Thr Gin Lys Lys Phe Leu Cys Asp Arg Val Met Trp Arg Ile Pro 1375 1380 1385
ttc tcc age aac ttc atg tcc atg ggc gcg etc acc gac etc ggc 20924 Phe Ser Ser Asn Phe Met Ser Met Gly Ala Leu Thr Asp Leu Gly 1390 1395 1400
cag aac atg etc tac gcc aac tcc gcc cac gcg eta gac atg aat 20969 Gin Asn Met Leu Tyr Ala Asn Ser Ala His Ala Leu Asp Met Asn 1405 1410 1415
ttc gaa gtc gac ccc atg gat gag tcc acc ett etc tat gtt gtc 21014 Phe Glu Val Asp Pro Met Asp Glu Ser Thr Leu Leu Tyr Val Val 1420 1425 1430
ttc gaa gtc ttc gac gtc gtc ega gtg cac cag ccc cac cgc ggc 21059 Phe Glu Val Phe Asp Val Val Arg Val His Gin Pro His Arg Gly 1435 1440 1445
gtc atc gag gcc gtc tac etg cgc acg ccc ttc tcg gcc ggc aac 21104 Val Ile Glu Ala Val Tyr Leu Arg Thr Pro Phe Ser Ala Gly Asn 1450 1455 1460
gcc acc acc taa gccccgctct tgcttcttgc aagatgaegg cetgtgeggg 21156 Ala Thr Thr
ctccggcgag caggagetea gggccatcct ccgcgacctg ggctgeggge cctgcttcct 21216 gggcaccttc gaeaageget tecegggatt catggccccg cacaagctgg cctgcgccat 21276 cgtcaacacg gccggccgcg agaccggggg cgagcactgg ctggccttcg cctggaaccc 21336 gcgctcccac acctgctacc tcttcgaccc cttcgggttc tcggacgagc gcctcaagca 21396 gatctaccag ttcgagtacg agggcctgct gcgccgcagc gccctggcca ccgaggaccg 21456 ctgcgtcacc ctggaaaagt ccacccagac cgtgcagggt ccgcgctcgg ccgcctgcgg 21516 gctcttctgc tgcatgttcc tgcacgcctt cgtgcactgg cccgaccgcc ccatggacaa 21576 gaaccccacc atgaacttgc tgacgggggt gcccaacggc atgctccagt cgccccaggt 21636 ggaacccacc ctgcgccgca accaggaggc gctctaccgc ttcctcaacg cccactccgc 21696 ctactttcgc tcccaccgcg cgcgcatcga gaaggccacc gccttcgacc gcatgaatca 21756 agacatgtaa accgtgtgtg tatgtgaatg ctttattcat aataaacagc acatgtttat 21816 gccacctttt ctgaggctct gactttattt agaaatcgaa ggggttctgc cggctctcgg 21876 cgtgccccgc gggcagggat acgttgcgga actggtactt gggcagccac ttgaactcgg 21936 ggatcagcag cttcggcacg gggaggtcgg ggaacgagtc gctccacagc ttgcgcgtga 21996 gttgcagggc gcccagcagg tcgggcgcgg agatcttgaa atcgcagttg ggacccgcgt 22056 tctgcgcgcg ggagttgcgg tacacggggt tgcagcactg gaacaccatc agggccgggt 22116 gcttcacgct cgccagcacc gtcgcgtcgg tgatgccctc cacgtccaga tcctcggcgt 22176 tggccatccc gaagggggtc atcttgcagg tctgccgccc catgctgggc acgcagccgg 22236 gcttgtggtt gcaatcgcag tgcaggggga tcagcatcat ctgggcctgc tcggagctca 22296 tgcccgggta catggccttc atgaaagcct ccagctggcg gaaggcctgc tgcgccttgc 22356 cgccctcggt gaagaagacc ccgcaggact tgctagagaa ctggttggtg gcgcagccgg 22416 cgtcgtgcac gcagcagcgc gcgtcgttgt tggccagctg caccacgctg cgcccccagc 22476 ggttctgggt gatcttggcc cggtcggggt tctccttcag cgcgcgctgc ccgttctcgc 22536 tcgccacatc catctcgatc gtgtgctcct tctggatcat cacggtcccg tgcaggcatc 22596 gcagcttgcc ctcggcctcg gtgcacccgt gcagccacag cgcgcagccg gtgcactccc 22656 agttcttgtg ggcgatctgg gagtgcgagt gcacgaagcc ctgcaggaag cggcccatca 22716 tcgtggtcag ggtcttgttg ctggtgaagg tcagcgggat gccgcggtgc tcctcgttca 22776 catacaggtg gcagatgcgg cggtacacct cgccctgctc gggcatcagc tggaaggcgg 22836 acttcaggtc gctctccacg cggtaccggt ccatcagcag cgtcatgact tccatgccct 22896 tctcccaggc cgagacgatc ggcaggctca gggggttctt caccgccgtt gtcatcttag 22956 tcgccgccgc tgaggtcagg gggtcgttct cgtccagggt ctcaaacact cgcttgccgt 23016 ccttctcggt gatgcgcacg gggggaaagc tgaagcccac ggccgccagc tcctcctcgg 23076 cctgcctttc gtcctcgctg tcctggctga tgtcttgcaa aggcacatgc ttggtcttgc 23136 ggggtttctt tttgggcggc agaggcggcg gcggagacgt gctgggcgag cgcgagttct 23196 cgctcaccac gactatttct tcttcttggc cgtcgtccga gaccacgcgg cggtaggcat 23256 gcctcttctg gggcagaggc ggaggcgacg ggctctcgcg gttcggcggg cggctggcag 23316 agccccttcc gcgttcgggg gtgcgctcct ggcggcgctg ctctgactga cttcctccgc 23376 ggccggccat tgtgttctcc tagggagcaa caagcatgga gactcagcca tcgtcgccaa 23436 catcgccatc tgcccccgcc gccgccgacg agaaccagca gcagaatgaa agcttaaccg 23496 ccccgccgcc cagccccacc tccgacgccg ccgcggcccc agacatgcaa gagatggagg 23556 aatccatcga gattgacctg ggctacgtga cgcccgcgga gcacgaggag gagctggcag 23616 cgcgcttttc agccccggaa gagaaccacc aagagcagcc agagcaggaa gcagagagcg 23676 agcagcagca ggctgggctc gagcatggcg actacctgag cggggcagag gacgtgctca 23736 tcaagcatct ggcccgccaa tgcatcatcg tcaaggacgc gctgctcgac cgcgccgagg 23796 tgcccctcag cgtggcggag ctcagccgcg cctacgagcg caacctcttc tcgccgcgcg 23856 tgccccccaa gcgccagccc aacggcacct gcgagcccaa cccgcgcctc aacttctacc 23916 cggtcttcgc ggtgcccgag gccctggcca cctaccacct ctttttcaag aaccaaagga 23976 tccccgtctc ctgccgcgcc aaccgcaccc gcgccgacgc cctgctcaac ctgggtcccg 24036 gcgcccgcct acctgatatc gcctccttgg aagaggttcc caagatcttc gagggtctgg 24096 gcagcgacga gactcgggcc gcgaacgctc tgcaaggaag cggagaggag catgagcacc 24156 acagcgccct ggtggagttg gaaggcgaca acgcgcgcct ggcggtgctc aagcgcacgg 24216 tcgagctgac ccacttcgcc tacccggcgc tcaacctgcc ccccaaggtc atgagcgccg 24276 tcatggacca ggtgctcatc aagcgcgcct cgcccctctc ggatgaggac atgcaggacc 24336 ccgagagctc ggacgagggc aagcccgtgg tcagcgacga gcagctggcg cgctggctgg 24396 gagcgagtag caccccccag agcttggaag agcggcgcaa gctcatgatg gccgtggtcc 24456 tggtgaccgt ggagctggag tgtctgcgcc gcttcttcgc cgacgcagag accctgcgca 24516 aggtcgagga gaacctgcac tacctcttca ggcacgggtt tgtgcgccag gcctgcaaga 24576 tctccaacgt ggagctgacc aacctggtct cctacatggg catcctgcac gagaaccgcc 24636 tggggcagaa cgtgctgcac accaccctgc gcggggaggc ccgccgcgac tacatccgcg 24696 actgcgtcta cctgtacctc tgccacacct ggcagacggg catgggcgtg tggcagcagt 24756 gcctggagga gcagaacctg aaagagctct gcaagctcct gcagaagaac ctgaaggccc 24816 tgtggaccgg gttcgacgag cgcaccaccg cctcggacct ggccgacctc atcttccccg 24876 agcgcctgcg gctgacgctg cgcaacggac tgcccgactt tatgagtcaa agcatgttgc 24936 aaaactttcg ctctttcatc ctcgaacgct ccgggatcct gcccgccacc tgctccgcgc 24996 tgccctcgga cttcgtgccg ctgaccttcc gcgagtgccc cccgccgctc tggagccact 25056 gctacctgct gcgcctggcc aactacctgg cctaccactc ggacgtgatc gaggacgtca 25116 gcggcgaggg tctgctcgag tgccactgcc gctgcaacct ctgcacgccg caccgctccc 25176 tggcctgcaa cccccagctg ctgagcgaga cccagatcat cggcaccttc gagttgcaag 25236 gccccggcga gggcaagggg ggtctgaaac tcaccccggg gctgtggacc tcggcctact 25296 tgcgcaagtt cgtgcccgag gactaccatc ccttcgagat caggttctac gaggaccaat 25356 cccagccgcc caaggccgaa ctgtcggcct gcgtcatcac ccagggggcc atcctggccc 25416 aattgcaagc catccagaaa tcccgccaag aatttctgct gaaaaagggc cacggggtct 25476 acctggaccc ccagaccgga gaggagctca accccagctt cccccaggat gccccgagga 25536 agcagcaaga agctgaaagt ggagctgccg ccgccggagg atttggagga agactgggag 25596 agcagtcagg cagaggagga ggagatggaa gactgggaca gcactcaggc agaggaggac 25656 agcctgcaag acagtctgga agacgaggtg gaggaggagg cagaggaaga agcagccgcc 25716 gccagaccgt cgtcctcggc ggagaaagca agcagcacgg ataccatctc cgctccgggt 25776 cggggtcgcg gcgaccgggc ccacagtagg tgggacgaga ccgggcgctt cccgaacccc 25836 accacccaga ccggtaagaa ggagcggcag ggatacaagt cctggcgggg gcacaaaaac 25896 gccatcgtct cctgcttgca agcctgcggg ggcaacatct ccttcacccg ccgctacctg 25956 ctcttccacc gcggggtgaa cttcccccgc aacatcttgc attactaccg tcacctccac 26016 agcccctact actgtttcca agaagaggca gaaacccagc agcagcagaa aaccagcggc 26076 agcagcagct agaaaatcca cagcggcggc aggtggactg aggatcgcag cgaacgagcc 26136 ggcgcagacc cgggagctga ggaaccggat ctttcccacc ctctatgcca tcttccagca 26196 gagtcggggg caggagcagg aactgaaagt caagaaccgt tctctgcgct cgctcacccg 26256 cagttgtctg tatcacaaga gcgaagacca acttcagcgc actctcgagg acgccgaggc 26316 tctcttcaac aagtactgcg cgctcactct taaagagtag cccgcgcccg cccacacacg 26376 gaaaaaggcg ggaattacgt caccacctgc gcccttcgcc cgaccatcat catgagcaaa 26436 gagattccca cgccttacat gtggagctac cagccccaga tgggcctggc cgccggcgcc 26496 gcccaggact actccacccg catgaactgg ctcagcgccg ggcccgcgat gatctcacgg 26556 gtgaatgaca tccgcgcccg ccgaaaccag atactcctag aacagtcagc gatcaccgcc 26616 acgccccgcc atcaccttaa tccgcgtaat tggcccgccg ccctggtgta ccaggaaatt 26676 ccccagccca cgaccgtact acttccgcga gacgcccagg ccgaagtcca gctgactaac 26736 tcaggtgtcc agctggccgg cggcgccgcc ctgtgtcgtc accgccccgc tcagggtata 26796 aagcggctgg tgatccgagg cagaggcaca cagctcaacg acgaggtggt gagctcttcg 26856 ctgggtctgc gacctgacgg agtcttccaa ctcgccggat cggggagatc ttccttcacg 26916 cctcgtcagg ccgtcctgac tttggagagt tcgtcctcgc agccccgctc gggtggcatc 26976 ggcactctcc agttcgtgga ggagttcact ccctcggtct acttcaaccc cttctccggc 27036 tcccccggcc actacccgga cgagttcatc ccgaacttcg acgccatcag cgagtcggtg 27096 gacggctacg attgaatgtc ccatggtggc gcagctgacc tagctcggct tcgacacctg 27156 gaccactgcc gccgcttccg ctgcttcgct cgggatctcg ccgagtttgc ctactttgag 27216 ctgcccgagg agcaccctca gggcccggcc cacggagtgc ggatcatcgt cgaagggggc 27276 ctcgactccc acctgcttcg gatcttcagc cagcgaccga tcctggtcga gcgcgagcaa 27336 ggacagaccc ttctgaccct gtactgcatc tgcaaccacc ccggcctgca tgaaagtctt 27396 tgttgtctgc tgtgtactga gtataataaa agctgagatc agcgactact ccggactcga 27456 ttgtggtgtt cctgctatca accggtccct gttcttcacc gggaacgaga ccgagctcca 27516 gcttcagtgt aagccccaca agaagtacct cacctggctg ttccagggct ccccgatcgc 27576 cgttgtcaac cactgcgaca acgacggagt cctgctgagc ggccccgcca accttacttt 27636 ttccacccgc agaagcaagc tccagctctt ccaacccttc ctccccggga cctatcagtg 27696 cgtctcggga ccctgccatc acaccttcca cctgatcccg aataccacag cgccgctccc 27756 cgctactaac aaccaaacta cccaccatcg ccaccgtcgc gacctttctg aatctaacac 27816 taccacccac accggaggtg agctccgagg tcgaccaacc tctgggattt actacggccc 27876 ctgggaggtg gtggggttaa tagcgctagg cctagttgtg ggtgggcttt tggctctctg 27936 ctacctatac ctcccttgct gttcgtactt agtggtgctg tgttgctggt ttaagaaatg 27996 gggaagatca ccctagtgag ctgcggtgcg ctggtggcgg tggtggtgtt ttcgattgtg 28056 ggactgggcg gcgcggctgt agtgaaggag aaggccgatc cctgcttgca tttcaatccc 28116 gacaattgcc agctgagttt tcagcccgat ggcaatcggt gcgcggtgct gatcaagtgc 28176 ggatgggaat gcgagaacgt gagaatcgag tacaataaca agactcggaa caatactctc 28236 gcgtccgtgt ggcagcccgg ggaccccgag tggtacaccg tctctgtccc cggtgctgac 28296 ggctccccgc gcaccgtgaa caatactttc atttttgcgc acatgtgcga cacggtcatg 28356 tggatgagca agcagtacga tatgtggccc cccacgaagg agaacatcgt ggtcttctcc 28416 atcgcttaca gcgcgtgcac ggcgctaatc accgctatcg tgtgcctgag cattcacatg 28476 ctcatcgcta ttcgccccag aaataatgcc gaaaaagaga aacagccata acacgttttt 28536 tcacacacct ttttcagacc atggcctctg ttaaattttt gcttttattt gccagtctca 28596 ttactgttat aagtaatgag aaactcacta tttacattgg cactaaccac actttagacg 28656 gaattccaaa atcctcatgg tattgctatt ttgatcaaga tccagactta actatagaac 28716 tgtgtggtaa caagggaaaa aatacaagca ttcatttaat taactttaat tgcggagaca 28776 atttgaaatt aattaatatc actaaagagt atggaggtat gtattactat gttgcagaaa 28836 ataacaacat gcagttttat gaagttactg taactaatcc caccacacct agaacaacaa 28896 caaccaccac cacaaaaact acacctgtta ccactatgca gctcactacc aataacattt 28956 ttgccatgcg tcaaatggtc aacaatagca ctcaacccac cccacccagt gaggaaattc 29016 ccaaatccat gattggcatt attgttgctg tagtggtgtg catgttgatc atcgccttgt 29076 gcatggtgta ctatgccttc tgctacagaa agcacagact gaacgacaag ctggaacact 29136 tactaagtgt tgaattttaa ttttttagaa ccatgaagat cctaggcctt ttaatttttt 29196 ctatcattac ctctgctcta tgcaattctg acaatgagga cgttactgtc gttgtcggaa 29256 ccaattatac actgaaaggt ccagcgaagg gtatgctttc gtggtattgc tggtttggaa 29316 ctgacgagca acagacagag ctctgcaatg ctcaaaaagg caaaacctca aattctaaaa 29376 tctctaatta tcaatgcaat ggcactgact tagtactgct caatgtcacg aaagcatatg 29436 ctggcagcta cacctgccct ggagatgata ctgagaacat gattttttac aaagtggaag 29496 tggttgatcc cactactcca cctccaccca ccacaactac tcacaccaca cacacagaac 29556 aaaccacagc agaggaggca gcaaagttag ccttgcaggt ccaagacagt tcatttgttg 29616 gcattacccc tacacctgat cagcggtgtc cggggctgct cgtcagcggc attgtcggtg 29676 tgctttcggg attagcagtc ataatcatct gcatgttcat ttttgcttgc tgctatagaa 29736 ggctttaccg acaaaaatca gacccactgc tgaacctcta tgtttaattt tttccagagc 29796 catgaaggca gttagcactc tagttttttg ttctttgatt ggcactgttt ttagtgttag 29856 ctttttgaaa caaatcaatg ttactgaggg ggaaaatgtg acactggtag gcgtagaggg 29916 tgctcaaaat accacctgga caaaattcca tctagatggg tggaaagaaa tttgcacctg 29976 gaatgtcagt acttatacat gtgaaggagt taatcttacc attgtcaatg tcagccaaat 30036 tcaaaagggt tggattaaag ggcaatctgt tagtgttagc aatagtgggt actataccca 30096 gcatactctt atctatgaca ttatagttat accactgcct acacctagcc cacctagcac 30156 taccacacag acaacccaca ctacacaaac aaccacatac agtacatcaa atcagcctac 30216 caccactaca acagcagagg ttgccagctc gtctggggtc cgagtggcat ttttgatgtt 30276 ggccccatct agcagtccca ctgctagtac caatgagcag actactgaat ttttgtccac 30336 tgtcgagagc cacaccacag ctacctcgag tgccttctct agcaccgcca atctatcctc 30396 gctttcctct acaccaatca gtcccgctac tactcctacc cccgctattc tccccactcc 30456 cctgaagcaa acagacggcg acatgcaatg gcagatcacc ctgctcattg tgatcgggtt 30516 ggtcatcctg gccgtgttgc tctactacat cttctgccgc cgcattccca acgcgcaccg 30576 caagccggcc tacaagccca tcgttgtcgg gcagccggag ccgcttcagg tggaaggggg 30636 tctaaggaat cttctcttct cttttacagt atggtgattg aattatgatt cctagacaaa 30696 tcttgatcac tattcttatc tgcctcctcc aagtctgtgc caccctcgct ctggtggcca 30756 acgccagtcc agactgtatt gggcccttcg cctcctacgt gctctttgcc ttcatcacct 30816 gcatctgctg ctgtagcata gtctgcctgc ttatcacctt cttccagttc attgactgga 30876 tctttgtgcg catcgcctac ctgcgccacc acccccagta ccgcgaccag cgagtggcgc 30936 ggctgctcag gatcctctga taagcatgcg ggctctgcta cttctcgcgc ttctgctgtt 30996 agtgctcccc cgtcccgtcg acccccggac ccccacccag tcccccgagg aggtccgcaa 31056 atgcaaattc caagaaccct ggaaattcct caaatgctac cgccaaaaat cagacatgca 31116 tcccagctgg atcatgatca ttgggatcgt gaacattctg gcctgcaccc tcatctcctt 31176 tgtgatttac ccctgctttg actttggttg gaactcgcca gaggcgctct atctcccgcc 31236 tgaacctgac acaccaccac agcaacctca ggcacacgca ctaccaccac caccacagcc 31296 taggccacaa tacatgccca tattagacta tgaggccgag ccacagcgac ccatgctccc 31356 cgctattagt tacttcaatc taaccggcgg agatgactga cccactggcc aacaacaacg 31416 tcaacgacct tctcctggac atggacggcc gcgcctcgga gcagcgactc gcccaacttc 31476 gcattcgcca gcagcaggag agagccgtca aggagctgca ggacggcata gccatccacc 31536 agtgcaagaa aggcatcttc tgcctggtga aacaggccaa gatctcctac gaggtcaccc 31596 agaccgacca tcgcctctcc tacgagctcc tgcagcagcg ccagaagttc acctgcctgg 31656 tcggagtcaa ccccatcgtc atcacccagc agtcgggcga taccaagggg tgcatccact 31716 gctcctgcga ctcccccgac tgcgtccaca ctctgatcaa gaccctctgc ggcctccgcg 31776 acctcctccc catgaactaa tcaccccctt atccagtgaa ataaagatca tattgatgat 31836 ttgagtttaa taaaaataaa gaatcactta cttgaaatct gataccaggt ctctgtccat 31896 gttttctgcc aacaccactt cactcccctc ttcccagctc tggtactgca ggccccggcg 31956 ggctgcaaac ttcctccaca ccctgaaggg gatgtcaaat tcctcctgtc cctcaatctt 32016 cattttatct tctatcag atg tcc aaa aag cgc gtc egg gtg gat gat gac 32067
Met Ser Lys Lys Arg Val Arg Val Asp Asp Asp 1465 1470
ttc gac ccc gtc tac ccc tac gat gca gac aac gca ccg acc gtg 32112 Phe Asp Pro Val Tyr Pro Tyr Asp Ala Asp Asn Ala Pro Thr Val 1475 1480 1485
ccc ttc atc aac ccc ccc ttc gtc tet tea gat gga ttc caa gag 32157 Pro Phe Ile Asn Pro Pro Phe Val Ser Ser Asp Gly Phe Gin Glu 1490 1495 1500
aag ccc etg ggg gtg etg tcc etg cgt etg gcc gat ccc gtc acc 32202 Lys Pro Leu Gly Val Leu Ser Leu Arg Leu Ala Asp Pro Val Thr 1505 1510 1515
acc aag aac ggg gaa atc acc etc aag etg gga gat ggg gtg gac 32247 Thr Lys Asn Gly Glu Ile Thr Leu Lys Leu Gly Asp Gly Val Asp 1520 1525 1530
etc gac tcc tcg gga aaa etc atc tcc aac acg gcc acc aag gcc 32292 Leu Asp Ser Ser Gly Lys Leu Ile Ser Asn Thr Ala Thr Lys Ala 1535 1540 1545
gcc gcc cct etc agt ttt tcc aac aac acc att tcc ett aac atg 32337 Ala Ala Pro Leu Ser Phe Ser Asn Asn Thr Ile Ser Leu Asn Met 1550 1555 1560
gat acc cct ttt tac aac aac aat gga aag tta ggc atg aaa gtc 32382 Asp Thr Pro Phe Tyr Asn Asn Asn Gly Lys Leu Gly Met Lys Val 1565 1570 1575
act get cca etg aag ata eta gac aca gac ttg eta aaa aca ett 32427 Thr Ala Pro Leu Lys Ile Leu Asp Thr Asp Leu Leu Lys Thr Leu 1580 1585 1590
gtt gta get tat gga caa ggt tta gga aca aac acc act ggt gcc 32472 Val Val Ala Tyr Gly Gin Gly Leu Gly Thr Asn Thr Thr Gly Ala 1595 1600 1605
ett gtt gcc caa eta gca tcc cca ett get ttt gat age aat age 32517 Leu Val Ala Gin Leu Ala Ser Pro Leu Ala Phe Asp Ser Asn Ser 1610 1615 1620
aaa att gcc ett aat tta ggc aat gga cca ttg aaa gtg gat gca 32562 Lys Ile Ala Leu Asn Leu Gly Asn Gly Pro Leu Lys Val Asp Ala 1625 1630 1635
aat aga etg aac atc aat tgc aat aga gga etc tat gtt act acc 32607 Asn Arg Leu Asn Ile Asn Cys Asn Arg Gly Leu Tyr Val Thr Thr 1640 1645 1650
aca aaa gat gca etg gaa gcc aat ata agt tgg get aat get atg 32652 Thr Lys Asp Ala Leu Glu Ala Asn Ile Ser Trp Ala Asn Ala Met 1655 1660 1665
aca ttt ata gga aat gcc atg ggt gtc aat att gat aca caa aaa 32697 Thr Phe Ile Gly Asn Ala Met Gly Val Asn Ile Asp Thr Gin Lys 1670 1675 1680
ggc ttg caa ttt ggc acc act agt acc gtc gca gat gtt aaa aac 32742 Gly Leu Gin Phe Gly Thr Thr Ser Thr Val Ala Asp Val Lys Asn 1685 1690 1695
get tac ccc ata caa atc aaa ett gga get ggt etc aca ttt gac 32787 Ala Tyr Pro Ile Gin Ile Lys Leu Gly Ala Gly Leu Thr Phe Asp 1700 1705 1710
age aca ggt gca att gtt gca tgg aac aaa gat gat gac aag ett 32832 Ser Thr Gly Ala Ile Val Ala Trp Asn Lys Asp Asp Asp Lys Leu 1715 1720 1725
aca eta tgg acc aca gcc gac ccc tet cca aat tgt cac ata tat 32877 Thr Leu Trp Thr Thr Ala Asp Pro Ser Pro Asn Cys His Ile Tyr 1730 1735 1740
tet gaa aag gat get aag ett aca ett tgc ttg aca aag tgt ggc 32922 Ser Glu Lys Asp Ala Lys Leu Thr Leu Cys Leu Thr Lys Cys Gly 1745 1750 1755
agt cag att etg ggc act gtt tcc etc ata get gtt gat act ggc 32967 Ser Gin Ile Leu Gly Thr Val Ser Leu Ile Ala Val Asp Thr Gly 1760 1765 1770
agt tta aat ccc ata aca gga aca gta acc act get ett gtc tea 33012 Ser Leu Asn Pro Ile Thr Gly Thr Val Thr Thr Ala Leu Val Ser 1775 1780 1785
ett aaa ttc gat gca aat gga gtt ttg caa age age tea aca eta 33057 Leu Lys Phe Asp Ala Asn Gly Val Leu Gin Ser Ser Ser Thr Leu 1790 1795 1800
gac tea gac tat tgg aat ttc aga cag gga gat gtt aca cct get 33102 Asp Ser Asp Tyr Trp Asn Phe Arg Gin Gly Asp Val Thr Pro Ala 1805 1810 1815
gaa gcc tat act aat get ata ggt ttc atg ccc aat eta aaa gca 33147 Glu Ala Tyr Thr Asn Ala Ile Gly Phe Met Pro Asn Leu Lys Ala 1820 1825 1830
tac cct aaa aac aca agt gga get gca aaa agt cac att gtt ggg 33192 Tyr Pro Lys Asn Thr Ser Gly Ala Ala Lys Ser His Ile Val Gly 1835 1840 1845
aaa gtg tac eta eat ggg gat aca ggc aaa cca etg gac etc att 33237 Lys Val Tyr Leu His Gly Asp Thr Gly Lys Pro Leu Asp Leu Ile 1850 1855 1860
att act ttc aat gaa aca agt gat gaa tet tgc act tac tgt att 33282 Ile Thr Phe Asn Glu Thr Ser Asp Glu Ser Cys Thr Tyr Cys Ile 1865 1870 1875
aac ttt caa tgg cag tgg ggg get gat caa tat aaa aat gaa aca 33327 Asn Phe Gin Trp Gin Trp Gly Ala Asp Gin Tyr Lys Asn Glu Thr 1880 1885 1890
ett gcc gtc agt tea ttc acc ttt tcc tat att get aaa gaa taa 33372 Leu Ala Val Ser Ser Phe Thr Phe Ser Tyr Ile Ala Lys Glu 1895 1900 1905
accccactct gtaccccatc tctgtctatg gaaaaaactc tgaaacacaa aataaaataa 33432 agttcaagtg ttttattgat tcaacagttt tacaggattc gagcagttat ttttcctcca 33492 ccctcccagg acatggaata caccaccctc tccccccgca cagecttgaa catctgaatg 33552 ccattggtga tggacatget tttggtctcc acgttccaca cagtttcaga gcgagccagt 33612 etegggtegg tcagggagat gaaaccctcc gggcactccc gcatctgcac ctcacagctc 33672 aacagctgag gattgtcctc ggtggtcggg ateaeggtta tctggaagaa gcagaagagc 33732 ggeggtggga atcatagtcc gegaaeggga teggceggtg gtgtegcate aggccccgca 33792 geagtcgetg tcgccgccgc tccgtcaagc tgctgctcag ggggtccggg tccagggact 33852 ccctcagcat gatgcccacg gccctcagca teagtegtet ggtgeggegg gcgcagcagc 33912 geatgeggat ctcgctcagg tegctgcagt aegtgcaaca caggaccacc aggttgttca 33972 acagtccata gttcaacacg ctccagccga aactcatcgc gggaaggatg ctacccacgt 34032 ggcegtegta ccagatcctc aggtaaatca agtggcgccc cctccagaac acgctgccca 34092 tgtacatgat ctccttgggc atgtggcggt tcaccacctc ccggtaccac atcaccctct 34152 ggttgaacat gcagccccgg atgatcctgc ggaaccacag ggccagcacc gccccgcccg 34212 ccatgcagcg aagagacccc gggtcccgac aatggcaatg gaggacccac cgctcgtacc 34272 egtggateat ctgggagctg aacaagtcta tgttggcaca gcacaggcat atgctcatgc 34332 atctcttcag cactctcagc tecteggggg tcaaaaccat atcccagggc aeggggaact 34392 cttgcaggac agcgaacccc gcagaacagg gcaatcctcg cacataactt acattgtgca 34452 tggacagggt atcgcaatca ggcagcaccg ggtgatcctc caccagagaa gcgcgggtct 34512 cggtctcctc acagcgtggt aagggggccg gccgatacgg gtgatggcgg gacgcggctg 34572 atcgtgttcg cgaccgtgtt atgatgcagt tgctttcgga cattttcgta cttgctgtag 34632 cagaacctgg tccgggcgct gcacaccgat cgccggcggc ggtcccggcg cttggaacgc 34692 tcggtgttga agttgtaaaa cagccactct ctcagaccgt gcagcagatc tagggcctca 34752 ggagtgatga agatcccatc atgcctgatg gctctaatca catcgaccac cgtggaatgg 34812 gccagaccca gccagatgat gcaattttgt tgggtttcgg tgacggcggg ggagggaaga 34872 acaggaagaa ccatgattaa cttttaatcc aaacggtctc ggagcacttc aaaatgaaga 34932 tcgcggagat ggcacctctc gcccccgctg tgttggtgga aaataacagc caggtcaaag 34992 gtgatacggt tctcgagatg ttccacggtg gcttccagca aagcctccac gcgcacatcc 35052 agaaacaaga caatagcgaa agcgggaggg ttctctaatt cctcaatcat catgttacac 35112 tcctgcacca tccccagata attttcattt ttccagcctt gaatgattcg aactagttcc 35172 tgaggtaaat ccaagccagc catgataaag agctcgcgca gagcgccctc caccggcatt 35232 cttaagcaca ccctcataat tccaagatat tctgctcctg gttcacctgc agcagattga 35292 caagcggaat atcaaaatct ctgccgcgat ccctaagctc ctccctcagc aataactgta 35352 agtactcttt catatcctct ccgaaatttt tagccatagg accaccagga ataagattag 35412 ggcaagccac agtacagata aaccgaagtc ctccccagtg agcattgcca aatgcaagac 35472 tgctataagc atgctggcta gacccggtga tatcttccag ataactggac agaaaatcgc 35532 ccaggcaatt tttaagaaaa tcaacaaaag aaaaatcctc caggtgcacg tttagagcct 35592 cgggaacaac gatggagtaa atgcaagcgg tgcgttccag catggttagt tagctgatct 35652 gtagaaaaaa acaaaaatga acattaaacc atgctagcct ggcgaacagg tgggtaaatc 35712 gttctctcca gcaccaggca ggccacgggg tctccggcac gaccctcgta aaaattgtcg 35772 ctatgattga aaaccatcac agagagacgt tcccggtggc cggcgtgaat gattcgacaa 35832 gatgaataca cccccggaac attggcgtcc gcgagtgaaa aaaagcgccc aaggaagcaa 35892 taaggcacta caatgctcag tctcaagtcc agcaaagcga tgccatgcgg atgaagcaca 35952 aaattctcag gtgcgtacaa aatgtaatta ctcccctcct gcacaggcag caaagccccc 36012 gatccctcca ggtacacata caaagcctca gcgtccatag cttaccgagc agcagcacac 36072 aacaggcgca agagtcagag aaaggctgag ctctaacctg tccacccgct ctctgctcaa 36132 tatatagccc agatctacac tgacgtaaag gccaaagtct aaaaataccc gccaaataat 36192 cacacacgcc cagcacacgc ccagaaaccg gtgacacact caaaaaaata cgcgcacttc 36252 ctcaaacgcc caaactgccg tcatttccgg gttcccacgc tacgtcatca aaattcgact 36312 ttcaaattcc gtcgaccgtt aaaaacgtcg cccgccccgc ccctaacggt cgccgctccc 36372 gcagccaatc accgccccgc atccccaaat tcaaatacct catttgcata ttaacgcgca 36432 ccaaaagttt gaggtatatt attgatgatg 36462
<210> 2
<211> 530
<212> PRT
<213> chimpanzee adenovirus serotype Pan5
<400> 2
Met Met Arg Arg Val Tyr Pro Glu Gly Pro Pro Pro Ser Tyr Glu Ser 15 10 15
Val Met Gin Gin Ala Val Ala Ala Ala Met Gin Pro Pro Leu Glu Ala 20 25 30
Pro Tyr Val Pro Pro Arg Tyr Leu Ala Pro Thr Glu Gly Arg Asn Ser 35 40 45
Ile Arg Tyr Ser Glu Leu Ala Pro Leu Tyr Asp Thr Thr Arg Leu Tyr 50 55 60
Leu Val Asp Asn Lys Ser Ala Asp Ile Ala Ser Leu Asn Tyr Gin Asn 65 70 75 80
Asp His Ser Asn Phe Leu Thr Thr Val Val Gin Asn Asn Asp Phe Thr 85 90 95
Pro Thr Glu Ala Ser Thr Gin Thr Ile Asn Phe Asp Glu Arg Ser Arg 100 105 110
Trp Gly Gly Gin Leu Lys Thr Ile Met His Thr Asn Met Pro Asn Val 115 120 125
Asn Glu Phe Met Tyr Ser Asn Lys Phe Lys Ala Arg Val Met Val Ser 130 135 140
Arg Lys Thr Pro Asn Gly Val Thr Val Thr Asp Gly Ser Gin Asp Glu 145 150 155 160
Leu Thr Tyr Glu Trp Val Glu Phe Glu Leu Pro Glu Gly Asn Phe Ser 165 170 175 Val Thr Met Thr Ile Asp Leu Met Asn Asn Ala Ile Ile Asp Asn Tyr 180 185 190
Leu Ala Val Gly Arg Gin Asn Gly Val Leu Glu Ser Asp Ile Gly Val 195 200 205
Lys Phe Asp Thr Arg Asn Phe Arg Leu Gly Trp Asp Pro Val Thr Glu 210 215 220
Leu Val Met Pro Gly Val Tyr Thr Asn Glu Ala Phe His Pro Asp Ile 225 230 235 240
Val Leu Leu Pro Gly Cys Gly Val Asp Phe Thr Glu Ser Arg Leu Ser 245 250 255
Asn Leu Leu Gly Ile Arg Lys Arg Gin Pro Phe Gin Glu Gly Phe Gin 260 265 270
Ile Leu Tyr Glu Asp Leu Glu Gly Gly Asn Ile Pro Ala Leu Leu Asp 275 280 285
Val Asp Ala Tyr Glu Lys Ser Lys Glu Asp Ser Ala Ala Ala Ala Thr 290 295 300
Ala Ala Val Ala Thr Ala Ser Thr Glu Val Arg Gly Asp Asn Phe Ala 305 310 315 320
Ser Ala Ala Thr Leu Ala Ala Ala Glu Ala Ala Glu Thr Glu Ser Lys 325 330 335
Ile Val Ile Gin Pro Val Glu Lys Asp Ser Lys Glu Arg Ser Tyr Asn 340 345 350
Val Leu Ala Asp Lys Lys Asn Thr Ala Tyr Arg Ser Trp Tyr Leu Ala 355 360 365
Tyr Asn Tyr Gly Asp Pro Glu Lys Gly Val Arg Ser Trp Thr Leu Leu 370 375 380
Thr Thr Ser Asp Val Thr Cys Gly Val Glu Gin Val Tyr Trp Ser Leu 385 390 395 400
Pro Asp Met Met Gin Asp Pro Val Thr Phe Arg Ser Thr Arg Gin Val 405 410 415
Ser Asn Tyr Pro Val Val Gly Ala Glu Leu Leu Pro Val Tyr Ser Lys 420 425 430 Ser Phe Phe Asn Glu Gin Ala Val Tyr Ser Gin Gin Leu Arg Ala Phe 435 440 445
Thr Ser Leu Thr His Val Phe Asn Arg Phe Pro Glu Asn Gin Ile Leu 450 455 460
Val Arg Pro Pro Ala Pro Thr Ile Thr Thr Val Ser Glu Asn Val Pro 465 470 475 480
Ala Leu Thr Asp His Gly Thr Leu Pro Leu Arg Ser Ser Ile Arg Gly 485 490 495
Val Gin Arg Val Thr Val Thr Asp Ala Arg Arg Arg Thr Cys Pro Tyr 500 505 510
Val Tyr Lys Ala Leu Gly Val Val Ala Pro Arg Val Leu Ser Ser Arg 515 520 525
Thr Phe
530
<210> 3
<211> 933
<212> PRT
<213> chimpanzee adenovirus serotype Pan5
<400> 3
Met Ala Thr Pro Ser Met Leu Pro Gin Trp Ala Tyr Met His Ile Ala 15 10 15
Gly Gin Asp Ala Ser Glu Tyr Leu Ser Pro Gly Leu Val Gin Phe Ala 20 25 30
Arg Ala Thr Asp Thr Tyr Phe Ser Leu Gly Asn Lys Phe Arg Asn Pro 35 40 45
Thr Val Ala Pro Thr His Asp Val Thr Thr Asp Arg Ser Gin Arg Leu 50 55 60
Thr Leu Arg Phe Val Pro Val Asp Arg Glu Asp Asn Thr Tyr Ser Tyr 65 70 75 80
Lys Val Arg Tyr Thr Leu Ala Val Gly Asp Asn Arg Val Leu Asp Met 85 90 95 Ala Ser Thr Tyr Phe Asp Ile Arg Gly Val Leu Asp Arg Gly Pro Ser 100 105 110
Phe Lys Pro Tyr Ser Gly Thr Ala Tyr Asn Ser Leu Ala Pro Lys Gly 115 120 125
Ala Pro Asn Thr Cys Gin Trp Thr Tyr Lys Ala Asp Gly Asp Thr Gly 130 135 140
Thr Glu Lys Thr Tyr Thr Tyr Gly Asn Ala Pro Val Gin Gly Ile Ser 145 150 155 160
Ile Thr Lys Asp Gly Ile Gin Leu Gly Thr Asp Thr Asp Asp Gin Pro 165 170 175
Ile Tyr Ala Asp Lys Thr Tyr Gin Pro Glu Pro Gin Val Gly Asp Ala 180 185 190
Glu Trp His Asp Ile Thr Gly Thr Asp Glu Lys Tyr Gly Gly Arg Ala 195 200 205
Leu Lys Pro Asp Thr Lys Met Lys Pro Cys Tyr Gly Ser Phe Ala Lys 210 215 220
Pro Thr Asn Lys Glu Gly Gly Gin Ala Asn Val Lys Thr Glu Thr Gly 225 230 235 240
Gly Thr Lys Glu Tyr Asp Ile Asp Met Ala Phe Phe Asp Asn Arg Ser 245 250 255
Ala Ala Ala Ala Gly Leu Ala Pro Glu Ile Val Leu Tyr Thr Glu Asn 260 265 270
Val Asp Leu Glu Thr Pro Asp Thr His Ile Val Tyr Lys Ala Gly Thr 275 280 285
Asp Asp Ser Ser Ser Ser Ile Asn Leu Gly Gin Gin Ser Met Pro Asn 290 295 300
Arg Pro Asn Tyr Ile Gly Phe Arg Asp Asn Phe Ile Gly Leu Met Tyr 305 310 315 320
Tyr Asn Ser Thr Gly Asn Met Gly Val Leu Ala Gly Gin Ala Ser Gin 325 330 335
Leu Asn Ala Val Val Asp Leu Gin Asp Arg Asn Thr Glu Leu Ser Tyr 340 345 350 Gin Leu Leu Leu Asp Ser Leu Gly Asp Arg Thr Arg Tyr Phe Ser Met 355 360 365
Trp Asn Gin Ala Val Asp Ser Tyr Asp Pro Asp Val Arg Ile Ile Glu 370 375 380
Asn His Gly Val Glu Asp Glu Leu Pro Asn Tyr Cys Phe Pro Leu Asp 385 390 395 400
Ala Val Gly Arg Thr Asp Thr Tyr Gin Gly Ile Lys Ala Asn Gly Ala 405 410 415
Asp Gin Thr Thr Trp Thr Lys Asp Asp Thr Val Asn Asp Ala Asn Glu 420 425 430
Leu Gly Lys Gly Asn Pro Phe Ala Met Glu Ile Asn Ile Gin Ala Asn 435 440 445
Leu Trp Arg Asn Phe Leu Tyr Ala Asn Val Ala Leu Tyr Leu Pro Asp 450 455 460
Ser Tyr Lys Tyr Thr Pro Ala Asn Ile Thr Leu Pro Thr Asn Thr Asn 465 470 475 480
Thr Tyr Asp Tyr Met Asn Gly Arg Val Val Ala Pro Ser Leu Val Asp 485 490 495
Ala Tyr Ile Asn Ile Gly Ala Arg Trp Ser Leu Asp Pro Met Asp Asn 500 505 510
Val Asn Pro Phe Asn His His Arg Asn Ala Gly Leu Arg Tyr Arg Ser 515 520 525
Met Leu Leu Gly Asn Gly Arg Tyr Val Pro Phe His Ile Gin Val Pro 530 535 540
Gin Lys Phe Phe Ala Ile Lys Ser Leu Leu Leu Leu Pro Gly Ser Tyr 545 550 555 560
Thr Tyr Glu Trp Asn Phe Arg Lys Asp Val Asn Met Ile Leu Gin Ser 565 570 575
Ser Leu Gly Asn Asp Leu Arg Thr Asp Gly Ala Ser Ile Ala Phe Thr 580 585 590
Ser Ile Asn Leu Tyr Ala Thr Phe Phe Pro Met Ala His Asn Thr Ala 595 600 605 Ser Thr Leu Glu Ala Met Leu Arg Asn Asp Thr Asn Asp Gin Ser Phe 610 615 620
Asn Asp Tyr Leu Ser Ala Ala Asn Met Leu Tyr Pro Ile Pro Ala Asn 625 630 635 640
Ala Thr Asn Val Pro Ile Ser Ile Pro Ser Arg Asn Trp Ala Ala Phe 645 650 655
Arg Gly Trp Ser Phe Thr Arg Leu Lys Thr Arg Glu Thr Pro Ser Leu 660 665 670
Gly Ser Gly Phe Asp Pro Tyr Phe Val Tyr Ser Gly Ser Ile Pro Tyr 675 680 685
Leu Asp Gly Thr Phe Tyr Leu Asn His Thr Phe Lys Lys Val Ser Ile 690 695 700
Thr Phe Asp Ser Ser Val Ser Trp Pro Gly Asn Asp Arg Leu Leu Thr 705 710 715 720
Pro Asn Glu Phe Glu Ile Lys Arg Thr Val Asp Gly Glu Gly Tyr Asn 725 730 735
Val Ala Gin Cys Asn Met Thr Lys Asp Trp Phe Leu Val Gin Met Leu 740 745 750
Ala His Tyr Asn Ile Gly Tyr Gin Gly Phe Tyr Val Pro Glu Gly Tyr 755 760 765
Lys Asp Arg Met Tyr Ser Phe Phe Arg Asn Phe Gin Pro Met Ser Arg 770 775 780
Gin Val Val Asp Glu Val Asn Tyr Lys Asp Tyr Gin Ala Val Thr Leu 785 790 795 800
Ala Tyr Gin His Asn Asn Ser Gly Phe Val Gly Tyr Leu Ala Pro Thr 805 810 815
Met Arg Gin Gly Gin Pro Tyr Pro Ala Asn Tyr Pro Tyr Pro Leu Ile 820 825 830
Gly Lys Ser Ala Val Ala Ser Val Thr Gin Lys Lys Phe Leu Cys Asp 835 840 845
Arg Val Met Trp Arg Ile Pro Phe Ser Ser Asn Phe Met Ser Met Gly 850 855 860 Ala Leu Thr Asp Leu Gly Gin Asn Met Leu Tyr Ala Asn Ser Ala His 865 870 875 880
Ala Leu Asp Met Asn Phe Glu Val Asp Pro Met Asp Glu Ser Thr Leu 885 890 895
Leu Tyr Val Val Phe Glu Val Phe Asp Val Val Arg Val His Gin Pro 900 905 910
His Arg Gly Val Ile Glu Ala Val Tyr Leu Arg Thr Pro Phe Ser Ala 915 920 925
Gly Asn Ala Thr Thr
930
<210> 4
<211> 445
<212> PRT
<213> chimpanzee adenovirus serotype Pan5
<400> 4
Met Ser Lys Lys Arg Val Arg Val Asp Asp Asp Phe Asp Pro Val Tyr 15 10 15
Pro Tyr Asp Ala Asp Asn Ala Pro Thr Val Pro Phe Ile Asn Pro Pro 20 25 30
Phe Val Ser Ser Asp Gly Phe Gin Glu Lys Pro Leu Gly Val Leu Ser 35 40 45
Leu Arg Leu Ala Asp Pro Val Thr Thr Lys Asn Gly Glu Ile Thr Leu 50 55 60
Lys Leu Gly Asp Gly Val Asp Leu Asp Ser Ser Gly Lys Leu Ile Ser 65 70 75 80
Asn Thr Ala Thr Lys Ala Ala Ala Pro Leu Ser Phe Ser Asn Asn Thr 85 90 95
Ile Ser Leu Asn Met Asp Thr Pro Phe Tyr Asn Asn Asn Gly Lys Leu 100 105 110
Gly Met Lys Val Thr Ala Pro Leu Lys Ile Leu Asp Thr Asp Leu Leu 115 120 125 Lys Thr Leu Val Val Ala Tyr Gly Gin Gly Leu Gly Thr Asn Thr Thr 130 135 140
Gly Ala Leu Val Ala Gin Leu Ala Ser Pro Leu Ala Phe Asp Ser Asn 145 150 155 160
Ser Lys Ile Ala Leu Asn Leu Gly Asn Gly Pro Leu Lys Val Asp Ala 165 170 175
Asn Arg Leu Asn Ile Asn Cys Asn Arg Gly Leu Tyr Val Thr Thr Thr 180 185 190
Lys Asp Ala Leu Glu Ala Asn Ile Ser Trp Ala Asn Ala Met Thr Phe 195 200 205
Ile Gly Asn Ala Met Gly Val Asn Ile Asp Thr Gin Lys Gly Leu Gin 210 215 220
Phe Gly Thr Thr Ser Thr Val Ala Asp Val Lys Asn Ala Tyr Pro Ile 225 230 235 240
Gin Ile Lys Leu Gly Ala Gly Leu Thr Phe Asp Ser Thr Gly Ala Ile 245 250 255
Val Ala Trp Asn Lys Asp Asp Asp Lys Leu Thr Leu Trp Thr Thr Ala 260 265 270
Asp Pro Ser Pro Asn Cys His Ile Tyr Ser Glu Lys Asp Ala Lys Leu 275 280 285
Thr Leu Cys Leu Thr Lys Cys Gly Ser Gin Ile Leu Gly Thr Val Ser 290 295 300
Leu Ile Ala Val Asp Thr Gly Ser Leu Asn Pro Ile Thr Gly Thr Val 305 310 315 320
Thr Thr Ala Leu Val Ser Leu Lys Phe Asp Ala Asn Gly Val Leu Gin 325 330 335
Ser Ser Ser Thr Leu Asp Ser Asp Tyr Trp Asn Phe Arg Gin Gly Asp 340 345 350
Val Thr Pro Ala Glu Ala Tyr Thr Asn Ala Ile Gly Phe Met Pro Asn 355 360 365
Leu Lys Ala Tyr Pro Lys Asn Thr Ser Gly Ala Ala Lys Ser His Ile 370 375 380 Val Gly Lys Val Tyr Leu His Gly Asp Thr Gly Lys Pro Leu Asp Leu 385 390 395 400
Ile Ile Thr Phe Asn Glu Thr Ser Asp Glu Ser Cys Thr Tyr Cys Ile 405 410 415
Asn Phe Gin Trp Gin Trp Gly Ala Asp Gin Tyr Lys Asn Glu Thr Leu 420 425 430
Ala Val Ser Ser Phe Thr Phe Ser Tyr Ile Ala Lys Glu
435 440 445
<210> 5
<211> 36604
<212> DNA
<213> chimpanzee adenovirus serotype Pan6
<220>
<221> CDS
<222> (13878)..(15467)
<223> L2 Penton
<220>
<221> CDS
<222> (18284)..(21112)
<223> L3 Hexon
<220>
<221> CDS
<222> (32162)..(33493)
<223> L5 Fiber
<400> 5
catcatcaat aatatacctc aaacttttgg tgcgcgttaa tatgcaaatg agctgtttga 60 atttggggag ggaggaaggt gattggctgc gggagcggcg accgttaggg gcggggcggg 120 tgacgttttg atgacgtggc tatgaggcgg agccggtttg caagttctcg tgggaaaagt 180 gacgtcaaac gaggtgtggt ttgaacacgg aaatactcaa ttttcccgcg ctctctgaca 240 ggaaatgagg tgtttctggg cggatgcaag tgaaaacggg ccattttcgc gcgaaaactg 300 aatgaggaag tgaaaatctg agtaatttcg cgtttatggc agggaggagt atttgccgag 360 ggccgagtag actttgaccg attacgtggg ggtttcgatt accgtatttt tcacctaaat 420 ttccgcgtac ggtgtcaaag tccggtgttt ttacgtaggc gtcagctgat cgccagggta 480 tttaaacctg cgctctctag tcaagaggcc actcttgagt gccagcgagt agagttttct 540 cctccgcgcc gcgagtcaga tctacacttt gaaagatgag gcacctgaga gacctgcccg 600 gtaatgtttt cctggctact gggaacgaga ttctggaatt ggtggtggac gccatgatgg 660 gtgacgaccc tccagagccc cctaccccat ttgaggcgcc ttcgctgtac gatttgtatg 720 atctggaggt ggatgtgccc gagagcgacc ctaacgagga ggcggtgaat gatttgttta 780 gcgatgccgc gctgctggct gccgagcagg ctaatacgga ctctggctca gacagcgatt 840 cctctctcca taccccgaga cccggcagag gtgagaaaaa gatccccgag cttaaagggg 900 aagagctcga cctgcgctgc tatgaggaat gcttgcctcc gagcgatgat gaggaggacg 960 aggaggcgat tcgagctgcg gtgaaccagg gagtgaaaac tgcgggcgag agctttagcc 1020 tggactgtcc tactctgccc ggacacggct gtaagtcttg tgaatttcat cgcatgaata 1080 ctggagataa gaatgtgatg tgtgccctgt gctatatgag agcttacaac cattgtgttt 1140 acagtaagtg tgattaactt tagttgggaa ggcagagggt gactgggtgc tgactggttt 1200 atttatgtat atgttttttt atgtgtaggt cccgtctctg acgtagatga gacccccact 1260 tcagagtgca tttcatcacc cccagaaatt ggcgaggaac cgcccgaaga tattattcat 1320 agaccagttg cagtgagagt caccgggcgg agagcagctg tggagagttt ggatgacttg 1380 ctacagggtg gggatgaacc tttggacttg tgtacccgga aacgccccag gcactaagtg 1440 ccacacatgt gtgtttactt aaggtgatgt cagtatttat agggtgtgga gtgcaataaa 1500 atccgtgttg actttaagtg cgtgttttat gactcagggg tggggactgt gggtatataa 1560 gcaggtgcag acctgtgtgg tcagttcaga gcaggactca tggagatctg gactgtcttg 1620 gaagactttc accagactag acagttgcta gagaactcat cggagggagt ctcttacctg 1680 tggagattct gcttcggtgg gcctctagct aagctagtct atagggccaa acaggattat 1740 aaggaacaat ttgaggatat tttgagagag tgtcctggta tttttgactc tctcaacttg 1800 ggccatcagt ctcactttaa ccagagtatt ctgagagccc ttgacttttc tactcctggc 1860 agaactaccg ccgcggtagc cttttttgcc tttattcttg acaaatggag tcaagaaacc 1920 catttcagca gggattaccg tctggactgc ttagcagtag ctttgtggag aacatggagg 1980 tgccagcgcc tgaatgcaat ctccggctac ttgccagtac agccggtaga cacgctgagg 2040 atcctgagtc tccagtcacc ccaggaacac caacgccgcc agcagccgca gcaggagcag 2100 cagcaagagg aggaccgaga agagaacccg agagccggtc tggaccctcc ggtggcggag 2160 gaggaggagt agctgacttg tttcccgagc tgcgccgggt gctgactagg tcttccagtg 2220 gacgggagag ggggattaag cgggagaggc atgaggagac tagccacaga actgaactga 2280 ctgtcagtct gatgagccgc aggcgcccag aatcggtgtg gtggcatgag gtgcagtcgc 2340 aggggataga tgaggtctcg gtgatgcatg agaaatattc cctagaacaa gtcaagactt 2400 gttggttgga gcccgaggat gattgggagg tagccatcag gaattatgcc aagctggctc 2460 tgaagccaga caagaagtac aagattacca aactgattaa tatcagaaat tcctgctaca 2520 tttcagggaa tggggccgag gtggagatca gtacccagga gagggtggcc ttcagatgtt 2580 gtatgatgaa tatgtacccg ggggtggtgg gcatggaggg agtcaccttt atgaacacga 2640 ggttcagggg tgatgggtat aatggggtgg tctttatggc caacaccaag ctgacagtgc 2700 acggatgctc cttctttggc ttcaataaca tgtgcatcga ggcctggggc agtgtttcag 2760 tgaggggatg cagcttttca gccaactgga tgggggtcgt gggcagaacc aagagcaagg 2820 tgtcagtgaa gaaatgcctg ttcgagaggt gccacctggg ggtgatgagc gagggcgaag 2880 ccaaagtcaa acactgcgcc tctaccgaga cgggctgctt tgtgctgatc aagggcaatg 2940 cccaagtcaa gcataacatg atctgtgggg cctcggatga gcgcggctac cagatgctga 3000 cctgcgccgg tgggaacagc catatgctgg ccaccgtgca tgtggcctcg cacccccgca 3060 agacatggcc cgagttcgag cacaacgtca tgacccgctg caatgtgcac ctgggctccc 3120 gccgaggcat gttcatgccc taccagtgca acatgcaatt tgtgaaggtg ctgctggagc 3180 ccgatgccat gtccagagtg agcctgacgg gggtgtttga catgaatgtg gagctgtgga 3240 aaattctgag atatgatgaa tccaagacca ggtgccgggc ctgcgaatgc ggaggcaagc 3300 acgccaggct tcagcccgtg tgtgtggagg tgacggagga cctgcgaccc gatcatttgg 3360 tgttgtcctg caacgggacg gagttcggct ccagcgggga agaatctgac tagagtgagt 3420 agtgtttggg gctgggtgtg agcctgcatg aggggcagaa tgactaaaat ctgtggtttt 3480 ctgtgtgttg cagcagcatg agcggaagcg cctcctttga gggaggggta ttcagccctt 3540 atctgacggg gcgtctcccc tcctgggcgg gagtgcgtca gaatgtgatg ggatccacgg 3600 tggacggccg gcccgtgcag cccgcgaact cttcaaccct gacctacgcg accctgagct 3660 cctcgtccgt ggacgcagct gccgccgcag ctgctgcttc cgccgccagc gccgtgcgcg 3720 gaatggccct gggcgccggc tactacagct ctctggtggc caactcgagt tccaccaata 3780 atcccgccag cctgaacgag gagaagctgc tgctgctgat ggcccagctc gaggccctga 3840 cccagcgcct gggcgagctg acccagcagg tggctcagct gcaggcggag acgcgggccg 3900 cggttgccac ggtgaaaacc aaataaaaaa tgaatcaata aataaacgga gacggttgtt 3960 gattttaaca cagagtcttg aatctttatt tgatttttcg cgcgcggtag gccctggacc 4020 accggtctcg atcattgagc acccggtgga tcttttccag gacccggtag aggtgggctt 4080 ggatgttgag gtacatgggc atgagcccgt cccgggggtg gaggtagctc cattgcaggg 4140 cctcgtgctc ggggatggtg ttgtaaatca cccagtcata gcaggggcgc agggcgtggt 4200 gctgcacgat gtccttgagg aggagactga tggccacggg cagccccttg gtgtaggtgt 4260 tgacgaacct gttgagctgg gagggatgca tgcgggggga gatgagatgc atcttggcct 4320 ggatcttgag attggcgatg ttcccgccca gatcccgccg ggggttcatg ttgtgcagga 4380 ccaccagcac ggtgtatccg gtgcacttgg ggaatttgtc atgcaacttg gaagggaagg 4440 cgtgaaagaa tttggagacg cccttgtgac cgcccaggtt ttccatgcac tcatccatga 4500 tgatggcgat gggcccgtgg gcggcggcct gggcaaagac gtttcggggg tcggacacat 4560 cgtagttgtg gtcctgggtg agctcgtcat aggccatttt aatgaatttg gggcggaggg 4620 tgcccgactg ggggacgaag gtgccctcga tcccgggggc gtagttgccc tcgcagatct 4680 gcatctccca ggccttgagc tcggaggggg ggatcatgtc cacctgcggg gcgatgaaaa 4740 aaacggtttc cggggcgggg gagatgagct gggccgaaag caggttccgg agcagctggg 4800 acttgccgca accggtgggg ccgtagatga ccccgatgac cggctgcagg tggtagttga 4860 gggagagaca gctgccgtcc tcgcggagga ggggggccac ctcgttcatc atctcgcgca 4920 catgcatgtt ctcgcgcacg agttccgcca ggaggcgctc gccccccagc gagaggagct 4980 cttgcagcga ggcgaagttt ttcagcggct tgagtccgtc ggccatgggc attttggaga 5040 gggtctgttg caagagttcc agacggtccc agagctcggt gatgtgctct agggcatctc 5100 gatccagcag acctcctcgt ttcgcgggtt ggggcgactg cgggagtagg gcaccaggcg 5160 atgggcgtcc agcgaggcca gggtccggtc cttccagggc cgcagggtcc gcgtcagcgt 5220 ggtctccgtc acggtgaagg ggtgcgcgcc gggctgggcg cttgcgaggg tgcgcttcag 5280 gctcatccgg ctggtcgaga accgctcccg gtcggcgccc tgcgcgtcgg ccaggtagca 5340 attgagcatg agttcgtagt tgagcgcctc ggccgcgtgg cccttggcgc ggagcttacc 5400 tttggaagtg tgtccgcaga cgggacagag gagggacttg agggcgtaga gcttgggggc 5460 gaggaagacg gactcggggg cgtaggcgtc cgcgccgcag ctggcgcaga cggtctcgca 5520 ctccacgagc caggtgaggt cggggcggtt ggggtcaaaa acgaggtttc ctccgtgctt 5580 tttgatgcgt ttcttacctc tggtctccat gagctcgtgt ccccgctggg tgacaaagag 5640 gctgtccgtg tccccgtaga ccgactttat gggccggtcc tcgagcgggg tgccgcggtc 5700 ctcgtcgtag aggaaccccg cccactccga gacgaaggcc cgggtccagg ccagcacgaa 5760 ggaggccacg tgggaggggt agcggtcgtt gtccaccagc gggtccacct tctccagggt 5820 atgcaagcac atgtccccct cgtccacatc caggaaggtg attggcttgt aagtgtaggc 5880 cacgtgaccg ggggtcccgg ccgggggggt ataaaagggg gcgggcccct gctcgtcctc 5940 actgtcttcc ggatcgctgt ccaggagcgc cagctgttgg ggtaggtatt ccctctcgaa 6000 ggcgggcatg acctcggcac tcaggttgtc agtttctaga aacgaggagg atttgatatt 6060 gacggtgccg ttggagacgc ctttcatgag cccctcgtcc atttggtcag aaaagacgat 6120 ctttttgttg tcgagcttgg tggcgaagga gccgtagagg gcgttggaga gcagcttggc 6180 gatggagcgc atggtctggt tcttttcctt gtcggcgcgc tccttggcgg cgatgttgag 6240 ctgcacgtac tcgcgcgcca cgcacttcca ttcggggaag acggtggtga gctcgtcggg 6300 cacgattctg acccgccagc cgcggttgtg cagggtgatg aggtccacgc tggtggccac 6360 ctcgccgcgc aggggctcgt tggtccagca gaggcgcccg cccttgcgcg agcagaaggg 6420 gggcagcggg tccagcatga gctcgtcggg ggggtcggcg tccacggtga agatgccggg 6480 caggagctcg gggtcgaagt agctgatgca ggtgcccaga ttgtccagcg ccgcttgcca 6540 gtcgcgcacg gccagcgcgc gctcgtaggg gctgaggggc gtgccccagg gcatggggtg 6600 cgtgagcgcg gaggcgtaca tgccgcagat gtcgtagacg tagaggggct cctcgaggac 6660 gccgatgtag gtggggtagc agcgcccccc gcggatgctg gcgcgcacgt agtcgtacag 6720 ctcgtgcgag ggcgcgagga gccccgtgcc gaggttggag cgttgcggct tttcggcgcg 6780 gtagacgatc tggcggaaga tggcgtggga gttggaggag atggtgggcc tttggaagat 6840 gttgaagtgg gcgtggggca ggccgaccga gtccctgatg aagtgggcgt aggagtcctg 6900 cagcttggcg acgagctcgg cggtgacgag gacgtccagg gcgcagtagt cgagggtctc 6960 ttggatgatg tcatacttga gctggccctt ctgcttccac agctcgcggt tgagaaggaa 7020 ctcttcgcgg tccttccagt actcttcgag ggggaacccg tcctgatcgg cacggtaaga 7080 gcccaccatg tagaactggt tgacggcctt gtaggcgcag cagcccttct ccacggggag 7140 ggcgtaagct tgcgcggcct tgcgcaggga ggtgtgggtg agggcgaagg tgtcgcgcac 7200 catgaccttg aggaactggt gcttgaagtc gaggtcgtcg cagccgccct gctcccagag 7260 ttggaagtcc gtgcgcttct tgtaggcggg gttaggcaaa gcgaaagtaa catcgttgaa 7320 gaggatcttg cccgcgcggg gcatgaagtt gcgagtgatg cggaaaggct ggggcacctc 7380 ggcccggttg ttgatgacct gggcggcgag gacgatctcg tcgaagccgt tgatgttgtg 7440 cccgacgatg tagagttcca cgaatcgcgg gcggcccttg acgtggggca gcttcttgag 7500 ctcgtcgtag gtgagctcgg cggggtcgct gagcccgtgc tgctcgaggg cccagtcggc 7560 gacgtggggg ttggcgctga ggaaggaagt ccagagatcc acggccaggg cggtctgcaa 7620 gcggtcccgg tactgacgga actgttggcc cacggccatt ttttcggggg tgacgcagta 7680 gaaggtgcgg gggtcgccgt gccagcggtc ccacttgagc tggagggcga ggtcgtgggc 7740 gagctcgacg agcggcgggt ccccggagag tttcatgacc agcatgaagg ggacgagctg 7800 cttgccgaag gaccccatcc aggtgtaggt ttccacatcg taggtgagga agagcctttc 7860 ggtgcgagga tgcgagccga tggggaagaa ctggatctcc tgccaccagt tggaggaatg 7920 gctgttgatg tgatggaagt agaaatgccg acggcgcgcc gagcactcgt gcttgtgttt 7980 atacaagcgt ccgcagtgct cgcaacgctg cacgggatgc acgtgctgca cgagctgtac 8040 ctgggttcct ttggcgagga atttcagtgg gcagtggagc gctggcggct gcatctcgtg 8100 ctgtactacg tcttggccat cggcgtggcc atcgtctgcc tcgatggtgg tcatgctgac 8160 gagcccgcgc gggaggcagg tccagacctc ggctcggacg ggtcggagag cgaggacgag 8220 ggcgcgcagg ccggagctgt ccagggtcct gagacgctgc ggagtcaggt cagtgggcag 8280 cggcggcgcg cggttgactt gcaggagctt ttccagggcg cgcgggaggt ccagatggta 8340 cttgatctcc acggcgccgt tggtggctac gtccacggct tgcagggtgc cgtgcccctg 8400 gggcgccacc accgtgcccc gtttcttctt gggcgctgct tccatgtcgg tcagaagcgg 8460 cggcgaggac gcgcgccggg cggcaggggc ggctcggggc ccggaggcag gggcggcagg 8520 ggcacgtcgg cgccgcgcgc gggcaggttc tggtactgcg cccggagaag actggcgtga 8580 gcgacgacgc gacggttgac gtcctggatc tgacgcctct gggtgaaggc cacgggaccc 8640 gtgagtttga acctgaaaga gagttcgaca gaatcaatct cggtatcgtt gacggcggcc 8700 tgccgcagga tctcttgcac gtcgcccgag ttgtcctggt aggcgatctc ggtcatgaac 8760 tgctcgatct cctcctcctg aaggtctccg cggccggcgc gctcgacggt ggccgcgagg 8820 tcgttggaga tgcggcccat gagctgcgag aaggcgttca tgccggcctc gttccagacg 8880 cggctgtaga ccacggctcc gtcggggtcg cgcgcgcgca tgaccacctg ggcgaggttg 8940 agctcgacgt ggcgcgtgaa gaccgcgtag ttgcagaggc gctggtagag gtagttgagc 9000 gtggtggcga tgtgctcggt gacgaagaag tacatgatcc agcggcggag cggcatctcg 9060 ctgacgtcgc ccagggcttc caagcgttcc atggcctcgt agaagtccac ggcgaagttg 9120 aaaaactggg agttgcgcgc cgagacggtc aactcctcct ccagaagacg gatgagctcg 9180 gcgatggtgg cgcgcacctc gcgctcgaag gccccggggg gctcctcttc catctcctcc 9240 tcttcctcct ccactaacat ctcttctact tcctcctcag gaggcggtgg cgggggaggg 9300 gccctgcgtc gccggcggcg cacgggcaga cggtcgatga agcgctcgat ggtctccccg 9360 cgccggcgac gcatggtctc ggtgacggcg cgcccgtcct cgcggggccg cagcatgaag 9420 acgccgccgc gcatctccag gtggccgccg ggggggtctc cgttgggcag ggagagggcg 9480 ctgacgatgc atcttatcaa ttgacccgta gggactccgc gcaaggacct gagcgtctcg 9540 agatccacgg gatccgaaaa ccgctgaacg aaggcttcga gccagtcgca gtcgcaaggt 9600 aggctgagcc cggtttcttg ttcttcgggt atttggtcgg gaggcgggcg ggcgatgctg 9660 ctggtgatga agttgaagta ggcggtcctg agacggcgga tggtggcgag gagcaccagg 9720 tccttgggcc cggcttgctg gatgcgcaga cggtcggcca tgccccaggc gtggtcctga 9780 cacctggcga ggtccttgta gtagtcctgc atgagccgct ccacgggcac ctcctcctcg 9840 cccgcgcggc cgtgcatgcg cgtgagcccg aacccgcgct gcggctggac gagcgccagg 9900 tcggcgacga cgcgctcggt gaggatggcc tgctggatct gggtgagggt ggtctggaag 9960 tcgtcgaagt cgacgaagcg gtggtaggct ccggtgttga tggtgtagga gcagttggcc 10020 atgacggacc agttgacggt ctggtggccg ggtcgcacga gctcgtggta cttgaggcgc 10080 gagtaggcgc gcgtgtcgaa gatgtagtcg ttgcaggcgc gcacgaggta ctggtatccg 10140 acgaggaagt gcggcggcgg ctggcggtag agcggccatc gctcggtggc gggggcgccg 10200 ggcgcgaggt cctcgagcat gaggcggtgg tagccgtaga tgtacctgga catccaggtg 10260 atgccggcgg cggtggtgga ggcgcgcggg aactcgcgga cgcggttcca gatgttgcgc 10320 agcggcagga agtagttcat ggtggccgcg gtctggcccg tgaggcgcgc gcagtcgtgg 10380 atgctctaga catacgggca aaaacgaaag cggtcagcgg ctcgactccg tggcctggag 10440 gctaagcgaa cgggttgggc tgcgcgtgta ccccggttcg aatctcgaat caggctggag 10500 ccgcagctaa cgtggtactg gcactcccgt ctcgacccaa gcctgctaac gaaacctcca 10560 ggatacggag gcgggtcgtt ttttggcctt ggtcgctggt catgaaaaac tagtaagcgc 10620 ggaaagcggc cgcccgcgat ggctcgctgc cgtagtctgg agaaagaatc gccagggttg 10680 cgttgcggtg tgccccggtt cgagcctcag cgctcggcgc cggccggatt ccgcggctaa 10740 cgtgggcgtg gctgccccgt cgtttccaag accccttagc cagccgactt ctccagttac 10800 ggagcgagcc cctctttttt tttcttgtgt ttttgccaga tgcatcccgt actgcggcag 10860 atgcgccccc accctccacc acaaccgccc ctaccgcagc agcagcaaca gccggcgctt 10920 ctgcccccgc cccagcagca gccagccact accgcggcgg ccgccgtgag cggagccggc 10980 gttcagtatg acctggcctt ggaagagggc gaggggctgg cgcggctggg ggcgtcgtcg 11040 ccggagcggc acccgcgcgt gcagatgaaa agggacgctc gcgaggccta cgtgcccaag 11100 cagaacctgt tcagagacag gagcggcgag gagcccgagg agatgcgcgc ctcccgcttc 11160 cacgcggggc gggagctgcg gcgcggcctg gaccgaaagc gggtgctgag ggacgaggat 11220 ttcgaggcgg acgagctgac ggggatcagc cccgcgcgcg cgcacgtggc cgcggccaac 11280 ctggtcacgg cgtacgagca gaccgtgaag gaggagagca acttccaaaa atccttcaac 11340 aaccacgtgc gcacgctgat cgcgcgcgag gaggtgaccc tgggcctgat gcacctgtgg 11400 gacctgctgg aggccatcgt gcagaacccc acgagcaagc cgctgacggc gcagctgttt 11460 ctggtggtgc agcacagtcg ggacaacgag acgttcaggg aggcgctgct gaatatcacc 11520 gagcccgagg gccgctggct cctggacctg gtgaacattt tgcagagcat cgtggtgcag 11580 gagcgcgggc tgccgctgtc cgagaagctg gcggccatca acttctcggt gctgagtctg 11640 ggcaagtact acgctaggaa gatctacaag accccgtacg tgcccataga caaggaggtg 11700 aagatcgacg ggttttacat gcgcatgacc ctgaaagtgc tgaccctgag cgacgatctg 11760 ggggtgtacc gcaacgacag gatgcaccgc gcggtgagcg ccagccgccg gcgcgagctg 11820 agcgaccagg agctgatgca cagcctgcag cgggccctga ccggggccgg gaccgagggg 11880 gagagctact ttgacatggg cgcggacctg cgctggcagc ccagccgccg ggccttggaa 11940 gctgccggcg gttcccccta cgtggaggag gtggacgatg aggaggagga gggcgagtac 12000 ctggaagact gatggcgcga ccgtattttt gctagatgca gcaacagcca ccgccgccgc 12060 ctcctgatcc cgcgatgcgg gcggcgctgc agagccagcc gtccggcatt aactcctcgg 12120 acgattggac ccaggccatg caacgcatca tggcgctgac gacccgcaat cccgaagcct 12180 ttagacagca gcctcaggcc aaccggctct cggccatcct ggaggccgtg gtgccctcgc 12240 gctcgaaccc cacgcacgag aaggtgctgg ccatcgtgaa cgcgctggtg gagaacaagg 12300 ccatccgcgg tgacgaggcc gggctggtgt acaacgcgct gctggagcgc gtggcccgct 12360 acaacagcac caacgtgcag acgaacctgg accgcatggt gaccgacgtg cgcgaggcgg 12420 tgtcgcagcg cgagcggttc caccgcgagt cgaacctggg ctccatggtg gcgctgaacg 12480 ccttcctgag cacgcagccc gccaacgtgc cccggggcca ggaggactac accaacttca 12540 tcagcgcgct gcggctgatg gtggccgagg tgccccagag cgaggtgtac cagtcggggc 12600 cggactactt cttccagacc agtcgccagg gcttgcagac cgtgaacctg agccaggctt 12660 tcaagaactt gcagggactg tggggcgtgc aggccccggt cggggaccgc gcgacggtgt 12720 cgagcctgct gacgccgaac tcgcgcctgc tgctgctgct ggtggcgccc ttcacggaca 12780 gcggcagcgt gagccgcgac tcgtacctgg gctacctgct taacctgtac cgcgaggcca 12840 tcggacaggc gcacgtggac gagcagacct accaggagat cacccacgtg agccgcgcgc 12900 tgggccagga ggacccgggc aacctggagg ccaccctgaa cttcctgctg accaaccggt 12960 cgcagaagat cccgccccag tacgcgctga gcaccgagga ggagcgcatc ctgcgctacg 13020 tgcagcagag cgtggggctg ttcctgatgc aggagggggc cacgcccagc gcggcgctcg 13080 acatgaccgc gcgcaacatg gagcccagca tgtacgcccg caaccgcccg ttcatcaata 13140 agctgatgga ctacttgcat cgggcggccg ccatgaactc ggactacttt accaacgcca 13200 tcttgaaccc gcactggctc ccgccgcccg ggttctacac gggcgagtac gacatgcccg 13260 accccaacga cgggttcctg tgggacgacg tggacagcag cgtgttctcg ccgcgtccag 13320 gaaccaatgc cgtgtggaag aaagagggcg gggaccggcg gccgtcctcg gcgctgtccg 13380 gtcgcgcggg tgctgccgcg gcggtgcccg aggccgccag ccccttcccg agcctgccct 13440 tttcgctgaa cagcgtgcgc agcagcgagc tgggtcggct gacgcgaccg cgcctgctgg 13500 gcgaggagga gtacctgaac gactccttgt tgaggcccga gcgcgagaag aacttcccca 13560 ataacgggat agagagcctg gtggacaaga tgagccgctg gaagacgtac gcgcacgagc 13620 acagggacga gccccgagct agcagcgcag gcacccgtag acgccagcgg cacgacaggc 13680 agcggggact ggtgtgggac gatgaggatt ccgccgacga cagcagcgtg ttggacttgg 13740 gtgggagtgg tggtaacccg ttcgctcacc tgcgcccccg tatcgggcgc ctgatgtaag 13800 aatctgaaaa aataaaagac ggtactcacc aaggccatgg cgaccagcgt gcgttcttct 13860 ctgttgtttg tagtagt atg atg agg cgc gtg tac ccg gag ggt cct cct 13910
Met Met Arg Arg Val Tyr Pro Glu Gly Pro Pro 15 10
ccc tcg tac gag age gtg atg cag cag gcg gtg gcg gcg gcg atg cag 13958 Pro Ser Tyr Glu Ser Val Met Gin Gin Ala Val Ala Ala Ala Met Gin 15 20 25
ccc ccg etg gag gcg cct tac gtg ccc ccg egg tac etg gcg cct acg 14006 Pro Pro Leu Glu Ala Pro Tyr Val Pro Pro Arg Tyr Leu Ala Pro Thr 30 35 40
gag ggg egg aac age att cgt tac tcg gag etg gca ccc ttg tac gat 14054 Glu Gly Arg Asn Ser Ile Arg Tyr Ser Glu Leu Ala Pro Leu Tyr Asp 45 50 55
acc acc egg ttg tac etg gtg gac aac aag tcg gca gac atc gcc tcg 14102 Thr Thr Arg Leu Tyr Leu Val Asp Asn Lys Ser Ala Asp Ile Ala Ser 60 65 70 75
etg aac tac cag aac gac cac age aac ttc etg acc acc gtg gtg cag 14150 Leu Asn Tyr Gin Asn Asp His Ser Asn Phe Leu Thr Thr Val Val Gin 80 85 90
aac aac gat ttc acc ccc acg gag gcc age acc cag acc atc aac ttt 14198 Asn Asn Asp Phe Thr Pro Thr Glu Ala Ser Thr Gin Thr Ile Asn Phe 95 100 105
gac gag cgc tcg egg tgg ggc ggc cag etg aaa acc atc atg cac acc 14246 Asp Glu Arg Ser Arg Trp Gly Gly Gin Leu Lys Thr Ile Met His Thr 110 115 120
aac atg ccc aac gtg aac gag ttc atg tac age aac aag ttc aag gcg 14294 Asn Met Pro Asn Val Asn Glu Phe Met Tyr Ser Asn Lys Phe Lys Ala 125 130 135
egg gtg atg gtc tcg cgc aag acc ccc aac ggg gtg gat gat gat tat 14342 Arg Val Met Val Ser Arg Lys Thr Pro Asn Gly Val Asp Asp Asp Tyr 140 145 150 155
gat ggt agt cag gac gag etg acc tac gag tgg gtg gag ttt gag etg 14390 Asp Gly Ser Gin Asp Glu Leu Thr Tyr Glu Trp Val Glu Phe Glu Leu 160 165 170
ccc gag ggc aac ttc tcg gtg acc atg acc atc gat etg atg aac aac 14438 Pro Glu Gly Asn Phe Ser Val Thr Met Thr Ile Asp Leu Met Asn Asn 175 180 185
gcc atc atc gac aac tac ttg gcg gtg ggg egg cag aac ggg gtg etg 14486 Ala Ile Ile Asp Asn Tyr Leu Ala Val Gly Arg Gin Asn Gly Val Leu 190 195 200
gag age gac atc ggc gtg aag ttc gac acg cgc aac ttc egg etg ggc 14534 Glu Ser Asp Ile Gly Val Lys Phe Asp Thr Arg Asn Phe Arg Leu Gly 205 210 215
tgg gac ccc gtg acc gag etg gtg atg ccg ggc gtg tac acc aac gag 14582 Trp Asp Pro Val Thr Glu Leu Val Met Pro Gly Val Tyr Thr Asn Glu 220 225 230 235
gcc ttc cac ccc gac atc gtc etg etg ccc ggc tgc ggc gtg gac ttc 14630 Ala Phe His Pro Asp Ile Val Leu Leu Pro Gly Cys Gly Val Asp Phe 240 245 250
acc gag age cgc etc age aac etg etg ggc atc cgc aag egg cag ccc 14678 Thr Glu Ser Arg Leu Ser Asn Leu Leu Gly Ile Arg Lys Arg Gin Pro 255 260 265
ttc cag gag ggc ttc cag atc etg tac gag gac etg gag ggg ggc aac 14726 Phe Gin Glu Gly Phe Gin Ile Leu Tyr Glu Asp Leu Glu Gly Gly Asn 270 275 280
atc ccc gcg etc ttg gat gtc gaa gcc tac gag aaa age aag gag gat 14774 Ile Pro Ala Leu Leu Asp Val Glu Ala Tyr Glu Lys Ser Lys Glu Asp 285 290 295
age acc gcc gcg gcg acc gca gcc gtg gcc acc gcc tet acc gag gtg 14822 Ser Thr Ala Ala Ala Thr Ala Ala Val Ala Thr Ala Ser Thr Glu Val 300 305 310 315
c<3999°3 at aat ttt get age get gcg gca gcg gcc gag gcg get gaa 14870 Arg Gly Asp Asn Phe Ala Ser Ala Ala Ala Ala Ala Glu Ala Ala Glu 320 325 330
acc gaa agt aag ata gtc atc cag ccg gtg gag aag gac age aag gac 14918 Thr Glu Ser Lys Ile Val Ile Gin Pro Val Glu Lys Asp Ser Lys Asp 335 340 345
agg age tac aac gtg etc gcg gac aag aaa aac acc gcc tac cgc age 14966 Arg Ser Tyr Asn Val Leu Ala Asp Lys Lys Asn Thr Ala Tyr Arg Ser 350 355 360
tgg tac etg gcc tac aac tac ggc gac ccc gag aag ggc gtg cgc tcc 15014 Trp Tyr Leu Ala Tyr Asn Tyr Gly Asp Pro Glu Lys Gly Val Arg Ser 365 370 375
tgg acg etg etc acc acc tcg gac gtc acc tgc ggc gtg gag caa gtc 15062 Trp Thr Leu Leu Thr Thr Ser Asp Val Thr Cys Gly Val Glu Gin Val 380 385 390 395
tac tgg tcg etg ccc gac atg atg caa gac ccg gtc acc ttc cgc tcc 15110 Tyr Trp Ser Leu Pro Asp Met Met Gin Asp Pro Val Thr Phe Arg Ser 400 405 410
acg cgt caa gtt age aac tac ccg gtg gtg ggc gcc gag etc etg ccc 15158 Thr Arg Gin Val Ser Asn Tyr Pro Val Val Gly Ala Glu Leu Leu Pro 415 420 425
gtc tac tcc aag age ttc ttc aac gag cag gcc gtc tac tcg cag cag 15206 Val Tyr Ser Lys Ser Phe Phe Asn Glu Gin Ala Val Tyr Ser Gin Gin 430 435 440
etg cgc gcc ttc acc tcg etc acg cac gtc ttc aac cgc ttc ccc gag 15254 Leu Arg Ala Phe Thr Ser Leu Thr His Val Phe Asn Arg Phe Pro Glu 445 450 455
aac cag atc etc gtc cgc ccg ccc gcg ccc acc att acc acc gtc agt 15302 Asn Gin Ile Leu Val Arg Pro Pro Ala Pro Thr Ile Thr Thr Val Ser 460 465 470 475
gaa aac gtt cct get etc aca gat cac ggg acc etg ccg etg cgc age 15350 Glu Asn Val Pro Ala Leu Thr Asp His Gly Thr Leu Pro Leu Arg Ser 480 485 490
agt atc egg gga gtc cag cgc gtg acc gtc act gac gcc aga cgc cgc 15398 Ser Ile Arg Gly Val Gin Arg Val Thr Val Thr Asp Ala Arg Arg Arg 495 500 505
acc tgc ccc tac gtc tac aag gcc etg ggc gta gtc gcg ccg cgc gtc 15446 Thr Cys Pro Tyr Val Tyr Lys Ala Leu Gly Val Val Ala Pro Arg Val 510 515 520
etc tcg age cgc acc ttc taa aaaatgtcca ttctcatctc gcccagtaat 15497 Leu Ser Ser Arg Thr Phe
525
aacaceggtt ggggcctgcg cgcgcccagc aagatgtacg gaggegcteg ccaacgctcc 15557 acgcaacacc ccgtgcgcgt gcgcgggcac ttccgcgctc cctggggcgc cctcaagggc 15617 cgcgtgcgct cgcgcaccac cgtcgacgac gtgatcgacc aggtggtggc cgacgcgcgc 15677 aactacacgc ccgccgccgc gcccgtctcc acegtggaeg ccgtcatcga cagegtggtg 15737 gccgacgcgc gccggtacgc ccgcaccaag agccggcggc ggcgcatcgc ccggcggcac 15797 cggagcaccc ccgccatgcg cgcggcgcga gccttgctgc gcagggccag gegcaeggga 15857 cgcagggcca tgetcaggge ggccagacgc gcggcctccg gcagcagcag cgccggcagg 15917 acccgcagac gcgcggccac ggcggcggcg gcggccatcg ccagcatgtc ccgcccgcgg 15977 cgcggcaacg tgtactgggt gcgcgacgcc gccaccggtg tgcgcgtgcc cgtgcgcacc 16037 cgcccccctc gcacttgaag atgetgaett egegatgttg atgtgtccca geggegagga 16097 ggatgtccaa gegcaaatae aaggaagaga tgctccaggt catcgcgcct gagatctacg 16157 gccccgcggc ggeggtgaag gaggaaagaa agccccgcaa actgaagegg gtcaaaaagg 16217 acaaaaagga ggaggaagat gaeggactgg tggagtttgt gegegagtte gccccccggc 16277 ggcgcgtgca gtggcgcggg eggaaagtga aaceggtget gcggcccggc accaeggtgg 16337 tcttcacgcc eggegagegt tccggctccg cctccaagcg ctcctacgac gaggtgtacg 16397 gggacgagga catcctcgag caggeggteg agegtctggg egagtttgeg taeggcaage 16457 gcagccgccc cgcgcccttg aaagaggagg eggtgtceat cccgctggac cacggcaacc 16517 ccacgccgag cctgaagccg gtgaccctgc ageaggtget accgagcgcg gcgccgcgcc 16577 ggggcttcaa gegegaggge ggegaggate tgtacccgac catgcagctg atggtgccca 16637 agegccagaa getggaggae gtgctggagc acatgaaggt ggaccccgag gtgcagcccg 16697 aggtcaaggt gcggcccatc aagcaggtgg ccccgggcct gggcgtgcag acegtggaca 16757 tcaagatccc cacggagccc atggaaaege agaccgagcc cgtgaagccc agcaccagca 16817 ccatggaggt geagaeggat ccctggatgc cagcaccagc ttccaccagc actcgccgaa 16877 gaegcaagta cggcgcggcc agcctgctga tgcccaacta cgcgctgcat ccttccatca 16937 tccccacgcc gggctaccgc ggcacgcgct tctaccgcgg ctacaccagc agccgccgcc 16997 gcaagaccac cacccgccgc cgtcgtcgca gccgccgcag cagcaccgcg acttccgcct 17057 tggtgcggag agtgtatcgc agcgggcgcg agcctctgac cctgccgcgc gcgcgctacc 17117 acccgagcat cgccatttaa ctaccgcctc ctacttgcag atatggccct cacatgccgc 17177 ctccgcgtcc ccattacggg ctaccgagga agaaagccgc gccgtagaag gctgacgggg 17237 aacgggctgc gtcgccatca ccaccggcgg cggcgcgcca tcagcaagcg gttgggggga 17297 ggcttcctgc ccgcgctgat ccccatcatc gccgcggcga tcggggcgat ccccggcata 17357 gcttccgtgg cggtgcaggc ctctcagcgc cactgagaca caaaaaagca tggatttgta 17417 ataaaaaaaa aaatggactg acgctcctgg tcctgtgatg tgtgttttta gatggaagac 17477 atcaattttt cgtccctggc accgcgacac ggcacgcggc cgtttatggg cacctggagc 17537 gacatcggca acagccaact gaacgggggc gccttcaatt ggagcagtct ctggagcggg 17597 cttaagaatt tcgggtccac gctcaaaacc tatggcaaca aggcgtggaa cagcagcaca 17657 gggcaggcgc tgagggaaaa gctgaaagaa cagaacttcc agcagaaggt ggttgatggc 17717 ctggcctcag gcatcaacgg ggtggttgac ctggccaacc aggccgtgca gaaacagatc 17777 aacagccgcc tggacgcggt cccgcccgcg gggtccgtgg agatgcccca ggtggaggag 17837 gagctgcctc ccctggacaa gcgcggcgac aagcgaccgc gtcccgacgc ggaggagacg 17897 ctgctgacgc acacggacga gccgcccccg tacgaggagg cggtgaaact gggcctgccc 17957 accacgcggc ccgtggcgcc tctggccacc ggagtgctga aacccagcag cagccagccc 18017 gcgaccctgg acttgcctcc gcctcgcccc tccacagtgg ctaagcccct gccgccggtg 18077 gccgtcgcgt cgcgcgcccc ccgaggccgc ccccaggcga actggcagag cactctgaac 18137 agcatcgtgg gtctgggagt gcagagtgtg aagcgccgcc gctgctatta aaagacactg 18197 tagcgcttaa cttgcttgtc tgtgtgtata tgtatgtccg ccgaccagaa ggaggagtgt 18257 gaagaggcgc gtcgccgagt tgcaag atg gcc acc cca tcg atg etg ccc cag 18310
Met Ala Thr Pro Ser Met Leu Pro Gin 530 535
tgg gcg tac atg cac atc gcc gga cag gac get tcg gag tac etg agt 18358 Trp Ala Tyr Met His Ile Ala Gly Gin Asp Ala Ser Glu Tyr Leu Ser 540 545 550
ccg ggt etg gtg cag ttc gcc cgc gcc aca gac acc tac ttc agt etg 18406 Pro Gly Leu Val Gin Phe Ala Arg Ala Thr Asp Thr Tyr Phe Ser Leu 555 560 565 570
ggg aac aag ttt agg aac ccc acg gtg gcg ccc acg cac gat gtg acc 18454 Gly Asn Lys Phe Arg As n Pro Thr Val Ala Pro Thr His Asp Val Thr 575 580 585
acc gac cgc age cag egg etg acg etg cgc ttc gtg ccc gtg gac cgc 18502 Thr Asp Arg Ser Gin Arg Leu Thr Leu Arg Phe Val Pro Val Asp Arg 590 595 600
gag gac aac acc tac tcg tac aaa gtg cgc tac acg etg gcc gtg ggc 18550 Glu Asp Asn Thr Tyr Ser Tyr Lys Val Arg Tyr Thr Leu Ala Val Gly 605 610 615
gac aac cgc gtg etg gac atg gcc age acc tac ttt gac atc cgc ggc 18598 Asp Asn Arg Val Leu Asp Met Ala Ser Thr Tyr Phe Asp Ile Arg Gly 620 625 630
gtg etg gac egg ggc cct age ttc aaa ccc tac tet ggc acc gcc tac 18646 Val Leu Asp Arg Gly Pro Ser Phe Lys Pro Tyr Ser Gly Thr Ala Tyr 635 640 645 650
aac age eta get ccc aag gga get ccc aat tcc age cag tgg gag caa 18694 Asn Ser Leu Ala Pro Lys Gly Ala Pro Asn Ser Ser Gin Trp Glu Gin 655 660 665
gca aaa aca ggc aat ggg gga act atg gaa aca cac aca tat ggt gtg 18742 Ala Lys Thr Gly Asn Gly Gly Thr Met Glu Thr His Thr Tyr Gly Val 670 675 680
gcc cca atg ggc gga gag aat att aca aaa gat ggt ett caa att gga 18790 Ala Pro Met Gly Gly Glu Asn Ile Thr Lys Asp Gly Leu Gin Ile Gly 685 690 695
act gac gtt aca gcg aat cag aat aaa cca att tat gcc gac aaa aca 18838 Thr Asp Val Thr Ala Asn Gin Asn Lys Pro Ile Tyr Ala Asp Lys Thr 700 705 710
ttt caa cca gaa ccg caa gta gga gaa gaa aat tgg caa gaa act gaa 18886 Phe Gin Pro Glu Pro Gin Val Gly Glu Glu Asn Trp Gin Glu Thr Glu 715 720 725 730
aac ttt tat ggc ggt aga get ett aaa aaa gac aca aac atg aaa cct 18934 Asn Phe Tyr Gly Gly Arg Ala Leu Lys Lys Asp Thr Asn Met Lys Pro 735 740 745
tgc tat ggc tcc tat get aga ccc acc aat gaa aaa gga ggt caa get 18982 Cys Tyr Gly Ser Tyr Ala Arg Pro Thr Asn Glu Lys Gly Gly Gin Ala 750 755 760
aaa ett aaa gtt gga gat gat gga gtt cca acc aaa gaa ttc gac ata 19030 Lys Leu Lys Val Gly Asp Asp Gly Val Pro Thr Lys Glu Phe Asp Ile 765 770 775
gac etg get ttc ttt gat act ccc ggt ggc acc gtg aac ggt caa gac 19078 Asp Leu Ala Phe Phe Asp Thr Pro Gly Gly Thr Val Asn Gly Gin Asp 780 785 790
gag tat aaa gca gac att gtc atg tat acc gaa aac acg tat ttg gaa 19126 Glu Tyr Lys Ala Asp Ile Val Met Tyr Thr Glu Asn Thr Tyr Leu Glu 795 800 805 810
act cca gac acg eat gtg gta tac aaa cca ggc aag gat gat gca agt 19174 Thr Pro Asp Thr His Val Val Tyr Lys Pro Gly Lys Asp Asp Ala Ser 815 820 825
tet gaa att aac etg gtt cag cag tet atg ccc aac aga ccc aac tac 19222 Ser Glu Ile Asn Leu Val Gin Gin Ser Met Pro Asn Arg Pro Asn Tyr 830 835 840
att ggg ttc agg gac aac ttt atc ggt ett atg tac tac aac age act 19270 Ile Gly Phe Arg Asp Asn Phe Ile Gly Leu Met Tyr Tyr Asn Ser Thr 845 850 855
ggc aat atg ggt gtg ett get ggt cag gcc tcc cag etg aat get gtg 19318 Gly Asn Met Gly Val Leu Ala Gly Gin Ala Ser Gin Leu Asn Ala Val 860 865 870
gtt gat ttg caa gac aga aac acc gag etg tcc tac cag etc ttg ett 19366 Val Asp Leu Gin Asp Arg Asn Thr Glu Leu Ser Tyr Gin Leu Leu Leu 875 880 885 890
gac tet ttg ggt gac aga acc egg tat ttc agt atg tgg aac cag gcg 19414 Asp Ser Leu Gly Asp Arg Thr Arg Tyr Phe Ser Met Trp Asn Gin Ala 895 900 905
gtg gac agt tat gac ccc gat gtg cgc atc atc gaa aac eat ggt gtg 19462 Val Asp Ser Tyr Asp Pro Asp Val Arg Ile Ile Glu Asn His Gly Val 910 915 920
gag gat gaa ttg cca aac tat tgc ttc ccc ttg gac ggc tet ggc act 19510 Glu Asp Glu Leu Pro Asn Tyr Cys Phe Pro Leu Asp Gly Ser Gly Thr 925 930 935
aac gcc gca tac caa ggt gtg aaa gta aaa gat ggt caa gat ggt gat 19558 Asn Ala Ala Tyr Gin Gly Val Lys Val Lys Asp Gly Gin Asp Gly Asp 940 945 950
gtt gag agt gaa tgg gaa aat gac gat act gtt gca get ega aat caa 19606 Val Glu Ser Glu Trp Glu Asn Asp Asp Thr Val Ala Ala Arg Asn Gin 955 960 965 970
tta tgt aaa ggt aac att ttc gcc atg gag att aat etc cag get aac 19654 Leu Cys Lys Gly Asn Ile Phe Ala Met Glu Ile Asn Leu Gin Ala Asn 975 980 985
etg tgg aga agt ttc etc tac tcg aac gtg gcc etg tac etg ccc gac 19702 Leu Trp Arg Ser Phe Leu Tyr Ser Asn Val Ala Leu Tyr Leu Pro Asp 990 995 1000
tcc tac aag tac acg ccg acc aac gtc acg etg ccg acc aac acc 19747 Ser Tyr Lys Tyr Thr Pro Thr Asn Val Thr Leu Pro Thr Asn Thr 1005 1010 1015
aac acc tac gat tac atg aat ggc aga gtg aca cct ccc tcg etg 19792 Asn Thr Tyr Asp Tyr Met Asn Gly Arg Val Thr Pro Pro Ser Leu 1020 1025 1030
gta gac gcc tac etc aac atc ggg gcg cgc tgg tcg etg gac ccc 19837 Val Asp Ala Tyr Leu Asn Ile Gly Ala Arg Trp Ser Leu Asp Pro 1035 1040 1045
atg gac aac gtc aac ccc ttc aac cac cac cgc aac gcg ggc etg 19882 Met Asp Asn Val Asn Pro Phe Asn His His Arg Asn Ala Gly Leu 1050 1055 1060
cgc tac cgc tcc atg etc etg ggc aac ggg cgc tac gtg ccc ttc 19927 Arg Tyr Arg Ser Met Leu Leu Gly Asn Gly Arg Tyr Val Pro Phe 1065 1070 1075
cac atc cag gtg ccc caa aag ttt ttc gcc atc aag age etc etg 19972 His Ile Gin Val Pro Gin Lys Phe Phe Ala Ile Lys Ser Leu Leu 1080 1085 1090
etc etg ccc ggg tcc tac acc tac gag tgg aac ttc cgc aag gac 20017 Leu Leu Pro Gly Ser Tyr Thr Tyr Glu Trp Asn Phe Arg Lys Asp 1095 1100 1105
gtc aac atg atc etg cag age tcc eta ggc aac gac etg cgc acg 20062 Val Asn Met Ile Leu Gin Ser Ser Leu Gly Asn Asp Leu Arg Thr 1110 1115 1120
gac ggg gcc tcc atc gcc ttc acc age atc aac etc tac gcc acc 20107 Asp Gly Ala Ser Ile Ala Phe Thr Ser Ile Asn Leu Tyr Ala Thr 1125 1130 1135
ttc ttc ccc atg gcg cac aac acc gcc tcc acg etc gag gcc atg 20152 Phe Phe Pro Met Ala His Asn Thr Ala Ser Thr Leu Glu Ala Met 1140 1145 1150
etg cgc aac gac acc aac gac cag tcc ttc aac gac tac etc tcg 20197 Leu Arg Asn Asp Thr Asn Asp Gin Ser Phe Asn Asp Tyr Leu Ser 1155 1160 1165
gcg gcc aac atg etc tac ccc atc ccg gcc aac gcc acc aac gtg 20242 Ala Ala Asn Met Leu Tyr Pro Ile Pro Ala Asn Ala Thr Asn Val 1170 1175 1180
ccc atc tcc atc ccc tcg cgc aac tgg gcc gcc ttc cgc gga tgg 20287 Pro Ile Ser Ile Pro Ser Arg Asn Trp Ala Ala Phe Arg Gly Trp 1185 1190 1195
tcc ttc acg cgc etg aag acc cgc gag acg ccc tcg etc ggc tcc 20332 Ser Phe Thr Arg Leu Lys Thr Arg Glu Thr Pro Ser Leu Gly Ser 1200 1205 1210
ggg ttc gac ccc tac ttc gtc tac tcg ggc tcc atc ccc tac eta 20377 Gly Phe Asp Pro Tyr Phe Val Tyr Ser Gly Ser Ile Pro Tyr Leu 1215 1220 1225
gac ggc acc ttc tac etc aac cac acc ttc aag aag gtc tcc atc 20422 Asp Gly Thr Phe Tyr Leu Asn His Thr Phe Lys Lys Val Ser Ile 1230 1235 1240
acc ttc gac tcc tcc gtc age tgg ccc ggc aac gac cgc etc etg 20467 Thr Phe Asp Ser Ser Val Ser Trp Pro Gly Asn Asp Arg Leu Leu 1245 1250 1255
acg ccc aac gag ttc gaa atc aag cgc acc gtc gac gga gag gga 20512 Thr Pro Asn Glu Phe Glu Ile Lys Arg Thr Val Asp Gly Glu Gly 1260 1265 1270
tac aac gtg gcc cag tgc aac atg acc aag gac tgg ttc etg gtc 20557 Tyr Asn Val Ala Gin Cys Asn Met Thr Lys Asp Trp Phe Leu Val 1275 1280 1285
cag atg etg gcc cac tac aac atc ggc tac cag ggc ttc tac gtg 20602 Gin Met Leu Ala His Tyr Asn Ile Gly Tyr Gin Gly Phe Tyr Val 1290 1295 1300
ccc gag ggc tac aag gac cgc atg tac tcc ttc ttc cgc aac ttc 20647 Pro Glu Gly Tyr Lys Asp Arg Met Tyr Ser Phe Phe Arg Asn Phe 1305 1310 1315
cag ccc atg age cgc cag gtc gtg gac gag gtc aac tac aag gac 20692 Gin Pro Met Ser Arg Gin Val Val Asp Glu Val Asn Tyr Lys Asp 1320 1325 1330
tac cag gcc gtc acc etg gcc tac cag cac aac aac tcg ggc ttc 20737 Tyr Gin Ala Val Thr Leu Ala Tyr Gin His Asn Asn Ser Gly Phe 1335 1340 1345
gtc ggc tac etc gcg ccc acc atg cgc cag ggc cag ccc tac ccc 20782 Val Gly Tyr Leu Ala Pro Thr Met Arg Gin Gly Gin Pro Tyr Pro 1350 1355 1360
gcc aac tac ccc tac ccg etc atc ggc aag age gcc gtc gcc age 20827 Ala Asn Tyr Pro Tyr Pro Leu Ile Gly Lys Ser Ala Val Ala Ser 1365 1370 1375
gtc acc cag aaa aag ttc etc tgc gac egg gtc atg tgg cgc atc 20872 Val Thr Gin Lys Lys Phe Leu Cys Asp Arg Val Met Trp Arg Ile 1380 1385 1390
ccc ttc tcc age aac ttc atg tcc atg ggc gcg etc acc gac etc 20917 Pro Phe Ser Ser Asn Phe Met Ser Met Gly Ala Leu Thr Asp Leu 1395 1400 1405
ggc cag aac atg etc tac gcc aac tcc gcc cac gcg eta gac atg 20962 Gly Gin Asn Met Leu Tyr Ala Asn Ser Ala His Ala Leu Asp Met 1410 1415 1420
aat ttc gaa gtc gac ccc atg gat gag tcc acc ett etc tat gtt 21007 Asn Phe Glu Val Asp Pro Met Asp Glu Ser Thr Leu Leu Tyr Val 1425 1430 1435
gtc ttc gaa gtc ttc gac gtc gtc ega gtg cac cag ccc cac cgc 21052 Val Phe Glu Val Phe Asp Val Val Arg Val His Gin Pro His Arg 1440 1445 1450
ggc gtc atc gaa gcc gtc tac etg cgc acg ccc ttc tcg gcc ggc 21097 Gly Val Ile Glu Ala Val Tyr Leu Arg Thr Pro Phe Ser Ala Gly 1455 1460 1465
aac gcc acc acc taa gccgctcttg ettcttgeaa gatgaeggeg ggctccggcg 21152 Asn Ala Thr Thr
1470
ageaggaget cagggccatc ctccgcgacc tgggctgegg gccctgcttc ctgggcacct 21212 tegacaageg cttccctgga ttcatggccc cgcacaagct ggcctgcgcc ategtgaaca 21272 cggccggccg egagaceggg ggegagcact ggctggcctt cgcctggaac ccgcgctccc 21332 aeacatgeta cctcttcgac cccttcgggt teteggaega gcgcctcaag cagatctacc 21392 agttcgagta egagggectg ctgcgtcgca gcgccctggc caccgaggac cgctgcgtca 21452 ccctggaaaa gtccacccag acegtgcagg gtccgcgctc ggccgcctgc gggetcttet 21512 gctgcatgtt cctgcacgcc ttegtgcact ggcccgaccg ccccatggac aagaacccca 21572 ccatgaactt actgaegggg gtgcccaacg gcatgctcca gtcgccccag gtggaaccca 21632 ccctgcgccg caaccaggaa gcgctctacc gcttcctcaa tgcccactcc gcctactttc 21692 gctcccaccg cgcgcgcatc gagaaggcca ccgccttcga eegcatgaat caagacatgt 21752 aaaaaacegg tgtgtgtatg tgaatgcttt attcataata aacagcacat gtttatgcca 21812 ccttctctga ggctctgact ttatttagaa atcgaagggg ttctgccggc teteggcatg 21872 gcccgcgggc agggataegt tgeggaaetg gtacttgggc agccacttga acteggggat 21932 cagcagcttg ggcaegggga ggteggggaa cgagtcgctc cacagcttgc gcgtgagttg 21992 cagggcgccc ageaggtegg gegeggagat cttgaaatcg cagttgggac ccgcgttctg 22052 cgcgcgagag ttgcggtaca cggggttgca gcactggaac accatcaggg ccgggtgctt 22112 cacgcttgcc agcaccgtcg cgtcggtgat gccctccacg tccagatcct cggcgttggc 22172 catcccgaag ggggtcatct tgcaggtctg ccgccccatg ctgggcacgc agccgggctt 22232 gtggttgcaa tcgcagtgca gggggatcag catcatctgg gcctgctcgg agctcatgcc 22292 cgggtacatg gccttcatga aagcctccag ctggcggaag gcctgctgcg ccttgccgcc 22352 ctcggtgaag aagaccccgc aggacttgct agagaactgg ttggtggcgc agccggcgtc 22412 gtgcacgcag cagcgcgcgt cgttgttggc cagctgcacc acgctgcgcc cccagcggtt 22472 ctgggtgatc ttggcccggt tggggttctc cttcagcgcg cgctgcccgt tctcgctcgc 22532 cacatccatc tcgatagtgt gctccttctg gatcatcacg gtcccgtgca ggcaccgcag 22592 cttgccctcg gcttcggtgc agccgtgcag ccacagcgcg cagccggtgc actcccagtt 22652 cttgtgggcg atctgggagt gcgagtgcac gaagccctgc aggaagcggc ccatcatcgc 22712 ggtcagggtc ttgttgctgg tgaaggtcag cgggatgccg cggtgctcct cgttcacata 22772 caggtggcag atgcggcggt acacctcgcc ctgctcgggc atcagctgga aggcggactt 22832 caggtcgctc tccacgcggt accggtccat cagcagcgtc atcacttcca tgcccttctc 22892 ccaggccgaa acgatcggca ggctcagggg gttcttcacc gccattgtca tcttagtcgc 22952 cgccgccgag gtcagggggt cgttctcgtc cagggtctca aacactcgct tgccgtcctt 23012 ctcgatgatg cgcacggggg gaaagctgaa gcccacggcc gccagctcct cctcggcctg 23072 cctttcgtcc tcgctgtcct ggctgatgtc ttgcaaaggc acatgcttgg tcttgcgggg 23132 tttctttttg ggcggcagag gcggcggcga tgtgctggga gagcgcgagt tctcgttcac 23192 cacgactatt tcttcttctt ggccgtcgtc cgagaccacg cggcggtagg catgcctctt 23252 ctggggcaga ggcggaggcg acgggctctc gcggttcggc gggcggctgg cagagcccct 23312 tccgcgttcg ggggtgcgct cctggcggcg ctgctctgac tgacttcctc cgcggccggc 23372 cattgtgttc tcctagggag caacaacaag catggagact cagccatcgt cgccaacatc 23432 gccatctgcc cccgccgcca ccgccgacga gaaccagcag cagaatgaaa gcttaaccgc 23492 cccgccgccc agccccacct ccgacgccgc ggccccagac atgcaagaga tggaggaatc 23552 catcgagatt gacctgggct acgtgacgcc cgcggagcac gaggaggagc tggcagcgcg 23612 cttttcagcc ccggaagaga accaccaaga gcagccagag caggaagcag agaacgagca 23672 gaaccaggct gggcacgagc atggcgacta cctgagcggg gcagaggacg tgctcatcaa 23732 gcatctggcc cgccaatgca tcatcgtcaa ggacgcgctg ctcgaccgcg ccgaggtgcc 23792 cctcagcgtg gcggagctca gccgcgccta cgagcgcaac ctcttctcgc cgcgcgtgcc 23852 ccccaagcgc cagcccaacg gcacctgtga gcccaacccg cgcctcaact tctacccggt 23912 cttcgcggtg cccgaggccc tggccaccta ccacctcttt ttcaagaacc aaaggatccc 23972 cgtctcctgc cgcgccaacc gcacccgcgc cgacgccctg ctcaacctgg gccccggcgc 24032 ccgcctacct gatatcacct ccttggaaga ggttcccaag atcttcgagg gtctgggcag 24092 cgacgagact cgggccgcga acgctctgca aggaagcgga gaggagcatg agcaccacag 24152 cgccctggtg gagttggaag gcgacaacgc gcgcctggcg gtcctcaagc gcacggtcga 24212 gctgacccac ttcgcctacc cggcgctcaa cctgcccccc aaggtcatga gcgccgtcat 24272 ggaccaggtg ctcatcaagc gcgcctcgcc cctctcggag gaggagatgc aggaccccga 24332 gagttcggac gagggcaagc ccgtggtcag cgacgagcag ctggcgcgct ggctgggagc 24392 gagtagcacc ccccagagcc tggaagagcg gcgcaagctc atgatggccg tggtcctggt 24452 gaccgtggag ctggagtgtc tgcgccgctt ctttgccgac gcggagaccc tgcgcaaggt 24512 cgaggagaac ctgcactacc tcttcaggca cgggttcgtg cgccaggcct gcaagatctc 24572 caacgtggag ctgaccaacc tggtctccta catgggcatc ctgcacgaga accgcctggg 24632 gcaaaacgtg ctgcacacca ccctgcgcgg ggaggcccgc cgcgactaca tccgcgactg 24692 cgtctacctg tacctctgcc acacctggca gacgggcatg ggcgtgtggc agcagtgcct 24752 ggaggagcag aacctgaaag agctctgcaa gctcctgcag aagaacctca aggccctgtg 24812 gaccgggttc gacgagcgta ccaccgcctc ggacctggcc gacctcatct tccccgagcg 24872 cctgcggctg acgctgcgca acgggctgcc cgactttatg agccaaagca tgttgcaaaa 24932 ctttcgctct ttcatcctcg aacgctccgg gatcctgccc gccacctgct ccgcgctgcc 24992 ctcggacttc gtgccgctga ccttccgcga gtgccccccg ccgctctgga gccactgcta 25052 cttgctgcgc ctggccaact acctggccta ccactcggac gtgatcgagg acgtcagcgg 25112 cgagggtctg ctggagtgcc actgccgctg caacctctgc acgccgcacc gctccctggc 25172 ctgcaacccc cagctgctga gcgagaccca gatcatcggc accttcgagt tgcaaggccc 25232 cggcgacggc gagggcaagg ggggtctgaa actcaccccg gggctgtgga cctcggccta 25292 cttgcgcaag ttcgtgcccg aggactacca tcccttcgag atcaggttct acgaggacca 25352 atcccagccg cccaaggccg agctgtcggc ctgcgtcatc acccaggggg ccatcctggc 25412 ccaattgcaa gccatccaga aatcccgcca agaatttctg ctgaaaaagg gccacggggt 25472 ctacttggac ccccagaccg gagaggagct caaccccagc ttcccccagg atgccccgag 25532 gaagcagcaa gaagctgaaa gtggagctgc cgccgccgga ggatttggag gaagactggg 25592 agagcagtca ggcagaggag gaggagatgg aagactggga cagcactcag gcagaggagg 25652 acagcctgca agacagtctg gaggaggaag acgaggtgga ggaggcagag gaagaagcag 25712 ccgccgccag accgtcgtcc tcggcggaga aagcaagcag cacggatacc atctccgctc 25772 cgggtcgggg tcgcggcggc cgggcccaca gtaggtggga egagaceggg cgcttcccga 25832 accccaccac ccagaccggt aagaaggagc ggcagggata caagtcctgg egggggcaca 25892 aaaacgccat cgtctcctgc ttgcaagcct gcgggggcaa catctccttc acccggcgct 25952 acctgctctt ccaccgcggg gtgaacttcc cccgcaacat cttgcattac taccgtcacc 26012 tccacagccc ctactactgt ttccaagaag aggcagaaac ccagcagcag cagaaaacca 26072 gcggcagcag cagctagaaa atccacagcg gcggcaggtg gactgaggat cgcggcgaac 26132 gagccggcgc agacccggga gctgaggaac cggatctttc ccaccctcta tgccatcttc 26192 cagcagagtc gggggcagga gcaggaactg aaagtcaaga accgttctct gcgctcgctc 26252 acccgcagtt gtctgtatca caagagcgaa gaccaacttc agcgcactct cgaggacgcc 26312 gaggctctct tcaacaagta ctgcgcgctc actcttaaag agtagcccgc gcccgcccac 26372 acacggaaaa aggcgggaat tacgtcacca cctgcgccct tcgcccgacc atcatgagca 26432 aagagattcc cacgccttac atgtggagct accagcccca gatgggcctg gccgccggcg 26492 ccgcccagga ctactccacc cgcatgaact ggctcagtgc cgggcccgcg atgatctcac 26552 gggtgaatga catccgcgcc caccgaaacc agatactcct agaacagtca gcgatcaccg 26612 ccacgccccg ccatcacctt aatccgcgta attggcccgc cgccctggtg taccaggaaa 26672 ttccccagcc cacgaccgta ctacttccgc gagacgccca ggccgaagtc cagctgacta 26732 actcaggtgt ccagctggcc ggcggcgccg ccctgtgtcg tcaccgcccc gctcagggta 26792 taaagcggct ggtgatccga ggcagaggca cacagctcaa cgacgaggtg gtgagctctt 26852 cgctgggtct gcgacctgac ggagtcttcc aactcgccgg atcggggaga tcttccttca 26912 cgcctcgtca ggccgtcctg actttggaga gttcgtcctc gcagccccgc tcgggcggca 26972 tcggcactct ccagttcgtg gaggagttca ctccctcggt ctacttcaac cccttctccg 27032 gctcccccgg ccactacccg gacgagttca tcccgaactt cgacgccatc agcgagtcgg 27092 tggacggcta cgattgaatg tcccatggtg gcgcagctga cctagctcgg cttcgacacc 27152 tggaccactg ccgccgcttc cgctgcttcg ctcgggatct cgccgagttt gcctactttg 27212 agctgcccga ggagcaccct cagggcccag cccacggagt gcggatcatc gtcgaagggg 27272 gcctcgactc ccacctgctt cggatcttca gccagcgacc gatcctggtc gagcgcgaac 27332 aaggacagac ccttcttact ttgtactgca tctgcaacca ccccggcctg catgaaagtc 27392 tttgttgtct gctgtgtact gagtataata aaagctgaga tcagcgacta ctccggactc 27452 gattgtggtg ttcctgctat caaccggtcc ctgttcttca ccgggaacga gaccgagctc 27512 cagctccagt gtaagcccca caagaagtac ctcacctggc tgttccaggg ctccccgatc 27572 gccgttgtca accactgcga caacgacgga gtcctgctga gcggccctgc caaccttact 27632 ttttccaccc gcagaagcaa gctccagctc ttccaaccct tcctccccgg gacctatcag 27692 tgcgtctcag gaccctgcca tcacaccttc cacctgatcc cgaataccac agcgccgctc 27752 cccgctacta acaaccaaac tacccaccaa cgccaccgtc gcgacctttc ctctgaatct 27812 aataccacta ccggaggtga gctccgaggt cgaccaacct ctgggattta ctacggcccc 27872 tgggaggtgg tggggttaat agcgctaggc ctagttgcgg gtgggctttt ggttctctgc 27932 tacctatacc tcccttgctg ttcgtactta gtggtgctgt gttgctggtt taagaaatgg 27992 ggaagatcac cctagtgagc tgcggtgcgc tggtggcggt gttgctttcg attgtgggac 28052 tgggcggcgc ggctgtagtg aaggagaagg ccgatccctg cttgcatttc aatcccaaca 28112 aatgccagct gagttttcag cccgatggca atcggtgcgc ggtactgatc aagtgcggat 28172 gggaatgcga gaacgtgaga atcgagtaca ataacaagac tcggaacaat actctcgcgt 28232 ccgtgtggca gcccggggac cccgagtggt acaccgtctc tgtccccggt gctgacggct 28292 ccccgcgcac cgtgaataat actttcattt ttgcgcacat gtgcaacacg gtcatgtgga 28352 tgagcaagca gtacgatatg tggcccccca cgaaggagaa catcgtggtc ttctccatcg 28412 cttacagcct gtgcacggcg ctaatcaccg ctatcgtgtg cctgagcatt cacatgctca 28472 tcgctattcg ccccagaaat aatgccgaga aagagaaaca gccataacac gttttttcac 28532 acaccttgtt tttacagaca atgcgtctgt taaatttttt aaacattgtg ctcagtattg 28592 cttatgcctc tggttatgca aacatacaga aaacccttta tgtaggatct gatggtacac 28652 tagagggtac ccaatcacaa gccaaggttg catggtattt ttatagaacc aacactgatc 28712 cagttaaact ttgtaagggt gaattgccgc gtacacataa aactccactt acatttagtt 28772 gcagcaataa taatcttaca cttttttcaa ttacaaaaca atatactggt acttattaca 28832 gtacaaactt tcatacagga caagataaat attatactgt taaggtagaa aatcctacca 28892 ctcctagaac taccaccacc accactactg caaagcccac tgtgaaaact acaactagga 28952 ccaccacaac tacagaaacc accaccagca caacacttgc tgcaactaca cacacacaca 29012 ctaagctaac cttacagacc actaatgatt tgatcgccct gctgcaaaag ggggataaca 29072 gcaccacttc caatgaggag atacccaaat ccatgattgg cattattgtt gctgtagtgg 29132 tgtgcatgtt gatcatcgcc ttgtgcatgg tgtactatgc cttctgctac agaaagcaca 29192 gactgaacga caagctggaa cacttactaa gtgttgaatt ttaatttttt agaaccatga 29252 agatcctagg cctttttagt ttttctatca ttacctctgc tctttgtgaa tcagtggata 29312 gagatgttac tattaccact ggttctaatt atacactgaa agggccaccc tcaggtatgc 29372 tttcgtggta ttgctatttt ggaactgaca ctgatcaaac tgaattatgc aattttcaaa 29432 aaggcaaaac ctcaaactct aaaatctcta attatcaatg caatggcact gatctgatac 29492 tactcaatgt cacgaaagca tatggtggca gttattattg ccctggacaa aacactgaag 29552 aaatgatttt ttacaaagtg gaagtggttg atcccactac accacccacc accacaacta 29612 ttcataccac acacacagaa caaacaccag aggcaacaga agcagagttg gccttccagg 29672 ttcacggaga ttcctttgct gtcaataccc ctacacccga tcagcggtgt ccggggccgc 29732 tagtcagcgg cattgtcggt gtgctttcgg gattagcagt cataatcatc tgcatgttca 29792 tttttgcttg ctgctataga aggctttacc gacaaaaatc agacccactg ctgaacctct 29852 atgtttaatt ttttccagag ccatgaaggc agttagcgct ctagtttttt gttctttgat 29912 tggcattgtt tttaatagta aaattaccag agttagcttt attaaacatg ttaatgtaac 29972 tgaaggagat aacatcacac tagcaggtgt agaaggtgct caaaacacca cctggacaaa 30032 ataccatcta ggatggagag atatttgcac ctggaatgta acttattatt gcataggagt 30092 taatcttacc attgttaacg ctaaccaatc tcagaatggg ttaattaaag gacagagtgt 30152 tagtgtgacc agtgatgggt actataccca gcatagtttt aactacaaca ttactgtcat 30212 accactgcct acgcctagcc cacctagcac taccacacag acaaccacat acagtacatc 30272 aaatcagcct accaccacta cagcagcaga ggttgccagc tcgtctgggg tccgagtggc 30332 atttttgatg ttggccccat ctagcagtcc cactgctagt accaatgagc agactactga 30392 atttttgtcc actgtcgaga gccacaccac agctacctcc agtgccttct ctagcaccgc 30452 caatctctcc tcgctttcct ctacaccaat cagccccgct actactccta gccccgctcc 30512 tcttcccact cccctgaagc aaacagacgg cggcatgcaa tggcagatca ccctgctcat 30572 tgtgatcggg ttggtcatcc tggccgtgtt gctctactac atcttctgcc gccgcattcc 30632 caacgcgcac cgcaagccgg cctacaagcc catcgttatc gggcagccgg agccgcttca 30692 ggtggaaggg ggtctaagga atcttctctt ctcttttaca gtatggtgat tgaactatga 30752 ttcctagaca attcttgatc actattctta tctgcctcct ccaagtctgt gccaccctcg 30812 ctctggtggc caacgccagt ccagactgta ttgggccctt cgcctcctac gtgctctttg 30872 ccttcgtcac ctgcatctgc tgctgtagca tagtctgcct gcttatcacc ttcttccagt 30932 tcattgactg gatctttgtg cgcatcgcct acctgcgcca ccacccccag taccgcgacc 30992 agcgagtggc gcagctgctc aggctcctct gataagcatg cgggctctgc tacttctcgc 31052 gcttctgctg ttagtgctcc cccgtcccgt cgacccccgg tcccccactc agtcccccga 31112 ggaggttcgc aaatgcaaat tccaagaacc ctggaaattc ctcaaatgct accgccaaaa 31172 atcagacatg catcccagct ggatcatgat cattgggatc gtgaacattc tggcctgcac 31232 cctcatctcc tttgtgattt acccctgctt tgactttggt tggaactcgc cagaggcgct 31292 ctatctcccg cctgaacctg acacaccacc acagcagcaa cctcaggcac acgcactacc 31352 accaccacag cctaggccac aatacatgcc catattagac tatgaggccg agccacagcg 31412 acccatgctc cccgctatta gttacttcaa tctaaccggc ggagatgact gacccactgg 31472 ccaataacaa cgtcaacgac cttctcctgg acatggacgg ccgcgcctcg gagcagcgac 31532 tcgcccaact tcgcattcgt cagcagcagg agagagccgt caaggagctg caggacggca 31592 tagccatcca ccagtgcaag agaggcatct tctgcctggt gaaacaggcc aagatctcct 31652 acgaggtcac ccagaccgac catcgcctct cctacgagct cctgcagcag cgccagaagt 31712 tcacctgcct ggtcggagtc aaccccatcg tcatcaccca gcagtcgggc gataccaagg 31772 ggtgcatcca ctgctcctgc gactcccccg actgcgtcca cactctgatc aagaccctct 31832 gcggcctccg cgacctcctc cccatgaact aatcaccccc ttatccagtg aaataaagat 31892 catattgatg atgatttaaa taaaaaaaat aatcatttga tttgaaataa agatacaatc 31952 atattgatga tttgagttta acaaaaataa agaatcactt acttgaaatc tgataccagg 32012 tctctgtcca tgttttctgc caacaccacc tcactcccct cttcccagct ctggtactgc 32072 aggccccggc gggctgcaaa cttcctccac acgctgaagg ggatgtcaaa ttcctcctgt 32132 ccctcaatct tcattttatc ttctatcag atg tcc aaa aag cgc gtc egg gtg 32185
Met Ser Lys Lys Arg Val Arg Val 1475
gat gat gac ttc gac ccc gtc tac ccc tac gat gca gac aac gca 32230 Asp Asp Asp Phe Asp Pro Val Tyr Pro Tyr Asp Ala Asp Asn Ala 1480 1485 1490
ccg acc gtg ccc ttc atc aac ccc ccc ttc gtc tet tea gat gga 32275 Pro Thr Val Pro Phe Ile Asn Pro Pro Phe Val Ser Ser Asp Gly 1495 1500 1505
ttc caa gag aag ccc etg ggg gtg ttg tcc etg ega etg get gac 32320 Phe Gin Glu Lys Pro Leu Gly Val Leu Ser Leu Arg Leu Ala Asp 1510 1515 1520
ccc gtc acc acc aag aac ggg gaa atc acc etc aag etg gga gag 32365 Pro Val Thr Thr Lys Asn Gly Glu Ile Thr Leu Lys Leu Gly Glu 1525 1530 1535
ggg gtg gac etc gac tcg tcg gga aaa etc atc tcc aac acg gcc 32410 Gly Val Asp Leu Asp Ser Ser Gly Lys Leu Ile Ser Asn Thr Ala 1540 1545 1550
acc aag gcc gcc gcc cct etc agt att tea aac aac acc att tcc 32455 Thr Lys Ala Ala Ala Pro Leu Ser Ile Ser Asn Asn Thr Ile Ser 1555 1560 1565
ett aaa act get gcc cct ttc tac aac aac aat gga act tta age 32500 Leu Lys Thr Ala Ala Pro Phe Tyr Asn Asn Asn Gly Thr Leu Ser 1570 1575 1580
etc aat gtc tcc aca cca tta gca gta ttt ccc aca ttt aac act 32545 Leu Asn Val Ser Thr Pro Leu Ala Val Phe Pro Thr Phe Asn Thr 1585 1590 1595
tta ggc ata agt ett gga aac ggt ett cag act tea aat aag ttg 32590 Leu Gly Ile Ser Leu Gly Asn Gly Leu Gin Thr Ser Asn Lys Leu 1600 1605 1610
ttg act gta caa eta act eat cct ett aca ttc age tea aat age 32635 Leu Thr Val Gin Leu Thr His Pro Leu Thr Phe Ser Ser Asn Ser 1615 1620 1625
atc aca gta aaa aca gac aaa ggg eta tat att aac tcc agt gga 32680 Ile Thr Val Lys Thr Asp Lys Gly Leu Tyr Ile Asn Ser Ser Gly 1630 1635 1640
aac aga gga ett gag get aat ata age eta aaa aga gga eta gtt 32725 Asn Arg Gly Leu Glu Ala Asn Ile Ser Leu Lys Arg Gly Leu Val 1645 1650 1655
ttt gac ggt aat get att gca aca tat att gga aat ggc tta gac 32770 Phe Asp Gly Asn Ala Ile Ala Thr Tyr Ile Gly Asn Gly Leu Asp 1660 1665 1670
tat gga tet tat gat agt gat gga aaa aca aga ccc gta att acc 32815 Tyr Gly Ser Tyr Asp Ser Asp Gly Lys Thr Arg Pro Val Ile Thr 1675 1680 1685
aaa att gga gca gga tta aat ttt gat get aac aaa gca ata get 32860 Lys Ile Gly Ala Gly Leu Asn Phe Asp Ala Asn Lys Ala Ile Ala 1690 1695 1700
gtc aaa eta ggc aca ggt tta agt ttt gac tcc get ggt gcc ttg 32905 Val Lys Leu Gly Thr Gly Leu Ser Phe Asp Ser Ala Gly Ala Leu 1705 1710 1715
aca get gga aac aaa cag gat gac aag eta aca ett tgg act acc 32950 Thr Ala Gly Asn Lys Gin Asp Asp Lys Leu Thr Leu Trp Thr Thr 1720 1725 1730
cct gac cca age cct aat tgt caa tta ett tea gac aga gat gcc 32995 Pro Asp Pro Ser Pro Asn Cys Gin Leu Leu Ser Asp Arg Asp Ala 1735 1740 1745
aaa ttt act etc tgt ett aca aaa tgc ggt agt caa ata eta ggc 33040 Lys Phe Thr Leu Cys Leu Thr Lys Cys Gly Ser Gin Ile Leu Gly 1750 1755 1760
act gtg gca gtg gcg get gtt act gta gga tea gca eta aat cca 33085 Thr Val Ala Val Ala Ala Val Thr Val Gly Ser Ala Leu Asn Pro 1765 1770 1775
att aat gac aca gtc aaa age gcc ata gtt ttc ett aga ttt gat 33130 Ile Asn Asp Thr Val Lys Ser Ala Ile Val Phe Leu Arg Phe Asp 1780 1785 1790
tcc gat ggt gta etc atg tea aac tea tea atg gta ggt gat tac 33175 Ser Asp Gly Val Leu Met Ser Asn Ser Ser Met Val Gly Asp Tyr 1795 1800 1805
tgg aac ttt agg gag gga cag acc act caa agt gta gcc tat aca 33220 Trp Asn Phe Arg Glu Gly Gin Thr Thr Gin Ser Val Ala Tyr Thr 1810 1815 1820
aat get gtg gga ttc atg cca aat ata ggt gca tat cca aaa acc 33265 Asn Ala Val Gly Phe Met Pro Asn Ile Gly Ala Tyr Pro Lys Thr 1825 1830 1835
caa agt aaa aca cct aaa aat age ata gtc agt cag gta tat tta 33310 Gin Ser Lys Thr Pro Lys Asn Ser Ile Val Ser Gin Val Tyr Leu 1840 1845 1850
act gga gaa act act atg cca atg aca eta acc ata act ttc aat 33355 Thr Gly Glu Thr Thr Met Pro Met Thr Leu Thr Ile Thr Phe Asn 1855 1860 1865
ggc act gat gaa aaa gac aca acc cca gtt age acc tac tet atg 33400 Gly Thr Asp Glu Lys Asp Thr Thr Pro Val Ser Thr Tyr Ser Met 1870 1875 1880
act ttt aca tgg cag tgg act gga gac tat aag gac aaa aat att 33445 Thr Phe Thr Trp Gin Trp Thr Gly Asp Tyr Lys Asp Lys Asn Ile 1885 1890 1895
acc ttt get acc aac tea ttc tet ttt tcc tac atc gcc cag gaa 33490 Thr Phe Ala Thr Asn Ser Phe Ser Phe Ser Tyr Ile Ala Gin Glu 1900 1905 1910
taa tcccacccag caagccaacc ccttttccca ccacctttgt ctatatggaa 33543 actctgaaac agaaaaataa agttcaagtg ttttattgaa tcaacagttt tacaggactc 33603 gagcagttat ttttcctcca ccctcccagg acatggaata caccaccctc tccccccgca 33663 cagecttgaa catctgaatg ccattggtga tggacatget tttggtctcc acgttccaca 33723 cagtttcaga gcgagccagt eteggategg tcagggagat gaaaccctcc gggcactccc 33783 gcatctgcac ctcacagctc aacagctgag gattgtcctc ggtggtcggg ateaeggtta 33843 tctggaagaa gcagaagagc ggeggtggga atcatagtcc gegaaeggga teggceggtg 33903 gtgtegcate aggccccgca geagtcgetg ccgccgccgc tccgtcaagc tgctgctcag 33963 ggggttcggg tccagggact ccctcagcat gatgcccacg gccctcagca teagtegtet 34023 ggtgeggegg gcgcagcagc geatgegaat ctcgctcagg teactgeagt aegtgcaaca 34083 caggaccacc aggttgttca acagtccata gttcaacacg ctccagccga aactcatcgc 34143 gggaaggatg ctacccacgt ggcegtegta ccagatcctc aggtaaatca agtggegetc 34203 cctccagaag acgctgccca tgtacatgat ctccttgggc atgtggcggt tcaccacctc 34263 ccggtaccac atcaccctct ggttgaacat gcagccccgg atgatcctgc ggaaccacag 34323 ggccagcacc gccccgcccg ccatgcagcg aagagacccc ggatcccggc aatgacaatg 34383 gaggacccac cgctcgtacc egtggateat ctgggagctg aacaagtcta tgttggcaca 34443 gcacaggcat atgctcatgc atctcttcag cactctcagc tecteggggg tcaaaaccat 34503 atcccagggc aeggggaact ettgeaggae agcgaacccc gcagaacagg gcaatcctcg 34563 cacataactt acattgtgca tggacagggt ategcaatea ggcagcaccg ggtgatcetc 34623 caccagagaa gegegggtet cggtctcctc aeagegtggt aagggggeeg geegataegg 34683 gtgatggegg gaegeggetg ategtgttet cgaccgtgtc atgatgcagt tgctttegga 34743 cattttegta cttgctgtag cagaacctgg tccgggcgct gcacaccgat cgccggcggc 34803 ggtctcggcg ettggaaege teggtgttaa agttgtaaaa cagccactct ctcagaccgt 34863 gcagcagatc tagggectea ggagtgatga agatcccatc atgcctgata getctgatea 34923 catcgaccac cgtggaatgg gccaggccca gccagatgat gcaattttgt tgggtttegg 34983 tgacggcggg ggagggaaga acaggaagaa ccatgattaa cttttaatcc aaaeggtetc 35043 ggagcacttc aaaatgaagg teaeggagat ggcacctctc gcccccgctg tgttggtgga 35103 aaataacagc caggtcaaag gtgatacggt tctcgagatg ttccacggtg gcttccagca 35163 aagcctccac gcgcacatcc agaaacaaga caatagcgaa agcgggaggg ttctctaatt 35223 cctcaaccat catgttacac tcctgcacca tccccagata attttcattt ttccagcctt 35283 gaatgattcg aactagttcc tgaggtaaat ccaagccagc catgataaaa agctcgcgca 35343 gagcaccctc caccggcatt cttaagcaca ccctcataat tccaagatat tctgctcctg 35403 gttcacctgc agcagattga caagcggaat atcaaaatct ctgccgcgat ccctgagctc 35463 ctccctcagc aataactgta agtactcttt catatcgtct ccgaaatttt tagccatagg 35523 acccccagga ataagagaag ggcaagccac attacagata aaccgaagtc ccccccagtg 35583 agcattgcca aatgtaagat tgaaataagc atgctggcta gacccggtga tatcttccag 35643 ataactggac agaaaatcgg gtaagcaatt tttaagaaaa tcaacaaaag aaaaatcttc 35703 caggtgcacg tttagggcct cgggaacaac gatggagtaa gtgcaagggg tgcgttccag 35763 catggttagt tagctgatct gtaaaaaaac aaaaaataaa acattaaacc atgctagcct 35823 ggcgaacagg tgggtaaatc gttctctcca gcaccaggca ggccacgggg tctccggcgc 35883 gaccctcgta aaaattgtcg ctatgattga aaaccatcac agagagacgt tcccggtggc 35943 cggcgtgaat gattcgagaa gaagcataca cccccggaac attggagtcc gtgagtgaaa 36003 aaaagcggcc gaggaagcaa tgaggcacta caacgctcac tctcaagtcc agcaaagcga 36063 tgccatgcgg atgaagcaca aaattttcag gtgcgtaaaa aatgtaatta ctcccctcct 36123 gcacaggcag cgaagctccc gatccctcca gatacacata caaagcctca gcgtccatag 36183 cttaccgagc ggcagcagca gcggcacaca acaggcgcaa gagtcagaga aaagactgag 36243 ctctaacctg tccgcccgct ctctgctcaa tatatagccc cagatctaca ctgacgtaaa 36303 ggccaaagtc taaaaatacc cgccaaataa tcacacacgc ccagcacacg cccagaaacc 36363 ggtgacacac tcagaaaaat acgcgcactt cctcaaacgg ccaaactgcc gtcatttccg 36423 ggttcccacg ctacgtcatc aaaacacgac tttcaaattc cgtcgaccgt taaaaacatc 36483 acccgccccg cccctaacgg tcgccgctcc cgcagccaat caccttcctc cctccccaaa 36543 ttcaaacagc tcatttgcat attaacgcgc accaaaagtt tgaggtatat tattgatgat 36603 g 36604
<210> 6
<211> 529
<212> PRT
<213> chimpanzee adenovirus serotype Pan6
<400> 6
Met Met Arg Arg Val Tyr Pro Glu Gly Pro Pro Pro Ser Tyr Glu Ser 15 10 15
Val Met Gin Gin Ala Val Ala Ala Ala Met Gin Pro Pro Leu Glu Ala 20 25 30
Pro Tyr Val Pro Pro Arg Tyr Leu Ala Pro Thr Glu Gly Arg Asn Ser 35 40 45
Ile Arg Tyr Ser Glu Leu Ala Pro Leu Tyr Asp Thr Thr Arg Leu Tyr 50 55 60
Leu Val Asp Asn Lys Ser Ala Asp Ile Ala Ser Leu Asn Tyr Gin Asn 65 70 75 80
Asp His Ser Asn Phe Leu Thr Thr Val Val Gin Asn Asn Asp Phe Thr 85 90 95
Pro Thr Glu Ala Ser Thr Gin Thr Ile Asn Phe Asp Glu Arg Ser Arg 100 105 110
Trp Gly Gly Gin Leu Lys Thr Ile Met His Thr Asn Met Pro Asn Val 115 120 125
Asn Glu Phe Met Tyr Ser Asn Lys Phe Lys Ala Arg Val Met Val Ser 130 135 140
Arg Lys Thr Pro Asn Gly Val Asp Asp Asp Tyr Asp Gly Ser Gin Asp 145 150 155 160
Glu Leu Thr Tyr Glu Trp Val Glu Phe Glu Leu Pro Glu Gly Asn Phe 165 170 175
Ser Val Thr Met Thr Ile Asp Leu Met Asn Asn Ala Ile Ile Asp Asn 180 185 190
Tyr Leu Ala Val Gly Arg Gin Asn Gly Val Leu Glu Ser Asp Ile Gly 195 200 205
Val Lys Phe Asp Thr Arg Asn Phe Arg Leu Gly Trp Asp Pro Val Thr 210 215 220
Glu Leu Val Met Pro Gly Val Tyr Thr Asn Glu Ala Phe His Pro Asp 225 230 235 240
Ile Val Leu Leu Pro Gly Cys Gly Val Asp Phe Thr Glu Ser Arg Leu 245 250 255 Ser Asn Leu Leu Gly Ile Arg Lys Arg Gin Pro Phe Gin Glu Gly Phe 260 265 270
Gin Ile Leu Tyr Glu Asp Leu Glu Gly Gly Asn Ile Pro Ala Leu Leu 275 280 285
Asp Val Glu Ala Tyr Glu Lys Ser Lys Glu Asp Ser Thr Ala Ala Ala 290 295 300
Thr Ala Ala Val Ala Thr Ala Ser Thr Glu Val Arg Gly Asp Asn Phe 305 310 315 320
Ala Ser Ala Ala Ala Ala Ala Glu Ala Ala Glu Thr Glu Ser Lys Ile 325 330 335
Val Ile Gin Pro Val Glu Lys Asp Ser Lys Asp Arg Ser Tyr Asn Val 340 345 350
Leu Ala Asp Lys Lys Asn Thr Ala Tyr Arg Ser Trp Tyr Leu Ala Tyr 355 360 365
Asn Tyr Gly Asp Pro Glu Lys Gly Val Arg Ser Trp Thr Leu Leu Thr 370 375 380
Thr Ser Asp Val Thr Cys Gly Val Glu Gin Val Tyr Trp Ser Leu Pro 385 390 395 400
Asp Met Met Gin Asp Pro Val Thr Phe Arg Ser Thr Arg Gin Val Ser 405 410 415
Asn Tyr Pro Val Val Gly Ala Glu Leu Leu Pro Val Tyr Ser Lys Ser 420 425 430
Phe Phe Asn Glu Gin Ala Val Tyr Ser Gin Gin Leu Arg Ala Phe Thr 435 440 445
Ser Leu Thr His Val Phe Asn Arg Phe Pro Glu Asn Gin Ile Leu Val 450 455 460
Arg Pro Pro Ala Pro Thr Ile Thr Thr Val Ser Glu Asn Val Pro Ala 465 470 475 480
Leu Thr Asp His Gly Thr Leu Pro Leu Arg Ser Ser Ile Arg Gly Val 485 490 495
Gin Arg Val Thr Val Thr Asp Ala Arg Arg Arg Thr Cys Pro Tyr Val 500 505 510 Tyr Lys Ala Leu Gly Val Val Ala Pro Arg Val Leu Ser Ser Arg Thr 515 520 525
Phe
<210> 7
<211> 942
<212> PRT
<213> chimpanzee adenovirus serotype Pan6
<400> 7
Met Ala Thr Pro Ser Met Leu Pro Gin Trp Ala Tyr Met His Ile Ala 15 10 15
Gly Gin Asp Ala Ser Glu Tyr Leu Ser Pro Gly Leu Val Gin Phe Ala 20 25 30
Arg Ala Thr Asp Thr Tyr Phe Ser Leu Gly Asn Lys Phe Arg Asn Pro 35 40 45
Thr Val Ala Pro Thr His Asp Val Thr Thr Asp Arg Ser Gin Arg Leu 50 55 60
Thr Leu Arg Phe Val Pro Val Asp Arg Glu Asp Asn Thr Tyr Ser Tyr 65 70 75 80
Lys Val Arg Tyr Thr Leu Ala Val Gly Asp Asn Arg Val Leu Asp Met 85 90 95
Ala Ser Thr Tyr Phe Asp Ile Arg Gly Val Leu Asp Arg Gly Pro Ser 100 105 110
Phe Lys Pro Tyr Ser Gly Thr Ala Tyr Asn Ser Leu Ala Pro Lys Gly 115 120 125
Ala Pro Asn Ser Ser Gin Trp Glu Gin Ala Lys Thr Gly Asn Gly Gly 130 135 140
Thr Met Glu Thr His Thr Tyr Gly Val Ala Pro Met Gly Gly Glu Asn 145 150 155 160
Ile Thr Lys Asp Gly Leu Gin Ile Gly Thr Asp Val Thr Ala Asn Gin 165 170 175 Asn Lys Pro Ile Tyr Ala Asp Lys Thr Phe Gin Pro Glu Pro Gin Val 180 185 190
Gly Glu Glu Asn Trp Gin Glu Thr Glu Asn Phe Tyr Gly Gly Arg Ala 195 200 205
Leu Lys Lys Asp Thr Asn Met Lys Pro Cys Tyr Gly Ser Tyr Ala Arg 210 215 220
Pro Thr Asn Glu Lys Gly Gly Gin Ala Lys Leu Lys Val Gly Asp Asp 225 230 235 240
Gly Val Pro Thr Lys Glu Phe Asp Ile Asp Leu Ala Phe Phe Asp Thr 245 250 255
Pro Gly Gly Thr Val Asn Gly Gin Asp Glu Tyr Lys Ala Asp Ile Val 260 265 270
Met Tyr Thr Glu Asn Thr Tyr Leu Glu Thr Pro Asp Thr His Val Val 275 280 285
Tyr Lys Pro Gly Lys Asp Asp Ala Ser Ser Glu Ile Asn Leu Val Gin 290 295 300
Gin Ser Met Pro Asn Arg Pro Asn Tyr Ile Gly Phe Arg Asp Asn Phe 305 310 315 320
Ile Gly Leu Met Tyr Tyr Asn Ser Thr Gly Asn Met Gly Val Leu Ala 325 330 335
Gly Gin Ala Ser Gin Leu Asn Ala Val Val Asp Leu Gin Asp Arg Asn 340 345 350
Thr Glu Leu Ser Tyr Gin Leu Leu Leu Asp Ser Leu Gly Asp Arg Thr 355 360 365
Arg Tyr Phe Ser Met Trp Asn Gin Ala Val Asp Ser Tyr Asp Pro Asp 370 375 380
Val Arg Ile Ile Glu Asn His Gly Val Glu Asp Glu Leu Pro Asn Tyr 385 390 395 400
Cys Phe Pro Leu Asp Gly Ser Gly Thr Asn Ala Ala Tyr Gin Gly Val 405 410 415
Lys Val Lys Asp Gly Gin Asp Gly Asp Val Glu Ser Glu Trp Glu Asn 420 425 430 Asp Asp Thr Val Ala Ala Arg Asn Gin Leu Cys Lys Gly Asn Ile Phe 435 440 445
Ala Met Glu Ile Asn Leu Gin Ala Asn Leu Trp Arg Ser Phe Leu Tyr 450 455 460
Ser Asn Val Ala Leu Tyr Leu Pro Asp Ser Tyr Lys Tyr Thr Pro Thr 465 470 475 480
Asn Val Thr Leu Pro Thr Asn Thr Asn Thr Tyr Asp Tyr Met Asn Gly 485 490 495
Arg Val Thr Pro Pro Ser Leu Val Asp Ala Tyr Leu Asn Ile Gly Ala 500 505 510
Arg Trp Ser Leu Asp Pro Met Asp Asn Val Asn Pro Phe Asn His His 515 520 525
Arg Asn Ala Gly Leu Arg Tyr Arg Ser Met Leu Leu Gly Asn Gly Arg 530 535 540
Tyr Val Pro Phe His Ile Gin Val Pro Gin Lys Phe Phe Ala Ile Lys 545 550 555 560
Ser Leu Leu Leu Leu Pro Gly Ser Tyr Thr Tyr Glu Trp Asn Phe Arg 565 570 575
Lys Asp Val Asn Met Ile Leu Gin Ser Ser Leu Gly Asn Asp Leu Arg 580 585 590
Thr Asp Gly Ala Ser Ile Ala Phe Thr Ser Ile Asn Leu Tyr Ala Thr 595 600 605
Phe Phe Pro Met Ala His Asn Thr Ala Ser Thr Leu Glu Ala Met Leu 610 615 620
Arg Asn Asp Thr Asn Asp Gin Ser Phe Asn Asp Tyr Leu Ser Ala Ala 625 630 635 640
Asn Met Leu Tyr Pro Ile Pro Ala Asn Ala Thr Asn Val Pro Ile Ser 645 650 655
Ile Pro Ser Arg Asn Trp Ala Ala Phe Arg Gly Trp Ser Phe Thr Arg 660 665 670
Leu Lys Thr Arg Glu Thr Pro Ser Leu Gly Ser Gly Phe Asp Pro Tyr 675 680 685 Phe Val Tyr Ser Gly Ser Ile Pro Tyr Leu Asp Gly Thr Phe Tyr Leu 690 695 700
Asn His Thr Phe Lys Lys Val Ser Ile Thr Phe Asp Ser Ser Val Ser 705 710 715 720
Trp Pro Gly Asn Asp Arg Leu Leu Thr Pro Asn Glu Phe Glu Ile Lys 725 730 735
Arg Thr Val Asp Gly Glu Gly Tyr Asn Val Ala Gin Cys Asn Met Thr 740 745 750
Lys Asp Trp Phe Leu Val Gin Met Leu Ala His Tyr Asn Ile Gly Tyr 755 760 765
Gin Gly Phe Tyr Val Pro Glu Gly Tyr Lys Asp Arg Met Tyr Ser Phe 770 775 780
Phe Arg Asn Phe Gin Pro Met Ser Arg Gin Val Val Asp Glu Val Asn 785 790 795 800
Tyr Lys Asp Tyr Gin Ala Val Thr Leu Ala Tyr Gin His Asn Asn Ser 805 810 815
Gly Phe Val Gly Tyr Leu Ala Pro Thr Met Arg Gin Gly Gin Pro Tyr 820 825 830
Pro Ala Asn Tyr Pro Tyr Pro Leu Ile Gly Lys Ser Ala Val Ala Ser 835 840 845
Val Thr Gin Lys Lys Phe Leu Cys Asp Arg Val Met Trp Arg Ile Pro 850 855 860
Phe Ser Ser Asn Phe Met Ser Met Gly Ala Leu Thr Asp Leu Gly Gin 865 870 875 880
Asn Met Leu Tyr Ala Asn Ser Ala His Ala Leu Asp Met Asn Phe Glu 885 890 895
Val Asp Pro Met Asp Glu Ser Thr Leu Leu Tyr Val Val Phe Glu Val 900 905 910
Phe Asp Val Val Arg Val His Gin Pro His Arg Gly Val Ile Glu Ala 915 920 925
Val Tyr Leu Arg Thr Pro Phe Ser Ala Gly Asn Ala Thr Thr 930 935 940 <210> 8
<211> 443
<212> PRT
<213> chimpanzee adenovirus serotype Pan6
<400> 8
Met Ser Lys Lys Arg Val Arg Val Asp Asp Asp Phe Asp Pro Val Tyr 15 10 15
Pro Tyr Asp Ala Asp Asn Ala Pro Thr Val Pro Phe Ile Asn Pro Pro 20 25 30
Phe Val Ser Ser Asp Gly Phe Gin Glu Lys Pro Leu Gly Val Leu Ser 35 40 45
Leu Arg Leu Ala Asp Pro Val Thr Thr Lys Asn Gly Glu Ile Thr Leu 50 55 60
Lys Leu Gly Glu Gly Val Asp Leu Asp Ser Ser Gly Lys Leu Ile Ser 65 70 75 80
Asn Thr Ala Thr Lys Ala Ala Ala Pro Leu Ser Ile Ser Asn Asn Thr 85 90 95
Ile Ser Leu Lys Thr Ala Ala Pro Phe Tyr Asn Asn Asn Gly Thr Leu 100 105 110
Ser Leu Asn Val Ser Thr Pro Leu Ala Val Phe Pro Thr Phe Asn Thr 115 120 125
Leu Gly Ile Ser Leu Gly Asn Gly Leu Gin Thr Ser Asn Lys Leu Leu 130 135 140
Thr Val Gin Leu Thr His Pro Leu Thr Phe Ser Ser Asn Ser Ile Thr 145 150 155 160
Val Lys Thr Asp Lys Gly Leu Tyr Ile Asn Ser Ser Gly Asn Arg Gly 165 170 175
Leu Glu Ala Asn Ile Ser Leu Lys Arg Gly Leu Val Phe Asp Gly Asn 180 185 190
Ala Ile Ala Thr Tyr Ile Gly Asn Gly Leu Asp Tyr Gly Ser Tyr Asp 195 200 205 Ser Asp Gly Lys Thr Arg Pro Val Ile Thr Lys Ile Gly Ala Gly Leu 210 215 220
Asn Phe Asp Ala Asn Lys Ala Ile Ala Val Lys Leu Gly Thr Gly Leu 225 230 235 240
Ser Phe Asp Ser Ala Gly Ala Leu Thr Ala Gly Asn Lys Gin Asp Asp 245 250 255
Lys Leu Thr Leu Trp Thr Thr Pro Asp Pro Ser Pro Asn Cys Gin Leu 260 265 270
Leu Ser Asp Arg Asp Ala Lys Phe Thr Leu Cys Leu Thr Lys Cys Gly 275 280 285
Ser Gin Ile Leu Gly Thr Val Ala Val Ala Ala Val Thr Val Gly Ser 290 295 300
Ala Leu Asn Pro Ile Asn Asp Thr Val Lys Ser Ala Ile Val Phe Leu 305 310 315 320
Arg Phe Asp Ser Asp Gly Val Leu Met Ser Asn Ser Ser Met Val Gly 325 330 335
Asp Tyr Trp Asn Phe Arg Glu Gly Gin Thr Thr Gin Ser Val Ala Tyr 340 345 350
Thr Asn Ala Val Gly Phe Met Pro Asn Ile Gly Ala Tyr Pro Lys Thr 355 360 365
Gin Ser Lys Thr Pro Lys Asn Ser Ile Val Ser Gin Val Tyr Leu Thr 370 375 380
Gly Glu Thr Thr Met Pro Met Thr Leu Thr Ile Thr Phe Asn Gly Thr 385 390 395 400
Asp Glu Lys Asp Thr Thr Pro Val Ser Thr Tyr Ser Met Thr Phe Thr 405 410 415
Trp Gin Trp Thr Gly Asp Tyr Lys Asp Lys Asn Ile Thr Phe Ala Thr 420 425 430
Asn Ser Phe Ser Phe Ser Tyr Ile Ala Gin Glu
435 440
<210> 9
<211> 36535
<212> DNA
<213> chimpanzee adenovirus serotype Pan7
<220>
<221> CDS
<222> (13874)..(15469)
<223> L2 Penton
<220>
<221> CDS
<222> (18288)..(21086)
<223> L3 Hexon
<220>
<221> CDS
<222> (32094)..(33425)
<223> L5 Fiber
<400> 9
catcatcaat aatatacctc aaacttttgg tgcgcgttaa tatgcaaatg agctgtttga 60 atttggggag ggaggaaggt gattggccga gagacgggcg accgttaggg gcggggcggg 120 tgacgttttt aatacgtggc cgtgaggcgg agccggtttg caagttctcg tgggaaaagt 180 gacgtcaaac gaggtgtggt ttgaacacgg aaatactcaa ttttcccgcg ctctctgaca 240 ggaaatgagg tgtttctggg cggatgcaag tgaaaacggg ccattttcgc gcgaaaactg 300 aatgaggaag tgaaaatctg agtaatttcg cgtttatggc agggaggagt atttgccgag 360 ggccgagtag actttgaccg attacgtggg ggtttcgatt accgtatttt tcacctaaat 420 ttccgcgtac ggtgtcaaag tccggtgttt ttacgtaggc gtcagctgat cgccagggta 480 tttaaacctg cgctctctag tcaagaggcc actcttgagt gccagcgagt agagttttct 540 cctccgcgcc gcgagtcaga tctacacttt gaaagatgag gcacctgaga gacctgcccg 600 gtaatgtttt cctggctact gggaacgaga ttctggaatt ggtggtggac gccatgatgg 660 gtggcgaccc tcctgagccc cctaccccat ttgaggcgcc ttcgctgtac gatttgtatg 720 atctggaggt ggatgtgccc gagaacgacc ccaacgagga ggcggtgaat gatttgttta 780 gcgatgccgc gctgctggct gccgagcagg ctaatacgga ctctggctca gacagcgatt 840 cctctctcca taccccgaga cccggcagag gtgagaaaaa gatccccgag cttaaagggg 900 aagagctcga cctgcgctgc tatgaggaat gcttgcctcc gagcgatgat gaggaggacg 960 aggaggcgat tcgagctgca tcgaaccagg gagtgaaagc tgcgggcgaa agctttagcc 1020 tggactgtcc tactctgccc ggacacggct gtaagtcttg tgaatttcat cgcatgaata 1080 ctggagataa gaatgtgatg tgtgccctgt gctatatgag agcttacaac cattgtgttt 1140 acagtaagtg tgattaactt tagttgggaa ggcagagggt gactgggtgc tgactggttt 1200 atttatgtat atgttttttt atgtgtaggt cccgtctctg acgtagatga gacccccact 1260 tcagagtgca tttcatcacc cccagaaatt ggcgaggaac cgcccgaaga tattattcat 1320 agaccagttg cagtgagagt caccgggcgg agagcagctg tggagagttt ggatgacttg 1380 ctacagggtg gggatgaacc tttggacttg tgtacccgga aacgccccag gcactaagtg 1440 ccacacatgt gtgtttactt aaggtgatgt cagtatttat agggtgtgga gtgcaataaa 1500 atccgtgttg actttaagtg cgtggtttat gactcagggg tggggactgt gggtatataa 1560 gcaggtgcag acctgtgtgg tcagttcaga gcaggactca tggagatctg gacggtcttg 1620 gaagactttc accagactag acagctgcta gagaactcat cggagggggt ctcttacctg 1680 tggagattct gcttcggtgg gcctctagct aagctagtct atagggccaa acaggattat 1740 aaggatcaat ttgaggatat tttgagagag tgtcctggta tttttgactc tctcaacttg 1800 ggccatcagt ctcactttaa ccagagtatt ctgagagccc ttgacttttc tactcctggc 1860 agaactaccg ccgcggtagc cttttttgcc tttatccttg acaaatggag tcaagaaacc 1920 catttcagca gggattaccg tctggactgc ttagcagtag ctttgtggag aacatggagg 1980 tgccagcgcc tgaatgcaat ctccggctac ttgccagtac agccggtaga cacgctgagg 2040 atcctgagtc tccagtcacc ccaggaacac caacgccgcc agcagccgca gcaggagcag 2100 cagcaagagg aggaggagga tcgagaagag aacccgagag ccggtctgga ccctccggtg 2160 gcggaggagg aggagtagct gacttgtttc ccgagctgcg ccgggtgctg actaggtctt 2220 ccagtggacg ggagaggggg attaagcggg agaggcatga ggagactagc cacagaactg 2280 aactgactgt cagtctgatg agccgcaggc gcccagaatc ggtgtggtgg catgaggttc 2340 agtcgcaggg gatagatgag gtctcggtga tgcatgagaa atattccctg gaacaagtca 2400 agacttgttg gttggagcct gaggatgatt gggaggtagc catcaggaat tatgccaagc 2460 tggctctgaa gccagacaag aagtacaaga ttaccaaact gattaatatc agaaattcct 2520 gctacatttc agggaatggg gccgaggtgg agatcagtac ccaggagagg gtggccttca 2580 gatgttgtat gatgaatatg tacccggggg tggtgggcat ggagggagtc acctttatga 2640 acgcgaggtt caggggtgat gggtataatg gggtggtctt tatggccaac accaagctga 2700 cagtgcacgg atgctccttc tttgggttca ataacatgtg catcgaggcc tggggcagtg 2760 tttcagtgag gggatgcagc ttttcagcca actggatggg ggtcgtgggc agaaccaaga 2820 gcaaggtgtc agtgaagaaa tgcctgttcg agaggtgcca cctgggggtg atgagcgagg 2880 gcgaagccaa agtcaaacac tgcgcctcta ctgagacggg ctgctttgtg ctgatcaagg 2940 gcaatgccca agtcaagcat aacatgatct gtggggcctc ggatgagcgc ggctaccaga 3000 tgctgacctg cgccggtggg aacagccata tgctggccac cgtgcatgtg acctcgcacc 3060 cccgcaagac atggcccgag ttcgagcaca acgtcatgac ccgatgcaat gtgcacctgg 3120 ggtcccgccg aggcatgttc atgccctacc agtgcaacat gcaatttgtg aaggtgctgc 3180 tggagcccga tgccatgtcc agagtgagcc tgacgggggt gtttgacatg aatgtggagc 3240 tgtggaaaat tctgagatat gatgaatcca agaccaggtg ccgggcctgc gaatgcggag 3300 gcaagcacgc caggcttcag cccgtgtgtg tggaggtgac ggaggacctg cgacccgatc 3360 atttggtgtt gtcctgcaac gggacggagt tcggctccag cggggaagaa tctgactaga 3420 gtgagtagtg tttgggggag gtggagggct tgtatgaggg gcagaatgac taaaatctgt 3480 gtttttctgt gtgttgcagc agcatgagcg gaagcgcctc ctttgaggga ggggtattca 3540 gcccttatct gacggggcgt ctcccctcct gggcgggagt gcgtcagaat gtgatgggat 3600 ccacggtgga cggccggccc gtgcagcccg cgaactcttc aaccctgacc tacgcgaccc 3660 tgagctcctc gtccgtggac gcagctgccg ccgcagctgc tgcttccgcc gccagcgccg 3720 tgcgcggaat ggccctgggc gccggctact acagctctct ggtggccaac tcgacttcca 3780 ccaataatcc cgccagcctg aacgaggaga agctgctgct gctgatggcc cagctcgagg 3840 ccctgaccca gcgcctgggc gagctgaccc agcaggtggc tcagctgcag gcggagacgc 3900 gggccgcggt tgccacggtg aaaaccaaat aaaaaatgaa tcaataaata aacggagacg 3960 gttgttgatt ttaacacaga gtcttgaatc tttatttgat ttttcgcgcg cggtaggccc 4020 tggaccaccg gtctcgatca ttgagcaccc ggtggatttt ttccaggacc cggtagaggt 4080 gggcttggat gttgaggtac atgggcatga gcccgtcccg ggggtggagg tagctccatt 4140 gcagggcctc gtgctcgggg gtggtgttgt aaatcaccca gtcatagcag gggcgcaggg 4200 cgtggtgctg cacgatgtcc ttgaggagga gactgatggc cacgggcagc cccttggtgt 4260 aggtgttgac gaacctgttg agctgggagg gatgcatgcg gggggagatg agatgcatct 4320 tggcctggat cttgagattg gcgatgttcc cgcccagatc ccgccggggg ttcatgttgt 4380 gcaggaccac cagcacggtg tatccggtgc acttggggaa tttgtcatgc aacttggaag 4440 ggaaggcgtg aaagaatttg gagacgccct tgtgaccgcc caggttttcc atgcactcat 4500 ccatgatgat ggcgatgggc ccgtgggcgg cggcctgggc aaagacgttt cgggggtcgg 4560 acacatcgta gttgtggtcc tgggtgagct cgtcataggc cattttaatg aatttggggc 4620 ggagggtgcc cgactggggg acgaaggtgc cctcgatccc gggggcgtag ttgccctcgc 4680 agatctgcat ctcccaggcc ttgagctcgg agggggggat catgtccacc tgcggggcga 4740 tgaaaaaaac ggtttccggg gcgggggaga tgagctgggc cgaaagcagg ttccggagca 4800 gctgggactt gccgcagccg gtggggccgt agatgacccc gatgaccggc tgcaggtggt 4860 agttgaggga gagacagctg ccgtcctcgc ggaggagggg ggccacctcg ttcatcatct 4920 cgcgcacatg catgttctcg cgcacgagtt ccgccaggag gcgctcgccc cccagcgaga 4980 ggagctcttg cagcgaggcg aagtttttca gcggcttgag yccgtcggcc atgggcattt 5040 tggagagggt ctgttgcaag agttccagac ggtcccagag ctcggtgatg tgctctaggg 5100 catctcgatc cagcagacct cctcgtttcg cgggttgggg cgactgcggg agtagggcac 5160 caggcgatgg gcgtccagcg aggccagggt ccggtccttc cagggtcgca gggtccgcgt 5220 cagcgtggtc tccgtcacgg tgaaggggtg cgcgccgggc tgggcgcttg cgagggtgcg 5280 cttcaggctc atccggctgg tcgagaaccg ctcccggtcg gcgccctgcg cgtcggccag 5340 gtagcaattg agcatgagtt cgtagttgag cgcctcggcc gcgtggccct tggcgcggag 5400 cttacctttg gaagtgtgtc cgcagacggg acagaggagg gacttgaggg cgtagagctt 5460 gggggcgagg aagacggact cgggggcgta ggcgtccgcg ccgcagctgg cgcagacggt 5520 ctcgcactcc acgagccagg tgaggtcggg ccggttgggg tcaaaaacga ggtttcctcc 5580 gtgctttttg atgcgtttct tacctctggt ctccatgagc tcgtgtcccc gctgggtgac 5640 aaagaggctg tccgtgtccc cgtagaccga ctttatgggc cggtcctcga gcggggtgcc 5700 gcggtcctcg tcgtagagga accccgccca ctccgagacg aaggcccggg tccaggccag 5760 cacgaaggag gccacgtggg aggggtagcg gtcgttgtcc accagcgggt ccaccttctc 5820 cagggtatgc aagcacatgt ccccctcgtc cacatccagg aaggtgattg gcttgtaagt 5880 gtaggccacg tgaccggggg tcccggccgg gggggtataa aagggggcgg gcccctgctc 5940 gtcctcactg tcttccggat cgctgtccag gagcgccagc tgttggggta ggtattccct 6000 ctcgaaggct ggcataacct cggcactcag gttgtcagtt tctagaaacg aggaggattt 6060 gatattgacg gtgccgttgg agacgccttt catgagcccc tcgtccatct ggtcagaaaa 6120 gacgatcttt ttgttgtcga gcttggtggc gaaggagccg tagagggcgt tggagaggag 6180 cttggcgatg gagcgcatgg tctggttctt ttccttgtcg gcgcgctcct tggcggcgat 6240 gttgagctgc acgtactcgc gcgccacgca cttccattcg gggaagacgg tggtgagctc 6300 gtcgggcacg attctgaccc gccagccgcg gttgtgcagg gtgatgaggt ccacgctggt 6360 ggccacctcg ccgcgcaggg gctcgttggt ccagcagagg cgcccgccct tgcgcgagca 6420 gaaggggggc agcgggtcca gcatgagctc gtcggggggg tcggcgtcca cggtgaagat 6480 gccgggcaga agctcggggt cgaagtagct gatgcaggtg tccagatcgt ccagcgccgc 6540 ttgccagtcg cgcacggcca gcgcgcgctc gtaggggctg aggggcgtgc cccagggcat 6600 ggggtgcgtg agcgcggagg cgtacatgcc gcagatgtcg tagacgtaga ggggctcctc 6660 gaggacgccg atgtaggtgg ggtagcagcg ccccccgcgg atgctggcgc gcacgtagtc 6720 gtacagctcg tgcgagggcg cgaggagccc cgtgccgagg ttggagcgtt gcggcttttc 6780 ggcgcggtag acgatctggc ggaagatggc gtgggagttg gaggagatgg tgggcctctg 6840 gaagatgttg aagtgggcgt ggggcaggcc gaccgagtcc ctgatgaagt gggcgtagga 6900 gtcctgcagc ttggcgacga gctcggcggt gacgaggacg tccagggcgc agtagtcgag 6960 ggtctcttgg atgatgtcgt acttgagctg gcccttctgc ttccacagct cgcggttgag 7020 aaggaactct tcgcggtcct tccagtactc ttcgaggggg aacccgtcct gatcggcacg 7080 gtaagagccc accatgtaga actggttgac ggccttgtag gcgcagcagc ccttctccac 7140 ggggagggcg taagcttgtg cggccttgcg cagggaggtg tgggtgaggg cgaaggtgtc 7200 gcgcaccatg accttgagga actggtgctt gaagtcgagg tcgtcgcagc cgccctgctc 7260 ccagagctgg aagtccgtgc gcttcttgta ggcggggttg ggcaaagcga aagtaacatc 7320 gttgaagagg atcttgcccg cgcggggcat gaagttgcga gtgatgcgga aaggctgggg 7380 cacctcggcc cggttgttga tgacctgggc ggcgaggacg atctcgtcga agccgttgat 7440 gttgtgcccg acgatgtaga gttccacgaa tcgcgggcgg cccttaacgt ggggcagctt 7500 cttgagctcg tcgtaggtga gctcggcggg gtcgctgagc ccgtgctgct cgagggccca 7560 gtcggcgacg tgggggttgg cgctgaggaa ggaagtccag agatccacgg ccagggcggt 7620 ctgcaagcgg tcccggtact gacggaactg ctggcccacg gccatttttt cgggggtgac 7680 gcagtagaag gtgcgggggt cgccgtgcca gcggtcccac ttgagctgga gggcgaggtc 7740 gtgggcgagc tcgacgagcg gcgggtcccc ggagagtttc atgaccagca tgaaggggac 7800 gagctgcttg ccgaaggacc ccatccaggt gtaggtttcc acatcgtagg tgaggaagag 7860 cctttcggtg cgaggatgcg agccgatggg gaagaactgg atctcctgcc accagttgga 7920 ggaatggctg ttgatgtgat ggaagtagaa atgccgacgg cgcgccgagc actcgtgctt 7980 gtgtttatac aagcgtccgc agtgctcgca acgctgcacg ggatgcacgt gctgcacgag 8040 ctgtacctgg gttcctttga cgaggaattt cagtgggcag tggagcgctg gcggctgcat 8100 ctggtgctgt actacgtcct ggccatcggc gtggccatcg tctgcctcga tggtggtcat 8160 gctgacgagc ccgcgcggga ggcaggtcca gacttcggct cggacgggtc ggagagcgag 8220 gacgagggcg cgcaggccgg agctgtccag ggtcctgaga cgctgcggag tcaggtcagt 8280 gggcagcggc ggcgcgcggt tgacttgcag gagcttttcc agggcgcgcg ggaggtccag 8340 atggtacttg atctccacgg cgccgttggt ggcgacgtcc acggcttgca gggtcccgtg 8400 cccctggggc gccaccaccg tgccccgttt cttcttgggc gctgcttcca tgccggtcag 8460 aagcggcggc gaggacgcgc gccgggcggc aggggcggct cgggacccgg aggcaggggc 8520 ggcaggggca cgtcggcgcc gcgcgcgggc aggttctggt actgcgcccg gagaagactg 8580 gcgtgagcga cgacgcgacg gttgacgtcc tggatctgac gcctctgggt gaaggccacg 8640 ggacccgtga gtttgaacct gaaagagagt tcgacagaat caatctcggt atcgttgacg 8700 gcggcctgcc gcaggatctc ttgcacgtcg cccgagttgt cctggtaggc gatctcggtc 8760 atgaactgct cgatctcctc ctcctgaagg tctccgcggc cggcgcgctc gacggtggcc 8820 gcgaggtcgt tggagatgcg gcccatgagc tgcgagaagg cgttcatgcc ggcctcgttc 8880 cagacgcggc tgtagaccac ggctccgtcg gggtcgcgcg cgcgcatgac cacctgggcg 8940 aggttgagct cgacgtggcg cgtgaagacc gcgtagttgc agaggcgctg gtagaggtag 9000 ttgagcgtgg tggcgatgtg ctcggtgacg aagaagtaca tgatccagcg gcggagcggc 9060 atctcgctga cgtcgcccag ggcttccaag cgctccatgg cctcgtagaa gtccacggcg 9120 aagttgaaaa actgggagtt gcgcgccgag acggtcaact cctcctccag aagacggatg 9180 agctcagcga tggtggcgcg cacctcgcgc tcgaaggccc cggggggctc ctcttcttcc 9240 atctcttcct cctccactaa catctcttct acttcctcct caggaggcgg cggcggggga 9300 ggggccctgc gtcgccggcg gcgcacgggc agacggtcga tgaagcgctc gatggtctcc 9360 ccgcgccggc gacgcatggt ctcggtgacg gcgcgcccgt cctcgcgggg ccgcagcgtg 9420 aagacgccgc cgcgcatctc caggtggccg ccgggggggt ctccgttggg cagggagagg 9480 gcgctgacga tgcatcttat caattggccc gtagggactc cgcgcaagga cctgagcgtc 9540 tcgagatcca cgggatccga aaaccgctga acgaaggctt cgagccagtc gcagtcgcaa 9600 ggtaggctga gcccggtttc ttgttcttcg gggatttcgg gaggcgggcg ggcgatgctg 9660 ctggtgatga agttgaagta ggcggtcctg agacggcgga tggtggcgag gagcaccagg 9720 tccttgggcc cggcttgctg gatgcgcaga cggtcggcca tgccccaggc gtggtcctga 9780 cacctggcga ggtccttgta gtagtcctgc atgagccgct ccacgggcac ctcctcctcg 9840 cccgcgcggc cgtgcatgcg cgtgagcccg aacccgcgct ggggctggac gagcgccagg 9900 tcggcgacga cgcgctcggc gaggatggcc tgctgtatct gggtgagggt ggtctggaag 9960 tcgtcgaagt cgacgaagcg gtggtaggct ccggtgttga tggtatagga gcagttggcc 10020 atgacggacc agttgacggt ctggtggccg ggtcgcacga gctcgtggta cttgaggcgc 10080 gagtaggcgc gcgtgtcgaa gatgtagtcg ttgcaggtgc gcacgaggta ctggtatccg 10140 acgaggaagt gcggcggcgg ctggcggtag agcggccatc gctcggtggc gggggcgccg 10200 ggcgcgaggt cctcgagcat gaggcggtgg tagccgtaga tgtacctgga catccaggtg 10260 atgccggcgg cggtggtgga ggcgcgcggg aactcgcgga cgcggttcca gatgttgcgc 10320 agcggcagga agtagttcat ggtggccgcg gtctggcccg tgaggcgcgc gcagtcgtgg 10380 atgctctaga catacgggca aaaacgaaag cggtcagcgg ctcgactccg tggcctggag 10440 gctaagcgaa cgggttgggc tgcgcgtgta ccccggttcg aatctcgaat caggctggag 10500 ccgcagctaa cgtggtactg gcactcccgt ctcgacccaa gcctgctaac gaaacctcca 10560 ggatacggag gcgggtcgtt ttttggcctt ggtcgctggt catgaaaaac tagtaagcgc 10620 ggaaagcgac cgcccgcgat ggctcgctgc cgtagtctgg agaaagaatc gccagggttg 10680 cgttgcggtg tgccccggtt cgagcctcag cgctcggcgc cggccggatt ccgcggctaa 10740 cgtgggcgtg gctgccccgt cgtttccaag accccttagc cagccgactt ctccagttac 10800 ggagcgagcc cctctttttc ttgtgttttt gccagatgca tcccgtactg cggcagatgc 10860 gcccccaccc tccacctcaa ccgcccctac cgccgcagca gcagcaacag ccggcgcttc 10920 tgcccccgcc ccagcagcag ccagccacta ccgcggcggc cgccgtgagc ggagccggcg 10980 ttcagtatga cctggccttg gaagagggcg aggggctggc gcggctgggg gcgtcgtcgc 11040 cggagcggca cccgcgcgtg cagatgaaaa gggacgctcg cgaggcctac gtgcccaagc 11100 agaacctgtt cagagacagg agcggcgagg agcccgagga gatgcgcgcc tcccgcttcc 11160 acgcggggcg ggagctgcgg cgcggcctgg accgaaagcg ggtgctgagg gacgaggatt 11220 tcgaggcgga cgagctgacg gggatcagcc ccgcgcgcgc gcacgtggcc gcggccaacc 11280 tggtcacggc gtacgagcag accgtgaagg aggagagcaa cttccaaaaa tccttcaaca 11340 accacgtgcg cacgctgatc gcgcgcgagg aggtgaccct gggcctgatg cacctgtggg 11400 acctgctgga ggccatcgtg cagaacccca cgagcaagcc gctgacggcg cagctgtttc 11460 tggtggtgca gcacagtcgg gacaacgaga cgttcaggga ggcgctgctg aatatcaccg 11520 agcccgaggg ccgctggctc ctggacctgg tgaacattct gcagagcatc gtggtgcagg 11580 agcgcgggct gccgctgtcc gagaagctgg cggctatcaa cttctcggtg ctgagcctgg 11640 gcaagtacta cgctaggaag atctacaaga ccccgtacgt gcccatagac aaggaggtga 11700 agatcgacgg gttttacatg cgcatgaccc tgaaagtgct gaccctgagc gacgatctgg 11760 gggtgtaccg caacgacagg atgcaccgcg cggtgagcgc cagccgccgg cgcgagctga 11820 gcgaccagga gctgatgcac agcctgcagc gggccctgac cggggccggg accgaggggg 11880 agagctactt tgacatgggc gcggacctgc gctggcagcc cagccgccgg gccttggaag 11940 ctgccggcgg ttccccctac gtggaggagg tggacgatga ggaggaggag ggcgagtacc 12000 tggaagactg atggcgcgac cgtatttttg ctagatgcag caacagccac cgcctcctga 12060 tcccgcgatg cgggcggcgc tgcagagcca gccgtccggc attaactcct cggacgattg 12120 gacccaggcc atgcaacgca tcatggcgct gacgacccgc aatcccgaag cctttagaca 12180 gcagcctcag gccaaccggc tctcggccat cctggaggcc gtggtgccct cgcgctcgaa 12240 ccccacgcac gagaaggtgc tggccatcgt gaacgcgctg gtggagaaca aggccatccg 12300 cggcgacgag gccgggctgg tgtacaacgc gctgctggag cgcgtggccc gctacaacag 12360 caccaacgtg cagacgaacc tggaccgcat ggtgaccgac gtgcgcgagg cggtgtcgca 12420 gcgcgagcgg ttccaccgcg agtcgaacct gggctccatg gtggcgctga acgccttcct 12480 gagcacgcag cccgccaacg tgccccgggg ccaggaggac tacaccaact tcatcagcgc 12540 gctgcggctg atggtggccg aggtgcccca gagcgaggtg taccagtcgg ggccggacta 12600 cttcttccag accagtcgcc agggcttgca gaccgtgaac ctgagccagg ctttcaagaa 12660 cttgcaggga ctgtggggcg tgcaggcccc ggtcggggac cgcgcgacgg tgtcgagcct 12720 gctgacgccg aactcgcgcc tgctgctgct gctggtggcg cccttcacgg acagcggcag 12780 cgtgagccgc gactcgtacc tgggctacct gcttaacctg taccgcgagg ccatcgggca 12840 ggcgcacgtg gacgagcaga cctaccagga gatcacccac gtgagccgcg cgctgggcca 12900 ggaggacccg ggcaacctgg aggccaccct gaacttcctg ctgaccaacc ggtcgcagaa 12960 gatcccgccc cagtacgcgc tgagcaccga ggaggagcgc atcctgcgct acgtgcagca 13020 gagcgtgggg ctgttcctga tgcaggaggg ggccacgccc agcgccgcgc tcgacatgac 13080 cgcgcgcaac atggagccca gcatgtacgc tcgcaaccgc ccgttcatca ataagctgat 13140 ggactacttg catcgggcgg ccgccatgaa ctcggactac tttaccaacg ccatcttgaa 13200 cccgcactgg ctcccgccgc ccgggttcta cacgggcgag tacgacatgc ccgaccccaa 13260 cgacgggttc ctgtgggacg acgtggacag cagcgtgttc tcgccgcgcc ccgccaccac 13320 cgtgtggaag aaagagggcg gggaccggcg gccgtcctcg gcgctgtccg gtcgcgcggg 13380 tgctgccgcg gcggtgcctg aggccgccag ccccttcccg agcctgccct tttcgctgaa 13440 cagcgtgcgc agcagcgagc tgggtcggct gacgcggccg cgcctgctgg gcgaggagga 13500 gtacctgaac gactccttgt tgaggcccga gcgcgagaag aacttcccca ataacgggat 13560 agagagcctg gtggacaaga tgagccgctg gaagacgtac gcgcacgagc acagggacga 13620 gccccgagct agcagcagcg caggcacccg tagacgccag cgacacgaca ggcagcgggg 13680 tctggtgtgg gacgatgagg attccgccga cgacagcagc gtgttggact tgggtgggag 13740 tggtggtggt aacccgttcg ctcacttgcg cccccgtatc gggcgcctga tgtaagaatc 13800 tgaaaaaata aaaaacggta ctcaccaagg ccatggcgac cagcgtgcgt tcttctctgt 13860 tgtttgtagt agt atg atg agg cgc gtg tac ccg gag ggt cct cct ccc 13909
Met Met Arg Arg Val Tyr Pro Glu Gly Pro Pro Pro 15 10
tcg tac gag age gtg atg cag cag gcg gtg gcg gcg gcg atg cag ccc 13957 Ser Tyr Glu Ser Val Met Gin Gin Ala Val Ala Ala Ala Met Gin Pro 15 20 25
ccg etg gag gcg cct tac gtg ccc ccg egg tac etg gcg cct acg gag 14005 Pro Leu Glu Ala Pro Tyr Val Pro Pro Arg Tyr Leu Ala Pro Thr Glu 30 35 40
ggg egg aac age att cgt tac tcg gag etg gca ccc ttg tac gat acc 14053 Gly Arg Asn Ser Ile Arg Tyr Ser Glu Leu Ala Pro Leu Tyr Asp Thr 45 50 55 60
acc egg ttg tac etg gtg gac aac aag tcg gcg gac atc gcc tcg etg 14101 Thr Arg Leu Tyr Leu Val Asp Asn Lys Ser Ala Asp Ile Ala Ser Leu 65 70 75
aac tac cag aac gac cac age aac ttc etg acc acc gtg gtg cag aac 14149 Asn Tyr Gin Asn Asp His Ser Asn Phe Leu Thr Thr Val Val Gin Asn 80 85 90
aac gat ttc acc ccc acg gag gcc age acc cag acc atc aac ttt gac 14197 Asn Asp Phe Thr Pro Thr Glu Ala Ser Thr Gin Thr Ile Asn Phe Asp 95 100 105
gag cgc tcg egg tgg ggc ggc cag etg aaa acc atc atg cac acc aac 14245 Glu Arg Ser Arg Trp Gly Gly Gin Leu Lys Thr Ile Met His Thr Asn 110 115 120
atg ccc aac gtg aac gag ttc atg tac age aac aag ttc aag gcg egg 14293 Met Pro Asn Val Asn Glu Phe Met Tyr Ser Asn Lys Phe Lys Ala Arg 125 130 135 140
gtg atg gtc tcg cgc aag acc ccc aat ggg gtc gcg gtg gat gag aat 14341 Val Met Val Ser Arg Lys Thr Pro Asn Gly Val Ala Val Asp Glu Asn 145 150 155
tat gat ggt agt cag gac gag etg act tac gag tgg gtg gag ttt gag 14389 Tyr Asp Gly Ser Gin Asp Glu Leu Thr Tyr Glu Trp Val Glu Phe Glu 160 165 170
etg ccc gag ggc aac ttc tcg gtg acc atg acc atc gat etg atg aac 14437 Leu Pro Glu Gly Asn Phe Ser Val Thr Met Thr Ile Asp Leu Met Asn 175 180 185
aac gcc atc atc gac aac tac ttg gcg gtg ggg cgt cag aac ggg gtg 14485 Asn Ala Ile Ile Asp Asn Tyr Leu Ala Val Gly Arg Gin Asn Gly Val 190 195 200
etg gag age gac atc ggc gtg aag ttc gac acg cgc aac ttc egg etg 14533 Leu Glu Ser Asp Ile Gly Val Lys Phe Asp Thr Arg Asn Phe Arg Leu 205 210 215 220
ggc tgg gac ccc gtg acc gag etg gtg atg ccg ggc gtg tac acc aac 14581 Gly Trp Asp Pro Val Thr Glu Leu Val Met Pro Gly Val Tyr Thr Asn 225 230 235
gag gcc ttc cac ccc gac atc gtc etg etg ccc ggc tgc ggc gtg gac 14629 Glu Ala Phe His Pro Asp Ile Val Leu Leu Pro Gly Cys Gly Val Asp 240 245 250
ttc acc gag age cgc etc age aac etg etg ggc atc cgc aag egg cag 14677 Phe Thr Glu Ser Arg Leu Ser Asn Leu Leu Gly Ile Arg Lys Arg Gin 255 260 265
ccc ttc cag gag ggc ttc cag atc etg tac gag gac etg gag ggg ggc 14725 Pro Phe Gin Glu Gly Phe Gin Ile Leu Tyr Glu Asp Leu Glu Gly Gly 270 275 280
aac atc ccc gcg etc ttg gat gtc gaa gcc tat gag aaa age aag gag 14773 Asn Ile Pro Ala Leu Leu Asp Val Glu Ala Tyr Glu Lys Ser Lys Glu 285 290 295 300
gag gcc gcc gca gcg gcg acc gca gcc gtg gcc acc gcc tet acc gag 14821 Glu Ala Ala Ala Ala Ala Thr Ala Ala Val Ala Thr Ala Ser Thr Glu 305 310 315
gtg egg ggc gat aat ttt get age gcc gcg gca gtg gcc gag gcg get 14869 Val Arg Gly Asp Asn Phe Ala Ser Ala Ala Ala Val Ala Glu Ala Ala 320 325 330
gaa acc gaa agt aag ata gtc atc cag ccg gtg gag aag gac age aag 14917 Glu Thr Glu Ser Lys Ile Val Ile Gin Pro Val Glu Lys Asp Ser Lys 335 340 345
gac agg age tac aac gtg etc gcg gac aag aaa aac acc gcc tac cgc 14965 Asp Arg Ser Tyr Asn Val Leu Ala Asp Lys Lys Asn Thr Ala Tyr Arg 350 355 360
age tgg tac etg gcc tac aac tac ggc gac ccc gag aag ggc gtg cgc 15013 Ser Trp Tyr Leu Ala Tyr Asn Tyr Gly Asp Pro Glu Lys Gly Val Arg 365 370 375 380
tcc tgg acg etg etc acc acc tcg gac gtc acc tgc ggc gtg gag caa 15061 Ser Trp Thr Leu Leu Thr Thr Ser Asp Val Thr Cys Gly Val Glu Gin 385 390 395
gtc tac tgg tcg etg ccc gac atg atg caa gac ccg gtc acc ttc cgc 15109 Val Tyr Trp Ser Leu Pro Asp Met Met Gin Asp Pro Val Thr Phe Arg 400 405 410
tcc acg cgt caa gtt age aac tac ccg gtg gtg ggc gcc gag etc etg 15157 Ser Thr Arg Gin Val Ser Asn Tyr Pro Val Val Gly Ala Glu Leu Leu 415 420 425
ccc gtc tac tcc aag age ttc ttc aac gag cag gcc gtc tac tcg cag 15205 Pro Val Tyr Ser Lys Ser Phe Phe Asn Glu Gin Ala Val Tyr Ser Gin 430 435 440
cag etg cgc gcc ttc acc tcg etc acg cac gtc ttc aac cgc ttc ccc 15253 Gin Leu Arg Ala Phe Thr Ser Leu Thr His Val Phe Asn Arg Phe Pro 445 450 455 460
gag aac cag atc etc gtc cgc ccg ccc gcg ccc acc att acc acc gtc 15301 Glu Asn Gin Ile Leu Val Arg Pro Pro Ala Pro Thr Ile Thr Thr Val 465 470 475
agt gaa aac gtt cct get etc aca gat cac ggg acc etg ccg etg cgc 15349 Ser Glu Asn Val Pro Ala Leu Thr Asp His Gly Thr Leu Pro Leu Arg 480 485 490
age agt atc egg gga gtc cag cgc gtg acc gtc act gac gcc aga cgc 15397 Ser Ser Ile Arg Gly Val Gin Arg Val Thr Val Thr Asp Ala Arg Arg 495 500 505
cgc acc tgc ccc tac gtc tac aag gcc etg ggc gta gtc gcg ccg cgc 15445 Arg Thr Cys Pro Tyr Val Tyr Lys Ala Leu Gly Val Val Ala Pro Arg 510 515 520
gtc etc tcg age cgc acc ttc taa aaaatgtcca ttctcatctc gcccagtaat 15499 Val Leu Ser Ser Arg Thr Phe
525 530
aacaceggtt ggggcctgcg cgcgcccagc aagatgtacg gaggegcteg ccaacgctcc 15559 acgcaacacc ccgtgcgcgt gcgcgggcac ttccgcgctc cctggggcgc cctcaagggc 15619 cgcgtgcgct cgcgcaccac cgtcgacgac gtgatcgacc aggtggtggc cgacgcgcgc 15679 aactacacgc ccgccgccgc gcccgcctcc accgtggacg ccgtcatcga cagcgtggtg 15739 gccgatgcgc gccggtacgc ccgcgccaag agccggcggc ggcgcatcgc ccggcggcac 15799 cggagcaccc ccgccatgcg cgcggcgcga gccttgctgc gcagggccag gcgcacggga 15859 cgcagggcca tgctcagggc ggccagacgc gcggcctccg gcagcagcag cgccggcagg 15919 acccgcagac gcgcggccac ggcggcggcg gcggccatcg ccagcatgtc ccgcccgcgg 15979 cgcggcaacg tgtactgggt gcgcgacgcc gccaccggtg tgcgcgtgcc cgtgcgcacc 16039 cgcccccctc gcacttgaag atgctgactt cgcgatgttg atgtgtccca gcggcgagga 16099 ggatgtccaa gcgcaaatac aaggaagaga tgctccaggt catcgcgcct gagatctacg 16159 gccccgcggt gaaggaggaa agaaagcccc gcaaactgaa gcgggtcaaa aaggacaaaa 16219 aggaggagga agatgtggac ggactggtgg agtttgtgcg cgagttcgcc ccccggcggc 16279 gcgtgcagtg gcgcgggcgg aaagtgaaac cggtgctgcg gcccggcacc acggtggtct 16339 tcacgcccgg cgagcgttcc ggctccgcct ccaagcgctc ctacgacgag gtgtacgggg 16399 acgaggacat cctcgagcag gcggtcgagc gtctgggcga gtttgcttac ggcaagcgca 16459 gccgccccgc gcccttgaaa gaggaggcgg tgtccatccc gctggaccac ggcaacccca 16519 cgccgagcct gaagccggtg accctgcagc aggtgctgcc gagcgcggcg ccgcgccggg 16579 gcttcaagcg cgagggcggc gaggatctgt acccgaccat gcagctgatg gtgcccaagc 16639 gccagaagct ggaggacgtg ctggagcaca tgaaggtgga ccccgaggtg cagcccgagg 16699 tcaaggtgcg gcccatcaag caggtggccc cgggcctggg cgtgcagacc gtggacatca 16759 agatccccac ggagcccatg gaaacgcaga ccgagcccgt gaagcccagc accagcacca 16819 tggaggtgca gacggatccc tggatgccgg cgccggcttc caccactcgc cgaagacgca 16879 agtacggcgc ggccagcctg ctgatgccca actacgcgct gcatccttcc atcatcccca 16939 cgccgggcta ccgcggcacg cgcttctacc gcggctacac cagcagccgc cgcaagacca 16999 ccacccgccg ccgccgtcgt cgcacccgcc gcagcagcac cgcgacttcc gccgccgccc 17059 tggtgcggag agtgtaccgc agcgggcgcg agcctctgac cctgccgcgc gcgcgctacc 17119 acccgagcat cgccatttaa ctctgccgtc gcctcctact tgcagatatg gccctcacat 17179 gccgcctccg cgtccccatt acgggctacc gaggaagaaa gccgcgccgt agaaggctga 17239 cggggaacgg gctgcgtcgc catcaccacc ggcggcggcg cgccatcagc aagcggttgg 17299 ggggaggctt cctgcccgcg ctgatcccca tcatcgccgc ggcgatcggg gcgatccccg 17359 gcatagcttc cgtggcggtg caggcctctc agcgccactg agacacagct tggaaaattt 17419 gtaataaaaa aatggactga cgctcctggt cctgtgatgt gtgtttttag atggaagaca 17479 tcaatttttc gtccctggca ccgcgacacg gcacgcggcc gtttatgggc acctggagcg 17539 acatcggcaa cagccaactg aacgggggcg ccttcaattg gagcagtctc tggagcgggc 17599 ttaagaattt cgggtccacg ctcaaaacct atggcaacaa ggcgtggaac agcagcacag 17659 ggcaggcgct gagggaaaag ctgaaagagc agaacttcca gcagaaggtg gtcgatggcc 17719 tggcctcggg catcaacggg gtggtggacc tggccaacca ggccgtgcag aaacagatca 17779 acagccgcct ggacgcggtc ccgcccgcgg ggtccgtgga gatgccccag gtggaggagg 17839 agctgcctcc cctggacaag cgcggcgaca agcgaccgcg tcccgacgcg gaggagacgc 17899 tgctgacgca cacggacgag ccgcccccgt acgaggaggc ggtgaaactg ggtctgccca 17959 ccacgcggcc cgtggcgcct ctggccaccg gggtgctgaa acccagcagc agcagccagc 18019 ccgcgaccct ggacttgcct ccgcctgctt cccgcccctc cacagtggct aagcccctgc 18079 cgccggtggc cgtcgcgtcg cgcgcccccc gaggccgccc ccaggcgaac tggcagagca 18139 ctctgaacag catcgtgggt ctgggagtgc agagtgtgaa gcgccgccgc tgctattaaa 18199 agacactgta gcgcttaact tgcttgtctg tgtgtatatg tatgtccgcc gaccagaagg 18259 aggaagaggc gcgtcgccga gttgcaag atg gcc acc cca tcg atg etg ccc 18311
Met Ala Thr Pro Ser Met Leu Pro 535
cag tgg gcg tac atg cac atc gcc gga cag gac get tcg gag tac etg 18359 Gin Trp Ala Tyr Met His Ile Ala Gly Gin Asp Ala Ser Glu Tyr Leu 540 545 550 555
agt ccg ggt etg gtg cag ttc gcc cgc gcc aca gac acc tac ttc agt 18407 Ser Pro Gly Leu Val Gin Phe Ala Arg Ala Thr Asp Thr Tyr Phe Ser 560 565 570
etg ggg aac aag ttt agg aac ccc acg gtg gcg ccc acg cac gat gtg 18455 Leu Gly Asn Lys Phe Arg Asn Pro Thr Val Ala Pro Thr His Asp Val 575 580 585
acc acc gac cgc age cag egg etg acg etg cgc ttc gtg ccc gtg gac 18503 Thr Thr Asp Arg Ser Gin Arg Leu Thr Leu Arg Phe Val Pro Val Asp 590 595 600
cgc gag gac aac acc tac tcg tac aaa gtg cgc tac acg etg gcc gtg 18551 Arg Glu Asp Asn Thr Tyr Ser Tyr Lys Val Arg Tyr Thr Leu Ala Val 605 610 615
ggc gac aac cgc gtg etg gac atg gcc age acc tac ttt gac atc cgc 18599 Gly Asp Asn Arg Val Leu Asp Met Ala Ser Thr Tyr Phe Asp Ile Arg 620 625 630 635
ggc gtg etg gat egg ggg ccc age ttc aaa ccc tac tcc ggc acc gcc 18647 Gly Val Leu Asp Arg Gly Pro Ser Phe Lys Pro Tyr Ser Gly Thr Ala 640 645 650
tac aac age etg get ccc aag gga gcg ccc aac act tgc cag tgg aca 18695 Tyr Asn Ser Leu Ala Pro Lys Gly Ala Pro Asn Thr Cys Gin Trp Thr 655 660 665
tat aaa get ggt gat act gat aca gaa aaa acc tat aca tat gga aat 18743 Tyr Lys Ala Gly Asp Thr Asp Thr Glu Lys Thr Tyr Thr Tyr Gly Asn 670 675 680
gca cct gtg caa ggc att age att aca aag gat ggt att caa ett gga 18791 Ala Pro Val Gin Gly Ile Ser Ile Thr Lys Asp Gly Ile Gin Leu Gly 685 690 695
act gac age gat ggt cag gca atc tat gca gac gaa act tat caa cca 18839 Thr Asp Ser Asp Gly Gin Ala Ile Tyr Ala Asp Glu Thr Tyr Gin Pro 700 705 710 715
gag cct caa gtg ggt gat get gaa tgg eat gac atc act ggt act gat 18887 Glu Pro Gin Val Gly Asp Ala Glu Trp His Asp Ile Thr Gly Thr Asp 720 725 730
gaa aaa tat gga ggc aga get ett aag cct gac acc aaa atg aag cct 18935 Glu Lys Tyr Gly Gly Arg Ala Leu Lys Pro Asp Thr Lys Met Lys Pro 735 740 745
tgc tat ggt tet ttt gcc aag cct acc aat aaa gaa gga ggc cag gca 18983 Cys Tyr Gly Ser Phe Ala Lys Pro Thr Asn Lys Glu Gly Gly Gin Ala 750 755 760
aat gtg aaa acc gaa aca ggc ggt acc aaa gaa tat gac att gac atg 19031 Asn Val Lys Thr Glu Thr Gly Gly Thr Lys Glu Tyr Asp Ile Asp Met 765 770 775
gca ttc ttc gat aat ega agt gca get gcc gcc ggc eta gcc cca gaa 19079 Ala Phe Phe Asp Asn Arg Ser Ala Ala Ala Ala Gly Leu Ala Pro Glu 780 785 790 795
att gtt ttg tat act gag aat gtg gat etg gaa act cca gat acc eat 19127 Ile Val Leu Tyr Thr Glu Asn Val Asp Leu Glu Thr Pro Asp Thr His 800 805 810
att gta tac aag gca ggt aca gat gac agt age tet tet atc aat ttg 19175 Ile Val Tyr Lys Ala Gly Thr Asp Asp Ser Ser Ser Ser Ile Asn Leu 815 820 825
ggt cag cag tcc atg ccc aac aga ccc aac tac att ggc ttc aga gac 19223 Gly Gin Gin Ser Met Pro Asn Arg Pro Asn Tyr Ile Gly Phe Arg Asp 830 835 840
aac ttt atc ggt etg atg tac tac aac age act ggc aat atg ggt gta 19271 Asn Phe Ile Gly Leu Met Tyr Tyr Asn Ser Thr Gly Asn Met Gly Val 845 850 855
etg get gga cag gcc tcc cag etg aat get gtg gtg gac ttg cag gac 19319 Leu Ala Gly Gin Ala Ser Gin Leu Asn Ala Val Val Asp Leu Gin Asp 860 865 870 875
aga aac acc gaa etg tcc tac cag etc ttg ett gac tet etg ggt gac 19367 Arg Asn Thr Glu Leu Ser Tyr Gin Leu Leu Leu Asp Ser Leu Gly Asp 880 885 890
aga acc agg tat ttc agt atg tgg aat cag gcg gtg gac agt tat gac 19415 Arg Thr Arg Tyr Phe Ser Met Trp Asn Gin Ala Val Asp Ser Tyr Asp 895 900 905
ccc gat gtg cgc att att gaa aat cac ggt gtg gag gat gaa ett cct 19463 Pro Asp Val Arg Ile Ile Glu Asn His Gly Val Glu Asp Glu Leu Pro 910 915 920
aac tat tgc ttc ccc etg gat get gtg ggt aga act gat act tac cag 19511 Asn Tyr Cys Phe Pro Leu Asp Ala Val Gly Arg Thr Asp Thr Tyr Gin 925 930 935
gga att aag gcc aat ggt gat aat caa acc acc tgg acc aaa gat gat 19559 Gly Ile Lys Ala Asn Gly Asp Asn Gin Thr Thr Trp Thr Lys Asp Asp 940 945 950 955
act gtt aat gat get aat gaa ttg ggc aag ggc aat cct ttc gcc atg 19607 Thr Val Asn Asp Ala Asn Glu Leu Gly Lys Gly Asn Pro Phe Ala Met 960 965 970
gag atc aac atc cag gcc aac etg tgg egg aac ttc etc tac gcg aac 19655 Glu Ile Asn Ile Gin Ala Asn Leu Trp Arg Asn Phe Leu Tyr Ala Asn 975 980 985
gtg gcg etg tac etg ccc gac tcc tac aag tac acg ccg gcc aac atc 19703 Val Ala Leu Tyr Leu Pro Asp Ser Tyr Lys Tyr Thr Pro Ala Asn Ile 990 995 1000
acg etg ccc acc aac acc aac acc tac gat tac atg aac ggc cgc 19748 Thr Leu Pro Thr Asn Thr Asn Thr Tyr Asp Tyr Met Asn Gly Arg 1005 1010 1015
gtg gtg gcg ccc tcg etg gtg gac gcc tac atc aac atc ggg gcg 19793 Val Val Ala Pro Ser Leu Val Asp Ala Tyr Ile Asn Ile Gly Ala 1020 1025 1030
cgc tgg tcg etg gac ccc atg gac aac gtc aac ccc ttc aac cac 19838 Arg Trp Ser Leu Asp Pro Met Asp Asn Val Asn Pro Phe Asn His 1035 1040 1045
cac cgc aac gcg ggc etg ega tac cgc tcc atg etc etg ggc aac 19883 His Arg Asn Ala Gly Leu Arg Tyr Arg Ser Met Leu Leu Gly Asn 1050 1055 1060
ggg cgc tac gtg ccc ttc cac atc cag gtg ccc caa aag ttt ttc 19928 Gly Arg Tyr Val Pro Phe His Ile Gin Val Pro Gin Lys Phe Phe 1065 1070 1075
gcc atc aag age etc etg etc etg ccc ggg tcc tac acc tac gag 19973 Ala Ile Lys Ser Leu Leu Leu Leu Pro Gly Ser Tyr Thr Tyr Glu 1080 1085 1090
tgg aac ttc cgc aag gac gtc aac atg atc etg cag age tcc etc 20018 Trp Asn Phe Arg Lys Asp Val Asn Met Ile Leu Gin Ser Ser Leu 1095 1100 1105
ggc aac gac etg cgc acg gac ggg gcc tcc atc gcc ttc acc age 20063 Gly Asn Asp Leu Arg Thr Asp Gly Ala Ser Ile Ala Phe Thr Ser 1110 1115 1120
atc aac etc tac gcc acc ttc ttc ccc atg gcg cac aac acc gcc 20108 Ile Asn Leu Tyr Ala Thr Phe Phe Pro Met Ala His Asn Thr Ala 1125 1130 1135
tcc acg etc gag gcc atg etg cgc aac gac acc aac gac cag tcc 20153 Ser Thr Leu Glu Ala Met Leu Arg Asn Asp Thr Asn Asp Gin Ser 1140 1145 1150
ttc aac gac tac etc tcg gcg gcc aac atg etc tac ccc atc ccg 20198 Phe Asn Asp Tyr Leu Ser Ala Ala Asn Met Leu Tyr Pro Ile Pro 1155 1160 1165
gcc aac gcc acc aac gtg ccc atc tcc atc ccc tcg cgc aac tgg 20243 Ala Asn Ala Thr Asn Val Pro Ile Ser Ile Pro Ser Arg Asn Trp 1170 1175 1180
gcc gcc ttc cgc ggc tgg tcc ttc acg cgc etc aag acc cgc gag 20288 Ala Ala Phe Arg Gly Trp Ser Phe Thr Arg Leu Lys Thr Arg Glu 1185 1190 1195
acg ccc tcg etc ggc tcc ggg ttc gac ccc tac ttc gtc tac tcg 20333 Thr Pro Ser Leu Gly Ser Gly Phe Asp Pro Tyr Phe Val Tyr Ser 1200 1205 1210
ggc tcc atc ccc tac etc gac ggc acc ttc tac etc aac cac acc 20378 Gly Ser Ile Pro Tyr Leu Asp Gly Thr Phe Tyr Leu Asn His Thr 1215 1220 1225
ttc aag aag gtc tcc atc acc ttc gac tcc tcc gtc age tgg ccc 20423 Phe Lys Lys Val Ser Ile Thr Phe Asp Ser Ser Val Ser Trp Pro 1230 1235 1240
ggc aac gac cgc etc etg acg ccc aac gag ttc gaa atc aag cgc 20468 Gly Asn Asp Arg Leu Leu Thr Pro Asn Glu Phe Glu Ile Lys Arg 1245 1250 1255
acc gtc gac gga gag ggg tac aac gtg gcc cag tgc aac atg acc 20513 Thr Val Asp Gly Glu Gly Tyr Asn Val Ala Gin Cys Asn Met Thr 1260 1265 1270
aag gac tgg ttc etg gtc cag atg etg gcc cac tac aac atc ggc 20558 Lys Asp Trp Phe Leu Val Gin Met Leu Ala His Tyr Asn Ile Gly 1275 1280 1285
tac cag ggc ttc tac gtg ccc gag ggc tac aag gac cgc atg tac 20603 Tyr Gin Gly Phe Tyr Val Pro Glu Gly Tyr Lys Asp Arg Met Tyr 1290 1295 1300
tcc ttc ttc cgc aac ttc cag ccc atg age cgc cag gtc gtg gac 20648 Ser Phe Phe Arg Asn Phe Gin Pro Met Ser Arg Gin Val Val Asp 1305 1310 1315
gag gtc aac tac aag gac tac cag gcc gtc acc etg gcc tac cag 20693 Glu Val Asn Tyr Lys Asp Tyr Gin Ala Val Thr Leu Ala Tyr Gin 1320 1325 1330
cac aac aac tcg ggc ttc gtc ggc tac etc gcg ccc acc atg cgc 20738 His Asn Asn Ser Gly Phe Val Gly Tyr Leu Ala Pro Thr Met Arg 1335 1340 1345
cag ggc cag ccc tac ccc gcc aac tac ccc tac ccg etc atc ggc 20783 Gin Gly Gin Pro Tyr Pro Ala Asn Tyr Pro Tyr Pro Leu Ile Gly 1350 1355 1360
aag age gcc gtc gcc age gtc acc cag aaa aag ttc etc tgc gac 20828 Lys Ser Ala Val Ala Ser Val Thr Gin Lys Lys Phe Leu Cys Asp 1365 1370 1375
egg gtc atg tgg cgc atc ccc ttc tcc age aac ttc atg tcc atg 20873 Arg Val Met Trp Arg Ile Pro Phe Ser Ser Asn Phe Met Ser Met 1380 1385 1390
ggc gcg etc acc gac etc ggc cag aac atg etc tac gcc aac tcc 20918 Gly Ala Leu Thr Asp Leu Gly Gin Asn Met Leu Tyr Ala Asn Ser 1395 1400 1405
gcc cac gcg eta gac atg aat ttc gaa gtc gac ccc atg gat gag 20963 Ala His Ala Leu Asp Met Asn Phe Glu Val Asp Pro Met Asp Glu 1410 1415 1420
tcc acc ett etc tat gtt gtc ttc gaa gtc ttc gac gtc gtc ega 21008 Ser Thr Leu Leu Tyr Val Val Phe Glu Val Phe Asp Val Val Arg 1425 1430 1435
gtg cac cag ccc cac cgc ggc gtc atc gag gcc gtc tac etg cgc 21053 Val His Gin Pro His Arg Gly Val Ile Glu Ala Val Tyr Leu Arg 1440 1445 1450
acg ccc ttc tcg gcc ggc aac gcc acc acc taa gcctcttgct 21096 Thr Pro Phe Ser Ala Gly Asn Ala Thr Thr
1455 1460
tettgeaaga tgaeggcetg cgcgggctcc ggegagcagg agetcaggge catcctccgc 21156 gaectggget gcgggccctg cttcctgggc accttcgaca agcgcttccc gggattcatg 21216 gccccgcaca agctggcctg cgccatcgtc aacacggccg gccgcgagac egggggegag 21276 cactggctgg ccttcgcctg gaacccgcgc tcccacacct gctacctctt cgaccccttc 21336 gggttctegg acgagcgcct caagcagatc taccagttcg agtacgaggg cctgctgcgt 21396 cgcagcgccc tggccaccga ggaccgctgc gtcaccctgg aaaagtccac ccagaccgtg 21456 cagggtccgc gctcggccgc ctgcgggctc ttctgctgca tgttcctgca cgccttcgtg 21516 cactggcccg accgccccat ggacaagaac cccaccatga acttgetgae gggggtgece 21576 aaeggcatgc tccagtcgcc ccaggtggaa cccaccctgc gccgcaacca ggaggegetc 21636 taccgcttcc tcaacgccca ctccgcctac tttcgctccc accgcgcgcg catcgagaag 21696 gccaccgcct tcgaccgcat gaatcaagac atgtaateeg gtgtgtgtat gtgaatgett 21756 tattcatcat aataaacagc acatgtttat gccaccttct ctgaggetet gactttattt 21816 agaaatcgaa ggggttctgc cggctctcgg catggcccgc gggcagggat aegttgegga 21876 actggtactt gggcagccac ttgaactegg ggatcagcag cttcggcacg gggaggtegg 21936 ggaacgagtc gctccacagc ttgegegtga gttgeaggge gcccagcagg tegggegegg 21996 agatcttgaa ategcagttg ggacccgcgt tctgcgcgcg agagttaegg taeaeggggt 22056 tgcagcactg gaacaccatc agggcegggt gcttcacgct cgccagcacc gtcgegtegg 22116 tgatgccctc cacgtccaga tcctcggcgt tggccatccc gaagggggtc atettgeagg 22176 tctgccgccc catgctgggc acgcagccgg gcttgtggtt geaategcag tgcaggggga 22236 tcagcatcat ctgggcctgc teggagctea tgccegggta catggccttc atgaaagect 22296 ccagctggcg gaaggcctgc tgcgccttgc cgccctcggt gaagaagacc ccgcaggact 22356 tgetagagaa ctggttggtg gcgcagccag cgtcgtgcac gcagcagcgc gcgtcgttgt 22416 tggccagctg caccacgctg cgcccccagc ggttctgggt gatcttggcc eggteggggt 22476 tctccttcag cgcgcgctgc ccgttctcgc tcgccacatc catctcgatc gtgtgctcct 22536 tctggatcat cacggtcccg tgcaggcacc gcagcttgcc ctcggcctcg gtgcacccgt 22596 gcagccacag cgcgcagccg gtgctctccc agttcttgtg ggcgatctgg gagtgegagt 22656 gcacgaagcc ctgcaggaag cggcccatca tegtggtcag ggtcttgttg ctggtgaagg 22716 tcagcggaat gccgcggtgc tcctcgttca catacaggtg gcagatacgg cggtacacct 22776 cgccctgctc gggcatcagc tggaaggcgg acttcaggtc gctctccacg cggtaccggt 22836 ccatcagcag cgtcatcact tccatgccct tctcccaggc cgaaacgatc ggcaggctca 22896 gggggttctt caccgttgtc atcttagtcg ccgccgccga agtcaggggg tcgttctcgt 22956 ccagggtctc aaacactcgc ttgccgtcct tctcggtgat gcgcacgggg ggaaagctga 23016 agcccacggc cgccagctcc tcctcggcct gcctttcgtc ctcgctgtcc tggctgatgt 23076 cttgcaaagg cacatgcttg gtcttgcggg gtttcttttt gggcggcaga ggcggcggcg 23136 gagacgtgct gggcgagcgc gagttctcgc tcaccacgac tatttcttct ccttggccgt 23196 cgtccgagac cacgcggcgg taggcatgcc tcttctgggg cagaggcgga ggcgacgggc 23256 tctcgcggtt cggcgggcgg ctggcagagc cccttccgcg ttcgggggtg cgctcctggc 23316 ggcgctgctc tgactgactt cctccgcggc cggccattgt gttctcctag ggagcaagca 23376 tggagactca gccatcgtcg ccaacatcgc catctgcccc cgccgccgcc gacgagaacc 23436 agcagcagca gaatgaaagc ttaaccgccc cgccgcccag ccccacctcc gacgccgcag 23496 ccccagacat gcaagagatg gaggaatcca tcgagattga cctgggctac gtgacgcccg 23556 cggagcacga ggaggagctg gcagcgcgct tttcagcccc ggaagagaac caccaagagc 23616 agccagagca ggaagcagag agcgagcaga accaggctgg gctcgagcat ggcgactacc 23676 tgagcggggc agaggacgtg ctcatcaagc atctggcccg ccaatgcatc atcgtcaagg 23736 acgcgctgct cgaccgcgcc gaggtgcccc tcagcgtggc ggagctcagc cgcgcctacg 23796 agcgcaacct cttctcgccg cgcgtgcccc ccaagcgcca gcccaacggc acctgcgagc 23856 ccaacccgcg cctcaacttc tacccggtct tcgcggtgcc cgaggccctg gccacctacc 23916 acctcttttt caagaaccaa aggatccccg tctcctgccg cgccaaccgc acccgcgccg 23976 acgccctgct caacctgggc cccggcgccc gcctacctga tatcgcctcc ttggaagagg 24036 ttcccaagat cttcgagggt ctgggcagcg acgagactcg ggccgcgaac gctctgcaag 24096 gaagcggaga ggagcatgag caccacagcg ccctggtgga gttggaaggc gacaacgcgc 24156 gcctggcggt cctcaagcgc acggtcgagc tgacccactt cgcctacccg gcgctcaacc 24216 tgccccccaa ggtcatgagc gccgtcatgg accaggtgct catcaagcgc gcctcgcccc 24276 tctcggagga ggagatgcag gaccccgaga gctcggacga gggcaagccc gtggtcagcg 24336 acgagcagct ggcgcgctgg ctgggagcga gtagcacccc ccagagcctg gaagagcggc 24396 gcaagctcat gatggccgtg gtcctggtga ccgtggagct ggagtgtctg cgccgcttct 24456 tcgccgacgc ggagaccctg cgcaaggtcg aggagaacct gcactacctc ttcagacacg 24516 ggttcgtgcg ccaggcctgc aagatctcca acgtggagct gaccaacctg gtctcctaca 24576 tgggcatcct gcacgagaac cgcctggggc agaacgtgct gcacaccacc ctgcgcgggg 24636 aggcccgccg cgactacatc cgcgactgcg tctacctgta cctctgccac acctggcaga 24696 cgggcatggg cgtgtggcag cagtgcctgg aggagcagaa cctgaaagag ctctgcaagc 24756 tcctgcagaa gaacctcaag gccctgtgga ccgggttcga cgagcgcacc accgccgcgg 24816 acctggccga cctcatcttc cccgagcgcc tgcggctgac gctgcgcaac gggctgcccg 24876 actttatgag ccaaagcatg ttgcaaaact ttcgctcttt catcctcgaa cgctccggga 24936 tcctgcccgc cacctgctcc gcgctgccct cggacttcgt gccgctgacc ttccgcgagt 24996 gccccccgcc gctctggagc cactgctacc tgctgcgcct ggccaactac ctggcctacc 25056 actcggacgt gatcgaggac gtcagcggcg agggcctgct cgagtgccac tgccgctgca 25116 acctctgcac gccgcaccgc tccctggcct gcaaccccca gctgctgagc gagacccaga 25176 tcatcggcac cttcgagttg caaggccccg gcgagggcaa ggggggtctg aaactcaccc 25236 cggggctgtg gacctcggcc tacttgcgca agttcgtgcc cgaggactac catcccttcg 25296 agatcaggtt ctacgaggac caatcccagc cgcccaaggc cgagctgtcg gcctgcgtca 25356 tcacccaggg ggccatcctg gcccaattgc aagccatcca gaaatcccgc caagaatttc 25416 tgctgaaaaa gggccacggg gtctacttgg acccccagac cggagaggag ctcaacccca 25476 gcttccccca ggatgccccg aggaagcagc aagaagctga aagtggagct gccgccgccg 25536 ccggaggatt tggaggaaga ctgggagagc agtcaggcag aggaggagga gatggaagac 25596 tgggacagca ctcaggcaga ggaggacagc ctgcaagaca gtctggagga ggaagacgag 25656 gtggaggagg cagaggaaga agcagccgcc gccagaccgt cgtcctcggc ggaggaggag 25716 aaagcaagca gcacggatac catctccgct ccgggtcggg gtcgcggcgg ccgggcccac 25776 agtagatggg acgagaccgg gcgcttcccg aaccccacca cccagaccgg taagaaggag 25836 cggcagggat acaagtcctg gcgggggcac aaaaacgcca tcgtctcctg cttgcaagcc 25896 tgcgggggca acatctcctt cacccggcgc tacctgctct tccaccgcgg ggtgaacttc 25956 ccccgcaaca tcttgcatta ctaccgtcac ctccacagcc cctactactg tttccaagaa 26016 gaggcagaaa cccagcagca gcagcagcag cagaaaacca gcggcagcag ctagaaaatc 26076 cacagcggcg gcaggtggac tgaggatcgc ggcgaacgag ccggcgcaga cccgggagct 26136 gaggaaccgg atctttccca ccctctatgc catcttccag cagagtcggg ggcaagagca 26196 ggaactgaaa gtcaagaacc gttctctgcg ctcgctcacc cgcagttgtc tgtatcacaa 26256 gagcgaagac caacttcagc gcactctcga ggacgccgag gctctcttca acaagtactg 26316 cgcgctcact cttaaagagt agcccgcgcc cgcccacaca cggaaaaagg cgggaattac 26376 gtcaccacct gcgcccttcg cccgaccatc atcatgagca aagagattcc cacgccttac 26436 atgtggagct accagcccca gatgggcctg gccgccggcg ccgcccagga ctactccacc 26496 cgcatgaact ggctcagtgc cgggcccgcg atgatctcac gggtgaatga catccgcgcc 26556 caccgaaacc agatactcct agaacagtca gcgatcaccg ccacgccccg ccatcacctt 26616 aatccgcgta attggcccgc cgccctggtg taccaggaaa ttccccagcc cacgaccgta 26676 ctacttccgc gagacgccca ggccgaagtc cagctgacta actcaggtgt ccagctggcc 26736 ggcggcgccg ccctgtgtcg tcaccgcccc gctcagggta taaagcggct ggtgatccga 26796 ggcagaggca cacagctcaa cgacgaggtg gtgagctctt cgctgggtct gcgacctgac 26856 ggagtcttcc aactcgccgg atcggggaga tcttccttca cgcctcgtca ggccgtcctg 26916 actttggaga gttcgtcctc gcagccccgc tcgggtggca tcggcactct ccagttcgtg 26976 gaggagttca ctccctcggt ctacttcaac cccttctccg gctcccccgg ccactacccg 27036 gacgagttca tcccgaactt cgacgccatc agcgagtcgg tggacggcta cgattgaatg 27096 tcccatggtg gcgcggctga cctagctcgg cttcgacacc tggaccactg ccgccgcttc 27156 cgctgcttcg ctcgggatct cgccgagttt gcctactttg agctgcccga ggagcaccct 27216 cagggcccgg cccacggagt gcggatcgtc gtcgaagggg gtctcgactc ccacctgctt 27276 cggatcttca gccagcgtcc gatcctggcc gagcgcgagc aaggacagac ccttctgacc 27336 ctgtactgca tctgcaacca ccccggcctg catgaaagtc tttgttgtct gctgtgtact 27396 gagtataata aaagctgaga tcagcgacta ctccggactt ccgtgtgttc ctgctatcaa 27456 ccagtccctg ttcttcaccg ggaacgagac cgagctccag ctccagtgta agccccacaa 27516 gaagtacctc acctggctgt tccagggctc tccgatcgcc gttgtcaacc actgcgacaa 27576 cgacggagtc ctgctgagcg gccctgccaa ccttactttt tccacccgca gaagcaagct 27636 ccagctcttc caacccttcc tccccgggac ctatcagtgc gtctcgggac cctgccatca 27696 caccttccac ctgatcccga ataccacagc gtcgctcccc gctactaaca accaaactac 27756 ccaccaacgc caccgtcgcg acctttcctc tgggtctaat accactaccg gaggtgagct 27816 ccgaggtcga ccaacctctg ggatttacta cggcccctgg gaggtggtag ggttaatagc 27876 gctaggccta gttgcgggtg ggcttttggc tctctgctac ctatacctcc cttgctgttc 27936 gtacttagtg gtgctgtgtt gctggtttaa gaaatgggga agatcaccct agtgagctgc 27996 ggtgtgctgg tggcggtggt gctttcgatt gtgggactgg gcggcgcggc tgtagtgaag 28056 gagaaggccg atccctgctt gcatttcaat cccgacaaat gccagctgag ttttcagccc 28116 gatggcaatc ggtgcgcggt gctgatcaag tgcggatggg aatgcgagaa cgtgagaatc 28176 gagtacaata acaagactcg gaacaatact ctcgcgtccg tgtggcagcc cggggacccc 28236 gagtggtaca ccgtctctgt ccccggtgct gacggctccc cgcgcaccgt gaataatact 28296 ttcatttttg cgcacatgtg cgacacggtc atgtggatga gcaagcagta cgatatgtgg 28356 ccccccacga aggagaacat cgtggtcttc tccatcgctt acagcgtgtg cacggcgcta 28416 atcaccgcta tcgtgtgcct gagcattcac atgctcatcg ctattcgccc cagaaataat 28476 gccgaaaaag aaaaacagcc ataacacgtt ttttcacaca cctttttcag accatggcct 28536 ctgttaaatt tttgctttta tttgccagtc tcattgccgt cattcatgga atgagtaatg 28596 agaaaattac tatttacact ggcactaatc acacattgaa aggtccagaa aaagccacag 28656 aagtttcatg gtattgttat tttaatgaat cagatgtatc tactgaactc tgtggaaaca 28716 ataacaaaaa aaatgagagc attactctca tcaagtttca atgtggatct gacttaaccc 28776 taattaacat cactagagac tatgtaggta tgtattatgg aactacagca ggcatttcgg 28836 acatggaatt ttatcaagtt tctgtgtctg aacccaccac gcctagaatg accacaacca 28896 caaaaactac acctgttacc actatacagc tcactaccaa tggctttctt gccatgcttc 28956 aagtggctga aaatagcacc agcattcaac ccaccccacc cagtgaggaa attcccagat 29016 ccatgattgg cattattgtt gctgtagtgg tgtgcatgtt gatcatcgcc ttgtgcatgg 29076 tgtactatgc cttctgctac agaaagcaca gactgaacga caagctggaa cacttactaa 29136 gtgttgaatt ttaatttttt agaaccatga agatcctagg ccttttagtt ttttctatca 29196 ttacctctgc tctatgcaat tctgacaatg aggacgttac tgtcgttgtc ggatcaaatt 29256 atacactaaa aggtccagca aaaggtatgc tttcgtggta ttgttggttc ggaactgacg 29316 agcaacagac agaactttgc aatgctcaaa aaggcaaaac ctcaaattct aaaatctcta 29376 attatcaatg caatggcact gacttagtat tgctcaatgt cacgaaagca tatgctggca 29436 gttacacctg ccctggagat gatgccgaca atatgatttt ttacaaagtg gaagtggttg 29496 atcccactac tccaccgccc accaccacaa ctactcatac cacacacaca gaacaaacac 29556 cagaggcagc agaagcagag ttggccttcc aggttcacgg agattccttt gctgtcaata 29616 cccctacacc cgatcagcgg tgtccggggc tgctcgtcag cggcattgtc ggtgtgcttt 29676 cgggattagc agtcataatc atctgcatgt tcatttttgc ttgctgctat agaaggcttt 29736 accgacaaaa atcagaccca ctgctgaacc tctatgttta attttttcca gagccatgaa 29796 ggcagttagc gctctagttt tttgttcttt gattggcatt gtttttagtg ctgggttttt 29856 gaaaaatctt accatttatg aaggtgagaa tgccactcta gtgggcatca gtggtcaaaa 29916 tgtcagctgg ctaaaatacc atctagatgg gtggaaagac atttgcgatt ggaatgtcac 29976 tgtgtataca tgtaatggag ttaacctcac cattactaat gccacccaag atcagaatgg 30036 taggtttaag ggccagagtt tcactagaaa taatgggtat gaatcccata acatgtttat 30096 ctatgacgtc actgtcatca gaaatgagac tgccaccacc acacagatgc ccactacaca 30156 cagttctacc actactacca tgcaaaccac acagacaacc actacatcaa ctcagcatat 30216 gaccaccact acagcagcaa agccaagtag tgcagcgcct cagccccagg ctttggcttt 30276 gaaagctgca caacctagta caactactag gaccaatgag cagactactg aatttttgtc 30336 cactgtcgag agccacacca cagctacctc cagtgccttc tctagcaccg ccaatctctc 30396 ctcgctttcc tctacaccaa tcagtcccgc tactactccc accccagctc ttctccccac 30456 tcccctgaag caaactgagg acagcggcat gcaatggcag atcaccctgc tcattgtgat 30516 cgggttggtc atcctggccg tgttgctcta ctacatcttc tgccgccgca ttcccaacgc 30576 gcaccgcaaa ccggcctaca agcccatcgt tatcgggcag ccggagccgc ttcaggtgga 30636 agggggtcta aggaatcttc tcttctcttt tacagtatgg tgattgaact atgattccta 30696 gacaattctt gatcactatt cttatctgcc tcctccaagt ctgtgccacc ctcgctctgg 30756 tggccaacgc cagtccagac tgtattgggc ccttcgcctc ctacgtgctc tttgccttca 30816 tcacctgcat ctgctgctgt agcatagtct gcctgcttat caccttcttc cagttcattg 30876 actggatctt tgtgcgcatc gcctacctgc gccaccaccc ccagtaccgc gaccagcgag 30936 tggcgcggct gctcaggctc ctctgataag catgcgggct ctgctacttc tcgcgcttct 30996 gctgttagtg ctcccccgcc ccgtcgaccc ccggtccccc actcagtccc ccgaagaggt 31056 ccgcaaatgc aaattccaag aaccctggaa attcctcaaa tgctaccgcc aaaaatcaga 31116 catgcttccc agctggatca tgatcattgg gatcgtgaac attctggcct gcaccctcat 31176 ctcctttgtg atttacccct gctttgactt tggttggaac tcgccagagg cgctctatct 31236 cccgcctgaa cctgacacac caccacagca acctcaggca cacgcactac caccaccaca 31296 gcctaggcca caatacatgc ccatattaga ctatgaggcc gagccacagc gacccatgct 31356 ccccgctatt agttacttca atctaaccgg cggagatgac tgacccactg gccaacaaca 31416 acgtcaacga ccttctcctg gacatggacg gccgcgcctc ggagcagcga ctcgcccaac 31476 ttcgcattcg ccagcagcag gagagagccg tcaaggagct gcaggacggc atagccatcc 31536 accagtgcaa gaaaggcatc ttctgcctgg tgaaacaggc caagatctcc tacgaggtca 31596 ccccgaccga ccatcgcctc tcctacgagc tcctgcagca gcgccagaag ttcacctgcc 31656 tggtcggagt caaccccatc gtcatcaccc agcagtcggg cgataccaag gggtgcatcc 31716 actgctcctg cgactccccc gactgcgtcc acactctgat caagaccctc tgcggcctcc 31776 gcgacctcct ccccatgaac taatcacccc cttatccagt gaaataaata tcatattgat 31836 gatgatttaa ataaaaaata atcatttgat ttgaaataaa gatacaatca tattgatgat 31896 ttgagtttta aaaaataaag aatcacttac ttgaaatctg ataccaggtc tctgtccatg 31956 ttttctgcca acaccacctc actcccctct tcccagctct ggtactgcag accccggcgg 32016 gctgcaaact tcctccacac gctgaagggg atgtcaaatt cctcctgtcc ctcaatcttc 32076 attttatctt ctatcag atg tcc aaa aag cgc gtc egg gtg gat gat gac 32126
Met Ser Lys Lys Arg Val Arg Val Asp Asp Asp 1465 1470
ttc gac ccc gtc tac ccc tac gat gca gac aac gca ccg acc gtg 32171 Phe Asp Pro Val Tyr Pro Tyr Asp Ala Asp Asn Ala Pro Thr Val 1475 1480 1485
ccc ttc atc aac ccc ccc ttc gtc tet tea gat gga ttc caa gag 32216 Pro Phe Ile Asn Pro Pro Phe Val Ser Ser Asp Gly Phe Gin Glu 1490 1495 1500
aag ccc etg ggg gtg etg tcc etg ega etg get gac ccc gtc acc 32261 Lys Pro Leu Gly Val Leu Ser Leu Arg Leu Ala Asp Pro Val Thr 1505 1510 1515
acc aag aac ggg gaa atc acc etc aag etg gga gag ggg gtg gac 32306 Thr Lys Asn Gly Glu Ile Thr Leu Lys Leu Gly Glu Gly Val Asp 1520 1525 1530
etc gac tcc tcg gga aaa etc atc tcc aac acg gcc acc aag gcc 32351 Leu Asp Ser Ser Gly Lys Leu Ile Ser Asn Thr Ala Thr Lys Ala 1535 1540 1545
gcc gcc cct etc agt ttt tcc aac aac acc att tcc ett aac atg 32396 Ala Ala Pro Leu Ser Phe Ser Asn Asn Thr Ile Ser Leu Asn Met 1550 1555 1560
gat acc cct ett tat acc aaa gat gga aaa tta tcc tta caa gtt 32441 Asp Thr Pro Leu Tyr Thr Lys Asp Gly Lys Leu Ser Leu Gin Val 1565 1570 1575
tet cca ccg tta aac ata tta aaa tea acc att etg aac aca tta 32486 Ser Pro Pro Leu Asn Ile Leu Lys Ser Thr Ile Leu Asn Thr Leu 1580 1585 1590
get gta get tat gga tea ggt tta gga etg agt ggt ggc act get 32531 Ala Val Ala Tyr Gly Ser Gly Leu Gly Leu Ser Gly Gly Thr Ala 1595 1600 1605
ett gca gta cag ttg gcc tet cca etc act ttt gat gaa aaa gga 32576 Leu Ala Val Gin Leu Ala Ser Pro Leu Thr Phe Asp Glu Lys Gly 1610 1615 1620
aat att aaa att aac eta gcc agt ggt cca tta aca gtt gat gca 32621 Asn Ile Lys Ile Asn Leu Ala Ser Gly Pro Leu Thr Val Asp Ala 1625 1630 1635
agt ega ett agt atc aac tgc aaa aga ggg gtc act gtc act acc 32666 Ser Arg Leu Ser Ile Asn Cys Lys Arg Gly Val Thr Val Thr Thr 1640 1645 1650
tea gga gat gca att gaa age aac ata age tgg cct aaa ggt ata 32711 Ser Gly Asp Ala Ile Glu Ser Asn Ile Ser Trp Pro Lys Gly Ile 1655 1660 1665
aga ttt gaa ggt aat ggc ata get gca aac att ggc aga gga ttg 32756 Arg Phe Glu Gly Asn Gly Ile Ala Ala Asn Ile Gly Arg Gly Leu 1670 1675 1680
gaa ttt gga acc act agt aca gag act gat gtc aca gat gca tac 32801 Glu Phe Gly Thr Thr Ser Thr Glu Thr Asp Val Thr Asp Ala Tyr 1685 1690 1695
cca att caa gtt aaa ttg ggt act ggc ett acc ttt gac agt aca 32846 Pro Ile Gin Val Lys Leu Gly Thr Gly Leu Thr Phe Asp Ser Thr 1700 1705 1710
ggc gcc att gtt get tgg aac aaa gag gat gat aaa ett aca tta 32891 Gly Ala Ile Val Ala Trp Asn Lys Glu Asp Asp Lys Leu Thr Leu 1715 1720 1725
tgg acc aca gcc gac ccc tcg cca aat tgc aaa ata tac tet gaa 32936 Trp Thr Thr Ala Asp Pro Ser Pro Asn Cys Lys Ile Tyr Ser Glu 1730 1735 1740
aaa gat gcc aaa etc aca ett tgc ttg aca aag tgt gga agt caa 32981 Lys Asp Ala Lys Leu Thr Leu Cys Leu Thr Lys Cys Gly Ser Gin 1745 1750 1755
att etg ggt act gtg act gta ttg gca gtg aat aat gga agt etc 33026 Ile Leu Gly Thr Val Thr Val Leu Ala Val Asn Asn Gly Ser Leu 1760 1765 1770
aac cca atc aca aac aca gta age act gca etc gtc tcc etc aag 33071 Asn Pro Ile Thr Asn Thr Val Ser Thr Ala Leu Val Ser Leu Lys 1775 1780 1785
ttt gat gca agt gga gtt ttg eta age age tcc aca tta gac aaa 33116 Phe Asp Ala Ser Gly Val Leu Leu Ser Ser Ser Thr Leu Asp Lys 1790 1795 1800
gaa tat tgg aac ttc aga aag gga gat gtt aca cct get gag ccc 33161 Glu Tyr Trp Asn Phe Arg Lys Gly Asp Val Thr Pro Ala Glu Pro 1805 1810 1815
tat act aat get ata ggt ttt atg cct aac ata aag gcc tat cct 33206 Tyr Thr Asn Ala Ile Gly Phe Met Pro Asn Ile Lys Ala Tyr Pro 1820 1825 1830
aaa aac aca tet gca get tea aaa age eat att gtc agt caa gtt 33251 Lys Asn Thr Ser Ala Ala Ser Lys Ser His Ile Val Ser Gin Val 1835 1840 1845
tat etc aat ggg gat gag gcc aaa cca etg atg etg att att act 33296 Tyr Leu Asn Gly Asp Glu Ala Lys Pro Leu Met Leu Ile Ile Thr 1850 1855 1860
ttt aat gaa act gag gat gca act tgc acc tac agt atc act ttt 33341 Phe Asn Glu Thr Glu Asp Ala Thr Cys Thr Tyr Ser Ile Thr Phe 1865 1870 1875
caa tgg aaa tgg gat agt act aag tac aca ggt gaa aca ett get 33386 Gin Trp Lys Trp Asp Ser Thr Lys Tyr Thr Gly Glu Thr Leu Ala 1880 1885 1890
acc age tcc ttc acc ttc tcc tac atc gcc caa gaa tga acactgtatc 33435 Thr Ser Ser Phe Thr Phe Ser Tyr Ile Ala Gin Glu
1895 1900 1905
ccaccctgca tgccaaccct tcccacccca ctctgtctat ggaaaaaact ctgaagcaca 33495 aaataaaata aagttcaagt gttttattga ttcaacagtt ttacaggatt cgagcagtta 33555 tttttcctcc accctcccag gacatggaat acaccaccct ctccccccgc acagccttga 33615 acatctgaat gccattggtg atggacatgc ttttggtctc cacgttccac acagtttcag 33675 agegagccag tetegggteg gtcagggaga tgaaaccctc cgggcactcc cgcatctgca 33735 cctcacagct caacagctga ggattgtcct eggtggtegg gatcaeggtt atctggaaga 33795 agcagaagag eggeggtggg aatcatagtc egegaaeggg ateggceggt ggtgtcgeat 33855 caggccccgc ageagtcget gccgccgccg ctccgtcaag ctgctgctca gggggtcegg 33915 gtccagggac tccctcagca tgatgcccac ggccctcagc atcagtcgtc tggtgcggcg 33975 ggcgcagcag cgcatgcgga tctcgctcag gtcgctgcag tacgtgcaac acaggaccac 34035 caggttgttc aacagtccat agttcaacac gctccagccg aaactcatcg cgggaaggat 34095 gctacccacg tggccgtcgt accagatcct caggtaaatc aagtggcgct ccctccagaa 34155 cacgctgccc acgtacatga tctccttggg catgtggcgg ttcaccacct cccggtacca 34215 catcaccctc tggttgaaca tgcagccccg gatgatcctg cggaaccaca gggccagcac 34275 cgccccgccc gccatgcagc gaagagaccc cgggtcccgg caatggcaat ggaggaccca 34335 ccgctcgtac ccgtggatca tctgggagct gaacaagtct atgttggcac agcacaggca 34395 tatgctcatg catctcttca gcactctcag ctcctcgggg gtcaaaacca tatcccaggg 34455 cacggggaac tcttgcagga cagcgaaccc cgcagaacag ggcaatcctc gcacataact 34515 tacattgtgc atggacaggg tatcgcaatc aggcagcacc gggtgatcct ccaccagaga 34575 agcgcgggtc tcggtctcct cacagcgtgg taagggggcc ggccgatacg ggtgatggcg 34635 ggacgcggct gatcgtgttc gcgaccgtgt catgatgcag ttgctttcgg acattttcgt 34695 acttgctgta gcagaacctg gtccgggcgc tgcacaccga tcgccggcgg cggtcccggc 34755 gcttggaacg ctcggtgttg aaattgtaaa acagccactc tctcagaccg tgcagcagat 34815 ctagggcctc aggagtgatg aagatcccat catgcctgat agctctgatc acatcgacca 34875 ccgtggaatg ggccagaccc agccagatga tgcaattttg ttgggtttcg gtgacggcgg 34935 gggagggaag aacaggaaga accatgatta acttttaatc caaacggtct cggagcactt 34995 caaaatgaag gtcgcggaga tggcacctct cgcccccgct gtgttggtgg aaaataacag 35055 ccaggtcaaa ggtgatacgg ttctcgagat gttccacggt ggcttccagc aaagcctcca 35115 cgcgcacatc cagaaacaag acaatagcga aagcgggagg gttctctaat tcctcaatca 35175 tcatgttaca ctcctgcacc atccccagat aattttcatt tttccagcct tgaatgattc 35235 gaactagttc ctgaggtaaa tccaagccag ccatgataaa gagctcgcgc agagcgccct 35295 ccaccggcat tcttaagcac accctcataa ttccaagata ttctgctcct ggttcacctg 35355 cagcagattg acaagcggaa tatcaaaatc tctgccgcga tccctaagct cctccctcag 35415 caataactgt aagtactctt tcatatcctc tccgaaattt ttagccatag gaccaccagg 35475 aataagatta gggcaagcca cagtacagat aaaccgaagt cctccccagt gagcattgcc 35535 aaatgcaaga ctgctataag catgctggct agacccggtg atatcttcca gataactgga 35595 cagaaaatca cccaggcaat ttttaagaaa atcaacaaaa gaaaaatcct ccaggtgcac 35655 gtttagagcc tcgggaacaa cgatgaagta aatgcaagcg gtgcgttcca gcatggttag 35715 ttagctgatc tgtaaaaaac aaaaaataaa acattaaacc atgctagcct ggcgaacagg 35775 tgggtaaatc gttctctcca gcaccaggca ggccacgggg tctccggcgc gaccctcgta 35835 aaaattgtcg ctatgattga aaaccatcac agagagacgt tcccggtggc cggcgtgaat 35895 gattcgacaa gatgaataca cccccggaac attggcgtcc gcgagtgaaa aaaagcgccc 35955 gaggaagcaa taaggcacta caatgctcag tctcaagtcc agcaaagcga tgccatgcgg 36015 atgaagcaca aaatcctcag gtgcgtacaa aatgtaatta ctcccctcct gcacaggcag 36075 cgaagccccc gatccctcca gatacacata caaagcctca gcgtccatag cttaccgagc 36135 agcagcacac aacaggcgca agagtcagag aaaggctgag ctctaacctg tccacccgct 36195 ctctgctcaa tatatagccc agatctacac tgacgtaaag gccaaagtct aaaaataccc 36255 gccaaataat cacacacgcc cagcacacgc ccagaaaccg gtgacacact caaaaaaata 36315 cgcgcacttc ctcaaacgcc caaactgccg tcatttccgg gttcccacgc tacgtcatcg 36375 gaattcgact ttcaaattcc gtcgaccgtt aaaaacgtca cccgccccgc ccctaacggt 36435 cgcccgtctc tcggccaatc accttcctcc ctccccaaat tcaaacagct catttgcata 36495 ttaacgcgca ccaaaagttt gaggtatatt attgatgatg 36535
<210> 10
<211> 531
<212> PRT
<213> chimpanzee adenovirus serotype Pan7
<400> 10
Met Met Arg Arg Val Tyr Pro Glu Gly Pro Pro Pro Ser Tyr Glu Ser 15 10 15
Val Met Gin Gin Ala Val Ala Ala Ala Met Gin Pro Pro Leu Glu Ala 20 25 30
Pro Tyr Val Pro Pro Arg Tyr Leu Ala Pro Thr Glu Gly Arg Asn Ser 35 40 45
Ile Arg Tyr Ser Glu Leu Ala Pro Leu Tyr Asp Thr Thr Arg Leu Tyr 50 55 60
Leu Val Asp Asn Lys Ser Ala Asp Ile Ala Ser Leu Asn Tyr Gin Asn 65 70 75 80
Asp His Ser Asn Phe Leu Thr Thr Val Val Gin Asn Asn Asp Phe Thr 85 90 95
Pro Thr Glu Ala Ser Thr Gin Thr Ile Asn Phe Asp Glu Arg Ser Arg 100 105 110 Trp Gly Gly Gin Leu Lys Thr Ile Met His Thr Asn Met Pro Asn Val 115 120 125
Asn Glu Phe Met Tyr Ser Asn Lys Phe Lys Ala Arg Val Met Val Ser 130 135 140
Arg Lys Thr Pro Asn Gly Val Ala Val Asp Glu Asn Tyr Asp Gly Ser 145 150 155 160
Gin Asp Glu Leu Thr Tyr Glu Trp Val Glu Phe Glu Leu Pro Glu Gly 165 170 175
Asn Phe Ser Val Thr Met Thr Ile Asp Leu Met Asn Asn Ala Ile Ile 180 185 190
Asp Asn Tyr Leu Ala Val Gly Arg Gin Asn Gly Val Leu Glu Ser Asp 195 200 205
Ile Gly Val Lys Phe Asp Thr Arg Asn Phe Arg Leu Gly Trp Asp Pro 210 215 220
Val Thr Glu Leu Val Met Pro Gly Val Tyr Thr Asn Glu Ala Phe His 225 230 235 240
Pro Asp Ile Val Leu Leu Pro Gly Cys Gly Val Asp Phe Thr Glu Ser 245 250 255
Arg Leu Ser Asn Leu Leu Gly Ile Arg Lys Arg Gin Pro Phe Gin Glu 260 265 270
Gly Phe Gin Ile Leu Tyr Glu Asp Leu Glu Gly Gly Asn Ile Pro Ala 275 280 285
Leu Leu Asp Val Glu Ala Tyr Glu Lys Ser Lys Glu Glu Ala Ala Ala 290 295 300
Ala Ala Thr Ala Ala Val Ala Thr Ala Ser Thr Glu Val Arg Gly Asp 305 310 315 320
Asn Phe Ala Ser Ala Ala Ala Val Ala Glu Ala Ala Glu Thr Glu Ser 325 330 335
Lys Ile Val Ile Gin Pro Val Glu Lys Asp Ser Lys Asp Arg Ser Tyr 340 345 350
Asn Val Leu Ala Asp Lys Lys Asn Thr Ala Tyr Arg Ser Trp Tyr Leu 355 360 365 Ala Tyr Asn Tyr Gly Asp Pro Glu Lys Gly Val Arg Ser Trp Thr Leu 370 375 380
Leu Thr Thr Ser Asp Val Thr Cys Gly Val Glu Gin Val Tyr Trp Ser 385 390 395 400
Leu Pro Asp Met Met Gin Asp Pro Val Thr Phe Arg Ser Thr Arg Gin 405 410 415
Val Ser Asn Tyr Pro Val Val Gly Ala Glu Leu Leu Pro Val Tyr Ser 420 425 430
Lys Ser Phe Phe Asn Glu Gin Ala Val Tyr Ser Gin Gin Leu Arg Ala 435 440 445
Phe Thr Ser Leu Thr His Val Phe Asn Arg Phe Pro Glu Asn Gin Ile 450 455 460
Leu Val Arg Pro Pro Ala Pro Thr Ile Thr Thr Val Ser Glu Asn Val 465 470 475 480
Pro Ala Leu Thr Asp His Gly Thr Leu Pro Leu Arg Ser Ser Ile Arg 485 490 495
Gly Val Gin Arg Val Thr Val Thr Asp Ala Arg Arg Arg Thr Cys Pro 500 505 510
Tyr Val Tyr Lys Ala Leu Gly Val Val Ala Pro Arg Val Leu Ser Ser 515 520 525
Arg Thr Phe
530
<210> 11
<211> 932
<212> PRT
<213> chimpanzee adenovirus serotype Pan7
<400> 11
Met Ala Thr Pro Ser Met Leu Pro Gin Trp Ala Tyr Met His Ile Ala 15 10 15
Gly Gin Asp Ala Ser Glu Tyr Leu Ser Pro Gly Leu Val Gin Phe Ala 20 25 30 Arg Ala Thr Asp Thr Tyr Phe Ser Leu Gly Asn Lys Phe Arg Asn Pro 35 40 45
Thr Val Ala Pro Thr His Asp Val Thr Thr Asp Arg Ser Gin Arg Leu 50 55 60
Thr Leu Arg Phe Val Pro Val Asp Arg Glu Asp Asn Thr Tyr Ser Tyr 65 70 75 80
Lys Val Arg Tyr Thr Leu Ala Val Gly Asp Asn Arg Val Leu Asp Met 85 90 95
Ala Ser Thr Tyr Phe Asp Ile Arg Gly Val Leu Asp Arg Gly Pro Ser 100 105 110
Phe Lys Pro Tyr Ser Gly Thr Ala Tyr Asn Ser Leu Ala Pro Lys Gly 115 120 125
Ala Pro Asn Thr Cys Gin Trp Thr Tyr Lys Ala Gly Asp Thr Asp Thr 130 135 140
Glu Lys Thr Tyr Thr Tyr Gly Asn Ala Pro Val Gin Gly Ile Ser Ile 145 150 155 160
Thr Lys Asp Gly Ile Gin Leu Gly Thr Asp Ser Asp Gly Gin Ala Ile 165 170 175
Tyr Ala Asp Glu Thr Tyr Gin Pro Glu Pro Gin Val Gly Asp Ala Glu 180 185 190
Trp His Asp Ile Thr Gly Thr Asp Glu Lys Tyr Gly Gly Arg Ala Leu 195 200 205
Lys Pro Asp Thr Lys Met Lys Pro Cys Tyr Gly Ser Phe Ala Lys Pro 210 215 220
Thr Asn Lys Glu Gly Gly Gin Ala Asn Val Lys Thr Glu Thr Gly Gly 225 230 235 240
Thr Lys Glu Tyr Asp Ile Asp Met Ala Phe Phe Asp Asn Arg Ser Ala 245 250 255
Ala Ala Ala Gly Leu Ala Pro Glu Ile Val Leu Tyr Thr Glu Asn Val 260 265 270
Asp Leu Glu Thr Pro Asp Thr His Ile Val Tyr Lys Ala Gly Thr Asp 275 280 285 Asp Ser Ser Ser Ser Ile Asn Leu Gly Gin Gin Ser Met Pro Asn Arg 290 295 300
Pro Asn Tyr Ile Gly Phe Arg Asp Asn Phe Ile Gly Leu Met Tyr Tyr 305 310 315 320
Asn Ser Thr Gly Asn Met Gly Val Leu Ala Gly Gin Ala Ser Gin Leu 325 330 335
Asn Ala Val Val Asp Leu Gin Asp Arg Asn Thr Glu Leu Ser Tyr Gin 340 345 350
Leu Leu Leu Asp Ser Leu Gly Asp Arg Thr Arg Tyr Phe Ser Met Trp 355 360 365
Asn Gin Ala Val Asp Ser Tyr Asp Pro Asp Val Arg Ile Ile Glu Asn 370 375 380
His Gly Val Glu Asp Glu Leu Pro Asn Tyr Cys Phe Pro Leu Asp Ala 385 390 395 400
Val Gly Arg Thr Asp Thr Tyr Gin Gly Ile Lys Ala Asn Gly Asp Asn 405 410 415
Gin Thr Thr Trp Thr Lys Asp Asp Thr Val Asn Asp Ala Asn Glu Leu 420 425 430
Gly Lys Gly Asn Pro Phe Ala Met Glu Ile Asn Ile Gin Ala Asn Leu 435 440 445
Trp Arg Asn Phe Leu Tyr Ala Asn Val Ala Leu Tyr Leu Pro Asp Ser 450 455 460
Tyr Lys Tyr Thr Pro Ala Asn Ile Thr Leu Pro Thr Asn Thr Asn Thr 465 470 475 480
Tyr Asp Tyr Met Asn Gly Arg Val Val Ala Pro Ser Leu Val Asp Ala 485 490 495
Tyr Ile Asn Ile Gly Ala Arg Trp Ser Leu Asp Pro Met Asp Asn Val 500 505 510
Asn Pro Phe Asn His His Arg Asn Ala Gly Leu Arg Tyr Arg Ser Met 515 520 525
Leu Leu Gly Asn Gly Arg Tyr Val Pro Phe His Ile Gin Val Pro Gin 530 535 540 Lys Phe Phe Ala Ile Lys Ser Leu Leu Leu Leu Pro Gly Ser Tyr Thr 545 550 555 560
Tyr Glu Trp Asn Phe Arg Lys Asp Val Asn Met Ile Leu Gin Ser Ser 565 570 575
Leu Gly Asn Asp Leu Arg Thr Asp Gly Ala Ser Ile Ala Phe Thr Ser 580 585 590
Ile Asn Leu Tyr Ala Thr Phe Phe Pro Met Ala His Asn Thr Ala Ser 595 600 605
Thr Leu Glu Ala Met Leu Arg Asn Asp Thr Asn Asp Gin Ser Phe Asn 610 615 620
Asp Tyr Leu Ser Ala Ala Asn Met Leu Tyr Pro Ile Pro Ala Asn Ala 625 630 635 640
Thr Asn Val Pro Ile Ser Ile Pro Ser Arg Asn Trp Ala Ala Phe Arg 645 650 655
Gly Trp Ser Phe Thr Arg Leu Lys Thr Arg Glu Thr Pro Ser Leu Gly 660 665 670
Ser Gly Phe Asp Pro Tyr Phe Val Tyr Ser Gly Ser Ile Pro Tyr Leu 675 680 685
Asp Gly Thr Phe Tyr Leu Asn His Thr Phe Lys Lys Val Ser Ile Thr 690 695 700
Phe Asp Ser Ser Val Ser Trp Pro Gly Asn Asp Arg Leu Leu Thr Pro 705 710 715 720
Asn Glu Phe Glu Ile Lys Arg Thr Val Asp Gly Glu Gly Tyr Asn Val 725 730 735
Ala Gin Cys Asn Met Thr Lys Asp Trp Phe Leu Val Gin Met Leu Ala 740 745 750
His Tyr Asn Ile Gly Tyr Gin Gly Phe Tyr Val Pro Glu Gly Tyr Lys 755 760 765
Asp Arg Met Tyr Ser Phe Phe Arg Asn Phe Gin Pro Met Ser Arg Gin 770 775 780
Val Val Asp Glu Val Asn Tyr Lys Asp Tyr Gin Ala Val Thr Leu Ala 785 790 795 800 Tyr Gin His Asn Asn Ser Gly Phe Val Gly Tyr Leu Ala Pro Thr Met 805 810 815
Arg Gin Gly Gin Pro Tyr Pro Ala Asn Tyr Pro Tyr Pro Leu Ile Gly 820 825 830
Lys Ser Ala Val Ala Ser Val Thr Gin Lys Lys Phe Leu Cys Asp Arg 835 840 845
Val Met Trp Arg Ile Pro Phe Ser Ser Asn Phe Met Ser Met Gly Ala 850 855 860
Leu Thr Asp Leu Gly Gin Asn Met Leu Tyr Ala Asn Ser Ala His Ala 865 870 875 880
Leu Asp Met Asn Phe Glu Val Asp Pro Met Asp Glu Ser Thr Leu Leu 885 890 895
Tyr Val Val Phe Glu Val Phe Asp Val Val Arg Val His Gin Pro His 900 905 910
Arg Gly Val Ile Glu Ala Val Tyr Leu Arg Thr Pro Phe Ser Ala Gly 915 920 925
Asn Ala Thr Thr
930
<210> 12
<211> 443
<212> PRT
<213> chimpanzee adenovirus serotype Pan7
<400> 12
Met Ser Lys Lys Arg Val Arg Val Asp Asp Asp Phe Asp Pro Val Tyr 15 10 15
Pro Tyr Asp Ala Asp Asn Ala Pro Thr Val Pro Phe Ile Asn Pro Pro 20 25 30
Phe Val Ser Ser Asp Gly Phe Gin Glu Lys Pro Leu Gly Val Leu Ser 35 40 45
Leu Arg Leu Ala Asp Pro Val Thr Thr Lys Asn Gly Glu Ile Thr Leu 50 55 60 Lys Leu Gly Glu Gly Val Asp Leu Asp Ser Ser Gly Lys Leu Ile Ser 65 70 75 80
Asn Thr Ala Thr Lys Ala Ala Ala Pro Leu Ser Phe Ser Asn Asn Thr 85 90 95
Ile Ser Leu Asn Met Asp Thr Pro Leu Tyr Thr Lys Asp Gly Lys Leu 100 105 110
Ser Leu Gin Val Ser Pro Pro Leu Asn Ile Leu Lys Ser Thr Ile Leu 115 120 125
Asn Thr Leu Ala Val Ala Tyr Gly Ser Gly Leu Gly Leu Ser Gly Gly 130 135 140
Thr Ala Leu Ala Val Gin Leu Ala Ser Pro Leu Thr Phe Asp Glu Lys 145 150 155 160
Gly Asn Ile Lys Ile Asn Leu Ala Ser Gly Pro Leu Thr Val Asp Ala 165 170 175
Ser Arg Leu Ser Ile Asn Cys Lys Arg Gly Val Thr Val Thr Thr Ser 180 185 190
Gly Asp Ala Ile Glu Ser Asn Ile Ser Trp Pro Lys Gly Ile Arg Phe 195 200 205
Glu Gly Asn Gly Ile Ala Ala Asn Ile Gly Arg Gly Leu Glu Phe Gly 210 215 220
Thr Thr Ser Thr Glu Thr Asp Val Thr Asp Ala Tyr Pro Ile Gin Val 225 230 235 240
Lys Leu Gly Thr Gly Leu Thr Phe Asp Ser Thr Gly Ala Ile Val Ala 245 250 255
Trp Asn Lys Glu Asp Asp Lys Leu Thr Leu Trp Thr Thr Ala Asp Pro 260 265 270
Ser Pro Asn Cys Lys Ile Tyr Ser Glu Lys Asp Ala Lys Leu Thr Leu 275 280 285
Cys Leu Thr Lys Cys Gly Ser Gin Ile Leu Gly Thr Val Thr Val Leu 290 295 300
Ala Val Asn Asn Gly Ser Leu Asn Pro Ile Thr Asn Thr Val Ser Thr 305 310 315 320 Ala Leu Val Ser Leu Lys Phe Asp Ala Ser Gly Val Leu Leu Ser Ser 325 330 335
Ser Thr Leu Asp Lys Glu Tyr Trp Asn Phe Arg Lys Gly Asp Val Thr 340 345 350
Pro Ala Glu Pro Tyr Thr Asn Ala Ile Gly Phe Met Pro Asn Ile Lys 355 360 365
Ala Tyr Pro Lys Asn Thr Ser Ala Ala Ser Lys Ser His Ile Val Ser 370 375 380
Gin Val Tyr Leu Asn Gly Asp Glu Ala Lys Pro Leu Met Leu Ile Ile 385 390 395 400
Thr Phe Asn Glu Thr Glu Asp Ala Thr Cys Thr Tyr Ser Ile Thr Phe 405 410 415
Gin Trp Lys Trp Asp Ser Thr Lys Tyr Thr Gly Glu Thr Leu Ala Thr 420 425 430
Ser Ser Phe Thr Phe Ser Tyr Ile Ala Gin Glu 435 440
<210> 13
<211> 338
<212> PRT
<213> simian serotype Cl
<400> 13
Ala Pro Lys Gly Ala Pro Asn Thr Ser Gin Trp Leu Asp Lys Gly Val 15 10 15
Thr Thr Thr Asp Asn Asn Thr Glu Asn Gly Asp Glu Glu Asp Glu Val 20 25 30
Ala Glu Glu Gly Glu Glu Glu Lys Gin Ala Thr Tyr Thr Phe Gly Asn 35 40 45
Ala Pro Val Lys Ala Glu Ala Glu Ile Thr Lys Glu Gly Leu Pro Ile 50 55 60
Gly Leu Glu Val Pro Ser Glu Gly Asp Pro Lys Pro Ile Tyr Ala Asp 65 70 75 80 Lys Leu Tyr Gin Pro Glu Pro Gin Val Gly Glu Glu Ser Trp Thr Asp 85 90 95
Thr Asp Gly Thr Asp Glu Lys Tyr Gly Gly Arg Ala Leu Lys Pro Glu 100 105 110
Thr Lys Met Lys Pro Cys Tyr Gly Ser Phe Ala Lys Pro Thr Asn Val 115 120 125
Lys Gly Gly Gin Ala Lys Val Lys Lys Val Glu Glu Gly Lys Val Glu 130 135 140
Tyr Asp Ile Asp Met Asn Phe Phe Asp Leu Arg Ser Gin Lys Thr Gly 145 150 155 160
Leu Lys Pro Lys Ile Val Met Tyr Ala Glu Asn Val Asp Leu Glu Thr 165 170 175
Pro Asp Thr His Val Val Tyr Lys Pro Gly Ala Ser Asp Ala Ser Ser 180 185 190
His Ala Asn Leu Gly Gin Gin Ser Met Pro Asn Arg Pro Asn Tyr Ile 195 200 205
Gly Phe Arg Asp Asn Phe Ile Gly Leu Met Tyr Tyr Asn Ser Thr Gly 210 215 220
Asn Met Gly Val Leu Ala Gly Gin Ala Ser Gin Leu Asn Ala Val Val 225 230 235 240
Asp Leu Gin Asp Arg Asn Thr Glu Leu Ser Tyr Gin Leu Leu Leu Asp 245 250 255
Ser Leu Gly Asp Arg Thr Arg Tyr Phe Ser Met Trp Asn Gin Ala Val 260 265 270
Asp Ser Tyr Asp Pro Asp Val Arg Val Ile Glu Asn His Gly Val Glu 275 280 285
Asp Glu Leu Pro Asn Tyr Cys Phe Pro Leu Asp Gly Val Gly Pro Arg 290 295 300
Thr Asp Ser Tyr Lys Gly Ile Glu Thr Asn Gly Asp Glu Asn Thr Thr 305 310 315 320
Trp Lys Asp Leu Asp Pro Asn Gly Ile Ser Glu Leu Ala Lys Gly Asn 325 330 335 Pro Phe
<210> 14
<211> 315
<212> PRT
<213> chimpanzee adenovirus Pan-9
<400> 14
Ala Pro Lys Gly Ala Pro Asn Thr Cys Gin Trp Thr Tyr Lys Ala Asp 15 10 15
Gly Glu Thr Ala Thr Glu Lys Thr Tyr Thr Tyr Gly Asn Ala Pro Val 20 25 30
Gin Gly Ile Asn Ile Thr Lys Asp Gly Ile Gin Leu Gly Thr Asp Thr 35 40 45
Asp Asp Gin Pro Ile Tyr Ala Asp Lys Thr Tyr Gin Pro Glu Pro Gin 50 55 60
Val Gly Asp Ala Glu Trp His Asp Ile Thr Gly Thr Asp Glu Lys Tyr 65 70 75 80
Gly Gly Arg Ala Leu Lys Pro Asp Thr Lys Met Lys Pro Cys Tyr Gly 85 90 95
Ser Phe Ala Lys Pro Thr Asn Lys Glu Gly Gly Gin Ala Asn Val Lys 100 105 110
Thr Gly Thr Gly Thr Thr Lys Glu Tyr Asp Ile Asp Met Ala Phe Phe 115 120 125
Asp Asn Arg Ser Ala Ala Ala Ala Gly Leu Ala Pro Glu Ile Val Leu 130 135 140
Tyr Thr Glu Asn Val Asp Leu Glu Thr Pro Asp Thr His Ile Val Tyr 145 150 155 160
Lys Ala Gly Thr Asp Asp Ser Ser Ser Ser Ile Asn Leu Gly Gin Gin 165 170 175
Ala Met Pro Asn Arg Pro Asn Tyr Ile Gly Phe Arg Asp Asn Phe Ile 180 185 190 Gly Leu Met Tyr Tyr Asn Ser Thr Gly Asn Met Gly Val Leu Ala Gly 195 200 205
Gin Ala Ser Gin Leu Asn Ala Val Val Asp Leu Gin Asp Arg Asn Thr 210 215 220
Glu Leu Ser Tyr Gin Leu Leu Leu Asp Ser Leu Gly Asp Arg Thr Arg 225 230 235 240
Tyr Phe Ser Met Trp Asn Gin Ala Val Asp Ser Tyr Asp Pro Asp Val 245 250 255
Arg Ile Ile Glu Asn His Gly Val Glu Asp Glu Leu Pro Asn Tyr Cys 260 265 270
Phe Pro Leu Asp Ala Val Gly Arg Thr Asp Thr Tyr Gin Gly Ile Lys 275 280 285
Ala Asn Gly Thr Asp Gin Thr Thr Trp Thr Lys Asp Asp Ser Val Asn 290 295 300
Asp Ala Asn Glu Ile Gly Lys Gly Asn Pro Phe
305 310 315
<210> 15
<211> 315
<212> PRT
<213> chimpanzee adenovirus Pan-5
<400> 15
Ala Pro Lys Gly Ala Pro Asn Thr Cys Gin Trp Thr Tyr Lys Ala Asp 15 10 15
Gly Asp Thr Gly Thr Glu Lys Thr Tyr Thr Tyr Gly Asn Ala Pro Val 20 25 30
Gin Gly Ile Ser Ile Thr Lys Asp Gly Ile Gin Leu Gly Thr Asp Thr 35 40 45
Asp Asp Gin Pro Ile Tyr Ala Asp Lys Thr Tyr Gin Pro Glu Pro Gin 50 55 60
Val Gly Asp Ala Glu Trp His Asp Ile Thr Gly Thr Asp Glu Lys Tyr 65 70 75 80 Gly Gly Arg Ala Leu Lys Pro Asp Thr Lys Met Lys Pro Cys Tyr Gly 85 90 95
Ser Phe Ala Lys Pro Thr Asn Lys Glu Gly Gly Gin Ala Asn Val Lys 100 105 110
Thr Glu Thr Gly Gly Thr Lys Glu Tyr Asp Ile Asp Met Ala Phe Phe 115 120 125
Asp Asn Arg Ser Ala Ala Ala Ala Gly Leu Ala Pro Glu Ile Val Leu 130 135 140
Tyr Thr Glu Asn Val Asp Leu Glu Thr Pro Asp Thr His Ile Val Tyr 145 150 155 160
Lys Ala Gly Thr Asp Asp Ser Ser Ser Ser Ile Asn Leu Gly Gin Gin 165 170 175
Ser Met Pro Asn Arg Pro Asn Tyr Ile Gly Phe Arg Asp Asn Phe Ile 180 185 190
Gly Leu Met Tyr Tyr Asn Ser Thr Gly Asn Met Gly Val Leu Ala Gly 195 200 205
Gin Ala Ser Gin Leu Asn Ala Val Val Asp Leu Gin Asp Arg Asn Thr 210 215 220
Glu Leu Ser Tyr Gin Leu Leu Leu Asp Ser Leu Gly Asp Arg Thr Arg 225 230 235 240
Tyr Phe Ser Met Trp Asn Gin Ala Val Asp Ser Tyr Asp Pro Asp Val 245 250 255
Arg Ile Ile Glu Asn His Gly Val Glu Asp Glu Leu Pro Asn Tyr Cys 260 265 270
Phe Pro Leu Asp Ala Val Gly Arg Thr Asp Thr Tyr Gin Gly Ile Lys 275 280 285
Ala Asn Gly Ala Asp Gin Thr Thr Trp Thr Lys Asp Asp Thr Val Asn 290 295 300
Asp Ala Asn Glu Leu Gly Lys Gly Asn Pro Phe
305 310 315
<210> 16
<211> 324
<212> PRT
<213> chimpanzee adenovirus Pan-6
<400> 16
Ala Pro Lys Gly Ala Pro Asn Ser Ser Gin Trp Glu Gin Ala Lys Thr 15 10 15
Gly Asn Gly Gly Thr Met Glu Thr His Thr Tyr Gly Val Ala Pro Met 20 25 30
Gly Gly Glu Asn Ile Thr Lys Asp Gly Leu Gin Ile Gly Thr Asp Val 35 40 45
Thr Ala Asn Gin Asn Lys Pro Ile Tyr Ala Asp Lys Thr Phe Gin Pro 50 55 60
Glu Pro Gin Val Gly Glu Glu Asn Trp Gin Glu Thr Glu Asn Phe Tyr 65 70 75 80
Gly Gly Arg Ala Leu Lys Lys Asp Thr Lys Met Lys Pro Cys Tyr Gly 85 90 95
Ser Tyr Ala Arg Pro Thr Asn Glu Lys Gly Gly Gin Ala Lys Leu Lys 100 105 110
Val Gly Asp Asp Gly Val Pro Thr Lys Glu Phe Asp Ile Asp Leu Ala 115 120 125
Phe Phe Asp Thr Pro Gly Gly Thr Val Asn Gly Gin Asp Glu Tyr Lys 130 135 140
Ala Asp Ile Val Met Tyr Thr Glu Asn Thr Tyr Leu Glu Thr Pro Asp 145 150 155 160
Thr His Val Val Tyr Lys Pro Gly Lys Asp Asp Ala Ser Ser Glu Ile 165 170 175
Asn Leu Val Gin Gin Ser Met Pro Asn Arg Pro Asn Tyr Ile Gly Phe 180 185 190
Arg Asp Asn Phe Ile Gly Leu Met Tyr Tyr Asn Ser Thr Gly Asn Met 195 200 205
Gly Val Leu Ala Gly Gin Ala Ser Gin Leu Asn Ala Val Val Asp Leu 210 215 220 Gin Asp Arg Asn Thr Glu Leu Ser Tyr Gin Leu Leu Leu Asp Ser Leu 225 230 235 240
Gly Asp Arg Thr Arg Tyr Phe Ser Met Trp Asn Gin Ala Val Asp Ser 245 250 255
Tyr Asp Pro Asp Val Arg Ile Ile Glu Asn His Gly Val Glu Asp Glu 260 265 270
Leu Pro Asn Tyr Cys Phe Pro Leu Asp Gly Ser Gly Thr Asn Ala Ala 275 280 285
Tyr Gin Gly Val Lys Val Lys Asp Gly Gin Asp Gly Asp Val Glu Ser 290 295 300
Glu Trp Glu Asn Asp Asp Thr Val Ala Ala Arg Asn Gin Leu Cys Lys 305 310 315 320
Gly Asn Ile Phe
<210> 17
<211> 314
<212> PRT
<213> chimpanzee adenovirus Pan-7
<400> 17
Ala Pro Lys Gly Ala Pro Asn Thr Cys Gin Trp Thr Tyr Lys Ala Gly 15 10 15
Asp Thr Asp Thr Glu Lys Thr Tyr Thr Tyr Gly Asn Ala Pro Val Gin 20 25 30
Gly Ile Ser Ile Thr Lys Asp Gly Ile Gin Leu Gly Thr Asp Ser Asp 35 40 45
Gly Gin Ala Ile Tyr Ala Asp Glu Thr Tyr Gin Pro Glu Pro Gin Val 50 55 60
Gly Asp Ala Glu Trp His Asp Ile Thr Gly Thr Asp Glu Lys Tyr Gly 65 70 75 80
Gly Arg Ala Leu Lys Pro Asp Thr Lys Met Lys Pro Cys Tyr Gly Ser 85 90 95 Phe Ala Lys Pro Thr Asn Lys Glu Gly Gly Gin Ala Asn Val Lys Thr 100 105 110
Glu Thr Gly Gly Thr Lys Glu Tyr Asp Ile Asp Met Ala Phe Phe Asp 115 120 125
Asn Arg Ser Ala Ala Ala Ala Gly Leu Ala Pro Glu Ile Val Leu Tyr 130 135 140
Thr Glu Asn Val Asp Leu Glu Thr Pro Asp Thr His Ile Val Tyr Lys 145 150 155 160
Ala Gly Thr Asp Asp Ser Ser Ser Ser Ile Asn Leu Gly Gin Gin Ser 165 170 175
Met Pro Asn Arg Pro Asn Tyr Ile Gly Phe Arg Asp Asn Phe Ile Gly 180 185 190
Leu Met Tyr Tyr Asn Ser Thr Gly Asn Met Gly Val Leu Ala Gly Gin 195 200 205
Ala Ser Gin Leu Asn Ala Val Val Asp Leu Gin Asp Arg Asn Thr Glu 210 215 220
Leu Ser Tyr Gin Leu Leu Leu Asp Ser Leu Gly Asp Arg Thr Arg Tyr 225 230 235 240
Phe Ser Met Trp Asn Gin Ala Val Asp Ser Tyr Asp Pro Asp Val Arg 245 250 255
Ile Ile Glu Asn His Gly Val Glu Asp Glu Leu Pro Asn Tyr Cys Phe 260 265 270
Pro Leu Asp Ala Val Gly Arg Thr Asp Thr Tyr Gin Gly Ile Lys Ala 275 280 285
Asn Gly Asp Asn Gin Thr Thr Trp Thr Lys Asp Asp Thr Val Asn Asp 290 295 300
Ala Asn Glu Leu Gly Lys Gly Asn Pro Phe
305 310
<210> 18
<211> 179
<212> PRT
<213> chimpanzee adenovirus Pan9
<400> 18
Thr Leu Trp Thr Thr Pro Asp Pro Ser Pro Asn Cys Gin Ile Leu Ala 15 10 15
Glu Asn Asp Ala Lys Leu Thr Leu Cys Leu Thr Lys Cys Gly Ser Gin 20 25 30
Ile Leu Ala Thr Val Ser Val Leu Val Val Gly Ser Gly Asn Leu Asn 35 40 45
Pro Ile Thr Gly Thr Val Ser Ser Ala Gin Val Phe Leu Arg Phe Asp 50 55 60
Ala Asn Gly Val Leu Leu Thr Glu His Ser Thr Leu Lys Lys Tyr Trp 65 70 75 80
Gly Tyr Arg Gin Gly Asp Ser Ile Asp Gly Thr Pro Tyr Thr Asn Ala 85 90 95
Val Gly Phe Met Pro Asn Leu Lys Ala Tyr Pro Lys Ser Gin Ser Ser 100 105 110
Thr Thr Lys Asn Asn Ile Val Gly Gin Val Tyr Met Asn Gly Asp Val 115 120 125
Ser Lys Pro Met Leu Leu Thr Ile Thr Leu Asn Gly Thr Asp Asp Ser 130 135 140
Asn Ser Thr Tyr Ser Met Ser Phe Ser Tyr Thr Trp Thr Asn Gly Ser 145 150 155 160
Tyr Val Gly Ala Thr Phe Gly Ala Asn Ser Tyr Thr Phe Ser Tyr Ile 165 170 175
Ala Gin Glu
<210> 19
<211> 185
<212> PRT
<213> chimpanzee adenovirus Pan6
<400> 19
Thr Leu Trp Thr Thr Pro Asp Pro Ser Pro Asn Cys Gin Leu Leu Ser 15 10 15
Asp Arg Asp Ala Lys Phe Thr Leu Cys Leu Thr Lys Cys Gly Ser Gin 20 25 30
Ile Leu Gly Thr Val Ala Val Ala Ala Val Thr Val Gly Ser Ala Leu 35 40 45
Asn Pro Ile Asn Asp Thr Val Lys Ser Ala Ile Val Phe Leu Arg Phe 50 55 60
Asp Ser Asp Gly Val Leu Met Ser Asn Ser Ser Met Val Gly Asp Tyr 65 70 75 80
Trp Asn Phe Arg Glu Gly Gin Thr Thr Gin Ser Val Ala Tyr Thr Asn 85 90 95
Ala Val Gly Phe Met Pro Asn Ile Gly Ala Tyr Pro Lys Thr Gin Ser 100 105 110
Lys Thr Pro Lys Asn Ser Ile Val Ser Gin Val Tyr Leu Thr Gly Glu 115 120 125
Thr Thr Met Pro Met Thr Leu Thr Ile Thr Phe Asn Gly Thr Asp Glu 130 135 140
Lys Asp Thr Thr Pro Val Ser Thr Tyr Ser Met Thr Phe Thr Trp Gin 145 150 155 160
Trp Thr Gly Asp Tyr Lys Asp Lys Asn Ile Thr Phe Ala Thr Asn Ser 165 170 175
Phe Ser Phe Ser Tyr Ile Ala Gin Glu
180 185
<210> 20
<211> 179
<212> PRT
<213> chimpanzee adenovirus Pan7
<400> 20
Thr Leu Trp Thr Thr Ala Asp Pro Ser Pro Asn Cys Lys Ile Tyr Ser 15 10 15 Glu Lys Asp Ala Lys Leu Thr Leu Cys Leu Thr Lys Cys Gly Ser Gin 20 25 30
Ile Leu Gly Thr Val Thr Val Leu Ala Val Asn Asn Gly Ser Leu Asn 35 40 45
Pro Ile Thr Asn Thr Val Ser Thr Ala Leu Val Ser Leu Lys Phe Asp 50 55 60
Ala Ser Gly Val Leu Leu Ser Ser Ser Thr Leu Asp Lys Glu Tyr Trp 65 70 75 80
Asn Phe Arg Lys Gly Asp Val Thr Pro Ala Glu Pro Tyr Thr Asn Ala 85 90 95
Ile Gly Phe Met Pro Asn Ile Lys Ala Tyr Pro Lys Asn Thr Ser Ala 100 105 110
Ala Ser Lys Ser His Ile Val Ser Gin Val Tyr Leu Asn Gly Asp Glu 115 120 125
Ala Lys Pro Leu Met Leu Ile Ile Thr Phe Asn Glu Thr Glu Asp Ala 130 135 140
Thr Cys Thr Tyr Ser Ile Thr Phe Gin Trp Lys Trp Asp Ser Thr Lys 145 150 155 160
Tyr Thr Gly Glu Thr Leu Ala Thr Ser Ser Phe Thr Phe Ser Tyr Ile 165 170 175
Ala Gin Glu
<210> 21
<211> 179
<212> PRT
<213> chimpanzee adenovirus Pan5
<400> 21
Thr Leu Trp Thr Thr Ala Asp Pro Ser Pro Asn Cys His Ile Tyr Ser 15 10 15
Glu Lys Asp Ala Lys Leu Thr Leu Cys Leu Thr Lys Cys Gly Ser Gin 20 25 30 Ile Leu Gly Thr Val Ser Leu Ile Ala Val Asp Thr Gly Ser Leu Asn 35 40 45
Pro Ile Thr Gly Thr Val Thr Thr Ala Leu Val Ser Leu Lys Phe Asp 50 55 60
Ala Asn Gly Val Leu Gin Ser Ser Ser Thr Leu Asp Ser Asp Tyr Trp 65 70 75 80
Asn Phe Arg Gin Gly Asp Val Thr Pro Ala Glu Ala Tyr Thr Asn Ala 85 90 95
Ile Gly Phe Met Pro Asn Leu Lys Ala Tyr Pro Lys Asn Thr Ser Gly 100 105 110
Ala Ala Lys Ser His Ile Val Gly Lys Val Tyr Leu His Gly Asp Thr 115 120 125
Gly Lys Pro Leu Asp Leu Ile Ile Thr Phe Asn Glu Thr Ser Asp Glu 130 135 140
Ser Cys Thr Tyr Cys Ile Asn Phe Gin Trp Gin Trp Gly Ala Asp Gin 145 150 155 160
Tyr Lys Asn Glu Thr Leu Ala Val Ser Ser Phe Thr Phe Ser Tyr Ile 165 170 175
Ala Lys Glu
<210> 22
<211> 183
<212> PRT
<213> human adenovirus Ad 2
<400> 22
Thr Leu Trp Thr Thr Pro Asp Pro Ser Pro Asn Cys Arg Ile His Ser 15 10 15
Asp Asn Asp Cys Lys Phe Thr Leu Val Leu Thr Lys Cys Gly Ser Gin 20 25 30
Val Leu Ala Thr Val Ala Ala Leu Ala Val Ser Gly Asp Leu Ser Ser 35 40 45 Met Thr Gly Thr Val Ala Ser Val Ser Ile Phe Leu Arg Phe Asp Gin 50 55 60
Asn Gly Val Leu Met Glu Asn Ser Ser Leu Lys Lys His Tyr Trp Asn 65 70 75 80
Phe Arg Asn Gly Asn Ser Thr Asn Ala Asn Pro Tyr Thr Asn Ala Val 85 90 95
Gly Phe Met Pro Asn Leu Leu Ala Tyr Pro Lys Thr Gin Ser Gin Thr 100 105 110
Ala Lys Asn Asn Ile Val Ser Gin Val Tyr Leu His Gly Asp Lys Thr 115 120 125
Lys Pro Met Ile Leu Thr Ile Thr Leu Asn Gly Thr Ser Glu Ser Thr 130 135 140
Glu Thr Ser Glu Val Ser Thr Tyr Ser Met Ser Phe Thr Trp Ser Trp 145 150 155 160
Glu Ser Gly Lys Tyr Thr Thr Glu Thr Phe Ala Thr Asn Ser Tyr Thr 165 170 175
Phe Ser Tyr Ile Ala Gin Glu
180
<210> 23
<211> 182
<212> PRT
<213> human adenovirus Ad 5
<400> 23
Thr Leu Trp Thr Thr Pro Ala Pro Ser Pro Asn Cys Arg Leu Asn Ala 15 10 15
Glu Lys Asp Ala Lys Leu Thr Leu Val Leu Thr Lys Cys Gly Ser Gin 20 25 30
Ile Leu Ala Thr Val Ser Val Leu Ala Val Lys Gly Ser Leu Ala Pro 35 40 45
Ile Ser Gly Thr Val Gin Ser Ala His Leu Ile Ile Arg Phe Asp Glu 50 55 60 Asn Gly Val Leu Ile Asn Asn Ser Phe Leu Asp Pro Glu Tyr Trp Asn 65 70 75 80
Phe Arg Asn Gly Asp Leu Thr Glu Gly Thr Ala Tyr Thr Asn Ala Val 85 90 95
Gly Phe Met Pro Asn Leu Ser Ala Tyr Pro Lys Ser His Gly Lys Thr 100 105 110
Ala Lys Ser Asn Ile Val Ser Gin Val Tyr Leu Asn Gly Asp Lys Thr 115 120 125
Lys Pro Val Thr Leu Thr Ile Thr Leu Asn Gly Thr Gin Glu Thr Gly 130 135 140
Asp Thr Thr Pro Ser Ala Tyr Ser Met Ser Phe Ser Trp Asp Trp Ser 145 150 155 160
Gly His Asn Tyr Ile Asn Glu Ile Phe Ala Thr Ser Ser Tyr Thr Glu 165 170 175
Ser Tyr Ile Ala Gin Glu
180
<210> 24
<211> 34264
<212> DNA
<213> simian adenovirus SV-1
<220>
<221> CDS
<222> (12454)..(13965)
<223> L2 Penton
<220>
<221> CDS
<222> (16841)..(19636)
<223> L3 Hexon
<220>
<221> CDS
<222> (28059)..(29150)
<223> L5 Fiber #2
<220>
<221> CDS
<222> (29183)..(30865)
<223> L5 Fiber #1
<400> 24
tccttattct ggaaacgtgc caatatgata atgagcgggg aggagcgagg cggggccggg 60 gtgacgtgcg gtgacgtggg gtgacgcggg gtggcgcgag ggcggggcgg gagtggggag 120 gcgcttagtt tttacgtatg cggaaggagg ttttataccg gaagttgggt aatttgggcg 180 tatacttgta agttttgtgt aatttggcgc gaaaaccggg taatgaggaa gttgaggtta 240 atatgtactt tttatgactg ggcggaattt ctgctgatca gcagtgaact ttgggcgctg 300 acggggaggt ttcgctacgt ggcagtacca cgagaaggct caaaggtccc atttattgta 360 ctcctcagcg ttttcgctgg gtatttaaac gctgtcagat catcaagagg ccactcttga 420 gtgccggcga gtagagtttt ctcctccgcg ctgccgcgat gaggctggtt cccgagatgt 480 acggtgtttt ctgcagcgag acggcccgga actcagatga gctgcttaat acagatctgc 540 tggatgttcc caactcgcct gtggcttcgc ctccgtcgct tcatgatctt ttcgatgtgg 600 aagtggatcc accgcaagat cccaacgagg acgcggtaaa cagtatgttc cctgaatgtc 660 tgtttgaggc ggctgaggag ggttctcaca gcagtgaaga gagcagacgg ggagaggaac 720 tggacttgaa atgctacgag gaatgtctgc cttctagcga ttctgaaacg gaacagacag 780 ggggagacgg ctgtgagtcg gcaatgaaaa atgaacttgt attagactgt ccagaacatc 840 ctggtcatgg ctgccgtgcc tgtgcttttc atagaaatgc cagcggaaat cctgagactc 900 tatgtgctct gtgttatctg cgccttacca gcgattttgt atacagtaag taaagtgttt 960 tcattggcgt acggtagggg attcgttgaa gtgctttgtg acttattatg tgtcattatt 1020 tctaggtgac gtgtccgacg tggaagggga aggagataga tcaggggctg ctaattctcc 1080 ttgcactttg ggggctgtgg ttccagttgg catttttaaa ccgagtggtg gaggagaacg 1140 agccggagga gaccgagaat ctgagagccg gcctggaccc tccagtggaa gactaggtgc 1200 tgaggatgat cctgaagagg ggactagtgg gggtgctagg aaaaagcaaa aaactgagcc 1260 tgaacctaga aactttttga atgagttgac tgtaagccta atgaatcggc agcgtcctga 1320 gacggtgttt tggactgagt tggaggatga gttcaagaag ggggaattaa acctcttgta 1380 caagtatggg tttgagcagt tgaaaactca ctggttggag ccgtgggagg atatggaaat 1440 ggctctagac acctttgcta aagtggctct gcggccggat aaagtttaca ctattcgccg 1500 cactgttaat ataaaaaaga gtgtttatgt tatcggccat ggagctctgg tgcaggtgca 1560 gaccccagac cgggtggctt tcaattgcgg catgcagagt ttgggccccg gggtgatagg 1620 tttgaatgga gttacatttc aaaatgtcag gtttactggt gatgatttta atggctctgt 1680 gtttgtgact agcacccagc taaccctcca cggtgtttac ttttttaact ttaacaatac 1740 atgtgtggag tcatggggta gggtgtctct gaggggctgc agttttcatg gttgctggaa 1800 ggcggtggtg ggaagaatta aaagtgtcat gtctgtgaag aaatgcatat ttgaacgctg 1860 tgtgatagct ctagcagtag aggggtacgg acggatcagg aataacgccg catctgagaa 1920 tggatgtttt cttttgctga aaggtacggc cagcgttaag cataatatga tttgcggcag 1980 cggcctgtgc ccctcgcagc tcttaacttg cgcagatgga aactgtcaca ccttgcgcac 2040 cgtgcacata gtgtcccact cgcgccgcac ctggccaaca tttgagcaca atatgctcat 2100 gcgttgcgcc gttcacctag gtgctagacg cggcgtgttt atgccttatc aatgtaactt 2160 tagtcatact aagattttgc tggaaactga ttccttccct cgagtatgtt tcaatggggt 2220 gtttgacatg tcaatggaac tttttaaagt gataagatat gatgaaacca agtctcgttg 2280 tcgctcatgt gaatgcggag ctaatcattt gaggttgtat cctgtaaccc tgaacgttac 2340 cgaggagctg aggacggacc accacatgct gtcttgcctg cgtaccgact atgaatccag 2400 cgatgaggag tgaggtgagg ggcggagcca caaagggtat aaaggggcat gaggggtggg 2460 cgcggtgttt caaaatgagc gggacgacgg acggcaatgc gtttgagggg ggagtgttca 2520 gcccatatct gacatctcgt cttccttcct gggcaggagt tcgtcagaat gtagtgggct 2580 ccaccgtgga cggacggccg gtcgcccctg caaattccgc caccctcacc tatgccaccg 2640 tgggatcatc gttggacact gccgcggcag ctgccgcttc tgctgccgct tctactgctc 2700 gcggcatggc ggctgatttt ggactatata accaactggc cactgcagct gtggcgtctc 2760 ggtctctggt tcaagaagat gccctgaatg tgatcttgac tcgcctggag atcatgtcac 2820 gtcgcctgga cgaactggct gcgcagatat cccaagctaa ccccgatacc gcttcagaat 2880 cttaaaataa agacaaacaa atttgttgaa aagtaaaatg gctttatttg ttttttttgg 2940 ctcggtaggc tcgggtccac ctgtctcggt cgttaaggac tttgtgtatg ttttccaaaa 3000 cacggtacag atgggcttgg atgttcaagt acatgggcat gaggccatct ttggggtgga 3060 gataggacca ctgaagagcg tcatgttccg gggtggtatt gtaaatcacc cagtcgtagc 3120 agggtttttg agcgtggaac tggaatatgt ccttcaggag caggctaatg gccaagggta 3180 gacccttagt gtaggtgttt acaaagcggt tgagctggga gggatgcatg cggggggaga 3240 tgatatgcat cttggcttgg attttgaggt tagctatgtt accacccagg tctctgcggg 3300 ggttcatgtt atgaaggacc accagcacgg tatagccagt gcatttgggg aacttgtcat 3360 gcagtttgga ggggaaggcg tggaagaatt tagatacccc cttgtgcccc cctaggtttt 3420 ccatgcactc atccataata atggcaatgg gacccctggc ggccgcttta gcaaacacgt 3480 tttgggggtt ggaaacatca tagttttgct ctagagtgag ctcatcatag gccatcttta 3540 caaagcgggg taggagggtg cccgactggg ggatgatagt tccatctggg cctggagcgt 3600 agttgccctc acagatctgc atctcccagg ccttaatttc cgaggggggg atcatgtcca 3660 cctggggggc gataaaaaac acggtttctg gcggggggtt aatgagctgg gtggaaagca 3720 agttacgcaa cagctgggat ttgccgcaac cggtgggacc gtagatgacc ccgatgacgg 3780 gttgcagctg gtagttcaga gaggaacagc tgccgtcggg gcgcaggagg ggagctacct 3840 cattcatcat gcttctgaca tgtttatttt cactcactaa gttttgcaag agcctctccc 3900 cacccaggga taagagttct tccaggctgt tgaagtgttt cagcggtttc aggccgtcgg 3960 ccatgggcat cttttcaagc gactgacgaa gcaagtacag tcggtcccag agctcggtga 4020 cgtgctctat ggaatctcga tccagcagac ttcttggttt cgggggttgg gccgactttc 4080 gctgtagggc accagccggt gggcgtccag ggccgcgagg gttctgtcct tccagggtct 4140 cagcgttcgg gtgagggtgg tctcggtgac ggtgaaggga tgagccccgg gctgggcgct 4200 tgcgagggtg cgcttcaggc tcatcctgct ggtgctgaag cgggcgtcgt ctccctgtga 4260 gtcggccaga tagcaacgaa gcatgaggtc gtagctgagg gactcggccg cgtgtccctt 4320 ggcgcgcagc tttcccttgg aaacgtgctg acatttggtg cagtgcagac acttgagggc 4380 gtagagtttt ggggccagga agaccgactc gggcgagtag gcgtcggctc cgcactgagc 4440 gcagacggtc tcgcactcca ccagccacgt gagctcgggt ttagcgggat caaaaaccaa 4500 gttgcctcca ttttttttga tgcgtttctt accttgcgtc tccatgagtc tgtgtcccgc 4560 ttccgtgaca aaaaggctgt cggtatcccc gtagaccgac ttgagggggc gatcttccaa 4620 aggtgttccg aggtcttccg cgtacaggaa ctgggaccac tccgagacaa aggctcgggt 4680 ccaggctaac acgaaggagg cgatctgcga ggggtatctg tcgttttcaa tgagggggtc 4740 caccttttcc agggtgtgca gacacaggtc gtcctcctcc gcgtccacga aggtgattgg 4800 cttgtaagtg taggtcacgt gacccgcacc cccccaaggg gtataaaagg gggcgtgccc 4860 actctccccg tcactttctt ccgcatcgct gtggaccaga gccagctgtt cgggtgagta 4920 ggccctctca aaagccggca tgatttcggc gctcaagttg tcagtttcta caaacgaggt 4980 ggatttgata ttcacgtgcc ccgcggcgat gcttttgatg gtggaggggt ccatctgatc 5040 agaaaacacg atctttttat tgtcaagttt ggtggcgaaa gacccgtaga gggcgttgga 5100 aagcaacttg gcgatggagc gcagggtctg atttttctcc cgatcggccc tctccttggc 5160 ggcgatgttg agttgcacgt actcgcgggc cacgcaccgc cactcgggga acacggcggt 5220 gcgctcgtcg ggcaggatgc gcacgcgcca gccgcggttg tgcagggtga tgaggtccac 5280 gctggtggcc acctccccgc ggaggggctc gttggtccaa cacaatcgcc ccccttttct 5340 ggagcagaac ggaggcaggg gatctagcaa gttggcgggc ggggggtcgg cgtcgatggt 5400 aaatatgccg ggtagcagaa ttttattaaa ataatcgatt tcggtgtccg tgtcttgcaa 5460 cgcgtcttcc cacttcttca ccgccagggc cctttcgtag ggattcaggg gcggtcccca 5520 gggcatgggg tgggtcaggg ccgaggcgta catgccgcag atgtcgtaca cgtacagggg 5580 ctccctcaac accccgatgt aagtggggta acagcgcccc ccgcggatgc tggctcgcac 5640 gtagtcgtac atctcgtgag agggagccat gagcccgtct cccaagtggg tcttgtgggg 5700 tttttcggcc cggtagagga tctgcctgaa gatggcgtgg gagttggaag agatagtggg 5760 gcgttggaag acgttaaagt tggctccggg cagtcccacg gagtcttgga tgaactgggc 5820 gtaggattcc cggagcttgt ccaccagggc tgcggttacc agcacgtcga gagcgcagta 5880 gtccaacgtc tcgcggacca ggttgtaggc cgtctcttgt tttttctccc acagttcgcg 5940 attgaggagg tattcctcgc ggtctttcca gtactcttcg gcgggaaatc ctttttcgtc 6000 cgctcggtaa gaacctaaca tgtaaaattc gttcacggct ttgtatggac aacagccttt 6060 ttctaccggc agggcgtacg cttgagcggc ctttctgaga gaggtgtggg tgagggcgaa 6120 ggtgtcccgc accatcactt tcaggtactg atgtttgaag tccgtgtcgt cgcaggcgcc 6180 ctgttcccac agcgtgaagt cggtgcgctt tttctgcctg ggattgggga gggcgaatgt 6240 gacgtcgtta aagaggattt tcccggcgcg gggcatgaag ttgcgagaga tcctgaaggg 6300 tccgggcacg tccgagcggt tgttgatgac ttgcgccgcc aggacgatct cgtcgaagcc 6360 gttgatgttg tggcccacga tgtaaagttc gataaagcgc ggctgtccct tgagggccgg 6420 cgcttttttc aactcctcgt aggtgagaca gtccggcgag gagagaccca gctccgcccg 6480 ggcccagtcg gagagctgag ggttagccgc gaggaaagag ctccacaggt caagggctag 6540 cagagtttgc aagcggtcgc ggaactcgcg aaactttttc cccacggcca ttttctccgg 6600 cgtcaccacg tagaaagtgc aggggcggtc gttccagacg tcccatcgga gctctagggc 6660 cagctcgcag gcttgacgaa cgagggtctc ctcgcccgag acgtgcatga ccagcatgaa 6720 gggtaccaac tgtttcccga acgagcccat ccatgtgtag gtttctacgt cgtaggtgac 6780 aaagagccgc tgggtgcgcg cgtgggagcc gatcgggaag aagctgatct cctgccacca 6840 gttggaggaa tgggtgttga tgtggtgaaa gtagaagtcc cgccggcgca cagagcattc 6900 gtgctgatgt ttgtaaaagc gaccgcagta gtcgcagcgc tgcacgctct gtatctcctg 6960 aatgagatgc gcttttcgcc cgcgcaccag aaaccggagg gggaagttga gacgggggct 7020 tggtggggcg gcatcccctt cgccttggcg gtgggagtct gcgtctgcgc cctccttctc 7080 tgggtggacg acggtgggga cgacgacgcc ccgggtgccg caagtccaga tctccgccac 7140 ggaggggcgc aggcgttgca ggaggggacg cagctgcccg ctgtccaggg agtcgagggc 7200 ggccgcgctg aggtcggcgg gaagcgtttg caagttcact ttcagaagac cggtaagagc 7260 gtgagccagg tgcacatggt acttgatttc caggggggtg ttggaagagg cgtccacggc 7320 gtagaggagg ccgtgtccgc gcggggccac caccgtgccc cgaggaggtt ttatctcact 7380 cgtcgagggc gagcgccggg gggtagaggc ggctctgcgc cggggggcag cggaggcagt 7440 ggcacgtttt cgtgaggatt cggcagcggt tgatgacgag cccggagact gctggcgtgg 7500 gcgacgacgc ggcggttgag gtcctggatg tgccgtctct gcgtgaagac caccggcccc 7560 cgggtcctga acctgaaaga gagttccaca gaatcaatgt ctgcatcgtt aacggcggcc 7620 tgcctgagga tctcctgtac gtcgcccgag ttgtcttgat aggcgatctc ggccatgaac 7680 tgctccactt cttcctcgcg gaggtcgccg tggcccgctc gctccacggt ggcggccagg 7740 tcgttggaga tgcgacgcat gagttgagag aaggcgttga ggccgttctc gttccacacg 7800 cggctgtaca ccacgtttcc gaaggagtcg cgcgctcgca tgaccacctg ggccacgttg 7860 agttccacgt ggcgggcgaa gacggcgtag tttctgaggc gctggaagag gtagttgagc 7920 gtggtggcga tgtgctcgca gacgaagaag tacatgatcc agcgccgcag ggtcatctcg 7980 ttgatgtctc cgatggcttc gagacgctcc atggcctcgt agaagtcgac ggcgaagttg 8040 aaaaattggg agttgcgggc ggccaccgtg agttcttctt gcaggaggcg gatgagatcg 8100 gcgaccgtgt cgcgcacctc ctgctcgaaa gcgccccgag gcgcctctgc ttcttcctcc 8160 ggctcctcct cttccagggg cacgggttcc tccggcagct ctgcgacggg gacggggcgg 8220 cgacgtcgtc gtctgaccgg caggcggtcc acgaagcgct cgatcatttc gccgcgccgg 8280 cgacgcatgg tctcggtgac ggcgcgtccg ttttcgcgag gtcgcagttc gaagacgccg 8340 ccgcgcagag cgcccccgtg cagggagggt aagtggttag ggccgtcggg cagggacacg 8400 gcgctgacga tgcattttat caattgctgc gtaggcactc cgtgcaggga tctgagaacg 8460 tcgaggtcga cgggatccga gaacttctct aggaaagcgt ctatccaatc gcagtcgcaa 8520 ggtaagctga ggacggtggg ccgctggggg gcgtccgcgg gcagttggga ggtgatgctg 8580 ctgatgatgt aattaaagta ggcggtcttc aggcggcgga tggtggcgag gaggaccacg 8640 tctttgggcc cggcctgttg aatgcgcagg cgctcggcca tgccccaggc ctcgctctga 8700 cagcgacgca ggtctttgta gtagtcttgc atcagtctct ccaccggaac ctctgcttct 8760 cccctgtctg ccatgcgagt cgagccgaac ccccgcaggg gctgcagcaa cgctaggtcg 8820 gccacgaccc tctcggccag cacggcctgt tggatctgcg tgagggtggt ctggaagtcg 8880 tccaggtcca cgaagcggtg ataggccccc gtgttgatgg tgtaggtgca gttggccatg 8940 acggaccagt tgacgacttg catgccgggt tgggtgatct ccgtgtactt gaggcgcgag 9000 taggcgcggg actcgaacac gtagtcgttg catgtgcgta ccagatactg gtagccaacc 9060 aggaagtggg gaggcggttc tcggtacagg ggccagccga ctgtggcggg ggcgccgggg 9120 gacaggtcgt ccagcatgag gcgatggtag tggtagatgt agcgggagag ccaggtgatg 9180 ccggccgagg tggtcgcggc cctggtgaat tcgcggacgc ggttccagat gttgcgcagg 9240 gggcgaaagc gctccatggt gggcacgctc tgccccgtga ggcgggcgca atcttgtacg 9300 ctctagatgg aaaaaagaca gggcggtcat cgactccctt ccgtagctcg gggggtaaag 9360 tcgcaagggt gcggcggcgg ggaaccccgg ttcgagaccg gccggatccg ccgctcccga 9420 tgcgcctggc cccgcatcca cgacgtccgc gtcgagaccc agccgcgacg ctccgcccca 9480 atacggaggg gagtcttttg gtgttttttc gtagatgcat ccggtgctgc ggcagatgcg 9540 acctcagacg cccaccacca ccgccgcggc ggcagtaaac ctgagcggag gcggtgacag 9600 ggaggaggag gagctggctt tagacctgga agagggagag gggctggccc ggctgggagc 9660 gccgtcccca gagagacacc ctagggttca gctcgtgagg gacgccaggc aggcttttgt 9720 gccgaagcag aacctgttta gggaccgcag cggtcaggag gcggaggaga tgcgcgattg 9780 caggtttcgg gcgggtagag agctgagggc gggcttcgat cgggagcggc tcctgagggc 9840 ggaggatttc gagcccgacg agcgttctgg ggtgagcccg gcccgcgctc acgtctcggc 9900 ggccaacctg gtgagcgcgt acgagcagac ggtgaacgag gagcgcaact tccaaaagag 9960 ctttaacaat cacgtgagga ccctgatcgc gagggaggag gtgaccatcg ggctgatgca 10020 tctgtgggac ttcgtggagg cctacgtgca gaacccggcc agcaaacctc tgacggccca 10080 gctgttcctg atcgtgcagc acagccgcga caacgagacg ttccgcgacg ccatgttgaa 10140 catcgcggag cccgagggtc gctggctctt ggatctgatt aacatcctgc agagcatcgt 10200 ggtgcaggag aggggcctca gcttagcgga caaggtggcg gccattaact attcgatgca 10260 gagcctgggg aagttctacg ctcgcaagat ctacaagagc ccttacgtgc ccatagacaa 10320 ggaggtgaag atagacagct tttacatgcg catggcgctg aaggtgctga cgctgagcga 10380 cgacctcggc gtgtaccgta acgacaagat ccacaaggcg gtgagcgcca gccgccggcg 10440 ggagctgagc gacagggagc tgatgcacag cctgcagagg gcgctggcgg gcgccgggga 10500 cgaggagcgc gaggcttact tcgacatggg agccgatctg cagtggcgtc ccagcgcgcg 10560 cgccttggag gcggcgggct accccgacga ggaggatcgg gacgatttgg aggaggcagg 10620 cgagtacgag gacgaagcct gaccgggcag gtgttgtttt agatgcagcg gccggcggac 10680 ggggccaccg cggatcccgc acttttggca tccatgcaga gtcaaccttc gggcgtgacc 10740 gcctccgatg actgggcggc ggccatggac cgcattatgg cgctgactac ccgcaacccc 10800 gaggctttta gacagcaacc ccaggccaac cgtttttcgg ccatcttgga agcggtggtg 10860 ccctcccgca ccaaccccac acacgagaaa gtcctgacta tcgtgaacgc cctggtagac 10920 agcaaggcca tccgccgcga cgaggcgggc ttgatttaca acgctctgct ggaacgggtg 10980 gcgcgctaca acagcactaa cgttcagacc aatctggatc gcctcaccac cgacgtgaag 11040 gaggcgctgg ctcagaagga gcggtttctg agggacagca atctgggctc tctggtggca 11100 ctcaacgcct tcctgagcac gcagccggcc aacgtgcccc gcgggcagga ggactacgtg 11160 agcttcatca gcgctctgag gctgctggtg tccgaggtgc cccagagcga ggtgtatcag 11220 tctgggccgg attacttctt ccagacgtcc cgacagggct tgcaaacggt gaacctgact 11280 caggccttta aaaacttgca aggcatgtgg ggcgttaagg ccccggtggg cgatcgagcc 11340 accatctcca gtctgctgac ccccaacact cgcctgctgc tgctcttgat cgcgccgttc 11400 accaacagta gcactatcag ccgtgactcg tacctgggtc atctcatcac tttgtaccgc 11460 gaggccatcg gtcaggctca gatcgacgag cacacatatc aggagatcac taacgtgagc 11520 cgggccctgg gtcaggaaga taccggcagc ctggaagcca cgttgaactt tttgctaacc 11580 aaccggaggc aaaaaatacc ctcccagttt acgttaagcg ccgaggagga gaggattctg 11640 cgatacgtgc agcagtccgt gagtctgtac ttgatgcggg agggcgccac cgcttccacg 11700 gctttagaca tgacggctcg gaacatggaa ccgtcctttt actccgccca ccggccgttc 11760 attaaccgtc tgatggacta cttccatcgc gcggccgcca tgaacgggga gtacttcacc 11820 aatgccatcc tgaatccgca ttggatgccc ccgtccggct tctacaccgg cgagtttgac 11880 ctgcccgaag ccgacgacgg ctttctttgg gacgacgtgt ccgacagcat tttcacgccg 11940 ggcaatcgcc gattccagaa gaaggagggc ggagacgagc tccccctctc cagcgtggag 12000 gcggcctcta ggggagagag tccctttccc agtctgtctt ccgccagcag tggtcgggta 12060 acgcgcccgc ggttgccggg ggagagcgac tacctgaacg accccttgct gcggccggct 12120 aggaagaaaa atttccccaa caacggggtg gaaagcttgg tggataaaat gaatcgttgg 12180 aagacctacg cccaggagca gcgggagtgg gaggacagtc agccgcgacc gctggttccg 12240 ccgcactggc gtcgtcagag agaagacccg gacgactccg cagacgatag tagcgtgttg 12300 gacctgggag ggagcggagc caaccccttt gctcacttgc aacccaaggg gcgttccagt 12360 cgcctctact aataaaaaag acgcggaaac ttaccagagc catggccaca gcgtgtgtcc 12420 tttcttcctc tctttcttcc tcggcgcggc aga atg aga aga gcg gtg aga gtc 12474
Met Arg Arg Ala Val Arg Val 1 5
acg ccg gcg gcg tat gag ggt ccg ccc cct tet tac gaa age gtg atg 12522 Thr Pro Ala Ala Tyr Glu Gly Pro Pro Pro Ser Tyr Glu Ser Val Met 10 15 20
gga tea gcg aac gtg ccg gcc acg etg gag gcg cct tac gtt cct ccc 12570 Gly Ser Ala Asn Val Pro Ala Thr Leu Glu Ala Pro Tyr Val Pro Pro 25 30 35
aga tac etg gga cct acg gag ggc aga aac age atc cgt tac tcc gag 12618 Arg Tyr Leu Gly Pro Thr Glu Gly Arg Asn Ser Ile Arg Tyr Ser Glu 40 45 50 55
etg gca ccc etg tac gat acc acc aag gtg tac etg gtg gac aac aag 12666 Leu Ala Pro Leu Tyr Asp Thr Thr Lys Val Tyr Leu Val Asp Asn Lys 60 65 70
tcg gcg gac atc gcc tcc etg aat tat caa aac gat cac age aat ttt 12714 Ser Ala Asp Ile Ala Ser Leu Asn Tyr Gin Asn Asp His Ser Asn Phe 75 80 85
etg act acc gtg gtg cag aac aat gac ttc acc ccg acg gag gcg ggc 12762 Leu Thr Thr Val Val Gin Asn Asn Asp Phe Thr Pro Thr Glu Ala Gly 90 95 100
acg cag acc att aac ttt gac gag cgt tcc cgc tgg ggc ggt cag etg 12810 Thr Gin Thr Ile Asn Phe Asp Glu Arg Ser Arg Trp Gly Gly Gin Leu 105 110 115
aaa acc atc etg cac acc aac atg ccc aac atc aac gag ttc atg tcc 12858 Lys Thr Ile Leu His Thr Asn Met Pro Asn Ile Asn Glu Phe Met Ser 120 125 130 135
acc aac aag ttc agg gcc agg etg atg gtt aaa aag get gaa aac cag 12906 Thr Asn Lys Phe Arg Ala Arg Leu Met Val Lys Lys Ala Glu Asn Gin 140 145 150
cct ccc gag tac gaa tgg ttt gag ttc acc att ccc gag ggc aac tat 12954 Pro Pro Glu Tyr Glu Trp Phe Glu Phe Thr Ile Pro Glu Gly Asn Tyr 155 160 165
tcc gag acc atg act atc gat etg atg aac aat gcg atc gtg gac aat 13002 Ser Glu Thr Met Thr Ile Asp Leu Met Asn Asn Ala Ile Val Asp Asn 170 175 180
tac etg caa gtg ggg agg cag aac ggg gta ttg gaa age gat atc ggc 13050 Tyr Leu Gin Val Gly Arg Gin Asn Gly Val Leu Glu Ser Asp Ile Gly 185 190 195
gta aaa ttt gat acc aga aac ttc ega etg ggg tgg gat ccc gtg acc 13098 Val Lys Phe Asp Thr Arg Asn Phe Arg Leu Gly Trp Asp Pro Val Thr 200 205 210 215
aag etg gtg atg cca ggc gtg tac acc aac gag get ttt cac ccc gac 13146 Lys Leu Val Met Pro Gly Val Tyr Thr Asn Glu Ala Phe His Pro Asp 220 225 230
atc gtg etg etg ccg ggg tgc ggt gtg gac ttc act cag age cgt ttg 13194 Ile Val Leu Leu Pro Gly Cys Gly Val Asp Phe Thr Gin Ser Arg Leu 235 240 245
agt aac etg tta ggg atc aga aag cgc cgc ccc ttc caa gag ggc ttt 13242 Ser Asn Leu Leu Gly Ile Arg Lys Arg Arg Pro Phe Gin Glu Gly Phe 250 255 260
cag atc atg tat gag gac etg gaa gga ggt aac att cca ggt ttg eta 13290 Gin Ile Met Tyr Glu Asp Leu Glu Gly Gly Asn Ile Pro Gly Leu Leu 265 270 275
gac gtg ccg gcg tat gaa gag agt gtt aaa cag gcg gag gcg cag gga 13338 Asp Val Pro Ala Tyr Glu Glu Ser Val Lys Gin Ala Glu Ala Gin Gly 280 285 290 295
ega gag att ega ggc gac acc ttt gcc acg gaa cct cac gaa etg gta 13386 Arg Glu Ile Arg Gly Asp Thr Phe Ala Thr Glu Pro His Glu Leu Val 300 305 310
ata aaa cct etg gaa caa gac agt aaa aaa egg agt tac aac att ata 13434 Ile Lys Pro Leu Glu Gin Asp Ser Lys Lys Arg Ser Tyr Asn Ile Ile 315 320 325
tcc ggc act atg aat acc ttg tac egg age tgg ttt etg get tac aac 13482 Ser Gly Thr Met Asn Thr Leu Tyr Arg Ser Trp Phe Leu Ala Tyr Asn 330 335 340
tac ggg gat ccc gaa aag gga gtg aga tea tgg acc ata etc acc acc 13530 Tyr Gly Asp Pro Glu Lys Gly Val Arg Ser Trp Thr Ile Leu Thr Thr 345 350 355
acg gac gtg acc tgc ggc tcg cag caa gtg tac tgg tcc etg ccg gat 13578 Thr Asp Val Thr Cys Gly Ser Gin Gin Val Tyr Trp Ser Leu Pro Asp 360 365 370 375
atg atg caa gac ccg gtc acc ttc cgc ccc tcc acc caa gtc age aac 13626 Met Met Gin Asp Pro Val Thr Phe Arg Pro Ser Thr Gin Val Ser Asn 380 385 390
ttc ccg gtg gtg ggc acc gag etg etg ccc gtc eat gcc aag age ttc 13674 Phe Pro Val Val Gly Thr Glu Leu Leu Pro Val His Ala Lys Ser Phe 395 400 405
tac aac gaa cag gcc gtc tac tcg caa etc att cgc cag tcc acc gcg 13722 Tyr Asn Glu Gin Ala Val Tyr Ser Gin Leu Ile Arg Gin Ser Thr Ala 410 415 420
ett acc cac gtg ttc aat cgc ttt ccc gag aac cag att etg gtg cgc 13770 Leu Thr His Val Phe Asn Arg Phe Pro Glu Asn Gin Ile Leu Val Arg 425 430 435
cct ccc get cct acc att acc acc gtc agt gaa aac gtt ccc gcc etc 13818 Pro Pro Ala Pro Thr Ile Thr Thr Val Ser Glu Asn Val Pro Ala Leu 440 445 450 455
aca gat cac gga acc etg ccg etg cgc age agt atc agt gga gtt cag 13866 Thr Asp His Gly Thr Leu Pro Leu Arg Ser Ser Ile Ser Gly Val Gin 460 465 470
cgc gtg acc atc acc gac gcc aga cgt ega acc tgt ccc tac gtt tac 13914 Arg Val Thr Ile Thr Asp Ala Arg Arg Arg Thr Cys Pro Tyr Val Tyr 475 480 485
aaa get ett ggc gta gtg get cct aaa gtg etc tet agt cgc acc ttc 13962 Lys Ala Leu Gly Val Val Ala Pro Lys Val Leu Ser Ser Arg Thr Phe 490 495 500
taa acatgtccat cctcatctct cccgataaca acaccggctg gggactgggc 14015 teeggcaaga tgtaeggegg agccaaaagg cgctccagtc agcacccagt tegagttegg 14075 ggccacttcc gtgctccctg gggagettae aagegaggae tctcgggccg aaeggeggta 14135 gacgatacca tagatgcegt gattgecgae gcccgccggt acaaccccgg accggtcgct 14195 agcgccgcct ccaccgtgga ttcegtgate gaeagegtgg tagctggege tegggcctat 14255 gctcgccgca agaggegget geateggaga cgtcgcccca ccgccgccat gctggcagcc 14315 agggcegtgc tgaggeggge eeggagggta ggcagaaggg ctatgcgccg cgctgccgcc 14375 aacgccgccg ccgggagggc ccgccgacag gctgcccgcc aggctgctgc cgccatcgct 14435 agcatggcca gacccaggag agggaacgtg tactgggtgc gcgattctgt gacgggagtc 14495 cgagtgccgg tgcgcagccg acctccccga agttagaaga tccaagctgc gaagacggcg 14555 gtactgagtc tccctgttgt tatcagccca acatgagcaa gcgcaagttt aaagaagaac 14615 tgctgcagac gctggtgcct gagatctatg gccctccgga cgtgaagcct gacattaagc 14675 cccgcgatat caagcgtgtt aaaaagcggg aaaagaaaga ggaactcgcg gtggtagacg 14735 atggcggagt ggaatttatt aggagtttcg ccccgcgacg cagggttcaa tggaaagggc 14795 ggcgggtaca acgcgttttg aggccgggca ccgcggtagt ttttaccccg ggagagcggt 14855 cggccgttag gggtttcaaa aggcagtacg acgaggtgta cggcgacgag gacatattgg 14915 aacaggcggc tcaacagatc ggagaatttg cctacggaaa gcgttcgcgt cgcgaagacc 14975 tggccatcgc tttagacagc ggcaacccca cgcccagcct caaacctgtg acgctgcagc 15035 aggtgctccc cgtgagcgcc agcacggaca gcaagagggg aataaaaaga gaaatggaag 15095 atctgcagcc caccatccag ctcatggtcc ctaaacggca gaggctggaa gaggtcctgg 15155 agaaaatgaa agtggaccca agcatagagc cggacgtcaa agtcaggccg atcaaagaag 15215 tggcccctgg tctcggggtg cagacggtgg atatccagat ccccgtcacg tcagcttcga 15275 ccgccgtgga agccatggaa acgcaaacgg aaacccctgc cgcgatcggt accagggaag 15335 tggcgttgca aaccgacccc tggtacgaat acgccgcccc tcggcgtcag aggcgacccg 15395 ctcgttacgg ccccgccaac gccatcatgc cagaatatgc gctgcatccg tctatcctgc 15455 ccacccccgg ctaccgggga gtgacgtatc gcccgtcagg aacccgccgc cgaacccgtc 15515 gccgccgccg ctcccgtcgt gctctggccc ccgtgtcggt gcgccgcgta acacgccggg 15575 gaaagacagt taccattccc aacccgcgct accaccctag catcctttaa tgactctgcc 15635 gttttgcaga tggctctgac ttgccgcgtg cgccttcccg ttccgcacta tcgaggaaga 15695 tctcgtcgta ggagaggcat ggcgggtagt ggtcgccggc gggctttgcg caggcgcatg 15755 aaaggcggaa ttttacccgc tctgataccc ataatcgccg ccgccatcgg tgccataccc 15815 ggcgtcgctt cagtggcctt gcaagcagct cgtaataaat aaacgaaggc ttttgcactt 15875 atgtcctggt cctgactatt ttatgcagaa agagcatgga agacatcaat tttacgtcgc 15935 tggctccgcg gcacggctcg cggccgctca tgggcacctg gaacgacatc ggcaccagtc 15995 agctcaacgg gggcgctttc aattggggga gcctttggag cggcattaaa aactttggct 16055 ccacgattaa atcctacggc agcaaagcct ggaacagtag tgctggtcag atgctccgag 16115 ataaactgaa ggacaccaac ttccaagaaa aagtggtcaa tggggtggtg accggcatcc 16175 acggcgcggt agatctcgcc aaccaagcgg tgcagaaaga gattgacagg cgtttggaaa 16235 gctcgcgggt gccgccgcag agaggggatg aggtggaggt cgaggaagta gaagtagagg 16295 aaaagctgcc cccgctggag aaagttcccg gtgcgcctcc gagaccgcag aagcgaccca 16355 ggccagaact agaagaaact ctggtgacgg agagcaagga gcctccctcg tacgagcaag 16415 ccttgaaaga gggcgcctct ccaccctacc caatgacaaa accgatcgcg cctatggctc 16475 ggccggtgta cgggaaggac tacaagcctg tcacgctaga gctccccccg ccgccaccgc 16535 cgccccccac gcgcccgacc gttccccccc ccctgccggc tccgtcggcg ggacccgtgt 16595 ccgcacccgt cgccgtgcct ctgccagccg cccgcccagt ggccgtggcc actgccagaa 16655 accccagagg ccagagagga gccaactggc aaagcacgct gaacagcatc gtgggcctgg 16715 gagtgaaaag cctgaaacgc cgccgttgct attattaaaa gtgtagctaa aaaatttccc 16775 gttgtatacg cctcctatgt taccgccaga gacgcgtgac tgtcgccgcg agcgccgctt 16835 tcaag atg gcc acc cca tcg atg atg ccg cag tgg tet tac atg cac atc 16885
Met Ala Thr Pro Ser Met Met Pro Gin Trp Ser Tyr Met His Ile 505 510 515
gcc ggg cag gac gcc tcg gag tac etg age ccc ggt etc gtg cag ttc 16933 Ala Gly Gin Asp Ala Ser Glu Tyr Leu Ser Pro Gly Leu Val Gin Phe 520 525 530
gcc cgc gcc acc gac acc tac ttc age ttg gga aac aag ttt aga aac 16981 Ala Arg Ala Thr Asp Thr Tyr Phe Ser Leu Gly Asn Lys Phe Arg Asn 535 540 545 550
ccc acc gtg gcc ccc acc cac gat gta acc acg gac cgc tcg caa agg 17029 Pro Thr Val Ala Pro Thr His Asp Val Thr Thr Asp Arg Ser Gin Arg 555 560 565
etg acc etg cgt ttt gtg ccc gta gac egg gag gac acc gcg tac tet 17077 Leu Thr Leu Arg Phe Val Pro Val Asp Arg Glu Asp Thr Ala Tyr Ser 570 575 580
tac aaa gtg cgc tac acg etg gcc gta ggg gac aac ega gtg etg gac 17125 Tyr Lys Val Arg Tyr Thr Leu Ala Val Gly Asp Asn Arg Val Leu Asp 585 590 595
atg gcc age acc tac ttt gac atc egg gga gtg etg gat cgc ggt ccc 17173 Met Ala Ser Thr Tyr Phe Asp Ile Arg Gly Val Leu Asp Arg Gly Pro 600 605 610
agt ttt aag ccc tac tcg ggt acc gcg tac aat tcc etg get ccc aag 17221 Ser Phe Lys Pro Tyr Ser Gly Thr Ala Tyr Asn Ser Leu Ala Pro Lys 615 620 625 630
ggc get ccc aac cct gca gaa tgg acg aat tea gac age aaa gtt aaa 17269 Gly Ala Pro Asn Pro Ala Glu Trp Thr Asn Ser Asp Ser Lys Val Lys 635 640 645
gtg agg gca cag gcg cct ttt gtt age tcg tat ggt get aca gcg att 17317 Val Arg Ala Gin Ala Pro Phe Val Ser Ser Tyr Gly Ala Thr Ala Ile 650 655 660
aca aaa gag ggt att cag gtg gga gta acc tta aca gac tcc gga tea 17365 Thr Lys Glu Gly Ile Gin Val Gly Val Thr Leu Thr Asp Ser Gly Ser 665 670 675
aca cca cag tat gca gat aaa acg tat cag cct gag ccg caa att gga 17413 Thr Pro Gin Tyr Ala Asp Lys Thr Tyr Gin Pro Glu Pro Gin Ile Gly 680 685 690
gaa eta cag tgg aac age gat gtt gga acc gat gac aaa ata gca gga 17461 Glu Leu Gin Trp Asn Ser Asp Val Gly Thr Asp Asp Lys Ile Ala Gly 695 700 705 710
aga gtg eta aag aaa aca acg ccc atg ttc cct tgt tac ggc tea tat 17509 Arg Val Leu Lys Lys Thr Thr Pro Met Phe Pro Cys Tyr Gly Ser Tyr 715 720 725
gcc agg ccc act aat gaa aaa gga gga cag gca aca ccg tcc get agt 17557 Ala Arg P ro Thr Asn Glu Lys Gly Gly Gin Ala Thr Pro Ser Ala Ser 730 735 740
caa gac gtg caa aat ccc gaa tta caa ttt ttt gcc tet act aat gtc 17605 Gin Asp Val Gin Asn Pro Glu Leu Gin Phe Phe Ala Ser Thr Asn Val 745 750 755
gcc aat aca cca aaa gca gtt eta tat gcg gag gac gtg tea att gaa 17653 Ala Asn Thr Pro Lys Ala Val Leu Tyr Ala Glu Asp Val Ser Ile Glu 760 765 770
gcg cca gac act cac ttg gtg ttc aaa cca aca gtc act gaa ggc att 17701 Ala Pro Asp Thr His Leu Val Phe Lys Pro Thr Val Thr Glu Gly Ile 775 780 785 790
aca agt tea gag get eta etg acc caa caa get get ccc aac cgt cca 17749 Thr Ser Ser Glu Ala Leu Leu Thr Gin Gin Ala Ala Pro Asn Arg Pro 795 800 805
aac tac ata gcc ttt aga gat aat ttt att ggt etc atg tac tac aat 17797 Asn Tyr Ile Ala Phe Arg Asp Asn Phe Ile Gly Leu Met Tyr Tyr Asn 810 815 820
age aca ggt aac atg gga gta etg gca ggc cag get tet cag eta aat 17845 Ser Thr Gly Asn Met Gly Val Leu Ala Gly Gin Ala Ser Gin Leu Asn 825 830 835
gca gtt gtt gac etg caa gac aga aat act gag etg tcc tac caa etc 17893 Ala Val Val Asp Leu Gin Asp Arg Asn Thr Glu Leu Ser Tyr Gin Leu 840 845 850
atg ttg gac gcc etc gga gac cgc agt egg tac ttt tet atg tgg aac 17941 Met Leu Asp Ala Leu Gly Asp Arg Ser Arg Tyr Phe Ser Met Trp Asn 855 860 865 870
caa get gtg gat agt tac gat cct gat gta aga atc ata gaa aac eat 17989 Gin Ala Val Asp Ser Tyr Asp Pro Asp Val Arg Ile Ile Glu Asn His 875 880 885
ggc gta gaa gat gaa ttg cct aat tat tgc ttt cct ttg gga ggc atg 18037 Gly Val Glu Asp Glu Leu Pro Asn Tyr Cys Phe Pro Leu Gly Gly Met 890 895 900
gca gta acc gac acc tac tcg cct ata aag gtt aat gga gga ggc aat 18085 Ala Val Thr Asp Thr Tyr Ser Pro Ile Lys Val Asn Gly Gly Gly Asn 905 910 915
gga tgg gaa gcc aat aac ggc gtt ttc acc gaa aga gga gtg gaa ata 18133 Gly Trp Glu Ala Asn Asn Gly Val Phe Thr Glu Arg Gly Val Glu Ile 920 925 930
ggt tea ggg aac atg ttt gcc atg gag att aac etg caa gcc aac eta 18181 Gly Ser Gly Asn Met Phe Ala Met Glu Ile Asn Leu Gin Ala Asn Leu 935 940 945 950
tgg cgt age ttt etg tac tcc aat att ggg etg tac etg cca gac tet 18229 Trp Arg Ser Phe Leu Tyr Ser Asn Ile Gly Leu Tyr Leu Pro Asp Ser 955 960 965
etc aaa atc act cct gac aac atc aca etc cca gag aac aaa aac acc 18277 Leu Lys Ile Thr Pro Asp Asn Ile Thr Leu Pro Glu Asn Lys Asn Thr 970 975 980
tat cag tat atg aac ggt cgc gtg acg cca ccc ggg etg gtt gac acc 18325 Tyr Gin Tyr Met Asn Gly Arg Val Thr Pro Pro Gly Leu Val Asp Thr 985 990 995
tac gtt aac gtg ggc gcg cgc tgg tcc ccc gat gtc atg gac agt 18370 Tyr Val Asn Val Gly Ala Arg Trp Ser Pro Asp Val Met Asp Ser 1000 1005 1010
att aac cct ttt aat cac cac cgc aac gcc gga etc cgc tac cgt 18415 Ile Asn Pro Phe Asn His His Arg Asn Ala Gly Leu Arg Tyr Arg 1015 1020 1025
tcc atg etc etg gga aac gga cgc tac gtg ccc ttc cac atc cag 18460 Ser Met Leu Leu Gly Asn Gly Arg Tyr Val Pro Phe His Ile Gin 1030 1035 1040
gtg ccc cag aaa ttc ttt gca att aaa aac etg etg etg etc ccc 18505 Val Pro Gin Lys Phe Phe Ala Ile Lys Asn Leu Leu Leu Leu Pro 1045 1050 1055
ggt tcc tac acc tac gag tgg aac ttc cgc aag gac gtg aac atg 18550 Gly Ser Tyr Thr Tyr Glu Trp Asn Phe Arg Lys Asp Val Asn Met 1060 1065 1070
atc ttg cag age tcg etg ggc aat gac etg ega gtg gac ggg gcc 18595 Ile Leu Gin Ser Ser Leu Gly Asn Asp Leu Arg Val Asp Gly Ala 1075 1080 1085
age atc cgc ttc gac age atc aac etg tac gcc aac ttt ttc ccc 18640 Ser Ile Arg Phe Asp Ser Ile Asn Leu Tyr Ala Asn Phe Phe Pro 1090 1095 1100
atg gcc cac aac acg gcc tcc acc etg gaa gcc atg etg cgc aac 18685 Met Ala His Asn Thr Ala Ser Thr Leu Glu Ala Met Leu Arg Asn 1105 1110 1115
gac acc aac gac caa tet ttc aac gac tac etg tgc gcg gcc aac 18730 Asp Thr Asn Asp Gin Ser Phe Asn Asp Tyr Leu Cys Ala Ala Asn 1120 1125 1130
atg etg tac ccc atc ccc gcc aac gcc acc age gtg ccc atc tcc 18775 Met Leu Tyr Pro Ile Pro Ala Asn Ala Thr Ser Val Pro Ile Ser 1135 1140 1145
att ccc tet cgc aac tgg gca gcc ttc agg ggc tgg agt ttc acc 18820 Ile Pro Ser Arg Asn Trp Ala Ala Phe Arg Gly Trp Ser Phe Thr 1150 1155 1160
cgc etc aaa acc aag gag acc ccc tcg etg ggc tcc ggg ttc gac 18865 Arg Leu Lys Thr Lys Glu Thr Pro Ser Leu Gly Ser Gly Phe Asp 1165 1170 1175
ccc tac ttc gtc tac tcc ggc tcc atc ccc tac etg gac ggc acc 18910 Pro Tyr Phe Val Tyr Ser Gly Ser Ile Pro Tyr Leu Asp Gly Thr 1180 1185 1190
ttc tac etc aac eat act ttc aaa aag gtg tea atc atg ttc gac 18955 Phe Tyr Leu Asn His Thr Phe Lys Lys Val Ser Ile Met Phe Asp 1195 1200 1205
tcc tcc gtc age tgg ccc ggc aac gac cgt etg etg acg ccc aac 19000 Ser Ser Val Ser Trp Pro Gly Asn Asp Arg Leu Leu Thr Pro Asn 1210 1215 1220
gag ttc gaa atc aag cgt tcg gtg gac ggt gaa ggg tac aac gtg 19045 Glu Phe Glu Ile Lys Arg Ser Val Asp Gly Glu Gly Tyr Asn Val 1225 1230 1235
get cag age aac atg acc aag gac tgg ttc etg att cag atg etc 19090 Ala Gin Ser Asn Met Thr Lys Asp Trp Phe Leu Ile Gin Met Leu 1240 1245 1250
age cac tac aac atc ggc tac cag ggc ttc tac gtg ccc gaa aat 19135 Ser His Tyr Asn Ile Gly Tyr Gin Gly Phe Tyr Val Pro Glu Asn 1255 1260 1265
tac aag gac cgc atg tac tet ttc ttc aga aac ttc caa ccc atg 19180 Tyr Lys Asp Arg Met Tyr Ser Phe Phe Arg Asn Phe Gin Pro Met 1270 1275 1280
age cgc caa att gta gat tea acg get tac act aat tat cag gat 19225 Ser Arg Gin Ile Val Asp Ser Thr Ala Tyr Thr Asn Tyr Gin Asp 1285 1290 1295
gtg aaa etg cca tac cag eat aac aac tea ggg ttc gtg ggc tac 19270 Val Lys Leu Pro Tyr Gin His Asn Asn Ser Gly Phe Val Gly Tyr 1300 1305 1310
atg gga ccc acc atg ega gag ggg cag gcc tac ccg gcc aac tat 19315 Met Gly Pro Thr Met Arg Glu Gly Gin Ala Tyr Pro Ala Asn Tyr 1315 1320 1325
ccc tat ccc etg att ggg gcc acc gcc gtg ccc age etc acg cag 19360 Pro Tyr Pro Leu Ile Gly Ala Thr Ala Val Pro Ser Leu Thr Gin 1330 1335 1340
aaa aag ttc etc tgc gac egg gtg atg tgg agg atc ccc ttc tet 19405 Lys Lys Phe Leu Cys Asp Arg Val Met Trp Arg Ile Pro Phe Ser 1345 1350 1355
age aac ttc atg tet atg ggc tcc etc acc gac etg ggg cag aac 19450 Ser Asn Phe Met Ser Met Gly Ser Leu Thr Asp Leu Gly Gin Asn 1360 1365 1370
atg etg tac gcc aac tcc get cac gcc ttg gat atg acc ttt gag 19495 Met Leu Tyr Ala Asn Ser Ala His Ala Leu Asp Met Thr Phe Glu 1375 1380 1385
gtg gat ccc atg gat gag ccc acg ett etc tat gtt etg ttt gaa 19540 Val Asp Pro Met Asp Glu Pro Thr Leu Leu Tyr Val Leu Phe Glu 1390 1395 1400
gtc ttc gac gtg gtg cgc atc cac cag ccg cac cgc ggc gtc atc 19585 Val Phe Asp Val Val Arg Ile His Gin Pro His Arg Gly Val Ile 1405 1410 1415
gag gcc gtc tac etg cgc aca cct ttc tet gcc ggt aac gcc acc 19630 Glu Ala Val Tyr Leu Arg Thr Pro Phe Ser Ala Gly Asn Ala Thr 1420 1425 1430
acc taa agaagccgat gggctccagc gaacaggagc tgcaggccat tgttcgcgac 19686 Thr
ctgggctgcg ggccctactt tttgggcacc ttcgacaagc gttttcccgg cttcatgtcc 19746 ccccacaagc cggcctgtgc catcgttaac acggccggac gggagaccgg gggggtccac 19806 tggctcgcct tcgcctggaa cccgcgtaac cgcacctgct acctgttcga cccttttggt 19866 ttctccgacg aaaggctgaa gcagatctac cagttcgagt acgaggggct cctcaagcgc 19926 agcgctctgg cctccacgcc cgaccactgc gtcaccctgg aaaagtccac ccaaacggtc 19986 caggggcccc tctcggccgc ctgcgggctc ttctgttgca tgtttttgca cgccttcgtg 20046 cactggcctc acacccccat ggatcacaac cccaccatgg atctgctcac cggagtgccc 20106 aacagcatgc ttcacagccc ccaggtcgcc cccaccctgc gccgtaacca ggaacacctg 20166 tatcgctttc tggggaaaca ctctgcctat tttcgccgcc accggcagcg catcgaacgg 20226 gccacggcct tcgaaagcat gagccaaaga gtgtaatcaa taaaaaacat ttttatttga 20286 catgatacgc gcttctggcg ttttattaaa aatcgaaggg ttcgagggag gggtcctcgt 20346 gcccgctggg gagggacacg ttgcgatact ggaaacgggc gctccaacga aactcgggga 20406 tcaccagccg cggcaggggc acgtcttcta ggttctgctt ccaaaactgc cgcaccagct 20466 gcagggctcc catgacgtcg ggcgccgata tcttgaagtc gcagttaggg ccggagctcc 20526 cgcggctgtt gcggaacacg gggttggcac actggaacac cagcacgccg gggttgtgga 20586 tactggccag ggccgtcggg tcggtcacct ccgacgcatc cagatcctcg gcgttgctca 20646 gggcaaacgg ggtcagcttg cacatctgcc gcccaatctg gggtactagg tcgcgcttgt 20706 tgaggcagtc gcagcgcaga gggatcagga tgcgtcgctg cccgcgttgc atgatagggt 20766 aactcgccgc caggaactcc tccatttgac ggaaggccat ctgggctttg ccgccctcgg 20826 tgtagaatag cccgcaggac ttgctagaga atacgttatg accgcagttg acgtcctccg 20886 cgcagcagcg ggcgtcttcg ttcttcagct gaaccacgtt gcggccccaa cggttctgga 20946 ccaccttggc tctagtgggg tgctccttca gcgcccgctg tccgttctcg ctggttacat 21006 ccatttccaa cacgtgctcc ttgcagacca tctccactcc gtggaagcaa aacaggacgc 21066 cctcctgctg ggtactgcga tgctcccata cggcgcatcc ggtgggctcc cagctcttgt 21126 gttttacccc cgcgtaggct tccatgtaag ccataaggaa tctgcccatc agctcggtga 21186 aggtcttctg gttggtgaag gttagcggca ggccgcggtg ctcctcgttc aaccaagttt 21246 gacagatctt gcggtacacc gctccctggt cgggcagaaa cttaaaagcc gctctgctgt 21306 cgttgtctac gtggaacttc tccattaaca tcatcatggt ttccataccc ttctcccacg 21366 ctgtcaccag tggtttgctg tcggggttct tcaccaacac ggcggtagag gggccctcgc 21426 cggccccgac gtccttcatg gtcattcttt gaaactccac ggagccgtcc gcgcgacgta 21486 ctctgcgcac cggagggtag ctgaagccca cctccaccac ggtgccttcg ccctcgctgt 21546 cggagacaat ctccggggat ggcggcggcg cgggtgtcgc cttgcgagcc ttcttcttgg 21606 gagggagctg aggcgcctcc tgctcgcgct cggggctcat ctcccgcaag tagggggtaa 21666 tggagctgcc tgcttggttc tgacggttgg ccattgtatc ctaggcagaa agacatggag 21726 cttatgcgcg aggaaacttt aaccgccccg tcccccgtca gcgacgaaga tgtcatcgtc 21786 gaacaggacc cgggctacgt tacgccgccc gaggatctgg aggggcctga ccggcgcgac 21846 gctagtgagc ggcaggaaaa tgagaaagag gaggcctgct acctcctgga aggcgacgtt 21906 ttgctaaagc atttcgccag gcagagcacc atagttaagg aggccttgca agaccgctcc 21966 gaggtgccct tggacgtcgc cgcgctctcc caggcctacg aggcgaacct tttctcgcct 22026 cgagtgcctc cgaagagaca gcccaacggc acctgcgagc ccaacccgcg actcaacttc 22086 taccccgtgt tcgccgtacc agaggcgctg gccacctatc acattttttt caaaaaccaa 22146 cgcatccccc tatcgtgccg ggccaaccgc accgcggccg ataggaatct caggcttaaa 22206 aacggagcca acatacctga tatcacgtcg ctggaggaag tgcccaagat tttcgagggt 22266 ctgggtcgag atgagaagcg ggcggcgaac gctctgcaga aagaacagaa agagagtcag 22326 aacgtgctgg tggagctgga gggggacaac gcgcgtctgg ccgtcctcaa acgctgcata 22386 gaagtctccc acttcgccta ccccgccctc aacttgccac ccaaagttat gaaatcggtc 22446 atggatcagc tgctcatcaa gagagctgag cccctggatc ccgaccaccc cgaggcggaa 22506 aactcagagg acggaaagcc cgtcgtcagc gacgaggagc tcgagcggtg gctggaaacc 22566 agggaccccc aacagttgca agagaggcgc aagatgatga tggcggccgt gctggtcacc 22626 gtggagctgg aatgcctgca acggtttttc agcgacgtgg agacgctacg caaaatcggg 22686 gaatccctgc actacacctt ccgccagggc tacgtccgcc aggcctgcaa gatctccaac 22746 gtggagctca gcaacctggt ctcctacatg ggcatcctcc acgagaaccg gctggggcag 22806 agcgtgctgc actgcacctt gcaaggcgag gcgcggcggg actacgtgcg agactgcatc 22866 tacctcttcc tcaccctcac ctggcagacc gccatgggcg tctggcagca gtgcttggaa 22926 gagagaaacc tcaaagagct agacaaactc ctctgccgcc agcggcgcgc cctgtggtcc 22986 ggtttcagcg agcgcacggt cgccagcgct ctggcggaca tcatcttccc ggagcgcctg 23046 atgaaaacct tgcaaaacgg cctgccggat ttcatcagtc aaagcatttt gcaaaacttc 23106 cgctcttttg tcctggaacg ctccgggatc ttgcccgcca tgagctgcgc gctaccttct 23166 gactttgtcc ccctctccta ccgcgagtgc cctcccccac tgtggagcca ctgctacctc 23226 ttccaactgg ccaactttct ggcctaccac tccgacctca tggaagacgt aagcggagag 23286 ggtttactgg agtgccactg ccgctgcaac ctgtgcaccc cccacagatc gctggcctgc 23346 aacaccgagc tactcagcga aacccaggtc ataggtacct tcgagatcca ggggccccag 23406 cagcaagagg gtgcttccgg cttgaagctc actccggcgc tgtggacctc ggcttactta 23466 cgcaaatttg tagccgagga ctaccacgcc cacaaaattc agttttacga agaccaatct 23526 cgaccaccga aagcccccct cacggcctgc gtcatcaccc agagcaagat cctggcccaa 23586 ttgcaatcca tcaaccaagc gcgccgcgat ttccttttga aaaagggtcg gggggtgtac 23646 ctggaccccc agaccggcga ggaactcaac ccgtccacac tctccgtcga agcagccccc 23706 ccgagacatg ccgcccaagg gaaccgccaa gcagctgatc gctcggcaga gagcgaagaa 23766 gcaagagctg ctccagcagc aggtggagga cgaggaagag atgtgggaca gccaggcaga 23826 ggaggtgtca gaggacgagg aggagatgga aagctgggac agcctagacg aggaggagga 23886 cgagctttca gaggaagagg cgaccgaaga aaaaccacct gcatccagcg cgccttctct 23946 gagccgacag ccgaagcccc ggcccccgac gcccccggcc ggctcactca aagccagccg 24006 taggtgggac gccaccgaat ctccagcggc agcggcaacg gcagcgggta aggccaaacg 24066 cgagcggcgg gggtattgct cctggcgggc ccacaaaagc agtattgtga actgcttgca 24126 acactgcggg ggaaacatct cctttgcccg acgctacctc ctcttccatc acggtgtggc 24186 cttccctcgc aacgttctct attattaccg tcatctctac agcccctacg aaacgctcgg 24246 agaaaaaagc taaggcctcc tccgccgcga ggaaaaactc cgccgccgct gccgccgcca 24306 aggatccacc ggccaccgaa gagctgagaa agcgcatctt tcccactctg tatgctatct 24366 ttcagcaaag ccgcgggcag caccctcagc gcgaactgaa aataaaaaac cgctccttcc 24426 gctcgctcac ccgcagctgt ctgtaccaca agagagaaga ccagctgcag cgcaccctgg 24486 acgacgccga agcactgttc agcaaatact gctcagcgtc tcttaaagac taaaagaccc 24546 gcgctttttc cccctcggcc gccaaaaccc acgtcatcgc cagcatgagc aaggagattc 24606 ccacccccta catgtggagc tatcagcccc agatgggcct ggccgcgggg gccgcccagg 24666 actactccag caagatgaac tggctcagcg ccggccccca catgatctca cgagttaacg 24726 gcatccgagc ccaccgaaac cagattctct tagaacaggc ggcaatcacc gccacacccc 24786 ggcgccaact caacccgcct agttggcccg ccgcccaggt gtatcaggaa aatccccgcc 24846 cgaccacagt cctcctgcca cgcgacgcgg aggccgaagt cctcatgact aactctgggg 24906 tacaattagc gggcgggtcc aggtacgcca ggtacagagg tcgggccgct ccttactctc 24966 ccgggagtat aaagagggtg atcattcgag gccgaggtat ccagctcaac gacgagacgg 25026 tgagctcctc aaccggtctc agacctgacg gagtcttcca gctcggagga gcgggccgct 25086 cttccttcac cactcgccag gcctacctga ccctgcagag ctcttcctcg cagccgcgct 25146 ccgggggaat cggcactctc cagttcgtgg aagagttcgt tccctccgtc tacttcaacc 25206 ccttctccgg ctcgcctgga cgctacccgg acgccttcat tcccaacttt gacgcagtga 25266 gtgaatccgt ggacggctac gactgatgac agatggtgcg gccgtgagag ctcggctgcg 25326 acatctgcat cactgccgtc agcctcgctg ctacgctcgg gaggcgatcg tcttcagcta 25386 ctttgagctg ccggacgagc accctcaggg tccggctcac gggttgaaac tcgagatcga 25446 gaacgcgctc gagtctcgcc tcatcgacac cttcaccgcc cgacctctcc tggtagaaat 25506 ccaacggggg atcactacca tcaccctgtt ctgcatctgc cccacgcccg gattacatga 25566 agatctgtgt tgtcatcttt gcgctcagtt taataaaaac tgaacttttt gccgcacctt 25626 caacgccatc tgtgatttct acaacaaaaa gttcttctgg caaaggtaca caaactgtat 25686 tttattctaa ttctacctca tctatcgtgc tgaactgcgc ctgcactaac gaacttatcc 25746 agtggattgc aaacggtagt gtgtgcaagt acttttgggg gaacgatata gttagtagaa 25806 ataacagcct ttgcgagcac tgcaactcct ccacactaat cctttatccc ccatttgtta 25866 ctggatggta tatgtgcgtt ggctccggtt taaatcctag ttgctttcat aagtggtttc 25926 tacaaaaaga gacccttccc aacaattctg tttctttttt cgccctatcc tactgctgtt 25986 ctccctctgg ttactctttc aaacctctaa ttggtatttt agctttgata ctcataatct 26046 ttattaactt tataataatt aacaacttac agtaaacatg cttgttctac tgctcgccac 26106 atctttcgct ctctctcacg ccagaacaag tattgttggc gcaggttaca atgcaactct 26166 tcaatctgct tacatgccag attccgacca gataccccat attacgtggt acttacaaac 26226 ctccaaacct aattcttcat tttatgaagg aaacaaactc tgcgatgact ccgacaacag 26286 aacgcacaca tttccccacc cttcactaca attcgaatgc gtaaacaaaa gcttgaagct 26346 ttacaactta aagccttcag attctggctt gtaccatgct gtagttgaaa aaagtaattt 26406 agaagtccac agtgattaca ttgaattgac ggttgtggac ctgccacctc caaaatgtga 26466 ggtttcctcc tcttaccttg aagttcaagg cgtggatgcc tactgcctca tacacattaa 26526 ctgcagcaac tctaaatatc cagctagaat ttactataat ggacaggaaa gtaatctttt 26586 ttattattta acaacaagcg ctggtaacgg taaacagtta cctgactatt ttactgctgt 26646 tgttgaattt tccacctaca gagaaacgta tgccaagcgg ccttacaatt tctcataccc 26706 gtttaacgac ctttgcaatg aaatacaagc gctcgaaact ggaactgatt ttactccaat 26766 tttcattgct gccattgttg taagcttaat taccattatt gtcagcctag cattttactg 26826 cttttacaag cccaaaaacc ctaagtttga aaaacttaaa ctaaaacctg tcattcaaca 26886 agtgtgattt tgttttccag catggtagct gcatttctac ttctcctctg tctacccatc 26946 attttcgtct cttcaacttt cgccgcagtt tcccacctgg aaccagagtg cctaccgcct 27006 tttgacgtgt atctgattct cacctttgtt tgttgtatat ccatttgcag tatagcctgc 27066 ttttttataa caatctttca agccgccgac tatttttacg tgcgaattgc ttactttaga 27126 caccatcctg aatacagaaa tcaaaacgtt gcctccttac tttgtttggc atgattaagt 27186 tattgctgat acttaattat ttacccctaa tcaactgtaa ttgtccattc accaaaccct 27246 ggtcattcta cacctgttat gataaaatcc ccgacactcc tgttgcttgg ctttacgcag 27306 ccaccgccgc tttggtattt atatctactt gccttggagt aaaattgtat tttattttac 27366 acactgggtg gctacatccc agagaagatt tacctagata tcctcttgta aacgcttttc 27426 aattacagcc tctgcctcct cctgatcttc ttcctcgagc tccctctatt gtgagctact 27486 ttcaactcac cggtggagat gactgactct caggacatta atattagtgt ggaaagaata 27546 gctgctcagc gtcagcgaga aacgcgagtg ttggaatacc tggaactaca gcaacttaaa 27606 gagtcccact ggtgtgagaa aggagtgctg tgccatgtta agcaggcagc cctttcctac 27666 gatgtcagcg ttcagggaca tgaactgtct tacactttgc ctttgcagaa acaaaccttc 27726 tgcaccatga tgggctctac ctccatcaca atcacccaac aagccgggcc tgtagagggg 27786 gctatcctct gtcactgtca cgcacctgat tgcatgtcca aactaatcaa aactctctgt 27846 gctttaggtg atatttttaa ggtgtaaatc aataataaac ttaccttaaa tttgacaaca 27906 aatttctggt gacatcattc agcagcacca ctttaccctc ttcccagctc tcgtatggga 27966 tgcgatagtg ggtggcaaac ttcctccaaa ccctaaaaga aatattggta tccacttcct 28026 tgtcctcacc cacaattttc atcttttcat ag atg aaa aga acc aga gtt gat 28079
Met Lys Arg Thr Arg Val Asp 1435 1440
gaa gac ttc aac ccc gtc tac ccc tat gac acc aca acc act cct 28124 Glu Asp Phe Asn Pro Val Tyr Pro Tyr Asp Thr Thr Thr Thr Pro 1445 1450 1455
gca gtt ccc ttt ata tea ccc ccc ttt gta aac age gat ggt ett 28169 Ala Val Pro Phe Ile Ser Pro Pro Phe Val Asn Ser Asp Gly Leu 1460 1465 1470
cag gaa aac ccc cca ggt gtt tta agt etg ega ata get aaa ccc 28214 Gin Glu Asn Pro Pro Gly Val Leu Ser Leu Arg Ile Ala Lys Pro 1475 1480 1485
eta tat ttc gac atg gag aga aaa eta gcc ett tea ett gga aga 28259 Leu Tyr Phe Asp Met Glu Arg Lys Leu Ala Leu Ser Leu Gly Arg 1490 1495 1500
ggg ttg aca att acc gcc gcc gga caa tta gaa agt acg cag age 28304 Gly Leu Thr Ile Thr Ala Ala Gly Gin Leu Glu Ser Thr Gin Ser 1505 1510 1515
gta caa acc aac cca ccg ttg ata att acc aac aac aac aca etg 28349 Val Gin Thr Asn Pro Pro Leu Ile Ile Thr Asn Asn Asn Thr Leu 1520 1525 1530
acc eta cgt eat tet ccc ccc tta aac eta act gac aat age tta 28394 Thr Leu Arg His Ser Pro Pro Leu Asn Leu Thr Asp Asn Ser Leu 1535 1540 1545
gtg eta ggc tac tcg agt cct etc cgc gtc aca gac aac aaa ett 28439 Val Leu Gly Tyr Ser Ser Pro Leu Arg Val Thr Asp Asn Lys Leu 1550 1555 1560 aca ttt aac ttc aca tea cca etc cgt tat gaa aat gaa aac ett 28484 Thr Phe Asn Phe Thr Ser Pro Leu Arg Tyr Glu Asn Glu Asn Leu 1565 1570 1575
act ttt aac tat aca gag cct ett aaa ett ata aat aac age ett 28529 Thr Phe Asn Tyr Thr Glu Pro Leu Lys Leu Ile Asn Asn Ser Leu 1580 1585 1590
gcc att gac atc aat tcc tea aaa ggc ett agt age gtc gga ggc 28574 Ala Ile Asp Ile Asn Ser Ser Lys Gly Leu Ser Ser Val Gly Gly 1595 1600 1605
tea eta get gta aac etg agt tea gac tta aag ttt gac age aac 28619 Ser Leu Ala Val Asn Leu Ser Ser Asp Leu Lys Phe Asp Ser Asn 1610 1615 1620
gga tcc ata get ttt ggc ata caa acc etg tgg acc get ccg acc 28664 Gly Ser Ile Ala Phe Gly Ile Gin Thr Leu Trp Thr Ala Pro Thr 1625 1630 1635
tcg act ggc aac tgc acc gtc tac age gag ggc gat tcc eta ett 28709 Ser Thr Gly Asn Cys Thr Val Tyr Ser Glu Gly Asp Ser Leu Leu 1640 1645 1650
agt etc tgt tta acc aaa tgc gga get cac gtc tta gga agt gta 28754 Ser Leu Cys Leu Thr Lys Cys Gly Ala His Val Leu Gly Ser Val 1655 1660 1665
agt tta acc ggt tta aca gga acc ata acc caa atg act gat att 28799 Ser Leu Thr Gly Leu Thr Gly Thr Ile Thr Gin Met Thr Asp Ile 1670 1675 1680
tet gtc acc att caa ttt aca ttt gac aac aat ggt aag eta eta 28844 Ser Val Thr Ile Gin Phe Thr Phe Asp Asn Asn Gly Lys Leu Leu 1685 1690 1695
age tet cca ett ata aac aac gcc ttt agt att ega cag aat gac 28889 Ser Ser Pro Leu Ile Asn Asn Ala Phe Ser Ile Arg Gin Asn Asp 1700 1705 1710
agt acg gcc tea aac cct acc tac aac gcc etg gcg ttt atg cct 28934 Ser Thr Ala Ser Asn Pro Thr Tyr Asn Ala Leu Ala Phe Met Pro 1715 1720 1725
aac agt acc ata tat gca aga ggg gga ggt ggt gaa cca ega aac 28979 Asn Ser Thr Ile Tyr Ala Arg Gly Gly Gly Gly Glu Pro Arg Asn 1730 1735 1740
aac tac tac gtc caa acg tat ett agg gga aat gtt caa aaa cca 29024 Asn Tyr Tyr Val Gin Thr Tyr Leu Arg Gly Asn Val Gin Lys Pro 1745 1750 1755
atc att ett act gta acc tac aac tea gtc gcc aca gga tat tcc 29069 Ile Ile Leu Thr Val Thr Tyr Asn Ser Val Ala Thr Gly Tyr Ser 1760 1765 1770
tta tet ttt aag tgg act get ett gca cgt gaa aag ttt gca acc 29114 Leu Ser Phe Lys Trp Thr Ala Leu Ala Arg Glu Lys Phe Ala Thr 1775 1780 1785
cca aca acc tcg ttt tgc tac att aca gaa caa taa aacegtgtae 29160 Pro Thr Thr Ser Phe Cys Tyr Ile Thr Glu Gin
1790 1795
cccaccgttt cgtttttttc ag atg aaa egg gcg aga gtt gat gaa gac 29209
Met Lys Arg Ala Arg Val Asp Glu Asp 1800 1805
ttc aac cca gtg tac cct tat gac ccc cca eat get cct gtt atg 29254 Phe Asn Pro Val Tyr Pro Tyr Asp Pro Pro His Ala Pro Val Met 1810 1815 1820
ccc ttc att act cca cct ttt acc tcc tcg gat ggg ttg cag gaa 29299 Pro Phe Ile Thr Pro Pro Phe Thr Ser Ser Asp Gly Leu Gin Glu 1825 1830 1835
aaa cca ett gga gtg tta agt tta aac tac aga gat ccc att act 29344 Lys Pro Leu Gly Val Leu Ser Leu Asn Tyr Arg Asp Pro Ile Thr 1840 1845 1850
acg caa aat gag tet ett aca att aaa eta gga aac ggc etc act 29389 Thr Gin Asn Glu Ser Leu Thr Ile Lys Leu Gly Asn Gly Leu Thr 1855 1860 1865
eta gac aac cag gga caa eta aca tea acc get ggc gaa gta gaa 29434 Leu Asp Asn Gin Gly Gin Leu Thr Ser Thr Ala Gly Glu Val Glu 1870 1875 1880
cct cca etc act aac get aac aac aaa ett gca etg gtc tat age 29479 Pro Pro Leu Thr Asn Ala Asn Asn Lys Leu Ala Leu Val Tyr Ser 1885 1890 1895
gat cct tta gca gta aag cgc aac age eta acc tta tcg cac acc 29524 Asp Pro Leu Ala Val Lys Arg Asn Ser Leu Thr Leu Ser His Thr 1900 1905 1910
get ccc ett gtt att get gat aac tet tta gca ttg caa gtt tea 29569 Ala Pro Leu Val Ile Ala Asp Asn Ser Leu Ala Leu Gin Val Ser 1915 1920 1925
gag cct att ttt ata aat gac aag gac aaa eta gcc etg caa aca 29614 Glu Pro Ile Phe Ile Asn Asp Lys Asp Lys Leu Ala Leu Gin Thr 1930 1935 1940
gcc gcg ccc ett gta act aac get ggc acc ett cgc tta caa age 29659 Ala Ala Pro Leu Val Thr Asn Ala Gly Thr Leu Arg Leu Gin Ser 1945 1950 1955
gcc gcc cct tta ggc att gca gac caa acc eta aaa etc etg ttt 29704 Ala Ala Pro Leu Gly Ile Ala Asp Gin Thr Leu Lys Leu Leu Phe 1960 1965 1970
acc aac cct ttg tac ttg cag aat aac ttt etc acg tta gcc att 29749 Thr Asn Pro Leu Tyr Leu Gin Asn Asn Phe Leu Thr Leu Ala Ile 1975 1980 1985
gaa ega ccc ett gcc att acc aat act gga aag etg get eta cag 29794 Glu Arg Pro Leu Ala Ile Thr Asn Thr Gly Lys Leu Ala Leu Gin 1990 1995 2000
etc tcc cca ccg eta caa aca gca gac aca ggc ttg act ttg caa 29839 Leu Ser Pro Pro Leu Gin Thr Ala Asp Thr Gly Leu Thr Leu Gin 2005 2010 2015
acc aac gtg cca tta act gta age aac ggg acc eta ggc tta gcc 29884 Thr Asn Val Pro Leu Thr Val Ser Asn Gly Thr Leu Gly Leu Ala 2020 2025 2030 ata aag cgc cca ett att att cag gac aac aac ttg ttt ttg gac 29929 Ile Lys Arg Pro Leu Ile Ile Gin Asp Asn Asn Leu Phe Leu Asp 2035 2040 2045
ttc aga get ccc etg cgt ett ttc aac age gac cca gta eta ggg 29974 Phe Arg Ala Pro Leu Arg Leu Phe Asn Ser Asp Pro Val Leu Gly 2050 2055 2060
ett aac ttt tac acc cct ett gcg gta cgc gat gag gcg etc act 30019 Leu Asn Phe Tyr Thr Pro Leu Ala Val Arg Asp Glu Ala Leu Thr 2065 2070 2075
gtt aac aca ggc cgc ggc etc aca gtg agt tac gat ggt tta att 30064 Val Asn Thr Gly Arg Gly Leu Thr Val Ser Tyr Asp Gly Leu Ile 2080 2085 2090
tta aat ett ggt aag gat ett cgc ttt gac aac aac acc gtt tet 30109 Leu Asn Leu Gly Lys Asp Leu Arg Phe Asp Asn Asn Thr Val Ser 2095 2100 2105
gtc get ett agt get get ttg cct tta caa tac act gat cag ett 30154 Val Ala Leu Ser Ala Ala Leu Pro Leu Gin Tyr Thr Asp Gin Leu 2110 2115 2120
cgc ett aac gtg ggc get ggg etg cgt tac aat cca gtg agt aag 30199 Arg Leu Asn Val Gly Ala Gly Leu Arg Tyr Asn Pro Val Ser Lys 2125 2130 2135
aaa ttg gac gtg aac ccc aat caa aac aag ggt tta acc tgg gaa 30244 Lys Leu Asp Val Asn Pro Asn Gin Asn Lys Gly Leu Thr Trp Glu 2140 2145 2150
aat gac tac etc att gta aag eta gga aat gga tta ggt ttt gat 30289 Asn Asp Tyr Leu Ile Val Lys Leu Gly Asn Gly Leu Gly Phe Asp 2155 2160 2165
ggc gat gga aac ata get gtt tet cct caa gtt aca tcg cct gac 30334 Gly Asp Gly Asn Ile Ala Val Ser Pro Gin Val Thr Ser Pro Asp 2170 2175 2180
acc tta tgg acc act gcc gac cca tcc ccc aat tgt tcc atc tac 30379 Thr Leu Trp Thr Thr Ala Asp Pro Ser Pro Asn Cys Ser Ile Tyr 2185 2190 2195
act gat tta gat gcc aaa atg tgg etc tcg ttg gta aaa caa ggg 30424 Thr Asp Leu Asp Ala Lys Met Trp Leu Ser Leu Val Lys Gin Gly 2200 2205 2210
ggt gtg gtt cac ggt tet gtt get tta aaa gca ttg aaa gga acc 30469 Gly Val Val His Gly Ser Val Ala Leu Lys Ala Leu Lys Gly Thr 2215 2220 2225
eta ttg agt cct acg gaa age gcc att gtt att ata eta eat ttt 30514 Leu Leu Ser Pro Thr Glu Ser Ala Ile Val Ile Ile Leu His Phe 2230 2235 2240
gac aat tat gga gtg ega att etc aat tat ccc act ttg ggc act 30559 Asp Asn Tyr Gly Val Arg Ile Leu Asn Tyr Pro Thr Leu Gly Thr 2245 2250 2255
caa ggc acg ttg gga aat aat gca act tgg ggt tat agg cag gga 30604 Gin Gly Thr Leu Gly Asn Asn Ala Thr Trp Gly Tyr Arg Gin Gly 2260 2265 2270 gaa tet gca gac act aat gta etc aat gca eta gca ttt atg ccc 30649 Glu Ser Ala Asp Thr Asn Val Leu Asn Ala Leu Ala Phe Met Pro 2275 2280 2285
agt tea aaa agg tac cca aga ggg cgt gga age gaa gtt cag aat 30694 Ser Ser Lys Arg Tyr Pro Arg Gly Arg Gly Ser Glu Val Gin Asn 2290 2295 2300
caa act gtg ggc tac act tgt ata cag ggt gac ttt tet atg ccc 30739 Gin Thr Val Gly Tyr Thr Cys Ile Gin Gly Asp Phe Ser Met Pro 2305 2310 2315
gta ccg tac caa ata cag tac aac tat gga cca act ggc tac tcc 30784 Val Pro Tyr Gin Ile Gin Tyr Asn Tyr Gly Pro Thr Gly Tyr Ser 2320 2325 2330
ttt aaa ttt att tgg aga act gtt tea aga caa cca ttt gac atc 30829 Phe Lys Phe Ile Trp Arg Thr Val Ser Arg Gin Pro Phe Asp Ile 2335 2340 2345
cca tgc tgt ttt ttc tet tac att acg gaa gaa taa aacaactttt 30875 Pro Cys Cys Phe Phe Ser Tyr Ile Thr Glu Glu
2350 2355
tctttttatt ttctttttat tttacacgca cagtaagget tcctccaccc ttccatctca 30935 cagcatacac cagcctctcc cccttcatgg cagtaaactg ttgtgagtca gtceggtatt 30995 tgggagttaa gatccaaaca gtctctttgg tgatgaaaca tggatccgtg atggacacaa 31055 atccctggga caggttctcc aaegtttegg taaaaaactg catgccgccc tacaaaacaa 31115 acaggttcag gctctccacg ggttatctcc ccgatcaaac tcagacagag taaaggtgcg 31175 atgatgttcc actaaaccac geaggtggeg ctgtctgaac ctctcggtgc gactcctgtg 31235 aggctggtaa gaagttagat tgtccagcag cctcacagca tggatcatca gtctacgagt 31295 gcgtctggcg cagcagcgca tctgaatctc actgagattc eggcaagaat cgcacaccat 31355 cacaatcagg ttgttcatga tcccatagct gaacacgctc cagccaaagc tcattcgctc 31415 caacagcgcc accgcgtgtc cgtccaacct tactttaaca taaatcaggt gtctgccgcg 31475 tacaaacatg ctacccgcat acagaacctc eeggggcaaa cccctgttca ccacctgcct 31535 gtaccaggga aacctcacat ttatcaggga gccatagata gccattttaa accaattagc 31595 taacaccgcc ccaccagctc tacactgaag agaaceggga gagttacaat gacagtgaat 31655 aatccatctc tcataacccc taatggtctg atggaaatcc agatctaacg tggcacagca 31715 gatacacact ttcatataca ttttcatcac atgtttttcc caggcegtta aaatacaatc 31775 ccaatacacg ggccactcct gcagtacaat aaagctaata caagatggta tactcctcac 31835 ctcactaaca ttgtgcatgt tcatattttc acattctaag taecgagagt tctcctctac 31895 aacagcactg ccgcggtcct cacaaggtgg tagctggtga cgattgtaag gagccagtct 31955 geagegatae cgtctgtcgc gttgcategt agaccaggga ccgacgcact tcctcgtact 32015 tgtagtagca gaaccacgtc cgctgccagc acgtctccaa gtaaegcegg tccctgcgtc 32075 gctcacgctc cctcctcaac gcaaagtgca accactcttg taatccacac agatccctct 32135 cggcctccgg ggcgatgcac acctcaaacc tacagatgtc tcggtacagt tccaaacacg 32195 tagtgagggc gagttccaac caagacagac agcctgatct atcccgacac actggaggtg 32255 gaggaagaca cggaagaggc atgttattcc aagcgattca ccaacgggtc gaaatgaaga 32315 tcccgaagat gacaacggtc gcctccggag ccctgatgga atttaacagc cagatcaaac 32375 attatgcgat tttccaggct atcaatcgcg gcctccaaaa gagcctggac ccgcacttcc 32435 acaaacacca gcaaagcaaa agcgttatta tcaaactctt cgatcatcaa gctgcaggac 32495 tgtacaatgc ccaagtaatt ttcatttctc cactcgcgaa tgatgtcgcg gcaaatagtc 32555 tgaaggttca tgccgtgcat attaaaaagc tccgaaaggg cgccctctat agccatgcgt 32615 agacacacca tcatgactgc aagatatcgg gctcctgaga cacctgcagc agatttaaca 32675 gacccaggtc aggttgctct ccgcgatcgc gaatctccat ccgcaaagtc atttgcaaat 32735 aattaaatag atctgcgccg actaaatctg ttaactccgc gctaggaact aaatcaggtg 32795 tggctacgca gcacaaaagt tccagggatg gcgccaaact cactagaacc gctcccgagt 32855 agcaaaactg atgaatggga gtaacacagt gtaaaatgtt cagccaaaaa tcactaagct 32915 gctcctttaa aaagtccagt acttctatat tcagttcgtg caagtactga agcaactgtg 32975 cgggaatatg cacagcaaaa aaaatagggc ggctcagata catgttgacc taaaataaaa 33035 agaatcatta aactaaagaa gcctggcgaa cggtgggata tatgacacgc tccagcagca 33095 ggcaagcaac cggctgtccc cgggaaccgc ggtaaaattc atccgaatga ttaaaaagaa 33155 caacagagac ttcccaccat gtactcggtt ggatctcctg agcacagagc aatacccccc 33215 tcacattcat atccgctaca gaaaaaaaac gtcccagata cccagcggga atatccaacg 33275 acagctgcaa agacagcaaa acaatccctc tgggagcaat cacaaaatcc tccggtgaaa 33335 aaagcacata catattagaa taaccctgtt gctggggcaa aaaggcccgt cgtcccagca 33395 aatgcacata aatatgttca tcagccattg ccccgtctta ccgcgtaaac agccacgaaa 33455 aaatcgagct aaaatccacc caacagccta tagctatata tacactccac ccaatgacgc 33515 taataccgca ccacccacga ccaaagttca cccacaccca caaaacccgc gaaaatccag 33575 cgccgtcagc acttccgcaa tttcagtctc acaacgtcac ttccgcgcgc cttttcactt 33635 tcccacacac gcccttcgcc cgcccgccct cgcgccaccc cgcgtcaccc cacgtcaccg 33695 cacgtcaccc cggccccgcc tcgctcctcc ccgctcatta tcatattggc acgtttccag 33755 aataaggtat attattgatg cagcaaaaca atccctctgg gagcaatcac aaaatcctcc 33815 ggtgaaaaaa gcacatacat attagaataa ccctgttgct ggggcaaaaa ggcccgtcgt 33875 cccagcaaat gcacataaat atgttcatca gccattgccc cgtcttaccg cgtaaacagc 33935 cacgaaaaaa tcgagctaaa atccacccaa cagcctatag ctatatatac actccaccca 33995 atgacgctaa taccgcacca cccacgacca aagttcaccc acacccacaa aacccgcgaa 34055 aatccagcgc cgtcagcact tccgcaattt cagtctcaca acgtcacttc cgcgcgcctt 34115 ttcactttcc cacacacgcc cttcgcccgc ccgccctcgc gccaccccgc gtcaccccac 34175 gtcaccgcac gtcaccccgg ccccgcctcg ctcctccccg ctcattatca tattggcacg 34235 tttccagaat aaggtatatt attgatgca 34264
<210> 25
<211> 503
<212> PRT
<213> simian adenovirus SV-1
<400> 25
Met Arg Arg Ala Val Arg Val Thr Pro Ala Ala Tyr Glu Gly Pro Pro 15 10 15
Pro Ser Tyr Glu Ser Val Met Gly Ser Ala Asn Val Pro Ala Thr Leu 20 25 30
Glu Ala Pro Tyr Val Pro Pro Arg Tyr Leu Gly Pro Thr Glu Gly Arg 35 40 45
Asn Ser Ile Arg Tyr Ser Glu Leu Ala Pro Leu Tyr Asp Thr Thr Lys 50 55 60
Val Tyr Leu Val Asp Asn Lys Ser Ala Asp Ile Ala Ser Leu Asn Tyr 65 70 75 80
Gin Asn Asp His Ser Asn Phe Leu Thr Thr Val Val Gin Asn Asn Asp 85 90 95
Phe Thr Pro Thr Glu Ala Gly Thr Gin Thr Ile Asn Phe Asp Glu Arg 100 105 110
Ser Arg Trp Gly Gly Gin Leu Lys Thr Ile Leu His Thr Asn Met Pro 115 120 125
Asn Ile Asn Glu Phe Met Ser Thr Asn Lys Phe Arg Ala Arg Leu Met 130 135 140
Val Lys Lys Ala Glu Asn Gin Pro Pro Glu Tyr Glu Trp Phe Glu Phe 145 150 155 160 Thr Ile Pro Glu Gly Asn Tyr Ser Glu Thr Met Thr Ile Asp Leu Met 165 170 175
Asn Asn Ala Ile Val Asp Asn Tyr Leu Gin Val Gly Arg Gin Asn Gly 180 185 190
Val Leu Glu Ser Asp Ile Gly Val Lys Phe Asp Thr Arg Asn Phe Arg 195 200 205
Leu Gly Trp Asp Pro Val Thr Lys Leu Val Met Pro Gly Val Tyr Thr 210 215 220
Asn Glu Ala Phe His Pro Asp Ile Val Leu Leu Pro Gly Cys Gly Val 225 230 235 240
Asp Phe Thr Gin Ser Arg Leu Ser Asn Leu Leu Gly Ile Arg Lys Arg 245 250 255
Arg Pro Phe Gin Glu Gly Phe Gin Ile Met Tyr Glu Asp Leu Glu Gly 260 265 270
Gly Asn Ile Pro Gly Leu Leu Asp Val Pro Ala Tyr Glu Glu Ser Val 275 280 285
Lys Gin Ala Glu Ala Gin Gly Arg Glu Ile Arg Gly Asp Thr Phe Ala 290 295 300
Thr Glu Pro His Glu Leu Val Ile Lys Pro Leu Glu Gin Asp Ser Lys 305 310 315 320
Lys Arg Ser Tyr Asn Ile Ile Ser Gly Thr Met Asn Thr Leu Tyr Arg 325 330 335
Ser Trp Phe Leu Ala Tyr Asn Tyr Gly Asp Pro Glu Lys Gly Val Arg 340 345 350
Ser Trp Thr Ile Leu Thr Thr Thr Asp Val Thr Cys Gly Ser Gin Gin 355 360 365
Val Tyr Trp Ser Leu Pro Asp Met Met Gin Asp Pro Val Thr Phe Arg 370 375 380
Pro Ser Thr Gin Val Ser Asn Phe Pro Val Val Gly Thr Glu Leu Leu 385 390 395 400
Pro Val His Ala Lys Ser Phe Tyr Asn Glu Gin Ala Val Tyr Ser Gin 405 410 415 Leu Ile Arg Gin Ser Thr Ala Leu Thr His Val Phe Asn Arg Phe Pro 420 425 430
Glu Asn Gin Ile Leu Val Arg Pro Pro Ala Pro Thr Ile Thr Thr Val 435 440 445
Ser Glu Asn Val Pro Ala Leu Thr Asp His Gly Thr Leu Pro Leu Arg 450 455 460
Ser Ser Ile Ser Gly Val Gin Arg Val Thr Ile Thr Asp Ala Arg Arg 465 470 475 480
Arg Thr Cys Pro Tyr Val Tyr Lys Ala Leu Gly Val Val Ala Pro Lys 485 490 495
Val Leu Ser Ser Arg Thr Phe
500
<210> 26
<211> 931
<212> PRT
<213> simian adenovirus SV-1
<400> 26
Met Ala Thr Pro Ser Met Met Pro Gin Trp Ser Tyr Met His Ile Ala 15 10 15
Gly Gin Asp Ala Ser Glu Tyr Leu Ser Pro Gly Leu Val Gin Phe Ala 20 25 30
Arg Ala Thr Asp Thr Tyr Phe Ser Leu Gly Asn Lys Phe Arg Asn Pro 35 40 45
Thr Val Ala Pro Thr His Asp Val Thr Thr Asp Arg Ser Gin Arg Leu 50 55 60
Thr Leu Arg Phe Val Pro Val Asp Arg Glu Asp Thr Ala Tyr Ser Tyr 65 70 75 80
Lys Val Arg Tyr Thr Leu Ala Val Gly Asp Asn Arg Val Leu Asp Met 85 90 95
Ala Ser Thr Tyr Phe Asp Ile Arg Gly Val Leu Asp Arg Gly Pro Ser 100 105 110 Phe Lys Pro Tyr Ser Gly Thr Ala Tyr Asn Ser Leu Ala Pro Lys Gly 115 120 125
Ala Pro Asn Pro Ala Glu Trp Thr Asn Ser Asp Ser Lys Val Lys Val 130 135 140
Arg Ala Gin Ala Pro Phe Val Ser Ser Tyr Gly Ala Thr Ala Ile Thr 145 150 155 160
Lys Glu Gly Ile Gin Val Gly Val Thr Leu Thr Asp Ser Gly Ser Thr 165 170 175
Pro Gin Tyr Ala Asp Lys Thr Tyr Gin Pro Glu Pro Gin Ile Gly Glu 180 185 190
Leu Gin Trp Asn Ser Asp Val Gly Thr Asp Asp Lys Ile Ala Gly Arg 195 200 205
Val Leu Lys Lys Thr Thr Pro Met Phe Pro Cys Tyr Gly Ser Tyr Ala 210 215 220
Arg Pro Thr Asn Glu Lys Gly Gly Gin Ala Thr Pro Ser Ala Ser Gin 225 230 235 240
Asp Val Gin Asn Pro Glu Leu Gin Phe Phe Ala Ser Thr Asn Val Ala 245 250 255
Asn Thr Pro Lys Ala Val Leu Tyr Ala Glu Asp Val Ser Ile Glu Ala 260 265 270
Pro Asp Thr His Leu Val Phe Lys Pro Thr Val Thr Glu Gly Ile Thr 275 280 285
Ser Ser Glu Ala Leu Leu Thr Gin Gin Ala Ala Pro Asn Arg Pro Asn 290 295 300
Tyr Ile Ala Phe Arg Asp Asn Phe Ile Gly Leu Met Tyr Tyr Asn Ser 305 310 315 320
Thr Gly Asn Met Gly Val Leu Ala Gly Gin Ala Ser Gin Leu Asn Ala 325 330 335
Val Val Asp Leu Gin Asp Arg Asn Thr Glu Leu Ser Tyr Gin Leu Met 340 345 350
Leu Asp Ala Leu Gly Asp Arg Ser Arg Tyr Phe Ser Met Trp Asn Gin 355 360 365 Ala Val Asp Ser Tyr Asp Pro Asp Val Arg Ile Ile Glu Asn His Gly 370 375 380
Val Glu Asp Glu Leu Pro Asn Tyr Cys Phe Pro Leu Gly Gly Met Ala 385 390 395 400
Val Thr Asp Thr Tyr Ser Pro Ile Lys Val Asn Gly Gly Gly Asn Gly 405 410 415
Trp Glu Ala Asn Asn Gly Val Phe Thr Glu Arg Gly Val Glu Ile Gly 420 425 430
Ser Gly Asn Met Phe Ala Met Glu Ile Asn Leu Gin Ala Asn Leu Trp 435 440 445
Arg Ser Phe Leu Tyr Ser Asn Ile Gly Leu Tyr Leu Pro Asp Ser Leu 450 455 460
Lys Ile Thr Pro Asp Asn Ile Thr Leu Pro Glu Asn Lys Asn Thr Tyr 465 470 475 480
Gin Tyr Met Asn Gly Arg Val Thr Pro Pro Gly Leu Val Asp Thr Tyr 485 490 495
Val Asn Val Gly Ala Arg Trp Ser Pro Asp Val Met Asp Ser Ile Asn 500 505 510
Pro Phe Asn His His Arg Asn Ala Gly Leu Arg Tyr Arg Ser Met Leu 515 520 525
Leu Gly Asn Gly Arg Tyr Val Pro Phe His Ile Gin Val Pro Gin Lys 530 535 540
Phe Phe Ala Ile Lys Asn Leu Leu Leu Leu Pro Gly Ser Tyr Thr Tyr 545 550 555 560
Glu Trp Asn Phe Arg Lys Asp Val Asn Met Ile Leu Gin Ser Ser Leu 565 570 575
Gly Asn Asp Leu Arg Val Asp Gly Ala Ser Ile Arg Phe Asp Ser Ile 580 585 590
Asn Leu Tyr Ala Asn Phe Phe Pro Met Ala His Asn Thr Ala Ser Thr 595 600 605
Leu Glu Ala Met Leu Arg Asn Asp Thr Asn Asp Gin Ser Phe Asn Asp 610 615 620 Tyr Leu Cys Ala Ala Asn Met Leu Tyr Pro Ile Pro Ala Asn Ala Thr 625 630 635 640
Ser Val Pro Ile Ser Ile Pro Ser Arg Asn Trp Ala Ala Phe Arg Gly 645 650 655
Trp Ser Phe Thr Arg Leu Lys Thr Lys Glu Thr Pro Ser Leu Gly Ser 660 665 670
Gly Phe Asp Pro Tyr Phe Val Tyr Ser Gly Ser Ile Pro Tyr Leu Asp 675 680 685
Gly Thr Phe Tyr Leu Asn His Thr Phe Lys Lys Val Ser Ile Met Phe 690 695 700
Asp Ser Ser Val Ser Trp Pro Gly Asn Asp Arg Leu Leu Thr Pro Asn 705 710 715 720
Glu Phe Glu Ile Lys Arg Ser Val Asp Gly Glu Gly Tyr Asn Val Ala 725 730 735
Gin Ser Asn Met Thr Lys Asp Trp Phe Leu Ile Gin Met Leu Ser His 740 745 750
Tyr Asn Ile Gly Tyr Gin Gly Phe Tyr Val Pro Glu Asn Tyr Lys Asp 755 760 765
Arg Met Tyr Ser Phe Phe Arg Asn Phe Gin Pro Met Ser Arg Gin Ile 770 775 780
Val Asp Ser Thr Ala Tyr Thr Asn Tyr Gin Asp Val Lys Leu Pro Tyr 785 790 795 800
Gin His Asn Asn Ser Gly Phe Val Gly Tyr Met Gly Pro Thr Met Arg 805 810 815
Glu Gly Gin Ala Tyr Pro Ala Asn Tyr Pro Tyr Pro Leu Ile Gly Ala 820 825 830
Thr Ala Val Pro Ser Leu Thr Gin Lys Lys Phe Leu Cys Asp Arg Val 835 840 845
Met Trp Arg Ile Pro Phe Ser Ser Asn Phe Met Ser Met Gly Ser Leu 850 855 860
Thr Asp Leu Gly Gin Asn Met Leu Tyr Ala Asn Ser Ala His Ala Leu 865 870 875 880 Asp Met Thr Phe Glu Val Asp Pro Met Asp Glu Pro Thr Leu Leu Tyr 885 890 895
Val Leu Phe Glu Val Phe Asp Val Val Arg Ile His Gin Pro His Arg 900 905 910
Gly Val Ile Glu Ala Val Tyr Leu Arg Thr Pro Phe Ser Ala Gly Asn 915 920 925
Ala Thr Thr
930
<210> 27
<211> 363
<212> PRT
<213> simian adenovirus SV-1
<400> 27
Met Lys Arg Thr Arg Val Asp Glu Asp Phe Asn Pro Val Tyr Pro Tyr 15 10 15
Asp Thr Thr Thr Thr Pro Ala Val Pro Phe Ile Ser Pro Pro Phe Val 20 25 30
Asn Ser Asp Gly Leu Gin Glu Asn Pro Pro Gly Val Leu Ser Leu Arg 35 40 45
Ile Ala Lys Pro Leu Tyr Phe Asp Met Glu Arg Lys Leu Ala Leu Ser 50 55 60
Leu Gly Arg Gly Leu Thr Ile Thr Ala Ala Gly Gin Leu Glu Ser Thr 65 70 75 80
Gin Ser Val Gin Thr Asn Pro Pro Leu Ile Ile Thr Asn Asn Asn Thr 85 90 95
Leu Thr Leu Arg His Ser Pro Pro Leu Asn Leu Thr Asp Asn Ser Leu 100 105 110
Val Leu Gly Tyr Ser Ser Pro Leu Arg Val Thr Asp Asn Lys Leu Thr 115 120 125
Phe Asn Phe Thr Ser Pro Leu Arg Tyr Glu Asn Glu Asn Leu Thr Phe 130 135 140 Asn Tyr Thr Glu Pro Leu Lys Leu Ile Asn Asn Ser Leu Ala Ile Asp 145 150 155 160
Ile Asn Ser Ser Lys Gly Leu Ser Ser Val Gly Gly Ser Leu Ala Val 165 170 175
Asn Leu Ser Ser Asp Leu Lys Phe Asp Ser Asn Gly Ser Ile Ala Phe 180 185 190
Gly Ile Gin Thr Leu Trp Thr Ala Pro Thr Ser Thr Gly Asn Cys Thr 195 200 205
Val Tyr Ser Glu Gly Asp Ser Leu Leu Ser Leu Cys Leu Thr Lys Cys 210 215 220
Gly Ala His Val Leu Gly Ser Val Ser Leu Thr Gly Leu Thr Gly Thr 225 230 235 240
Ile Thr Gin Met Thr Asp Ile Ser Val Thr Ile Gin Phe Thr Phe Asp 245 250 255
Asn Asn Gly Lys Leu Leu Ser Ser Pro Leu Ile Asn Asn Ala Phe Ser 260 265 270
Ile Arg Gin Asn Asp Ser Thr Ala Ser Asn Pro Thr Tyr Asn Ala Leu 275 280 285
Ala Phe Met Pro Asn Ser Thr Ile Tyr Ala Arg Gly Gly Gly Gly Glu 290 295 300
Pro Arg Asn Asn Tyr Tyr Val Gin Thr Tyr Leu Arg Gly Asn Val Gin 305 310 315 320
Lys Pro Ile Ile Leu Thr Val Thr Tyr Asn Ser Val Ala Thr Gly Tyr 325 330 335
Ser Leu Ser Phe Lys Trp Thr Ala Leu Ala Arg Glu Lys Phe Ala Thr 340 345 350
Pro Thr Thr Ser Phe Cys Tyr Ile Thr Glu Gin 355 360
<210> 28
<211> 560
<212> PRT
<213> simian adenovirus SV-1
<400> 28
Met Lys Arg Ala Arg Val Asp Glu Asp Phe Asn Pro Val Tyr Pro Tyr 15 10 15
Asp Pro Pro His Ala Pro Val Met Pro Phe Ile Thr Pro Pro Phe Thr 20 25 30
Ser Ser Asp Gly Leu Gin Glu Lys Pro Leu Gly Val Leu Ser Leu Asn 35 40 45
Tyr Arg Asp Pro Ile Thr Thr Gin Asn Glu Ser Leu Thr Ile Lys Leu 50 55 60
Gly Asn Gly Leu Thr Leu Asp Asn Gin Gly Gin Leu Thr Ser Thr Ala 65 70 75 80
Gly Glu Val Glu Pro Pro Leu Thr Asn Ala Asn Asn Lys Leu Ala Leu 85 90 95
Val Tyr Ser Asp Pro Leu Ala Val Lys Arg Asn Ser Leu Thr Leu Ser 100 105 110
His Thr Ala Pro Leu Val Ile Ala Asp Asn Ser Leu Ala Leu Gin Val 115 120 125
Ser Glu Pro Ile Phe Ile Asn Asp Lys Asp Lys Leu Ala Leu Gin Thr 130 135 140
Ala Ala Pro Leu Val Thr Asn Ala Gly Thr Leu Arg Leu Gin Ser Ala 145 150 155 160
Ala Pro Leu Gly Ile Ala Asp Gin Thr Leu Lys Leu Leu Phe Thr Asn 165 170 175
Pro Leu Tyr Leu Gin Asn Asn Phe Leu Thr Leu Ala Ile Glu Arg Pro 180 185 190
Leu Ala Ile Thr Asn Thr Gly Lys Leu Ala Leu Gin Leu Ser Pro Pro 195 200 205
Leu Gin Thr Ala Asp Thr Gly Leu Thr Leu Gin Thr Asn Val Pro Leu 210 215 220
Thr Val Ser Asn Gly Thr Leu Gly Leu Ala Ile Lys Arg Pro Leu Ile 225 230 235 240 Ile Gin Asp Asn Asn Leu Phe Leu Asp Phe Arg Ala Pro Leu Arg Leu 245 250 255
Phe Asn Ser Asp Pro Val Leu Gly Leu Asn Phe Tyr Thr Pro Leu Ala 260 265 270
Val Arg Asp Glu Ala Leu Thr Val Asn Thr Gly Arg Gly Leu Thr Val 275 280 285
Ser Tyr Asp Gly Leu Ile Leu Asn Leu Gly Lys Asp Leu Arg Phe Asp 290 295 300
Asn Asn Thr Val Ser Val Ala Leu Ser Ala Ala Leu Pro Leu Gin Tyr 305 310 315 320
Thr Asp Gin Leu Arg Leu Asn Val Gly Ala Gly Leu Arg Tyr Asn Pro 325 330 335
Val Ser Lys Lys Leu Asp Val Asn Pro Asn Gin Asn Lys Gly Leu Thr 340 345 350
Trp Glu Asn Asp Tyr Leu Ile Val Lys Leu Gly Asn Gly Leu Gly Phe 355 360 365
Asp Gly Asp Gly Asn Ile Ala Val Ser Pro Gin Val Thr Ser Pro Asp 370 375 380
Thr Leu Trp Thr Thr Ala Asp Pro Ser Pro Asn Cys Ser Ile Tyr Thr 385 390 395 400
Asp Leu Asp Ala Lys Met Trp Leu Ser Leu Val Lys Gin Gly Gly Val 405 410 415
Val His Gly Ser Val Ala Leu Lys Ala Leu Lys Gly Thr Leu Leu Ser 420 425 430
Pro Thr Glu Ser Ala Ile Val Ile Ile Leu His Phe Asp Asn Tyr Gly 435 440 445
Val Arg Ile Leu Asn Tyr Pro Thr Leu Gly Thr Gin Gly Thr Leu Gly 450 455 460
Asn Asn Ala Thr Trp Gly Tyr Arg Gin Gly Glu Ser Ala Asp Thr Asn 465 470 475 480
Val Leu Asn Ala Leu Ala Phe Met Pro Ser Ser Lys Arg Tyr Pro Arg 485 490 495 Gly Arg Gly Ser Glu Val Gin Asn Gin Thr Val Gly Tyr Thr Cys Ile 500 505 510
Gin Gly Asp Phe Ser Met Pro Val Pro Tyr Gin Ile Gin Tyr Asn Tyr 515 520 525
Gly Pro Thr Gly Tyr Ser Phe Lys Phe Ile Trp Arg Thr Val Ser Arg 530 535 540
Gin Pro Phe Asp Ile Pro Cys Cys Phe Phe Ser Tyr Ile Thr Glu Glu 545 550 555 560
<210> 29
<211> 31044
<212> DNA
<213> simian adenovirus SV-25
<220>
<221> CDS
<222> (12284)..(13801)
<223> Penton
<220>
<221> CDS
<222> (16681)..(19446)
<223> Hexon
<220>
<221> CDS
<222> (25380)..(26423)
<223> Fiber #2
<220>
<221> CDS
<222> (26457)..(28136)
<223> Fiber #1
<400> 29
catcatcaat aatatacctt attctggaaa cgtgccaata tgataatgag cggggaggag 60 cgaggcgggg ccggggtgac gtgcggtgac gcggggtggc gcgagggcgg ggcgaagggc 120 gcgggtgtgt gtgtgggagg cgcttagttt ttacgtatgc ggaaggaggt tttataccgg 180 aagatgggta atttgggcgt atacttgtaa gttttgtgta atttggcgcg aaaactgggt 240 aatgaggaag ttgaggttaa tatgtacttt ttatgactgg gcggaatttc tgctgatcag 300 cagtgaactt tgggcgctga cggggaggtt tcgctacgtg acagtaccac gagaaggctc 360 aaaggtccca tttattgtac tcttcagcgt tttcgctggg tatttaaacg ctgtcagatc 420 atcaagaggc cactcttgag tgctggcgag aagagttttc tcctccgtgc tgccacgatg 480 aggctggtcc ccgagatgta cggtgttttt agcgacgaga cggtgcgtaa ctcagatgac 540 ctgctgaatt cagacgcgct ggaaatttcc aattcgcctg tgctttcgcc gccgtcactt 600 cacgacctgt ttgtgttttg gctcaacgct tagcaacgtg ttatataggg tcaagaagga 660 gcaggagacg cagtttgcta ggctgttggc cgatactcct ggagtttttg tggctctgga 720 tctaggccat cactctcttt tccaagagaa aattatcaaa aacttaactt ttacgtctcc 780 tggtcgcacg gttgcttccg ctgcctttat tacctatatt ttggatcaat ggagcaacag 840 cgacagccac ctgtcgtggg agtacatgct ggattacatg tcgatggcgc tgtggagggc 900 catgctgcgg aggagggttt gcatttactt gcgggcgcag cctccgcggc tggaccgagt 960 ggaggaggag gacgagccgg gggagaccga gaacctgagg gccgggctgg accctccaac 1020 ggaggactag gtgctgagga tgatcccgaa gaggggacta gtggggctag gaagaagcaa 1080 aagactgagt ctgaacctcg aaactttttg aatgagttga ctgtgagttt gatgaatcgt 1140 cagcgtccgg agacaatttt ctggtctgaa ttggaggagg aattcaggag gggggaactg 1200 aacctgctat acaagtatgg gtttgaacag ttaaaaactc actggttgga gccgtgggag 1260 gattttgaaa ccgccttgga cacttttgct aaagtggctc tgcggccgga taaggtttac 1320 actatccgcc gcactgttaa cataaagaag agtgtttatg ttataggcca tggagctctg 1380 gtgcaggtgc aaaccgtcga ccgggtggcc tttagttgcg gtatgcaaaa tctgggcccc 1440 ggggtgatag gcttaaatgg tgtaacattt cacaatgtaa ggtttactgg tgaaagtttt 1500 aacggctctg tgtttgcaaa taacacacag ctgacgctcc acggcgttta cttttttaac 1560 tttaataaca catgtgtgga gtcgtggggc agggtgtctt tgaggggctg ctgttttcac 1620 ggctgctgga aggcggtggt gggaagactt aaaagtgtaa catctgtaaa aaaatgcgtg 1680 tttgagcggt gtgtgttggc tttaactgtg gagggctgtg gacgcattag gaataatgcg 1740 gcgtctgaga atggatgttt tcttttgcta aaaggcacgg ctagtattaa gcataacatg 1800 atatgcggca gcggtctgta cccttcacag ctgttaactt gcgcggatgg aaactgtcag 1860 accttgcgca ccgtgcacat agcgtcccac cagcgccgcg cctggccaac attcgagcac 1920 aatatgctta tgcgttgtgc cgtccacttg ggccctaggc gaggcgtgtt tgtgccttac 1980 cagtgtaact ttagccatac caagatttta ctagaacctg ataccttctc tcgagtgtgt 2040 ttcaatgggg tgtttgacat gtcaatggaa ctgtttaaag tgataagata tgatgaatcc 2100 aagtctcgtt gtcgcccatg tgaatgcgga gctaatcatc tgaggttgta tcctgtaacc 2160 ctaaacgtta ccgaggagct gaggacggat caccacatgt tgtcctgcct gcgcaccgac 2220 tatgaatcca gcgacgagga gtgaggtgag gggcggagcc acaaagggta taaaggggcg 2280 tgaggggtgg gtgtgatgat tcaaaatgag cgggacgacg gacggcaacg cgtttgaggg 2340 tggagtgttc agcccttatc tgacatctcg tcttccttcc tgggcaggag tgcgtcagaa 2400 tgtagtgggc tccaccgtgg acggacgacc ggtcgcccct gcaaattccg ccaccctcac 2460 ctatgccacc gtgggatcat cgttggacac tgccgcggca gctgccgctt ctgctgccgc 2520 ttctactgct cgcggcatgg cggctgattt tggactgtat aaccaactgg ccactgcagc 2580 tgtggcgtct cggtctctgg ttcaagaaga tgccctgaat gtgatcctga ctcgcctgga 2640 gatcatgtca cgtcgcttgg acgaactggc tgcgcagata tcccaagcta accccgatac 2700 cacttcagaa tcctaaaata aagacaaaca aatatgttga aaagtaaaat ggctttattt 2760 gttttttttg gctcggtagg ctcgggtcca cctgtctcgg tcgttaagaa ctttgtgtat 2820 gttttccaaa acacggtaca gatgggcttg gatgttcaag tacatgggca tgaggccatc 2880 tttggggtga agataggacc attgaagagc gtcatgctcc ggggtggtgt tgtaaattac 2940 ccagtcgtag cagggtttct gggcgtggaa ctggaagatg tcctttagga gtaggctgat 3000 ggccaagggc aggcccttag tgtaggtgtt tacaaagcgg ttaagctggg agggatgcat 3060 gcggggggag atgatatgca tcttggcttg gatcttgagg ttagctatgt taccacccag 3120 gtctctgcgg gggttcatgt tatgaaggac caccagcacg gtgtagccgg tgcatttggg 3180 gaacttgtca tgcagtttgg aggggaaggc gtggaagaat ttagagaccc ccttgtggcc 3240 ccctaggttt tccatgcact catccataat gatggcaatg ggacccctgg cggccgcttt 3300 ggcaaacacg ttttgggggt tggaaacatc atagttttgc tctagagtga gctcatcata 3360 ggccatctta acaaagcggg gtaggagggt gcccgactgg gggatgatag ttccatctgg 3420 gcctggggcg tagttaccct cacagatctg catctcccag gccttaattt ccgagggggg 3480 tatcatgtcc acctgggggg caataaagaa cacggtttct ggcgggggat tgatgagctg 3540 ggtggaaagc aagttacgca gcagttgaga tttgccacag ccggtggggc cgtagatgac 3600 cccgatgacg ggttgcagct ggtagttgag agaggaacag ctgccgtcgg ggcgcaggag 3660 gggggctacc tcattcatca tgcttctaac atgtttattt tcactcacta agttttgcaa 3720 gagcctctcc ccacccaggg ataagagttc ttccaggctg ttgaagtgtt tcagcggttt 3780 taggccgtcg gccatgggca tcttttcgag cgactgacga agcaagtaca gtcggtccca 3840 gagctcggtg acgtgctcta tggaatctcg atccagcaga cttcttggtt gcgggggttg 3900 ggtcgacttt cgctgtaggg caccagccgg tgggcgtcca gggccgcgag ggttctgtcc 3960 ttccagggtc tcagcgtccg ggtgagggtg gtctcggtga cggtgaaggg atgagccccg 4020 ggctgggcgc ttgcgagggt gcgcttcagg ctcatcctgc tggtgctgaa gcggacgtcg 4080 tctccctgtg agtcggccag atagcaacga agcatgaggt cgtagctgag ggactcggcc 4140 gcgtgtccct tggcgcgcag ctttcccttg gaaacgtgct gacatttggt gcagtgcaga 4200 cattggaggg cgtagagttt gggggccagg aagaccgact cgggcgagta ggcgtcggct 4260 ccgcactgag cgcagacggt ctcgcactcc actagccacg tgagctcggg tttagcggga 4320 tcaaaaacca agttgcctcc attttttttg atgcgtttct taccttgcgt ttccatgagt 4380 ttgtggcccg cttccgtgac aaaaaggctg tcggtgtctc cgtagacaga cttgaggggg 4440 cgatcttcca aaggtgttcc gaggtcttcc gcgtacagga actgggacca ctccgagacg 4500 aaggctctgg tccaggctaa cacgaaggag gcaatctgcg aggggtatct gtcgttttca 4560 atgagggggt ccaccttttc cagggtgtgc agacacaggt cgtcctcctc cgcgtccacg 4620 aaggtgattg gcttgtaagt gtaggtcacg tgatctgcac cccccaaagg ggtataaaag 4680 ggggcgtgcc caccctctcc gtcactttct tccgcatcgc tgtggaccag agccagctgt 4740 tcgggtgagt aggccctctc aaaagccggc atgatctcgg cgctcaagtt gtcagtttct 4800 acaaacgagg tggatttgat attcacgtgc cccgcggcga tgcttttgat ggtggagggg 4860 tccatctgat cagaaaacac gatctttttg ttgtcaagtt tggtggcgaa agacccgtag 4920 agggcgttgg aaagcaactt ggcgatggag cgcagggtct gatttttctc ccgatcggcc 4980 ctctccttgg cggcgatgtt gagttgcacg tactcccggg ccgcgcaccg ccactcgggg 5040 aacacggcgg tgcgctcgtc gggcaggatg cgcacgcgcc agccgcgatt gtgcagggtg 5100 atgaggtcca cgctggtagc cacctccccg cggaggggct cgttggtcca acacaatcgc 5160 cccccttttc tggagcagaa cggaggcagg ggatctagca agttggcggg cggggggtcg 5220 gcgtcgatgg tgaagatacc gggtagcagg atcttattaa aataatcgat ttcggtgtcc 5280 gtgtcttgca acgcgtcttc ccacttcttc accgccaggg ccctttcgta gggattcagg 5340 ggcggtcccc agggcatggg gtgggtcagg gccgaggcgt acatgccgca gatgtcatac 5400 acgtacaggg gttccctcaa caccccgatg taagtggggt aacagcgccc cccgcggatg 5460 ctggctcgca cgtagtcgta catctcgcgc gagggagcca tgaggccgtc tcccaagtgg 5520 gtcttgtggg gtttttcggc ccggtagagg atctgtctga agatggcgtg ggagttggaa 5580 gagatggtgg ggcgttggaa gacgttaaag ttggccccgg gtagtcccac ggagtcttgg 5640 atgaactggg cgtaggattc ccggagtttg tccaccaggg cggcggtcac cagcacgtcg 5700 agagcgcagt agtccaacgt ctcgcggacc aggttgtagg ccgtctcttg ttttttctcc 5760 cacagttcgc ggttgaggag gtattcctcg cggtctttcc agtactcttc ggcgggaaat 5820 cctttttcgt ccgctcggta agaacctaac atgtaaaatt cgttcaccgc tttgtatgga 5880 caacagcctt tttctaccgg cagggcgtac gcttgagcgg cctttctgag agaggtgtgg 5940 gtgagggcga aggtgtcccg caccatcact ttcaggtact gatgtttgaa gtccgtgtcg 6000 tcgcaggcgc cctgttccca cagcgtgaag tcggtgcgct ttttctgcct gggattgggg 6060 agggcgaagg tgacatcgtt aaagagtatt ttcccggcgc ggggcatgaa gttgcgagag 6120 atcctgaagg gcccgggcac gtccgagcgg ttgttgatga cctgcgccgc caggacgatc 6180 tcgtcgaagc cgttgatgtt gtgacccacg atgtaaagtt cgatgaagcg cggctgtccc 6240 ttgagggccg gcgctttttt caactcctcg taggtgagac agtccggcga ggagagaccc 6300 agctcagccc gggcccagtc ggagagttga ggattagccg caaggaagga gctccataga 6360 tccaaggcca ggagagtttg caagcggtcg cggaactcgc ggaacttttt ccccacggcc 6420 attttctccg gtgtcactac gtaaaaggtg ttggggcggt tgttccacac gtcccatcgg 6480 agctctaggg ccagctcgca ggcttggcga acgagggtct cctcgccaga gacgtgcatg 6540 accagcataa agggtaccaa ctgtttcccg aacgagccca tccatgtgta ggtttctacg 6600 tcgtaggtga caaagagccg ctgggtgcgc gcgtgggagc cgatcggaaa gaagctgatc 6660 tcctgccacc agctggagga atgggtgtta atgtggtgga agtagaagtc ccgccggcgc 6720 acagagcatt cgtgctgatg tttgtaaaag cgaccgcagt agtcgcagcg ctgcacgctc 6780 tgtatctcct gaacgagatg cgcttttcgc ccgcgcacca gaaaccggag ggggaagttg 6840 agacgggggg ctggtggggc gacatcccct tcgccttggc ggtgggagtc tgcgtctgcg 6900 tcctccttct ctgggtggac gacggtgggg acgacgacgc cccgggtgcc gcaagtccag 6960 atctccgcca cggaggggtg caggcgctgc aggaggggac gcagctgccc gctgtccagg 7020 gagtcgaggg aagtcgcgct gaggtcggcg ggaagcgttt gcaagttcac tttcagaaga 7080 ccggtaagag cgtgagccag gtgcagatgg tacttgattt ccaggggggt gttggatgaa 7140 gcgtccacgg cgtagaggag tccgtgtccg cgcggggcca ccaccgtgcc ccgaggaggt 7200 tttatctcac tcgtcgaggg cgagcgccgg ggggtagagg cggctctgcg ccggggggca 7260 gcggaggcag aggcacgttt tcgtgaggat tcggcagcgg ttgatgacga gcccggagac 7320 tgctggcgtg ggcgacgacg cggcggttga ggtcctggat gtgccgtctc tgcgtgaaga 7380 ccaccggccc ccgggtcctg aacctaaaga gagttccaca gaatcaatgt ctgcatcgtt 7440 aacggcggcc tgcctgagga tctcctgcac gtcgcccgag ttgtcctgat aggcgatctc 7500 ggccatgaac tgttccactt cttcctcgcg gaggtcaccg tggcccgctc gctccacggt 7560 ggcggccagg tcgttggaga tgcggcgcat gagttgagag aaggcgttga ggccgttctc 7620 gttccacacg cggctgtaca ccacgtttcc gaaggagtcg cgcgctcgca tgaccacctg 7680 ggccacgttg agttccacgt ggcgggcgaa gacggcgtag tttctgaggc gctggaagag 7740 gtagttgagc gtggtggcga tgtgctcgca gacgaagaag tacataatcc agcgccgcag 7800 ggtcatctcg ttgatgtctc cgatggcttc gagacgctcc atggcctcgt agaagtcgac 7860 ggcgaagttg aaaaattggg agttgcgggc ggccaccgtg agttcttctt gcaggaggcg 7920 gatgagatcg gcgaccgtgt cgcgcacctc ctgttcgaaa gcgccccgag gcgcctctgc 7980 ttcttcctcc ggctcctcct cttccagggg ctcgggttcc tccggcagct ctgcgacggg 8040 gacggggcgg cgacgtcgtc gtctgaccgg caggcggtcc acgaagcgct cgatcatttc 8100 gccgcgccgg cgacgcatgg tctcggtgac ggcgcgtccg ttttcgcgag gtcgcagttc 8160 gaagacgccg ccgcgcagag cgcccccgtg cagggagggt aagtggttag ggccgtcggg 8220 cagggacacg gcgctgacga tgcattttat caattgctgc gtaggcactc cgtgcaggga 8280 tctgagaacg tcgaggtcga cgggatccga gaacttctct aggaaagcgt ctatccaatc 8340 gcaatcgcaa ggtaagctga gaacggtggg tcgctggggg gcgttcgcgg gcagttggga 8400 ggtgatgctg ctgatgatgt aattaaagta ggcggtcttc aggcggcgga tggtggcgag 8460 gaggaccacg tctttgggcc cggcctgttg aatgcgcagg cgctcggcca tgccccaggc 8520 ctcgctctga cagcgacgca ggtctttgta gaagtcttgc atcagtctct ccaccggaac 8580 ctctgcttct cccctgtctg ccatgcgagt cgagccgaac ccccgcaggg gctgcagcaa 8640 cgctaggtcg gccacgaccc tttcggccag cacggcctgt tgaatctgcg tgagggtggc 8700 ctggaagtcg tccaggtcca cgaagcggtg ataggccccc gtgttgatgg tgtaggtgca 8760 gttggccatg acggaccagt tgacgacttg catgccgggt tgggtgatct ccgtgtactt 8820 gaggcgcgag taggccctgg actcgaacac gtagtcgttg catgtgcgca ccagatactg 8880 gtagccgacc aggaagtgag gaggcggctc tcggtacagg ggccagccaa cggtggcggg 8940 ggcgccgggg gacaggtcgt ccagcatgag gcggtggtag tggtagatgt agcgggagag 9000 ccaggtgatg ccggccgagg tggttgcggc cctggtgaat tcgcggacgc ggttccagat 9060 gttgcgcagg ggaccaaagc gctccatggt gggcacgctc tgccccgtga ggcgggcgca 9120 atcttgtacg ctctagatgg aaaaaagaca gggcggtcat cgactccttt ccgtagcttg 9180 gggggtaaag tcgcaagggt gcggcggcgg ggaaccccgg ttcgagaccg gccggatccg 9240 ccgctcccga tgcgcctggc cccgcatcca cgacgtccgc gccgagaccc agccgcgacg 9300 ctccgcccca atacggaggg gagtcttttg gtgttttttc gtagatgcat ccggtgctgc 9360 ggcagatgcg accccagacg cccactacca ccgccgtggc ggcagtaaac ctgagcggag 9420 gcggtgacag ggaggaggaa gagctggctt tagacctgga agagggagag gggctggccc 9480 ggctgggagc gccatcccca gagagacacc ctagggttca gctcgtgagg gacgccaggc 9540 aggcttttgt gccgaagcag aacctgttta gggaccgcag cggtcaggag gcggaggaga 9600 tgcgcgattg caggtttcgg gcgggcagag agctcagggc gggcttcgat cgggagcggc 9660 tcctgagggc ggaggatttc gagcccgacg agcgttctgg ggtgagcccg gcccgcgctc 9720 acgtatcggc ggccaacctg gtgagcgcgt acgagcagac ggtgaacgag gagcgcaact 9780 tccaaaagag ctttaacaat cacgtgagga ccctgatcgc gagggaggag gtgaccatcg 9840 ggctgatgca tctgtgggac ttcgtggagg cctacgtgca gaacccggct agcaaacccc 9900 tgacggccca gctgttcctg atcgtgcagc acagccgcga caacgagacg ttccgcgacg 9960 ccatgttgaa catcgcggag cccgagggtc gctggctctt ggatctgatt aacatcctgc 10020 agagcatcgt ggtgcaggag aggggcctga gtttagcgga caaggtggcg gccattaact 10080 attcgatgca gagcctgggg aagttctacg ctcgcaagat ctacaagagc ccttacgtgc 10140 ccatagacaa ggaggtgaag atagacagct tttacatgcg catggcgctg aaggtgctga 10200 cgctgagcga cgacctcggc gtgtaccgta acgacaagat ccacaaggcg gtgagcgcca 10260 gccgccggcg ggagctgagc gacagggagc tgatgcacag cctgcagagg gcgctggcgg 10320 gcgccgggga cgaggagcgc gaggcttact tcgacatggg agccgatctg cagtggcgtc 10380 ccagcgcgcg cgccttggag gcggcgggtt atcccgacga ggaggatcgg gacgatttgg 10440 aggaggcagg cgagtacgag gacgaagcct gaccgggcag gtgttgtttt agatgcagcg 10500 gccggcggac gggaccaccg cggatcccgc acttttggca tccatgcaga gtcaaccttc 10560 gggcgtgacc gcctccgatg actgggcggc ggccatggac cgcatcatgg cgctgaccac 10620 ccgcaacccc gaggctttta ggcagcaacc ccaggccaac cgtttttcgg ccatcttgga 10680 agcggtggtg ccgtcgcgca ccaacccgac gcacgagaaa gtcctgacta tcgtgaacgc 10740 cctggtagac agcaaggcca tccgccgtga cgaggcgggc ttgatttaca acgctctttt 10800 ggaacgcgtg gcgcgctaca acagcactaa cgtgcagacc aatctggacc gcctcaccac 10860 cgacgtgaag gaggcgctgg cgcagaagga gcggtttctg agggacagta atctgggctc 10920 tctggtggca ctgaacgcct tcctgagctc acagccggcc aacgtgcccc gcgggcagga 10980 ggattacgtg agcttcatca gcgctctgag actgctggtg tccgaggtgc cccagagcga 11040 ggtgtaccag tctgggccgg attacttttt ccagacgtcc cgacagggct tgcaaacggt 11100 gaacctgact caggccttta aaaacttgca aggcatgtgg ggggtcaagg ccccggtggg 11160 cgatcgcgcc actatctcca gtctgctgac ccccaacact cgcctgctgc tgctcttgat 11220 cgcaccgttt accaacagta gcactatcag ccgtgactcg tacctgggtc atctcatcac 11280 tctgtaccgc gaggccatcg gccaggctca gatcgacgag catacgtatc aggagattac 11340 taacgtgagc cgtgccctgg gtcaggaaga taccggcagc ctggaagcca cgttgaactt 11400 tttgctaacc aaccggaggc aaaaaatacc ctcccagttc acgttaagcg ccgaggagga 11460 gaggattctg cgatacgtgc agcagtccgt gagcctgtac ttgatgcgcg agggcgccac 11520 cgcttccacg gctttagaca tgacggctcg gaacatggaa ccgtcctttt actccgccca 11580 ccggccgttc attaaccgtc tgatggacta cttccatcgc gcggccgcca tgaacgggga 11640 gtacttcacc aatgccatcc tgaatccgca ttggatgccc ccgtccggct tctacaccgg 11700 ggagtttgac ctgcccgaag ccgacgacgg ctttctgtgg gacgacgtgt ccgatagcat 11760 tttcacgccg gctaatcgcc gattccagaa gaaggagggc ggagacgagc tccccctctc 11820 cagcgtggaa gcggcctcaa ggggagagag tccctttcca agtctgtctt ccgccagtag 11880 cggtcgggta acgcgtccac ggttgccggg ggagagcgac tacctgaacg accccttgct 11940 gcgaccggct agaaagaaaa attttcccaa taacggggtg gaaagcttgg tggataaaat 12000 gaatcgttgg aagacgtacg cccaggagca gcgggagtgg gaggacagtc agccgcggcc 12060 gctggtaccg ccgcattggc gtcgccagag agaagacccg gacgactccg cagacgatag 12120 tagcgtgttg gacctgggag ggagcggagc caaccccttt gctcacttgc aacccaaggg 12180 gcgctcgagt cgcctgtatt aataaaaaag acgcggaaac ttaccagagc catggccaca 12240 gcgtgtgtgc tttcttcctc tctttcttcc tcggcgcggc aga atg aga aga gcg 12295
Met Arg Arg Ala 1
gtg aga gtc acg ccg gcg gcg tat gag ggc ccg ccc cct tet tac gaa 12343 Val Arg Val Thr Pro Ala Ala Tyr Glu Gly Pro Pro Pro Ser Tyr Glu 5 10 15 20
age gtg atg gga tea gcg aac gtg ccg gcc acg etg gag gcg cct tac 12391 Ser Val Met Gly Ser Ala Asn Val Pro Ala Thr Leu Glu Ala Pro Tyr 25 30 35
gtt cct ccc aga tac etg gga cct acg gag ggc aga aac age atc cgt 12439 Val Pro Pro Arg Tyr Leu Gly Pro Thr Glu Gly Arg Asn Ser Ile Arg 40 45 50
tac tcc gag etg gcg ccc etg tac gat acc acc aag gtg tac etg gtg 12487 Tyr Ser Glu Leu Ala Pro Leu Tyr Asp Thr Thr Lys Val Tyr Leu Val 55 60 65
gac aac aag tcg gcg gac atc gcc tcc etg aat tac caa aac gat cac 12535 Asp Asn Lys Ser Ala Asp Ile Ala Ser Leu Asn Tyr Gin Asn Asp His 70 75 80
agt aac ttt etg act acc gtg gtg cag aac aat gac ttc acc ccg acg 12583 Ser Asn Phe Leu Thr Thr Val Val Gin Asn Asn Asp Phe Thr Pro Thr 85 90 95 100
gag gcg ggc acg cag acc att aac ttt gac gag cgt tcc cgc tgg ggc 12631 Glu Ala Gly Thr Gin Thr Ile Asn Phe Asp Glu Arg Ser Arg Trp Gly 105 110 115
ggt cag etg aaa acc atc etg cac acc aac atg ccc aac atc aac gag 12679 Gly Gin Leu Lys Thr Ile Leu His Thr Asn Met Pro Asn Ile Asn Glu 120 125 130
ttc atg tcc acc aac aag ttc agg get aag etg atg gta gaa aaa agt 12727 Phe Met Ser Thr Asn Lys Phe Arg Ala Lys Leu Met Val Glu Lys Ser 135 140 145
aat gcg gaa act egg cag ccc ega tac gag tgg ttc gag ttt acc att 12775 Asn Ala Glu Thr Arg Gin Pro Arg Tyr Glu Trp Phe Glu Phe Thr Ile 150 155 160
cca gag ggc aac tat tcc gaa act atg act atc gat etc atg aat aac 12823 Pro Glu Gly Asn Tyr Ser Glu Thr Met Thr Ile Asp Leu Met Asn Asn 165 170 175 180
gcg atc gtg gac aat tac etg caa gtg ggg aga cag aac ggg gtg etg 12871 Ala Ile Val Asp Asn Tyr Leu Gin Val Gly Arg Gin Asn Gly Val Leu 185 190 195
gaa age gat atc ggc gtg aaa ttc gat acc aga aac ttc ega etg ggg 12919 Glu Ser Asp Ile Gly Val Lys Phe Asp Thr Arg Asn Phe Arg Leu Gly 200 205 210
tgg gat ccc gtg acc aag etg gtg atg cca ggc gtg tac acc aac gag 12967 Trp Asp Pro Val Thr Lys Leu Val Met Pro Gly Val Tyr Thr Asn Glu 215 220 225
get ttt cac ccg gac atc gtg etg etg ccg ggg tgc ggt gtg gac ttc 13015 Ala Phe His Pro Asp Ile Val Leu Leu Pro Gly Cys Gly Val Asp Phe 230 235 240
act cag age cgt ttg agt aac etg tta gga att aga aag cgc cgc ccc 13063 Thr Gin Ser Arg Leu Ser Asn Leu Leu Gly Ile Arg Lys Arg Arg Pro 245 250 255 260
ttc caa gag ggc ttt caa atc atg tat gag gac etg gag gga ggt aat 13111 Phe Gin Glu Gly Phe Gin Ile Met Tyr Glu Asp Leu Glu Gly Gly Asn 265 270 275
ata ccc gcc tta etg gac gtg tcg aag tac gaa get age ata caa cgc 13159 Ile Pro Ala Leu Leu Asp Val Ser Lys Tyr Glu Ala Ser Ile Gin Arg 280 285 290
gcc aaa gcg gag ggt aga gag att egg gga gac acc ttt gcg gta get 13207 Ala Lys Ala Glu Gly Arg Glu Ile Arg Gly Asp Thr Phe Ala Val Ala 295 300 305
ccc cag gac etg gaa ata gtg cct tta act aaa gac age aaa gac aga 13255 Pro Gin Asp Leu Glu Ile Val Pro Leu Thr Lys Asp Ser Lys Asp Arg 310 315 320
age tac aat att ata aac aac acg acg gac acc etg tat egg age tgg 13303 Ser Tyr Asn Ile Ile Asn Asn Thr Thr Asp Thr Leu Tyr Arg Ser Trp 325 330 335 340
ttt etg get tac aac tac gga gac ccc gag aaa gga gtg aga tea tgg 13351 Phe Leu Ala Tyr Asn Tyr Gly Asp Pro Glu Lys Gly Val Arg Ser Trp 345 350 355
acc ata etc acc acc acg gac gtg acc tgt ggc tcg cag caa gtg tac 13399 Thr Ile Leu Thr Thr Thr Asp Val Thr Cys Gly Ser Gin Gin Val Tyr 360 365 370
tgg tcc etg ccg gat atg atg caa gac ccg gtc acc ttc cgc ccc tcc 13447 Trp Ser Leu Pro Asp Met Met Gin Asp Pro Val Thr Phe Arg Pro Ser 375 380 385
acc caa gtc age aac ttc ccg gtg gtg ggc acc gag etg etg ccc gtc 13495 Thr Gin Val Ser Asn Phe Pro Val Val Gly Thr Glu Leu Leu Pro Val 390 395 400
eat gcc aag age ttc tac aac gag cag gcc gtc tac tcg caa ett att 13543 His Ala Lys Ser Phe Tyr Asn Glu Gin Ala Val Tyr Ser Gin Leu Ile 405 410 415 420
cgc cag tcc acc gcg ett acc cac gtg ttc aat cgc ttt ccc gag aac 13591 Arg Gin Ser Thr Ala Leu Thr His Val Phe Asn Arg Phe Pro Glu Asn 425 430 435
cag att etg gtg cgc cct ccc get cct acc att acc acc gtc agt gaa 13639 Gin Ile Leu Val Arg Pro Pro Ala Pro Thr Ile Thr Thr Val Ser Glu 440 445 450
aac gtt ccc gcc etc aca gat cac gga acc etg ccg etg cgc age agt 13687 Asn Val Pro Ala Leu Thr Asp His Gly Thr Leu Pro Leu Arg Ser Ser 455 460 465
atc agt gga gtt cag cgc gtg acc atc acc gac gcc aga cgt ega acc 13735 Ile Ser Gly Val Gin Arg Val Thr Ile Thr Asp Ala Arg Arg Arg Thr 470 475 480
tgc ccc tac gtt tac aaa gcg ett ggc gtg gtg get cct aaa gtt ett 13783 Cys Pro Tyr Val Tyr Lys Ala Leu Gly Val Val Ala Pro Lys Val Leu 485 490 495 500
tet agt cgc acc ttc taa aaacatgtcc atcctcatct ctcccgataa 13831 Ser Ser Arg Thr Phe
505
caacaccggc tggggactgg gctccggcaa gatgtaegge ggagccaaaa ggcgctccag 13891 tcagcaccca gttcgagttc ggggccactt ccgcgctcct tggggagett aeaagegagg 13951 actctegggt egaaeggetg tagacgatac catagatgcc gtgattgeeg acgcccgccg 14011 gtacaacccc ggaceggteg ctagcgccgc ctccaccgtg gattcegtga tegacagegt 14071 ggtagcegge gctcgggcct atgctcgccg caagaggegg etgcategga gacgtcgccc 14131 caccgccgcc atgctggcag ccagggccgt getgaggegg geeeggaggg caggcagaag 14191 ggctatgege cgcgctgccg ccaacgccgc egeegggagg gcccgccgac aggctgcccg 14251 ccaggctgcc gctgccatcg ctagcatggc cagacccagg agagggaacg tgtactgggt 14311 gegtgattet gtgaegggag tccgagtgcc ggtgegcage cgacctcccc gaagttagaa 14371 gatccaaget gegaagaegg eggtactgag tctccctgtt gttatcagcc caacatgagc 14431 aagegcaagt ttaaagaaga actgctgcag aegctggtgc ctgagatcta tggccctccg 14491 gaegtgaage cagacattaa gccccgcgat ateaagegtg ttaaaaagcg ggaaaagaaa 14551 gaggaactcg eggtggtaga egatggegga gtggaattta ttaggagttt cgccccgcga 14611 egeagggtte aatggaaagg geggegggta caacgegttt tgaggccggg caccgcggta 14671 gtttttaccc egggagageg gtcggccgtt aggggtttca aaaggcagta cgacgaggtg 14731 taeggegaeg aggacatatt ggaacaggcg getcaacaga teggagaatt tgcctaegga 14791 aagegttege gtcgegaaga cctggccatc geettagaca gcggcaaccc cacgcccagc 14851 ctcaaacccg tgacgctgca gcaggtgctt cccgtgagcg ccagcacgga cagcaagagg 14911 gggattaaga gagaaatgga agatctgcat cccaccatcc aactcatggt ccctaaacgg 14971 cagaggctgg aagaggtcct ggagaagatg aaagtggacc ccagcataga gccggatgta 15031 aaagtcagac ctattaagga agtggccccc ggtcttgggg tgcaaacggt ggacattcaa 15091 atccccgtca ccaccgcttc aaccgccgtg gaagctatgg aaacgcaaac ggagacccct 15151 gccgcgatcg gtaccaggga agtggcgttg caaacggagc cttggtacga atacgcagcc 15211 cctcggcgtc agaggcgttc cgctcgttac ggccccgcca acgccatcat gccagaatat 15271 gcgctgcatc cgtctattct gcccactccc ggataccggg gtgtgacgta tcgcccgtct 15331 ggaacccgcc gccgaacccg tcgccgccgc cgctcccgtc gcgctctggc ccccgtgtcg 15391 gtgcggcgtg tgacccgccg gggaaagaca gtcgtcattc ccaacccgcg ttaccaccct 15451 agcatccttt aataactctg ccgttttgca gatggctctg acttgccgcg tgcgccttcc 15511 cgttccgcac tatcgaggaa gatctcgtcg taggagaggc atgacgggca gtggtcgccg 15571 gcgggctttg cgcaggcgca tgaaaggcgg aattttaccc gccctgatac ccataattgc 15631 cgccgccatc ggtgccatac ccggcgttgc ttcagtggcg ttgcaagcag ctcgtaataa 15691 ataaacaaag gcttttgcac ttatgacctg gtcctgacta ttttatgcag aaagagcatg 15751 gaagacatca attttacgtc gctggctccg cggcacggct cgcggccgct catgggcacc 15811 tggaacgaca tcggcaccag tcagctcaac gggggcgctt tcaattgggg gagcctttgg 15871 agcggcatta aaaactttgg ctccacgatt aaatcctacg gcagcaaagc ctggaacagt 15931 agtgctggtc agatgctccg agataaactg aaggacacca acttccaaga aaaagtggtc 15991 aatggggtgg tgaccggcat ccacggcgcg gtagatctcg ccaaccaagc ggtgcagaaa 16051 gagattgaca ggcgtttgga aagctcgcgg gtgccgccgc agagagggga tgaggtggag 16111 gtcgaggaag tagaagtaga ggaaaagctg cccccgctgg agaaagttcc cggtgcgcct 16171 ccgagaccgc agaagcggcc caggccagaa ctagaagaga ctctggtgac ggagagcaag 16231 gagcctccct cgtacgagca agccttgaaa gagggcgcct ctccaccctc ctacccgatg 16291 actaagccga tcgcacccat ggctcgaccg gtgtacggca aggattacaa gcccgtcacg 16351 ctagagctgc ccccaccgcc ccccacgcgc ccgaccgtcc cccccctgcc gactccgtcg 16411 gcggccgcgg cgggacccgt gtccgcacca tccgctgtgc ctctgccagc cgcccgtcca 16471 gtggccgtgg ccactgccag aaaccccaga ggccagagag gagccaactg gcaaagcacg 16531 ctgaacagca tcgtgggcct gggagtgaaa agcctgaaac gccgccgttg ctattattaa 16591 aaaagtgtag ctaaaaagtc tcccgttgta tacgcctcct atgttaccgc cagagacgag 16651 tgactgtcgc cgcgagcgcc gctttcaag atg gcc acc cca tcg atg atg ccg 16704
Met Ala Thr Pro Ser Met Met Pro 510
cag tgg tet tac atg cac atc gcc ggc cag gac gcc tcg gag tac etg 16752 Gin Trp Ser Tyr Met His Ile Ala Gly Gin Asp Ala Ser Glu Tyr Leu 515 520 525
agt ccc ggc etc gtg cag ttt gcc cgc gcc acc gac acc tac ttc age 16800 Ser Pro Gly Leu Val Gin Phe Ala Arg Ala Thr Asp Thr Tyr Phe Ser 530 535 540 545
ttg gga aac aag ttt aga aac ccc acc gtg gcc ccc acc cac gat gtg 16848 Leu Gly Asn Lys Phe Arg Asn Pro Thr Val Ala Pro Thr His Asp Val 550 555 560
acc acg gac cgc tcg cag agg etg acc etg cgc ttt gtg ccc gta gac 16896 Thr Thr Asp Arg Ser Gin Arg Leu Thr Leu Arg Phe Val Pro Val Asp 565 570 575
egg gag gac acc gcg tac tet tac aaa gtg cgc tac acg ttg gcc gta 16944 Arg Glu Asp Thr Ala Tyr Ser Tyr Lys Val Arg Tyr Thr Leu Ala Val 580 585 590
ggg gac aac ega gtg etg gac atg gcc age acc tac ttt gac atc egg 16992 Gly Asp Asn Arg Val Leu Asp Met Ala Ser Thr Tyr Phe Asp Ile Arg 595 600 605
ggg gtg etg gat egg ggt ccc age ttc aag ccc tat tcc ggc acc get 17040 Gly Val Leu Asp Arg Gly Pro Ser Phe Lys Pro Tyr Ser Gly Thr Ala 610 615 620 625
tac aac tcc etg gcc ccc aag gga get ccc aac ccc tcg gaa tgg acg 17088 Tyr Asn Ser Leu Ala Pro Lys Gly Ala Pro Asn Pro Ser Glu Trp Thr 630 635 640
gac act tcc gac aac aaa ett aaa gca tat get cag get ccc tac cag 17136 Asp Thr Ser Asp Asn Lys Leu Lys Ala Tyr Ala Gin Ala Pro Tyr Gin 645 650 655
agt caa gga ett aca aag gat ggt att cag gtt ggg eta gtt gtg aca 17184 Ser Gin Gly Leu Thr Lys Asp Gly Ile Gin Val Gly Leu Val Val Thr 660 665 670
gag tea gga caa aca ccc caa tat gca aac aaa gtg tac caa ccc gag 17232 Glu Ser Gly Gin Thr Pro Gin Tyr Ala Asn Lys Val Tyr Gin Pro Glu 675 680 685
cca caa att ggg gaa aac caa tgg aat tta gaa caa gaa gat aaa gcg 17280 Pro Gin Ile Gly Glu Asn Gin Trp Asn Leu Glu Gin Glu Asp Lys Ala 690 695 700 705
gcg gga aga gtc eta aag aaa gat acc cct atg ttt ccc tgc tat ggg 17328 Ala Gly Arg Val Leu Lys Lys Asp Thr Pro Met Phe Pro Cys Tyr Gly 710 715 720
tea tat gcc agg ccc aca aac gaa caa gga ggg cag gca aaa aac caa 17376 Ser Tyr Ala Arg Pro Thr Asn Glu Gin Gly Gly Gin Ala Lys Asn Gin 725 730 735
gaa gta gat tta cag ttt ttt gcc act ccg ggc gac acc cag aac acg 17424 Glu Val Asp Leu Gin Phe Phe Ala Thr Pro Gly Asp Thr Gin Asn Thr 740 745 750
get aaa gtg gta ett tat get gaa aat gtc aac etg gaa act cca gat 17472 Ala Lys Val Val Leu Tyr Ala Glu Asn Val Asn Leu Glu Thr Pro Asp 755 760 765
act cac tta gtg ttt aaa ccc gat gac gac age acc agt tea aaa ett 17520 Thr His Leu Val Phe Lys Pro Asp Asp Asp Ser Thr Ser Ser Lys Leu 770 775 780 785
ett ett ggg cag cag get gca cct aac aga ccc aac tac ata ggt ttt 17568 Leu Leu Gly Gin Gin Ala Ala Pro Asn Arg Pro Asn Tyr Ile Gly Phe 790 795 800
aga gat aat ttt att ggt tta atg tac tac aat age act gga aac atg 17616 Arg Asp Asn Phe Ile Gly Leu Met Tyr Tyr Asn Ser Thr Gly Asn Met 805 810 815
ggc gtg etg gcc gga cag get tet caa ttg aat gcc gta gtc gac ttg 17664 Gly Val Leu Ala Gly Gin Ala Ser Gin Leu Asn Ala Val Val Asp Leu 820 825 830
cag gac aga aac acc gag ttg tcc tac cag etg atg etg gac gca etg 17712 Gin Asp Arg Asn Thr Glu Leu Ser Tyr Gin Leu Met Leu Asp Ala Leu 835 840 845
ggg gat cgc age ega tat ttt tea atg tgg aat cag gca gta gac age 17760 Gly Asp Arg Ser Arg Tyr Phe Ser Met Trp Asn Gin Ala Val Asp Ser 850 855 860 865
tat gac cca gac gtt aga att ata gaa aac cac gga gtg gaa gac gaa 17808 Tyr Asp Pro Asp Val Arg Ile Ile Glu Asn His Gly Val Glu Asp Glu 870 875 880
etg cca aac tat tgt ttt cct etg gga gga atg gtg gtg act gac aat 17856 Leu Pro Asn Tyr Cys Phe Pro Leu Gly Gly Met Val Val Thr Asp Asn 885 890 895
tac aac tet gtg acg cct caa aat gga ggc agt gga aat aca tgg cag 17904 Tyr Asn Ser Val Thr Pro Gin Asn Gly Gly Ser Gly Asn Thr Trp Gin 900 905 910
gca gac aat act aca ttt agt caa aga gga gcg cag att ggc tcc gga 17952 Ala Asp Asn Thr Thr Phe Ser Gin Arg Gly Ala Gin Ile Gly Ser Gly 915 920 925
aac atg ttt gcc etg gaa att aac eta cag gcc aac etc tgg cgc ggc 18000 Asn Met Phe Ala Leu Glu Ile Asn Leu Gin Ala Asn Leu Trp Arg Gly 930 935 940 945
ttc ttg tat tcc aat att ggg ttg tat ett cca gac tet etg aaa atc 18048 Phe Leu Tyr Ser Asn Ile Gly Leu Tyr Leu Pro Asp Ser Leu Lys Ile 950 955 960
acc ccc gac aac atc acg etg cca gaa aac aaa aac act tat cag tac 18096 Thr Pro Asp Asn Ile Thr Leu Pro Glu Asn Lys Asn Thr Tyr Gin Tyr 965 970 975
atg aac ggt cgc gta acg cca ccc ggg etc ata gac acc tat gta aac 18144 Met Asn Gly Arg Val Thr Pro Pro Gly Leu Ile Asp Thr Tyr Val Asn 980 985 990
gtg ggc gcg cgc tgg tcc ccc gat gtc atg gac age att aac ccc ttc 18192 Val Gly Ala Arg Trp Ser Pro Asp Val Met Asp Ser Ile Asn Pro Phe 995 1000 1005
aac cac cac cgt aac gcg ggc ttg cgc tac cgc tcc atg etc ttg 18237 Asn His His Arg Asn Ala Gly Leu Arg Tyr Arg Ser Met Leu Leu 1010 1015 1020
ggc aac ggc cgt tat gtg cct ttt cac att cag gtg ccc caa aaa 18282 Gly Asn Gly Arg Tyr Val Pro Phe His Ile Gin Val Pro Gin Lys 1025 1030 1035
ttc ttt gcc att aaa aac etg etg ett etc ccc ggt tcc tat acc 18327 Phe Phe Ala Ile Lys Asn Leu Leu Leu Leu Pro Gly Ser Tyr Thr 1040 1045 1050
tat gag tgg aac ttc cgc aag gat gtc aac atg atc etg cag age 18372 Tyr Glu Trp Asn Phe Arg Lys Asp Val Asn Met Ile Leu Gin Ser 1055 1060 1065
tcg etg ggt aat gac etg ega gtg gac ggg gcc age ata cgc ttt 18417 Ser Leu Gly Asn Asp Leu Arg Val Asp Gly Ala Ser Ile Arg Phe 1070 1075 1080
gac age att aac etg tat gcc aac ttt ttt ccc atg gcc cac aac 18462 Asp Ser Ile Asn Leu Tyr Ala Asn Phe Phe Pro Met Ala His Asn 1085 1090 1095
acg gcc tet acc etg gaa gcc atg etg cgc aac gac acc aat gac 18507 Thr Ala Ser Thr Leu Glu Ala Met Leu Arg Asn Asp Thr Asn Asp 1100 1105 1110
cag tcc ttc aac gac tac etg tgc gcg get aac atg etg tac ccc 18552 Gin Ser Phe Asn Asp Tyr Leu Cys Ala Ala Asn Met Leu Tyr Pro 1115 1120 1125
atc ccc gcc aac gcc acc age gtg ccc att tet att cct tet egg 18597 Ile Pro Ala Asn Ala Thr Ser Val Pro Ile Ser Ile Pro Ser Arg 1130 1135 1140
aac tgg get gcc ttc agg ggc tgg agt ttt act cgc etc aaa acc 18642 Asn Trp Ala Ala Phe Arg Gly Trp Ser Phe Thr Arg Leu Lys Thr 1145 1150 1155
aag gag act ccc tcg etg ggc tcc ggt ttt gac ccc tac ttt gtt 18687 Lys Glu Thr Pro Ser Leu Gly Ser Gly Phe Asp Pro Tyr Phe Val 1160 1165 1170
tac tcc ggc tcc att ccc tac eta gat ggc acc ttt tac etc aac 18732 Tyr Ser Gly Ser Ile Pro Tyr Leu Asp Gly Thr Phe Tyr Leu Asn 1175 1180 1185
cac act ttc aaa aag gtg tet att atg ttt gac tcc tcg gtt age 18777 His Thr Phe Lys Lys Val Ser Ile Met Phe Asp Ser Ser Val Ser 1190 1195 1200
tgg ccc ggc aac gac cgc etg eta acg ccc aac gag ttc gaa att 18822 Trp Pro Gly Asn Asp Arg Leu Leu Thr Pro Asn Glu Phe Glu Ile 1205 1210 1215
aag cgt tcc gtg gac ggt gaa ggg tac aac gtg gcc cag age aac 18867 Lys Arg Ser Val Asp Gly Glu Gly Tyr Asn Val Ala Gin Ser Asn 1220 1225 1230
atg acc aag gac tgg ttt eta att caa atg etc agt cac tat aat 18912 Met Thr Lys Asp Trp Phe Leu Ile Gin Met Leu Ser His Tyr Asn 1235 1240 1245
ata ggt tac cag ggc ttc tat gtg ccc gag aac tac aag gac cgc 18957 Ile Gly Tyr Gin Gly Phe Tyr Val Pro Glu Asn Tyr Lys Asp Arg 1250 1255 1260
atg tac tcc ttc ttc cgc aac ttc caa cca atg age egg cag gtg 19002 Met Tyr Ser Phe Phe Arg Asn Phe Gin Pro Met Ser Arg Gin Val 1265 1270 1275
gta gat acc gtg act tat aca gac tac aaa gat gtc aag etc ccc 19047 Val Asp Thr Val Thr Tyr Thr Asp Tyr Lys Asp Val Lys Leu Pro 1280 1285 1290
tac caa cac aac aac tea ggg ttc gtg ggc tac atg gga ccc acc 19092 Tyr Gin His Asn Asn Ser Gly Phe Val Gly Tyr Met Gly Pro Thr 1295 1300 1305
atg ega gag gga cag gcc tac ccg gcc aac tat ccc tac ccc etg 19137 Met Arg Glu Gly Gin Ala Tyr Pro Ala Asn Tyr Pro Tyr Pro Leu 1310 1315 1320
atc gga gag act gcc gta ccc age etc acg cag aaa aag ttc etc 19182 Ile Gly Glu Thr Ala Val Pro Ser Leu Thr Gin Lys Lys Phe Leu 1325 1330 1335
tgc gac egg gtg atg tgg agg ata ccc ttc tet age aac ttt atg 19227 Cys Asp Arg Val Met Trp Arg Ile Pro Phe Ser Ser Asn Phe Met 1340 1345 1350
tcg atg ggc tcc etc acc gac etg ggg cag aac atg etg tac gcc 19272 Ser Met Gly Ser Leu Thr Asp Leu Gly Gin Asn Met Leu Tyr Ala 1355 1360 1365
aac tcc get cac gcc ttg gac atg act ttt gag gtg gat ccc atg 19317 Asn Ser Ala His Ala Leu Asp Met Thr Phe Glu Val Asp Pro Met 1370 1375 1380
gat gag ccc acg ett etc tat gtt etg ttt gaa gtc ttc gac gtg 19362 Asp Glu Pro Thr Leu Leu Tyr Val Leu Phe Glu Val Phe Asp Val 1385 1390 1395
gtg cgc atc cac cag ccg cac cgc ggc gtc atc gag gcc gtc tac 19407 Val Arg Ile His Gin Pro His Arg Gly Val Ile Glu Ala Val Tyr 1400 1405 1410
etg cgc aca cct ttc tet gcc ggt aac gcc acc acc taa agaagctgat 19456 Leu Arg Thr Pro Phe Ser Ala Gly Asn Ala Thr Thr
1415 1420 1425
gggttccagc gaacaggagt tgeaggceat tgttcgegae etgggctgeg ggccctgctt 19516 tttgggcacc ttcgacaagc gttttccegg attcatgtcc ccccacaagc cggcctgcgc 19576 categttaae acggccggac gggagacagg gggggtgeae tggctcgcct tegcctggaa 19636 cccgcgcaac cgcacctgct acctgttcga cccttttggt ttctccgacg aaaggetgaa 19696 gcagatctac caattcgagt aegagggget cctcaagcgc agegctctgg cctccacgcc 19756 cgaccactgc gtcaccctgg aaaagtccac ccagacggtc caggggcccc tctcggccgc 19816 etgegggett ttctgttgca tgtttttgca cgccttcgtg cactggcctc acacccccat 19876 ggagegcaae cccaccatgg atctgctcac cggagtgccc aacagcatgc ttcacagtcc 19936 ccaggtcgcc cccaccctgc gtegcaatea ggaccacctg tategcttte tggggaaaca 19996 ctctgcctat ttccgccgcc accggcagcg catcgaacag gccacggcct tcgaaagcat 20056 gagccaaaga gtgtaatcaa taaaaaccgt ttttatttga catgatacgc gcttctggcg 20116 tttttattaa aaatcgaagg gttcgaggga ggggtcctcg tgcccgctgg ggagggacac 20176 gttgcggtac tggaatcggg cgctccaacg aaactcgggg atcaccagcc gcggcagggc 20236 cacgtcttcc atgttctgct tccaaaactg tcgcaccagc tgcagggctc ccatcacgtc 20296 gggcgctgag atcttgaagt cgcagttagg gccggagccc ccgcggctgt tgcggaacac 20356 ggggttggca cactggaaca ccaacacgct ggggttgtgg atactagcca gggccgtcgg 20416 gtcggtcacc tccgatgcat ccagatcctc ggcattgctc agggcgaacg gggtcagctt 20476 gcacatctgc cgcccgatct ggggtaccag gtcgcgcttg ttgaggcagt cgcagcgcag 20536 agggatgagg atgcgacgct gcccgcgttg catgatgggg taactcgccg ccaggaactc 20596 ctctatctga cggaaggcca tctgggcctt gacgccctcg gtgaaaaata gcccacagga 20656 cttgctggaa aacacgttat tgccacagtt gatgtcttcc gcgcagcagc gcgcatcttc 20716 gttcttcagc tgaaccacgt tgcgacccca gcggttctga accaccttgg ctttcgtggg 20776 atgctccttc agcgcccgct gtccgttctc gctggtcaca tccatttcca ccacgtgctc 20836 cttgcagacc atctccactc cgtggaaaca gaacagaatg ccctcctgtt gggtattgcg 20896 atgctcccac acggcgcacc cggtggactc ccagctcttg tgtttcaccc ccgcgtaggc 20956 ttccatgtaa gccattagaa atctgcccat cagctcagtg aaggtcttct ggttggtgaa 21016 ggttagcggc aggccgcggt gttcctcgtt caaccaagtt tgacagatct tgcggtacac 21076 ggctccctgg tcgggcagaa acttaaaagt cgttctgctc tcgttgtcca cgtggaactt 21136 ctccatcaac atcgtcatga cttccatgcc cttctcccag gcagtcacca gcggcgcgct 21196 ctcggggttc ttcaccaaca cggcggtgga ggggccctcg ccggccccga cgtccttcat 21256 ggacattttt tgaaactcca cggtgccgtc cgcgcggcgt actctgcgca tcggagggta 21316 gctgaagccc acctccatga cggtgctttc gccctcgctg tcggagacga tctccgggga 21376 gggcggcgga acgggggcag acttgcgagc cttcttcttg ggagggagcg gaggcacctc 21436 ctgctcgcgc tcgggactca tctcccgcaa gtagggggtg atggagcttc ctggttggtt 21496 ctgacggttg gccattgtat cctaggcaga aagacatgga gcttatgcgc gaggaaactt 21556 taaccgcccc gtcccccgtc agcgacgaag aggtcatcgt cgaacaggac ccgggctacg 21616 ttacgccgcc cgaggatctg gaggggccct tagacgaccg gcgcgacgct agtgagcggc 21676 aggaaaatga gaaagaggag gaggagggct gctacctcct ggaaggcgac gttttgctaa 21736 agcatttcgc caggcagagc accatactca aggaggcctt gcaagaccgc tccgaggtgc 21796 ccttggacgt cgccgcgctc tcccaggcct acgaggcgaa ccttttctcg ccccgagtgc 21856 ctccgaagag acagcccaac ggcacctgcg agcccaaccc gcgactcaac ttctaccccg 21916 tgttcgccgt gcccgaggcg ctggccacct accacatctt tttcaaaaac cagcgcattc 21976 ccctttcctg ccgggccaac cgcaccgcgg ccgataggaa gctaacactc agaaacggag 22036 tcagcatacc tgatatcacg tcactggagg aagtgcctaa gatcttcgag ggtctgggtc 22096 gagatgagaa gcgggcggcg aacgctctgc agaaagaaca gaaagagagt cagaacgtgc 22156 tggtggagct ggagggggac aacgcgcgtc tgaccgtcct caaacgttgc atagaagttt 22216 cccacttcgc ctacccggcc ctcaacctgc cgcccaaagt tatgaaatcg gtcatggacc 22276 agctactcat caagagagct gagcccctga atcccgacca ccctgaggcg gaaaactcag 22336 aggacggaaa gcccgtcgtc agcgacgagg agctcgagcg gtggctggaa accagggacc 22396 cccagcagtt gcaagagagg cgcaagatga tgatggcggc cgtgctggtc acggtggagc 22456 tagaatgcct gcaacggttt ttcagcgacg tggagacgct acgcaaaatc ggggagtccc 22516 tgcactacac cttccgccag ggctacgttc gccaggcctg caaaatctcc aacgtagagc 22576 tcagcaacct ggtttcctac atgggcatcc tccacgagaa ccggctgggg cagagcgtgc 22636 tgcactgcac cttgcaaggc gaggcgcgaa gggactacgt ccgagactgc gtctacctct 22696 tcctcaccct cacctggcag accgccatgg gcgtgtggca gcagtgcttg gaagagagaa 22756 acctcaaaga gctggacaaa ctcctctgcc gccagcggcg ggccctctgg accggcttca 22816 gcgagcgcac ggtcgcctgc gccctggcag acatcatttt cccagaacgc ctgatgaaaa 22876 ccttgcagaa cggcctgccg gatttcatca gtcagagcat cttgcaaaac ttccgctcct 22936 tcgtcctgga gcgctccggg atcttgcccg ccatgagctg cgcgctgcct tctgactttg 22996 tccccctttc ctaccgcgag tgccctcccc cactgtggag ccactgctac ctcttccaac 23056 tggccaactt tctggcctac cactccgacc tcatggaaga cgtgagcgga gaggggctgc 23116 tcgagtgcca ctgccgctgc aacctctgca ccccccacag atcgctggcc tgcaacaccg 23176 agctgctcag cgaaacccag gtcataggta ccttcgagat ccaggggccc cagcagcaag 23236 agggtgcttc cggcttgaag ctcactccgg cgctgtggac ctcggcttac ttacgcaaat 23296 ttgtagccga ggactaccac gcccacaaaa ttcagtttta cgaagaccaa tctcgaccac 23356 cgaaagcccc cctcacggcc tgcgtcatca cccagagcaa aatcctggcc caattgcaat 23416 ccatcaacca agcgcgccga gatttccttt tgaaaaaggg tcggggggtg tacctggacc 23476 cccagaccgg cgaggaactc aacccgtcca cactttccgt cgaagcagcc cccccgagac 23536 atgccaccca agggaaccgc caagcagctg atcgctcggc agagagcgaa gaagcaagag 23596 ctgctccagc agcaggtgga ggacgaggaa gagctgtggg acagccaggc agaggaggtg 23656 tcagaggacg aggaggagat ggaaagctgg gacagcctag acgaggagga cgagctttca 23716 gaggaagagg cgaccgaaga aaaaccacct gcatccagcg cgccttctct gagccgacag 23776 ccgaagcccc ggcccccgac gcccccggcc ggctcactca aagccagccg taggtgggac 23836 gccaccggat ctccagcggc agcggcaacg gcagcgggta aggccaaacg cgagcggcgg 23896 gggtattgct cctggcggac ccacaaaagc agtatcgtga actgcttgca acactgcggg 23956 ggaaacatct cctttgcccg acgctacctc ctcttccatc acggtgtggc cttccctcgc 24016 aacgttctct attattaccg tcatctctac agcccctacg aaacgctcgg agaaaaaagc 24076 taaggcctcc tctgccgcga ggaaaaactc cgccgccgct gccgccaagg atccgccggc 24136 caccgaggag ctgagaaagc gcatctttcc cactctgtat gctatctttc agcaaagccg 24196 cgggcagcac cctcagcgcg aactgaaaat aaaaaaccgc tccttccgct cactcacccg 24256 cagctgtctg taccacaaga gagaagacca gctgcagcgc accctggacg acgccgaagc 24316 actgttcagc aaatactgct cagcgtctct taaagactaa aagacccgcg ctttttcccc 24376 ctcgggcgcc aaaacccacg tcatcgccag catgagcaag gagattccca ccccttacat 24436 gtggagctat cagccccaga tgggcctggc cgcgggggcc gcccaggact actccagcaa 24496 aatgaactgg ctcagcgccg gcccccacat gatctcacga gttaacggca tccgagccca 24556 ccgaaaccag atcctcttag aacaggcggc aatcaccgcc acaccccggc gccaactcaa 24616 cccgcccagt tggcccgccg cccaggtgta tcaggaaact ccccgcccga ccacagtcct 24676 cctgccacgc gacgcggagg ccgaagtcct catgactaac tctggggtac aattagcggg 24736 cgggtccagg tacgccaggt acagaggtcg ggccgctcct tactctcccg ggagtataaa 24796 gagggtgatc attcgaggcc gaggtatcca gctcaacgac gaggcggtga gctcctcaac 24856 cggtctcaga cctgacggag tcttccagct cggaggagcg ggccgctctt ccttcaccac 24916 tcgccaggcc tacctgaccc tgcagagctc ttcctcgcag ccgcgctccg ggggaatcgg 24976 cactctccag ttcgtggaag agttcgtccc ctccgtctac ttcaacccgt tttccggctc 25036 acctggacgc tacccggacg ccttcattcc caactttgac gcagtgagtg aatccgtgga 25096 cggctacgac tgatgacaga tggtgcggcc gtgagagctc ggctgcgaca tctgcatcac 25156 tgccgccagc ctcgctgcta cgctcgggag gcgatcgtgt tcagctactt tgagctgccg 25216 gacgagcacc ctcagggacc ggctcacggg ttgaaactcg agattgagaa cgcgcttgag 25276 tctcacctca tcgacgcctt caccgcccgg cctctcctgg tagaaaccga acgcgggatc 25336 actaccatca ccctgttctg catctgcccc acgcccggat tac atg aag atc tgt 25391
Met Lys Ile Cys 1430
gtt gtc atc ttt gcg etc agt tta ata aaa act gaa ett ttt gcc 25436 Val Val Ile Phe Ala Leu Ser Leu Ile Lys Thr Glu Leu Phe Ala 1435 1440 1445
gta cct tea acg cca cgc gtt gtt tet cct tgt gaa aaa acc cca 25481 Val Pro Ser Thr Pro Arg Val Val Ser Pro Cys Glu Lys Thr Pro 1450 1455 1460
gga gtc ett aac tta cac ata gca aaa ccc ttg tat ttt acc ata 25526 Gly Val Leu Asn Leu His Ile Ala Lys Pro Leu Tyr Phe Thr Ile 1465 1470 1475 gaa aaa caa eta gcc ett tea att gga aaa ggg tta aca att tet 25571 Glu Lys Gin Leu Ala Leu Ser Ile Gly Lys Gly Leu Thr Ile Ser 1480 1485 1490
get aca gga cag ttg gaa age aca gca age gta cag gac age get 25616 Ala Thr Gly Gin Leu Glu Ser Thr Ala Ser Val Gin Asp Ser Ala 1495 1500 1505
aca cca ccc eta cgt ggt att tcc cct tta aag etg aca gac aac 25661 Thr Pro Pro Leu Arg Gly Ile Ser Pro Leu Lys Leu Thr Asp Asn 1510 1515 1520
ggt tta aca tta age tat tea gat ccc etg cgt gtg gta ggt gac 25706 Gly Leu Thr Leu Ser Tyr Ser Asp Pro Leu Arg Val Val Gly Asp 1525 1530 1535
caa ett acg ttt aat ttt act tet cca eta cgt tac gaa aat ggc 25751 Gin Leu Thr Phe Asn Phe Thr Ser Pro Leu Arg Tyr Glu Asn Gly 1540 1545 1550
agt ett aca ttc aac tac act tet ccc atg aca eta ata aac aac 25796 Ser Leu Thr Phe Asn Tyr Thr Ser Pro Met Thr Leu Ile Asn Asn 1555 1560 1565
agt ett get att aac gtc aat acc tcc aaa ggc etc agt agt gac 25841 Ser Leu Ala Ile Asn Val Asn Thr Ser Lys Gly Leu Ser Ser Asp 1570 1575 1580
aac ggc aca etc get gta aat gtt act cca gat ttt aga ttt aac 25886 Asn Gly Thr Leu Ala Val Asn Val Thr Pro Asp Phe Arg Phe Asn 1585 1590 1595
age tet ggt gcc tta act ttt ggc ata caa agt eta tgg act ttt 25931 Ser Ser Gly Ala Leu Thr Phe Gly Ile Gin Ser Leu Trp Thr Phe 1600 1605 1610
cca acc aaa act cct aac tgt acc gtg ttt acc gaa agt gac tcc 25976 Pro Thr Lys Thr Pro Asn Cys Thr Val Phe Thr Glu Ser Asp Ser 1615 1620 1625
etg etg agt ett tgc ttg act aaa tgc gga get cac gta ett gga 26021 Leu Leu Ser Leu Cys Leu Thr Lys Cys Gly Ala His Val Leu Gly 1630 1635 1640
age gtg agt tta age gga gtg gca gga acc atg eta aaa atg acc 26066 Ser Val Ser Leu Ser Gly Val Ala Gly Thr Met Leu Lys Met Thr 1645 1650 1655
cac act tet gtt acc gtt cag ttt tcg ttt gat gac agt ggt aaa 26111 His Thr Ser Val Thr Val Gin Phe Ser Phe Asp Asp Ser Gly Lys 1660 1665 1670
eta ata ttc tet cca ett gcg aac aac act tgg ggt gtt ega caa 26156 Leu Ile Phe Ser Pro Leu Ala Asn Asn Thr Trp Gly Val Arg Gin 1675 1680 1685
age gag agt ccg ttg ccc aac cca tcc ttc aac get etc acg ttt 26201 Ser Glu Ser Pro Leu Pro Asn Pro Ser Phe Asn Ala Leu Thr Phe 1690 1695 1700
atg cca aac agt acc att tat tet aga gga gca agt aac gaa cct 26246 Met Pro Asn Ser Thr Ile Tyr Ser Arg Gly Ala Ser Asn Glu Pro 1705 1710 1715 caa aac aat tat tat gtc cag acg tat ett aga ggc aac gtg ega 26291 Gin Asn Asn Tyr Tyr Val Gin Thr Tyr Leu Arg Gly Asn Val Arg 1720 1725 1730
aag cca att eta eta act gtt acc tac aac tea gtt aat tea gga 26336 Lys Pro Ile Leu Leu Thr Val Thr Tyr Asn Ser Val Asn Ser Gly 1735 1740 1745
tat tcc tta act ttt aaa tgg gat get gtc gcc aat gaa aaa ttt 26381 Tyr Ser Leu Thr Phe Lys Trp Asp Ala Val Ala Asn Glu Lys Phe 1750 1755 1760
gcc act cct aca tet tcg ttt tgc tat gtt gca gag caa taa 26423 Ala Thr Pro Thr Ser Ser Phe Cys Tyr Val Ala Glu Gin 1765 1770
aaccctgtta ccccaccgtc tegttttttt cag atg aaa ega gcg aga gtt 26474
Met Lys Arg Ala Arg Val 1775
gat gaa gac ttc aac cca gtg tac cct tat gac ccc cca tac get 26519 Asp Glu Asp Phe Asn Pro Val Tyr Pro Tyr Asp Pro Pro Tyr Ala 1780 1785 1790
ccc gtc atg ccc ttc att act ccg cct ttt acc tcc tcg gat ggg 26564 Pro Val Met Pro Phe Ile Thr Pro Pro Phe Thr Ser Ser Asp Gly 1795 1800 1805
ttg cag gaa aaa cca ett gga gtg tta agt tta aac tac agg gat 26609 Leu Gin Glu Lys Pro Leu Gly Val Leu Ser Leu Asn Tyr Arg Asp 1810 1815 1820
ccc att act aca caa aat ggg tet etc acg tta aaa eta gga aac 26654 Pro Ile Thr Thr Gin Asn Gly Ser Leu Thr Leu Lys Leu Gly Asn 1825 1830 1835
ggc etc act eta aac aac cag gga cag tta aca tea act get ggc 26699 Gly Leu Thr Leu Asn Asn Gin Gly Gin Leu Thr Ser Thr Ala Gly 1840 1845 1850
gaa gtg gag cct ccg etc act aat get aac aac aaa ett gca eta 26744 Glu Val Glu Pro Pro Leu Thr Asn Ala Asn Asn Lys Leu Ala Leu 1855 1860 1865
gcc tat age gaa cca tta gca gta aaa age aac cgc eta act eta 26789 Ala Tyr Ser Glu Pro Leu Ala Val Lys Ser Asn Arg Leu Thr Leu 1870 1875 1880
tea cac acc get ccc ett gtc atc get aat aat tet tta gcg ttg 26834 Ser His Thr Ala Pro Leu Val Ile Ala Asn Asn Ser Leu Ala Leu 1885 1890 1895
caa gtt tea gag cct att ttt gta aat gac gat gac aag eta gcc 26879 Gin Val Ser Glu Pro Ile Phe Val Asn Asp Asp Asp Lys Leu Ala 1900 1905 1910
etg cag aca gcc gcc ccc ett gta acc aac get ggc acc ett cgc 26924 Leu Gin Thr Ala Ala Pro Leu Val Thr Asn Ala Gly Thr Leu Arg 1915 1920 1925
tta cag age get gcc cct tta gga ttg gtt gaa aat act ett aaa 26969 Leu Gin Ser Ala Ala Pro Leu Gly Leu Val Glu Asn Thr Leu Lys 1930 1935 1940
etg etg ttt tet aaa ccc ttg tat ttg caa aat gat ttt ett gca 27014 Leu Leu Phe Ser Lys Pro Leu Tyr Leu Gin Asn Asp Phe Leu Ala 1945 1950 1955
tta gcc att gaa cgc ccc etg get gta gca gcc gca ggt act etg 27059 Leu Ala Ile Glu Arg Pro Leu Ala Val Ala Ala Ala Gly Thr Leu 1960 1965 1970
acc eta caa ett act cct cca tta aag act aac gat gac ggg eta 27104 Thr Leu Gin Leu Thr Pro Pro Leu Lys Thr Asn Asp Asp Gly Leu 1975 1980 1985
aca eta tcc aca gtc gag cca tta act gta aaa aac gga aac eta 27149 Thr Leu Ser Thr Val Glu Pro Leu Thr Val Lys Asn Gly Asn Leu 1990 1995 2000
ggc ttg caa ata tcg cgc cct tta gtt gtt caa aac aac ggc ett 27194 Gly Leu Gin Ile Ser Arg Pro Leu Val Val Gin Asn Asn Gly Leu 2005 2010 2015
tcg ett get att acc ccc ccg etg cgt ttg ttt aac age gac ccc 27239 Ser Leu Ala Ile Thr Pro Pro Leu Arg Leu Phe Asn Ser Asp Pro 2020 2025 2030
gtt ett ggt ttg ggc ttc act ttt ccc eta get gtc aca aac aac 27284 Val Leu Gly Leu Gly Phe Thr Phe Pro Leu Ala Val Thr Asn Asn 2035 2040 2045
etc etc tcc tta aac atg gga gac gga gtt aaa ett acc tat aat 27329 Leu Leu Ser Leu Asn Met Gly Asp Gly Val Lys Leu Thr Tyr Asn 2050 2055 2060
aaa eta aca gcc aat ttg ggt agg gat tta caa ttt gaa aac ggt 27374 Lys Leu Thr Ala Asn Leu Gly Arg Asp Leu Gin Phe Glu Asn Gly 2065 2070 2075
gcg att gcc gta acg ett act gcc gaa tta cct ttg caa tac act 27419 Ala Ile Ala Val Thr Leu Thr Ala Glu Leu Pro Leu Gin Tyr Thr 2080 2085 2090
aac aaa ett caa etg aat att gga get ggc ett cgt tac aat gga 27464 Asn Lys Leu Gin Leu Asn Ile Gly Ala Gly Leu Arg Tyr Asn Gly 2095 2100 2105
gcc age aga aaa eta gat gta aac att aac caa aat aaa ggc tta 27509 Ala Ser Arg Lys Leu Asp Val Asn Ile Asn Gin Asn Lys Gly Leu 2110 2115 2120
act tgg gac aac gat gca gtt att ccc aaa eta gga tcg ggc tta 27554 Thr Trp Asp Asn Asp Ala Val Ile Pro Lys Leu Gly Ser Gly Leu 2125 2130 2135
caa ttt gac cct aat ggc aac atc get gtt atc cct gaa acc gtg 27599 Gin Phe Asp Pro Asn Gly Asn Ile Ala Val Ile Pro Glu Thr Val 2140 2145 2150
aag ccg caa acg tta tgg acg act gca gat ccc tcg cct aac tgc 27644 Lys Pro Gin Thr Leu Trp Thr Thr Ala Asp Pro Ser Pro Asn Cys 2155 2160 2165
tea gtg tac cag gac ttg gat gcc agg etg tgg etc get ett gtt 27689 Ser Val Tyr Gin Asp Leu Asp Ala Arg Leu Trp Leu Ala Leu Val 2170 2175 2180
aaa agt ggc gac atg gtg eat gga age att gcc eta aaa gcc eta 27734 Lys Ser Gly Asp Met Val His Gly Ser Ile Ala Leu Lys Ala Leu 2185 2190 2195
aaa ggg acg ttg eta aat cct aca gcc age tac att tcc att gtg 27779 Lys Gly Thr Leu Leu Asn Pro Thr Ala Ser Tyr Ile Ser Ile Val 2200 2205 2210
ata tat ttt tac age aac gga gtc agg cgt acc aac tat cca acg 27824 Ile Tyr Phe Tyr Ser Asn Gly Val Arg Arg Thr Asn Tyr Pro Thr 2215 2220 2225
ttt gac aac gaa ggc acc tta get aac age gcc act tgg gga tac 27869 Phe Asp Asn Glu Gly Thr Leu Ala Asn Ser Ala Thr Trp Gly Tyr 2230 2235 2240
ega cag ggg caa tet get aac act aat gtg acc aat gcc act gaa 27914 Arg Gin Gly Gin Ser Ala Asn Thr Asn Val Thr Asn Ala Thr Glu 2245 2250 2255
ttt atg ccc age tea age agg tac ccc gtg aat aaa gga gac aac 27959 Phe Met Pro Ser Ser Ser Arg Tyr Pro Val Asn Lys Gly Asp Asn 2260 2265 2270
att caa aat caa tet ttt tea tac acc tgt att aaa gga gat ttt 28004 Ile Gin Asn Gin Ser Phe Ser Tyr Thr Cys Ile Lys Gly Asp Phe 2275 2280 2285
get atg cct gtc ccg ttc cgt gta aca tat aat cac gcc etg gaa 28049 Ala Met Pro Val Pro Phe Arg Val Thr Tyr Asn His Ala Leu Glu 2290 2295 2300
ggg tat tcc ett aag ttc acc tgg cgc gtt gta gcc aat cag gcc 28094 Gly Tyr Ser Leu Lys Phe Thr Trp Arg Val Val Ala Asn Gin Ala 2305 2310 2315
ttt gat att cct tgc tgt tea ttt tea tac atc aca gaa taa 28136 Phe Asp Ile Pro Cys Cys Ser Phe Ser Tyr Ile Thr Glu 2320 2325 2330
aaaaccactt tttcatttta atttcttttt attttacacg aacagtgaga cttcctccac 28196 ccttccattt gacagcatac accagcctct cccccttcat agcagtaaac tgttgtgaat 28256 cagtceggta tttgggagtt aaaatccaaa cagtctcttt ggtgatgaaa cgtcgatcag 28316 taatggacac aaatccctgg gacaggtttt ccaacgtttc ggtgaaaaac tgcacaccgc 28376 cctacaaaac aaacaggttc aggctctcca egggttatet ccccgatcaa actcagacag 28436 ggtaaaggtg eggtggtgtt ccactaaacc aegcaggtgg egetgtctga acctctcggt 28496 gcgactcctg tgaggctggt aagaagttag attgtccagt agcctcacag catgtatcat 28556 cagtctacga gtgcgtctgg cgcagcagcg catctgaatc tcactgagat teeggcaaga 28616 atcgcacacc atcacaatca ggttgttcat gatcccatag ctgaacacgc tccagccaaa 28676 gctcattcgc tccaacagcg ccaccgcgtg tccgtccaac cttactttaa cataaatcag 28736 gtgtctgeeg cgtacaaaca tgctacccac atacagaact tcccggggca ggcccctgtt 28796 caccacctgt ctgtaccagg gaaacctcac atttatcagg gagccataga tggccatttt 28856 aaaccaatta gctaataccg ccccaccagc tctacactga agagaaccgg gagagttaca 28916 atgacagtga ataatccatc tctcataacc cctgatggtc tgatgaaaat ctagatctaa 28976 cgtggcacaa caaatacaca ctttcatata cattttcata acatgttttt cccaggccgt 29036 taaaatacaa tcccaataca cgggccactc ctgcagtaca ataaagctaa tacaagatgg 29096 tatactcctc acctcactga cactgtgcat gttcatattt tcacattcta agtaccgaga 29156 gttctcctct acagcagcac tgctgcggtc ctcacaaggt ggtagctggt gatgattgta 29216 gggggccagt ctgcagcgat accgtctgtc gcgttgcatc gtagaccagg aaccgacgca 29276 cctcctcgta cttgtggtag cagaaccacg tccgctgcca gcacgtctcc acgtaacgcc 29336 ggtccctgcg tcgctcacgc tccctcctca atgcaaagtg caaccactct tgtaatccac 29396 acagatccct ctcggcctcc ggggtgatgc acacctcaaa cctacagatg tctcggtaca 29456 gttccaaaca cgtagtgagg gcgagttcca accaagacag acagcctgat ctatcccgac 29516 acactggagg tggaggaaga cacggaagag gcatgttatt ccaagcgatt caccaacggg 29576 tcgaaatgaa gatcccgaag atgacaacgg tcgcctccgg agccctgatg gaatttaaca 29636 gccagatcaa acgttatgcg attctccaag ctatcgatcg ccgcttccaa aagagcctgg 29696 acccgcactt ccacaaacac cagcaaagca aaagcactat tatcaaactc ttcaatcatc 29756 aagctgcagg actgtacaat gcctaagtaa ttttcgtttc tccactcgcg aatgatgtcg 29816 cggcagatag tctgaaggtt catcccgtgc agggtaaaaa gctccgaaag ggcgccctct 29876 acagccatgc gtagacacac catcatgact gcaagatatc gggctcctga gacacctgca 29936 gcagatttaa cagatcaagg tcaggttgct ctccgcgatc acgaatctcc atccgcaagg 29996 tcatttgcaa aaaattaaat aaatctatgc cgactagatc tgtcaactcc gcattaggaa 30056 ccaaatcagg tgtggctacg cagcacaaaa gttccaggga tggtgccaaa ctcactagaa 30116 ccgctcccga gtaacaaaac tgatgaatgg gagtaacaca gtgtaaaatg tgcaaccaaa 30176 aatcactaag gtgctccttt aaaaagtcca gtacttctat attcagtccg tgcaagtact 30236 gaagcaactg tgcgggaata tgcacaacaa aaaaaatagg gcggctcaga tacatgttga 30296 cctaaaataa aaagaatcat taaactaaag aagcttggcg aacggtggga taaatgacac 30356 gctccagcag cagacaggca accggctgtc cccgggaacc gcggtaaaat tcatccgaat 30416 gattaaaaag aacaacagaa acttcccacc atgtactcgg ttggatctcc tgagcacaca 30476 gcaatacccc cctcacattc atgtccgcca cagaaaaaaa acgtcccaga tacccagcgg 30536 ggatatccaa cgacagctgc aaagacagca aaacaatccc tctgggagcg atcacaaaat 30596 cctccggtga aaaaagcaca tacatattag aataaccctg ttgctggggc aaaaaggccc 30656 ggcgtcccag caaatgcaca taaatatgtt catcagccat tgccccgtct taccgcgtaa 30716 tcagccacga aaaaatcgag ctaaaattca cccaacagcc tatagctata tatacactcc 30776 gcccaatgac gctaataccg caccacccac gaccaaagtt cacccacacc cacaaaaccc 30836 gcgaaaatcc agcgccgtca gcacttccgc aatttcagtc tcacaacgtc acttccgcgc 30896 gccttttcac attcccacac acacccgcgc ccttcgcccc gccctcgcgc caccccgcgt 30956 caccgcacgt caccccggcc ccgcctcgct cctccccgct cattatcata ttggcacgtt 31016 tccagaataa ggtatattat tgatgatg 31044
<210> 30
<211> 505
<212> PRT
<213> simian adenovirus SV-25
<400> 30
Met Arg Arg Ala Val Arg Val Thr Pro Ala Ala Tyr Glu Gly Pro Pro 15 10 15
Pro Ser Tyr Glu Ser Val Met Gly Ser Ala Asn Val Pro Ala Thr Leu 20 25 30
Glu Ala Pro Tyr Val Pro Pro Arg Tyr Leu Gly Pro Thr Glu Gly Arg 35 40 45
Asn Ser Ile Arg Tyr Ser Glu Leu Ala Pro Leu Tyr Asp Thr Thr Lys 50 55 60
Val Tyr Leu Val Asp Asn Lys Ser Ala Asp Ile Ala Ser Leu Asn Tyr 65 70 75 80
Gin Asn Asp His Ser Asn Phe Leu Thr Thr Val Val Gin Asn Asn Asp 85 90 95
Phe Thr Pro Thr Glu Ala Gly Thr Gin Thr Ile Asn Phe Asp Glu Arg 100 105 110
Ser Arg Trp Gly Gly Gin Leu Lys Thr Ile Leu His Thr Asn Met Pro 115 120 125
Asn Ile Asn Glu Phe Met Ser Thr Asn Lys Phe Arg Ala Lys Leu Met 130 135 140
Val Glu Lys Ser Asn Ala Glu Thr Arg Gin Pro Arg Tyr Glu Trp Phe 145 150 155 160 Glu Phe Thr Ile Pro Glu Gly Asn Tyr Ser Glu Thr Met Thr Ile Asp 165 170 175
Leu Met Asn Asn Ala Ile Val Asp Asn Tyr Leu Gin Val Gly Arg Gin 180 185 190
Asn Gly Val Leu Glu Ser Asp Ile Gly Val Lys Phe Asp Thr Arg Asn 195 200 205
Phe Arg Leu Gly Trp Asp Pro Val Thr Lys Leu Val Met Pro Gly Val 210 215 220
Tyr Thr Asn Glu Ala Phe His Pro Asp Ile Val Leu Leu Pro Gly Cys 225 230 235 240
Gly Val Asp Phe Thr Gin Ser Arg Leu Ser Asn Leu Leu Gly Ile Arg 245 250 255
Lys Arg Arg Pro Phe Gin Glu Gly Phe Gin Ile Met Tyr Glu Asp Leu 260 265 270
Glu Gly Gly Asn Ile Pro Ala Leu Leu Asp Val Ser Lys Tyr Glu Ala 275 280 285
Ser Ile Gin Arg Ala Lys Ala Glu Gly Arg Glu Ile Arg Gly Asp Thr 290 295 300
Phe Ala Val Ala Pro Gin Asp Leu Glu Ile Val Pro Leu Thr Lys Asp 305 310 315 320
Ser Lys Asp Arg Ser Tyr Asn Ile Ile Asn Asn Thr Thr Asp Thr Leu 325 330 335
Tyr Arg Ser Trp Phe Leu Ala Tyr Asn Tyr Gly Asp Pro Glu Lys Gly 340 345 350
Val Arg Ser Trp Thr Ile Leu Thr Thr Thr Asp Val Thr Cys Gly Ser 355 360 365
Gin Gin Val Tyr Trp Ser Leu Pro Asp Met Met Gin Asp Pro Val Thr 370 375 380
Phe Arg P ro Ser Thr Gin Val Ser Asn Phe Pro Val Val Gly Thr Glu 385 390 395 400
Leu Leu Pro Val His Ala Lys Ser Phe Tyr Asn Glu Gin Ala Val Tyr 405 410 415 Ser Gin Leu Ile Arg Gin Ser Thr Ala Leu Thr His Val Phe Asn Arg 420 425 430
Phe Pro Glu Asn Gin Ile Leu Val Arg Pro Pro Ala Pro Thr Ile Thr 435 440 445
Thr Val Ser Glu Asn Val Pro Ala Leu Thr Asp His Gly Thr Leu Pro 450 455 460
Leu Arg Ser Ser Ile Ser Gly Val Gin Arg Val Thr Ile Thr Asp Ala 465 470 475 480
Arg Arg Arg Thr Cys Pro Tyr Val Tyr Lys Ala Leu Gly Val Val Ala 485 490 495
Pro Lys Val Leu Ser Ser Arg Thr Phe
500 505
<210> 31
<211> 921
<212> PRT
<213> simian adenovirus SV-25
<400> 31
Met Ala Thr Pro Ser Met Met Pro Gin Trp Ser Tyr Met His Ile Ala 15 10 15
Gly Gin Asp Ala Ser Glu Tyr Leu Ser Pro Gly Leu Val Gin Phe Ala 20 25 30
Arg Ala Thr Asp Thr Tyr Phe Ser Leu Gly Asn Lys Phe Arg Asn Pro 35 40 45
Thr Val Ala Pro Thr His Asp Val Thr Thr Asp Arg Ser Gin Arg Leu 50 55 60
Thr Leu Arg Phe Val Pro Val Asp Arg Glu Asp Thr Ala Tyr Ser Tyr 65 70 75 80
Lys Val Arg Tyr Thr Leu Ala Val Gly Asp Asn Arg Val Leu Asp Met 85 90 95
Ala Ser Thr Tyr Phe Asp Ile Arg Gly Val Leu Asp Arg Gly Pro Ser 100 105 110 Phe Lys Pro Tyr Ser Gly Thr Ala Tyr Asn Ser Leu Ala Pro Lys Gly 115 120 125
Ala Pro Asn Pro Ser Glu Trp Thr Asp Thr Ser Asp Asn Lys Leu Lys 130 135 140
Ala Tyr Ala Gin Ala Pro Tyr Gin Ser Gin Gly Leu Thr Lys Asp Gly 145 150 155 160
Ile Gin Val Gly Leu Val Val Thr Glu Ser Gly Gin Thr Pro Gin Tyr 165 170 175
Ala Asn Lys Val Tyr Gin Pro Glu Pro Gin Ile Gly Glu Asn Gin Trp 180 185 190
Asn Leu Glu Gin Glu Asp Lys Ala Ala Gly Arg Val Leu Lys Lys Asp 195 200 205
Thr Pro Met Phe Pro Cys Tyr Gly Ser Tyr Ala Arg Pro Thr Asn Glu 210 215 220
Gin Gly Gly Gin Ala Lys Asn Gin Glu Val Asp Leu Gin Phe Phe Ala 225 230 235 240
Thr Pro Gly Asp Thr Gin Asn Thr Ala Lys Val Val Leu Tyr Ala Glu 245 250 255
Asn Val Asn Leu Glu Thr Pro Asp Thr His Leu Val Phe Lys Pro Asp 260 265 270
Asp Asp Ser Thr Ser Ser Lys Leu Leu Leu Gly Gin Gin Ala Ala Pro 275 280 285
Asn Arg Pro Asn Tyr Ile Gly Phe Arg Asp Asn Phe Ile Gly Leu Met 290 295 300
Tyr Tyr Asn Ser Thr Gly Asn Met Gly Val Leu Ala Gly Gin Ala Ser 305 310 315 320
Gin Leu Asn Ala Val Val Asp Leu Gin Asp Arg Asn Thr Glu Leu Ser 325 330 335
Tyr Gin Leu Met Leu Asp Ala Leu Gly Asp Arg Ser Arg Tyr Phe Ser 340 345 350
Met Trp Asn Gin Ala Val Asp Ser Tyr Asp Pro Asp Val Arg Ile Ile 355 360 365 Glu Asn His Gly Val Glu Asp Glu Leu Pro Asn Tyr Cys Phe Pro Leu 370 375 380
Gly Gly Met Val Val Thr Asp Asn Tyr Asn Ser Val Thr Pro Gin Asn 385 390 395 400
Gly Gly Ser Gly Asn Thr Trp Gin Ala Asp Asn Thr Thr Phe Ser Gin 405 410 415
Arg Gly Ala Gin Ile Gly Ser Gly Asn Met Phe Ala Leu Glu Ile Asn 420 425 430
Leu Gin Ala Asn Leu Trp Arg Gly Phe Leu Tyr Ser Asn Ile Gly Leu 435 440 445
Tyr Leu Pro Asp Ser Leu Lys Ile Thr Pro Asp Asn Ile Thr Leu Pro 450 455 460
Glu Asn Lys Asn Thr Tyr Gin Tyr Met Asn Gly Arg Val Thr Pro Pro 465 470 475 480
Gly Leu Ile Asp Thr Tyr Val Asn Val Gly Ala Arg Trp Ser Pro Asp 485 490 495
Val Met Asp Ser Ile Asn Pro Phe Asn His His Arg Asn Ala Gly Leu 500 505 510
Arg Tyr Arg Ser Met Leu Leu Gly Asn Gly Arg Tyr Val Pro Phe His 515 520 525
Ile Gin Val Pro Gin Lys Phe Phe Ala Ile Lys Asn Leu Leu Leu Leu 530 535 540
Pro Gly Ser Tyr Thr Tyr Glu Trp Asn Phe Arg Lys Asp Val Asn Met 545 550 555 560
Ile Leu Gin Ser Ser Leu Gly Asn Asp Leu Arg Val Asp Gly Ala Ser 565 570 575
Ile Arg Phe Asp Ser Ile Asn Leu Tyr Ala Asn Phe Phe Pro Met Ala 580 585 590
His Asn Thr Ala Ser Thr Leu Glu Ala Met Leu Arg Asn Asp Thr Asn 595 600 605
Asp Gin Ser Phe Asn Asp Tyr Leu Cys Ala Ala Asn Met Leu Tyr Pro 610 615 620 Ile Pro Ala Asn Ala Thr Ser Val Pro Ile Ser Ile Pro Ser Arg Asn 625 630 635 640
Trp Ala Ala Phe Arg Gly Trp Ser Phe Thr Arg Leu Lys Thr Lys Glu 645 650 655
Thr Pro Ser Leu Gly Ser Gly Phe Asp Pro Tyr Phe Val Tyr Ser Gly 660 665 670
Ser Ile Pro Tyr Leu Asp Gly Thr Phe Tyr Leu Asn His Thr Phe Lys 675 680 685
Lys Val Ser Ile Met Phe Asp Ser Ser Val Ser Trp Pro Gly Asn Asp 690 695 700
Arg Leu Leu Thr Pro Asn Glu Phe Glu Ile Lys Arg Ser Val Asp Gly 705 710 715 720
Glu Gly Tyr Asn Val Ala Gin Ser Asn Met Thr Lys Asp Trp Phe Leu 725 730 735
Ile Gin Met Leu Ser His Tyr Asn Ile Gly Tyr Gin Gly Phe Tyr Val 740 745 750
Pro Glu Asn Tyr Lys Asp Arg Met Tyr Ser Phe Phe Arg Asn Phe Gin 755 760 765
Pro Met Ser Arg Gin Val Val Asp Thr Val Thr Tyr Thr Asp Tyr Lys 770 775 780
Asp Val Lys Leu Pro Tyr Gin His Asn Asn Ser Gly Phe Val Gly Tyr 785 790 795 800
Met Gly Pro Thr Met Arg Glu Gly Gin Ala Tyr Pro Ala Asn Tyr Pro 805 810 815
Tyr Pro Leu Ile Gly Glu Thr Ala Val Pro Ser Leu Thr Gin Lys Lys 820 825 830
Phe Leu Cys Asp Arg Val Met Trp Arg Ile Pro Phe Ser Ser Asn Phe 835 840 845
Met Ser Met Gly Ser Leu Thr Asp Leu Gly Gin Asn Met Leu Tyr Ala 850 855 860
Asn Ser Ala His Ala Leu Asp Met Thr Phe Glu Val Asp Pro Met Asp 865 870 875 880 Glu Pro Thr Leu Leu Tyr Val Leu Phe Glu Val Phe Asp Val Val Arg 885 890 895
Ile His Gin Pro His Arg Gly Val Ile Glu Ala Val Tyr Leu Arg Thr 900 905 910
Pro Phe Ser Ala Gly Asn Ala Thr Thr
915 920
<210> 32
<211> 347
<212> PRT
<213> simian adenovirus SV-25
<400> 32
Met Lys Ile Cys Val Val Ile Phe Ala Leu Ser Leu Ile Lys Thr Glu 15 10 15
Leu Phe Ala Val Pro Ser Thr Pro Arg Val Val Ser Pro Cys Glu Lys 20 25 30
Thr Pro Gly Val Leu Asn Leu His Ile Ala Lys Pro Leu Tyr Phe Thr 35 40 45
Ile Glu Lys Gin Leu Ala Leu Ser Ile Gly Lys Gly Leu Thr Ile Ser 50 55 60
Ala Thr Gly Gin Leu Glu Ser Thr Ala Ser Val Gin Asp Ser Ala Thr 65 70 75 80
Pro Pro Leu Arg Gly Ile Ser Pro Leu Lys Leu Thr Asp Asn Gly Leu 85 90 95
Thr Leu Ser Tyr Ser Asp Pro Leu Arg Val Val Gly Asp Gin Leu Thr 100 105 110
Phe Asn Phe Thr Ser Pro Leu Arg Tyr Glu Asn Gly Ser Leu Thr Phe 115 120 125
Asn Tyr Thr Ser Pro Met Thr Leu Ile Asn Asn Ser Leu Ala Ile Asn 130 135 140
Val Asn Thr Ser Lys Gly Leu Ser Ser Asp Asn Gly Thr Leu Ala Val 145 150 155 160 Asn Val Thr Pro Asp Phe Arg Phe Asn Ser Ser Gly Ala Leu Thr Phe 165 170 175
Gly Ile Gin Ser Leu Trp Thr Phe Pro Thr Lys Thr Pro Asn Cys Thr 180 185 190
Val Phe Thr Glu Ser Asp Ser Leu Leu Ser Leu Cys Leu Thr Lys Cys 195 200 205
Gly Ala His Val Leu Gly Ser Val Ser Leu Ser Gly Val Ala Gly Thr 210 215 220
Met Leu Lys Met Thr His Thr Ser Val Thr Val Gin Phe Ser Phe Asp 225 230 235 240
Asp Ser Gly Lys Leu Ile Phe Ser Pro Leu Ala Asn Asn Thr Trp Gly 245 250 255
Val Arg Gin Ser Glu Ser Pro Leu Pro Asn Pro Ser Phe Asn Ala Leu 260 265 270
Thr Phe Met Pro Asn Ser Thr Ile Tyr Ser Arg Gly Ala Ser Asn Glu 275 280 285
Pro Gin Asn Asn Tyr Tyr Val Gin Thr Tyr Leu Arg Gly Asn Val Arg 290 295 300
Lys Pro Ile Leu Leu Thr Val Thr Tyr Asn Ser Val Asn Ser Gly Tyr 305 310 315 320
Ser Leu Thr Phe Lys Trp Asp Ala Val Ala Asn Glu Lys Phe Ala Thr 325 330 335
Pro Thr Ser Ser Phe Cys Tyr Val Ala Glu Gin
340 345
<210> 33
<211> 559
<212> PRT
<213> simian adenovirus SV-25
<400> 33
Met Lys Arg Ala Arg Val Asp Glu Asp Phe Asn Pro Val Tyr Pro Tyr 15 10 15 Asp Pro Pro Tyr Ala Pro Val Met Pro Phe Ile Thr Pro Pro Phe Thr 20 25 30
Ser Ser Asp Gly Leu Gin Glu Lys Pro Leu Gly Val Leu Ser Leu Asn 35 40 45
Tyr Arg Asp Pro Ile Thr Thr Gin Asn Gly Ser Leu Thr Leu Lys Leu 50 55 60
Gly Asn Gly Leu Thr Leu Asn Asn Gin Gly Gin Leu Thr Ser Thr Ala 65 70 75 80
Gly Glu Val Glu Pro Pro Leu Thr Asn Ala Asn Asn Lys Leu Ala Leu 85 90 95
Ala Tyr Ser Glu Pro Leu Ala Val Lys Ser Asn Arg Leu Thr Leu Ser 100 105 110
His Thr Ala Pro Leu Val Ile Ala Asn Asn Ser Leu Ala Leu Gin Val 115 120 125
Ser Glu Pro Ile Phe Val Asn Asp Asp Asp Lys Leu Ala Leu Gin Thr 130 135 140
Ala Ala Pro Leu Val Thr Asn Ala Gly Thr Leu Arg Leu Gin Ser Ala 145 150 155 160
Ala Pro Leu Gly Leu Val Glu Asn Thr Leu Lys Leu Leu Phe Ser Lys 165 170 175
Pro Leu Tyr Leu Gin Asn Asp Phe Leu Ala Leu Ala Ile Glu Arg Pro 180 185 190
Leu Ala Val Ala Ala Ala Gly Thr Leu Thr Leu Gin Leu Thr Pro Pro 195 200 205
Leu Lys Thr Asn Asp Asp Gly Leu Thr Leu Ser Thr Val Glu Pro Leu 210 215 220
Thr Val Lys Asn Gly Asn Leu Gly Leu Gin Ile Ser Arg Pro Leu Val 225 230 235 240
Val Gin Asn Asn Gly Leu Ser Leu Ala Ile Thr Pro Pro Leu Arg Leu 245 250 255
Phe Asn Ser Asp Pro Val Leu Gly Leu Gly Phe Thr Phe Pro Leu Ala 260 265 270 Val Thr Asn Asn Leu Leu Ser Leu Asn Met Gly Asp Gly Val Lys Leu 275 280 285
Thr Tyr Asn Lys Leu Thr Ala Asn Leu Gly Arg Asp Leu Gin Phe Glu 290 295 300
Asn Gly Ala Ile Ala Val Thr Leu Thr Ala Glu Leu Pro Leu Gin Tyr 305 310 315 320
Thr Asn Lys Leu Gin Leu Asn Ile Gly Ala Gly Leu Arg Tyr Asn Gly 325 330 335
Ala Ser Arg Lys Leu Asp Val Asn Ile Asn Gin Asn Lys Gly Leu Thr 340 345 350
Trp Asp Asn Asp Ala Val Ile Pro Lys Leu Gly Ser Gly Leu Gin Phe 355 360 365
Asp Pro Asn Gly Asn Ile Ala Val Ile Pro Glu Thr Val Lys Pro Gin 370 375 380
Thr Leu Trp Thr Thr Ala Asp Pro Ser Pro Asn Cys Ser Val Tyr Gin 385 390 395 400
Asp Leu Asp Ala Arg Leu Trp Leu Ala Leu Val Lys Ser Gly Asp Met 405 410 415
Val His Gly Ser Ile Ala Leu Lys Ala Leu Lys Gly Thr Leu Leu Asn 420 425 430
Pro Thr Ala Ser Tyr Ile Ser Ile Val Ile Tyr Phe Tyr Ser Asn Gly 435 440 445
Val Arg Arg Thr Asn Tyr Pro Thr Phe Asp Asn Glu Gly Thr Leu Ala 450 455 460
Asn Ser Ala Thr Trp Gly Tyr Arg Gin Gly Gin Ser Ala Asn Thr Asn 465 470 475 480
Val Thr Asn Ala Thr Glu Phe Met Pro Ser Ser Ser Arg Tyr Pro Val 485 490 495
Asn Lys Gly Asp Asn Ile Gin Asn Gin Ser Phe Ser Tyr Thr Cys Ile 500 505 510
Lys Gly Asp Phe Ala Met Pro Val Pro Phe Arg Val Thr Tyr Asn His 515 520 525 Ala Leu Glu Gly Tyr Ser Leu Lys Phe Thr Trp Arg Val Val Ala Asn 530 535 540
Gin Ala Phe Asp Ile Pro Cys Cys Ser Phe Ser Tyr Ile Thr Glu 545 550 555
<210> 34
<211> 34115
<212> DNA
<213> simian adenovirus SV-39
<220>
<221> CDS
<222> (13448)..(14959)
<223> L2 Penton
<220>
<221> CDS
<222> (17785)..(20538)
<223> L3 Hexon
<220>
<221> CDS
<222> (29515)..(31116)
<223> L5 Fiber #1
<400> 34
catcatcaat ataacaccgc aagatggcga ccgagttaac atgcaaatga ggtgggcgga 60 gttacgcgac ctttgtcttg ggaacgcgga agtgggcgcg gcgggtttcg gggaggagcg 120 c3333 c3333 cgggcgtgtc gcgcggcggt gacgcgccgg ggacccggaa attgagtagt 180 ttttattcat tttgcaagtt tttctgtaca ttttggcgcg aaaactgaaa cgaggaagtg 240 aaaagtgaaa aatgccgagg tagtcaccgg gtggagatct gacctttgcc gtgtggagtt 300 tacccgctga cgtgtgggtt tcggtctcta ttttttcact gtggttttcc gggtacggtc 360 aaaggtcccc attttatgac tccacgtcag ctgatcgcta gggtatttaa tgcgcctcag 420 accgtcaaga ggccactctt gagtgccggc gagaagagtt ttctcctccg cgttccgcca 480 actgtgaaaa aatgaggaac ttcttgctat ctccggggct gccagcgacc gtagccgccg 540 agctgttgga ggacattgtt accggagctc tgggagacga tcctcaggtg atttctcact 600 tttgtgaaga ttttagtctt catgatctct atgatattga tccgggtgtt gaggggcaag 660 aggatgaatg gctggagtct gtggatgggt tttttccgga cgctatgctg ctagaggctg 720 atttgccacc acctcacaac tctcacactg agcccgagtc agctgctatt cctgaattgt 780 catcaggtga acttgacttg gcttgttacg agactatgcc tccggagtcg gatgaggagg 840 acagcgggat cagcgatccc acggctttta tggtctctaa ggcgattgct atactaaaag 900 aagatgatga tggcgatgat ggatttcgac tggacgctcc ggcggtgccg gggagagact 960 gtaagtcctg tgaataccac cgggatcgta ccggagaccc gtctatgttg tgttctctgt 1020 gttatctccg tcttaacgct gcttttgtct acagtaagtg ttttgtgctt ttttaccctg 1080 tggctttgtt gagtttattt ttttctgtgt ctcatagggt gttgtttatt ataggtcctg 1140 tttcagatgt ggaggaacct gatagtacta ctggaaatga ggaggaaaag ccctccccgc 1200 cgaaactaac tcagcgctgc agacctaata ttttgagacc ctcggcccag cgtgtgtcat 1260 cccggaaacg tgctgctgtt aattgcatag aagatttatt ggaagagccc actgaacctt 1320 tggacttgtc cttaaagcga ccccgcccgc agtagggcgc ggtgccagtt ttttctctct 1380 agcttccggg tgactcagtg caataaaaat tttcttggca acaggtgtat gtgtttactt 1440 tacgggcggg aagggattag gggagtataa agctggaggg gaaaaatctg aggctgtcag 1500 atcgagtgag aagttccatg gacttgtacg agagcctaga gaatctaagt tctttgcgac 1560 gtttgctgga ggaggcctcc gacagaacct cttacatttg gaggtttctg ttcggttccc 1620 ctctgagtcg ctttttgcac cgggtgaagc gagagcacct gacggaattt gatgggcttt 1680 tagagcagct gcctggactg tttgattctt tgaatctcgg ccaccggacg ctgctagagg 1740 agaggctttt tccacaattg gacttttcct ctccaggccg tctgtgttca gcgcttgctt 1800 ttgctgtaca tctgttggac agatggaacg agcagacgca gctcagcccg ggttacactc 1860 tggacttcct gacgctatgc ctatggaagt tcggaatcag gagggggagg aagctgtacg 1920 ggcgcttggt ggagaggcat ccgtctctgc gccagcagcg tctgcaagct caagtgctgc 1980 tgaggcggga ggatctggaa gccatttcgg aggaggagag cggcatggaa gagaagaatc 2040 cgagagcggg gctggaccct ccggcggagg agtagggggg ataccggacc cttttcctga 2100 gttggctttg ggggcggtgg ggggcgcttc tgtggtacgt gaggatgaag aggggcgcca 2160 acgcggtcag aagagggagc attttgagtc ctcgactttc ttggctgatg taaccgtggc 2220 cctgatggcg aaaaacaggc tggaggtggt gtggtacccg gaagtatggg aggactttga 2280 gaagggggac ttgcacctgc tggaaaaata taactttgag caggtgaaaa catactggat 2340 gaacccggat gaggactggg aggtggtttt gaaccgatac ggcaaggtag ctctgcgtcc 2400 cgactgtcgc taccaggttc gcgacaaggt ggtcctgcga cgcaacgtgt acctgttggg 2460 caacggcgcc accgtggaga tggtggaccc cagaaggggt ggttttgtgg ccaatatgca 2520 agaaatgtgc cctggggtgg tgggcttgtc tggggtgact tttcatagtg tgaggtttag 2580 cggtagcaat tttgggggtg tggttattac cgcgaacact cctgtggtcc tgcataattg 2640 ctactttttt ggcttcagca acacctgtgt ggaaatgagg gtgggaggca aagtgcgcgg 2700 gtgttccttt tacgcttgct ggaagggggt ggtgagccag ggtaaggcta aagtgtctgt 2760 tcacaagtgt atgttggaga gatgcacctt gggcatttcc agtgagggct tcctccacgc 2820 cagcgacaac gtggcttctg acaacggctg cgcctttctt atcaagggag ggggtcgcat 2880 ctgtcacaac atgatatgcg gccctgggga tgtcccccca aagccttacc agatggttac 2940 ctgcacagat ggcaaggtgc gcatgctcaa gcctgtgcac attgtgggcc accggcgcca 3000 ccgctggcca gagtttgaac acaatgtgat gacccgctgt agcttgtacc tgggaggcag 3060 gcgaggagtt ttcttgccca gacagtgtaa cctggcccac tgcaacgtga tcatggaaca 3120 atccgccgct acccaggttt gctttggagg aatatttgat ataagcatgg tggtgtataa 3180 gatcctgcgc tacgacgact gtcgggctcg tactcgaacc tgcgactgcg gagcctctca 3240 cctgtgtaac ctgactgtga tggggatggt gactgaggag gtgcgactgg accactgtca 3300 gcactcttgc ctgcgggagg agttttcttc ctcggacgag gaggactagg taggtggttg 3360 gggcgtggcc agcgagaggg tgggctataa aggggaggtg tcggctgacg ctgtcttctg 3420 tttttcaggt accatgagcg gatcaagcag ccagaccgcg ctgagcttcg acggggccgt 3480 gtacagcccc tttctgacgg ggcgcttgcc tgcctgggcc ggagtgcgtc agaatgttac 3540 cggttcgacc gtggacggac gtcccgtgga tccatctaac gctgcttcta tgcgctacgc 3600 tactatcagc acatctactc tggacagcgc cgctgccgcc gcagccgcca cctcagccgc 3660 tctctccgcc gccaagatca tggctattaa cccaagcctt tacagccctg tatccgtgga 3720 cacctcagcc ctggagcttt accggcgaga tctagctcaa gtggtggacc aactcgcagc 3780 cgtgagccaa cagttgcagc tggtgtcgac ccgagtggag caactttccc gccctcccca 3840 gtaaccgcaa aaattcaata aacagaattt aataaacagc acttgagaaa agtttaaact 3900 tgtggttgac tttattcctg gatagctggg gggagggaac ggcgggaacg gtaagacctg 3960 gtccatcgtt cccggtcgtt gagaacacgg tggatttttt ccaagacccg atagaggtgg 4020 gtctgaacgt tgagatacat gggcatgagc ccgtctcggg ggtggaggta ggcccactgc 4080 agggcctcgt tttcaggggt ggtgttgtaa atgatccagt cgtaggcccc ccgctgggcg 4140 tggtgctgga agatgtcctt cagcagcaag ctgatggcaa cgggaagacc cttggtgtag 4200 gtgttgacaa agcggttgag ttgggagggg tgcatgcggg gactgatgag gtgcattttg 4260 gcctggatct tgaggttggc tatgttgccg cccagatcgc gcctgggatt catgttatgc 4320 aagaccacca gcaccgagta accggtgcag cgggggaatt tgtcgtgcag cttggaaggg 4380 aaagcgtgga agaatttgga gacccctcgg tgcccgccta ggttttccat gcactcatcc 4440 atgatgatgg cgatgggccc ccgggaggca gcctgggcaa aaacgttgcg ggggtccgtg 4500 acatcgtagt tgtggtcctg ggtgagttca tcataggaca ttttgacaaa gcgcgggcag 4560 agggtcccag actggggaat gatggttcca tccggtccgg gggcgtagtt gccctcgcag 4620 atttgcattt cccaggcttt gatttcagag ggagggatca tgtcaacctg gggggcgatg 4680 aaaaaaatgg tctctggggc gggggtgatg agctgggtgg aaagcaggtt gcgcaagagc 4740 tgtgacttgc cgcagccggt gggcccgtag atgacagcta tgacgggttg cagggtgtag 4800 tttagagagc tacaactgcc atcatccttc aaaagcgggg ccacactgtt taaaagttct 4860 ctaacatgta agttttcccg cactaagtcc tgcaggagac gtgaccctcc tagggagaga 4920 agttcaggaa gcgaagcaaa gtttttaagt ggcttgaggc catcggccaa gggcaagttc 4980 ctgagagttt gactgagcag ttccagccgg tcccagagct cggttacgtg ctctacggca 5040 tctcgatcca gcagacctcc tcgtttcggg ggttggggcg gctctggctg tagggaatga 5100 ggcggtgggc gtccagctgg gccatggtgc ggtccctcca tgggcgcagg gttctcttca 5160 gggtggtctc ggtcacggtg aatgggtggg ccccgggctg ggcgctggcc agggtgcgct 5220 tgaggctgag gcggctggtg gcgaaccgtt gcttttcgtc tccctgcaag tcagccaaat 5280 agcaacggac catgagctca tagtccaggc tctctgcggc atgtcctttg gcgcgaagct 5340 tgcctttgga aacgtgcccg cagtttgagc agagcaagca ttttagcgcg tagagttttg 5400 gcgccaagaa cacggattcc ggggaataag catccccacc gcagttggag caaacggttt 5460 cgcattccac cagccaggtc agctgaggat cttttgggtc aaaaaccaag cgcccgccgt 5520 tttttttgat gcgcttccta cctcgggtct ccatgaggcg gtgcccgcgt tcggtgacga 5580 agaggctgtc ggtgtctccg tagacggagg tcagggcgcg ctcctccagg ggggtcccgc 5640 ggtcctcggc gtagagaaac tcgcaccact ctgacataaa cgcccgggtc caggctagga 5700 cgaatgaggc gatgtgggaa gggtaccggt cgttatcgat gagggggtcg gttttttcca 5760 aggtgtgcag gcacatgtcc ccctcgtccg cttccaaaaa tgtgattggc ttgtaggtgt 5820 aagtcacgtg atcctgtcct tccgcggggg tataaaaggg ggcgtttccc ccctcctcgt 5880 cactctcttc cggttcgctg tcgccaaagg ccagctgttg gggtacgtaa acgcgggtga 5940 aggcgggcat gacctgtgcg ctgaggttgt cagtttctat atacgaggaa gatttgatgg 6000 cgagcgcccc cgtggagatg cccttgaggt gctcggggcc catttggtca gaaaacacaa 6060 tctgtcggtt atcaagcttg gtggcaaaag acccgtagag ggcgttggag agcaacttgg 6120 cgatggagcg ctgggtttgg tttttttccc ggtcggcttt ttccttggcc gcgatgttga 6180 gctggacgta ctccctggcc acgcacttcc agccgggaaa aacggccgtg cgctcgtccg 6240 gcaccagcct cacgctccat ccgcggttgt gcagggtgat gacgtcgatg ctggtggcca 6300 cctctccgcg caggggctcg ttggtccagc agaggcgacc gcccttgcga gagcagaagg 6360 ggggcagggg gtcaagcagg cgctcgtccg gggggtcggc gtcgatggta aagatggcgg 6420 gcagcaggtg tttgtcaaag taatcgatct gatgcccggg gcaacgcagg gcggtttccc 6480 agtcccgcac cgccaaggcg cgctcgtatg gactgagggg ggcgccccag ggcatgggat 6540 gcgtcagggc cgaggcgtac atgccgcaga tgtcatagac gtaaaggggc tcctccagga 6600 cgccgaggta ggtggggtag cagcgccccc cgcggatgct ggcccgtacg tagtcgtaga 6660 gctcgtgcga gggggccaga aggtggcggc tgaggtgagc gcgctggggc ttttcatctc 6720 ggaagaggat ctgcctgaag atggcgtggg agttggagga gatggtgggc cgctgaaaaa 6780 tgttgaagcg ggcgtcgggc agacccacgg cctcgccgat aaagtgggcg taggactctt 6840 gcagcttttc caccagggag gcggtgacca gcacgtccag agcgcagtag tccagggttt 6900 cccgcacgat gtcataatgc tcttcctttt tttccttcca gaggtctcgg ttgaagagat 6960 actcttcgcg gtctttccag tactcttgga gaggaaaccc gttttcgtct ccacggtaag 7020 agcccaacat gtaaaactgg ttgacggcct gatagggaca gcatcccttc tccacgggca 7080 gcgagtaggc cagggcggcc ttgcgcaggg aggtgtgagt cagggcaaag gtgtcgcgga 7140 ccataacttt tacaaactgg tacttaaagt cccggtcgtc gcacatgcct cgctcccagt 7200 ctgagtagtc tgtgcgcttt ttgtgcttgg ggttaggcag ggagtaggtg acgtcgttaa 7260 agaggatttt gccacatctg ggcataaagt tgcgagagat tctgaagggg ccgggcacct 7320 ccgagcggtt gttgatgact tgggcagcca ggagaatttc gtcgaagccg ttgatgttgt 7380 gccccacgac gtagaactct atgaaacgcg gagcgccgcg cagcaggggg cacttttcaa 7440 gttgctggaa agtaagttcc cgcggctcga cgccgtgttc cgtgcggctc cagtcctcca 7500 ccgggtttcg ctccacaaaa tcctgccaga tgtggtcgac tagcaagagc tgcagtcggt 7560 cgcgaaattc gcggaatttt ctgccgatgg cttgcttctg ggggttcaag caaaaaaagg 7620 tgtctgcgtg gtcgcgccag gcgtcccagc cgagctcgcg agccagattc agggccagca 7680 gcaccagagc cggctcaccg gtgattttca tgacgaggag aaagggcacc agctgttttc 7740 cgaacgcgcc catccaggtg taggtctcca cgtcgtaggt gagaaacaga cgttcggtcc 7800 gcgggtgcga tcccaggggg aaaaacttga tgggctgcca ccattgggag ctctgggcgt 7860 ggatgtgatg gaagtaaaag tcccggcggc gcgtggaaca ttcgtgctgg tttttgtaaa 7920 agcggccgca gtggtcgcag cgcgagacgg agtgaaggct gtgaatcagg tgaatcttgc 7980 gtcgctgagg gggccccaga gccaaaaagc ggagcgggaa cgaccgcgcg gccacttcgg 8040 cgtccgcagg caagatggat gagggttcca ccgttccccg cccgcggacc gaccagactt 8100 ccgccagctg cggcttcagt tcttgcacca gctctcgcag cgtttcgtcg ctgggcgaat 8160 cgtgaatacg gaagttgtcg ggtagaggcg ggaggcggtg gacttccagg aggtgtgtga 8220 gggccggcag gagatgcagg tggtacttga tttcccacgg atgacggtcg cgggcgtcca 8280 aggcgaagag atgaccgtgg ggccgcggcg ccaccagcgt tccgcggggg gtctttatcg 8340 gcggcgggga cgggctcccg gcggcagcgg cggctcggga cccgcgggca agtcgggcag 8400 cggcacgtcg gcgtggagct cgggcagggg ctggtgctgc gcgcggagct gactggcaaa 8460 ggctatcacc cggcgattga cgtcctggat ccggcggcgc tgcgtgaaga ccaccggacc 8520 cgtggtcttg aacctgaaag agagttcgac agaatcaatc tcggcatcgt taaccgcggc 8580 ctggcgcagg atttcggcca cgtccccgga gttgtcttga tacgcgattt ctgccatgaa 8640 ctggtcgatt tcctcttcct gcaagtctcc gtgaccggcg cgttcgacgg tggccgcgag 8700 atcgttggag atgcggccca tgagctggga aaaggcattg atgccgacct cgttccacac 8760 tcggctgtac accacctctc cgtgaacgtc gcgggcgcgc atcaccacct gggcgagatt 8820 gagttccacg tggcgggcga aaaccggata gtttcggagg cgctgataca gatagttgag 8880 ggtggtggcg gcgtgctcgg ccacaaaaaa atacatgatc cagcggcgga gggtcagctc 8940 gttgatgtcg cccagcgcct ccaggcgttc catggcctcg taaaagtcca cggcaaagtt 9000 gaaaaattgg ctgttcctgg ccgagaccgt gagctcttct tccaagagcc gaatgagatc 9060 cgccacggtg gccctgactt cgcgttcgaa agccccgggt gcctcctcca cctcttcctc 9120 ctcgacttct tcgaccgctt cgggcacctc ctcttcctcg accaccacct caggcggggc 9180 tcggcggcgc cggcggcgga cgggcaggcg gtcgacgaaa cgctcgatca tttcccccct 9240 ccgtcgacgc atggtctcgg tgacggcgcg accctgttcg cgaggacgca gggtgaaggc 9300 gccgccgccg agcggaggta acagggagat cggggggcgg tcgtggggga gactgacggc 9360 gctaactatg catctgatca atgtttgcgt agtgacctcg ggtcggagcg agctcagcgc 9420 ttgaaaatcc acgggatcgg aaaaccgttc caggaacgcg tctagccaat cacagtcgca 9480 aggtaagctg aggaccgtct cgggggcttg tctgttctgt cttcccgcgg tggtgctgct 9540 gatgaggtag ttgaagtagg cgctcttgag gcggcggatg gtggacagga gaaccacgtc 9600 tttgcgccca gcttgctgta tccgcaggcg gtcggccatg ccccacactt ctccttgaca 9660 gcggcggagg tccttgtagt attcttgcat cagcctttcc acgggcacct cgtcttcttc 9720 ttccgctcgg ccggacgaga gccgcgtcag gccgtacccg cgctgcccct gtggttggag 9780 cagggccagg tcggccacga cgcgctcggc cagcacggcc tgctggatgc gggtgagggt 9840 gtcctgaaag tcgtcgagat ccacaaagcg gtggtacgcg ccagtgttga tggtgtaggt 9900 gcagttgctc atgacggacc agtttacggt ctgggtgcca tggcccacgg tttccaggta 9960 gcggagacgc gagtaggccc gcgtctcgaa gatgtagtcg ttgcaggtcc gcagcaggta 10020 ctggtagccc accagcagat gcggcggcgg ctggcggtag aggggccacc gctgggtggc 10080 gggggcgttg ggggcgagat cttccaacat gaggcggtga tagccgtaga tgtagcgcga 10140 catccaagtg atgccgctgg ccgtggtgct ggcgcgggcg tagtcgcgaa cgcggttcca 10200 gatgtttcgc agcggctgga agtactcgat ggtggggcga ctctgccccg tgaggcgggc 10260 gcagtcggcg atgctctacg gggaaaaaga agggccagtg aacaaccgcc ttccgtagcc 10320 ggaggagaac gcaagggggt caaagaccac cgaggctcgg gttcgaaacc cgggtggcgg 10380 cccgaatacg gagggcggtt ttttgctttt ttctcagatg catcccgtgc tgcggcagat 10440 gcgtccgaac gcggggtccc agtccccggc ggtgcctgcg gccgtgacgg cggcttctac 10500 ggccacgtcg cgctccaccc cgcctaccac ggcccaggcg gcggtggctc tgcgcggcgc 10560 aggggaaccc gaagcagagg cggtgttgga cgtggaggag ggccaggggt tggctcggct 10620 gggggccctg agtcccgagc ggcacccgcg cgtggctctg aagcgcgacg cggcggaggc 10680 gtacgtgccg cggagcaatc tgtttcgcga ccgcagcggc gaggaggccg aggagatgcg 10740 agacttgcgt tttcgggcgg ggagggagtt gcgtcacggg ctggaccggc agagggttct 10800 gagagaggag gactttgagg cggacgagcg cacgggggtg agtcccgcgc gggctcacgt 10860 ggcggccgcc aacctggtga gcgcgtacga gcagacggtc aaggaggaga tgaacttcca 10920 gaagagcttc aatcatcacg tgcgcacgct gattgcgcgc gaagaggtgg ccatcggcct 10980 catgcatctg tgggattttg tggaggcgta cgttcagaac cccagcagca agccgctgac 11040 ggctcagctg ttcctcatcg tgcaacatag tcgagacaac gaaacgttca gggaggccat 11100 gctgaacatt gcagagcctg aggggcgctg gctcttggat ctcattaaca tcttgcagag 11160 tatcgtagtg caggagcgct cgctgagcct ggccgacaag gtggctgcca tcaactacag 11220 catgctgtcg ctgggcaaat tttacgcccg caagatctac aagtctccgt tcgtccccat 11280 agacaaggag gtgaagatag acagctttta catgcgcatg gcgctcaagg tgctgactct 11340 aagcgacgac ctgggggtgt accgcaacga ccgcatacac aaggcggtga gcgccagccg 11400 ccggcgcgag ctgagcgacc gcgagctttt gcacagcctg catcgggcgt tgactggtgc 11460 cggcagcgcc gaggcggccg agtactttga cgccggagcg gacttgcgct ggcagccatc 11520 ccgacgcgcg ctggaggcgg ctggcgtcgg ggagtacggg gtcgaggacg acgatgaagc 11580 ggacgacgag ttgggcattg acttgtagcc gtttttcgtt agatatgtcg gcgaacgagc 11640 cgtctgcggc cgccatggtg acggcggcgg gcgcgcccca ggacccggcc acgcgcgcgg 11700 cgctgcagag tcagccttcc ggagtgacgc ccgcggacga ctggtccgag gccatgcgtc 11760 gcatcctggc gctgacggcg cgcaaccccg aggcttttcg gcagcagccg caggcaaacc 11820 ggtttgcggc cattttggaa gcggtggtgc cctccagacc caaccccacc cacgaaaagg 11880 tgctggccat cgtcaacgcc ctggcggaga ccaaggccat ccgcccagac gaggccgggc 11940 aggtttacaa cgcgctgcta gaaagggtgg gacgctacaa cagctccaac gtgcagacca 12000 atctggaccg cttggtgacg gacgtgaagg aggccgtagc ccagcgagag cggtttttca 12060 aggaagccaa tctgggctcg ctggtggccc tcaacgcctt cctgagcacg ctgccggcga 12120 acgtgccccg cggtcaggag gactacgtga actttctgag cgccctccgc ctgatggtgg 12180 ccgaggtgcc gcagagcgag gtgtaccagt ctggccccaa ctactacttc cagacctccc 12240 ggcagggcct gcagacggta aacctgacgc aggcctttca gaacctgcag ggcctttggg 12300 gggtgcgcgc tccgctgggc gaccgcagca cggtgtccag cctgctgacc cccaatgccc 12360 ggctgctctt gcttctcatt gctccgttca ccgacagcgg ttccatcagc cgcgactctt 12420 acctgggaca cctgctcacc ctgtaccggg aggccatcgg gcaggcgcgg gtggacgagc 12480 agacgtacca ggaaatcacc agcgtgagcc gcgcgctggg gcaggaggac acgggcagct 12540 tggaggcgac tctgaacttc ctgctgacca accggcggca gcgcctacct ccccagtacg 12600 cgctgaacgc ggaggaggag cgcatcctgc gtttcgtgca gcagagcacc gcgctgtact 12660 tgatgcggga aggcgcctct cccagcgctt cgctggacat gacggcggcc aacatggagc 12720 catcgttcta cgccgccaac cgtcccttcg tcaaccggct aatggactat ttgcatcggg 12780 cggcggccct gaacccggaa tactttacta acgtcatcct gaacgaccgt tggctgccac 12840 ctcccggctt ctacacgggg gagttcgacc tcccggaggc caacgacggt ttcatgtggg 12900 acgacgtgga cagcgtgttc ctgcccggca agaaggaggc gggtgactct cagagccacc 12960 gcgcgagcct cgcagacctg ggggcgaccg ggcccgcgtc tccgctgcct cgcctgccga 13020 gcgccagcag cgccagcgtg gggcgggtga gccgtccgcg cctcagcggt gaggaggact 13080 ggtggaacga tccgctgctc cgtccggccc gcaacaaaaa cttccccaac aacgggatag 13140 aggatttggt agacaaaatg aaccgttgga agacgtatgc ccaggagcat cgggagtggc 13200 aggcgaggca acccatgggc cctgttctgc cgccctctcg gcgcccgcgc agggacgaag 13260 acgccgacga ttcagccgat gacagcagcg tgttggatct gggcgggagc gggaacccct 13320 ttgcccacct gcaacctcgc ggcgtgggtc ggcggtggcg ctaggaaaaa aaattattaa 13380 aagcacttac cagagccatg gtaagaagag caacaaaggt gtgtcctgct ttcttcccgg 13440 tagcaaa atg cgt egg gcg gtg gca gtt ccc tcc gcg gca atg gcg tta 13489
Met Arg Arg Ala Val Ala Val Pro Ser Ala Ala Met Ala Leu 15 10
ggc ccg ccc cct tet tac gaa age gtg atg gca gcg gcc acc etg caa 13537 Gly Pro Pro Pro Ser Tyr Glu Ser Val Met Ala Ala Ala Thr Leu Gin 15 20 25 30
gcg ccg ttg gag aat cct tac gtg ccg ccg ega tac etg gag cct acg 13585 Ala Pro Leu Glu Asn Pro Tyr Val Pro Pro Arg Tyr Leu Glu Pro Thr 35 40 45
ggc ggg aga aac age att cgt tac tcg gag etg acg ccc etg tac gac 13633 Gly Gly Arg Asn Ser Ile Arg Tyr Ser Glu Leu Thr Pro Leu Tyr Asp 50 55 60 acc acc cgc etg tac etg gtg gac aac aag tea gca gat atc gcc acc 13681 Thr Thr Arg Leu Tyr Leu Val Asp Asn Lys Ser Ala Asp Ile Ala Thr 65 70 75
ttg aac tac cag aac gac cac age aac ttt etc acg tcc gtg gtg cag 13729 Leu Asn Tyr Gin Asn Asp His Ser Asn Phe Leu Thr Ser Val Val Gin 80 85 90
aac age gac tac acg ccc gcc gaa gcg age acg cag acc att aac ttg 13777 Asn Ser Asp Tyr Thr Pro Ala Glu Ala Ser Thr Gin Thr Ile Asn Leu 95 100 105 110
gac gac cgc tcg cgc tgg ggc ggg gac ttg aaa acc att etg cac act 13825 Asp Asp Arg Ser Arg Trp Gly Gly Asp Leu Lys Thr Ile Leu His Thr 115 120 125
aac atg ccc aac gtg aac gag ttc atg ttt acc aac tcg ttc agg get 13873 Asn Met Pro Asn Val Asn Glu Phe Met Phe Thr Asn Ser Phe Arg Ala 130 135 140
aaa ett atg gtg gcg cac gag gcc gac aag gac ccg gtt tat gag tgg 13921 Lys Leu Met Val Ala His Glu Ala Asp Lys Asp Pro Val Tyr Glu Trp 145 150 155
gtg cag etg acg etg ccg gag ggg aac ttt tea gag att atg acc ata 13969 Val Gin Leu Thr Leu Pro Glu Gly Asn Phe Ser Glu Ile Met Thr Ile 160 165 170
gac etg atg aac aac gcc att atc gac cac tac etg gcg gta gcc aga 14017 Asp Leu Met Asn Asn Ala Ile Ile Asp His Tyr Leu Ala Val Ala Arg 175 180 185 190
cag cag ggg gtg aaa gaa age gag atc ggc gtc aag ttt gac acg cgc 14065 Gin Gin Gly Val Lys Glu Ser Glu Ile Gly Val Lys Phe Asp Thr Arg 195 200 205
aac ttt cgt etg ggc tgg gac ccg gag acg ggg ett gtg atg ccg ggg 14113 Asn Phe Arg Leu Gly Trp Asp Pro Glu Thr Gly Leu Val Met Pro Gly 210 215 220
gtg tac acg aac gaa get ttc eat ccc gac gtg gtc etc ttg ccg ggc 14161 Val Tyr Thr Asn Glu Ala Phe His Pro Asp Val Val Leu Leu Pro Gly 225 230 235
tgc ggg gtg gac ttt acc tac age egg tta aac aac etg eta ggc ata 14209 Cys Gly Val Asp Phe Thr Tyr Ser Arg Leu Asn Asn Leu Leu Gly Ile 240 245 250
cgc aag aga atg ccc ttt cag gaa ggg ttt cag atc etg tac gag gac 14257 Arg Lys Arg Met Pro Phe Gin Glu Gly Phe Gin Ile Leu Tyr Glu Asp 255 260 265 270
etg gag ggc ggt aac atc ccg gcc etg etg gac gtg ccg gcg tac gag 14305 Leu Glu Gly Gly Asn Ile Pro Ala Leu Leu Asp Val Pro Ala Tyr Glu 275 280 285
gag age atc gcc aac gca agg gag gcg gcg atc agg ggc gat aat ttc 14353 Glu Ser Ile Ala Asn Ala Arg Glu Ala Ala Ile Arg Gly Asp Asn Phe 290 295 300
gcg gcg cag ccc cag gcg get cca acc ata aaa ccc gtt ttg gaa gac 14401 Ala Ala Gin Pro Gin Ala Ala Pro Thr Ile Lys Pro Val Leu Glu Asp 305 310 315
tcc aaa ggg egg age tac aac gta ata gcc aac acc aac aac acg get 14449 Ser Lys Gly Arg Ser Tyr Asn Val Ile Ala Asn Thr Asn Asn Thr Ala 320 325 330
tac agg age tgg tat etg get tat aac tac ggc gac ccg gag aag ggg 14497 Tyr Arg Ser Trp Tyr Leu Ala Tyr Asn Tyr Gly Asp Pro Glu Lys Gly 335 340 345 350
gtt agg gcc tgg acc etg etc acc act ccg gac gtg acg tgc ggt tea 14545 Val Arg Ala Trp Thr Leu Leu Thr Thr Pro Asp Val Thr Cys Gly Ser 355 360 365
gag cag gtc tac tgg tcg etg cct gac atg tac gtg gac cct gtg acg 14593 Glu Gin Val Tyr Trp Ser Leu Pro Asp Met Tyr Val Asp Pro Val Thr 370 375 380
ttt cgc tcc acg cag caa gtt age aac tac cca gtg gtg gga gcg gag 14641 Phe Arg Ser Thr Gin Gin Val Ser Asn Tyr Pro Val Val Gly Ala Glu 385 390 395
ett atg ccg att cac age aag age ttt tac aac gag cag gcc gtc tac 14689 Leu Met Pro Ile His Ser Lys Ser Phe Tyr Asn Glu Gin Ala Val Tyr 400 405 410
tea cag etc att cgt cag acc acc gcc eta acg cac gtt ttc aac cgc 14737 Ser Gin Leu Ile Arg Gin Thr Thr Ala Leu Thr His Val Phe Asn Arg 415 420 425 430
ttc ccc gag aac caa atc eta gtg ega cct cca gcg ccc acc atc acc 14785 Phe Pro Glu Asn Gin Ile Leu Val Arg Pro Pro Ala Pro Thr Ile Thr 435 440 445
acc gtc age gag aac gtg ccc get eta acc gat cac ggg acg etg cct 14833 Thr Val Ser Glu Asn Val Pro Ala Leu Thr Asp His Gly Thr Leu Pro 450 455 460
ttg cag aac age atc cgc gga gtt cag ega gtt acc atc acg gac gcc 14881 Leu Gin Asn Ser Ile Arg Gly Val Gin Arg Val Thr Ile Thr Asp Ala 465 470 475
cgt cgt egg acc tgt ccc tac gtc tac aaa gcc ttg gga atc gtg gcc 14929 Arg Arg Arg Thr Cys Pro Tyr Val Tyr Lys Ala Leu Gly Ile Val Ala 480 485 490
ccg cgc gtc etg tcg agt cgc act ttc tag atgtccatcc tcatctctcc 14979 Pro Arg Val Leu Ser Ser Arg Thr Phe
495 500
cagcaacaat aceggttggg gtctgggegt gaccaaaatg taeggaggeg ccaaacgacg 15039 gtccccacaa catcccgtgc gagtgegegg gcactttaga gccccatggg ggtcgcacac 15099 gcgcgggcgc accggccgaa ccaccgtcga egaegtgate gatagegtgg tggccgacgc 15159 ccgcaactac cagcccgctc gatccaeggt ggacgaagtc ategaeggeg tggtggcega 15219 cgccagggcc tacgcccgca gaaagtctcg tctgcgccgc cgccgttcgc taaagcgccc 15279 cacggccgcc atgaaageeg ctcgctctct gctgcgtcgc geaegtateg tgggtcgeeg 15339 cgccgccaga cgcgcagccg ccaacgccgc cgccggccga gtgcgccgcc gggccgccca 15399 gcaggccgcc gccgccatct ccagtctatc cgccccccga egegggaatg tgtactgggt 15459 cagggactcg gccaccggcg tgcgagttcc cgtgagaacc cgtcctcctc gtccctgaat 15519 aaaaagttct aagcccaatc ggtgttccgt tgtgtgttca gctcgtcatg accaaacgca 15579 agtttaaaga ggagctgctg caagcgctgg tccccgaaat ctatgcgccg gcgccggacg 15639 tgaaaccgcg tcgcgtgaaa cgcgtgaaga agcaggaaaa gctagagaca aaagaggagg 15699 cggtggcgtt gggagacggg gaggtggagt ttgtgcgctc gttcgcgccg cgtcggcgag 15759 tgaattggaa ggggcgcaag gtgcaacggg tgctgcgtcc cggcacggtg gtgtctttca 15819 ccccgggtga aaaatccgcc tggaagggca taaagcgcgt gtacgatgag gtgtacgggg 15879 acgaagacat tctggagcag gcgctggata gaagcgggga gtttgcttac ggcaagaggg 15939 cgaggacggg cgagatcgcc atcccgctgg acacttccaa ccccaccccc agtctgaaac 15999 ccgtgacgct gcaacaggtg ttgccggtga gcgccccctc gcgacgcggc ataaaacgcg 16059 agggcggcga gctgcagccc accatgcagc tcctggttcc caagaggcag aaactagagg 16119 acgtactgga catgataaaa atggagcccg acgtgcagcc cgatattaaa atccgtccca 16179 tcaaagaagt ggcgccggga atgggcgtgc agaccgtgga catccagatt cccatgacca 16239 gcgccgcaca ggcggtagag gccatgcaga ccgacgtggg gatgatgacg gacctgcccg 16299 cagctgctgc cgccgtggcc agcgccgcga cgcaaacgga agccggcatg cagaccgacc 16359 cgtggacgga ggcgcccgtg cagccggcca gaagacgcgt cagacggacg tacggccccg 16419 tttctggcat aatgccggag tacgcgctgc atccttccat catccccacc cccggctacc 16479 gggggcgcac ctaccgtccg cgacgcagca ccactcgccg ccgtcgccgc acggcacgag 16539 tcgccaccgc cagagtgaga cgcgtaacga cacgtcgcgg ccgccgcttg accctgcccg 16599 tggtgcgcta ccatcccagc attctttaaa aaaccgctcc tacgttgcag atgggcaagc 16659 ttacttgtcg actccgtatg gccgtgcccg gctaccgagg aagatcccgc cgacgacgga 16719 ctttgggagg cagcggtttg cgccgccgtc gggcggttca ccggcgcctc aagggaggca 16779 ttctgccggc cctgatcccc ataatcgccg cagccatcgg ggccattccc ggaatcgcca 16839 gcgtagcggt gcaggctagc cagcgccact gattttacta accctgtcgg tcgcgccgtc 16899 tctttcggca gactcaacgc ccagcatgga agacatcaat ttctcctctc tggccccgcg 16959 gcacggcacg cggccgtata tggggacgtg gagcgagatc ggcacgaacc agatgaacgg 17019 gggcgctttc aattggagcg gtgtgtggag cggcttgaaa aatttcggtt ccactctgaa 17079 aacttacggc aaccgggtgt ggaactccag cacggggcag atgctgaggg acaagctaaa 17139 ggacacgcag tttcagcaaa aggtggtgga cggcatcgct tcgggcctca acggcgccgt 17199 cgacctggcc aaccaggcca ttcaaaagga aattaacagc cgcctggagc cgcggccgca 17259 ggtggaggag aacctgcccc ctctggaggc gctgcccccc aagggagaga agcgcccgcg 17319 gcccgacatg gaggagacgc tagttactaa gagcgaggag ccgccatcat acgaggaggc 17379 ggtgggtagc tcgcagctgc cgtccctcac gctgaagccc accacctatc ccatgaccaa 17439 gcccatcgcc tccatggcgc gccccgtggg agtcgacccg cccatcgacg cggtggccac 17499 tttggacctg ccgcgccccg aacccggcaa ccgcgtgcct cccgtcccca tcgctccgcc 17559 ggtttctcgc cccgccatcc gccccgtcgc cgtggccact ccccgctatc cgagccgcaa 17619 cgccaactgg cagaccaccc tcaacagtat tgtcggactg ggggtgaagt ctctgaagcg 17679 ccgtcgctgt ttttaaagca caatttatta aacgagtagc cctgtcttaa tccatcgttg 17739 tatgtgtgcc tatatcacgc gttcagagcc tgaccgtccg tcaag atg gcc act ccg 17796
Met Ala Thr Pro 505
tcg atg atg ccg cag tgg tcg tac atg cac atc gcc ggg cag gac gcc 17844 Ser Met Met Pro Gin Trp Ser Tyr Met His Ile Ala Gly Gin Asp Ala 510 515 520
tcg gag tac etg age ccg ggt etg gtg cag ttt gcc cgt gcg acg gaa 17892 Ser Glu Tyr Leu Ser Pro Gly Leu Val Gin Phe Ala Arg Ala Thr Glu 525 530 535
acc tac ttc tea etg ggc aac aag ttc agg aac ccc acc gtg gcg ccc 17940 Thr Tyr Phe Ser Leu Gly Asn Lys Phe Arg Asn Pro Thr Val Ala Pro 540 545 550 555
acc cac gac gtc acc acc gat egg tcc cag ega etg aca atc cgc ttc 17988 Thr His Asp Val Thr Thr Asp Arg Ser Gin Arg Leu Thr Ile Arg Phe 560 565 570
gtc ccc gtg gac aag gaa gac acc get tac tcc tac aaa acc cgc ttc 18036 Val Pro Val Asp Lys Glu Asp Thr Ala Tyr Ser Tyr Lys Thr Arg Phe 575 580 585
acg etg gcc gtg ggc gac aac egg gtg eta gac atg gcc agt acc tac 18084 Thr Leu Ala Val Gly Asp Asn Arg Val Leu Asp Met Ala Ser Thr Tyr 590 595 600
ttt gac atc cgc ggc gtg atc gac cgc gga cct age ttc aag cct tac 18132 Phe Asp Ile Arg Gly Val Ile Asp Arg Gly Pro Ser Phe Lys Pro Tyr 605 610 615
tcc ggc acg get tac aac tea etg get ccc aaa ggg gcg ccc aac aac 18180 Ser Gly Thr Ala Tyr Asn Ser Leu Ala Pro Lys Gly Ala Pro Asn Asn 620 625 630 635
age caa tgg aac gcc aca gat aac ggg aac aag cca gtg tgt ttt get 18228 Ser Gin Trp Asn Ala Thr Asp Asn Gly Asn Lys Pro Val Cys Phe Ala 640 645 650
cag gca get ttt ata ggt caa age att aca aaa gac gga gtg caa ata 18276 Gin Ala Ala Phe Ile Gly Gin Ser Ile Thr Lys Asp Gly Val Gin Ile 655 660 665
cag aac tea gaa aat caa cag get get gcc gac aaa act tac caa cca 18324 Gin Asn Ser Glu Asn Gin Gin Ala Ala Ala Asp Lys Thr Tyr Gin Pro 670 675 680
gag cct caa att gga gtt tcc acc tgg gat acc aac gtt acc agt aac 18372 Glu Pro Gin Ile Gly Val Ser Thr Trp Asp Thr Asn Val Thr Ser Asn 685 690 695
get gcc gga ega gtg tta aaa gcc acc act ccc atg etg cca tgt tac 18420 Ala Ala Gly Arg Val Leu Lys Ala Thr Thr Pro Met Leu Pro Cys Tyr 700 705 710 715
ggt tea tat gcc aat ccc act aat cca aac ggg ggt cag gca aaa aca 18468 Gly Ser Tyr Ala Asn Pro Thr Asn Pro Asn Gly Gly Gin Ala Lys Thr 720 725 730
gaa gga gac att tcg eta aac ttt ttc aca aca act gcg gca gca gac 18516 Glu Gly Asp Ile Ser Leu Asn Phe Phe Thr Thr Thr Ala Ala Ala Asp 735 740 745
aat aat ccc aaa gtg gtt ett tac age gaa gat gta aac ett caa gcc 18564 Asn Asn Pro Lys Val Val Leu Tyr Ser Glu Asp Val Asn Leu Gin Ala 750 755 760
ccc gat act cac tta gta tat aag cca acg gtg gga gaa aac gtt atc 18612 Pro Asp Thr His Leu Val Tyr Lys Pro Thr Val Gly Glu Asn Val Ile 765 770 775
gcc gca gaa gcc etg eta acg cag cag gcg tgt ccc aac aga gca aac 18660 Ala Ala Glu Ala Leu Leu Thr Gin Gin Ala Cys Pro Asn Arg Ala Asn 780 785 790 795
tac ata ggt ttc ega gat aac ttt atc ggt tta atg tat tat aac age 18708 Tyr Ile Gly Phe Arg Asp Asn Phe Ile Gly Leu Met Tyr Tyr Asn Ser 800 805 810
aca ggg aac atg gga gtt etg gca ggt cag gcc tcg cag tta aac gca 18756 Thr Gly Asn Met Gly Val Leu Ala Gly Gin Ala Ser Gin Leu Asn Ala 815 820 825
gtt gta gac etg caa gat ega aac acg gaa etg tcc tat cag eta atg 18804 Val Val Asp Leu Gin Asp Arg Asn Thr Glu Leu Ser Tyr Gin Leu Met 830 835 840
eta gat get etg ggt gac aga act ega tat ttc tea atg tgg aat cag 18852 Leu Asp Ala Leu Gly Asp Arg Thr Arg Tyr Phe Ser Met Trp Asn Gin 845 850 855
gcc gtg gac age tac gat cca gac gtt agg att atc gag aac eat ggg 18900 Ala Val Asp Ser Tyr Asp Pro Asp Val Arg Ile Ile Glu Asn His Gly 860 865 870 875
gtg gaa gac gag etg ccc aat tac tgt ttt cca etc cca ggc atg ggt 18948 Val Glu Asp Glu Leu Pro Asn Tyr Cys Phe Pro Leu Pro Gly Met Gly 880 885 890
att ttt aac tcc tac aag ggg gta aaa cca caa aat ggc ggt aat ggt 18996 Ile Phe Asn Ser Tyr Lys Gly Val Lys Pro Gin Asn Gly Gly Asn Gly 895 900 905
aac tgg gaa gca aac ggg gac eta tea aat gcc aat gag atc get tta 19044 Asn Trp Glu Ala Asn Gly Asp Leu Ser Asn Ala Asn Glu Ile Ala Leu 910 915 920
gga aac att ttt gcc atg gaa att aac etc cac gca aac etg tgg cgc 19092 Gly Asn Ile Phe Ala Met Glu Ile Asn Leu His Ala Asn Leu Trp Arg 925 930 935
age ttc ttg tac age aat gtg gcg etg tac etg cca gac age tat aaa 19140 Ser Phe Leu Tyr Ser Asn Val Ala Leu Tyr Leu Pro Asp Ser Tyr Lys 940 945 950 955 ttc act ccc get aac atc act etg ccc gcc aac caa aac acc tac gag 19188 Phe Thr Pro Ala Asn Ile Thr Leu Pro Ala Asn Gin Asn Thr Tyr Glu 960 965 970
tat atc aac ggg cgc gtc act tet cca acc etg gtg gac acc ttt gtt 19236 Tyr Ile Asn Gly Arg Val Thr Ser Pro Thr Leu Val Asp Thr Phe Val 975 980 985
aac att gga gcc ega tgg tcg ccg gat ccc atg gac aac gtc aac ccc 19284 Asn Ile Gly Ala Arg Trp Ser Pro Asp Pro Met Asp Asn Val Asn Pro 990 995 1000
ttt aac eat cac egg aac gcg ggc etc cgt tac cgc tcc atg etg 19329 Phe Asn His His Arg Asn Ala Gly Leu Arg Tyr Arg Ser Met Leu 1005 1010 1015
etg gga aat gga cgc gtg gtg cct ttc cac ata caa gtg ccg caa 19374 Leu Gly Asn Gly Arg Val Val Pro Phe His Ile Gin Val Pro Gin 1020 1025 1030
aaa ttt ttc gcg att aag aac etc etg ett ttg ccc ggc tcc tac 19419 Lys Phe Phe Ala Ile Lys Asn Leu Leu Leu Leu Pro Gly Ser Tyr 1035 1040 1045
act tac gag tgg age ttc aga aaa gac gtg aac atg att etg cag 19464 Thr Tyr Glu Trp Ser Phe Arg Lys Asp Val Asn Met Ile Leu Gin 1050 1055 1060
age acc etg ggc aat gat ett ega gtg gac ggg gcc age gtc cgc 19509 Ser Thr Leu Gly Asn Asp Leu Arg Val Asp Gly Ala Ser Val Arg 1065 1070 1075
att gac age gtc aac ttg tac gcc aac ttt ttc ccc atg gcg cac 19554 Ile Asp Ser Val Asn Leu Tyr Ala Asn Phe Phe Pro Met Ala His 1080 1085 1090
aac acc get tet acc ttg gaa gcc atg etg ega aac gac acc aac 19599 Asn Thr Ala Ser Thr Leu Glu Ala Met Leu Arg Asn Asp Thr Asn 1095 1100 1105
gac cag tcg ttt aac gac tac etc age gcg gcc aac atg ett tat 19644 Asp Gin Ser Phe Asn Asp Tyr Leu Ser Ala Ala Asn Met Leu Tyr 1110 1115 1120
ccc att ccg gcc aac gcc acc aac gtt ccc att tcc att ccc tcc 19689 Pro Ile Pro Ala Asn Ala Thr Asn Val Pro Ile Ser Ile Pro Ser 1125 1130 1135
cgc aac tgg gcg gcc ttc egg gga tgg age ttc acc cgc ett aaa 19734 Arg Asn Trp Ala Ala Phe Arg Gly Trp Ser Phe Thr Arg Leu Lys 1140 1145 1150
gcc aag gaa acg cct tcc ttg ggc tcc ggc ttt gac ccc tac ttt 19779 Ala Lys Glu Thr Pro Ser Leu Gly Ser Gly Phe Asp Pro Tyr Phe 1155 1160 1165
gtg tac tea ggc acc att cct tac etg gac ggc age ttt tac etc 19824 Val Tyr Ser Gly Thr Ile Pro Tyr Leu Asp Gly Ser Phe Tyr Leu 1170 1175 1180
aac cac act ttc aaa cgt etg tcc atc atg ttc gat tet tcc gta 19869 Asn His Thr Phe Lys Arg Leu Ser Ile Met Phe Asp Ser Ser Val 1185 1190 1195
agt tgg ccg ggc aac gac cgc etc etg acg ccg aac gag ttc gaa 19914 Ser Trp Pro Gly Asn Asp Arg Leu Leu Thr Pro Asn Glu Phe Glu 1200 1205 1210
att aag cgc att gtg gac ggg gaa ggc tac aac gtg get caa agt 19959 Ile Lys Arg Ile Val Asp Gly Glu Gly Tyr Asn Val Ala Gin Ser 1215 1220 1225
aac atg acc aaa gac tgg ttt tta att caa atg etc age cac tac 20004 Asn Met Thr Lys Asp Trp Phe Leu Ile Gin Met Leu Ser His Tyr 1230 1235 1240
aac atc ggc tac caa ggc ttc tat gtt ccc gag ggc tac aag gat 20049 Asn Ile Gly Tyr Gin Gly Phe Tyr Val Pro Glu Gly Tyr Lys Asp 1245 1250 1255
egg atg tat tet ttc ttc ega aac ttt cag ccc atg age cgc cag 20094 Arg Met Tyr Ser Phe Phe Arg Asn Phe Gin Pro Met Ser Arg Gin 1260 1265 1270
gtg ccg gat ccc acc get gcc ggc tat caa gcc gtt ccc etg ccc 20139 Val Pro Asp Pro Thr Ala Ala Gly Tyr Gin Ala Val Pro Leu Pro 1275 1280 1285
aga caa cac aac aac tcg ggc ttt gtg ggg tac atg ggc ccg acc 20184 Arg Gin His Asn Asn Ser Gly Phe Val Gly Tyr Met Gly Pro Thr 1290 1295 1300
atg cgc gaa gga cag cca tac ccg gcc aac tac ccc tat ccc etg 20229 Met Arg Glu Gly Gin Pro Tyr Pro Ala Asn Tyr Pro Tyr Pro Leu 1305 1310 1315
atc ggc get acc gcc gtc ccc gcc att acc cag aaa aag ttt ttg 20274 Ile Gly Ala Thr Ala Val Pro Ala Ile Thr Gin Lys Lys Phe Leu 1320 1325 1330
tgc gac cgc gtc atg tgg cgc ata cct ttt tcc age aac ttt atg 20319 Cys Asp Arg Val Met Trp Arg Ile Pro Phe Ser Ser Asn Phe Met 1335 1340 1345
tea atg ggg gcc etg acc gac etc gga cag aac atg ett tac get 20364 Ser Met Gly Ala Leu Thr Asp Leu Gly Gin Asn Met Leu Tyr Ala 1350 1355 1360
aac tcc gcc eat gcc etg gat atg act ttt gag gtg gac ccc atg 20409 Asn Ser Ala His Ala Leu Asp Met Thr Phe Glu Val Asp Pro Met 1365 1370 1375
aac gag ccc acg ttg etg tac atg ett ttt gag gtg ttc gac gtg 20454 Asn Glu Pro Thr Leu Leu Tyr Met Leu Phe Glu Val Phe Asp Val 1380 1385 1390
gtc aga gtg cac cag ccg cac cgc ggt att atc gag gcc gtg tac 20499 Val Arg Val His Gin Pro His Arg Gly Ile Ile Glu Ala Val Tyr 1395 1400 1405
etg cgc acc ccc ttc tet gcg ggc aat gcc acc aca taa gccgctgaac 20548 Leu Arg Thr Pro Phe Ser Ala Gly Asn Ala Thr Thr
1410 1415 1420
tagctggttt ttaccccaga tcccatgggc tecaeggaag aegaactgeg ggccattgtg 20608 egagaectgg gctgcggacc ctacttcctg ggcacctttg aeaageggtt tcccgggttc 20668 gtgtctcctc gcaaactcgc gtgcgcgatc gtgaataccg ccggccgaga gaccggagga 20728 gagcattggc tagctctggg ctggaacccc cgctcgtcca cgtttttcct gttcgacccc 20788 tttggctttt cagaccaacg cttgaagcag atctatgcat ttgaatatga gggtctactc 20848 aagcgaagcg cgctggcctc ctccgccgat cactgtctaa ccctggtaaa gagcactcag 20908 acggttcagg gccctcacag cgccgcctgt ggcctttttt gttgcatgtt tttgcacgcc 20968 tttgtgaact ggccggacac ccccatggaa aacaacccca ccatggacct cctgactggc 21028 gttcccaact ccatgctcca aagccccagc gtgcagacca ccctcctcca aaaccagaaa 21088 aatctgtacg cctttctgca caagcactct ccctactttc gccgccatcg ggaacaaata 21148 gaaaatgcaa ccgcgtttaa caaaactctg taacgtttaa taaatgaact ttttattgaa 21208 ctggaaaacg ggtttgtgat ttttaaaaat caaaggggtt gagctggaca tccatgtggg 21268 aggccggaag ggtggtgttc ttgtactggt acttgggcag ccacttaaac tctggaatca 21328 caaacttggg cagcggtatt tctgggaagt tgtcgtgcca cagctggcgg gtcagctgaa 21388 gtgcctgcag aacatcgggg gcggagatct tgaagtcgca gtttatctgg ttcacggcac 21448 gcgcgttgcg gtacatggga ttggcacact gaaacaccag caggctggga ttcttgatgc 21508 tagccagggc cacggcgtcg gtcacgtcac cggtgtcttc tatgttggac agcgaaaaag 21568 gcgtgacttt gcaaagctgg cgtcccgcgc gaggcacgca atctcccagg tagttgcact 21628 cacagcggat gggcagaaga agatgcttgt ggccgcgggt catgtaggga taggccgctg 21688 ccataaaagc ttcgatctgc ctgaaagcct gcttggcctt gtgcccttcg gtataaaaaa 21748 caccgcagga cttgttggaa aaggtattac tggcgcaagc ggcatcgtga aagcaagcgc 21808 gtgcgtcttc gtttcgtaac tgcaccacgc tgcggcccca ccggttctga atcaccttgg 21868 ccctgccggg gttttccttg agagcgcgct ggccggcttc gctgcccaca tccatttcca 21928 cgacatgctc cttgttaatc atggccagac cgtggaggca gcgcagctcc tcgtcatcgt 21988 cggtgcagtg atgctcccac acgacgcagc cagtgggctc ccacttgggc ttggaggcct 22048 cggcaatgcc agaatacagg agaacgtagt ggtgcagaaa acgtcccatc atggtgccaa 22108 aggttttctg gctgctgaag gtcatcgggc agtacctcca gtcctcgtta agccaagtgt 22168 tgcagatctt cctgaagacc gtgtactgat cgggcataaa gtggaactca ttgcgctcgg 22228 tcttgtcgat cttatacttt tccatcagac tatgcataat ctccatgccc ttttcccagg 22288 cgcaaacaat cttggtgcta cacgggttag gtatggccaa agtggttggc ctctgaggcg 22348 gcgcttgttc ttcctcttga gccctctccc gactgacggg ggttgaaaga gggtgcccct 22408 tggggaacgg cttgaacacg gtctggcccg aggcgtcccg aagaatctgc atcgggggat 22468 tgctggccgt catggcgatg atctgacccc ggggctcctc cacttcgtcc tcctcgggac 22528 tttcctcgtg cttttcgggg gacggtacgg gagtaggggg aagagcgcgg cgcgccttct 22588 tcttgggcgg cagttccgga gcctgctctt gacgactggc cattgtcttc tcctaggcaa 22648 gaaaaacaag atggaagact ctttctcctc ctcctcgtca acgtcagaaa gcgagtcttc 22708 caccttaagc gccgagaact cccagcgcat agaatccgat gtgggctacg agactccccc 22768 cgcgaacttt tcgccgcccc ccataaacac taacgggtgg acggactacc tggccctagg 22828 agacgtactg ctgaagcaca tcaggcggca gagcgttatc gtgcaagatg ctctcaccga 22888 gcgactcgcg gttccgctgg aagtggcgga acttagcgcc gcctacgagc gaaccctctt 22948 ctccccaaag actcccccca agaggcaggc taacggcacc tgcgagccta accctcgact 23008 caacttctac cctgcctttg ccgtgccaga ggtactggct acgtaccaca tttttttcca 23068 aaaccacaaa atccctctct cgtgccgcgc caaccgcacc aaagccgatc gcgtgctgcg 23128 actggaggaa ggggctcgca tacctgagat tgcgtgtctg gaggaagtcc caaaaatctt 23188 tgaaggtctg ggccgcgacg aaaagcgagc agcaaacgct ctggaagaga acgcagagag 23248 tcacaacagc gccttggtag aactcgaggg cgacaacgcc agactggccg tcctcaaacg 23308 gtccatagaa gtcacgcact tcgcctaccc cgccgttaac ctccctccaa aagttatgac 23368 agcggtcatg gactcgctgc tcataaagcg cgctcagccc ttagacccag agcacgaaaa 23428 caacagtgac gaaggaaaac cggtggtttc tgatgaggag ttgagcaagt ggctgtcctc 23488 caacgacccc gccacgttgg aggaacgaag aaaaaccatg atggccgtgg tgctagttac 23548 cgtgcaatta gaatgtctgc agaggttctt ttcccaccca gagaccctga gaaaagtgga 23608 ggaaacgctg cactacacat ttaggcacgg ctacgtgaag caagcctgca agatttccaa 23668 cgtagaactt agcaacctca tctcctacct ggggatcttg cacgaaaacc gcctcggaca 23728 aaacgtgctg cacagcacac tgaaaggaga agcccgccga gactatgtgc gagactgcgt 23788 gttcctagcg ctagtgtaca cctggcagag cggaatggga gtctggcagc agtgcctgga 23848 ggacgaaaac ctcaaagagc ttgaaaagct gctggtgcgc tccagaaggg cactgtggac 23908 cagttttgac gagcgcaccg ccgcgcgaga cctagctgat attatttttc ctcccaagct 23968 ggtgcagact ctccgggaag gactgccaga ttttatgagt caaagcatct tgcaaaactt 24028 ccgctctttc atcttggaac gctcgggaat cttgcccgcc actagctgcg ccctacccac 24088 agattttgtg cctctccact accgcgaatg cccaccgccg ctgtggccgt acacttactt 24148 gcttaaactg gccaactttc taatgttcca ctctgacctg gcagaagacg ttagcggcga 24208 ggggctgcta gaatgccact gccgctgcaa cctgtgcacc ccccaccgct ctctagtatg 24268 caacactccc ctgctcaatg agacccagat catcggtacc tttgaaatcc agggaccctc 24328 cgacgcggaa aacggcaagc aggggtctgg gctaaaactc acagccggac tgtggacctc 24388 cgcctacttg cgcaaatttg taccagaaga ctatcacgcc caccaaatta aattttacga 24448 aaaccaatca aaaccaccca aaagcgagtt aacggcttgc gtcattacgc agagcagcat 24508 agttgggcag ttgcaagcca ttaacaaagc gcggcaagag tttctcctaa aaaaaggaaa 24568 aggggtctac ttggaccccc agaccggcga ggaactcaac ggaccctcct cagtcgcagg 24628 ttgtgtgccc catgccgccc aaaaagaaca cctcgcagtg gaacatgcca gagacggagg 24688 aagaggagtg gagcagtgtg agcaacagcg aaacggagga agagccgtgg cccgaggggt 24748 gcaacgggga agaggacacg gagggacggc gaagtcttcg ccgaagaact ctcgccgctg 24808 cccccgaagt cccagccggc cgcctcggcc caagatcccg cacacacccg tagatgggat 24868 agcaagacca aaaagccggg taagagaaac gctcgccccc gccagggcta ccgctcgtgg 24928 agaaagcaca aaaactgcat cttatcgtgc ttgctccagt gcggcggaga cgtttcgttc 24988 acccgtagat acttgctttt taacaaaggg gtggccgtcc cccgtaacgt cctccactac 25048 taccgtcact cttacagctc cgaagcggac ggctaagaaa acgcagcagt tgccggcggg 25108 aggactgcgt ctcagcgccc gagaaccccc agccaccagg gagctccgaa accgcatatt 25168 tcccaccctc tacgctatct ttcagcaaag ccgggggcag cagcaagaac tgaaaataaa 25228 aaaccgcacg ctgaggtcgc ttacccgaag ctgcctctat cacaagagcg aagagcagct 25288 gcagcgaacc ctggaggacg cagaagcgct gttccagaag tactgcgcga ccaccctaaa 25348 taactaaaaa agcccgcgcg cgggacttca aaccgtctga cgtcaccagc cgcgcgccaa 25408 aatgagcaaa gagattccca cgccttacat gtggagttac cagccgcaga tgggattagc 25468 cgccggcgcc gcccaggatt actccacgaa aatgaactgg ctcagcgccg ggccccacat 25528 gatttcccgc gtaaacgaca ttcgcgccca ccgcaatcag ctattgttag aacaggctgc 25588 tctgaccgcc acgccccgta ataacctgaa ccctcccagc tggccagctg ccctggtgta 25648 ccaggaaacg cctccaccca ccagcgtact tttgccccgt gacgcccagg cggaagtcca 25708 gatgactaac gcgggcgcgc aattagcggg cggatcccgg tttcggtaca gagttcacgg 25768 cgccgcaccc tatagcccag gtataaagag gctgatcatt cgaggcagag gtgtccagct 25828 caacgacgag acagtgagct cttcgcttgg tctacgacca gacggagtgt tccagctcgc 25888 gggctcgggc cgctcttcgt tcacgcctcg ccaggcatac ctgactctgc agagctctgc 25948 ctctcagcct cgctcgggag gaatcggacc ccttcagttt gtggaggagt ttgtgccctc 26008 ggtctacttt cagcctttct ccggatcgcc cggccagtac ccggacgagt tcatccccaa 26068 cttcgacgcg gtgagtgact ctgtggacgg ttatgactga tgtcgagccc gcttcagtgc 26128 tagtggaaca agcgcggctc aatcacctgg ttcgttgccg ccgccgctgc tgcgtggctc 26188 gcgacttgag cttagctctc aagtttgtaa aaaacccgtc cgaaaccggg agcgctgtgc 26248 acgggttgga gctagtgggt cctgagaagg ccaccatcca cgttctcaga aactttgtgg 26308 aaaaacccat tttggttaaa cgagatcagg ggccttttgt aatcagctta ctctgcacct 26368 gtaaccatgt tgaccttcac gactatttta tggatcattt gtgcgctgaa ttcaataagt 26428 aaagcgaatt cttaccaaga ttatgatgtc catgactgtt cctcgccact atacgatgtt 26488 gtgccagtaa actctcttgt cgacatctat ctgaactgtt ccttttggtc cgcacagctt 26548 acttggtact acggtgacac cgtcctttct ggctcactgg gcagctcaca cggaataaca 26608 cttcacctct tttcgccgtt tcgatacgga aactacagct gtcgtgccgg tacctgcctc 26668 cacgttttca atcttcagcc ctgtccaccg accaaacttg tatttgtcga ctctaagcac 26728 ttacagctca actgcagcat tctaggcccc agtatcttgt ggacatacaa taaaatcagg 26788 ttggtggaat ttgtctacta cccacccagc gcccgcggtt ttggggaaat tcctttccag 26848 atctactaca actatcttgc cacacattat gcaagtcaac agcaactaaa cttgcaagca 26908 cccttcacgc caggagagta ctcctgtcac gtaggctcct gcacagaaac ttttattctc 26968 ttcaacagat cttctgccat tgaacgcttc actactaact actttagaaa ccaagttgtg 27028 cttttcactg acgaaacccc taacgtcacc ctggactgtg catgtttttc tcatgacacc 27088 gtaacttgga ctcttaacaa tactctctgg ctcgcgttcg ataaccaaag cttgattgtt 27148 aaaaattttg atttaacctt tactaaaccc tctcctcgcg aaatagttat ctttgctcct 27208 tttaatccaa aaactacctt agcctgtcag gttttgttta agccttgcca aacaaacttt 27268 aagtttgttt atttgcctcc gcaatctgtc aaactcatag aaaaatacaa caaagcgccc 27328 gtcttggctc ctaaaacctt ctaccactgg ctaacctaca cggggctgtt tgcactaatt 27388 gtttttttcc taattaacat ttttatatgt ttcttgcctt cctccttctt ttcgcgaaca 27448 ccgttgccgc agaaagacct ctccttatta ctgtagcgct tgctatacaa aaccaagagt 27508 ggtcaaccgt gctctcaatc tattttcaat ttttcatttt gtccttaata ctttctctta 27568 ttgtcgttaa caatgatctg gagcattggt ctcgcctttt tttggctgct tagtgcaaaa 27628 gccactattt ttcacaggta tgtggaagaa ggaactagca ccctctttac gatacctgaa 27688 acaattaagg cggctgatga agtttcttgg tacaaaggct cgctctcaga cggcaaccac 27748 tcattctcag gacagaccct ttgcatccaa gaaacttatt ttaaatcaga actacaatac 27808 agctgcataa aaaacttttt ccatctctac aacatctcaa aaccctatga gggtatttac 27868 aatgccaagg tttcagacaa ctccagcaca cggaactttt actttaatct gacagttatt 27928 aaagcaattt ccattcctat ctgtgagttt agctcccagt ttctttctga aacctactgt 27988 ttaattacta taaactgcac taaaaatcgc cttcacacca ccataatcta caatcacaca 28048 caatcacctt gggttttaaa cctaaaattt tctccacaca tgccttcgca atttctcacg 28108 caagttaccg tctctaacat aagcaagcag tttggctttt actatccttt ccacgaactg 28168 tgcgaaataa ttgaagccga atatgaacca gactacttta cttacattgc cattggtgta 28228 atcgttgttt gcctttgctt tgttattggg gggtgtgttt atttgtacat tcagagaaaa 28288 atattgctct cgctgtgctc ctgcggttac aaagcagaag aaagaattaa aatctctaca 28348 ctttattaat gttttccaga aatggcaaaa ctaacgctcc tacttttgct tctcacgccg 28408 gtgacgcttt ttaccatcac tttttctgcc gccgccacac tcgaacctca atgtttgcca 28468 ccggttgaag tctactttgt ctacgtgttg ctgtgctgcg ttagcgtttg cagtataaca 28528 tgttttacct ttgtttttct tcagtgcatt gactacttct gggtcagact ctactaccgc 28588 agacacgcgc ctcagtatca aaatcaacaa attgccagac tactcggtct gccatgattg 28648 tcttgtattt taccctgatt ttttttcacc ttacttgcgc ttgtgatttt cacttcactc 28708 aattttggaa aacgcaatgc ttcgacccgc gcctctccaa cgactggatg atggctcttg 28768 caattgccac gcttggggcg tttggacttt ttagtggttt tgctttgcat tacaaattta 28828 agactccatg gacacatggc tttctttcag attttccagt tacacctact ccgccgcctc 28888 ccccggccat cgacgtgcct caggttccct caccttctcc atctgtctgc agctactttc 28948 atctgtaatg gccgacctag aatttgacgg agtgcaatct gagcaaaggg ctatacactt 29008 ccaacgccag tcggaccgcg aacgcaaaaa cagagagctg caaaccatac aaaacaccca 29068 ccaatgtaaa cgcgggatat tttgtattgt aaaacaagct aagctccact acgagcttct 29128 atctggcaac gaccacgagc tccaatacgt ggtcgatcag cagcgtcaaa cctgtgtatt 29188 cttaattgga gtttccccca ttaaagttac tcaaaccaag ggtgaaacca agggaaccat 29248 aaggtgctca tgtcacctgt cagaatgcct ttacactcta gttaaaaccc tatgtggctt 29308 acatgattct atccccttta attaaataaa cttactttaa atctgcaatc acttcttcgt 29368 ccttgttttt gtcgccatcc agcagcacca ccttcccctc ttcccaactt tcatagcata 29428 ttttccgaaa agaggcgtac tttcgccaca ccttaaaggg aacgtttact tcgctttcaa 29488 gctctcccac gattttcatt gcagat atg aaa cgc gcc aaa gtg gaa gaa gga 29541
Met Lys Arg Ala Lys Val Glu Glu Gly 1425
ttt aac ccc gtt tat ccc tat gga tat tet act ccg act gac gtg 29586 Phe Asn Pro Val Tyr Pro Tyr Gly Tyr Ser Thr Pro Thr Asp Val 1430 1435 1440
get cct ccc ttt gta gcc tet gac ggt ett caa gaa aac cca cct 29631 Ala Pro Pro Phe Val Ala Ser Asp Gly Leu Gin Glu Asn Pro Pro 1445 1450 1455
ggg gtc ttg tcc eta aaa ata tcc aaa cct tta act ttt aat gcc 29676 Gly Val Leu Ser Leu Lys Ile Ser Lys Pro Leu Thr Phe Asn Ala 1460 1465 1470
tcc aag get eta age etg get att ggt cca gga tta aaa att caa 29721 Ser Lys Ala Leu Ser Leu Ala Ile Gly Pro Gly Leu Lys Ile Gin 1475 1480 1485
gat ggt aaa eta gtg ggg gag gga caa gca att ett gca aac etg 29766 Asp Gly Lys Leu Val Gly Glu Gly Gin Ala Ile Leu Ala Asn Leu 1490 1495 1500
ccg ett caa atc acc aac aac aca att tea eta cgt ttt ggg aac 29811 Pro Leu Gin Ile Thr Asn Asn Thr Ile Ser Leu Arg Phe Gly Asn 1505 1510 1515
aca ett gcc ttg aat gac aat aat gaa etc caa acc aca eta aaa 29856 Thr Leu Ala Leu Asn Asp Asn Asn Glu Leu Gin Thr Thr Leu Lys 1520 1525 1530
tet tea tcg ccc ett aaa atc aca gac cag act etg tcc ett aac 29901 Ser Ser Ser Pro Leu Lys Ile Thr Asp Gin Thr Leu Ser Leu Asn 1535 1540 1545
ata ggg gac age ett gca att aaa gat gac aaa eta gaa age get 29946 Ile Gly Asp Ser Leu Ala Ile Lys Asp Asp Lys Leu Glu Ser Ala 1550 1555 1560
ett caa gcg acc etc cca etc tcc att age aac aac acc atc age 29991 Leu Gin Ala Thr Leu Pro Leu Ser Ile Ser Asn Asn Thr Ile Ser 1565 1570 1575
etc aac gtg ggc acc gga etc acc ata aat gga aac gtt tta caa 30036 Leu Asn Val Gly Thr Gly Leu Thr Ile Asn Gly Asn Val Leu Gin 1580 1585 1590
get gtt ccc tta aat get eta agt ccc eta act att tcc aac aat 30081 Ala Val Pro Leu Asn Ala Leu Ser Pro Leu Thr Ile Ser Asn Asn 1595 1600 1605
aac atc age etg cgc tat ggc agt tcc etg acg gtg ett aac aat 30126 Asn Ile Ser Leu Arg Tyr Gly Ser Ser Leu Thr Val Leu Asn Asn 1610 1615 1620
gaa etg caa age aac etc aca gtt cac tcc cct tta aaa etc aac 30171 Glu Leu Gin Ser Asn Leu Thr Val His Ser Pro Leu Lys Leu Asn 1625 1630 1635
tcc aac aac tea att tet etc aac act eta tet ccg ttt aga atc 30216 Ser Asn Asn Ser Ile Ser Leu Asn Thr Leu Ser Pro Phe Arg Ile 1640 1645 1650
gag aat ggt ttc etc acg etc tat ttg gga aca aaa tet ggc ttg 30261 Glu Asn Gly Phe Leu Thr Leu Tyr Leu Gly Thr Lys Ser Gly Leu 1655 1660 1665
eta gtt caa aac agt ggc tta aaa gtt caa gcg ggc tac ggc etg 30306 Leu Val Gin Asn Ser Gly Leu Lys Val Gin Ala Gly Tyr Gly Leu 1670 1675 1680
caa gta aca gac acc aat get etc aca tta aga tat etc get cca 30351 Gin Val Thr Asp Thr Asn Ala Leu Thr Leu Arg Tyr Leu Ala Pro 1685 1690 1695
etg acc att cca gac tcg ggc tea gaa caa ggc att ett aaa gta 30396 Leu Thr Ile Pro Asp Ser Gly Ser Glu Gin Gly Ile Leu Lys Val 1700 1705 1710
aac act gga cag ggc eta agt gtg aac caa get gga gcg ett gaa 30441 Asn Thr Gly Gin Gly Leu Ser Val Asn Gin Ala Gly Ala Leu Glu 1715 1720 1725
aca tcc eta gga ggt gga tta aaa tat get gat aac aaa ata acc 30486 Thr Ser Leu Gly Gly Gly Leu Lys Tyr Ala Asp Asn Lys Ile Thr 1730 1735 1740
ttt gat aca gga aac gga etg aca tta tet gaa aat aaa ett gca 30531 Phe Asp Thr Gly Asn Gly Leu Thr Leu Ser Glu Asn Lys Leu Ala 1745 1750 1755
gta get gca ggt agt ggt eta act ttt aga gat ggt gcc ttg gta 30576 Val Ala Ala Gly Ser Gly Leu Thr Phe Arg Asp Gly Ala Leu Val 1760 1765 1770
gcc acg gga acc gca ttt acg caa aca etg tgg act acg get gat 30621 Ala Thr Gly Thr Ala Phe Thr Gin Thr Leu Trp Thr Thr Ala Asp 1775 1780 1785
ccg tet ccc aac tgc aca att ata cag gac cgc gac aca aaa ttt 30666 Pro Ser Pro Asn Cys Thr Ile Ile Gin Asp Arg Asp Thr Lys Phe 1790 1795 1800
act ttg gcg ett acc att agt ggg age caa gtg etg ggg acg gtt 30711 Thr Leu Ala Leu Thr Ile Ser Gly Ser Gin Val Leu Gly Thr Val 1805 1810 1815
tcc att att gga gta aaa ggc ccc ett tea agt age ata ccg tea 30756 Ser Ile Ile Gly Val Lys Gly Pro Leu Ser Ser Ser Ile Pro Ser 1820 1825 1830
get acc gtt aca gta caa ett aac ttt gat tcc aac gga gcc eta 30801 Ala Thr Val Thr Val Gin Leu Asn Phe Asp Ser Asn Gly Ala Leu 1835 1840 1845
ttg age tcc tet tea ett aaa ggt tac tgg ggg tat cgc caa ggt 30846 Leu Ser Ser Ser Ser Leu Lys Gly Tyr Trp Gly Tyr Arg Gin Gly 1850 1855 1860
ccc tea att gac cct tac ccc ata att aat gcc tta aac ttt atg 30891 Pro Ser Ile Asp Pro Tyr Pro Ile Ile Asn Ala Leu Asn Phe Met 1865 1870 1875
cca aac tea etg get tat ccc ccg gga caa gaa atc caa gca aaa 30936 Pro Asn Ser Leu Ala Tyr Pro Pro Gly Gin Glu Ile Gin Ala Lys 1880 1885 1890
tgt aac atg tac gtt tet act ttt tta ega gga aat cca caa aga 30981 Cys Asn Met Tyr Val Ser Thr Phe Leu Arg Gly Asn Pro Gin Arg 1895 1900 1905
cca ata gtt tta aac atc act ttt aat aat caa acc age ggg ttt 31026 Pro Ile Val Leu Asn Ile Thr Phe Asn Asn Gin Thr Ser Gly Phe 1910 1915 1920
tcc att aga ttt aca tgg aca aat tta acc aca gga gaa gca ttt 31071 Ser Ile Arg Phe Thr Trp Thr Asn Leu Thr Thr Gly Glu Ala Phe 1925 1930 1935
gca atg ccc cca tgc act ttt tcc tac att get gaa caa caa taa 31116 Ala Met Pro Pro Cys Thr Phe Ser Tyr Ile Ala Glu Gin Gin 1940 1945 1950
actatgtaac cctcaccgtt aacccgcctc cgcccttcca ttttatttta taaaccaccc 31176 gatccacctt ttcagcagta aacaattgca tgtcagtagg ggcagtaaaa cttttgggag 31236 ttaaaatcca cacaggttct teacaageta agegaaaate agttacactt ataaaaccat 31296 cgctaacatc ggacaaagac aagcatgagt ccaaagcttc eggttctgga tcagattttt 31356 gttcattaac agegggagaa aeagettctg gaggattttc catctccatc tccttcatca 31416 gttccaccat gtccaccgtg gtcatctggg acgagaacga cagttgtcat acacctcata 31476 agtcaccggt cgatgacgaa cgtacagatc tcgaagaatg tcctgtcgcc gcctttcggc 31536 agcactgggc cgaaggcgaa agcgcccatg tttaacaatg gccagcaccg cccgcttcat 31596 caggcgccta gttcttttag cgcaacagcg catgcgcagc tcgctaagac tggcgcaaga 31656 aacacagcac agaaccacca gattgttcat gatcccataa gcgtgctgac accagcccat 31716 actaacaaat tgtttcacta ttctagcatg aatgtcatat ctgatgttca agtaaattaa 31776 atggcgcccc cttatgtaaa cacttcccac gtacaacacc tcctttggca tctgataatt 31836 aaccacctcc cgataccaaa tacatctctg attaatagtc gccccgtaca ctacccgatt 31896 aaaccaagtt gccaacataa tcccccctgc catacactgc aaagaacctg gacggctaca 31956 atgacagtgc aaagtccaca cctcgttgcc atggataact gaggaacgcc ttaagtcaat 32016 agtggcacaa ctaatacaaa catgtaaata gtgtttcaac aagtgccact cgtatgaggt 32076 gagtatcatg tcccagggaa cgggccactc cataaacact gcaaaaccaa cacatcctac 32136 catcccccgc acggcactca catcgtgcat ggtgttcata tcacagtccg gaagctgagg 32196 acaaggaaaa gtctcgggag cattttcata gggcggtagt gggtactcct tgtaggggtt 32256 cagtcggcac cggtatctcc tcaccttctg ggccataaca cacaagttga gatctgattt 32316 caaggtactt tctgaatgaa aaccaagtgc tttcccaaca atgtatccga tgtcttcggt 32376 ccccgcgtcg gtagcgctcc ttgcagtaca cacggaacaa ccactcacgc aggcccagaa 32436 gacagttttc cgcggacggt gacaagttaa tccccctcag tctcagagcc aatatagttt 32496 cttccacagt agcataggcc aaacccaacc aggaaacaca agctggcacg tcccgttcaa 32556 cgggaggaca aggaagcaga ggcagaggca taggcaaagc aacagaattt ttattccaac 32616 tggtcacgta gcacttcaaa caccaggtca cgtaaatggc agcgatcttg ggtttcctga 32676 tggaacataa cagcaagatc aaacatgaga cgattctcaa ggtgattaac cacagctgga 32736 attaaatcct ccacgcgcac atttagaaac accagcaata caaaagcccg gttttctccg 32796 ggatctatca tagcagcaca gtcatcaatt agtcccaagt aattttcccg tttccaatct 32856 gttataattt gcagaataat gccctgtaaa tccaagccgg ccatggcgaa aagctcagat 32916 aatgcacttt ccacgtgcat tcgtaaacac accctcatct tgtcaatcca aaaagtcttc 32976 ttcttgagaa acctgtagta aattaagaat cgccaggtta ggctcgatgc ctacatcccg 33036 gagcttcatt ctcagcatgc actgcaaatg atccagcaga tcagaacagc aattagcagc 33096 cagctcatcc ccggtttcca gttccggagt tcccacggca attatcactc gaaacgtggg 33156 acaaatcgaa ataacatgag ctcccacgtg agcaaaagcc gtagggccag tgcaataatc 33216 acagaaccag cggaaaaaag attgcagctc atgtttcaaa aagctctgca gatcaaaatt 33276 cagctcatgc aaataacaca gtaaagtttg cggtatagta accgaaaacc acacgggtcg 33336 acgttcaaac atctcggctt acctaaaaaa gaagcacatt tttaaaccac agtcgcttcc 33396 tgaacaggag gaaatatggt gcggcgtaaa accagacgcg ccaccggatc tccggcagag 33456 ccctgataat acagccagct gtggttaaac agcaaaacct ttaattcggc aacggttgag 33516 gtctccacat aatcagcgcc cacaaaaatc ccatctcgaa cttgctcgcg tagggagcta 33576 aaatggccag tatagcccca tggcacccga acgctaatct gcaagtatat gagagccacc 33636 ccattcggcg ggatcacaaa atcagtcgga gaaaacaacg tatacacccc ggactgcaaa 33696 agctgttcag gcaaacgccc ctgcggtccc tctcggtaca ccagcaaagc ctcgggtaaa 33756 gcagccatgc caagcgctta ccgtgccaag agcgactcag acgaaaaagt gtactgaggc 33816 gctcagagca gcggctatat actctacctg tgacgtcaag aaccgaaagt caaaagttca 33876 cccggcgcgc ccgaaaaaac ccgcgaaaat ccacccaaaa agcccgcgaa aaacacttcc 33936 gtataaaatt tccgggttac cggcgcgtca ccgccgcgcg acacgcccgc cccgccccgc 33996 gctcctcccc gaaacccgcc gcgcccactt ccgcgttccc aagacaaagg tcgcgtaact 34056 ccgcccacct catttgcatg ttaactcggt cgccatcttg cggtgttata ttgatgatg 34115
<210> 35
<211> 503
<212> PRT
<213> simian adenovirus SV-39
<400> 35
Met Arg Arg Ala Val Ala Val Pro Ser Ala Ala Met Ala Leu Gly Pro 15 10 15
Pro Pro Ser Tyr Glu Ser Val Met Ala Ala Ala Thr Leu Gin Ala Pro 20 25 30
Leu Glu Asn Pro Tyr Val Pro Pro Arg Tyr Leu Glu Pro Thr Gly Gly 35 40 45
Arg Asn Ser Ile Arg Tyr Ser Glu Leu Thr Pro Leu Tyr Asp Thr Thr 50 55 60
Arg Leu Tyr Leu Val Asp Asn Lys Ser Ala Asp Ile Ala Thr Leu Asn 65 70 75 80
Tyr Gin Asn Asp His Ser Asn Phe Leu Thr Ser Val Val Gin Asn Ser 85 90 95 Asp Tyr Thr Pro Ala Glu Ala Ser Thr Gin Thr Ile Asn Leu Asp Asp 100 105 110
Arg Ser Arg Trp Gly Gly Asp Leu Lys Thr Ile Leu His Thr Asn Met 115 120 125
Pro Asn Val Asn Glu Phe Met Phe Thr Asn Ser Phe Arg Ala Lys Leu 130 135 140
Met Val Ala His Glu Ala Asp Lys Asp Pro Val Tyr Glu Trp Val Gin 145 150 155 160
Leu Thr Leu Pro Glu Gly Asn Phe Ser Glu Ile Met Thr Ile Asp Leu 165 170 175
Met Asn Asn Ala Ile Ile Asp His Tyr Leu Ala Val Ala Arg Gin Gin 180 185 190
Gly Val Lys Glu Ser Glu Ile Gly Val Lys Phe Asp Thr Arg Asn Phe 195 200 205
Arg Leu Gly Trp Asp Pro Glu Thr Gly Leu Val Met Pro Gly Val Tyr 210 215 220
Thr Asn Glu Ala Phe His Pro Asp Val Val Leu Leu Pro Gly Cys Gly 225 230 235 240
Val Asp Phe Thr Tyr Ser Arg Leu Asn Asn Leu Leu Gly Ile Arg Lys 245 250 255
Arg Met Pro Phe Gin Glu Gly Phe Gin Ile Leu Tyr Glu Asp Leu Glu 260 265 270
Gly Gly Asn Ile Pro Ala Leu Leu Asp Val Pro Ala Tyr Glu Glu Ser 275 280 285
Ile Ala Asn Ala Arg Glu Ala Ala Ile Arg Gly Asp Asn Phe Ala Ala 290 295 300
Gin Pro Gin Ala Ala Pro Thr Ile Lys Pro Val Leu Glu Asp Ser Lys 305 310 315 320
Gly Arg Ser Tyr Asn Val Ile Ala Asn Thr Asn Asn Thr Ala Tyr Arg 325 330 335
Ser Trp Tyr Leu Ala Tyr Asn Tyr Gly Asp Pro Glu Lys Gly Val Arg 340 345 350 Ala Trp Thr Leu Leu Thr Thr Pro Asp Val Thr Cys Gly Ser Glu Gin 355 360 365
Val Tyr Trp Ser Leu Pro Asp Met Tyr Val Asp Pro Val Thr Phe Arg 370 375 380
Ser Thr Gin Gin Val Ser Asn Tyr Pro Val Val Gly Ala Glu Leu Met 385 390 395 400
Pro Ile His Ser Lys Ser Phe Tyr Asn Glu Gin Ala Val Tyr Ser Gin 405 410 415
Leu Ile Arg Gin Thr Thr Ala Leu Thr His Val Phe Asn Arg Phe Pro 420 425 430
Glu Asn Gin Ile Leu Val Arg Pro Pro Ala Pro Thr Ile Thr Thr Val 435 440 445
Ser Glu Asn Val Pro Ala Leu Thr Asp His Gly Thr Leu Pro Leu Gin 450 455 460
Asn Ser Ile Arg Gly Val Gin Arg Val Thr Ile Thr Asp Ala Arg Arg 465 470 475 480
Arg Thr Cys Pro Tyr Val Tyr Lys Ala Leu Gly Ile Val Ala Pro Arg 485 490 495
Val Leu Ser Ser Arg Thr Phe
500
<210> 36
<211> 917
<212> PRT
<213> simian adenovirus SV-39
<400> 36
Met Ala Thr Pro Ser Met Met Pro Gin Trp Ser Tyr Met His Ile Ala 15 10 15
Gly Gin Asp Ala Ser Glu Tyr Leu Ser Pro Gly Leu Val Gin Phe Ala 20 25 30
Arg Ala Thr Glu Thr Tyr Phe Ser Leu Gly Asn Lys Phe Arg Asn Pro 35 40 45 Thr Val Ala Pro Thr His Asp Val Thr Thr Asp Arg Ser Gin Arg Leu 50 55 60
Thr Ile Arg Phe Val Pro Val Asp Lys Glu Asp Thr Ala Tyr Ser Tyr 65 70 75 80
Lys Thr Arg Phe Thr Leu Ala Val Gly Asp Asn Arg Val Leu Asp Met 85 90 95
Ala Ser Thr Tyr Phe Asp Ile Arg Gly Val Ile Asp Arg Gly Pro Ser 100 105 110
Phe Lys Pro Tyr Ser Gly Thr Ala Tyr Asn Ser Leu Ala Pro Lys Gly 115 120 125
Ala Pro Asn Asn Ser Gin Trp Asn Ala Thr Asp Asn Gly Asn Lys Pro 130 135 140
Val Cys Phe Ala Gin Ala Ala Phe Ile Gly Gin Ser Ile Thr Lys Asp 145 150 155 160
Gly Val Gin Ile Gin Asn Ser Glu Asn Gin Gin Ala Ala Ala Asp Lys 165 170 175
Thr Tyr Gin Pro Glu Pro Gin Ile Gly Val Ser Thr Trp Asp Thr Asn 180 185 190
Val Thr Ser Asn Ala Ala Gly Arg Val Leu Lys Ala Thr Thr Pro Met 195 200 205
Leu Pro Cys Tyr Gly Ser Tyr Ala Asn Pro Thr Asn Pro Asn Gly Gly 210 215 220
Gin Ala Lys Thr Glu Gly Asp Ile Ser Leu Asn Phe Phe Thr Thr Thr 225 230 235 240
Ala Ala Ala Asp Asn Asn Pro Lys Val Val Leu Tyr Ser Glu Asp Val 245 250 255
Asn Leu Gin Ala Pro Asp Thr His Leu Val Tyr Lys Pro Thr Val Gly 260 265 270
Glu Asn Val Ile Ala Ala Glu Ala Leu Leu Thr Gin Gin Ala Cys Pro 275 280 285
Asn Arg Ala Asn Tyr Ile Gly Phe Arg Asp Asn Phe Ile Gly Leu Met 290 295 300 Tyr Tyr Asn Ser Thr Gly Asn Met Gly Val Leu Ala Gly Gin Ala Ser 305 310 315 320
Gin Leu Asn Ala Val Val Asp Leu Gin Asp Arg Asn Thr Glu Leu Ser 325 330 335
Tyr Gin Leu Met Leu Asp Ala Leu Gly Asp Arg Thr Arg Tyr Phe Ser 340 345 350
Met Trp Asn Gin Ala Val Asp Ser Tyr Asp Pro Asp Val Arg Ile Ile 355 360 365
Glu Asn His Gly Val Glu Asp Glu Leu Pro Asn Tyr Cys Phe Pro Leu 370 375 380
Pro Gly Met Gly Ile Phe Asn Ser Tyr Lys Gly Val Lys Pro Gin Asn 385 390 395 400
Gly Gly Asn Gly Asn Trp Glu Ala Asn Gly Asp Leu Ser Asn Ala Asn 405 410 415
Glu Ile Ala Leu Gly Asn Ile Phe Ala Met Glu Ile Asn Leu His Ala 420 425 430
Asn Leu Trp Arg Ser Phe Leu Tyr Ser Asn Val Ala Leu Tyr Leu Pro 435 440 445
Asp Ser Tyr Lys Phe Thr Pro Ala Asn Ile Thr Leu Pro Ala Asn Gin 450 455 460
Asn Thr Tyr Glu Tyr Ile Asn Gly Arg Val Thr Ser Pro Thr Leu Val 465 470 475 480
Asp Thr Phe Val Asn Ile Gly Ala Arg Trp Ser Pro Asp Pro Met Asp 485 490 495
Asn Val Asn Pro Phe Asn His His Arg Asn Ala Gly Leu Arg Tyr Arg 500 505 510
Ser Met Leu Leu Gly Asn Gly Arg Val Val Pro Phe His Ile Gin Val 515 520 525
Pro Gin Lys Phe Phe Ala Ile Lys Asn Leu Leu Leu Leu Pro Gly Ser 530 535 540
Tyr Thr Tyr Glu Trp Ser Phe Arg Lys Asp Val Asn Met Ile Leu Gin 545 550 555 560 Ser Thr Leu Gly Asn Asp Leu Arg Val Asp Gly Ala Ser Val Arg Ile 565 570 575
Asp Ser Val Asn Leu Tyr Ala Asn Phe Phe Pro Met Ala His Asn Thr 580 585 590
Ala Ser Thr Leu Glu Ala Met Leu Arg Asn Asp Thr Asn Asp Gin Ser 595 600 605
Phe Asn Asp Tyr Leu Ser Ala Ala Asn Met Leu Tyr Pro Ile Pro Ala 610 615 620
Asn Ala Thr Asn Val Pro Ile Ser Ile Pro Ser Arg Asn Trp Ala Ala 625 630 635 640
Phe Arg Gly Trp Ser Phe Thr Arg Leu Lys Ala Lys Glu Thr Pro Ser 645 650 655
Leu Gly Ser Gly Phe Asp Pro Tyr Phe Val Tyr Ser Gly Thr Ile Pro 660 665 670
Tyr Leu Asp Gly Ser Phe Tyr Leu Asn His Thr Phe Lys Arg Leu Ser 675 680 685
Ile Met Phe Asp Ser Ser Val Ser Trp Pro Gly Asn Asp Arg Leu Leu 690 695 700
Thr Pro Asn Glu Phe Glu Ile Lys Arg Ile Val Asp Gly Glu Gly Tyr 705 710 715 720
Asn Val Ala Gin Ser Asn Met Thr Lys Asp Trp Phe Leu Ile Gin Met 725 730 735
Leu Ser His Tyr Asn Ile Gly Tyr Gin Gly Phe Tyr Val Pro Glu Gly 740 745 750
Tyr Lys Asp Arg Met Tyr Ser Phe Phe Arg Asn Phe Gin Pro Met Ser 755 760 765
Arg Gin Val Pro Asp Pro Thr Ala Ala Gly Tyr Gin Ala Val Pro Leu 770 775 780
Pro Arg Gin His Asn Asn Ser Gly Phe Val Gly Tyr Met Gly Pro Thr 785 790 795 800
Met Arg Glu Gly Gin Pro Tyr Pro Ala Asn Tyr Pro Tyr Pro Leu Ile 805 810 815 Gly Ala Thr Ala Val Pro Ala Ile Thr Gin Lys Lys Phe Leu Cys Asp 820 825 830
Arg Val Met Trp Arg Ile Pro Phe Ser Ser Asn Phe Met Ser Met Gly 835 840 845
Ala Leu Thr Asp Leu Gly Gin Asn Met Leu Tyr Ala Asn Ser Ala His 850 855 860
Ala Leu Asp Met Thr Phe Glu Val Asp Pro Met Asn Glu Pro Thr Leu 865 870 875 880
Leu Tyr Met Leu Phe Glu Val Phe Asp Val Val Arg Val His Gin Pro 885 890 895
His Arg Gly Ile Ile Glu Ala Val Tyr Leu Arg Thr Pro Phe Ser Ala 900 905 910
Gly Asn Ala Thr Thr
915
<210> 37
<211> 533
<212> PRT
<213> simian adenovirus SV-39
<400> 37
Met Lys Arg Ala Lys Val Glu Glu Gly Phe Asn Pro Val Tyr Pro Tyr 15 10 15
Gly Tyr Ser Thr Pro Thr Asp Val Ala Pro Pro Phe Val Ala Ser Asp 20 25 30
Gly Leu Gin Glu Asn Pro Pro Gly Val Leu Ser Leu Lys Ile Ser Lys 35 40 45
Pro Leu Thr Phe Asn Ala Ser Lys Ala Leu Ser Leu Ala Ile Gly Pro 50 55 60
Gly Leu Lys Ile Gin Asp Gly Lys Leu Val Gly Glu Gly Gin Ala Ile 65 70 75 80
Leu Ala Asn Leu Pro Leu Gin Ile Thr Asn Asn Thr Ile Ser Leu Arg 85 90 95 Phe Gly Asn Thr Leu Ala Leu Asn Asp Asn Asn Glu Leu Gin Thr Thr 100 105 110
Leu Lys Ser Ser Ser Pro Leu Lys Ile Thr Asp Gin Thr Leu Ser Leu 115 120 125
Asn Ile Gly Asp Ser Leu Ala Ile Lys Asp Asp Lys Leu Glu Ser Ala 130 135 140
Leu Gin Ala Thr Leu Pro Leu Ser Ile Ser Asn Asn Thr Ile Ser Leu 145 150 155 160
Asn Val Gly Thr Gly Leu Thr Ile Asn Gly Asn Val Leu Gin Ala Val 165 170 175
Pro Leu Asn Ala Leu Ser Pro Leu Thr Ile Ser Asn Asn Asn Ile Ser 180 185 190
Leu Arg Tyr Gly Ser Ser Leu Thr Val Leu Asn Asn Glu Leu Gin Ser 195 200 205
Asn Leu Thr Val His Ser Pro Leu Lys Leu Asn Ser Asn Asn Ser Ile 210 215 220
Ser Leu Asn Thr Leu Ser Pro Phe Arg Ile Glu Asn Gly Phe Leu Thr 225 230 235 240
Leu Tyr Leu Gly Thr Lys Ser Gly Leu Leu Val Gin Asn Ser Gly Leu 245 250 255
Lys Val Gin Ala Gly Tyr Gly Leu Gin Val Thr Asp Thr Asn Ala Leu 260 265 270
Thr Leu Arg Tyr Leu Ala Pro Leu Thr Ile Pro Asp Ser Gly Ser Glu 275 280 285
Gin Gly Ile Leu Lys Val Asn Thr Gly Gin Gly Leu Ser Val Asn Gin 290 295 300
Ala Gly Ala Leu Glu Thr Ser Leu Gly Gly Gly Leu Lys Tyr Ala Asp 305 310 315 320
Asn Lys Ile Thr Phe Asp Thr Gly Asn Gly Leu Thr Leu Ser Glu Asn 325 330 335
Lys Leu Ala Val Ala Ala Gly Ser Gly Leu Thr Phe Arg Asp Gly Ala 340 345 350 Leu Val Ala Thr Gly Thr Ala Phe Thr Gin Thr Leu Trp Thr Thr Ala 355 360 365
Asp Pro Ser Pro Asn Cys Thr Ile Ile Gin Asp Arg Asp Thr Lys Phe 370 375 380
Thr Leu Ala Leu Thr Ile Ser Gly Ser Gin Val Leu Gly Thr Val Ser 385 390 395 400
Ile Ile Gly Val Lys Gly Pro Leu Ser Ser Ser Ile Pro Ser Ala Thr 405 410 415
Val Thr Val Gin Leu Asn Phe Asp Ser Asn Gly Ala Leu Leu Ser Ser 420 425 430
Ser Ser Leu Lys Gly Tyr Trp Gly Tyr Arg Gin Gly Pro Ser Ile Asp 435 440 445
Pro Tyr Pro Ile Ile Asn Ala Leu Asn Phe Met Pro Asn Ser Leu Ala 450 455 460
Tyr Pro Pro Gly Gin Glu Ile Gin Ala Lys Cys Asn Met Tyr Val Ser 465 470 475 480
Thr Phe Leu Arg Gly Asn Pro Gin Arg Pro Ile Val Leu Asn Ile Thr 485 490 495
Phe Asn Asn Gin Thr Ser Gly Phe Ser Ile Arg Phe Thr Trp Thr Asn 500 505 510
Leu Thr Thr Gly Glu Ala Phe Ala Met Pro Pro Cys Thr Phe Ser Tyr 515 520 525
Ile Ala Glu Gin Gin
530
<210> 38
<211> 50
<212> DNA
<213> Artificial sequence
<220>
<223> oligomer SV25T
<400> 38
aatttaaata cgtagcgcac tagtcgcgct aagcgcggat atcatttaaa 50 <210> 39
<211> 49
<212> DNA
<213> Artificial sequence
<220>
<223> oligomer SV25B
<400> 39
tatttaaatg atatccgcgc ttaagcgcga ctagtgcgct acgtattta 49
Applications Claiming Priority (3)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US33195101P | 2001-11-21 | 2001-11-21 | |
US36679802P | 2002-03-22 | 2002-03-22 | |
PCT/US2002/033645 WO2003046124A2 (en) | 2001-11-21 | 2002-11-20 | Simian adenovirus nucleic acid and amino acid sequences, vectors containing same, and methods of use |
Publications (2)
Publication Number | Publication Date |
---|---|
NO20130590L true NO20130590L (no) | 2004-05-26 |
NO335438B1 NO335438B1 (no) | 2014-12-15 |
Family
ID=26987985
Family Applications (3)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
NO20042191A NO332692B1 (no) | 2001-11-21 | 2004-05-26 | Rekombinant adenovirus, sammensetning inneholdende denne, isolert vertscelle samt anvendelse for fremstilling av et medikament |
NO20120337A NO334512B1 (no) | 2001-11-21 | 2012-03-21 | Rekombinant adenovirus, isolert vertscelle inneholdende denne, sammensetning som omfatter det rekombinante adenoviruset og en anvendelse |
NO20130590A NO335438B1 (no) | 2001-11-21 | 2013-04-30 | Rekombinant adenovirus, isolert vertscelle inneholdende denne, sammensetning som omfatter det rekombinante adenoviruset og avendelse. |
Family Applications Before (2)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
NO20042191A NO332692B1 (no) | 2001-11-21 | 2004-05-26 | Rekombinant adenovirus, sammensetning inneholdende denne, isolert vertscelle samt anvendelse for fremstilling av et medikament |
NO20120337A NO334512B1 (no) | 2001-11-21 | 2012-03-21 | Rekombinant adenovirus, isolert vertscelle inneholdende denne, sammensetning som omfatter det rekombinante adenoviruset og en anvendelse |
Country Status (19)
Country | Link |
---|---|
US (5) | US7247472B2 (no) |
EP (4) | EP3108899A1 (no) |
JP (9) | JP2005511035A (no) |
KR (1) | KR100987360B1 (no) |
CN (1) | CN1578678B (no) |
AU (1) | AU2002365366B2 (no) |
BR (1) | BR0214350A (no) |
CA (3) | CA2466431C (no) |
CO (1) | CO5590973A2 (no) |
HU (3) | HU230364B1 (no) |
IL (5) | IL161584A0 (no) |
MX (2) | MXPA04004876A (no) |
NO (3) | NO332692B1 (no) |
NZ (3) | NZ564586A (no) |
PH (1) | PH12016500338A1 (no) |
PL (1) | PL209133B1 (no) |
SG (2) | SG2013034475A (no) |
WO (1) | WO2003046124A2 (no) |
ZA (1) | ZA200403117B (no) |
Families Citing this family (113)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20040136963A1 (en) * | 2001-06-22 | 2004-07-15 | The Trustees Of The University Of Pennsylvania | Simian adenovirus vectors and methods of use |
EP1409748B1 (en) | 2001-06-22 | 2011-10-26 | The Trustees of The University of Pennsylvania | Recombinant Adenoviruses comprising simian adenovirus proteins and uses thereof. |
MXPA04004876A (es) * | 2001-11-21 | 2004-07-30 | Univ Pennsylvania | Secuencias de acido nucleico y de aminoacido de adenovirus de simio, vectores que contienen los mismos y metodos de uso. |
JP5404990B2 (ja) | 2002-10-23 | 2014-02-05 | グラクソスミスクライン・バイオロジカルス・ソシエテ・アノニム | マラリアに対するワクチン接種の方法 |
US7491508B2 (en) * | 2003-06-20 | 2009-02-17 | The Trustees Of The University Of Pennsylvania | Methods of generating chimeric adenoviruses and uses for such chimeric adenoviruses |
US7291498B2 (en) | 2003-06-20 | 2007-11-06 | The Trustees Of The University Of Pennsylvania | Methods of generating chimeric adenoviruses and uses for such chimeric adenoviruses |
ES2337374T3 (es) * | 2004-01-23 | 2010-04-23 | Istituto Di Richerche Di Biologia Molecolare P. Angeletti S.P.A. | Portadores de vacunas de adenovirus de chimpance. |
US9592284B2 (en) | 2004-04-28 | 2017-03-14 | The Trustees Of The University Of Pennsylvania | Immunization regimen with E4-deleted adenovirus prime and E1-deleted adenovirus boost |
DE602005026269D1 (de) * | 2004-04-28 | 2011-03-24 | Univ Pennsylvania | Sequenzielle freisetzung immunogener moleküle über adenoviren und adeno-assoziierte viren vermittelte abgaben |
US7964196B2 (en) * | 2004-05-25 | 2011-06-21 | Chimeros, Inc. | Self-assembling nanoparticle drug delivery system |
GB0417494D0 (en) | 2004-08-05 | 2004-09-08 | Glaxosmithkline Biolog Sa | Vaccine |
WO2006053871A2 (en) | 2004-11-16 | 2006-05-26 | Crucell Holland B.V. | Multivalent vaccines comprising recombinant viral vectors |
CA2599754C (en) * | 2005-03-08 | 2012-06-19 | Genesense Technologies Inc. | Use of interleukin 17e for the treatment of cancer |
GB0513421D0 (en) | 2005-06-30 | 2005-08-03 | Glaxosmithkline Biolog Sa | Vaccines |
DE602007005366D1 (de) * | 2006-04-07 | 2010-04-29 | Chimeros Inc | Zusammensetzungen und verfahren zur behandlung von b-zellen-malignomen |
CN101437538A (zh) * | 2006-04-28 | 2009-05-20 | 宾夕法尼亚州立大学托管会 | 修饰的腺病毒六邻体蛋白和其应用 |
US20090246220A1 (en) | 2006-08-28 | 2009-10-01 | Ertl Hildegund C J | Constructs for enhancing immune responses |
CN101675068A (zh) | 2007-03-02 | 2010-03-17 | 葛兰素史密丝克莱恩生物有限公司 | 新的方法和组合物 |
US20090226525A1 (en) * | 2007-04-09 | 2009-09-10 | Chimeros Inc. | Self-assembling nanoparticle drug delivery system |
LT2220242T (lt) | 2007-11-28 | 2017-04-10 | The Trustees Of The University Of Pennsylvania | Simian pošeimio b adenovirusai sadv-28,27,-29,-32,-33 ir -35 ir jų panaudojimas |
MX2010005858A (es) | 2007-11-28 | 2010-06-11 | Univ Pennsylvania | Adenovirus e simianos sadv-39, sadv-25.2, sadv-26, sadv-30, sadv-37 y sadv-38. |
US8231880B2 (en) * | 2007-11-28 | 2012-07-31 | The Trustess Of The University Of Pennsylvania | Simian subfamily C adenoviruses SAdV-40, -31, and -34 and uses thereof |
AU2014203073B2 (en) * | 2007-11-28 | 2016-07-07 | The Trustees Of The University Of Pennsylvania | Simian E adenovirus SAdV-30 |
US8470310B2 (en) * | 2008-03-04 | 2013-06-25 | The Trustees Of The University Of Pennsylvania | Simian adenoviruses SAdV-36, -42.1, -42.2, and -44 and uses thereof |
US9217155B2 (en) | 2008-05-28 | 2015-12-22 | University Of Massachusetts | Isolation of novel AAV'S and uses thereof |
JP5809978B2 (ja) * | 2008-10-31 | 2015-11-11 | ザ・トラステイーズ・オブ・ザ・ユニバーシテイ・オブ・ペンシルベニア | サルアデノウイルスSAdV−43、−45、−46、−47、−48、−49および−50ならびにそれらの用途 |
CN102300872A (zh) * | 2009-02-02 | 2011-12-28 | 奥凯罗斯股份公司 | 猿腺病毒核酸和氨基酸序列,包含其的载体及其用途 |
ES2687294T3 (es) * | 2009-02-02 | 2018-10-24 | Glaxosmithkline Biologicals Sa | Secuencias de aminoácidos y de ácidos nucleicos de adenovirus de simio, vectores que las contienen, y sus usos |
WO2010085984A1 (en) * | 2009-02-02 | 2010-08-05 | Okairos Ag | Simian adenovirus nucleic acid- and amino acid-sequences, vectors containing same, and uses thereof |
BRPI1009873A2 (pt) | 2009-03-17 | 2016-03-08 | Glaxosmithkline Biolog Sa | detecção aperfeiçoada de expressão gênica |
WO2010120874A2 (en) | 2009-04-14 | 2010-10-21 | Chimeros, Inc. | Chimeric therapeutics, compositions, and methods for using same |
US8734809B2 (en) | 2009-05-28 | 2014-05-27 | University Of Massachusetts | AAV's and uses thereof |
US8846031B2 (en) | 2009-05-29 | 2014-09-30 | The Trustees Of The University Of Pennsylvania | Simian adenovirus 41 and uses thereof |
WO2011057254A2 (en) * | 2009-11-09 | 2011-05-12 | The United States Of America, As Represented By The Secretary, Department Of Health And Human Services | Simian adenoviral vector-based vaccines |
CA2779632C (en) * | 2009-11-09 | 2019-08-20 | Jason Gall | Simian adenovirus and methods of use |
KR101458367B1 (ko) * | 2010-04-14 | 2014-11-05 | 재단법인 목암생명공학연구소 | 원숭이 아데노바이러스 혈청형 19로부터 분리된 헥손, 그의 초가변 영역 및 그를 이용한 키메라 아데노바이러스 |
US9546369B2 (en) | 2010-04-23 | 2017-01-17 | University Of Massachusetts | Multicistronic expression constructs |
WO2011133890A1 (en) | 2010-04-23 | 2011-10-27 | University Of Massachusetts | Cns targeting aav vectors and methods of use thereof |
CA2833912C (en) | 2010-04-23 | 2021-09-21 | University Of Massachusetts | Aav-based treatment of cholesterol-related disorders |
ES2660615T3 (es) * | 2010-05-14 | 2018-03-23 | Oregon Health & Science University | Vectores de CMVH y CMVRh recombinantes y usos de los mismos |
WO2012021730A2 (en) | 2010-08-11 | 2012-02-16 | Genvec, Inc. | Respiratory syncytial virus (rsv) vaccine |
CN103118702A (zh) | 2010-09-20 | 2013-05-22 | 克鲁塞尔荷兰公司 | 活动性结核病的治疗性接种 |
AU2011332025B2 (en) | 2010-11-23 | 2015-06-25 | The Trustees Of The University Of Pennsylvania | Subfamily E simian adenoviruses A1321, A1325, A1295, A1309 and A1322 and uses thereof |
AU2011350066A1 (en) | 2010-12-27 | 2013-07-11 | Alexion Pharma International Sarl | Compositions comprising natriuretic peptides and methods of use thereof |
WO2012089231A1 (en) * | 2010-12-30 | 2012-07-05 | Okairòs Ag | Paramyxovirus vaccines |
US9267112B2 (en) | 2011-05-10 | 2016-02-23 | The Regents Of The University Of California | Adenovirus isolated from Titi Monkeys |
US10221218B2 (en) | 2011-05-10 | 2019-03-05 | The Regents Of The University Of California | Adenovirus isolated from titi monkeys |
GB201108879D0 (en) * | 2011-05-25 | 2011-07-06 | Isis Innovation | Vector |
TWI623618B (zh) | 2011-07-12 | 2018-05-11 | 傳斯堅公司 | Hbv聚合酶突變體 |
US20140348791A1 (en) * | 2011-09-09 | 2014-11-27 | Beth Israel Deaconess Medical Center, Inc. | Modified adenoviral vectors and methods of treatment using same |
EP2747774A4 (en) | 2011-09-09 | 2015-02-11 | Biomed Realty L P | METHOD AND COMPOSITIONS FOR CONTROLLING VIRUS PROTECTION |
WO2013045668A2 (en) | 2011-09-29 | 2013-04-04 | Transgene Sa | Immunotherapy composition and regimen for treating hepatitis c virus infection |
WO2013045658A1 (en) | 2011-09-29 | 2013-04-04 | Transgene Sa | Immunotherapy composition and regimen for treating hepatitis c virus infection |
CA2850627C (en) * | 2011-10-05 | 2024-05-21 | Genvec, Inc. | Affenadenovirus (gorilla) or adenoviral vectors and methods of use |
WO2013058833A1 (en) | 2011-10-19 | 2013-04-25 | Enobia Canada Limited Partnership | Compositions comprising alkaline phosphatase and/or natriuretic peptide and methods of use thereof |
WO2013063019A1 (en) | 2011-10-28 | 2013-05-02 | The Wistar Institute Of Anatomy And Biology | Methods and compositions for enhancing the therapeutic effect of anti-tumor t cells |
US10238755B2 (en) | 2011-11-30 | 2019-03-26 | The Wistar Institute Of Anatomy And Biology | Methods and compositions for regulation of cell aging, carcinogenesis and reprogramming |
US9217159B2 (en) | 2012-05-18 | 2015-12-22 | The Trustees Of The University Of Pennsylvania | Subfamily E simian adenoviruses A1302, A1320, A1331 and A1337 and uses thereof |
EP2892554B1 (en) | 2012-09-07 | 2020-08-26 | Emory University | Hiv immune stimulating compositions comprising recombinantly expressed pili on bacteria and methods related thereto |
US9683268B2 (en) * | 2012-09-19 | 2017-06-20 | Beth Israel Deaconess | Viruses associated with immunodeficiency and enteropathy and methods using same |
DK2920313T3 (da) * | 2012-11-16 | 2019-09-02 | Beth Israel Deaconess Medical Ct Inc | Rekombinante adenovira og anvendelse deraf |
US9222142B2 (en) * | 2013-01-15 | 2015-12-29 | The Regents Of The University Of California | Adenoviruses and their use |
US9624510B2 (en) | 2013-03-01 | 2017-04-18 | The Wistar Institute | Adenoviral vectors comprising partial deletions of E3 |
CA3115891A1 (en) | 2013-03-14 | 2014-09-25 | Salk Institute For Biological Studies | Oncolytic adenovirus compositions |
US9402888B2 (en) | 2013-03-14 | 2016-08-02 | The Wistar Institute Of Anatomy And Biology | Methods and compositions for treating cancer |
KR102006527B1 (ko) * | 2013-11-01 | 2019-08-02 | 화이자 인코포레이티드 | 전립선-연관 항원의 발현을 위한 벡터 |
WO2015191508A1 (en) | 2014-06-09 | 2015-12-17 | Voyager Therapeutics, Inc. | Chimeric capsids |
PL3200815T3 (pl) | 2014-10-02 | 2021-10-18 | The Wistar Institute Of Anatomy And Biology | Sposoby i kompozycje do leczenia nowotworu |
CA2963293A1 (en) | 2014-10-06 | 2016-04-14 | The Trustees Of The University Of Pennsylvania | Compositions and methods for isolation of circulating tumor cells (ctc) |
US10480011B2 (en) | 2014-10-21 | 2019-11-19 | University Of Massachusetts | Recombinant AAV variants and uses thereof |
CA2966620A1 (en) | 2014-11-05 | 2016-05-12 | Voyager Therapeutics, Inc. | Aadc polynucleotides for the treatment of parkinson's disease |
CN107109407A (zh) | 2014-11-14 | 2017-08-29 | 沃雅戈治疗公司 | 治疗肌萎缩性侧索硬化(als)的组合物和方法 |
IL284949B2 (en) | 2014-11-14 | 2024-06-01 | Voyager Therapeutics Inc | modulatory polynucleotides |
WO2016094783A1 (en) | 2014-12-12 | 2016-06-16 | Voyager Therapeutics, Inc. | Compositions and methods for the production of scaav |
WO2016131945A1 (en) | 2015-02-20 | 2016-08-25 | Transgene Sa | Combination product with autophagy modulator |
JP6983665B2 (ja) * | 2015-06-12 | 2021-12-17 | グラクソスミスクライン バイオロジカルズ ソシエテ アノニム | アデノウイルスポリヌクレオチド及びポリペプチド |
WO2017096162A1 (en) | 2015-12-02 | 2017-06-08 | Voyager Therapeutics, Inc. | Assays for the detection of aav neutralizing antibodies |
US11208468B2 (en) | 2016-02-18 | 2021-12-28 | The Wistar Institute Of Anatomy And Biology | Methods and compositions for treating melanoma |
KR20180112024A (ko) | 2016-02-23 | 2018-10-11 | 더 솔크 인스티튜트 포 바이올로지칼 스터디즈 | 바이러스 동역학에 미치는 영향 최소화를 위한 치료용 아데노바이러스의 외인성 유전자 발현 |
WO2017147265A1 (en) | 2016-02-23 | 2017-08-31 | Salk Institute For Biological Studies | High throughput assay for measuring adenovirus replication kinetics |
US11299751B2 (en) | 2016-04-29 | 2022-04-12 | Voyager Therapeutics, Inc. | Compositions for the treatment of disease |
US11326182B2 (en) | 2016-04-29 | 2022-05-10 | Voyager Therapeutics, Inc. | Compositions for the treatment of disease |
US20190134190A1 (en) | 2016-05-04 | 2019-05-09 | Transgene Sa | Combination therapy with cpg tlr9 ligand |
JP7220080B2 (ja) | 2016-05-18 | 2023-02-09 | ボイジャー セラピューティクス インコーポレイテッド | ハンチントン病治療組成物及び方法 |
AU2017267665C1 (en) | 2016-05-18 | 2023-10-05 | Voyager Therapeutics, Inc. | Modulatory polynucleotides |
SG11201900808SA (en) * | 2016-08-01 | 2019-02-27 | Wistar Inst | Compositions and methods of replication deficient adenoviral vectors for vaccine applications |
JP2019524162A (ja) | 2016-08-18 | 2019-09-05 | ザ リージェンツ オブ ザ ユニバーシティ オブ カリフォルニア | モジュラーAAV送達システムによるCRISPR−Casゲノム編集 |
AU2017321488B2 (en) | 2016-08-30 | 2022-10-20 | The Regents Of The University Of California | Methods for biomedical targeting and delivery and devices and systems for practicing the same |
WO2018069316A2 (en) | 2016-10-10 | 2018-04-19 | Transgene Sa | Immunotherapeutic product and mdsc modulator combination therapy |
WO2018071831A1 (en) | 2016-10-13 | 2018-04-19 | University Of Massachusetts | Aav capsid designs |
WO2018111767A1 (en) | 2016-12-12 | 2018-06-21 | Salk Institute For Biological Studies | Tumor-targeting synthetic adenoviruses and uses thereof |
SG11201909870SA (en) | 2017-05-05 | 2019-11-28 | Voyager Therapeutics Inc | Compositions and methods of treating amyotrophic lateral sclerosis (als) |
AU2018261003A1 (en) | 2017-05-05 | 2019-11-14 | Voyager Therapeutics, Inc. | Compositions and methods of treating Huntington's Disease |
JOP20190269A1 (ar) | 2017-06-15 | 2019-11-20 | Voyager Therapeutics Inc | بولي نوكليوتيدات aadc لعلاج مرض باركنسون |
CN111132626B (zh) | 2017-07-17 | 2024-01-30 | 沃雅戈治疗公司 | 轨迹阵列引导系统 |
WO2019028306A2 (en) | 2017-08-03 | 2019-02-07 | Voyager Therapeutics, Inc. | COMPOSITIONS AND METHODS FOR ADMINISTRATION OF ADENO-ASSOCIATED VIRUSES |
AU2018352236A1 (en) | 2017-10-16 | 2020-04-23 | The Curators Of The University Of Missouri | Treatment of amyotrophic lateral sclerosis (ALS) |
WO2019079242A1 (en) | 2017-10-16 | 2019-04-25 | Voyager Therapeutics, Inc. | TREATMENT OF AMYOTROPHIC LATERAL SCLEROSIS (ALS) |
US11773142B2 (en) | 2017-12-11 | 2023-10-03 | Beth Israel Deaconess Medical Center, Inc. | Recombinant adenoviruses and uses thereof |
EP3807404A1 (en) | 2018-06-13 | 2021-04-21 | Voyager Therapeutics, Inc. | Engineered 5' untranslated regions (5' utr) for aav production |
WO2020016655A2 (en) | 2018-07-17 | 2020-01-23 | Helixmith Co., Ltd. | Treatment of neuropathy with dna constructs expressing igf-1 isoforms |
AU2019310459A1 (en) | 2018-07-24 | 2021-02-18 | Voyager Therapeutics, Inc. | Systems and methods for producing gene therapy formulations |
TW202035689A (zh) | 2018-10-04 | 2020-10-01 | 美商航海家醫療公司 | 測量病毒載體粒子的效價及強度之方法 |
EP3861107A1 (en) | 2018-10-05 | 2021-08-11 | Voyager Therapeutics, Inc. | Engineered nucleic acid constructs encoding aav production proteins |
EP3867389A1 (en) | 2018-10-15 | 2021-08-25 | Voyager Therapeutics, Inc. | Expression vectors for large-scale production of raav in the baculovirus/sf9 system |
CN113924115A (zh) | 2019-01-31 | 2022-01-11 | 俄勒冈健康与科学大学 | 用于aav衣壳的使用转录依赖性定向进化的方法 |
US20220211835A1 (en) * | 2019-04-17 | 2022-07-07 | The Wistar Institute | Replication Deficient Adenoviral Vectors for HIV Vaccine Applications |
EP4178605A1 (en) | 2020-07-13 | 2023-05-17 | Transgene | Treatment of immune depression |
US20240091380A1 (en) | 2021-02-01 | 2024-03-21 | Regenxbio Inc. | Gene therapy for neuronal ceroid lipofuscinoses |
CN112831524B (zh) * | 2021-02-20 | 2023-06-13 | 苏州相奕生物技术有限公司 | 人工改造的重组腺病毒载体、由其包装的病毒及其应用 |
AU2022250521A1 (en) | 2021-03-29 | 2023-09-28 | Genematrix Inc. | Recombinant chimeric adenoviral vector substituted by knob gene of chimpanzee adenovirus serotype 6, and application thereof |
WO2022218997A1 (en) | 2021-04-12 | 2022-10-20 | Centre National De La Recherche Scientifique (Cnrs) | Novel universal vaccine presenting system |
WO2023213764A1 (en) | 2022-05-02 | 2023-11-09 | Transgene | Fusion polypeptide comprising an anti-pd-l1 sdab and a member of the tnfsf |
Family Cites Families (40)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
GB8607679D0 (en) | 1986-03-27 | 1986-04-30 | Winter G P | Recombinant dna product |
IL162181A (en) | 1988-12-28 | 2006-04-10 | Pdl Biopharma Inc | A method of producing humanized immunoglubulin, and polynucleotides encoding the same |
US5240846A (en) | 1989-08-22 | 1993-08-31 | The Regents Of The University Of Michigan | Gene therapy vector for cystic fibrosis |
US6174666B1 (en) | 1992-03-27 | 2001-01-16 | The United States Of America As Represented By The Department Of Health And Human Services | Method of eliminating inhibitory/instability regions from mRNA |
EP0804076A4 (en) | 1994-10-19 | 1998-10-21 | Genetic Therapy Inc | GENTHERAPY THROUGH SIMULTANEOUS AND REPEATED ADMINISTRATION OF ADENOVIRUS AND IMMUNE SUPPRESSIVES |
DE69534166T2 (de) * | 1994-10-28 | 2006-03-09 | Trustees Of The University Of Pennsylvania | Rekombinanter adenovirus und methoden zu dessen verwendung |
US5856152A (en) | 1994-10-28 | 1999-01-05 | The Trustees Of The University Of Pennsylvania | Hybrid adenovirus-AAV vector and methods of use therefor |
US5770442A (en) * | 1995-02-21 | 1998-06-23 | Cornell Research Foundation, Inc. | Chimeric adenoviral fiber protein and methods of using same |
US6127525A (en) * | 1995-02-21 | 2000-10-03 | Cornell Research Foundation, Inc. | Chimeric adenoviral coat protein and methods of using same |
US5698202A (en) * | 1995-06-05 | 1997-12-16 | The Wistar Institute Of Anatomy & Biology | Replication-defective adenovirus human type 5 recombinant as a rabies vaccine carrier |
AU6261696A (en) * | 1995-06-05 | 1996-12-24 | Trustees Of The University Of Pennsylvania, The | A replication-defective adenovirus human type 5 recombinant as a vaccine carrier |
WO1998010088A1 (en) | 1996-09-06 | 1998-03-12 | Trustees Of The University Of Pennsylvania | An inducible method for production of recombinant adeno-associated viruses utilizing t7 polymerase |
AU4255397A (en) * | 1996-09-06 | 1998-03-26 | Trustees Of The University Of Pennsylvania, The | Chimpanzee adenovirus vectors |
US5922315A (en) * | 1997-01-24 | 1999-07-13 | Genetic Therapy, Inc. | Adenoviruses having altered hexon proteins |
US5891994A (en) | 1997-07-11 | 1999-04-06 | Thymon L.L.C. | Methods and compositions for impairing multiplication of HIV-1 |
AU9319198A (en) | 1997-09-19 | 1999-04-05 | Trustees Of The University Of Pennsylvania, The | Methods and vector constructs useful for production of recombinant aav |
WO1999015685A1 (en) | 1997-09-19 | 1999-04-01 | The Trustees Of The University Of Pennsylvania | Methods and cell line useful for production of recombinant adeno-associated viruses |
GB9720585D0 (en) | 1997-09-26 | 1997-11-26 | Smithkline Beecham Biolog | Vaccine |
JP2002506652A (ja) | 1998-03-20 | 2002-03-05 | トラステイーズ・オブ・ザ・ユニバーシテイ・オブ・ペンシルベニア | 組換えアデノ随伴ウイルスのヘルパーを含まない製造のための組成物及び方法 |
US20030017138A1 (en) * | 1998-07-08 | 2003-01-23 | Menzo Havenga | Chimeric adenoviruses |
AU5677399A (en) * | 1998-08-20 | 2000-03-14 | Wistar Institute Of Anatomy And Biology, The | Methods of augmenting mucosal immunity through systemic priming and mucosal boosting |
US6258595B1 (en) | 1999-03-18 | 2001-07-10 | The Trustees Of The University Of Pennsylvania | Compositions and methods for helper-free production of recombinant adeno-associated viruses |
WO2001002607A1 (en) | 1999-07-06 | 2001-01-11 | Merck & Co., Inc. | Adenovirus carrying gag gene hiv vaccine |
PL211762B1 (pl) | 2000-01-31 | 2012-06-29 | Smithkline Beecham Biolog | Zastosowanie białka fuzyjnego zawierającego białka HIV Tat i HIV Nef lub polinukleotyd kodujący takie białko, białka lub polinukleotydu HIV gp120, białka lub polinukleotydu SIV Nef, adiuwanta indukującego TH1 zawierającego monofosforylolipid A lub jego pochodną i adiuwanta saponinowego oraz kompozycja szczepionki |
US6740525B2 (en) | 2000-02-09 | 2004-05-25 | Genvec, Inc. | Adenoviral capsid containing chimeric protein IX |
US20040136963A1 (en) | 2001-06-22 | 2004-07-15 | The Trustees Of The University Of Pennsylvania | Simian adenovirus vectors and methods of use |
EP1409748B1 (en) * | 2001-06-22 | 2011-10-26 | The Trustees of The University of Pennsylvania | Recombinant Adenoviruses comprising simian adenovirus proteins and uses thereof. |
US20040241181A1 (en) * | 2001-06-22 | 2004-12-02 | Ertl Hildeghund C. J. | Methods of inducing a cytotoxic immune response and recormbinant simian adenovirus compositions useful therein |
MXPA04004876A (es) | 2001-11-21 | 2004-07-30 | Univ Pennsylvania | Secuencias de acido nucleico y de aminoacido de adenovirus de simio, vectores que contienen los mismos y metodos de uso. |
JP5404990B2 (ja) | 2002-10-23 | 2014-02-05 | グラクソスミスクライン・バイオロジカルス・ソシエテ・アノニム | マラリアに対するワクチン接種の方法 |
US7291498B2 (en) | 2003-06-20 | 2007-11-06 | The Trustees Of The University Of Pennsylvania | Methods of generating chimeric adenoviruses and uses for such chimeric adenoviruses |
US7491508B2 (en) | 2003-06-20 | 2009-02-17 | The Trustees Of The University Of Pennsylvania | Methods of generating chimeric adenoviruses and uses for such chimeric adenoviruses |
DE602005026269D1 (de) * | 2004-04-28 | 2011-03-24 | Univ Pennsylvania | Sequenzielle freisetzung immunogener moleküle über adenoviren und adeno-assoziierte viren vermittelte abgaben |
US9592284B2 (en) * | 2004-04-28 | 2017-03-14 | The Trustees Of The University Of Pennsylvania | Immunization regimen with E4-deleted adenovirus prime and E1-deleted adenovirus boost |
JP5690038B2 (ja) * | 2004-04-28 | 2015-03-25 | ザ・トラステイーズ・オブ・ザ・ユニバーシテイ・オブ・ペンシルベニア | 多価ウイルスベクターおよびその作製のためのシステム |
EP1880012B1 (en) | 2005-05-12 | 2015-07-08 | Glaxo Group Limited | Vaccine composition |
CN101437538A (zh) * | 2006-04-28 | 2009-05-20 | 宾夕法尼亚州立大学托管会 | 修饰的腺病毒六邻体蛋白和其应用 |
EP2040742B1 (en) | 2006-07-18 | 2014-10-29 | GlaxoSmithKline Biologicals S.A. | Vaccines for malaria |
CN101675068A (zh) | 2007-03-02 | 2010-03-17 | 葛兰素史密丝克莱恩生物有限公司 | 新的方法和组合物 |
US9603035B2 (en) | 2011-10-13 | 2017-03-21 | Telefonaktiebolaget L M Ericsson | Method and node related to channel estimation |
-
2002
- 2002-11-20 MX MXPA04004876A patent/MXPA04004876A/es active IP Right Grant
- 2002-11-20 HU HU0500987A patent/HU230364B1/hu not_active IP Right Cessation
- 2002-11-20 EP EP16181533.7A patent/EP3108899A1/en not_active Withdrawn
- 2002-11-20 PL PL373602A patent/PL209133B1/pl unknown
- 2002-11-20 EP EP10178464.3A patent/EP2301582B1/en not_active Expired - Lifetime
- 2002-11-20 NZ NZ564586A patent/NZ564586A/en not_active IP Right Cessation
- 2002-11-20 CN CN02823023XA patent/CN1578678B/zh not_active Expired - Fee Related
- 2002-11-20 EP EP02803963.4A patent/EP1453543B1/en not_active Expired - Lifetime
- 2002-11-20 HU HU1400619A patent/HU230488B1/hu not_active IP Right Cessation
- 2002-11-20 KR KR1020047007792A patent/KR100987360B1/ko not_active IP Right Cessation
- 2002-11-20 IL IL16158402A patent/IL161584A0/xx unknown
- 2002-11-20 SG SG2013034475A patent/SG2013034475A/en unknown
- 2002-11-20 CA CA2466431A patent/CA2466431C/en not_active Expired - Fee Related
- 2002-11-20 NZ NZ532383A patent/NZ532383A/en not_active IP Right Cessation
- 2002-11-20 BR BR0214350-0A patent/BR0214350A/pt not_active Application Discontinuation
- 2002-11-20 HU HU1300578A patent/HU230365B1/hu not_active IP Right Cessation
- 2002-11-20 AU AU2002365366A patent/AU2002365366B2/en not_active Ceased
- 2002-11-20 CA CA2990322A patent/CA2990322A1/en not_active Abandoned
- 2002-11-20 MX MX2012000696A patent/MX351516B/es unknown
- 2002-11-20 SG SG200605559-4A patent/SG165153A1/en unknown
- 2002-11-20 EP EP20100178483 patent/EP2286841A1/en not_active Withdrawn
- 2002-11-20 WO PCT/US2002/033645 patent/WO2003046124A2/en active Application Filing
- 2002-11-20 JP JP2003547559A patent/JP2005511035A/ja not_active Withdrawn
- 2002-11-20 US US10/494,364 patent/US7247472B2/en not_active Expired - Fee Related
- 2002-11-20 NZ NZ550416A patent/NZ550416A/en not_active IP Right Cessation
- 2002-11-20 CA CA2852277A patent/CA2852277C/en not_active Expired - Fee Related
-
2004
- 2004-04-22 IL IL161584A patent/IL161584A/en active IP Right Grant
- 2004-04-23 ZA ZA2004/03117A patent/ZA200403117B/en unknown
- 2004-05-26 NO NO20042191A patent/NO332692B1/no not_active IP Right Cessation
- 2004-06-17 CO CO04056841A patent/CO5590973A2/es active IP Right Grant
-
2007
- 2007-06-19 US US11/820,439 patent/US8105574B2/en not_active Expired - Fee Related
-
2009
- 2009-01-29 JP JP2009018231A patent/JP5715749B2/ja not_active Expired - Fee Related
-
2010
- 2010-10-04 JP JP2010225018A patent/JP2011055835A/ja active Pending
-
2011
- 2011-12-27 US US13/337,608 patent/US8603459B2/en not_active Expired - Lifetime
-
2012
- 2012-03-21 NO NO20120337A patent/NO334512B1/no not_active IP Right Cessation
- 2012-11-29 IL IL223344A patent/IL223344A/en active IP Right Grant
-
2013
- 2013-04-30 NO NO20130590A patent/NO335438B1/no not_active IP Right Cessation
- 2013-08-09 JP JP2013167091A patent/JP2013252144A/ja active Pending
- 2013-11-07 US US14/073,979 patent/US9133483B2/en not_active Expired - Fee Related
-
2014
- 2014-01-02 IL IL230292A patent/IL230292A/en active IP Right Grant
- 2014-03-13 IL IL231502A patent/IL231502A/en not_active IP Right Cessation
- 2014-09-10 JP JP2014184006A patent/JP2015057052A/ja not_active Ceased
- 2014-09-10 JP JP2014184003A patent/JP2015057051A/ja not_active Withdrawn
-
2015
- 2015-08-12 US US14/824,336 patent/US20170119873A9/en not_active Abandoned
-
2016
- 2016-02-19 PH PH12016500338A patent/PH12016500338A1/en unknown
- 2016-11-11 JP JP2016220444A patent/JP2017035111A/ja active Pending
- 2016-11-11 JP JP2016220443A patent/JP2017070292A/ja active Pending
- 2016-11-11 JP JP2016220440A patent/JP2017035110A/ja active Pending
Also Published As
Similar Documents
Publication | Publication Date | Title |
---|---|---|
KR100987360B1 (ko) | 원숭이 아데노바이러스 핵산과 아미노산 서열, 이를함유하는 벡터 및 사용방법 | |
US20040136963A1 (en) | Simian adenovirus vectors and methods of use | |
CN1833027B (zh) | 产生嵌合腺病毒的方法及这种嵌合腺病毒的用途 | |
AU2011332025B2 (en) | Subfamily E simian adenoviruses A1321, A1325, A1295, A1309 and A1322 and uses thereof | |
ES2627288T3 (es) | Portadores de vacuna de adenovirus de chimpancé | |
AU2008331906B2 (en) | Simian E adenovirus SAdV-39 | |
KR102535670B1 (ko) | 아데노바이러스 폴리뉴클레오티드 및 폴리펩티드 | |
RU2762854C2 (ru) | Последовательности нуклеиновых кислот и аминокислотные последовательности аденовирусов человекообразных обезьян, исключая человека, содержащие их векторы, и их применения | |
CA2528511A1 (en) | Methods of generating chimeric adenoviruses and uses for such chimeric adenoviruses | |
JP2015083007A (ja) | サルサブファミリーCアデノウイルスSAdV−40、−31および−34ならびにそれらの用途 | |
CN113897388A (zh) | 一种新型黑猩猩腺病毒载体及其构建方法和应用 | |
NL2023464B1 (en) | Oncolytic Non-human adenoviruses and uses thereof | |
CN114269363A (zh) | 用于hiv疫苗应用的复制缺陷型腺病毒载体 | |
AU2007201747A1 (en) | Simian adenovirus nucleic acid and amino acid sequences, vectors containing same, and methods of use |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
MM1K | Lapsed by not paying the annual fees |