TWI858159B - Human papillomavirus polyvalent immunogenic composition - Google Patents
Human papillomavirus polyvalent immunogenic composition Download PDFInfo
- Publication number
- TWI858159B TWI858159B TW109135395A TW109135395A TWI858159B TW I858159 B TWI858159 B TW I858159B TW 109135395 A TW109135395 A TW 109135395A TW 109135395 A TW109135395 A TW 109135395A TW I858159 B TWI858159 B TW I858159B
- Authority
- TW
- Taiwan
- Prior art keywords
- hpv
- seq
- type
- cdata
- protein
- Prior art date
Links
- 239000000203 mixture Substances 0.000 title claims abstract description 27
- 230000002163 immunogen Effects 0.000 title claims abstract description 18
- 241000701806 Human papillomavirus Species 0.000 title description 481
- 108090000623 proteins and genes Proteins 0.000 claims abstract description 554
- 102000004169 proteins and genes Human genes 0.000 claims abstract description 387
- 239000002245 particle Substances 0.000 claims abstract description 154
- 241001631646 Papillomaviridae Species 0.000 claims abstract description 75
- 230000005847 immunogenicity Effects 0.000 claims abstract description 48
- 210000004900 c-terminal fragment Anatomy 0.000 claims abstract description 36
- 210000004898 n-terminal fragment Anatomy 0.000 claims abstract description 35
- 229960005486 vaccine Drugs 0.000 claims abstract description 32
- 108091033319 polynucleotide Proteins 0.000 claims abstract description 22
- 102000040430 polynucleotide Human genes 0.000 claims abstract description 22
- 239000002157 polynucleotide Substances 0.000 claims abstract description 22
- 210000004027 cell Anatomy 0.000 claims description 78
- 239000013598 vector Substances 0.000 claims description 47
- 241000700605 Viruses Species 0.000 claims description 46
- 208000015181 infectious disease Diseases 0.000 claims description 18
- 239000002671 adjuvant Substances 0.000 claims description 15
- 238000002360 preparation method Methods 0.000 claims description 15
- 244000052616 bacterial pathogen Species 0.000 claims description 14
- 241000238631 Hexapoda Species 0.000 claims description 13
- 201000010099 disease Diseases 0.000 claims description 13
- 208000037265 diseases, disorders, signs and symptoms Diseases 0.000 claims description 13
- ILRRQNADMUWWFW-UHFFFAOYSA-K aluminium phosphate Chemical group O1[Al]2OP1(=O)O2 ILRRQNADMUWWFW-UHFFFAOYSA-K 0.000 claims description 4
- 239000003814 drug Substances 0.000 claims description 3
- 230000001717 pathogenic effect Effects 0.000 claims description 2
- 229940079593 drug Drugs 0.000 claims 1
- 230000014509 gene expression Effects 0.000 abstract description 78
- 238000011031 large-scale manufacturing process Methods 0.000 abstract description 22
- 101001065501 Escherichia phage MS2 Lysis protein Proteins 0.000 abstract description 7
- 235000018102 proteins Nutrition 0.000 description 355
- 150000001413 amino acids Chemical class 0.000 description 143
- 235000001014 amino acid Nutrition 0.000 description 120
- 229940024606 amino acid Drugs 0.000 description 119
- 108020004414 DNA Proteins 0.000 description 113
- 239000012634 fragment Substances 0.000 description 112
- 125000003275 alpha amino acid group Chemical group 0.000 description 107
- 238000002474 experimental method Methods 0.000 description 79
- 210000002966 serum Anatomy 0.000 description 54
- 239000013592 cell lysate Substances 0.000 description 53
- 238000006386 neutralization reaction Methods 0.000 description 45
- 238000012360 testing method Methods 0.000 description 45
- 241001465754 Metazoa Species 0.000 description 39
- 108010050848 glycylleucine Proteins 0.000 description 39
- 241000282414 Homo sapiens Species 0.000 description 38
- 239000003550 marker Substances 0.000 description 37
- 241001112090 Pseudovirus Species 0.000 description 35
- 239000002773 nucleotide Substances 0.000 description 35
- 125000003729 nucleotide group Chemical group 0.000 description 35
- 108010077245 asparaginyl-proline Proteins 0.000 description 30
- 230000003472 neutralizing effect Effects 0.000 description 28
- 241000699666 Mus <mouse, genus> Species 0.000 description 27
- 108010034529 leucyl-lysine Proteins 0.000 description 27
- 238000000034 method Methods 0.000 description 27
- 108010065920 Insulin Lispro Proteins 0.000 description 26
- 239000006228 supernatant Substances 0.000 description 25
- 210000004899 c-terminal region Anatomy 0.000 description 24
- 238000004519 manufacturing process Methods 0.000 description 24
- 239000013612 plasmid Substances 0.000 description 24
- FFZJHQODAYHGPO-KZVJFYERSA-N Ala-Pro-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](C)N FFZJHQODAYHGPO-KZVJFYERSA-N 0.000 description 23
- 238000010276 construction Methods 0.000 description 23
- 108700042769 prolyl-leucyl-glycine Proteins 0.000 description 23
- HHQCBFGKQDMWSP-GUBZILKMSA-N Gln-Leu-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCC(=O)N)N HHQCBFGKQDMWSP-GUBZILKMSA-N 0.000 description 22
- 108010073472 leucyl-prolyl-proline Proteins 0.000 description 22
- 108010017391 lysylvaline Proteins 0.000 description 22
- 108010031719 prolyl-serine Proteins 0.000 description 22
- 108010090894 prolylleucine Proteins 0.000 description 22
- SHAUZYVSXAMYAZ-JYJNAYRXSA-N Gln-Leu-Phe Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CCC(=O)N)N SHAUZYVSXAMYAZ-JYJNAYRXSA-N 0.000 description 21
- 108010020688 glycylhistidine Proteins 0.000 description 21
- CXRCVCURMBFFOL-FXQIFTODSA-N Ala-Ala-Pro Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@H]1C(O)=O CXRCVCURMBFFOL-FXQIFTODSA-N 0.000 description 20
- DPURXCQCHSQPAN-AVGNSLFASA-N Leu-Pro-Pro Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 DPURXCQCHSQPAN-AVGNSLFASA-N 0.000 description 20
- KDXKERNSBIXSRK-UHFFFAOYSA-N Lysine Natural products NCCCCC(N)C(O)=O KDXKERNSBIXSRK-UHFFFAOYSA-N 0.000 description 20
- 238000001514 detection method Methods 0.000 description 20
- 208000009608 Papillomavirus Infections Diseases 0.000 description 19
- OVLIFGQSBSNGHY-KKHAAJSZSA-N Val-Asp-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](C(C)C)N)O OVLIFGQSBSNGHY-KKHAAJSZSA-N 0.000 description 19
- FXGIMYRVJJEIIM-UWVGGRQHSA-N Pro-Leu-Gly Chemical compound OC(=O)CNC(=O)[C@H](CC(C)C)NC(=O)[C@@H]1CCCN1 FXGIMYRVJJEIIM-UWVGGRQHSA-N 0.000 description 18
- 108010040443 aspartyl-aspartic acid Proteins 0.000 description 18
- 108010068265 aspartyltyrosine Proteins 0.000 description 18
- 238000005119 centrifugation Methods 0.000 description 18
- 108010054813 diprotin B Proteins 0.000 description 18
- 108010008237 glutamyl-valyl-glycine Proteins 0.000 description 18
- 108010054155 lysyllysine Proteins 0.000 description 18
- 230000030648 nucleus localization Effects 0.000 description 18
- UXJCMQFPDWCHKX-DCAQKATOSA-N Arg-Arg-Glu Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(O)=O)C(O)=O UXJCMQFPDWCHKX-DCAQKATOSA-N 0.000 description 17
- WXJKFRMKJORORD-DCAQKATOSA-N Lys-Arg-Ala Chemical compound NC(=N)NCCC[C@@H](C(=O)N[C@@H](C)C(O)=O)NC(=O)[C@@H](N)CCCCN WXJKFRMKJORORD-DCAQKATOSA-N 0.000 description 17
- 108010027338 isoleucylcysteine Proteins 0.000 description 17
- YFNOUBWUIIJQHF-LPEHRKFASA-N Pro-Asp-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC(=O)O)C(=O)N2CCC[C@@H]2C(=O)O YFNOUBWUIIJQHF-LPEHRKFASA-N 0.000 description 16
- KQNDIKOYWZTZIX-FXQIFTODSA-N Ser-Ser-Arg Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCNC(N)=N KQNDIKOYWZTZIX-FXQIFTODSA-N 0.000 description 16
- 238000011156 evaluation Methods 0.000 description 16
- 108010030617 leucyl-phenylalanyl-valine Proteins 0.000 description 16
- 239000006166 lysate Substances 0.000 description 16
- 238000004806 packaging method and process Methods 0.000 description 16
- PABFFPWEJMEVEC-JGVFFNPUSA-N Gly-Gln-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)N)NC(=O)CN)C(=O)O PABFFPWEJMEVEC-JGVFFNPUSA-N 0.000 description 15
- 108010062796 arginyllysine Proteins 0.000 description 15
- 108010060035 arginylproline Proteins 0.000 description 15
- 230000000877 morphologic effect Effects 0.000 description 15
- 108010061238 threonyl-glycine Proteins 0.000 description 15
- VHAQSYHSDKERBS-XPUUQOCRSA-N Ala-Val-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O VHAQSYHSDKERBS-XPUUQOCRSA-N 0.000 description 14
- WXVGISRWSYGEDK-KKUMJFAQSA-N Asn-Lys-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC(=O)N)N WXVGISRWSYGEDK-KKUMJFAQSA-N 0.000 description 14
- CURLTUGMZLYLDI-UHFFFAOYSA-N Carbon dioxide Chemical compound O=C=O CURLTUGMZLYLDI-UHFFFAOYSA-N 0.000 description 14
- YFGONBOFGGWKKY-VHSXEESVSA-N Gly-His-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CN=CN2)NC(=O)CN)C(=O)O YFGONBOFGGWKKY-VHSXEESVSA-N 0.000 description 14
- NXRNRBOKDBIVKQ-CXTHYWKRSA-N Ile-Tyr-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O)N NXRNRBOKDBIVKQ-CXTHYWKRSA-N 0.000 description 14
- 241000880493 Leptailurus serval Species 0.000 description 14
- NTXYXFDMIHXTHE-WDSOQIARSA-N Leu-Val-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC(C)C)C(O)=O)=CNC2=C1 NTXYXFDMIHXTHE-WDSOQIARSA-N 0.000 description 14
- LNMKRJJLEFASGA-BZSNNMDCSA-N Lys-Phe-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O LNMKRJJLEFASGA-BZSNNMDCSA-N 0.000 description 14
- NYTDJEZBAAFLLG-IHRRRGAJSA-N Lys-Val-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCCN)C(O)=O NYTDJEZBAAFLLG-IHRRRGAJSA-N 0.000 description 14
- IKXQOBUBZSOWDY-AVGNSLFASA-N Lys-Val-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H](CCCCN)N IKXQOBUBZSOWDY-AVGNSLFASA-N 0.000 description 14
- 108010002311 N-glycylglutamic acid Proteins 0.000 description 14
- BPGDJSUFQKWUBK-KJEVXHAQSA-N Thr-Val-Tyr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 BPGDJSUFQKWUBK-KJEVXHAQSA-N 0.000 description 14
- VNGKMNPAENRGDC-JYJNAYRXSA-N Val-Phe-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C(C)C)CC1=CC=CC=C1 VNGKMNPAENRGDC-JYJNAYRXSA-N 0.000 description 14
- 108010047495 alanylglycine Proteins 0.000 description 14
- 238000001493 electron microscopy Methods 0.000 description 14
- 108020001507 fusion proteins Proteins 0.000 description 14
- 102000037865 fusion proteins Human genes 0.000 description 14
- 108010053037 kyotorphin Proteins 0.000 description 14
- 108010038320 lysylphenylalanine Proteins 0.000 description 14
- WSXTWLJHTLRFLW-SRVKXCTJSA-N Lys-Ala-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCCN)C(O)=O WSXTWLJHTLRFLW-SRVKXCTJSA-N 0.000 description 13
- RMODQFBNDDENCP-IHRRRGAJSA-N Pro-Lys-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O RMODQFBNDDENCP-IHRRRGAJSA-N 0.000 description 13
- CEXFELBFVHLYDZ-XGEHTFHBSA-N Thr-Arg-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O CEXFELBFVHLYDZ-XGEHTFHBSA-N 0.000 description 13
- 239000002609 medium Substances 0.000 description 13
- LXAARTARZJJCMB-CIQUZCHMSA-N Ala-Ile-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LXAARTARZJJCMB-CIQUZCHMSA-N 0.000 description 12
- KWKQGHSSNHPGOW-BQBZGAKWSA-N Arg-Ala-Gly Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(=O)NCC(O)=O KWKQGHSSNHPGOW-BQBZGAKWSA-N 0.000 description 12
- XLILXFRAKOYEJX-GUBZILKMSA-N Asp-Leu-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O XLILXFRAKOYEJX-GUBZILKMSA-N 0.000 description 12
- DCRODRAURLJOFY-XPUUQOCRSA-N His-Ala-Gly Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(=O)NCC(O)=O DCRODRAURLJOFY-XPUUQOCRSA-N 0.000 description 12
- SJNZALDHDUYDBU-IHRRRGAJSA-N Lys-Arg-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCCN)C(O)=O SJNZALDHDUYDBU-IHRRRGAJSA-N 0.000 description 12
- LUAJJLPHUXPQLH-KKUMJFAQSA-N Lys-Phe-Ser Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCCCN)N LUAJJLPHUXPQLH-KKUMJFAQSA-N 0.000 description 12
- YBAFDPFAUTYYRW-UHFFFAOYSA-N N-L-alpha-glutamyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CCC(O)=O YBAFDPFAUTYYRW-UHFFFAOYSA-N 0.000 description 12
- BQVUABVGYYSDCJ-UHFFFAOYSA-N Nalpha-L-Leucyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)CC(C)C)C(O)=O)=CNC2=C1 BQVUABVGYYSDCJ-UHFFFAOYSA-N 0.000 description 12
- BWTKUQPNOMMKMA-FIRPJDEBSA-N Phe-Ile-Phe Chemical compound C([C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 BWTKUQPNOMMKMA-FIRPJDEBSA-N 0.000 description 12
- FIODMZKLZFLYQP-GUBZILKMSA-N Pro-Val-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O FIODMZKLZFLYQP-GUBZILKMSA-N 0.000 description 12
- SMIDBHKWSYUBRZ-ACZMJKKPSA-N Ser-Glu-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O SMIDBHKWSYUBRZ-ACZMJKKPSA-N 0.000 description 12
- HHJFMHQYEAAOBM-ZLUOBGJFSA-N Ser-Ser-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O HHJFMHQYEAAOBM-ZLUOBGJFSA-N 0.000 description 12
- RXUOAOOZIWABBW-XGEHTFHBSA-N Ser-Thr-Arg Chemical compound OC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N RXUOAOOZIWABBW-XGEHTFHBSA-N 0.000 description 12
- WFUAUEQXPVNAEF-ZJDVBMNYSA-N Thr-Arg-Thr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H]([C@@H](C)O)C(O)=O)CCCN=C(N)N WFUAUEQXPVNAEF-ZJDVBMNYSA-N 0.000 description 12
- RVMNUBQWPVOUKH-HEIBUPTGSA-N Thr-Ser-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O RVMNUBQWPVOUKH-HEIBUPTGSA-N 0.000 description 12
- VIWQOOBRKCGSDK-RYQLBKOJSA-N Trp-Arg-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)N)C(=O)O VIWQOOBRKCGSDK-RYQLBKOJSA-N 0.000 description 12
- 108010093581 aspartyl-proline Proteins 0.000 description 12
- JYPCXBJRLBHWME-UHFFFAOYSA-N glycyl-L-prolyl-L-arginine Natural products NCC(=O)N1CCCC1C(=O)NC(CCCN=C(N)N)C(O)=O JYPCXBJRLBHWME-UHFFFAOYSA-N 0.000 description 12
- 108010025801 glycyl-prolyl-arginine Proteins 0.000 description 12
- 108010057821 leucylproline Proteins 0.000 description 12
- 108010064235 lysylglycine Proteins 0.000 description 12
- 108010077112 prolyl-proline Proteins 0.000 description 12
- 239000000523 sample Substances 0.000 description 12
- 238000004627 transmission electron microscopy Methods 0.000 description 12
- NPAVRDPEFVKELR-DCAQKATOSA-N Arg-Lys-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O NPAVRDPEFVKELR-DCAQKATOSA-N 0.000 description 11
- WOZDCBHUGJVJPL-AVGNSLFASA-N Arg-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N WOZDCBHUGJVJPL-AVGNSLFASA-N 0.000 description 11
- GIQCDTKOIPUDSG-GARJFASQSA-N Asn-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)[C@H](CC(=O)N)N)C(=O)O GIQCDTKOIPUDSG-GARJFASQSA-N 0.000 description 11
- DJCAHYVLMSRBFR-QXEWZRGKSA-N Asp-Met-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CCSC)NC(=O)[C@@H](N)CC(O)=O DJCAHYVLMSRBFR-QXEWZRGKSA-N 0.000 description 11
- JJQGZGOEDSSHTE-FOHZUACHSA-N Asp-Thr-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O JJQGZGOEDSSHTE-FOHZUACHSA-N 0.000 description 11
- PUUYVMYCMIWHFE-BQBZGAKWSA-N Gly-Ala-Arg Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N PUUYVMYCMIWHFE-BQBZGAKWSA-N 0.000 description 11
- 241000341655 Human papillomavirus type 16 Species 0.000 description 11
- LOLUPZNNADDTAA-AVGNSLFASA-N Leu-Gln-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O LOLUPZNNADDTAA-AVGNSLFASA-N 0.000 description 11
- 241000699670 Mus sp. Species 0.000 description 11
- MWQXFDIQXIXPMS-UNQGMJICSA-N Phe-Val-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CC1=CC=CC=C1)N)O MWQXFDIQXIXPMS-UNQGMJICSA-N 0.000 description 11
- QSKCKTUQPICLSO-AVGNSLFASA-N Pro-Arg-Lys Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCCN)C(=O)O QSKCKTUQPICLSO-AVGNSLFASA-N 0.000 description 11
- ULWBBFKQBDNGOY-RWMBFGLXSA-N Pro-Lys-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CCCCN)C(=O)N2CCC[C@@H]2C(=O)O ULWBBFKQBDNGOY-RWMBFGLXSA-N 0.000 description 11
- QFBNNYNWKYKVJO-DCAQKATOSA-N Ser-Arg-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CO)CCCN=C(N)N QFBNNYNWKYKVJO-DCAQKATOSA-N 0.000 description 11
- NMZXJDSKEGFDLJ-DCAQKATOSA-N Ser-Pro-Lys Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CO)N)C(=O)N[C@@H](CCCCN)C(=O)O NMZXJDSKEGFDLJ-DCAQKATOSA-N 0.000 description 11
- ZSDXEKUKQAKZFE-XAVMHZPKSA-N Ser-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N)O ZSDXEKUKQAKZFE-XAVMHZPKSA-N 0.000 description 11
- URPSJRMWHQTARR-MBLNEYKQSA-N Thr-Ile-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O URPSJRMWHQTARR-MBLNEYKQSA-N 0.000 description 11
- XJPXTYLVMUZGNW-IHRRRGAJSA-N Tyr-Pro-Asp Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O XJPXTYLVMUZGNW-IHRRRGAJSA-N 0.000 description 11
- BXJQKVDPRMLGKN-PMVMPFDFSA-N Tyr-Trp-Leu Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)N[C@@H](CC(C)C)C(O)=O)C1=CC=C(O)C=C1 BXJQKVDPRMLGKN-PMVMPFDFSA-N 0.000 description 11
- 108010044940 alanylglutamine Proteins 0.000 description 11
- 108010060199 cysteinylproline Proteins 0.000 description 11
- 108010063718 gamma-glutamylaspartic acid Proteins 0.000 description 11
- INOIAEUXVVNJKA-XGEHTFHBSA-N Arg-Thr-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O INOIAEUXVVNJKA-XGEHTFHBSA-N 0.000 description 10
- KVPHTGVUMJGMCX-BIIVOSGPSA-N Asp-Cys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CS)NC(=O)[C@H](CC(=O)O)N)C(=O)O KVPHTGVUMJGMCX-BIIVOSGPSA-N 0.000 description 10
- PRBLYKYHAJEABA-SRVKXCTJSA-N Gln-Arg-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O PRBLYKYHAJEABA-SRVKXCTJSA-N 0.000 description 10
- DFRYZTUPVZNRLG-KKUMJFAQSA-N Gln-Met-Phe Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CCC(=O)N)N DFRYZTUPVZNRLG-KKUMJFAQSA-N 0.000 description 10
- HBMRTXJZQDVRFT-DZKIICNBSA-N Glu-Tyr-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O HBMRTXJZQDVRFT-DZKIICNBSA-N 0.000 description 10
- ZZWUYQXMIFTIIY-WEDXCCLWSA-N Gly-Thr-Leu Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O ZZWUYQXMIFTIIY-WEDXCCLWSA-N 0.000 description 10
- SYOJVRNQCXYEOV-XVKPBYJWSA-N Gly-Val-Glu Chemical compound [H]NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O SYOJVRNQCXYEOV-XVKPBYJWSA-N 0.000 description 10
- 208000022361 Human papillomavirus infectious disease Diseases 0.000 description 10
- VQPPIMUZCZCOIL-GUBZILKMSA-N Leu-Gln-Ala Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O VQPPIMUZCZCOIL-GUBZILKMSA-N 0.000 description 10
- BABSVXFGKFLIGW-UWVGGRQHSA-N Leu-Gly-Arg Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCNC(N)=N BABSVXFGKFLIGW-UWVGGRQHSA-N 0.000 description 10
- POZULHZYLPGXMR-ONGXEEELSA-N Leu-Gly-Val Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O POZULHZYLPGXMR-ONGXEEELSA-N 0.000 description 10
- IEWBEPKLKUXQBU-VOAKCMCISA-N Leu-Leu-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O IEWBEPKLKUXQBU-VOAKCMCISA-N 0.000 description 10
- HVHRPWQEQHIQJF-AVGNSLFASA-N Leu-Lys-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O HVHRPWQEQHIQJF-AVGNSLFASA-N 0.000 description 10
- BIZNDKMFQHDOIE-KKUMJFAQSA-N Leu-Phe-Asn Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(N)=O)C(O)=O)CC1=CC=CC=C1 BIZNDKMFQHDOIE-KKUMJFAQSA-N 0.000 description 10
- GILLQRYAWOMHED-DCAQKATOSA-N Lys-Val-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCCN GILLQRYAWOMHED-DCAQKATOSA-N 0.000 description 10
- FDMKYQQYJKYCLV-GUBZILKMSA-N Pro-Pro-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 FDMKYQQYJKYCLV-GUBZILKMSA-N 0.000 description 10
- BGWKULMLUIUPKY-BQBZGAKWSA-N Pro-Ser-Gly Chemical compound OC(=O)CNC(=O)[C@H](CO)NC(=O)[C@@H]1CCCN1 BGWKULMLUIUPKY-BQBZGAKWSA-N 0.000 description 10
- MMAPOBOTRUVNKJ-ZLUOBGJFSA-N Ser-Asp-Ser Chemical compound C([C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CO)N)C(=O)O MMAPOBOTRUVNKJ-ZLUOBGJFSA-N 0.000 description 10
- UIGMAMGZOJVTDN-WHFBIAKZSA-N Ser-Gly-Ser Chemical compound OC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O UIGMAMGZOJVTDN-WHFBIAKZSA-N 0.000 description 10
- WUXCHQZLUHBSDJ-LKXGYXEUSA-N Ser-Thr-Asp Chemical compound OC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CC(O)=O)C(O)=O WUXCHQZLUHBSDJ-LKXGYXEUSA-N 0.000 description 10
- FAPWRFPIFSIZLT-UHFFFAOYSA-M Sodium chloride Chemical compound [Na+].[Cl-] FAPWRFPIFSIZLT-UHFFFAOYSA-M 0.000 description 10
- DLZKEQQWXODGGZ-KWQFWETISA-N Tyr-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 DLZKEQQWXODGGZ-KWQFWETISA-N 0.000 description 10
- DZKFGCNKEVMXFA-JUKXBJQTSA-N Tyr-Ile-His Chemical compound CC[C@H](C)[C@H](NC(=O)[C@@H](N)Cc1ccc(O)cc1)C(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O DZKFGCNKEVMXFA-JUKXBJQTSA-N 0.000 description 10
- JTWIMNMUYLQNPI-WPRPVWTQSA-N Val-Gly-Arg Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCNC(N)=N JTWIMNMUYLQNPI-WPRPVWTQSA-N 0.000 description 10
- 108010038633 aspartylglutamate Proteins 0.000 description 10
- 108010092114 histidylphenylalanine Proteins 0.000 description 10
- 230000003053 immunization Effects 0.000 description 10
- 238000002649 immunization Methods 0.000 description 10
- 108010083476 phenylalanyltryptophan Proteins 0.000 description 10
- 108010071207 serylmethionine Proteins 0.000 description 10
- 108010020532 tyrosyl-proline Proteins 0.000 description 10
- 108010003137 tyrosyltyrosine Proteins 0.000 description 10
- PBSQFBAJKPLRJY-BYULHYEWSA-N Asn-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CC(=O)N)N PBSQFBAJKPLRJY-BYULHYEWSA-N 0.000 description 9
- 108010083946 Asp-Tyr-Leu-Lys Proteins 0.000 description 9
- LVSYIKGMLRHKME-IUCAKERBSA-N Gln-Gly-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CCC(=O)N)N LVSYIKGMLRHKME-IUCAKERBSA-N 0.000 description 9
- PHONXOACARQMPM-BQBZGAKWSA-N Gly-Ala-Met Chemical compound [H]NCC(=O)N[C@@H](C)C(=O)N[C@@H](CCSC)C(O)=O PHONXOACARQMPM-BQBZGAKWSA-N 0.000 description 9
- JBCLFWXMTIKCCB-UHFFFAOYSA-N H-Gly-Phe-OH Natural products NCC(=O)NC(C(O)=O)CC1=CC=CC=C1 JBCLFWXMTIKCCB-UHFFFAOYSA-N 0.000 description 9
- LHSGPCFBGJHPCY-UHFFFAOYSA-N L-leucine-L-tyrosine Natural products CC(C)CC(N)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 LHSGPCFBGJHPCY-UHFFFAOYSA-N 0.000 description 9
- KFKWRHQBZQICHA-STQMWFEESA-N L-leucyl-L-phenylalanine Natural products CC(C)C[C@H](N)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 KFKWRHQBZQICHA-STQMWFEESA-N 0.000 description 9
- CGSOWZUPLOKYOR-AVGNSLFASA-N Pro-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 CGSOWZUPLOKYOR-AVGNSLFASA-N 0.000 description 9
- LZHHZYDPMZEMRX-STQMWFEESA-N Pro-Tyr-Gly Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(O)=O LZHHZYDPMZEMRX-STQMWFEESA-N 0.000 description 9
- LVVBAKCGXXUHFO-ZLUOBGJFSA-N Ser-Ala-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(O)=O LVVBAKCGXXUHFO-ZLUOBGJFSA-N 0.000 description 9
- HOVLHEKTGVIKAP-WDCWCFNPSA-N Thr-Leu-Gln Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O HOVLHEKTGVIKAP-WDCWCFNPSA-N 0.000 description 9
- WSFXJLFSJSXGMQ-MGHWNKPDSA-N Tyr-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N WSFXJLFSJSXGMQ-MGHWNKPDSA-N 0.000 description 9
- 108010042598 glutamyl-aspartyl-glycine Proteins 0.000 description 9
- 108010012058 leucyltyrosine Proteins 0.000 description 9
- 150000007523 nucleic acids Chemical class 0.000 description 9
- 229960002566 papillomavirus vaccine Drugs 0.000 description 9
- KVWLTGNCJYDJET-LSJOCFKGSA-N Ala-Arg-His Chemical compound C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N KVWLTGNCJYDJET-LSJOCFKGSA-N 0.000 description 8
- ZIWWTZWAKYBUOB-CIUDSAMLSA-N Ala-Asp-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O ZIWWTZWAKYBUOB-CIUDSAMLSA-N 0.000 description 8
- SLQQPJBDBVPVQV-JYJNAYRXSA-N Arg-Phe-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(O)=O SLQQPJBDBVPVQV-JYJNAYRXSA-N 0.000 description 8
- LRPZJPMQGKGHSG-XGEHTFHBSA-N Arg-Ser-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCCN=C(N)N)N)O LRPZJPMQGKGHSG-XGEHTFHBSA-N 0.000 description 8
- WPOLSNAQGVHROR-GUBZILKMSA-N Asn-Gln-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC(=O)N)N WPOLSNAQGVHROR-GUBZILKMSA-N 0.000 description 8
- PPCORQFLAZWUNO-QWRGUYRKSA-N Asn-Phe-Gly Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CC(=O)N)N PPCORQFLAZWUNO-QWRGUYRKSA-N 0.000 description 8
- WBDWQKRLTVCDSY-WHFBIAKZSA-N Asp-Gly-Asp Chemical compound OC(=O)C[C@H](N)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O WBDWQKRLTVCDSY-WHFBIAKZSA-N 0.000 description 8
- GCACQYDBDHRVGE-LKXGYXEUSA-N Asp-Thr-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H]([C@H](O)C)NC(=O)[C@@H](N)CC(O)=O GCACQYDBDHRVGE-LKXGYXEUSA-N 0.000 description 8
- JDDYEZGPYBBPBN-JRQIVUDYSA-N Asp-Thr-Tyr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O JDDYEZGPYBBPBN-JRQIVUDYSA-N 0.000 description 8
- 206010008342 Cervix carcinoma Diseases 0.000 description 8
- JUUMIGUJJRFQQR-KKUMJFAQSA-N Cys-Lys-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CS)N)O JUUMIGUJJRFQQR-KKUMJFAQSA-N 0.000 description 8
- WQWMZOIPXWSZNE-WDSKDSINSA-N Gln-Asp-Gly Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O WQWMZOIPXWSZNE-WDSKDSINSA-N 0.000 description 8
- ZGKXAUIVGIBISK-SZMVWBNQSA-N Glu-His-Trp Chemical compound N[C@@H](CCC(O)=O)C(=O)N[C@@H](Cc1c[nH]cn1)C(=O)N[C@@H](Cc1c[nH]c2ccccc12)C(O)=O ZGKXAUIVGIBISK-SZMVWBNQSA-N 0.000 description 8
- KFMBRBPXHVMDFN-UWVGGRQHSA-N Gly-Arg-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCNC(N)=N KFMBRBPXHVMDFN-UWVGGRQHSA-N 0.000 description 8
- AYBKPDHHVADEDA-YUMQZZPRSA-N Gly-His-Asn Chemical compound [H]NCC(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(N)=O)C(O)=O AYBKPDHHVADEDA-YUMQZZPRSA-N 0.000 description 8
- HAXARWKYFIIHKD-ZKWXMUAHSA-N Gly-Ile-Ser Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O HAXARWKYFIIHKD-ZKWXMUAHSA-N 0.000 description 8
- NLZVTPYXYXMCIP-XUXIUFHCSA-N Ile-Pro-Lys Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(O)=O NLZVTPYXYXMCIP-XUXIUFHCSA-N 0.000 description 8
- KBDIBHQICWDGDL-PPCPHDFISA-N Ile-Thr-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)O)N KBDIBHQICWDGDL-PPCPHDFISA-N 0.000 description 8
- HFBCHNRFRYLZNV-GUBZILKMSA-N Leu-Glu-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O HFBCHNRFRYLZNV-GUBZILKMSA-N 0.000 description 8
- HEWWNLVEWBJBKA-WDCWCFNPSA-N Lys-Gln-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CCCCN HEWWNLVEWBJBKA-WDCWCFNPSA-N 0.000 description 8
- ODUQLUADRKMHOZ-JYJNAYRXSA-N Lys-Glu-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCCCN)N)O ODUQLUADRKMHOZ-JYJNAYRXSA-N 0.000 description 8
- LUTDBHBIHHREDC-IHRRRGAJSA-N Lys-Pro-Lys Chemical compound NCCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(O)=O LUTDBHBIHHREDC-IHRRRGAJSA-N 0.000 description 8
- MIROMRNASYKZNL-ULQDDVLXSA-N Lys-Pro-Tyr Chemical compound NCCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 MIROMRNASYKZNL-ULQDDVLXSA-N 0.000 description 8
- GGXZOTSDJJTDGB-GUBZILKMSA-N Met-Ser-Val Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O GGXZOTSDJJTDGB-GUBZILKMSA-N 0.000 description 8
- DEDANIDYQAPTFI-IHRRRGAJSA-N Pro-Asp-Tyr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O DEDANIDYQAPTFI-IHRRRGAJSA-N 0.000 description 8
- CLJLVCYFABNTHP-DCAQKATOSA-N Pro-Leu-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O CLJLVCYFABNTHP-DCAQKATOSA-N 0.000 description 8
- HFNPOYOKIPGAEI-SRVKXCTJSA-N Pro-Leu-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]1CCCN1 HFNPOYOKIPGAEI-SRVKXCTJSA-N 0.000 description 8
- BPMRXBZYPGYPJN-WHFBIAKZSA-N Ser-Gly-Asn Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O BPMRXBZYPGYPJN-WHFBIAKZSA-N 0.000 description 8
- PMTWIUBUQRGCSB-FXQIFTODSA-N Ser-Val-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O PMTWIUBUQRGCSB-FXQIFTODSA-N 0.000 description 8
- RKDFEMGVMMYYNG-WDCWCFNPSA-N Thr-Gln-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O RKDFEMGVMMYYNG-WDCWCFNPSA-N 0.000 description 8
- FJKXUIJOMUWCDD-FHWLQOOXSA-N Tyr-Gln-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O)N)O FJKXUIJOMUWCDD-FHWLQOOXSA-N 0.000 description 8
- NKUGCYDFQKFVOJ-JYJNAYRXSA-N Tyr-Leu-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 NKUGCYDFQKFVOJ-JYJNAYRXSA-N 0.000 description 8
- BYAKMYBZADCNMN-JYJNAYRXSA-N Tyr-Lys-Gln Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O BYAKMYBZADCNMN-JYJNAYRXSA-N 0.000 description 8
- 208000006105 Uterine Cervical Neoplasms Diseases 0.000 description 8
- HNWQUBBOBKSFQV-AVGNSLFASA-N Val-Arg-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N HNWQUBBOBKSFQV-AVGNSLFASA-N 0.000 description 8
- OFTXTCGQJXTNQS-XGEHTFHBSA-N Val-Thr-Ser Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](C(C)C)N)O OFTXTCGQJXTNQS-XGEHTFHBSA-N 0.000 description 8
- 108010024078 alanyl-glycyl-serine Proteins 0.000 description 8
- 108010005233 alanylglutamic acid Proteins 0.000 description 8
- 238000000137 annealing Methods 0.000 description 8
- 201000010881 cervical cancer Diseases 0.000 description 8
- 238000004925 denaturation Methods 0.000 description 8
- 230000036425 denaturation Effects 0.000 description 8
- 108010089804 glycyl-threonine Proteins 0.000 description 8
- 108010037850 glycylvaline Proteins 0.000 description 8
- 108010045383 histidyl-glycyl-glutamic acid Proteins 0.000 description 8
- 108010003700 lysyl aspartic acid Proteins 0.000 description 8
- 108090000765 processed proteins & peptides Proteins 0.000 description 8
- 108010015796 prolylisoleucine Proteins 0.000 description 8
- 108010038745 tryptophylglycine Proteins 0.000 description 8
- CHFFHQUVXHEGBY-GARJFASQSA-N Ala-Lys-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N1CCC[C@@H]1C(=O)O)N CHFFHQUVXHEGBY-GARJFASQSA-N 0.000 description 7
- 206010059313 Anogenital warts Diseases 0.000 description 7
- OEUQMKNNOWJREN-AVGNSLFASA-N Asp-Gln-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC(=O)O)N OEUQMKNNOWJREN-AVGNSLFASA-N 0.000 description 7
- VKAWJBQTFCBHQY-GUBZILKMSA-N Cys-Gln-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CS)N VKAWJBQTFCBHQY-GUBZILKMSA-N 0.000 description 7
- 101000641175 Human papillomavirus type 18 Major capsid protein L1 Proteins 0.000 description 7
- 108700025391 Human papillomavirus type 6 L1 Proteins 0.000 description 7
- IAJFFZORSWOZPQ-SRVKXCTJSA-N Leu-Leu-Asn Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O IAJFFZORSWOZPQ-SRVKXCTJSA-N 0.000 description 7
- HXSUFWQYLPKEHF-IHRRRGAJSA-N Phe-Asn-Arg Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N HXSUFWQYLPKEHF-IHRRRGAJSA-N 0.000 description 7
- JJHVFCUWLSKADD-ONGXEEELSA-N Phe-Gly-Ala Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)NCC(=O)N[C@@H](C)C(O)=O JJHVFCUWLSKADD-ONGXEEELSA-N 0.000 description 7
- SWSRFJZZMNLMLY-ZKWXMUAHSA-N Ser-Asp-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O SWSRFJZZMNLMLY-ZKWXMUAHSA-N 0.000 description 7
- WURLIFOWSMBUAR-SLFFLAALSA-N Tyr-Phe-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)NC(=O)[C@H](CC3=CC=C(C=C3)O)N)C(=O)O WURLIFOWSMBUAR-SLFFLAALSA-N 0.000 description 7
- 229910002092 carbon dioxide Inorganic materials 0.000 description 7
- VPZXBVLAVMBEQI-UHFFFAOYSA-N glycyl-DL-alpha-alanine Natural products OC(=O)C(C)NC(=O)CN VPZXBVLAVMBEQI-UHFFFAOYSA-N 0.000 description 7
- 108010044056 leucyl-phenylalanine Proteins 0.000 description 7
- 102000039446 nucleic acids Human genes 0.000 description 7
- 108020004707 nucleic acids Proteins 0.000 description 7
- MIPWEZAIMPYQST-FXQIFTODSA-N Ala-Cys-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CS)C(=O)N[C@@H](C(C)C)C(O)=O MIPWEZAIMPYQST-FXQIFTODSA-N 0.000 description 6
- IFTVANMRTIHKML-WDSKDSINSA-N Ala-Gln-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O IFTVANMRTIHKML-WDSKDSINSA-N 0.000 description 6
- OYJCVIGKMXUVKB-GARJFASQSA-N Ala-Leu-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N OYJCVIGKMXUVKB-GARJFASQSA-N 0.000 description 6
- MSILNNHVVMMTHZ-UWVGGRQHSA-N Arg-His-Gly Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CN=CN1 MSILNNHVVMMTHZ-UWVGGRQHSA-N 0.000 description 6
- VITDJIPIJZAVGC-VEVYYDQMSA-N Asn-Met-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O VITDJIPIJZAVGC-VEVYYDQMSA-N 0.000 description 6
- HPASIOLTWSNMFB-OLHMAJIHSA-N Asn-Thr-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(O)=O HPASIOLTWSNMFB-OLHMAJIHSA-N 0.000 description 6
- GGRSYTUJHAZTFN-IHRRRGAJSA-N Asp-Pro-Tyr Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC(=O)O)N)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O GGRSYTUJHAZTFN-IHRRRGAJSA-N 0.000 description 6
- RSMZEHCMIOKNMW-GSSVUCPTSA-N Asp-Thr-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O RSMZEHCMIOKNMW-GSSVUCPTSA-N 0.000 description 6
- BYLPQJAWXJWUCJ-YDHLFZDLSA-N Asp-Tyr-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O BYLPQJAWXJWUCJ-YDHLFZDLSA-N 0.000 description 6
- AZDQAZRURQMSQD-XPUUQOCRSA-N Cys-Val-Gly Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O AZDQAZRURQMSQD-XPUUQOCRSA-N 0.000 description 6
- MWERYIXRDZDXOA-QEWYBTABSA-N Gln-Ile-Phe Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O MWERYIXRDZDXOA-QEWYBTABSA-N 0.000 description 6
- OTQSTOXRUBVWAP-NRPADANISA-N Gln-Ser-Val Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O OTQSTOXRUBVWAP-NRPADANISA-N 0.000 description 6
- PBFGQTGPSKWHJA-QEJZJMRPSA-N Glu-Asp-Trp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O PBFGQTGPSKWHJA-QEJZJMRPSA-N 0.000 description 6
- RJIVPOXLQFJRTG-LURJTMIESA-N Gly-Arg-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)CN)CCCN=C(N)N RJIVPOXLQFJRTG-LURJTMIESA-N 0.000 description 6
- SUDUYJOBLHQAMI-WHFBIAKZSA-N Gly-Asp-Cys Chemical compound NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CS)C(O)=O SUDUYJOBLHQAMI-WHFBIAKZSA-N 0.000 description 6
- LUJVWKKYHSLULQ-ZKWXMUAHSA-N Gly-Ile-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)CN LUJVWKKYHSLULQ-ZKWXMUAHSA-N 0.000 description 6
- PDUHNKAFQXQNLH-ZETCQYMHSA-N Gly-Lys-Gly Chemical compound NCCCC[C@H](NC(=O)CN)C(=O)NCC(O)=O PDUHNKAFQXQNLH-ZETCQYMHSA-N 0.000 description 6
- IEGFSKKANYKBDU-QWHCGFSZSA-N Gly-Phe-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)NC(=O)CN)C(=O)O IEGFSKKANYKBDU-QWHCGFSZSA-N 0.000 description 6
- JVEKQAYXFGIISZ-HOCLYGCPSA-N His-Trp-Gly Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)NCC(O)=O)C1=CN=CN1 JVEKQAYXFGIISZ-HOCLYGCPSA-N 0.000 description 6
- AWTDTFXPVCTHAK-BJDJZHNGSA-N Ile-Cys-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CCCCN)C(=O)O)N AWTDTFXPVCTHAK-BJDJZHNGSA-N 0.000 description 6
- PARSHQDZROHERM-NHCYSSNCSA-N Ile-Lys-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)NCC(=O)O)N PARSHQDZROHERM-NHCYSSNCSA-N 0.000 description 6
- VOCZPDONPURUHV-QEWYBTABSA-N Ile-Phe-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N VOCZPDONPURUHV-QEWYBTABSA-N 0.000 description 6
- VZSDQFZFTCVEGF-ZEWNOJEFSA-N Ile-Phe-Tyr Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](Cc1ccccc1)C(=O)N[C@@H](Cc1ccc(O)cc1)C(O)=O VZSDQFZFTCVEGF-ZEWNOJEFSA-N 0.000 description 6
- FADYJNXDPBKVCA-UHFFFAOYSA-N L-Phenylalanyl-L-lysin Natural products NCCCCC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 FADYJNXDPBKVCA-UHFFFAOYSA-N 0.000 description 6
- FJUKMPUELVROGK-IHRRRGAJSA-N Leu-Arg-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N FJUKMPUELVROGK-IHRRRGAJSA-N 0.000 description 6
- KTFHTMHHKXUYPW-ZPFDUUQYSA-N Leu-Asp-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KTFHTMHHKXUYPW-ZPFDUUQYSA-N 0.000 description 6
- GLBNEGIOFRVRHO-JYJNAYRXSA-N Leu-Gln-Phe Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O GLBNEGIOFRVRHO-JYJNAYRXSA-N 0.000 description 6
- KUEVMUXNILMJTK-JYJNAYRXSA-N Leu-Gln-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 KUEVMUXNILMJTK-JYJNAYRXSA-N 0.000 description 6
- SBANPBVRHYIMRR-UHFFFAOYSA-N Leu-Ser-Pro Natural products CC(C)CC(N)C(=O)NC(CO)C(=O)N1CCCC1C(O)=O SBANPBVRHYIMRR-UHFFFAOYSA-N 0.000 description 6
- NKKFVJRLCCUJNA-QWRGUYRKSA-N Lys-Gly-Lys Chemical compound NCCCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCCN NKKFVJRLCCUJNA-QWRGUYRKSA-N 0.000 description 6
- NCZIQZYZPUPMKY-PPCPHDFISA-N Lys-Ile-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O NCZIQZYZPUPMKY-PPCPHDFISA-N 0.000 description 6
- MUXNCRWTWBMNHX-SRVKXCTJSA-N Lys-Leu-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O MUXNCRWTWBMNHX-SRVKXCTJSA-N 0.000 description 6
- NQSFIPWBPXNJII-PMVMPFDFSA-N Lys-Phe-Trp Chemical compound C([C@H](NC(=O)[C@@H](N)CCCCN)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(O)=O)C1=CC=CC=C1 NQSFIPWBPXNJII-PMVMPFDFSA-N 0.000 description 6
- QEVRUYFHWJJUHZ-DCAQKATOSA-N Met-Ala-Leu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC(C)C QEVRUYFHWJJUHZ-DCAQKATOSA-N 0.000 description 6
- JHDNAOVJJQSMMM-GMOBBJLQSA-N Met-Ile-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CCSC)N JHDNAOVJJQSMMM-GMOBBJLQSA-N 0.000 description 6
- HLZORBMOISUNIV-DCAQKATOSA-N Met-Ser-Leu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC(C)C HLZORBMOISUNIV-DCAQKATOSA-N 0.000 description 6
- SITLTJHOQZFJGG-UHFFFAOYSA-N N-L-alpha-glutamyl-L-valine Natural products CC(C)C(C(O)=O)NC(=O)C(N)CCC(O)=O SITLTJHOQZFJGG-UHFFFAOYSA-N 0.000 description 6
- 108010065395 Neuropep-1 Proteins 0.000 description 6
- LJUUGSWZPQOJKD-JYJNAYRXSA-N Phe-Arg-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CCCNC(N)=N)NC(=O)[C@@H](N)Cc1ccccc1)C(O)=O LJUUGSWZPQOJKD-JYJNAYRXSA-N 0.000 description 6
- KAHUBGWSIQNZQQ-KKUMJFAQSA-N Phe-Asn-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 KAHUBGWSIQNZQQ-KKUMJFAQSA-N 0.000 description 6
- APJPXSFJBMMOLW-KBPBESRZSA-N Phe-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC1=CC=CC=C1 APJPXSFJBMMOLW-KBPBESRZSA-N 0.000 description 6
- OWSLLRKCHLTUND-BZSNNMDCSA-N Phe-Phe-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CC=CC=C2)C(=O)N[C@@H](CC(=O)N)C(=O)O)N OWSLLRKCHLTUND-BZSNNMDCSA-N 0.000 description 6
- GTMSCDVFQLNEOY-BZSNNMDCSA-N Phe-Tyr-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N GTMSCDVFQLNEOY-BZSNNMDCSA-N 0.000 description 6
- MTHRMUXESFIAMS-DCAQKATOSA-N Pro-Asn-Lys Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O MTHRMUXESFIAMS-DCAQKATOSA-N 0.000 description 6
- SFECXGVELZFBFJ-VEVYYDQMSA-N Pro-Asp-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SFECXGVELZFBFJ-VEVYYDQMSA-N 0.000 description 6
- KWMUAKQOVYCQJQ-ZPFDUUQYSA-N Pro-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@@H]1CCCN1 KWMUAKQOVYCQJQ-ZPFDUUQYSA-N 0.000 description 6
- GFHOSBYCLACKEK-GUBZILKMSA-N Pro-Pro-Asn Chemical compound [H]N1CCC[C@H]1C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O GFHOSBYCLACKEK-GUBZILKMSA-N 0.000 description 6
- POQFNPILEQEODH-FXQIFTODSA-N Pro-Ser-Ala Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O POQFNPILEQEODH-FXQIFTODSA-N 0.000 description 6
- WTUJZHKANPDPIN-CIUDSAMLSA-N Ser-Ala-Lys Chemical compound C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CO)N WTUJZHKANPDPIN-CIUDSAMLSA-N 0.000 description 6
- OYEDZGNMSBZCIM-XGEHTFHBSA-N Ser-Arg-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OYEDZGNMSBZCIM-XGEHTFHBSA-N 0.000 description 6
- IXCHOHLPHNGFTJ-YUMQZZPRSA-N Ser-Gly-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CO)N IXCHOHLPHNGFTJ-YUMQZZPRSA-N 0.000 description 6
- IXZHZUGGKLRHJD-DCAQKATOSA-N Ser-Leu-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O IXZHZUGGKLRHJD-DCAQKATOSA-N 0.000 description 6
- AMRRYKHCILPAKD-FXQIFTODSA-N Ser-Met-Asn Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CO)N AMRRYKHCILPAKD-FXQIFTODSA-N 0.000 description 6
- JCLAFVNDBJMLBC-JBDRJPRFSA-N Ser-Ser-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JCLAFVNDBJMLBC-JBDRJPRFSA-N 0.000 description 6
- CTONFVDJYCAMQM-IUKAMOBKSA-N Thr-Asn-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H]([C@@H](C)O)N CTONFVDJYCAMQM-IUKAMOBKSA-N 0.000 description 6
- SKHPKKYKDYULDH-HJGDQZAQSA-N Thr-Asn-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O SKHPKKYKDYULDH-HJGDQZAQSA-N 0.000 description 6
- CQNFRKAKGDSJFR-NUMRIWBASA-N Thr-Glu-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N)O CQNFRKAKGDSJFR-NUMRIWBASA-N 0.000 description 6
- ZTPXSEUVYNNZRB-CDMKHQONSA-N Thr-Gly-Phe Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O ZTPXSEUVYNNZRB-CDMKHQONSA-N 0.000 description 6
- ADPHPKGWVDHWML-PPCPHDFISA-N Thr-Ile-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N ADPHPKGWVDHWML-PPCPHDFISA-N 0.000 description 6
- KERCOYANYUPLHJ-XGEHTFHBSA-N Thr-Pro-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O KERCOYANYUPLHJ-XGEHTFHBSA-N 0.000 description 6
- IVDFVBVIVLJJHR-LKXGYXEUSA-N Thr-Ser-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O IVDFVBVIVLJJHR-LKXGYXEUSA-N 0.000 description 6
- WPSKTVVMQCXPRO-BWBBJGPYSA-N Thr-Ser-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O WPSKTVVMQCXPRO-BWBBJGPYSA-N 0.000 description 6
- BEZTUFWTPVOROW-KJEVXHAQSA-N Thr-Tyr-Arg Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N)O BEZTUFWTPVOROW-KJEVXHAQSA-N 0.000 description 6
- AKHDFZHUPGVFEJ-YEPSODPASA-N Thr-Val-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O AKHDFZHUPGVFEJ-YEPSODPASA-N 0.000 description 6
- VYVBSMCZNHOZGD-RCWTZXSCSA-N Thr-Val-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C(C)C)C(O)=O VYVBSMCZNHOZGD-RCWTZXSCSA-N 0.000 description 6
- FNOQJVHFVLVMOS-AAEUAGOBSA-N Trp-Gly-Asn Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)NCC(=O)N[C@@H](CC(=O)N)C(=O)O)N FNOQJVHFVLVMOS-AAEUAGOBSA-N 0.000 description 6
- YVXIAOOYAKBAAI-SZMVWBNQSA-N Trp-Leu-Gln Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O)=CNC2=C1 YVXIAOOYAKBAAI-SZMVWBNQSA-N 0.000 description 6
- QYSBJAUCUKHSLU-JYJNAYRXSA-N Tyr-Arg-Val Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(O)=O QYSBJAUCUKHSLU-JYJNAYRXSA-N 0.000 description 6
- AKLNEFNQWLHIGY-QWRGUYRKSA-N Tyr-Gly-Asp Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)NCC(=O)N[C@@H](CC(=O)O)C(=O)O)N)O AKLNEFNQWLHIGY-QWRGUYRKSA-N 0.000 description 6
- GITNQBVCEQBDQC-KKUMJFAQSA-N Tyr-Lys-Asn Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O GITNQBVCEQBDQC-KKUMJFAQSA-N 0.000 description 6
- XIFAHCUNWWKUDE-DCAQKATOSA-N Val-Cys-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CCCCN)C(=O)O)N XIFAHCUNWWKUDE-DCAQKATOSA-N 0.000 description 6
- WFENBJPLZMPVAX-XVKPBYJWSA-N Val-Gly-Glu Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(O)=O WFENBJPLZMPVAX-XVKPBYJWSA-N 0.000 description 6
- GMOLURHJBLOBFW-ONGXEEELSA-N Val-Gly-His Chemical compound CC(C)[C@@H](C(=O)NCC(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N GMOLURHJBLOBFW-ONGXEEELSA-N 0.000 description 6
- UGFMVXRXULGLNO-XPUUQOCRSA-N Val-Ser-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O UGFMVXRXULGLNO-XPUUQOCRSA-N 0.000 description 6
- IRAUYEAFPFPVND-UVBJJODRSA-N Val-Trp-Ala Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](N)C(C)C)C(=O)N[C@@H](C)C(O)=O)=CNC2=C1 IRAUYEAFPFPVND-UVBJJODRSA-N 0.000 description 6
- JSOXWWFKRJKTMT-WOPDTQHZSA-N Val-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N JSOXWWFKRJKTMT-WOPDTQHZSA-N 0.000 description 6
- 238000010790 dilution Methods 0.000 description 6
- 239000012895 dilution Substances 0.000 description 6
- XKUKSGPZAADMRA-UHFFFAOYSA-N glycyl-glycyl-glycine Natural products NCC(=O)NCC(=O)NCC(O)=O XKUKSGPZAADMRA-UHFFFAOYSA-N 0.000 description 6
- 108010010147 glycylglutamine Proteins 0.000 description 6
- 108010018006 histidylserine Proteins 0.000 description 6
- 108010044348 lysyl-glutamyl-aspartic acid Proteins 0.000 description 6
- 108010057952 lysyl-phenylalanyl-lysine Proteins 0.000 description 6
- 108010009298 lysylglutamic acid Proteins 0.000 description 6
- 229920001184 polypeptide Polymers 0.000 description 6
- 238000012257 pre-denaturation Methods 0.000 description 6
- 102000004196 processed proteins & peptides Human genes 0.000 description 6
- 238000000746 purification Methods 0.000 description 6
- XQGIRPGAVLFKBJ-CIUDSAMLSA-N Ala-Asn-Lys Chemical compound N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)O XQGIRPGAVLFKBJ-CIUDSAMLSA-N 0.000 description 5
- BTJVOUQWFXABOI-IHRRRGAJSA-N Arg-Lys-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCCNC(N)=N BTJVOUQWFXABOI-IHRRRGAJSA-N 0.000 description 5
- QSFHZPQUAAQHAQ-CIUDSAMLSA-N Asp-Ser-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O QSFHZPQUAAQHAQ-CIUDSAMLSA-N 0.000 description 5
- GWWSUMLEWKQHLR-NUMRIWBASA-N Asp-Thr-Glu Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CC(=O)O)N)O GWWSUMLEWKQHLR-NUMRIWBASA-N 0.000 description 5
- BJDHEININLSZOT-KKUMJFAQSA-N Asp-Tyr-Lys Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCCN)C(O)=O BJDHEININLSZOT-KKUMJFAQSA-N 0.000 description 5
- BUAUGQJXGNRTQE-AAEUAGOBSA-N Cys-Trp-Gly Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CS)N BUAUGQJXGNRTQE-AAEUAGOBSA-N 0.000 description 5
- 229940124897 Gardasil Drugs 0.000 description 5
- YRHZWVKUFWCEPW-GLLZPBPUSA-N Gln-Thr-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)N)N)O YRHZWVKUFWCEPW-GLLZPBPUSA-N 0.000 description 5
- JUBDONGMHASUCN-IUCAKERBSA-N Gly-Glu-His Chemical compound NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O JUBDONGMHASUCN-IUCAKERBSA-N 0.000 description 5
- LLZXNUUIBOALNY-QWRGUYRKSA-N Gly-Leu-Lys Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCCN LLZXNUUIBOALNY-QWRGUYRKSA-N 0.000 description 5
- 101000642125 Human papillomavirus 11 Major capsid protein L1 Proteins 0.000 description 5
- 101100209954 Human papillomavirus type 16 L1 gene Proteins 0.000 description 5
- 101000641177 Human papillomavirus type 16 Major capsid protein L1 Proteins 0.000 description 5
- QQPSCXKFDSORFT-IHRRRGAJSA-N Lys-Lys-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCCCN QQPSCXKFDSORFT-IHRRRGAJSA-N 0.000 description 5
- IEIHKHYMBIYQTH-YESZJQIVSA-N Lys-Tyr-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)[C@H](CCCCN)N)C(=O)O IEIHKHYMBIYQTH-YESZJQIVSA-N 0.000 description 5
- 108010077850 Nuclear Localization Signals Proteins 0.000 description 5
- YTILBRIUASDGBL-BZSNNMDCSA-N Phe-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 YTILBRIUASDGBL-BZSNNMDCSA-N 0.000 description 5
- RCYUBVHMVUHEBM-RCWTZXSCSA-N Pro-Pro-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O RCYUBVHMVUHEBM-RCWTZXSCSA-N 0.000 description 5
- QNJZOAHSYPXTAB-VEVYYDQMSA-N Thr-Asn-Met Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCSC)C(O)=O QNJZOAHSYPXTAB-VEVYYDQMSA-N 0.000 description 5
- ODXKUIGEPAGKKV-KATARQTJSA-N Thr-Leu-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CS)C(=O)O)N)O ODXKUIGEPAGKKV-KATARQTJSA-N 0.000 description 5
- YMTOEGGOCHVGEH-IHRRRGAJSA-N Val-Lys-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(O)=O YMTOEGGOCHVGEH-IHRRRGAJSA-N 0.000 description 5
- 108010087924 alanylproline Proteins 0.000 description 5
- KOSRFJWDECSPRO-UHFFFAOYSA-N alpha-L-glutamyl-L-glutamic acid Natural products OC(=O)CCC(N)C(=O)NC(CCC(O)=O)C(O)=O KOSRFJWDECSPRO-UHFFFAOYSA-N 0.000 description 5
- 125000000539 amino acid group Chemical group 0.000 description 5
- 208000025009 anogenital human papillomavirus infection Diseases 0.000 description 5
- 201000004201 anogenital venereal wart Diseases 0.000 description 5
- 230000005540 biological transmission Effects 0.000 description 5
- 239000000872 buffer Substances 0.000 description 5
- 238000004587 chromatography analysis Methods 0.000 description 5
- 108010055341 glutamyl-glutamic acid Proteins 0.000 description 5
- 238000000338 in vitro Methods 0.000 description 5
- 108010026333 seryl-proline Proteins 0.000 description 5
- 239000011780 sodium chloride Substances 0.000 description 5
- JBVSSSZFNTXJDX-YTLHQDLWSA-N Ala-Ala-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@H](C)N JBVSSSZFNTXJDX-YTLHQDLWSA-N 0.000 description 4
- UQJUGHFKNKGHFQ-VZFHVOOUSA-N Ala-Cys-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CS)C(=O)N[C@@H]([C@@H](C)O)C(O)=O UQJUGHFKNKGHFQ-VZFHVOOUSA-N 0.000 description 4
- OKIKVSXTXVVFDV-MMWGEVLESA-N Ala-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](C)N OKIKVSXTXVVFDV-MMWGEVLESA-N 0.000 description 4
- LDLSENBXQNDTPB-DCAQKATOSA-N Ala-Lys-Arg Chemical compound NCCCC[C@H](NC(=O)[C@@H](N)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N LDLSENBXQNDTPB-DCAQKATOSA-N 0.000 description 4
- OLVCTPPSXNRGKV-GUBZILKMSA-N Ala-Pro-Pro Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 OLVCTPPSXNRGKV-GUBZILKMSA-N 0.000 description 4
- NCQMBSJGJMYKCK-ZLUOBGJFSA-N Ala-Ser-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O NCQMBSJGJMYKCK-ZLUOBGJFSA-N 0.000 description 4
- WQKAQKZRDIZYNV-VZFHVOOUSA-N Ala-Ser-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O WQKAQKZRDIZYNV-VZFHVOOUSA-N 0.000 description 4
- OMSKGWFGWCQFBD-KZVJFYERSA-N Ala-Val-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OMSKGWFGWCQFBD-KZVJFYERSA-N 0.000 description 4
- 208000007860 Anus Neoplasms Diseases 0.000 description 4
- YNSGXDWWPCGGQS-YUMQZZPRSA-N Arg-Gly-Gln Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)NCC(=O)N[C@@H](CCC(N)=O)C(O)=O YNSGXDWWPCGGQS-YUMQZZPRSA-N 0.000 description 4
- ITHMWNNUDPJJER-ULQDDVLXSA-N Arg-His-Tyr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O ITHMWNNUDPJJER-ULQDDVLXSA-N 0.000 description 4
- DGFXIWKPTDKBLF-AVGNSLFASA-N Arg-His-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CCCN=C(N)N)N DGFXIWKPTDKBLF-AVGNSLFASA-N 0.000 description 4
- IIAXFBUTKIDDIP-ULQDDVLXSA-N Arg-Leu-Phe Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O IIAXFBUTKIDDIP-ULQDDVLXSA-N 0.000 description 4
- NMRHDSAOIURTNT-RWMBFGLXSA-N Arg-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N NMRHDSAOIURTNT-RWMBFGLXSA-N 0.000 description 4
- CVXXSWQORBZAAA-SRVKXCTJSA-N Arg-Lys-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCCN=C(N)N CVXXSWQORBZAAA-SRVKXCTJSA-N 0.000 description 4
- XMZZGVGKGXRIGJ-JYJNAYRXSA-N Arg-Tyr-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O XMZZGVGKGXRIGJ-JYJNAYRXSA-N 0.000 description 4
- CGXQUULXFWRJOI-SRVKXCTJSA-N Arg-Val-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C(C)C)C(O)=O CGXQUULXFWRJOI-SRVKXCTJSA-N 0.000 description 4
- PCKRJVZAQZWNKM-WHFBIAKZSA-N Asn-Asn-Gly Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O PCKRJVZAQZWNKM-WHFBIAKZSA-N 0.000 description 4
- PNHQRQTVBRDIEF-CIUDSAMLSA-N Asn-Leu-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC(=O)N)N PNHQRQTVBRDIEF-CIUDSAMLSA-N 0.000 description 4
- JWKDQOORUCYUIW-ZPFDUUQYSA-N Asn-Lys-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JWKDQOORUCYUIW-ZPFDUUQYSA-N 0.000 description 4
- AYOAHKWVQLNPDM-HJGDQZAQSA-N Asn-Lys-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O AYOAHKWVQLNPDM-HJGDQZAQSA-N 0.000 description 4
- RVHGJNGNKGDCPX-KKUMJFAQSA-N Asn-Phe-Lys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)N)N RVHGJNGNKGDCPX-KKUMJFAQSA-N 0.000 description 4
- YUOXLJYVSZYPBJ-CIUDSAMLSA-N Asn-Pro-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O YUOXLJYVSZYPBJ-CIUDSAMLSA-N 0.000 description 4
- WUQXMTITJLFXAU-JIOCBJNQSA-N Asn-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)N)N)O WUQXMTITJLFXAU-JIOCBJNQSA-N 0.000 description 4
- PQKSVQSMTHPRIB-ZKWXMUAHSA-N Asn-Val-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O PQKSVQSMTHPRIB-ZKWXMUAHSA-N 0.000 description 4
- BLQBMRNMBAYREH-UWJYBYFXSA-N Asp-Ala-Tyr Chemical compound N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O BLQBMRNMBAYREH-UWJYBYFXSA-N 0.000 description 4
- CASGONAXMZPHCK-FXQIFTODSA-N Asp-Asn-Arg Chemical compound C(C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC(=O)O)N)CN=C(N)N CASGONAXMZPHCK-FXQIFTODSA-N 0.000 description 4
- UFAQGGZUXVLONR-AVGNSLFASA-N Asp-Gln-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC(=O)O)N)O UFAQGGZUXVLONR-AVGNSLFASA-N 0.000 description 4
- SPKCGKRUYKMDHP-GUDRVLHUSA-N Asp-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)O)N SPKCGKRUYKMDHP-GUDRVLHUSA-N 0.000 description 4
- CMBDUPIBCOEWNE-BJDJZHNGSA-N Asp-Leu-Asp-Gln Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O CMBDUPIBCOEWNE-BJDJZHNGSA-N 0.000 description 4
- USNJAPJZSGTTPX-XVSYOHENSA-N Asp-Phe-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O USNJAPJZSGTTPX-XVSYOHENSA-N 0.000 description 4
- BWJZSLQJNBSUPM-FXQIFTODSA-N Asp-Pro-Asn Chemical compound OC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O BWJZSLQJNBSUPM-FXQIFTODSA-N 0.000 description 4
- YFGUZQQCSDZRBN-DCAQKATOSA-N Asp-Pro-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O YFGUZQQCSDZRBN-DCAQKATOSA-N 0.000 description 4
- ZBYLEBZCVKLPCY-FXQIFTODSA-N Asp-Ser-Arg Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O ZBYLEBZCVKLPCY-FXQIFTODSA-N 0.000 description 4
- MGSVBZIBCCKGCY-ZLUOBGJFSA-N Asp-Ser-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O MGSVBZIBCCKGCY-ZLUOBGJFSA-N 0.000 description 4
- ITGFVUYOLWBPQW-KKHAAJSZSA-N Asp-Thr-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O ITGFVUYOLWBPQW-KKHAAJSZSA-N 0.000 description 4
- NVXLFIPTHPKSKL-UBHSHLNASA-N Asp-Trp-Asn Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CC(O)=O)N)C(=O)N[C@@H](CC(N)=O)C(O)=O)=CNC2=C1 NVXLFIPTHPKSKL-UBHSHLNASA-N 0.000 description 4
- XMKXONRMGJXCJV-LAEOZQHASA-N Asp-Val-Glu Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O XMKXONRMGJXCJV-LAEOZQHASA-N 0.000 description 4
- QPDUWAUSSWGJSB-NGZCFLSTSA-N Asp-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)O)N QPDUWAUSSWGJSB-NGZCFLSTSA-N 0.000 description 4
- 208000000907 Condylomata Acuminata Diseases 0.000 description 4
- PKNIZMPLMSKROD-BIIVOSGPSA-N Cys-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CS)N PKNIZMPLMSKROD-BIIVOSGPSA-N 0.000 description 4
- KSMSFCBQBQPFAD-GUBZILKMSA-N Cys-Pro-Pro Chemical compound SC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 KSMSFCBQBQPFAD-GUBZILKMSA-N 0.000 description 4
- YNJBLTDKTMKEET-ZLUOBGJFSA-N Cys-Ser-Ser Chemical compound SC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O YNJBLTDKTMKEET-ZLUOBGJFSA-N 0.000 description 4
- PNEAWXSKCKCHDK-XIRDDKMYSA-N Cys-Trp-His Chemical compound C([C@H](NC(=O)[C@H](CC=1C2=CC=CC=C2NC=1)NC(=O)[C@H](CS)N)C(O)=O)C1=CN=CN1 PNEAWXSKCKCHDK-XIRDDKMYSA-N 0.000 description 4
- WVWRADGCZPIJJR-IHRRRGAJSA-N Cys-Val-Tyr Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@H](CS)N WVWRADGCZPIJJR-IHRRRGAJSA-N 0.000 description 4
- HHWQMFIGMMOVFK-WDSKDSINSA-N Gln-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(N)=O HHWQMFIGMMOVFK-WDSKDSINSA-N 0.000 description 4
- JFSNBQJNDMXMQF-XHNCKOQMSA-N Gln-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCC(=O)N)N)C(=O)O JFSNBQJNDMXMQF-XHNCKOQMSA-N 0.000 description 4
- UICOTGULOUGGLC-NUMRIWBASA-N Gln-Asp-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCC(=O)N)N)O UICOTGULOUGGLC-NUMRIWBASA-N 0.000 description 4
- TWIAMTNJOMRDAK-GUBZILKMSA-N Gln-Lys-Asp Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O TWIAMTNJOMRDAK-GUBZILKMSA-N 0.000 description 4
- OZEQPCDLCDRCGY-SOUVJXGZSA-N Gln-Phe-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)NC(=O)[C@H](CCC(=O)N)N)C(=O)O OZEQPCDLCDRCGY-SOUVJXGZSA-N 0.000 description 4
- GTBXHETZPUURJE-KKUMJFAQSA-N Gln-Tyr-Arg Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O GTBXHETZPUURJE-KKUMJFAQSA-N 0.000 description 4
- CKOFNWCLWRYUHK-XHNCKOQMSA-N Glu-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCC(=O)O)N)C(=O)O CKOFNWCLWRYUHK-XHNCKOQMSA-N 0.000 description 4
- SBCYJMOOHUDWDA-NUMRIWBASA-N Glu-Asp-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SBCYJMOOHUDWDA-NUMRIWBASA-N 0.000 description 4
- YLJHCWNDBKKOEB-IHRRRGAJSA-N Glu-Glu-Phe Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O YLJHCWNDBKKOEB-IHRRRGAJSA-N 0.000 description 4
- BUAKRRKDHSSIKK-IHRRRGAJSA-N Glu-Glu-Tyr Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 BUAKRRKDHSSIKK-IHRRRGAJSA-N 0.000 description 4
- VNCNWQPIQYAMAK-ACZMJKKPSA-N Glu-Ser-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O VNCNWQPIQYAMAK-ACZMJKKPSA-N 0.000 description 4
- HHSKZJZWQFPSKN-AVGNSLFASA-N Glu-Tyr-Asp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O HHSKZJZWQFPSKN-AVGNSLFASA-N 0.000 description 4
- HQTDNEZTGZUWSY-XVKPBYJWSA-N Glu-Val-Gly Chemical compound CC(C)[C@H](NC(=O)[C@@H](N)CCC(O)=O)C(=O)NCC(O)=O HQTDNEZTGZUWSY-XVKPBYJWSA-N 0.000 description 4
- RMWAOBGCZZSJHE-UMNHJUIQSA-N Glu-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)O)N RMWAOBGCZZSJHE-UMNHJUIQSA-N 0.000 description 4
- XRTDOIOIBMAXCT-NKWVEPMBSA-N Gly-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)CN)C(=O)O XRTDOIOIBMAXCT-NKWVEPMBSA-N 0.000 description 4
- ZRZILYKEJBMFHY-BQBZGAKWSA-N Gly-Asp-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)CN ZRZILYKEJBMFHY-BQBZGAKWSA-N 0.000 description 4
- PEZZSFLFXXFUQD-XPUUQOCRSA-N Gly-Cys-Val Chemical compound [H]NCC(=O)N[C@@H](CS)C(=O)N[C@@H](C(C)C)C(O)=O PEZZSFLFXXFUQD-XPUUQOCRSA-N 0.000 description 4
- KMSGYZQRXPUKGI-BYPYZUCNSA-N Gly-Gly-Asn Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CC(N)=O KMSGYZQRXPUKGI-BYPYZUCNSA-N 0.000 description 4
- TWTPDFFBLQEBOE-IUCAKERBSA-N Gly-Leu-Gln Chemical compound [H]NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O TWTPDFFBLQEBOE-IUCAKERBSA-N 0.000 description 4
- ULZCYBYDTUMHNF-IUCAKERBSA-N Gly-Leu-Glu Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O ULZCYBYDTUMHNF-IUCAKERBSA-N 0.000 description 4
- NNCSJUBVFBDDLC-YUMQZZPRSA-N Gly-Leu-Ser Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O NNCSJUBVFBDDLC-YUMQZZPRSA-N 0.000 description 4
- SBVMXEZQJVUARN-XPUUQOCRSA-N Gly-Val-Ser Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O SBVMXEZQJVUARN-XPUUQOCRSA-N 0.000 description 4
- 101710121996 Hexon protein p72 Proteins 0.000 description 4
- AVQOSMRPITVTRB-CIUDSAMLSA-N His-Asn-Asn Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)N)C(=O)O)N AVQOSMRPITVTRB-CIUDSAMLSA-N 0.000 description 4
- PGRPSOUCWRBWKZ-DLOVCJGASA-N His-Lys-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC1=CN=CN1 PGRPSOUCWRBWKZ-DLOVCJGASA-N 0.000 description 4
- VCBWXASUBZIFLQ-IHRRRGAJSA-N His-Pro-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O VCBWXASUBZIFLQ-IHRRRGAJSA-N 0.000 description 4
- CHIAUHSHDARFBD-ULQDDVLXSA-N His-Pro-Tyr Chemical compound C([C@H](N)C(=O)N1[C@@H](CCC1)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CN=CN1 CHIAUHSHDARFBD-ULQDDVLXSA-N 0.000 description 4
- 241000701828 Human papillomavirus type 11 Species 0.000 description 4
- BGZIJZJBXRVBGJ-SXTJYALSSA-N Ile-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N BGZIJZJBXRVBGJ-SXTJYALSSA-N 0.000 description 4
- ZIPOVLBRVPXWJQ-SPOWBLRKSA-N Ile-Cys-Trp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N ZIPOVLBRVPXWJQ-SPOWBLRKSA-N 0.000 description 4
- NHJKZMDIMMTVCK-QXEWZRGKSA-N Ile-Gly-Arg Chemical compound CC[C@H](C)[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N NHJKZMDIMMTVCK-QXEWZRGKSA-N 0.000 description 4
- LPFBXFILACZHIB-LAEOZQHASA-N Ile-Gly-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N[C@@H](CCC(=O)O)C(=O)O)N LPFBXFILACZHIB-LAEOZQHASA-N 0.000 description 4
- KEKTTYCXKGBAAL-VGDYDELISA-N Ile-His-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CO)C(=O)O)N KEKTTYCXKGBAAL-VGDYDELISA-N 0.000 description 4
- YNMQUIVKEFRCPH-QSFUFRPTSA-N Ile-Ile-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(=O)O)N YNMQUIVKEFRCPH-QSFUFRPTSA-N 0.000 description 4
- GVNNAHIRSDRIII-AJNGGQMLSA-N Ile-Lys-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)O)N GVNNAHIRSDRIII-AJNGGQMLSA-N 0.000 description 4
- PWWVAXIEGOYWEE-UHFFFAOYSA-N Isophenergan Chemical compound C1=CC=C2N(CC(C)N(C)C)C3=CC=CC=C3SC2=C1 PWWVAXIEGOYWEE-UHFFFAOYSA-N 0.000 description 4
- SITWEMZOJNKJCH-UHFFFAOYSA-N L-alanine-L-arginine Natural products CC(N)C(=O)NC(C(O)=O)CCCNC(N)=N SITWEMZOJNKJCH-UHFFFAOYSA-N 0.000 description 4
- LZDNBBYBDGBADK-UHFFFAOYSA-N L-valyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)C(C)C)C(O)=O)=CNC2=C1 LZDNBBYBDGBADK-UHFFFAOYSA-N 0.000 description 4
- IIKJNQWOQIWWMR-CIUDSAMLSA-N Leu-Cys-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CC(C)C)N IIKJNQWOQIWWMR-CIUDSAMLSA-N 0.000 description 4
- NHHKSOGJYNQENP-SRVKXCTJSA-N Leu-Cys-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CCCCN)C(=O)O)N NHHKSOGJYNQENP-SRVKXCTJSA-N 0.000 description 4
- ZTLGVASZOIKNIX-DCAQKATOSA-N Leu-Gln-Glu Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N ZTLGVASZOIKNIX-DCAQKATOSA-N 0.000 description 4
- AXZGZMGRBDQTEY-SRVKXCTJSA-N Leu-Gln-Met Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCSC)C(O)=O AXZGZMGRBDQTEY-SRVKXCTJSA-N 0.000 description 4
- QVFGXCVIXXBFHO-AVGNSLFASA-N Leu-Glu-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O QVFGXCVIXXBFHO-AVGNSLFASA-N 0.000 description 4
- USLNHQZCDQJBOV-ZPFDUUQYSA-N Leu-Ile-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O USLNHQZCDQJBOV-ZPFDUUQYSA-N 0.000 description 4
- LIINDKYIGYTDLG-PPCPHDFISA-N Leu-Ile-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LIINDKYIGYTDLG-PPCPHDFISA-N 0.000 description 4
- VVQJGYPTIYOFBR-IHRRRGAJSA-N Leu-Lys-Met Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCSC)C(=O)O)N VVQJGYPTIYOFBR-IHRRRGAJSA-N 0.000 description 4
- ZAVCJRJOQKIOJW-KKUMJFAQSA-N Leu-Phe-Asp Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(O)=O)C(O)=O)CC1=CC=CC=C1 ZAVCJRJOQKIOJW-KKUMJFAQSA-N 0.000 description 4
- FYPWFNKQVVEELI-ULQDDVLXSA-N Leu-Phe-Val Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C(C)C)C(O)=O)CC1=CC=CC=C1 FYPWFNKQVVEELI-ULQDDVLXSA-N 0.000 description 4
- BMVFXOQHDQZAQU-DCAQKATOSA-N Leu-Pro-Asp Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(=O)O)C(=O)O)N BMVFXOQHDQZAQU-DCAQKATOSA-N 0.000 description 4
- IRMLZWSRWSGTOP-CIUDSAMLSA-N Leu-Ser-Ala Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O IRMLZWSRWSGTOP-CIUDSAMLSA-N 0.000 description 4
- RGUXWMDNCPMQFB-YUMQZZPRSA-N Leu-Ser-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O RGUXWMDNCPMQFB-YUMQZZPRSA-N 0.000 description 4
- SBANPBVRHYIMRR-GARJFASQSA-N Leu-Ser-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CO)C(=O)N1CCC[C@@H]1C(=O)O)N SBANPBVRHYIMRR-GARJFASQSA-N 0.000 description 4
- KLSUAWUZBMAZCL-RHYQMDGZSA-N Leu-Thr-Pro Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@H]1C(O)=O KLSUAWUZBMAZCL-RHYQMDGZSA-N 0.000 description 4
- WUHBLPVELFTPQK-KKUMJFAQSA-N Leu-Tyr-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(O)=O WUHBLPVELFTPQK-KKUMJFAQSA-N 0.000 description 4
- ARNIBBOXIAWUOP-MGHWNKPDSA-N Leu-Tyr-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O ARNIBBOXIAWUOP-MGHWNKPDSA-N 0.000 description 4
- FDBTVENULFNTAL-XQQFMLRXSA-N Leu-Val-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N FDBTVENULFNTAL-XQQFMLRXSA-N 0.000 description 4
- VKVDRTGWLVZJOM-DCAQKATOSA-N Leu-Val-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O VKVDRTGWLVZJOM-DCAQKATOSA-N 0.000 description 4
- JCFYLFOCALSNLQ-GUBZILKMSA-N Lys-Ala-Gln Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(O)=O JCFYLFOCALSNLQ-GUBZILKMSA-N 0.000 description 4
- GGNOBVSOZPHLCE-GUBZILKMSA-N Lys-Gln-Asp Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O GGNOBVSOZPHLCE-GUBZILKMSA-N 0.000 description 4
- RBEATVHTWHTHTJ-KKUMJFAQSA-N Lys-Leu-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(O)=O RBEATVHTWHTHTJ-KKUMJFAQSA-N 0.000 description 4
- MSSJJDVQTFTLIF-KBPBESRZSA-N Lys-Phe-Gly Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](Cc1ccccc1)C(=O)NCC(O)=O MSSJJDVQTFTLIF-KBPBESRZSA-N 0.000 description 4
- RMKJOQSYLQQRFN-KKUMJFAQSA-N Lys-Tyr-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O RMKJOQSYLQQRFN-KKUMJFAQSA-N 0.000 description 4
- HMZPYMSEAALNAE-ULQDDVLXSA-N Lys-Val-Tyr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O HMZPYMSEAALNAE-ULQDDVLXSA-N 0.000 description 4
- 101710125418 Major capsid protein Proteins 0.000 description 4
- CAODKDAPYGUMLK-FXQIFTODSA-N Met-Asn-Ser Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O CAODKDAPYGUMLK-FXQIFTODSA-N 0.000 description 4
- XOMXAVJBLRROMC-IHRRRGAJSA-N Met-Asp-Phe Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 XOMXAVJBLRROMC-IHRRRGAJSA-N 0.000 description 4
- NSMXRFMGZYTFEX-KJEVXHAQSA-N Met-Thr-Tyr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@H](CCSC)N)O NSMXRFMGZYTFEX-KJEVXHAQSA-N 0.000 description 4
- UZBQXELAFPCGRV-SZMVWBNQSA-N Met-Trp-Arg Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O UZBQXELAFPCGRV-SZMVWBNQSA-N 0.000 description 4
- LBSWWNKMVPAXOI-GUBZILKMSA-N Met-Val-Ser Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O LBSWWNKMVPAXOI-GUBZILKMSA-N 0.000 description 4
- 108010079364 N-glycylalanine Proteins 0.000 description 4
- 108010066427 N-valyltryptophan Proteins 0.000 description 4
- 206010057444 Oropharyngeal neoplasm Diseases 0.000 description 4
- 238000012408 PCR amplification Methods 0.000 description 4
- NHCKESBLOMHIIE-IRXDYDNUSA-N Phe-Gly-Phe Chemical compound C([C@H](N)C(=O)NCC(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 NHCKESBLOMHIIE-IRXDYDNUSA-N 0.000 description 4
- DSXPMZMSJHOKKK-HJOGWXRNSA-N Phe-Phe-Tyr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O DSXPMZMSJHOKKK-HJOGWXRNSA-N 0.000 description 4
- WWPAHTZOWURIMR-ULQDDVLXSA-N Phe-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CC1=CC=CC=C1 WWPAHTZOWURIMR-ULQDDVLXSA-N 0.000 description 4
- XOHJOMKCRLHGCY-UNQGMJICSA-N Phe-Pro-Thr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O XOHJOMKCRLHGCY-UNQGMJICSA-N 0.000 description 4
- UNBFGVQVQGXXCK-KKUMJFAQSA-N Phe-Ser-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O UNBFGVQVQGXXCK-KKUMJFAQSA-N 0.000 description 4
- NJONQBYLTANINY-IHPCNDPISA-N Phe-Trp-Asn Chemical compound N[C@@H](Cc1ccccc1)C(=O)N[C@@H](Cc1c[nH]c2ccccc12)C(=O)N[C@@H](CC(N)=O)C(O)=O NJONQBYLTANINY-IHPCNDPISA-N 0.000 description 4
- NHHZWPNMYQUNEH-ACRUOGEOSA-N Phe-Tyr-His Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)N[C@@H](CC3=CN=CN3)C(=O)O)N NHHZWPNMYQUNEH-ACRUOGEOSA-N 0.000 description 4
- ZYNBEWGJFXTBDU-ACRUOGEOSA-N Phe-Tyr-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CC2=CC=CC=C2)N ZYNBEWGJFXTBDU-ACRUOGEOSA-N 0.000 description 4
- VOZIBWWZSBIXQN-SRVKXCTJSA-N Pro-Glu-Lys Chemical compound NCCCC[C@H](NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H]1CCCN1)C(O)=O VOZIBWWZSBIXQN-SRVKXCTJSA-N 0.000 description 4
- UEHYFUCOGHWASA-HJGDQZAQSA-N Pro-Glu-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H]1CCCN1 UEHYFUCOGHWASA-HJGDQZAQSA-N 0.000 description 4
- ULIWFCCJIOEHMU-BQBZGAKWSA-N Pro-Gly-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H]1CCCN1 ULIWFCCJIOEHMU-BQBZGAKWSA-N 0.000 description 4
- JMVQDLDPDBXAAX-YUMQZZPRSA-N Pro-Gly-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H]1CCCN1 JMVQDLDPDBXAAX-YUMQZZPRSA-N 0.000 description 4
- DXTOOBDIIAJZBJ-BQBZGAKWSA-N Pro-Gly-Ser Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CO)C(O)=O DXTOOBDIIAJZBJ-BQBZGAKWSA-N 0.000 description 4
- XYHMFGGWNOFUOU-QXEWZRGKSA-N Pro-Ile-Gly Chemical compound OC(=O)CNC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H]1CCCN1 XYHMFGGWNOFUOU-QXEWZRGKSA-N 0.000 description 4
- LEIKGVHQTKHOLM-IUCAKERBSA-N Pro-Pro-Gly Chemical compound OC(=O)CNC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 LEIKGVHQTKHOLM-IUCAKERBSA-N 0.000 description 4
- GMJDSFYVTAMIBF-FXQIFTODSA-N Pro-Ser-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O GMJDSFYVTAMIBF-FXQIFTODSA-N 0.000 description 4
- QUBVFEANYYWBTM-VEVYYDQMSA-N Pro-Thr-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(O)=O QUBVFEANYYWBTM-VEVYYDQMSA-N 0.000 description 4
- AIOWVDNPESPXRB-YTWAJWBKSA-N Pro-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@@H]2CCCN2)O AIOWVDNPESPXRB-YTWAJWBKSA-N 0.000 description 4
- GZNYIXWOIUFLGO-ZJDVBMNYSA-N Pro-Thr-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GZNYIXWOIUFLGO-ZJDVBMNYSA-N 0.000 description 4
- QMABBZHZMDXHKU-FKBYEOEOSA-N Pro-Tyr-Trp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O QMABBZHZMDXHKU-FKBYEOEOSA-N 0.000 description 4
- ZMLRZBWCXPQADC-TUAOUCFPSA-N Pro-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@@H]2CCCN2 ZMLRZBWCXPQADC-TUAOUCFPSA-N 0.000 description 4
- WDXYVIIVDIDOSX-DCAQKATOSA-N Ser-Arg-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CO)CCCN=C(N)N WDXYVIIVDIDOSX-DCAQKATOSA-N 0.000 description 4
- VGNYHOBZJKWRGI-CIUDSAMLSA-N Ser-Asn-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CO VGNYHOBZJKWRGI-CIUDSAMLSA-N 0.000 description 4
- OHKLFYXEOGGGCK-ZLUOBGJFSA-N Ser-Asp-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O OHKLFYXEOGGGCK-ZLUOBGJFSA-N 0.000 description 4
- DLPXTCTVNDTYGJ-JBDRJPRFSA-N Ser-Ile-Cys Chemical compound OC[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CS)C(O)=O DLPXTCTVNDTYGJ-JBDRJPRFSA-N 0.000 description 4
- RIAKPZVSNBBNRE-BJDJZHNGSA-N Ser-Ile-Leu Chemical compound OC[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O RIAKPZVSNBBNRE-BJDJZHNGSA-N 0.000 description 4
- ZOPISOXXPQNOCO-SVSWQMSJSA-N Ser-Ile-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](CO)N ZOPISOXXPQNOCO-SVSWQMSJSA-N 0.000 description 4
- ZIFYDQAFEMIZII-GUBZILKMSA-N Ser-Leu-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O ZIFYDQAFEMIZII-GUBZILKMSA-N 0.000 description 4
- KJKQUQXDEKMPDK-FXQIFTODSA-N Ser-Met-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(O)=O)C(O)=O KJKQUQXDEKMPDK-FXQIFTODSA-N 0.000 description 4
- HEYZPTCCEIWHRO-IHRRRGAJSA-N Ser-Met-Phe Chemical compound OC[C@H](N)C(=O)N[C@@H](CCSC)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 HEYZPTCCEIWHRO-IHRRRGAJSA-N 0.000 description 4
- ASGYVPAVFNDZMA-GUBZILKMSA-N Ser-Met-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](CO)N ASGYVPAVFNDZMA-GUBZILKMSA-N 0.000 description 4
- ZKBKUWQVDWWSRI-BZSNNMDCSA-N Ser-Phe-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O ZKBKUWQVDWWSRI-BZSNNMDCSA-N 0.000 description 4
- AZWNCEBQZXELEZ-FXQIFTODSA-N Ser-Pro-Ser Chemical compound OC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O AZWNCEBQZXELEZ-FXQIFTODSA-N 0.000 description 4
- SQHKXWODKJDZRC-LKXGYXEUSA-N Ser-Thr-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(O)=O SQHKXWODKJDZRC-LKXGYXEUSA-N 0.000 description 4
- BDMWLJLPPUCLNV-XGEHTFHBSA-N Ser-Thr-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O BDMWLJLPPUCLNV-XGEHTFHBSA-N 0.000 description 4
- MFQMZDPAZRZAPV-NAKRPEOUSA-N Ser-Val-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CO)N MFQMZDPAZRZAPV-NAKRPEOUSA-N 0.000 description 4
- NJEMRSFGDNECGF-GCJQMDKQSA-N Thr-Ala-Asp Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC(O)=O NJEMRSFGDNECGF-GCJQMDKQSA-N 0.000 description 4
- JTEICXDKGWKRRV-HJGDQZAQSA-N Thr-Asn-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N)O JTEICXDKGWKRRV-HJGDQZAQSA-N 0.000 description 4
- QQWNRERCGGZOKG-WEDXCCLWSA-N Thr-Gly-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O QQWNRERCGGZOKG-WEDXCCLWSA-N 0.000 description 4
- QNCFWHZVRNXAKW-OEAJRASXSA-N Thr-Lys-Phe Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O QNCFWHZVRNXAKW-OEAJRASXSA-N 0.000 description 4
- XSEPSRUDSPHMPX-KATARQTJSA-N Thr-Lys-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O XSEPSRUDSPHMPX-KATARQTJSA-N 0.000 description 4
- DOBIBIXIHJKVJF-XKBZYTNZSA-N Thr-Ser-Gln Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(N)=O DOBIBIXIHJKVJF-XKBZYTNZSA-N 0.000 description 4
- AHERARIZBPOMNU-KATARQTJSA-N Thr-Ser-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O AHERARIZBPOMNU-KATARQTJSA-N 0.000 description 4
- BKVICMPZWRNWOC-RHYQMDGZSA-N Thr-Val-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)[C@@H](C)O BKVICMPZWRNWOC-RHYQMDGZSA-N 0.000 description 4
- GHXXDFDIDHIEIL-WFBYXXMGSA-N Trp-Ala-Cys Chemical compound C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N GHXXDFDIDHIEIL-WFBYXXMGSA-N 0.000 description 4
- TWJDQTTXXZDJKV-BPUTZDHNSA-N Trp-Arg-Ser Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O TWJDQTTXXZDJKV-BPUTZDHNSA-N 0.000 description 4
- DVWAIHZOPSYMSJ-ZVZYQTTQSA-N Trp-Glu-Val Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O)=CNC2=C1 DVWAIHZOPSYMSJ-ZVZYQTTQSA-N 0.000 description 4
- VTFWAGGJDRSQFG-MELADBBJSA-N Tyr-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC2=CC=C(C=C2)O)N)C(=O)O VTFWAGGJDRSQFG-MELADBBJSA-N 0.000 description 4
- GAYLGYUVTDMLKC-UWJYBYFXSA-N Tyr-Asp-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 GAYLGYUVTDMLKC-UWJYBYFXSA-N 0.000 description 4
- JFDGVHXRCKEBAU-KKUMJFAQSA-N Tyr-Asp-Lys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N)O JFDGVHXRCKEBAU-KKUMJFAQSA-N 0.000 description 4
- IGXLNVIYDYONFB-UFYCRDLUSA-N Tyr-Phe-Arg Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O)C1=CC=C(O)C=C1 IGXLNVIYDYONFB-UFYCRDLUSA-N 0.000 description 4
- NVZVJIUDICCMHZ-BZSNNMDCSA-N Tyr-Phe-Ser Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O NVZVJIUDICCMHZ-BZSNNMDCSA-N 0.000 description 4
- AGDDLOQMXUQPDY-BZSNNMDCSA-N Tyr-Tyr-Ser Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(O)=O AGDDLOQMXUQPDY-BZSNNMDCSA-N 0.000 description 4
- OBKOPLHSRDATFO-XHSDSOJGSA-N Tyr-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N OBKOPLHSRDATFO-XHSDSOJGSA-N 0.000 description 4
- SLLKXDSRVAOREO-KZVJFYERSA-N Val-Ala-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](C)NC(=O)[C@H](C(C)C)N)O SLLKXDSRVAOREO-KZVJFYERSA-N 0.000 description 4
- TZVUSFMQWPWHON-NHCYSSNCSA-N Val-Asp-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](C(C)C)N TZVUSFMQWPWHON-NHCYSSNCSA-N 0.000 description 4
- VFOHXOLPLACADK-GVXVVHGQSA-N Val-Gln-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](C(C)C)N VFOHXOLPLACADK-GVXVVHGQSA-N 0.000 description 4
- SZTTYWIUCGSURQ-AUTRQRHGSA-N Val-Glu-Glu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O SZTTYWIUCGSURQ-AUTRQRHGSA-N 0.000 description 4
- NXRAUQGGHPCJIB-RCOVLWMOSA-N Val-Gly-Asn Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O NXRAUQGGHPCJIB-RCOVLWMOSA-N 0.000 description 4
- ZHQWPWQNVRCXAX-XQQFMLRXSA-N Val-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](C(C)C)N ZHQWPWQNVRCXAX-XQQFMLRXSA-N 0.000 description 4
- YDVDTCJGBBJGRT-GUBZILKMSA-N Val-Met-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CO)C(=O)O)N YDVDTCJGBBJGRT-GUBZILKMSA-N 0.000 description 4
- QPPZEDOTPZOSEC-RCWTZXSCSA-N Val-Met-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](C(C)C)N)O QPPZEDOTPZOSEC-RCWTZXSCSA-N 0.000 description 4
- MHHAWNPHDLCPLF-ULQDDVLXSA-N Val-Phe-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C(C)C)CC1=CC=CC=C1 MHHAWNPHDLCPLF-ULQDDVLXSA-N 0.000 description 4
- XBJKAZATRJBDCU-GUBZILKMSA-N Val-Pro-Ala Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O XBJKAZATRJBDCU-GUBZILKMSA-N 0.000 description 4
- NHXZRXLFOBFMDM-AVGNSLFASA-N Val-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)C(C)C NHXZRXLFOBFMDM-AVGNSLFASA-N 0.000 description 4
- DOFAQXCYFQKSHT-SRVKXCTJSA-N Val-Pro-Pro Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 DOFAQXCYFQKSHT-SRVKXCTJSA-N 0.000 description 4
- DEGUERSKQBRZMZ-FXQIFTODSA-N Val-Ser-Ala Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O DEGUERSKQBRZMZ-FXQIFTODSA-N 0.000 description 4
- QTPQHINADBYBNA-DCAQKATOSA-N Val-Ser-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCCN QTPQHINADBYBNA-DCAQKATOSA-N 0.000 description 4
- LCHZBEUVGAVMKS-RHYQMDGZSA-N Val-Thr-Leu Chemical compound CC(C)C[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)C(C)C)[C@@H](C)O)C(O)=O LCHZBEUVGAVMKS-RHYQMDGZSA-N 0.000 description 4
- HTONZBWRYUKUKC-RCWTZXSCSA-N Val-Thr-Val Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O HTONZBWRYUKUKC-RCWTZXSCSA-N 0.000 description 4
- 239000000427 antigen Substances 0.000 description 4
- 108091007433 antigens Proteins 0.000 description 4
- 102000036639 antigens Human genes 0.000 description 4
- 108010013835 arginine glutamate Proteins 0.000 description 4
- 239000001569 carbon dioxide Substances 0.000 description 4
- 238000010828 elution Methods 0.000 description 4
- 108010078144 glutaminyl-glycine Proteins 0.000 description 4
- XBGGUPMXALFZOT-UHFFFAOYSA-N glycyl-L-tyrosine hemihydrate Natural products NCC(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 XBGGUPMXALFZOT-UHFFFAOYSA-N 0.000 description 4
- 108010019832 glycyl-asparaginyl-glycine Proteins 0.000 description 4
- 108010040030 histidinoalanine Proteins 0.000 description 4
- 108010025306 histidylleucine Proteins 0.000 description 4
- 108700042300 human papillomavirus type 52 L1 Proteins 0.000 description 4
- 230000005764 inhibitory process Effects 0.000 description 4
- 108010044374 isoleucyl-tyrosine Proteins 0.000 description 4
- 108010045397 lysyl-tyrosyl-lysine Proteins 0.000 description 4
- 239000008188 pellet Substances 0.000 description 4
- 108010073025 phenylalanylphenylalanine Proteins 0.000 description 4
- 230000009465 prokaryotic expression Effects 0.000 description 4
- 108010087846 prolyl-prolyl-glycine Proteins 0.000 description 4
- 150000003839 salts Chemical class 0.000 description 4
- 108010007375 seryl-seryl-seryl-arginine Proteins 0.000 description 4
- 108010005652 splenotritin Proteins 0.000 description 4
- 108010005834 tyrosyl-alanyl-glycine Proteins 0.000 description 4
- 108700042752 tyrosyl-prolyl-leucyl-glycine Proteins 0.000 description 4
- IBIDRSSEHFLGSD-UHFFFAOYSA-N valinyl-arginine Natural products CC(C)C(N)C(=O)NC(C(O)=O)CCCN=C(N)N IBIDRSSEHFLGSD-UHFFFAOYSA-N 0.000 description 4
- 108010015385 valyl-prolyl-proline Proteins 0.000 description 4
- 108010073969 valyllysine Proteins 0.000 description 4
- 108010009962 valyltyrosine Proteins 0.000 description 4
- IPZQNYYAYVRKKK-FXQIFTODSA-N Ala-Pro-Ala Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O IPZQNYYAYVRKKK-FXQIFTODSA-N 0.000 description 3
- NYZGVTGOMPHSJW-CIUDSAMLSA-N Arg-Glu-Cys Chemical compound C(C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CS)C(=O)O)N)CN=C(N)N NYZGVTGOMPHSJW-CIUDSAMLSA-N 0.000 description 3
- IRRMIGDCPOPZJW-ULQDDVLXSA-N Arg-His-Phe Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O IRRMIGDCPOPZJW-ULQDDVLXSA-N 0.000 description 3
- YRTOMUMWSTUQAX-FXQIFTODSA-N Asn-Pro-Asp Chemical compound NC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O YRTOMUMWSTUQAX-FXQIFTODSA-N 0.000 description 3
- SBHUBSDEZQFJHJ-CIUDSAMLSA-N Asp-Asp-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC(O)=O SBHUBSDEZQFJHJ-CIUDSAMLSA-N 0.000 description 3
- 102000053602 DNA Human genes 0.000 description 3
- LKUWAWGNJYJODH-KBIXCLLPSA-N Gln-Ala-Ile Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O LKUWAWGNJYJODH-KBIXCLLPSA-N 0.000 description 3
- RGXXLQWXBFNXTG-CIUDSAMLSA-N Gln-Arg-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O RGXXLQWXBFNXTG-CIUDSAMLSA-N 0.000 description 3
- DSPQRJXOIXHOHK-WDSKDSINSA-N Glu-Asp-Gly Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O DSPQRJXOIXHOHK-WDSKDSINSA-N 0.000 description 3
- VGBSZQSKQRMLHD-MNXVOIDGSA-N Glu-Leu-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O VGBSZQSKQRMLHD-MNXVOIDGSA-N 0.000 description 3
- MFNUFCFRAZPJFW-JYJNAYRXSA-N Glu-Lys-Phe Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 MFNUFCFRAZPJFW-JYJNAYRXSA-N 0.000 description 3
- UZWUBBRJWFTHTD-LAEOZQHASA-N Glu-Val-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCC(O)=O UZWUBBRJWFTHTD-LAEOZQHASA-N 0.000 description 3
- NTNUEBVGKMVANB-NHCYSSNCSA-N Glu-Val-Met Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCSC)C(O)=O NTNUEBVGKMVANB-NHCYSSNCSA-N 0.000 description 3
- LGQZOQRDEUIZJY-YUMQZZPRSA-N Gly-Cys-Lys Chemical compound NCCCC[C@H](NC(=O)[C@H](CS)NC(=O)CN)C(O)=O LGQZOQRDEUIZJY-YUMQZZPRSA-N 0.000 description 3
- SOEGEPHNZOISMT-BYPYZUCNSA-N Gly-Ser-Gly Chemical compound NCC(=O)N[C@@H](CO)C(=O)NCC(O)=O SOEGEPHNZOISMT-BYPYZUCNSA-N 0.000 description 3
- YABRDIBSPZONIY-BQBZGAKWSA-N Gly-Ser-Met Chemical compound [H]NCC(=O)N[C@@H](CO)C(=O)N[C@@H](CCSC)C(O)=O YABRDIBSPZONIY-BQBZGAKWSA-N 0.000 description 3
- DHMQDGOQFOQNFH-UHFFFAOYSA-N Glycine Chemical compound NCC(O)=O DHMQDGOQFOQNFH-UHFFFAOYSA-N 0.000 description 3
- SYPULFZAGBBIOM-GVXVVHGQSA-N His-Val-Glu Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N SYPULFZAGBBIOM-GVXVVHGQSA-N 0.000 description 3
- 241000282412 Homo Species 0.000 description 3
- 101000781698 Human papillomavirus 35 Major capsid protein L1 Proteins 0.000 description 3
- 108700005307 Human papillomavirus HPV L1 Proteins 0.000 description 3
- OVDKXUDMKXAZIV-ZPFDUUQYSA-N Ile-Lys-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(=O)N)C(=O)O)N OVDKXUDMKXAZIV-ZPFDUUQYSA-N 0.000 description 3
- 108060003951 Immunoglobulin Proteins 0.000 description 3
- DCXYFEDJOCDNAF-REOHCLBHSA-N L-asparagine Chemical compound OC(=O)[C@@H](N)CC(N)=O DCXYFEDJOCDNAF-REOHCLBHSA-N 0.000 description 3
- PPBKJAQJAUHZKX-SRVKXCTJSA-N Leu-Cys-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@H](C(O)=O)CC(C)C PPBKJAQJAUHZKX-SRVKXCTJSA-N 0.000 description 3
- ZYLJULGXQDNXDK-GUBZILKMSA-N Leu-Gln-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O ZYLJULGXQDNXDK-GUBZILKMSA-N 0.000 description 3
- JNDYEOUZBLOVOF-AVGNSLFASA-N Leu-Leu-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O JNDYEOUZBLOVOF-AVGNSLFASA-N 0.000 description 3
- JLWZLIQRYCTYBD-IHRRRGAJSA-N Leu-Lys-Arg Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O JLWZLIQRYCTYBD-IHRRRGAJSA-N 0.000 description 3
- LOGFVTREOLYCPF-RHYQMDGZSA-N Lys-Pro-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CCCCN LOGFVTREOLYCPF-RHYQMDGZSA-N 0.000 description 3
- 101710157639 Minor capsid protein Proteins 0.000 description 3
- 206010028980 Neoplasm Diseases 0.000 description 3
- 108700026244 Open Reading Frames Proteins 0.000 description 3
- KAGCQPSEVAETCA-JYJNAYRXSA-N Phe-Gln-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC1=CC=CC=C1)N KAGCQPSEVAETCA-JYJNAYRXSA-N 0.000 description 3
- CBENHWCORLVGEQ-HJOGWXRNSA-N Phe-Phe-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 CBENHWCORLVGEQ-HJOGWXRNSA-N 0.000 description 3
- WWAQEUOYCYMGHB-FXQIFTODSA-N Pro-Asn-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H]1CCCN1 WWAQEUOYCYMGHB-FXQIFTODSA-N 0.000 description 3
- XYSXOCIWCPFOCG-IHRRRGAJSA-N Pro-Leu-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O XYSXOCIWCPFOCG-IHRRRGAJSA-N 0.000 description 3
- SBVPYBFMIGDIDX-SRVKXCTJSA-N Pro-Pro-Pro Chemical compound OC(=O)[C@@H]1CCCN1C(=O)[C@H]1N(C(=O)[C@H]2NCCC2)CCC1 SBVPYBFMIGDIDX-SRVKXCTJSA-N 0.000 description 3
- RMJZWERKFFNNNS-XGEHTFHBSA-N Pro-Thr-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O RMJZWERKFFNNNS-XGEHTFHBSA-N 0.000 description 3
- 101710136297 Protein VP2 Proteins 0.000 description 3
- 240000004808 Saccharomyces cerevisiae Species 0.000 description 3
- IDQFQFVEWMWRQQ-DLOVCJGASA-N Ser-Ala-Phe Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O IDQFQFVEWMWRQQ-DLOVCJGASA-N 0.000 description 3
- ZKOKTQPHFMRSJP-YJRXYDGGSA-N Ser-Thr-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O ZKOKTQPHFMRSJP-YJRXYDGGSA-N 0.000 description 3
- 101710172711 Structural protein Proteins 0.000 description 3
- UTCFSBBXPWKLTG-XKBZYTNZSA-N Thr-Cys-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N)O UTCFSBBXPWKLTG-XKBZYTNZSA-N 0.000 description 3
- YAAPRMFURSENOZ-KATARQTJSA-N Thr-Cys-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CCCCN)C(=O)O)N)O YAAPRMFURSENOZ-KATARQTJSA-N 0.000 description 3
- DKDHTRVDOUZZTP-IFFSRLJSSA-N Thr-Gln-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)[C@@H](C)O)C(O)=O DKDHTRVDOUZZTP-IFFSRLJSSA-N 0.000 description 3
- XHWCDRUPDNSDAZ-XKBZYTNZSA-N Thr-Ser-Glu Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N)O XHWCDRUPDNSDAZ-XKBZYTNZSA-N 0.000 description 3
- WLBZWXXGSOLJBA-HOCLYGCPSA-N Trp-Gly-Lys Chemical compound C1=CC=C2C(C[C@H](N)C(=O)NCC(=O)N[C@@H](CCCCN)C(O)=O)=CNC2=C1 WLBZWXXGSOLJBA-HOCLYGCPSA-N 0.000 description 3
- RCLOWEZASFJFEX-KKUMJFAQSA-N Tyr-Asp-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 RCLOWEZASFJFEX-KKUMJFAQSA-N 0.000 description 3
- LNYOXPDEIZJDEI-NHCYSSNCSA-N Val-Asn-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](C(C)C)N LNYOXPDEIZJDEI-NHCYSSNCSA-N 0.000 description 3
- PMDOQZFYGWZSTK-LSJOCFKGSA-N Val-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)C(C)C PMDOQZFYGWZSTK-LSJOCFKGSA-N 0.000 description 3
- RWOGENDAOGMHLX-DCAQKATOSA-N Val-Lys-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](C(C)C)N RWOGENDAOGMHLX-DCAQKATOSA-N 0.000 description 3
- 230000001580 bacterial effect Effects 0.000 description 3
- 201000011510 cancer Diseases 0.000 description 3
- 238000012217 deletion Methods 0.000 description 3
- 230000037430 deletion Effects 0.000 description 3
- 230000009547 development abnormality Effects 0.000 description 3
- 230000028993 immune response Effects 0.000 description 3
- 102000018358 immunoglobulin Human genes 0.000 description 3
- 239000007788 liquid Substances 0.000 description 3
- 230000035772 mutation Effects 0.000 description 3
- 108010029020 prolylglycine Proteins 0.000 description 3
- 230000001105 regulatory effect Effects 0.000 description 3
- 239000012096 transfection reagent Substances 0.000 description 3
- 229910052721 tungsten Inorganic materials 0.000 description 3
- 230000003612 virological effect Effects 0.000 description 3
- FSNVAJOPUDVQAR-UHFFFAOYSA-N 2-[[6-amino-2-[[2-amino-5-(diaminomethylideneamino)pentanoyl]amino]hexanoyl]amino]-5-(diaminomethylideneamino)pentanoic acid Chemical compound NC(N)=NCCCC(N)C(=O)NC(CCCCN)C(=O)NC(CCCN=C(N)N)C(O)=O FSNVAJOPUDVQAR-UHFFFAOYSA-N 0.000 description 2
- JAMAWBXXKFGFGX-KZVJFYERSA-N Ala-Arg-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O JAMAWBXXKFGFGX-KZVJFYERSA-N 0.000 description 2
- ZEXDYVGDZJBRMO-ACZMJKKPSA-N Ala-Asn-Gln Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N ZEXDYVGDZJBRMO-ACZMJKKPSA-N 0.000 description 2
- NXSFUECZFORGOG-CIUDSAMLSA-N Ala-Asn-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O NXSFUECZFORGOG-CIUDSAMLSA-N 0.000 description 2
- YSMPVONNIWLJML-FXQIFTODSA-N Ala-Asp-Pro Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(O)=O YSMPVONNIWLJML-FXQIFTODSA-N 0.000 description 2
- IKKVASZHTMKJIR-ZKWXMUAHSA-N Ala-Asp-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O IKKVASZHTMKJIR-ZKWXMUAHSA-N 0.000 description 2
- DECCMEWNXSNSDO-ZLUOBGJFSA-N Ala-Cys-Ala Chemical compound C[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@@H](C)C(O)=O DECCMEWNXSNSDO-ZLUOBGJFSA-N 0.000 description 2
- XAGIMRPOEJSYER-CIUDSAMLSA-N Ala-Cys-Lys Chemical compound C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CCCCN)C(=O)O)N XAGIMRPOEJSYER-CIUDSAMLSA-N 0.000 description 2
- RXTBLQVXNIECFP-FXQIFTODSA-N Ala-Gln-Gln Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O RXTBLQVXNIECFP-FXQIFTODSA-N 0.000 description 2
- FVSOUJZKYWEFOB-KBIXCLLPSA-N Ala-Gln-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@H](C)N FVSOUJZKYWEFOB-KBIXCLLPSA-N 0.000 description 2
- NJPMYXWVWQWCSR-ACZMJKKPSA-N Ala-Glu-Asn Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O NJPMYXWVWQWCSR-ACZMJKKPSA-N 0.000 description 2
- OMMDTNGURYRDAC-NRPADANISA-N Ala-Glu-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O OMMDTNGURYRDAC-NRPADANISA-N 0.000 description 2
- BEMGNWZECGIJOI-WDSKDSINSA-N Ala-Gly-Glu Chemical compound [H]N[C@@H](C)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O BEMGNWZECGIJOI-WDSKDSINSA-N 0.000 description 2
- BLIMFWGRQKRCGT-YUMQZZPRSA-N Ala-Gly-Lys Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCCN BLIMFWGRQKRCGT-YUMQZZPRSA-N 0.000 description 2
- DVJSJDDYCYSMFR-ZKWXMUAHSA-N Ala-Ile-Gly Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O DVJSJDDYCYSMFR-ZKWXMUAHSA-N 0.000 description 2
- TZDNWXDLYFIFPT-BJDJZHNGSA-N Ala-Ile-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O TZDNWXDLYFIFPT-BJDJZHNGSA-N 0.000 description 2
- VNYMOTCMNHJGTG-JBDRJPRFSA-N Ala-Ile-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O VNYMOTCMNHJGTG-JBDRJPRFSA-N 0.000 description 2
- SUHLZMHFRALVSY-YUMQZZPRSA-N Ala-Lys-Gly Chemical compound NCCCC[C@H](NC(=O)[C@@H](N)C)C(=O)NCC(O)=O SUHLZMHFRALVSY-YUMQZZPRSA-N 0.000 description 2
- KQESEZXHYOUIIM-CQDKDKBSSA-N Ala-Lys-Tyr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O KQESEZXHYOUIIM-CQDKDKBSSA-N 0.000 description 2
- MDNAVFBZPROEHO-UHFFFAOYSA-N Ala-Lys-Val Natural products CC(C)C(C(O)=O)NC(=O)C(NC(=O)C(C)N)CCCCN MDNAVFBZPROEHO-UHFFFAOYSA-N 0.000 description 2
- JAQNUEWEJWBVAY-WBAXXEDZSA-N Ala-Phe-Phe Chemical compound C([C@H](NC(=O)[C@@H](N)C)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 JAQNUEWEJWBVAY-WBAXXEDZSA-N 0.000 description 2
- FEGOCLZUJUFCHP-CIUDSAMLSA-N Ala-Pro-Gln Chemical compound [H]N[C@@H](C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O FEGOCLZUJUFCHP-CIUDSAMLSA-N 0.000 description 2
- DCVYRWFAMZFSDA-ZLUOBGJFSA-N Ala-Ser-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O DCVYRWFAMZFSDA-ZLUOBGJFSA-N 0.000 description 2
- MSWSRLGNLKHDEI-ACZMJKKPSA-N Ala-Ser-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(O)=O MSWSRLGNLKHDEI-ACZMJKKPSA-N 0.000 description 2
- MMLHRUJLOUSRJX-CIUDSAMLSA-N Ala-Ser-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCCN MMLHRUJLOUSRJX-CIUDSAMLSA-N 0.000 description 2
- ARHJJAAWNWOACN-FXQIFTODSA-N Ala-Ser-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O ARHJJAAWNWOACN-FXQIFTODSA-N 0.000 description 2
- YNOCMHZSWJMGBB-GCJQMDKQSA-N Ala-Thr-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(O)=O YNOCMHZSWJMGBB-GCJQMDKQSA-N 0.000 description 2
- LSMDIAAALJJLRO-XQXXSGGOSA-N Ala-Thr-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O LSMDIAAALJJLRO-XQXXSGGOSA-N 0.000 description 2
- IOFVWPYSRSCWHI-JXUBOQSCSA-N Ala-Thr-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@H](C)N IOFVWPYSRSCWHI-JXUBOQSCSA-N 0.000 description 2
- QOIGKCBMXUCDQU-KDXUFGMBSA-N Ala-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](C)N)O QOIGKCBMXUCDQU-KDXUFGMBSA-N 0.000 description 2
- KTXKIYXZQFWJKB-VZFHVOOUSA-N Ala-Thr-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O KTXKIYXZQFWJKB-VZFHVOOUSA-N 0.000 description 2
- CREYEAPXISDKSB-FQPOAREZSA-N Ala-Thr-Tyr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O CREYEAPXISDKSB-FQPOAREZSA-N 0.000 description 2
- IETUUAHKCHOQHP-KZVJFYERSA-N Ala-Thr-Val Chemical compound CC(C)[C@H](NC(=O)[C@@H](NC(=O)[C@H](C)N)[C@@H](C)O)C(O)=O IETUUAHKCHOQHP-KZVJFYERSA-N 0.000 description 2
- MTDDMSUUXNQMKK-BPNCWPANSA-N Ala-Tyr-Arg Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N MTDDMSUUXNQMKK-BPNCWPANSA-N 0.000 description 2
- BHFOJPDOQPWJRN-XDTLVQLUSA-N Ala-Tyr-Gln Chemical compound C[C@H](N)C(=O)N[C@@H](Cc1ccc(O)cc1)C(=O)N[C@@H](CCC(N)=O)C(O)=O BHFOJPDOQPWJRN-XDTLVQLUSA-N 0.000 description 2
- GCTANJIJJROSLH-GVARAGBVSA-N Ala-Tyr-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](C)N GCTANJIJJROSLH-GVARAGBVSA-N 0.000 description 2
- XSLGWYYNOSUMRM-ZKWXMUAHSA-N Ala-Val-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O XSLGWYYNOSUMRM-ZKWXMUAHSA-N 0.000 description 2
- NLYYHIKRBRMAJV-AEJSXWLSSA-N Ala-Val-Pro Chemical compound C[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N NLYYHIKRBRMAJV-AEJSXWLSSA-N 0.000 description 2
- REWSWYIDQIELBE-FXQIFTODSA-N Ala-Val-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O REWSWYIDQIELBE-FXQIFTODSA-N 0.000 description 2
- 206010061424 Anal cancer Diseases 0.000 description 2
- DFCIPNHFKOQAME-FXQIFTODSA-N Arg-Ala-Asn Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O DFCIPNHFKOQAME-FXQIFTODSA-N 0.000 description 2
- GXCSUJQOECMKPV-CIUDSAMLSA-N Arg-Ala-Gln Chemical compound C[C@H](NC(=O)[C@@H](N)CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(O)=O GXCSUJQOECMKPV-CIUDSAMLSA-N 0.000 description 2
- PQWTZSNVWSOFFK-FXQIFTODSA-N Arg-Asp-Asn Chemical compound C(C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N)CN=C(N)N PQWTZSNVWSOFFK-FXQIFTODSA-N 0.000 description 2
- PTVGLOCPAVYPFG-CIUDSAMLSA-N Arg-Gln-Asp Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O PTVGLOCPAVYPFG-CIUDSAMLSA-N 0.000 description 2
- MZRBYBIQTIKERR-GUBZILKMSA-N Arg-Glu-Gln Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O MZRBYBIQTIKERR-GUBZILKMSA-N 0.000 description 2
- OGUPCHKBOKJFMA-SRVKXCTJSA-N Arg-Glu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCCN=C(N)N OGUPCHKBOKJFMA-SRVKXCTJSA-N 0.000 description 2
- HPSVTWMFWCHKFN-GARJFASQSA-N Arg-Glu-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCCN=C(N)N)N)C(=O)O HPSVTWMFWCHKFN-GARJFASQSA-N 0.000 description 2
- UBCPNBUIQNMDNH-NAKRPEOUSA-N Arg-Ile-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O UBCPNBUIQNMDNH-NAKRPEOUSA-N 0.000 description 2
- LLUGJARLJCGLAR-CYDGBPFRSA-N Arg-Ile-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N LLUGJARLJCGLAR-CYDGBPFRSA-N 0.000 description 2
- BNYNOWJESJJIOI-XUXIUFHCSA-N Arg-Lys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCCN=C(N)N)N BNYNOWJESJJIOI-XUXIUFHCSA-N 0.000 description 2
- DNLQVHBBMPZUGJ-BQBZGAKWSA-N Arg-Ser-Gly Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O DNLQVHBBMPZUGJ-BQBZGAKWSA-N 0.000 description 2
- UVTGNSWSRSCPLP-UHFFFAOYSA-N Arg-Tyr Natural products NC(CCNC(=N)N)C(=O)NC(Cc1ccc(O)cc1)C(=O)O UVTGNSWSRSCPLP-UHFFFAOYSA-N 0.000 description 2
- NVPHRWNWTKYIST-BPNCWPANSA-N Arg-Tyr-Ala Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C)C(O)=O)CC1=CC=C(O)C=C1 NVPHRWNWTKYIST-BPNCWPANSA-N 0.000 description 2
- 239000004475 Arginine Substances 0.000 description 2
- SWLOHUMCUDRTCL-ZLUOBGJFSA-N Asn-Ala-Asn Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N SWLOHUMCUDRTCL-ZLUOBGJFSA-N 0.000 description 2
- POOCJCRBHHMAOS-FXQIFTODSA-N Asn-Arg-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O POOCJCRBHHMAOS-FXQIFTODSA-N 0.000 description 2
- JEPNYDRDYNSFIU-QXEWZRGKSA-N Asn-Arg-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H](N)CC(N)=O)C(O)=O JEPNYDRDYNSFIU-QXEWZRGKSA-N 0.000 description 2
- ACRYGQFHAQHDSF-ZLUOBGJFSA-N Asn-Asn-Asn Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O ACRYGQFHAQHDSF-ZLUOBGJFSA-N 0.000 description 2
- KXFCBAHYSLJCCY-ZLUOBGJFSA-N Asn-Asn-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O KXFCBAHYSLJCCY-ZLUOBGJFSA-N 0.000 description 2
- QHBMKQWOIYJYMI-BYULHYEWSA-N Asn-Asn-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O QHBMKQWOIYJYMI-BYULHYEWSA-N 0.000 description 2
- VYLVOMUVLMGCRF-ZLUOBGJFSA-N Asn-Asp-Ser Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O VYLVOMUVLMGCRF-ZLUOBGJFSA-N 0.000 description 2
- KWQPAXYXVMHJJR-AVGNSLFASA-N Asn-Gln-Tyr Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 KWQPAXYXVMHJJR-AVGNSLFASA-N 0.000 description 2
- ULRPXVNMIIYDDJ-ACZMJKKPSA-N Asn-Glu-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC(=O)N)N ULRPXVNMIIYDDJ-ACZMJKKPSA-N 0.000 description 2
- DXVMJJNAOVECBA-WHFBIAKZSA-N Asn-Gly-Asn Chemical compound NC(=O)C[C@H](N)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O DXVMJJNAOVECBA-WHFBIAKZSA-N 0.000 description 2
- RAQMSGVCGSJKCL-FOHZUACHSA-N Asn-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC(N)=O RAQMSGVCGSJKCL-FOHZUACHSA-N 0.000 description 2
- OOWSBIOUKIUWLO-RCOVLWMOSA-N Asn-Gly-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O OOWSBIOUKIUWLO-RCOVLWMOSA-N 0.000 description 2
- IKLAUGBIDCDFOY-SRVKXCTJSA-N Asn-His-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(O)=O IKLAUGBIDCDFOY-SRVKXCTJSA-N 0.000 description 2
- NVWJMQNYLYWVNQ-BYULHYEWSA-N Asn-Ile-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O NVWJMQNYLYWVNQ-BYULHYEWSA-N 0.000 description 2
- SEKBHZJLARBNPB-GHCJXIJMSA-N Asn-Ile-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O SEKBHZJLARBNPB-GHCJXIJMSA-N 0.000 description 2
- YVXRYLVELQYAEQ-SRVKXCTJSA-N Asn-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)N)N YVXRYLVELQYAEQ-SRVKXCTJSA-N 0.000 description 2
- LZLCLRQMUQWUHJ-GUBZILKMSA-N Asn-Lys-Gln Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N LZLCLRQMUQWUHJ-GUBZILKMSA-N 0.000 description 2
- ORJQQZIXTOYGGH-SRVKXCTJSA-N Asn-Lys-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O ORJQQZIXTOYGGH-SRVKXCTJSA-N 0.000 description 2
- COWITDLVHMZSIW-CIUDSAMLSA-N Asn-Lys-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O COWITDLVHMZSIW-CIUDSAMLSA-N 0.000 description 2
- KNENKKKUYGEZIO-FXQIFTODSA-N Asn-Met-Asn Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N KNENKKKUYGEZIO-FXQIFTODSA-N 0.000 description 2
- CDGHMJJJHYKMPA-DLOVCJGASA-N Asn-Phe-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](CC(=O)N)N CDGHMJJJHYKMPA-DLOVCJGASA-N 0.000 description 2
- BKFXFUPYETWGGA-XVSYOHENSA-N Asn-Phe-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O BKFXFUPYETWGGA-XVSYOHENSA-N 0.000 description 2
- GMUOCGCDOYYWPD-FXQIFTODSA-N Asn-Pro-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O GMUOCGCDOYYWPD-FXQIFTODSA-N 0.000 description 2
- VHQSGALUSWIYOD-QXEWZRGKSA-N Asn-Pro-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O VHQSGALUSWIYOD-QXEWZRGKSA-N 0.000 description 2
- REQUGIWGOGSOEZ-ZLUOBGJFSA-N Asn-Ser-Asn Chemical compound C([C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(=O)N)C(=O)O)N)C(=O)N REQUGIWGOGSOEZ-ZLUOBGJFSA-N 0.000 description 2
- VWADICJNCPFKJS-ZLUOBGJFSA-N Asn-Ser-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O VWADICJNCPFKJS-ZLUOBGJFSA-N 0.000 description 2
- HPBNLFLSSQDFQW-WHFBIAKZSA-N Asn-Ser-Gly Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O HPBNLFLSSQDFQW-WHFBIAKZSA-N 0.000 description 2
- NCXTYSVDWLAQGZ-ZKWXMUAHSA-N Asn-Ser-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC(N)=O NCXTYSVDWLAQGZ-ZKWXMUAHSA-N 0.000 description 2
- ZUFPUBYQYWCMDB-NUMRIWBASA-N Asn-Thr-Glu Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O ZUFPUBYQYWCMDB-NUMRIWBASA-N 0.000 description 2
- HCZQKHSRYHCPSD-IUKAMOBKSA-N Asn-Thr-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O HCZQKHSRYHCPSD-IUKAMOBKSA-N 0.000 description 2
- AMGQTNHANMRPOE-LKXGYXEUSA-N Asn-Thr-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O AMGQTNHANMRPOE-LKXGYXEUSA-N 0.000 description 2
- QTKYFZCMSQLYHI-UBHSHLNASA-N Asn-Trp-Asn Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC(N)=O)C(O)=O QTKYFZCMSQLYHI-UBHSHLNASA-N 0.000 description 2
- JZLFYAAGGYMRIK-BYULHYEWSA-N Asn-Val-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O JZLFYAAGGYMRIK-BYULHYEWSA-N 0.000 description 2
- VTYQAQFKMQTKQD-ACZMJKKPSA-N Asp-Ala-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(O)=O VTYQAQFKMQTKQD-ACZMJKKPSA-N 0.000 description 2
- NJIKKGUVGUBICV-ZLUOBGJFSA-N Asp-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC(O)=O NJIKKGUVGUBICV-ZLUOBGJFSA-N 0.000 description 2
- HOQGTAIGQSDCHR-SRVKXCTJSA-N Asp-Asn-Phe Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O HOQGTAIGQSDCHR-SRVKXCTJSA-N 0.000 description 2
- XACXDSRQIXRMNS-OLHMAJIHSA-N Asp-Asn-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC(=O)O)N)O XACXDSRQIXRMNS-OLHMAJIHSA-N 0.000 description 2
- RDRMWJBLOSRRAW-BYULHYEWSA-N Asp-Asn-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O RDRMWJBLOSRRAW-BYULHYEWSA-N 0.000 description 2
- LKIYSIYBKYLKPU-BIIVOSGPSA-N Asp-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(=O)O)N)C(=O)O LKIYSIYBKYLKPU-BIIVOSGPSA-N 0.000 description 2
- RYKWOUUZJFSJOH-FXQIFTODSA-N Asp-Gln-Glu Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CC(=O)O)N RYKWOUUZJFSJOH-FXQIFTODSA-N 0.000 description 2
- QCVXMEHGFUMKCO-YUMQZZPRSA-N Asp-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC(O)=O QCVXMEHGFUMKCO-YUMQZZPRSA-N 0.000 description 2
- HXVILZUZXFLVEN-DCAQKATOSA-N Asp-Met-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O HXVILZUZXFLVEN-DCAQKATOSA-N 0.000 description 2
- IDDMGSKZQDEDGA-SRVKXCTJSA-N Asp-Phe-Asn Chemical compound OC(=O)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(N)=O)C(O)=O)CC1=CC=CC=C1 IDDMGSKZQDEDGA-SRVKXCTJSA-N 0.000 description 2
- RPUYTJJZXQBWDT-SRVKXCTJSA-N Asp-Phe-Ser Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC(=O)O)N RPUYTJJZXQBWDT-SRVKXCTJSA-N 0.000 description 2
- FAUPLTGRUBTXNU-FXQIFTODSA-N Asp-Pro-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O FAUPLTGRUBTXNU-FXQIFTODSA-N 0.000 description 2
- OFYVKOXTTDCUIL-FXQIFTODSA-N Asp-Ser-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CC(=O)O)N OFYVKOXTTDCUIL-FXQIFTODSA-N 0.000 description 2
- IWLZBRTUIVXZJD-OLHMAJIHSA-N Asp-Thr-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(O)=O IWLZBRTUIVXZJD-OLHMAJIHSA-N 0.000 description 2
- JSNWZMFSLIWAHS-HJGDQZAQSA-N Asp-Thr-Leu Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)O)NC(=O)[C@H](CC(=O)O)N)O JSNWZMFSLIWAHS-HJGDQZAQSA-N 0.000 description 2
- NAAAPCLFJPURAM-HJGDQZAQSA-N Asp-Thr-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)O)N)O NAAAPCLFJPURAM-HJGDQZAQSA-N 0.000 description 2
- NWAHPBGBDIFUFD-KKUMJFAQSA-N Asp-Tyr-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(O)=O NWAHPBGBDIFUFD-KKUMJFAQSA-N 0.000 description 2
- ALMIMUZAWTUNIO-BZSNNMDCSA-N Asp-Tyr-Tyr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O ALMIMUZAWTUNIO-BZSNNMDCSA-N 0.000 description 2
- GFYOIYJJMSHLSN-QXEWZRGKSA-N Asp-Val-Arg Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O GFYOIYJJMSHLSN-QXEWZRGKSA-N 0.000 description 2
- 206010008263 Cervical dysplasia Diseases 0.000 description 2
- 108020004638 Circular DNA Proteins 0.000 description 2
- TVYMKYUSZSVOAG-ZLUOBGJFSA-N Cys-Ala-Ala Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O TVYMKYUSZSVOAG-ZLUOBGJFSA-N 0.000 description 2
- QLCPDGRAEJSYQM-LPEHRKFASA-N Cys-Arg-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CS)N)C(=O)O QLCPDGRAEJSYQM-LPEHRKFASA-N 0.000 description 2
- ZVNFONSZVUBRAV-CIUDSAMLSA-N Cys-Gln-Arg Chemical compound C(C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CS)N)CN=C(N)N ZVNFONSZVUBRAV-CIUDSAMLSA-N 0.000 description 2
- HHABWQIFXZPZCK-ACZMJKKPSA-N Cys-Gln-Ser Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CS)N HHABWQIFXZPZCK-ACZMJKKPSA-N 0.000 description 2
- VIRYODQIWJNWNU-NRPADANISA-N Cys-Glu-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CS)N VIRYODQIWJNWNU-NRPADANISA-N 0.000 description 2
- OXOQBEVULIBOSH-ZDLURKLDSA-N Cys-Gly-Thr Chemical compound [H]N[C@@H](CS)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O OXOQBEVULIBOSH-ZDLURKLDSA-N 0.000 description 2
- OXFOKRAFNYSREH-BJDJZHNGSA-N Cys-Ile-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)O)NC(=O)[C@H](CS)N OXFOKRAFNYSREH-BJDJZHNGSA-N 0.000 description 2
- ABLJDBFJPUWQQB-DCAQKATOSA-N Cys-Leu-Arg Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CS)N ABLJDBFJPUWQQB-DCAQKATOSA-N 0.000 description 2
- VPQZSNQICFCCSO-BJDJZHNGSA-N Cys-Leu-Ile Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O VPQZSNQICFCCSO-BJDJZHNGSA-N 0.000 description 2
- XMVZMBGFIOQONW-GARJFASQSA-N Cys-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)[C@H](CS)N)C(=O)O XMVZMBGFIOQONW-GARJFASQSA-N 0.000 description 2
- RWVBNRYBHAGYSG-GUBZILKMSA-N Cys-Met-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](CS)N RWVBNRYBHAGYSG-GUBZILKMSA-N 0.000 description 2
- CMYVIUWVYHOLRD-ZLUOBGJFSA-N Cys-Ser-Ala Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O CMYVIUWVYHOLRD-ZLUOBGJFSA-N 0.000 description 2
- SRZZZTMJARUVPI-JBDRJPRFSA-N Cys-Ser-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CS)N SRZZZTMJARUVPI-JBDRJPRFSA-N 0.000 description 2
- WTXCNOPZMQRTNN-BWBBJGPYSA-N Cys-Thr-Ser Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CS)N)O WTXCNOPZMQRTNN-BWBBJGPYSA-N 0.000 description 2
- DXSBGVKEPHDOTD-UBHSHLNASA-N Cys-Trp-Asn Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CS)N DXSBGVKEPHDOTD-UBHSHLNASA-N 0.000 description 2
- DGQJGBDBFVGLGL-ZKWXMUAHSA-N Cys-Val-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CS)N DGQJGBDBFVGLGL-ZKWXMUAHSA-N 0.000 description 2
- 108090000790 Enzymes Proteins 0.000 description 2
- 102000004190 Enzymes Human genes 0.000 description 2
- JSYULGSPLTZDHM-NRPADANISA-N Gln-Ala-Val Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O JSYULGSPLTZDHM-NRPADANISA-N 0.000 description 2
- WLODHVXYKYHLJD-ACZMJKKPSA-N Gln-Asp-Ser Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CO)C(=O)O)N WLODHVXYKYHLJD-ACZMJKKPSA-N 0.000 description 2
- COYGBRTZEVWZBW-XKBZYTNZSA-N Gln-Cys-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CS)NC(=O)[C@@H](N)CCC(N)=O COYGBRTZEVWZBW-XKBZYTNZSA-N 0.000 description 2
- YPMDZWPZFOZYFG-GUBZILKMSA-N Gln-Leu-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O YPMDZWPZFOZYFG-GUBZILKMSA-N 0.000 description 2
- HSHCEAUPUPJPTE-JYJNAYRXSA-N Gln-Leu-Tyr Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@H](CCC(=O)N)N HSHCEAUPUPJPTE-JYJNAYRXSA-N 0.000 description 2
- IHSGESFHTMFHRB-GUBZILKMSA-N Gln-Lys-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCC(N)=O IHSGESFHTMFHRB-GUBZILKMSA-N 0.000 description 2
- JNENSVNAUWONEZ-GUBZILKMSA-N Gln-Lys-Asn Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O JNENSVNAUWONEZ-GUBZILKMSA-N 0.000 description 2
- KSKFIECUYMYWNS-AVGNSLFASA-N Gln-Lys-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCC(=O)N)N KSKFIECUYMYWNS-AVGNSLFASA-N 0.000 description 2
- LHMWTCWZARHLPV-CIUDSAMLSA-N Gln-Met-Ser Chemical compound CSCC[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCC(=O)N)N LHMWTCWZARHLPV-CIUDSAMLSA-N 0.000 description 2
- QBEWLBKBGXVVPD-RYUDHWBXSA-N Gln-Phe-Gly Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CCC(=O)N)N QBEWLBKBGXVVPD-RYUDHWBXSA-N 0.000 description 2
- WHVLABLIJYGVEK-QEWYBTABSA-N Gln-Phe-Ile Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WHVLABLIJYGVEK-QEWYBTABSA-N 0.000 description 2
- FQCILXROGNOZON-YUMQZZPRSA-N Gln-Pro-Gly Chemical compound NC(=O)CC[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O FQCILXROGNOZON-YUMQZZPRSA-N 0.000 description 2
- WLRYGVYQFXRJDA-DCAQKATOSA-N Gln-Pro-Pro Chemical compound NC(=O)CC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 WLRYGVYQFXRJDA-DCAQKATOSA-N 0.000 description 2
- LGWNISYVKDNJRP-FXQIFTODSA-N Gln-Ser-Gln Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O LGWNISYVKDNJRP-FXQIFTODSA-N 0.000 description 2
- SXFPZRRVWSUYII-KBIXCLLPSA-N Gln-Ser-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCC(=O)N)N SXFPZRRVWSUYII-KBIXCLLPSA-N 0.000 description 2
- LPIKVBWNNVFHCQ-GUBZILKMSA-N Gln-Ser-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O LPIKVBWNNVFHCQ-GUBZILKMSA-N 0.000 description 2
- SYZZMPFLOLSMHL-XHNCKOQMSA-N Gln-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CCC(=O)N)N)C(=O)O SYZZMPFLOLSMHL-XHNCKOQMSA-N 0.000 description 2
- JILRMFFFCHUUTJ-ACZMJKKPSA-N Gln-Ser-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O JILRMFFFCHUUTJ-ACZMJKKPSA-N 0.000 description 2
- BYKZWDGMJLNFJY-XKBZYTNZSA-N Gln-Ser-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCC(=O)N)N)O BYKZWDGMJLNFJY-XKBZYTNZSA-N 0.000 description 2
- RNPGPFAVRLERPP-QEJZJMRPSA-N Gln-Trp-Asn Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC(N)=O)C(O)=O RNPGPFAVRLERPP-QEJZJMRPSA-N 0.000 description 2
- AKDOUBMVLRCHBD-SIUGBPQLSA-N Gln-Tyr-Ile Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O AKDOUBMVLRCHBD-SIUGBPQLSA-N 0.000 description 2
- JTWZNMUVQWWGOX-SOUVJXGZSA-N Gln-Tyr-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)[C@H](CCC(=O)N)N)C(=O)O JTWZNMUVQWWGOX-SOUVJXGZSA-N 0.000 description 2
- QGWXAMDECCKGRU-XVKPBYJWSA-N Gln-Val-Gly Chemical compound CC(C)[C@H](NC(=O)[C@@H](N)CCC(N)=O)C(=O)NCC(O)=O QGWXAMDECCKGRU-XVKPBYJWSA-N 0.000 description 2
- OGMQXTXGLDNBSS-FXQIFTODSA-N Glu-Ala-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(O)=O OGMQXTXGLDNBSS-FXQIFTODSA-N 0.000 description 2
- SBYVDRJAXWSXQL-AVGNSLFASA-N Glu-Asn-Phe Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O SBYVDRJAXWSXQL-AVGNSLFASA-N 0.000 description 2
- RDDSZZJOKDVPAE-ACZMJKKPSA-N Glu-Asn-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O RDDSZZJOKDVPAE-ACZMJKKPSA-N 0.000 description 2
- BUVMZWZNWMKASN-QEJZJMRPSA-N Glu-Asn-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CC(N)=O)NC(=O)[C@H](CCC(O)=O)N)C(O)=O)=CNC2=C1 BUVMZWZNWMKASN-QEJZJMRPSA-N 0.000 description 2
- PCBBLFVHTYNQGG-LAEOZQHASA-N Glu-Asn-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)O)N PCBBLFVHTYNQGG-LAEOZQHASA-N 0.000 description 2
- HJIFPJUEOGZWRI-GUBZILKMSA-N Glu-Asp-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCC(=O)O)N HJIFPJUEOGZWRI-GUBZILKMSA-N 0.000 description 2
- WATXSTJXNBOHKD-LAEOZQHASA-N Glu-Asp-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O WATXSTJXNBOHKD-LAEOZQHASA-N 0.000 description 2
- XKPOCESCRTVRPL-KBIXCLLPSA-N Glu-Cys-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CS)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O XKPOCESCRTVRPL-KBIXCLLPSA-N 0.000 description 2
- RQNYYRHRKSVKAB-GUBZILKMSA-N Glu-Cys-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(O)=O RQNYYRHRKSVKAB-GUBZILKMSA-N 0.000 description 2
- WLIPTFCZLHCNFD-LPEHRKFASA-N Glu-Gln-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCC(=O)O)N)C(=O)O WLIPTFCZLHCNFD-LPEHRKFASA-N 0.000 description 2
- PJBVXVBTTFZPHJ-GUBZILKMSA-N Glu-Leu-Asp Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CCC(=O)O)N PJBVXVBTTFZPHJ-GUBZILKMSA-N 0.000 description 2
- DNPCBMNFQVTHMA-DCAQKATOSA-N Glu-Leu-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O DNPCBMNFQVTHMA-DCAQKATOSA-N 0.000 description 2
- MWMJCGBSIORNCD-AVGNSLFASA-N Glu-Leu-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O MWMJCGBSIORNCD-AVGNSLFASA-N 0.000 description 2
- IVGJYOOGJLFKQE-AVGNSLFASA-N Glu-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)O)N IVGJYOOGJLFKQE-AVGNSLFASA-N 0.000 description 2
- IOUQWHIEQYQVFD-JYJNAYRXSA-N Glu-Leu-Tyr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O IOUQWHIEQYQVFD-JYJNAYRXSA-N 0.000 description 2
- BCYGDJXHAGZNPQ-DCAQKATOSA-N Glu-Lys-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O BCYGDJXHAGZNPQ-DCAQKATOSA-N 0.000 description 2
- HRBYTAIBKPNZKQ-AVGNSLFASA-N Glu-Lys-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCC(O)=O HRBYTAIBKPNZKQ-AVGNSLFASA-N 0.000 description 2
- LHIPZASLKPYDPI-AVGNSLFASA-N Glu-Phe-Asp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O LHIPZASLKPYDPI-AVGNSLFASA-N 0.000 description 2
- ARIORLIIMJACKZ-KKUMJFAQSA-N Glu-Pro-Tyr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O ARIORLIIMJACKZ-KKUMJFAQSA-N 0.000 description 2
- NNQDRRUXFJYCCJ-NHCYSSNCSA-N Glu-Pro-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O NNQDRRUXFJYCCJ-NHCYSSNCSA-N 0.000 description 2
- DAHLWSFUXOHMIA-FXQIFTODSA-N Glu-Ser-Gln Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O DAHLWSFUXOHMIA-FXQIFTODSA-N 0.000 description 2
- JWNZHMSRZXXGTM-XKBZYTNZSA-N Glu-Ser-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O JWNZHMSRZXXGTM-XKBZYTNZSA-N 0.000 description 2
- DMYACXMQUABZIQ-NRPADANISA-N Glu-Ser-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O DMYACXMQUABZIQ-NRPADANISA-N 0.000 description 2
- MWTGQXBHVRTCOR-GLLZPBPUSA-N Glu-Thr-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(O)=O MWTGQXBHVRTCOR-GLLZPBPUSA-N 0.000 description 2
- DLISPGXMKZTWQG-IFFSRLJSSA-N Glu-Thr-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O DLISPGXMKZTWQG-IFFSRLJSSA-N 0.000 description 2
- RXJFSLQVMGYQEL-IHRRRGAJSA-N Glu-Tyr-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CCC(O)=O)C(O)=O)CC1=CC=C(O)C=C1 RXJFSLQVMGYQEL-IHRRRGAJSA-N 0.000 description 2
- HJTSRYLPAYGEEC-SIUGBPQLSA-N Glu-Tyr-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CCC(=O)O)N HJTSRYLPAYGEEC-SIUGBPQLSA-N 0.000 description 2
- UUTGYDAKPISJAO-JYJNAYRXSA-N Glu-Tyr-Leu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C(O)=O)CC1=CC=C(O)C=C1 UUTGYDAKPISJAO-JYJNAYRXSA-N 0.000 description 2
- BKMOHWJHXQLFEX-IRIUXVKKSA-N Glu-Tyr-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CCC(=O)O)N)O BKMOHWJHXQLFEX-IRIUXVKKSA-N 0.000 description 2
- YPHPEHMXOYTEQG-LAEOZQHASA-N Glu-Val-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCC(O)=O YPHPEHMXOYTEQG-LAEOZQHASA-N 0.000 description 2
- UGVQELHRNUDMAA-BYPYZUCNSA-N Gly-Ala-Gly Chemical compound [NH3+]CC(=O)N[C@@H](C)C(=O)NCC([O-])=O UGVQELHRNUDMAA-BYPYZUCNSA-N 0.000 description 2
- CLODWIOAKCSBAN-BQBZGAKWSA-N Gly-Arg-Asp Chemical compound NC(N)=NCCC[C@H](NC(=O)CN)C(=O)N[C@@H](CC(O)=O)C(O)=O CLODWIOAKCSBAN-BQBZGAKWSA-N 0.000 description 2
- WJZLEENECIOOSA-WDSKDSINSA-N Gly-Asn-Gln Chemical compound NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)O WJZLEENECIOOSA-WDSKDSINSA-N 0.000 description 2
- OCDLPQDYTJPWNG-YUMQZZPRSA-N Gly-Asn-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)CN OCDLPQDYTJPWNG-YUMQZZPRSA-N 0.000 description 2
- FZQLXNIMCPJVJE-YUMQZZPRSA-N Gly-Asp-Leu Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O FZQLXNIMCPJVJE-YUMQZZPRSA-N 0.000 description 2
- LCNXZQROPKFGQK-WHFBIAKZSA-N Gly-Asp-Ser Chemical compound NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O LCNXZQROPKFGQK-WHFBIAKZSA-N 0.000 description 2
- PMNHJLASAAWELO-FOHZUACHSA-N Gly-Asp-Thr Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PMNHJLASAAWELO-FOHZUACHSA-N 0.000 description 2
- GVVKYKCOFMMTKZ-WHFBIAKZSA-N Gly-Cys-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CS)NC(=O)CN GVVKYKCOFMMTKZ-WHFBIAKZSA-N 0.000 description 2
- IANBSEOVTQNGBZ-BQBZGAKWSA-N Gly-Cys-Met Chemical compound [H]NCC(=O)N[C@@H](CS)C(=O)N[C@@H](CCSC)C(O)=O IANBSEOVTQNGBZ-BQBZGAKWSA-N 0.000 description 2
- JMQFHZWESBGPFC-WDSKDSINSA-N Gly-Gln-Asp Chemical compound NCC(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O JMQFHZWESBGPFC-WDSKDSINSA-N 0.000 description 2
- BIRKKBCSAIHDDF-WDSKDSINSA-N Gly-Glu-Cys Chemical compound NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CS)C(O)=O BIRKKBCSAIHDDF-WDSKDSINSA-N 0.000 description 2
- SOEATRRYCIPEHA-BQBZGAKWSA-N Gly-Glu-Glu Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O SOEATRRYCIPEHA-BQBZGAKWSA-N 0.000 description 2
- HQRHFUYMGCHHJS-LURJTMIESA-N Gly-Gly-Arg Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N HQRHFUYMGCHHJS-LURJTMIESA-N 0.000 description 2
- QPCVIQJVRGXUSA-LURJTMIESA-N Gly-Gly-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)CNC(=O)CN QPCVIQJVRGXUSA-LURJTMIESA-N 0.000 description 2
- BUEFQXUHTUZXHR-LURJTMIESA-N Gly-Gly-Pro zwitterion Chemical compound NCC(=O)NCC(=O)N1CCC[C@H]1C(O)=O BUEFQXUHTUZXHR-LURJTMIESA-N 0.000 description 2
- UUYBFNKHOCJCHT-VHSXEESVSA-N Gly-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN UUYBFNKHOCJCHT-VHSXEESVSA-N 0.000 description 2
- VEPBEGNDJYANCF-QWRGUYRKSA-N Gly-Lys-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCCN VEPBEGNDJYANCF-QWRGUYRKSA-N 0.000 description 2
- OJNZVYSGVYLQIN-BQBZGAKWSA-N Gly-Met-Asp Chemical compound [H]NCC(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(O)=O)C(O)=O OJNZVYSGVYLQIN-BQBZGAKWSA-N 0.000 description 2
- IGOYNRWLWHWAQO-JTQLQIEISA-N Gly-Phe-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)CN)CC1=CC=CC=C1 IGOYNRWLWHWAQO-JTQLQIEISA-N 0.000 description 2
- IALQAMYQJBZNSK-WHFBIAKZSA-N Gly-Ser-Asn Chemical compound [H]NCC(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O IALQAMYQJBZNSK-WHFBIAKZSA-N 0.000 description 2
- VNNRLUNBJSWZPF-ZKWXMUAHSA-N Gly-Ser-Ile Chemical compound [H]NCC(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O VNNRLUNBJSWZPF-ZKWXMUAHSA-N 0.000 description 2
- WNGHUXFWEWTKAO-YUMQZZPRSA-N Gly-Ser-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)CN WNGHUXFWEWTKAO-YUMQZZPRSA-N 0.000 description 2
- ABPRMMYHROQBLY-NKWVEPMBSA-N Gly-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)CN)C(=O)O ABPRMMYHROQBLY-NKWVEPMBSA-N 0.000 description 2
- LCRDMSSAKLTKBU-ZDLURKLDSA-N Gly-Ser-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)CN LCRDMSSAKLTKBU-ZDLURKLDSA-N 0.000 description 2
- NVTPVQLIZCOJFK-FOHZUACHSA-N Gly-Thr-Asp Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(O)=O NVTPVQLIZCOJFK-FOHZUACHSA-N 0.000 description 2
- JQFILXICXLDTRR-FBCQKBJTSA-N Gly-Thr-Gly Chemical compound NCC(=O)N[C@@H]([C@H](O)C)C(=O)NCC(O)=O JQFILXICXLDTRR-FBCQKBJTSA-N 0.000 description 2
- TVTZEOHWHUVYCG-KYNKHSRBSA-N Gly-Thr-Thr Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O TVTZEOHWHUVYCG-KYNKHSRBSA-N 0.000 description 2
- HQSKKSLNLSTONK-JTQLQIEISA-N Gly-Tyr-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)CN)CC1=CC=C(O)C=C1 HQSKKSLNLSTONK-JTQLQIEISA-N 0.000 description 2
- PNUFMLXHOLFRLD-KBPBESRZSA-N Gly-Tyr-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CC=C(O)C=C1 PNUFMLXHOLFRLD-KBPBESRZSA-N 0.000 description 2
- RYAOJUMWLWUGNW-QMMMGPOBSA-N Gly-Val-Gly Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O RYAOJUMWLWUGNW-QMMMGPOBSA-N 0.000 description 2
- TVQGUFGDVODUIF-LSJOCFKGSA-N His-Arg-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC1=CN=CN1)N TVQGUFGDVODUIF-LSJOCFKGSA-N 0.000 description 2
- AHEBIAHEZWQVHB-QTKMDUPCSA-N His-Thr-Met Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N)O AHEBIAHEZWQVHB-QTKMDUPCSA-N 0.000 description 2
- YERBCFWVWITTEJ-NAZCDGGXSA-N His-Trp-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](CC3=CN=CN3)N)O YERBCFWVWITTEJ-NAZCDGGXSA-N 0.000 description 2
- LQSBBHNVAVNZSX-GHCJXIJMSA-N Ile-Ala-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CC(=O)N)C(=O)O)N LQSBBHNVAVNZSX-GHCJXIJMSA-N 0.000 description 2
- CISBRYJZMFWOHJ-JBDRJPRFSA-N Ile-Ala-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CS)C(=O)O)N CISBRYJZMFWOHJ-JBDRJPRFSA-N 0.000 description 2
- TZCGZYWNIDZZMR-UHFFFAOYSA-N Ile-Arg-Ala Natural products CCC(C)C(N)C(=O)NC(C(=O)NC(C)C(O)=O)CCCN=C(N)N TZCGZYWNIDZZMR-UHFFFAOYSA-N 0.000 description 2
- FADXGVVLSPPEQY-GHCJXIJMSA-N Ile-Cys-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(=O)N)C(=O)O)N FADXGVVLSPPEQY-GHCJXIJMSA-N 0.000 description 2
- FHCNLXMTQJNJNH-KBIXCLLPSA-N Ile-Cys-Gln Chemical compound N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(N)=O)C(=O)O FHCNLXMTQJNJNH-KBIXCLLPSA-N 0.000 description 2
- PFTFEWHJSAXGED-ZKWXMUAHSA-N Ile-Cys-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)NCC(=O)O)N PFTFEWHJSAXGED-ZKWXMUAHSA-N 0.000 description 2
- GECLQMBTZCPAFY-PEFMBERDSA-N Ile-Gln-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N GECLQMBTZCPAFY-PEFMBERDSA-N 0.000 description 2
- KIMHKBDJQQYLHU-PEFMBERDSA-N Ile-Glu-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N KIMHKBDJQQYLHU-PEFMBERDSA-N 0.000 description 2
- NYEYYMLUABXDMC-NHCYSSNCSA-N Ile-Gly-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N[C@@H](CC(C)C)C(=O)O)N NYEYYMLUABXDMC-NHCYSSNCSA-N 0.000 description 2
- DFFTXLCCDFYRKD-MBLNEYKQSA-N Ile-Gly-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(=O)O)N DFFTXLCCDFYRKD-MBLNEYKQSA-N 0.000 description 2
- CMNMPCTVCWWYHY-MXAVVETBSA-N Ile-His-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CC(C)C)C(=O)O)N CMNMPCTVCWWYHY-MXAVVETBSA-N 0.000 description 2
- URWXDJAEEGBADB-TUBUOCAGSA-N Ile-His-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N URWXDJAEEGBADB-TUBUOCAGSA-N 0.000 description 2
- FZWVCYCYWCLQDH-NHCYSSNCSA-N Ile-Leu-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)NCC(=O)O)N FZWVCYCYWCLQDH-NHCYSSNCSA-N 0.000 description 2
- YSGBJIQXTIVBHZ-AJNGGQMLSA-N Ile-Lys-Leu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O YSGBJIQXTIVBHZ-AJNGGQMLSA-N 0.000 description 2
- QQVXERGIFIRCGW-NAKRPEOUSA-N Ile-Ser-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCSC)C(=O)O)N QQVXERGIFIRCGW-NAKRPEOUSA-N 0.000 description 2
- RQJUKVXWAKJDBW-SVSWQMSJSA-N Ile-Ser-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N RQJUKVXWAKJDBW-SVSWQMSJSA-N 0.000 description 2
- RKQAYOWLSFLJEE-SVSWQMSJSA-N Ile-Thr-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CS)C(=O)O)N RKQAYOWLSFLJEE-SVSWQMSJSA-N 0.000 description 2
- FXJLRZFMKGHYJP-CFMVVWHZSA-N Ile-Tyr-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N FXJLRZFMKGHYJP-CFMVVWHZSA-N 0.000 description 2
- KXUKTDGKLAOCQK-LSJOCFKGSA-N Ile-Val-Gly Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O KXUKTDGKLAOCQK-LSJOCFKGSA-N 0.000 description 2
- YHFPHRUWZMEOIX-CYDGBPFRSA-N Ile-Val-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C(C)C)C(=O)O)N YHFPHRUWZMEOIX-CYDGBPFRSA-N 0.000 description 2
- HGCNKOLVKRAVHD-UHFFFAOYSA-N L-Met-L-Phe Natural products CSCCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 HGCNKOLVKRAVHD-UHFFFAOYSA-N 0.000 description 2
- ODKSFYDXXFIFQN-BYPYZUCNSA-N L-arginine Chemical compound OC(=O)[C@@H](N)CCCN=C(N)N ODKSFYDXXFIFQN-BYPYZUCNSA-N 0.000 description 2
- CKLJMWTZIZZHCS-REOHCLBHSA-N L-aspartic acid Chemical compound OC(=O)[C@@H](N)CC(O)=O CKLJMWTZIZZHCS-REOHCLBHSA-N 0.000 description 2
- AGPKZVBTJJNPAG-WHFBIAKZSA-N L-isoleucine Chemical compound CC[C@H](C)[C@H](N)C(O)=O AGPKZVBTJJNPAG-WHFBIAKZSA-N 0.000 description 2
- RCFDOSNHHZGBOY-UHFFFAOYSA-N L-isoleucyl-L-alanine Natural products CCC(C)C(N)C(=O)NC(C)C(O)=O RCFDOSNHHZGBOY-UHFFFAOYSA-N 0.000 description 2
- ROHFNLRQFUQHCH-YFKPBYRVSA-N L-leucine Chemical compound CC(C)C[C@H](N)C(O)=O ROHFNLRQFUQHCH-YFKPBYRVSA-N 0.000 description 2
- COLNVLDHVKWLRT-QMMMGPOBSA-N L-phenylalanine Chemical compound OC(=O)[C@@H](N)CC1=CC=CC=C1 COLNVLDHVKWLRT-QMMMGPOBSA-N 0.000 description 2
- OUYCCCASQSFEME-QMMMGPOBSA-N L-tyrosine Chemical compound OC(=O)[C@@H](N)CC1=CC=C(O)C=C1 OUYCCCASQSFEME-QMMMGPOBSA-N 0.000 description 2
- KWTVLKBOQATPHJ-SRVKXCTJSA-N Leu-Ala-Lys Chemical compound C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(C)C)N KWTVLKBOQATPHJ-SRVKXCTJSA-N 0.000 description 2
- DLFAACQHIRSQGG-CIUDSAMLSA-N Leu-Asp-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O DLFAACQHIRSQGG-CIUDSAMLSA-N 0.000 description 2
- PJYSOYLLTJKZHC-GUBZILKMSA-N Leu-Asp-Gln Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCC(N)=O PJYSOYLLTJKZHC-GUBZILKMSA-N 0.000 description 2
- DLCOFDAHNMMQPP-SRVKXCTJSA-N Leu-Asp-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O DLCOFDAHNMMQPP-SRVKXCTJSA-N 0.000 description 2
- PPTAQBNUFKTJKA-BJDJZHNGSA-N Leu-Cys-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CS)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O PPTAQBNUFKTJKA-BJDJZHNGSA-N 0.000 description 2
- FOEHRHOBWFQSNW-KATARQTJSA-N Leu-Cys-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CC(C)C)N)O FOEHRHOBWFQSNW-KATARQTJSA-N 0.000 description 2
- VPKIQULSKFVCSM-SRVKXCTJSA-N Leu-Gln-Arg Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O VPKIQULSKFVCSM-SRVKXCTJSA-N 0.000 description 2
- CQGSYZCULZMEDE-UHFFFAOYSA-N Leu-Gln-Pro Natural products CC(C)CC(N)C(=O)NC(CCC(N)=O)C(=O)N1CCCC1C(O)=O CQGSYZCULZMEDE-UHFFFAOYSA-N 0.000 description 2
- GPICTNQYKHHHTH-GUBZILKMSA-N Leu-Gln-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O GPICTNQYKHHHTH-GUBZILKMSA-N 0.000 description 2
- CIVKXGPFXDIQBV-WDCWCFNPSA-N Leu-Gln-Thr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CIVKXGPFXDIQBV-WDCWCFNPSA-N 0.000 description 2
- HPBCTWSUJOGJSH-MNXVOIDGSA-N Leu-Glu-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O HPBCTWSUJOGJSH-MNXVOIDGSA-N 0.000 description 2
- HVJVUYQWFYMGJS-GVXVVHGQSA-N Leu-Glu-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O HVJVUYQWFYMGJS-GVXVVHGQSA-N 0.000 description 2
- KGCLIYGPQXUNLO-IUCAKERBSA-N Leu-Gly-Glu Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(O)=O KGCLIYGPQXUNLO-IUCAKERBSA-N 0.000 description 2
- VBZOAGIPCULURB-QWRGUYRKSA-N Leu-Gly-His Chemical compound CC(C)C[C@@H](C(=O)NCC(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N VBZOAGIPCULURB-QWRGUYRKSA-N 0.000 description 2
- OHZIZVWQXJPBJS-IXOXFDKPSA-N Leu-His-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OHZIZVWQXJPBJS-IXOXFDKPSA-N 0.000 description 2
- WXUOJXIGOPMDJM-SRVKXCTJSA-N Leu-Lys-Asn Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O WXUOJXIGOPMDJM-SRVKXCTJSA-N 0.000 description 2
- RZXLZBIUTDQHJQ-SRVKXCTJSA-N Leu-Lys-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O RZXLZBIUTDQHJQ-SRVKXCTJSA-N 0.000 description 2
- LVTJJOJKDCVZGP-QWRGUYRKSA-N Leu-Lys-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O LVTJJOJKDCVZGP-QWRGUYRKSA-N 0.000 description 2
- QNTJIDXQHWUBKC-BZSNNMDCSA-N Leu-Lys-Phe Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O QNTJIDXQHWUBKC-BZSNNMDCSA-N 0.000 description 2
- MJWVXZABPOKJJF-ACRUOGEOSA-N Leu-Phe-Phe Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O MJWVXZABPOKJJF-ACRUOGEOSA-N 0.000 description 2
- AKVBOOKXVAMKSS-GUBZILKMSA-N Leu-Ser-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O AKVBOOKXVAMKSS-GUBZILKMSA-N 0.000 description 2
- IWMJFLJQHIDZQW-KKUMJFAQSA-N Leu-Ser-Phe Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 IWMJFLJQHIDZQW-KKUMJFAQSA-N 0.000 description 2
- PPGBXYKMUMHFBF-KATARQTJSA-N Leu-Ser-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PPGBXYKMUMHFBF-KATARQTJSA-N 0.000 description 2
- ZJZNLRVCZWUONM-JXUBOQSCSA-N Leu-Thr-Ala Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O ZJZNLRVCZWUONM-JXUBOQSCSA-N 0.000 description 2
- GZRABTMNWJXFMH-UVOCVTCTSA-N Leu-Thr-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GZRABTMNWJXFMH-UVOCVTCTSA-N 0.000 description 2
- AIQWYVFNBNNOLU-RHYQMDGZSA-N Leu-Thr-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O AIQWYVFNBNNOLU-RHYQMDGZSA-N 0.000 description 2
- XZNJZXJZBMBGGS-NHCYSSNCSA-N Leu-Val-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O XZNJZXJZBMBGGS-NHCYSSNCSA-N 0.000 description 2
- AAKRWBIIGKPOKQ-ONGXEEELSA-N Leu-Val-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O AAKRWBIIGKPOKQ-ONGXEEELSA-N 0.000 description 2
- YQFZRHYZLARWDY-IHRRRGAJSA-N Leu-Val-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCCN YQFZRHYZLARWDY-IHRRRGAJSA-N 0.000 description 2
- SWWCDAGDQHTKIE-RHYQMDGZSA-N Lys-Arg-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SWWCDAGDQHTKIE-RHYQMDGZSA-N 0.000 description 2
- QUCDKEKDPYISNX-HJGDQZAQSA-N Lys-Asn-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O QUCDKEKDPYISNX-HJGDQZAQSA-N 0.000 description 2
- YEIYAQQKADPIBJ-GARJFASQSA-N Lys-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCCCN)N)C(=O)O YEIYAQQKADPIBJ-GARJFASQSA-N 0.000 description 2
- KWUKZRFFKPLUPE-HJGDQZAQSA-N Lys-Asp-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KWUKZRFFKPLUPE-HJGDQZAQSA-N 0.000 description 2
- ZAENPHCEQXALHO-GUBZILKMSA-N Lys-Cys-Glu Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(O)=O)C(O)=O ZAENPHCEQXALHO-GUBZILKMSA-N 0.000 description 2
- IRRZDAIFYHNIIN-JYJNAYRXSA-N Lys-Gln-Tyr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O IRRZDAIFYHNIIN-JYJNAYRXSA-N 0.000 description 2
- DRCILAJNUJKAHC-SRVKXCTJSA-N Lys-Glu-Arg Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O DRCILAJNUJKAHC-SRVKXCTJSA-N 0.000 description 2
- IMAKMJCBYCSMHM-AVGNSLFASA-N Lys-Glu-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN IMAKMJCBYCSMHM-AVGNSLFASA-N 0.000 description 2
- ITWQLSZTLBKWJM-YUMQZZPRSA-N Lys-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)[C@@H](N)CCCCN ITWQLSZTLBKWJM-YUMQZZPRSA-N 0.000 description 2
- FHIAJWBDZVHLAH-YUMQZZPRSA-N Lys-Gly-Ser Chemical compound NCCCC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O FHIAJWBDZVHLAH-YUMQZZPRSA-N 0.000 description 2
- CANPXOLVTMKURR-WEDXCCLWSA-N Lys-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CCCCN CANPXOLVTMKURR-WEDXCCLWSA-N 0.000 description 2
- OIYWBDBHEGAVST-BZSNNMDCSA-N Lys-His-Tyr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O OIYWBDBHEGAVST-BZSNNMDCSA-N 0.000 description 2
- NVGBPTNZLWRQSY-UWVGGRQHSA-N Lys-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(O)=O)CCCCN NVGBPTNZLWRQSY-UWVGGRQHSA-N 0.000 description 2
- BXPHMHQHYHILBB-BZSNNMDCSA-N Lys-Lys-Tyr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O BXPHMHQHYHILBB-BZSNNMDCSA-N 0.000 description 2
- GZGWILAQHOVXTD-DCAQKATOSA-N Lys-Met-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(O)=O)C(O)=O GZGWILAQHOVXTD-DCAQKATOSA-N 0.000 description 2
- KFSALEZVQJYHCE-AVGNSLFASA-N Lys-Met-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](CCCCN)N KFSALEZVQJYHCE-AVGNSLFASA-N 0.000 description 2
- AZOFEHCPMBRNFD-BZSNNMDCSA-N Lys-Phe-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CCCCN)C(O)=O)CC1=CC=CC=C1 AZOFEHCPMBRNFD-BZSNNMDCSA-N 0.000 description 2
- BOJYMMBYBNOOGG-DCAQKATOSA-N Lys-Pro-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O BOJYMMBYBNOOGG-DCAQKATOSA-N 0.000 description 2
- PDIDTSZKKFEDMB-UWVGGRQHSA-N Lys-Pro-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O PDIDTSZKKFEDMB-UWVGGRQHSA-N 0.000 description 2
- GHKXHCMRAUYLBS-CIUDSAMLSA-N Lys-Ser-Asn Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O GHKXHCMRAUYLBS-CIUDSAMLSA-N 0.000 description 2
- CUHGAUZONORRIC-HJGDQZAQSA-N Lys-Thr-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCCCN)N)O CUHGAUZONORRIC-HJGDQZAQSA-N 0.000 description 2
- 239000004472 Lysine Substances 0.000 description 2
- 101710135729 Major capsid protein L1 Proteins 0.000 description 2
- YRAWWKUTNBILNT-FXQIFTODSA-N Met-Ala-Ala Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O YRAWWKUTNBILNT-FXQIFTODSA-N 0.000 description 2
- VTKPSXWRUGCOAC-GUBZILKMSA-N Met-Ala-Met Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCSC VTKPSXWRUGCOAC-GUBZILKMSA-N 0.000 description 2
- HUKLXYYPZWPXCC-KZVJFYERSA-N Met-Ala-Thr Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O HUKLXYYPZWPXCC-KZVJFYERSA-N 0.000 description 2
- BVXXDMUMHMXFER-BPNCWPANSA-N Met-Ala-Tyr Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O BVXXDMUMHMXFER-BPNCWPANSA-N 0.000 description 2
- DLAFCQWUMFMZSN-GUBZILKMSA-N Met-Arg-Ala Chemical compound CSCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C)C(O)=O)CCCN=C(N)N DLAFCQWUMFMZSN-GUBZILKMSA-N 0.000 description 2
- CTVJSFRHUOSCQQ-DCAQKATOSA-N Met-Arg-Glu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O CTVJSFRHUOSCQQ-DCAQKATOSA-N 0.000 description 2
- VIZLHGTVGKBBKO-AVGNSLFASA-N Met-Arg-His Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N VIZLHGTVGKBBKO-AVGNSLFASA-N 0.000 description 2
- UAPZLLPGGOOCRO-IHRRRGAJSA-N Met-Asn-Phe Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N UAPZLLPGGOOCRO-IHRRRGAJSA-N 0.000 description 2
- IHITVQKJXQQGLJ-LPEHRKFASA-N Met-Asn-Pro Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N1CCC[C@@H]1C(=O)O)N IHITVQKJXQQGLJ-LPEHRKFASA-N 0.000 description 2
- DNDVVILEHVMWIS-LPEHRKFASA-N Met-Asp-Pro Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N DNDVVILEHVMWIS-LPEHRKFASA-N 0.000 description 2
- PNDCUTDWYVKBHX-IHRRRGAJSA-N Met-Asp-Tyr Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 PNDCUTDWYVKBHX-IHRRRGAJSA-N 0.000 description 2
- OXHSZBRPUGNMKW-DCAQKATOSA-N Met-Gln-Arg Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O OXHSZBRPUGNMKW-DCAQKATOSA-N 0.000 description 2
- CHQWUYSNAOABIP-ZPFDUUQYSA-N Met-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCSC)N CHQWUYSNAOABIP-ZPFDUUQYSA-N 0.000 description 2
- YCUSPBPZVJDMII-YUMQZZPRSA-N Met-Gly-Glu Chemical compound CSCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(O)=O YCUSPBPZVJDMII-YUMQZZPRSA-N 0.000 description 2
- HWROAFGWPQUPTE-OSUNSFLBSA-N Met-Ile-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](CCSC)N HWROAFGWPQUPTE-OSUNSFLBSA-N 0.000 description 2
- WNJXJJSGUXAIQU-UFYCRDLUSA-N Met-Phe-Phe Chemical compound C([C@H](NC(=O)[C@@H](N)CCSC)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 WNJXJJSGUXAIQU-UFYCRDLUSA-N 0.000 description 2
- ZWBCVBHKXHPCEI-BVSLBCMMSA-N Met-Phe-Trp Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC2=CNC3=CC=CC=C32)C(=O)O)N ZWBCVBHKXHPCEI-BVSLBCMMSA-N 0.000 description 2
- CIDICGYKRUTYLE-FXQIFTODSA-N Met-Ser-Ala Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O CIDICGYKRUTYLE-FXQIFTODSA-N 0.000 description 2
- SOAYQFDWEIWPPR-IHRRRGAJSA-N Met-Ser-Tyr Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O SOAYQFDWEIWPPR-IHRRRGAJSA-N 0.000 description 2
- OVTOTTGZBWXLFU-QXEWZRGKSA-N Met-Val-Asp Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC(O)=O OVTOTTGZBWXLFU-QXEWZRGKSA-N 0.000 description 2
- CQRGINSEMFBACV-WPRPVWTQSA-N Met-Val-Gly Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O CQRGINSEMFBACV-WPRPVWTQSA-N 0.000 description 2
- IIHMNTBFPMRJCN-RCWTZXSCSA-N Met-Val-Thr Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O IIHMNTBFPMRJCN-RCWTZXSCSA-N 0.000 description 2
- PVSPJQWHEIQTEH-JYJNAYRXSA-N Met-Val-Tyr Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 PVSPJQWHEIQTEH-JYJNAYRXSA-N 0.000 description 2
- BAVYZALUXZFZLV-UHFFFAOYSA-N Methylamine Chemical compound NC BAVYZALUXZFZLV-UHFFFAOYSA-N 0.000 description 2
- 101710163801 Minor capsid protein L2 Proteins 0.000 description 2
- WYBVBIHNJWOLCJ-UHFFFAOYSA-N N-L-arginyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CCCN=C(N)N WYBVBIHNJWOLCJ-UHFFFAOYSA-N 0.000 description 2
- 108010047562 NGR peptide Proteins 0.000 description 2
- 208000009869 Neu-Laxova syndrome Diseases 0.000 description 2
- 108091028043 Nucleic acid sequence Proteins 0.000 description 2
- 108010038807 Oligopeptides Proteins 0.000 description 2
- 102000015636 Oligopeptides Human genes 0.000 description 2
- 229910019142 PO4 Inorganic materials 0.000 description 2
- BJEYSVHMGIJORT-NHCYSSNCSA-N Phe-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=CC=C1 BJEYSVHMGIJORT-NHCYSSNCSA-N 0.000 description 2
- BBDSZDHUCPSYAC-QEJZJMRPSA-N Phe-Ala-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O BBDSZDHUCPSYAC-QEJZJMRPSA-N 0.000 description 2
- SEPNOAFMZLLCEW-UBHSHLNASA-N Phe-Ala-Val Chemical compound N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)O SEPNOAFMZLLCEW-UBHSHLNASA-N 0.000 description 2
- ZENDEDYRYVHBEG-SRVKXCTJSA-N Phe-Asp-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 ZENDEDYRYVHBEG-SRVKXCTJSA-N 0.000 description 2
- LXUJDHOKVUYHRC-KKUMJFAQSA-N Phe-Cys-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CC1=CC=CC=C1)N LXUJDHOKVUYHRC-KKUMJFAQSA-N 0.000 description 2
- VADLTGVIOIOKGM-BZSNNMDCSA-N Phe-His-Leu Chemical compound C([C@@H](C(=O)N[C@@H](CC(C)C)C(O)=O)NC(=O)[C@@H](N)CC=1C=CC=CC=1)C1=CN=CN1 VADLTGVIOIOKGM-BZSNNMDCSA-N 0.000 description 2
- BYAIIACBWBOJCU-URLPEUOOSA-N Phe-Ile-Thr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O BYAIIACBWBOJCU-URLPEUOOSA-N 0.000 description 2
- KBVJZCVLQWCJQN-KKUMJFAQSA-N Phe-Leu-Asn Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O KBVJZCVLQWCJQN-KKUMJFAQSA-N 0.000 description 2
- OQTDZEJJWWAGJT-KKUMJFAQSA-N Phe-Lys-Asp Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O OQTDZEJJWWAGJT-KKUMJFAQSA-N 0.000 description 2
- SCKXGHWQPPURGT-KKUMJFAQSA-N Phe-Lys-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O SCKXGHWQPPURGT-KKUMJFAQSA-N 0.000 description 2
- GPSMLZQVIIYLDK-ULQDDVLXSA-N Phe-Lys-Val Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O GPSMLZQVIIYLDK-ULQDDVLXSA-N 0.000 description 2
- PBWNICYZGJQKJV-BZSNNMDCSA-N Phe-Phe-Cys Chemical compound N[C@@H](Cc1ccccc1)C(=O)N[C@@H](Cc1ccccc1)C(=O)N[C@@H](CS)C(O)=O PBWNICYZGJQKJV-BZSNNMDCSA-N 0.000 description 2
- AXIOGMQCDYVTNY-ACRUOGEOSA-N Phe-Phe-Leu Chemical compound C([C@@H](C(=O)N[C@@H](CC(C)C)C(O)=O)NC(=O)[C@@H](N)CC=1C=CC=CC=1)C1=CC=CC=C1 AXIOGMQCDYVTNY-ACRUOGEOSA-N 0.000 description 2
- AFNJAQVMTIQTCB-DLOVCJGASA-N Phe-Ser-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CC=CC=C1 AFNJAQVMTIQTCB-DLOVCJGASA-N 0.000 description 2
- JXQVYPWVGUOIDV-MXAVVETBSA-N Phe-Ser-Ile Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JXQVYPWVGUOIDV-MXAVVETBSA-N 0.000 description 2
- YTGGLKWSVIRECD-JBACZVJFSA-N Phe-Trp-Glu Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)N[C@@H](CCC(O)=O)C(O)=O)C1=CC=CC=C1 YTGGLKWSVIRECD-JBACZVJFSA-N 0.000 description 2
- BAONJAHBAUDJKA-BZSNNMDCSA-N Phe-Tyr-Asp Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)N[C@@H](CC(O)=O)C(O)=O)C1=CC=CC=C1 BAONJAHBAUDJKA-BZSNNMDCSA-N 0.000 description 2
- VDTYRPWRWRCROL-UFYCRDLUSA-N Phe-Val-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 VDTYRPWRWRCROL-UFYCRDLUSA-N 0.000 description 2
- FZHBZMDRDASUHN-NAKRPEOUSA-N Pro-Ala-Ile Chemical compound CC[C@H](C)[C@H](NC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1)C(O)=O FZHBZMDRDASUHN-NAKRPEOUSA-N 0.000 description 2
- CGBYDGAJHSOGFQ-LPEHRKFASA-N Pro-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@@H]2CCCN2 CGBYDGAJHSOGFQ-LPEHRKFASA-N 0.000 description 2
- XQLBWXHVZVBNJM-FXQIFTODSA-N Pro-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1 XQLBWXHVZVBNJM-FXQIFTODSA-N 0.000 description 2
- AMBLXEMWFARNNQ-DCAQKATOSA-N Pro-Asn-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@@H]1CCCN1 AMBLXEMWFARNNQ-DCAQKATOSA-N 0.000 description 2
- SWXSLPHTJVAWDF-VEVYYDQMSA-N Pro-Asn-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SWXSLPHTJVAWDF-VEVYYDQMSA-N 0.000 description 2
- ILMLVTGTUJPQFP-FXQIFTODSA-N Pro-Asp-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O ILMLVTGTUJPQFP-FXQIFTODSA-N 0.000 description 2
- ZCXQTRXYZOSGJR-FXQIFTODSA-N Pro-Asp-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O ZCXQTRXYZOSGJR-FXQIFTODSA-N 0.000 description 2
- LUGOKRWYNMDGTD-FXQIFTODSA-N Pro-Cys-Asn Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(=O)N)C(=O)O LUGOKRWYNMDGTD-FXQIFTODSA-N 0.000 description 2
- FMLRRBDLBJLJIK-DCAQKATOSA-N Pro-Leu-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]1CCCN1 FMLRRBDLBJLJIK-DCAQKATOSA-N 0.000 description 2
- XQPHBAKJJJZOBX-SRVKXCTJSA-N Pro-Lys-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O XQPHBAKJJJZOBX-SRVKXCTJSA-N 0.000 description 2
- DWGFLKQSGRUQTI-IHRRRGAJSA-N Pro-Lys-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H]1CCCN1 DWGFLKQSGRUQTI-IHRRRGAJSA-N 0.000 description 2
- HOTVCUAVDQHUDB-UFYCRDLUSA-N Pro-Phe-Tyr Chemical compound C([C@@H](C(=O)O)NC(=O)[C@H](CC=1C=CC=CC=1)NC(=O)[C@H]1NCCC1)C1=CC=C(O)C=C1 HOTVCUAVDQHUDB-UFYCRDLUSA-N 0.000 description 2
- KDBHVPXBQADZKY-GUBZILKMSA-N Pro-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 KDBHVPXBQADZKY-GUBZILKMSA-N 0.000 description 2
- FYKUEXMZYFIZKA-DCAQKATOSA-N Pro-Pro-Gln Chemical compound [H]N1CCC[C@H]1C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O FYKUEXMZYFIZKA-DCAQKATOSA-N 0.000 description 2
- DWPXHLIBFQLKLK-CYDGBPFRSA-N Pro-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 DWPXHLIBFQLKLK-CYDGBPFRSA-N 0.000 description 2
- ITUDDXVFGFEKPD-NAKRPEOUSA-N Pro-Ser-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O ITUDDXVFGFEKPD-NAKRPEOUSA-N 0.000 description 2
- SNGZLPOXVRTNMB-LPEHRKFASA-N Pro-Ser-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CO)C(=O)N2CCC[C@@H]2C(=O)O SNGZLPOXVRTNMB-LPEHRKFASA-N 0.000 description 2
- KWMZPPWYBVZIER-XGEHTFHBSA-N Pro-Ser-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KWMZPPWYBVZIER-XGEHTFHBSA-N 0.000 description 2
- PRKWBYCXBBSLSK-GUBZILKMSA-N Pro-Ser-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O PRKWBYCXBBSLSK-GUBZILKMSA-N 0.000 description 2
- JDJMFMVVJHLWDP-UNQGMJICSA-N Pro-Thr-Phe Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O JDJMFMVVJHLWDP-UNQGMJICSA-N 0.000 description 2
- XDKKMRPRRCOELJ-GUBZILKMSA-N Pro-Val-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C(C)C)NC(=O)[C@@H]1CCCN1 XDKKMRPRRCOELJ-GUBZILKMSA-N 0.000 description 2
- 231100000645 Reed–Muench method Toxicity 0.000 description 2
- 101150071661 SLC25A20 gene Proteins 0.000 description 2
- 101100309606 Saccharomyces cerevisiae (strain ATCC 204508 / S288c) SCD6 gene Proteins 0.000 description 2
- ZUGXSSFMTXKHJS-ZLUOBGJFSA-N Ser-Ala-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O ZUGXSSFMTXKHJS-ZLUOBGJFSA-N 0.000 description 2
- FIXILCYTSAUERA-FXQIFTODSA-N Ser-Ala-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O FIXILCYTSAUERA-FXQIFTODSA-N 0.000 description 2
- MWMKFWJYRRGXOR-ZLUOBGJFSA-N Ser-Ala-Asn Chemical compound N[C@H](C(=O)N[C@H](C(=O)N[C@H](C(=O)O)CC(N)=O)C)CO MWMKFWJYRRGXOR-ZLUOBGJFSA-N 0.000 description 2
- DWUIECHTAMYEFL-XVYDVKMFSA-N Ser-Ala-His Chemical compound OC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 DWUIECHTAMYEFL-XVYDVKMFSA-N 0.000 description 2
- JPIDMRXXNMIVKY-VZFHVOOUSA-N Ser-Ala-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O JPIDMRXXNMIVKY-VZFHVOOUSA-N 0.000 description 2
- PZZJMBYSYAKYPK-UWJYBYFXSA-N Ser-Ala-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O PZZJMBYSYAKYPK-UWJYBYFXSA-N 0.000 description 2
- VQBLHWSPVYYZTB-DCAQKATOSA-N Ser-Arg-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CO)N VQBLHWSPVYYZTB-DCAQKATOSA-N 0.000 description 2
- NRCJWSGXMAPYQX-LPEHRKFASA-N Ser-Arg-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CO)N)C(=O)O NRCJWSGXMAPYQX-LPEHRKFASA-N 0.000 description 2
- OOKCGAYXSNJBGQ-ZLUOBGJFSA-N Ser-Asn-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O OOKCGAYXSNJBGQ-ZLUOBGJFSA-N 0.000 description 2
- BCKYYTVFBXHPOG-ACZMJKKPSA-N Ser-Asn-Gln Chemical compound C(CC(=O)N)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CO)N BCKYYTVFBXHPOG-ACZMJKKPSA-N 0.000 description 2
- VAUMZJHYZQXZBQ-WHFBIAKZSA-N Ser-Asn-Gly Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O VAUMZJHYZQXZBQ-WHFBIAKZSA-N 0.000 description 2
- KAAPNMOKUUPKOE-SRVKXCTJSA-N Ser-Asn-Phe Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 KAAPNMOKUUPKOE-SRVKXCTJSA-N 0.000 description 2
- UGJRQLURDVGULT-LKXGYXEUSA-N Ser-Asn-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O UGJRQLURDVGULT-LKXGYXEUSA-N 0.000 description 2
- ICHZYBVODUVUKN-SRVKXCTJSA-N Ser-Asn-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O ICHZYBVODUVUKN-SRVKXCTJSA-N 0.000 description 2
- BGOWRLSWJCVYAQ-CIUDSAMLSA-N Ser-Asp-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O BGOWRLSWJCVYAQ-CIUDSAMLSA-N 0.000 description 2
- HEQPKICPPDOSIN-SRVKXCTJSA-N Ser-Asp-Tyr Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 HEQPKICPPDOSIN-SRVKXCTJSA-N 0.000 description 2
- TUYBIWUZWJUZDD-ACZMJKKPSA-N Ser-Cys-Gln Chemical compound OC[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@H](C(O)=O)CCC(N)=O TUYBIWUZWJUZDD-ACZMJKKPSA-N 0.000 description 2
- CRZRTKAVUUGKEQ-ACZMJKKPSA-N Ser-Gln-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O CRZRTKAVUUGKEQ-ACZMJKKPSA-N 0.000 description 2
- SQBLRDDJTUJDMV-ACZMJKKPSA-N Ser-Glu-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O SQBLRDDJTUJDMV-ACZMJKKPSA-N 0.000 description 2
- LALNXSXEYFUUDD-GUBZILKMSA-N Ser-Glu-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O LALNXSXEYFUUDD-GUBZILKMSA-N 0.000 description 2
- OHKFXGKHSJKKAL-NRPADANISA-N Ser-Glu-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O OHKFXGKHSJKKAL-NRPADANISA-N 0.000 description 2
- UQFYNFTYDHUIMI-WHFBIAKZSA-N Ser-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)[C@@H](N)CO UQFYNFTYDHUIMI-WHFBIAKZSA-N 0.000 description 2
- YMTLKLXDFCSCNX-BYPYZUCNSA-N Ser-Gly-Gly Chemical compound OC[C@H](N)C(=O)NCC(=O)NCC(O)=O YMTLKLXDFCSCNX-BYPYZUCNSA-N 0.000 description 2
- OQPNSDWGAMFJNU-QWRGUYRKSA-N Ser-Gly-Tyr Chemical compound OC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 OQPNSDWGAMFJNU-QWRGUYRKSA-N 0.000 description 2
- FYUIFUJFNCLUIX-XVYDVKMFSA-N Ser-His-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(O)=O FYUIFUJFNCLUIX-XVYDVKMFSA-N 0.000 description 2
- MOINZPRHJGTCHZ-MMWGEVLESA-N Ser-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N MOINZPRHJGTCHZ-MMWGEVLESA-N 0.000 description 2
- MQQBBLVOUUJKLH-HJPIBITLSA-N Ser-Ile-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O MQQBBLVOUUJKLH-HJPIBITLSA-N 0.000 description 2
- IAORETPTUDBBGV-CIUDSAMLSA-N Ser-Leu-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CO)N IAORETPTUDBBGV-CIUDSAMLSA-N 0.000 description 2
- GZSZPKSBVAOGIE-CIUDSAMLSA-N Ser-Lys-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O GZSZPKSBVAOGIE-CIUDSAMLSA-N 0.000 description 2
- FPCGZYMRFFIYIH-CIUDSAMLSA-N Ser-Lys-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O FPCGZYMRFFIYIH-CIUDSAMLSA-N 0.000 description 2
- JUTGONBTALQWMK-NAKRPEOUSA-N Ser-Met-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](CO)N JUTGONBTALQWMK-NAKRPEOUSA-N 0.000 description 2
- JJUNLJTUIKFPRF-BPUTZDHNSA-N Ser-Met-Trp Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CO)N JJUNLJTUIKFPRF-BPUTZDHNSA-N 0.000 description 2
- MQUZANJDFOQOBX-SRVKXCTJSA-N Ser-Phe-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O MQUZANJDFOQOBX-SRVKXCTJSA-N 0.000 description 2
- RHAPJNVNWDBFQI-BQBZGAKWSA-N Ser-Pro-Gly Chemical compound OC[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O RHAPJNVNWDBFQI-BQBZGAKWSA-N 0.000 description 2
- QUGRFWPMPVIAPW-IHRRRGAJSA-N Ser-Pro-Phe Chemical compound OC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 QUGRFWPMPVIAPW-IHRRRGAJSA-N 0.000 description 2
- GYDFRTRSSXOZCR-ACZMJKKPSA-N Ser-Ser-Glu Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O GYDFRTRSSXOZCR-ACZMJKKPSA-N 0.000 description 2
- XQJCEKXQUJQNNK-ZLUOBGJFSA-N Ser-Ser-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O XQJCEKXQUJQNNK-ZLUOBGJFSA-N 0.000 description 2
- PYTKULIABVRXSC-BWBBJGPYSA-N Ser-Ser-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PYTKULIABVRXSC-BWBBJGPYSA-N 0.000 description 2
- XJDMUQCLVSCRSJ-VZFHVOOUSA-N Ser-Thr-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O XJDMUQCLVSCRSJ-VZFHVOOUSA-N 0.000 description 2
- PURRNJBBXDDWLX-ZDLURKLDSA-N Ser-Thr-Gly Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CO)N)O PURRNJBBXDDWLX-ZDLURKLDSA-N 0.000 description 2
- NADLKBTYNKUJEP-KATARQTJSA-N Ser-Thr-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O NADLKBTYNKUJEP-KATARQTJSA-N 0.000 description 2
- HAYADTTXNZFUDM-IHRRRGAJSA-N Ser-Tyr-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O HAYADTTXNZFUDM-IHRRRGAJSA-N 0.000 description 2
- UKKROEYWYIHWBD-ZKWXMUAHSA-N Ser-Val-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O UKKROEYWYIHWBD-ZKWXMUAHSA-N 0.000 description 2
- JGUWRQWULDWNCM-FXQIFTODSA-N Ser-Val-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O JGUWRQWULDWNCM-FXQIFTODSA-N 0.000 description 2
- SIEBDTCABMZCLF-XGEHTFHBSA-N Ser-Val-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SIEBDTCABMZCLF-XGEHTFHBSA-N 0.000 description 2
- YRNBANYVJJBGDI-VZFHVOOUSA-N Thr-Ala-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CS)C(=O)O)N)O YRNBANYVJJBGDI-VZFHVOOUSA-N 0.000 description 2
- FQPQPTHMHZKGFM-XQXXSGGOSA-N Thr-Ala-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O FQPQPTHMHZKGFM-XQXXSGGOSA-N 0.000 description 2
- BSNZTJXVDOINSR-JXUBOQSCSA-N Thr-Ala-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O BSNZTJXVDOINSR-JXUBOQSCSA-N 0.000 description 2
- CAJFZCICSVBOJK-SHGPDSBTSA-N Thr-Ala-Thr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CAJFZCICSVBOJK-SHGPDSBTSA-N 0.000 description 2
- MFEBUIFJVPNZLO-OLHMAJIHSA-N Thr-Asp-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O MFEBUIFJVPNZLO-OLHMAJIHSA-N 0.000 description 2
- GKMYGVQDGVYCPC-IUKAMOBKSA-N Thr-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H]([C@@H](C)O)N GKMYGVQDGVYCPC-IUKAMOBKSA-N 0.000 description 2
- NLSNVZAREYQMGR-HJGDQZAQSA-N Thr-Asp-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O NLSNVZAREYQMGR-HJGDQZAQSA-N 0.000 description 2
- ZUUDNCOCILSYAM-KKHAAJSZSA-N Thr-Asp-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O ZUUDNCOCILSYAM-KKHAAJSZSA-N 0.000 description 2
- RJBFAHKSFNNHAI-XKBZYTNZSA-N Thr-Gln-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CS)C(=O)O)N)O RJBFAHKSFNNHAI-XKBZYTNZSA-N 0.000 description 2
- HJOSVGCWOTYJFG-WDCWCFNPSA-N Thr-Glu-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N)O HJOSVGCWOTYJFG-WDCWCFNPSA-N 0.000 description 2
- ONNSECRQFSTMCC-XKBZYTNZSA-N Thr-Glu-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O ONNSECRQFSTMCC-XKBZYTNZSA-N 0.000 description 2
- XOTBWOCSLMBGMF-SUSMZKCASA-N Thr-Glu-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XOTBWOCSLMBGMF-SUSMZKCASA-N 0.000 description 2
- BNGDYRRHRGOPHX-IFFSRLJSSA-N Thr-Glu-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)[C@@H](C)O)C(O)=O BNGDYRRHRGOPHX-IFFSRLJSSA-N 0.000 description 2
- JQAWYCUUFIMTHE-WLTAIBSBSA-N Thr-Gly-Tyr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O JQAWYCUUFIMTHE-WLTAIBSBSA-N 0.000 description 2
- DDDLIMCZFKOERC-SVSWQMSJSA-N Thr-Ile-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N DDDLIMCZFKOERC-SVSWQMSJSA-N 0.000 description 2
- YJCVECXVYHZOBK-KNZXXDILSA-N Thr-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H]([C@@H](C)O)N YJCVECXVYHZOBK-KNZXXDILSA-N 0.000 description 2
- FQPDRTDDEZXCEC-SVSWQMSJSA-N Thr-Ile-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O FQPDRTDDEZXCEC-SVSWQMSJSA-N 0.000 description 2
- GXUWHVZYDAHFSV-FLBSBUHZSA-N Thr-Ile-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GXUWHVZYDAHFSV-FLBSBUHZSA-N 0.000 description 2
- IHAPJUHCZXBPHR-WZLNRYEVSA-N Thr-Ile-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N IHAPJUHCZXBPHR-WZLNRYEVSA-N 0.000 description 2
- RFKVQLIXNVEOMB-WEDXCCLWSA-N Thr-Leu-Gly Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)NCC(=O)O)N)O RFKVQLIXNVEOMB-WEDXCCLWSA-N 0.000 description 2
- NCXVJIQMWSGRHY-KXNHARMFSA-N Thr-Leu-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N)O NCXVJIQMWSGRHY-KXNHARMFSA-N 0.000 description 2
- YOOAQCZYZHGUAZ-KATARQTJSA-N Thr-Leu-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O YOOAQCZYZHGUAZ-KATARQTJSA-N 0.000 description 2
- PCMDGXKXVMBIFP-VEVYYDQMSA-N Thr-Met-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(N)=O)C(O)=O PCMDGXKXVMBIFP-VEVYYDQMSA-N 0.000 description 2
- QHUWWSQZTFLXPQ-FJXKBIBVSA-N Thr-Met-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCSC)C(=O)NCC(O)=O QHUWWSQZTFLXPQ-FJXKBIBVSA-N 0.000 description 2
- XIHGJKFSIDTDKV-LYARXQMPSA-N Thr-Phe-Trp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O XIHGJKFSIDTDKV-LYARXQMPSA-N 0.000 description 2
- WTMPKZWHRCMMMT-KZVJFYERSA-N Thr-Pro-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O WTMPKZWHRCMMMT-KZVJFYERSA-N 0.000 description 2
- DNCUODYZAMHLCV-XGEHTFHBSA-N Thr-Pro-Cys Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CS)C(=O)O)N)O DNCUODYZAMHLCV-XGEHTFHBSA-N 0.000 description 2
- MROIJTGJGIDEEJ-RCWTZXSCSA-N Thr-Pro-Pro Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 MROIJTGJGIDEEJ-RCWTZXSCSA-N 0.000 description 2
- YGCDFAJJCRVQKU-RCWTZXSCSA-N Thr-Pro-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)[C@@H](C)O YGCDFAJJCRVQKU-RCWTZXSCSA-N 0.000 description 2
- PRTHQBSMXILLPC-XGEHTFHBSA-N Thr-Ser-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O PRTHQBSMXILLPC-XGEHTFHBSA-N 0.000 description 2
- IEZVHOULSUULHD-XGEHTFHBSA-N Thr-Ser-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O IEZVHOULSUULHD-XGEHTFHBSA-N 0.000 description 2
- QYDKSNXSBXZPFK-ZJDVBMNYSA-N Thr-Thr-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O QYDKSNXSBXZPFK-ZJDVBMNYSA-N 0.000 description 2
- UQCNIMDPYICBTR-KYNKHSRBSA-N Thr-Thr-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O UQCNIMDPYICBTR-KYNKHSRBSA-N 0.000 description 2
- ZESGVALRVJIVLZ-VFCFLDTKSA-N Thr-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@@H]1C(=O)O)N)O ZESGVALRVJIVLZ-VFCFLDTKSA-N 0.000 description 2
- GRIUMVXCJDKVPI-IZPVPAKOSA-N Thr-Thr-Tyr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O GRIUMVXCJDKVPI-IZPVPAKOSA-N 0.000 description 2
- LVRFMARKDGGZMX-IZPVPAKOSA-N Thr-Tyr-Thr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H]([C@@H](C)O)C(O)=O)CC1=CC=C(O)C=C1 LVRFMARKDGGZMX-IZPVPAKOSA-N 0.000 description 2
- FYBFTPLPAXZBOY-KKHAAJSZSA-N Thr-Val-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O FYBFTPLPAXZBOY-KKHAAJSZSA-N 0.000 description 2
- ILUOMMDDGREELW-OSUNSFLBSA-N Thr-Val-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)[C@@H](C)O ILUOMMDDGREELW-OSUNSFLBSA-N 0.000 description 2
- HYVLNORXQGKONN-NUTKFTJISA-N Trp-Ala-Lys Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCCN)C(O)=O)=CNC2=C1 HYVLNORXQGKONN-NUTKFTJISA-N 0.000 description 2
- PNHABSVRPFBUJY-UMPQAUOISA-N Trp-Arg-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N)O PNHABSVRPFBUJY-UMPQAUOISA-N 0.000 description 2
- IBBBOLAPFHRDHW-BPUTZDHNSA-N Trp-Asn-Arg Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N IBBBOLAPFHRDHW-BPUTZDHNSA-N 0.000 description 2
- UTQBQJNSNXJNIH-IHPCNDPISA-N Trp-Asn-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)N UTQBQJNSNXJNIH-IHPCNDPISA-N 0.000 description 2
- RERIQEJUYCLJQI-QRTARXTBSA-N Trp-Asp-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N RERIQEJUYCLJQI-QRTARXTBSA-N 0.000 description 2
- AWEGFIJXYWGBCA-XIRDDKMYSA-N Trp-His-Asn Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC3=CN=CN3)C(=O)N[C@@H](CC(=O)N)C(=O)O)N AWEGFIJXYWGBCA-XIRDDKMYSA-N 0.000 description 2
- IQXWAJUIAQLZNX-IHPCNDPISA-N Trp-Leu-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)N IQXWAJUIAQLZNX-IHPCNDPISA-N 0.000 description 2
- GWBWCGITOYODER-YTQUADARSA-N Trp-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)N GWBWCGITOYODER-YTQUADARSA-N 0.000 description 2
- VCXWRWYFJLXITF-AUTRQRHGSA-N Tyr-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 VCXWRWYFJLXITF-AUTRQRHGSA-N 0.000 description 2
- DYEGCOJHFNJBKB-UFYCRDLUSA-N Tyr-Arg-Tyr Chemical compound C([C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=C(O)C=C1 DYEGCOJHFNJBKB-UFYCRDLUSA-N 0.000 description 2
- AYPAIRCDLARHLM-KKUMJFAQSA-N Tyr-Asn-Lys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N)O AYPAIRCDLARHLM-KKUMJFAQSA-N 0.000 description 2
- BEIGSKUPTIFYRZ-SRVKXCTJSA-N Tyr-Asp-Asp Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N)O BEIGSKUPTIFYRZ-SRVKXCTJSA-N 0.000 description 2
- NRFTYDWKWGJLAR-MELADBBJSA-N Tyr-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N)C(=O)O NRFTYDWKWGJLAR-MELADBBJSA-N 0.000 description 2
- CNLKDWSAORJEMW-KWQFWETISA-N Tyr-Gly-Ala Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(=O)N[C@@H](C)C(O)=O CNLKDWSAORJEMW-KWQFWETISA-N 0.000 description 2
- CDHQEOXPWBDFPL-QWRGUYRKSA-N Tyr-Gly-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC1=CC=C(O)C=C1 CDHQEOXPWBDFPL-QWRGUYRKSA-N 0.000 description 2
- HIINQLBHPIQYHN-JTQLQIEISA-N Tyr-Gly-Gly Chemical compound OC(=O)CNC(=O)CNC(=O)[C@@H](N)CC1=CC=C(O)C=C1 HIINQLBHPIQYHN-JTQLQIEISA-N 0.000 description 2
- AXWBYOVVDRBOGU-SIUGBPQLSA-N Tyr-Ile-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N AXWBYOVVDRBOGU-SIUGBPQLSA-N 0.000 description 2
- YMUQBRQQCPQEQN-CXTHYWKRSA-N Tyr-Ile-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N YMUQBRQQCPQEQN-CXTHYWKRSA-N 0.000 description 2
- YKCXQOBTISTQJD-BZSNNMDCSA-N Tyr-Leu-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N YKCXQOBTISTQJD-BZSNNMDCSA-N 0.000 description 2
- JLKVWTICWVWGSK-JYJNAYRXSA-N Tyr-Lys-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 JLKVWTICWVWGSK-JYJNAYRXSA-N 0.000 description 2
- CWVHKVVKAQIJKY-ACRUOGEOSA-N Tyr-Lys-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC2=CC=C(C=C2)O)N CWVHKVVKAQIJKY-ACRUOGEOSA-N 0.000 description 2
- BBSPTGPYIPGTKH-JYJNAYRXSA-N Tyr-Met-Arg Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N BBSPTGPYIPGTKH-JYJNAYRXSA-N 0.000 description 2
- PSALWJCUIAQKFW-ACRUOGEOSA-N Tyr-Phe-Lys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N PSALWJCUIAQKFW-ACRUOGEOSA-N 0.000 description 2
- QPOUERMDWKKZEG-HJPIBITLSA-N Tyr-Ser-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 QPOUERMDWKKZEG-HJPIBITLSA-N 0.000 description 2
- LDKDSFQSEUOCOO-RPTUDFQQSA-N Tyr-Thr-Phe Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O LDKDSFQSEUOCOO-RPTUDFQQSA-N 0.000 description 2
- MWUYSCVVPVITMW-IGNZVWTISA-N Tyr-Tyr-Ala Chemical compound C([C@@H](C(=O)N[C@@H](C)C(O)=O)NC(=O)[C@@H](N)CC=1C=CC(O)=CC=1)C1=CC=C(O)C=C1 MWUYSCVVPVITMW-IGNZVWTISA-N 0.000 description 2
- NWEGIYMHTZXVBP-JSGCOSHPSA-N Tyr-Val-Gly Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O NWEGIYMHTZXVBP-JSGCOSHPSA-N 0.000 description 2
- NVJCMGGZHOJNBU-UFYCRDLUSA-N Tyr-Val-Phe Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N NVJCMGGZHOJNBU-UFYCRDLUSA-N 0.000 description 2
- ZLFHAAGHGQBQQN-GUBZILKMSA-N Val-Ala-Pro Natural products CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@H]1C(O)=O ZLFHAAGHGQBQQN-GUBZILKMSA-N 0.000 description 2
- UUYCNAXCCDNULB-QXEWZRGKSA-N Val-Arg-Asn Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC(N)=O)C(O)=O UUYCNAXCCDNULB-QXEWZRGKSA-N 0.000 description 2
- JYVKKBDANPZIAW-AVGNSLFASA-N Val-Arg-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](C(C)C)N JYVKKBDANPZIAW-AVGNSLFASA-N 0.000 description 2
- DBOXBUDEAJVKRE-LSJOCFKGSA-N Val-Asn-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](C(C)C)C(=O)O)N DBOXBUDEAJVKRE-LSJOCFKGSA-N 0.000 description 2
- CGGVNFJRZJUVAE-BYULHYEWSA-N Val-Asp-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N CGGVNFJRZJUVAE-BYULHYEWSA-N 0.000 description 2
- QHDXUYOYTPWCSK-RCOVLWMOSA-N Val-Asp-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)NCC(=O)O)N QHDXUYOYTPWCSK-RCOVLWMOSA-N 0.000 description 2
- COSLEEOIYRPTHD-YDHLFZDLSA-N Val-Asp-Tyr Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 COSLEEOIYRPTHD-YDHLFZDLSA-N 0.000 description 2
- CFSSLXZJEMERJY-NRPADANISA-N Val-Gln-Ala Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O CFSSLXZJEMERJY-NRPADANISA-N 0.000 description 2
- OUUBKKIJQIAPRI-LAEOZQHASA-N Val-Gln-Asn Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O OUUBKKIJQIAPRI-LAEOZQHASA-N 0.000 description 2
- PWRITNSESKQTPW-NRPADANISA-N Val-Gln-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CO)C(=O)O)N PWRITNSESKQTPW-NRPADANISA-N 0.000 description 2
- DJEVQCWNMQOABE-RCOVLWMOSA-N Val-Gly-Asp Chemical compound CC(C)[C@@H](C(=O)NCC(=O)N[C@@H](CC(=O)O)C(=O)O)N DJEVQCWNMQOABE-RCOVLWMOSA-N 0.000 description 2
- FXVDGDZRYLFQKY-WPRPVWTQSA-N Val-Gly-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)C(C)C FXVDGDZRYLFQKY-WPRPVWTQSA-N 0.000 description 2
- LAYSXAOGWHKNED-XPUUQOCRSA-N Val-Gly-Ser Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O LAYSXAOGWHKNED-XPUUQOCRSA-N 0.000 description 2
- ZIGZPYJXIWLQFC-QTKMDUPCSA-N Val-His-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](C(C)C)N)O ZIGZPYJXIWLQFC-QTKMDUPCSA-N 0.000 description 2
- WNZSAUMKZQXHNC-UKJIMTQDSA-N Val-Ile-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N WNZSAUMKZQXHNC-UKJIMTQDSA-N 0.000 description 2
- VXDSPJJQUQDCKH-UKJIMTQDSA-N Val-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N VXDSPJJQUQDCKH-UKJIMTQDSA-N 0.000 description 2
- LYERIXUFCYVFFX-GVXVVHGQSA-N Val-Leu-Glu Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N LYERIXUFCYVFFX-GVXVVHGQSA-N 0.000 description 2
- GVJUTBOZZBTBIG-AVGNSLFASA-N Val-Lys-Arg Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N GVJUTBOZZBTBIG-AVGNSLFASA-N 0.000 description 2
- ZRSZTKTVPNSUNA-IHRRRGAJSA-N Val-Lys-Leu Chemical compound CC(C)C[C@H](NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)C(C)C)C(O)=O ZRSZTKTVPNSUNA-IHRRRGAJSA-N 0.000 description 2
- MBGFDZDWMDLXHQ-GUBZILKMSA-N Val-Met-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](C(C)C)N MBGFDZDWMDLXHQ-GUBZILKMSA-N 0.000 description 2
- JVGHIFMSFBZDHH-WPRPVWTQSA-N Val-Met-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)NCC(=O)O)N JVGHIFMSFBZDHH-WPRPVWTQSA-N 0.000 description 2
- BGXVHVMJZCSOCA-AVGNSLFASA-N Val-Pro-Lys Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)O)N BGXVHVMJZCSOCA-AVGNSLFASA-N 0.000 description 2
- PZTZYZUTCPZWJH-FXQIFTODSA-N Val-Ser-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)O)N PZTZYZUTCPZWJH-FXQIFTODSA-N 0.000 description 2
- NZYNRRGJJVSSTJ-GUBZILKMSA-N Val-Ser-Val Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O NZYNRRGJJVSSTJ-GUBZILKMSA-N 0.000 description 2
- MNSSBIHFEUUXNW-RCWTZXSCSA-N Val-Thr-Arg Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N MNSSBIHFEUUXNW-RCWTZXSCSA-N 0.000 description 2
- GVNLOVJNNDZUHS-RHYQMDGZSA-N Val-Thr-Lys Chemical compound [H]N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(O)=O GVNLOVJNNDZUHS-RHYQMDGZSA-N 0.000 description 2
- CFIBZQOLUDURST-IHRRRGAJSA-N Val-Tyr-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CS)C(=O)O)N CFIBZQOLUDURST-IHRRRGAJSA-N 0.000 description 2
- ZNGPROMGGGFOAA-JYJNAYRXSA-N Val-Tyr-Val Chemical compound CC(C)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C(C)C)C(O)=O)CC1=CC=C(O)C=C1 ZNGPROMGGGFOAA-JYJNAYRXSA-N 0.000 description 2
- 208000004354 Vulvar Neoplasms Diseases 0.000 description 2
- 230000005856 abnormality Effects 0.000 description 2
- 108010076324 alanyl-glycyl-glycine Proteins 0.000 description 2
- 108010045350 alanyl-tyrosyl-alanine Proteins 0.000 description 2
- 108010041407 alanylaspartic acid Proteins 0.000 description 2
- 108010070783 alanyltyrosine Proteins 0.000 description 2
- 238000010171 animal model Methods 0.000 description 2
- 201000011165 anus cancer Diseases 0.000 description 2
- 229960003121 arginine Drugs 0.000 description 2
- ODKSFYDXXFIFQN-UHFFFAOYSA-N arginine Natural products OC(=O)C(N)CCCNC(N)=N ODKSFYDXXFIFQN-UHFFFAOYSA-N 0.000 description 2
- 108010029539 arginyl-prolyl-proline Proteins 0.000 description 2
- 108010010430 asparagine-proline-alanine Proteins 0.000 description 2
- 108010047857 aspartylglycine Proteins 0.000 description 2
- 238000003556 assay Methods 0.000 description 2
- 230000008901 benefit Effects 0.000 description 2
- 238000007622 bioinformatic analysis Methods 0.000 description 2
- 229960000074 biopharmaceutical Drugs 0.000 description 2
- 230000015572 biosynthetic process Effects 0.000 description 2
- 210000004369 blood Anatomy 0.000 description 2
- 239000008280 blood Substances 0.000 description 2
- 210000005252 bulbus oculi Anatomy 0.000 description 2
- 101150102633 cact gene Proteins 0.000 description 2
- 210000003855 cell nucleus Anatomy 0.000 description 2
- 208000007951 cervical intraepithelial neoplasia Diseases 0.000 description 2
- 230000000052 comparative effect Effects 0.000 description 2
- 108091036078 conserved sequence Proteins 0.000 description 2
- 239000013078 crystal Substances 0.000 description 2
- 238000013461 design Methods 0.000 description 2
- 239000012470 diluted sample Substances 0.000 description 2
- FSXRLASFHBWESK-UHFFFAOYSA-N dipeptide phenylalanyl-tyrosine Natural products C=1C=C(O)C=CC=1CC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 FSXRLASFHBWESK-UHFFFAOYSA-N 0.000 description 2
- 239000006167 equilibration buffer Substances 0.000 description 2
- 239000013604 expression vector Substances 0.000 description 2
- ZDXPYRJPNDTMRX-UHFFFAOYSA-N glutamine Natural products OC(=O)C(N)CCC(N)=O ZDXPYRJPNDTMRX-UHFFFAOYSA-N 0.000 description 2
- 108010000434 glycyl-alanyl-leucine Proteins 0.000 description 2
- 108010062266 glycyl-glycyl-argininal Proteins 0.000 description 2
- 108010067216 glycyl-glycyl-glycine Proteins 0.000 description 2
- 108010010096 glycyl-glycyl-tyrosine Proteins 0.000 description 2
- 108010082286 glycyl-seryl-alanine Proteins 0.000 description 2
- 108010059898 glycyl-tyrosyl-lysine Proteins 0.000 description 2
- 108010036413 histidylglycine Proteins 0.000 description 2
- 108010085325 histidylproline Proteins 0.000 description 2
- 229940072221 immunoglobulins Drugs 0.000 description 2
- 239000012535 impurity Substances 0.000 description 2
- 238000001727 in vivo Methods 0.000 description 2
- 238000011534 incubation Methods 0.000 description 2
- 230000002458 infectious effect Effects 0.000 description 2
- 238000011068 loading method Methods 0.000 description 2
- 108700023046 methionyl-leucyl-phenylalanine Proteins 0.000 description 2
- 108010034507 methionyltryptophan Proteins 0.000 description 2
- 238000012986 modification Methods 0.000 description 2
- 230000004048 modification Effects 0.000 description 2
- 229940031348 multivalent vaccine Drugs 0.000 description 2
- 239000013642 negative control Substances 0.000 description 2
- 230000012223 nuclear import Effects 0.000 description 2
- 108010064486 phenylalanyl-leucyl-valine Proteins 0.000 description 2
- 108010051242 phenylalanylserine Proteins 0.000 description 2
- NBIIXXVUZAFLBC-UHFFFAOYSA-K phosphate Chemical compound [O-]P([O-])([O-])=O NBIIXXVUZAFLBC-UHFFFAOYSA-K 0.000 description 2
- 239000010452 phosphate Substances 0.000 description 2
- 239000013641 positive control Substances 0.000 description 2
- 230000002265 prevention Effects 0.000 description 2
- 230000002062 proliferating effect Effects 0.000 description 2
- 108010093296 prolyl-prolyl-alanine Proteins 0.000 description 2
- 108010079317 prolyl-tyrosine Proteins 0.000 description 2
- 108010070643 prolylglutamic acid Proteins 0.000 description 2
- 108010053725 prolylvaline Proteins 0.000 description 2
- 230000000717 retained effect Effects 0.000 description 2
- 239000000126 substance Substances 0.000 description 2
- 238000006467 substitution reaction Methods 0.000 description 2
- 238000003786 synthesis reaction Methods 0.000 description 2
- 238000013518 transcription Methods 0.000 description 2
- 238000013519 translation Methods 0.000 description 2
- 108010080629 tryptophan-leucine Proteins 0.000 description 2
- 108010051110 tyrosyl-lysine Proteins 0.000 description 2
- 208000013139 vaginal neoplasm Diseases 0.000 description 2
- NQPDZGIKBAWPEJ-UHFFFAOYSA-N valeric acid Chemical compound CCCCC(O)=O NQPDZGIKBAWPEJ-UHFFFAOYSA-N 0.000 description 2
- 230000029812 viral genome replication Effects 0.000 description 2
- 230000006656 viral protein synthesis Effects 0.000 description 2
- 230000006490 viral transcription Effects 0.000 description 2
- 239000011534 wash buffer Substances 0.000 description 2
- 101150084750 1 gene Proteins 0.000 description 1
- 241000143437 Aciculosporium take Species 0.000 description 1
- 201000007490 Adenocarcinoma in Situ Diseases 0.000 description 1
- UCIYCBSJBQGDGM-LPEHRKFASA-N Ala-Arg-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N1CCC[C@@H]1C(=O)O)N UCIYCBSJBQGDGM-LPEHRKFASA-N 0.000 description 1
- PCIFXPRIFWKWLK-YUMQZZPRSA-N Ala-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@H](C)N PCIFXPRIFWKWLK-YUMQZZPRSA-N 0.000 description 1
- OBVSBEYOMDWLRJ-BFHQHQDPSA-N Ala-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@H](C)N OBVSBEYOMDWLRJ-BFHQHQDPSA-N 0.000 description 1
- VCSABYLVNWQYQE-SRVKXCTJSA-N Ala-Lys-Lys Chemical compound NCCCC[C@H](NC(=O)[C@@H](N)C)C(=O)N[C@@H](CCCCN)C(O)=O VCSABYLVNWQYQE-SRVKXCTJSA-N 0.000 description 1
- VCSABYLVNWQYQE-UHFFFAOYSA-N Ala-Lys-Lys Natural products NCCCCC(NC(=O)C(N)C)C(=O)NC(CCCCN)C(O)=O VCSABYLVNWQYQE-UHFFFAOYSA-N 0.000 description 1
- XUGATJVGQUGQKY-ULQDDVLXSA-N Arg-Lys-Phe Chemical compound NC(=N)NCCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 XUGATJVGQUGQKY-ULQDDVLXSA-N 0.000 description 1
- GMCOADLDNLGOFE-ZLUOBGJFSA-N Asn-Asp-Cys Chemical compound C([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CS)C(=O)O)N)C(=O)N GMCOADLDNLGOFE-ZLUOBGJFSA-N 0.000 description 1
- SRUUBQBAVNQZGJ-LAEOZQHASA-N Asn-Gln-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC(=O)N)N SRUUBQBAVNQZGJ-LAEOZQHASA-N 0.000 description 1
- XVAPVJNJGLWGCS-ACZMJKKPSA-N Asn-Glu-Asn Chemical compound C(CC(=O)O)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N XVAPVJNJGLWGCS-ACZMJKKPSA-N 0.000 description 1
- ZYPWIUFLYMQZBS-SRVKXCTJSA-N Asn-Lys-Lys Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)N)N ZYPWIUFLYMQZBS-SRVKXCTJSA-N 0.000 description 1
- HNXWVVHIGTZTBO-LKXGYXEUSA-N Asn-Ser-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC(N)=O HNXWVVHIGTZTBO-LKXGYXEUSA-N 0.000 description 1
- CJUKAWUWBZCTDQ-SRVKXCTJSA-N Asp-Leu-Lys Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(O)=O CJUKAWUWBZCTDQ-SRVKXCTJSA-N 0.000 description 1
- IOXWDLNHXZOXQP-FXQIFTODSA-N Asp-Met-Ser Chemical compound CSCC[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC(=O)O)N IOXWDLNHXZOXQP-FXQIFTODSA-N 0.000 description 1
- JUWISGAGWSDGDH-KKUMJFAQSA-N Asp-Phe-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CC(O)=O)CC1=CC=CC=C1 JUWISGAGWSDGDH-KKUMJFAQSA-N 0.000 description 1
- DCXYFEDJOCDNAF-UHFFFAOYSA-N Asparagine Natural products OC(=O)C(N)CC(N)=O DCXYFEDJOCDNAF-UHFFFAOYSA-N 0.000 description 1
- 241000283690 Bos taurus Species 0.000 description 1
- 241000282472 Canis lupus familiaris Species 0.000 description 1
- OKTJSMMVPCPJKN-UHFFFAOYSA-N Carbon Chemical compound [C] OKTJSMMVPCPJKN-UHFFFAOYSA-N 0.000 description 1
- 229940124957 Cervarix Drugs 0.000 description 1
- RYGMFSIKBFXOCR-UHFFFAOYSA-N Copper Chemical compound [Cu] RYGMFSIKBFXOCR-UHFFFAOYSA-N 0.000 description 1
- XTHUKRLJRUVVBF-WHFBIAKZSA-N Cys-Gly-Ser Chemical compound SC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O XTHUKRLJRUVVBF-WHFBIAKZSA-N 0.000 description 1
- ZXCAQANTQWBICD-DCAQKATOSA-N Cys-Lys-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CS)N ZXCAQANTQWBICD-DCAQKATOSA-N 0.000 description 1
- 230000004543 DNA replication Effects 0.000 description 1
- 241000450599 DNA viruses Species 0.000 description 1
- 241000588724 Escherichia coli Species 0.000 description 1
- IXFVOPOHSRKJNG-LAEOZQHASA-N Gln-Asp-Val Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O IXFVOPOHSRKJNG-LAEOZQHASA-N 0.000 description 1
- CAXXTYYGFYTBPV-IUCAKERBSA-N Gln-Leu-Gly Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O CAXXTYYGFYTBPV-IUCAKERBSA-N 0.000 description 1
- ZEEPYMXTJWIMSN-GUBZILKMSA-N Gln-Lys-Ser Chemical compound NCCCC[C@@H](C(=O)N[C@@H](CO)C(O)=O)NC(=O)[C@@H](N)CCC(N)=O ZEEPYMXTJWIMSN-GUBZILKMSA-N 0.000 description 1
- QFXNFFZTMFHPST-DZKIICNBSA-N Gln-Phe-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](CCC(=O)N)N QFXNFFZTMFHPST-DZKIICNBSA-N 0.000 description 1
- NCWOMXABNYEPLY-NRPADANISA-N Glu-Ala-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O NCWOMXABNYEPLY-NRPADANISA-N 0.000 description 1
- RLFSBAPJTYKSLG-WHFBIAKZSA-N Gly-Ala-Asp Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(O)=O RLFSBAPJTYKSLG-WHFBIAKZSA-N 0.000 description 1
- LHYJCVCQPWRMKZ-WEDXCCLWSA-N Gly-Leu-Thr Chemical compound [H]NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LHYJCVCQPWRMKZ-WEDXCCLWSA-N 0.000 description 1
- JYPCXBJRLBHWME-IUCAKERBSA-N Gly-Pro-Arg Chemical compound NCC(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(O)=O JYPCXBJRLBHWME-IUCAKERBSA-N 0.000 description 1
- GWCJMBNBFYBQCV-XPUUQOCRSA-N Gly-Val-Ala Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O GWCJMBNBFYBQCV-XPUUQOCRSA-N 0.000 description 1
- 239000004471 Glycine Substances 0.000 description 1
- MBSSHYPAEHPSGY-LSJOCFKGSA-N His-Ala-Met Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(=O)N[C@@H](CCSC)C(O)=O MBSSHYPAEHPSGY-LSJOCFKGSA-N 0.000 description 1
- 241000701830 Human papillomavirus type 31 Species 0.000 description 1
- 241000701826 Human papillomavirus type 33 Species 0.000 description 1
- 241000701824 Human papillomavirus type 39 Species 0.000 description 1
- 241000701790 Human papillomavirus type 45 Species 0.000 description 1
- 241000701788 Human papillomavirus type 51 Species 0.000 description 1
- 241000701789 Human papillomavirus type 56 Species 0.000 description 1
- 241001502466 Human papillomavirus type 59 Species 0.000 description 1
- 241000722343 Human papillomavirus types Species 0.000 description 1
- HUORUFRRJHELPD-MNXVOIDGSA-N Ile-Leu-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N HUORUFRRJHELPD-MNXVOIDGSA-N 0.000 description 1
- AKOYRLRUFBZOSP-BJDJZHNGSA-N Ile-Lys-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)O)N AKOYRLRUFBZOSP-BJDJZHNGSA-N 0.000 description 1
- NAFIFZNBSPWYOO-RWRJDSDZSA-N Ile-Thr-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N NAFIFZNBSPWYOO-RWRJDSDZSA-N 0.000 description 1
- DGAQECJNVWCQMB-PUAWFVPOSA-M Ilexoside XXIX Chemical compound C[C@@H]1CC[C@@]2(CC[C@@]3(C(=CC[C@H]4[C@]3(CC[C@@H]5[C@@]4(CC[C@@H](C5(C)C)OS(=O)(=O)[O-])C)C)[C@@H]2[C@]1(C)O)C)C(=O)O[C@H]6[C@@H]([C@H]([C@@H]([C@H](O6)CO)O)O)O.[Na+] DGAQECJNVWCQMB-PUAWFVPOSA-M 0.000 description 1
- QNAYBMKLOCPYGJ-REOHCLBHSA-N L-alanine Chemical compound C[C@H](N)C(O)=O QNAYBMKLOCPYGJ-REOHCLBHSA-N 0.000 description 1
- LEVWYRKDKASIDU-IMJSIDKUSA-N L-cystine Chemical compound [O-]C(=O)[C@@H]([NH3+])CSSC[C@H]([NH3+])C([O-])=O LEVWYRKDKASIDU-IMJSIDKUSA-N 0.000 description 1
- QIVBCDIJIAJPQS-VIFPVBQESA-N L-tryptophane Chemical compound C1=CC=C2C(C[C@H](N)C(O)=O)=CNC2=C1 QIVBCDIJIAJPQS-VIFPVBQESA-N 0.000 description 1
- POJPZSMTTMLSTG-SRVKXCTJSA-N Leu-Asn-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N POJPZSMTTMLSTG-SRVKXCTJSA-N 0.000 description 1
- LAPSXOAUPNOINL-YUMQZZPRSA-N Leu-Gly-Asp Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(O)=O LAPSXOAUPNOINL-YUMQZZPRSA-N 0.000 description 1
- ZRHDPZAAWLXXIR-SRVKXCTJSA-N Leu-Lys-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O ZRHDPZAAWLXXIR-SRVKXCTJSA-N 0.000 description 1
- GOFJOGXGMPHOGL-DCAQKATOSA-N Leu-Ser-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC(C)C GOFJOGXGMPHOGL-DCAQKATOSA-N 0.000 description 1
- ROHFNLRQFUQHCH-UHFFFAOYSA-N Leucine Natural products CC(C)CC(N)C(O)=O ROHFNLRQFUQHCH-UHFFFAOYSA-N 0.000 description 1
- CLBGMWIYPYAZPR-AVGNSLFASA-N Lys-Arg-Arg Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O CLBGMWIYPYAZPR-AVGNSLFASA-N 0.000 description 1
- NTSPQIONFJUMJV-AVGNSLFASA-N Lys-Arg-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(O)=O NTSPQIONFJUMJV-AVGNSLFASA-N 0.000 description 1
- SSJBMGCZZXCGJJ-DCAQKATOSA-N Lys-Asp-Met Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCSC)C(O)=O SSJBMGCZZXCGJJ-DCAQKATOSA-N 0.000 description 1
- LLSUNJYOSCOOEB-GUBZILKMSA-N Lys-Glu-Asp Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O LLSUNJYOSCOOEB-GUBZILKMSA-N 0.000 description 1
- DIBZLYZXTSVGLN-CIUDSAMLSA-N Lys-Ser-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O DIBZLYZXTSVGLN-CIUDSAMLSA-N 0.000 description 1
- VHTOGMKQXXJOHG-RHYQMDGZSA-N Lys-Thr-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O VHTOGMKQXXJOHG-RHYQMDGZSA-N 0.000 description 1
- PPNCMJARTHYNEC-MEYUZBJRSA-N Lys-Tyr-Thr Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H]([C@H](O)C)C(O)=O)CC1=CC=C(O)C=C1 PPNCMJARTHYNEC-MEYUZBJRSA-N 0.000 description 1
- 239000007993 MOPS buffer Substances 0.000 description 1
- 241000124008 Mammalia Species 0.000 description 1
- HUURTRNKPBHHKZ-JYJNAYRXSA-N Met-Phe-Val Chemical compound CSCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C(C)C)C(O)=O)CC1=CC=CC=C1 HUURTRNKPBHHKZ-JYJNAYRXSA-N 0.000 description 1
- GWADARYJIJDYRC-XGEHTFHBSA-N Met-Thr-Ser Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O GWADARYJIJDYRC-XGEHTFHBSA-N 0.000 description 1
- 108010089610 Nuclear Proteins Proteins 0.000 description 1
- 102000007999 Nuclear Proteins Human genes 0.000 description 1
- 206010031096 Oropharyngeal cancer Diseases 0.000 description 1
- 241000283973 Oryctolagus cuniculus Species 0.000 description 1
- MZUSCVCCMHDHDF-UHFFFAOYSA-N P(=O)(=O)[W] Chemical compound P(=O)(=O)[W] MZUSCVCCMHDHDF-UHFFFAOYSA-N 0.000 description 1
- 208000002471 Penile Neoplasms Diseases 0.000 description 1
- 101000722180 Petunia hybrida Floral defensin-like protein 2 Proteins 0.000 description 1
- MMYUOSCXBJFUNV-QWRGUYRKSA-N Phe-Gly-Cys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)NCC(=O)N[C@@H](CS)C(=O)O)N MMYUOSCXBJFUNV-QWRGUYRKSA-N 0.000 description 1
- WLYPRKLMRIYGPP-JYJNAYRXSA-N Phe-Lys-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC1=CC=CC=C1 WLYPRKLMRIYGPP-JYJNAYRXSA-N 0.000 description 1
- AQGUSRZKDZYGGV-GMOBBJLQSA-N Pro-Ile-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O AQGUSRZKDZYGGV-GMOBBJLQSA-N 0.000 description 1
- WVXQQUWOKUZIEG-VEVYYDQMSA-N Pro-Thr-Asn Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(O)=O WVXQQUWOKUZIEG-VEVYYDQMSA-N 0.000 description 1
- YHUBAXGAAYULJY-ULQDDVLXSA-N Pro-Tyr-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(O)=O YHUBAXGAAYULJY-ULQDDVLXSA-N 0.000 description 1
- UIUWGMRJTWHIJZ-ULQDDVLXSA-N Pro-Tyr-Lys Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)N[C@@H](CCCCN)C(=O)O UIUWGMRJTWHIJZ-ULQDDVLXSA-N 0.000 description 1
- ONIBWKKTOPOVIA-UHFFFAOYSA-N Proline Natural products OC(=O)C1CCCN1 ONIBWKKTOPOVIA-UHFFFAOYSA-N 0.000 description 1
- 206010038707 Respiratory papilloma Diseases 0.000 description 1
- YQHZVYJAGWMHES-ZLUOBGJFSA-N Ser-Ala-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O YQHZVYJAGWMHES-ZLUOBGJFSA-N 0.000 description 1
- CJINPXGSKSZQNE-KBIXCLLPSA-N Ser-Ile-Gln Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(N)=O)C(O)=O CJINPXGSKSZQNE-KBIXCLLPSA-N 0.000 description 1
- MHVXPTAMDHLTHB-IHPCNDPISA-N Ser-Phe-Trp Chemical compound C([C@H](NC(=O)[C@H](CO)N)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(O)=O)C1=CC=CC=C1 MHVXPTAMDHLTHB-IHPCNDPISA-N 0.000 description 1
- MTCFGRXMJLQNBG-UHFFFAOYSA-N Serine Natural products OCC(N)C(O)=O MTCFGRXMJLQNBG-UHFFFAOYSA-N 0.000 description 1
- 108010008038 Synthetic Vaccines Proteins 0.000 description 1
- 230000024932 T cell mediated immunity Effects 0.000 description 1
- 108091008874 T cell receptors Proteins 0.000 description 1
- 102000016266 T-Cell Antigen Receptors Human genes 0.000 description 1
- SWIKDOUVROTZCW-GCJQMDKQSA-N Thr-Asn-Ala Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](C)C(=O)O)N)O SWIKDOUVROTZCW-GCJQMDKQSA-N 0.000 description 1
- XFTYVCHLARBHBQ-FOHZUACHSA-N Thr-Gly-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O XFTYVCHLARBHBQ-FOHZUACHSA-N 0.000 description 1
- GMXIJHCBTZDAPD-QPHKQPEJSA-N Thr-Ile-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N GMXIJHCBTZDAPD-QPHKQPEJSA-N 0.000 description 1
- VRUFCJZQDACGLH-UVOCVTCTSA-N Thr-Leu-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O VRUFCJZQDACGLH-UVOCVTCTSA-N 0.000 description 1
- NDZYTIMDOZMECO-SHGPDSBTSA-N Thr-Thr-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O NDZYTIMDOZMECO-SHGPDSBTSA-N 0.000 description 1
- RPECVQBNONKZAT-WZLNRYEVSA-N Thr-Tyr-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H]([C@@H](C)O)N RPECVQBNONKZAT-WZLNRYEVSA-N 0.000 description 1
- CJEHCEOXPLASCK-MEYUZBJRSA-N Thr-Tyr-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)[C@H](O)C)CC1=CC=C(O)C=C1 CJEHCEOXPLASCK-MEYUZBJRSA-N 0.000 description 1
- AYFVYJQAPQTCCC-UHFFFAOYSA-N Threonine Natural products CC(O)C(N)C(O)=O AYFVYJQAPQTCCC-UHFFFAOYSA-N 0.000 description 1
- 239000004473 Threonine Substances 0.000 description 1
- 108091023040 Transcription factor Proteins 0.000 description 1
- 102000040945 Transcription factor Human genes 0.000 description 1
- 241000255993 Trichoplusia ni Species 0.000 description 1
- AFSYEUHJBVCPEL-JBACZVJFSA-N Trp-Gln-Phe Chemical compound C([C@H](NC(=O)[C@H](CCC(N)=O)NC(=O)[C@H](CC=1C2=CC=CC=C2NC=1)N)C(O)=O)C1=CC=CC=C1 AFSYEUHJBVCPEL-JBACZVJFSA-N 0.000 description 1
- QIVBCDIJIAJPQS-UHFFFAOYSA-N Tryptophan Natural products C1=CC=C2C(CC(N)C(O)=O)=CNC2=C1 QIVBCDIJIAJPQS-UHFFFAOYSA-N 0.000 description 1
- XHALUUQSNXSPLP-UFYCRDLUSA-N Tyr-Arg-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=C(O)C=C1 XHALUUQSNXSPLP-UFYCRDLUSA-N 0.000 description 1
- PJWCWGXAVIVXQC-STECZYCISA-N Tyr-Ile-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 PJWCWGXAVIVXQC-STECZYCISA-N 0.000 description 1
- 241000700618 Vaccinia virus Species 0.000 description 1
- XXROXFHCMVXETG-UWVGGRQHSA-N Val-Gly-Val Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O XXROXFHCMVXETG-UWVGGRQHSA-N 0.000 description 1
- APQIVBCUIUDSMB-OSUNSFLBSA-N Val-Ile-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](C(C)C)N APQIVBCUIUDSMB-OSUNSFLBSA-N 0.000 description 1
- PCBOWMZAEDDKNH-HOTGVXAUSA-N [4-(trifluoromethoxy)phenyl]methyl (3as,6as)-2-(3-fluoro-4-sulfamoylbenzoyl)-1,3,3a,4,6,6a-hexahydropyrrolo[3,4-c]pyrrole-5-carboxylate Chemical compound C1=C(F)C(S(=O)(=O)N)=CC=C1C(=O)N1C[C@H]2CN(C(=O)OCC=3C=CC(OC(F)(F)F)=CC=3)C[C@@H]2C1 PCBOWMZAEDDKNH-HOTGVXAUSA-N 0.000 description 1
- CUJRVFIICFDLGR-UHFFFAOYSA-N acetylacetonate Chemical compound CC(=O)[CH-]C(C)=O CUJRVFIICFDLGR-UHFFFAOYSA-N 0.000 description 1
- 239000002253 acid Substances 0.000 description 1
- 230000002378 acidificating effect Effects 0.000 description 1
- 239000000654 additive Substances 0.000 description 1
- 230000001464 adherent effect Effects 0.000 description 1
- 235000004279 alanine Nutrition 0.000 description 1
- 125000001931 aliphatic group Chemical group 0.000 description 1
- AZDRQVAHHNSJOQ-UHFFFAOYSA-N alumane Chemical class [AlH3] AZDRQVAHHNSJOQ-UHFFFAOYSA-N 0.000 description 1
- 230000003321 amplification Effects 0.000 description 1
- 230000003698 anagen phase Effects 0.000 description 1
- 239000003963 antioxidant agent Substances 0.000 description 1
- 239000007864 aqueous solution Substances 0.000 description 1
- 229960003589 arginine hydrochloride Drugs 0.000 description 1
- 125000003118 aryl group Chemical group 0.000 description 1
- 229960001230 asparagine Drugs 0.000 description 1
- 235000009582 asparagine Nutrition 0.000 description 1
- 235000003704 aspartic acid Nutrition 0.000 description 1
- 230000009286 beneficial effect Effects 0.000 description 1
- OQFSQFPPLPISGP-UHFFFAOYSA-N beta-carboxyaspartic acid Natural products OC(=O)C(N)C(C(O)=O)C(O)=O OQFSQFPPLPISGP-UHFFFAOYSA-N 0.000 description 1
- 230000027455 binding Effects 0.000 description 1
- 230000000711 cancerogenic effect Effects 0.000 description 1
- 210000000234 capsid Anatomy 0.000 description 1
- 235000014633 carbohydrates Nutrition 0.000 description 1
- 150000001720 carbohydrates Chemical class 0.000 description 1
- 229910052799 carbon Inorganic materials 0.000 description 1
- 229910002091 carbon monoxide Inorganic materials 0.000 description 1
- 231100000315 carcinogenic Toxicity 0.000 description 1
- 125000002091 cationic group Chemical group 0.000 description 1
- 239000006143 cell culture medium Substances 0.000 description 1
- 239000002738 chelating agent Substances 0.000 description 1
- 150000001875 compounds Chemical class 0.000 description 1
- 238000004590 computer program Methods 0.000 description 1
- 229910052802 copper Inorganic materials 0.000 description 1
- 239000010949 copper Substances 0.000 description 1
- 238000012258 culturing Methods 0.000 description 1
- 229960003067 cystine Drugs 0.000 description 1
- 230000009089 cytolysis Effects 0.000 description 1
- 210000000805 cytoplasm Anatomy 0.000 description 1
- 238000012938 design process Methods 0.000 description 1
- 238000011161 development Methods 0.000 description 1
- 230000018109 developmental process Effects 0.000 description 1
- 150000002016 disaccharides Chemical class 0.000 description 1
- 238000001035 drying Methods 0.000 description 1
- 230000000694 effects Effects 0.000 description 1
- 238000001962 electrophoresis Methods 0.000 description 1
- 239000012149 elution buffer Substances 0.000 description 1
- 239000000839 emulsion Substances 0.000 description 1
- 210000001808 exosome Anatomy 0.000 description 1
- 238000000605 extraction Methods 0.000 description 1
- 238000000855 fermentation Methods 0.000 description 1
- 230000004151 fermentation Effects 0.000 description 1
- 229940102767 gardasil 9 Drugs 0.000 description 1
- 230000007614 genetic variation Effects 0.000 description 1
- 108010013768 glutamyl-aspartyl-proline Proteins 0.000 description 1
- 150000002337 glycosamines Chemical group 0.000 description 1
- 108010087823 glycyltyrosine Proteins 0.000 description 1
- 239000001963 growth medium Substances 0.000 description 1
- HNDVDQJCIGZPNO-UHFFFAOYSA-N histidine Natural products OC(=O)C(N)CC1=CN=CN1 HNDVDQJCIGZPNO-UHFFFAOYSA-N 0.000 description 1
- 229940039096 human papillomavirus type 18 l1 protein Drugs 0.000 description 1
- 229940064528 human papillomavirus type 6 l1 protein Drugs 0.000 description 1
- 229940124866 human papillomavirus vaccine Drugs 0.000 description 1
- 230000028996 humoral immune response Effects 0.000 description 1
- 229920001477 hydrophilic polymer Polymers 0.000 description 1
- 230000008696 hypoxemic pulmonary vasoconstriction Effects 0.000 description 1
- 210000000987 immune system Anatomy 0.000 description 1
- 210000003000 inclusion body Anatomy 0.000 description 1
- 229910052500 inorganic mineral Inorganic materials 0.000 description 1
- 238000005342 ion exchange Methods 0.000 description 1
- 229960000310 isoleucine Drugs 0.000 description 1
- AGPKZVBTJJNPAG-UHFFFAOYSA-N isoleucine Natural products CCC(C)C(N)C(O)=O AGPKZVBTJJNPAG-UHFFFAOYSA-N 0.000 description 1
- 210000002510 keratinocyte Anatomy 0.000 description 1
- GZQKNULLWNGMCW-PWQABINMSA-N lipid A (E. coli) Chemical class O1[C@H](CO)[C@@H](OP(O)(O)=O)[C@H](OC(=O)C[C@@H](CCCCCCCCCCC)OC(=O)CCCCCCCCCCCCC)[C@@H](NC(=O)C[C@@H](CCCCCCCCCCC)OC(=O)CCCCCCCCCCC)[C@@H]1OC[C@@H]1[C@@H](O)[C@H](OC(=O)C[C@H](O)CCCCCCCCCCC)[C@@H](NC(=O)C[C@H](O)CCCCCCCCCCC)[C@@H](OP(O)(O)=O)O1 GZQKNULLWNGMCW-PWQABINMSA-N 0.000 description 1
- 230000001404 mediated effect Effects 0.000 description 1
- 230000000813 microbial effect Effects 0.000 description 1
- 239000011707 mineral Substances 0.000 description 1
- 238000002156 mixing Methods 0.000 description 1
- 150000002772 monosaccharides Chemical class 0.000 description 1
- 239000002736 nonionic surfactant Substances 0.000 description 1
- 231100000252 nontoxic Toxicity 0.000 description 1
- 230000003000 nontoxic effect Effects 0.000 description 1
- 238000003199 nucleic acid amplification method Methods 0.000 description 1
- 210000004940 nucleus Anatomy 0.000 description 1
- 201000006958 oropharynx cancer Diseases 0.000 description 1
- 208000003154 papilloma Diseases 0.000 description 1
- 231100000915 pathological change Toxicity 0.000 description 1
- 230000036285 pathological change Effects 0.000 description 1
- COLNVLDHVKWLRT-UHFFFAOYSA-N phenylalanine Natural products OC(=O)C(N)CC1=CC=CC=C1 COLNVLDHVKWLRT-UHFFFAOYSA-N 0.000 description 1
- 108010073101 phenylalanylleucine Proteins 0.000 description 1
- 239000008363 phosphate buffer Substances 0.000 description 1
- 235000010482 polyoxyethylene sorbitan monooleate Nutrition 0.000 description 1
- 229920000053 polysorbate 80 Polymers 0.000 description 1
- 230000003449 preventive effect Effects 0.000 description 1
- 230000008569 process Effects 0.000 description 1
- 108010065320 prolyl-lysyl-glutamyl-lysine Proteins 0.000 description 1
- 230000001902 propagating effect Effects 0.000 description 1
- 230000026447 protein localization Effects 0.000 description 1
- 239000001397 quillaja saponaria molina bark Substances 0.000 description 1
- 230000009257 reactivity Effects 0.000 description 1
- 229940124551 recombinant vaccine Drugs 0.000 description 1
- 230000010076 replication Effects 0.000 description 1
- 238000011160 research Methods 0.000 description 1
- 230000004044 response Effects 0.000 description 1
- 229930182490 saponin Natural products 0.000 description 1
- 150000007949 saponins Chemical class 0.000 description 1
- 238000002864 sequence alignment Methods 0.000 description 1
- 239000012090 serum-supplement Substances 0.000 description 1
- 229910052708 sodium Inorganic materials 0.000 description 1
- 239000011734 sodium Substances 0.000 description 1
- 238000002415 sodium dodecyl sulfate polyacrylamide gel electrophoresis Methods 0.000 description 1
- 239000000243 solution Substances 0.000 description 1
- 125000006850 spacer group Chemical group 0.000 description 1
- 230000009870 specific binding Effects 0.000 description 1
- 239000007921 spray Substances 0.000 description 1
- 238000010186 staining Methods 0.000 description 1
- 238000012799 strong cation exchange Methods 0.000 description 1
- 150000005846 sugar alcohols Chemical class 0.000 description 1
- 238000001890 transfection Methods 0.000 description 1
- 238000012546 transfer Methods 0.000 description 1
- OUYCCCASQSFEME-UHFFFAOYSA-N tyrosine Natural products OC(=O)C(N)CC1=CC=C(O)C=C1 OUYCCCASQSFEME-UHFFFAOYSA-N 0.000 description 1
- 238000002525 ultrasonication Methods 0.000 description 1
- 229940005605 valeric acid Drugs 0.000 description 1
- 230000009385 viral infection Effects 0.000 description 1
- 210000002845 virion Anatomy 0.000 description 1
- 210000005253 yeast cell Anatomy 0.000 description 1
Abstract
本發明關於一種嵌合的乳頭瘤病毒L1蛋白和編碼其的多核苷酸,HPV類病毒顆粒以及包含至少一種上述類病毒顆粒的多價HPV免疫原性組合物及其用途。所述嵌合的乳頭瘤病毒L1蛋白包含衍生於第一型別乳頭瘤病毒L1蛋白的N端片段,所述N端片段保持HPV相應型別L1蛋白的免疫原性;和衍生於第二型別乳頭瘤病毒L1蛋白的C端片段,所述第二型別乳頭狀瘤病毒L1蛋白具有相較於其他型別的L1蛋白表達量和可溶性較好的特性;其中所述嵌合的乳頭瘤病毒L1蛋白具有HPV相應型別L1蛋白的免疫原性。所述嵌合的乳頭瘤病毒L蛋白具有較高的表達量和可溶性,可用於疫苗的大規模生產。在一個實施方式中,至少一種所述HPV類病毒顆粒為嵌合的HPV類病毒顆粒,所述嵌合的HPV類病毒顆粒包含一種或多種嵌合HPV L1蛋白。The present invention relates to a chimeric papillomavirus L1 protein and a polynucleotide encoding the same, an HPV virus-like particle, and a multivalent HPV immunogenic composition comprising at least one of the virus-like particles, and uses thereof. The chimeric papillomavirus L1 protein comprises an N-terminal fragment derived from a first type of papillomavirus L1 protein, the N-terminal fragment retains the immunogenicity of the HPV corresponding type of L1 protein; and a C-terminal fragment derived from a second type of papillomavirus L1 protein, the second type of papillomavirus L1 protein has better expression and solubility than other types of L1 proteins; wherein the chimeric papillomavirus L1 protein has the immunogenicity of the HPV corresponding type of L1 protein. The chimeric papillomavirus L protein has a relatively high expression and solubility, and can be used for large-scale production of vaccines. In one embodiment, at least one of the HPV virus-like particles is a chimeric HPV virus-like particle, and the chimeric HPV virus-like particle comprises one or more chimeric HPV L1 proteins.
Description
本發明關於乳頭瘤病毒(HPV)L1蛋白和編碼該蛋白的多核苷酸,還關於HPV類病毒顆粒及含有HPV類病毒顆粒的預防人乳頭瘤病毒(HPV)相關疾病或感染的多價免疫原性組合物及其用途。The present invention relates to human papillomavirus (HPV) L1 protein and polynucleotide encoding the protein, as well as HPV-like virus particles and a multivalent immunogenic composition containing the HPV-like virus particles for preventing human papillomavirus (HPV)-related diseases or infections and uses thereof.
乳頭瘤病毒(papilloma virus,PV)屬於乳頭瘤病毒科(Papillomaviridae),能引起人、牛、狗、兔等的乳頭瘤。其成員人乳頭瘤病毒(Human Papillomavirus, HPV)為無包膜DNA病毒。該病毒的基因組為雙鏈閉環DNA,大小約7.2-8 kb,具有8個開放閱讀框,按照功能可分為三個區域:(1)早期區(E),約4.5 kb,編碼E1、E2、E4-E7共6個與病毒複製、轉錄及轉譯有關的非結構蛋白;(2)晚期區(L),約2.5kb,編碼主要衣殼蛋白L1和次要衣殼蛋白L2;(3)長調控區(LCR),其位於L區末端與E區起始端之間,長約800-900 bp,不編碼任何蛋白,但具有DNA複製和表達調控元件。Papilloma virus (PV) belongs to the Papillomaviridae family and can cause papilloma in humans, cattle, dogs, rabbits, etc. Its member, human papillomavirus (HPV), is a non-enveloped DNA virus. The genome of the virus is a double-stranded closed circular DNA with a size of approximately 7.2-8 kb and 8 open reading frames. It can be divided into three regions according to its function: (1) the early region (E), approximately 4.5 kb, encoding 6 non-structural proteins E1, E2, E4-E7 related to viral replication, transcription and translation; (2) the late region (L), approximately 2.5 kb, encoding the major capsid protein L1 and the minor capsid protein L2; (3) the long regulatory region (LCR), which is located between the end of the L region and the beginning of the E region, is approximately 800-900 bp long, does not encode any protein, but has DNA replication and expression regulatory elements.
L1和L2蛋白在HPV感染週期的中晚期合成。L1蛋白是主要衣殼蛋白並且具有55-60 kDa的分子量。L2蛋白是次要衣殼蛋白。72個L1蛋白五聚體構成二十面體HPV病毒粒子的外殼(直徑為45-55 nm),其包裹閉環雙鏈DNA。L2蛋白質位於L1蛋白質內側(Structure of Small Virus-like Particles Assembled from the L1 Protein of Human Papillomavirus 16 Chen, X.S., R. L.Garcea, Mol.Cell. 5(3):557-567, 2000)。L1 and L2 proteins are synthesized in the middle and late stages of the HPV infection cycle. L1 protein is the major capsid protein and has a molecular weight of 55-60 kDa. L2 protein is the minor capsid protein. 72 L1 protein pentamers constitute the outer shell of the icosahedral HPV virus particle (diameter 45-55 nm), which encapsulates the closed-circular double-stranded DNA. L2 protein is located on the inner side of L1 protein (Structure of Small Virus-like Particles Assembled from the L1 Protein of Human Papillomavirus 16 Chen, X.S., R. L.Garcea, Mol.Cell. 5(3):557-567, 2000).
L1蛋白的ORF是PV基因組中最保守的基因,可以用於鑒別新的PV型。如果選殖了完整的基因組,並且L1 ORF的DNA序列與最接近的已知PV型相差超過10%,則被認定為分離出新的PV型。差異在2%和10%同源性被定義為不同的亞型,差異小於2%被定義為同一亞型的不同變種(E.-M. de Villiers et al. / Virology 324 (2004) 17–27)。The ORF of the L1 protein is the most conserved gene in the PV genome and can be used to identify new PV types. If the complete genome is cloned and the DNA sequence of the L1 ORF differs by more than 10% from the closest known PV type, a new PV type is considered to have been isolated. Differences between 2% and 10% homology are defined as different subtypes, and differences less than 2% are defined as different variants of the same subtype (E.-M. de Villiers et al. / Virology 324 (2004) 17–27).
在HPV感染的後期,細胞質中新合成的L1蛋白被輸送到終端分化的角蛋白細胞核中,與L2蛋白一起,包裝複製的HPV基因組DNA形成傳染性病毒(Nelson, L.M, et al. 2002. Nuclear import strategies of high risk HPV16 L1 major capsid protein. J. Biol. Chem. 277: 23958-23964)。這表明L1蛋白的核導入在HPV感染和生產中起著非常重要的作用。病毒進入細胞核的能力由HPV L1蛋白C端的核定位信號(NLS)決定,核定位信號的一個特徵是富含鹼性胺基酸(Garcia-Bustos, J., et al. 1991. Nuclear protein localization. Biochimica et Biophysica Acta 1071: 83-101)。In the late stage of HPV infection, newly synthesized L1 protein in the cytoplasm is transported to the nucleus of terminally differentiated keratinocytes, and together with L2 protein, it packages the replicated HPV genomic DNA to form infectious viruses (Nelson, L.M, et al. 2002. Nuclear import strategies of high risk HPV16 L1 major capsid protein. J. Biol. Chem. 277: 23958-23964). This shows that nuclear import of L1 protein plays a very important role in HPV infection and production. The ability of the virus to enter the cell nucleus is determined by the nuclear localization signal (NLS) at the C-terminus of the HPV L1 protein. One of the characteristics of the nuclear localization signal is that it is rich in basic amino acids (Garcia-Bustos, J., et al. 1991. Nuclear protein localization. Biochimica et Biophysica Acta 1071: 83-101).
15種高風險(HR) HPV型可導致子宮頸、肛門、陰莖、陰道、外陰和口咽癌。其中,HPV-16和HPV-18型是迄今最為常見的癌症起因,約佔子宮頸癌的70%,其餘為其他HR-HPV型(31、33、35、39、45、51、52、56、58、59、68、73和82)引起。HPV-16約占HPV陽性口咽癌(OPCs)的95%。持續低風險基因型HPV-6和HPV-11導致大多數肛門生殖器疣和呼吸道乳頭狀瘤,但很少與癌症相關(Human Papillomavirus in Cervical Cancer and Oropharyngeal Cancer: One Cause, Two Diseases Tara A. Bermanand John T. Schiller, PhD2 Cancer 2017;123:2219-29)。Fifteen high-risk (HR) HPV types can cause cervical, anal, penile, vaginal, vulvar, and oropharyngeal cancers. Of these, HPV-16 and HPV-18 are by far the most common causes of cancer, accounting for about 70% of cervical cancers, with the remainder caused by other HR-HPV types (31, 33, 35, 39, 45, 51, 52, 56, 58, 59, 68, 73, and 82). HPV-16 accounts for about 95% of HPV-positive oropharyngeal cancers (OPCs). The persistently low-risk genotypes HPV-6 and HPV-11 cause most anogenital warts and respiratory papillomas but are rarely associated with cancer (Human Papillomavirus in Cervical Cancer and Oropharyngeal Cancer: One Cause, Two Diseases Tara A. Bermanand John T. Schiller, PhD2 Cancer 2017;123:2219-29).
使用痘瘡病毒、桿狀病毒或酵母系統重組表達L1蛋白,L1蛋白可自我裝配形成類病毒顆粒(VLP),大約含有72個L1蛋白,與病毒體外殼相似。VLP沒有適應症。VLP可以在接種動物中誘導中和抗體,保護實驗動物免受感染性病毒的隨後攻擊。因此,VLP似乎是乳頭瘤病毒疫苗的優秀候選者(Structure of Small Virus-like Particles Assembled from the L1 Protein of Human Papillomavirus 16 Chen, X.S., R. L.Garcea, Mol.Cell. 5(3):557-567, 2000)。L1 protein is recombinantly expressed using vaccinia virus, bacilli or yeast systems. L1 protein can self-assemble to form virus-like particles (VLPs) containing approximately 72 L1 proteins, similar to the viral exosome. VLPs have no indications. VLPs can induce neutralizing antibodies in vaccinated animals and protect experimental animals from subsequent attacks by infectious viruses. Therefore, VLPs appear to be excellent candidates for papillomavirus vaccines (Structure of Small Virus-like Particles Assembled from the L1 Protein of Human Papillomavirus 16 Chen, X.S., R. L.Garcea, Mol.Cell. 5(3):557-567, 2000).
葛蘭素公司的CERVARIX ®是雙價重組HPV疫苗。其中含有由重組桿狀病毒表達載體系統在夜蛾(Trichoplusia ni)昆蟲細胞中表達獲得的HPV 16型重組L1蛋白和HPV 18型重組L1蛋白。L1蛋白自組裝成類病毒顆粒,用於預防9-25歲的婦女由16和18型HPV引起的子宮頸癌,2級或3級子宮頸上皮內瘤樣變和原位腺癌,和1級子宮頸上皮內瘤樣病變(致癌) (https://www.fda.gov/downloads/BiologicsBloodVaccines/Vaccines/ApprovedProducts/UCM186981.pdf)。 Glaxo's CERVARIX® is a bivalent recombinant HPV vaccine. It contains recombinant L1 proteins of HPV type 16 and HPV type 18 expressed in Trichoplusia ni insect cells by a recombinant bacilliform virus expression vector system. The L1 proteins self-assemble into virus-like particles and are used to prevent cervical cancer, grade 2 or 3 cervical intraepithelial neoplasia and adenocarcinoma in situ, and grade 1 cervical intraepithelial neoplasia (carcinogenic) caused by HPV types 16 and 18 in women aged 9-25 years (https://www.fda.gov/downloads/BiologicsBloodVaccines/Vaccines/ApprovedProducts/UCM186981.pdf).
GARDASIL ®是默克公司生產的人乳頭狀瘤病毒四價(6、11、16和18型)重組疫苗,用於9-26歲的女孩和婦女用於預防子宮頸癌生殖器疣(尖銳濕疣)和由HPV 6、11、16、18型引起癌前或增生異常病變;以及9-26歲的男孩和男人用於預防肛門癌、生殖器疣(尖銳濕疣)和由HPV 6、11、16、18型引起的癌前期或發育異常病變 (https://www.fda.gov/vaccines-blood-biologics/vaccines/gardasil)。 GARDASIL ® is a quadrivalent human papillomavirus (HPV) recombinant vaccine (types 6, 11, 16, and 18) manufactured by Merck for use in girls and women aged 9-26 years to prevent cervical cancer, genital warts (condylomata), and precancerous or developmental abnormalities caused by HPV types 6, 11, 16, and 18; and in boys and men aged 9-26 years to prevent anal cancer, genital warts (condylomata), and precancerous or developmental abnormalities caused by HPV types 6, 11, 16, and 18 (https://www.fda.gov/vaccines-blood-biologics/vaccines/gardasil).
GARDASIL ®9是默克公司生產的人乳頭狀瘤病毒九價重組疫苗,包含HPV 6、11、16、18、31、33、45、52和58型L1蛋白的類病毒顆粒,該L1蛋白由釀酒酵母發酵生產,自組裝為VLP。用於9-45歲的女孩和婦女用於預防HPV16、18、31、33、45、52和58型引起的子宮頸癌、外陰癌、陰道癌和肛門癌,HPV6和11引起的生殖器疣 (尖銳濕疣)和由HPV 6、11、16、18、31、33、45、52和58型引起癌前或或增生異常病變;以及9-45歲的男孩和男人用於預防16、18、31、33、45、52和58型引起的肛門癌,HPV 6和11引起的生殖器疣(尖銳濕疣)和由HPV 6、11、16、18、31、33、45、52和58型引起的癌前期或發育異常病變(https://www.fda.gov/vaccines-blood-biologics/vaccines/gardasil-9)。 GARDASIL ® 9 is a nine-valent recombinant human papillomavirus vaccine produced by Merck. It contains virus-like particles of L1 protein of HPV types 6, 11, 16, 18, 31, 33, 45, 52 and 58. The L1 protein is produced by fermentation of brewing yeast and self-assembled into VLPs. For use in girls and women aged 9-45 years to prevent cervical, vulvar, vaginal and anal cancers caused by HPV types 16, 18, 31, 33, 45, 52 and 58, genital warts (condyloma acuminatum) caused by HPV types 6 and 11, and precancerous or proliferative abnormalities caused by HPV types 6, 11, 16, 18, 31, 33, 45, 52 and 58; and in boys and men aged 9-45 years to prevent anal cancer caused by HPV types 16, 18, 31, 33, 45, 52 and 58, genital warts (condyloma acuminatum) caused by HPV types 6 and 11, and precancerous or proliferative abnormalities caused by HPV types 6, 11, 16, 18, 31, 33, 45, 52 and 58. Precancerous or developmental abnormalities caused by types 6, 11, 16, 18, 31, 33, 45, 52, and 58 (https://www.fda.gov/vaccines-blood-biologics/vaccines/gardasil-9).
GARDASIL ®9的說明書中聲稱HPV16和18型是約70%的子宮頸癌的發病緣由,其餘的20%病例歸責於31、33、45、52和58型,由是GARDASIL ®9可以預防90%的子宮頸癌的發生(https://www.fda.gov/BiologicsBloodVaccines/Vaccines/ApprovedProducts/ucm426445.htm)。 The instructions for GARDASIL ® 9 claim that HPV types 16 and 18 are the cause of approximately 70% of cervical cancer, and the remaining 20% of cases are attributed to types 31, 33, 45, 52 and 58. Therefore, GARDASIL ® 9 can prevent 90% of cervical cancer (https://www.fda.gov/BiologicsBloodVaccines/Vaccines/ApprovedProducts/ucm426445.htm).
HPV疫苗研製的關鍵因素是類病毒顆粒可進行大量生產。目前較為普遍的生產類病毒顆粒的系統主要分為真核表達系統和原核表達系統。The key factor in the development of HPV vaccines is that virus-like particles can be mass-produced. Currently, the more common systems for producing virus-like particles are mainly divided into eukaryotic expression systems and prokaryotic expression systems.
常用的真核表達系統有痘病毒表達系統、昆蟲桿狀病毒表達系統、酵母表達系統。在真核表達系統中表達的HPV L1蛋白的天然構型破壞較少,可自發裝配形成類病毒顆粒,但產量較低。原核表達系統主要大腸桿菌表達系統,產量高但大多以包涵體形式存在,不利於純化,生產程序複雜。Commonly used eukaryotic expression systems include poxvirus expression system, insect bacilli virus expression system, and yeast expression system. The natural conformation of HPV L1 protein expressed in eukaryotic expression system is less damaged and can spontaneously assemble to form virus-like particles, but the yield is low. The prokaryotic expression system is mainly the Escherichia coli expression system, which has high yield but mostly exists in the form of inclusion bodies, which is not conducive to purification and has a complicated production process.
因此,在本領域仍然存在獲得高產量的HPV類病毒顆粒,從而獲得HPV多價疫苗以廣效地預防HPV相關疾病或感染,包括目前商業化疫苗尚未涵蓋的HPV型別引起的HPV相關疾病或感染。Therefore, there is still a need in the art to obtain high-yield HPV virus-like particles, thereby obtaining HPV multivalent vaccines to effectively prevent HPV-related diseases or infections, including HPV-related diseases or infections caused by HPV types that are not covered by current commercial vaccines.
在一個方面,本發明提供一種嵌合的乳頭瘤病毒L1蛋白,自其N末端至C末端方向包含a. 衍生於第一型別乳頭瘤病毒L1蛋白的N端片段,所述N端片段保持HPV相應型別L1蛋白的免疫原性;和b. 衍生於第二型別乳頭瘤病毒L1蛋白的C端片段,所述第二型別乳頭狀瘤病毒L1蛋白具有相較於其他型別的L1蛋白表達量和可溶性較好的特性;其中所述嵌合的乳頭瘤病毒L1蛋白具有第一型別乳頭瘤病毒L1蛋白的免疫原性。In one aspect, the present invention provides a chimeric papillomavirus L1 protein, comprising, from its N-terminus to the C-terminus, a. an N-terminal fragment derived from a first type of papillomavirus L1 protein, wherein the N-terminal fragment retains the immunogenicity of the HPV corresponding type L1 protein; and b. a C-terminal fragment derived from a second type of papillomavirus L1 protein, wherein the second type of papillomavirus L1 protein has better expression and solubility than other types of L1 proteins; wherein the chimeric papillomavirus L1 protein has the immunogenicity of the first type of papillomavirus L1 protein.
在另一個方面,本發明提供一種乳頭瘤類病毒顆粒,其包含嵌合的乳頭瘤病毒L1蛋白。In another aspect, the present invention provides a papillomavirus-like particle comprising a chimeric papillomavirus L1 protein.
在另一個方面,本發明提供一種分離的多核苷酸,其編碼嵌合的乳頭瘤病毒L1蛋白。In another aspect, the present invention provides an isolated polynucleotide encoding a chimeric papillomavirus L1 protein.
在另一個方面,本發明提供一種載體,其包含編碼嵌合的乳頭瘤病毒L1蛋白的多核苷酸。In another aspect, the present invention provides a vector comprising a polynucleotide encoding a chimeric papillomavirus L1 protein.
在另一個方面,本發明提供一種桿狀病毒,其包含編碼嵌合的乳頭瘤病毒L1蛋白的多核苷酸。In another aspect, the present invention provides a bacillivirus comprising a polynucleotide encoding a chimeric papillomavirus L1 protein.
在另一個方面,本發明提供一種宿主細胞,其包含如前所述的多核苷酸、載體、或桿狀病毒。In another aspect, the present invention provides a host cell comprising the polynucleotide, vector, or bacilli as described above.
在一個方面,本發明提供一種預防HPV相關疾病或感染的HPV免疫原性組合物,其包含以上方面的嵌合的乳頭瘤病毒L1蛋白、HPV類病毒顆粒、多核苷酸、載體、桿狀病毒、細胞;In one aspect, the present invention provides an HPV immunogenic composition for preventing HPV-related diseases or infections, comprising the chimeric papillomavirus L1 protein, HPV virus-like particles, polynucleotides, vectors, bacilli, and cells of the above aspects;
在一個具體的實施方案中,其包含:由HPV 6型、11型、16型、18型、31型、33型、45型、52型和58型的嵌合的L1蛋白組裝而成的HPV類病毒顆粒;和HPV33型和59型的L1蛋白組裝而成的HPV類病毒顆粒。In a specific embodiment, it comprises: HPV virus-like particles assembled from chimeric L1 proteins of HPV types 6, 11, 16, 18, 31, 33, 45, 52 and 58; and HPV virus-like particles assembled from L1 proteins of HPV types 33 and 59.
在另一個方面,本發明提供一種預防HPV相關疾病或感染的方法,其包括:向受試者施用多價HPV免疫原性組合物。In another aspect, the present invention provides a method for preventing HPV-related diseases or infections, comprising: administering a multivalent HPV immunogenic composition to a subject.
在另一個方面,本發明提供多價HPV免疫原性組合物在用於製備用於預防HPV相關疾病或感染的疫苗或藥物中的用途。In another aspect, the present invention provides use of a multivalent HPV immunogenic composition for preparing a vaccine or a medicament for preventing HPV-related diseases or infections.
在一個方面,本發明提供一種嵌合的乳頭瘤病毒L1蛋白,自其N末端自C末端方向包含:a. 衍生於第一型別乳頭瘤病毒L1蛋白的N端片段,所述N端片段保持HPV相應型別L1蛋白的免疫原性;和b. 衍生於第二型別乳頭瘤病毒L1蛋白的C端片段,所述第二型別乳頭狀瘤病毒L1蛋白具有相較於其他型別的L1蛋白表達量和可溶性較好的特性;其中所述嵌合的乳頭瘤病毒L1蛋白具有第一型別乳頭瘤病毒L1蛋白的免疫原性。In one aspect, the present invention provides a chimeric papillomavirus L1 protein, comprising, from its N-terminus to the C-terminus: a. an N-terminal fragment derived from a first type of papillomavirus L1 protein, wherein the N-terminal fragment retains the immunogenicity of the HPV corresponding type L1 protein; and b. a C-terminal fragment derived from a second type of papillomavirus L1 protein, wherein the second type of papillomavirus L1 protein has better expression and solubility than other types of L1 proteins; wherein the chimeric papillomavirus L1 protein has the immunogenicity of the first type of papillomavirus L1 protein.
在一個實施方式中,所述N端片段為將所述第一型別乳頭瘤病毒L1蛋白的天然序列的C末端截短於其α5區內的任一胺基酸位點而得到的片段,以及與其具有至少98%的同一性的片段;所述C端片段為將第二型別乳頭狀瘤病毒L1蛋白的天然序列的N末端截短於其α5區內的任一胺基酸位點而得到的片段,以及該片段進一步突變、缺失和/或添加而產生的功能性變體。In one embodiment, the N-terminal fragment is a fragment obtained by truncating the C-terminus of the native sequence of the first type of papillomavirus L1 protein at any amino acid site in its α5 region, and a fragment having at least 98% identity therewith; the C-terminal fragment is a fragment obtained by truncating the N-terminus of the native sequence of the second type of papillomavirus L1 protein at any amino acid site in its α5 region, and a functional variant produced by further mutation, deletion and/or addition of the fragment.
在另一個實施方式中,所述N端片段與將所述第一型別乳頭瘤病毒L1蛋白天然序列的C端截短於其α5區內的任一胺基酸位點而得到的片段具有至少98.5%、99%、99.5%或100%的同一性。In another embodiment, the N-terminal fragment is at least 98.5%, 99%, 99.5% or 100% identical to the fragment obtained by truncating the C-terminus of the native sequence of the first type of papillomavirus L1 protein at any amino acid site in its α5 region.
在一個實施方式中,所述C端片段含有一個或多個核定位序列。In one embodiment, the C-terminal fragment contains one or more nuclear localization sequences.
在一個實施方式中,所述乳頭瘤病毒L1蛋白是HPV L1蛋白。In one embodiment, the papillomavirus L1 protein is HPV L1 protein.
在一個實施方式中,所述第二型別乳頭瘤病毒L1蛋白選自HPV 1型、2型、3型、4型、6型、7型、10型、11型、13型、16型、18型、22型、26型、28型、31型、32型、33型、35型、39型、42型、44型、45型、51型、52型、53型、56型、58型、59型、60型、63型、66型、68型、73型或82型L1蛋白; 較佳地,所述第二型別乳頭瘤病毒L1蛋白選自HPV 16型、28型、33型、59型、或68型L1蛋白; 更較佳地,所述第二型別乳頭瘤病毒L1蛋白選自HPV 33型或HPV 59型L1蛋白。 In one embodiment, the second type of papillomavirus L1 protein is selected from HPV type 1, 2, 3, 4, 6, 7, 10, 11, 13, 16, 18, 22, 26, 28, 31, 32, 33, 35, 39, 42, 44, 45, 51, 52, 53, 56, 58, 59, 60, 63, 66, 68, 73 or 82 L1 protein; Preferably, the second type of papillomavirus L1 protein is selected from HPV type 16, 28, 33, 59, or 68 L1 protein; More preferably, the second type of papillomavirus L1 protein is selected from HPV type 33 or HPV type 59 L1 protein.
在一個實施方式中,所述第二型別乳頭瘤病毒L1蛋白為HPV 33型L1蛋白,,所述C端片段為SEQ ID No: 2;或其長度為m1個胺基酸的片段,較佳涵蓋SEQ ID No: 2的第1-m1位胺基酸的片段;其中m1為8-26的整數;或所述C端片段為SEQ ID No: 135;或其長度為m2個胺基酸的片段,較佳涵蓋SEQ ID No: 135的第1-m2位胺基酸的片段;其中m2為13-31的整數。In one embodiment, the second type of papillomavirus L1 protein is HPV 33 type L1 protein, the C-terminal fragment is SEQ ID No: 2; or a fragment with a length of m1 amino acids, preferably covering the fragment of amino acids 1-m1 of SEQ ID No: 2; wherein m1 is an integer of 8-26; or the C-terminal fragment is SEQ ID No: 135; or a fragment with a length of m2 amino acids, preferably covering the fragment of amino acids 1-m2 of SEQ ID No: 135; wherein m2 is an integer of 13-31.
在一個實施方式中,HPV 33型L1蛋白的C端片段具有一個核定位序列。在另一個實施方式中,HPV 33型L1蛋白的C端片段具有兩個核定位序列。在一些實施方式中,嵌合的乳頭瘤病毒L1蛋白包含一個或多個HPV 33型L1蛋白的C端片段。所述多個HPV 33型L1蛋白的C端片段可以相同也可以不同。在一個實施方式中,SEQ ID No: 2的胺基酸編號7-8的胺基酸序列(KR)和胺基酸序列編號20-23的胺基酸序列(KRKK)為HPV 33型L1蛋白的C端片段的核定位序列。In one embodiment, the C-terminal fragment of the HPV 33 type L1 protein has one nuclear localization sequence. In another embodiment, the C-terminal fragment of the HPV 33 type L1 protein has two nuclear localization sequences. In some embodiments, the chimeric papillomavirus L1 protein comprises one or more C-terminal fragments of the HPV 33 type L1 protein. The C-terminal fragments of the multiple HPV 33 type L1 proteins may be the same or different. In one embodiment, the amino acid sequence (KR) of amino acid sequence numbers 7-8 of SEQ ID No: 2 and the amino acid sequence (KRKK) of amino acid sequence numbers 20-23 are the nuclear localization sequences of the C-terminal fragment of the HPV 33 type L1 protein.
在另一個實施方式中,所述第二型別乳頭瘤病毒L1蛋白為HPV 59型L1蛋白,所述C端片段為SEQ ID No: 13;或其長度為n個胺基酸的片段,優選涵蓋SEQ ID No: 13的第1-n 位胺基酸的片段;其中n為16-38的整數。In another embodiment, the second type of papillomavirus L1 protein is HPV type 59 L1 protein, and the C-terminal fragment is SEQ ID No: 13; or a fragment thereof having a length of n amino acids, preferably a fragment covering amino acids 1 to n of SEQ ID No: 13; wherein n is an integer of 16-38.
在一個實施方式中,HPV 59型L1蛋白的C端片段具有一個核定位序列。在另一個實施方式中,HPV 59型L1蛋白的C端片段具有兩個核定位序列。在一些實施方式中,嵌合的乳頭瘤病毒L1蛋白包含一個或多個HPV 59型L1蛋白的C端片段。所述多個HPV 59型L1蛋白的C端片段可以相同也可以不同。In one embodiment, the C-terminal fragment of the HPV 59 L1 protein has one nuclear localization sequence. In another embodiment, the C-terminal fragment of the HPV 59 L1 protein has two nuclear localization sequences. In some embodiments, the chimeric papillomavirus L1 protein comprises one or more C-terminal fragments of the HPV 59 L1 protein. The C-terminal fragments of the multiple HPV 59 L1 proteins may be the same or different.
在一個實施方式中,所述第一型別乳頭瘤病毒L1蛋白選自HPV 6型、11型、16型、18型、31型、35型、39型、45型、51型、52型、56型或58型L1蛋白。In one embodiment, the first type of papillomavirus L1 protein is selected from HPV type 6, type 11, type 16, type 18, type 31, type 35, type 39, type 45, type 51, type 52, type 56 or type 58 L1 protein.
在一個實施方式中,所述HPV 6型L1蛋白的N端片段與將SEQ ID No: 1所示序列的C末端截短於其α5區內的任一胺基酸位點而得到的片段具有98%、98.5%、99%、99.5%、99%或100%的同一性; 所述HPV 11型L1蛋白的N端片段與將SEQ ID No: 14所示序列的C末端截短於其α5區內的任一胺基酸位點而得到的片段具有98%、98.5%、99%、99.5%、99%或100%的同一性; 所述HPV 16型L1蛋白的N端片段與將SEQ ID No: 27所示序列的C末端截短於其α5區內的任一胺基酸位點而得到的片段具有98%、98.5%、99%、99.5%、99%或100%的同一性; 所述HPV 18型L1蛋白的N端片段與將SEQ ID No: 40所示序列的C末端截短於其α5區內的任一胺基酸位點而得到的片段具有98%、98.5%、99%、99.5%、99%或100%的同一性; 所述HPV 31型L1蛋白的N端片段與將SEQ ID No: 53所示序列的C末端截短於其α5區內的任一胺基酸位點而得到的片段具有98%、98.5%、99%、99.5%、99%或100%的同一性; 所述HPV 35型L1蛋白的N端片段與將SEQ ID No: 69所示序列的C末端截短於其α5區內的任一胺基酸位點而得到的片段具有98%、98.5%、99%、99.5%、99%或100%的同一性; 所述HPV 39型L1蛋白的N端片段與將SEQ ID No: 82所示序列的C末端截短於其α5區內的任一胺基酸位點而得到的片段具有98%、98.5%、99%、99.5%、99%或100%的同一性; 所述HPV 45型L1蛋白的N端片段與將SEQ ID No: 95所示序列的C末端截短於其α5區內的任一胺基酸位點而得到的片段具有98%、98.5%、99%、99.5%、99%或100%的同一性; 所述HPV 51型L1蛋白的N端片段與將SEQ ID No: 108所示序列的C末端截短於其α5區內的任一胺基酸位點而得到的片段具有98%、98.5%、99%、99.5%、99%或100%的同一性; 所述HPV 52型L1蛋白的N端片段與將SEQ ID No: 121所示序列的C末端截短於其α5區內的任一胺基酸位點而得到的片段具有98%、98.5%、99%、99.5%、99%或100%的同一性; 所述HPV 56型L1蛋白的N端片段與將SEQ ID No: 134所示序列的C末端截短於其α5區內的任一胺基酸位點而得到的片段具有98%、98.5%、99%、99.5%、99%或100%的同一性;和 所述HPV 58型L1蛋白的N端片段與將SEQ ID No: 147所示序列的C末端截短於其α5區內的任一胺基酸位點而得到的片段具有98%、98.5%、99%、99.5%、99%或100%的同一性。 In one embodiment, the N-terminal fragment of the HPV 6 L1 protein has 98%, 98.5%, 99%, 99.5%, 99% or 100% identity with the fragment obtained by truncating the C-terminus of the sequence shown in SEQ ID No: 1 at any amino acid site in its α5 region; The N-terminal fragment of the HPV 11 L1 protein has 98%, 98.5%, 99%, 99.5%, 99% or 100% identity with the fragment obtained by truncating the C-terminus of the sequence shown in SEQ ID No: 14 at any amino acid site in its α5 region; The N-terminal fragment of the HPV 16 L1 protein has 98%, 98.5%, 99%, 99.5%, 99% or 100% identity with the fragment obtained by truncating the C-terminus of the sequence shown in SEQ ID No: 27 at any amino acid site in its α5 region; The HPV The N-terminal fragment of the HPV 18 L1 protein has 98%, 98.5%, 99%, 99.5%, 99% or 100% identity with the fragment obtained by truncating the C-terminus of the sequence shown in SEQ ID No: 40 at any amino acid site in its α5 region; The N-terminal fragment of the HPV 31 L1 protein has 98%, 98.5%, 99%, 99.5%, 99% or 100% identity with the fragment obtained by truncating the C-terminus of the sequence shown in SEQ ID No: 53 at any amino acid site in its α5 region; The N-terminal fragment of the HPV 35 L1 protein has 98%, 98.5%, 99%, 99.5%, 99% or 100% identity with the fragment obtained by truncating the C-terminus of the sequence shown in SEQ ID No: 69 at any amino acid site in its α5 region; The N-terminal fragment of the HPV 39 L1 protein has 98%, 98.5%, 99%, 99.5%, 99% or 100% identity with the fragment obtained by truncating the C-terminus of the sequence shown in SEQ ID No: 69 at any amino acid site in its α5 region; 82, the C-terminal truncation of the sequence shown in SEQ ID No: 82 at any amino acid site in its α5 region has 98%, 98.5%, 99%, 99.5%, 99% or 100% identity; The N-terminal fragment of the HPV 45 L1 protein has 98%, 98.5%, 99%, 99.5%, 99% or 100% identity with the fragment obtained by truncating the C-terminal of the sequence shown in SEQ ID No: 95 at any amino acid site in its α5 region; The N-terminal fragment of the HPV 51 L1 protein has 98%, 98.5%, 99%, 99.5%, 99% or 100% identity with the fragment obtained by truncating the C-terminal of the sequence shown in SEQ ID No: 108 at any amino acid site in its α5 region; The N-terminal fragment of the HPV 52 L1 protein has 98%, 98.5%, 99%, 99.5%, 99% or 100% identity with the fragment obtained by truncating the C-terminal of the sequence shown in SEQ ID No: 108 at any amino acid site in its α5 region; The fragment obtained by truncating the C-terminus of the sequence shown in SEQ ID No: 121 at any amino acid site in its α5 region has 98%, 98.5%, 99%, 99.5%, 99% or 100% identity; The N-terminal fragment of the HPV 56 type L1 protein has 98%, 98.5%, 99%, 99.5%, 99% or 100% identity with the fragment obtained by truncating the C-terminus of the sequence shown in SEQ ID No: 134 at any amino acid site in its α5 region; and The N-terminal fragment of the HPV 58 type L1 protein has 98%, 98.5%, 99%, 99.5%, 99% or 100% identity with the fragment obtained by truncating the C-terminus of the sequence shown in SEQ ID No: 147 at any amino acid site in its α5 region.
在一個實施方式中,所述N端片段的C末端與所述C端片段的N末端直接連接或透過連接子連接。In one embodiment, the C-terminus of the N-terminal fragment is directly linked to the N-terminus of the C-terminal fragment or via a linker.
連接子不影響所述N端片段的免疫原性,且不影響蛋白的表達量或可溶性。在一個實施方式中,所述N端片段和所述C端片段透過由1、2、3、4、5、6、7、8、9或10個胺基酸組成的連接子連接。在一個實施方式中,連接子是人工序列。在另一個實施方式中,連接子是HPV L1蛋白中天然存在的序列。在另一個實施方式中,連接子可以是HPV 33型L1蛋白的部分序列。在另一個實施方式中,連接子可以是HPV 59型L1蛋白的部分序列。The linker does not affect the immunogenicity of the N-terminal fragment and does not affect the expression or solubility of the protein. In one embodiment, the N-terminal fragment and the C-terminal fragment are connected by a linker consisting of 1, 2, 3, 4, 5, 6, 7, 8, 9 or 10 amino acids. In one embodiment, the linker is an artificial sequence. In another embodiment, the linker is a sequence naturally occurring in the HPV L1 protein. In another embodiment, the linker can be a partial sequence of the HPV 33 type L1 protein. In another embodiment, the linker can be a partial sequence of the HPV 59 type L1 protein.
在一個實施方式中,所述N端片段的C末端與所述C端片段的N末端連接時,在連接點的正負4個胺基酸位點的範圍內存在以下連續胺基酸序列:RKFL;較佳地,在連接點的正負6個胺基酸位點的範圍內存在以下連續胺基酸序列:LGRKFL。In one embodiment, when the C-terminus of the N-terminal fragment is connected to the N-terminus of the C-terminal fragment, the following continuous amino acid sequence exists within the range of positive and negative 4 amino acid sites of the connection point: RKFL; preferably, the following continuous amino acid sequence exists within the range of positive and negative 6 amino acid sites of the connection point: LGRKFL.
在一些實施方式中,所述嵌合的HPV 6型、11型、16型、18型、31型、35型、39型、45型、51型、52型、56型和58型嵌合HPV L1蛋白分別與SEQ ID No: 3、SEQ ID No: 16、SEQ ID No: 29、SEQ ID No: 42、SEQ ID No: 55、SEQ ID No: 71、SEQ ID No: 84、SEQ ID No: 97、SEQ ID No: 110、SEQ ID No: 123、SEQ ID No: 136和SEQ ID No: 149具有98%、98.5%、99%、99.5%或100%的同一性。以及HPV 33型L1蛋白和HPV 59型L1蛋白分別與SEQ ID No: 66和SEQ ID No: 160具有98%、98.5%、99%、99.5%或100%的同一性。In some embodiments, the chimeric HPV type 6, type 11, type 16, type 18, type 31, type 35, type 39, type 45, type 51, type 52, type 56 and type 58 chimeric HPV L1 protein has 98%, 98.5%, 99%, 99.5% or 100% identity with SEQ ID No: 3, SEQ ID No: 16, SEQ ID No: 29, SEQ ID No: 42, SEQ ID No: 55, SEQ ID No: 71, SEQ ID No: 84, SEQ ID No: 97, SEQ ID No: 110, SEQ ID No: 123, SEQ ID No: 136 and SEQ ID No: 149, respectively. and HPV type 33 L1 protein and HPV type 59 L1 protein have 98%, 98.5%, 99%, 99.5% or 100% identity with SEQ ID No: 66 and SEQ ID No: 160, respectively.
在一個方面,本發明提供一種乳頭瘤類病毒顆粒,其包含如前所述的嵌合的乳頭瘤病毒L1蛋白。在一個實施方式中,乳頭瘤類病毒顆粒是HPV類病毒顆粒,在一個實施方式中,HPV類病毒顆粒為由72個所述嵌合的HPV L1蛋白的五聚體構成的二十面體。在一個實施方式中,HPV類病毒顆粒具有正確形成的雙硫鍵,因而具有良好的天然構型。在一個實施方式中,HPV類病毒顆粒在體內表達系統中自行裝配。In one aspect, the present invention provides a papillomavirus-like particle comprising a chimeric papillomavirus L1 protein as described above. In one embodiment, the papillomavirus-like particle is an HPV virus-like particle, and in one embodiment, the HPV virus-like particle is an icosahedron composed of 72 pentamers of the chimeric HPV L1 protein. In one embodiment, the HPV virus-like particle has correctly formed disulfide bonds and thus has a good natural configuration. In one embodiment, the HPV virus-like particle self-assembles in an in vivo expression system.
在一個方面,本發明提供一種分離的多核苷酸,其編碼如前所述的嵌合的乳頭瘤病毒L1蛋白。在一個實施方式中,多核苷酸為針對不同表達系統進行密碼子優化後的多核苷酸。在一個實施方式中,多核苷酸為針對昆蟲桿狀病毒表達系統進行密碼子優化後的多核苷酸。In one aspect, the present invention provides an isolated polynucleotide encoding a chimeric papillomavirus L1 protein as described above. In one embodiment, the polynucleotide is a polynucleotide that has been codon-optimized for a different expression system. In one embodiment, the polynucleotide is a polynucleotide that has been codon-optimized for an insect bacilliform virus expression system.
在一個方面,本發明提供一種載體,其包含如前所述的多核苷酸。在一個實施方式中,載體為桿狀病毒載體。在一個實施方式中,載體可以是用於桿狀病毒表達系統的轉移載體。在另一個實施方式中,載體可以是用於桿狀病毒表達系統的表達載體。在另一個實施方式中,載體可以是用於桿狀病毒表達系統的重組後的載體。In one aspect, the present invention provides a vector comprising a polynucleotide as described above. In one embodiment, the vector is a bacilli virus vector. In one embodiment, the vector may be a transfer vector for a bacilli virus expression system. In another embodiment, the vector may be an expression vector for a bacilli virus expression system. In another embodiment, the vector may be a recombinant vector for a bacilli virus expression system.
在一個方面,本發明提供一種桿狀病毒,其包含如前所述的多核苷酸。In one aspect, the present invention provides a bacillivirus comprising the polynucleotide as described above.
在一個方面,本發明提供一種宿主細胞,其包含如前所述的多核苷酸、載體、或桿狀病毒。在一個實施方式中,宿主細胞為昆蟲細胞,較佳地,所述昆蟲細胞選自Sf9細胞、Sf21細胞、Hi5細胞和S2細胞。In one aspect, the present invention provides a host cell comprising the polynucleotide, vector, or bacillivirus as described above. In one embodiment, the host cell is an insect cell, preferably, the insect cell is selected from Sf9 cells, Sf21 cells, Hi5 cells, and S2 cells.
在一個方面,本發明提供一種預防乳頭瘤病毒相關的疾病或感染的多價免疫原性組合物。其包含以上方面的HPV類病毒顆粒、多核苷酸、載體、桿狀病毒、和細胞。In one aspect, the present invention provides a multivalent immunogenic composition for preventing papillomavirus-related diseases or infections, comprising the HPV-like virus particles, polynucleotides, vectors, bacilli, and cells of the above aspects.
在一個實施方式中,其包含:由嵌合的L1蛋白,如HPV 6型、11型、16型、18型、31型、35型、39型、45型、51型、52型、56型和58型的L1蛋白組裝而成的HPV類病毒顆粒;和一種或多種由其他致病的HPV型別,如HPV33型和59型的L1蛋白組裝而成的HPV類病毒顆粒。In one embodiment, it comprises: HPV virus-like particles assembled from chimeric L1 proteins, such as L1 proteins of HPV types 6, 11, 16, 18, 31, 35, 39, 45, 51, 52, 56 and 58; and one or more HPV virus-like particles assembled from L1 proteins of other pathogenic HPV types, such as HPV33 and 59.
在一個實施方式中,所述HPV各型別的L1蛋白可以為天然存在的L1蛋白,或非天然存在的L1蛋白,或嵌合HPV L1蛋白。在一個實施方式中,所述HPV類病毒顆粒可以由單一型別的HPV L1蛋白裝配形成單價的HPV類病毒顆粒,也可以由多個型別的HPV L1蛋白共同裝配形成多價的HPV類病毒顆粒。In one embodiment, the L1 protein of each HPV type can be a naturally occurring L1 protein, a non-naturally occurring L1 protein, or a chimeric HPV L1 protein. In one embodiment, the HPV virus-like particle can be assembled from a single type of HPV L1 protein to form a monovalent HPV virus-like particle, or can be assembled from multiple types of HPV L1 proteins to form a multivalent HPV virus-like particle.
在一個實施方式中,所述嵌合的HPV類病毒顆粒可以由單一型別的嵌合HPV L1蛋白裝配形成單價的HPV類病毒顆粒,也可以由多個型別的嵌合HPV L1蛋白共同裝配形成多價的HPV類病毒顆粒。In one embodiment, the chimeric HPV virus-like particles can be assembled from a single type of chimeric HPV L1 protein to form monovalent HPV virus-like particles, or can be assembled from multiple types of chimeric HPV L1 proteins to form multivalent HPV virus-like particles.
在一些實施方式中,所述嵌合的HPV 6型、11型、16型、18型、31型、35型、39型、45型、51型、52型、56型和58型嵌合HPV L1蛋白分別與SEQ ID No: 3、SEQ ID No: 16、SEQ ID No: 29、SEQ ID No: 42、SEQ ID No: 55、SEQ ID No: 71、SEQ ID No: 84、SEQ ID No: 97、SEQ ID No: 110、SEQ ID No: 123、SEQ ID No: 136和SEQ ID No: 149具有98%、98.5%、99%、99.5%或100%的同一性;以及HPV 33型L1蛋白和HPV 59型L1蛋白分別與SEQ ID No: 66和SEQ ID No: 160具有98%、98.5%、99%、99.5%或100%的同一性。In some embodiments, the chimeric HPV type 6, type 11, type 16, type 18, type 31, type 35, type 39, type 45, type 51, type 52, type 56 and type 58 chimeric HPV L1 proteins are 98%, 98.5%, 99%, 99.5% or 100% identical to SEQ ID No: 3, SEQ ID No: 16, SEQ ID No: 29, SEQ ID No: 42, SEQ ID No: 55, SEQ ID No: 71, SEQ ID No: 84, SEQ ID No: 97, SEQ ID No: 110, SEQ ID No: 123, SEQ ID No: 136 and SEQ ID No: 149, respectively; and the HPV type 33 L1 protein and the HPV type 59 L1 protein are 98%, 98.5%, 99%, 99.5% or 100% identical to SEQ ID No: 66 and SEQ ID No: 160 has 98%, 98.5%, 99%, 99.5% or 100% identity.
在一個實施方式中,至少一種所述HPV類病毒顆粒由單一型別的嵌合HPV L1蛋白組成,較佳地,由具有相同的胺基酸序列的所述單一型別的嵌合HPV L1蛋白組成。In one embodiment, at least one of the HPV-like virus particles is composed of a single type of chimeric HPV L1 protein, preferably, is composed of a single type of chimeric HPV L1 protein having the same amino acid sequence.
在一個實施方式中,嵌合的HPV類病毒顆粒為由72個所述嵌合HPV L1蛋白的五聚體構成的二十面體。在一個實施方式中,嵌合的HPV類病毒顆粒具有正確形成的雙硫鍵,因而具有良好的天然構型。在一個實施方式中,嵌合的HPV類病毒顆粒在體內表達系統中自行裝配。In one embodiment, the chimeric HPV virus-like particle is an icosahedron composed of 72 pentamers of the chimeric HPV L1 protein. In one embodiment, the chimeric HPV virus-like particle has correctly formed disulfide bonds and thus has a good natural configuration. In one embodiment, the chimeric HPV virus-like particle self-assembles in an in vivo expression system.
在一個實施方式中,所述多價HPV免疫原性組合物還包含生理學上可接受的載體以及任選地,還包含佐劑。在一個實施方式中,佐劑包含鋁鹽、脂質A衍生物和ISCOM中的一種或多種。In one embodiment, the multivalent HPV immunogenic composition further comprises a physiologically acceptable carrier and optionally, an adjuvant. In one embodiment, the adjuvant comprises one or more of an aluminum salt, a lipid A derivative, and an ISCOM.
在一個實施方式中,所述佐劑為磷酸鋁佐劑。In one embodiment, the adjuvant is an aluminum phosphate adjuvant.
在一個方面,本發明提供一種預防HPV相關疾病或感染的方法,其包括:向受試者施用多價HPV免疫原性組合物。所述預防可被認為是治療,兩者可互換使用。在一個實施方式中,受試者是人。In one aspect, the present invention provides a method for preventing HPV-related diseases or infections, comprising: administering a polyvalent HPV immunogenic composition to a subject. The prevention can be considered as treatment, and the two can be used interchangeably. In one embodiment, the subject is a human.
在一個方面,本發明提供如前所述的多價HPV免疫原性組合物在用於製備用於預防HPV相關疾病或感染的疫苗或藥物中的用途。In one aspect, the present invention provides use of the multivalent HPV immunogenic composition as described above for preparing a vaccine or a medicament for preventing HPV-related diseases or infections.
真核表達系統表達的乳頭瘤病毒L1蛋白能自發裝配成類病毒顆粒,但具有表達量低不易規模化生產的缺點。The papillomavirus L1 protein expressed by the eukaryotic expression system can spontaneously assemble into virus-like particles, but has the disadvantages of low expression levels and difficulty in scalable production.
各型別HPV 的L1蛋白的序列可以從https://www.uniprot.org 方便地獲得。每一型別的HPV L1可以來源於不同的病毒株,因而其胺基酸序列有多個版本,其中任何一個版本的天然序列都可以用於本發明,本發明的構思和設計過程中,所用某一給定型別的HPV L1蛋白序列有可能不同於實施例中使用的序列,但是這種差異不影響發明人的判斷和結論。The sequence of L1 protein of each type of HPV can be conveniently obtained from https://www.uniprot.org. Each type of HPV L1 can be derived from different virus strains, so its amino acid sequence has multiple versions, any of which can be used in the present invention. In the conception and design process of the present invention, the sequence of a given type of HPV L1 protein used may be different from the sequence used in the embodiments, but this difference does not affect the judgment and conclusion of the inventor.
本發明所屬技術領域中具有通常知識者普遍認為L1蛋白的C端不含有主要中和抗原表位,因此試圖透過截短HPV L1蛋白的C端提高表達量,例如葛蘭素公司的美國專利US6361778B1中,HPV16 L1蛋白C端截短1-34個胺基酸,較佳26個胺基酸,聲明VLP的產量增加許多倍,最好至少增加10倍,特別是大約10到100倍。受此啟發,發明人嘗試將HPV 16型L1的C端截短31個胺基酸,命名為HPV16 L1(1-474)。但其蛋白表達量高但蛋白可溶性差,難以提取純化(見對比例)。It is generally believed by those skilled in the art that the C-terminus of the L1 protein does not contain a major neutralizing antigen epitope, and therefore attempts have been made to increase the expression level by truncating the C-terminus of the HPV L1 protein. For example, in GlaxoSmithKline's U.S. patent US6361778B1, the HPV16 L1 protein C-terminus is truncated by 1-34 amino acids, preferably 26 amino acids, and the VLP yield is claimed to be increased by many times, preferably at least 10 times, and especially about 10 to 100 times. Inspired by this, the inventors tried to truncate the C-terminus of HPV 16 L1 by 31 amino acids, and named it HPV16 L1 (1-474). However, the protein expression level was high, but the protein solubility was poor, and it was difficult to extract and purify (see comparative example).
這種截短引起的蛋白可溶性差有可能是C端的核定位序列的缺失造成的,本發明並不受限於此推測。發明人在研究和生產過程中發現HPV 16型L1蛋白、HPV 28型L1蛋白、HPV 33型L1蛋白、HPV 59型L1蛋白和HPV 68型L1蛋白相較於其他型別的L1蛋白表達量和可溶性較好,受此啟發,發明人用表達量和可溶性較好型別的L1 蛋白C端替換不易提取或表達量低的HPV型別的C端。即發明人構建了這樣一種嵌合蛋白:自其N末端至C末端方向包含衍生於第一型別乳頭瘤病毒L1蛋白(例如HPV L1蛋白)的N端片段和衍生於第二型別乳頭瘤病毒L1蛋白(例如HPV L1蛋白)的C端片段,前者提供第一型別乳頭瘤病毒(例如HPV)的免疫原性,後者提供表達量和可溶性較好的特性。兩者可以直接連接也可以透過連接子連接。The poor protein solubility caused by this truncation may be caused by the lack of the nuclear localization sequence at the C-terminus, and the present invention is not limited to this speculation. The inventors found in the research and production process that the HPV 16 L1 protein, HPV 28 L1 protein, HPV 33 L1 protein, HPV 59 L1 protein and HPV 68 L1 protein have better expression and solubility than other types of L1 proteins. Inspired by this, the inventors replaced the C-terminus of HPV types that are difficult to extract or have low expression with the C-terminus of L1 proteins with better expression and solubility. That is, the inventors have constructed such a chimeric protein: from its N-terminus to its C-terminus, it includes an N-terminal fragment derived from the first type of papillomavirus L1 protein (such as HPV L1 protein) and a C-terminal fragment derived from the second type of papillomavirus L1 protein (such as HPV L1 protein), the former providing the immunogenicity of the first type of papillomavirus (such as HPV), and the latter providing the characteristics of better expression and solubility. The two can be directly connected or connected through a linker.
為保持第一型別HPV L1蛋白的免疫原性,以及保證其能夠形成VLP,發明人確定了合適的HPV L1蛋白的N端片段的長度。以下報導關於常見HPV亞型的表位研究:In order to maintain the immunogenicity of the first type HPV L1 protein and ensure that it can form VLPs, the inventors determined the length of the appropriate N-terminal fragment of the HPV L1 protein. The following reports on epitope studies of common HPV subtypes:
Sunanda Baidya等人報導,L1蛋白的表位 48EEYDLQFIFQLCKITLTA65, 45RHGEEYDLQFIFQLCKITLTA65, 63LPDPNKF69, 79PETQRLVWAC88, 36PVPGQYDA43, 77YNPETQRLVWAC88, 188DTGYGAMD195, 36PVPGQYDATK45, 45KQDIPKVSAYQYRVFRV61, 130RDNVSVDYKQTQLCI144 and 49YSRHVEEY DLQFIF62 可以用作設計HPV16和18型疫苗的工具(參見Epitope design of L1 protein for vaccine production against Human Papilloma Virus types 16 and 18,Bioinformation 13(3):86-93 March 2017,透過引用全部併入本文)。Sunanda Baidya et al. reported that the epitopes of L1 protein 48EEYDLQFIFQLCKITLTA65, 45RHGEEYDLQFIFQLCKITLTA65, 63LPDPNKF69, 79PETQRLVWAC88, 36PVPGQYDA43, 77YNPETQRLVWAC88, 188DTGYGAMD195, 36PVPGQYDATK45, 45KQDIPKVSAYQYRVFRV61, 130RDNVSVDYKQTQLCI144 and 49YSRHVEEY DLQFIF62 can be used as tools for designing HPV16 and 18 vaccines (see Epitope design of L1 protein for vaccine production against Human Papilloma Virus types 16 and 18, Bioinformation 13(3):86-93 March 2014). 2017, which is incorporated herein by reference in its entirety).
Katharina Slupetzky等人報導HPV-16的 aa 282–286及351–355 附近的區域對於中和表位有貢獻,而且後者是免疫優勢位點(參見Chimeric papillomavirus-like particles expressing a foreign epitope on capsid surface loops,Journal of General Virology (2001), 82, 2799–2804,透過引用全部併入本文)。Katharina Slupetzky et al. reported that the regions around aa 282–286 and 351–355 of HPV-16 contribute to neutralizing epitopes and that the latter are immunodominant sites (see Chimeric papillomavirus-like particles expressing a foreign epitope on capsid surface loops, Journal of General Virology (2001), 82, 2799–2804, incorporated herein by reference in its entirety).
Brooke Bishop等人製備了HPV11、16、18和35 L1蛋白的以下3種變體:其N端9個胺基酸缺失、α4(對應於HPV16的404–436位胺基酸殘基)缺失、其C端31個胺基酸缺失,報導前兩者不能組裝成VLP,但是未報導後者有此現象 Brooke Bishop et al. prepared the following three variants of HPV11, 16, 18, and 35 L1 proteins: 9 amino acids missing at the N-terminus, α4 (corresponding to amino acid residues 404–436 of HPV16), and 31 amino acids missing at the C-terminus, and reported that the first two could not assemble into VLPs, but did not report this phenomenon for the latter.
(Crystal Structures of Four Types of Human Papillomavirus L1 Capsid Proteins UNDERSTANDING THE SPECIFICITY OF NEUTRALIZING MONOCLONAL ANTIBODIES,The Journal of Biological Chemistry,282,31803-31811。透過引用全部併入本文)。各型別的HPV L1蛋白的α螺旋、β折疊片各個Loop區都可以透過本領域常用的序列分析軟體方便地確定。其中α螺旋區包含α1區、α2區、α3區、α4區和α5區。(Crystal Structures of Four Types of Human Papillomavirus L1 Capsid Proteins UNDERSTANDING THE SPECIFICITY OF NEUTRALIZING MONOCLONAL ANTIBODIES, The Journal of Biological Chemistry, 282, 31803-31811. The entire contents are incorporated herein by reference). The α-helix and β-fold loop regions of each type of HPV L1 protein can be easily determined using sequence analysis software commonly used in the field. The α-helix region includes the α1 region, the α2 region, the α3 region, the α4 region, and the α5 region.
發明人對14種型別(6型、11型、16型、18型、31型、33型、35型、39型、45型、51型、52型、56型、58型和59型)的HPV L1蛋白進行序列比對,然後根據如上引用的文獻(Crystal Structures of Four Types of Human Papillomavirus L1 Capsid Proteins UNDERSTANDING THE SPECIFICITY OF NEUTRALIZING MONOCLONAL ANTIBODIES,The Journal of Biological Chemistry,282,31803-31811)進行二級結構預測,結果如下所示,其中向下的箭頭之間的部分對應於該文獻中關於的為製備變體而缺失的區域。 The inventors aligned the sequences of 14 types of HPV L1 proteins (type 6, type 11, type 16, type 18, type 31, type 33, type 35, type 39, type 45, type 51, type 52, type 56, type 58 and type 59), and then predicted the secondary structures according to the above-cited literature (Crystal Structures of Four Types of Human Papillomavirus L1 Capsid Proteins UNDERSTANDING THE SPECIFICITY OF NEUTRALIZING MONOCLONAL ANTIBODIES, The Journal of Biological Chemistry, 282, 31803-31811). The results are shown below, in which the portion between the downward arrows corresponds to the region deleted for preparing variants in the literature.
除發明人所用的序列對比的方法之外,可用於預測的蛋白質二級結構預測軟體包括但不限於: 1. JPred: http://www.compbio.dundee.ac.uk/jpred/index.html 2. ProtPredicct: http://predictprotein.org 3. PsiPred: http://bioinf.cs.ucl.ac.uk/psipred 4. SCRATCH-1D: http://download.igb.uci.edu 5. Nnpredict: http://www.cmpharm.ucsf.edu/~nomi/nnpredict 6. Predictprotein: http://www.embl-heidelberg.de/predictprotein/SOPMA http://www.ibcp.fr/predict.html 7. SSPRED: http://www.embl-heidelberg.de/sspred/ssprd_info.html。 In addition to the sequence alignment method used by the inventors, protein secondary structure prediction software that can be used for prediction includes but is not limited to: 1. JPred: http://www.compbio.dundee.ac.uk/jpred/index.html 2. ProtPredicct: http://predictprotein.org 3. PsiPred: http://bioinf.cs.ucl.ac.uk/psipred 4. SCRATCH-1D: http://download.igb.uci.edu 5. Nnpredict: http://www.cmpharm.ucsf.edu/~nomi/nnpredict 6. Predictprotein: http://www.embl-heidelberg.de/predictprotein/SOPMA http://www.ibcp.fr/predict.html 7. SSPRED: http://www.embl-heidelberg.de/sspred/ssprd_info.html.
在本發明的一個實施方式中,發明人以以下方式確定衍生於第一型別的HPV L1蛋白的N端片段的長度:將L1蛋白天然序列在其α5區及其附近區域截短,保留從其N末端至α5區域新產生的C末端的序列。如此截短的序列可以保證其具有本型別的免疫原性,且能夠形成VLP。In one embodiment of the present invention, the inventors determined the length of the N-terminal fragment derived from the first type of HPV L1 protein in the following manner: the native sequence of the L1 protein was truncated in its α5 region and its vicinity, and the sequence from its N-terminus to the newly generated C-terminus of the α5 region was retained. Such a truncated sequence can ensure that it has the immunogenicity of this type and can form VLPs.
衍生於第一型別的HPV L1蛋白的N端片段還可以進一步改造,以保證其具有本型別的免疫原性,且能夠形成VLP為限。The N-terminal fragment derived from the HPV L1 protein of the first type can be further modified to ensure that it has the immunogenicity of this type and is able to form VLP.
發明人以以下方式確定了衍生於第二型別的HPV L1蛋白的C端片段的長度。將L1蛋白天然序列在其α5區及其附近區域截短、保留從其α5區域新產生的N末端至C末端序列。如此截短的序列不具有主要中和抗原表位,不干擾形成的嵌合蛋白的免疫原性。The inventors determined the length of the C-terminal fragment derived from the second type of HPV L1 protein in the following manner. The native sequence of the L1 protein was truncated in its α5 region and its vicinity, and the newly generated N-terminal to C-terminal sequence from its α5 region was retained. Such a truncated sequence does not have a major neutralizing antigen epitope and does not interfere with the immunogenicity of the formed chimeric protein.
衍生於第二型別的HPV L1蛋白的C端片段還可以進一步突變、缺失和/或添加,較佳保留其至少一個核定位序列。Yang 等人預測了107種HPV亞型的核定位序列(Yang et al. Predicting the nuclear localization signals of 107 types of HPV L1 proteins by bioinformatic analysis. Geno. Prot. Bioinfo. Vol. 4 No. 1 2006,透過引用全部併入本文),各型別的HPV L1蛋白的核定位序列可以透過本領域常用的序列分析軟體方便地確定。The C-terminal fragment derived from the second type of HPV L1 protein can be further mutated, deleted and/or added, preferably retaining at least one nuclear localization sequence. Yang et al. predicted the nuclear localization sequences of 107 types of HPV subtypes (Yang et al. Predicting the nuclear localization signals of 107 types of HPV L1 proteins by bioinformatic analysis. Geno. Prot. Bioinfo. Vol. 4 No. 1 2006, incorporated herein by reference in its entirety), and the nuclear localization sequences of each type of HPV L1 protein can be conveniently determined by sequence analysis software commonly used in the art.
上述N端片段和C端片段的連接發生在前者的新產生的C末端和後者的新產生的N末端。可以是直接連接也可以是透過連接子連接。將連接點視為原點,則在原點的N端側為負,而其C端側為正。The connection between the N-terminal fragment and the C-terminal fragment occurs at the newly generated C-terminus of the former and the newly generated N-terminus of the latter. It can be directly connected or connected through a linker. If the connection point is regarded as the origin, the N-terminal side of the origin is negative, and its C-terminal side is positive.
如下示出HPV6 L1蛋白的453-469位胺基酸序列、以及多個型別HPV L1蛋白的與之相對應的一段序列。可以看出這些序列高度相似。這段序列和α5區有重合。括號內數字表示所列出序列的最後一位胺基酸的位置,其中對於HPV 45型,一些HPV 45型毒株的L1蛋白的N端存在額外的26個胺基酸,而在另一些HPV 45型毒株的L1蛋白的N端不存在所述額外的26個胺基酸,所以以(478)+26表示。 HPV6 ELDQYPLGRKFLLQSGY(469) HPV11 ELDQFPLGRKFLLQSGY(470) HPV16 DLDQFPLGRKFLLQAGL(474) HPV18 DLDQYPLGRKFLVQAGL(475) HPV31 DLDQFPLGRKFLLQAGY(475) HPV35 DLDQFPLGRKFLLQAGL(472) HPV39 ELDQFPLGRKFLLQARV (474) HPV45 DLDQYPLGRKFLVQAGL(478)+26 HPV51 DLDQFALGRKFLLQVGV(474) HPV52 DLDQFPLGRKFLLQAGL(478) HPV56 DLDQFPLGRKFLMQLGTRS(474) HPV58 DLDQFPLGRKFLLQSGL(473) HPV33 DLDQFPLGRKFLLQAGL(473) KAKPKLKRAAPTSTRTSSAKRKKVKK,其中,480-481的KR和493-496位的KRKK是核定位序列。 HPV59 DLDQFPLGRKFLLQLGA(475) RPKPTIGPRKRAAPAPTSTPSPKRVKRRKSSRK,其中,484-486的RKR和498-504的KRVKRRK是核定位序列。 The following shows the amino acid sequence of the 453-469 position of the HPV6 L1 protein and the corresponding sequence of multiple types of HPV L1 proteins. It can be seen that these sequences are highly similar. This sequence overlaps with the α5 region. The numbers in brackets indicate the position of the last amino acid in the listed sequence. For HPV 45, some HPV 45 strains have an additional 26 amino acids at the N-terminus of the L1 protein, while other HPV 45 strains do not have the additional 26 amino acids at the N-terminus of the L1 protein, so it is represented by (478)+26. HPV6 ELDQYPLGRKFLLQSGY(469) HPV11 ELDQFPLGRKFLLQSGY(470) HPV16 DLDQFPLGRKFLLQAGL(474) HPV18 DLDQYPLGRKFLVQAGL(475) HPV31 DLDQFPLGRKFLLQAGY(475) HPV35 DLDQFPLGRKFLLQAGL(472) HPV39 ELDQFPLGRKFLLQARV (474) HPV45 DLDQYPLGRKFLVQAGL(478)+26 HPV51 DLDQFALGRKFLLQVGV(474) HPV52 DLDQFPLGRKFLLQAGL(478) HPV56 DLDQFPLGRKFLMQLGTRS(474) HPV58 DLDQFPLGRKFLLQSGL(473) HPV33 DLDQFPLGRKFLLQAGL(473) KAKPKLKRAAPTSTRTSSAKRKKVKK, where KR at 480-481 and KRKK at 493-496 are nuclear localization sequences. HPV59 DLDQFPLGRKFLLQLGA(475) RPKPTIGPRKRAAPAPTSTPSPKRVKRRKSSRK, where RKR at 484-486 and KRVKRRK at 498-504 are nuclear localization sequences.
在本發明的一個實施方式中,發明人借助多個HPV型別之間的α5區及其附近區域的序列相似性,便利地完成了不同型別之間的L1蛋白的C端替換。In one embodiment of the present invention, the inventors conveniently completed the C-terminal replacement of L1 protein between different types by taking advantage of the sequence similarity of the α5 region and its surrounding regions between multiple HPV types.
在本發明的最佳的實施方式中,發明人注意到各個型別的HPV L1蛋白都在相似的位置具有一段四肽RKFL,更有利的情形是一段六肽LGRKFL。發明人巧妙地利用這一高度保守的序列,將嵌合蛋白的連接點設計在這一段寡肽的任一胺基酸位點。自一個方面看來,自嵌合蛋白N末端起至RKFL或LGRKFL止與衍生於第一型別的HPV L1 蛋白的N端片段的序列相同,而從另一方面看來自RKFL或LGRKFL起至嵌合蛋白的C末端止,與衍生於第二型別的L1蛋白的C端片段的序列相同。In the best embodiment of the present invention, the inventors noticed that each type of HPV L1 protein has a tetrapeptide RKFL at a similar position, and more preferably a hexapeptide LGRKFL. The inventors cleverly used this highly conserved sequence to design the connection point of the chimeric protein at any amino acid position of this oligopeptide. From one aspect, the sequence from the N-terminus of the chimeric protein to RKFL or LGRKFL is the same as the sequence of the N-terminal fragment derived from the first type of HPV L1 protein, and from another aspect, the sequence from RKFL or LGRKFL to the C-terminus of the chimeric protein is the same as the sequence of the C-terminal fragment derived from the second type of L1 protein.
如此產生的嵌合蛋白保持與天然HPV L1蛋白高度相似性, 可以預期在生產乃至此後的醫療或預防過程中,都會有良好的表現。The chimeric protein thus produced maintains a high degree of similarity to the natural HPV L1 protein and can be expected to perform well in production and in subsequent medical or preventive processes.
本發明所屬技術領域中具有通常知識者會理解,因為同一型別的HPV有不同的毒株,因此其天然序列不同,利用不同毒株構建而成的嵌合蛋白亦落入本發明。Those skilled in the art will understand that, because the same type of HPV has different strains and thus different natural sequences, chimeric proteins constructed using different strains also fall within the scope of the present invention.
本發明所屬技術領域中具有通常知識者會理解,因為不同型別HPV L1的高度相似性,如果在嵌合蛋白構建過程中,將衍生於第一型別HPV L1蛋白的N端片段向C末端延伸更多的胺基酸殘基,或者是將衍生於第二型別的HPV L1蛋白的C端片段向N末端延伸更多的胺基酸殘基,亦有可能因相應位點上胺基酸的相同或相似,形成與本發明結構一致的嵌合蛋白。如此形成的嵌合蛋白亦落入本發明。A person skilled in the art to which the present invention belongs will understand that, due to the high similarity of different types of HPV L1, if, during the construction of the chimeric protein, the N-terminal fragment derived from the first type of HPV L1 protein is extended to the C-terminal with more amino acid residues, or the C-terminal fragment derived from the second type of HPV L1 protein is extended to the N-terminal with more amino acid residues, it is also possible to form a chimeric protein consistent with the structure of the present invention due to the same or similar amino acids at the corresponding positions. The chimeric protein thus formed also falls within the scope of the present invention.
本發明所屬技術領域中具有通常知識者會理解,在以上描述的實施方式的嵌合蛋白的基礎上,會透過胺基酸殘基的突變、缺失和/或添加形成嵌合蛋白的變體。這些變體有可能具有第一型別的HPV L1蛋白的免疫原性、可以形成VLP,且具有良好的產量和可溶性。如此形成的嵌合蛋白亦落入本發明。Those skilled in the art will appreciate that, based on the chimeric protein of the above-described embodiments, variants of the chimeric protein may be formed by mutation, deletion and/or addition of amino acid residues. These variants may have the immunogenicity of the first type of HPV L1 protein, may form VLPs, and have good yield and solubility. The chimeric protein thus formed also falls within the scope of the present invention.
發明的有益效果Beneficial effects of the invention
目前普遍用於生產類病毒顆粒的表達系統分為真核表達系統和原核表達系統。真核表達系統表達的乳頭瘤病毒L蛋白能自發裝配成類病毒顆粒,但具有表達量低不易規模化生產的缺點。原核表達系統表達的乳頭瘤病毒L蛋白的天然構型往往被破壞,需要後期進行體外處理才能得到類病毒顆粒,而且產量較低,很難進行產業化。Currently, the expression systems commonly used to produce virus-like particles are divided into eukaryotic expression systems and prokaryotic expression systems. The papillomavirus L protein expressed by the eukaryotic expression system can spontaneously assemble into virus-like particles, but has the disadvantages of low expression levels and difficulty in large-scale production. The natural configuration of the papillomavirus L protein expressed by the prokaryotic expression system is often destroyed, requiring subsequent in vitro treatment to obtain virus-like particles, and the yield is low, making it difficult to industrialize.
本發明將乳頭瘤病毒(例如人乳頭瘤病毒)L蛋白的C端進行改造,例如替換為HPV 16型L1蛋白、HPV 28型L1蛋白、HPV 33型L1蛋白、HPV 59型L1蛋白或HPV 68型L1蛋白中的C端片段,可以在表達系統(例如宿主細胞,例如昆蟲細胞)中提高乳頭瘤病毒L蛋白的表達量和可溶性。這可用於疫苗例如HPV疫苗的大規模生產。The present invention modifies the C-terminus of the L protein of a papillomavirus (e.g., human papillomavirus), for example, by replacing it with a C-terminal fragment of the L1 protein of HPV 16, HPV 28, HPV 33, HPV 59, or HPV 68, thereby increasing the expression amount and solubility of the L protein of the papillomavirus in an expression system (e.g., a host cell, such as an insect cell). This can be used for the mass production of vaccines, such as HPV vaccines.
發明人自行發現HPV 16型L1蛋白、HPV 28型L1蛋白、HPV 33型L1蛋白、HPV 59型L1蛋白和HPV 68型L1蛋白相較於其他型的L1蛋白表達量和可溶性較好,且發現所述增加的蛋白表達量和可溶性取決於所述HPV L1蛋白的C端序列。在107型HPV L1蛋白中,大部分在C端具有核定位序列,且C端序列具有一定的相似性。The inventors found that the expression and solubility of HPV 16, HPV 28, HPV 33, HPV 59 and HPV 68 L1 proteins were better than those of other types of L1 proteins, and that the increased protein expression and solubility depended on the C-terminal sequence of the HPV L1 protein. Among the 107 types of HPV L1 proteins, most of them had a nuclear localization sequence at the C-terminus, and the C-terminal sequences had certain similarities.
對於目前無法表達、表達量非常低或表達後不可溶的乳頭瘤病毒L蛋白,將其C端片段替換為HPV 16型L1蛋白、HPV 28型L1蛋白、HPV 33型L1蛋白、HPV 59型L1蛋白或HPV 68型L1蛋白中的C端片段,使得原本表達量極低或不可溶的乳頭瘤L蛋白的可溶性表達和後續純化成為可能。這可以用於更多價疫苗(例如HPV疫苗)的大規模生產,使得更全面地預防多種乳頭瘤病毒,特別是HPV的感染成為可能。For papillomavirus L proteins that are currently unable to be expressed, expressed at very low levels, or insoluble after expression, their C-terminal fragments are replaced with the C-terminal fragments of HPV 16 L1 proteins, HPV 28 L1 proteins, HPV 33 L1 proteins, HPV 59 L1 proteins, or HPV 68 L1 proteins, making it possible to express soluble papillomavirus L proteins that were originally expressed at very low levels or were insoluble, and subsequently purify them. This can be used for the large-scale production of more valent vaccines (such as HPV vaccines), making it possible to more comprehensively prevent infections caused by multiple papillomaviruses, especially HPV.
為了實現疫苗的大規模生產,還存在提高HPV L1蛋白在昆蟲細胞中的表達量和可溶性的需求。此外,在酵母細胞中,因為無法正確形成雙硫鍵,HPV L1蛋白裝配成的類病毒顆粒缺乏良好的構型。In order to achieve large-scale production of vaccines, there is also a need to improve the expression and solubility of HPV L1 protein in insect cells. In addition, in yeast cells, HPV L1 protein assembles into virus-like particles that lack a good conformation because disulfide bonds cannot be formed correctly.
對於在昆蟲細胞中表達量低且可溶性差的HPV L1蛋白,將其C端片段改造為HPV 33型或59型L1蛋白的C端片段後,表達量和可溶性顯著提高,可用於HPV疫苗的大規模生產。For HPV L1 protein with low expression level and poor solubility in insect cells, the expression level and solubility are significantly improved after its C-terminal fragment is transformed into the C-terminal fragment of HPV type 33 or type 59 L1 protein, which can be used for large-scale production of HPV vaccine.
對於相較於其他型的L1蛋白在昆蟲細胞中表達量和可溶性較好的HPV L1蛋白,例如HPV 16型、HPV 28型L1蛋白、HPV 68型L1蛋白等,為了實現疫苗的大規模生產,還存在進一步提高表達量和可溶性的需求。在本發明中,例如,將HPV 16型L1蛋白的C端片段改造為HPV 33型L1蛋白的C端片段後,改造後的嵌合的HPV 16型L1蛋白表達量和可溶性均得到改善,有利於HPV疫苗的大規模生產。For HPV L1 proteins that have better expression and solubility in insect cells than other types of L1 proteins, such as HPV 16, HPV 28, HPV 68, etc., there is a need to further improve the expression and solubility in order to achieve large-scale production of vaccines. In the present invention, for example, after the C-terminal fragment of the HPV 16 L1 protein is transformed into the C-terminal fragment of the HPV 33 L1 protein, the expression and solubility of the transformed chimeric HPV 16 L1 protein are improved, which is conducive to the large-scale production of HPV vaccines.
總之,嵌合HPV L1蛋白相比於未改造之前的HPV L1蛋白在昆蟲細胞中的表達量和可溶性大大提高。可用於HPV疫苗的大規模生產。此外,嵌合HPV L1蛋白在昆蟲細胞中可以正確形成雙硫鍵而裝配為具有良好構型的HPV類病毒顆粒。這可以提高HPV類病毒顆粒的免疫原性,產生更好的免疫反應。In summary, the expression amount and solubility of the chimeric HPV L1 protein in insect cells are greatly improved compared to the HPV L1 protein before modification. It can be used for large-scale production of HPV vaccines. In addition, the chimeric HPV L1 protein can correctly form disulfide bonds in insect cells and assemble into HPV virus-like particles with good configuration. This can improve the immunogenicity of HPV virus-like particles and produce better immune responses.
本發明的多價疫苗可以用於預防多種HPV相關疾病或感染,包括目前尚無法預防的HPV型別。The multivalent vaccine of the present invention can be used to prevent a variety of HPV-related diseases or infections, including HPV types that are currently unpreventable.
定義Definition
除非另有說明,本文使用的所有技術和科學術語具有本發明所屬技術領域的通常知識者通常理解的含義。為方便地理解本發明,以下引述下列術語的通常含義。Unless otherwise specified, all technical and scientific terms used herein have the meanings commonly understood by those of ordinary skill in the art to which the present invention belongs. For the convenience of understanding the present invention, the common meanings of the following terms are quoted below.
當用於本文和所附申請專利範圍中時,單數形式「一個/種」、「另一個/種」和「所述/該」包括複數指代對象,除非上下文明確地另有指示。除非另有明確說明,否則術語「包括/包含/具有」、「例如」等旨在傳達包含而非限制。As used herein and in the appended claims, the singular forms "a/an," "another," and "the" include plural referents unless the context clearly indicates otherwise. Unless expressly stated otherwise, the terms "including/comprising/having," "such as," and the like are intended to convey inclusion rather than limitation.
術語「免疫原性」是指某種物質,例如蛋白質或多肽刺激免疫反應的能力,即刺激產生抗體,尤其是產生體液或者刺激細胞介導的反應的能力。The term "immunogenicity" refers to the ability of a substance, such as a protein or peptide, to stimulate an immune response, that is, the ability to stimulate the production of antibodies, especially the production of humoral or cell-mediated responses.
術語「抗體」指能結合抗原的免疫球蛋白分子。抗體可以是多株混合物或單株。抗體可以是源於天然來源或源於重組來源的完整的免疫球蛋白或可以是完整的免疫球蛋白的免疫反應性部分。抗體可以存在於多種形式,包括例如Fv、Fab’、F(ab’)2以及以單鏈存在。The term "antibody" refers to an immunoglobulin molecule that can bind to an antigen. Antibodies can be a mixture of multiple strains or a single strain. Antibodies can be complete immunoglobulins derived from natural sources or from recombinant sources or can be immunoreactive portions of complete immunoglobulins. Antibodies can exist in a variety of forms, including, for example, Fv, Fab', F(ab')2, and as single chains.
術語「抗原性」是指某種物質,例如蛋白質或多肽產生與其特異性結合的抗體的能力。The term "antigenicity" refers to the ability of a substance, such as a protein or peptide, to generate antibodies that specifically bind to it.
術語「表位」包括能夠特異性結合至抗體或T細胞受體的任何蛋白質決定位。表位決定位通常由分子的化學活性表面基團(例如胺基酸或糖側鏈,或其組合)組成,並且通常具有特定三維結構特徵以及特定的電荷特徵。The term "epitope" includes any protein determinant capable of specific binding to an antibody or T-cell receptor. Epitope determinants usually consist of chemically active surface groups of a molecule (such as amino acids or sugar side chains, or a combination thereof) and usually have specific three-dimensional structural characteristics as well as specific charge characteristics.
術語「亞型」或「型別」可在本文中互換使用,表示所述病毒抗原的遺傳變體以使得一個亞型區別於一個不同亞型地被免疫系統識別。例如,HPV 16在免疫學上可區別於HPV 33。The terms "subtype" or "type" are used interchangeably herein to refer to the genetic variation of the viral antigens that allows one subtype to be distinguished from a different subtype by the immune system. For example, HPV 16 can be distinguished immunologically from HPV 33.
術語「HPV L1蛋白」如本文所用,術語「HPV」和「人乳頭狀瘤病毒」是指乳頭狀瘤病毒科的無包膜雙鏈DNA病毒。它們的基因組是圓形的,並且大小約為8千鹼基對。大多數HPV編碼八種主要蛋白,六種位於「早期」區域(E1-E2),並且兩種位於「晚期」區域(L1(主要衣殼蛋白)和L2(次要衣殼蛋白))。已經鑒定了超過120種HPV類型,並且它們由數字標出(例如,HPV-16、HPV-18等)。Term "HPV L1 protein" As used herein, the terms "HPV" and "human papillomavirus" refer to non-enveloped double-stranded DNA viruses of the Papillomaviridae family. Their genome is round and about 8 kilobase pairs in size. Most HPVs encode eight major proteins, six located in the "early" region (E1-E2) and two located in the "late" region (L1 (major capsid protein) and L2 (minor capsid protein)). More than 120 HPV types have been identified, and they are designated by numbers (e.g., HPV-16, HPV-18, etc.).
術語「HPV」或「HPV病毒」指乳頭狀瘤病毒科的乳頭狀瘤病毒,為無包膜DNA病毒,該病毒基因組為雙鏈閉環DNA,大小約為8 kb, 通常可以分為三個區域:①早期區(E),含有編碼E1、E2、E4~E7病毒複製,轉錄及轉譯有關的非結構蛋白的6個開放閱讀框,以及E3和E8開放閱讀框;②晚期區(L)含有編碼主要衣殼蛋白L1和次要衣殼蛋白L2的閱讀框;③長調控區(LCR)不編碼任何蛋白,但具有複製的起源以及多個轉錄因子結合位點。The term "HPV" or "HPV virus" refers to the papillomavirus of the Papillomaviridae family, which is a non-enveloped DNA virus with a double-stranded closed circular DNA genome of approximately 8 kb in size, which can usually be divided into three regions: ① The early region (E), which contains six open reading frames encoding E1, E2, E4-E7 non-structural proteins related to viral replication, transcription and translation, as well as E3 and E8 open reading frames; ② The late region (L) contains reading frames encoding the major capsid protein L1 and the minor capsid protein L2; ③ The long regulatory region (LCR) does not encode any protein, but has the origin of replication and multiple transcription factor binding sites.
術語「HPV L1蛋白」及「HPV L2蛋白」指由HPV基因的晚期區(L)編碼,在HPV感染週期中晚期合成的蛋白。L1蛋白質是主要的衣殼蛋白並且具有55‑60 kDa的分子量。L2蛋白質是次要的衣殼蛋白質。72個L1五聚體構成二十面體HPV病毒粒子的外殼,包裹閉環雙鏈DNA微染色體。L2蛋白質位於L1蛋白質內側。The terms "HPV L1 protein" and "HPV L2 protein" refer to proteins encoded by the late region (L) of the HPV gene and synthesized in the late phase of the HPV infection cycle. L1 protein is the major capsid protein and has a molecular weight of 55‑60 kDa. L2 protein is the minor capsid protein. 72 L1 pentamers form the outer shell of the icosahedral HPV virion, enclosing the closed double-stranded DNA minichromosome. L2 protein is located on the inner side of L1 protein.
術語「類病毒顆粒」是含有某種病毒的一個或多個結構蛋白的空心顆粒,沒有病毒核酸。The term "virus-like particle" refers to a hollow particle containing one or more structural proteins of a virus but without viral nucleic acid.
「HPV假病毒」係利用HPV VLP的非特異包裹核酸的特性,透過細胞內表達的HPV L1和L2組成的VLP包裹游離的DNA或導入外源質體形成。是理想的HPV體外中和實驗模型。HPV pseudoviruses utilize the non-specific nucleic acid-encapsulating properties of HPV VLPs, and are formed by encapsulating free DNA or introducing exogenous plasmids with VLPs composed of HPV L1 and L2 expressed intracellularly. It is an ideal HPV in vitro neutralization experimental model.
「假病毒中和法」是評價抗體的中和活性的一種方法,將免疫後的動物血清與一定量的假病毒孵育後再侵染細胞,細胞會隨著血清中中和抗體的增加而減少,在一定的範圍內可存在線性負相關,因此可以透過檢測表達細胞數的變化來評價血清中抗體的中和活性。The "pseudovirus neutralization method" is a method for evaluating the neutralizing activity of antibodies. The immunized animal serum is incubated with a certain amount of pseudovirus and then infects cells. The number of cells will decrease as the neutralizing antibodies in the serum increase. Within a certain range, there can be a linear negative correlation. Therefore, the neutralizing activity of antibodies in the serum can be evaluated by detecting changes in the number of expressing cells.
術語「其片段」或「其變體」指根據本發明的部分核苷酸或胺基酸序列被缺失、插入和/或取代。較佳地,本發明提供的多肽的片段或變體能在動物或人體中引發體液和/或細胞免疫反應。The term "fragment thereof" or "variant thereof" refers to a portion of the nucleotide or amino acid sequence according to the present invention that is deleted, inserted and/or substituted. Preferably, the fragment or variant of the polypeptide provided by the present invention can induce humoral and/or cellular immune response in animals or humans.
術語「嵌合」意指,源自不同的親本分子的多肽或核苷酸序列分別經由醯胺鍵或3’,5’-磷酸二酯鍵連接在一起。較佳的,不被額外的連接子序列分隔開,而是直接彼此相鄰。The term "chimeric" means that polypeptide or nucleotide sequences derived from different parent molecules are linked together via amide bonds or 3', 5'-phosphodiester bonds, respectively. Preferably, they are not separated by additional linker sequences but are directly adjacent to each other.
術語「截短」意指透過從多肽的N和/ 或C-末端除去一個或多個胺基酸或者缺失一個或多個多肽內部的胺基酸。The term "truncation" refers to the removal of one or more amino acids from the N- and/or C-terminus of a polypeptide or the deletion of one or more amino acids within a polypeptide.
術語「核定位序列」為可引導蛋白質進入細胞核的胺基酸序列。在一些HPV L1蛋白中,兩個緊密的鹼性殘基位(即核定位序列)(例如一個是KRKR、KRKK、KRKRK、KRKKRK、KRVKRRK等,另一個是KR、RKR、KRK等)之間具有10-14個胺基酸的間隔區。上述鹼性殘基位屬於核定位序列。在另一些HPV L1蛋白中,核定位序列為精胺酸和/或賴胺酸形成的緊密的鹼性殘基位。核定位序列包括但不限於如上所述鹼性殘基位的實例。參見Jun Yang等,Predicting the Nuclear Localization Signals of 107 Types of HPV L1 Proteins by Bioinformatic Analysis,Genomics, Proteomics & Bioinformatics Volume 4, Issue 1, 2006, Pages 34-41,其全部內容透過引用併入本文。The term "nuclear localization sequence" is an amino acid sequence that can guide a protein into the cell nucleus. In some HPV L1 proteins, there is a spacer of 10-14 amino acids between two close basic residues (i.e., nuclear localization sequences) (e.g., one is KRKR, KRKK, KRKRK, KRKKRK, KRVKRRK, etc., and the other is KR, RKR, KRK, etc.). The above basic residues belong to the nuclear localization sequence. In other HPV L1 proteins, the nuclear localization sequence is a close basic residue formed by arginine and/or lysine. The nuclear localization sequence includes but is not limited to the examples of basic residues as described above. See Jun Yang et al., Predicting the Nuclear Localization Signals of 107 Types of HPV L1 Proteins by Bioinformatic Analysis, Genomics, Proteomics & Bioinformatics Volume 4, Issue 1, 2006, Pages 34-41, the entire contents of which are incorporated herein by reference.
術語「功能性變體」為某一多肽或蛋白經截短、突變、缺失和/或添加後仍然保持所需要的活性或特徵的版本。The term "functional variant" refers to a version of a polypeptide or protein that has been truncated, mutated, deleted and/or added but still retains the desired activity or characteristic.
兩條多肽或核酸序列之間的「序列同一性」表示所述序列之間相同的殘基的數目占殘基總數的百分比,且基於比較的分子中較小者的大小來計算。在計算同一性百分數時,將正在比較的序列以產生序列之間最大匹配的方式比對,透過特定算法解決比對中的空位(如果存在的話)。確定兩個序列之間同一性的較佳電腦程式方法包括,但不限於,GCG程序包,包括GAP、BLASTP、BLASTN和FASTA(Altschul等人,1990,J.Mol.Biol.215:403-410)。上述程式可以公開地從國際生物技術信息中心(NCBI)和其他來源得到。習知的Smith Waterman算法也可用於確定同一性。"Sequence identity" between two polypeptide or nucleic acid sequences refers to the percentage of the total number of residues that are identical between the sequences and is calculated based on the size of the smaller of the molecules being compared. In calculating percent identity, the sequences being compared are aligned in a manner that produces the largest match between the sequences, and gaps in the alignment, if any, are resolved by a particular algorithm. Preferred computer program methods for determining the identity between two sequences include, but are not limited to, the GCG program package, including GAP, BLASTP, BLASTN, and FASTA (Altschul et al., 1990, J. Mol. Biol. 215: 403-410). The above programs are publicly available from the National Center for Biotechnology Information (NCBI) and other sources. The well-known Smith Waterman algorithm can also be used to determine identity.
可以保守性置換非關鍵的胺基酸而不影響蛋白質的正常功能。保守性置換意指用化學或功能相似的胺基酸置換胺基酸。提供相似胺基酸的保守性置換表是本領域熟知的。舉例來說,在一些實施方式中,表1-3中提供的胺基酸組被認為是相互的保守性置換。Non-critical amino acids can be conservatively replaced without affecting the normal function of the protein. Conservative replacement means replacing an amino acid with an amino acid that is chemically or functionally similar. Conservative replacement tables that provide similar amino acids are well known in the art. For example, in some embodiments, the amino acid groups provided in Tables 1-3 are considered to be mutually conservative replacements.
表1 在某些實施方式中,被認為是相互保守性置換的胺基酸的所選組
表2 在某些實施方式中,被認為是相互的保守性置換的胺基酸的其他所選組
表3 在某些實施方式中,被認為是相互的保守性置換的胺基酸的其他所選組
術語「胺基酸」意指二十種常見的天然存在的胺基酸。天然存在的胺基酸包括丙胺酸(Ala;A)、精胺酸(Arg;R)、天冬醯胺酸(Asn;N)、天冬胺酸(Asp;D)、胱胺酸(Cys;C);麩胺酸(Glu;E)、麩醯胺酸(Gln;Q)、甘胺酸(Gly;G)、組胺酸(His;H)、異白胺酸(Ile;I)、白胺酸(Leu;L)、離胺酸(Lys;K)、甲基胺酸(Met;M)、苯丙胺酸(Phe;F)、脯胺酸(Pro;P)、絲胺酸(Ser;S)、蘇胺酸(Thr;T)、色胺酸(Trp;W)、酪胺酸(Tyr;Y)和纈胺酸(Val;V)。The term "amino acid" refers to the twenty common naturally occurring amino acids. Naturally occurring amino acids include alanine (Ala; A), arginine (Arg; R), asparagine (Asn; N), aspartic acid (Asp; D), cystine (Cys; C); glutamine (Glu; E), glutamine (Gln; Q), glycine (Gly; G), histidine (His; H), isoleucine (Ile; I), leucine (Leu; L), lysine (Lys; K), methylamine (Met; M), phenylalanine (Phe; F), proline (Pro; P), serine (Ser; S), threonine (Thr; T), tryptophan (Trp; W), tyrosine (Tyr; Y), and valeric acid (Val; V).
如本文所用的「生理學上可接受的載體」,其對所用劑量和濃度的細胞或哺乳動物是無毒的。通常為pH緩衝含水溶液,其非限制性實例包括緩衝劑;抗氧化劑;寡肽;蛋白質;親水性聚合物;胺基酸;單糖、二糖和其它碳水化合物;螯合劑;糖醇;成鹽的抗衡離子,例如鈉;以及/或非離子表面活性劑。As used herein, a "physiologically acceptable carrier" is non-toxic to cells or mammals at the dosage and concentration used. It is usually a pH-buffered aqueous solution, and non-limiting examples include buffers; antioxidants; oligopeptides; proteins; hydrophilic polymers; amino acids; monosaccharides, disaccharides and other carbohydrates; chelating agents; sugar alcohols; salt-forming counterions, such as sodium; and/or non-ionic surfactants.
術語「佐劑」指一種增強免疫反應的化合物或混合物。特別的,疫苗可以包含佐劑。用於本發明的佐劑可以包括但不限於以下的一種或多種:含礦物佐劑組合物、油-乳佐劑、皂素佐劑製劑、細菌或微生物衍生物。The term "adjuvant" refers to a compound or mixture that enhances an immune response. In particular, a vaccine may contain an adjuvant. The adjuvant used in the present invention may include, but is not limited to, one or more of the following: a mineral adjuvant composition, an oil-emulsion adjuvant, a saponin adjuvant preparation, a bacterial or microbial derivative.
術語「載體」意指能夠增殖與其連接的另一核酸的核酸分子。該術語包括作為自我複製核酸結構的載體以及作為整合至已引入載體的宿主細胞基因組中的載體。某些載體能夠引導該類載體以可操作方式連接的核酸的表達。The term "vector" means a nucleic acid molecule capable of propagating another nucleic acid to which it is linked. The term includes vectors as self-replicating nucleic acid structures as well as vectors that integrate into the genome of a host cell into which the vector has been introduced. Certain vectors are capable of directing the expression of a nucleic acid to which such vector is operably linked.
術語「宿主細胞」意指已引入外源核酸的細胞,以及這樣的細胞的後代。宿主細胞包括「轉化體」(或「轉化細胞」)、「轉染體」(或「轉染細胞」)或「感染體」(或「感染細胞」),其各自包括初代轉化、轉染或感染的細胞和由其衍生的後代。這樣的後代在核酸含量上可能不與親本細胞完全相同,並且可能含有突變。The term "host cell" means a cell into which an exogenous nucleic acid has been introduced, as well as the progeny of such a cell. Host cells include "transformants" (or "transformed cells"), "transfectants" (or "transfected cells"), or "infected bodies" (or "infected cells"), each of which includes the primary transformed, transfected or infected cell and the progeny derived therefrom. Such progeny may not be completely identical to the parent cell in terms of nucleic acid content, and may contain mutations.
施用量較佳的為「預防性有效量」(本文預防可以被認為是治療,兩者可互換使用),其足以對個體顯示出益處。The dosage is preferably a "prophylactically effective amount" (prevention herein may be considered treatment, and the two terms are used interchangeably), which is sufficient to show benefit to an individual.
實施例Embodiment
實施例1 基因的構建Example 1 Gene construction
實施例1.1:HPV6L1 C端替換為HPV33L1 C端的嵌合基因的構建Example 1.1: Construction of a chimeric gene in which the HPV6 L1 C-terminus is replaced by the HPV33 L1 C-terminus
1.1.1 用作模板的pFB-HPV6L1的構建1.1.1 Construction of pFB-HPV6L1 as template
委託Thermo Fisher公司(原英濰捷基(上海)貿易有限公司)基因合成HPV6L1基因,且合成的序列兩端分別具有KpnⅠ和XbaI酶切位點,其序列見SEQ ID NO: 5。透過KpnⅠ和XbaI酶切位點將合成的基因片段與pcDNA3載體(銷售商Thermo Fisher)連接,得到含有編碼HPV6L1 1-500個胺基酸的核苷酸序列的質體pcDNA3-HPV6-L1。Thermo Fisher (formerly Invitrogen (Shanghai) Trading Co., Ltd.) was commissioned to synthesize the HPV6L1 gene, and the synthesized sequence had KpnⅠ and XbaI restriction sites at both ends, and its sequence is shown in SEQ ID NO: 5. The synthesized gene fragment was connected to the pcDNA3 vector (seller Thermo Fisher) through the KpnⅠ and XbaI restriction sites to obtain the plasmid pcDNA3-HPV6-L1 containing the nucleotide sequence encoding the 1-500 amino acids of HPV6L1.
將得到的pcDNA3-HPV6-L1質體進行KpnⅠ和XbaI雙酶切得到HPV6L1(1-500)的基因的片段。再將該片段與KpnⅠ和XbaI雙酶切的pFastBacTM1載體(銷售商Thermo Fisher)進行連接,得到含HPV6L1(1-500)基因片段的桿粒載體,命名為pFB-HPV6L1。The obtained pcDNA3-HPV6-L1 plasmid was double-digested with KpnⅠ and XbaI to obtain the HPV6L1 (1-500) gene fragment. The fragment was then ligated with the pFastBacTM1 vector (seller: Thermo Fisher) double-digested with KpnⅠ and XbaI to obtain a bacmid vector containing the HPV6L1 (1-500) gene fragment, named pFB-HPV6L1.
1.1.2 用作模板的pFB-HPV33L1的構建1.1.2 Construction of pFB-HPV33L1 as template
委託Thermo Fisher公司(原英濰捷基(上海)貿易有限公司)基因合成HPV33L1基因,且合成序列兩端分別具有KpnI和XbaI酶切位點,基因片段序列見SEQ ID NO: 6。透過KpnI和XbaI酶切位點將合成的基因片段與pcDNA3載體(銷售商Thermo Fisher)連接,得到含有編碼HPV33L1 1-499位胺基酸的核苷酸序列的質體pcDNA3-HPV33-L1。Thermo Fisher (formerly Invitrogen (Shanghai) Trading Co., Ltd.) was commissioned to synthesize the HPV33L1 gene, and the two ends of the synthetic sequence had KpnI and XbaI restriction sites, respectively. The gene fragment sequence is shown in SEQ ID NO: 6. The synthesized gene fragment was connected to the pcDNA3 vector (seller Thermo Fisher) through the KpnI and XbaI restriction sites to obtain the plasmid pcDNA3-HPV33-L1 containing the nucleotide sequence encoding the 1-499 amino acids of HPV33L1.
將得到的pcDNA3-HPV33-L1質體進行KpnI和XbaI雙酶切得到HPV33L1(1-499)的基因的片段。再將該片段與KpnI和XbaI雙酶切的pFastBacTM1載體(銷售商Thermo Fisher)進行連接,得到含HPV33L1(1-499)基因片段的桿粒載體,命名為pFB-HPV33L1。The obtained pcDNA3-HPV33-L1 plasmid was double-digested with KpnI and XbaI to obtain the gene fragment of HPV33L1 (1-499). The fragment was then ligated with the pFastBacTM1 vector (seller: Thermo Fisher) double-digested with KpnI and XbaI to obtain a bacmid vector containing the HPV33L1 (1-499) gene fragment, named pFB-HPV33L1.
1.1.3 pFB-HPV6L1:33C的構建1.1.3 Construction of pFB-HPV6L1:33C
HPV6L1 C端替換為HPV33L1 C端的嵌合基因:以構建成功的重組質體pFB-HPV6L1為基因模板,用引子F1和R1擴增長度為1426bp基因片段,引子序列F1如SEQ ID No: 7所示,R1如SEQ ID No: 8所示。The chimeric gene in which the C-terminus of HPV6L1 is replaced by the C-terminus of HPV33L1: the successfully constructed recombinant plasmid pFB-HPV6L1 is used as a gene template, and primers F1 and R1 are used to amplify a gene fragment with a length of 1426 bp. The primer sequence F1 is shown in SEQ ID No: 7, and the primer sequence R1 is shown in SEQ ID No: 8.
該基因片段包含編碼HPV6L1的1-469胺基酸的基因片段、與HPV33L1的474-499胺基酸的基因片段重疊的10個鹼基以及KpnI酶切位點(GGTAC^C)段,擴增的序列如SEQ ID No: 9所示:The gene fragment includes a gene fragment encoding amino acids 1-469 of HPV6L1, 10 bases overlapping with the gene fragment of amino acids 474-499 of HPV33L1, and a KpnI restriction site (GGTAC^C). The expanded sequence is shown in SEQ ID No: 9:
PCR擴增參數:94℃預變性 5min;98℃變性 10s、69℃退火 15s、72℃ 1kb/1min、進行30個循環;72℃延伸 5min;16℃結束。PCR amplification parameters: 94°C pre-denaturation for 5 min; 98°C denaturation for 10 s, 69°C annealing for 15 s, 72°C 1 kb/1 min, 30 cycles; 72°C extension for 5 min; 16°C termination.
以重組質體pFB-HPV33L1為基因模板,用引子F2和R2,擴增長度101 bp的基因片段,引子序列F2如SEQ ID No: 10所示,R2如SEQ ID No: 11所示。The recombinant plasmid pFB-HPV33L1 was used as a gene template and primers F2 and R2 were used to amplify a gene fragment of 101 bp in length. The primer sequence F2 is shown in SEQ ID No: 10, and the primer sequence R2 is shown in SEQ ID No: 11.
該基因片段含HPV33L1 C端的26個(474-499)胺基酸的基因片段、與HPV6L1的1-469胺基酸C端基因片段重疊的10bp鹼基以及XbaI(T^CTAGA)酶切位點,擴增的序列如SEQ ID No: 12所示。The gene fragment contains a gene fragment of 26 (474-499) amino acids at the C-terminus of HPV33L1, a 10 bp base overlapping with the 1-469 amino acids of the C-terminal gene fragment of HPV6L1, and an XbaI (T^CTAGA) restriction site. The amplified sequence is shown in SEQ ID No: 12.
PCR擴增參數:94℃預變性 5min;98℃變性 10s、69℃退火 15s、72℃ 1kb/1min、進行 30個循環;72℃延伸 5min;16℃結束。PCR amplification parameters: 94°C pre-denaturation for 5 min; 98°C denaturation for 10 s, 69°C annealing for 15 s, 72°C 1 kb/1 min, 30 cycles; 72°C extension for 5 min; 16°C termination.
PCR拼接序列:PCR splicing sequence:
拼接引子分別為F1和R2,以上述引子擴增得到的片段(F1和R1擴增得到的片段,F2和R2擴增得到的片段)為模板。The splicing primers are F1 and R2, and the fragments amplified by the above primers (fragments amplified by F1 and R1, and fragments amplified by F2 and R2) are used as templates.
PCR拼接參數:94℃預變性 5min;98℃變性 10s、52℃退火 15s、72℃ 1kb/1min、進行 5個循環;98℃變性 10s、68℃退火 15s、72℃ 1kb/1min、進行 25個循環;72℃延伸 5min;16℃結束。PCR splicing parameters: 94°C pre-denaturation for 5 min; 98°C denaturation for 10 s, 52°C annealing for 15 s, 72°C 1 kb/1 min, for 5 cycles; 98°C denaturation for 10 s, 68°C annealing for 15 s, 72°C 1 kb/1 min, for 25 cycles; 72°C extension for 5 min; 16°C end.
最終得到SEQ ID NO: 4,編碼由HPV6L1的1-469胺基酸和HPV33L1 C端的26個(474-499)胺基酸組成的核苷酸序列,兩端帶有KpnI和XbaI酶切位點(下稱拼接序列)。Finally, SEQ ID NO: 4 was obtained, encoding a nucleotide sequence consisting of amino acids 1-469 of HPV6L1 and 26 amino acids (474-499) of the C-terminus of HPV33L1, with KpnI and XbaI restriction sites at both ends (hereinafter referred to as the splicing sequence).
用KpnI+XbaI雙酶切pFastBacTM1載體和拼接序列片段,將拼接序列選殖到pFastBacTM1載體上,獲得重組質體pFB-HPV6L1:33C。即為HPV6L1 C端替換為HPV33L1 C端的嵌合基因。The pFastBacTM1 vector and the splicing sequence fragment were double-digested with KpnI+XbaI, and the splicing sequence was cloned into the pFastBacTM1 vector to obtain the recombinant plasmid pFB-HPV6L1:33C, which is a chimeric gene in which the C-terminus of HPV6L1 is replaced by the C-terminus of HPV33L1.
實施例1.2 HPV11L1 C端替換為HPV33L1 C端的嵌合基因的構建Example 1.2 Construction of a chimeric gene in which the HPV11L1 C-terminus is replaced by the HPV33L1 C-terminus
實驗方法和步驟與實施例1.1相同,相關序列參見附錄2。The experimental methods and steps are the same as those in Example 1.1, and the relevant sequences are shown in Appendix 2.
實施例1.3 HPV16L1 C端替換為HPV33L1 C端的嵌合基因的構建Example 1.3 Construction of a chimeric gene in which the HPV16L1 C-terminus is replaced by the HPV33L1 C-terminus
實驗方法和步驟與實施例1.1相同,相關序列參見附錄3。The experimental methods and steps are the same as those in Example 1.1, and the relevant sequences are shown in Appendix 3.
實施例1.4 HPV18L1 C端替換為HPV33L1 C端的嵌合基因的構建Example 1.4 Construction of a chimeric gene in which the HPV18 L1 C-terminus is replaced by the HPV33 L1 C-terminus
實驗方法和步驟與實施例1.1相同,相關序列參見附錄4。The experimental methods and steps are the same as those in Example 1.1, and the relevant sequences are shown in Appendix 4.
實施例1.5 HPV31L1 C端替換為HPV33L1 C端的嵌合基因的構建Example 1.5 Construction of a chimeric gene in which the HPV31L1 C-terminus is replaced by the HPV33L1 C-terminus
實驗方法和步驟與實施例1.1相同,相關序列參見附錄5。The experimental methods and steps are the same as those in Example 1.1, and the relevant sequences are shown in Appendix 5.
實施例1.6 HPV33L1基因的構建Example 1.6 Construction of HPV33L1 gene
1.6.1 pFB-HPV33L1基因的製備1.6.1 Preparation of pFB-HPV33L1 gene
HPV33L1基因透過基因合成的方法構建而成,委託Thermo Fisher公司(原英濰捷基(上海)貿易有限公司)基因合成,合成序列兩端分別具有KpnⅠ和XbaI酶切位點,基因片段序列見SEQ ID No: 68。透過KpnⅠ和XbaI酶切位點將合成的基因片段與pcDNA3載體(銷售商Thermofisher)連接,得到含有編碼HPV33L1 1-499個胺基酸的核苷酸序列的質體pcDNA3-HPV33-L1。The HPV33L1 gene was constructed by gene synthesis and commissioned to Thermo Fisher (formerly Invitrogen (Shanghai) Trading Co., Ltd.). The two ends of the synthetic sequence have KpnⅠ and XbaI restriction sites, respectively. The gene fragment sequence is shown in SEQ ID No: 68. The synthesized gene fragment was connected to the pcDNA3 vector (seller Thermofisher) through the KpnⅠ and XbaI restriction sites to obtain the plasmid pcDNA3-HPV33-L1 containing the nucleotide sequence encoding the 1-499 amino acids of HPV33L1.
將得到的pcDNA3-HPV33-L1質體進行KpnⅠ和XbaI雙酶切得到HPV33L1 (1-499)的基因的片段。再將該片段與KpnⅠ和XbaI雙酶切的pFastBacTM1載體(銷售商Thermofisher)進行連接,得到含HPV33L1 (1-499)基因片段的桿粒載體,命名為pFB-HPV33L1。The obtained pcDNA3-HPV33-L1 plasmid was double-digested with KpnⅠ and XbaI to obtain the gene fragment of HPV33L1 (1-499). The fragment was then ligated with the pFastBacTM1 vector (seller: Thermofisher) double-digested with KpnⅠ and XbaI to obtain a bacmid vector containing the HPV33L1 (1-499) gene fragment, named pFB-HPV33L1.
實施例1.7 HPV35L1 C端替換為HPV33L1 C端的嵌合基因的構建Example 1.7 Construction of a chimeric gene in which the HPV35L1 C-terminus is replaced by the HPV33L1 C-terminus
實驗方法和步驟與實施例1.1相同,相關序列參見附錄7。The experimental methods and steps are the same as those in Example 1.1, and the relevant sequences are shown in Appendix 7.
實施例1.8 HPV39L1 C端替換為HPV59L1 C端的嵌合基因的構建Example 1.8 Construction of a chimeric gene in which the HPV39L1 C-terminus is replaced by the HPV59L1 C-terminus
1.8.1 用作模板的pFB-HPV39L1的構建1.8.1 Construction of pFB-HPV39L1 as template
委託Thermo Fisher公司(原英濰捷基(上海)貿易有限公司)基因合成HPV39L1基因,且合成的序列兩端分別具有KpnⅠ和XbaI酶切位點,其序列見SEQ ID NO: 86。透過KpnⅠ和XbaI酶切位點將合成的基因片段與pcDNA3載體(銷售商Thermo Fisher)連接,得到含有編碼HPV39L1 1-505個胺基酸的核苷酸序列的質體pcDNA3-HPV39-L1。Thermo Fisher (formerly Invitrogen (Shanghai) Trading Co., Ltd.) was commissioned to synthesize the HPV39L1 gene, and the synthesized sequence had KpnⅠ and XbaI restriction sites at both ends, and its sequence is shown in SEQ ID NO: 86. The synthesized gene fragment was connected to the pcDNA3 vector (seller Thermo Fisher) through the KpnⅠ and XbaI restriction sites to obtain the plasmid pcDNA3-HPV39-L1 containing the nucleotide sequence encoding the 1-505 amino acids of HPV39L1.
將得到的pcDNA3-HPV39-L1質體進行KpnⅠ和XbaI雙酶切得到HPV39L1(1-505)的基因的片段。再將該片段與KpnⅠ和XbaI雙酶切的pFastBacTM1載體(銷售商Thermo Fisher)進行連接,得到含HPV39L1(1-505)基因片段的桿粒載體,命名為pFB-HPV39L1。The obtained pcDNA3-HPV39-L1 plasmid was double-digested with KpnⅠ and XbaI to obtain the gene fragment of HPV39L1 (1-505). The fragment was then ligated with the pFastBacTM1 vector (seller: Thermo Fisher) double-digested with KpnⅠ and XbaI to obtain a bacmid vector containing the HPV39L1 (1-505) gene fragment, named pFB-HPV39L1.
1.8.2 用作模板的pFB-HPV59L1的構建1.8.2 Construction of pFB-HPV59L1 as template
委託Thermo Fisher公司(原英濰捷基(上海)貿易有限公司)基因合成HPV59L1基因,且合成序列兩端分別具有KpnI和XbaI酶切位點,基因片段序列見SEQ ID NO: 87。透過KpnI和XbaI酶切位點將合成的基因片段與pcDNA3載體(銷售商Thermo Fisher)連接,得到含有編碼HPV59L1 1-508位胺基酸的核苷酸序列的質體pcDNA3-HPV59-L1。Thermo Fisher (formerly Invitrogen (Shanghai) Trading Co., Ltd.) was commissioned to synthesize the HPV59L1 gene, and the two ends of the synthetic sequence had KpnI and XbaI restriction sites, respectively. The gene fragment sequence is shown in SEQ ID NO: 87. The synthesized gene fragment was connected to the pcDNA3 vector (seller Thermo Fisher) through the KpnI and XbaI restriction sites to obtain the plasmid pcDNA3-HPV59-L1 containing the nucleotide sequence encoding the 1-508 amino acids of HPV59L1.
將得到的pcDNA3-HPV59-L1質體進行KpnI和XbaI雙酶切得到HPV59L1(1-508)的基因的片段。再將該片段與KpnI和XbaI雙酶切的pFastBacTM1載體(銷售商Thermo Fisher)進行連接,得到含HPV59L1(1-508)基因片段的桿粒載體,命名為pFB-HPV59L1。The obtained pcDNA3-HPV59-L1 plasmid was double-digested with KpnI and XbaI to obtain the gene fragment of HPV59L1 (1-508). The fragment was then ligated with the pFastBacTM1 vector (seller: Thermo Fisher) double-digested with KpnI and XbaI to obtain a bacmid vector containing the HPV59L1 (1-508) gene fragment, named pFB-HPV59L1.
1.8.3 pFB-HPV39L1:59C的構建1.8.3 Construction of pFB-HPV39L1:59C
HPV39L1 C端替換為HPV59L1 C端的嵌合基因:以構建成功的重組質體pFB-HPV39L1為基因模板,用引子F1和R1擴增長度為1428 bp基因片段,引子序列F1如SEQ ID No: 88所示,R1如SEQ ID No: 89所示。The chimeric gene in which the C-terminus of HPV39L1 is replaced by the C-terminus of HPV59L1: the successfully constructed recombinant plasmid pFB-HPV39L1 is used as the gene template, and the primers F1 and R1 are used to amplify the gene fragment with a length of 1428 bp. The primer sequence F1 is shown in SEQ ID No: 88, and the primer sequence R1 is shown in SEQ ID No: 89.
該基因片段包含編碼HPV39L1的1-469胺基酸的基因片段、與HPV59L1的471-508胺基酸的基因片段重疊的12個鹼基以及KpnI酶切位點(GGTAC^C)段,擴增的序列如SEQ ID No: 90所示:The gene fragment includes a gene fragment encoding amino acids 1-469 of HPV39L1, 12 bases overlapping with the gene fragment of amino acids 471-508 of HPV59L1, and a KpnI restriction site (GGTAC^C), and the expanded sequence is shown in SEQ ID No: 90:
PCR擴增參數:94℃預變性 5min;98℃變性 10s、69℃退火 15s、72℃ 1kb/1min、進行30個循環;72℃延伸 5min;16℃結束。PCR amplification parameters: 94°C pre-denaturation for 5 min; 98°C denaturation for 10 s, 69°C annealing for 15 s, 72°C 1 kb/1 min, 30 cycles; 72°C extension for 5 min; 16°C termination.
以重組質體pFB-HPV59L1為基因模板,用引子F2和R2,擴增長度139bp的基因片段,引子序列F2如SEQ ID No: 91所示,R2如SEQ ID No: 92所示。The recombinant plasmid pFB-HPV59L1 was used as a gene template and primers F2 and R2 were used to amplify a gene fragment of 139 bp in length. The primer sequence F2 is shown in SEQ ID No: 91, and the primer sequence R2 is shown in SEQ ID No: 92.
該基因片段含HPV59L1 C端的38個(471-508)胺基酸的基因片段、與HPV39L1的1-469胺基酸C端基因片段重疊的12bp鹼基以及XbaI(T^CTAGA)酶切位點,擴增的序列如SEQ ID No: 93所示。The gene fragment contains a gene fragment of 38 (471-508) amino acids at the C-terminus of HPV59L1, a 12 bp base overlapping with the 1-469 amino acids of the C-terminal gene fragment of HPV39L1, and an XbaI (T^CTAGA) restriction site. The amplified sequence is shown in SEQ ID No: 93.
PCR擴增參數:94℃預變性 5min;98℃變性 10s、69℃退火 15s、72℃ 1kb/1min、進行 30個循環;72℃延伸 5min;16℃結束。PCR amplification parameters: 94°C pre-denaturation for 5 min; 98°C denaturation for 10 s, 69°C annealing for 15 s, 72°C 1 kb/1 min, 30 cycles; 72°C extension for 5 min; 16°C termination.
PCR拼接序列:PCR splicing sequence:
拼接引子分別為F1和R2,以上述引子擴增得到的片段(F1和R1擴增得到的片段,F2和R2擴增得到的片段)為模板。The splicing primers are F1 and R2, and the fragments amplified by the above primers (fragments amplified by F1 and R1, and fragments amplified by F2 and R2) are used as templates.
PCR拼接參數:94℃預變性 5min;98℃變性 10s、52℃退火 15s、72℃ 1kb/1min、進行 5個循環;98℃變性 10s、68℃退火 15s、72℃ 1kb/1min、進行 25個循環;72℃延伸 5min;16℃結束。PCR splicing parameters: 94°C pre-denaturation for 5 min; 98°C denaturation for 10 s, 52°C annealing for 15 s, 72°C 1 kb/1 min, for 5 cycles; 98°C denaturation for 10 s, 68°C annealing for 15 s, 72°C 1 kb/1 min, for 25 cycles; 72°C extension for 5 min; 16°C end.
最終得到SEQ ID NO: 85,編碼由HPV39L1的1-469胺基酸和HPV59L1 C端的38個(471-508)胺基酸組成的核苷酸序列,兩端帶有KpnI和XbaI酶切位點(下稱拼接序列)。Finally, SEQ ID NO: 85 was obtained, encoding a nucleotide sequence consisting of amino acids 1-469 of HPV39L1 and 38 amino acids (471-508) at the C-terminus of HPV59L1, with KpnI and XbaI restriction sites at both ends (hereinafter referred to as splicing sequence).
用KpnI+XbaI雙酶切pFastBacTM1載體和拼接序列片段,將拼接序列克隆到pFastBacTM1載體上,獲得重組質體pFB-HPV39L1:59C。即為HPV39L1 C端替換為HPV59L1 C端的嵌合基因。The pFastBacTM1 vector and the splicing sequence fragment were double-digested with KpnI+XbaI, and the splicing sequence was cloned into the pFastBacTM1 vector to obtain the recombinant plasmid pFB-HPV39L1:59C, which is a chimeric gene in which the C-terminus of HPV39L1 is replaced by the C-terminus of HPV59L1.
實施例1.9 HPV45L1 C端替換為HPV33L1 C端的嵌合基因的構建Example 1.9 Construction of a chimeric gene in which the HPV45L1 C-terminus is replaced by the HPV33L1 C-terminus
實驗方法和步驟與實施例1.1相同,相關序列參見附錄9。The experimental methods and steps are the same as those in Example 1.1, and the relevant sequences are shown in Appendix 9.
實施例1.10 HPV51L1 C端替換為HPV33L1 C端的嵌合基因的構建Example 1.10 Construction of a chimeric gene in which the HPV51 L1 C-terminus is replaced by the HPV33 L1 C-terminus
實驗方法和步驟與實施例1.1相同,相關序列參見附錄10。The experimental methods and steps are the same as those in Example 1.1, and the relevant sequences are shown in Appendix 10.
實施例1.11 HPV52L1 C端替換為HPV33L1 C端的嵌合基因的構建Example 1.11 Construction of a chimeric gene in which the HPV52 L1 C-terminus is replaced by the HPV33 L1 C-terminus
實驗方法和步驟與實施例1.1相同,相關序列參見附錄11。The experimental methods and steps are the same as those in Example 1.1, and the relevant sequences are shown in Appendix 11.
實施例1.12 HPV56L1 C端替換為HPV33L1 C端的嵌合基因的構建Example 1.12 Construction of a chimeric gene in which the HPV56 L1 C-terminus is replaced by the HPV33 L1 C-terminus
實驗方法和步驟與實施例1.1相同,相關序列參見附錄12。The experimental methods and steps are the same as those in Example 1.1, and the relevant sequences are shown in Appendix 12.
實施例1.13 HPV58L1 C端替換為HPV33L1 C端的嵌合基因的構建Example 1.13 Construction of a chimeric gene in which the HPV58 L1 C-terminus is replaced by the HPV33 L1 C-terminus
實驗方法和步驟與實施例1.1相同,相關序列參見附錄13。The experimental methods and steps are the same as those in Example 1.1, and the relevant sequences are shown in Appendix 13.
實施例1.14 HPV59L1基因的構建Example 1.14 Construction of HPV59L1 gene
實施例1.14.1 pFB-HPV59L1基因的製備Example 1.14.1 Preparation of pFB-HPV59L1 gene
HPV59L1基因透過基因合成的方法構建而成,委託Thermo Fisher公司(原英濰捷基(上海)貿易有限公司)基因合成,合成序列兩端分別具有KpnⅠ和XbaI酶切位點,基因片段序列見SEQ ID No: 162。透過KpnⅠ和XbaI酶切位點將合成的基因片段與pcDNA3載體(銷售商Thermofisher)連接,得到含有編碼HPV59L1 1-508個胺基酸的核苷酸序列的質體pcDNA3-HPV59-L1。The HPV59L1 gene was constructed by gene synthesis and commissioned to Thermo Fisher (formerly Invitrogen (Shanghai) Trading Co., Ltd.). The two ends of the synthetic sequence have KpnⅠ and XbaI restriction sites, respectively. The gene fragment sequence is shown in SEQ ID No: 162. The synthesized gene fragment was connected to the pcDNA3 vector (seller Thermofisher) through the KpnⅠ and XbaI restriction sites to obtain the plasmid pcDNA3-HPV59-L1 containing the nucleotide sequence encoding the 1-508 amino acids of HPV59L1.
將得到的pcDNA3-HPV59-L1質體進行KpnⅠ和XbaI雙酶切得到HPV59L1 (1-508)的基因的片段。再將該片段與KpnⅠ和XbaI雙酶切的pFastBacTM1載體(銷售商Thermofisher)進行連接,得到含HPV59L1 (1-508)基因片段的桿粒載體,命名為pFB-HPV59L1。The obtained pcDNA3-HPV59-L1 plasmid was double-digested with KpnⅠ and XbaI to obtain the gene fragment of HPV59L1 (1-508). The fragment was then ligated with the pFastBacTM1 vector (seller: Thermofisher) double-digested with KpnⅠ and XbaI to obtain a bacmid vector containing the HPV59L1 (1-508) gene fragment, named pFB-HPV59L1.
實施例2 重組桿狀病毒的包裝Example 2 Packaging of recombinant bacilli
實施例2.1:HPV 6L1:33C重組桿狀病毒包裝Example 2.1: HPV 6L1:33C recombinant rod-shaped virus packaging
實施例1構建的pFB-HPV 6L1:33C的重組質體經鑒定和測序正確後,將其轉化至DH10Bac細菌感受態細胞(Bac-to-Bac®試劑盒,購於Thermo Fisher)中,37℃培養擴增,並進行平盤劃線培養,挑選白色菌斑並擴增,培養過夜後收集菌液,使用鹼裂解法提取重組桿粒DNA。After the recombinant plasmid of pFB-HPV 6L1:33C constructed in Example 1 was correctly identified and sequenced, it was transformed into DH10Bac bacterial competent cells (Bac-to-Bac® kit, purchased from Thermo Fisher), cultured and expanded at 37°C, and plate streak culture was performed. White plaques were selected and expanded. After culturing overnight, the bacterial liquid was collected and the recombinant bacmid DNA was extracted using the alkaline lysis method.
用陽離子轉染試劑(購於Sino Biological)將其轉染至昆蟲細胞SF9中進行重組桿狀病毒毒種包裝。具體操作如下: a. 取對數生長期的SF9細胞按照0.6×10 6cell/dish的密度接種dish,將接種有SF9細胞的dish室溫放置2h,貼壁。 b. 提取的質體20μL Bacmid DNA加至200μL Grace’s Medium(無血清,無添加物,購於Gibico)混合顛倒5次。 c. 25 μL 0.2x TF1(轉染試劑,購於Sino Biological)滴加至200 μL Grace’s Meduim輕輕混勻。 d. 將b和c混合。室溫孵育15-45min。 e. 當DNA與cellfectin(購於Sino Biological)孵育時,棄細胞上清,添加無血清添加物的Grace Medium 0.8mL/dish。 f. 將d中孵育好的DNA與轉染試劑複合物滴加到dish中。 g. 27℃孵育2hr。 h. 丟棄細胞培養液,加2.5 mL/dish完全生長培養基(SCD6 SF +10%FBS)(SCD6 SF購於Sino Biological,FBS購於Gibico)。 i. 27℃培養7天觀察是否有病毒感染。 Use cationic transfection reagent (purchased from Sino Biological) to transfect it into insect cells SF9 for recombinant bacilli virus seed packaging. The specific operation is as follows: a. Take SF9 cells in logarithmic growth phase and inoculate the dish at a density of 0.6×10 6 cells/dish. Place the dish inoculated with SF9 cells at room temperature for 2 hours to adhere to the wall. b. Add 20μL Bacmid DNA of the extracted plasmid to 200μL Grace's Medium (serum-free, no additives, purchased from Gibico) and mix it by inverting 5 times. c. Add 25 μL 0.2x TF1 (transfection reagent, purchased from Sino Biological) dropwise to 200 μL Grace's Medium and mix gently. d. Mix b and c. Incubate at room temperature for 15-45 minutes. e. When the DNA is incubated with cellfectin (purchased from Sino Biological), discard the cell supernatant and add Grace Medium without serum supplement 0.8mL/dish. f. Add the DNA and transfection reagent complex incubated in step d to the dish. g. Incubate at 27℃ for 2 hours. h. Discard the cell culture medium and add 2.5 mL/dish of complete growth medium (SCD6 SF +10% FBS) (SCD6 SF purchased from Sino Biological, FBS purchased from Gibico). i. Incubate at 27℃ for 7 days and observe whether there is virus infection.
轉染後待細胞產生明顯的病變後收集病毒上清,一般培養7-11天。用移液器無菌收取病毒上清液,即為HPV6L1:33C P1代毒種。使用HPV6L1:33C P1代毒種按照1:50(V/V)比例感染SF9細胞,SF9細胞的感染密度為2×10 6cells/mL,27℃培養擴增3天,1000g±200g室溫離心10min,收集的病毒上清液即為P2代病毒,可用於感染生產。 After transfection, the virus supernatant is collected after the cells produce obvious pathological changes, and the culture is generally 7-11 days. The virus supernatant is collected aseptically with a pipette, which is the HPV6L1:33C P1 virus strain. The HPV6L1:33C P1 virus strain is used to infect SF9 cells at a ratio of 1:50 (V/V). The infection density of SF9 cells is 2×10 6 cells/mL. The cells are cultured and expanded at 27℃ for 3 days, and centrifuged at 1000g±200g at room temperature for 10 minutes. The collected virus supernatant is the P2 virus, which can be used for infection production.
實施例2.2:HPV 11L1:33C重組桿狀病毒包裝Example 2.2: HPV 11L1:33C recombinant rod-shaped virus packaging
實驗方法和步驟與實施例2.1相同。The experimental methods and steps are the same as those in Example 2.1.
實施例2.3:HPV 16L1:33C重組桿狀病毒包裝Example 2.3: HPV 16L1:33C recombinant rod-shaped virus packaging
實驗方法和步驟與實施例2.1相同。The experimental methods and steps are the same as those in Example 2.1.
實施例2.4:HPV 18L1:33C重組桿狀病毒包裝Example 2.4: HPV 18L1:33C recombinant rod-shaped virus packaging
實驗方法和步驟與實施例2.1相同。The experimental methods and steps are the same as those in Example 2.1.
實施例2.5:HPV 31L1:33C重組桿狀病毒包裝Example 2.5: HPV 31L1:33C recombinant rod-shaped virus packaging
實驗方法和步驟與實施例2.1相同。The experimental methods and steps are the same as those in Example 2.1.
實施例2.6:HPV 33L1重組桿狀病毒包裝Example 2.6: HPV 33L1 recombinant rod-shaped virus packaging
實驗方法和步驟與實施例2.1相同。The experimental methods and steps are the same as those in Example 2.1.
實施例2.7:HPV 35L1:33C重組桿狀病毒包裝Example 2.7: HPV 35L1:33C recombinant rod-shaped virus packaging
實驗方法和步驟與實施例2.1相同。The experimental methods and steps are the same as those in Example 2.1.
實施例2.8:HPV 39L1:59C重組桿狀病毒包裝Example 2.8: HPV 39L1:59C recombinant rod-shaped virus packaging
實驗方法和步驟與實施例2.1相同。The experimental methods and steps are the same as those in Example 2.1.
實施例2.9:HPV 45L1:33C重組桿狀病毒包裝Example 2.9: HPV 45L1:33C recombinant rod-shaped virus packaging
實驗方法和步驟與實施例2.1相同。The experimental methods and steps are the same as those in Example 2.1.
實施例2.10:HPV 51L1:33C重組桿狀病毒包裝Example 2.10: HPV 51L1:33C recombinant rod-shaped virus packaging
實驗方法和步驟與實施例2.1相同。The experimental methods and steps are the same as those in Example 2.1.
實施例2.11:HPV 52L1:33C重組桿狀病毒包裝Example 2.11: HPV 52L1:33C recombinant rod-shaped virus packaging
實驗方法和步驟與實施例2.1相同。The experimental methods and steps are the same as those in Example 2.1.
實施例2.12:HPV 56L1:33C重組桿狀病毒包裝Example 2.12: HPV 56L1:33C recombinant rod-shaped virus packaging
實驗方法和步驟與實施例2.1相同。The experimental methods and steps are the same as those in Example 2.1.
實施例2.13:HPV 58L1:33C重組桿狀病毒包裝Example 2.13: HPV 58L1:33C recombinant rod-shaped virus packaging
實驗方法和步驟與實施例2.1相同。The experimental methods and steps are the same as those in Example 2.1.
實施例2.14:HPV 59L1重組桿狀病毒包裝Example 2.14: HPV 59L1 recombinant bacilli virus packaging
實驗方法和步驟與實施例2.1相同。The experimental methods and steps are the same as those in Example 2.1.
實施例3 嵌合蛋白或蛋白的表達Example 3 Expression of chimeric proteins or proteins
實施例3.1:HPV 6L1:33C表達生產Example 3.1: HPV 6L1:33C expression production
用實施例2中獲得的含有HPV 6L1:33C 重組基因的桿狀病毒感染High Five細胞,感染比例1:200(V/V),1000g±100g室溫離心收集細胞沉澱,使用PBS或MOPS 緩衝液(pH6.0-7.0,鹽濃度100 mM-1M)超音波裂解細胞沉澱,低溫超音波破碎3min,大於10000g的離心力離心10分鐘,收集離心後上清液,SDS-PAGE電泳檢測。泳道1: Marker(Marker為7種純化後的蛋白,分子量大小包含 14.4 至116kDa,生產商為Thermo Scientific);泳道2:細胞裂解液;泳道3:裂解液離心後收集的上清液。High Five cells were infected with the bacilli containing the HPV 6L1:33C recombinant gene obtained in Example 2 at an infection ratio of 1:200 (V/V). The cell pellet was collected by centrifugation at 1000g±100g at room temperature. The cell pellet was ultrasonically lysed using PBS or MOPS buffer (pH 6.0-7.0, salt concentration 100 mM-1M), and the cell pellet was broken by low-temperature ultrasonication for 3 minutes. The cell pellet was centrifuged at a centrifugal force greater than 10000g for 10 minutes, and the supernatant was collected after centrifugation and detected by SDS-PAGE electrophoresis. Lane 1: Marker (Marker is 7 purified proteins with molecular weight ranging from 14.4 to 116 kDa, produced by Thermo Scientific); Lane 2: Cell lysate; Lane 3: Supernatant collected after centrifugation of the lysate.
結果如圖1A所示,該方法製備的HPV 6L1:33C L1蛋白產量大於100 mg/L,蛋白大小約56 KD,可以用於大規模生產。The results are shown in Figure 1A. The yield of HPV 6L1:33C L1 protein prepared by this method is greater than 100 mg/L, and the protein size is about 56 KD, which can be used for large-scale production.
實施例3.2:HPV 11L1:33C表達生產Example 3.2: HPV 11L1:33C expression production
實驗方法和步驟與實施例3.1相同。The experimental methods and steps are the same as those in Example 3.1.
結果如圖1B所示,該方法製備的HPV 11L1:33C L1蛋白產量大於100 mg/L,蛋白大小約56 KD,可以用於大規模生產。The results are shown in Figure 1B. The yield of HPV 11L1:33C L1 protein prepared by this method is greater than 100 mg/L, and the protein size is about 56 KD, which can be used for large-scale production.
實施例3.3:HPV 16L1:33C表達生產Example 3.3: HPV 16L1:33C expression production
實驗方法和步驟與實施例3.1相同。The experimental methods and steps are the same as those in Example 3.1.
結果如圖1C所示,該方法製備的HPV 16L1:33C L1蛋白產量大於100 mg/L,蛋白大小約56 KD,可以用於大規模生產。The results are shown in Figure 1C. The yield of HPV 16L1:33C L1 protein prepared by this method is greater than 100 mg/L, and the protein size is about 56 KD, which can be used for large-scale production.
實施例3.4:HPV 18L1:33C表達生產Example 3.4: HPV 18L1:33C expression production
實驗方法和步驟與實施例3.1相同。The experimental methods and steps are the same as those in Example 3.1.
結果如圖1D所示,該方法製備的HPV 18L1:33C L1蛋白產量大於100 mg/L,蛋白大小約56 KD,可以用於大規模生產。The results are shown in Figure 1D. The yield of HPV 18L1:33C L1 protein prepared by this method is greater than 100 mg/L, and the protein size is approximately 56 KD, which can be used for large-scale production.
實施例3.5:HPV 31L1:33C表達生產Example 3.5: HPV 31L1:33C expression production
實驗方法和步驟與實施例3.1相同。The experimental methods and steps are the same as those in Example 3.1.
結果如圖1E所示,該方法製備的HPV 31L1:33C L1蛋白產量大於100 mg/L,蛋白大小約56 KD,可以用於大規模生產。The results are shown in Figure 1E. The yield of HPV 31L1:33C L1 protein prepared by this method is greater than 100 mg/L, and the protein size is about 56 KD, which can be used for large-scale production.
實施例3.6:HPV 33L1表達生產Example 3.6: HPV 33L1 expression production
實驗方法和步驟與實施例3.1相同。The experimental methods and steps are the same as those in Example 3.1.
結果如圖1F所示,該方法製備的HPV 33L1 L1蛋白產量大於100 mg/L,蛋白大小約56 KD,可以用於大規模生產。The results are shown in Figure 1F. The yield of HPV 33L1 L1 protein prepared by this method is greater than 100 mg/L, and the protein size is about 56 KD, which can be used for large-scale production.
實施例3.7:HPV 35L1:33C表達生產Example 3.7: HPV 35L1:33C expression production
實驗方法和步驟與實施例3.1相同。The experimental methods and steps are the same as those in Example 3.1.
結果如圖1G所示,該方法製備的HPV 35L1:33C L1蛋白產量大於100 mg/L,蛋白大小約56 KD,可以用於大規模生產。The results are shown in Figure 1G. The yield of HPV 35L1:33C L1 protein prepared by this method is greater than 100 mg/L, and the protein size is approximately 56 KD, which can be used for large-scale production.
實施例3.8:HPV 39L1:59C表達生產Example 3.8: HPV 39L1:59C expression production
實驗方法和步驟與實施例3.1相同。The experimental methods and steps are the same as those in Example 3.1.
結果如圖1H所示,該方法製備的HPV 39L1:59C L1蛋白產量大於100 mg/L,蛋白大小約56 KD,可以用於大規模生產。The results are shown in Figure 1H. The yield of HPV 39L1:59C L1 protein prepared by this method is greater than 100 mg/L, and the protein size is approximately 56 KD, which can be used for large-scale production.
實施例3.9:HPV 45L1:33C表達生產Example 3.9: HPV 45L1:33C expression production
實驗方法和步驟與實施例3.1相同。The experimental methods and steps are the same as those in Example 3.1.
結果如圖1I所示,該方法製備的HPV 45L1:33C L1蛋白產量大於100 mg/L,蛋白大小約56 KD,可以用於大規模生產。The results are shown in Figure 1I. The yield of HPV 45L1:33C L1 protein prepared by this method is greater than 100 mg/L, and the protein size is about 56 KD, which can be used for large-scale production.
實施例3.10:HPV 51L1:33C表達生產Example 3.10: HPV 51L1:33C expression production
實驗方法和步驟與實施例3.1相同。The experimental methods and steps are the same as those in Example 3.1.
結果如圖1J所示,該方法製備的HPV 51L1:33C L1蛋白產量大於100 mg/L,蛋白大小約56 KD,可以用於大規模生產。The results are shown in Figure 1J. The yield of HPV 51L1:33C L1 protein prepared by this method is greater than 100 mg/L, and the protein size is approximately 56 KD, which can be used for large-scale production.
實施例3.11:HPV 52L1:33C表達生產Example 3.11: HPV 52L1:33C expression production
實驗方法和步驟與實施例3.1相同。The experimental methods and steps are the same as those in Example 3.1.
結果如圖1K所示,該方法製備的HPV 52L1:33C L1蛋白產量大於100 mg/L,蛋白大小約56 KD,可以用於大規模生產。The results are shown in Figure 1K. The yield of HPV 52L1:33C L1 protein prepared by this method is greater than 100 mg/L, and the protein size is approximately 56 KD, which can be used for large-scale production.
實施例3.12:HPV 56L1:33C表達生產Example 3.12: HPV 56L1:33C expression production
實驗方法和步驟與實施例3.1相同。The experimental methods and steps are the same as those in Example 3.1.
結果如圖1L所示,該方法製備的HPV 56L1:33C L1蛋白產量大於100 mg/L,蛋白大小約56 KD,可以用於大規模生產。The results are shown in Figure 1L. The yield of HPV 56L1:33C L1 protein prepared by this method is greater than 100 mg/L, and the protein size is about 56 KD, which can be used for large-scale production.
實施例3.13:HPV 58L1:33C表達生產Example 3.13: HPV 58L1:33C expression production
實驗方法和步驟與實施例3.1相同。The experimental methods and steps are the same as those in Example 3.1.
結果如圖1M所示,該方法製備的HPV 58L1:33C L1蛋白產量大於100 mg/L,蛋白大小約56 KD,可以用於大規模生產。The results are shown in Figure 1M. The yield of HPV 58L1:33C L1 protein prepared by this method is greater than 100 mg/L, and the protein size is approximately 56 KD, which can be used for large-scale production.
實施例3.14:HPV 59L1表達生產Example 3.14: HPV 59L1 expression production
實驗方法和步驟與實施例3.1相同。The experimental methods and steps are the same as those in Example 3.1.
結果如圖1N所示,該方法製備的HPV 59L1 L1蛋白產量大於100 mg/L,蛋白大小約56 KD,可以用於大規模生產。The results are shown in Figure 1N. The yield of HPV 59L1 L1 protein prepared by this method is greater than 100 mg/L, and the protein size is about 56 KD, which can be used for large-scale production.
實施例4 類病毒顆粒的純化製備Example 4 Preparation of Purified Virus-Like Particles
實施例4.1:HPV 6L1:33C類病毒顆粒的純化製備Example 4.1: Preparation of purified HPV 6L1:33C virus-like particles
HPV 6L1:33C類病毒顆粒純化方法為兩步層析法,即HS-MMA法,純化實施例3中收集的上清液,最終可得到高純度的類病毒顆粒。The HPV 6L1:33C virus-like particles were purified by a two-step chromatography method, namely, the HS-MMA method. The supernatant collected in Example 3 was purified to obtain highly pure virus-like particles.
第一步層析: 介質:採用Thermo Fisher公司生產的POROS ®50 HS強陽離子交換介質。 介質體積:介質體積150mL,線性流速30mL/min。 層析條件:平衡緩衝液(pH6.2,鹽濃度為50mM磷酸鹽,0.5M氯化鈉);清洗緩衝液(鹽濃度為50mM磷酸鹽,0.75M氯化鈉,pH6.2;) 層析柱先用 5 CV平衡緩衝液,然後上樣。上樣結束後,之後分別用5 CV的平衡緩衝液和清洗緩衝液洗脫雜蛋白。 洗脫條件:pH6.2,洗脫鹽濃度為1.25M 氯化鈉採用含有50mM 鹽酸精胺酸的50mM磷酸鹽緩衝液進行洗脫。 The first step of chromatography: Medium: POROS ® 50 HS strong cation exchange medium produced by Thermo Fisher was used. Medium volume: medium volume 150mL, linear flow rate 30mL/min. Chromatographic conditions: equilibration buffer (pH6.2, salt concentration of 50mM phosphate, 0.5M sodium chloride); washing buffer (salt concentration of 50mM phosphate, 0.75M sodium chloride, pH6.2;) The chromatography column was first equilibrated with 5 CV of buffer and then loaded with sample. After loading, the impurities were eluted with 5 CV of equilibration buffer and washing buffer respectively. Elution conditions: pH 6.2, elution salt concentration is 1.25M sodium chloride, and elution is performed using 50mM phosphate buffer containing 50mM arginine hydrochloride.
第二步層析: 介質:採用上海博格隆公司生產的MMA離子交換介質。 介質體積:介質體積150mL,線性流速30mL/min。 層析條件:平衡緩衝液50mM PB,1.25M NaCl, pH6.2。層析柱先用4CV平衡緩衝液平衡,然後上樣。上樣結束後,用5 CV 平衡緩衝液沖洗雜蛋白後,然後用洗脫緩衝液洗脫目標蛋白收集蛋白。 洗脫條件:100mM NaAC,150mM NaCl,0.01% Tween 80, pH4.5。 Second step of chromatography: Medium: MMA ion exchange medium produced by Shanghai Boglon Company was used. Medium volume: medium volume 150mL, linear flow rate 30mL/min. Chromatographic conditions: equilibrium buffer 50mM PB, 1.25M NaCl, pH6.2. The chromatography column was first equilibrated with 4CV equilibrium buffer, and then the sample was loaded. After the loading was completed, the impurities were washed with 5 CV equilibrium buffer, and then the target protein was eluted with elution buffer to collect the protein. Elution conditions: 100mM NaAC, 150mM NaCl, 0.01% Tween 80, pH4.5.
實施例4.2:HPV 11L1:33C類病毒顆粒的純化製備Example 4.2: Preparation of purified HPV 11L1:33C virus-like particles
實驗方法和步驟與實施例4.1相同。The experimental methods and steps are the same as those in Example 4.1.
實施例4.3:HPV 16L1:33C類病毒顆粒的純化製備Example 4.3: Preparation of purified HPV 16L1:33C virus-like particles
實驗方法和步驟與實施例4.1相同。The experimental methods and steps are the same as those in Example 4.1.
實施例4.4:HPV 18L1:33C類病毒顆粒的純化製備Example 4.4: Purification of HPV 18L1:33C virus-like particles
實驗方法和步驟與實施例4.1相同。The experimental methods and steps are the same as those in Example 4.1.
實施例4.5:HPV 31L1:33C類病毒顆粒的純化製備Example 4.5: Purification of HPV 31L1:33C virus-like particles
實驗方法和步驟與實施例4.1相同。The experimental methods and steps are the same as those in Example 4.1.
實施例4.6:HPV 33L1類病毒顆粒的純化製備Example 4.6: Purification of HPV 33L1 virus-like particles
實驗方法和步驟與實施例4.1相同。The experimental methods and steps are the same as those in Example 4.1.
實施例4.7:HPV 35L1:33C類病毒顆粒的純化製備Example 4.7: Purification of HPV 35L1:33C virus-like particles
實驗方法和步驟與實施例4.1相同。The experimental methods and steps are the same as those in Example 4.1.
實施例4.8:HPV 39L1:59C類病毒顆粒的純化製備Example 4.8: Purification of HPV 39L1:59C virus-like particles
實驗方法和步驟與實施例4.1相同。The experimental methods and steps are the same as those in Example 4.1.
實施例4.9:HPV 45L1:33C類病毒顆粒的純化製備Example 4.9: Preparation of purified HPV 45L1:33C virus-like particles
實驗方法和步驟與實施例4.1相同。The experimental methods and steps are the same as those in Example 4.1.
實施例4.10:HPV 51L1:33C類病毒顆粒的純化製備Example 4.10: Preparation of purified HPV 51L1:33C virus-like particles
實驗方法和步驟與實施例4.1相同。The experimental methods and steps are the same as those in Example 4.1.
實施例4.11:HPV 52L1:33C類病毒顆粒的純化製備Example 4.11: Preparation of purified HPV 52L1:33C virus-like particles
實驗方法和步驟與實施例4.1相同。The experimental methods and steps are the same as those in Example 4.1.
實施例4.12:HPV 56L1:33C類病毒顆粒的純化製備Example 4.12: Preparation of purified HPV 56L1:33C virus-like particles
實驗方法和步驟與實施例4.1相同。The experimental methods and steps are the same as those in Example 4.1.
實施例4.13:HPV 58L1:33C類病毒顆粒的純化製備Example 4.13: Preparation of purified HPV 58L1:33C virus-like particles
實驗方法和步驟與實施例4.1相同。The experimental methods and steps are the same as those in Example 4.1.
實施例4.14:HPV 59L1類病毒顆粒的純化製備Example 4.14: Preparation of purified HPV 59L1 virus-like particles
實驗方法和步驟與實施例4.1相同。The experimental methods and steps are the same as those in Example 4.1.
實施例5 類病毒顆粒的形態學檢測Example 5 Morphological Detection of Virus-like Particles
實施例5.1:HPV 6L1:33C類病毒顆粒的形態學檢測Example 5.1: Morphological Detection of HPV 6L1:33C Virus-like Particles
取10μL樣品用於穿透式電子顯微鏡觀察。將樣品固定到碳噴銅網上吸附2min,殘餘液體用濾紙吸掉,再使用磷鎢酸(北京中鏡科儀技術有限公司,濃度2%,pH6.5)染色兩次,每次30秒,殘餘染色液用濾紙吸掉,晾乾後即可在穿透式電子顯微鏡下觀察。穿透式電子顯微鏡(品牌:日立,型號:H-7650)為80KV,放大倍數為80,000倍。Take 10μL sample for transmission electron microscope observation. Fix the sample on the carbon spray copper mesh for 2 minutes, remove the residual liquid with filter paper, and then use phospho-tungsten acid (Beijing Zhongjing Instrument Technology Co., Ltd., concentration 2%, pH 6.5) to stain twice, each time for 30 seconds, remove the residual staining liquid with filter paper, and observe under the transmission electron microscope after drying. The transmission electron microscope (brand: Hitachi, model: H-7650) is 80KV and the magnification is 80,000 times.
電子顯微鏡觀察結果見圖2A,由圖2A可見,C端改造的HPV 6L1:33C可以形成大小均一的類病毒顆粒,平均直徑在60nm左右。The results of electron microscopy observation are shown in Figure 2A. As shown in Figure 2A, the C-terminally modified HPV 6L1:33C can form virus-like particles of uniform size with an average diameter of about 60 nm.
實施例5.2:HPV 11L1:33C類病毒顆粒的形態學檢測Example 5.2: Morphological Detection of HPV 11L1:33C Virus-like Particles
實驗方法和步驟與實施例5.1相同。The experimental methods and steps are the same as those in Example 5.1.
電子顯微鏡觀察結果見圖2B,由圖2B可見,C端改造的HPV 11L1:33C可以形成大小均一的類病毒顆粒,平均直徑在60nm左右。The results of electron microscopy observation are shown in Figure 2B. As shown in Figure 2B, the C-terminally modified HPV 11L1:33C can form virus-like particles of uniform size with an average diameter of about 60 nm.
實施例5.3:HPV 16L1:33C類病毒顆粒的形態學檢測Example 5.3: Morphological Detection of HPV 16L1:33C Virus-like Particles
實驗方法和步驟與實施例5.1相同。The experimental methods and steps are the same as those in Example 5.1.
電子顯微鏡觀察結果見圖2C,由圖2C可見,C端改造的HPV 16L1:33C可以形成大小均一的類病毒顆粒,平均直徑在60nm左右。The results of electron microscopy observation are shown in Figure 2C. As shown in Figure 2C, the C-terminally modified HPV 16L1:33C can form virus-like particles of uniform size with an average diameter of about 60 nm.
實施例5.4:HPV 18L1:33C類病毒顆粒的形態學檢測Example 5.4: Morphological Detection of HPV 18L1:33C Virus-like Particles
實驗方法和步驟與實施例5.1相同。The experimental methods and steps are the same as those in Example 5.1.
電子顯微鏡觀察結果見圖2D,由圖2D可見,C端改造的HPV 18L1:33C可以形成大小均一的類病毒顆粒,平均直徑在60nm左右。The results of electron microscopy observation are shown in Figure 2D. As shown in Figure 2D, the C-terminally modified HPV 18L1:33C can form virus-like particles of uniform size with an average diameter of about 60 nm.
實施例5.5:HPV 31L1:33C類病毒顆粒的形態學檢測Example 5.5: Morphological Detection of HPV 31L1:33C Virus-like Particles
實驗方法和步驟與實施例5.1相同。The experimental methods and steps are the same as those in Example 5.1.
電子顯微鏡觀察結果見圖2E,由圖2E可見,C端改造的HPV 31L1:33C可以形成大小均一的類病毒顆粒,平均直徑在60nm左右。The results of electron microscopy observation are shown in Figure 2E. As shown in Figure 2E, the C-terminally modified HPV 31L1:33C can form virus-like particles of uniform size with an average diameter of about 60 nm.
實施例5.6:HPV 33L1類病毒顆粒的形態學檢測Example 5.6: Morphological Detection of HPV 33L1 Virus-like Particles
實驗方法和步驟與實施例5.1相同。The experimental methods and steps are the same as those in Example 5.1.
電子顯微鏡觀察結果見圖2F,由圖2F可見,C端改造的HPV 33L1可以形成大小均一的類病毒顆粒,平均直徑在60nm左右。The results of electron microscopy observation are shown in Figure 2F. As shown in Figure 2F, the C-terminally modified HPV 33L1 can form virus-like particles of uniform size with an average diameter of about 60 nm.
實施例5.7:HPV 35L1:33C類病毒顆粒的形態學檢測Example 5.7: Morphological Detection of HPV 35L1:33C Virus-like Particles
實驗方法和步驟與實施例5.1相同。The experimental methods and steps are the same as those in Example 5.1.
電子顯微鏡觀察結果見圖2G,由圖2G可見,C端改造的HPV 35L1:33C可以形成大小均一的類病毒顆粒,平均直徑在60nm左右。The results of electron microscopy observation are shown in Figure 2G. As shown in Figure 2G, the C-terminally modified HPV 35L1:33C can form virus-like particles of uniform size with an average diameter of about 60 nm.
實施例5.8:HPV 39L1:59C類病毒顆粒的形態學檢測Example 5.8: Morphological Detection of HPV 39L1:59C Virus-like Particles
實驗方法和步驟與實施例5.1相同。The experimental methods and steps are the same as those in Example 5.1.
電子顯微鏡觀察結果見圖2H,由圖2H可見,C端改造的HPV 39L1:59C可以形成大小均一的類病毒顆粒,平均直徑在60nm左右。The results of electron microscopy observation are shown in Figure 2H. As shown in Figure 2H, the C-terminally modified HPV 39L1:59C can form virus-like particles of uniform size with an average diameter of about 60 nm.
實施例5.9:HPV 45L1:33C類病毒顆粒的形態學檢測Example 5.9: Morphological Detection of HPV 45L1:33C Virus-like Particles
實驗方法和步驟與實施例5.1相同。The experimental methods and steps are the same as those in Example 5.1.
電子顯微鏡觀察結果見圖2I,由圖2I可見,C端改造的HPV 45L1:33C可以形成大小均一的類病毒顆粒,平均直徑在60nm左右。The results of electron microscopy observation are shown in Figure 2I. As shown in Figure 2I, the C-terminally modified HPV 45L1:33C can form virus-like particles of uniform size with an average diameter of about 60 nm.
實施例5.10:HPV 51L1:33C類病毒顆粒的形態學檢測Example 5.10: Morphological Detection of HPV 51L1:33C Virus-like Particles
實驗方法和步驟與實施例5.1相同。The experimental methods and steps are the same as those in Example 5.1.
電子顯微鏡觀察結果見圖2J,由圖2J可見,C端改造的HPV 51L1:33C可以形成大小均一的類病毒顆粒,平均直徑在60nm左右。The results of electron microscopy observation are shown in Figure 2J. As can be seen from Figure 2J, the C-terminally modified HPV 51L1:33C can form virus-like particles of uniform size with an average diameter of about 60nm.
實施例5.11:HPV 52L1:33C類病毒顆粒的形態學檢測Example 5.11: Morphological Detection of HPV 52L1:33C Virus-like Particles
實驗方法和步驟與實施例5.1相同。The experimental methods and steps are the same as those in Example 5.1.
電子顯微鏡觀察結果見圖2K,由圖2K可見,C端改造的HPV 52L1:33C可以形成大小均一的類病毒顆粒,平均直徑在60nm左右。The results of electron microscopy observation are shown in Figure 2K. As shown in Figure 2K, the C-terminally modified HPV 52L1:33C can form virus-like particles of uniform size with an average diameter of about 60 nm.
實施例5.12:HPV 56L1:33C類病毒顆粒的形態學檢測Example 5.12: Morphological Detection of HPV 56L1:33C Virus-like Particles
實驗方法和步驟與實施例5.1相同。The experimental methods and steps are the same as those in Example 5.1.
電子顯微鏡觀察結果見圖2L,由圖2L可見,C端改造的HPV 56L1:33C可以形成大小均一的類病毒顆粒,平均直徑在60nm左右。The results of electron microscopy observation are shown in Figure 2L. As shown in Figure 2L, the C-terminally modified HPV 56L1:33C can form virus-like particles of uniform size with an average diameter of about 60 nm.
實施例5.13:HPV 58L1:33C類病毒顆粒的形態學檢測Example 5.13: Morphological Detection of HPV 58L1:33C Virus-like Particles
實驗方法和步驟與實施例5.1相同。The experimental methods and steps are the same as those in Example 5.1.
電子顯微鏡觀察結果見圖2M,由圖2M可見,C端改造的HPV 58L1:33C可以形成大小均一的類病毒顆粒,平均直徑在60nm左右。The results of electron microscopy observation are shown in Figure 2M. As shown in Figure 2M, the C-terminally modified HPV 58L1:33C can form virus-like particles of uniform size with an average diameter of about 60 nm.
實施例5.14:HPV 59L1類病毒顆粒的形態學檢測Example 5.14: Morphological Detection of HPV 59L1 Virus-like Particles
實驗方法和步驟與實施例5.1相同。The experimental methods and steps are the same as those in Example 5.1.
電子顯微鏡觀察結果見圖2N,由圖2N可見,C端改造的HPV 59L1可以形成大小均一的類病毒顆粒,平均直徑在60nm左右。The results of electron microscopy observation are shown in Figure 2N. As shown in Figure 2N, the C-terminally modified HPV 59L1 can form virus-like particles of uniform size with an average diameter of about 60 nm.
實施例6 類病毒顆粒動物免疫原性評價Example 6 Evaluation of the immunogenicity of virus-like particles in animals
實施例6.1:HPV 6L1:33C類病毒顆粒動物免疫原性評價Example 6.1: Evaluation of the immunogenicity of HPV 6L1:33C virus-like particles in animals
6.1.1 假病毒中和細胞的模型建立6.1.1 Model establishment of pseudovirus neutralization cells
由於HPV很難進行體外培養,又具有較強的宿主特異性,很難在除人體以外的生物體進行繁殖,缺乏合適的動物模型。所以需要建立合適有效的體外中和實驗模型,用於疫苗免疫保護性的評估。Since HPV is difficult to culture in vitro and has strong host specificity, it is difficult to reproduce in organisms other than humans and there is a lack of suitable animal models. Therefore, it is necessary to establish a suitable and effective in vitro neutralization experimental model for evaluating the immune protection of vaccines.
HPV假病毒是理想的HPV體外中和實驗模型:利用HPV VLP具有非特異包裹核酸的特性,細胞內表達的HPV L1和L2組成的VLP包裹游離的DNA或導入外源質體形成HPV假病毒。HPV pseudovirus is an ideal HPV in vitro neutralization experimental model: HPV VLP has the property of non-specifically encapsulating nucleic acids, and VLPs composed of HPV L1 and L2 expressed intracellularly encapsulate free DNA or are introduced into exogenous plasmids to form HPV pseudoviruses.
採用假病毒中和法對樣品免疫後動物血清樣品進行免疫原性分析。HPV6類病毒顆粒樣品免疫動物後能產生針對HPV6的中和抗體,能中和HPV6型的假病毒。將免疫後的動物血清與一定量的假病毒孵育後再侵染細胞,可表達GFP螢光的細胞會隨著血清中中和抗體的增加而減少,在一定的範圍內可存在線性負相關,因此可以透過檢測表達GFP的細胞數的變化來評價血清中抗體的中和活性。The pseudovirus neutralization method is used to analyze the immunogenicity of animal serum samples after immunization. HPV6 virus-like particle samples can produce neutralizing antibodies against HPV6 after immunization of animals, which can neutralize HPV6 pseudoviruses. After incubating the immunized animal serum with a certain amount of pseudovirus and then infecting cells, the cells that can express GFP fluorescence will decrease as the neutralizing antibodies in the serum increase. There can be a linear negative correlation within a certain range. Therefore, the neutralizing activity of antibodies in the serum can be evaluated by detecting changes in the number of cells expressing GFP.
假病毒構建方法:將HPV6型的 pCMV3-3-HPV6L1+L2(L1序列來源於Uniprot P69898,L2序列來源Uniprot Q84297) 質體(購於Sino Biological)以及螢光質體(PSEU-GFP Spark,購於Sino Biological)共轉染至293FT貼壁細胞(購於Thermo Fisher)。具體方法參考文獻(Pastrana D V, Buck C B, Pang Y S, Thompson C D, Castle P E, FitzGerald P C, Kjaer S K, Lowy D R, Schiller J T.Reactivity of human sera in a sensitive, high-throughput pseudovirus-based papillomavirus neutralization assay for HPV16 and HPV18. J Virology 2004,321:205-216.)。收集假病毒上清液並進行分裝,置於-80℃冰箱中保存備用。Pseudovirus construction method: HPV6 type pCMV3-3-HPV6L1+L2 (L1 sequence from Uniprot P69898, L2 sequence from Uniprot Q84297) plasmid (purchased from Sino Biological) and fluorescent plasmid (PSEU-GFP Spark, purchased from Sino Biological) were co-transfected into 293FT adherent cells (purchased from Thermo Fisher). For specific methods, please refer to the literature (Pastrana D V, Buck CB, Pang Y S, Thompson C D, Castle P E, FitzGerald P C, Kjaer S K, Lowy D R, Schiller J T.Reactivity of human sera in a sensitive, high-throughput pseudovirus-based papillomavirus neutralization assay for HPV16 and HPV18. J Virology 2004,321:205-216.). The pseudovirus supernatant was collected and packaged, and stored in a -80°C refrigerator for later use.
6.1.2 HPV 6L1:33C類病毒顆粒動物免疫保護性評價6.1.2 HPV 6L1:33C virus-like particles for animal immune protection
小鼠免疫程序: HPV 6L1:33C類病毒顆粒吸附於磷酸鋁佐劑上,經混合後取200µL用於免疫小鼠,每隻小鼠免疫劑量0.15µg,免疫10隻小鼠,於實驗第0天、第7天、第21天分別用稀釋後樣品對小鼠進行免疫,同時設立空白血清對照組,於實驗第28天摘取小鼠眼球取血,分離出血清進行假病毒中和效價檢測。 Mouse immunization procedure: HPV 6L1:33C virus-like particles were adsorbed on aluminum phosphate adjuvant. After mixing, 200µL was taken for immunization of mice. The immunization dose for each mouse was 0.15µg. Ten mice were immunized. Mice were immunized with diluted samples on the 0th, 7th and 21st days of the experiment. A blank serum control group was set up at the same time. On the 28th day of the experiment, the eyeballs of mice were removed to collect blood, and the serum was separated for pseudovirus neutralization titer detection.
小鼠EC 50檢測: 小鼠血清在56℃滅活30分鐘後,離心6000g,5分鐘後取上清進行檢測。檢測前4-8小時,將293FT細胞以15000細胞/孔的密度鋪板於96孔板中,培養於37℃,5%CO 2的二氧化碳培養箱中。免疫後小鼠血清、空白對照血清均用中和培養基系列稀釋後按照體積比1:1分別與6.1中製備的HPV6假病毒混合。2~8℃冰箱中孵育1小時後按照100μL/孔加入到提前4-8小時鋪板的293FT細胞上,每個樣品2個複孔,同時設立空白血清對照孔、假病毒陽性對照孔和陰性對照孔。加樣後的細胞繼續在37℃,5%CO 2的二氧化碳培養箱中培養62-96小時後,在酶聯斑點分析儀中(型號:S6 Universal-V Analyzer,廠家:CTL)進行螢光掃描拍照以及計數。透過計算每個小鼠血清樣品的中和抑制率,依據Reed-Muench法計算得到血清中和抑制率為50%時血清最大稀釋倍數,即半數有效稀釋倍數EC 50。 Mouse EC 50 test: Mouse serum was inactivated at 56°C for 30 minutes, centrifuged at 6000g, and the supernatant was taken for detection after 5 minutes. 4-8 hours before the test, 293FT cells were plated in a 96-well plate at a density of 15,000 cells/well and cultured in a carbon dioxide incubator at 37°C and 5% CO 2. The immunized mouse serum and blank control serum were serially diluted with neutralizing medium and mixed with the HPV6 pseudovirus prepared in 6.1 at a volume ratio of 1:1. After incubation in a refrigerator at 2~8°C for 1 hour, 100μL/well was added to the 293FT cells plated 4-8 hours in advance, with 2 replicates for each sample, and blank serum control wells, pseudovirus positive control wells, and negative control wells were set up at the same time. After adding the sample, the cells were cultured in a carbon dioxide incubator at 37°C and 5% CO2 for 62-96 hours, and then the cells were scanned and photographed and counted in an enzyme-linked spot analyzer (model: S6 Universal-V Analyzer, manufacturer: CTL). The neutralization inhibition rate of each mouse serum sample was calculated, and the maximum serum dilution factor when the serum neutralization inhibition rate was 50% was calculated according to the Reed-Muench method, that is, the half effective dilution factor EC50 .
HPV6血清假病毒中和效價檢測結果詳見表4。The results of HPV6 serum pseudovirus neutralization titer test are shown in Table 4.
表 4 小鼠血清中和效價檢測結果 EC
50 ( GMT±SEM )
上述檢測結果顯示,本發明製備的HPV 6L1:33C類病毒顆粒具有較好的免疫原性,可在動物體內產生高效價的中和抗體,可以用於製備成預防HPV感染的疫苗。The above test results show that the HPV 6L1:33C virus-like particles prepared by the present invention have good immunogenicity, can produce high-titer neutralizing antibodies in animals, and can be used to prepare vaccines for preventing HPV infection.
實施例6.2:HPV 11L1:33C類病毒顆粒動物免疫原性評價Example 6.2: Evaluation of the immunogenicity of HPV 11L1:33C virus-like particles in animals
實驗方法和步驟與實施例6.1相同。L1序列來源於Uniprot P04012,L2序列來源Uniprot P04013。The experimental methods and steps were the same as those in Example 6.1. The L1 sequence was derived from Uniprot P04012, and the L2 sequence was derived from Uniprot P04013.
HPV11血清假病毒中和效價檢測結果詳見表5。The results of HPV11 serum pseudovirus neutralization titer test are shown in Table 5.
表surface
55
小鼠血清中和效價檢測結果Mouse serum neutralization titer test results
EC
50 EC 50
((
GMT±SEMGMT±SEM
))
上述檢測結果顯示,本發明製備的HPV 11L1:33C類病毒顆粒具有較好的免疫原性,可在動物體內產生高效價的中和抗體,可以用於製備成預防HPV感染的疫苗。The above test results show that the HPV 11L1:33C virus-like particles prepared by the present invention have good immunogenicity, can produce high-titer neutralizing antibodies in animals, and can be used to prepare vaccines for preventing HPV infection.
實施例6.3:HPV 16L1:33C類病毒顆粒動物免疫原性評價Example 6.3: Evaluation of the immunogenicity of HPV 16L1:33C virus-like particles in animals
實驗方法和步驟與實施例6.1相同。L1序列來源於Uniprot P03101,L2序列來源Uniprot P03107。The experimental methods and steps were the same as those in Example 6.1. The L1 sequence was derived from Uniprot P03101, and the L2 sequence was derived from Uniprot P03107.
HPV16血清假病毒中和效價檢測結果詳見表6。The results of HPV16 serum pseudovirus neutralization titer test are shown in Table 6.
表surface
66
小鼠血清中和效價檢測結果Mouse serum neutralization titer test results
EC
50 EC 50
((
GMT±SEMGMT±SEM
))
上述檢測結果顯示,本發明製備的HPV 16L1:33C類病毒顆粒具有較好的免疫原性,可在動物體內產生高效價的中和抗體,可以用於製備成預防HPV感染的疫苗。The above test results show that the HPV 16L1:33C virus-like particles prepared by the present invention have good immunogenicity, can produce high-titer neutralizing antibodies in animals, and can be used to prepare vaccines for preventing HPV infection.
實施例6.4:HPV 18L1:33C類病毒顆粒動物免疫原性評價Example 6.4: Evaluation of the immunogenicity of HPV 18L1:33C virus-like particles in animals
實驗方法和步驟與實施例6.1相同。L1序列來源於Uniprot Q80B70,L2序列來源Uniprot P06793。The experimental methods and steps were the same as those in Example 6.1. The L1 sequence was derived from Uniprot Q80B70, and the L2 sequence was derived from Uniprot P06793.
HPV18血清假病毒中和效價檢測結果詳見表7。The results of HPV18 serum pseudovirus neutralization titer test are shown in Table 7.
表surface
77
小鼠血清中和效價檢測結果Mouse serum neutralization titer test results
EC
50 EC 50
((
GMT±SEMGMT±SEM
))
上述檢測結果顯示,本發明製備的HPV 18L1:33C類病毒顆粒具有較好的免疫原性,可在動物體內產生高效價的中和抗體,可以用於製備成預防HPV感染的疫苗。The above test results show that the HPV 18L1:33C virus-like particles prepared by the present invention have good immunogenicity, can produce high-titer neutralizing antibodies in animals, and can be used to prepare vaccines for preventing HPV infection.
實施例6.5:HPV 31L1:33C類病毒顆粒動物免疫原性評價Example 6.5: Evaluation of the immunogenicity of HPV 31L1:33C virus-like particles in animals
實驗方法和步驟與實施例6.1相同。L1序列來源於Uniprot P17388,L2序列來源Uniprot P17389。The experimental methods and steps were the same as those in Example 6.1. The L1 sequence was derived from Uniprot P17388, and the L2 sequence was derived from Uniprot P17389.
HPV31血清假病毒中和效價檢測結果詳見表8。The results of HPV31 serum pseudovirus neutralization titer test are shown in Table 8.
表surface
88
小鼠血清中和效價檢測結果Mouse serum neutralization titer test results
EC
50 EC 50
((
GMT±SEMGMT±SEM
))
上述檢測結果顯示,本發明製備的HPV 31L1:33C類病毒顆粒具有較好的免疫原性,可在動物體內產生高效價的中和抗體,可以用於製備成預防HPV感染的疫苗。The above test results show that the HPV 31L1:33C virus-like particles prepared by the present invention have good immunogenicity, can produce high-titer neutralizing antibodies in animals, and can be used to prepare vaccines for preventing HPV infection.
實施例6.6:HPV 33L1類病毒顆粒動物免疫原性評價Example 6.6: Evaluation of the immunogenicity of HPV 33L1-like virus particles in animals
實驗方法和步驟與實施例6.1相同。L1序列來源於Uniprot P06416,L2序列來源Uniprot P06418。The experimental methods and steps were the same as those in Example 6.1. The L1 sequence was derived from Uniprot P06416, and the L2 sequence was derived from Uniprot P06418.
HPV33血清假病毒中和效價檢測結果詳見表9。The results of HPV33 serum pseudovirus neutralization titer test are shown in Table 9.
表surface 99 小鼠血清中和效價檢測結果Mouse serum neutralization titer test results EC 50 EC 50 (( GMT±SEMGMT±SEM ))
上述檢測結果顯示,本發明製備的HPV 33L1類病毒顆粒具有較好的免疫原性,可在動物體內產生高效價的中和抗體,可以用於製備成預防HPV感染的疫苗。
實施例6.7:HPV 35L1:33C類病毒顆粒動物免疫原性評價Example 6.7: Evaluation of the immunogenicity of HPV 35L1:33C virus-like particles in animals
實驗方法和步驟與實施例6.1相同。L1序列來源於Uniprot P27232,L2序列來源Uniprot P27234。The experimental methods and steps were the same as those in Example 6.1. The L1 sequence was derived from Uniprot P27232, and the L2 sequence was derived from Uniprot P27234.
HPV35血清假病毒中和效價檢測結果詳見表10。The results of HPV35 serum pseudovirus neutralization titer test are shown in Table 10.
表surface
1010
小鼠血清中和效價檢測結果Mouse serum neutralization titer test results
EC
50 EC 50
((
GMT±SEMGMT±SEM
))
上述檢測結果顯示,本發明製備的HPV 35L1:33C類病毒顆粒具有較好的免疫原性,可在動物體內產生高效價的中和抗體,可以用於製備成預防HPV感染的疫苗。The above test results show that the HPV 35L1:33C virus-like particles prepared by the present invention have good immunogenicity, can produce high-titer neutralizing antibodies in animals, and can be used to prepare vaccines for preventing HPV infection.
實施例6.8:HPV 39L1:59C類病毒顆粒動物免疫原性評價Example 6.8: Evaluation of the immunogenicity of HPV 39L1:59C virus-like particles in animals
實驗方法和步驟與實施例6.1相同。L1序列來源於Uniprot P24838,L2序列來源Uniprot P24839。The experimental methods and steps were the same as those in Example 6.1. The L1 sequence was derived from Uniprot P24838, and the L2 sequence was derived from Uniprot P24839.
HPV39血清假病毒中和效價檢測結果詳見表11。The results of HPV39 serum pseudovirus neutralization titer test are shown in Table 11.
表surface
1111
小鼠血清中和效價檢測結果Mouse serum neutralization titer test results
EC
50 EC 50
((
GMT±SEMGMT±SEM
))
上述檢測結果顯示,本發明製備的HPV 39L1:59C類病毒顆粒具有較好的免疫原性,可在動物體內產生高效價的中和抗體,可以用於製備成預防HPV感染的疫苗。The above test results show that the HPV 39L1:59C virus-like particles prepared by the present invention have good immunogenicity, can produce high-titer neutralizing antibodies in animals, and can be used to prepare vaccines for preventing HPV infection.
實施例6.9:HPV 45L1:33C類病毒顆粒動物免疫原性評價Example 6.9: Evaluation of the immunogenicity of HPV 45L1:33C virus-like particles in animals
實驗方法和步驟與實施例6.1相同。L1序列來源於Uniprot P36741,L2序列來源Uniprot P36761。The experimental methods and steps were the same as those in Example 6.1. The L1 sequence was derived from Uniprot P36741, and the L2 sequence was derived from Uniprot P36761.
HPV45血清假病毒中和效價檢測結果詳見表12。The results of HPV45 serum pseudovirus neutralization titer test are shown in Table 12.
表surface
1212
小鼠血清中和效價檢測結果Mouse serum neutralization titer test results
EC
50 EC 50
((
GMT±SEMGMT±SEM
))
上述檢測結果顯示,本發明製備的HPV 45L1:33C類病毒顆粒具有較好的免疫原性,可在動物體內產生高效價的中和抗體,可以用於製備成預防HPV感染的疫苗。The above test results show that the HPV 45L1:33C virus-like particles prepared by the present invention have good immunogenicity, can produce high-titer neutralizing antibodies in animals, and can be used to prepare vaccines for preventing HPV infection.
實施例6.10:HPV 51L1:33C類病毒顆粒動物免疫原性評價Example 6.10: Evaluation of the immunogenicity of HPV 51L1:33C virus-like particles in animals
實驗方法和步驟與實施例6.1相同。L1序列來源於Uniprot P26536,L2序列來源Uniprot P26539 。The experimental methods and steps were the same as those in Example 6.1. The L1 sequence was from Uniprot P26536, and the L2 sequence was from Uniprot P26539.
HPV51血清假病毒中和效價檢測結果詳見表13。The results of HPV51 serum pseudovirus neutralization titer test are shown in Table 13.
表surface
1313
小鼠血清中和效價檢測結果Mouse serum neutralization titer test results
EC
50 EC 50
((
GMT±SEMGMT±SEM
))
上述檢測結果顯示,本發明製備的HPV 51L1:33C類病毒顆粒具有較好的免疫原性,可在動物體內產生高效價的中和抗體,可以用於製備成預防HPV感染的疫苗。The above test results show that the HPV 51L1:33C virus-like particles prepared by the present invention have good immunogenicity, can produce high-titer neutralizing antibodies in animals, and can be used to prepare vaccines for preventing HPV infection.
實施例6.11:HPV 52L1:33C類病毒顆粒動物免疫原性評價Example 6.11: Evaluation of the immunogenicity of HPV 52L1:33C virus-like particles in animals
實驗方法和步驟與實施例6.1相同。L1序列來源於Uniprot Q05138,L2序列來源Uniprot F8S4U2。The experimental methods and steps were the same as those in Example 6.1. The L1 sequence was from Uniprot Q05138, and the L2 sequence was from Uniprot F8S4U2.
HPV52血清假病毒中和效價檢測結果詳見表14。The results of HPV52 serum pseudovirus neutralization titer test are shown in Table 14.
表surface
1414
小鼠血清中和效價檢測結果Mouse serum neutralization titer test results
EC
50 EC 50
((
GMT±SEMGMT±SEM
))
上述檢測結果顯示,本發明製備的HPV 52L1:33C類病毒顆粒具有較好的免疫原性,可在動物體內產生高效價的中和抗體,可以用於製備成預防HPV感染的疫苗。The above test results show that the HPV 52L1:33C virus-like particles prepared by the present invention have good immunogenicity, can produce high-titer neutralizing antibodies in animals, and can be used to prepare vaccines for preventing HPV infection.
實施例6.12:HPV 56L1:33C類病毒顆粒動物免疫原性評價Example 6.12: Evaluation of the immunogenicity of HPV 56L1:33C virus-like particles in animals
實驗方法和步驟與實施例6.1相同。L1序列來源於Uniprot P36743,L2序列來源Uniprot P36765。The experimental methods and steps were the same as those in Example 6.1. The L1 sequence was from Uniprot P36743, and the L2 sequence was from Uniprot P36765.
HPV56血清假病毒中和效價檢測結果詳見表15。The results of HPV56 serum pseudovirus neutralization titer test are shown in Table 15.
表surface
1515
小鼠血清中和效價檢測結果Mouse serum neutralization titer test results
EC
50 EC 50
((
GMT±SEMGMT±SEM
))
上述檢測結果顯示,本發明製備的HPV 56L1:33C類病毒顆粒具有較好的免疫原性,可在動物體內產生高效價的中和抗體,可以用於製備成預防HPV感染的疫苗。The above test results show that the HPV 56L1:33C virus-like particles prepared by the present invention have good immunogenicity, can produce high-titer neutralizing antibodies in animals, and can be used to prepare vaccines for preventing HPV infection.
實施例6.13:HPV 58L1:33C類病毒顆粒動物免疫原性評價Example 6.13: Evaluation of the immunogenicity of HPV 58L1:33C virus-like particles in animals
實驗方法和步驟與實施例6.1相同。L1序列來源於Uniprot P26535,L2序列來源Uniprot B6ZB12。The experimental methods and steps were the same as those in Example 6.1. The L1 sequence was derived from Uniprot P26535, and the L2 sequence was derived from Uniprot B6ZB12.
HPV58血清假病毒中和效價檢測結果詳見表16。The results of HPV58 serum pseudovirus neutralization titer test are shown in Table 16.
表surface
1616
小鼠血清中和效價檢測結果Mouse serum neutralization titer test results
EC
50 EC 50
((
GMT±SEMGMT±SEM
))
上述檢測結果顯示,本發明製備的HPV 58L1:33C類病毒顆粒具有較好的免疫原性,可在動物體內產生高效價的中和抗體,可以用於製備成預防HPV感染的疫苗。The above test results show that the HPV 58L1:33C virus-like particles prepared by the present invention have good immunogenicity, can produce high-titer neutralizing antibodies in animals, and can be used to prepare vaccines for preventing HPV infection.
實施例6.14:HPV 59L1類病毒顆粒動物免疫原性評價Example 6.14: Evaluation of the immunogenicity of HPV 59L1-like virus particles in animals
實驗方法和步驟與實施例6.1相同。L1序列來源於Uniprot Q81971,L2序列來源Uniprot Q81970。The experimental methods and steps were the same as those in Example 6.1. The L1 sequence was from Uniprot Q81971, and the L2 sequence was from Uniprot Q81970.
HPV59血清假病毒中和效價檢測結果詳見表17。The results of HPV59 serum pseudovirus neutralization titer test are shown in Table 17.
表surface
1717
小鼠血清中和效價檢測結果Mouse serum neutralization titer test results
EC
50 EC 50
((
GMT±SEMGMT±SEM
))
上述檢測結果顯示,本發明製備的HPV 59L1類病毒顆粒具有較好的免疫原性,可在動物體內產生高效價的中和抗體,可以用於製備成預防HPV感染的疫苗。The above test results show that the HPV 59L1 virus-like particles prepared by the present invention have good immunogenicity, can produce high-titer neutralizing antibodies in animals, and can be used to prepare vaccines for preventing HPV infection.
實施例6.15:14價類病毒顆粒免疫組合物動物免疫原性評價Example 6.15: Evaluation of the immunogenicity of 14-valent virus-like particle immune composition in animals
小鼠免疫程序Mouse immunization procedure
將如上所述的14型(HPV 6、11、16、18、31、33、35、39、45、51、52、56、58、59)類病毒顆粒用磷酸鋁空白佐劑在無菌條件下系列稀釋後(以樣品中單個劑量為20 μg的HPV型為標準進行稀釋,將該型樣品稀釋80倍至0.5 μg/mL,其他型樣品的稀釋則隨著各型配比組分的變化而變化),取200 μL用於免疫小鼠,每隻小鼠免疫劑量見表18。將6~8週齡 SPF級雌性Balb/c小鼠按照每組10隻小鼠進行分組(實驗組和對照),於實驗第0天、第7天、第21天分別用稀釋後樣品對各組小鼠進行免疫,樣品免疫一組小鼠,同時設立空白血清對照組,於實驗第28天摘取小鼠眼球取血,分離出血清進行假病毒中和效價檢測。The 14 types (HPV 6, 11, 16, 18, 31, 33, 35, 39, 45, 51, 52, 56, 58, 59) of virus-like particles as described above were serially diluted with aluminum phosphate blank adjuvant under sterile conditions (the HPV type with a single dose of 20 μg in the sample was used as the standard for dilution, and the sample of this type was diluted 80 times to 0.5 μg/mL, and the dilution of other types of samples varied with the changes in the proportions of each type), and 200 μL was taken for immunization of mice. The immunization dose for each mouse is shown in Table 18. SPF female Balb/c mice aged 6-8 weeks were divided into experimental group and control group with 10 mice in each group. The mice in each group were immunized with diluted samples on the 0th, 7th and 21st days of the experiment. One group of mice was immunized with the sample, and a blank serum control group was set up at the same time. On the 28th day of the experiment, the eyeballs of the mice were removed to collect blood, and the serum was separated for pseudovirus neutralization titer detection.
表18 各型HPV疫苗每隻小鼠免疫劑量(µg)
假病毒中和法檢測免疫後小鼠血清中和效價Pseudovirus neutralization assay to detect the neutralization titer of mouse serum after immunization
小鼠血清在56度滅活30分鐘後,離心6000g,5分鐘後取上清進行檢測。檢測前4-8小時,將293FT細胞以15000細胞/孔的密度鋪板於96孔板中,培養於37℃,5%CO 2的二氧化碳培養箱中。免疫後小鼠血清、空白對照血清均用中和培養基系列稀釋後按照體積比1:1分別與預稀釋好的14種假病毒(14型HPV假病毒包括6、11、16、18、31、33、35、39、45、51、52、56、58、59型,其製備分別見實施例6.1-6.14)混合。2~8℃冰箱中孵育1小時後按照100μL/孔加入到提前4-8小時鋪板的293FT細胞上,每個樣品2個複孔,同時設立空白血清對照孔、假病毒陽性對照孔和陰性對照孔。加樣後的細胞繼續在37℃,5%CO 2的二氧化碳培養箱中培養62-96小時後,在酶聯斑點分析儀(型號:S6 Universal-V Analyzer,廠家:CTL)中進行螢光掃描拍照以及計數。透過計算每個小鼠血清樣品的中和抑制率,依據Reed-Muench法計算得到血清中和抑制率為50%時血清最大稀釋倍數,即半數有效稀釋倍數EC 50。檢測結果見圖3。 Mouse serum was inactivated at 56 degrees for 30 minutes, centrifuged at 6000g, and the supernatant was taken for detection after 5 minutes. 4-8 hours before the detection, 293FT cells were plated in a 96-well plate at a density of 15,000 cells/well and cultured in a carbon dioxide incubator at 37°C and 5% CO2 . The immunized mouse serum and blank control serum were diluted in series with neutralization medium and mixed with 14 pre-diluted pseudoviruses (14 HPV pseudoviruses include 6, 11, 16, 18, 31, 33, 35, 39, 45, 51, 52, 56, 58, and 59 types, and their preparation is shown in Examples 6.1-6.14, respectively) at a volume ratio of 1:1. After incubation in a refrigerator at 2~8℃ for 1 hour, 100μL/well was added to the 293FT cells plated 4-8 hours in advance. Each sample had 2 replicate wells, and blank serum control wells, pseudovirus positive control wells and negative control wells were set up at the same time. After the cells were added, they were cultured in a carbon dioxide incubator at 37℃ and 5% CO2 for 62-96 hours, and then scanned and counted in an enzyme-linked spot analyzer (model: S6 Universal-V Analyzer, manufacturer: CTL). By calculating the neutralization inhibition rate of each mouse serum sample, the maximum serum dilution multiple when the serum neutralization inhibition rate was 50% was calculated according to the Reed-Muench method, that is, the half effective dilution multiple EC50 . The test results are shown in Figure 3.
如圖3所示,14價類病毒顆粒免疫組合物能夠產生很好的中和抗體,可以用於製備成預防HPV感染的疫苗。As shown in Figure 3, the 14-valent virus-like particle immune composition can produce good neutralizing antibodies and can be used to prepare a vaccine to prevent HPV infection.
比較例1:C端截短的HPV16L1(1-474)的表達Comparative Example 1: Expression of C-terminally truncated HPV16L1 (1-474)
發明人嘗試將HPV16L1的C端截短31個胺基酸,命名為HPV16L1(1-474)(SEQ ID NO: 27)。但在研究中發現,截短的HPV16L1(1-474)蛋白表達量高但蛋白可溶性差,難以提取純化,具體表達和提取結果見圖4。The inventors tried to truncate the C-terminus of HPV16L1 by 31 amino acids and named it HPV16L1 (1-474) (SEQ ID NO: 27). However, the study found that the truncated HPV16L1 (1-474) protein had high expression levels but poor protein solubility and was difficult to extract and purify. The specific expression and extraction results are shown in Figure 4.
雖然前述已經用說明和實施例的方式對本發明進行了細節描述,但其目的在於理解方便,本發明所屬技術領域中具有通常知識者顯然可以對本發明的技術方案作出的各種變形和改進,而不會偏離附加的申請專利範圍的精神或範圍。Although the present invention has been described in detail above by way of illustration and examples, the purpose is to facilitate understanding. A person having ordinary knowledge in the technical field to which the present invention belongs can obviously make various modifications and improvements to the technical solution of the present invention without deviating from the spirit or scope of the attached patent application.
附錄1:序列表-嵌合的人乳頭瘤病毒6型L1蛋白
附錄2:序列表-嵌合的人乳頭瘤病毒11型L1蛋白
附錄3:序列表-嵌合的人乳頭瘤病毒16型L1蛋白
附錄4:序列表-嵌合的人乳頭瘤病毒18型L1蛋白
附錄5:序列表-嵌合的人乳頭瘤病毒31型L1蛋白
附錄6:序列表-人乳頭瘤病毒33型L1蛋白
附錄7:序列表-嵌合的人乳頭瘤病毒35型L1蛋白
附錄8:序列表-嵌合的人乳頭瘤病毒39型L1蛋白
附錄9:序列表-嵌合的人乳頭瘤病毒45型L1蛋白
附錄10:序列表-嵌合的人乳頭瘤病毒51型L1蛋白
附錄11:序列表-嵌合的人乳頭瘤病毒52型L1蛋白
附錄12:序列表-嵌合的人乳頭瘤病毒56型L1蛋白
附錄13:序列表-嵌合的人乳頭瘤病毒58型L1蛋白
附錄14:序列表-人乳頭瘤病毒59型L1蛋白
無without
圖1A HPV 6 L1:33C的L1蛋白的表達。M:Marker;L:細胞裂解液;E-S:裂解液離心後收集的上清液。Figure 1A Expression of L1 protein of HPV 6 L1:33C. M: Marker; L: Cell lysate; E-S: Supernatant collected after centrifugation of the lysate.
圖1B HPV 11 L1:33C的L1蛋白的表達。M:Marker;L:細胞裂解液;E-S:裂解液離心後收集的上清液。Figure 1B Expression of L1 protein of HPV 11 L1:33C. M: Marker; L: Cell lysate; E-S: Supernatant collected after centrifugation of the lysate.
圖1C HPV 16 L1:33C的L1蛋白的表達。M:Marker;L:細胞裂解液;E-S:裂解液離心後收集的上清液。Fig. 1C Expression of L1 protein of HPV 16 L1:33C. M: Marker; L: Cell lysate; E-S: Supernatant collected after centrifugation of the lysate.
圖1D HPV 18 L1:33C的L1蛋白的表達。M:Marker;L:細胞裂解液;E-S:裂解液離心後收集的上清液。Fig. 1D Expression of L1 protein of HPV 18 L1:33C. M: Marker; L: Cell lysate; E-S: Supernatant collected after centrifugation of the lysate.
圖1E HPV 31 L1:33C的L1蛋白的表達。M:Marker;L:細胞裂解液;E-S:裂解液離心後收集的上清液。Fig. 1E Expression of L1 protein of HPV 31 L1:33C. M: Marker; L: Cell lysate; E-S: Supernatant collected after centrifugation of the lysate.
圖1F HPV 33 L1的L1蛋白的表達。M:Marker;L:細胞裂解液;E-S:裂解液離心後收集的上清液。Fig. 1F Expression of L1 protein of HPV 33 L1. M: Marker; L: Cell lysate; E-S: Supernatant collected after centrifugation of the lysate.
圖1G HPV 35 L1:33C的L1蛋白的表達。M:Marker;L:細胞裂解液;E-S:裂解液離心後收集的上清液。Fig. 1G Expression of L1 protein of HPV 35 L1:33C. M: Marker; L: Cell lysate; E-S: Supernatant collected after centrifugation of the lysate.
圖1H HPV 39 L1:59C的L1蛋白的表達。M:Marker;L:細胞裂解液;E-S:裂解液離心後收集的上清液。Fig. 1H Expression of L1 protein of HPV 39 L1:59C. M: Marker; L: Cell lysate; E-S: Supernatant collected after centrifugation of the lysate.
圖1I HPV 45 L1:33C的L1蛋白的表達。M:Marker;L:細胞裂解液;E-S:裂解液離心後收集的上清液。Figure 1I Expression of L1 protein of HPV 45 L1:33C. M: Marker; L: Cell lysate; E-S: Supernatant collected after centrifugation of the lysate.
圖1J HPV 51 L1:33C的L1蛋白的表達。M:Marker;L:細胞裂解液;E-S:裂解液離心後收集的上清液。Fig. 1J Expression of L1 protein of HPV 51 L1:33C. M: Marker; L: Cell lysate; E-S: Supernatant collected after centrifugation of the lysate.
圖1K HPV 52 L1:33C的L1蛋白的表達。M:Marker;L:細胞裂解液;E-S:裂解液離心後收集的上清液。Figure 1K Expression of L1 protein of HPV 52 L1:33C. M: Marker; L: Cell lysate; E-S: Supernatant collected after centrifugation of the lysate.
圖1L HPV 56 L1:33C的L1蛋白的表達。M:Marker;L:細胞裂解液;E-S:裂解液離心後收集的上清液。Fig. 1L Expression of L1 protein of HPV 56 L1:33C. M: Marker; L: Cell lysate; E-S: Supernatant collected after centrifugation of the lysate.
圖1M HPV 58 L1:33C的L1蛋白的表達。M:Marker;L:細胞裂解液;E-S:裂解液離心後收集的上清液。Fig. 1M Expression of L1 protein of HPV 58 L1:33C. M: Marker; L: Cell lysate; E-S: Supernatant collected after centrifugation of the lysate.
圖1N HPV 59 L1的L1蛋白的表達。M:Marker;L:細胞裂解液;E-S:裂解液離心後收集的上清液。Figure 1N Expression of L1 protein of HPV 59 L1. M: Marker; L: Cell lysate; E-S: Supernatant collected after centrifugation of the lysate.
圖2A 穿透式電子顯微鏡觀察HPV 6 L1:33C類病毒顆粒。Figure 2A Transmission electron microscopy observation of HPV 6 L1:33C virus-like particles.
圖2B 穿透式電子顯微鏡觀察HPV 11 L1:33C類病毒顆粒。Figure 2B: HPV 11 L1:33C virus-like particles observed under transmission electron microscopy.
圖2C 穿透式電子顯微鏡觀察HPV 16 L1:33C類病毒顆粒。Figure 2C Transmission electron microscopy observation of HPV 16 L1:33C virus-like particles.
圖2D 穿透式電子顯微鏡觀察HPV 18 L1:33C類病毒顆粒。Figure 2D: HPV 18 L1:33C virus-like particles observed under transmission electron microscopy.
圖2E 穿透式電子顯微鏡觀察HPV 31 L1:33C類病毒顆粒。Figure 2E HPV 31 L1:33C virus-like particles observed under transmission electron microscopy.
圖2F 穿透式電子顯微鏡觀察HPV 33 L1類病毒顆粒。Figure 2F HPV 33 L1 virus-like particles observed under transmission electron microscopy.
圖2G 穿透式電子顯微鏡觀察HPV 35 L1:33C類病毒顆粒。Figure 2G HPV 35 L1:33C virus-like particles observed under transmission electron microscopy.
圖2H 穿透式電子顯微鏡觀察HPV 39 L1:59C類病毒顆粒。Figure 2H HPV 39 L1:59C virus-like particles observed under transmission electron microscopy.
圖2I 穿透式電子顯微鏡觀察HPV 45 L1:33C類病毒顆粒。Figure 2I HPV 45 L1:33C virus-like particles observed under transmission electron microscopy.
圖2J 穿透式電子顯微鏡觀察HPV 51 L1:33C類病毒顆粒。Figure 2J HPV 51 L1:33C virus-like particles observed under transmission electron microscopy.
圖2K 穿透式電子顯微鏡觀察HPV 52 L1:33C類病毒顆粒。Figure 2K. HPV 52 L1:33C virus-like particles observed under a transmission electron microscope.
圖2L 穿透式電子顯微鏡觀察HPV 56 L1:33C類病毒顆粒。Figure 2L HPV 56 L1:33C virus-like particles observed under transmission electron microscopy.
圖2M 穿透式電子顯微鏡觀察HPV 58 L1:33C類病毒顆粒。Figure 2M Transmission electron microscopy observation of HPV 58 L1:33C virus-like particles.
圖2N 穿透式電子顯微鏡觀察HPV 59 L1類病毒顆粒。Figure 2N HPV 59 L1 virus-like particles observed under a transmission electron microscope.
圖3 14型類病毒顆粒組合物免疫小鼠後的假病毒中和效價。動物數,N=10。GMT(Geometric Mean Titer):幾何平均效價。Fig. 3 Pseudovirus neutralization titer of mice immunized with type 14 virus-like particle composition. Number of animals, N=10. GMT (Geometric Mean Titer): Geometric mean titer.
圖4 C端截短的HPV16L1(1-474)的表達。M:Marker;L:細胞裂解液;E-S:裂解液離心後收集的上清液。Fig. 4 Expression of C-terminally truncated HPV16L1 (1-474). M: Marker; L: Cell lysate; E-S: Supernatant collected after centrifugation of the lysate.
無without
序列表
<![CDATA[<110> 大陸商神州細胞工程有限公司]]>
<![CDATA[<120> 人乳頭瘤病毒多價免疫原性組合物]]>
<![CDATA[<140> TW109135395]]>
<![CDATA[<141> 2020-10-13]]>
<![CDATA[<160> 162]]>
<![CDATA[<170> BiSSAP 1.3.6]]>
<![CDATA[<210> 1]]>
<![CDATA[<211> 469]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 智人]]>
<![CDATA[<220> ]]>
<![CDATA[<223> 0601 HPV 6型L1蛋白的1-469胺基酸序列]]>
<![CDATA[<400> 1]]>
Met Trp Arg Pro Ser Asp Ser Thr Val Tyr Val Pro Pro Pro Asn Pro
1 5 10 15
Val Ser Lys Val Val Ala Thr Asp Ala Tyr Val Thr Arg Thr Asn Ile
20 25 30
Phe Tyr His Ala Ser Ser Ser Arg Leu Leu Ala Val Gly His Pro Tyr
35 40 45
Phe Ser Ile Lys Arg Ala Asn Lys Thr Val Val Pro Lys Val Ser Gly
50 55 60
Tyr Gln Tyr Arg Val Phe Lys Val Val Leu Pro Asp Pro Asn Lys Phe
65 70 75 80
Ala Leu Pro Asp Ser Ser Leu Phe Asp Pro Thr Thr Gln Arg Leu Val
85 90 95
Trp Ala Cys Thr Gly Leu Glu Val Gly Arg Gly Gln Pro Leu Gly Val
100 105 110
Gly Val Ser Gly His Pro Phe Leu Asn Lys Tyr Asp Asp Val Glu Asn
115 120 125
Ser Gly Ser Gly Gly Asn Pro Gly Gln Asp Asn Arg Val Asn Val Gly
130 135 140
Met Asp Tyr Lys Gln Thr Gln Leu Cys Met Val Gly Cys Ala Pro Pro
145 150 155 160
Leu Gly Glu His Trp Gly Lys Gly Lys Gln Cys Thr Asn Thr Pro Val
165 170 175
Gln Ala Gly Asp Cys Pro Pro Leu Glu Leu Ile Thr Ser Val Ile Gln
180 185 190
Asp Gly Asp Met Val Asp Thr Gly Phe Gly Ala Met Asn Phe Ala Asp
195 200 205
Leu Gln Thr Asn Lys Ser Asp Val Pro Ile Asp Ile Cys Gly Thr Thr
210 215 220
Cys Lys Tyr Pro Asp Tyr Leu Gln Met Ala Ala Asp Pro Tyr Gly Asp
225 230 235 240
Arg Leu Phe Phe Phe Leu Arg Lys Glu Gln Met Phe Ala Arg His Phe
245 250 255
Phe Asn Arg Ala Gly Glu Val Gly Glu Pro Val Pro Asp Thr Leu Ile
260 265 270
Ile Lys Gly Ser Gly Asn Arg Thr Ser Val Gly Ser Ser Ile Tyr Val
275 280 285
Asn Thr Pro Ser Gly Ser Leu Val Ser Ser Glu Ala Gln Leu Phe Asn
290 295 300
Lys Pro Tyr Trp Leu Gln Lys Ala Gln Gly His Asn Asn Gly Ile Cys
305 310 315 320
Trp Gly Asn Gln Leu Phe Val Thr Val Val Asp Thr Thr Arg Ser Thr
325 330 335
Asn Met Thr Leu Cys Ala Ser Val Thr Thr Ser Ser Thr Tyr Thr Asn
340 345 350
Ser Asp Tyr Lys Glu Tyr Met Arg His Val Glu Glu Tyr Asp Leu Gln
355 360 365
Phe Ile Phe Gln Leu Cys Ser Ile Thr Leu Ser Ala Glu Val Met Ala
370 375 380
Tyr Ile His Thr Met Asn Pro Ser Val Leu Glu Asp Trp Asn Phe Gly
385 390 395 400
Leu Ser Pro Pro Pro Asn Gly Thr Leu Glu Asp Thr Tyr Arg Tyr Val
405 410 415
Gln Ser Gln Ala Ile Thr Cys Gln Lys Pro Thr Pro Glu Lys Glu Lys
420 425 430
Pro Asp Pro Tyr Lys Asn Leu Ser Phe Trp Glu Val Asn Leu Lys Glu
435 440 445
Lys Phe Ser Ser Glu Leu Asp Gln Tyr Pro Leu Gly Arg Lys Phe Leu
450 455 460
Leu Gln Ser Gly Tyr
465
<![CDATA[<210> 2]]>
<![CDATA[<211> 26]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 智人]]>
<![CDATA[<220> ]]>
<![CDATA[<223> 0602 HPV 33型L1蛋白的474-499胺基酸序列]]>
<![CDATA[<400> 2]]>
Lys Ala Lys Pro Lys Leu Lys Arg Ala Ala Pro Thr Ser Thr Arg Thr
1 5 10 15
Ser Ser Ala Lys Arg Lys Lys Val Lys Lys
20 25
<![CDATA[<210> 3]]>
<![CDATA[<211> 495]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220> ]]>
<![CDATA[<223> 0603 嵌合的HPV 6型L1蛋白的胺基酸序列]]>
<![CDATA[<400> 3]]>
Met Trp Arg Pro Ser Asp Ser Thr Val Tyr Val Pro Pro Pro Asn Pro
1 5 10 15
Val Ser Lys Val Val Ala Thr Asp Ala Tyr Val Thr Arg Thr Asn Ile
20 25 30
Phe Tyr His Ala Ser Ser Ser Arg Leu Leu Ala Val Gly His Pro Tyr
35 40 45
Phe Ser Ile Lys Arg Ala Asn Lys Thr Val Val Pro Lys Val Ser Gly
50 55 60
Tyr Gln Tyr Arg Val Phe Lys Val Val Leu Pro Asp Pro Asn Lys Phe
65 70 75 80
Ala Leu Pro Asp Ser Ser Leu Phe Asp Pro Thr Thr Gln Arg Leu Val
85 90 95
Trp Ala Cys Thr Gly Leu Glu Val Gly Arg Gly Gln Pro Leu Gly Val
100 105 110
Gly Val Ser Gly His Pro Phe Leu Asn Lys Tyr Asp Asp Val Glu Asn
115 120 125
Ser Gly Ser Gly Gly Asn Pro Gly Gln Asp Asn Arg Val Asn Val Gly
130 135 140
Met Asp Tyr Lys Gln Thr Gln Leu Cys Met Val Gly Cys Ala Pro Pro
145 150 155 160
Leu Gly Glu His Trp Gly Lys Gly Lys Gln Cys Thr Asn Thr Pro Val
165 170 175
Gln Ala Gly Asp Cys Pro Pro Leu Glu Leu Ile Thr Ser Val Ile Gln
180 185 190
Asp Gly Asp Met Val Asp Thr Gly Phe Gly Ala Met Asn Phe Ala Asp
195 200 205
Leu Gln Thr Asn Lys Ser Asp Val Pro Ile Asp Ile Cys Gly Thr Thr
210 215 220
Cys Lys Tyr Pro Asp Tyr Leu Gln Met Ala Ala Asp Pro Tyr Gly Asp
225 230 235 240
Arg Leu Phe Phe Phe Leu Arg Lys Glu Gln Met Phe Ala Arg His Phe
245 250 255
Phe Asn Arg Ala Gly Glu Val Gly Glu Pro Val Pro Asp Thr Leu Ile
260 265 270
Ile Lys Gly Ser Gly Asn Arg Thr Ser Val Gly Ser Ser Ile Tyr Val
275 280 285
Asn Thr Pro Ser Gly Ser Leu Val Ser Ser Glu Ala Gln Leu Phe Asn
290 295 300
Lys Pro Tyr Trp Leu Gln Lys Ala Gln Gly His Asn Asn Gly Ile Cys
305 310 315 320
Trp Gly Asn Gln Leu Phe Val Thr Val Val Asp Thr Thr Arg Ser Thr
325 330 335
Asn Met Thr Leu Cys Ala Ser Val Thr Thr Ser Ser Thr Tyr Thr Asn
340 345 350
Ser Asp Tyr Lys Glu Tyr Met Arg His Val Glu Glu Tyr Asp Leu Gln
355 360 365
Phe Ile Phe Gln Leu Cys Ser Ile Thr Leu Ser Ala Glu Val Met Ala
370 375 380
Tyr Ile His Thr Met Asn Pro Ser Val Leu Glu Asp Trp Asn Phe Gly
385 390 395 400
Leu Ser Pro Pro Pro Asn Gly Thr Leu Glu Asp Thr Tyr Arg Tyr Val
405 410 415
Gln Ser Gln Ala Ile Thr Cys Gln Lys Pro Thr Pro Glu Lys Glu Lys
420 425 430
Pro Asp Pro Tyr Lys Asn Leu Ser Phe Trp Glu Val Asn Leu Lys Glu
435 440 445
Lys Phe Ser Ser Glu Leu Asp Gln Tyr Pro Leu Gly Arg Lys Phe Leu
450 455 460
Leu Gln Ser Gly Tyr Lys Ala Lys Pro Lys Leu Lys Arg Ala Ala Pro
465 470 475 480
Thr Ser Thr Arg Thr Ser Ser Ala Lys Arg Lys Lys Val Lys Lys
485 490 495
<![CDATA[<210> 4]]>
<![CDATA[<211> 1488]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220> ]]>
<![CDATA[<223> 0604 嵌合的HPV 6型L1蛋白的核苷酸序列]]>
<![CDATA[<400> 4]]>
atgtggagac catctgacag cacagtctat gtgcctcctc caaaccctgt gagcaaggtg 60
gtggctacag atgcctatgt gaccaggacc aacatcttct accatgcctc ctccagcaga 120
ctgctggctg tgggacaccc atacttcagc atcaagaggg ctaacaagac agtggtgcca 180
aaggtgtctg gctaccaata cagggtgttc aaggtggtgc tgcctgaccc aaacaagttt 240
gccctgcctg actcctccct gtttgaccca accacccaga gactggtgtg ggcttgtact 300
ggattggagg tgggcagggg acaaccactg ggagtgggag tgtctggaca cccattcctg 360
aacaaatatg atgatgtgga gaactctggc tctggaggca accctggaca agacaacagg 420
gtgaatgtgg ggatggacta caagcagacc caactttgta tggtgggctg tgcccctcca 480
ctgggagaac actggggcaa gggcaagcag tgtaccaaca cacctgtcca ggctggagac 540
tgtcctccat tggaactgat tacctctgtg attcaggatg gagatatggt ggacacaggc 600
tttggagcta tgaactttgc tgacctccaa accaacaagt ctgatgtgcc aattgacatc 660
tgtggcacca cttgtaaata ccctgactac ctccaaatgg ctgctgaccc atatggagac 720
agactgttct tcttcctgag gaaggaacag atgtttgcca gacacttctt caacagggct 780
ggagaggtgg gagaacctgt gcctgacacc ctgattatca agggctctgg caacaggacc 840
tctgtgggct ccagcatcta tgtgaacaca ccatctggct ccctggtgtc ctctgaggct 900
caacttttca acaagccata ctggctccaa aaggctcaag gacacaacaa tggcatctgt 960
tggggcaacc aactttttgt gacagtggtg gacaccacca ggagcaccaa tatgaccctg 1020
tgtgcctctg tgaccacctc cagcacctac accaactctg actacaagga atatatgagg 1080
catgtggagg aatatgacct ccaattcatc ttccaacttt gtagcatcac cctgtctgct 1140
gaggtgatgg cttacatcca cacaatgaac ccatctgtgt tggaggactg gaactttgga 1200
ctgagccctc ctccaaatgg caccttggag gacacctaca gatatgtcca gagccaggct 1260
atcacttgtc agaagccaac acctgagaag gagaagcctg acccatacaa gaacctgtcc 1320
ttctgggagg tgaacctgaa agagaagttc tcctctgaac tggaccaata cccactgggc 1380
aggaagttcc tgctccaatc tggctacaaa gccaagccaa aactgaaaag ggctgcccca 1440
accagcacca ggacctcctc tgccaagagg aagaaggtga agaagtaa 1488
<![CDATA[<210> 5]]>
<![CDATA[<211> 1522]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220> ]]>
<![CDATA[<223> 0605 合成的HPV6L1基因]]>
<![CDATA[<400> 5]]>
ctgggtacca tgtggagacc atctgacagc acagtctatg tgcctcctcc aaaccctgtg 60
agcaaggtgg tggctacaga tgcctatgtg accaggacca acatcttcta ccatgcctcc 120
tccagcagac tgctggctgt gggacaccca tacttcagca tcaagagggc taacaagaca 180
gtggtgccaa aggtgtctgg ctaccaatac agggtgttca aggtggtgct gcctgaccca 240
aacaagtttg ccctgcctga ctcctccctg tttgacccaa ccacccagag actggtgtgg 300
gcttgtactg gattggaggt gggcagggga caaccactgg gagtgggagt gtctggacac 360
ccattcctga acaaatatga tgatgtggag aactctggct ctggaggcaa ccctggacaa 420
gacaacaggg tgaatgtggg gatggactac aagcagaccc aactttgtat ggtgggctgt 480
gcccctccac tgggagaaca ctggggcaag ggcaagcagt gtaccaacac acctgtccag 540
gctggagact gtcctccatt ggaactgatt acctctgtga ttcaggatgg agatatggtg 600
gacacaggct ttggagctat gaactttgct gacctccaaa ccaacaagtc tgatgtgcca 660
attgacatct gtggcaccac ttgtaaatac cctgactacc tccaaatggc tgctgaccca 720
tatggagaca gactgttctt cttcctgagg aaggaacaga tgtttgccag acacttcttc 780
aacagggctg gagaggtggg agaacctgtg cctgacaccc tgattatcaa gggctctggc 840
aacaggacct ctgtgggctc cagcatctat gtgaacacac catctggctc cctggtgtcc 900
tctgaggctc aacttttcaa caagccatac tggctccaaa aggctcaagg acacaacaat 960
ggcatctgtt ggggcaacca actttttgtg acagtggtgg acaccaccag gagcaccaat 1020
atgaccctgt gtgcctctgt gaccacctcc agcacctaca ccaactctga ctacaaggaa 1080
tatatgaggc atgtggagga atatgacctc caattcatct tccaactttg tagcatcacc 1140
ctgtctgctg aggtgatggc ttacatccac acaatgaacc catctgtgtt ggaggactgg 1200
aactttggac tgagccctcc tccaaatggc accttggagg acacctacag atatgtccag 1260
agccaggcta tcacttgtca gaagccaaca cctgagaagg agaagcctga cccatacaag 1320
aacctgtcct tctgggaggt gaacctgaaa gagaagttct cctctgaact ggaccaatac 1380
ccactgggca ggaagttcct gctccaatct ggctacaggg gcaggtccag catcaggaca 1440
ggagtgaaga gacctgctgt gagcaaggca tctgctgccc caaagaggaa gagggctaag 1500
accaagaggt aaactcgagc tc 1522
<![CDATA[<210> 6]]>
<![CDATA[<211> 1519]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220> ]]>
<![CDATA[<22]]>3> 0606 合成的HPV33L1基因]]>
<br/>
<br/><![CDATA[<400> 6]]>
<br/><![CDATA[ctgggtacca tgagtgtgtg gagaccatct gaggctacag tctacctgcc tcctgtgcct 60
gtgagcaagg tggtgagcac agatgaatat gtgagcagga ccagcatcta ctactatgct 120
ggctccagca gactgctggc tgtgggacac ccatacttca gcatcaagaa cccaaccaat 180
gccaagaaac tgctggtgcc aaaggtgtct ggactccaat acagggtgtt cagggtgaga 240
ctgcctgacc caaacaagtt tggctttcct gacacctcct tctacaaccc tgacacccag 300
agactggtgt gggcttgtgt gggattggag attggcaggg gacaaccact gggagtgggc 360
atctctggac acccactgct gaacaagttt gatgacacag agacaggcaa caaataccct 420
ggacaacctg gagcagacaa cagggagtgt ctgagtatgg actacaagca gacccaactt 480
tgtctgctgg gctgtaagcc tccaacagga gaacactggg gcaagggagt ggcttgtacc 540
aatgctgccc ctgccaatga ctgtcctcca ttggaactga taaacaccat cattgaggat 600
ggagatatgg tggacacagg ctttggctgt atggacttca agaccctcca agccaacaag 660
tctgatgtgc caattgacat ctgtggcagc acttgtaaat accctgacta cctgaaaatg 720
acctctgaac catatggaga ctccctgttc ttcttcctga ggagggaaca gatgtttgtg 780
agacacttct tcaacagggc tggcaccctg ggagaggctg tgcctgatga cctctacatc 840
aagggctctg gcaccacagc cagcatccag tcctctgcct tctttccaac accatctggc 900
agtatggtga cctctgagag ccaacttttc aacaagccat actggctcca aagggctcaa 960
ggacacaaca atggcatctg ttggggcaac caggtgtttg tgacagtggt ggacaccacc 1020
aggagcacca atatgaccct gtgtacccag gtgacctctg acagcaccta caagaatgag 1080
aacttcaagg aatacatcag gcatgtggag gaatatgacc tccaatttgt gttccaactt 1140
tgtaaggtga ccctgacagc agaggtgatg acctacatcc atgctatgaa ccctgacatc 1200
ttggaggact ggcagtttgg actgacacct cctccatctg cctccctcca agacacctac 1260
aggtttgtga ccagccaggc tatcacttgt cagaagacag tgcctccaaa ggagaaggag 1320
gacccactgg gcaaatacac cttctgggag gtggacctga aagagaagtt ctctgctgac 1380
ctggaccagt ttccactggg caggaagttc ctgctccaag caggactgaa agccaagcca 1440
aaactgaaaa gggctgcccc aaccagcacc aggacctcct ctgccaagag gaagaaggtg 1500
aagaagtaaa ctcgagctc 1519
<![CDATA[<210> 7]]>
<![CDATA[<211> 35]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220> ]]>
<![CDATA[<223> 0607 HPV6L1 F1]]>
<![CDATA[<400> 7]]>
cttggtacca tgtggagacc atctgacagc acagt 35
<![CDATA[<210> 8]]>
<![CDATA[<211> 36]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220> ]]>
<![CDATA[<223> 0608 HPV6L1 R1]]>
<![CDATA[<400> 8]]>
gcttggcttt gtagccagat tggagcagga acttcc 36
<![CDATA[<210> 9]]>
<![CDATA[<211> 1426]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220> ]]>
<![CDATA[<223> 0609 HPV6L1擴增序列1]]>
<![CDATA[<400> 9]]>
cttggtacca tgtggagacc atctgacagc acagtctatg tgcctcctcc aaaccctgtg 60
agcaaggtgg tggctacaga tgcctatgtg accaggacca acatcttcta ccatgcctcc 120
tccagcagac tgctggctgt gggacaccca tacttcagca tcaagagggc taacaagaca 180
gtggtgccaa aggtgtctgg ctaccaatac agggtgttca aggtggtgct gcctgaccca 240
aacaagtttg ccctgcctga ctcctccctg tttgacccaa ccacccagag actggtgtgg 300
gcttgtactg gattggaggt gggcagggga caaccactgg gagtgggagt gtctggacac 360
ccattcctga acaaatatga tgatgtggag aactctggct ctggaggcaa ccctggacaa 420
gacaacaggg tgaatgtggg gatggactac aagcagaccc aactttgtat ggtgggctgt 480
gcccctccac tgggagaaca ctggggcaag ggcaagcagt gtaccaacac acctgtccag 540
gctggagact gtcctccatt ggaactgatt acctctgtga ttcaggatgg agatatggtg 600
gacacaggct ttggagctat gaactttgct gacctccaaa ccaacaagtc tgatgtgcca 660
attgacatct gtggcaccac ttgtaaatac cctgactacc tccaaatggc tgctgaccca 720
tatggagaca gactgttctt cttcctgagg aaggaacaga tgtttgccag acacttcttc 780
aacagggctg gagaggtggg agaacctgtg cctgacaccc tgattatcaa gggctctggc 840
aacaggacct ctgtgggctc cagcatctat gtgaacacac catctggctc cctggtgtcc 900
tctgaggctc aacttttcaa caagccatac tggctccaaa aggctcaagg acacaacaat 960
ggcatctgtt ggggcaacca actttttgtg acagtggtgg acaccaccag gagcaccaat 1020
atgaccctgt gtgcctctgt gaccacctcc agcacctaca ccaactctga ctacaaggaa 1080
tatatgaggc atgtggagga atatgacctc caattcatct tccaactttg tagcatcacc 1140
ctgtctgctg aggtgatggc ttacatccac acaatgaacc catctgtgtt ggaggactgg 1200
aactttggac tgagccctcc tccaaatggc accttggagg acacctacag atatgtccag 1260
agccaggcta tcacttgtca gaagccaaca cctgagaagg agaagcctga cccatacaag 1320
aacctgtcct tctgggaggt gaacctgaaa gagaagttct cctctgaact ggaccaatac 1380
ccactgggca ggaagttcct gctccaatct ggctacaaag ccaagc 1426
<![CDATA[<210> 10]]>
<![CDATA[<211> 35]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220> ]]>
<![CDATA[<223> 0610 HPV6L1 F2]]>
<![CDATA[<400> 10]]>
atctggctac aaagccaagc caaaactgaa aaggg 35
<![CDATA[<210> 11]]>
<![CDATA[<211> 37]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220> ]]>
<![CDATA[<223> 0611 HPV6L1 R2]]>
<![CDATA[<400> 11]]>
ctgtctagat ttacttcttc accttcttcc tcttggc 37
<![CDATA[<210> 12]]>
<![CDATA[<211> 101]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220> ]]>
<![CDATA[<223> 0612 HPV6L1擴增序列2]]>
<![CDATA[<400> 12]]>
atctggctac aaagccaagc caaaactgaa aagggctgcc ccaaccagca ccaggacctc 60
ctctgccaag aggaagaagg tgaagaagta aatctagaca g 101
<![CDATA[<210> 13]]>
<![CDATA[<211> 38]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 智人]]>
<![CDATA[<220> ]]>
<![CDATA[<223> 0613 HPV 59型L1蛋白的471-508胺基酸序列]]>
<![CDATA[<400> 13]]>
Leu Gln Leu Gly Ala Arg Pro Lys Pro Thr Ile Gly Pro Arg Lys Arg
1 5 10 15
Ala Ala Pro Ala Pro Thr Ser Thr Pro Ser Pro Lys Arg Val Lys Arg
20 25 30
Arg Lys Ser Ser Arg Lys
35
<![CDATA[<210> 14]]>
<![CDATA[<211> 470]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 智人]]>
<![CDATA[<220> ]]>
<![CDATA[<223> 1101 HPV 11型L1蛋白的1-470胺基酸序列]]>
<![CDATA[<400> 14]]>
Met Trp Arg Pro Ser Asp Ser Thr Val Tyr Val Pro Pro Pro Asn Pro
1 5 10 15
Val Ser Lys Val Val Ala Thr Asp Ala Tyr Val Lys Arg Thr Asn Ile
20 25 30
Phe Tyr His Ala Ser Ser Ser Arg Leu Leu Ala Val Gly His Pro Tyr
35 40 45
Tyr Ser Ile Lys Lys Val Asn Lys Thr Val Val Pro Lys Val Ser Gly
50 55 60
Tyr Gln Tyr Arg Val Phe Lys Val Val Leu Pro Asp Pro Asn Lys Phe
65 70 75 80
Ala Leu Pro Asp Ser Ser Leu Phe Asp Pro Thr Thr Gln Arg Leu Val
85 90 95
Trp Ala Cys Thr Gly Leu Glu Val Gly Arg Gly Gln Pro Leu Gly Val
100 105 110
Gly Val Ser Gly His Pro Leu Leu Asn Lys Tyr Asp Asp Val Glu Asn
115 120 125
Ser Gly Gly Tyr Gly Gly Asn Pro Gly Gln Asp Asn Arg Val Asn Val
130 135 140
Gly Met Asp Tyr Lys Gln Thr Gln Leu Cys Met Val Gly Cys Ala Pro
145 150 155 160
Pro Leu Gly Glu His Trp Gly Lys Gly Thr Gln Cys Ser Asn Thr Ser
165 170 175
Val Gln Asn Gly Asp Cys Pro Pro Leu Glu Leu Ile Thr Ser Val Ile
180 185 190
Gln Asp Gly Asp Met Val Asp Thr Gly Phe Gly Ala Met Asn Phe Ala
195 200 205
Asp Leu Gln Thr Asn Lys Ser Asp Val Pro Leu Asp Ile Cys Gly Thr
210 215 220
Val Cys Lys Tyr Pro Asp Tyr Leu Gln Met Ala Ala Asp Pro Tyr Gly
225 230 235 240
Asp Arg Leu Phe Phe Tyr Leu Arg Lys Glu Gln Met Phe Ala Arg His
245 250 255
Phe Phe Asn Arg Ala Gly Thr Val Gly Glu Pro Val Pro Asp Asp Leu
260 265 270
Leu Val Lys Gly Gly Asn Asn Arg Ser Ser Val Ala Ser Ser Ile Tyr
275 280 285
Val His Thr Pro Ser Gly Ser Leu Val Ser Ser Glu Ala Gln Leu Phe
290 295 300
Asn Lys Pro Tyr Trp Leu Gln Lys Ala Gln Gly His Asn Asn Gly Ile
305 310 315 320
Cys Trp Gly Asn His Leu Phe Val Thr Val Val Asp Thr Thr Arg Ser
325 330 335
Thr Asn Met Thr Leu Cys Ala Ser Val Ser Lys Ser Ala Thr Tyr Thr
340 345 350
Asn Ser Asp Tyr Lys Glu Tyr Met Arg His Val Glu Glu Phe Asp Leu
355 360 365
Gln Phe Ile Phe Gln Leu Cys Ser Ile Thr Leu Ser Ala Glu Val Met
370 375 380
Ala Tyr Ile His Thr Met Asn Pro Ser Val Leu Glu Asp Trp Asn Phe
385 390 395 400
Gly Leu Ser Pro Pro Pro Asn Gly Thr Leu Glu Asp Thr Tyr Arg Tyr
405 410 415
Val Gln Ser Gln Ala Ile Thr Cys Gln Lys Pro Thr Pro Glu Lys Glu
420 425 430
Lys Gln Asp Pro Tyr Lys Asp Met Ser Phe Trp Glu Val Asn Leu Lys
435 440 445
Glu Lys Phe Ser Ser Glu Leu Asp Gln Phe Pro Leu Gly Arg Lys Phe
450 455 460
Leu Leu Gln Ser Gly Tyr
465 470
<![CDATA[<210> 15]]>
<![CDATA[<211> 26]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 智人]]>
<![CDATA[<220> ]]>
<![CDATA[<223> 1102 HPV 33型L1蛋白的474-499胺基酸序列]]>
<![CDATA[<400> 15]]>
Lys Ala Lys Pro Lys Leu Lys Arg Ala Ala Pro Thr Ser Thr Arg Thr
1 5 10 15
Ser Ser Ala Lys Arg Lys Lys Val Lys Lys
20 25
<![CDATA[<210> 16]]>
<![CDATA[<211> 496]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220> ]]>
<![CDATA[<223> 1103 嵌合的HPV 11型L1蛋白的胺基酸序列]]>
<![CDATA[<400> 16]]>
Met Trp Arg Pro Ser Asp Ser Thr Val Tyr Val Pro Pro Pro Asn Pro
1 5 10 15
Val Ser Lys Val Val Ala Thr Asp Ala Tyr Val Lys Arg Thr Asn Ile
20 25 30
Phe Tyr His Ala Ser Ser Ser Arg Leu Leu Ala Val Gly His Pro Tyr
35 40 45
Tyr Ser Ile Lys Lys Val Asn Lys Thr Val Val Pro Lys Val Ser Gly
50 55 60
Tyr Gln Tyr Arg Val Phe Lys Val Val Leu Pro Asp Pro Asn Lys Phe
65 70 75 80
Ala Leu Pro Asp Ser Ser Leu Phe Asp Pro Thr Thr Gln Arg Leu Val
85 90 95
Trp Ala Cys Thr Gly Leu Glu Val Gly Arg Gly Gln Pro Leu Gly Val
100 105 110
Gly Val Ser Gly His Pro Leu Leu Asn Lys Tyr Asp Asp Val Glu Asn
115 120 125
Ser Gly Gly Tyr Gly Gly Asn Pro Gly Gln Asp Asn Arg Val Asn Val
130 135 140
Gly Met Asp Tyr Lys Gln Thr Gln Leu Cys Met Val Gly Cys Ala Pro
145 150 155 160
Pro Leu Gly Glu His Trp Gly Lys Gly Thr Gln Cys Ser Asn Thr Ser
165 170 175
Val Gln Asn Gly Asp Cys Pro Pro Leu Glu Leu Ile Thr Ser Val Ile
180 185 190
Gln Asp Gly Asp Met Val Asp Thr Gly Phe Gly Ala Met Asn Phe Ala
195 200 205
Asp Leu Gln Thr Asn Lys Ser Asp Val Pro Leu Asp Ile Cys Gly Thr
210 215 220
Val Cys Lys Tyr Pro Asp Tyr Leu Gln Met Ala Ala Asp Pro Tyr Gly
225 230 235 240
Asp Arg Leu Phe Phe Tyr Leu Arg Lys Glu Gln Met Phe Ala Arg His
245 250 255
Phe Phe Asn Arg Ala Gly Thr Val Gly Glu Pro Val Pro Asp Asp Leu
260 265 270
Leu Val Lys Gly Gly Asn Asn Arg Ser Ser Val Ala Ser Ser Ile Tyr
275 280 285
Val His Thr Pro Ser Gly Ser Leu Val Ser Ser Glu Ala Gln Leu Phe
290 295 300
Asn Lys Pro Tyr Trp Leu Gln Lys Ala Gln Gly His Asn Asn Gly Ile
305 310 315 320
Cys Trp Gly Asn His Leu Phe Val Thr Val Val Asp Thr Thr Arg Ser
325 330 335
Thr Asn Met Thr Leu Cys Ala Ser Val Ser Lys Ser Ala Thr Tyr Thr
340 345 350
Asn Ser Asp Tyr Lys Glu Tyr Met Arg His Val Glu Glu Phe Asp Leu
355 360 365
Gln Phe Ile Phe Gln Leu Cys Ser Ile Thr Leu Ser Ala Glu Val Met
370 375 380
Ala Tyr Ile His Thr Met Asn Pro Ser Val Leu Glu Asp Trp Asn Phe
385 390 395 400
Gly Leu Ser Pro Pro Pro Asn Gly Thr Leu Glu Asp Thr Tyr Arg Tyr
405 410 415
Val Gln Ser Gln Ala Ile Thr Cys Gln Lys Pro Thr Pro Glu Lys Glu
420 425 430
Lys Gln Asp Pro Tyr Lys Asp Met Ser Phe Trp Glu Val Asn Leu Lys
435 440 445
Glu Lys Phe Ser Ser Glu Leu Asp Gln Phe Pro Leu Gly Arg Lys Phe
450 455 460
Leu Leu Gln Ser Gly Tyr Lys Ala Lys Pro Lys Leu Lys Arg Ala Ala
465 470 475 480
Pro Thr Ser Thr Arg Thr Ser Ser Ala Lys Arg Lys Lys Val Lys Lys
485 490 495
<![CDATA[<210> 17]]>
<![CDATA[<211> 1492]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220> ]]>
<![CDATA[<223> 1104 嵌合的HPV 11型L1蛋白的核苷酸序列]]>
<![CDATA[<400> 17]]>
atgtggagac catctgacag cacagtctat gtgcctcctc caaaccctgt gagcaaggtg 60
gtggctacag atgcctatgt gaagaggacc aacatcttct accatgcctc ctccagcaga 120
ctgctggctg tgggacaccc atactacagc atcaagaagg tgaacaagac agtggtgcca 180
aaggtgtctg gctaccaata cagggtgttc aaggtggtgc tgcctgaccc aaacaagttt 240
gccctgcctg actcctccct gtttgaccca accacccaga gactggtgtg ggcttgtact 300
ggattggagg tgggcagggg acaaccactg ggagtgggag tgtctggaca cccactgctg 360
aacaaatatg atgatgtgga gaactctgga ggctatggag gcaaccctgg acaagacaac 420
agggtgaatg tggggatgga ctacaagcag acccaacttt gtatggtggg ctgtgcccct 480
ccactgggag aacactgggg caagggcacc cagtgtagca acacctctgt ccagaatgga 540
gactgtcctc cattggaact gattacctct gtgattcagg atggagatat ggtggacaca 600
ggctttggag ctatgaactt tgctgacctc caaaccaaca agtctgatgt gccactggac 660
atctgtggca cagtgtgtaa ataccctgac tacctccaaa tggctgctga cccatatgga 720
gacagactgt tcttctacct gaggaaggaa cagatgtttg ccagacactt cttcaacagg 780
gctggcacag tgggagaacc tgtgcctgat gacctgctgg tgaagggagg caacaacagg 840
tcctctgtgg catccagcat ctatgtgcat acaccatctg gctccctggt gtcctctgag 900
gctcaacttt tcaacaagcc atactggctc caaaaggctc aaggacacaa caatggcatc 960
tgttggggca accacctgtt tgtgacagtg gtggacacca ccaggagcac caatatgacc 1020
ctgtgtgcct ctgtgagcaa gtctgccacc tacaccaact ctgactacaa ggaatatatg 1080
aggcatgtgg aggagtttga cctccaattc atcttccaac tttgtagcat caccctgtct 1140
gctgaggtga tggcttacat ccacacaatg aacccatctg tgttggagga ctggaacttt 1200
ggactgagcc ctcctccaaa tggcaccttg gaggacacct acagatatgt ccagagccag 1260
gctatcactt gtcagaagcc aacacctgag aaggagaagc aggacccata caaggatatg 1320
agtttctggg aggtgaacct gaaagagaag ttctcctctg aactggacca gtttccactg 1380
ggcaggaagt tcctgctcca atctggctac aaagccaagc caaaactgaa aagggctgcc 1440
ccaaccagca ccaggacctc ctctgccaag aggaagaagg tgaagaagta aa 1492
<![CDATA[<210> 18]]>
<![CDATA[<211> 1525]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220> ]]>
<![CDATA[<223> 1105 合成的HPV11L1基因]]>
<![CDATA[<400> 18]]>
ctgggtacca tgtggagacc atctgacagc acagtctatg tgcctcctcc aaaccctgtg 60
agcaaggtgg tggctacaga tgcctatgtg aagaggacca acatcttcta ccatgcctcc 120
tccagcagac tgctggctgt gggacaccca tactacagca tcaagaaggt gaacaagaca 180
gtggtgccaa aggtgtctgg ctaccaatac agggtgttca aggtggtgct gcctgaccca 240
aacaagtttg ccctgcctga ctcctccctg tttgacccaa ccacccagag actggtgtgg 300
gcttgtactg gattggaggt gggcagggga caaccactgg gagtgggagt gtctggacac 360
ccactgctga acaaatatga tgatgtggag aactctggag gctatggagg caaccctgga 420
caagacaaca gggtgaatgt ggggatggac tacaagcaga cccaactttg tatggtgggc 480
tgtgcccctc cactgggaga acactggggc aagggcaccc agtgtagcaa cacctctgtc 540
cagaatggag actgtcctcc attggaactg attacctctg tgattcagga tggagatatg 600
gtggacacag gctttggagc tatgaacttt gctgacctcc aaaccaacaa gtctgatgtg 660
ccactggaca tctgtggcac agtgtgtaaa taccctgact acctccaaat ggctgctgac 720
ccatatggag acagactgtt cttctacctg aggaaggaac agatgtttgc cagacacttc 780
ttcaacaggg ctggcacagt gggagaacct gtgcctgatg acctgctggt gaagggaggc 840
aacaacaggt cctctgtggc atccagcatc tatgtgcata caccatctgg ctccctggtg 900
tcctctgagg ctcaactttt caacaagcca tactggctcc aaaaggctca aggacacaac 960
aatggcatct gttggggcaa ccacctgttt gtgacagtgg tggacaccac caggagcacc 1020
aatatgaccc tgtgtgcctc tgtgagcaag tctgccacct acaccaactc tgactacaag 1080
gaatatatga ggcatgtgga ggagtttgac ctccaattca tcttccaact ttgtagcatc 1140
accctgtctg ctgaggtgat ggcttacatc cacacaatga acccatctgt gttggaggac 1200
tggaactttg gactgagccc tcctccaaat ggcaccttgg aggacaccta cagatatgtc 1260
cagagccagg ctatcacttg tcagaagcca acacctgaga aggagaagca ggacccatac 1320
aaggatatga gtttctggga ggtgaacctg aaagagaagt tctcctctga actggaccag 1380
tttccactgg gcaggaagtt cctgctccaa tctggctaca ggggcaggac ctctgccagg 1440
acaggcatca agagacctgc tgtgagcaag ccaagcacag ccccaaagag gaagaggacc 1500
aagaccaaga agtaaactcg agctc 1525
<![CDATA[<210> 19]]>
<![CDATA[<211> 1519]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220> ]]>
<![CDATA[<223> 1106 合成的HPV33L1基因]]>
<![CDATA[<400> 19]]>
ctgggtacca tgagtgtgtg gagaccatct gaggctacag tctacctgcc tcctgtgcct 60
gtgagcaagg tggtgagcac agatgaatat gtgagcagga ccagcatcta ctactatgct 120
ggctccagca gactgctggc tgtgggacac ccatacttca gcatcaagaa cccaaccaat 180
gccaagaaac tgctggtgcc aaaggtgtct ggactccaat acagggtgtt cagggtgaga 240
ctgcctgacc caaacaagtt tggctttcct gacacctcct tctacaaccc tgacacccag 300
agactggtgt gggcttgtgt gggattggag attggcaggg gacaaccact gggagtgggc 360
atctctggac acccactgct gaacaagttt gatgacacag agacaggcaa caaataccct 420
ggacaacctg gagcagacaa cagggagtgt ctgagtatgg actacaagca gacccaactt 480
tgtctgctgg gctgtaagcc tccaacagga gaacactggg gcaagggagt ggcttgtacc 540
aatgctgccc ctgccaatga ctgtcctcca ttggaactga taaacaccat cattgaggat 600
ggagatatgg tggacacagg ctttggctgt atggacttca agaccctcca agccaacaag 660
tctgatgtgc caattgacat ctgtggcagc acttgtaaat accctgacta cctgaaaatg 720
acctctgaac catatggaga ctccctgttc ttcttcctga ggagggaaca gatgtttgtg 780
agacacttct tcaacagggc tggcaccctg ggagaggctg tgcctgatga cctctacatc 840
aagggctctg gcaccacagc cagcatccag tcctctgcct tctttccaac accatctggc 900
agtatggtga cctctgagag ccaacttttc aacaagccat actggctcca aagggctcaa 960
ggacacaaca atggcatctg ttggggcaac caggtgtttg tgacagtggt ggacaccacc 1020
aggagcacca atatgaccct gtgtacccag gtgacctctg acagcaccta caagaatgag 1080
aacttcaagg aatacatcag gcatgtggag gaatatgacc tccaatttgt gttccaactt 1140
tgtaaggtga ccctgacagc agaggtgatg acctacatcc atgctatgaa ccctgacatc 1200
ttggaggact ggcagtttgg actgacacct cctccatctg cctccctcca agacacctac 1260
aggtttgtga ccagccaggc tatcacttgt cagaagacag tgcctccaaa ggagaaggag 1320
gacccactgg gcaaatacac cttctgggag gtggacctga aagagaagtt ctctgctgac 1380
ctggaccagt ttccactggg caggaagttc ctgctccaag caggactgaa agccaagcca 1440
aaactgaaaa gggctgcccc aaccagcacc aggacctcct ctgccaagag gaagaaggtg 1500
aagaagtaaa ctcgagctc 1519
<![CDATA[<210> 20]]>
<![CDATA[<211> 35]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220> ]]>
<![CDATA[<223> 1107 HPV11L1 F1]]>
<![CDATA[<400> 20]]>
cttggtacca tgtggagacc atctgacagc acagt 35
<![CDATA[<21]]>0> 21]]>
<br/><![CDATA[<211> 36]]>
<br/><![CDATA[<212> DNA]]>
<br/><![CDATA[<213> 人工序列]]>
<br/>
<br/>
<br/><![CDATA[<220> ]]>
<br/><![CDATA[<223> 1108 HPV11L1 R1]]>
<br/>
<br/><![CDATA[<400> 21]]>
<br/><![CDATA[gcttggcttt gtagccagat tggagcagga acttcc 36
<![CDATA[<210> 22]]>
<![CDATA[<211> 1429]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220> ]]>
<![CDATA[<223> 1109 HPV11L1擴增序列1]]>
<![CDATA[<400> 22]]>
cttggtacca tgtggagacc atctgacagc acagtctatg tgcctcctcc aaaccctgtg 60
agcaaggtgg tggctacaga tgcctatgtg aagaggacca acatcttcta ccatgcctcc 120
tccagcagac tgctggctgt gggacaccca tactacagca tcaagaaggt gaacaagaca 180
gtggtgccaa aggtgtctgg ctaccaatac agggtgttca aggtggtgct gcctgaccca 240
aacaagtttg ccctgcctga ctcctccctg tttgacccaa ccacccagag actggtgtgg 300
gcttgtactg gattggaggt gggcagggga caaccactgg gagtgggagt gtctggacac 360
ccactgctga acaaatatga tgatgtggag aactctggag gctatggagg caaccctgga 420
caagacaaca gggtgaatgt ggggatggac tacaagcaga cccaactttg tatggtgggc 480
tgtgcccctc cactgggaga acactggggc aagggcaccc agtgtagcaa cacctctgtc 540
cagaatggag actgtcctcc attggaactg attacctctg tgattcagga tggagatatg 600
gtggacacag gctttggagc tatgaacttt gctgacctcc aaaccaacaa gtctgatgtg 660
ccactggaca tctgtggcac agtgtgtaaa taccctgact acctccaaat ggctgctgac 720
ccatatggag acagactgtt cttctacctg aggaaggaac agatgtttgc cagacacttc 780
ttcaacaggg ctggcacagt gggagaacct gtgcctgatg acctgctggt gaagggaggc 840
aacaacaggt cctctgtggc atccagcatc tatgtgcata caccatctgg ctccctggtg 900
tcctctgagg ctcaactttt caacaagcca tactggctcc aaaaggctca aggacacaac 960
aatggcatct gttggggcaa ccacctgttt gtgacagtgg tggacaccac caggagcacc 1020
aatatgaccc tgtgtgcctc tgtgagcaag tctgccacct acaccaactc tgactacaag 1080
gaatatatga ggcatgtgga ggagtttgac ctccaattca tcttccaact ttgtagcatc 1140
accctgtctg ctgaggtgat ggcttacatc cacacaatga acccatctgt gttggaggac 1200
tggaactttg gactgagccc tcctccaaat ggcaccttgg aggacaccta cagatatgtc 1260
cagagccagg ctatcacttg tcagaagcca acacctgaga aggagaagca ggacccatac 1320
aaggatatga gtttctggga ggtgaacctg aaagagaagt tctcctctga actggaccag 1380
tttccactgg gcaggaagtt cctgctccaa tctggctaca aagccaagc 1429
<![CDATA[<210> 23]]>
<![CDATA[<211> 35]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220> ]]>
<![CDATA[<223> 1110 HPV11L1 F2]]>
<![CDATA[<400> 23]]>
atctggctac aaagccaagc caaaactgaa aaggg 35
<![CDATA[<210> 24]]>
<![CDATA[<211]]>> 37]]>
<br/><![CDATA[<212> DNA]]>
<br/><![CDATA[<213> 人工序列]]>
<br/>
<br/>
<br/><![CDATA[<220> ]]>
<br/><![CDATA[<223> 1111 HPV11L1 R2]]>
<br/>
<br/><![CDATA[<400> 24]]>
<br/><![CDATA[ctgtctagat ttacttcttc accttcttcc tcttggc 37
<![CDATA[<210> 25]]>
<![CDATA[<211> 101]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220> ]]>
<![CDATA[<223> 1112 HPV11L1擴增序列2]]>
<![CDATA[<400> 25]]>
atctggctac aaagccaagc caaaactgaa aagggctgcc ccaaccagca ccaggacctc 60
ctctgccaag aggaagaagg tgaagaagta aatctagaca g 101
<![CDATA[<210> 26]]>
<![CDATA[<211> 38]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 智人]]>
<![CDATA[<220> ]]>
<![CDATA[<223> 1113 HPV 59型L1蛋白的471-508胺基酸序列]]>
<![CDATA[<400> 26]]>
Leu Gln Leu Gly Ala Arg Pro Lys Pro Thr Ile Gly Pro Arg Lys Arg
1 5 10 15
Ala Ala Pro Ala Pro Thr Ser Thr Pro Ser Pro Lys Arg Val Lys Arg
20 25 30
Arg Lys Ser Ser Arg Lys
35
<![CDATA[<210> 27]]>
<![CDATA[<211> 474]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 智人]]>
<![CDATA[<220> ]]>
<![CDATA[<223> 1601 HPV 16型L1蛋白的1-474胺基酸序列]]>
<![CDATA[<400> 27]]>
Met Ser Leu Trp Leu Pro Ser Glu Ala Thr Val Tyr Leu Pro Pro Val
1 5 10 15
Pro Val Ser Lys Val Val Ser Thr Asp Glu Tyr Val Ala Arg Thr Asn
20 25 30
Ile Tyr Tyr His Ala Gly Thr Ser Arg Leu Leu Ala Val Gly His Pro
35 40 45
Tyr Phe Pro Ile Lys Lys Pro Asn Asn Asn Lys Ile Leu Val Pro Lys
50 55 60
Val Ser Gly Leu Gln Tyr Arg Val Phe Arg Ile His Leu Pro Asp Pro
65 70 75 80
Asn Lys Phe Gly Phe Pro Asp Thr Ser Phe Tyr Asn Pro Asp Thr Gln
85 90 95
Arg Leu Val Trp Ala Cys Val Gly Val Glu Val Gly Arg Gly Gln Pro
100 105 110
Leu Gly Val Gly Ile Ser Gly His Pro Leu Leu Asn Lys Leu Asp Asp
115 120 125
Thr Glu Asn Ala Ser Ala Tyr Ala Ala Asn Ala Gly Val Asp Asn Arg
130 135 140
Glu Cys Ile Ser Met Asp Tyr Lys Gln Thr Gln Leu Cys Leu Ile Gly
145 150 155 160
Cys Lys Pro Pro Ile Gly Glu His Trp Gly Lys Gly Ser Pro Cys Thr
165 170 175
Asn Val Ala Val Asn Pro Gly Asp Cys Pro Pro Leu Glu Leu Ile Asn
180 185 190
Thr Val Ile Gln Asp Gly Asp Met Val Asp Thr Gly Phe Gly Ala Met
195 200 205
Asp Phe Thr Thr Leu Gln Ala Asn Lys Ser Glu Val Pro Leu Asp Ile
210 215 220
Cys Thr Ser Ile Cys Lys Tyr Pro Asp Tyr Ile Lys Met Val Ser Glu
225 230 235 240
Pro Tyr Gly Asp Ser Leu Phe Phe Tyr Leu Arg Arg Glu Gln Met Phe
245 250 255
Val Arg His Leu Phe Asn Arg Ala Gly Ala Val Gly Glu Asn Val Pro
260 265 270
Asp Asp Leu Tyr Ile Lys Gly Ser Gly Ser Thr Ala Asn Leu Ala Ser
275 280 285
Ser Asn Tyr Phe Pro Thr Pro Ser Gly Ser Met Val Thr Ser Asp Ala
290 295 300
Gln Ile Phe Asn Lys Pro Tyr Trp Leu Gln Arg Ala Gln Gly His Asn
305 310 315 320
Asn Gly Ile Cys Trp Gly Asn Gln Leu Phe Val Thr Val Val Asp Thr
325 330 335
Thr Arg Ser Thr Asn Met Ser Leu Cys Ala Ala Ile Ser Thr Ser Glu
340 345 350
Thr Thr Tyr Lys Asn Thr Asn Phe Lys Glu Tyr Leu Arg His Gly Glu
355 360 365
Glu Tyr Asp Leu Gln Phe Ile Phe Gln Leu Cys Lys Ile Thr Leu Thr
370 375 380
Ala Asp Val Met Thr Tyr Ile His Ser Met Asn Ser Thr Ile Leu Glu
385 390 395 400
Asp Trp Asn Phe Gly Leu Gln Pro Pro Pro Gly Gly Thr Leu Glu Asp
405 410 415
Thr Tyr Arg Phe Val Thr Ser Gln Ala Ile Ala Cys Gln Lys His Thr
420 425 430
Pro Pro Ala Pro Lys Glu Asp Pro Leu Lys Lys Tyr Thr Phe Trp Glu
435 440 445
Val Asn Leu Lys Glu Lys Phe Ser Ala Asp Leu Asp Gln Phe Pro Leu
450 455 460
Gly Arg Lys Phe Leu Leu Gln Ala Gly Leu
465 470
<![CDATA[<210> 28]]>
<![CDATA[<211> 26]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 智人]]>
<![CDATA[<220> ]]>
<![CDATA[<223> 1602 HPV 33型L1蛋白的474-499胺基酸序列]]>
<![CDATA[<400> 28]]>
Lys Ala Lys Pro Lys Leu Lys Arg Ala Ala Pro Thr Ser Thr Arg Thr
1 5 10 15
Ser Ser Ala Lys Arg Lys Lys Val Lys Lys
20 25
<![CDATA[<210> 29]]>
<![CDATA[<211> 500]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220> ]]>
<![CDATA[<223> 1603 嵌合的HPV 16型L1蛋白的胺基酸序列]]>
<![CDATA[<400> 29]]>
Met Ser Leu Trp Leu Pro Ser Glu Ala Thr Val Tyr Leu Pro Pro Val
1 5 10 15
Pro Val Ser Lys Val Val Ser Thr Asp Glu Tyr Val Ala Arg Thr Asn
20 25 30
Ile Tyr Tyr His Ala Gly Thr Ser Arg Leu Leu Ala Val Gly His Pro
35 40 45
Tyr Phe Pro Ile Lys Lys Pro Asn Asn Asn Lys Ile Leu Val Pro Lys
50 55 60
Val Ser Gly Leu Gln Tyr Arg Val Phe Arg Ile His Leu Pro Asp Pro
65 70 75 80
Asn Lys Phe Gly Phe Pro Asp Thr Ser Phe Tyr Asn Pro Asp Thr Gln
85 90 95
Arg Leu Val Trp Ala Cys Val Gly Val Glu Val Gly Arg Gly Gln Pro
100 105 110
Leu Gly Val Gly Ile Ser Gly His Pro Leu Leu Asn Lys Leu Asp Asp
115 120 125
Thr Glu Asn Ala Ser Ala Tyr Ala Ala Asn Ala Gly Val Asp Asn Arg
130 135 140
Glu Cys Ile Ser Met Asp Tyr Lys Gln Thr Gln Leu Cys Leu Ile Gly
145 150 155 160
Cys Lys Pro Pro Ile Gly Glu His Trp Gly Lys Gly Ser Pro Cys Thr
165 170 175
Asn Val Ala Val Asn Pro Gly Asp Cys Pro Pro Leu Glu Leu Ile Asn
180 185 190
Thr Val Ile Gln Asp Gly Asp Met Val Asp Thr Gly Phe Gly Ala Met
195 200 205
Asp Phe Thr Thr Leu Gln Ala Asn Lys Ser Glu Val Pro Leu Asp Ile
210 215 220
Cys Thr Ser Ile Cys Lys Tyr Pro Asp Tyr Ile Lys Met Val Ser Glu
225 230 235 240
Pro Tyr Gly Asp Ser Leu Phe Phe Tyr Leu Arg Arg Glu Gln Met Phe
245 250 255
Val Arg His Leu Phe Asn Arg Ala Gly Ala Val Gly Glu Asn Val Pro
260 265 270
Asp Asp Leu Tyr Ile Lys Gly Ser Gly Ser Thr Ala Asn Leu Ala Ser
275 280 285
Ser Asn Tyr Phe Pro Thr Pro Ser Gly Ser Met Val Thr Ser Asp Ala
290 295 300
Gln Ile Phe Asn Lys Pro Tyr Trp Leu Gln Arg Ala Gln Gly His Asn
305 310 315 320
Asn Gly Ile Cys Trp Gly Asn Gln Leu Phe Val Thr Val Val Asp Thr
325 330 335
Thr Arg Ser Thr Asn Met Ser Leu Cys Ala Ala Ile Ser Thr Ser Glu
340 345 350
Thr Thr Tyr Lys Asn Thr Asn Phe Lys Glu Tyr Leu Arg His Gly Glu
355 360 365
Glu Tyr Asp Leu Gln Phe Ile Phe Gln Leu Cys Lys Ile Thr Leu Thr
370 375 380
Ala Asp Val Met Thr Tyr Ile His Ser Met Asn Ser Thr Ile Leu Glu
385 390 395 400
Asp Trp Asn Phe Gly Leu Gln Pro Pro Pro Gly Gly Thr Leu Glu Asp
405 410 415
Thr Tyr Arg Phe Val Thr Ser Gln Ala Ile Ala Cys Gln Lys His Thr
420 425 430
Pro Pro Ala Pro Lys Glu Asp Pro Leu Lys Lys Tyr Thr Phe Trp Glu
435 440 445
Val Asn Leu Lys Glu Lys Phe Ser Ala Asp Leu Asp Gln Phe Pro Leu
450 455 460
Gly Arg Lys Phe Leu Leu Gln Ala Gly Leu Lys Ala Lys Pro Lys Leu
465 470 475 480
Lys Arg Ala Ala Pro Thr Ser Thr Arg Thr Ser Ser Ala Lys Arg Lys
485 490 495
Lys Val Lys Lys
500
<![CDATA[<210> 30]]>
<![CDATA[<211>]]> 1504
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220> ]]>
<![CDATA[<223> 1604 嵌合的HPV 16型L1蛋白的核苷酸序列]]>
<![CDATA[<400> 30]]>
atgagtctgt ggctgccatc tgaggctaca gtctacctgc ctcctgtgcc tgtgagcaag 60
gtggtgagca cagatgaata tgtggcaagg accaacatct actaccatgc tggcaccagc 120
agactgctgg ctgtgggaca cccatacttt ccaatcaaga agccaaacaa caacaagatt 180
ctggtgccaa aggtgtctgg actccaatac agggtgttca ggattcacct gcctgaccca 240
aacaagtttg gctttcctga cacctccttc tacaaccctg acacccagag actggtgtgg 300
gcttgtgtgg gagtggaggt gggcagggga caaccactgg gagtgggcat ctctggacac 360
ccactgctga acaaactgga tgacacagag aatgcctctg cctatgctgc caatgctgga 420
gtggacaaca gggagtgtat cagtatggac tacaagcaga cccaactttg tctgattggc 480
tgtaagcctc caattggaga acactggggc aagggcagcc catgtaccaa tgtggctgtg 540
aaccctggag actgtcctcc attggaactg ataaacacag tgattcagga tggagatatg 600
gtggacacag gctttggagc tatggacttc accaccctcc aagccaacaa gtctgaggtg 660
ccactggaca tctgtaccag catctgtaaa taccctgact acatcaagat ggtgtctgaa 720
ccatatggag actccctgtt cttctacctg aggagggaac agatgtttgt gagacacctg 780
ttcaacaggg ctggagcagt gggagagaat gtgcctgatg acctctacat caagggctct 840
ggcagcacag ccaacctggc atccagcaac tactttccaa caccatctgg cagtatggtg 900
acctctgatg cccagatttt caacaagcca tactggctcc aaagggctca aggacacaac 960
aatggcatct gttggggcaa ccaacttttt gtgacagtgg tggacaccac caggagcacc 1020
aatatgagtc tgtgtgctgc catcagcacc tctgagacca cctacaagaa caccaacttc 1080
aaggaatacc tgagacatgg agaggaatat gacctccaat tcatcttcca actttgtaag 1140
attaccctga cagcagatgt gatgacctac atccacagta tgaacagcac catcttggag 1200
gactggaact ttggactcca acctcctcct ggaggcacct tggaggacac ctacaggttt 1260
gtgaccagcc aggctattgc ctgtcagaaa cacacacctc ctgccccaaa ggaggaccca 1320
ctgaaaaaat acaccttctg ggaggtgaac ctgaaagaga agttctctgc tgacctggac 1380
cagtttccac tgggcaggaa gttcctgctc caagcaggac tgaaagccaa gccaaaactg 1440
aaaagggctg ccccaaccag caccaggacc tcctctgcca agaggaagaa ggtgaagaag 1500
taaa 1504
<![CDATA[<210> 31]]>
<![CDATA[<211> 1537]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220> ]]>
<![CDATA[<223> 1605 合成的HPV16L1基因]]>
<![CDATA[<400> 31]]>
ctgggtacca tgagtctgtg gctgccatct gaggctacag tctacctgcc tcctgtgcct 60
gtgagcaagg tggtgagcac agatgaatat gtggcaagga ccaacatcta ctaccatgct 120
ggcaccagca gactgctggc tgtgggacac ccatactttc caatcaagaa gccaaacaac 180
aacaagattc tggtgccaaa ggtgtctgga ctccaataca gggtgttcag gattcacctg 240
cctgacccaa acaagtttgg ctttcctgac acctccttct acaaccctga cacccagaga 300
ctggtgtggg cttgtgtggg agtggaggtg ggcaggggac aaccactggg agtgggcatc 360
tctggacacc cactgctgaa caaactggat gacacagaga atgcctctgc ctatgctgcc 420
aatgctggag tggacaacag ggagtgtatc agtatggact acaagcagac ccaactttgt 480
ctgattggct gtaagcctcc aattggagaa cactggggca agggcagccc atgtaccaat 540
gtggctgtga accctggaga ctgtcctcca ttggaactga taaacacagt gattcaggat 600
ggagatatgg tggacacagg ctttggagct atggacttca ccaccctcca agccaacaag 660
tctgaggtgc cactggacat ctgtaccagc atctgtaaat accctgacta catcaagatg 720
gtgtctgaac catatggaga ctccctgttc ttctacctga ggagggaaca gatgtttgtg 780
agacacctgt tcaacagggc tggagcagtg ggagagaatg tgcctgatga cctctacatc 840
aagggctctg gcagcacagc caacctggca tccagcaact actttccaac accatctggc 900
agtatggtga cctctgatgc ccagattttc aacaagccat actggctcca aagggctcaa 960
ggacacaaca atggcatctg ttggggcaac caactttttg tgacagtggt ggacaccacc 1020
aggagcacca atatgagtct gtgtgctgcc atcagcacct ctgagaccac ctacaagaac 1080
accaacttca aggaatacct gagacatgga gaggaatatg acctccaatt catcttccaa 1140
ctttgtaaga ttaccctgac agcagatgtg atgacctaca tccacagtat gaacagcacc 1200
atcttggagg actggaactt tggactccaa cctcctcctg gaggcacctt ggaggacacc 1260
tacaggtttg tgaccagcca ggctattgcc tgtcagaaac acacacctcc tgccccaaag 1320
gaggacccac tgaaaaaata caccttctgg gaggtgaacc tgaaagagaa gttctctgct 1380
gacctggacc agtttccact gggcaggaag ttcctgctcc aagcaggact gaaagccaag 1440
ccaaagttca ccctgggcaa gaggaaggct acaccaacca cctccagcac cagcaccaca 1500
gccaagagga agaagaggaa actgtaaact cgagctc 1537
<![CDATA[<210> 32]]>
<![CDATA[<211> 1519]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220> ]]>
<![CDATA[<223> 1606 合成的HPV33L1基因]]>
<![CDATA[<400> 32]]>
ctgggtacca tgagtgtgtg gagaccatct gaggctacag tctacctgcc tcctgtgcct 60
gtgagcaagg tggtgagcac agatgaatat gtgagcagga ccagcatcta ctactatgct 120
ggctccagca gactgctggc tgtgggacac ccatacttca gcatcaagaa cccaaccaat 180
gccaagaaac tgctggtgcc aaaggtgtct ggactccaat acagggtgtt cagggtgaga 240
ctgcctgacc caaacaagtt tggctttcct gacacctcct tctacaaccc tgacacccag 300
agactggtgt gggcttgtgt gggattggag attggcaggg gacaaccact gggagtgggc 360
atctctggac acccactgct gaacaagttt gatgacacag agacaggcaa caaataccct 420
ggacaacctg gagcagacaa cagggagtgt ctgagtatgg actacaagca gacccaactt 480
tgtctgctgg gctgtaagcc tccaacagga gaacactggg gcaagggagt ggcttgtacc 540
aatgctgccc ctgccaatga ctgtcctcca ttggaactga taaacaccat cattgaggat 600
ggagatatgg tggacacagg ctttggctgt atggacttca agaccctcca agccaacaag 660
tctgatgtgc caattgacat ctgtggcagc acttgtaaat accctgacta cctgaaaatg 720
acctctgaac catatggaga ctccctgttc ttcttcctga ggagggaaca gatgtttgtg 780
agacacttct tcaacagggc tggcaccctg ggagaggctg tgcctgatga cctctacatc 840
aagggctctg gcaccacagc cagcatccag tcctctgcct tctttccaac accatctggc 900
agtatggtga cctctgagag ccaacttttc aacaagccat actggctcca aagggctcaa 960
ggacacaaca atggcatctg ttggggcaac caggtgtttg tgacagtggt ggacaccacc 1020
aggagcacca atatgaccct gtgtacccag gtgacctctg acagcaccta caagaatgag 1080
aacttcaagg aatacatcag gcatgtggag gaatatgacc tccaatttgt gttccaactt 1140
tgtaaggtga ccctgacagc agaggtgatg acctacatcc atgctatgaa ccctgacatc 1200
ttggaggact ggcagtttgg actgacacct cctccatctg cctccctcca agacacctac 1260
aggtttgtga ccagccaggc tatcacttgt cagaagacag tgcctccaaa ggagaaggag 1320
gacccactgg gcaaatacac cttctgggag gtggacctga aagagaagtt ctctgctgac 1380
ctggaccagt ttccactggg caggaagttc ctgctccaag caggactgaa agccaagcca 1440
aaactgaaaa gggctgcccc aaccagcacc aggacctcct ctgccaagag gaagaaggtg 1500
aagaagtaaa ctcgagctc 1519
<![CDATA[<210> 33]]>
<![CDATA[<211> 34]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220> ]]>
<![CDATA[<223> 1607 HPV16L1 F1]]>
<![CDATA[<400> 33]]>
cttggtacca tgagtctgtg gctgccatct gagg 34
<![CDATA[<210> 34]]>
<![CDATA[<211> 36]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220> ]]>
<![CDATA[<223> 1608 HPV16L1 R1]]>
<![CDATA[<400> 34]]>
gcttggcttt cagtcctgct tggagcagga acttcc 36
<![CDATA[<210> 35]]>
<![CDATA[<211> 1441]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220> ]]>
<![CDATA[<223> 1609 HPV16L1擴增序列1]]>
<![CDATA[<400> 35]]>
cttggtacca tgagtctgtg gctgccatct gaggctacag tctacctgcc tcctgtgcct 60
gtgagcaagg tggtgagcac agatgaatat gtggcaagga ccaacatcta ctaccatgct 120
ggcaccagca gactgctggc tgtgggacac ccatactttc caatcaagaa gccaaacaac 180
aacaagattc tggtgccaaa ggtgtctgga ctccaataca gggtgttcag gattcacctg 240
cctgacccaa acaagtttgg ctttcctgac acctccttct acaaccctga cacccagaga 300
ctggtgtggg cttgtgtggg agtggaggtg ggcaggggac aaccactggg agtgggcatc 360
tctggacacc cactgctgaa caaactggat gacacagaga atgcctctgc ctatgctgcc 420
aatgctggag tggacaacag ggagtgtatc agtatggact acaagcagac ccaactttgt 480
ctgattggct gtaagcctcc aattggagaa cactggggca agggcagccc atgtaccaat 540
gtggctgtga accctggaga ctgtcctcca ttggaactga taaacacagt gattcaggat 600
ggagatatgg tggacacagg ctttggagct atggacttca ccaccctcca agccaacaag 660
tctgaggtgc cactggacat ctgtaccagc atctgtaaat accctgacta catcaagatg 720
gtgtctgaac catatggaga ctccctgttc ttctacctga ggagggaaca gatgtttgtg 780
agacacctgt tcaacagggc tggagcagtg ggagagaatg tgcctgatga cctctacatc 840
aagggctctg gcagcacagc caacctggca tccagcaact actttccaac accatctggc 900
agtatggtga cctctgatgc ccagattttc aacaagccat actggctcca aagggctcaa 960
ggacacaaca atggcatctg ttggggcaac caactttttg tgacagtggt ggacaccacc 1020
aggagcacca atatgagtct gtgtgctgcc atcagcacct ctgagaccac ctacaagaac 1080
accaacttca aggaatacct gagacatgga gaggaatatg acctccaatt catcttccaa 1140
ctttgtaaga ttaccctgac agcagatgtg atgacctaca tccacagtat gaacagcacc 1200
atcttggagg actggaactt tggactccaa cctcctcctg gaggcacctt ggaggacacc 1260
tacaggtttg tgaccagcca ggctattgcc tgtcagaaac acacacctcc tgccccaaag 1320
gaggacccac tgaaaaaata caccttctgg gaggtgaacc tgaaagagaa gttctctgct 1380
gacctggacc agtttccact gggcaggaag ttcctgctcc aagcaggact gaaagccaag 1440
c 1441
<![CDATA[<210> 36]]>
<![CDATA[<211> 35]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220> ]]>
<![CDATA[<223> 1610 HPV16L1 F2]]>
<![CDATA[<400> 36]]>
agcaggactg aaagccaagc caaaactgaa aaggg 35
<![CDATA[<210> 37]]>
<![CDATA[<211> 36]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220> ]]>
<![CDATA[<223> 1611 HPV16L1 R2]]>
<![CDATA[<400> 37]]>
ctgtctagat ttacttcttc accttcttcc tcttgg 36
<![CDATA[<210> 38]]>
<![CDATA[<211> 101]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220> ]]>
<![CDATA[<223> 1612 HPV16L1擴增序列2]]>
<![CDATA[<400> 38]]>
agcaggactg aaagccaagc caaaactgaa aagggctgcc ccaaccagca ccaggacctc 60
ctctgccaag aggaagaagg tgaagaagta aatctagaca g 101
<![CDATA[<210> 39]]>
<![CDATA[<211> 38]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 智人]]>
<![CDATA[<220> ]]>
<![CDATA[<223> 1613 HPV 59型L1蛋白的471-508胺基酸序列]]>
<![CDATA[<400> 39]]>
Leu Gln Leu Gly Ala Arg Pro Lys Pro Thr Ile Gly Pro Arg Lys Arg
1 5 10 15
Ala Ala Pro Ala Pro Thr Ser Thr Pro Ser Pro Lys Arg Val Lys Arg
20 25 30
Arg Lys Ser Ser Arg Lys
35
<![CDATA[<210> 40]]>
<![CDATA[<211> 470]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 智人]]>
<![CDATA[<220> ]]>
<![CDATA[<223> 1801 HPV 18型L1蛋白的1-470胺基酸序列]]>
<![CDATA[<400> 40]]>
Met Ala Leu Trp Arg Pro Ser Asp Asn Thr Val Tyr Leu Pro Pro Pro
1 5 10 15
Ser Val Ala Arg Val Val Asn Thr Asp Asp Tyr Val Thr Arg Thr Ser
20 25 30
Ile Phe Tyr His Ala Gly Ser Ser Arg Leu Leu Thr Val Gly Asn Pro
35 40 45
Tyr Phe Arg Val Pro Ala Gly Gly Gly Asn Lys Gln Asp Ile Pro Lys
50 55 60
Val Ser Ala Tyr Gln Tyr Arg Val Phe Arg Val Gln Leu Pro Asp Pro
65 70 75 80
Asn Lys Phe Gly Leu Pro Asp Thr Ser Ile Tyr Asn Pro Glu Thr Gln
85 90 95
Arg Leu Val Trp Ala Cys Ala Gly Val Glu Ile Gly Arg Gly Gln Pro
100 105 110
Leu Gly Val Gly Leu Ser Gly His Pro Phe Tyr Asn Lys Leu Asp Asp
115 120 125
Thr Glu Ser Ser His Ala Ala Thr Ser Asn Val Ser Glu Asp Val Arg
130 135 140
Asp Asn Val Ser Val Asp Tyr Lys Gln Thr Gln Leu Cys Ile Leu Gly
145 150 155 160
Cys Ala Pro Ala Ile Gly Glu His Trp Ala Lys Gly Thr Ala Cys Lys
165 170 175
Ser Arg Pro Leu Ser Gln Gly Asp Cys Pro Pro Leu Glu Leu Lys Asn
180 185 190
Thr Val Leu Glu Asp Gly Asp Met Val Asp Thr Gly Tyr Gly Ala Met
195 200 205
Asp Phe Ser Thr Leu Gln Asp Thr Lys Cys Glu Val Pro Leu Asp Ile
210 215 220
Cys Gln Ser Ile Cys Lys Tyr Pro Asp Tyr Leu Gln Met Ser Ala Asp
225 230 235 240
Pro Tyr Gly Asp Ser Met Phe Phe Cys Leu Arg Arg Glu Gln Leu Phe
245 250 255
Ala Arg His Phe Trp Asn Arg Ala Gly Thr Met Gly Asp Thr Val Pro
260 265 270
Gln Ser Leu Tyr Ile Lys Gly Thr Gly Met Arg Ala Ser Pro Gly Ser
275 280 285
Cys Val Tyr Ser Pro Ser Pro Ser Gly Ser Ile Val Thr Ser Asp Ser
290 295 300
Gln Leu Phe Asn Lys Pro Tyr Trp Leu His Lys Ala Gln Gly His Asn
305 310 315 320
Asn Gly Val Cys Trp His Asn Gln Leu Phe Val Thr Val Val Asp Thr
325 330 335
Thr Arg Ser Thr Asn Leu Thr Ile Cys Ala Ser Thr Gln Ser Pro Val
340 345 350
Pro Gly Gln Tyr Asp Ala Thr Lys Phe Lys Gln Tyr Ser Arg His Val
355 360 365
Glu Glu Tyr Asp Leu Gln Phe Ile Phe Gln Leu Cys Thr Ile Thr Leu
370 375 380
Thr Ala Asp Val Met Ser Tyr Ile His Ser Met Asn Ser Ser Ile Leu
385 390 395 400
Glu Asp Trp Asn Phe Gly Val Pro Pro Pro Pro Thr Thr Ser Leu Val
405 410 415
Asp Thr Tyr Arg Phe Val Gln Ser Val Ala Ile Thr Cys Gln Lys Asp
420 425 430
Ala Ala Pro Ala Glu Asn Lys Asp Pro Tyr Asp Lys Leu Lys Phe Trp
435 440 445
Asn Val Asp Leu Lys Glu Lys Phe Ser Leu Asp Leu Asp Gln Tyr Pro
450 455 460
Leu Gly Arg Lys Phe Leu
465 470
<![CDATA[<210> 41]]>
<![CDATA[<211> 26]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 智人]]>
<![CDATA[<220> ]]>
<![CDATA[<223> 1802 HPV 33型L1蛋白的474-499胺基酸序列]]>
<![CDATA[<400> 41]]>
Lys Ala Lys Pro Lys Leu Lys Arg Ala Ala Pro Thr Ser Thr Arg Thr
1 5 10 15
Ser Ser Ala Lys Arg Lys Lys Val Lys Lys
20 25
<![CDATA[<210> 42]]>
<![CDATA[<211> 496]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> ]]>人工序列
<![CDATA[<220> ]]>
<![CDATA[<223> 1803 嵌合的HPV 18型L1蛋白的胺基酸序列]]>
<![CDATA[<400> 42]]>
Met Ala Leu Trp Arg Pro Ser Asp Asn Thr Val Tyr Leu Pro Pro Pro
1 5 10 15
Ser Val Ala Arg Val Val Asn Thr Asp Asp Tyr Val Thr Arg Thr Ser
20 25 30
Ile Phe Tyr His Ala Gly Ser Ser Arg Leu Leu Thr Val Gly Asn Pro
35 40 45
Tyr Phe Arg Val Pro Ala Gly Gly Gly Asn Lys Gln Asp Ile Pro Lys
50 55 60
Val Ser Ala Tyr Gln Tyr Arg Val Phe Arg Val Gln Leu Pro Asp Pro
65 70 75 80
Asn Lys Phe Gly Leu Pro Asp Thr Ser Ile Tyr Asn Pro Glu Thr Gln
85 90 95
Arg Leu Val Trp Ala Cys Ala Gly Val Glu Ile Gly Arg Gly Gln Pro
100 105 110
Leu Gly Val Gly Leu Ser Gly His Pro Phe Tyr Asn Lys Leu Asp Asp
115 120 125
Thr Glu Ser Ser His Ala Ala Thr Ser Asn Val Ser Glu Asp Val Arg
130 135 140
Asp Asn Val Ser Val Asp Tyr Lys Gln Thr Gln Leu Cys Ile Leu Gly
145 150 155 160
Cys Ala Pro Ala Ile Gly Glu His Trp Ala Lys Gly Thr Ala Cys Lys
165 170 175
Ser Arg Pro Leu Ser Gln Gly Asp Cys Pro Pro Leu Glu Leu Lys Asn
180 185 190
Thr Val Leu Glu Asp Gly Asp Met Val Asp Thr Gly Tyr Gly Ala Met
195 200 205
Asp Phe Ser Thr Leu Gln Asp Thr Lys Cys Glu Val Pro Leu Asp Ile
210 215 220
Cys Gln Ser Ile Cys Lys Tyr Pro Asp Tyr Leu Gln Met Ser Ala Asp
225 230 235 240
Pro Tyr Gly Asp Ser Met Phe Phe Cys Leu Arg Arg Glu Gln Leu Phe
245 250 255
Ala Arg His Phe Trp Asn Arg Ala Gly Thr Met Gly Asp Thr Val Pro
260 265 270
Gln Ser Leu Tyr Ile Lys Gly Thr Gly Met Arg Ala Ser Pro Gly Ser
275 280 285
Cys Val Tyr Ser Pro Ser Pro Ser Gly Ser Ile Val Thr Ser Asp Ser
290 295 300
Gln Leu Phe Asn Lys Pro Tyr Trp Leu His Lys Ala Gln Gly His Asn
305 310 315 320
Asn Gly Val Cys Trp His Asn Gln Leu Phe Val Thr Val Val Asp Thr
325 330 335
Thr Arg Ser Thr Asn Leu Thr Ile Cys Ala Ser Thr Gln Ser Pro Val
340 345 350
Pro Gly Gln Tyr Asp Ala Thr Lys Phe Lys Gln Tyr Ser Arg His Val
355 360 365
Glu Glu Tyr Asp Leu Gln Phe Ile Phe Gln Leu Cys Thr Ile Thr Leu
370 375 380
Thr Ala Asp Val Met Ser Tyr Ile His Ser Met Asn Ser Ser Ile Leu
385 390 395 400
Glu Asp Trp Asn Phe Gly Val Pro Pro Pro Pro Thr Thr Ser Leu Val
405 410 415
Asp Thr Tyr Arg Phe Val Gln Ser Val Ala Ile Thr Cys Gln Lys Asp
420 425 430
Ala Ala Pro Ala Glu Asn Lys Asp Pro Tyr Asp Lys Leu Lys Phe Trp
435 440 445
Asn Val Asp Leu Lys Glu Lys Phe Ser Leu Asp Leu Asp Gln Tyr Pro
450 455 460
Leu Gly Arg Lys Phe Leu Lys Ala Lys Pro Lys Leu Lys Arg Ala Ala
465 470 475 480
Pro Thr Ser Thr Arg Thr Ser Ser Ala Lys Arg Lys Lys Val Lys Lys
485 490 495
<![CDATA[<210> 43]]>
<![CDATA[<211> 1492]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220> ]]>
<![CDATA[<223> 1804 嵌合的HPV 18型L1蛋白的核苷酸序列]]>
<![CDATA[<400> 43]]>
atggccctct ggagaccatc cgataacaca gtgtacttgc ccccacccag cgtcgcccgg 60
gtggtgaaca cagacgacta cgtcaccaga acctcaatct tctaccacgc cgggtccagc 120
cggctgctga ccgtgggcaa cccctacttc cgcgtgcccg ccggcggcgg aaacaaacaa 180
gacatcccca aagtcagcgc ctatcagtac cgggtgttcc gcgtccaact gcccgatccc 240
aacaagttcg gcctgcccga cacctccatc tacaaccccg agacccagag gctggtctgg 300
gcttgcgccg gcgtcgagat cgggaggggc caacccctgg gcgtggggtt gtccggccac 360
cccttctaca acaagctgga cgataccgag tccagccacg cagcaaccag caacgtctcc 420
gaagatgtgc gcgataacgt cagcgtggac tacaaacaaa cccaactgtg catcctggga 480
tgcgcacccg ccatcggcga gcattgggcc aaggggaccg cctgcaagag caggcccctg 540
agccaagggg actgtccacc cctggagttg aagaataccg tgctcgagga cggcgacatg 600
gtggacaccg gctacggcgc tatggatttc tccaccctcc aggacaccaa gtgcgaagtg 660
cccctcgaca tctgccaaag catctgcaag taccccgact acctccagat gagcgccgac 720
ccctacggcg acagcatgtt cttctgtctc agaagggaac aattgttcgc ccgccacttc 780
tggaaccggg ccggcacaat gggagataca gtcccccaga gcctgtacat caaggggacc 840
ggaatgaggg ccagccccgg gtcctgcgtc tacagcccaa gcccctccgg gagcatcgtc 900
acaagcgata gccaactctt caacaagccc tactggctcc acaaagccca aggccacaat 960
aacggggtgt gttggcacaa ccagctgttc gtgaccgtcg tggacacaac caggtccaca 1020
aacctgacca tctgcgccag cacccaaagc cccgtgcccg gccagtacga cgccacaaag 1080
ttcaaacaat actctcggca cgtggaagag tacgacctcc aattcatctt ccaactctgc 1140
accatcaccc tcaccgccga cgtgatgagc tacatccact ccatgaactc ctccatcctg 1200
gaagactgga atttcggcgt gccaccaccc cctaccacct ccctcgtcga cacctacaga 1260
ttcgtgcaga gcgtggccat cacatgccag aaagacgccg cccccgccga gaacaaagac 1320
ccatacgaca aactgaaatt ctggaacgtc gacctgaaag agaaattcag cctggatctg 1380
gaccagtacc cattgggcag gaagttcctc aaagccaagc caaaactgaa aagggctgcc 1440
ccaaccagca ccaggacctc ctctgccaag aggaagaagg tgaagaagta aa 1492
<![CDATA[<210> 44]]>
<![CDATA[<211> 1543]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220> ]]>
<![CDATA[<223> 1805 合成的HPV18L1基因]]>
<![CDATA[<400> 44]]>
ctgggtacca tggccctctg gagaccatcc gataacacag tgtacttgcc cccacccagc 60
gtcgcccggg tggtgaacac agacgactac gtcaccagaa cctcaatctt ctaccacgcc 120
gggtccagcc ggctgctgac cgtgggcaac ccctacttcc gcgtgcccgc cggcggcgga 180
aacaaacaag acatccccaa agtcagcgcc tatcagtacc gggtgttccg cgtccaactg 240
cccgatccca acaagttcgg cctgcccgac acctccatct acaaccccga gacccagagg 300
ctggtctggg cttgcgccgg cgtcgagatc gggaggggcc aacccctggg cgtggggttg 360
tccggccacc ccttctacaa caagctggac gataccgagt ccagccacgc agcaaccagc 420
aacgtctccg aagatgtgcg cgataacgtc agcgtggact acaaacaaac ccaactgtgc 480
atcctgggat gcgcacccgc catcggcgag cattgggcca aggggaccgc ctgcaagagc 540
aggcccctga gccaagggga ctgtccaccc ctggagttga agaataccgt gctcgaggac 600
ggcgacatgg tggacaccgg ctacggcgct atggatttct ccaccctcca ggacaccaag 660
tgcgaagtgc ccctcgacat ctgccaaagc atctgcaagt accccgacta cctccagatg 720
agcgccgacc cctacggcga cagcatgttc ttctgtctca gaagggaaca attgttcgcc 780
cgccacttct ggaaccgggc cggcacaatg ggagatacag tcccccagag cctgtacatc 840
aaggggaccg gaatgagggc cagccccggg tcctgcgtct acagcccaag cccctccggg 900
agcatcgtca caagcgatag ccaactcttc aacaagccct actggctcca caaagcccaa 960
ggccacaata acggggtgtg ttggcacaac cagctgttcg tgaccgtcgt ggacacaacc 1020
aggtccacaa acctgaccat ctgcgccagc acccaaagcc ccgtgcccgg ccagtacgac 1080
gccacaaagt tcaaacaata ctctcggcac gtggaagagt acgacctcca attcatcttc 1140
caactctgca ccatcaccct caccgccgac gtgatgagct acatccactc catgaactcc 1200
tccatcctgg aagactggaa tttcggcgtg ccaccacccc ctaccacctc cctcgtcgac 1260
acctacagat tcgtgcagag cgtggccatc acatgccaga aagacgccgc ccccgccgag 1320
aacaaagacc catacgacaa actgaaattc tggaacgtcg acctgaaaga gaaattcagc 1380
ctggatctgg accagtaccc attgggcagg aagttcctcg tgcaagccgg cctcaggaga 1440
aaaccaacaa tcgggcccag gaagaggagc gcccccagcg caaccaccag cagcaagccc 1500
gcaaaaaggg tcagagtgag ggcacgcaaa taaactcgag ctc 1543
<![CDATA[<210> 45]]>
<![CDATA[<211> 1519]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220> ]]>
<![CDATA[<223> 1806 合成的HPV33L1基因]]>
<![CDATA[<400> 45]]>
ctgggtacca tgagtgtgtg gagaccatct gaggctacag tctacctgcc tcctgtgcct 60
gtgagcaagg tggtgagcac agatgaatat gtgagcagga ccagcatcta ctactatgct 120
ggctccagca gactgctggc tgtgggacac ccatacttca gcatcaagaa cccaaccaat 180
gccaagaaac tgctggtgcc aaaggtgtct ggactccaat acagggtgtt cagggtgaga 240
ctgcctgacc caaacaagtt tggctttcct gacacctcct tctacaaccc tgacacccag 300
agactggtgt gggcttgtgt gggattggag attggcaggg gacaaccact gggagtgggc 360
atctctggac acccactgct gaacaagttt gatgacacag agacaggcaa caaataccct 420
ggacaacctg gagcagacaa cagggagtgt ctgagtatgg actacaagca gacccaactt 480
tgtctgctgg gctgtaagcc tccaacagga gaacactggg gcaagggagt ggcttgtacc 540
aatgctgccc ctgccaatga ctgtcctcca ttggaactga taaacaccat cattgaggat 600
ggagatatgg tggacacagg ctttggctgt atggacttca agaccctcca agccaacaag 660
tctgatgtgc caattgacat ctgtggcagc acttgtaaat accctgacta cctgaaaatg 720
acctctgaac catatggaga ctccctgttc ttcttcctga ggagggaaca gatgtttgtg 780
agacacttct tcaacagggc tggcaccctg ggagaggctg tgcctgatga cctctacatc 840
aagggctctg gcaccacagc cagcatccag tcctctgcct tctttccaac accatctggc 900
agtatggtga cctctgagag ccaacttttc aacaagccat actggctcca aagggctcaa 960
ggacacaaca atggcatctg ttggggcaac caggtgtttg tgacagtggt ggacaccacc 1020
aggagcacca atatgaccct gtgtacccag gtgacctctg acagcaccta caagaatgag 1080
aacttcaagg aatacatcag gcatgtggag gaatatgacc tccaatttgt gttccaactt 1140
tgtaaggtga ccctgacagc agaggtgatg acctacatcc atgctatgaa ccctgacatc 1200
ttggaggact ggcagtttgg actgacacct cctccatctg cctccctcca agacacctac 1260
aggtttgtga ccagccaggc tatcacttgt cagaagacag tgcctccaaa ggagaaggag 1320
gacccactgg gcaaatacac cttctgggag gtggacctga aagagaagtt ctctgctgac 1380
ctggaccagt ttccactggg caggaagttc ctgctccaag caggactgaa agccaagcca 1440
aaactgaaaa gggctgcccc aaccagcacc aggacctcct ctgccaagag gaagaaggtg 1500
aagaagtaaa ctcgagctc 1519
<![CDATA[<210> 46]]>
<![CDATA[<211> 34]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220> ]]>
<![CDATA[<223> 1807 HPV18L1 F1]]>
<![CDATA[<400> 46]]>
cttggtacca tggccctctg gagaccatcc gata 34
<![CDATA[<210> 47]]>
<![CDATA[<211> 35]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220> ]]>
<![CDATA[<223> 1808 HPV18L1 R1]]>
<![CDATA[<400> 47]]>
gcttggcttt gaggaacttc ctgcccaatg ggtac 35
<![CDATA[<210> 48]]>
<![CDATA[<211> 1429]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220> ]]>
<![CDATA[<223> 1809 HPV18L1擴增序列1]]>
<![CDATA[<400> 48]]>
cttggtacca tggccctctg gagaccatcc gataacacag tgtacttgcc cccacccagc 60
gtcgcccggg tggtgaacac agacgactac gtcaccagaa cctcaatctt ctaccacgcc 120
gggtccagcc ggctgctgac cgtgggcaac ccctacttcc gcgtgcccgc cggcggcgga 180
aacaaacaag acatccccaa agtcagcgcc tatcagtacc gggtgttccg cgtccaactg 240
cccgatccca acaagttcgg cctgcccgac acctccatct acaaccccga gacccagagg 300
ctggtctggg cttgcgccgg cgtcgagatc gggaggggcc aacccctggg cgtggggttg 360
tccggccacc ccttctacaa caagctggac gataccgagt ccagccacgc agcaaccagc 420
aacgtctccg aagatgtgcg cgataacgtc agcgtggact acaaacaaac ccaactgtgc 480
atcctgggat gcgcacccgc catcggcgag cattgggcca aggggaccgc ctgcaagagc 540
aggcccctga gccaagggga ctgtccaccc ctggagttga agaataccgt gctcgaggac 600
ggcgacatgg tggacaccgg ctacggcgct atggatttct ccaccctcca ggacaccaag 660
tgcgaagtgc ccctcgacat ctgccaaagc atctgcaagt accccgacta cctccagatg 720
agcgccgacc cctacggcga cagcatgttc ttctgtctca gaagggaaca attgttcgcc 780
cgccacttct ggaaccgggc cggcacaatg ggagatacag tcccccagag cctgtacatc 840
aaggggaccg gaatgagggc cagccccggg tcctgcgtct acagcccaag cccctccggg 900
agcatcgtca caagcgatag ccaactcttc aacaagccct actggctcca caaagcccaa 960
ggccacaata acggggtgtg ttggcacaac cagctgttcg tgaccgtcgt ggacacaacc 1020
aggtccacaa acctgaccat ctgcgccagc acccaaagcc ccgtgcccgg ccagtacgac 1080
gccacaaagt tcaaacaata ctctcggcac gtggaagagt acgacctcca attcatcttc 1140
caactctgca ccatcaccct caccgccgac gtgatgagct acatccactc catgaactcc 1200
tccatcctgg aagactggaa tttcggcgtg ccaccacccc ctaccacctc cctcgtcgac 1260
acctacagat tcgtgcagag cgtggccatc acatgccaga aagacgccgc ccccgccgag 1320
aacaaagacc catacgacaa actgaaattc tggaacgtcg acctgaaaga gaaattcagc 1380
ctggatctgg accagtaccc attgggcagg aagttcctca aagccaagc 1429
<![CDATA[<210> 49]]>
<![CDATA[<211> 35]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220> ]]>
<![CDATA[<223> 1810 HPV18L1 F2]]>
<![CDATA[<400> 49]]>
gaagttcctc aaagccaagc caaaactgaa aaggg 35
<![CDATA[<210> 50]]>
<![CDATA[<211> 36]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220> ]]>
<![CDATA[<223> 1811 HPV18L1 R2]]>
<![CDATA[<400> 50]]>
ctgtctagat ttacttcttc accttcttcc tcttgg 36
<![CDATA[<210> 51]]>
<![CDATA[<211> 101]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220> ]]>
<![CDATA[<223> 1812 HPV18L1擴增序列2]]>
<![CDATA[<400> 51]]>
gaagttcctc aaagccaagc caaaactgaa aagggctgcc ccaaccagca ccaggacctc 60
ctctgccaag aggaagaagg tgaagaagta aatctagaca g 101
<![CDATA[<210> 52]]>
<![CDATA[<211> 38]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 智人]]>
<![CDATA[<220> ]]>
<![CDATA[<223> 1813 HPV 59型L1蛋白的471-508胺基酸序列]]>
<![CDATA[<400> 52]]>
Leu Gln Leu Gly Ala Arg Pro Lys Pro Thr Ile Gly Pro Arg Lys Arg
1 5 10 15
Ala Ala Pro Ala Pro Thr Ser Thr Pro Ser Pro Lys Arg Val Lys Arg
20 25 30
Arg Lys Ser Ser Arg Lys
35
<![CDATA[<210> 53]]>
<![CDATA[<211> 475]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 智人]]>
<![CDATA[<220> ]]>
<![CDATA[<223> 3101 HPV 31型L1蛋白的1-475胺基酸序列]]>
<![CDATA[<400> 53]]>
Met Ser Leu Trp Arg Pro Ser Glu Ala Thr Val Tyr Leu Pro Pro Val
1 5 10 15
Pro Val Ser Lys Val Val Ser Thr Asp Glu Tyr Val Thr Arg Thr Asn
20 25 30
Ile Tyr Tyr His Ala Gly Ser Ala Arg Leu Leu Thr Val Gly His Pro
35 40 45
Tyr Tyr Ser Ile Pro Lys Ser Asp Asn Pro Lys Lys Ile Val Val Pro
50 55 60
Lys Val Ser Gly Leu Gln Tyr Arg Val Phe Arg Val Arg Leu Pro Asp
65 70 75 80
Pro Asn Lys Phe Gly Phe Pro Asp Thr Ser Phe Tyr Asn Pro Glu Thr
85 90 95
Gln Arg Leu Val Trp Ala Cys Val Gly Leu Glu Val Gly Arg Gly Gln
100 105 110
Pro Leu Gly Val Gly Ile Ser Gly His Pro Leu Leu Asn Lys Phe Asp
115 120 125
Asp Thr Glu Asn Ser Asn Arg Tyr Ala Gly Gly Pro Gly Thr Asp Asn
130 135 140
Arg Glu Cys Ile Ser Met Asp Tyr Lys Gln Thr Gln Leu Cys Leu Leu
145 150 155 160
Gly Cys Lys Pro Pro Ile Gly Glu His Trp Gly Lys Gly Ser Pro Cys
165 170 175
Ser Asn Asn Ala Ile Thr Pro Gly Asp Cys Pro Pro Leu Glu Leu Lys
180 185 190
Asn Ser Val Ile Gln Asp Gly Asp Met Val Asp Thr Gly Phe Gly Ala
195 200 205
Met Asp Phe Thr Ala Leu Gln Asp Thr Lys Ser Asn Val Pro Leu Asp
210 215 220
Ile Cys Asn Ser Ile Cys Lys Tyr Pro Asp Tyr Leu Lys Met Val Ala
225 230 235 240
Glu Pro Tyr Gly Asp Thr Leu Phe Phe Tyr Leu Arg Arg Glu Gln Met
245 250 255
Phe Val Arg His Phe Phe Asn Arg Ser Gly Thr Val Gly Glu Ser Val
260 265 270
Pro Thr Asp Leu Tyr Ile Lys Gly Ser Gly Ser Thr Ala Thr Leu Ala
275 280 285
Asn Ser Thr Tyr Phe Pro Thr Pro Ser Gly Ser Met Val Thr Ser Asp
290 295 300
Ala Gln Ile Phe Asn Lys Pro Tyr Trp Met Gln Arg Ala Gln Gly His
305 310 315 320
Asn Asn Gly Ile Cys Trp Gly Asn Gln Leu Phe Val Thr Val Val Asp
325 330 335
Thr Thr Arg Ser Thr Asn Met Ser Val Cys Ala Ala Ile Ala Asn Ser
340 345 350
Asp Thr Thr Phe Lys Ser Ser Asn Phe Lys Glu Tyr Leu Arg His Gly
355 360 365
Glu Glu Phe Asp Leu Gln Phe Ile Phe Gln Leu Cys Lys Ile Thr Leu
370 375 380
Ser Ala Asp Ile Met Thr Tyr Ile His Ser Met Asn Pro Ala Ile Leu
385 390 395 400
Glu Asp Trp Asn Phe Gly Leu Thr Thr Pro Pro Ser Gly Ser Leu Glu
405 410 415
Asp Thr Tyr Arg Phe Val Thr Ser Gln Ala Ile Thr Cys Gln Lys Ser
420 425 430
Ala Pro Gln Lys Pro Lys Glu Asp Pro Phe Lys Asp Tyr Val Phe Trp
435 440 445
Glu Val Asn Leu Lys Glu Lys Phe Ser Ala Asp Leu Asp Gln Phe Pro
450 455 460
Leu Gly Arg Lys Phe Leu Leu Gln Ala Gly Tyr
465 470 475
<![CDATA[<210> 54]]>
<![CDATA[<211> 26]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 智人]]>
<![CDATA[<220> ]]>
<![CDATA[<223> 3102 HPV 33型L1蛋白的474-499胺基酸序列]]>
<![CDATA[<400> 54]]>
Lys Ala Lys Pro Lys Leu Lys Arg Ala Ala Pro Thr Ser Thr Arg Thr
1 5 10 15
Ser Ser Ala Lys Arg Lys Lys Val Lys Lys
20 25
<![CDATA[<210> 55]]>
<![CDATA[<211> 501]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220> ]]>
<![CDATA[<223> 3103 嵌合的HPV 31型L1蛋白的胺基酸序列]]>
<![CDATA[<400> 55]]>
Met Ser Leu Trp Arg Pro Ser Glu Ala Thr Val Tyr Leu Pro Pro Val
1 5 10 15
Pro Val Ser Lys Val Val Ser Thr Asp Glu Tyr Val Thr Arg Thr Asn
20 25 30
Ile Tyr Tyr His Ala Gly Ser Ala Arg Leu Leu Thr Val Gly His Pro
35 40 45
Tyr Tyr Ser Ile Pro Lys Ser Asp Asn Pro Lys Lys Ile Val Val Pro
50 55 60
Lys Val Ser Gly Leu Gln Tyr Arg Val Phe Arg Val Arg Leu Pro Asp
65 70 75 80
Pro Asn Lys Phe Gly Phe Pro Asp Thr Ser Phe Tyr Asn Pro Glu Thr
85 90 95
Gln Arg Leu Val Trp Ala Cys Val Gly Leu Glu Val Gly Arg Gly Gln
100 105 110
Pro Leu Gly Val Gly Ile Ser Gly His Pro Leu Leu Asn Lys Phe Asp
115 120 125
Asp Thr Glu Asn Ser Asn Arg Tyr Ala Gly Gly Pro Gly Thr Asp Asn
130 135 140
Arg Glu Cys Ile Ser Met Asp Tyr Lys Gln Thr Gln Leu Cys Leu Leu
145 150 155 160
Gly Cys Lys Pro Pro Ile Gly Glu His Trp Gly Lys Gly Ser Pro Cys
165 170 175
Ser Asn Asn Ala Ile Thr Pro Gly Asp Cys Pro Pro Leu Glu Leu Lys
180 185 190
Asn Ser Val Ile Gln Asp Gly Asp Met Val Asp Thr Gly Phe Gly Ala
195 200 205
Met Asp Phe Thr Ala Leu Gln Asp Thr Lys Ser Asn Val Pro Leu Asp
210 215 220
Ile Cys Asn Ser Ile Cys Lys Tyr Pro Asp Tyr Leu Lys Met Val Ala
225 230 235 240
Glu Pro Tyr Gly Asp Thr Leu Phe Phe Tyr Leu Arg Arg Glu Gln Met
245 250 255
Phe Val Arg His Phe Phe Asn Arg Ser Gly Thr Val Gly Glu Ser Val
260 265 270
Pro Thr Asp Leu Tyr Ile Lys Gly Ser Gly Ser Thr Ala Thr Leu Ala
275 280 285
Asn Ser Thr Tyr Phe Pro Thr Pro Ser Gly Ser Met Val Thr Ser Asp
290 295 300
Ala Gln Ile Phe Asn Lys Pro Tyr Trp Met Gln Arg Ala Gln Gly His
305 310 315 320
Asn Asn Gly Ile Cys Trp Gly Asn Gln Leu Phe Val Thr Val Val Asp
325 330 335
Thr Thr Arg Ser Thr Asn Met Ser Val Cys Ala Ala Ile Ala Asn Ser
340 345 350
Asp Thr Thr Phe Lys Ser Ser Asn Phe Lys Glu Tyr Leu Arg His Gly
355 360 365
Glu Glu Phe Asp Leu Gln Phe Ile Phe Gln Leu Cys Lys Ile Thr Leu
370 375 380
Ser Ala Asp Ile Met Thr Tyr Ile His Ser Met Asn Pro Ala Ile Leu
385 390 395 400
Glu Asp Trp Asn Phe Gly Leu Thr Thr Pro Pro Ser Gly Ser Leu Glu
405 410 415
Asp Thr Tyr Arg Phe Val Thr Ser Gln Ala Ile Thr Cys Gln Lys Ser
420 425 430
Ala Pro Gln Lys Pro Lys Glu Asp Pro Phe Lys Asp Tyr Val Phe Trp
435 440 445
Glu Val Asn Leu Lys Glu Lys Phe Ser Ala Asp Leu Asp Gln Phe Pro
450 455 460
Leu Gly Arg Lys Phe Leu Leu Gln Ala Gly Tyr Lys Ala Lys Pro Lys
465 470 475 480
Leu Lys Arg Ala Ala Pro Thr Ser Thr Arg Thr Ser Ser Ala Lys Arg
485 490 495
Lys Lys Val Lys Lys
500
<![CDATA[<210> 56]]>
<![CDATA[<211> 1507]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220> ]]>
<![CDATA[<223> 3104 嵌合的HPV 31型L1蛋白的核苷酸序列]]>
<![CDATA[<400> 56]]>
atgagcctgt ggaggcccag cgaggccacc gtgtacctgc cccccgtgcc cgtgagcaag 60
gtggtgagca ccgacgagta cgtgaccagg accaacatct actaccacgc cggcagcgcc 120
aggctgctga ccgtgggcca cccctactac agcatcccca agagcgacaa ccccaagaag 180
atcgtggtgc ccaaggtgag cggcctgcag tacagggtgt tcagggtgag gctgcccgac 240
cccaacaagt tcggcttccc cgacaccagc ttctacaacc ccgagaccca gaggctggtg 300
tgggcctgcg tgggcctgga ggtgggcagg ggccagcccc tgggcgtggg catcagcggc 360
caccccctgc tgaacaagtt cgacgacacc gagaacagca acaggtacgc cggcggcccc 420
ggcaccgaca acagggagtg catcagcatg gactacaagc agacccagct gtgcctgctg 480
ggctgcaagc cccccatcgg cgagcactgg ggcaagggca gcccctgcag caacaacgcc 540
atcacccccg gcgactgccc ccccctggag ctgaagaaca gcgtgatcca ggacggcgac 600
atggtggaca ccggcttcgg cgccatggac ttcaccgccc tgcaggacac caagagcaac 660
gtgcccctgg acatctgcaa cagcatctgc aagtaccccg actacctgaa gatggtggcc 720
gagccctacg gcgacaccct gttcttctac ctgaggaggg agcagatgtt cgtgaggcac 780
ttcttcaaca ggagcggcac cgtgggcgag agcgtgccca ccgacctgta catcaagggc 840
agcggcagca ccgccaccct ggccaacagc acctacttcc ccacccccag cggcagcatg 900
gtgaccagcg acgcccagat cttcaacaag ccctactgga tgcagagggc ccagggccac 960
aacaacggca tctgctgggg caaccagctg ttcgtgaccg tggtggacac caccaggagc 1020
accaacatga gcgtgtgcgc cgccatcgcc aacagcgaca ccaccttcaa gagcagcaac 1080
ttcaaggagt acctgaggca cggcgaggag ttcgacctgc agttcatctt ccagctgtgc 1140
aagatcaccc tgagcgccga catcatgacc tacatccaca gcatgaaccc cgccatcctg 1200
gaggactgga acttcggcct gaccaccccc cccagcggca gcctggagga cacctacagg 1260
ttcgtgacca gccaggccat cacctgccag aagtccgccc cccagaagcc caaggaggac 1320
cccttcaagg actacgtgtt ctgggaggtg aacctgaagg agaagttcag cgccgacctg 1380
gaccagttcc ccctgggcag gaagttcctg ctgcaggccg gctacaaagc caagccaaaa 1440
ctgaaaaggg ctgccccaac cagcaccagg acctcctctg ccaagaggaa gaaggtgaag 1500
aagtaaa 1507
<![CDATA[<210> 57]]>
<![CDATA[<211> 1534]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220> ]]>
<![CDATA[<223> 3105 合成的HPV31L1基因]]>
<![CDATA[<400> 57]]>
ctgggtacca tgagcctgtg gaggcccagc gaggccaccg tgtacctgcc ccccgtgccc 60
gtgagcaagg tggtgagcac cgacgagtac gtgaccagga ccaacatcta ctaccacgcc 120
ggcagcgcca ggctgctgac cgtgggccac ccctactaca gcatccccaa gagcgacaac 180
cccaagaaga tcgtggtgcc caaggtgagc ggcctgcagt acagggtgtt cagggtgagg 240
ctgcccgacc ccaacaagtt cggcttcccc gacaccagct tctacaaccc cgagacccag 300
aggctggtgt gggcctgcgt gggcctggag gtgggcaggg gccagcccct gggcgtgggc 360
atcagcggcc accccctgct gaacaagttc gacgacaccg agaacagcaa caggtacgcc 420
ggcggccccg gcaccgacaa cagggagtgc atcagcatgg actacaagca gacccagctg 480
tgcctgctgg gctgcaagcc ccccatcggc gagcactggg gcaagggcag cccctgcagc 540
aacaacgcca tcacccccgg cgactgcccc cccctggagc tgaagaacag cgtgatccag 600
gacggcgaca tggtggacac cggcttcggc gccatggact tcaccgccct gcaggacacc 660
aagagcaacg tgcccctgga catctgcaac agcatctgca agtaccccga ctacctgaag 720
atggtggccg agccctacgg cgacaccctg ttcttctacc tgaggaggga gcagatgttc 780
gtgaggcact tcttcaacag gagcggcacc gtgggcgaga gcgtgcccac cgacctgtac 840
atcaagggca gcggcagcac cgccaccctg gccaacagca cctacttccc cacccccagc 900
ggcagcatgg tgaccagcga cgcccagatc ttcaacaagc cctactggat gcagagggcc 960
cagggccaca acaacggcat ctgctggggc aaccagctgt tcgtgaccgt ggtggacacc 1020
accaggagca ccaacatgag cgtgtgcgcc gccatcgcca acagcgacac caccttcaag 1080
agcagcaact tcaaggagta cctgaggcac ggcgaggagt tcgacctgca gttcatcttc 1140
cagctgtgca agatcaccct gagcgccgac atcatgacct acatccacag catgaacccc 1200
gccatcctgg aggactggaa cttcggcctg accacccccc ccagcggcag cctggaggac 1260
acctacaggt tcgtgaccag ccaggccatc acctgccaga agtccgcccc ccagaagccc 1320
aaggaggacc ccttcaagga ctacgtgttc tgggaggtga acctgaagga gaagttcagc 1380
gccgacctgg accagttccc cctgggcagg aagttcctgc tgcaggccgg ctacagggcc 1440
aggcccaagt tcaaggccgg caagaggagc gcccccagcg ccagcaccac cacccccgcc 1500
aagaggaaga agaccaagaa gtaaactcga gctc 1534
<![CDATA[<210> 58]]>
<![CDATA[<211> 1519]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220> ]]>
<![CDATA[<223> 3106 合成的HPV33L1基因]]>
<![CDATA[<400> 58]]>
ctgggtacca tgagtgtgtg gagaccatct gaggctacag tctacctgcc tcctgtgcct 60
gtgagcaagg tggtgagcac agatgaatat gtgagcagga ccagcatcta ctactatgct 120
ggctccagca gactgctggc tgtgggacac ccatacttca gcatcaagaa cccaaccaat 180
gccaagaaac tgctggtgcc aaaggtgtct ggactccaat acagggtgtt cagggtgaga 240
ctgcctgacc caaacaagtt tggctttcct gacacctcct tctacaaccc tgacacccag 300
agactggtgt gggcttgtgt gggattggag attggcaggg gacaaccact gggagtgggc 360
atctctggac acccactgct gaacaagttt gatgacacag agacaggcaa caaataccct 420
ggacaacctg gagcagacaa cagggagtgt ctgagtatgg actacaagca gacccaactt 480
tgtctgctgg gctgtaagcc tccaacagga gaacactggg gcaagggagt ggcttgtacc 540
aatgctgccc ctgccaatga ctgtcctcca ttggaactga taaacaccat cattgaggat 600
ggagatatgg tggacacagg ctttggctgt atggacttca agaccctcca agccaacaag 660
tctgatgtgc caattgacat ctgtggcagc acttgtaaat accctgacta cctgaaaatg 720
acctctgaac catatggaga ctccctgttc ttcttcctga ggagggaaca gatgtttgtg 780
agacacttct tcaacagggc tggcaccctg ggagaggctg tgcctgatga cctctacatc 840
aagggctctg gcaccacagc cagcatccag tcctctgcct tctttccaac accatctggc 900
agtatggtga cctctgagag ccaacttttc aacaagccat actggctcca aagggctcaa 960
ggacacaaca atggcatctg ttggggcaac caggtgtttg tgacagtggt ggacaccacc 1020
aggagcacca atatgaccct gtgtacccag gtgacctctg acagcaccta caagaatgag 1080
aacttcaagg aatacatcag gcatgtggag gaatatgacc tccaatttgt gttccaactt 1140
tgtaaggtga ccctgacagc agaggtgatg acctacatcc atgctatgaa ccctgacatc 1200
ttggaggact ggcagtttgg actgacacct cctccatctg cctccctcca agacacctac 1260
aggtttgtga ccagccaggc tatcacttgt cagaagacag tgcctccaaa ggagaaggag 1320
gacccactgg gcaaatacac cttctgggag gtggacctga aagagaagtt ctctgctgac 1380
ctggaccagt ttccactggg caggaagttc ctgctccaag caggactgaa agccaagcca 1440
aaactgaaaa gggctgcccc aaccagcacc aggacctcct ctgccaagag gaagaaggtg 1500
aagaagtaaa ctcgagctc 1519
<![CDATA[<210> 59]]>
<![CDATA[<211> 33]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220> ]]>
<![CDATA[<223> 3107 HPV31L1 F1]]>
<![CDATA[<400> 59]]>
cttggtacca tgagcctgtg gaggcccagc gag 33
<![CDATA[<210> 60]]>
<![CDATA[<211> 38]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220> ]]>
<![CDATA[<223> 3108 HPV31L1 R1]]>
<![CDATA[<400> 60]]>
gcttggcttt gtagccggcc tgcagcagga acttcctg 38
<![CDATA[<210> 61]]>
<![CDATA[<211> 1444]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220> ]]>
<![CDATA[<223> 3109 HPV31L1擴增序列1]]>
<![CDATA[<400> 61]]>
cttggtacca tgagcctgtg gaggcccagc gaggccaccg tgtacctgcc ccccgtgccc 60
gtgagcaagg tggtgagcac cgacgagtac gtgaccagga ccaacatcta ctaccacgcc 120
ggcagcgcca ggctgctgac cgtgggccac ccctactaca gcatccccaa gagcgacaac 180
cccaagaaga tcgtggtgcc caaggtgagc ggcctgcagt acagggtgtt cagggtgagg 240
ctgcccgacc ccaacaagtt cggcttcccc gacaccagct tctacaaccc cgagacccag 300
aggctggtgt gggcctgcgt gggcctggag gtgggcaggg gccagcccct gggcgtgggc 360
atcagcggcc accccctgct gaacaagttc gacgacaccg agaacagcaa caggtacgcc 420
ggcggccccg gcaccgacaa cagggagtgc atcagcatgg actacaagca gacccagctg 480
tgcctgctgg gctgcaagcc ccccatcggc gagcactggg gcaagggcag cccctgcagc 540
aacaacgcca tcacccccgg cgactgcccc cccctggagc tgaagaacag cgtgatccag 600
gacggcgaca tggtggacac cggcttcggc gccatggact tcaccgccct gcaggacacc 660
aagagcaacg tgcccctgga catctgcaac agcatctgca agtaccccga ctacctgaag 720
atggtggccg agccctacgg cgacaccctg ttcttctacc tgaggaggga gcagatgttc 780
gtgaggcact tcttcaacag gagcggcacc gtgggcgaga gcgtgcccac cgacctgtac 840
atcaagggca gcggcagcac cgccaccctg gccaacagca cctacttccc cacccccagc 900
ggcagcatgg tgaccagcga cgcccagatc ttcaacaagc cctactggat gcagagggcc 960
cagggccaca acaacggcat ctgctggggc aaccagctgt tcgtgaccgt ggtggacacc 1020
accaggagca ccaacatgag cgtgtgcgcc gccatcgcca acagcgacac caccttcaag 1080
agcagcaact tcaaggagta cctgaggcac ggcgaggagt tcgacctgca gttcatcttc 1140
cagctgtgca agatcaccct gagcgccgac atcatgacct acatccacag catgaacccc 1200
gccatcctgg aggactggaa cttcggcctg accacccccc ccagcggcag cctggaggac 1260
acctacaggt tcgtgaccag ccaggccatc acctgccaga agtccgcccc ccagaagccc 1320
aaggaggacc ccttcaagga ctacgtgttc tgggaggtga acctgaagga gaagttcagc 1380
gccgacctgg accagttccc cctgggcagg aagttcctgc tgcaggccgg ctacaaagcc 1440
aagc 1444
<![CDATA[<210> 62]]>
<![CDATA[<211> 35]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220> ]]>
<![CDATA[<223> 3110 HPV31L1 F2]]>
<![CDATA[<400> 62]]>
ggccggctac aaagccaagc caaaactgaa aaggg 35
<![CDATA[<210> 63]]>
<![CDATA[<211> 41]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220> ]]>
<![CDATA[<223> 3111 HPV31L1 R2]]>
<![CDATA[<400> 63]]>
ctgtctagat ttacttcttc accttcttcc tcttggcaga g 41
<![CDATA[<210> 64]]>
<![CDATA[<211> 101]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220> ]]>
<![CDATA[<223> 3112 HPV31L1擴增序列2]]>
<![CDATA[<400> 64]]>
ggccggctac aaagccaagc caaaactgaa aagggctgcc ccaaccagca ccaggacctc 60
ctctgccaag aggaagaagg tgaagaagta aatctagaca g 101
<![CDATA[<210> 65]]>
<![CDATA[<211> 38]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 智人]]>
<![CDATA[<220> ]]>
<![CDATA[<223> 3113 HPV 59型L1蛋白的471-508胺基酸序列]]>
<![CDATA[<400> 65]]>
Leu Gln Leu Gly Ala Arg Pro Lys Pro Thr Ile Gly Pro Arg Lys Arg
1 5 10 15
Ala Ala Pro Ala Pro Thr Ser Thr Pro Ser Pro Lys Arg Val Lys Arg
20 25 30
Arg Lys Ser Ser Arg Lys
35
<![CDATA[<210> 66]]>
<![CDATA[<211> 499]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 智人]]>
<![CDATA[<220> ]]>
<![CDATA[<223> 3301 HPV 33型L1蛋白的胺基酸序列]]>
<![CDATA[<400> 66]]>
Met Ser Val Trp Arg Pro Ser Glu Ala Thr Val Tyr Leu Pro Pro Val
1 5 10 15
Pro Val Ser Lys Val Val Ser Thr Asp Glu Tyr Val Ser Arg Thr Ser
20 25 30
Ile Tyr Tyr Tyr Ala Gly Ser Ser Arg Leu Leu Ala Val Gly His Pro
35 40 45
Tyr Phe Ser Ile Lys Asn Pro Thr Asn Ala Lys Lys Leu Leu Val Pro
50 55 60
Lys Val Ser Gly Leu Gln Tyr Arg Val Phe Arg Val Arg Leu Pro Asp
65 70 75 80
Pro Asn Lys Phe Gly Phe Pro Asp Thr Ser Phe Tyr Asn Pro Asp Thr
85 90 95
Gln Arg Leu Val Trp Ala Cys Val Gly Leu Glu Ile Gly Arg Gly Gln
100 105 110
Pro Leu Gly Val Gly Ile Ser Gly His Pro Leu Leu Asn Lys Phe Asp
115 120 125
Asp Thr Glu Thr Gly Asn Lys Tyr Pro Gly Gln Pro Gly Ala Asp Asn
130 135 140
Arg Glu Cys Leu Ser Met Asp Tyr Lys Gln Thr Gln Leu Cys Leu Leu
145 150 155 160
Gly Cys Lys Pro Pro Thr Gly Glu His Trp Gly Lys Gly Val Ala Cys
165 170 175
Thr Asn Ala Ala Pro Ala Asn Asp Cys Pro Pro Leu Glu Leu Ile Asn
180 185 190
Thr Ile Ile Glu Asp Gly Asp Met Val Asp Thr Gly Phe Gly Cys Met
195 200 205
Asp Phe Lys Thr Leu Gln Ala Asn Lys Ser Asp Val Pro Ile Asp Ile
210 215 220
Cys Gly Ser Thr Cys Lys Tyr Pro Asp Tyr Leu Lys Met Thr Ser Glu
225 230 235 240
Pro Tyr Gly Asp Ser Leu Phe Phe Phe Leu Arg Arg Glu Gln Met Phe
245 250 255
Val Arg His Phe Phe Asn Arg Ala Gly Thr Leu Gly Glu Ala Val Pro
260 265 270
Asp Asp Leu Tyr Ile Lys Gly Ser Gly Thr Thr Ala Ser Ile Gln Ser
275 280 285
Ser Ala Phe Phe Pro Thr Pro Ser Gly Ser Met Val Thr Ser Glu Ser
290 295 300
Gln Leu Phe Asn Lys Pro Tyr Trp Leu Gln Arg Ala Gln Gly His Asn
305 310 315 320
Asn Gly Ile Cys Trp Gly Asn Gln Val Phe Val Thr Val Val Asp Thr
325 330 335
Thr Arg Ser Thr Asn Met Thr Leu Cys Thr Gln Val Thr Ser Asp Ser
340 345 350
Thr Tyr Lys Asn Glu Asn Phe Lys Glu Tyr Ile Arg His Val Glu Glu
355 360 365
Tyr Asp Leu Gln Phe Val Phe Gln Leu Cys Lys Val Thr Leu Thr Ala
370 375 380
Glu Val Met Thr Tyr Ile His Ala Met Asn Pro Asp Ile Leu Glu Asp
385 390 395 400
Trp Gln Phe Gly Leu Thr Pro Pro Pro Ser Ala Ser Leu Gln Asp Thr
405 410 415
Tyr Arg Phe Val Thr Ser Gln Ala Ile Thr Cys Gln Lys Thr Val Pro
420 425 430
Pro Lys Glu Lys Glu Asp Pro Leu Gly Lys Tyr Thr Phe Trp Glu Val
435 440 445
Asp Leu Lys Glu Lys Phe Ser Ala Asp Leu Asp Gln Phe Pro Leu Gly
450 455 460
Arg Lys Phe Leu Leu Gln Ala Gly Leu Lys Ala Lys Pro Lys Leu Lys
465 470 475 480
Arg Ala Ala Pro Thr Ser Thr Arg Thr Ser Ser Ala Lys Arg Lys Lys
485 490 495
Val Lys Lys
<![CDATA[<210> 67]]>
<![CDATA[<211> 1501]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 智人]]>
<![CDATA[<220> ]]>
<![CDATA[<223> 3302 HPV 33型L1蛋白的核苷酸序列]]>
<![CDATA[<400> 67]]>
atgagtgtgt ggagaccatc tgaggctaca gtctacctgc ctcctgtgcc tgtgagcaag 60
gtggtgagca cagatgaata tgtgagcagg accagcatct actactatgc tggctccagc 120
agactgctgg ctgtgggaca cccatacttc agcatcaaga acccaaccaa tgccaagaaa 180
ctgctggtgc caaaggtgtc tggactccaa tacagggtgt tcagggtgag actgcctgac 240
ccaaacaagt ttggctttcc tgacacctcc ttctacaacc ctgacaccca gagactggtg 300
tgggcttgtg tgggattgga gattggcagg ggacaaccac tgggagtggg catctctgga 360
cacccactgc tgaacaagtt tgatgacaca gagacaggca acaaataccc tggacaacct 420
ggagcagaca acagggagtg tctgagtatg gactacaagc agacccaact ttgtctgctg 480
ggctgtaagc ctccaacagg agaacactgg ggcaagggag tggcttgtac caatgctgcc 540
cctgccaatg actgtcctcc attggaactg ataaacacca tcattgagga tggagatatg 600
gtggacacag gctttggctg tatggacttc aagaccctcc aagccaacaa gtctgatgtg 660
ccaattgaca tctgtggcag cacttgtaaa taccctgact acctgaaaat gacctctgaa 720
ccatatggag actccctgtt cttcttcctg aggagggaac agatgtttgt gagacacttc 780
ttcaacaggg ctggcaccct gggagaggct gtgcctgatg acctctacat caagggctct 840
ggcaccacag ccagcatcca gtcctctgcc ttctttccaa caccatctgg cagtatggtg 900
acctctgaga gccaactttt caacaagcca tactggctcc aaagggctca aggacacaac 960
aatggcatct gttggggcaa ccaggtgttt gtgacagtgg tggacaccac caggagcacc 1020
aatatgaccc tgtgtaccca ggtgacctct gacagcacct acaagaatga gaacttcaag 1080
gaatacatca ggcatgtgga ggaatatgac ctccaatttg tgttccaact ttgtaaggtg 1140
accctgacag cagaggtgat gacctacatc catgctatga accctgacat cttggaggac 1200
tggcagtttg gactgacacc tcctccatct gcctccctcc aagacaccta caggtttgtg 1260
accagccagg ctatcacttg tcagaagaca gtgcctccaa aggagaagga ggacccactg 1320
ggcaaataca ccttctggga ggtggacctg aaagagaagt tctctgctga cctggaccag 1380
tttccactgg gcaggaagtt cctgctccaa gcaggactga aagccaagcc aaaactgaaa 1440
agggctgccc caaccagcac caggacctcc tctgccaaga ggaagaaggt gaagaagtaa 1500
a 1501
<![CDATA[<210> 68]]>
<![CDATA[<211> 1519]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220> ]]>
<![CDATA[<223> 3303 合成的HPV33L1基因]]>
<![CDATA[<400> 68]]>
ctgggtacca tgagtgtgtg gagaccatct gaggctacag tctacctgcc tcctgtgcct 60
gtgagcaagg tggtgagcac agatgaatat gtgagcagga ccagcatcta ctactatgct 120
ggctccagca gactgctggc tgtgggacac ccatacttca gcatcaagaa cccaaccaat 180
gccaagaaac tgctggtgcc aaaggtgtct ggactccaat acagggtgtt cagggtgaga 240
ctgcctgacc caaacaagtt tggctttcct gacacctcct tctacaaccc tgacacccag 300
agactggtgt gggcttgtgt gggattggag attggcaggg gacaaccact gggagtgggc 360
atctctggac acccactgct gaacaagttt gatgacacag agacaggcaa caaataccct 420
ggacaacctg gagcagacaa cagggagtgt ctgagtatgg actacaagca gacccaactt 480
tgtctgctgg gctgtaagcc tccaacagga gaacactggg gcaagggagt ggcttgtacc 540
aatgctgccc ctgccaatga ctgtcctcca ttggaactga taaacaccat cattgaggat 600
ggagatatgg tggacacagg ctttggctgt atggacttca agaccctcca agccaacaag 660
tctgatgtgc caattgacat ctgtggcagc acttgtaaat accctgacta cctgaaaatg 720
acctctgaac catatggaga ctccctgttc ttcttcctga ggagggaaca gatgtttgtg 780
agacacttct tcaacagggc tggcaccctg ggagaggctg tgcctgatga cctctacatc 840
aagggctctg gcaccacagc cagcatccag tcctctgcct tctttccaac accatctggc 900
agtatggtga cctctgagag ccaacttttc aacaagccat actggctcca aagggctcaa 960
ggacacaaca atggcatctg ttggggcaac caggtgtttg tgacagtggt ggacaccacc 1020
aggagcacca atatgaccct gtgtacccag gtgacctctg acagcaccta caagaatgag 1080
aacttcaagg aatacatcag gcatgtggag gaatatgacc tccaatttgt gttccaactt 1140
tgtaaggtga ccctgacagc agaggtgatg acctacatcc atgctatgaa ccctgacatc 1200
ttggaggact ggcagtttgg actgacacct cctccatctg cctccctcca agacacctac 1260
aggtttgtga ccagccaggc tatcacttgt cagaagacag tgcctccaaa ggagaaggag 1320
gacccactgg gcaaatacac cttctgggag gtggacctga aagagaagtt ctctgctgac 1380
ctggaccagt ttccactggg caggaagttc ctgctccaag caggactgaa agccaagcca 1440
aaactgaaaa gggctgcccc aaccagcacc aggacctcct ctgccaagag gaagaaggtg 1500
aagaagtaaa ctcgagctc 1519
<![CDATA[<210> 69]]>
<![CDATA[<211> 472]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 智人]]>
<![CDATA[<220> ]]>
<![CDATA[<223> 3501 HPV 35型L1蛋白的1-472胺基酸序列]]>
<![CDATA[<400> 69]]>
Met Ser Leu Trp Arg Ser Asn Glu Ala Thr Val Tyr Leu Pro Pro Val
1 5 10 15
Ser Val Ser Lys Val Val Ser Thr Asp Glu Tyr Val Thr Arg Thr Asn
20 25 30
Ile Tyr Tyr His Ala Gly Ser Ser Arg Leu Leu Ala Val Gly His Pro
35 40 45
Tyr Tyr Ala Ile Lys Lys Gln Asp Ser Asn Lys Ile Ala Val Pro Lys
50 55 60
Val Ser Gly Leu Gln Tyr Arg Val Phe Arg Val Lys Leu Pro Asp Pro
65 70 75 80
Asn Lys Phe Gly Phe Pro Asp Thr Ser Phe Tyr Asp Pro Ala Ser Gln
85 90 95
Arg Leu Val Trp Ala Cys Thr Gly Val Glu Val Gly Arg Gly Gln Pro
100 105 110
Leu Gly Val Gly Ile Ser Gly His Pro Leu Leu Asn Lys Leu Asp Asp
115 120 125
Thr Glu Asn Ser Asn Lys Tyr Val Gly Asn Ser Gly Thr Asp Asn Arg
130 135 140
Glu Cys Ile Ser Met Asp Tyr Lys Gln Thr Gln Leu Cys Leu Ile Gly
145 150 155 160
Cys Arg Pro Pro Ile Gly Glu His Trp Gly Lys Gly Thr Pro Cys Asn
165 170 175
Ala Asn Gln Val Lys Ala Gly Glu Cys Pro Pro Leu Glu Leu Leu Asn
180 185 190
Thr Val Leu Gln Asp Gly Asp Met Val Asp Thr Gly Phe Gly Ala Met
195 200 205
Asp Phe Thr Thr Leu Gln Ala Asn Lys Ser Asp Val Pro Leu Asp Ile
210 215 220
Cys Ser Ser Ile Cys Lys Tyr Pro Asp Tyr Leu Lys Met Val Ser Glu
225 230 235 240
Pro Tyr Gly Asp Met Leu Phe Phe Tyr Leu Arg Arg Glu Gln Met Phe
245 250 255
Val Arg His Leu Phe Asn Arg Ala Gly Thr Val Gly Glu Thr Val Pro
260 265 270
Ala Asp Leu Tyr Ile Lys Gly Thr Thr Gly Thr Leu Pro Ser Thr Ser
275 280 285
Tyr Phe Pro Thr Pro Ser Gly Ser Met Val Thr Ser Asp Ala Gln Ile
290 295 300
Phe Asn Lys Pro Tyr Trp Leu Gln Arg Ala Gln Gly His Asn Asn Gly
305 310 315 320
Ile Cys Trp Ser Asn Gln Leu Phe Val Thr Val Val Asp Thr Thr Arg
325 330 335
Ser Thr Asn Met Ser Val Cys Ser Ala Val Ser Ser Ser Asp Ser Thr
340 345 350
Tyr Lys Asn Asp Asn Phe Lys Glu Tyr Leu Arg His Gly Glu Glu Tyr
355 360 365
Asp Leu Gln Phe Ile Phe Gln Leu Cys Lys Ile Thr Leu Thr Ala Asp
370 375 380
Val Met Thr Tyr Ile His Ser Met Asn Pro Ser Ile Leu Glu Asp Trp
385 390 395 400
Asn Phe Gly Leu Thr Pro Pro Pro Ser Gly Thr Leu Glu Asp Thr Tyr
405 410 415
Arg Tyr Val Thr Ser Gln Ala Val Thr Cys Gln Lys Pro Ser Ala Pro
420 425 430
Lys Pro Lys Asp Asp Pro Leu Lys Asn Tyr Thr Phe Trp Glu Val Asp
435 440 445
Leu Lys Glu Lys Phe Ser Ala Asp Leu Asp Gln Phe Pro Leu Gly Arg
450 455 460
Lys Phe Leu Leu Gln Ala Gly Leu
465 470
<![CDATA[<210> 70]]>
<![CDATA[<211> 26]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> ]]>智人
<![CDATA[<220> ]]>
<![CDATA[<223> 3502 HPV 33型L1蛋白的474-499胺基酸序列]]>
<![CDATA[<400> 70]]>
Lys Ala Lys Pro Lys Leu Lys Arg Ala Ala Pro Thr Ser Thr Arg Thr
1 5 10 15
Ser Ser Ala Lys Arg Lys Lys Val Lys Lys
20 25
<![CDATA[<210> 71]]>
<![CDATA[<211> 498]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220> ]]>
<![CDATA[<223> 3503 嵌合的HPV 35型L1蛋白的胺基酸序列]]>
<![CDATA[<400]]>> 71]]>
<br/><![CDATA[Met Ser Leu Trp Arg Ser Asn Glu Ala Thr Val Tyr Leu Pro Pro Val
1 5 10 15
Ser Val Ser Lys Val Val Ser Thr Asp Glu Tyr Val Thr Arg Thr Asn
20 25 30
Ile Tyr Tyr His Ala Gly Ser Ser Arg Leu Leu Ala Val Gly His Pro
35 40 45
Tyr Tyr Ala Ile Lys Lys Gln Asp Ser Asn Lys Ile Ala Val Pro Lys
50 55 60
Val Ser Gly Leu Gln Tyr Arg Val Phe Arg Val Lys Leu Pro Asp Pro
65 70 75 80
Asn Lys Phe Gly Phe Pro Asp Thr Ser Phe Tyr Asp Pro Ala Ser Gln
85 90 95
Arg Leu Val Trp Ala Cys Thr Gly Val Glu Val Gly Arg Gly Gln Pro
100 105 110
Leu Gly Val Gly Ile Ser Gly His Pro Leu Leu Asn Lys Leu Asp Asp
115 120 125
Thr Glu Asn Ser Asn Lys Tyr Val Gly Asn Ser Gly Thr Asp Asn Arg
130 135 140
Glu Cys Ile Ser Met Asp Tyr Lys Gln Thr Gln Leu Cys Leu Ile Gly
145 150 155 160
Cys Arg Pro Pro Ile Gly Glu His Trp Gly Lys Gly Thr Pro Cys Asn
165 170 175
Ala Asn Gln Val Lys Ala Gly Glu Cys Pro Pro Leu Glu Leu Leu Asn
180 185 190
Thr Val Leu Gln Asp Gly Asp Met Val Asp Thr Gly Phe Gly Ala Met
195 200 205
Asp Phe Thr Thr Leu Gln Ala Asn Lys Ser Asp Val Pro Leu Asp Ile
210 215 220
Cys Ser Ser Ile Cys Lys Tyr Pro Asp Tyr Leu Lys Met Val Ser Glu
225 230 235 240
Pro Tyr Gly Asp Met Leu Phe Phe Tyr Leu Arg Arg Glu Gln Met Phe
245 250 255
Val Arg His Leu Phe Asn Arg Ala Gly Thr Val Gly Glu Thr Val Pro
260 265 270
Ala Asp Leu Tyr Ile Lys Gly Thr Thr Gly Thr Leu Pro Ser Thr Ser
275 280 285
Tyr Phe Pro Thr Pro Ser Gly Ser Met Val Thr Ser Asp Ala Gln Ile
290 295 300
Phe Asn Lys Pro Tyr Trp Leu Gln Arg Ala Gln Gly His Asn Asn Gly
305 310 315 320
Ile Cys Trp Ser Asn Gln Leu Phe Val Thr Val Val Asp Thr Thr Arg
325 330 335
Ser Thr Asn Met Ser Val Cys Ser Ala Val Ser Ser Ser Asp Ser Thr
340 345 350
Tyr Lys Asn Asp Asn Phe Lys Glu Tyr Leu Arg His Gly Glu Glu Tyr
355 360 365
Asp Leu Gln Phe Ile Phe Gln Leu Cys Lys Ile Thr Leu Thr Ala Asp
370 375 380
Val Met Thr Tyr Ile His Ser Met Asn Pro Ser Ile Leu Glu Asp Trp
385 390 395 400
Asn Phe Gly Leu Thr Pro Pro Pro Ser Gly Thr Leu Glu Asp Thr Tyr
405 410 415
Arg Tyr Val Thr Ser Gln Ala Val Thr Cys Gln Lys Pro Ser Ala Pro
420 425 430
Lys Pro Lys Asp Asp Pro Leu Lys Asn Tyr Thr Phe Trp Glu Val Asp
435 440 445
Leu Lys Glu Lys Phe Ser Ala Asp Leu Asp Gln Phe Pro Leu Gly Arg
450 455 460
Lys Phe Leu Leu Gln Ala Gly Leu Lys Ala Lys Pro Lys Leu Lys Arg
465 470 475 480
Ala Ala Pro Thr Ser Thr Arg Thr Ser Ser Ala Lys Arg Lys Lys Val
485 490 495
Lys Lys
<![CDATA[<210> 72]]>
<![CDATA[<211> 1498]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220> ]]>
<![CDATA[<223> 3504 嵌合的HPV 35型L1蛋白的核苷酸序列]]>
<![CDATA[<400> 72]]>
atgagtctgt ggaggagcaa tgaggctaca gtctacctgc ctcctgtgtc tgtgagcaag 60
gtggtgagca cagatgaata tgtgaccagg accaacatct actaccatgc tggctccagc 120
agactgctgg ctgtgggaca cccatactat gccatcaaga agcaggacag caacaagatt 180
gctgtgccaa aggtgtctgg actccaatac agggtgttca gggtgaaact gcctgaccca 240
aacaagtttg gctttcctga cacctccttc tatgaccctg ccagccagag actggtgtgg 300
gcttgtactg gagtggaggt gggcagggga caaccactgg gagtgggcat ctctggacac 360
ccactgctga acaaactgga tgacacagag aacagcaaca aatatgtggg caactctggc 420
acagacaaca gggagtgtat cagtatggac tacaagcaga cccaactttg tctgattggc 480
tgtagacctc caattggaga acactggggc aagggcacac catgtaatgc caaccaggtg 540
aaggctggag agtgtcctcc attggaactg ctgaacacag tgctccaaga tggagatatg 600
gtggacacag gctttggagc tatggacttc accaccctcc aagccaacaa gtctgatgtg 660
ccactggaca tctgttccag catctgtaaa taccctgact acctgaaaat ggtgtctgaa 720
ccatatggag atatgctgtt cttctacctg aggagggaac agatgtttgt gagacacctg 780
ttcaacaggg ctggcacagt gggagagaca gtgcctgctg acctctacat caagggcacc 840
acaggcaccc tgccaagcac ctcctacttt ccaacaccat ctggcagtat ggtgacctct 900
gatgcccaga ttttcaacaa gccatactgg ctccaaaggg ctcaaggaca caacaatggc 960
atctgttgga gcaaccaact ttttgtgaca gtggtggaca ccaccaggag caccaatatg 1020
agtgtgtgtt ctgctgtgtc ctcctctgac agcacctaca agaatgacaa cttcaaggaa 1080
tacctgagac atggagagga atatgacctc caattcatct tccaactttg taagattacc 1140
ctgacagcag atgtgatgac ctacatccac agtatgaacc caagcatctt ggaggactgg 1200
aactttggac tgacacctcc tccatctggc accttggagg acacctacag atatgtgacc 1260
agccaggctg tgacttgtca gaagccatct gccccaaagc caaaggatga cccactgaaa 1320
aactacacct tctgggaggt ggacctgaaa gagaagttct ctgctgacct ggaccagttt 1380
ccactgggca ggaagttcct gctccaagca ggactgaaag ccaagccaaa actgaaaagg 1440
gctgccccaa ccagcaccag gacctcctct gccaagagga agaaggtgaa gaagtaaa 1498
<![CDATA[<210> 73]]>
<![CDATA[<211> 1528]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220> ]]>
<![CDATA[<223> 3505 合成的HPV35L1基因]]>
<![CDATA[<400> 73]]>
ctgggtacca tgagtctgtg gaggagcaat gaggctacag tctacctgcc tcctgtgtct 60
gtgagcaagg tggtgagcac agatgaatat gtgaccagga ccaacatcta ctaccatgct 120
ggctccagca gactgctggc tgtgggacac ccatactatg ccatcaagaa gcaggacagc 180
aacaagattg ctgtgccaaa ggtgtctgga ctccaataca gggtgttcag ggtgaaactg 240
cctgacccaa acaagtttgg ctttcctgac acctccttct atgaccctgc cagccagaga 300
ctggtgtggg cttgtactgg agtggaggtg ggcaggggac aaccactggg agtgggcatc 360
tctggacacc cactgctgaa caaactggat gacacagaga acagcaacaa atatgtgggc 420
aactctggca cagacaacag ggagtgtatc agtatggact acaagcagac ccaactttgt 480
ctgattggct gtagacctcc aattggagaa cactggggca agggcacacc atgtaatgcc 540
aaccaggtga aggctggaga gtgtcctcca ttggaactgc tgaacacagt gctccaagat 600
ggagatatgg tggacacagg ctttggagct atggacttca ccaccctcca agccaacaag 660
tctgatgtgc cactggacat ctgttccagc atctgtaaat accctgacta cctgaaaatg 720
gtgtctgaac catatggaga tatgctgttc ttctacctga ggagggaaca gatgtttgtg 780
agacacctgt tcaacagggc tggcacagtg ggagagacag tgcctgctga cctctacatc 840
aagggcacca caggcaccct gccaagcacc tcctactttc caacaccatc tggcagtatg 900
gtgacctctg atgcccagat tttcaacaag ccatactggc tccaaagggc tcaaggacac 960
aacaatggca tctgttggag caaccaactt tttgtgacag tggtggacac caccaggagc 1020
accaatatga gtgtgtgttc tgctgtgtcc tcctctgaca gcacctacaa gaatgacaac 1080
ttcaaggaat acctgagaca tggagaggaa tatgacctcc aattcatctt ccaactttgt 1140
aagattaccc tgacagcaga tgtgatgacc tacatccaca gtatgaaccc aagcatcttg 1200
gaggactgga actttggact gacacctcct ccatctggca ccttggagga cacctacaga 1260
tatgtgacca gccaggctgt gacttgtcag aagccatctg ccccaaagcc aaaggatgac 1320
ccactgaaaa actacacctt ctgggaggtg gacctgaaag agaagttctc tgctgacctg 1380
gaccagtttc cactgggcag gaagttcctg ctccaagcag gactgaaagc cagaccaaac 1440
ttcagactgg gcaagagggc tgcccctgcc agcaccagca agaagtccag caccaagagg 1500
aggaaggtga agagctaaac tcgagctc 1528
<![CDATA[<210> 74]]>
<![CDATA[<211> 1519]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220> ]]>
<![CDATA[<223> 3506 合成的HPV33L1基因]]>
<![CDATA[<400> 74]]>
ctgggtacca tgagtgtgtg gagaccatct gaggctacag tctacctgcc tcctgtgcct 60
gtgagcaagg tggtgagcac agatgaatat gtgagcagga ccagcatcta ctactatgct 120
ggctccagca gactgctggc tgtgggacac ccatacttca gcatcaagaa cccaaccaat 180
gccaagaaac tgctggtgcc aaaggtgtct ggactccaat acagggtgtt cagggtgaga 240
ctgcctgacc caaacaagtt tggctttcct gacacctcct tctacaaccc tgacacccag 300
agactggtgt gggcttgtgt gggattggag attggcaggg gacaaccact gggagtgggc 360
atctctggac acccactgct gaacaagttt gatgacacag agacaggcaa caaataccct 420
ggacaacctg gagcagacaa cagggagtgt ctgagtatgg actacaagca gacccaactt 480
tgtctgctgg gctgtaagcc tccaacagga gaacactggg gcaagggagt ggcttgtacc 540
aatgctgccc ctgccaatga ctgtcctcca ttggaactga taaacaccat cattgaggat 600
ggagatatgg tggacacagg ctttggctgt atggacttca agaccctcca agccaacaag 660
tctgatgtgc caattgacat ctgtggcagc acttgtaaat accctgacta cctgaaaatg 720
acctctgaac catatggaga ctccctgttc ttcttcctga ggagggaaca gatgtttgtg 780
agacacttct tcaacagggc tggcaccctg ggagaggctg tgcctgatga cctctacatc 840
aagggctctg gcaccacagc cagcatccag tcctctgcct tctttccaac accatctggc 900
agtatggtga cctctgagag ccaacttttc aacaagccat actggctcca aagggctcaa 960
ggacacaaca atggcatctg ttggggcaac caggtgtttg tgacagtggt ggacaccacc 1020
aggagcacca atatgaccct gtgtacccag gtgacctctg acagcaccta caagaatgag 1080
aacttcaagg aatacatcag gcatgtggag gaatatgacc tccaatttgt gttccaactt 1140
tgtaaggtga ccctgacagc agaggtgatg acctacatcc atgctatgaa ccctgacatc 1200
ttggaggact ggcagtttgg actgacacct cctccatctg cctccctcca agacacctac 1260
aggtttgtga ccagccaggc tatcacttgt cagaagacag tgcctccaaa ggagaaggag 1320
gacccactgg gcaaatacac cttctgggag gtggacctga aagagaagtt ctctgctgac 1380
ctggaccagt ttccactggg caggaagttc ctgctccaag caggactgaa agccaagcca 1440
aaactgaaaa gggctgcccc aaccagcacc aggacctcct ctgccaagag gaagaaggtg 1500
aagaagtaaa ctcgagctc 1519
<![CDATA[<210> 75]]>
<![CDATA[<211> 34]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220> ]]>
<![CDATA[<223> 3507 HPV35L1 F1]]>
<![CDATA[<400> 75]]>
cttggtacca tgagtctgtg gaggagcaat gagg 34
<![CDATA[<210> 76]]>
<![CDATA[<211> 36]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220> ]]>
<![CDATA[<223> 3508 HPV35L1 R1]]>
<![CDATA[<400> 76]]>
gcttggcttt cagtcctgct tggagcagga acttcc 36
<![CDATA[<210> 77]]>
<![CDATA[<211> 1435]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220> ]]>
<![CDATA[<223> 3509 HPV35L1擴增序列1]]>
<![CDATA[<400> 77]]>
cttggtacca tgagtctgtg gaggagcaat gaggctacag tctacctgcc tcctgtgtct 60
gtgagcaagg tggtgagcac agatgaatat gtgaccagga ccaacatcta ctaccatgct 120
ggctccagca gactgctggc tgtgggacac ccatactatg ccatcaagaa gcaggacagc 180
aacaagattg ctgtgccaaa ggtgtctgga ctccaataca gggtgttcag ggtgaaactg 240
cctgacccaa acaagtttgg ctttcctgac acctccttct atgaccctgc cagccagaga 300
ctggtgtggg cttgtactgg agtggaggtg ggcaggggac aaccactggg agtgggcatc 360
tctggacacc cactgctgaa caaactggat gacacagaga acagcaacaa atatgtgggc 420
aactctggca cagacaacag ggagtgtatc agtatggact acaagcagac ccaactttgt 480
ctgattggct gtagacctcc aattggagaa cactggggca agggcacacc atgtaatgcc 540
aaccaggtga aggctggaga gtgtcctcca ttggaactgc tgaacacagt gctccaagat 600
ggagatatgg tggacacagg ctttggagct atggacttca ccaccctcca agccaacaag 660
tctgatgtgc cactggacat ctgttccagc atctgtaaat accctgacta cctgaaaatg 720
gtgtctgaac catatggaga tatgctgttc ttctacctga ggagggaaca gatgtttgtg 780
agacacctgt tcaacagggc tggcacagtg ggagagacag tgcctgctga cctctacatc 840
aagggcacca caggcaccct gccaagcacc tcctactttc caacaccatc tggcagtatg 900
gtgacctctg atgcccagat tttcaacaag ccatactggc tccaaagggc tcaaggacac 960
aacaatggca tctgttggag caaccaactt tttgtgacag tggtggacac caccaggagc 1020
accaatatga gtgtgtgttc tgctgtgtcc tcctctgaca gcacctacaa gaatgacaac 1080
ttcaaggaat acctgagaca tggagaggaa tatgacctcc aattcatctt ccaactttgt 1140
aagattaccc tgacagcaga tgtgatgacc tacatccaca gtatgaaccc aagcatcttg 1200
gaggactgga actttggact gacacctcct ccatctggca ccttggagga cacctacaga 1260
tatgtgacca gccaggctgt gacttgtcag aagccatctg ccccaaagcc aaaggatgac 1320
ccactgaaaa actacacctt ctgggaggtg gacctgaaag agaagttctc tgctgacctg 1380
gaccagtttc cactgggcag gaagttcctg ctccaagcag gactgaaagc caagc 1435
<![CDATA[<210> 78]]>
<![CDATA[<211> 35]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220> ]]>
<![CDATA[<22]]>3> 3510 HPV35L1 F2]]>
<br/>
<br/><![CDATA[<400> 78]]>
<br/><![CDATA[agcaggactg aaagccaagc caaaactgaa aaggg 35
<![CDATA[<210> 79]]>
<![CDATA[<211> 36]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220> ]]>
<![CDATA[<223> 3511 HPV35L1 R2]]>
<![CDATA[<400> 79]]>
ctgtctagat ttacttcttc accttcttcc tcttgg 36
<![CDATA[<210> 80]]>
<![CDATA[<211> 101]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220> ]]>
<![CDATA[<223> 3512 HPV35L1擴增序列2]]>
<![CDATA[<400> 80]]>
agcaggactg aaagccaagc caaaactgaa aagggctgcc ccaaccagca ccaggacctc 60
ctctgccaag aggaagaagg tgaagaagta aatctagaca g 101
<![CDATA[<210> 81]]>
<![CDATA[<211> 38]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 智人]]>
<![CDATA[<220> ]]>
<![CDATA[<223> 3513 HPV 59型L1蛋白的471-508胺基酸序列]]>
<![CDATA[<400> 81]]>
Leu Gln Leu Gly Ala Arg Pro Lys Pro Thr Ile Gly Pro Arg Lys Arg
1 5 10 15
Ala Ala Pro Ala Pro Thr Ser Thr Pro Ser Pro Lys Arg Val Lys Arg
20 25 30
Arg Lys Ser Ser Arg Lys
35
<![CDATA[<210> 82]]>
<![CDATA[<211> 469]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 智人]]>
<![CDATA[<220> ]]>
<![CDATA[<223> 3901 HPV 39型L1蛋白的1-469胺基酸序列]]>
<![CDATA[<400> 82]]>
Met Ala Met Trp Arg Ser Ser Asp Ser Met Val Tyr Leu Pro Pro Pro
1 5 10 15
Ser Val Ala Lys Val Val Asn Thr Asp Asp Tyr Val Thr Arg Thr Gly
20 25 30
Ile Tyr Tyr Tyr Ala Gly Ser Ser Arg Leu Leu Thr Val Gly His Pro
35 40 45
Tyr Phe Lys Val Gly Met Asn Gly Gly Arg Lys Gln Asp Ile Pro Lys
50 55 60
Val Ser Ala Tyr Gln Tyr Arg Val Phe Arg Val Thr Leu Pro Asp Pro
65 70 75 80
Asn Lys Phe Ser Ile Pro Asp Ala Ser Leu Tyr Asn Pro Glu Thr Gln
85 90 95
Arg Leu Val Trp Ala Cys Val Gly Val Glu Val Gly Arg Gly Gln Pro
100 105 110
Leu Gly Val Gly Ile Ser Gly His Pro Leu Tyr Asn Arg Gln Asp Asp
115 120 125
Thr Glu Asn Ser Pro Phe Ser Ser Thr Thr Asn Lys Asp Ser Arg Asp
130 135 140
Asn Val Ser Val Asp Tyr Lys Gln Thr Gln Leu Cys Ile Ile Gly Cys
145 150 155 160
Val Pro Ala Ile Gly Glu His Trp Gly Lys Gly Lys Ala Cys Lys Pro
165 170 175
Asn Asn Val Ser Thr Gly Asp Cys Pro Pro Leu Glu Leu Val Asn Thr
180 185 190
Pro Ile Glu Asp Gly Asp Met Ile Asp Thr Gly Tyr Gly Ala Met Asp
195 200 205
Phe Gly Ala Leu Gln Glu Thr Lys Ser Glu Val Pro Leu Asp Ile Cys
210 215 220
Gln Ser Ile Cys Lys Tyr Pro Asp Tyr Leu Gln Met Ser Ala Asp Val
225 230 235 240
Tyr Gly Asp Ser Met Phe Phe Cys Leu Arg Arg Glu Gln Leu Phe Ala
245 250 255
Arg His Phe Trp Asn Arg Gly Gly Met Val Gly Asp Ala Ile Pro Ala
260 265 270
Gln Leu Tyr Ile Lys Gly Thr Asp Ile Arg Ala Asn Pro Gly Ser Ser
275 280 285
Val Tyr Cys Pro Ser Pro Ser Gly Ser Met Val Thr Ser Asp Ser Gln
290 295 300
Leu Phe Asn Lys Pro Tyr Trp Leu His Lys Ala Gln Gly His Asn Asn
305 310 315 320
Gly Ile Cys Trp His Asn Gln Leu Phe Leu Thr Val Val Asp Thr Thr
325 330 335
Arg Ser Thr Asn Phe Thr Leu Ser Thr Ser Ile Glu Ser Ser Ile Pro
340 345 350
Ser Thr Tyr Asp Pro Ser Lys Phe Lys Glu Tyr Thr Arg His Val Glu
355 360 365
Glu Tyr Asp Leu Gln Phe Ile Phe Gln Leu Cys Thr Val Thr Leu Thr
370 375 380
Thr Asp Val Met Ser Tyr Ile His Thr Met Asn Ser Ser Ile Leu Asp
385 390 395 400
Asn Trp Asn Phe Ala Val Ala Pro Pro Pro Ser Ala Ser Leu Val Asp
405 410 415
Thr Tyr Arg Tyr Leu Gln Ser Ala Ala Ile Thr Cys Gln Lys Asp Ala
420 425 430
Pro Ala Pro Glu Lys Lys Asp Pro Tyr Asp Gly Leu Lys Phe Trp Asn
435 440 445
Val Asp Leu Arg Glu Lys Phe Ser Leu Glu Leu Asp Gln Phe Pro Leu
450 455 460
Gly Arg Lys Phe Leu
465
<![CDATA[<210> 83]]>
<![CDATA[<211> 38]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 智人]]>
<![CDATA[<220> ]]>
<![CDATA[<223> 3902 HPV 59型L1蛋白的471-508胺基酸序列]]>
<![CDATA[<400> 83]]>
Leu Gln Leu Gly Ala Arg Pro Lys Pro Thr Ile Gly Pro Arg Lys Arg
1 5 10 15
Ala Ala Pro Ala Pro Thr Ser Thr Pro Ser Pro Lys Arg Val Lys Arg
20 25 30
Arg Lys Ser Ser Arg Lys
35
<![CDATA[<210> 84]]>
<![CDATA[<211> 507]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220> ]]>
<![CDATA[<223> 3903 嵌合的HPV 39型L1蛋白的胺基酸序列]]>
<![CDATA[<400> 84]]>
Met Ala Met Trp Arg Ser Ser Asp Ser Met Val Tyr Leu Pro Pro Pro
1 5 10 15
Ser Val Ala Lys Val Val Asn Thr Asp Asp Tyr Val Thr Arg Thr Gly
20 25 30
Ile Tyr Tyr Tyr Ala Gly Ser Ser Arg Leu Leu Thr Val Gly His Pro
35 40 45
Tyr Phe Lys Val Gly Met Asn Gly Gly Arg Lys Gln Asp Ile Pro Lys
50 55 60
Val Ser Ala Tyr Gln Tyr Arg Val Phe Arg Val Thr Leu Pro Asp Pro
65 70 75 80
Asn Lys Phe Ser Ile Pro Asp Ala Ser Leu Tyr Asn Pro Glu Thr Gln
85 90 95
Arg Leu Val Trp Ala Cys Val Gly Val Glu Val Gly Arg Gly Gln Pro
100 105 110
Leu Gly Val Gly Ile Ser Gly His Pro Leu Tyr Asn Arg Gln Asp Asp
115 120 125
Thr Glu Asn Ser Pro Phe Ser Ser Thr Thr Asn Lys Asp Ser Arg Asp
130 135 140
Asn Val Ser Val Asp Tyr Lys Gln Thr Gln Leu Cys Ile Ile Gly Cys
145 150 155 160
Val Pro Ala Ile Gly Glu His Trp Gly Lys Gly Lys Ala Cys Lys Pro
165 170 175
Asn Asn Val Ser Thr Gly Asp Cys Pro Pro Leu Glu Leu Val Asn Thr
180 185 190
Pro Ile Glu Asp Gly Asp Met Ile Asp Thr Gly Tyr Gly Ala Met Asp
195 200 205
Phe Gly Ala Leu Gln Glu Thr Lys Ser Glu Val Pro Leu Asp Ile Cys
210 215 220
Gln Ser Ile Cys Lys Tyr Pro Asp Tyr Leu Gln Met Ser Ala Asp Val
225 230 235 240
Tyr Gly Asp Ser Met Phe Phe Cys Leu Arg Arg Glu Gln Leu Phe Ala
245 250 255
Arg His Phe Trp Asn Arg Gly Gly Met Val Gly Asp Ala Ile Pro Ala
260 265 270
Gln Leu Tyr Ile Lys Gly Thr Asp Ile Arg Ala Asn Pro Gly Ser Ser
275 280 285
Val Tyr Cys Pro Ser Pro Ser Gly Ser Met Val Thr Ser Asp Ser Gln
290 295 300
Leu Phe Asn Lys Pro Tyr Trp Leu His Lys Ala Gln Gly His Asn Asn
305 310 315 320
Gly Ile Cys Trp His Asn Gln Leu Phe Leu Thr Val Val Asp Thr Thr
325 330 335
Arg Ser Thr Asn Phe Thr Leu Ser Thr Ser Ile Glu Ser Ser Ile Pro
340 345 350
Ser Thr Tyr Asp Pro Ser Lys Phe Lys Glu Tyr Thr Arg His Val Glu
355 360 365
Glu Tyr Asp Leu Gln Phe Ile Phe Gln Leu Cys Thr Val Thr Leu Thr
370 375 380
Thr Asp Val Met Ser Tyr Ile His Thr Met Asn Ser Ser Ile Leu Asp
385 390 395 400
Asn Trp Asn Phe Ala Val Ala Pro Pro Pro Ser Ala Ser Leu Val Asp
405 410 415
Thr Tyr Arg Tyr Leu Gln Ser Ala Ala Ile Thr Cys Gln Lys Asp Ala
420 425 430
Pro Ala Pro Glu Lys Lys Asp Pro Tyr Asp Gly Leu Lys Phe Trp Asn
435 440 445
Val Asp Leu Arg Glu Lys Phe Ser Leu Glu Leu Asp Gln Phe Pro Leu
450 455 460
Gly Arg Lys Phe Leu Leu Gln Leu Gly Ala Arg Pro Lys Pro Thr Ile
465 470 475 480
Gly Pro Arg Lys Arg Ala Ala Pro Ala Pro Thr Ser Thr Pro Ser Pro
485 490 495
Lys Arg Val Lys Arg Arg Lys Ser Ser Arg Lys
500 505
<![CDATA[<210> 85]]>
<![CDATA[<211> 1525]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220> ]]>
<![CDATA[<223> 3904 嵌合的HPV 39型L1蛋白的核苷酸序列]]>
<![CDATA[<400> 85]]>
atggctatgt ggaggtcctc tgacagtatg gtctacctgc ctcctccatc tgtggctaag 60
gtggtgaaca cagatgacta tgtgaccagg acaggcatct actactatgc tggctccagc 120
agactgctga cagtgggaca cccatacttc aaggtgggga tgaatggagg caggaagcag 180
gacatcccaa aggtgtctgc ctaccaatac agggtgttca gggtgaccct gcctgaccca 240
aacaagttca gcatccctga tgcctccctc tacaaccctg agacccagag actggtgtgg 300
gcttgtgtgg gagtggaggt gggcagggga caaccactgg gagtgggcat ctctggacac 360
ccactctaca acagacagga tgacacagag aacagcccat tctccagcac caccaacaag 420
gacagcaggg acaatgtgtc tgtggactac aagcagaccc aactttgtat cattggctgt 480
gtgcctgcca ttggagaaca ctggggcaag ggcaaggctt gtaagccaaa caatgtgagc 540
acaggagact gtcctccatt ggaactggtg aacacaccaa ttgaggatgg agatatgatt 600
gacacaggct atggagctat ggactttgga gccctccaag agaccaagtc tgaggtgcca 660
ctggacatct gtcagagcat ctgtaaatac cctgactacc tccaaatgag tgctgatgtc 720
tatggagaca gtatgttctt ctgtctgagg agggaacaac tttttgccag acacttctgg 780
aacaggggag ggatggtggg agatgccatc cctgcccaac tctacatcaa gggcacagac 840
atcagggcta accctggctc ctctgtctac tgtccaagcc catctggcag tatggtgacc 900
tctgacagcc aacttttcaa caagccatac tggctgcaca aggctcaagg acacaacaat 960
ggcatctgtt ggcacaacca acttttcctg acagtggtgg acaccaccag gagcaccaac 1020
ttcaccctga gcaccagcat tgagtccagc atcccaagca cctatgaccc aagcaagttc 1080
aaggaataca ccaggcatgt ggaggaatat gacctccaat tcatcttcca actttgtact 1140
gtgaccctga ccacagatgt gatgagttac atccacacaa tgaactccag catcctggac 1200
aactggaact ttgctgtggc tcctcctcca tctgcctccc tggtggacac ctacagatac 1260
ctccaatctg ctgccatcac ttgtcagaag gatgcccctg cccctgagaa gaaggaccca 1320
tatgatggac tgaagttctg gaatgtggac ctgagggaga agttctcctt ggaactggac 1380
cagtttccac tgggcaggaa gttcctgctc caacttggag ccagaccaaa gccaaccatt 1440
ggaccaagga agagggctgc ccctgcccca accagcacac caagcccaaa gagggtgaag 1500
aggaggaagt ccagcaggaa gtaaa 1525
<![CDATA[<210> 86]]>
<![CDATA[<211> 1537]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220> ]]>
<![CDATA[<223> 3905 合成的HPV39L1基因]]>
<![CDATA[<400> 86]]>
ctgggtacca tggctatgtg gaggtcctct gacagtatgg tctacctgcc tcctccatct 60
gtggctaagg tggtgaacac agatgactat gtgaccagga caggcatcta ctactatgct 120
ggctccagca gactgctgac agtgggacac ccatacttca aggtggggat gaatggaggc 180
aggaagcagg acatcccaaa ggtgtctgcc taccaataca gggtgttcag ggtgaccctg 240
cctgacccaa acaagttcag catccctgat gcctccctct acaaccctga gacccagaga 300
ctggtgtggg cttgtgtggg agtggaggtg ggcaggggac aaccactggg agtgggcatc 360
tctggacacc cactctacaa cagacaggat gacacagaga acagcccatt ctccagcacc 420
accaacaagg acagcaggga caatgtgtct gtggactaca agcagaccca actttgtatc 480
attggctgtg tgcctgccat tggagaacac tggggcaagg gcaaggcttg taagccaaac 540
aatgtgagca caggagactg tcctccattg gaactggtga acacaccaat tgaggatgga 600
gatatgattg acacaggcta tggagctatg gactttggag ccctccaaga gaccaagtct 660
gaggtgccac tggacatctg tcagagcatc tgtaaatacc ctgactacct ccaaatgagt 720
gctgatgtct atggagacag tatgttcttc tgtctgagga gggaacaact ttttgccaga 780
cacttctgga acaggggagg gatggtggga gatgccatcc ctgcccaact ctacatcaag 840
ggcacagaca tcagggctaa ccctggctcc tctgtctact gtccaagccc atctggcagt 900
atggtgacct ctgacagcca acttttcaac aagccatact ggctgcacaa ggctcaagga 960
cacaacaatg gcatctgttg gcacaaccaa cttttcctga cagtggtgga caccaccagg 1020
agcaccaact tcaccctgag caccagcatt gagtccagca tcccaagcac ctatgaccca 1080
agcaagttca aggaatacac caggcatgtg gaggaatatg acctccaatt catcttccaa 1140
ctttgtactg tgaccctgac cacagatgtg atgagttaca tccacacaat gaactccagc 1200
atcctggaca actggaactt tgctgtggct cctcctccat ctgcctccct ggtggacacc 1260
tacagatacc tccaatctgc tgccatcact tgtcagaagg atgcccctgc ccctgagaag 1320
aaggacccat atgatggact gaagttctgg aatgtggacc tgagggagaa gttctccttg 1380
gaactggacc agtttccact gggcaggaag ttcctgctcc aagccagggt gaggaggaga 1440
ccaaccattg gaccaaggaa gagacctgct gccagcacct cctcctcctc tgccaccaaa 1500
cacaagagga agagggtgag caagtaaact cgagctc 1537
<![CDATA[<210> 87]]>
<![CDATA[<211> 1546]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220> ]]>
<![CDATA[<223> 3906 合成的HPV59L1基因]]>
<![CDATA[<400> 87]]>
ctgggtacca tggctctgtg gaggtcctct gacaacaagg tctacctgcc tcctccatct 60
gtggctaagg tggtgagcac agatgaatat gtgaccagga ccagcatctt ctaccatgct 120
ggctccagca gactgctgac agtgggacac ccatacttca aggtgccaaa gggaggcaat 180
ggcagacagg atgtgccaaa ggtgtctgcc taccaataca gggtgttcag ggtgaaactg 240
cctgacccaa acaagtttgg actgcctgac aacacagtct atgacccaaa cagccagaga 300
ctggtgtggg cttgtgtggg agtggagatt ggcaggggac aaccactggg agtgggactg 360
tctggacacc cactctacaa caaactggat gacacagaga actctcatgt ggcatctgct 420
gtggacacca aggacaccag ggacaatgtg tctgtggact acaagcagac ccaactttgt 480
atcattggct gtgtgcctgc cattggagaa cactggacca agggcacagc ctgtaagcca 540
accacagtgg tccagggaga ctgtcctcca ttggaactga taaacacacc aattgaggat 600
ggagatatgg tggacacagg ctatggagct atggacttca aactgctcca agacaacaag 660
tctgaggtgc cactggacat ctgtcagagc atctgtaaat accctgacta cctccaaatg 720
agtgctgatg cctatggaga cagtatgttc ttctgtctga ggagggaaca ggtgtttgcc 780
agacacttct ggaacaggtc tggcacaatg ggagaccaac ttcctgagtc cctctacatc 840
aagggcacag acatcagggc taaccctggc tcctacctct acagcccaag cccatctggc 900
tctgtggtga cctctgacag ccaacttttc aacaagccat actggctgca caaggctcaa 960
ggactgaaca atggcatctg ttggcacaac caacttttcc tgacagtggt ggacaccacc 1020
aggagcacca acctgtctgt gtgtgccagc accacctcca gcatcccaaa tgtctacaca 1080
ccaacctcct tcaaggaata tgccaggcat gtggaggagt ttgacctcca attcatcttc 1140
caactttgta agattaccct gaccacagag gtgatgagtt acatccacaa tatgaacacc 1200
accatcttgg aggactggaa ctttggagtg acacctcctc caacagcctc cctggtggac 1260
acctacaggt ttgtccagtc tgctgctgtg acttgtcaga aggacacagc ccctcctgtg 1320
aagcaggacc catatgacaa actgaagttc tggcctgtgg acctgaaaga gaggttctct 1380
gctgacctgg accagtttcc actgggcagg aagttcctgc tccaacttgg agccagacca 1440
aagccaacca ttggaccaag gaagagggct gcccctgccc caaccagcac accaagccca 1500
aagagggtga agaggaggaa gtccagcagg aagtaaactc gagctc 1546
<![CDATA[<210> 88]]>
<![CDATA[<211> 38]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220> ]]>
<![CDATA[<223> 3907 HPV39L1 F1]]>
<![CDATA[<400> 88]]>
cttggtacca tggctatgtg gaggtcctct gacagtat 38
<![CDATA[<210> 89]]>
<![CDATA[<211> 38]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220> ]]>
<![CDATA[<223> 3908 HPV39L1 R1]]>
<![CDATA[<400> 89]]>
tccaagttgg agcaggaact tcctgcccag tggaaact 38
<![CDATA[<210> 90]]>
<![CDATA[<211> 1428]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220> ]]>
<![CDATA[<223> 3909 HPV39L1擴增序列1]]>
<![CDATA[<400> 90]]>
cttggtacca tggctatgtg gaggtcctct gacagtatgg tctacctgcc tcctccatct 60
gtggctaagg tggtgaacac agatgactat gtgaccagga caggcatcta ctactatgct 120
ggctccagca gactgctgac agtgggacac ccatacttca aggtggggat gaatggaggc 180
aggaagcagg acatcccaaa ggtgtctgcc taccaataca gggtgttcag ggtgaccctg 240
cctgacccaa acaagttcag catccctgat gcctccctct acaaccctga gacccagaga 300
ctggtgtggg cttgtgtggg agtggaggtg ggcaggggac aaccactggg agtgggcatc 360
tctggacacc cactctacaa cagacaggat gacacagaga acagcccatt ctccagcacc 420
accaacaagg acagcaggga caatgtgtct gtggactaca agcagaccca actttgtatc 480
attggctgtg tgcctgccat tggagaacac tggggcaagg gcaaggcttg taagccaaac 540
aatgtgagca caggagactg tcctccattg gaactggtga acacaccaat tgaggatgga 600
gatatgattg acacaggcta tggagctatg gactttggag ccctccaaga gaccaagtct 660
gaggtgccac tggacatctg tcagagcatc tgtaaatacc ctgactacct ccaaatgagt 720
gctgatgtct atggagacag tatgttcttc tgtctgagga gggaacaact ttttgccaga 780
cacttctgga acaggggagg gatggtggga gatgccatcc ctgcccaact ctacatcaag 840
ggcacagaca tcagggctaa ccctggctcc tctgtctact gtccaagccc atctggcagt 900
atggtgacct ctgacagcca acttttcaac aagccatact ggctgcacaa ggctcaagga 960
cacaacaatg gcatctgttg gcacaaccaa cttttcctga cagtggtgga caccaccagg 1020
agcaccaact tcaccctgag caccagcatt gagtccagca tcccaagcac ctatgaccca 1080
agcaagttca aggaatacac caggcatgtg gaggaatatg acctccaatt catcttccaa 1140
ctttgtactg tgaccctgac cacagatgtg atgagttaca tccacacaat gaactccagc 1200
atcctggaca actggaactt tgctgtggct cctcctccat ctgcctccct ggtggacacc 1260
tacagatacc tccaatctgc tgccatcact tgtcagaagg atgcccctgc ccctgagaag 1320
aaggacccat atgatggact gaagttctgg aatgtggacc tgagggagaa gttctccttg 1380
gaactggacc agtttccact gggcaggaag ttcctgctcc aacttgga 1428
<![CDATA[<210> 91]]>
<![CDATA[<211> 37]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220> ]]>
<![CDATA[<223> 3910 HPV39L1 F2]]>
<![CDATA[<400> 91]]>
aggaagttcc tgctccaact tggagccaga ccaaagc 37
<![CDATA[<210> 92]]>
<![CDATA[<211> 40]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220> ]]>
<![CDATA[<223> 3911 HPV39L1 R2]]>
<![CDATA[<400> 92]]>
ctgtctagat ttacttcctg ctggacttcc tcctcttcac 40
<![CDATA[<210> 93]]>
<![CDATA[<211> 139]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220> ]]>
<![CDATA[<223> 3912 HPV39L1擴增序列2]]>
<![CDATA[<400> 93]]>
aggaagttcc tgctccaact tggagccaga ccaaagccaa ccattggacc aaggaagagg 60
gctgcccctg ccccaaccag cacaccaagc ccaaagaggg tgaagaggag gaagtccagc 120
aggaagtaaa tctagacag 139
<![CDATA[<210> 94]]>
<![CDATA[<211> 26]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 智人]]>
<![CDATA[<220> ]]>
<![CDATA[<223> 3913 HPV 33型L1蛋白的474-499胺基酸序列]]>
<![CDATA[<400> 94]]>
Lys Ala Lys Pro Lys Leu Lys Arg Ala Ala Pro Thr Ser Thr Arg Thr
1 5 10 15
Ser Ser Ala Lys Arg Lys Lys Val Lys Lys
20 25
<![CDATA[<210> 95]]>
<![CDATA[<211> 478]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 智人]]>
<![CDATA[<220> ]]>
<![CDATA[<223> 450]]>1 HPV 45型L1蛋白的1-478胺基酸序列
<![CDATA[<400> 95]]>
Met Ala Leu Trp Arg Pro Ser Asp Ser Thr Val Tyr Leu Pro Pro Pro
1 5 10 15
Ser Val Ala Arg Val Val Asn Thr Asp Asp Tyr Val Ser Arg Thr Ser
20 25 30
Ile Phe Tyr His Ala Gly Ser Ser Arg Leu Leu Thr Val Gly Asn Pro
35 40 45
Tyr Phe Arg Val Val Pro Ser Gly Ala Gly Asn Lys Gln Ala Val Pro
50 55 60
Lys Val Ser Ala Tyr Gln Tyr Arg Val Phe Arg Val Ala Leu Pro Asp
65 70 75 80
Pro Asn Lys Phe Gly Leu Pro Asp Ser Thr Ile Tyr Asn Pro Glu Thr
85 90 95
Gln Arg Leu Val Trp Ala Cys Val Gly Met Glu Ile Gly Arg Gly Gln
100 105 110
Pro Leu Gly Ile Gly Leu Ser Gly His Pro Phe Tyr Asn Lys Leu Asp
115 120 125
Asp Thr Glu Ser Ala His Ala Ala Thr Ala Val Ile Thr Gln Asp Val
130 135 140
Arg Asp Asn Val Ser Val Asp Tyr Lys Gln Thr Gln Leu Cys Ile Leu
145 150 155 160
Gly Cys Val Pro Ala Ile Gly Glu His Trp Ala Lys Gly Thr Leu Cys
165 170 175
Lys Pro Ala Gln Leu Gln Pro Gly Asp Cys Pro Pro Leu Glu Leu Lys
180 185 190
Asn Thr Ile Ile Glu Asp Gly Asp Met Val Asp Thr Gly Tyr Gly Ala
195 200 205
Met Asp Phe Ser Thr Leu Gln Asp Thr Lys Cys Glu Val Pro Leu Asp
210 215 220
Ile Cys Gln Ser Ile Cys Lys Tyr Pro Asp Tyr Leu Gln Met Ser Ala
225 230 235 240
Asp Pro Tyr Gly Asp Ser Met Phe Phe Cys Leu Arg Arg Glu Gln Leu
245 250 255
Phe Ala Arg His Phe Trp Asn Arg Ala Gly Val Met Gly Asp Thr Val
260 265 270
Pro Thr Asp Leu Tyr Ile Lys Gly Thr Ser Ala Asn Met Arg Glu Thr
275 280 285
Pro Gly Ser Cys Val Tyr Ser Pro Ser Pro Ser Gly Ser Ile Thr Thr
290 295 300
Ser Asp Ser Gln Leu Phe Asn Lys Pro Tyr Trp Leu His Lys Ala Gln
305 310 315 320
Gly His Asn Asn Gly Ile Cys Trp His Asn Gln Leu Phe Val Thr Val
325 330 335
Val Asp Thr Thr Arg Ser Thr Asn Leu Thr Leu Cys Ala Ser Thr Gln
340 345 350
Asn Pro Val Pro Asn Thr Tyr Asp Pro Thr Lys Phe Lys His Tyr Ser
355 360 365
Arg His Val Glu Glu Tyr Asp Leu Gln Phe Ile Phe Gln Leu Cys Thr
370 375 380
Ile Thr Leu Thr Ala Glu Val Met Ser Tyr Ile His Ser Met Asn Ser
385 390 395 400
Ser Ile Leu Glu Asn Trp Asn Phe Gly Val Pro Pro Pro Pro Thr Thr
405 410 415
Ser Leu Val Asp Thr Tyr Arg Phe Val Gln Ser Val Ala Val Thr Cys
420 425 430
Gln Lys Asp Thr Thr Pro Pro Glu Lys Gln Asp Pro Tyr Asp Lys Leu
435 440 445
Lys Phe Trp Thr Val Asp Leu Lys Glu Lys Phe Ser Ser Asp Leu Asp
450 455 460
Gln Tyr Pro Leu Gly Arg Lys Phe Leu Val Gln Ala Gly Leu
465 470 475
<![CDATA[<210> 96]]>
<![CDATA[<211> 26]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 智人]]>
<![CDATA[<220> ]]>
<![CDATA[<223> 4502 HPV 33型L1蛋白的474-499胺基酸序列]]>
<![CDATA[<400> 96]]>
Lys Ala Lys Pro Lys Leu Lys Arg Ala Ala Pro Thr Ser Thr Arg Thr
1 5 10 15
Ser Ser Ala Lys Arg Lys Lys Val Lys Lys
20 25
<![CDATA[<210> 97]]>
<![CDATA[<211> 504]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220> ]]>
<![CDATA[<223> 4503 嵌合的HPV 45型L1蛋白的胺基酸序列]]>
<![CDATA[<400> 97]]>
Met Ala Leu Trp Arg Pro Ser Asp Ser Thr Val Tyr Leu Pro Pro Pro
1 5 10 15
Ser Val Ala Arg Val Val Asn Thr Asp Asp Tyr Val Ser Arg Thr Ser
20 25 30
Ile Phe Tyr His Ala Gly Ser Ser Arg Leu Leu Thr Val Gly Asn Pro
35 40 45
Tyr Phe Arg Val Val Pro Ser Gly Ala Gly Asn Lys Gln Ala Val Pro
50 55 60
Lys Val Ser Ala Tyr Gln Tyr Arg Val Phe Arg Val Ala Leu Pro Asp
65 70 75 80
Pro Asn Lys Phe Gly Leu Pro Asp Ser Thr Ile Tyr Asn Pro Glu Thr
85 90 95
Gln Arg Leu Val Trp Ala Cys Val Gly Met Glu Ile Gly Arg Gly Gln
100 105 110
Pro Leu Gly Ile Gly Leu Ser Gly His Pro Phe Tyr Asn Lys Leu Asp
115 120 125
Asp Thr Glu Ser Ala His Ala Ala Thr Ala Val Ile Thr Gln Asp Val
130 135 140
Arg Asp Asn Val Ser Val Asp Tyr Lys Gln Thr Gln Leu Cys Ile Leu
145 150 155 160
Gly Cys Val Pro Ala Ile Gly Glu His Trp Ala Lys Gly Thr Leu Cys
165 170 175
Lys Pro Ala Gln Leu Gln Pro Gly Asp Cys Pro Pro Leu Glu Leu Lys
180 185 190
Asn Thr Ile Ile Glu Asp Gly Asp Met Val Asp Thr Gly Tyr Gly Ala
195 200 205
Met Asp Phe Ser Thr Leu Gln Asp Thr Lys Cys Glu Val Pro Leu Asp
210 215 220
Ile Cys Gln Ser Ile Cys Lys Tyr Pro Asp Tyr Leu Gln Met Ser Ala
225 230 235 240
Asp Pro Tyr Gly Asp Ser Met Phe Phe Cys Leu Arg Arg Glu Gln Leu
245 250 255
Phe Ala Arg His Phe Trp Asn Arg Ala Gly Val Met Gly Asp Thr Val
260 265 270
Pro Thr Asp Leu Tyr Ile Lys Gly Thr Ser Ala Asn Met Arg Glu Thr
275 280 285
Pro Gly Ser Cys Val Tyr Ser Pro Ser Pro Ser Gly Ser Ile Thr Thr
290 295 300
Ser Asp Ser Gln Leu Phe Asn Lys Pro Tyr Trp Leu His Lys Ala Gln
305 310 315 320
Gly His Asn Asn Gly Ile Cys Trp His Asn Gln Leu Phe Val Thr Val
325 330 335
Val Asp Thr Thr Arg Ser Thr Asn Leu Thr Leu Cys Ala Ser Thr Gln
340 345 350
Asn Pro Val Pro Asn Thr Tyr Asp Pro Thr Lys Phe Lys His Tyr Ser
355 360 365
Arg His Val Glu Glu Tyr Asp Leu Gln Phe Ile Phe Gln Leu Cys Thr
370 375 380
Ile Thr Leu Thr Ala Glu Val Met Ser Tyr Ile His Ser Met Asn Ser
385 390 395 400
Ser Ile Leu Glu Asn Trp Asn Phe Gly Val Pro Pro Pro Pro Thr Thr
405 410 415
Ser Leu Val Asp Thr Tyr Arg Phe Val Gln Ser Val Ala Val Thr Cys
420 425 430
Gln Lys Asp Thr Thr Pro Pro Glu Lys Gln Asp Pro Tyr Asp Lys Leu
435 440 445
Lys Phe Trp Thr Val Asp Leu Lys Glu Lys Phe Ser Ser Asp Leu Asp
450 455 460
Gln Tyr Pro Leu Gly Arg Lys Phe Leu Val Gln Ala Gly Leu Lys Ala
465 470 475 480
Lys Pro Lys Leu Lys Arg Ala Ala Pro Thr Ser Thr Arg Thr Ser Ser
485 490 495
Ala Lys Arg Lys Lys Val Lys Lys
500
<![CDATA[<210> 98]]>
<![CDATA[<211> 1516]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220> ]]>
<![CDATA[<223> 4504]]> 嵌合的HPV 45型L1蛋白的核苷酸序列
<![CDATA[<400> 98]]>
atggctctgt ggagaccatc tgacagcaca gtctacctgc ctcctccatc tgtggcaagg 60
gtggtgaaca cagatgacta tgtgagcagg accagcatct tctaccatgc tggctccagc 120
agactgctga cagtgggcaa cccatacttc agggtggtgc caagtggagc aggcaacaag 180
caggctgtgc caaaggtgtc tgcctaccaa tacagggtgt tcagggtggc tctgcctgac 240
ccaaacaagt ttggactgcc tgacagcacc atctacaacc ctgagaccca gagactggtg 300
tgggcttgtg tggggatgga gattggcagg ggacaaccac tgggcattgg actgtctgga 360
cacccattct acaacaaact ggatgacaca gagtctgccc atgctgccac agcagtgatt 420
acccaggatg tgagggacaa tgtgtctgtg gactacaagc agacccaact ttgtatcctg 480
ggctgtgtgc ctgccattgg agaacactgg gctaagggca ccctgtgtaa gcctgcccaa 540
ctccaacctg gagactgtcc tccattggaa ctgaaaaaca ccatcattga ggatggagat 600
atggtggaca caggctatgg agctatggac ttcagcaccc tccaagacac caagtgtgag 660
gtgccactgg acatctgtca gagcatctgt aaataccctg actacctcca aatgagtgct 720
gacccatatg gagacagtat gttcttctgt ctgaggaggg aacaactttt tgccagacac 780
ttctggaaca gggctggagt gatgggagac acagtgccaa cagacctcta catcaagggc 840
acctctgcca atatgaggga gacacctggc tcctgtgtct acagcccaag cccatctggc 900
agcatcacca cctctgacag ccaacttttc aacaagccat actggctgca caaggctcaa 960
ggacacaaca atggcatctg ttggcacaac caactttttg tgacagtggt ggacaccacc 1020
aggagcacca acctgaccct gtgtgccagc acccagaacc ctgtgccaaa cacctatgac 1080
ccaaccaagt tcaagcacta cagcaggcat gtggaggaat atgacctcca attcatcttc 1140
caactttgta ccatcaccct gacagcagag gtgatgagtt acatccacag tatgaactcc 1200
agcatcttgg agaactggaa ctttggagtg cctcctcctc caaccacctc cctggtggac 1260
acctacaggt ttgtccagtc tgtggctgtg acttgtcaga aggacaccac acctcctgag 1320
aagcaggacc catatgacaa actgaagttc tggacagtgg acctgaaaga gaagttctcc 1380
tctgacctgg accaataccc actgggcagg aagttcctgg tccaggctgg actgaaagcc 1440
aagccaaaac tgaaaagggc tgccccaacc agcaccagga cctcctctgc caagaggaag 1500
aaggtgaaga agtaaa 1516
<![CDATA[<210> 99]]>
<![CDATA[<211> 1552]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220> ]]>
<![CDATA[<223> 4505 合成的HPV45L1基因]]>
<![CDATA[<400> 99]]>
ctgggtacca tggctctgtg gagaccatct gacagcacag tctacctgcc tcctccatct 60
gtggcaaggg tggtgaacac agatgactat gtgagcagga ccagcatctt ctaccatgct 120
ggctccagca gactgctgac agtgggcaac ccatacttca gggtggtgcc aagtggagca 180
ggcaacaagc aggctgtgcc aaaggtgtct gcctaccaat acagggtgtt cagggtggct 240
ctgcctgacc caaacaagtt tggactgcct gacagcacca tctacaaccc tgagacccag 300
agactggtgt gggcttgtgt ggggatggag attggcaggg gacaaccact gggcattgga 360
ctgtctggac acccattcta caacaaactg gatgacacag agtctgccca tgctgccaca 420
gcagtgatta cccaggatgt gagggacaat gtgtctgtgg actacaagca gacccaactt 480
tgtatcctgg gctgtgtgcc tgccattgga gaacactggg ctaagggcac cctgtgtaag 540
cctgcccaac tccaacctgg agactgtcct ccattggaac tgaaaaacac catcattgag 600
gatggagata tggtggacac aggctatgga gctatggact tcagcaccct ccaagacacc 660
aagtgtgagg tgccactgga catctgtcag agcatctgta aataccctga ctacctccaa 720
atgagtgctg acccatatgg agacagtatg ttcttctgtc tgaggaggga acaacttttt 780
gccagacact tctggaacag ggctggagtg atgggagaca cagtgccaac agacctctac 840
atcaagggca cctctgccaa tatgagggag acacctggct cctgtgtcta cagcccaagc 900
ccatctggca gcatcaccac ctctgacagc caacttttca acaagccata ctggctgcac 960
aaggctcaag gacacaacaa tggcatctgt tggcacaacc aactttttgt gacagtggtg 1020
gacaccacca ggagcaccaa cctgaccctg tgtgccagca cccagaaccc tgtgccaaac 1080
acctatgacc caaccaagtt caagcactac agcaggcatg tggaggaata tgacctccaa 1140
ttcatcttcc aactttgtac catcaccctg acagcagagg tgatgagtta catccacagt 1200
atgaactcca gcatcttgga gaactggaac tttggagtgc ctcctcctcc aaccacctcc 1260
ctggtggaca cctacaggtt tgtccagtct gtggctgtga cttgtcagaa ggacaccaca 1320
cctcctgaga agcaggaccc atatgacaaa ctgaagttct ggacagtgga cctgaaagag 1380
aagttctcct ctgacctgga ccaataccca ctgggcagga agttcctggt ccaggctgga 1440
ctgaggagga gaccaaccat tggaccaagg aagagacctg ctgccagcac cagcacagcc 1500
agcagacctg ccaagagggt gaggattagg agcaagaagt aaactcgagc tc 1552
<![CDATA[<210> 100]]>
<![CDATA[<211> 1519]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220> ]]>
<![CDATA[<223> 4506 合成的HPV33L1基因]]>
<![CDATA[<400> 100]]>
ctgggtacca tgagtgtgtg gagaccatct gaggctacag tctacctgcc tcctgtgcct 60
gtgagcaagg tggtgagcac agatgaatat gtgagcagga ccagcatcta ctactatgct 120
ggctccagca gactgctggc tgtgggacac ccatacttca gcatcaagaa cccaaccaat 180
gccaagaaac tgctggtgcc aaaggtgtct ggactccaat acagggtgtt cagggtgaga 240
ctgcctgacc caaacaagtt tggctttcct gacacctcct tctacaaccc tgacacccag 300
agactggtgt gggcttgtgt gggattggag attggcaggg gacaaccact gggagtgggc 360
atctctggac acccactgct gaacaagttt gatgacacag agacaggcaa caaataccct 420
ggacaacctg gagcagacaa cagggagtgt ctgagtatgg actacaagca gacccaactt 480
tgtctgctgg gctgtaagcc tccaacagga gaacactggg gcaagggagt ggcttgtacc 540
aatgctgccc ctgccaatga ctgtcctcca ttggaactga taaacaccat cattgaggat 600
ggagatatgg tggacacagg ctttggctgt atggacttca agaccctcca agccaacaag 660
tctgatgtgc caattgacat ctgtggcagc acttgtaaat accctgacta cctgaaaatg 720
acctctgaac catatggaga ctccctgttc ttcttcctga ggagggaaca gatgtttgtg 780
agacacttct tcaacagggc tggcaccctg ggagaggctg tgcctgatga cctctacatc 840
aagggctctg gcaccacagc cagcatccag tcctctgcct tctttccaac accatctggc 900
agtatggtga cctctgagag ccaacttttc aacaagccat actggctcca aagggctcaa 960
ggacacaaca atggcatctg ttggggcaac caggtgtttg tgacagtggt ggacaccacc 1020
aggagcacca atatgaccct gtgtacccag gtgacctctg acagcaccta caagaatgag 1080
aacttcaagg aatacatcag gcatgtggag gaatatgacc tccaatttgt gttccaactt 1140
tgtaaggtga ccctgacagc agaggtgatg acctacatcc atgctatgaa ccctgacatc 1200
ttggaggact ggcagtttgg actgacacct cctccatctg cctccctcca agacacctac 1260
aggtttgtga ccagccaggc tatcacttgt cagaagacag tgcctccaaa ggagaaggag 1320
gacccactgg gcaaatacac cttctgggag gtggacctga aagagaagtt ctctgctgac 1380
ctggaccagt ttccactggg caggaagttc ctgctccaag caggactgaa agccaagcca 1440
aaactgaaaa gggctgcccc aaccagcacc aggacctcct ctgccaagag gaagaaggtg 1500
aagaagtaaa ctcgagctc 1519
<![CDATA[<210> 101]]>
<![CDATA[<211> 33]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220> ]]>
<![CDATA[<223> 4507 HPV45L1 F1]]>
<![CDATA[<400> 101]]>
cttggtacca tggctctgtg gagaccatct gac 33
<![CDATA[<210> 102]]>
<![CDATA[<211> 32]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220> ]]>
<![CDATA[<223> 4508 HPV45L1 R1]]>
<![CDATA[<400> 102]]>
gcttggcttt cagtccagcc tggaccagga ac 32
<![CDATA[<210> 103]]>
<![CDATA[<211> 1453]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<22]]>0> ]]>
<br/><![CDATA[<223> 4509 HPV45L1擴增序列1]]>
<br/>
<br/><![CDATA[<400> 103]]>
<br/><![CDATA[cttggtacca tggctctgtg gagaccatct gacagcacag tctacctgcc tcctccatct 60
gtggcaaggg tggtgaacac agatgactat gtgagcagga ccagcatctt ctaccatgct 120
ggctccagca gactgctgac agtgggcaac ccatacttca gggtggtgcc aagtggagca 180
ggcaacaagc aggctgtgcc aaaggtgtct gcctaccaat acagggtgtt cagggtggct 240
ctgcctgacc caaacaagtt tggactgcct gacagcacca tctacaaccc tgagacccag 300
agactggtgt gggcttgtgt ggggatggag attggcaggg gacaaccact gggcattgga 360
ctgtctggac acccattcta caacaaactg gatgacacag agtctgccca tgctgccaca 420
gcagtgatta cccaggatgt gagggacaat gtgtctgtgg actacaagca gacccaactt 480
tgtatcctgg gctgtgtgcc tgccattgga gaacactggg ctaagggcac cctgtgtaag 540
cctgcccaac tccaacctgg agactgtcct ccattggaac tgaaaaacac catcattgag 600
gatggagata tggtggacac aggctatgga gctatggact tcagcaccct ccaagacacc 660
aagtgtgagg tgccactgga catctgtcag agcatctgta aataccctga ctacctccaa 720
atgagtgctg acccatatgg agacagtatg ttcttctgtc tgaggaggga acaacttttt 780
gccagacact tctggaacag ggctggagtg atgggagaca cagtgccaac agacctctac 840
atcaagggca cctctgccaa tatgagggag acacctggct cctgtgtcta cagcccaagc 900
ccatctggca gcatcaccac ctctgacagc caacttttca acaagccata ctggctgcac 960
aaggctcaag gacacaacaa tggcatctgt tggcacaacc aactttttgt gacagtggtg 1020
gacaccacca ggagcaccaa cctgaccctg tgtgccagca cccagaaccc tgtgccaaac 1080
acctatgacc caaccaagtt caagcactac agcaggcatg tggaggaata tgacctccaa 1140
ttcatcttcc aactttgtac catcaccctg acagcagagg tgatgagtta catccacagt 1200
atgaactcca gcatcttgga gaactggaac tttggagtgc ctcctcctcc aaccacctcc 1260
ctggtggaca cctacaggtt tgtccagtct gtggctgtga cttgtcagaa ggacaccaca 1320
cctcctgaga agcaggaccc atatgacaaa ctgaagttct ggacagtgga cctgaaagag 1380
aagttctcct ctgacctgga ccaataccca ctgggcagga agttcctggt ccaggctgga 1440
ctgaaagcca agc 1453
<![CDATA[<210> 104]]>
<![CDATA[<211> 35]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<2]]>20> ]]>
<br/><![CDATA[<223> 4510 HPV45L1]]><![CDATA[ F2
<![CDATA[<400> 104]]>
ggctggactg aaagccaagc caaaactgaa aaggg 35
<![CDATA[<210> 105]]>
<![CDATA[<211> 37]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220> ]]>
<![CDATA[<223> 4511 HPV45L1 R2]]>
<![CDATA[<400> 105]]>
ctgtctagat ttacttcttc accttcttcc tcttggc 37
<![CDATA[<210> 106]]>
<![CDATA[<211> 101]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220> ]]>
<![CDATA[<223> 4512 HPV45L1擴增序列2]]>
<![CDATA[<400> 106]]>
ggctggactg aaagccaagc caaaactgaa aagggctgcc ccaaccagca ccaggacctc 60
ctctgccaag aggaagaagg tgaagaagta aatctagaca g 101
<![CDATA[<210> 107]]>
<![CDATA[<211> 38]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 智人]]>
<![CDATA[<220> ]]>
<![CDATA[<223> 4513 HPV 59型L1蛋白的471-508胺基酸序列]]>
<![CDATA[<400> 107]]>
Leu Gln Leu Gly Ala Arg Pro Lys Pro Thr Ile Gly Pro Arg Lys Arg
1 5 10 15
Ala Ala Pro Ala Pro Thr Ser Thr Pro Ser Pro Lys Arg Val Lys Arg
20 25 30
Arg Lys Ser Ser Arg Lys
35
<![CDATA[<210> 108]]>
<![CDATA[<211> 474]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 智人]]>
<![CDATA[<220> ]]>
<![CDATA[<223> 5101 HPV 51型L1蛋白的1-474胺基酸序列]]>
<![CDATA[<400> 108]]>
Met Ala Leu Trp Arg Thr Asn Asp Ser Lys Val Tyr Leu Pro Pro Ala
1 5 10 15
Pro Val Ser Arg Ile Val Asn Thr Glu Glu Tyr Ile Thr Arg Thr Gly
20 25 30
Ile Tyr Tyr Tyr Ala Gly Ser Ser Arg Leu Ile Thr Leu Gly His Pro
35 40 45
Tyr Phe Pro Ile Pro Lys Thr Ser Thr Arg Ala Ala Ile Pro Lys Val
50 55 60
Ser Ala Phe Gln Tyr Arg Val Phe Arg Val Gln Leu Pro Asp Pro Asn
65 70 75 80
Lys Phe Gly Leu Pro Asp Pro Asn Leu Tyr Asn Pro Asp Thr Asp Arg
85 90 95
Leu Val Trp Gly Cys Val Gly Val Glu Val Gly Arg Gly Gln Pro Leu
100 105 110
Gly Val Gly Leu Ser Gly His Pro Leu Phe Asn Lys Tyr Asp Asp Thr
115 120 125
Glu Asn Ser Arg Ile Ala Asn Gly Asn Ala Gln Gln Asp Val Arg Asp
130 135 140
Asn Thr Ser Val Asp Asn Lys Gln Thr Gln Leu Cys Ile Ile Gly Cys
145 150 155 160
Ala Pro Pro Ile Gly Glu His Trp Gly Ile Gly Thr Thr Cys Lys Asn
165 170 175
Thr Pro Val Pro Pro Gly Asp Cys Pro Pro Leu Glu Leu Val Ser Ser
180 185 190
Val Ile Gln Asp Gly Asp Met Ile Asp Thr Gly Phe Gly Ala Met Asp
195 200 205
Phe Ala Ala Leu Gln Ala Thr Lys Ser Asp Val Pro Leu Asp Ile Ser
210 215 220
Gln Ser Val Cys Lys Tyr Pro Asp Tyr Leu Lys Met Ser Ala Asp Thr
225 230 235 240
Tyr Gly Asn Ser Met Phe Phe His Leu Arg Arg Glu Gln Ile Phe Ala
245 250 255
Arg His Tyr Tyr Asn Lys Leu Val Gly Val Gly Glu Asp Ile Pro Asn
260 265 270
Asp Tyr Tyr Ile Lys Gly Ser Gly Asn Gly Arg Asp Pro Ile Glu Ser
275 280 285
Tyr Ile Tyr Ser Ala Thr Pro Ser Gly Ser Met Ile Thr Ser Asp Ser
290 295 300
Gln Ile Phe Asn Lys Pro Tyr Trp Leu His Arg Ala Gln Gly His Asn
305 310 315 320
Asn Gly Ile Cys Trp Asn Asn Gln Leu Phe Ile Thr Cys Val Asp Thr
325 330 335
Thr Arg Ser Thr Asn Leu Thr Ile Ser Thr Ala Thr Ala Ala Val Ser
340 345 350
Pro Thr Phe Thr Pro Ser Asn Phe Lys Gln Tyr Ile Arg His Gly Glu
355 360 365
Glu Tyr Glu Leu Gln Phe Ile Phe Gln Leu Cys Lys Ile Thr Leu Thr
370 375 380
Thr Glu Val Met Ala Tyr Leu His Thr Met Asp Pro Thr Ile Leu Glu
385 390 395 400
Gln Trp Asn Phe Gly Leu Thr Leu Pro Pro Ser Ala Ser Leu Glu Asp
405 410 415
Ala Tyr Arg Phe Val Arg Asn Ala Ala Thr Ser Cys Gln Lys Asp Thr
420 425 430
Pro Pro Gln Ala Lys Pro Asp Pro Leu Ala Lys Tyr Lys Phe Trp Asp
435 440 445
Val Asp Leu Lys Glu Arg Phe Ser Leu Asp Leu Asp Gln Phe Ala Leu
450 455 460
Gly Arg Lys Phe Leu Leu Gln Val Gly Val
465 470
<![CDATA[<210> 109]]>
<![CDATA[<211> 26]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 智人]]>
<![CDATA[<220> ]]>
<![CDATA[<223> 5102 HPV 33型]]>L1蛋白的474-499胺基酸序列
<![CDATA[<400> 109]]>
Lys Ala Lys Pro Lys Leu Lys Arg Ala Ala Pro Thr Ser Thr Arg Thr
1 5 10 15
Ser Ser Ala Lys Arg Lys Lys Val Lys Lys
20 25
<![CDATA[<210> 110]]>
<![CDATA[<211> 500]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220> ]]>
<![CDATA[<223> 5103 嵌合的HPV 51型L1蛋白的胺基酸序列]]>
<![CDATA[<400> 110]]>
Met Ala Leu Trp Arg Thr Asn Asp Ser Lys Val Tyr Leu Pro Pro Ala
1 5 10 15
Pro Val Ser Arg Ile Val Asn Thr Glu Glu Tyr Ile Thr Arg Thr Gly
20 25 30
Ile Tyr Tyr Tyr Ala Gly Ser Ser Arg Leu Ile Thr Leu Gly His Pro
35 40 45
Tyr Phe Pro Ile Pro Lys Thr Ser Thr Arg Ala Ala Ile Pro Lys Val
50 55 60
Ser Ala Phe Gln Tyr Arg Val Phe Arg Val Gln Leu Pro Asp Pro Asn
65 70 75 80
Lys Phe Gly Leu Pro Asp Pro Asn Leu Tyr Asn Pro Asp Thr Asp Arg
85 90 95
Leu Val Trp Gly Cys Val Gly Val Glu Val Gly Arg Gly Gln Pro Leu
100 105 110
Gly Val Gly Leu Ser Gly His Pro Leu Phe Asn Lys Tyr Asp Asp Thr
115 120 125
Glu Asn Ser Arg Ile Ala Asn Gly Asn Ala Gln Gln Asp Val Arg Asp
130 135 140
Asn Thr Ser Val Asp Asn Lys Gln Thr Gln Leu Cys Ile Ile Gly Cys
145 150 155 160
Ala Pro Pro Ile Gly Glu His Trp Gly Ile Gly Thr Thr Cys Lys Asn
165 170 175
Thr Pro Val Pro Pro Gly Asp Cys Pro Pro Leu Glu Leu Val Ser Ser
180 185 190
Val Ile Gln Asp Gly Asp Met Ile Asp Thr Gly Phe Gly Ala Met Asp
195 200 205
Phe Ala Ala Leu Gln Ala Thr Lys Ser Asp Val Pro Leu Asp Ile Ser
210 215 220
Gln Ser Val Cys Lys Tyr Pro Asp Tyr Leu Lys Met Ser Ala Asp Thr
225 230 235 240
Tyr Gly Asn Ser Met Phe Phe His Leu Arg Arg Glu Gln Ile Phe Ala
245 250 255
Arg His Tyr Tyr Asn Lys Leu Val Gly Val Gly Glu Asp Ile Pro Asn
260 265 270
Asp Tyr Tyr Ile Lys Gly Ser Gly Asn Gly Arg Asp Pro Ile Glu Ser
275 280 285
Tyr Ile Tyr Ser Ala Thr Pro Ser Gly Ser Met Ile Thr Ser Asp Ser
290 295 300
Gln Ile Phe Asn Lys Pro Tyr Trp Leu His Arg Ala Gln Gly His Asn
305 310 315 320
Asn Gly Ile Cys Trp Asn Asn Gln Leu Phe Ile Thr Cys Val Asp Thr
325 330 335
Thr Arg Ser Thr Asn Leu Thr Ile Ser Thr Ala Thr Ala Ala Val Ser
340 345 350
Pro Thr Phe Thr Pro Ser Asn Phe Lys Gln Tyr Ile Arg His Gly Glu
355 360 365
Glu Tyr Glu Leu Gln Phe Ile Phe Gln Leu Cys Lys Ile Thr Leu Thr
370 375 380
Thr Glu Val Met Ala Tyr Leu His Thr Met Asp Pro Thr Ile Leu Glu
385 390 395 400
Gln Trp Asn Phe Gly Leu Thr Leu Pro Pro Ser Ala Ser Leu Glu Asp
405 410 415
Ala Tyr Arg Phe Val Arg Asn Ala Ala Thr Ser Cys Gln Lys Asp Thr
420 425 430
Pro Pro Gln Ala Lys Pro Asp Pro Leu Ala Lys Tyr Lys Phe Trp Asp
435 440 445
Val Asp Leu Lys Glu Arg Phe Ser Leu Asp Leu Asp Gln Phe Ala Leu
450 455 460
Gly Arg Lys Phe Leu Leu Gln Val Gly Val Lys Ala Lys Pro Lys Leu
465 470 475 480
Lys Arg Ala Ala Pro Thr Ser Thr Arg Thr Ser Ser Ala Lys Arg Lys
485 490 495
Lys Val Lys Lys
500
<![CDATA[<210> 111]]>
<![CDATA[<211> 1504]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220> ]]>
<![CDATA[<223> 5104 嵌合的HPV 51型L1蛋白的核苷酸序列]]>
<![CDATA[<400> 111]]>
atggctctgt ggaggaccaa tgacagcaag gtctacctgc ctcctgcccc tgtgagcagg 60
attgtgaaca cagaggaata catcaccagg acaggcatct actactatgc tggctccagc 120
agactgatta ccctgggaca cccatacttt ccaatcccaa agaccagcac cagggctgcc 180
atcccaaagg tgtctgcctt ccaatacagg gtgttcaggg tccaacttcc tgacccaaac 240
aagtttggac tgcctgaccc aaacctctac aaccctgaca cagacagact ggtgtggggc 300
tgtgtgggag tggaggtggg caggggacaa ccactgggag tgggactgtc tggacaccca 360
ctgttcaaca aatatgatga cacagagaac agcaggattg ccaatggcaa tgcccaacag 420
gatgtgaggg acaacacctc tgtggacaac aagcagaccc aactttgtat cattggctgt 480
gcccctccaa ttggagaaca ctggggcatt ggcaccactt gtaagaacac acctgtgcct 540
cctggagact gtcctccatt ggaactggtg tcctctgtga ttcaggatgg agatatgatt 600
gacacaggct ttggagctat ggactttgct gccctccaag ccaccaagtc tgatgtgcca 660
ctggacatca gccagtctgt gtgtaaatac cctgactacc tgaaaatgag tgctgacacc 720
tatggcaaca gtatgttctt ccacctgagg agggaacaga tttttgccag acactactac 780
aacaaactgg tgggagtggg agaggacatc ccaaatgact actacatcaa gggctctggc 840
aatggcaggg acccaattga gtcctacatc tactctgcca caccatctgg cagtatgatt 900
acctctgaca gccagatttt caacaagcca tactggctgc acagggctca aggacacaac 960
aatggcatct gttggaacaa ccaacttttc atcacttgtg tggacaccac caggagcacc 1020
aacctgacca tcagcacagc cacagcagca gtgagcccaa ccttcacacc aagcaacttc 1080
aagcaataca tcagacatgg agaggaatat gaactccaat tcatcttcca actttgtaag 1140
attaccctga ccacagaggt gatggcttac ctgcacacaa tggacccaac catcttggaa 1200
cagtggaact ttggactgac cctgcctcca tctgcctcct tggaggatgc ctacaggttt 1260
gtgaggaatg ctgccacctc ctgtcagaag gacacacctc cacaggctaa gcctgaccca 1320
ctggctaaat acaagttctg ggatgtggac ctgaaagaga ggttctccct ggacctggac 1380
cagtttgccc tgggcaggaa gttcctgctc caagtgggag tcaaagccaa gccaaaactg 1440
aaaagggctg ccccaaccag caccaggacc tcctctgcca agaggaagaa ggtgaagaag 1500
taaa 1504
<![CDATA[<210> 112]]>
<![CDATA[<211> 1534]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220> ]]>
<![CDATA[<22]]>3> 5105 合成的HPV51L1基因]]>
<br/>
<br/><![CDATA[<400> 112]]>
<br/><![CDATA[ctgggtacca tggctctgtg gaggaccaat gacagcaagg tctacctgcc tcctgcccct 60
gtgagcagga ttgtgaacac agaggaatac atcaccagga caggcatcta ctactatgct 120
ggctccagca gactgattac cctgggacac ccatactttc caatcccaaa gaccagcacc 180
agggctgcca tcccaaaggt gtctgccttc caatacaggg tgttcagggt ccaacttcct 240
gacccaaaca agtttggact gcctgaccca aacctctaca accctgacac agacagactg 300
gtgtggggct gtgtgggagt ggaggtgggc aggggacaac cactgggagt gggactgtct 360
ggacacccac tgttcaacaa atatgatgac acagagaaca gcaggattgc caatggcaat 420
gcccaacagg atgtgaggga caacacctct gtggacaaca agcagaccca actttgtatc 480
attggctgtg cccctccaat tggagaacac tggggcattg gcaccacttg taagaacaca 540
cctgtgcctc ctggagactg tcctccattg gaactggtgt cctctgtgat tcaggatgga 600
gatatgattg acacaggctt tggagctatg gactttgctg ccctccaagc caccaagtct 660
gatgtgccac tggacatcag ccagtctgtg tgtaaatacc ctgactacct gaaaatgagt 720
gctgacacct atggcaacag tatgttcttc cacctgagga gggaacagat ttttgccaga 780
cactactaca acaaactggt gggagtggga gaggacatcc caaatgacta ctacatcaag 840
ggctctggca atggcaggga cccaattgag tcctacatct actctgccac accatctggc 900
agtatgatta cctctgacag ccagattttc aacaagccat actggctgca cagggctcaa 960
ggacacaaca atggcatctg ttggaacaac caacttttca tcacttgtgt ggacaccacc 1020
aggagcacca acctgaccat cagcacagcc acagcagcag tgagcccaac cttcacacca 1080
agcaacttca agcaatacat cagacatgga gaggaatatg aactccaatt catcttccaa 1140
ctttgtaaga ttaccctgac cacagaggtg atggcttacc tgcacacaat ggacccaacc 1200
atcttggaac agtggaactt tggactgacc ctgcctccat ctgcctcctt ggaggatgcc 1260
tacaggtttg tgaggaatgc tgccacctcc tgtcagaagg acacacctcc acaggctaag 1320
cctgacccac tggctaaata caagttctgg gatgtggacc tgaaagagag gttctccctg 1380
gacctggacc agtttgccct gggcaggaag ttcctgctcc aagtgggagt ccagaggaag 1440
ccaagacctg gactgaaaag acctgcctcc tctgcctcct cctcctcctc ctcctctgcc 1500
aagaggaaga gggtgaagaa gtaaactcga gctc 1534
<![CDATA[<210> 113]]>
<![CDATA[<211> 1519]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220> ]]>
<![CDATA[<223> 5106 合成的HPV33L1基因]]>
<![CDATA[<400> 113]]>
ctgggtacca tgagtgtgtg gagaccatct gaggctacag tctacctgcc tcctgtgcct 60
gtgagcaagg tggtgagcac agatgaatat gtgagcagga ccagcatcta ctactatgct 120
ggctccagca gactgctggc tgtgggacac ccatacttca gcatcaagaa cccaaccaat 180
gccaagaaac tgctggtgcc aaaggtgtct ggactccaat acagggtgtt cagggtgaga 240
ctgcctgacc caaacaagtt tggctttcct gacacctcct tctacaaccc tgacacccag 300
agactggtgt gggcttgtgt gggattggag attggcaggg gacaaccact gggagtgggc 360
atctctggac acccactgct gaacaagttt gatgacacag agacaggcaa caaataccct 420
ggacaacctg gagcagacaa cagggagtgt ctgagtatgg actacaagca gacccaactt 480
tgtctgctgg gctgtaagcc tccaacagga gaacactggg gcaagggagt ggcttgtacc 540
aatgctgccc ctgccaatga ctgtcctcca ttggaactga taaacaccat cattgaggat 600
ggagatatgg tggacacagg ctttggctgt atggacttca agaccctcca agccaacaag 660
tctgatgtgc caattgacat ctgtggcagc acttgtaaat accctgacta cctgaaaatg 720
acctctgaac catatggaga ctccctgttc ttcttcctga ggagggaaca gatgtttgtg 780
agacacttct tcaacagggc tggcaccctg ggagaggctg tgcctgatga cctctacatc 840
aagggctctg gcaccacagc cagcatccag tcctctgcct tctttccaac accatctggc 900
agtatggtga cctctgagag ccaacttttc aacaagccat actggctcca aagggctcaa 960
ggacacaaca atggcatctg ttggggcaac caggtgtttg tgacagtggt ggacaccacc 1020
aggagcacca atatgaccct gtgtacccag gtgacctctg acagcaccta caagaatgag 1080
aacttcaagg aatacatcag gcatgtggag gaatatgacc tccaatttgt gttccaactt 1140
tgtaaggtga ccctgacagc agaggtgatg acctacatcc atgctatgaa ccctgacatc 1200
ttggaggact ggcagtttgg actgacacct cctccatctg cctccctcca agacacctac 1260
aggtttgtga ccagccaggc tatcacttgt cagaagacag tgcctccaaa ggagaaggag 1320
gacccactgg gcaaatacac cttctgggag gtggacctga aagagaagtt ctctgctgac 1380
ctggaccagt ttccactggg caggaagttc ctgctccaag caggactgaa agccaagcca 1440
aaactgaaaa gggctgcccc aaccagcacc aggacctcct ctgccaagag gaagaaggtg 1500
aagaagtaaa ctcgagctc 1519
<![CDATA[<210> 114]]>
<![CDATA[<211> 34]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220> ]]>
<![CDATA[<223> 5107 HPV51L1 F1]]>
<![CDATA[<400> 114]]>
cttggtacca tggctctgtg gaggaccaat gaca 34
<![CDATA[<210> 115]]>
<![CDATA[<211> 32]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220> ]]>
<![CDATA[<223> 5108 HPV51L1 R1]]>
<![CDATA[<400> 115]]>
gcttggcttt gactcccact tggagcagga ac 32
<![CDATA[<210> 116]]>
<![CDATA[<211> 1441]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220> ]]>
<![CDATA[<223> 5109 HPV51L1擴增序列1]]>
<![CDATA[<400> 116]]>
cttggtacca tggctctgtg gaggaccaat gacagcaagg tctacctgcc tcctgcccct 60
gtgagcagga ttgtgaacac agaggaatac atcaccagga caggcatcta ctactatgct 120
ggctccagca gactgattac cctgggacac ccatactttc caatcccaaa gaccagcacc 180
agggctgcca tcccaaaggt gtctgccttc caatacaggg tgttcagggt ccaacttcct 240
gacccaaaca agtttggact gcctgaccca aacctctaca accctgacac agacagactg 300
gtgtggggct gtgtgggagt ggaggtgggc aggggacaac cactgggagt gggactgtct 360
ggacacccac tgttcaacaa atatgatgac acagagaaca gcaggattgc caatggcaat 420
gcccaacagg atgtgaggga caacacctct gtggacaaca agcagaccca actttgtatc 480
attggctgtg cccctccaat tggagaacac tggggcattg gcaccacttg taagaacaca 540
cctgtgcctc ctggagactg tcctccattg gaactggtgt cctctgtgat tcaggatgga 600
gatatgattg acacaggctt tggagctatg gactttgctg ccctccaagc caccaagtct 660
gatgtgccac tggacatcag ccagtctgtg tgtaaatacc ctgactacct gaaaatgagt 720
gctgacacct atggcaacag tatgttcttc cacctgagga gggaacagat ttttgccaga 780
cactactaca acaaactggt gggagtggga gaggacatcc caaatgacta ctacatcaag 840
ggctctggca atggcaggga cccaattgag tcctacatct actctgccac accatctggc 900
agtatgatta cctctgacag ccagattttc aacaagccat actggctgca cagggctcaa 960
ggacacaaca atggcatctg ttggaacaac caacttttca tcacttgtgt ggacaccacc 1020
aggagcacca acctgaccat cagcacagcc acagcagcag tgagcccaac cttcacacca 1080
agcaacttca agcaatacat cagacatgga gaggaatatg aactccaatt catcttccaa 1140
ctttgtaaga ttaccctgac cacagaggtg atggcttacc tgcacacaat ggacccaacc 1200
atcttggaac agtggaactt tggactgacc ctgcctccat ctgcctcctt ggaggatgcc 1260
tacaggtttg tgaggaatgc tgccacctcc tgtcagaagg acacacctcc acaggctaag 1320
cctgacccac tggctaaata caagttctgg gatgtggacc tgaaagagag gttctccctg 1380
gacctggacc agtttgccct gggcaggaag ttcctgctcc aagtgggagt caaagccaag 1440
c 1441
<![CDATA[<210> 117]]>
<![CDATA[<211> 35]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220> ]]>
<![CDATA[<223> 5110 HPV51L1 F2]]>
<![CDATA[<400> 117]]>
agtgggagtc aaagccaagc caaaactgaa aaggg 35
<![CDATA[<210> 118]]>
<![CDATA[<211> 36]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220> ]]>
<![CDATA[<223> 5111 HPV51L1 R2]]>
<![CDATA[<400> 118]]>
ctgtctagat ttacttcttc accttcttcc tcttgg 36
<![CDATA[<210> 119]]>
<![CDATA[<211> 101]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220> ]]>
<![CDATA[<223> 5112 HPV51L1擴增序列2]]>
<![CDATA[<400> 119]]>
agtgggagtc aaagccaagc caaaactgaa aagggctgcc ccaaccagca ccaggacctc 60
ctctgccaag aggaagaagg tgaagaagta aatctagaca g 101
<![CDATA[<210> 120]]>
<![CDATA[<211> 38]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 智人]]>
<![CDATA[<220> ]]>
<![CDATA[<223> 5113 HPV 59型L1蛋白的471-508胺基酸序列]]>
<![CDATA[<400> 120]]>
Leu Gln Leu Gly Ala Arg Pro Lys Pro Thr Ile Gly Pro Arg Lys Arg
1 5 10 15
Ala Ala Pro Ala Pro Thr Ser Thr Pro Ser Pro Lys Arg Val Lys Arg
20 25 30
Arg Lys Ser Ser Arg Lys
35
<![CDATA[<210> 121]]>
<![CDATA[<211> 478]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 智人]]>
<![CDATA[<220> ]]>
<![CDATA[<223> 5201 HPV 52型L1蛋白的1-478胺基酸序列]]>
<![CDATA[<400> 121]]>
Met Ser Val Trp Arg Pro Ser Glu Ala Thr Val Tyr Leu Pro Pro Val
1 5 10 15
Pro Val Ser Lys Val Val Ser Thr Asp Glu Tyr Val Ser Arg Thr Ser
20 25 30
Ile Tyr Tyr Tyr Ala Gly Ser Ser Arg Leu Leu Thr Val Gly His Pro
35 40 45
Tyr Phe Ser Ile Lys Asn Thr Ser Ser Gly Asn Gly Lys Lys Val Leu
50 55 60
Val Pro Lys Val Ser Gly Leu Gln Tyr Arg Val Phe Arg Ile Lys Leu
65 70 75 80
Pro Asp Pro Asn Lys Phe Gly Phe Pro Asp Thr Ser Phe Tyr Asn Pro
85 90 95
Glu Thr Gln Arg Leu Val Trp Ala Cys Thr Gly Leu Glu Ile Gly Arg
100 105 110
Gly Gln Pro Leu Gly Val Gly Ile Ser Gly His Pro Leu Leu Asn Lys
115 120 125
Phe Asp Asp Thr Glu Thr Ser Asn Lys Tyr Ala Gly Lys Pro Gly Ile
130 135 140
Asp Asn Arg Glu Cys Leu Ser Met Asp Tyr Lys Gln Thr Gln Leu Cys
145 150 155 160
Ile Leu Gly Cys Lys Pro Pro Ile Gly Glu His Trp Gly Lys Gly Thr
165 170 175
Pro Cys Asn Asn Asn Ser Gly Asn Pro Gly Asp Cys Pro Pro Leu Gln
180 185 190
Leu Ile Asn Ser Val Ile Gln Asp Gly Asp Met Val Asp Thr Gly Phe
195 200 205
Gly Cys Met Asp Phe Asn Thr Leu Gln Ala Ser Lys Ser Asp Val Pro
210 215 220
Ile Asp Ile Cys Ser Ser Val Cys Lys Tyr Pro Asp Tyr Leu Gln Met
225 230 235 240
Ala Ser Glu Pro Tyr Gly Asp Ser Leu Phe Phe Phe Leu Arg Arg Glu
245 250 255
Gln Met Phe Val Arg His Phe Phe Asn Arg Ala Gly Thr Leu Gly Asp
260 265 270
Pro Val Pro Gly Asp Leu Tyr Ile Gln Gly Ser Asn Ser Gly Asn Thr
275 280 285
Ala Thr Val Gln Ser Ser Ala Phe Phe Pro Thr Pro Ser Gly Ser Met
290 295 300
Val Thr Ser Glu Ser Gln Leu Phe Asn Lys Pro Tyr Trp Leu Gln Arg
305 310 315 320
Ala Gln Gly His Asn Asn Gly Ile Cys Trp Gly Asn Gln Leu Phe Val
325 330 335
Thr Val Val Asp Thr Thr Arg Ser Thr Asn Met Thr Leu Cys Ala Glu
340 345 350
Val Lys Lys Glu Ser Thr Tyr Lys Asn Glu Asn Phe Lys Glu Tyr Leu
355 360 365
Arg His Gly Glu Glu Phe Asp Leu Gln Phe Ile Phe Gln Leu Cys Lys
370 375 380
Ile Thr Leu Thr Ala Asp Val Met Thr Tyr Ile His Lys Met Asp Ala
385 390 395 400
Thr Ile Leu Glu Asp Trp Gln Phe Gly Leu Thr Pro Pro Pro Ser Ala
405 410 415
Ser Leu Glu Asp Thr Tyr Arg Phe Val Thr Ser Thr Ala Ile Thr Cys
420 425 430
Gln Lys Asn Thr Pro Pro Lys Gly Lys Glu Asp Pro Leu Lys Asp Tyr
435 440 445
Met Phe Trp Glu Val Asp Leu Lys Glu Lys Phe Ser Ala Asp Leu Asp
450 455 460
Gln Phe Pro Leu Gly Arg Lys Phe Leu Leu Gln Ala Gly Leu
465 470 475
<![CDATA[<210> 122]]>
<![CDATA[<211> 26]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 智人]]>
<![CDATA[<220> ]]>
<![CDATA[<223> 5202 HPV 33型L1蛋白的474-499胺基酸序列]]>
<![CDATA[<400> 122]]>
Lys Ala Lys Pro Lys Leu Lys Arg Ala Ala Pro Thr Ser Thr Arg Thr
1 5 10 15
Ser Ser Ala Lys Arg Lys Lys Val Lys Lys
20 25
<![CDATA[<210> 123]]>
<![CDATA[<211> 504]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220> ]]>
<![CDATA[<223> 5203 嵌合的HPV 52型L1蛋白的胺基酸序列]]>
<![CDATA[<400> 123]]>
Met Ser Val Trp Arg Pro Ser Glu Ala Thr Val Tyr Leu Pro Pro Val
1 5 10 15
Pro Val Ser Lys Val Val Ser Thr Asp Glu Tyr Val Ser Arg Thr Ser
20 25 30
Ile Tyr Tyr Tyr Ala Gly Ser Ser Arg Leu Leu Thr Val Gly His Pro
35 40 45
Tyr Phe Ser Ile Lys Asn Thr Ser Ser Gly Asn Gly Lys Lys Val Leu
50 55 60
Val Pro Lys Val Ser Gly Leu Gln Tyr Arg Val Phe Arg Ile Lys Leu
65 70 75 80
Pro Asp Pro Asn Lys Phe Gly Phe Pro Asp Thr Ser Phe Tyr Asn Pro
85 90 95
Glu Thr Gln Arg Leu Val Trp Ala Cys Thr Gly Leu Glu Ile Gly Arg
100 105 110
Gly Gln Pro Leu Gly Val Gly Ile Ser Gly His Pro Leu Leu Asn Lys
115 120 125
Phe Asp Asp Thr Glu Thr Ser Asn Lys Tyr Ala Gly Lys Pro Gly Ile
130 135 140
Asp Asn Arg Glu Cys Leu Ser Met Asp Tyr Lys Gln Thr Gln Leu Cys
145 150 155 160
Ile Leu Gly Cys Lys Pro Pro Ile Gly Glu His Trp Gly Lys Gly Thr
165 170 175
Pro Cys Asn Asn Asn Ser Gly Asn Pro Gly Asp Cys Pro Pro Leu Gln
180 185 190
Leu Ile Asn Ser Val Ile Gln Asp Gly Asp Met Val Asp Thr Gly Phe
195 200 205
Gly Cys Met Asp Phe Asn Thr Leu Gln Ala Ser Lys Ser Asp Val Pro
210 215 220
Ile Asp Ile Cys Ser Ser Val Cys Lys Tyr Pro Asp Tyr Leu Gln Met
225 230 235 240
Ala Ser Glu Pro Tyr Gly Asp Ser Leu Phe Phe Phe Leu Arg Arg Glu
245 250 255
Gln Met Phe Val Arg His Phe Phe Asn Arg Ala Gly Thr Leu Gly Asp
260 265 270
Pro Val Pro Gly Asp Leu Tyr Ile Gln Gly Ser Asn Ser Gly Asn Thr
275 280 285
Ala Thr Val Gln Ser Ser Ala Phe Phe Pro Thr Pro Ser Gly Ser Met
290 295 300
Val Thr Ser Glu Ser Gln Leu Phe Asn Lys Pro Tyr Trp Leu Gln Arg
305 310 315 320
Ala Gln Gly His Asn Asn Gly Ile Cys Trp Gly Asn Gln Leu Phe Val
325 330 335
Thr Val Val Asp Thr Thr Arg Ser Thr Asn Met Thr Leu Cys Ala Glu
340 345 350
Val Lys Lys Glu Ser Thr Tyr Lys Asn Glu Asn Phe Lys Glu Tyr Leu
355 360 365
Arg His Gly Glu Glu Phe Asp Leu Gln Phe Ile Phe Gln Leu Cys Lys
370 375 380
Ile Thr Leu Thr Ala Asp Val Met Thr Tyr Ile His Lys Met Asp Ala
385 390 395 400
Thr Ile Leu Glu Asp Trp Gln Phe Gly Leu Thr Pro Pro Pro Ser Ala
405 410 415
Ser Leu Glu Asp Thr Tyr Arg Phe Val Thr Ser Thr Ala Ile Thr Cys
420 425 430
Gln Lys Asn Thr Pro Pro Lys Gly Lys Glu Asp Pro Leu Lys Asp Tyr
435 440 445
Met Phe Trp Glu Val Asp Leu Lys Glu Lys Phe Ser Ala Asp Leu Asp
450 455 460
Gln Phe Pro Leu Gly Arg Lys Phe Leu Leu Gln Ala Gly Leu Lys Ala
465 470 475 480
Lys Pro Lys Leu Lys Arg Ala Ala Pro Thr Ser Thr Arg Thr Ser Ser
485 490 495
Ala Lys Arg Lys Lys Val Lys Lys
500
<![CDATA[<210> 124]]>
<![CDATA[<211> 1516]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220> ]]>
<![CDATA[<223> 5204 嵌合的HPV 52型L1蛋白的核苷酸序列]]>
<![CDATA[<400> 124]]>
atgagcgtgt ggaggcccag cgaggccacc gtgtacctgc cccccgtgcc cgtgagcaag 60
gtggtgagca ccgacgagta cgtgagcagg accagcatct actactacgc cggcagcagc 120
aggctgctga ccgtgggcca cccctacttc agcatcaaga acaccagcag cggcaacggc 180
aagaaggtgc tggtgcccaa ggtgagcggc ctgcagtaca gggtgttcag gatcaagctg 240
cccgacccca acaagttcgg cttccccgac accagcttct acaaccccga gacccagagg 300
ctggtgtggg cctgcaccgg cctggagatc ggcaggggcc agcccctggg cgtgggcatc 360
agcggccacc ccctgctgaa caagttcgac gacaccgaga ccagcaacaa gtacgccggc 420
aagcccggca tcgacaacag ggagtgcctg agcatggact acaagcagac ccagctgtgc 480
atcctgggct gcaagccccc catcggcgag cactggggca agggcacccc ctgcaacaac 540
aacagcggca accccggcga ctgccccccc ctgcagctga tcaacagcgt gatccaggac 600
ggcgacatgg tggacaccgg cttcggctgc atggacttca acaccctgca ggccagcaag 660
agcgacgtgc ccatcgacat ctgcagcagc gtgtgcaagt accccgacta cctgcagatg 720
gccagcgagc cctacggcga cagcctgttc ttcttcctga ggagggagca gatgttcgtg 780
aggcacttct tcaacagggc cggcaccctg ggcgaccccg tgcccggcga cctgtacatc 840
cagggcagca acagcggcaa caccgccacc gtgcagagca gcgccttctt ccccaccccc 900
agcggcagca tggtgaccag cgagagccag ctgttcaaca agccctactg gctgcagagg 960
gcccagggcc acaacaacgg catctgctgg ggcaaccagc tgttcgtgac cgtggtggac 1020
accaccagga gcaccaacat gaccctgtgc gccgaggtga agaaggagag cacctacaag 1080
aacgagaact tcaaggagta cctgaggcac ggcgaggagt tcgacctgca gttcatcttc 1140
cagctgtgca agatcaccct gaccgccgac gtgatgacct acatccacaa gatggacgcc 1200
accatcctgg aggactggca gttcggcctg accccccccc ccagcgccag cctggaggac 1260
acctacaggt tcgtgaccag caccgccatc acctgccaga agaacacccc ccccaagggc 1320
aaggaggacc ccctgaagga ctacatgttc tgggaggtgg acctgaagga gaagttcagc 1380
gccgacctgg accagttccc cctgggcagg aagttcctgc tgcaggccgg cctgaaagcc 1440
aagccaaaac tgaaaagggc tgccccaacc agcaccagga cctcctctgc caagaggaag 1500
aaggtgaaga agtaaa 1516
<![CDATA[<210> 125]]>
<![CDATA[<211> 1531]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220> ]]>
<![CDATA[<223> 5205 合成的HPV52L1基因]]>
<![CDATA[<400> 125]]>
ctgggtacca tgagcgtgtg gaggcccagc gaggccaccg tgtacctgcc ccccgtgccc 60
gtgagcaagg tggtgagcac cgacgagtac gtgagcagga ccagcatcta ctactacgcc 120
ggcagcagca ggctgctgac cgtgggccac ccctacttca gcatcaagaa caccagcagc 180
ggcaacggca agaaggtgct ggtgcccaag gtgagcggcc tgcagtacag ggtgttcagg 240
atcaagctgc ccgaccccaa caagttcggc ttccccgaca ccagcttcta caaccccgag 300
acccagaggc tggtgtgggc ctgcaccggc ctggagatcg gcaggggcca gcccctgggc 360
gtgggcatca gcggccaccc cctgctgaac aagttcgacg acaccgagac cagcaacaag 420
tacgccggca agcccggcat cgacaacagg gagtgcctga gcatggacta caagcagacc 480
cagctgtgca tcctgggctg caagcccccc atcggcgagc actggggcaa gggcaccccc 540
tgcaacaaca acagcggcaa ccccggcgac tgcccccccc tgcagctgat caacagcgtg 600
atccaggacg gcgacatggt ggacaccggc ttcggctgca tggacttcaa caccctgcag 660
gccagcaaga gcgacgtgcc catcgacatc tgcagcagcg tgtgcaagta ccccgactac 720
ctgcagatgg ccagcgagcc ctacggcgac agcctgttct tcttcctgag gagggagcag 780
atgttcgtga ggcacttctt caacagggcc ggcaccctgg gcgaccccgt gcccggcgac 840
ctgtacatcc agggcagcaa cagcggcaac accgccaccg tgcagagcag cgccttcttc 900
cccaccccca gcggcagcat ggtgaccagc gagagccagc tgttcaacaa gccctactgg 960
ctgcagaggg cccagggcca caacaacggc atctgctggg gcaaccagct gttcgtgacc 1020
gtggtggaca ccaccaggag caccaacatg accctgtgcg ccgaggtgaa gaaggagagc 1080
acctacaaga acgagaactt caaggagtac ctgaggcacg gcgaggagtt cgacctgcag 1140
ttcatcttcc agctgtgcaa gatcaccctg accgccgacg tgatgaccta catccacaag 1200
atggacgcca ccatcctgga ggactggcag ttcggcctga cccccccccc cagcgccagc 1260
ctggaggaca cctacaggtt cgtgaccagc accgccatca cctgccagaa gaacaccccc 1320
cccaagggca aggaggaccc cctgaaggac tacatgttct gggaggtgga cctgaaggag 1380
aagttcagcg ccgacctgga ccagttcccc ctgggcagga agttcctgct gcaggccggc 1440
ctgcaggcca ggcccaagct gaagaggccc gccagcagcg cccccaggac cagcaccaag 1500
aagaagaagg tgaagaggta aactcgagct c 1531
<![CDATA[<210> 126]]>
<![CDATA[<211> 1519]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<22]]>0> ]]>
<br/><![CDATA[<223> 5206 合成的HPV33L1基因]]>
<br/>
<br/><![CDATA[<400> 126]]>
<br/><![CDATA[ctgggtacca tgagtgtgtg gagaccatct gaggctacag tctacctgcc tcctgtgcct 60
gtgagcaagg tggtgagcac agatgaatat gtgagcagga ccagcatcta ctactatgct 120
ggctccagca gactgctggc tgtgggacac ccatacttca gcatcaagaa cccaaccaat 180
gccaagaaac tgctggtgcc aaaggtgtct ggactccaat acagggtgtt cagggtgaga 240
ctgcctgacc caaacaagtt tggctttcct gacacctcct tctacaaccc tgacacccag 300
agactggtgt gggcttgtgt gggattggag attggcaggg gacaaccact gggagtgggc 360
atctctggac acccactgct gaacaagttt gatgacacag agacaggcaa caaataccct 420
ggacaacctg gagcagacaa cagggagtgt ctgagtatgg actacaagca gacccaactt 480
tgtctgctgg gctgtaagcc tccaacagga gaacactggg gcaagggagt ggcttgtacc 540
aatgctgccc ctgccaatga ctgtcctcca ttggaactga taaacaccat cattgaggat 600
ggagatatgg tggacacagg ctttggctgt atggacttca agaccctcca agccaacaag 660
tctgatgtgc caattgacat ctgtggcagc acttgtaaat accctgacta cctgaaaatg 720
acctctgaac catatggaga ctccctgttc ttcttcctga ggagggaaca gatgtttgtg 780
agacacttct tcaacagggc tggcaccctg ggagaggctg tgcctgatga cctctacatc 840
aagggctctg gcaccacagc cagcatccag tcctctgcct tctttccaac accatctggc 900
agtatggtga cctctgagag ccaacttttc aacaagccat actggctcca aagggctcaa 960
ggacacaaca atggcatctg ttggggcaac caggtgtttg tgacagtggt ggacaccacc 1020
aggagcacca atatgaccct gtgtacccag gtgacctctg acagcaccta caagaatgag 1080
aacttcaagg aatacatcag gcatgtggag gaatatgacc tccaatttgt gttccaactt 1140
tgtaaggtga ccctgacagc agaggtgatg acctacatcc atgctatgaa ccctgacatc 1200
ttggaggact ggcagtttgg actgacacct cctccatctg cctccctcca agacacctac 1260
aggtttgtga ccagccaggc tatcacttgt cagaagacag tgcctccaaa ggagaaggag 1320
gacccactgg gcaaatacac cttctgggag gtggacctga aagagaagtt ctctgctgac 1380
ctggaccagt ttccactggg caggaagttc ctgctccaag caggactgaa agccaagcca 1440
aaactgaaaa gggctgcccc aaccagcacc aggacctcct ctgccaagag gaagaaggtg 1500
aagaagtaaa ctcgagctc 1519
<![CDATA[<210> 127]]>
<![CDATA[<211> 34]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220> ]]>
<![CDATA[<223> 5207 HPV52L1 F1]]>
<![CDATA[<400> 127]]>
cttggtacca tgagcgtgtg gaggcccagc gagg 34
<![CDATA[<210> 128]]>
<![CDATA[<211> 35]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220> ]]>
<![CDATA[<223> 5208 HPV52L1 R1]]>
<![CDATA[<400> 128]]>
gcttggcttt caggccggcc tgcagcagga acttc 35
<![CDATA[<210> 129]]>
<![CDATA[<211> 1453]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220> ]]>
<![CDATA[<223> 5209 HPV52L1擴增序列1]]>
<![CDATA[<400> 129]]>
cttggtacca tgagcgtgtg gaggcccagc gaggccaccg tgtacctgcc ccccgtgccc 60
gtgagcaagg tggtgagcac cgacgagtac gtgagcagga ccagcatcta ctactacgcc 120
ggcagcagca ggctgctgac cgtgggccac ccctacttca gcatcaagaa caccagcagc 180
ggcaacggca agaaggtgct ggtgcccaag gtgagcggcc tgcagtacag ggtgttcagg 240
atcaagctgc ccgaccccaa caagttcggc ttccccgaca ccagcttcta caaccccgag 300
acccagaggc tggtgtgggc ctgcaccggc ctggagatcg gcaggggcca gcccctgggc 360
gtgggcatca gcggccaccc cctgctgaac aagttcgacg acaccgagac cagcaacaag 420
tacgccggca agcccggcat cgacaacagg gagtgcctga gcatggacta caagcagacc 480
cagctgtgca tcctgggctg caagcccccc atcggcgagc actggggcaa gggcaccccc 540
tgcaacaaca acagcggcaa ccccggcgac tgcccccccc tgcagctgat caacagcgtg 600
atccaggacg gcgacatggt ggacaccggc ttcggctgca tggacttcaa caccctgcag 660
gccagcaaga gcgacgtgcc catcgacatc tgcagcagcg tgtgcaagta ccccgactac 720
ctgcagatgg ccagcgagcc ctacggcgac agcctgttct tcttcctgag gagggagcag 780
atgttcgtga ggcacttctt caacagggcc ggcaccctgg gcgaccccgt gcccggcgac 840
ctgtacatcc agggcagcaa cagcggcaac accgccaccg tgcagagcag cgccttcttc 900
cccaccccca gcggcagcat ggtgaccagc gagagccagc tgttcaacaa gccctactgg 960
ctgcagaggg cccagggcca caacaacggc atctgctggg gcaaccagct gttcgtgacc 1020
gtggtggaca ccaccaggag caccaacatg accctgtgcg ccgaggtgaa gaaggagagc 1080
acctacaaga acgagaactt caaggagtac ctgaggcacg gcgaggagtt cgacctgcag 1140
ttcatcttcc agctgtgcaa gatcaccctg accgccgacg tgatgaccta catccacaag 1200
atggacgcca ccatcctgga ggactggcag ttcggcctga cccccccccc cagcgccagc 1260
ctggaggaca cctacaggtt cgtgaccagc accgccatca cctgccagaa gaacaccccc 1320
cccaagggca aggaggaccc cctgaaggac tacatgttct gggaggtgga cctgaaggag 1380
aagttcagcg ccgacctgga ccagttcccc ctgggcagga agttcctgct gcaggccggc 1440
ctgaaagcca agc 1453
<![CDATA[<210> 130]]>
<![CDATA[<211> 35]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220> ]]>
<![CDATA[<223> 5210 HPV52L1 F2]]>
<![CDATA[<400> 130]]>
ggccggcctg aaagccaagc caaaactgaa aaggg 35
<![CDATA[<210> 131]]>
<![CDATA[<211> 36]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220> ]]>
<![CDATA[<223> 5211 HPV52L1 R2]]>
<![CDATA[<400> 131]]>
ctgtctagat ttacttcttc accttcttcc tcttgg 36
<![CDATA[<210> 132]]>
<![CDATA[<211> 101]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220> ]]>
<![CDATA[<223> 5212 HPV52L1擴增序列2]]>
<![CDATA[<400> 132]]>
ggccggcctg aaagccaagc caaaactgaa aagggctgcc ccaaccagca ccaggacctc 60
ctctgccaag aggaagaagg tgaagaagta aatctagaca g 101
<![CDATA[<210> 133]]>
<![CDATA[<211> 38]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 智人]]>
<![CDATA[<220> ]]>
<![CDATA[<223> 5213 HPV 59型L1蛋白的471-508胺基酸序列]]>
<![CDATA[<400> 133]]>
Leu Gln Leu Gly Ala Arg Pro Lys Pro Thr Ile Gly Pro Arg Lys Arg
1 5 10 15
Ala Ala Pro Ala Pro Thr Ser Thr Pro Ser Pro Lys Arg Val Lys Arg
20 25 30
Arg Lys Ser Ser Arg Lys
35
<![CDATA[<210> 134]]>
<![CDATA[<211> 467]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 智人]]>
<![CDATA[<220> ]]>
<![CDATA[<223> 5601 HPV 56型L1蛋白的1-467胺基酸序列]]>
<![CDATA[<400> 134]]>
Met Ala Thr Trp Arg Pro Ser Glu Asn Lys Val Tyr Leu Pro Pro Thr
1 5 10 15
Pro Val Ser Lys Val Val Ala Thr Asp Ser Tyr Val Lys Arg Thr Ser
20 25 30
Ile Phe Tyr His Ala Gly Ser Ser Arg Leu Leu Ala Val Gly His Pro
35 40 45
Tyr Tyr Ser Val Thr Lys Asp Asn Thr Lys Thr Asn Ile Pro Lys Val
50 55 60
Ser Ala Tyr Gln Tyr Arg Val Phe Arg Val Arg Leu Pro Asp Pro Asn
65 70 75 80
Lys Phe Gly Leu Pro Asp Thr Asn Ile Tyr Asn Pro Asp Gln Glu Arg
85 90 95
Leu Val Trp Ala Cys Val Gly Leu Glu Val Gly Arg Gly Gln Pro Leu
100 105 110
Gly Ala Gly Leu Ser Gly His Pro Leu Phe Asn Arg Leu Asp Asp Thr
115 120 125
Glu Ser Ser Asn Leu Ala Asn Asn Asn Val Ile Glu Asp Ser Arg Asp
130 135 140
Asn Ile Ser Val Asp Gly Lys Gln Thr Gln Leu Cys Ile Val Gly Cys
145 150 155 160
Thr Pro Ala Met Gly Glu His Trp Thr Lys Gly Ala Val Cys Lys Ser
165 170 175
Thr Gln Val Thr Thr Gly Asp Cys Pro Pro Leu Ala Leu Ile Asn Thr
180 185 190
Pro Ile Glu Asp Gly Asp Met Ile Asp Thr Gly Phe Gly Ala Met Asp
195 200 205
Phe Lys Val Leu Gln Glu Ser Lys Ala Glu Val Pro Leu Asp Ile Val
210 215 220
Gln Ser Thr Cys Lys Tyr Pro Asp Tyr Leu Lys Met Ser Ala Asp Ala
225 230 235 240
Tyr Gly Asp Ser Met Trp Phe Tyr Leu Arg Arg Glu Gln Leu Phe Ala
245 250 255
Arg His Tyr Phe Asn Arg Ala Gly Lys Val Gly Glu Thr Ile Pro Ala
260 265 270
Glu Leu Tyr Leu Lys Gly Ser Asn Gly Arg Glu Pro Pro Pro Ser Ser
275 280 285
Val Tyr Val Ala Thr Pro Ser Gly Ser Met Ile Thr Ser Glu Ala Gln
290 295 300
Leu Phe Asn Lys Pro Tyr Trp Leu Gln Arg Ala Gln Gly His Asn Asn
305 310 315 320
Gly Ile Cys Trp Gly Asn Gln Leu Phe Val Thr Val Val Asp Thr Thr
325 330 335
Arg Ser Thr Asn Met Thr Ile Ser Thr Ala Thr Glu Gln Leu Ser Lys
340 345 350
Tyr Asp Ala Arg Lys Ile Asn Gln Tyr Leu Arg His Val Glu Glu Tyr
355 360 365
Glu Leu Gln Phe Val Phe Gln Leu Cys Lys Ile Thr Leu Ser Ala Glu
370 375 380
Val Met Ala Tyr Leu His Asn Met Asn Ala Asn Leu Leu Glu Asp Trp
385 390 395 400
Asn Ile Gly Leu Ser Pro Pro Val Ala Thr Ser Leu Glu Asp Lys Tyr
405 410 415
Arg Tyr Val Arg Ser Thr Ala Ile Thr Cys Gln Arg Glu Gln Pro Pro
420 425 430
Thr Glu Lys Gln Asp Pro Leu Ala Lys Tyr Lys Phe Trp Asp Val Asn
435 440 445
Leu Gln Asp Ser Phe Ser Thr Asp Leu Asp Gln Phe Pro Leu Gly Arg
450 455 460
Lys Phe Leu
465
<![CDATA[<210> 135]]>
<![CDATA[<211> 31]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 智人]]>
<![CDATA[<220> ]]>
<![CDATA[<223> 5602 HPV 33型L1蛋白的469-499胺基酸序列]]>
<![CDATA[<400> 135]]>
Leu Gln Ala Gly Leu Lys Ala Lys Pro Lys Leu Lys Arg Ala Ala Pro
1 5 10 15
Thr Ser Thr Arg Thr Ser Ser Ala Lys Arg Lys Lys Val Lys Lys
20 25 30
<![CDATA[<210> 136]]>
<![CDATA[<211> 498]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220> ]]>
<![CDATA[<223> 5603 嵌合的HPV 56型L1蛋白的胺基酸序列]]>
<![CDATA[<400> 136]]>
Met Ala Thr Trp Arg Pro Ser Glu Asn Lys Val Tyr Leu Pro Pro Thr
1 5 10 15
Pro Val Ser Lys Val Val Ala Thr Asp Ser Tyr Val Lys Arg Thr Ser
20 25 30
Ile Phe Tyr His Ala Gly Ser Ser Arg Leu Leu Ala Val Gly His Pro
35 40 45
Tyr Tyr Ser Val Thr Lys Asp Asn Thr Lys Thr Asn Ile Pro Lys Val
50 55 60
Ser Ala Tyr Gln Tyr Arg Val Phe Arg Val Arg Leu Pro Asp Pro Asn
65 70 75 80
Lys Phe Gly Leu Pro Asp Thr Asn Ile Tyr Asn Pro Asp Gln Glu Arg
85 90 95
Leu Val Trp Ala Cys Val Gly Leu Glu Val Gly Arg Gly Gln Pro Leu
100 105 110
Gly Ala Gly Leu Ser Gly His Pro Leu Phe Asn Arg Leu Asp Asp Thr
115 120 125
Glu Ser Ser Asn Leu Ala Asn Asn Asn Val Ile Glu Asp Ser Arg Asp
130 135 140
Asn Ile Ser Val Asp Gly Lys Gln Thr Gln Leu Cys Ile Val Gly Cys
145 150 155 160
Thr Pro Ala Met Gly Glu His Trp Thr Lys Gly Ala Val Cys Lys Ser
165 170 175
Thr Gln Val Thr Thr Gly Asp Cys Pro Pro Leu Ala Leu Ile Asn Thr
180 185 190
Pro Ile Glu Asp Gly Asp Met Ile Asp Thr Gly Phe Gly Ala Met Asp
195 200 205
Phe Lys Val Leu Gln Glu Ser Lys Ala Glu Val Pro Leu Asp Ile Val
210 215 220
Gln Ser Thr Cys Lys Tyr Pro Asp Tyr Leu Lys Met Ser Ala Asp Ala
225 230 235 240
Tyr Gly Asp Ser Met Trp Phe Tyr Leu Arg Arg Glu Gln Leu Phe Ala
245 250 255
Arg His Tyr Phe Asn Arg Ala Gly Lys Val Gly Glu Thr Ile Pro Ala
260 265 270
Glu Leu Tyr Leu Lys Gly Ser Asn Gly Arg Glu Pro Pro Pro Ser Ser
275 280 285
Val Tyr Val Ala Thr Pro Ser Gly Ser Met Ile Thr Ser Glu Ala Gln
290 295 300
Leu Phe Asn Lys Pro Tyr Trp Leu Gln Arg Ala Gln Gly His Asn Asn
305 310 315 320
Gly Ile Cys Trp Gly Asn Gln Leu Phe Val Thr Val Val Asp Thr Thr
325 330 335
Arg Ser Thr Asn Met Thr Ile Ser Thr Ala Thr Glu Gln Leu Ser Lys
340 345 350
Tyr Asp Ala Arg Lys Ile Asn Gln Tyr Leu Arg His Val Glu Glu Tyr
355 360 365
Glu Leu Gln Phe Val Phe Gln Leu Cys Lys Ile Thr Leu Ser Ala Glu
370 375 380
Val Met Ala Tyr Leu His Asn Met Asn Ala Asn Leu Leu Glu Asp Trp
385 390 395 400
Asn Ile Gly Leu Ser Pro Pro Val Ala Thr Ser Leu Glu Asp Lys Tyr
405 410 415
Arg Tyr Val Arg Ser Thr Ala Ile Thr Cys Gln Arg Glu Gln Pro Pro
420 425 430
Thr Glu Lys Gln Asp Pro Leu Ala Lys Tyr Lys Phe Trp Asp Val Asn
435 440 445
Leu Gln Asp Ser Phe Ser Thr Asp Leu Asp Gln Phe Pro Leu Gly Arg
450 455 460
Lys Phe Leu Leu Gln Ala Gly Leu Lys Ala Lys Pro Lys Leu Lys Arg
465 470 475 480
Ala Ala Pro Thr Ser Thr Arg Thr Ser Ser Ala Lys Arg Lys Lys Val
485 490 495
Lys Lys
<![CDATA[<210> 137]]>
<![CDATA[<211> 1498]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220> ]]>
<![CDATA[<223> 5604 嵌合的HPV 56型L1蛋白的核苷酸序列]]>
<![CDATA[<400> 137]]>
atggctacct ggagaccatc tgagaacaag gtctacctgc ctccaacacc tgtgagcaag 60
gtggtggcta cagactccta tgtgaagagg accagcatct tctaccatgc tggctccagc 120
agactgctgg ctgtgggaca cccatactac tctgtgacca aggacaacac caagaccaac 180
atcccaaagg tgtctgccta ccaatacagg gtgttcaggg tgagactgcc tgacccaaac 240
aagtttggac tgcctgacac caacatctac aaccctgacc aggagagact ggtgtgggct 300
tgtgtgggat tggaggtggg caggggacaa ccactgggag caggactgtc tggacaccca 360
ctgttcaaca gactggatga cacagagtcc agcaacctgg ctaacaacaa tgtgattgag 420
gacagcaggg acaacatctc tgtggatggc aagcagaccc aactttgtat tgtgggctgt 480
actcctgcta tgggagaaca ctggaccaag ggagcagtgt gtaagagcac ccaggtgacc 540
acaggagact gtcctccact ggctctgata aacacaccaa ttgaggatgg agatatgatt 600
gacacaggct ttggagctat ggacttcaag gtgctccaag agagcaaggc tgaggtgcca 660
ctggacattg tccagagcac ttgtaaatac cctgactacc tgaaaatgag tgctgatgcc 720
tatggagaca gtatgtggtt ctacctgagg agggaacaac tttttgccag acactacttc 780
aacagggctg gcaaggtggg agagaccatc cctgctgaac tctacctgaa aggcagcaat 840
ggcagggaac ctcctccatc ctctgtctat gtggctacac catctggcag tatgattacc 900
tctgaggctc aacttttcaa caagccatac tggctccaaa gggctcaagg acacaacaat 960
ggcatctgtt ggggcaacca actttttgtg acagtggtgg acaccaccag gagcaccaat 1020
atgaccatca gcacagccac agaacaactt agcaaatatg atgccaggaa gataaaccaa 1080
tacctgaggc atgtggagga atatgaactc caatttgtgt tccaactttg taagattacc 1140
ctgtctgctg aggtgatggc ttacctgcac aatatgaatg ccaacctgtt ggaggactgg 1200
aacattggac tgagccctcc tgtggctacc tccttggagg acaaatacag atatgtgagg 1260
agcacagcca tcacttgtca gagggaacaa cctccaacag agaagcagga cccactggct 1320
aaatacaagt tctgggatgt gaacctccaa gactccttca gcacagacct ggaccagttt 1380
ccactgggca ggaagttcct gctccaagca ggactgaaag ccaagccaaa actgaaaagg 1440
gctgccccaa ccagcaccag gacctcctct gccaagagga agaaggtgaa gaagtaaa 1498
<![CDATA[<210> 138]]>
<![CDATA[<211> 1519]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220> ]]>
<![CDATA[<223> 5605 合成的HPV56L1基因]]>
<![CDATA[<400> 138]]>
ctgggtacca tggctacctg gagaccatct gagaacaagg tctacctgcc tccaacacct 60
gtgagcaagg tggtggctac agactcctat gtgaagagga ccagcatctt ctaccatgct 120
ggctccagca gactgctggc tgtgggacac ccatactact ctgtgaccaa ggacaacacc 180
aagaccaaca tcccaaaggt gtctgcctac caatacaggg tgttcagggt gagactgcct 240
gacccaaaca agtttggact gcctgacacc aacatctaca accctgacca ggagagactg 300
gtgtgggctt gtgtgggatt ggaggtgggc aggggacaac cactgggagc aggactgtct 360
ggacacccac tgttcaacag actggatgac acagagtcca gcaacctggc taacaacaat 420
gtgattgagg acagcaggga caacatctct gtggatggca agcagaccca actttgtatt 480
gtgggctgta ctcctgctat gggagaacac tggaccaagg gagcagtgtg taagagcacc 540
caggtgacca caggagactg tcctccactg gctctgataa acacaccaat tgaggatgga 600
gatatgattg acacaggctt tggagctatg gacttcaagg tgctccaaga gagcaaggct 660
gaggtgccac tggacattgt ccagagcact tgtaaatacc ctgactacct gaaaatgagt 720
gctgatgcct atggagacag tatgtggttc tacctgagga gggaacaact ttttgccaga 780
cactacttca acagggctgg caaggtggga gagaccatcc ctgctgaact ctacctgaaa 840
ggcagcaatg gcagggaacc tcctccatcc tctgtctatg tggctacacc atctggcagt 900
atgattacct ctgaggctca acttttcaac aagccatact ggctccaaag ggctcaagga 960
cacaacaatg gcatctgttg gggcaaccaa ctttttgtga cagtggtgga caccaccagg 1020
agcaccaata tgaccatcag cacagccaca gaacaactta gcaaatatga tgccaggaag 1080
ataaaccaat acctgaggca tgtggaggaa tatgaactcc aatttgtgtt ccaactttgt 1140
aagattaccc tgtctgctga ggtgatggct tacctgcaca atatgaatgc caacctgttg 1200
gaggactgga acattggact gagccctcct gtggctacct ccttggagga caaatacaga 1260
tatgtgagga gcacagccat cacttgtcag agggaacaac ctccaacaga gaagcaggac 1320
ccactggcta aatacaagtt ctgggatgtg aacctccaag actccttcag cacagacctg 1380
gaccagtttc cactgggcag gaagttcctg atgcaacttg gcaccaggag caagcctgct 1440
gtggctacca gcaagaagag gtctgcccca accagcacca gcacacctgc caagaggaag 1500
aggaggtaaa ctcgagctc 1519
<![CDATA[<210> 139]]>
<![CDATA[<211> 1519]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220> ]]>
<![CDATA[<223> 5606 合成的HPV33L1基因]]>
<![CDATA[<400> 139]]>
ctgggtacca tgagtgtgtg gagaccatct gaggctacag tctacctgcc tcctgtgcct 60
gtgagcaagg tggtgagcac agatgaatat gtgagcagga ccagcatcta ctactatgct 120
ggctccagca gactgctggc tgtgggacac ccatacttca gcatcaagaa cccaaccaat 180
gccaagaaac tgctggtgcc aaaggtgtct ggactccaat acagggtgtt cagggtgaga 240
ctgcctgacc caaacaagtt tggctttcct gacacctcct tctacaaccc tgacacccag 300
agactggtgt gggcttgtgt gggattggag attggcaggg gacaaccact gggagtgggc 360
atctctggac acccactgct gaacaagttt gatgacacag agacaggcaa caaataccct 420
ggacaacctg gagcagacaa cagggagtgt ctgagtatgg actacaagca gacccaactt 480
tgtctgctgg gctgtaagcc tccaacagga gaacactggg gcaagggagt ggcttgtacc 540
aatgctgccc ctgccaatga ctgtcctcca ttggaactga taaacaccat cattgaggat 600
ggagatatgg tggacacagg ctttggctgt atggacttca agaccctcca agccaacaag 660
tctgatgtgc caattgacat ctgtggcagc acttgtaaat accctgacta cctgaaaatg 720
acctctgaac catatggaga ctccctgttc ttcttcctga ggagggaaca gatgtttgtg 780
agacacttct tcaacagggc tggcaccctg ggagaggctg tgcctgatga cctctacatc 840
aagggctctg gcaccacagc cagcatccag tcctctgcct tctttccaac accatctggc 900
agtatggtga cctctgagag ccaacttttc aacaagccat actggctcca aagggctcaa 960
ggacacaaca atggcatctg ttggggcaac caggtgtttg tgacagtggt ggacaccacc 1020
aggagcacca atatgaccct gtgtacccag gtgacctctg acagcaccta caagaatgag 1080
aacttcaagg aatacatcag gcatgtggag gaatatgacc tccaatttgt gttccaactt 1140
tgtaaggtga ccctgacagc agaggtgatg acctacatcc atgctatgaa ccctgacatc 1200
ttggaggact ggcagtttgg actgacacct cctccatctg cctccctcca agacacctac 1260
aggtttgtga ccagccaggc tatcacttgt cagaagacag tgcctccaaa ggagaaggag 1320
gacccactgg gcaaatacac cttctgggag gtggacctga aagagaagtt ctctgctgac 1380
ctggaccagt ttccactggg caggaagttc ctgctccaag caggactgaa agccaagcca 1440
aaactgaaaa gggctgcccc aaccagcacc aggacctcct ctgccaagag gaagaaggtg 1500
aagaagtaaa ctcgagctc 1519
<![CDATA[<210> 140]]>
<![CDATA[<211> 33]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220> ]]>
<![CDATA[<223> 5607 HPV56L1 F1]]>
<![CDATA[<400> 140]]>
cttggtacca tggctacctg gagaccatct gag 33
<![CDATA[<210> 141]]>
<![CDATA[<211> 31]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220> ]]>
<![CDATA[<223> 5608 HPV56L1 R1]]>
<![CDATA[<400> 141]]>
ctgcttggag caggaacttc ctgcccagtg g 31
<![CDATA[<210> 142]]>
<![CDATA[<211> 1420]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220> ]]>
<![CDATA[<223> 5609 HPV56L1擴增序列1]]>
<![CDATA[<400> 142]]>
cttggtacca tggctacctg gagaccatct gagaacaagg tctacctgcc tccaacacct 60
gtgagcaagg tggtggctac agactcctat gtgaagagga ccagcatctt ctaccatgct 120
ggctccagca gactgctggc tgtgggacac ccatactact ctgtgaccaa ggacaacacc 180
aagaccaaca tcccaaaggt gtctgcctac caatacaggg tgttcagggt gagactgcct 240
gacccaaaca agtttggact gcctgacacc aacatctaca accctgacca ggagagactg 300
gtgtgggctt gtgtgggatt ggaggtgggc aggggacaac cactgggagc aggactgtct 360
ggacacccac tgttcaacag actggatgac acagagtcca gcaacctggc taacaacaat 420
gtgattgagg acagcaggga caacatctct gtggatggca agcagaccca actttgtatt 480
gtgggctgta ctcctgctat gggagaacac tggaccaagg gagcagtgtg taagagcacc 540
caggtgacca caggagactg tcctccactg gctctgataa acacaccaat tgaggatgga 600
gatatgattg acacaggctt tggagctatg gacttcaagg tgctccaaga gagcaaggct 660
gaggtgccac tggacattgt ccagagcact tgtaaatacc ctgactacct gaaaatgagt 720
gctgatgcct atggagacag tatgtggttc tacctgagga gggaacaact ttttgccaga 780
cactacttca acagggctgg caaggtggga gagaccatcc ctgctgaact ctacctgaaa 840
ggcagcaatg gcagggaacc tcctccatcc tctgtctatg tggctacacc atctggcagt 900
atgattacct ctgaggctca acttttcaac aagccatact ggctccaaag ggctcaagga 960
cacaacaatg gcatctgttg gggcaaccaa ctttttgtga cagtggtgga caccaccagg 1020
agcaccaata tgaccatcag cacagccaca gaacaactta gcaaatatga tgccaggaag 1080
ataaaccaat acctgaggca tgtggaggaa tatgaactcc aatttgtgtt ccaactttgt 1140
aagattaccc tgtctgctga ggtgatggct tacctgcaca atatgaatgc caacctgttg 1200
gaggactgga acattggact gagccctcct gtggctacct ccttggagga caaatacaga 1260
tatgtgagga gcacagccat cacttgtcag agggaacaac ctccaacaga gaagcaggac 1320
ccactggcta aatacaagtt ctgggatgtg aacctccaag actccttcag cacagacctg 1380
gaccagtttc cactgggcag gaagttcctg ctccaagcag 1420
<![CDATA[<210> 143]]>
<![CDATA[<211> 36]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220> ]]>
<![CDATA[<223> 5610 HPV56L1 F2]]>
<![CDATA[<400> 143]]>
gaagttcctg ctccaagcag gactgaaagc caagcc 36
<![CDATA[<210> 144]]>
<![CDATA[<211> 36]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220> ]]>
<![CDATA[<223> 5611 HPV56L1 R2]]>
<![CDATA[<400> 144]]>
ctgtctagat ttacttcttc accttcttcc tcttgg 36
<![CDATA[<210> 145]]>
<![CDATA[<211> 116]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220> ]]>
<![CDATA[<223> 5612 HPV56L1擴增序列2]]>
<![CDATA[<400> 145]]>
gaagttcctg ctccaagcag gactgaaagc caagccaaaa ctgaaaaggg ctgccccaac 60
cagcaccagg acctcctctg ccaagaggaa gaaggtgaag aagtaaatct agacag 116
<![CDATA[<210> 146]]>
<![CDATA[<211> 38]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 智人]]>
<![CDATA[<220> ]]>
<![CDATA[<223> 5613 HPV 59型L1蛋白的471-508胺基酸序列]]>
<![CDATA[<400> 146]]>
Leu Gln Leu Gly Ala Arg Pro Lys Pro Thr Ile Gly Pro Arg Lys Arg
1 5 10 15
Ala Ala Pro Ala Pro Thr Ser Thr Pro Ser Pro Lys Arg Val Lys Arg
20 25 30
Arg Lys Ser Ser Arg Lys
35
<![CDATA[<210> 147]]>
<![CDATA[<211> 473]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 智人]]>
<![CDATA[<220> ]]>
<![CDATA[<223> 5801 HPV 58型L1蛋白的1-473胺基酸序列]]>
<![CDATA[<400> 147]]>
Met Ser Val Trp Arg Pro Ser Glu Ala Thr Val Tyr Leu Pro Pro Val
1 5 10 15
Pro Val Ser Lys Val Val Ser Thr Asp Glu Tyr Val Ser Arg Thr Ser
20 25 30
Ile Tyr Tyr Tyr Ala Gly Ser Ser Arg Leu Leu Ala Val Gly Asn Pro
35 40 45
Tyr Phe Ser Ile Lys Ser Pro Asn Asn Asn Lys Lys Val Leu Val Pro
50 55 60
Lys Val Ser Gly Leu Gln Tyr Arg Val Phe Arg Val Arg Leu Pro Asp
65 70 75 80
Pro Asn Lys Phe Gly Phe Pro Asp Thr Ser Phe Tyr Asn Pro Asp Thr
85 90 95
Gln Arg Leu Val Trp Ala Cys Val Gly Leu Glu Ile Gly Arg Gly Gln
100 105 110
Pro Leu Gly Val Gly Val Ser Gly His Pro Tyr Leu Asn Lys Phe Asp
115 120 125
Asp Thr Glu Thr Ser Asn Arg Tyr Pro Ala Gln Pro Gly Ser Asp Asn
130 135 140
Arg Glu Cys Leu Ser Met Asp Tyr Lys Gln Thr Gln Leu Cys Leu Ile
145 150 155 160
Gly Cys Lys Pro Pro Thr Gly Glu His Trp Gly Lys Gly Val Ala Cys
165 170 175
Asn Asn Asn Ala Ala Ala Thr Asp Cys Pro Pro Leu Glu Leu Phe Asn
180 185 190
Ser Ile Ile Glu Asp Gly Asp Met Val Asp Thr Gly Phe Gly Cys Met
195 200 205
Asp Phe Gly Thr Leu Gln Ala Asn Lys Ser Asp Val Pro Ile Asp Ile
210 215 220
Cys Asn Ser Thr Cys Lys Tyr Pro Asp Tyr Leu Lys Met Ala Ser Glu
225 230 235 240
Pro Tyr Gly Asp Ser Leu Phe Phe Phe Leu Arg Arg Glu Gln Met Phe
245 250 255
Val Arg His Phe Phe Asn Arg Ala Gly Lys Leu Gly Glu Ala Val Pro
260 265 270
Asp Asp Leu Tyr Ile Lys Gly Ser Gly Asn Thr Ala Val Ile Gln Ser
275 280 285
Ser Ala Phe Phe Pro Thr Pro Ser Gly Ser Ile Val Thr Ser Glu Ser
290 295 300
Gln Leu Phe Asn Lys Pro Tyr Trp Leu Gln Arg Ala Gln Gly His Asn
305 310 315 320
Asn Gly Ile Cys Trp Gly Asn Gln Leu Phe Val Thr Val Val Asp Thr
325 330 335
Thr Arg Ser Thr Asn Met Thr Leu Cys Thr Glu Val Thr Lys Glu Gly
340 345 350
Thr Tyr Lys Asn Asp Asn Phe Lys Glu Tyr Val Arg His Val Glu Glu
355 360 365
Tyr Asp Leu Gln Phe Val Phe Gln Leu Cys Lys Ile Thr Leu Thr Ala
370 375 380
Glu Ile Met Thr Tyr Ile His Thr Met Asp Ser Asn Ile Leu Glu Asp
385 390 395 400
Trp Gln Phe Gly Leu Thr Pro Pro Pro Ser Ala Ser Leu Gln Asp Thr
405 410 415
Tyr Arg Phe Val Thr Ser Gln Ala Ile Thr Cys Gln Lys Thr Ala Pro
420 425 430
Pro Lys Glu Lys Glu Asp Pro Leu Asn Lys Tyr Thr Phe Trp Glu Val
435 440 445
Asn Leu Lys Glu Lys Phe Ser Ala Asp Leu Asp Gln Phe Pro Leu Gly
450 455 460
Arg Lys Phe Leu Leu Gln Ser Gly Leu
465 470
<![CDATA[<210> 148]]>
<![CDATA[<211> 26]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 智人]]>
<![CDATA[<220> ]]>
<![CDATA[<223> 5802 HPV 33型L1蛋白的474-499胺基酸序列]]>
<![CDATA[<400> 148]]>
Lys Ala Lys Pro Lys Leu Lys Arg Ala Ala Pro Thr Ser Thr Arg Thr
1 5 10 15
Ser Ser Ala Lys Arg Lys Lys Val Lys Lys
20 25
<![CDATA[<210> 149]]>
<![CDATA[<211> 499]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220> ]]>
<![CDATA[<223> 5803 嵌合的HPV 58型L1蛋白的胺基酸序列]]>
<![CDATA[<400> 149]]>
Met Ser Val Trp Arg Pro Ser Glu Ala Thr Val Tyr Leu Pro Pro Val
1 5 10 15
Pro Val Ser Lys Val Val Ser Thr Asp Glu Tyr Val Ser Arg Thr Ser
20 25 30
Ile Tyr Tyr Tyr Ala Gly Ser Ser Arg Leu Leu Ala Val Gly Asn Pro
35 40 45
Tyr Phe Ser Ile Lys Ser Pro Asn Asn Asn Lys Lys Val Leu Val Pro
50 55 60
Lys Val Ser Gly Leu Gln Tyr Arg Val Phe Arg Val Arg Leu Pro Asp
65 70 75 80
Pro Asn Lys Phe Gly Phe Pro Asp Thr Ser Phe Tyr Asn Pro Asp Thr
85 90 95
Gln Arg Leu Val Trp Ala Cys Val Gly Leu Glu Ile Gly Arg Gly Gln
100 105 110
Pro Leu Gly Val Gly Val Ser Gly His Pro Tyr Leu Asn Lys Phe Asp
115 120 125
Asp Thr Glu Thr Ser Asn Arg Tyr Pro Ala Gln Pro Gly Ser Asp Asn
130 135 140
Arg Glu Cys Leu Ser Met Asp Tyr Lys Gln Thr Gln Leu Cys Leu Ile
145 150 155 160
Gly Cys Lys Pro Pro Thr Gly Glu His Trp Gly Lys Gly Val Ala Cys
165 170 175
Asn Asn Asn Ala Ala Ala Thr Asp Cys Pro Pro Leu Glu Leu Phe Asn
180 185 190
Ser Ile Ile Glu Asp Gly Asp Met Val Asp Thr Gly Phe Gly Cys Met
195 200 205
Asp Phe Gly Thr Leu Gln Ala Asn Lys Ser Asp Val Pro Ile Asp Ile
210 215 220
Cys Asn Ser Thr Cys Lys Tyr Pro Asp Tyr Leu Lys Met Ala Ser Glu
225 230 235 240
Pro Tyr Gly Asp Ser Leu Phe Phe Phe Leu Arg Arg Glu Gln Met Phe
245 250 255
Val Arg His Phe Phe Asn Arg Ala Gly Lys Leu Gly Glu Ala Val Pro
260 265 270
Asp Asp Leu Tyr Ile Lys Gly Ser Gly Asn Thr Ala Val Ile Gln Ser
275 280 285
Ser Ala Phe Phe Pro Thr Pro Ser Gly Ser Ile Val Thr Ser Glu Ser
290 295 300
Gln Leu Phe Asn Lys Pro Tyr Trp Leu Gln Arg Ala Gln Gly His Asn
305 310 315 320
Asn Gly Ile Cys Trp Gly Asn Gln Leu Phe Val Thr Val Val Asp Thr
325 330 335
Thr Arg Ser Thr Asn Met Thr Leu Cys Thr Glu Val Thr Lys Glu Gly
340 345 350
Thr Tyr Lys Asn Asp Asn Phe Lys Glu Tyr Val Arg His Val Glu Glu
355 360 365
Tyr Asp Leu Gln Phe Val Phe Gln Leu Cys Lys Ile Thr Leu Thr Ala
370 375 380
Glu Ile Met Thr Tyr Ile His Thr Met Asp Ser Asn Ile Leu Glu Asp
385 390 395 400
Trp Gln Phe Gly Leu Thr Pro Pro Pro Ser Ala Ser Leu Gln Asp Thr
405 410 415
Tyr Arg Phe Val Thr Ser Gln Ala Ile Thr Cys Gln Lys Thr Ala Pro
420 425 430
Pro Lys Glu Lys Glu Asp Pro Leu Asn Lys Tyr Thr Phe Trp Glu Val
435 440 445
Asn Leu Lys Glu Lys Phe Ser Ala Asp Leu Asp Gln Phe Pro Leu Gly
450 455 460
Arg Lys Phe Leu Leu Gln Ser Gly Leu Lys Ala Lys Pro Lys Leu Lys
465 470 475 480
Arg Ala Ala Pro Thr Ser Thr Arg Thr Ser Ser Ala Lys Arg Lys Lys
485 490 495
Val Lys Lys
<![CDATA[<210> 150]]>
<![CDATA[<211> 1501]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220> ]]>
<![CDATA[<223> 5804 嵌合的HPV 58型L1蛋白的核苷酸序列]]>
<![CDATA[<400> 150]]>
atgagcgtgt ggaggcccag cgaggccacc gtgtacctgc cccccgtgcc cgtgagcaag 60
gtggtgagca ccgacgagta cgtgagcagg accagcatct actactacgc cggcagcagc 120
aggctgctgg ccgtgggcaa cccctacttc agcatcaaga gccccaacaa caacaagaag 180
gtgctggtgc ccaaggtgag cggcctgcag tacagggtgt tcagggtgag gctgcccgac 240
cccaacaagt tcggcttccc cgacaccagc ttctacaacc ccgacaccca gaggctggtg 300
tgggcctgcg tgggcctgga gatcggcagg ggccagcccc tgggcgtggg cgtgagcggc 360
cacccctacc tgaacaagtt cgacgacacc gagaccagca acaggtaccc cgcccagccc 420
ggcagcgaca acagggagtg cctgagcatg gactacaagc agacccagct gtgcctgatc 480
ggctgcaagc cccccaccgg cgagcactgg ggcaagggcg tggcctgcaa caacaacgcc 540
gccgccaccg actgcccccc cctggagctg ttcaacagca tcatcgagga cggcgacatg 600
gtggacaccg gcttcggctg catggacttc ggcaccctgc aggccaacaa gagcgacgtg 660
cccatcgaca tctgcaacag cacctgcaag taccccgact acctgaagat ggccagcgag 720
ccctacggcg acagcctgtt cttcttcctg aggagggagc agatgttcgt gaggcacttc 780
ttcaacaggg ccggcaagct gggcgaggcc gtgcccgacg acctgtacat caagggcagc 840
ggcaacaccg ccgtgatcca gagcagcgcc ttcttcccca cccccagcgg cagcatcgtg 900
accagcgaga gccagctgtt caacaagccc tactggctgc agagggccca gggccacaac 960
aacggcatct gctggggcaa ccagctgttc gtgaccgtgg tggacaccac caggagcacc 1020
aacatgaccc tgtgcaccga ggtgaccaag gagggcacct acaagaacga caacttcaag 1080
gagtacgtga ggcacgtgga ggagtacgac ctgcagttcg tgttccagct gtgcaagatc 1140
accctgaccg ccgagatcat gacctacatc cacaccatgg acagcaacat cctggaggac 1200
tggcagttcg gcctgacccc cccccccagc gccagcctgc aggacaccta caggttcgtg 1260
accagccagg ccatcacctg ccagaagacc gcccccccca aggagaagga ggaccccctg 1320
aacaagtaca ccttctggga ggtgaacctg aaggagaagt tcagcgccga cctggaccag 1380
ttccccctgg gcaggaagtt cctgctgcag agcggcctga aagccaagcc aaaactgaaa 1440
agggctgccc caaccagcac caggacctcc tctgccaaga ggaagaaggt gaagaagtaa 1500
a 1501
<![CDATA[<210> 151]]>
<![CDATA[<211> 1516]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220> ]]>
<![CDATA[<223> 5805 合成的HPV58L1基因]]>
<![CDATA[<400> 151]]>
ctgggtacca tgagcgtgtg gaggcccagc gaggccaccg tgtacctgcc ccccgtgccc 60
gtgagcaagg tggtgagcac cgacgagtac gtgagcagga ccagcatcta ctactacgcc 120
ggcagcagca ggctgctggc cgtgggcaac ccctacttca gcatcaagag ccccaacaac 180
aacaagaagg tgctggtgcc caaggtgagc ggcctgcagt acagggtgtt cagggtgagg 240
ctgcccgacc ccaacaagtt cggcttcccc gacaccagct tctacaaccc cgacacccag 300
aggctggtgt gggcctgcgt gggcctggag atcggcaggg gccagcccct gggcgtgggc 360
gtgagcggcc acccctacct gaacaagttc gacgacaccg agaccagcaa caggtacccc 420
gcccagcccg gcagcgacaa cagggagtgc ctgagcatgg actacaagca gacccagctg 480
tgcctgatcg gctgcaagcc ccccaccggc gagcactggg gcaagggcgt ggcctgcaac 540
aacaacgccg ccgccaccga ctgccccccc ctggagctgt tcaacagcat catcgaggac 600
ggcgacatgg tggacaccgg cttcggctgc atggacttcg gcaccctgca ggccaacaag 660
agcgacgtgc ccatcgacat ctgcaacagc acctgcaagt accccgacta cctgaagatg 720
gccagcgagc cctacggcga cagcctgttc ttcttcctga ggagggagca gatgttcgtg 780
aggcacttct tcaacagggc cggcaagctg ggcgaggccg tgcccgacga cctgtacatc 840
aagggcagcg gcaacaccgc cgtgatccag agcagcgcct tcttccccac ccccagcggc 900
agcatcgtga ccagcgagag ccagctgttc aacaagccct actggctgca gagggcccag 960
ggccacaaca acggcatctg ctggggcaac cagctgttcg tgaccgtggt ggacaccacc 1020
aggagcacca acatgaccct gtgcaccgag gtgaccaagg agggcaccta caagaacgac 1080
aacttcaagg agtacgtgag gcacgtggag gagtacgacc tgcagttcgt gttccagctg 1140
tgcaagatca ccctgaccgc cgagatcatg acctacatcc acaccatgga cagcaacatc 1200
ctggaggact ggcagttcgg cctgaccccc ccccccagcg ccagcctgca ggacacctac 1260
aggttcgtga ccagccaggc catcacctgc cagaagaccg ccccccccaa ggagaaggag 1320
gaccccctga acaagtacac cttctgggag gtgaacctga aggagaagtt cagcgccgac 1380
ctggaccagt tccccctggg caggaagttc ctgctgcaga gcggcctgaa ggccaagccc 1440
aggctgaaga ggagcgcccc caccaccagg gcccccagca ccaagaggaa gaaggtgaag 1500
aagtaaactc gagctc 1516
<![CDATA[<210> 152]]>
<![CDATA[<211> 1519]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220> ]]>
<![CDATA[<223> 5806 合成的HPV33L1基因]]>
<![CDATA[<400> 152]]>
ctgggtacca tgagtgtgtg gagaccatct gaggctacag tctacctgcc tcctgtgcct 60
gtgagcaagg tggtgagcac agatgaatat gtgagcagga ccagcatcta ctactatgct 120
ggctccagca gactgctggc tgtgggacac ccatacttca gcatcaagaa cccaaccaat 180
gccaagaaac tgctggtgcc aaaggtgtct ggactccaat acagggtgtt cagggtgaga 240
ctgcctgacc caaacaagtt tggctttcct gacacctcct tctacaaccc tgacacccag 300
agactggtgt gggcttgtgt gggattggag attggcaggg gacaaccact gggagtgggc 360
atctctggac acccactgct gaacaagttt gatgacacag agacaggcaa caaataccct 420
ggacaacctg gagcagacaa cagggagtgt ctgagtatgg actacaagca gacccaactt 480
tgtctgctgg gctgtaagcc tccaacagga gaacactggg gcaagggagt ggcttgtacc 540
aatgctgccc ctgccaatga ctgtcctcca ttggaactga taaacaccat cattgaggat 600
ggagatatgg tggacacagg ctttggctgt atggacttca agaccctcca agccaacaag 660
tctgatgtgc caattgacat ctgtggcagc acttgtaaat accctgacta cctgaaaatg 720
acctctgaac catatggaga ctccctgttc ttcttcctga ggagggaaca gatgtttgtg 780
agacacttct tcaacagggc tggcaccctg ggagaggctg tgcctgatga cctctacatc 840
aagggctctg gcaccacagc cagcatccag tcctctgcct tctttccaac accatctggc 900
agtatggtga cctctgagag ccaacttttc aacaagccat actggctcca aagggctcaa 960
ggacacaaca atggcatctg ttggggcaac caggtgtttg tgacagtggt ggacaccacc 1020
aggagcacca atatgaccct gtgtacccag gtgacctctg acagcaccta caagaatgag 1080
aacttcaagg aatacatcag gcatgtggag gaatatgacc tccaatttgt gttccaactt 1140
tgtaaggtga ccctgacagc agaggtgatg acctacatcc atgctatgaa ccctgacatc 1200
ttggaggact ggcagtttgg actgacacct cctccatctg cctccctcca agacacctac 1260
aggtttgtga ccagccaggc tatcacttgt cagaagacag tgcctccaaa ggagaaggag 1320
gacccactgg gcaaatacac cttctgggag gtggacctga aagagaagtt ctctgctgac 1380
ctggaccagt ttccactggg caggaagttc ctgctccaag caggactgaa agccaagcca 1440
aaactgaaaa gggctgcccc aaccagcacc aggacctcct ctgccaagag gaagaaggtg 1500
aagaagtaaa ctcgagctc 1519
<![CDATA[<210> 153]]>
<![CDATA[<211> 34]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220> ]]>
<![CDATA[<223> 5807 HPV58L1 F1]]>
<![CDATA[<400> 153]]>
cttggtacca tgagcgtgtg gaggcccagc gagg 34
<![CDATA[<210> 154]]>
<![CDATA[<211> 36]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220> ]]>
<![CDATA[<223> 5808 HPV58L1 R1]]>
<![CDATA[<400> 154]]>
gcttggcttt caggccgctc tgcagcagga acttcc 36
<![CDATA[<210> 155]]>
<![CDATA[<211> 1438]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220> ]]>
<![CDATA[<223> 5809 HPV58L1擴增序列1]]>
<![CDATA[<400> 155]]>
cttggtacca tgagcgtgtg gaggcccagc gaggccaccg tgtacctgcc ccccgtgccc 60
gtgagcaagg tggtgagcac cgacgagtac gtgagcagga ccagcatcta ctactacgcc 120
ggcagcagca ggctgctggc cgtgggcaac ccctacttca gcatcaagag ccccaacaac 180
aacaagaagg tgctggtgcc caaggtgagc ggcctgcagt acagggtgtt cagggtgagg 240
ctgcccgacc ccaacaagtt cggcttcccc gacaccagct tctacaaccc cgacacccag 300
aggctggtgt gggcctgcgt gggcctggag atcggcaggg gccagcccct gggcgtgggc 360
gtgagcggcc acccctacct gaacaagttc gacgacaccg agaccagcaa caggtacccc 420
gcccagcccg gcagcgacaa cagggagtgc ctgagcatgg actacaagca gacccagctg 480
tgcctgatcg gctgcaagcc ccccaccggc gagcactggg gcaagggcgt ggcctgcaac 540
aacaacgccg ccgccaccga ctgccccccc ctggagctgt tcaacagcat catcgaggac 600
ggcgacatgg tggacaccgg cttcggctgc atggacttcg gcaccctgca ggccaacaag 660
agcgacgtgc ccatcgacat ctgcaacagc acctgcaagt accccgacta cctgaagatg 720
gccagcgagc cctacggcga cagcctgttc ttcttcctga ggagggagca gatgttcgtg 780
aggcacttct tcaacagggc cggcaagctg ggcgaggccg tgcccgacga cctgtacatc 840
aagggcagcg gcaacaccgc cgtgatccag agcagcgcct tcttccccac ccccagcggc 900
agcatcgtga ccagcgagag ccagctgttc aacaagccct actggctgca gagggcccag 960
ggccacaaca acggcatctg ctggggcaac cagctgttcg tgaccgtggt ggacaccacc 1020
aggagcacca acatgaccct gtgcaccgag gtgaccaagg agggcaccta caagaacgac 1080
aacttcaagg agtacgtgag gcacgtggag gagtacgacc tgcagttcgt gttccagctg 1140
tgcaagatca ccctgaccgc cgagatcatg acctacatcc acaccatgga cagcaacatc 1200
ctggaggact ggcagttcgg cctgaccccc ccccccagcg ccagcctgca ggacacctac 1260
aggttcgtga ccagccaggc catcacctgc cagaagaccg ccccccccaa ggagaaggag 1320
gaccccctga acaagtacac cttctgggag gtgaacctga aggagaagtt cagcgccgac 1380
ctggaccagt tccccctggg caggaagttc ctgctgcaga gcggcctgaa agccaagc 1438
<![CDATA[<210> 156]]>
<![CDATA[<211> 35]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220> ]]>
<![CDATA[<223> 5810 HPV58L1 F2]]>
<![CDATA[<400> 156]]>
gagcggcctg aaagccaagc caaaactgaa aaggg 35
<![CDATA[<210> 157]]>
<![CDATA[<211> 36]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220> ]]>
<![CDATA[<223> 5811 HPV58L1 R2]]>
<![CDATA[<400> 157]]>
ctgtctagat ttacttcttc accttcttcc tcttgg 36
<![CDATA[<210> ]]>158
<![CDATA[<211> 101]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220> ]]>
<![CDATA[<223> 5812 HPV58L1擴增序列2]]>
<![CDATA[<400> 158]]>
gagcggcctg aaagccaagc caaaactgaa aagggctgcc ccaaccagca ccaggacctc 60
ctctgccaag aggaagaagg tgaagaagta aatctagaca g 101
<![CDATA[<210> 159]]>
<![CDATA[<211> 38]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 智人]]>
<![CDATA[<220> ]]>
<![CDATA[<223> 5813 HPV 59型L1蛋白的471-508胺基酸序列]]>
<![CDATA[<400> 159]]>
Leu Gln Leu Gly Ala Arg Pro Lys Pro Thr Ile Gly Pro Arg Lys Arg
1 5 10 15
Ala Ala Pro Ala Pro Thr Ser Thr Pro Ser Pro Lys Arg Val Lys Arg
20 25 30
Arg Lys Ser Ser Arg Lys
35
<![CDATA[<210> 160]]>
<![CDATA[<211> 508]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 智人]]>
<![CDATA[<220> ]]>
<![CDATA[<223> 5901 HPV 59型L1蛋白的胺基酸序列]]>
<![CDATA[<400> 160]]>
Met Ala Leu Trp Arg Ser Ser Asp Asn Lys Val Tyr Leu Pro Pro Pro
1 5 10 15
Ser Val Ala Lys Val Val Ser Thr Asp Glu Tyr Val Thr Arg Thr Ser
20 25 30
Ile Phe Tyr His Ala Gly Ser Ser Arg Leu Leu Thr Val Gly His Pro
35 40 45
Tyr Phe Lys Val Pro Lys Gly Gly Asn Gly Arg Gln Asp Val Pro Lys
50 55 60
Val Ser Ala Tyr Gln Tyr Arg Val Phe Arg Val Lys Leu Pro Asp Pro
65 70 75 80
Asn Lys Phe Gly Leu Pro Asp Asn Thr Val Tyr Asp Pro Asn Ser Gln
85 90 95
Arg Leu Val Trp Ala Cys Val Gly Val Glu Ile Gly Arg Gly Gln Pro
100 105 110
Leu Gly Val Gly Leu Ser Gly His Pro Leu Tyr Asn Lys Leu Asp Asp
115 120 125
Thr Glu Asn Ser His Val Ala Ser Ala Val Asp Thr Lys Asp Thr Arg
130 135 140
Asp Asn Val Ser Val Asp Tyr Lys Gln Thr Gln Leu Cys Ile Ile Gly
145 150 155 160
Cys Val Pro Ala Ile Gly Glu His Trp Thr Lys Gly Thr Ala Cys Lys
165 170 175
Pro Thr Thr Val Val Gln Gly Asp Cys Pro Pro Leu Glu Leu Ile Asn
180 185 190
Thr Pro Ile Glu Asp Gly Asp Met Val Asp Thr Gly Tyr Gly Ala Met
195 200 205
Asp Phe Lys Leu Leu Gln Asp Asn Lys Ser Glu Val Pro Leu Asp Ile
210 215 220
Cys Gln Ser Ile Cys Lys Tyr Pro Asp Tyr Leu Gln Met Ser Ala Asp
225 230 235 240
Ala Tyr Gly Asp Ser Met Phe Phe Cys Leu Arg Arg Glu Gln Val Phe
245 250 255
Ala Arg His Phe Trp Asn Arg Ser Gly Thr Met Gly Asp Gln Leu Pro
260 265 270
Glu Ser Leu Tyr Ile Lys Gly Thr Asp Ile Arg Ala Asn Pro Gly Ser
275 280 285
Tyr Leu Tyr Ser Pro Ser Pro Ser Gly Ser Val Val Thr Ser Asp Ser
290 295 300
Gln Leu Phe Asn Lys Pro Tyr Trp Leu His Lys Ala Gln Gly Leu Asn
305 310 315 320
Asn Gly Ile Cys Trp His Asn Gln Leu Phe Leu Thr Val Val Asp Thr
325 330 335
Thr Arg Ser Thr Asn Leu Ser Val Cys Ala Ser Thr Thr Ser Ser Ile
340 345 350
Pro Asn Val Tyr Thr Pro Thr Ser Phe Lys Glu Tyr Ala Arg His Val
355 360 365
Glu Glu Phe Asp Leu Gln Phe Ile Phe Gln Leu Cys Lys Ile Thr Leu
370 375 380
Thr Thr Glu Val Met Ser Tyr Ile His Asn Met Asn Thr Thr Ile Leu
385 390 395 400
Glu Asp Trp Asn Phe Gly Val Thr Pro Pro Pro Thr Ala Ser Leu Val
405 410 415
Asp Thr Tyr Arg Phe Val Gln Ser Ala Ala Val Thr Cys Gln Lys Asp
420 425 430
Thr Ala Pro Pro Val Lys Gln Asp Pro Tyr Asp Lys Leu Lys Phe Trp
435 440 445
Pro Val Asp Leu Lys Glu Arg Phe Ser Ala Asp Leu Asp Gln Phe Pro
450 455 460
Leu Gly Arg Lys Phe Leu Leu Gln Leu Gly Ala Arg Pro Lys Pro Thr
465 470 475 480
Ile Gly Pro Arg Lys Arg Ala Ala Pro Ala Pro Thr Ser Thr Pro Ser
485 490 495
Pro Lys Arg Val Lys Arg Arg Lys Ser Ser Arg Lys
500 505
<![CDATA[<210> 161]]>
<![CDATA[<211> 1528]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 智人]]>
<![CDATA[<220> ]]>
<![CDATA[<223> 5902 HPV 59型L1蛋白的核苷酸序列]]>
<![CDATA[<400> 161]]>
atggctctgt ggaggtcctc tgacaacaag gtctacctgc ctcctccatc tgtggctaag 60
gtggtgagca cagatgaata tgtgaccagg accagcatct tctaccatgc tggctccagc 120
agactgctga cagtgggaca cccatacttc aaggtgccaa agggaggcaa tggcagacag 180
gatgtgccaa aggtgtctgc ctaccaatac agggtgttca gggtgaaact gcctgaccca 240
aacaagtttg gactgcctga caacacagtc tatgacccaa acagccagag actggtgtgg 300
gcttgtgtgg gagtggagat tggcagggga caaccactgg gagtgggact gtctggacac 360
ccactctaca acaaactgga tgacacagag aactctcatg tggcatctgc tgtggacacc 420
aaggacacca gggacaatgt gtctgtggac tacaagcaga cccaactttg tatcattggc 480
tgtgtgcctg ccattggaga acactggacc aagggcacag cctgtaagcc aaccacagtg 540
gtccagggag actgtcctcc attggaactg ataaacacac caattgagga tggagatatg 600
gtggacacag gctatggagc tatggacttc aaactgctcc aagacaacaa gtctgaggtg 660
ccactggaca tctgtcagag catctgtaaa taccctgact acctccaaat gagtgctgat 720
gcctatggag acagtatgtt cttctgtctg aggagggaac aggtgtttgc cagacacttc 780
tggaacaggt ctggcacaat gggagaccaa cttcctgagt ccctctacat caagggcaca 840
gacatcaggg ctaaccctgg ctcctacctc tacagcccaa gcccatctgg ctctgtggtg 900
acctctgaca gccaactttt caacaagcca tactggctgc acaaggctca aggactgaac 960
aatggcatct gttggcacaa ccaacttttc ctgacagtgg tggacaccac caggagcacc 1020
aacctgtctg tgtgtgccag caccacctcc agcatcccaa atgtctacac accaacctcc 1080
ttcaaggaat atgccaggca tgtggaggag tttgacctcc aattcatctt ccaactttgt 1140
aagattaccc tgaccacaga ggtgatgagt tacatccaca atatgaacac caccatcttg 1200
gaggactgga actttggagt gacacctcct ccaacagcct ccctggtgga cacctacagg 1260
tttgtccagt ctgctgctgt gacttgtcag aaggacacag cccctcctgt gaagcaggac 1320
ccatatgaca aactgaagtt ctggcctgtg gacctgaaag agaggttctc tgctgacctg 1380
gaccagtttc cactgggcag gaagttcctg ctccaacttg gagccagacc aaagccaacc 1440
attggaccaa ggaagagggc tgcccctgcc ccaaccagca caccaagccc aaagagggtg 1500
aagaggagga agtccagcag gaagtaaa 1528
<![CDATA[<210> 162]]>
<![CDATA[<211> 1546]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220> ]]>
<![CDATA[<223> 5903 合成的HPV59L1基因]]>
<![CDATA[<400> 162]]>
ctgggtacca tggctctgtg gaggtcctct gacaacaagg tctacctgcc tcctccatct 60
gtggctaagg tggtgagcac agatgaatat gtgaccagga ccagcatctt ctaccatgct 120
ggctccagca gactgctgac agtgggacac ccatacttca aggtgccaaa gggaggcaat 180
ggcagacagg atgtgccaaa ggtgtctgcc taccaataca gggtgttcag ggtgaaactg 240
cctgacccaa acaagtttgg actgcctgac aacacagtct atgacccaaa cagccagaga 300
ctggtgtggg cttgtgtggg agtggagatt ggcaggggac aaccactggg agtgggactg 360
tctggacacc cactctacaa caaactggat gacacagaga actctcatgt ggcatctgct 420
gtggacacca aggacaccag ggacaatgtg tctgtggact acaagcagac ccaactttgt 480
atcattggct gtgtgcctgc cattggagaa cactggacca agggcacagc ctgtaagcca 540
accacagtgg tccagggaga ctgtcctcca ttggaactga taaacacacc aattgaggat 600
ggagatatgg tggacacagg ctatggagct atggacttca aactgctcca agacaacaag 660
tctgaggtgc cactggacat ctgtcagagc atctgtaaat accctgacta cctccaaatg 720
agtgctgatg cctatggaga cagtatgttc ttctgtctga ggagggaaca ggtgtttgcc 780
agacacttct ggaacaggtc tggcacaatg ggagaccaac ttcctgagtc cctctacatc 840
aagggcacag acatcagggc taaccctggc tcctacctct acagcccaag cccatctggc 900
tctgtggtga cctctgacag ccaacttttc aacaagccat actggctgca caaggctcaa 960
ggactgaaca atggcatctg ttggcacaac caacttttcc tgacagtggt ggacaccacc 1020
aggagcacca acctgtctgt gtgtgccagc accacctcca gcatcccaaa tgtctacaca 1080
ccaacctcct tcaaggaata tgccaggcat gtggaggagt ttgacctcca attcatcttc 1140
caactttgta agattaccct gaccacagag gtgatgagtt acatccacaa tatgaacacc 1200
accatcttgg aggactggaa ctttggagtg acacctcctc caacagcctc cctggtggac 1260
acctacaggt ttgtccagtc tgctgctgtg acttgtcaga aggacacagc ccctcctgtg 1320
aagcaggacc catatgacaa actgaagttc tggcctgtgg acctgaaaga gaggttctct 1380
gctgacctgg accagtttcc actgggcagg aagttcctgc tccaacttgg agccagacca 1440
aagccaacca ttggaccaag gaagagggct gcccctgccc caaccagcac accaagccca 1500
aagagggtga agaggaggaa gtccagcagg aagtaaactc gagctc 1546
Sequence Listing
<![CDATA[ <110> China-based company Shenzhou Cell Engineering Co., Ltd.]]>
<![CDATA[ <120> Human papillomavirus multivalent immunogenic composition]]>
<![CDATA[ <140> TW109135395]]>
<![CDATA[ <141> 2020-10-13]]>
<![CDATA[ <160> 162]]>
<![CDATA[ <170> BiSSAP 1.3.6]]>
<![CDATA[ <210> 1]]>
<![CDATA[ <211> 469]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Homo sapiens]]>
<![CDATA[ <220> ]]>
<![CDATA[ <223> 0601 HPV type 6 L1 protein amino acid sequence 1-469]]>
<![CDATA[ <400> 1]]> Met Trp Arg Pro Ser Asp Ser Thr Val Tyr Val Pro Pro Pro Asn Pro 1 5 10 15 Val Ser Lys Val Val Ala Thr Asp Ala Tyr Val Thr Arg Thr Asn Ile 20 25 30 Phe Tyr His Ala Ser Ser Ser Arg Leu Leu Ala Val Gly His Pro Tyr 35 40 45 Phe Ser Ile Lys Arg Ala Asn Lys Thr Val Val Pro Lys Val Ser Gly 50 55 60 Tyr Gln Tyr Arg Val Phe Lys Val Val Leu Pro Asp Pro Asn Lys Phe 65 70 75 80 Ala Leu Pro Asp Ser Ser Leu Phe Asp Pro Thr Thr Gln Arg Leu Val 85 90 95 Trp Ala Cys Thr Gly Leu Glu Val Gly Arg Gly Gln Pro Leu Gly Val 100 105 110 Gly Val Ser Gly His Pro Phe Leu Asn Lys Tyr Asp Asp Val Glu Asn 115 120 125 Ser Gly Ser Gly Gly Asn Pro Gly Gln Asp Asn Arg Val Asn Val Gly 130 135 140 Met Asp Tyr Lys Gln Thr Gln Leu Cys Met Val Gly Cys Ala Pro Pro 145 150 155 160 Leu Gly Glu His Trp Gly Lys Gly Lys Gln Cys Thr Asn Thr Pro Val 165 170 175 Gln Ala Gly Asp Cys Pro Pro Leu Glu Leu Ile Thr Ser Val Ile Gln 180 185 190 Asp Gly Asp Met Val Asp Thr Gly Phe Gly Ala Met Asn Phe Ala Asp 195 200 205 Leu Gln Thr Asn Lys Ser Asp Val Pro Ile Asp Ile Cys Gly Thr Thr 210 215 220 Cys Lys Tyr Pro Asp Tyr Leu Gln Met Ala Ala Asp Pro Tyr Gly Asp 225 230 235 240 Arg Leu Phe Phe Phe Leu Arg Lys Glu Gln Met Phe Ala Arg His Phe 245 250 255 Phe Asn Arg Ala Gly Glu Val Gly Glu Pro Val Pro Asp Thr Leu Ile 260 265 270 Ile Lys Gly Ser Gly Asn Arg Thr Ser Val Gly Ser Ser Ile Tyr Val 275 280 285 Asn Thr Pro Ser Gly Ser Leu Val Ser Ser Glu Ala Gln Leu Phe Asn 290 295 300 Lys Pro Tyr Trp Leu Gln Lys Ala Gln Gly His Asn Asn Gly Ile Cys 305 310 315 320 Trp Gly Asn Gln Leu Phe Val Thr Val Val Asp Thr Thr Arg Ser Thr 325 330 335 Asn Met Thr Leu Cys Ala Ser Val Thr Thr Ser Ser Thr Tyr Thr Asn 340 345 350 Ser Asp Tyr Lys Glu Tyr Met Arg His Val Glu Glu Tyr Asp Leu Gln 355 360 365 Phe Ile Phe Gln Leu Cys Ser Ile Thr Leu Ser Ala Glu Val Met Ala 370 375 380 Tyr Ile His Thr Met Asn Pro Ser Val Leu Glu Asp Trp Asn Phe Gly 385 390 395 400 Leu Ser Pro Pro Pro Asn Gly Thr Leu Glu Asp Thr Tyr Arg Tyr Val 405 410 415 Gln Ser Gln Ala Ile Thr Cys Gln Lys Pro Thr Pro Glu Lys Glu Lys 420 425 430 Pro Asp Pro Tyr Lys Asn Leu Ser Phe Trp Glu Val Asn Leu Lys Glu 435 440 445 Lys Phe Ser Ser Glu Leu Asp Gln Tyr Pro Leu Gly Arg Lys Phe Leu 450 455 460 Leu Gln Ser Gly Tyr 465
<![CDATA[ <210> 2]]>
<![CDATA[ <211> 26]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Homo sapiens]]>
<![CDATA[ <220> ]]>
<![CDATA[ <223> 0602 HPV 33 L1 protein amino acid sequence 474-499]]>
<![CDATA[ <400> 2]]> Lys Ala Lys Pro Lys Leu Lys Arg Ala Ala Pro Thr Ser Thr Arg Thr 1 5 10 15 Ser Ser Ala Lys Arg Lys Lys Val Lys Lys 20 25
<![CDATA[ <210> 3]]>
<![CDATA[ <211> 495]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Artificial sequence]]>
<![CDATA[ <220> ]]>
<![CDATA[ <223> 0603 Amino acid sequence of chimeric HPV type 6 L1 protein]]>
<![CDATA[ <400> 3]]> Met Trp Arg Pro Ser Asp Ser Thr Val Tyr Val Pro Pro Pro Asn Pro 1 5 10 15 Val Ser Lys Val Val Ala Thr Asp Ala Tyr Val Thr Arg Thr Asn Ile 20 25 30 Phe Tyr His Ala Ser Ser Ser Arg Leu Leu Ala Val Gly His Pro Tyr 35 40 45 Phe Ser Ile Lys Arg Ala Asn Lys Thr Val Val Pro Lys Val Ser Gly 50 55 60 Tyr Gln Tyr Arg Val Phe Lys Val Val Leu Pro Asp Pro Asn Lys Phe 65 70 75 80 Ala Leu Pro Asp Ser Ser Leu Phe Asp Pro Thr Thr Gln Arg Leu Val 85 90 95 Trp Ala Cys Thr Gly Leu Glu Val Gly Arg Gly Gln Pro Leu Gly Val 100 105 110 Gly Val Ser Gly His Pro Phe Leu Asn Lys Tyr Asp Asp Val Glu Asn 115 120 125 Ser Gly Ser Gly Gly Asn Pro Gly Gln Asp Asn Arg Val Asn Val Gly 130 135 140 Met Asp Tyr Lys Gln Thr Gln Leu Cys Met Val Gly Cys Ala Pro Pro 145 150 155 160 Leu Gly Glu His Trp Gly Lys Gly Lys Gln Cys Thr Asn Thr Pro Val 165 170 175 Gln Ala Gly Asp Cys Pro Pro Leu Glu Leu Ile Thr Ser Val Ile Gln 180 185 190 Asp Gly Asp Met Val Asp Thr Gly Phe Gly Ala Met Asn Phe Ala Asp 195 200 205 Leu Gln Thr Asn Lys Ser Asp Val Pro Ile Asp Ile Cys Gly Thr Thr 210 215 220 Cys Lys Tyr Pro Asp Tyr Leu Gln Met Ala Ala Asp Pro Tyr Gly Asp 225 230 235 240 Arg Leu Phe Phe Phe Leu Arg Lys Glu Gln Met Phe Ala Arg His Phe 245 250 255 Phe Asn Arg Ala Gly Glu Val Gly Glu Pro Val Pro Asp Thr Leu Ile 260 265 270 Ile Lys Gly Ser Gly Asn Arg Thr Ser Val Gly Ser Ser Ile Tyr Val 275 280 285 Asn Thr Pro Ser Gly Ser Leu Val Ser Ser Glu Ala Gln Leu Phe Asn 290 295 300 Lys Pro Tyr Trp Leu Gln Lys Ala Gln Gly His Asn Asn Gly Ile Cys 305 310 315 320 Trp Gly Asn Gln Leu Phe Val Thr Val Val Asp Thr Thr Arg Ser Thr 325 330 335 Asn Met Thr Leu Cys Ala Ser Val Thr Thr Ser Ser Thr Tyr Thr Asn 340 345 350 Ser Asp Tyr Lys Glu Tyr Met Arg His Val Glu Glu Tyr Asp Leu Gln 355 360 365 Phe Ile Phe Gln Leu Cys Ser Ile Thr Leu Ser Ala Glu Val Met Ala 370 375 380 Tyr Ile His Thr Met Asn Pro Ser Val Leu Glu Asp Trp Asn Phe Gly 385 390 395 400 Leu Ser Pro Pro Pro Asn Gly Thr Leu Glu Asp Thr Tyr Arg Tyr Val 405 410 415 Gln Ser Gln Ala Ile Thr Cys Gln Lys Pro Thr Pro Glu Lys Glu Lys 420 425 430 Pro Asp Pro Tyr Lys Asn Leu Ser Phe Trp Glu Val Asn Leu Lys Glu 435 440 445 Lys Phe Ser Ser Glu Leu Asp Gln Tyr Pro Leu Gly Arg Lys Phe Leu 450 455 460 Leu Gln Ser Gly Tyr Lys Ala Lys Pro Lys Leu Lys Arg Ala Ala Pro 465 470 475 480 Thr Ser Thr Arg Thr Ser Ser Ala Lys Arg Lys Lys Val Lys Lys 485 490 495
<![CDATA[ <210> 4]]>
<![CDATA[ <211> 1488]]>
<![CDATA[ <212> DNA]]>
<![CDATA[ <213> Artificial sequence]]>
<![CDATA[ <220> ]]>
<![CDATA[ <223> 0604 Nucleotide sequence of chimeric HPV type 6 L1 protein]]>
<![CDATA[ <400> 4]]> atgtggagac catctgacag cacagtctat gtgcctcctc caaaccctgt gagcaaggtg 60 gtggctacag atgcctatgt gaccaggacc aacatcttct accatgcctc ctccagcaga 120 ctgctggctg tgggacaccc atacttcagc atcaagaggg ctaacaagac agtggtgcca 180 aaggtgtctg gctaccaata cagggtgttc aaggtggtgc tgcctgaccc aaacaagttt 240 gccctgcctg actcctccct gtttgaccca accacccaga gactggtgtg ggcttgtact 300 ggattggagg tgggcagggg acaaccactg ggagtggggag tgtctggacaccattcctg 360 aacaaatatg atgatgtgga gaactctggc tctggaggca accctggaca agacaacagg 420 gtgaatgtgg ggatggacta caagcagacc caactttgta tggtgggctg tgcccctcca 480 ctgggagaac actggggcaa gggcaagcag tgtaccaaca cacctgtcca ggctggagac 540 tgtcctccat tggaactgat tacctctgtg attcaggatg gagatatggt ggacacaggc 600 tttggagcta tgaactttgc tgacctccaa accaacaagt ctgatgtgcc aattgacatc 660 tgtggcacca cttgtaaata ccctgactac ctccaaatgg ctgctgaccc atatggagac 720 agactgttct tcttcctgag gaaggaacag atgtttgcca gacacttctt caacagggct 780 ggagaggtgg gagaacctgt gcctgacacc ctgattatca agggctctgg caacaggacc 840 tctgtgggct ccagcatcta tgtgaacaca ccatctggct ccctggtgtc ctctgaggct 900 caacttttca acaagccata ctggctccaa aaggctcaag gacacaacaa tggcatctgt 960 tggggcaacc aactttttgt gacagtggtg gacaccacca ggagcaccaa tatgaccctg 1020 tgtgcctctg tgaccacctc cagcacctac accaactctg actacaagga atatatgagg 1080 catgtggagg aatatgacct ccaattcatc ttccaacttt gtagcatcac cctgtctgct 1140 gaggtgatgg cttacatcca cacaatgaac ccatctgtgt tggaggactg gaactttgga 1200 ctgagccctc ctccaaatgg caccttggag gacacctaca gatatgtcca gagccaggct 1260 atcacttgtc agaagccaac acctgagaag gagaagcctg acccatacaa gaacctgtcc 1320 ttctgggagg tgaacctgaa agagaagttc tcctctgaac tggaccaata cccactgggc 1380 aggaagttcc tgctccaatc tggctacaaa gccaagccaa aactgaaaag ggctgcccca 1440 accagcacca ggacctcctc tgccaagagg aagaaggtga agaagtaa 1488
<![CDATA[ <210> 5]]>
<![CDATA[ <211> 1522]]>
<![CDATA[ <212> DNA]]>
<![CDATA[ <213> Artificial sequence]]>
<![CDATA[ <220> ]]>
<![CDATA[ <223> 0605 Synthetic HPV6L1 gene]]>
<![CDATA[ <400> 5]]> ctgggtacca tgtggagacc atctgacagc acagtctatg tgcctcctcc aaaccctgtg 60 agcaaggtgg tggctacaga tgcctatgtg accaggacca acatcttcta ccatgcctcc 120 tccagcagac tgctggctgt gggacaccca tacttcagca tcaagagggc taacaagaca 180 gtggtgccaa aggtgtctgg ctaccaatac agggtgttca aggtggtgct gcctgaccca 240 aacaagtttg ccctgcctga ctcctccctg tttgacccaa ccacccagag actggtgtgg 300 gcttgtactg gattggaggt gggcagggga caaccactgg gagtggggagt gtctggacac 360 ccattcctga acaaatatga tgatgtggag aactctggct ctggaggcaa ccctggacaa 420 gacaacaggg tgaatgtggg gatggactac aagcagaccc aactttgtat ggtgggctgt 480 gcccctccac tgggagaaca ctggggcaag ggcaagcagt gtaccaacac acctgtccag 540 gctggagact gtcctccatt ggaactgatt acctctgtga ttcaggatgg agatatggtg 600 gacacaggct ttggagctat gaactttgct gacctccaaa ccaacaagtc tgatgtgcca 660 attgacatct gtggcaccac ttgtaaatac cctgactacc tccaaatggc tgctgaccca 720 tatggagaca gactgttctt cttcctgagg aaggaacaga tgtttgccag acacttcttc 780 aacagggctg gagaggtggg agaacctgtg cctgacaccc tgattatcaa gggctctggc 840 aacaggacct ctgtgggctc cagcatctat gtgaacacac catctggctc cctggtgtcc 900 tctgaggctc aacttttcaa caagccatac tggctccaaa aggctcaagg acacaacaat 960 ggcatctgtt ggggcaacca actttttgtg acagtggtgg acaccaccag gagcaccaat 1020 atgaccctgt gtgcctctgt gaccacctcc agcacctaca ccaactctga ctacaaggaa 1080 tatatgaggc atgtggagga atatgacctc caattcatct tccaactttg tagcatcacc 1140 ctgtctgctg aggtgatggc ttacatccac acaatgaacc catctgtgtt ggaggactgg 1200 aactttggac tgagccctcc tccaaatggc accttggagg acacctacag atatgtccag 1260 agccaggcta tcacttgtca gaagccaaca cctgagaagg agaagcctga cccatacaag 1320 aacctgtcct tctgggaggt gaacctgaaa gagaagttct cctctgaact ggaccaatac 1380 ccactgggca ggaagttcct gctccaatct ggctacaggg gcaggtccag catcaggaca 1440 ggagtgaaga gacctgctgt gagcaaggca tctgctgccc caaagaggaa gagggctaag 1500 accaagaggt aaactcgagc tc 1522
<![CDATA[ <210> 6]]>
<![CDATA[ <211> 1519]]>
<![CDATA[ <212> DNA]]>
<![CDATA[ <213> Artificial sequence]]>
<![CDATA[ <220> ]]>
<![CDATA[ <22]]>3> 0606 Synthetic HPV33L1 gene]]>
<br/>
<br/> <![CDATA[ <400>6]]>
<br/> <![CDATA[ctgggtacca tgagtgtgtg gagaccatct gaggctacag tctacctgcc tcctgtgcct 60 gtgagcaagg tggtgagcac agatgaatat gtgagcagga ccagcatcta ctactatgct 120 ggctccagca gactgctggc tgtgggacac ccatacttca gcatcaagaa cccaaccaat 180 gccaagaaac tgctggtgcc aaaggtgtct ggactccaat acagggtgtt cagggtgaga 240 ctgcctgacc caaacaagtt tggctttcct gacacctcct tctacaaccc tgacacccag 300 agactggtgt gggcttgtgt gggattggag attggcaggg gacaaccact gggagtgggc 360 atctctggac acccactgct gaacaagttt gatgacacag agacaggcaa caaataccct 420 ggacaacctg gagcagacaa cagggagtgt ctgagtatgg actacaagca gacccaactt 480 tgtctgctgg gctgtaagcc tccaacagga gaacactggg gcaagggagt ggcttgtacc 540 aatgctgccc ctgccaatga ctgtcctcca ttggaactga taaacaccat cattgaggat 600 ggagatatgg tggacacagg ctttggctgt atggacttca agaccctcca agccaacaag 660 tctgatgtgc caattgacat ctgtggcagc acttgtaaat accctgacta cctgaaaatg 720 acctctgaac catatggaga ctccctgttc ttcttcctga ggagggaaca gatgtttgtg 780 agacacttct tcaacagggc tggcaccctg ggagaggctg tgcctgatga cctctacatc 840 aagggctctg gcaccacagc cagcatccag tcctctgcct tctttccaac accatctggc 900 agtatggtga cctctgagag ccaacttttc aacaagccat actggctcca aagggctcaa 960 ggacacaaca atggcatctg ttggggcaac caggtgtttg tgacagtggt ggacaccacc 1020 aggagcacca atatgaccct gtgtacccag gtgacctctg acagcaccta caagaatgag 1080 aacttcaagg aatacatcag gcatgtggag gaatatgacc tccaatttgt gttccaactt 1140 tgtaaggtga ccctgacagc agaggtgatg acctacatcc atgctatgaa ccctgacatc 1200 ttggaggact ggcagtttgg actgacacct cctccatctg cctccctcca agacacctac 1260 aggtttgtga ccagccaggc tatcacttgt cagaagacag tgcctccaaa ggagaaggag 1320 gacccactgg gcaaatacac cttctggggag gtggacctga aagagaagtt ctctgctgac 1380 ctggaccagt ttccactggg caggaagttc ctgctccaag caggactgaa agccaagcca 1440 aaactgaaaa gggctgcccc aaccagcacc aggacctcct ctgccaagag gaagaaggtg 1500 aagaagtaaa ctcgagctc 1519
<![CDATA[ <210> 7]]>
<![CDATA[ <211> 35]]>
<![CDATA[ <212> DNA]]>
<![CDATA[ <213> Artificial sequence]]>
<![CDATA[ <220> ]]>
<![CDATA[ <223> 0607 HPV6L1 F1]]>
<![CDATA[ <400> 7]]> cttggtacca tgtggagacc atctgacagc acagt 35
<![CDATA[ <210> 8]]>
<![CDATA[ <211> 36]]>
<![CDATA[ <212> DNA]]>
<![CDATA[ <213> Artificial sequence]]>
<![CDATA[ <220> ]]>
<![CDATA[ <223> 0608 HPV6L1 R1]]>
<![CDATA[ <400> 8]]> gcttggcttt gtagccagat tggagcagga acttcc 36 <![CDATA[ <210> 9]]>
<![CDATA[ <211> 1426]]>
<![CDATA[ <212> DNA]]>
<![CDATA[ <213> Artificial sequence]]>
<![CDATA[ <220> ]]>
<![CDATA[ <223> 0609 HPV6L1 expansion sequence 1]]>
<![CDATA[ <400> 9]]> cttggtacca tgtggagacc atctgacagc acagtctatg tgcctcctcc aaaccctgtg 60 agcaaggtgg tggctacaga tgcctatgtg accaggacca acatcttcta ccatgcctcc 120 tccagcagac tgctggctgt gggacaccca t acttcagca tcaagagggc taacaagaca 180 gtggtgccaa aggtgtctgg ctaccaatac agggtgttca aggtggtgct gcctgaccca 240 aacaagtttg ccctgcctga ctcctccctg tttgacccaa ccacccagag actggtgtgg 300 gcttgtactg gattgg aggt gggcagggga caaccactgg gagtgggagt gtctggacac 360 ccattcctga acaaatatga tgatgtggag aactctggct ctggaggcaa ccctggacaa 420 gacaacaggg tgaatgtggg gatggactac aagcagaccc aactttgtat ggtgggctgt 480 gcccctccac tgggagaaca ct ggggcaag ggcaagcagt gtaccaacac acctgtccag 540 gctggagact gtcctccatt ggaactgatt acctctgtga ttcaggatgg agatatggtg 600 gacacaggct ttggagctat gaactttgct gacctccaaa ccaacaagtc tgatgtgcca 660 attgacatct gt ggcaccac ttgtaaatac cctgactacc tccaaatggc tgctgaccca 720 tatggagaca gactgttctt cttcctgagg aaggaacaga tgtttgccag acacttcttc 780 aacagggctg gagaggtggg agaacctgtg cctgacaccc tgattatcaa gggctctggc 840 a acaggacct ctgtgggctc cagcatctat gtgaacacac catctggctc cctggtgtcc 900 tctgaggctc aacttttcaa caagccatac tggctccaaa aggctcaagg acacaacaat 960 ggcatctgtt ggggcaacca actttttgtg acagtggtgg acaccaccag gagcaccaat 1020 atgaccctgt gtgcctctgt gaccacctcc agcacctaca ccaactctga ctacaaggaa 1080 tatatgaggc atgtggagga atatgacctc caattcatct tccaactttg tagcatcacc 1140 ctgtctgctg aggtgatggc ttacatccac acaatgaacc catctgtgtt ggaggactgg 12 00 aactttggac tgagccctcc tccaaatggc accttggagg acacctacag atatgtccag 1260 agccaggcta tcacttgtca gaagccaaca cctgagaagg agaagcctga cccatacaag 1320 aacctgtcct tctgggaggt gaacctgaaa gagaagttct cctctga act ggaccaatac 1380 ccactgggca ggaagttcct gctccaatct ggctacaaag ccaagc 1426 <![CDATA[ <210> 10]]>
<![CDATA[ <211> 35]]>
<![CDATA[ <212> DNA]]>
<![CDATA[ <213> Artificial sequence]]>
<![CDATA[ <220> ]]>
<![CDATA[ <223> 0610 HPV6L1 F2]]> <![CDATA[ <400> 10]]> atctggctac aaagccaagc caaaactgaa aaggg 35 <![CDATA[ <210> 11]]>
<![CDATA[ <211> 37]]>
<![CDATA[ <212> DNA]]>
<![CDATA[ <213> Artificial sequence]]>
<![CDATA[ <220> ]]>
<![CDATA[ <223> 0611 HPV6L1 R2]]> <![CDATA[ <400> 11]]> ctgtctagat ttacttcttc accttcttcc tcttggc 37 <![CDATA[ <210> 12]]>
<![CDATA[ <211> 101]]>
<![CDATA[ <212> DNA]]>
<![CDATA[ <213> Artificial sequence]]>
<![CDATA[ <220> ]]>
<![CDATA[ <223> 0612 HPV6L1 expansion sequence 2]]>
<![CDATA[ <400> 12]]> atctggctac aaagccaagc caaaactgaa aagggctgcc ccaaccagca ccaggacctc 60 ctctgccaag aggaagaagg tgaagaagta aatctagaca g 101 <![CDATA[ <210> 13]]>
<![CDATA[ <211> 38]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Homo sapiens]]>
<![CDATA[ <220> ]]>
<![CDATA[ <223> 0613 HPV 59 L1 protein amino acid sequence 471-508]]>
<![CDATA[ <400> 13]]> Leu Gln Leu Gly Ala Arg Pro Lys Pro Thr Ile Gly Pro Arg Lys Arg 1 5 10 15 Ala Ala Pro Ala Pro Thr Ser Thr Pro Ser Pro Lys Arg Val Lys Arg 20 25 30 Arg Lys Ser Ser Arg Lys 35 <![CDATA[ <210> 14]]>
<![CDATA[ <211> 470]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Homo sapiens]]>
<![CDATA[ <220> ]]>
<![CDATA[ <223> 1101 HPV type 11 L1 protein amino acid sequence 1-470]]>
<![CDATA[ <400> 14]]> Met Trp Arg Pro Ser Asp Ser Thr Val Tyr Val Pro Pro Pro Asn Pro 1 5 10 15 Val Ser Lys Val Val Ala Thr Asp Ala Tyr Val Lys Arg Thr Asn Ile 20 25 30 Phe Tyr His Ala Ser Ser Ser Arg Leu Leu Ala Val Gly His Pro Tyr 35 40 45 Tyr Ser Ile Lys Lys Val Asn Lys Thr Val Val Pro Lys Val Ser Gly 50 55 60 Tyr Gln Tyr Arg Val Phe Lys Val Val Leu Pro Asp Pro Asn Lys Phe 65 70 75 80 Ala Leu Pro Asp Ser Ser Leu Phe Asp Pro Thr Thr Gln Arg Leu Val 85 90 95 Trp Ala Cys Thr Gly Leu Glu Val Gly Arg Gly Gln Pro Leu Gly Val 100 105 110 Gly Val Ser Gly His Pro Leu Leu Asn Lys Tyr Asp Asp Val Glu Asn 115 120 125 Ser Gly Gly Tyr Gly Gly Gly Asn Pro Gly Gln Asp Asn Arg Val Asn Val 130 135 140 Gly Met Asp Tyr Lys Gln Thr Gln Leu Cys Met Val Gly Cys Ala Pro 145 150 155 160 Pro Leu Gly Glu His Trp Gly Lys Gly Thr Gln Cys Ser Asn Thr Ser 165 170 175 Val Gln Asn Gly Asp Cys Pro Pro Leu Glu Leu Ile Thr Ser Val Ile 180 185 190 Gln Asp Gly Asp Met Val Asp Thr Gly Phe Gly Ala Met Asn Phe Ala 195 200 205 Asp Leu Gln Thr Asn Lys Ser Asp Val Pro Leu Asp Ile Cys Gly Thr 210 215 220 Val Cys Lys Tyr Pro Asp Tyr Leu Gln Met Ala Ala Asp Pro Tyr Gly 225 230 235 240 Asp Arg Leu Phe Phe Tyr Leu Arg Lys Glu Gln Met Phe Ala Arg His 245 250 255 Phe Phe Asn Arg Ala Gly Thr Val Gly Glu Pro Val Pro Asp Asp Leu 260 265 270 Leu Val Lys Gly Gly Asn Asn Arg Ser Ser Val Ala Ser Ser Ile Tyr 275 280 285 Val His Thr Pro Ser Gly Ser Leu Val Ser Ser Ser Glu Ala Gln Leu Phe 290 295 300 Asn Lys Pro Tyr Trp Leu Gln Lys Ala Gln Gly His Asn Asn Gly Ile 305 310 315 320 Cys Trp Gly Asn His Leu Phe Val Thr Val Val Asp Thr Thr Arg Ser 325 330 335 Thr Asn Met Thr Leu Cys Ala Ser Val Ser Lys Ser Ala Thr Tyr Thr 340 345 350 Asn Ser Asp Tyr Lys Glu Tyr Met Arg His Val Glu Glu Phe Asp Leu 355 360 365 Gln Phe Ile Phe Gln Leu Cys Ser Ile Thr Leu Ser Ala Glu Val Met 370 375 380 Ala Tyr Ile His Thr Met Asn Pro Ser Val Leu Glu Asp Trp Asn Phe 385 390 395 400 Gly Leu Ser Pro Pro Pro Asn Gly Thr Leu Glu Asp Thr Tyr Arg Tyr 405 410 415 Val Gln Ser Gln Ala Ile Thr Cys Gln Lys Pro Thr Pro Glu Lys Glu 420 425 430 Lys Gln Asp Pro Tyr Lys Asp Met Ser Phe Trp Glu Val Asn Leu Lys 435 440 445 Glu Lys Phe Ser Ser Glu Leu Asp Gln Phe Pro Leu Gly Arg Lys Phe 450 455 460 Leu Leu Gln Ser Gly Tyr 465 470 <![CDATA[ <210> 15]]>
<![CDATA[ <211> 26]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Homo sapiens]]>
<![CDATA[ <220> ]]>
<![CDATA[ <223> 1102 HPV 33 L1 protein amino acid sequence 474-499]]>
<![CDATA[ <400> 15]]> Lys Ala Lys Pro Lys Leu Lys Arg Ala Ala Pro Thr Ser Thr Arg Thr 1 5 10 15 Ser Ser Ala Lys Arg Lys Lys Val Lys Lys 20 25 <![CDATA[ <210> 16]]>
<![CDATA[ <211> 496]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Artificial sequence]]>
<![CDATA[ <220> ]]>
<![CDATA[ <223> 1103 Amino acid sequence of chimeric HPV type 11 L1 protein]]>
<![CDATA[ <400> 16]]> Met Trp Arg Pro Ser Asp Ser Thr Val Tyr Val Pro Pro Pro Asn Pro 1 5 10 15 Val Ser Lys Val Val Ala Thr Asp Ala Tyr Val Lys Arg Thr Asn Ile 20 25 30 Phe Tyr His Ala Ser Ser Ser Arg Leu Leu Ala Val Gly His Pro Tyr 35 40 45 Tyr Ser Ile Lys Lys Val Asn Lys Thr Val Val Pro Lys Val Ser Gly 50 55 60 Tyr Gln Tyr Arg Val Phe Lys Val Val Leu Pro Asp Pro Asn Lys Phe 65 70 75 80 Ala Leu Pro Asp Ser Ser Leu Phe Asp Pro Thr Thr Gln Arg Leu Val 85 90 95 Trp Ala Cys Thr Gly Leu Glu Val Gly Arg Gly Gln Pro Leu Gly Val 100 105 110 Gly Val Ser Gly His Pro Leu Leu Asn Lys Tyr Asp Asp Val Glu Asn 115 120 125 Ser Gly Gly Tyr Gly Gly Gly Asn Pro Gly Gln Asp Asn Arg Val Asn Val 130 135 140 Gly Met Asp Tyr Lys Gln Thr Gln Leu Cys Met Val Gly Cys Ala Pro 145 150 155 160 Pro Leu Gly Glu His Trp Gly Lys Gly Thr Gln Cys Ser Asn Thr Ser 165 170 175 Val Gln Asn Gly Asp Cys Pro Pro Leu Glu Leu Ile Thr Ser Val Ile 180 185 190 Gln Asp Gly Asp Met Val Asp Thr Gly Phe Gly Ala Met Asn Phe Ala 195 200 205 Asp Leu Gln Thr Asn Lys Ser Asp Val Pro Leu Asp Ile Cys Gly Thr 210 215 220 Val Cys Lys Tyr Pro Asp Tyr Leu Gln Met Ala Ala Asp Pro Tyr Gly 225 230 235 240 Asp Arg Leu Phe Phe Tyr Leu Arg Lys Glu Gln Met Phe Ala Arg His 245 250 255 Phe Phe Asn Arg Ala Gly Thr Val Gly Glu Pro Val Pro Asp Asp Leu 260 265 270 Leu Val Lys Gly Gly Asn Asn Arg Ser Ser Val Ala Ser Ser Ile Tyr 275 280 285 Val His Thr Pro Ser Gly Ser Leu Val Ser Ser Ser Glu Ala Gln Leu Phe 290 295 300 Asn Lys Pro Tyr Trp Leu Gln Lys Ala Gln Gly His Asn Asn Gly Ile 305 310 315 320 Cys Trp Gly Asn His Leu Phe Val Thr Val Val Asp Thr Thr Arg Ser 325 330 335 Thr Asn Met Thr Leu Cys Ala Ser Val Ser Lys Ser Ala Thr Tyr Thr 340 345 350 Asn Ser Asp Tyr Lys Glu Tyr Met Arg His Val Glu Glu Phe Asp Leu 355 360 365 Gln Phe Ile Phe Gln Leu Cys Ser Ile Thr Leu Ser Ala Glu Val Met 370 375 380 Ala Tyr Ile His Thr Met Asn Pro Ser Val Leu Glu Asp Trp Asn Phe 385 390 395 400 Gly Leu Ser Pro Pro Pro Asn Gly Thr Leu Glu Asp Thr Tyr Arg Tyr 405 410 415 Val Gln Ser Gln Ala Ile Thr Cys Gln Lys Pro Thr Pro Glu Lys Glu 420 425 430 Lys Gln Asp Pro Tyr Lys Asp Met Ser Phe Trp Glu Val Asn Leu Lys 435 440 445 Glu Lys Phe Ser Ser Glu Leu Asp Gln Phe Pro Leu Gly Arg Lys Phe 450 455 460 Leu Leu Gln Ser Gly Tyr Lys Ala Lys Pro Lys Leu Lys Arg Ala Ala 465 470 475 480 Pro Thr Ser Thr Arg Thr Ser Ser Ala Lys Arg Lys Lys Val Lys Lys 485 490 495 <![CDATA[ <210> 17]]>
<![CDATA[ <211> 1492]]>
<![CDATA[ <212> DNA]]>
<![CDATA[ <213> Artificial sequence]]>
<![CDATA[ <220> ]]>
<![CDATA[ <223> 1104 Nucleotide sequence of chimeric HPV type 11 L1 protein]]>
<![CDATA[ <400> 17]]>
atgtggagac catctgacag cacagtctat gtgcctcctc caaaccctgt gagcaaggtg 60
gtggctacag atgcctatgt gaagaggacc aacatcttct accatgcctc ctccagcaga 120
ctgctggctg tgggacaccc atactacagc atcaagaagg tgaacaagac agtggtgcca 180
aaggtgtctg gctaccaata cagggtgttc aaggtggtgc tgcctgaccc aaacaagttt 240
gccctgcctg actcctccct gtttgaccca accacccaga gactggtgtg ggcttgtact 300
ggattggagg tgggcagggg acaaccactg ggagtgggag tgtctggaca cccactgctg 360
aacaaatatg atgatgtgga gaactctgga ggctatggag gcaaccctgg acaagacaac 420
agggtgaatg tggggatgga ctacaagcag acccaacttt gtatggtggg ctgtgcccct 480
ccactgggag aacactgggg caagggcacc cagtgtagca acacctctgt ccagaatgga 540
gactgtcctc cattggaact gattacctct gtgattcagg atggagatat ggtggacaca 600
ggctttggag ctatgaactt tgctgacctc caaaccaaca agtctgatgt gccactggac 660
atctgtggca cagtgtgtaa ataccctgac tacctccaaa tggctgctga cccatatgga 720 gacagactgt tcttctacct gaggaaggaa cagatgtttg ccagacactt cttcaacagg 780 gctggcacag tgggagaacc tgtgcctgat gacctgctgg tgaagggagg caacaacagg 840 tcc tctgtgg catccagcat ctatgtgcat acaccatctg gctccctggt gtcctctgag 900 gctcaacttt tcaacaagcc atactggctc caaaaggctc aaggacacaa caatggcatc 960 tgttggggca accacctgtt tgtgacagtg gtggacacca ccaggagcac caatatgacc 102 0 ctgtgtgcct ctgtgagcaa gtctgccacc taccaact ctgactacaa ggaatatatg 1080 aggcatgtgg aggagtttga cctccaattc atcttccaac tttgtagcat caccctgtct 1140 gctgaggtga tggcttacat ccacacaatg aacccatctg tgttggag ga ctggaacttt 1200 ggactgagcc ctcctccaaa tggcaccttg gaggacacct acagatatgt ccagagccag 1260 gctatcactt gtcagaagcc aacacctgag aaggagaagc aggacccata caaggatatg 1320 agtttctggg aggtgaacct gaaagagaag ttctcct ctg aactggacca gtttccactg 1380 ggcaggaagt tcctgctcca atctggctac aaagccaagc caaaactgaa aagggctgcc 1440 ccaaccagca ccaggacctc ctctgccaag aggaagaagg tgaagaagta aa 1492 <![CDATA[ <210> 18]]>
<![CDATA[ <211> 1525]]>
<![CDATA[ <212> DNA]]>
<![CDATA[ <213> Artificial sequence]]>
<![CDATA[ <220> ]]>
<![CDATA[ <223> 1105 Synthetic HPV11L1 gene]]>
<![CDATA[ <400> 18]]> ctgggtacca tgtggagacc atctgacagc acagtctatg tgcctcctcc aaaccctgtg 60 agcaaggtgg tggctacaga tgcctatgtg aagaggacca acatcttcta ccatgcctcc 120 tccagcagac tgctggctgt gggacaccca t actacagca tcaagaaggt gaacaagaca 180 gtggtgccaa aggtgtctgg ctaccaatac agggtgttca aggtggtgct gcctgaccca 240 aacaagtttg ccctgcctga ctcctccctg tttgacccaa ccacccagag actggtgtgg 300 gcttgtactg gattgga ggt gggcagggga caaccactgg gagtgggagt gtctggacac 360 ccactgctga acaaatatga tgatgtggag aactctggag gctatggagg caaccctgga 420 caagacaaca gggtgaatgt ggggatggac tacaagcaga cccaactttg tatggtgggc 480 tgtgcccctc cactgggaga acactgg ggc aagggcaccc agtgtagcaa cacctctgtc 540 cagaatggag actgtcctcc attggaactg attacctctg tgattcagga tggagatatg 600 gtggacacag gctttggagc tatgaacttt gctgacctcc aaaccaacaa gtctgatgtg 660 ccactggaca tctgtggcac agtgtgtaaa taccctgact acctccaaat ggctgctgac 720 ccatatggag acagactgtt cttctacctg aggaaggaac agatgtttgc cagacacttc 780 ttcaacaggg ctggcacagt gggagaacct gtgcctgatg acctgctggt gaagg gaggc 840 aacaacaggt cctctgtggc atccagcatc tatgtgcata caccatctgg ctccctggtg 900 tcctctgagg ctcaactttt caacaagcca tactggctcc aaaaggctca aggacacaac 960 aatggcatct gttggggcaa ccacctgttt gtgacagtgg tggac accac caggagcacc 1020 aatatgaccc tgtgtgcctc tgtgagcaag tctgccacct acaccaactc tgactacaag 1080 gaatatatga ggcatgtgga ggagtttgac ctccaattca tcttccaact ttgtagcatc 1140 accctgtctg ctgaggtgat ggctta catc cacacaatga acccatctgt gttggaggac 1200 tggaactttg gactgagccc tcctccaaat ggcaccttgg aggacaccta cagatatgtc 1260 cagagccagg ctatcacttg tcagaagcca acacctgaga aggagaagca ggacccatac 1320 aaggatatga gtttct ggga ggtgaacctg aaagagaagt tctcctctga actggaccag 1380 tttccactgg gcaggaagtt cctgctccaa tctggctaca ggggcaggac ctctgccagg 1440 acaggcatca agagacctgc tgtgagcaag ccaagcacag ccccaaagag gaagaggacc 1500 aagac caaga agtaaactcg agctc 1525 <![CDATA[ <210> 19]]>
<![CDATA[ <211> 1519]]>
<![CDATA[ <212> DNA]]>
<![CDATA[ <213> Artificial sequence]]>
<![CDATA[ <220> ]]>
<![CDATA[ <223> 1106 Synthetic HPV33L1 gene]]>
<![CDATA[ <400> 19]]> ctgggtacca tgagtgtgtg gagaccatct gaggctacag tctacctgcc tcctgtgcct 60 gtgagcaagg tggtgagcac agatgaatat gtgagcagga ccagcatcta ctactatgct 120 ggctccagca gactgctggc tgtgggacac ccatact tca gcatcaagaa cccaaccaat 180 gccaagaaac tgctggtgcc aaaggtgtct ggactccaat acagggtgtt cagggtgaga 240 ctgcctgacc caaacaagtt tggctttcct gacacctcct tctacaaccc tgacacccag 300 agactggtgt gggcttgtg t gggattggag attggcaggg gacaaccact gggagtgggc 360
atctctggac acccactgct gaacaagttt gatgacacag agacaggcaa caaataccct 420
ggacaacctg gagcagacaa cagggagtgt ctgagtatgg actacaagca gacccaactt 480
tgtctgctgg gctgtaagcc tccaacagga gaacactggg gcaagggagt ggcttgtacc 540
aatgctgccc ctgccaatga ctgtcctcca ttggaactga taaacaccat cattgaggat 600
ggagatatgg tggacacagg ctttggctgt atggacttca agaccctcca agccaacaag 660
tctgatgtgc caattgacat ctgtggcagc acttgtaaat accctgacta cctgaaaatg 720 acctctgaac catatggaga ctccctgttc ttcttcctga gggagggaaca gatgtttgtg 780 agacacttct tcaacagggc tggcaccctg ggagaggctg tgcctgatga cctctacatc 840 a agggctctg gcaccacagc cagcatccag tcctctgcct tctttccaac accatctggc 900 agtatggtga cctctgagag ccaacttttc aacaagccat actggctcca aagggctcaa 960 ggacacaaca atggcatctg ttggggcaac caggtgtttg tgacagtggt ggacaccacc 1020 aggagcacca atatgaccct gtgtacccag gtgacctctg acagcaccta caagaatgag 1080 aacttcaagg aatacatcag gcatgtggag gaatatgacc tccaatttgt gttccaactt 1140 tgtaaggtga ccctgacagc agaggtgatg acctacatcc atgctatgaa ccctgacatc 1200 ttggaggact ggcagtttgg actgacacct cctccatctg cctccctcca agacacctac 1260 aggtttgtga ccagccaggc tatcacttgt cagaagacag tgcctccaaa ggagaaggag 1320 gacccactgg gcaaatacac cttctgggag gtggacctga aaga gaagtt ctctgctgac 1380 ctggaccagt ttccactggg caggaagttc ctgctccaag caggactgaa agccaagcca 1440 aaactgaaaa gggctgcccc aaccagcacc aggacctcct ctgccaagag gaagaaggtg 1500 aagaagtaaa ctcgagctc 1519 <![CDATA[ <210> 20]]>
<![CDATA[ <211> 35]]>
<![CDATA[ <212> DNA]]>
<![CDATA[ <213> Artificial sequence]]>
<![CDATA[ <220> ]]>
<![CDATA[ <223> 1107 HPV11L1 F1]]> <![CDATA[ <400> 20]]> cttggtacca tgtggagacc atctgacagc acagt 35 <![CDATA[ <21]]>0> 21]]>
<br/> <![CDATA[ <211>36]]> <br/> <![CDATA[ <212>DNA]]> <br/> <![CDATA[ <213> Artificial sequence]]>
<br/>
<br/>
<br/> <![CDATA[ <220>]]>
<br/> <![CDATA[ <223> 1108 HPV11L1 R1]]> <br/>
<br/> <![CDATA[ <400>21]]> <br/> <![CDATA[gcttggcttt gtagccagat tggagcagga acttcc 36 <![CDATA[ <210> 22]]>
<![CDATA[ <211> 1429]]>
<![CDATA[ <212> DNA]]>
<![CDATA[ <213> Artificial sequence]]>
<![CDATA[ <220> ]]>
<![CDATA[ <223> 1109 HPV11L1 expansion sequence 1]]>
<![CDATA[ <400> 22]]> cttggtacca tgtggagacc atctgacagc acagtctatg tgcctcctcc aaaccctgtg 60 agcaaggtgg tggctacaga tgcctatgtg aagaggacca acatcttcta ccatgcctcc 120 tccagcagac tgctggctgt gggacaccca t actacagca tcaagaaggt gaacaagaca 180 gtggtgccaa aggtgtctgg ctaccaatac agggtgttca aggtggtgct gcctgaccca 240 aacaagtttg ccctgcctga ctcctccctg tttgacccaa ccacccagag actggtgtgg 300 gcttgtactg gattgga ggt gggcagggga caaccactgg gagtgggagt gtctggacac 360 ccactgctga acaaatatga tgatgtggag aactctggag gctatggagg caaccctgga 420 caagacaaca gggtgaatgt ggggatggac tacaagcaga cccaactttg tatggtgggc 480 tgtgcccctc cactgggaga acactgg ggc aagggcaccc agtgtagcaa cacctctgtc 540 cagaatggag actgtcctcc attggaactg attacctctg tgattcagga tggagatatg 600 gtggacacag gctttggagc tatgaacttt gctgacctcc aaaccaacaa gtctgatgtg 660 ccactggaca tctgtggcac agtgtgtaaa taccctgact acctccaaat ggctgctgac 720 ccatatggag acagactgtt cttctacctg aggaaggaac agatgtttgc cagacacttc 780 ttcaacaggg ctggcacagt gggagaacct gtgcctgatg acctgctggt gaagg gaggc 840 aacaacaggt cctctgtggc atccagcatc tatgtgcata caccatctgg ctccctggtg 900 tcctctgagg ctcaactttt caacaagcca tactggctcc aaaaggctca aggacacaac 960 aatggcatct gttggggcaa ccacctgttt gtgacagtgg tggac accac caggagcacc 1020 aatatgaccc tgtgtgcctc tgtgagcaag tctgccacct acaccaactc tgactacaag 1080 gaatatatga ggcatgtgga ggagtttgac ctccaattca tcttccaact ttgtagcatc 1140 accctgtctg ctgaggtgat ggctta catc cacacaatga acccatctgt gttggaggac 1200 tggaactttg gactgagccc tcctccaaat ggcaccttgg aggacaccta cagatatgtc 1260 cagagccagg ctatcacttg tcagaagcca acacctgaga aggagaagca ggacccatac 1320 aaggatatga gtttct ggga ggtgaacctg aaagagaagt tctcctctga actggaccag 1380 tttccactgg gcaggaagtt cctgctccaa tctggctaca aagccaagc 1429 <![CDATA[ <210> 23]]>
<![CDATA[ <211> 35]]>
<![CDATA[ <212> DNA]]>
<![CDATA[ <213> Artificial sequence]]>
<![CDATA[ <220> ]]>
<![CDATA[ <223> 1110 HPV11L1 F2]]> <![CDATA[ <400> 23]]> atctggctac aaagccaagc caaaactgaa aaggg 35 <![CDATA[ <210> 24]]>
<![CDATA[ <211]]>> 37]]>
<br/> <![CDATA[ <212>DNA]]> <br/> <![CDATA[ <213> Artificial sequence]]>
<br/>
<br/>
<br/> <![CDATA[ <220>]]>
<br/> <![CDATA[ <223> 1111 HPV11L1 R2]]> <br/>
<br/> <![CDATA[ <400>24]]> <br/> <![CDATA[ctgtctagat ttacttcttc accttcttcc tcttggc 37 <![CDATA[ <210> 25]]>
<![CDATA[ <211> 101]]>
<![CDATA[ <212> DNA]]>
<![CDATA[ <213> Artificial sequence]]>
<![CDATA[ <220> ]]>
<![CDATA[ <223> 1112 HPV11L1 expansion sequence 2]]>
<![CDATA[ <400> 25]]> atctggctac aaagccaagc caaaactgaa aagggctgcc ccaaccagca ccaggacctc 60 ctctgccaag aggaagaagg tgaagaagta aatctagaca g 101 <![CDATA[ <210> 26]]>
<![CDATA[ <211> 38]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Homo sapiens]]>
<![CDATA[ <220> ]]>
<![CDATA[ <223> 1113 HPV 59 L1 protein amino acid sequence 471-508]]>
<![CDATA[ <400> 26]]> Leu Gln Leu Gly Ala Arg Pro Lys Pro Thr Ile Gly Pro Arg Lys Arg 1 5 10 15 Ala Ala Pro Ala Pro Thr Ser Thr Pro Ser Pro Lys Arg Val Lys Arg 20 25 30 Arg Lys Ser Ser Arg Lys 35 <![CDATA[ <210> 27]]>
<![CDATA[ <211> 474]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Homo sapiens]]>
<![CDATA[ <220> ]]>
<![CDATA[ <223> 1601 HPV type 16 L1 protein amino acid sequence 1-474]]>
<![CDATA[ <400> 27]]> Met Ser Leu Trp Leu Pro Ser Glu Ala Thr Val Tyr Leu Pro Pro Val 1 5 10 15 Pro Val Ser Lys Val Val Ser Thr Asp Glu Tyr Val Ala Arg Thr Asn 20 25 30 Ile Tyr Tyr His Ala Gly Thr Ser Arg Leu Leu Ala Val Gly His Pro 35 40 45 Tyr Phe Pro Ile Lys Lys Pro Asn Asn Asn Lys Ile Leu Val Pro Lys 50 55 60 Val Ser Gly Leu Gln Tyr Arg Val Phe Arg Ile His Leu Pro Asp Pro 65 70 75 80 Asn Lys Phe Gly Phe Pro Asp Thr Ser Phe Tyr Asn Pro Asp Thr Gln 85 90 95 Arg Leu Val Trp Ala Cys Val Gly Val Glu Val Gly Arg Gly Gln Pro 100 105 110 Leu Gly Val Gly Ile Ser Gly His Pro Leu Leu Asn Lys Leu Asp Asp 115 120 125 Thr Glu Asn Ala Ser Ala Tyr Ala Ala Asn Ala Asn Val Ala Val Asn Pro Gly Asp Cys Pro Pro Leu Glu Leu Ile Asn 180 185 190 Thr Val Ile Gln Asp Gly Asp Met Val Asp Thr Gly Phe Gly Ala Met 195 200 205 Asp Phe Thr Thr Leu Gln Ala Asn Lys Ser Glu Val Pro Leu Asp Ile 210 215 220 Cys Thr Ser Ile Cys Lys Tyr Pro Asp Tyr Ile Lys Met Val Ser Glu 225 230 235 240 Pro Tyr Gly Asp Ser Leu Phe Phe Tyr Leu Arg Arg Glu Gln Met Phe 245 250 255 Val Arg His Leu Phe Asn Arg Ala Gly Ala Val Gly Glu Asn Val Pro 260 265 270 Asp Asp Leu Tyr Ile Lys Gly Ser Gly Ser Thr Ala Asn Leu Ala Ser 275 280 285 Ser Asn Tyr Phe Pro Thr Pro Ser Gly Ser Met Val Thr Ser Asp Ala 290 295 300 Gln Ile Phe Asn Lys Pro Tyr Trp Leu Gln Arg Ala Gln Gly His Asn 305 310 315 320 Asn Gly Ile Cys Trp Gly Asn Gln Leu Phe Val Thr Val Val Asp Thr 325 330 335 Thr Arg Ser Thr Asn Met Ser Leu Cys Ala Ala Ile Ser Thr Ser Glu 340 345 350 Thr Thr Tyr Lys Asn Thr Asn Phe Lys Glu Tyr Leu Arg His Gly Glu 355 360 365 Glu Tyr Asp Leu Gln Phe Ile Phe Gln Leu Cys Lys Ile Thr Leu Thr 370 375 380 Ala Asp Val Met Thr Tyr Ile His Ser Met Asn Ser Thr Ile Leu Glu 385 390 395 400 Asp Trp Asn Phe Gly Leu Gln Pro Pro Pro Pro Gly Gly Thr Leu Glu Asp 405 410 415 Thr Tyr Arg Phe Val Thr Ser Gln Ala Ile Ala Cys Gln Lys His Thr 420 425 430 Pro Pro Ala Pro Lys Glu Asp Pro Leu Lys Lys Tyr Thr Phe Trp Glu 435 440 445 Val Asn Leu Lys Glu Lys Phe Ser Ala Asp Leu Asp Gln Phe Pro Leu 450 455 460 Gly Arg Lys Phe Leu Leu Gln Ala Gly Leu 465 470 <![CDATA[ <210> 28]]>
<![CDATA[ <211> 26]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Homo sapiens]]>
<![CDATA[ <220> ]]>
<![CDATA[ <223> 1602 HPV type 33 L1 protein amino acid sequence 474-499]]>
<![CDATA[ <400> 28]]> Lys Ala Lys Pro Lys Leu Lys Arg Ala Ala Pro Thr Ser Thr Arg Thr 1 5 10 15 Ser Ser Ala Lys Arg Lys Lys Val Lys Lys 20 25 <![CDATA[ <210> 29]]>
<![CDATA[ <211> 500]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Artificial sequence]]>
<![CDATA[ <220> ]]>
<![CDATA[ <223> 1603 Amino acid sequence of chimeric HPV type 16 L1 protein]]>
<![CDATA[ <400> 29]]> Met Ser Leu Trp Leu Pro Ser Glu Ala Thr Val Tyr Leu Pro Pro Val 1 5 10 15 Pro Val Ser Lys Val Val Ser Thr Asp Glu Tyr Val Ala Arg Thr Asn 20 25 30 Ile Tyr Tyr His Ala Gly Thr Ser Arg Leu Leu Ala Val Gly His Pro 35 40 45 Tyr Phe Pro Ile Lys Lys Pro Asn Asn Asn Lys Ile Leu Val Pro Lys 50 55 60 Val Ser Gly Leu Gln Tyr Arg Val Phe Arg Ile His Leu Pro Asp Pro 65 70 75 80 Asn Lys Phe Gly Phe Pro Asp Thr Ser Phe Tyr Asn Pro Asp Thr Gln 85 90 95 Arg Leu Val Trp Ala Cys Val Gly Val Glu Val Gly Arg Gly Gln Pro 100 105 110 Leu Gly Val Gly Ile Ser Gly His Pro Leu Leu Asn Lys Leu Asp Asp 115 120 125 Thr Glu Asn Ala Ser Ala Tyr Ala Ala Asn Ala Asn Val Ala Val Asn Pro Gly Asp Cys Pro Pro Leu Glu Leu Ile Asn 180 185 190 Thr Val Ile Gln Asp Gly Asp Met Val Asp Thr Gly Phe Gly Ala Met 195 200 205 Asp Phe Thr Thr Leu Gln Ala Asn Lys Ser Glu Val Pro Leu Asp Ile 210 215 220 Cys Thr Ser Ile Cys Lys Tyr Pro Asp Tyr Ile Lys Met Val Ser Glu 225 230 235 240 Pro Tyr Gly Asp Ser Leu Phe Phe Tyr Leu Arg Arg Glu Gln Met Phe 245 250 255 Val Arg His Leu Phe Asn Arg Ala Gly Ala Val Gly Glu Asn Val Pro 260 265 270 Asp Asp Leu Tyr Ile Lys Gly Ser Gly Ser Thr Ala Asn Leu Ala Ser 275 280 285 Ser Asn Tyr Phe Pro Thr Pro Ser Gly Ser Met Val Thr Ser Asp Ala 290 295 300 Gln Ile Phe Asn Lys Pro Tyr Trp Leu Gln Arg Ala Gln Gly His Asn 305 310 315 320 Asn Gly Ile Cys Trp Gly Asn Gln Leu Phe Val Thr Val Val Asp Thr 325 330 335 Thr Arg Ser Thr Asn Met Ser Leu Cys Ala Ala Ile Ser Thr Ser Glu 340 345 350 Thr Thr Tyr Lys Asn Thr Asn Phe Lys Glu Tyr Leu Arg His Gly Glu 355 360 365 Glu Tyr Asp Leu Gln Phe Ile Phe Gln Leu Cys Lys Ile Thr Leu Thr 370 375 380 Ala Asp Val Met Thr Tyr Ile His Ser Met Asn Ser Thr Ile Leu Glu 385 390 395 400 Asp Trp Asn Phe Gly Leu Gln Pro Pro Pro Pro Gly Gly Thr Leu Glu Asp 405 410 415 Thr Tyr Arg Phe Val Thr Ser Gln Ala Ile Ala Cys Gln Lys His Thr 420 425 430 Pro Pro Ala Pro Lys Glu Asp Pro Leu Lys Lys Tyr Thr Phe Trp Glu 435 440 445 Val Asn Leu Lys Glu Lys Phe Ser Ala Asp Leu Asp Gln Phe Pro Leu 450 455 460 Gly Arg Lys Phe Leu Leu Gln Ala Gly Leu Lys Ala Lys Pro Lys Leu 465 470 475 480 Lys Arg Ala Ala Pro Thr Ser Thr Arg Thr Ser Ser Ala Lys Arg Lys 485 490 495 Lys Val Lys Lys 500 <![CDATA[ <210> 30]]>
<![CDATA[ <211>]]> 1504
<![CDATA[ <212> DNA]]>
<![CDATA[ <213> Artificial sequence]]>
<![CDATA[ <220> ]]>
<![CDATA[ <223> 1604 Nucleotide sequence of chimeric HPV type 16 L1 protein]]>
<![CDATA[ <400> 30]]> atgagtctgt ggctgccatc tgaggctaca gtctacctgc ctcctgtgcc tgtgagcaag 60 gtggtgagca cagatgaata tgtggcaagg accaacatct actaccatgc tggcaccagc 120 agactgctgg ctgtgggaca cccatacttt ccaatca aga agccaaacaa caacaagatt 180 ctggtgccaa aggtgtctgg actccaatac agggtgttca ggattcacct gcctgaccca 240 aacaagtttg gctttcctga cacctccttc tacaaccctg acacccag actggtgtgg 300 gcttgtgtgg gagtggaggt gggca gggga caaccactgg gagtgggcat ctctggacac 360 ccactgctga acaaactgga tgacacagag aatgcctctg cctatgctgc caatgctgga 420 gtggacaaca gggagtgtat cagtatggac tacaagcaga cccaactttg tctgattggc 480 tgtaagcctc caattggaga acactggggc aag ggcagcc catgtaccaa tgtggctgtg 540 aaccctggag actgtcctcc attggaactg ataaacacag tgattcagga tggagatatg 600 gtggacacag gctttggagc tatggacttc accaccctcc aagccaacaa gtctgaggtg 660 ccactggaca tctgtaccag catctgtaaa taccctgact acatcaagat ggtgtctgaa 720 ccatatggag actccctgtt cttctacctg aggagggaac agatgtttgt gagacacctg 780 ttcaacaggg ctggagcagt gggagagaat gtgcctgatg acctctacat caagggctct 840 ggcagcacag ccaac ctggc atccagcaac tactttccaa caccatctgg cagtatggtg 900 acctctgatg cccagatttt caacaagcca tactggctcc aaagggctca aggacacaac 960 aatggcatct gttggggcaa ccaacttttt gtgacagtgg tggacaccac caggagcacc 1020 aatatgagtc tgtgtgctgc catcagcacc tctgagacca cctacaagaa caccaacttc 1080 aaggaatacc tgagacatgg agaggaatat gacctccaat tcatcttcca actttgtaag 1140 attaccctga cagcagatgt gatgacctac atccacagta tgaacagcac catcttggag 1200 gact ggaact ttggactcca acctcctcct ggaggcacct tggaggacac ctacaggttt 1260 gtgaccagcc aggctattgc ctgtcagaaa cacacacctc ctgccccaaa ggaggaccca 1320 ctgaaaaaat acaccttctg ggaggtgaac ctgaaagaga agttctctgc t gacctggac 1380 cagtttccac tgggcaggaa gttcctgctc caagcaggac tgaaagccaa gccaaaactg 1440 aaaagggctg ccccaaccag caccaggacc tcctctgcca agaggaagaa ggtgaagaag 1500 taaa 1504 <![CDATA[ <210> 31]]>
<![CDATA[ <211> 1537]]>
<![CDATA[ <212> DNA]]>
<![CDATA[ <213> Artificial sequence]]>
<![CDATA[ <220> ]]>
<![CDATA[ <223> 1605 Synthetic HPV16L1 gene]]>
<![CDATA[ <400> 31]]> ctgggtacca tgagtctgtg gctgccatct gaggctacag tctacctgcc tcctgtgcct 60 gtgagcaagg tggtgagcac agatgaatat gtggcaagga ccaacatcta ctaccatgct 120 ggcaccagca gactgctggc tgtgggacac ccatact ttc caatcaagaa gccaaacaac 180 aacaagattc tggtgccaaa ggtgtctgga ctccaataca gggtgttcag gattcacctg 240 cctgacccaa acaagtttgg ctttcctgac acctccttct acaaccctga cacccagaga 300 ctggtgtggg cttgtg tggg agtggaggtg ggcaggggac aaccactggg agtgggcatc 360 tctggacacc cactgctgaa caaactggat gacacagaga atgcctctgc ctatgctgcc 420 aatgctggag tggacaacag ggagtgtatc agtatggact acaagcagac ccaactttgt 480 ctgattggct g taagcctcc aattggagaa cactggggca agggcagccc atgtaccaat 540 gtggctgtga accctggaga ctgtcctcca ttggaactga taaacacagt gattcaggat 600 ggagatatgg tggacacagg ctttggagct atggacttca ccaccctcca agccaacaag 660 tctgaggt gc cactggacat ctgtaccagc atctgtaaat accctgacta catcaagatg 720 gtgtctgaac catatggaga ctccctgttc ttctacctga gggagggaaca gatgtttgtg 780 agacacctgt tcaacagggc tggagcagtg ggagagaatg tgcctgatga cctctacatc 840 aaggg ctctg gcagcacagc caacctggca tccagcaact actttccaac accatctggc 900 agtatggtga cctctgatgc ccagattttc aacaagccat actggctcca aagggctcaa 960 ggacacaaca atggcatctg ttggggcaac caactttttg tgacagtggt ggacaccacc 1020 aggagcacca atatgagtct gtgtgctgcc atcagcacct ctgagaccac ctacaagaac 1080 accaacttca aggaatacct gagacatgga gaggaatatg acctccaatt catcttccaa 1140 ctttgtaaga ttaccctgac agcagatgtg atgacctaca tccacagtat gaacagcacc 1200 atcttgg agg actggaactt tggactccaa cctcctcctg gaggcacctt ggaggacacc 1260 tacaggtttg tgaccagcca ggctattgcc tgtcagaaac acacacctcc tgccccaaag 1320 gaggacccac tgaaaaaata caccttctgg gaggtgaacc tgaaagagaa gttctct gct 1380 gacctggacc agtttccact gggcaggaag ttcctgctcc aagcaggact gaaagccaag 1440 ccaaagttca ccctgggcaa gaggaaggct acaccaacca cctccagcac cagcaccaca 1500 gccaagagga agaagaggaa actgtaaact cgagctc 1537 <![CDATA[ <210> 32]]>
<![CDATA[ <211> 1519]]>
<![CDATA[ <212> DNA]]>
<![CDATA[ <213> Artificial sequence]]>
<![CDATA[ <220> ]]>
<![CDATA[ <223> 1606 Synthetic HPV33L1 gene]]>
<![CDATA[ <400> 32]]> ctgggtacca tgagtgtgtg gagaccatct gaggctacag tctacctgcc tcctgtgcct 60 gtgagcaagg tggtgagcac agatgaatat gtgagcagga ccagcatcta ctactatgct 120 ggctccagca gactgctggc tgtgggacac ccatact tca gcatcaagaa cccaaccaat 180 gccaagaaac tgctggtgcc aaaggtgtct ggactccaat acagggtgtt cagggtgaga 240 ctgcctgacc caaacaagtt tggctttcct gacacctcct tctacaaccc tgacacccag 300 agactggtgt gggcttgtg t gggattggag attggcaggg gacaaccact gggagtgggc 360
atctctggac acccactgct gaacaagttt gatgacacag agacaggcaa caaataccct 420
ggacaacctg gagcagacaa cagggagtgt ctgagtatgg actacaagca gacccaactt 480
tgtctgctgg gctgtaagcc tccaacagga gaacactggg gcaagggagt ggcttgtacc 540
aatgctgccc ctgccaatga ctgtcctcca ttggaactga taaacaccat cattgaggat 600
ggagatatgg tggacacagg ctttggctgt atggacttca agaccctcca agccaacaag 660
tctgatgtgc caattgacat ctgtggcagc acttgtaaat accctgacta cctgaaaatg 720 acctctgaac catatggaga ctccctgttc ttcttcctga gggagggaaca gatgtttgtg 780 agacacttct tcaacagggc tggcaccctg ggagaggctg tgcctgatga cctctacatc 840 a agggctctg gcaccacagc cagcatccag tcctctgcct tctttccaac accatctggc 900 agtatggtga cctctgagag ccaacttttc aacaagccat actggctcca aagggctcaa 960 ggacacaaca atggcatctg ttggggcaac caggtgtttg tgacagtggt ggacaccacc 1020 aggagcacca atatgaccct gtgtacccag gtgacctctg acagcaccta caagaatgag 1080 aacttcaagg aatacatcag gcatgtggag gaatatgacc tccaatttgt gttccaactt 1140 tgtaaggtga ccctgacagc agaggtgatg acctacatcc atgctatgaa ccctgacatc 1200 ttggaggact ggcagtttgg actgacacct cctccatctg cctccctcca agacacctac 1260 aggtttgtga ccagccaggc tatcacttgt cagaagacag tgcctccaaa ggagaaggag 1320 gacccactgg gcaaatacac cttctgggag gtggacctga aaga gaagtt ctctgctgac 1380 ctggaccagt ttccactggg caggaagttc ctgctccaag caggactgaa agccaagcca 1440 aaactgaaaa gggctgcccc aaccagcacc aggacctcct ctgccaagag gaagaaggtg 1500 aagaagtaaa ctcgagctc 1519 <![CDATA[ <210> 33]]>
<![CDATA[ <211> 34]]>
<![CDATA[ <212> DNA]]>
<![CDATA[ <213> Artificial sequence]]>
<![CDATA[ <220> ]]>
<![CDATA[ <223> 1607 HPV16L1 F1]]> <![CDATA[ <400> 33]]> cttggtacca tgagtctgtg gctgccatct gagg 34 <![CDATA[ <210> 34]]>
<![CDATA[ <211> 36]]>
<![CDATA[ <212> DNA]]>
<![CDATA[ <213> Artificial sequence]]>
<![CDATA[ <220> ]]>
<![CDATA[ <223> 1608 HPV16L1 R1]]> <![CDATA[ <400> 34]]> gcttggcttt cagtcctgct tggagcagga acttcc 36 <![CDATA[ <210> 35]]>
<![CDATA[ <211> 1441]]>
<![CDATA[ <212> DNA]]>
<![CDATA[ <213> Artificial sequence]]>
<![CDATA[ <220> ]]>
<![CDATA[ <223> 1609 HPV16L1 expansion sequence 1]]>
<![CDATA[ <400> 35]]> cttggtacca tgagtctgtg gctgccatct gaggctacag tctacctgcc tcctgtgcct 60 gtgagcaagg tggtgagcac agatgaatat gtggcaagga ccaacatcta ctaccatgct 120 ggcaccagca gactgctggc tgtgggacac ccatact ttc caatcaagaa gccaaacaac 180 aacaagattc tggtgccaaa ggtgtctgga ctccaataca gggtgttcag gattcacctg 240 cctgacccaa acaagtttgg ctttcctgac acctccttct acaaccctga cacccagaga 300 ctggtgtggg cttgtg tggg agtggaggtg ggcaggggac aaccactggg agtgggcatc 360 tctggacacc cactgctgaa caaactggat gacacagaga atgcctctgc ctatgctgcc 420 aatgctggag tggacaacag ggagtgtatc agtatggact acaagcagac ccaactttgt 480 ctgattggct g taagcctcc aattggagaa cactggggca agggcagccc atgtaccaat 540 gtggctgtga accctggaga ctgtcctcca ttggaactga taaacacagt gattcaggat 600 ggagatatgg tggacacagg ctttggagct atggacttca ccaccctcca agccaacaag 660 tctgaggt gc cactggacat ctgtaccagc atctgtaaat accctgacta catcaagatg 720 gtgtctgaac catatggaga ctccctgttc ttctacctga gggagggaaca gatgtttgtg 780 agacacctgt tcaacagggc tggagcagtg ggagagaatg tgcctgatga cctctacatc 840 aaggg ctctg gcagcacagc caacctggca tccagcaact actttccaac accatctggc 900 agtatggtga cctctgatgc ccagattttc aacaagccat actggctcca aagggctcaa 960 ggacacaaca atggcatctg ttggggcaac caactttttg tgacagtggt ggacaccacc 1020 aggagcacca atatgagtct gtgtgctgcc atcagcacct ctgagaccac ctacaagaac 1080 accaacttca aggaatacct gagacatgga gaggaatatg acctccaatt catcttccaa 1140 ctttgtaaga ttaccctgac agcagatgtg atgacctaca tccacagtat gaacagcacc 1200 atcttgg agg actggaactt tggactccaa cctcctcctg gaggcacctt ggaggacacc 1260 tacaggtttg tgaccagcca ggctattgcc tgtcagaaac acacacctcc tgccccaaag 1320 gaggacccac tgaaaaaata caccttctgg gaggtgaacc tgaaagagaa gttctct gct 1380 gacctggacc agtttccact gggcaggaag ttcctgctcc aagcaggact gaaagccaag 1440 c 1441 <![CDATA[ <210> 36]]>
<![CDATA[ <211> 35]]>
<![CDATA[ <212> DNA]]>
<![CDATA[ <213> Artificial sequence]]>
<![CDATA[ <220> ]]>
<![CDATA[ <223> 1610 HPV16L1 F2]]> <![CDATA[ <400> 36]]> agcaggactg aaagccaagc caaaactgaa aaggg 35 <![CDATA[ <210> 37]]>
<![CDATA[ <211> 36]]>
<![CDATA[ <212> DNA]]>
<![CDATA[ <213> Artificial sequence]]>
<![CDATA[ <220> ]]>
<![CDATA[ <223> 1611 HPV16L1 R2]]> <![CDATA[ <400> 37]]> ctgtctagat ttacttcttc accttcttcc tcttgg 36 <![CDATA[ <210> 38]]>
<![CDATA[ <211> 101]]>
<![CDATA[ <212> DNA]]>
<![CDATA[ <213> Artificial sequence]]>
<![CDATA[ <220> ]]>
<![CDATA[ <223> 1612 HPV16L1 expansion sequence 2]]>
<![CDATA[ <400> 38]]> agcaggactg aaagccaagc caaaactgaa aagggctgcc ccaaccagca ccaggacctc 60 ctctgccaag aggaagaagg tgaagaagta aatctagaca g 101 <![CDATA[ <210> 39]]>
<![CDATA[ <211> 38]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Homo sapiens]]>
<![CDATA[ <220> ]]>
<![CDATA[ <223> 1613 HPV 59 L1 protein amino acid sequence 471-508]]>
<![CDATA[ <400> 39]]> Leu Gln Leu Gly Ala Arg Pro Lys Pro Thr Ile Gly Pro Arg Lys Arg 1 5 10 15 Ala Ala Pro Ala Pro Thr Ser Thr Pro Ser Pro Lys Arg Val Lys Arg 20 25 30 Arg Lys Ser Ser Arg Lys 35 <![CDATA[ <210> 40]]>
<![CDATA[ <211> 470]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Homo sapiens]]>
<![CDATA[ <220> ]]>
<![CDATA[ <223> 1801 HPV type 18 L1 protein amino acid sequence 1-470]]>
<![CDATA[ <400> 40]]> Met Ala Leu Trp Arg Pro Ser Asp Asn Thr Val Tyr Leu Pro Pro Pro 1 5 10 15 Ser Val Ala Arg Val Val Asn Thr Asp Asp Tyr Val Thr Arg Thr Ser 20 25 30 Ile Phe Tyr His Ala Gly Ser Ser Arg Leu Leu Thr Val Gly Asn Pro 35 40 45 Tyr Phe Arg Val Pro Ala Gly Gly Gly Asn Lys Gln Asp Ile Pro Lys 50 55 60 Val Ser Ala Tyr Gln Tyr Arg Val Phe Arg Val Gln Leu Pro Asp Pro 65 70 75 80 Asn Lys Phe Gly Leu Pro Asp Thr Ser Ile Tyr Asn Pro Glu Thr Gln 85 90 95 Arg Leu Val Trp Ala Cys Ala Gly Val Glu Ile Gly Arg Gly Gln Pro 100 105 110 Leu Gly Val Gly Leu Ser Gly His Pro Phe Tyr Asn Lys Leu Asp Asp 115 120 125 Thr Glu Ser Ser His Ala Ala Thr Ser Asn Val Ser Glu Asp Val Arg 130 135 140 Asp Asn Val Ser Val Asp Tyr Lys Gln Thr Gln Leu Cys Ile Leu Gly 145 150 155 160 Cys Ala Pro Ala Ile Gly Glu His Trp Ala Lys Gly Thr Ala Cys Lys 165 170 175 Ser Arg Pro Leu Ser Gln Gly Asp Cys Pro Pro Leu Glu Leu Lys Asn 180 185 190 Thr Val Leu Glu Asp Gly Asp Met Val Asp Thr Gly Tyr Gly Ala Met 195 200 205 Asp Phe Ser Thr Leu Gln Asp Thr Lys Cys Glu Val Pro Leu Asp Ile 210 215 220 Cys Gln Ser Ile Cys Lys Tyr Pro Asp Tyr Leu Gln Met Ser Ala Asp 225 230 235 240 Pro Tyr Gly Asp Ser Met Phe Phe Cys Leu Arg Arg Glu Gln Leu Phe 245 250 255 Ala Arg His Phe Trp Asn Arg Ala Gly Thr Met Gly Asp Thr Val Pro 260 265 270 Gln Ser Leu Tyr Ile Lys Gly Thr Gly Met Arg Ala Ser Pro Gly Ser 275 280 285 Cys Val Tyr Ser Pro Ser Pro Pro Ser Gly Ser Ile Val Thr Ser Asp Ser 290 295 300 Gln Leu Phe Asn Lys Pro Tyr Trp Leu His Lys Ala Gln Gly His Asn 305 310 315 320 Asn Gly Val Cys Trp His Asn Gln Leu Phe Val Thr Val Val Asp Thr 325 330 335 Thr Arg Ser Thr Asn Leu Thr Ile Cys Ala Ser Thr Gln Ser Pro Val 340 345 350 Pro Gly Gln Tyr Asp Ala Thr Lys Phe Lys Gln Tyr Ser Arg His Val 355 360 365 Glu Glu Tyr Asp Leu Gln Phe Ile Phe Gln Leu Cys Thr Ile Thr Leu 370 375 380 Thr Ala Asp Val Met Ser Tyr Ile His Ser Met Asn Ser Ser Ile Leu 385 390 395 400 Glu Asp Trp Asn Phe Gly Val Pro Pro Pro Pro Thr Thr Ser Leu Val 405 410 415 Asp Thr Tyr Arg Phe Val Gln Ser Val Ala Ile Thr Cys Gln Lys Asp 420 425 430 Ala Ala Pro Ala Glu Asn Lys Asp Pro Tyr Asp Lys Leu Lys Phe Trp 435 440 445 Asn Val Asp Leu Lys Glu Lys Phe Ser Leu Asp Leu Asp Gln Tyr Pro 450 455 460 Leu Gly Arg Lys Phe Leu 465 470 <![CDATA[ <210> 41]]>
<![CDATA[ <211> 26]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Homo sapiens]]>
<![CDATA[ <220> ]]>
<![CDATA[ <223> 1802 HPV 33 L1 protein amino acid sequence 474-499]]>
<![CDATA[ <400> 41]]> Lys Ala Lys Pro Lys Leu Lys Arg Ala Ala Pro Thr Ser Thr Arg Thr 1 5 10 15 Ser Ser Ala Lys Arg Lys Lys Val Lys Lys 20 25 <![CDATA[ <210> 42]]>
<![CDATA[ <211> 496]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> ]]>Artificial sequence
<![CDATA[ <220> ]]>
<![CDATA[ <223> 1803 Amino acid sequence of chimeric HPV type 18 L1 protein]]>
<![CDATA[ <400> 42]]> Met Ala Leu Trp Arg Pro Ser Asp Asn Thr Val Tyr Leu Pro Pro Pro 1 5 10 15 Ser Val Ala Arg Val Val Asn Thr Asp Asp Tyr Val Thr Arg Thr Ser 20 25 30 Ile Phe Tyr His Ala Gly Ser Ser Arg Leu Leu Thr Val Gly Asn Pro 35 40 45 Tyr Phe Arg Val Pro Ala Gly Gly Gly Asn Lys Gln Asp Ile Pro Lys 50 55 60 Val Ser Ala Tyr Gln Tyr Arg Val Phe Arg Val Gln Leu Pro Asp Pro 65 70 75 80 Asn Lys Phe Gly Leu Pro Asp Thr Ser Ile Tyr Asn Pro Glu Thr Gln 85 90 95 Arg Leu Val Trp Ala Cys Ala Gly Val Glu Ile Gly Arg Gly Gln Pro 100 105 110 Leu Gly Val Gly Leu Ser Gly His Pro Phe Tyr Asn Lys Leu Asp Asp 115 120 125 Thr Glu Ser Ser His Ala Ala Thr Ser Asn Val Ser Glu Asp Val Arg 130 135 140 Asp Asn Val Ser Val Asp Tyr Lys Gln Thr Gln Leu Cys Ile Leu Gly 145 150 155 160 Cys Ala Pro Ala Ile Gly Glu His Trp Ala Lys Gly Thr Ala Cys Lys 165 170 175 Ser Arg Pro Leu Ser Gln Gly Asp Cys Pro Pro Leu Glu Leu Lys Asn 180 185 190 Thr Val Leu Glu Asp Gly Asp Met Val Asp Thr Gly Tyr Gly Ala Met 195 200 205 Asp Phe Ser Thr Leu Gln Asp Thr Lys Cys Glu Val Pro Leu Asp Ile 210 215 220 Cys Gln Ser Ile Cys Lys Tyr Pro Asp Tyr Leu Gln Met Ser Ala Asp 225 230 235 240 Pro Tyr Gly Asp Ser Met Phe Phe Cys Leu Arg Arg Glu Gln Leu Phe 245 250 255 Ala Arg His Phe Trp Asn Arg Ala Gly Thr Met Gly Asp Thr Val Pro 260 265 270 Gln Ser Leu Tyr Ile Lys Gly Thr Gly Met Arg Ala Ser Pro Gly Ser 275 280 285 Cys Val Tyr Ser Pro Ser Pro Pro Ser Gly Ser Ile Val Thr Ser Asp Ser 290 295 300 Gln Leu Phe Asn Lys Pro Tyr Trp Leu His Lys Ala Gln Gly His Asn 305 310 315 320 Asn Gly Val Cys Trp His Asn Gln Leu Phe Val Thr Val Val Asp Thr 325 330 335 Thr Arg Ser Thr Asn Leu Thr Ile Cys Ala Ser Thr Gln Ser Pro Val 340 345 350 Pro Gly Gln Tyr Asp Ala Thr Lys Phe Lys Gln Tyr Ser Arg His Val 355 360 365 Glu Glu Tyr Asp Leu Gln Phe Ile Phe Gln Leu Cys Thr Ile Thr Leu 370 375 380 Thr Ala Asp Val Met Ser Tyr Ile His Ser Met Asn Ser Ser Ile Leu 385 390 395 400 Glu Asp Trp Asn Phe Gly Val Pro Pro Pro Pro Thr Thr Ser Leu Val 405 410 415 Asp Thr Tyr Arg Phe Val Gln Ser Val Ala Ile Thr Cys Gln Lys Asp 420 425 430 Ala Ala Pro Ala Glu Asn Lys Asp Pro Tyr Asp Lys Leu Lys Phe Trp 435 440 445 Asn Val Asp Leu Lys Glu Lys Phe Ser Leu Asp Leu Asp Gln Tyr Pro 450 455 460 Leu Gly Arg Lys Phe Leu Lys Ala Lys Pro Lys Leu Lys Arg Ala Ala 465 470 475 480 Pro Thr Ser Thr Arg Thr Ser Ser Ala Lys Arg Lys Lys Val Lys Lys 485 490 495 <![CDATA[ <210> 43]]>
<![CDATA[ <211> 1492]]>
<![CDATA[ <212> DNA]]>
<![CDATA[ <213> Artificial sequence]]>
<![CDATA[ <220> ]]>
<![CDATA[ <223> 1804 Nucleotide sequence of chimeric HPV type 18 L1 protein]]>
<![CDATA[ <400> 43]]> atggccctct ggagaccatc cgataacaca gtgtacttgc ccccacccag cgtcgcccgg 60 gtggtgaaca cagacgacta cgtcaccaga acctcaatct tctaccacgc cgggtccagc 120 cggctgctga ccgtgggcaa cccctacttc cgcgt gcccg ccggcggcgg aaacaaacaa 180 gacatcccca aagtcagcgc ctatcagtac cgggtgttcc gcgtccaact gcccgatccc 240 aacaagttcg gcctgcccga cacctccatc tacaacccg agacccag gctggtctgg 300 gcttgcgccg gc gtcgagat cgggaggggc caacccctgg gcgtggggtt gtccggccac 360 cccttctaca acaagctgga cgataccgag tccagccacg cagcaaccag caacgtctcc 420 gaagatgtgc gcgataacgt cagcgtggac tacaaacaaa cccaactgtg catcctggga 480 tgcgc acccg ccatcggcga gcattgggcc aaggggaccg cctgcaagag caggcccctg 540 agccaagggg actgtccacc cctggagttg aagaataccg tgctcgagga cggcgacatg 600 gtggacaccg gctacggcgc tatggatttc tccaccctcc aggacaccaa gtgcgaagtg 660 cccctcgaca tctgccaaag catctgcaag taccccgact acctccagat gagcgccgac 720 ccctacggcg acagcatgtt cttctgtctc agaagggaac aattgttcgc ccgccacttc 780 tggaaccggg ccggcacaat gggagataca gtcccccaga gcctgtacat caaggg gacc 840 ggaatgaggg ccagccccgg gtcctgcgtc tacagcccaa gcccctccgg gagcatcgtc 900 acaagcgata gccaactctt caacaagccc tactggctcc acaaagccca aggccacaat 960 aacggggtgt gttggcacaa ccagctgttc gtgaccgtcg tggac acaac caggtccaca 1020 aacctgacca tctgcgccag cacccaaagc cccgtgcccg gccagtacga cgccacaaag 1080 ttcaaacaat actctcggca cgtggaagag tacgacctcc aattcatctt ccaactctgc 1140 accatcaccc tcaccgccga cgtgatgagc ta catccact ccatgaactc ctccatcctg 1200 gaagactgga atttcggcgt gccaccaccc cctaccacct ccctcgtcga cacctacaga 1260 ttcgtgcaga gcgtggccat cacatgccag aaagacgccg cccccgccga gaacaaagac 1320 ccatacgaca aactgaaatt ctgga acgtc gacctgaaag agaaattcag cctggatctg 1380 gaccagtacc cattgggcag gaagttcctc aaagccaagc caaaactgaa aagggctgcc 1440 ccaaccagca ccaggacctc ctctgccaag aggaagaagg tgaagaagta aa 1492 <![CDATA[ <210> 44]]>
<![CDATA[ <211> 1543]]>
<![CDATA[ <212> DNA]]>
<![CDATA[ <213> Artificial sequence]]>
<![CDATA[ <220> ]]>
<![CDATA[ <223> 1805 Synthetic HPV18L1 gene]]>
<![CDATA[ [400] gtgcccgc cggcggcgga 180 aacaaacaag acatccccaa agtcagcgcc tatcagtacc gggtgttccg cgtccaactg 240 cccgatccca acaagttcgg cctgcccgac acctccatct acaacccgga gacccagagg 300 ctggtctggg cttgcgccgg cg tcgagatc gggaggggcc aacccctggg cgtggggttg 360 tccggccacc ccttctacaa caagctggac gataccgagt ccagccacgc agcaaccagc 420 aacgtctccg aagatgtgcg cgataacgtc agcgtggact acaaacaaac ccaactgtgc 480 atcctggggat gc gcacccgc catcggcgag cattgggcca aggggaccgc ctgcaagagc 540 aggcccctga gccaagggga ctgtccaccc ctggagttga agaataccgt gctcgaggac 600 ggcgacatgg tggacaccgg ctacggcgct atggatttct ccaccctcca ggacaccaag 660 tgc gaagtgcccctcgacat ctgccaaagc atctgcaagt accccgacta cctccagatg 720 agcgccgacc cctacggcga cagcatgttc ttctgtctca gaagggaaca attgttcgcc 780 cgccacttct ggaaccgggc cggcacaatg ggagatacag tcccccagag cctgtacatc 840 a aggggaccg gaatgagggc cagccccggg tcctgcgtct acagcccaag cccctccggg 900 agcatcgtca caagcgatag ccaactcttc aacaagccct actggctcca caaagcccaa 960 ggccacaata acggggtgtg ttggcacaac cagctgttcg tgaccgtcgt ggaca caacc 1020 aggtccacaa acctgaccat ctgcgccagc acccaaagcc ccgtgcccgg ccagtacgac 1080 gccacaaagt tcaaacaata ctctcggcac gtggaagagt acgacctcca attcatcttc 1140 caactctgca ccatcaccct caccgccgac gtgatgagct acatccactc catga actcc 1200 tccatcctgg aagactggaa tttcggcgtg ccaccacccc ctaccacctc cctcgtcgac 1260 acctacagat tcgtgcagag cgtggccatc acatgccaga aagacgccgc ccccgccgag 1320 aacaaagacc catacgacaa actgaaattc tggaacg tcg acctgaaaga gaaattcagc 1380 ctggatctgg accagtaccc attgggcagg aagttcctcg tgcaagccgg cctcaggaga 1440 aaaccaacaa tcgggcccag gaagaggagc gcccccagcg caaccaccag cagcaagccc 1500 gcaaaaaggg tcagagtgag gg cacgcaaa taaactcgag ctc 1543 <![CDATA[ <210> 45]]>
<![CDATA[ <211> 1519]]>
<![CDATA[ <212> DNA]]>
<![CDATA[ <213> Artificial sequence]]>
<![CDATA[ <220> ]]>
<![CDATA[ <223> 1806 Synthetic HPV33L1 gene]]>
<![CDATA[ <400> 45]]> ctgggtacca tgagtgtgtg gagaccatct gaggctacag tctacctgcc tcctgtgcct 60 gtgagcaagg tggtgagcac agatgaatat gtgagcagga ccagcatcta ctactatgct 120 ggctccagca gactgctggc tgtgggacac ccatact tca gcatcaagaa cccaaccaat 180 gccaagaaac tgctggtgcc aaaggtgtct ggactccaat acagggtgtt cagggtgaga 240 ctgcctgacc caaacaagtt tggctttcct gacacctcct tctacaaccc tgacacccag 300 agactggtgt gggcttgtg t gggattggag attggcaggg gacaaccact gggagtgggc 360
atctctggac acccactgct gaacaagttt gatgacacag agacaggcaa caaataccct 420
ggacaacctg gagcagacaa cagggagtgt ctgagtatgg actacaagca gacccaactt 480
tgtctgctgg gctgtaagcc tccaacagga gaacactggg gcaagggagt ggcttgtacc 540
aatgctgccc ctgccaatga ctgtcctcca ttggaactga taaacaccat cattgaggat 600
ggagatatgg tggacacagg ctttggctgt atggacttca agaccctcca agccaacaag 660
tctgatgtgc caattgacat ctgtggcagc acttgtaaat accctgacta cctgaaaatg 720 acctctgaac catatggaga ctccctgttc ttcttcctga gggagggaaca gatgtttgtg 780 agacacttct tcaacagggc tggcaccctg ggagaggctg tgcctgatga cctctacatc 840 a agggctctg gcaccacagc cagcatccag tcctctgcct tctttccaac accatctggc 900 agtatggtga cctctgagag ccaacttttc aacaagccat actggctcca aagggctcaa 960 ggacacaaca atggcatctg ttggggcaac caggtgtttg tgacagtggt ggacaccacc 1020 aggagcacca atatgaccct gtgtacccag gtgacctctg acagcaccta caagaatgag 1080 aacttcaagg aatacatcag gcatgtggag gaatatgacc tccaatttgt gttccaactt 1140 tgtaaggtga ccctgacagc agaggtgatg acctacatcc atgctatgaa ccctgacatc 1200 ttggaggact ggcagtttgg actgacacct cctccatctg cctccctcca agacacctac 1260 aggtttgtga ccagccaggc tatcacttgt cagaagacag tgcctccaaa ggagaaggag 1320 gacccactgg gcaaatacac cttctgggag gtggacctga aaga gaagtt ctctgctgac 1380 ctggaccagt ttccactggg caggaagttc ctgctccaag caggactgaa agccaagcca 1440 aaactgaaaa gggctgcccc aaccagcacc aggacctcct ctgccaagag gaagaaggtg 1500 aagaagtaaa ctcgagctc 1519 <![CDATA[ <210> 46]]>
<![CDATA[ <211> 34]]>
<![CDATA[ <212> DNA]]>
<![CDATA[ <213> Artificial sequence]]>
<![CDATA[ <220> ]]>
<![CDATA[ <223> 1807 HPV18L1 F1]]> <![CDATA[ <400> 46]]> cttggtacca tggccctctg gagaccatcc gata 34 <![CDATA[ <210> 47]]>
<![CDATA[ <211> 35]]>
<![CDATA[ <212> DNA]]>
<![CDATA[ <213> Artificial sequence]]>
<![CDATA[ <220> ]]>
<![CDATA[ <223> 1808 HPV18L1 R1]]> <![CDATA[ <400> 47]]> gcttggcttt gaggaacttc ctgcccaatg ggtac 35 <![CDATA[ <210> 48]]>
<![CDATA[ <211> 1429]]>
<![CDATA[ <212> DNA]]>
<![CDATA[ <213> Artificial sequence]]>
<![CDATA[ <220> ]]>
<![CDATA[ <223> 1809 HPV18L1 expansion sequence 1]]>
<![CDATA[ <400> 48]]> cttggtacca tggccctctg gagaccatcc gataacacag tgtacttgcc cccacccagc 60 gtcgcccggg tggtgaacac agacgactac gtcaccagaa cctcaatctt ctaccacgcc 120 gggtccagcc ggctgctgac cgtgggcaac ccctacttcc gc gtgcccgc cggcggcgga 180 aacaaacaag acatccccaa agtcagcgcc tatcagtacc gggtgttccg cgtccaactg 240 cccgatccca acaagttcgg cctgcccgac acctccatct acaacccgga gacccagagg 300 ctggtctggg cttgcgccgg cg tcgagatc gggaggggcc aacccctggg cgtggggttg 360 tccggccacc ccttctacaa caagctggac gataccgagt ccagccacgc agcaaccagc 420 aacgtctccg aagatgtgcg cgataacgtc agcgtggact acaaacaaac ccaactgtgc 480 atcctggggat gc gcacccgc catcggcgag cattgggcca aggggaccgc ctgcaagagc 540 aggcccctga gccaagggga ctgtccaccc ctggagttga agaataccgt gctcgaggac 600 ggcgacatgg tggacaccgg ctacggcgct atggatttct ccaccctcca ggacaccaag 660 tgc gaagtgcccctcgacat ctgccaaagc atctgcaagt accccgacta cctccagatg 720 agcgccgacc cctacggcga cagcatgttc ttctgtctca gaagggaaca attgttcgcc 780 cgccacttct ggaaccgggc cggcacaatg ggagatacag tcccccagag cctgtacatc 840 a aggggaccg gaatgagggc cagccccggg tcctgcgtct acagcccaag cccctccggg 900 agcatcgtca caagcgatag ccaactcttc aacaagccct actggctcca caaagcccaa 960 ggccacaata acggggtgtg ttggcacaac cagctgttcg tgaccgtcgt ggaca caacc 1020 aggtccacaa acctgaccat ctgcgccagc acccaaagcc ccgtgcccgg ccagtacgac 1080 gccacaaagt tcaaacaata ctctcggcac gtggaagagt acgacctcca attcatcttc 1140 caactctgca ccatcaccct caccgccgac gtgatgagct acatccactc catga actcc 1200 tccatcctgg aagactggaa tttcggcgtg ccaccacccc ctaccacctc cctcgtcgac 1260 acctacagat tcgtgcagag cgtggccatc acatgccaga aagacgccgc ccccgccgag 1320 aacaaagacc catacgacaa actgaaattc tggaacg tcg acctgaaaga gaaattcagc 1380 ctggatctgg accagtaccc attgggcagg aagttcctca aagccaagc 1429 <![CDATA[ <210> 49]]>
<![CDATA[ <211> 35]]>
<![CDATA[ <212> DNA]]>
<![CDATA[ <213> Artificial sequence]]>
<![CDATA[ <220> ]]>
<![CDATA[ <223> 1810 HPV18L1 F2]]> <![CDATA[ <400> 49]]> gaagttcctc aaagccaagc caaaactgaa aaggg 35 <![CDATA[ <210> 50]]>
<![CDATA[ <211> 36]]>
<![CDATA[ <212> DNA]]>
<![CDATA[ <213> Artificial sequence]]>
<![CDATA[ <220> ]]>
<![CDATA[ <223> 1811 HPV18L1 R2]]> <![CDATA[ <400> 50]]> ctgtctagat ttacttcttc accttcttcc tcttgg 36 <![CDATA[ <210> 51]]>
<![CDATA[ <211> 101]]>
<![CDATA[ <212> DNA]]>
<![CDATA[ <213> Artificial sequence]]>
<![CDATA[ <220> ]]>
<![CDATA[ <223> 1812 HPV18L1 expansion sequence 2]]>
<![CDATA[ <400> 51]]> gaagttcctc aaagccaagc caaaactgaa aagggctgcc ccaaccagca ccaggacctc 60 ctctgccaag aggaagaagg tgaagaagta aatctagaca g 101 <![CDATA[ <210> 52]]>
<![CDATA[ <211> 38]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Homo sapiens]]>
<![CDATA[ <220> ]]>
<![CDATA[ <223> 1813 HPV 59 L1 protein amino acid sequence 471-508]]>
<![CDATA[ <400> 52]]> Leu Gln Leu Gly Ala Arg Pro Lys Pro Thr Ile Gly Pro Arg Lys Arg 1 5 10 15 Ala Ala Pro Ala Pro Thr Ser Thr Pro Ser Pro Lys Arg Val Lys Arg 20 25 30 Arg Lys Ser Ser Arg Lys 35 <![CDATA[ <210> 53]]>
<![CDATA[ <211> 475]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Homo sapiens]]>
<![CDATA[ <220> ]]>
<![CDATA[ <223> 3101 HPV 31 L1 protein amino acid sequence 1-475]]>
<![CDATA[ <400> 53]]> Met Ser Leu Trp Arg Pro Ser Glu Ala Thr Val Tyr Leu Pro Pro Val 1 5 10 15 Pro Val Ser Lys Val Val Ser Thr Asp Glu Tyr Val Thr Arg Thr Asn 20 25 30 Ile Tyr Tyr His Ala Gly Ser Ala Arg Leu Leu Thr Val Gly His Pro 35 40 45 Tyr Tyr Ser Ile Pro Lys Ser Asp Asn Pro Lys Lys Ile Val Val Pro 50 55 60 Lys Val Ser Gly Leu Gln Tyr Arg Val Phe Arg Val Arg Leu Pro Asp 65 70 75 80 Pro Asn Lys Phe Gly Phe Pro Asp Thr Ser Phe Tyr Asn Pro Glu Thr 85 90 95 Gln Arg Leu Val Trp Ala Cys Val Gly Leu Glu Val Gly Arg Gly Gln 100 105 110 Pro Leu Gly Val Gly Ile Ser Gly His Pro Leu Leu Asn Lys Phe Asp 115 120 125 Asp Thr Glu Asn Ser Asn Arg Tyr Ala Gly Gly Pro Gly Thr Asp Asn 130 135 140 Arg Glu Cys Ile Ser Met Asp Tyr Lys Gln Thr Gln Leu Cys Leu Leu 145 150 155 160 Gly Cys Lys Pro Pro Ile Gly Glu His Trp Gly Lys Gly Ser Pro Cys 165 170 175 Ser Asn Asn Ala Ile Thr Pro Gly Asp Cys Pro Pro Leu Glu Leu Lys 180 185 190 Asn Ser Val Ile Gln Asp Gly Asp Met Val Asp Thr Gly Phe Gly Ala 195 200 205 Met Asp Phe Thr Ala Leu Gln Asp Thr Lys Ser Asn Val Pro Leu Asp 210 215 220 Ile Cys Asn Ser Ile Cys Lys Tyr Pro Asp Tyr Leu Lys Met Val Ala 225 230 235 240 Glu Pro Tyr Gly Asp Thr Leu Phe Phe Tyr Leu Arg Arg Glu Gln Met 245 250 255 Phe Val Arg His Phe Phe Asn Arg Ser Gly Thr Val Gly Glu Ser Val 260 265 270 Pro Thr Asp Leu Tyr Ile Lys Gly Ser Gly Ser Thr Ala Thr Leu Ala 275 280 285 Asn Ser Thr Tyr Phe Pro Thr Pro Ser Gly Ser Met Val Thr Ser Asp 290 295 300 Ala Gln Ile Phe Asn Lys Pro Tyr Trp Met Gln Arg Ala Gln Gly His 305 310 315 320 Asn Asn Gly Ile Cys Trp Gly Asn Gln Leu Phe Val Thr Val Val Asp 325 330 335 Thr Thr Arg Ser Thr Asn Met Ser Val Cys Ala Ala Ile Ala Asn Ser 340 345 350 Asp Thr Thr Phe Lys Ser Ser Asn Phe Lys Glu Tyr Leu Arg His Gly 355 360 365 Glu Glu Phe Asp Leu Gln Phe Ile Phe Gln Leu Cys Lys Ile Thr Leu 370 375 380 Ser Ala Asp Ile Met Thr Tyr Ile His Ser Met Asn Pro Ala Ile Leu 385 390 395 400 Glu Asp Trp Asn Phe Gly Leu Thr Thr Pro Pro Ser Gly Ser Leu Glu 405 410 415 Asp Thr Tyr Arg Phe Val Thr Ser Gln Ala Ile Thr Cys Gln Lys Ser 420 425 430 Ala Pro Gln Lys Pro Lys Glu Asp Pro Phe Lys Asp Tyr Val Phe Trp 435 440 445 Glu Val Asn Leu Lys Glu Lys Phe Ser Ala Asp Leu Asp Gln Phe Pro 450 455 460 Leu Gly Arg Lys Phe Leu Leu Gln Ala Gly Tyr 465 470 475 <![CDATA[ <210> 54]]>
<![CDATA[ <211> 26]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Homo sapiens]]>
<![CDATA[ <220> ]]>
<![CDATA[ <223> 3102 HPV 33 L1 protein amino acid sequence 474-499]]>
<![CDATA[ <400> 54]]> Lys Ala Lys Pro Lys Leu Lys Arg Ala Ala Pro Thr Ser Thr Arg Thr 1 5 10 15 Ser Ser Ala Lys Arg Lys Lys Val Lys Lys 20 25 <![CDATA[ <210> 55]]>
<![CDATA[ <211> 501]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Artificial sequence]]>
<![CDATA[ <220> ]]>
<![CDATA[ <223> 3103 Amino acid sequence of chimeric HPV type 31 L1 protein]]>
<![CDATA[ <400> 55]]> Met Ser Leu Trp Arg Pro Ser Glu Ala Thr Val Tyr Leu Pro Pro Val 1 5 10 15 Pro Val Ser Lys Val Val Ser Thr Asp Glu Tyr Val Thr Arg Thr Asn 20 25 30 Ile Tyr Tyr His Ala Gly Ser Ala Arg Leu Leu Thr Val Gly His Pro 35 40 45 Tyr Tyr Ser Ile Pro Lys Ser Asp Asn Pro Lys Lys Ile Val Val Pro 50 55 60 Lys Val Ser Gly Leu Gln Tyr Arg Val Phe Arg Val Arg Leu Pro Asp 65 70 75 80 Pro Asn Lys Phe Gly Phe Pro Asp Thr Ser Phe Tyr Asn Pro Glu Thr 85 90 95 Gln Arg Leu Val Trp Ala Cys Val Gly Leu Glu Val Gly Arg Gly Gln 100 105 110 Pro Leu Gly Val Gly Ile Ser Gly His Pro Leu Leu Asn Lys Phe Asp 115 120 125 Asp Thr Glu Asn Ser Asn Arg Tyr Ala Gly Gly Pro Gly Thr Asp Asn 130 135 140 Arg Glu Cys Ile Ser Met Asp Tyr Lys Gln Thr Gln Leu Cys Leu Leu 145 150 155 160 Gly Cys Lys Pro Pro Ile Gly Glu His Trp Gly Lys Gly Ser Pro Cys 165 170 175 Ser Asn Asn Ala Ile Thr Pro Gly Asp Cys Pro Pro Leu Glu Leu Lys 180 185 190 Asn Ser Val Ile Gln Asp Gly Asp Met Val Asp Thr Gly Phe Gly Ala 195 200 205 Met Asp Phe Thr Ala Leu Gln Asp Thr Lys Ser Asn Val Pro Leu Asp 210 215 220 Ile Cys Asn Ser Ile Cys Lys Tyr Pro Asp Tyr Leu Lys Met Val Ala 225 230 235 240 Glu Pro Tyr Gly Asp Thr Leu Phe Phe Tyr Leu Arg Arg Glu Gln Met 245 250 255 Phe Val Arg His Phe Phe Asn Arg Ser Gly Thr Val Gly Glu Ser Val 260 265 270 Pro Thr Asp Leu Tyr Ile Lys Gly Ser Gly Ser Thr Ala Thr Leu Ala 275 280 285 Asn Ser Thr Tyr Phe Pro Thr Pro Ser Gly Ser Met Val Thr Ser Asp 290 295 300 Ala Gln Ile Phe Asn Lys Pro Tyr Trp Met Gln Arg Ala Gln Gly His 305 310 315 320 Asn Asn Gly Ile Cys Trp Gly Asn Gln Leu Phe Val Thr Val Val Asp 325 330 335 Thr Thr Arg Ser Thr Asn Met Ser Val Cys Ala Ala Ile Ala Asn Ser 340 345 350 Asp Thr Thr Phe Lys Ser Ser Asn Phe Lys Glu Tyr Leu Arg His Gly 355 360 365 Glu Glu Phe Asp Leu Gln Phe Ile Phe Gln Leu Cys Lys Ile Thr Leu 370 375 380 Ser Ala Asp Ile Met Thr Tyr Ile His Ser Met Asn Pro Ala Ile Leu 385 390 395 400 Glu Asp Trp Asn Phe Gly Leu Thr Thr Pro Pro Ser Gly Ser Leu Glu 405 410 415 Asp Thr Tyr Arg Phe Val Thr Ser Gln Ala Ile Thr Cys Gln Lys Ser 420 425 430 Ala Pro Gln Lys Pro Lys Glu Asp Pro Phe Lys Asp Tyr Val Phe Trp 435 440 445 Glu Val Asn Leu Lys Glu Lys Phe Ser Ala Asp Leu Asp Gln Phe Pro 450 455 460 Leu Gly Arg Lys Phe Leu Leu Gln Ala Gly Tyr Lys Ala Lys Pro Lys 465 470 475 480 Leu Lys Arg Ala Ala Pro Thr Ser Thr Arg Thr Ser Ser Ala Lys Arg 485 490 495 Lys Lys Val Lys Lys 500 <![CDATA[ <210> 56]]>
<![CDATA[ <211> 1507]]>
<![CDATA[ <212> DNA]]>
<![CDATA[ <213> Artificial sequence]]>
<![CDATA[ <220> ]]>
<![CDATA[ <223> 3104 Nucleotide sequence of chimeric HPV type 31 L1 protein]]>
<![CDATA[ <400> 56]]> atgagcctgt ggaggcccag cgaggccacc gtgtacctgc cccccgtgcc cgtgagcaag 60 gtggtgagca ccgacgagta cgtgaccagg accaacatct actaccacgc cggcagcgcc 120 aggctgctga ccgtgggcca cccctactac agcatcc cca agagcgacaa ccccaagaag 180 atcgtggtgc ccaaggtgag cggcctgcag tacagggtgt tcagggtgag gctgcccgac 240 cccaacaagt tcggcttccc cgacaccagc ttctacaacc ccgagaccca gaggctggtg 300 tgggcctgcg tg ggcctgga ggtgggcagg ggccagcccc tgggcgtggg catcagcggc 360 caccccctgc tgaacaagtt cgacgacacc gagaacagca acaggtacgc cggcggcccc 420 ggcaccgaca acagggagtg catcagcatg gactacaagc agacccagct gtgcctgctg 480 ggctgcaagc cccccatcgg c gagcactgg ggcaagggca gcccctgcag caacaacgcc 540 atcaccccg gcgactgccc ccccctggag ctgaagaaca gcgtgatcca ggacggcgac 600 atggtggaca ccggcttcgg cgccatggac ttcaccgccc tgcaggacac caagagcaac 660 gtgcccctgg acatctgcaa cagcatctgc aagtaccccg actacctgaa gatggtggcc 720 gagccctacg gcgacaccct gttcttctac ctgaggaggg agcagatgtt cgtgaggcac 780 ttcttcaaca ggagcggcac cgtgggcgag agcgtgccca ccgacctgta catca agggc 840 agcggcagca ccgccaccct ggccaacagc acctacttcc ccacccccag cggcagcatg 900 gtgaccagcg acgcccagat cttcaacaag ccctactgga tgcagagggc ccagggccac 960 aacaacggca tctgctgggg caaccagctg ttcgtgaccg tggtggacac caccaggagc 1020 accaacatga gcgtgtgcgc cgccatcgcc aacagcgaca ccaccttcaa gagcagcaac 1080 ttcaaggagt acctgaggca cggcgaggag ttcgacctgc agttcatctt ccagctgtgc 1140 aagatcaccc tgag cgccga catcatgacc tacatccaca gcatgaaccc cgccatcctg 1200 gaggactgga acttcggcct gaccaccccc cccagcggca gcctggagga cacctacagg 1260 ttcgtgacca gccaggccat cacctgccag aagtccgccc cccagaagcc caaggaggac 1320 cccttcaagg actacgt gtt ctgggaggtg aacctgaagg agaagttcag cgccgacctg 1380 gaccagttcc ccctgggcag gaagttcctg ctgcaggccg gctacaaagc caagccaaaa 1440 ctgaaaaggg ctgccccaac cagcaccagg acctcctctg ccaagaggaa gaaggtgaag 15 00 aagtaaa 1507 <![CDATA[ <210> 57]]>
<![CDATA[ <211> 1534]]>
<![CDATA[ <212> DNA]]>
<![CDATA[ <213> Artificial sequence]]>
<![CDATA[ <220> ]]>
<![CDATA[ <223> 3105 Synthetic HPV31L1 gene]]>
<![CDATA[ <400> 57]]> ctgggtacca tgagcctgtg gaggcccagc gaggccaccg tgtacctgcc ccccgtgccc 60 gtgagcaagg tggtgagcac cgacgagtac gtgaccagga ccaacatcta ctaccacgcc 120 ggcagcgcca ggctgctgac cgtgggcc ac ccctactaca gcatccccaa gagcgacaac 180 cccaagaaga tcgtggtgcc caaggtgagc ggcctgcagt acagggtgtt cagggtgagg 240 ctgcccgacc ccaacaagtt cggcttcccc gacaccagct tctacaaccc cgagacccag 300 aggctggtgt gggcctg cgt gggcctggag gtgggcaggg gccagcccct gggcgtgggc 360 atcagcggcc accccctgct gaacaagttc gacgacaccg agaacagcaa caggtacgcc 420 ggcggccccg gcaccgacaa cagggagtgc atcagcatgg actacaagca gacccagctg 480 tgcctgctgg gct gcaagcc ccccatcggc gagcactggg gcaagggcag cccctgcagc 540 aacaacgcca tcacccccgg cgactgcccc cccctggagc tgaagaacag cgtgatccag 600 gacggcgaca tggtggacac cggcttcggc gccatggact tcaccgccct gcaggacacc 660 aagagcaacg tgcccctgga catctgcaac agcatctgca agtaccccga ctacctgaag 720 atggtggccg agccctacgg cgacaccctg ttcttctacc tgaggaggga gcagatgttc 780 gtgaggcact tcttcaacag gagcggcacc gtgggcgaga gcgtgcccac cgacct gtac 840 atcaagggca gcggcagcac cgccaccctg gccaacagca cctacttccc cacccccagc 900 ggcagcatgg tgaccagcga cgcccagatc ttcaacaagc cctactggat gcagagggcc 960 cagggccaca acaacggcat ctgctggggc aaccagctgt tcgtga ccgt ggtggacacc 1020 accaggagca ccaacatgag cgtgtgcgcc gccatcgcca acagcgacac caccttcaag 1080 agcagcaact tcaaggagta cctgaggcac ggcgaggagt tcgacctgca gttcatcttc 1140 cagctgtgca agatcaccct gagcgccg ac atcatgacct acatccacag catgaaccc 1200 gccatcctgg aggactggaa cttcggcctg accaccccccc ccagcggcag cctggaggac 1260 acctacaggt tcgtgaccag ccaggccatc acctgccaga agtccgcccc ccagaagccc 1320 aaggaggacc ccttcaagga ctacgtgttc tgggaggtga acctgaagga gaagttcagc 1380 gccgacctgg accagttccc cctgggcagg aagttcctgc tgcaggccgg ctacagggcc 1440 aggcccaagt tcaaggccgg caagaggagc gcccccagcg ccagcaccac cacccccgcc 1500 aagaggaaga agacca agaa gtaaactcga gctc 1534 <![CDATA[ <210> 58]]>
<![CDATA[ <211> 1519]]>
<![CDATA[ <212> DNA]]>
<![CDATA[ <213> Artificial sequence]]>
<![CDATA[ <220> ]]>
<![CDATA[ <223> 3106 Synthetic HPV33L1 gene]]>
<![CDATA[ <400> 58]]> ctgggtacca tgagtgtgtg gagaccatct gaggctacag tctacctgcc tcctgtgcct 60 gtgagcaagg tggtgagcac agatgaatat gtgagcagga ccagcatcta ctactatgct 120 ggctccagca gactgctggc tgtgggacac ccatact tca gcatcaagaa cccaaccaat 180 gccaagaaac tgctggtgcc aaaggtgtct ggactccaat acagggtgtt cagggtgaga 240 ctgcctgacc caaacaagtt tggctttcct gacacctcct tctacaaccc tgacacccag 300 agactggtgt gggcttgtg t gggattggag attggcaggg gacaaccact gggagtgggc 360
atctctggac acccactgct gaacaagttt gatgacacag agacaggcaa caaataccct 420
ggacaacctg gagcagacaa cagggagtgt ctgagtatgg actacaagca gacccaactt 480
tgtctgctgg gctgtaagcc tccaacagga gaacactggg gcaagggagt ggcttgtacc 540
aatgctgccc ctgccaatga ctgtcctcca ttggaactga taaacaccat cattgaggat 600
ggagatatgg tggacacagg ctttggctgt atggacttca agaccctcca agccaacaag 660
tctgatgtgc caattgacat ctgtggcagc acttgtaaat accctgacta cctgaaaatg 720 acctctgaac catatggaga ctccctgttc ttcttcctga gggagggaaca gatgtttgtg 780 agacacttct tcaacagggc tggcaccctg ggagaggctg tgcctgatga cctctacatc 840 a agggctctg gcaccacagc cagcatccag tcctctgcct tctttccaac accatctggc 900 agtatggtga cctctgagag ccaacttttc aacaagccat actggctcca aagggctcaa 960 ggacacaaca atggcatctg ttggggcaac caggtgtttg tgacagtggt ggacaccacc 1020 aggagcacca atatgaccct gtgtacccag gtgacctctg acagcaccta caagaatgag 1080 aacttcaagg aatacatcag gcatgtggag gaatatgacc tccaatttgt gttccaactt 1140 tgtaaggtga ccctgacagc agaggtgatg acctacatcc atgctatgaa ccctgacatc 1200 ttggaggact ggcagtttgg actgacacct cctccatctg cctccctcca agacacctac 1260 aggtttgtga ccagccaggc tatcacttgt cagaagacag tgcctccaaa ggagaaggag 1320 gacccactgg gcaaatacac cttctgggag gtggacctga aaga gaagtt ctctgctgac 1380 ctggaccagt ttccactggg caggaagttc ctgctccaag caggactgaa agccaagcca 1440 aaactgaaaa gggctgcccc aaccagcacc aggacctcct ctgccaagag gaagaaggtg 1500 aagaagtaaa ctcgagctc 1519 <![CDATA[ <210> 59]]>
<![CDATA[ <211> 33]]>
<![CDATA[ <212> DNA]]>
<![CDATA[ <213> Artificial sequence]]>
<![CDATA[ <220> ]]>
<![CDATA[ <223> 3107 HPV31L1 F1]]> <![CDATA[ <400> 59]]> cttggtacca tgagcctgtg gaggcccagc gag 33 <![CDATA[ <210> 60]]>
<![CDATA[ <211> 38]]>
<![CDATA[ <212> DNA]]>
<![CDATA[ <213> Artificial sequence]]>
<![CDATA[ <220> ]]>
<![CDATA[ <223> 3108 HPV31L1 R1]]> <![CDATA[ <400> 60]]> gcttggcttt gtagccggcc tgcagcagga acttcctg 38 <![CDATA[ <210> 61]]>
<![CDATA[ <211> 1444]]>
<![CDATA[ <212> DNA]]>
<![CDATA[ <213> Artificial sequence]]>
<![CDATA[ <220> ]]>
<![CDATA[ <223> 3109 HPV31L1 expansion sequence 1]]>
<![CDATA[ <400> 61]]> cttggtacca tgagcctgtg gaggcccagc gaggccaccg tgtacctgcc ccccgtgccc 60 gtgagcaagg tggtgagcac cgacgagtac gtgaccagga ccaacatcta ctaccacgcc 120 ggcagcgcca ggctgctgac cgtgggcc ac ccctactaca gcatccccaa gagcgacaac 180 cccaagaaga tcgtggtgcc caaggtgcagc ggcctgcagt acagggtgtt cagggtgagg 240 ctgcccgacc ccaacaagtt cggcttcccc gacaccagct tctacaaccc cgagacccag 300 aggctggtgt gggcctg cgt gggcctggag gtgggcaggg gccagcccct gggcgtgggc 360 atcagcggcc accccctgct gaacaagttc gacgacaccg agaacagcaa caggtacgcc 420 ggcggccccg gcaccgacaa cagggagtgc atcagcatgg actacaagca gacccagctg 480 tgcctgctgg gct gcaagcc ccccatcggc gagcactggg gcaagggcag cccctgcagc 540 aacaacgcca tcacccccgg cgactgcccc cccctggagc tgaagaacag cgtgatccag 600 gacggcgaca tggtggacac cggcttcggc gccatggact tcaccgccct gcaggacacc 660 aagagcaacg tgcccctgga catctgcaac agcatctgca agtaccccga ctacctgaag 720 atggtggccg agccctacgg cgacaccctg ttcttctacc tgaggaggga gcagatgttc 780 gtgaggcact tcttcaacag gagcggcacc gtgggcgaga gcgtgcccac cgacct gtac 840 atcaagggca gcggcagcac cgccaccctg gccaacagca cctacttccc cacccccagc 900 ggcagcatgg tgaccagcga cgcccagatc ttcaacaagc cctactggat gcagagggcc 960 cagggccaca acaacggcat ctgctggggc aaccagctgt tcgtga ccgt ggtggacacc 1020 accaggagca ccaacatgag cgtgtgcgcc gccatcgcca acagcgacac caccttcaag 1080 agcagcaact tcaaggagta cctgaggcac ggcgaggagt tcgacctgca gttcatcttc 1140 cagctgtgca agatcaccct gagcgccg ac atcatgacct acatccacag catgaaccc 1200 gccatcctgg aggactggaa cttcggcctg accaccccccc ccagcggcag cctggaggac 1260 acctacaggt tcgtgaccag ccaggccatc acctgccaga agtccgcccc ccagaagccc 1320 aaggaggacc ccttcaagga ctacgtgttc tgggaggtga acctgaagga gaagttcagc 1380 gccgacctgg accagttccc cctgggcagg aagttcctgc tgcaggccgg ctacaaagcc 1440 aagc 1444 <![CDATA[ <210> 62]]>
<![CDATA[ <211> 35]]>
<![CDATA[ <212> DNA]]>
<![CDATA[ <213> Artificial sequence]]>
<![CDATA[ <220> ]]>
<![CDATA[ <223> 3110 HPV31L1 F2]]> <![CDATA[ <400> 62]]> ggccggctac aaagccaagc caaaactgaa aaggg 35 <![CDATA[ <210> 63]]>
<![CDATA[ <211> 41]]>
<![CDATA[ <212> DNA]]>
<![CDATA[ <213> Artificial sequence]]>
<![CDATA[ <220> ]]>
<![CDATA[ <223> 3111 HPV31L1 R2]]> <![CDATA[ <400> 63]]> ctgtctagat ttacttcttc accttcttcc tcttggcaga g 41 <![CDATA[ <210> 64]]>
<![CDATA[ <211> 101]]>
<![CDATA[ <212> DNA]]>
<![CDATA[ <213> Artificial sequence]]>
<![CDATA[ <220> ]]>
<![CDATA[ <223> 3112 HPV31L1 expansion sequence 2]]>
<![CDATA[ <400> 64]]> ggccggctac aaagccaagc caaaactgaa aagggctgcc ccaaccagca ccaggacctc 60 ctctgccaag aggaagaagg tgaagaagta aatctagaca g 101 <![CDATA[ <210> 65]]>
<![CDATA[ <211> 38]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Homo sapiens]]>
<![CDATA[ <220> ]]>
<![CDATA[ <223> 3113 HPV 59 L1 protein amino acid sequence 471-508]]>
<![CDATA[ <400> 65]]> Leu Gln Leu Gly Ala Arg Pro Lys Pro Thr Ile Gly Pro Arg Lys Arg 1 5 10 15 Ala Ala Pro Ala Pro Thr Ser Thr Pro Ser Pro Lys Arg Val Lys Arg 20 25 30 Arg Lys Ser Ser Arg Lys 35 <![CDATA[ <210> 66]]>
<![CDATA[ <211> 499]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Homo sapiens]]>
<![CDATA[ <220> ]]>
<![CDATA[ <223> 3301 Amino acid sequence of HPV type 33 L1 protein]]>
<![CDATA[ <400> 66]]> Met Ser Val Trp Arg Pro Ser Glu Ala Thr Val Tyr Leu Pro Pro Val 1 5 10 15 Pro Val Ser Lys Val Val Ser Thr Asp Glu Tyr Val Ser Arg Thr Ser 20 25 30 Ile Tyr Tyr Tyr Ala Gly Ser Ser Arg Leu Leu Ala Val Gly His Pro 35 40 45 Tyr Phe Ser Ile Lys Asn Pro Thr Asn Ala Lys Lys Leu Leu Val Pro 50 55 60 Lys Val Ser Gly Leu Gln Tyr Arg Val Phe Arg Val Arg Leu Pro Asp 65 70 75 80 Pro Asn Lys Phe Gly Phe Pro Asp Thr Ser Phe Tyr Asn Pro Asp Thr 85 90 95 Gln Arg Leu Val Trp Ala Cys Val Gly Leu Glu Ile Gly Arg Gly Gln 100 105 110 Pro Leu Gly Val Gly Ile Ser Gly His Pro Leu Leu Asn Lys Phe Asp 115 120 125 Asp Thr Glu Thr Gly Asn Lys Tyr Pro Gly Gln Pro Gly Ala Asp Asn 130 135 140 Arg Glu Cys Leu Ser Met Asp Tyr Lys Gln Thr Gln Leu Cys Leu Leu 145 150 155 160 Gly Cys Lys Pro Pro Thr Gly Glu His Trp Gly Lys Gly Val Ala Cys 165 170 175 Thr Asn Ala Ala Pro Ala Asn Asp Cys Pro Pro Leu Glu Leu Ile Asn 180 185 190 Thr Ile Ile Glu Asp Gly Asp Met Val Asp Thr Gly Phe Gly Cys Met 195 200 205 Asp Phe Lys Thr Leu Gln Ala Asn Lys Ser Asp Val Pro Ile Asp Ile 210 215 220 Cys Gly Ser Thr Cys Lys Tyr Pro Asp Tyr Leu Lys Met Thr Ser Glu 225 230 235 240 Pro Tyr Gly Asp Ser Leu Phe Phe Phe Leu Arg Arg Glu Gln Met Phe 245 250 255 Val Arg His Phe Phe Asn Arg Ala Gly Thr Leu Gly Glu Ala Val Pro 260 265 270 Asp Asp Leu Tyr Ile Lys Gly Ser Gly Thr Thr Ala Ser Ile Gln Ser 275 280 285 Ser Ala Phe Phe Pro Thr Pro Ser Gly Ser Met Val Thr Ser Glu Ser 290 295 300 Gln Leu Phe Asn Lys Pro Tyr Trp Leu Gln Arg Ala Gln Gly His Asn 305 310 315 320 Asn Gly Ile Cys Trp Gly Asn Gln Val Phe Val Thr Val Val Asp Thr 325 330 335 Thr Arg Ser Thr Asn Met Thr Leu Cys Thr Gln Val Thr Ser Asp Ser 340 345 350 Thr Tyr Lys Asn Glu Asn Phe Lys Glu Tyr Ile Arg His Val Glu Glu 355 360 365 Tyr Asp Leu Gln Phe Val Phe Gln Leu Cys Lys Val Thr Leu Thr Ala 370 375 380 Glu Val Met Thr Tyr Ile His Ala Met Asn Pro Asp Ile Leu Glu Asp 385 390 395 400 Trp Gln Phe Gly Leu Thr Pro Pro Pro Ser Ala Ser Leu Gln Asp Thr 405 410 415 Tyr Arg Phe Val Thr Ser Gln Ala Ile Thr Cys Gln Lys Thr Val Pro 420 425 430 Pro Lys Glu Lys Glu Asp Pro Leu Gly Lys Tyr Thr Phe Trp Glu Val 435 440 445 Asp Leu Lys Glu Lys Phe Ser Ala Asp Leu Asp Gln Phe Pro Leu Gly 450 455 460 Arg Lys Phe Leu Leu Gln Ala Gly Leu Lys Ala Lys Pro Lys Leu Lys 465 470 475 480 Arg Ala Ala Pro Thr Ser Thr Arg Thr Ser Ser Ala Lys Arg Lys Lys 485 490 495 Val Lys Lys <![CDATA[ <210> 67]]>
<![CDATA[ <211> 1501]]>
<![CDATA[ <212> DNA]]>
<![CDATA[ <213> Homo sapiens]]>
<![CDATA[ <220> ]]>
<![CDATA[ <223> 3302 Nucleotide sequence of HPV type 33 L1 protein]]>
<![CDATA[ <400> 67]]> atgagtgtgt ggagaccatc tgaggctaca gtctacctgc ctcctgtgcc tgtgagcaag 60 gtggtgagca cagatgaata tgtgagcagg accagcatct actactatgc tggctccagc 120 agactgctgg ctgtgggaca cccatacttc agcatcaaga acccaaccaa tgccaagaaa 180 ctgctggtgc caaaggtgtc tggactccaa tacagggtgt tcagggtgag actgcctgac 240 ccaaacaagt ttggctttcc tgacacctcc ttctacaacc ctgacaccca gagactggtg 300 tgggcttgtg tg ggattgga gattggcagg ggacaaccac tgggagtggg catctctgga 360
cacccactgc tgaacaagtt tgatgacaca gagacaggca acaaataccc tggacaacct 420
ggagcagaca acagggagtg tctgagtatg gactacaagc agacccaact ttgtctgctg 480
ggctgtaagc ctccaacagg agaacactgg ggcaagggag tggcttgtac caatgctgcc 540
cctgccaatg actgtcctcc attggaactg ataaacacca tcattgagga tggagatatg 600
gtggacacag gctttggctg tatggacttc aagaccctcc aagccaacaa gtctgatgtg 660
ccaattgaca tctgtggcag cacttgtaaa taccctgact acctgaaaat gacctctgaa 720
ccatatggag actccctgtt cttcttcctg aggagggaac agatgtttgt gagacacttc 780
ttcaacaggg ctggcaccct gggagaggct gtgcctgatg acctctacat caagggctct 840
ggcaccacag ccagcatcca gtcctctgcc ttctttccaa caccatctgg cagtatggtg 900
acctctgaga gccaactttt caacaagcca tactggctcc aaagggctca aggacacaac 960
aatggcatct gttggggcaa ccaggtgttt gtgacagtgg tggacaccac caggagcacc 1020 aatatgaccc tgtgtaccca ggtgacctct gacagcacct acaagaatga gaacttcaag 1080 gaatacatca ggcatgtgga ggaatatgac ctccaatttg tgttccaact ttgtaaggtg 1140 accctgacag cagaggtgat gacctacatc catgctatga accctgacat cttggaggac 1200 tggcagtttg gactgacacc tcctccatct gcctccctcc aagacaccta caggtttgtg 1260 accagccagg ctatcacttg tcagaagaca gtgcctccaa aggagaagga ggacccactg 1320 ggcaaataca ccttctggga ggtggacct g aaagagaagt tctctgctga cctggaccag 1380 tttccactgg gcaggaagtt cctgctccaa gcaggactga aagccaagcc aaaactgaaa 1440 agggctgccc caaccagcac caggacctcc tctgccaaga ggaagaaggt gaagaagtaa 1500 a 1501 <![CDATA[ <210> 68]]>
<![CDATA[ <211> 1519]]>
<![CDATA[ <212> DNA]]>
<![CDATA[ <213> Artificial sequence]]>
<![CDATA[ <220> ]]>
<![CDATA[ <223> 3303 Synthetic HPV33L1 gene]]>
<![CDATA[ <400> 68]]> ctgggtacca tgagtgtgtg gagaccatct gaggctacag tctacctgcc tcctgtgcct 60 gtgagcaagg tggtgagcac agatgaatat gtgagcagga ccagcatcta ctactatgct 120 ggctccagca gactgctggc tgtgggacac ccatact tca gcatcaagaa cccaaccaat 180 gccaagaaac tgctggtgcc aaaggtgtct ggactccaat acagggtgtt cagggtgaga 240 ctgcctgacc caaacaagtt tggctttcct gacacctcct tctacaaccc tgacacccag 300 agactggtgt gggcttgtg t gggattggag attggcaggg gacaaccact gggagtgggc 360
atctctggac acccactgct gaacaagttt gatgacacag agacaggcaa caaataccct 420
ggacaacctg gagcagacaa cagggagtgt ctgagtatgg actacaagca gacccaactt 480
tgtctgctgg gctgtaagcc tccaacagga gaacactggg gcaagggagt ggcttgtacc 540
aatgctgccc ctgccaatga ctgtcctcca ttggaactga taaacaccat cattgaggat 600
ggagatatgg tggacacagg ctttggctgt atggacttca agaccctcca agccaacaag 660
tctgatgtgc caattgacat ctgtggcagc acttgtaaat accctgacta cctgaaaatg 720 acctctgaac catatggaga ctccctgttc ttcttcctga gggagggaaca gatgtttgtg 780 agacacttct tcaacagggc tggcaccctg ggagaggctg tgcctgatga cctctacatc 840 a agggctctg gcaccacagc cagcatccag tcctctgcct tctttccaac accatctggc 900 agtatggtga cctctgagag ccaacttttc aacaagccat actggctcca aagggctcaa 960 ggacacaaca atggcatctg ttggggcaac caggtgtttg tgacagtggt ggacaccacc 1020 aggagcacca atatgaccct gtgtacccag gtgacctctg acagcaccta caagaatgag 1080 aacttcaagg aatacatcag gcatgtggag gaatatgacc tccaatttgt gttccaactt 1140 tgtaaggtga ccctgacagc agaggtgatg acctacatcc atgctatgaa ccctgacatc 1200 ttggaggact ggcagtttgg actgacacct cctccatctg cctccctcca agacacctac 1260 aggtttgtga ccagccaggc tatcacttgt cagaagacag tgcctccaaa ggagaaggag 1320 gacccactgg gcaaatacac cttctgggag gtggacctga aaga gaagtt ctctgctgac 1380 ctggaccagt ttccactggg caggaagttc ctgctccaag caggactgaa agccaagcca 1440 aaactgaaaa gggctgcccc aaccagcacc aggacctcct ctgccaagag gaagaaggtg 1500 aagaagtaaa ctcgagctc 1519 <![CDATA[ <210> 69]]>
<![CDATA[ <211> 472]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Homo sapiens]]>
<![CDATA[ <220> ]]>
<![CDATA[ <223> 3501 HPV 35 L1 protein amino acid sequence 1-472]]>
<![CDATA[ <400> 69]]> Met Ser Leu Trp Arg Ser Asn Glu Ala Thr Val Tyr Leu Pro Pro Val 1 5 10 15 Ser Val Ser Lys Val Val Ser Thr Asp Glu Tyr Val Thr Arg Thr Asn 20 25 30 Ile Tyr Tyr His Ala Gly Ser Ser Arg Leu Leu Ala Val Gly His Pro 35 40 45 Tyr Tyr Ala Ile Lys Lys Gln Asp Ser Asn Lys Ile Ala Val Pro Lys 50 55 60 Val Ser Gly Leu Gln Tyr Arg Val Phe Arg Val Lys Leu Pro Asp Pro 65 70 75 80 Asn Lys Phe Gly Phe Pro Asp Thr Ser Phe Tyr Asp Pro Ala Ser Gln 85 90 95 Arg Leu Val Trp Ala Cys Thr Gly Val Glu Val Gly Arg Gly Gln Pro 100 105 110 Leu Gly Val Gly Ile Ser Gly His Pro Leu Leu Asn Lys Leu Asp Asp 115 120 125 Thr Glu Asn Ser Asn Lys Tyr Val Gly Asn Ser Gly Thr Asp Asn Arg 130 135 140 Glu Cys Ile Ser Met Asp Tyr Lys Gln Thr Gln Leu Cys Leu Ile Gly 145 150 155 160 Cys Arg Pro Pro Ile Gly Glu His Trp Gly Lys Gly Thr Pro Cys Asn 165 170 175 Ala Asn Gln Val Lys Ala Gly Glu Cys Pro Pro Leu Glu Leu Leu Asn 180 185 190 Thr Val Leu Gln Asp Gly Asp Met Val Asp Thr Gly Phe Gly Ala Met 195 200 205 Asp Phe Thr Thr Leu Gln Ala Asn Lys Ser Asp Val Pro Leu Asp Ile 210 215 220 Cys Ser Ser Ile Cys Lys Tyr Pro Asp Tyr Leu Lys Met Val Ser Glu 225 230 235 240 Pro Tyr Gly Asp Met Leu Phe Phe Tyr Leu Arg Arg Glu Gln Met Phe 245 250 255 Val Arg His Leu Phe Asn Arg Ala Gly Thr Val Gly Glu Thr Val Pro 260 265 270 Ala Asp Leu Tyr Ile Lys Gly Thr Thr Gly Thr Leu Pro Ser Thr Ser 275 280 285 Tyr Phe Pro Thr Pro Ser Gly Ser Met Val Thr Ser Asp Ala Gln Ile 290 295 300 Phe Asn Lys Pro Tyr Trp Leu Gln Arg Ala Gln Gly His Asn Asn Gly 305 310 315 320 Ile Cys Trp Ser Asn Gln Leu Phe Val Thr Val Val Asp Thr Thr Thr Arg 325 330 335 Ser Thr Asn Met Ser Val Cys Ser Ala Val Ser Ser Ser Asp Ser Thr 340 345 350 Tyr Lys Asn Asp Asn Phe Lys Glu Tyr Leu Arg His Gly Glu Glu Tyr 355 360 365 Asp Leu Gln Phe Ile Phe Gln Leu Cys Lys Ile Thr Leu Thr Ala Asp 370 375 380 Val Met Thr Tyr Ile His Ser Met Asn Pro Ser Ile Leu Glu Asp Trp 385 390 395 400 Asn Phe Gly Leu Thr Pro Pro Pro Ser Gly Thr Leu Glu Asp Thr Tyr 405 410 415 Arg Tyr Val Thr Ser Gln Ala Val Thr Cys Gln Lys Pro Ser Ala Pro 420 425 430 Lys Pro Lys Asp Asp Pro Leu Lys Asn Tyr Thr Phe Trp Glu Val Asp 435 440 445 Leu Lys Glu Lys Phe Ser Ala Asp Leu Asp Gln Phe Pro Leu Gly Arg 450 455 460 Lys Phe Leu Leu Gln Ala Gly Leu 465 470 <![CDATA[ <210> 70]]>
<![CDATA[ <211> 26]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> ]]>Homo sapiens
<![CDATA[ <220> ]]>
<![CDATA[ <223> 3502 HPV 33 L1 protein amino acid sequence 474-499]]>
<![CDATA[ <400> 70]]> Lys Ala Lys Pro Lys Leu Lys Arg Ala Ala Pro Thr Ser Thr Arg Thr 1 5 10 15 Ser Ser Ala Lys Arg Lys Lys Val Lys Lys 20 25 <![CDATA[ <210> 71]]>
<![CDATA[ <211> 498]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Artificial sequence]]>
<![CDATA[ <220> ]]>
<![CDATA[ <223> 3503 Amino acid sequence of chimeric HPV type 35 L1 protein]]>
<![CDATA[ <400]]>> 71]]>
<br/> <![CDATA[Met Ser Leu Trp Arg Ser Asn Glu Ala Thr Val Tyr Leu Pro Pro Val 1 5 10 15 Ser Val Ser Lys Val Val Ser Thr Asp Glu Tyr Val Thr Arg Thr Asn 20 25 30 Ile Tyr Tyr His Ala Gly Ser Ser Arg Leu Leu Ala Val Gly His Pro 35 40 45 Tyr Tyr Ala Ile Lys Lys Gln Asp Ser Asn Lys Ile Ala Val Pro Lys 50 55 60 Val Ser Gly Leu Gln Tyr Arg Val Phe Arg Val Lys Leu Pro Asp Pro 65 70 75 80 Asn Lys Phe Gly Phe Pro Asp Thr Ser Phe Tyr Asp Pro Ala Ser Gln 85 90 95 Arg Leu Val Trp Ala Cys Thr Gly Val Glu Val Gly Arg Gly Gln Pro 100 105 110 Leu Gly Val Gly Ile Ser Gly His Pro Leu Leu Asn Lys Leu Asp Asp 115 120 125 Thr Glu Asn Ser Asn Lys Tyr Val Gly Asn Ser Gly Thr Asp Asn Arg 130 135 140 Glu Cys Ile Ser Met Asp Tyr Lys Gln Thr Gln Leu Cys Leu Ile Gly 145 150 155 160 Cys Arg Pro Pro Ile Gly Glu His Trp Gly Lys Gly Thr Pro Cys Asn 165 170 175 Ala Asn Gln Val Lys Ala Gly Glu Cys Pro Pro Leu Glu Leu Leu Asn 180 185 190 Thr Val Leu Gln Asp Gly Asp Met Val Asp Thr Gly Phe Gly Ala Met 195 200 205 Asp Phe Thr Thr Leu Gln Ala Asn Lys Ser Asp Val Pro Leu Asp Ile 210 215 220 Cys Ser Ser Ile Cys Lys Tyr Pro Asp Tyr Leu Lys Met Val Ser Glu 225 230 235 240 Pro Tyr Gly Asp Met Leu Phe Phe Tyr Leu Arg Arg Glu Gln Met Phe 245 250 255 Val Arg His Leu Phe Asn Arg Ala Gly Thr Val Gly Glu Thr Val Pro 260 265 270 Ala Asp Leu Tyr Ile Lys Gly Thr Thr Gly Thr Leu Pro Ser Thr Ser 275 280 285 Tyr Phe Pro Thr Pro Ser Gly Ser Met Val Thr Ser Asp Ala Gln Ile 290 295 300 Phe Asn Lys Pro Tyr Trp Leu Gln Arg Ala Gln Gly His Asn Asn Gly 305 310 315 320 Ile Cys Trp Ser Asn Gln Leu Phe Val Thr Val Val Asp Thr Thr Thr Arg 325 330 335 Ser Thr Asn Met Ser Val Cys Ser Ala Val Ser Ser Ser Asp Ser Thr 340 345 350 Tyr Lys Asn Asp Asn Phe Lys Glu Tyr Leu Arg His Gly Glu Glu Tyr 355 360 365 Asp Leu Gln Phe Ile Phe Gln Leu Cys Lys Ile Thr Leu Thr Ala Asp 370 375 380 Val Met Thr Tyr Ile His Ser Met Asn Pro Ser Ile Leu Glu Asp Trp 385 390 395 400 Asn Phe Gly Leu Thr Pro Pro Pro Ser Gly Thr Leu Glu Asp Thr Tyr 405 410 415 Arg Tyr Val Thr Ser Gln Ala Val Thr Cys Gln Lys Pro Ser Ala Pro 420 425 430 Lys Pro Lys Asp Asp Pro Leu Lys Asn Tyr Thr Phe Trp Glu Val Asp 435 440 445 Leu Lys Glu Lys Phe Ser Ala Asp Leu Asp Gln Phe Pro Leu Gly Arg 450 455 460 Lys Phe Leu Leu Gln Ala Gly Leu Lys Ala Lys Pro Lys Leu Lys Arg 465 470 475 480 Ala Ala Pro Thr Ser Thr Arg Thr Ser Ser Ala Lys Arg Lys Lys Val 485 490 495 Lys Lys <![CDATA[ <210> 72]]>
<![CDATA[ <211> 1498]]>
<![CDATA[ <212> DNA]]>
<![CDATA[ <213> Artificial sequence]]>
<![CDATA[ <220> ]]>
<![CDATA[ <223> 3504 Nucleotide sequence of chimeric HPV type 35 L1 protein]]>
<![CDATA[ <400> 72]]> atgagtctgt ggaggagcaa tgaggctaca gtctacctgc ctcctgtgtc tgtgagcaag 60 gtggtgagca cagatgaata tgtgaccagg accaacatct actaccatgc tggctccagc 120 agactgctgg ctgtgggaca cccatactat gccatcaaga agcaggacag caacaagatt 180 gctgtgccaa aggtgtctgg actccaatac agggtgttca gggtgaaact gcctgaccca 240 aacaagtttg gctttcctga cacctccttc tatgaccctg ccagccagag actggtgtgg 300 gcttgtactg gagtggaggt gggcagggga ca accactgg gagtgggcat ctctggacac 360
ccactgctga acaaactgga tgacacagag aacagcaaca aatatgtggg caactctggc 420
acagacaaca gggagtgtat cagtatggac tacaagcaga cccaactttg tctgattggc 480
tgtagacctc caattggaga acactggggc aagggcacac catgtaatgc caaccaggtg 540
aaggctggag agtgtcctcc attggaactg ctgaacacag tgctccaaga tggagatatg 600
gtggacacag gctttggagc tatggacttc accaccctcc aagccaacaa gtctgatgtg 660
ccactggaca tctgttccag catctgtaaa taccctgact acctgaaaat ggtgtctgaa 720 ccatatggag atatgctgtt cttctacctg aggagggaac agatgtttgt gagacacctg 780 ttcaacaggg ctggcacagt gggagagaca gtgcctgctg acctctacat caagggcacc 840 acaggcaccc tgccaagc ac ctcctacttt ccaacaccat ctggcagtat ggtgacctct 900 gatgcccaga ttttcaacaa gccatactgg ctccaaaggg ctcaaggaca caacaatggc 960 atctgttgga gcaaccaact ttttgtgaca gtggtggaca ccaccaggag caccaatatg 1020 agtgt gtgtt ctgctgtgtc ctcctctgac agcacctaca agaatgacaa cttcaaggaa 1080 tacctgagac atggagagga atatgacctc caattcatct tccaactttg taagattacc 1140 ctgacagcag atgtgatgac ctacatccac agtatgaacc caagcatctt ggaggactgg 1200 aact ttggac tgacacctcc tccatctggc accttggagg acacctacag atatgtgacc 1260 agccaggctg tgacttgtca gaagccatct gccccaaagc caaaggatga cccactgaaa 1320 aactacacct tctgggaggt ggacctgaaa gagaagttct ctgctgacct ggaccagttt 1380 ccactgggca ggaagttcct gctccaagca ggactgaaag ccaagccaaa actgaaaagg 1440 gctgccccaa ccagcaccag gacctcctct gccaagagga agaaggtgaa gaagtaaa 1498 <![CDATA[ <210> 73]]>
<![CDATA[ <211> 1528]]>
<![CDATA[ <212> DNA]]>
<![CDATA[ <213> Artificial sequence]]>
<![CDATA[ <220> ]]>
<![CDATA[ <223> 3505 Synthetic HPV35L1 gene]]>
<![CDATA[ <400> 73]]> ctgggtacca tgagtctgtg gaggagcaat gaggctacag tctacctgcc tcctgtgtct 60 gtgagcaagg tggtgagcac agatgaatat gtgaccagga ccaacatcta ctaccatgct 120 ggctccagca gactgctggc tgtgggacac ccatactat g ccatcaagaa gcaggacagc 180 aacaagattg ctgtgccaaa ggtgtctgga ctccaataca gggtgttcag ggtgaaactg 240 cctgacccaa acaagtttgg ctttcctgac acctccttct atgaccctgc cagccagaga 300 ctggtgtggg cttgtactgg a gtggaggtg ggcaggggac aaccactggg agtgggcatc 360 tctggacacc cactgctgaa caaactggat gacacagaga acagcaacaa atatgtgggc 420 aactctggca cagacaacag ggagtgtatc agtatggact acaagcagac ccaactttgt 480 ctgattggct gtagacctcc aattggagaa cact ggggca agggcacacc atgtaatgcc 540 aaccaggtga aggctggaga gtgtcctcca ttggaactgc tgaacacagt gctccaagat 600 ggagatatgg tggacacagg ctttggagct atggacttca ccaccctcca agccaacaag 660 tctgatgtgc cactggacat ctgt tccagc atctgtaaat accctgacta cctgaaaatg 720 gtgtctgaac catatggaga tatgctgttc ttctacctga gggagggaaca gatgtttgtg 780 agacacctgt tcaacagggc tggcacagtg ggagagacag tgcctgctga cctctacatc 840 aagggcacca caggcaccct gccaagcacc tcctactttc caacaccatc tggcagtatg 900 gtgacctctg atgcccagat tttcaacaag ccatactggc tccaaagggc tcaaggacac 960 aacaatggca tctgttggag caaccaactt tttgtgacag tggtggacac caccaggagc 1 020 accaatatga gtgtgtgttc tgctgtgtcc tcctctgaca gcacctacaa gaatgacaac 1080 ttcaaggaat acctgagaca tggagaggaa tatgacctcc aattcatctt ccaactttgt 1140 aagattaccc tgacagcaga tgtgatgacc tacatccaca gtatga accc aagcatcttg 1200 gaggactgga actttggact gacacctcct ccatctggca ccttggagga cacctacaga 1260 tatgtgacca gccaggctgt gacttgtcag aagccatctg ccccaaagcc aaaggatgac 1320 ccactgaaaa actacacctt ctgggaggtg gacctgaa agaagttctc tgctgacctg 1380 gaccagtttc cactgggcag gaagttcctg ctccaagcag gactgaaagc cagaccaaac 1440 ttcagactgg gcaagagggc tgcccctgcc agcaccagca agaagtccag caccaagagg 1500 aggaaggtga agagctaaac tcgagctc 1528 <![CDATA[ <210> 74]]>
<![CDATA[ <211> 1519]]>
<![CDATA[ <212> DNA]]>
<![CDATA[ <213> Artificial sequence]]>
<![CDATA[ <220> ]]>
<![CDATA[ <223> 3506 Synthetic HPV33L1 gene]]>
<![CDATA[ <400> 74]]> ctgggtacca tgagtgtgtg gagaccatct gaggctacag tctacctgcc tcctgtgcct 60 gtgagcaagg tggtgagcac agatgaatat gtgagcagga ccagcatcta ctactatgct 120 ggctccagca gactgctggc tgtgggacac ccatact tca gcatcaagaa cccaaccaat 180 gccaagaaac tgctggtgcc aaaggtgtct ggactccaat acagggtgtt cagggtgaga 240 ctgcctgacc caaacaagtt tggctttcct gacacctcct tctacaaccc tgacacccag 300 agactggtgt gggcttgtg t gggattggag attggcaggg gacaaccact gggagtgggc 360
atctctggac acccactgct gaacaagttt gatgacacag agacaggcaa caaataccct 420
ggacaacctg gagcagacaa cagggagtgt ctgagtatgg actacaagca gacccaactt 480
tgtctgctgg gctgtaagcc tccaacagga gaacactggg gcaagggagt ggcttgtacc 540
aatgctgccc ctgccaatga ctgtcctcca ttggaactga taaacaccat cattgaggat 600
ggagatatgg tggacacagg ctttggctgt atggacttca agaccctcca agccaacaag 660
tctgatgtgc caattgacat ctgtggcagc acttgtaaat accctgacta cctgaaaatg 720 acctctgaac catatggaga ctccctgttc ttcttcctga gggagggaaca gatgtttgtg 780 agacacttct tcaacagggc tggcaccctg ggagaggctg tgcctgatga cctctacatc 840 a agggctctg gcaccacagc cagcatccag tcctctgcct tctttccaac accatctggc 900 agtatggtga cctctgagag ccaacttttc aacaagccat actggctcca aagggctcaa 960 ggacacaaca atggcatctg ttggggcaac caggtgtttg tgacagtggt ggacaccacc 1020 aggagcacca atatgaccct gtgtacccag gtgacctctg acagcaccta caagaatgag 1080 aacttcaagg aatacatcag gcatgtggag gaatatgacc tccaatttgt gttccaactt 1140 tgtaaggtga ccctgacagc agaggtgatg acctacatcc atgctatgaa ccctgacatc 1200 ttggaggact ggcagtttgg actgacacct cctccatctg cctccctcca agacacctac 1260 aggtttgtga ccagccaggc tatcacttgt cagaagacag tgcctccaaa ggagaaggag 1320 gacccactgg gcaaatacac cttctgggag gtggacctga aaga gaagtt ctctgctgac 1380 ctggaccagt ttccactggg caggaagttc ctgctccaag caggactgaa agccaagcca 1440 aaactgaaaa gggctgcccc aaccagcacc aggacctcct ctgccaagag gaagaaggtg 1500 aagaagtaaa ctcgagctc 1519 <![CDATA[ <210> 75]]>
<![CDATA[ <211> 34]]>
<![CDATA[ <212> DNA]]>
<![CDATA[ <213> Artificial sequence]]>
<![CDATA[ <220> ]]>
<![CDATA[ <223> 3507 HPV35L1 F1]]> <![CDATA[ <400> 75]]> cttggtacca tgagtctgtg gaggagcaat gagg 34 <![CDATA[ <210> 76]]>
<![CDATA[ <211> 36]]>
<![CDATA[ <212> DNA]]>
<![CDATA[ <213> Artificial sequence]]>
<![CDATA[ <220> ]]>
<![CDATA[ <223> 3508 HPV35L1 R1]]> <![CDATA[ <400> 76]]> gcttggcttt cagtcctgct tggagcagga acttcc 36 <![CDATA[ <210> 77]]>
<![CDATA[ <211> 1435]]>
<![CDATA[ <212> DNA]]>
<![CDATA[ <213> Artificial sequence]]>
<![CDATA[ <220> ]]>
<![CDATA[ <223> 3509 HPV35L1 expansion sequence 1]]>
<![CDATA[ <400> 77]]> cttggtacca tgagtctgtg gaggagcaat gaggctacag tctacctgcc tcctgtgtct 60 gtgagcaagg tggtgagcac agatgaatat gtgaccagga ccaacatcta ctaccatgct 120 ggctccagca gactgctggc tgtgggacac ccatactat g ccatcaagaa gcaggacagc 180 aacaagattg ctgtgccaaa ggtgtctgga ctccaataca gggtgttcag ggtgaaactg 240 cctgacccaa acaagtttgg ctttcctgac acctccttct atgaccctgc cagccagaga 300 ctggtgtggg cttgtactgg a gtggaggtg ggcaggggac aaccactggg agtgggcatc 360 tctggacacc cactgctgaa caaactggat gacacagaga acagcaacaa atatgtgggc 420 aactctggca cagacaacag ggagtgtatc agtatggact acaagcagac ccaactttgt 480 ctgattggct gtagacctcc aattggagaa cact ggggca agggcacacc atgtaatgcc 540 aaccaggtga aggctggaga gtgtcctcca ttggaactgc tgaacacagt gctccaagat 600 ggagatatgg tggacacagg ctttggagct atggacttca ccaccctcca agccaacaag 660 tctgatgtgc cactggacat ctgt tccagc atctgtaaat accctgacta cctgaaaatg 720 gtgtctgaac catatggaga tatgctgttc ttctacctga gggagggaaca gatgtttgtg 780 agacacctgt tcaacagggc tggcacagtg ggagagacag tgcctgctga cctctacatc 840 aagggcacca caggcaccct gccaagcacc tcctactttc caacaccatc tggcagtatg 900 gtgacctctg atgcccagat tttcaacaag ccatactggc tccaaagggc tcaaggacac 960 aacaatggca tctgttggag caaccaactt tttgtgacag tggtggacac caccaggagc 1 020 accaatatga gtgtgtgttc tgctgtgtcc tcctctgaca gcacctacaa gaatgacaac 1080 ttcaaggaat acctgagaca tggagaggaa tatgacctcc aattcatctt ccaactttgt 1140 aagattaccc tgacagcaga tgtgatgacc tacatccaca gtatga accc aagcatcttg 1200 gaggactgga actttggact gacacctcct ccatctggca ccttggagga cacctacaga 1260 tatgtgacca gccaggctgt gacttgtcag aagccatctg ccccaaagcc aaaggatgac 1320 ccactgaaaa actacacctt ctgggaggtg gacctgaa agaagttctc tgctgacctg 1380 gaccagtttc cactgggcag gaagttcctg ctccaagcag gactgaaagc caagc 1435 <![CDATA[ <210> 78]]>
<![CDATA[ <211> 35]]>
<![CDATA[ <212> DNA]]>
<![CDATA[ <213> Artificial sequence]]>
<![CDATA[ <220> ]]>
<![CDATA[ <22]]>3> 3510 HPV35L1 F2]]> <br/>
<br/> <![CDATA[ <400>78]]> <br/> <![CDATA[agcaggactg aaagccaagc caaaactgaa aaggg 35 <![CDATA[ <210> 79]]>
<![CDATA[ <211> 36]]>
<![CDATA[ <212> DNA]]>
<![CDATA[ <213> Artificial sequence]]>
<![CDATA[ <220> ]]>
<![CDATA[ <223> 3511 HPV35L1 R2]]> <![CDATA[ <400> 79]]> ctgtctagat ttacttcttc accttcttcc tcttgg 36 <![CDATA[ <210> 80]]>
<![CDATA[ <211> 101]]>
<![CDATA[ <212> DNA]]>
<![CDATA[ <213> Artificial sequence]]>
<![CDATA[ <220> ]]>
<![CDATA[ <223> 3512 HPV35L1 extended sequence 2]]>
<![CDATA[ <400> 80]]> agcaggactg aaagccaagc caaaactgaa aagggctgcc ccaaccagca ccaggacctc 60 ctctgccaag aggaagaagg tgaagaagta aatctagaca g 101 <![CDATA[ <210> 81]]>
<![CDATA[ <211> 38]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Homo sapiens]]>
<![CDATA[ <220> ]]>
<![CDATA[ <223> 3513 HPV 59 L1 protein amino acid sequence 471-508]]>
<![CDATA[ <400> 81]]> Leu Gln Leu Gly Ala Arg Pro Lys Pro Thr Ile Gly Pro Arg Lys Arg 1 5 10 15 Ala Ala Pro Ala Pro Thr Ser Thr Pro Ser Pro Lys Arg Val Lys Arg 20 25 30 Arg Lys Ser Ser Arg Lys 35 <![CDATA[ <210> 82]]>
<![CDATA[ <211> 469]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Homo sapiens]]>
<![CDATA[ <220> ]]>
<![CDATA[ <223> 3901 HPV 39 type L1 protein 1-469 amino acid sequence]]>
<![CDATA[ <400> 82]]> Met Ala Met Trp Arg Ser Ser Asp Ser Met Val Tyr Leu Pro Pro Pro 1 5 10 15 Ser Val Ala Lys Val Val Asn Thr Asp Asp Tyr Val Thr Arg Thr Gly 20 25 30 Ile Tyr Tyr Tyr Ala Gly Ser Ser Arg Leu Leu Thr Val Gly His Pro 35 40 45 Tyr Phe Lys Val Gly Met Asn Gly Gly Arg Lys Gln Asp Ile Pro Lys 50 55 60 Val Ser Ala Tyr Gln Tyr Arg Val Phe Arg Val Thr Leu Pro Asp Pro 65 70 75 80 Asn Lys Phe Ser Ile Pro Asp Ala Ser Leu Tyr Asn Pro Glu Thr Gln 85 90 95 Arg Leu Val Trp Ala Cys Val Gly Val Glu Val Gly Arg Gly Gln Pro 100 105 110 Leu Gly Val Gly Ile Ser Gly His Pro Leu Tyr Asn Arg Gln Asp Asp 115 120 125 Thr Glu Asn Ser Pro Phe Ser Ser Thr Thr Asn Lys Asp Ser Arg Asp 130 135 140 Asn Val Ser Val Asp Tyr Lys Gln Thr Gln Leu Cys Ile Ile Gly Cys 145 150 155 160 Val Pro Ala Ile Gly Glu His Trp Gly Lys Gly Lys Ala Cys Lys Pro 165 170 175 Asn Asn Val Ser Thr Gly Asp Cys Pro Pro Leu Glu Leu Val Asn Thr 180 185 190 Pro Ile Glu Asp Gly Asp Met Ile Asp Thr Gly Tyr Gly Ala Met Asp 195 200 205 Phe Gly Ala Leu Gln Glu Thr Lys Ser Glu Val Pro Leu Asp Ile Cys 210 215 220 Gln Ser Ile Cys Lys Tyr Pro Asp Tyr Leu Gln Met Ser Ala Asp Val 225 230 235 240 Tyr Gly Asp Ser Met Phe Phe Cys Leu Arg Arg Glu Gln Leu Phe Ala 245 250 255 Arg His Phe Trp Asn Arg Gly Gly Met Val Gly Asp Ala Ile Pro Ala 260 265 270 Gln Leu Tyr Ile Lys Gly Thr Asp Ile Arg Ala Asn Pro Gly Ser Ser 275 280 285 Val Tyr Cys Pro Ser Pro Ser Gly Ser Met Val Thr Ser Asp Ser Gln 290 295 300 Leu Phe Asn Lys Pro Tyr Trp Leu His Lys Ala Gln Gly His Asn Asn 305 310 315 320 Gly Ile Cys Trp His Asn Gln Leu Phe Leu Thr Val Val Asp Thr Thr 325 330 335 Arg Ser Thr Asn Phe Thr Leu Ser Thr Ser Ser Ile Glu Ser Ser Ile Pro 340 345 350 Ser Thr Tyr Asp Pro Ser Lys Phe Lys Glu Tyr Thr Arg His Val Glu 355 360 365 Glu Tyr Asp Leu Gln Phe Ile Phe Gln Leu Cys Thr Val Thr Leu Thr 370 375 380 Thr Asp Val Met Ser Tyr Ile His Thr Met Asn Ser Ser Ile Leu Asp 385 390 395 400 Asn Trp Asn Phe Ala Val Ala Pro Pro Pro Ser Ala Ser Leu Val Asp 405 410 415 Thr Tyr Arg Tyr Leu Gln Ser Ala Ala Ile Thr Cys Gln Lys Asp Ala 420 425 430 Pro Ala Pro Glu Lys Lys Asp Pro Tyr Asp Gly Leu Lys Phe Trp Asn 435 440 445 Val Asp Leu Arg Glu Lys Phe Ser Leu Glu Leu Asp Gln Phe Pro Leu 450 455 460 Gly Arg Lys Phe Leu 465 <![CDATA[ <210> 83]]>
<![CDATA[ <211> 38]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Homo sapiens]]>
<![CDATA[ <220> ]]>
<![CDATA[ <223> 3902 HPV 59 type L1 protein amino acid sequence 471-508]]>
<![CDATA[ <400> 83]]> Leu Gln Leu Gly Ala Arg Pro Lys Pro Thr Ile Gly Pro Arg Lys Arg 1 5 10 15 Ala Ala Pro Ala Pro Thr Ser Thr Pro Ser Pro Lys Arg Val Lys Arg 20 25 30 Arg Lys Ser Ser Arg Lys 35 <![CDATA[ <210> 84]]>
<![CDATA[ <211> 507]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Artificial sequence]]>
<![CDATA[ <220> ]]>
<![CDATA[ <223> 3903 Amino acid sequence of chimeric HPV type 39 L1 protein]]>
<![CDATA[ <400> 84]]> Met Ala Met Trp Arg Ser Ser Asp Ser Met Val Tyr Leu Pro Pro Pro 1 5 10 15 Ser Val Ala Lys Val Val Asn Thr Asp Asp Tyr Val Thr Arg Thr Gly 20 25 30 Ile Tyr Tyr Tyr Ala Gly Ser Ser Arg Leu Leu Thr Val Gly His Pro 35 40 45 Tyr Phe Lys Val Gly Met Asn Gly Gly Arg Lys Gln Asp Ile Pro Lys 50 55 60 Val Ser Ala Tyr Gln Tyr Arg Val Phe Arg Val Thr Leu Pro Asp Pro 65 70 75 80 Asn Lys Phe Ser Ile Pro Asp Ala Ser Leu Tyr Asn Pro Glu Thr Gln 85 90 95 Arg Leu Val Trp Ala Cys Val Gly Val Glu Val Gly Arg Gly Gln Pro 100 105 110 Leu Gly Val Gly Ile Ser Gly His Pro Leu Tyr Asn Arg Gln Asp Asp 115 120 125 Thr Glu Asn Ser Pro Phe Ser Ser Thr Thr Asn Lys Asp Ser Arg Asp 130 135 140 Asn Val Ser Val Asp Tyr Lys Gln Thr Gln Leu Cys Ile Ile Gly Cys 145 150 155 160 Val Pro Ala Ile Gly Glu His Trp Gly Lys Gly Lys Ala Cys Lys Pro 165 170 175 Asn Asn Val Ser Thr Gly Asp Cys Pro Pro Leu Glu Leu Val Asn Thr 180 185 190 Pro Ile Glu Asp Gly Asp Met Ile Asp Thr Gly Tyr Gly Ala Met Asp 195 200 205 Phe Gly Ala Leu Gln Glu Thr Lys Ser Glu Val Pro Leu Asp Ile Cys 210 215 220 Gln Ser Ile Cys Lys Tyr Pro Asp Tyr Leu Gln Met Ser Ala Asp Val 225 230 235 240 Tyr Gly Asp Ser Met Phe Phe Cys Leu Arg Arg Glu Gln Leu Phe Ala 245 250 255 Arg His Phe Trp Asn Arg Gly Gly Met Val Gly Asp Ala Ile Pro Ala 260 265 270 Gln Leu Tyr Ile Lys Gly Thr Asp Ile Arg Ala Asn Pro Gly Ser Ser 275 280 285 Val Tyr Cys Pro Ser Pro Ser Gly Ser Met Val Thr Ser Asp Ser Gln 290 295 300 Leu Phe Asn Lys Pro Tyr Trp Leu His Lys Ala Gln Gly His Asn Asn 305 310 315 320 Gly Ile Cys Trp His Asn Gln Leu Phe Leu Thr Val Val Asp Thr Thr 325 330 335 Arg Ser Thr Asn Phe Thr Leu Ser Thr Ser Ser Ile Glu Ser Ser Ile Pro 340 345 350 Ser Thr Tyr Asp Pro Ser Lys Phe Lys Glu Tyr Thr Arg His Val Glu 355 360 365 Glu Tyr Asp Leu Gln Phe Ile Phe Gln Leu Cys Thr Val Thr Leu Thr 370 375 380 Thr Asp Val Met Ser Tyr Ile His Thr Met Asn Ser Ser Ile Leu Asp 385 390 395 400 Asn Trp Asn Phe Ala Val Ala Pro Pro Pro Ser Ala Ser Leu Val Asp 405 410 415 Thr Tyr Arg Tyr Leu Gln Ser Ala Ala Ile Thr Cys Gln Lys Asp Ala 420 425 430 Pro Ala Pro Glu Lys Lys Asp Pro Tyr Asp Gly Leu Lys Phe Trp Asn 435 440 445 Val Asp Leu Arg Glu Lys Phe Ser Leu Glu Leu Asp Gln Phe Pro Leu 450 455 460 Gly Arg Lys Phe Leu Leu Gln Leu Gly Ala Arg Pro Lys Pro Thr Ile 465 470 475 480 Gly Pro Arg Lys Arg Ala Ala Pro Ala Pro Thr Ser Thr Pro Ser Pro 485 490 495 Lys Arg Val Lys Arg Arg Lys Ser Ser Arg Lys 500 505 <![CDATA[ <210> 85]]>
<![CDATA[ <211> 1525]]>
<![CDATA[ <212> DNA]]>
<![CDATA[ <213> Artificial sequence]]>
<![CDATA[ <220> ]]>
<![CDATA[ <223> 3904 Nucleotide sequence of chimeric HPV type 39 L1 protein]]>
<![CDATA[ <400> 85]]> atggctatgt ggaggtcctc tgacagtatg gtctacctgc ctcctccatc tgtggctaag 60 gtggtgaaca cagatgacta tgtgaccagg acaggcatct actactatgc tggctccagc 120 agactgctga cagtgggaca cccatacttc aaggtgggg a tgaatggagg caggaagcag 180 gacatcccaa aggtgtctgc ctaccaatac agggtgttca gggtgaccct gcctgaccca 240 aacaagttca gcatccctga tgcctccctc tacaaccctg agacccag actggtgtgg 300 gcttgtgtgg gagtggaggt gggcagggg a caaccactgg gagtgggcat ctctggacac 360 ccactctaca acagacagga tgacacagag aacagcccat tctccagcac caccaacaag 420 gacagcaggg acaatgtgtc tgtggactac aagcagaccc aactttgtat cattggctgt 480 gtgcctgcca ttggagaaca ctggggcaag gg caaggctt gtaagccaaa caatgtgagc 540 acaggagact gtcctccatt ggaactggtg aacacaccaa ttgaggatgg agatatgatt 600 gacacaggct atggagctat ggactttgga gccctccaag agaccaagtc tgaggtgcca 660 ctggacatct gtcagagcat ctgtaaat ac cctgactacc tccaaatgag tgctgatgtc 720 tatggagaca gtatgttctt ctgtctgagg agggaacaac tttttgccag acacttctgg 780 aacaggggag ggatggtggg agatgccatc cctgcccaac tctacatcaa gggcacagac 840 atcagggc ta accctggctc ctctgtctac tgtccaagcc catctggcag tatggtgacc 900 tctgacagcc aacttttcaa caagccatac tggctgcaca aggctcaagg acacaacaat 960 ggcatctgtt ggcacaacca acttttcctg acagtggtgg acaccaccag gagcaccaac 1020 ttcaccctga gcaccagcat tgagtccagc atcccaagca cctatgaccc aagcaagttc 1080 aaggaataca ccaggcatgt ggaggaatat gacctccaat tcatcttcca actttgtact 1140 gtgaccctga ccacagatgt gatgagttac atccacacaa tgaactccag catcctggac 1200 aactggaact ttgctgtggc tcctcctcca tctgcctccc tggtggacac ctacagatac 1260 ctccaatctg ctgccatcac ttgtcagaag gatgcccctg cccctgagaa gaaggaccca 1320 tatgatggac tgaagttctg gaatgtggac ctga gggaga agttctcctt ggaactggac 1380 cagtttccac tgggcaggaa gttcctgctc caacttggag ccagaccaaa gccaaccatt 1440 ggaccaagga agagggctgc ccctgcccca accagcacac caagcccaaa gagggtgaag 1500 aggaggaagt ccagcaggaa gtaaa 1525 <![CDATA[ <210> 86]]>
<![CDATA[ <211> 1537]]>
<![CDATA[ <212> DNA]]>
<![CDATA[ <213> Artificial sequence]]>
<![CDATA[ <220> ]]>
<![CDATA[ <223> 3905 Synthetic HPV39L1 gene]]>
<![CDATA[ <400> 86]]>
ctgggtacca tggctatgtg gaggtcctct gacagtatgg tctacctgcc tcctccatct 60
gtggctaagg tggtgaacac agatgactat gtgaccagga caggcatcta ctactatgct 120
ggctccagca gactgctgac agtgggacac ccatacttca aggtggggat gaatggaggc 180
aggaagcagg acatcccaaa ggtgtctgcc taccaataca gggtgttcag ggtgaccctg 240
cctgacccaa acaagttcag catccctgat gcctccctct acaaccctga gacccagaga 300
ctggtgtggg cttgtgtggg agtggaggtg ggcaggggac aaccactggg agtgggcatc 360
tctggacacc cactctacaa cagacaggat gacacagaga acagcccatt ctccagcacc 420
accaacaagg acagcaggga caatgtgtct gtggactaca agcagaccca actttgtatc 480
attggctgtg tgcctgccat tggagaacac tggggcaagg gcaaggcttg taagccaaac 540
aatgtgagca caggagactg tcctccattg gaactggtga acacaccaat tgaggatgga 600
gatatgattg acacaggcta tggagctatg gactttggag ccctccaaga gaccaagtct 660
gaggtgccac tggacatctg tcagagcatc tgtaaatacc ctgactacct ccaaatgagt 720 gctgatgtct atggagacag tatgttcttc tgtctgagga gggaacaact ttttgccaga 780 cacttctgga acaggggagg gatggtggga gatgccatcc ctgcccaact ctacatcaag 840 ggcacagaca tcagggctaa ccctggctcc tctgtctact gtccaagccc atctggcagt 900 atggtgacct ctgacagcca acttttcaac aagccatact ggctgcacaa ggctcaagga 960 cacaacaatg gcatctgttg gcacaaccaa cttttcctga cagtggtgga caccaccagg 1020 agcaccaact tcaccctgag caccagcatt gagtccagca tcccaagcac ctatgaccca 1080 agcaagttca aggaatacac caggcatgtg gaggaatatg acctccaatt catcttccaa 1140 ctttgtactg tgaccctgac cacagatgtg atgagttaca tccacacaat gaactccagc 1200 atcctggaca actggaactt tgctgtggct cctcctccat ctgcctccct ggtggacacc 1260 tacagatacc tccaatctgc tgccatcact tgtcagaagg atgcccctgc ccctgagaag 1320 aaggacccat atgatggact gaagttctgg aatgtggacc tgag ggagaa gttctccttg 1380 gaactggacc agtttccact gggcaggaag ttcctgctcc aagccagggt gaggaggaga 1440 ccaaccattg gaccaaggaa gagacctgct gccagcacct cctcctcctc tgccaccaaa 1500 cacaagagga agagggtgag caagtaaact cgagctc 1 537 <![CDATA[ <210> 87]]>
<![CDATA[ <211> 1546]]>
<![CDATA[ <212> DNA]]>
<![CDATA[ <213> Artificial sequence]]>
<![CDATA[ <220> ]]>
<![CDATA[ <223> 3906 Synthetic HPV59L1 gene]]>
<![CDATA[ <400> 87]]> ctgggtacca tggctctgtg gaggtcctct gacaacaagg tctacctgcc tcctccatct 60 gtggctaagg tggtgagcac agatgaatat gtgaccagga ccagcatctt ctaccatgct 120 ggctccagca gactgctgac agtgggacac ccatacttca aggtgccaaa gggaggcaat 180 ggcagacagg atgtgccaaa ggtgtctgcc taccaataca gggtgttcag ggtgaaactg 240 cctgacccaa acaagtttgg actgcctgac aacacagtct atgacccaaa cagccagaga 300 ctggtgtggg cttgtgtggg a gtggagatt ggcaggggac aaccactggg agtgggactg 360 tctggacacc cactctacaa caaactggat gacacagaga actctcatgt ggcatctgct 420 gtggacacca aggacaccag ggacaatgtg tctgtggact acaagcagac ccaactttgt 480 atcattggct gtgtgcctgc cattggagaa cactggacca agggcacagc ctgtaagcca 540 accacagtgg tccagggaga ctgtcctcca ttggaactga taaacacacc aattgaggat 600 ggagatatgg tggacacagg ctatggagct atggacttca aactgctcca agacaacaag 660 tctgaggtgc cactggacat ctgtc agagc atctgtaaat accctgacta cctccaaatg 720 agtgctgatg cctatggaga cagtatgttc ttctgtctga ggagggaaca ggtgtttgcc 780 agacacttct ggaacaggtc tggcacaatg ggagaccaac ttcctgagtc cctctacatc 840 aagggca cag acatcagggc taaccctggc tcctacctct acagcccaag cccatctggc 900 tctgtggtga cctctgacag ccaacttttc aacaagccat actggctgca caaggctcaa 960 ggactgaaca atggcatctg ttggcacaac caacttttcc tgacagtggt ggacaccacc 1020 aggagcacca acctgtctgt gtgtgccagc accacctcca gcatcccaaa tgtctacaca 1080 ccaacctcct tcaaggaata tgccaggcat gtggaggagt ttgacctcca attcatcttc 1140 caactttgta agattaccct gaccacagag gtgatgagtt acatccacaa tatgaacac c 1200 accatcttgg aggactggaa ctttggagtg acacctcctc caacagcctc cctggtggac 1260 acctacaggt ttgtccagtc tgctgctgtg acttgtcaga aggacacagc ccctcctgtg 1320 aagcaggacc catatgacaa actgaagttc tggcctgt gg acctgaaaga gaggttctct 1380 gctgacctgg accagtttcc actgggcagg aagttcctgc tccaacttgg agccagacca 1440 aagccaacca ttggaccaag gaagagggct gcccctgccc caaccagcac accaagccca 1500 aagagggtga agaggaggaa gtccagcagg aagtaaactc ctc 1546 <![CDATA[ <210> 88]]>
<![CDATA[ <211> 38]]>
<![CDATA[ <212> DNA]]>
<![CDATA[ <213> Artificial sequence]]>
<![CDATA[ <220> ]]>
<![CDATA[ <223> 3907 HPV39L1 F1]]> <![CDATA[ <400> 88]]> cttggtacca tggctatgtg gaggtcctct gacagtat 38 <![CDATA[ <210> 89]]>
<![CDATA[ <211> 38]]>
<![CDATA[ <212> DNA]]>
<![CDATA[ <213> Artificial sequence]]>
<![CDATA[ <220> ]]>
<![CDATA[ <223> 3908 HPV39L1 R1]]> <![CDATA[ <400> 89]]> tccaagttgg agcaggaact tcctgcccag tggaaact 38 <![CDATA[ <210> 90]]>
<![CDATA[ <211> 1428]]>
<![CDATA[ <212> DNA]]>
<![CDATA[ <213> Artificial sequence]]>
<![CDATA[ <220> ]]>
<![CDATA[ <223> 3909 HPV39L1 expansion sequence 1]]>
<![CDATA[ <400> 90]]>
cttggtacca tggctatgtg gaggtcctct gacagtatgg tctacctgcc tcctccatct 60
gtggctaagg tggtgaacac agatgactat gtgaccagga caggcatcta ctactatgct 120
ggctccagca gactgctgac agtgggacac ccatacttca aggtggggat gaatggaggc 180
aggaagcagg acatcccaaa ggtgtctgcc taccaataca gggtgttcag ggtgaccctg 240
cctgacccaa acaagttcag catccctgat gcctccctct acaaccctga gacccagaga 300
ctggtgtggg cttgtgtggg agtggaggtg ggcaggggac aaccactggg agtgggcatc 360
tctggacacc cactctacaa cagacaggat gacacagaga acagcccatt ctccagcacc 420
accaacaagg acagcaggga caatgtgtct gtggactaca agcagaccca actttgtatc 480
attggctgtg tgcctgccat tggagaacac tggggcaagg gcaaggcttg taagccaaac 540
aatgtgagca caggagactg tcctccattg gaactggtga acacaccaat tgaggatgga 600
gatatgattg acacaggcta tggagctatg gactttggag ccctccaaga gaccaagtct 660
gaggtgccac tggacatctg tcagagcatc tgtaaatacc ctgactacct ccaaatgagt 720 gctgatgtct atggagacag tatgttcttc tgtctgagga gggaacaact ttttgccaga 780 cacttctgga acaggggagg gatggtggga gatgccatcc ctgcccaact ctacatcaag 840 ggcacagaca tcagggctaa ccctggctcc tctgtctact gtccaagccc atctggcagt 900 atggtgacct ctgacagcca acttttcaac aagccatact ggctgcacaa ggctcaagga 960 cacaacaatg gcatctgttg gcacaaccaa cttttcctga cagtggtgga caccaccagg 1020 agcaccaact tcaccctgag caccagcatt gagtccagca tcccaagcac ctatgaccca 1080 agcaagttca aggaatacac caggcatgtg gaggaatatg acctccaatt catcttccaa 1140 ctttgtactg tgaccctgac cacagatgtg atgagttaca tccacacaat gaactccagc 1200 atcctggaca actggaactt tgctgtggct cctcctccat ctgcctccct ggtggacacc 1260 tacagatacc tccaatctgc tgccatcact tgtcagaagg atgcccctgc ccctgagaag 1320 aaggacccat atgatggact gaagttctgg aatgtggacc tgag ggagaa gttctccttg 1380 gaactggacc agtttccact gggcaggaag ttcctgctcc aacttgga 1428 <![CDATA[ <210> 91]]>
<![CDATA[ <211> 37]]>
<![CDATA[ <212> DNA]]>
<![CDATA[ <213> Artificial sequence]]>
<![CDATA[ <220> ]]>
<![CDATA[ <223> 3910 HPV39L1 F2]]> <![CDATA[ <400> 91]]> aggaagttcc tgctccaact tggagccaga ccaaagc 37 <![CDATA[ <210> 92]]>
<![CDATA[ <211> 40]]>
<![CDATA[ <212> DNA]]>
<![CDATA[ <213> Artificial sequence]]>
<![CDATA[ <220> ]]>
<![CDATA[ <223> 3911 HPV39L1 R2]]> <![CDATA[ <400> 92]]> ctgtctagat ttacttcctg ctggacttcc tcctcttcac 40 <![CDATA[ <210> 93]]>
<![CDATA[ <211> 139]]>
<![CDATA[ <212> DNA]]>
<![CDATA[ <213> Artificial sequence]]>
<![CDATA[ <220> ]]>
<![CDATA[ <223> 3912 HPV39L1 expansion sequence 2]]>
<![CDATA[ <400> 93]]> aggaagttcc tgctccaact tggagccaga ccaaagccaa ccattggacc aaggaagagg 60 gctgcccctg ccccaaccag cacaccaagc ccaaagaggg tgaagaggag gaagtccagc 120 aggaagtaaa tctagacag 139 <![CDATA[ <210> 94]]>
<![CDATA[ <211> 26]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Homo sapiens]]>
<![CDATA[ <220> ]]>
<![CDATA[ <223> 3913 HPV 33 L1 protein amino acid sequence 474-499]]>
<![CDATA[ <400> 94]]> Lys Ala Lys Pro Lys Leu Lys Arg Ala Ala Pro Thr Ser Thr Arg Thr 1 5 10 15 Ser Ser Ala Lys Arg Lys Lys Val Lys Lys 20 25 <![CDATA[ <210> 95]]>
<![CDATA[ <211> 478]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Homo sapiens]]>
<![CDATA[ <220> ]]>
<![CDATA[ <223> 450]]>1 Amino acid sequence 1-478 of HPV type 45 L1 protein
<![CDATA[ <400> 95]]> Met Ala Leu Trp Arg Pro Ser Asp Ser Thr Val Tyr Leu Pro Pro Pro 1 5 10 15 Ser Val Ala Arg Val Val Asn Thr Asp Asp Tyr Val Ser Arg Thr Ser 20 25 30 Ile Phe Tyr His Ala Gly Ser Ser Arg Leu Leu Thr Val Gly Asn Pro 35 40 45 Tyr Phe Arg Val Val Pro Ser Gly Ala Gly Asn Lys Gln Ala Val Pro 50 55 60 Lys Val Ser Ala Tyr Gln Tyr Arg Val Phe Arg Val Ala Leu Pro Asp 65 70 75 80 Pro Asn Lys Phe Gly Leu Pro Asp Ser Thr Ile Tyr Asn Pro Glu Thr 85 90 95 Gln Arg Leu Val Trp Ala Cys Val Gly Met Glu Ile Gly Arg Gly Gln 100 105 110 Pro Leu Gly Ile Gly Leu Ser Gly His Pro Phe Tyr Asn Lys Leu Asp 115 120 125 Asp Thr Glu Ser Ala His Ala Ala Thr Ala Val Ile Thr Gln Asp Val 130 135 140 Arg Asp Asn Val Ser Val Asp Tyr Lys Gln Thr Gln Leu Cys Ile Leu 145 150 155 160 Gly Cys Val Pro Ala Ile Gly Glu His Trp Ala Lys Gly Thr Leu Cys 165 170 175 Lys Pro Ala nnJL Gln Pro Gly Asp Cys Pro Pro Leu Glu Leu Lys 180 185 190 Asn Thr Ile Ile Glu Asp Gly Asp Met Val Asp Thr Gly Tyr Gly Ala 195 200 205 Met Asp Phe Ser Thr Leu Gln Asp Thr Lys Cys Glu Val Pro Leu Asp 210 215 220 Ile Cys Gln Ser Ile Cys Lys Tyr Pro Asp Tyr Leu Gln Met Ser Ala 225 230 235 240 Asp Pro Tyr Gly Asp Ser Met Phe Phe Cys Leu Arg Arg Glu Gln Leu 245 250 255 Phe Ala Arg His Phe Trp Asn Arg Ala Gly Val Met Gly Asp Thr Val 260 265 270 Pro Thr Asp Leu Tyr Ile Lys Gly Thr Ser Ala Asn Met Arg Glu Thr 275 280 285 Pro Gly Ser Cys Val Tyr Ser Pro Ser Pro Ser Gly Ser Ile Thr Thr 290 295 300 Ser Asp Ser Gln Leu Phe Asn Lys Pro Tyr Trp Leu His Lys Ala Gln 305 310 315 320 Gly His Asn Asn Gly Ile Cys Trp His Asn Gln Leu Phe Val Thr Val 325 330 335 Val Asp Thr Thr Arg Ser Thr Asn Leu Thr Leu Cys Ala Ser Thr Gln 340 345 350 Asn Pro Val Pro Asn Thr Tyr Asp Pro Thr Lys Phe Lys His Tyr Ser 355 360 365 Arg His Val Glu Glu Tyr Asp Leu Gln Phe Ile Phe Gln Leu Cys Thr 370 375 380 Ile Thr Leu Thr Ala Glu Val Met Ser Tyr Ile His Ser Met Asn Ser 385 390 395 400 Ser Ile Leu Glu Asn Trp Asn Phe Gly Val Pro Pro Pro Pro Thr Thr 405 410 415 Ser Leu Val Asp Thr Tyr Arg Phe Val Gln Ser Val Ala Val Thr Cys 420 425 430 Gln Lys Asp Thr Thr Pro Pro Glu Lys Gln Asp Pro Tyr Asp Lys Leu 435 440 445 Lys Phe Trp Thr Val Asp Leu Lys Glu Lys Phe Ser Ser Asp Leu Asp 450 455 460 Gln Tyr Pro Leu Gly Arg Lys Phe Leu Val Gln Ala Gly Leu 465 470 475 <![CDATA[ <210> 96]]>
<![CDATA[ <211> 26]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Homo sapiens]]>
<![CDATA[ <220> ]]>
<![CDATA[ <223> 4502 HPV type 33 L1 protein amino acid sequence 474-499]]>
<![CDATA[ <400> 96]]> Lys Ala Lys Pro Lys Leu Lys Arg Ala Ala Pro Thr Ser Thr Arg Thr 1 5 10 15 Ser Ser Ala Lys Arg Lys Lys Val Lys Lys 20 25 <![CDATA[ <210> 97]]>
<![CDATA[ <211> 504]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Artificial sequence]]>
<![CDATA[ <220> ]]>
<![CDATA[ <223> 4503 Amino acid sequence of chimeric HPV type 45 L1 protein]]>
<![CDATA[ <400> 97]]> Met Ala Leu Trp Arg Pro Ser Asp Ser Thr Val Tyr Leu Pro Pro Pro 1 5 10 15 Ser Val Ala Arg Val Val Asn Thr Asp Asp Tyr Val Ser Arg Thr Ser 20 25 30 Ile Phe Tyr His Ala Gly Ser Ser Arg Leu Leu Thr Val Gly Asn Pro 35 40 45 Tyr Phe Arg Val Val Pro Ser Gly Ala Gly Asn Lys Gln Ala Val Pro 50 55 60 Lys Val Ser Ala Tyr Gln Tyr Arg Val Phe Arg Val Ala Leu Pro Asp 65 70 75 80 Pro Asn Lys Phe Gly Leu Pro Asp Ser Thr Ile Tyr Asn Pro Glu Thr 85 90 95 Gln Arg Leu Val Trp Ala Cys Val Gly Met Glu Ile Gly Arg Gly Gln 100 105 110 Pro Leu Gly Ile Gly Leu Ser Gly His Pro Phe Tyr Asn Lys Leu Asp 115 120 125 Asp Thr Glu Ser Ala His Ala Ala Thr Ala Val Ile Thr Gln Asp Val 130 135 140 Arg Asp Asn Val Ser Val Asp Tyr Lys Gln Thr Gln Leu Cys Ile Leu 145 150 155 160 Gly Cys Val Pro Ala Ile Gly Glu His Trp Ala Lys Gly Thr Leu Cys 165 170 175 Lys Pro Ala nnJL Gln Pro Gly Asp Cys Pro Pro Leu Glu Leu Lys 180 185 190 Asn Thr Ile Ile Glu Asp Gly Asp Met Val Asp Thr Gly Tyr Gly Ala 195 200 205 Met Asp Phe Ser Thr Leu Gln Asp Thr Lys Cys Glu Val Pro Leu Asp 210 215 220 Ile Cys Gln Ser Ile Cys Lys Tyr Pro Asp Tyr Leu Gln Met Ser Ala 225 230 235 240 Asp Pro Tyr Gly Asp Ser Met Phe Phe Cys Leu Arg Arg Glu Gln Leu 245 250 255 Phe Ala Arg His Phe Trp Asn Arg Ala Gly Val Met Gly Asp Thr Val 260 265 270 Pro Thr Asp Leu Tyr Ile Lys Gly Thr Ser Ala Asn Met Arg Glu Thr 275 280 285 Pro Gly Ser Cys Val Tyr Ser Pro Ser Pro Ser Gly Ser Ile Thr Thr 290 295 300 Ser Asp Ser Gln Leu Phe Asn Lys Pro Tyr Trp Leu His Lys Ala Gln 305 310 315 320 Gly His Asn Asn Gly Ile Cys Trp His Asn Gln Leu Phe Val Thr Val 325 330 335 Val Asp Thr Thr Arg Ser Thr Asn Leu Thr Leu Cys Ala Ser Thr Gln 340 345 350 Asn Pro Val Pro Asn Thr Tyr Asp Pro Thr Lys Phe Lys His Tyr Ser 355 360 365 Arg His Val Glu Glu Tyr Asp Leu Gln Phe Ile Phe Gln Leu Cys Thr 370 375 380 Ile Thr Leu Thr Ala Glu Val Met Ser Tyr Ile His Ser Met Asn Ser 385 390 395 400 Ser Ile Leu Glu Asn Trp Asn Phe Gly Val Pro Pro Pro Pro Thr Thr 405 410 415 Ser Leu Val Asp Thr Tyr Arg Phe Val Gln Ser Val Ala Val Thr Cys 420 425 430 Gln Lys Asp Thr Thr Pro Pro Glu Lys Gln Asp Pro Tyr Asp Lys Leu 435 440 445 Lys Phe Trp Thr Val Asp Leu Lys Glu Lys Phe Ser Ser Asp Leu Asp 450 455 460 Gln Tyr Pro Leu Gly Arg Lys Phe Leu Val Gln Ala Gly Leu Lys Ala 465 470 475 480 Lys Pro Lys Leu Lys Arg Ala Ala Pro Thr Ser Thr Arg Thr Ser Ser 485 490 495 Ala Lys Arg Lys Lys Val Lys Lys 500 <![CDATA[ <210> 98]]>
<![CDATA[ <211> 1516]]>
<![CDATA[ <212> DNA]]>
<![CDATA[ <213> Artificial sequence]]>
<![CDATA[ <220> ]]>
<![CDATA[ <223> 4504]]> Nucleotide sequence of chimeric HPV type 45 L1 protein
<![CDATA[ <400> 98]]> atggctctgt ggagaccatc tgacagcaca gtctacctgc ctcctccatc tgtggcaagg 60 gtggtgaaca cagatgacta tgtgagcagg accagcatct tctaccatgc tggctccagc 120 agactgctga cagtgggcaa cccatacttc agggtgg tgc caagtggagc aggcaacaag 180 caggctgtgc caaaggtgtc tgcctaccaa tacagggtgt tcagggtggc tctgcctgac 240 ccaaacaagt ttggactgcc tgacagcacc atctacaacc ctgagaccca gagactggtg 300 tgggcttgtg tggggatgga gattggcagg ggacaaccac tgggcattgg actgtctgga 360 cacccattct acaacaaact ggatgacaca gagtctgccc atgctgccac agcagtgatt 420 acccaggatg tgagggacaa tgtgtctgtg gactacaagc agacccaact ttgtatcctg 480 ggctgtgtgc ctgccattgg agaacactgg gctaagggca ccctgtgtaa gcctgcccaa 540 ctccaacctg gagactgtcc tccattggaa ctgaaaaaca ccatcattga ggatggagat 600 atggtggaca caggctatgg agctatggac ttcagcaccc tccaagacac caagtgtga g 660 gtgccactgg acatctgtca gagcatctgt aaataccctg actacctcca aatgagtgct 720 gacccatatg gagacagtat gttcttctgt ctgaggaggg aacaactttt tgccagacac 780 ttctggaaca gggctggagt gatgggagac acagtgccaa cagacctcta catcaag ggc 840 acctctgcca atatgaggga gacacctggc tcctgtgtct acagcccaag cccatctggc 900 agcatcacca cctctgacag ccaacttttc aacaagccat actggctgca caaggctcaa 960 ggacacaaca atggcatctg ttggcacaac caactttttg tgacagtggt ggacaccacc 1020 aggagcacca acctgaccct gtgtgccagc accgaacc ctgtgccaaa cacctatgac 1080 ccaaccaagt tcaagcacta cagcaggcat gtggaggaat atgacctcca attcatcttc 1140 caactttgta ccatcaccct gacagcagag gtgatgagtt acatccacag tatgaactcc 1200 agcatcttgg agaactggaa ctttggagtg cctcctcctc caaccacctc cctggtggac 1260 acctacaggt ttgtccagtc tgtggctgtg acttgtcaga aggacaccac acctcctgag 1320 aagcaggacc catatgacaa actgaagttc tgg acagtgg acctgaaaga gaagttctcc 1380
tctgacctgg accaataccc actgggcagg aagttcctgg tccaggctgg actgaaagcc 1440
aagccaaaac tgaaaagggc tgccccaacc agcaccagga cctcctctgc caagaggaag 1500
aaggtgaaga agtaaa 1516
<![CDATA[ <210> 99]]>
<![CDATA[ <211> 1552]]>
<![CDATA[ <212> DNA]]>
<![CDATA[ <213> Artificial sequence]]>
<![CDATA[ <220> ]]>
<![CDATA[ <223> 4505 Synthetic HPV45L1 gene]]>
<![CDATA[ <400> 99]]> ctgggtacca tggctctgtg gagaccatct gacagcacag tctacctgcc tcctccatct 60 gtggcaaggg tggtgaacac agatgactat gtgagcagga ccagcatctt ctaccatgct 120 ggctccagca gactgctgac agtgggcaac ccatacttca gggtgg tgcc aagtggagca 180 ggcaacaagc aggctgtgcc aaaggtgtct gcctaccaat acagggtgtt cagggtggct 240 ctgcctgacc caaacaagtt tggactgcct gacagcacca tctacaaccc tgagacccag 300 agactggtgt gggcttgtgt ggggatgg ag attggcaggg gacaaccact gggcattgga 360 ctgtctggac acccattcta caacaaactg gatgacacag agtctgccca tgctgccaca 420 gcagtgatta cccaggatgt gagggacaat gtgtctgtgg actacaagca gacccaactt 480 tgtatcctgg gctgtgtgcc tgccattgga gaacactggg ctaagggcac cctgtgtaag 540 cctgcccaac tccaacctgg agactgtcct ccattggaac tgaaaaacac catcattgag 600 gatggagata tggtggacac aggctatgga gctatggact tcagcaccct ccaagacacc 660 aagtgtgagg tg ccactgga catctgtcag agcatctgta aataccctga ctacctccaa 720 atgagtgctg acccatatgg agacagtatg ttcttctgtc tgaggaggga acaacttttt 780 gccagacact tctggaacag ggctggagtg atgggagaca cagtgccaac agacctctac 840 atcaagggca cctctgccaa tatgaggggag acacctggct cctgtgtcta cagcccaagc 900 ccatctggca gcatcaccac ctctgacagc caacttttca acaagccata ctggctgcac 960 aaggctcaag gacacaacaa tggcatctgt tggcacaacc aactttttgt gacagtggtg 1020 gacaccacca ggagcaccaa cctgaccctg tgtgccagca cccagaaccc tgtgccaaac 1080 acctatgacc caaccaagtt caagcactac agcaggcatg tggaggaata tgacctccaa 1140 ttcatcttcc aactttgtac catcaccctg acagcagagg tgatgagtta catccacagt 1200 atgaactcca gcatcttgga gaactggaac tttggagtgc ctcctcctcc aaccacctcc 1260 ctggtggaca cctacaggtt tgtccagtct gtggctgtga cttgtcagaa ggacaccaca 1320 cctcctgaga agcaggaccc atatgacaaa ctgaagttct ggacagtgga cctga aagag 1380 aagttctcct ctgacctgga ccaataccca ctgggcagga agttcctggt ccaggctgga 1440 ctgaggagga gaccaaccat tggaccaagg aagagacctg ctgccagcac cagcacagcc 1500 agcagacctg ccaagagggt gaggattagg agcaagaagt aaactcgagc tc 1552 <![CDATA[ <210> 100]]>
<![CDATA[ <211> 1519]]>
<![CDATA[ <212> DNA]]>
<![CDATA[ <213> Artificial sequence]]>
<![CDATA[ <220> ]]>
<![CDATA[ <223> 4506 Synthetic HPV33L1 gene]]>
<![CDATA[ [400] acttca gcatcaagaa cccaaccaat 180 gccaagaaac tgctggtgcc aaaggtgtct ggactccaat acagggtgtt cagggtgaga 240 ctgcctgacc caaacaagtt tggctttcct gacacctcct tctacaaccc tgacacccag 300 agactggtgt gggcttgt gt gggattggag attggcaggg gacaaccact gggagtgggc 360
atctctggac acccactgct gaacaagttt gatgacacag agacaggcaa caaataccct 420
ggacaacctg gagcagacaa cagggagtgt ctgagtatgg actacaagca gacccaactt 480
tgtctgctgg gctgtaagcc tccaacagga gaacactggg gcaagggagt ggcttgtacc 540
aatgctgccc ctgccaatga ctgtcctcca ttggaactga taaacaccat cattgaggat 600
ggagatatgg tggacacagg ctttggctgt atggacttca agaccctcca agccaacaag 660
tctgatgtgc caattgacat ctgtggcagc acttgtaaat accctgacta cctgaaaatg 720 acctctgaac catatggaga ctccctgttc ttcttcctga gggagggaaca gatgtttgtg 780 agacacttct tcaacagggc tggcaccctg ggagaggctg tgcctgatga cctctacatc 840 a agggctctg gcaccacagc cagcatccag tcctctgcct tctttccaac accatctggc 900 agtatggtga cctctgagag ccaacttttc aacaagccat actggctcca aagggctcaa 960 ggacacaaca atggcatctg ttggggcaac caggtgtttg tgacagtggt ggacaccacc 1020 aggagcacca atatgaccct gtgtacccag gtgacctctg acagcaccta caagaatgag 1080 aacttcaagg aatacatcag gcatgtggag gaatatgacc tccaatttgt gttccaactt 1140 tgtaaggtga ccctgacagc agaggtgatg acctacatcc atgctatgaa ccctgacatc 1200 ttggaggact ggcagtttgg actgacacct cctccatctg cctccctcca agacacctac 1260 aggtttgtga ccagccaggc tatcacttgt cagaagacag tgcctccaaa ggagaaggag 1320 gacccactgg gcaaatacac cttctgggag gtggacctga aaga gaagtt ctctgctgac 1380 ctggaccagt ttccactggg caggaagttc ctgctccaag caggactgaa agccaagcca 1440 aaactgaaaa gggctgcccc aaccagcacc aggacctcct ctgccaagag gaagaaggtg 1500 aagaagtaaa ctcgagctc 1519 <![CDATA[ <210> 101]]>
<![CDATA[ <211> 33]]>
<![CDATA[ <212> DNA]]>
<![CDATA[ <213> Artificial sequence]]>
<![CDATA[ <220> ]]>
<![CDATA[ <223> 4507 HPV45L1 F1]]> <![CDATA[ <400> 101]]> cttggtacca tggctctgtg gagaccatct gac 33 <![CDATA[ <210> 102]]>
<![CDATA[ <211> 32]]>
<![CDATA[ <212> DNA]]>
<![CDATA[ <213> Artificial sequence]]>
<![CDATA[ <220> ]]>
<![CDATA[ <223> 4508 HPV45L1 R1]]> <![CDATA[ <400> 102]]> gcttggcttt cagtccagcc tggaccagga ac 32 <![CDATA[ <210> 103]]>
<![CDATA[ <211> 1453]]>
<![CDATA[ <212> DNA]]>
<![CDATA[ <213> Artificial sequence]]>
<![CDATA[ <22]]>0> ]]>
<br/> <![CDATA[ <223> 4509 HPV45L1 expansion sequence 1]]>
<br/>
<br/> <![CDATA[ <400>103]]> <br/> <![CDATA[cttggtacca tggctctgtg gagaccatct gacagcacag tctacctgcc tcctccatct 60 gtggcaaggg tggtgaacac agatgactat gtgagcagga ccagcatctt ctaccatgct 120 ggctccagca gactgctgac agtgggcaac ccatacttca gggtggtgcc aagt ggagca 180 ggcaacaagc aggctgtgcc aaaggtgtct gcctaccaat acagggtgtt cagggtggct 240 ctgcctgacc caaacaagtt tggactgcct gacagcacca tctacaaccc tgagacccag 300 agactggtgt gggcttgtgt ggggatggag gg gacaaccact gggcattgga 360 ctgtctggac acccattcta caacaaactg gatgacacag agtctgccca tgctgccaca 420 gcagtgatta cccaggatgt gagggacaat gtgtctgtgg actacaagca gacccaactt 480 tgtatcctgg gctgtgtgcc tgccattgg a gaacactggg ctaagggcac cctgtgtaag 540 cctgcccaac tccaacctgg agactgtcct ccattggaac tgaaaaacac catcattgag 600 gatggagata tggtggacac aggctatgga gctatggact tcagcaccct ccaagacacc 660 aagtgtgagg tgccactgga catct gtcag agcatctgta aataccctga ctacctccaa 720 atgagtgctg acccatatgg agacagtatg ttcttctgtc tgaggaggga acaacttttt 780 gccagacact tctggaacag ggctggagtg atgggagaca cagtgccaac agacctctac 840 atcaagggca cctctgccaa tatgaggggag acacctggct cctgtgtcta cagcccaagc 900 ccatctggca gcatcaccac ctctgacagc caacttttca acaagccata ctggctgcac 960 aaggctcaag gacacaacaa tggcatctgt tggcacaacc aactttttgt gacagtggtg 1020 gacaccacca ggagcaccaa cctgaccctg tgtgccagca cccagaaccc tgtgccaaac 1080 acctatgacc caaccaagtt caagcactac agcaggcatg tggaggaata tgacctccaa 1140 ttcatcttcc aactttgtac catcaccctg acagcagagg tgatgagtta catccacagt 1200 atgaactcca gcatcttgga gaactggaac tttggagtgc ctcctcctcc aaccacctcc 1260 ctggtggaca cctacaggtt tgtccagtct gtggctgtga cttgtcagaa ggacaccaca 1320 cctcctgaga agcaggaccc atatgacaaa ctgaagttct ggacagtgga cctgaaagag 138 0 aagttctcct ctgacctgga ccaataccca ctgggcagga agttcctggt ccaggctgga 1440 ctgaaagcca agc 1453 <![CDATA[ <210> 104]]>
<![CDATA[ <211> 35]]>
<![CDATA[ <212> DNA]]>
<![CDATA[ <213> Artificial sequence]]>
<![CDATA[ <2]]>20> ]]>
<br/> <![CDATA[ <223> 4510 HPV45L1]]> <![CDATA[ F2
<![CDATA[ <400> 104]]> ggctggactg aaagccaagc caaaactgaa aaggg 35 <![CDATA[ <210> 105]]>
<![CDATA[ <211> 37]]>
<![CDATA[ <212> DNA]]>
<![CDATA[ <213> Artificial sequence]]>
<![CDATA[ <220> ]]>
<![CDATA[ <223> 4511 HPV45L1 R2]]> <![CDATA[ <400> 105]]> ctgtctagat ttacttcttc accttcttcc tcttggc 37 <![CDATA[ <210> 106]]>
<![CDATA[ <211> 101]]>
<![CDATA[ <212> DNA]]>
<![CDATA[ <213> Artificial sequence]]>
<![CDATA[ <220> ]]>
<![CDATA[ <223> 4512 HPV45L1 extended sequence 2]]>
<![CDATA[ <400> 106]]> ggctggactg aaagccaagc caaaactgaa aagggctgcc ccaaccagca ccaggacctc 60 ctctgccaag aggaagaagg tgaagaagta aatctagaca g 101 <![CDATA[ <210> 107]]>
<![CDATA[ <211> 38]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Homo sapiens]]>
<![CDATA[ <220> ]]>
<![CDATA[ <223> 4513 Amino acid sequence 471-508 of HPV type 59 L1 protein]]>
<![CDATA[ <400> 107]]> Leu Gln Leu Gly Ala Arg Pro Lys Pro Thr Ile Gly Pro Arg Lys Arg 1 5 10 15 Ala Ala Pro Ala Pro Thr Ser Thr Pro Ser Pro Lys Arg Val Lys Arg 20 25 30 Arg Lys Ser Ser Arg Lys 35 <![CDATA[ <210> 108]]>
<![CDATA[ <211> 474]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Homo sapiens]]>
<![CDATA[ <220> ]]>
<![CDATA[ <223> 5101 HPV 51 L1 protein amino acid sequence 1-474]]>
<![CDATA[ <400> 108]]> Met Ala Leu Trp Arg Thr Asn Asp Ser Lys Val Tyr Leu Pro Pro Ala 1 5 10 15 Pro Val Ser Arg Ile Val Asn Thr Glu Glu Tyr Ile Thr Arg Thr Gly 20 25 30 Ile Tyr Tyr Tyr Ala Gly Ser Ser Arg Leu Ile Thr Leu Gly His Pro 35 40 45 Tyr Phe Pro Ile Pro Lys Thr Ser Thr Arg Ala Ala Ile Pro Lys Val 50 55 60 Ser Ala Phe Gln Tyr Arg Val Phe Arg Val Gln Leu Pro Asp Pro Asn 65 70 75 80 Lys Phe Gly Leu Pro Asp Pro Asn Leu Tyr Asn Pro Asp Thr Asp Arg 85 90 95 Leu Val Trp Gly Cys Val Gly Val Glu Val Gly Arg Gly Gln Pro Leu 100 105 110 Gly Val Gly Leu Ser Gly His Pro Leu Phe Asn Lys Tyr Asp Asp Thr 115 120 125 Glu Asn Ser Arg Ile Ala Asn Gly Asn Ala Gln Gln Asp Val Arg Asp 130 135 140 Asn Thr Ser Val Asp Asn Lys Gln Thr Gln Leu Cys Ile Ile Gly Cys 145 150 155 160 Ala Pro Pro Ile Gly Glu His Trp Gly Ile Gly Thr Thr Cys Lys Asn 165 170 175 Thr Pro Val Pro Pro Gly Asp Cys Pro Pro Leu Glu Leu Val Ser Ser 180 185 190 Val Ile Gln Asp Gly Asp Met Ile Asp Thr Gly Phe Gly Ala Met Asp 195 200 205 Phe Ala Ala Leu Gln Ala Thr Lys Ser Asp Val Pro Leu Asp Ile Ser 210 215 220 Gln Ser Val Cys Lys Tyr Pro Asp Tyr Leu Lys Met Ser Ala Asp Thr 225 230 235 240 Tyr Gly Asn Ser Met Phe Phe His Leu Arg Arg Glu Gln Ile Phe Ala 245 250 255 Arg His Tyr Tyr Asn Lys Leu Val Gly Val Gly Glu Asp Ile Pro Asn 260 265 270 Asp Tyr Tyr Ile Lys Gly Ser Gly Asn Gly Arg Asp Pro Ile Glu Ser 275 280 285 Tyr Ile Tyr Ser Ala Thr Pro Ser Gly Ser Met Ile Thr Ser Asp Ser 290 295 300 Gln Ile Phe Asn Lys Pro Tyr Trp Leu His Arg Ala Gln Gly His Asn 305 310 315 320 Asn Gly Ile Cys Trp Asn Asn Gln Leu Phe Ile Thr Cys Val Asp Thr 325 330 335 Thr Arg Ser Thr Asn Leu Thr Ile Ser Thr Ala Thr Ala Ala Val Ser 340 345 350 Pro Thr Phe Thr Pro Ser Asn Phe Lys Gln Tyr Ile Arg His Gly Glu 355 360 365 Glu Tyr Glu Leu Gln Phe Ile Phe Gln Leu Cys Lys Ile Thr Leu Thr 370 375 380 Thr Glu Val Met Ala Tyr Leu His Thr Met Asp Pro Thr Ile Leu Glu 385 390 395 400 Gln Trp Asn Phe Gly Leu Thr Leu Pro Pro Ser Ala Ser Leu Glu Asp 405 410 415 Ala Tyr Arg Phe Val Arg Asn Ala Ala Thr Ser Cys Gln Lys Asp Thr 420 425 430 Pro Pro Gln Ala Lys Pro Asp Pro Leu Ala Lys Tyr Lys Phe Trp Asp 435 440 445 Val Asp Leu Lys Glu Arg Phe Ser Leu Asp Leu Asp Gln Phe Ala Leu 450 455 460 Gly Arg Lys Phe Leu Leu Gln Val Gly Val 465 470 <![CDATA[ <210> 109]]>
<![CDATA[ <211> 26]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Homo sapiens]]>
<![CDATA[ <220> ]]>
<![CDATA[ <223> 5102 HPV type 33]]>Amino acid sequence 474-499 of L1 protein
<![CDATA[ <400> 109]]> Lys Ala Lys Pro Lys Leu Lys Arg Ala Ala Pro Thr Ser Thr Arg Thr 1 5 10 15 Ser Ser Ala Lys Arg Lys Lys Val Lys Lys 20 25 <![CDATA[ <210> 110]]>
<![CDATA[ <211> 500]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Artificial sequence]]>
<![CDATA[ <220> ]]>
<![CDATA[ <223> 5103 Amino acid sequence of chimeric HPV type 51 L1 protein]]>
<![CDATA[ <400> 110]]> Met Ala Leu Trp Arg Thr Asn Asp Ser Lys Val Tyr Leu Pro Pro Ala 1 5 10 15 Pro Val Ser Arg Ile Val Asn Thr Glu Glu Tyr Ile Thr Arg Thr Gly 20 25 30 Ile Tyr Tyr Tyr Ala Gly Ser Ser Arg Leu Ile Thr Leu Gly His Pro 35 40 45 Tyr Phe Pro Ile Pro Lys Thr Ser Thr Arg Ala Ala Ile Pro Lys Val 50 55 60 Ser Ala Phe Gln Tyr Arg Val Phe Arg Val Gln Leu Pro Asp Pro Asn 65 70 75 80 Lys Phe Gly Leu Pro Asp Pro Asn Leu Tyr Asn Pro Asp Thr Asp Arg 85 90 95 Leu Val Trp Gly Cys Val Gly Val Glu Val Gly Arg Gly Gln Pro Leu 100 105 110 Gly Val Gly Leu Ser Gly His Pro Leu Phe Asn Lys Tyr Asp Asp Thr 115 120 125 Glu Asn Ser Arg Ile Ala Asn Gly Asn Ala Gln Gln Asp Val Arg Asp 130 135 140 Asn Thr Ser Val Asp Asn Lys Gln Thr Gln Leu Cys Ile Ile Gly Cys 145 150 155 160 Ala Pro Pro Ile Gly Glu His Trp Gly Ile Gly Thr Thr Cys Lys Asn 165 170 175 Thr Pro Val Pro Pro Gly Asp Cys Pro Pro Leu Glu Leu Val Ser Ser 180 185 190 Val Ile Gln Asp Gly Asp Met Ile Asp Thr Gly Phe Gly Ala Met Asp 195 200 205 Phe Ala Ala Leu Gln Ala Thr Lys Ser Asp Val Pro Leu Asp Ile Ser 210 215 220 Gln Ser Val Cys Lys Tyr Pro Asp Tyr Leu Lys Met Ser Ala Asp Thr 225 230 235 240 Tyr Gly Asn Ser Met Phe Phe His Leu Arg Arg Glu Gln Ile Phe Ala 245 250 255 Arg His Tyr Tyr Asn Lys Leu Val Gly Val Gly Glu Asp Ile Pro Asn 260 265 270 Asp Tyr Tyr Ile Lys Gly Ser Gly Asn Gly Arg Asp Pro Ile Glu Ser 275 280 285 Tyr Ile Tyr Ser Ala Thr Pro Ser Gly Ser Met Ile Thr Ser Asp Ser 290 295 300 Gln Ile Phe Asn Lys Pro Tyr Trp Leu His Arg Ala Gln Gly His Asn 305 310 315 320 Asn Gly Ile Cys Trp Asn Asn Gln Leu Phe Ile Thr Cys Val Asp Thr 325 330 335 Thr Arg Ser Thr Asn Leu Thr Ile Ser Thr Ala Thr Ala Ala Val Ser 340 345 350 Pro Thr Phe Thr Pro Ser Asn Phe Lys Gln Tyr Ile Arg His Gly Glu 355 360 365 Glu Tyr Glu Leu Gln Phe Ile Phe Gln Leu Cys Lys Ile Thr Leu Thr 370 375 380 Thr Glu Val Met Ala Tyr Leu His Thr Met Asp Pro Thr Ile Leu Glu 385 390 395 400 Gln Trp Asn Phe Gly Leu Thr Leu Pro Pro Ser Ala Ser Leu Glu Asp 405 410 415 Ala Tyr Arg Phe Val Arg Asn Ala Ala Thr Ser Cys Gln Lys Asp Thr 420 425 430 Pro Pro Gln Ala Lys Pro Asp Pro Leu Ala Lys Tyr Lys Phe Trp Asp 435 440 445 Val Asp Leu Lys Glu Arg Phe Ser Leu Asp Leu Asp Gln Phe Ala Leu 450 455 460 Gly Arg Lys Phe Leu Leu Gln Val Gly Val Lys Ala Lys Pro Lys Leu 465 470 475 480 Lys Arg Ala Ala Pro Thr Ser Thr Arg Thr Ser Ser Ala Lys Arg Lys 485 490 495 Lys Val Lys Lys 500 <![CDATA[ <210> 111]]>
<![CDATA[ <211> 1504]]>
<![CDATA[ <212> DNA]]>
<![CDATA[ <213> Artificial sequence]]>
<![CDATA[ <220> ]]>
<![CDATA[ <223> 5104 Nucleotide sequence of chimeric HPV type 51 L1 protein]]>
<![CDATA[ <400> 111]]> atggctctgt ggaggaccaa tgacagcaag gtctacctgc ctcctgcccc tgtgagcagg 60 attgtgaaca cagaggaata catcaccagg acaggcatct actactatgc tggctccagc 120 agactgatta ccctgggaca cccatacttt ccaatcccaa agaccagcac cagggctgcc 180 atcccaaagg tgtctgcctt ccaatacagg gtgttcaggg tccaacttcc tgacccaaac 240 aagtttggac tgcctgaccc aaacctctac aaccctgaca cagacagact ggtgtggggc 300 tgtgtgggag tggaggtggg caggggacaa ccactggggag tgggactgtc tggacacccca 360 ctgttcaaca aatatgatga cacagagaac agcaggattg ccaatggcaa tgcccaacag 420 gatgtgaggg acaacacctc tgtggacaac aagcagaccc aactttgtat cattggctgt 480 gcccctccaa ttggagaaca ctgggg catt ggcaccactt gtaagaacac acctgtgcct 540 cctggagact gtcctccatt ggaactggtg tcctctgtga ttcaggatgg agatatgatt 600 gacacaggct ttggagctat ggactttgct gccctccaag ccaccaagtc tgatgtgcca 660 ctggacatca gccagt ctgt gtgtaaatac cctgactacc tgaaaatgag tgctgacacc 720 tatggcaaca gtatgttctt ccacctgagg agggaacaga tttttgccag acactactac 780 aacaaactgg tgggagtggg agaggacatc ccaaatgact actacatcaa gggctctggc 840 aatggcaggg acccaattga g tcctacatc tactctgcca caccatctgg cagtatgatt 900 acctctgaca gccagatttt caacaagcca tactggctgc acagggctca aggacacaac 960 aatggcatct gttggaacaa ccaacttttc atcacttgtg tggacaccac caggagcacc 1020 aacctgacca tcagcacagc cacagcagca gtgagcccaa ccttcacacc aagcaacttc 1080 aagcaataca tcagacatgg agaggaatat gaactccaat tcatcttcca actttgtaag 1140 attaccctga ccacagaggt gatggcttac ctgcacacaa tggacccaac catcttggaa 1200 ca gtggaact ttggactgac cctgcctcca tctgcctcct tggaggatgc ctacaggttt 1260 gtgaggaatg ctgccacctc ctgtcagaag gacacacctc cacaggctaa gcctgaccca 1320 ctggctaaat acaagttctg ggatgtggac ctgaaagaga ggttctccct ggacctggac 1380 cagtttgccc tgggcaggaa gttcctgctc caagtggggag tcaaagccaa gccaaaactg 1440 aaaagggctg ccccaaccag caccaggacc tcctctgcca agaggaagaa ggtgaagaag 1500 taaa 1504 <![CDATA[ <210> 112]]>
<![CDATA[ <211> 1534]]>
<![CDATA[ <212> DNA]]>
<![CDATA[ <213> Artificial sequence]]>
<![CDATA[ <220> ]]>
<![CDATA[ <22]]>3> 5105 Synthetic HPV51L1 gene]]>
<br/>
<br/> <![CDATA[ <400>112]]> <br/> <![CDATA[ctgggtacca tggctctgtg gaggaccaat gacagcaagg tctacctgcc tcctgcccct 60
gtgagcagga ttgtgaacac agaggaatac atcaccagga caggcatcta ctactatgct 120
ggctccagca gactgattac cctgggacac ccatactttc caatcccaaa gaccagcacc 180
agggctgcca tcccaaaggt gtctgccttc caatacaggg tgttcagggt ccaacttcct 240
gacccaaaca agtttggact gcctgaccca aacctctaca accctgacac agacagactg 300
gtgtggggct gtgtgggagt ggaggtgggc aggggacaac cactgggagt gggactgtct 360 ggacacccac tgttcaacaa atatgatgac acagagaaca gcaggattgc caatggcaat 420 gcccaacagg atgtgaggga caacacctct gtggacaaca agcagaccca actttgtatc 480 attggctgtg cccctccaat tggagaacac tgggg cattg gcaccacttg taagaacaca 540 cctgtgcctc ctggagactg tcctccattg gaactggtgt cctctgtgat tcaggatgga 600 gatatgattg acacaggctt tggagctatg gactttgctg ccctccaagc caccaagtct 660 gatgtgccac tggacatcag ccagtctgtg tgtaaatacc ctgactacct gaaaatgagt 720 gctgacacct atggcaacag tatgttcttc cacctgagga gggaacagat ttttgccaga 780 cactactaca acaaactggt gggagtggga gaggacatcc caaatgacta ctacatcaag 840 ggctctggca atggcaggga cccaattgag tcctacatct actctgccac accatctggc 900 agtatgatta cctctgacag ccagattttc aacaagccat actggctgca cagggctcaa 960 ggacacaaca atggcatctg ttggaacaac caacttttca tcacttgtgt ggacaccacc 1020 aggagcacca acctgaccat cagcacagcc acagcagcag tgagcccaac cttcacca 1080 agcaacttca agcaatacat cagacatgga gaggaatatg aactccaatt catcttccaa 1140 ctttgtaaga ttaccctgac cacagaggtg atggcttacc tgcacacaat ggacccaacc 1200 atcttggaac agtggaact t tggactgacc ctgcctccat ctgcctcctt ggaggatgcc 1260 tacaggtttg tgaggaatgc tgccacctcc tgtcagaagg acacacctcc acaggctaag 1320 cctgacccac tggctaaata caagttctgg gatgtggacc tgaaagagag gttctccctg 1380 gacctggacc agtttgcct gggcaggaag ttcctgctcc aagtggggagt ccagaggaag 1440 ccaagacctg gactgaaaag acctgcctcc tctgcctcct cctcctcctc ctcctctgcc 1500 aagaggaaga gggtgaagaa gtaaactcga gctc 153 4 <![CDATA[ <210> 113]]>
<![CDATA[ <211> 1519]]>
<![CDATA[ <212> DNA]]>
<![CDATA[ <213> Artificial sequence]]>
<![CDATA[ <220> ]]>
<![CDATA[ <223> 5106 Synthetic HPV33L1 gene]]>
<![CDATA[ <400> 113]]> ctgggtacca tgagtgtgtg gagaccatct gaggctacag tctacctgcc tcctgtgcct 60 gtgagcaagg tggtgagcac agatgaatat gtgagcagga ccagcatcta ctactatgct 120 ggctccagca gactgctggc tgtgggacac ccat acttca gcatcaagaa cccaaccaat 180 gccaagaaac tgctggtgcc aaaggtgtct ggactccaat acagggtgtt cagggtgaga 240 ctgcctgacc caaacaagtt tggctttcct gacacctcct tctacaaccc tgacacccag 300 agactggtgt gggcttgt gt gggattggag attggcaggg gacaaccact gggagtgggc 360
atctctggac acccactgct gaacaagttt gatgacacag agacaggcaa caaataccct 420
ggacaacctg gagcagacaa cagggagtgt ctgagtatgg actacaagca gacccaactt 480
tgtctgctgg gctgtaagcc tccaacagga gaacactggg gcaagggagt ggcttgtacc 540
aatgctgccc ctgccaatga ctgtcctcca ttggaactga taaacaccat cattgaggat 600
ggagatatgg tggacacagg ctttggctgt atggacttca agaccctcca agccaacaag 660
tctgatgtgc caattgacat ctgtggcagc acttgtaaat accctgacta cctgaaaatg 720 acctctgaac catatggaga ctccctgttc ttcttcctga gggagggaaca gatgtttgtg 780 agacacttct tcaacagggc tggcaccctg ggagaggctg tgcctgatga cctctacatc 840 a agggctctg gcaccacagc cagcatccag tcctctgcct tctttccaac accatctggc 900 agtatggtga cctctgagag ccaacttttc aacaagccat actggctcca aagggctcaa 960 ggacacaaca atggcatctg ttggggcaac caggtgtttg tgacagtggt ggacaccacc 1020 aggagcacca atatgaccct gtgtacccag gtgacctctg acagcaccta caagaatgag 1080 aacttcaagg aatacatcag gcatgtggag gaatatgacc tccaatttgt gttccaactt 1140 tgtaaggtga ccctgacagc agaggtgatg acctacatcc atgctatgaa ccctgacatc 1200 ttggaggact ggcagtttgg actgacacct cctccatctg cctccctcca agacacctac 1260 aggtttgtga ccagccaggc tatcacttgt cagaagacag tgcctccaaa ggagaaggag 1320 gacccactgg gcaaatacac cttctgggag gtggacctga aaga gaagtt ctctgctgac 1380 ctggaccagt ttccactggg caggaagttc ctgctccaag caggactgaa agccaagcca 1440 aaactgaaaa gggctgcccc aaccagcacc aggacctcct ctgccaagag gaagaaggtg 1500 aagaagtaaa ctcgagctc 1519 <![CDATA[ <210> 114]]>
<![CDATA[ <211> 34]]>
<![CDATA[ <212> DNA]]>
<![CDATA[ <213> Artificial sequence]]>
<![CDATA[ <220> ]]>
<![CDATA[ <223> 5107 HPV51L1 F1]]> <![CDATA[ <400> 114]]> cttggtacca tggctctgtg gaggaccaat gaca 34 <![CDATA[ <210> 115]]>
<![CDATA[ <211> 32]]>
<![CDATA[ <212> DNA]]>
<![CDATA[ <213> Artificial sequence]]>
<![CDATA[ <220> ]]>
<![CDATA[ <223> 5108 HPV51L1 R1]]> <![CDATA[ <400> 115]]> gcttggcttt gactcccact tggagcagga ac 32 <![CDATA[ <210> 116]]>
<![CDATA[ <211> 1441]]>
<![CDATA[ <212> DNA]]>
<![CDATA[ <213> Artificial sequence]]>
<![CDATA[ <220> ]]>
<![CDATA[ <223> 5109 HPV51L1 expansion sequence 1]]>
<![CDATA[ [400] cccaaa gaccagcacc 180 agggctgcca tcccaaaggt gtctgccttc caatacaggg tgttcagggt ccaacttcct 240 gacccaaaca agtttggact gcctgaccca aacctctaca accctgacac agacagactg 300 gtgtggggct gtgtggggt ggaggtggg c aggggacaac cactgggagt gggactgtct 360 ggacacccac tgttcaacaa atatgatgac acagagaaca gcaggattgc caatggcaat 420 gcccaacagg atgtgaggga caacacctct gtggacaaca agcagaccca actttgtatc 480 attggctgtg cccctccaat tggaga acac tggggcattg gcaccacttg taagaacaca 540 cctgtgcctc ctggagactg tcctccattg gaactggtgt cctctgtgat tcaggatgga 600 gatatgattg acacaggctt tggagctatg gactttgctg ccctccaagc caccaagtct 660 gatgtgccac tggacatcag ccagtctgtg tgtaaatacc ctgactacct gaaaatgagt 720 gctgacacct atggcaacag tatgttcttc cacctgagga gggaacagat ttttgccaga 780 cactactaca acaaactggt gggagtggga gaggacatcc caaatgacta ctacatcaag 840 ggctctggca atggcagg ga cccaattgag tcctacatct actctgccac accatctggc 900 agtatgatta cctctgacag ccagattttc aacaagccat actggctgca cagggctcaa 960 ggacacaaca atggcatctg ttggaacaac caacttttca tcacttgtgt ggacaccacc 1020 aggagcacca acctgaccat cagcacagcc acagcagcag tgagcccaac cttcacacca 1080 agcaacttca agcaatacat cagacatgga gaggaatatg aactccaatt catcttccaa 1140 ctttgtaaga ttaccctgac cacagaggtg atggcttacc tgcacacaat ggacccaacc 1200 atcttggaac agtggaactt tggactgacc ctgcctccat ctgcctcctt ggaggatgcc 1260 tacaggtttg tgaggaatgc tgccacctcc tgtcagaagg acacacctcc acaggctaag 1320 cctgacccac tggctaaata caagttctgg gatgtggacc tgaaagagag gttctccctg 1380 gacctggacc agtttgccct gggcaggaag ttcctgctcc aagtggggagt caaagccaag 1440 c 1441 <![CDATA[ <210> 117]]>
<![CDATA[ <211> 35]]>
<![CDATA[ <212> DNA]]>
<![CDATA[ <213> Artificial sequence]]>
<![CDATA[ <220> ]]>
<![CDATA[ <223> 5110 HPV51L1 F2]]> <![CDATA[ <400> 117]]> agtggggagtc aaagccaagc caaaactgaa aaggg 35 <![CDATA[ <210> 118]]>
<![CDATA[ <211> 36]]>
<![CDATA[ <212> DNA]]>
<![CDATA[ <213> Artificial sequence]]>
<![CDATA[ <220> ]]>
<![CDATA[ <223> 5111 HPV51L1 R2]]> <![CDATA[ <400> 118]]> ctgtctagat ttacttcttc accttcttcc tcttgg 36 <![CDATA[ <210> 119]]>
<![CDATA[ <211> 101]]>
<![CDATA[ <212> DNA]]>
<![CDATA[ <213> Artificial sequence]]>
<![CDATA[ <220> ]]>
<![CDATA[ <223> 5112 HPV51L1 extended sequence 2]]>
<![CDATA[ <400> 119]]> agtggggagtc aaagccaagc caaaactgaa aagggctgcc ccaaccagca ccaggacctc 60 ctctgccaag aggaagaagg tgaagaagta aatctagaca g 101 <![CDATA[ <210> 120]]>
<![CDATA[ <211> 38]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Homo sapiens]]>
<![CDATA[ <220> ]]>
<![CDATA[ <223> 5113 Amino acid sequence 471-508 of HPV type 59 L1 protein]]>
<![CDATA[ <400> 120]]> Leu Gln Leu Gly Ala Arg Pro Lys Pro Thr Ile Gly Pro Arg Lys Arg 1 5 10 15 Ala Ala Pro Ala Pro Thr Ser Thr Pro Ser Pro Lys Arg Val Lys Arg 20 25 30 Arg Lys Ser Ser Arg Lys 35 <![CDATA[ <210> 121]]>
<![CDATA[ <211> 478]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Homo sapiens]]>
<![CDATA[ <220> ]]>
<![CDATA[ <223> 5201 HPV 52 L1 protein amino acid sequence 1-478]]>
<![CDATA[ <400> 121]]> Met Ser Val Trp Arg Pro Ser Glu Ala Thr Val Tyr Leu Pro Pro Val 1 5 10 15 Pro Val Ser Lys Val Val Ser Thr Asp Glu Tyr Val Ser Arg Thr Ser 20 25 30 Ile Tyr Tyr Tyr Ala Gly Ser Ser Arg Leu Leu Thr Val Gly His Pro 35 40 45 Tyr Phe Ser Ile Lys Asn Thr Ser Ser Ser Gly Asn Gly Lys Lys Val Leu 50 55 60 Val Pro Lys Val Ser Gly Leu Gln Tyr Arg Val Phe Arg Ile Lys Leu 65 70 75 80 Pro Asp Pro Asn Lys Phe Gly Phe Pro Asp Thr Ser Phe Tyr Asn Pro 85 90 95 Glu Thr Gln Arg Leu Val Trp Ala Cys Thr Gly Leu Glu Ile Gly Arg 100 105 110 Gly Gln Pro Leu Gly Val Gly Ile Ser Gly His Pro Leu Leu Asn Lys 115 120 125 Phe Asp Asp Thr Glu Thr Ser Asn Lys Tyr Ala Gly Lys Pro Gly Ile 130 135 140 Asp Asn Arg Glu Cys Leu Ser Met Asp Tyr Lys Gln Thr Gln Leu Cys 145 150 155 160 Ile Leu Gly Cys Lys Pro Pro Ile Gly Glu His Trp Gly Lys Gly Thr 165 170 175 Pro Cys Asn Asn Asn Ser Gly Asn Pro Gly Asp Cys Pro Pro Leu Gln 180 185 190 Leu Ile Asn Ser Val Ile Gln Asp Gly Asp Met Val Asp Thr Gly Phe 195 200 205 Gly Cys Met Asp Phe Asn Thr Leu Gln Ala Ser Lys Ser Asp Val Pro 210 215 220 Ile Asp Ile Cys Ser Ser Val Cys Lys Tyr Pro Asp Tyr Leu Gln Met 225 230 235 240 Ala Ser Glu Pro Tyr Gly Asp Ser Leu Phe Phe Phe Leu Arg Arg Glu 245 250 255 Gln Met Phe Val Arg His Phe Phe Asn Arg Ala Gly Thr Leu Gly Asp 260 265 270 Pro Val Pro Gly Asp Leu Tyr Ile Gln Gly Ser Asn Ser Gly Asn Thr 275 280 285 Ala Thr Val Gln Ser Ser Ala Phe Phe Pro Thr Pro Ser Gly Ser Met 290 295 300 Val Thr Ser Glu Ser Gln Leu Phe Asn Lys Pro Tyr Trp Leu Gln Arg 305 310 315 320 Ala Gln Gly His Asn Asn Gly Ile Cys Trp Gly Asn Gln Leu Phe Val 325 330 335 Thr Val Val Asp Thr Thr Arg Ser Thr Asn Met Thr Leu Cys Ala Glu 340 345 350 Val Lys Lys Glu Ser Thr Tyr Lys Asn Glu Asn Phe Lys Glu Tyr Leu 355 360 365 Arg His Gly Glu Glu Phe Asp Leu Gln Phe Ile Phe Gln Leu Cys Lys 370 375 380 Ile Thr Leu Thr Ala Asp Val Met Thr Tyr Ile His Lys Met Asp Ala 385 390 395 400 Thr Ile Leu Glu Asp Trp Gln Phe Gly Leu Thr Pro Pro Pro Ser Ala 405 410 415 Ser Leu Glu Asp Thr Tyr Arg Phe Val Thr Ser Thr Ala Ile Thr Cys 420 425 430 Gln Lys Asn Thr Pro Pro Lys Gly Lys Glu Asp Pro Leu Lys Asp Tyr 435 440 445 Met Phe Trp Glu Val Asp Leu Lys Glu Lys Phe Ser Ala Asp Leu Asp 450 455 460 Gln Phe Pro Leu Gly Arg Lys Phe Leu Leu Gln Ala Gly Leu 465 470 475 <![CDATA[ <210> 122]]>
<![CDATA[ <211> 26]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Homo sapiens]]>
<![CDATA[ <220> ]]>
<![CDATA[ <223> 5202 HPV 33 L1 protein amino acid sequence 474-499]]>
<![CDATA[ <400> 122]]> Lys Ala Lys Pro Lys Leu Lys Arg Ala Ala Pro Thr Ser Thr Arg Thr 1 5 10 15 Ser Ser Ala Lys Arg Lys Lys Val Lys Lys 20 25 <![CDATA[ <210> 123]]>
<![CDATA[ <211> 504]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Artificial sequence]]>
<![CDATA[ <220> ]]>
<![CDATA[ <223> 5203 Amino acid sequence of chimeric HPV type 52 L1 protein]]>
<![CDATA[ <400> 123]]> Met Ser Val Trp Arg Pro Ser Glu Ala Thr Val Tyr Leu Pro Pro Val 1 5 10 15 Pro Val Ser Lys Val Val Ser Thr Asp Glu Tyr Val Ser Arg Thr Ser 20 25 30 Ile Tyr Tyr Tyr Ala Gly Ser Ser Arg Leu Leu Thr Val Gly His Pro 35 40 45 Tyr Phe Ser Ile Lys Asn Thr Ser Ser Ser Gly Asn Gly Lys Lys Val Leu 50 55 60 Val Pro Lys Val Ser Gly Leu Gln Tyr Arg Val Phe Arg Ile Lys Leu 65 70 75 80 Pro Asp Pro Asn Lys Phe Gly Phe Pro Asp Thr Ser Phe Tyr Asn Pro 85 90 95 Glu Thr Gln Arg Leu Val Trp Ala Cys Thr Gly Leu Glu Ile Gly Arg 100 105 110 Gly Gln Pro Leu Gly Val Gly Ile Ser Gly His Pro Leu Leu Asn Lys 115 120 125 Phe Asp Asp Thr Glu Thr Ser Asn Lys Tyr Ala Gly Lys Pro Gly Ile 130 135 140 Asp Asn Arg Glu Cys Leu Ser Met Asp Tyr Lys Gln Thr Gln Leu Cys 145 150 155 160 Ile Leu Gly Cys Lys Pro Pro Ile Gly Glu His Trp Gly Lys Gly Thr 165 170 175 Pro Cys Asn Asn Asn Ser Gly Asn Pro Gly Asp Cys Pro Pro Leu Gln 180 185 190 Leu Ile Asn Ser Val Ile Gln Asp Gly Asp Met Val Asp Thr Gly Phe 195 200 205 Gly Cys Met Asp Phe Asn Thr Leu Gln Ala Ser Lys Ser Asp Val Pro 210 215 220 Ile Asp Ile Cys Ser Ser Val Cys Lys Tyr Pro Asp Tyr Leu Gln Met 225 230 235 240 Ala Ser Glu Pro Tyr Gly Asp Ser Leu Phe Phe Phe Leu Arg Arg Glu 245 250 255 Gln Met Phe Val Arg His Phe Phe Asn Arg Ala Gly Thr Leu Gly Asp 260 265 270 Pro Val Pro Gly Asp Leu Tyr Ile Gln Gly Ser Asn Ser Gly Asn Thr 275 280 285 Ala Thr Val Gln Ser Ser Ala Phe Phe Pro Thr Pro Ser Gly Ser Met 290 295 300 Val Thr Ser Glu Ser Gln Leu Phe Asn Lys Pro Tyr Trp Leu Gln Arg 305 310 315 320 Ala Gln Gly His Asn Asn Gly Ile Cys Trp Gly Asn Gln Leu Phe Val 325 330 335 Thr Val Val Asp Thr Thr Arg Ser Thr Asn Met Thr Leu Cys Ala Glu 340 345 350 Val Lys Lys Glu Ser Thr Tyr Lys Asn Glu Asn Phe Lys Glu Tyr Leu 355 360 365 Arg His Gly Glu Glu Phe Asp Leu Gln Phe Ile Phe Gln Leu Cys Lys 370 375 380 Ile Thr Leu Thr Ala Asp Val Met Thr Tyr Ile His Lys Met Asp Ala 385 390 395 400 Thr Ile Leu Glu Asp Trp Gln Phe Gly Leu Thr Pro Pro Pro Ser Ala 405 410 415 Ser Leu Glu Asp Thr Tyr Arg Phe Val Thr Ser Thr Ala Ile Thr Cys 420 425 430 Gln Lys Asn Thr Pro Pro Lys Gly Lys Glu Asp Pro Leu Lys Asp Tyr 435 440 445 Met Phe Trp Glu Val Asp Leu Lys Glu Lys Phe Ser Ala Asp Leu Asp 450 455 460 Gln Phe Pro Leu Gly Arg Lys Phe Leu Leu Gln Ala Gly Leu Lys Ala 465 470 475 480 Lys Pro Lys Leu Lys Arg Ala Ala Pro Thr Ser Thr Arg Thr Ser Ser 485 490 495 Ala Lys Arg Lys Lys Val Lys Lys 500 <![CDATA[ <210> 124]]>
<![CDATA[ <211> 1516]]>
<![CDATA[ <212> DNA]]>
<![CDATA[ <213> Artificial sequence]]>
<![CDATA[ <220> ]]>
<![CDATA[ <223> 5204 Nucleotide sequence of chimeric HPV type 52 L1 protein]]>
<![CDATA[ <400> 124]]> atgagcgtgt ggaggcccag cgaggccacc gtgtacctgc cccccgtgcc cgtgagcaag 60 gtggtgagca ccgacgagta cgtgagcagg accagcatct actactacgc cggcagcagc 120 aggctgctga ccgtgggcca cccctacttc agcat caaga acaccagcag cggcaacggc 180 aagaaggtgc tggtgcccaa ggtgagcggc ctgcagtaca gggtgttcag gatcaagctg 240 cccgacccca acaagttcgg cttccccgac accagcttct acaacccga gacccagagg 300 ctggtgtggg cctgcaccgg cc tggagatc ggcaggggcc agcccctggg cgtgggcatc 360 agcggccacc ccctgctgaa caagttcgac gacaccgaga ccagcaacaa gtacgccggc 420 aagcccggca tcgacaacag ggagtgcctg agcatggact acaagcagac ccagctgtgc 480 atcctgggct gcaagc cccc catcggcgag cactggggca agggcacccc ctgcaacaac 540 aacagcggca accccggcga ctgcccccccc ctgcagctga tcaacagcgt gatccaggac 600 ggcgacatgg tggacaccgg cttcggctgc atggacttca acaccctgca ggccagcaag 660 agcgacgtgc ccatcgacat ctgcagcagc gtgtgcaagt accccgacta cctgcagatg 720
gccagcgagc cctacggcga cagcctgttc ttcttcctga ggagggagca gatgttcgtg 780
aggcacttct tcaacagggc cggcaccctg ggcgaccccg tgcccggcga cctgtacatc 840
cagggcagca acagcggcaa caccgccacc gtgcagagca gcgccttctt ccccaccccc 900
agcggcagca tggtgaccag cgagagccag ctgttcaaca agccctactg gctgcagagg 960
gcccagggcc acaacaacgg catctgctgg ggcaaccagc tgttcgtgac cgtggtggac 1020 accaccagga gcaccaacat gaccctgtgc gccgaggtga agaaggagag cacctacaag 1080 aacgagaact tcaaggagta cctgaggcac ggcgaggagt tcgacctgca gttcatcttc 1140 cagctgtg ca agatcaccct gaccgccgac gtgatgacct acatccacaa gatggacgcc 1200 accatcctgg aggactggca gttcggcctg acccccccc ccagcgccag cctggaggac 1260 acctacaggt tcgtgaccag caccgccatc acctgccaga agaacaccccc ccccaagggc 132 0 aaggaggacc ccctgaagga ctacatgttc tgggaggtgg acctgaagga gaagttcagc 1380 gccgacctgg accagttccc cctgggcagg aagttcctgc tgcaggccgg cctgaaagcc 1440 aagccaaaac tgaaaagggc tgccccaacc agcaccagga cctcctctgc caagaggaag 1500 aaggtgaaga agtaaa 1516 <![CDATA[ <210> 125]]>
<![CDATA[ <211> 1531]]>
<![CDATA[ <212> DNA]]>
<![CDATA[ <213> Artificial sequence]]>
<![CDATA[ <220> ]]>
<![CDATA[ <223> 5205 Synthetic HPV52L1 gene]]>
<![CDATA[ <400> 125]]> ctgggtacca tgagcgtgtg gaggcccagc gaggccaccg tgtacctgcc ccccgtgccc 60 gtgagcaagg tggtgagcac cgacgagtac gtgagcagga ccagcatcta ctactacgcc 120 ggcagcagca ggctgctgac cgtgg gccac ccctacttca gcatcaagaa caccagcagc 180 ggcaacggca agaaggtgct ggtgcccaag gtgagcggcc tgcagtacag ggtgttcagg 240 atcaagctgc ccgaccccaa caagttcggc ttccccgaca ccagcttcta caaccccgag 300 acccagaggc tggtgtgggc ctgcaccggc ctggagatcg gcaggggcca gcccctgggc 360 gtgggcatca gcggccaccc cctgctgaac aagttcgacg acaccgagac cagcaacaag 420 tacgccggca agcccggcat cgacaacagg gagtgcctga gcatggacta caagcagacc 480 ca gctgtgca tcctgggctg caagcccccc atcggcgagc actggggcaa gggcaccccc 540 tgcaacaaca acagcggcaa ccccggcgac tgcccccccc tgcagctgat caacagcgtg 600 atccaggacg gcgacatggt ggacaccggc ttcggctgca tggacttcaa caccctgcag 660 gccagcaaga gcgacgtgcc catcgacatc tgcagcagcg tgtgcaagta ccccgactac 720 ctgcagatgg ccagcgagcc ctacggcgac agcctgttct tcttcctgag gagggagcag 780 atgttcgtga ggcacttctt caacagggcc ggcaccctgg gcgaccccgt gcccggcgac 840 ctgtacatcc agggcagcaa cagcggcaac accgccaccg tgcagagcag cgccttcttc 900 cccaccccca gcggcagcat ggtgaccagc gagagccagc tgttcaacaa gccctactgg 960 ctgcagaggg cccagggcca caacaacggc atctgctggg gcaaccagct gttcgtgacc 1020 gtggtggaca ccaccaggag caccaacatg accctgtgcg ccgaggtgaa gaaggagagc 1080 acctacaaga acgagaactt caaggagtac ctgaggcacg gcgaggagtt cga cctgcag 1140 ttcatcttcc agctgtgcaa gatcaccctg accgccgacg tgatgaccta catccacaag 1200 atggacgcca ccatcctgga ggactggcag ttcggcctga cccccccccc cagcgccagc 1260 ctggaggaca cctacaggtt cgtgaccagc accgccat ca cctgccagaa gaacaccccc 1320 cccaagggca aggaggaccc cctgaaggac tacatgttct gggaggtgga cctgaaggag 1380 aagttcagcg ccgacctgga ccagttcccc ctgggcagga agttcctgct gcaggccggc 1440 ctgcaggcca ggcccaag ct gaagaggccc gccagcagcg cccccaggac cagcaccaag 1500 aagaagaagg tgaagaggta aactcgagct c 1531 <![CDATA[ <210> 126]]>
<![CDATA[ <211> 1519]]>
<![CDATA[ <212> DNA]]>
<![CDATA[ <213> Artificial sequence]]>
<![CDATA[ <22]]>0> ]]>
<br/> <![CDATA[ <223> 5206 Synthetic HPV33L1 gene]]>
<br/>
<br/> <![CDATA[ <400>126]]> <br/> <![CDATA[ctgggtacca tgagtgtgtg gagaccatct gaggctacag tctacctgcc tcctgtgcct 60 gtgagcaagg tggtgagcac agatgaatat gtgagcagga ccagcatcta ctactatgct 120 ggctccagca gactgctggc tgtgggacac ccatacttca gcatcaaga a cccaaccaat 180 gccaagaaac tgctggtgcc aaaggtgtct ggactccaat acagggtgtt cagggtgaga 240 ctgcctgacc caaacaagtt tggctttcct gacacctcct tctacaaccc tgacacccag 300 agactggtgt gggcttgtgt gggattggag attggcaggg gacaaccact gggagtgggc 360 atctctggac acccactgct gaacaagttt gatgacacag agacaggcaa caaataccct 420 ggacaacctg gagcagacaa cagggagtgt ctgagtatgg actacaagca gacccaactt 480 tgtctgctgg gctgtaagcc tccaacagga gaacactgg g gcaagggagt ggcttgtacc 540 aatgctgccc ctgccaatga ctgtcctcca ttggaactga taaacaccat cattgaggat 600 ggagatatgg tggacacagg ctttggctgt atggacttca agaccctcca agccaacaag 660 tctgatgtgc caattgacat ctgtggca gc acttgtaaat accctgacta cctgaaaatg 720 acctctgaac catatggaga ctccctgttc ttcttcctga ggagggaaca gatgtttgtg 780 agacacttct tcaacagggc tggcaccctg ggagaggctg tgcctgatga cctctacatc 840 aagggctctg gcacc acagc cagcatccag tcctctgcct tctttccaac accatctggc 900 agtatggtga cctctgagag ccaacttttc aacaagccat actggctcca aagggctcaa 960 ggacacaaca atggcatctg ttggggcaac caggtgtttg tgacagtggt ggacaccacc 1020 aggag cacca atatgaccct gtgtacccag gtgacctctg acagcaccta caagaatgag 1080 aacttcaagg aatacatcag gcatgtggag gaatatgacc tccaatttgt gttccaactt 1140 tgtaaggtga ccctgacagc agaggtgatg acctacatcc atgctatgaa ccctgacatc 1200 ttggaggact ggcagtttgg actgacacct cctccatctg cctccctcca agacacctac 1260 aggtttgtga ccagccaggc tatcacttgt cagaagacag tgcctccaaa ggagaaggag 1320 gacccactgg gcaaatacac cttctggggag gtggacctga aagagaagtt ct ctgctgac 1380 ctggaccagt ttccactggg caggaagttc ctgctccaag caggactgaa agccaagcca 1440 aaactgaaaa gggctgcccc aaccagcacc aggacctcct ctgccaagag gaagaaggtg 1500 aagaagtaaa ctcgagctc 1519 <![CDATA[ <210> 127]]>
<![CDATA[ <211> 34]]>
<![CDATA[ <212> DNA]]>
<![CDATA[ <213> Artificial sequence]]>
<![CDATA[ <220> ]]>
<![CDATA[ <223> 5207 HPV52L1 F1]]> <![CDATA[ <400> 127]]> cttggtacca tgagcgtgtg gaggcccagc gagg 34 <![CDATA[ <210> 128]]>
<![CDATA[ <211> 35]]>
<![CDATA[ <212> DNA]]>
<![CDATA[ <213> Artificial sequence]]>
<![CDATA[ <220> ]]>
<![CDATA[ <223> 5208 HPV52L1 R1]]> <![CDATA[ <400> 128]]> gcttggcttt caggccggcc tgcagcagga acttc 35 <![CDATA[ <210> 129]]>
<![CDATA[ <211> 1453]]>
<![CDATA[ <212> DNA]]>
<![CDATA[ <213> Artificial sequence]]>
<![CDATA[ <220> ]]>
<![CDATA[ <223> 5209 HPV52L1 expansion sequence 1]]>
<![CDATA[ <400> 129]]> cttggtacca tgagcgtgtg gaggcccagc gaggccaccg tgtacctgcc ccccgtgccc 60 gtgagcaagg tggtgagcac cgacgagtac gtgagcagga ccagcatcta ctactacgcc 120 ggcagcagca ggctgctgac cgtgg gccac ccctacttca gcatcaagaa caccagcagc 180 ggcaacggca agaaggtgct ggtgcccaag gtgagcggcc tgcagtacag ggtgttcagg 240 atcaagctgc ccgaccccaa caagttcggc ttccccgaca ccagcttcta caaccccgag 300 acccagaggc tggtgtgggc ctgcaccggc ctggagatcg gcaggggcca gcccctgggc 360 gtgggcatca gcggccaccc cctgctgaac aagttcgacg acaccgagac cagcaacaag 420 tacgccggca agcccggcat cgacaacagg gagtgcctga gcatggacta caagcagacc 480 ca gctgtgca tcctgggctg caagcccccc atcggcgagc actggggcaa gggcaccccc 540 tgcaacaaca acagcggcaa ccccggcgac tgcccccccc tgcagctgat caacagcgtg 600 atccaggacg gcgacatggt ggacaccggc ttcggctgca tggacttcaa caccctgcag 660 gccagcaaga gcgacgtgcc catcgacatc tgcagcagcg tgtgcaagta ccccgactac 720 ctgcagatgg ccagcgagcc ctacggcgac agcctgttct tcttcctgag gagggagcag 780 atgttcgtga ggcacttctt caacagggcc ggcaccctgg gcgaccccgt gcccggcgac 840 ctgtacatcc agggcagcaa cagcggcaac accgccaccg tgcagagcag cgccttcttc 900 cccaccccca gcggcagcat ggtgaccagc gagagccagc tgttcaacaa gccctactgg 960 ctgcagaggg cccagggcca caacaacggc atctgctggg gcaaccagct gttcgtgacc 1020 gtggtggaca ccaccaggag caccaacatg accctgtgcg ccgaggtgaa gaaggagagc 1080 acctacaaga acgagaactt caaggagtac ctgaggcacg gcgaggagtt cga cctgcag 1140 ttcatcttcc agctgtgcaa gatcaccctg accgccgacg tgatgaccta catccacaag 1200 atggacgcca ccatcctgga ggactggcag ttcggcctga cccccccccc cagcgccagc 1260 ctggaggaca cctacaggtt cgtgaccagc accgccat ca cctgccagaa gaacaccccc 1320 cccaagggca aggaggaccc cctgaaggac tacatgttct gggaggtgga cctgaaggag 1380 aagttcagcg ccgacctgga ccagttcccc ctgggcagga agttcctgct gcaggccggc 1440 ctgaaagcca agc 1 453 <![CDATA[ <210> 130]]>
<![CDATA[ <211> 35]]>
<![CDATA[ <212> DNA]]>
<![CDATA[ <213> Artificial sequence]]>
<![CDATA[ <220> ]]>
<![CDATA[ <223> 5210 HPV52L1 F2]]> <![CDATA[ <400> 130]]> ggccggcctg aaagccaagc caaaactgaa aaggg 35 <![CDATA[ <210> 131]]>
<![CDATA[ <211> 36]]>
<![CDATA[ <212> DNA]]>
<![CDATA[ <213> Artificial sequence]]>
<![CDATA[ <220> ]]>
<![CDATA[ <223> 5211 HPV52L1 R2]]> <![CDATA[ <400> 131]]> ctgtctagat ttacttcttc accttcttcc tcttgg 36 <![CDATA[ <210> 132]]>
<![CDATA[ <211> 101]]>
<![CDATA[ <212> DNA]]>
<![CDATA[ <213> Artificial sequence]]>
<![CDATA[ <220> ]]>
<![CDATA[ <223> 5212 HPV52L1 expansion sequence 2]]>
<![CDATA[ <400> 132]]> ggccggcctg aaagccaagc caaaactgaa aagggctgcc ccaaccagca ccaggacctc 60 ctctgccaag aggaagaagg tgaagaagta aatctagaca g 101 <![CDATA[ <210> 133]]>
<![CDATA[ <211> 38]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Homo sapiens]]>
<![CDATA[ <220> ]]>
<![CDATA[ <223> 5213 Amino acid sequence 471-508 of HPV type 59 L1 protein]]>
<![CDATA[ <400> 133]]> Leu Gln Leu Gly Ala Arg Pro Lys Pro Thr Ile Gly Pro Arg Lys Arg 1 5 10 15 Ala Pro Ala Ala Pro Thr Ser Thr Pro Ser Pro Lys Arg Val Lys Arg 20 25 30 Arg Lys Ser Ser Arg Lys 35
<![CDATA[ <210> 134]]>
<![CDATA[ <211> 467]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Homo sapiens]]>
<![CDATA[ <220> ]]>
<![CDATA[ <223> 5601 HPV 56 L1 protein amino acid sequence 1-467]]>
<![CDATA[ <400> 134]]> Met Ala Thr Trp Arg Pro Ser Glu Asn Lys Val Tyr Leu Pro Pro Thr 1 5 10 15 Pro Val Ser Lys Val Val Ala Thr Asp Ser Tyr Val Lys Arg Thr Ser 20 25 30 Ile Phe Tyr His Ala Gly Ser Ser Arg Leu Leu Ala Val Gly His Pro 35 40 45 Tyr Tyr Ser Val Thr Lys Asp Asn Thr Lys Thr Asn Ile Pro Lys Val 50 55 60 Ser Ala Tyr Gln Tyr Arg Val Phe Arg Val Arg Leu Pro Asp Pro Asn 65 70 75 80 Lys Phe Gly Leu Pro Asp Thr Asn Ile Tyr Asn Pro Asp Gln Glu Arg 85 90 95 Leu Val Trp Ala Cys Val Gly Leu Glu Val Gly Arg Gly Gln Pro Leu 100 105 110 Gly Ala Gly Leu Ser Gly His Pro Leu Phe Asn Arg Leu Asp Asp Thr 115 120 125 Glu Ser Ser Asn Leu Ala Asn Asn Asn Val Ile Glu Asp Ser Arg Asp 130 135 140 Asn Ile Ser Val Asp Gly Lys Gln Thr Gln Leu Cys Ile Val Gly Cys 145 150 155 160 Thr Pro Ala Met Gly Glu His Trp Thr Lys Gly Ala Val Cys Lys Ser 165 170 175 Thr Gln Val Thr Thr Gly Asp Cys Pro Pro Leu Ala Leu Ile Asn Thr 180 185 190 Pro Ile Glu Asp Gly Asp Met Ile Asp Thr Gly Phe Gly Ala Met Asp 195 200 205 Phe Lys Val Leu Gln Glu Ser Lys Ala Glu Val Pro Leu Asp Ile Val 210 215 220 Gln Ser Thr Cys Lys Tyr Pro Asp Tyr Leu Lys Met Ser Ala Asp Ala 225 230 235 240 Tyr Gly Asp Ser Met Trp Phe Tyr Leu Arg Arg Glu Gln Leu Phe Ala 245 250 255 Arg His Tyr Phe Asn Arg Ala Gly Lys Val Gly Glu Thr Ile Pro Ala 260 265 270 Glu Leu Tyr Leu Lys Gly Ser Asn Gly Arg Glu Pro Pro Pro Ser Ser 275 280 285 Val Tyr Val Ala Thr Pro Ser Gly Ser Met Ile Thr Ser Glu Ala Gln 290 295 300 Leu Phe Asn Lys Pro Tyr Trp Leu Gln Arg Ala Gln Gly His Asn Asn 305 310 315 320 Gly Ile Cys Trp Gly Asn Gln Leu Phe Val Thr Val Val Asp Thr Thr 325 330 335 Arg Ser Thr Asn Met Thr Ile Ser Thr Ala Thr Glu Gln Leu Ser Lys 340 345 350 Tyr Asp Ala Arg Lys Ile Asn Gln Tyr Leu Arg His Val Glu Glu Tyr 355 360 365 Glu Leu Gln Phe Val Phe Gln Leu Cys Lys Ile Thr Leu Ser Ala Glu 370 375 380 Val Met Ala Tyr Leu His Asn Met Asn Ala Asn Leu Leu Glu Asp Trp 385 390 395 400 Asn Ile Gly Leu Ser Pro Pro Val Ala Thr Ser Leu Glu Asp Lys Tyr 405 410 415 Arg Tyr Val Arg Ser Thr Ala Ile Thr Cys Gln Arg Glu Gln Pro Pro 420 425 430 Thr Glu Lys Gln Asp Pro Leu Ala Lys Tyr Lys Phe Trp Asp Val Asn 435 440 445 Leu Gln Asp Ser Phe Ser Thr Asp Leu Asp Gln Phe Pro Leu Gly Arg 450 455 460 Lys Phe Leu 465
<![CDATA[ <210> 135]]>
<![CDATA[ <211> 31]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Homo sapiens]]>
<![CDATA[ <220> ]]>
<![CDATA[ <223> 5602 HPV 33 L1 protein amino acid sequence 469-499]]>
<![CDATA[ <400> 135]]> Leu Gln Ala Gly Leu Lys Ala Lys Pro Lys Leu Lys Arg Ala Ala Pro 1 5 10 15 Thr Ser Thr Arg Thr Ser Ser Ala Lys Arg Lys Lys Val Lys Lys 20 25 30
<![CDATA[ <210> 136]]>
<![CDATA[ <211> 498]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Artificial sequence]]>
<![CDATA[ <220> ]]>
<![CDATA[ <223> 5603 Amino acid sequence of chimeric HPV type 56 L1 protein]]>
<![CDATA[ <400> 136]]> Met Ala Thr Trp Arg Pro Ser Glu Asn Lys Val Tyr Leu Pro Pro Thr 1 5 10 15 Pro Val Ser Lys Val Val Ala Thr Asp Ser Tyr Val Lys Arg Thr Ser 20 25 30 Ile Phe Tyr His Ala Gly Ser Ser Arg Leu Leu Ala Val Gly His Pro 35 40 45 Tyr Tyr Ser Val Thr Lys Asp Asn Thr Lys Thr Asn Ile Pro Lys Val 50 55 60 Ser Ala Tyr Gln Tyr Arg Val Phe Arg Val Arg Leu Pro Asp Pro Asn 65 70 75 80 Lys Phe Gly Leu Pro Asp Thr Asn Ile Tyr Asn Pro Asp Gln Glu Arg 85 90 95 Leu Val Trp Ala Cys Val Gly Leu Glu Val Gly Arg Gly Gln Pro Leu 100 105 110 Gly Ala Gly Leu Ser Gly His Pro Leu Phe Asn Arg Leu Asp Asp Thr 115 120 125 Glu Ser Ser Asn Leu Ala Asn Asn Asn Val Ile Glu Asp Ser Arg Asp 130 135 140 Asn Ile Ser Val Asp Gly Lys Gln Thr Gln Leu Cys Ile Val Gly Cys 145 150 155 160 Thr Pro Ala Met Gly Glu His Trp Thr Lys Gly Ala Val Cys Lys Ser 165 170 175 Thr Gln Val Thr Thr Gly Asp Cys Pro Pro Leu Ala Leu Ile Asn Thr 180 185 190 Pro Ile Glu Asp Gly Asp Met Ile Asp Thr Gly Phe Gly Ala Met Asp 195 200 205 Phe Lys Val Leu Gln Glu Ser Lys Ala Glu Val Pro Leu Asp Ile Val 210 215 220 Gln Ser Thr Cys Lys Tyr Pro Asp Tyr Leu Lys Met Ser Ala Asp Ala 225 230 235 240 Tyr Gly Asp Ser Met Trp Phe Tyr Leu Arg Arg Glu Gln Leu Phe Ala 245 250 255 Arg His Tyr Phe Asn Arg Ala Gly Lys Val Gly Glu Thr Ile Pro Ala 260 265 270 Glu Leu Tyr Leu Lys Gly Ser Asn Gly Arg Glu Pro Pro Pro Ser Ser 275 280 285 Val Tyr Val Ala Thr Pro Ser Gly Ser Met Ile Thr Ser Glu Ala Gln 290 295 300 Leu Phe Asn Lys Pro Tyr Trp Leu Gln Arg Ala Gln Gly His Asn Asn 305 310 315 320 Gly Ile Cys Trp Gly Asn Gln Leu Phe Val Thr Val Val Asp Thr Thr 325 330 335 Arg Ser Thr Asn Met Thr Ile Ser Thr Ala Thr Glu Gln Leu Ser Lys 340 345 350 Tyr Asp Ala Arg Lys Ile Asn Gln Tyr Leu Arg His Val Glu Glu Tyr 355 360 365 Glu Leu Gln Phe Val Phe Gln Leu Cys Lys Ile Thr Leu Ser Ala Glu 370 375 380 Val Met Ala Tyr Leu His Asn Met Asn Ala Asn Leu Leu Glu Asp Trp 385 390 395 400 Asn Ile Gly Leu Ser Pro Pro Val Ala Thr Ser Leu Glu Asp Lys Tyr 405 410 415 Arg Tyr Val Arg Ser Thr Ala Ile Thr Cys Gln Arg Glu Gln Pro Pro 420 425 430 Thr Glu Lys Gln Asp Pro Leu Ala Lys Tyr Lys Phe Trp Asp Val Asn 435 440 445 Leu Gln Asp Ser Phe Ser Thr Asp Leu Asp Gln Phe Pro Leu Gly Arg 450 455 460 Lys Phe Leu Leu Gln Ala Gly Leu Lys Ala Lys Pro Lys Leu Lys Arg 465 470 475 480 Ala Ala Pro Thr Ser Thr Arg Thr Ser Ser Ala Lys Arg Lys Lys Val 485 490 495 Lys Lys
<![CDATA[ <210> 137]]>
<![CDATA[ <211> 1498]]>
<![CDATA[ <212> DNA]]>
<![CDATA[ <213> Artificial sequence]]>
<![CDATA[ <220> ]]>
<![CDATA[ <223> 5604 Nucleotide sequence of chimeric HPV type 56 L1 protein]]>
<![CDATA[ <400> 137]]> atggctacct ggagaccatc tgagaacaag gtctacctgc ctccaacacc tgtgagcaag 60 gtggtggcta cagactccta tgtgaagagg accagcatct tctaccatgc tggctccagc 120 agactgctgg ctgtgggaca cccatactac tctgtgacca aggacaacac caagaccaac 180 atcccaaagg tgtctgccta ccaatacagg gtgttcaggg tgagactgcc tgacccaaac 240 aagtttggac tgcctgacac caacatctac aaccctgacc aggagagact ggtgtgggct 300 tgtgtgggat tggaggtggg caggggacaa ccactgggag caggactgtc tggacacccca 360 ctgttcaaca gactggatga cacagagtcc agcaacctgg ctaacaacaa tgtgattgag 420 gacagcaggg acaacatctc tgtggatggc aagcagaccc aactttgtat tgtgggctgt 480 actcctgcta tggggagaaca ctggaccaag ggagcagtgt gtaagagcac ccaggtgacc 540 acaggagact gtcctccact ggctctgata aacacaccaa ttgaggatgg agatatgatt 600 gacacaggct ttggagctat ggacttcaag gtgctccaag agagcaaggc tgaggtgcca 660 ctggacattg tccagagcac ttgtaaatac cctgactacc tgaaaatgag tgctgatgcc 720 tatggagaca gtatgtggtt ctacctgagg agggaacaac tttttgccag acactacttc 780 aacagggctg gcaaggtggg agagaccatc cctgctgaac tctacctgaa aggcagcaat 840 ggcagggaac ctcctccatc ctctgtctat gtggctacac catctggcag tatgattacc 900 tctgaggctc aacttttcaa caagccatac tggctccaaa gggctcaagg acacaacaat 960 ggcatctgtt ggggcaacca actttttgtg acagtggtgg acaccaccag gagcaccaat 1020 atgaccatca gcacagccac agaacaactt agcaaatatg atgccaggaa gataaaccaa 1080 tacctgaggc atgtggagga atatgaactc caatttgtgt tccaactttg taagattacc 1140 ctgtctgctg aggtgatggc ttacctgcac aatatgaatg ccaacctgtt ggaggactgg 1200 aacattggac tgagccctcc tgtggctacc tccttggagg acaaatacag atatgtgagg 1260 agcacagcca tcacttgtca gagggaacaa cctccaacag agaagcagga cccactggct 1320 aaatacaagt tctggggatgt gaacctccaa gactccttca gcacagacct ggaccagttt 1380 ccactgggca ggaagttcct gctccaagca ggactgaaag ccaagccaaa actgaaaagg 1440 gctgccccaa ccagcaccag gacctcctct gccaagagga agaaggtgaa gaagtaaa 1498
<![CDATA[ <210> 138]]>
<![CDATA[ <211> 1519]]>
<![CDATA[ <212> DNA]]>
<![CDATA[ <213> Artificial sequence]]>
<![CDATA[ <220> ]]>
<![CDATA[ <223> 5605 Synthetic HPV56L1 gene]]>
<![CDATA[ <400> 138]]> ctgggtacca tggctacctg gagaccatct gagaacaagg tctacctgcc tccaacacct 60 gtgagcaagg tggtggctac agactcctat gtgaagagga ccagcatctt ctaccatgct 120 ggctccagca gactgctggc tgtgggacac ccatactact ctgtgaccaa ggacaacacc 180 aagaccaaca tcccaaaggt gtctgcctac caatacaggg tgttcagggt gagactgcct 240 gacccaaaca agtttggact gcctgacacc aacatctaca accctgacca ggagagactg 300 gtgtgggctt gtgtgggatt ggaggtgggc aggggacaac cactgggagc aggactgtct 360 ggacacccac tgttcaacag actggatgac acagagtcca gcaacctggc taacaacaat 420 gtgattgagg acagcaggga caacatctct gtggatggca agcagaccca actttgtatt 480 gtgggctgta ctcctgctat gggagaacac tggaccaagg gagcagtgtg taagagcacc 540 caggtgacca caggagactg tcctccactg gctctgataa acacaccaat tgaggatgga 600 gatatgattg acacaggctt tggagctatg gacttcaagg tgctccaaga gagcaaggct 660 gaggtgccac tggacattgtccagagcact tgtaaatacc ctgactacct gaaaatgagt 720 gctgatgcct atggagacag tatgtggttc tacctgagga gggaacaact ttttgccaga 780 cactacttca acagggctgg caaggtggga gagaccatcc ctgctgaact ctacctgaaa 840 ggcagcaatg gcagggaacc tcctccatcc tctgtctatg tggctacacc atctggcagt 900 atgattacct ctgaggctca acttttcaac aagccatact ggctccaaag ggctcaagga 960 cacaacaatg gcatctgttg gggcaaccaa ctttttgtga cagtggtgga caccaccagg 1020 agcaccaata tgaccatcag cacagccaca gaacaactta gcaaatatga tgccaggaag 1080 ataaaccaat acctgaggca tgtggaggaa tatgaactcc aatttgtgtt ccaactttgt 1140 aagattaccc tgtctgctga ggtgatggct tacctgcaca atatgaatgc caacctgttg 1200 gaggactgga acattggact gagccctcct gtggctacct ccttggagga caaatacaga 1260 tatgtgagga gcacagccat cacttgtcag agggaacaac ctccaacaga gaagcaggac 1320 ccactggcta aatacaagtt ctggggatgtg aacctccaag actccttcag cacagacctg 1380 gaccagtttc cactgggcag gaagttcctg atgcaacttg gcaccaggag caagcctgct 1440 gtggctacca gcaagaagag gtctgcccca accagcacca gcacacctgc caagaggaag 1500 aggaggtaaa ctcgagctc 1519
<![CDATA[ <210> 139]]>
<![CDATA[ <211> 1519]]>
<![CDATA[ <212> DNA]]>
<![CDATA[ <213> Artificial sequence]]>
<![CDATA[ <220> ]]>
<![CDATA[ <223> 5606 Synthetic HPV33L1 gene]]>
<![CDATA[ <400> 139]]> ctgggtacca tgagtgtgtg gagaccatct gaggctacag tctacctgcc tcctgtgcct 60 gtgagcaagg tggtgagcac agatgaatat gtgagcagga ccagcatcta ctactatgct 120 ggctccagca gactgctggc tgtgggacac ccatacttca gcatcaagaa cccaaccaat 180 gccaagaaac tgctggtgcc aaaggtgtct ggactccaat acagggtgtt cagggtgaga 240 ctgcctgacc caaacaagtt tggctttcct gacacctcct tctacaaccc tgacacccag 300 agactggtgt gggcttgtgt gggattggag attggcaggg gacaaccact gggagtgggc 360 atctctggac acccactgct gaacaagttt gatgacacag agacaggcaa caaataccct 420 ggacaacctg gagcagacaa cagggagtgt ctgagtatgg actacaagca gacccaactt 480 tgtctgctgg gctgtaagcc tccaacagga gaacactggg gcaagggagt ggcttgtacc 540 aatgctgccc ctgccaatga ctgtcctcca ttggaactga taaacaccat cattgaggat 600 ggagatatgg tggacacagg ctttggctgt atggacttca agaccctcca agccaacaag 660 tctgatgtgc caattgacat ctgtggcagc acttgtaaat accctgacta cctgaaaatg 720 acctctgaac catatggaga ctccctgttc ttcttcctga ggagggaaca gatgtttgtg 780 agacacttct tcaacagggc tggcaccctg ggagaggctg tgcctgatga cctctacatc 840 aagggctctg gcaccacagc cagcatccag tcctctgcct tctttccaac accatctggc 900 agtatggtga cctctgagag ccaacttttc aacaagccat actggctcca aagggctcaa 960 ggacacaaca atggcatctg ttggggcaac caggtgtttg tgacagtggt ggacaccacc 1020 aggagcacca atatgaccct gtgtacccag gtgacctctg acagcaccta caagaatgag 1080 aacttcaagg aatacatcag gcatgtggag gaatatgacc tccaatttgt gttccaactt 1140 tgtaaggtga ccctgacagc agaggtgatg acctacatcc atgctatgaa ccctgacatc 1200 ttggaggact ggcagtttgg actgacacct cctccatctg cctccctcca agacacctac 1260 aggtttgtga ccagccaggc tatcacttgt cagaagacag tgcctccaaa ggagaaggag 1320 gacccactgg gcaaatacac cttctggggag gtggacctga aagagaagtt ctctgctgac 1380 ctggaccagt ttccactggg caggaagttc ctgctccaag caggactgaa agccaagcca 1440 aaactgaaaa gggctgcccc aaccagcacc aggacctcct ctgccaagag gaagaaggtg 1500 aagaagtaaa ctcgagctc 1519
<![CDATA[ <210> 140]]>
<![CDATA[ <211> 33]]>
<![CDATA[ <212> DNA]]>
<![CDATA[ <213> Artificial sequence]]>
<![CDATA[ <220> ]]>
<![CDATA[ <223> 5607 HPV56L1 F1]]>
<![CDATA[ <400> 140]]> cttggtacca tggctacctg gagaccatct gag 33
<![CDATA[ <210> 141]]>
<![CDATA[ <211> 31]]>
<![CDATA[ <212> DNA]]>
<![CDATA[ <213> Artificial sequence]]>
<![CDATA[ <220> ]]>
<![CDATA[ <223> 5608 HPV56L1 R1]]>
<![CDATA[ <400> 141]]> ctgcttggag caggaacttc ctgcccagtg g 31
<![CDATA[ <210> 142]]>
<![CDATA[ <211> 1420]]>
<![CDATA[ <212> DNA]]>
<![CDATA[ <213> Artificial sequence]]>
<![CDATA[ <220> ]]>
<![CDATA[ <223> 5609 HPV56L1 expansion sequence 1]]>
<![CDATA[ <400> 142]]> cttggtacca tggctacctg gagaccatct gagaacaagg tctacctgcc tccaacacct 60 gtgagcaagg tggtggctac agactcctat gtgaagagga ccagcatctt ctaccatgct 120 ggctccagca gactgctggc tgtgggacac ccatactact ctgtgaccaa ggacaacacc 180 aagaccaaca tcccaaaggt gtctgcctac caatacaggg tgttcagggt gagactgcct 240 gacccaaaca agtttggact gcctgacacc aacatctaca accctgacca ggagagactg 300 gtgtgggctt gtgtgggatt ggaggtgggc aggggacaac cactgggagc aggactgtct 360 ggacacccac tgttcaacag actggatgac acagagtcca gcaacctggc taacaacaat 420 gtgattgagg acagcaggga caacatctct gtggatggca agcagaccca actttgtatt 480 gtgggctgta ctcctgctat gggagaacac tggaccaagg gagcagtgtg taagagcacc 540 caggtgacca caggagactg tcctccactg gctctgataa acacaccaat tgaggatgga 600 gatatgattg acacaggctt tggagctatg gacttcaagg tgctccaaga gagcaaggct 660 gaggtgccac tggacattgtccagagcact tgtaaatacc ctgactacct gaaaatgagt 720 gctgatgcct atggagacag tatgtggttc tacctgagga gggaacaact ttttgccaga 780 cactacttca acagggctgg caaggtggga gagaccatcc ctgctgaact ctacctgaaa 840 ggcagcaatg gcagggaacc tcctccatcc tctgtctatg tggctacacc atctggcagt 900 atgattacct ctgaggctca acttttcaac aagccatact ggctccaaag ggctcaagga 960 cacaacaatg gcatctgttg gggcaaccaa ctttttgtga cagtggtgga caccaccagg 1020 agcaccaata tgaccatcag cacagccaca gaacaactta gcaaatatga tgccaggaag 1080 ataaaccaat acctgaggca tgtggaggaa tatgaactcc aatttgtgtt ccaactttgt 1140 aagattaccc tgtctgctga ggtgatggct tacctgcaca atatgaatgc caacctgttg 1200 gaggactgga acattggact gagccctcct gtggctacct ccttggagga caaatacaga 1260 tatgtgagga gcacagccat cacttgtcag agggaacaac ctccaacaga gaagcaggac 1320 ccactggcta aatacaagtt ctggggatgtg aacctccaag actccttcag cacagacctg 1380 gaccagtttc cactgggcag gaagttcctg ctccaagcag 1420
<![CDATA[ <210> 143]]>
<![CDATA[ <211> 36]]>
<![CDATA[ <212> DNA]]>
<![CDATA[ <213> Artificial sequence]]>
<![CDATA[ <220> ]]>
<![CDATA[ <223> 5610 HPV56L1 F2]]> <![CDATA[ <400> 143]]> gaagttcctg ctccaagcag gactgaaagc caagcc 36 <![CDATA[ <210> 144]]>
<![CDATA[ <211> 36]]>
<![CDATA[ <212> DNA]]>
<![CDATA[ <213> Artificial sequence]]>
<![CDATA[ <220> ]]>
<![CDATA[ <223> 5611 HPV56L1 R2]]> <![CDATA[ <400> 144]]> ctgtctagat ttacttcttc accttcttcc tcttgg 36 <![CDATA[ <210> 145]]>
<![CDATA[ <211> 116]]>
<![CDATA[ <212> DNA]]>
<![CDATA[ <213> Artificial sequence]]>
<![CDATA[ <220> ]]>
<![CDATA[ <223> 5612 HPV56L1 expansion sequence 2]]>
<![CDATA[ <400> 145]]> gaagttcctg ctccaagcag gactgaaagc caagccaaaa ctgaaaaggg ctgccccaac 60 cagcaccagg acctcctctg ccaagaggaa gaaggtgaag aagtaaatct agacag 116 <![CDATA[ <210> 146]]>
<![CDATA[ <211> 38]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Homo sapiens]]>
<![CDATA[ <220> ]]>
<![CDATA[ <223> 5613 HPV 59 L1 protein amino acid sequence 471-508]]>
<![CDATA[ <400> 146]]> Leu Gln Leu Gly Ala Arg Pro Lys Pro Thr Ile Gly Pro Arg Lys Arg 1 5 10 15 Ala Ala Pro Ala Pro Thr Ser Thr Pro Ser Pro Lys Arg Val Lys Arg 20 25 30 Arg Lys Ser Ser Arg Lys 35 <![CDATA[ <210> 147]]>
<![CDATA[ <211> 473]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Homo sapiens]]>
<![CDATA[ <220> ]]>
<![CDATA[ <223> 5801 HPV 58 L1 protein amino acid sequence 1-473]]>
<![CDATA[ <400> 147]]> Met Ser Val Trp Arg Pro Ser Glu Ala Thr Val Tyr Leu Pro Pro Val 1 5 10 15 Pro Val Ser Lys Val Val Ser Thr Asp Glu Tyr Val Ser Arg Thr Ser 20 25 30 Ile Tyr Tyr Tyr Ala Gly Ser Ser Arg Leu Leu Ala Val Gly Asn Pro 35 40 45 Tyr Phe Ser Ile Lys Ser Pro Asn Asn Asn Lys Lys Val Leu Val Pro 50 55 60 Lys Val Ser Gly Leu Gln Tyr Arg Val Phe Arg Val Arg Leu Pro Asp 65 70 75 80 Pro Asn Lys Phe Gly Phe Pro Asp Thr Ser Phe Tyr Asn Pro Asp Thr 85 90 95 Gln Arg Leu Val Trp Ala Cys Val Gly Leu Glu Ile Gly Arg Gly Gln 100 105 110 Pro Leu Gly Val Gly Val Ser Gly His Pro Tyr Leu Asn Lys Phe Asp 115 120 125 Asp Thr Glu Thr Ser Asn Arg Tyr Pro Ala Gln Pro Gly Ser Asp Asn 130 135 140 Arg Glu Cys Leu Ser Met Asp Tyr Lys Gln Thr Gln Leu Cys Leu Ile 145 150 155 160 Gly Cys Lys Pro Pro Thr Gly Glu His Trp Gly Lys Gly Val Ala Cys 165 170 175 Asn Asn Asn Ala Ala Ala Thr Asp Cys Pro Pro Leu Glu Leu Phe Asn 180 185 190 Ser Ile Ile Glu Asp Gly Asp Met Val Asp Thr Gly Phe Gly Cys Met 195 200 205 Asp Phe Gly Thr Leu Gln Ala Asn Lys Ser Asp Val Pro Ile Asp Ile 210 215 220 Cys Asn Ser Thr Cys Lys Tyr Pro Asp Tyr Leu Lys Met Ala Ser Glu 225 230 235 240 Pro Tyr Gly Asp Ser Leu Phe Phe Phe Leu Arg Arg Glu Gln Met Phe 245 250 255 Val Arg His Phe Phe Asn Arg Ala Gly Lys Leu Gly Glu Ala Val Pro 260 265 270 Asp Asp Leu Tyr Ile Lys Gly Ser Gly Asn Thr Ala Val Ile Gln Ser 275 280 285 Ser Ala Phe Phe Pro Thr Pro Ser Gly Ser Ile Val Thr Ser Glu Ser 290 295 300 Gln Leu Phe Asn Lys Pro Tyr Trp Leu Gln Arg Ala Gln Gly His Asn 305 310 315 320 Asn Gly Ile Cys Trp Gly Asn Gln Leu Phe Val Thr Val Val Asp Thr 325 330 335 Thr Arg Ser Thr Asn Met Thr Leu Cys Thr Glu Val Thr Lys Glu Gly 340 345 350 Thr Tyr Lys Asn Asp Asn Phe Lys Glu Tyr Val Arg His Val Glu Glu 355 360 365 Tyr Asp Leu Gln Phe Val Phe Gln Leu Cys Lys Ile Thr Leu Thr Ala 370 375 380 Glu Ile Met Thr Tyr Ile His Thr Met Asp Ser Asn Ile Leu Glu Asp 385 390 395 400 Trp Gln Phe Gly Leu Thr Pro Pro Pro Ser Ala Ser Leu Gln Asp Thr 405 410 415 Tyr Arg Phe Val Thr Ser Gln Ala Ile Thr Cys Gln Lys Thr Ala Pro 420 425 430 Pro Lys Glu Lys Glu Asp Pro Leu Asn Lys Tyr Thr Phe Trp Glu Val 435 440 445 Asn Leu Lys Glu Lys Phe Ser Ala Asp Leu Asp Gln Phe Pro Leu Gly 450 455 460 Arg Lys Phe Leu Leu Gln Ser Gly Leu 465 470 <![CDATA[ <210> 148]]>
<![CDATA[ <211> 26]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Homo sapiens]]>
<![CDATA[ <220> ]]>
<![CDATA[ <223> 5802 HPV 33 L1 protein amino acid sequence 474-499]]>
<![CDATA[ <400> 148]]> Lys Ala Lys Pro Lys Leu Lys Arg Ala Ala Pro Thr Ser Thr Arg Thr 1 5 10 15 Ser Ser Ala Lys Arg Lys Lys Val Lys Lys 20 25 <![CDATA[ <210> 149]]>
<![CDATA[ <211> 499]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Artificial sequence]]>
<![CDATA[ <220> ]]>
<![CDATA[ <223> 5803 Amino acid sequence of chimeric HPV type 58 L1 protein]]>
<![CDATA[ <400> 149]]> Met Ser Val Trp Arg Pro Ser Glu Ala Thr Val Tyr Leu Pro Pro Val 1 5 10 15 Pro Val Ser Lys Val Val Ser Thr Asp Glu Tyr Val Ser Arg Thr Ser 20 25 30 Ile Tyr Tyr Tyr Ala Gly Ser Ser Arg Leu Leu Ala Val Gly Asn Pro 35 40 45 Tyr Phe Ser Ile Lys Ser Pro Asn Asn Asn Lys Lys Val Leu Val Pro 50 55 60 Lys Val Ser Gly Leu Gln Tyr Arg Val Phe Arg Val Arg Leu Pro Asp 65 70 75 80 Pro Asn Lys Phe Gly Phe Pro Asp Thr Ser Phe Tyr Asn Pro Asp Thr 85 90 95 Gln Arg Leu Val Trp Ala Cys Val Gly Leu Glu Ile Gly Arg Gly Gln 100 105 110 Pro Leu Gly Val Gly Val Ser Gly His Pro Tyr Leu Asn Lys Phe Asp 115 120 125 Asp Thr Glu Thr Ser Asn Arg Tyr Pro Ala Gln Pro Gly Ser Asp Asn 130 135 140 Arg Glu Cys Leu Ser Met Asp Tyr Lys Gln Thr Gln Leu Cys Leu Ile 145 150 155 160 Gly Cys Lys Pro Pro Thr Gly Glu His Trp Gly Lys Gly Val Ala Cys 165 170 175 Asn Asn Asn Ala Ala Ala Thr Asp Cys Pro Pro Leu Glu Leu Phe Asn 180 185 190 Ser Ile Ile Glu Asp Gly Asp Met Val Asp Thr Gly Phe Gly Cys Met 195 200 205 Asp Phe Gly Thr Leu Gln Ala Asn Lys Ser Asp Val Pro Ile Asp Ile 210 215 220 Cys Asn Ser Thr Cys Lys Tyr Pro Asp Tyr Leu Lys Met Ala Ser Glu 225 230 235 240 Pro Tyr Gly Asp Ser Leu Phe Phe Phe Leu Arg Arg Glu Gln Met Phe 245 250 255 Val Arg His Phe Phe Asn Arg Ala Gly Lys Leu Gly Glu Ala Val Pro 260 265 270 Asp Asp Leu Tyr Ile Lys Gly Ser Gly Asn Thr Ala Val Ile Gln Ser 275 280 285 Ser Ala Phe Phe Pro Thr Pro Ser Gly Ser Ile Val Thr Ser Glu Ser 290 295 300 Gln Leu Phe Asn Lys Pro Tyr Trp Leu Gln Arg Ala Gln Gly His Asn 305 310 315 320 Asn Gly Ile Cys Trp Gly Asn Gln Leu Phe Val Thr Val Val Asp Thr 325 330 335 Thr Arg Ser Thr Asn Met Thr Leu Cys Thr Glu Val Thr Lys Glu Gly 340 345 350 Thr Tyr Lys Asn Asp Asn Phe Lys Glu Tyr Val Arg His Val Glu Glu 355 360 365 Tyr Asp Leu Gln Phe Val Phe Gln Leu Cys Lys Ile Thr Leu Thr Ala 370 375 380 Glu Ile Met Thr Tyr Ile His Thr Met Asp Ser Asn Ile Leu Glu Asp 385 390 395 400 Trp Gln Phe Gly Leu Thr Pro Pro Pro Ser Ala Ser Leu Gln Asp Thr 405 410 415 Tyr Arg Phe Val Thr Ser Gln Ala Ile Thr Cys Gln Lys Thr Ala Pro 420 425 430 Pro Lys Glu Lys Glu Asp Pro Leu Asn Lys Tyr Thr Phe Trp Glu Val 435 440 445 Asn Leu Lys Glu Lys Phe Ser Ala Asp Leu Asp Gln Phe Pro Leu Gly 450 455 460 Arg Lys Phe Leu Leu Gln Ser Gly Leu Lys Ala Lys Pro Lys Leu Lys 465 470 475 480 Arg Ala Ala Pro Thr Ser Thr Arg Thr Ser Ser Ala Lys Arg Lys Lys 485 490 495 Val Lys Lys <![CDATA[ <210> 150]]>
<![CDATA[ <211> 1501]]>
<![CDATA[ <212> DNA]]>
<![CDATA[ <213> Artificial sequence]]>
<![CDATA[ <220> ]]>
<![CDATA[ <223> 5804 Nucleotide sequence of chimeric HPV type 58 L1 protein]]>
<![CDATA[ <400> 150]]> atgagcgtgt ggaggcccag cgaggccacc gtgtacctgc cccccgtgcc cgtgagcaag 60 gtggtgagca ccgacgagta cgtgagcagg accagcatct actactacgc cggcagcagc 120 aggctgctgg ccgtgggcaa cccctacttc agcat caaga gccccaacaa caacaagaag 180 gtgctggtgc ccaaggtgag cggcctgcag tacagggtgt tcagggtgag gctgcccgac 240 cccaacaagt tcggcttccc cgacaccagc ttctacaacc ccgacaccca gaggctggtg 300 tgggcctgcg tg ggcctgga gatcggcagg ggccagcccc tgggcgtggg cgtgagcggc 360 cacccctacc tgaacaagtt cgacgacacc gagaccagca acaggtaccc cgcccagccc 420 ggcagcgaca acagggagtg cctgagcatg gactacaagc agacccagct gtgcctgatc 480 ggctg caagc cccccaccgg cgagcactgg ggcaagggcg tggcctgcaa caacaacgcc 540 gccgccaccg actgcccccc cctggagctg ttcaacagca tcatcgagga cggcgacatg 600 gtggacaccg gcttcggctg catggacttc ggcaccctgc aggccaacaa gagcgacgtg 660 cccatcgaca tctgcaacag cacctgcaag taccccgact acctgaagat ggccagcgag 720 ccctacggcg acagcctgtt cttcttcctg aggagggagc agatgttcgt gaggcacttc 780 ttcaacaggg ccggcaagct gggcgaggcc gtgcccgacg acc tgtacat caagggcagc 840 ggcaacaccg ccgtgatcca gagcagcgcc ttcttcccca cccccagcgg cagcatcgtg 900 accagcgaga gccagctgtt caacaagccc tactggctgc agagggccca gggccacaac 960 aacggcatct gctggggcaa ccagctgttc gtgaccgtgg tggacacccac caggagcacc 1020 aacatgaccc tgtgcaccga ggtgaccaag gagggcacct acaagaacga caacttcaag 1080 gagtacgtga ggcacgtgga ggagtacgac ctgcagttcg tgttccagct gtgcaagatc 1140 accctgaccg cc gagatcat gacctacatc cacaccatgg acagcaacat cctggaggac 1200 tggcagttcg gcctgacccc cccccccagc gccagcctgc aggacaccta caggttcgtg 1260 accagccagg ccatcacctg ccagaagacc gcccccccca aggagaagga ggaccccctg 1320 aacaag taca ccttctggga ggtgaacctg aaggagaagt tcagcgccga cctggaccag 1380 ttccccctgg gcaggaagtt cctgctgcag agcggcctga aagccaagcc aaaactgaaa 1440 agggctgccc caaccagcac caggacctcc tctgccaaga ggaagaaggt gaagaagta a 1500 a 1501 <![CDATA[ <210> 151]]>
<![CDATA[ <211> 1516]]>
<![CDATA[ <212> DNA]]>
<![CDATA[ <213> Artificial sequence]]>
<![CDATA[ <220> ]]>
<![CDATA[ <223> 5805 Synthetic HPV58L1 gene]]>
<![CDATA[ <400> 151]]> ctgggtacca tgagcgtgtg gaggcccagc gaggccaccg tgtacctgcc ccccgtgccc 60 gtgagcaagg tggtgagcac cgacgagtac gtgagcagga ccagcatcta ctactacgcc 120 ggcagcagca ggctgctggc cgtgg gcaac ccctacttca gcatcaagag ccccaacaac 180 aacaagaagg tgctggtgcc caaggtgagc ggcctgcagt acagggtgtt cagggtgagg 240 ctgcccgacc ccaacaagtt cggcttcccc gacaccagct tctacaaccc cgacacccag 300 aggctggtgt gggcc tgcgt gggcctggag atcggcaggg gccagcccct gggcgtgggc 360 gtgagcggcc acccctacct gaacaagttc gacgacaccg agaccagcaa caggtacccc 420 gcccagcccg gcagcgacaa cagggagtgc ctgagcatgg actacaagca gacccagctg 480 tgcctgatcg gctgcaagcc ccccaccggc gagcactggg gcaagggcgt ggcctgcaac 540 aacaacgccg ccgccaccga ctgcccccccc ctggagctgt tcaacagcat catcgaggac 600 ggcgacatgg tggacaccgg cttcggctgc atggacttcg gcaccctgca ggccaacaag 660 agcgacgtgc ccatcgacat ctgcaacagc acctgcaagt accccgacta cctgaagatg 720 gccagcgagc cctacggcga cagcctgttc ttcttcctga ggagggagca gatgttcgtg 780 aggcacttct tcaacagggc cggcaagctg ggcgaggcc g tgcccgacga cctgtacatc 840 aagggcagcg gcaacaccgc cgtgatccag agcagcgcct tcttccccac ccccagcggc 900 agcatcgtga ccagcgagag ccagctgttc aacaagccct actggctgca gagggcccag 960 ggccacaaca acggcatctg ct gggcaac cagctgttcg tgaccgtggt ggacaccacc 1020 aggagcacca acatgaccct gtgcaccgag gtgaccaagg agggcaccta caagaacgac 1080 aacttcaagg agtacgtgag gcacgtggag gagtacgacc tgcagttcgt gttccagctg 1140 tgcaagatca ccctgaccgc cgagatcatg acctacatcc acaccatgga cagcaacatc 1200 ctggaggact ggcagttcgg cctgaccccc ccccccagcg ccagcctgca ggacacctac 1260 aggttcgtga ccagccaggc catcacctgc cagaagaccg ccccccccaa ggagaaggag 1320 gaccccctga acaagtacac cttctggggag gtgaacctga aggagaagtt cagcgccgac 1380 ctggaccagt tccccctggg caggaagttc ctgctgcaga gcggcctgaa ggccaagccc 1440 aggctgaaga ggagcgcccc caccaccagg gcccccagca ccaagaggaa gaaggtgaag 15 00 aagtaaactc gagctc 1516 <![CDATA[ <210> 152]]>
<![CDATA[ <211> 1519]]>
<![CDATA[ <212> DNA]]>
<![CDATA[ <213> Artificial sequence]]>
<![CDATA[ <220> ]]>
<![CDATA[ <223> 5806 Synthetic HPV33L1 gene]]>
<![CDATA[ <400> 152]]> ctgggtacca tgagtgtgtg gagaccatct gaggctacag tctacctgcc tcctgtgcct 60 gtgagcaagg tggtgagcac agatgaatat gtgagcagga ccagcatcta ctactatgct 120 ggctccagca gactgctggc tgtgggacac ccat acttca gcatcaagaa cccaaccaat 180 gccaagaaac tgctggtgcc aaaggtgtct ggactccaat acagggtgtt cagggtgaga 240 ctgcctgacc caaacaagtt tggctttcct gacacctcct tctacaaccc tgacacccag 300 agactggtgt gggcttgt gt gggattggag attggcaggg gacaaccact gggagtgggc 360
atctctggac acccactgct gaacaagttt gatgacacag agacaggcaa caaataccct 420
ggacaacctg gagcagacaa cagggagtgt ctgagtatgg actacaagca gacccaactt 480
tgtctgctgg gctgtaagcc tccaacagga gaacactggg gcaagggagt ggcttgtacc 540
aatgctgccc ctgccaatga ctgtcctcca ttggaactga taaacaccat cattgaggat 600
ggagatatgg tggacacagg ctttggctgt atggacttca agaccctcca agccaacaag 660
tctgatgtgc caattgacat ctgtggcagc acttgtaaat accctgacta cctgaaaatg 720 acctctgaac catatggaga ctccctgttc ttcttcctga gggagggaaca gatgtttgtg 780 agacacttct tcaacagggc tggcaccctg ggagaggctg tgcctgatga cctctacatc 840 a agggctctg gcaccacagc cagcatccag tcctctgcct tctttccaac accatctggc 900 agtatggtga cctctgagag ccaacttttc aacaagccat actggctcca aagggctcaa 960 ggacacaaca atggcatctg ttggggcaac caggtgtttg tgacagtggt ggacaccacc 1020 aggagcacca atatgaccct gtgtacccag gtgacctctg acagcaccta caagaatgag 1080 aacttcaagg aatacatcag gcatgtggag gaatatgacc tccaatttgt gttccaactt 1140 tgtaaggtga ccctgacagc agaggtgatg acctacatcc atgctatgaa ccctgacatc 1200 ttggaggact ggcagtttgg actgacacct cctccatctg cctccctcca agacacctac 1260 aggtttgtga ccagccaggc tatcacttgt cagaagacag tgcctccaaa ggagaaggag 1320 gacccactgg gcaaatacac cttctgggag gtggacctga aaga gaagtt ctctgctgac 1380 ctggaccagt ttccactggg caggaagttc ctgctccaag caggactgaa agccaagcca 1440 aaactgaaaa gggctgcccc aaccagcacc aggacctcct ctgccaagag gaagaaggtg 1500 aagaagtaaa ctcgagctc 1519 <![CDATA[ <210> 153]]>
<![CDATA[ <211> 34]]>
<![CDATA[ <212> DNA]]>
<![CDATA[ <213> Artificial sequence]]>
<![CDATA[ <220> ]]>
<![CDATA[ <223> 5807 HPV58L1 F1]]> <![CDATA[ <400> 153]]> cttggtacca tgagcgtgtg gaggcccagc gagg 34 <![CDATA[ <210> 154]]>
<![CDATA[ <211> 36]]>
<![CDATA[ <212> DNA]]>
<![CDATA[ <213> Artificial sequence]]>
<![CDATA[ <220> ]]>
<![CDATA[ <223> 5808 HPV58L1 R1]]> <![CDATA[ <400> 154]]> gcttggcttt caggccgctc tgcagcagga acttcc 36 <![CDATA[ <210> 155]]>
<![CDATA[ <211> 1438]]>
<![CDATA[ <212> DNA]]>
<![CDATA[ <213> Artificial sequence]]>
<![CDATA[ <220> ]]>
<![CDATA[ <223> 5809 HPV58L1 expansion sequence 1]]>
<![CDATA[ <400> 155]]> cttggtacca tgagcgtgtg gaggcccagc gaggccaccg tgtacctgcc ccccgtgccc 60 gtgagcaagg tggtgagcac cgacgagtac gtgagcagga ccagcatcta ctactacgcc 120 ggcagcagca ggctgctggc cgtgg gcaac ccctacttca gcatcaagag ccccaacaac 180 aacaagaagg tgctggtgcc caaggtgagc ggcctgcagt acagggtgtt cagggtgagg 240 ctgcccgacc ccaacaagtt cggcttcccc gacaccagct tctacaaccc cgacacccag 300 aggctggtgt gggcc tgcgt gggcctggag atcggcaggg gccagcccct gggcgtgggc 360 gtgagcggcc acccctacct gaacaagttc gacgacaccg agaccagcaa caggtacccc 420 gcccagcccg gcagcgacaa cagggagtgc ctgagcatgg actacaagca gacccagctg 480 tgcctgatcg gctgcaagcc ccccaccggc gagcactggg gcaagggcgt ggcctgcaac 540 aacaacgccg ccgccaccga ctgcccccccc ctggagctgt tcaacagcat catcgaggac 600 ggcgacatgg tggacaccgg cttcggctgc atggacttcg gcaccctgca ggccaacaag 660 agcgacgtgc ccatcgacat ctgcaacagc acctgcaagt accccgacta cctgaagatg 720 gccagcgagc cctacggcga cagcctgttc ttcttcctga ggagggagca gatgttcgtg 780 aggcacttct tcaacagggc cggcaagctg ggcgaggcc g tgcccgacga cctgtacatc 840 aagggcagcg gcaacaccgc cgtgatccag agcagcgcct tcttccccac ccccagcggc 900 agcatcgtga ccagcgagag ccagctgttc aacaagccct actggctgca gagggcccag 960 ggccacaaca acggcatctg ct gggcaac cagctgttcg tgaccgtggt ggacaccacc 1020 aggagcacca acatgaccct gtgcaccgag gtgaccaagg agggcaccta caagaacgac 1080 aacttcaagg agtacgtgag gcacgtggag gagtacgacc tgcagttcgt gttccagctg 1140 tgcaagatca ccctgaccgc cgagatcatg acctacatcc acaccatgga cagcaacatc 1200 ctggaggact ggcagttcgg cctgaccccc ccccccagcg ccagcctgca ggacacctac 1260 aggttcgtga ccagccaggc catcacctgc cagaagaccg ccccccccaa ggagaaggag 1320 gaccccctga acaagtacac cttctggggag gtgaacctga aggagaagtt cagcgccgac 1380 ctggaccagt tccccctggg caggaagttc ctgctgcaga gcggcctgaa agccaagc 1438 <![CDATA[ <210> 156]]>
<![CDATA[ <211> 35]]>
<![CDATA[ <212> DNA]]>
<![CDATA[ <213> Artificial sequence]]>
<![CDATA[ <220> ]]>
<![CDATA[ <223> 5810 HPV58L1 F2]]> <![CDATA[ <400> 156]]> gagcggcctg aaagccaagc caaaactgaa aaggg 35 <![CDATA[ <210> 157]]>
<![CDATA[ <211> 36]]>
<![CDATA[ <212> DNA]]>
<![CDATA[ <213> Artificial sequence]]>
<![CDATA[ <220> ]]>
<![CDATA[ <223> 5811 HPV58L1 R2]]> <![CDATA[ <400> 157]]> ctgtctagat ttacttcttc accttcttcc tcttgg 36 <![CDATA[ <210> ]]>158
<![CDATA[ <211> 101]]>
<![CDATA[ <212> DNA]]>
<![CDATA[ <213> Artificial sequence]]>
<![CDATA[ <220> ]]>
<![CDATA[ <223> 5812 HPV58L1 expansion sequence 2]]>
<![CDATA[ <400> 158]]> gagcggcctg aaagccaagc caaaactgaa aagggctgcc ccaaccagca ccaggacctc 60 ctctgccaag aggaagaagg tgaagaagta aatctagaca g 101 <![CDATA[ <210> 159]]>
<![CDATA[ <211> 38]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Homo sapiens]]>
<![CDATA[ <220> ]]>
<![CDATA[ <223> 5813 HPV 59 L1 protein amino acid sequence 471-508]]>
<![CDATA[ <400> 159]]> Leu Gln Leu Gly Ala Arg Pro Lys Pro Thr Ile Gly Pro Arg Lys Arg 1 5 10 15 Ala Ala Pro Ala Pro Thr Ser Thr Pro Ser Pro Lys Arg Val Lys Arg 20 25 30 Arg Lys Ser Ser Arg Lys 35 <![CDATA[ <210> 160]]>
<![CDATA[ <211> 508]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Homo sapiens]]>
<![CDATA[ <220> ]]>
<![CDATA[ <223> 5901 Amino acid sequence of HPV type 59 L1 protein]]>
<![CDATA[ <400> 160]]> Met Ala Leu Trp Arg Ser Ser Asp Asn Lys Val Tyr Leu Pro Pro Pro 1 5 10 15 Ser Val Ala Lys Val Val Ser Thr Asp Glu Tyr Val Thr Arg Thr Ser 20 25 30 Ile Phe Tyr His Ala Gly Ser Ser Arg Leu Leu Thr Val Gly His Pro 35 40 45 Tyr Phe Lys Val Pro Lys Gly Gly Asn Gly Arg Gln Asp Val Pro Lys 50 55 60 Val Ser Ala Tyr Gln Tyr Arg Val Phe Arg Val Lys Leu Pro Asp Pro 65 70 75 80 Asn Lys Phe Gly Leu Pro Asp Asn Thr Val Tyr Asp Pro Asn Ser Gln 85 90 95 Arg Leu Val Trp Ala Cys Val Gly Val Glu Ile Gly Arg Gly Gln Pro 100 105 110 Leu Gly Val Gly Leu Ser Gly His Pro Leu Tyr Asn Lys Leu Asp Asp 115 120 125 Thr Glu Asn Ser His Val Ala Ser Ala Val Asp Thr Lys Asp Thr Arg 130 135 140 Asp Asn Val Ser Val Asp Tyr Lys Gln Thr Gln Leu Cys Ile Ile Gly 145 150 155 160 Cys Val Pro Ala Ile Gly Glu His Trp Thr Lys Gly Thr Ala Cys Lys 165 170 175 Pro Thr Thr Val Val Gln Gly Asp Cys Pro Pro Leu Glu Leu Ile Asn 180 185 190 Thr Pro Ile Glu Asp Gly Asp Met Val Asp Thr Gly Tyr Gly Ala Met 195 200 205 Asp Phe Lys Leu Leu Gln Asp Asn Lys Ser Glu Val Pro Leu Asp Ile 210 215 220 Cys Gln Ser Ile Cys Lys Tyr Pro Asp Tyr Leu Gln Met Ser Ala Asp 225 230 235 240 Ala Tyr Gly Asp Ser Met Phe Phe Cys Leu Arg Arg Glu Gln Val Phe 245 250 255 Ala Arg His Phe Trp Asn Arg Ser Gly Thr Met Gly Asp Gln Leu Pro 260 265 270 Glu Ser Leu Tyr Ile Lys Gly Thr Asp Ile Arg Ala Asn Pro Gly Ser 275 280 285 Tyr Leu Tyr Ser Pro Ser Pro Ser Gly Ser Val Val Thr Ser Asp Ser 290 295 300 Gln Leu Phe Asn Lys Pro Tyr Trp Leu His Lys Ala Gln Gly Leu Asn 305 310 315 320 Asn Gly Ile Cys Trp His Asn Gln Leu Phe Leu Thr Val Val Asp Thr 325 330 335 Thr Arg Ser Thr Asn Leu Ser Val Cys Ala Ser Thr Thr Ser Ser Ile 340 345 350 Pro Asn Val Tyr Thr Pro Thr Ser Phe Lys Glu Tyr Ala Arg His Val 355 360 365 Glu Glu Phe Asp Leu Gln Phe Ile Phe Gln Leu Cys Lys Ile Thr Leu 370 375 380 Thr Thr Glu Val Met Ser Tyr Ile His Asn Met Asn Thr Thr Ile Leu 385 390 395 400 Glu Asp Trp Asn Phe Gly Val Thr Pro Pro Pro Thr Ala Ser Leu Val 405 410 415 Asp Thr Tyr Arg Phe Val Gln Ser Ala Ala Val Thr Cys Gln Lys Asp 420 425 430 Thr Ala Pro Pro Val Lys Gln Asp Pro Tyr Asp Lys Leu Lys Phe Trp 435 440 445 Pro Val Asp Leu Lys Glu Arg Phe Ser Ala Asp Leu Asp Gln Phe Pro 450 455 460 Leu Gly Arg Lys Phe Leu Leu Gln Leu Gly Ala Arg Pro Lys Pro Thr 465 470 475 480 Ile Gly Pro Arg Lys Arg Ala Ala Pro Ala Pro Thr Ser Thr Pro Ser 485 490 495 Pro Lys Arg Val Lys Arg Arg Lys Ser Ser Arg Lys 500 505 <![CDATA[ <210> 161]]>
<![CDATA[ <211> 1528]]>
<![CDATA[ <212> DNA]]>
<![CDATA[ <213> Homo sapiens]]>
<![CDATA[ <220> ]]>
<![CDATA[ <223> 5902 Nucleotide sequence of HPV type 59 L1 protein]]>
<![CDATA[ <400> 161]]> atggctctgt ggaggtcctc tgacaacaag gtctacctgc ctcctccatc tgtggctaag 60 gtggtgagca cagatgaata tgtgaccagg accagcatct tctaccatgc tggctccagc 120 agactgctga cagtgggaca cccatacttc aaggtg ccaa agggaggcaa tggcagacag 180 gatgtgccaa aggtgtctgc ctaccaatac agggtgttca gggtgaaact gcctgaccca 240 aacaagtttg gactgcctga caacacagtc tatgacccaa acagccagag actggtgtgg 300 gcttgtgtgg gagtggagat t ggcagggga caaccactgg gagtgggact gtctggacac 360 ccactctaca acaaactgga tgacacagag aactctcatg tggcatctgc tgtggacacc 420 aaggacacca gggacaatgt gtctgtggac tacaagcaga cccaactttg tatcattggc 480 tgtgtgcctg ccattggaga a cactggacc aagggcacag cctgtaagcc aaccacagtg 540 gtccagggag actgtcctcc attggaactg ataaacacac caattgagga tggagatatg 600 gtggacacag gctatggagc tatggacttc aaactgctcc aagacaacaa gtctgaggtg 660 ccactggaca tctgtcaga g catctgtaaa taccctgact acctccaaat gagtgctgat 720 gcctatggag acagtatgtt cttctgtctg aggagggaac aggtgtttgc cagacacttc 780 tggaacaggt ctggcacaat gggagaccaa cttcctgagt ccctctacat caagggcaca 840 gacatca ggg ctaaccctgg ctcctacctc tacagcccaa gcccatctgg ctctgtggtg 900 acctctgaca gccaactttt caacaagcca tactggctgc acaaggctca aggactgaac 960 aatggcatct gttggcacaa ccaacttttc ctgacagtgg tggacaccac caggagcacc 1020 aacctgtctg tgtgtgccag caccacctcc agcatcccaa atgtctacac accaacctcc 1080 ttcaaggaat atgccaggca tgtggaggag tttgacctcc aattcatctt ccaactttgt 1140 aagattaccc tgaccacaga ggtgatgagt tacatccaca atatgaacac caccat cttg 1200 gaggactgga actttggagt gacacctcct ccaacagcct ccctggtgga cacctacagg 1260 tttgtccagt ctgctgctgt gacttgtcag aaggacacag cccctcctgt gaagcaggac 1320 ccatatgaca aactgaagtt ctggcctgtg gacctgaa agaggttctc tgctgacctg 1380 gaccagtttc cactgggcag gaagttcctg ctccaacttg gagccagacc aaagccaacc 1440 attggaccaa ggaagagggc tgcccctgcc ccaaccagca caccaagccc aaagagggtg 1500 aagaggagga agtccagcag gaagtaaa 15 28 <![CDATA[ <210> 162]]>
<![CDATA[ <211> 1546]]>
<![CDATA[ <212> DNA]]>
<![CDATA[ <213> Artificial sequence]]>
<![CDATA[ <220> ]]>
<![CDATA[ <223> 5903 Synthetic HPV59L1 gene]]>
<![CDATA[ [400] ca aggtgccaaa gggaggcaat 180 ggcagacagg atgtgccaaa ggtgtctgcc taccaataca gggtgttcag ggtgaaactg 240 cctgacccaa acaagtttgg actgcctgac aacacagtct atgacccaaa cagccagaga 300 ctggtgtggg cttgtgtggg agtggagatt ggcaggggac aaccactggg agtgggactg 360 tctggacacc cactctacaa caaactggat gacacagaga actctcatgt ggcatctgct 420 gtggacacca aggacaccag ggacaatgtg tctgtggact acaagcagac ccaactttgt 480 atcattggct gtgtgcctgc cattggagaa cactggacca agggcacagc ctgtaagcca 540 accacagtgg tccagggaga ctgtcctcca ttggaactga taaacacacc aattgaggat 600 ggagatatgg tggacacagg ctatggagct atggacttca aactgctcca agacaacaag 660 tctgaggtgc cactggacat ctgtc agagc atctgtaaat accctgacta cctccaaatg 720 agtgctgatg cctatggaga cagtatgttc ttctgtctga ggagggaaca ggtgtttgcc 780 agacacttct ggaacaggtc tggcacaatg ggagaccaac ttcctgagtc cctctacatc 840 aagggca cag acatcagggc taaccctggc tcctacctct acagcccaag cccatctggc 900 tctgtggtga cctctgacag ccaacttttc aacaagccat actggctgca caaggctcaa 960 ggactgaaca atggcatctg ttggcacaac caacttttcc tgacagtggt ggacaccacc 1020 aggagcacca acctgtctgt gtgtgccagc accacctcca gcatcccaaa tgtctacaca 1080 ccaacctcct tcaaggaata tgccaggcat gtggaggagt ttgacctcca attcatcttc 1140 caactttgta agattaccct gaccacagag gtgatgagtt acatccacaa tatgaacac c 1200 accatcttgg aggactggaa ctttggagtg acacctcctc caacagcctc cctggtggac 1260 acctacaggt ttgtccagtc tgctgctgtg acttgtcaga aggacacagc ccctcctgtg 1320 aagcaggacc catatgacaa actgaagttc tggcctgt gg acctgaaaga gaggttctct 1380 gctgacctgg accagtttcc actgggcagg aagttcctgc tccaacttgg agccagacca 1440 aagccaacca ttggaccaag gaagagggct gcccctgccc caaccagcac accaagccca 1500 aagagggtga agaggaggaa gtccagcagg aagtaaactc ctc 1546
Claims (17)
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
TW109135395A TWI858159B (en) | 2020-10-13 | Human papillomavirus polyvalent immunogenic composition |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
TW109135395A TWI858159B (en) | 2020-10-13 | Human papillomavirus polyvalent immunogenic composition |
Publications (2)
Publication Number | Publication Date |
---|---|
TW202214671A TW202214671A (en) | 2022-04-16 |
TWI858159B true TWI858159B (en) | 2024-10-11 |
Family
ID=
Citations (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20060127979A1 (en) | 2000-06-21 | 2006-06-15 | Wilson Susan D | Chimeric human papillomavirus (HPV) L1 molecules and uses therefor |
Patent Citations (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20060127979A1 (en) | 2000-06-21 | 2006-06-15 | Wilson Susan D | Chimeric human papillomavirus (HPV) L1 molecules and uses therefor |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
AU2020317321B2 (en) | Polyvalent immunogenicity composition for human papillomavirus | |
CN114127098B (en) | Chimeric human papillomavirus 51 type L1 protein | |
KR102640490B1 (en) | Chimeric papillomavirus L1 protein | |
CN114127100B (en) | Chimeric human papillomavirus 39 type L1 protein | |
CN114127127B (en) | Chimeric human papillomavirus 35 type L1 protein | |
CN114127097B (en) | Chimeric human papillomavirus 56-type L1 protein | |
CN114174319B (en) | Chimeric human papillomavirus 52 type L1 protein | |
CN114127095B (en) | Chimeric human papillomavirus 11-type L1 protein | |
CN114127096B (en) | Chimeric human papillomavirus 31-type L1 protein | |
TWI858159B (en) | Human papillomavirus polyvalent immunogenic composition | |
CN114127093B (en) | Chimeric human papilloma virus 45-type L1 protein | |
CN114174320B (en) | Chimeric human papillomavirus 18-type L1 protein | |
CN114127295B (en) | Chimeric human papillomavirus 16-type L1 protein | |
CN114127094B (en) | Chimeric human papillomavirus 58 type L1 protein | |
WO2021013067A1 (en) | Chimeric human papillomavirus type 6 l1 protein | |
RU2808002C2 (en) | Chimeric protein l1 of papillomavirus | |
RU2806424C2 (en) | Polyvalent immunogenic composition against human papillomavirus | |
TW202214671A (en) | Human papillomavirus multivalent immunogenic composition wherein the chimeric papillomavirus L1 protein comprises an N-terminal fragment derived from the first type papillomavirus L1 protein and a C-terminal fragment derived from the second type papillomavirus L1 protein |