CN101300353A - 用于制备抗虫转基因植物的杀虫组合物和方法 - Google Patents
用于制备抗虫转基因植物的杀虫组合物和方法 Download PDFInfo
- Publication number
- CN101300353A CN101300353A CNA2006800404600A CN200680040460A CN101300353A CN 101300353 A CN101300353 A CN 101300353A CN A2006800404600 A CNA2006800404600 A CN A2006800404600A CN 200680040460 A CN200680040460 A CN 200680040460A CN 101300353 A CN101300353 A CN 101300353A
- Authority
- CN
- China
- Prior art keywords
- val
- ile
- thr
- leu
- lys
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 241000238631 Hexapoda Species 0.000 title claims abstract description 78
- 230000009261 transgenic effect Effects 0.000 title claims abstract description 60
- 238000000034 method Methods 0.000 title claims abstract description 50
- 230000000749 insecticidal effect Effects 0.000 title claims description 90
- 239000000203 mixture Substances 0.000 title claims description 87
- 241000196324 Embryophyta Species 0.000 claims abstract description 198
- 108090000623 proteins and genes Proteins 0.000 claims abstract description 145
- 102000004169 proteins and genes Human genes 0.000 claims abstract description 110
- 235000002017 Zea mays subsp mays Nutrition 0.000 claims abstract description 62
- 235000005824 Zea mays ssp. parviglumis Nutrition 0.000 claims abstract description 60
- 235000005822 corn Nutrition 0.000 claims abstract description 60
- 241000258937 Hemiptera Species 0.000 claims abstract description 50
- 241000489975 Diabrotica Species 0.000 claims abstract description 6
- 241000489972 Diabrotica barberi Species 0.000 claims abstract description 6
- 241000381325 Diabrotica virgifera zeae Species 0.000 claims abstract description 6
- 241000489976 Diabrotica undecimpunctata howardi Species 0.000 claims abstract description 5
- 241001529600 Diabrotica balteata Species 0.000 claims abstract description 4
- 241000916731 Diabrotica speciosa Species 0.000 claims abstract description 4
- 241000916730 Diabrotica viridula Species 0.000 claims abstract description 4
- 230000004927 fusion Effects 0.000 claims abstract description 4
- 210000004027 cell Anatomy 0.000 claims description 192
- 230000014509 gene expression Effects 0.000 claims description 147
- 235000013311 vegetables Nutrition 0.000 claims description 78
- 239000002773 nucleotide Substances 0.000 claims description 75
- 125000003729 nucleotide group Chemical group 0.000 claims description 75
- 125000003275 alpha amino acid group Chemical group 0.000 claims description 74
- 230000001580 bacterial effect Effects 0.000 claims description 50
- 241000254173 Coleoptera Species 0.000 claims description 40
- 238000009825 accumulation Methods 0.000 claims description 37
- 241000209140 Triticum Species 0.000 claims description 36
- 235000021307 Triticum Nutrition 0.000 claims description 35
- 240000007594 Oryza sativa Species 0.000 claims description 29
- 235000007164 Oryza sativa Nutrition 0.000 claims description 28
- 210000003763 chloroplast Anatomy 0.000 claims description 28
- 235000009566 rice Nutrition 0.000 claims description 28
- 244000078534 Vaccinium myrtillus Species 0.000 claims description 24
- 241000607479 Yersinia pestis Species 0.000 claims description 23
- 239000003053 toxin Substances 0.000 claims description 23
- 231100000765 toxin Toxicity 0.000 claims description 23
- 108091032973 (ribonucleotides)n+m Proteins 0.000 claims description 22
- 235000013305 food Nutrition 0.000 claims description 21
- 244000025254 Cannabis sativa Species 0.000 claims description 19
- 241000193830 Bacillus <bacterium> Species 0.000 claims description 17
- 235000003095 Vaccinium corymbosum Nutrition 0.000 claims description 16
- 235000017537 Vaccinium myrtillus Nutrition 0.000 claims description 16
- 235000021014 blueberries Nutrition 0.000 claims description 16
- 230000009545 invasion Effects 0.000 claims description 16
- 238000001514 detection method Methods 0.000 claims description 15
- 241001233957 eudicotyledons Species 0.000 claims description 15
- 102000040650 (ribonucleotides)n+m Human genes 0.000 claims description 14
- 244000068988 Glycine max Species 0.000 claims description 14
- 235000010469 Glycine max Nutrition 0.000 claims description 13
- 238000006243 chemical reaction Methods 0.000 claims description 13
- 241000209510 Liliopsida Species 0.000 claims description 12
- 244000046109 Sorghum vulgare var. nervosum Species 0.000 claims description 12
- 238000002360 preparation method Methods 0.000 claims description 11
- 210000000582 semen Anatomy 0.000 claims description 11
- 241001124134 Chrysomelidae Species 0.000 claims description 10
- 239000000843 powder Substances 0.000 claims description 10
- 235000007319 Avena orientalis Nutrition 0.000 claims description 9
- 235000007558 Avena sp Nutrition 0.000 claims description 9
- 241000894006 Bacteria Species 0.000 claims description 9
- 229920000742 Cotton Polymers 0.000 claims description 9
- 240000005979 Hordeum vulgare Species 0.000 claims description 9
- 241001414823 Lygus hesperus Species 0.000 claims description 9
- 244000061456 Solanum tuberosum Species 0.000 claims description 9
- 244000144730 Amygdalus persica Species 0.000 claims description 8
- 240000007087 Apium graveolens Species 0.000 claims description 8
- 235000015849 Apium graveolens Dulce Group Nutrition 0.000 claims description 8
- 235000010591 Appio Nutrition 0.000 claims description 8
- 244000003416 Asparagus officinalis Species 0.000 claims description 8
- 235000005340 Asparagus officinalis Nutrition 0.000 claims description 8
- 235000000832 Ayote Nutrition 0.000 claims description 8
- 241000219310 Beta vulgaris subsp. vulgaris Species 0.000 claims description 8
- 244000178993 Brassica juncea Species 0.000 claims description 8
- 235000005855 Brassica juncea var. subintegrifolia Nutrition 0.000 claims description 8
- 240000002791 Brassica napus Species 0.000 claims description 8
- 235000011293 Brassica napus Nutrition 0.000 claims description 8
- 235000011299 Brassica oleracea var botrytis Nutrition 0.000 claims description 8
- 240000003259 Brassica oleracea var. botrytis Species 0.000 claims description 8
- 235000000540 Brassica rapa subsp rapa Nutrition 0.000 claims description 8
- 235000005979 Citrus limon Nutrition 0.000 claims description 8
- 244000131522 Citrus pyriformis Species 0.000 claims description 8
- 240000008067 Cucumis sativus Species 0.000 claims description 8
- 240000004244 Cucurbita moschata Species 0.000 claims description 8
- 235000009854 Cucurbita moschata Nutrition 0.000 claims description 8
- 235000009804 Cucurbita pepo subsp pepo Nutrition 0.000 claims description 8
- 240000008620 Fagopyrum esculentum Species 0.000 claims description 8
- 235000016623 Fragaria vesca Nutrition 0.000 claims description 8
- 240000009088 Fragaria x ananassa Species 0.000 claims description 8
- 235000011363 Fragaria x ananassa Nutrition 0.000 claims description 8
- 244000020551 Helianthus annuus Species 0.000 claims description 8
- 244000017020 Ipomoea batatas Species 0.000 claims description 8
- 235000002678 Ipomoea batatas Nutrition 0.000 claims description 8
- 240000008415 Lactuca sativa Species 0.000 claims description 8
- 235000011430 Malus pumila Nutrition 0.000 claims description 8
- 235000015103 Malus silvestris Nutrition 0.000 claims description 8
- 235000002637 Nicotiana tabacum Nutrition 0.000 claims description 8
- 244000061176 Nicotiana tabacum Species 0.000 claims description 8
- 244000046052 Phaseolus vulgaris Species 0.000 claims description 8
- 235000010582 Pisum sativum Nutrition 0.000 claims description 8
- 240000004713 Pisum sativum Species 0.000 claims description 8
- 235000009827 Prunus armeniaca Nutrition 0.000 claims description 8
- 244000018633 Prunus armeniaca Species 0.000 claims description 8
- 235000006040 Prunus persica var persica Nutrition 0.000 claims description 8
- 241000220324 Pyrus Species 0.000 claims description 8
- 235000017848 Rubus fruticosus Nutrition 0.000 claims description 8
- 235000007238 Secale cereale Nutrition 0.000 claims description 8
- 240000003768 Solanum lycopersicum Species 0.000 claims description 8
- 235000002597 Solanum melongena Nutrition 0.000 claims description 8
- 244000061458 Solanum melongena Species 0.000 claims description 8
- 235000002595 Solanum tuberosum Nutrition 0.000 claims description 8
- 235000021536 Sugar beet Nutrition 0.000 claims description 8
- 240000001717 Vaccinium macrocarpon Species 0.000 claims description 8
- 235000021029 blackberry Nutrition 0.000 claims description 8
- 235000021019 cranberries Nutrition 0.000 claims description 8
- 230000001186 cumulative effect Effects 0.000 claims description 8
- 235000013399 edible fruits Nutrition 0.000 claims description 8
- 235000021017 pears Nutrition 0.000 claims description 8
- 235000015136 pumpkin Nutrition 0.000 claims description 8
- 244000241257 Cucumis melo Species 0.000 claims description 7
- 235000010799 Cucumis sativus var sativus Nutrition 0.000 claims description 7
- 235000009419 Fagopyrum esculentum Nutrition 0.000 claims description 7
- 235000003222 Helianthus annuus Nutrition 0.000 claims description 7
- 235000003228 Lactuca sativa Nutrition 0.000 claims description 7
- 235000007688 Lycopersicon esculentum Nutrition 0.000 claims description 7
- 235000010726 Vigna sinensis Nutrition 0.000 claims description 7
- 235000021028 berry Nutrition 0.000 claims description 7
- 230000008488 polyadenylation Effects 0.000 claims description 7
- 238000000926 separation method Methods 0.000 claims description 7
- 231100000419 toxicity Toxicity 0.000 claims description 7
- 230000001988 toxicity Effects 0.000 claims description 7
- 235000009847 Cucumis melo var cantalupensis Nutrition 0.000 claims description 6
- 235000007340 Hordeum vulgare Nutrition 0.000 claims description 6
- 230000005030 transcription termination Effects 0.000 claims description 6
- 240000007241 Agrostis stolonifera Species 0.000 claims description 5
- 235000017060 Arachis glabrata Nutrition 0.000 claims description 5
- 244000105624 Arachis hypogaea Species 0.000 claims description 5
- 235000010777 Arachis hypogaea Nutrition 0.000 claims description 5
- 235000018262 Arachis monticola Nutrition 0.000 claims description 5
- 241000209202 Bromus secalinus Species 0.000 claims description 5
- 244000052363 Cynodon dactylon Species 0.000 claims description 5
- 240000004585 Dactylis glomerata Species 0.000 claims description 5
- 241000234642 Festuca Species 0.000 claims description 5
- 235000020232 peanut Nutrition 0.000 claims description 5
- 241001466007 Heteroptera Species 0.000 claims description 4
- 150000001875 compounds Chemical group 0.000 claims description 4
- 235000012343 cottonseed oil Nutrition 0.000 claims description 4
- 230000002538 fungal effect Effects 0.000 claims description 4
- 108020001507 fusion proteins Proteins 0.000 claims description 4
- 102000037865 fusion proteins Human genes 0.000 claims description 4
- 238000000746 purification Methods 0.000 claims description 4
- 241000907223 Bruchinae Species 0.000 claims description 3
- 241000254171 Curculionidae Species 0.000 claims description 3
- 241001427543 Elateridae Species 0.000 claims description 3
- 235000007039 Euchlaena mexicana Nutrition 0.000 claims description 3
- 206010061217 Infestation Diseases 0.000 claims description 3
- 241000254062 Scarabaeidae Species 0.000 claims description 3
- 241000607757 Xenorhabdus Species 0.000 claims description 3
- 235000002485 Zea mays ssp. mexicana Nutrition 0.000 claims description 3
- 241000209152 Zea mays subsp. mexicana Species 0.000 claims description 3
- 239000006227 byproduct Substances 0.000 claims description 3
- 241000489977 Diabrotica virgifera Species 0.000 claims description 2
- 235000013339 cereals Nutrition 0.000 claims description 2
- 238000004821 distillation Methods 0.000 claims description 2
- 238000004519 manufacturing process Methods 0.000 claims description 2
- 239000003921 oil Substances 0.000 claims description 2
- 235000019198 oils Nutrition 0.000 claims description 2
- 239000007787 solid Substances 0.000 claims description 2
- 235000012424 soybean oil Nutrition 0.000 claims description 2
- 239000003549 soybean oil Substances 0.000 claims description 2
- 241000209149 Zea Species 0.000 claims 13
- 241001672694 Citrus reticulata Species 0.000 claims 8
- 235000003934 Abelmoschus esculentus Nutrition 0.000 claims 4
- 240000004507 Abelmoschus esculentus Species 0.000 claims 4
- QLRRUWXMMVXORS-UHFFFAOYSA-N Augustine Natural products C12=CC=3OCOC=3C=C2CN2C3CC(OC)C4OC4C31CC2 QLRRUWXMMVXORS-UHFFFAOYSA-N 0.000 claims 4
- 241000209763 Avena sativa Species 0.000 claims 4
- 241000167854 Bourreria succulenta Species 0.000 claims 4
- 235000010523 Cicer arietinum Nutrition 0.000 claims 4
- 244000045195 Cicer arietinum Species 0.000 claims 4
- 241000219146 Gossypium Species 0.000 claims 4
- 241000220225 Malus Species 0.000 claims 4
- 241000209056 Secale Species 0.000 claims 4
- 241000219793 Trifolium Species 0.000 claims 4
- 241000219977 Vigna Species 0.000 claims 4
- 235000009754 Vitis X bourquina Nutrition 0.000 claims 4
- 235000012333 Vitis X labruscana Nutrition 0.000 claims 4
- 240000006365 Vitis vinifera Species 0.000 claims 4
- 235000014787 Vitis vinifera Nutrition 0.000 claims 4
- 235000019693 cherries Nutrition 0.000 claims 4
- 244000037671 genetically modified crops Species 0.000 claims 3
- 241000193403 Clostridium Species 0.000 claims 1
- 241000305071 Enterobacterales Species 0.000 claims 1
- 108700028146 Genetic Enhancer Elements Proteins 0.000 claims 1
- 235000011868 grain product Nutrition 0.000 claims 1
- 210000004962 mammalian cell Anatomy 0.000 claims 1
- 102000040430 polynucleotide Human genes 0.000 abstract description 87
- 108091033319 polynucleotide Proteins 0.000 abstract description 87
- 239000002157 polynucleotide Substances 0.000 abstract description 87
- 229940097012 bacillus thuringiensis Drugs 0.000 abstract description 82
- 241000193388 Bacillus thuringiensis Species 0.000 abstract description 81
- 240000008042 Zea mays Species 0.000 abstract description 50
- 108091028043 Nucleic acid sequence Proteins 0.000 abstract description 9
- 238000011161 development Methods 0.000 abstract description 3
- 241000489947 Diabrotica virgifera virgifera Species 0.000 abstract description 2
- 241001414826 Lygus Species 0.000 abstract description 2
- 230000001747 exhibiting effect Effects 0.000 abstract 1
- 235000018102 proteins Nutrition 0.000 description 85
- 108010009298 lysylglutamic acid Proteins 0.000 description 60
- 108010030617 leucyl-phenylalanyl-valine Proteins 0.000 description 58
- 108010073969 valyllysine Proteins 0.000 description 43
- 108020004414 DNA Proteins 0.000 description 42
- FYPWFNKQVVEELI-ULQDDVLXSA-N Leu-Phe-Val Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C(C)C)C(O)=O)CC1=CC=CC=C1 FYPWFNKQVVEELI-ULQDDVLXSA-N 0.000 description 38
- DFQZDQPLWBSFEJ-LSJOCFKGSA-N Val-Val-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(=O)N)C(=O)O)N DFQZDQPLWBSFEJ-LSJOCFKGSA-N 0.000 description 36
- 108090000765 processed proteins & peptides Proteins 0.000 description 32
- NZYNRRGJJVSSTJ-GUBZILKMSA-N Val-Ser-Val Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O NZYNRRGJJVSSTJ-GUBZILKMSA-N 0.000 description 28
- DOSZISJPMCYEHT-NAKRPEOUSA-N Ser-Ile-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O DOSZISJPMCYEHT-NAKRPEOUSA-N 0.000 description 27
- ZMUQQMGITUJQTI-CIUDSAMLSA-N Asn-Leu-Asn Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O ZMUQQMGITUJQTI-CIUDSAMLSA-N 0.000 description 26
- 108010092854 aspartyllysine Proteins 0.000 description 26
- 108010034529 leucyl-lysine Proteins 0.000 description 26
- 108010071324 Livagen Proteins 0.000 description 24
- IMDMLDSVUSMAEJ-HJGDQZAQSA-N Thr-Leu-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O IMDMLDSVUSMAEJ-HJGDQZAQSA-N 0.000 description 24
- ZLFHAAGHGQBQQN-GUBZILKMSA-N Val-Ala-Pro Natural products CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@H]1C(O)=O ZLFHAAGHGQBQQN-GUBZILKMSA-N 0.000 description 24
- 239000000463 material Substances 0.000 description 24
- HLRLXVPRJJITSK-IFFSRLJSSA-N Gln-Thr-Val Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O HLRLXVPRJJITSK-IFFSRLJSSA-N 0.000 description 22
- 108700026244 Open Reading Frames Proteins 0.000 description 21
- HUPLKEHTTQBXSC-YJRXYDGGSA-N Thr-Ser-Tyr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 HUPLKEHTTQBXSC-YJRXYDGGSA-N 0.000 description 21
- 108010003137 tyrosyltyrosine Proteins 0.000 description 21
- IOFVWPYSRSCWHI-JXUBOQSCSA-N Ala-Thr-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@H](C)N IOFVWPYSRSCWHI-JXUBOQSCSA-N 0.000 description 20
- NVEASDQHBRZPSU-BQBZGAKWSA-N Gln-Gln-Gly Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O NVEASDQHBRZPSU-BQBZGAKWSA-N 0.000 description 20
- MTEQZJFSEMXXRK-CFMVVWHZSA-N Tyr-Asn-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC1=CC=C(C=C1)O)N MTEQZJFSEMXXRK-CFMVVWHZSA-N 0.000 description 20
- 108010044374 isoleucyl-tyrosine Proteins 0.000 description 20
- 238000012360 testing method Methods 0.000 description 20
- ZOHGLPQGEHSLPD-FXQIFTODSA-N Ser-Gln-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O ZOHGLPQGEHSLPD-FXQIFTODSA-N 0.000 description 19
- 241000282326 Felis catus Species 0.000 description 18
- AIQWYVFNBNNOLU-RHYQMDGZSA-N Leu-Thr-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O AIQWYVFNBNNOLU-RHYQMDGZSA-N 0.000 description 18
- HXSUFWQYLPKEHF-IHRRRGAJSA-N Phe-Asn-Arg Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N HXSUFWQYLPKEHF-IHRRRGAJSA-N 0.000 description 18
- APQIVBCUIUDSMB-OSUNSFLBSA-N Val-Ile-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](C(C)C)N APQIVBCUIUDSMB-OSUNSFLBSA-N 0.000 description 18
- 108010013835 arginine glutamate Proteins 0.000 description 18
- 210000001519 tissue Anatomy 0.000 description 18
- 238000010276 construction Methods 0.000 description 17
- 102000004196 processed proteins & peptides Human genes 0.000 description 17
- 239000000523 sample Substances 0.000 description 17
- SUMYEVXWCAYLLJ-GUBZILKMSA-N Ala-Leu-Gln Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O SUMYEVXWCAYLLJ-GUBZILKMSA-N 0.000 description 16
- QIQABBIDHGQXGA-ZPFDUUQYSA-N Glu-Ile-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O QIQABBIDHGQXGA-ZPFDUUQYSA-N 0.000 description 16
- IOVUXUSIGXCREV-DKIMLUQUSA-N Ile-Leu-Phe Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 IOVUXUSIGXCREV-DKIMLUQUSA-N 0.000 description 16
- JKGHDYGZRDWHGA-SRVKXCTJSA-N Leu-Asn-Leu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O JKGHDYGZRDWHGA-SRVKXCTJSA-N 0.000 description 16
- LIINDKYIGYTDLG-PPCPHDFISA-N Leu-Ile-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LIINDKYIGYTDLG-PPCPHDFISA-N 0.000 description 16
- WYEXWKAWMNJKPN-UBHSHLNASA-N Met-Ala-Phe Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CCSC)N WYEXWKAWMNJKPN-UBHSHLNASA-N 0.000 description 16
- 108010003700 lysyl aspartic acid Proteins 0.000 description 16
- 239000013612 plasmid Substances 0.000 description 16
- GFBLJMHGHAXGNY-ZLUOBGJFSA-N Ala-Asn-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O GFBLJMHGHAXGNY-ZLUOBGJFSA-N 0.000 description 15
- JPPLRQVZMZFOSX-UWJYBYFXSA-N Asn-Tyr-Ala Chemical compound NC(=O)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C)C(O)=O)CC1=CC=C(O)C=C1 JPPLRQVZMZFOSX-UWJYBYFXSA-N 0.000 description 15
- KQBVNNAPIURMPD-PEFMBERDSA-N Asp-Ile-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(O)=O KQBVNNAPIURMPD-PEFMBERDSA-N 0.000 description 15
- LDLZOAJRXXBVGF-GMOBBJLQSA-N Asp-Ile-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CC(=O)O)N LDLZOAJRXXBVGF-GMOBBJLQSA-N 0.000 description 15
- 108091026890 Coding region Proteins 0.000 description 15
- SJMJMEWQMBJYPR-DZKIICNBSA-N Gln-Tyr-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CCC(=O)N)N SJMJMEWQMBJYPR-DZKIICNBSA-N 0.000 description 15
- NKRJALPCDNXULF-BYULHYEWSA-N Ile-Asp-Gly Chemical compound [H]N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O NKRJALPCDNXULF-BYULHYEWSA-N 0.000 description 15
- QMKFDEUJGYNFMC-AVGNSLFASA-N Leu-Pro-Arg Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCN=C(N)N)C(O)=O QMKFDEUJGYNFMC-AVGNSLFASA-N 0.000 description 15
- RMOKGALPSPOYKE-KATARQTJSA-N Lys-Thr-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O RMOKGALPSPOYKE-KATARQTJSA-N 0.000 description 15
- MKGIILKDUGDRRO-FXQIFTODSA-N Pro-Ser-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H]1CCCN1 MKGIILKDUGDRRO-FXQIFTODSA-N 0.000 description 15
- YUSRGTQIPCJNHQ-CIUDSAMLSA-N Ser-Arg-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O YUSRGTQIPCJNHQ-CIUDSAMLSA-N 0.000 description 15
- XPNSAQMEAVSQRD-FBCQKBJTSA-N Thr-Gly-Gly Chemical compound C[C@@H](O)[C@H](N)C(=O)NCC(=O)NCC(O)=O XPNSAQMEAVSQRD-FBCQKBJTSA-N 0.000 description 15
- VKMOGXREKGVZAF-QEJZJMRPSA-N Trp-Asp-Gln Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N VKMOGXREKGVZAF-QEJZJMRPSA-N 0.000 description 15
- 108010037850 glycylvaline Proteins 0.000 description 15
- IFKQPMZRDQZSHI-GHCJXIJMSA-N Ala-Ile-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O IFKQPMZRDQZSHI-GHCJXIJMSA-N 0.000 description 14
- GSHKMNKPMLXSQW-KBIXCLLPSA-N Ala-Ile-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](C)N GSHKMNKPMLXSQW-KBIXCLLPSA-N 0.000 description 14
- BFMIRJBURUXDRG-DLOVCJGASA-N Ala-Phe-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=CC=C1 BFMIRJBURUXDRG-DLOVCJGASA-N 0.000 description 14
- SLHOOKXYTYAJGQ-XVYDVKMFSA-N Asp-Ala-His Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CNC=N1 SLHOOKXYTYAJGQ-XVYDVKMFSA-N 0.000 description 14
- GHODABZPVZMWCE-FXQIFTODSA-N Asp-Glu-Glu Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O GHODABZPVZMWCE-FXQIFTODSA-N 0.000 description 14
- RVMXMLSYBTXCAV-VEVYYDQMSA-N Asp-Pro-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O RVMXMLSYBTXCAV-VEVYYDQMSA-N 0.000 description 14
- GWWSUMLEWKQHLR-NUMRIWBASA-N Asp-Thr-Glu Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CC(=O)O)N)O GWWSUMLEWKQHLR-NUMRIWBASA-N 0.000 description 14
- KVYVOGYEMPEXBT-GUBZILKMSA-N Gln-Ala-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(N)=O KVYVOGYEMPEXBT-GUBZILKMSA-N 0.000 description 14
- VSXBYIJUAXPAAL-WDSKDSINSA-N Gln-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)[C@@H](N)CCC(N)=O VSXBYIJUAXPAAL-WDSKDSINSA-N 0.000 description 14
- NCWOMXABNYEPLY-NRPADANISA-N Glu-Ala-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O NCWOMXABNYEPLY-NRPADANISA-N 0.000 description 14
- OQXDUSZKISQQSS-GUBZILKMSA-N Glu-Lys-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O OQXDUSZKISQQSS-GUBZILKMSA-N 0.000 description 14
- SGLXGEDPYJPGIQ-ACRUOGEOSA-N His-Phe-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CC=CC=C2)C(=O)O)NC(=O)[C@H](CC3=CN=CN3)N SGLXGEDPYJPGIQ-ACRUOGEOSA-N 0.000 description 14
- NBJAAWYRLGCJOF-UGYAYLCHSA-N Ile-Asp-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N NBJAAWYRLGCJOF-UGYAYLCHSA-N 0.000 description 14
- DURWCDDDAWVPOP-JBDRJPRFSA-N Ile-Cys-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CO)C(=O)O)N DURWCDDDAWVPOP-JBDRJPRFSA-N 0.000 description 14
- VZSDQFZFTCVEGF-ZEWNOJEFSA-N Ile-Phe-Tyr Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](Cc1ccccc1)C(=O)N[C@@H](Cc1ccc(O)cc1)C(O)=O VZSDQFZFTCVEGF-ZEWNOJEFSA-N 0.000 description 14
- HXIDVIFHRYRXLZ-NAKRPEOUSA-N Ile-Ser-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)O)N HXIDVIFHRYRXLZ-NAKRPEOUSA-N 0.000 description 14
- SAEWJTCJQVZQNZ-IUKAMOBKSA-N Ile-Thr-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N SAEWJTCJQVZQNZ-IUKAMOBKSA-N 0.000 description 14
- HJDZMPFEXINXLO-QPHKQPEJSA-N Ile-Thr-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N HJDZMPFEXINXLO-QPHKQPEJSA-N 0.000 description 14
- GNXGAVNTVNOCLL-SIUGBPQLSA-N Ile-Tyr-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N GNXGAVNTVNOCLL-SIUGBPQLSA-N 0.000 description 14
- RSFGIMMPWAXNML-MNXVOIDGSA-N Leu-Gln-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O RSFGIMMPWAXNML-MNXVOIDGSA-N 0.000 description 14
- YUTNOGOMBNYPFH-XUXIUFHCSA-N Leu-Pro-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(O)=O YUTNOGOMBNYPFH-XUXIUFHCSA-N 0.000 description 14
- WAIHHELKYSFIQN-XUXIUFHCSA-N Lys-Ile-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O WAIHHELKYSFIQN-XUXIUFHCSA-N 0.000 description 14
- ATNKHRAIZCMCCN-BZSNNMDCSA-N Lys-Lys-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCCCN)N ATNKHRAIZCMCCN-BZSNNMDCSA-N 0.000 description 14
- MGKFCQFVPKOWOL-CIUDSAMLSA-N Lys-Ser-Asp Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(=O)O)C(=O)O)N MGKFCQFVPKOWOL-CIUDSAMLSA-N 0.000 description 14
- RPWTZTBIFGENIA-VOAKCMCISA-N Lys-Thr-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O RPWTZTBIFGENIA-VOAKCMCISA-N 0.000 description 14
- LMMBAXJRYSXCOQ-ACRUOGEOSA-N Lys-Tyr-Phe Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](Cc1ccc(O)cc1)C(=O)N[C@@H](Cc1ccccc1)C(O)=O LMMBAXJRYSXCOQ-ACRUOGEOSA-N 0.000 description 14
- JHDNAOVJJQSMMM-GMOBBJLQSA-N Met-Ile-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CCSC)N JHDNAOVJJQSMMM-GMOBBJLQSA-N 0.000 description 14
- DJPXNKUDJKGQEE-BZSNNMDCSA-N Phe-Asp-Phe Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O DJPXNKUDJKGQEE-BZSNNMDCSA-N 0.000 description 14
- PSKRILMFHNIUAO-JYJNAYRXSA-N Phe-Glu-Lys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N PSKRILMFHNIUAO-JYJNAYRXSA-N 0.000 description 14
- SCKXGHWQPPURGT-KKUMJFAQSA-N Phe-Lys-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O SCKXGHWQPPURGT-KKUMJFAQSA-N 0.000 description 14
- QBFONMUYNSNKIX-AVGNSLFASA-N Pro-Arg-His Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O QBFONMUYNSNKIX-AVGNSLFASA-N 0.000 description 14
- YPUSXTWURJANKF-KBIXCLLPSA-N Ser-Gln-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O YPUSXTWURJANKF-KBIXCLLPSA-N 0.000 description 14
- HVKMTOIAYDOJPL-NRPADANISA-N Ser-Gln-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O HVKMTOIAYDOJPL-NRPADANISA-N 0.000 description 14
- KQNDIKOYWZTZIX-FXQIFTODSA-N Ser-Ser-Arg Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCNC(N)=N KQNDIKOYWZTZIX-FXQIFTODSA-N 0.000 description 14
- AHOLTQCAVBSUDP-PPCPHDFISA-N Thr-Ile-Lys Chemical compound CC[C@H](C)[C@H](NC(=O)[C@@H](N)[C@@H](C)O)C(=O)N[C@@H](CCCCN)C(O)=O AHOLTQCAVBSUDP-PPCPHDFISA-N 0.000 description 14
- BVOVIGCHYNFJBZ-JXUBOQSCSA-N Thr-Leu-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O BVOVIGCHYNFJBZ-JXUBOQSCSA-N 0.000 description 14
- MXNAOGFNFNKUPD-JHYOHUSXSA-N Thr-Phe-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MXNAOGFNFNKUPD-JHYOHUSXSA-N 0.000 description 14
- QGVBFDIREUUSHX-IFFSRLJSSA-N Thr-Val-Gln Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O QGVBFDIREUUSHX-IFFSRLJSSA-N 0.000 description 14
- JLKVWTICWVWGSK-JYJNAYRXSA-N Tyr-Lys-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 JLKVWTICWVWGSK-JYJNAYRXSA-N 0.000 description 14
- QFXVAFIHVWXXBJ-AVGNSLFASA-N Tyr-Ser-Glu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(O)=O QFXVAFIHVWXXBJ-AVGNSLFASA-N 0.000 description 14
- ZLFHAAGHGQBQQN-AEJSXWLSSA-N Val-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](C(C)C)N ZLFHAAGHGQBQQN-AEJSXWLSSA-N 0.000 description 14
- JLFKWDAZBRYCGX-ZKWXMUAHSA-N Val-Asn-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CO)C(=O)O)N JLFKWDAZBRYCGX-ZKWXMUAHSA-N 0.000 description 14
- ISERLACIZUGCDX-ZKWXMUAHSA-N Val-Asp-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](C(C)C)N ISERLACIZUGCDX-ZKWXMUAHSA-N 0.000 description 14
- VVIZITNVZUAEMI-DLOVCJGASA-N Val-Val-Gln Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCC(N)=O VVIZITNVZUAEMI-DLOVCJGASA-N 0.000 description 14
- 230000004071 biological effect Effects 0.000 description 14
- 230000000694 effects Effects 0.000 description 14
- 108010025306 histidylleucine Proteins 0.000 description 14
- 108010076756 leucyl-alanyl-phenylalanine Proteins 0.000 description 14
- 108010083476 phenylalanyltryptophan Proteins 0.000 description 14
- 108010026333 seryl-proline Proteins 0.000 description 14
- 108700012359 toxins Proteins 0.000 description 14
- GORKKVHIBWAQHM-GCJQMDKQSA-N Ala-Asn-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GORKKVHIBWAQHM-GCJQMDKQSA-N 0.000 description 13
- QGWXAMDECCKGRU-XVKPBYJWSA-N Gln-Val-Gly Chemical compound CC(C)[C@H](NC(=O)[C@@H](N)CCC(N)=O)C(=O)NCC(O)=O QGWXAMDECCKGRU-XVKPBYJWSA-N 0.000 description 13
- HJIFPJUEOGZWRI-GUBZILKMSA-N Glu-Asp-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCC(=O)O)N HJIFPJUEOGZWRI-GUBZILKMSA-N 0.000 description 13
- UJMNFCAHLYKWOZ-DCAQKATOSA-N Glu-Lys-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O UJMNFCAHLYKWOZ-DCAQKATOSA-N 0.000 description 13
- BXSZPACYCMNKLS-AVGNSLFASA-N Glu-Ser-Phe Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O BXSZPACYCMNKLS-AVGNSLFASA-N 0.000 description 13
- CNGOEHJCLVCJHN-SRVKXCTJSA-N Lys-Pro-Glu Chemical compound NCCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O CNGOEHJCLVCJHN-SRVKXCTJSA-N 0.000 description 13
- 108020004511 Recombinant DNA Proteins 0.000 description 13
- JTEICXDKGWKRRV-HJGDQZAQSA-N Thr-Asn-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N)O JTEICXDKGWKRRV-HJGDQZAQSA-N 0.000 description 13
- XYFISNXATOERFZ-OSUNSFLBSA-N Thr-Ile-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N XYFISNXATOERFZ-OSUNSFLBSA-N 0.000 description 13
- SPVHQURZJCUDQC-VOAKCMCISA-N Thr-Lys-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O SPVHQURZJCUDQC-VOAKCMCISA-N 0.000 description 13
- IJUTXXAXQODRMW-KBPBESRZSA-N Tyr-Gly-His Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)NCC(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N)O IJUTXXAXQODRMW-KBPBESRZSA-N 0.000 description 13
- 230000003321 amplification Effects 0.000 description 13
- 239000013078 crystal Substances 0.000 description 13
- 238000003199 nucleic acid amplification method Methods 0.000 description 13
- 108020003589 5' Untranslated Regions Proteins 0.000 description 12
- HHRAXZAYZFFRAM-CIUDSAMLSA-N Ala-Leu-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O HHRAXZAYZFFRAM-CIUDSAMLSA-N 0.000 description 12
- IHMCQESUJVZTKW-UBHSHLNASA-N Ala-Phe-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](C)N)CC1=CC=CC=C1 IHMCQESUJVZTKW-UBHSHLNASA-N 0.000 description 12
- ZJLORAAXDAJLDC-CQDKDKBSSA-N Ala-Tyr-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(O)=O ZJLORAAXDAJLDC-CQDKDKBSSA-N 0.000 description 12
- RKQRHMKFNBYOTN-IHRRRGAJSA-N Arg-His-Lys Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N RKQRHMKFNBYOTN-IHRRRGAJSA-N 0.000 description 12
- CBHVAFXKOYAHOY-NHCYSSNCSA-N Asn-Val-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O CBHVAFXKOYAHOY-NHCYSSNCSA-N 0.000 description 12
- TZOZNVLBTAFJRW-UGYAYLCHSA-N Asp-Ile-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC(=O)O)N TZOZNVLBTAFJRW-UGYAYLCHSA-N 0.000 description 12
- WOPJVEMFXYHZEE-SRVKXCTJSA-N Asp-Phe-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O WOPJVEMFXYHZEE-SRVKXCTJSA-N 0.000 description 12
- JSNWZMFSLIWAHS-HJGDQZAQSA-N Asp-Thr-Leu Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)O)NC(=O)[C@H](CC(=O)O)N)O JSNWZMFSLIWAHS-HJGDQZAQSA-N 0.000 description 12
- RSUVOPBMWMTVDI-XEGUGMAKSA-N Glu-Ala-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CCC(O)=O)C)C(O)=O)=CNC2=C1 RSUVOPBMWMTVDI-XEGUGMAKSA-N 0.000 description 12
- LGYZYFFDELZWRS-DCAQKATOSA-N Glu-Glu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O LGYZYFFDELZWRS-DCAQKATOSA-N 0.000 description 12
- QXDXIXFSFHUYAX-MNXVOIDGSA-N Glu-Ile-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCC(O)=O QXDXIXFSFHUYAX-MNXVOIDGSA-N 0.000 description 12
- SUIAHERNFYRBDZ-GVXVVHGQSA-N Glu-Lys-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O SUIAHERNFYRBDZ-GVXVVHGQSA-N 0.000 description 12
- JDUKCSSHWNIQQZ-IHRRRGAJSA-N Glu-Phe-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O JDUKCSSHWNIQQZ-IHRRRGAJSA-N 0.000 description 12
- SCWYHUQOOFRVHP-MBLNEYKQSA-N Gly-Ile-Thr Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SCWYHUQOOFRVHP-MBLNEYKQSA-N 0.000 description 12
- JBCLFWXMTIKCCB-UHFFFAOYSA-N H-Gly-Phe-OH Natural products NCC(=O)NC(C(O)=O)CC1=CC=CC=C1 JBCLFWXMTIKCCB-UHFFFAOYSA-N 0.000 description 12
- RIVKTKFVWXRNSJ-GRLWGSQLSA-N Ile-Ile-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N RIVKTKFVWXRNSJ-GRLWGSQLSA-N 0.000 description 12
- NFHJQETXTSDZSI-DCAQKATOSA-N Leu-Cys-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O NFHJQETXTSDZSI-DCAQKATOSA-N 0.000 description 12
- OVZLLFONXILPDZ-VOAKCMCISA-N Leu-Lys-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OVZLLFONXILPDZ-VOAKCMCISA-N 0.000 description 12
- JGKHAFUAPZCCDU-BZSNNMDCSA-N Leu-Tyr-Leu Chemical compound CC(C)C[C@H]([NH3+])C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C([O-])=O)CC1=CC=C(O)C=C1 JGKHAFUAPZCCDU-BZSNNMDCSA-N 0.000 description 12
- KCXUCYYZNZFGLL-SRVKXCTJSA-N Lys-Ala-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O KCXUCYYZNZFGLL-SRVKXCTJSA-N 0.000 description 12
- ZXFRGTAIIZHNHG-AJNGGQMLSA-N Lys-Ile-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)O)NC(=O)[C@H](CCCCN)N ZXFRGTAIIZHNHG-AJNGGQMLSA-N 0.000 description 12
- OIQSIMFSVLLWBX-VOAKCMCISA-N Lys-Leu-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OIQSIMFSVLLWBX-VOAKCMCISA-N 0.000 description 12
- XOQMURBBIXRRCR-SRVKXCTJSA-N Lys-Lys-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCCCN XOQMURBBIXRRCR-SRVKXCTJSA-N 0.000 description 12
- RIPJMCFGQHGHNP-RHYQMDGZSA-N Lys-Val-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CCCCN)N)O RIPJMCFGQHGHNP-RHYQMDGZSA-N 0.000 description 12
- HUKLXYYPZWPXCC-KZVJFYERSA-N Met-Ala-Thr Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O HUKLXYYPZWPXCC-KZVJFYERSA-N 0.000 description 12
- DSZFTPCSFVWMKP-DCAQKATOSA-N Met-Ser-Lys Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCCN DSZFTPCSFVWMKP-DCAQKATOSA-N 0.000 description 12
- FRPVPGRXUKFEQE-YDHLFZDLSA-N Phe-Asp-Val Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O FRPVPGRXUKFEQE-YDHLFZDLSA-N 0.000 description 12
- JFNPBBOGGNMSRX-CIUDSAMLSA-N Pro-Gln-Ala Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O JFNPBBOGGNMSRX-CIUDSAMLSA-N 0.000 description 12
- ZLXKLMHAMDENIO-DCAQKATOSA-N Pro-Lys-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O ZLXKLMHAMDENIO-DCAQKATOSA-N 0.000 description 12
- WIPAMEKBSHNFQE-IUCAKERBSA-N Pro-Met-Gly Chemical compound CSCC[C@@H](C(=O)NCC(=O)O)NC(=O)[C@@H]1CCCN1 WIPAMEKBSHNFQE-IUCAKERBSA-N 0.000 description 12
- UQFYNFTYDHUIMI-WHFBIAKZSA-N Ser-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)[C@@H](N)CO UQFYNFTYDHUIMI-WHFBIAKZSA-N 0.000 description 12
- PMCMLDNPAZUYGI-DCAQKATOSA-N Ser-Lys-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O PMCMLDNPAZUYGI-DCAQKATOSA-N 0.000 description 12
- IGROJMCBGRFRGI-YTLHQDLWSA-N Thr-Ala-Ala Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O IGROJMCBGRFRGI-YTLHQDLWSA-N 0.000 description 12
- LHEZGZQRLDBSRR-WDCWCFNPSA-N Thr-Glu-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O LHEZGZQRLDBSRR-WDCWCFNPSA-N 0.000 description 12
- AMXMBCAXAZUCFA-RHYQMDGZSA-N Thr-Leu-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O AMXMBCAXAZUCFA-RHYQMDGZSA-N 0.000 description 12
- KZSYAEWQMJEGRZ-RHYQMDGZSA-N Thr-Leu-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O KZSYAEWQMJEGRZ-RHYQMDGZSA-N 0.000 description 12
- PCMDGXKXVMBIFP-VEVYYDQMSA-N Thr-Met-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(N)=O)C(O)=O PCMDGXKXVMBIFP-VEVYYDQMSA-N 0.000 description 12
- AKHDFZHUPGVFEJ-YEPSODPASA-N Thr-Val-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O AKHDFZHUPGVFEJ-YEPSODPASA-N 0.000 description 12
- PWONLXBUSVIZPH-RHYQMDGZSA-N Thr-Val-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N)O PWONLXBUSVIZPH-RHYQMDGZSA-N 0.000 description 12
- PHKQVWWHRYUCJL-HJOGWXRNSA-N Tyr-Phe-Tyr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O PHKQVWWHRYUCJL-HJOGWXRNSA-N 0.000 description 12
- GZWPQZDVTBZVEP-BZSNNMDCSA-N Tyr-Tyr-Asn Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(O)=O GZWPQZDVTBZVEP-BZSNNMDCSA-N 0.000 description 12
- RWOGENDAOGMHLX-DCAQKATOSA-N Val-Lys-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](C(C)C)N RWOGENDAOGMHLX-DCAQKATOSA-N 0.000 description 12
- LTTQCQRTSHJPPL-ZKWXMUAHSA-N Val-Ser-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(=O)O)C(=O)O)N LTTQCQRTSHJPPL-ZKWXMUAHSA-N 0.000 description 12
- 238000004166 bioassay Methods 0.000 description 12
- 108010089804 glycyl-threonine Proteins 0.000 description 12
- 108010050848 glycylleucine Proteins 0.000 description 12
- BKIOKSLLAAZYTC-KKHAAJSZSA-N Thr-Val-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O BKIOKSLLAAZYTC-KKHAAJSZSA-N 0.000 description 11
- 239000012634 fragment Substances 0.000 description 11
- 210000002706 plastid Anatomy 0.000 description 11
- 102000007469 Actins Human genes 0.000 description 10
- 108010085238 Actins Proteins 0.000 description 10
- LXAARTARZJJCMB-CIQUZCHMSA-N Ala-Ile-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LXAARTARZJJCMB-CIQUZCHMSA-N 0.000 description 10
- JWQWPRCDYWNVNM-ACZMJKKPSA-N Asn-Ser-Gln Chemical compound C(CC(=O)N)[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CC(=O)N)N JWQWPRCDYWNVNM-ACZMJKKPSA-N 0.000 description 10
- PIABYSIYPGLLDQ-XVSYOHENSA-N Asn-Thr-Phe Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O PIABYSIYPGLLDQ-XVSYOHENSA-N 0.000 description 10
- PKVWNYGXMNWJSI-CIUDSAMLSA-N Gln-Gln-Gln Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O PKVWNYGXMNWJSI-CIUDSAMLSA-N 0.000 description 10
- BCISUQVFDGYZBO-QSFUFRPTSA-N Ile-Val-Asp Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC(O)=O BCISUQVFDGYZBO-QSFUFRPTSA-N 0.000 description 10
- AWAYOWOUGVZXOB-BZSNNMDCSA-N Phe-Asn-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 AWAYOWOUGVZXOB-BZSNNMDCSA-N 0.000 description 10
- 108091028664 Ribonucleotide Proteins 0.000 description 10
- SKHPKKYKDYULDH-HJGDQZAQSA-N Thr-Asn-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O SKHPKKYKDYULDH-HJGDQZAQSA-N 0.000 description 10
- HTONZBWRYUKUKC-RCWTZXSCSA-N Val-Thr-Val Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O HTONZBWRYUKUKC-RCWTZXSCSA-N 0.000 description 10
- 239000013604 expression vector Substances 0.000 description 10
- 108010078144 glutaminyl-glycine Proteins 0.000 description 10
- 108010017391 lysylvaline Proteins 0.000 description 10
- 239000000047 product Substances 0.000 description 10
- 239000002336 ribonucleotide Substances 0.000 description 10
- 125000002652 ribonucleotide group Chemical group 0.000 description 10
- 239000005562 Glyphosate Substances 0.000 description 9
- XDDAORKBJWWYJS-UHFFFAOYSA-N glyphosate Chemical compound OC(=O)CNCP(O)(O)=O XDDAORKBJWWYJS-UHFFFAOYSA-N 0.000 description 9
- 229940097068 glyphosate Drugs 0.000 description 9
- 108010038320 lysylphenylalanine Proteins 0.000 description 9
- 239000003550 marker Substances 0.000 description 9
- 238000011160 research Methods 0.000 description 9
- 230000002194 synthesizing effect Effects 0.000 description 9
- SHKGHIFSEAGTNL-DLOVCJGASA-N Ala-His-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CN=CN1 SHKGHIFSEAGTNL-DLOVCJGASA-N 0.000 description 8
- MEFILNJXAVSUTO-JXUBOQSCSA-N Ala-Leu-Thr Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MEFILNJXAVSUTO-JXUBOQSCSA-N 0.000 description 8
- FEGOCLZUJUFCHP-CIUDSAMLSA-N Ala-Pro-Gln Chemical compound [H]N[C@@H](C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O FEGOCLZUJUFCHP-CIUDSAMLSA-N 0.000 description 8
- WZGZDOXCDLLTHE-SYWGBEHUSA-N Ala-Trp-Ile Chemical compound C1=CC=C2C(C[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(O)=O)NC(=O)[C@H](C)N)=CNC2=C1 WZGZDOXCDLLTHE-SYWGBEHUSA-N 0.000 description 8
- UFBURHXMKFQVLM-CIUDSAMLSA-N Arg-Glu-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O UFBURHXMKFQVLM-CIUDSAMLSA-N 0.000 description 8
- NVCIXQYNWYTLDO-IHRRRGAJSA-N Arg-His-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CCCN=C(N)N)N NVCIXQYNWYTLDO-IHRRRGAJSA-N 0.000 description 8
- NECWUSYTYSIFNC-DLOVCJGASA-N Asp-Ala-Phe Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 NECWUSYTYSIFNC-DLOVCJGASA-N 0.000 description 8
- JDHOJQJMWBKHDB-CIUDSAMLSA-N Asp-Asn-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC(=O)O)N JDHOJQJMWBKHDB-CIUDSAMLSA-N 0.000 description 8
- GWIJZUVQVDJHDI-AVGNSLFASA-N Asp-Phe-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O GWIJZUVQVDJHDI-AVGNSLFASA-N 0.000 description 8
- KVCJEMHFLGVINV-ZLUOBGJFSA-N Cys-Ser-Asn Chemical compound SC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC(N)=O KVCJEMHFLGVINV-ZLUOBGJFSA-N 0.000 description 8
- YJIUYQKQBBQYHZ-ACZMJKKPSA-N Gln-Ala-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O YJIUYQKQBBQYHZ-ACZMJKKPSA-N 0.000 description 8
- HDUDGCZEOZEFOA-KBIXCLLPSA-N Gln-Ile-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)O)NC(=O)[C@H](CCC(=O)N)N HDUDGCZEOZEFOA-KBIXCLLPSA-N 0.000 description 8
- PHONAZGUEGIOEM-GLLZPBPUSA-N Glu-Glu-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PHONAZGUEGIOEM-GLLZPBPUSA-N 0.000 description 8
- GMVCSRBOSIUTFC-FXQIFTODSA-N Glu-Ser-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(O)=O GMVCSRBOSIUTFC-FXQIFTODSA-N 0.000 description 8
- HJTSRYLPAYGEEC-SIUGBPQLSA-N Glu-Tyr-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CCC(=O)O)N HJTSRYLPAYGEEC-SIUGBPQLSA-N 0.000 description 8
- YMUFWNJHVPQNQD-ZKWXMUAHSA-N Gly-Ala-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)CN YMUFWNJHVPQNQD-ZKWXMUAHSA-N 0.000 description 8
- QIHJTGSVGIPHIW-QSFUFRPTSA-N Ile-Asn-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](C(C)C)C(=O)O)N QIHJTGSVGIPHIW-QSFUFRPTSA-N 0.000 description 8
- QSPLUJGYOPZINY-ZPFDUUQYSA-N Ile-Asp-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N QSPLUJGYOPZINY-ZPFDUUQYSA-N 0.000 description 8
- GECLQMBTZCPAFY-PEFMBERDSA-N Ile-Gln-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N GECLQMBTZCPAFY-PEFMBERDSA-N 0.000 description 8
- SLQVFYWBGNNOTK-BYULHYEWSA-N Ile-Gly-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N[C@@H](CC(=O)N)C(=O)O)N SLQVFYWBGNNOTK-BYULHYEWSA-N 0.000 description 8
- RMNMUUCYTMLWNA-ZPFDUUQYSA-N Ile-Lys-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(=O)O)C(=O)O)N RMNMUUCYTMLWNA-ZPFDUUQYSA-N 0.000 description 8
- DLEBSGAVWRPTIX-PEDHHIEDSA-N Ile-Val-Ile Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)[C@@H](C)CC DLEBSGAVWRPTIX-PEDHHIEDSA-N 0.000 description 8
- FADYJNXDPBKVCA-UHFFFAOYSA-N L-Phenylalanyl-L-lysin Natural products NCCCCC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 FADYJNXDPBKVCA-UHFFFAOYSA-N 0.000 description 8
- KWTVLKBOQATPHJ-SRVKXCTJSA-N Leu-Ala-Lys Chemical compound C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(C)C)N KWTVLKBOQATPHJ-SRVKXCTJSA-N 0.000 description 8
- VKOAHIRLIUESLU-ULQDDVLXSA-N Leu-Arg-Phe Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O VKOAHIRLIUESLU-ULQDDVLXSA-N 0.000 description 8
- GPICTNQYKHHHTH-GUBZILKMSA-N Leu-Gln-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O GPICTNQYKHHHTH-GUBZILKMSA-N 0.000 description 8
- ZDBMWELMUCLUPL-QEJZJMRPSA-N Leu-Phe-Ala Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C)C(O)=O)CC1=CC=CC=C1 ZDBMWELMUCLUPL-QEJZJMRPSA-N 0.000 description 8
- NQSFIPWBPXNJII-PMVMPFDFSA-N Lys-Phe-Trp Chemical compound C([C@H](NC(=O)[C@@H](N)CCCCN)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(O)=O)C1=CC=CC=C1 NQSFIPWBPXNJII-PMVMPFDFSA-N 0.000 description 8
- WZVSHTFTCYOFPL-GARJFASQSA-N Lys-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CCCCN)N)C(=O)O WZVSHTFTCYOFPL-GARJFASQSA-N 0.000 description 8
- DRRXXZBXDMLGFC-IHRRRGAJSA-N Lys-Val-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCCN DRRXXZBXDMLGFC-IHRRRGAJSA-N 0.000 description 8
- IKXQOBUBZSOWDY-AVGNSLFASA-N Lys-Val-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H](CCCCN)N IKXQOBUBZSOWDY-AVGNSLFASA-N 0.000 description 8
- IUVYJBMTHARMIP-PCBIJLKTSA-N Phe-Asp-Ile Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O IUVYJBMTHARMIP-PCBIJLKTSA-N 0.000 description 8
- KIQUCMUULDXTAZ-HJOGWXRNSA-N Phe-Tyr-Tyr Chemical compound N[C@@H](Cc1ccccc1)C(=O)N[C@@H](Cc1ccc(O)cc1)C(=O)N[C@@H](Cc1ccc(O)cc1)C(O)=O KIQUCMUULDXTAZ-HJOGWXRNSA-N 0.000 description 8
- MWQXFDIQXIXPMS-UNQGMJICSA-N Phe-Val-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CC1=CC=CC=C1)N)O MWQXFDIQXIXPMS-UNQGMJICSA-N 0.000 description 8
- IBGCFJDLCYTKPW-NAKRPEOUSA-N Pro-Ile-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H]1CCCN1 IBGCFJDLCYTKPW-NAKRPEOUSA-N 0.000 description 8
- BRGQQXQKPUCUJQ-KBIXCLLPSA-N Ser-Glu-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O BRGQQXQKPUCUJQ-KBIXCLLPSA-N 0.000 description 8
- UKKROEYWYIHWBD-ZKWXMUAHSA-N Ser-Val-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O UKKROEYWYIHWBD-ZKWXMUAHSA-N 0.000 description 8
- RCOUFINCYASMDN-GUBZILKMSA-N Ser-Val-Met Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCSC)C(O)=O RCOUFINCYASMDN-GUBZILKMSA-N 0.000 description 8
- KEHKBBUYZWAMHL-DZKIICNBSA-N Tyr-Gln-Val Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O KEHKBBUYZWAMHL-DZKIICNBSA-N 0.000 description 8
- FDKDGFGTHGJKNV-FHWLQOOXSA-N Tyr-Phe-Gln Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N FDKDGFGTHGJKNV-FHWLQOOXSA-N 0.000 description 8
- IQQYYFPCWKWUHW-YDHLFZDLSA-N Val-Asn-Tyr Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N IQQYYFPCWKWUHW-YDHLFZDLSA-N 0.000 description 8
- CFSSLXZJEMERJY-NRPADANISA-N Val-Gln-Ala Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O CFSSLXZJEMERJY-NRPADANISA-N 0.000 description 8
- UZDHNIJRRTUKKC-DLOVCJGASA-N Val-Gln-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](C(C)C)C(=O)O)N UZDHNIJRRTUKKC-DLOVCJGASA-N 0.000 description 8
- BTWMICVCQLKKNR-DCAQKATOSA-N Val-Leu-Ser Chemical compound CC(C)[C@H]([NH3+])C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C([O-])=O BTWMICVCQLKKNR-DCAQKATOSA-N 0.000 description 8
- 108010011559 alanylphenylalanine Proteins 0.000 description 8
- 108010038633 aspartylglutamate Proteins 0.000 description 8
- 230000006872 improvement Effects 0.000 description 8
- 108010027338 isoleucylcysteine Proteins 0.000 description 8
- 108010057821 leucylproline Proteins 0.000 description 8
- 108010054155 lysyllysine Proteins 0.000 description 8
- 102000039446 nucleic acids Human genes 0.000 description 8
- 108020004707 nucleic acids Proteins 0.000 description 8
- 150000007523 nucleic acids Chemical class 0.000 description 8
- 108010031719 prolyl-serine Proteins 0.000 description 8
- 230000028070 sporulation Effects 0.000 description 8
- 108010061238 threonyl-glycine Proteins 0.000 description 8
- ZWASIOHRQWRWAS-UGYAYLCHSA-N Asn-Asp-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O ZWASIOHRQWRWAS-UGYAYLCHSA-N 0.000 description 7
- ZSJFGGSPCCHMNE-LAEOZQHASA-N Asp-Gln-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC(=O)O)N ZSJFGGSPCCHMNE-LAEOZQHASA-N 0.000 description 7
- SNDBKTFJWVEVPO-WHFBIAKZSA-N Asp-Gly-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](CO)C(O)=O SNDBKTFJWVEVPO-WHFBIAKZSA-N 0.000 description 7
- OLPPXYMMIARYAL-QMMMGPOBSA-N Gly-Gly-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)CNC(=O)CN OLPPXYMMIARYAL-QMMMGPOBSA-N 0.000 description 7
- DFJJAVZIHDFOGQ-MNXVOIDGSA-N Ile-Glu-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N DFJJAVZIHDFOGQ-MNXVOIDGSA-N 0.000 description 7
- ZUPJCJINYQISSN-XUXIUFHCSA-N Ile-Met-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCCCN)C(=O)O)N ZUPJCJINYQISSN-XUXIUFHCSA-N 0.000 description 7
- 108060001084 Luciferase Proteins 0.000 description 7
- 239000005089 Luciferase Substances 0.000 description 7
- IMAKMJCBYCSMHM-AVGNSLFASA-N Lys-Glu-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN IMAKMJCBYCSMHM-AVGNSLFASA-N 0.000 description 7
- QSKCKTUQPICLSO-AVGNSLFASA-N Pro-Arg-Lys Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCCN)C(=O)O QSKCKTUQPICLSO-AVGNSLFASA-N 0.000 description 7
- QEDMOZUJTGEIBF-FXQIFTODSA-N Ser-Arg-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O QEDMOZUJTGEIBF-FXQIFTODSA-N 0.000 description 7
- PPCZVWHJWJFTFN-ZLUOBGJFSA-N Ser-Ser-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O PPCZVWHJWJFTFN-ZLUOBGJFSA-N 0.000 description 7
- CQNFRKAKGDSJFR-NUMRIWBASA-N Thr-Glu-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N)O CQNFRKAKGDSJFR-NUMRIWBASA-N 0.000 description 7
- DXYWRYQRKPIGGU-BPNCWPANSA-N Tyr-Ala-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 DXYWRYQRKPIGGU-BPNCWPANSA-N 0.000 description 7
- AEOFMCAKYIQQFY-YDHLFZDLSA-N Tyr-Val-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 AEOFMCAKYIQQFY-YDHLFZDLSA-N 0.000 description 7
- JEPNYDRDYNSFIU-QXEWZRGKSA-N Asn-Arg-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H](N)CC(N)=O)C(O)=O JEPNYDRDYNSFIU-QXEWZRGKSA-N 0.000 description 6
- GQRDIVQPSMPQME-ZPFDUUQYSA-N Asn-Ile-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O GQRDIVQPSMPQME-ZPFDUUQYSA-N 0.000 description 6
- 238000002965 ELISA Methods 0.000 description 6
- BCYGDJXHAGZNPQ-DCAQKATOSA-N Glu-Lys-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O BCYGDJXHAGZNPQ-DCAQKATOSA-N 0.000 description 6
- KBDIBHQICWDGDL-PPCPHDFISA-N Ile-Thr-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)O)N KBDIBHQICWDGDL-PPCPHDFISA-N 0.000 description 6
- 241000880493 Leptailurus serval Species 0.000 description 6
- WNJXJJSGUXAIQU-UFYCRDLUSA-N Met-Phe-Phe Chemical compound C([C@H](NC(=O)[C@@H](N)CCSC)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 WNJXJJSGUXAIQU-UFYCRDLUSA-N 0.000 description 6
- BYAIIACBWBOJCU-URLPEUOOSA-N Phe-Ile-Thr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O BYAIIACBWBOJCU-URLPEUOOSA-N 0.000 description 6
- NADLKBTYNKUJEP-KATARQTJSA-N Ser-Thr-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O NADLKBTYNKUJEP-KATARQTJSA-N 0.000 description 6
- QNXZCKMXHPULME-ZNSHCXBVSA-N Thr-Val-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N)O QNXZCKMXHPULME-ZNSHCXBVSA-N 0.000 description 6
- 108010069205 aspartyl-phenylalanine Proteins 0.000 description 6
- 230000006870 function Effects 0.000 description 6
- 108010012581 phenylalanylglutamate Proteins 0.000 description 6
- 230000008521 reorganization Effects 0.000 description 6
- 239000000126 substance Substances 0.000 description 6
- 231100000331 toxic Toxicity 0.000 description 6
- 230000002588 toxic effect Effects 0.000 description 6
- 238000011144 upstream manufacturing Methods 0.000 description 6
- 244000075850 Avena orientalis Species 0.000 description 5
- 101710163595 Chaperone protein DnaK Proteins 0.000 description 5
- 244000299507 Gossypium hirsutum Species 0.000 description 5
- 101710178376 Heat shock 70 kDa protein Proteins 0.000 description 5
- 101710152018 Heat shock cognate 70 kDa protein Proteins 0.000 description 5
- 235000001014 amino acid Nutrition 0.000 description 5
- 238000013459 approach Methods 0.000 description 5
- 230000000875 corresponding effect Effects 0.000 description 5
- 238000004321 preservation Methods 0.000 description 5
- 238000012545 processing Methods 0.000 description 5
- 108091008146 restriction endonucleases Proteins 0.000 description 5
- 238000012216 screening Methods 0.000 description 5
- 239000000725 suspension Substances 0.000 description 5
- 230000014616 translation Effects 0.000 description 5
- AWZKCUCQJNTBAD-SRVKXCTJSA-N Ala-Leu-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCCN AWZKCUCQJNTBAD-SRVKXCTJSA-N 0.000 description 4
- 241000234282 Allium Species 0.000 description 4
- 235000002732 Allium cepa var. cepa Nutrition 0.000 description 4
- 108091093088 Amplicon Proteins 0.000 description 4
- NKBQZKVMKJJDLX-SRVKXCTJSA-N Arg-Glu-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O NKBQZKVMKJJDLX-SRVKXCTJSA-N 0.000 description 4
- SKTGPBFTMNLIHQ-KKUMJFAQSA-N Arg-Glu-Phe Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O SKTGPBFTMNLIHQ-KKUMJFAQSA-N 0.000 description 4
- FAEFJTCTNZTPHX-ACZMJKKPSA-N Asn-Gln-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O FAEFJTCTNZTPHX-ACZMJKKPSA-N 0.000 description 4
- JEEFEQCRXKPQHC-KKUMJFAQSA-N Asn-Leu-Phe Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O JEEFEQCRXKPQHC-KKUMJFAQSA-N 0.000 description 4
- RQHLMGCXCZUOGT-ZPFDUUQYSA-N Asp-Leu-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O RQHLMGCXCZUOGT-ZPFDUUQYSA-N 0.000 description 4
- DONWIPDSZZJHHK-HJGDQZAQSA-N Asp-Lys-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC(=O)O)N)O DONWIPDSZZJHHK-HJGDQZAQSA-N 0.000 description 4
- MYLZFUMPZCPJCJ-NHCYSSNCSA-N Asp-Lys-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O MYLZFUMPZCPJCJ-NHCYSSNCSA-N 0.000 description 4
- VNXQRBXEQXLERQ-CIUDSAMLSA-N Asp-Ser-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CC(=O)O)N VNXQRBXEQXLERQ-CIUDSAMLSA-N 0.000 description 4
- YIDFBWRHIYOYAA-LKXGYXEUSA-N Asp-Ser-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O YIDFBWRHIYOYAA-LKXGYXEUSA-N 0.000 description 4
- OTKUAVXGMREHRX-CFMVVWHZSA-N Asp-Tyr-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CC(O)=O)CC1=CC=C(O)C=C1 OTKUAVXGMREHRX-CFMVVWHZSA-N 0.000 description 4
- ZUNMTUPRQMWMHX-LSJOCFKGSA-N Asp-Val-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C(C)C)C(O)=O ZUNMTUPRQMWMHX-LSJOCFKGSA-N 0.000 description 4
- 240000007124 Brassica oleracea Species 0.000 description 4
- 241000675108 Citrus tangerina Species 0.000 description 4
- 108020004705 Codon Proteins 0.000 description 4
- NKCZYEDZTKOFBG-GUBZILKMSA-N Gln-Gln-Arg Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O NKCZYEDZTKOFBG-GUBZILKMSA-N 0.000 description 4
- PSERKXGRRADTKA-MNXVOIDGSA-N Gln-Leu-Ile Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O PSERKXGRRADTKA-MNXVOIDGSA-N 0.000 description 4
- MLSKFHLRFVGNLL-WDCWCFNPSA-N Gln-Leu-Thr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MLSKFHLRFVGNLL-WDCWCFNPSA-N 0.000 description 4
- VGBSZQSKQRMLHD-MNXVOIDGSA-N Glu-Leu-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O VGBSZQSKQRMLHD-MNXVOIDGSA-N 0.000 description 4
- MRWYPDWDZSLWJM-ACZMJKKPSA-N Glu-Ser-Asp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O MRWYPDWDZSLWJM-ACZMJKKPSA-N 0.000 description 4
- BXDLTKLPPKBVEL-FJXKBIBVSA-N Gly-Thr-Met Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCSC)C(O)=O BXDLTKLPPKBVEL-FJXKBIBVSA-N 0.000 description 4
- LDFWDDVELNOGII-MXAVVETBSA-N His-Lys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC1=CN=CN1)N LDFWDDVELNOGII-MXAVVETBSA-N 0.000 description 4
- XVZJRZQIHJMUBG-TUBUOCAGSA-N His-Thr-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@H](CC1=CN=CN1)N XVZJRZQIHJMUBG-TUBUOCAGSA-N 0.000 description 4
- VSZALHITQINTGC-GHCJXIJMSA-N Ile-Ala-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CC(=O)O)C(=O)O)N VSZALHITQINTGC-GHCJXIJMSA-N 0.000 description 4
- BEWFWZRGBDVXRP-PEFMBERDSA-N Ile-Glu-Asn Chemical compound [H]N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O BEWFWZRGBDVXRP-PEFMBERDSA-N 0.000 description 4
- PMMMQRVUMVURGJ-XUXIUFHCSA-N Ile-Leu-Pro Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(O)=O PMMMQRVUMVURGJ-XUXIUFHCSA-N 0.000 description 4
- DSDPLOODKXISDT-XUXIUFHCSA-N Ile-Leu-Val Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O DSDPLOODKXISDT-XUXIUFHCSA-N 0.000 description 4
- BATWGBRIZANGPN-ZPFDUUQYSA-N Ile-Pro-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(=O)N)C(=O)O)N BATWGBRIZANGPN-ZPFDUUQYSA-N 0.000 description 4
- OXKYZSRZKBTVEY-ZPFDUUQYSA-N Leu-Asn-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O OXKYZSRZKBTVEY-ZPFDUUQYSA-N 0.000 description 4
- MDVZJYGNAGLPGJ-KKUMJFAQSA-N Leu-Asn-Phe Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 MDVZJYGNAGLPGJ-KKUMJFAQSA-N 0.000 description 4
- HNDWYLYAYNBWMP-AJNGGQMLSA-N Leu-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(C)C)N HNDWYLYAYNBWMP-AJNGGQMLSA-N 0.000 description 4
- KLSUAWUZBMAZCL-RHYQMDGZSA-N Leu-Thr-Pro Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@H]1C(O)=O KLSUAWUZBMAZCL-RHYQMDGZSA-N 0.000 description 4
- QESXLSQLQHHTIX-RHYQMDGZSA-N Leu-Val-Thr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O QESXLSQLQHHTIX-RHYQMDGZSA-N 0.000 description 4
- BRSGXFITDXFMFF-IHRRRGAJSA-N Lys-Arg-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CCCCN)N BRSGXFITDXFMFF-IHRRRGAJSA-N 0.000 description 4
- LZWNAOIMTLNMDW-NHCYSSNCSA-N Lys-Asn-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCCCN)N LZWNAOIMTLNMDW-NHCYSSNCSA-N 0.000 description 4
- 244000070406 Malus silvestris Species 0.000 description 4
- LQMHZERGCQJKAH-STQMWFEESA-N Met-Gly-Phe Chemical compound CSCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 LQMHZERGCQJKAH-STQMWFEESA-N 0.000 description 4
- 108010006519 Molecular Chaperones Proteins 0.000 description 4
- WUGMRIBZSVSJNP-UHFFFAOYSA-N N-L-alanyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)C)C(O)=O)=CNC2=C1 WUGMRIBZSVSJNP-UHFFFAOYSA-N 0.000 description 4
- YBAFDPFAUTYYRW-UHFFFAOYSA-N N-L-alpha-glutamyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CCC(O)=O YBAFDPFAUTYYRW-UHFFFAOYSA-N 0.000 description 4
- VLZGUAUYZGQKPM-DRZSPHRISA-N Phe-Gln-Ala Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O VLZGUAUYZGQKPM-DRZSPHRISA-N 0.000 description 4
- KXUZHWXENMYOHC-QEJZJMRPSA-N Phe-Leu-Ala Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O KXUZHWXENMYOHC-QEJZJMRPSA-N 0.000 description 4
- 231100000674 Phytotoxicity Toxicity 0.000 description 4
- 244000082988 Secale cereale Species 0.000 description 4
- SMIDBHKWSYUBRZ-ACZMJKKPSA-N Ser-Glu-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O SMIDBHKWSYUBRZ-ACZMJKKPSA-N 0.000 description 4
- 229930006000 Sucrose Natural products 0.000 description 4
- CZMRCDWAGMRECN-UGDNZRGBSA-N Sucrose Chemical compound O[C@H]1[C@H](O)[C@@H](CO)O[C@@]1(CO)O[C@@H]1[C@H](O)[C@@H](O)[C@H](O)[C@@H](CO)O1 CZMRCDWAGMRECN-UGDNZRGBSA-N 0.000 description 4
- GKMYGVQDGVYCPC-IUKAMOBKSA-N Thr-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H]([C@@H](C)O)N GKMYGVQDGVYCPC-IUKAMOBKSA-N 0.000 description 4
- RFKVQLIXNVEOMB-WEDXCCLWSA-N Thr-Leu-Gly Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)NCC(=O)O)N)O RFKVQLIXNVEOMB-WEDXCCLWSA-N 0.000 description 4
- MECLEFZMPPOEAC-VOAKCMCISA-N Thr-Leu-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N)O MECLEFZMPPOEAC-VOAKCMCISA-N 0.000 description 4
- QYDKSNXSBXZPFK-ZJDVBMNYSA-N Thr-Thr-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O QYDKSNXSBXZPFK-ZJDVBMNYSA-N 0.000 description 4
- DYEGCOJHFNJBKB-UFYCRDLUSA-N Tyr-Arg-Tyr Chemical compound C([C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=C(O)C=C1 DYEGCOJHFNJBKB-UFYCRDLUSA-N 0.000 description 4
- WVRUKYLYMFGKAN-IHRRRGAJSA-N Tyr-Glu-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 WVRUKYLYMFGKAN-IHRRRGAJSA-N 0.000 description 4
- YMUQBRQQCPQEQN-CXTHYWKRSA-N Tyr-Ile-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N YMUQBRQQCPQEQN-CXTHYWKRSA-N 0.000 description 4
- KHCSOLAHNLOXJR-BZSNNMDCSA-N Tyr-Leu-Leu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O KHCSOLAHNLOXJR-BZSNNMDCSA-N 0.000 description 4
- RMRFSFXLFWWAJZ-HJOGWXRNSA-N Tyr-Tyr-Tyr Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=C(O)C=C1 RMRFSFXLFWWAJZ-HJOGWXRNSA-N 0.000 description 4
- QHFQQRKNGCXTHL-AUTRQRHGSA-N Val-Gln-Glu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O QHFQQRKNGCXTHL-AUTRQRHGSA-N 0.000 description 4
- AEMPCGRFEZTWIF-IHRRRGAJSA-N Val-Leu-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(O)=O AEMPCGRFEZTWIF-IHRRRGAJSA-N 0.000 description 4
- MBGFDZDWMDLXHQ-GUBZILKMSA-N Val-Met-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](C(C)C)N MBGFDZDWMDLXHQ-GUBZILKMSA-N 0.000 description 4
- YLRAFVVWZRSZQC-DZKIICNBSA-N Val-Phe-Glu Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N YLRAFVVWZRSZQC-DZKIICNBSA-N 0.000 description 4
- QTPQHINADBYBNA-DCAQKATOSA-N Val-Ser-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCCN QTPQHINADBYBNA-DCAQKATOSA-N 0.000 description 4
- CEKSLIVSNNGOKH-KZVJFYERSA-N Val-Thr-Ala Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C)C(=O)O)NC(=O)[C@H](C(C)C)N)O CEKSLIVSNNGOKH-KZVJFYERSA-N 0.000 description 4
- 230000002159 abnormal effect Effects 0.000 description 4
- 108010070783 alanyltyrosine Proteins 0.000 description 4
- KOSRFJWDECSPRO-UHFFFAOYSA-N alpha-L-glutamyl-L-glutamic acid Natural products OC(=O)CCC(N)C(=O)NC(CCC(O)=O)C(O)=O KOSRFJWDECSPRO-UHFFFAOYSA-N 0.000 description 4
- 230000008901 benefit Effects 0.000 description 4
- 239000003153 chemical reaction reagent Substances 0.000 description 4
- 238000013461 design Methods 0.000 description 4
- FSXRLASFHBWESK-UHFFFAOYSA-N dipeptide phenylalanyl-tyrosine Natural products C=1C=C(O)C=CC=1CC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 FSXRLASFHBWESK-UHFFFAOYSA-N 0.000 description 4
- 238000002474 experimental method Methods 0.000 description 4
- 108010049041 glutamylalanine Proteins 0.000 description 4
- VPZXBVLAVMBEQI-UHFFFAOYSA-N glycyl-DL-alpha-alanine Natural products OC(=O)C(C)NC(=O)CN VPZXBVLAVMBEQI-UHFFFAOYSA-N 0.000 description 4
- 230000002363 herbicidal effect Effects 0.000 description 4
- 231100000614 poison Toxicity 0.000 description 4
- 230000007096 poisonous effect Effects 0.000 description 4
- 229920001184 polypeptide Polymers 0.000 description 4
- 239000005720 sucrose Substances 0.000 description 4
- 241000589158 Agrobacterium Species 0.000 description 3
- 240000002234 Allium sativum Species 0.000 description 3
- 241000722824 Ardisia crenata Species 0.000 description 3
- QYRMBFWDSFGSFC-OLHMAJIHSA-N Asn-Thr-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N)O QYRMBFWDSFGSFC-OLHMAJIHSA-N 0.000 description 3
- PUUPMDXIHCOPJU-HJGDQZAQSA-N Asn-Thr-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)N)N)O PUUPMDXIHCOPJU-HJGDQZAQSA-N 0.000 description 3
- KDFQZBWWPYQBEN-ZLUOBGJFSA-N Asp-Ala-Asn Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)O)N KDFQZBWWPYQBEN-ZLUOBGJFSA-N 0.000 description 3
- 206010003694 Atrophy Diseases 0.000 description 3
- 241000194107 Bacillus megaterium Species 0.000 description 3
- 235000003899 Brassica oleracea var acephala Nutrition 0.000 description 3
- 235000011301 Brassica oleracea var capitata Nutrition 0.000 description 3
- 235000001169 Brassica oleracea var oleracea Nutrition 0.000 description 3
- 241000388393 Caesalpinia violacea Species 0.000 description 3
- 235000010773 Cajanus indicus Nutrition 0.000 description 3
- 244000105627 Cajanus indicus Species 0.000 description 3
- 208000003643 Callosities Diseases 0.000 description 3
- 235000002566 Capsicum Nutrition 0.000 description 3
- 241000723418 Carya Species 0.000 description 3
- 235000009025 Carya illinoensis Nutrition 0.000 description 3
- 244000068645 Carya illinoensis Species 0.000 description 3
- 241000522254 Cassia Species 0.000 description 3
- 108050001186 Chaperonin Cpn60 Proteins 0.000 description 3
- 102000052603 Chaperonins Human genes 0.000 description 3
- 244000183685 Citrus aurantium Species 0.000 description 3
- 235000007716 Citrus aurantium Nutrition 0.000 description 3
- 240000007154 Coffea arabica Species 0.000 description 3
- 235000002787 Coriandrum sativum Nutrition 0.000 description 3
- 244000018436 Coriandrum sativum Species 0.000 description 3
- 240000009226 Corylus americana Species 0.000 description 3
- 235000001543 Corylus americana Nutrition 0.000 description 3
- 235000007466 Corylus avellana Nutrition 0.000 description 3
- 235000007129 Cuminum cyminum Nutrition 0.000 description 3
- 244000304337 Cuminum cyminum Species 0.000 description 3
- 235000003392 Curcuma domestica Nutrition 0.000 description 3
- 244000008991 Curcuma longa Species 0.000 description 3
- 241001057636 Dracaena deremensis Species 0.000 description 3
- 241000588724 Escherichia coli Species 0.000 description 3
- SYAYROHMAIHWFB-KBIXCLLPSA-N Glu-Ser-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O SYAYROHMAIHWFB-KBIXCLLPSA-N 0.000 description 3
- 241000578422 Graphosoma lineatum Species 0.000 description 3
- 108010004889 Heat-Shock Proteins Proteins 0.000 description 3
- 206010020649 Hyperkeratosis Diseases 0.000 description 3
- 240000007049 Juglans regia Species 0.000 description 3
- 235000009496 Juglans regia Nutrition 0.000 description 3
- 241000255777 Lepidoptera Species 0.000 description 3
- LJBVRCDPWOJOEK-PPCPHDFISA-N Leu-Thr-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O LJBVRCDPWOJOEK-PPCPHDFISA-N 0.000 description 3
- 241000501345 Lygus lineolaris Species 0.000 description 3
- VEGLGAOVLFODGC-GUBZILKMSA-N Lys-Glu-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O VEGLGAOVLFODGC-GUBZILKMSA-N 0.000 description 3
- 241000218922 Magnoliophyta Species 0.000 description 3
- 241001465754 Metazoa Species 0.000 description 3
- 239000006002 Pepper Substances 0.000 description 3
- 244000062780 Petroselinum sativum Species 0.000 description 3
- 235000016761 Piper aduncum Nutrition 0.000 description 3
- 240000003889 Piper guineense Species 0.000 description 3
- 235000017804 Piper guineense Nutrition 0.000 description 3
- 235000008184 Piper nigrum Nutrition 0.000 description 3
- 240000000111 Saccharum officinarum Species 0.000 description 3
- 235000005794 Salvia japonica Nutrition 0.000 description 3
- 244000295490 Salvia japonica Species 0.000 description 3
- DKDHTRVDOUZZTP-IFFSRLJSSA-N Thr-Gln-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)[C@@H](C)O)C(O)=O DKDHTRVDOUZZTP-IFFSRLJSSA-N 0.000 description 3
- LGNBRHZANHMZHK-NUMRIWBASA-N Thr-Glu-Asp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N)O LGNBRHZANHMZHK-NUMRIWBASA-N 0.000 description 3
- PELIQFPESHBTMA-WLTAIBSBSA-N Thr-Tyr-Gly Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CC=C(O)C=C1 PELIQFPESHBTMA-WLTAIBSBSA-N 0.000 description 3
- 235000007303 Thymus vulgaris Nutrition 0.000 description 3
- 240000002657 Thymus vulgaris Species 0.000 description 3
- LNYOXPDEIZJDEI-NHCYSSNCSA-N Val-Asn-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](C(C)C)N LNYOXPDEIZJDEI-NHCYSSNCSA-N 0.000 description 3
- 244000042314 Vigna unguiculata Species 0.000 description 3
- 150000001413 amino acids Chemical group 0.000 description 3
- 230000037444 atrophy Effects 0.000 description 3
- 230000008859 change Effects 0.000 description 3
- 230000000295 complement effect Effects 0.000 description 3
- 235000003373 curcuma longa Nutrition 0.000 description 3
- 235000021438 curry Nutrition 0.000 description 3
- 230000034994 death Effects 0.000 description 3
- 230000029087 digestion Effects 0.000 description 3
- 238000004520 electroporation Methods 0.000 description 3
- 239000002158 endotoxin Substances 0.000 description 3
- 238000011156 evaluation Methods 0.000 description 3
- 238000013467 fragmentation Methods 0.000 description 3
- 238000006062 fragmentation reaction Methods 0.000 description 3
- 235000004611 garlic Nutrition 0.000 description 3
- 108010020688 glycylhistidine Proteins 0.000 description 3
- 230000012010 growth Effects 0.000 description 3
- 239000004009 herbicide Substances 0.000 description 3
- 230000008676 import Effects 0.000 description 3
- 230000005764 inhibitory process Effects 0.000 description 3
- 230000000968 intestinal effect Effects 0.000 description 3
- 238000005259 measurement Methods 0.000 description 3
- 230000001404 mediated effect Effects 0.000 description 3
- 230000000813 microbial effect Effects 0.000 description 3
- 235000011197 perejil Nutrition 0.000 description 3
- 230000008569 process Effects 0.000 description 3
- 230000017854 proteolysis Effects 0.000 description 3
- 230000008929 regeneration Effects 0.000 description 3
- 238000011069 regeneration method Methods 0.000 description 3
- 239000013605 shuttle vector Substances 0.000 description 3
- 238000002415 sodium dodecyl sulfate polyacrylamide gel electrophoresis Methods 0.000 description 3
- 241000894007 species Species 0.000 description 3
- 230000001954 sterilising effect Effects 0.000 description 3
- 238000004659 sterilization and disinfection Methods 0.000 description 3
- 230000004083 survival effect Effects 0.000 description 3
- 239000001585 thymus vulgaris Substances 0.000 description 3
- 230000010474 transient expression Effects 0.000 description 3
- 238000013519 translation Methods 0.000 description 3
- 235000013976 turmeric Nutrition 0.000 description 3
- 239000003981 vehicle Substances 0.000 description 3
- 235000020234 walnut Nutrition 0.000 description 3
- 239000002023 wood Substances 0.000 description 3
- QKNYBSVHEMOAJP-UHFFFAOYSA-N 2-amino-2-(hydroxymethyl)propane-1,3-diol;hydron;chloride Chemical compound Cl.OCC(N)(CO)CO QKNYBSVHEMOAJP-UHFFFAOYSA-N 0.000 description 2
- ZIWWTZWAKYBUOB-CIUDSAMLSA-N Ala-Asp-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O ZIWWTZWAKYBUOB-CIUDSAMLSA-N 0.000 description 2
- BTYTYHBSJKQBQA-GCJQMDKQSA-N Ala-Asp-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](C)N)O BTYTYHBSJKQBQA-GCJQMDKQSA-N 0.000 description 2
- KQESEZXHYOUIIM-CQDKDKBSSA-N Ala-Lys-Tyr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O KQESEZXHYOUIIM-CQDKDKBSSA-N 0.000 description 2
- WEZNQZHACPSMEF-QEJZJMRPSA-N Ala-Phe-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=CC=C1 WEZNQZHACPSMEF-QEJZJMRPSA-N 0.000 description 2
- OMSKGWFGWCQFBD-KZVJFYERSA-N Ala-Val-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OMSKGWFGWCQFBD-KZVJFYERSA-N 0.000 description 2
- 108010088751 Albumins Proteins 0.000 description 2
- 102000009027 Albumins Human genes 0.000 description 2
- HKRXJBBCQBAGIM-FXQIFTODSA-N Arg-Asp-Ser Chemical compound C(C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CO)C(=O)O)N)CN=C(N)N HKRXJBBCQBAGIM-FXQIFTODSA-N 0.000 description 2
- UHFUZWSZQKMDSX-DCAQKATOSA-N Arg-Leu-Asn Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N UHFUZWSZQKMDSX-DCAQKATOSA-N 0.000 description 2
- FSNVAJOPUDVQAR-AVGNSLFASA-N Arg-Lys-Arg Chemical compound NC(=N)NCCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O FSNVAJOPUDVQAR-AVGNSLFASA-N 0.000 description 2
- MTYLORHAQXVQOW-AVGNSLFASA-N Arg-Lys-Met Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCSC)C(O)=O MTYLORHAQXVQOW-AVGNSLFASA-N 0.000 description 2
- FKQITMVNILRUCQ-IHRRRGAJSA-N Arg-Phe-Asp Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O FKQITMVNILRUCQ-IHRRRGAJSA-N 0.000 description 2
- BWMMKQPATDUYKB-IHRRRGAJSA-N Arg-Tyr-Asn Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(N)=O)C(O)=O)CC1=CC=C(O)C=C1 BWMMKQPATDUYKB-IHRRRGAJSA-N 0.000 description 2
- YYSYDIYQTUPNQQ-SXTJYALSSA-N Asn-Ile-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O YYSYDIYQTUPNQQ-SXTJYALSSA-N 0.000 description 2
- FBODFHMLALOPHP-GUBZILKMSA-N Asn-Lys-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O FBODFHMLALOPHP-GUBZILKMSA-N 0.000 description 2
- ZYPWIUFLYMQZBS-SRVKXCTJSA-N Asn-Lys-Lys Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)N)N ZYPWIUFLYMQZBS-SRVKXCTJSA-N 0.000 description 2
- UGXYFDQFLVCDFC-CIUDSAMLSA-N Asn-Ser-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC(N)=O UGXYFDQFLVCDFC-CIUDSAMLSA-N 0.000 description 2
- QUMKPKWYDVMGNT-NUMRIWBASA-N Asn-Thr-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N)O QUMKPKWYDVMGNT-NUMRIWBASA-N 0.000 description 2
- DPWDPEVGACCWTC-SRVKXCTJSA-N Asn-Tyr-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(O)=O DPWDPEVGACCWTC-SRVKXCTJSA-N 0.000 description 2
- WQAOZCVOOYUWKG-LSJOCFKGSA-N Asn-Val-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H](CC(=O)N)N WQAOZCVOOYUWKG-LSJOCFKGSA-N 0.000 description 2
- SPKCGKRUYKMDHP-GUDRVLHUSA-N Asp-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)O)N SPKCGKRUYKMDHP-GUDRVLHUSA-N 0.000 description 2
- KLYPOCBLKMPBIQ-GHCJXIJMSA-N Asp-Ile-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC(=O)O)N KLYPOCBLKMPBIQ-GHCJXIJMSA-N 0.000 description 2
- HJCGDIGVVWETRO-ZPFDUUQYSA-N Asp-Lys-Ile Chemical compound CC[C@H](C)[C@H](NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC(O)=O)C(O)=O HJCGDIGVVWETRO-ZPFDUUQYSA-N 0.000 description 2
- YWLDTBBUHZJQHW-KKUMJFAQSA-N Asp-Lys-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC(=O)O)N YWLDTBBUHZJQHW-KKUMJFAQSA-N 0.000 description 2
- RKXVTTIQNKPCHU-KKHAAJSZSA-N Asp-Val-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC(O)=O RKXVTTIQNKPCHU-KKHAAJSZSA-N 0.000 description 2
- 101100474206 Chlorobium chlorochromatii (strain CaD3) rpsM gene Proteins 0.000 description 2
- CLDCTNHPILWQCW-CIUDSAMLSA-N Cys-Arg-Glu Chemical compound C(C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CS)N)CN=C(N)N CLDCTNHPILWQCW-CIUDSAMLSA-N 0.000 description 2
- 230000004544 DNA amplification Effects 0.000 description 2
- 241000255925 Diptera Species 0.000 description 2
- KCXVZYZYPLLWCC-UHFFFAOYSA-N EDTA Chemical compound OC(=O)CN(CC(O)=O)CCN(CC(O)=O)CC(O)=O KCXVZYZYPLLWCC-UHFFFAOYSA-N 0.000 description 2
- 102000002322 Egg Proteins Human genes 0.000 description 2
- 108010000912 Egg Proteins Proteins 0.000 description 2
- 241000588921 Enterobacteriaceae Species 0.000 description 2
- KZKBJEUWNMQTLV-XDTLVQLUSA-N Gln-Ala-Tyr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O KZKBJEUWNMQTLV-XDTLVQLUSA-N 0.000 description 2
- IKDOHQHEFPPGJG-FXQIFTODSA-N Gln-Asp-Glu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O IKDOHQHEFPPGJG-FXQIFTODSA-N 0.000 description 2
- KDXKFBSNIJYNNR-YVNDNENWSA-N Gln-Glu-Ile Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KDXKFBSNIJYNNR-YVNDNENWSA-N 0.000 description 2
- JHPFPROFOAJRFN-IHRRRGAJSA-N Gln-Glu-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCC(=O)N)N)O JHPFPROFOAJRFN-IHRRRGAJSA-N 0.000 description 2
- JXFLPKSDLDEOQK-JHEQGTHGSA-N Gln-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CCC(N)=O JXFLPKSDLDEOQK-JHEQGTHGSA-N 0.000 description 2
- ARYKRXHBIPLULY-XKBZYTNZSA-N Gln-Thr-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O ARYKRXHBIPLULY-XKBZYTNZSA-N 0.000 description 2
- ZZLDMBMFKZFQMU-NRPADANISA-N Gln-Val-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O ZZLDMBMFKZFQMU-NRPADANISA-N 0.000 description 2
- FITIQFSXXBKFFM-NRPADANISA-N Gln-Val-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O FITIQFSXXBKFFM-NRPADANISA-N 0.000 description 2
- HNAUFGBKJLTWQE-IFFSRLJSSA-N Gln-Val-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CCC(=O)N)N)O HNAUFGBKJLTWQE-IFFSRLJSSA-N 0.000 description 2
- BUZMZDDKFCSKOT-CIUDSAMLSA-N Glu-Glu-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O BUZMZDDKFCSKOT-CIUDSAMLSA-N 0.000 description 2
- BIHMNDPWRUROFZ-JYJNAYRXSA-N Glu-His-Phe Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O BIHMNDPWRUROFZ-JYJNAYRXSA-N 0.000 description 2
- KRRFFAHEAOCBCQ-SIUGBPQLSA-N Glu-Ile-Tyr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O KRRFFAHEAOCBCQ-SIUGBPQLSA-N 0.000 description 2
- RFTVTKBHDXCEEX-WDSKDSINSA-N Glu-Ser-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)NCC(O)=O RFTVTKBHDXCEEX-WDSKDSINSA-N 0.000 description 2
- JVWPPCWUDRJGAE-YUMQZZPRSA-N Gly-Asn-Leu Chemical compound [H]NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O JVWPPCWUDRJGAE-YUMQZZPRSA-N 0.000 description 2
- ALOBJFDJTMQQPW-ONGXEEELSA-N Gly-His-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)CN ALOBJFDJTMQQPW-ONGXEEELSA-N 0.000 description 2
- GAFKBWKVXNERFA-QWRGUYRKSA-N Gly-Phe-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CC=CC=C1 GAFKBWKVXNERFA-QWRGUYRKSA-N 0.000 description 2
- CSMYMGFCEJWALV-WDSKDSINSA-N Gly-Ser-Gln Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(N)=O CSMYMGFCEJWALV-WDSKDSINSA-N 0.000 description 2
- LCRDMSSAKLTKBU-ZDLURKLDSA-N Gly-Ser-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)CN LCRDMSSAKLTKBU-ZDLURKLDSA-N 0.000 description 2
- CUVBTVWFVIIDOC-YEPSODPASA-N Gly-Thr-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)O)NC(=O)CN CUVBTVWFVIIDOC-YEPSODPASA-N 0.000 description 2
- MUGLKCQHTUFLGF-WPRPVWTQSA-N Gly-Val-Met Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)CN MUGLKCQHTUFLGF-WPRPVWTQSA-N 0.000 description 2
- 102000002812 Heat-Shock Proteins Human genes 0.000 description 2
- XGBVLRJLHUVCNK-DCAQKATOSA-N His-Val-Ser Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O XGBVLRJLHUVCNK-DCAQKATOSA-N 0.000 description 2
- DPTBVFUDCPINIP-JURCDPSOSA-N Ile-Ala-Phe Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 DPTBVFUDCPINIP-JURCDPSOSA-N 0.000 description 2
- RPZFUIQVAPZLRH-GHCJXIJMSA-N Ile-Asp-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](C)C(=O)O)N RPZFUIQVAPZLRH-GHCJXIJMSA-N 0.000 description 2
- KUHFPGIVBOCRMV-MNXVOIDGSA-N Ile-Gln-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC(C)C)C(=O)O)N KUHFPGIVBOCRMV-MNXVOIDGSA-N 0.000 description 2
- YCKPUHHMCFSUMD-IUKAMOBKSA-N Ile-Thr-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N YCKPUHHMCFSUMD-IUKAMOBKSA-N 0.000 description 2
- NAFIFZNBSPWYOO-RWRJDSDZSA-N Ile-Thr-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N NAFIFZNBSPWYOO-RWRJDSDZSA-N 0.000 description 2
- COWHUQXTSYTKQC-RWRJDSDZSA-N Ile-Thr-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N COWHUQXTSYTKQC-RWRJDSDZSA-N 0.000 description 2
- DZMWFIRHFFVBHS-ZEWNOJEFSA-N Ile-Tyr-Phe Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC2=CC=CC=C2)C(=O)O)N DZMWFIRHFFVBHS-ZEWNOJEFSA-N 0.000 description 2
- NJGXXYLPDMMFJB-XUXIUFHCSA-N Ile-Val-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N NJGXXYLPDMMFJB-XUXIUFHCSA-N 0.000 description 2
- 108010065920 Insulin Lispro Proteins 0.000 description 2
- XEEYBQQBJWHFJM-UHFFFAOYSA-N Iron Chemical compound [Fe] XEEYBQQBJWHFJM-UHFFFAOYSA-N 0.000 description 2
- RCFDOSNHHZGBOY-UHFFFAOYSA-N L-isoleucyl-L-alanine Natural products CCC(C)C(N)C(=O)NC(C)C(O)=O RCFDOSNHHZGBOY-UHFFFAOYSA-N 0.000 description 2
- LHSGPCFBGJHPCY-UHFFFAOYSA-N L-leucine-L-tyrosine Natural products CC(C)CC(N)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 LHSGPCFBGJHPCY-UHFFFAOYSA-N 0.000 description 2
- KFKWRHQBZQICHA-STQMWFEESA-N L-leucyl-L-phenylalanine Natural products CC(C)C[C@H](N)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 KFKWRHQBZQICHA-STQMWFEESA-N 0.000 description 2
- XIRYQRLFHWWWTC-QEJZJMRPSA-N Leu-Ala-Phe Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 XIRYQRLFHWWWTC-QEJZJMRPSA-N 0.000 description 2
- XBBKIIGCUMBKCO-JXUBOQSCSA-N Leu-Ala-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XBBKIIGCUMBKCO-JXUBOQSCSA-N 0.000 description 2
- OGCQGUIWMSBHRZ-CIUDSAMLSA-N Leu-Asn-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O OGCQGUIWMSBHRZ-CIUDSAMLSA-N 0.000 description 2
- CCQLQKZTXZBXTN-NHCYSSNCSA-N Leu-Gly-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)NCC(=O)N[C@@H]([C@@H](C)CC)C(O)=O CCQLQKZTXZBXTN-NHCYSSNCSA-N 0.000 description 2
- QJXHMYMRGDOHRU-NHCYSSNCSA-N Leu-Ile-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O QJXHMYMRGDOHRU-NHCYSSNCSA-N 0.000 description 2
- IFMPDNRWZZEZSL-SRVKXCTJSA-N Leu-Leu-Cys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CS)C(O)=O IFMPDNRWZZEZSL-SRVKXCTJSA-N 0.000 description 2
- HVHRPWQEQHIQJF-AVGNSLFASA-N Leu-Lys-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O HVHRPWQEQHIQJF-AVGNSLFASA-N 0.000 description 2
- BGZCJDGBBUUBHA-KKUMJFAQSA-N Leu-Lys-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O BGZCJDGBBUUBHA-KKUMJFAQSA-N 0.000 description 2
- KPYAOIVPJKPIOU-KKUMJFAQSA-N Leu-Lys-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(O)=O KPYAOIVPJKPIOU-KKUMJFAQSA-N 0.000 description 2
- QONKWXNJRRNTBV-AVGNSLFASA-N Leu-Pro-Met Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCSC)C(=O)O)N QONKWXNJRRNTBV-AVGNSLFASA-N 0.000 description 2
- DPURXCQCHSQPAN-AVGNSLFASA-N Leu-Pro-Pro Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 DPURXCQCHSQPAN-AVGNSLFASA-N 0.000 description 2
- IDGZVZJLYFTXSL-DCAQKATOSA-N Leu-Ser-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCN=C(N)N IDGZVZJLYFTXSL-DCAQKATOSA-N 0.000 description 2
- MVHXGBZUJLWZOH-BJDJZHNGSA-N Leu-Ser-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O MVHXGBZUJLWZOH-BJDJZHNGSA-N 0.000 description 2
- LFSQWRSVPNKJGP-WDCWCFNPSA-N Leu-Thr-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCC(O)=O LFSQWRSVPNKJGP-WDCWCFNPSA-N 0.000 description 2
- QWWPYKKLXWOITQ-VOAKCMCISA-N Leu-Thr-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(C)C QWWPYKKLXWOITQ-VOAKCMCISA-N 0.000 description 2
- XZNJZXJZBMBGGS-NHCYSSNCSA-N Leu-Val-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O XZNJZXJZBMBGGS-NHCYSSNCSA-N 0.000 description 2
- VKVDRTGWLVZJOM-DCAQKATOSA-N Leu-Val-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O VKVDRTGWLVZJOM-DCAQKATOSA-N 0.000 description 2
- 102000003960 Ligases Human genes 0.000 description 2
- 108090000364 Ligases Proteins 0.000 description 2
- NFLFJGGKOHYZJF-BJDJZHNGSA-N Lys-Ala-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN NFLFJGGKOHYZJF-BJDJZHNGSA-N 0.000 description 2
- LMVOVCYVZBBWQB-SRVKXCTJSA-N Lys-Asp-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN LMVOVCYVZBBWQB-SRVKXCTJSA-N 0.000 description 2
- KWUKZRFFKPLUPE-HJGDQZAQSA-N Lys-Asp-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KWUKZRFFKPLUPE-HJGDQZAQSA-N 0.000 description 2
- HEWWNLVEWBJBKA-WDCWCFNPSA-N Lys-Gln-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CCCCN HEWWNLVEWBJBKA-WDCWCFNPSA-N 0.000 description 2
- GJJQCBVRWDGLMQ-GUBZILKMSA-N Lys-Glu-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O GJJQCBVRWDGLMQ-GUBZILKMSA-N 0.000 description 2
- XDPLZVNMYQOFQZ-BJDJZHNGSA-N Lys-Ile-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCCCN)N XDPLZVNMYQOFQZ-BJDJZHNGSA-N 0.000 description 2
- LJADEBULDNKJNK-IHRRRGAJSA-N Lys-Leu-Val Chemical compound CC(C)C[C@H](NC(=O)[C@@H](N)CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O LJADEBULDNKJNK-IHRRRGAJSA-N 0.000 description 2
- ODTZHNZPINULEU-KKUMJFAQSA-N Lys-Phe-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCCCN)N ODTZHNZPINULEU-KKUMJFAQSA-N 0.000 description 2
- HDNOQCZWJGGHSS-VEVYYDQMSA-N Met-Asn-Thr Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O HDNOQCZWJGGHSS-VEVYYDQMSA-N 0.000 description 2
- JCMMNFZUKMMECJ-DCAQKATOSA-N Met-Lys-Asn Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O JCMMNFZUKMMECJ-DCAQKATOSA-N 0.000 description 2
- AUEJLPRZGVVDNU-UHFFFAOYSA-N N-L-tyrosyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CC1=CC=C(O)C=C1 AUEJLPRZGVVDNU-UHFFFAOYSA-N 0.000 description 2
- XMBSYZWANAQXEV-UHFFFAOYSA-N N-alpha-L-glutamyl-L-phenylalanine Natural products OC(=O)CCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 XMBSYZWANAQXEV-UHFFFAOYSA-N 0.000 description 2
- 108010079364 N-glycylalanine Proteins 0.000 description 2
- 241000364057 Peoria Species 0.000 description 2
- DFEVBOYEUQJGER-JURCDPSOSA-N Phe-Ala-Ile Chemical compound N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O DFEVBOYEUQJGER-JURCDPSOSA-N 0.000 description 2
- WIVCOAKLPICYGY-KKUMJFAQSA-N Phe-Asp-Lys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N WIVCOAKLPICYGY-KKUMJFAQSA-N 0.000 description 2
- LLGTYVHITPVGKR-RYUDHWBXSA-N Phe-Gln-Gly Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O LLGTYVHITPVGKR-RYUDHWBXSA-N 0.000 description 2
- JWQWPTLEOFNCGX-AVGNSLFASA-N Phe-Glu-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 JWQWPTLEOFNCGX-AVGNSLFASA-N 0.000 description 2
- AGTHXWTYCLLYMC-FHWLQOOXSA-N Phe-Tyr-Glu Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)N[C@@H](CCC(O)=O)C(O)=O)C1=CC=CC=C1 AGTHXWTYCLLYMC-FHWLQOOXSA-N 0.000 description 2
- GOUWCZRDTWTODO-YDHLFZDLSA-N Phe-Val-Asn Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O GOUWCZRDTWTODO-YDHLFZDLSA-N 0.000 description 2
- VDTYRPWRWRCROL-UFYCRDLUSA-N Phe-Val-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 VDTYRPWRWRCROL-UFYCRDLUSA-N 0.000 description 2
- NBIIXXVUZAFLBC-UHFFFAOYSA-N Phosphoric acid Chemical compound OP(O)(O)=O NBIIXXVUZAFLBC-UHFFFAOYSA-N 0.000 description 2
- 241000178953 Photorhabdus sp. Species 0.000 description 2
- IFMDQWDAJUMMJC-DCAQKATOSA-N Pro-Ala-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O IFMDQWDAJUMMJC-DCAQKATOSA-N 0.000 description 2
- NHDVNAKDACFHPX-GUBZILKMSA-N Pro-Arg-Ala Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O NHDVNAKDACFHPX-GUBZILKMSA-N 0.000 description 2
- 241000589516 Pseudomonas Species 0.000 description 2
- 241000382353 Pupa Species 0.000 description 2
- 235000007201 Saccharum officinarum Nutrition 0.000 description 2
- RDFQNDHEHVSONI-ZLUOBGJFSA-N Ser-Asn-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O RDFQNDHEHVSONI-ZLUOBGJFSA-N 0.000 description 2
- BGOWRLSWJCVYAQ-CIUDSAMLSA-N Ser-Asp-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O BGOWRLSWJCVYAQ-CIUDSAMLSA-N 0.000 description 2
- OLIJLNWFEQEFDM-SRVKXCTJSA-N Ser-Asp-Phe Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 OLIJLNWFEQEFDM-SRVKXCTJSA-N 0.000 description 2
- SWSRFJZZMNLMLY-ZKWXMUAHSA-N Ser-Asp-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O SWSRFJZZMNLMLY-ZKWXMUAHSA-N 0.000 description 2
- GVMUJUPXFQFBBZ-GUBZILKMSA-N Ser-Lys-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O GVMUJUPXFQFBBZ-GUBZILKMSA-N 0.000 description 2
- KZPRPBLHYMZIMH-MXAVVETBSA-N Ser-Phe-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KZPRPBLHYMZIMH-MXAVVETBSA-N 0.000 description 2
- NUEHQDHDLDXCRU-GUBZILKMSA-N Ser-Pro-Arg Chemical compound OC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCN=C(N)N)C(O)=O NUEHQDHDLDXCRU-GUBZILKMSA-N 0.000 description 2
- OSFZCEQJLWCIBG-BZSNNMDCSA-N Ser-Tyr-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O OSFZCEQJLWCIBG-BZSNNMDCSA-N 0.000 description 2
- 108020004459 Small interfering RNA Proteins 0.000 description 2
- UKBSDLHIKIXJKH-HJGDQZAQSA-N Thr-Arg-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O UKBSDLHIKIXJKH-HJGDQZAQSA-N 0.000 description 2
- ZQUKYJOKQBRBCS-GLLZPBPUSA-N Thr-Gln-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N)O ZQUKYJOKQBRBCS-GLLZPBPUSA-N 0.000 description 2
- NCXVJIQMWSGRHY-KXNHARMFSA-N Thr-Leu-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N)O NCXVJIQMWSGRHY-KXNHARMFSA-N 0.000 description 2
- VRUFCJZQDACGLH-UVOCVTCTSA-N Thr-Leu-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O VRUFCJZQDACGLH-UVOCVTCTSA-N 0.000 description 2
- GFRIEEKFXOVPIR-RHYQMDGZSA-N Thr-Pro-Lys Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(O)=O GFRIEEKFXOVPIR-RHYQMDGZSA-N 0.000 description 2
- BYSKNUASOAGJSS-NQCBNZPSSA-N Trp-Ile-Phe Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)N BYSKNUASOAGJSS-NQCBNZPSSA-N 0.000 description 2
- PRONOHBTMLNXCZ-BZSNNMDCSA-N Tyr-Leu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 PRONOHBTMLNXCZ-BZSNNMDCSA-N 0.000 description 2
- QVYFTFIBKCDHIE-ACRUOGEOSA-N Tyr-Tyr-Lys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)N[C@@H](CCCCN)C(=O)O)N)O QVYFTFIBKCDHIE-ACRUOGEOSA-N 0.000 description 2
- GXAZTLJYINLMJL-LAEOZQHASA-N Val-Asn-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N GXAZTLJYINLMJL-LAEOZQHASA-N 0.000 description 2
- PVPAOIGJYHVWBT-KKHAAJSZSA-N Val-Asn-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](C(C)C)N)O PVPAOIGJYHVWBT-KKHAAJSZSA-N 0.000 description 2
- DDNIHOWRDOXXPF-NGZCFLSTSA-N Val-Asp-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N DDNIHOWRDOXXPF-NGZCFLSTSA-N 0.000 description 2
- HURRXSNHCCSJHA-AUTRQRHGSA-N Val-Gln-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N HURRXSNHCCSJHA-AUTRQRHGSA-N 0.000 description 2
- LAYSXAOGWHKNED-XPUUQOCRSA-N Val-Gly-Ser Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O LAYSXAOGWHKNED-XPUUQOCRSA-N 0.000 description 2
- KZKMBGXCNLPYKD-YEPSODPASA-N Val-Gly-Thr Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O KZKMBGXCNLPYKD-YEPSODPASA-N 0.000 description 2
- LKUDRJSNRWVGMS-QSFUFRPTSA-N Val-Ile-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N LKUDRJSNRWVGMS-QSFUFRPTSA-N 0.000 description 2
- ZZGPVSZDZQRJQY-ULQDDVLXSA-N Val-Leu-Phe Chemical compound CC(C)C[C@H](NC(=O)[C@@H](N)C(C)C)C(=O)N[C@@H](Cc1ccccc1)C(O)=O ZZGPVSZDZQRJQY-ULQDDVLXSA-N 0.000 description 2
- CXWJFWAZIVWBOS-XQQFMLRXSA-N Val-Lys-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N1CCC[C@@H]1C(=O)O)N CXWJFWAZIVWBOS-XQQFMLRXSA-N 0.000 description 2
- JAKHAONCJJZVHT-DCAQKATOSA-N Val-Lys-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)O)N JAKHAONCJJZVHT-DCAQKATOSA-N 0.000 description 2
- MGVYZTPLGXPVQB-CYDGBPFRSA-N Val-Met-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](C(C)C)N MGVYZTPLGXPVQB-CYDGBPFRSA-N 0.000 description 2
- WUFHZIRMAZZWRS-OSUNSFLBSA-N Val-Thr-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@H](C(C)C)N WUFHZIRMAZZWRS-OSUNSFLBSA-N 0.000 description 2
- YLBNZCJFSVJDRJ-KJEVXHAQSA-N Val-Thr-Tyr Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](Cc1ccc(O)cc1)C(O)=O YLBNZCJFSVJDRJ-KJEVXHAQSA-N 0.000 description 2
- ZLNYBMWGPOKSLW-LSJOCFKGSA-N Val-Val-Asp Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O ZLNYBMWGPOKSLW-LSJOCFKGSA-N 0.000 description 2
- 241000500606 Xenorhabdus sp. Species 0.000 description 2
- 235000016383 Zea mays subsp huehuetenangensis Nutrition 0.000 description 2
- 108010005233 alanylglutamic acid Proteins 0.000 description 2
- 238000004458 analytical method Methods 0.000 description 2
- 230000001857 anti-mycotic effect Effects 0.000 description 2
- 230000000840 anti-viral effect Effects 0.000 description 2
- 239000002543 antimycotic Substances 0.000 description 2
- 108010062796 arginyllysine Proteins 0.000 description 2
- 108010093581 aspartyl-proline Proteins 0.000 description 2
- 238000003149 assay kit Methods 0.000 description 2
- 230000003115 biocidal effect Effects 0.000 description 2
- 239000012472 biological sample Substances 0.000 description 2
- 239000007853 buffer solution Substances 0.000 description 2
- 239000003795 chemical substances by application Substances 0.000 description 2
- 108010031100 chloroplast transit peptides Proteins 0.000 description 2
- 238000010367 cloning Methods 0.000 description 2
- 235000016213 coffee Nutrition 0.000 description 2
- 235000013353 coffee beverage Nutrition 0.000 description 2
- 230000006378 damage Effects 0.000 description 2
- 230000002939 deleterious effect Effects 0.000 description 2
- 230000018109 developmental process Effects 0.000 description 2
- 238000001035 drying Methods 0.000 description 2
- 238000005516 engineering process Methods 0.000 description 2
- 108010063718 gamma-glutamylaspartic acid Proteins 0.000 description 2
- 108010055341 glutamyl-glutamic acid Proteins 0.000 description 2
- 238000009396 hybridization Methods 0.000 description 2
- 210000003000 inclusion body Anatomy 0.000 description 2
- 230000003993 interaction Effects 0.000 description 2
- 108010078274 isoleucylvaline Proteins 0.000 description 2
- 108010053037 kyotorphin Proteins 0.000 description 2
- 108010044056 leucyl-phenylalanine Proteins 0.000 description 2
- 108010073472 leucyl-prolyl-proline Proteins 0.000 description 2
- 108010012058 leucyltyrosine Proteins 0.000 description 2
- 235000009973 maize Nutrition 0.000 description 2
- 108010005942 methionylglycine Proteins 0.000 description 2
- 238000002156 mixing Methods 0.000 description 2
- 230000004048 modification Effects 0.000 description 2
- 238000012986 modification Methods 0.000 description 2
- 239000005645 nematicide Substances 0.000 description 2
- 235000016709 nutrition Nutrition 0.000 description 2
- 230000035764 nutrition Effects 0.000 description 2
- 230000000361 pesticidal effect Effects 0.000 description 2
- 108010073101 phenylalanylleucine Proteins 0.000 description 2
- 108010070643 prolylglutamic acid Proteins 0.000 description 2
- 210000001938 protoplast Anatomy 0.000 description 2
- 238000012797 qualification Methods 0.000 description 2
- 230000006641 stabilisation Effects 0.000 description 2
- 238000011105 stabilization Methods 0.000 description 2
- 230000000699 topical effect Effects 0.000 description 2
- 230000009466 transformation Effects 0.000 description 2
- ZBMRKNMTMPPMMK-UHFFFAOYSA-N 2-amino-4-[hydroxy(methyl)phosphoryl]butanoic acid;azane Chemical compound [NH4+].CP(O)(=O)CCC(N)C([O-])=O ZBMRKNMTMPPMMK-UHFFFAOYSA-N 0.000 description 1
- 108020005065 3' Flanking Region Proteins 0.000 description 1
- 108010020183 3-phosphoshikimate 1-carboxyvinyltransferase Proteins 0.000 description 1
- QCVGEOXPDFCNHA-UHFFFAOYSA-N 5,5-dimethyl-2,4-dioxo-1,3-oxazolidine-3-carboxamide Chemical compound CC1(C)OC(=O)N(C(N)=O)C1=O QCVGEOXPDFCNHA-UHFFFAOYSA-N 0.000 description 1
- 108010013043 Acetylesterase Proteins 0.000 description 1
- 101710197633 Actin-1 Proteins 0.000 description 1
- 229920000936 Agarose Polymers 0.000 description 1
- 241001124076 Aphididae Species 0.000 description 1
- 241000219195 Arabidopsis thaliana Species 0.000 description 1
- XFTWUNOVBCHBJR-UHFFFAOYSA-N Aspergillomarasmine A Chemical compound OC(=O)C(N)CNC(C(O)=O)CNC(C(O)=O)CC(O)=O XFTWUNOVBCHBJR-UHFFFAOYSA-N 0.000 description 1
- 108700003918 Bacillus Thuringiensis insecticidal crystal Proteins 0.000 description 1
- 244000063299 Bacillus subtilis Species 0.000 description 1
- 235000014469 Bacillus subtilis Nutrition 0.000 description 1
- 101000878902 Bacillus thuringiensis Pesticidal crystal protein Cry6Aa Proteins 0.000 description 1
- 101000878906 Bacillus thuringiensis Pesticidal crystal protein Cry6Ba Proteins 0.000 description 1
- 108700031685 Bacillus thuringiensis Vip3A Proteins 0.000 description 1
- 206010004194 Bed bug infestation Diseases 0.000 description 1
- 229920002799 BoPET Polymers 0.000 description 1
- 108091003079 Bovine Serum Albumin Proteins 0.000 description 1
- 241000193417 Brevibacillus laterosporus Species 0.000 description 1
- 241000701489 Cauliflower mosaic virus Species 0.000 description 1
- 108010059892 Cellulase Proteins 0.000 description 1
- 241000931705 Cicada Species 0.000 description 1
- 241001414720 Cicadellidae Species 0.000 description 1
- 241001327638 Cimex lectularius Species 0.000 description 1
- 108020004394 Complementary RNA Proteins 0.000 description 1
- 101710151559 Crystal protein Proteins 0.000 description 1
- 241000258922 Ctenocephalides Species 0.000 description 1
- 241000131094 Cucujidae Species 0.000 description 1
- FBPFZTCFMRRESA-KVTDHHQDSA-N D-Mannitol Chemical compound OC[C@@H](O)[C@@H](O)[C@H](O)[C@H](O)CO FBPFZTCFMRRESA-KVTDHHQDSA-N 0.000 description 1
- 244000000626 Daucus carota Species 0.000 description 1
- 239000005504 Dicamba Substances 0.000 description 1
- 102000004190 Enzymes Human genes 0.000 description 1
- 108090000790 Enzymes Proteins 0.000 description 1
- YQYJSBFKSSDGFO-UHFFFAOYSA-N Epihygromycin Natural products OC1C(O)C(C(=O)C)OC1OC(C(=C1)O)=CC=C1C=C(C)C(=O)NC1C(O)C(O)C2OCOC2C1O YQYJSBFKSSDGFO-UHFFFAOYSA-N 0.000 description 1
- 108700024394 Exon Proteins 0.000 description 1
- 241000233866 Fungi Species 0.000 description 1
- 240000004859 Gamochaeta purpurea Species 0.000 description 1
- ZNZPKVQURDQFFS-FXQIFTODSA-N Gln-Glu-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O ZNZPKVQURDQFFS-FXQIFTODSA-N 0.000 description 1
- 108030006517 Glyphosate oxidoreductases Proteins 0.000 description 1
- 235000004341 Gossypium herbaceum Nutrition 0.000 description 1
- 240000002024 Gossypium herbaceum Species 0.000 description 1
- 101000963974 Hydrophis stokesii Alpha-elapitoxin-Ast2b Proteins 0.000 description 1
- QNAYBMKLOCPYGJ-REOHCLBHSA-N L-alanine Chemical compound C[C@H](N)C(O)=O QNAYBMKLOCPYGJ-REOHCLBHSA-N 0.000 description 1
- FBOZXECLQNJBKD-ZDUSSCGKSA-N L-methotrexate Chemical compound C=1N=C2N=C(N)N=C(N)C2=NC=1CN(C)C1=CC=C(C(=O)N[C@@H](CCC(O)=O)C(O)=O)C=C1 FBOZXECLQNJBKD-ZDUSSCGKSA-N 0.000 description 1
- LAGPXKYZCCTSGQ-JYJNAYRXSA-N Leu-Glu-Phe Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O LAGPXKYZCCTSGQ-JYJNAYRXSA-N 0.000 description 1
- 241001117312 Lygus elisus Species 0.000 description 1
- 229930195725 Mannitol Natural products 0.000 description 1
- JHKXZYLNVJRAAJ-WDSKDSINSA-N Met-Ala Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(O)=O JHKXZYLNVJRAAJ-WDSKDSINSA-N 0.000 description 1
- 239000005041 Mylar™ Substances 0.000 description 1
- 101000964025 Naja naja Long neurotoxin 3 Proteins 0.000 description 1
- 101100168995 Neurospora crassa (strain ATCC 24698 / 74-OR23-1A / CBS 708.71 / DSM 1257 / FGSC 987) cyt-1 gene Proteins 0.000 description 1
- 101100438748 Neurospora crassa (strain ATCC 24698 / 74-OR23-1A / CBS 708.71 / DSM 1257 / FGSC 987) cyt-2 gene Proteins 0.000 description 1
- 108020004711 Nucleic Acid Probes Proteins 0.000 description 1
- 108091034117 Oligonucleotide Proteins 0.000 description 1
- 241000238814 Orthoptera Species 0.000 description 1
- 101710091688 Patatin Proteins 0.000 description 1
- 102000035195 Peptidases Human genes 0.000 description 1
- 108091005804 Peptidases Proteins 0.000 description 1
- 241001674048 Phthiraptera Species 0.000 description 1
- 108010003581 Ribulose-bisphosphate carboxylase Proteins 0.000 description 1
- 241000283984 Rodentia Species 0.000 description 1
- 240000004808 Saccharomyces cerevisiae Species 0.000 description 1
- 238000012300 Sequence Analysis Methods 0.000 description 1
- QYOJSKGCWNAKGW-PBXRRBTRSA-N Shikimic acid 3-phosphate Natural products O[C@@H]1CC(C(O)=O)=C[C@@H](OP(O)(O)=O)[C@H]1O QYOJSKGCWNAKGW-PBXRRBTRSA-N 0.000 description 1
- 238000002105 Southern blotting Methods 0.000 description 1
- 108091081024 Start codon Proteins 0.000 description 1
- 241000254107 Tenebrionidae Species 0.000 description 1
- 108091036066 Three prime untranslated region Proteins 0.000 description 1
- 241001105109 Trogossitidae Species 0.000 description 1
- 208000036142 Viral infection Diseases 0.000 description 1
- 235000007244 Zea mays Nutrition 0.000 description 1
- 230000005856 abnormality Effects 0.000 description 1
- -1 acetonyl shikimic acid-3-phosphate Chemical compound 0.000 description 1
- 108020002494 acetyltransferase Proteins 0.000 description 1
- 102000005421 acetyltransferase Human genes 0.000 description 1
- 230000009471 action Effects 0.000 description 1
- 229910000147 aluminium phosphate Inorganic materials 0.000 description 1
- 238000012443 analytical study Methods 0.000 description 1
- 239000003242 anti bacterial agent Substances 0.000 description 1
- 229940088710 antibiotic agent Drugs 0.000 description 1
- 238000003556 assay Methods 0.000 description 1
- QLULGSLAHXLKSR-UHFFFAOYSA-N azane;phosphane Chemical compound N.P QLULGSLAHXLKSR-UHFFFAOYSA-N 0.000 description 1
- 230000009286 beneficial effect Effects 0.000 description 1
- 230000000975 bioactive effect Effects 0.000 description 1
- 230000008827 biological function Effects 0.000 description 1
- 230000015572 biosynthetic process Effects 0.000 description 1
- 229940098773 bovine serum albumin Drugs 0.000 description 1
- 238000009395 breeding Methods 0.000 description 1
- 230000001488 breeding effect Effects 0.000 description 1
- 230000010261 cell growth Effects 0.000 description 1
- 229940106157 cellulase Drugs 0.000 description 1
- 229930002868 chlorophyll a Natural products 0.000 description 1
- ATNHDLDRLWWWCB-AENOIHSZSA-M chlorophyll a Chemical compound C1([C@@H](C(=O)OC)C(=O)C2=C3C)=C2N2C3=CC(C(CC)=C3C)=[N+]4C3=CC3=C(C=C)C(C)=C5N3[Mg-2]42[N+]2=C1[C@@H](CCC(=O)OC\C=C(/C)CCC[C@H](C)CCC[C@H](C)CCCC(C)C)[C@H](C)C2=C5 ATNHDLDRLWWWCB-AENOIHSZSA-M 0.000 description 1
- 229930002869 chlorophyll b Natural products 0.000 description 1
- NSMUHPMZFPKNMZ-VBYMZDBQSA-M chlorophyll b Chemical compound C1([C@@H](C(=O)OC)C(=O)C2=C3C)=C2N2C3=CC(C(CC)=C3C=O)=[N+]4C3=CC3=C(C=C)C(C)=C5N3[Mg-2]42[N+]2=C1[C@@H](CCC(=O)OC\C=C(/C)CCC[C@H](C)CCC[C@H](C)CCCC(C)C)[C@H](C)C2=C5 NSMUHPMZFPKNMZ-VBYMZDBQSA-M 0.000 description 1
- 239000013599 cloning vector Substances 0.000 description 1
- 239000003245 coal Substances 0.000 description 1
- 239000011248 coating agent Substances 0.000 description 1
- 238000000576 coating method Methods 0.000 description 1
- 239000000084 colloidal system Substances 0.000 description 1
- 239000003184 complementary RNA Substances 0.000 description 1
- 239000002131 composite material Substances 0.000 description 1
- 238000012790 confirmation Methods 0.000 description 1
- 239000013068 control sample Substances 0.000 description 1
- 230000001276 controlling effect Effects 0.000 description 1
- 238000001816 cooling Methods 0.000 description 1
- 230000002596 correlated effect Effects 0.000 description 1
- 238000005336 cracking Methods 0.000 description 1
- 231100000135 cytotoxicity Toxicity 0.000 description 1
- 230000003013 cytotoxicity Effects 0.000 description 1
- 238000013016 damping Methods 0.000 description 1
- 238000003745 diagnosis Methods 0.000 description 1
- 235000005911 diet Nutrition 0.000 description 1
- 230000037213 diet Effects 0.000 description 1
- 230000001079 digestive effect Effects 0.000 description 1
- 230000009365 direct transmission Effects 0.000 description 1
- 235000018927 edible plant Nutrition 0.000 description 1
- 235000014103 egg white Nutrition 0.000 description 1
- 210000000969 egg white Anatomy 0.000 description 1
- 235000013601 eggs Nutrition 0.000 description 1
- 230000008030 elimination Effects 0.000 description 1
- 238000003379 elimination reaction Methods 0.000 description 1
- 229940088598 enzyme Drugs 0.000 description 1
- 238000001976 enzyme digestion Methods 0.000 description 1
- 238000010195 expression analysis Methods 0.000 description 1
- 230000002349 favourable effect Effects 0.000 description 1
- 239000012530 fluid Substances 0.000 description 1
- 235000012055 fruits and vegetables Nutrition 0.000 description 1
- 230000000855 fungicidal effect Effects 0.000 description 1
- 230000000799 fusogenic effect Effects 0.000 description 1
- 230000002068 genetic effect Effects 0.000 description 1
- 230000035784 germination Effects 0.000 description 1
- 108010078326 glycyl-glycyl-valine Proteins 0.000 description 1
- 230000012447 hatching Effects 0.000 description 1
- 239000012535 impurity Substances 0.000 description 1
- 238000011534 incubation Methods 0.000 description 1
- 230000006698 induction Effects 0.000 description 1
- 239000004615 ingredient Substances 0.000 description 1
- 239000002054 inoculum Substances 0.000 description 1
- 239000002917 insecticide Substances 0.000 description 1
- 238000007852 inverse PCR Methods 0.000 description 1
- 229910052742 iron Inorganic materials 0.000 description 1
- 230000007774 longterm Effects 0.000 description 1
- 239000000594 mannitol Substances 0.000 description 1
- 235000010355 mannitol Nutrition 0.000 description 1
- 235000012054 meals Nutrition 0.000 description 1
- QSHDDOUJBYECFT-UHFFFAOYSA-N mercury Chemical compound [Hg] QSHDDOUJBYECFT-UHFFFAOYSA-N 0.000 description 1
- 229910052753 mercury Inorganic materials 0.000 description 1
- 108020004999 messenger RNA Proteins 0.000 description 1
- 229960000485 methotrexate Drugs 0.000 description 1
- 101150023613 mev-1 gene Proteins 0.000 description 1
- 108091070501 miRNA Proteins 0.000 description 1
- 239000002679 microRNA Substances 0.000 description 1
- 239000011859 microparticle Substances 0.000 description 1
- 238000005497 microtitration Methods 0.000 description 1
- 230000002438 mitochondrial effect Effects 0.000 description 1
- 238000002703 mutagenesis Methods 0.000 description 1
- 231100000350 mutagenesis Toxicity 0.000 description 1
- 239000013642 negative control Substances 0.000 description 1
- 231100000252 nontoxic Toxicity 0.000 description 1
- 230000003000 nontoxic effect Effects 0.000 description 1
- 239000002853 nucleic acid probe Substances 0.000 description 1
- 235000014571 nuts Nutrition 0.000 description 1
- 238000005457 optimization Methods 0.000 description 1
- 210000003463 organelle Anatomy 0.000 description 1
- 210000004681 ovum Anatomy 0.000 description 1
- 238000004806 packaging method and process Methods 0.000 description 1
- 238000012856 packing Methods 0.000 description 1
- 244000052769 pathogen Species 0.000 description 1
- 101150056670 phnO gene Proteins 0.000 description 1
- 230000008654 plant damage Effects 0.000 description 1
- 239000003375 plant hormone Substances 0.000 description 1
- 239000004033 plastic Substances 0.000 description 1
- 229920003023 plastic Polymers 0.000 description 1
- 229920006267 polyester film Polymers 0.000 description 1
- 239000013641 positive control Substances 0.000 description 1
- 230000001737 promoting effect Effects 0.000 description 1
- 230000002787 reinforcement Effects 0.000 description 1
- 230000000717 retained effect Effects 0.000 description 1
- 230000002441 reversible effect Effects 0.000 description 1
- 238000007789 sealing Methods 0.000 description 1
- 230000001932 seasonal effect Effects 0.000 description 1
- 230000003248 secreting effect Effects 0.000 description 1
- 230000010153 self-pollination Effects 0.000 description 1
- 230000035945 sensitivity Effects 0.000 description 1
- 238000012882 sequential analysis Methods 0.000 description 1
- 239000000243 solution Substances 0.000 description 1
- 125000006850 spacer group Chemical group 0.000 description 1
- 239000007921 spray Substances 0.000 description 1
- 230000000087 stabilizing effect Effects 0.000 description 1
- 238000003756 stirring Methods 0.000 description 1
- 238000003786 synthesis reaction Methods 0.000 description 1
- 238000013518 transcription Methods 0.000 description 1
- 230000035897 transcription Effects 0.000 description 1
- 230000014621 translational initiation Effects 0.000 description 1
- 239000013598 vector Substances 0.000 description 1
- 230000003612 virological effect Effects 0.000 description 1
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Substances O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 1
- 239000003643 water by type Substances 0.000 description 1
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
- C12N15/79—Vectors or expression systems specially adapted for eukaryotic hosts
- C12N15/82—Vectors or expression systems specially adapted for eukaryotic hosts for plant cells, e.g. plant artificial chromosomes (PACs)
- C12N15/8241—Phenotypically and genetically modified plants via recombinant DNA technology
- C12N15/8261—Phenotypically and genetically modified plants via recombinant DNA technology with agronomic (input) traits, e.g. crop yield
- C12N15/8271—Phenotypically and genetically modified plants via recombinant DNA technology with agronomic (input) traits, e.g. crop yield for stress resistance, e.g. heavy metal resistance
- C12N15/8279—Phenotypically and genetically modified plants via recombinant DNA technology with agronomic (input) traits, e.g. crop yield for stress resistance, e.g. heavy metal resistance for biotic stress resistance, pathogen resistance, disease resistance
- C12N15/8286—Phenotypically and genetically modified plants via recombinant DNA technology with agronomic (input) traits, e.g. crop yield for stress resistance, e.g. heavy metal resistance for biotic stress resistance, pathogen resistance, disease resistance for insect resistance
-
- A—HUMAN NECESSITIES
- A01—AGRICULTURE; FORESTRY; ANIMAL HUSBANDRY; HUNTING; TRAPPING; FISHING
- A01N—PRESERVATION OF BODIES OF HUMANS OR ANIMALS OR PLANTS OR PARTS THEREOF; BIOCIDES, e.g. AS DISINFECTANTS, AS PESTICIDES OR AS HERBICIDES; PEST REPELLANTS OR ATTRACTANTS; PLANT GROWTH REGULATORS
- A01N63/00—Biocides, pest repellants or attractants, or plant growth regulators containing microorganisms, viruses, microbial fungi, animals or substances produced by, or obtained from, microorganisms, viruses, microbial fungi or animals, e.g. enzymes or fermentates
- A01N63/50—Isolated enzymes; Isolated proteins
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61P—SPECIFIC THERAPEUTIC ACTIVITY OF CHEMICAL COMPOUNDS OR MEDICINAL PREPARATIONS
- A61P43/00—Drugs for specific purposes, not provided for in groups A61P1/00-A61P41/00
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K14/00—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
- C07K14/195—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from bacteria
- C07K14/32—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from bacteria from Bacillus (G)
- C07K14/325—Bacillus thuringiensis crystal peptides, i.e. delta-endotoxins
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
- C12N15/79—Vectors or expression systems specially adapted for eukaryotic hosts
- C12N15/82—Vectors or expression systems specially adapted for eukaryotic hosts for plant cells, e.g. plant artificial chromosomes (PACs)
- C12N15/8216—Methods for controlling, regulating or enhancing expression of transgenes in plant cells
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
- C12N15/79—Vectors or expression systems specially adapted for eukaryotic hosts
- C12N15/82—Vectors or expression systems specially adapted for eukaryotic hosts for plant cells, e.g. plant artificial chromosomes (PACs)
- C12N15/8241—Phenotypically and genetically modified plants via recombinant DNA technology
- C12N15/8242—Phenotypically and genetically modified plants via recombinant DNA technology with non-agronomic quality (output) traits, e.g. for industrial processing; Value added, non-agronomic traits
- C12N15/8257—Phenotypically and genetically modified plants via recombinant DNA technology with non-agronomic quality (output) traits, e.g. for industrial processing; Value added, non-agronomic traits for the production of primary gene products, e.g. pharmaceutical products, interferon
-
- Y—GENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y02—TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
- Y02A—TECHNOLOGIES FOR ADAPTATION TO CLIMATE CHANGE
- Y02A40/00—Adaptation technologies in agriculture, forestry, livestock or agroalimentary production
- Y02A40/10—Adaptation technologies in agriculture, forestry, livestock or agroalimentary production in agriculture
- Y02A40/146—Genetically Modified [GMO] plants, e.g. transgenic plants
Landscapes
- Health & Medical Sciences (AREA)
- Life Sciences & Earth Sciences (AREA)
- Genetics & Genomics (AREA)
- Chemical & Material Sciences (AREA)
- Engineering & Computer Science (AREA)
- Organic Chemistry (AREA)
- Zoology (AREA)
- General Health & Medical Sciences (AREA)
- Molecular Biology (AREA)
- Wood Science & Technology (AREA)
- Biotechnology (AREA)
- Bioinformatics & Cheminformatics (AREA)
- General Engineering & Computer Science (AREA)
- Biomedical Technology (AREA)
- Biophysics (AREA)
- Biochemistry (AREA)
- Microbiology (AREA)
- Plant Pathology (AREA)
- Physics & Mathematics (AREA)
- Cell Biology (AREA)
- Medicinal Chemistry (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- Pest Control & Pesticides (AREA)
- Crystallography & Structural Chemistry (AREA)
- Insects & Arthropods (AREA)
- Pharmacology & Pharmacy (AREA)
- Gastroenterology & Hepatology (AREA)
- Virology (AREA)
- Agronomy & Crop Science (AREA)
- Environmental Sciences (AREA)
- Dentistry (AREA)
- General Chemical & Material Sciences (AREA)
- Nuclear Medicine, Radiotherapy & Molecular Imaging (AREA)
- Animal Behavior & Ethology (AREA)
- Veterinary Medicine (AREA)
- Chemical Kinetics & Catalysis (AREA)
- Public Health (AREA)
- Micro-Organisms Or Cultivation Processes Thereof (AREA)
- Agricultural Chemicals And Associated Chemicals (AREA)
- Breeding Of Plants And Reproduction By Means Of Culturing (AREA)
Abstract
本发明提供编码苏云金芽孢杆菌(Bacillus thuringiensis)的ET37、TIC810和TIC812蛋白的分离的多核苷酸序列,以及用于在植物中表达TIC809、ET37、TIC810和TIC812以及这些蛋白的多种杀虫有效组合的融合体如TIC127的核苷酸序列。公开了在转基因植物细胞和转基因植物的开发中制备和使用所述多核苷酸序列和蛋白的方法,所述转基因植物细胞和转基因植物针对以下害虫具有改善的抗虫性:(1)鞘翅目昆虫,包括西方玉米根虫(Diabrotica virgifera)、南方玉米根虫(Diabrotica undecempunctata)、北方玉米根虫(Diabrotica barbed)、墨西哥玉米根虫(Diabrotica virgifera zeae)、巴西玉米根虫(Diabroticabalteata)和巴西玉米根虫复合群(Diabrotica viridula和南美叶甲(Diabrotica speciosa)),和(2)半翅目昆虫,例如草盲蝽。
Description
发明背景
本发明总的来讲涉及植物分子生物学领域,更具体地讲,本发明涉及新的多核苷酸序列和由这些序列编码的蛋白,这些序列来源于苏云金芽孢杆菌(Bacillus thuringiensis)并且编码ET29、TIC809、ET37、TIC810和TIC812蛋白,这些蛋白对鞘翅目和该总目内被称为半翅目的昆虫有毒性。鞘翅目毒性蛋白包括ET29、TIC809(ET29的氨基酸序列变体)和ET37(ET29的同源物)。本文还提供TIC810和TIC812以及编码这些蛋白的核苷酸序列。当TIC810或TIC812与ET29、TIC809或ET37组合在一起时,提供杀虫组合物,与仅单独的ET29、TIC809或ET37的表现相比,该杀虫组合物表现出针对鞘翅目令人惊奇的更高效力,这两种蛋白的组合(TIC810与ET29、TIC809或ET37的组合,或者TIC812与ET29、TIC809或ET37的组合)令人惊奇地提供半翅目昆虫毒性组合物,尤其是在例如豆荚盲蝽(western tarnishedplant bug,WTPB)等物种的食物中提供时。还公开了在抵抗鞘翅目和半翅目昆虫侵袭的转基因植物和植物细胞的开发中制备和使用编码这些蛋白和相关蛋白的多核苷酸的方法。
用于防治或根除昆虫侵袭的环境敏感的方法和组合物在许多情况下是合乎需要的,因为商业用途的作物经常是昆虫攻击的目标,尤其是来自鞘翅目和鳞翅目害虫的攻击的目标。这对设法使用环境友好的方法和组合物防治昆虫种群的农民、园丁、栽培者以及商业区和居民区尤其如此。防治或根除半翅目昆虫对作物的侵袭还具有商业上的重要性,且重要性逐渐增加,因为用于鞘翅目和鳞翅目害虫防治方法的生物技术方法变得更广泛可用,尤其是因为很少有产生广谱杀虫活性的化学杀虫应用被利用。
长期以来,人们就已认识到细菌-苏云金芽孢杆菌的杀虫特性。众所周知,苏云金芽孢杆菌产生蛋白质伴孢晶体,即δ-内毒素,对多种鳞翅目、鞘翅目和双翅目幼虫具有特异性毒性(English等,美国专利第6,063,597号)。含产生杀虫蛋白的苏云金芽孢杆菌菌株的组合物已在商业上用作环境上可接受的杀虫剂,因为它们对特定目标昆虫表现出毒性,而对植物、动物和其它非目标生物没有表现出毒性。
业已分离和表征了250多种不同的δ-内毒素。这些δ-内毒素当中某些的编码序列已被用于构建基因工程苏云金芽孢杆菌产品,其中表达一种或多种杀虫蛋白,所述杀虫蛋白表现出对目标害虫的特异性杀虫活性,已被批准作为局部应用的杀虫组合物用于农业用途。表达一种或多种Bt杀虫δ-内毒素蛋白并且这些蛋白能用于防治特定类别中的一种或多种昆虫如鳞翅目或鞘翅目害虫的转基因植物已被批准商业化并已获得成功。然而,存在着以这些转基因植物为食的目标害虫种群将对该植物产生的一种或多种毒素出现抗性的风险,所以仍有需要鉴定可单独使用或与通过不同作用方式表现出它们的毒性作用的其它蛋白一起使用的新的杀虫蛋白。新的杀虫组合物是产生表达一种或多种对相同昆虫有毒性的苏云金芽孢杆菌杀虫蛋白的转基因植物、提供管理抗性的方法以及延迟或消除任何特定敏感昆虫对转基因植物中表达的一种或多种杀虫物质的任一种的抗性发展所需要的。
大多数Bt毒素对鳞翅目有毒性。几乎没有表现出对鞘翅目有效,与未表现出宿主范围特异性的溶细胞毒素不同,还没有表明Bt毒素对鳞翅目或鞘翅目和半翅目害虫表现出杀虫活性。因此,有需要鉴定新的鞘翅目和/或半翅目特异性杀虫组合物,以及防治鞘翅目和半翅目昆虫科成员侵袭的方法,所述成员对于鞘翅目而言具体地为叶甲科(Chrysomelidae)成员,更具体地为叶甲科(Chrysomelidae)叶甲属(Diabrotica),可包括来自叶甲属(Diabrotica)的那些,包括西方玉米根虫(Diabrotica virgifera)(western corn rootworm,WCR)、南方玉米根虫(Diabrotica undecempunctata)(southern corn rootworm,SCR)、北方玉米根虫(Diabrotica barberi)(Northern Corn Rootworm,NCR)、墨西哥玉米根虫(Diabrotica virgifera zeae)(Mexican Corn Rootworm,MCR)、巴西玉米根虫(Diabrotica balteata)(Brazilian Corn Rootworm,BZR)以及由Diabrotica viridula和南美叶甲(Diabrotica speciosa)组成的巴西玉米根虫复合群(Brazilian Corn Rootworm complex,BCR),所述成员具体地为半翅总目成员,包括异翅亚目和同翅亚目中的任何害虫,其中异翅亚目包括通常被称为蝽象、草盲蝽(包括豆荚盲蝽(Lygus Hesperus)、美国牧草盲蝽(Lygus lineolorus)和豆荚灰盲蝽(Lygus elisus))、猎蝽、臭虫和花蝽的昆虫,同翅亚目包括通常被称为蝉、蚜虫、叶蝉、介壳虫和粉虱的昆虫。
发明概述
本发明提供从苏云金芽孢杆菌分离的并且具有抗鞘翅目和半翅目害虫的杀虫活性的多肽组合物,并提供编码这些多肽的核苷酸序列。本发明通过在植物中共表达至少两种苏云金芽孢杆菌蛋白(或为独立蛋白或为两种蛋白的融合蛋白),产生令人惊奇的高水平的杀虫蛋白累积,用于提供对目标鞘翅目和半翅目害虫的有效防治,提供对鞘翅目和半翅目昆虫侵袭的防治,以及提供鞘翅目和半翅目昆虫侵袭的抗虫性管理的改善。另外,提供增加苏云金芽孢杆菌杀虫蛋白或其变体在植物体中(in planta)的累积水平的方法,该方法还为表达这些蛋白的转基因植物提供没有异常植物形态的额外益处。
在实现前述内容时,提供如SEQ ID NO:1所示的多核苷酸分子,该分子是从苏云金芽孢杆菌菌株EG5078分离出来的。该多核苷酸分子编码如SEQ ID NO:2所示的杀虫蛋白,在本文被称为ET37,具有鞘翅目害虫抑制性生物活性。
还提供如SEQ ID NO:3所示的多核苷酸分子,该分子从苏云金芽孢杆菌菌株EG4096分离出来,并编码如SEQ ID NO:4所示的TIC810氨基酸序列。
提供如SEQ ID NO:5所示的又一个多核苷酸序列,其编码如SEQID NO:6所示的TIC8 12氨基酸序列,并从苏云金芽孢杆菌菌株EG5078分离出来。TIC812与TIC810基本相同。
本文具体地提出了编码杀虫蛋白或其杀虫片段的分离多核苷酸分子,与示于SEQ ID NO:4和SEQ ID NO:6的多肽序列具有至少约70%至约99%或以上的序列相似性,或其间的任何百分率。杀虫蛋白或杀虫片段由来源于苏云金芽孢杆菌的分离多核苷酸分子编码,并包含与选自SEQ ID NO:3和SEQ ID NO:5的多核苷酸序列或其互补序列具有至少约70%、约80%、约90%或约99%或以上的序列同一性或其间的任何百分率的多核苷酸序列。
在又一个实施方案中,提供用于在植物中实现改良的杀虫蛋白表达的多核苷酸分子。杀虫蛋白优选具有防治鞘翅目或半翅目害虫或这二者的生物活性,但可对非鞘翅目或半翅目害虫的害虫有活性,并可由为在植物细胞中表达而改造的核苷酸序列编码,所述核苷酸序列例如示于SEQ ID NO:15(TIC810)或SEQ ID NO:19(ET37)。
为本文提供其它多核苷酸序列,用于获得表达本发明的一种或多种蛋白的稳定转化的植物细胞。这样的核苷酸序列包括但不限于SEQID NO:13、SEQ ID NO:15、SEQ ID NO:17、SEQ ID NO:19、SEQ IDNO:28、SEQ ID NO:31、SEQ ID NO:34、SEQ ID NO:36、SEQ ID NO:38、SEQ ID NO:40、SEQ ID NO:43和SEQ ID NO:46。
提供寡核苷酸序列,用于鉴定其它细菌、尤其是其它芽孢杆菌细菌菌株(包括苏云金芽孢杆菌(Bacillus thuringiensis)和侧孢芽孢杆菌(Bacillus laterosperous)和杀虫芽孢杆菌(Bacillus entomocidus))中的相关核苷酸序列。
提供以SEQ ID NO:2、SEQ ID NO:4、SEQ ID NO:6、SEQ ID NO:8、SEQ ID NO:14和SEQ ID NO:47(TIC127)的氨基酸序列例举的杀虫蛋白。在具体的实施方案中,提供SEQ ID NO:4(TIC810)和SEQ ID NO:6(TIC812),作为辅助蛋白、伴侣蛋白或另外稳定和增强连同这些辅助蛋白一起同时表达的第二种蛋白的表达和累积的蛋白。TIC809、ET29和ET37在各自均连同TIC810或TIC812氨基酸序列一起同时表达时,全部是具有较高稳定性水平并由此具有改善的累积水平的杀虫蛋白。这些序列可在相同的亚细胞区室中一起表达,即二者均在胞质中或二者均被靶向插入到植物细胞的叶绿体或质体中并累积,或者同时表达,以在不同的亚细胞区室中累积,例如TIC810在胞质中,ET29或TIC809被靶向插入到植物叶绿体或质体中并累积。另外,TIC809、ET29或ET37蛋白中的任一种连同TIC810或TIC812蛋白中的任一种的组合产生组合物,该组合物令人惊奇地对两种蛋白组分在植物细胞中的高水平表达和累积稳定,并具有涉及防治鞘翅目和半翅目中的植物害虫的杀虫活性。所述蛋白组合还可以作为肽融合体表达,例如SEQ ID NO:47。
在其它实施方案中,本发明涉及选自EG5078和EG4096的苏云金芽孢杆菌野生型菌株的生物学上纯的培养物,由EG5078分离示于SEQ ID NO:1的编码ET37蛋白的多核苷酸序列和示于SEQ ID NO:5的编码TIC812蛋白的多核苷酸序列,由EG4096分离编码TIC810蛋白的SEQ ID NO:3和编码ET29蛋白的SEQ ID NO:7。EG4096和EG5078已依照国际承认用于专利程序的微生物保藏的布达佩斯条约保藏于美国农业研究机构保藏中心(Agricultural Research ServiceCenter Collection,NRRL)的北方地区研究实验室(Northern RegionalResearch Laboratory,USDA,1815 North University Street,Peoria,IL61604),已登记的保藏号分别为NRRL B-21582和NRRL B-30841。EG4096的保藏日为1996年5月30日,EG5087的保藏日为2005年5月3日。
本发明还涉及用于在植物或植物细胞中表达的重组DNA构建体,其包含同时表达第一多核苷酸序列和第二多核苷酸序列的双基因表达盒,其中第一多核苷酸序列编码的多肽序列选自ET29、TIC809和ET37或其杀虫活性片段,其中第二多核苷酸序列编码的多肽序列选自TIC810和TIC812及其杀虫活性片段。第一多核苷酸序列选自SEQ ID NO:13和SEQ ID NO:17,第二多核苷酸序列选自SEQ IDNO:15和SEQ ID NO:19,其中第二多核苷酸序列与第一多核苷酸序列共表达,以增强或改善第一多核苷酸序列的表达,促进涉及防治鞘翅目和半翅目植物害虫侵袭的杀虫活性。
本发明还涉及被转化以包含本文公开的本发明表达载体或重组DNA构建体的宿主细胞。宿主细胞可选自细菌细胞、真菌细胞和植物细胞。在实施方案的一个方面,宿主细胞为细菌细胞,例如被转化以包含本发明表达载体的苏云金芽孢杆菌。在实施方案的另一方面,宿主细胞为被转化以包含本发明的重组DNA构建体的转基因植物细胞。重组DNA构建体可包含同时编码4种蛋白中的2种的组合的多核苷酸序列,这4种蛋白的多肽序列见SEQ ID NO:14(TIC809)、SEQ IDNO:16(TIC810)、SEQ ID NO:18(ET37)和SEQ ID NO:20(TIC812)。优选蛋白组合可包括TIC809和TIC810、TIC809和TIC812、ET37和TIC810以及ET37和TIC812,其中编码TIC810或TIC812的多核苷酸序列和编码TIC809(或ET29)或ET37的多核苷酸序列的共表达将(a)导致植物细胞中TIC809(或ET29)或ET37蛋白的累积增加,(b)产生正常的细胞生长,(c)产生由宿主植物细胞再生的转基因植物,(d)产生正常表型,和(e)导致鞘翅目和半翅目昆虫抗性水平增加。
本发明的转基因植物细胞可包含玉米植物细胞、小麦植物细胞、黑麦植物细胞、大麦植物细胞、燕麦植物细胞、荞麦植物细胞、高粱植物细胞、水稻植物细胞、甘蔗植物细胞、木豆植物细胞、花生植物细胞、洋葱植物细胞、大蒜植物细胞、草植物细胞(包括翦股颖、牛毛草、雀麦草、梯牧草、鸭茅、百慕大草、结缕草等)、拟南芥植物细胞、椰菜植物细胞、向日葵植物细胞、油菜植物细胞、豌豆植物细胞、豇豆植物细胞、菜豆植物细胞、咖啡植物细胞、大豆植物细胞、棉花植物细胞、亚麻子植物细胞、花椰菜植物细胞、芦笋植物细胞、莴苣植物细胞、甘蓝植物细胞、烟草植物细胞、香料植物细胞(包括咖哩、芥菜、鼠尾草、欧芹、胡椒、百里香、芫荽、月桂、孜然芹、姜黄、肉豆蔻、肉桂等)、糖甜菜植物细胞、马铃薯植物细胞、甘薯植物细胞、胡萝卜植物细胞、芜菁植物细胞、芹菜植物细胞、番茄植物细胞、茄子细胞、黄瓜植物细胞、南瓜或香瓜植物细胞等,果树植物细胞(包括苹果、杏、桃、梨、李、橘、柠檬、酸橙等)、坚果树植物细胞(包括橡树、山核桃木、巴西木、美洲山核桃木、胡桃木、榛木等)、葡萄植物细胞、浆果植物细胞(包括黑莓、蓝莓、草莓、酸果蔓等)和开花植物细胞。
在另一个实施方案中,本发明涉及被转化以包含本文公开的重组DNA构建体的转基因植物。转基因植物可由本发明的转基因植物细胞再生,或者可来自于由再生的转基因植物或其后代获得的转基因种子。转基因植物选自单子叶植物和双子叶植物,可包括诸如玉米、小麦、黑麦、大麦、燕麦、荞麦、高粱、水稻、甘蔗、洋葱、大蒜、草等单子叶植物,或诸如向日葵、油菜、豌豆、豇豆、木豆、菜豆、大豆、咖啡、椰菜、棉花、亚麻子、花椰菜、拟南芥、芦笋、莴苣、烟草、香料植物(包括咖哩、芥菜、鼠尾草、欧芹、胡椒、百里香、芫荽、月桂、孜然芹、姜黄、肉豆蔻、肉桂等)、糖甜菜、马铃薯、甘薯、胡萝卜、芜菁、芹菜、番茄、茄子、黄瓜、南瓜或香瓜,果树植物(包括苹果、杏、桃、梨、李、橘、柠檬、酸橙等)、浆果植物(包括黑莓、蓝莓、草莓、酸果蔓等)、坚果树植物(包括橡树、山核桃木、巴西木、美洲山核桃木、胡桃木、榛木等)、葡萄植物和开花植物等双子叶植物。
在另一个实施方案中,本发明涉及来自被转化以包含重组DNA构建体的转基因植物的转基因种子。转基因种子可来自于由本发明的转基因植物细胞再生的转基因植物,或者可来自于再生的转基因植物的子代,或者来自于由于转基因植物与非转基因植物杂交或育种而产生的杂种。在一个方面,转基因种子可被覆种皮,其中种皮包括除草组合物、杀真菌种皮、杀细菌种皮、杀虫种皮、植物激素种皮、营养物种皮、微生物接种物种皮、有色种皮、驱禽种皮、驱啮齿动物种皮、杀虫蛋白种皮、含杀虫蛋白的细菌种皮、单链RNA种皮、双链RNA种皮、小RNA种皮或小干扰RNA种皮。一种使种皮包含这些单链或双链RNA组合物并稳定化的方法是将这些RNA分子和互补RNA分子组合起来,使得稳定的DNA-RNA分子杂种存在于种皮组合物中,该种皮组合物能将dsRNA或单链RNA提供给以种子为食的害虫,或在包衣种子萌发时提供至发芽种子带中的微环境或生出的发芽枝条的小根。
按照本发明的一个实施方案,提供产生抗鞘翅目和/或半翅目昆虫侵袭的植物的方法,该方法包括以下步骤:
a)将用于在植物中编码第一种蛋白的第一种核酸分子和用于在植物中编码第二种蛋白的第二种核酸分子插入到植物细胞的基因组中,所述第一种蛋白选自SEQ ID NO:14(TIC809)和SEQ ID NO:18(ET37),所述第二种蛋白选自SEQ ID NO:16(TIC810)和SEQ IDNO:20(TIC812);
b)获得含步骤(a)的核酸分子的植物细胞;和
c)由植物细胞产生表达两种蛋白的转基因植物,其中转基因植物与没有所述分子的植物相比具有鞘翅目和/或半翅目害虫抗性。
在另一个实施方案中,本发明还提供用于防治植物的鞘翅目和/或半翅目害虫侵袭的方法,该方法包括在害虫食物中提供表达TIC809、ET2或ET37蛋白连同TIC810或TIC812蛋白的植物、植物组织或植物细胞。
本文公开的多核苷酸和多肽组合物和方法在抗鞘翅目和半翅目害虫使用时将获得特别的益处,所述害虫选自由叶甲科(Chrysomelidae)、扁甲科(Cucujidae)、金龟子科(Scarabaeidae)、谷盗科(Trogositidae)、拟步甲科(Tenebrionidae)、象虫科(Curculionidae)、叩甲科(Elateridae)和豆象科(Bruchidae)组成的鞘翅目(Coleopteran),以及选自半翅目成员,具体包括异翅亚目和同翅亚目成员。在本发明的一个方面,鞘翅目昆虫来自叶甲科。示例性的鞘翅目叶甲科昆虫可包括叶甲属(Diabrotica)的那些昆虫,包括西方玉米根虫(D.virgfera)(WCR)、南方玉米根虫(D.undecempunctata)(SCR)、北方玉米根虫(D.barberi)(NCR)、墨西哥玉米根虫(D.virgifera zeae)(MCR)、巴西玉米根虫(D.balteata)(BZR)以及由Diabrotica viridula和南美叶甲(D.speciosa)组成的巴西玉米根虫复合群(BCR)。
可构建核酸序列分子,以掺入编码第三种物质(dsRNA或蛋白)的第三种结构基因序列,作为手段用于在同一植物中提供具有涉及防治一种以上植物害虫的活性(例如具有鞘翅目和/或半翅目昆虫防治)的额外农艺性状以及抗鳞翅目昆虫、抗细菌、抗病毒或抗真菌侵袭、抗线虫的额外性状,或用于提供诸如抗除草剂性状、产量性状、胁迫性状、进食增强或导致进食加工增强的性状等的补充性状。该方法可由以下步骤组成:将编码选自SEQ ID NO:14(TIC809)和SEQ ID NO:18(ET37)的蛋白的第一种核酸分子插入到植物细胞基因组中。将第一种核酸分子接上编码选自SEQ ID NO:16(TIC810)和SEQ ID NO:20(TIC812)的蛋白的第二种核酸分子。导入到植物基因组中的第三种核酸分子编码的物质提供与第一种和第二种核酸分子提供的农艺性状不同的农艺性状,包括但不限于抗鳞翅目昆虫、抗细菌、抗病毒或抗真菌侵袭、抗线虫,或提供补充性状,例如抗除草剂性状、产量性状、胁迫性状、进食增强或导致进食加工增强的性状等。
除了在用于防治鞘翅目或半翅目害虫侵袭的组合物中提供本发明的蛋白以外,还具体提出了转录核糖核苷酸(RNA)分子的多核苷酸序列,所述分子在被无脊椎动物-害虫摄入时通过抑制害虫中的生物功能起到控制无脊椎动物-害虫侵袭的作用,作为组合物的第二种作用模式或抗虫性管理特征。RNA分子可包含dsRNA分子、siRNA分子、miRNA分子或ssRNA分子,并应当特异性抑制目标害虫的必需基因,所述目标害虫例如为本发明组合物所靶向的害虫。
本发明公开的组合物和方法提供了许多超越现有技术的优势,包括以上具体概述的那些。这些优势可包括:获得对敏感害虫(不仅仅包括侵袭植物的那些害虫)的改善的防治措施,获得大量商业上可行的抗虫植物系;获得抗昆虫病原体的季节性长期保护作用;以及增加形态学上正常的转化植物的发生率。
序列表描述
SEQ ID NO:1是编码ET37杀虫蛋白的苏云金芽孢杆菌多核苷酸序列。
SEQ ID NO:2是由示于SEQ ID NO:1的多核苷酸序列编码的ET37氨基酸序列。
SEQ ID NO:3是编码TIC810杀虫蛋白的多核苷酸序列。
SEQ ID NO:4是由示于SEQ ID NO:3的多核苷酸序列编码的TIC810氨基酸序列。
SEQ ID NO:5是编码TIC812蛋白的苏云金芽孢杆菌多核苷酸序列。
SEQ ID NO:6是由示于SEQ ID NO:5的多核苷酸序列编码的TIC812氨基酸序列。
SEQ ID NO:7是编码ET29蛋白的苏云金芽孢杆菌多核苷酸序列。
SEQ ID NO:8是由示于SEQ ID NO:7的多核苷酸序列编码的ET29氨基酸序列。
SEQ ID NO:9是编码TIC810的核苷酸1位至核苷酸657位并编码ET29的核苷酸716-1411位的苏云金芽孢杆菌多核苷酸序列。
SEQ ID NO:10是编码TIC812的核苷酸1-657位并编码ET37的核苷酸716-1411位的苏云金芽孢杆菌多核苷酸序列。
SEQ ID NO:11是编码TIC810的核苷酸1位至核苷酸657位并编码ET37的核苷酸716-1411位的多核苷酸序列。
SEQ ID NO:12是编码TIC812的核苷酸1-657位并编码ET29的核苷酸716-1411位的多核苷酸序列。
SEQ ID NO:13是构建用于在植物细胞中表达的、编码TIC809蛋白的多核苷酸序列。
SEQ ID NO:1 4是由示于SEQ ID NO:13的多核苷酸序列编码的TIC809氨基酸序列。
SEQ ID NO:15是构建用于在植物细胞中表达的、编码TIC810蛋白的多核苷酸序列。
SEQ ID NO:16是由示于SEQ ID NO:15的多核苷酸序列编码的TIC810氨基酸序列。
SEQ ID NO:17代表构建用于在植物细胞中表达的、编码ET37蛋白的多核苷酸序列。
SEQ ID NO:18是由示于SEQ ID NO:19的多核苷酸序列编码的ET37氨基酸序列。
SEQ ID NO:19是构建用于在植物细胞中表达的、编码TIC812蛋白的多核苷酸序列。
SEQ ID NO:20是由示于SEQ ID NO:17的多核苷酸序列编码的TIC812氨基酸序列。
SEQ ID NO:21是用于扩增编码TIC810氨基酸序列的核苷酸序列的热扩增引物,在本文被称为pr370。
SEQ ID NO:22是用于扩增编码TIC810氨基酸序列的核苷酸序列的热扩增引物,在本文被称为pr371。
SEQ ID NO:23是用于扩增编码TIC810氨基酸序列的核苷酸序列的热扩增引物,在本文被称为pr375。
SEQ ID NO:24是用于扩增编码TIC810氨基酸序列的核苷酸序列的热扩增引物,在本文被称为pr376。
SEQ ID NO:25是用于扩增编码ET29氨基酸序列的核苷酸序列的热扩增引物,在本文被称为pr365。
SEQ ID NO:26是用于扩增编码ET29氨基酸序列的核苷酸序列的热扩增引物,在本文被称为pr372。
SEQ ID NO:27是用于扩增编码TIC810_ET29或TIC812_ET37操纵子序列的核苷酸序列的热扩增引物,在本文被称为pr421。
SEQ ID NO:28是存在于转化载体pMON64138中的合成核苷酸序列,由编码TIC809蛋白的第一植物表达盒和编码TIC810蛋白的第二植物表达盒组成。
SEQ ID NO:29是由示于SEQ ID NO:28的第一植物表达盒编码的TIC809氨基酸序列。
SEQ ID NO:30是由示于SEQ ID NO:28的第二植物表达盒编码的TIC810氨基酸序列。
SEQ ID NO:31是存在于pMON64139中的合成核苷酸序列,由编码靶向叶绿体的TIC809的第一植物表达盒和编码靶向叶绿体的TIC810的第二植物表达盒组成。
SEQ ID NO:32是由示于SEQ ID NO:31的第一植物表达盒编码的TIC809氨基酸序列。
SEQ ID NO:33是由示于SEQ ID NO:31的第二植物表达盒编码的TIC810氨基酸序列。
SEQ ID NO:34是存在于pMON70513中的合成核苷酸序列,由编码TIC809氨基酸序列的植物表达盒组成。
SEQ ID NO:35是由示于SEQ ID NO:34的植物表达盒编码的TIC809氨基酸序列。
SEQ ID NO:36是存在于pMON70514中的合成核苷酸序列,由编码靶向叶绿体的TIC809氨基酸序列的植物表达盒组成。
SEQ ID NO:37是由示于SEQ ID NO:36的植物表达盒编码的TIC809氨基酸序列。
SEQ ID NO:38是存在于pMON64144中的合成核苷酸序列,由编码靶向叶绿体的TIC809氨基酸序列的植物表达盒组成。
SEQ ID NO:39是由示于SEQ ID NO:38的植物表达盒编码的TIC809氨基酸序列。
SEQ ID NO:40是存在于pMON64150中的合成核苷酸序列,由编码靶向叶绿体的TIC809氨基酸序列的第一植物表达盒和编码靶向叶绿体的TIC810氨基酸序列的第二植物表达盒组成。
SEQ ID NO:41是由示于SEQ ID NO:40的第一植物表达盒编码的TIC809氨基酸序列。
SEQ ID NO:42是由示于SEQ ID NO:40的第二植物表达盒编码的TIC810氨基酸序列。
SEQ ID NO:43是存在于pMON64151中的合成核苷酸序列,由编码TIC809氨基酸序列的第一植物表达盒和编码TIC810氨基酸序列的第二植物表达盒组成。
SEQ ID NO:44是由示于SEQ ID NO:43的第一植物表达盒编码的TIC809氨基酸序列。
SEQ ID NO:45是由示于SEQ ID NO:43的第二植物表达盒编码的TIC810氨基酸序列。
SEQ ID NO:46是编码TIC127肽的核苷酸序列,对应于TIC809(由第1-696位核苷酸编码)和TIC810(由第754-1407位核苷酸编码)之间的融合体,其中已导入短连接序列(由第697-753位核苷酸编码),以允许两种蛋白在植物细胞中表达后(或在摄入目标害虫的消化道时)通过蛋白水解被分离开。
SEQ ID NO:47是TIC127氨基酸序列。
发明详述
本文提供得自苏云金芽孢杆菌的多核苷酸序列,其编码ET29、ET37、TIC809、TIC810和TIC812蛋白以及ET29衍生物TIC809和TIC810之间的融合体。还提供构建用于在植物中表达的合成核苷酸序列,其编码ET29、TIC809、ET37、TIC810和TIC812氨基酸序列以及TIC809和TIC810之间的TIC127蛋白融合体。还公开了在抗鞘翅目和半翅目昆虫侵袭的转基因植物和植物细胞的开发中制备和使用多核苷酸序列的方法。还提供在多种农业或动物环境中的局部施用制剂中作为杀虫组合物的蛋白,或作为由优选宿主细胞如细菌细胞、植物细胞或酵母或真菌细胞生产的杀虫组合物的蛋白。关于术语“得自”,其意指序列可直接从特定来源分离得到,或在从特定来源分离后,对要编码蛋白的序列诸如核苷酸序列进行修饰,所述序列与从特定来源分离的序列基本相同。或者,氨基酸序列可与从特定来源分离的氨基酸序列基本相同,或由特定核苷酸序列编码。氨基酸序列可为各自已独立地从特定来源分离的众多不同氨基酸序列的嵌合体,但这些不同氨基酸序列的多种区段已乱拼凑在一起,产生嵌合体。在此意义上,嵌合体得自多种不同氨基酸序列的每一个。核苷酸序列可类似地得自其它核苷酸序列。核苷酸序列可由于其参照一种或多种其它核苷酸序列产生或获得而得自其它核苷酸序列。类似地,氨基酸序列可参照一种或多种其它氨基酸序列获得或产生,因此以该方式获得。
本发明的合成多核苷酸序列优选设计用于植物组织和植物细胞中在植物体中表达(in planta expression)杀虫蛋白。具体地说,本发明的杀虫蛋白被称为ET29、ET37、TIC809、TIC810、TIC812和TIC127蛋白。这些蛋白中任一种的氨基酸序列只要其具有的杀虫活性至少等同于其所来源的全长蛋白的杀虫活性,就都落入本发明的范围内。
在一个实施方案中,本发明涉及含编码本文公开的一种或多种蛋白的核苷酸序列的苏云金芽孢杆菌细菌的生物学上纯的培养物。具体地说,核苷酸序列为示于编码ET37(SEQ ID NO:2)的SEQ ID NO:1、编码TIC810(SEQ ID NO:4)的SEQ ID NO:3、编码TIC812(SEQ IDNO:6)的SEQ ID NO:5以及编码ET29(SEQ ID NO:8)的SEQ ID NO:7的那些序列。另外,本文提出并具体实施了融合体,例如TIC127(编码SEQ ID NO:47的SEQ ID NO:46)。生物学上纯的培养物还可包括用本发明的多核苷酸序列或两种以上多核苷酸序列转化的那些培养物,至少第一种多核苷酸序列选自ET37编码序列和ET29编码序列,至少第二种多核苷酸序列选自TIC810编码序列和TIC812编码序列。示例性细菌菌株,即EG4096和EG5078,已依照国际承认用于专利程序的微生物保藏的布达佩斯条约保藏于美国农业研究机构保藏中心(Agricultural Research Service Center Collection,NRRL)的北方地区研究实验室(Northern Regional Research Laboratory,USDA,1815 NorthUniversity Street,Peoria,IL 61604),并已给出保藏号,见表1。
表1.示例性苏云金芽孢杆菌(Bacillus thuringiensis)菌株
Bt菌株 | 菌株性质 | 含有的毒素 | NRRL保藏号 | 保藏日期 |
EG4096 | 野生型 | ET29,TIC810 | NRRL B-21582 | 1996年5月30日 |
EG5078 | 野生型 | ET37,TIC812 | NRRL B-30841 | 2005年5月30日 |
编码ET37的天然存在的(天然的)多核苷酸序列示于SEQ IDNO:1。该序列与编码在美国专利第6,093,695号中公开的ET29杀虫蛋白并在本文以SEQ ID NO:7公开的多核苷酸序列具有约99%的序列同一性。由SEQ ID NO:1编码的ET37氨基酸序列示于SEQ ID NO:2。ET37蛋白的杀虫活性在本文于使用鞘翅目叶甲属昆虫(包括WCR和SCR)和半翅目草盲蝽属昆虫的生物测定中被证实。在对ET37和ET29编码序列在它们对应菌株中位于其上的染色体外质粒进行序列分析的过程中,鉴定ET37和ET29可读框各自上游的单个可读框,这些可读框分别对应于编码蛋白TIC812和TIC810的序列。编码TIC810蛋白(SEQ ID NO:4)的天然多核苷酸分子(SEQ ID NO:3)在苏云金芽孢杆菌菌株EG4096中紧邻ET29编码序列上游定位。编码TIC812蛋白(SEQ ID NO:6)的天然多核苷酸分子(SEQ ID NO:5)在苏云金芽孢杆菌菌株EG5078中紧邻ET37编码序列上游定位。
ET29、ET37、TIC810和TIC812全都可以与Cyt杀虫毒素家族远缘相关,但是,从系统发生的观点看,ET29和37蛋白彼此比其它Cyt蛋白更接近,TIC810和812蛋白彼此也比其它Cyt蛋白更接近。ET37氨基酸序列与ET29的氨基酸序列共有约99%的序列相似性。TIC810和TIC812彼此具有约97%的氨基酸序列相似性。TIC810与ET29和ET37具有约33%的氨基酸序列相似性。类似地,TIC812蛋白与ET29和ET37具有约32%的氨基酸序列相似性。相似性对比基于使用Wisconsin Package 10.3版,Accelrys Inc.,San Diego,CA进行的蛋白之间的成对比对。
按照本发明,在宿主细胞中TIC810、TIC812、ET37和ET29蛋白表达的某些组合用于在宿主细胞中实现期望的提高水平的杀虫蛋白累积,产生涉及某些目标鞘翅目和半翅目害虫的改善的杀虫活性。编码ET29的多核苷酸序列可与编码TIC810蛋白的多核苷酸序列共表达,以在宿主细胞中实现ET29杀虫蛋白的增强的表达和或累积。类似地,编码ET37的多核苷酸序列可与编码TIC812的多核苷酸序列共表达,以在宿主细胞中实现ET37杀虫蛋白的增强的表达和或累积。预计TIC812与TIC810作为杀虫剂以及作为ET37或ET29的稳定、累积和改善的宿主范围生物活性所需的伴侣蛋白或辅助蛋白可互换。TIC810或TIC812连同ET37或ET29的共表达产生改善的ET37或ET29表达和/或累积。这些组合在本文被称为稳定的杀虫组合物。而且,至少就其针对鞘翅目和半翅目害虫的杀虫效力而言,稳定的组合物比组合物中的任何个体组分都具有更大的宿主范围。其中由ET37或ET29组成的第一种蛋白和由TIC810或TIC812组成的第二种蛋白具有增加水平的ET29/ET37累积的重组细胞提供了先前用杀虫蛋白ET37和ET29未获得的鞘翅目和半翅目昆虫抗性水平,并导致整体与在没有TIC810或TIC812的情况下表达ET29或ET37的细胞相比,细胞或由这些重组细胞组成的生物没有异常的形态和/或表型。
以上杀虫组合还可连同至少一种不同于组合中包含的任一种蛋白的额外杀虫蛋白一起在植物中表达,表现出不同于组合中包含的任一种蛋白的作用模式,以及和组合中的蛋白相同的昆虫毒性。此包含额外的杀虫蛋白的第二种组合提供了鞘翅目昆虫抗性管理方法。所述额外的蛋白包括但不限于鞘翅目毒素Cry3Bb和变体、Cry22A、TIC901、TIC1201、TIC407、TIC417、CryET70,二元毒素PS149B1、ET33/34和ET80/76,以及多种已表现出鞘翅目杀虫活性的其它蛋白,例如马铃薯块茎蛋白(patatin)、Cry3Aa变体和可从细菌如致病杆菌(Xenorhabdus sp.)和发光杆菌(Photorhabdus sp.)分离的非特异性杀虫组合物。
ET29或ET37蛋白各自可与TIC810或TIC812组合,并在植物中与具有针对非鞘翅目害虫的杀虫活性的物质共表达,实现了对一种以上的普通植物害虫的预期防治,所述植物害虫选自鳞翅目害虫和半翅目害虫。而且,这些组合可与另一些有效控制病毒害物、细菌害物、真菌害物等的物质组合。在这些组合中提出的物质可连同ET29/ET37和TIC810/TIC812组合一起表达,或通过在农业上可接受的制剂中应用杀虫或杀虫物质提供,可能采用载体,例如软化剂、胶体、喷雾剂、粉末剂、混合物或粉剂。在其中组合物应对防治动物害虫如蚤、蜱、虱、螨等有用的情况下,连同ET29/ET37和TIC810/TIC812组合一起包括有效防治与该组合针对的害虫相同或不同的害虫的物质是有用的,所以这样的应用应以药学上可接受的制剂提供。本发明的特定杀虫组合制剂可用于局部和/或系统应用于大田作物、草、水果和蔬菜以及观赏植物。在一个实施方案中,杀虫组合物包含表达本文公开的一种或多种新的杀虫蛋白的细菌细胞的油性可流动悬液。示例性的细胞可为苏云金芽孢杆菌菌株EG4096、EG5078、sIC8134或sIC8135,但是,任何这样的表达杀虫组合物的细菌宿主细胞,例如巨大芽孢杆菌(B.megaterium)、枯草芽孢杆菌(B.subtilis)、大肠杆菌(E.coli)或假单胞菌(Pseudomonas spp.),都应当是有用的。
本发明的杀虫组合物可与其它生物技术方法如双链RNA介导的基因抑制技术组合,以实现对一种或多种特定植物害虫的预期防治。以导致形成双链RNA乃至稳定的双链RNA的方式,在植物细胞中表达选自参与必需生物途径的特定害虫细胞的天然序列的特定核苷酸序列。以此方式,在害虫摄取杀虫有效量的RNA(即一种或多种表达得自这些害虫的细胞的双链RNA的植物细胞)时,害虫的一种或多种必需生物途径被抑制。连同dsRNA一起害虫还摄入杀虫有效量的本文所述杀虫组合物,导致提供有效的昆虫或害虫抗性管理系统,该系统由于两种杀虫剂通过不同作用模式起作用而避免了出现抗性的可能性。表达对应于本发明的鞘翅目和半翅目杀虫蛋白的组合物的特定重组植物细胞,也可以表达作为dsRNA分子的一种或多种得自目标鞘翅目害虫的基因组的序列和一种或多种得自目标半翅目害虫的基因组的序列,导致提供多种杀虫有效量的本发明蛋白和设计用于抑制目标鞘翅目和/或半翅目害虫中的一种或多种必需基因的dsRNA。由编码一种或多种本发明蛋白的基因单独地或与额外的防虫剂(例如dsRNA)组合地组成的植物,相比于没有这些杀虫剂的植物,表现出改善的产量和耐旱性状。这可能是因为相比于没有这些物质的根群,这些性状产生更均一、强壮和健康的稳定化根群,提供更多的营养物和集水能力。
嵌合蛋白可被合成,其中编码ET29或ET37的序列和编码TIC810或TIC812的序列融合在一起,供作为一种蛋白的稳定杀虫组合物表达用。预期嵌合蛋白可能不稳定或不具有杀虫生物活性,除非两个融合肽未连接在一起。此物理分离可通过在蛋白之间包含作为间隔物的独特肽序列实现,所述肽序列是本领域已知的众多蛋白酶的靶。嵌合蛋白还可以与影响嵌合体稳定性的其它序列连接,导致形成基本由嵌合-融合肽组成的晶体形式或包涵体,把杂质组合物、肽或分子排除在外。这样的嵌合体或融合体在本文的实施例11中以TIC127(通过短肽连接的TIC809和TIC810的顺序融合体)和TIC128(TIC810和TIC809的顺序融合体)举例说明,已表明具有鞘翅目和半翅目杀虫生物活性。
还提供在宿主细胞中使用的表达载体。在示例实施方案中提供了包含导致本文公开的至少两种多核苷酸序列组合表达的序列的表达载体。在一个实施方案中,表达载体是分离纯化的多核苷酸分子,包含两种不同多核苷酸序列的组合,每个序列均包含在期望的宿主细胞中有功能的启动子,所述启动子与编码TIC809、TIC810、TIC812、ET29或ET37的核苷酸区段有效连接。在某些实施方案中,编码这些蛋白之一的核苷酸区段的3’可包含转录终止和聚腺苷酸化序列。
提供用于细菌宿主细胞的表达载体,例如用于大肠杆菌(E.coli)细胞或芽孢杆菌细胞,包括来自苏云金芽孢杆菌、巨大芽孢杆菌、枯草芽孢杆菌或相关芽孢杆菌的细胞。细菌宿主细胞表达载体可包含串联表达本发明的一种或多种蛋白的一种核苷酸序列,以大体上相同的方式,蛋白可在大部分细菌细胞中表达,即在多顺反子表达盒中表达。或者,细菌表达载体可由仅编码其中一种本发明蛋白的核苷酸序列组成。
在细菌中有功能的启动子在本领域众所周知。示例性的芽孢杆菌晶体蛋白启动子可包括任一种已知的晶体蛋白基因启动子,包括ET29基因启动子(美国专利第6,093,695号)和苏云金芽孢杆菌σ因子特异性启动子(Baum和Malvar,Molec.Microbiol.,18(1):1-12,1995)。或者,编码诱变或重组晶体蛋白的基因启动子可由本领域技术人员工程化,并用于启动本文公开的新的多核苷酸序列的表达。对于本发明而言,本文用于多核苷酸序列表达的启动子为cry1A启动子。
提供用于在植物细胞中表达的重组DNA构建体。这样的构建体通常包含两种或多种植物功能性表达盒,所述表达盒能将两种或多种表达盒同时导入到植物基因组中的相同基因座的方式连接在一起,或者可包含两种或多种植物功能性表达盒,所述表达盒连接在构建体或载体中,但能够被独立地导入到植物基因组中的不同基因座内。这些表达盒可被称为第一表达盒和第二表达盒,分别表达编码第一种蛋白的第一种多核苷酸序列和编码第二种蛋白的第二种多核苷酸序列。预期第一种蛋白可为本文公开的任何蛋白,第二种蛋白可为非第一种蛋白的任何本文公开的其它蛋白。提供示于SEQ ID NO:13、SEQ IDNO:15、SEQ ID NO:17和SEQ ID NO:19的示例性多核苷酸序列。
用于本发明的重组DNA构建体的启动子在导入到植物细胞中时应具有驱动编码杀虫物质的多核苷酸序列表达的能力。可用于在植物中表达多肽序列的启动子可为用于单子叶或双子叶植物的诱导型、组成型、组织特异性或发育特异性启动子。在一个实施方案中,选择使用的启动子可为组成型启动子,对本发明而言,启动子可特别地包括增强型花椰菜花叶病毒(CaMV 35S)启动子。在另一个实施方案中,选定的启动子可为组织特异性启动子,对本发明而言,启动子可特别地包括从水稻分离的根特异性启动子Rcc3(美国专利申请序号11\075,113)。
除了一个或多个启动子以外,载体或构建体还可包含用于调节它们所连接的目标基因的表达水平和时间的元件。例如,所述构建体可包含内含子序列。用于本发明的内含子序列可包括水稻肌动蛋白内含子(美国专利第5,641,876号)。重组DNA构建体还可在启动子和编码序列之间具有翻译前导序列。载体或构建体还可在目标编码区中包含完全或部分地起终止该区转录作用的核酸序列。在一个实施方案中,本发明的多腺苷酸化序列可来自小麦热激蛋白基因(tahsp17)的3′非翻译区。
重组DNA构建体也可包含其它元件。例如,构建体可包含提供复制功能的DNA区段和一种或多种用于细菌细胞的选择标记。构建体还可包含筛选标记、选择标记和适合于选择具有本发明的重组DNA构建体的植物或细菌细胞的其它元件。重组DNA构建体设计得具有可赋予细胞抗生素或除草剂耐受性的适宜选择标记。耐抗生素的多核苷酸序列包括但不限于编码涉及耐卡那霉素、耐新霉素、耐潮霉素和耐本领域已知的其它抗生素的蛋白的多核苷酸序列。在此载体中的耐抗生素基因可被编码5-烯醇丙酮基莽草酸-3-磷酸合酶(EPSPS,参见美国专利第5,627,061和5,633,435号;Padgette等,Herbicide ResistantCrops,Lewis Publishers,53-85,1996)的耐除草剂基因或其它选择标记基因和它们的等同物(例如草铵膦(basta)耐受性、bar耐受性、氨甲喋呤抗性、草甘膦氧化还原酶、草甘膦乙酰转移酶、磷酸乙酰化酶(phnO和得自肠杆菌科(Enterobacteriaceae)的等位基因)和耐麦草畏基因等)替代。表达与一种或多种这样的选择标记关联的本发明耐虫性的植物对商业用途特别有用。
本发明的多核苷酸序列可用于转化植物细胞,所述植物细胞可被再生为转基因植物,该转基因植物与获得转基因植物的植物或植物细胞相比具有改善的抗虫性。本发明的多核苷酸序列可被修饰,以改善它们在植物宿主细胞中的表达。本发明的多核苷酸序列在植物细胞中的表达可实现杀虫蛋白在胞质中的累积,或者可以导致杀虫蛋白在诸如叶绿体、质体或线粒体的亚细胞器中累积。
在实施前述内容时,已改善了编码示于SEQ ID NO:14、SEQ IDNO:16、SEQ ID NO:18和SEQ ID NO:20以及SEQ ID NO:47的本发明蛋白的多核苷酸序列在植物中的表达。本文例举的这些多核苷酸序列示于SEQ ID NO:13(TIC809)、SEQ ID NO:15(TIC810)、SEQ IDNO:17(ET37)、SEQ ID NO:19(TIC812)和SEQ ID NO:46(TIC127),而且在示于SEQ ID NO:28、SEQ ID NO:31、SEQ ID NO:34、SEQ IDNO:36、SEQ ID NO:38、SEQ ID NO:40和SEQ ID NO:43的表达盒序列中。TIC810或TIC812蛋白中的至少一种连同ET29、TIC809或ET37杀虫蛋白中的一种或多种的共表达改善(或帮助、作为伴侣蛋白用于、稳定或可作为辅助蛋白另外与杀虫蛋白相互作用)杀虫蛋白的表达和/或累积。在转基因植物中的共表达导致没有低水平表达和/或累计以及没有在仅杀虫蛋白(例如ET29、ET37或TIC809)单独表达时观察到的植物毒性作用。
有许多将含多核苷酸序列组合的重组DNA构建体导入细胞中的方法,一般认为适宜的方法实际上包括可将DNA导入植物细胞中的任何方法,例如通过土壤杆菌感染,例如通过PEG介导的原生质体转化(Omirulleh等,Plant Mol.Biol.,21:415-428,1993)、通过干燥/抑制介导的DNA摄取、通过电穿孔或通过微粒轰击等直接传送DNA。
在考虑抗虫性管理(IRM)时,本发明的组合物和方法可用于生产转基因植物,其表达两种以上对相同昆虫有毒的苏云金芽孢杆菌蛋白,并提供一定水平的抗性管理,用于延迟任何特定敏感昆虫对转基因植物中表达的一种或多种杀虫物质的抗性的出现。或者,期望表达对特定目标害虫有毒的苏云金芽孢杆菌杀虫蛋白连同对相同害虫有毒但通过与苏云金芽孢杆菌毒素具有的途径不同的途径赋予毒性的不同蛋白类物质。这些其它的不同蛋白类物质可包含任何Cry杀虫蛋白、Cyt杀虫蛋白、来自致病杆菌(Xenorhabdus sp.)或发光杆菌(Photorhabdus sp.)的杀虫蛋白、苏云金芽孢杆菌植物性杀虫蛋白等。一种获得此结果的方法会产生两种不同的转基因事件,一个事件表达本发明的两种杀虫蛋白的组合,对鞘翅目和半翅目昆虫有活性,另一个事件表达第三种杀虫蛋白,将两种性状一起选育到杂种植物中。第三种杀虫蛋白可能是具有鞘翅目杀虫活性的蛋白、具有半翅目杀虫活性的蛋白、具有针对鞘翅目和半翅目这二者(可能还对其它昆虫目有毒性)的杀虫活性的蛋白或者具有针对非鞘翅目和半翅目害虫的一种或多种其它目昆虫(包括但不限于鳞翅目、直翅目、双翅目等)的杀虫活性的蛋白。
杀虫量的本发明杀虫蛋白可在害虫食物中提供。典型地,所述食物由一般为昆虫食用的植物部分组成,例如植物组织或植物细胞,但也可包括其它组合物,例如配制用于增强特定害虫的发育和存活的人工食物。杀虫蛋白可在应用于食物表面的组合物中提供,或者更优选地可通过细胞的蛋白合成机器产生,并如上所述在植物细胞中累积或分泌到植物细胞外部,只要提供的蛋白毒素量是足以抑制害虫进一步进食或抑制害虫的进一步生长和发育或引起害虫死亡的杀虫量。
转基因植物可自交(自花授粉),以产生具有对编码杀虫蛋白的转基因为纯合的基因型的种子。这些种子产生仅对编码杀虫蛋白的转基因为纯合的植物和种子。产生的转基因植物经常在不能表现出最期望的农艺品质的品系中产生。因此,纯合的重组植物可与具有特定农艺学主要品质的近交系杂交。
本发明尤其可用于产生商业用途的转基因植物,包括多种草坪草、小麦、玉米、水稻、大麦、燕麦、各种观赏植物和蔬菜,以及众多结坚果和水果的树和植物。具体地说,这些植物可包括玉米、小麦、黑麦、大麦、燕麦、荞麦、高粱、水稻、洋葱、草、向日葵、油菜、豌豆、菜豆、大豆、棉花、亚麻子、花椰菜、芦笋、莴苣、烟草、芥菜、糖甜菜、马铃薯、甘薯、胡萝卜、芜菁、芹菜、番茄、茄子、黄瓜、南瓜、苹果、杏、桃、梨、李、橘、黑莓、蓝莓、草莓、酸果蔓和柠檬。一般来说,本发明在单子叶和双子叶植物品系中有用。转基因植物选自单子叶和双子叶植物,可包括诸如玉米、小麦、黑麦、大麦、燕麦、荞麦、高粱、水稻、甘蔗、洋葱、大蒜、草等单子叶植物,或诸如向日葵、油菜、豌豆、豇豆、木豆、菜豆、大豆、咖啡、椰菜、棉花、亚麻子、花椰菜、拟南芥、芦笋、莴苣、烟草、香料植物(包括咖哩、芥菜、鼠尾草、欧芹、胡椒、百里香、芫荽、月桂、孜然芹、姜黄、肉豆蔻、肉桂等)、糖甜菜、马铃薯、甘薯、胡萝卜、芜菁、芹菜、番茄、茄子、黄瓜、南瓜或香瓜,果树植物(包括苹果、杏、桃、梨、李、橘、柠檬、酸橙等)、浆果植物(包括黑莓、蓝莓、草莓、酸果蔓等)、坚果树植物(包括橡树、山核桃木、巴西木、美洲山核桃木、胡桃木、榛木等)、葡萄植物和开花植物。
本文提供的DNA序列信息允许制备具有与本文公开的核苷酸序列或编码TIC810、TIC812、ET37和ET29相关蛋白的同源序列特异性杂交能力的核苷酸序列或探针和/或引物。这些核酸探针与相关毒素或辅助/伴侣素样蛋白编码序列特异性杂交的能力在多个实施方案中提供了特别用途。更重要的是,探针可用于检测给定样品中的互补序列存在情况的多种测定。含本发明基因的转基因植物在其下商品化的法规环境的性质提供了能够检测生物样品中的蛋白编码序列以及本发明蛋白的存在情况的特定用途,并提供了在关于其授权的任何专利期当中发现某些要求保护的实施方案的侵权行为的方法。
在某些实施方案中,使用单独的或成对的或其它引物组的寡核苷酸引物是有利的。这些引物的序列使用本发明的多核苷酸设计,用于使用热扩增法检测、扩增或突变来自苏云金芽孢杆菌或其它来源的毒素或辅助/伴侣素样蛋白编码序列的限定区段。还可以扩增来自其它种的相关毒素或辅助/伴侣素样蛋白编码序列的区段。
还提出了用于检测生物样品中的本发明多核苷酸或氨基酸序列的试剂盒。这些试剂盒包含一个或多个多核苷酸序列,所述多核苷酸序列各自均用作检测编码本发明的杀虫蛋白或其片段的多核苷酸序列存在情况的探针。这些试剂盒可另外或可选地包含特异性结合本发明蛋白的一种或多种多肽的抗体,以及与探针或抗体一起使用的试剂,试剂盒应当还含有对照样品,用于确保按照生产商的说明实施用探针和或抗体和试剂鉴定核苷酸或肽。实施核苷酸序列或肽鉴定方法必需的所有试剂都应和使用说明书一起包装在试剂盒中。示例性的试剂盒可包含TIC810或编码杀虫蛋白的相关多核苷酸序列,连同示于SEQ ID NO:23和SEQ ID NO:24的示例性核苷酸序列扩增引物pr375和pr376的样品,加上实施扩增反应必需的必需试剂,所有这些都一起包装在试剂盒中。
特异性结合仅由ET37、TIC810和TIC812蛋白中的任一种或其同源物提供的表位的抗体也可用于鉴定ET37、TIC810和TIC812蛋白中的任一种或其同源物的存在情况,用于纯化蛋白或同源物,用于鉴定由其表达ET37、TIC810或TIC812蛋白或同源物的核苷酸序列,用于设计得可检测ET37、TIC810和TIC812蛋白或同源物或检测表达所述蛋白或其同源物的核苷酸序列的试剂盒。技术人员容易认识到,这些抗体还提供对这些蛋白的融合体(例如TIC127等)的鉴定。
预期农业上和商业上重要的产品和/或组合物(包括但不限于动物饲料、商品和棉花或大豆或玉米产品以及拟用作动物饲料或用作人食用食品或用于预计人食用的组合物的副产品,包括但不限于棉籽、棉籽油、棉籽饼粉等,大豆饼、大豆油、大豆粉等,玉米面、玉米粉、玉米浆、玉米油、玉米淀粉、爆米花、玉米饼、含有玉米或大豆的谷物以及玉米副产品等)都落入本发明的范围内,前提条件是这些产品和组合物含有可检测量的、如本文所示的、用于诊断编码ET29、ET37、TIC809、TIC810、TIC812、TIC127的序列或其组合等的存在情况的核苷酸序列。还提出了为农业上和商业上重要产品的蒸馏谷物固体(distillers dry goods),尤其是在其包含可检测量的编码本发明的一种或多种蛋白的核苷酸序列或者可检测量的一种或多种本发明蛋白的情况下。
含编码这些蛋白的可检测量的核苷酸序列的种子,或可被加工成含可检测量的这些核苷酸序列或蛋白的产品的种子或植物部分,都落入本发明的范围内。
按照这些实施例,本领域技术人员应理解,可在不偏离所公开的本发明精神和范围的情况下,对前述公开内容实施多种改变。
业已阐释和描述了本发明的原则,对本领域技术人员应显而易见的是,可在不偏离这些原则的情况下,在排列和细节方面修改本发明。我们要求保护在随附权利要求书的精神和范围内的所有修改。
本说明书中提及的所有出版物和公布的专利文件都在此引入作为参考,其程度如同每个单独的出版物或专利申请具体地和单独地指明被引入作为参考。
实施例
实施例1
本实施例说明了用于在玉米植物细胞中实现ET29蛋白表达的核苷酸序列的构建。
ET29是得自苏云金芽孢杆菌的蛋白,先前已表明在食物中提供给玉米根虫幼虫时具有玉米根虫杀虫生物活性(美国专利第6,093,695、6,537,756、6,686,452号)。业已表明,天然Bt编码序列在植物细胞中表达时表现出无法接受的蛋白合成水平(美国专利第5,500,365号)。ET29蛋白在玉米植物细胞中的表达,特别是在玉米根细胞中的表达,可提供给玉米植物抗玉米根虫进食损伤的保护作用。因此,构建编码苏云金芽孢杆菌ET29杀虫蛋白的核苷酸序列,预期其在植物中更高地表达,避免了先前已表明有问题的某些不利核苷酸序列,同时保持除一个以外编码天然杀虫蛋白的核苷酸序列;将在编码序列(SEQ IDNO:13)2位的补加丙氨酸密码子纳入合成序列中,以利于克隆的简易性。示于SEQ ID NO:14的ALA2变体ET29氨基酸序列被称为TIC809,在生物测定中具有不低于天然ET29的生物活性。
将示于SEQ ID NO:13的TIC809编码区亚克隆入二元植物转化载体。TIC809编码区上游的元件包括增强的CaMV 35S启动子、小麦主要叶绿素a/b-结合蛋白5’非翻译前导序列、水稻肌动蛋白1基因第一内含子和侧翼非翻译前导区(UTL)外显子序列,以及任选地包括玉米核酮糖1,5-二磷酸羧化酶小亚基叶绿体转运肽编码序列。由与TIC809蛋白N端连接的叶绿体转运肽(ctp)组成的融合蛋白在植物体中的表达能够将TIC809蛋白靶入质体中。将小麦hsp17 3’非翻译区(UTR)掺入TIC809下游,以实现mRNA转录物的转录终止和聚腺苷酸化。
植物转化载体包含耐草甘膦选择标记。植物转化载体pMON705 13供胞质可溶性TIC809蛋白表达用,而pMON705 14供靶向质体的TIC809蛋白表达用。
实施例2
本实施例说明了在瞬时玉米原生质体测定中靶向质体的和非靶向的TIC809表达的对比,以及随后对被转化以表达靶向玉米质体的TIC809蛋白的转基因植物的分析。
将使用用pMON70513或pMON70514转化的玉米原生质体的瞬时表达测定彼此对比和与空载体对照对比。结果表明,非靶向的TIC809蛋白表达水平比靶向的TIC809蛋白低。因此,仅进一步分析pMON70514。在土壤杆菌介导的玉米原生质体转化后产生转基因玉米事件。筛选再生的玉米植株(“R0植株”)的草甘膦耐受性和小麦hsp17(tahsp17)3′UTR的拷贝数。使用ELISA法筛选各个转基因玉米事件的6叶期(V6)根和叶样品的ET29蛋白的存在情况和存在量。
通过ELISA分析的87个转基因事件中有1 9个表现出以褪绿茎秆为特征的独特异常表型,这些事件中的8个具有缨或穗异常。表现出异常表型的植株的叶和根组织中的TIC809平均表达水平分别为2.2ppm和2.0ppm。表型正常植株的叶和根中的TIC809平均水平分别为1.4ppm和1.1ppm。这些结果提示,较高水平的TIC809蛋白可能与观察到的异常表型相关。
实施例3
本实施例说明了TIC810编码基因的克隆以及TIC810的表达不具有玉米根虫杀虫生物活性的鉴定结果,TIC810是由苏云金芽孢杆菌的操纵子中表达的蛋白,在该操纵子中还存在ET29(tic809)基因。
et29最初克隆在得自苏云金芽孢杆菌菌株EG4096的DNA的7.1kb EcoRI片段上(美国专利第6,686,452号),并保留在质粒pEG1303中,质粒pEG1303是能够在苏云金芽孢杆菌和大肠杆菌中复制的穿梭载体。含pEG1303的重组苏云金芽孢杆菌菌株EG11502在C2培养基中生长时产生低水平的ET29晶体蛋白(Donovan等,Mol.Gen.Genet.214:365-372,1988)。
将ET29编码序列作为约1.5kb的KpnI-ClaI片段由pEGl303中的大7.1 EcoRI片段亚克隆入高拷贝数的穿梭载体pEG854.9(Baum等,(1996)Appl.Env.Microbiol.62:4367-4373)中,例外之处是会由较小片段观察到ET29表达水平的增加。产生的质粒pMON78402一般被认为包含ET29编码区的足够天然的DNA 5’和3’,以掺入任何必需的表达元件,例如孢子形成依赖性启动子。令人惊奇的是,当将pMON78402导入到无晶体苏云金芽孢杆菌宿主菌株EG10650中时未检测到蛋白晶体形成,提示存在于pMON78402上的5’区可能不包含天然ET29启动子,pEG1303中克隆的ET29转录由偶然性的虫煤启动子驱动。而且,pEG1303中的完整7.1 kb EcoRI片段的测序揭示,紧邻ET29编码区上游存在中断的可读框。中断的编码区含有用于在pEG1303中克隆7.1 kb EcoRI片段的其中一个末端EcoRI位点。现有的非冗余蛋白数据库以及苏云金芽孢杆菌晶体蛋白序列数据库的FASTX检索提示,该部分编码区编码的氨基酸序列与ET29蛋白的氨基酸序列具有约36%的序列同一性。此相关蛋白称为TIC810。这提示,ET29基因存在于包含至少上游TIC810基因的非特征性操纵子中,并因为TIC810可能与ET29共表达,所以其非常可能也具有玉米根虫杀虫活性。
天然ET29编码序列示于SEQ ID NO:9的716-1408位。在该编码序列中存在单个NheI位点(示于SEQ ID NO:9的核苷酸820-825)。pEG1303中的部分TIC810编码序列示于SEQ ID NO:9的核苷酸369-654位。分开TIC810编码序列的EcoRI位点示于SEQ ID NO:9的核苷酸369-374。NheI消化EG4096 DNA或用NheI和适合的限制酶消化结合连接和反向PCR使得可以鉴定TIC810编码区的5’末端的核苷酸序列。
用相适的多种组合的限制酶以50μL体积消化EG4096基因组DNA(5μg),所述组合包括NheI、NheI+BlnI、NheI+SpeI和NheI+XbaI。将10μl完全消化物与80μL无菌水、10μL 10X连接酶缓冲液(New England BioLabs,Beverly,MA)和2μL T4连接酶混合,并于4℃保温过夜。连接产物用作热扩增模板,使用Invitrogen(Carlsbad,CA)的Elongase试剂盒以及异向引物pr370和pr371(分别示于SEQ IDNO:21和SEQ ID NO.22)。SEQ ID NO:21对应于示于SEQ ID NO:9的核苷酸650-671的反向互补序列,SEQ ID NO:22对应于示于SEQ IDNO:9的核苷酸744-764。
用于各个反应的限制酶产生匹配末端。苏云金芽孢杆菌DNA为富含AT的,所以预期与BlnI和NheI限制酶相比限制酶SpeI和XbaI会产生较小的PCR产物。只有SpeI-NheI和XbaI-NheI组合产生扩增的DNA片段。对扩增的DNA片段克隆和测序,并与示于SEQ ID NO:9的核苷酸369-825组成的EcoRI-NheI区段组装。鉴定出如SEQ ID NO:3所示的预计编码TIC810的组装序列。随后使用高保真度热稳定性聚合酶直接由EG4096基因组DNA扩增TIC810基因。使用多个克隆证实该序列。预计TIC810基因编码如SEQ ID NO:4所示的约25,000道尔顿的蛋白。推断的TIC810氨基酸序列与ET29氨基酸序列具有约33%的氨基酸序列同一性,提示TIC810也可能具有杀虫生物活性。
检验苏云金芽孢杆菌无晶体菌株中的TIC810表达。TIC810基因使用引物pr375和pr376(分别示于SEQ ID NO:23和SEQ ID NO:24)扩增,产生编码TIC810氨基酸序列变体的扩增子,其包含取代天然GTG密码子的ATG翻译起始密码子。引物pr375还将SpeI位点5’掺入到TIC810编码区,而引物pr376将XhoI位点3’掺入到TIC810编码区扩增子,允许将该扩增子亚克隆到苏云金芽孢杆菌-大肠杆菌穿梭载体pMON47407中。将编码TIC810的扩增子紧邻载体内源性cry1A启动子下游插入到该载体中。位于cry1A启动子下游的序列在苏云金芽孢杆菌中具有孢子形成依赖性表达。将获得的含TIC810蛋白编码序列的重组质粒pMON78409通过电穿孔导入到无晶体苏云金芽孢杆菌宿主菌株EG10650中,以产生重组苏云金芽孢杆菌菌株SIC8116。菌株SIC8116在C2孢子形成培养基中生长时产生含TIC810蛋白的伴孢包涵体。当玉米根虫幼虫接触覆盖已形成孢子的SIC8116培养物的人工培养基时,未观察到杀虫生物活性,提示TIC810蛋白不具有针对CRW的杀虫生物活性。
实施例4
本实施例说明了处于cry1A孢子形成依赖性启动子控制下的ET29表达产生较差的表达水平和异常的生理性宿主细胞行为。
含质粒pEG1303的重组苏云金芽孢杆菌菌株EG11502产生少量的ET29晶体蛋白。在尝试增加苏云金芽孢杆菌中的ET29表达时,将ET29编码序列插入到苏云金芽孢杆菌-大肠杆菌穿梭载体pMON47407中。ET29编码序列使用引物pr365和pr372(分别为SEQID NO:25和SEQ ID NO:26)由pEG1303 DNA扩增。在验证序列后,将扩增的ET29基因片段以由载体内源性cry1A启动子有效表达ET29的方向插入到pMON47407载体骨架中。为评价ET29蛋白生产,将获得的重组质粒pIC17507导入到苏云金芽孢杆菌菌株EG10650中,以产生重组菌株SIC8114。菌株SIC8114产生的ET29蛋白量低于原始ET29重组菌株EG11502产生的ET29蛋白量,在C2孢子形成培养基中表现出较差的孢子形成。该结果提示,ET29的过表达可能对宿主细胞有害,或者天然菌株EG4096中存在的或重组菌株中不存在的某些其它因素是ET29的有效表达和/累积所需要的。
实施例5
本实施例说明了与TIC810/ET29操纵子具有同源性的操纵子的鉴定。
通过对EG4096中TIC810/ET29操纵子相关序列的存在情况进行DNA印迹分析检验其它Bt菌株。发现Bt菌株EG5078总DNA的5.4kb ClaI限制性片段与ET29特异性杂交探针杂交。将该5.4 kb ClaI片段克隆入苏云金芽孢杆菌-大肠杆菌穿梭载体pEG854中(Baum等,(1990)Appl.Env.Microbiol.56:3420-3428),并还克隆到大肠杆菌载体pBluescript IISK中,以分别产生重组质粒pEG1325和pEG1323。5.4 kb插入片段的序列分析揭示了在苏云金芽孢杆菌菌株EG4096中类似于ET29和TIC810基因的两个紧密连接的基因。5’邻近编码序列被称为tic812(示于SEQ ID NO:5),预计其编码TIC812多肽(示于SEQ IDNO:6)。TIC812与TIC810蛋白具有约97%的氨基酸序列同一性。3’邻近编码序列被称为et37(SEQ ID NO:1),预计其编码示于SEQ IDNO:2的ET37氨基酸序列。ET37与ET29蛋白具有超过约99%的氨基酸序列同一性,差别仅在于一个氨基酸位置。
使用电穿孔法将包含TIC812和ET37基因这二者的穿梭载体pEG1325导入到无晶体苏云金芽孢杆菌宿主菌株EG10650中。获得的重组菌株EG11541在C2孢子形成培养基中培养时产生高水平的ET37蛋白。然而,存在于孢子中的TIC812蛋白量大约仅为ET37蛋白量的25%。已发现菌株EG5078产生的ET37/TIC812蛋白混合物在生物测定中测试时对玉米根虫幼虫有毒。
因为TIC810/ET29和TIC812/ET37操纵子之间的相似性,以及因为ET37和ET29差异仅在于一个氨基酸位置,所以不可能两种蛋白都表现出明显不同的杀虫或细胞毒性特性。单独表达时低水平的ET29相比于由其天然操纵子表达时较高的ET37蛋白水平,连同TIC810和TIC812蛋白之间的大致同一性,提示TIC810和TIC812蛋白可在为它们对应的ET29或ET37蛋白增加水平或提供一定稳定作用方面起作用。TIC812连同ET37蛋白的共表达可为在菌株EG11541中观察到的ET37过表达的原因。另外,TIC810和ET29的共表达同样可导致ET29过表达。
为测试该假说,扩增单个ET29、ET37、TIC810和TIC812编码序列,并克隆入上述苏云金芽孢杆菌表达载体pMON47407中,使得每个的表达都处于载体内源性cry1A启动子控制之下。将TIC812编码序列位置1的天然GTG密码子改变为ATG密码子。TIC810-ET29串联编码序列(SEQ ID NO:9)和TIC812-ET37串联编码序列(SEQ IDNO:10)各自由基因组DNA扩增,并克隆入TOPO克隆载体pCR2.1-TOPO(Invitrogen,Carlsbad,CA)中,证实它们的序列。然后将扩增的DNA片段克隆入载体pMON47407中,使得串联编码序列处于载体内源性cry1A启动子控制之下。因此,将各个插入片段以相同方向克隆入pMON47407中,并使用相同的启动子。无晶体苏云金芽孢杆菌菌株EG10650用作所有表达研究的宿主菌株。质粒构建体和包含这些质粒的重组苏云金芽孢杆菌见表2。
表2.构建用于TIC810/ET29和TIC812/ET37的表达分析的质粒
1-用于扩增插入到pMON47407中的编码序列的引物对
重组菌株和EG10650各自在250ml挡板瓶中的30ml C2培养基于28℃在剧烈搅拌下培养3天。通过低速离心收集孢子和晶体,用30ml洗涤缓冲液(10mM Tris-HCl,0.1mM EDTA,0.005%TritonX-100,pH 6.8)洗涤1次,以3ml终体积重悬浮在洗涤缓冲液中。然后通过SDS-PAGE分析这些10X C2浓缩物。蛋白浓度通过使用BSA作为标准品的光密度分析法测定。
SDS-PAGE分析表明,1)ET29和ET37在单独表达时均表现出较差的产量;2)TIC810和TIC812在单独表达时累积至高水平;和3)ET29和ET37在分别连同TIC810或TIC812共表达时表现出急剧提高的表达水平。存在TIC810时的ET29蛋白产量是不存在TIC810时的约4.6倍高,存在TIC812时的ET37产量是不存在TIC812时的约6.6倍高。而且,含TIC810_ET29串联编码序列的菌株SIC8134和含TIC812_ET37串联编码序列的SIC8135具有正常的孢子形成和裂解。这些结果表明,TIC810和TIC812分别是在苏云金芽孢杆菌中高水平生产ET29和ET37所需要的,可能起辅助蛋白或伴侣蛋白的作用。
10X TIC812/ET37孢子-晶体悬液直接用于针对WCR的生物测定。悬液中的晶体蛋白通过SDS-聚丙烯酰胺凝胶电泳和使用牛血清白蛋白作为标准品的光密度分析法定量。以类似于Pleau等(Entomol.Exp.Appl.105:1-11,2002)所述的方式制备200mL WCR食物。每孔施加20μL测试样品,并使其干燥,然后用细硬毛刷涂抹于每孔一个新生昆虫幼虫。板用聚酯薄膜密封,并使用昆虫针通风。每个样品浓度测试24只幼虫。生物测定板于27℃、60%RH在完全黑暗中温育5-7天。根据实验在5-7天结束时记录每个处理的存活幼虫数。存活幼虫在微量天平(Cahn C-33)上称重。数据使用4统计学软件(SAS Institute,Cary,N.C.,USA)分析。生物测定数据见表3。
结果提示,发现含ET37和TIC812混合物的野生型苏云金芽孢杆菌菌株EG5078的已形成孢子的培养物对西方玉米根虫幼虫有毒,与对照相比引起显著的幼虫量减少。相对照的cry-苏云金芽孢杆菌宿主菌株EG10650的10X孢子悬液在生物测定中未表现出抗WCR幼虫的活性。纳入含500mg Cry3Bb根虫杀虫蛋白的样品作为阳性对照。
表3.西方玉米根虫生物测定对ET37/TIC812
1-mg/ml;2-标准偏差;3-未处理的检验;
变异数不相等,Levene法,P>F 0.0053;
存在由于处理产生的作用,SLS,P>F 0.0002;和
P值<0.05的平均值与UTC、对比计划(Planned Contrast)显著不同。
实施例6
本实施例说明了TIC810和TIC809蛋白在玉米原生质体中的共表达。
构建编码苏云金芽孢杆菌TIC810和ET29(TIC809)蛋白的合成核苷酸序列,用于在植物中表达。编码TIC809的合成序列示于SEQ IDNO:13,氨基酸序列翻译示于SEQ ID NO:14。编码TIC810的合成序列示于SEQ ID NO:15,氨基酸序列翻译示于SEQ ID NO:16。将TIC809和TIC810的合成编码序列克隆入表达载体中,用于采用玉米原生质体的瞬时表达研究。TIC809和TIC810编码序列的编码区周围的遗传元件是相同的,只是加入叶绿体转运肽(ctp),如实施例1所述。质粒构建体的部分描述见下表4。
表4.用于TIC810/TIC809表达的瞬时测定的质粒构建体
原生质体如下制备:在0.6 M甘露醇、10 mM MES pH 5.7、2%纤维素酶RS和0.3%混合酶R10中消化12日玉米叶组织达2小时。所有转化都使用50μg DNA和1.3×106个细胞进行。
原生质体中的TIC809表达使用ELISA检测,该ELISA采用针对ET29蛋白产生的多克隆抗体。结果见表5,代表3个重复样品的平均值。
表5.TIC809在玉米原生质体中的瞬时表达
1-便用Tukey-Kramer HSD比较所有对,α=0.05。具有相同字母的处理彼此之间没有显著不同。
2-(ngET29/mg总蛋白)
结果表明,靶向叶绿体的TIC809蛋白导致瞬时系统中的TIC809表达相比于非靶向的TIC809表达增加。通过比较pMON84203的表达和pMON84202的表达,观察到靶向的TIC809蛋白表达约为非靶向的TIC809蛋白表达的约20倍高。但是,非靶向的TIC810蛋白与非靶向的TIC809共表达也导致TIC809蛋白的表达显著增加(比较pMON641 34和pMON84202)。在6个实验中的5个当中,将TIC810和TIC809这二者靶向质体导致TIC809的表达和累积水平等同于或高于TIC809被单独地靶向质体时累积的量。无论如何,TIC809和TIC810一起在任何共同细胞部位的共表达都导致TIC809累积水平相比于没有TIC810的情况下在该区室中表达时的TIC809累积水平增加。
构建pMON64136和pMON64137,以测试TIC809在定位于与TIC810不同的亚细胞区室时的表达作用。TIC810靶向质体对非靶向的TIC809的表达没有显著影响(比较pMON64136和pMON84202)。类似地,非靶向的TIC810对靶向的TIC809的累积没有显著影响(比较pMON64137和pMON84203)。但是,非靶向的TIC810增加或稳定非靶向的TIC809的累积。这些结果提示,两种蛋白定位于细胞中的相同空间导致根虫杀虫蛋白TIC809的累积更大,这可能是由于蛋白之间的某些相互作用造成的,这些相互作用稳定TIC809蛋白的累积。这些结果与TIC810和ET29共表达或TIC812和ET37共表达分别在苏云金芽孢杆菌产生高水平的ET29或ET37表达的观察结果相一致。
在瞬时原生质体表达测定中纳入含萤光素酶(LUX)基因的质粒作为对照。尽管萤光素酶一般被纳入瞬时测定中作为转化功效的指示剂,但观察到萤光素酶表达水平随测试的质粒构建体而广泛变化,推测是由于累积的Bt蛋白的植物毒性作用造成的。TIC809的低累积与低萤光素酶水平相关。在所有情况下,除了TIC809定位于叶绿体时以外,非靶向的TIC810在相同区室中与TIC809共表达导致萤光素酶和TIC809表达水平均急剧增加(对比pMON64134和pMON84202)。
萤光素酶数据见表6。
表6.瞬时测定实验中的萤光素酶水平
1所有对的对比都使用Tukey-Kramer HSD,α=0.05。具有相同字母的处理彼此之间没有显著不同。
实施例7
本实施例说明了TIC810和TIC809在转基因玉米中共表达的结果。
将TIC809、ctp-TIC809、TIC809-TIC810和ctpTIC809-ctpTIC810表达盒导入到适用于玉米转化的二元植物转化载体中。与编码Bt蛋白的序列不同,构建体彼此差异仅在于有或没有靶向叶绿体的肽编码序列。赋予草甘膦耐受性的基因作为选择标记用于土壤杆菌介导的转化。
pMON64138包含用于在植物体中表达TIC809和TIC810的表达盒。pMON64139包含用于在植物体中表达靶向叶绿体的TIC809和靶向叶绿体的TIC810的表达盒。pMON70513包含用于在植物体中表达TIC809的表达盒。pMON70514包含用于在植物体中表达靶向叶绿体的TIC809的表达盒。使用这4种质粒转化和草甘膦选择后获得再生转基因玉米植株,使用TaqMan测定筛选这些植株的草甘膦选择标记基因存在情况和tahsp173’序列存在情况。对于每个事件在相关的情况下都使用终点PCR测定证实TIC809和/或TIC810编码序列的存在情况。预期用含TIC809和TIC810这二者的构建体转化的事件具有2个拷贝的tahsp17 3’序列,因为每个编码序列在其3’末端都以tahsp17 3’元件为界。使用ET29 ELISA测定含6片叶(V6期)和根的转基因和对照植株的TIC809累积水平。
在用仅含靶向胞质的TIC809编码序列的pMON70513转化后再生的植株表现如以上实施例1和2所述。具有显著异常的表型和特征的植株在视觉上通常以褪绿茎秆连同其它异常性为特征。根组织中的TIC809表达和/或累积水平平均不超过约2.0ppm。
测定用pMON64138转化的15株R0植株的草甘膦标记的存在情况以及tahsp173’元件的存在情况和拷贝数,以及用于检测任何载体骨架的OriV的存在情况,和编码靶向胞质的TIC809和TIC810蛋白的基因的存在情况和完整性。一个事件不包含全长TIC809编码序列,还被测定出具有不可检测水平的TIC809蛋白。余下的14个事件在叶组织或根组织中表现出平均约12ppm的TIC809(鲜重)。根组织中的TIC809水平由1个植株中的约0.2ppm至表现出最高表达和/或累积水平的植株中的约45ppm。该结果提示,TIC810连同TIC809在植物组织中的共表达提供了改善的TIC809蛋白表达和/或累积水平。更显著地,在胞质中表达TIC809和TIC810这二者的植物中未观察到异常表型。TIC809的表达水平在根组织中比在叶组织中更均一,大部分植株在根中具有的TIC809表达水平高于叶中的表达水平。
使用含靶向叶绿体的TIC809和TIC810编码序列的pMON64139由转化的植物细胞再生15株R0植株。在筛选分析中鉴定出1个事件,该事件不含全长TIC809表达盒,也不具有可检测的TIC809蛋白水平。余下的14个事件在根和叶组织中的TIC809蛋白分别为平均约4.4ppm和约8.6ppm。根中的TIC809水平在约0.2ppm至约45ppm的范围内。用pMON70514转化的、含ctpTIC809基因的玉米植株在叶组织中平均仅为约1.7ppm TIC809蛋白,在根组织中平均为约6.3ppmTIC809蛋白。因此,非靶向的TIC810连同靶向的TIC809的共表达产生的TIC809表达和/或累积水平高于用靶向叶绿体的TIC809蛋白达到的水平。而且,产生提高水平的TIC809蛋白的R0植株没有表现出茎秆褪绿或与在植物体中表达单独的TIC809蛋白相关的其它植物毒性表现。在与没有TIC810的情况下表达的靶向叶绿体的TIC809蛋白水平相比时,靶向叶绿体的TIC810连同靶向叶绿体的TIC809的共表达也导致TIC809累积水平增加。
转化18株R0植株,以共表达靶向的TIC809和靶向的TIC810蛋白,测定这些植株的耐草甘膦选择标记基因、tahsp17 3’拷贝数、OriV(骨架)的存在情况以及完整的TIC809和TIC810编码序列。一个事件不包含完整的TIC809编码序列,不能表现出可检测水平的TIC809蛋白。余下的17株植株在叶和根中的TIC809蛋白分别为平均8.6和4.4ppm。TIC809蛋白的根表达在约1ppm至约9ppm的范围内。具有靶向叶绿体的TIC809和TIC810蛋白表达的事件没有表现出茎秆褪绿或与在植物体中表达单独的TIC809蛋白相关的其它植物毒性表现。
实施例8
本实施例说明了靶向质体的TIC809在玉米根中的增强的表达。
构建pMON64144,以包含靶向叶绿体的TIC809,其处于RCc3根启动子的有效控制之下(美国专利申请序号11\075,113),CTP编码序列的5’侧接玉米热激蛋白HSP70内含子,TIC809编码序列的3’侧接小麦hsp17 3’转录终止和聚腺苷酸化序列。表达盒的序列示于SEQID NO:38。
在用载体pMON64144进行土壤杆菌介导的玉米组织转化后再生玉米植株。使用TaqMan测定筛选再生的玉米植株的草甘膦选择标记和小麦3’侧翼序列的存在情况。完整的TIC809编码序列的存在情况使用终点PCR测定证实。在6叶期的23株R0玉米植株的根和叶样品使用ET29 ELISA筛选。
在根组织中累积的TIC809平均水平为0.4ppm。在叶中未检测到TIC809蛋白,提示RCc3启动子活性在根细胞中被增强。23株R0植株中有8株表现出的TIC809蛋白浓度低于根中的检测水平。测试的植株没有一个表现出高于约1ppm TIC809的水平。相反,处于e35S启动子控制下的类似构建体在根组织中表现出平均约1.4ppm的TIC809蛋白,在叶组织中约1.7ppm(n=87)。
实施例9
本实施例说明了各自处于不同启动子控制下的TIC809和TIC810在植物体中的共表达。
在实施例7中,TIC809和TIC810基因各自由独立表达盒在植物体中表达,各个编码序列的表达由独立但相同的e35S启动子驱动。在本实施例中,设计表达盒,使得使用RCc3启动子将TIC809的表达基本上定位于根组织,而TIC810的表达处于e35S启动子控制之下。
pMON64150含有两个表达盒。一个表达盒(SEQ ID NO:40)包含靶向叶绿体的TIC809编码序列,该编码序列在其5’末端与水稻RCc3启动子和玉米热激蛋白HSP70内含子有效连接,在其3’末端与小麦hsp17 3’转录终止和聚腺苷酸化序列有效连接。另一个表达盒(SEQ IDNO:40)含有靶向叶绿体的TIC810编码序列,该编码序列在其5’末端与e35S启动子和水稻肌动蛋白内含子序列有效连接,在其3’末端与小麦hsp17 3’转录终止和聚腺苷酸化序列有效连接。
pMON64151与pMON64150相同,只是在两个表达盒中的编码序列没有靶向叶绿体的肽编码序列(分别为SEQ ID NO:43和SEQ IDNO:40)。
测试由用pMON64150或pMON64151转化的玉米组织再生的植株,以检验TIC809和TIC810编码序列的存在情况和完整性。在6叶期使用ET29 ELISA测试这些事件的叶和根,以测定TIC809蛋白累积水平。用pMON64150转化的植株具有平均约1.5ppm TIC809/株,而用pMON64151转化的植物具有平均约0.4pm TIC809/株。pMON64150植株具有约0.4ppm至约6ppm的TIC809根累积水平,2/3以上的事件具有至少约1ppm的TIC809水平。叶组织始终具有低于测定检出限的TIC809累积水平。
pMON64151事件中的TIC809根平均累积高于使用靶向叶绿体的pMON64150表达盒产生的事件(pMON64151,6.6ppm对pMON64150,1.4ppm)。这些结果与使用其中TIC809由e35S启动子表达的构建体获得的结果一致(pMON64138和pMON64139)。由RCc3对e35S启动子表达TIC809的植物之间的最大差异在于,当表达受控于RCc3启动子时,在叶中没有TIC809累积。pMON64150和pMON64151事件均具有正常表型。
实施例10
本实施例说明了含TIC 809和TIC 810及其如本文公开的同源物的组合物的半翅目昆虫杀虫生物活性。
本文业已公开,含ET29和/或TIC809或ET37的组合物对鞘翅目害虫有杀虫性。TIC810和TIC812还没有表现出对鞘翅目昆虫的杀虫生物活性,但正如本文所公开的,其可用于促进ET29和或TIC809和ET37的高水平表达和稳定性。ET29先前还没有表现出针对栉头蚤(Ctenocephalides sp.)的杀虫生物活性,预期ET37对该类昆虫也会表现出活性。推测TIC810和/或TIC812也具有杀虫生物活性,所以在抗其它植物害虫的生物测定中测试这些蛋白,所述其它植物害虫例如为半翅目害虫,例如豆荚盲蝽(Lygus hesperus)(西方牧草盲蝽(WesternTarnished Plant Bug;WTPB))。
WTPB是攻击众多杂草和作物的食植物的刺吸式昆虫。WTPB通过直接进食损伤来损害农作物,包括棉花。使用该类昆虫测试杀虫组合物的测定必须允许昆虫的天然进食行为。使用的进食测定基于使用药粉包系统(sachet system)的96孔微量滴定板模式,参见Habibi等,(Archives of Insect Biochem.and Phys.50:62-74(2002))。WTPB人工食物由Bio-(Bio-Diet F9644B,Frenchtown,NJ)提供。
使580ml高压蒸汽灭菌的煮沸水和156.3g Bio-DietF9644B在表面消毒的混合器中混合。加入4个表面消毒的鸡蛋的内容物,将混合物混匀,直至平滑,然后调整至1L总体积,并使其冷却。通过将需要浓度的20μl样品和200μl混合食物混合(1∶10)制备毒素样品。根据测试需要的单个样品数目,该量可放大或缩小。
将一片(Pechiney Plastic Packing,Chicago,IL)放置在设计用于96孔板模式(Analytical Research Systems,Gainesville,FL)的真空歧管上,应用约-20mm汞柱的真空度,该真空度足以使挤入到孔中。向孔加入40μl测试样品。然后将一片Mylar膜(Clear Lam Packaging,Inc.,Elk Grove Village,IL)放置在上,并用电烫斗(Bienfang Sealector II,Hunt Corporation,Philadelphia,PA)轻轻密封。然后将药粉包放置在含悬浮在琼脂糖中的WTPB卵的平底96孔板上。在孵化时,WTPB蛹将刺穿存在于它们之上的药粉包进食。由于将WTPB口分泌物挤入到药粉包中而引起的口外消化可导致食物内容物在昆虫摄入前蛋白水解和降解。为确保完整蛋白在昆虫食物中被提供给昆虫,每两天更换一次食物药粉包。这种强化允许在进食测定过程中更长时间地提供完整食物内容物。昆虫食物药粉包在第2、4和6天更换。在第8天确定萎缩和死亡得分,并与未处理的检查结果(UTC)对比。
单独地和组合例如TIC809加TIC810测试蛋白ET29(或TIC809)、TIC810、ET37和TIC812(美国专利申请号60\713,111)对WTPB的毒性。晶体蛋白在无晶体苏云金芽孢杆菌菌株EG10650中表达,并经蔗糖分级梯度纯化,以除去孢子和细胞碎片。蔗糖分级梯度(10mL各55%、70%和79%蔗糖在10mM Tris-HCl、0.1mM EDTA、0.005%TritonX-100中的溶液,pH 7.5)在25×89mm Ultra-Clear离心管(BeckmanInstruments,Inc.,Palo Alto,CA)中制备。孢子-晶体悬液处于梯度顶部,以18,000rpm(4℃)在配备SW28转子的超速离心机中离心4-18小时。蛋白晶体由55-70%或70-79%蔗糖界面回收,并悬浮在25mMTris-HCl pH 7.5中。初始生物测定包含终浓度为200 ppm的纯化的Bt杀虫蛋白。在进食测定中各包含鞘翅目特异性毒素Cry3Bbl(Donovan等,Appl.Environ.Microbiol.58:3921-3927(1992))、鳞翅目特异性毒素Cry1Ac(Baum等,Appl.Environ.Microbiol.56:3420-3428(1 990)和鳞翅目特异性毒素Cry1Bb1(美国专利第5,322,687号)作为阴性对照。令人惊奇的是,WTPB蛹在接触TIC810加ET29蛋白组合时表现出萎缩和死亡,并如所预期的,在仅接触ET29或任何其它BT蛋白Cry3Bb1、Cry1Ac或Cry1Bb1时都没有观察到萎缩和死亡,在与未处理对照相比时它们全都没有表现出明显差异。
扩展草盲蝽生物测定,以包含TIC810、TIC812的单个晶体制备物以及TIC812和ET37的混合物。与上述结果类似,只有两种蛋白的组合才表现出显著的杀虫活性。在与未处理的对照或仅使用单个蛋白的生物测定相比时,TIC810加ET29以及TIC812加ET37表现出显著的死亡率和数量下降。但是,TIC810加ET29(或TIC809)组合比TIC812加ET37组合表现出更大的死亡率和数量下降。
这些生物测定的结果表明,单独的TIC810或TIC812对WTPB均无毒,TIC810和ET29的混合物或者TIC812和ET37的混合物对WTPB有毒,这类似于针对玉米根虫幼虫测试时观察到的结果。
实施例11
本实施例说明了用于表达TIC809和/或TIC810及其同源物的表达盒的构建。
构建植物转化载体,以在植物中获得高水平的根虫毒性TIC809和/或ET37蛋白表达。含TIC812和ET37编码序列的载体可用于共表达TIC812和ET37蛋白,由此获得抗虫植物,该植物具有高水平的在植物体中的ET37蛋白生产。TIC810连同ET37的共表达足以在宿主细胞中实现ET37的稳定高水平累积。类似地,TIC812连同ET29乃至TIC809的共表达足以在宿主细胞中获得稳定的高水平的ET29或TIC809累积。如上文所述,
预期与ET29和/或ET37具有约15%至约100%的氨基酸序列相似性的Cyt1和Cyt2类蛋白在宿主细胞中连同TIC810、TIC812或具有的氨基酸序列与TIC810或TIC812具有约50%至约100%的氨基酸序列相似性的直系同源或同源蛋白一起表达时,将具有改善的表达和/或累积,由这些Cyt蛋白表达引起任何负面表型作用都将被这些Cyt蛋白与TIC810、TIC812或其变体的共表达改善。
以上说明书描述了本发明的优选实施方案。本领域技术人员会理解,在不偏离本发明的范围和精神以及无需过多实验的情况下,本发明可采用广泛的等价参数实施。尽管已结合其具体实施方案描述了本发明,但要理解的是,本发明能够进一步修改。本发明用于按照本发明一般原则覆盖本发明的任何用途、变化或修改。在随附的所有权利要求中提供的各项内容的多种排列和组合是可能的,都落入本发明的范围内。
在本说明书中提及的所有出版物、专利和公布的专利申请都在此引入作为参考,如同每个单独的出版物或专利具体地和单独地指明被引入作为参考一样。
序列表
<110>孟山都技术有限公司(Monsanto Technology LLC).
Anderson,Heather M
Baum,James A
Chay,Catherine A
Roberts,James K
Zhang,Bei
<120>用于制备抗虫转基因植物的杀虫组合物和方法
<130>38-21(53327)B
<150>60/713,111
<151>2005-08-31
<160>47
<170>PatentIn version 3.3
<210>1
<211>696
<212>DNA
<213>苏云金芽孢杆菌(Bacillus thuringiensis)
<220>
<221>CDS
<222>(1)..(696)
<223>ET37
<400>1
atg ttc ttt aat cgc gtt att aca tta aca gta cca tct tca gat gtg 48
Met Phe Phe Asn Arg Val Ile Thr Leu Thr Val Pro Ser Ser Asp Val
1 5 10 15
gtt aat tat agt gaa att tat cag gta gct cca caa tat gtg aat caa 96
Val Asn Tyr Ser Glu Ile Tyr Gln Val Ala Pro Gln Tyr Val Asn Gln
20 25 30
gct ctt acg cta gct aaa tat ttc caa gga gca att gat ggt tca aca 144
Ala Leu Thr Leu Ala Lys Tyr Phe Gln Gly Ala Ile Asp Gly Ser Thr
35 40 45
tta cgt ttt gat ttt gaa aaa gcc tta caa att gca aat gat att cca 192
Leu Arg Phe Asp Phe Glu Lys Ala Leu Gln Ile Ala Asn Asp Ile Pro
50 55 60
cag gca gca gtg gta aac act tta aat caa act gtg cag caa ggt aca 240
Gln Ala Ala Val Val Asn Thr Leu Asn Gln Thr Val Gln Gln Gly Thr
65 70 75 80
gtc caa gta tca gtg atg ata gac aag att gta gac att atg aag aat 288
Val Gln Val Ser Val Met Ile Asp Lys Ile Val Asp Ile Met Lys Asn
85 90 95
gta tta tct att gta att gat aac aaa aag ttt tgg gat cag gta aca 336
Val Leu Ser Ile Val Ile Asp Asn Lys Lys Phe Trp Asp Gln Val Thr
100 105 110
gct gct att aca aat aca ttc aca aat cta aat tcg caa gaa agc gaa 384
Ala Ala Ile Thr Asn Thr Phe Thr Asn Leu Asn Ser Gln Glu Ser Glu
115 120 125
gca tgg att ttt tat tac aaa gaa gat gca cat aaa act agt tac tat 432
Ala Trp Ile Phe Tyr Tyr Lys Glu Asp Ala His Lys Thr Ser Tyr Tyr
130 135 140
tat aat atc tta ttt gct ata cag gat gag gaa aca ggt ggg gta atg 480
Tyr Asn Ile Leu Phe Ala Ile Gln Asp Glu Glu Thr Gly Gly Val Met
145 150 155 160
gcg aca tta ccg att gca ttt gat att agt gta gat att gaa aaa gaa 528
Ala Thr Leu Pro Ile Ala Phe Asp Ile Ser Val Asp Ile Glu Lys Glu
165 170 175
aag gtt cta ttt gtt act atc aag gat act gaa aat tat gct gtt aca 576
Lys Val Leu Phe Val Thr Ile Lys Asp Thr Glu Asn Tyr Ala Val Thr
180 185 190
gta aaa gct att aat gta gta caa gca ctt caa tct tcc cga gat tca 624
Val Lys Ala Ile Asn Val Val Gln Ala Leu Gln Ser Ser Arg Asp Ser
195 200 205
aaa gtt gta gat gct ttt aaa tcg cca cgt cac tta cct aga aaa aga 672
Lys Val Val Asp Ala Phe Lys Ser Pro Arg His Leu Pro Arg Lys Arg
210 215 220
cat aca att tgt agt aac tct taa 696
His Thr Ile Cys Ser Asn Ser
225 230
<210>2
<211>231
<212>PRT
<213>苏云金芽孢杆菌(Bacillus thuringiensis)
<400>2
Met Phe Phe Asn Arg Val Ile Thr Leu Thr Val Pro Ser Ser Asp Val
1 5 10 15
Val Asn Tyr Ser Glu Ile Tyr Gln Val Ala Pro Gln Tyr Val Asn Gln
20 25 30
Ala Leu Thr Leu Ala Lys Tyr Phe Gln Gly Ala Ile Asp Gly Ser Thr
35 40 45
Leu Arg Phe Asp Phe Glu Lys Ala Leu Gln Ile Ala Asn Asp Ile Pro
50 55 60
Gln Ala Ala Val Val Asn Thr Leu Asn Gln Thr Val Gln Gln Gly Thr
65 70 75 80
Val Gln Val Ser Val Met Ile Asp Lys Ile Val Asp Ile Met Lys Asn
85 90 95
Val Leu Ser Ile Val Ile Asp Asn Lys Lys Phe Trp Asp Gln Val Thr
100 105 110
Ala Ala Ile Thr Asn Thr Phe Thr Asn Leu Asn Ser Gln Glu Ser Glu
115 120 125
Ala Trp Ile Phe Tyr Tyr Lys Glu Asp Ala His Lys Thr Ser Tyr Tyr
130 135 140
Tyr Asn Ile Leu Phe Ala Ile Gln Asp Glu Glu Thr Gly Gly Val Met
145 150 155 160
Ala Thr Leu Pro Ile Ala Phe Asp Ile Ser Val Asp Ile Glu Lys Glu
165 170 175
Lys Val Leu Phe Val Thr Ile Lys Asp Thr Glu Ash Tyr Ala Val Thr
180 185 190
Val Lys Ala Ile Asn Val Val Gln Ala Leu Gln Ser Ser Arg Asp Ser
195 200 205
Lys Val Val Asp Ala Phe Lys Ser Pro Arg His Leu Pro Arg Lys Arg
210 215 220
His Thr Ile Cys Ser Asn Ser
225 230
<210>3
<211>657
<212>DNA
<213>苏云金芽孢杆菌(Bacillus thuringiensis)
<220>
<221>CDS
<222>(1)..(657)
<223>TIC810
<400>3
gtg agt aaa gaa att cgt tta aat ttg agt aga gaa tca ggg gca gat 48
Val Ser Lys Glu Ile Arg Leu Asn Leu Ser Arg Glu Ser Gly Ala Asp
1 5 10 15
tta tat tta aaa ata ctt gct ttt gta aaa cct gag cat ttt ttt caa 96
Leu Tyr Leu Lys Ile Leu Ala Phe Val Lys Pro Glu His Phe Phe Gln
20 25 30
gca tat tta tta tgt aga gaa ttt gag tct atc gta gat cct aca aca 144
Ala Tyr Leu Leu Cys Arg Glu Phe Glu Ser Ile Val Asp Pro Thr Thr
35 40 45
aga gaa tcg gat ttt gac aaa aca ctt acc att gta aag agt gat tca 192
Arg Glu Ser Asp Phe Asp Lys Thr Leu Thr Ile Val Lys Ser Asp Ser
50 55 60
act tta gtt acg gtt ggt aca atg aat act aaa ctt gtg aat agt caa 240
Thr Leu Val Thr Val Gly Thr Met Asn Thr Lys Leu Val Asn Ser Gln
65 70 75 80
gaa att cta gtt agt gat ttg att acg caa gtt gga agt cag ata gct 288
Glu Ile Leu Val Ser Asp Leu Ile Thr Gln Val Gly Ser Gln Ile Ala
85 90 95
gat acc tta ggt att aca gac att gat gca aat aca cag caa caa tta 336
Asp Thr Leu Gly Ile Thr Asp Ile Asp Ala Asn Thr Gln Gln Gln Leu
100 105 110
aca gaa tta att gga aat tta ttt gtg aat ctg aat tct caa gtt caa 384
Thr Glu Leu Ile Gly Asn Leu Phe Val Asn Leu Asn Ser Gln Val Gln
115 120 125
gaa tat att tat ttt tat gag gaa aaa gaa aag caa aca agt tat cgc 432
Glu Tyr Ile Tyr Phe Tyr Glu Glu Lys Glu Lys Gln Thr Ser Tyr Arg
130 135 140
tat aac atc ctt ttc gtt ttt gaa aaa gag tct ttt atc acc att tta 480
Tyr Asn Ile Leu Phe Val Phe Glu Lys Glu Ser Phe Ile Thr Ile Leu
145 150 155 160
cca atg gga ttc gat gtg act gtg aac act aat aaa gaa gcg gtt ctt 528
Pro Met Gly Phe Asp Val Thr Val Asn Thr Asn Lys Glu Ala Val Leu
165 170 175
aag tta aca cct aaa gat aaa gtc act tat ggt cat gta tca gta aaa 576
Lys Leu Thr Pro Lys Asp Lys Val Thr Tyr Gly His Val Ser Val Lys
180 185 190
gct tta aat att att caa ctt atc aca gaa gat aaa ttt aac ttt ctt 624
Ala Leu Asn Ile Ile Gln Leu Ile Thr Glu Asp Lys Phe Asn Phe Leu
195 200 205
gct aca tta aaa aag gca cta aaa act cta taa 657
Ala Thr Leu Lys Lys Ala Leu Lys Thr Leu
210 215
<210>4
<211>218
<212>PRT
<213>苏云金芽孢杆菌(Bacillus thuringiensis)
<400>4
Val Ser Lys Glu Ile Arg Leu Asn Leu Ser Arg Glu Ser Gly Ala Asp
1 5 10 15
Leu Tyr Leu Lys Ile Leu Ala Phe Val Lys Pro Glu His Phe Phe Gln
20 25 30
Ala Tyr Leu Leu Cys Arg Glu Phe Glu Ser Ile Val Asp Pro Thr Thr
35 40 45
Arg Glu Ser Asp Phe Asp Lys Thr Leu Thr Ile Val Lys Ser Asp Ser
50 55 60
Thr Leu Val Thr Val Gly Thr Met Asn Thr Lys Leu Val Asn Ser Gln
65 70 75 80
Glu Ile Leu Val Ser Asp Leu Ile Thr Gln Val Gly Ser Gln Ile Ala
85 90 95
Asp Thr Leu Gly Ile Thr Asp Ile Asp Ala Asn Thr Gln Gln Gln Leu
100 105 110
Thr Glu Leu Ile Gly Asn Leu Phe Val Asn Leu Asn Ser Gln Val Gln
115 120 125
Glu Tyr Ile Tyr Phe Tyr Glu Glu Lys Glu Lys Gln Thr Ser Tyr Arg
130 135 140
Tyr Asn Ile Leu Phe Val Phe Glu Lys Glu Ser Phe Ile Thr Ile Leu
145 150 155 160
Pro Met Gly Phe Asp Val Thr Val Asn Thr Asn Lys Glu Ala Val Leu
165 170 175
Lys Leu Thr Pro Lys Asp Lys Val Thr Tyr Gly His Val Ser Val Lys
180 185 190
Ala Leu Asn Ile Ile Gln Leu Ile Thr Glu Asp Lys Phe Asn Phe Leu
195 200 205
Ala Thr Leu Lys Lys Ala Leu Lys Thr Leu
210 215
<210>5
<211>657
<212>DNA
<213>苏云金芽孢杆菌(Bacillus thuringiensis)
<220>
<221>CDS
<222>(1)..(657)
<223>TIC812
<400>5
gtg agt aaa gaa att cgt tta aat ttg agt aga gaa tca ggg gca gat 48
Val Ser Lys Glu Ile Arg Leu Asn Leu Ser Arg Glu Ser Gly Ala Asp
1 5 10 15
tta tat tta aaa ata ctt gct ttt gta aaa cct gag cat ttt ttt caa 96
Leu Tyr Leu Lys Ile Leu Ala Phe Val Lys Pro Glu His Phe Phe Gln
20 25 30
gca tat tta tta tgt aga gaa ttt gag tct atc gta gat cct aca aca 144
Ala Tyr Leu Leu Cys Arg Glu Phe Glu Ser Ile Val Asp Pro Thr Thr
35 40 45
aga gaa ttg gat ttt gac aaa acg ctt acc att gta aag agt gat tca 192
Arg Glu Leu Asp Phe Asp Lys Thr Leu Thr Ile Val Lys Ser Asp Ser
50 55 60
act tta gtt acg gtt ggt aca atg aat act aaa ctt gtg aat agt caa 240
Thr Leu Val Thr Val Gly Thr Met Asn Thr Lys Leu Val Asn Ser Gln
65 70 75 80
gaa att cta gtt agt gat ttg att aag caa gtt gga agt cag ata gct 288
Glu Ile Leu Val Ser Asp Leu Ile Lys Gln Val Gly Ser Gln Ile Ala
85 90 95
gat acc tta ggt att aca gac att gat gca aat aca cag caa cga tta 336
Asp Thr Leu Gly Ile Thr Asp Ile Asp Ala Asn Thr Gln Gln Arg Leu
100 105 110
acg gaa tta att gaa aat tta ttt gtg aat ctg aat tct caa gtt caa 384
Thr Glu Leu Ile Glu Asn Leu Phe Val Asn Leu Asn Ser Gln Val Gln
115 120 125
gac tat att tat ttt tat gag gaa aaa gaa aag caa aca agt tat cgc 432
Asp Tyr Ile Tyr Phe Tyr Glu Glu Lys Glu Lys Gln Thr Ser Tyr Arg
130 135 140
tat aac atc ctt ttc gtt ttt gaa aaa gag tct ttt atc acc att tta 480
Tyr Asn Ile Leu Phe Val Phe Glu Lys Glu Ser Phe Ile Thr Ile Leu
145 150 155 160
cca atg gga ttc gat gtg act gtg aac act aat aaa gaa gcg gtt ctt 528
Pro Met Gly Phe Asp Val Thr Val Asn Thr Asn Lys Glu Ala Val Leu
165 170 175
aag tta aca cct aaa gat aaa gtc act tat ggt cat gta tca gta aaa 576
Lys Leu Thr Pro Lys Asp Lys Val Thr Tyr Gly His Val Ser Val Lys
180 185 190
gct tta aat att att caa ttt atc aca gaa gat aaa ttg aac ttt ctt 624
Ala Leu Asn Ile Ile Gln Phe Ile Thr Glu Asp Lys Leu Asn Phe Leu
195 200 205
gct aca tta aaa aag gca cta aaa act cta taa 657
Ala Thr Leu Lys Lys Ala Leu Lys Thr Leu
210 215
<210>6
<211>218
<212>PRT
<213>苏云金芽孢杆菌(Bacillus thuringiensis)
<400>6
Val Ser Lys Glu Ile Arg Leu Asn Leu Ser Arg Glu Ser Gly Ala Asp
1 5 10 15
Leu Tyr Leu Lys Ile Leu Ala Phe Val Lys Pro Glu His Phe Phe Gln
20 25 30
Ala Tyr Leu Leu Cys Arg Glu Phe Glu Ser Ile Val Asp Pro Thr Thr
35 40 45
Arg Glu Leu Asp Phe Asp Lys Thr Leu Thr Ile Val Lys Ser Asp Ser
50 55 60
Thr Leu Val Thr Val Gly Thr Met Asn Thr Lys Leu Val Asn Ser Gln
65 70 75 80
Glu Ile Leu Val Ser Asp Leu Ile Lys Gln Val Gly Ser Gln Ile Ala
85 90 95
Asp Thr Leu Gly Ile Thr Asp Ile Asp Ala Asn Thr Gln Gln Arg Leu
100 105 110
Thr Glu Leu Ile Glu Asn Leu Phe Val Asn Leu Asn Ser Gln Val Gln
115 120 125
Asp Tyr Ile Tyr Phe Tyr Glu Glu Lys Glu Lys Gln Thr Ser Tyr Arg
130 135 140
Tyr Asn Ile Leu Phe Val Phe Glu Lys Glu Ser Phe Ile Thr Ile Leu
145 150 155 160
Pro Met Gly Phe Asp Val Thr Val Asn Thr Asn Lys Glu Ala Val Leu
165 170 175
Lys Leu Thr Pro Lys Asp Lys Val Thr Tyr Gly His Val Ser Val Lys
180 185 190
Ala Leu Asn Ile Ile Gln Phe Ile Thr Glu Asp Lys Leu Asn Phe Leu
195 200 205
Ala Thr Leu Lys Lys Ala Leu Lys Thr Leu
210 215
<210>7
<211>696
<212>DNA
<213>苏云金芽孢杆菌(Bacillus thuringiensis)
<220>
<221>CDS
<222>(1)..(696)
<223>ET29
<400>7
atg ttc ttt aat cgc gtt att aca tta aca gta cca tct tca gat gtg 48
Met Phe Phe Asn Arg Val Ile Thr Leu Thr Val Pro Ser Ser Asp Val
1 5 10 15
gtt aat tat agt gaa att tat cag gta gct cca caa tat gtg aat caa 96
Val Asn Tyr Ser Glu Ile Tyr Gln Val Ala Pro Gln Tyr Val Asn Gln
20 25 30
gct ctt acg cta gct aaa tat ttc caa gga gca att gat ggt tca aca 144
Ala Leu Thr Leu Ala Lys Tyr Phe Gln Gly Ala Ile Asp Gly Ser Thr
35 40 45
tta cgt ttt gat ttt gaa aaa gcc tta caa att gca aat gat att cca 192
Leu Arg Phe Asp Phe Glu Lys Ala Leu Gln Ile Ala Asn Asp Ile Pro
50 55 60
cag gca gca gtg gta aac act tta aat caa act gtg cag caa ggt aca 240
Gln Ala Ala Val Val Asn Thr Leu Asn Gln Thr Val Gln Gln Gly Thr
65 70 75 80
gtc caa gta tca gtg atg ata gac aag att gta gac att atg aag aat 288
Val Gln Val Ser Val Met Ile Asp Lys Ile Val Asp Ile Met Lys Asn
85 90 95
gta tta tct att gta att gat aac aaa aag ttt tgg gat cag gta aca 336
Val Leu Ser Ile Val Ile Asp Asn Lys Lys Phe Trp Asp Gln Val Thr
100 105 110
gct gct att aca aat aca ttc aca aat cta aat tcg caa gaa agc gaa 384
Ala Ala Ile Thr Asn Thr Phe Thr Asn Leu Asn Ser Gln Glu Ser Glu
115 120 125
gca tgg att ttt tat tac aaa gaa gat gca cat aaa act agt tac tat 432
Ala Trp Ile Phe Tyr Tyr Lys Glu Asp Ala His Lys Thr Ser Tyr Tyr
130 135 140
tat aat atc tta ttt gct ata cag gat gag gaa aca ggt ggg gta atg 480
Tyr Asn Ile Leu Phe Ala Ile Gln Asp Glu Glu Thr Gly Gly Val Met
145 150 155 160
gcg aca tta ccg att gca ttt gat att agt gta gat att gaa aaa gaa 528
Ala Thr Leu Pro Ile Ala Phe Asp Ile Ser Val Asp Ile Glu Lys Glu
165 170 175
aag gtt cta ttt gtt act atc aag gat act gaa aat tat gcg gtt aca 576
Lys Val Leu Phe Val Thr Ile Lys Asp Thr Glu Asn Tyr Ala Val Thr
180 185 190
gta aaa gct att aat gta gta caa gca ctt caa tct tcc cga gat tca 624
Val Lys Ala Ile Asn Val Val Gln Ala Leu Gln Ser Ser Arg Asp Ser
195 200 205
aaa gtt gta gat gct ttt aaa tcg cca cgt cac tta cct aga aaa aga 672
Lys Val Val Asp Ala Phe Lys Ser Pro Arg His Leu Pro Arg Lys Arg
210 215 220
cat aaa att tgt agt aac tct taa 696
His Lys Ile Cys Ser Asn Ser
225 230
<210>8
<211>231
<212>PRT
<213>苏云金芽孢杆菌(Bacillus thuringiensis)
<400>8
Met Phe Phe Asn Arg Val Ile Thr Leu Thr Val Pro Ser Ser Asp Val
1 5 10 15
Val Asn Tyr Ser Glu Ile Tyr Gln Val Ala Pro Gln Tyr Val Asn Gln
20 25 30
Ala Leu Thr Leu Ala Lys Tyr Phe Gln Gly Ala Ile Asp Gly Ser Thr
35 40 45
Leu Arg Phe Asp Phe Glu Lys Ala Leu Gln Ile Ala Asn Asp Ile Pro
50 55 60
Gln Ala Ala Val Val Asn Thr Leu Asn Gln Thr Val Gln Gln Gly Thr
65 70 75 80
Val Gln Val Ser Val Met Ile Asp Lys Ile Val Asp Ile Met Lys Asn
85 90 95
Val Leu Ser Ile Val Ile Asp Asn Lys Lys Phe Trp Asp Gln Val Thr
100 105 110
Ala Ala Ile Thr Asn Thr Phe Thr Asn Leu Asn Ser Gln Glu Ser Glu
115 120 125
Ala Trp Ile Phe Tyr Tyr Lys Glu Asp Ala His Lys Thr Ser Tyr Tyr
130 135 140
Tyr Asn Ile Leu Phe Ala Ile Gln Asp Glu Glu Thr Gly Gly Val Met
145 150 155 160
Ala Thr Leu Pro Ile Ala Phe Asp Ile Ser Val Asp Ile Glu Lys Glu
165 170 175
Lys Val Leu Phe Val Thr Ile Lys Asp Thr Glu Asn Tyr Ala Val Thr
180 185 190
Val Lys Ala Ile Asn Val Val Gln Ala Leu Gln Ser Ser Arg Asp Ser
195 200 205
Lys Val Val Asp Ala Phe Lys Ser Pro Arg His Leu Pro Arg Lys Arg
210 215 220
His Lys Ile Cys Ser Asn Ser
225 230
<210>9
<211>1411
<212>DNA
<213>苏云金芽孢杆菌(Bacillus thuringiensis)
<220>
<221>其他特征
<222>(1)..(1411)
<223>TIC810 ORF 1-657;ET29 ORF 716-1411
<400>9
atgagtaaag aaattcgttt aaatttgagt agagaatcag gggcagattt atatttaaaa 60
atacttgctt ttgtaaaacc tgagcatttt tttcaagcat atttattatg tagagaattt 120
gagtctatcg tagatcctac aacaagagaa tcggattttg acaaaacact taccattgta 180
aagagtgatt caactttagt tacggttggt acaatgaata ctaaacttgt gaatagtcaa 240
gaaattctag ttagtgattt gattacgcaa gttggaagtc agatagctga taccttaggt 300
attacagaca ttgatgcaaa tacacagcaa caattaacag aattaattgg aaatttattt 360
gtgaatctga attctcaagt tcaagaatat atttattttt atgaggaaaa agaaaagcaa 420
acaagttatc gctataacat ccttttcgtt tttgaaaaag agtcttttat caccatttta 480
ccaatgggat tcgatgtgac tgtgaacact aataaagaag cggttcttaa gttaacacct 540
aaagataaag tcacttatgg tcatgtatca gtaaaagctt taaatattat tcaacttatc 600
acagaagata aatttaactt tcttgctaca ttaaaaaagg cactaaaaac tctataagcg 660
ggttaagtag gtaaaataga attaaaatga aacagtatga aaggggtaat tttatatgtt 720
ctttaatcgc gttattacat taacagtacc atcttcagat gtggttaatt atagtgaaat 780
ttatcaggta gctccacaat atgtgaatca agctcttacg ctagctaaat atttccaagg 840
agcaattgat ggttcaacat tacgttttga ttttgaaaaa gccttacaaa ttgcaaatga 900
tattccacag gcagcagtgg taaacacttt aaatcaaact gtgcagcaag gtacagtcca 960
agtatcagtg atgatagaca agattgtaga cattatgaag aatgtattat ctattgtaat 1020
tgataacaaa aagttttggg atcaggtaac agctgctatt acaaatacat tcacaaatct 1080
aaattcgcaa gaaagcgaag catggatttt ttattacaaa gaagatgcac ataaaactag 1140
ttactattat aatatcttat ttgctataca ggatgaggaa acaggtgggg taatggcgac 1200
attaccgatt gcatttgata ttagtgtaga tattgaaaaa gaaaaggttc tatttgttac 1260
tatcaaggat actgaaaatt atgcggttac agtaaaagct attaatgtag tacaagcact 1320
tcaatcttcc cgagattcaa aagttgtaga tgcttttaaatcgccacgtc acttacctag 1380
aaaaagacat aaaatttgta gtaactctta a 1411
<210>10
<211>1411
<212>DNA
<213>苏云金芽孢杆菌(Bacillus thuringiensis)
<220>
<221>其他特征
<222>(1)..(1411)
<223>TIC812 ORF 1-657;ET37 ORF 716-1411
<400>10
atgagtaaag aaattcgttt aaatttgagt agagaatcag gggcagattt atatttaaaa 60
atacttgctt ttgtaaaacc tgagcatttt tttcaagcat atttattatg tagagaattt 120
gagtctatcg tagatcctac aacaagagaa ttggattttg acaaaacgct taccattgta 180
aagagtgatt caactttagt tacggttggt acaatgaata ctaaacttgt gaatagtcaa 240
gaaattctag ttagtgattt gattaagcaa gttggaagtc agatagctga taccttaggt 300
attacagaca ttgatgcaaa tacacagcaa cgattaacgg aattaattga aaatttattt 360
gtgaatctga attctcaagt tcaagactat atttattttt atgaggaaaa agaaaagcaa 420
acaagttatc gctataacat ccttttcgtt tttgaaaaag agtcttttat caccatttta 480
ccaatgggat tcgatgtgac tgtgaacact aataaagaag cggttcttaa gttaacacct 540
aaagataaag tcacttatgg tcatgtatca gtaaaagctt taaatattat tcaatttatc 600
acagaagata aattgaactt tcttgctaca ttaaaaaagg cactaaaaac tctataagtg 660
ggttaagtag gtaaaataga attaaaatga aacagtatga aaggggtaat tttatatgtt 720
ctttaatcgc gttattacat taacagtacc atcttcagat gtggttaatt atagtgaaat 780
ttatcaggta gctccacaat atgtgaatca agctcttacg ctagctaaat atttccaagg 840
agcaattgat ggttcaacat tacgttttga ttttgaaaaa gccttacaaa ttgcaaatga 900
tattccacag gcagcagtgg taaacacttt aaatcaaact gtgcagcaag gtacagtcca 960
agtatcagtg atgatagaca agattgtaga cattatgaag aatgtattat ctattgtaat 1020
tgataacaaa aagttttggg atcaggtaac agctgctatt acaaatacat tcacaaatct 1080
aaattcgcaa gaaagcgaag catggatttt ttattacaaa gaagatgcac ataaaactag 1140
ttactattat aatatcttat ttgctataca ggatgaggaa acaggtgggg taatggcgac 1200
attaccgatt gcatttgata ttagtgtaga tattgaaaaa gaaaaggttc tatttgttac 1260
tatcaaggat actgaaaatt atgctgttac agtaaaagct attaatgtag tacaagcact 1320
tcaatcttcc cgagattcaa aagttgtaga tgcttttaaa tcgccacgtc acttacctag 1380
aaaaagacat acaatttgta gtaactctta a 1411
<210>11
<211>1531
<212>DNA
<213>人工序列
<220>
<223>ORF TIC810 1-657;ORF ET37 716-1411
<220>
<221>其他特征
<222>(1)..(1531)
<223>TIC810 ORF 1-657;ET37 ORF 716-1411
<400>11
gaattcgccc ttgcctaggt atgagtaaag aaattcgttt aaatttgagt agagaatcag 60
gggcagattt atatttaaaa atacttgctt ttgtaaaacc tgagcatttt tttcaagcat 120
atttattatg tagagaattt gagtctatcg tagatcctac aacaagagaa tcggattttg 180
acaaaacact taccattgta aagagtgatt caactttagt tacggttggt acaatgaata 240
ctaaacttgt gaatagtcaa gaaattctag ttagtgattt gattacgcaa gttggaagtc 300
agatagctga taccttaggt attacagaca ttgatgcaaa tacacagcaa caattaacag 360
aattaattgg aaatttattt gtgaatctga attctcaagt tcaagaatat atttattttt 420
atgaggaaaa agaaaagcaa acaagttatc gctataacat ccttttcgtt tttgaaaaag 480
agtcttttat caccatttta ccaatgggat tcgatgtgac tgtgaacact aataaagaag 540
cggttcttaa gttaacacct aaagataaag tcacttatgg tcatgtatca gtaaaagctt 600
taaatattat tcaacttatc acagaagata aatttaactt tcttgctaca ttaaaaaagg 660
cactaaaaac tctataagcg ggttaagtag gtaaaataga attaaaatga aacagtatga 720
aaggggtaat tttatatgtt ctttaatcgc gttattacat taacagtacc atcttcagat 780
gtggttaatt atagtgaaat ttatcaggta gctccacaat atgtgaatca agctcttacg 840
ctagctaaat atttccaagg agcaattgat ggttcaacat tacgttttga ttttgaaaaa 900
gccttacaaa ttgcaaatga tattccacag gcagcagtgg taaacacttt aaatcaaact 960
gtgcagcaag gtacagtcca agtatcagtg atgatagaca agattgtaga cattatgaag 1020
aatgtattat ctattgtaat tgataacaaa aagttttggg atcaggtaac agctgctatt 1080
acaaatacat tcacaaatct aaattcgcaa gaaagcgaag catggatttt ttattacaaa 1140
gaagatgcac ataaaactag ttactattat aatatcttat ttgctataca ggatgaggaa 1200
acaggtgggg taatggcgac attaccgatt gcatttgata ttagtgtaga tattgaaaaa 1260
gaaaaggttc tatttgttac tatcaaggat actgaaaatt atgctgttac agtaaaagct 1320
attaatgtag tacaagcact tcaatcttcc cgagattcaa aagttgtaga tgcttttaaa 1380
tcgccacgtc acttacctag aaaaagacat acaatttgta gtaactctta agaagaccga 1440
caataagata aaatcttatt gcctatcttc ttagaataac aaatggctgt tatggggaag 1500
cactaaatgg actcgagtta agggcgaatt c 1531
<210>12
<211>1531
<212>DNA
<213>人工序列
<220>
<223>ORF TIC812 1-657;ORF ET29 716-1411
<220>
<221>其他特征
<222>(1)..(1531)
<223>TIC812 ORF 1-657;ET29 ORF 716-1411
<400>12
gaattcgccc ttgcctaggt atgagtaaag aaattcgttt aaatttgagt agagaatcag 60
gggcagattt atatttaaaa atacttgctt ttgtaaaacc tgagcatttt tttcaagcat 120
atttattatg tagagaattt gagtctatcg tagatcctac aacaagagaa ttggattttg 180
acaaaacgct taccattgta aagagtgatt caactttagt tacggttggt acaatgaata 240
ctaaacttgt gaatagtcaa gaaattctag ttagtgattt gattaagcaa gttggaagtc 300
agatagctga taccttaggt attacagaca ttgatgcaaa tacacagcaa cgattaacgg 360
aattaattga aaatttattt gtgaatctga attctcaagt tcaagactat atttattttt 420
atgaggaaaa agaaaagcaa acaagttatc gctataacat ccttttcgtt tttgaaaaag 480
agtcttttat caccatttta ccaatgggat tcgatgtgac tgtgaacact aataaagaag 540
cggttcttaa gttaacacct aaagataaag tcacttatgg tcatgtatca gtaaaagctt 600
taaatattat tcaatttatc acagaagata aattgaactt tcttgctaca ttaaaaaagg 660
cactaaaaac tctataagtg ggttaagtag gtaaaataga attaaaatga aacagtatga 720
aaggggtaat tttatatgtt ctttaatcgc gttattacat taacagtacc atcttcagat 780
gtggttaatt atagtgaaat ttatcaggta gctccacaat atgtgaatca agctcttacg 840
ctagctaaat atttccaagg agcaattgat ggttcaacat tacgttttga ttttgaaaaa 900
gccttacaaa ttgcaaatga tattccacag gcagcagtgg taaacacttt aaatcaaact 960
gtgcagcaag gtacagtcca agtatcagtg atgatagaca agattgtaga cattatgaag 1020
aatgtattat ctattgtaat tgataacaaa aagttttggg atcaggtaac agctgctatt 1080
acaaatacat tcacaaatct aaattcgcaa gaaagcgaag catggatttt ttattacaaa 1140
gaagatgcac ataaaactag ttactattat aatatcttat ttgctataca ggatgaggaa 1200
acaggtgggg taatggcgac attaccgatt gcatttgata ttagtgtaga tattgaaaaa 1260
gaaaaggttc tatttgttac tatcaaggat actgaaaatt atgcggttac agtaaaagct 1320
attaatgtag tacaagcact tcaatcttcc cgagattcaa aagttgtaga tgcttttaaa 1380
tcgccacgtc acttacctag aaaaagacat aaaatttgta gtaactctta agaagaccga 1440
caataagata aaatcttatt gtctatcttc ttagaataac aaatggctgt tatggggaag 1500
cactaaatgg actcgagtta agggcgaatt c 1531
<210>13
<211>702
<212>DNA
<213>人工序列
<220>
<223>用于植物体中表达TIC809(ET29 MET-ALA)的合成序列
<220>
<221>CDS
<222>(1)..(702)
<223>TIC809
<400>13
atg gcc ttc ttc aac cgg gtg atc acc ctc acg gtg ccg tcg tca gac 48
Met Ala Phe Phe Asn Arg Val Ile Thr Leu Thr Val Pro Ser Ser Asp
1 5 10 15
gtg gtc aac tac tcg gag atc tac cag gtg gct cct cag tat gtc aac 96
Val Val Asn Tyr Ser Glu Ile Tyr Gln Val Ala Pro Gln Tyr Val Asn
20 25 30
cag gcc ctg acc ctg gcc aag tac ttc cag ggc gcc atc gac ggc agc 144
Gln Ala Leu Thr Leu Ala Lys Tyr Phe Gln Gly Ala Ile Asp Gly Ser
35 40 45
acc ctg agg ttc gac ttc gag aag gcg tta cag atc gcc aac gac atc 192
Thr Leu Arg Phe Asp Phe Glu Lys Ala Leu Gln Ile Ala Asn Asp Ile
50 55 60
ccg cag gcc gcg gtg gtc aac acc ctg aac cag acc gtc cag cag ggg 240
Pro Gln Ala Ala Val Val Asn Thr Leu Asn Gln Thr Val Gln Gln Gly
65 70 75 80
acc gtc cag gtc agc gtc atg atc gac aag atc gtg gac atc atg aag 288
Thr Val Gln Val Ser Val Met Ile Asp Lys Ile Val Asp Ile Met Lys
85 90 95
aat gtc ctg tcc atc gtg ata gac aac aag aag ttt tgg gat cag gtc 336
Asn Val Leu Ser Ile Val Ile Asp Asn Lys Lys Phe Trp Asp Gln Val
100 105 110
acg gct gcc atc acc aac acc ttc acg aac ctg aac agc cag gag tcg 384
Thr Ala Ala Ile Thr Asn Thr Phe Thr Asn Leu Asn Ser Gln Glu Ser
115 120 125
gag gcc tgg atc ttc tat tac aag gag gac gcc cac aag acg tcc tac 432
Glu Ala Trp Ile Phe Tyr Tyr Lys Glu Asp Ala His Lys Thr Ser Tyr
130 135 140
tat tac aac atc ctc ttc gcc atc cag gac gaa gag acg ggt ggc gtg 480
Tyr Tyr Asn Ile Leu Phe Ala Ile Gln Asp Glu Glu Thr Gly Gly Val
145 150 155 160
atg gcc acg ctg ccc atc gcc ttc gac atc agt gtg gac atc gag aag 528
Met Ala Thr Leu Pro Ile Ala Phe Asp Ile Ser Val Asp Ile Glu Lys
165 170 175
gag aag gtc ctg ttc gtg acc atc aag gac act gag aat tac gcc gtc 576
Glu Lys Val Leu Phe Val Thr Ile Lys Asp Thr Glu Asn Tyr Ala Val
180 185 190
acc gtc aag gcg atc aac gtg gtc cag gca ctc cag tct agc agg gat 624
Thr Val Lys Ala Ile Asn Val Val Gln Ala Leu Gln Ser Ser Arg Asp
195 200 205
tct aag gtg gtt gat gcg ttc aaa tcg cca cgg cac tta ccc cgg aag 672
Ser Lys Val Val Asp Ala Phe Lys Ser Pro Arg His Leu Pro Arg Lys
210 215 220
agg cat aag att tgc tct aac tcg tga tga 702
Arg His Lys Ile Cys Ser Asn Ser
225 230
<210>14
<211>232
<212>PRT
<213>人工序列
<220>
<223>合成构建体
<400>14
Met Ala Phe Phe Asn Arg Val Ile Thr Leu Thr Val Pro Ser Ser Asp
1 5 10 15
Val Val Asn Tyr Ser Glu Ile Tyr Gln Val Ala Pro Gln Tyr Val Asn
20 25 30
Gln Ala Leu Thr Leu Ala Lys Tyr Phe Gln Gly Ala Ile Asp Gly Ser
35 40 45
Thr Leu Arg Phe Asp Phe Glu Lys Ala Leu Gln Ile Ala Asn Asp Ile
50 55 60
Pro Gln Ala Ala Val Val Asn Thr Leu Asn Gln Thr Val Gln Gln Gly
65 70 75 80
Thr Val Gln Val Ser Val Met Ile Asp Lys Ile Val Asp Ile Met Lys
85 90 95
Asn Val Leu Ser Ile Val Ile Asp Asn Lys Lys Phe Trp Asp Gln Val
100 105 110
Thr Ala Ala Ile Thr Asn Thr Phe Thr Asn Leu Asn Ser Gln Glu Ser
115 120 125
Glu Ala Trp Ile Phe Tyr Tyr Lys Glu Asp Ala His Lys Thr Ser Tyr
130 135 140
Tyr Tyr Asn Ile Leu Phe Ala Ile Gln Asp Glu Glu Thr Gly Gly Val
145 150 155 160
Met Ala Thr Leu Pro Ile Ala Phe Asp Ile Ser Val Asp Ile Glu Lys
165 170 175
Glu Lys Val Leu Phe Val Thr Ile Lys Asp Thr Glu Asn Tyr Ala Val
180 185 190
Thr Val Lys Ala Ile Asn Val Val Gln Ala Leu Gln Ser Set Arg Asp
195 200 205
Ser Lys Val Val Asp Ala Phe Lys Ser Pro Arg His Leu Pro Arg Lys
210 215 220
Arg His Lys Ile Cys Ser Asn Set
225 230
<210>15
<211>666
<212>DNA
<213>人工序列
<220>
<223>用于植物体中表达TIC810的合成序列
<220>
<221>CDS
<222>(1)..(666)
<223>TIC810
<400>15
atg agc aaa gaa atc agg ctc aac ctt tct cgt gag agc ggc gcc gac 48
Met Ser Lys Glu Ile Arg Leu Asn Leu Ser Arg Glu Ser Gly Ala Asp
1 5 10 15
ctg tac ctc aag atc ctc gcc ttc gtg aag ccc gag cac ttc ttt cag 96
Leu Tyr Leu Lys Ile Leu Ala Phe Val Lys Pro Glu His Phe Phe Gln
20 25 30
gcg tac ctc ctg tgc cgc gag ttc gag agc atc gtg gat cct aca acc 144
Ala Tyr Leu Leu Cys Arg Glu Phe Glu Ser Ile Val Asp Pro Thr Thr
35 40 45
cgc gag tct gac ttc gac aag acg ctg acc atc gtg aag tcg gac tcc 192
Arg Glu Ser Asp Phe Asp Lys Thr Leu Thr Ile Val Lys Ser Asp Ser
50 55 60
acc ctc gtg acc gtg ggc acg atg aac acc aag ctg gtc aat agc caa 240
Thr Leu Val Thr Val Gly Thr Met Asn Thr Lys Leu Val Asn Ser Gln
65 70 75 80
gag atc ctc gtg tcg gac ttg atc act caa gtc ggt tcc cag atc gcc 288
Glu Ile Leu Val Ser Asp Leu Ile Thr Gln Val Gly Ser Gln Ile Ala
85 90 95
gat acc ctc ggc atc acg gac atc gac gcc aac acc cag caa cag ctc 336
Asp Thr Leu Gly Ile Thr Asp Ile Asp Ala Asn Thr Gln Gln Gln Leu
100 105 110
acg gag ctg atc ggc aac ctc ttc gtg aac ctc aat tcc caa gtt cag 384
Thr Glu Leu Ile Gly Asn Leu Phe Val Asn Leu Asn Ser Gln Val Gln
115 120 125
gag tac atc tac ttc tac gag gag aag gag aag cag acc tcc tac cgc 432
Glu Tyr Ile Tyr Phe Tyr Glu Glu Lys Glu Lys Gln Thr Ser Tyr Arg
130 135 140
tac aac atc ctc ttc gtg ttc gaa aag gag tcg ttc atc acc att ctg 480
Tyr Asn Ile Leu Phe Val Phe Glu Lys Glu Ser Phe Ile Thr Ile Leu
145 150 155 160
cca atg ggc ttc gac gtg acc gtg aac acg aac aag gag gcc gtc ctg 528
Pro Met Gly Phe Asp Val Thr Val Asn Thr Asn Lys Glu Ala Val Leu
165 170 175
aag ctg acc ccg aag gac aag gtt acc tac ggc cac gtc agc gtc aag 576
Lys Leu Thr Pro Lys Asp Lys Val Thr Tyr Gly His Val Ser Val Lys
180 185 190
gcc ctc aac atc atc cag ctc att acg gag gac aag ttc aac ttc ctc 624
Ala Leu Asn Ile Ile Gln Leu Ile Thr Glu Asp Lys Phe Asn Phe Leu
195 200 205
gca acc ctc aag aag gct ctc aag acc ctg tga tga gaa ttc 666
Ala Thr Leu Lys Lys Ala Leu Lys Thr Leu Glu Phe
210 215 220
<210>16
<211>218
<212>PRT
<213>人工序列
<220>
<223>合成构建体
<400>16
Met Ser Lys Glu Ile Arg Leu Asn Leu Ser Arg Glu Ser Gly Ala Asp
1 5 10 15
Leu Tyr Leu Lys Ile Leu Ala Phe Val Lys Pro Glu His Phe Phe Gln
20 25 30
Ala Tyr Leu Leu Cys Arg Glu Phe Glu Ser Ile Val Asp Pro Thr Thr
35 40 45
Arg Glu Ser Asp Phe Asp Lys Thr Leu Thr Ile Val Lys Ser Asp Ser
50 55 60
Thr Leu Val Thr Val Gly Thr Met Asn Thr Lys Leu Val Asn Ser Gln
65 70 75 80
Glu Ile Leu Val Ser Asp Leu Ile Thr Gln Val Gly Ser Gln Ile Ala
85 90 95
Asp Thr Leu Gly Ile Thr Asp Ile Asp Ala Asn Thr Gln Gln Gln Leu
100 105 110
Thr Glu Leu Ile Gly Asn Leu Phe Val Asn Leu Asn Ser Gln Val Gln
115 120 125
Glu Tyr Ile Tyr Phe Tyr Glu Glu Lys Glu Lys Gln Thr Ser Tyr Arg
130 135 140
Tyr Asn Ile Leu Phe Val Phe Glu Lys Glu Ser Phe Ile Thr Ile Leu
145 150 155 160
Pro Met Gly Phe Asp Val Thr Val Asn Thr Asn Lys Glu Ala Val Leu
165 170 175
Lys Leu Thr Pro Lys Asp Lys Val Thr Tyr Gly His Val Ser Val Lys
180 185 190
Ala Leu Asn Ile Ile Gln Leu Ile Thr Glu Asp Lys Phe Asn Phe Leu
195 200 205
Ala Thr Leu Lys Lys Ala Leu Lys Thr Leu
210 215
<210>17
<211>699
<212>DNA
<213>人工序列
<220>
<223>用于植物体中表达ET37的合成序列
<220>
<221>CDS
<222>(1)..(699)
<223>ET37
<400>17
atg ttc ttc aac cgg gtg atc acc ctc acg gtg ccg tcg tca gac gtg 48
Met Phe Phe Asn Arg Val Ile Thr Leu Thr Val Pro Ser Ser Asp Val
1 5 10 15
gtc aac tac tcg gag atc tac cag gtg gct cct cag tat gtc aac cag 96
Val Asn Tyr Ser Glu Ile Tyr Gln Val Ala Pro Gln Tyr Val Asn Gln
20 25 30
gcc ctg acc ctg gcc aag tac ttc cag ggc gcc atc gac ggc agc acc 144
Ala Leu Thr Leu Ala Lys Tyr Phe Gln Gly Ala Ile Asp Gly Ser Thr
35 40 45
ctg agg ttc gac ttc gag aag gcg tta cag atc gcc aac gac atc ccg 192
Leu Arg Phe Asp Phe Glu Lys Ala Leu Gln Ile Ala Asn Asp Ile Pro
50 55 60
cag gcc gcg gtg gtc aac acc ctg aac cag acc gtc cag cag ggg acc 240
Gln Ala Ala Val Val Asn Thr Leu Asn Gln Thr Val Gln Gln Gly Thr
65 70 75 80
gtc cag gtc agc gtc atg atc gac aag atc gtg gac atc atg aag aat 288
Val Gln Val Ser Val Met Ile Asp Lys Ile Val Asp Ile Met Lys Asn
85 90 95
gtc ctg tcc atc gtg ata gac aac aag aag ttt tgg gat cag gtc acg 336
Val Leu Ser Ile Val Ile Asp Asn Lys Lys Phe Trp Asp Gln Val Thr
100 105 110
gct gcc atc acc aac acc ttc acg aac ctg aac agc cag gag tcg gag 384
Ala Ala Ile Thr Asn Thr Phe Thr Asn Leu Asn Ser Gln Glu Ser Glu
115 120 125
gcc tgg atc ttc tat tac aag gag gac gcc cac aag acg tcc tac tat 432
Ala Trp Ile Phe Tyr Tyr Lys Glu Asp Ala His Lys Thr Ser Tyr Tyr
130 135 140
tac aac atc ctc ttc gcc atc cag gac gaa gag acg ggt ggc gtg atg 480
Tyr Asn Ile Leu Phe Ala Ile Gln Asp Glu Glu Thr Gly Gly Val Met
145 150 155 160
gcc acg ctg ccc atc gcc ttc gac atc agt gtg gac atc gag aag gag 528
Ala Thr Leu Pro Ile Ala Phe Asp Ile Ser Val Asp Ile Glu Lys Glu
165 170 175
aag gtc ctg ttc gtg acc atc aag gac act gag aat tac gcc gtc acc 576
Lys Val Leu Phe Val Thr Ile Lys Asp Thr Glu Asn Tyr Ala Val Thr
180 185 190
gtc aag gcg atc aac gtg gtc cag gca ctc cag tct agc agg gat tct 624
Val Lys Ala Ile Asn Val Val Gln Ala Leu Gln Ser Ser Arg Asp Ser
195 200 205
aag gtg gtt gat gcg ttc aaa tcg cca cgg cac tta ccc cgg aag agg 672
Lys Val Val Asp Ala Phe Lys Ser Pro Arg His Leu Pro Arg Lys Arg
210 215 220
cat acc att tgc tct aac tcg tga tga 699
His Thr Ile Cys Ser Asn Ser
225 230
<210>18
<211>231
<212>PRT
<213>人工序列
<220>
<223>合成构建体
<400>18
Met Phe Phe Asn Arg Val Ile Thr Leu Thr Val Pro Ser Ser Asp Val
1 5 10 15
Val Asn Tyr Ser Glu Ile Tyr Gln Val Ala Pro Gln Tyr Val Asn Gln
20 25 30
Ala Leu Thr Leu Ala Lys Tyr Phe Gln Gly Ala Ile Asp Gly Ser Thr
35 40 45
Leu Arg Phe Asp Phe Glu Lys Ala Leu Gln Ile Ala Asn Asp Ile Pro
50 55 60
Gln Ala Ala Val Val Asn Thr Leu Asn Gln Thr Val Gln Gln Gly Thr
65 70 75 80
Val Gln Val Ser Val Met Ile Asp Lys Ile Val Asp Ile Met Lys Asn
85 90 95
Val Leu Ser Ile Val Ile Asp Asn Lys Lys Phe Trp Asp Gln Val Thr
100 105 110
Ala Ala Ile Thr Asn Thr Phe Thr Asn Leu Asn Ser Gln Glu Ser Glu
115 120 125
Ala Trp Ile Phe Tyr Tyr Lys Glu Asp Ala His Lys Thr Ser Tyr Tyr
130 135 140
Tyr Asn Ile Leu Phe Ala Ile Gln Asp Glu Glu Thr Gly Gly Val Met
145 150 155 160
Ala Thr Leu Pro Ile Ala Phe Asp Ile Ser Val Asp Ile Glu Lys Glu
165 170 175
Lys Val Leu Phe Val Thr Ile Lys Asp Thr Glu Asn Tyr Ala Val Thr
180 185 190
Val Lys Ala Ile Asn Val Val Gln Ala Leu Gln Ser Ser Arg Asp Ser
195 200 205
Lys Val Val Asp Ala Phe Lys Ser Pro Arg His Leu Pro Arg Lys Arg
210 215 220
His Thr Ile Cys Ser Asn Ser
225 230
<210>19
<211>657
<212>DNA
<213>人工序列
<220>
<223>用于植物体中表达TIC812的合成序列
<220>
<221>CDS
<222>(1)..(657)
<223>TIC812
<400>19
atg agc aaa gaa atc agg ctc aac ctt tct cgt gag agc ggc gcc gac 48
Met Ser Lys Glu Ile Arg Leu Asn Leu Ser Arg Glu Ser Gly Ala Asp
1 5 10 15
ctg tac ctc aag atc ctc gcc ttc gtg aag ccc gag cac ttc ttt cag 96
Leu Tyr Leu Lys Ile Leu Ala Phe Val Lys Pro Glu His Phe Phe Gln
20 25 30
gcg tac ctc ctg tgc cgc gag ttc gag agc atc gtg gat cct aca acc 144
Ala Tyr Leu Leu Cys Arg Glu Phe Glu Ser Ile Val Asp Pro Thr Thr
35 40 45
cgc gag ctg gac ttc gac aag acg ctg acc atc gtg aag tcg gac tcc 192
Arg Glu Leu Asp Phe Asp Lys Thr Leu Thr Ile Val Lys Ser Asp Ser
50 55 60
acc ctc gtg acc gtg ggc acg atg aac acc aag ctg gtc aat agc caa 240
Thr Leu Val Thr Val Gly Thr Met Asn Thr Lys Leu Val Asn Ser Gln
65 70 75 80
gag atc ctc gtg tcg gac ttg atc aag caa gtc ggt tcc cag atc gcc 288
Glu Ile Leu Val Ser Asp Leu Ile Lys Gln Val Gly Ser Gln Ile Ala
85 90 95
gat acc ctc ggc atc acg gac atc gac gcc aac acc cag caa agg ctc 336
Asp Thr Leu Gly Ile Thr Asp Ile Asp Ala Asn Thr Gln Gln Arg Leu
100 105 110
acg gag ctg atc gag aac ctc ttc gtg aac ctc aat tcc caa gtt cag 384
Thr Glu Leu Ile Glu Asn Leu Phe Val Asn Leu Asn Ser Gln Val Gln
115 120 125
gac tac atc tac ttc tac gag gag aag gag aag cag acc tcc tac cgc 432
Asp Tyr Ile Tyr Phe Tyr Glu Glu Lys Glu Lys Gln Thr Ser Tyr Arg
130 135 140
tac aac atc ctc ttc gtg ttc gaa aag gag tcg ttc atc acc att ctg 480
Tyr Asn Ile Leu Phe Val Phe Glu Lys Glu Ser Phe Ile Thr Ile Leu
145 150 155 160
cca atg ggc ttc gac gtg acc gtg aac acg aac aag gag gcc gtc ctg 528
Pro Met Gly Phe Asp Val Thr Val Asn Thr Asn Lys Glu Ala Val Leu
165 170 175
aag ctg acc ccg aag gac aag gtt acc tac ggc cac gtc agc gtc aag 576
Lys Leu Thr Pro Lys Asp Lys Val Thr Tyr Gly His Val Ser Val Lys
180 185 190
gcc ctc aac atc atc cag ttc att acg gag gac aag ctc aac ttc ctc 624
Ala Leu Asn Ile Ile Gln Phe Ile Thr Glu Asp Lys Leu Asn Phe Leu
195 200 205
gca acc ctc aag aag gct ctc aag acc ctg tga 657
Ala Thr Leu Lys Lys Ala Leu Lys Thr Leu
210 215
<210>20
<211>218
<212>PRT
<213>人工序列
<220>
<223>合成构建体
<400>20
Met Ser Lys Glu Ile Arg Leu Asn Leu Ser Arg Glu Ser Gly Ala Asp
1 5 10 15
Leu Tyr Leu Lys Ile Leu Ala Phe Val Lys Pro Glu His Phe Phe Gln
20 25 30
Ala Tyr Leu Leu Cys Arg Glu Phe Glu Ser Ile Val Asp Pro Thr Thr
35 40 45
Arg Glu Leu Asp Phe Asp Lys Thr Leu Thr Ile Val Lys Ser Asp Ser
50 55 60
Thr Leu Val Thr Val Gly Thr Met Asn Thr Lys Leu Val Asn Ser Gln
65 70 75 80
Glu Ile Leu Val Ser Asp Leu Ile Lys Gln Val Gly Ser Gln Ile Ala
85 90 95
Asp Thr Leu Gly Ile Thr Asp Ile Asp Ala Asn Thr Gln Gln Arg Leu
100 105 110
Thr Glu Leu Ile Glu Asn Leu Phe Val Asn Leu Asn Ser Gln Val Gln
115 120 125
Asp Tyr Ile Tyr Phe Tyr Glu Glu Lys Glu Lys Gln Thr Ser Tyr Arg
130 135 140
Tyr Asn Ile Leu Phe Val Phe Glu Lys Glu Ser Phe Ile Thr Ile Leu
145 150 155 160
Pro Met Gly Phe Asp Val Thr Val Asn Thr Asn Lys Glu Ala Val Leu
165 170 175
Lys Leu Thr Pro Lys Asp Lys Val Thr Tyr Gly His Val Ser Val Lys
180 185 190
Ala Leu Asn Ile Ile Gln Phe Ile Thr Glu Asp Lys Leu Asn Phe Leu
195 200 205
Ala Thr Leu Lys Lys Ala Leu Lys Thr Leu
210 215
<210>21
<211>22
<212>DNA
<213>人工序列
<220>
<223>合成序列;热扩增引物
<220>
<221>其他特征
<222>(1)..(22)
<223>热扩增引物;pr370
<400>21
cctacttaac ccgcttatag ag 22
<210>22
<211>21
<212>DNA
<213>人工序列
<220>
<223>合成序列;热扩增引物
<220>
<221>其他特征
<222>(1)..(21)
<223>热扩增引物;pr371
<400>22
cagtaccatc ttcagatgtg g 21
<210>23
<211>49
<212>DNA
<213>人工序列
<220>
<223>合成序列;热扩增引物
<220>
<221>其他特征
<222>(1)..(49)
<223>热扩增引物;pr375
<400>23
gactagtaat gagtaaagaa attcgtttaa atttgagtag agaatcagg 49
<210>24
<211>30
<212>DNA
<213>人工序列
<220>
<223>合成序列;热扩增引物
<220>
<221>其他特征
<222>(1)..(30)
<223>热扩增引物;pr376
<400>24
aactcgagcc tacttaaccc gcttatagag 30
<210>25
<211>35
<212>DNA
<213>人工序列
<220>
<223>合成序列;热扩增引物
<220>
<221>其他特征
<222>(1)..(35)
<223>热扩增引物;pr365
<400>25
aactcgagtc catttagtgc ttccccataa cagcc 35
<210>26
<211>43
<212>DNA
<213>人工序列
<220>
<223>合成序列;热扩增引物
<220>
<221>其他特征
<222>(1)..(43)
<223>热扩增引物;pr372
<400>26
aacctaggat gttctttaat cgcgttatta cattaacagt acc 43
<210>27
<211>49
<212>DNA
<213>人工序列
<220>
<223>合成序列;热扩增引物
<220>
<221>其他特征
<222>(1)..(49)
<223>热扩增引物;pr421
<400>27
gcctaggtat gagtaaagaa attcgtttaa atttgagtag agaatcagg 49
<210>28
<211>4257
<212>DNA
<213>人工序列
<220>
<223>合成序列;编码TIC89和TIC810的pMON64138表达盒
<220>
<221>其他特征
<223>pMON64138第一和第二植物表达盒
<220>
<221>启动子
<222>(1)..(614)
<223>e35S
<220>
<221>5’UTR
<222>(650)..(710)
<223>小麦CAB前导序列
<220>
<221>内含子
<222>(727)..(1206)
<223>水稻肌动蛋白
<220>
<221>CDS
<222>(1216)..(1917)
<223>TIC809
<220>
<221>终止子
<222>(1921)..(2130)
<223>小麦Hsp17
<220>
<221>启动子
<222>(2168)..(2781)
<223>CaMV 35S enh
<220>
<221>5’UTR
<222>(2817)..(2877)
<223>小麦CAB前导序列
<220>
<221>内含子
<222>(2894)..(3373)
<223>水稻肌动蛋白
<220>
<221>CDS
<222>(3383)..(4042)
<223>TIC810
<220>
<221>终止子
<222>(4048)..(4257)
<223>小麦Hsp17
<400>28
ggtccgatgt gagacttttc aacaaagggt aatatccgga aacctcctcg gattccattg 60
cccagctatc tgtcacttta ttgtgaagat agtggaaaag gaaggtggct cctacaaatg 120
ccatcattgc gataaaggaa aggccatcgt tgaagatgcc tctgccgaca gtggtcccaa 180
agatggaccc ccacccacga ggagcatcgt ggaaaaagaa gacgttccaa ccacgtcttc 240
aaagcaagtg gattgatgtg atggtccgat gtgagacttt tcaacaaagg gtaatatccg 300
gaaacctcct cggattccat tgcccagcta tctgtcactt tattgtgaag atagtggaaa 360
aggaaggtgg ctcctacaaa tgccatcatt gcgataaagg aaaggccatc gttgaagatg 420
cctctgccga cagtggtccc aaagatggac ccccacccac gaggagcatc gtggaaaaag 480
aagacgttcc aaccacgtct tcaaagcaag tggattgatg tgatatctcc actgacgtaa 540
gggatgacgc acaatcccac tatccttcgc aagacccttc ctctatataa ggaagttcat 600
ttcatttgga gaggacacgc tgacaagctg actctagcag atcctctaga accatcttcc 660
acacactcaa gccacactat tggagaacac acagggacaa cacaccataa gatccaaggg 720
aggcctccgc cgccgccggt aaccaccccg cccctctcct ctttctttct ccgttttttt 780
ttccgtctcg gtctcgatct ttggccttgg tagtttgggt gggcgagagg cggcttcgtg 840
cgcgcccaga tcggtgcgcg ggaggggcgg gatctcgcgg ctggggctct cgccggcgtg 900
gatccggccc ggatctcgcg gggaatgggg ctctcggatg tagatctgcg atccgccgtt 960
gttgggggag atgatggggg gtttaaaatt tccgccgtgc taaacaagat caggaagagg 1020
ggaaaagggc actatggttt atatttttat atatttctgc tgcttcgtca ggcttagatg 1080
tgctagatct ttctttcttc tttttgtggg tagaatttga atccctcagc attgttcatc 1140
ggtagttttt cttttcatga tttgtgacaa atgcagcctc gtgcggagct tttttgtagg 1200
tagaagtgat caacc atg gcc ttc ttc aac cgg gtg atc acc ctc acg gtg 1251
Met Ala Phe Phe Asn Arg Val Ile Thr Leu Thr Val
1 5 10
ccg tcg tca gac gtg gtc aac tac tcg gag atc tac cag gtg gct cct 1299
Pro Ser Ser Asp Val Val Asn Tyr Ser Glu Ile Tyr Gln Val Ala Pro
15 20 25
cag tat gtc aac cag gcc ctg acc ctg gcc aag tac ttc cag ggc gcc 1347
Gln Tyr Val Asn Gln Ala Leu Thr Leu Ala Lys Tyr Phe Gln Gly Ala
30 35 40
atc gac ggc agc acc ctg agg ttc gac ttc gag aag gcg tta cag atc 1395
Ile Asp Gly Ser Thr Leu Arg Phe Asp Phe Glu Lys Ala Leu Gln Ile
45 50 55 60
gcc aac gac atc ccg cag gcc gcg gtg gtc aac acc ctg aac cag acc 1443
Ala Asn Asp Ile Pro Gln Ala Ala Val Val Asn Thr Leu Asn Gln Thr
65 70 75
gtc cag cag ggg acc gtc cag gtc agc gtc atg atc gac aag atc gtg 1491
Val Gln Gln Gly Thr Val Gln Val Ser Val Met Ile Asp Lys Ile Val
80 85 90
gac atc atg aag aat gtc ctg tcc atc gtg ata gac aac aag aag ttt 1539
Asp Ile Met Lys Asn Val Leu Ser Ile Val Ile Asp Asn Lys Lys Phe
95 100 105
tgg gat cag gtc acg gct gcc atc acc aac acc ttc acg aac ctg aac 1587
Trp Asp Gln Val Thr Ala Ala Ile Thr Asn Thr Phe Thr Asn Leu Asn
110 115 120
agc cag gag tcg gag gcc tgg atc ttc tat tac aag gag gac gcc cac 1635
Ser Gln Glu Ser Glu Ala Trp Ile Phe Tyr Tyr Lys Glu Asp Ala His
125 130 135 140
aag acg tcc tac tat tac aac atc ctc ttc gcc atc cag gac gaa gag 1683
Lys Thr Ser Tyr Tyr Tyr Asn Ile Leu Phe Ala Ile Gln Asp Glu Glu
145 150 155
acg ggt ggc gtg atg gcc acg ctg ccc atc gcc ttc gac atc agt gtg 173l
Thr Gly Gly Val Met Ala Thr Leu Pro Ile Ala Phe Asp Ile Ser Val
160 165 170
gac atc gag aag gag aag gtc ctg ttc gtg acc atc aag gac act gag 1779
Asp Ile Glu Lys Glu Lys Val Leu Phe Val Thr Ile Lys Asp Thr Glu
175 180 185
aat tac gcc gtc acc gtc aag gcg atc aac gtg gtc cag gca ctc cag 1827
Asn Tyr Ala Val Thr Val Lys Ala Ile Asn Val Val Gln Ala Leu Gln
190 195 200
tct agc agg gat tct aag gtg gtt gat gcg ttc aaa tcg cca cgg cac 1875
Ser Ser Arg Asp Ser Lys Val Val Asp Ala Phe Lys Ser Pro Arg His
205 210 215 220
tta ccc cgg aag agg cat aag att tgc tct aac tcg tga tga 1917
Leu Pro Arg Lys Arg His Lys Ile Cys Ser Asn Ser
225 230
attctgcatg cgtttggacg tatgctcatt caggttggag ccaatttggt tgatgtgtgt 1977
gcgagttctt gcgagtctga tgagacatct ctgtattgtg tttctttccc cagtgttttc 2037
tgtacttgtg taatcggcta atcgccaaca gattcggcga tgaataaatg agaaataaat 2097
tgttctgatt ttgagtgcaa aaaaaaagga attagatctg tgtgtgtttt ttggatcccc 2157
agcttctgca ggtccgatgt gagacttttc aacaaagggt aatatccgga aacctcctcg 2217
gattccattg cccagctatc tgtcacttta ttgtgaagat agtggaaaag gaaggtggct 2277
cctacaaatg ccatcattgc gataaaggaa aggccatcgt tgaagatgcc tctgccgaca 2337
gtggtcccaa agatggaccc ccacccacga ggagcatcgt ggaaaaagaa gacgttccaa 2397
ccacgtcttc aaagcaagtg gattgatgtg atggtccgat gtgagacttt tcaacaaagg 2457
gtaatatccg gaaacctcct cggattccat tgcccagcta tctgtcactt tattgtgaag 2517
atagtggaaa aggaaggtgg ctcctacaaa tgccatcatt gcgataaagg aaaggccatc 2577
gttgaagatg cctctgccga cagtggtccc aaagatggac ccccacccac gaggagcatc 2637
gtggaaaaag aagacgttcc aaccacgtct tcaaagcaag tggattgatg tgatatctcc 2697
actgacgtaa gggatgacgc acaatcccac tatccttcgc aagacccttc ctctatataa 2757
ggaagttcat ttcatttgga gaggacacgc tgacaagctg actctagcag atcctctaga 2817
accatcttcc acacactcaa gccacactat tggagaacac acagggacaa cacaccataa 2877
gatccaaggg aggcctccgc cgccgccggt aaccaccccg cccctctcct ctttctttct 2937
ccgttttttt ttccgtctcg gtctcgatct ttggccttgg tagtttgggt gggcgagagg 2997
cggcttcgtg cgcgcccaga tcggtgcgcg ggaggggcgg gatctcgcgg ctggggctct 3057
cgccggcgtg gatccggccc ggatctcgcg gggaatgggg ctctcggatg tagatctgcg 3117
atccgccgtt gttgggggag atgatggggg gtttaaaatt tccgccgtgc taaacaagat 3177
caggaagagg ggaaaagggc actatggttt atatttttat atatttctgc tgcttcgtca 3237
ggcttagatg tgctagatct ttctttcttc tttttgtggg tagaatttga atccctcagc 3297
attgttcatc ggtagttttt cttttcatga tttgtgacaa atgcagcctc gtgcggagct 3357
tttttgtagg tagaagtgat caacc atg agc aaa gaa atc agg ctc aac ctt 3409
Met Ser Lys Glu Ile Arg Leu Asn Leu
235 240
tct cgt gag agc ggc gcc gac ctg tac ctc aag atc ctc gcc ttc gtg 3457
Ser Arg Glu Ser Gly Ala Asp Leu Tyr Leu Lys Ile Leu Ala Phe Val
245 250 255
aag ccc gag cac ttc ttt cag gcg tac ctc ctg tgc cgc gag ttc gag 3505
Lys Pro Glu His Phe Phe Gln Ala Tyr Leu Leu Cys Arg Glu Phe Glu
260 265 270
agc atc gtg gat cct aca acc cgc gag tct gac ttc gac aag acg ctg 3553
Ser Ile Val Asp Pro Thr Thr Arg Glu Ser Asp Phe Asp Lys Thr Leu
275 280 285
acc atc gtg aag tcg gac tcc acc ctc gtg acc gtg ggc acg atg aac 3601
Thr Ile Val Lys Ser Asp Ser Thr Leu Val Thr Val Gly Thr Met Asn
290 295 300 305
acc aag ctg gtc aat agc caa gag atc ctc gtg tcg gac ttg atc act 3649
Thr Lys Leu Val Asn Ser Gln Glu Ile Leu Val Ser Asp Leu Ile Thr
310 315 320
caa gtc ggt tcc cag atc gcc gat acc ctc ggc atc acg gac atc gac 3697
Gln Val Gly Ser Gln Ile Ala Asp Thr Leu Gly Ile Thr Asp Ile Asp
325 330 335
gcc aac acc cag caa cag ctc acg gag ctg atc ggc aac ctc ttc gtg 3745
Ala Asn Thr Gln Gln Gln Leu Thr Glu Leu Ile Gly Asn Leu Phe Val
340 345 350
aac ctc aat tcc caa gtt cag gag tac atc tac ttc tac gag gag aag 3793
Asn Leu Asn Ser Gln Val Gln Glu Tyr Ile Tyr Phe Tyr Glu Glu Lys
355 360 365
gag aag cag acc tcc tac cgc tac aac atc ctc ttc gtg ttc gaa aag 3841
Glu Lys Gln Thr Ser Tyr Arg Tyr Asn Ile Leu Phe Val Phe Glu Lys
370 375 380 385
gag tcg ttc atc acc att ctg cca atg ggc ttc gac gtg acc gtg aac 3889
Glu Ser Phe Ile Thr Ile Leu Pro Met Gly Phe Asp Val Thr Val Asn
390 395 400
acg aac aag gag gcc gtc ctg aag ctg acc ccg aag gac aag gtt acc 3937
Thr Asn Lys Glu Ala Val Leu Lys Leu Thr Pro Lys Asp Lys Val Thr
405 410 415
tac ggc cac gtc agc gtc aag gcc ctc aac atc atc cag ctc att acg 3985
Tyr Gly His Val Ser Val Lys Ala Leu Asn Ile Ile Gln Leu Ile Thr
420 425 430
gag gac aag ttc aac ttc ctc gca acc ctc aag aag gct ctc aag acc 4033
Glu Asp Lys Phe Asn Phe Leu Ala Thr Leu Lys Lys Ala Leu Lys Thr
435 440 445
ctg tga tga gaattctgca tgcgtttgga cgtatgctca ttcaggttgg 4082
Leu
450
agccaatttg gttgatgtgt gtgcgagttc ttgcgagtct gatgagacat ctctgtattg 4142
tgtttctttc cccagtgttt tctgtacttg tgtaatcggc taatcgccaa cagattcggc 4202
gatgaataaa tgagaaataa attgttctga ttttgagtgc aaaaaaaaag gaatt 4257
<210>29
<211>232
<212>PRT
<213>人工序列
<220>
<223>合成构建体
<400>29
Met Ala Phe Phe Asn Arg Val Ile Thr Leu Thr Val Pro Ser Ser Asp
1 5 10 15
Val Val Asn Tyr Ser Glu Ile Tyr Gln Val Ala Pro Gln Tyr Val Asn
20 25 30
Gln Ala Leu Thr Leu Ala Lys Tyr Phe Gln Gly Ala Ile Asp Gly Ser
35 40 45
Thr Leu Arg Phe Asp Phe Glu Lys Ala Leu Gln Ile Ala Asn Asp Ile
50 55 60
Pro Gln Ala Ala Val Val Asn Thr Leu Asn Gln Thr Val Gln Gln Gly
65 70 75 80
Thr Val Gln Val Ser Val Met Ile Asp Lys Ile Val Asp Ile Met Lys
85 90 95
Asn Val Leu Ser Ile Val Ile Asp Asn Lys Lys Phe Trp Asp Gln Val
100 105 110
Thr Ala Ala Ile Thr Asn Thr Phe Thr Asn Leu Asn Ser Gln Glu Ser
115 120 125
Glu Ala Trp Ile Phe Tyr Tyr Lys Glu Asp Ala His Lys Thr Ser Tyr
130 135 140
Tyr Tyr Asn Ile Leu Phe Ala Ile Gln Asp Glu Glu Thr Gly Gly Val
145 150 155 160
Met Ala Thr Leu Pro Ile Ala Phe Asp Ile Ser Val Asp Ile Glu Lys
165 170 175
Glu Lys Val Leu Phe Val Thr Ile Lys Asp Thr Glu Asn Tyr Ala Val
180 185 190
Thr Val Lys Ala Ile Asn Val Val Gln Ala Leu Gln Ser Ser Arg Asp
195 200 205
Ser Lys Val Val Asp Ala Phe Lys Ser Pro Arg His Leu Pro Arg Lys
210 215 220
Arg His Lys Ile Cys Ser Asn Ser
225 230
<210>30
<211>218
<212>PRT
<213>人工序列
<220>
<223>合成构建体
<400>30
Met Ser Lys Glu Ile Arg Leu Asn Leu Ser Arg Glu Ser Gly Ala Asp
1 5 10 15
Leu Tyr Leu Lys Ile Leu Ala Phe Val Lys Pro Glu His Phe Phe Gln
20 25 30
Ala Tyr Leu Leu Cys Arg Glu Phe Glu Ser Ile Val Asp Pro Thr Thr
35 40 45
Arg Glu Ser Asp Phe Asp Lys Thr Leu Thr Ile Val Lys Ser Asp Ser
50 55 60
Thr Leu Val Thr Val Gly Thr Met Asn Thr Lys Leu Val Asn Ser Gln
65 70 75 80
Glu Ile Leu Val Ser Asp Leu Ile Thr Gln Val Gly Ser Gln Ile Ala
85 90 95
Asp Thr Leu Gly Ile Thr Asp Ile Asp Ala Asn Thr Gln Gln Gln Leu
100 105 110
Thr Glu Leu Ile Gly Asn Leu Phe Val Asn Leu Ash Ser Gln Val Gln
115 120 125
Glu Tyr Ile Tyr Phe Tyr Glu Glu Lys Glu Lys Gln Thr Ser Tyr Arg
130 135 140
Tyr Asn Ile Leu Phe Val Phe Glu Lys Glu Ser Phe Ile Thr Ile Leu
145 150 155 160
Pro Met Gly Phe Asp Val Thr Val Ash Thr Asn Lys Glu Ala Val Leu
165 170 175
Lys Leu Thr Pro Lys Asp Lys Val Thr Tyr Gly His Val Ser Val Lys
180 185 190
Ala Leu Asn Ile Ile Gln Leu Ile Thr Glu Asp Lys Phe Asn Phe Leu
195 200 205
Ala Thr Leu Lys Lys Ala Leu Lys Thr Leu
210 215
<210>31
<211>5079
<212>DNA
<213>人工序列
<220>
<223>合成序列;在pMON64139中编码TIC809和TIC810的表达盒
<220>
<221>其他特征
<223>pMON64139
<220>
<221>启动子
<222>(1)..(614)
<223>e35S
<220>
<221>5’UTR
<222>(650)..(710)
<223>小麦CAB前导序列
<220>
<221>内含子
<222>(727)..(1206)
<223>水稻肌动蛋白(外显子727-738;内含子739-1199;外显子1200-1206)
<220>
<221>转运肽
<222>(1230)..(1370)
<223>玉米SSU信号
<220>
<221>内含子
<222>(1231)..(1539)
<223>Zm RbcS
<220>
<221>转运肽
<222>(1540)..(1626)
<223>Zm RbcS
<220>
<221>CDS
<222>(1627)..(2328)
<223>TIC809
<220>
<221>终止子
<222>(2332)..(2541)
<223>小麦Hsp17
<220>
<221>启动子
<222>(2579)..(3192)
<223>CaMV 35S enh
<220>
<221>5’UTR
<222>(3228)..(3288)
<223>小麦CAB前导序列
<220>
<221>内含子
<222>(3305)..(3784)
<223>水稻肌动蛋白(外显子3305-3316;内含子3317-3777;外显子3778-3784)
<220>
<221>转运肽
<222>(3808)..(3948)
<223>玉米SSU信号
<220>
<221>内含子
<222>(3949)..(4117)
<223>Zm RbcS
<220>
<221>转运肽
<222>(4118)..(4204)
<223>Zm RbcS
<220>
<221>CDS
<222>(4205)..(4864)
<223>TIC810
<220>
<221>终止子
<222>(4870)..(5079)
<223>小麦Hsp17
<400>31
ggtccgatgt gagacttttc aacaaagggt aatatccgga aacctcctcg gattccattg 60
cccagctatc tgtcacttta ttgtgaagat agtggaaaag gaaggtggct cctacaaatg 120
ccatcattgc gataaaggaa aggccatcgt tgaagatgcc tctgccgaca gtggtcccaa 180
agatggaccc ccacccacga ggagcatcgt ggaaaaagaa gacgttccaa ccacgtcttc 240
aaagcaagtg gattgatgtg atggtccgat gtgagacttt tcaacaaagg gtaatatccg 300
gaaacctcct cggattccat tgcccagcta tctgtcactt tattgtgaag atagtggaaa 360
aggaaggtgg ctcctacaaa tgccatcatt gcgataaagg aaaggccatc gttgaagatg 420
cctctgccga cagtggtccc aaagatggac ccccacccac gaggagcatc gtggaaaaag 480
aagacgttcc aaccacgtct tcaaagcaag tggattgatg tgatatctcc actgacgtaa 540
gggatgacgc acaatcccac tatccttcgc aagacccttc ctctatataa ggaagttcat 600
ttcatttgga gaggacacgc tgacaagctg actctagcag atcctctaga accatcttcc 660
acacactcaa gccacactat tggagaacac acagggacaa cacaccataa gatccaaggg 720
aggcctccgc cgccgccggt aaccaccccg cccctctcct ctttctttct ccgttttttt 780
ttccgtctcg gtctcgatct ttggccttgg tagtttgggt gggcgagagg cggcttcgtg 840
cgcgcccaga tcggtgcgcg ggaggggcgg gatctcgcgg ctggggctct cgccggcgtg 900
gatccggccc ggatctcgcg gggaatgggg ctctcggatg tagatctgcg atccgccgtt 960
gttgggggag atgatggggg gtttaaaatt tccgccgtgc taaacaagat caggaagagg 1020
ggaaaagggc actatggttt atatttttat atatttctgc tgcttcgtca ggcttagatg 1080
tgctagatct ttctttcttc tttttgtggg tagaatttga atccctcagc attgttcatc 1140
ggtagttttt cttttcatga tttgtgacaa atgcagcctc gtgcggagct tttttgtagg 1200
tagaagtgat caacctctag aggatcagca tggcgcccac cgtgatgatg gcctcgtcgg 1260
ccaccgccgt cgctccgttc ctggggctca agtccaccgc cagcctcccc gtcgcccgcc 1320
gctcctccag aagcctcggc aacgtcagca acggcggaag gatccggtgc atgcaggtaa 1380
caaatgcatc ctagctagta gttctttgca ttgcagcagc tgcagctagc gagttagtaa 1440
taggaaggga actgatgatc catgcatgga ctgatgtgtg ttgcccatcc catcccatcc 1500
catttcccaa acgaaccgaa aacaccgtac tacgtgcagg tgtggcccta cggcaacaag 1560
aagttcgaga cgctgtcgta cctgccgccg ctgtcgaccg gcgggcgcat ccgctgcatg 1620
caggcc atg gcc ttc ttc aac cgg gtg atc acc ctc acg gtg ccg tcg 1668
Met Ala Phe Phe Asn Arg Val Ile Thr Leu Thr Val Pro Ser
1 5 10
tca gac gtg gtc aac tac tcg gag atc tac cag gtg gct cct cag tat 1716
Ser Asp Val Val Asn Tyr Ser Glu Ile Tyr Gln Val Ala Pro Gln Tyr
15 20 25 30
gtc aac cag gcc ctg acc ctg gcc aag tac ttc cag ggc gcc atc gac 1764
Val Asn Gln Ala Leu Thr Leu Ala Lys Tyr Phe Gln Gly Ala Ile Asp
35 40 45
ggc agc acc ctg agg ttc gac ttc gag aag gcg tta cag atc gcc aac 1812
Gly Ser Thr Leu Arg Phe Asp Phe Glu Lys Ala Leu Gln Ile Ala Asn
50 55 60
gac atc ccg cag gcc gcg gtg gtc aac acc ctg aac cag acc gtc cag 1860
Asp Ile Pro Gln Ala Ala Val Val Asn Thr Leu Asn Gln Thr Val Gln
65 70 75
cag ggg acc gtc cag gtc agc gtc atg atc gac aag atc gtg gac atc 1908
Gln Gly Thr Val Gln Val Ser Val Met Ile Asp Lys Ile Val Asp Ile
80 85 90
atg aag aat gtc ctg tcc atc gtg ata gac aac aag aag ttt tgg gat 1956
Met Lys Asn Val Leu Ser Ile Val Ile Asp Asn Lys Lys Phe Trp Asp
95 100 105 110
cag gtc acg gct gcc atc acc aac acc ttc acg aac ctg aac agc cag 2004
Gln Val Thr Ala Ala Ile Thr Asn Thr Phe Thr Asn Leu Asn Ser Gln
115 120 125
gag tcg gag gcc tgg atc ttc tat tac aag gag gac gcc cac aag acg 2052
Glu Ser Glu Ala Trp Ile Phe Tyr Tyr Lys Glu Asp Ala His Lys Thr
130 135 140
tcc tac tat tac aac atc ctc ttc gcc atc cag gac gaa gag acg ggt 2100
Ser Tyr Tyr Tyr Asn Ile Leu Phe Ala Ile Gln Asp Glu Glu Thr Gly
145 150 155
ggc gtg atg gcc acg ctg ccc atc gcc ttc gac atc agt gtg gac atc 2148
Gly Val Met Ala Thr Leu Pro Ile Ala Phe Asp Ile Ser Val Asp Ile
160 165 170
gag aag gag aag gtc ctg ttc gtg acc atc aag gac act gag aat tac 2196
Glu Lys Glu Lys Val Leu Phe Val Thr Ile Lys Asp Thr Glu Asn Tyr
175 180 185 190
gcc gtc acc gtc aag gcg atc aac gtg gtc cag gca ctc cag tct agc 2244
Ala Val Thr Val Lys Ala Ile Asn Val Val Gln Ala Leu Gln Ser Ser
195 200 205
agg gat tct aag gtg gtt gat gcg ttc aaa tcg cca cgg cac tta ccc 2292
Arg Asp Ser Lys Val Val Asp Ala Phe Lys Ser Pro Arg His Leu Pro
210 215 220
cgg aag agg cat aag att tgc tct aac tcg tga tga attctgcatg 2338
Arg Lys Arg His Lys Ile Cys Ser Asn Ser
225 230
cgtttggacg tatgctcatt caggttggag ccaatttggt tgatgtgtgt gcgagttctt 2398
gcgagtctga tgagacatct ctgtattgtg tttctttccc cagtgttttc tgtacttgtg 2458
taatcggcta atcgccaaca gattcggcga tgaataaatg agaaataaat tgttctgatt 2518
ttgagtgcaa aaaaaaagga attagatctg tgtgtgtttt ttggatcccc agcttctgca 2578
ggtccgatgt gagacttttc aacaaagggt aatatccgga aacctcctcg gattccattg 2638
cccagctatc tgtcacttta ttgtgaagat agtggaaaag gaaggtggct cctacaaatg 2698
ccatcattgc gataaaggaa aggccatcgt tgaagatgcc tctgccgaca gtggtcccaa 2758
agatggaccc ccacccacga ggagcatcgt ggaaaaagaa gacgttccaa ccacgtcttc 2818
aaagcaagtg gattgatgtg atggtccgat gtgagacttt tcaacaaagg gtaatatccg 2878
gaaacctcct cggattccat tgcccagcta tctgtcactt tattgtgaag atagtggaaa 2938
aggaaggtgg ctcctacaaa tgccatcatt gcgataaagg aaaggccatc gttgaagatg 2998
cctctgccga cagtggtccc aaagatggac ccccacccac gaggagcatc gtggaaaaag 3058
aagacgttcc aaccacgtct tcaaagcaag tggattgatg tgatatctcc actgacgtaa 3118
gggatgacgc acaatcccac tatccttcgc aagacccttc ctctatataa ggaagttcat 3178
ttcatttgga gaggacacgc tgacaagctg actctagcag atcctctaga accatcttcc 3238
acacactcaa gccacactat tggagaacac acagggacaa cacaccataa gatccaaggg 3298
aggcctccgc cgccgccggt aaccaccccg cccctctcct ctttctttct ccgttttttt 3358
ttccgtctcg gtctcgatct ttggccttgg tagtttgggt gggcgagagg cggcttcgtg 3418
cgcgcccaga tcggtgcgcg ggaggggcgg gatctcgcgg ctggggctct cgccggcgtg 3478
gatccggccc ggatctcgcg gggaatgggg ctctcggatg tagatctgcg atccgccgtt 3538
gttgggggag atgatggggg gtttaaaatt tccgccgtgc taaacaagat caggaagagg 3598
ggaaaagggc actatggttt atatttttat atatttctgc tgcttcgtca ggcttagatg 3658
tgctagatct ttctttcttc tttttgtggg tagaatttga atccctcagc attgttcatc 3718
ggtagttttt cttttcatga tttgtgacaa atgcagcctc gtgcggagct tttttgtagg 3778
tagaagtgat caacctctag aggatcagca tggcgcccac cgtgatgatg gcctcgtcgg 3838
ccaccgccgt cgctccgttc ctggggctca agtccaccgc cagcctcccc gtcgcccgcc 3898
gctcctccag aagcctcggc aacgtcagca acggcggaag gatccggtgc atgcaggtaa 3958
caaatgcatc ctagctagta gttctttgca ttgcagcagc tgcagctagc gagttagtaa 4018
taggaaggga actgatgatc catgcatgga ctgatgtgtg ttgcccatcc catcccatcc 4078
catttcccaa acgaaccgaa aacaccgtac tacgtgcagg tgtggcccta cggcaacaag 4138
aagttcgaga cgctgtcgta cctgccgccg ctgtcgaccg gcgggcgcat ccgctgcatg 4198
caggcc atg agc aaa gaa atc agg ctc aac ctt tct cgt gag agc ggc 4246
Met Ser Lys Glu Ile Arg Leu Asn Leu Ser Arg Glu Ser Gly
235 240 245
gcc gac ctg tac ctc aag atc ctc gcc ttc gtg aag ccc gag cac ttc 4294
Ala Asp Leu Tyr Leu Lys Ile Leu Ala Phe Val Lys Pro Glu His Phe
250 255 260
ttt cag gcg tac ctc ctg tgc cgc gag ttc gag agc atc gtg gat cct 4342
Phe Gln Ala Tyr Leu Leu Cys Arg Glu Phe Glu Ser Ile Val Asp Pro
265 270 275
aca acc cgc gag tct gac ttc gac aag acg ctg acc atc gtg aag tcg 4390
Thr Thr Arg Glu Ser Asp Phe Asp Lys Thr Leu Thr Ile Val Lys Ser
280 285 290
gac tcc acc ctc gtg acc gtg ggc acg atg aac acc aag ctg gtc aat 4438
Asp Ser Thr Leu Val Thr Val Gly Thr Met Asn Thr Lys Leu Val Asn
295 300 305 310
agc caa gag atc ctc gtg tcg gac ttg atc act caa gtc ggt tcc cag 4486
Ser Gln Glu Ile Leu Val Ser Asp Leu Ile Thr Gln Val Gly Ser Gln
315 320 325
atc gcc gat acc ctc ggc atc acg gac atc gac gcc aac acc cag caa 4534
Ile Ala Asp Thr Leu Gly Ile Thr Asp Ile Asp Ala Asn Thr Gln Gln
330 335 340
cag ctc acg gag ctg atc ggc aac ctc ttc gtg aac ctc aat tcc caa 4582
Gln Leu Thr Glu Leu Ile Gly Asn Leu Phe Val Asn Leu Asn Ser Gln
345 350 355
gtt cag gag tac atc tac ttc tac gag gag aag gag aag cag acc tcc 4630
Val Gln Glu Tyr Ile Tyr Phe Tyr Glu Glu Lys Glu Lys Gln Thr Ser
360 365 370
tac cgc tac aac atc ctc ttc gtg ttc gaa aag gag tcg ttc atc acc 4678
Tyr Arg Tyr Asn Ile Leu Phe Val Phe Glu Lys Glu Ser Phe Ile Thr
375 380 385 390
att ctg cca atg ggc ttc gac gtg acc gtg aac acg aac aag gag gcc 4726
Ile Leu Pro Met Gly Phe Asp Val Thr Val Asn Thr Asn Lys Glu Ala
395 400 405
gtc ctg aag ctg acc ccg aag gac aag gtt acc tac ggc cac gtc agc 4774
Val Leu Lys Leu Thr Pro Lys Asp Lys Val Thr Tyr Gly His Val Ser
410 415 420
gtc aag gcc ctc aac atc atc cag ctc att acg gag gac aag ttc aac 4822
Val Lys Ala Leu Asn Ile Ile Gln Leu Ile Thr Glu Asp Lys Phe Asn
425 430 435
ttc ctc gca acc ctc aag aag gct ctc aag acc ctg tga tga 4864
Phe Leu Ala Thr Leu Lys Lys Ala Leu Lys Thr Leu
440 445 450
gaattctgca tgcgtttgga cgtatgctca ttcaggttgg agccaatttg gttgatgtgt 4924
gtgcgagttc ttgcgagtct gatgagacat ctctgtattg tgtttctttc cccagtgttt 4984
tctgtacttg tgtaatcggc taatcgccaa cagattcggc gatgaataaa tgagaaataa 5044
attgttctga ttttgagtgc aaaaaaaaag gaatt 5079
<210>32
<211>232
<212>PRT
<213>人工序列
<220>
<223>合成构建体
<400>32
Met Ala Phe Phe Asn Arg Val Ile Thr Leu Thr Val Pro Ser Ser Asp
1 5 10 15
Val Val Asn Tyr Ser Glu Ile Tyr Gln Val Ala Pro Gln Tyr Val Asn
20 25 30
Gln Ala Leu Thr Leu Ala Lys Tyr Phe Gln Gly Ala Ile Asp Gly Ser
35 40 45
Thr Leu Arg Phe Asp Phe Glu Lys Ala Leu Gln Ile Ala Asn Asp Ile
50 55 60
Pro Gln Ala Ala Val Val Asn Thr Leu Asn Gln Thr Val Gln Gln Gly
65 70 75 80
Thr Val Gln Val Ser Val Met Ile Asp Lys Ile Val Asp Ile Met Lys
85 90 95
Asn Val Leu Ser Ile Val Ile Asp Asn Lys Lys Phe Trp Asp Gln Val
100 105 110
Thr Ala Ala Ile Thr Asn Thr Phe Thr Asn Leu Asn Ser Gln Glu Ser
115 120 125
Glu Ala Trp Ile Phe Tyr Tyr Lys Glu Asp Ala His Lys Thr Ser Tyr
130 135 140
Tyr Tyr Asn Ile Leu Phe Ala Ile Gln Asp Glu Glu Thr Gly Gly Val
145 150 155 160
Met Ala Thr Leu Pro Ile Ala Phe Asp Ile Ser Val Asp Ile Glu Lys
165 170 175
Glu Lys Val Leu Phe Val Thr Ile Lys Asp Thr Glu Asn Tyr Ala Val
180 185 190
Thr Val Lys Ala Ile Asn Val Val Gln Ala Leu Gln Ser Ser Arg Asp
195 200 205
Ser Lys Val Val Asp Ala Phe Lys Ser Pro Arg His Leu Pro Arg Lys
210 215 220
Arg His Lys Ile Cys Ser Asn Ser
225 230
<210>33
<211>218
<212>PRT
<213>人工序列
<220>
<223>合成构建体
<400>33
Met Ser Lys Glu Ile Arg Leu Asn Leu Ser Arg Glu Ser Gly Ala Asp
1 5 10 15
Leu Tyr Leu Lys Ile Leu Ala Phe Val Lys Pro Glu His Phe Phe Gln
20 25 30
Ala Tyr Leu Leu Cys Arg Glu Phe Glu Ser Ile Val Asp Pro Thr Thr
35 40 45
Arg Glu Ser Asp Phe Asp Lys Thr Leu Thr Ile Val Lys Ser Asp Ser
50 55 60
Thr Leu Val Thr Val Gly Thr Met Asn Thr Lys Leu Val Asn Ser Gln
65 70 75 80
Glu Ile Leu Val Ser Asp Leu Ile Thr Gln Val Gly Ser Gln Ile Ala
85 90 95
Asp Thr Leu Gly Ile Thr Asp Ile Asp Ala Asn Thr Gln Gln Gln Leu
100 105 110
Thr Glu Leu Ile Gly Asn Leu Phe Val Asn Leu Asn Ser Gln Val Gln
115 120 125
Glu Tyr Ile Tyr Phe Tyr Glu Glu Lys Glu Lys Gln Thr Ser Tyr Arg
130 135 140
Tyr Asn Ile Leu Phe Val Phe Glu Lys Glu Ser Phe Ile Thr Ile Leu
145 150 155 160
Pro Met Gly Phe Asp Val Thr Val Asn Thr Asn Lys Glu Ala Val Leu
165 170 175
Lys Leu Thr Pro Lys Asp Lys Val Thr Tyr Gly His Val Ser Val Lys
180 185 190
Ala Leu Asn Ile Ile Gln Leu Ile Thr Glu Asp Lys Phe Asn Phe Leu
195 200 205
Ala Thr Leu Lys Lys Ala Leu Lys Thr Leu
210 215
<210>34
<211>2148
<212>DNA
<213>人工序列
<220>
<223>合成序列;在pMON70513中编码TIC809的表达盒
<220>
<221>其他特征
<223>pMON70513植物表达盒
<220>
<221>启动子
<222>(1)..(614)
<223>e35S
<220>
<221>5’UTR
<222>(650)..(710)
<223>小麦CAB前导序列
<220>
<221>内含子
<222>(727)..(1206)
<223>水稻肌动蛋白
<220>
<221>CDS
<222>(1216)..(1917)
<223>TIC809
<220>
<221>终止子
<222>(1939)..(2148)
<223>小麦Hsp17
<400>34
ggtccgatgt gagacttttc aacaaagggt aatatccgga aacctcctcg gattccattg 60
cccagctatc tgtcacttta ttgtgaagat agtggaaaag gaaggtggct cctacaaatg 120
ccatcattgc gataaaggaa aggccatcgt tgaagatgcc tctgccgaca gtggtcccaa 180
agatggaccc ccacccacga ggagcatcgt ggaaaaagaa gacgttccaa ccacgtcttc 240
aaagcaagtg gattgatgtg atggtccgat gtgagacttt tcaacaaagg gtaatatccg 300
gaaacctcct cggattccat tgcccagcta tctgtcactt tattgtgaag atagtggaaa 360
aggaaggtgg ctcctacaaa tgccatcatt gcgataaagg aaaggccatc gttgaagatg 420
cctctgccga cagtggtccc aaagatggac ccccacccac gaggagcatc gtggaaaaag 480
aagacgttcc aaccacgtct tcaaagcaag tggattgatg tgatatctcc actgacgtaa 540
gggatgacgc acaatcccac tatccttcgc aagacccttc ctctatataa ggaagttcat 600
ttcatttgga gaggacacgc tgacaagctg actctagcag atcctctaga accatcttcc 660
acacactcaa gccacactat tggagaacac acagggacaa cacaccataa gatccaaggg 720
aggcctccgc cgccgccggt aaccaccccg cccctctcct ctttctttct ccgttttttt 780
ttccgtctcg gtctcgatct ttggccttgg tagtttgggt gggcgagagg cggcttcgtg 840
cgcgcccaga tcggtgcgcg ggaggggcgg gatctcgcgg ctggggctct cgccggcgtg 900
gatccggccc ggatctcgcg gggaatgggg ctctcggatg tagatctgcg atccgccgtt 960
gttgggggag atgatggggg gtttaaaatt tccgccgtgc taaacaagat caggaagagg 1020
ggaaaagggc actatggttt atatttttat atatttctgc tgcttcgtca ggcttagatg 1080
tgctagatct ttctttcttc tttttgtggg tagaatttga atccctcagc attgttcatc 1140
ggtagttttt cttttcatga tttgtgacaa atgcagcctc gtgcggagct tttttgtagg 1200
tagaagtgat caacc atg gcc ttc ttc aac cgg gtg atc acc ctc acg gtg 1251
Met Ala Phe Phe Asn Arg Val Ile Thr Leu Thr Val
1 5 10
ccg tcg tca gac gtg gtc aac tac tcg gag atc tac cag gtg gct cct 1299
Pro Ser Ser Asp Val Val Asn Tyr Ser Glu Ile Tyr Gln Val Ala Pro
15 20 25
cag tat gtc aac cag gcc ctg acc ctg gcc aag tac ttc cag ggc gcc 1347
Gln Tyr Val Asn Gln Ala Leu Thr Leu Ala Lys Tyr Phe Gln Gly Ala
30 35 40
atc gac ggc agc acc ctg agg ttc gac ttc gag aag gcg tta cag atc 1395
Ile Asp Gly Ser Thr Leu Arg Phe Asp Phe Glu Lys Ala Leu Gln Ile
45 50 55 60
gcc aac gac atc ccg cag gcc gcg gtg gtc aac acc ctg aac cag acc 1443
Ala Asn Asp Ile Pro Gln Ala Ala Val Val Asn Thr Leu Asn Gln Thr
65 70 75
gtc cag cag ggg acc gtc cag gtc agc gtc atg atc gac aag atc gtg 1491
Val Gln Gln Gly Thr Val Gln Val Ser Val Met Ile Asp Lys Ile Val
80 85 90
gac atc atg aag aat gtc ctg tcc atc gtg ata gac aac aag aag ttt 1539
Asp Ile Met Lys Asn Val Leu Ser Ile Val Ile Asp Asn Lys Lys Phe
95 100 105
tgg gat cag gtc acg gct gcc atc acc aac acc ttc acg aac ctg aac 1587
Trp Asp Gln Val Thr Ala Ala Ile Thr Asn Thr Phe Thr Asn Leu Asn
110 115 120
agc cag gag tcg gag gcc tgg atc ttc tat tac aag gag gac gcc cac 1635
Ser Gln Glu Ser Glu Ala Trp Ile Phe Tyr Tyr Lys Glu Asp Ala His
125 130 135 140
aag acg tcc tac tat tac aac atc ctc ttc gcc atc cag gac gaa gag 1683
Lys Thr Ser Tyr Tyr Tyr Asn Ile Leu Phe Ala Ile Gln Asp Glu Glu
145 150 155
acg ggt ggc gtg atg gcc acg ctg ccc atc gcc ttc gac atc agt gtg 1731
Thr Gly Gly Val Met Ala Thr Leu Pro Ile Ala Phe Asp Ile Ser Val
160 165 170
gac atc gag aag gag aag gtc ctg ttc gtg acc atc aag gac act gag 1779
Asp Ile Glu Lys Glu Lys Val Leu Phe Val Thr Ile Lys Asp Thr Glu
175 180 185
aat tac gcc gtc acc gtc aag gcg atc aac gtg gtc cag gca ctc cag 1827
Asn Tyr Ala Val Thr Val Lys Ala Ile Asn Val Val Gln Ala Leu Gln
190 195 200
tct agc agg gat tct aag gtg gtt gat gcg ttc aaa tcg cca cgg cac 1875
Ser Ser Arg Asp Ser Lys Val Val Asp Ala Phe Lys Ser Pro Arg His
205 210 215 220
tta ccc cgg aag agg cat aag att tgc tct aac tcg tga tga 1917
Leu Pro Arg Lys Arg His Lys Ile Cys Ser Asn Ser
225 230
attcggatcc aagggcgaat tctgcatgcg tttggacgta tgctcattca ggttggagcc 1977
aatttggttg atgtgtgtgc gagttcttgc gagtctgatg agacatctct gtattgtgtt 2037
tctttcccca gtgttttctg tacttgtgta atcggctaat cgccaacaga ttcggcgatg 2097
aataaatgag aaataaattg ttctgatttt gagtgcaaaa aaaaaggaat t 2148
<210>35
<211>232
<212>PRT
<213>人工序列
<220>
<223>合成构建体
<400>35
Met Ala Phe Phe Asn Arg Val Ile Thr Leu Thr Val Pro Ser Ser Asp
1 5 10 15
Val Val Asn Tyr Ser Glu Ile Tyr Gln Val Ala Pro Gln Tyr Val Asn
20 25 30
Gln Ala Leu Thr Leu Ala Lys Tyr Phe Gln Gly Ala Ile Asp Gly Ser
35 40 45
Thr Leu Arg Phe Asp Phe Glu Lys Ala Leu Gln Ile Ala Asn Asp Ile
50 55 60
Pro Gln Ala Ala Val Val Asn Thr Leu Asn Gln Thr Val Gln Gln Gly
65 70 75 80
Thr Val Gln Val Ser Val Met Ile Asp Lys Ile Val Asp Ile Met Lys
85 90 95
Asn Val Leu Ser Ile Val Ile Asp Asn Lys Lys Phe Trp Asp Gln Val
100 105 110
Thr Ala Ala Ile Thr Asn Thr Phe Thr Asn Leu Asn Ser Gln Glu Ser
115 120 125
Glu Ala Trp Ile Phe Tyr Tyr Lys Glu Asp Ala His Lys Thr Ser Tyr
130 135 140
Tyr Tyr Asn Ile Leu Phe Ala Ile Gln Asp Glu Glu Thr Gly Gly Val
145 150 155 160
Met Ala Thr Leu Pro Ile Ala Phe Asp Ile Ser Val Asp Ile Glu Lys
165 170 175
Glu Lys Val Leu Phe Val Thr Ile Lys Asp Thr Glu Asn Tyr Ala Val
180 185 190
Thr Val Lys Ala Ile Asn Val Val Gln Ala Leu Gln Ser Ser Arg Asp
195 200 205
Ser Lys Val Val Asp Ala Phe Lys Ser Pro Arg His Leu Pro Arg Lys
210 215 220
Arg His Lys Ile Cys Ser Asn Ser
225 230
<210>36
<211>2541
<212>DNA
<213>人工序列
<220>
<223>合成序列;在pMON70514中编码TIC809的表达盒
<220>
<221>其他特征
<223>pMON70514植物表达盒
<220>
<221>启动子
<222>(1)..(614)
<223>e35S
<220>
<221>5’UTR
<222>(650)..(710)
<223>小麦CAB前导序列
<220>
<221>内含子
<222>(727)..(1206)
<223>水稻肌动蛋白
<220>
<221>转运肽
<222>(1230)..(1270)
<223>Zm RbcS
<220>
<221>内含子
<222>(1371)..(1539)
<223>Zm RbcS
<220>
<221>转运肽
<222>(1540)..(1626)
<223>Zm RbcS
<220>
<221>CDS
<222>(1627)..(2328)
<223>TIC809
<220>
<221>终止子
<222>(2332)..(2541)
<223>小麦Hsp17
<400>36
ggtccgatgt gagacttttc aacaaagggt aatatccgga aacctcctcg gattccattg 60
cccagctatc tgtcacttta ttgtgaagat agtggaaaag gaaggtggct cctacaaatg 120
ccatcattgc gataaaggaa aggccatcgt tgaagatgcc tctgccgaca gtggtcccaa 180
agatggaccc ccacccacga ggagcatcgt ggaaaaagaa gacgttccaa ccacgtcttc 240
aaagcaagtg gattgatgtg atggtccgat gtgagacttt tcaacaaagg gtaatatccg 300
gaaacctcct cggattccat tgcccagcta tctgtcactt tattgtgaag atagtggaaa 360
aggaaggtgg ctcctacaaa tgccatcatt gcgataaagg aaaggccatc gttgaagatg 420
cctctgccga cagtggtccc aaagatggac ccccacccac gaggagcatc gtggaaaaag 480
aagacgttcc aaccacgtct tcaaagcaag tggattgatg tgatatctcc actgacgtaa 540
gggatgacgc acaatcccac tatccttcgc aagacccttc ctctatataa ggaagttcat 600
ttcatttgga gaggacacgc tgacaagctg actctagcag atcctctaga accatcttcc 660
acacactcaa gccacactat tggagaacac acagggacaa cacaccataa gatccaaggg 720
aggcctccgc cgccgccggt aaccaccccg cccctctcct ctttctttct ccgttttttt 780
ttccgtctcg gtctcgatct ttggccttgg tagtttgggt gggcgagagg cggcttcgtg 840
cgcgcccaga tcggtgcgcg ggaggggcgg gatctcgcgg ctggggctct cgccggcgtg 900
gatccggccc ggatctcgcg gggaatgggg ctctcggatg tagatctgcg atccgccgtt 960
gttgggggag atgatggggg gtttaaaatt tccgccgtgc taaacaagat caggaagagg 1020
ggaaaagggc actatggttt atatttttat atatttctgc tgcttcgtca ggcttagatg 1080
tgctagatct ttctttcttc tttttgtggg tagaatttga atccctcagc attgttcatc 1140
ggtagttttt cttttcatga tttgtgacaa atgcagcctc gtgcggagct tttttgtagg 1200
tagaagtgat caacctctag aggatcagca tggcgcccac cgtgatgatg gcctcgtcgg 1260
ccaccgccgt cgctccgttc ctggggctca agtccaccgc cagcctcccc gtcgcccgcc 1320
gctcctccag aagcctcggc aacgtcagca acggcggaag gatccggtgc atgcaggtaa 1380
caaatgcatc ctagctagta gttctttgca ttgcagcagc tgcagctagc gagttagtaa 1440
taggaaggga actgatgatc catgcatgga ctgatgtgtg ttgcccatcc catcccatcc 1500
catttcccaa acgaaccgaa aacaccgtac tacgtgcagg tgtggcccta cggcaacaag 1560
aagttcgaga cgctgtcgta cctgccgccg ctgtcgaccg gcgggcgcat ccgctgcatg 1620
caggcc atg gcc ttc ttc aac cgg gtg atc acc ctc acg gtg ccg tcg 1668
Met Ala Phe Phe Asn Arg Val Ile Thr Leu Thr Val Pro Ser
1 5 10
tca gac gtg gtc aac tac tcg gag atc tac cag gtg gct cct cag tat 1716
Ser Asp Val Val Asn Tyr Ser Glu Ile Tyr Gln Val Ala Pro Gln Tyr
15 20 25 30
gtc aac cag gcc ctg acc ctg gcc aag tac ttc cag ggc gcc atc gac 1764
Val Asn Gln Ala Leu Thr Leu Ala Lys Tyr Phe Gln Gly Ala Ile Asp
35 40 45
ggc agc acc ctg agg ttc gac ttc gag aag gcg tta cag atc gcc aac 1812
Gly Ser Thr Leu Arg Phe Asp Phe Glu Lys Ala Leu Gln Ile Ala Asn
50 55 60
gac atc ccg cag gcc gcg gtg gtc aac acc ctg aac cag acc gtc cag 1860
Asp Ile Pro Gln Ala Ala Val Val Asn Thr Leu Asn Gln Thr Val Gln
65 70 75
cag ggg acc gtc cag gtc agc gtc atg atc gac aag atc gtg gac atc 1908
Gln Gly Thr Val Gln Val Ser Val Met Ile Asp Lys Ile Val Asp Ile
80 85 90
atg aag aat gtc ctg tcc atc gtg ata gac aac aag aag ttt tgg gat 1956
Met Lys Asn Val Leu Ser Ile Val Ile Asp Asn Lys Lys Phe Trp Asp
95 100 105 110
cag gtc acg gct gcc atc acc aac acc ttc acg aac ctg aac agc cag 2004
Gln Val Thr Ala Ala Ile Thr Asn Thr Phe Thr Asn Leu Asn Ser Gln
115 120 125
gag tcg gag gcc tgg atc ttc tat tac aag gag gac gcc cac aag acg 2052
Glu Ser Glu Ala Trp Ile Phe Tyr Tyr Lys Glu Asp Ala His Lys Thr
130 135 140
tcc tac tat tac aac atc ctc ttc gcc atc cag gac gaa gag acg ggt 2100
Ser Tyr Tyr Tyr Asn Ile Leu Phe Ala Ile Gln Asp Glu Glu Thr Gly
145 150 155
ggc gtg atg gcc acg ctg ccc atc gcc ttc gac atc agt gtg gac atc 2148
Gly Val Met Ala Thr Leu Pro Ile Ala Phe Asp Ile Ser Val Asp Ile
160 165 170
gag aag gag aag gtc ctg ttc gtg acc atc aag gac act gag aat tac 2196
Glu Lys Glu Lys Val Leu Phe Val Thr Ile Lys Asp Thr Glu Asn Tyr
175 180 185 190
gcc gtc acc gtc aag gcg atc aac gtg gtc cag gca ctc cag tct agc 2244
Ala Val Thr Val Lys Ala Ile Asn Val Val Gln Ala Leu Gln Ser Ser
195 200 205
agg gat tct aag gtg gtt gat gcg ttc aaa tcg cca cgg cac tta ccc 2292
Arg Asp Ser Lys Val Val Asp Ala Phe Lys Ser Pro Arg His Leu Pro
210 215 220
cgg aag agg cat aag att tgc tct aac tcg tga tga attctgcatg 2338
Arg Lys Arg His Lys Ile Cys Ser Asn Ser
225 230
cgtttggacg tatgctcatt caggttggag ccaatttggt tgatgtgtgt gcgagttctt 2398
gcgagtctga tgagacatct ctgtattgtg tttctttccc cagtgttttc tgtacttgtg 2458
taatcggcta atcgccaaca gattcggcga tgaataaatg agaaataaat tgttctgatt 2518
ttgagtgcaa aaaaaaagga att 2541
<210>37
<211>232
<212>PRT
<213>人工序列
<220>
<223>合成构建体
<400>37
Met Ala Phe Phe Asn Arg Val Ile Thr Leu Thr Val Pro Ser Ser Asp
1 5 10 15
Val Val Asn Tyr Ser Glu Ile Tyr Gln Val Ala Pro Gln Tyr Val Asn
20 25 30
Gln Ala Leu Thr Leu Ala Lys Tyr Phe Gln Gly Ala Ile Asp Gly Ser
35 40 45
Thr Leu Arg Phe Asp Phe Glu Lys Ala Leu Gln Ile Ala Asn Asp Ile
50 55 60
Pro Gln Ala Ala Val Val Asn Thr Leu Asn Gln Thr Val Gln Gln Gly
65 70 75 80
Thr Val Gln Val Ser Val Met Ile Asp Lys Ile Val Asp Ile Met Lys
85 90 95
Asn Val Leu Ser Ile Val Ile Asp Asn Lys Lys Phe Trp Asp Gln Val
100 105 110
Thr Ala Ala Ile Thr Asn Thr Phe Thr Asn Leu Asn Ser Gln Glu Ser
115 120 125
Glu Ala Trp Ile Phe Tyr Tyr Lys Glu Asp Ala His Lys Thr Ser Tyr
130 135 140
Tyr Tyr Asn Ile Leu Phe Ala Ile Gln Asp Glu Glu Thr Gly Gly Val
145 150 155 160
Met Ala Thr Leu Pro Ile Ala Phe Asp Ile Ser Val Asp Ile Glu Lys
165 170 175
Glu Lys Val Leu Phe Val Thr Ile Lys Asp Thr Glu Asn Tyr Ala Val
180 185 190
Thr Val Lys Ala Ile Asn Val Val Gln Ala Leu Gln Ser Ser Arg Asp
195 200 205
Ser Lys Val Val Asp Ala Phe Lys Ser Pro Arg His Leu Pro Arg Lys
210 215 220
Arg His Lys Ile Cys Ser Asn Ser
225 230
<210>38
<211>4083
<212>DNA
<213>人工序列
<220>
<223>合成序列;在pMON64144中编码TIC809的表达盒
<220>
<221>其他特征
<223>pMON64144植物表达盒
<220>
<221>启动子
<222>(1)..(1844)
<223>水稻Rcc3
<220>
<221>5’UTR
<222>(1845)..(1943)
<223>水稻Rcc3
<220>
<221>内含子
<222>(1952)..(2755)
<223>HSP70
<220>
<221>转运肽
<222>(2772)..(2912)
<223>Zm RbcS
<220>
<221>内含子
<222>(2913)..(3081)
<223>Zm RbcS
<220>
<221>转运肽
<222>(3082)..(3168)
<223>Zm RbcS
<220>
<221>CDS
<222>(3169)..(3870)
<223>TIC809
<220>
<221>终止子
<222>(3874)..(4083)
<223>小麦Hsp17
<400>38
gcaatcaacc aacatatact gaatatggga aagtttcttt tagcttttct aaattaagta 60
ctgattctta aacttaagtg agaatctagc ctgttcaggg gcgacggcta aaggacatag 120
caccactagt ctacgcgatt gcaaaaaaga agaatgcaag cctgcaacaa gtatcgcttt 180
cccgaccaat ggttggttga cctcggtttg ccggtaacct caggctggac gacagaacta 240
attagccaac ttgtcaatgt ctagggtgct gttcatagcc tgcagttgac agagtacgaa 300
aaggacaaga tcacatggaa gctaactagt cacggcgaat acatgacgac atcggcctac 360
aacgcacaac ttcttggcat aaaagcttca atttcaatgc ccctatctgg aagccctagg 420
cgccgcgcaa atgtaaaaca ttcgcttcgc ttggcttgtt atccaaaata gagtatggac 480
ctccgacaga ttggcaaccc gtgggtaatc gaaaatggct ccatctgccc ctttgtcgaa 540
ggaatcagga aacggccctc acctcctggc ggagtgtaga tatgtgaaag aatctaggcg 600
acacttgcag actggacaac atgtgaacaa ataagaccaa cgttatggca acaagcctcg 660
acgctactca agtggtggga ggccaccgca tgttccaacg aagcgccaaa gaaagccttg 720
cagactctaa tgctattagt cgcctaggat atttggaatg aaaggaaccg cagagttttt 780
cagcaccaag agcttccggt ggctagtctg atagccaaaa ttaaggagga tgccaaaaca 840
tgggtcttgg cgggcgcgaa acaccttgat aggtggctta ccttttaaca tgttcgggcc 900
aaaggccttg agacggtaaa gttttctatt tgcgcttgcg catgtacaat tttattcctc 960
tattcaatga aattggtggc tcactggttc attaaaaaaa aaagaatcta gcctgttcgg 1020
gaagaagagg attttgttcg tgagagagag agagagagag agagagagag agagagagaa 1080
ggaggaggag gattttcagg cttcgcattg cccaacctct gcttctgttg gcccaagaag 1140
aatcccaggc gcccatgggc tggcagttta ccacggacct acctagccta ccttagctat 1200
ctaagcgggc cgacctagta gccacgtgcc tagtgtagat taaagttgcc gggccagcag 1260
gaagccacgc tgcaatggca tcttcccctg tccttcgcgt acgtgaaaac aaacccaggt 1320
aagcttagaa tcttcttgcc cgttggactg ggacacccac caatcccacc atgccccgat 1380
attcctccgg tctcggttca tgtgatgtcc tctcttgtgt gatcacggag caagcattct 1440
taaacggcaa aagaaaatca ccaacttgct cacgcagtca cgctgcaccg cgcgaagcga 1500
cgcccgatag gccaagatcg cgagataaaa taacaaccaa tgatcataag gaaacaagcc 1560
cgcgatgtgt cgtgtgcagc aatcttggtc atttgcggga tcgagtgctt cacagctaac 1620
caaatattcg gccgatgatt taacacatta tcagcgtaga tgtacgtacg atttgttaat 1680
taatctacga gccttgctag ggcaggtgtt ctgccagcca atccagatcg ccctcgtatg 1740
cacgctcaca tgatggcagg gcagggttca catgagctct aacggtcgat taattaatcc 1800
cggggctcga ctataaatac ctccctaatc ccatgatcaa aaccatctca agcagcctaa 1860
tcatctccag ctgatcaaga gctcttaatt agctagctag tgattagctg cgcttgtgat 1920
cgatcgatct cgggtacgta gcaatagatc taccgtcttc ggtacgcgct cactccgccc 1980
tctgcctttg ttactgccac gtttctctga atgctctctt gtgtggtgat tgctgagagt 2040
ggtttagctg gatctagaat tacactctga aatcgtgttc tgcctgtgct gattacttgc 2100
cgtcctttgt agcagcaaaa tatagggaca tggtagtacg aaacgaagat agaacctaca 2160
cagcaatacg agaaatgtgt aatttggtgc ttagcggtat ttatttaagc acatgttggt 2220
gttatagggc acttggattc agaagtttgc tgttaattta ggcacaggct tcatactaca 2280
tgggtcaata gtatagggat tcatattata ggcgatacta taataatttg ttcgtctgca 2340
gagcttatta tttgccaaaa ttagatattc ctattctgtt tttgtttgtg tgctgttaaa 2400
ttgttaacgc ctgaaggaat aaatataaat gacgaaattt tgatgtttat ctctgctcct 2460
ttattgtgac cataagtcaa gatcagatgc acttgtttta aatattgttg tctgaagaaa 2520
taagtactga cagtattttg atgcattgat ctgcttgttt gttgtaacaa aatttaaaaa 2580
taaagagttt cctttttgtt gctctcctta cctcctgatg gtatctagta tctaccaact 2640
gacactatat tgcttctctt tacatacgta tcttgctcga tgccttctcc ctagtgttga 2700
ccagtgttac tcacatagtc tttgctcatt tcattgtaat gcagatacca agcggcctct 2760
agaggatcag catggcgccc accgtgatga tggcctcgtc ggccaccgcc gtcgctccgt 2820
tcctggggct caagtccacc gccagcctcc ccgtcgcccg ccgctcctcc agaagcctcg 2880
gcaacgtcag caacggcgga aggatccggt gcatgcaggt aacaaatgca tcctagctag 2940
tagttctttg cattgcagca gctgcagcta gcgagttagt aataggaagg gaactgatga 3000
tccatgcatg gactgatgtg tgttgcccat cccatcccat cccatttccc aaacgaaccg 3060
aaaacaccgt actacgtgca ggtgtggccc tacggcaaca agaagttcga gacgctgtcg 3120
tacctgccgc cgctgtcgac cggcgggcgc atccgctgca tgcaggcc atg gcc ttc 3177
Met Ala Phe
1
ttc aac cgg gtg atc acc ctc acg gtg ccg tcg tca gac gtg gtc aac 3225
Phe Asn Arg Val Ile Thr Leu Thr Val Pro Ser Ser Asp Val Val Asn
5 10 15
tac tcg gag atc tac cag gtg gct cct cag tat gtc aac cag gcc ctg 3273
Tyr Ser Glu Ile Tyr Gln Val Ala Pro Gln Tyr Val Asn Gln Ala Leu
20 25 30 35
acc ctg gcc aag tac ttc cag ggc gcc atc gac ggc agc acc ctg agg 3321
Thr Leu Ala Lys Tyr Phe Gln Gly Ala Ile Asp Gly Ser Thr Leu Arg
40 45 50
ttc gac ttc gag aag gcg tta cag atc gcc aac gac atc ccg cag gcc 3369
Phe Asp Phe Glu Lys Ala Leu Gln Ile Ala Asn Asp Ile Pro Gln Ala
55 60 65
gcg gtg gtc aac acc ctg aac cag acc gtc cag cag ggg acc gtc cag 3417
Ala Val Val Asn Thr Leu Asn Gln Thr Val Gln Gln Gly Thr Val Gln
70 75 80
gtc agc gtc atg atc gac aag atc gtg gac atc atg aag aat gtc ctg 3465
Val Ser Val Met Ile Asp Lys Ile Val Asp Ile Met Lys Asn Val Leu
85 90 95
tcc atc gtg ata gac aac aag aag ttt tgg gat cag gtc acg gct gcc 3513
Ser Ile Val Ile Asp Asn Lys Lys Phe Trp Asp Gln Val Thr Ala Ala
100 105 110 115
atc acc aac acc ttc acg aac ctg aac agc cag gag tcg gag gcc tgg 3561
Ile Thr Asn Thr Phe Thr Asn Leu Asn Ser Gln Glu Ser Glu Ala Trp
120 125 130
atc ttc tat tac aag gag gac gcc cac aag acg tcc tac tat tac aac 3609
Ile Phe Tyr Tyr Lys Glu Asp Ala His Lys Thr Ser Tyr Tyr Tyr Asn
135 140 145
atc ctc ttc gcc atc cag gac gaa gag acg ggt ggc gtg atg gcc acg 3657
Ile Leu Phe Ala Ile Gln Asp Glu Glu Thr Gly Gly Val Met Ala Thr
150 155 160
ctg ccc atc gcc ttc gac atc agt gtg gac atc gag aag gag aag gtc 3705
Leu Pro Ile Ala Phe Asp Ile Ser Val Asp Ile Glu Lys Glu Lys Val
165 170 175
ctg ttc gtg acc atc aag gac act gag aat tac gcc gtc acc gtc aag 3753
Leu Phe Val Thr Ile Lys Asp Thr Glu Asn Tyr Ala Val Thr Val Lys
180 185 190 195
gcg atc aac gtg gtc cag gca ctc cag tct agc agg gat tct aag gtg 3801
Ala Ile Asn Val Val Gln Ala Leu Gln Ser Ser Arg Asp Ser Lys Val
200 205 210
gtt gat gcg ttc aaa tcg cca cgg cac tta ccc cgg aag agg cat aag 3849
Val Asp Ala Phe Lys Ser Pro Arg His Leu Pro Arg Lys Arg His Lys
215 220 225
att tgc tct aac tcg tga tga attctgcatg cgtttggacg tatgctcatt 3900
Ile Cys Ser Asn Ser
230
caggttggag ccaatttggt tgatgtgtgt gcgagttctt gcgagtctga tgagacatct 3960
ctgtattgtg tttctttccc cagtgttttc tgtacttgtg taatcggcta atcgccaaca 4020
gattcggcga tgaataaatg agaaataaat tgttctgatt ttgagtgcaa aaaaaaagga 4080
att 4083
<210>39
<211>232
<212>PRT
<213>人工序列
<220>
<223>合成构建体
<400>39
Met Ala Phe Phe Asn Arg Val Ile Thr Leu Thr Val Pro Ser Ser Asp
1 5 10 15
Val Val Asn Tyr Ser Glu Ile Tyr Gln Val Ala Pro Gln Tyr Val Asn
20 25 30
Gln Ala Leu Thr Leu Ala Lys Tyr Phe Gln Gly Ala Ile Asp Gly Ser
35 40 45
Thr Leu Arg Phe Asp Phe Glu Lys Ala Leu Gln Ile Ala Asn Asp Ile
50 55 60
Pro Gln Ala Ala Val Val Asn Thr Leu Asn Gln Thr Val Gln Gln Gly
65 70 75 80
Thr Val Gln Val Ser Val Met Ile Asp Lys Ile Val Asp Ile Met Lys
85 90 95
Asn Val Leu Ser Ile Val Ile Asp Asn Lys Lys Phe Trp Asp Gln Val
100 105 110
Thr Ala Ala Ile Thr Asn Thr Phe Thr Asn Leu Asn Ser Gln Glu Ser
115 120 125
Glu Ala Trp Ile Phe Tyr Tyr Lys Glu Asp Ala His Lys Thr Ser Tyr
130 135 140
Tyr Tyr Asn Ile Leu Phe Ala Ile Gln Asp Glu Glu Thr Gly Gly Val
145 150 155 160
Met Ala Thr Leu Pro Ile Ala Phe Asp Ile Ser Val Asp Ile Glu Lys
165 170 175
Glu Lys Val Leu Phe Val Thr Ile Lys Asp Thr Glu Asn Tyr Ala Val
180 185 190
Thr Val Lys Ala Ile Asn Val Val Gln Ala Leu Gln Ser Ser Arg Asp
195 200 205
Ser Lys Val Val Asp Ala Phe Lys Ser Pro Arg His Leu Pro Arg Lys
210 215 220
Arg His Lys Ile Cys Ser Asn Ser
225 230
<210>40
<211>6641
<212>DNA
<213>人工序列
<220>
<223>合成序列;在pMON64150中编码TIC809和TIC810的表达盒
<220>
<221>其他特征
<223>pMON64150第一和第二植物表达盒
<220>
<221>启动子
<222>(1)..(1844)
<223>水稻Rcc3
<220>
<221>5’UTR
<222>(1845)..(1943)
<223>水稻Rcc3前导序列
<220>
<221>内含子
<222>(1952)..(2755)
<223>HSP70内含子
<220>
<221>转运肽
<222>(2772)..(2912)
<223>Zm RbcS
<220>
<221>内含子
<222>(2913)..(3081)
<223>Zm RbcS
<220>
<221>转运肽
<222>(3082)..(3168)
<223>Zm RbcS
<220>
<221>CDS
<222>(3169)..(3870)
<223>TIC809
<220>
<221>终止子
<222>(3874)..(4083)
<223>小麦Hsp17
<220>
<221>启动子
<222>(4139)..(4750)
<223>e35S
<220>
<221>5’UTR
<222>(4751)..(4759)
<223>前导序列
<220>
<221>5’UTR
<222>(4786)..(4846)
<223>小麦CAB前导序列
<220>
<221>内含子
<222>(4863)..(5342)
<223>水稻肌动蛋白
<220>
<221>转运肽
<222>(5366)..(5506)
<223>Zm.RbcS
<220>
<221>内含子
<222>(5507)..(5676)
<223>Zm.RhcS
<220>
<221>转运肽
<222>(5677)..(5766)
<223>Zm RbcS
<220>
<221>CDS
<222>(5767)..(6426)
<223>TIC810
<220>
<221>终止子
<222>(6432)..(6641)
<223>小麦Hsp17
<400>40
gcaatcaacc aacatatact gaatatggga aagtttcttt tagcttttct aaattaagta 60
ctgattctta aacttaagtg agaatctagc ctgttcaggg gcgacggcta aaggacatag 120
caccactagt ctacgcgatt gcaaaaaaga agaatgcaag cctgcaacaa gtatcgcttt 180
cccgaccaat ggttggttga cctcggtttg ccggtaacct caggctggac gacagaacta 240
attagccaac ttgtcaatgt ctagggtgct gttcatagcc tgcagttgac agagtacgaa 300
aaggacaaga tcacatggaa gctaactagt cacggcgaat acatgacgac atcggcctac 360
aacgcacaac ttcttggcat aaaagcttca atttcaatgc ccctatctgg aagccctagg 420
cgccgcgcaa atgtaaaaca ttcgcttcgc ttggcttgtt atccaaaata gagtatggac 480
ctccgacaga ttggcaaccc gtgggtaatc gaaaatggct ccatctgccc ctttgtcgaa 540
ggaatcagga aacggccctc acctcctggc ggagtgtaga tatgtgaaag aatctaggcg 600
acacttgcag actggacaac atgtgaacaa ataagaccaa cgttatggca acaagcctcg 660
acgctactca agtggtggga ggccaccgca tgttccaacg aagcgccaaa gaaagccttg 720
cagactctaa tgctattagt cgcctaggat atttggaatg aaaggaaccg cagagttttt 780
cagcaccaag agcttccggt ggctagtctg atagccaaaa ttaaggagga tgccaaaaca 840
tgggtcttgg cgggcgcgaa acaccttgat aggtggctta ccttttaaca tgttcgggcc 900
aaaggccttg agacggtaaa gttttctatt tgcgcttgcg catgtacaat tttattcctc 960
tattcaatga aattggtggc tcactggttc attaaaaaaa aaagaatcta gcctgttcgg 1020
gaagaagagg attttgttcg tgagagagag agagagagag agagagagag agagagagaa 1080
ggaggaggag gattttcagg cttcgcattg cccaacctct gcttctgttg gcccaagaag 1140
aatcccaggc gcccatgggc tggcagtt a ccacggacct acctagccta ccttagctat 1200
ctaagcgggc cgacctagta gccacgtgcc tagtgtagat taaagttgcc gggccagcag 1260
gaagccacgc tgcaatggca tcttcccctg tccttcgcgt acgtgaaaac aaacccaggt 1320
aagcttagaa tcttcttgcc cgttggactg ggacacccac caatcccacc atgccccgat 1380
attcctccgg tctcggttca tgtgatgtcc tctcttgtgt gatcacggag caagcattct 1440
taaacggcaa aagaaaatca ccaacttgct cacgcagtca cgctgcaccg cgcgaagcga 1500
cgcccgatag gccaagatcg cgagataaaa taacaaccaa tgatcataag gaaacaagcc 1560
cgcgatgtgt cgtgtgcagc aatcttggtc atttgcggga tcgagtgctt cacagctaac 1620
caaatattcg gccgatgatt taacacatta tcagcgtaga tgtacgtacg atttgttaat 1680
taatctacga gccttgctag ggcaggtgtt ctgccagcca atccagatcg ccctcgtatg 1740
cacgctcaca tgatggcagg gcagggttca catgagctct aacggtcgat taattaatcc 1800
cggggctcga ctataaatac ctccctaatc ccatgatcaa aaccatctca agcagcctaa 1860
tcatctccag ctgatcaaga gctcttaatt agctagctag tgattagctg cgcttgtgat 1920
cgatcgatct cgggtacgta gcaatagatc taccgtcttc ggtacgcgct cactccgccc 1980
tctgcctttg ttactgccac gtttctctga atgctctctt gtgtggtgat tgctgagagt 2040
ggtttagctg gatctagaat tacactctga aatcgtgttc tgcctgtgct gattacttgc 2100
cgtcctttgt agcagcaaaa tatagggaca tggtagtacg aaacgaagat agaacctaca 2160
cagcaatacg agaaatgtgt aatttggtgc ttagcggtat ttatttaagc acatgttggt 2220
gttatagggc acttggattc agaagtttgc tgttaattta ggcacaggct tcatactaca 2280
tgggtcaata gtatagggat tcatattata ggcgatacta taataatttg ttcgtctgca 2340
gagcttatta tttgccaaaa ttagatattc ctattctgtt tttgtttgtg tgctgttaaa 2400
ttgttaacgc ctgaaggaat aaatataaat gacgaaattt tgatgtttat ctctgctcct 2460
ttattgtgac cataagtcaa gatcagatgc acttgtttta aatattgttg tctgaagaaa 2520
taagtactga cagtattttg atgcattgat ctgcttgttt gttgtaacaa aatttaaaaa 2580
taaagagttt cctttttgtt gctctcctta cctcctgatg gtatctagta tctaccaact 2640
gacactatat tgcttctctt tacatacgta tcttgctcga tgccttctcc ctagtgttga 2700
ccagtgttac tcacatagtc tttgctcatt tcattgtaat gcagatacca agcggcctct 2760
agaggatcag catggcgccc accgtgatga tggcctcgtc ggccaccgcc gtcgctccgt 2820
tcctggggct caagtccacc gccagcctcc ccgtcgcccg ccgctcctcc agaagcctcg 2880
gcaacgtcag caacggcgga aggatccggt gcatgcaggt aacaaatgca tcctagctag 2940
tagttctttg cattgcagca gctgcagcta gcgagttagt aataggaagg gaactgatga 3000
tccatgcatg gactgatgtg tgttgcccat cccatcccat cccatttccc aaacgaaccg 3060
aaaacaccgt actacgtgca ggtgtggccc tacggcaaca agaagttcga gacgctgtcg 3120
tacctgccgc cgctgtcgac cggcgggcgc atccgctgca tgcaggcc atg gcc ttc 3177
Met Ala Phe
1
ttc aac cgg gtg atc acc ctc acg gtg ccg tcg tca gac gtg gtc aac 3225
Phe Asn Arg Val Ile Thr Leu Thr Val Pro Ser Ser Asp Val Val Asn
5 10 15
tac tcg gag atc tac cag gtg gct cct cag tat gtc aac cag gcc ctg 3273
Tyr Ser Glu Ile Tyr Gln Val Ala Pro Gln Tyr Val Asn Gln Ala Leu
20 25 30 35
acc ctg gcc aag tac ttc cag ggc gcc atc gac ggc agc acc ctg agg 3321
Thr Leu Ala Lys Tyr Phe Gln Gly Ala Ile Asp Gly Ser Thr Leu Arg
40 45 50
ttc gac ttc gag aag gcg tta cag atc gcc aac gac atc ccg cag gcc 3369
Phe Asp Phe Glu Lys Ala Leu Gln Ile Ala Asn Asp Ile Pro Gln Ala
55 60 65
gcg gtg gtc aac acc ctg aac cag acc gtc cag cag ggg acc gtc cag 3417
Ala Val Val Asn Thr Leu Asn Gln Thr Val Gln Gln Gly Thr Val Gln
70 75 80
gtc agc gtc atg atc gac aag atc gtg gac atc atg aag aat gtc ctg 3465
Val Ser Val Met Ile Asp Lys Ile Val Asp Ile Met Lys Asn Val Leu
85 90 95
tcc atc gtg ata gac aac aag aag ttt tgg gat cag gtc acg gct gcc 3513
Ser Ile Val Ile Asp Asn Lys Lys Phe Trp Asp Gln Val Thr Ala Ala
100 105 110 115
atc acc aac acc ttc acg aac ctg aac agc cag gag tcg gag gcc tgg 3561
Ile Thr Asn Thr Phe Thr Asn Leu Asn Ser Gln Glu Ser Glu Ala Trp
120 125 130
atc ttc tat tac aag gag gac gcc cac aag acg tcc tac tat tac aac 3609
Ile Phe Tyr Tyr Lys Glu Asp Ala His Lys Thr Ser Tyr Tyr Tyr Asn
135 140 145
atc ctc ttc gcc atc cag gac gaa gag acg ggt ggc gtg atg gcc acg 3657
Ile Leu Phe Ala Ile Gln Asp Glu Glu Thr Gly Gly Val Met Ala Thr
150 155 160
ctg ccc atc gcc ttc gac atc agt gtg gac atc gag aag gag aag gtc 3705
Leu Pro Ile Ala Phe Asp Ile Ser Val Asp Ile Glu Lys Glu Lys Val
165 170 175
ctg ttc gtg acc atc aag gac act gag aat tac gcc gtc acc gtc aag 3753
Leu Phe Val Thr Ile Lys Asp Thr Glu Asn Tyr Ala Val Thr Val Lys
180 185 190 195
gcg atc aac gtg gtc cag gca ctc cag tct agc agg gat tct aag gtg 3801
Ala Ile Asn Val Val Gln Ala Leu Gln Ser Ser Arg Asp Ser Lys Val
200 205 210
gtt gat gcg ttc aaa tcg cca cgg cac tta ccc cgg aag agg cat aag 3849
Val Asp Ala Phe Lys Ser Pro Arg His Leu Pro Arg Lys Arg His Lys
215 220 225
att tgc tct aac tcg tga tga attctgcatg cgtttggacg tatgctcatt 3900
Ile Cys Ser Asn Ser
230
caggttggag ccaatttggt tgatgtgtgt gcgagttctt gcgagtctga tgagacatct 3960
ctgtattgtg tttctttccc cagtgttttc tgtacttgtg taatcggcta atcgccaaca 4020
gattcggcga tgaataaatg agaaataaat tgttctgatt ttgagtgcaa aaaaaaagga 4080
attagatctg tgtgtgtttt ttggatcccc ggggcggccg cgttaacaag cttctgcagg 4140
tccgattgag acttttcaac aaagggtaat atccggaaac ctcctcggat tccattgccc 4200
agctatctgt cactttattg tgaagatagt ggaaaaggaa ggtggctcct acaaatgcca 4260
tcattgcgat aaaggaaagg ccatcgttga agatgcctct gccgacagtg gtcccaaaga 4320
tggaccccca cccacgagga gcatcgtgga aaaagaagac gttccaacca cgtcttcaaa 4380
gcaagtggat tgatgtgatg gtccgattga gacttttcaa caaagggtaa tatccggaaa 4440
cctcctcgga ttccattgcc cagctatctg tcactttatt gtgaagatag tggaaaagga 4500
aggtggctcc tacaaatgcc atcattgcga taaaggaaag gccatcgttg aagatgcctc 4560
tgccgacagt ggtcccaaag atggaccccc acccacgagg agcatcgtgg aaaaagaaga 4620
cgttccaacc acgtcttcaa agcaagtgga ttgatgtgat atctccactg acgtaaggga 4680
tgacgcacaa tcccactatc cttcgcaaga cccttcctct atataaggaa gttcatttca 4740
tttggagagg acacgctgac aagctgactc tagcagatcc tctagaacca tcttccacac 4800
actcaagcca cactattgga gaacacacag ggacaacaca ccataagatc caagggaggc 4860
ctccgccgcc gccggtaacc accccgcccc tctcctcttt ctttctccgt ttttttttcc 4920
gtctcggtct cgatctttgg ccttggtagt ttgggtgggc gagaggcggc ttcgtgcgcg 4980
cccagatcgg tgcgcgggag gggcgggatc tcgcggctgg ggctctcgcc ggcgtggatc 5040
cggcccggat ctcgcgggga atggggctct cggatgtaga tctgcgatcc gccgttgttg 5100
ggggagatga tggggggttt aaaatttccg ccgtgctaaa caagatcagg aagaggggaa 5160
aagggcacta tggtttatat ttttatatat ttctgctgct tcgtcaggct tagatgtgct 5220
agatctttct ttcttctttt tgtgggtaga atttgaatcc ctcagcattg ttcatcggta 5280
gtttttcttt tcatgatttg tgacaaatgc agcctcgtgc ggagcttttt tgtaggtaga 5340
agtgatcaac ctctagagga tcagcatggc gcccaccgtg atgatggcct cgtcggccac 5400
cgccgtcgct ccgttccagg ggctcaagtc caccgccagc ctccccgtcg cccgccgctc 5460
ctccagaagc ctcggcaacg tcagcaacgg cggaaggatc cggtgcatgc aggtaacaaa 5520
tgcatcctag ctagtagttc tttgcattgc agcagctgca gctagcgagt tagtaatagg 5580
aagggaactg atgatccatg catggactga tgtgtgttgc ccatcccatc ccatttccca 5640
accccaaacg aaccaaaaca cacgtactac gtgcaggtgt ggccggccta cggcaacaag 5700
aagttcgaga cgctgtcgta cctgccgccg ctgtcgaccg gcgggcgcat ccgctgcatg 5760
caggcc atg agc aaa gaa atc agg ctc aac ctt tct cgt gag agc ggc 5808
Met Ser Lys Glu Ile Arg Leu Asn Leu Ser Arg Glu Ser Gly
235 240 245
gcc gac ctg tac ctc aag atc ctc gcc ttc gtg aag ccc gag cac ttc 5856
Ala Asp Leu Tyr Leu Lys Ile Leu Ala Phe Val Lys Pro Glu His Phe
250 255 260
ttt cag gcg tac ctc ctg tgc cgc gag ttc gag agc atc gtg gat cct 5904
Phe Gln Ala Tyr Leu Leu Cys Arg Glu Phe Glu Ser Ile Val Asp Pro
265 270 275
aca acc cgc gag tct gac ttc gac aag acg ctg acc atc gtg aag tcg 5952
Thr Thr Arg Glu Ser Asp Phe Asp Lys Thr Leu Thr Ile Val Lys Ser
280 285 290
gac tcc acc ctc gtg acc gtg ggc acg atg aac acc aag ctg gtc aat 6000
Asp Ser Thr Leu Val Thr Val Gly Thr Met Asn Thr Lys Leu Val Asn
295 300 305 310
agc caa gag atc ctc gtg tcg gac ttg atc act caa gtc ggt tcc cag 6048
Ser Gln Glu Ile Leu Val Ser Asp Leu Ile Thr Gln Val Gly Ser Gln
315 320 325
atc gcc gat acc ctc ggc atc acg gac atc gac gcc aac acc cag caa 6096
Ile Ala Asp Thr Leu Gly Ile Thr Asp Ile Asp Ala Asn Thr Gln Gln
330 335 340
cag ctc acg gag ctg atc ggc aac ctc ttc gtg aac ctc aat tcc caa 6144
Gln Leu Thr Glu Leu Ile Gly Asn Leu Phe Val Asn Leu Asn Ser Gln
345 350 355
gtt cag gag tac atc tac ttc tac gag gag aag gag aag cag acc tcc 6192
Val Gln Glu Tyr Ile Tyr Phe Tyr Glu Glu Lys Glu Lys Gln Thr Ser
360 365 370
tac cgc tac aac atc ctc ttc gtg ttc gaa aag gag tcg ttc atc acc 6240
Tyr Arg Tyr Asn Ile Leu Phe Val Phe Glu Lys Glu Ser Phe Ile Thr
375 380 385 390
att ctg cca atg ggc ttc gac gtg acc gtg aac acg aac aag gag gcc 6288
Ile Leu Pro Met Gly Phe Asp Val Thr Val Asn Thr Asn Lys Glu Ala
395 400 405
gtc ctg aag ctg acc ccg aag gac aag gtt acc tac ggc cac gtc agc 6336
Val Leu Lys Leu Thr Pro Lys Asp Lys Val Thr Tyr Gly His Val Ser
410 415 420
gtc aag gcc ctc aac atc atc cag ctc att acg gag gac aag ttc aac 6384
Val Lys Ala Leu Asn Ile Ile Gln Leu Ile Thr Glu Asp Lys Phe Asn
425 430 435
ttc ctc gca acc ctc aag aag gct ctc aag acc ctg tga tga 6426
Phe Leu Ala Thr Leu Lys Lys Ala Leu Lys Thr Leu
440 445 450
gaattctgca tgcgtttgga cgtatgctca ttcaggttgg agccaatttg gttgatgtgt 6486
gtgcgagttc ttgcgagtct gatgagacat ctctgtattg tgtttctttc cccagtgttt 6546
tctgtacttg tgtaatcggc taatcgccaa cagattcggc gatgaataaa tgagaaataa 6606
attgttctga ttttgagtgc aaaaaaaaag gaatt 6641
<210>41
<211>232
<212>PRT
<213>人工序列
<220>
<223>合成构建体
<400>41
Met Ala Phe Phe Asn Arg Val Ile Thr Leu Thr Val Pro Ser Ser Asp
1 5 10 15
Val Val Asn Tyr Ser Glu Ile Tyr Gln Val Ala Pro Gln Tyr Val Asn
20 25 30
Gln Ala Leu Thr Leu Ala Lys Tyr Phe Gln Gly Ala Ile Asp Gly Ser
35 40 45
Thr Leu Arg Phe Asp Phe Glu Lys Ala Leu Gln Ile Ala Asn Asp Ile
50 55 60
Pro Gln Ala Ala Val Val Asn Thr Leu Asn Gln Thr Val Gln Gln Gly
65 70 75 80
Thr Val Gln Val Ser Val Met Ile Asp Lys Ile Val Asp Ile Met Lys
85 90 95
Asn Val Leu Ser Ile Val Ile Asp Asn Lys Lys Phe Trp Asp Gln Val
100 105 110
Thr Ala Ala Ile Thr Asn Thr Phe Thr Asn Leu Asn Ser Gln Glu Ser
115 120 125
Glu Ala Trp Ile Phe Tyr Tyr Lys Glu Asp Ala His Lys Thr Ser Tyr
130 135 140
Tyr Tyr Asn Ile Leu Phe Ala Ile Gln Asp Glu Glu Thr Gly Gly Val
145 150 155 160
Met Ala Thr Leu Pro Ile Ala Phe Asp Ile Ser Val Asp Ile Glu Lys
165 170 175
Glu Lys Val Leu Phe Val Thr Ile Lys Asp Thr Glu Asn Tyr Ala Val
180 185 190
Thr Val Lys Ala Ile Asn Val Val Gln Ala Leu Gln Ser Ser Arg Asp
195 200 205
Ser Lys Val Val Asp Ala Phe Lys Ser Pro Arg His Leu Pro Arg Lys
210 215 220
Arg His Lys Ile Cys Ser Asn Ser
225 230
<210>42
<211>218
<212>PRT
<213>人工序列
<220>
<223>合成构建体
<400>42
Met Ser Lys Glu Ile Arg Leu Asn Leu Ser Arg Glu Ser Gly Ala Asp
1 5 10 15
Leu Tyr Leu Lys Ile Leu Ala Phe Val Lys Pro Glu His Phe Phe Gln
20 25 30
Ala Tyr Leu Leu Cys Arg Glu Phe Glu Ser Ile Val Asp Pro Thr Thr
35 40 45
Arg Glu Ser Asp Phe Asp Lys Thr Leu Thr Ile Val Lys Ser Asp Ser
50 55 60
Thr Leu Val Thr Val Gly Thr Met Asn Thr Lys Leu Val Asn Ser Gln
65 70 75 80
Glu Ile Leu Val Ser Asp Leu Ile Thr Gln Val Gly Ser Gln Ile Ala
85 90 95
Asp Thr Leu Gly Ile Thr Asp Ile Asp Ala Asn Thr Gln Gln Gln Leu
100 105 110
Thr Glu Leu Ile Gly Asn Leu Phe Val Asn Leu Asn Ser Gln Val Gln
115 120 125
Glu Tyr Ile Tyr Phe Tyr Glu Glu Lys Glu Lys Gln Thr Ser Tyr Arg
130 135 140
Tyr Asn Ile Leu Phe Val Phe Glu Lys Glu Ser Phe Ile Thr Ile Leu
145 150 155 160
Pro Met Gly Phe Asp Val Thr Val Asn Thr Asn Lys Glu Ala Val Leu
165 170 175
Lys Leu Thr Pro Lys Asp Lys Val Thr Tyr Gly His Val Ser Val Lys
180 185 190
Ala Leu Asn Ile Ile Gln Leu Ile Thr Glu Asp Lys Phe Asn Phe Leu
195 200 205
Ala Thr Leu Lys Lys Ala Leu Lys Thr Leu
210 215
<210>43
<212>DNA
<213>人工序列
<220>
<223>合成序列;在pMON64151中编码TIC809和TIC810的表达盒
<220>
<221>其他特征
<223>pMON64151第一和第二植物表达盒
<220>
<221>启动子
<222>(1)..(1844)
<223>水稻Rcc3
<220>
<221>5’UTR
<222>(1845)..(1943)
<223>水稻Rcc3前导序列
<220>
<221>内含子
<222>(1952)..(2755)
<223>HSP70内含子
<220>
<221>CDS
<222>(2772)..(3473)
<223>TIC809
<220>
<221>终止子
<222>(3477)..(3686)
<223>小麦Hsp17
<220>
<221>启动子
<222>(3724)..(4337)
<223>e35S
<220>
<221>5’UTR
<222>(4373)..(4433)
<223>小麦CAB前导序列
<220>
<221>内含子
<222>(4450)..(4929)
<223>水稻肌动蛋白
<220>
<221>CDS
<222>(4939)..(5598)
<223>TIC810
<220>
<221>终止子
<222>(5604)..(5813)
<223>小麦Hsp17
<400>43
gcaatcaacc aacatatact gaatatggga aagtttcttt tagcttttct aaattaagta 60
ctgattctta aacttaagtg agaatctagc ctgttcaggg gcgacggcta aaggacatag 120
caccactagt ctacgcgatt gcaaaaaaga agaatgcaag cctgcaacaa gtatcgcttt 180
cccgaccaat ggttggttga cctcggtttg ccggtaacct caggctggac gacagaacta 240
attagccaac ttgtcaatgt ctagggtgct gttcatagcc tgcagttgac agagtacgaa 300
aaggacaaga tcacatggaa gctaactagt cacggcgaat acatgacgac atcggcctac 360
aacgcacaac ttcttggcat aaaagcttca atttcaatgc ccctatctgg aagccctagg 420
cgccgcgcaa atgtaaaaca ttcgcttcgc ttggcttgtt atccaaaata gagtatggac 480
ctccgacaga ttggcaaccc gtgggtaatc gaaaatggct ccatctgccc ctttgtcgaa 540
ggaatcagga aacggccctc acctcctggc ggagtgtaga tatgtgaaag aatctaggcg 600
acacttgcag actggacaac atgtgaacaa ataagaccaa cgttatggca acaagcctcg 660
acgctactca agtggtggga ggccaccgca tgttccaacg aagcgccaaa gaaagccttg 720
cagactctaa tgctattagt cgcctaggat atttggaatg aaaggaaccg cagagttttt 780
cagcaccaag agcttccggt ggctagtctg atagccaaaa ttaaggagga tgccaaaaca 840
tgggtcttgg cgggcgcgaa acaccttgat aggtggctta ccttttaaca tgttcgggcc 900
aaaggccttg agacggtaaa gttttctatt tgcgcttgcg catgtacaat tttattcctc 960
tattcaatga aattggtggc tcactggttc attaaaaaaa aaagaatcta gcctgttcgg 1020
gaagaagagg attttgttcg tgagagagag agagagagag agagagagag agagagagaa 1080
ggaggaggag gattttcagg cttcgcattg cccaacctct gcttctgttg gcccaagaag 1140
aatcccaggc gcccatgggc tggcagttta ccacggacct acctagccta ccttagctat 1200
ctaagcgggc cgacctagta gccacgtgcc tagtgtagat taaagttgcc gggccagcag 1260
gaagccacgc tgcaatggca tcttcccctg tccttcgcgt acgtgaaaac aaacccaggt 1320
aagcttagaa tcttcttgcc cgttggactg ggacacccac caatcccacc atgccccgat 1380
attcctccgg tctcggttca tgtgatgtcc tctcttgtgt gatcacggag caagcattct 1440
taaacggcaa aagaaaatca ccaacttgct cacgcagtca cgctgcaccg cgcgaagcga 1500
cgcccgatag gccaagatcg cgagataaaa taacaaccaa tgatcataag gaaacaagcc 1560
cgcgatgtgt cgtgtgcagc aatcttggtc atttgcggga tcgagtgctt cacagctaac 1620
caaatattcg gccgatgatt taacacatta tcagcgtaga tgtacgtacg atttgttaat 1680
taatctacga gccttgctag ggcaggtgtt ctgccagcca atccagatcg ccctcgtatg 1740
cacgctcaca tgatggcagg gcagggttca catgagctct aacggtcgat taattaatcc 1800
cggggctcga ctataaatac ctccctaatc ccatgatcaa aaccatctca agcagcctaa 1860
tcatctccag ctgatcaaga gctcttaatt agctagctag tgattagctg cgcttgtgat 1920
cgatcgatct cgggtacgta gcaatagatc taccgtcttc ggtacgcgct cactccgccc 1980
tctgcctttg ttactgccac gtttctctga atgctctctt gtgtggtgat tgctgagagt 2040
ggtttagctg gatctagaat tacactctga aatcgtgttc tgcctgtgct gattacttgc 2100
cgtcctttgt agcagcaaaa tatagggaca tggtagtacg aaacgaagat agaacctaca 2160
cagcaatacg agaaatgtgt aatttggtgc ttagcggtat ttatttaagc acatgttggt 2220
gttatagggc acttggattc agaagtttgc tgttaattta ggcacaggct tcatactaca 2280
tgggtcaata gtatagggat tcatattata ggcgatacta taataatttg ttcgtctgca 2340
gagcttatta tttgccaaaa ttagatattc ctattctgtt tttgtttgtg tgctgttaaa 2400
ttgttaacgc ctgaaggaat aaatataaat gacgaaattt tgatgtttat ctctgctcct 2460
ttattgtgac cataagtcaa gatcagatgc acttgtttta aatattgttg tctgaagaaa 2520
taagtactga cagtattttg atgcattgat ctgcttgttt gttgtaacaa aatttaaaaa 2580
taaagagttt cctttttgtt gctctcctta cctcctgatg gtatctagta tctaccaact 2640
gacactatat tgcttctctt tacatacgta tcttgctcga tgccttctcc ctagtgttga 2700
ccagtgttac tcacatagtc tttgctcatt tcattgtaat gcagatacca agcggcctct 2760
agaggatctc c atg gcc ttc ttc aac cgg gtg atc acc ctc acg gtg ccg 2810
Met Ala Phe Phe Asn Arg Val Ile Thr Leu Thr Val Pro
1 5 10
tcg tca gac gtg gtc aac tac tcg gag atc tac cag gtg gct cct cag 2858
Ser Ser Asp Val Val Asn Tyr Ser Glu Ile Tyr Gln Val Ala Pro Gln
15 20 25
tat gtc aac cag gcc ctg acc ctg gcc aag tac ttc cag ggc gcc atc 2906
Tyr Val Asn Gln Ala Leu Thr Leu Ala Lys Tyr Phe Gln Gly Ala Ile
30 35 40 45
gac ggc agc acc ctg agg ttc gac ttc gag aag gcg tta cag atc gcc 2954
Asp Gly Ser Thr Leu Arg Phe Asp Phe Glu Lys Ala Leu Gln Ile Ala
50 55 60
aac gac atc ccg cag gcc gcg gtg gtc aac acc ctg aac cag acc gtc 3002
Asn Asp Ile Pro Gln Ala Ala Val Val Asn Thr Leu Asn Gln Thr Val
65 70 75
cag cag ggg acc gtc cag gtc agc gtc atg atc gac aag atc gtg gac 3050
Gln Gln Gly Thr Val Gln Val Ser Val Met Ile Asp Lys Ile Val Asp
80 85 90
atc atg aag aat gtc ctg tcc atc gtg ata gac aac aag aag ttt tgg 3098
Ile Met Lys Asn Val Leu Ser Ile Val Ile Asp Asn Lys Lys Phe Trp
95 100 105
gat cag gtc acg gct gcc atc acc aac acc ttc acg aac ctg aac agc 3146
Asp Gln Val Thr Ala Ala Ile Thr Asn Thr Phe Thr Asn Leu Asn Ser
110 115 120 125
cag gag tcg gag gcc tgg atc ttc tat tac aag gag gac gcc cac aag 3194
Gln Glu Ser Glu Ala Trp Ile Phe Tyr Tyr Lys Glu Asp Ala His Lys
130 135 140
acg tcc tac tat tac aac atc ctc ttc gcc atc cag gac gaa gag acg 3242
Thr Ser Tyr Tyr Tyr Asn Ile Leu Phe Ala Ile Gln Asp Glu Glu Thr
145 150 155
ggt ggc gtg atg gcc acg ctg ccc atc gcc ttc gac atc agt gtg gac 3290
Gly Gly Val Met Ala Thr Leu Pro Ile Ala Phe Asp Ile Ser Val Asp
160 165 170
atc gag aag gag aag gtc ctg ttc gtg acc atc aag gac act gag aat 3338
Ile Glu Lys Glu Lys Val Leu Phe Val Thr Ile Lys Asp Thr Glu Asn
175 180 185
tac gcc gtc acc gtc aag gcg atc aac gtg gtc cag gca ctc cag tct 3386
Tyr Ala Val Thr Val Lys Ala Ile Asn Val Val Gln Ala Leu Gln Ser
190 195 200 205
agc agg gat tct aag gtg gtt gat gcg ttc aaa tcg cca cgg cac tta 3434
Ser Arg Asp Ser Lys Val Val Asp Ala Phe Lys Ser Pro Arg His Leu
210 215 220
ccc cgg aag agg cat aag att tgc tct aac tcg tga tga attctgcatg 3483
Pro Arg Lys Arg His Lys Ile Cys Ser Asn Ser
225 230
cgtttggacg tatgctcatt caggttggag ccaatttggt tgatgtgtgt gcgagttctt 3543
gcgagtctga tgagacatct ctgtattgtg tttctttccc cagtgttttc tgtacttgtg 3603
taatcggcta atcgccaaca gattcggcga tgaataaatg agaaataaat tgttctgatt 3663
ttgagtgcaa aaaaaaagga attagatctg tgtgtgtttt ttggatcccc agcttctgca 3723
ggtccgatgt gagacttttc aacaaagggt aatatccgga aacctcctcg gattccattg 3783
cccagctatc tgtcacttta ttgtgaagat agtggaaaag gaaggtggct cctacaaatg 3843
ccatcattgc gataaaggaa aggccatcgt tgaagatgcc tctgccgaca gtggtcccaa 3903
agatggaccc ccacccacga ggagcatcgt ggaaaaagaa gacgttccaa ccacgtcttc 3963
aaagcaagtg gattgatgtg atggtccgat gtgagacttt tcaacaaagg gtaatatccg 4023
gaaacctcct cggattccat tgcccagcta tctgtcactt tattgtgaag atagtggaaa 4083
aggaaggtgg ctcctacaaa tgccatcatt gcgataaagg aaaggccatc gttgaagatg 4143
cctctgccga cagtggtccc aaagatggac ccccacccac gaggagcatc gtggaaaaag 4203
aagacgttcc aaccacgtct tcaaagcaag tggattgatg tgatatctcc actgacgtaa 4263
gggatgacgc acaatcccac tatccttcgc aagacccttc ctctatataa ggaagttcat 4323
ttcatttgga gaggacacgc tgacaagctg actctagcag atcctctaga accatcttcc 4383
acacactcaa gccacactat tggagaacac acagggacaa cacaccataa gatccaaggg 4443
aggcctccgc cgccgccggt aaccaccccg cccctctcct ctttctttct ccgttttttt 4503
ttccgtctcg gtctcgatct ttggccttgg tagtttgggt gggcgagagg cggcttcgtg 4563
cgcgcccaga tcggtgcgcg ggaggggcgg gatctcgcgg ctggggctct cgccggcgtg 4623
gatccggccc ggatctcgcg gggaatgggg ctctcggatg tagatctgcg atccgccgtt 4683
gttgggggag atgatggggg gtttaaaatt tccgccgtgc taaacaagat caggaagagg 4743
ggaaaagggc actatggttt atatttttat atatttctgc tgcttcgtca ggcttagatg 4803
tgctagatct ttctttcttc tttttgtggg tagaatttga atccctcagc attgttcatc 4863
ggtagttttt cttttcatga tttgtgacaa atgcagcctc gtgcggagct tttttgtagg 4923
tagaagtgat caacc atg agc aaa gaa atc agg ctc aac ctt tct cgt gag 4974
Met Ser Lys Glu Ile Arg Leu Asn Leu Ser Arg Glu
235 240
agc ggc gcc gac ctg tac ctc aag atc ctc gcc ttc gtg aag ccc gag 5022
Ser Gly Ala Asp Leu Tyr Leu Lys Ile Leu Ala Phe Val Lys Pro Glu
245 250 255 260
cac ttc ttt cag gcg tac ctc ctg tgc cgc gag ttc gag agc atc gtg 5070
His Phe Phe Gln Ala Tyr Leu Leu Cys Arg Glu Phe Glu Ser Ile Val
265 270 275
gat cct aca acc cgc gag tct gac ttc gac aag acg ctg acc atc gtg 5118
Asp Pro Thr Thr Arg Glu Ser Asp Phe Asp Lys Thr Leu Thr Ile Val
280 285 290
aag tcg gac tcc acc ctc gtg acc gtg ggc acg atg aac acc aag ctg 5166
Lys Ser Asp Ser Thr Leu Val Thr Val Gly Thr Met Asn Thr Lys Leu
295 300 305
gtc aat agc caa gag atc ctc gtg tcg gac ttg atc act caa gtc ggt 5214
Val Asn Ser Gln Glu Ile Leu Val Ser Asp Leu Ile Thr Gln Val Gly
310 315 320
tcc cag atc gcc gat acc ctc ggc atc acg gac atc gac gcc aac acc 5262
Ser Gln Ile Ala Asp Thr Leu Gly Ile Thr Asp Ile Asp Ala Asn Thr
325 330 335 340
cag caa cag ctc acg gag ctg atc ggc aac ctc ttc gtg aac ctc aat 5310
Gln Gln Gln Leu Thr Glu Leu Ile Gly Asn Leu Phe Val Asn Leu Asn
345 350 355
tcc caa gtt cag gag tac atc tac ttc tac gag gag aag gag aag cag 5358
Ser Gln Val Gln Glu Tyr Ile Tyr Phe Tyr Glu Glu Lys Glu Lys Gln
360 365 370
acc tcc tac cgc tac aac atc ctc ttc gtg ttc gaa aag gag tcg ttc 5406
Thr Ser Tyr Arg Tyr Asn Ile Leu Phe Val Phe Glu Lys Glu Ser Phe
375 380 385
atc acc att ctg cca atg ggc ttc gac gtg acc gtg aac acg aac aag 5454
Ile Thr Ile Leu Pro Met Gly Phe Asp Val Thr Val Asn Thr Asn Lys
390 395 400
gag gcc gtc ctg aag ctg acc ccg aag gac aag gtt acc tac ggc cac 5502
Glu Ala Val Leu Lys Leu Thr Pro Lys Asp Lys Val Thr Tyr Gly His
405 410 415 420
gtc agc gtc aag gcc ctc aac atc atc cag ctc att acg gag gac aag 5550
Val Ser Val Lys Ala Leu Asn Ile Ile Gln Leu Ile Thr Glu Asp Lys
425 430 435
ttc aac ttc ctc gca acc ctc aag aag gct ctc aag acc ctg tga tga 5598
Phe Asn Phe Leu Ala Thr Leu Lys Lys Ala Leu Lys Thr Leu
440 445 450
gaattctgca tgcgtttgga cgtatgctca ttcaggttgg agccaatttg gttgatgtgt 5658
gtgcgagttc ttgcgagtct gatgagacat ctctgtattg tgtttctttc cccagtgttt 5718
tctgtacttg tgtaatcggc taatcgccaa cagattcggc gatgaataaa tgagaaataa 5778
attgttctga ttttgagtgc aaaaaaaaag gaatt 5813
<210>44
<211>232
<212>PRT
<213>人工序列
<220>
<223>合成构建体
<400>44
Met Ala Phe Phe Asn Arg Val Ile Thr Leu Thr Val Pro Ser Ser Asp
1 5 10 15
Val Val Asn Tyr Ser Glu Ile Tyr Gln Val Ala Pro Gln Tyr Val Asn
20 25 30
Gln Ala Leu Thr Leu Ala Lys Tyr Phe Gln Gly Ala Ile Asp Gly Ser
35 40 45
Thr Leu Arg Phe Asp Phe Glu Lys Ala Leu Gln Ile Ala Asn Asp Ile
50 55 60
Pro Gln Ala Ala Val Val Asn Thr Leu Asn Gln Thr Val Gln Gln Gly
65 70 75 80
Thr Val Gln Val Ser Val Met Ile Asp Lys Ile Val Asp Ile Met Lys
85 90 95
Asn Val Leu Ser Ile Val Ile Asp Asn Lys Lys Phe Trp Asp Gln Val
100 105 110
Thr Ala Ala Ile Thr Asn Thr Phe Thr Asn Leu Asn Ser Gln Glu Ser
115 120 125
Glu Ala Trp Ile Phe Tyr Tyr Lys Glu Asp Ala His Lys Thr Ser Tyr
130 135 140
Tyr Tyr Asn Ile Leu Phe Ala Ile Gln Asp Glu Glu Thr Gly Gly Val
145 150 155 160
Met Ala Thr Leu Pro Ile Ala Phe Asp Ile Ser Val Asp Ile Glu Lys
165 170 175
Glu Lys Val Leu Phe Val Thr Ile Lys Asp Thr Glu Asn Tyr Ala Val
180 185 190
Thr Val Lys Ala Ile Asn Val Val Gln Ala Leu Gln Ser Ser Arg Asp
195 200 205
Ser Lys Val Val Asp Ala Phe Lys Ser Pro Arg His Leu Pro Arg Lys
210 215 220
Arg His Lys Ile Cys Ser Asn Ser
225 230
<210>45
<211>218
<212>PRT
<213>人工序列
<220>
<223>合成构建体
<400>45
Met Ser Lys Glu Ile Arg Leu Asn Leu Ser Arg Glu Ser Gly Ala Asp
1 5 10 15
Leu Tyr Leu Lys Ile Leu Ala Phe Val Lys Pro Glu His Phe Phe Gln
20 25 30
Ala Tyr Leu Leu Cys Arg Glu Phe Glu Ser Ile Val Asp Pro Thr Thr
35 40 45
Arg Glu Ser Asp Phe Asp Lys Thr Leu Thr Ile Val Lys Ser Asp Ser
50 55 60
Thr Leu Val Thr Val Gly Thr Met Asn Thr Lys Leu Val Asn Ser Gln
65 70 75 80
Glu Ile Leu Val Ser Asp Leu Ile Thr Gln Val Gly Ser Gln Ile Ala
85 90 95
Asp Thr Leu Gly Ile Thr Asp Ile Asp Ala Asn Thr Gln Gln Gln Leu
100 105 110
Thr Glu Leu Ile Gly Asn Leu Phe Val Asn Leu Asn Ser Gln Val Gln
115 120 125
Glu Tyr Ile Tyr Phe Tyr Glu Glu Lys Glu Lys Gln Thr Ser Tyr Arg
130 135 140
Tyr Asn Ile Leu Phe Val Phe Glu Lys Glu Ser Phe Ile Thr Ile Leu
145 150 155 160
Pro Met Gly Phe Asp Val Thr Val Asn Thr Asn Lys Glu Ala Val Leu
165 170 175
Lys Leu Thr Pro Lys Asp Lys Val Thr Tyr Gly His Val Ser Val Lys
180 185 190
Ala Leu Asn Ile Ile Gln Leu Ile Thr Glu Asp Lys Phe Asn Phe Leu
195 200 205
Ala Thr Leu Lys Lys Ala Leu Lys Thr Leu
210 215
<210>46
<211>1413
<212>DNA
<213>人工序列
<220>
<221>CDS
<222>(1)..(1407)
<223>TIC127
<220>
<223>合成构建体
<220>
<223>(1)..(696)
<223>TIC809氨基酸序列
<220>
<223>(697)..(753)
<223>对蛋白水解敏感的间隔或连接氨基酸序列
<220>
<223>(754)..(1407)
<223>TIC810氨基酸序列
<400>46
atg gcc ttc ttc aac cgg gtg atc acc ctc acg gtg ccg tcg tca gac 48
Met Ala Phe Phe Asn Arg Val Ile Thr Leu Thr Val Pro Ser Ser Asp
1 5 10 15
gtg gtc aac tac tcg gag atc tac cag gtg gct cct cag tat gtc aac 96
Val Val Asn Tyr Ser Glu Ile Tyr Gln Val Ala Pro Gln Tyr Val Asn
20 25 30
cag gcc ctg acc ctg gcc aag tac ttc cag ggc gcc atc gac ggc agc 144
Gln Ala Leu Thr Leu Ala Lys Tyr Phe Gln Gly Ala Ile Asp Gly Ser
35 40 45
acc ctg agg ttc gac ttc gag aag gcg tta cag atc gcc aac gac atc 192
Thr Leu Arg Phe Asp Phe Glu Lys Ala Leu Gln Ile Ala Asn Asp Ile
50 55 60
ccg cag gcc gcg gtg gtc aac acc ctg aac cag acc gtc cag cag ggg 240
Pro Gln Ala Ala Val Val Asn Thr Leu Asn Gln Thr Val Gln Gln Gly
65 70 75 80
acc gtc cag gtc agc gtc atg atc gac aag atc gtg gac atc atg aag 288
Thr Val Gln Val Ser Val Met Ile Asp Lys Ile Val Asp Ile Met Lys
85 90 95
aat gtc ctg tcc atc gtg ata gac aac aag aag ttt tgg gat cag gtc 336
Asn Val Leu Ser Ile Val Ile Asp Asn Lys Lys Phe Trp Asp Gln Val
100 105 110
acg gct gcc atc acc aac acc ttc acg aac ctg aac agc cag gag tcg 384
Thr Ala Ala Ile Thr Asn Thr Phe Thr Asn Leu Asn Ser Gln Glu Ser
115 120 125
gag gcc tgg atc ttc tat tac aag gag gac gcc cac aag acg tcc tac 432
Glu Ala Trp Ile Phe Tyr Tyr Lys Glu Asp Ala His Lys Thr Ser Tyr
130 135 140
tat tac aac atc ctc ttc gcc atc cag gac gaa gag acg ggt ggc gtg 480
Tyr Tyr Asn Ile Leu Phe Ala Ile Gln Asp Glu Glu Thr Gly Gly Val
145 150 155 160
atg gcc acg ctg ccc atc gcc ttc gac atc agt gtg gac atc gag aag 528
Met Ala Thr Leu Pro Ile Ala Phe Asp Ile Ser Val Asp Ile Glu Lys
165 170 175
gag aag gtc ctg ttc gtg acc atc aag gac act gag aat tac gcc gtc 576
Glu Lys Val Leu Phe Val Thr Ile Lys Asp Thr Glu Asn Tyr Ala Val
180 185 190
acc gtc aag gcg atc aac gtg gtc cag gca ctc cag tct agc agg gat 624
Thr Val Lys Ala Ile Asn Val Val Gln Ala Leu Gln Ser Ser Arg Asp
195 200 205
tct aag gtg gtt gat gcg ttc aaa tcg cca cgg cac tta ccc cgg aag 672
Ser Lys Val Val Asp Ala Phe Lys Ser Pro Arg His Leu Pro Arg Lys
210 215 220
agg cat aag att tgc tct aac tcg aag ccc gcc ctg ctc aag gag gct 720
Arg His Lys Ile Cys Ser Asn Ser Lys Pro Ala Leu Leu Lys Glu Ala
225 230 235 240
ccc cgc gcc gag gag gag ctg cct ccc cgc aag atg agc aaa gaa atc 768
Pro Arg Ala Glu Glu Glu Leu Pro Pro Arg Lys Met Ser Lys Glu Ile
245 250 255
agg ctc aac ctt tct cgt gag agc ggc gcc gac ctg tac ctc aag atc 816
Arg Leu Asn Leu Ser Arg Glu Ser Gly Ala Asp Leu Tyr Leu Lys Ile
260 265 270
ctc gcc ttc gtg aag ccc gag cac ttc ttt cag gcg tac ctc ctg tgc 864
Leu Ala Phe Val Lys Pro Glu His Phe Phe Gln Ala Tyr Leu Leu Cys
275 280 285
cgc gag ttc gag agc atc gtg gat cct aca acc cgc gag tct gac ttc 912
Arg Glu Phe Glu Ser Ile Val Asp Pro Thr Thr Arg Glu Ser Asp Phe
290 295 300
gac aag acg ctg acc atc gtg aag tcg gac tcc acc ctc gtg acc gtg 960
Asp Lys Thr Leu Thr Ile Val Lys Ser Asp Ser Thr Leu Val Thr Val
305 310 315 320
ggc acg atg aac acc aag ctg gtc aat agc caa gag atc ctc gtg tcg 1008
Gly Thr Met Asn Thr Lys Leu Val Asn Ser Gln Glu Ile Leu Val Ser
325 330 335
gac ttg atc act caa gtc ggt tcc cag atc gcc gat acc ctc ggc atc 1056
Asp Leu Ile Thr Gln Val Gly Ser Gln Ile Ala Asp Thr Leu Gly Ile
340 345 350
acg gac atc gac gcc aac acc cag caa cag ctc acg gag ctg atc ggc 1104
Thr Asp Ile Asp Ala Asn Thr Gln Gln Gln Leu Thr Glu Leu Ile Gly
355 360 365
aac ctc ttc gtg aac ctc aat tcc caa gtt cag gag tac atc tac ttc 1152
Asn Leu Phe Val Asn Leu Asn Ser Gln Val Gln Glu Tyr Ile Tyr Phe
370 375 380
tac gag gag aag gag aag cag acc tcc tac cgc tac aac atc ctc ttc 1200
Tyr Glu Glu Lys Glu Lys Gln Thr Ser Tyr Arg Tyr Asn Ile Leu Phe
385 390 395 400
gtg ttc gaa aag gag tcg ttc atc acc att ctg cca atg ggc ttc gac 1248
Val Phe Glu Lys Glu Ser Phe Ile Thr Ile Leu Pro Met Gly Phe Asp
405 410 415
gtg acc gtg aac acg aac aag gag gcc gtc ctg aag ctg acc ccg aag 1296
Val Thr Val Asn Thr Asn Lys Glu Ala Val Leu Lys Leu Thr Pro Lys
420 425 430
gac aag gtt acc tac ggc cac gtc agc gtc aag gcc ctc aac atc atc 1344
Asp Lys Val Thr Tyr Gly His Val Ser Val Lys Ala Leu Asn Ile Ile
435 440 445
cag ctc att acg gag gac aag ttc aac ttc ctc gca acc ctc aag aag 1392
Gln Leu Ile Thr Glu Asp Lys Phe Asn Phe Leu Ala Thr Leu Lys Lys
450 455 460
gct ctc aag acc ctg tgatga 1413
Ala Leu Lys Thr Leu
465
<210>47
<211>469
<212>PRT
<213>人工序列
<400>47
Met Ala Phe Phe Asn Arg Val Ile Thr Leu Thr Val Pro Ser Ser Asp
1 5 10 15
Val Val Asn Tyr Ser Glu Ile Tyr Gln Val Ala Pro Gln Tyr Val Asn
20 25 30
Gln Ala Leu Thr Leu Ala Lys Tyr Phe Gln Gly Ala Ile Asp Gly Ser
35 40 45
Thr Leu Arg Phe Asp Phe Glu Lys Ala Leu Gln Ile Ala Asn Asp Ile
50 55 60
Pro Gln Ala Ala Val Val Asn Thr Leu Asn Gln Thr Val Gln Gln Gly
65 70 75 80
Thr Val Gln Val Ser Val Met Ile Asp Lys Ile Val Asp Ile Met Lys
85 90 95
Asn Val Leu Ser Ile Val Ile Asp Asn Lys Lys Phe Trp Asp Gln Val
100 105 110
Thr Ala Ala Ile Thr Asn Thr Phe Thr Asn Leu Asn Ser Gln Glu Ser
115 120 125
Glu Ala Trp Ile Phe Tyr Tyr Lys Glu Asp Ala His Lys Thr Ser Tyr
130 135 140
Tyr Tyr Asn Ile Leu Phe Ala Ile Gln Asp Glu Glu Thr Gly Gly Val
145 150 155 160
Met Ala Thr Leu Pro Ile Ala Phe Asp Ile Ser Val Asp Ile Glu Lys
165 170 175
Glu Lys Val Leu Phe Val Thr Ile Lys Asp Thr Glu Asn Tyr Ala Val
180 185 190
Thr Val Lys Ala Ile Asn Val Val Gln Ala Leu Gln Ser Ser Arg Asp
195 200 205
Ser Lys Val Val Asp Ala Phe Lys Ser Pro Arg His Leu Pro Arg Lys
210 215 220
Arg His Lys Ile Cys Ser Asn Ser Lys Pro Ala Leu Leu Lys Glu Ala
225 230 235 240
Pro Arg Ala Glu Glu Glu Leu Pro Pro Arg Lys Met Ser Lys Glu Ile
245 250 255
Arg Leu Asn Leu Ser Arg Glu Ser Gly Ala Asp Leu Tyr Leu Lys Ile
260 265 270
Leu Ala Phe Val Lys Pro Glu His Phe Phe Gln Ala Tyr Leu Leu Cys
275 280 285
Arg Glu Phe Glu Ser Ile Val Asp Pro Thr Thr Arg Glu Ser Asp Phe
290 295 300
Asp Lys Thr Leu Thr Ile Val Lys Ser Asp Ser Thr Leu Val Thr Val
305 310 315 320
Gly Thr Met Asn Thr Lys Leu Val Asn Ser Gln Glu Ile Leu Val Ser
325 330 335
Asp Leu Ile Thr Gln Val Gly Ser Gln Ile Ala Asp Thr Leu Gly Ile
340 345 350
Thr Asp Ile Asp Ala Asn Thr Gln Gln Gln Leu Thr Glu Leu Ile Gly
355 360 365
Asn Leu Phe Val Asn Leu Asn Ser Gln Val Gln Glu Tyr Ile Tyr Phe
370 375 380
Tyr Glu Glu Lys Glu Lys Gln Thr Ser Tyr Arg Tyr Asn Ile Leu Phe
385 390 395 400
Val Phe Glu Lys Glu Ser Phe Ile Thr Ile Leu Pro Met Gly Phe Asp
405 410 415
Val Thr Val Asn Thr Asn Lys Glu Ala Val Leu Lys Leu Thr Pro Lys
420 425 430
Asp Lys Val Thr Tyr Gly His Val Ser Val Lys Ala Leu Asn Ile Ile
435 440 445
Gln Leu Ile Thr Glu Asp Lys Phe Asn Phe Leu Ala Thr Leu Lys Lys
450 455 460
Ala Leu Lys Thr Leu
465
Claims (40)
1.一种编码杀虫蛋白的分离纯化的核苷酸序列,所述蛋白选自SEQ ID NO:2、SEQ ID NO:4、SEQ ID NO:6、SEQ ID NO:8、SEQ IDNO:10、SEQ ID NO:12和SEQ ID NO:47。
2.权利要求1的分离纯化的核苷酸序列,所述核苷酸序列选自SEQ ID NO:1、SEQ ID NO:3、SEQ ID NO:5、SEQ ID NO:7、SEQ IDNO:9、SEQ ID NO:11和SEQ ID NO:46。
3.一种分离纯化的杀虫蛋白,所述杀虫蛋白选自ET29、ET37、TIC810、TIC812、TIC809和TIC127。
4.一种用编码杀虫蛋白的核苷酸序列转化的植物或植物细胞,所述杀虫蛋白选自ET29、ET37、TIC810、TIC812、TIC809和TIC127。
5.权利要求4的植物的子代或种子,其中所述子代或种子包含所述核苷酸序列。
6.一种载体,所述载体包含的核苷酸序列编码的蛋白选自ET29、ET37、TIC810、TIC812、TIC809和TIC127。
7.一种在宿主细胞中增强第一种杀虫蛋白累积的方法,所述方法包括同时表达选自ET29、TIC809和ET37的所述第一种蛋白和选自TIC810和TIC812的第二种杀虫蛋白,其中,相比于在没有所述第二种蛋白时表达的所述第一种蛋白的累积,所述同时表达增强第一种蛋白的累积。
8.一种表现出杀虫活性的组合物,所述组合物包含选自ET29、ET37和TIC809的第一种蛋白以及选自TIC810和TIC812的第二种蛋白。
9.权利要求8的组合物,其中所述第一种蛋白是TIC809,所述第二种蛋白是TIC810。
10.权利要求8的组合物,其中所述第一种蛋白与所述第二种蛋白的比率为约1∶1至约2∶1。
11.权利要求8的组合物,所述组合物以农学上可接受的制剂在鞘翅目或半翅目植物害虫的食物中提供。
12.权利要求11的组合物,其中所述制剂由在植物细胞中表达的两种蛋白组成。
13.权利要求11的组合物,其中所述鞘翅目害虫为玉米根虫,所述半翅目害虫为草盲蝽。
14.一种商品,所述商品包括可检测量的选自ET29、ET37和TIC809的第一种蛋白,以及可检测量的选自TIC810和TIC812的第二种蛋白。
15.一种商品,所述商品包括可检测量的(1)第一种核苷酸序列和第二种核苷酸序列,所述第一种核苷酸序列编码选自ET29、ET37和TIC809的第一种蛋白,所述第二种核苷酸序列编码选自TIC810和TIC812的第二种蛋白;(2)选自ET29、ET37和TIC809的第一种蛋白,和选自TIC810和TIC812的第二种蛋白;(3)(1)和(2)这二者;或(4)(1)和(2)的任何组合。
16.权利要求14-15中任一项的商品,所述商品选自棉籽、棉籽油、棉籽饼、大豆种子、大豆油、大豆饼、大豆粉、蒸馏干谷物固体、玉米种子、玉米粉、玉米饼、玉米浆、玉米油、玉米淀粉、动物饲料以及全部或部分生产以含有大豆和/玉米副产物的谷物产品。
17.一种制备抗害虫的植物细胞的方法,所述方法包括转化所述植物细胞,以表达杀虫有效量的毒素组合物,所述毒素组合物包含选自ET29、ET37和TIC809的第一种蛋白以及选自TIC810和TIC812的第二种蛋白。
18.权利要求17的方法,其中所述植物细胞再生为能育转基因植物。
19.权利要求17的方法,其中所述植物细胞选自单子叶植物细胞和双子叶植物细胞。
20.权利要求19的方法,其中(1)所述单子叶植物细胞选自玉米、小麦、燕麦、水稻、高粱、蜀黍、荞麦、黑麦、草(牛毛草、梯牧草、雀麦草、鸭茅、圣奥古斯丁草、百慕大草、翦股颖)和大麦植物细胞,(2)所述双子叶植物细胞选自苜蓿、苹果、杏、芦笋、菜豆、浆果、黑莓、蓝莓、油菜、胡萝卜、花椰菜、芹菜、樱桃、鹰嘴豆、柑橘树、棉花、豇豆、酸果蔓、黄瓜、葫芦、茄子、果树、葡萄、柠檬、莴苣、亚麻子、香瓜、芥菜、结坚果的树、秋葵、橘、豌豆、桃、花生、梨、李、马铃薯、大豆、南瓜、草莓、糖甜菜、向日葵、甘薯、烟草、番茄、芜菁和蔬菜植物细胞。
21.权利要求18的方法,其中子代植物和种子由所述转基因植物产生,其中所述子代植物和种子包含所述第一种和第二种蛋白。
22.一种包含编码cyt蛋白的核苷酸序列的宿主细胞,所述蛋白选自TIC809、ET37、TIC810、TIC812和TIC127。
23.权利要求22的宿主细胞,所述宿主细胞选自植物细胞、真菌细胞和细菌细胞。
24.权利要求23的宿主细胞,其中所述植物细胞选自单子叶植物细胞和双子叶植物细胞。
25.权利要求24的宿主细胞,其中(1)所述单子叶植物细胞选自玉米、小麦、燕麦、水稻、高粱、蜀黍、荞麦、黑麦、草(牛毛草、梯牧草、雀麦草、鸭茅、圣奥古斯丁草、百慕大草、翦股颖)和大麦植物细胞,(2)所述双子叶植物细胞选自苜蓿、苹果、杏、芦笋、菜豆、浆果、黑莓、蓝莓、油菜、胡萝卜、花椰菜、芹菜、樱桃、鹰嘴豆、柑橘树、棉花、豇豆、酸果蔓、黄瓜、葫芦、茄子、果树、葡萄、柠檬、莴苣、亚麻子、香瓜、芥菜、结坚果的树、秋葵、橘、豌豆、桃、花生、梨、李、马铃薯、大豆、南瓜、草莓、糖甜菜、向日葵、甘薯、烟草、番茄、芜菁和蔬菜植物细胞。
26.一种用于在宿主细胞中表达杀虫蛋白的表达盒,其中所述表达盒包含可操作连接的、在所述宿主细胞中有功能的启动子序列以及编码所述蛋白的核苷酸序列,所述蛋白选自TIC809、TIC810、TIC812、ET37和TIC127。
27.权利要求26的表达盒,其中所述宿主细胞选自细菌细胞、真菌细胞、哺乳动物细胞和植物细胞。
28.权利要求27的表达盒,其中
a)所述细菌细胞选自芽孢杆菌细胞、肠杆菌细胞、假单胞菌细胞、梭菌细胞和根瘤菌细胞,以及土壤杆菌细胞;和
b)所述植物细胞选自由双子叶植物和单子叶植物组成的植物,所述双子叶植物又选自苜蓿、苹果、杏、芦笋、菜豆、浆果、黑莓、蓝莓、油菜、胡萝卜、花椰菜、芹菜、樱桃、鹰嘴豆、柑橘树、棉花、豇豆、酸果蔓、黄瓜、葫芦、茄子、果树、葡萄、柠檬、莴苣、亚麻子、香瓜、芥菜、结坚果的树、秋葵、橘、豌豆、桃、花生、梨、李、马铃薯、大豆、南瓜、草莓、糖甜菜、向日葵、甘薯、烟草、番茄、芜菁和蔬菜,而所述单子叶植物又选自玉米、小麦、燕麦、水稻、高粱、蜀黍、荞麦、黑麦、草(牛毛草、梯牧草、雀麦草、鸭茅、圣奥古斯丁草、百慕大草、翦股颖)和大麦。
29.权利要求26的表达盒,其中所述宿主细胞为植物细胞,所述表达盒还包含可操作连接的序列,所述序列选自表达增强子序列、非翻译前导序列、内含子序列、靶向叶绿体的肽编码序列以及转录终止和聚腺苷酸化序列。
30.一种载体,所述载体包含权利要求26-29中任一项的表达盒。
31.一种抗昆虫侵袭的转基因植物或植物细胞,所述转基因植物或植物细胞包含编码第一种cyt毒素和第二种cyt毒素的核苷酸序列,所述第一种cyt毒素选自ET29、TIC809和ET37,所述第二种cyt毒素选自TIC810和TIC812。
32.权利要求31的转基因植物或植物细胞,其中所述转基因植物选自双子叶植物或双子叶植物细胞和单子叶植物或单子叶植物细胞,所述双子叶植物或双子叶植物细胞又选自苜蓿、苹果、杏、芦笋、菜豆、浆果、黑莓、蓝莓、油菜、胡萝卜、花椰菜、芹菜、樱桃、鹰嘴豆、柑橘树、棉花、豇豆、酸果蔓、黄瓜、葫芦、茄子、果树、葡萄、柠檬、莴苣、亚麻子、香瓜、芥菜、结坚果的树、秋葵、橘、豌豆、桃、花生、梨、李、马铃薯、大豆、南瓜、草莓、糖甜菜、向日葵、甘薯、烟草、番茄、芜菁和蔬菜,而所述单子叶植物又选自玉米、小麦、燕麦、水稻、高粱、蜀黍、荞麦、黑麦、草(牛毛草、梯牧草、雀麦草、鸭茅、圣奥古斯丁草、百慕大草、翦股颖)和大麦。
33.权利要求32的转基因植物或植物细胞,所述转基因植物或植物细胞能抵抗(1)鞘翅目昆虫侵袭和(2)半翅目昆虫侵袭,其中所述鞘翅目昆虫选自叶甲科、扁甲科、金龟子科、谷盗科、拟步甲科、象虫科、叩甲科和豆象科,所述半翅目选自异翅亚目和同翅亚目。
34.权利要求33的转基因植物或植物细胞,其中所述叶甲科为选自以下的叶甲虫:西方玉米根虫(Diabrotica virgifera)、南方玉米根虫(Diabrotica undecempunctata)、北方玉米根虫(Diabrotica barberi)、墨西哥玉米根虫(Diabrotica virgifera zeae)、巴西玉米根虫(Diabroticabalteata)以及巴西玉米根虫复合群(BCR),所述巴西玉米根虫复合群还包括Diabrotica viridula和南美叶甲(Diabrotica speciosa)。
35.权利要求33的转基因植物或植物细胞的子代或种子,其中所述子代或种子包含所述核苷酸序列。
36.一种防治鞘翅目或半翅目昆虫侵袭植物的方法,所述方法通过在昆虫食物中提供一种或多种用核酸序列转化的植物细胞,所述核酸序列包含有效连接的植物功能性启动子和编码融合蛋白的核苷酸序列,其中所述融合蛋白包含选自SEQ ID NO:2、SEQ ID NO:8和SEQID NO:14的第一种氨基酸序列以及选自SEQ ID NO:4和SEQ ID NO:6的第二种氨基酸序列。
37.一种保护大田中作物免受昆虫侵袭的方法,所述方法包括培育含杀虫有效量的第一种蛋白和第二种蛋白的转基因作物,所述第一种蛋白选自ET37、ET29和TIC809,所述第二种蛋白选自TIC810和TIC812,并在昆虫食物中一起提供所述蛋白,以抑制昆虫在所述转基因作物上存活。
38.权利要求37的方法,其中所述昆虫选自鞘翅目昆虫和半翅目昆虫。
39.权利要求37的方法,其中所述转基因作物还包含与所述第一种和第二种蛋白具有相同昆虫毒性的额外杀虫剂,其中所述额外杀虫剂选自芽孢杆菌毒素、致病杆菌毒素、发光杆菌毒素和特异性抑制所述昆虫中的一种或多种必需基因的dsRNA。
40.权利要求37-39中任一项的方法,其中所述作物的产量相比于没有所述杀虫蛋白的等基因作物的产量被提高。
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US71311105P | 2005-08-31 | 2005-08-31 | |
US60/713,111 | 2005-08-31 |
Publications (1)
Publication Number | Publication Date |
---|---|
CN101300353A true CN101300353A (zh) | 2008-11-05 |
Family
ID=37698224
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CNA2006800404600A Pending CN101300353A (zh) | 2005-08-31 | 2006-08-30 | 用于制备抗虫转基因植物的杀虫组合物和方法 |
Country Status (15)
Country | Link |
---|---|
US (1) | US9121035B2 (zh) |
EP (1) | EP1920060B1 (zh) |
CN (1) | CN101300353A (zh) |
AR (1) | AR057515A1 (zh) |
AT (1) | ATE505548T1 (zh) |
AU (1) | AU2006284856B2 (zh) |
BR (1) | BRPI0615649A2 (zh) |
CA (1) | CA2618430A1 (zh) |
CR (1) | CR9769A (zh) |
DE (1) | DE602006021318D1 (zh) |
EC (1) | ECSP088226A (zh) |
MX (1) | MX2008002801A (zh) |
RU (1) | RU2008112187A (zh) |
WO (1) | WO2007027776A2 (zh) |
ZA (1) | ZA200801477B (zh) |
Cited By (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN102993282A (zh) * | 2012-11-19 | 2013-03-27 | 北京大北农科技集团股份有限公司 | 杀虫蛋白质、其编码基因及用途 |
CN107406850A (zh) * | 2014-12-16 | 2017-11-28 | 美国陶氏益农公司 | 用于控制半翅目害虫的KRUPPEL基因的亲代RNAi抑制 |
CN108882716A (zh) * | 2015-11-02 | 2018-11-23 | 孟山都技术公司 | 棉花转基因事件mon 88702以及其检测和使用方法 |
Families Citing this family (38)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20060200878A1 (en) | 2004-12-21 | 2006-09-07 | Linda Lutfiyya | Recombinant DNA constructs and methods for controlling gene expression |
US8956436B2 (en) | 2006-06-30 | 2015-02-17 | Corning Incorporated | Cordierite aluminum magnesium titanate compositions and ceramic articles comprising same |
US10501375B2 (en) | 2006-06-30 | 2019-12-10 | Corning Incorporated | Cordierite aluminum magnesium titanate compositions and ceramic articles comprising same |
AU2007352460C1 (en) | 2006-10-12 | 2013-05-23 | Monsanto Technology, Llc | Plant microRNAs and methods of use thereof |
US10036036B1 (en) * | 2007-03-15 | 2018-07-31 | Monsanto Technology Llc | Compositions and methods for deploying a transgenic refuge as a seed blend |
US8609936B2 (en) | 2007-04-27 | 2013-12-17 | Monsanto Technology Llc | Hemipteran-and coleopteran active toxin proteins from Bacillus thuringiensis |
US8809625B2 (en) * | 2008-01-17 | 2014-08-19 | Pioneer Hi-Bred International, Inc. | Compositions and methods for the suppression of target polynucleotides from Lygus |
US8513493B2 (en) | 2008-08-29 | 2013-08-20 | Monsanto Technology Llc | Hemipteran and coleopteran active toxin proteins from Bacillus thuringiensis |
MX348430B (es) * | 2008-10-15 | 2017-06-05 | Centro De Investigación Y De Estudios Avanzados Del Instituto Politécnico Nac | Proteinas derivadas de genes cry de bacillus thuringiensis. |
US8937214B2 (en) | 2009-10-23 | 2015-01-20 | Monsanto Technology Llc | Methods and compositions for expression of transgenes in plants |
MX346662B (es) | 2011-04-07 | 2017-03-27 | Monsanto Technology Llc | Familia de toxinas inhibidoras de insectos activas contra insectos hemipteros y lepidopteros. |
BR112014003911A2 (pt) * | 2011-08-19 | 2017-03-14 | Synthetic Genomics Inc | método integrado para a identificação de alto rendimento de novas composições pesticidas e seu uso |
DK3453711T3 (da) | 2011-09-02 | 2021-10-18 | Univ California | Llp2a-bisphosphonatkonjugater til behandling mod osteoporose |
US9657308B2 (en) * | 2011-10-06 | 2017-05-23 | Dow Agrosciences Llc | Nucleic acid molecules that target PP1-87B and confer resistance to coleopteran pests |
AU2013229777B2 (en) | 2012-03-09 | 2017-10-26 | Vestaron Corporation | Toxic peptide production, peptide expression in plants and combinations of cysteine rich peptides |
US11692016B2 (en) | 2012-03-09 | 2023-07-04 | Vestaron Corporation | High gene expression yeast strain |
UY34731A (es) | 2012-04-06 | 2013-11-29 | Monsanto Technology Llc | ?proteínas tóxicas para especies de insectos hemípteros, su uso y métodos de uso, secuencias codifi cantes, métodos de detección, y aislamiento?. |
US9688730B2 (en) * | 2012-07-02 | 2017-06-27 | Pioneer Hi-Bred International, Inc. | Insecticidal proteins and methods for their use |
US10968446B2 (en) | 2012-11-01 | 2021-04-06 | Massachusetts Institute Of Technology | Directed evolution of synthetic gene cluster |
WO2014182473A1 (en) * | 2013-05-08 | 2014-11-13 | Monsanto Technology Llc | Compositions and methods for deploying a transgenic refuge seed blend |
CN106232820A (zh) * | 2013-08-16 | 2016-12-14 | 先锋国际良种公司 | 杀昆虫蛋白及其使用方法 |
EP3221454B1 (en) * | 2014-11-20 | 2020-12-23 | Monsanto Technology LLC | Novel insect inhibitory proteins |
BR102016005404A2 (pt) * | 2015-03-13 | 2016-09-20 | Dow Agrosciences Llc | moléculas de ácido nucléico de rna polimerase ii33 para controlar as pragas de inseto |
BR102016005432A2 (pt) * | 2015-03-13 | 2016-09-13 | Dow Agrosciences Llc | moléculas de ácido nucléico de rna polimerase ii33 para controlar as pragas de inseto |
KR102197507B1 (ko) | 2015-07-13 | 2020-12-31 | 피벗 바이오, 인크. | 식물 형질 개선을 위한 방법 및 조성물 |
HUE057364T2 (hu) * | 2015-07-30 | 2022-05-28 | Monsanto Technology Llc | Új rovargátló fehérjék |
CN116003550A (zh) | 2015-08-06 | 2023-04-25 | 先锋国际良种公司 | 植物来源的杀昆虫蛋白及其使用方法 |
WO2017034853A1 (en) * | 2015-08-25 | 2017-03-02 | Dow Agrosciences Llc | Irdig17912 insecticidal cry toxins |
US11479516B2 (en) | 2015-10-05 | 2022-10-25 | Massachusetts Institute Of Technology | Nitrogen fixation using refactored NIF clusters |
US10612037B2 (en) | 2016-06-20 | 2020-04-07 | Monsanto Technology Llc | Insecticidal proteins toxic or inhibitory to hemipteran pests |
US11447531B2 (en) | 2016-10-21 | 2022-09-20 | Vestaron Corporation | Cleavable peptides and insecticidal and nematicidal proteins comprising same |
AU2018207204B2 (en) | 2017-01-12 | 2023-11-30 | Pivot Bio, Inc. | Methods and compositions for improving plant traits |
KR20200088342A (ko) | 2017-10-25 | 2020-07-22 | 피벗 바이오, 인크. | 질소를 고정하는 유전자조작 미생물을 개선하는 방법 및 조성물 |
WO2019103986A1 (en) * | 2017-11-27 | 2019-05-31 | Bayer Cropscience Lp | Microtiter plates designed for high-throughput screening of piercing-sucking pests such as arthropods |
KR20200123144A (ko) | 2018-02-22 | 2020-10-28 | 지머젠 인코포레이티드 | 바실러스가 농축된 게놈 라이브러리를 생성하고 새로운 cry 독소를 동정하기 위한 방법 |
JP2021514643A (ja) | 2018-03-02 | 2021-06-17 | ザイマージェン インコーポレイテッド | 殺虫タンパク質発見プラットフォームおよびそこから発見される殺虫タンパク質 |
CN112739668A (zh) | 2018-06-27 | 2021-04-30 | 皮沃特生物股份有限公司 | 包括重构固氮微生物的农业组合物 |
CN116676304B (zh) * | 2023-07-20 | 2023-09-26 | 隆平生物技术(海南)有限公司 | 转基因玉米事件lp016-1及其检测方法 |
Family Cites Families (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
EP0413019B1 (en) | 1989-02-24 | 2001-10-04 | Monsanto Technology LLC | Synthetic plant genes and method for preparation |
US5849870A (en) * | 1993-03-25 | 1998-12-15 | Novartis Finance Corporation | Pesticidal proteins and strains |
US5322687A (en) | 1993-07-29 | 1994-06-21 | Ecogen Inc. | Bacillus thuringiensis cryet4 and cryet5 toxin genes and proteins toxic to lepidopteran insects |
US5723440A (en) * | 1995-06-07 | 1998-03-03 | Mycogen Corporation | Controlling hemipteran insect pests with Bacillus thuringiensis |
US6093695A (en) * | 1996-09-26 | 2000-07-25 | Monsanto Company | Bacillus thuringiensis CryET29 compositions toxic to coleopteran insects and ctenocephalides SPP |
US6063597A (en) | 1997-12-18 | 2000-05-16 | Monsanto Company | Polypeptide compositions toxic to coleopteran insects |
EP1263281B1 (en) * | 2000-02-29 | 2012-04-11 | Auburn University | Multiple gene expression for engineering novel pathways and hyperexpression of foreign proteins in plants |
-
2006
- 2006-08-30 DE DE602006021318T patent/DE602006021318D1/de active Active
- 2006-08-30 WO PCT/US2006/033867 patent/WO2007027776A2/en active Application Filing
- 2006-08-30 AU AU2006284856A patent/AU2006284856B2/en not_active Ceased
- 2006-08-30 AT AT06813950T patent/ATE505548T1/de active
- 2006-08-30 US US12/064,840 patent/US9121035B2/en active Active
- 2006-08-30 EP EP06813950A patent/EP1920060B1/en active Active
- 2006-08-30 CN CNA2006800404600A patent/CN101300353A/zh active Pending
- 2006-08-30 BR BRPI0615649-5A patent/BRPI0615649A2/pt not_active IP Right Cessation
- 2006-08-30 MX MX2008002801A patent/MX2008002801A/es unknown
- 2006-08-30 CA CA002618430A patent/CA2618430A1/en not_active Abandoned
- 2006-08-30 RU RU2008112187/13A patent/RU2008112187A/ru not_active Application Discontinuation
- 2006-08-31 AR ARP060103814A patent/AR057515A1/es unknown
-
2008
- 2008-02-13 ZA ZA200801477A patent/ZA200801477B/xx unknown
- 2008-02-26 EC EC2008008226A patent/ECSP088226A/es unknown
- 2008-02-27 CR CR9769A patent/CR9769A/es unknown
Cited By (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN102993282A (zh) * | 2012-11-19 | 2013-03-27 | 北京大北农科技集团股份有限公司 | 杀虫蛋白质、其编码基因及用途 |
CN102993282B (zh) * | 2012-11-19 | 2014-10-15 | 北京大北农科技集团股份有限公司 | 杀虫蛋白质、其编码基因及用途 |
CN107406850A (zh) * | 2014-12-16 | 2017-11-28 | 美国陶氏益农公司 | 用于控制半翅目害虫的KRUPPEL基因的亲代RNAi抑制 |
CN108882716A (zh) * | 2015-11-02 | 2018-11-23 | 孟山都技术公司 | 棉花转基因事件mon 88702以及其检测和使用方法 |
CN108882716B (zh) * | 2015-11-02 | 2021-12-28 | 孟山都技术公司 | 棉花转基因事件mon 88702以及其检测和使用方法 |
US11286499B2 (en) | 2015-11-02 | 2022-03-29 | Monsanto Technology Llc | Cotton transgenic event MON 88702 and methods for detection and uses thereof |
Also Published As
Publication number | Publication date |
---|---|
WO2007027776A2 (en) | 2007-03-08 |
WO2007027776A3 (en) | 2007-08-02 |
BRPI0615649A2 (pt) | 2011-05-24 |
DE602006021318D1 (de) | 2011-05-26 |
AR057515A1 (es) | 2007-12-05 |
ECSP088226A (es) | 2008-03-26 |
EP1920060B1 (en) | 2011-04-13 |
US9121035B2 (en) | 2015-09-01 |
ATE505548T1 (de) | 2011-04-15 |
MX2008002801A (es) | 2008-04-07 |
CA2618430A1 (en) | 2007-03-08 |
RU2008112187A (ru) | 2009-10-10 |
CR9769A (es) | 2008-08-06 |
AU2006284856A1 (en) | 2007-03-08 |
AU2006284856B2 (en) | 2011-06-02 |
EP1920060A2 (en) | 2008-05-14 |
ZA200801477B (en) | 2009-09-30 |
US20090068159A1 (en) | 2009-03-12 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN101300353A (zh) | 用于制备抗虫转基因植物的杀虫组合物和方法 | |
CN101686705B (zh) | 来自Bacillus thuringiensis的半翅目和鞘翅目活性的毒素蛋白 | |
KR102238620B1 (ko) | 신규한 곤충 저해 단백질 | |
CN101268094A (zh) | 编码杀虫蛋白的核苷酸序列 | |
WO2011084622A1 (en) | Combined use of cry1ca and cry1ab proteins for insect resistance management | |
CN107109417B (zh) | 新型昆虫抑制性蛋白 | |
AU2011343472B2 (en) | Combined use of Vip3Ab and CrylAb for management of resistance insects | |
CN109952024B (zh) | 新型昆虫抑制蛋白 | |
CN110678067B (zh) | 新型昆虫抑制蛋白 | |
CN107849571B (zh) | 新型昆虫抑制蛋白 | |
EP3445160A1 (en) | Combination of four vip and cry protein toxins for management of insect pests in plants | |
JP5913124B2 (ja) | サトウキビでのCry抵抗性のシュガーケーンボーラーの防除および昆虫抵抗性管理のためのCRY1FaおよびCRY1Abタンパク質の併用 | |
CN117616117A (zh) | 新型昆虫抑制蛋白 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
C02 | Deemed withdrawal of patent application after publication (patent law 2001) | ||
WD01 | Invention patent application deemed withdrawn after publication |
Open date: 20081105 |