US20190203267A1 - Detection of microorganisms in food samples and food processing facilities - Google Patents
Detection of microorganisms in food samples and food processing facilities Download PDFInfo
- Publication number
- US20190203267A1 US20190203267A1 US15/927,958 US201815927958A US2019203267A1 US 20190203267 A1 US20190203267 A1 US 20190203267A1 US 201815927958 A US201815927958 A US 201815927958A US 2019203267 A1 US2019203267 A1 US 2019203267A1
- Authority
- US
- United States
- Prior art keywords
- food
- sample
- microorganism
- sequencing
- assay
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Abandoned
Links
- 235000013305 food Nutrition 0.000 title claims abstract description 252
- 244000005700 microbiome Species 0.000 title claims abstract description 148
- 238000012545 processing Methods 0.000 title description 83
- 238000001514 detection method Methods 0.000 title description 32
- 238000012163 sequencing technique Methods 0.000 claims abstract description 138
- 238000000034 method Methods 0.000 claims abstract description 104
- 230000007613 environmental effect Effects 0.000 claims abstract description 62
- 241000607142 Salmonella Species 0.000 claims abstract description 33
- 241000186781 Listeria Species 0.000 claims abstract description 29
- 241000589876 Campylobacter Species 0.000 claims abstract description 24
- 241000588722 Escherichia Species 0.000 claims abstract description 21
- 150000007523 nucleic acids Chemical group 0.000 claims description 108
- 238000003556 assay Methods 0.000 claims description 74
- 238000006243 chemical reaction Methods 0.000 claims description 43
- 238000003860 storage Methods 0.000 claims description 19
- 239000011148 porous material Substances 0.000 claims description 16
- 244000144977 poultry Species 0.000 claims description 15
- 235000013601 eggs Nutrition 0.000 claims description 10
- 235000013311 vegetables Nutrition 0.000 claims description 9
- 241000589875 Campylobacter jejuni Species 0.000 claims description 8
- 230000003321 amplification Effects 0.000 claims description 8
- 238000003199 nucleic acid amplification method Methods 0.000 claims description 8
- 241000251468 Actinopterygii Species 0.000 claims description 7
- 238000012070 whole genome sequencing analysis Methods 0.000 claims description 7
- 235000013399 edible fruits Nutrition 0.000 claims description 5
- 235000020989 red meat Nutrition 0.000 claims description 5
- 241000282898 Sus scrofa Species 0.000 claims description 4
- 238000012258 culturing Methods 0.000 claims description 4
- 238000003753 real-time PCR Methods 0.000 claims description 4
- 238000007480 sanger sequencing Methods 0.000 claims description 4
- 241000589877 Campylobacter coli Species 0.000 claims description 3
- 241000589986 Campylobacter lari Species 0.000 claims description 3
- FEWJPZIEWOKRBE-JCYAYHJZSA-N Dextrotartaric acid Chemical compound OC(=O)[C@H](O)[C@@H](O)C(O)=O FEWJPZIEWOKRBE-JCYAYHJZSA-N 0.000 claims description 3
- 235000021374 legumes Nutrition 0.000 claims description 3
- 238000003906 pulsed field gel electrophoresis Methods 0.000 claims description 3
- 238000012286 ELISA Assay Methods 0.000 claims description 2
- 244000052769 pathogen Species 0.000 abstract description 40
- 230000001717 pathogenic effect Effects 0.000 abstract description 38
- 244000000010 microbial pathogen Species 0.000 abstract description 34
- 230000001052 transient effect Effects 0.000 abstract description 5
- 244000078673 foodborn pathogen Species 0.000 abstract description 2
- 239000000523 sample Substances 0.000 description 222
- 102000039446 nucleic acids Human genes 0.000 description 61
- 108020004707 nucleic acids Proteins 0.000 description 61
- 241000894006 Bacteria Species 0.000 description 40
- 241000588724 Escherichia coli Species 0.000 description 38
- 210000004027 cell Anatomy 0.000 description 35
- 238000004458 analytical method Methods 0.000 description 30
- 239000011324 bead Substances 0.000 description 30
- 206010012735 Diarrhoea Diseases 0.000 description 24
- 238000013461 design Methods 0.000 description 24
- 238000003752 polymerase chain reaction Methods 0.000 description 24
- 230000035945 sensitivity Effects 0.000 description 23
- 239000000203 mixture Substances 0.000 description 21
- 230000000670 limiting effect Effects 0.000 description 20
- 230000008569 process Effects 0.000 description 19
- LFQSCWFLJHTTHZ-UHFFFAOYSA-N Ethanol Chemical compound CCO LFQSCWFLJHTTHZ-UHFFFAOYSA-N 0.000 description 18
- 238000004590 computer program Methods 0.000 description 17
- 239000000047 product Substances 0.000 description 17
- 241000894007 species Species 0.000 description 17
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Substances O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 17
- 238000004422 calculation algorithm Methods 0.000 description 16
- 238000002360 preparation method Methods 0.000 description 16
- 239000003153 chemical reaction reagent Substances 0.000 description 15
- 238000005070 sampling Methods 0.000 description 15
- 108020004414 DNA Proteins 0.000 description 14
- 102000053602 DNA Human genes 0.000 description 14
- 238000010801 machine learning Methods 0.000 description 14
- 235000013594 poultry meat Nutrition 0.000 description 14
- 208000019331 Foodborne disease Diseases 0.000 description 13
- 206010047700 Vomiting Diseases 0.000 description 13
- 238000012360 testing method Methods 0.000 description 13
- 230000008673 vomiting Effects 0.000 description 13
- 241001465754 Metazoa Species 0.000 description 12
- 206010037660 Pyrexia Diseases 0.000 description 12
- 238000011109 contamination Methods 0.000 description 12
- 235000013372 meat Nutrition 0.000 description 12
- 108090000623 proteins and genes Proteins 0.000 description 12
- 241000287828 Gallus gallus Species 0.000 description 11
- 230000001580 bacterial effect Effects 0.000 description 11
- 235000013330 chicken meat Nutrition 0.000 description 11
- 108091092878 Microsatellite Proteins 0.000 description 10
- 208000007101 Muscle Cramp Diseases 0.000 description 10
- 238000004891 communication Methods 0.000 description 10
- 208000015181 infectious disease Diseases 0.000 description 10
- 238000004519 manufacturing process Methods 0.000 description 10
- 239000008188 pellet Substances 0.000 description 10
- 229920002477 rna polymer Polymers 0.000 description 10
- 239000006228 supernatant Substances 0.000 description 10
- 238000012706 support-vector machine Methods 0.000 description 10
- 235000013336 milk Nutrition 0.000 description 9
- 239000008267 milk Substances 0.000 description 9
- 210000004080 milk Anatomy 0.000 description 9
- 241000588914 Enterobacter Species 0.000 description 8
- 241001646719 Escherichia coli O157:H7 Species 0.000 description 8
- 206010028813 Nausea Diseases 0.000 description 8
- 230000003187 abdominal effect Effects 0.000 description 8
- 244000052616 bacterial pathogen Species 0.000 description 8
- 230000008693 nausea Effects 0.000 description 8
- 230000000737 periodic effect Effects 0.000 description 8
- 102000040430 polynucleotide Human genes 0.000 description 8
- 108091033319 polynucleotide Proteins 0.000 description 8
- 238000007894 restriction fragment length polymorphism technique Methods 0.000 description 8
- 239000000243 solution Substances 0.000 description 8
- 239000000126 substance Substances 0.000 description 8
- 108020004635 Complementary DNA Proteins 0.000 description 7
- 241000881810 Enterobacter asburiae Species 0.000 description 7
- 108090000790 Enzymes Proteins 0.000 description 7
- 102000004190 Enzymes Human genes 0.000 description 7
- 241000607477 Yersinia pseudotuberculosis Species 0.000 description 7
- 238000010804 cDNA synthesis Methods 0.000 description 7
- 238000013145 classification model Methods 0.000 description 7
- 239000002299 complementary DNA Substances 0.000 description 7
- 230000006870 function Effects 0.000 description 7
- 239000002773 nucleotide Substances 0.000 description 7
- 125000003729 nucleotide group Chemical group 0.000 description 7
- 235000014102 seafood Nutrition 0.000 description 7
- 238000012549 training Methods 0.000 description 7
- 241000588923 Citrobacter Species 0.000 description 6
- 241000982938 Enterobacter cancerogenus Species 0.000 description 6
- 241000588697 Enterobacter cloacae Species 0.000 description 6
- 241000043309 Enterobacter hormaechei Species 0.000 description 6
- 108091028043 Nucleic acid sequence Proteins 0.000 description 6
- 241001138501 Salmonella enterica Species 0.000 description 6
- 241000607766 Shigella boydii Species 0.000 description 6
- 241000607265 Vibrio vulnificus Species 0.000 description 6
- 241000700605 Viruses Species 0.000 description 6
- 210000004369 blood Anatomy 0.000 description 6
- 239000008280 blood Substances 0.000 description 6
- 235000019688 fish Nutrition 0.000 description 6
- 230000002068 genetic effect Effects 0.000 description 6
- 239000002157 polynucleotide Substances 0.000 description 6
- 230000037452 priming Effects 0.000 description 6
- 208000024891 symptom Diseases 0.000 description 6
- 230000004568 DNA-binding Effects 0.000 description 5
- 241000694513 Enterobacter bugandensis Species 0.000 description 5
- 241001245440 Enterobacter kobei Species 0.000 description 5
- 241001217893 Enterobacter ludwigii Species 0.000 description 5
- 241000737206 Enterobacter soli Species 0.000 description 5
- 241000191967 Staphylococcus aureus Species 0.000 description 5
- 238000013528 artificial neural network Methods 0.000 description 5
- 235000013351 cheese Nutrition 0.000 description 5
- 238000011161 development Methods 0.000 description 5
- 201000010099 disease Diseases 0.000 description 5
- 208000037265 diseases, disorders, signs and symptoms Diseases 0.000 description 5
- 238000000126 in silico method Methods 0.000 description 5
- 239000011159 matrix material Substances 0.000 description 5
- 210000002381 plasma Anatomy 0.000 description 5
- 229920000642 polymer Polymers 0.000 description 5
- 230000000717 retained effect Effects 0.000 description 5
- 239000002689 soil Substances 0.000 description 5
- 229910001220 stainless steel Inorganic materials 0.000 description 5
- 239000010935 stainless steel Substances 0.000 description 5
- 210000001519 tissue Anatomy 0.000 description 5
- 210000002700 urine Anatomy 0.000 description 5
- 240000002129 Malva sylvestris Species 0.000 description 4
- 235000006770 Malva sylvestris Nutrition 0.000 description 4
- 241000607768 Shigella Species 0.000 description 4
- 241000607598 Vibrio Species 0.000 description 4
- 241000607272 Vibrio parahaemolyticus Species 0.000 description 4
- 238000013459 approach Methods 0.000 description 4
- 235000015278 beef Nutrition 0.000 description 4
- 210000001185 bone marrow Anatomy 0.000 description 4
- 210000001175 cerebrospinal fluid Anatomy 0.000 description 4
- 238000013500 data storage Methods 0.000 description 4
- 230000034994 death Effects 0.000 description 4
- 231100000517 death Toxicity 0.000 description 4
- 239000003651 drinking water Substances 0.000 description 4
- 235000020188 drinking water Nutrition 0.000 description 4
- 238000005516 engineering process Methods 0.000 description 4
- 210000000887 face Anatomy 0.000 description 4
- 210000003608 fece Anatomy 0.000 description 4
- 238000007667 floating Methods 0.000 description 4
- 230000036541 health Effects 0.000 description 4
- 238000003780 insertion Methods 0.000 description 4
- 230000037431 insertion Effects 0.000 description 4
- 230000033001 locomotion Effects 0.000 description 4
- 239000002609 medium Substances 0.000 description 4
- 238000007481 next generation sequencing Methods 0.000 description 4
- 102000054765 polymorphisms of proteins Human genes 0.000 description 4
- 210000003296 saliva Anatomy 0.000 description 4
- 235000015170 shellfish Nutrition 0.000 description 4
- 231100000765 toxin Toxicity 0.000 description 4
- 230000000007 visual effect Effects 0.000 description 4
- 208000004998 Abdominal Pain Diseases 0.000 description 3
- 241000193830 Bacillus <bacterium> Species 0.000 description 3
- 241000193403 Clostridium Species 0.000 description 3
- 238000002965 ELISA Methods 0.000 description 3
- 206010016952 Food poisoning Diseases 0.000 description 3
- 241000233866 Fungi Species 0.000 description 3
- 240000008415 Lactuca sativa Species 0.000 description 3
- 241000186779 Listeria monocytogenes Species 0.000 description 3
- 241000191940 Staphylococcus Species 0.000 description 3
- 239000000427 antigen Substances 0.000 description 3
- 108091007433 antigens Proteins 0.000 description 3
- 102000036639 antigens Human genes 0.000 description 3
- 230000008901 benefit Effects 0.000 description 3
- 230000002596 correlated effect Effects 0.000 description 3
- 230000000875 corresponding effect Effects 0.000 description 3
- -1 crop production Substances 0.000 description 3
- 238000012272 crop production Methods 0.000 description 3
- 238000003066 decision tree Methods 0.000 description 3
- 238000010586 diagram Methods 0.000 description 3
- 230000000369 enteropathogenic effect Effects 0.000 description 3
- 230000000688 enterotoxigenic effect Effects 0.000 description 3
- 230000001973 epigenetic effect Effects 0.000 description 3
- 238000011156 evaluation Methods 0.000 description 3
- 235000012041 food component Nutrition 0.000 description 3
- 210000001035 gastrointestinal tract Anatomy 0.000 description 3
- 230000012010 growth Effects 0.000 description 3
- 239000003547 immunosorbent Substances 0.000 description 3
- 238000011534 incubation Methods 0.000 description 3
- 230000000266 injurious effect Effects 0.000 description 3
- 239000010871 livestock manure Substances 0.000 description 3
- 238000007477 logistic regression Methods 0.000 description 3
- 230000011987 methylation Effects 0.000 description 3
- 238000007069 methylation reaction Methods 0.000 description 3
- DXHWIAMGTKXUEA-UHFFFAOYSA-O propidium monoazide Chemical compound C12=CC(N=[N+]=[N-])=CC=C2C2=CC=C(N)C=C2[N+](CCC[N+](C)(CC)CC)=C1C1=CC=CC=C1 DXHWIAMGTKXUEA-UHFFFAOYSA-O 0.000 description 3
- 238000007637 random forest analysis Methods 0.000 description 3
- 235000020185 raw untreated milk Nutrition 0.000 description 3
- 239000012488 sample solution Substances 0.000 description 3
- 210000002784 stomach Anatomy 0.000 description 3
- 239000003053 toxin Substances 0.000 description 3
- 108700012359 toxins Proteins 0.000 description 3
- 229920001621 AMOLED Polymers 0.000 description 2
- 208000004429 Bacillary Dysentery Diseases 0.000 description 2
- 241000193755 Bacillus cereus Species 0.000 description 2
- 241000589562 Brucella Species 0.000 description 2
- 241000193155 Clostridium botulinum Species 0.000 description 2
- 241000193468 Clostridium perfringens Species 0.000 description 2
- 108020004394 Complementary RNA Proteins 0.000 description 2
- 241000606678 Coxiella burnetii Species 0.000 description 2
- 241000989055 Cronobacter Species 0.000 description 2
- 238000001712 DNA sequencing Methods 0.000 description 2
- 241000194033 Enterococcus Species 0.000 description 2
- 208000005577 Gastroenteritis Diseases 0.000 description 2
- 206010019233 Headaches Diseases 0.000 description 2
- 235000003228 Lactuca sativa Nutrition 0.000 description 2
- 206010024641 Listeriosis Diseases 0.000 description 2
- 201000009906 Meningitis Diseases 0.000 description 2
- 241001263478 Norovirus Species 0.000 description 2
- 208000002193 Pain Diseases 0.000 description 2
- 208000005374 Poisoning Diseases 0.000 description 2
- 238000011529 RT qPCR Methods 0.000 description 2
- 208000001647 Renal Insufficiency Diseases 0.000 description 2
- 241000210647 Salmonella enterica subsp. enterica serovar Montevideo Species 0.000 description 2
- 241001135250 Salmonella enterica subsp. enterica serovar Oranienburg Species 0.000 description 2
- 241000194017 Streptococcus Species 0.000 description 2
- 241000607626 Vibrio cholerae Species 0.000 description 2
- 241000607734 Yersinia <bacteria> Species 0.000 description 2
- 241000607447 Yersinia enterocolitica Species 0.000 description 2
- 125000003275 alpha amino acid group Chemical group 0.000 description 2
- PYMYPHUHKUWMLA-UHFFFAOYSA-N arabinose Natural products OCC(O)C(O)C(O)C=O PYMYPHUHKUWMLA-UHFFFAOYSA-N 0.000 description 2
- 239000012148 binding buffer Substances 0.000 description 2
- 230000005540 biological transmission Effects 0.000 description 2
- 239000003184 complementary RNA Substances 0.000 description 2
- 238000004883 computer application Methods 0.000 description 2
- 239000000470 constituent Substances 0.000 description 2
- 238000002790 cross-validation Methods 0.000 description 2
- 238000007405 data analysis Methods 0.000 description 2
- 230000007423 decrease Effects 0.000 description 2
- 238000007847 digital PCR Methods 0.000 description 2
- 239000012149 elution buffer Substances 0.000 description 2
- 239000012530 fluid Substances 0.000 description 2
- 239000005417 food ingredient Substances 0.000 description 2
- 239000012634 fragment Substances 0.000 description 2
- 235000011389 fruit/vegetable juice Nutrition 0.000 description 2
- 239000000446 fuel Substances 0.000 description 2
- 235000013882 gravy Nutrition 0.000 description 2
- 231100000869 headache Toxicity 0.000 description 2
- 238000009396 hybridization Methods 0.000 description 2
- 210000000987 immune system Anatomy 0.000 description 2
- 230000006872 improvement Effects 0.000 description 2
- 230000000968 intestinal effect Effects 0.000 description 2
- 150000002500 ions Chemical class 0.000 description 2
- 238000011901 isothermal amplification Methods 0.000 description 2
- 201000006370 kidney failure Diseases 0.000 description 2
- 239000004973 liquid crystal related substance Substances 0.000 description 2
- 239000012139 lysis buffer Substances 0.000 description 2
- 230000000813 microbial effect Effects 0.000 description 2
- 235000015097 nutrients Nutrition 0.000 description 2
- 230000003287 optical effect Effects 0.000 description 2
- 244000045947 parasite Species 0.000 description 2
- 230000002093 peripheral effect Effects 0.000 description 2
- 231100000572 poisoning Toxicity 0.000 description 2
- 230000000607 poisoning effect Effects 0.000 description 2
- 235000013324 preserved food Nutrition 0.000 description 2
- 230000002265 prevention Effects 0.000 description 2
- 238000000513 principal component analysis Methods 0.000 description 2
- 238000011002 quantification Methods 0.000 description 2
- 125000002652 ribonucleotide group Chemical group 0.000 description 2
- 238000012502 risk assessment Methods 0.000 description 2
- 239000010979 ruby Substances 0.000 description 2
- 229910001750 ruby Inorganic materials 0.000 description 2
- 239000012146 running buffer Substances 0.000 description 2
- 238000012216 screening Methods 0.000 description 2
- 201000005113 shigellosis Diseases 0.000 description 2
- 235000000346 sugar Nutrition 0.000 description 2
- 230000008685 targeting Effects 0.000 description 2
- 238000010200 validation analysis Methods 0.000 description 2
- 239000003643 water by type Substances 0.000 description 2
- 229940098232 yersinia enterocolitica Drugs 0.000 description 2
- 241000607534 Aeromonas Species 0.000 description 1
- 241000607528 Aeromonas hydrophila Species 0.000 description 1
- 208000031729 Bacteremia Diseases 0.000 description 1
- 241000283690 Bos taurus Species 0.000 description 1
- 208000003508 Botulism Diseases 0.000 description 1
- 206010051226 Campylobacter infection Diseases 0.000 description 1
- 241000700199 Cavia porcellus Species 0.000 description 1
- 241001660259 Cereus <cactus> Species 0.000 description 1
- 240000001817 Cereus hexagonus Species 0.000 description 1
- 108010077544 Chromatin Proteins 0.000 description 1
- 241000949032 Citrobacter sedlakii Species 0.000 description 1
- 108091035707 Consensus sequence Proteins 0.000 description 1
- 241001445332 Coxiella <snail> Species 0.000 description 1
- 208000008953 Cryptosporidiosis Diseases 0.000 description 1
- 206010011502 Cryptosporidiosis infection Diseases 0.000 description 1
- 241000223935 Cryptosporidium Species 0.000 description 1
- 241000179197 Cyclospora Species 0.000 description 1
- 206010061802 Cyclosporidium infection Diseases 0.000 description 1
- 206010012741 Diarrhoea haemorrhagic Diseases 0.000 description 1
- 241000196324 Embryophyta Species 0.000 description 1
- 241001493237 Enterobacter mori Species 0.000 description 1
- 241000059459 Escherichia coli O104 Species 0.000 description 1
- 241000282326 Felis catus Species 0.000 description 1
- 206010017915 Gastroenteritis shigella Diseases 0.000 description 1
- 208000032843 Hemorrhage Diseases 0.000 description 1
- 241000724675 Hepatitis E virus Species 0.000 description 1
- 241000709721 Hepatovirus A Species 0.000 description 1
- 241000238631 Hexapoda Species 0.000 description 1
- 241000282412 Homo Species 0.000 description 1
- 206010061598 Immunodeficiency Diseases 0.000 description 1
- 206010022004 Influenza like illness Diseases 0.000 description 1
- 206010023126 Jaundice Diseases 0.000 description 1
- 102000003960 Ligases Human genes 0.000 description 1
- 108090000364 Ligases Proteins 0.000 description 1
- 240000004658 Medicago sativa Species 0.000 description 1
- 235000017587 Medicago sativa ssp. sativa Nutrition 0.000 description 1
- 208000010428 Muscle Weakness Diseases 0.000 description 1
- 206010028372 Muscular weakness Diseases 0.000 description 1
- 241000186359 Mycobacterium Species 0.000 description 1
- 241000186366 Mycobacterium bovis Species 0.000 description 1
- 241000187479 Mycobacterium tuberculosis Species 0.000 description 1
- 101000822797 Naja naja Long neurotoxin 5 Proteins 0.000 description 1
- 235000010676 Ocimum basilicum Nutrition 0.000 description 1
- 240000007926 Ocimum gratissimum Species 0.000 description 1
- 241000283973 Oryctolagus cuniculus Species 0.000 description 1
- 241000237502 Ostreidae Species 0.000 description 1
- 241000566145 Otus Species 0.000 description 1
- 208000012868 Overgrowth Diseases 0.000 description 1
- 238000002944 PCR assay Methods 0.000 description 1
- 241001494479 Pecora Species 0.000 description 1
- 241000286209 Phasianidae Species 0.000 description 1
- 241000607000 Plesiomonas Species 0.000 description 1
- 241000606999 Plesiomonas shigelloides Species 0.000 description 1
- 206010036595 Premature delivery Diseases 0.000 description 1
- 208000034809 Product contamination Diseases 0.000 description 1
- 238000001190 Q-PCR Methods 0.000 description 1
- 208000004756 Respiratory Insufficiency Diseases 0.000 description 1
- 108091028664 Ribonucleotide Proteins 0.000 description 1
- PYMYPHUHKUWMLA-LMVFSUKVSA-N Ribose Natural products OC[C@@H](O)[C@@H](O)[C@@H](O)C=O PYMYPHUHKUWMLA-LMVFSUKVSA-N 0.000 description 1
- 241000702670 Rotavirus Species 0.000 description 1
- 206010039438 Salmonella Infections Diseases 0.000 description 1
- 241001354013 Salmonella enterica subsp. enterica serovar Enteritidis Species 0.000 description 1
- 241000293869 Salmonella enterica subsp. enterica serovar Typhimurium Species 0.000 description 1
- 241000607764 Shigella dysenteriae Species 0.000 description 1
- 241000607762 Shigella flexneri Species 0.000 description 1
- 206010040550 Shigella infections Diseases 0.000 description 1
- 241000607760 Shigella sonnei Species 0.000 description 1
- 244000061456 Solanum tuberosum Species 0.000 description 1
- 235000002595 Solanum tuberosum Nutrition 0.000 description 1
- 244000300264 Spinacia oleracea Species 0.000 description 1
- 235000009337 Spinacia oleracea Nutrition 0.000 description 1
- 238000000692 Student's t-test Methods 0.000 description 1
- 241001441724 Tetraodontidae Species 0.000 description 1
- 208000025865 Ulcer Diseases 0.000 description 1
- 244000290333 Vanilla fragrans Species 0.000 description 1
- 235000009499 Vanilla fragrans Nutrition 0.000 description 1
- 235000012036 Vanilla tahitensis Nutrition 0.000 description 1
- 241000251539 Vertebrata <Metazoa> Species 0.000 description 1
- 208000027418 Wounds and injury Diseases 0.000 description 1
- 230000001154 acute effect Effects 0.000 description 1
- 239000003570 air Substances 0.000 description 1
- 235000013334 alcoholic beverage Nutrition 0.000 description 1
- HMFHBZSHGGEWLO-UHFFFAOYSA-N alpha-D-Furanose-Ribose Natural products OCC1OC(O)C(O)C1O HMFHBZSHGGEWLO-UHFFFAOYSA-N 0.000 description 1
- XAGFODPZIPBFFR-UHFFFAOYSA-N aluminium Chemical compound [Al] XAGFODPZIPBFFR-UHFFFAOYSA-N 0.000 description 1
- 229910052782 aluminium Inorganic materials 0.000 description 1
- 125000000539 amino acid group Chemical group 0.000 description 1
- 150000001413 amino acids Chemical class 0.000 description 1
- 230000000890 antigenic effect Effects 0.000 description 1
- 230000036528 appetite Effects 0.000 description 1
- 235000019789 appetite Nutrition 0.000 description 1
- 230000001651 autotrophic effect Effects 0.000 description 1
- 235000012019 baked potatoes Nutrition 0.000 description 1
- 235000021028 berry Nutrition 0.000 description 1
- SRBFZHDQGSBBOR-UHFFFAOYSA-N beta-D-Pyranose-Lyxose Natural products OC1COC(O)C(O)C1O SRBFZHDQGSBBOR-UHFFFAOYSA-N 0.000 description 1
- 235000013361 beverage Nutrition 0.000 description 1
- 230000003115 biocidal effect Effects 0.000 description 1
- 238000001369 bisulfite sequencing Methods 0.000 description 1
- 230000000740 bleeding effect Effects 0.000 description 1
- 230000023555 blood coagulation Effects 0.000 description 1
- 235000012206 bottled water Nutrition 0.000 description 1
- 210000004556 brain Anatomy 0.000 description 1
- 239000000872 buffer Substances 0.000 description 1
- 239000007853 buffer solution Substances 0.000 description 1
- 201000004927 campylobacteriosis Diseases 0.000 description 1
- 230000006037 cell lysis Effects 0.000 description 1
- 238000012512 characterization method Methods 0.000 description 1
- 239000003795 chemical substances by application Substances 0.000 description 1
- 229940112822 chewing gum Drugs 0.000 description 1
- 235000015218 chewing gum Nutrition 0.000 description 1
- 210000003483 chromatin Anatomy 0.000 description 1
- 206010009887 colitis Diseases 0.000 description 1
- 230000000295 complement effect Effects 0.000 description 1
- 235000009508 confectionery Nutrition 0.000 description 1
- 238000012790 confirmation Methods 0.000 description 1
- 239000000356 contaminant Substances 0.000 description 1
- 238000010411 cooking Methods 0.000 description 1
- 239000006071 cream Substances 0.000 description 1
- 238000012864 cross contamination Methods 0.000 description 1
- 201000002641 cyclosporiasis Diseases 0.000 description 1
- 235000013365 dairy product Nutrition 0.000 description 1
- 230000006378 damage Effects 0.000 description 1
- 230000002498 deadly effect Effects 0.000 description 1
- 239000005547 deoxyribonucleotide Substances 0.000 description 1
- 125000002637 deoxyribonucleotide group Chemical group 0.000 description 1
- 235000015872 dietary supplement Nutrition 0.000 description 1
- 230000004069 differentiation Effects 0.000 description 1
- 230000029087 digestion Effects 0.000 description 1
- 238000009826 distribution Methods 0.000 description 1
- 230000000694 effects Effects 0.000 description 1
- 210000003278 egg shell Anatomy 0.000 description 1
- 244000000015 environmental pathogen Species 0.000 description 1
- 230000002255 enzymatic effect Effects 0.000 description 1
- 230000007717 exclusion Effects 0.000 description 1
- 239000000284 extract Substances 0.000 description 1
- 239000011888 foil Substances 0.000 description 1
- 235000015219 food category Nutrition 0.000 description 1
- 235000013350 formula milk Nutrition 0.000 description 1
- 235000013611 frozen food Nutrition 0.000 description 1
- 235000012055 fruits and vegetables Nutrition 0.000 description 1
- 239000001963 growth medium Substances 0.000 description 1
- 244000005709 gut microbiome Species 0.000 description 1
- 235000015220 hamburgers Nutrition 0.000 description 1
- 231100000206 health hazard Toxicity 0.000 description 1
- 244000000013 helminth Species 0.000 description 1
- 230000002008 hemorrhagic effect Effects 0.000 description 1
- 208000006454 hepatitis Diseases 0.000 description 1
- 231100000283 hepatitis Toxicity 0.000 description 1
- 208000005252 hepatitis A Diseases 0.000 description 1
- 238000012165 high-throughput sequencing Methods 0.000 description 1
- 238000010348 incorporation Methods 0.000 description 1
- 208000014674 injury Diseases 0.000 description 1
- 238000009830 intercalation Methods 0.000 description 1
- 210000000936 intestine Anatomy 0.000 description 1
- 239000003621 irrigation water Substances 0.000 description 1
- 238000002955 isolation Methods 0.000 description 1
- DNHVXYDGZKWYNU-UHFFFAOYSA-N lead;hydrate Chemical compound O.[Pb] DNHVXYDGZKWYNU-UHFFFAOYSA-N 0.000 description 1
- 231100000518 lethal Toxicity 0.000 description 1
- 230000001665 lethal effect Effects 0.000 description 1
- 208000019423 liver disease Diseases 0.000 description 1
- 244000144972 livestock Species 0.000 description 1
- 230000014759 maintenance of location Effects 0.000 description 1
- 230000002503 metabolic effect Effects 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 238000012544 monitoring process Methods 0.000 description 1
- 239000000178 monomer Substances 0.000 description 1
- 210000003097 mucus Anatomy 0.000 description 1
- 210000003205 muscle Anatomy 0.000 description 1
- 210000000653 nervous system Anatomy 0.000 description 1
- 235000008935 nutritious Nutrition 0.000 description 1
- 210000000056 organ Anatomy 0.000 description 1
- 235000020636 oyster Nutrition 0.000 description 1
- 238000004806 packaging method and process Methods 0.000 description 1
- 235000014594 pastries Nutrition 0.000 description 1
- 238000003909 pattern recognition Methods 0.000 description 1
- 230000002085 persistent effect Effects 0.000 description 1
- 239000000575 pesticide Substances 0.000 description 1
- 230000035790 physiological processes and functions Effects 0.000 description 1
- 231100000614 poison Toxicity 0.000 description 1
- 230000007096 poisonous effect Effects 0.000 description 1
- 229920001184 polypeptide Polymers 0.000 description 1
- 235000015277 pork Nutrition 0.000 description 1
- 238000012805 post-processing Methods 0.000 description 1
- 235000013613 poultry product Nutrition 0.000 description 1
- 239000003755 preservative agent Substances 0.000 description 1
- 230000002335 preservative effect Effects 0.000 description 1
- 108090000765 processed proteins & peptides Proteins 0.000 description 1
- 102000004196 processed proteins & peptides Human genes 0.000 description 1
- 238000003908 quality control method Methods 0.000 description 1
- 239000002994 raw material Substances 0.000 description 1
- 235000020995 raw meat Nutrition 0.000 description 1
- 239000011535 reaction buffer Substances 0.000 description 1
- 235000021487 ready-to-eat food Nutrition 0.000 description 1
- 230000009467 reduction Effects 0.000 description 1
- 230000002829 reductive effect Effects 0.000 description 1
- 230000008439 repair process Effects 0.000 description 1
- 238000011160 research Methods 0.000 description 1
- 201000004193 respiratory failure Diseases 0.000 description 1
- 230000004044 response Effects 0.000 description 1
- 230000002441 reversible effect Effects 0.000 description 1
- 239000002336 ribonucleotide Substances 0.000 description 1
- 239000007320 rich medium Substances 0.000 description 1
- 235000012045 salad Nutrition 0.000 description 1
- 206010039447 salmonellosis Diseases 0.000 description 1
- 235000015067 sauces Nutrition 0.000 description 1
- 230000003248 secreting effect Effects 0.000 description 1
- 239000013049 sediment Substances 0.000 description 1
- 238000010187 selection method Methods 0.000 description 1
- 239000004065 semiconductor Substances 0.000 description 1
- 229940007046 shigella dysenteriae Drugs 0.000 description 1
- 229940115939 shigella sonnei Drugs 0.000 description 1
- 235000020183 skimmed milk Nutrition 0.000 description 1
- 239000010454 slate Substances 0.000 description 1
- 235000011888 snacks Nutrition 0.000 description 1
- 239000007787 solid Substances 0.000 description 1
- 239000007790 solid phase Substances 0.000 description 1
- 238000003045 statistical classification method Methods 0.000 description 1
- 238000007619 statistical method Methods 0.000 description 1
- 235000013547 stew Nutrition 0.000 description 1
- 208000002254 stillbirth Diseases 0.000 description 1
- 231100000537 stillbirth Toxicity 0.000 description 1
- 238000006467 substitution reaction Methods 0.000 description 1
- 238000001356 surgical procedure Methods 0.000 description 1
- 230000009747 swallowing Effects 0.000 description 1
- LZNWYQJJBLGYLT-UHFFFAOYSA-N tenoxicam Chemical compound OC=1C=2SC=CC=2S(=O)(=O)N(C)C=1C(=O)NC1=CC=CC=N1 LZNWYQJJBLGYLT-UHFFFAOYSA-N 0.000 description 1
- 229960002871 tenoxicam Drugs 0.000 description 1
- CFMYXEVWODSLAX-QOZOJKKESA-N tetrodotoxin Chemical compound O([C@@]([C@H]1O)(O)O[C@H]2[C@@]3(O)CO)[C@H]3[C@@H](O)[C@]11[C@H]2[C@@H](O)N=C(N)N1 CFMYXEVWODSLAX-QOZOJKKESA-N 0.000 description 1
- 229950010357 tetrodotoxin Drugs 0.000 description 1
- CFMYXEVWODSLAX-UHFFFAOYSA-N tetrodotoxin Natural products C12C(O)NC(=N)NC2(C2O)C(O)C3C(CO)(O)C1OC2(O)O3 CFMYXEVWODSLAX-UHFFFAOYSA-N 0.000 description 1
- 239000010409 thin film Substances 0.000 description 1
- 238000009966 trimming Methods 0.000 description 1
- 239000003656 tris buffered saline Substances 0.000 description 1
- 231100000397 ulcer Toxicity 0.000 description 1
- 230000003612 virological effect Effects 0.000 description 1
Images
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Q—MEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
- C12Q1/00—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
- C12Q1/68—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
- C12Q1/6876—Nucleic acid products used in the analysis of nucleic acids, e.g. primers or probes
- C12Q1/6888—Nucleic acid products used in the analysis of nucleic acids, e.g. primers or probes for detection or identification of organisms
- C12Q1/689—Nucleic acid products used in the analysis of nucleic acids, e.g. primers or probes for detection or identification of organisms for bacteria
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/10—Processes for the isolation, preparation or purification of DNA or RNA
- C12N15/1034—Isolating an individual clone by screening libraries
- C12N15/1065—Preparation or screening of tagged libraries, e.g. tagged microorganisms by STM-mutagenesis, tagged polynucleotides, gene tags
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Q—MEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
- C12Q1/00—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
- C12Q1/02—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving viable microorganisms
- C12Q1/04—Determining presence or kind of microorganism; Use of selective media for testing antibiotics or bacteriocides; Compositions containing a chemical indicator therefor
- C12Q1/10—Enterobacteria
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Q—MEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
- C12Q1/00—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
- C12Q1/68—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
- C12Q1/6869—Methods for sequencing
-
- G—PHYSICS
- G01—MEASURING; TESTING
- G01N—INVESTIGATING OR ANALYSING MATERIALS BY DETERMINING THEIR CHEMICAL OR PHYSICAL PROPERTIES
- G01N33/00—Investigating or analysing materials by specific methods not covered by groups G01N1/00 - G01N31/00
- G01N33/02—Food
-
- G—PHYSICS
- G01—MEASURING; TESTING
- G01N—INVESTIGATING OR ANALYSING MATERIALS BY DETERMINING THEIR CHEMICAL OR PHYSICAL PROPERTIES
- G01N33/00—Investigating or analysing materials by specific methods not covered by groups G01N1/00 - G01N31/00
- G01N33/02—Food
- G01N33/08—Eggs, e.g. by candling
-
- G—PHYSICS
- G01—MEASURING; TESTING
- G01N—INVESTIGATING OR ANALYSING MATERIALS BY DETERMINING THEIR CHEMICAL OR PHYSICAL PROPERTIES
- G01N33/00—Investigating or analysing materials by specific methods not covered by groups G01N1/00 - G01N31/00
- G01N33/02—Food
- G01N33/12—Meat; Fish
-
- G—PHYSICS
- G01—MEASURING; TESTING
- G01N—INVESTIGATING OR ANALYSING MATERIALS BY DETERMINING THEIR CHEMICAL OR PHYSICAL PROPERTIES
- G01N33/00—Investigating or analysing materials by specific methods not covered by groups G01N1/00 - G01N31/00
- G01N33/48—Biological material, e.g. blood, urine; Haemocytometers
- G01N33/50—Chemical analysis of biological material, e.g. blood, urine; Testing involving biospecific ligand binding methods; Immunological testing
- G01N33/53—Immunoassay; Biospecific binding assay; Materials therefor
- G01N33/569—Immunoassay; Biospecific binding assay; Materials therefor for microorganisms, e.g. protozoa, bacteria, viruses
- G01N33/56911—Bacteria
- G01N33/56916—Enterobacteria, e.g. shigella, salmonella, klebsiella, serratia
-
- G—PHYSICS
- G01—MEASURING; TESTING
- G01N—INVESTIGATING OR ANALYSING MATERIALS BY DETERMINING THEIR CHEMICAL OR PHYSICAL PROPERTIES
- G01N33/00—Investigating or analysing materials by specific methods not covered by groups G01N1/00 - G01N31/00
- G01N33/48—Biological material, e.g. blood, urine; Haemocytometers
- G01N33/50—Chemical analysis of biological material, e.g. blood, urine; Testing involving biospecific ligand binding methods; Immunological testing
- G01N33/53—Immunoassay; Biospecific binding assay; Materials therefor
- G01N33/569—Immunoassay; Biospecific binding assay; Materials therefor for microorganisms, e.g. protozoa, bacteria, viruses
- G01N33/56911—Bacteria
- G01N33/56922—Campylobacter
-
- G06F19/22—
-
- G06F19/24—
-
- G06F19/28—
-
- G—PHYSICS
- G16—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
- G16B—BIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
- G16B20/00—ICT specially adapted for functional genomics or proteomics, e.g. genotype-phenotype associations
- G16B20/20—Allele or variant detection, e.g. single nucleotide polymorphism [SNP] detection
-
- G—PHYSICS
- G16—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
- G16B—BIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
- G16B30/00—ICT specially adapted for sequence analysis involving nucleotides or amino acids
-
- G—PHYSICS
- G16—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
- G16B—BIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
- G16B40/00—ICT specially adapted for biostatistics; ICT specially adapted for bioinformatics-related machine learning or data mining, e.g. knowledge discovery or pattern finding
-
- G—PHYSICS
- G16—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
- G16B—BIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
- G16B40/00—ICT specially adapted for biostatistics; ICT specially adapted for bioinformatics-related machine learning or data mining, e.g. knowledge discovery or pattern finding
- G16B40/20—Supervised data analysis
-
- G—PHYSICS
- G16—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
- G16B—BIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
- G16B40/00—ICT specially adapted for biostatistics; ICT specially adapted for bioinformatics-related machine learning or data mining, e.g. knowledge discovery or pattern finding
- G16B40/30—Unsupervised data analysis
-
- G—PHYSICS
- G16—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
- G16B—BIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
- G16B50/00—ICT programming tools or database systems specially adapted for bioinformatics
-
- G—PHYSICS
- G16—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
- G16B—BIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
- G16B50/00—ICT programming tools or database systems specially adapted for bioinformatics
- G16B50/30—Data warehousing; Computing architectures
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Q—MEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
- C12Q2535/00—Reactions characterised by the assay type for determining the identity of a nucleotide base or a sequence of oligonucleotides
- C12Q2535/122—Massive parallel sequencing
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Q—MEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
- C12Q2563/00—Nucleic acid detection characterized by the use of physical, structural and functional properties
- C12Q2563/116—Nucleic acid detection characterized by the use of physical, structural and functional properties electrical properties of nucleic acids, e.g. impedance, conductivity or resistance
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Q—MEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
- C12Q2565/00—Nucleic acid analysis characterised by mode or means of detection
- C12Q2565/60—Detection means characterised by use of a special device
- C12Q2565/631—Detection means characterised by use of a special device being a biochannel or pore
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Q—MEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
- C12Q2600/00—Oligonucleotides characterized by their use
- C12Q2600/154—Methylation markers
-
- Y—GENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y02—TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
- Y02A—TECHNOLOGIES FOR ADAPTATION TO CLIMATE CHANGE
- Y02A50/00—TECHNOLOGIES FOR ADAPTATION TO CLIMATE CHANGE in human health protection, e.g. against extreme weather
- Y02A50/30—Against vector-borne diseases, e.g. mosquito-borne, fly-borne, tick-borne or waterborne diseases whose impact is exacerbated by climate change
Definitions
- the disclosure provides a method comprising: (a) detecting a presence or an absence of a non-pathogenic microorganism in a sample; (b) predicting, by a computer system, a presence or an absence of a pathogenic microorganism in said sample based on said presence or said absence of said non-pathogenic microorganism.
- said predicting is performed by a machine learning algorithm in a computer, such as a support vector machine (SVM), a Naive Bayes classification, a random forest, Logistic Regression, and a neural network.
- said sample is a food sample or an environmental sample associated with said food sample.
- said food sample is a perishable, such as a meat, a poultry, a red meat, a fish, a swine, a fruit, an egg, a vegetable, a produce, or a legume.
- said environmental sample is a surface swab or a surface rinse of said environment.
- said environmental sample is a food storage container, a food handling equipment, or a piece of clothing from a worker of said environment associated with said food processing facility.
- said sample is a non-food sample.
- said sample comprises blood, plasma, urine, tissue, faces, bone marrow, saliva or cerebrospinal fluid.
- said non-pathogenic microorganism is selected from the group consisting of: Enterobacter asburiae, Enterobacter bugandensis, Enterobacter cancerogenus, Enterobacter cloacae, Enterobacter endosymbiont, Enterobacter hormaechei, Enterobacter kobei, Enterobacter ludwigii, Enterobacter mori , and Enterobacter soli .
- said pathogenic microorganism is selected from the group consisting of: a microorganism of the Salmonella genus, a microorganism of the Campylobacter genus, a microorganism of the Listeria genus, and a microorganism of the Escherichia genus.
- said pathogenic microorganism is selected from the group consisting of Vibrio parahaemolyticus, Vibrio cholera, Vibrio vulnificus, Escherichia coli, Salmonella enterica, Shigella boydii, Campylobacter jejuni, Staphylococcus aureus, Listeria monocytogenes, Clostridium botulinum, Yersinia pseudotuberculosis, Clostridium perfringens, Yersinia enterocolitica, Coxiella burnetii, Yersinia pseudotuberculosis, Vibrio parahaemolyticus, Bacillus cereus, Mycobacterium tuberculosis, Shigella flexneri, Shigella boydii, Shigella dysenteriae , and Shigella sonnei .
- said detecting comprises a nucleic acid characterization assay selected from the group consisting of a pore sequencing reaction, a next generation sequencing reaction, a shotgun next generation sequencing, Sanger sequencing, or hybridization assay.
- the method further comprises performing an assay to confirm the prediction of (b), such as a polymerase chain reaction (PCR) assay, an enzyme-linked immunosorbent (ELISA) assay, or an enzyme-linked fluorescent assay (ELFA) assay.
- PCR polymerase chain reaction
- ELISA enzyme-linked immunosorbent
- ELFA enzyme-linked fluorescent assay
- the disclosure provides a method comprising: (a) sequencing a plurality of nucleic acid sequences from a food sample or from an environmental sample associated with said food sample for a period of time; and (b) performing an assay on said food sample or said environment associated with said food sample if said sequencing for said period of time identifies a threshold level of nucleic acid sequences from a microorganism in said food sample.
- said period of time is less than 30 minutes.
- said threshold is no more than 0.1% of said nucleic acid sequences from said microorganism.
- the method further comprises performing an amplification reaction on said plurality of nucleic acid sequences prior to sequencing.
- said sequencing is a pore sequencing reaction.
- said assay is a serotyping assay, a culturing assay, a Pulse Field Gel Electrophoresis (PFGE) assay, a RiboPrinter® assay, a q-PCR assay, a Sanger sequencing assay, an ELISA assay, a Whole Genome Sequencing (WGS) assay, a targeted sequencing assay, or a shotgun metagenomics assay.
- PFGE Pulse Field Gel Electrophoresis
- RiboPrinter® RiboPrinter® assay
- q-PCR assay q-PCR assay
- Sanger sequencing assay an ELISA assay
- WGS Whole Genome Sequencing
- said microorganism is selected from the group consisting of: a microorganism of the Salmonella genus, a microorganism of the Campylobacter genus, a microorganism of the Listeria genus, and a microorganism of the Escherichia genus.
- said microorganism of the Salmonella genus has a serotype selected from the group consisting of: Enteritidis, Typhimurium, Newport, Javiana, Infantis, Montevideo, Heidelberg, Muenchen, Saintpaul, Oranienburg, Braenderup, Paratyphi B var. L(+) Tartrate+, Agona, Thompson, and Kentucky.
- said microorganism of the Escherichia genus has a serotype selected from the group consisting of: O103, O111, O121, O145, O26, O45, and O157.
- said microorganism of the Listeria genus has a serotype selected from the group consisting of: 2a, 1/2b, 1/2c, 3a, 3b, 3c, 4a, 4b, 4ab, 4c, 4d, and 4e.
- said microorganism of the Campylobacter genus is C. jejunis, C. lari , or C. coli .
- said food sample is a meat, a poultry, a red meat, a fish, or a swine, a fruit, an egg, a vegetable, a produce or a legume.
- said environmental sample is a surface swab of said environment, a surface rinse of said environment, a food storage container, a food handling equipment, or a piece of clothing from a worker of said environment associated with said food sample.
- FIG. 1 illustrates the deploying of a sequencing assay 101 to one or more food processing facilities, food testing lab, or any other diagnostic lab 102 for performing a sequencing reaction of a food sample or of an environmental sample from said food processing facilities such as, for example, soil, water, air, animal product(s), feed, manure, crop production, or any sample associated with a manufacturing plant.
- a sequencing assay 101 to one or more food processing facilities, food testing lab, or any other diagnostic lab 102 for performing a sequencing reaction of a food sample or of an environmental sample from said food processing facilities such as, for example, soil, water, air, animal product(s), feed, manure, crop production, or any sample associated with a manufacturing plant.
- FIG. 2 ( FIG. 2 ): illustrates a transmission of an electronic communication comprising a data set associated with a sequencing reaction from one or more food processing facilities to a server.
- FIG. 3 ( FIG. 3 ): is a chart illustrating that a redundancy in genetic markers decreases a false negative rate of a method of the disclosure.
- FIG. 4 ( FIG. 4 ): illustrates a process for predictive risk assessment based on a detection of a non-pathogenic microorganism.
- FIG. 5 ( FIG. 5 ): is a heat map illustrating predictive pathogen detection through machine learning.
- FIG. 6 ( FIG. 6 ): illustrates a process for predicting a shelf-life of a food based on the detection of a microorganism.
- FIG. 7 ( FIG. 7 ): is a diagram illustrating the tunable resolution of various assays.
- FIG. 8 ( FIG. 8 ): is a schematic illustrating various serotypes of various microorganisms that can be detected by an analysis of a plurality of nucleic acid sequences as described herein and further validated with a serotyping assay.
- FIG. 9 is a schematic illustrating one process for distinguishing a live microorganism from a food or from an environmental sample.
- FIG. 10 ( FIG. 10 ): illustrates a process for re-using flow cells with distinct indexes.
- FIG. 11 ( FIG. 11 ): illustrates an automated sequencing apparatus of the disclosure.
- FIG. 12 ( FIG. 12 ): illustrates a sequencing process with no human touch points after enrichment.
- FIG. 13 ( FIG. 13 ): illustrates the PMAxx-induced removal of free-floating DNA.
- FIG. 14 ( FIG. 14 ): illustrates a priming port in a flow cell.
- FIG. 15 ( FIG. 15 ): illustrates a dispensing of a loading library on a flow cell.
- FIG. 16 ( FIG. 16 ): illustrates the simultaneous targeting of multiple pathogens.
- FIG. 17 ( FIG. 17 ): illustrates the in silico prediction of primer sensitivity/specificity.
- FIG. 18 ( FIG. 18 ): illustrates the reuse of MinION/GridION flow cells.
- FIG. 19 ( FIG. 19 ): illustrates the number of reads per sample during reuse of MinION/GridION flow cells.
- FIG. 20 illustrates the performance of the disclosed automated handling system on samples spiked with 10 different Salmonella serotypes ( Enteritidis, Thyphimurium, 14_[5]_12:i:, Newport, Javiana, Infantis, Montevideo, Heidelberg, Muenchen ).
- FIG. 21 ( FIG. 21 ): illustrates a principal component analysis to chicken wing chicken data sets.
- FIG. 22 ( FIG. 22 ): illustrates a principal component analysis to ground chicken data sets.
- FIG. 23 ( FIG. 23 ): illustrates periodic and nonperiodic barcode designs.
- FIG. 24 ( FIG. 24 ): illustrates a principle component analysis of Listeria sequences identifying clusters of closely related bacteria which likely originated from the same source.
- Food safety is a complex issue that has an impact on multiple segments of society.
- a food is considered to be adulterated if it contains: (1) a poisonous or otherwise harmful substance that is not an inherent natural constituent of the food itself, in an amount that poses a reasonable possibility of injury to health, or (2) a substance that is an inherent natural constituent of the food itself; is not the result of environmental, agricultural, industrial, or other contamination; and is present in an amount that ordinarily renders the food injurious to health.
- the first includes, for example, a pathogenic bacterium, fungus, parasite or virus, if the amount present in the food may be injurious to health.
- microorganisms can contaminate foods, and there are many different foodborne infections. Although our scientific understanding of pathogenic microorganisms and their toxins is continually advancing, some of the most common microorganisms associated with foodborne illnesses include microorganisms of the Salmonella, Campylobacter, Listeria , and Escherichia genus.
- Salmonella for example is widely dispersed in nature. It can colonize the intestinal tracts of vertebrates, including livestock, wildlife, domestic pets, and humans, and may also live in environments such as pond-water sediment. It is spread through the fecal-oral route and through contact with contaminated water. (Certain protozoa may act as a reservoir for the organism). It may, for example, contaminate poultry, red meats, farm-irrigation water (thereby contaminating produce in the field), soil and insects, factory equipment, hands, and kitchen surfaces and utensils.
- Campylobacter jejuni is estimated to be the third leading bacterial cause of foodborne illness in the U.S.
- the symptoms this bacterium causes generally last from 2 to 10 days and, while the diarrhea (sometimes bloody), vomiting, and cramping are unpleasant, they usually go away by themselves in people who are otherwise healthy.
- Raw poultry, unpasteurized (“raw”) milk and cheeses made from it, and contaminated water (for example, unchlorinated water, such as in streams and ponds) are major sources, but C. jejuni also occurs in other kinds of meats and has been found in seafood and vegetables.
- this bacterium is one of the leading causes of death from foodborne illness. It can cause two forms of disease. One can range from mild to intense symptoms of nausea, vomiting, aches, fever, and, sometimes, diarrhea, and usually goes away by itself. The other, more deadly, form occurs when the infection spreads through the bloodstream to the nervous system (including the brain), resulting in meningitis and other potentially fatal problems.
- Escherichia microorganisms are also diverse in nature. For instance, at least four groups of pathogenic Escherichia coli have been identified: a) Enterotoxigenic Escherichia coli (ETEC), b) Enteropathogenic Escherichia coli (EPEC), c) Enterohemorrhagic Escherichia coli (EHEC), and Enteroinvasive Escherichia coli (EIEC). While ETEC is generally associated with traveler's diarrhea some members of the EHEC group, such as E. coli 0157:H7, can cause bloody diarrhea, blood-clotting problems, kidney failure, and death. Thus, it is important to be able not only to identify individual microorganism, but also to distinguish them.
- ETEC Enterotoxigenic Escherichia coli
- EPEC Enteropathogenic Escherichia coli
- EHEC Enterohemorrhagic Escherichia coli
- EIEC Enteroinvasive Escherichia
- the disclosure solves existing challenges encountered in identifying food borne pathogens, including pathogens of the Salmonella, Campylobacter, Listeria , and Escherichia genus in a timely and efficient manner.
- the disclosure also provides methods for differentiating a transient versus a resident pathogen, correlating presence of non-pathogenic with pathogenic microorganisms, and distinguishing live versus dead microorganisms by sequencing, amongst others.
- the term “food processing facility” includes facilities that manufacture, process, pack, or hold food in any location globally.
- a food processing facility can, for example, determine the location and source of an outbreak of food-borne illness or a potential bioterrorism incident.
- the term “food” includes any nutritious substance that people or animals eat or drink, or that plants absorb, in order to maintain life and growth.
- foods include red meat, poultry, fruits, vegetables, fish, pork, seafood, dairy products, eggs, egg shells, raw agricultural commodities for use as food or components of food, canned foods, frozen foods, bakery goods, snack food, candy (including chewing gum), dietary supplements and dietary ingredients, infant formula, beverages (including alcoholic beverages and bottled water), animal feeds and pet food, and live food animals.
- the term environmental sample includes a surface swab of a food contact substance, a surface rinse of a food contact substance, a food storage container, a food handling equipment, a piece of clothing from a subject in contact with a food processing facility, or another suitable sample from a food processing facility.
- sample as used herein, generally refers to any sample that can be informative of an environment or a food, such as a sample that comprises soil, water, water quality, air, animal production, feed, manure, crop production, manufacturing plants, environmental samples or food samples directly.
- sample may also refer to other non-food sample, such as samples derived from a subject, such as comprise blood, plasma, urine, tissue, faces, bone marrow, saliva or cerebrospinal fluid. Such samples may be derived from a hospital or a clinic.
- the term “subject,” can refer to a human or to another animal.
- An animal can be a mouse, a rat, a guinea pig, a dog, a cat, a horse, a rabbit, and various other animals.
- a subject can be of any age, for example, a subject can be an infant, a toddler, a child, a pre-adolescent, an adolescent, an adult, or an elderly individual.
- disease generally refers to conditions associated with the presence of a microorganism in a food, e.g., outbreaks or incidents of foodborne disease.
- nucleic acid or “polynucleotide,” as used herein, refers to a polymeric form of nucleotides of any length, either ribonucleotides or deoxyribonucleotides.
- Polynucleotides include sequences of deoxyribonucleic acid (DNA), ribonucleic acid (RNA), or DNA copies of ribonucleic acid (cDNA).
- polyribonucleotide generally refers to polynucleotide polymers that comprise ribonucleic acids. The term also refers to polynucleotide polymers that comprise chemically modified ribonucleotides.
- a polyribonucleotide can be formed of D-ribose sugars, which can be found in nature, and L-ribose sugars, which are not found in nature.
- polypeptides generally refers to polymer chains comprised of amino acid residue monomers which are joined together through amide bonds (peptide bonds).
- the amino acids may be the L-optical isomer or the D-optical isomer.
- barcode generally refers to a label, or identifier, that conveys or is capable of conveying information about one or more nucleic acid sequences from a food sample or from an environmental sample associated with said food sample.
- a barcode can be part of a nucleic acid sequence.
- a barcode can be independent of a nucleic acid sequence.
- a barcode can be a tag attached to a nucleic acid molecule.
- a barcode can have a variety of different formats.
- barcodes can include: polynucleotide barcodes; random nucleic acid and/or amino acid sequences; and synthetic nucleic acid and/or amino acid sequences.
- a barcode can be added to, for example, a fragment of a deoxyribonucleic acid (DNA) or ribonucleic acid (RNA) sample before, during, and/or after sequencing of the sample. Barcodes can allow for identification and/or quantification of individual sequencing-reads. Examples of such barcodes and uses thereof, as may be used with methods, apparatus and systems of the present disclosure, are provided in U.S. Patent Pub. No. 2016/0239732, which is entirely incorporated herein by reference.
- a “molecular index” can either be a barcode itself or it can be a building block, i.e., a component or portion of a larger barcode.
- sequencing generally refers to methods and technologies for determining the sequence of nucleotide bases in one or more nucleic acid polymers, i.e., polynucleotides. Sequencing can be performed by various systems currently available, such as, without limitation, a sequencing system by Illumina®, Pacific Biosciences (PacBio®), Oxford Nanopore®, Genia (Roche) or Life Technologies (Ion Torrent®). Alternatively or in addition, sequencing may be performed using nucleic acid amplification, polymerase chain reaction (PCR) (e.g., digital PCR, quantitative PCR, or real time PCR), or isothermal amplification.
- PCR polymerase chain reaction
- Such systems may provide a plurality of raw data corresponding to the genetic information associated with a food sample or an environmental sample.
- such systems provide nucleic acid sequences (also “reads” or “sequencing reads” herein).
- the term also refers to epigenetics which is the study of heritable changes in gene function that do not involve changes in the DNA sequence.
- a read may include a string of nucleic acid bases corresponding to a sequence of a nucleic acid molecule that has been sequenced.
- pathogenic microorganisms including pathogens of the Salmonella, Campylobacter, Listeria , and Escherichia genus.
- foods that have been associated with such outbreaks include milk, cheeses, vegetables, meats (notably beef and poultry), fish, seafood, and many others.
- Potential contamination sources for various pathogens include raw materials, food workers, incoming air, water, and food processing environments. Among those, post-processing contamination at food-contact surfaces in a food processing facility poses a great threat to product contamination.
- the disclosure provides a method for the identification of a microorganism associated with a food or with a food processing facility.
- the method comprises deploying an assay to one or more food processing facilities; performing a sequencing reaction of a food sample or of an environmental sample from said one or more food processing facilities; transmitting an electronic communication comprising a data set associated with said sequencing reaction of said food sample or of said environmental sample from said one or more food processing facilities to a server; and scanning, by a computer, at least a fraction of said transmitted data set for one or more genes associated with a microorganism.
- the scanning scans fewer than 1%, fewer than 0.1%, fewer than 0.001% of said transmitted data set for one or more genes associated with said microorganism.
- Said scanning can be performed to identify a variety of polymorphic gene regions (comprising SNP's, RFLP's, STRs, VNTR's, hypervariable regions, minisatellites, dinucleotide repeats, trinucleotide repeats, tetranucleotide repeats, simple sequence repeats, indels, and insertion elements) associated with a wide diversity of microorganisms.
- the variety of polymorphic regions to be searched for can be determined by creating a large database of sequences from dozens, hundreds and thousands of food and environmental samples.
- a database of such polymorphic regions can be constructed by performing sequencing reactions on at least 5,000, at least 10,000, at least 15,000, at least 20,000, at least 25,000, at least 30,000, at least 35,000, at least 40,000, at least 45,000, at least 50,000 different food or environmental samples.
- the sequences obtained can be used to compile information in a database that includes: a) the composition of each sample; and b) the presence or absence of a variety of pathogenic and non-pathogenic organisms associated on each sample.
- databases comprise data from polymorphic gene regions of a variety of strains that are variants of a single species.
- a plurality of sequences in the database might correspond to one or more serovars, morphovars, biovars, or other strain specific information.
- a variety of sequencing techniques such as a pore sequencing reaction, a next generation sequencing reaction, a shotgun next generation sequencing, or Sanger sequencing can be used to create a collection of polymorphic regions.
- said sequencing reaction is a pore sequencing reaction and said pore sequencing reaction distinguishes an epigenetic pattern on a nucleic acid from said food sample or from said environmental sample.
- said microorganism may be pre-selected by a customer.
- a customer can be an individual or an entity, such as one or more food processing facilities.
- a customer can be a food packaging facility; a food distribution center; a food storage center; a facilities handling meat, poultry, egg, or another edible product; a farm; a retail food establishment; a fishing vessel; or another type of facility that also manufactures, processes, packs, or holds foods for any period of time.
- a customer may pre-select a microorganism of interest to be identified with any of the methods disclosed herein.
- raw or undercooked ground beef and beef products are vehicles often implicated in E. coli O157:H7 outbreaks.
- Produce, including bagged lettuce, spinach, and alfalfa sprouts, are also increasingly being implicated in E. coli O157:H7 outbreaks.
- a food processing facility producing raw meats or other produce associated with E. coli O157:H7 may be a customer that pre-selects E. coli as a microorganism for analysis.
- a customer may pre-select one or more types of microorganisms for analysis.
- a microorganism can be one or more of types of bacteria, fungus, parasites, protozoa, and viruses.
- Non-limiting examples of bacteria that can be pre-selected by a customer and detected with the methods of the disclosure include: bacteria in the Escherichia genus, including enterotoxigenic Escherichia coli (ETEC), enteropathogenic Escherichia coli (EPEC), enterohemorrhagic Escherichia coli (EHEC), and enteroinvasive Escherichia coli (EIEC); bacteria of the Salmonella genus; bacteria of the Campylobacter genus; bacteria of the Listeria genus; bacteria of the Yersinia genus; bacteria of the Shigella genus; bacteria of the Vibrio genus; bacteria of the Coxiella genus; bacteria of the Mycobacterium genus; bacteria of the Brucella genus; bacteria of the Vibrio genus; bacteria of the Cronobacter genus; bacteria of the Aeromonas genus; bacteria of the Plesiomonas genus; bacteria of the Clostridium
- a microorganism can be a virus.
- viruses that can be pre-selected by a customer and detected with the methods of the disclosure include: noroviruses, Hepatitis A virus, Hepatitis E virus, rotavirus.
- the performing of a sequencing reaction of a food sample or of an environmental sample from said one or more food processing facilities often generates a plurality of nucleic acids sequences that contain redundant information or information associated with genes that are not from a microorganism.
- the disclosed methods empower efficient data analysis by facilitating the targeted analysis of a smaller data set.
- the generated data could be in the range of Kb, Mb, Gb, Tb or more per analyzed sample.
- said scanning scans fewer than 1/10, fewer than 1/20, fewer than 1/30, fewer than 1/40, fewer than 1/50, fewer than 1/60, fewer than 1/70, fewer than 1/80, fewer than 1/90, fewer than 1/100, fewer than 1/200, fewer than 1/300, fewer than 1/400, fewer than 1/500, fewer than 1/600, fewer than 1/700, fewer than 1/800, fewer than 1/900, fewer than 1/1,000, fewer than 1/10,000, or fewer than 1/100,000 of a data set, such as a transmitted data set for one or more genes associated with a microorganism.
- a data set such as a transmitted data set for one or more genes associated with a microorganism.
- said scanning scans at least a fraction of said transmitted data set for one or more genes associated with two or more, three or more, four or more, five or more, six or more, seven or more, eight or more, nine or more, ten or more microorganisms or another suitable number. In some instances, said scanning comprises scanning said transmitted data set for one or more polymorphic gene regions.
- said one or more polymorphic regions comprise one or more single nucleotide polymorphisms (SNP's), one or more restriction fragment length polymorphisms (RFLP's), one or more short tandem repeats (STRs), one or more variable number of tandem repeats (VNTR's), one or more hypervariable regions, one or more minisatellites, one or more dinucleotide repeats, one or more trinucleotide repeats, one or more tetranucleotide repeats, one or more simple sequence repeats, one or more indel, or one or more insertion elements.
- said one or more polymorphic regions comprise one or more single nucleotide polymorphisms (SNP's).
- a data set associated with a sequencing reaction of a food sample or of an environmental sample can be transmitted to a server and scanned by a computer.
- a method can detect a microorganism selected from the group consisting of: a microorganism of the Salmonella genus, a microorganism of the Campylobacter genus, a microorganism of the Listeria genus, and a microorganism of the Escherichia genus.
- the detected microorganisms may be of any serotype and a scanning, by a computer, of one or more genes associated with a microorganism may detect a microorganism independently of its serotype.
- a sequencing reaction of a food sample, an environmental sample, or another sample is a pore sequencing reaction, such as an Oxford Nanopore® sequencing reaction.
- at least one barcode is added to one or more nucleic acid polymers derived from a food sample, from an environmental sample, or from another sample prior to performing said sequencing reaction.
- a plurality of mutually exclusive barcodes are added to a plurality of food processing facilities, thereby creating a barcode identifier that can be associated with each food processing facility.
- a barcoded sequencing read comprising sequences from a pathogenic microorganism can be associated with a food or processing facility.
- a method disclosed herein further comprises creating, in a computer, a data file that associates said at least one barcode with a source of said food sample, of said environmental sample, or of another sample.
- FIG. 1 illustrates the deploying of a sequencing assay 101 to one or more food processing facilities 102 , food testing lab, or any other diagnostic lab and performing a sequencing reaction of a food sample or of an environmental sample from said one or more food processing facilities 102 .
- the food processing facility, food testing lab, or any other diagnostic lab may have one or more computer systems that can be used to transmit the results of the sequencing reads to a server, either on premise or remotely deployed cloud environment.
- FIG. 2 illustrates a transmission of an electronic communication comprising a data set associated with a sequencing reaction from one or more food processing facilities, food testing labs, or any other diagnostic labs to a server.
- the raw sequence data collected from the sequencing reaction includes a large set of data that includes all individual sequences as well as the quality at each base. From this large data set, the Clear Labs bioinformatics pipeline extracts a final report that is orders of magnitudes smaller.
- the final report (e.g. electronic communication) is essentially limited to the presence or absence of an organism of interest, for instance pathogens, and a further classification of the organism in terms of serotypes, strains, or other subclassifications.
- the collected data not used in the report comprises the following:
- the raw sequences include information on the quality of the sequences per base.
- the quality scores can be used in a Bayesian model where classifications are statistically sensitive to these quality scores. Furthermore the quality scores can reveal more on possible relations that content of samples have with the accuracy of sequencing platform.
- Sequence time The raw sequences also include information on the time when the sequence was read by the sequencer. The number of sequences form the same source as a function of time can reveal a lot more information than we currently have. In addition, using these time data, can be useful in generating reports for all or some of the samples earlier than it is currently done.
- Clustering An important step in the pipeline involves clustering sequences that are close enough to each other and representing all the sequences within a cluster by a consensus sequence. This reduces the data significantly and make is easier to classify these sequences. However these differences, even if minute, carry information that gets lost with clustering. Clustering with more stringent criteria, or no clustering can lead into higher resolution and perhaps finer classification.
- a computer system 201 can be programmed or otherwise configured to process and transmit a data set from a food processing facility, food testing labs, or any other diagnostic labs.
- the computer system 201 includes a central processing unit (CPU, also “processor” and “computer processor” herein) 204 , which can be a single core or multi core processor, or a plurality of processors for parallel processing.
- CPU central processing unit
- processor also “processor” and “computer processor” herein
- the computer system 201 also includes memory or memory location 205 (e.g., random-access memory, read-only memory, flash memory), electronic storage unit 206 (e.g., hard disk), communication interface 202 (e.g., network adapter) for communicating with one or more other systems, such as for instance transmitting a data set associated with said sequencing reads, and peripheral devices 204 , such as cache, other memory, data storage and/or electronic display adapters.
- memory 205 , storage unit 206 , interface 202 and peripheral devices 203 are in communication with the CPU 204 through a communication bus (solid lines), such as a motherboard.
- the storage unit 206 can be a data storage unit (or data repository) for storing data.
- the data storage unit 206 can store a plurality of sequencing reads and provide a library of sequences associated with one or more strains from one or more microorganisms associated with a food processing facility, food testing labs, or any other diagnostic labs.
- the computer system 201 can be operatively coupled to a computer network (“network”) 207 with the aid of the communication interface 202 .
- the network 207 can be the Internet, an interne and/or extranet, or an intranet and/or extranet that is in communication with the Internet.
- the network 207 in some cases is a telecommunication and/or data network.
- the network 207 can include one or more computer servers, which can enable distributed computing, such as cloud computing.
- the network 207 in some cases with the aid of the computer system 201 , can implement a peer-to-peer network, which may enable devices coupled to the computer system 201 to behave as a client or a server.
- Escherichia family of pathogens comprise lethal and harmless strains of E. coli . Thus it is not only relevant to be able to identify a pathogen in a sample, but it is also relevant to be able to characterize it with high sensitivity.
- the disclosure provides a method comprising obtaining a plurality of nucleic acid sequences from a food sample, from an environment associated with said food sample or from another sample, such as non-food derived samples from clinical sources, including blood, plasma, urine, tissue, faces, bone marrow, saliva or cerebrospinal fluid samples; scanning, by a computer, at least a fraction of said plurality of said nucleic acid sequences for a plurality of nucleic acid regions from one or more microorganisms selected from the group consisting of: a microorganism of the Salmonella genus, a microorganism of the Campylobacter genus, a microorganism of the Listeria genus, and a microorganism of the Escherichia genus, wherein said scanning characterizes said one or more microorganisms with greater than 98% sensitivity, greater than 98.5% sensitivity, greater than 99% sensitivity, greater than 99.5% sensitivity, or greater than 99.9% sensitivity.
- said scanning characterizes said one or more microorganisms with greater than 98% specificity, greater than 98.5% specificity, greater than 99% specificity, greater than 99.5% specificity, or greater than 99.9% specificity.
- Sensitivity can be a measure of a microorganism that is correctly identified (e.g. the percentage of a microorganism that can be correctly identified based on sequencing read analyses).
- Specificity also called the true negative rate
- measures the proportion of negatives that are correctly identified as such e.g. the percentage of food samples or environmental samples that are correctly identified as not having the microorganism therein).
- said method can distinguish a genetic variant or subtype of a microorganism (e.g., one or more bacterial strains).
- said plurality of nucleic acid sequences comprise complementary DNA (cDNA) sequences, ribonucleic acid (RNA) sequences, genomic deoxyribonucleic acid (gDNA) sequences or a mixture of cDNA, RNA, and gDNA sequences.
- cDNA complementary DNA
- RNA ribonucleic acid
- gDNA genomic deoxyribonucleic acid
- the high sensitivity of the disclosed method, the high specificity of the disclosed method, or both, can be accomplished by scanning said plurality of said nucleic acid sequences for one or more polymorphic gene regions associated with said microorganisms.
- said one or more polymorphic regions is selected from the group consisting of one or more single nucleotide polymorphisms (SNP's), one or more restriction fragment length polymorphisms (RFLP's), one or more short tandem repeats (STRs), one or more variable number of tandem repeats (VNTR's), one or more hypervariable regions, one or more minisatellites, one or more dinucleotide repeats, one or more trinucleotide repeats, one or more tetranucleotide repeats, one or more simple sequence repeats, one or more indel, or one or more insertion elements.
- SNP's single nucleotide polymorphisms
- RFLP's restriction fragment length polymorphisms
- STRs short tandem repeats
- VNTR's variable number of tandem repeats
- hypervariable regions one or more minisatellites, one or more dinucleotide repeats, one or more trinucleotide repeats,
- said scanning compares a scanned polymorphism with a library of sequences comprising sequences from dozens, hundreds, or thousands of unique strains of a microorganism.
- the higher sensitivity is achieved by comparing the sequence information of the target region that can discriminate different microorganisms through the lens of SNPs, indels or other non-universal target specific markers that are only present within the genome of target micromicroorganisms.
- FIG. 3 is a chart illustrating that a redundancy in genetic markers decreases a false negative rate of a method of the disclosure and increases its sensitivity as compared to PCR based methods.
- three commercially available q/PCR based pathogen detection kits revealed that they would not detect all known Salmonella or Listeria genomes.
- 301 illustrates percentages of Salmonella detection by existing commercial kits.
- 302 illustrates percentages of Listeria detection by existing commercial kits.
- a scanning of a plurality of nucleic acid regions within said plurality of nucleic acid sequences can characterize said one or more microorganisms with a desired specificity, sensitivity, or both.
- a scanning of no more than 0.001%, 0.01%, 0.1%, 1%, 5%, 10%, 25%, 50%, 90%, 99%, 100% or any number in between of nucleic acid regions within said plurality of nucleic acid sequences characterizes said one or more microorganisms with greater than 90%, 95%, 98%, 99%, 99.9%, 99.99% and 99.999% sensitivity.
- the method has fewer than 2%, fewer than 1.5%, fewer than 1.0%, fewer than 0.5%, or fewer than 0.1% of a false positive identification rate. In some aspects, a scanning of no more than 1% of a whole genome can characterize said microorganism.
- the high sensitivity and specificity of the disclosed methods are independent of a serotype of the microorganism.
- a scanning of a plurality of nucleic acid regions can identify a microorganism of the Salmonella genus that has a serotype selected from the group consisting of: Enteritidis, Typhimurium, Newport, Javiana, Infantis, Montevideo, Heidelberg, Muenchen, Saintpaul, Oranienburg, Braenderup, Paratyphi B var.
- a microorganism of the Escherichia genus has a serotype selected from the group consisting of: O103, O111, O121, O145, O26, O45, and O157; a microorganism of the Listeria genus that has a serotype selected from the group consisting of: 2a, 1/2b, 1/2c, 3a, 3b, 3c, 4a, 4b, 4ab, 4c, 4d, and 4e; a microorganism of the Campylobacter genus with the C. jejuni, C. lari , or C. coli serotype and others.
- a non-pathogenic strain of Citrobacter namely Citrobacter sedlakii , expresses the Escherichia coli O157:H7 antigen. This is usually associated with a false positive detection of E. coli in a sample.
- Citrobacter is erroneously classified as E. coli
- a food lot may be unnecessarily disposed of and a food processing facility may be erroneously classified as a contaminated facility.
- the high sensitivity of the disclosed methods can be used to distinguish a microorganism from the Escherichia genus from a microorganism of the Citrobacter genus.
- the disclosure provides a method comprising: scanning, by a computer, a plurality of sequencing reads from a food sample or from an environment associated with said food sample, whereby said scanning distinguishes a microorganism of a Citrobacter genus from a microorganism of an Escherichia genus by identifying one or more single nucleotide polymorphisms that are associated with either said Citrobacter genus or said Escherichia genus.
- Other examples include E. coli O157:H7 assay cross-reacting with E. coli O55 (which is not an STEC). Also some assays deliver false positives against E. coli O104 (which is not an STEC). Citrobacter is also a long-understood challenge for the some systems E. coli O157:H7.
- the disclosure provides methods for the rapid identification of a microorganism from a food sample.
- the disclosure provides a method for sequencing a plurality of nucleic acid sequences from a food sample, from an environmental sample associated with said food sample or from another sample (such as a clinically derived sample) for a period of time; and performing an assay on said food sample or said environment associated with said food sample if said sequencing for said period of time identifies a threshold level of nucleic acid sequences from a microorganism in said food sample.
- FIG. 4 is a schematic illustrating a sequencing of a plurality of nucleic acid sequences from a food sample for a period of time and the advantages of performing an assay on said food sample if said sequencing for said period of time identifies a threshold level of nucleic acid sequences from a microorganism in said food sample.
- a microorganism that can injure its host e.g., by competing with it for metabolic resources, destroying its cells or tissues, or secreting toxins can be considered a pathogenic microorganism.
- pathogenic microorganisms include viruses, bacteria, mycobacteria, fungi, protozoa, and some helminths.
- the disclosure provides methods for detecting one or more microorganisms from a food sample or from an environment associated with said food sample—such as from a table, a floor, a boot cover, an equipment of a food processing facility—or from a food related sample that comprise soil, water, water quality, air, animal production, feed, manure, crop production, manufacturing plants, environmental samples, or non-food derived samples, such as samples from clinical sources that comprise blood, plasma, urine, tissue, faces, bone marrow, saliva or cerebrospinal fluid by analyzing a plurality of nucleic acid sequencing reads from such samples.
- Salmonella enterica subspecies enterica is further divided into numerous serotypes, including S. enteritidis and S. typhimurium .
- the methods of the disclosure can distinguish between such subspecies of a variety of Salmonella by analyzing their nucleic acid sequences.
- Escherichia coli Escherichia coli
- E. coli Escherichia coli
- Many E. coli are harmless and in some aspects are an important part of a healthy human intestinal tract.
- many E. coli can cause illnesses, including diarrhea or illness outside of the intestinal tract and should be distinguished from less pathogenic strains.
- the methods of the disclosure can distinguish between various subspecies of a variety of Escherichia bacteria by analyzing their nucleic acid sequences.
- Listeria is a harmful bacterium that can be found in refrigerated, ready-to-eat foods (meat, poultry, seafood, and dairy—unpasteurized milk and milk products or foods made with unpasteurized milk), and produce harvested from soil contaminated with, for example, L. monocytogenes . Many animals can carry this bacterium without appearing ill, which increases the challenges in identifying the pathogen derived from a food source. In addition, some species of Listeria can grow at refrigerator temperatures where most other foodborne bacteria do not, another factor that increases the challenges of identifying Listeria . When eaten, Listeria may cause listeriosis, an illness to which pregnant women and their unborn children are very susceptible. In some aspects, the methods of the disclosure can distinguish between various subspecies of a variety of Listeria bacteria by analyzing their nucleic acid sequences.
- Campylobacter jejuni is estimated to be the third leading bacterial cause of foodborne illness in the United States.
- Raw poultry, unpasteurized (“raw”) milk and cheeses made from it, and contaminated water (for example, unchlorinated water, such as in streams and ponds) are major sources of Campylobacter , but it also occurs in other kinds of meats and has been found in seafood and vegetables.
- the methods of the disclosure can distinguish between various subspecies of a variety of Campylobacter bacteria by analyzing their nucleic acid sequences.
- Non-limiting examples of pathogenic microorganisms that can be detected with the methods of the disclosure include: pathogenic Escherichia coli group, including Enterotoxigenic Escherichia coli (ETEC), Enteropathogenic Escherichia coli (EPEC), Enterohemorrhagic Escherichia coli (EHEC), Enteroinvasive Escherichia coli (EIEC), Salmonella spp., Campylobacter jejuni, Listeria, Yersinia enterocolitica, Shigella spp., Vibrio parahaemolyticus, Coxiella burnetii, Mycobacterium bovis, Brucella spp., Vibrio cholera, Vibrio vulnificus, Cronobacter, Aeromonas hydrophila and other spp., Plesiomonas shigelloides, Clostridium perfringens, Clostridium botulinum, Staphylococcus aureus, Bacill
- resident microorganisms reflect a persistent contamination within a location, e.g., a food processing facility or a hospital, that is very different than the transient pathogens that are being repeatedly introduced into the locations.
- Discriminating resident and transient pathogens provides more clarity for differentiation of source of contaminations and intervention strategies. This strategy can be used, for example, to manage contaminations with managing contaminations with Listeria monocytogensis .
- Campylobacter is part of the natural gut microflora of most food-producing animals, such as chickens, turkeys, swine, cattle, and sheep.
- each contaminated poultry carcass can carry from about 100 to about 100,000 Campylobacter cells.
- Campylobacter cells can be carried from about 100 to about 100,000 Campylobacter cells.
- Campylobacter cells pose a significant risk for consumers who mishandle fresh or processed poultry during preparation or who undercook it.
- one must be able to distinguish a normal level of a Campylobacter on a food carcass from a Campylobacter overgrowth in a sample or from the presence of a new strain of Campylobacter in a food processing facility, environment, or food sample.
- FIG. 4 illustrates a process for predictive risk assessment based on a detection of a non-pathogenic microorganism.
- a food sample such as a steak sample illustrated as 401 is processed and an assay, such as a nucleic acid sequencing reaction is performed.
- An analysis of a plurality of nucleic acid sequencing reads from 401 may, in some instances, not detect a particular pathogen, such as the E. coli pathogen illustrated in this example.
- an analysis 403 of the microbiome 402 of the food sample 401 may indicate high risk for a presence of a pathogen, such as E. coli .
- the food sample may be re-sampled and re-processed to confirm the presence of a pathogenic microorganism therein.
- the methods disclosed herein further comprise performing an additional assay to confirm the presence of the pathogenic microorganism in the sample, such as a serotyping assay, a polymerase chain reaction (PCR) assay, an enzyme-linked immunosorbent (ELISA) assay, or an enzyme-linked fluorescent assay (ELFA) assay, restriction fragment length polymorphisms (RFLP) assay, pulse field gel electrophoresis (PFGE) assay, multi-locus sequence typing (MLST) assay, targeted DNA sequencing assay, whole genome sequencing (WGS) assay, or shotgun sequencing assay.
- an additional assay to confirm the presence of the pathogenic microorganism in the sample
- a serotyping assay such as a polymerase chain reaction (PCR) assay, an enzyme-linked immunosorbent (ELISA) assay, or an enzyme-linked fluorescent assay (ELFA) assay, restriction fragment length polymorphisms (RFLP) assay, pulse field gel electrophoresis (PFGE) assay, multi-
- the disclosure provides a method comprising obtaining a first plurality of nucleic acid sequences from a first sample of a food processing facility; creating a data file in a computer that associates one or more of said first plurality of nucleic acid sequences with said food processing facility; obtaining a second plurality of nucleic acid sequences from a second food sample of said food processing facility; and scanning a plurality of sequences from said second plurality of nucleic acid sequences for one or more sequences associated with said food processing facility in the created data file.
- One or more data files can be created that associate a microorganism with a food processing facility.
- a data file can provide a collection of sequencing reads that can be associated with one or more strains of a microorganism present in the processing facility.
- more than 10, 15, 20, 25, 30, 35, 40, 45, 50, 60, 70, 80, 90, 100, or 1000 bacterial strains can be associated with one or more food processing facilities.
- the instance disclosure recognizes that a presence of some non-pathogenic microorganisms, i.e. indicator microorganisms, can be correlated with a presence of pathogenic bacteria in food, in environmental samples, or another sample.
- the disclosure provides a method comprising detecting a presence or an absence of a non-pathogenic microorganism in a food sample, an environment associated with said food sample, or another sample described herein, by a computer system, and a presence or an absence of a pathogenic microorganism in said food sample, environment associated, or another sample based on said presence or said absence of said non-pathogenic microorganism.
- FIG. 5 is a heat map illustrating predictive pathogen detection through machine learning using associated non-pathogenic microorganisms.
- the data was supplemented by alpha diversity measures including Shannon entropy, number of observed OTUs, and Faith's phylogenetic diversity measure.
- the quantification of the bacteria in the samples and these supplemented measures provided coordinates for the data points used in the final classification.
- the distance between the data points was computed as a combination of unifrac distance and the euclidean distance restricted to the supplemented coordinates.
- the data points were split into training and test subsets. We used stratified 10-fold cross validation to train support vector machine model on the training set. The performance of the model was measured on the previously separated test set. The scores with regard to detection of some of the pathogens is presented in FIG. 5 .
- the coefficients of the support vector machine classifier were used to determine bacteria that play significance in determining presence or absence of the pathogens and therefore to provide signatures that can be used independently of the model.
- This analysis determined a set of non-pathogenic microorganisms that had statistically significant correlation with the presence of pathogenic organisms, including members of the genus Enterobacter. Enterobacter asburiae, Enterobacter bugandensis, Enterobacter cancerogenus, Enterobacter cloacae, Enterobacter endosymbiont, Enterobacter hormaechei, Enterobacter kobei, Enterobacter ludwigii , and Enterobacter soli were among the top 9 examples of non-pathogenic bacteria associated with_our set of pathogenic bacteria.
- Yersinia pseudotuberculosis was associated with Enterobacter asburiae ; Vibrio vulnificus was associated with Enterobacter bugandensis, Enterobacter endosymbiont , and Enterobacter soli; Escherichia coli, Salmonella enterica , and Shigella boydii were associated with Enterobacter cancerogenus, Enterobacter cloacae , and Enterobacter hormaechei; Staphylococcus Aureus was associated with Enterobacter kobei ; and Yersinia pseudotuberculosis was associated with Enterobacter asburiae and Enterobacter ludwigii.
- a variety of other samples described herein can be analyzed as described. Briefly, a sample may be screened with any one of the methods described herein and a plurality of nucleic acid sequences may be obtained. Numerous sequences within said plurality of nucleic acid sequences may be correlated by a machine learning algorithm with a variety of microorganisms. A prediction can then be created and a visual output of such prediction, such as the illustrated a heat map can be created by detecting statistically significant correlations. For instance, a heat map created by a machine learning algorithm may illustrate a correlation between a presence of E.
- a machine learning algorithm including the machine learning algorithm's described herein, can be used to create such predictions.
- a statistical analysis can be performed to identify the top nonpathogenic species/food ingredients associated with the presence of Vibrio/Staphylococcus/Yersinia/Shigella/Salmonella/Escherichia (an illustrative cluster-based representation of such analysis is presented in FIG. 5 ).
- This analysis determined a set of non-pathogenic microorganisms that had statistically significant correlation with the presence of pathogenic organisms, including members of the genus Enterobacter.
- Enterobacter asburiae, Enterobacter bugandensis, Enterobacter cancerogenus, Enterobacter cloacae, Enterobacter endosymbiont, Enterobacter hormaechei, Enterobacter kobei, Enterobacter ludwigii , and Enterobacter soli were among the top 9 examples of non-pathogenic bacteria associated with_our set of pathogenic bacteria.
- Yersinia pseudotuberculosis was associated with Enterobacter asburiae ; Vibrio vulnificus was associated with Enterobacter bugandensis, Enterobacter endosymbiont , and Enterobacter soli; Escherichia coli, Salmonella enterica , and Shigella boydii were associated with Enterobacter cancerogenus, Enterobacter cloacae , and Enterobacter hormaechei; Staphylococcus Aureus was associated with Enterobacter kobei ; and Yersinia pseudotuberculosis was associated with Enterobacter asburiae and Enterobacter ludwigii.
- Food is a chemically complex matrix. Predicting whether, or how fast, microorganisms will grow in a food, or how quickly a food may spoil, is difficult. For instance, most foods contain sufficient nutrients to support microbial growth. Furthermore, there are many additional factors that encourage, prevent, or limit growth of microorganisms in foods including pH, temperature, and relative humidity. In some aspects, the instant disclosure recognizes that a presence of some microorganism, whether or not pathogenic, can be correlated with a sell-by date, i.e., a spoilage date of a food.
- the disclosure provides a method comprising: detecting a presence or an absence of a microorganism in a food sample or in an environmental sample from a food processing facility; and predicting, by a computer system, a risk presented by said food sample or by said food processing facility based on said presence or said absence of said microorganism.
- FIG. 6 illustrates a process for predicting a shelf-life of a food based on machine learning.
- FIG. 6 illustrates a screening of a sample, such as a screening of a plurality of nucleic acid sequencing reads.
- a machine learning algorithm is used to create a risk profile, whereby said risk profile associates a presence of some microorganism with a low or a high likelihood of food spoilage, thereby predicting the sell-by date of a food.
- a machine learning algorithm can be used to associate any number of sequencing reads with a presence of microorganism in a food sample, a food related sample, or another sample. Similarly, a machine learning algorithm may be able to associate any number of sequencing reads with a presence of a pathogenic microorganism, even if the sequence reads themselves are not from the pathogenic microorganism.
- Computer-implemented methods for generating a machine learning-based classifier in a system may require a number of input datasets in order for the classifier to produce highly accurate predictions.
- a machine learning algorithm is selected from the group consisting of: a support vector machine (SVM), a Naive Bayes classification, a random forest, Logistic regression and a neural network.
- FIG. 7 is a diagram illustrating the tunable resolution of various assays. Briefly, one or more assays can be used sequentially to obtain a desired level of sensitivity, such as to determine a genus, a species, a serotype, a sub-serotype, or a strain of said microorganism.
- the assays can be identical or they can be distinct.
- a sequencing assay can be used to identify a strain or a sub-serotype of a microorganism whereas a PCR reaction may be able to identify a species or, in some cases, a serotype of a particular microorganism.
- the disclosure provides a method comprising: obtaining a plurality of nucleic acid sequences of a food sample, of an environmental sample or of another non-food derived sample from a food processing facility or another facility; performing a first assay in said plurality of nucleic acid sequences of said food sample, whereby said assay predicts a presence or predicts an absence of a microorganism in said food sample; and determining, based on said predicted presence or said predicted absence of said microorganism of the first assay whether to perform a second assay, whereby a sensitivity of said second assay is selected to determine a genus, a species, a serotype, a sub-serotype, or a strain of said microorganism.
- PCR polymerase chain reaction
- sequencing can be performed by various systems currently available, such as, without limitation, a sequencing system by Illumina®, Pacific Biosciences (PacBio®), Oxford Nanopore®, Genia (Roche) or Life Technologies (Ion Torrent®).
- sequencing may be performed using nucleic acid amplification, polymerase chain reaction (PCR) (e.g., digital PCR, quantitative PCR, or real time PCR), or isothermal amplification.
- PCR polymerase chain reaction
- the assay is an enzyme-linked immunosorbent (ELISA) assay or an enzyme-linked fluorescent assay (ELFA) assay.
- the assay is a serotyping assay.
- a serotype or serovar is a distinct variation within a species of bacteria or virus. These microorganisms can be classified together based on their cell surface antigens, allowing the epidemiologic classification of microorganisms to the sub-species level. A group of serovars with common antigens is called a serogroup or sometimes serocomplex.
- the disclosure provides methods for performing a sequencing assay on a plurality of nucleic acids derived from a sample and a serotyping assay on a derivative of said sample.
- FIG. 8 is a schematic illustrating various serotypes of various microorganisms that can be detected by an analysis of a plurality of nucleic acid sequences as described herein and further validated with a serotyping assay.
- FIG. 9 is a schematic illustrating one process for distinguishing a live microorganism from a food or from an environmental sample. Briefly, FIG. 9 illustrates than an amount of a microorganism in a sample can be increased, i.e., enriched 901 , by growing the microorganism in a rich medium for a period of time.
- a reagent such as a photoreactive DNA-binding dye, a DNA intercalating reagent, or another suitable reagent may be added to enriched sample 901 .
- a photoreactive DNA-binding dye such as a DNA intercalating reagent, or another suitable reagent
- Such reagents distinguish live 902 microorganisms from dead 903 microorganisms by interacting with the nucleic acid sequence of dead microorganisms only.
- the disclosure contemplates using propidium monoazide or a derivative thereof as a dye.
- the modified sample can be prepared for a subsequent reaction 904 , such as a sequencing reaction 905 .
- the disclosure provides a method comprising adding a reagent to a plurality of nucleic acid molecules from a food sample, or food related sample or another sample described herein thereby forming a modified plurality of nucleic acid molecules, whereby said reagent (i) interacts with and modifies a structure of a plurality of nucleic acid molecules derived from one or more dead microorganisms; and (ii) does not interact with or modify a structure of a nucleic acid molecule derived from one or more live microorganisms; thereby providing a modified plurality of nucleic acid molecules; and sequencing said modified plurality of nucleic acid molecules, thereby distinguishing one or more live organisms from said food sample or from another sample.
- the disclosure provides a method comprising performing a pore sequencing or other DNA sequencing or hybridization assay on a plurality of nucleic acid molecules from a food sample or from another sample whereby said pore sequencing reaction distinguishes one or more nucleic acid molecules derived from a dead microorganism from one or more nucleic acid molecules derived from a live microorganism based on a methylation or other epigenetic pattern of said one or more nucleic acid molecules derived from said dead microorganism.
- epigenetic patterns such as methylation
- Such methods include, but are not limited to, bisulfite sequencing (including targeted bisfulfite sequencing, see e.g. Ziller et al. Epigenetics Chromatin. 2016 Dec. 3; 9:55 and Masser et al. J Vis Exp. 2015; (96): 52488) and methylation-sensitive restriction digestion (see e.g. Bitinaite et al. U.S. Pat. No. 9,034,597).
- Unique identifiers can be added to one or more nucleic acids isolated from a sample from a food processing facility, from a hospital or clinic, or from another sources. Barcodes can be used to associate a sample with a source; e.g., to associate an environmental sample with a specific food processing facility or with a particular location within said food processing facility. Barcodes can also be used to identify a processing of a sample, as described in U.S. Patent Pub. No. 2016/0239732, which is entirely incorporated herein by reference.
- the disclosure provides a method comprising adding a first barcode to a first plurality of nucleic acid sequences from a sample, thereby providing a first plurality of barcoded nucleic acid sequences; performing a first sequencing reaction on said first plurality of barcoded nucleic acid sequences, wherein said sequencing reaction is performed on a sequencing apparatus comprising a flow cell; adding a second barcode to a second plurality of nucleic acid sequences from a second sample, thereby providing a second plurality of barcoded nucleic acid sequences; and performing a second sequencing reaction on said second plurality of barcoded nucleic acid sequences, wherein said second sequencing reaction is performed on said sequencing apparatus comprising said flow cell, thereby reusing said flow cell.
- FIG. 10 illustrates a process for re-using flow cells with distinct indexes as described herein.
- two distinct indexes, 1001 and 1002 can be added to different samples prior to sequencing 1003 . Since a first sample can be associated with a first index 1001 and a second sample can be associated with a second index 1002 this process effectively allows for the re-using of a flow cell.
- FIG. 18 and FIG. 19 demonstrate the re-use of MinION/GridION flow cells.
- Example 21 demonstrates how certain primer design schemes, such as a nonperiodic design, can reduce crosstalk in situations with high multiplexing or closely related sequences, as may happen with reuse of flow cells.
- One or more barcodes or block of barcodes may be added to a nucleic acid sequence from a food sample or another sample from a food processing facility, such as a first, a second, a third, or any subsequent sample.
- 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, or 20 identical barcodes are added to such samples.
- distinct barcodes are added to such samples.
- 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, or 20 distinct barcodes are added to such samples.
- the serial addition of two or more barcodes, either identical in sequence or distinct in sequence can provide an indexing of a sample that is used in its analyses.
- a barcode is added to a nucleic acid sequence comprising complementary DNA (cDNA) sequences, ribonucleic acid (RNA) sequences, genomic deoxyribonucleic acid (gDNA) sequences, or a mixture of cDNA, RNA, and gDNA sequences.
- cDNA complementary DNA
- RNA ribonucleic acid
- gDNA genomic deoxyribonucleic acid
- Automated nucleic acid sequencing apparatuses can provide a robust platform for the generation of nucleic acid sequencing reads.
- many apparatuses have a high rate of failure, i.e., high rate of error of the sequencing reaction itself, which require manual intervention in such instances, such as re-loading of samples into flow cells.
- the disclosure provides an automated nucleic acid sequencing apparatus that requires no manual intervention in the event of a failure of a sequencing reaction.
- the disclosure provides a nucleic acid sequencing apparatus comprising: a nucleic acid library preparation compartment comprising two or more chambers configured to prepare a plurality of nucleic acids for a sequencing reaction, wherein said compartment is operatively connected to a nucleic acid sequencing chamber; a nucleic acid sequencing chamber, wherein said nucleic acid sequencing chamber comprises: (i) one or more flow cells comprising a plurality of pores configured for the passage of a nucleic acid strand, wherein said two or more flow cells are juxtaposed to one another; and an automated platform, wherein said automated platform is programmed to robotically move a sample from said nucleic acid library preparation compartment into said nucleic acid sequencing chamber.
- FIG. 11 illustrates an automated sequencing apparatus of the disclosure.
- Nucleic acid library preparation compartment 1103 shows a variety of chambers configured to prepare a plurality of nucleic acids for a sequencing reaction in close proximity to a sequencing chamber 1104 , which comprises one or more flow cells.
- an automated apparatus of the disclosure is programmed to move one or more samples from the library preparation chambers 1103 into a sequencing chamber 1104 upon detecting a failure in a sequencing reaction. This provides a sequencing process with no human touch points after a sample is added to the library preparation chamber, as illustrated in FIG. 12 .
- FIG. 12 FIG.
- a sample from a food processing facility, from a hospital or clinical setting, or from another source can be manually processed between 6 am to 6 pm or any shorter or longer incubation window by incubating the sample in a presence of a growth medium (e.g., enrichment) and automatically processed after the sample is added to a nucleic acid preparation chamber 1103 .
- a growth medium e.g., enrichment
- the disclosed apparatus is programmed in such a manner that said automated platform moves one or more samples from said nucleic acid library preparation compartment into said nucleic acid sequencing chamber. Upon detecting a failure of a sequencing reaction, the automated platform moves one or more samples from the failed sequencing flow cell or apparatus to the next sequencing flow cell or apparatus.
- samples comprise nucleic acid sequences that include one or more barcodes.
- a plurality of mutually exclusive barcodes are added to a plurality of nucleic acids in said two or more chambers of the nucleic acid library preparation compartment 1103 , thereby providing a plurality of mutually exclusive barcoded nucleic acids within the apparatus.
- the automated platform robotically moves two or more of said mutually exclusive barcoded nucleic acids into said nucleic acid sequencing chamber, in some instances by moving said mutually exclusive barcoded nucleic acids into a same flow cell of said one or more flow cells.
- Microbiome data (data representing the presence or absence of particular species or serotypes of microbes as determined by sequencing) of the invention can be used to classify a sample.
- a sample can be classified as, or predicted to be: a) containing a particular pathogenic microbe, b) containing a particular serotype of a pathogenic microbe, and/or c) contaminated with at least one species/serotype of pathogenic microbe.
- Many statistical classification techniques are known to those of skill in the art. In supervised learning approaches, a group of samples from two or more groups (e.g. contaminated with a pathogen and not) are analyzed with a statistical classification method.
- Microbe presence/absence data can be used as a classifier that differentiates between the two or more groups.
- a new sample can then be analyzed so that the classifier can associate the new sample with one of the two or more groups.
- supervised classifiers include without limitation the neural network (multi-layer perceptron), support vector machines, k-nearest neighbours, Gaussian mixture model, Gaussian, naive Bayes, decision tree and radial basis function (RBF) classifiers.
- Linear classification methods include Fisher's linear discriminant, logistic regression, naive Bayes classifier, perceptron, and support vector machines (SVMs).
- Other classifiers for use with the invention include quadratic classifiers, k-nearest neighbor, boosting, decision trees, random forests, neural networks, pattern recognition, Bayesian networks and Hidden Markov models.
- Classification using supervised methods is generally performed by the following methodology:
- Gather a training set can include, for example, samples that are from a food or environment contaminated or not contaminated with a particular microbe, samples that are contaminated with different serotypes of the same microbe, samples that are or are not contaminated with a combination of different species and serotypes of microbes, etc.
- the training samples are used to “train” the classifier.
- the accuracy of the learned function depends on how the input object is represented.
- the input object is transformed into a feature vector, which contains a number of features that are descriptive of the object.
- the number of features should not be too large, because of the curse of dimensionality; but should be large enough to accurately predict the output.
- the features might include a set of bacterial species or serotypes present in a food or environmental sample derived as described herein.
- a learning algorithm is chosen, e.g., artificial neural networks, decision trees, Bayes classifiers or support vector machines. The learning algorithm is used to build the classifier.
- the learning algorithm is run on the gathered training set. Parameters of the learning algorithm may be adjusted by optimizing performance on a subset (called a validation set) of the training set, or via cross-validation. After parameter adjustment and learning, the performance of the algorithm may be measured on a test set of naive samples that is separate from the training set.
- a validation set a subset of the training set
- the classifier e.g. classification model
- a sample e.g., that of food sample or environment that is being analyzed by the methods of the invention.
- Clustering is an unsupervised learning approach wherein a clustering algorithm correlates a series of samples without the use the labels. The most similar samples are sorted into “clusters.” A new sample could be sorted into a cluster and thereby classified with other members that it most closely associates.
- the disclosed provides quality control methods or methods to assess a risk associated with a food, with a hospital, with a clinic, or any other location where the presence of a bacterium poses a certain risk to one or more subjects.
- systems, platforms, software, networks, and methods described herein include a digital processing device, or use of the same.
- the digital processing device includes one or more hardware central processing units (CPUs), i.e., processors that carry out the device's functions, such as the automated sequencing apparatus disclosed herein or a computer system used in the analyses of a plurality of nucleic acid sequencing reads from samples derived from a food processing facility or from any other facility, such as a hospital a clinical or another.
- CPUs hardware central processing units
- the digital processing device further comprises an operating system configured to perform executable instructions.
- the digital processing device is optionally connected a computer network.
- the digital processing device is optionally connected to the Internet such that it accesses the World Wide Web.
- the digital processing device is optionally connected to a cloud computing infrastructure.
- the digital processing device is optionally connected to an intranet.
- the digital processing device is optionally connected to a data storage device.
- the digital processing device could be deployed on premise or remotely deployed in the cloud.
- suitable digital processing devices include, by way of non-limiting examples, server computers, desktop computers, laptop computers, notebook computers, sub-notebook computers, netbook computers, netpad computers, set-top computers, handheld computers, Internet appliances, mobile smartphones, tablet computers, personal digital assistants, video game consoles, and vehicles.
- smartphones are suitable for use in the system described herein.
- Suitable tablet computers include those with booklet, slate, and convertible configurations, known to those of skill in the art.
- the disclosure contemplates any suitable digital processing device that can either be deployed to a food processing facility, or is used within said food processing facility to process and analyze a variety of nucleic acids from a variety of samples.
- a digital processing device includes an operating system configured to perform executable instructions.
- the operating system is, for example, software, including programs and data, which manages the device's hardware and provides services for execution of applications.
- server operating systems include, by way of non-limiting examples, FreeBSD, OpenBSD, NetBSD®, Linux, Apple® Mac OS X Server®, Oracle® Solaris®, Windows Server®, and Novell® NetWare®.
- suitable personal computer operating systems include, by way of non-limiting examples, Microsoft® Windows®, Apple® Mac OS X®, UNIX®, and UNIX-like operating systems such as GNU/Linux®.
- the operating system is provided by cloud computing.
- suitable mobile smart phone operating systems include, by way of non-limiting examples, Nokia® Symbian® OS, Apple® iOS®, Research In Motion® BlackBerry OS®, Google® Android®, Microsoft® Windows Phone® OS, Microsoft® Windows Mobile® OS, Linux®, and Palm® WebOS®.
- a digital processing device includes a storage and/or memory device.
- the storage and/or memory device is one or more physical apparatuses used to store data or programs on a temporary or permanent basis.
- the device is volatile memory and requires power to maintain stored information.
- the device is non-volatile memory and retains stored information when the digital processing device is not powered.
- the non-volatile memory comprises flash memory.
- the non-volatile memory comprises dynamic random-access memory (DRAM).
- the non-volatile memory comprises ferroelectric random access memory (FRAM).
- the non-volatile memory comprises phase-change random access memory (PRAM).
- the device is a storage device including, by way of non-limiting examples, CD-ROMs, DVDs, flash memory devices, magnetic disk drives, magnetic tapes drives, optical disk drives, and cloud computing based storage.
- the storage and/or memory device is a combination of devices such as those disclosed herein.
- a digital processing device includes a display to send visual information to a user.
- the display is a cathode ray tube (CRT).
- the display is a liquid crystal display (LCD).
- the display is a thin film transistor liquid crystal display (TFT-LCD).
- the display is an organic light emitting diode (OLED) display.
- OLED organic light emitting diode
- on OLED display is a passive-matrix OLED (PMOLED) or active-matrix OLED (AMOLED) display.
- the display is a plasma display.
- the display is a video projector.
- the display is a combination of devices such as those disclosed herein.
- a digital processing device includes an input device to receive information from a user.
- the input device is a keyboard.
- the input device is a pointing device including, by way of non-limiting examples, a mouse, trackball, track pad, joystick, game controller, or stylus.
- the input device is a touch screen or a multi-touch screen.
- the input device is a microphone to capture voice or other sound input.
- the input device is a video camera to capture motion or visual input.
- the input device is a combination of devices such as those disclosed herein.
- a digital processing device includes a digital camera.
- a digital camera captures digital images.
- the digital camera is an autofocus camera.
- a digital camera is a charge-coupled device (CCD) camera.
- a digital camera is a CCD video camera.
- a digital camera is a complementary metal-oxide-semiconductor (CMOS) camera.
- CMOS complementary metal-oxide-semiconductor
- a digital camera captures still images.
- a digital camera captures video images.
- suitable digital cameras include 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, and higher megapixel cameras, including increments therein.
- a digital camera is a standard definition camera.
- a digital camera is an HD video camera.
- an HD video camera captures images with at least about 1280 ⁇ about 720 pixels or at least about 1920 ⁇ about 1080 pixels.
- a digital camera captures color digital images.
- a digital camera captures grayscale digital images.
- digital images are stored in any suitable digital image format.
- Suitable digital image formats include, by way of non-limiting examples, Joint Photographic Experts Group (JPEG), JPEG 2000, Exchangeable image file format (Exif), Tagged Image File Format (TIFF), RAW, Portable Network Graphics (PNG), Graphics Interchange Format (GIF), Windows® bitmap (BMP), portable pixmap (PPM), portable graymap (PGM), portable bitmap file format (PBM), and WebP.
- JPEG Joint Photographic Experts Group
- JPEG 2000 Exchangeable image file format
- Exif Tagged Image File Format
- TIFF Portable Network Graphics
- GIF Portable Network Graphics
- GIF Portable Network Graphics
- BMP Portable Network Graphics
- PPM Portable Network Graphics
- PPM Portable graymap
- PBM portable bitmap file format
- WebP WebP.
- digital images are stored in any suitable digital video format.
- Suitable digital video formats include, by way of non-limiting examples, AVI, MPEG, Apple® QuickTime®, MP4, AVCHD®, Windows Media®, Div
- the systems, platforms, software, networks, and methods disclosed herein include one or more non-transitory computer readable storage media encoded with a program including instructions executable by the operating system of an optionally networked digital processing device.
- the methods comprise creating data files associated with a plurality of sequencing reads from a plurality of samples associated with a food processing facility.
- a computer readable storage medium is a tangible component of a digital processing device.
- a computer readable storage medium is optionally removable from a digital processing device.
- a computer readable storage medium includes, by way of non-limiting examples, CD-ROMs, DVDs, flash memory devices, solid state memory, magnetic disk drives, magnetic tape drives, optical disk drives, cloud computing systems and services, and the like.
- the program and instructions are permanently, substantially permanently, semi-permanently, or non-transitorily encoded on the media.
- the systems, platforms, software, networks, and methods disclosed herein include at least one computer program.
- a computer program includes a sequence of instructions, executable in the digital processing device's CPU, written to perform a specified task. In light of the disclosure provided herein, those of skill in the art will recognize that a computer program may be written in various versions of various languages.
- a computer program comprises one sequence of instructions.
- a computer program comprises a plurality of sequences of instructions.
- a computer program is provided from one location. In other embodiments, a computer program is provided from a plurality of locations.
- a computer program includes one or more software modules.
- a computer program includes, in part or in whole, one or more web applications, one or more mobile applications, one or more standalone applications, one or more web browser plug-ins, extensions, add-ins, or add-ons, or combinations thereof.
- a computer program includes a web application.
- a web application in various embodiments, utilizes one or more software frameworks and one or more database systems.
- a web application is created upon a software framework such as Microsoft®.NET or Ruby on Rails (RoR).
- a web application utilizes one or more database systems including, by way of non-limiting examples, relational, non-relational, object oriented, associative, and XML database systems.
- suitable relational database systems include, by way of non-limiting examples, Microsoft® SQL Server, mySQLTM, and Oracle®.
- a web application in various embodiments, is written in one or more versions of one or more languages.
- a web application may be written in one or more markup languages, presentation definition languages, client-side scripting languages, server-side coding languages, database query languages, or combinations thereof.
- a web application is written to some extent in a markup language such as Hypertext Markup Language (HTML), Extensible Hypertext Markup Language (XHTML), or eXtensible Markup Language (XML).
- a web application is written to some extent in a presentation definition language such as Cascading Style Sheets (CSS).
- CSS Cascading Style Sheets
- a web application is written to some extent in a client-side scripting language such as Asynchronous Javascript and XML (AJAX), Flash® Actionscript, Javascript, or Silverlight®.
- AJAX Asynchronous Javascript and XML
- Flash® Actionscript Javascript
- Javascript or Silverlight®
- a web application is written to some extent in a server-side coding language such as Active Server Pages (ASP), ColdFusion®, Perl, JavaTM, JavaServer Pages (JSP), Hypertext Preprocessor (PHP), PythonTM, Ruby, Tcl, Smalltalk, WebDNA®, or Groovy.
- a web application is written to some extent in a database query language such as Structured Query Language (SQL).
- SQL Structured Query Language
- a web application integrates enterprise server products such as IBM® Lotus Domino®.
- a web application for providing a career development network for artists that allows artists to upload information and media files includes a media player element.
- a media player element utilizes one or more of many suitable multimedia technologies including, by way of non-limiting examples, Adobe® Flash®, HTML 5, Apple® QuickTime®, Microsoft® Silverlight®, JavaTM, and Unity®.
- a computer program includes a mobile application provided to a mobile digital processing device.
- the mobile application is provided to a mobile digital processing device at the time it is manufactured.
- the mobile application is provided to a mobile digital processing device via the computer network described herein.
- a mobile application is created by techniques known to those of skill in the art using hardware, languages, and development environments known to the art. Those of skill in the art will recognize that mobile applications are written in several languages. Suitable programming languages include, by way of non-limiting examples, C, C++, C#, Objective-C, JavaTM, Javascript, Pascal, Object Pascal, PythonTM, Ruby, VB.NET, WML, and XHTML/HTML with or without CSS, or combinations thereof.
- Suitable mobile application development environments are available from several sources.
- Commercially available development environments include, by way of non-limiting examples, AirplaySDK, alcheMo, Appcelerator®, Celsius, Bedrock, Flash Lite, .NET Compact Framework, Rhomobile, and WorkLight Mobile Platform.
- Other development environments are available without cost including, by way of non-limiting examples, Lazarus, MobiFlex, MoSync, and Phonegap.
- mobile device manufacturers distribute software developer kits including, by way of non-limiting examples, iPhone and iPad (iOS) SDK, AndroidTM SDK, BlackBerry® SDK, BREW SDK, Palm® OS SDK, Symbian SDK, webOS SDK, and Windows® Mobile SDK.
- a computer program includes a standalone application, which is a program that is run as an independent computer process, not an add-on to an existing process, e.g., not a plug-in.
- standalone applications are often compiled.
- a compiler is a computer program(s) that transforms source code written in a programming language into binary object code such as assembly language or machine code. Suitable compiled programming languages include, by way of non-limiting examples, C, C++, Objective-C, COBOL, Delphi, Eiffel, JavaTM, Lisp, PythonTM, Visual Basic, and VB .NET, or combinations thereof. Compilation is often performed, at least in part, to create an executable program.
- a computer program includes one or more executable complied applications.
- a software module comprises a file, a section of code, a programming object, a programming structure, or combinations thereof.
- a software module comprises a plurality of files, a plurality of sections of code, a plurality of programming objects, a plurality of programming structures, or combinations thereof.
- the one or more software modules comprise, by way of non-limiting examples, a web application, a mobile application, and a standalone application.
- software modules are in one computer program or application. In other embodiments, software modules are in more than one computer program or application. In some embodiments, software modules are hosted on one machine. In other embodiments, software modules are hosted on more than one machine. In further embodiments, software modules are hosted on cloud computing platforms. In some embodiments, software modules are hosted on one or more machines in one location. In other embodiments, software modules are hosted on one or more machines in more than one location.
- Food and environmental samples may be processed for various purposes, such as the enrichment of one or more microorganism from the sample, or the isolation of one or more microorganism from the sample.
- the following protocol was used in the preparation of various food and environmental samples including: carcass rinses, stainless steel, primary production boot covers, dry pet food and shell eggs.
- carcass food samples are generated by aseptically draining excess fluid from a carcass and transferring the carcass to a large sterile sampling bag.
- 100 mL of an enriched broth, in this case, Clear Salmonella media (CSM) was poured into the cavity of the carcass in the sampling bag.
- the carcass was rinsed inside and out with a rocking motion for about one minute, while assuring that all surfaces (interior and exterior of the carcass) were rinsed.
- About 20 ⁇ 0.5 mL of the CSM was added to the sample bag and homogenized by massaging sample bag for approximately 1.5-2 min. The sample was incubated at 42 ⁇ 1° C. for 9-24 h, providing an enriched sample.
- a stainless steel surface environmental sample was generated by moistening a sterile sampling sponge in 10 mL of Dey-Engley Broth prior to sampling, or using a sponge pre-moistened in the same.
- the sponge was used to touch, scrub, or otherwise contact the stainless steel surface and it was subsequently placed into a sampling bag.
- About 10 ⁇ 0.5 mL of CSM was added to the sampling sponge.
- the sponge was pressed to expel the collection broth into the CSM solution.
- the sample was incubated at 42 ⁇ 1° C. for 9-24 h, providing an enriched sample.
- an environmental sample from a boot cover was first pre-moistened in skim milk. About 50 ⁇ 1 mL of CSM was then added to the sampling bag containing boot cover environmental sample. The contents were mixed thoroughly for approximately 1.5-2 min, and incubated at 42 ⁇ 1° C. for 9-24 h, thereby providing an enriched sample. The enriched sample was removed from incubator and briefly mixed.
- a homogenized egg sample was added to a filtered sampling bag.
- About 200 ⁇ 2 mL CSM was then added to the sampling bag containing said homogenized egg sample.
- the contents were mixed thoroughly for approximately 1.5-2 min, and incubated at 42 ⁇ 1° C. for 9-24 h, thereby providing an enriched sample.
- the enriched sample was removed from incubator and briefly mixed.
- a photoreactive DNA-binding dye namely propidium monoazide (PMA) was added to various food and environmental samples, including the samples described in Examples 1-6.
- PMA propidium monoazide
- 5 ⁇ L of a PMAxx solution was added to a well in a 200 ⁇ L 96-well PCR plate.
- Approximately 45 ⁇ L of each enriched sample from the sampling bags described in Examples 1-6 was added to individual wells in PCR plate containing PMAxx.
- the samples were mixed thoroughly by gentle pipetting and placed in the dark for 10 min at room temperature. Subsequently, the plates were incubated under a blue LED light for 20 min.
- Step Temperature Time 1 37° C. 20 min 2 95° C. 10 min
- the plate was then incubated in a thermocycler as shown below. Analysis of the sample readouts showed that the addition of PMAxx solution (25 ⁇ L) to the sample solution was sufficient to reduce the number of free-floating DNA by at least 2 orders of magnitude, as shown in FIG. 13 .
- the samples described in Examples 1-8 were subjected to an amplification reaction. Briefly 15 ⁇ L of primer cocktail and polymerase master mix was added to individual wells of an empty 200 ⁇ L 96-well PCR plate. About 5 ⁇ l of each sample treated with a photoreactive DNA-binding dye treatment was added to the respective wells containing the polymerase master mix. The solution was mixed gently by pipetting up and down and placed in a thermocycler with the conditions described below.
- Step Temperature Time 1 95° C. 3 min 2 95° C. 30 sec 3 57° C. 1 min 4 72° C. 1 min 5 Go to step 2, 37 times 6 72° C. 10 min 7 10° C. Hold
- Solid Phase Reversible Immobilization (SPRI) Magnetic Beads were used to purify and quantify one or more of the samples described in Examples 1-9. Briefly, the SPRI beads were removed from 4° C. storage and allowed to reach room temperature for approximately 15 min. About 1 mL of 80% ethanol was prepared by combining 800 ⁇ L of ethanol and 200 ⁇ L of molecular biology grade water. Equal volumes of each samples amplification product (described in Example 9) was used to obtain at least 100 ⁇ L of pooled products, which was purified using the SPRI beads along with standard manufacturing protocols.
- SPRI Solid Phase Reversible Immobilization
- the tube was aspirated fully and the ethanol solution discarded. The process was repeated twice.
- the sample was allowed to dry for 3-5 min at room temperature, or until no visible ethanol remained. Once thoroughly dry, the tube was removed from the magnetic stand and re-suspended in 50 ⁇ L of 10 mM RSB into the tube.
- the tube was mixed thoroughly by gently pipetting up and down approximately 10 times and incubate at room temperature for 2 min.
- the tube was moved to a magnetic stand and incubated at room temperature for 2 min to allow the beads to pellet. Remove and retain 50 ⁇ L of the eluate.
- Example 10 the terminal ends of fragment nucleic acids described in Example 10 were repaired as described below.
- the following reagents were combined and mixed well by pipetting up and down approximately 10 times.
- Step Temperature Time 1 20° C. 5 min 2 65° C. 5 min 3 25° C. 5 min
- the samples were spun for approximately 5 s using a benchtop minifuge.
- 60 ⁇ L of SPRI beads were added to the end-repaired product and mixed by pipetting up and down approximately 10 times.
- the samples were incubated for 5 min at room temperature.
- the sample/bead mixture was placed in a magnetic stand and the beads were allowed to pellet in a ring around the middle portion of the tube for approximately 30-60 s, leaving a clear supernatant.
- the supernatant was discarded by leaving the tube in the magnetic stand while placing the pipette tip to the bottom center of the tube when aspirating to avoid disturbing the beads.
- 190 ⁇ L of 80% ethanol was added to the samples.
- the 80% ethanol solution was incubated in the tube for 5-10 s, and the ethanol was aspirated and discarded. This process was repeated twice.
- the sample was allowed to dry for 5 min at room temperature, or until no visible ethanol remained.
- the beads were resuspended with 31 ⁇ L molecular biology grade water and mixed by gently pipetting up and down approximately 10 times and incubate for 2 min at room temperature.
- the tube was moved to a magnetic stand and the beads were allowed to pellet for approximately 30-60 s.
- the eluate was retained as the “end-repaired product”.
- Reagent Volume End-repaired product 30 ⁇ L ONT Adapter Mix (AMX 1D) 20 ⁇ L NEB Blunt/TA Ligase Master Mix 50 ⁇ L Total 100 ⁇ L
- the reagents were gently mixed by pipetting up and down approximately 10 times and were incubated at room temperature for 10 min. About 40 ⁇ L of SPRI beads were added to the mixture, gently mixed, and incubated at room temperature for 5 min.
- the sample/bead mixture was placed in a magnetic stand and the beads were allowed to pellet in a ring around the middle portion of the tube for approximately 30-60 s, leaving a clear supernatant. The supernatant was discarded by leaving the tube in the magnetic stand while placing the pipette tip to the bottom center of the tube when aspirating to avoid disturbing the beads.
- the tube was removed from the magnetic rack and 140 ⁇ L of ONT-Adapter Bead Binding buffer was pipetted onto the beads.
- the sample was mixed by gently pipetting up and down approximately 10 times to resuspend the pellet.
- the tube was returned to the magnetic stand and the beads were allowed to pellet in a ring around the middle portion of the tube for approximately 30-60 s, leaving a clear supernatant.
- the supernatant was discarded by leaving the tube in the magnetic stand while placing the pipette tip to the bottom center of the tube when aspirating to avoid disturbing the beads.
- the tube was removed from the magnetic rack and an additional 140 ⁇ L of Adapter Bead Binding buffer was added and pipetted up and down to resuspend the pellet.
- the sample/bead mixture was placed in a magnetic stand and the beads were allowed to pellet into a ring around the middle portion of the tube for approximately 30-60 s, leaving a clear supernatant. The supernatant was discarded by leaving the tube in the magnetic stand while placing the pipette tip to the bottom center of the tube when aspirating to avoid disturbing the beads. The tube was then removed from the magnetic stand. About 15 ⁇ L of Elution Buffer (ELB) was added to the beads, and the beads were mixed thoroughly by pipetting up and down approximately 10 times and incubate for 10 minutes at room temperature for 5 min. The tubes were moved to a magnetic stand and the beads allowed to pellet for approximately 30-60 s. About 15 ⁇ L of eluate was remove and retained as the “final ligated product” for sequencing.
- ELB Elution Buffer
- a food or an environmental sample was processed by pore sequencing using standard manufacturer protocols. Briefly, one or more flow cells were primed by combining the following reagents per flow cell:
- Reagent Volume ONT-Running Buffer with Fuel Mix (RBF) 480 ⁇ L Molecular grade H 2 O 520 ⁇ L Total 1,000 ⁇ L
- a loading library was prepared by combining the following reagents:
- the priming port on the Flow Cell was gently opened and approximately 50 ⁇ L of the preservative buffer and any small bubbles were removed, as illustrated by FIG. 14 .
- About 800 ⁇ L of the priming mix was added into the priming port of the Flow Cell.
- 200 ⁇ L of the priming mix was dispensed into the Priming port.
- the final loading library was mixed thoroughly and 75 ⁇ L were added into the SpotON port, as illustrated by FIG. 15 .
- the lid of the pore sequencing device was closed and the sequencing was executed.
- an electronic communication comprising a data set associated with the sequencing reaction described in Example 13 was transmitted over the cloud for analysis.
- the results of the analysis were reported back to customer.
- FIG. 16 in this particular example the customer requested an analysis of the sample for the presence or absence of Listeria, Salmonella, Campylobacter , and E. coli , which required the simultaneous targeting of multiple pathogens.
- Table 2 Exemplary Pathogenic Microorganisms Identified by Methods According to This Disclosure Onset Common Name Time After Signs & Duration Organism of Illness Ingesting Symptoms of Ilness Food Sources Bacillus B. cereus food 10-16 hrs Abdominal 24-48 hours Meats, stews, cereus poisoning cramps, watery gravies, vanilla diarrhea, nausea sauce Campylobacter Campylobacteriosis 2-5 days Diarrhea, cramps, 2-10 days Raw and jejuni fever, and undercooked vomiting; diarrhea poultry, may be bloody unpasteurized milk, contaminated water Clostridium Botulism 12-72 hours Vomiting, Variable Improperly botulinum diarrhea, blurred canned foods, vision, double especially vision, difficulty home-canned in swallowing, vegetables, muscle weakness.
- Unpasteurized Usually, little or milk and juice, no fever is raw fruits and present. More vegetables (e.g. common in sprouts), and children 4 years contaminated or younger. Can water lead to kidney failure.
- the elderly or immuno- compromised patients may develop bacteremia or meningitis.
- Noroviruses Variously called 12-48 hrs Nausea, vomiting, 12-60 hrs Raw produce, viral abdominal contaminated gastroenteritis, cramping, drinking water, winter diarrhea, diarrhea, fever, uncooked acute non- bacterial headache. foods and gastroenteritis, Diarrhea is more cooked foods food poisoning, prevalent in that are not and food infection adults, vomiting reheated after more common in contact with an children.
- infected food handler shellfish from contaminated waters Salmonella Salmonellosis 6-48 hours Diarrhea, fever, 4-7 days Eggs, poultry, abdominal meat, cramps, vomiting unpasteurized milk or juice, cheese, contaminated raw fruits and vegetables Shigella Shigellosis or 4-7 days Abdominal 24-48 hrs Raw produce, Bacillary dysentery cramps, fever, and contaminated diarrhea. Stools drinking water, may contain blood uncooked and mucus. foods and cooked foods that are not reheated after contact with an infected food handler Staphylococcus Staphylococcal 1-6 hours Sudden onset of 24-48 hours Unrefrigerated aureus food poisoning severe nausea and or improperly vomiting. refrigerated Abdominal meats, potato cramps.
- Diarrhea and egg salads, and fever may be cream pastries present.
- a database was constructed using data from approximately 35,000 food or environmental samples (of which about 10% contained traces of pathogenic microorganisms as shown in Table 3) using two components: microorganism presence and chemical composition.
- Pore sequencing in combination with use of characteristic polymorphic gene regions comprising SNP's, RFLP's, STRs, VNTR's, hypervariable regions, minisatellites, dinucleotide repeats, trinucleotide repeats, tetranucleotide repeats, simple sequence repeats, indels, and insertion elements
- characteristic polymorphic gene regions comprising SNP's, RFLP's, STRs, VNTR's, hypervariable regions, minisatellites, dinucleotide repeats, trinucleotide repeats, tetranucleotide repeats, simple sequence repeats, indels, and insertion elements
- data on sample composition was collected for 4,600 food ingredients in each environmental/food sample.
- the data using the top bacteria associated with pathogen contamination (exemplified in FIG. 5 ) was used to train a classification model, which was tested for overfitting by machine learning techniques.
- This example describes the in silico evaluation of primer sensitivity and specificity for pathogen detection in PCR assays.
- a candidate primer pair was mapped against inclusion and exclusion sequences in sequence databases.
- the identified hits are tabulated based on predicted amplification patterns in order to then determine the sensitivity and specificity of the primer pair in silico.
- a primer pair was designed to target Salmonella Montevideo and Salmonella Oranienburg.
- the composition of the sequence database for in silico evaluation contained 7705 Salmonella genomes, including 98 Montevideo/Oranienburg genomes, and 1707 non- Salmonella genomes (total of 9412 genomes). Tabulation of the analysis results showed that the exact number of 98 Salmonella Montevideo and Oranienburg genomes was identified as true positive hits. The remaining 9314 (which equals the total number of 9412 genomes minus the 98 true positive hits identified) genomes were characterized as true negative results. The results are shown in FIG. 17 .
- FIG. 18 illustrates that the number of reads per sample for reused MinION/GridION flow cells was well above the acceptable minimum threshold of 10,000 (10 K) reads per sample.
- a significant source of confounding data in pathogen risk detection is contamination of samples by resident microorganisms on human handlers. Accordingly, we deployed a biomek-based sample sequencing platform that requires no human handling after enrichment (see FIG. 11 and FIG. 12 ) to implement the methods of Examples 10-13 and 15. Automation included every step of library preparation post incubation of the samples as in Examples 1-6, and included cell lysis, PCR, clean up, and sequencing. An automated handling system is illustrated in FIG. 11 .
- a significant limitation of existing environmental pathogen detection methods is that they involve culturing, which involves the use of multiple different specialized media to detect different classes of pathogens (e.g. bacteria autotrophic for one or more nutrient vs those not). This severely limits the ability to detect food contamination during storage. Accordingly, we applied our environmental sampling/pore sequencing technique as outlined in Examples 1-13 on 100 samples of chicken wings and 100 samples of ground chicken. Each sample was analyzed for the presence/absence of 17,800 pathogenic and non-pathogenic bacteria.
- the principle components analysis suggested a classification model could be built to detect whether or not a whole or ground chicken sample had expired.
- the data on the presence/absence of 17,800 pathogenic and non-pathogenic bacteria was used to generate a classification model.
- this classifier When tested on an independent data set of samples, this classifier showed 97% accuracy in detecting samples past their expiration date using an ROC analysis.
- a defined Levenshtein distance between each “building block” or molecular index can be used to form larger barcodes.
- Such larger barcodes can have a period block design, such as barcodes created by repeating each block multiple times with the largest possible Levenshtein distance between the individual blocks (see FIG. 23 ).
- barcodes can also have a nonperiod block design, such as barcodes created by concatenative multiple blocks that are unique to each barcode with the largest possible Levenshtein distance between the individual blocks (see FIG. 23 ).
- Both barcode designs present distinct advantages. Both increase the number of retained sequences and allow for adjustable precision by choosing 1, 2, or 3 blocks in demultiplexing, but the periodic design requires fewer repeat blocks and presents less complexity in demultiplexing, whereas the nonperiodic design allows for improved crosstalk prevention.
- the improved crosstalk prevention of the nonperiodic design suggests a method of reducing crosstalk during highly multiplexed runs or when a flowcell is reused.
- Listeria-containing food and environmental samples were prepared, libraries were constructed, and sequencing was performed as in Examples 1-13 and 15. Samples were analyzed for the presence of Listeria by analyzing highly polymorphic genetic markers. A principle component analysis of the Listeria sequences isolated from sequencing (see FIG. 24 ) identified clusters of closely related bacteria which likely originated from the same source.
- the length of time for a full sequencing run represents a major limitation in the speed of detection or serotyping of pathogenic bacterial strains by high-throughput sequencing.
- We hypothesized that using “live” detection calls during sequencing runs (which can be performed as early as 1 hour for ONT MinION and GridION, and 5 hours for Illumina MiSeq) would allow for certain bacteria to be detected/serotyped on a preliminary basis based on sequencing, with follow-up confirmation by other non-sequencing-based tests (e.g. Q-PCR).
Landscapes
- Life Sciences & Earth Sciences (AREA)
- Health & Medical Sciences (AREA)
- Engineering & Computer Science (AREA)
- Chemical & Material Sciences (AREA)
- Physics & Mathematics (AREA)
- General Health & Medical Sciences (AREA)
- Analytical Chemistry (AREA)
- Biotechnology (AREA)
- Immunology (AREA)
- Organic Chemistry (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- Biophysics (AREA)
- Biochemistry (AREA)
- Medical Informatics (AREA)
- Food Science & Technology (AREA)
- Wood Science & Technology (AREA)
- Zoology (AREA)
- Molecular Biology (AREA)
- Genetics & Genomics (AREA)
- Bioinformatics & Computational Biology (AREA)
- Theoretical Computer Science (AREA)
- Evolutionary Biology (AREA)
- Microbiology (AREA)
- Spectroscopy & Molecular Physics (AREA)
- Pathology (AREA)
- General Physics & Mathematics (AREA)
- Medicinal Chemistry (AREA)
- Data Mining & Analysis (AREA)
- General Engineering & Computer Science (AREA)
- Biomedical Technology (AREA)
- Bioethics (AREA)
- Databases & Information Systems (AREA)
- Urology & Nephrology (AREA)
- Hematology (AREA)
- Software Systems (AREA)
- Public Health (AREA)
- Epidemiology (AREA)
- Computer Vision & Pattern Recognition (AREA)
- Artificial Intelligence (AREA)
Abstract
Description
- This application claims priority to Provisional Patent Application Ser. No. 62/611,846, filed on Dec. 29, 2017, which is incorporated herein by reference in its entirety.
- Food producers recall their products from the marketplace when the products are mislabeled or when the food may present a health hazard to consumers because the food is contaminated or has caused a foodborne illness outbreak. Although these producers rely on several existing monitoring programs for pathogens, natural toxins, pesticides, and other contaminants about 48 million cases of foodborne illness are still identified annually in the United States alone—the equivalent of sickening 1 in 6 Americans each year. And each year these illnesses result in an estimated 128,000 hospitalizations and 3,000 deaths. The threats are numerous and varied, with symptoms ranging from relatively mild discomfort to very serious, life-threatening illness. While the very young, the elderly, and persons with weakened immune systems are at greatest risk of serious consequences from most foodborne illnesses, some of the microorganisms detected in foods pose grave threats to all persons.
- In some aspects the disclosure provides a method comprising: (a) detecting a presence or an absence of a non-pathogenic microorganism in a sample; (b) predicting, by a computer system, a presence or an absence of a pathogenic microorganism in said sample based on said presence or said absence of said non-pathogenic microorganism. In some aspects, said predicting is performed by a machine learning algorithm in a computer, such as a support vector machine (SVM), a Naive Bayes classification, a random forest, Logistic Regression, and a neural network. In some aspects, said sample is a food sample or an environmental sample associated with said food sample. In some instances, said food sample is a perishable, such as a meat, a poultry, a red meat, a fish, a swine, a fruit, an egg, a vegetable, a produce, or a legume. In some aspects, said environmental sample is a surface swab or a surface rinse of said environment. In some aspects, said environmental sample is a food storage container, a food handling equipment, or a piece of clothing from a worker of said environment associated with said food processing facility. In some aspects, said sample is a non-food sample. In some aspects, said sample comprises blood, plasma, urine, tissue, faces, bone marrow, saliva or cerebrospinal fluid. In some instances, said non-pathogenic microorganism is selected from the group consisting of: Enterobacter asburiae, Enterobacter bugandensis, Enterobacter cancerogenus, Enterobacter cloacae, Enterobacter endosymbiont, Enterobacter hormaechei, Enterobacter kobei, Enterobacter ludwigii, Enterobacter mori, and Enterobacter soli. In some instances, said pathogenic microorganism is selected from the group consisting of: a microorganism of the Salmonella genus, a microorganism of the Campylobacter genus, a microorganism of the Listeria genus, and a microorganism of the Escherichia genus. In some instances, said pathogenic microorganism is selected from the group consisting of Vibrio parahaemolyticus, Vibrio cholera, Vibrio vulnificus, Escherichia coli, Salmonella enterica, Shigella boydii, Campylobacter jejuni, Staphylococcus aureus, Listeria monocytogenes, Clostridium botulinum, Yersinia pseudotuberculosis, Clostridium perfringens, Yersinia enterocolitica, Coxiella burnetii, Yersinia pseudotuberculosis, Vibrio parahaemolyticus, Bacillus cereus, Mycobacterium tuberculosis, Shigella flexneri, Shigella boydii, Shigella dysenteriae, and Shigella sonnei. In some instances, said detecting comprises a nucleic acid characterization assay selected from the group consisting of a pore sequencing reaction, a next generation sequencing reaction, a shotgun next generation sequencing, Sanger sequencing, or hybridization assay. In some instances, the method further comprises performing an assay to confirm the prediction of (b), such as a polymerase chain reaction (PCR) assay, an enzyme-linked immunosorbent (ELISA) assay, or an enzyme-linked fluorescent assay (ELFA) assay.
- In some aspects, the disclosure provides a method comprising: (a) sequencing a plurality of nucleic acid sequences from a food sample or from an environmental sample associated with said food sample for a period of time; and (b) performing an assay on said food sample or said environment associated with said food sample if said sequencing for said period of time identifies a threshold level of nucleic acid sequences from a microorganism in said food sample. In some instances, said period of time is less than 30 minutes. In some instances, said threshold is no more than 0.1% of said nucleic acid sequences from said microorganism. In some aspects, the method further comprises performing an amplification reaction on said plurality of nucleic acid sequences prior to sequencing. In some aspects, said sequencing is a pore sequencing reaction. In some aspects, said assay is a serotyping assay, a culturing assay, a Pulse Field Gel Electrophoresis (PFGE) assay, a RiboPrinter® assay, a q-PCR assay, a Sanger sequencing assay, an ELISA assay, a Whole Genome Sequencing (WGS) assay, a targeted sequencing assay, or a shotgun metagenomics assay. In some aspects, said microorganism is selected from the group consisting of: a microorganism of the Salmonella genus, a microorganism of the Campylobacter genus, a microorganism of the Listeria genus, and a microorganism of the Escherichia genus. In some aspects, said microorganism of the Salmonella genus has a serotype selected from the group consisting of: Enteritidis, Typhimurium, Newport, Javiana, Infantis, Montevideo, Heidelberg, Muenchen, Saintpaul, Oranienburg, Braenderup, Paratyphi B var. L(+) Tartrate+, Agona, Thompson, and Kentucky. In some aspects, said microorganism of the Escherichia genus has a serotype selected from the group consisting of: O103, O111, O121, O145, O26, O45, and O157. In some aspects, said microorganism of the Listeria genus has a serotype selected from the group consisting of: 2a, 1/2b, 1/2c, 3a, 3b, 3c, 4a, 4b, 4ab, 4c, 4d, and 4e. In some aspects, said microorganism of the Campylobacter genus is C. jejunis, C. lari, or C. coli. In some aspects, said food sample is a meat, a poultry, a red meat, a fish, or a swine, a fruit, an egg, a vegetable, a produce or a legume. In some aspects said environmental sample is a surface swab of said environment, a surface rinse of said environment, a food storage container, a food handling equipment, or a piece of clothing from a worker of said environment associated with said food sample.
- All publications, patents, and patent applications mentioned in this specification are herein incorporated by reference to the same extent as if each individual publication, patent, or patent application was specifically and individually indicated to be incorporated by reference.
- The novel features of the invention are set forth with particularity in the appended claims. A better understanding of the features and advantages of the present invention will be obtained by reference to the following detailed description that sets forth illustrative embodiments, in which the principles of the invention are utilized, and the accompanying drawings of which:
-
FIG. 1 (FIG. 1 ): illustrates the deploying of asequencing assay 101 to one or more food processing facilities, food testing lab, or any otherdiagnostic lab 102 for performing a sequencing reaction of a food sample or of an environmental sample from said food processing facilities such as, for example, soil, water, air, animal product(s), feed, manure, crop production, or any sample associated with a manufacturing plant. -
FIG. 2 (FIG. 2 ): illustrates a transmission of an electronic communication comprising a data set associated with a sequencing reaction from one or more food processing facilities to a server. -
FIG. 3 (FIG. 3 ): is a chart illustrating that a redundancy in genetic markers decreases a false negative rate of a method of the disclosure. -
FIG. 4 (FIG. 4 ): illustrates a process for predictive risk assessment based on a detection of a non-pathogenic microorganism. -
FIG. 5 (FIG. 5 ): is a heat map illustrating predictive pathogen detection through machine learning. -
FIG. 6 (FIG. 6 ): illustrates a process for predicting a shelf-life of a food based on the detection of a microorganism. -
FIG. 7 (FIG. 7 ): is a diagram illustrating the tunable resolution of various assays. -
FIG. 8 (FIG. 8 ): is a schematic illustrating various serotypes of various microorganisms that can be detected by an analysis of a plurality of nucleic acid sequences as described herein and further validated with a serotyping assay. -
FIG. 9 (FIG. 9 ): is a schematic illustrating one process for distinguishing a live microorganism from a food or from an environmental sample. -
FIG. 10 (FIG. 10 ): illustrates a process for re-using flow cells with distinct indexes. -
FIG. 11 (FIG. 11 ): illustrates an automated sequencing apparatus of the disclosure. -
FIG. 12 (FIG. 12 ): illustrates a sequencing process with no human touch points after enrichment. -
FIG. 13 (FIG. 13 ): illustrates the PMAxx-induced removal of free-floating DNA. -
FIG. 14 (FIG. 14 ): illustrates a priming port in a flow cell. -
FIG. 15 (FIG. 15 ): illustrates a dispensing of a loading library on a flow cell. -
FIG. 16 (FIG. 16 ): illustrates the simultaneous targeting of multiple pathogens. -
FIG. 17 (FIG. 17 ): illustrates the in silico prediction of primer sensitivity/specificity. -
FIG. 18 (FIG. 18 ): illustrates the reuse of MinION/GridION flow cells. -
FIG. 19 (FIG. 19 ): illustrates the number of reads per sample during reuse of MinION/GridION flow cells. -
FIG. 20 (FIG. 20 ): illustrates the performance of the disclosed automated handling system on samples spiked with 10 different Salmonella serotypes (Enteritidis, Thyphimurium, 14_[5]_12:i:, Newport, Javiana, Infantis, Montevideo, Heidelberg, Muenchen). -
FIG. 21 (FIG. 21 ): illustrates a principal component analysis to chicken wing chicken data sets. -
FIG. 22 (FIG. 22 ): illustrates a principal component analysis to ground chicken data sets. -
FIG. 23 (FIG. 23 ): illustrates periodic and nonperiodic barcode designs. -
FIG. 24 (FIG. 24 ): illustrates a principle component analysis of Listeria sequences identifying clusters of closely related bacteria which likely originated from the same source. - Food safety is a complex issue that has an impact on multiple segments of society. Usually a food is considered to be adulterated if it contains: (1) a poisonous or otherwise harmful substance that is not an inherent natural constituent of the food itself, in an amount that poses a reasonable possibility of injury to health, or (2) a substance that is an inherent natural constituent of the food itself; is not the result of environmental, agricultural, industrial, or other contamination; and is present in an amount that ordinarily renders the food injurious to health. The first includes, for example, a pathogenic bacterium, fungus, parasite or virus, if the amount present in the food may be injurious to health. An example of the second is the tetrodotoxin that occurs naturally in some organs of some types of pufferfish and that ordinarily will make the fish injurious to health. In either case, foods adulterated with these agents are generally deemed unfit for consumption.
- Many different disease-causing microorganisms can contaminate foods, and there are many different foodborne infections. Although our scientific understanding of pathogenic microorganisms and their toxins is continually advancing, some of the most common microorganisms associated with foodborne illnesses include microorganisms of the Salmonella, Campylobacter, Listeria, and Escherichia genus.
- Salmonella for example is widely dispersed in nature. It can colonize the intestinal tracts of vertebrates, including livestock, wildlife, domestic pets, and humans, and may also live in environments such as pond-water sediment. It is spread through the fecal-oral route and through contact with contaminated water. (Certain protozoa may act as a reservoir for the organism). It may, for example, contaminate poultry, red meats, farm-irrigation water (thereby contaminating produce in the field), soil and insects, factory equipment, hands, and kitchen surfaces and utensils.
- Campylobacter jejuni is estimated to be the third leading bacterial cause of foodborne illness in the U.S. The symptoms this bacterium causes generally last from 2 to 10 days and, while the diarrhea (sometimes bloody), vomiting, and cramping are unpleasant, they usually go away by themselves in people who are otherwise healthy. Raw poultry, unpasteurized (“raw”) milk and cheeses made from it, and contaminated water (for example, unchlorinated water, such as in streams and ponds) are major sources, but C. jejuni also occurs in other kinds of meats and has been found in seafood and vegetables.
- Although the number of people infected by foodborne Listeria is comparatively small, this bacterium is one of the leading causes of death from foodborne illness. It can cause two forms of disease. One can range from mild to intense symptoms of nausea, vomiting, aches, fever, and, sometimes, diarrhea, and usually goes away by itself. The other, more deadly, form occurs when the infection spreads through the bloodstream to the nervous system (including the brain), resulting in meningitis and other potentially fatal problems.
- Escherichia microorganisms are also diverse in nature. For instance, at least four groups of pathogenic Escherichia coli have been identified: a) Enterotoxigenic Escherichia coli (ETEC), b) Enteropathogenic Escherichia coli (EPEC), c) Enterohemorrhagic Escherichia coli (EHEC), and Enteroinvasive Escherichia coli (EIEC). While ETEC is generally associated with traveler's diarrhea some members of the EHEC group, such as E. coli 0157:H7, can cause bloody diarrhea, blood-clotting problems, kidney failure, and death. Thus, it is important to be able not only to identify individual microorganism, but also to distinguish them.
- Provided herein are methods and apparatus for the identification of pathogenic and non-pathogenic microorganisms in food and environmental samples. The disclosure solves existing challenges encountered in identifying food borne pathogens, including pathogens of the Salmonella, Campylobacter, Listeria, and Escherichia genus in a timely and efficient manner. The disclosure also provides methods for differentiating a transient versus a resident pathogen, correlating presence of non-pathogenic with pathogenic microorganisms, and distinguishing live versus dead microorganisms by sequencing, amongst others.
- As used herein, the term “food processing facility” includes facilities that manufacture, process, pack, or hold food in any location globally. A food processing facility can, for example, determine the location and source of an outbreak of food-borne illness or a potential bioterrorism incident.
- As used herein, the term “food” includes any nutritious substance that people or animals eat or drink, or that plants absorb, in order to maintain life and growth. Non-limiting examples of foods include red meat, poultry, fruits, vegetables, fish, pork, seafood, dairy products, eggs, egg shells, raw agricultural commodities for use as food or components of food, canned foods, frozen foods, bakery goods, snack food, candy (including chewing gum), dietary supplements and dietary ingredients, infant formula, beverages (including alcoholic beverages and bottled water), animal feeds and pet food, and live food animals. The term “environmental sample,” as used herein, includes all food contact substances or items from a food processing facility. The term environmental sample includes a surface swab of a food contact substance, a surface rinse of a food contact substance, a food storage container, a food handling equipment, a piece of clothing from a subject in contact with a food processing facility, or another suitable sample from a food processing facility. The term “sample” as used herein, generally refers to any sample that can be informative of an environment or a food, such as a sample that comprises soil, water, water quality, air, animal production, feed, manure, crop production, manufacturing plants, environmental samples or food samples directly. The term “sample” may also refer to other non-food sample, such as samples derived from a subject, such as comprise blood, plasma, urine, tissue, faces, bone marrow, saliva or cerebrospinal fluid. Such samples may be derived from a hospital or a clinic.
- As used herein, the term “subject,” can refer to a human or to another animal. An animal can be a mouse, a rat, a guinea pig, a dog, a cat, a horse, a rabbit, and various other animals. A subject can be of any age, for example, a subject can be an infant, a toddler, a child, a pre-adolescent, an adolescent, an adult, or an elderly individual.
- As used herein, the term “disease,” generally refers to conditions associated with the presence of a microorganism in a food, e.g., outbreaks or incidents of foodborne disease.
- The term “nucleic acid” or “polynucleotide,” as used herein, refers to a polymeric form of nucleotides of any length, either ribonucleotides or deoxyribonucleotides. Polynucleotides include sequences of deoxyribonucleic acid (DNA), ribonucleic acid (RNA), or DNA copies of ribonucleic acid (cDNA).
- The term “polyribonucleotide,” as used herein, generally refers to polynucleotide polymers that comprise ribonucleic acids. The term also refers to polynucleotide polymers that comprise chemically modified ribonucleotides. A polyribonucleotide can be formed of D-ribose sugars, which can be found in nature, and L-ribose sugars, which are not found in nature.
- The term “polypeptides,” as used herein, generally refers to polymer chains comprised of amino acid residue monomers which are joined together through amide bonds (peptide bonds). The amino acids may be the L-optical isomer or the D-optical isomer.
- The term “barcode,” as used herein, generally refers to a label, or identifier, that conveys or is capable of conveying information about one or more nucleic acid sequences from a food sample or from an environmental sample associated with said food sample. A barcode can be part of a nucleic acid sequence. A barcode can be independent of a nucleic acid sequence. A barcode can be a tag attached to a nucleic acid molecule. A barcode can have a variety of different formats. For example, barcodes can include: polynucleotide barcodes; random nucleic acid and/or amino acid sequences; and synthetic nucleic acid and/or amino acid sequences. A barcode can be added to, for example, a fragment of a deoxyribonucleic acid (DNA) or ribonucleic acid (RNA) sample before, during, and/or after sequencing of the sample. Barcodes can allow for identification and/or quantification of individual sequencing-reads. Examples of such barcodes and uses thereof, as may be used with methods, apparatus and systems of the present disclosure, are provided in U.S. Patent Pub. No. 2016/0239732, which is entirely incorporated herein by reference. In some instances, as described herein, a “molecular index” can either be a barcode itself or it can be a building block, i.e., a component or portion of a larger barcode.
- The term “sequencing,” as used herein, generally refers to methods and technologies for determining the sequence of nucleotide bases in one or more nucleic acid polymers, i.e., polynucleotides. Sequencing can be performed by various systems currently available, such as, without limitation, a sequencing system by Illumina®, Pacific Biosciences (PacBio®), Oxford Nanopore®, Genia (Roche) or Life Technologies (Ion Torrent®). Alternatively or in addition, sequencing may be performed using nucleic acid amplification, polymerase chain reaction (PCR) (e.g., digital PCR, quantitative PCR, or real time PCR), or isothermal amplification. Such systems may provide a plurality of raw data corresponding to the genetic information associated with a food sample or an environmental sample. In some examples, such systems provide nucleic acid sequences (also “reads” or “sequencing reads” herein). The term also refers to epigenetics which is the study of heritable changes in gene function that do not involve changes in the DNA sequence. A read may include a string of nucleic acid bases corresponding to a sequence of a nucleic acid molecule that has been sequenced.
- Many food poisoning outbreaks have been associated with pathogenic microorganisms including pathogens of the Salmonella, Campylobacter, Listeria, and Escherichia genus. Examples of foods that have been associated with such outbreaks include milk, cheeses, vegetables, meats (notably beef and poultry), fish, seafood, and many others. Potential contamination sources for various pathogens include raw materials, food workers, incoming air, water, and food processing environments. Among those, post-processing contamination at food-contact surfaces in a food processing facility poses a great threat to product contamination.
- There are many challenges in ensuring the safety of our food supply. Some of these challenges include changes in a food processing environment that lead to food contamination, such as the introduction of a new lot of contaminated raw products. Other challenges include changes in food production and supply, which include importing and exporting foods from different jurisdictions, which may have distinct standards to assess a risk associated with a food. In addition, new and emerging bacteria strains, toxins, and antibiotic resistance may not be detected by traditional serotyping or PCR methods of detection.
- In some aspects, the disclosure provides a method for the identification of a microorganism associated with a food or with a food processing facility. In some aspects the method comprises deploying an assay to one or more food processing facilities; performing a sequencing reaction of a food sample or of an environmental sample from said one or more food processing facilities; transmitting an electronic communication comprising a data set associated with said sequencing reaction of said food sample or of said environmental sample from said one or more food processing facilities to a server; and scanning, by a computer, at least a fraction of said transmitted data set for one or more genes associated with a microorganism.
- In some instances, the scanning scans fewer than 1%, fewer than 0.1%, fewer than 0.001% of said transmitted data set for one or more genes associated with said microorganism. Said scanning can be performed to identify a variety of polymorphic gene regions (comprising SNP's, RFLP's, STRs, VNTR's, hypervariable regions, minisatellites, dinucleotide repeats, trinucleotide repeats, tetranucleotide repeats, simple sequence repeats, indels, and insertion elements) associated with a wide diversity of microorganisms. The variety of polymorphic regions to be searched for can be determined by creating a large database of sequences from dozens, hundreds and thousands of food and environmental samples. For instance, a database of such polymorphic regions can be constructed by performing sequencing reactions on at least 5,000, at least 10,000, at least 15,000, at least 20,000, at least 25,000, at least 30,000, at least 35,000, at least 40,000, at least 45,000, at least 50,000 different food or environmental samples. The sequences obtained can be used to compile information in a database that includes: a) the composition of each sample; and b) the presence or absence of a variety of pathogenic and non-pathogenic organisms associated on each sample. In addition to containing information about various types of genus and species, such databases comprise data from polymorphic gene regions of a variety of strains that are variants of a single species. For example, a plurality of sequences in the database might correspond to one or more serovars, morphovars, biovars, or other strain specific information.
- A variety of sequencing techniques, such as a pore sequencing reaction, a next generation sequencing reaction, a shotgun next generation sequencing, or Sanger sequencing can be used to create a collection of polymorphic regions. In some instances, said sequencing reaction is a pore sequencing reaction and said pore sequencing reaction distinguishes an epigenetic pattern on a nucleic acid from said food sample or from said environmental sample.
- In some cases, said microorganism may be pre-selected by a customer. A customer can be an individual or an entity, such as one or more food processing facilities. For example, a customer can be a food packaging facility; a food distribution center; a food storage center; a facilities handling meat, poultry, egg, or another edible product; a farm; a retail food establishment; a fishing vessel; or another type of facility that also manufactures, processes, packs, or holds foods for any period of time.
- A customer may pre-select a microorganism of interest to be identified with any of the methods disclosed herein. For example, raw or undercooked ground beef and beef products are vehicles often implicated in E. coli O157:H7 outbreaks. Produce, including bagged lettuce, spinach, and alfalfa sprouts, are also increasingly being implicated in E. coli O157:H7 outbreaks. A food processing facility producing raw meats or other produce associated with E. coli O157:H7 may be a customer that pre-selects E. coli as a microorganism for analysis. A customer may pre-select one or more types of microorganisms for analysis. A microorganism can be one or more of types of bacteria, fungus, parasites, protozoa, and viruses.
- Non-limiting examples of bacteria that can be pre-selected by a customer and detected with the methods of the disclosure include: bacteria in the Escherichia genus, including enterotoxigenic Escherichia coli (ETEC), enteropathogenic Escherichia coli (EPEC), enterohemorrhagic Escherichia coli (EHEC), and enteroinvasive Escherichia coli (EIEC); bacteria of the Salmonella genus; bacteria of the Campylobacter genus; bacteria of the Listeria genus; bacteria of the Yersinia genus; bacteria of the Shigella genus; bacteria of the Vibrio genus; bacteria of the Coxiella genus; bacteria of the Mycobacterium genus; bacteria of the Brucella genus; bacteria of the Vibrio genus; bacteria of the Cronobacter genus; bacteria of the Aeromonas genus; bacteria of the Plesiomonas genus; bacteria of the Clostridium genus; bacteria of the Staphylococcus genus; bacteria of the Bacillus genus; bacteria of the Streptococcus genus; bacteria of the Clostridium genus; and bacteria of the Enterococcus genus.
- A microorganism can be a virus. Non-limiting examples of viruses that can be pre-selected by a customer and detected with the methods of the disclosure include: noroviruses, Hepatitis A virus, Hepatitis E virus, rotavirus.
- The performing of a sequencing reaction of a food sample or of an environmental sample from said one or more food processing facilities often generates a plurality of nucleic acids sequences that contain redundant information or information associated with genes that are not from a microorganism. In some aspects, the disclosed methods empower efficient data analysis by facilitating the targeted analysis of a smaller data set. The generated data could be in the range of Kb, Mb, Gb, Tb or more per analyzed sample. In some aspects, said scanning scans fewer than 1/10, fewer than 1/20, fewer than 1/30, fewer than 1/40, fewer than 1/50, fewer than 1/60, fewer than 1/70, fewer than 1/80, fewer than 1/90, fewer than 1/100, fewer than 1/200, fewer than 1/300, fewer than 1/400, fewer than 1/500, fewer than 1/600, fewer than 1/700, fewer than 1/800, fewer than 1/900, fewer than 1/1,000, fewer than 1/10,000, or fewer than 1/100,000 of a data set, such as a transmitted data set for one or more genes associated with a microorganism. In some aspects, said scanning scans at least a fraction of said transmitted data set for one or more genes associated with two or more, three or more, four or more, five or more, six or more, seven or more, eight or more, nine or more, ten or more microorganisms or another suitable number. In some instances, said scanning comprises scanning said transmitted data set for one or more polymorphic gene regions. In some instances, said one or more polymorphic regions comprise one or more single nucleotide polymorphisms (SNP's), one or more restriction fragment length polymorphisms (RFLP's), one or more short tandem repeats (STRs), one or more variable number of tandem repeats (VNTR's), one or more hypervariable regions, one or more minisatellites, one or more dinucleotide repeats, one or more trinucleotide repeats, one or more tetranucleotide repeats, one or more simple sequence repeats, one or more indel, or one or more insertion elements. In some instances said one or more polymorphic regions comprise one or more single nucleotide polymorphisms (SNP's). A data set associated with a sequencing reaction of a food sample or of an environmental sample can be transmitted to a server and scanned by a computer.
- In some cases, a method can detect a microorganism selected from the group consisting of: a microorganism of the Salmonella genus, a microorganism of the Campylobacter genus, a microorganism of the Listeria genus, and a microorganism of the Escherichia genus. The detected microorganisms may be of any serotype and a scanning, by a computer, of one or more genes associated with a microorganism may detect a microorganism independently of its serotype.
- In some cases, a sequencing reaction of a food sample, an environmental sample, or another sample is a pore sequencing reaction, such as an Oxford Nanopore® sequencing reaction. In some instances, at least one barcode is added to one or more nucleic acid polymers derived from a food sample, from an environmental sample, or from another sample prior to performing said sequencing reaction. In some instances, a plurality of mutually exclusive barcodes are added to a plurality of food processing facilities, thereby creating a barcode identifier that can be associated with each food processing facility. For instance, a barcoded sequencing read comprising sequences from a pathogenic microorganism can be associated with a food or processing facility. In some aspects, a method disclosed herein further comprises creating, in a computer, a data file that associates said at least one barcode with a source of said food sample, of said environmental sample, or of another sample.
- In some aspects, the disclosed methods comprise computer systems that are programmed to implement methods of the disclosure.
FIG. 1 illustrates the deploying of asequencing assay 101 to one or morefood processing facilities 102, food testing lab, or any other diagnostic lab and performing a sequencing reaction of a food sample or of an environmental sample from said one or morefood processing facilities 102. The food processing facility, food testing lab, or any other diagnostic lab may have one or more computer systems that can be used to transmit the results of the sequencing reads to a server, either on premise or remotely deployed cloud environment.FIG. 2 illustrates a transmission of an electronic communication comprising a data set associated with a sequencing reaction from one or more food processing facilities, food testing labs, or any other diagnostic labs to a server. - The raw sequence data collected from the sequencing reaction includes a large set of data that includes all individual sequences as well as the quality at each base. From this large data set, the Clear Labs bioinformatics pipeline extracts a final report that is orders of magnitudes smaller. The final report (e.g. electronic communication) is essentially limited to the presence or absence of an organism of interest, for instance pathogens, and a further classification of the organism in terms of serotypes, strains, or other subclassifications. The collected data not used in the report comprises the following:
- (a) Read quality: The raw sequences include information on the quality of the sequences per base. The quality scores can be used in a Bayesian model where classifications are statistically sensitive to these quality scores. Furthermore the quality scores can reveal more on possible relations that content of samples have with the accuracy of sequencing platform.
- (b) Sequence time: The raw sequences also include information on the time when the sequence was read by the sequencer. The number of sequences form the same source as a function of time can reveal a lot more information than we currently have. In addition, using these time data, can be useful in generating reports for all or some of the samples earlier than it is currently done.
- (c) Trimmed portions of sequences: During demultiplexing of the sequences initial and terminal portions of those sequences are trimmed. Those portions include adapters, index barcodes, and primers. The main data extracted from the trimmed portions, identifies which sample the sequence belonged to. This decision however is influenced by sequencing errors, and special properties of the involved sequences. The information on accuracy of this decision, and other factors gets lost with trimming. Moreover the quality of these portions can be used as an indicator for the quality of the entire sequence.
- (d) Clustering: An important step in the pipeline involves clustering sequences that are close enough to each other and representing all the sequences within a cluster by a consensus sequence. This reduces the data significantly and make is easier to classify these sequences. However these differences, even if minute, carry information that gets lost with clustering. Clustering with more stringent criteria, or no clustering can lead into higher resolution and perhaps finer classification.
- A
computer system 201 can be programmed or otherwise configured to process and transmit a data set from a food processing facility, food testing labs, or any other diagnostic labs. Thecomputer system 201 includes a central processing unit (CPU, also “processor” and “computer processor” herein) 204, which can be a single core or multi core processor, or a plurality of processors for parallel processing. Thecomputer system 201 also includes memory or memory location 205 (e.g., random-access memory, read-only memory, flash memory), electronic storage unit 206 (e.g., hard disk), communication interface 202 (e.g., network adapter) for communicating with one or more other systems, such as for instance transmitting a data set associated with said sequencing reads, andperipheral devices 204, such as cache, other memory, data storage and/or electronic display adapters. Thememory 205,storage unit 206,interface 202 andperipheral devices 203 are in communication with theCPU 204 through a communication bus (solid lines), such as a motherboard. Thestorage unit 206 can be a data storage unit (or data repository) for storing data. For instance, in some cases, thedata storage unit 206 can store a plurality of sequencing reads and provide a library of sequences associated with one or more strains from one or more microorganisms associated with a food processing facility, food testing labs, or any other diagnostic labs. - The
computer system 201 can be operatively coupled to a computer network (“network”) 207 with the aid of thecommunication interface 202. Thenetwork 207 can be the Internet, an interne and/or extranet, or an intranet and/or extranet that is in communication with the Internet. Thenetwork 207 in some cases is a telecommunication and/or data network. Thenetwork 207 can include one or more computer servers, which can enable distributed computing, such as cloud computing. Thenetwork 207, in some cases with the aid of thecomputer system 201, can implement a peer-to-peer network, which may enable devices coupled to thecomputer system 201 to behave as a client or a server. - Some families of microorganisms comprise both harmless and highly pathogenic bugs. The Escherichia family of pathogens, for example, comprise lethal and harmless strains of E. coli. Thus it is not only relevant to be able to identify a pathogen in a sample, but it is also relevant to be able to characterize it with high sensitivity. In some aspects, the disclosure provides a method comprising obtaining a plurality of nucleic acid sequences from a food sample, from an environment associated with said food sample or from another sample, such as non-food derived samples from clinical sources, including blood, plasma, urine, tissue, faces, bone marrow, saliva or cerebrospinal fluid samples; scanning, by a computer, at least a fraction of said plurality of said nucleic acid sequences for a plurality of nucleic acid regions from one or more microorganisms selected from the group consisting of: a microorganism of the Salmonella genus, a microorganism of the Campylobacter genus, a microorganism of the Listeria genus, and a microorganism of the Escherichia genus, wherein said scanning characterizes said one or more microorganisms with greater than 98% sensitivity, greater than 98.5% sensitivity, greater than 99% sensitivity, greater than 99.5% sensitivity, or greater than 99.9% sensitivity. In some aspects, said scanning characterizes said one or more microorganisms with greater than 98% specificity, greater than 98.5% specificity, greater than 99% specificity, greater than 99.5% specificity, or greater than 99.9% specificity. Sensitivity can be a measure of a microorganism that is correctly identified (e.g. the percentage of a microorganism that can be correctly identified based on sequencing read analyses). Specificity (also called the true negative rate) measures the proportion of negatives that are correctly identified as such (e.g. the percentage of food samples or environmental samples that are correctly identified as not having the microorganism therein). In some instances, said method can distinguish a genetic variant or subtype of a microorganism (e.g., one or more bacterial strains).
- In some instances said plurality of nucleic acid sequences comprise complementary DNA (cDNA) sequences, ribonucleic acid (RNA) sequences, genomic deoxyribonucleic acid (gDNA) sequences or a mixture of cDNA, RNA, and gDNA sequences. In some instances, the high sensitivity of the disclosed method, the high specificity of the disclosed method, or both, can be accomplished by scanning said plurality of said nucleic acid sequences for one or more polymorphic gene regions associated with said microorganisms. In some instances, said one or more polymorphic regions is selected from the group consisting of one or more single nucleotide polymorphisms (SNP's), one or more restriction fragment length polymorphisms (RFLP's), one or more short tandem repeats (STRs), one or more variable number of tandem repeats (VNTR's), one or more hypervariable regions, one or more minisatellites, one or more dinucleotide repeats, one or more trinucleotide repeats, one or more tetranucleotide repeats, one or more simple sequence repeats, one or more indel, or one or more insertion elements. In some instances, said scanning compares a scanned polymorphism with a library of sequences comprising sequences from dozens, hundreds, or thousands of unique strains of a microorganism. The higher sensitivity is achieved by comparing the sequence information of the target region that can discriminate different microorganisms through the lens of SNPs, indels or other non-universal target specific markers that are only present within the genome of target micromicroorganisms.
- In some aspects, an analysis of a redundancy in genetic markers increases a specificity and sensitivity of a method disclosed herein.
FIG. 3 is a chart illustrating that a redundancy in genetic markers decreases a false negative rate of a method of the disclosure and increases its sensitivity as compared to PCR based methods. As shown inFIG. 3 , three commercially available q/PCR based pathogen detection kits revealed that they would not detect all known Salmonella or Listeria genomes. 301 illustrates percentages of Salmonella detection by existing commercial kits. 302 illustrates percentages of Listeria detection by existing commercial kits. - A scanning of a plurality of nucleic acid regions within said plurality of nucleic acid sequences can characterize said one or more microorganisms with a desired specificity, sensitivity, or both. In some aspects, a scanning of no more than 0.001%, 0.01%, 0.1%, 1%, 5%, 10%, 25%, 50%, 90%, 99%, 100% or any number in between of nucleic acid regions within said plurality of nucleic acid sequences characterizes said one or more microorganisms with greater than 90%, 95%, 98%, 99%, 99.9%, 99.99% and 99.999% sensitivity. In some aspects, the method has fewer than 2%, fewer than 1.5%, fewer than 1.0%, fewer than 0.5%, or fewer than 0.1% of a false positive identification rate. In some aspects, a scanning of no more than 1% of a whole genome can characterize said microorganism.
- In some instances, the high sensitivity and specificity of the disclosed methods are independent of a serotype of the microorganism. For instance, a scanning of a plurality of nucleic acid regions can identify a microorganism of the Salmonella genus that has a serotype selected from the group consisting of: Enteritidis, Typhimurium, Newport, Javiana, Infantis, Montevideo, Heidelberg, Muenchen, Saintpaul, Oranienburg, Braenderup, Paratyphi B var. L(+) Tartrate+, Agona, Thompson, and Kentucky; a microorganism of the Escherichia genus has a serotype selected from the group consisting of: O103, O111, O121, O145, O26, O45, and O157; a microorganism of the Listeria genus that has a serotype selected from the group consisting of: 2a, 1/2b, 1/2c, 3a, 3b, 3c, 4a, 4b, 4ab, 4c, 4d, and 4e; a microorganism of the Campylobacter genus with the C. jejuni, C. lari, or C. coli serotype and others.
- A non-pathogenic strain of Citrobacter, namely Citrobacter sedlakii, expresses the Escherichia coli O157:H7 antigen. This is usually associated with a false positive detection of E. coli in a sample. Typically, when Citrobacter is erroneously classified as E. coli, a food lot may be unnecessarily disposed of and a food processing facility may be erroneously classified as a contaminated facility. In some aspects, the high sensitivity of the disclosed methods can be used to distinguish a microorganism from the Escherichia genus from a microorganism of the Citrobacter genus. In some instances, the disclosure provides a method comprising: scanning, by a computer, a plurality of sequencing reads from a food sample or from an environment associated with said food sample, whereby said scanning distinguishes a microorganism of a Citrobacter genus from a microorganism of an Escherichia genus by identifying one or more single nucleotide polymorphisms that are associated with either said Citrobacter genus or said Escherichia genus. Other examples include E. coli O157:H7 assay cross-reacting with E. coli O55 (which is not an STEC). Also some assays deliver false positives against E. coli O104 (which is not an STEC). Citrobacter is also a long-understood challenge for the some systems E. coli O157:H7.
- In many cases, disease outbreaks require a rapid response, often including multijurisdictional coordination. In some aspects, the disclosure provides methods for the rapid identification of a microorganism from a food sample. In some instances, the disclosure provides a method for sequencing a plurality of nucleic acid sequences from a food sample, from an environmental sample associated with said food sample or from another sample (such as a clinically derived sample) for a period of time; and performing an assay on said food sample or said environment associated with said food sample if said sequencing for said period of time identifies a threshold level of nucleic acid sequences from a microorganism in said food sample. In some instances said period of time is less than 12 hours, less than 6 hours, less than 4 hours, less than 2 hours, less than 1 hour, less than 30 minutes, less than 20 minutes, less than 15 minutes or another suitable time.
FIG. 4 is a schematic illustrating a sequencing of a plurality of nucleic acid sequences from a food sample for a period of time and the advantages of performing an assay on said food sample if said sequencing for said period of time identifies a threshold level of nucleic acid sequences from a microorganism in said food sample. - In general, a microorganism that can injure its host, e.g., by competing with it for metabolic resources, destroying its cells or tissues, or secreting toxins can be considered a pathogenic microorganism. Examples of classes of pathogenic microorganisms include viruses, bacteria, mycobacteria, fungi, protozoa, and some helminths. In some aspects, the disclosure provides methods for detecting one or more microorganisms from a food sample or from an environment associated with said food sample—such as from a table, a floor, a boot cover, an equipment of a food processing facility—or from a food related sample that comprise soil, water, water quality, air, animal production, feed, manure, crop production, manufacturing plants, environmental samples, or non-food derived samples, such as samples from clinical sources that comprise blood, plasma, urine, tissue, faces, bone marrow, saliva or cerebrospinal fluid by analyzing a plurality of nucleic acid sequencing reads from such samples.
- Many pathogenic microorganisms are further subdivided into serotypes, which can differentiate strains by their surface and antigenic properties. For instance Salmonella species are commonly referred to by their serotype names. For example, Salmonella enterica subspecies enterica is further divided into numerous serotypes, including S. enteritidis and S. typhimurium. In some aspects, the methods of the disclosure can distinguish between such subspecies of a variety of Salmonella by analyzing their nucleic acid sequences.
- Escherichia coli (E. coli) bacteria normally live in the intestines of people and animals. Many E. coli are harmless and in some aspects are an important part of a healthy human intestinal tract. However, many E. coli can cause illnesses, including diarrhea or illness outside of the intestinal tract and should be distinguished from less pathogenic strains. In some aspects, the methods of the disclosure can distinguish between various subspecies of a variety of Escherichia bacteria by analyzing their nucleic acid sequences.
- Listeria is a harmful bacterium that can be found in refrigerated, ready-to-eat foods (meat, poultry, seafood, and dairy—unpasteurized milk and milk products or foods made with unpasteurized milk), and produce harvested from soil contaminated with, for example, L. monocytogenes. Many animals can carry this bacterium without appearing ill, which increases the challenges in identifying the pathogen derived from a food source. In addition, some species of Listeria can grow at refrigerator temperatures where most other foodborne bacteria do not, another factor that increases the challenges of identifying Listeria. When eaten, Listeria may cause listeriosis, an illness to which pregnant women and their unborn children are very susceptible. In some aspects, the methods of the disclosure can distinguish between various subspecies of a variety of Listeria bacteria by analyzing their nucleic acid sequences.
- Campylobacter jejuni is estimated to be the third leading bacterial cause of foodborne illness in the United States. Raw poultry, unpasteurized (“raw”) milk and cheeses made from it, and contaminated water (for example, unchlorinated water, such as in streams and ponds) are major sources of Campylobacter, but it also occurs in other kinds of meats and has been found in seafood and vegetables. In some aspects, the methods of the disclosure can distinguish between various subspecies of a variety of Campylobacter bacteria by analyzing their nucleic acid sequences.
- Non-limiting examples of pathogenic microorganisms that can be detected with the methods of the disclosure include: pathogenic Escherichia coli group, including Enterotoxigenic Escherichia coli (ETEC), Enteropathogenic Escherichia coli (EPEC), Enterohemorrhagic Escherichia coli (EHEC), Enteroinvasive Escherichia coli (EIEC), Salmonella spp., Campylobacter jejuni, Listeria, Yersinia enterocolitica, Shigella spp., Vibrio parahaemolyticus, Coxiella burnetii, Mycobacterium bovis, Brucella spp., Vibrio cholera, Vibrio vulnificus, Cronobacter, Aeromonas hydrophila and other spp., Plesiomonas shigelloides, Clostridium perfringens, Clostridium botulinum, Staphylococcus aureus, Bacillus cereus and other Bacillus spp., Listeria monocytogenes, Streptococcus spp., Enterococcus, and others.
- Disclosed herein are methods and apparatuses that allow the distinction of a microorganism that has been newly introduced into a food processing facility or any other environmental setting in which tracking hygiene is critical, such as a hospital or a clinic. In some instances, resident microorganisms reflect a persistent contamination within a location, e.g., a food processing facility or a hospital, that is very different than the transient pathogens that are being repeatedly introduced into the locations. Discriminating resident and transient pathogens provides more clarity for differentiation of source of contaminations and intervention strategies. This strategy can be used, for example, to manage contaminations with managing contaminations with Listeria monocytogensis. For example, Campylobacter is part of the natural gut microflora of most food-producing animals, such as chickens, turkeys, swine, cattle, and sheep. Typically, each contaminated poultry carcass can carry from about 100 to about 100,000 Campylobacter cells. On one hand, given the fact that less than 500 Campylobacter cells can cause infection, poultry products pose a significant risk for consumers who mishandle fresh or processed poultry during preparation or who undercook it. On another hand, one must be able to distinguish a normal level of a Campylobacter on a food carcass from a Campylobacter overgrowth in a sample or from the presence of a new strain of Campylobacter in a food processing facility, environment, or food sample. One must also be able to identify a new source of contamination in a facility from existing sources.
FIG. 4 illustrates a process for predictive risk assessment based on a detection of a non-pathogenic microorganism. Briefly, a food sample, such as a steak sample illustrated as 401 is processed and an assay, such as a nucleic acid sequencing reaction is performed. An analysis of a plurality of nucleic acid sequencing reads from 401 may, in some instances, not detect a particular pathogen, such as the E. coli pathogen illustrated in this example. Nevertheless, ananalysis 403 of themicrobiome 402 of thefood sample 401 may indicate high risk for a presence of a pathogen, such as E. coli. In such instances, the food sample may be re-sampled and re-processed to confirm the presence of a pathogenic microorganism therein. - In some instances, the methods disclosed herein further comprise performing an additional assay to confirm the presence of the pathogenic microorganism in the sample, such as a serotyping assay, a polymerase chain reaction (PCR) assay, an enzyme-linked immunosorbent (ELISA) assay, or an enzyme-linked fluorescent assay (ELFA) assay, restriction fragment length polymorphisms (RFLP) assay, pulse field gel electrophoresis (PFGE) assay, multi-locus sequence typing (MLST) assay, targeted DNA sequencing assay, whole genome sequencing (WGS) assay, or shotgun sequencing assay.
- In some aspects, the disclosure provides a method comprising obtaining a first plurality of nucleic acid sequences from a first sample of a food processing facility; creating a data file in a computer that associates one or more of said first plurality of nucleic acid sequences with said food processing facility; obtaining a second plurality of nucleic acid sequences from a second food sample of said food processing facility; and scanning a plurality of sequences from said second plurality of nucleic acid sequences for one or more sequences associated with said food processing facility in the created data file.
- One or more data files can be created that associate a microorganism with a food processing facility. In some instances, a data file can provide a collection of sequencing reads that can be associated with one or more strains of a microorganism present in the processing facility. In some cases, more than 10, 15, 20, 25, 30, 35, 40, 45, 50, 60, 70, 80, 90, 100, or 1000 bacterial strains can be associated with one or more food processing facilities.
- Correlating a Presence of a Microorganism with the Risk Associated with a Food Sample
- The instance disclosure recognizes that a presence of some non-pathogenic microorganisms, i.e. indicator microorganisms, can be correlated with a presence of pathogenic bacteria in food, in environmental samples, or another sample. In some aspects the disclosure provides a method comprising detecting a presence or an absence of a non-pathogenic microorganism in a food sample, an environment associated with said food sample, or another sample described herein, by a computer system, and a presence or an absence of a pathogenic microorganism in said food sample, environment associated, or another sample based on said presence or said absence of said non-pathogenic microorganism.
FIG. 5 is a heat map illustrating predictive pathogen detection through machine learning using associated non-pathogenic microorganisms. Data was collected from more than 20,000 food samples varying over the food categories identified by CODEX, with presentation proportional to their market share. Among those about 950 samples were identified to have pathogens present. The pathogens were detected via Clear Labs sequencing platform, as well as, with traditional culturing. Via sequencing multiple regions, the bacteria present in the samples were detected and quantified (relative to each other) at the species level. - The data was supplemented by alpha diversity measures including Shannon entropy, number of observed OTUs, and Faith's phylogenetic diversity measure. The quantification of the bacteria in the samples and these supplemented measures, provided coordinates for the data points used in the final classification. The distance between the data points was computed as a combination of unifrac distance and the euclidean distance restricted to the supplemented coordinates.
- The data points were split into training and test subsets. We used stratified 10-fold cross validation to train support vector machine model on the training set. The performance of the model was measured on the previously separated test set. The scores with regard to detection of some of the pathogens is presented in
FIG. 5 . - The coefficients of the support vector machine classifier were used to determine bacteria that play significance in determining presence or absence of the pathogens and therefore to provide signatures that can be used independently of the model. This analysis determined a set of non-pathogenic microorganisms that had statistically significant correlation with the presence of pathogenic organisms, including members of the genus Enterobacter. Enterobacter asburiae, Enterobacter bugandensis, Enterobacter cancerogenus, Enterobacter cloacae, Enterobacter endosymbiont, Enterobacter hormaechei, Enterobacter kobei, Enterobacter ludwigii, and Enterobacter soli were among the top 9 examples of non-pathogenic bacteria associated with_our set of pathogenic bacteria. For example, Yersinia pseudotuberculosis was associated with Enterobacter asburiae; Vibrio vulnificus was associated with Enterobacter bugandensis, Enterobacter endosymbiont, and Enterobacter soli; Escherichia coli, Salmonella enterica, and Shigella boydii were associated with Enterobacter cancerogenus, Enterobacter cloacae, and Enterobacter hormaechei; Staphylococcus Aureus was associated with Enterobacter kobei; and Yersinia pseudotuberculosis was associated with Enterobacter asburiae and Enterobacter ludwigii.
- Without being limited by theory, a variety of other samples described herein can be analyzed as described. Briefly, a sample may be screened with any one of the methods described herein and a plurality of nucleic acid sequences may be obtained. Numerous sequences within said plurality of nucleic acid sequences may be correlated by a machine learning algorithm with a variety of microorganisms. A prediction can then be created and a visual output of such prediction, such as the illustrated a heat map can be created by detecting statistically significant correlations. For instance, a heat map created by a machine learning algorithm may illustrate a correlation between a presence of E. coli, Salmonella enterica, and Shigella boydii of one or more non-pathogenic microorganisms from the Enterobacter genus, such as Enterobacter cancerogenus, Enterobacter cloacae, and Enterobacter hormaechei or any other bacterial genera. In some aspects, a machine learning algorithm, including the machine learning algorithm's described herein, can be used to create such predictions.
- A statistical analysis can be performed to identify the top nonpathogenic species/food ingredients associated with the presence of Vibrio/Staphylococcus/Yersinia/Shigella/Salmonella/Escherichia (an illustrative cluster-based representation of such analysis is presented in
FIG. 5 ). This analysis determined a set of non-pathogenic microorganisms that had statistically significant correlation with the presence of pathogenic organisms, including members of the genus Enterobacter. Enterobacter asburiae, Enterobacter bugandensis, Enterobacter cancerogenus, Enterobacter cloacae, Enterobacter endosymbiont, Enterobacter hormaechei, Enterobacter kobei, Enterobacter ludwigii, and Enterobacter soli were among the top 9 examples of non-pathogenic bacteria associated with_our set of pathogenic bacteria. For example, Yersinia pseudotuberculosis was associated with Enterobacter asburiae; Vibrio vulnificus was associated with Enterobacter bugandensis, Enterobacter endosymbiont, and Enterobacter soli; Escherichia coli, Salmonella enterica, and Shigella boydii were associated with Enterobacter cancerogenus, Enterobacter cloacae, and Enterobacter hormaechei; Staphylococcus Aureus was associated with Enterobacter kobei; and Yersinia pseudotuberculosis was associated with Enterobacter asburiae and Enterobacter ludwigii. - Food is a chemically complex matrix. Predicting whether, or how fast, microorganisms will grow in a food, or how quickly a food may spoil, is difficult. For instance, most foods contain sufficient nutrients to support microbial growth. Furthermore, there are many additional factors that encourage, prevent, or limit growth of microorganisms in foods including pH, temperature, and relative humidity. In some aspects, the instant disclosure recognizes that a presence of some microorganism, whether or not pathogenic, can be correlated with a sell-by date, i.e., a spoilage date of a food. In some aspects the disclosure provides a method comprising: detecting a presence or an absence of a microorganism in a food sample or in an environmental sample from a food processing facility; and predicting, by a computer system, a risk presented by said food sample or by said food processing facility based on said presence or said absence of said microorganism.
-
FIG. 6 illustrates a process for predicting a shelf-life of a food based on machine learning. Briefly,FIG. 6 illustrates a screening of a sample, such as a screening of a plurality of nucleic acid sequencing reads. Subsequently, a machine learning algorithm is used to create a risk profile, whereby said risk profile associates a presence of some microorganism with a low or a high likelihood of food spoilage, thereby predicting the sell-by date of a food. - A machine learning algorithm can be used to associate any number of sequencing reads with a presence of microorganism in a food sample, a food related sample, or another sample. Similarly, a machine learning algorithm may be able to associate any number of sequencing reads with a presence of a pathogenic microorganism, even if the sequence reads themselves are not from the pathogenic microorganism. Computer-implemented methods for generating a machine learning-based classifier in a system may require a number of input datasets in order for the classifier to produce highly accurate predictions. Depending on the microorganism, matrix, and the microorganisms abundance in the real life samples of the matrix, the data can be in range of 100, 1000, 10000, 100000, 1000000, 10000000, 100000000 sequencing reads. A machine learning algorithm is selected from the group consisting of: a support vector machine (SVM), a Naive Bayes classification, a random forest, Logistic regression and a neural network.
- One can tune the resolution for the detection of a microorganism based on the source of the sample, e.g., food versus surface swab; and the sensitivity of the assay itself, e.g., genus, species, serotype, versus strain (obtained via whole genome sequencing).
FIG. 7 is a diagram illustrating the tunable resolution of various assays. Briefly, one or more assays can be used sequentially to obtain a desired level of sensitivity, such as to determine a genus, a species, a serotype, a sub-serotype, or a strain of said microorganism. The assays can be identical or they can be distinct.FIG. 7 illustrates that a sequencing assay can be used to identify a strain or a sub-serotype of a microorganism whereas a PCR reaction may be able to identify a species or, in some cases, a serotype of a particular microorganism. - In some aspects, the disclosure provides a method comprising: obtaining a plurality of nucleic acid sequences of a food sample, of an environmental sample or of another non-food derived sample from a food processing facility or another facility; performing a first assay in said plurality of nucleic acid sequences of said food sample, whereby said assay predicts a presence or predicts an absence of a microorganism in said food sample; and determining, based on said predicted presence or said predicted absence of said microorganism of the first assay whether to perform a second assay, whereby a sensitivity of said second assay is selected to determine a genus, a species, a serotype, a sub-serotype, or a strain of said microorganism.
- There are various approaches for processing nucleic acids from food samples or from environmental samples, such as polymerase chain reaction (PCR) and sequencing. In some cases said assay is a sequencing assay that provides the ability to obtain sequencing-reads in real time, such as pore sequencing assay. Sequencing can be performed by various systems currently available, such as, without limitation, a sequencing system by Illumina®, Pacific Biosciences (PacBio®), Oxford Nanopore®, Genia (Roche) or Life Technologies (Ion Torrent®). Alternatively or in addition, sequencing may be performed using nucleic acid amplification, polymerase chain reaction (PCR) (e.g., digital PCR, quantitative PCR, or real time PCR), or isothermal amplification. In some cases, the assay is an enzyme-linked immunosorbent (ELISA) assay or an enzyme-linked fluorescent assay (ELFA) assay.
- In some cases, the assay is a serotyping assay. A serotype or serovar is a distinct variation within a species of bacteria or virus. These microorganisms can be classified together based on their cell surface antigens, allowing the epidemiologic classification of microorganisms to the sub-species level. A group of serovars with common antigens is called a serogroup or sometimes serocomplex. In some aspects, the disclosure provides methods for performing a sequencing assay on a plurality of nucleic acids derived from a sample and a serotyping assay on a derivative of said sample.
FIG. 8 is a schematic illustrating various serotypes of various microorganisms that can be detected by an analysis of a plurality of nucleic acid sequences as described herein and further validated with a serotyping assay. - Differentiating Live versus Dead Microorganisms
- Nucleic acid-based targeted analytical methods, such as PCR provide only limited information on the activities and physiological states of microorganisms in samples and cannot distinguish viable cells from dead cells. In some aspects, the disclosure provides methods for distinguishing a live microorganism in a food sample or in another sample, from a dead microorganism within the same sample.
FIG. 9 is a schematic illustrating one process for distinguishing a live microorganism from a food or from an environmental sample. Briefly,FIG. 9 illustrates than an amount of a microorganism in a sample can be increased, i.e., enriched 901, by growing the microorganism in a rich medium for a period of time. A reagent, such as a photoreactive DNA-binding dye, a DNA intercalating reagent, or another suitable reagent may be added to enrichedsample 901. Such reagents distinguish live 902 microorganisms from dead 903 microorganisms by interacting with the nucleic acid sequence of dead microorganisms only. In some cases, the disclosure contemplates using propidium monoazide or a derivative thereof as a dye. The modified sample can be prepared for asubsequent reaction 904, such as asequencing reaction 905. - In some instances the disclosure provides a method comprising adding a reagent to a plurality of nucleic acid molecules from a food sample, or food related sample or another sample described herein thereby forming a modified plurality of nucleic acid molecules, whereby said reagent (i) interacts with and modifies a structure of a plurality of nucleic acid molecules derived from one or more dead microorganisms; and (ii) does not interact with or modify a structure of a nucleic acid molecule derived from one or more live microorganisms; thereby providing a modified plurality of nucleic acid molecules; and sequencing said modified plurality of nucleic acid molecules, thereby distinguishing one or more live organisms from said food sample or from another sample.
- In other aspects the disclosure provides a method comprising performing a pore sequencing or other DNA sequencing or hybridization assay on a plurality of nucleic acid molecules from a food sample or from another sample whereby said pore sequencing reaction distinguishes one or more nucleic acid molecules derived from a dead microorganism from one or more nucleic acid molecules derived from a live microorganism based on a methylation or other epigenetic pattern of said one or more nucleic acid molecules derived from said dead microorganism.
- In some embodiments, epigenetic patterns, such as methylation, can be detected in DNA derived from food or environmental samples by chemical or enzymatic selection methods prior to sequencing. Such methods include, but are not limited to, bisulfite sequencing (including targeted bisfulfite sequencing, see e.g. Ziller et al. Epigenetics Chromatin. 2016 Dec. 3; 9:55 and Masser et al. J Vis Exp. 2015; (96): 52488) and methylation-sensitive restriction digestion (see e.g. Bitinaite et al. U.S. Pat. No. 9,034,597).
- Unique identifiers, such as barcodes, can be added to one or more nucleic acids isolated from a sample from a food processing facility, from a hospital or clinic, or from another sources. Barcodes can be used to associate a sample with a source; e.g., to associate an environmental sample with a specific food processing facility or with a particular location within said food processing facility. Barcodes can also be used to identify a processing of a sample, as described in U.S. Patent Pub. No. 2016/0239732, which is entirely incorporated herein by reference.
- In some aspects, the disclosure provides a method comprising adding a first barcode to a first plurality of nucleic acid sequences from a sample, thereby providing a first plurality of barcoded nucleic acid sequences; performing a first sequencing reaction on said first plurality of barcoded nucleic acid sequences, wherein said sequencing reaction is performed on a sequencing apparatus comprising a flow cell; adding a second barcode to a second plurality of nucleic acid sequences from a second sample, thereby providing a second plurality of barcoded nucleic acid sequences; and performing a second sequencing reaction on said second plurality of barcoded nucleic acid sequences, wherein said second sequencing reaction is performed on said sequencing apparatus comprising said flow cell, thereby reusing said flow cell.
FIG. 10 illustrates a process for re-using flow cells with distinct indexes as described herein. As illustrated byFIG. 10 two distinct indexes, 1001 and 1002, such as two different barcodes, can be added to different samples prior tosequencing 1003. Since a first sample can be associated with afirst index 1001 and a second sample can be associated with asecond index 1002 this process effectively allows for the re-using of a flow cell.FIG. 18 andFIG. 19 demonstrate the re-use of MinION/GridION flow cells. Example 21 demonstrates how certain primer design schemes, such as a nonperiodic design, can reduce crosstalk in situations with high multiplexing or closely related sequences, as may happen with reuse of flow cells. - One or more barcodes or block of barcodes may be added to a nucleic acid sequence from a food sample or another sample from a food processing facility, such as a first, a second, a third, or any subsequent sample. In some cases, 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, or 20 identical barcodes are added to such samples. In other cases, distinct barcodes are added to such samples. In some cases, 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, or 20 distinct barcodes are added to such samples. The serial addition of two or more barcodes, either identical in sequence or distinct in sequence, can provide an indexing of a sample that is used in its analyses. The presence of additional barcode or barcode blocks make the system more robust against any barcode manufacturing error and can also significantly reduce the chance of cross contamination between barcodes. In some cases, a barcode is added to a nucleic acid sequence comprising complementary DNA (cDNA) sequences, ribonucleic acid (RNA) sequences, genomic deoxyribonucleic acid (gDNA) sequences, or a mixture of cDNA, RNA, and gDNA sequences.
- Automated nucleic acid sequencing apparatuses can provide a robust platform for the generation of nucleic acid sequencing reads. Unfortunately, many apparatuses have a high rate of failure, i.e., high rate of error of the sequencing reaction itself, which require manual intervention in such instances, such as re-loading of samples into flow cells. In some aspects, the disclosure provides an automated nucleic acid sequencing apparatus that requires no manual intervention in the event of a failure of a sequencing reaction. In some aspects, the disclosure provides a nucleic acid sequencing apparatus comprising: a nucleic acid library preparation compartment comprising two or more chambers configured to prepare a plurality of nucleic acids for a sequencing reaction, wherein said compartment is operatively connected to a nucleic acid sequencing chamber; a nucleic acid sequencing chamber, wherein said nucleic acid sequencing chamber comprises: (i) one or more flow cells comprising a plurality of pores configured for the passage of a nucleic acid strand, wherein said two or more flow cells are juxtaposed to one another; and an automated platform, wherein said automated platform is programmed to robotically move a sample from said nucleic acid library preparation compartment into said nucleic acid sequencing chamber.
FIG. 11 illustrates an automated sequencing apparatus of the disclosure. 1101 is a diagram of the apparatus comprising the nucleicacid sequencing compartment 1102. Nucleic acidlibrary preparation compartment 1103 shows a variety of chambers configured to prepare a plurality of nucleic acids for a sequencing reaction in close proximity to asequencing chamber 1104, which comprises one or more flow cells. Briefly, an automated apparatus of the disclosure is programmed to move one or more samples from thelibrary preparation chambers 1103 into asequencing chamber 1104 upon detecting a failure in a sequencing reaction. This provides a sequencing process with no human touch points after a sample is added to the library preparation chamber, as illustrated inFIG. 12 .FIG. 12 illustrates an embodiment where a sample from a food processing facility, from a hospital or clinical setting, or from another source can be manually processed between 6 am to 6 pm or any shorter or longer incubation window by incubating the sample in a presence of a growth medium (e.g., enrichment) and automatically processed after the sample is added to a nucleicacid preparation chamber 1103. - The disclosed apparatus is programmed in such a manner that said automated platform moves one or more samples from said nucleic acid library preparation compartment into said nucleic acid sequencing chamber. Upon detecting a failure of a sequencing reaction, the automated platform moves one or more samples from the failed sequencing flow cell or apparatus to the next sequencing flow cell or apparatus. In many cases, such samples comprise nucleic acid sequences that include one or more barcodes. In some cases, a plurality of mutually exclusive barcodes are added to a plurality of nucleic acids in said two or more chambers of the nucleic acid
library preparation compartment 1103, thereby providing a plurality of mutually exclusive barcoded nucleic acids within the apparatus. In some instances, the automated platform robotically moves two or more of said mutually exclusive barcoded nucleic acids into said nucleic acid sequencing chamber, in some instances by moving said mutually exclusive barcoded nucleic acids into a same flow cell of said one or more flow cells. - Microbiome data (data representing the presence or absence of particular species or serotypes of microbes as determined by sequencing) of the invention can be used to classify a sample. For example, a sample can be classified as, or predicted to be: a) containing a particular pathogenic microbe, b) containing a particular serotype of a pathogenic microbe, and/or c) contaminated with at least one species/serotype of pathogenic microbe. Many statistical classification techniques are known to those of skill in the art. In supervised learning approaches, a group of samples from two or more groups (e.g. contaminated with a pathogen and not) are analyzed with a statistical classification method. Microbe presence/absence data can be used as a classifier that differentiates between the two or more groups. A new sample can then be analyzed so that the classifier can associate the new sample with one of the two or more groups. Commonly used supervised classifiers include without limitation the neural network (multi-layer perceptron), support vector machines, k-nearest neighbours, Gaussian mixture model, Gaussian, naive Bayes, decision tree and radial basis function (RBF) classifiers. Linear classification methods include Fisher's linear discriminant, logistic regression, naive Bayes classifier, perceptron, and support vector machines (SVMs). Other classifiers for use with the invention include quadratic classifiers, k-nearest neighbor, boosting, decision trees, random forests, neural networks, pattern recognition, Bayesian networks and Hidden Markov models. One of skill will appreciate that these or other classifiers, including improvements of any of these, are contemplated within the scope of the invention.
- Classification using supervised methods is generally performed by the following methodology:
- In order to solve a given problem of supervised learning (e.g. learning to recognize handwriting) one has to consider various steps:
- 1. Gather a training set. These can include, for example, samples that are from a food or environment contaminated or not contaminated with a particular microbe, samples that are contaminated with different serotypes of the same microbe, samples that are or are not contaminated with a combination of different species and serotypes of microbes, etc. The training samples are used to “train” the classifier.
- 2. Determine the input “feature” representation of the learned function. The accuracy of the learned function depends on how the input object is represented. Typically, the input object is transformed into a feature vector, which contains a number of features that are descriptive of the object. The number of features should not be too large, because of the curse of dimensionality; but should be large enough to accurately predict the output. The features might include a set of bacterial species or serotypes present in a food or environmental sample derived as described herein.
- 3. Determine the structure of the learned function and corresponding learning algorithm. A learning algorithm is chosen, e.g., artificial neural networks, decision trees, Bayes classifiers or support vector machines. The learning algorithm is used to build the classifier.
- 4. Build the classifier (e.g. classification model). The learning algorithm is run on the gathered training set. Parameters of the learning algorithm may be adjusted by optimizing performance on a subset (called a validation set) of the training set, or via cross-validation. After parameter adjustment and learning, the performance of the algorithm may be measured on a test set of naive samples that is separate from the training set.
- Once the classifier (e.g. classification model) is determined as described above, it can be used to classify a sample, e.g., that of food sample or environment that is being analyzed by the methods of the invention.
- Unsupervised learning approaches can also be used with the invention. Clustering is an unsupervised learning approach wherein a clustering algorithm correlates a series of samples without the use the labels. The most similar samples are sorted into “clusters.” A new sample could be sorted into a cluster and thereby classified with other members that it most closely associates.
- In some aspects, the disclosed provides quality control methods or methods to assess a risk associated with a food, with a hospital, with a clinic, or any other location where the presence of a bacterium poses a certain risk to one or more subjects. In many instances, systems, platforms, software, networks, and methods described herein include a digital processing device, or use of the same. In further embodiments, the digital processing device includes one or more hardware central processing units (CPUs), i.e., processors that carry out the device's functions, such as the automated sequencing apparatus disclosed herein or a computer system used in the analyses of a plurality of nucleic acid sequencing reads from samples derived from a food processing facility or from any other facility, such as a hospital a clinical or another. In still further embodiments, the digital processing device further comprises an operating system configured to perform executable instructions. In some embodiments, the digital processing device is optionally connected a computer network. In further embodiments, the digital processing device is optionally connected to the Internet such that it accesses the World Wide Web. In still further embodiments, the digital processing device is optionally connected to a cloud computing infrastructure. In other embodiments, the digital processing device is optionally connected to an intranet. In other embodiments, the digital processing device is optionally connected to a data storage device. In other embodiments, the digital processing device could be deployed on premise or remotely deployed in the cloud.
- In accordance with the description herein, suitable digital processing devices include, by way of non-limiting examples, server computers, desktop computers, laptop computers, notebook computers, sub-notebook computers, netbook computers, netpad computers, set-top computers, handheld computers, Internet appliances, mobile smartphones, tablet computers, personal digital assistants, video game consoles, and vehicles. Those of skill in the art will recognize that many smartphones are suitable for use in the system described herein. Those of skill in the art will also recognize that select televisions, video players, and digital music players with optional computer network connectivity are suitable for use in the system described herein. Suitable tablet computers include those with booklet, slate, and convertible configurations, known to those of skill in the art. In many aspects, the disclosure contemplates any suitable digital processing device that can either be deployed to a food processing facility, or is used within said food processing facility to process and analyze a variety of nucleic acids from a variety of samples.
- In some embodiments, a digital processing device includes an operating system configured to perform executable instructions. The operating system is, for example, software, including programs and data, which manages the device's hardware and provides services for execution of applications. Those of skill in the art will recognize that suitable server operating systems include, by way of non-limiting examples, FreeBSD, OpenBSD, NetBSD®, Linux, Apple® Mac OS X Server®, Oracle® Solaris®, Windows Server®, and Novell® NetWare®. Those of skill in the art will recognize that suitable personal computer operating systems include, by way of non-limiting examples, Microsoft® Windows®, Apple® Mac OS X®, UNIX®, and UNIX-like operating systems such as GNU/Linux®. In some embodiments, the operating system is provided by cloud computing. Those of skill in the art will also recognize that suitable mobile smart phone operating systems include, by way of non-limiting examples, Nokia® Symbian® OS, Apple® iOS®, Research In Motion® BlackBerry OS®, Google® Android®, Microsoft® Windows Phone® OS, Microsoft® Windows Mobile® OS, Linux®, and Palm® WebOS®.
- In some embodiments, a digital processing device includes a storage and/or memory device. The storage and/or memory device is one or more physical apparatuses used to store data or programs on a temporary or permanent basis. In some embodiments, the device is volatile memory and requires power to maintain stored information. In some embodiments, the device is non-volatile memory and retains stored information when the digital processing device is not powered. In further embodiments, the non-volatile memory comprises flash memory. In some embodiments, the non-volatile memory comprises dynamic random-access memory (DRAM). In some embodiments, the non-volatile memory comprises ferroelectric random access memory (FRAM). In some embodiments, the non-volatile memory comprises phase-change random access memory (PRAM). In other embodiments, the device is a storage device including, by way of non-limiting examples, CD-ROMs, DVDs, flash memory devices, magnetic disk drives, magnetic tapes drives, optical disk drives, and cloud computing based storage. In further embodiments, the storage and/or memory device is a combination of devices such as those disclosed herein.
- In some embodiments, a digital processing device includes a display to send visual information to a user. In some embodiments, the display is a cathode ray tube (CRT). In some embodiments, the display is a liquid crystal display (LCD). In further embodiments, the display is a thin film transistor liquid crystal display (TFT-LCD). In some embodiments, the display is an organic light emitting diode (OLED) display. In various further embodiments, on OLED display is a passive-matrix OLED (PMOLED) or active-matrix OLED (AMOLED) display. In some embodiments, the display is a plasma display. In other embodiments, the display is a video projector. In still further embodiments, the display is a combination of devices such as those disclosed herein.
- In some embodiments, a digital processing device includes an input device to receive information from a user. In some embodiments, the input device is a keyboard. In some embodiments, the input device is a pointing device including, by way of non-limiting examples, a mouse, trackball, track pad, joystick, game controller, or stylus. In some embodiments, the input device is a touch screen or a multi-touch screen. In other embodiments, the input device is a microphone to capture voice or other sound input. In other embodiments, the input device is a video camera to capture motion or visual input. In still further embodiments, the input device is a combination of devices such as those disclosed herein.
- In some embodiments, a digital processing device includes a digital camera. In some embodiments, a digital camera captures digital images. In some embodiments, the digital camera is an autofocus camera. In some embodiments, a digital camera is a charge-coupled device (CCD) camera. In further embodiments, a digital camera is a CCD video camera. In other embodiments, a digital camera is a complementary metal-oxide-semiconductor (CMOS) camera. In some embodiments, a digital camera captures still images. In other embodiments, a digital camera captures video images. In various embodiments, suitable digital cameras include 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, and higher megapixel cameras, including increments therein. In some embodiments, a digital camera is a standard definition camera. In other embodiments, a digital camera is an HD video camera. In further embodiments, an HD video camera captures images with at least about 1280× about 720 pixels or at least about 1920× about 1080 pixels. In some embodiments, a digital camera captures color digital images. In other embodiments, a digital camera captures grayscale digital images. In various embodiments, digital images are stored in any suitable digital image format. Suitable digital image formats include, by way of non-limiting examples, Joint Photographic Experts Group (JPEG), JPEG 2000, Exchangeable image file format (Exif), Tagged Image File Format (TIFF), RAW, Portable Network Graphics (PNG), Graphics Interchange Format (GIF), Windows® bitmap (BMP), portable pixmap (PPM), portable graymap (PGM), portable bitmap file format (PBM), and WebP. In various embodiments, digital images are stored in any suitable digital video format. Suitable digital video formats include, by way of non-limiting examples, AVI, MPEG, Apple® QuickTime®, MP4, AVCHD®, Windows Media®, DivX™, Flash Video, Ogg Theora, WebM, and RealMedia.
- In many aspects, the systems, platforms, software, networks, and methods disclosed herein include one or more non-transitory computer readable storage media encoded with a program including instructions executable by the operating system of an optionally networked digital processing device. For instance, in some aspects, the methods comprise creating data files associated with a plurality of sequencing reads from a plurality of samples associated with a food processing facility. In further embodiments, a computer readable storage medium is a tangible component of a digital processing device. In still further embodiments, a computer readable storage medium is optionally removable from a digital processing device. In some embodiments, a computer readable storage medium includes, by way of non-limiting examples, CD-ROMs, DVDs, flash memory devices, solid state memory, magnetic disk drives, magnetic tape drives, optical disk drives, cloud computing systems and services, and the like. In some cases, the program and instructions are permanently, substantially permanently, semi-permanently, or non-transitorily encoded on the media.
- In some embodiments, the systems, platforms, software, networks, and methods disclosed herein include at least one computer program. A computer program includes a sequence of instructions, executable in the digital processing device's CPU, written to perform a specified task. In light of the disclosure provided herein, those of skill in the art will recognize that a computer program may be written in various versions of various languages. In some embodiments, a computer program comprises one sequence of instructions. In some embodiments, a computer program comprises a plurality of sequences of instructions. In some embodiments, a computer program is provided from one location. In other embodiments, a computer program is provided from a plurality of locations. In various embodiments, a computer program includes one or more software modules. In various embodiments, a computer program includes, in part or in whole, one or more web applications, one or more mobile applications, one or more standalone applications, one or more web browser plug-ins, extensions, add-ins, or add-ons, or combinations thereof.
- In some embodiments, a computer program includes a web application. In light of the disclosure provided herein, those of skill in the art will recognize that a web application, in various embodiments, utilizes one or more software frameworks and one or more database systems. In some embodiments, a web application is created upon a software framework such as Microsoft®.NET or Ruby on Rails (RoR). In some embodiments, a web application utilizes one or more database systems including, by way of non-limiting examples, relational, non-relational, object oriented, associative, and XML database systems. In further embodiments, suitable relational database systems include, by way of non-limiting examples, Microsoft® SQL Server, mySQL™, and Oracle®. Those of skill in the art will also recognize that a web application, in various embodiments, is written in one or more versions of one or more languages. A web application may be written in one or more markup languages, presentation definition languages, client-side scripting languages, server-side coding languages, database query languages, or combinations thereof. In some embodiments, a web application is written to some extent in a markup language such as Hypertext Markup Language (HTML), Extensible Hypertext Markup Language (XHTML), or eXtensible Markup Language (XML). In some embodiments, a web application is written to some extent in a presentation definition language such as Cascading Style Sheets (CSS). In some embodiments, a web application is written to some extent in a client-side scripting language such as Asynchronous Javascript and XML (AJAX), Flash® Actionscript, Javascript, or Silverlight®. In some embodiments, a web application is written to some extent in a server-side coding language such as Active Server Pages (ASP), ColdFusion®, Perl, Java™, JavaServer Pages (JSP), Hypertext Preprocessor (PHP), Python™, Ruby, Tcl, Smalltalk, WebDNA®, or Groovy. In some embodiments, a web application is written to some extent in a database query language such as Structured Query Language (SQL). In some embodiments, a web application integrates enterprise server products such as IBM® Lotus Domino®. A web application for providing a career development network for artists that allows artists to upload information and media files, in some embodiments, includes a media player element. In various further embodiments, a media player element utilizes one or more of many suitable multimedia technologies including, by way of non-limiting examples, Adobe® Flash®,
HTML 5, Apple® QuickTime®, Microsoft® Silverlight®, Java™, and Unity®. - In some embodiments, a computer program includes a mobile application provided to a mobile digital processing device. In some embodiments, the mobile application is provided to a mobile digital processing device at the time it is manufactured. In other embodiments, the mobile application is provided to a mobile digital processing device via the computer network described herein.
- In view of the disclosure provided herein, a mobile application is created by techniques known to those of skill in the art using hardware, languages, and development environments known to the art. Those of skill in the art will recognize that mobile applications are written in several languages. Suitable programming languages include, by way of non-limiting examples, C, C++, C#, Objective-C, Java™, Javascript, Pascal, Object Pascal, Python™, Ruby, VB.NET, WML, and XHTML/HTML with or without CSS, or combinations thereof.
- Suitable mobile application development environments are available from several sources. Commercially available development environments include, by way of non-limiting examples, AirplaySDK, alcheMo, Appcelerator®, Celsius, Bedrock, Flash Lite, .NET Compact Framework, Rhomobile, and WorkLight Mobile Platform. Other development environments are available without cost including, by way of non-limiting examples, Lazarus, MobiFlex, MoSync, and Phonegap. Also, mobile device manufacturers distribute software developer kits including, by way of non-limiting examples, iPhone and iPad (iOS) SDK, Android™ SDK, BlackBerry® SDK, BREW SDK, Palm® OS SDK, Symbian SDK, webOS SDK, and Windows® Mobile SDK.
- Those of skill in the art will recognize that several commercial forums are available for distribution of mobile applications including, by way of non-limiting examples, Apple® App Store, Android™ Market, BlackBerry® App World, App Store for Palm devices, App Catalog for webOS, Windows® Marketplace for Mobile, Ovi Store for Nokia® devices, Samsung® Apps, and Nintendo® DSi Shop.
- In some embodiments, a computer program includes a standalone application, which is a program that is run as an independent computer process, not an add-on to an existing process, e.g., not a plug-in. Those of skill in the art will recognize that standalone applications are often compiled. A compiler is a computer program(s) that transforms source code written in a programming language into binary object code such as assembly language or machine code. Suitable compiled programming languages include, by way of non-limiting examples, C, C++, Objective-C, COBOL, Delphi, Eiffel, Java™, Lisp, Python™, Visual Basic, and VB .NET, or combinations thereof. Compilation is often performed, at least in part, to create an executable program. In some embodiments, a computer program includes one or more executable complied applications.
- The systems, platforms, software, networks, and methods disclosed herein include, in various embodiments, software, server, and database modules. In view of the disclosure provided herein, software modules are created by techniques known to those of skill in the art using machines, software, and languages known to the art. The software modules disclosed herein are implemented in a multitude of ways. In various embodiments, a software module comprises a file, a section of code, a programming object, a programming structure, or combinations thereof. In further various embodiments, a software module comprises a plurality of files, a plurality of sections of code, a plurality of programming objects, a plurality of programming structures, or combinations thereof. In various embodiments, the one or more software modules comprise, by way of non-limiting examples, a web application, a mobile application, and a standalone application. In some embodiments, software modules are in one computer program or application. In other embodiments, software modules are in more than one computer program or application. In some embodiments, software modules are hosted on one machine. In other embodiments, software modules are hosted on more than one machine. In further embodiments, software modules are hosted on cloud computing platforms. In some embodiments, software modules are hosted on one or more machines in one location. In other embodiments, software modules are hosted on one or more machines in more than one location.
- Food and environmental samples may be processed for various purposes, such as the enrichment of one or more microorganism from the sample, or the isolation of one or more microorganism from the sample. The following protocol was used in the preparation of various food and environmental samples including: carcass rinses, stainless steel, primary production boot covers, dry pet food and shell eggs.
-
TABLE 1 Food and Environmental Sample Preparation Table 1: Food and Environmental Sample Preparation Enrichment Amount determined by volume or Matrix Sample Size weight Incubation Carcass Rinse 30 ± 0.6 mL sample rinse fluid 20 ± 0.5 mL of Clear 42 ± 1° C. for Salmonella media (CSM) 9-24 h Stainless Steel 1 sponge pre moistened with 10 10 ± 0.5 mL Clear 42 ± 1° C. for mL tris-buffered saline Salmonella media (CSM) 9-24 h Environmental 1 environmental sampling bootie 50 ± 1 mL Clear 42 ± 1° C. for Boot Cover pre-moistened with 10 mL skim Salmonella media (CSM) 9-24 h milk Pet Food 25 ± 0.5 g 100 ± 1 mL Clear 42 ± 1° C. for Salmonella media (CSM) 9-24 h Shell Eggs 100 ± 2 g 200 ± 2 mL Clear 42 ± 1° C. for Salmonella media (CSM) 9-24 h - In this example, carcass food samples are generated by aseptically draining excess fluid from a carcass and transferring the carcass to a large sterile sampling bag. 100 mL of an enriched broth, in this case, Clear Salmonella media (CSM) was poured into the cavity of the carcass in the sampling bag. The carcass was rinsed inside and out with a rocking motion for about one minute, while assuring that all surfaces (interior and exterior of the carcass) were rinsed. About 20±0.5 mL of the CSM was added to the sample bag and homogenized by massaging sample bag for approximately 1.5-2 min. The sample was incubated at 42±1° C. for 9-24 h, providing an enriched sample.
- In this example, a stainless steel surface environmental sample was generated by moistening a sterile sampling sponge in 10 mL of Dey-Engley Broth prior to sampling, or using a sponge pre-moistened in the same. The sponge was used to touch, scrub, or otherwise contact the stainless steel surface and it was subsequently placed into a sampling bag. About 10±0.5 mL of CSM was added to the sampling sponge. Subsequently, the sponge was pressed to expel the collection broth into the CSM solution. The sample was incubated at 42±1° C. for 9-24 h, providing an enriched sample.
- In this example, an environmental sample from a boot cover was first pre-moistened in skim milk. About 50±1 mL of CSM was then added to the sampling bag containing boot cover environmental sample. The contents were mixed thoroughly for approximately 1.5-2 min, and incubated at 42±1° C. for 9-24 h, thereby providing an enriched sample. The enriched sample was removed from incubator and briefly mixed.
- In this example, about 25±0.5 g of a pet food sample were added into a filtered sampling bag. About 100±1 mL CSM was then added to the sampling bag containing said pet food. The contents were mixed thoroughly for approximately 1.5-2 min, and incubated at 42±1° C. for 9-24 h, thereby providing an enriched sample. The enriched sample was removed from incubator and briefly mixed.
- In this example, about 100±2 g of a homogenized egg sample was added to a filtered sampling bag. About 200±2 mL CSM was then added to the sampling bag containing said homogenized egg sample. The contents were mixed thoroughly for approximately 1.5-2 min, and incubated at 42±1° C. for 9-24 h, thereby providing an enriched sample. The enriched sample was removed from incubator and briefly mixed.
- In this example, a photoreactive DNA-binding dye, namely propidium monoazide (PMA) was added to various food and environmental samples, including the samples described in Examples 1-6. In general, 5 μL of a PMAxx solution was added to a well in a 200 μL 96-well PCR plate. Approximately 45 μL of each enriched sample from the sampling bags described in Examples 1-6 was added to individual wells in PCR plate containing PMAxx. The samples were mixed thoroughly by gentle pipetting and placed in the dark for 10 min at room temperature. Subsequently, the plates were incubated under a blue LED light for 20 min. 10 μL of each sample were then diluted with 90 μL of Lysis Buffer in a new 200 μL 96-well PCR plate. The plate was then incubated in a thermocycler as shown below. Alternatively the sample could have been incubated in a water bath.
-
Step Temperature Time 1 37° C. 20 min 2 95° C. 10 min - This example demonstrates that addition of a solution of the photoreactive DNA-binding dye PMAxx to a sample solution reduced the number of free-floating and contaminating DNA in said sample. Specifically, 45 μL of each enriched sample from the sampling bags as described in Examples 1-7 was added to individual wells of the 96-well PCR plate containing 25 μL of PMAxx solution. The sample solutions were mixed thoroughly by gentle pipetting and placed in the dark for 10 min at room temperature. Subsequently, the plates were incubated under a blue LED light for 20 min. 10 μL of each sample were then diluted with 90 μL of Lysis Buffer in a new 200 μL 96-well PCR plate. The plate was then incubated in a thermocycler as shown below. Analysis of the sample readouts showed that the addition of PMAxx solution (25 μL) to the sample solution was sufficient to reduce the number of free-floating DNA by at least 2 orders of magnitude, as shown in
FIG. 13 . - In this example, the samples described in Examples 1-8 were subjected to an amplification reaction. Briefly 15 μL of primer cocktail and polymerase master mix was added to individual wells of an empty 200 μL 96-well PCR plate. About 5 μl of each sample treated with a photoreactive DNA-binding dye treatment was added to the respective wells containing the polymerase master mix. The solution was mixed gently by pipetting up and down and placed in a thermocycler with the conditions described below.
-
Step Temperature Time 1 95° C. 3 min 2 95° C. 30 sec 3 57° C. 1 min 4 72° C. 1 min 5 Go to step 2, 37 times 6 72° C. 10 min 7 10° C. Hold - In this example, Solid Phase Reversible Immobilization (SPRI) Magnetic Beads were used to purify and quantify one or more of the samples described in Examples 1-9. Briefly, the SPRI beads were removed from 4° C. storage and allowed to reach room temperature for approximately 15 min. About 1 mL of 80% ethanol was prepared by combining 800 μL of ethanol and 200 μL of molecular biology grade water. Equal volumes of each samples amplification product (described in Example 9) was used to obtain at least 100 μL of pooled products, which was purified using the SPRI beads along with standard manufacturing protocols. Briefly, 100 μL of vortexed, pooled PCR product was pipetted into a 0.2 mL PCR tube and add 60 μL of SPRI beads. The tube was mixed thoroughly by pipetting up and down approximately 10 times and incubated at room temperature for 5 min. The sample/bead mixture was placed in a magnetic stand and the beads were allowed to pellet in a ring for approximately 30-60 s, leaving a clear supernatant. The supernatant was discarded by leaving the tube in the magnetic stand while placing the pipette tip to the bottom center of the tube when aspirating to avoid disturbing the beads. 190 μL of 80% ethanol was then added to the tube, and incubated for 5-10 s. The tube was aspirated fully and the ethanol solution discarded. The process was repeated twice. The sample was allowed to dry for 3-5 min at room temperature, or until no visible ethanol remained. Once thoroughly dry, the tube was removed from the magnetic stand and re-suspended in 50 μL of 10 mM RSB into the tube. The tube was mixed thoroughly by gently pipetting up and down approximately 10 times and incubate at room temperature for 2 min. The tube was moved to a magnetic stand and incubated at room temperature for 2 min to allow the beads to pellet. Remove and retain 50 μL of the eluate.
- In this example, the terminal ends of fragment nucleic acids described in Example 10 were repaired as described below. First, the following reagents were combined and mixed well by pipetting up and down approximately 10 times.
-
Reagent Volume Purified Pooled Libraries 45 μL NEB Ultra II end- prep reaction buffer 7 μL NEB Ultra II End- prep enzyme mix 3 μL ONT DNA CS (DCS) 5 μL Total 60 μL - The samples were then spun for approximately 5 s using a benchtop minifuge. End-repair was performed in a thermal cycler with the following conditions:
-
Step Temperature Time 1 20° C. 5 min 2 65° C. 5 min 3 25° C. 5 min - Subsequently, the samples were spun for approximately 5 s using a benchtop minifuge. 60 μL of SPRI beads were added to the end-repaired product and mixed by pipetting up and down approximately 10 times. The samples were incubated for 5 min at room temperature. The sample/bead mixture was placed in a magnetic stand and the beads were allowed to pellet in a ring around the middle portion of the tube for approximately 30-60 s, leaving a clear supernatant. The supernatant was discarded by leaving the tube in the magnetic stand while placing the pipette tip to the bottom center of the tube when aspirating to avoid disturbing the beads. 190 μL of 80% ethanol was added to the samples. The 80% ethanol solution was incubated in the tube for 5-10 s, and the ethanol was aspirated and discarded. This process was repeated twice. The sample was allowed to dry for 5 min at room temperature, or until no visible ethanol remained. The beads were resuspended with 31 μL molecular biology grade water and mixed by gently pipetting up and down approximately 10 times and incubate for 2 min at room temperature. The tube was moved to a magnetic stand and the beads were allowed to pellet for approximately 30-60 s. The eluate was retained as the “end-repaired product”.
- In this example, using the end-repaired product of Example 11, the following reagents were combined:
-
Reagent Volume End-repaired product 30 μL ONT Adapter Mix (AMX 1D) 20 μL NEB Blunt/TA Ligase Master Mix 50 μL Total 100 μL - The reagents were gently mixed by pipetting up and down approximately 10 times and were incubated at room temperature for 10 min. About 40 μL of SPRI beads were added to the mixture, gently mixed, and incubated at room temperature for 5 min. The sample/bead mixture was placed in a magnetic stand and the beads were allowed to pellet in a ring around the middle portion of the tube for approximately 30-60 s, leaving a clear supernatant. The supernatant was discarded by leaving the tube in the magnetic stand while placing the pipette tip to the bottom center of the tube when aspirating to avoid disturbing the beads. The tube was removed from the magnetic rack and 140 μL of ONT-Adapter Bead Binding buffer was pipetted onto the beads. The sample was mixed by gently pipetting up and down approximately 10 times to resuspend the pellet. The tube was returned to the magnetic stand and the beads were allowed to pellet in a ring around the middle portion of the tube for approximately 30-60 s, leaving a clear supernatant. The supernatant was discarded by leaving the tube in the magnetic stand while placing the pipette tip to the bottom center of the tube when aspirating to avoid disturbing the beads. The tube was removed from the magnetic rack and an additional 140 μL of Adapter Bead Binding buffer was added and pipetted up and down to resuspend the pellet. The sample/bead mixture was placed in a magnetic stand and the beads were allowed to pellet into a ring around the middle portion of the tube for approximately 30-60 s, leaving a clear supernatant. The supernatant was discarded by leaving the tube in the magnetic stand while placing the pipette tip to the bottom center of the tube when aspirating to avoid disturbing the beads. The tube was then removed from the magnetic stand. About 15 μL of Elution Buffer (ELB) was added to the beads, and the beads were mixed thoroughly by pipetting up and down approximately 10 times and incubate for 10 minutes at room temperature for 5 min. The tubes were moved to a magnetic stand and the beads allowed to pellet for approximately 30-60 s. About 15 μL of eluate was remove and retained as the “final ligated product” for sequencing.
- In this example, a food or an environmental sample was processed by pore sequencing using standard manufacturer protocols. Briefly, one or more flow cells were primed by combining the following reagents per flow cell:
-
Reagent Volume ONT-Running Buffer with Fuel Mix (RBF) 480 μL Molecular grade H2O 520 μL Total 1,000 μL - A loading library was prepared by combining the following reagents:
-
Reagent Volume ONT-Running Buffer with Fuel Mix (RBF) 35 μL ONT-Library Loading Beads (LLB) 25.5 μL Final ligated product 12 μL Molecular grade H2O 2.5 μL Total 75 μL - The priming port on the Flow Cell was gently opened and approximately 50 μL of the preservative buffer and any small bubbles were removed, as illustrated by
FIG. 14 . About 800 μL of the priming mix was added into the priming port of the Flow Cell. Subsequently, 200 μL of the priming mix was dispensed into the Priming port. The final loading library was mixed thoroughly and 75 μL were added into the SpotON port, as illustrated byFIG. 15 . The lid of the pore sequencing device was closed and the sequencing was executed. - In this example, an electronic communication comprising a data set associated with the sequencing reaction described in Example 13 was transmitted over the cloud for analysis. The results of the analysis were reported back to customer.
FIG. 16 in this particular example, the customer requested an analysis of the sample for the presence or absence of Listeria, Salmonella, Campylobacter, and E. coli, which required the simultaneous targeting of multiple pathogens. - In this example, data from pore sequencing was used to identify foodborne disease-causing microorganisms. Briefly, the methods and processes described in Examples 1-13 were used to identify food or environmental samples comprising one or more of the organism shown below.
-
TABLE 2 Table 2: Exemplary Pathogenic Microorganisms Identified by Methods According to This Disclosure Onset Common Name Time After Signs & Duration Organism of Illness Ingesting Symptoms of Ilness Food Sources Bacillus B. cereus food 10-16 hrs Abdominal 24-48 hours Meats, stews, cereus poisoning cramps, watery gravies, vanilla diarrhea, nausea sauce Campylobacter Campylobacteriosis 2-5 days Diarrhea, cramps, 2-10 days Raw and jejuni fever, and undercooked vomiting; diarrhea poultry, may be bloody unpasteurized milk, contaminated water Clostridium Botulism 12-72 hours Vomiting, Variable Improperly botulinum diarrhea, blurred canned foods, vision, double especially vision, difficulty home-canned in swallowing, vegetables, muscle weakness. fermented fish, Can result in baked potatoes respiratory failure in aluminum and death foil Perfringens Perfringens food 8-16 hours Intense abdominal Usually Meats, poultry, poisoning cramps, watery 24 hours gravy, dried or diarrhea precooked foods, time and/or temperature- abused foods Cryptosporidium Intestinal 2-10 days Diarrhea (usually May be Uncooked food cryptosporidiosis watery), stomach remitting and or food cramps, upset relapsing over contaminated stomach, slight weeks to by an ill food fever months handler after cooking, contaminated drinking water Cyclospora Cyclosporiasis 1-14 days, Diarrhea (usually May be Various types cayetanensis usually at watery), loss of remitting and of fresh least 1 appetite, relapsing over produce week substantial loss of weeks to (imported weight, stomach months berries, lettuce, cramps, nausea, basil) vomiting, fatigue E. coli E. coli infection 1-3 days Watery diarrhea, 3-7 or Water or food (Escherichia (common cause of abdominal more days contaminated coli) “travelers' cramps, some with human producing diarrhea”) vomiting feces toxin E. coli Hemorrhagic 1-8 days Severe (often 5-10 days Undercooked O157:H7 colitis or bloody) diarrhea, beef (especially E. coli O157:H7 abdominal pain hamburger), infection and vomiting. unpasteurized Usually, little or milk and juice, no fever is raw fruits and present. More vegetables (e.g. common in sprouts), and children 4 yearscontaminated or younger. Can water lead to kidney failure. Hepatitis A Hepatitis 28 days Diarrhea, dark Variable, Raw produce, average urine, jaundice, 2 weeks-3 months contaminated (15-50 days) and flu-like drinking water, symptoms, i.e., uncooked fever, headache, foods and nausea, and cooked foods abdominal pain that are not reheated after contact with an infected food handler; shellfish from contaminated waters Lisieria Listeriosis 9-48 hrs for Fever, muscle Variable Unpasteurized monocytogenes gastro- aches, and nausea milk, soft intestinal or diarrhea. cheeses symptoms, Pregnant women made with 2-6 weeks may have mild unpasteurized for invasive flu-like illness, milk, ready-to- disease and infection can eat deli meats lead to premature delivery or stillbirth. The elderly or immuno- compromised patients may develop bacteremia or meningitis. Noroviruses Variously called 12-48 hrs Nausea, vomiting, 12-60 hrs Raw produce, viral abdominal contaminated gastroenteritis, cramping, drinking water, winter diarrhea, diarrhea, fever, uncooked acute non- bacterial headache. foods and gastroenteritis, Diarrhea is more cooked foods food poisoning, prevalent in that are not and food infection adults, vomiting reheated after more common in contact with an children. infected food handler; shellfish from contaminated waters Salmonella Salmonellosis 6-48 hours Diarrhea, fever, 4-7 days Eggs, poultry, abdominal meat, cramps, vomiting unpasteurized milk or juice, cheese, contaminated raw fruits and vegetables Shigella Shigellosis or 4-7 days Abdominal 24-48 hrs Raw produce, Bacillary dysentery cramps, fever, and contaminated diarrhea. Stools drinking water, may contain blood uncooked and mucus. foods and cooked foods that are not reheated after contact with an infected food handler Staphylococcus Staphylococcal 1-6 hours Sudden onset of 24-48 hours Unrefrigerated aureus food poisoning severe nausea and or improperly vomiting. refrigerated Abdominal meats, potato cramps. Diarrhea and egg salads, and fever may be cream pastries present. Vibrio V. 4-96 hours Watery 2-5 days Undercooked parahaemolyticus parahaemolyticus- (occasionally or raw seafood, infection bloody) diarrhea, such as abdominal shellfish cramps, nausea, vomiting, fever Vibrio V. vulnificus 1-7 days Vomiting, 2-8 days Undercooked vulnificus infection diarrhea, or raw seafood, abdominal pain, such as blood borne shellfish infection. Fever, (especially bleeding within oysters) the skin, ulcers requiring surgical removal. Can be fatal to persons with liver disease or weakened immune systems. - First, a database was constructed using data from approximately 35,000 food or environmental samples (of which about 10% contained traces of pathogenic microorganisms as shown in Table 3) using two components: microorganism presence and chemical composition. Pore sequencing in combination with use of characteristic polymorphic gene regions (comprising SNP's, RFLP's, STRs, VNTR's, hypervariable regions, minisatellites, dinucleotide repeats, trinucleotide repeats, tetranucleotide repeats, simple sequence repeats, indels, and insertion elements) associated with a wide diversity of microorganisms were used to analyze each sample for the presence or absence of 17,800 different bacterial species (representing both pathogenic and non-pathogenic bacterial species). Additionally, data on sample composition was collected for 4,600 food ingredients in each environmental/food sample.
- The data using the top bacteria associated with pathogen contamination (exemplified in
FIG. 5 ) was used to train a classification model, which was tested for overfitting by machine learning techniques. - We further tested the performance of the model by testing a set of unknown food or environmental samples (50% of each). The full results of and ROC analysis of accuracy and precision of the classification models are presented in Table 3. In the cases of all the pathogens in Table 3, the metagenomics-based classification model had higher than 95% precision and 97% accuracy for pathogen detection.
-
TABLE 3 Table 3: Independent Validation of Pathogen Prediction in Unknown Samples Accuracy Precision Pathogen Score Score Vibrio parahaemolyticus 99.78% 96.55% Staphylococcus aureus 99.67% 100.00% Yersinia pseudotuberculosis 99.45% 100.00% Vibrio vulnificus 99.12% 100.00% Shigella boydii 99.12% 100.00% Salmonella enterica 96.16% 94.39% Escherichia coli 97.48% 98.40% - This example describes the in silico evaluation of primer sensitivity and specificity for pathogen detection in PCR assays. First, a candidate primer pair was mapped against inclusion and exclusion sequences in sequence databases. Secondly, the identified hits are tabulated based on predicted amplification patterns in order to then determine the sensitivity and specificity of the primer pair in silico.
- Specifically, a primer pair was designed to target Salmonella Montevideo and Salmonella Oranienburg. The composition of the sequence database for in silico evaluation contained 7705 Salmonella genomes, including 98 Montevideo/Oranienburg genomes, and 1707 non-Salmonella genomes (total of 9412 genomes). Tabulation of the analysis results showed that the exact number of 98 Salmonella Montevideo and Oranienburg genomes was identified as true positive hits. The remaining 9314 (which equals the total number of 9412 genomes minus the 98 true positive hits identified) genomes were characterized as true negative results. The results are shown in
FIG. 17 . - This example shows that the MinION/GridION flow cell can be reused for sequence sample analysis for at least 2 times. Between each sample analysis (50 samples analyzed in each analysis) the flow cell was washed with a buffer system resulting in 30,000 reads and 26,000 reads per sample during the second and third reuse, respectively, compared to 36,000 reads per sample when using a new flow cell (
FIG. 18 ).FIG. 19 illustrates that the number of reads per sample for reused MinION/GridION flow cells was well above the acceptable minimum threshold of 10,000 (10 K) reads per sample. - A significant source of confounding data in pathogen risk detection is contamination of samples by resident microorganisms on human handlers. Accordingly, we deployed a biomek-based sample sequencing platform that requires no human handling after enrichment (see
FIG. 11 andFIG. 12 ) to implement the methods of Examples 10-13 and 15. Automation included every step of library preparation post incubation of the samples as in Examples 1-6, and included cell lysis, PCR, clean up, and sequencing. An automated handling system is illustrated inFIG. 11 . - To determine the performance of our automated handling system, we analyzed samples spiked with 10 different Salmonella serotypes (Enteritidis, Thyphimurium, I 4—[5]_12:i:-, Newport, Javiana, Infantis, Montevideo, Heidelberg, Muenchen) by automated or manual handling. The results are presented in
FIG. 20 . Serotype detection accorded 100% between manual and automatic handling, and a student's T-test of the number of sequencing reads generated indicated no significant difference between manual and automated handling. - A significant limitation of existing environmental pathogen detection methods is that they involve culturing, which involves the use of multiple different specialized media to detect different classes of pathogens (e.g. bacteria autotrophic for one or more nutrient vs those not). This severely limits the ability to detect food contamination during storage. Accordingly, we applied our environmental sampling/pore sequencing technique as outlined in Examples 1-13 on 100 samples of chicken wings and 100 samples of ground chicken. Each sample was analyzed for the presence/absence of 17,800 pathogenic and non-pathogenic bacteria.
- We applied a principle components analysis to the whole or ground chicken data sets, which is presented in
FIGS. 21 andFIG. 22 . Data points for both whole and ground chicken samples cluster along a discernable trajectory more than 2 days prior to their expiration date (see movement along PC2 in the whole chicken sample and PC1/PC3 in the ground chicken sample), while data points 1-2 days from expiration begin to rapidly diverge. - The principle components analysis suggested a classification model could be built to detect whether or not a whole or ground chicken sample had expired. The data on the presence/absence of 17,800 pathogenic and non-pathogenic bacteria was used to generate a classification model. When tested on an independent data set of samples, this classifier showed 97% accuracy in detecting samples past their expiration date using an ROC analysis.
- To improve detection of desired sequences during sequencing runs, we tested the performance of different barcoding designs on sequence detection. We generated unique sequences of nucleotides with maximum Levenschtein distances and used them to generate two formats of barcodes to be applied to sequences during library preparation: a) a periodic block design, in which each barcode consisted of a unique block sequence repeated 3 times, and b) a nonperiodic block design, in which 3 unique blocks were combined in tandem for each barcode sequence.
- We tested these nonperiodic and periodic block designs alongside a conventional barcode design (which were designed barcodes provided by our sequencing platform provider) when applied to the same samples in test sequencing runs (see
FIG. 23 ). Briefly, a defined Levenshtein distance between each “building block” or molecular index can be used to form larger barcodes. Such larger barcodes can have a period block design, such as barcodes created by repeating each block multiple times with the largest possible Levenshtein distance between the individual blocks (seeFIG. 23 ). Alternatively, such barcodes can also have a nonperiod block design, such as barcodes created by concatenative multiple blocks that are unique to each barcode with the largest possible Levenshtein distance between the individual blocks (seeFIG. 23 ). - We performed 10 ONT MinION runs and averaged the % of retained sequences and crosstalk for each run. The results are presented in Table 4. Both periodic and nonperiodic barcode designs showed improvements in retention and crosstalk versus the conventional design, with the nonperiodic design being the best in both metrics.
- Both barcode designs present distinct advantages. Both increase the number of retained sequences and allow for adjustable precision by choosing 1, 2, or 3 blocks in demultiplexing, but the periodic design requires fewer repeat blocks and presents less complexity in demultiplexing, whereas the nonperiodic design allows for improved crosstalk prevention. The improved crosstalk prevention of the nonperiodic design suggests a method of reducing crosstalk during highly multiplexed runs or when a flowcell is reused.
-
TABLE 4 Table 4: Performance of Conventional Barcode Design vs Periodic and Nonperiodic Block Designs Conventional Periodic Block Nonperiodic Design Design Block Design Retained Sequences 85% 96% 98 % Crosstalk 6% 5% 2% - Listeria-containing food and environmental samples were prepared, libraries were constructed, and sequencing was performed as in Examples 1-13 and 15. Samples were analyzed for the presence of Listeria by analyzing highly polymorphic genetic markers. A principle component analysis of the Listeria sequences isolated from sequencing (see
FIG. 24 ) identified clusters of closely related bacteria which likely originated from the same source. - The length of time for a full sequencing run represents a major limitation in the speed of detection or serotyping of pathogenic bacterial strains by high-throughput sequencing. We hypothesized that using “live” detection calls during sequencing runs (which can be performed as early as 1 hour for ONT MinION and GridION, and 5 hours for Illumina MiSeq) would allow for certain bacteria to be detected/serotyped on a preliminary basis based on sequencing, with follow-up confirmation by other non-sequencing-based tests (e.g. Q-PCR).
- We performed a test analysis of 50 environmental samples with about 15% positive for one of the pathogens identified in Table 3; positive samples were spiked with Salmonella, Listeria, E. coli, and campylobacter (2 samples each) from the top known pathogenic top strain/serotypes. Pathogen species was detected by detection of characteristic genomic markers. We compared the accuracy of species detection and serotyping at “live” and complete timepoints for the sequencing runs. The results are presented in Table 5. Early detection (1 hour for ONT MinION, and 5 hours for Illumina MiSeq) was 100% accurate for both formats, while MinION showed improved accuracy for serotyping.
-
TABLE 5 Table 5: “Early call” Detection of Bacterial Species and Serotype Sequences at Detection Serotyping Final Platform early call calls calls serotyping call MiSeq 425,000 100% 20% 100% MinION 630,000 100% 60% 100% - While preferred embodiments of the present invention have been shown and described herein, such embodiments are provided by way of example only. It is not intended that the invention be limited by the specific examples provided within the specification. While the invention has been described with reference to the aforementioned specification, the descriptions and illustrations of the embodiments herein are not meant to be construed in a limiting sense. Numerous variations, changes, and substitutions will now occur to those skilled in the art without departing from the invention. Furthermore, it shall be understood that all aspects of the invention are not limited to the specific depictions, configurations or relative proportions set forth herein which depend upon a variety of conditions and variables. It should be understood that various alternatives to the embodiments of the invention described herein may be employed in practicing the invention. It is therefore contemplated that the invention shall also cover any such alternatives, modifications, variations or equivalents. It is intended that the following claims define the scope of the invention and that methods and structures within the scope of these claims and their equivalents be covered thereby.
Claims (14)
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US15/927,958 US20190203267A1 (en) | 2017-12-29 | 2018-03-21 | Detection of microorganisms in food samples and food processing facilities |
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US201762611846P | 2017-12-29 | 2017-12-29 | |
US15/927,958 US20190203267A1 (en) | 2017-12-29 | 2018-03-21 | Detection of microorganisms in food samples and food processing facilities |
Publications (1)
Publication Number | Publication Date |
---|---|
US20190203267A1 true US20190203267A1 (en) | 2019-07-04 |
Family
ID=63761644
Family Applications (4)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US15/927,958 Abandoned US20190203267A1 (en) | 2017-12-29 | 2018-03-21 | Detection of microorganisms in food samples and food processing facilities |
US15/928,023 Active US10246704B1 (en) | 2017-12-29 | 2018-03-21 | Detection of microorganisms in food samples and food processing facilities |
US15/927,913 Active US10101328B1 (en) | 2017-12-29 | 2018-03-21 | Detection of microorganisms in food samples and food processing facilities |
US16/054,682 Active 2038-10-09 US10676794B2 (en) | 2017-12-29 | 2018-08-03 | Detection of microorganisms in food samples and food processing facilities |
Family Applications After (3)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US15/928,023 Active US10246704B1 (en) | 2017-12-29 | 2018-03-21 | Detection of microorganisms in food samples and food processing facilities |
US15/927,913 Active US10101328B1 (en) | 2017-12-29 | 2018-03-21 | Detection of microorganisms in food samples and food processing facilities |
US16/054,682 Active 2038-10-09 US10676794B2 (en) | 2017-12-29 | 2018-08-03 | Detection of microorganisms in food samples and food processing facilities |
Country Status (1)
Country | Link |
---|---|
US (4) | US20190203267A1 (en) |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US11282587B2 (en) | 2017-12-29 | 2022-03-22 | Clear Labs, Inc. | Automated priming and library loading device |
Families Citing this family (23)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CA2972902A1 (en) | 2014-12-31 | 2016-07-07 | Wal-Mart Stores, Inc. | System and method for monitoring gas emission of perishable products |
MX2019013936A (en) * | 2017-05-23 | 2020-01-30 | Walmart Apollo Llc | Automated inspection system. |
US20190203267A1 (en) | 2017-12-29 | 2019-07-04 | Clear Labs, Inc. | Detection of microorganisms in food samples and food processing facilities |
US10597714B2 (en) | 2017-12-29 | 2020-03-24 | Clear Labs, Inc. | Automated priming and library loading device |
US11448632B2 (en) | 2018-03-19 | 2022-09-20 | Walmart Apollo, Llc | System and method for the determination of produce shelf life |
US11393082B2 (en) | 2018-07-26 | 2022-07-19 | Walmart Apollo, Llc | System and method for produce detection and classification |
US11715059B2 (en) | 2018-10-12 | 2023-08-01 | Walmart Apollo, Llc | Systems and methods for condition compliance |
WO2020106332A1 (en) | 2018-11-20 | 2020-05-28 | Walmart Apollo, Llc | Systems and methods for assessing products |
EP3977461A4 (en) * | 2019-05-24 | 2023-06-28 | Clear Labs, Inc. | Methods and kits for detecting pathogens |
CN110408707B (en) * | 2019-07-23 | 2021-02-12 | 华中农业大学 | Molecular marker cloned from InDel fragment and related to pig hair color property |
US10557105B1 (en) | 2019-08-09 | 2020-02-11 | Bao Tran | Extraction systems and methods |
CN111020039B (en) * | 2019-12-30 | 2022-12-09 | 广东省科学院微生物研究所(广东省微生物分析检测中心) | Campylobacter jejuni species specific molecular target and rapid detection method thereof |
CN110951899B (en) * | 2020-01-03 | 2022-12-27 | 广东顺德工业设计研究院(广东顺德创新设计研究院) | PCR detection system, kit and detection method for detecting vibrio parahaemolyticus |
CN114078568B (en) * | 2020-09-14 | 2022-07-05 | 青岛欧易生物科技有限公司 | Metagenome sequencing data processing system and processing method based on IIB type restriction endonuclease characteristics |
CN112308426A (en) * | 2020-11-02 | 2021-02-02 | 北京工商大学 | Training method, evaluation method and device for food heavy metal pollution risk evaluation model |
CN112345712A (en) * | 2020-11-06 | 2021-02-09 | 四川省丹丹郫县豆瓣集团股份有限公司 | Storage method for safety risk prevention and control of fermented food |
US11248265B1 (en) | 2020-11-19 | 2022-02-15 | Clear Labs, Inc | Systems and processes for distinguishing pathogenic and non-pathogenic sequences from specimens |
MX2023012316A (en) * | 2021-04-22 | 2023-11-28 | Basepaws | Oral swab-based test for the detection of dental disease states in domestic cats, dogs and other mammals. |
CN113215235A (en) * | 2021-06-17 | 2021-08-06 | 嘉兴允英医学检验有限公司 | Method for rapidly detecting pathogenic microorganisms in high flux |
CA3224390A1 (en) * | 2021-07-14 | 2023-01-19 | Damian KAO | Oral swab-based test for the detection of various disease states in domestic cats |
WO2023288279A2 (en) * | 2021-07-14 | 2023-01-19 | Basepaws | Oral swab-based test for the detection of various disease states in domestic cats |
US11775918B2 (en) | 2021-09-08 | 2023-10-03 | International Business Machines Corporation | Analysis of handling parameters for transporting sensitive items using artificial intelligence |
CN113848291B (en) * | 2021-12-01 | 2022-03-18 | 江苏中农生物科技有限公司 | Egg product safety sampling rapid detection system and method |
Family Cites Families (18)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
PT1356080E (en) | 2001-02-01 | 2006-09-29 | Profos Ag | DETECTION AND IDENTIFICATION OF BACTERIAL GROUPS |
WO2003072805A2 (en) | 2002-02-21 | 2003-09-04 | Asm Scientific, Inc. | Recombinase polymerase amplification |
US20170022558A1 (en) | 2007-10-30 | 2017-01-26 | Complete Genomics, Inc. | Integrated system for nucleic acid sequence and analysis |
WO2010080617A2 (en) | 2008-12-19 | 2010-07-15 | The Board Of Trustees Of The University Of Illinois | Detecting and sorting methylated dna using a synthetic nanopore |
WO2011025819A1 (en) | 2009-08-25 | 2011-03-03 | New England Biolabs, Inc. | Detection and quantification of hydroxymethylated nucleotides in a polynucleotide preparation |
JP2013535193A (en) | 2010-07-23 | 2013-09-12 | ベックマン コールター, インコーポレイテッド | System and method including an analyzer |
WO2012024658A2 (en) | 2010-08-20 | 2012-02-23 | IntegenX, Inc. | Integrated analysis system |
EP2674501A1 (en) | 2012-06-14 | 2013-12-18 | Agence nationale de sécurité sanitaire de l'alimentation,de l'environnement et du travail | Method for detecting and identifying enterohemorrhagic Escherichia coli |
CA2880583C (en) | 2012-09-10 | 2021-02-16 | Randy P. Rasmussen | Multiple amplification cycle detection |
WO2014060305A1 (en) | 2012-10-15 | 2014-04-24 | Technical University Of Denmark | Database-driven primary analysis of raw sequencing data |
US20140161686A1 (en) | 2012-12-10 | 2014-06-12 | Advanced Liquid Logic, Inc. | System and method of dispensing liquids in a microfluidic device |
US10302614B2 (en) | 2014-05-06 | 2019-05-28 | Safetraces, Inc. | DNA based bar code for improved food traceability |
US20160239732A1 (en) | 2014-11-20 | 2016-08-18 | Clear Labs Inc. | System and method for using nucleic acid barcodes to monitor biological, chemical, and biochemical materials and processes |
CN112126675B (en) | 2015-01-12 | 2022-09-09 | 10X基因组学有限公司 | Method and system for preparing nucleic acid sequencing library and library prepared by using same |
US10159971B2 (en) | 2015-05-03 | 2018-12-25 | Clear Labs Inc. | Apparatus and method for economic, fast and easy sampling of food and environmental samples |
SG11201805118XA (en) | 2015-12-18 | 2018-07-30 | Biofire Defense Llc | Solid fluorescence standard |
CA3018187A1 (en) | 2016-03-24 | 2017-09-28 | Biofire Diagnostics, Llc | Methods for quantitative amplification |
US20190203267A1 (en) | 2017-12-29 | 2019-07-04 | Clear Labs, Inc. | Detection of microorganisms in food samples and food processing facilities |
-
2018
- 2018-03-21 US US15/927,958 patent/US20190203267A1/en not_active Abandoned
- 2018-03-21 US US15/928,023 patent/US10246704B1/en active Active
- 2018-03-21 US US15/927,913 patent/US10101328B1/en active Active
- 2018-08-03 US US16/054,682 patent/US10676794B2/en active Active
Cited By (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US11282587B2 (en) | 2017-12-29 | 2022-03-22 | Clear Labs, Inc. | Automated priming and library loading device |
US11568958B2 (en) | 2017-12-29 | 2023-01-31 | Clear Labs, Inc. | Automated priming and library loading device |
US11581065B2 (en) | 2017-12-29 | 2023-02-14 | Clear Labs, Inc. | Automated nucleic acid library preparation and sequencing device |
Also Published As
Publication number | Publication date |
---|---|
US10101328B1 (en) | 2018-10-16 |
US20190204317A1 (en) | 2019-07-04 |
US10246704B1 (en) | 2019-04-02 |
US10676794B2 (en) | 2020-06-09 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US10676794B2 (en) | Detection of microorganisms in food samples and food processing facilities | |
US11282587B2 (en) | Automated priming and library loading device | |
Choi et al. | Metagenomic analysis of chicken gut microbiota for improving metabolism and health of chickens—a review | |
EFSA Biohaz Panel et al. | Pathogenicity assessment of Shiga toxin‐producing Escherichia coli (STEC) and the public health risk posed by contamination of food with STEC | |
Ortiz‐Estrada et al. | Predictive functional profiles using metagenomic 16S rRNA data: a novel approach to understanding the microbial ecology of aquaculture systems | |
Michel et al. | The gut of the finch: uniqueness of the gut microbiome of the Galápagos vampire finch | |
Videvall et al. | Major shifts in gut microbiota during development and its relationship to growth in ostriches | |
Forsythe | The microbiology of safe food | |
US10597714B2 (en) | Automated priming and library loading device | |
Bell et al. | Ecological characterization of the colonic microbiota of normal and diarrheic dogs | |
US20220084630A1 (en) | Methods and kits for detecting pathogens | |
Hill et al. | Polymerase chain reaction screening for Salmonella and enterohemorrhagic Escherichia coli on beef products in processing establishments | |
Yang et al. | Microevolution and gain or loss of mobile genetic elements of outbreak-related Listeria monocytogenes in food processing environments identified by whole genome sequencing analysis | |
Baiz et al. | Gut microbiome composition better reflects host phylogeny than diet diversity in breeding wood‐warblers | |
Bolinger et al. | Utilizing the microbiota and machine learning algorithms to assess risk of Salmonella contamination in poultry rinsate | |
Gökmen et al. | Prevalence and molecular characterization of shiga toxin-producing Escherichia coli in animal source foods and green leafy vegetables | |
Sallam et al. | Cefotaxime-, ciprofloxacin-, and extensively drug-resistant Escherichia coli O157: H7 and O55: H7 in camel meat | |
GB2569831A (en) | Detection of microorganisms in food samples and food processing facilities | |
Okyere et al. | Analysis of fish commonly sold in local supermarkets reveals the presence of pathogenic and multidrug-resistant bacterial communities | |
Sirangelo | Food Microbiology and Multi-Omics Approaches | |
BR112019009341A2 (en) | automated initiation and library loading device | |
Cazer | Modeling and Mining Antimicrobial Resistance in Human and Animal Populations | |
Soverini | HOLOBIOMICS-Use of microbiomics for the exploration of microbial communities in holobionts. | |
Merrick et al. | A genetically related cluster of Salmonella Typhimurium cases in humans associated with ruminant livestock and related food chains, United Kingdom, August 2021-December 2022 | |
McKenna | Campylobacter Spp. Within the UK Poultry Industry: Prevalence, Risk Factors and the Chicken Microbiome |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
AS | Assignment |
Owner name: CLEAR LABS, INC., CALIFORNIA Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNORS:AMINI, SASAN;KHAKSAR, RAMIN;TAYLOR, MICHAEL;AND OTHERS;SIGNING DATES FROM 20180327 TO 20180328;REEL/FRAME:045578/0888 |
|
STPP | Information on status: patent application and granting procedure in general |
Free format text: NON FINAL ACTION MAILED |
|
STPP | Information on status: patent application and granting procedure in general |
Free format text: RESPONSE TO NON-FINAL OFFICE ACTION ENTERED AND FORWARDED TO EXAMINER |
|
STPP | Information on status: patent application and granting procedure in general |
Free format text: FINAL REJECTION MAILED |
|
STPP | Information on status: patent application and granting procedure in general |
Free format text: DOCKETED NEW CASE - READY FOR EXAMINATION |
|
STPP | Information on status: patent application and granting procedure in general |
Free format text: FINAL REJECTION MAILED |
|
STPP | Information on status: patent application and granting procedure in general |
Free format text: NON FINAL ACTION MAILED |
|
STCB | Information on status: application discontinuation |
Free format text: EXPRESSLY ABANDONED -- DURING EXAMINATION |