Abstract
Food safety is of vital interest for public health and the stability of society. In this paper, we analyzed the characteristics of food safety incidents (FSIs), including spatial distribution, food categories, risk factors, and supply chain links, reported by mainstream media in China. Based on our analysis, we constructed a semantic template for text data related to FSIs. Furthermore, we introduced a multi-layer, multi-level semantic structure of rank (MMSS-Rank) algorithm to measure the similarity between collected food safety data and the semantic template. We then calculated the overall scores (i.e., text layer weight, semantic template weight, and keyword density matrix) and selected an appropriate threshold to determine the accuracy of the FSI data. Results showed that, compared with traditional methods, MMSS-Rank is an efficient and robust method for identifying large-scale FSI data with higher accuracy and recall rate.
Similar content being viewed by others
Explore related subjects
Discover the latest articles, news and stories from top researchers in related subjects.References
Alexandr Andoni PI (2006) Near-optimal hashing algorithms for approximate nearest neighbor in high dimensions. In: IEEE symposium on foundations of computer science. IEEE Computer Society
Anonymous (1997) A simple guide to understanding and applying the hazard analysis critical control point concept, 2nd edn. International Life Sciences Institute (ILSI), Brussels
Bollegala D, Matsuo Y, Ishizuka M (2007) Measuring semantic similarity between words using web search engines. In: Proceedings of the 16th international conference on World Wide Web. ACM, Banff. pp 757–766
Burlingame B, Pineiro M (2007) The essential balance: risks and benefits in food safety and quality. J Food Compos Anal 20(3–4):139–146
Cai XB, Chen HP, Zhao PP (2009) A deep web sources focused Crawler’s Crawling strategy. Microelectron Comput 26(8):117–120
Chowdhury A, Frieder O et al (2002) Collection statistics for fast duplicate document detection. ACM Trans Inf Syst 2002:171–191
Dai Y, Kong D, Wang M (2013) Investor reactions to food safety incidents: evidence from the Chinese milk industry. Food Policy 43:23–31
FAO (1997) Risk Management and Food Safety. Food and Nutrition Paper, Rome
Fukuda K (2015) Food safety in a globalized world. Bull World Health Organ 93(4):212
Gratt LB (1987) Uncertainty in risk assessment, risk management and decision making. Plenum Press, New York, pp 147–154
He Z, Zhai G, Suzuki T (2014) The immediate influence of a food safety incident on Japanese consumers’ food choice decisions and willingness to pay for safer food. Hum Ecol Risk Assess 20(4):1099–1112
Huang CH, Yin J, Hou F (2011) A text similarity measurement combining word semantic information with TF-IDF method. Chin J Comput 34(5):856–864
Li Q, Liu W, Wang J, Dai Y (2011) Application of content analysis in food safety reports on the Internet in China. Food Control 22(2):252–256
Li S, Chen L, Chen B (2014) The analysis of food safety incidents exposed by the media from 2004 to 2012 in China. J Chin Inst Food Sci Technol 14(3):1–8
Liu H, Kerr WA, Hobbs JE (2012) A review of Chinese food safety strategies implemented after several food safety incidents involving export of Chinese aquatic products. Br Food J 114(3):372–386
Liu YP, Wang M, Hu BG (2014) Research on the management of the safety risks of livestock food in China based on the empirical analysis of livestock food safety events. J Anhui Agric Sci 19:6373–6375 (6378)
Liu Y, Liu F, Zhang J et al (2015) Insights into the nature of food safety issues in Beijing through content analysis of an Internet database of food safety incidents in China. Food Control 51:206–211
Lu YC, Lu MY et al (2002) Analysis and structural word weighting function space vector method. J Comput Res Dev 39(10):1205–1210
Luo L, An YF, Gu C, Li Y (2013) Analysis of sources of risk and regulatory strategy of Chinese food safety. J Food Sci Technol 2013(2):77–82
Mihalcea R, Tarau P, Figa E (2004) PageRank on semantic networks, with application to word sense disambiguation. Unit Scholarly Works
Mo M, An YF, He ZW (2014) Key supervision points and control countermeasures of food safety in supermarkets based on the analysis of 359 food safety incidents in supermarkets. Theory Pract Finance Econ 1:137–140
Pirro G (2009) A semantic similarity metric combining features and intrinsic information content. Data Knowl Eng 68(11):1289–1308
Qureshi MA, Younus A, O’Riordan C et al (2018) A wikipedia-based semantic relatedness framework for effective dimensions classification in online reputation management. J Ambient Intell Humaniz Comput 9:1403–1413
Rong F, Zhang Y, Wang Z et al (2019) Influencing factors of consumer willingness to pay for cold chain logistics: an empirical analysis in China. J Ambient Intell Humaniz Comput 10:3279–3285
Shi JP (2010) Exploration on food safety supervision mechanism of catering service. China Food Drug Adm Mag 2:21–23
Theobald M, Siddharth J, Paepcke A (2008) SpotSigs: robust and efficient near duplicate detection in large web collections. In: Proceedings of the 31st annual international ACM SIGIR conference on research and development in information retrieval, vol 2008. ACM, pp 563–570
Valeeva NI, Meuwissen MPM, Huirne RBM (2004) Economics of food safety in chains: a review of general principles. NJAS Wagening J Life Sci 51(4):369–390
Wang JF (2010) Research and application of web news extraction based on structure and visual consistency. Zhejiang University, Hangzhou
Wu LH, Qian H (2012) China food safety development report, 2012th edn. Peking University Press, Beijing
Zadeh PDH, Reformat MZ (2013) Context-aware similarity assessment within semantic space formed in linked data. J Ambient Intell Humaniz Comput 4:515–532
Zhang HX, An YF (2013) Sources and prevention measures of food safety risks in food producing enterprises: based on content analysis of food safety incidents. On Econ Probl 5:73–76
Zhang C, Chen ZY, Gu P (2008) Automatic Blog recognition with DOM tree. Appl Res Comput 25(5):1489–1491
Zhang DB, Xu JP, Li CG (2010) Model for food safety warning based on inspection data and BP neural network. Trans CSAE 26(1):221–226
Zhang HX, An YF, Zhang WS (2013) China’s food safety risk identification, assessment and management based on the empirical analysis of food safety incidents. Inq Econ Issues 6:135–141
Zhao X, Zhang W, He W et al (2019) Research on customer purchase behaviors in online take-out platforms based on semantic fuzziness and deep web crawler. J Ambient Intell Humaniz Comput. https://doi.org/10.1007/s-12652-019-01533-605
Acknowledgement
This work was supported in part by the 2019 Key Research Project Sponsored by National Social Science: Research on the Scientific Connotation and the Design of Food Safety System Framework, Project No. 19AGL021. The Philosophy and Social Science Fund of Education Department of Jiangsu Province (15JD005).
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Zhang, J., Chen, M., Hu, E. et al. Data mining model for food safety incidents based on structural analysis and semantic similarity. J Ambient Intell Human Comput (2020). https://doi.org/10.1007/s12652-020-01750-4
Received:
Accepted:
Published:
DOI: https://doi.org/10.1007/s12652-020-01750-4