Europe PMC
Nothing Special   »   [go: up one dir, main page]

Europe PMC requires Javascript to function effectively.

Either your web browser doesn't support Javascript or it is currently turned off. In the latter case, please turn on Javascript support in your web browser and reload this page.

This website requires cookies, and the limited processing of your personal data in order to function. By using the site you are agreeing to this as outlined in our privacy notice and cookie policy.

Abstract 


Background:

A direct consequence of global warming, and strongly correlated with poor physical and mental health, food insecurity is a rising global concern associated with low dietary intake. The Coronavirus pandemic has further aggravated food insecurity among vulnerable communities, and thus has sparked the global conversation of equal food access, food distribution, and improvement of food support programs. This research was designed to identify the key features associated with food insecurity during the COVID-19 pandemic using Machine learning techniques. Seven machine learning algorithms were used in the model, which used a dataset of 32 features. The model was designed to predict food insecurity across ten Arab countries in the Gulf and Mediterranean regions. A total of 13,443 participants were extracted from the international Corona Cooking Survey conducted by 38 different countries during the COVID -19 pandemic.

Results:

: The findings indicate that Jordanian, Palestinian, Lebanese, and Saudi Arabian respondents reported the highest rates of food insecurity in the region (15.4%,13.7%,13.7% and 11.3% respectively). On the other hand, Oman and Bahrain reported the lowest rates (5.4% and 5.5% respectively). Our model obtained accuracy levels of 70%-82% in all algorithms. Gradient Boosting and Random Forest techniques had the highest performance levels in predicting food insecurity (82% and 80% respectively). Place of residence, age, financial instability, difficulties in accessing food, and depression were found to be the most relevant features associated with food insecurity.

Conclusions:

: Overall, ML algorithms seem to be an effective method in early detection and prediction of food insecurity. Future research would benefit from utilizing the proposed model in developing more complex and accurate models aiming to enhance granularity, with the ability to share data, to incorporate wide range of variables, and to make use of automation for effective prevention and intervention programs at the regional and individual levels.

Free full text 


Research Square

PPRID: PPR636673
EMSID: EMS173425
Res Sq preprint, version 1, posted 2023 March 27
https://doi.org/10.21203/rs.3.rs-2545270/v1

Machine Learning Techniques for the Identification of Risk Factors Associated with Food Insecurity among Adults in Arab countries during the COVID-19 Pandemic

Affiliations

  1. 1.Al Quds University: Al-Quds University
  2. 2.Lebanese University: Universite Libanaise
  3. 3.Qatar University College of Health Sciences
  4. 4.Taibah University Faculty of Medicine
  5. 5.Zayed University Department of Natural Science and Public Health
  6. 6.Cairo Research Center
  7. 7.Public Authority for Applied Education and Training
  8. 8.University of Bahrain College of Health Sciences
  9. 9.Sultan Qaboos University
  10. 10.Mutah University
  11. 11.Al Quds University School of Medicine: Al Quds University Faculty of Medicine

Copyright and license information

This article is a preprint. A journal published article is available.

Abstract

Background

A direct consequence of global warming, and strongly correlated with poor physical and mental health, food insecurity is a rising global concern associated with low dietary intake. The Coronavirus pandemic has further aggravated food insecurity among vulnerable communities, and thus has sparked the global conversation of equal food access, food distribution, and improvement of food support programs. This research was designed to identify the key features associated with food insecurity during the COVID-19 pandemic using Machine learning techniques. Seven machine learning algorithms were used in the model, which used a dataset of 32 features. The model was designed to predict food insecurity across ten Arab countries in the Gulf and Mediterranean regions. A total of 13,443 participants were extracted from the international Corona Cooking Survey conducted by 38 different countries during the COVID -19 pandemic.

Results

The findings indicate that Jordanian, Palestinian, Lebanese, and Saudi Arabian respondents reported the highest rates of food insecurity in the region (15.4%,13.7%,13.7% and 11.3% respectively). On the other hand, Oman and Bahrain reported the lowest rates (5.4% and 5.5% respectively). Our model obtained accuracy levels of 70%-82% in all algorithms. Gradient Boosting and Random Forest techniques had the highest performance levels in predicting food insecurity (82% and 80% respectively). Place of residence, age, financial instability, difficulties in accessing food, and depression were found to be the most relevant features associated with food insecurity.

Conclusions

Overall, ML algorithms seem to be an effective method in early detection and prediction of food insecurity. Future research would benefit from utilizing the proposed model in developing more complex and accurate models aiming to enhance granularity, with the ability to share data, to incorporate wide range of variables, and to make use of automation for effective prevention and intervention programs at the regional and individual levels.

Keywords: food insecurityl, COVID-192, food consumption score3, machine learning4, prediction5, Arab countries6

Introduction

Food insecurity (FI) has been a major global health concern, particularly in low- and middle-income countries. FI is defined as a lack of access to sufficient and affordable food (1, 2). Improper diet, unhealthy eating habits and lifestyles(35), limited food availability, lack of access to food, and proper food consumption to meet dietary needs have been evidenced as major factors that contribute to food insecurity (4, 6, 7) y. According to the State of Food Security and Nutrition in the World (SOFI) report, food insecurity can have a serious impact on people’s health of all ages and may include several adverse health effects such as depression, diabetes, obesity, anxiety, and hypertension (8, 9). Furthermore, the SOFI report indicated that the COVID-19 pandemic increased chronic hunger between 2019 and 2020(8). The Global Humanitarian Overview report indicated that about 265 million people were in need of humanitarian assistance during the year 2021 due to the COVID-19 pandemic (8, 10). Several studies reported that the COVID-19 pandemic significantly impacted global food insecurity and poverty, mainly in low- and middle-income countries that have limited access to food, as they experienced high rates of unemployment, income inequality, and disruption of social safety programs(1114).

In the Arab region, mainly in Jordan, Lebanon and Palestine, the pandemic has severely affected the countries’ national economies, in particular the services sectors, the supply chain, the markets, and trade, all of which have directly impacted food supply, demand, and access (1517). Several studies reported that low- and middle-income countries were most affected by the COVID-19 pandemic, and thus presented an increase in food insecurity(8, 18). A United Nations Economic and Social Commission for Western Asia (ESCWA) study reported that around 8.3 million people in the Arab region ran the risk of falling into poverty and income crises due to lockdowns, which increased the number of food-insecure and undernourished persons(19). Furthermore, Kharroubi et al.’s study indicated that food insecurity among Lebanese increased from 27–36%-39% due to the pandemic(12). Similarly, World Food Program (WFP) reports indicated that 53% and 34% of Jordanians and Palestinians are vulnerable to food insecurity respectively(2022). Although Gulf Cooperation Council (GCC) countries have been considered the most food-secure countries in the Arab world, the COVID-19 pandemic and the current global crises have affected food accessibility and caused food insecurity concerns in the GCC region(23, 24).

Food insecurity is an interdimensional concept that encompasses food availability, access to food, and food consumption diversity. Several indicators have been developed for the assessment and depiction of households’ food insecurity. The Food Consumption Score (FCS) was used to measure the quantity and diversity of food consumption and has been considered a valid indicator for food insecurity(2528). Data mining and Machine Learning (ML) techniques have been used in several studies as efficient tools for predicting and identifying the risk factors associated with food insecurity (2932). K-Nearest Neighbor (KNN), Random Forest (RF), Logistic regression and Support Vector Machine (SVM) are among the machine learning models that have been assessed for the prediction of food insecurity(29, 30, 33).

Gao et al. examined the machine learning models used to identify vulnerable household characteristics and to predict food-insecure households. The models indicated that household size, income, and access to farm production and resources are risk factors associated with food insecurity(34). In Georgiana et al.’s study, the machine learning models were used for identifying food insecurity in the food sharing network, the authors indicated that Random Forest and AdaBoost had higher prediction accuracies and produced a complex features structure that contributes to food insecurity(29). In Doreswamy et al.’s study, the machine learning models were used in household food insecurity classifications, among them are the KNN, Logistic Regression (LR), RF, and SVM models; the RF reported the best accuracy for features classifications and importance with an accuracy rate of 99.98% (30).

Thus, this study aims to further investigate the performance of different machine learning techniques in predicting and identifying risk factors associated with food insecurity in low- and middle-income countries during the COVID-19 pandemic. Furthermore, we seek to identify the direct consequences of the COVID-19 pandemic on people’s health and poverty indices in the region. The study utilized ML algorithms to detect food insecurity and food secure groups based on the food consumption score, key features were extracted from the Regional Corona Cooking Survey dataset. Furthermore, the study determined the features most associated with food insecurity.

Methods

Aim

The study aimed to identify the key determinants of food insecurity within the Arab region during the COVID-19 pandemic through a novel approach, Machine Learning algorithms.

Setting

The “Corona Cooking Survey April 2020”(35) conducted by 38 different countries during the COVID - 19 pandemic was used for this study. The survey was distributed to over 300 participants per country in each of the following Arab countries: (Palestine, Lebanon, Jordan, Kuwait, Oman, Qatar, Saudi Arabia, United Arab Emirates, and Bahrain. The survey data was collected by the research team between April 17th and June 25th, 2020.

Design

Data Source

The dataset was extracted from the international “Corona Cooking Survey April 2020”(35). The survey was designed as a multi-language survey including the Arabic language to facilitate data collection among different countries. The data collection instrument assessed the effect of the COVID-19 lockdown on adults’ health and nutrition. The survey included several types of information, such as COVID-19 Lockdown Measures, COVID-19 Feelings (Kessler K6 scale), Food Literacy Scale, Shopping Experiences and Behavior, Cooking Behavior and Attitudes, Seeking Recipes & Food Content, Eating Behaviors (Food Frequency Questionnaire), Food Advice Sources, E-drinking and E-dining, anthropometric measurements, and lifestyle and eating habits. The survey was distributed through different social media platforms and through the partner universities’ networks. The final sample included over 300 participants per country. Countries that acquired a lesser number of samples were excluded from the analysis. The data relating to food insecurity in Arab countries were extracted from the International Survey. Overall, 13,443 participants aged over 18 years were included in the ML model development.

Study Features

The features were extracted from the main study variables, accessible via https://osf.io/nz9xf/files/ (35). The study features contain respondents’ data from before and after the COVID-19 lockdown. The machine learning models used food insecurity as the main target variable for assessing the performance of machine learning in predicting and identifying associated risk factors. Food insecurity was defined in reference to the Food Consumption Score (FCS), which is an indicator used to assess dietary diversity before and after the pandemic. The FCS was categorized into two groups: Low FCS if the FCS < 42 (Unacceptable), and high FCS score if the FCS scores >= 42 (Acceptable). Thus, food insecurity was determined based on the FCS classification. The low FCS group was considered a food insecure group, while the high FCS group was considered a normal group. Detailed information about the study variables’ definition and calculation can be found in Hoteit et al.(36).

Data Preprocessing

Data underwent a preprocessing procedure prior to building the ML models. The preprocessing phase included data cleaning, formatting, missing data treatment, and data categorization. The data cleaning process included the null value data, the text-to-numeric conversion, and the missing data treatment.

In the data set, imbalanced data was encountered as 1529 participants were categorized as food insecure, and 11914 participants were categorized as food secure. An imbalanced data set might bias the ML model’s estimation by providing more weight to the dominant class(26). The simple and effective Min- Max Normalization technique was used to scale features to a common range to ensure that all the features were on the same scale, and to allow the model to make more accurate predictions. The minimum and maximum values for each feature were first calculated, followed by subtracting the minimum value from all feature values and dividing the resulting tables by the range (i.e., the difference between the maximum and minimum values) to ensure that all the features were scaled between 0 and 1.

Machine Learning Model

Seven machine learning models were used in this study to assess the performance of ML in predicting food insecurity among Arab countries following the COVID-19 pandemic. Logistic Regression (LR), Gradient Boosting (GB), Support Vector Machine (SVM), Random Forest (RF), Artificial Neural Network (ANN), Naïve Bayes (NB), and k-nearest neighbors’ algorithm (k-NN) were built and evaluated considering their performance measures. The models’ features were categorized into two groups: 1) the dependent variable, represented by the food consumption score, and 2) the independent variables, including all associated features. The features are further represented in Table 1.

Table 1. The list of variables used in the machine learning model
CodeNameDescriptionValue
1GenderGender1 = Male, 2 = Female
2FCS_categoryFCS Status1 = Not Acceptable, 2 = Acceptable
3FCS_variationVariation of FCS during lockdown1 = Decreased, 2 = The same, 3 = Increased
4Region_of_livingLiving region1 = MENA, 2 = GULF
5Country _LivingArabic CountriesBahrain, Egypt, Jordan, Palestine, Lebanon, Saudi Arabia, Emirate, Oman, Kuwait
6Family_IncomeFamily Income Level1 = Low, 2 = Average, 3 = High
7EducationEducation Level>= Secondary, >Secondary
8WatchTVNumber of Hours watching TV1 =< One Hour, 2 = 1-2 Hours, 3 = 3-4 Hours, 5 = 5hours +
9Computer useNumber of Hours using computer1 =< One Hour, 2 = 1-2 Hours, 3 = 3-4 Hours, 5 = 5hours +
10EmploymentEmployment status1 = Employed, 2 = Unemployed
11BMIBody Mass Index (BMI)1 = Normal, 2 = Overweight, 3 = Obese
12AgeGroupAge in years1 = 18-23, 2 = 24-29, 3 = 30-39. 4 = 40+
13Cooking_MoneyDon’t have Money for Cooking1 = Yes, 2 = No
14Access_FoodDon’t have access to food for Cooking1 = Yes, 2 = No
15Cooking_FacilitiesDon’t have access to cooking tools for cooking1 = Yes, 2 = No
16Famiy_SizeFamily Size1=<= 5 persons, 2 => 5 Person
17Financial_ProblemsFinancial difficulties until the end of the month> 5 per
18Financial_ShoppingFinancial difficulties in Shopping1 = Yes, 2 = No
19PHAPhysical Activity during lockdown1 = Low Active, 2 = Moderate, 3 = Highly Active
20SmokingSmoking Before COVID-191 = Yes, 2 = No
21Mother_SHOP_3Mother usually did food shopping1 = Yes, 2 = No
22Father_SHOP_4Father usually did food shopping1 = Yes, 2 = No
23HopelessI feel hopeless1 = Yes, 2 = No
24restlessI feel restless or fidgety1 = Yes, 2 = No
25require_effortsI feel that everything requires effort1 = Yes, 2 = No
26worthlessI feel worthless1 = Yes, 2 = No
27nervousI feel nervous1 = Yes, 2 = No
28depressedI feel so depressed that nothing could cheer me up1 = Yes, 2 = No
29Moretime_activityI feel I have more time than usual in doing activities1 = Yes, 2 = No
30struggle_financiallyI feel I struggle financially1 = Yes, 2 = No
31connectedI feel more connected than usual1 = Yes, 2 = No
32lockdown_durationLockdown duration (Weeks)1 =< 12 weeks, 2 >= 12 weeks

The ML models were built based on the data ratio of 70:20:10 for training, testing, and validation. The grid search method and cross-validation with 10-folds were used for parameters’ optimizations. The following parameters were set for the ANN, RF, and SVM:

  • In Artificial Neural Networks, the hidden layer had 1000 neurons, with a 600-maximum number of iterations in reference to the logistic activation function.
  • The Random Forest trees were set to 1000 with 5 maximum depth trees, and the leaf node minimum number was set to 1, while the maximum number of samples to split the internal nodes was set to 2.
  • The SVM regularization parameter was set to 10, the RFB kernel was set to 0.001, and the bias error control factor was set to 1.

Based on the parameters optimization results, the optimized algorithms (ANN, SVM, RF) were used in identifying and predicting food insecurity.

Data Analysis

Data cleaning, transformation, and normalization processes were conducted prior to building the ML data analysis. The final dataset consisted of 13,446 participants. The seven ML models were built and performed using the python orange data mining software(37), which was then used for testing and validating the machine learning models.

Different performance measures were used to evaluate whether the ML models are able to predict food insecurity levels and the associated risk factors, such as accuracy, specificity, precision, recall, and F-measure. The calculating equations for performance measures are as follows: Specificity =TNFP+TN(1) Precision=TPTP+FP(2) Recall=TPTP+FN(3) FMeasure=2*precision*recallprecision+recall(4) Accuracy=TP+TN(TP+TN+FP+FN)(5)

Results

Statistical Analysis

The food consumption score (FCS) was used to identify the food insecure (FI) and food secure (FS) participants. The borderline and not acceptable FCS group were considered food insecure, while those with acceptable FCS were considered food secure. Overall, 9.3% out of 13,443 participants reported low and borderline FI rates among Arab countries. Results in Fig. 1 show the distribution of FI and FS by country. The findings indicate that the Jordanian, Palestinian, Lebanese, and Saudi Arabian respondents reported the highest FI rates in the region (15.4%,13.7%,13.7% and 11.3% respectively). On the other hand, Oman and Bahrain reported the lowest FI rates (5.4% and 5.5% respectively).

Figure 1
Open in new tabFigure 1: Food Consumption Score distribution by country

The ML models used a balanced data set of 4,259 participants extracted from the general data set of 13,443 participants. The connectivity between the associated features and food insecurity was described by the correlation matrix as illustrated in Table 2. The correlation matrix compares the target variable (FCS) with the study features and identifies which features are most correlated with the outcome variable. The results indicated that most of the study features are significantly correlated with the outcome feature except for five features: “Family Size”, “Smoking”, “I feel hopeless”, “I feel that everything requires effort”, “I feel worthless”, “I feel so depressed”, “Food shopping by mother”, “Food shopping by father”, and the “Lockdown period”. The highest correlations were found between the region and Body Mass Index (BMI) (correlation values = 0.202 and 0.143 respectively).

Table 2. Correlation between study features and food consumption score
FeaturesCorrelation-ValueFeaturesCorrelation-Value
Gender- .090**Use Computer.149**
BMI.143**Education Level- .057**
Region.202**I feel hopeless0.017
Don’t have money for cooking.090**I feel restless or fidgety.046**
Don’t have access to food for cooking facilities.103**I feel that everything requires effort0.023
Don’t have access to cooking tools for cooking.110**I feel worthless-0.017
Family Size-0.029I feel nervous.037*
Facing Financial difficulties to the end of the month.035*I feel so depressed0.012
Financial difficulties in Shopping0.014*I feel I have more time than usual.045**
Physical Activity during lockdown.075**I feel I struggle financially0.04*
Smoking During COVID-190.004I feel more connected than usual.063**
Food shopping by mother0.003Lockdown duration (Weeks)0.015
Food shopping by father0.005Age (Year)- .375**
Watch TV.071**Country of living during COVID 19- .139**

* P<0·05

** P<0·01

Machine Learning Models Performance Analysis

Table 3 depicts the performance measures in predicting participants’ FI, their accuracy, sensitivity, specificity, F1-score, and Receiver Operating Characteristic (ROC) classification curve. The accuracy score presents the crossover over the accuracy of the model, sensitivity measures the segment of FI participants correctly predicted, while specificity shows the identified segment of FS participants. According to the models’ results, the overall accuracy in predicting food insecurity ranged from 70–82%. Sensitivity ranged from 72–81.6%, while the f1-score ranged from 70.3–80.5%. Comparing the different ML algorithms, results indicated that GB and RF reported the highest accuracy rates (81.6% and 81.4% respectively), while the KNN algorithm had the lowest accuracy rate.

Table 3. FI ML prediction performance measures
#ModelAUCCAF1PrecisionRecall
1Gradient Boosting8582838382
2Random Forest8482808382
3Logistic Regression8480828181
4Support Vector Machine8380838181
5Neural Network8480818181
6Naive Bayes8076777777
7K-nearest neighbor7270717272

Figures’ Legends

The results in Table 3 illustrate that the algorithms’ performance is determined according to the performance measure. Thus, GB had the highest accuracy, F1-score, sensitivity, and AUC scores. However, the other algorithms reported acceptable performance measures and can be used in predicting food insecurity, except KNN and NB, which had the lowest performance levels and therefore are not recommended. The models’ performance was further evaluated using the Area Under the Curve-Receiver Operating Characteristics curve (AUC- ROC). The food consumption scores were classified into two categories: FI and FS. The ROC was obtained for the FI as indicated in Fig. 2. Results in Fig. 2 illustrate that the ROC characteristics are in the upper left side of the curve for the gradient boosting and logistic regression models, thus the two models reported the highest accuracy rates (AUC of 82%, and 80% respectively).

Figure 2
Open in new tabFigure 2: Gradient Boosting and Logistic Regression ROC curve for below average cognitive scores (TP rate of sensitivity against FP rate of specificity)

The SHapley Additive exPlanations (SHAP) values analysis was used to identify the importance of study features in predicting the outcome variable and associated features. The SHAP value analysis was conducted on the GB model as it presented the best FI prediction accuracy levels. Figure 3 illustrates the correlation plot of FI participants with model predictors. The y-axis shows the FI predictors, while the x-axis shows the SHAP value. Based on the results in Fig. 3, the most important features that positively affect FI show to be (1) country of residence, (2) age, (3) financial difficulties in food shopping, (4) depression, (5) having financial problems, (6) don’t have access to food. The place of residence (living country) had the highest positive significant impact on the outcomes of FI participants. On the other hand, BMI, physical activity, smoking, food shopping by the father, and food shopping by the mother had a negative impact on the outcome of FI participants. Notably, age, depression, and feeling nervous were found to be relevant factors that play a significant role in predicting respondents’ food insecurity.

Figure 3
Open in new tabFigure 3: Correlation of Food Insecurity with the study features

Discussion

Food insecurity and food inequity have been increasingly relevant during and following the COVID-19 pandemic (11, 13, 27). Millions of households around the world relied on food support programs during COVID-19(38). The lockdown-imposed work and movement limitations that significantly increased the risk of food insecurity at the global level(38)(14, 39, 40). In this paper, we used machine learning models to predict food insecurity using a dataset containing relevant data from 10 Arab countries during the COVID-19 pandemic. The models utilized the food consumption score (FCS) as a key variable in identifying food insecure (FI) and food secure (FS) participants as it has been used in several studies as an indicator for the prediction and identification of household FI (26, 41, 42).

This study has found that some of the features used are associated to food insecurity. Nonetheless, limited studies have used ML algorithms in identifying and predicting food insecurity, particularly in the Arab region and during the COVID-19 pandemic. The results showed significant FI levels in Arab countries, mainly in Jordan, Palestine, and Lebanon. Similarly, significantly high FI levels were evidenced in Saudi Arabia and Kuwait in the Gulf states. Furthermore, the study showed a significant relationship between the country of residence, age, and participants’ mental health status with FI levels. These findings are consistent with other similar studies that indicated a significant association between household FI and age, place of residence, and other sociodemographic variables (12, 4347). Interestingly, we found that BMI had a strong correlation and association with FI. The negative association between BMI and FCS indicated that food-insecure people had higher BMI scores. This finding was consistent with other similar studies(6, 48, 49).

The implementation of the ML models demonstrates the power of ML in predicting FI from the associated factors. The performance analysis included the accuracy, recall, precision, F1-score, and ROC measures. The results showed that GB and RF are among the highest-performing ML models in predicting FI. However, the other employed models showed an acceptable performance rate, while KNN reported the lowest accuracy rate of all. Our results were found to be consistent with other studies that used ML in predicting FI and confirmed the feasibility of ML use in identifying FI (41,5052). The models in this study reported accuracy rates ranging from 70–82%, while other models reviewed from previous research reported accuracy rates ranging from 55–85%. Other studies found that RF and SVM had high performance rates in predicting FI (34). Furthermore, the ML models identified a significant association between mental health factors and FI outcomes, including depression, stress, hopelessness, and negative feelings. The findings were consistent with other studies that showed a significant association between FI outcomes and depression, anxiety, despair, and hopelessness (38, 53).

In this study, the importance of each feature on FI prediction was examined using the SHAP values analysis(51). The country of residence, age, financial difficulties in shopping, depression, inaccessibility to food, and financial problems were found to be the most important features affecting the model outcomes. Our findings were consistent with other studies indicating that food prices, poor access to food and market, poverty, living place, and lack of education were associated with food insecurity. Notably, depression, nervous feelings and age were among the most important factors associated with FI outcomes(41, 51, 52).

The pandemic encouraged the consideration of food access equity at the global level, mainly with regards to ensuring the availability of basic food, housing, health, and education. Our approach introduced an important contribution to improving the currently available methods of predicting and early warning of food insecurity as it has proven effective in identifying and predicting food insecure people from food consumption levels. The development of this model was based on international data from Arab countries to ensure its replicability at the global level, among marginalized and conflict-prone areas.

Strengths and Limitations

The pandemic calls for a shift in thinking and consideration of access to food and ensuring food security among low- and middle-income countries. This model was designed to respond to the need of early detection of food insecurity to ensure rapid and accessible humanitarian response. The model provides a precise and accurate FI identification and prediction tool that is less dependent on traditional assessment and analysis. Thus, the model improves automatic and early detection of food insecurity, which in turn can enhance rapid intervention and policy-making programs for combatting food insecurity. Thus, this research study not only introduces ML techniques in predicting FI during and after pandemic conditions, but also enhances data driven decision making and early intervention.

Nonetheless, the study was limited by the number of features used in predicting FI. In some countries, the model data set presented a lower number of participants, which limited the ability of building ML models per country. The research findings encourage future research by building more complex ML models that might improve prediction and classifications accuracy.

Conclusions

The study assessed the performance measures of seven ML algorithms in predicting the risk factors associated with food insecurity during the COVID-19 pandemic. The research found that Gradient Boosting and Random Forest are among the highest-performing Machine Learning algorithms for the accurate prediction of food insecurity. The study developed an ML model that can enhance the early detection of FI, and that can be replicated in other regions. The study contributes to the literature on food insecurity based on FCS by utilizing the ML methods in identifying the key characteristics of food insecurity in the Arab region, to determine the relationship between the food consumption score and the associated factors, and to support policy makers with advanced food insecurity identification and prediction tools. Furthermore, the use of ML models is a valuable tool in improving food insecurity prediction and detection over time, with enhanced granularity, with the ability to share data, to incorporate wide range of variables, and to make use of automation for effective prevention and intervention programs at the regional and individual levels.

Acknowledgements

The authors wish to thank the research participants for sharing their invaluable information. Furthermore, this paper and the research behind it would not have been possible without the support of the Corona Cooking Survey Regional Study Group and all its members, as listed below:

Belgium: Charlotte De Backer, Lauranna Teunissen, Kathleen Van Royen, Isabelle Cuykx, Paulien Decorte, Gaёlle Ouvrein, Karolien Poels, Heidi Vandebosch, Katrien Maldoy (University of Antwerp); Sara Pabian (Tilburg University), Christophe Matthys, Tim Smits, Jules Vrinten (KULeuven); Ann DeSmet (University of Antwerp, Université Libre de Bruxelles); Nelleke Teughels, Maggie Geuens, Iris Vermeir, Viktor Proesmans, Liselot Hudders (Ghent University);

Bahrain: Mariam Al-Mannai, Tariq Alalwan (University of Bahrain);

Lebanon: Elissa Naim (Lebanese University), Rania Mansour (Lebanese

University), Nour Yazbeck (Lebanese University);

Palestine: Hazem Agha, Rania Abu Seir (Al Quds University);

Saudi Arabia: Jamila Arrish, Ghadir Fallata, Omar Alhumaidan, Shihana Alakeel, Norah AlBuayjan, Sarah Alkhunein, Budur Binobaydan, and Aeshah Alshaya (National Nutrition Committee (NNC) at Saudi Food and Drug Authority (Saudi FDA);

United Arab Emirates: Ayesha Aldhaheri (United Arab Emirates University).

Funding

This project did not receive any external funds.

Author Information

ude.sduqla.ffats@nawdar

Authors’ contributions: CB, LT, IC, PD, SP and KR: conceptualization, software, validation, supervision, project administration, and funding acquisition. CB, LT, IC, PD, SP, KR, MH, HM, AJ-J, RM, MA, KB, RT, HA, LC, RQ, RA, IK, SD, SA, MA-M, HB, MW: methodology. RQ, DA, and SV: Design the ML model, formal analysis, data curation and writing—original draft preparation. RQ, RT, HA, and MH: investigation. All authors and the research group contributed to the article and approved the submitted version.

Competing interests: The authors declare that they have no competing interests

Availability of data and materials

The datasets generated and/or analysed during the current study are available in the Corona Survey repository, https://osf.io/nz9xf/files/.

Notes

Ethics approval and consent to participate: This study was conducted according to the guidelines laid down in the Declaration of Helsinki and all procedures involving research study participants were approved by the University of Antwerp (SHW_46). Written consent was obtained from all subjects/patients at the start of the study.

Consent for publication: Not applicable

Abbreviations

FI
Food insecurity
SOFI
State of Food Insecurity and Nutrition in the World
COVID-19
Coronavirus disease 2019
ESCWA
Social Commission for Western Asia
WFP
World Food Program
GCC
Gulf Cooperation Council
FCS
Food Consumption Score
ML
Machine Learning
KNN
k-Nearest Neighbor
RF
Random Forest
SVM
Support Vector Machine
LR
Logistic Regression
NB
Naïve Bayes
ANN
Artificial Neural Network
FS
Food Secure
BMI
Body Mass Index
ROC
Receiver Operating Characteristics
AUC
Area under the curve

References

History

  • Posted March 27, 2023.

Citations & impact 


This article has not been cited yet.

Impact metrics

Alternative metrics

Altmetric item for https://www.altmetric.com/details/144490803
Altmetric
Discover the attention surrounding your research
https://www.altmetric.com/details/144490803