Nothing Special   »   [go: up one dir, main page]

You seem to have javascript disabled. Please note that many of the page functionalities won't work as expected without javascript enabled.
 
 
Sign in to use this feature.

Years

Between: -

Subjects

remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline

Journals

remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline

Article Types

Countries / Regions

remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline

Search Results (680)

Search Parameters:
Keywords = robust variable selection

Order results
Result details
Results per page
Select all
Export citation of selected articles as:
27 pages, 762 KiB  
Article
Innovative Approach to Detecting Autism Spectrum Disorder Using Explainable Features and Smart Web Application
by Mohammad Abu Tareq Rony, Fatama Tuz Johora, Nisrean Thalji, Ali Raza, Norma Latif Fitriyani, Muhammad Syafrudin and Seung Won Lee
Mathematics 2024, 12(22), 3515; https://doi.org/10.3390/math12223515 - 11 Nov 2024
Viewed by 59
Abstract
Autism Spectrum Disorder (ASD) is a complex developmental condition marked by challenges in social interaction, communication, and behavior, often involving restricted interests and repetitive actions. The diversity in symptoms and skill profiles across individuals creates a diagnostic landscape that requires a multifaceted approach [...] Read more.
Autism Spectrum Disorder (ASD) is a complex developmental condition marked by challenges in social interaction, communication, and behavior, often involving restricted interests and repetitive actions. The diversity in symptoms and skill profiles across individuals creates a diagnostic landscape that requires a multifaceted approach for accurate understanding and intervention. This study employed advanced machine-learning techniques to enhance the accuracy and reliability of ASD diagnosis. We used a standard dataset comprising 1054 patient samples and 20 variables. The research methodology involved rigorous preprocessing, including selecting key variables through data mining (DM) visualization techniques including Chi-Square tests, analysis of variance, and correlation analysis, along with outlier removal to ensure robust model performance. The proposed DM and logistic regression (LR) with Shapley Additive exPlanations (DMLRS) model achieved the highest accuracy at 99%, outperforming state-of-the-art methods. eXplainable artificial intelligence was incorporated using Shapley Additive exPlanations to enhance interpretability. The model was compared with other approaches, including XGBoost, Deep Models with Residual Connections and Ensemble (DMRCE), and fast lightweight automated machine learning systems. Each method was fine-tuned, and performance was verified using k-fold cross-validation. In addition, a real-time web application was developed that integrates the DMLRS model with the Django framework for ASD diagnosis. This app represents a significant advancement in medical informatics, offering a practical, user-friendly, and innovative solution for early detection and diagnosis. Full article
(This article belongs to the Section Fuzzy Sets, Systems and Decision Making)
21 pages, 623 KiB  
Article
Attribute Relevance Score: A Novel Measure for Identifying Attribute Importance
by Pablo Neirz, Hector Allende and Carolina Saavedra
Algorithms 2024, 17(11), 518; https://doi.org/10.3390/a17110518 - 9 Nov 2024
Viewed by 244
Abstract
This study introduces a novel measure for evaluating attribute relevance, specifically designed to accurately identify attributes that are intrinsically related to a phenomenon, while being sensitive to the asymmetry of those relationships and noise conditions. Traditional variable selection techniques, such as filter and [...] Read more.
This study introduces a novel measure for evaluating attribute relevance, specifically designed to accurately identify attributes that are intrinsically related to a phenomenon, while being sensitive to the asymmetry of those relationships and noise conditions. Traditional variable selection techniques, such as filter and wrapper methods, often fall short in capturing these complexities. Our methodology, grounded in decision trees but extendable to other machine learning models, was rigorously evaluated across various data scenarios. The results demonstrate that our measure effectively distinguishes relevant from irrelevant attributes and highlights how relevance is influenced by noise, providing a more nuanced understanding compared to established methods such as Pearson, Spearman, Kendall, MIC, MAS, MEV, GMIC, and Phik. This research underscores the importance of phenomenon-centric explainability, reproducibility, and robust attribute relevance evaluation in the development of predictive models. By enhancing both the interpretability and contextual accuracy of models, our approach not only supports more informed decision making but also contributes to a deeper understanding of the underlying mechanisms in diverse application domains, such as biomedical research, financial modeling, astronomy, and others. Full article
(This article belongs to the Special Issue Algorithms for Feature Selection (2nd Edition))
Show Figures

Figure 1

Figure 1
<p>Scatter plots illustrating various bivariate relationships generated. Each subplot represents a different synthetic dataset: (<b>A</b>–<b>G</b>) are datasets generated from multivariate normal distributions with varying correlations; (<b>H</b>–<b>N</b>) are datasets generated from rotated normal distributions, illustrating different linear relationships; (<b>O</b>–<b>U</b>) represent other complex, nonlinear patterns.</p>
Full article ">Figure 2
<p>Performance of ARS with different base models on Equation (<a href="#FD15-algorithms-17-00518" class="html-disp-formula">15</a>) without the noise term <math display="inline"><semantics> <mi>η</mi> </semantics></math> on the benchmark dataset. The informative variables (<math display="inline"><semantics> <msub> <mi mathvariant="bold">x</mi> <mn>1</mn> </msub> </semantics></math>, <math display="inline"><semantics> <msub> <mi mathvariant="bold">x</mi> <mn>2</mn> </msub> </semantics></math>, and <math display="inline"><semantics> <msub> <mi mathvariant="bold">x</mi> <mn>3</mn> </msub> </semantics></math>) and noninformative variables (<math display="inline"><semantics> <msub> <mi mathvariant="bold">x</mi> <mn>4</mn> </msub> </semantics></math>, <math display="inline"><semantics> <msub> <mi mathvariant="bold">x</mi> <mn>5</mn> </msub> </semantics></math>, and <math display="inline"><semantics> <msub> <mi mathvariant="bold">x</mi> <mn>6</mn> </msub> </semantics></math>) are evaluated across three base models: decision trees, linear regression, and k-nearest neighbors.</p>
Full article ">Figure 3
<p>Performance of various dependency measures on informative variables <math display="inline"><semantics> <msubsup> <mi mathvariant="bold">v</mi> <mn>1</mn> <mrow> <mo>(</mo> <mi>j</mi> <mo>)</mo> </mrow> </msubsup> </semantics></math>, <math display="inline"><semantics> <msubsup> <mi mathvariant="bold">v</mi> <mn>2</mn> <mrow> <mo>(</mo> <mi>j</mi> <mo>)</mo> </mrow> </msubsup> </semantics></math>, and <math display="inline"><semantics> <msubsup> <mi mathvariant="bold">v</mi> <mn>3</mn> <mrow> <mo>(</mo> <mi>j</mi> <mo>)</mo> </mrow> </msubsup> </semantics></math> across different noise levels <math display="inline"><semantics> <mrow> <mo>(</mo> <mi>j</mi> <mo>)</mo> </mrow> </semantics></math>. This figure illustrates the mean and standard deviation of each measure, highlighting the stability of ARS compared to other metrics.</p>
Full article ">Figure 4
<p>Performance of various dependency measures on noninformative variables <math display="inline"><semantics> <msubsup> <mi mathvariant="bold">v</mi> <mn>4</mn> <mrow> <mo>(</mo> <mi>j</mi> <mo>)</mo> </mrow> </msubsup> </semantics></math>, <math display="inline"><semantics> <msubsup> <mi mathvariant="bold">v</mi> <mn>5</mn> <mrow> <mo>(</mo> <mi>j</mi> <mo>)</mo> </mrow> </msubsup> </semantics></math>, and <math display="inline"><semantics> <msubsup> <mi mathvariant="bold">v</mi> <mn>6</mn> <mrow> <mo>(</mo> <mi>j</mi> <mo>)</mo> </mrow> </msubsup> </semantics></math> across different noise levels (j). This figure shows the mean and standard deviation for each measure, demonstrating the high variability of traditional metrics compared to the consistently low scores of ARS. Absolute values are considered for readability.</p>
Full article ">
16 pages, 3188 KiB  
Article
Species-Specific and Pollution-Induced Changes in Gene Expression and Metabolome of Closely Related Noccaea Species Under Natural Conditions
by Valentina Bočaj, Paula Pongrac, Sina Fischer and Matevž Likar
Plants 2024, 13(22), 3149; https://doi.org/10.3390/plants13223149 - 9 Nov 2024
Viewed by 216
Abstract
Hyperaccumulators within the Noccaea genus possess many promising genetic and metabolic adaptations that could be potentially exploited to support phytoremediation efforts and/or crop improvement and biofortification. Although hyperaccumulation is very common in this genus, individual species display specific traits as they can accumulate [...] Read more.
Hyperaccumulators within the Noccaea genus possess many promising genetic and metabolic adaptations that could be potentially exploited to support phytoremediation efforts and/or crop improvement and biofortification. Although hyperaccumulation is very common in this genus, individual species display specific traits as they can accumulate different elements (e.g., zinc, cadmium, and/or nickel). Moreover, there appears to be some populational variability with natural selection increasing the metal tolerance in metallicolous populations. Therefore, employing robust methods, such as integrated analysis of the transcriptome and metabolome, is crucial for uncovering pivotal candidate genes and pathways orchestrating the response to metal stress in Noccaea hyperaccumulators. Our study highlights several species-specific traits linked to the detoxification of metals and metal-induced oxidative stress in hyperaccumulating N. praecox when compared to a closely related model species, N. caerulescens, when grown in the field. Transcriptome analysis revealed distinct differences between the three studied natural Noccaea populations. Notably, we observed several pathways frequently connected to metal stress, i.e., glutathione metabolism, phenylpropanoid biosynthesis, and flavonoid biosynthesis, which were enriched. These differences were observed despite the relative evolutionary closeness of studied species, which emphasizes the importance of further expanding our knowledge on hyperaccumulators if we want to exploit their mechanisms for phytoremediation efforts or food quality improvements. Full article
(This article belongs to the Section Plant Physiology and Metabolism)
Show Figures

Figure 1

Figure 1
<p>Schematic representation of the study.</p>
Full article ">Figure 2
<p>Transcriptome data of <span class="html-italic">Noccaea caerulescens</span> (Nc) and <span class="html-italic">N. praecox</span> (Np) from non-polluted (Lo) and polluted (Ze) sites. (<b>a</b>) Pearson correlation coefficient analysis, (<b>b</b>) PCA analysis, (<b>c</b>) number of differentially expressed genes (DEGs), and (<b>d</b>) Venn diagram of DEGs for species (Np_Lo vs. Nc_Lo), environment (Np_Ze vs. Np_Lo), and species_x_environment (Np_Ze vs. Nc_Lo) comparisons.</p>
Full article ">Figure 3
<p>The enrichment analysis of KEGG functional pathways (<b>a</b>,<b>b</b>) and GO terms (<b>c</b>,<b>d</b>) among all DEGs (<b>a</b>,<b>c</b>) and DEGs in <span class="html-italic">N. praecox</span> grown in different environments (<b>b</b>,<b>d</b>). The inset Venn diagram depicts the differentially expressed genes used for pathway enrichment analysis, as depicted in <a href="#plants-13-03149-f002" class="html-fig">Figure 2</a>d. The colors of dots represent the adjusted <span class="html-italic">p</span> values and sizes represent the counts of transcripts in the category.</p>
Full article ">Figure 4
<p>Metabolome analysis of <span class="html-italic">Noccaea praecox</span> (Np) and <span class="html-italic">N. caerulescens</span> (Nc): (<b>a</b>) the number of different metabolites; (<b>b</b>) the number of differentially accumulated metabolites (DAMs); (<b>c</b>) the heatmap of DAMs for species (Np_Lo vs. Nc_Lo), environment (Np_Ze vs. Np_Lo), and species_x_environment (Np_Ze vs. Nc_Lo) comparisons.</p>
Full article ">Figure 5
<p>Heatmap of fold changes for differentially expressed genes and differentially accumulated metabolites (in bold) in (<b>a</b>) the glutathione metabolism pathway, (<b>b</b>) the phenylpropanoid biosynthesis pathway, and (<b>c</b>) the flavonoid biosynthesis pathway in for between-species (Np vs. Nc), environment (Ze vs. Lo), and species × environment (Np_Ze vs. Nc_Loc) comparisons. Colors represent log2 fold change: red—up-regulated; blue—down-regulated.</p>
Full article ">Figure 6
<p>Main results of the integrated view on transcriptomics and metabolomics of studied <span class="html-italic">Noccaea</span> species for between-species (Np vs. Nc), environment (Ze vs. Lo), and species × environment (Np_Ze vs. Nc_Loc) comparisons.</p>
Full article ">
15 pages, 2544 KiB  
Systematic Review
Prevalence of Mitral Valve Prolapse Among Individuals with Pectus Excavatum: A Systematic Review and Meta-Analysis
by Andrea Sonaglioni, Antonino Bruno, Alessio Polymeropoulos, Gian Luigi Nicolosi, Michele Lombardo and Paola Muti
Diagnostics 2024, 14(22), 2488; https://doi.org/10.3390/diagnostics14222488 - 7 Nov 2024
Viewed by 265
Abstract
Background: During the last decades, a small number of studies reported a wide range of variability in the estimated prevalence of mitral valve prolapse (MVP) among individuals with pectus excavatum (PE). The present systematic review and meta-analysis has been primarily designed to [...] Read more.
Background: During the last decades, a small number of studies reported a wide range of variability in the estimated prevalence of mitral valve prolapse (MVP) among individuals with pectus excavatum (PE). The present systematic review and meta-analysis has been primarily designed to summarize the main findings of these studies and to estimate the overall prevalence of MVP among PE individuals. Methods: All imaging studies assessing the prevalence of MVP in PE individuals vs. healthy controls, selected from PubMed and EMBASE databases, were included. The risk of bias was assessed by using the National Institutes of Health (NIH) Quality Assessment of Case–Control Studies. Events (presence of MVP) and nonevents (absence of MVP) in PE individuals and control groups were recorded. The main outcome was the measure of odds ratio (OR) for MVP presence pooled with 95% confidence intervals, using a fixed-effects model. Results: The full texts of eight studies with 303 PE patients (mean age 25.7 yrs) and 498 healthy controls (mean age 31 yrs) were analyzed. Three studies assessed MVP prevalence in children and early adolescents, whereas the remaining five studies examined PE adults. The prevalence of MVP in PE individuals and healthy controls was 40.6% and 12.8%, respectively. In the pooled sample, the OR for MVP presence was significantly higher in PE individuals compared to controls (OR = 5.80, 95%CI = 3.83–8.78, Z = 8.30, p < 0.001). Subgroup analysis revealed that MVP prevalence was approximately three-fold higher among PE children and early adolescents compared with PE adults. Overall, high consistency was observed in the pooled effect sizes, due to the low statistical heterogeneity among the included studies (I2 = 22.7%, p = 0.25). Egger’s test for a regression intercept gave a p-value of 0.07, indicating no publication bias. The sensitivity analysis supported the robustness of the results. Conclusions: PE individuals are nearly six times more likely to have MVP than controls. MVP prevalence is three-fold higher in PE individuals during childhood and early adolescence, compared to PE adults. Given the strong association between MVP and PE, MVP should be suspected in all individuals with anterior chest wall deformity. Full article
Show Figures

Figure 1

Figure 1
<p>Flow diagram used for identifying included studies. MVP, mitral valve prolapse; PE, pectus excavatum.</p>
Full article ">Figure 2
<p>Forest plots of ORs for MVP presence in PE individuals (categorized in children and adults) vs. healthy controls across included studies [<a href="#B23-diagnostics-14-02488" class="html-bibr">23</a>,<a href="#B24-diagnostics-14-02488" class="html-bibr">24</a>,<a href="#B25-diagnostics-14-02488" class="html-bibr">25</a>,<a href="#B26-diagnostics-14-02488" class="html-bibr">26</a>,<a href="#B27-diagnostics-14-02488" class="html-bibr">27</a>,<a href="#B28-diagnostics-14-02488" class="html-bibr">28</a>,<a href="#B29-diagnostics-14-02488" class="html-bibr">29</a>,<a href="#B30-diagnostics-14-02488" class="html-bibr">30</a>].</p>
Full article ">Figure 3
<p>Begg’s funnel plot for publication bias detection [<a href="#B23-diagnostics-14-02488" class="html-bibr">23</a>,<a href="#B24-diagnostics-14-02488" class="html-bibr">24</a>,<a href="#B25-diagnostics-14-02488" class="html-bibr">25</a>,<a href="#B26-diagnostics-14-02488" class="html-bibr">26</a>,<a href="#B27-diagnostics-14-02488" class="html-bibr">27</a>,<a href="#B28-diagnostics-14-02488" class="html-bibr">28</a>,<a href="#B29-diagnostics-14-02488" class="html-bibr">29</a>,<a href="#B30-diagnostics-14-02488" class="html-bibr">30</a>].</p>
Full article ">Figure 4
<p>An example of MHI assessment in a PE individual. Panel (<b>A</b>): L-L thoracic diameter, measured with the individual in the standing position and with open arms by using a rigid ruler in centimeters coupled to a level (the measuring device), placed at the distal third of the sternum, at the point of maximum depression of the sternum. Panel (<b>B</b>): A-P thoracic diameter, obtained with the individual in the left-lateral decubitus position during conventional transthoracic echocardiography by placing a 2.5 mHz transducer near the sternum in the left third or fourth intercostal space, to obtain a parasternal long-axis view, and measuring the distance between the true apex of the sector and the anterior surface of the vertebral body. The vertebral body is identified by using, as a reference point, the posterior wall of the descending thoracic aorta, visualized behind the left atrium. Ao, aorta; A-P, antero-posterior; Asc, ascending; Desc, descending; LA; left atrium; L-L, latero-lateral; LV, left ventricle; MHI, modified Haller index; PE, pectus excavatum; RV, right ventricle.</p>
Full article ">
19 pages, 9044 KiB  
Article
Advanced Short-Term Load Forecasting with XGBoost-RF Feature Selection and CNN-GRU
by Jingping Cui, Wei Kuang, Kai Geng, Aiying Bi, Fengjiao Bi, Xiaogang Zheng and Chuan Lin
Processes 2024, 12(11), 2466; https://doi.org/10.3390/pr12112466 - 7 Nov 2024
Viewed by 281
Abstract
Accurate and efficient short-term load forecasting (STLF) is essential for optimizing power system operations. This study proposes a novel hybrid forecasting model that integrates XGBoost-RF feature selection with a CNN-GRU neural network to enhance prediction performance while reducing model complexity. The XGBoost-RF approach [...] Read more.
Accurate and efficient short-term load forecasting (STLF) is essential for optimizing power system operations. This study proposes a novel hybrid forecasting model that integrates XGBoost-RF feature selection with a CNN-GRU neural network to enhance prediction performance while reducing model complexity. The XGBoost-RF approach is first applied to select the most predictive features from historical load data, weather conditions, and time-based variables. A convolutional neural network (CNN) is then employed to extract spatial features, while a gated recurrent unit (GRU) captures temporal dependencies for load forecasting. By leveraging a dual-channel structure that combines long- and short-term historical load trends, the proposed model significantly mitigates cumulative errors from recursive predictions. Experimental results demonstrate that the model achieves superior performance with an average root mean square error (RMSE) of 53.29 and mean absolute percentage error (MAPE) of 3.56% on the test set. Compared to traditional models, the prediction accuracy improves by 28.140% to 110.146%. Additionally, the model exhibits strong robustness across different climatic conditions. This research validates the efficacy of integrating XGBoost-RF feature selection with CNN-GRU for STLF, offering reliable decision support for power system management. Full article
(This article belongs to the Section Automation Control Systems)
Show Figures

Figure 1

Figure 1
<p>Schematic diagram of multi-step prediction.</p>
Full article ">Figure 2
<p>Strategies for multi-step ahead predictions.</p>
Full article ">Figure 3
<p>The neuronal structure of the GRU.</p>
Full article ">Figure 4
<p>Convolution process.</p>
Full article ">Figure 5
<p>Electricity load demand map for the Panama region.</p>
Full article ">Figure 6
<p>Weekly and hourly heat map of the area.</p>
Full article ">Figure 7
<p>Hybrid feature extraction method.</p>
Full article ">Figure 8
<p>Ranking of feature importance in the hybrid feature extraction method.</p>
Full article ">Figure 9
<p>Structure of the parallel CNN-GRU short-term electric load forecasting model.</p>
Full article ">Figure 10
<p>Single-channel CNN-GRU models.</p>
Full article ">Figure 11
<p>CNN-GRU model diagram.</p>
Full article ">Figure 12
<p>Model flowchart.</p>
Full article ">Figure 13
<p>Comparison of forecast results from different models.</p>
Full article ">Figure 14
<p>Electricity load forecasting charts of different models.</p>
Full article ">Figure 15
<p>Comparison results between the GRU model and NFS-GRU model.</p>
Full article ">Figure 16
<p>Load forecast curve plot of the GRU model vs. NFS-GRU model.</p>
Full article ">Figure 17
<p>Comparison of results between the GRU and SC-GRU models.</p>
Full article ">Figure 18
<p>Load forecasting curves of the GRU and SC-GRU models.</p>
Full article ">
37 pages, 4062 KiB  
Article
Heart Sound Classification Using Harmonic and Percussive Spectral Features from Phonocardiograms with a Deep ANN Approach
by Anupinder Singh, Vinay Arora and Mandeep Singh
Appl. Sci. 2024, 14(22), 10201; https://doi.org/10.3390/app142210201 - 6 Nov 2024
Viewed by 381
Abstract
Cardiovascular diseases (CVDs) are a leading cause of mortality worldwide, with a particularly high burden in India. Non-invasive methods like Phonocardiogram (PCG) analysis capture the acoustic activity of the heart. This holds significant potential for the early detection and diagnosis of heart conditions. [...] Read more.
Cardiovascular diseases (CVDs) are a leading cause of mortality worldwide, with a particularly high burden in India. Non-invasive methods like Phonocardiogram (PCG) analysis capture the acoustic activity of the heart. This holds significant potential for the early detection and diagnosis of heart conditions. However, the complexity and variability of PCG signals pose considerable challenges for accurate classification. Traditional methods of PCG signal analysis, including time-domain, frequency-domain, and time-frequency domain techniques, often fall short in capturing the intricate details necessary for reliable diagnosis. This study introduces an innovative approach that leverages harmonic–percussive source separation (HPSS) to extract distinct harmonic and percussive spectral features from PCG signals. These features are then utilized to train a deep feed-forward artificial neural network (ANN), classifying heart conditions as normal or abnormal. The methodology involves advanced digital signal processing techniques applied to PCG recordings from the PhysioNet 2016 dataset. The feature set comprises 164 attributes, including the Chroma STFT, Chroma CENS, Mel-frequency cepstral coefficients (MFCCs), and statistical features. These are refined using the ROC-AUC feature selection method to ensure optimal performance. The deep feed-forward ANN model was rigorously trained and validated on a balanced dataset. Techniques such as noise reduction and outlier detection were used to improve model training. The proposed model achieved a validation accuracy of 93.40% with sensitivity and specificity rates of 82.40% and 80.60%, respectively. These results underscore the effectiveness of harmonic-based features and the robustness of the ANN in heart sound classification. This research highlights the potential for deploying such models in non-invasive cardiac diagnostics, particularly in resource-constrained settings. It also lays the groundwork for future advancements in cardiac signal analysis. Full article
(This article belongs to the Special Issue Machine Learning in Biomedical Applications)
Show Figures

Figure 1

Figure 1
<p>Types of heart signals [<a href="#B7-applsci-14-10201" class="html-bibr">7</a>].</p>
Full article ">Figure 2
<p>General approach of PCG classification.</p>
Full article ">Figure 3
<p>Research methodology for classification of PCG signals.</p>
Full article ">Figure 4
<p>Abnormal PCG (a0006.wav [<a href="#B11-applsci-14-10201" class="html-bibr">11</a>]): (<b>a</b>) original file; (<b>b</b>) harmonic and percussive components.</p>
Full article ">Figure 5
<p>Normal PCG (a0007.wav [<a href="#B11-applsci-14-10201" class="html-bibr">11</a>]): (<b>a</b>) original file; (<b>b</b>) harmonic and percussive components.</p>
Full article ">Figure 6
<p>Time–frequency diagrams of abnormal PCGs (a0006.wav [<a href="#B11-applsci-14-10201" class="html-bibr">11</a>]): (<b>a</b>) original file, (<b>b</b>) noise-filtered clipped to 600 Hz, (<b>c</b>) harmonic component of noise-filtered signal clipped to 600 Hz, (<b>d</b>) percussive component of noise-filtered signal clipped to 600 Hz, (<b>e</b>) harmonic component with margin 4 of noise-filtered signal clipped to 600 Hz, and (<b>f</b>) percussive component with margin 4 of noise-filtered signal clipped to 600 Hz.</p>
Full article ">Figure 7
<p>Time–frequencydiagrams of normal PCGs (a0007.wav [<a href="#B11-applsci-14-10201" class="html-bibr">11</a>]): (<b>a</b>) original file, (<b>b</b>) noise-filtered clipped to 600 Hz, (<b>c</b>) harmonic component of noise-filtered signal clipped to 600 Hz, (<b>d</b>) percussive component of noise-filtered signal clipped to 600 Hz, (<b>e</b>) harmonic component with margin 4 of noise-filtered signal clipped to 600 Hz, and (<b>f</b>) percussive component with margin 4 of noise-filtered signal clipped to 600 Hz.</p>
Full article ">Figure 8
<p>Waveplots of abnormal PCGs (a0006.wav [<a href="#B11-applsci-14-10201" class="html-bibr">11</a>]): (<b>a</b>) original signal; (<b>b</b>) harmonic and percussive components of original file; (<b>c</b>) with margin 4 on harmonic and percussive components of original signal.</p>
Full article ">Figure 9
<p>Waveplots of abnormal PCGs (a0006.wav [<a href="#B11-applsci-14-10201" class="html-bibr">11</a>]) with noise-filtered frequency clipped to 600Hz: (<b>a</b>) noise-filtered signal; (<b>b</b>) harmonic and percussive components; (<b>c</b>) with margin 4 on harmonic and percussive components.</p>
Full article ">Figure 10
<p>Waveplots of normal PCGs (a0007.wav [<a href="#B11-applsci-14-10201" class="html-bibr">11</a>]): (<b>a</b>) original signal; (<b>b</b>) harmonic and percussive components of original file; (<b>c</b>) with margin 4 on harmonic and percussive components of original signal.</p>
Full article ">Figure 11
<p>Waveplots of normal PCGs (a0007.wav [<a href="#B11-applsci-14-10201" class="html-bibr">11</a>]) with noise-filtered frequency clipped to 600 Hz: (<b>a</b>) noise-filtered signal; (<b>b</b>) harmonic and percussive components; (<b>c</b>) with margin 4 on harmonic and percussive components.</p>
Full article ">Figure 12
<p>Feed-forward ANN.</p>
Full article ">Figure 13
<p>Confusion matrix of harmonic features. (<b>a</b>) all features, (<b>b</b>) chroma, (<b>c</b>) Chroma CENS, (<b>d</b>) MFCC, (<b>e</b>) Statistical, (<b>f</b>) chroma and statistical, (<b>g</b>) Chroma CENS and statistical, and (<b>h</b>) MFCC and statistical.</p>
Full article ">Figure 14
<p>ROC of harmonic features. (<b>a</b>) All features, (<b>b</b>) chroma, (<b>c</b>) Chroma CENS, (<b>d</b>) MFCC, (<b>e</b>) statistical, (<b>f</b>) chroma and statistical, (<b>g</b>) Chroma CENS and statistical, and (<b>h</b>) MFCC and statistical.</p>
Full article ">Figure 15
<p>Confusion matrix of percussive features. (<b>a</b>) All percussive, (<b>b</b>) chroma, (<b>c</b>) Chroma CENS, (<b>d</b>) MFCC, (<b>e</b>) statistical, (<b>f</b>) chroma and statistical, (<b>g</b>) Chroma CENS and statistical, and (<b>h</b>) MFCC and statistical.</p>
Full article ">Figure 16
<p>ROC of percussive features. (<b>a</b>) All percussive, (<b>b</b>) chroma, (<b>c</b>) Chroma CENS, (<b>d</b>) MFCC, (<b>e</b>) statistical, (<b>f</b>) chroma and statistical, (<b>g</b>) Chroma CENS and statistical, and (<b>h</b>) MFCC and statistical.</p>
Full article ">
21 pages, 402 KiB  
Systematic Review
Enhancing IoT Security in Vehicles: A Comprehensive Review of AI-Driven Solutions for Cyber-Threat Detection
by Rafael Abreu, Emanuel Simão, Carlos Serôdio, Frederico Branco and António Valente
AI 2024, 5(4), 2279-2299; https://doi.org/10.3390/ai5040112 - 6 Nov 2024
Viewed by 902
Abstract
Background: The Internet of Things (IoT) has improved many aspects that have impacted the industry and the people’s daily lives. To begin with, the IoT allows communication to be made across a wide range of devices, from household appliances to industrial machinery. This [...] Read more.
Background: The Internet of Things (IoT) has improved many aspects that have impacted the industry and the people’s daily lives. To begin with, the IoT allows communication to be made across a wide range of devices, from household appliances to industrial machinery. This connectivity allows for a better integration of the pervasive computing, making devices “smart” and capable of interacting with each other and with the corresponding users in a sublime way. However, the widespread adoption of IoT devices has introduced some security challenges, because these devices usually run in environments that have limited resources. As IoT technology becomes more integrated into critical infrastructure and daily life, the need for stronger security measures will increase. These devices are exposed to a variety of cyber-attacks. This literature review synthesizes the current research of artificial intelligence (AI) technologies to improve IoT security. This review addresses key research questions, including: (1) What are the primary challenges and threats that IoT devices face?; (2) How can AI be used to improve IoT security?; (3) What AI techniques are currently being used for this purpose?; and (4) How does applying AI to IoT security differ from traditional methods? Methods: We included a total of 33 peer-reviewed studies published between 2020 and 2024, specifically in journal and conference papers written in English. Studies irrelevant to the use of AI for IoT security, duplicate studies, and articles without full-text access were excluded. The literature search was conducted using scientific databases, including MDPI, ScienceDirect, IEEE Xplore, and SpringerLink. Results were synthesized through a narrative synthesis approach, with the help of the Parsifal tool to organize and visualize key themes and trends. Results: We focus on the use of machine learning, deep learning, and federated learning, which are used for anomaly detection to identify and mitigate the security threats inherent to these devices. AI-driven technologies offer promising solutions for attack detection and predictive analysis, reducing the need for human intervention more significantly. This review acknowledges limitations such as the rapidly evolving nature of IoT technologies, the early-stage development or proprietary nature of many AI techniques, the variable performance of AI models in real-world applications, and potential biases in the search and selection of articles. The risk of bias in this systematic review is moderate. While the study selection and data collection processes are robust, the reliance on narrative synthesis and the limited exploration of potential biases in the selection process introduce some risk. Transparency in funding and conflict of interest reporting reduces bias in those areas. Discussion: The effectiveness of these AI-based approaches can vary depending on the performance of the model and the computational efficiency. In this article, we provide a comprehensive overview of existing AI models applied to IoT security, including machine learning (ML), deep learning (DL), and hybrid approaches. We also examine their role in enhancing the detection accuracy. Despite all the advances, challenges still remain in terms of data privacy and the scalability of AI solutions in IoT security. Conclusion: This review provides a comprehensive overview of ML applications to enhance IoT security. We also discuss and outline future directions, emphasizing the need for collaboration between interested parties and ongoing innovation to address the evolving threat landscape in IoT security. Full article
Show Figures

Figure 1

Figure 1
<p>Bar chart of selected vs. accepted articles.</p>
Full article ">Figure 2
<p>PRISMA 2020 diagram.</p>
Full article ">
25 pages, 1578 KiB  
Review
Systematic Review of Dimensions and Indicators in Sustainable and Smart Cities: Trends, Interdependencies, and Continental Variations
by Roger Alejandro Parra-Pulido, Yolanda Teresa Hernández-Peña and Carlos Alfonso Zafra-Mejía
Urban Sci. 2024, 8(4), 202; https://doi.org/10.3390/urbansci8040202 - 6 Nov 2024
Viewed by 505
Abstract
Accelerated urbanization challenges cities globally. Ensuring good quality of life and environmental protection requires innovative, data-driven solutions. Sustainable and smart cities (SSCs) offer a promising response, but effective implementation depends on selecting and using appropriate dimensions and indicators. The main objective of this [...] Read more.
Accelerated urbanization challenges cities globally. Ensuring good quality of life and environmental protection requires innovative, data-driven solutions. Sustainable and smart cities (SSCs) offer a promising response, but effective implementation depends on selecting and using appropriate dimensions and indicators. The main objective of this article is to present a systematic review within the framework of SSCs to study trends in the use of dimensions and indicators. A citation frequency index (Q) considering quartiles is used to determine the importance of the variables considered in this study. A spatiotemporal analysis of the dimensions and indicators detected is also performed. The results suggest that the multifaceted nature of SSCs necessitates a holistic approach, with environmental (Q3 = 0.536) and social (Q2 = 0.403) dimensions being principal. Despite the lower ranking (Q1 = 0.080) of the information and communication technology (ICT) dimension, its pervasive influence highlights its evolving importance. Effective ICT implementation relies on robust social and environmental foundations, illustrating their interdependence. The governance (Q1 = 0.169) dimension is also essential for the creation of transparent and equitable urban policies. Key indicators related to energy systems (Q2 = 0.379), data analysis (Q2 = 0.327), and resource management (Q2 = 0.291) are crucial for operational efficiency and sustainability. Continental differences in prioritizing SSC dimensions highlight the importance of context-specific strategies, reflecting unique challenges and opportunities faced by different continental areas. By understanding and addressing these continental differences, policymakers and urban planners can more effectively promote sustainable and resilient urban development worldwide. Full article
(This article belongs to the Special Issue Sustainable Urbanization, Regional Planning and Development)
Show Figures

Figure 1

Figure 1
<p>Temporal variation in SSC dimensions detected in Scopus database (2015–2023; <span class="html-italic">n</span> = 7558).</p>
Full article ">Figure 2
<p>A dendrogram for the dimensions of SSCs detected in the Scopus database (2015–2023; <span class="html-italic">n</span> = 7558).</p>
Full article ">Figure 3
<p>Temporal variations in the most reported SSC indicators in the Scopus database (2015–2023; <span class="html-italic">n</span> = 7558).</p>
Full article ">Figure 4
<p>A dendrogram for the SSC indicators detected in the Scopus database (2015–2023; <span class="html-italic">n</span> = 7558).</p>
Full article ">Figure 5
<p>Spatial variations in SSC dimensions detected in the Scopus database (2015–2023; <span class="html-italic">n</span> = 7558).</p>
Full article ">
21 pages, 8936 KiB  
Article
A Proposal for a New Python Library Implementing Stepwise Procedure
by Luiz Paulo Fávero, Helder Prado Santos, Patrícia Belfiore, Alexandre Duarte, Igor Pinheiro de Araújo Costa, Adilson Vilarinho Terra, Miguel Ângelo Lellis Moreira, Wilson Tarantin Junior and Marcos dos Santos
Algorithms 2024, 17(11), 502; https://doi.org/10.3390/a17110502 - 4 Nov 2024
Viewed by 302
Abstract
Carefully selecting variables in problems with large volumes of data are extremely important, as it reduces the complexity of the model, improves the interpretation of the results, and increases computational efficiency, ensuring more accurate and relevant analyses. This paper presents a comprehensive approach [...] Read more.
Carefully selecting variables in problems with large volumes of data are extremely important, as it reduces the complexity of the model, improves the interpretation of the results, and increases computational efficiency, ensuring more accurate and relevant analyses. This paper presents a comprehensive approach to selecting variables in multiple regression models using the stepwise procedure. As the main contribution of this study, we present the stepwise function implemented in Python to improve the effectiveness of statistical analyses, allowing the intuitive and efficient selection of statistically significant variables. The application of the function is exemplified in a real case study of real estate pricing, validating its effectiveness in improving the fit of regression models. In addition, we presented a methodological framework for treating joint problems in data analysis, such as heteroskedasticity, multicollinearity, and nonadherence of residues to normality. This framework offers a robust computational implementation to mitigate such issues. This study aims to advance the understanding and application of statistical methods in Python, providing valuable tools for researchers, students, and professionals from various areas. Full article
(This article belongs to the Special Issue Algorithms for Feature Selection (2nd Edition))
Show Figures

Figure 1

Figure 1
<p>Methodological workflow.</p>
Full article ">Figure 2
<p>Descriptive statistics of the quantitative variables.</p>
Full article ">Figure 3
<p>Heatmap of the Pearson correlation correlations matrix between the metric variables.</p>
Full article ">Figure 4
<p>Distribution of the metric variables.</p>
Full article ">Figure 5
<p>MLR model.</p>
Full article ">Figure 6
<p>Variance Inflation Factor and Tolerance.</p>
Full article ">Figure 7
<p>New model (‘model_step_apartments’) obtained after applying the stepwise.</p>
Full article ">Figure 8
<p>Shapiro–Francia test.</p>
Full article ">Figure 9
<p>Breusch–Pagan test.</p>
Full article ">Figure 10
<p><span class="html-italic">p</span>-values of the model variables after the Box–Cox transformation.</p>
Full article ">Figure 11
<p>Stepwise procedure applied to the ‘model_bc_apartments’.</p>
Full article ">Figure 12
<p>Comparison between the two models.</p>
Full article ">Figure 13
<p>Application of the Shapiro–Francia test.</p>
Full article ">Figure 14
<p>Breusch–Pagan test.</p>
Full article ">
20 pages, 6875 KiB  
Article
Identification of a Novel Biomarker Panel for Breast Cancer Screening
by Maria Vaida, Kamala K. Arumalla, Pavan Kumar Tatikonda, Bharadwaj Popuri, Rashid A. Bux, Paramjit S. Tappia, Guoyu Huang, Jean-François Haince and W. Randolph Ford
Int. J. Mol. Sci. 2024, 25(21), 11835; https://doi.org/10.3390/ijms252111835 - 4 Nov 2024
Viewed by 393
Abstract
Breast cancer remains a major public health concern, and early detection is crucial for improving survival rates. Metabolomics offers the potential to develop non-invasive screening and diagnostic tools based on metabolic biomarkers. However, the inherent complexity of metabolomic datasets and the high dimensionality [...] Read more.
Breast cancer remains a major public health concern, and early detection is crucial for improving survival rates. Metabolomics offers the potential to develop non-invasive screening and diagnostic tools based on metabolic biomarkers. However, the inherent complexity of metabolomic datasets and the high dimensionality of biomarkers complicates the identification of diagnostically relevant features, with multiple studies demonstrating limited consensus on the specific metabolites involved. Unlike previous studies that rely on singular feature selection techniques such as Partial Least Square (PLS) or LASSO regression, this research combines supervised and unsupervised machine learning methods with random sampling strategies, offering a more robust and interpretable approach to feature selection. This study aimed to identify a parsimonious and robust set of biomarkers for breast cancer diagnosis using metabolomics data. Plasma samples from 185 breast cancer patients and 53 controls (from the Cooperative Human Tissue Network, USA) were analyzed. This study also overcomes the common issue of dataset imbalance by using propensity score matching (PSM), which ensures reliable comparisons between cancer and control groups. We employed Univariate Naïve Bayes, L2-regularized Support Vector Classifier (SVC), Principal Component Analysis (PCA), and feature engineering techniques to refine and select the most informative features. Our best-performing feature set comprised 11 biomarkers, including 9 metabolites (SM(OH) C22:2, SM C18:0, C0, C3OH, C14:2OH, C16:2OH, LysoPC a C18:1, PC aa C36:0 and Asparagine), a metabolite ratio (Kynurenine-to-Tryptophan), and 1 demographic variable (Age), achieving an area under the ROC curve (AUC) of 98%. These results demonstrate the potential for a robust, cost-effective, and non-invasive breast cancer screening and diagnostic tool, offering significant clinical value for early detection and personalized patient management. Full article
(This article belongs to the Special Issue Molecular Biomarkers in Cancers: Advances and Challenges)
Show Figures

Graphical abstract

Graphical abstract
Full article ">Figure 1
<p>Word cloud of significant metabolites identified as significant indicators of breast cancer in the reviewed breast cancer literature [<a href="#B13-ijms-25-11835" class="html-bibr">13</a>,<a href="#B14-ijms-25-11835" class="html-bibr">14</a>,<a href="#B15-ijms-25-11835" class="html-bibr">15</a>,<a href="#B16-ijms-25-11835" class="html-bibr">16</a>,<a href="#B17-ijms-25-11835" class="html-bibr">17</a>,<a href="#B18-ijms-25-11835" class="html-bibr">18</a>,<a href="#B19-ijms-25-11835" class="html-bibr">19</a>,<a href="#B20-ijms-25-11835" class="html-bibr">20</a>,<a href="#B21-ijms-25-11835" class="html-bibr">21</a>,<a href="#B22-ijms-25-11835" class="html-bibr">22</a>,<a href="#B23-ijms-25-11835" class="html-bibr">23</a>,<a href="#B24-ijms-25-11835" class="html-bibr">24</a>,<a href="#B25-ijms-25-11835" class="html-bibr">25</a>,<a href="#B26-ijms-25-11835" class="html-bibr">26</a>]. Text size represents the prevalence of the biomarkers in the reviewed literature. The word cloud was generated using the <span class="html-italic">wordcloud</span> Python library [<a href="#B27-ijms-25-11835" class="html-bibr">27</a>].</p>
Full article ">Figure 2
<p>Receiver Operating Characteristic (ROC) curves for classification models on propensity-score-matched (PSM) data using different feature sets. The Matthew’s correlation coefficient (MCC), accuracy, and Area Under the Curve (AUC) values are reported within each panel to quantify the overall model performance. Panels <b>A</b>–<b>R</b> display AUC curves for the 18 feature sets described in <a href="#ijms-25-11835-t001" class="html-table">Table 1</a>. The best performance across all models is observed in Panel <b>B</b> using the 59 features identified by SVC feature selector, with an AUC of 99%, an MCC score of 91%, and an accuracy of 96%. The 11-feature set (Panel <b>Q</b>) scores a point lower on AUC and accuracy and 2 values lower for MCC score, while reducing the number of features 5-fold.</p>
Full article ">Figure 3
<p>Receiver Operating Characteristic (ROC) curves for classification models on age-excluded propensity-score-matched (PSM) data using different feature sets. The Receiver Operating Characteristic (ROC) curves for classification models applied to PSM feature sets after excluding the age variable. Panels <b>A</b>–<b>R</b> display AUC curves for the 18 feature sets described in <a href="#ijms-25-11835-t002" class="html-table">Table 2</a>. Panels <b>B</b>, <b>E</b>, <b>L</b>, <b>K</b>, and <b>N</b> consistently maintained an accuracy of 94% and an AUC of 99%. However, when the feature set size was reduced to fewer than 12 features, the removal of age had a significant impact on performance. Specifically, the AUC for the 11-feature set (Panel <b>Q</b>) decreased from 99% to 94%, with accuracy dropping by 3 percentage points. The MCC showed the most substantial decline, decreasing from 89% to 72%.</p>
Full article ">Figure 4
<p>This figure shows the Receiver Operating Characteristic (ROC) curves for classification models applied to the unmatched data. Panels <b>A</b>–<b>R</b> display AUC curves for the 18 feature sets described in <a href="#ijms-25-11835-t003" class="html-table">Table 3</a>. When the analysis was restricted to just 54 control cases, performance dropped across all datasets. The feature set in Panel <b>B</b> achieved the highest AUC at 96%, followed by those in Panels <b>F</b> and <b>Q</b>, each at 95%. All panels demonstrate wider confidence intervals and lower performance metrics.</p>
Full article ">Figure 5
<p>Receiver Operating Characteristic (ROC) curves for classification models applied to the unmatched data after excluding age variable. Panels <b>A</b>–<b>R</b> display AUC curves for the 18 feature sets described in <a href="#ijms-25-11835-t004" class="html-table">Table 4</a>. When the analysis was limited to only 54 control cases, performance decreased across all datasets, with notably wide confidence intervals, similar to the PSM dataset after the age variable was excluded. The highest AUC (94%) was achieved by the feature sets in Panels <b>B</b> and <b>F</b>, while the AUC in Panel <b>Q</b> was 10 percentage points lower, reflecting a significant drop in model performance, likely due to the high imbalance between the cancer and control cases.</p>
Full article ">Figure 6
<p>Violin plots of selected features between the cancer and control groups displaying the distribution of feature values for the cancer (blue) and control (orange) groups for 11 selected features. Each violin plot visualizes the kernel density estimate of the feature values, where the width represents the distribution frequency. The <span class="html-italic">p</span>-values, computed using the Wilcoxon rank-sum test, are shown below each feature. Features with <span class="html-italic">p</span>-values ≤ 0.05 show a statistically significant difference in distribution between the cancer and control groups, whereas the Kynurenine–Tryptophan ratio (<span class="html-italic">p</span> = 0.117) and C3OH (<span class="html-italic">p</span> = 0.139) display less pronounced statistical significance.</p>
Full article ">Figure 7
<p>Heatmap of selected 11 features for the cancer and control groups. The color intensity represents the standardized values (Z-scores) of each feature, with the color scale ranging from purple (low expression, −2) to yellow (high expression, +4). The bar at the bottom of the heatmap indicates the sample group assignments, with blue representing cancer and orange representing control. This visual clearly shows the variability in feature expression patterns across samples, highlighting the distinct expression profiles between the two groups.</p>
Full article ">Figure 8
<p>Panels <b>A</b>–<b>R</b> display PCA visualizations for the 18 feature sets described in <a href="#ijms-25-11835-t003" class="html-table">Table 3</a>. The 12-, 11-, and 10-feature sets (Panels <b>P</b>, <b>Q</b> and <b>R</b>) exhibit superior separation between control and cancer samples when compared to other feature sets. While other sets, such as SVC–92 and PCA ∩ NB-35 (Panels <b>B</b> and <b>N</b>) show some separation, refining the feature selection improves the overall distinction between the groups.</p>
Full article ">
9 pages, 210 KiB  
Article
Mitigating Bias Due to Race and Gender in Machine Learning Predictions of Traffic Stop Outcomes
by Kevin Saville, Derek Berger and Jacob Levman
Information 2024, 15(11), 687; https://doi.org/10.3390/info15110687 - 1 Nov 2024
Viewed by 397
Abstract
Traffic stops represent a crucial point of interaction between citizens and law enforcement, with potential implications for bias and discrimination. This study performs a rigorously validated comparative machine learning model analysis, creating artificial intelligence (AI) technologies to predict the results of traffic stops [...] Read more.
Traffic stops represent a crucial point of interaction between citizens and law enforcement, with potential implications for bias and discrimination. This study performs a rigorously validated comparative machine learning model analysis, creating artificial intelligence (AI) technologies to predict the results of traffic stops using a dataset sourced from the Montgomery County Maryland Data Centre, focusing on variables such as driver demographics, violation types, and stop outcomes. We repeated our rigorous validation of AI for the creation of models that predict outcomes with and without race and with and without gender informing the model. Feature selection employed regularly selects for gender and race as a predictor variable. We also observed correlations between model performance and both race and gender. While these findings imply the existence of discrimination based on race and gender, our large-scale analysis (>600,000 samples) demonstrates the ability to produce top performing models that are gender and race agnostic, implying the potential to create technology that can help mitigate bias in traffic stops. The findings encourage the need for unbiased data and robust algorithms to address biases in law enforcement practices and enhance public trust in AI technologies deployed in this domain. Full article
(This article belongs to the Section Artificial Intelligence)
27 pages, 2540 KiB  
Article
Forecasting Multi-Step Soil Moisture with Three-Phase Hybrid Wavelet-Least Absolute Shrinkage Selection Operator-Long Short-Term Memory Network (moDWT-Lasso-LSTM) Model
by W. J. M. Lakmini Prarthana Jayasinghe, Ravinesh C. Deo, Nawin Raj, Sujan Ghimire, Zaher Mundher Yaseen, Thong Nguyen-Huy and Afshin Ghahramani
Water 2024, 16(21), 3133; https://doi.org/10.3390/w16213133 - 1 Nov 2024
Viewed by 593
Abstract
To develop agricultural risk management strategies, the early identification of water deficits during the growing cycle is critical. This research proposes a deep learning hybrid approach for multi-step soil moisture forecasting in the Bundaberg region in Queensland, Australia, with predictions made for 1-day, [...] Read more.
To develop agricultural risk management strategies, the early identification of water deficits during the growing cycle is critical. This research proposes a deep learning hybrid approach for multi-step soil moisture forecasting in the Bundaberg region in Queensland, Australia, with predictions made for 1-day, 14-day, and 30-day, intervals. The model integrates Geospatial Interactive Online Visualization and Analysis Infrastructure (Giovanni) satellite data with ground observations. Due to the periodicity, transience, and trends in soil moisture of the top layer, time series datasets were complex. Hence, the Maximum Overlap Discrete Wavelet Transform (moDWT) method was adopted for data decomposition to identify the best correlated wavelet and scaling coefficients of the predictor variables with the target top layer moisture. The proposed 3-phase hybrid moDWT-Lasso-LSTM model used the Least Absolute Shrinkage and Selection Operator (Lasso) method for feature selection. Optimal hyperparameters were identified using the Hyperopt algorithm with deep learning LSTM method. This proposed model’s performances were compared with benchmarked machine learning (ML) models. In total, nine models were developed, including three standalone models (e.g., LSTM), three integrated feature selection models (e.g., Lasso-LSTM), and three hybrid models incorporating wavelet decomposition and feature selection (e.g., moDWT-Lasso-LSTM). Compared to alternative models, the hybrid deep moDWT-Lasso-LSTM produced the superior predictive model across statistical performance metrics. For example, at 1-day forecast, The moDWT-Lasso-LSTM model exhibits the highest accuracy with the highest R20.92469 and the lowest RMSE 0.97808, MAE 0.76623, and SMAPE 4.39700%, outperforming other models. The moDWT-Lasso-DNN model follows closely, while the Lasso-ANN and Lasso-DNN models show lower accuracy with higher RMSE and MAE values. The ANN and DNN models have the lowest performance, with higher error metrics and lower R2 values compared to the deep learning models incorporating moDWT and Lasso techniques. This research emphasizes the utility of the advanced complementary ML model, such as the developed moDWT-Lasso-LSTM 3-phase hybrid model, as a robust data-driven tool for early forecasting of soil moisture. Full article
(This article belongs to the Section Soil and Water)
Show Figures

Figure 1

Figure 1
<p>Study site geographical location and land use of the region and surrounding areas [<a href="#B41-water-16-03133" class="html-bibr">41</a>].</p>
Full article ">Figure 2
<p>Schematic view of the development of benchmark models and proposed 3-phase hybrid moDWT-Lasso-LSTM model for multi-step SM forecasting at <span class="html-italic">t</span> + 1, <span class="html-italic">t</span> + 14 and <span class="html-italic">t</span> + 30 lead times.</p>
Full article ">Figure 3
<p>Wavelet and scaling data series resulted from moDWT decomposition process given for the predictor variable: SM10-40 when decomposition level 4 and wavelet filter “haar” is used at <span class="html-italic">t</span> + 1 lead time.</p>
Full article ">Figure 4
<p>Heatmap of the Root Mean Square Error (RMSE) and Symmetric Mean Absolute Percentage Error (SMAPE) for the moDWT-Lasso-LSTM model and other benchmark models at <span class="html-italic">t</span> + 1, <span class="html-italic">t</span> + 14 and <span class="html-italic">t</span> + 30 lead time SM forecasting.</p>
Full article ">Figure 5
<p>Redar plots for the Mean Absolute Error (MAE) for the moDWT-Lasso-LSTM model and other benchmark models at <span class="html-italic">t</span> + 1, <span class="html-italic">t</span> + 14 and <span class="html-italic">t</span> + 30 lead time SM forecasting.</p>
Full article ">Figure 6
<p>Box plot of forecast errors in the testing phase generated by the moDWT-Lasso-LSTM hybrid model and other benchmark models at <span class="html-italic">t</span> + 1, <span class="html-italic">t</span> + 14 and <span class="html-italic">t</span> + 30 lead time SM forecasting.</p>
Full article ">Figure 7
<p>Stem plots of the Nash-Sutcliffe Coefficient (<math display="inline"><semantics> <mrow> <mi>N</mi> <mi>S</mi> </mrow> </semantics></math>) for the hybrid moDWT-Lasso-LSTM model and the benchmark models in testing phase at <span class="html-italic">t</span> + 1, <span class="html-italic">t</span> + 14 and <span class="html-italic">t</span> + 30 lead time SM forecasting.</p>
Full article ">Figure 8
<p>Scatter plots of moDWT-Lasso-LSTM model and other benchmark models in testing phase at t+30 lead time SM forecasting.</p>
Full article ">
15 pages, 1678 KiB  
Article
Multi-Objective Decision-Making Evaluation Method of Environmental Impact Associated with the Life Cycle of Agro-Friendly Biochar Materials
by Shunyang Wang, Jing Wei, Hua Li, Da Ding, Yaxin Zhang, Yuen Zhu, Shaopo Deng and Yongming Luo
Agronomy 2024, 14(11), 2583; https://doi.org/10.3390/agronomy14112583 - 1 Nov 2024
Viewed by 468
Abstract
The urgency of addressing farmland contamination is undeniable. However, the environmental impacts associated with soil remediation, especially during the production of remediation materials, are often overlooked. This study seeks to fill this gap by conducting a comprehensive environmental impact assessment of remediation material [...] Read more.
The urgency of addressing farmland contamination is undeniable. However, the environmental impacts associated with soil remediation, especially during the production of remediation materials, are often overlooked. This study seeks to fill this gap by conducting a comprehensive environmental impact assessment of remediation material production processes. We apply a Life Cycle Assessment (LCA) framework, enhanced by a multi-objective optimization model combining the Analytic Hierarchy Process (AHP) and Techniques for Order Preference by Similarity to an Ideal Solution (TOPSIS). This method enables the integration of multiple environmental indicators into a high-dimensional reference system, reducing subjectivity in decision-making. The study focuses on the environmental impacts of 11 types of biochar materials used in soil remediation. Among these, alkali-modified biochar loaded with nano TiO2 exhibited the highest environmental impact index. Sensitivity analysis further confirmed the robustness of the method, with impact variations ranging from 0.44 to 0.52, suggesting the model’s reliability in comparing different remediation materials. Our findings highlight the significant environmental variability between remediation materials and underscore the necessity of incorporating comprehensive environmental assessments in material selection processes. This study provides a valuable framework for optimizing the environmental sustainability of soil remediation efforts. Full article
(This article belongs to the Special Issue Soil Evolution, Management, and Sustainable Utilization)
Show Figures

Figure 1

Figure 1
<p>Diagram of the system boundary.</p>
Full article ">Figure 2
<p>Classification of data indicators.</p>
Full article ">Figure 3
<p>Evaluation results of environmental impact potential for each material.</p>
Full article ">
18 pages, 1463 KiB  
Review
Depletion Estimation, Stock–Recruitment Relationships, and Interpretation of Biomass Reference Points
by Mark N. Maunder and Kevin R. Piner
Fishes 2024, 9(11), 447; https://doi.org/10.3390/fishes9110447 - 1 Nov 2024
Viewed by 377
Abstract
Stock depletion level is an important concept in the assessment and management of exploited fish stocks because it is often used in conjunction with reference points to infer stock status. Both the depletion level and reference points can be highly dependent on the [...] Read more.
Stock depletion level is an important concept in the assessment and management of exploited fish stocks because it is often used in conjunction with reference points to infer stock status. Both the depletion level and reference points can be highly dependent on the stock–recruitment relationship. Here, we show how depletion level is estimated in stock assessment models, what data inform the depletion level, and how the stock–recruitment relationship influences the depletion level. There are a variety of data that provide information on abundance. In addition, to estimate the depletion level, unexploited absolute abundance needs to be determined. This often means extrapolating the abundance back in time to the start of the fishery, accounting for the removals and the productivity. Uncertainty in the depletion level arises because the model can account for the same removals by either estimating low productivity (e.g., low natural mortality) and high carrying capacity or high productivity and a low carrying capacity, and by estimating different relationships between productivity and depletion level, which are strongly controlled by the stock–recruitment relationship. Therefore, estimates of depletion are particularly sensitive to uncertainty in the biological processes related to natural mortality and the stock–recruitment relationship and to growth when length composition data are used. In addition, depletion-based reference points are highly dependent on the stock–recruitment relationship and need to account for recruitment variability, particularly autocorrelation, trends, and regime shifts. Future research needs to focus on estimating natural mortality, the stock–recruitment relationship, asymptotic length, shape of the selectivity curve, or management strategies that are robust to uncertainty in these parameters. Tagging studies, including close-kin mark-recapture, can address some of these issues. However, the stock–recruitment relationship will remain uncertain. Full article
(This article belongs to the Special Issue Assessment and Management of Fishery Resources)
Show Figures

Figure 1

Figure 1
<p>Illustration of the calculation of the depletion level corresponding to MSY. The upper figure shows that the biomass of a cohort changes as it ages due to a tradeoff between increases from individual growth and reductions from natural mortality. The maximum yield-per-recruit (YPR) is obtained when all the individuals are caught at the age that maximizes the biomass of the cohort, which occurs when the growth rate equals natural mortality. For illustrative purposes, natural mortality is assumed to be constant across age, while individual growth rates reduce as they age, forming a peak in population biomass growth. The lower figure shows how MSY is calculated by combing the YPR curve with the stock–recruitment relationship, which also gives the shape of the production function. For a given selectivity that catches multiple ages of fish, fish are caught at a younger age as the fishing mortality increases and the population becomes more depleted. The YPR initial increases with fishing mortality because fewer fish are left in the population and lost due to natural mortality, then decreases as more fish are caught at an age younger than the age that maximizes the biomass of the cohort. Yield is a tradeoff between increases in yield-per-recruit as the stock becomes more depleted and individuals are caught before the loss due to natural mortality is greater than the gain due to growth and losses in recruitment due to reduced spawning stock size through the stock–recruitment relationship.</p>
Full article ">Figure 2
<p>Estimates of historic abundance reconstructed with different steepness values of the stock–recruitment relationship from a given absolute abundance level in year 10 (<b>top</b>) under a given catch trajectory (<b>bottom</b>). Units in number of individuals. The simple population dynamics model is described in <a href="#app1-fishes-09-00447" class="html-app">Appendix A</a> and is fit to an absolute biomass estimate in year 10.</p>
Full article ">Figure 3
<p>Uncertainty in depletion level represented by the posterior distribution for a total catch history model (<span class="html-italic">R</span><sub>0</sub> estimated) (<b>left</b>) with known h = 0.75 and no recruitment variation (<b>top</b>), uniform prior on h = U(0.5, 1.0) and no recruitment variation (<b>middle</b>), uniform prior on h = U(0.5, 1.0) and recruitment variation sd = 0.6 (<b>bottom</b>) versus short-term model (<span class="html-italic">R</span><sub>0</sub>, initial recruitment, and initial fishing mortality estimated) (<b>right</b>) using the simple model described in <a href="#app1-fishes-09-00447" class="html-app">Appendix A</a> fit to an absolute biomass estimate in year 10. The prior on <span class="html-italic">R</span><sub>0</sub> is U(0, 200).</p>
Full article ">Figure 4
<p>Recruitment as a fraction of virgin recruitment (<b>top</b>) and yield and a fraction of maximum sustainable yield (MSY, <b>bottom</b>) for different levels of depletion (spawning biomass divided by virgin spawning biomass, S/S<sub>0</sub>) and different steepness values of the stock–recruitment relationship. The latter represents the production function.</p>
Full article ">Figure 5
<p>The age-structure of an exploited population at the start of the modeling time period in a short-term model is represented by <span class="html-italic">R<sub>init</sub></span> that scales the recruitment and compensates for the stock–recruitment relationship and any other trends or regime shifts in recruitment that occurred before the start of the modeling time period, <span class="html-italic">F<sub>init</sub></span> that represents the historic fishing, and <span class="html-italic">R<sub>dev</sub></span> that represents individual variation in recruitment (and fishing mortality or other process variation) related to that cohort [<a href="#B27-fishes-09-00447" class="html-bibr">27</a>].</p>
Full article ">Figure 6
<p>Estimates of depletion of a hypothetical application under different assumed values of <span class="html-italic">h</span> and the associated MSY-based biomass reference points when fit to an absolute abundance estimate in year 10 (<b>top</b>) and to the change in the index from year 10 to year 11 (<b>bottom</b>).</p>
Full article ">Figure 6 Cont.
<p>Estimates of depletion of a hypothetical application under different assumed values of <span class="html-italic">h</span> and the associated MSY-based biomass reference points when fit to an absolute abundance estimate in year 10 (<b>top</b>) and to the change in the index from year 10 to year 11 (<b>bottom</b>).</p>
Full article ">
14 pages, 2644 KiB  
Systematic Review
Concurrent Validity of Virtual Reality-Based Assessment of Executive Function: A Systematic Review and Meta-Analysis
by Si-An Lee, Ji-Yea Kim and Jin-Hyuck Park
J. Intell. 2024, 12(11), 108; https://doi.org/10.3390/jintelligence12110108 - 31 Oct 2024
Viewed by 436
Abstract
This meta-analysis investigated the concurrent validity between virtual reality (VR)-based assessments and traditional neuropsychological assessments of executive function, with a focus on subcomponents such as cognitive flexibility, attention, and inhibition. A total of 1605 articles were identified through searches of PubMed, Web of [...] Read more.
This meta-analysis investigated the concurrent validity between virtual reality (VR)-based assessments and traditional neuropsychological assessments of executive function, with a focus on subcomponents such as cognitive flexibility, attention, and inhibition. A total of 1605 articles were identified through searches of PubMed, Web of Science, and ScienceDirect from 2013 to 2023. After removing duplicates, 1313 articles were screened based on their titles and abstracts, with 77 articles selected for full-text eligibility review. Of these, nine articles fully met the inclusion and exclusion criteria for this study. The effect size for overall executive function was assessed, with subcomponents categorized based on the specific assessment tools used in the studies. The effect size for each subcomponent—cognitive flexibility, attention, and inhibition—was then analyzed to provide a more detailed understanding of their relationships with traditional measures. The results revealed statistically significant correlations between VR-based assessments and traditional measures across all subcomponents. Additionally, sensitivity analyses confirmed the robustness of the findings, even when low-quality studies were excluded. These results support the use of VR-based assessments as a valid alternative to traditional methods for evaluating executive function. Moreover, the study highlights the potential of VR-based assessments as a valid alternative to traditional methods, emphasizing the need to address variability in executive function subcomponents and integrate diverse cognitive and motor metrics for greater ecological validity. Full article
Show Figures

Figure 1

Figure 1
<p>Flow chart of the study selection process.</p>
Full article ">Figure 2
<p>Quality Assessment of Diagnostic Accuracy Studies 2 (QUADAS-2) assessment of included studies.</p>
Full article ">Figure 3
<p>Forest plot of overall executive function. The lines with solid square represent the effect sizes for each study. The diamond symbol represents the pooled effect size and 95% confidence interval (<a href="#B19-jintelligence-12-00108" class="html-bibr">Miskowiak et al. 2022</a>; <a href="#B29-jintelligence-12-00108" class="html-bibr">Seesjärvi et al. 2023</a>; <a href="#B28-jintelligence-12-00108" class="html-bibr">Seesjärvi et al. 2022</a>; <a href="#B2-jintelligence-12-00108" class="html-bibr">Borgnis et al. 2023</a>; <a href="#B32-jintelligence-12-00108" class="html-bibr">Voinescu et al. 2023</a>; <a href="#B35-jintelligence-12-00108" class="html-bibr">Zeng et al. 2023</a>; <a href="#B25-jintelligence-12-00108" class="html-bibr">Plotnik et al. 2017</a>; <a href="#B16-jintelligence-12-00108" class="html-bibr">Kourtesis et al. 2021</a>; <a href="#B24-jintelligence-12-00108" class="html-bibr">Parsons and Courtney 2014</a>).</p>
Full article ">Figure 4
<p>Funnel plot of overall executive function. The dots represent individual studies in a funnel plot. The diamond represents the observed effect size.</p>
Full article ">Figure 5
<p>Forest plot of executive function of sensitivity analysis. The lines with solid square represent the effect sizes for each study. The diamond symbol represents the pooled effect size and 95% confidence interval (<a href="#B19-jintelligence-12-00108" class="html-bibr">Miskowiak et al. 2022</a>; <a href="#B29-jintelligence-12-00108" class="html-bibr">Seesjärvi et al. 2023</a>; <a href="#B35-jintelligence-12-00108" class="html-bibr">Zeng et al. 2023</a>; <a href="#B25-jintelligence-12-00108" class="html-bibr">Plotnik et al. 2017</a>; <a href="#B24-jintelligence-12-00108" class="html-bibr">Parsons and Courtney 2014</a>).</p>
Full article ">Figure 6
<p>Forest plot of executive function on attention. The lines with solid square represent the effect sizes for each study. The diamond symbol represents the pooled effect size and 95% confidence interval (<a href="#B2-jintelligence-12-00108" class="html-bibr">Borgnis et al. 2023</a>; <a href="#B32-jintelligence-12-00108" class="html-bibr">Voinescu et al. 2023</a>; <a href="#B25-jintelligence-12-00108" class="html-bibr">Plotnik et al. 2017</a>; <a href="#B16-jintelligence-12-00108" class="html-bibr">Kourtesis et al. 2021</a>; <a href="#B24-jintelligence-12-00108" class="html-bibr">Parsons and Courtney 2014</a>).</p>
Full article ">Figure 7
<p>Forest plot of executive function on inhibition. The lines with solid square represent the effect sizes for each study. The diamond symbol represents the pooled effect size and 95% confidence interval (<a href="#B2-jintelligence-12-00108" class="html-bibr">Borgnis et al. 2023</a>; <a href="#B24-jintelligence-12-00108" class="html-bibr">Parsons and Courtney 2014</a>).</p>
Full article ">Figure 8
<p>Forest plot of executive function on cognitive flexibility. The lines with solid square represent the effect sizes for each study. The diamond symbol represents the pooled effect size and 95% confidence interval (<a href="#B2-jintelligence-12-00108" class="html-bibr">Borgnis et al. 2023</a>; <a href="#B32-jintelligence-12-00108" class="html-bibr">Voinescu et al. 2023</a>; <a href="#B35-jintelligence-12-00108" class="html-bibr">Zeng et al. 2023</a>; <a href="#B25-jintelligence-12-00108" class="html-bibr">Plotnik et al. 2017</a>; <a href="#B16-jintelligence-12-00108" class="html-bibr">Kourtesis et al. 2021</a>).</p>
Full article ">
Back to TopTop