Nothing Special   »   [go: up one dir, main page]

You seem to have javascript disabled. Please note that many of the page functionalities won't work as expected without javascript enabled.
 
 
Sign in to use this feature.

Years

Between: -

Subjects

remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline

Journals

remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline

Article Types

Countries / Regions

remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline

Search Results (9,628)

Search Parameters:
Keywords = randomized algorithms

Order results
Result details
Results per page
Select all
Export citation of selected articles as:
27 pages, 2585 KiB  
Article
Technology-Driven Financial Risk Management: Exploring the Benefits of Machine Learning for Non-Profit Organizations
by Hao Huang
Systems 2024, 12(10), 416; https://doi.org/10.3390/systems12100416 - 8 Oct 2024
Viewed by 96
Abstract
This study explores how machine learning can optimize financial risk management for non-profit organizations by evaluating various algorithms aimed at mitigating loan default risks. The findings indicate that ensemble learning models, such as random forest and LightGBM, significantly improve prediction accuracy, thereby enabling [...] Read more.
This study explores how machine learning can optimize financial risk management for non-profit organizations by evaluating various algorithms aimed at mitigating loan default risks. The findings indicate that ensemble learning models, such as random forest and LightGBM, significantly improve prediction accuracy, thereby enabling non-profits to better manage financial risk. In the context of the 2008 subprime mortgage crisis, which underscored the volatility of financial markets, this research assesses a range of risks—credit, operational, liquidity, and market risks—while exploring both traditional machine learning and advanced ensemble techniques, with a particular focus on stacking fusion to enhance model performance. Emphasizing the importance of privacy and adaptive methods, this study advocates for interdisciplinary approaches to overcome limitations such as stress testing, data analysis rule formulation, and regulatory collaboration. The research underscores machine learning’s crucial role in financial risk control and calls on regulatory authorities to reassess existing frameworks to accommodate evolving risks. Additionally, it highlights the need for accurate data type identification and the potential for machine learning to strengthen financial risk management amid uncertainty, promoting interdisciplinary efforts that address broader issues like environmental sustainability and economic development. Full article
Show Figures

Figure 1

Figure 1
<p>Technology Roadmap.</p>
Full article ">Figure 2
<p>System architecture based on machine learning.</p>
Full article ">Figure 3
<p>Multi-layer neural network structure.</p>
Full article ">Figure 4
<p>Confusion matrix under downsampling.</p>
Full article ">Figure 5
<p>Confusion matrix under oversampling.</p>
Full article ">Figure 6
<p>Relationship between credit rating and risk limit.</p>
Full article ">Figure 7
<p>The performance measurement diagram of the initial logistic regression algorithm.</p>
Full article ">Figure 8
<p>Comparison chart of performance metrics of each model.</p>
Full article ">
23 pages, 4153 KiB  
Article
Analyzing Supervised Machine Learning Models for Classifying Astronomical Objects Using Gaia DR3 Spectral Features
by Orestes Javier Pérez Cruz, Cynthia Alejandra Martínez Pinto, Silvana Guadalupe Navarro Jiménez, Luis José Corral Escobedo and Minia Manteiga Outeiro
Appl. Sci. 2024, 14(19), 9058; https://doi.org/10.3390/app14199058 (registering DOI) - 8 Oct 2024
Viewed by 248
Abstract
In this paper, we present an analysis of the effectiveness of various machine learning algorithms in classifying astronomical objects using data from the third release (DR3) of the Gaia space mission. The dataset used includes spectral information from the satellite’s red and blue [...] Read more.
In this paper, we present an analysis of the effectiveness of various machine learning algorithms in classifying astronomical objects using data from the third release (DR3) of the Gaia space mission. The dataset used includes spectral information from the satellite’s red and blue spectrophotometers. The primary goal is to achieve reliable classification with high confidence for symbiotic stars, planetary nebulae, and red giants. Symbiotic stars are binary systems formed by a high-temperature star (a white dwarf in most cases) and an evolved star (Mira type or red giant star); their spectra varies between the typical for these objects (depending on the orbital phase of the object) and present emission lines similar to those observed in PN spectra, which is the reason for this first selection. Several classification algorithms are evaluated, including Random Forest (RF), Support Vector Machine (SVM), Artificial Neural Networks (ANN), Gradient Boosting (GB), and Naive Bayes classifier. The evaluation is based on different metrics such as Precision, Recall, F1-Score, and the Kappa index. The study confirms the effectiveness of classifying the mentioned stars using only their spectral information. The models trained with Artificial Neural Networks and Random Forest demonstrated superior performance, surpassing an accuracy rate of 94.67%. Full article
(This article belongs to the Section Computing and Artificial Intelligence)
Show Figures

Figure 1

Figure 1
<p>The figures represent three different types of stars, (blue) symbiotic star [EM* AS 323], (red) red giant [2MASS J04390924+2847545], and (green) planetary nebula [Hen 2-442]. In the initial diagram, the flux values are denoted in Watts per nanometer per square meter (W/nm/m<sup>2</sup>), derived from external calibration using GaiaXpy library. In the adjacent figure, the flux values are normalized to a range between 0 and 1.</p>
Full article ">Figure 2
<p>The figure displays the original spectrum of the symbiotic star (blue) and the spectrum resulting from the addition of noise (green), following a normal distribution with a mean of 0 and a standard deviation of 0.05.</p>
Full article ">Figure 3
<p>t-SNE representation of unbalanced and balanced data in a two-dimensional space. The points represent samples from different classes. (<b>Left</b>) t-SNE projection of unbalanced data. A prominent cluster of the RG class is evident in the left side, whereas samples from the PN class are in the upper right region. The SS class exhibits weak clustering, blending with the other classes. (<b>Right</b>) t-SNE projection of balanced data in a two-dimensional space reveals improved separation and clustering of classes. The class clusters are more well-defined, although there still exist points that overlap with other classes.</p>
Full article ">Figure 4
<p>Confusion matrix resulting from the execution of the Random Forest algorithm. On the left are the results with the unbalanced dataset and on the right with balanced dataset.</p>
Full article ">Figure 5
<p>Confusion matrix resulting from the execution of the SVM algorithm. On the left are the results derived with unbalanced dataset and on the right with balanced dataset.</p>
Full article ">Figure 6
<p>Confusion matrix resulting from the execution of the Gradient Boosting algorithm. On the left are the results derived with the unbalanced dataset and on the right with the balanced dataset.</p>
Full article ">Figure 7
<p>Confusion matrix resulting from the execution of the Naive Bayes algorithm. On the left are the results derived with the unbalanced dataset and on the right with the balanced dataset.</p>
Full article ">Figure 8
<p>Confusion matrix resulting from the execution of the ANN algorithm. On the left are the results derived with the unbalanced dataset and on the right with the balanced dataset.</p>
Full article ">Figure 9
<p>The figure displays the spectra obtained from the Gaia DR3 catalog that were sourced from the article “A catalogue of symbiotic stars” [<a href="#B28-applsci-14-09058" class="html-bibr">28</a>].</p>
Full article ">Figure 10
<p>The figure displays the spectra obtained from the Gaia DR3 catalog that were sourced from the article “A machine learning approach for identification and classification of symbiotic stars using 2MASS and WISE” [<a href="#B29-applsci-14-09058" class="html-bibr">29</a>].</p>
Full article ">Figure 11
<p>The figure displays the spectra of suspected planetary nebulae stars obtained from the Gaia DR3 catalog that were sourced from Symbad database online.</p>
Full article ">
21 pages, 950 KiB  
Article
Optimizing Maintenance of Energy Supply Systems in City Logistics with Heuristics and Reinforcement Learning
by Antoni Guerrero, Angel A. Juan, Alvaro Garcia-Sanchez and Luis Pita-Romero
Mathematics 2024, 12(19), 3140; https://doi.org/10.3390/math12193140 - 7 Oct 2024
Viewed by 379
Abstract
In urban logistics, effective maintenance is crucial for maintaining the reliability and efficiency of energy supply systems, impacting both asset performance and operational stability. This paper addresses the scheduling and routing plans for maintenance of power generation assets over a multi-period horizon. We [...] Read more.
In urban logistics, effective maintenance is crucial for maintaining the reliability and efficiency of energy supply systems, impacting both asset performance and operational stability. This paper addresses the scheduling and routing plans for maintenance of power generation assets over a multi-period horizon. We model this problem as a multi-period team orienteering problem. To address this multi-period challenge, we propose a dual approach: a novel reinforcement learning (RL) framework and a biased-randomized heuristic algorithm. The RL-based method dynamically learns from real-time operational data and evolving asset conditions, adapting to changes in asset health and failure probabilities to optimize decision making. In addition, we develop and apply a biased-randomized heuristic algorithm designed to provide effective solutions within practical computational limits. Our approach is validated through a series of computational experiments comparing the RL model and the heuristic algorithm. The results demonstrate that, when properly trained, the RL-based model is able to offer equivalent or even superior performance compared to the heuristic algorithm. Full article
(This article belongs to the Special Issue Planning and Scheduling in City Logistics Optimization)
17 pages, 1656 KiB  
Article
Improving Alzheimer’s Disease Prediction with Different Machine Learning Approaches and Feature Selection Techniques
by Hala Alshamlan, Arwa Alwassel, Atheer Banafa and Layan Alsaleem
Diagnostics 2024, 14(19), 2237; https://doi.org/10.3390/diagnostics14192237 - 7 Oct 2024
Viewed by 385
Abstract
Machine learning (ML) has increasingly been utilized in healthcare to facilitate disease diagnosis and prediction. This study focuses on predicting Alzheimer’s disease (AD) through the development and comparison of ML models using Support Vector Machine (SVM), Random Forest (RF), and Logistic Regression (LR) [...] Read more.
Machine learning (ML) has increasingly been utilized in healthcare to facilitate disease diagnosis and prediction. This study focuses on predicting Alzheimer’s disease (AD) through the development and comparison of ML models using Support Vector Machine (SVM), Random Forest (RF), and Logistic Regression (LR) algorithms. Additionally, feature selection techniques including Minimum Redundancy Maximum Relevance (mRMR) and Mutual Information (MI) were employed to enhance the model performance. The research methodology involved training and testing these models on the OASIS-2 dataset, evaluating their predictive accuracies. Notably, LR combined with mRMR achieved the highest accuracy of 99.08% in predicting AD. These findings underscore the efficacy of ML algorithms in AD prediction and highlight the utility of the feature selection methods in improving the model performance. This study contributes to the ongoing efforts to leverage ML for more accurate disease prognosis and underscores the potential of these techniques in advancing clinical decision-making. Full article
Show Figures

Figure 1

Figure 1
<p>The result of first 5 rows.</p>
Full article ">Figure 2
<p>Descriptive statistics of the participant demographics within the dataset. (<b>a</b>) The distribution of group classifications. (<b>b</b>) The gender representation.</p>
Full article ">Figure 3
<p>Histograms illustrating the distribution of socioeconomic status. (<b>a</b>) SES scores among participants. (<b>b</b>) MMSE scores among participants.</p>
Full article ">Figure 4
<p>Boxplots representing the distribution and variability of numeric attributes within the dataset, facilitating the identification of potential outliers.</p>
Full article ">Figure 5
<p>Probability plot depicting the distribution of MMSE scores, highlighting the assessment of cognitive function among participants.</p>
Full article ">Figure 6
<p>Visualization of feature selection results using mRMR.</p>
Full article ">Figure 7
<p>Heat map of correlation coefficient feature selection.</p>
Full article ">Figure 8
<p>Mutual Information feature selection.</p>
Full article ">Figure 9
<p>Performance comparison of the SVM model. (<b>a</b>) Accuracy before outlier removal. (<b>b</b>) Accuracy after outlier removal.</p>
Full article ">Figure 10
<p>Performance comparison of the LR model. (<b>a</b>) Accuracy before outlier removal. (<b>b</b>) Accuracy after outlier removal.</p>
Full article ">Figure 11
<p>Performance comparison of the RF model. (<b>a</b>) Accuarcy before outlier removal. (<b>b</b>) Accuracy after outlier removal.</p>
Full article ">Figure 12
<p>Illustration of the 10-fold cross-validation process, showing the division of the dataset into 10 equal-sized folds for model evaluation.</p>
Full article ">Figure 13
<p>Illustration of the 5-fold cross-validation process, demonstrating the partitioning of the dataset into five subsets for model evaluation.</p>
Full article ">Figure 14
<p>Illustration of the Leave-One-Out (LOO) cross-validation method, depicting the evaluation process where each individual data point serves as a separate validation set.</p>
Full article ">Figure 15
<p>Comparison of SVM model performance. (<b>a</b>) Accuracy prior to outlier removal. (<b>b</b>) Accuracy following outlier removal.</p>
Full article ">Figure 16
<p>Comparison of LR model performance. (<b>a</b>) Accuracy prior to outlier removal. (<b>b</b>) Accuracy following outlier removal.</p>
Full article ">Figure 17
<p>Comparison of Random Forest model performance. (<b>a</b>) Accuracy prior to outlier removal. (<b>b</b>) Accuracy following outlier removal.</p>
Full article ">
15 pages, 4494 KiB  
Communication
Analysis of the Grid Quantization for the Microwave Radar Coincidence Imaging Based on Basic Correlation Algorithm
by Yiheng Nian, Mengran Zhao, Die Li, Ming Zhang, Anxue Zhang, Tong Li and Shitao Zhu
Remote Sens. 2024, 16(19), 3726; https://doi.org/10.3390/rs16193726 - 7 Oct 2024
Viewed by 251
Abstract
In Microwave Radar Coincidence Imaging (MRCI), the imaging region is typically discretized into a fine grid. In other words, it assumes that the equivalent scatterers of the target are precisely located at the centers of these pre-discretized grids. However, this approach usually encounters [...] Read more.
In Microwave Radar Coincidence Imaging (MRCI), the imaging region is typically discretized into a fine grid. In other words, it assumes that the equivalent scatterers of the target are precisely located at the centers of these pre-discretized grids. However, this approach usually encounters the off-grid problem, which can significantly degrade the imaging performance. In this paper, to establish a criterion for grid quantization, the performance of the MRCI system related to the grid size and the distribution of imaging points is investigated. First, the discretization of the imaging scene is regarded as a random sampling problem, and the off-grid imaging model for MRCI is established. Then, the probability distribution function (PDF) of the imaging amplitude for a single point target is analyzed, and the mean first-order imaging error (MFE) for multiple point targets is derived based on the Basic Correlation Algorithm (BCA). Finally, the relationship between the grid quantization of the imaging area and the performance of the MRCI system is analyzed, providing a theoretical guidance for enhancing the performance of MRCI. The validity of the analyses is verified through simulation experiments. Full article
Show Figures

Figure 1

Figure 1
<p>Typical scene of MRCI.</p>
Full article ">Figure 2
<p>Imaging process using BCA.</p>
Full article ">Figure 3
<p>Schematic diagram of PDF of the imaging amplitude for one point target.</p>
Full article ">Figure 4
<p>Single point target. (<b>a</b>) Target expression result. (<b>b</b>) Imaging result; the grid offset ratio is −0.25.</p>
Full article ">Figure 5
<p>Two point targets. (<b>a</b>) Target expression result. (<b>b</b>) Imaging result; the grid offset ratio is −0.25 and −0.5.</p>
Full article ">Figure 6
<p>PDF of the imaging amplitude of one grid cell for one point target.</p>
Full article ">Figure 7
<p>The DP of a single point target.</p>
Full article ">Figure 8
<p>Flowchart for the derivation of PDF and DP.</p>
Full article ">Figure 9
<p>The MFE for one point target.</p>
Full article ">Figure 10
<p>MFE for two point targets with different spacing.</p>
Full article ">Figure 11
<p>MFE for different number of point targets, <math display="inline"><semantics> <mrow> <mi>d</mi> <mo>=</mo> <mn>1.0</mn> <mi>ρ</mi> </mrow> </semantics></math>.</p>
Full article ">Figure 12
<p>The reconstructed image of simulation and theoretical under different SNRs (<b>a</b>) SNR = 10 dB (<b>b</b>) SNR = 0 dB (<b>c</b>) SNR = −10 dB (<b>d</b>) SNR = −20 dB.</p>
Full article ">Figure 13
<p>RMSE with respect to SNR.</p>
Full article ">Figure 14
<p>NCF with respect to MCC.</p>
Full article ">Figure 15
<p>MFE for a single point target (calculated and simulated results).</p>
Full article ">Figure 16
<p>MFE for two point targets with different spacing (calculated and simulated results).</p>
Full article ">Figure 17
<p>MFE for different number of point targets, <math display="inline"><semantics> <mrow> <mi>d</mi> <mo>=</mo> <mn>0.5</mn> <mi>ρ</mi> </mrow> </semantics></math> (calculated and simulated results).</p>
Full article ">Figure 18
<p>MFE for different number of point targets, <math display="inline"><semantics> <mrow> <mi>d</mi> <mo>=</mo> <mn>1.0</mn> <mi>ρ</mi> </mrow> </semantics></math> (calculated and simulated results).</p>
Full article ">Figure 19
<p>MFE for different number of point targets, <math display="inline"><semantics> <mrow> <mi>d</mi> <mo>=</mo> <mn>1.5</mn> <mi>ρ</mi> </mrow> </semantics></math> (calculated and simulated results).</p>
Full article ">
14 pages, 526 KiB  
Article
Assessment of Ensemble-Based Machine Learning Algorithms for Exoplanet Identification
by Thiago S. F. Luz, Rodrigo A. S. Braga and Enio R. Ribeiro
Electronics 2024, 13(19), 3950; https://doi.org/10.3390/electronics13193950 - 7 Oct 2024
Viewed by 306
Abstract
This paper presents a comprehensive assessment procedure for evaluating Ensemble-based Machine Learning algorithms in the context of exoplanet classification. Each of the algorithm hyperparameter values were tuned. Deployments were carried out using the cross-validation method. Performance metrics, including accuracy, sensitivity, specificity, precision, and [...] Read more.
This paper presents a comprehensive assessment procedure for evaluating Ensemble-based Machine Learning algorithms in the context of exoplanet classification. Each of the algorithm hyperparameter values were tuned. Deployments were carried out using the cross-validation method. Performance metrics, including accuracy, sensitivity, specificity, precision, and F1 score, were evaluated using confusion matrices generated from each implementation. Machine Learning (ML) algorithms were trained and used to identify exoplanet data. Most of the current research deals with traditional ML algorithms for this purpose. The Ensemble algorithm is another type of ML technique that combines the prediction performance of two or more algorithms to obtain an improved final prediction. Few studies have applied Ensemble algorithms to predict exoplanets. To the best of our knowledge, no paper that has exclusively assessed Ensemble algorithms exists, highlighting a significant gap in the literature about the potential of Ensemble methods. Five Ensemble algorithms were evaluated in this paper: Adaboost, Random Forest, Stacking, Random Subspace Method, and Extremely Randomized Trees. They achieved an average performance of more than 80% in all metrics. The results underscore the substantial benefits of fine tuning hyperparameters to enhance predictive performance. The Stacking algorithm achieved a higher performance than the other algorithms. This aspect is discussed in this paper. The results of this work show that it is worth increasing the use of Ensemble algorithms to improve exoplanet identification. Full article
(This article belongs to the Section Computer Science & Engineering)
Show Figures

Figure 1

Figure 1
<p>Ensemble algorithm concept for predicting data.</p>
Full article ">Figure 2
<p>Confusion matrix with the (<b>a</b>) initial and (<b>b</b>) improved hyperparameter values for Adaboost.</p>
Full article ">Figure 3
<p>Confusion matrix with the (<b>a</b>) initial and (<b>b</b>) improved hyperparameter values for Random Forest.</p>
Full article ">Figure 4
<p>Confusion matrix for the (<b>a</b>) first and (<b>b</b>) second implementation of Stacking.</p>
Full article ">Figure 5
<p>Accuracy results for the algorithms tested as estimators for Stacking.</p>
Full article ">Figure 6
<p>Confusion matrix with the (<b>a</b>) initial and (<b>b</b>) improved hyperparameter values for the Random Subspace Method.</p>
Full article ">Figure 7
<p>Confusion matrix with the (<b>a</b>) initial and (<b>b</b>) improved hyperparameter values for Extremely Randomized Trees.</p>
Full article ">
27 pages, 5306 KiB  
Article
Area-Time-Efficient Secure Comb Scalar Multiplication Architecture Based on Recoding
by Zhantao Zhang, Weijiang Wang, Jingqi Zhang, Xiang He, Mingzhi Ma, Shiwei Ren and Hua Dang
Micromachines 2024, 15(10), 1238; https://doi.org/10.3390/mi15101238 - 7 Oct 2024
Viewed by 261
Abstract
With the development of mobile communication, digital signatures with low latency, low area, and high security are in increasing demand. Elliptic curve cryptography (ECC) is widely used because of its security and lightweight. Elliptic curve scalar multiplication (ECSM) is the basic arithmetic in [...] Read more.
With the development of mobile communication, digital signatures with low latency, low area, and high security are in increasing demand. Elliptic curve cryptography (ECC) is widely used because of its security and lightweight. Elliptic curve scalar multiplication (ECSM) is the basic arithmetic in ECC. Based on this background information, we propose our own research objectives. In this paper, a low-latency and low-area ECSM architecture based on the comb algorithm is proposed. The detailed methodology is as follows. The recoding-k algorithm and randomization-Z algorithm are used to improve security, which can resist sample power analysis (SPA) and differential power analysis (DPA). A low-area multi-functional architecture for comb is proposed, which takes into account different stages of the comb algorithm. Based on this, the data dependency is considered and the comb architecture is optimized to achieve a uniform and efficient execution pattern. The interleaved modular multiplication algorithm and modified binary inverse algorithm are used to achieve short clock cycle delay and high frequency while taking into account the need for a low area. The proposed architecture has been implemented on Xilinx Virtex-7 series FPGA to perform ECSM on 256-bits prime field GF(p). In the hardware architecture with only 7351 slices of resource usage, a single ECSM only takes 0.74 ms, resulting in an area-time product (ATP) of 5.41. The implementation results show that our design can compete with the existing state-of-the-art engineering in terms of performance and has higher security. Our design is suitable for computing scenarios where security and computing speed are required. The implementation of the overall architecture is of great significance and inspiration to the research community. Full article
Show Figures

Figure 1

Figure 1
<p>Comparison of different <math display="inline"><semantics> <mi>ω</mi> </semantics></math> and the Co-Z algorithm and Montgomery Ladder algorithm with the increase in ECSM calculation times. In the figure, Combx_pre means the pre-calculation burden of comb-x.</p>
Full article ">Figure 2
<p>Date dependence and stream of pre-calculation in different modules.</p>
Full article ">Figure 3
<p>Hardware finite field operation scheduling diagram of the overall ALU of ECPDPA with ECPD and ECPA inside.</p>
Full article ">Figure 4
<p>Overall ECSM architecture on FPGA and top input/output.</p>
Full article ">Figure 5
<p>Circuit structure diagram of Fibonacci linear feedback shift register.</p>
Full article ">Figure 6
<p>Inner architecture of ALU and different internal modules.</p>
Full article ">Figure 7
<p>Jump flow chart of state machine in ALU in different modes.</p>
Full article ">Figure 8
<p>Scheduling of three multipliers in ALU in which the yellow background is ECPD and the green background is ECPA.</p>
Full article ">Figure 9
<p>Block diagram of interleaved modular multiplication algorithm. In this figure, the same color means the same function.</p>
Full article ">Figure 10
<p>Block diagram of modular addition/subtraction. In this figure, the same color means the same function.</p>
Full article ">Figure 11
<p>Block diagram of critical registers that need to be iterated in binary inversion algorithm. In this figure, the same color means the same function.</p>
Full article ">Figure 12
<p>The upper and lower limits of <math display="inline"><semantics> <mrow> <mi>C</mi> <msub> <mi>C</mi> <mrow> <mi>t</mi> <mi>o</mi> <mi>t</mi> <mi>a</mi> <mi>l</mi> </mrow> </msub> </mrow> </semantics></math> and the trend of change with <span class="html-italic">n</span>.</p>
Full article ">Figure 13
<p>The simulation diagram related to FPGA implementation. In this figure, the correct data can be read by the yellow line.</p>
Full article ">
14 pages, 11769 KiB  
Article
Research on Longitudinal Control of Electric Vehicle Platoons Based on Robust UKF–MPC
by Jiading Bao, Zishan Lin, Hui Jing, Huanqin Feng, Xiaoyuan Zhang and Ziqiang Luo
Sustainability 2024, 16(19), 8648; https://doi.org/10.3390/su16198648 - 6 Oct 2024
Viewed by 402
Abstract
In a V2V communication environment, the control of electric vehicle platoons faces issues such as random communication delays, packet loss, and external disturbances, which affect sustainable transportation systems. In order to solve these problems and promote the development of sustainable transportation, a longitudinal [...] Read more.
In a V2V communication environment, the control of electric vehicle platoons faces issues such as random communication delays, packet loss, and external disturbances, which affect sustainable transportation systems. In order to solve these problems and promote the development of sustainable transportation, a longitudinal control algorithm for the platoon based on robust Unscented Kalman Filter (UKF) and Model Predictive Control (MPC) is designed. First, a longitudinal kinematic model of the vehicle platoon is constructed, and discrete state–space equations are established. The robust UKF algorithm is derived by enhancing the UKF algorithm with Huber-M estimation. This enhanced algorithm is then used to estimate the state information of the leading vehicle. Based on the vehicle state information obtained from the robust UKF estimation, feedback correction and compensation are added to the MPC algorithm to design the robust UKF–MPC longitudinal controller. Finally, the effectiveness of the proposed controller is verified through CarSim/Simulink joint simulation. The simulation results show that in the presence of communication delay and data loss, the robust UKF–MPC controller outperforms the MPC and UKF–MPC controllers in terms of MSE and IAE metrics for vehicle spacing error and acceleration tracking error and exhibits stronger robustness and stability. Full article
Show Figures

Figure 1

Figure 1
<p>Schematic diagram of the longitudinal movement of the vehicle platoon.</p>
Full article ">Figure 2
<p>Framework diagram of robust UKF–MPC based longitudinal controller.</p>
Full article ">Figure 3
<p>Variation of velocity from vehicle 0 to vehicle 3 under different controllers in an environment without communication delay and data loss.</p>
Full article ">Figure 4
<p>Variation of acceleration from vehicle 0 to vehicle 3 under different controllers in an environment without communication delay and data loss.</p>
Full article ">Figure 5
<p>Variation of spacing error from vehicle 0 to vehicle 3 under different controllers in an environment without communication delay and data loss.</p>
Full article ">Figure 6
<p>Variation of velocity from vehicle 0 to vehicle 3 under different controllers with 10~100 ms random communication delay and 50% data packet loss.</p>
Full article ">Figure 7
<p>Variation of acceleration from vehicle 0 to vehicle 3 under different controllers with 10~100 ms random communication delay and 50% data packet loss.</p>
Full article ">Figure 8
<p>Variation of spacing error from vehicle 0 to vehicle 3 under different controllers with 10~100 ms random communication delay and 50% data packet loss.</p>
Full article ">
19 pages, 5113 KiB  
Article
Urban Air Logistics with Unmanned Aerial Vehicles (UAVs): Double-Chromosome Genetic Task Scheduling with Safe Route Planning
by Marco Rinaldi, Stefano Primatesta, Martin Bugaj, Ján Rostáš and Giorgio Guglieri
Smart Cities 2024, 7(5), 2842-2860; https://doi.org/10.3390/smartcities7050110 - 6 Oct 2024
Viewed by 596
Abstract
In an efficient aerial package delivery scenario carried out by multiple Unmanned Aerial Vehicles (UAVs), a task allocation problem has to be formulated and solved in order to select the most suitable assignment for each delivery task. This paper presents the development methodology [...] Read more.
In an efficient aerial package delivery scenario carried out by multiple Unmanned Aerial Vehicles (UAVs), a task allocation problem has to be formulated and solved in order to select the most suitable assignment for each delivery task. This paper presents the development methodology of an evolutionary-based optimization framework designed to tackle a specific formulation of a Drone Delivery Problem (DDP) with charging hubs. The proposed evolutionary-based optimization framework is based on a double-chromosome task encoding logic. The goal of the algorithm is to find optimal (and feasible) UAV task assignments such that (i) the tasks’ due dates are met, (ii) an energy consumption model is minimized, (iii) re-charge tasks are allocated to ensure service persistency, (iv) risk-aware flyable paths are included in the paradigm. Hard and soft constraints are defined such that the optimizer can also tackle very demanding instances of the DDP, such as tens of package delivery tasks with random temporal deadlines. Simulation results show how the algorithm’s development methodology influences the capability of the UAVs to be assigned to different tasks with different temporal constraints. Monte Carlo simulations corroborate the results for two different realistic scenarios in the city of Turin, Italy. Full article
(This article belongs to the Special Issue Smart Urban Air Mobility)
Show Figures

Figure 1

Figure 1
<p>Snapshot of a simplified example in a portion of the city of Turin (Italy), with three delivery tasks, one charge hub, and a fleet of four UAVs.</p>
Full article ">Figure 2
<p>Example of PMC operator for creation of offspring <math display="inline"><semantics> <mrow> <mi>I</mi> </mrow> </semantics></math> from <math display="inline"><semantics> <mrow> <msub> <mrow> <mi>C</mi> </mrow> <mrow> <mi>I</mi> </mrow> </msub> </mrow> </semantics></math> and <math display="inline"><semantics> <mrow> <msub> <mrow> <mi>C</mi> </mrow> <mrow> <mi>I</mi> <mi>I</mi> </mrow> </msub> </mrow> </semantics></math>.</p>
Full article ">Figure 3
<p>Example of slide mutation.</p>
Full article ">Figure 4
<p>Example of flip mutation.</p>
Full article ">Figure 5
<p>Example of swap mutation.</p>
Full article ">Figure 6
<p>(<b>a</b>) Graph-based representation of the final schedule related to the solution of Algorithm 1 with <math display="inline"><semantics> <mrow> <msub> <mrow> <mi>m</mi> </mrow> <mrow> <mi>d</mi> <mi>w</mi> </mrow> </msub> <mo>=</mo> <mn>0</mn> </mrow> </semantics></math> with a simple instance of the DDP. (<b>b</b>) Evolution of the fitness function <math display="inline"><semantics> <mrow> <mi>J</mi> </mrow> </semantics></math> at each iteration of Algorithm 1 with <math display="inline"><semantics> <mrow> <msub> <mrow> <mi>m</mi> </mrow> <mrow> <mi>d</mi> <mi>w</mi> </mrow> </msub> <mo>=</mo> <mn>0</mn> </mrow> </semantics></math>.</p>
Full article ">Figure 7
<p>Risk maps of the operational area of <a href="#smartcities-07-00110-f001" class="html-fig">Figure 1</a> computed after taking into account UAV A and UAV C, the latter both without and with a payload. The black line is the minimum risk path computed with the risk-aware path planning. The dashed black line is the minimum distance path.</p>
Full article ">
14 pages, 4952 KiB  
Article
A Three-Dimensional Modeling Approach for Carbon Nanotubes Filled Polymers Utilizing the Modified Nearest Neighbor Algorithm
by Junpu Wang, Xiaozhuang Yue, Yuxuan Wang, Liupeng Di, Wenzhi Wang, Jingchao Wei and Fei Yu
Polymers 2024, 16(19), 2824; https://doi.org/10.3390/polym16192824 - 6 Oct 2024
Viewed by 323
Abstract
Carbon nanotubes (CNTs) are extensively utilized in the fabrication of high-performance composites due to their exceptional mechanical, electrical, and thermal characteristics. To investigate the mechanical properties of CNTs filled polymers accurately and effectively, a 3D modeling approach that incorporates the microstructural attributes of [...] Read more.
Carbon nanotubes (CNTs) are extensively utilized in the fabrication of high-performance composites due to their exceptional mechanical, electrical, and thermal characteristics. To investigate the mechanical properties of CNTs filled polymers accurately and effectively, a 3D modeling approach that incorporates the microstructural attributes of CNTs was introduced. Initially, a representative volume element model was constructed utilizing the modified nearest neighbor algorithm. During the modeling phase, a corresponding interference judgment method was suggested, taking into account the potential positional relationships among the CNTs. Subsequently, stress–strain curves of the model under various loading conditions were derived through finite element analysis employing the volume averaging technique. To validate the efficacy of the modeling approach, the stress within a CNT/epoxy resin composite with varying volume fractions under different axial strains was computed. The resulting stress–strain curves were in good agreement with experimental data from the existing literature. Hence, the modeling method proposed in this study provides a more precise representation of the random distribution of CNTs in the matrix. Furthermore, it is applicable to a broader range of aspect ratios, thereby enabling the CNT simulation model to more closely align with real-world models. Full article
(This article belongs to the Special Issue Polymer Nanoparticles: Synthesis and Applications—2nd Edition)
Show Figures

Figure 1

Figure 1
<p>A schematic representation of the multi-scale analysis method: (<b>a</b>) generation of the CNTs; (<b>b</b>) establishment of the RVE model; (<b>c</b>) FE analysis; (<b>d</b>) comparison of the stress–strain curves.</p>
Full article ">Figure 2
<p>Modeling a 3D CNT: (<b>a</b>) defining a CNT and (<b>b</b>) configuring its position and orientation.</p>
Full article ">Figure 3
<p>The positional relationship between two spatial line segments.</p>
Full article ">Figure 4
<p>A schematic representation of the modified NNA algorithm.</p>
Full article ">Figure 5
<p>The flowchart of the modeling process.</p>
Full article ">Figure 6
<p>The stress–strain curve of epoxy resin [<a href="#B28-polymers-16-02824" class="html-bibr">28</a>].</p>
Full article ">Figure 7
<p>The 3D RVE model of CNTs filled epoxy resin is illustrated in two parts: (<b>a</b>) geometry and (<b>b</b>) meshed configuration.</p>
Full article ">Figure 8
<p>The displacement distribution of a 3D RVE model of an epoxy resin composite filled with 1.5% volume fraction of CNTs is analyzed under various uniaxial elongation ratios (λ).</p>
Full article ">Figure 9
<p>A comparison of simulation results with experimental data.</p>
Full article ">
20 pages, 1522 KiB  
Article
Forecasting Foreign Direct Investment Inflow to Bangladesh: Using an Autoregressive Integrated Moving Average and a Machine Learning-Based Random Forest Approach
by Md. Monirul Islam, Arifa Jannat, Kentaka Aruga and Md Mamunur Rashid
J. Risk Financial Manag. 2024, 17(10), 451; https://doi.org/10.3390/jrfm17100451 - 5 Oct 2024
Viewed by 472
Abstract
This study focuses on the challenge of accurately forecasting foreign direct investment (FDI) inflows to Bangladesh, which are crucial for the country’s sustainable economic growth. Although Bangladesh has strong potential as an investment destination, recent FDI inflows have sharply declined due to global [...] Read more.
This study focuses on the challenge of accurately forecasting foreign direct investment (FDI) inflows to Bangladesh, which are crucial for the country’s sustainable economic growth. Although Bangladesh has strong potential as an investment destination, recent FDI inflows have sharply declined due to global economic uncertainties and the impact of the COVID-19 pandemic. There is a clear gap in applying advanced forecasting models, particularly the autoregressive integrated moving average (ARIMA) model and machine learning techniques like random forest (RF), to predict FDI inflows in Bangladesh. This study aims to analyze and forecast FDI inflows in Bangladesh by employing a hybrid approach that integrates the ARIMA model and the RF algorithm. This study covers the period from 1986 to 2022. The analysis reveals that net FDI inflow in Bangladesh is integrated into the first order, and the ARIMA (3,1,2) model is identified as the most suitable based on the Akaike Information Criterion (AIC). Diagnostic tests confirm its consistency and appropriateness for forecasting net FDI inflows in the country. This study’s findings indicate a decreasing trend in net FDI inflows over the forecasted period, with an average of USD 1664 million, similar to recent values. The results from the RF model also support these findings, projecting average net FDI values of USD 1588.99 million. To achieve the aims of Vision 2041, which include eradicating extreme poverty and becoming a high-economic nation, an increasing trend of FDI inflow is crucial. The current forecasting trends provide insights into the potential trajectory of FDI inflows in Bangladesh, highlighting the importance of attracting higher FDI to accomplish their economic goals. Additionally, strengthening bilateral investment agreements and leveraging technology transfer through FDI will also be essential for fostering sustainable economic growth. Full article
(This article belongs to the Special Issue Advances in Macroeconomics and Financial Markets)
Show Figures

Figure 1

Figure 1
<p>Research framework to forecast FDI net inflows in Bangladesh.</p>
Full article ">Figure 2
<p>(<b>a</b>,<b>b</b>) The trend of FDI in Bangladesh from 1986 to 2022. (<b>a</b>) d = 0; and (<b>b</b>) d = 1.</p>
Full article ">Figure 3
<p>(<b>a</b>). ACF plot of first differenced FDI in Bangladesh; (<b>b</b>). PACF plot of first differenced FDI in Bangladesh.</p>
Full article ">Figure 4
<p>R-studio output of ARIMA (3,1,2) forecasts for 2023 to 2032. Note: FDI value is in millions of USD. The black line represents the actual observed data until 2022. The shaded regions in blue represent the forecast intervals generated by the ARIMA (3,1,2) model, with different shades indicating levels of confidence.</p>
Full article ">Figure A1
<p>ARIMA (3,1,2) residual plot.</p>
Full article ">Figure A2
<p>Random forest-generated forecasts for the next 10 years (2023–2032). Source: authors’ computation using Python software.</p>
Full article ">
25 pages, 41563 KiB  
Article
Estimation, Spatiotemporal Dynamics, and Driving Factors of Grassland Biomass Carbon Storage Based on Machine Learning Methods: A Case Study of the Hulunbuir Grassland
by Qiuying Zhi, Xiaosheng Hu, Ping Wang, Ming Li, Yi Ding, Yuxuan Wu, Tiantian Peng, Wenjie Li, Xiao Guan, Xiaoming Shi and Junsheng Li
Remote Sens. 2024, 16(19), 3709; https://doi.org/10.3390/rs16193709 - 5 Oct 2024
Viewed by 596
Abstract
Precisely estimating the grassland biomass carbon storage is vital for evaluating grassland carbon sequestration potential and the monitoring and management of grassland resources. With the increasing intensity of climate change (CC) and human activities (HA), it is necessary to explore spatiotemporal variations in [...] Read more.
Precisely estimating the grassland biomass carbon storage is vital for evaluating grassland carbon sequestration potential and the monitoring and management of grassland resources. With the increasing intensity of climate change (CC) and human activities (HA), it is necessary to explore spatiotemporal variations in biomass carbon storage and its response to CC and HA. In this study, we focused on the Hulunbuir Grassland, utilizing sample plots data, MODIS data, environmental factors (terrain, soil, and climate), location factor, and texture characteristics to assess the performance of four machine learning algorithms: random forest, support vector machine, gradient boosting decision tree, and extreme gradient boosting in estimating grassland aboveground biomass (AGB). Based on the optimal model combined with root-shoot ratio data, grassland distribution data, and carbon content coefficients, the spatiotemporal characteristics and driving factors of biomass carbon storage from 2001–2022 were analyzed. The results showed that (1) the random forest achieved the highest prediction accuracy for grassland AGB, making it appropriate for AGB estimation in the Hulunbuir Grassland. (2) The spectral indices were the key variables of the grassland AGB, especially the enhanced vegetation index and difference vegetation index. (3) The 22-year average total biomass (TB) of the study area was 1037.10 gC/m2, of which the 22-year average AGB was 48.73 gC/m2 and 22-year average belowground biomass was 988.37 gC/m2, showing a spatial distribution feature of gradual increase from west to east. (4) From 2001–2022, TB carbon storage showed an insignificant growth trend (p > 0.05). The 22-year average carbon storage of TB was 72.34 ± 18.07 gC. (5) Climate factors were the main driving factors for the spatial pattern of grassland TB carbon density, while the combined effects of CC and HA were the main contributors to the interannual increase in grassland TB carbon density. Full article
(This article belongs to the Section Ecological Remote Sensing)
Show Figures

Figure 1

Figure 1
<p>Location of study area and sample plot distribution.</p>
Full article ">Figure 2
<p>Research framework.</p>
Full article ">Figure 3
<p>Variables selection based on RFECV method.</p>
Full article ">Figure 4
<p>Hyperparameter tuning for RF, SVM, GBDT, and XGBoost models.</p>
Full article ">Figure 5
<p>Scatter plot of the optimal model constructed by four ML algorithms.</p>
Full article ">Figure 6
<p>Average grassland biomass carbon density in the growing peak season in Hulunbuir Grassland over the period 2001–2022: (<b>a</b>) AGB; (<b>b</b>) BGB; and (<b>c</b>) TB.</p>
Full article ">Figure 7
<p>Interannual variation trend of grassland TB carbon density (<b>a</b>), grassland area (<b>b</b>), and TB carbon storage (<b>c</b>).</p>
Full article ">Figure 8
<p>Change trend (<b>a</b>) and significance test (<b>b</b>) of grassland biomass carbon density in Hulunbuir Grassland from 2001 to 2022.</p>
Full article ">Figure 9
<p>Factor detection (<b>a</b>) and interaction detection (<b>b</b>) results of the spatial distribution of grassland biomass carbon density.</p>
Full article ">Figure 10
<p>Spatial distribution of contributions of CC and HA to variations in grassland biomass carbon density from 2001 to 2022. (<b>a</b>) Contribution types; (<b>b</b>) the contributions of CC; and (<b>c</b>) the contributions of HA.</p>
Full article ">
17 pages, 1035 KiB  
Article
Predicting the Malignancy Grade of Soft Tissue Sarcomas on MRI Using Conventional Image Reading and Radiomics
by Fabian Schmitz, Hendrik Voigtländer, Hyungseok Jang, Heinz-Peter Schlemmer, Hans-Ulrich Kauczor and Sam Sedaghat
Diagnostics 2024, 14(19), 2220; https://doi.org/10.3390/diagnostics14192220 - 5 Oct 2024
Viewed by 308
Abstract
Objectives: This study aims to investigate MRI features predicting the grade of STS malignancy using conventional image reading and radiomics. Methods: Pretherapeutic imaging data regarding size, tissue heterogeneity, peritumoral changes, necrosis, hemorrhage, and cystic degeneration were evaluated in conventional image reading. [...] Read more.
Objectives: This study aims to investigate MRI features predicting the grade of STS malignancy using conventional image reading and radiomics. Methods: Pretherapeutic imaging data regarding size, tissue heterogeneity, peritumoral changes, necrosis, hemorrhage, and cystic degeneration were evaluated in conventional image reading. Furthermore, the tumors’ apparent diffusion coefficient (ADC) values and radiomics features were extracted and analyzed. A random forest machine learning algorithm was trained and evaluated based on the extracted features. Results: A total of 139 STS cases were included in this study. The mean tumor ADC and the ratio between tumor ADC to healthy muscle ADC were significantly lower in high-grade tumors (p = 0.001 and 0.005, respectively). Peritumoral edema (p < 0.001) and peritumoral contrast enhancement (p < 0.001) were significantly more extensive in high-grade tumors. Tumor heterogeneity was significantly increased in high-grade sarcomas, particularly in T2w- and contrast-enhanced sequences using conventional image reading (p < 0.001) as well as in the radiomics analysis (p < 0.001). Our trained random forest machine learning model predicted high-grade status with an area under the curve (AUC) of 0.97 and an F1 score of 0.93. Biopsy-underestimated tumors exhibited differences in tumor heterogeneity and peritumoral changes. Conclusions: Tumor heterogeneity is a key characteristic of high-grade STSs, which is discernible through conventional imaging reading and radiomics analysis. Higher STS grades are also associated with low ADC values, peritumoral edema, and peritumoral contrast enhancement. Full article
(This article belongs to the Special Issue Soft Tissue Sarcoma: From Diagnosis to Prognosis)
Show Figures

Figure 1

Figure 1
<p>Three different STSs are illustrated—(<b>A</b>) G3 pleomorphic liposarcoma with peritumoral contrast enhancement and intratumoral heterogeneity on a contrast-enhanced T1w image, (<b>B</b>) G1 spindle cell sarcoma without peritumoral contrast enhancement and homogenous intratumoral enhancement on a contrast--enhanced T1w image, and (<b>C</b>) G2 myxofibrosarcoma of the thigh with extensive peritumoral edema on a T2 TIRM image.</p>
Full article ">Figure 2
<p>ROC curve of the random forest model to predict the STS malignancy grade.</p>
Full article ">
19 pages, 12812 KiB  
Article
Design and Implementation of a Shipborne Echo Sounder Simulator Based on a Seabed Echo Scattering and Noise Model
by Shihao Li, Xiao Yang, Hongxiang Ren and Chang Li
J. Mar. Sci. Eng. 2024, 12(10), 1762; https://doi.org/10.3390/jmse12101762 - 5 Oct 2024
Viewed by 315
Abstract
The Manila Amendment 2010 to the STCW International Convention has made clear requirements for seafarers to use the navigation simulation system for training. A shipborne echo sounder is an important navigation aid equipment necessary for a ship’s bridge. Proper use of this equipment [...] Read more.
The Manila Amendment 2010 to the STCW International Convention has made clear requirements for seafarers to use the navigation simulation system for training. A shipborne echo sounder is an important navigation aid equipment necessary for a ship’s bridge. Proper use of this equipment can effectively prevent ship grounding accidents. Given the lack of research on simulating different seabed substrate echoes within echo sounder simulations, this paper proposes an algorithm for generating echoes and clutter from various seabed substrates, based on the Jackson model and noise model. Using the seabed echo generation algorithm, the bathymetric data and seabed echo under the influence of ship rolling are generated, the seabed echo simulation of the sounder under the influence of four different grazing angles and six different substrates is realized. Clutter images, including random noise, bubble interference, co-frequency interference, and fish school interference, are also simulated. A typical ship echo sounder simulator is designed and developed. The echo sounder simulator developed in this paper has high realism in seabed echoes and clutter simulation, complete functions, and friendly human–computer interaction. The system has been used by college students and crews, with satisfactory results, which can effectively meet the needs of actual training of seafarers. Full article
(This article belongs to the Section Ocean Engineering)
Show Figures

Figure 1

Figure 1
<p>The schematic diagram of seabed acoustic scattering.</p>
Full article ">Figure 2
<p>Bresenham algorithm.</p>
Full article ">Figure 3
<p>Process of seabed echo generation.</p>
Full article ">Figure 4
<p>Ship rolling data.</p>
Full article ">Figure 5
<p>Sounder depth in roll attitude.</p>
Full article ">Figure 6
<p>Simulated echo images for ship stationary and roll conditions.</p>
Full article ">Figure 7
<p>Simulation result of the seabed echo image.</p>
Full article ">Figure 8
<p>The curve of echo intensity change.</p>
Full article ">Figure 9
<p>Echo images of different grazing angles under clay substrate.</p>
Full article ">Figure 10
<p>Echo images of different grazing angles under rock substrate.</p>
Full article ">Figure 11
<p>Different sediment echo images.</p>
Full article ">Figure 12
<p>The effect of echo image under different gain.</p>
Full article ">Figure 13
<p>Bubble noise; (<b>a</b>) real equipment; (<b>b</b>) simulation image.</p>
Full article ">Figure 14
<p>Interference noise from other ships: (<b>a</b>) real equipment; (<b>b</b>) simulation image.</p>
Full article ">Figure 15
<p>Reverberation noise and fish school clutter.</p>
Full article ">Figure 16
<p>Simulation structure diagram of echo sounder device.</p>
Full article ">Figure 17
<p>Simulation effect of main interface.</p>
Full article ">Figure 17 Cont.
<p>Simulation effect of main interface.</p>
Full article ">Figure 18
<p>Simulation effect of function implementation.</p>
Full article ">
28 pages, 5528 KiB  
Article
Estimating Rootzone Soil Moisture by Fusing Multiple Remote Sensing Products with Machine Learning
by Shukran A. Sahaar and Jeffrey D. Niemann
Remote Sens. 2024, 16(19), 3699; https://doi.org/10.3390/rs16193699 - 4 Oct 2024
Viewed by 590
Abstract
This study explores machine learning for estimating soil moisture at multiple depths (0–5 cm, 0–10 cm, 0–20 cm, 0–50 cm, and 0–100 cm) across the coterminous United States. A framework is developed that integrates soil moisture from Soil Moisture Active Passive (SMAP), precipitation [...] Read more.
This study explores machine learning for estimating soil moisture at multiple depths (0–5 cm, 0–10 cm, 0–20 cm, 0–50 cm, and 0–100 cm) across the coterminous United States. A framework is developed that integrates soil moisture from Soil Moisture Active Passive (SMAP), precipitation from the Global Precipitation Measurement (GPM), evapotranspiration from the Ecosystem Spaceborne Thermal Radiometer Experiment on Space Station (ECOSTRESS), vegetation data from the Moderate Resolution Imaging Spectroradiometer (MODIS), soil properties from gridded National Soil Survey Geographic (gNATSGO), and land cover information from the National Land Cover Database (NLCD). Five machine learning algorithms are evaluated including the feed-forward artificial neural network, random forest, extreme gradient boosting (XGBoost), Categorical Boosting, and Light Gradient Boosting Machine. The methods are tested by comparing to in situ soil moisture observations from several national and regional networks. XGBoost exhibits the best performance for estimating soil moisture, achieving higher correlation coefficients (ranging from 0.76 at 0–5 cm depth to 0.86 at 0–100 cm depth), lower root mean squared errors (from 0.024 cm3/cm3 at 0–100 cm depth to 0.039 cm3/cm3 at 0–5 cm depth), higher Nash–Sutcliffe Efficiencies (from 0.551 at 0–5 cm depth to 0.694 at 0–100 cm depth), and higher Kling–Gupta Efficiencies (0.511 at 0–5 cm depth to 0.696 at 0–100 cm depth). Additionally, XGBoost outperforms the SMAP Level 4 product in representing the time series of soil moisture for the networks. Key factors influencing the soil moisture estimation are elevation, clay content, aridity index, and antecedent soil moisture derived from SMAP. Full article
Show Figures

Figure 1

Figure 1
<p>Locations and climates of the in situ soil moisture stations used in this study.</p>
Full article ">Figure 2
<p>Performance metrics (<span class="html-italic">R</span>, MBE, RMSE, ubRMSE, NSE, and KGE) for the soil moisture estimates of the machine learning algorithms when compared to the testing data, including all depths and stations. For each performance metric, the line inside the box indicates the median value and the box represents the interquartile range.</p>
Full article ">Figure 3
<p>RMSE of the soil moisture estimates from the machine learning algorithms for the testing dataset when the data are divided according to the (<b>a</b>) in situ soil moisture networks and (<b>b</b>) depths. For each performance metric, the line inside the box indicates the median and the box represents the interquartile range.</p>
Full article ">Figure 4
<p>Density plots comparing the observed and XGBoost estimates of soil moisture for each depth using the testing datasets for each climate. Darker blues represent higher concentrations of data, while lighter blues represent lower concentrations.</p>
Full article ">Figure 5
<p>Time series of soil moisture at (<b>a</b>) 0–5 cm and (<b>b</b>) 0–100 cm depths at the arid USCRN Las Cruces 20N station (a member of the testing dataset). The plotted soil moisture data include hourly in situ measurements, estimates from the XGBoost model, and 3 h SMAP L4 soil moisture estimates. Daily GPM precipitation data at the site are also shown.</p>
Full article ">Figure 5 Cont.
<p>Time series of soil moisture at (<b>a</b>) 0–5 cm and (<b>b</b>) 0–100 cm depths at the arid USCRN Las Cruces 20N station (a member of the testing dataset). The plotted soil moisture data include hourly in situ measurements, estimates from the XGBoost model, and 3 h SMAP L4 soil moisture estimates. Daily GPM precipitation data at the site are also shown.</p>
Full article ">Figure 6
<p>Time series of soil moisture at (<b>a</b>) 0–5 cm and (<b>b</b>) 0–100 cm depths at the humid USCRN Versailles 3NNW station (a member of the testing dataset). The plotted soil moisture data include hourly in situ measurements, estimates from the XGBoost model, and 3 h SMAP L4 soil moisture estimates. Daily GPM precipitation data for the site are also shown.</p>
Full article ">Figure 7
<p>Correlations between predictor variables and in situ soil moisture at different depths. Positive correlations are shown in blue and negative correlations are shown in red.</p>
Full article ">Figure 8
<p>Relative importance of each predictor variable in the RF, XGBoost, CatBoost, and LightGBM models and the average importance among the four models.</p>
Full article ">
Back to TopTop