Nothing Special   »   [go: up one dir, main page]

You seem to have javascript disabled. Please note that many of the page functionalities won't work as expected without javascript enabled.
 
 
Sign in to use this feature.

Years

Between: -

Subjects

remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline

Journals

remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline

Article Types

Countries / Regions

remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline

Search Results (10,566)

Search Parameters:
Keywords = noise models

Order results
Result details
Results per page
Select all
Export citation of selected articles as:
35 pages, 1085 KiB  
Article
Multi-Channel Speech Enhancement Using Labelled Random Finite Sets and a Neural Beamformer in Cocktail Party Scenario
by Jayanta Datta, Ali Dehghan Firoozabadi, David Zabala-Blanco and Francisco R. Castillo-Soria
Appl. Sci. 2025, 15(6), 2944; https://doi.org/10.3390/app15062944 (registering DOI) - 8 Mar 2025
Abstract
In this research, a multi-channel target speech enhancement scheme is proposed that is based on deep learning (DL) architecture and assisted by multi-source tracking using a labeled random finite set (RFS) framework. A neural network based on minimum variance distortionless response (MVDR) beamformer [...] Read more.
In this research, a multi-channel target speech enhancement scheme is proposed that is based on deep learning (DL) architecture and assisted by multi-source tracking using a labeled random finite set (RFS) framework. A neural network based on minimum variance distortionless response (MVDR) beamformer is considered as the beamformer of choice, where a residual dense convolutional graph-U-Net is applied in a generative adversarial network (GAN) setting to model the beamformer for target speech enhancement under reverberant conditions involving multiple moving speech sources. The input dataset for this neural architecture is constructed by applying multi-source tracking using multi-sensor generalized labeled multi-Bernoulli (MS-GLMB) filtering, which belongs to the labeled RFS framework, to obtain estimations of the sources’ positions and the associated labels (corresponding to each source) at each time frame with high accuracy under the effect of undesirable factors like reverberation and background noise. The tracked sources’ positions and associated labels help to correctly discriminate the target source from the interferers across all time frames and generate time–frequency (T-F) masks corresponding to the target source from the output of a time-varying, minimum variance distortionless response (MVDR) beamformer. These T-F masks constitute the target label set used to train the proposed deep neural architecture to perform target speech enhancement. The exploitation of MS-GLMB filtering and a time-varying MVDR beamformer help in providing the spatial information of the sources, in addition to the spectral information, within the neural speech enhancement framework during the training phase. Moreover, the application of the GAN framework takes advantage of adversarial optimization as an alternative to maximum likelihood (ML)-based frameworks, which further boosts the performance of target speech enhancement under reverberant conditions. The computer simulations demonstrate that the proposed approach leads to better target speech enhancement performance compared with existing state-of-the-art DL-based methodologies which do not incorporate the labeled RFS-based approach, something which is evident from the 75% ESTOI and PESQ of 2.70 achieved by the proposed approach as compared with the 46.74% ESTOI and PESQ of 1.84 achieved by Mask-MVDR with self-attention mechanism at a reverberation time (RT60) of 550 ms. Full article
23 pages, 7419 KiB  
Article
A Deep Learning-Based Detection and Segmentation System for Multimodal Ultrasound Images in the Evaluation of Superficial Lymph Node Metastases
by Roxana Rusu-Both, Marius-Cristian Socaci, Adrian-Ionuț Palagos, Corina Buzoianu, Camelia Avram, Honoriu Vălean and Romeo-Ioan Chira
J. Clin. Med. 2025, 14(6), 1828; https://doi.org/10.3390/jcm14061828 (registering DOI) - 8 Mar 2025
Abstract
Background/Objectives: Even with today’s advancements, cancer still represents a major cause of mortality worldwide. One important aspect of cancer progression that has a big impact on diagnosis, prognosis, and treatment plans is accurate lymph node metastasis evaluation. However, regardless of the imaging [...] Read more.
Background/Objectives: Even with today’s advancements, cancer still represents a major cause of mortality worldwide. One important aspect of cancer progression that has a big impact on diagnosis, prognosis, and treatment plans is accurate lymph node metastasis evaluation. However, regardless of the imaging method used, this process is challenging and time-consuming. This research aimed to develop and validate an automatic detection and segmentation system for superficial lymph node evaluation based on multimodal ultrasound images, such as traditional B-mode, Doppler, and elastography, using deep learning techniques. Methods: The suggested approach incorporated a Mask R-CNN architecture designed specifically for the detection and segmentation of lymph nodes. The pipeline first involved noise reduction preprocessing, after which morphological and textural feature segmentation and analysis were performed. Vascularity and stiffness parameters were further examined in Doppler and elastography pictures. Metrics, including accuracy, mean average precision (mAP), and dice coefficient, were used to assess the system’s performance during training and validation on a carefully selected dataset of annotated ultrasound pictures. Results: During testing, the Mask R-CNN model showed an accuracy of 92.56%, a COCO AP score of 60.7 and a validation score of 64. Furter on, to improve diagnostic capabilities, Doppler and elastography data were added. This allowed for improved performance across several types of ultrasound images and provided thorough insights into the morphology, vascularity, and stiffness of lymph nodes. Conclusions: This paper offers a novel use of deep learning for automated lymph node assessment in ultrasound imaging. This system offers a dependable tool for doctors to evaluate lymph node metastases efficiently by fusing sophisticated segmentation techniques with multimodal image processing. It has the potential to greatly enhance patient outcomes and diagnostic accuracy. Full article
19 pages, 3377 KiB  
Article
AI-Enhanced Detection of Heart Murmurs: Advancing Non-Invasive Cardiovascular Diagnostics
by Maria-Alexandra Zolya, Elena-Laura Popa, Cosmin Baltag, Dragoș-Vasile Bratu, Simona Coman and Sorin-Aurel Moraru
Sensors 2025, 25(6), 1682; https://doi.org/10.3390/s25061682 (registering DOI) - 8 Mar 2025
Viewed by 4
Abstract
Cardiovascular diseases (CVDs) are the leading cause of death worldwide, claiming over 17 million lives annually. Early detection of conditions like heart murmurs, often indicative of heart valve abnormalities, is critical for improving patient outcomes. Traditional diagnostic methods, including physical auscultation and advanced [...] Read more.
Cardiovascular diseases (CVDs) are the leading cause of death worldwide, claiming over 17 million lives annually. Early detection of conditions like heart murmurs, often indicative of heart valve abnormalities, is critical for improving patient outcomes. Traditional diagnostic methods, including physical auscultation and advanced imaging techniques, are constrained by their reliance on specialized clinical expertise, inherent procedural invasiveness, substantial financial costs, and limited accessibility, particularly in resource-limited healthcare environments. This study presents a novel convolutional recurrent neural network (CRNN) model designed for the non-invasive classification of heart murmurs. The model processes heart sound recordings using advanced pre-processing techniques such as z-score normalization, band-pass filtering, and data augmentation (Gaussian noise, time shift, and pitch shift) to enhance robustness. By combining convolutional and recurrent layers, the CRNN captures spatial and temporal features in audio data, achieving an accuracy of 90.5%, precision of 89%, and recall of 87%. These results underscore the potential of machine-learning technologies to revolutionize cardiac diagnostics by offering scalable, accessible solutions for the early detection of cardiovascular conditions. This approach paves the way for broader applications of AI in healthcare, particularly in underserved regions where traditional resources are scarce. Full article
Show Figures

Figure 1

Figure 1
<p>(<b>a</b>) The original sound wave; (<b>b</b>) The associated spectogram.</p>
Full article ">Figure 2
<p>(<b>a</b>) The original recording; (<b>b</b>) The recording after pre-processing stage.</p>
Full article ">Figure 3
<p>(<b>a</b>) The original sound; (<b>b</b>) The sound after Gaussian noise was added.</p>
Full article ">Figure 4
<p>(<b>a</b>) The original sound; (<b>b</b>) The sound after time shift.</p>
Full article ">Figure 5
<p>(<b>a</b>) The original sound; (<b>b</b>) The sound after pitch shift.</p>
Full article ">Figure 6
<p>(<b>a</b>) The original sound; (<b>b</b>) The sound after standardization of sampling rate.</p>
Full article ">Figure 7
<p>(<b>a</b>) The original sound; (<b>b</b>) The sound after normalization of recording length (15 s).</p>
Full article ">Figure 8
<p>(<b>a</b>) The original sound; (<b>b</b>) The sound after band-pass filter was applied.</p>
Full article ">Figure 9
<p>(<b>a</b>) The original sound; (<b>b</b>) The sound after Z-score normalization.</p>
Full article ">Figure 10
<p>The proposed CRNN architecture.</p>
Full article ">Figure 11
<p>Evolution of the loss function and the accuracy during training and validation for the first model: (<b>a</b>) Loss; (<b>b</b>) Accuracy.</p>
Full article ">Figure 12
<p>ROC curve and confusion matrix for first model: (<b>a</b>) ROC; (<b>b</b>) Confusion matrix.</p>
Full article ">Figure 13
<p>Evolution of the loss function and the accuracy during training and validation for the second model: (<b>a</b>) Loss; (<b>b</b>) Accuracy.</p>
Full article ">Figure 14
<p>ROC curve and confusion matrix for second model: (<b>a</b>) ROC; (<b>b</b>) Confusion matrix.</p>
Full article ">Figure 15
<p>Evolution of the loss function and the accuracy during training and validation for the third model: (<b>a</b>) Loss; (<b>b</b>) Accuracy.</p>
Full article ">Figure 16
<p>ROC curve and confusion matrix for third model: (<b>a</b>) ROC; (<b>b</b>) Confusion matrix.</p>
Full article ">Figure 17
<p>Evolution of the loss function and the accuracy during training and validation for the first model: (<b>a</b>) Loss; (<b>b</b>) Accuracy.</p>
Full article ">Figure 18
<p>ROC curve and confusion matrix for the fourth model: (<b>a</b>) ROC; (<b>b</b>) Confusion matrix.</p>
Full article ">
20 pages, 6467 KiB  
Article
A Lightweight TA-YOLOv8 Method for the Spot Weld Surface Anomaly Detection of Body in White
by Weijie Liu, Miao Jia, Shuo Zhang, Siyu Zhu, Jin Qi and Jie Hu
Appl. Sci. 2025, 15(6), 2931; https://doi.org/10.3390/app15062931 (registering DOI) - 8 Mar 2025
Viewed by 12
Abstract
The deep learning architecture YOLO (You Only Look Once) has demonstrated its superior visual detection performance in various computer vision tasks and has been widely applied in the field of automatic surface defect detection. In this paper, we propose a lightweight YOLOv8-based method [...] Read more.
The deep learning architecture YOLO (You Only Look Once) has demonstrated its superior visual detection performance in various computer vision tasks and has been widely applied in the field of automatic surface defect detection. In this paper, we propose a lightweight YOLOv8-based method for the quality inspection of car body welding spots. We developed a TA-YOLOv8 network structure which has an improved Task-Aligned (TA) head detection, designed to handle a small sample size, imbalanced positive and negative samples, and high-noise characteristics of Body-in-White welding spot data. By learning with fewer parameters, the model achieves more efficient and accurate classification. Additionally, our algorithm framework can perform anomaly segmentation and classification on our open-world raw datasets obtained from actual production environments. The experimental results show that the lightweight module improves the processing speed by an average of 2.8%, with increases in detection the mAP@50-95 and recall rate of 1.35% and 0.1226, respectively. Full article
(This article belongs to the Special Issue Motion Control for Robots and Automation)
Show Figures

Figure 1

Figure 1
<p>Architecture of YOLOv8 model. The different color parts of the input batches represent different image data. The different color parts of the architecture represent different function modules.</p>
Full article ">Figure 2
<p>Improved backbone network of our architecture. On the left is a schematic diagram of the backbone network process for detecting spot weld images, while the right side shows the corresponding structure layer parameters and related information. The different color parts are the same as <a href="#applsci-15-02931-f001" class="html-fig">Figure 1</a>.</p>
Full article ">Figure 3
<p>Proposed Multiple Cross-Layer FPN (MC-FPN) network. The different color dotted lines represent multiple cross layers, with P<sub>2</sub>-P<sub>5</sub> being simplified representations of the intermediate connection layers.</p>
Full article ">Figure 4
<p>Task-Aligned head structure: to learn extensive task-interactive features from multiple convolutional layers.</p>
Full article ">Figure 5
<p>Welding spots sample images and annotated data in Body-in-White production lines. (<b>a</b>) shows the samples we collected in the production lines, while (<b>b</b>) shows the pretraining dataset and the labels (yellow squares).</p>
Full article ">Figure 5 Cont.
<p>Welding spots sample images and annotated data in Body-in-White production lines. (<b>a</b>) shows the samples we collected in the production lines, while (<b>b</b>) shows the pretraining dataset and the labels (yellow squares).</p>
Full article ">Figure 6
<p>Performance comparison with typical object detection algorithms on test set.</p>
Full article ">Figure 7
<p>Some results of WSDDM and comparison between small welding spot detection models. We use green to represent the detected weld spots are normal, and red to represent the detected weld spots have defects or abnormalities.</p>
Full article ">Figure 8
<p>The weld spot dataset obtained from image segmentation using the WSDDM.</p>
Full article ">Figure 9
<p>Data augmentation and labeling.</p>
Full article ">Figure 10
<p>Visualization and validation sample results for model testing. The model effectively captures the location of welding defects through highlighted (green) regions.</p>
Full article ">Figure 11
<p>Validation sample results for model generalization ability.</p>
Full article ">Figure 12
<p>Experimental pipeline and integrated detection systems.</p>
Full article ">
27 pages, 4262 KiB  
Article
A Self-Supervised Method for Speaker Recognition in Real Sound Fields with Low SNR and Strong Reverberation
by Xuan Zhang, Jun Tang, Huiliang Cao, Chenguang Wang, Chong Shen and Jun Liu
Appl. Sci. 2025, 15(6), 2924; https://doi.org/10.3390/app15062924 - 7 Mar 2025
Viewed by 244
Abstract
Speaker recognition is essential in smart voice applications for personal identification. Current state-of-the-art techniques primarily focus on ideal acoustic conditions. However, the traditional spectrogram struggles to differentiate between noise, reverberation, and speech. To overcome this challenge, MFCC can be replaced with the output [...] Read more.
Speaker recognition is essential in smart voice applications for personal identification. Current state-of-the-art techniques primarily focus on ideal acoustic conditions. However, the traditional spectrogram struggles to differentiate between noise, reverberation, and speech. To overcome this challenge, MFCC can be replaced with the output from a self-supervised learning model. This study introduces a TDNN enhanced with a pre-trained model for robust performance in noisy and reverberant environments, referred to as PNR-TDNN. The PNR-TDNN employs HuBERT as its backbone, while the TDNN is an improved ECAPA-TDNN. The pre-trained model employs the Canopy/Mini Batch k-means++ strategy. In the TDNN architecture, several enhancements are implemented, including a cross-channel fusion mechanism based on Res2Net. Additionally, a non-average attention mechanism is applied to the pooling operation, focusing on the weight information of each channel within the Squeeze-and-Excitation Net. Furthermore, the contribution of individual channels to the pooling of time-domain frames is enhanced by substituting attentive statistics with multi-head attention statistics. Validated by zhvoice in noisy conditions, the minimized PNR-TDNN demonstrates a 5.19% improvement in EER compared to CAM++. In more challenging environments with noise and reverberation, the minimized PNR-TDNN further improves EER by 3.71% and 9.6%, respectively, and MinDCF by 3.14% and 3.77%, respectively. The proposed method has also been validated on the VoxCeleb1 and cn-celeb_v2 datasets, representing a significant breakthrough in the field of speaker recognition under challenging conditions. This advancement is particularly crucial for enhancing safety and protecting personal identification in voice-enabled microphone applications. Full article
(This article belongs to the Section Computing and Artificial Intelligence)
19 pages, 9096 KiB  
Article
Speech Enhancement Based on Unidirectional Interactive Noise Modeling Assistance
by Yuewei Zhang, Huanbin Zou and Jie Zhu
Appl. Sci. 2025, 15(6), 2919; https://doi.org/10.3390/app15062919 - 7 Mar 2025
Viewed by 166
Abstract
It has been demonstrated that interactive speech and noise modeling outperforms traditional speech modeling-only methods for speech enhancement (SE). With a dual-branch topology that simultaneously predicts target speech and noise signals and employs bidirectional information communication between the two branches, the quality of [...] Read more.
It has been demonstrated that interactive speech and noise modeling outperforms traditional speech modeling-only methods for speech enhancement (SE). With a dual-branch topology that simultaneously predicts target speech and noise signals and employs bidirectional information communication between the two branches, the quality of the enhanced speech is significantly improved. However, the dual-branch topology greatly increases the model complexity and deployment cost, thus limiting its practicality. In this paper, we propose UniInterNet, a unidirectional information interaction-based dual-branch network to achieve noise modeling-assisted SE without any increase in complexity. Specifically, the noise branch still receives information from the speech branch to achieve more accurate noise modeling. Subsequently, the noise modeling results are utilized to assist the learning of the speech branch during backpropagation, while the speech branch no longer receives the auxiliary information from the noise branch, so only the speech branch is required during model deployment. Experimental results demonstrate that under the causal inference condition, the performance of UniInterNet only marginally decreases compared to the corresponding bidirectional information interaction scheme, while the model inference complexity is reduced by about 75%. With comparable overall performance, UniInterNet also outperforms previous interactive speech and noise modeling-based benchmarks in terms of causal inference and model complexity. Furthermore, UniInterNet surpasses other existing competitive methods. Full article
(This article belongs to the Section Computing and Artificial Intelligence)
Show Figures

Figure 1

Figure 1
<p>Overall architecture of unidirectional information interaction-based dual-branch network (UniInterNet).</p>
Full article ">Figure 2
<p>(<b>a</b>) The detail of the two-dimensional convolutional (Conv2d) block. (<b>b</b>) The detail of the two-dimensional deconvolutional (DeConv2d) block.</p>
Full article ">Figure 3
<p>The diagram of time–frequency sequence modeling (TFSM) block. During temporal sequence modeling, a causal gated recurrent unit (GRU) layer is employed in the speech branch, while a non-causal bidirectional GRU (BiGRU) layer is utilized in the noise branch.</p>
Full article ">Figure 4
<p>Structure of the unidirectional interaction module.</p>
Full article ">Figure 5
<p>Visualization of the spectrum of the following: (<b>a</b>) noisy speech; (<b>b</b>) clean speech; (<b>c</b>) enhanced speech by SiNet; (<b>d</b>) enhanced speech by BiInterNet; (<b>e</b>) enhanced speech by UniInterNet-CausalNoise; (<b>f</b>) enhanced speech by UniInterNet. The noise type is open area cafeteria noise.</p>
Full article ">Figure 6
<p>Visualization of the spectrum of the following: (<b>a</b>) noisy speech; (<b>b</b>) clean speech; (<b>c</b>) enhanced speech by UniInterNet w/o EncInter; (<b>d</b>) enhanced speech by UniInterNet w/o RecInter; (<b>e</b>) enhanced speech by UniInterNet w/o DecInter; (<b>f</b>) enhanced speech by UniInterNet. The noise type is public square noise.</p>
Full article ">
19 pages, 4685 KiB  
Article
Differential Privacy in Federated Learning: An Evolutionary Game Analysis
by Zhengwei Ni and Qi Zhou
Appl. Sci. 2025, 15(6), 2914; https://doi.org/10.3390/app15062914 - 7 Mar 2025
Viewed by 143
Abstract
This paper examines federated learning, a decentralized machine learning paradigm, focusing on privacy challenges. We introduce differential privacy mechanisms to protect privacy and quantify their impact on global model performance. Using evolutionary game theory, we establish a framework to analyze strategy dynamics and [...] Read more.
This paper examines federated learning, a decentralized machine learning paradigm, focusing on privacy challenges. We introduce differential privacy mechanisms to protect privacy and quantify their impact on global model performance. Using evolutionary game theory, we establish a framework to analyze strategy dynamics and define utilities for different strategies based on Gaussian noise powers and training iterations. A differential privacy federated learning model (DPFLM) is analyzed within this framework. A key contribution is the thorough existence and stability analysis, identifying evolutionarily stable strategies (ESSs) and confirming their stability through simulations. This research provides theoretical insights for enhancing privacy protection in federated learning systems. Full article
(This article belongs to the Special Issue Multimedia Smart Security)
Show Figures

Figure 1

Figure 1
<p>Localized differential privacy data processing framework.</p>
Full article ">Figure 2
<p>Implementation of differential privacy.</p>
Full article ">Figure 3
<p>Federated learning system model.</p>
Full article ">Figure 4
<p>Utilities for different scenarios: (<b>a</b>) Two-strategy scenario. (<b>b</b>) Four-strategy scenario.</p>
Full article ">Figure 5
<p>Population shares and utilities for two-strategy scenario (Equation (<a href="#FD23-applsci-15-02914" class="html-disp-formula">23</a>) holds): (<b>a</b>) Population shares. (<b>b</b>) Utilities.</p>
Full article ">Figure 6
<p>Population shares and utilities for two-strategy scenario (Equation (<a href="#FD24-applsci-15-02914" class="html-disp-formula">24</a>) holds): (<b>a</b>) Population shares. (<b>b</b>) Utilities.</p>
Full article ">Figure 7
<p>Population shares for four-strategy scenario.</p>
Full article ">Figure 8
<p>Population shares for eight-strategy scenario.</p>
Full article ">
28 pages, 13595 KiB  
Article
Research on Optimization of Diesel Engine Speed Control Based on UKF-Filtered Data and PSO Fuzzy PID Control
by Jun Fu, Shuo Gu, Lei Wu, Nan Wang, Luchen Lin and Zhenghong Chen
Processes 2025, 13(3), 777; https://doi.org/10.3390/pr13030777 - 7 Mar 2025
Viewed by 194
Abstract
With the continuous development of industrial automation, diesel engines play an increasingly important role in various types of construction machinery and power generation equipment. Improving the dynamic and static performance of the speed control system of single-cylinder diesel engines can not only significantly [...] Read more.
With the continuous development of industrial automation, diesel engines play an increasingly important role in various types of construction machinery and power generation equipment. Improving the dynamic and static performance of the speed control system of single-cylinder diesel engines can not only significantly improve the efficiency of the equipment, but also effectively reduce energy consumption and emissions. Particle swarm optimization (PSO) fuzzy PID control algorithms have been widely used in many complex engineering problems due to their powerful global optimization capability and excellent adaptability. Currently, PSO-based fuzzy PID control research mainly integrates hybrid algorithmic strategies to avoid the local optimum problem, and lacks optimization of the dynamic noise suppression of the input error and the rate of change of the error. This makes the algorithm susceptible to the coupling of the system uncertainty and measurement disturbances during the parameter optimization process, leading to performance degradation. For this reason, this study proposes a new framework based on the synergistic optimization of the untraceable Kalman filter (UKF) and PSO fuzzy PID control for the speed control system of a single-cylinder diesel engine. A PSO-optimized fuzzy PID controller is designed by obtaining accurate speed estimation data using the UKF. The PSO is capable of quickly adjusting the fuzzy PID parameters so as to effectively alleviate the nonlinearity and uncertainty problems during the operation of diesel engines. By establishing a Matlab/Simulink simulation model, the diesel engine speed step response experiments (i.e., startup experiments) and load mutation experiments were carried out, and the measurement noise and process noise were imposed. The simulation results show that the optimized diesel engine speed control system is able to reduce the overshoot by 76%, shorten the regulation time by 58%, and improve the noise reduction by 25% compared with the conventional PID control. Compared with the PSO fuzzy PID control algorithm without UKF noise reduction, the optimized scheme reduces the overshoot by 20%, shortens the regulation time by 48%, and improves the noise reduction effect by 23%. The results show that the PSO fuzzy PID control method with integrated UKF has superior control performance in terms of system stability and accuracy. The algorithm significantly improves the responsiveness and stability of diesel engine speed, achieves better control effect in the optimization of diesel engine speed control, and provides a useful reference for the optimization of other diesel engine control systems. In addition, this study establishes the GT-POWER model of a 168 F single-cylinder diesel engine, and compares the cylinder pressure and fuel consumption under four operating conditions through bench tests to ensure the physical reasonableness of the kinetic input parameters and avoid algorithmic optimization on the distorted front-end model. Full article
(This article belongs to the Section Process Control and Monitoring)
Show Figures

Figure 1

Figure 1
<p>Diesel engine speed control system schematic diagram.</p>
Full article ">Figure 2
<p>Diesel engine system schematic diagram.</p>
Full article ">Figure 3
<p>Schematic diagram of the overall architecture of the speed control system.</p>
Full article ">Figure 4
<p>Diesel engine test bench.</p>
Full article ">Figure 5
<p>GT-POWER model of 168 F single cylinder diesel engine.</p>
Full article ">Figure 6
<p>Comparison of cylinder pressure under different loads.</p>
Full article ">Figure 7
<p>Fuel consumption comparison chart under different loads.</p>
Full article ">Figure 8
<p>Schematic diagram of overall technical scheme.</p>
Full article ">Figure 9
<p>Schematic diagram of the PSO fuzzy PID controller based on UKF data.</p>
Full article ">Figure 10
<p>UKF algorithm flowchart.</p>
Full article ">Figure 11
<p>Unscented kalman filtering noise reduction effect diagram.</p>
Full article ">Figure 12
<p>Characteristic face of the fuzzy inference system: (<b>a</b>) <math display="inline"><semantics> <mrow> <mi mathvariant="sans-serif">Δ</mi> <msub> <mrow> <mi>k</mi> </mrow> <mrow> <mi>p</mi> </mrow> </msub> </mrow> </semantics></math> (Proportional term characteristic surface), (<b>b</b>) <math display="inline"><semantics> <mrow> <mi mathvariant="sans-serif">Δ</mi> <msub> <mrow> <mi>k</mi> </mrow> <mrow> <mi>i</mi> </mrow> </msub> </mrow> </semantics></math> (Integral term characteristic surface), (<b>c</b>) <math display="inline"><semantics> <mrow> <mi mathvariant="sans-serif">Δ</mi> <msub> <mrow> <mi>k</mi> </mrow> <mrow> <mi>d</mi> </mrow> </msub> </mrow> </semantics></math> (Derivative term characteristic surface).</p>
Full article ">Figure 13
<p>Particle swarm optimization flowchart.</p>
Full article ">Figure 14
<p>Fitness value optimization results.</p>
Full article ">Figure 15
<p>Model of fuzzy PID control algorithm optimized by particle swarm optimization based on UKF in Matlab/Simulink (R2022b).</p>
Full article ">Figure 16
<p>Model of PID, Fuzzy PID, Fuzzy PID based on data of UKF, and PSO Fuzzy PID based on data of UKF in Matlab/Simulink (R2022b).</p>
Full article ">Figure 17
<p>Step response experiment results.</p>
Full article ">Figure 18
<p>Load disturbance experiment results.</p>
Full article ">
21 pages, 24146 KiB  
Article
SMEP-DETR: Transformer-Based Ship Detection for SAR Imagery with Multi-Edge Enhancement and Parallel Dilated Convolutions
by Chushi Yu and Yoan Shin
Remote Sens. 2025, 17(6), 953; https://doi.org/10.3390/rs17060953 - 7 Mar 2025
Viewed by 158
Abstract
Synthetic aperture radar (SAR) serves as a pivotal remote sensing technology, offering critical support for ship monitoring, environmental observation, and national defense. Although optical detection methods have achieved good performance, SAR imagery still faces challenges, including speckle, complex backgrounds, and small, dense targets. [...] Read more.
Synthetic aperture radar (SAR) serves as a pivotal remote sensing technology, offering critical support for ship monitoring, environmental observation, and national defense. Although optical detection methods have achieved good performance, SAR imagery still faces challenges, including speckle, complex backgrounds, and small, dense targets. Reducing false alarms and missed detections while improving detection performance remains a key objective in the field. To address these issues, we propose SMEP-DETR, a transformer-based model with multi-edge enhancement and parallel dilated convolutions. This model integrates a speckle denoising module, a multi-edge information enhancement module, and a parallel dilated convolution and attention pyramid network. Experimental results demonstrate that SMEP-DETR achieves the high mAP 98.6% on SSDD, 93.2% in HRSID, and 80.0% in LS-SSDD-v1.0, surpassing several state-of-the-art algorithms. Visualization results validate the model’s capability to effectively mitigate the impact of speckle noise while preserving valuable information in both inshore and offshore scenarios. Full article
(This article belongs to the Special Issue Remote Sensing Image Thorough Analysis by Advanced Machine Learning)
Show Figures

Figure 1

Figure 1
<p>The architecture of the proposed SMEP-DETR. Ⓒ denotes the concatenate operation and ⊕ denotes the element-wise add operation.</p>
Full article ">Figure 2
<p>The structure of the multi-edge information enhancement module.</p>
Full article ">Figure 3
<p>Diagram of the parallel dilated convolution and attention pyramid network.</p>
Full article ">Figure 4
<p>Visualization of SMEP-DETR and comparison detectors on SSDD: (<b>a</b>) inshore scene with large-scale ship targets, (<b>b</b>) inshore scene with both large and small ships, (<b>c</b>) offshore scene with significant speckle interference, (<b>d</b>) offshore scene with multiple targets. Red bounding boxes represent predicted ships, yellow ellipses indicate missing detections, and blue ellipses denote false alarms.</p>
Full article ">Figure 4 Cont.
<p>Visualization of SMEP-DETR and comparison detectors on SSDD: (<b>a</b>) inshore scene with large-scale ship targets, (<b>b</b>) inshore scene with both large and small ships, (<b>c</b>) offshore scene with significant speckle interference, (<b>d</b>) offshore scene with multiple targets. Red bounding boxes represent predicted ships, yellow ellipses indicate missing detections, and blue ellipses denote false alarms.</p>
Full article ">Figure 4 Cont.
<p>Visualization of SMEP-DETR and comparison detectors on SSDD: (<b>a</b>) inshore scene with large-scale ship targets, (<b>b</b>) inshore scene with both large and small ships, (<b>c</b>) offshore scene with significant speckle interference, (<b>d</b>) offshore scene with multiple targets. Red bounding boxes represent predicted ships, yellow ellipses indicate missing detections, and blue ellipses denote false alarms.</p>
Full article ">Figure 5
<p>Visualization of SMEP-DETR and comparison detectors on HRSID and LS-SSDD-v1.0. (<b>a</b>,<b>b</b>) Samples from HRSID, (<b>c</b>,<b>d</b>) samples of LS-SSDD-v1.0. (<b>a</b>) Offshore scene with closely spaced targets, (<b>b</b>) inshore scene with docked objects near the shoreline, (<b>c</b>) offshore scene containing extremely small targets, (<b>d</b>) inshore scene with extensive background information. Red bounding boxes represent predicted ships, yellow ellipses indicate missing detections, and blue ellipses denote false alarms.</p>
Full article ">Figure 5 Cont.
<p>Visualization of SMEP-DETR and comparison detectors on HRSID and LS-SSDD-v1.0. (<b>a</b>,<b>b</b>) Samples from HRSID, (<b>c</b>,<b>d</b>) samples of LS-SSDD-v1.0. (<b>a</b>) Offshore scene with closely spaced targets, (<b>b</b>) inshore scene with docked objects near the shoreline, (<b>c</b>) offshore scene containing extremely small targets, (<b>d</b>) inshore scene with extensive background information. Red bounding boxes represent predicted ships, yellow ellipses indicate missing detections, and blue ellipses denote false alarms.</p>
Full article ">Figure 5 Cont.
<p>Visualization of SMEP-DETR and comparison detectors on HRSID and LS-SSDD-v1.0. (<b>a</b>,<b>b</b>) Samples from HRSID, (<b>c</b>,<b>d</b>) samples of LS-SSDD-v1.0. (<b>a</b>) Offshore scene with closely spaced targets, (<b>b</b>) inshore scene with docked objects near the shoreline, (<b>c</b>) offshore scene containing extremely small targets, (<b>d</b>) inshore scene with extensive background information. Red bounding boxes represent predicted ships, yellow ellipses indicate missing detections, and blue ellipses denote false alarms.</p>
Full article ">
19 pages, 13798 KiB  
Article
RANFIS-Based Sensor System with Low-Cost Multi-Sensors for Reliable Measurement of VOCs
by Keunyoung Kim and Woosung Yang
Technologies 2025, 13(3), 111; https://doi.org/10.3390/technologies13030111 - 7 Mar 2025
Viewed by 270
Abstract
This study describes a sensor system for continuous monitoring of volatile organic compounds (VOCs) emitted from small industrial facilities in urban centers, such as automobile paint facilities and printing facilities. Previously, intermittent measurements were made using expensive flame ionization detector (FID)-type instruments that [...] Read more.
This study describes a sensor system for continuous monitoring of volatile organic compounds (VOCs) emitted from small industrial facilities in urban centers, such as automobile paint facilities and printing facilities. Previously, intermittent measurements were made using expensive flame ionization detector (FID)-type instruments that were impossible to install, resulting in a lack of continuous management. This paper develops a low-cost sensor system for full-time management and consists of multi-sensor systems to increase the spatial resolution in the pipe. To improve the accuracy and reliability of this system, a new reinforced adaptive neuro fuzzy inference system (RANFIS) model with enhanced preprocessing based on the adaptive neuro fuzzy inference system (ANFIS) model is proposed. For this purpose, a smart sensor module consisting of low-cost metal oxide semiconductors (MOSs) and photo-ionization detectors (PIDs) is fabricated, and an operating controller is configured for real-time data acquisition, analysis, and evaluation. In the front part of the RANFIS, interquartile range (IQR) is used to remove outliers, and gradient analysis is used to detect and correct data with abnormal change rates to solve nonlinearities and outliers in sensor data. In the latter stage, the complex nonlinear relationship of the data was modeled using the ANFIS to reliably handle data uncertainty and noise. For practical verification, a toluene evaporation chamber with a sensor system for monitoring was built, and the results of real-time data sensing after training based on real data were compared and evaluated. As a result of applying the RANFIS model, the RMSE of the MQ135, MQ138, and PID-A15 sensors were 3.578, 11.594, and 4.837, respectively, which improved the performance by 87.1%, 25.9%, and 35.8% compared to the existing ANFIS. Therefore, the precision within 5% of the measurement results of the two experimentally verified sensors shows that the proposed RANFIS-based sensor system can be sufficiently applied in the field. Full article
Show Figures

Figure 1

Figure 1
<p>(<b>a</b>) Multi-sensor system structure; (<b>b</b>) a unit multi-sensor module; and (<b>c</b>) a multi-sensor system.</p>
Full article ">Figure 2
<p>Sensor module attachment position inside the chamber.</p>
Full article ">Figure 3
<p>Setting of VOC measurement sensor system.</p>
Full article ">Figure 4
<p>Comparison of normalized Sensor 1 data and reference data.</p>
Full article ">Figure 5
<p>ANFIS structure.</p>
Full article ">Figure 6
<p>RANFIS structure.</p>
Full article ">Figure 7
<p>Before and after outlier correction of Sensor 1 data positions 1, 4, 5, and 8.</p>
Full article ">Figure 8
<p>(<b>a</b>) Gradient compensation of Sensor 1 and (<b>b</b>) reconstructed data of Sensor 1.</p>
Full article ">Figure 9
<p>REF sensor and ANFIS and RANFIS results for Sensor 1 data.</p>
Full article ">Figure 10
<p>REF sensor and ANFIS and RANFIS results for Sensor 2 data.</p>
Full article ">Figure 11
<p>REF sensor and ANFIS and RANFIS results for Sensor 3 data.</p>
Full article ">Figure A1
<p>Comparison of normalized reference data and (<b>a</b>) Sensor 1, (<b>b</b>) Sensor 2, and (<b>c</b>) Sensor 3.</p>
Full article ">Figure A1 Cont.
<p>Comparison of normalized reference data and (<b>a</b>) Sensor 1, (<b>b</b>) Sensor 2, and (<b>c</b>) Sensor 3.</p>
Full article ">Figure A2
<p>Graph comparison by offset of (<b>a</b>) Sensor 1 (MQ135), (<b>b</b>) Sensor 2 (MQ138), and (<b>c</b>) Sensor 3 (PID-A15).</p>
Full article ">
13 pages, 5323 KiB  
Article
Advances in the Detection and Identification of Bacterial Biofilms Through NIR Spectroscopy
by Cristina Allende-Prieto, Lucía Fernández, Pablo Rodríguez-Gonzálvez, Pilar García, Ana Rodríguez, Carmen Recondo and Beatriz Martínez
Foods 2025, 14(6), 913; https://doi.org/10.3390/foods14060913 - 7 Mar 2025
Viewed by 89
Abstract
Bacterial biofilms play an important role in the pathogenesis of infectious diseases but are also very relevant in other fields such as the food industry. This fact has led to an increased focus on the early identification of these structures as prophylaxes to [...] Read more.
Bacterial biofilms play an important role in the pathogenesis of infectious diseases but are also very relevant in other fields such as the food industry. This fact has led to an increased focus on the early identification of these structures as prophylaxes to prevent biofilm-related contaminations or infections. One of the objectives of the present study was to assess the effectiveness of NIR (Near Infrared) spectroscopy in the detection and differentiation of biofilms from different bacterial species, namely Staphylococcus epidermidis, Staphylococcus aureus, Enterococcus faecium, Salmonella Typhymurium, Escherichia coli, Listeria monocytogenes, and Lactiplantibacillus plantarum. Additionally, we aimed to examine the capability of this technology to specifically identify S. aureus biofilms on glass surfaces commonly used as storage containers and processing equipment. We developed a detailed methodology for data acquisition and processing that takes into consideration the biochemical composition of these biofilms. To improve the quality of the spectral data, SNV (Standard Normal Variate) and Savitzky–Golay filters were applied, which correct systematic variations and eliminate random noise, followed by an exploratory analysis that revealed significant spectral differences in the NIR range. Then, we performed principal component analysis (PCA) to reduce data dimensionality and, subsequently, a Random Forest discriminant statistical analysis was used to classify biofilms accurately and reliably. The samples were organized into two groups, a control set and a test set, for the purpose of performing a comparative analysis. Model validation yielded an accuracy of 80.00% in the first analysis (detection and differentiation of biofilm) and 93.75% in the second (identification of biofilm on glass surfaces), thus demonstrating the efficacy of the proposed method. These results demonstrate that this technique is effective and reliable, indicating great potential for its application in the field of biofilm detection. Full article
(This article belongs to the Section Food Microbiology)
Show Figures

Figure 1

Figure 1
<p>Spectral signatures obtained after NIR measurement of each bacterial biofilm. Bacterial species and control are indicated on the bottom left.</p>
Full article ">Figure 2
<p>Random Forest performance: Influence of <span class="html-italic">mtry</span> on accuracy and stability.</p>
Full article ">Figure 3
<p>Distribution of the bacterial samples.</p>
Full article ">Figure 4
<p>Performance metrics of the Random Forest model.</p>
Full article ">Figure 5
<p>Average spectral signatures of contaminated and uncontaminated samples.</p>
Full article ">Figure 6
<p>Principal component analysis: cumulative variance explained.</p>
Full article ">
29 pages, 5292 KiB  
Article
Parameter Estimation of Noise-Disturbed Multivariate Systems Using Support Vector Regression Integrated with Random Search and Bayesian Optimization
by Jiawei Zheng and Xinchun Jie
Processes 2025, 13(3), 773; https://doi.org/10.3390/pr13030773 - 7 Mar 2025
Viewed by 62
Abstract
To achieve accurate control of Multi-Input and Multi-Output (MIMO) physical plants, it is crucial to obtain correct model expressions. In practice, the prevalence of both outliers and colored noise can cause serious interference with the industrial process, thus reducing the accuracy of the [...] Read more.
To achieve accurate control of Multi-Input and Multi-Output (MIMO) physical plants, it is crucial to obtain correct model expressions. In practice, the prevalence of both outliers and colored noise can cause serious interference with the industrial process, thus reducing the accuracy of the identification algorithm. The algorithm of support vector regression (SVR) is proposed to address the problem of parameter estimation for MIMO systems under interference from outliers and colored noise. In order to further improve the speed of parameter estimation, random search and Bayesian optimization algorithms were introduced, and the support vector regression combining stochastic search and Bayesian optimization (RSBO-SVR) algorithm was proposed. It was verified by simulation and tank experiments. The results showed that the method has strong anti-interference ability and can achieve high-precision parameter identification. The maximum relative error of the RSBO-SVR algorithm did not exceed 4% in both the simulation and experiment. It had a maximum reduction of 99.38% in runtime compared to SVR. Full article
(This article belongs to the Section Process Control and Monitoring)
Show Figures

Figure 1

Figure 1
<p>Schematic diagram of the SVR principle. (The circles are the data points that need to be fitted).</p>
Full article ">Figure 2
<p>MIMO system architecture diagram.</p>
Full article ">Figure 3
<p>The flowchart of SVR.</p>
Full article ">Figure 4
<p>Input signals.</p>
Full article ">Figure 5
<p>Diagrams of the RLS identification process of a and b (with outliers). (<b>a</b>) The identification process of parameter a; (<b>b</b>) The identification process of parameter b.</p>
Full article ">Figure 6
<p>The training and test fit plots of SVR (with outliers). (<b>a</b>) The training process of SVR; (<b>b</b>) The test process of SVR.</p>
Full article ">Figure 7
<p>The training and test fit plots of RSBO-SVR (with outliers). (<b>a</b>) The training process of RSBO-SVR; (<b>b</b>) The test process of RSBO-SVR.</p>
Full article ">Figure 8
<p>The training and test fit plots of SVR (with colored noise). (<b>a</b>) The training process of SVR; (<b>b</b>) The test process of SVR.</p>
Full article ">Figure 9
<p>The training and test fit plots of RSBO-SVR (with colored noise). (<b>a</b>) The training process of RSBO-SVR; (<b>b</b>) The test process of RSBO-SVR.</p>
Full article ">Figure 10
<p>The model diagram of the real water tank.</p>
Full article ">Figure 11
<p>The real input and output signals of tank 1. (<b>a</b>) The input signals; (<b>b</b>) The output signal.</p>
Full article ">Figure 12
<p>The training and test fit plots of tank 1 using SVR. (<b>a</b>) The training process of SVR; (<b>b</b>) The test process of SVR.</p>
Full article ">Figure 13
<p>The training and test fit plots of tank 1 using RSBO-SVR. (<b>a</b>) The training process of RSBO-SVR; (<b>b</b>) The test process of RSBO-SVR.</p>
Full article ">Figure 14
<p>(<b>a</b>) The output signals of the real and the estimated system (SVR); (<b>b</b>) the error between the real and estimated output (SVR).</p>
Full article ">Figure 15
<p>(<b>a</b>) The output signals of the real and the estimated system (RSBO-SVR); (<b>b</b>) the error between the real and estimated output (RSBO-SVR).</p>
Full article ">
9 pages, 2292 KiB  
Proceeding Paper
Noise-Aware UAV Path Planning in Urban Environment with Reinforcement Learning
by Shahin Sarhan, Marco Rinaldi, Stefano Primatesta and Giorgio Guglieri
Eng. Proc. 2025, 90(1), 3; https://doi.org/10.3390/engproc2025090003 - 7 Mar 2025
Viewed by 55
Abstract
This research presents a comprehensive approach for mitigating noise pollution from Unmanned Aerial Vehicles (UAVs) in urban environment by using Reinforcement Learning (RL) for flight path planning. Focusing on the city of Turin, Italy, the study utilizes its diverse urban architecture to develop [...] Read more.
This research presents a comprehensive approach for mitigating noise pollution from Unmanned Aerial Vehicles (UAVs) in urban environment by using Reinforcement Learning (RL) for flight path planning. Focusing on the city of Turin, Italy, the study utilizes its diverse urban architecture to develop a detailed 3D occupancy grid map, and a population density map. A dynamic noise source model adjusts noise emissions based on the UAV velocity, while acoustic ray tracing simulates noise propagation in the environment. The Deep Deterministic Policy Gradient (DDPG) algorithm optimizes flight paths, minimizing the noise impact, and balancing both the path length and the population density located under the UAV path. The simulation results demonstrate significant noise reduction, suggesting scalability and adaptability for global urban environments, contributing to sustainable urban air mobility by addressing noise pollution. Full article
Show Figures

Figure 1

Figure 1
<p>The geographical location of the area of study (Turin, Italy): (<b>a</b>) satellite image of the study area; (<b>b</b>) obstacle map of the study area; (<b>c</b>) population density map of the study area.</p>
Full article ">Figure 2
<p>The noise source: (<b>a</b>) the acoustic rays emitted from the source at an altitude of 40 m; (<b>b</b>) collision points of the unobstructed rays on the ground. (<b>c</b>) Synthetic representation of the scenario with buildings and the consequent reflection of acoustic rays.</p>
Full article ">Figure 3
<p>(<b>a</b>) Flight paths for the RL, A* and direct paths. (<b>b</b>)Velocity distribution along the RL flight path. (<b>c</b>) Heat Map of the SPL distribution along the RL path.</p>
Full article ">Figure 4
<p>(<b>a</b>) SPL distribution along the RL path; (<b>b</b>) SPL distribution along the A* path; (<b>c</b>) SPL distribution along the direct path; (<b>d</b>) Top view of noise impact in the environment for RL path; (<b>e</b>) Top view of noise impact in the environment for A* path; (<b>f</b>) Top view of noise impact in the environment for direct path.</p>
Full article ">
25 pages, 4295 KiB  
Article
Sound Change and Consonant Devoicing in Word-Final Sibilants: A Study of Brazilian Portuguese Plural Forms
by Wellington Mendes
Languages 2025, 10(3), 48; https://doi.org/10.3390/languages10030048 - 7 Mar 2025
Viewed by 169
Abstract
This study investigates consonant devoicing in Brazilian Portuguese (BP), in order to assess whether an ongoing sound change is taking place. We examine plural forms consisting of a stop consonant followed by a word-final sibilant, such as in redes [hedz] ~ [heds] ~ [...] Read more.
This study investigates consonant devoicing in Brazilian Portuguese (BP), in order to assess whether an ongoing sound change is taking place. We examine plural forms consisting of a stop consonant followed by a word-final sibilant, such as in redes [hedz] ~ [heds] ~ [hets] and sedes [sɛdz] ~ [sɛds] ~ [sɛts], focusing on the emergence of voiceless sibilants before word-initial vowels (e.g., redes amarelas, ‘yellow hammocks’). If sibilants remain voiceless despite a following vowel, this challenges the expected regressive voicing assimilation in BP and raises the question of the conditions under which this devoicing occurs. Data were collected through recordings of oral production from twenty Brazilian speakers, using reading and picture naming tasks. Sibilant voicing was quantified using harmonics-to-noise ratio (HNR). A linear mixed-effects model—including random intercepts and slopes for both speakers and words—reveals that sibilants are significantly more voiced before a vowel than before a pause, but this voicing is substantially reduced when the sibilant is preceded by voiceless consonants. These findings indicate an ongoing devoicing process at pre-vocalic word boundaries in BP, affecting clusters [pz, tz, kz] and [bz, dz, gz] alike. Spectrographic analyses indicate that not only the sibilants but also their preceding stop may exhibit devoicing. Moreover, minimal-pair considerations suggest that speakers potentially maintain sibilant voicing in certain lexical items to preserve intelligibility (e.g., gra[dz] ‘grades’ and se[dz] ‘headquarters’ vs. grá[ts] ‘free’ and se[ts] ‘sets’). Drawing on Exemplar Theory, we propose a competition between the influence of the phonological environment and word-final devoicing: sibilants are sometimes voiced due to a following vowel (e.g., botes argentinos [bɔtz ah.ʒẽ.’tʃi.nus] ‘Argentine boats’), but they often emerge as voiceless due to consonantal devoicing (e.g., [bɔts ah.ʒẽ.’tʃi.nus]), resulting in both expected and unexpected forms. We suggest that fine phonetic detail, whether associated with allophonic or emergent sound patterns, contributes to the construction of phonological representations. Full article
(This article belongs to the Special Issue Phonetics and Phonology of Ibero-Romance Languages)
Show Figures

Figure 1

Figure 1
<p>Spectrographic analysis of the phrase “redes argentinas” by three different speakers. In (<b>a</b>), speaker 1 pronounces the final /s/ as fully voiced [z]. In (<b>b</b>), speaker 2 pronounces the final /s/ with partial voicing. In (<b>c</b>), speaker 3 pronounces the final /s/ as fully voiceless [s].</p>
Full article ">Figure 2
<p>Voicing rates of final sibilants per following phonological context.</p>
Full article ">Figure 3
<p>Spectrographic analysis of the phrase “os alpes italianos”.</p>
Full article ">Figure 4
<p>Spectrographic analysis of the phrase “duas redes argentinas”.</p>
Full article ">Figure 5
<p>Voicing rates of final sibilants per preceding phonological context.</p>
Full article ">Figure 6
<p>Voicing rates per task type.</p>
Full article ">Figure 7
<p>Voicing rates per word and lexical frequency.</p>
Full article ">Figure 8
<p>Voicing rates per word.</p>
Full article ">Figure 9
<p>Voicing rates per individual.</p>
Full article ">
26 pages, 11365 KiB  
Article
Angle Estimation Based on Wave Path Difference Rate of Change Ambiguity Function
by Jianye Xu, Maozhong Fu and Zhenmiao Deng
Remote Sens. 2025, 17(5), 943; https://doi.org/10.3390/rs17050943 - 6 Mar 2025
Viewed by 113
Abstract
Modern radar systems commonly utilize monopulse angle estimation techniques for target angle estimation, with the phase comparison method being one of the most widely adopted approaches. While the phase comparison method achieves high estimation precision, it is highly susceptible to noise and exhibits [...] Read more.
Modern radar systems commonly utilize monopulse angle estimation techniques for target angle estimation, with the phase comparison method being one of the most widely adopted approaches. While the phase comparison method achieves high estimation precision, it is highly susceptible to noise and exhibits a suboptimal performance under low Signal-to-Noise Ratio (SNR) conditions, leading to a high SNR threshold. Moreover, conventional monopulse angle estimation methods provide limited target information, as a single measurement cannot reveal the target’s motion direction. To address these shortcomings, a novel approach based on the phase comparison method is proposed in this study, with the variation in the wave path difference modeled as a first-order motion model. By accumulating the conjugate-multiplied signals over multiple time steps, the Wave Path Difference Rate of Change Ambiguity Function (WPD-ROC AF) is constructed. A fast algorithm employing the 2D Chirp-Z Transform (2D-CZT) is proposed, enabling multi-pulse angle estimation through the identification of frequency and phase values corresponding to spectral peaks. Simulation results validate that the proposed method outperforms traditional monopulse angle estimation techniques under low-SNR conditions and effectively suppresses static clutter interference. Furthermore, the sign of the WPD-ROC AF is shown to be correlated with the target’s motion direction, providing practical utility for determining the direction of movement in remote sensing scenarios. Full article
(This article belongs to the Section Remote Sensing Image Processing)
Show Figures

Figure 1

Figure 1
<p>The schematic of the radar antenna array plane [<a href="#B22-remotesensing-17-00943" class="html-bibr">22</a>].</p>
Full article ">Figure 2
<p>Phase comparison angle estimation method principle diagram.</p>
Full article ">Figure 3
<p>The impact of conjugate multiplication on SNR. (<b>a</b>) The relationship between the SNR before and after conjugate multiplication; (<b>b</b>) the relationship between SNR loss and the original input SNR.</p>
Full article ">Figure 4
<p>The spectrum of the conjugate cross-correlation signal at eight consecutive time instances.</p>
Full article ">Figure 5
<p>The WPD-ROC AF obtained by accumulating eight conjugate cross-correlation signals. (<b>a</b>) WPD-ROC AF; (<b>b</b>) the cross-sectional plot of WPD-ROC AF.</p>
Full article ">Figure 6
<p>2D-FFT spectrum.</p>
Full article ">Figure 7
<p>2D-CZT spectrum.</p>
Full article ">Figure 8
<p>Comparison of spectrum at high and low SNR. (<b>a</b>) Doppler transform spectrum at high SNR; (<b>b</b>) Doppler transform spectrum at low SNR; (<b>c</b>) two-dimensional transform spectrum at low SNR.</p>
Full article ">Figure 9
<p>Comparison of WPD-ROC AF with and without denoising. (<b>a</b>) WPD-ROC AF without denoising; (<b>b</b>) denoised WPD-ROC AF.</p>
Full article ">Figure 10
<p>Comparison of angle estimation precision with and without denoising.</p>
Full article ">Figure 11
<p>Joint angle estimation algorithm flowchart.</p>
Full article ">Figure 12
<p>Monopulse signals spectrum from each quadrant. (<b>a</b>) Monopulse signal spectrum of quadrant A; (<b>b</b>) monopulse signal spectrum of quadrant B.</p>
Full article ">Figure 13
<p>Monopulse cross-correlation signal spectrum.</p>
Full article ">Figure 14
<p>The 2D spectrum of the signals from each quadrant. (<b>a</b>) The 2D spectrum of quadrant A signal; (<b>b</b>) the 2D spectrum of quadrant B signal.</p>
Full article ">Figure 15
<p>The 2D spectrum of the echo signals from each quadrant with static clutter removal. (<b>a</b>) The 2D spectrum of quadrant A signal with static clutter removal; (<b>b</b>) the 2D spectrum of quadrant B signal with static clutter removal.</p>
Full article ">Figure 16
<p>The comparison of WPD-ROC AF spectrum with and without static clutter removal. (<b>a</b>) The WPD-ROC AF after static clutter removal; (<b>b</b>) the WPD-ROC AF without static clutter removal.</p>
Full article ">Figure 17
<p>Comparison of WPD-ROC AF spectrum with different target wave path difference rates. (<b>a</b>) WPD-ROC AF with the target wave path difference rate of 1 <math display="inline"><semantics> <mrow> <mi mathvariant="normal">m</mi> <mo>/</mo> <mi mathvariant="normal">s</mi> </mrow> </semantics></math>; (<b>b</b>) WPD-ROC AF with the target wave path difference rate of −1 <math display="inline"><semantics> <mrow> <mi mathvariant="normal">m</mi> <mo>/</mo> <mi mathvariant="normal">s</mi> </mrow> </semantics></math>.</p>
Full article ">Figure 18
<p>Simulation of first-order model approximation error impact. (<b>a</b>) WPD Variation with Constant Rate; (<b>b</b>) WPD Variation with Acceleration.</p>
Full article ">Figure 19
<p>The WPD estimation precision.</p>
Full article ">Figure 20
<p>The ROC estimation precision.</p>
Full article ">Figure 21
<p>The ROC estimation precision of phase comparison monopulse angle estimation method.</p>
Full article ">Figure 22
<p>The angle estimation precision.</p>
Full article ">Figure 23
<p>The angle estimation accuracy.</p>
Full article ">Figure 24
<p>The comparison of angle estimation precision.</p>
Full article ">Figure 25
<p>The comparison of angle estimation accuracy.</p>
Full article ">Figure 26
<p>The angle estimation precision under different quadrant array spacings.</p>
Full article ">
Back to TopTop