MDPI - Publisher of Open Access Journals

25 pages, 2484 KiB

Open AccessArticle

Automatic Fault Classification in Photovoltaic Modules Using Denoising Diffusion Probabilistic Model, Generative Adversarial Networks, and Convolutional Neural Networks

by Carlos Roberto da Silveira Junior, Carlos Eduardo Rocha Sousa and Ricardo Henrique Fonseca Alves

Energies 2025, 18(4), 776; https://doi.org/10.3390/en18040776 - 7 Feb 2025

Viewed by 459

Abstract

Current techniques for fault analysis in photovoltaic (PV) systems plants involve either electrical performance measurements or image processing, as well as line infrared thermography for visual inspection. Deep convolutional neural networks (CNNs) are machine learning algorithms that perform tasks involving images, such as [...] Read more.

Current techniques for fault analysis in photovoltaic (PV) systems plants involve either electrical performance measurements or image processing, as well as line infrared thermography for visual inspection. Deep convolutional neural networks (CNNs) are machine learning algorithms that perform tasks involving images, such as image classification and object recognition. However, to train a model effectively to recognize different patterns, it is crucial to have a sufficiently balanced dataset. Unfortunately, this is not always feasible owing to the limited availability of publicly accessible datasets for PV thermographic data and the unequal distribution of different faults in real-world systems. In this study, three data augmentation techniques—geometric transformations (GTs), generative adversarial networks (GANs), and the denoising diffusion probabilistic model (DDPM)—were combined with a CNN to classify faults in PV modules through thermographic images and identify the type of fault in 11 different classes (i.e., soiling, shadowing, and diode). Through the cross-validation method, the main results found with the Wasserstein GAN (WGAN) and DDPM networks combined with the CNN for anomaly classification achieved testing accuracies of 86.98% and 89.83%, respectively. These results demonstrate the effectiveness of both networks for accurately classifying anomalies in the dataset. The results corroborate the use of the diffusion model as a PV data augmentation technique when compared with other methods such as GANs and GTs. Full article

(This article belongs to the Special Issue Recent Advances in Renewable Energy Generation Technologies and Power Demand Response)

► Show Figures

Figure 1

13 pages, 1650 KiB

Open AccessTechnical Note

Pano-GAN: A Deep Generative Model for Panoramic Dental Radiographs

by Søren Pedersen, Sanyam Jain, Mikkel Chavez, Viktor Ladehoff, Bruna Neves de Freitas and Ruben Pauwels

J. Imaging 2025, 11(2), 41; https://doi.org/10.3390/jimaging11020041 - 2 Feb 2025

Viewed by 493

Abstract

This paper presents the development of a generative adversarial network (GAN) for the generation of synthetic dental panoramic radiographs. While this is an exploratory study, the ultimate aim is to address the scarcity of data in dental research and education. A deep convolutional [...] Read more.

This paper presents the development of a generative adversarial network (GAN) for the generation of synthetic dental panoramic radiographs. While this is an exploratory study, the ultimate aim is to address the scarcity of data in dental research and education. A deep convolutional GAN (DCGAN) with the Wasserstein loss and a gradient penalty (WGAN-GP) was trained on a dataset of 2322 radiographs of varying quality. The focus of this study was on the dentoalveolar part of the radiographs; other structures were cropped out. Significant data cleaning and preprocessing were conducted to standardize the input formats while maintaining anatomical variability. Four candidate models were identified by varying the critic iterations, number of features and the use of denoising prior to training. To assess the quality of the generated images, a clinical expert evaluated a set of generated synthetic radiographs using a ranking system based on visibility and realism, with scores ranging from 1 (very poor) to 5 (excellent). It was found that most generated radiographs showed moderate depictions of dentoalveolar anatomical structures, although they were considerably impaired by artifacts. The mean evaluation scores showed a trade-off between the model trained on non-denoised data, which showed the highest subjective quality for finer structures, such as the mandibular canal and trabecular bone, and one of the models trained on denoised data, which offered better overall image quality, especially in terms of clarity and sharpness and overall realism. These outcomes serve as a foundation for further research into GAN architectures for dental imaging applications. Full article

(This article belongs to the Special Issue Tools and Techniques for Improving Radiological Imaging Applications)

► Show Figures

Figure 1

Figure 1
An overview of WGAN-GP. The generator receives input noise and produces samples, which are compared against real data from the dataset in the discriminator. The figure illustrates the calculation of the critic loss (D loss) by real, fake, and gradient penalty terms, along with the generator loss (G loss). The gradient penalty term is highlighted alongside the Wasserstein distance in a separate box. Full article ">Figure 2
Generator <math display="inline"><semantics> <mrow> <mi>G</mi> <mo>(</mo> <mi>z</mi> <mo>)</mo> </mrow> </semantics></math> and discriminator <math display="inline"><semantics> <mrow> <mi>D</mi> <mo>(</mo> <mi>I</mi> <mo>)</mo> </mrow> </semantics></math> networks used in methodology. Full article ">Figure 3
Examples of real (R), fake (F), and Gaussian noise (G) images. The t-SNE plot compares the feature embedding for R (blue cluster), F (red cluster), and G (green cluster) images. Full article ">Figure 4
Boxplots of observer scores for Models 1 and 2. Full article ">Figure 5
Radar plots for Models 1 (red) and 2 (blue). Full article ">Figure 6
Best images generated using Models 1 (top row) and 2 (bottom row). Full article ">Figure 7
Worst images among all model variants, showing poor overall anatomical depiction and severe artifacts. Full article ">

14 pages, 17020 KiB

Open AccessArticle

A Long Short-Term Memory–Wasserstein Generative Adversarial Network-Based Data Imputation Method for Photovoltaic Power Output Prediction

by Zhu Liu, Lingfeng Xuan, Dehuang Gong, Xinlin Xie and Dongguo Zhou

Energies 2025, 18(2), 399; https://doi.org/10.3390/en18020399 - 17 Jan 2025

Viewed by 467

Abstract

To address the challenges of the issue of inaccurate prediction results due to missing data in PV power records, a photovoltaic power data imputation method based on a Wasserstein Generative Adversarial Network (WGAN) and Long Short-Term Memory (LSTM) network is proposed. This method [...] Read more.

To address the challenges of the issue of inaccurate prediction results due to missing data in PV power records, a photovoltaic power data imputation method based on a Wasserstein Generative Adversarial Network (WGAN) and Long Short-Term Memory (LSTM) network is proposed. This method introduces a data-driven GAN framework with quasi-convex characteristics to ensure the smoothness of the imputed data with the existing data and employs a gradient penalty mechanism and a single-batch multi-iteration strategy for stable training. Finally, through frequency domain analysis, t-Distributed Stochastic Neighbor Embedding (t-SNE) metrics, and prediction performance validation of the generated data, the proposed method can improve the continuity and reliability of data in photovoltaic prediction tasks. Full article

(This article belongs to the Special Issue Forecasting of Photovoltaic Power Generation and Model Optimization)

► Show Figures

Figure 1

20 pages, 42222 KiB

Open AccessArticle

WGAN-GP for Synthetic Retinal Image Generation: Enhancing Sensor-Based Medical Imaging for Classification Models

by Héctor Anaya-Sánchez, Leopoldo Altamirano-Robles, Raquel Díaz-Hernández and Saúl Zapotecas-Martínez

Sensors 2025, 25(1), 167; https://doi.org/10.3390/s25010167 - 31 Dec 2024

Viewed by 850

Abstract

Accurate synthetic image generation is crucial for addressing data scarcity challenges in medical image classification tasks, particularly in sensor-derived medical imaging. In this work, we propose a novel method using a Wasserstein Generative Adversarial Network with Gradient Penalty (WGAN-GP) and nearest-neighbor interpolation to [...] Read more.

Accurate synthetic image generation is crucial for addressing data scarcity challenges in medical image classification tasks, particularly in sensor-derived medical imaging. In this work, we propose a novel method using a Wasserstein Generative Adversarial Network with Gradient Penalty (WGAN-GP) and nearest-neighbor interpolation to generate high-quality synthetic images for diabetic retinopathy classification. Our approach enhances training datasets by generating realistic retinal images that retain critical pathological features. We evaluated the method across multiple retinal image datasets, including Retinal-Lesions, Fine-Grained Annotated Diabetic Retinopathy (FGADR), Indian Diabetic Retinopathy Image Dataset (IDRiD), and the Kaggle Diabetic Retinopathy dataset. The proposed method outperformed traditional generative models, such as conditional GANs and PathoGAN, achieving the best performance on key metrics: a Fréchet Inception Distance (FID) of 15.21, a Mean Squared Error (MSE) of 0.002025, and a Structural Similarity Index (SSIM) of 0.89 in the Kaggle dataset. Additionally, expert evaluations revealed that only 56.66% of synthetic images could be distinguished from real ones, demonstrating the high fidelity and clinical relevance of the generated data. These results highlight the effectiveness of our approach in improving medical image classification by generating realistic and diverse synthetic datasets. Full article

(This article belongs to the Collection Medical Applications of Sensor Systems and Devices)

► Show Figures

Figure 1

22 pages, 5995 KiB

Open AccessArticle

Research on 3D Localization of Indoor UAV Based on Wasserstein GAN and Pseudo Fingerprint Map

by Junhua Yang, Jinhang Tian, Yang Qi, Wei Cheng, Yang Liu, Gang Han, Shanzhe Wang, Yapeng Li, Chenghu Cao and Santuan Qin

Drones 2024, 8(12), 740; https://doi.org/10.3390/drones8120740 - 9 Dec 2024

Viewed by 835

Abstract

In addition to outdoor environments, unmanned aerial vehicles (UAVs) also have a wide range of applications in indoor environments. The complex and changeable indoor environment and relatively small space make indoor localization of UAVs more difficult and urgent. An innovative 3D localization method [...] Read more.

In addition to outdoor environments, unmanned aerial vehicles (UAVs) also have a wide range of applications in indoor environments. The complex and changeable indoor environment and relatively small space make indoor localization of UAVs more difficult and urgent. An innovative 3D localization method for indoor UAVs using a Wasserstein generative adversarial network (WGAN) and a pseudo fingerprint map (PFM) is proposed in this paper. The primary aim is to enhance the localization accuracy and robustness in complex indoor environments. The proposed method integrates four classic matching localization algorithms with WGAN and PFM, demonstrating significant improvements in localization precision. Simulation results show that both the WGAN and PFM algorithms significantly reduce localization errors and enhance environmental adaptability and robustness in both small and large simulated indoor environments. The findings confirm the robustness and efficiency of the proposed method in real-world indoor localization scenarios. In the inertial measurement unit (IMU)-based tracking algorithm, using the fingerprint database of initial coarse particles and the fingerprint database processed by the WGAN algorithm to locate the UAV, the localization error of the four algorithms is reduced by 30.3% on average. After using the PFM algorithm for matching localization, the localization error of the UAV is reduced by 28% on average. Full article

(This article belongs to the Topic Target Tracking, Guidance, and Navigation for Autonomous Systems, 2nd Edition)

► Show Figures

Figure 1

26 pages, 3161 KiB

Open AccessReview

Survey of Quantum Generative Adversarial Networks (QGAN) to Generate Images

by Mohammadsaleh Pajuhanfard, Rasoul Kiani and Victor S. Sheng

Mathematics 2024, 12(23), 3852; https://doi.org/10.3390/math12233852 - 6 Dec 2024

Viewed by 1341

Abstract

Quantum Generative Adversarial Networks (QGANs) represent a useful development in quantum machine learning, using the particular properties of quantum mechanics to solve the challenges of data analysis and modeling. This paper brings up a general analysis of five QGAN architectures, focusing on their [...] Read more.

Quantum Generative Adversarial Networks (QGANs) represent a useful development in quantum machine learning, using the particular properties of quantum mechanics to solve the challenges of data analysis and modeling. This paper brings up a general analysis of five QGAN architectures, focusing on their evolution, strengths, weaknesses, and limitations in noisy intermediate-scale quantum (NISQ) devices. Primary methods like Entangling Quantum GAN (EQ-GAN) and Quantum state fidelity (QuGAN) concentrate on stability, convergence, and robust performance on small-scale datasets such as 2 × 2 grayscale images. Intermediate models such as Image Quantum GAN (IQGAN) and Experimental Quantum GAN (EXQGAN) provide new ideas like trainable encoders and patch-based sub-generators that are scalable to 8 × 8 datasets with increasing noise resilience. The most advanced method is Parameterized Quantum Wasserstein GAN (PQWGAN), which uses a hybrid quantum-classical structure to obtain high-resolution image processing for 28 × 28 grayscale datasets while trying to maintain parameter efficiency. This study explores, analyzes, and summarizes critical problems of QGANs, including accuracy, convergence, parameter efficiency, image quality, performance metrics, and training stability under noisy conditions. In addition, developing QGANs can generate and train parameters in quantum approximation optimization algorithms. One of the useful applications of QGAN is generating medical datasets that can generate medical images from limited datasets to train specific medical models for the recognition of diseases. Full article

► Show Figures

Figure 1

Figure 1
Operation of the GANs loss function. Full article ">Figure 2
The CC means the data and the algorithms are classic, but the quantum concept, methods, or process has helped improve the classical algorithms. The CQ means the data is classic and the algorithms are quantum. The QC means the data is quantum (such as chemistry data) and the algorithms are classic. The QQ means the data and the algorithms are quantum. <a href="https://commons.wikimedia.org/wiki/File:Qml_approaches.tif?page=1" target="_blank">https://commons.wikimedia.org/wiki/File:Qml_approaches.tif?page=1</a> (accessed on 2 November 2024). Full article ">Figure 3
The view of QGAN. Full article ">Figure 4
The general structure of QGAN. Full article ">Figure 5
The structure of Quantum state fidelity. Full article ">Figure 6
Scheme of quantum generator in quantum patch GAN. Full article ">Figure 7
Scheme of quantum patch GAN. Full article ">

21 pages, 3915 KiB

Open AccessArticle

Boosting EEG and ECG Classification with Synthetic Biophysical Data Generated via Generative Adversarial Networks

by Archana Venugopal and Diego Resende Faria

Appl. Sci. 2024, 14(23), 10818; https://doi.org/10.3390/app142310818 - 22 Nov 2024

Viewed by 1014

Abstract

This study presents a novel approach using Wasserstein Generative Adversarial Networks with Gradient Penalty (WGAN-GP) to generate synthetic electroencephalography (EEG) and electrocardiogram (ECG) waveforms. The synthetic EEG data represent concentration and relaxation mental states, while the synthetic ECG data correspond to normal and [...] Read more.

This study presents a novel approach using Wasserstein Generative Adversarial Networks with Gradient Penalty (WGAN-GP) to generate synthetic electroencephalography (EEG) and electrocardiogram (ECG) waveforms. The synthetic EEG data represent concentration and relaxation mental states, while the synthetic ECG data correspond to normal and abnormal states. By addressing the challenges of limited biophysical data, including privacy concerns and restricted volunteer availability, our model generates realistic synthetic waveforms learned from real data. Combining real and synthetic datasets improved classification accuracy from 92% to 98.45%, highlighting the benefits of dataset augmentation for machine learning performance. The WGAN-GP model achieved 96.84% classification accuracy for synthetic EEG data representing relaxation states and optimal accuracy for concentration states when classified using a fusion of convolutional neural networks (CNNs). A 50% combination of synthetic and real EEG data yielded the highest accuracy of 98.48%. For EEG signals, the real dataset consisted of 60-s recordings across four channels (TP9, AF7, AF8, and TP10) from four individuals, providing approximately 15,000 data points per subject per state. For ECG signals, the dataset contained 1200 real samples, each comprising 140 data points, representing normal and abnormal states. WGAN-GP outperformed a basic generative adversarial network (GAN) in generating reliable synthetic data. For ECG data, a support vector machine (SVM) classifier achieved an accuracy of 98% with real data and 95.8% with synthetic data. Synthetic ECG data improved the random forest (RF) classifier’s accuracy from 97% with real data alone to 98.40% when combined with synthetic data. Statistical significance was assessed using the Wilcoxon signed-rank test, demonstrating the robustness of the WGAN-GP model. Techniques such as discrete wavelet transform, downsampling, and upsampling were employed to enhance data quality. This method shows significant potential in addressing biophysical data scarcity and advancing applications in assistive technologies, human-robot interaction, and mental health monitoring, among other medical applications. Full article

► Show Figures

Figure 1

Figure 1
Five decomposition of EEG waves using discrete wavelet transform (DWT) into approximation coefficient (<math display="inline"><semantics> <mrow> <mi>c</mi> <mi>A</mi> </mrow> </semantics></math>) and detailed coefficients (<math display="inline"><semantics> <mrow> <mi>c</mi> <msub> <mi>D</mi> <mrow> <mn>1</mn> <mo>−</mo> <mn>5</mn> </mrow> </msub> </mrow> </semantics></math>). Full article ">Figure 2
WGAN-GP architecture. Full article ">Figure 3
Workflow of synthetic EEG wave generation using WGAN-GP model. Full article ">Figure 4
Two-dimensional CNN for EEG classification. Full article ">Figure 5
Interface of the synthetic EEG generator, visualization, and CNN classification. Full article ">Figure 6
EEG plot of TP9 channel for Subject A in concentration and relaxation states using WGAN-GP. Full article ">Figure 7
PSD plot of TP9 channel for Subject A in EEG concentration and relaxation states. Full article ">Figure 8
Real and synthetic normal ECG samples. Full article ">Figure 9
Real and synthetic abnormal ECG samples. Full article ">Figure 10
Bar chart of model accuracies with significance annotations. The label “<math display="inline"><semantics> <mrow> <mi>n</mi> <mi>s</mi> </mrow> </semantics></math>” stands for no statistical significance and the label “*” presents comparisons with statistical significance. Full article ">Figure 11
Heatmap of pairwise statistical significance. Full article ">

32 pages, 8354 KiB

Open AccessArticle

Estimation of Fractal Dimension and Detection of Fake Finger-Vein Images for Finger-Vein Recognition

by Seung Gu Kim, Jin Seong Hong, Jung Soo Kim and Kang Ryoung Park

Fractal Fract. 2024, 8(11), 646; https://doi.org/10.3390/fractalfract8110646 - 31 Oct 2024

Cited by 1 | Viewed by 1002

Abstract

With recent advancements in deep learning, spoofing techniques have developed and generative adversarial networks (GANs) have become an emerging threat to finger-vein recognition systems. Therefore, previous research has been performed to generate finger-vein images for training spoof detectors. However, these are limited and [...] Read more.

With recent advancements in deep learning, spoofing techniques have developed and generative adversarial networks (GANs) have become an emerging threat to finger-vein recognition systems. Therefore, previous research has been performed to generate finger-vein images for training spoof detectors. However, these are limited and researchers still cannot generate elaborate fake finger-vein images. Therefore, we develop a new densely updated contrastive learning-based self-attention generative adversarial network (DCS-GAN) to create elaborate fake finger-vein images, enabling the training of corresponding spoof detectors. Additionally, we propose an enhanced convolutional network for a next-dimension (ConvNeXt)-Small model with a large kernel attention module as a new spoof detector capable of distinguishing the generated fake finger-vein images. To improve the spoof detection performance of the proposed method, we introduce fractal dimension estimation to analyze the complexity and irregularity of class activation maps from real and fake finger-vein images, enabling the generation of more realistic and sophisticated fake finger-vein images. Experimental results obtained using two open databases showed that the fake images by the DCS-GAN exhibited Frechet inception distances (FID) of 7.601 and 23.351, with Wasserstein distances (WD) of 18.158 and 10.123, respectively, confirming the possibility of spoof attacks when using existing state-of-the-art (SOTA) frameworks of spoof detection. Furthermore, experiments conducted with the proposed spoof detector yielded average classification error rates of 0.4% and 0.12% on the two aforementioned open databases, respectively, outperforming existing SOTA methods for spoof detection. Full article

(This article belongs to the Special Issue Fractional Order Complex Systems: Advanced Control, Intelligent Estimation and Reinforcement Learning Image Processing Algorithms)

► Show Figures

Figure 1

12 pages, 1581 KiB

Open AccessArticle

Airfoil Shape Generation and Feature Extraction Using the Conditional VAE-WGAN-gp

by Kazuo Yonekura, Yuki Tomori and Katsuyuki Suzuki

AI 2024, 5(4), 2092-2103; https://doi.org/10.3390/ai5040102 - 28 Oct 2024

Cited by 1 | Viewed by 1182

Abstract

A machine learning method was applied to solve an inverse airfoil design problem. A conditional VAE-WGAN-gp model, which couples the conditional variational autoencoder (VAE) and Wasserstein generative adversarial network with gradient penalty (WGAN-gp), is proposed for an airfoil generation method, and then, it [...] Read more.

A machine learning method was applied to solve an inverse airfoil design problem. A conditional VAE-WGAN-gp model, which couples the conditional variational autoencoder (VAE) and Wasserstein generative adversarial network with gradient penalty (WGAN-gp), is proposed for an airfoil generation method, and then, it is compared with the WGAN-gp and VAE models. The VAEGAN model couples the VAE and GAN models, which enables feature extraction in the GAN models. In airfoil generation tasks, to generate airfoil shapes that satisfy lift coefficient requirements, it is known that VAE outperforms WGAN-gp with respect to the accuracy of the reproduction of the lift coefficient, whereas GAN outperforms VAE with respect to the smoothness and variations of generated shapes. In this study, VAE-WGAN-gp demonstrated a good performance in all three aspects. Latent distribution was also studied to compare the feature extraction ability of the proposed method. Full article

► Show Figures

Figure 1

Figure 1
Conditional GAN. Full article ">Figure 2
Conditional VAE. Full article ">Figure 3
Conditional VAEGAN. Full article ">Figure 4
Network architectures of the encoder, decoder, and discriminator. Full article ">Figure 5
Shape discretization. Full article ">Figure 6
Histogram of <math display="inline"><semantics> <msub> <mi>C</mi> <mi mathvariant="normal">L</mi> </msub> </semantics></math>. Full article ">Figure 7
Learning curve. Full article ">Figure 8
Generated shapes. Numbers on top of each shape represent re-calculated <math display="inline"><semantics> <msub> <mi>C</mi> <mi mathvariant="normal">L</mi> </msub> </semantics></math>. Red figure implies <math display="inline"><semantics> <msub> <mi>C</mi> <mi mathvariant="normal">L</mi> </msub> </semantics></math> calculation did not converge. Full article ">Figure 9
Generated shapes <math display="inline"><semantics> <mrow> <msub> <mi>C</mi> <mi mathvariant="normal">L</mi> </msub> <mo>=</mo> <mn>0.5</mn> </mrow> </semantics></math>. Different color represents different shapes. Full article ">Figure 10
Latent distribution. Full article ">

17 pages, 18662 KiB

Open AccessArticle

Symmetric Connected U-Net with Multi-Head Self Attention (MHSA) and WGAN for Image Inpainting

by Yanyang Hou, Xiaopeng Ma, Junjun Zhang and Chenxian Guo

Symmetry 2024, 16(11), 1423; https://doi.org/10.3390/sym16111423 - 25 Oct 2024

Cited by 1 | Viewed by 1159

Abstract

This study presents a new image inpainting model based on U-Net and incorporating the Wasserstein Generative Adversarial Network (WGAN). The model uses skip connections to connect every encoder block to the corresponding decoder block, resulting in a strictly symmetrical architecture referred to as [...] Read more.

This study presents a new image inpainting model based on U-Net and incorporating the Wasserstein Generative Adversarial Network (WGAN). The model uses skip connections to connect every encoder block to the corresponding decoder block, resulting in a strictly symmetrical architecture referred to as Symmetric Connected U-Net (SC-Unet). By combining SC-Unet with a GAN, the study aims to reconstruct images more effectively and seamlessly. The traditional discriminators only differentiate the entire image as true or false. In this study, the discriminator calculated the probability of each pixel belonging to the hole and non-hole regions, which provided the generator with more gradient loss information for image inpainting. Additionally, every block of SC-Unet incorporated a Dilated Convolutional Neural Network (DCNN) to increase the receptive field of the convolutional layers. Our model also integrated Multi-Head Self-Attention (MHSA) into selected blocks to enable it to efficiently search the entire image for suitable content to fill the missing areas. This study adopts the publicly available datasets CelebA-HQ and ImageNet for evaluation. Our proposed algorithm demonstrates a 10% improvement in PSNR and a 2.94% improvement in SSIM compared to existing representative image inpainting methods in the experiment. Full article

(This article belongs to the Section Computer)

► Show Figures

Figure 1

13 pages, 7413 KiB

Open AccessArticle

A Study on Enhancing the Visual Fidelity of Aviation Simulators Using WGAN-GP for Remote Sensing Image Color Correction

by Chanho Lee, Hyukjin Kwon, Hanseon Choi, Jonggeun Choi, Ilkyun Lee, Byungkyoo Kim, Jisoo Jang and Dongkyoo Shin

Appl. Sci. 2024, 14(20), 9227; https://doi.org/10.3390/app14209227 - 11 Oct 2024

Viewed by 959

Abstract

When implementing outside-the-window (OTW) visuals in aviation tactical simulators, maintaining terrain image color consistency is critical for enhancing pilot immersion and focus. However, due to various environmental factors, inconsistent image colors in terrain can cause visual confusion and diminish realism. To address these [...] Read more.

When implementing outside-the-window (OTW) visuals in aviation tactical simulators, maintaining terrain image color consistency is critical for enhancing pilot immersion and focus. However, due to various environmental factors, inconsistent image colors in terrain can cause visual confusion and diminish realism. To address these issues, a color correction technique based on a Wasserstein Generative Adversarial Network with Gradient Penalty (WGAN-GP) is proposed. The proposed WGAN-GP model utilizes multi-scale feature extraction and Wasserstein distance to effectively measure and adjust the color distribution difference between the input image and the reference image. This approach can preserve the texture and structural characteristics of the image while maintaining color consistency. In particular, by converting Bands 2, 3, and 4 of the BigEarthNet-S2 dataset into RGB images as the reference image and preprocessing the reference image to serve as the input image, it is demonstrated that the proposed WGAN-GP model can handle large-scale remote sensing images containing various lighting conditions and color differences. The experimental results showed that the proposed WGAN-GP model outperformed traditional methods, such as histogram matching and color transfer, and was effective in reflecting the style of the reference image to the target image while maintaining the structural elements of the target image during the training process. Quantitative analysis demonstrated that the mid-stage model achieved a PSNR of 28.93 dB and an SSIM of 0.7116, which significantly outperforms traditional methods. Furthermore, the LPIPS score was reduced to 0.3978, indicating improved perceptual similarity. This approach can contribute to improving the visual elements of the simulator to enhance pilot immersion and has the potential to significantly reduce time and costs compared to the manual methods currently used by the Republic of Korea Air Force. Full article

(This article belongs to the Special Issue Applications of Machine Learning Algorithms in Remote Sensing)

► Show Figures

Figure 1

Figure 1
An overview of the architecture of the WGAN-GP model. Full article ">Figure 2
Architecture of the Generator and Critic in the WGAN-GP model. (a) Architecture of the Generator (b) Architecture of the Critic. Full article ">Figure 3
BigEarthNet-S2 RGB images (ref images). Full article ">Figure 4
BigEarthNet-S2 preprocessing images (target images). Full article ">Figure 5
Color matching results of BigEarthNet-S2 RGB images and BigEarthNet-S2 preprocessing images. (a) Generated image by model at the early stage of training, (b) generated image by model at the mid-stage of training, and (c) generated image by fully trained model. Full article ">Figure 6
Precise texture reproduction results. (a) Generated image by model at the early stage of training, (b) generated image by model at the mid-stage of training, and (c) generated image by fully trained model. Full article ">Figure 7
Comparison with other method’s results. (a) Image processed using histogram matching, showing limitations in maintaining color consistency and significant information loss in texture and detail; (b) image processed using the color transfer technique, which also shows limitations in maintaining the ground truth’s color consistency and lacks texture reproduction; (c) image generated by the early stage of the WGAN-GP-based model, where color distribution is irregular and texture representation is still underdeveloped; (d) image generated by the mid-stage model, demonstrating improved color matching and texture reproduction, with textures becoming more similar to the ground truth; and (e) image generated by the fully trained WGAN-GP model, showing a slight decrease in color consistency compared to the mid-stage model but offering superior texture reproduction compared to the other methods. Full article ">

16 pages, 5561 KiB

Open AccessArticle

A Hybrid GAN-Inception Deep Learning Approach for Enhanced Coordinate-Based Acoustic Emission Source Localization

by Xuhui Huang, Ming Han and Yiming Deng

Appl. Sci. 2024, 14(19), 8811; https://doi.org/10.3390/app14198811 - 30 Sep 2024

Viewed by 1712

Abstract

In this paper, we propose a novel approach to coordinate-based acoustic emission (AE) source localization to address the challenges of limited and imbalanced datasets from fiber-optic AE sensors used for structural health monitoring (SHM). We have developed a hybrid deep learning model combining [...] Read more.

In this paper, we propose a novel approach to coordinate-based acoustic emission (AE) source localization to address the challenges of limited and imbalanced datasets from fiber-optic AE sensors used for structural health monitoring (SHM). We have developed a hybrid deep learning model combining four generative adversarial network (GAN) variants for data augmentation with an adapted inception neural network for regression-based prediction. The experimental setup features a single fiber-optic AE sensor based on a tightly coiled fiber-optic Fabry-Perot interferometer formed by two identical fiber Bragg gratings. AE signals were generated using the Hsu-Nielsen pencil lead break test on a grid-marked thin aluminum plate with 35 distinct locations, simulating real-world structural monitoring conditions in bounded isotropic plate-like structures. It is demonstrated that the single-sensor configuration can achieve precise localization, avoiding the need for a multiple sensor array. The GAN-based signal augmentation expanded the dataset from 900 to 4500 samples, with the Wasserstein distance between the original and synthetic datasets decreasing by 83% after 2000 training epochs, demonstrating the high fidelity of the synthetic data. Among the GAN variants, the standard GAN architecture proved the most effective, outperforming other variants in this specific application. The hybrid model exhibits superior performance compared to non-augmented deep learning approaches, with the median error distribution comparisons revealing a significant 50% reduction in prediction errors, accompanied by substantially improved consistency across various AE source locations. Overall, this developed hybrid approach offers a promising solution for enhancing AE-based SHM in complex infrastructures, improving damage detection accuracy and reliability for more efficient predictive maintenance strategies. Full article

(This article belongs to the Special Issue Advanced Optical-Fiber-Related Technologies)

► Show Figures

Figure 1

12 pages, 2079 KiB

Open AccessArticle

Research on Default Classification of Unbalanced Credit Data Based on PixelCNN-WGAN

by Yutong Sun, Yanting Ji and Xiangxing Tao

Electronics 2024, 13(17), 3419; https://doi.org/10.3390/electronics13173419 - 28 Aug 2024

Viewed by 1082

Abstract

Personal credit assessment plays a crucial role in the financial system, which not only relates to the financial activities of individuals but also affects the overall credit system and economic health of society. However, the current problem of data imbalance affecting classification results [...] Read more.

Personal credit assessment plays a crucial role in the financial system, which not only relates to the financial activities of individuals but also affects the overall credit system and economic health of society. However, the current problem of data imbalance affecting classification results in the field of personal credit assessment has not been fully solved. In order to solve this problem better, we propose a data-enhanced classification algorithm based on a Pixel Convolutional Neural Network (PixelCNN) and a Generative Adversarial Network (Wasserstein GAN, WGAN). Firstly, the historical data containing borrowers’ borrowing information are transformed into grayscale maps; then, data enhancement of default images is performed using the improved PixelCNN-WGAN model; and finally, the expanded image dataset is inputted into the CNN, AlexNet, SqueezeNet, and MobileNetV2 for classification. The results on the real dataset LendingClub show that the data enhancement algorithm designed in this paper improves the accuracy of the four algorithms by 1.548–3.568% compared with the original dataset, which can effectively improve the classification effect of the credit data, and to a certain extent, it provides a new idea for the classification task in the field of personal credit assessment. Full article

► Show Figures

Figure 1

16 pages, 4446 KiB

Open AccessArticle

Method for Recognition of Communication Interference Signals under Small-Sample Conditions

by Rong Ge, Yusheng Li, Yonggang Zhu, Xiuzai Zhang, Kai Zhang and Minghu Chen

Appl. Sci. 2024, 14(13), 5869; https://doi.org/10.3390/app14135869 - 4 Jul 2024

Viewed by 892

Abstract

To address the difficulty in obtaining a large number of labeled jamming signals in complex electromagnetic environments, this paper proposes a small-sample communication jamming signal recognition method based on WDCGAN-SA (Wasserstein Deep Convolution Generative Adversarial Network–Self Attention) and C-ResNet (Convolution Block Attention Module–Residual [...] Read more.

To address the difficulty in obtaining a large number of labeled jamming signals in complex electromagnetic environments, this paper proposes a small-sample communication jamming signal recognition method based on WDCGAN-SA (Wasserstein Deep Convolution Generative Adversarial Network–Self Attention) and C-ResNet (Convolution Block Attention Module–Residual Network). Firstly, leveraging the DCGAN architecture, we integrate the Wasserstein distance measurement and gradient penalty mechanism to design the jamming signal generation model WDCGAN for data augmentation. Secondly, we introduce a self-attention mechanism to make the generation model focus on global correlation features in time–frequency maps while optimizing training strategies to enhance the quality of generated samples. Finally, real samples are mixed with generated samples and fed into the classification network, incorporating cross-channel and spatial information in the classification network to improve jamming signal recognition rates. The simulation results demonstrate that under small-sample conditions with a Jamming-to-Noise Ratio (JNR) ranging from −10 dB to 10 dB, the proposed algorithm significantly outperforms GAN, WGAN and DCGAN comparative algorithms in recognizing six types of communication jamming signals. Full article

► Show Figures

Figure 1

23 pages, 5171 KiB

Open AccessArticle

Image Enhancement Based on Dual-Branch Generative Adversarial Network Combining Spatial and Frequency Domain Information for Imbalanced Fault Diagnosis of Rolling Bearing

by Yuguang Huang, Bin Wen, Weiqing Liao, Yahui Shan, Wenlong Fu and Renming Wang

Symmetry 2024, 16(5), 512; https://doi.org/10.3390/sym16050512 - 24 Apr 2024

Cited by 2 | Viewed by 1253

Abstract

To address the problems of existing 2D image-based imbalanced fault diagnosis methods for rolling bearings, which generate images with inadequate texture details and color degradation, this paper proposes a novel image enhancement model based on a dual-branch generative adversarial network (GAN) combining spatial [...] Read more.

To address the problems of existing 2D image-based imbalanced fault diagnosis methods for rolling bearings, which generate images with inadequate texture details and color degradation, this paper proposes a novel image enhancement model based on a dual-branch generative adversarial network (GAN) combining spatial and frequency domain information for an imbalanced fault diagnosis of rolling bearing. Firstly, the original vibration signals are converted into 2D time–frequency (TF) images by a continuous wavelet transform, and a dual-branch GAN model with a symmetric structure is constructed. One branch utilizes an auxiliary classification GAN (ACGAN) to process the spatial information of the TF images, while the other employs a GAN with a frequency generator and a frequency discriminator to handle the frequency information of the input images after a fast Fourier transform. Then, a shuffle attention (SA) module based on an attention mechanism is integrated into the proposed model to improve the network’s expression ability and reduce the computational burden. Simultaneously, mean square error (MSE) is integrated into the loss functions of both generators to enhance the consistency of frequency information for the generated images. Additionally, a Wasserstein distance and gradient penalty are also incorporated into the losses of the two discriminators to prevent gradient vanishing and mode collapse. Under the supervision of the frequency WGAN-GP branch, an ACWGAN-GP can generate high-quality fault samples to balance the dataset. Finally, the balanced dataset is utilized to train the auxiliary classifier to achieve fault diagnosis. The effectiveness of the proposed method is validated by two rolling bearing datasets. When the imbalanced ratios of the four datasets are 0.5, 0.2, 0.1, and 0.05, respectively, their average classification accuracy reaches 99.35% on the CWRU bearing dataset. Meanwhile, the average classification accuracy reaches 96.62% on the MFS bearing dataset. Full article

(This article belongs to the Section Engineering and Materials)

► Show Figures

Figure 1

Search Results (83)

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

Saved Queries

Search Filter Reset All

Years

Feature Papers

Subjects

Journals

Article Types

Countries / Regions

Search Results (83)

Further Information

Guidelines

MDPI Initiatives

Follow MDPI