MDPI - Publisher of Open Access Journals

15 pages, 2289 KiB

Open AccessArticle

Automatic Watershed Segmentation of Cancerous Lesions in Unsupervised Breast Histology Images

by Vincent Majanga and Ernest Mnkandla

Appl. Sci. 2024, 14(22), 10394; https://doi.org/10.3390/app142210394 - 12 Nov 2024

Viewed by 857

Segmentation of nuclei in histology images is key in analyzing and quantifying morphology changes of nuclei features and tissue structures. Conventional diagnosis, segmenting, and detection methods have relied heavily on the manual-visual inspection of histology images. These methods are only effective on clearly [...] Read more.

Segmentation of nuclei in histology images is key in analyzing and quantifying morphology changes of nuclei features and tissue structures. Conventional diagnosis, segmenting, and detection methods have relied heavily on the manual-visual inspection of histology images. These methods are only effective on clearly visible cancerous lesions on histology images thus limited in their performance due to the complexity of tissue structures in histology images. Hence, early detection of breast cancer is key for treatment and profits from Computer-Aided-Diagnostic (CAD) systems introduced to efficiently and automatically segment and detect nuclei cells in pathology. This paper proposes, an automatic watershed segmentation method of cancerous lesions in unsupervised human breast histology images. Firstly, this approach pre-processes data through various augmentation methods to increase the size of dataset images, then a stain normalization technique is applied to these augmented images to isolate nuclei features from tissue structures. Secondly, data enhancement techniques namely; erosion, dilation, and distance transform are used to highlight foreground and background pixels while removing unwanted regions from the highlighted nuclei objects on the image. Consequently, the connected components method groups these highlighted pixel components with similar intensity values and, assigns them to their relevant labeled component binary mask. Once all binary masked groups have been determined, a deep-learning recurrent neural network from the Keras architecture uses this information to automatically segment nuclei objects with cancerous lesions and their edges on the image via watershed filling. This segmentation method is evaluated on an unsupervised, augmented human breast cancer histology dataset of 11,151 images. This proposed method produced a significant evaluation result of

98 %

F1-accuracy score. Full article

► Show Figures

Figure 1

20 pages, 7605 KiB

Open AccessArticle

A Novel Adversarial Example Detection Method Based on Frequency Domain Reconstruction for Image Sensors

by Shuaina Huang, Zhiyong Zhang and Bin Song

Sensors 2024, 24(17), 5507; https://doi.org/10.3390/s24175507 - 25 Aug 2024

Viewed by 1738

Abstract

Convolutional neural networks (CNNs) have been extensively used in numerous remote sensing image detection tasks owing to their exceptional performance. Nevertheless, CNNs are often vulnerable to adversarial examples, limiting the uses in different safety-critical scenarios. Recently, how to efficiently detect adversarial examples and [...] Read more.

Convolutional neural networks (CNNs) have been extensively used in numerous remote sensing image detection tasks owing to their exceptional performance. Nevertheless, CNNs are often vulnerable to adversarial examples, limiting the uses in different safety-critical scenarios. Recently, how to efficiently detect adversarial examples and improve the robustness of CNNs has drawn considerable focus. The existing adversarial example detection methods require modifying CNNs, which not only affects the model performance but also greatly enhances training cost. With the purpose of solving these problems, this study proposes a detection algorithm for adversarial examples that does not need modification of the CNN models and can simultaneously retain the classification accuracy of normal examples. Specifically, we design a method to detect adversarial examples using frequency domain reconstruction. After converting the input adversarial examples into the frequency domain by Fourier transform, the adversarial disturbance from adversarial attacks can be eliminated by modifying the frequency of the example. The inverse Fourier transform is then used to maximize the recovery of the original example. Firstly, we train a CNN to reconstruct input examples. Then, we insert Fourier transform, convolution operation, and inverse Fourier transform into the features of the input examples to automatically filter out adversarial frequencies. We refer to our proposed method as FDR (frequency domain reconstruction), which removes adversarial interference by converting input samples into frequency and reconstructing them back into the spatial domain to restore the image. In addition, we also introduce gradient masking into the proposed FDR method to enhance the detection accuracy of the model for complex adversarial examples. We conduct extensive experiments on five mainstream adversarial attacks on three benchmark datasets, and the experimental results show that FDR can outperform state-of-the-art solutions in detecting adversarial examples. Additionally, FDR does not require any modifications to the detector and can be integrated with other adversarial example detection methods to be installed in sensing devices to ensure detection safety. Full article

(This article belongs to the Section Sensing and Imaging)

► Show Figures

Figure 1

Figure 1
Comparison of frequency domain graphs of original example, adversarial example and reconstructed example. The first column is the spatial domain diagram of the original example, and the rest is the frequency domain diagram. Full article ">Figure 2
(a) The FDR structure first extracts the features of the input attack example, then performs Fourier transform to the frequency domain, and finally eliminates the attack in the frequency domain and reconstructs the original image. (b) The FDR with gradient masking structure first resizes the attack examples and extracts the features of the input attack examples, then performs Fourier transform to the frequency domain, and finally eliminates the attack in the frequency domain and reconstructs the original image before resizing to recover the input size. Full article ">Figure 3
We select MJSMA attack algorithms to conduct experiments on three datasets and the number of iterations is 10–100. Full article ">Figure 4
We select four attack algorithms to conduct experiments on the MNIST dataset and the number of iterations of DeepFool and C&W attack algorithm is 10 or 100, respectively. We compare the proposed algorithm with 3 different detection algorithms. Full article ">Figure 5
We select four attack algorithms to conduct experiments on the CIFAR10 dataset, and the disturbance value of the FGSM and BIM attack algorithms is 0.03–0.30. We compare the proposed algorithm with 3 different detection algorithms. Full article ">Figure 6
We select four attack algorithms to conduct experiments on the SVHN dataset, and the disturbance value of the FGSM and BIM attack algorithms is 0.03–0.30. We compare the proposed algorithm with 3 different detection algorithms. Full article ">Figure 7
We visualize the recovery results of FDR GM method and other attack example detection methods under different attack algorithms. Full article ">

20 pages, 63242 KiB

Open AccessArticle

Crater Detection and Population Statistics in Tianwen-1 Landing Area Based on Segment Anything Model (SAM)

by Yaqi Zhao and Hongxia Ye

Remote Sens. 2024, 16(10), 1743; https://doi.org/10.3390/rs16101743 - 14 May 2024

Cited by 1 | Viewed by 1443

Abstract

Crater detection is useful for research into dating a planetary surface’s age and geological mapping. The high-resolution imaging camera (HiRIC) carried by the Tianwen-1 rover provides digital image model (DIM) datasets with a resolution of 0.7 m/pixel, which are suitable for detecting meter-scale [...] Read more.

Crater detection is useful for research into dating a planetary surface’s age and geological mapping. The high-resolution imaging camera (HiRIC) carried by the Tianwen-1 rover provides digital image model (DIM) datasets with a resolution of 0.7 m/pixel, which are suitable for detecting meter-scale craters. The existing deep-learning-based automatic crater detection algorithms require a large number of crater annotation datasets for training. However, there is currently a lack of datasets of optical images of small-sized craters. In this study, we propose a model based on the Segment Anything Model (SAM) to detect craters in Tianwen-1’s landing area and perform statistical analysis. The SAM network was used to obtain a segmentation mask of the craters from the DIM images. Then non-circular filtering was used to filter out irregular craters. Finally, deduplication and removal of false positives were performed to obtain accurate circular craters, and their center’s position and diameter were obtained through circular fitting analysis. We extracted 841,727 craters in total, with diameters ranging from 1.57 m to 7910.47 m. These data are useful for further Martian crater catalogs and crater datasets. Additionally, the crater size–frequency distribution (CSFD) was also analyzed, indicating that the surface ages of the Tianwen-1 landing area are ~3.25 billion years, with subsequent surface resurfacing events occurring ~1.67 billion years ago. Full article

(This article belongs to the Special Issue Planetary Geologic Mapping and Remote Sensing (Second Edition))

► Show Figures

Figure 1

16 pages, 1308 KiB

Open AccessArticle

Classification of Rainfall Intensity and Cloud Type from Dash Cam Images Using Feature Removal by Masking

by Kodai Suemitsu, Satoshi Endo and Shunsuke Sato

Climate 2024, 12(5), 70; https://doi.org/10.3390/cli12050070 - 12 May 2024

Cited by 1 | Viewed by 2211

Abstract

Weather Report is an initiative from Weathernews Inc. to obtain sky images and current weather conditions from the users of its weather app. This approach can provide supplementary weather information to radar observations and can potentially improve the accuracy of forecasts However, since [...] Read more.

Weather Report is an initiative from Weathernews Inc. to obtain sky images and current weather conditions from the users of its weather app. This approach can provide supplementary weather information to radar observations and can potentially improve the accuracy of forecasts However, since the time and location of the contributed images are limited, gathering data from different sources is also necessary. This study proposes a system that automatically submits weather reports using a dash cam with communication capabilities and image recognition technology. This system aims to provide detailed weather information by classifying rainfall intensities and cloud formations from images captured via dash cams. In models for fine-grained image classification tasks, there are very subtle differences between some classes and only a few samples per class. Therefore, they tend to include irrelevant details, such as the background, during training, leading to bias. One solution is to remove useless features from images by masking them using semantic segmentation, and then train each masked dataset using EfficientNet, evaluating the resulting accuracy. In the classification of rainfall intensity, the model utilizing the features of the entire image achieved up to 92.61% accuracy, which is 2.84% higher compared to the model trained specifically on road features. This outcome suggests the significance of considering information from the whole image to determine rainfall intensity. Furthermore, analysis using the Grad-CAM visualization technique revealed that classifiers trained on masked dash cam images particularly focused on car headlights when classifying the rainfall intensity. For cloud type classification, the model focusing solely on the sky region attained an accuracy of 68.61%, which is 3.16% higher than that of the model trained on the entire image. This indicates that concentrating on the features of clouds and the sky enables more accurate classification and that eliminating irrelevant areas reduces misclassifications. Full article

(This article belongs to the Special Issue Extreme Weather Detection, Attribution and Adaptation Design)

► Show Figures

Figure 1

21 pages, 20756 KiB

Open AccessArticle

A Novel Method for Cloud and Cloud Shadow Detection Based on the Maximum and Minimum Values of Sentinel-2 Time Series Images

by Kewen Liang, Gang Yang, Yangyan Zuo, Jiahui Chen, Weiwei Sun, Xiangchao Meng and Binjie Chen

Remote Sens. 2024, 16(8), 1392; https://doi.org/10.3390/rs16081392 - 15 Apr 2024

Cited by 2 | Viewed by 2201

Abstract

Automatic and accurate detection of clouds and cloud shadows is a critical aspect of optical remote sensing image preprocessing. This paper provides a time series maximum and minimum mask method (TSMM) for cloud and cloud shadow detection. Firstly, the Cloud Score+S2_HARMONIZED (CS+S2) is [...] Read more.

Automatic and accurate detection of clouds and cloud shadows is a critical aspect of optical remote sensing image preprocessing. This paper provides a time series maximum and minimum mask method (TSMM) for cloud and cloud shadow detection. Firstly, the Cloud Score+S2_HARMONIZED (CS+S2) is employed as a preliminary mask for clouds and cloud shadows. Secondly, we calculate the ratio of the maximum and sub-maximum values of the blue band in the time series, as well as the ratio of the minimum and sub-minimum values of the near-infrared band in the time series, to eliminate noise from the time series data. Finally, the maximum value of the clear blue band and the minimum value of the near-infrared band after noise removal are employed for cloud and cloud shadow detection, respectively. A national and a global dataset were used to validate the TSMM, and it was quantitatively compared against five other advanced methods or products. When clouds and cloud shadows are detected simultaneously, in the S2ccs dataset, the overall accuracy (OA) reaches 0.93 and the F1 score reaches 0.85. Compared with the most advanced CS+S2, there are increases of 3% and 9%, respectively. In the CloudSEN12 dataset, compared with CS+S2, the producer’s accuracy (PA) and F1 score show increases of 10% and 4%, respectively. Additionally, when applied to Landsat-8 images, TSMM outperforms Fmask, demonstrating its strong generalization capability. Full article

(This article belongs to the Special Issue Satellite-Based Cloud Climatologies)

► Show Figures

Graphical abstract

22 pages, 32270 KiB

Open AccessArticle

A Cloud Coverage Image Reconstruction Approach for Remote Sensing of Temperature and Vegetation in Amazon Rainforest

by Emili Bezerra, Salomão Mafalda, Ana Beatriz Alvarez, Diego Armando Uman-Flores, William Isaac Perez-Torres and Facundo Palomino-Quispe

Appl. Sci. 2023, 13(23), 12900; https://doi.org/10.3390/app132312900 - 1 Dec 2023

Cited by 3 | Viewed by 2151

Abstract

Remote sensing involves actions to obtain information about an area located on Earth. In the Amazon region, the presence of clouds is a common occurrence, and the visualization of important terrestrial information in the image, like vegetation and temperature, can be difficult. In [...] Read more.

Remote sensing involves actions to obtain information about an area located on Earth. In the Amazon region, the presence of clouds is a common occurrence, and the visualization of important terrestrial information in the image, like vegetation and temperature, can be difficult. In order to estimate land surface temperature (LST) and the normalized difference vegetation index (NDVI) from satellite images with cloud coverage, the inpainting approach will be applied to remove clouds and restore the image of the removed region. This paper proposes the use of the neural network LaMa (large mask inpainting) and the scalable model named Big LaMa for the automatic reconstruction process in satellite images. Experiments are conducted on Landsat-8 satellite images of the Amazon rainforest in the state of Acre, Brazil. To evaluate the architecture’s accuracy, the RMSE (root mean squared error), SSIM (structural similarity index) and PSNR (peak signal-to-noise ratio) metrics were used. The LST and NDVI of the reconstructed image were calculated and compared qualitatively and quantitatively, using scatter plots and the chosen metrics, respectively. The experimental results show that the Big LaMa architecture performs more effectively and robustly in restoring images in terms of visual quality. And the LaMa network shows minimal superiority for the measured metrics when addressing medium marked areas. When comparing the results achieved in NDVI and LST of the reconstructed images with real cloud coverage, great visual results were obtained with Big LaMa. Full article

(This article belongs to the Special Issue Advanced Remote Sensing Imaging for Environmental Sciences)

► Show Figures

Figure 1

16 pages, 742 KiB

Open AccessArticle

REKP: Refined External Knowledge into Prompt-Tuning for Few-Shot Text Classification

by Yuzhuo Dang, Weijie Chen, Xin Zhang and Honghui Chen

Mathematics 2023, 11(23), 4780; https://doi.org/10.3390/math11234780 - 27 Nov 2023

Cited by 1 | Viewed by 1356

Abstract

Text classification is a machine learning technique employed to assign a given text to predefined categories, facilitating the automatic analysis and processing of textual data. However, an important problem is that the number of new text categories is growing faster than that of [...] Read more.

Text classification is a machine learning technique employed to assign a given text to predefined categories, facilitating the automatic analysis and processing of textual data. However, an important problem is that the number of new text categories is growing faster than that of human annotation data, which makes many new categories of text data lack a lot of annotation data. As a result, the conventional deep neural network is forced to over-fit, which damages the application in the real world. As a solution to this problem, academics recommend addressing data scarcity through few-shot learning. One of the efficient methods is prompt-tuning, which transforms the input text into a mask prediction problem featuring [MASK]. By utilizing descriptors, the model maps output words to labels, enabling accurate prediction. Nevertheless, the previous prompt-based adaption approaches often relied on manually produced verbalizers or a single label to represent the entire label vocabulary, which makes the mapping granularity low, resulting in words not being accurately mapped to their label. To address these issues, we propose to enhance the verbalizer and construct the refined external knowledge into a prompt-tuning (REKP) model. We employ the external knowledge bases to increase the mapping space of tagged terms and design three refinement methods to remove noise data. We conduct comprehensive experiments on four benchmark datasets, namely AG’s News, Yahoo, IMDB, and Amazon. The results demonstrate that REKP can outperform the state-of-the-art baselines in terms of Micro-F1 on knowledge-enhanced text classification. In addition, we conduct an ablation study to ascertain the functionality of each module in our model, revealing that the refinement module significantly contributes to enhancing classification accuracy. Full article

► Show Figures

Figure 1

Figure 1
The framework of REKP. First, the whole label word goes through Label Word Refinement and Correlation Refinement to make its granularity more in line with the target task, and then, through Importance Refinement, the weight value of each label word is calculated. Finally, the verbalizer maps the predictions over label words into Labels. Full article ">Figure 2
WR process diagram. Full article ">Figure 3
CR process diagram. Full article ">Figure 4
IR process diagram. (Instance Encoder. A tag word is vectorized through PLMs. Instance-level Attention. Pay more attention to tag words related to Label and reduce the influence of noise. The whole process can be described as: tag words and Label are transformed into feature vectors through Instance Encoder, and then the results are input into Instance-level Attention, and the weight of the label words is obtained after the weight sum.) Full article ">Figure 5
The impact of sample quantity on model performance. Full article ">Figure 6
The remaining number of label words after WR and CR. (In our code, the number of tag thesaurus of each Label will be output after WR and CR, and we add all the tag words to get the number of tag words in WR and CR of this dataset.) Full article ">

12 pages, 485 KiB

Open AccessArticle

Habitual Mask Wearing as Part of COVID-19 Control in Japan: An Assessment Using the Self-Report Habit Index

by Tianwen Li, Marie Fujimoto, Katsuma Hayashi, Asami Anzai and Hiroshi Nishiura

Behav. Sci. 2023, 13(11), 951; https://doi.org/10.3390/bs13110951 - 19 Nov 2023

Cited by 6 | Viewed by 3912

Abstract

Although the Japanese government removed mask-wearing requirements in 2023, relatively high rates of mask wearing have continued in Japan. We aimed to assess psychological reasons and the strength of habitual mask wearing in Japan. An Internet-based cross-sectional survey was conducted with non-random participant [...] Read more.

Although the Japanese government removed mask-wearing requirements in 2023, relatively high rates of mask wearing have continued in Japan. We aimed to assess psychological reasons and the strength of habitual mask wearing in Japan. An Internet-based cross-sectional survey was conducted with non-random participant recruitment. We explored the frequency of mask usage, investigating psychological reasons for wearing masks. A regression analysis examined the association between psychological reasons and the frequency of mask wearing. The habitual use of masks was assessed in the participant’s most frequently visited indoor space and public transport using the self-report habit index. The principal component analysis with varimax rotation revealed distinct habitual characteristics. Among the 2640 participants surveyed from 6 to 9 February 2023, only 4.9% reported not wearing masks at all. Conformity to social norms was the most important reason for masks. Participants exhibited a slightly higher degree of habituation towards mask wearing on public transport compared to indoor spaces. The mask-wearing rate was higher in females than in males, and no significant difference was identified by age group. Daily mask wearing in indoor spaces was characterized by two traits (automaticity and behavioral frequency). A high mask-wearing frequency has been maintained in Japan during the social reopening transition period. Mask wearing has become a part of daily habit, especially on public transport, largely driven by automatic and frequent practice. Full article

(This article belongs to the Special Issue Health Psychology and Behaviors during COVID-19)

► Show Figures

Figure 1

19 pages, 8540 KiB

Open AccessArticle

Bone Metastases Lesion Segmentation on Breast Cancer Bone Scan Images with Negative Sample Training

by Yi-You Chen, Po-Nien Yu, Yung-Chi Lai, Te-Chun Hsieh and Da-Chuan Cheng

Diagnostics 2023, 13(19), 3042; https://doi.org/10.3390/diagnostics13193042 - 25 Sep 2023

Cited by 2 | Viewed by 3144

Abstract

The use of deep learning methods for the automatic detection and quantification of bone metastases in bone scan images holds significant clinical value. A fast and accurate automated system for segmenting bone metastatic lesions can assist clinical physicians in diagnosis. In this study, [...] Read more.

The use of deep learning methods for the automatic detection and quantification of bone metastases in bone scan images holds significant clinical value. A fast and accurate automated system for segmenting bone metastatic lesions can assist clinical physicians in diagnosis. In this study, a small internal dataset comprising 100 breast cancer patients (90 cases of bone metastasis and 10 cases of non-metastasis) and 100 prostate cancer patients (50 cases of bone metastasis and 50 cases of non-metastasis) was used for model training. Initially, all image labels were binary. We used the Otsu thresholding method or negative mining to generate a non-metastasis mask, thereby transforming the image labels into three classes. We adopted the Double U-Net as the baseline model and made modifications to its output activation function. We changed the activation function to SoftMax to accommodate multi-class segmentation. Several methods were used to enhance model performance, including background pre-processing to remove background information, adding negative samples to improve model precision, and using transfer learning to leverage shared features between two datasets, which enhances the model’s performance. The performance was investigated via 10-fold cross-validation and computed on a pixel-level scale. The best model we achieved had a precision of 69.96%, a sensitivity of 63.55%, and an F1-score of 66.60%. Compared to the baseline model, this represents an 8.40% improvement in precision, a 0.56% improvement in sensitivity, and a 4.33% improvement in the F1-score. The developed system has the potential to provide pre-diagnostic reports for physicians in final decisions and the calculation of the bone scan index (BSI) with the combination with bone skeleton segmentation. Full article

(This article belongs to the Special Issue Artificial Intelligence in Clinical Medical Imaging)

► Show Figures

Figure 1

26 pages, 31605 KiB

Open AccessArticle

An Automatic Method for Rice Mapping Based on Phenological Features with Sentinel-1 Time-Series Images

by Guixiang Tian, Heping Li, Qi Jiang, Baojun Qiao, Ning Li, Zhengwei Guo, Jianhui Zhao and Huijin Yang

Remote Sens. 2023, 15(11), 2785; https://doi.org/10.3390/rs15112785 - 26 May 2023

Cited by 7 | Viewed by 2749

Abstract

Rice is one of the most important staple foods in the world, feeding more than 50% of the global population. However, rice is also a significant emitter of greenhouse gases and plays a role in global climate change. As a result, quickly and [...] Read more.

Rice is one of the most important staple foods in the world, feeding more than 50% of the global population. However, rice is also a significant emitter of greenhouse gases and plays a role in global climate change. As a result, quickly and accurately obtaining the rice mapping is crucial for ensuring global food security and mitigating global warming. In this study, we proposed an automated rice mapping method called automated rice mapping using V-shaped phenological features of rice (Auto-RMVPF) based on the time-series Sentinel-1A images, which are composed of four main steps. First, the dynamic threshold method automatically extracts abundant rice samples by flooding signals. Second, the second-order difference method automatically extracts the phenological period of rice based on the scattering feature of rice samples. Then, the key “V” feature of the VH backscatter time series, which rises before and after rice transplanting due to flooding, is used for rice mapping. Finally, the farmland mask is extracted to avoid interference from non-farmland features on the rice map, and the median filter is applied to remove noise from the rice map and obtain the final spatial distribution of rice. The results show that the Auto-RMVPF method not only can automatically obtain abundant rice samples but also can extract the accurate phenological period of rice. At the same time, the accuracy of rice mapping is also satisfactory, with an overall accuracy is more than 95% and an

F_{1}

score of over 0.91. The overall accuracy of the Auto-RMVPF method is improved by 2.8–12.2% compared with support vector machine (SVM) with an overall accuracy of 89.9% (25 training samples) and 92.2% (124 training samples), random forest (RF) with an overall accuracy of 82.8% (25 training samples) and 88.3% (124 training samples), and automated rice mapping using synthetic aperture radar flooding signals (ARM-SARFS) with an overall accuracy of 89.9%. Altogether, these experimental results suggest that the Auto-RMVPF method has broad prospects for automatic rice mapping, especially for mountainous regions where ground samples are often not easily accessible. Full article

(This article belongs to the Section Biogeosciences Remote Sensing)

► Show Figures

Graphical abstract

22 pages, 6060 KiB

Open AccessArticle

Effects of Image Size on Deep Learning

by Olivier Rukundo

Electronics 2023, 12(4), 985; https://doi.org/10.3390/electronics12040985 - 16 Feb 2023

Cited by 31 | Viewed by 4639

Abstract

In this work, the best size for late gadolinium enhancement (LGE) magnetic resonance imaging (MRI) images in the training dataset was determined to optimize deep learning training outcomes. Non-extra pixel and extra pixel interpolation algorithms were used to determine the new size of [...] Read more.

In this work, the best size for late gadolinium enhancement (LGE) magnetic resonance imaging (MRI) images in the training dataset was determined to optimize deep learning training outcomes. Non-extra pixel and extra pixel interpolation algorithms were used to determine the new size of the LGE-MRI images. A novel strategy was introduced to handle interpolation masks and remove extra class labels in interpolated ground truth (GT) segmentation masks. The expectation maximization, weighted intensity, a priori information (EWA) algorithm was used for the quantification of myocardial infarction (MI) in automatically segmented LGE-MRI images. Arbitrary threshold, comparison of the sums, and sums of differences are methods used to estimate the relationship between semi-automatic or manual and fully automated quantification of myocardial infarction (MI) results. The relationship between semi-automatic and fully automated quantification of MI results was found to be closer in the case of bigger LGE MRI images (55.5% closer to manual results) than in the case of smaller LGE MRI images (22.2% closer to manual results). Full article

► Show Figures

Figure 1

31 pages, 4577 KiB

Open AccessArticle

Unsupervised Building Extraction from Multimodal Aerial Data Based on Accurate Vegetation Removal and Image Feature Consistency Constraint

by Yan Meng, Shanxiong Chen, Yuxuan Liu, Li Li, Zemin Zhang, Tao Ke and Xiangyun Hu

Remote Sens. 2022, 14(8), 1912; https://doi.org/10.3390/rs14081912 - 15 Apr 2022

Cited by 10 | Viewed by 2907

Abstract

Accurate building extraction from remotely sensed data is difficult to perform automatically because of the complex environments and the complex shapes, colours and textures of buildings. Supervised deep-learning-based methods offer a possible solution to solve this problem. However, these methods generally require many [...] Read more.

Accurate building extraction from remotely sensed data is difficult to perform automatically because of the complex environments and the complex shapes, colours and textures of buildings. Supervised deep-learning-based methods offer a possible solution to solve this problem. However, these methods generally require many high-quality, manually labelled samples to obtain satisfactory test results, and their production is time and labour intensive. For multimodal data with sufficient information, extracting buildings accurately in as unsupervised a manner as possible. Combining remote sensing images and LiDAR point clouds for unsupervised building extraction is not a new idea, but existing methods often experience two problems: (1) the accuracy of vegetation detection is often not high, which leads to limited building extraction accuracy, and (2) they lack a proper mechanism to further refine the building masks. We propose two methods to address these problems, combining aerial images and aerial LiDAR point clouds. First, we improve two recently developed vegetation detection methods to generate accurate initial building masks. We then refine the building masks based on the image feature consistency constraint, which can replace inaccurate LiDAR-derived boundaries with accurate image-based boundaries, remove the remaining vegetation points and recover some missing building points. Our methods do not require manual parameter tuning or manual data labelling, but still exhibit a competitive performance compared to 29 methods: our methods exhibit accuracies higher than or comparable to 19 state-of-the-art methods (including 8 deep-learning-based methods and 11 unsupervised methods, and 9 of them combine remote sensing images and 3D data), and outperform the top 10 methods (4 of them combine remote sensing images and LiDAR data) evaluated using all three test areas of the Vaihingen dataset on the official website of the ISPRS Test Project on Urban Classification and 3D Building Reconstruction in average area quality. These comparative results verify that our unsupervised methods combining multisource data are very effective. Full article

(This article belongs to the Special Issue Intelligent Remote Sensing for Planning, Management, and Maintenance of Renewable Energy Infrastructures)

► Show Figures

Figure 1

21 pages, 47782 KiB

Open AccessArticle

AFD-StackGAN: Automatic Mask Generation Network for Face De-Occlusion Using StackGAN

by Abdul Jabbar, Xi Li, Muhammad Assam, Javed Ali Khan, Marwa Obayya, Mimouna Abdullah Alkhonaini, Fahd N. Al-Wesabi and Muhammad Assad

Sensors 2022, 22(5), 1747; https://doi.org/10.3390/s22051747 - 23 Feb 2022

Cited by 8 | Viewed by 3506

Abstract

To address the problem of automatically detecting and removing the mask without user interaction, we present a GAN-based automatic approach for face de-occlusion, called Automatic Mask Generation Network for Face De-occlusion Using Stacked Generative Adversarial Networks (AFD-StackGAN). In this approach, we decompose the [...] Read more.

To address the problem of automatically detecting and removing the mask without user interaction, we present a GAN-based automatic approach for face de-occlusion, called Automatic Mask Generation Network for Face De-occlusion Using Stacked Generative Adversarial Networks (AFD-StackGAN). In this approach, we decompose the problem into two primary stages (i.e., Stage-I Network and Stage-II Network) and employ a separate GAN in both stages. Stage-I Network (Binary Mask Generation Network) automatically creates a binary mask for the masked region in the input images (occluded images). Then, Stage-II Network (Face De-occlusion Network) removes the mask object and synthesizes the damaged region with fine details while retaining the restored face’s appearance and structural consistency. Furthermore, we create a paired synthetic face-occluded dataset using the publicly available CelebA face images to train the proposed model. AFD-StackGAN is evaluated using real-world test images gathered from the Internet. Our extensive experimental results confirm the robustness and efficiency of the proposed model in removing complex mask objects from facial images compared to the previous image manipulation approaches. Additionally, we provide ablation studies for performance comparison between the user-defined mask and auto-defined mask and demonstrate the benefits of refiner networks in the generation process. Full article

(This article belongs to the Special Issue Big Data Analytics in Internet of Things Environment)

► Show Figures

Figure 1

11 pages, 9315 KiB

Open AccessArticle

Throwaway Shadows Using Parallel Encoders Generative Adversarial Network

by Kamran Javed, Nizam Ud Din, Ghulam Hussain and Tahir Farooq

Appl. Sci. 2022, 12(2), 824; https://doi.org/10.3390/app12020824 - 14 Jan 2022

Cited by 3 | Viewed by 2400

Abstract

Face photographs taken on a bright sunny day or in floodlight contain unnecessary shadows of objects on the face. Most previous works deal with removing shadow from scene images and struggle with doing so for facial images. Faces have a complex semantic structure, [...] Read more.

Face photographs taken on a bright sunny day or in floodlight contain unnecessary shadows of objects on the face. Most previous works deal with removing shadow from scene images and struggle with doing so for facial images. Faces have a complex semantic structure, due to which shadow removal is challenging. The aim of this research is to remove the shadow of an object in facial images. We propose a novel generative adversarial network (GAN) based image-to-image translation approach for shadow removal in face images. The first stage of our model automatically produces a binary segmentation mask for the shadow region. Then, the second stage, which is a GAN-based network, removes the object shadow and synthesizes the effected region. The generator network of our GAN has two parallel encoders—one is standard convolution path and the other is a partial convolution. We find that this combination in the generator results not only in learning an incorporated semantic structure but also in disentangling visual discrepancies problems under the shadow area. In addition to GAN loss, we exploit low level L1, structural level SSIM and perceptual loss from a pre-trained loss network for better texture and perceptual quality, respectively. Since there is no paired dataset for the shadow removal problem, we created a synthetic shadow dataset for training our network in a supervised manner. The proposed approach effectively removes shadows from real and synthetic test samples, while retaining complex facial semantics. Experimental evaluations consistently show the advantages of the proposed method over several representative state-of-the-art approaches. Full article

(This article belongs to the Special Issue Advanced Machine Learning and Scene Understanding in Images and Data)

► Show Figures

Figure 1

19 pages, 10525 KiB

Open AccessArticle

Moving Car Recognition and Removal for 3D Urban Modelling Using Oblique Images

by Chong Yang, Fan Zhang, Yunlong Gao, Zhu Mao, Liang Li and Xianfeng Huang

Remote Sens. 2021, 13(17), 3458; https://doi.org/10.3390/rs13173458 - 31 Aug 2021

Cited by 13 | Viewed by 4499

Abstract

With the progress of photogrammetry and computer vision technology, three-dimensional (3D) reconstruction using aerial oblique images has been widely applied in urban modelling and smart city applications. However, state-of-the-art image-based automatic 3D reconstruction methods cannot effectively handle the unavoidable geometric deformation and incorrect [...] Read more.

With the progress of photogrammetry and computer vision technology, three-dimensional (3D) reconstruction using aerial oblique images has been widely applied in urban modelling and smart city applications. However, state-of-the-art image-based automatic 3D reconstruction methods cannot effectively handle the unavoidable geometric deformation and incorrect texture mapping problems caused by moving cars in a city. This paper proposes a method to address this situation and prevent the influence of moving cars on 3D modelling by recognizing moving cars and combining the recognition results with a photogrammetric 3D modelling procedure. Through car detection using a deep learning method and multiview geometry constraints, we can analyse the state of a car’s movement and apply a proper preprocessing method to the geometrically model generation and texture mapping steps of 3D reconstruction pipelines. First, we apply the traditional Mask R-CNN object detection method to detect cars from oblique images. Then, a detected car and its corresponding image patch calculated by the geometry constraints in the other view images are used to identify the moving state of the car. Finally, the geometry and texture information corresponding to the moving car will be processed according to its moving state. Experiments on three different urban datasets demonstrate that the proposed method is effective in recognizing and removing moving cars and can repair the geometric deformation and error texture mapping problems caused by moving cars. In addition, the methods proposed in this paper can be applied to eliminate other moving objects in 3D modelling applications. Full article

(This article belongs to the Special Issue Urban Multi-Category Object Detection Using Aerial Images)

► Show Figures

Figure 1

Search Results (30)

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

Saved Queries

Search Filter Reset All

Years

Feature Papers

Subjects

Journals

Article Types

Countries / Regions

Search Results (30)

Further Information

Guidelines

MDPI Initiatives

Follow MDPI