Abstract
The containment liner plate (CLP) is a thin layer of carbon steel material applied as a base for concrete structures protecting nuclear material. The structural health monitoring of the CLP is critical to ensure the safety of nuclear power plants. Hidden defects in the CLP can be identified utilizing ultrasonic tomographic imaging techniques such as the reconstruction algorithm for the probabilistic inspection of damage (RAPID) methodology. However, Lamb waves have a multimodal dispersion feature, which makes the selection of a single mode more difficult. Thus, sensitivity analysis was utilized since it allows for the determination of each mode's level of sensitivity as a function of frequency; the S0 mode was chosen after examining the sensitivity. Even though proper Lamb wave mode was selected, the tomographic image had blurred zones. Blurring reduces the precision of an ultrasonic image and makes it more difficult to distinguish the dimensions of the flaw. To enhance the tomographic image of the CLP, deep learning architecture such as U-Net was utilized for the segmentation of the experimental ultrasonic tomographic image, which includes an encoder and decoder part for better visualization of the tomographic image. Nevertheless, collecting enough ultrasonic images to train the U-Net model was not economically feasible, and only a small number of the CLP specimens can be tested. Thus, it was necessary to utilize transfer learning and get the values of the parameters from a pre-trained model with a much larger dataset as a starting point for a new task, rather than training a new model from scratch. Through these deep learning approaches, we were able to eliminate the blurred section of the ultrasonic tomography, leading to images with clear edges of defects and no blurred zones.
Graphical abstract
Similar content being viewed by others
Avoid common mistakes on your manuscript.
1 Introduction
Structural health monitoring of nuclear power plants is crucial for ensuring public and environmental safety. Almost all nuclear power plants are covered by concrete with a carbon steel plate base layer with a thickness of 6 mm, which is called CLP. Corrosion affects the CLP during its operation. Inspection and maintenance are critical to prevent wall thinning in the corroded area of the CLP. Otherwise, there is the possibility of diminishing the CLP’s protection capability, resulting in a reduction in safety and operational lifespan. Corrosion defects appear between the CLP and the thick concrete layer, and visual inspection of the CLP is physically not possible. Due to the large dimensions and operating environment of nuclear power plants, an ultrasonic guided wave-based inspection of the CLP is the most suitable technique among all other nondestructive methods (Basu et al. 2021). A Lamb wave is a type of guided wave that can propagate for a long distance as well as its sensitivity to various types of defects. The Lamb wave can be used for the inspection of plate corrosion, flaws, debonding, and other kinds of defects (Zhang et al. 2023; Malikov et al. 2023). In addition, Lamb waves can propagate along curved structures as well, making them useful for monitoring the structural health of curved structures. Using the waveform propagation pattern, an ultrasonic image of the hidden defects can be reconstructed using a set of ultrasound probes arranged around the specimen. Ultrasonic tomography of CLP is the most effective method of inspecting and detecting localized defects. The main limitation of ultrasonic testing is that sufficient experimental skills are required for the ultrasonic NDT technician, because of the nonlinearities introduced due to the instrument and variation in the sensitivity of the ultrasonic elements, etc. These nonlinear effects significantly reduce the accuracy of the ultrasonic tomographic images, thus increasing the risk of missing defects.
Most of the significant improvements in ultrasonic tomographic detection have occurred in the industrial and medical fields in recent years (Malikov et al. 2021). For instance, the application of the RAPID algorithm with variable shape functions for detecting multiple defects in space (Martucci et al. 2021). Variable shape factors to improve ultrasonic tomography's reconstruction of images and microdamage in space (Basu et al. 2021). The Westervelt equation is employed to scan the surface structure using pulse-echo scanning (Bao et al. 2022). A correlation between the shape function and the skew parameter of the ultrasound beam has been applied to improve tomographic imaging (Duan and Gan 2019). Furthermore, fast ultrasonic tomography has been employed for underwater samples, including non-contact ultrasonic samples (Hay et al. 2006; Zhang et al. 2021).
The increasing use of deep learning in computer vision applications is due to the advancements in deep learning architecture, including the popularly used U-Net (Siddique et al. 2021; Malikov et al. 2022). U-Net is mainly adopted within medical imaging and specifically created for image segmentation (Antonioli et al. 2021). The encoder and decoder architecture are used in the U-Net model and image segmentation allows automatically detecting the particular objects in the image (Wu et al. 2021). As previously stated, U-Net is largely employed in medical image analysis, not only for image segmentation but also for computed tomography image reconstruction (De Carvalho et al. 2022). In the area of nondestructive testing (NDT), the U-Net was also utilized by using image segmentation to extract the defect from X-ray images (Liu and Kim 2022), to locate the defect with step-heating thermography for composite laminates (Pedrayes et al. 2022), to reconstruct ultrasonic images for curved parts and ultrasound reflection tomography images (Almansouri et al. 2019; Mei et al. 2021). Moreover, the U-Net architecture is famous for the application of the ready frameworks model as an encoder and a decoder(Costa et al. 2021). Particularly EfficientNet is a convenient architecture that can be adopted as an encoder of the U-Net architecture and allows it to reach higher accuracy within the limited number of the dataset (Nguon et al. 2022; Sharma et al. 2023).
Meanwhile, transfer learning enables the enhancement of prediction accuracy by transferring information from different models with huge datasets (Mushtaq et al. 2021). Therefore, the purpose of transfer learning is to enhance accuracy while utilizing fewer computational resources (Hussain et al. 2019; Zhuang et al. 2021). In addition, transfer learning has been applied for classification and segmentation in computer vision. In the case of image segmentation, transfer learning allows getting better predictions. The transfer learning technique was implemented in U-Net deep learning architecture for breast tumor cell nuclei segmentation in histopathology images and segmentation of COVID-19 CT images (Le Dinh et al. 2021; Wang et al. 2021). In addition, transfer learning can be applied to a range of applications by using pre-built architectures like EfficientNet as the encoder in U-Net models (Wang et al. 2021). The application of transfer learning for segmentation purposes has extended beyond medical images and has been utilized for inspecting materials and detecting flaws in ultrasonic images (Medak et al. 2022), applying EfficientNet as an encoder of U-Net (Wu et al. 2019). The most popular database utilized to train models and extract the pre-trained parameters of these models is ImageNet (Jia Deng et al. 2009). ImageNet is a large-scale image database widely used for transfer learning in deep learning applications. In the context of U-Net architecture, ImageNet has been used to pre-train the encoder part of the U-Net model, resulting in improved segmentation performance for various types of images.
The objective of this research is to improve ultrasonic imaging by eliminating blurred zones in the reconstructed image by the RAPID algorithm. The application of the ultrasonic reconstruction algorithm leads to images with blurred which might cause misclassification presence of defects in structures and makes it difficult to define the real dimension of the defects. In this research, the most sensitive wave mode was chosen based on an analysis of the sensitivity of the Lamb wave to wall thinning. Moreover, the image was processed by a deep learning algorithm such as U-Net. Applied U-Net architecture was simplified by using a ready model such as EfficientNet as an encoder. Additionally, the weight of the U-Net architecture was initiated by using transfer learning techniques based on the ImageNet database, which allows for fast and accurate training of the U-Net model to high accuracy even though a limited number of data is present. Through the application of the selected methods, blurred zones of the ultrasonic tomography images were eliminated, and the boundaries of defective zones were clearly defined.
2 Materials and methods
2.1 Lamb waves
The Lamb wave is a guided wave that can propagate for a long distance, which has many advantages over pointwise bulk waves. In contrast to the Rayleigh surface wave, the Lamb wave can be applied to typical structures with a curved shape. Moreover, the Lamb wave has a dispersive property, and it is calculated according to the material properties of the specimen under inspection. The dispersions property of the Lamb waves depends on the material properties and they vary with direction. A CLP has isotropic properties, which means that its material property is independent of the inspection direction. Isotropic properties of materials simplify the Lamb wave dispersion equilibrium equations. The displacement component of a Lamb wave can be separated into symmetric and antisymmetric modes as it propagates through the structure. The characteristic equation of the symmetric mode can be written as follows (Shi et al. 2020):
Symmetric modes are also known as quasi-extensional modes because they have a dominant wave polarization component along the propagation of the wave. In contrast, antisymmetric modes (or flexural modes) have quasi-flexural behavior, and they can be mathematically defined as follows (Šofer et al. 2021):
The half thickness of the plate is defined by \(h\) and the wave number is assigned as \(k\) in Eqs. (1) and (2), whereas parameters \(D\) and \(J\) define as unitless quantities based on angular velocity (\(\omega\)). In the following equation \(C_{L}\) and \(C_{S}\) longitudinal and shear wave velocities correspondingly.
In the experiments, mock-up CLP specimens without concrete were used, and the main concern was the nondestructive detection of thickness reduction in the CLP. The Lamb wave constitutive equation is based on the assumption of uniform plate thickness and traction-free boundary conditions. This model is frequently employed in the detection of defects since variations in plate thickness or boundary conditions significantly alter wave propagation properties. In the case of CLP, the wall thinning occurs in the delaminated zone between CLP and the concrete layer. An important advantage of the Lamb wave is its ability to facilitate ultrasonic testing of large areas. As mentioned earlier, the CLP specimen was made of carbon steel and material property such as the longitudinal and transverse wave propagation speeds were measured as \(C_{L}\) = 5.9 mm/\(\upmu\)s and \(C_{S}\) = 3.2 mm/\(\upmu\)s respectively. Taking into consideration the material properties of the CLP, the phase velocity curve of the Lamb wave derived, and it is shown in Fig. 1.
Phase velocity represents the velocity of the crest of the wave, and its magnitude is defined as \(C_{P} = \omega /k.\) In the case of isotropic materials such as CLP, the magnitude of phase velocity depends only on the wave number. In Fig. 1, symmetric (or extensional) modes are labeled \(S_{n}\), and anti-symmetric (or flexural) modes are labeled \(A_{n}\).
In Fig. 2, the group velocity dispersion curve for the CLP is represented, which determines the propagation speed of wave packets. The Lamb dispersion curve can be classified as dispersive or non-dispersive. The dispersive region of the Lamb wave mode corresponds to the region where propagation speed varies with frequency. Generally, the region where Lamb wave speed does not depend on frequency is called non-dispersive. In the literature, it is often considered that Lamb waves with a lower frequency range are appropriate since only two modes coexist only. In contrast, a Lamb wave at a higher frequency has a multimodal dispersion property, making excitation more challenging.
2.2 Sensitivity
As mentioned previously, the Lamb wave has multimodal and dispersive properties. As a result, determining proper mode selection is crucial for ultrasonic inspection. Depending on the field of the application the different Lamb modes can be selected since each mode has a different level of sensitivity to a certain kind of defect. The sensitivity of the selected Lamb wave mode to wall thinning is essential to obtain precise ultrasonic tomography image of the CLPs. Also, direct evaluation of each mode's sensitivity from the dispersion curve is relatively difficult. Sensitivity analysis allows us to compare the level of sensitivity of different modes to wall thinning. It is obvious from the dispersion curve that wall thinning will affect wave propagation speed. This is because the product of frequency and thickness will be reduced in case of wall thinning. The variation in the time of flight due to wall thinning can be written as follows (Gao and Rose 2010):
From Eq. 4, the expression of the time of flight shifts \((\Delta t)\) is due to the change of the wave propagation speed \((dC_{g} )\) caused by the wall thinning \((dh)\) of the plate as the wave propagates the distance \(L\). Considering the time of flight variation the sensitivity of the Lamb wave mode can be derived (Cheng et al. 2019):
Considering the dispersion property of the Lamb wave, the sensitivity curve of each mode was derived as a function of the frequency. The sensitivity curve of the Lamb wave modes is nonlinearly proportional to the value of the magnitude of the group velocity and inversely proportional to the amount of wall thinning. Figure 3 shows the sensitivity plots of the Lamb wave modes. As can be observed from the graph, each mode of the Lamb wave has a different level of sensitivity.
From the sensitivity plots in Fig. 3, it is shown that the low-frequency range of the mode S0 mode of the Lamb wave for the CLP does not show very high sensitivity due to the fact that it has nondispersive properties. Nevertheless, the sensitivity of the S0 mode increases rapidly as the frequency increases, and the mode dispersion property becomes more noticeable. The sensitivity of the S0 mode reaches its peak at 2.25 MHz*mm. In terms of sensitivity, the S0 mode was the most appropriate for detecting wall thinning on the CLP because it was the most sensitive among the other modes. The excited waveform has a certain frequency band, so concentrating at a single point of the dispersion curve would not be possible. Considering the frequency band of the excited Lamb wave, the 2.35 MHz*mm mode was used in the experiments. Furthermore, in many similar studies, it was found that waveforms with a low group velocity and dispersive properties were most suitable for evaluating wall thinning (Gao and Rose 2010). Ultrasonic tomographic algorithms were performed to analyze the CLP using the sensitive frequency of the S0 mode. In the next section, we will discuss the ultrasonic tomographic algorithm in more detail.
2.3 Ultrasonic tomography algorithms
The main idea behind the RAPID algorithm is that the waveform parameters such as time of flight, phase, and amplitude vary as the defect is present in the structure. In ultrasonic tomographic imaging, sets of pulser and receiver are implemented. Probabilistic reconstruction algorithms are based on the signal difference coefficient \((SDC_{ij} )\) between the measured waveform and the reference signal (measured from an intact specimen). The SDC value can be determined as follows (Li et al. 2019):
Whenever the specimen is free of defects, the SDC value is zero, while as defect size increases, the SDC value also increases. As we can see in Eq. (6), the SDC value is computed for each set of pulser \(i\) and receiver \(j\). Reference signal from the intact section of the CLP is denoted by \(w_{ij}\), whereas \(m_{ij}\) is waveform from the examining specimen, and the \(\mu\) is the corresponding mean value of the signal. The number \(\begin{gathered} p \hfill \\ \hfill \\ \end{gathered}\) corresponds to the order of an element in the signal vector, which has a length of \(n\). The signal difference coefficients are distributed along the surface in an elliptical shape as shown in Fig. 4. In order to define the ultrasonic tomographic image, the estimated numerical coefficient \(SDC_{ij}\) must be distributed over the plane. This is done by the spatial distribution parameter \((s_{ij} )\) is calculated by:
In Eq. (7), the \(\begin{gathered} R_{ij} \left( {x,y} \right) \end{gathered}\) is a unitless parameter determined by the ratio of the sum of distances among the pulser (with coordinates \(\left( {x_{i} ,y_{i} } \right)\) and receiver \(\left( {x_{j} ,y_{j} } \right)\) and a point with coordinates \(\left( {x,y} \right)\)). This unitless parameter \(\begin{gathered} \left( {R_{ij} \left( {x,y} \right)} \right) \end{gathered}\) can be expressed as (Wang et al. 2020):
Using the calculated \(SDC_{ij}\) parameters, the damage of the sample can be reconstructed by adding local probability values from each pulser-receiver pair. The number of pulser-receiver pairs required depends on the scanning area and the properties of the ultrasonic probe. The \(SDC_{ij}\) is distributed in an elliptical shape over the tomographic image, based on the spatial distribution described in Eq. (7). The dimension of the ellipse is determined by a single coefficient called the shape parameter, which is illustrated graphically in Fig. 4 and expressed numerically in Eq. (9).
According to the literature, there is no predefined method for determining the shape parameter \((\beta )\). It is common for ultrasonic tomography imaging to set the shape parameter at around 1.05 (Zhao et al. 2007). A given value of the shape parameter might cover a wide range of defects because the expression of the shape parameter possesses degrees of freedom. It is also useful to note that the values \(\begin{gathered} R_{ij} \left( {x,y} \right) \end{gathered}\) vary for each pulser and receiver position, whereas the shape parameter is constant and it determines the distribution of the spatial distribution values as expressed in Eq. (7) (Guo et al. 2022).
Damage reconstruction image can be obtained by adding all probability distributions of all pulser receiver sets. With Eq. (10), the reconstruction of tomographic images is possible by summing the probability distribution of the defect from each pulser and receiver. In Eq. (10) the \(SDC_{ij} s_{ij}\) corresponds to the defect probability for a single pulser and receiver pair and \(M\) is the total number of pulser and receiver sets (Hay et al. 2006) so that the reconstructed image pixels are the linear summation of the probabilities.
2.4 Instrumental setup
In the experimental analysis of the ultrasonic inspection of the CLP, the conventional ultrasonic testing setup was applied. The schematic diagram of the ultrasonic instruments can be observed in Fig. 5. The wave signal is excited and received by tone burst (Hay et al. 2006), which enables the setting of several parameters including the center frequency, the number of cycles of the excited waveform, and also the power of the signal. The maximum sampling frequency of the oscilloscope was 40 GHz.
In the ultrasonic inspection of the CLP, a piezoelectric transducer was used to convert electrical energy to mechanical vibration and vice versa. Piezoelectric elements are known for their high-power transformation efficiency and low acoustic impedance, making them suitable for material inspection (Pramanik and Arockiarajan 2019). In the experiment, three sets of ultrasonic piezoelectric probes were utilized: HAGISONICS M501-SB, Technisonic ABS-000408-GP, and Panametrics NDT A413S. Different probe types were used to account for any variations introduced by the ultrasonic probes and to increase the dataset for the training of the deep learning model. The probes were all piezoelectric and exhibited both wideband and narrowband characteristics. The HAGISONICS M501-SB 0.5 MHz and Technisonic ABS-000408-GP were narrowband probes, and the Panamsetrics NDT A413S was a broadband probe. Despite the use of different probe types, the experimental setup remained without changes for all ultrasonic probes. To obtain a maximum resolution of the ultrasonic tomographic reconstruction algorithm, 16 transducer positions were used to scan each of the sections of the CLP, as shown in Fig. 5a. In each scanned zone of the plate, 160 waveforms were obtained from ultrasonic scanning. The excitation frequency was set to 0.39 MHz, which corresponds to the fundamental symmetric mode (S0) of the Lamb wave. The selected Lamb wave phase velocity was determined using the dispersion curve (as shown in Fig. 1), which was found to be 4.3 mm/μs, while its group velocity was 1.9 mm/μs (derived from Fig. 2). An acrylic angle wedge was adopted to excite the selected Lamb wave mode, and wave propagation speed in the wedge was 2.7 mm/μs. Based on Snell’s law, the wedge angle was calculated, and a 40˚ wedge angle was used during all experiments, as shown in Fig. 5b and c.
3 Results
3.1 Experimental results
Experimental investigations were conducted using the instrumental setup shown in Fig. 5. Ultrasonic tomography imaging was performed over a section of the mock-up CLP specimen shown in Fig. 6, which had dimensions of 2500 × 1400 × 6 millimeters. The specimen contained ten defects of different shapes, but because they occur between the concrete and CLP layers, they cannot be visually detected in real applications. Only one side of the CLP plate had defects, while the backside was defect-free. Ultrasonic scanning of the CLP was performed from the backside, which was free of any structural defects. Additionally, the defects cannot be recognized visually in a real inspection of the CLP because they are located between the concrete structure and the CLP. The S0 mode of the Lamb wave at 2.35 MHz*mm was used in the experiment, and details of the Lamb wave mode selection are described in Sect. 2.1, while the details of the instrumental setup are in Sect. 2.3. To obtain good time–frequency resolution, the exciting wave signal was set to five cycles. When an ultrasonic signal has fewer cycles, it will have a wider band of frequencies in the frequency domain, which increases the chance of exciting nearby modes.
The ultrasonic inspection of the CLP specimen was conducted in sections, and defects were identified using the ultrasonic tomographic reconstruction (RAPID) algorithm. The ultrasonic tomographic reconstruction algorithm provides an approximate image of the defects (wall thinning). Waveforms were processed by the method described in Sect. 2.2. For all reconstructed images, the value of the shape parameter was set to 1.05, which is very common for the ultrasonic reconstruction method.
The ultrasonic image of the CLP specimen is shown in Fig. 7, whereas the original image of the CLP specimen is in Fig. 6. The coincidence between the image of the defect and the ultrasonic tomographic image can be obtained by comparing those two figures. As can see in Fig. 7, the ultrasonic tomography image does not clearly illustrate the borders of the defect. This makes it very difficult to determine its size and evaluate its severity. In some fields of application, the size of the defect might have an impact on the decision regarding the condition (whether intact or defected) of the CLP. The defect's shape and dimension become critical since there exists a particular dimension threshold below which the defect is considered as not significant. The blurred part in the ultrasonic tomographic image in Fig. 7 corresponds to an intact area, but the exact shape of the defects cannot be seen from the image due to the gradual change of color. NDT technicians who lack sufficient skill might misclassify blurred zones as defects in the real testing environment. Image processing is necessary to avoid the misdetection of CLP defects. In this case, conventional image processing methods such as binarization may not provide sufficient precision. This is because these blurred zones are caused by nonlinearities regarding the ultrasonic tomographic procedure as well as the experimental instrumentation. Due to the presence of multiple sources of nonlinearity in the system of ultrasonic tomographic imaging methods, the only way to increase accuracy in defect detection is by using artificial intelligence systems. One such method is the U-Net model. In the next section of this chapter, the details of applying the U-Net model for defect segmentation will be described.
3.2 U-Net architecture
Image segmentation can be used to improve ultrasonic tomographic imaging and to eliminate blurred zones caused by nonlinearity in the ultrasonic inspection method. The U-Net architecture is a powerful technique used for image processing in many fields. There is a wide application of deep learning approaches, such as U-Net in ultrasonic tomographic imaging processing (Lyu et al. 2020). In contrast to other conventional fully connected neural networks, U-Net trains faster and achieves higher accuracy, thereby revolutionizing deep learning for image segmentation. Image segmentation allows the filtering of the image pixels by classifying them into certain groups or segments. In the case of CLP, the tomographic image pixels can be classified as intact and defective areas.
The conventional U-Net architecture is composed of an encoder. Usually, in the encoder part, an input image passes through to the convolution layers, which contain a set of filters or kernels for performing a convolution operation. After the convolution layers, batch normalization is usually performed, and then the activation function is computed. Batch normalization is commonly utilized in order to eliminate noise that may occur after the convolution layers and also provides faster and more stable training results. Moreover, the pooling layer is applied in order to downsize the dimension of the feature maps output from the convolutional layer. The U-Net architecture relies on pooling layers to extract significant structural features from an image.
The second half of the U-Net architecture consists of the decoder, which is used to restore and enhance the spatial resolution of the feature maps. The decoder consists of a set of convolutional blocks that perform up-sampling convolution operations. The up-sampling process doubles the dimensions of all the encoder outputs. Additionally, concatenation can be used to combine the feature maps from the encoder to the decoder. Concatenation plays an important role in preserving information from previous layers, which reflects more strongly on overall values and accelerates model convergence.
Overall, the performance of the U-Net architecture is extremely dependent on the values of its hyperparameters. Hyperparameters are variables that are set by the user to tune and optimize the performance of the U-Net architecture. These hyperparameters include the number of encoder and decoder stages, the number and size of filters, the type of activation function and its values, the learning rate, the batch size, and other parameters that strictly affect the performance of the U-Net architecture. However, tuning the hyperparameters of the U-Net architecture becomes challenging when there is a limited amount of training data. Additionally, tuning the hyperparameters is time-consuming and requires significant computational resources.
The process of building a U-Net model from scratch is very time-consuming and requires sufficient data to guarantee high accuracy. However, as a rule of thumb in literature, there are off-the-shelf architectures that balance the accuracy of segmentation, computational efficiency, and model size. There is a wide range of neural network architectures that can be implemented for this purpose, and in this research, we implement the Efficient Net framework as the encoder of the U-Net model. Efficient Net is a cutting-edge neural network architecture that has achieved exceptional performance on various computer vision tasks, utilizing fewer parameters and computation resources than earlier architectures. The main motivation for selecting the EfficientNet is its flexibility and capability to reach higher accuracy with a limited number of epochs or iterations. Along with Efficient Net, there are many kinds of ready neural network architectures. However, in comparison to other architectures, Efficient Net's training requires less computational resources and is faster. Moreover, many research studies have found that EfficientNet is more effective and accurate than others. This has been observed in various fields such as medical (Zhou et al. 2022; Sharma et al. 2023) as well as engineering (Nguon et al. 2022; Jin et al. 2022).
EfficientNet is employed as an encoder module in the proposed U-Net architecture, as depicted in Fig. 8. The EfficientNet structure can be divided into seven blocks depending on the size of the filters, and different values of channels. The core of each encoder block relies on the Mobile Inverter Bottleneck Convolution (MBConv) layers and that layer can be implemented in different values. It can be seen from Table 1 that the number of the MBConv layer in each block is not the same. Table 1 displays the specifications for each block of the EfficientNet model. Each block has different input and output resolution, different input and output channel dimensions, and a different number of MBConv blocks. The input resolution of the input image is decreased as it moves from block 1 to block 7, while the number of output channels is increased. The number of MBConv blocks also varies among the blocks. Blocks 1, 2, and 7 have 4 MBConv blocks each, while block 3 has 7 MBConv blocks. Blocks 4, 5, and 6 have the most MBConv blocks, with 10 each.
The MBConv block can be decomposed into pointwise and depthwise convolutional layers, as well as squeeze and excitation layers. Figure 9a shows the layer of the MBConv block, where the input data pass through the pointwise convolutional layer, which expands the channels. Then, the channels are reduced in the depthwise convolutional layer. The squeeze and excitation layer focuses on the most important features of the channel. The pointwise convolutional layer is used to reduce the dimensionality of the feature space (Nayak et al. 2022).
The U-Net decoder is composed of multiple stages and utilizes a concentration connection mechanism (as shown in Fig. 8) to transfer features from the encoder to the decoder, which helps to improve the accuracy of the segmentation process. The concentration connection allows the decoder to access features from the corresponding level of the encoder. In other words, this connection enables the decoder to concentrate on and extract relevant information from the encoder's output, which it then uses to generate its own output. The proposed U-Net decoder consists of five stages, and at each stage, up-sampling, convolution 2D, batch normalization, and ReLU operations are performed to refine the feature maps (Fig. 9b).
3.3 Training dataset
Building a U-Net architecture from scratch is very time-consuming and requires sufficient data to guarantee high accuracy. When there is a limited amount of training data, these tasks become even more challenging. Therefore, pre-trained neural networks are commonly used when there is a limited amount of data available as a starting point for a new task, rather than initializing the parameters to be trained from scratch. Moreover, a pre-trained model offers several advantages: The parameters are already optimally tuned based on the different datasets, and the number of layers and hyperparameters does not need to be modified. It is only required to update the weights of a pre-trained model for this particular application. In the literature, this approach is called transfer learning. Transfer learning was employed in the proposed U-Net architecture with the use of an ImageNet dataset. The ImageNet database is a widely used large-scale dataset that comprises over 14 million images, annotated with over 20,000 categories. Additionally, taking advantage of pre-trained models on large-scale datasets such as ImageNet, highly informative features can be extracted from images that can be effectively employed for various downstream tasks.
In the given case, the number of ultrasonic measurements has limited value because most ultrasonic scanning has confidentiality issues, and laboratory-made specimens are exceedingly costly. In this experiment, we employed three sets of ultrasonic piezoelectric probes used equally to scan the CLP specimen and 60 images of defects were captured. The details of the ultrasonic probes were described in Sect. 2.3. It should be noted that the CLP specimen which is shown in Fig. 6 had 10 different types of defects. Overall, 70% (42 out of 60) of the ultrasonic tomographic images of the CLP were used to train the deep learning model, and 30% (18 out of 60) of images were used to test the U-Net model. Due to the limited number of specimens, 70% of the dataset was used to train the U-net architecture. The distribution of the dataset might be analyzed with a larger dataset. Nevertheless, dataset distribution analysis was beyond the scope of this study. The dataset distribution for the U-Net model's training and testing processes is illustrated in Fig. 10, where a subset of the ultrasonic tomography images of the CLP was used for training, and the remaining images were analyzed for testing.
Figure 11a depicts a picture of a specimen, which measures 2500 mm by 1400 mm. Because the CLP has a large size, a segment of the plate has been used for the typical experiment. Moreover, the “Computer Vision Annotation Tool" was implemented to obtain ground truth images of defects (shown in Fig. 11b). This particular online platform was chosen due to personal preference, but other software options could be used for the same purpose. According to the literature, this strategy of preserving the original shape of the defect is effective in a variety of different fields (Liu and Kim 2022). Similarly, other specimens were utilized to train the U-Net architecture, along with the match of ultrasound tomography and ground truth.
3.4 Image processing with U-Net
The proposed U-Net architecture was built on the Google Colab cloud simulation platform, due to sufficient computational resources provided for free. However, in some cases, the U-Net model can also be designed with other conventional IDEs (Integrated Development Environments). The specification of the simulation platform is shown in Table 2.
Using the GPU provided by Google Colab, the developed U-Net model was trained using the transfer learning method as well as using real experimental images obtained from ultrasonic tomography. The number of CUDA Cores is also helpful to reduce computation time. In this study, 400 epochs were needed to train the model, and its learning rate was 0.001.
The result of the U-Net segmentation is shown in Fig. 12, where the first column corresponds to the ultrasonic tomographic images from the experimental analysis, and the second column shows the ground truth images (obtained by the image annotation method), which corresponds to the original defect shape. Finally, the third column is associated with the segmented images by the U-Net architecture. It should be noted that the images for testing of the U-Net model were taken from the "Testing dataset" shown in Fig. 10 and were not used to train the U-Net model. The main limitation of the research method is the limited number of datasets for testing and validation. For this purpose, we selected sections of images with single defects as well as with multiple defects from the testing dataset. As a result, different shapes and dimensions of images are produced. Images with large or non-square shapes were compressed to a uniform square view to fit the fixed input size of the U-Net model, which was set at 300 × 300 pixels. In other words, all images in Fig. 12 correspond to the images from the “Testing dataset” in Fig. 10.
As can be seen from the segmented tomographic images in Fig. 12c, there are no blurred ones, and the edge of the defect can be clearly distinguished. In the case of the ultrasonic tomographic image (Fig. 12a), the gradually varying color makes it difficult to detect the clear location of the defects. Moreover, the segmented image closely resembles the shape of the ground truth image, indicating that the segmentation model has successfully localized the defects in the images with a high degree of accuracy. Despite variations in the shapes of the defects, the segmented images exhibit a high degree of similarity to the original defective images. To assess the accuracy of the segmented tomographic images, we calculated their intersection over union (IoU) metrics (Eq. 11). IoU is a common evaluation metric in image segmentation tasks, and it measures how well the predicted image segmentation map matches the ground truth segmentation map. To calculate IoU, we need to first calculate the intersection \(G \cap P\) and union of the ground truth and predicted image segmentations \(G \cup P\). The intersection is the area where the ground truth and predicted segmentations overlap, and the union is the area where either the ground truth or predicted segmentation exists.
where G is the ground truth image (Fig. 12b), and P is the predicted segmentation (Fig. 12c). We calculated the IoU for each image in the testing dataset to gain a better evaluation of the performance of our trained model. In this research, the IoU value was determined for each image, and the average of all tested images was taken, which resulted in a prediction of 83.5% for the U-Net architecture.
4 Discussions
This research investigates the method to improve ultrasonic tomographic images by using the deep learning algorithm and selecting the most suitable Lamb wave mode. In CLP, wall thinning is primarily the result of the surrounding environment. Sensitivity curves were used to determine the Lamb wave mode with the highest sensitivity to the defect. Based on the sensitivity curve in Fig. 3, it was found that the S0 mode had the highest sensitivity to wall thinning, particularly at a frequency of 2.35 MHz*mm. Therefore, this mode was used for the experiment, as any change in wall thickness would affect the time of flight of the waveform.
Ultrasonic tomographic imaging algorithm (RAPID) was employed to scan the mock-up CLP specimen from the defect-free side. Although the Lamb wave mode has been optimally selected, a tomographic image shows some blurred zones near defects, as can be seen in Fig. 7. In the case of a blurred ultrasonic image, it is not possible to evaluate the actual shape, and the intact section of the CLP will likely be misclassified as a defective zone or vice versa. Therefore, there is an increased risk of missing the CLP defect. To eliminate the blurred zone of the ultrasonic image, we implemented a U-Net-based segmentation method.
Due to the limited number of available datasets, we used the transfer learning technique using the ImageNet database. Moreover, experimental tomographic images were used to train U-Net, and datasets were collected by using three different kinds of ultrasonic probes with broadband and narrowband properties. The application of three different kinds of ultrasonic probes allowed us to collect 60 images from a mock-up CLP plate. The overall surface of the mock-up CLP specimen was divided into sections and an overall 60 ultrasonic tomographic images were collected, 42 of which were from the “Training dataset” subsection of the mock-up CLP specimen (as shown in Fig. 10) and the remaining 18 images from the “Testing dataset” subsection of the specimen (Fig. 10). Images were divided into a 70% (42 images) and 30% (18 images) ratio, and those image sets were used for training and testing of the proposed U-Net architecture. The photo of the mock-up CLP specimen was processed by an image annotation tool to obtain a ground truth image, as can be seen in Fig. 11b. The performance of the U-Net model was checked with the proposed data distribution by taking images of single, double, and 3 defects. It should be noted that the input image dimension for the U-Net architecture was 300 × 300 pixels. The predicted image by the U-Net architecture was represented in Fig. 11 by comparing the ultrasonic image with ground truth and segmented image by the U-Net architecture. To evaluate the accuracy of the segmentation result, we compared the ground truth and segmented images of the defect using IoU. Despite the limited training data implemented for the U-Net architecture, we achieved high accuracy with an IoU parameter, showing 83.5% accuracy.
5 Conclusions
A method of enhancing the ultrasonic tomographic image was investigated by selecting both the most sensitive Lamb wave mode and by applying the U-Net image segmentation method. According to the experimental results, the following conclusions can be drawn:
-
1.
The CLP's main structural defect is corrosion caused by harsh conditions between concrete and carbon steel plates, resulting in thinning of the walls. The sensitive curve was applied to select the optimal Lamb wave mode for detecting wall thinning, and the S0 mode at 2.35 MHz*mm was found to be the most suitable one.
-
2.
The ultrasonic tomographic image segmentation method is used to process ultrasonic tomographic images that have been reconstructed by applying the optimal Lamb wave mode. This method allows for the achievement of ultrasonic tomographic images without any blurred zones, which are more accurate than images obtained through conventional ultrasonic tomographic scanning methods. These combined techniques can be implemented as an assistant during the ultrasonic tomographic scanning process.
-
3.
An image annotation tool was applied to obtain the ground-truth image of the defects in a mock-up CLP specimen. The ground-truth image of the defects was obtained by image annotation tool using the photo of the mock-up CLP specimen and those images further used to train the U-Net model. Application of the image annotation tool is an effective technique to obtain the ground truth images of defects when a specimen of CLP is available.
-
4.
The U-Net architecture with transfer learning was used for tomographic image segmentation, and EfficientNet was used as the encoder module to achieve high ultrasonic tomography image resolution for better flaw identification. Despite the limited training data used in the U-Net model, relatively high accuracy was achieved. The segmentation accuracy was evaluated using the IoU technique, achieving an average accuracy of 83.5%.
References
Almansouri H, Venkatakrishnan S V, Buzzard GT et al (2019) Deep neural networks for non-linear model-based ultrasound reconstruction. In: 2018 IEEE global conference on signal and information processing, GlobalSIP 2018—proceedings 6–10. https://doi.org/10.1109/GlobalSIP.2018.8646704
Antonioli L, Pella A, Ricotti R et al (2021) Convolutional neural networks cascade for automatic pupil and iris detection in ocular proton therapy. Sensors 21:1–14. https://doi.org/10.3390/s21134400
Bao Y, Tan C, Dong F (2022) Oil–water two-phase flow volume fraction measurement based on nonlinear ultrasound technique. IEEE Trans Instrum Meas https://doi.org/10.1109/TIM.2022.3157368
Basu S, Thirumalaiselvi A, Sasmal S, Kundu T (2021) Nonlinear ultrasonics-based technique for monitoring damage progression in reinforced concrete structures. Ultrasonics 115:106472. https://doi.org/10.1016/j.ultras.2021.106472
De Carvalho OLF, Junior OADC, De Albuquerque AO et al (2022) Bounding box-free instance segmentation using semi-supervised iterative learning for vehicle detection. IEEE J Select Top Appl Earth Obs Remote Sens 15:3403–3420. https://doi.org/10.1109/JSTARS.2022.3169128
Cheng Y, Song Z, Jin J, Yang T (2019) Attenuation characteristics of stress wave peak in sandstone subjected to different axial stresses. Adv Mater Sci Eng. https://doi.org/10.1155/2019/6320601
da Costa MVCV, de Carvalho OLF, Orlandi AG et al (2021) Remote sensing for monitoring photovoltaic solar plants in Brazil using deep semantic segmentation. Energies 14:1–15. https://doi.org/10.3390/en14102960
Deng J, Dong W, Socher R et al (2009) ImageNet: a large-scale hierarchical image database, pp 248–255. https://doi.org/10.1109/cvprw.2009.5206848
Le Dinh T, Kwon S-G, Lee S-H, Kwon K-R (2021) Breast tumor cell nuclei segmentation in histopathology images using EfficientUnet++ and multi-organ transfer learning. J Korea Multimed Soc 24:1000–1011
Duan W, Gan TH (2019) Investigation of guided wave properties of anisotropic composite laminates using a semi-analytical finite element method. Compos Part B Eng 173:106898. https://doi.org/10.1016/j.compositesb.2019.106898
Gao H, Rose JL (2010) Goodness dispersion curves for ultrasonic guided wave based SHM: a sample problem in corrosion monitoring. Aeronaut J 114:49–56. https://doi.org/10.1017/S0001924000003523
Guo J, Zeng X, Liu Q, Qing X (2022) Lamb wave-based damage localization and quantification in composites using probabilistic imaging algorithm and statistical method. Sensors. https://doi.org/10.3390/s22134810
Hay TR, Royer RL, Gao H et al (2006) A comparison of embedded sensor Lamb wave ultrasonic tomography approaches for material loss detection. Smart Mater Struct 15:946–951. https://doi.org/10.1088/0964-1726/15/4/007
Hussain M, Bird JJ, Faria DR (2019) A study on CNN transfer learning for image classification. Adv Intell Syst Comput 840:191–202. https://doi.org/10.1007/978-3-319-97982-3_16
Jin Z, Li X, Zhang Y et al (2022) Boundary regression-based reep neural network for thyroid nodule segmentation in ultrasound images. Neural Comput Appl 34:22357–22366. https://doi.org/10.1007/S00521-022-07719-Y/METRICS
Li W, Xu Y, Qing X, Yang Z (2019) Quantitative imaging of surface cracks in polymer bonded explosives by surface wave tomographic approach. Polym Test 74:63–71. https://doi.org/10.1016/j.polymertesting.2018.12.020
Liu J, Kim JH (2022) A variable attention nested UNet++ network-based NDT X-ray image defect segmentation method. Coatings. https://doi.org/10.3390/coatings12050634
Lyu J, Tan C, Dong F (2020) image reconstruction for ultrasonic tomography using multi-channel convolutional neural network. In: Chinese control conference, CCC 2020-July:7304–7309. https://doi.org/10.23919/CCC50068.2020.9188702
Malikov AKU, Cho Y, Kim YH et al (2021) Ultrasonic assessment of thickness and bonding quality of coating layer based on short-time fourier transform and convolutional neural networks. Coatings. https://doi.org/10.3390/coatings11080909
Martucci D, Civera M, Surace C (2021) The extreme function theory for damage detection: an application to civil and aerospace structures. Appl Sci (switzerland) 11:1–23. https://doi.org/10.3390/app11041716
Medak D, Posilović L, Subašić M et al (2022) DefectDet: a deep learning architecture for detection of defects with extreme aspect ratios in ultrasonic images. Neurocomputing 473:107–115. https://doi.org/10.1016/j.neucom.2021.12.008
Mei Y, Jin H, Yu B et al (2021) Visual geometry Group-UNet: deep learning ultrasonic image reconstruction for curved parts. J Acoust Soc Am 149:2997–3009. https://doi.org/10.1121/10.0004827
Mushtaq Z, Su SF, Tran QV (2021) Spectral images based environmental sound classification using CNN with meaningful data augmentation. Appl Acoust 172:107581. https://doi.org/10.1016/j.apacoust.2020.107581
Nayak DR, Padhy N, Mallick PK et al (2022) Brain tumor classification using dense efficient-net. Axioms. https://doi.org/10.3390/axioms11010034
Nguon LS, Seo J, Seo K et al (2022) Reconstruction for plane-wave ultrasound imaging using modified U-Net-based beamformer. Comput Med Imaging Graph. https://doi.org/10.1016/J.COMPMEDIMAG.2022.102073
Pedrayes OD, Lema DG, Usamentiaga R et al (2022) Semantic segmentation for non-destructive testing with step-heating thermography for composite laminates. Meas J Int Meas Confed 200:111653. https://doi.org/10.1016/j.measurement.2022.111653
Pramanik R, Arockiarajan A (2019) Effective properties and nonlinearities in 1–3 piezocomposites: a comprehensive review. Smart Mater Struct. https://doi.org/10.1088/1361-665X/ab350a
Sharma N, Gupta S, Koundal D et al (2023) U-Net model with transfer learning model as a backbone for segmentation of gastrointestinal tract. Bioengineering 10:119. https://doi.org/10.3390/bioengineering10010119
Shi Z, Zhao Y, Ma C, Zhang J (2020) Parametric study of ultrasonic De-icing method on a plate with coating. Coatings. https://doi.org/10.3390/coatings10070631
Siddique N, Paheding S, Elkin CP, Devabhaktuni V (2021) U-net and its variants for medical image segmentation: a review of theory and applications. IEEE Access 9:82031–82057. https://doi.org/10.1109/ACCESS.2021.3086020
Šofer M, Šofer P, Ferfecki P et al (2021) Lamb wave mode scattering analysis on adhesively bonded single lap joint using modal decomposition method. Appl Math Model 89:413–427. https://doi.org/10.1016/j.apm.2020.08.017
ugli Malikov AK, Cho Y, Kim YH et al (2023) A novel ultrasonic inspection method of the heat exchangers based on circumferential waves and deep neural networks. Sci Prog 106:003685042211460. https://doi.org/10.1177/00368504221146081
ugli Malikov AK, Kim YH, Yi J-H et al (2022) Neural-network-based ultrasonic inspection of offshore coated concrete specimens. Coatings 12:773. https://doi.org/10.3390/coatings12060773
Wang B, Li J, Luo J et al (2021) Intelligent deblending of seismic data based on U-net and transfer learning. IEEE Trans Geosci Remote Sens 59:8885–8894. https://doi.org/10.1109/TGRS.2020.3048746
Wang S, Wu W, Shen Y et al (2020) Influence of the pzt sensor array configuration on lamb wave tomography imaging with the rapid algorithm for hole and crack detection. Sensors (switzerland). https://doi.org/10.3390/s20030860
Wu J, Chen XY, Zhang H et al (2019) Hyperparameter optimization for machine learning models based on Bayesian optimization. J Electron Sci Technol 17:26–40. https://doi.org/10.11989/JEST.1674-862X.80904120
Wu J, Xu X, Liu C et al (2021) Lamb wave-based damage detection of composite structures using deep convolutional neural network and continuous wavelet transform. Compos Struct 276:114590. https://doi.org/10.1016/j.compstruct.2021.114590
Zhang J, Cho Y, Kim J et al (2021) Non-destructive evaluation of coating thickness using water immersion ultrasonic testing. Coatings 11:1–10. https://doi.org/10.3390/COATINGS11111421
Zhang J, Cho Y, Kim J et al (2023) Nondestructive inspection of underwater coating layers using ultrasonic lamb waves
Zhao X, Gao H, Zhang G et al (2007) Active health monitoring of an aircraft wing with embedded piezoelectric sensor/actuator network: I. Defect detection, localization and growth monitoring. Smart Mater Struct 16:1208–1217. https://doi.org/10.1088/0964-1726/16/4/032
Zhou Q, Wang Q, Bao Y et al (2022) LAEDNet: a lightweight attention encoder–decoder network for ultrasound medical image segmentation. Comput Electr Eng 99:107777. https://doi.org/10.1016/J.COMPELECENG.2022.107777
Zhuang F, Qi Z, Duan K et al (2021) A comprehensive survey on transfer learning. Proc IEEE 109:43–76. https://doi.org/10.1109/JPROC.2020.3004555
Funding
This work was supported by the National Research Foundation of Korea (NRF) grant funded by Korea government (MSIT) (No.NRF-2020M2D2A1A02069933).
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Malikov, A.K.u., Flores Cuenca, M.F., Kim, B. et al. Ultrasonic tomography imaging enhancement approach based on deep convolutional neural networks. J Vis 26, 1067–1083 (2023). https://doi.org/10.1007/s12650-023-00922-6
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s12650-023-00922-6