CN112699736A

CN112699736A - Bridge bearing fault identification method based on space attention

Info

Publication number: CN112699736A
Application number: CN202011442501.7A
Authority: CN
Inventors: 艾志勇; 崔弥达; 荣耀; 张恺; 吴刚
Original assignee: JIANGXI TRAFFIC SCIENCE RESEARCH INSTITUTE; Southeast University
Current assignee: JIANGXI TRAFFIC SCIENCE RESEARCH INSTITUTE; Southeast University
Priority date: 2020-12-08
Filing date: 2020-12-08
Publication date: 2021-04-23
Anticipated expiration: 2040-12-08
Also published as: CN112699736B

Abstract

The present invention provides a method for identifying diseases of bridge supports based on spatial attention, which includes the following steps: acquiring image data of bridge supports, and assigning labels by manual labeling. The labels include normal supports and possible occurrences during service of the bridge. of various bearing diseases. Construct a neural network model with a spatial attention mechanism, in which the spatial attention mechanism generates 4 attention coordinate values through a small neural network, screen out the valuable areas in the image according to these 4 coordinate values, and pass the grid points The generating function and bilinear interpolation method are scaled to the specified size; the output of the spatial attention mechanism is used as the input of the convolutional neural network for training, and the neural network model with predicting bearing disease is obtained. The attention model of the present invention can allow the network model to automatically extract valuable areas in the bridge support image for learning, and compared with the traditional convolutional neural network model, the identification accuracy of the support disease can be effectively improved.

Description

Bridge bearing fault identification method based on space attention

Technical Field

The invention relates to a bridge bearing fault identification method based on space attention, and belongs to the technical field of civil engineering and artificial intelligence interaction.

Background

With the rapid development of infrastructure construction, the civil engineering industry develops rapidly, and after a large number of bridges are constructed, the bridge support is an important stressed component of the bridge, and diseases such as aging, cracking and the like can occur in the long-term service process, so that the normal use function of the bridge support is influenced. People always rely on means such as daily and periodic inspection, sampling and temporary inspection and the like to acquire relevant information of the structure. However, the existing bridge appearance detection mainly depends on manual detection, the method is low in efficiency, long in time consumption and high in cost, the manual detection method is greatly influenced by factors such as environment, professional technical literacy of detection personnel and the like, and the detection result is uncertain.

The image processing technology based on deep learning is rapidly developed and widely applied to various industries, however, a deep learning model usually needs a large amount of labeled data to be trained so as to achieve higher precision. In actual engineering, some disease image data are difficult to obtain, and the support is used as a connecting member of a bridge, so that other members and some background information are included in the image acquisition engineering, which is difficult to avoid, and the identification result of the model is influenced. Therefore, a method for improving the identification precision of the model under the conditions of limited data set and complex scene is urgently needed.

Disclosure of Invention

The invention aims to solve the technical problem of providing a bridge support fault identification method based on space attention, which is used for improving the training efficiency and the precision of a neural network for identifying bridge support faults.

The invention adopts the following technical scheme for solving the technical problems: the invention designs a bridge support fault identification method based on space attention, which is characterized in that a target neural network based on a space attention system for identifying bridge support faults is trained and obtained by applying the following steps A to E based on a plurality of sample images respectively only containing one bridge support; the target neural network comprises a space coordinate generation network and a classification network; then, sequentially applying the space coordinates to generate a network and a classification network aiming at a target image to be detected, so as to obtain a classification result of the bridge bearing contained in the target image;

a, respectively associating each sample image with a preset classification label type of a bridge support contained in the sample image to construct a bridge support sample image database;

b, defining a minimum rectangular area where a bridge support is located in a sample image as a spatial attention area, and constructing a spatial coordinate generation network by taking the sample image as input and four spatial coordinates of the spatial attention area in the sample image as output based on a bridge support sample image database, wherein the four spatial coordinates of the spatial attention area are x_min,y_min,x_max,y_maxWherein x is_min,y_minThe abscissa and ordinate, x, of the upper left corner of the spatial attention area_max,y_maxThe horizontal coordinate and the vertical coordinate of the right lower corner point of the spatial attention area are shown;

step C, aiming at each sample image, according to four space coordinates of a space attention area in the sample image, applying a lattice point generating function to obtain coordinates of each lattice point under the corresponding preset proportion grid division in the space attention area in the sample image, and then entering the step D;

step D, aiming at each sample image, obtaining the pixel value of each lattice point in a space attention area in the sample image through a preset interpolation method, and obtaining a processed image which corresponds to each sample image and is based on an attention mechanism;

and E, constructing a classification network which takes the pictures of the sample images based on the attention mechanism as input and the classes of the support images in the corresponding sample images as output based on the labels in the bridge support sample image database and the pictures of the sample images based on the attention mechanism, taking the difference between the output classes of the classification network and the label classes in the step A as a target of network optimization, and performing end-to-end iterative training on the spatial coordinates to generate a network and the classification network to obtain a target neural network.

As a preferred technical scheme of the invention, in the step a, the different states of the bridge support included in the constructed bridge support sample image database include both a normal support state and states of various support diseases preset in the service process of the bridge.

In a preferred embodiment of the present invention, in step a, the resolution of the image in the constructed bridge support sample image database is above 800 × 600.

As a preferred technical solution of the present invention, the spatial coordinate generation network in step B includes a convolutional layer, a pooling layer, and a full connection layer.

As a preferred embodiment of the present invention, in step C, the lattice point generating function generates image lattice points according to a proportional relationship between 4 image coordinates generated by the attention mechanism and the original image, and the lattice point generating function is described as follows:

wherein, x'_ijIs the corresponding abscissa, y 'of the grid point i, j in the original image'_ijIs the corresponding ordinate of the grid point i, j in the original image, w 'is the width of the preset grid point image, h' is the height of the corresponding preset grid point image, x_ijIs the abscissa, y, of the grid point i, j in the preset grid point image_ijIs the ordinate of the grid point i, j in the preset grid point image.

As a preferred embodiment of the present invention, in step D, the preset interpolation method is a differentiable interpolation method to meet the requirement of error back propagation.

As a preferred technical solution of the present invention, in step E, the classification network is a neural network that satisfies the preset specified feature characterization capability.

As a preferred technical solution of the present invention, the neural network with the preset designated feature characterization capability is VGG or ResNet.

As a preferred technical solution of the present invention, in step E, a gradient descent method is selected for iterative training of the target neural network.

Compared with the prior art, the bridge bearing disease identification method based on space attention has the following technical effects:

according to the method, under a limited data set, a target neural network consisting of a spatial attention mechanism is obtained through a combined training spatial coordinate generation network and a classification network, aiming at a bridge support image to be detected, an interested area in the bridge support image is firstly identified, and then whether the bridge support is damaged or not and the type of the damage are obtained through the classification network.

Drawings

FIG. 1 is a flow chart of the present invention;

FIG. 2 is a schematic diagram of a spatial coordinate generation network in accordance with the present invention;

FIG. 3 is a schematic flow chart of the method for obtaining attention area according to the present invention;

fig. 4 is a schematic diagram of a VGG-16 classification network used in the present invention.

Detailed Description

The following description will explain embodiments of the present invention in further detail with reference to the accompanying drawings.

The hardware conditions for training the convolutional neural network of the present embodiment are: the amazon AWS cloud computing service is enabled to configure an amazon EC2P2.xlarge instance, the instance is configured with 1 GPU, 4 vCPUs, 61GB random access memories, the system adopts an ubuntu system, the programming language adopts python, and the deep learning platform uses pytorch.

As shown in the flowchart of fig. 1, in the method for identifying a bridge bearer fault based on spatial attention implemented in this embodiment, 5000 sample images with a resolution of more than 800 × 600 are selected, each sample image respectively includes different states of the bridge bearer, including a normal bearer state and states of various bearer faults preset in the bridge service process, where the bridge bearer states in the sample images in this embodiment include three types of a normal bearer, a cracked bearer, and a shear deformation bearer.

Based on the sample image, the following steps A to E are applied, and a target neural network which is used for identifying the bridge support diseases and is based on a space attention mechanism is trained and obtained; the target neural network comprises a space coordinate generation network and a classification network.

And A, respectively associating each sample image with a preset classification label type of the bridge support contained in the sample image to construct a bridge support sample image database, wherein the classification label in the embodiment contains three types of bridge supports, namely a normal support, a cracking support and a shearing deformation support.

Step B, as shown in FIG. 2, defining a minimum rectangular area where a bridge support is located in a sample image as a spatial attention area, constructing a spatial coordinate generation network based on a bridge support sample image database, wherein the spatial coordinate generation network takes the sample image as input and four spatial coordinates of the spatial attention area in the sample image as output, and comprises a convolution layer, a pooling layer and a full-connection layer, wherein the four spatial coordinates of the spatial attention area are x_min,y_min,x_max,y_maxWherein x is_min,y_minThe abscissa and ordinate, x, of the upper left corner of the spatial attention area_max,y_maxThe abscissa and the ordinate of the lower right corner of the spatial attention area.

And step C, as shown in FIG. 3, applying a grid point generating function according to the four spatial coordinates of the spatial attention area in the sample image respectively for each sample image, and obtaining the coordinates of each grid point under the corresponding preset proportion grid division in the spatial attention area in the sample image.

The lattice point generating function generates image lattice points according to the proportional relation between 4 image coordinates generated by the attention mechanism and the original image, and the lattice point generating function is described as follows:

And D, aiming at each sample image, obtaining the pixel value of each lattice point in a spatial attention area in the sample image by a differentiable interpolation method meeting the requirement of error back propagation, wherein a bilinear interpolation method is specifically selected in the embodiment, and obtaining a processed image corresponding to each sample image based on an attention mechanism.

Step e, as shown in fig. 4, based on the tags in the bridge support sample image database and the attention-based pictures of each sample image, constructing a classification network that uses the attention-based pictures of each sample image as input and uses the category of the support image in the corresponding sample image as output and satisfies the preset specified feature characterization capability, specifically selecting a VGG network in a classical convolutional neural network in the present embodiment, then using the difference between the output category of the classification network and the category marked in step a as a target of network optimization, and performing joint training on the space coordinate generation network and the classification network by using a gradient descent method to obtain a target neural network.

And then, sequentially applying the space coordinates to generate a network and a classification network aiming at the target image to be detected, so as to obtain a classification result of the bridge bearing included in the target image.

The embodiments of the present invention have been described in detail with reference to the drawings, but the present invention is not limited to the above embodiments, and various changes can be made within the knowledge of those skilled in the art without departing from the gist of the present invention.

Claims

1. a method for identifying diseases of a bridge bearing based on spatial attention, is characterized in that, based on a plurality of sample images respectively comprising only a bridge bearing, applying the following steps A to step E, training and obtaining are used to identify the bridge bearing. A target neural network based on the spatial attention mechanism of the disease; wherein, the target neural network includes a spatial coordinate generation network and a classification network; then, for the target image to be detected, the spatial coordinate generation network and the classification network are sequentially applied to obtain the target image. The classification results of the included bridge supports;

Step A. Correlate each sample image with the preset classification label type of the bridge support it contains, and construct a bridge support sample image database;

Step B. Define the minimum rectangular area where the bridge support is located in the sample image as the spatial attention area. Based on the bridge support sample image database, construct the sample image as the input, and the four spatial coordinates of the spatial attention area in the sample image are: The output spatial coordinate generation network, where the four spatial coordinates of the spatial attention area are x _min , y _min , x _max , y _max , where x _min , y _min are the abscissa and ordinate of the upper left corner of the spatial attention area Coordinates, x _max , y _max are the abscissa and ordinate of the lower right corner of the spatial attention area;

Step C. For each sample image, according to the four spatial coordinates of the spatial attention area in the sample image, apply the grid point generation function to obtain the corresponding preset scale grid division in the spatial attention area in the sample image, each The coordinates of the grid point, and then go to step D;

Step D. For each sample image, obtain the pixel value of each grid point in the spatial attention area in the sample image by a preset interpolation method, and obtain a processed image corresponding to each sample image based on the attention mechanism;

Step E. Based on the labels in the bridge support sample image database and the pictures based on the attention mechanism of each sample image, construct the pictures based on the attention mechanism of each sample image as input, and use the corresponding sample images in the support The category of the image is the output classification network. According to the output category of the classification network, the difference between the output category and the labeled category in step A is the goal of network optimization. After end-to-end iterative training of the spatial coordinate generation network and the classification network, the target neural network is obtained.

2. a kind of bridge bearing disease identification method based on spatial attention according to claim 1, is characterized in that, in step A, the bridge bearing different states that the constructed bridge bearing sample image database comprises, both include normal The state of the bearing also includes the state of various bearing diseases preset during the service of the bridge.

3. A method for identifying diseases of bridge bearing based on spatial attention according to claim 1, wherein in step A, in the constructed bridge bearing sample image database, the resolution of the image is above 800×600 .

4 . The method for identifying diseases of bridge supports based on spatial attention according to claim 1 , wherein the spatial coordinate generation network in step B comprises a convolution layer, a pooling layer and a fully connected layer. 5 .

5. a kind of bridge bearing disease identification method based on spatial attention according to claim 1, is characterized in that, in step C, grid point generation function according to the 4 image coordinates that attention mechanism generates and original image proportional relationship To generate image lattice points, the description of the lattice point generation function is as follows:

Among them, x' _ij is the abscissa corresponding to grid point i, j in the original image, y' _ij is the ordinate corresponding to grid point i, j in the original image, w' is the width of the preset grid image, h ' is the height of the corresponding preset grid image, x _ij is the abscissa of the grid point i, j in the preset grid image, y _ij is the ordinate of the grid point i, j in the preset grid image.

6. a kind of bridge bearing disease identification method based on spatial attention according to claim 1, is characterized in that, in step D, the preset interpolation method is a differentiable interpolation method, so as to satisfy the error of back propagation. Require.

7 . The method for identifying diseases of bridge supports based on spatial attention according to claim 1 , wherein, in step E, the classification network is a neural network that satisfies the preset specified feature representation ability. 8 .

8 . The method for identifying diseases of bridge support based on spatial attention according to claim 7 , wherein the neural network of the preset specified feature representation capability is VGG or ResNet. 9 .

9 . The method for identifying diseases of bridge support based on spatial attention according to claim 1 , wherein, in step E, gradient descent method is selected for the iterative training of the target neural network. 10 .