CN116129198B

CN116129198B - Multi-domain tire pattern image classification method, system, medium and equipment

Info

Publication number: CN116129198B
Application number: CN202310382056.7A
Authority: CN
Inventors: 刘萌; 张风雷; 王旭峰; 周迪; 李达; 厉盛华
Original assignee: Shandong Jianzhu University
Current assignee: Shandong Jianzhu University
Priority date: 2023-04-12
Filing date: 2023-04-12
Publication date: 2023-07-18
Anticipated expiration: 2043-04-12
Also published as: CN116129198A

Abstract

The invention belongs to the technical field of image processing, and provides a multi-domain tire pattern image classification method, a system, a medium and equipment, aiming at the problem of multi-scene tire pattern image classification, the proposal is as follows: classifying the tire pattern image data set and the trained tire pattern classification model to obtain classification probability distribution to obtain tire pattern image classification results; the construction process of the tire pattern classification model comprises the following steps: dividing the tire pattern image data set into a source domain and a target domain; extracting features of the tire pattern image data to obtain global features and local features of the tire pattern image; based on the global features and the local features, a double-memory module is constructed, and visual knowledge and cross-domain transmission knowledge of the local features in two domains are stored through the double-memory module, so that holographic features containing information of the two domains are constructed by utilizing the features of one domain. The universality and the noise resistance are improved.

Description

Multi-domain tire pattern image classification method, system, medium and equipment

Technical Field

The invention belongs to the technical field of image processing, and particularly relates to a multi-domain tire pattern image classification method, a multi-domain tire pattern image classification system, a multi-domain tire pattern image classification medium and multi-domain tire pattern image classification equipment.

Background

The statements in this section merely provide background information related to the present disclosure and may not necessarily constitute prior art.

The tire pattern image classification task is a task of extracting features of tire pattern images by using a deep learning technology and classifying the tire pattern images by using a classifier according to probability distribution calculated by the extracted features, and is a basic task in the field of computer vision. The task also has a wide range of application scenarios in production and life, such as the need to classify tire patterns left on site, which is definitely inefficient and laborious if the pictures are classified manually due to the wide variety of vehicles. The task can effectively alleviate the problems, and greatly saves manpower and material resources on the basis of guaranteeing classification accuracy.

However, there are also certain difficulties in the investigation of tire pattern classification tasks, particularly for the following reasons:

1) There is a great difference in the tire pictures taken in different time domains. For the tire pictures shot in the same time period, such as the pictures shot in noon, the displayed pattern features are clear, the textures are clear, the negative shadow areas represented in the tire pictures are fewer, good classification effects can be obtained only by relying on the existing classical deep learning classification model, but the tire pictures shot in the timing period have no generality. For tire pattern pictures in actual application scenes, the shooting time is often very random, the brightness requirements of the tire pattern images on shooting environments are very high, different brightness can cause the texture features of the tire pattern images to show very large differences, and different background noise can be generated due to the influences of the shooting environments and the brightness, so that great difficulty is brought to classification of the tire pattern images.

2) The tire patterns photographed in different spatial domains are greatly different. Under the actual application scene, the acquisition place of the tire pattern pictures has extremely high instability, and the angles of the pictures acquired by photographers are different, so that a large number of tire pattern pictures with extremely high difference are generated, and when the tire pattern pictures are classified, the model is required to accurately identify the tire pattern pictures shot in different environments, and the pictures shot in different angles are required to have extremely high identification degree, so that the model is required to have extremely high noise resistance.

3) A single tire image contains multiple types of patterns. Because the road sections of the image acquisition are different, multiple vehicles can possibly pass through the acquisition points, which causes another problem of classifying the multiple images, the image often contains more complex characteristics, the noise resistance requirement on the model is very high, and the classifying performance of the current classifying model on the image still needs to be improved.

In view of the above challenges, existing methods propose a multi-period tire pattern image classification method that successfully improves the performance of one picture (e.g., a tire pattern picture taken in dusk) by transferring knowledge from a picture in another scene (e.g., a tire pattern picture taken in noon), but they have limitations in general because the source and target fields are taken at fixed angles. On the other hand, the detection performance of the multi-tire pattern image cannot reach the expected performance, which proves that the anti-noise performance of the model is not good, and the complicated tire image cannot be correctly classified.

Disclosure of Invention

In order to solve at least one technical problem in the background technology, the invention provides a multi-domain tire pattern image classification method, a multi-domain tire pattern image classification system, a multi-domain tire pattern image classification medium and multi-domain tire pattern image classification equipment, which can greatly inhibit background noise of pictures through multi-loss optimization by learning characteristics of target images from multiple angles, so that visual characteristics learned by a model are more comprehensive.

In order to achieve the above purpose, the present invention adopts the following technical scheme:

a first aspect of the present invention provides a multi-domain tire pattern image classification method comprising the steps of:

acquiring a tire pattern image dataset;

classifying the tire pattern image data set and the trained tire pattern classification model to obtain classification probability distribution to obtain tire pattern image classification results;

the construction process of the tire pattern classification model comprises the following steps:

dividing the tire pattern image data set into a source domain and a target domain; extracting features of the tire pattern image data to obtain global features and local features of the tire pattern image; based on the global features and the local features, a double-memory module is constructed, and visual knowledge and cross-domain transmission knowledge of the local features in two domains are stored through the double-memory module, so that holographic features containing information of the two domains are constructed by utilizing the features of one domain.

Further, the feature extraction of the tire pattern image data to obtain the global feature and the local feature of the tire pattern image includes:

carrying out feature extraction on the tire pattern picture data by adopting a pre-training neural network to obtain integral image features;

carrying out average value pool processing on the integral image characteristics;

introducing a self-attention mechanism to encode the image characteristics processed by the mean value pool to obtain image encoding characteristics;

based on the image coding features, factorization is carried out by adopting a depth residual mapping method, and global features and local features of the tire pattern image are obtained.

Further, the process of constructing the dual memory module includes:

the mass center of the distinguishing characteristics of different tire pictures is used as a basic component of the memory;

in the basic component of each memory, each centroid is initialized to a mean vector of the local features of all tread image samples belonging to the respective class.

Further, the method for storing the visual knowledge of the local features in two domains and transmitting the knowledge across domains by using the dual memory module, so as to construct the holographic features containing the information of the two domains by using the features of one domain specifically comprises the following steps:

constructing and obtaining heterogeneous holographic features based on local features of a source domain memory module and global image features of a target domain;

based on the local characteristics of the memory module of the source domain or the target domain and the global characteristics thereof, constructing and obtaining homogeneous holographic characteristics;

and splicing the heterogeneous holographic features and the homogeneous holographic features to obtain the holographic features.

Further, the constructing to obtain the heterogeneous holographic feature based on the local feature of the source domain memory module and the global image feature of the target domain includes:

based on the local characteristics of a given domain, obtaining the memory characteristics of the opposite domain by retrieving the memory;

adopting a memory selector based on an attention mechanism to obtain the participated heterogeneous memory characteristics by self-adaptive selection in a soft mode;

and combining the memory characteristics of the opposite domains with the participating heterogeneous memory characteristics to obtain the heterogeneous holographic characteristics.

Further, the constructing the homogeneous holographic feature based on the local feature and the global feature of the memory module of the source domain or the target domain includes:

based on the given global feature, obtaining global memory features by retrieving the memory;

adopting a memory selector based on an attention mechanism to obtain the participated homogeneous memory characteristics in a soft mode in a self-adaptive mode;

and combining the global memory characteristics with the participated homogeneous memory characteristics to obtain homogeneous holographic characteristics.

Further, the loss function when the tire pattern classification model is trained is as follows:

wherein,,to counter the balance weight of the loss, +.>Balance weight for memory loss, +.>For the discrimination loss of holographic features, < >>For discriminating loss of local features, < >>To optimize the contrast loss of the domain arbiter and the dynamic holographic module,is good atDissolving the fight loss function of the decomposition module +.>Is a memory penalty.

A second aspect of the present invention provides a multi-domain tire pattern image classification system comprising:

a data acquisition module configured to: acquiring a tire pattern image dataset;

an image classification module configured to: classifying the tire pattern image data set and the trained tire pattern classification model to obtain classification probability distribution to obtain tire pattern image classification results;

A third aspect of the present invention provides a computer-readable storage medium.

A computer readable storage medium having stored thereon a computer program which when executed by a processor performs the steps of a multi-domain tire pattern image classification method according to the first aspect.

A fourth aspect of the invention provides an electronic device.

A computer device comprising a memory, a processor and a computer program stored on the memory and executable on the processor, the processor implementing the steps in a multi-domain tire pattern image classification method according to the first aspect when the program is executed.

Compared with the prior art, the invention has the beneficial effects that:

1. the invention obtains the global feature and the local feature of the tire pattern image by carrying out feature extraction on the tire pattern image data; based on the global features and the local features, a double-memory module is constructed, visual knowledge and cross-domain transmission knowledge of the local features in two domains are stored through the double-memory module, and therefore holographic features containing information of the two domains are constructed by utilizing the features of one domain; compared with the previous method, the tire pattern classification method has the advantages that the accuracy of tire pattern classification tasks is improved to a certain extent, and the universality and the noise resistance are improved.

2. Aiming at the problem of classifying multi-scene tire pattern pictures, the invention provides a multi-domain tire pattern image classifying method which can learn the characteristics of a target image from multiple angles, and can greatly inhibit the background noise of the pictures through multi-loss optimization, so that the visual characteristics learned by a model are more comprehensive.

3. Aiming at the problem of single-picture multi-pattern, the invention can learn by setting a single-pattern tire image as a source domain and a multi-pattern image as a target domain, thereby greatly increasing the intake of useful information and interfering the influence of patterns.

Drawings

The accompanying drawings, which are included to provide a further understanding of the invention and are incorporated in and constitute a part of this specification, illustrate embodiments of the invention and together with the description serve to explain the invention.

FIG. 1 is a flow chart of a multi-domain tire pattern image classification method provided by an embodiment of the invention.

Detailed Description

The invention will be further described with reference to the drawings and examples.

It should be noted that the following detailed description is illustrative and is intended to provide further explanation of the invention. Unless defined otherwise, all technical and scientific terms used herein have the same meaning as commonly understood by one of ordinary skill in the art to which this invention belongs.

It is noted that the terminology used herein is for the purpose of describing particular embodiments only and is not intended to be limiting of exemplary embodiments according to the present invention. As used herein, the singular is also intended to include the plural unless the context clearly indicates otherwise, and furthermore, it is to be understood that the terms "comprises" and/or "comprising" when used in this specification are taken to specify the presence of stated features, steps, operations, devices, components, and/or combinations thereof.

Interpretation of the terms

Source domain: (source domain) represents a different domain than the test sample, but has rich supervisory information;

target domain: (target domain) indicates the domain in which the test sample is located, with no label or only a small number of labels.

The source domain and the target domain often belong to the same class of tasks, but are distributed differently.

Example 1

As shown in fig. 1, the present embodiment provides a multi-domain tire pattern image classification method, which includes the following steps:

step 1: acquiring tire pattern image data of different scenes and shooting angles, so that tire pattern pictures under different scenes and different times are respectively classified;

and randomly selecting two types of tire pattern picture data of different scenes and shooting angles to be placed in a source domain and a target domain.

Tire pattern picture assuming target areaAnd the picture set of the source domain->。

Wherein,,representing the +.>Picture (or->Representation->Corresponding pattern types of tiresTo represent one of two data sets, wherein +.>Representing the picture domain index.

Step 2: extracting features of tire pattern image data by using ResNet-152 pre-trained on ImageNet to obtain integral image features, and carrying out average value pool processing on images processed by a pre-training network to obtain feature embedding dimension as followsImage characteristics of->，/>Is a CNN feature dimension.

Wherein,,for integral image features->And->For the features of the ith and jth image, -/->And->Respectively 3 1 x 1 convolution layers, < >>For the attention score, the relationship between the jth image and the ith image is expressed, and T represents processing T pictures at a time.

Using trainable parametersThe participating features are added back to the original features and then pooling is used to compute a representation of the input features:

in order to obtain fine grain characteristic representation of the texture characteristics of the tire pattern picture, a self-attention mechanism is introduced to the preprocessed image characteristics, and the image characteristics after the mean value pool processing are encoded to obtain image encoding characteristics. Thus, the final image coding feature is obtained.

Step 3: and factorizing the final overall image features by adopting a depth residual mapping method to obtain global features and local texture features of the tire pattern image so as to transmit specific knowledge among different features.

In order to pass knowledge between different domains, it is necessary to put the knowledge in a stateDecomposition into local features->And global features->This allows domain-specific knowledge to be passed between different domains to construct holographic features.

The depth residual mapping method is adopted to decompose the characteristics, and a residual decomposition module is as follows:

wherein,,representing local features->Representing global features->Is made up of parameters->A nonlinear transducer composed of a fully attached layer and a tanh activation layer.

Step 4: building dual memory modulesAnd->And respectively storing the local characteristics of all tire pattern categories in the two domains through the double memory modules.

The mass center of the distinguishing characteristics of different tire pictures is used as a basic component of the memory, so thatVisual feature memory representing target domain training data;

in a similar manner to that described above,visual characteristics memory representing the source domain.

Wherein K is the tire pattern number, each centroidKnowledge of local features representing the ith tire class held in the target domain, each centroid +.>Knowledge of the local features of the ith tire class stored in the source domain is shown.

At the position ofAnd->Each centroid being initialized to a mean vector of local features of all tire tread image samples belonging to the respective class:

wherein,,is a support set containing training samples of all markers in field v of the ith tire tread category,/->Is thatLocal features of the j-th picture in (a).

When constructing the holographic features of the source domain, the corresponding features of the target domain are required to be supplemented, and then the realization is proposed to save the local features of each domain through a memory block, wherein the dual memory module comprises two memory blocks for saving the visual knowledge of the local features of the image and the cross-domain transmission knowledge in the two domains.

Step 5: the method for constructing the holographic feature containing the information of the two domains by using the characteristics of one domain comprises the steps of:

step 501: combining knowledge from local features of a source domain memory module and global image features of a target domain to construct heterogeneous holographic features, wherein the method specifically comprises the following steps:

step 5011: given local featuresBy retrieving memory->Obtain memory characteristics of opposite domains->The operation is as follows:

Norm(/>),/>

wherein,,is a learnable linear transducer constructed based on a fully connected layer and is used for calculating holographic coefficients +.>By retrieving the memory of the opposite domain +.>Norm is a regularization function, a Softmax function.

Step 5012: a memory selector based on an attention mechanism is designed, and important memories are adaptively selected in a soft mode:

wherein,,is a learnable linear transducer constructed based on fully connected layers>Representing multiplication of elements>Is a participating heterogeneous memory feature.

Step 5013: bonding ofAnd->Construction of heterogeneous holographic features->The following is shown:

step 502: combining knowledge from local features of a memory module of a source domain or a target domain and global features thereof to construct homogeneous holographic features, wherein the method specifically comprises the following steps:

step 5021: given global featuresBy retrieving memory->Obtain global memory feature->The operation is as follows:

Norm(/>),/>

wherein,,is a learnable linear transducer constructed based on a fully connected layer and is used for calculating holographic coefficients +.>By retrieving domain memory->Norm is a regularization function, a Softmax function.

Step 5022: a memory selector based on an attention mechanism is designed, and important memories are adaptively selected in a soft mode:

wherein,,is a learnable linear transducer constructed based on fully connected layers>Representing multiplication of elements>Is a participated homogeneous memory feature.

Step 5023: bonding ofAnd->Construction of homogeneous holographic features->The following is shown:

step 503: and splicing the heterogeneous holographic features and the homogeneous holographic features to obtain holographic features, wherein the holographic features comprise image features from a source domain and a target domain.

The holographic features were constructed as follows:

wherein,,representing vector concatenation, wherein +_in the above formula>And->Image information containing a target field is provided,and->Image information containing a source field.

Step 6: based on holographic features, a tire pattern classification module is designed, which generates a final probability distribution through a Softmax full-connection layer.

Based on holographic featuresA classification module is created>The module can generate a final classification probability distribution through the fully connected layer with Softmax.

In the multi-classification recognition task, the same classification module is used to respectively base onAnd->Calculating classification scores of the source domain and the target domain, < ->And->Share the same holographic feature space.

Step 7: using a weighted sum of multiple loss functions as the total loss for the method, the losses involved are as follows:

wherein,,to counter the balance weight of the loss, +.>Balance weight for memory loss, +.>For the discrimination loss of holographic features, < >>For discriminating loss of local features, < >>To optimize the contrast loss of the domain arbiter and the dynamic holographic module,to optimize the fight loss function of the decomposition module +.>Is a memory penalty.

Application ofAnd->Make holographic feature->And local features->The discrimination is provided for different tire types. The dynamic holographic module is connected with the classification module to construct holographic characteristics and predict the probability distribution y of final classification.

Wherein,,constraint using cross entropy loss:

wherein,,is a group-trunk type tag +.>Is encoded by one-hot->Representation->。

Discrimination loss: discrimination loss makes the holographic and local features of different domains distinguishable.

For both domains, the local features of the domain are specified with negative log likelihood of the group-trunk tire class:

wherein,,by being based on local features->Classification module of->Generate (I)>。

Countering losses: the global features of different domains share a global feature space, and the holographic features of different domains share a holographic feature space.

By limiting the dynamic holographic block so that a trained arbiter cannot reliably predict domain labels of holographic features. This is achieved by gradient inversion layer (GRL) and domain discriminatorsRealized on the basis of holographic features->Predictive domain label->。

Optimizing a domain discriminator and a dynamic hologram module by using cross entropy loss:

wherein,,is a group-trunk domain label +.>Is described.

Creating another GRL layer and view discriminatorIt is based on->Predictive domain label->。

The fight loss function of the optimization decomposition module is defined as follows:

wherein,,is a group-trunk domain label +.>Is described.

Memory loss: to optimize the dual memory module, local feature information is updated by minimizing intra-class distances and inter-class distances between view-specific features and each memory centroid to ensure intra-class compactness and inter-class differentiation.

Specifically, there are two components of memory loss:

wherein,,and->The similarity loss and the inter-class large marginal loss are respectively. Wherein:

wherein,,for manually defined boundary factors +.>Is based on image->Local features calculated by feature coding and decomposition module, < >>For specific domain->Middle->The characteristic memory centroid of each tire class.

In this embodiment, the final classification results include "dandulp 1956016", "dongfeng 165R13", "kantai 2056516", and the like.

Example two

The present embodiment provides a multi-domain tire pattern image classification system, comprising:

Example III

The present embodiment provides a computer-readable storage medium having stored thereon a computer program which, when executed by a processor, implements the steps of a multi-domain tire pattern image classification method of the first embodiment.

Example IV

The present embodiment provides an electronic device including a memory, a processor, and a computer program stored on the memory and executable on the processor, wherein the processor implements the steps in the multi-domain tire pattern image classification method of the first embodiment when executing the program.

The above description is only of the preferred embodiments of the present invention and is not intended to limit the present invention, but various modifications and variations can be made to the present invention by those skilled in the art. Any modification, equivalent replacement, improvement, etc. made within the spirit and principle of the present invention should be included in the protection scope of the present invention.

Claims

1. The multi-domain tire pattern image classification method is characterized by comprising the following steps of:

acquiring a tire pattern image dataset;

dividing the tire pattern image data set into a source domain and a target domain; extracting features of the tire pattern image data to obtain global features and local features of the tire pattern image; based on the global features and the local features, a double-memory module is constructed, visual knowledge and cross-domain transmission knowledge of the local features in two domains are stored through the double-memory module, and therefore holographic features containing information of the two domains are constructed by utilizing the features of one domain;

the method for saving the visual knowledge and the cross-domain transmission knowledge of the local features in two domains by the double memory module, thereby constructing the holographic features containing the information of the two domains by utilizing the features of one domain comprises the following steps:

based on the local characteristics of the memory module of the source domain or the target domain and the global characteristics thereof, constructing and obtaining homogeneous holographic characteristics; splicing the heterogeneous holographic features and the homogeneous holographic features to obtain holographic features;

the heterogeneous holographic feature is constructed based on the local feature of the source domain memory module and the global image feature of the target domain, and comprises the following steps:

based on the local characteristics of a given domain, obtaining the memory characteristics of the opposite domain by retrieving the memory; adopting a memory selector based on an attention mechanism to obtain the participated heterogeneous memory characteristics by self-adaptive selection in a soft mode; combining the memory characteristics of the opposite domains with the participating heterogeneous memory characteristics to obtain heterogeneous holographic characteristics;

the construction of the local feature and the global feature of the memory module based on the source domain or the target domain to obtain the homogeneous holographic feature comprises the following steps:

based on the given global feature, obtaining global memory features by retrieving the memory; adopting a memory selector based on an attention mechanism to obtain the participated homogeneous memory characteristics in a soft mode in a self-adaptive mode; combining global memory characteristics and participated homogeneous memory characteristics to obtain homogeneous holographic characteristics;

based on holographic features, a tire pattern classification module is designed that generates a final probability distribution through a Softmax full tie layer, comprising:

in the multi-classification recognition task, the same classification module is used for calculating classification scores of a source domain and a target domain based on holographic features respectively, and the holographic features of different domains share the same holographic feature space.

2. A multi-domain tire pattern image classification method as in claim 1, wherein said feature extraction of tire pattern image data to obtain global features and local features of tire pattern images comprises:

3. A multi-domain tire tread image classification method as in claim 1, wherein said process of constructing a dual memory module comprises:

4. A multi-domain tire pattern image classification method as claimed in claim 1, wherein the loss function of the tire pattern classification model when trained is:

wherein,,to counter the balance weight of the loss, +.>Balance weight for memory loss, +.>For the discrimination loss of holographic features, < >>For discriminating loss of local features, < >>To optimize the contrast loss of the domain arbiter and the dynamic holographic module +.>To optimize the fight loss function of the decomposition module +.>Is a memory penalty.

5. A multi-domain tire pattern image classification system, comprising:

constructing and obtaining heterogeneous holographic features based on local features of a source domain memory module and global image features of a target domain; based on the local characteristics of the memory module of the source domain or the target domain and the global characteristics thereof, constructing and obtaining homogeneous holographic characteristics; splicing the heterogeneous holographic features and the homogeneous holographic features to obtain holographic features;

6. A computer readable storage medium having stored thereon a computer program which when executed by a processor performs the steps of a multi-domain tire pattern image classification method as claimed in any one of claims 1-4.

7. An electronic device comprising a memory, a processor and a computer program stored on the memory and executable on the processor, wherein the processor performs the steps of a multi-domain tire pattern image classification method as claimed in any one of claims 1-4 when the program is executed.