CN113052245B

CN113052245B - Image clustering method and device, electronic equipment and storage medium

Info

Publication number: CN113052245B
Application number: CN202110340584.7A
Authority: CN
Inventors: 熊永福; 陈维立; 黄文雯
Original assignee: Chongqing Unisinsight Technology Co Ltd
Current assignee: Chongqing Unisinsight Technology Co Ltd
Priority date: 2021-03-30
Filing date: 2021-03-30
Publication date: 2023-08-25
Anticipated expiration: 2041-03-30
Also published as: CN113052245A

Abstract

The embodiment of the application provides an image clustering method and device, electronic equipment and a storage medium, and relates to the technical field of image clustering. The image clustering method comprises the following steps: firstly, acquiring image characteristics to be processed; secondly, inputting the image features to be processed into a preset image clustering model to perform first clustering processing to obtain a first image clustering result and a first type of center feature set; and then, inputting the first image clustering result and the first type center feature set into a preset image clustering model to perform second clustering processing, so as to obtain a second image clustering result. By the method, secondary clustering can be realized according to the same clustering model, and the problem of low image clustering efficiency caused by the fact that only pictures to be clustered are clustered in the prior art and a large number of pictures of the same individual are still distributed in a plurality of categories is solved.

Description

Image clustering method and device, electronic equipment and storage medium

Technical Field

The present application relates to the field of image clustering technologies, and in particular, to an image clustering method and apparatus, an electronic device, and a storage medium.

Background

According to the research of the inventor, in an actual application scene, the prior art only clusters pictures to be clustered, so that higher file splitting and lower recall rate are easy to generate, namely, a large number of pictures of the same individual are still distributed in a plurality of categories, so that the clustering result has very limited utilization value, and the problem of low image clustering efficiency exists.

Disclosure of Invention

Accordingly, an object of the present application is to provide an image clustering method and apparatus, an electronic device and a storage medium, so as to improve the problems in the prior art.

In order to achieve the above purpose, the embodiment of the present application adopts the following technical scheme:

in a first aspect, the present application provides an image clustering method, including:

acquiring image characteristics to be processed;

inputting the image features to be processed into a preset image clustering model to perform first clustering processing to obtain a first image clustering result and a first type center feature set;

and inputting the first image clustering result and the first type center feature set into the preset image clustering model to perform second clustering processing to obtain a second image clustering result.

In an alternative embodiment, the step of acquiring the feature of the image to be processed includes:

acquiring an image feature vector set, wherein the image feature vector set comprises at least one image feature vector;

performing similarity comparison on the image feature vector set and a historical clustering result to obtain similarity of each image feature vector, wherein the historical clustering result is a clustering result obtained by performing image clustering in advance;

judging whether the similarity of each image feature vector exceeds a preset similarity threshold, and taking the image feature vector which does not exceed the preset similarity threshold as the image feature to be processed.

In an alternative embodiment, the step of obtaining the set of image feature vectors includes:

acquiring image data to be processed;

extracting the image data to be processed to obtain image characteristics and structural information;

and filtering the image features according to the structural information to obtain an image feature vector set.

In an optional embodiment, the step of extracting the image data to be processed to obtain image features and structural information includes:

performing feature extraction processing on the image data to be processed to obtain image features;

and carrying out structured information extraction processing on the image data to be processed to obtain structured information.

In an optional embodiment, the step of inputting the image feature to be processed into a preset image clustering model to perform a first clustering process to obtain a first image clustering result and a first class center feature set includes:

judging whether the number of the image features to be processed is larger than a preset number threshold;

if yes, inputting the image features to be processed which are larger than a preset quantity threshold value into a preset image clustering model for processing, and obtaining a first image clustering result and a first type center feature set.

In an optional embodiment, the step of inputting the image features to be processed greater than a preset number threshold into a preset image clustering model to process to obtain a first image clustering result and a first class center feature set includes:

processing the image characteristics to be processed based on a preset image clustering model to obtain a first image clustering result;

and carrying out weighted average calculation on the first image clustering result to obtain a first class center feature set.

In an optional embodiment, the step of inputting the first image clustering result and the first class center feature set into the preset image clustering model to perform second clustering processing to obtain a second image clustering result includes:

judging whether the clustering batch of the first image clustering result is larger than a preset batch or not;

if yes, processing the first image clustering result and the first type center feature set based on the preset image clustering model to obtain a second image clustering result.

In a second aspect, the present application provides an image clustering apparatus, comprising:

the feature acquisition module is used for acquiring the feature of the image to be processed;

the first clustering module is used for inputting the image features to be processed into a preset image clustering model to perform first clustering processing to obtain a first image clustering result and a first type of center feature set;

and the second clustering module is used for inputting the first image clustering result and the first type center feature set into the preset image clustering model to perform second clustering processing to obtain a second image clustering result.

In a third aspect, the present application provides an electronic device comprising: a memory, a processor and a computer program stored on the memory and executable on the processor, the processor implementing the image clustering method of any one of the preceding embodiments when the program is executed.

In a fourth aspect, the present application provides a storage medium, where the storage medium includes a computer program, where the computer program controls an electronic device in which the storage medium is located to execute the image clustering method according to any one of the foregoing embodiments.

According to the image clustering method and device, the electronic equipment and the storage medium, the image to be processed is input into the preset image clustering model to obtain the first image clustering result and the first type center feature set, and the first image clustering result and the first type center feature set are input into the preset image clustering model to obtain the second type result, so that secondary clustering according to the same clustering model is realized, the problem that in the prior art, only images to be clustered are clustered, a large number of images of the same individual are still distributed in a plurality of types is solved, the clustering value is very limited, and the image clustering efficiency is low.

Drawings

In order to more clearly illustrate the technical solutions of the embodiments of the present application, the drawings that are needed in the embodiments will be briefly described below, it being understood that the following drawings only illustrate some embodiments of the present application and therefore should not be considered as limiting the scope, and other related drawings may be obtained according to these drawings without inventive effort for a person skilled in the art.

Fig. 1 shows a block diagram of an electronic device according to an embodiment of the present application.

Fig. 2 shows a flow chart of an image clustering method according to an embodiment of the present application.

Fig. 3 shows another flow diagram of an image clustering method according to an embodiment of the present application.

Fig. 4 shows a block diagram of an image clustering apparatus according to an embodiment of the present application.

Icon: 100-an electronic device; 110-a first memory; 120-a first processor; 130-a communication module; 400-image clustering means; 410-a feature acquisition module; 420-a first clustering module; 430-second aggregation module.

Detailed Description

In recent years, with the increasing maturation and development of artificial intelligence, face clustering is required for faces in a large number of videos or pictures in many industrial intelligent business scenes. The basic principle of face clustering is to archive faces of the same person into a file, and the file can be used as a unique mark of the person. For example, in the intelligent security industry, a large number of faces in a surveillance video stream or a snapshot photo stream need to be clustered, and the clustered unique mark can be applied to the services of querying and displaying the track of a specific person or performing track collision, fusion and the like on other tracks.

In the existing face clustering method, the face pictures to be clustered are subjected to feature extraction through a face recognition model, then are subjected to similarity comparison with a target file, and are clustered through threshold judgment, and the method is generally suitable for small-data-volume scenes, and has the problems of lower performance and lower recall rate in large-data-volume scenes; or batch clustering is carried out on the batch face pictures by using a supervised model or an unsupervised model, so that the method is suitable for batch processing of large data volume, but the real-time requirement is difficult to meet. In an actual application scene, no matter what method is adopted, if only face pictures to be clustered are clustered, higher file splitting and lower recall rate are very easy to generate, namely, a large number of faces of the same person are still distributed in a plurality of categories, so that the clustering result has very limited utilization value; in addition, for the clustering result, only one or a plurality of specific faces with highest quality scores are generally selected at present as the representative class of a certain clustering result, when the data volume is large, the problem of low recall rate or low calculation efficiency often exists, and how to define the representative characteristics of the clustering result is also a difficult problem of the current face clustering. In a word, how to simultaneously ensure or balance the instantaneity and the clustering efficiency of the face clusters, and how to improve the recall rate of the clusters and reduce the splitting situation under the condition of ensuring the clustering accuracy is a problem to be solved in the practical application process of the face clusters.

In order to improve at least one of the above technical problems, an embodiment of the present application provides an image clustering method and apparatus, an electronic device, and a storage medium, and a possible implementation manner is used to describe the technical scheme of the present application.

The present application is directed to a method for manufacturing a semiconductor device, and a semiconductor device manufactured by the method.

For the purpose of making the objects, technical solutions and advantages of the embodiments of the present application more apparent, the technical solutions of the embodiments of the present application will be clearly and completely described below with reference to the accompanying drawings in the embodiments of the present application, and it is apparent that the described embodiments are some embodiments of the present application, but not all embodiments of the present application. The components of the embodiments of the present application generally described and illustrated in the figures herein may be arranged and designed in a wide variety of different configurations.

Thus, the following detailed description of the embodiments of the application, as presented in the figures, is not intended to limit the scope of the application, as claimed, but is merely representative of selected embodiments of the application. All other embodiments, which can be made by those skilled in the art based on the embodiments of the application without making any inventive effort, are intended to be within the scope of the application.

It should be noted that the terms "comprises," "comprising," or any other variation thereof, are intended to cover a non-exclusive inclusion, such that a process, method, article, or apparatus that comprises a list of elements does not include only those elements but may include other elements not expressly listed or inherent to such process, method, article, or apparatus. Without further limitation, an element defined by the phrase "comprising one … …" does not exclude the presence of other like elements in a process, method, article, or apparatus that comprises an element.

It should be noted that: like reference numerals and letters denote like items in the following figures, and thus once an item is defined in one figure, no further definition or explanation thereof is necessary in the following figures.

It should be noted that the features of the embodiments of the present application may be combined with each other without conflict.

Fig. 1 is a block diagram of an electronic device 100 according to an embodiment of the present application, where the electronic device 100 in this embodiment may be a server, a processing device, a processing platform, etc. capable of performing data interaction and processing. The electronic device 100 includes a first memory 110, a first processor 120, and a communication module 130. The first memory 110, the first processor 120, and the communication module 130 are electrically connected directly or indirectly to each other to realize data transmission or interaction. For example, the components may be electrically connected to each other via one or more communication buses or signal lines.

Wherein the first memory 110 is used for storing programs or data. The first Memory 110 may be, but is not limited to, a random access Memory (Random Access Memory, RAM), a Read Only Memory (ROM), a programmable Read Only Memory (Programmable Read-Only Memory, PROM), an erasable Read Only Memory (Erasable Programmable Read-Only Memory, EPROM), an electrically erasable Read Only Memory (Electric Erasable Programmable Read-Only Memory, EEPROM), etc.

The first processor 120 is used to read/write data or programs stored in the first memory 110 and perform corresponding functions. The communication module 130 is used for establishing a communication connection between the electronic device 100 and other communication terminals through a network, and for transceiving data through the network.

It should be understood that the structure shown in fig. 1 is merely a schematic diagram of the structure of the electronic device 100, and that the electronic device 100 may also include more or fewer components than shown in fig. 1, or have a different configuration than shown in fig. 1. The components shown in fig. 1 may be implemented in hardware, software, or a combination thereof.

Referring to fig. 2 in combination, a flowchart of an image clustering method according to an embodiment of the present application may be executed by the electronic device 100 of fig. 1, for example, may be executed by the first processor 120 in the electronic device 100. It should be understood that, in other embodiments, the order of some steps in the image clustering method of the present embodiment may be interchanged according to actual needs, or some steps therein may be omitted or deleted. The flow of the image clustering method shown in fig. 2 is described in detail below.

Step S210, obtaining the image characteristics to be processed.

Step S220, inputting the image features to be processed into a preset image clustering model to perform first clustering processing, and obtaining a first image clustering result and a first type center feature set.

And step S230, inputting the first image clustering result and the first type center feature set into a preset image clustering model to perform second clustering processing, and obtaining a second image clustering result.

According to the method, the image to be processed is input into the preset image clustering model to obtain the first image clustering result and the first type center feature set, and the first image clustering result and the first type center feature set are input into the preset image clustering model to obtain the second type result, so that secondary clustering according to the same clustering model is realized, the problem that in the prior art, only pictures to be clustered are clustered, a large number of pictures of the same individual are still distributed in a plurality of categories is solved, the clustering value is very limited, and the image clustering efficiency is low.

For step S210, it should be noted that the specific manner of acquiring the image feature to be processed is not limited, and may be set according to the actual application requirement. For example, in an alternative example, step S210 may include a step of taking, as the image feature to be processed, an image feature vector that does not exceed a preset similarity threshold. Therefore, on the basis of fig. 2, fig. 3 is a schematic flow chart of another image clustering method provided by an embodiment of the present application, referring to fig. 3, step S210 may include:

step S211, an image feature vector set is acquired.

Wherein the set of image feature vectors comprises at least one image feature vector.

And S212, comparing the similarity between the image feature vector set and the historical clustering result to obtain the similarity of each image feature vector.

The historical clustering result is a clustering result obtained by carrying out image clustering in advance.

Step S213, judging whether the similarity of each image feature vector exceeds a preset similarity threshold, and taking the image feature vector which does not exceed the preset similarity threshold as the image feature to be processed.

In step S211, it should be noted that a specific manner of acquiring the image feature vector set is not limited, and may be set according to actual application requirements. For example, in an alternative example, step S211 may include the sub-steps of:

acquiring image data to be processed; extracting the image data to be processed to obtain image characteristics and structural information; and filtering the image features according to the structural information to obtain an image feature vector set.

Alternatively, the specific type of the image data is not limited, and may be set according to actual application requirements. For example, in one alternative example, the image data may be a face image. That is, face images to be clustered may be acquired and noted as a first set of face images to be clustered. The face image to be clustered can be obtained from a video file or a picture file, the video file can be a real-time video or an offline video of a multi-channel front-end camera, the video file comprises a plurality of frames, and each frame can contain one or more faces; the picture file may be a picture, and a picture may contain a plurality of faces or a face. The method comprises the steps of obtaining a first face image set to be clustered by inputting a video file or a picture file into a face detection model, wherein the first face image set to be clustered at least comprises one face image. The face detection model is generally a neural network model, such as an object detection model of MTCNN, YOLO series and the like. In one example, the face image acquisition source is a real-time snapshot image of the front-end camera, and the face detection is performed through the face detection model to acquire the real-time face image.

Alternatively, the specific step of extracting the image data to be processed is not limited, and may be set according to actual application requirements. For example, in one alternative example, the following sub-steps may be included:

carrying out feature extraction processing on the image data to be processed to obtain image features; and carrying out structured information extraction processing on the image data to be processed to obtain structured information.

That is, the face feature vector and the structured information are extracted from the first face image set to be clustered, so that a second face image feature vector set to be clustered and a second face structured information set to be clustered can be obtained.

In detail, the second face image feature vector set to be clustered and the second face structural information set to be clustered are obtained by inputting the first face image set to be clustered into a face analysis model. In one example, the first set of face images to be clustered may include a plurality of sets of face images, e.g., images from different front-end cameras may be different sets of faces, which may be parsed in parallel by the same face parsing model in a plurality of parsing services. And finally obtaining a unique face feature and a piece of structured information for each face image, and finally obtaining a second face image feature vector set to be clustered and a second face structured information set to be clustered for a batch of face data, wherein the second face image feature vector set to be clustered and the second face structured information set to be clustered have a one-to-one correspondence.

Exemplary embodimentsThe ground can be if there is K way front end to take a candid photograph the camera, record respectively as: c (C) ₁ ,C ₂ ,…,C _K In a certain unit time, for example, 10s, the number of face images captured by each path of cameras is N respectively ₁ ,N ₂ ,…,N _K If better real-time performance is required, smaller unit time, such as every 5s, 1s, etc., can be set. For example, M face parsing services P ₁ ,P ₂ ,…,P _M In general M<K, let us assume for C _i Subscript i of (C), C _i Road snapshot face image data at P _i％M The analysis service analyzes the face to obtain a K face feature set and a face structural information set in the unit time, and marks a second face image feature vector set to be clustered as F ₂₀ ＝{p _i :f _i The second face structural information set to be clustered is marked as F ₂₁ ＝{p _i :(t _i1 ,t _i2 ,…t _in ) P is }, where _i Representing a unique mark of a face image, and F ₂₀ And F is equal to ₂₁ P in (b) _i One-to-one correspondence, f _i Represents p _i Corresponding characteristic value, (t) _i1 ,t _i2 ,…t _in ) Represents p _i Is included in the n structured information of (a). It is obvious that the number of the assumed snap faces isHere, theAnd i is an integer.

For the second face structural information set F to be clustered ₂₁ ＝{p _i :(t _i1 ,t _i2 ,…t _in )}，t _in Representing structured information may include corresponding values of face pitch angle, horizontal angle, gender, age, face mass fraction, etc., such as face pitch angle, for example: 0, horizontal angle: 10, sex: 0, age: 30, face mass fraction: 80, etc.

The face analysis model is generally one or more multi-task neural network analysis models, and the multi-task neural network analysis model represents that one model can simultaneously identify some or all structural information and face characteristic information of a face image. For example, if the face analysis model is a multitasking neural network model, the face features can be obtained through analysis by the model, and all structural information values such as face pitching angle, horizontal angle, gender, age, face quality score and the like can be predicted in a regression mode.

Further, through the one-to-one correspondence between the second face image feature vector set to be clustered and the second face structural information set to be clustered, the second face image feature vector to be clustered can be filtered according to the second face structural information set to be clustered, and a third face image feature set to be clustered can be obtained. Based on the face structured information contained in the second face structured information set to be clustered, the faces to be clustered are screened, low-quality faces are removed, and the faces are not involved in face clustering, so that the clustering efficiency and the clustering accuracy are improved.

In detail, for the second face structured information set F to be clustered ₂₁ ＝{p _i :(t _i1 ,t _i2 ,…t _in ) Face structured information t contained in } _in Condition screening is performed. Typically, the screening of face images is performed only by face quality scores, such as face image quality scores, for example<And 40, directly placing the face image into a waste film library to indicate that the face is low in identification degree and unavailable. However, in practical application, the filtering is too single or limited only by the quality score of the face image, so that a plurality of filtering conditions can be fully added, and the unavailable face can be filtered as far as possible. By way of example, faces to be clustered with overlarge pitching angles and overlarge horizontal angles can be removed, the faces are generally low in identification degree, the quality of the corresponding face features is poor, the final clustering effect is greatly and continuously influenced, dirty data can be regarded as being filtered, the faces are not involved in face clustering, and the clustering efficiency and the clustering accuracy are improved. Illustratively, the pitch angle of the face may be set>40 or horizontal angle>60, the face is processedThe image is directly put into the scrap stock. Finally obtaining a second face structured information set to be clustered after filtering, and marking as F measurement ₃₁ ＝{p _i :(t _i1 ,t _i2 ,…t _in ) }. Through F ₃₁ ＝{p _i :(t _i1 ,t _i2 ,…t _in ) Face image unique mark p in } _i Can be related to F ₂₀ ＝{p _i :f _i Obtaining a third face image feature set to be clustered, and marking the feature set as F ₃₀ ＝{p _i :f _i }。

For the step S212 and the step S213, it should be noted that the similarity between the third face image feature vector set to be clustered and the class feature center of the existing clustering result may be compared, if the similarity threshold is met, the face image feature vector set to be clustered is classified directly into the corresponding class, and if the similarity threshold is not met, the face image feature vector set to be clustered is classified into the fourth face image feature vector set (i.e., the feature of the image to be processed).

The existing clustering result is an existing history clustering result, and as an example, the existing clustering result may be: f (F) _h ＝{a _i :[fc _i ,(p _i1 ,p _i2 ,…p _in )](wherein a) _i Unique sign, fc, representing the clustering result _i Representation a _i Class characteristic center of (p) _i1 ,p _i2 ,…p _in ) Representing the clustering result a _i The class feature center of the face image set contained in the clustering result is a representative feature vector of each clustering result.

Feature vector set F of face image to be clustered third ₃₀ ＝{p _i :f _i Face feature vector f in } _i With the existing clustering result F _h ＝{a _i :[fc _i ,(p _i1 ,p _i2 ,…p _in )]All class feature center fc _i And (5) carrying out similarity calculation, wherein the similarity is cosine similarity. Illustratively, as F ₃₀ ＝{p _i :f _i Face and its feature p ₁ :f ₁ Calculating f ₁ And all fcs _i Cosine similarity to fc ₁ The similarity of (2) is the largest and 95%, the similarity threshold is set to 90%,obviously p ₁ If the maximum similarity 95% of the existing clustering result is greater than the set similarity threshold value by 90%, the face is classified into the category corresponding to the maximum similarity, namely p ₁ Falls under the clustering result a ₁ Is a face image set (p _i1 ,p _i2 ,…p _in ) And finishing the clustering of the faces. Here, the real-time classification for each face does not make real-time changes to the class feature center, because in a large number of scenarios, the face clustering efficiency will be greatly affected. In addition, because the clustering precision is preferentially ensured when the face is classified in real time, the similarity threshold value can be set higher, the similarity between the classified face features and the class center features is obviously high, the class center features are not obviously changed before and after the change, the class center features are not changed when the face is classified in real time, and the gear gathering efficiency can be greatly improved. Setting a smaller unit time according to the example in the steps, and repeating the steps to ensure the real-time performance of clustering. If the maximum similarity between a face and the existing clustering result does not meet the set similarity threshold, the maximum similarity is classified into a fourth face image feature vector set to be clustered, and the feature vector set is not limited as F ₄₀ ＝{p _i :f _i And waiting for clustering again.

For step S220, it should be noted that the specific manner of performing the first clustering process is not limited, and may be set according to the actual application requirement. For example, in an alternative example, step S220 may include the sub-steps of:

judging whether the number of the image features to be processed is larger than a preset number threshold value or not;

That is, when the fourth face image feature vector set to be clustered meets the set quantity threshold, the fourth face image feature vector set to be clustered is input into a face clustering model to perform batch face clustering, and a first face clustering result feature center set are obtained.

While a large number of faces are classified, partial unclassified faces are left, and can be classified into F ₄₀ ＝{p _i :f _i In }. When F ₄₀ ＝{p _i :f _i And after the number of faces in the sequence number reaches a preset number threshold, directly clustering the faces in batches. Illustratively, as when F ₄₀ ＝{p _i :f _i When the number of faces reaches 10000, a batch of face clustering is started.

The batch face clustering method is a supervised neural network model or an unsupervised clustering model, wherein the common supervised neural network model for clustering is generally a neural network model based on graph convolution, such as L-GCN, GCN-D, GCN-V and other models, and DBSCAN, KMEANS is common unsupervised model for clustering. Illustratively, the neural network model GCN-V based on graph convolution can be selected for batch face clustering, and in practical application, the face clustering model based on graph convolution generally has better clustering effect than an unsupervised clustering model.

The specific mode of inputting the image features to be processed into the preset image clustering model for processing is not limited, and the image features can be set according to actual application requirements. For example, in one alternative example, the following sub-steps may be included:

processing the image characteristics to be processed based on a preset image clustering model to obtain a first image clustering result; and carrying out weighted average calculation on the first image clustering result to obtain a first class center feature set.

In detail, F ₄₀ ＝{p _i :f _i Inputting the first face clustering result into the face clustering model, and exemplarily, it is assumed that the first face clustering result may be expressed as follows:

F _t ＝{a _i :(p _i1 ,p _i2 ,…p _in )}；

wherein a is _i Unique sign indicating clustering result, (p) _i1 ,p _i2 ,…p _in ) Representing the clustering result a _i A set of facial images contained therein.

For each clustering result, the class feature center needs to be calculated, i.e. for each a _i It is necessary to calculate its class center feature fc _i For each fc _i All are related to a _i Feature vector p of (2) _in Is to select all p _in And performs a weighted average calculation on some representative feature vectors. Illustratively, for a clustered result a ₂ The 9 face images included are (p ₂₁ ,p ₂₂ ,p ₂₃ ,p ₂₄ …p ₂₉ ) And (p) ₂₁ ,p ₂₂ ,p ₂₃ ,p ₂₄ …p ₂₉ ) The corresponding face features are (f ₂₁ ,f ₂₂ ,f ₂₃ ,f ₂₄ ,…,f ₂₉ )。

The selection average calculation method is as follows:

for a ₂ In relation to p _in Is not limited to the feature vector (f) ₂₁ ,f ₂₂ ,f ₂₃ ,f ₂₄ ,…,f ₂₉ ) Generating a similarity matrix, which may be denoted as S _imk Each element in the matrix may be denoted as s _mk ，s _mk Also denoted p _im Feature vector and p of (2) _ik Is cosine similarity, wherein j, k<=9. Illustratively, assume S _imk The specific values of (2) are as follows:

table 1 similarity matrix

	f ₂₁	f ₂₂	f ₂₃	f ₂₄	f ₂₅	f ₂₆	f ₂₇	f ₂₈	f ₂₉
										f ₂₁	1	0.85	0.78	0.92	0.74	0.73	0.73	0.74	0.72
f ₂₂	0.85	1	0.88	0.85	0.72	0.69	0.68	0.72	0.72
										f ₂₃	0.78	0.88	1	0.74	0.71	0.74	0.69	0.73	0.71
f ₂₄	0.92	0.85	0.74	1	0.74	0.74	0.72	0.71	0.74
										f ₂₅	0.74	0.72	0.71	0.74	1	0.86	0.92	0.68	0.74
f ₂₆	0.73	0.69	0.74	0.74	0.86	1	0.88	0.72	0.73
										f ₂₇	0.73	0.68	0.69	0.72	0.92	0.88	1	0.73	0.7
f ₂₈	0.74	0.72	0.73	0.71	0.68	0.72	0.73	1	0.94
										f ₂₉	0.72	0.72	0.71	0.74	0.74	0.73	0.7	0.94	1

For example, assuming that the similarity threshold of the subset partition is 0.75, according to the similarity matrix, as shown in the specific data in table 1, since the similarity matrix is a square of symmetry, only all the elements of the upper triangle or the lower triangle need to be observed or traversed. It can be found that f ₂₁ 、f ₂₂ 、f ₂₃ 、f ₂₄ A subset can be formed because f ₂₁ 、f ₂₂ 、f ₂₃ The similarity between each other is greater than 0.75, although f ₂₃ 、f ₂₄ The similarity between them is less than 0.75, but f ₂₃ At least with f ₂₁ 、f ₂₂ One of the similarities is greater than 0.75, so that f is the final value ₂₁ 、f ₂₂ 、f ₂₃ 、f ₂₄ A subset of the connections may be formed. Similarly, f ₂₅ 、f ₂₆ 、f ₂₇ Form a subset f ₂₈ 、f ₂₉ To form a subset, then finally a ₂ The division into three subsets is possible, and may be set to subset 1, subset 2, and subset 3 in this order.

For the three subsets obtained, it is assumed that the face with the highest face quality score in each subset is p ₂₁ 、p ₂₆ 、p ₂₉ Selecting the corresponding face feature f ₂₁ 、f ₂₆ 、f ₂₉ As representative features of the subsets, the number of faces contained in each subset may be calculated to be 4, 3, and 2. A is calculated by weighted average ₂ Class center fc of (2) ₂ The specific calculation formula is as follows:

it should be noted that, in step S230, the specific manner of performing the second aggregation process is not limited, and may be set according to the actual application requirement. For example, in an alternative example, step S230 may include the sub-steps of:

if yes, processing the first image clustering result and the first type center feature set based on a preset image clustering model to obtain a second image clustering result.

In detail, after clustering of the face images of multiple batches, clustering results of different batches can be clustered secondarily through the class feature center set. The batch setting or time setting may be defined for a plurality of batches of face image clusters, indicating batches or time ranges in which face image clusters are continuously performed. For example, regarding the batch setting, it may be set to 3 times, 5 times, or the like, to indicate that the face image clustering is performed 3 times or 5 times followed by the secondary clustering. Regarding the setting of the time range, for example, the time range can be set to be 6:00-23:00, which means that only face images are clustered in the time range of 6:00-23:00, and secondary clustering is performed in the time range of not 6:00-23:00.

In an alternative example, after every 5 batches of face clustering, a secondary clustering of clustered results is performed, and the existing clustering result F is set _h ＝{a _i :[fc _i ,(p _i1 ,p _i2 ,…p _in )]Using each class a } _i Class center feature fc _i And adopting the same face clustering model to perform batch secondary clustering. The secondary clustering is to cluster the existing face clustering results again, so that the final face clustering is more sufficient, the recall rate of the clustering results is improved, the splitting situation is reduced, and the utilization value of the clustering results is improved.

Illustratively, the result of the secondary clustering is denoted as F _rh ＝{a _ri :[fc _ri ,(a _ri1 ,a _ri2 ,…,a _rin )](wherein a) _ri Unique flag fc representing secondary clustering result _ri Representation class a _ri Class-characteristic center of (a) _ri1 ,a _ri2 ,…,a _rin ) Representation class a _ri From the combination of existing face clustering results, and for fc _ri Can be calculated by adopting a weighted average modeAnd (5) calculating.

Regarding fc _ri Specific calculation mode and fc of (C) _i In a similar manner, since each fc _i The method is characterized by being a representative feature of each face clustering result, and the method does not need to divide and select the representative feature by sub-sets, and can directly perform weighted average. Illustratively, a certain secondary clustering result a _r1 From (a) _r11 ,a _r12 ,…,a _r1n ) Clustering, for each a _r1j The center of class is characterized by fc _r1j Each a _r1j The number of faces is c _r1j Then class center feature fc _ri The specific calculation mode of (a) is as follows:

further, face image clustering and secondary clustering are alternately performed in sequence, and a final face clustering result is obtained. That is, the face images to be clustered are continuously clustered, the clustered results are secondarily clustered according to the set face clustering batch or time interval in step S230, and the two processes are alternately executed to perform full clustering, so that a final clustering result is obtained.

Through the method, the application provides a face clustering method, under a large-scale face clustering scene, firstly, threshold judgment clustering is carried out through similarity comparison to ensure the real-time performance and accuracy of clustering, and model clustering of mass faces which are not classified through similarity is carried out to ensure the efficiency and recall rate of face clustering; the clustering result is clustered more fully through the secondary clustering of the class center features, the recall rate of the clustering result is improved, the splitting condition is reduced, and the practical application value of the face clustering result is greatly improved; the method for constructing the class center features by selecting the weighted average can effectively extract all the representative features in each clustering result through subset division, and generates class center features with representative significance through weighted fusion.

In connection with fig. 4, the embodiment of the present application further provides an image clustering device 400, where the functions implemented by the image clustering device 400 correspond to the steps performed by the above-mentioned method. The image clustering apparatus 400 may be understood as a processor of the electronic device 100, or may be understood as a component that is independent of the electronic device 100 or the processor and that is controlled by the electronic device 100 to implement the functions of the present application. The image clustering apparatus 400 may include a feature acquisition module 410, a first clustering module 420, and a second clustering module 430, among others.

The feature acquisition module 410 is configured to acquire features of an image to be processed. In an embodiment of the present application, the feature acquisition module 410 may be used to perform step S210 shown in fig. 2, and the description of step S210 may be referred to above with respect to the relevant content of the feature acquisition module 410.

The first clustering module 420 is configured to input the image features to be processed into a preset image clustering model to perform a first clustering process, so as to obtain a first image clustering result and a first class center feature set. In an embodiment of the present application, the first clustering module 420 may be used to perform step S220 shown in fig. 2, and the description of step S220 may be referred to above with respect to the relevant content of the first clustering module 420.

The second clustering module 430 is configured to input the first image clustering result and the first class center feature set into a preset image clustering model for performing a second clustering process, so as to obtain a second image clustering result. In an embodiment of the present application, the second aggregation module 430 may be used to perform step S230 shown in fig. 2, and the description of step S230 may be referred to above for the relevant content of the second aggregation module 430.

In addition, the embodiment of the application also provides a computer readable storage medium, and the computer readable storage medium stores a computer program which executes the steps of the image clustering method when being executed by a processor.

The computer program product of the image clustering method provided by the embodiment of the present application includes a computer readable storage medium storing a program code, where instructions included in the program code may be used to execute steps of the image clustering method in the above method embodiment, and specifically, reference may be made to the above method embodiment, which is not described herein.

In summary, the image clustering method and device, the electronic device and the storage medium provided by the embodiment of the application have the advantages that the image to be processed is input into the preset image clustering model to obtain the first image clustering result and the first type center feature set, and the first image clustering result and the first type center feature set are input into the preset image clustering model to obtain the second type result, so that the secondary clustering according to the same clustering model is realized, the problem that in the prior art, only the images to be clustered are clustered, a large number of images of the same individual are still distributed in a plurality of categories is avoided, the clustering value is very limited, and the image clustering efficiency is low.

The above is only a preferred embodiment of the present application, and is not intended to limit the present application, but various modifications and variations can be made to the present application by those skilled in the art. Any modification, equivalent replacement, improvement, etc. made within the spirit and principle of the present application should be included in the protection scope of the present application.

Claims

1. An image clustering method, comprising:

acquiring image characteristics to be processed;

if yes, performing second clustering on the first image clustering result and the first type center feature set based on the preset image clustering model to obtain a second image clustering result, wherein after the first image clustering processing of each preset batch is completed, performing second clustering on all the existing first image clustering results and first type center feature sets once, and the image features to be processed, which are aimed at by the first image clustering processing of each batch, are obtained based on different images;

the step of acquiring the image characteristics to be processed comprises the following steps:

comparing the similarity between the image feature vector set and a historical clustering result to obtain the similarity of each image feature vector, wherein the historical clustering result is a clustering result obtained by carrying out image clustering in advance, the historical clustering result comprises a plurality of class feature centers, and the similarity of the image feature vector is the maximum similarity between the image feature vector and all class feature centers;

2. The image clustering method as claimed in claim 1, wherein said step of acquiring a set of image feature vectors comprises:

acquiring image data to be processed;

3. The image clustering method as claimed in claim 2, wherein the step of extracting the image data to be processed to obtain the image features and the structural information includes:

4. The image clustering method as claimed in claim 1, wherein the step of inputting the image features to be processed into a preset image clustering model to perform a first clustering process to obtain a first image clustering result and a first class center feature set includes:

5. The image clustering method as claimed in claim 4, wherein the step of inputting the image features to be processed greater than a preset number threshold into a preset image clustering model for processing to obtain a first image clustering result and a first type of center feature set includes:

6. An image clustering apparatus, comprising:

the second clustering module is used for judging whether the clustering batch of the first image clustering result is larger than a preset batch or not; if yes, processing the first image clustering result and the first type center feature set based on the preset image clustering model to obtain a second image clustering result, wherein after the first image clustering processing of which the number meets the preset batch is completed, performing second clustering processing on all the existing first image clustering results and first type center feature sets once, and the image features to be processed, which are aimed at by the first image clustering processing of each batch, are obtained based on different images;

the feature acquisition module is specifically configured to:

7. An electronic device, comprising: memory, a processor and a computer program stored on the memory and executable on the processor, which processor implements the image clustering method according to any one of claims 1 to 5 when executing the program.

8. A storage medium comprising a computer program which, when run, controls an electronic device in which the storage medium is located to perform the image clustering method of any one of claims 1 to 5.