Nothing Special   »   [go: up one dir, main page]

CN103473327A - Image retrieval method and image retrieval system - Google Patents

Image retrieval method and image retrieval system Download PDF

Info

Publication number
CN103473327A
CN103473327A CN2013104202879A CN201310420287A CN103473327A CN 103473327 A CN103473327 A CN 103473327A CN 2013104202879 A CN2013104202879 A CN 2013104202879A CN 201310420287 A CN201310420287 A CN 201310420287A CN 103473327 A CN103473327 A CN 103473327A
Authority
CN
China
Prior art keywords
picture
similarity
query
text
list
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN2013104202879A
Other languages
Chinese (zh)
Inventor
钟海兰
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
GUANGDONG TUTUSOU NETWORK TECHNOLOGY Co Ltd
Original Assignee
GUANGDONG TUTUSOU NETWORK TECHNOLOGY Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by GUANGDONG TUTUSOU NETWORK TECHNOLOGY Co Ltd filed Critical GUANGDONG TUTUSOU NETWORK TECHNOLOGY Co Ltd
Priority to CN2013104202879A priority Critical patent/CN103473327A/en
Publication of CN103473327A publication Critical patent/CN103473327A/en
Pending legal-status Critical Current

Links

Images

Landscapes

  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

The invention discloses an image retrieval method and an image retrieval system. The method includes for given query texts and/or query images, acquiring multiple similarity ordered lists of in-base images according to text relevance and image content relevance, and then returning a comprehensive ordered list by combining the acquired ordered lists and comprehensively considering the text similarity and the image content similarity. Through the multi-mode mixed retrieval mechanism, shortcomings of conventional single-mode retrieval mechanisms are overcome, respective advantages of a text retrieval method and an image content retrieval method are developed, and accuracy of image retrieval is greatly improved. Since only ordering results of single retrieval models are fused, the single retrieval models can be increased, decreased and replaced conveniently, text and image content feature retrieval models are configured flexibly, and performance of the image retrieval system is improved.

Description

Image retrieval method and system
Technical Field
The invention relates to the technical field of information retrieval, in particular to an image retrieval method and an image retrieval system.
Background
Image retrieval has been a hot topic of research in the multimedia field for the last decade. Image retrieval systems are a class of specialized search engine systems that provide users with retrieval services for relevant graphical image material on the internet based on descriptive text or visual features of the image (i.e., image content). For example, a series of search engines such as google, hundredth, etc. can provide a picture search service.
Conventional image retrieval relies on descriptive text of pictures, typically searching for pictures in a database based on keywords. However, in many cases, text keywords do not accurately describe the visual features of an image (for example, a specific pattern), and therefore, image content-based search techniques have been developed. At present, there are many features describing the image content, such as color features, texture features, shape features, and the like. However, although the image content features can capture the visual similarity of the pictures, the visual similarity does not necessarily represent semantic similarity, i.e. there is a "semantic gap" problem. Therefore, both text-based image retrieval and content-based image retrieval have advantages and disadvantages, and cannot well meet user requirements.
Disclosure of Invention
Based on the above situation, the invention provides an image retrieval method and system to improve the accuracy of image retrieval.
An image retrieval method, comprising the steps of:
receiving a query picture and/or a query text submitted by a user;
extracting various content characteristics of the query picture, and segmenting the query text;
comparing various content characteristics of the query picture with corresponding content characteristics of each picture in a database, and sequencing the pictures in the database according to the similarity to obtain various lists of content similarity; comparing the query text after word segmentation with descriptive documents corresponding to each picture in the database, and sequencing the pictures in the database according to the similarity to obtain a text similarity list;
and assigning a score to each picture in the database according to the position in each list and the weight of the list, reordering according to the assigned scores to obtain a comprehensive similarity ranking list, and returning the list to the user.
An image retrieval system comprising:
the query information receiving terminal is used for receiving a query picture and/or a query text submitted by a user;
the query information processing module is used for extracting various content characteristics of the query picture and segmenting the query text;
the similarity single-item sequencing module is used for comparing various content characteristics of the inquired pictures with corresponding content characteristics of each picture in the database, and sequencing the pictures in the database according to the similarity to obtain various lists of content similarity; comparing the query text after word segmentation with descriptive documents corresponding to each picture in the database, and sequencing the pictures in the database according to the similarity to obtain a text similarity list;
and the comprehensive similarity sorting module is used for assigning scores to each picture in the database according to the positions in the lists and the weight of the list where the picture is located, re-sorting the pictures according to the assigned scores to obtain a comprehensive similarity sorting list, and returning the list to the user.
The image retrieval method and the system of the invention are used for obtaining a plurality of similarity ranking lists of pictures in a library respectively according to the text relevance and the picture content relevance aiming at a given query text and/or query pictures, then combining the obtained plurality of ranking lists, comprehensively considering the text similarity and the picture content similarity, and returning a comprehensive ranking list. The multi-modal mixed retrieval mechanism avoids the defects of the traditional single-modal retrieval mechanism, exerts the advantages of the text retrieval method and the image content retrieval method, and greatly improves the accuracy of image retrieval. As only the sorting results of the single retrieval models are fused, the single retrieval models can be increased, decreased and replaced conveniently, flexible configuration of the text and image content characteristic retrieval models is realized, and the performance of the image retrieval system is improved.
Drawings
FIG. 1 is a flow chart illustrating an image retrieval method according to the present invention;
FIG. 2 is a flow chart of the combined search of image content and text in the image search method of the present invention;
FIG. 3 is a comparison of the search results of the image search method of the present invention and the conventional search method;
FIG. 4 is a schematic diagram of an image retrieval system according to the present invention.
Detailed Description
In order to make the objects, technical solutions and advantages of the present invention more apparent, the present invention will be described in further detail below with reference to the accompanying drawings and embodiments. It should be understood that the detailed description and specific examples, while indicating the scope of the invention, are intended for purposes of illustration only and are not intended to limit the scope of the invention.
The image retrieval method of the present invention, as shown in fig. 1, includes the steps of:
step S101, receiving a query picture and/or a query text submitted by a user;
s102, extracting various content characteristics of the query picture, and segmenting the query text;
s103, comparing various content characteristics of the inquired pictures with corresponding content characteristics of each picture in a database, and sequencing the pictures in the database according to the similarity to obtain various lists of content similarity; comparing the query text after word segmentation with descriptive documents corresponding to each picture in the database, and sequencing the pictures in the database according to the similarity to obtain a text similarity list;
and step S104, assigning a score to each picture in the database according to the position in each list and the weight of the list, reordering the pictures according to the assigned scores to obtain a comprehensive similarity ranking list, and returning the list to the user.
In a traditional retrieval method, retrieval is performed according to text description information submitted by a user, or a feature is extracted from a picture submitted by the user for retrieval, namely, single-mode retrieval. By adopting the retrieval method, the user can perform retrieval only according to the picture or text description information and can perform combined retrieval simultaneously according to the picture and text description information. In the case where the user submits only pictures, as described in step S102, the present retrieval method extracts not only one content feature, but also a plurality of content features, and performs comprehensive ranking. In conclusion, compared with the traditional retrieval method, the retrieval method is a multi-modal mixed retrieval method. Experiments prove that the mixed retrieval mechanism is greatly improved in the aspect of returning result accuracy compared with the conventional single-mode retrieval mechanism. The above steps are described in detail below.
After the user submits the query information, in step S102, content features are extracted from the submitted query image, and the submitted text is segmented. In the embodiment of the present invention, the image content features preferably include color features, texture features and shape features, which are features that are commonly used at present and reflect the contents of the image more typically. The approach taken by the word segmentation is a Hidden Markov Model (HMM). Let the set of states be Q = (Q)1,q2,…qN) I.e., a corpus of tagged parts of speech (e.g., beginning of word, middle of word, end of word); the observation set is V ═ V (V)1,v2,…qM) I.e. a complete set of characters to be participled input by the user; the observed sequence is O = (O)1,o2,…oT) Namely, the input character sequence to be divided; the state sequence is I ═ I (I)1,i2,…iT) I.e. a possible sequence of part-of-speech tags to be divided into character sequences. Firstly, determining a used corpus, and then obtaining three parameters of a hidden Markov model by a statistical method, wherein the three parameters are respectively a state transition probability matrix A ═ aij]N×NObservation probability matrix B = [ B ]j(k)]N×MThe initial state probability vector pi ═ pi (pi)i). Wherein:
a ij = P ( q j | q i ) = P ( q i , q j ) P ( q i ) = count ( q i , q j ) count ( q i )
b j ( k ) = P ( v k | q j ) = P ( v k , q j ) P ( q j ) = count ( v k , q j ) count ( q j )
πi=P(qi)
where count represents frequency, obtained from training data.
When HMM model λ ═ (a, B, pi) is determined, the word segmentation is performed using the viterbi algorithm. Defining all the individual paths (i) with state i at time t1,i2,…it-1,itOf) the maximum value of the probability is δt(i) Setting the t-1 th node of the path with the maximum probability as psit(i) In that respect First, initialize, make delta1(i)=πibi(o1),Ψ1(i) =0, i =1, 2, …, N. Then, recurrently, for T2, …, T, respectively:
δ t ( i ) = max 1 ≤ j ≤ N [ δ t - 1 ( j ) a ji ] b i ( o t ) ,
Ψ t ( i ) = arg max 1 ≤ j ≤ N [ δ t - 1 ( j ) a ji ] ,
i=1,2,…,N
finally, let P*=max1≤i≤NδT(i) And is
Figure BDA0000381827360000045
P*The probability of the optimal path is represented,
Figure BDA0000381827360000046
representing the end point of the optimal path. After finding the end point of the optimal path, backtracking is carried out, and for T-1, T-2 and … 1, the order is given
Figure BDA0000381827360000047
Finding an optimal path
Figure BDA0000381827360000048
The optimal path is the output hidden state sequence, namely the corresponding word segmentation result. The same method is used for segmenting the document information of the pictures in the library, and the document is indexed by using the classical inverted index, so that the efficient retrieval is facilitated.
Next, we extract the content features of the query picture, including color features, texture features, and shape features. First we build a color histogram. When a user submits a query picture Q, preprocessing the submitted picture Q, and then counting a histogram in a color vector space, wherein the color histogram is a one-dimensional discrete function, namely:
Figure BDA0000381827360000051
in the formula, nkThe quantized color feature value is the number of pixels with k, N is the total number of image pixels, and l is the quantized color feature value number, i.e., the dimension of the one-dimensional vector H. Thereby obtaining a color histogram vector H of the query image QQ. Under the offline condition, the color histogram features are extracted and the indexes are built for the pictures in the library in the same way.
Next we extract Scale-Invariant Feature (sift-Invariant Feature), describing the texture features of the picture. When a user submits a query picture I (x, y), the scale space of the query picture is as follows:
L(x,y,σ)=G(x,y,σ)*I(x,y)
g (x, y, σ) is a gaussian function (sigma is a scale parameter). The Difference of gaussians (Difference of Gaussian) of the neighboring scale images is then calculated, i.e.:
D(x,y,σ)=L(x,y,kσ)-L(x,y,σ)
wherein k is typically 21/3
After the Gaussian difference of the adjacent scale images is calculated, a series of images are obtained, and an extreme point is obtained in the image space. And respectively comparing a pixel point in each Gaussian difference image with all adjacent points thereof to see whether the pixel point is larger or smaller than the adjacent points of the image domain and the scale domain. After the extreme points are solved, curve fitting needs to be carried out on the DoG function in the scale space to screen the extreme points, and the points on the low contrast ratio and the edge are removed:
D ( X ‾ ) = D + 1 2 ∂ D T ∂ X X ‾
wherein,
Figure BDA0000381827360000053
it is indicated the offset of the sample points,
Figure BDA0000381827360000054
is the extreme value of X. For each candidate extreme point
Figure BDA0000381827360000055
If the value is less than a threshold (generally 0.03), the candidate extreme point is determined to be an unstable extreme point with low contrast and removed.
In order to obtain stable extreme points, the influence of the edges should be removed, when
Tr ( H ) 2 Det ( H ) < ( r + 1 ) 2 r
And reserving key points, and otherwise, removing the key points. The key points are the feature points we are looking for. Wherein,
H = D xx D xy D xy D yy
is the session matrix, DxxIs to derive twice in the x-direction of an image of a certain scale in DoG space. Tr (H) is the trace of the H matrix, and Det (H) is the determinant of matrix H. α is a large eigenvalue of the H matrix, β is a small eigenvalue of the H matrix, and γ is α/β.
After the positions of the feature points of the image are determined, next, a direction is assigned to the feature points of the image by solving the gradient of the neighborhood of each feature point, and then the gradient amplitude m (x, y) and the gradient direction θ (x, y) are defined as:
m ( x , y ) = ( L ( x + 1 , y ) - L ( x - 1 , y ) ) 2 + ( L ( x , y + 1 ) - L ( x , y - 1 ) ) 2
&theta; ( x , y ) = tan - 1 ( L ( x , y + 1 ) - L ( x , y - 1 ) ) / ( L ( x + 1 , y ) - L ( x - 1 , y ) )
a region is defined by taking the characteristic point as a center, and a direction histogram is formed by utilizing the gradients of all points in the region. And selecting one item with the largest ordinate value from the histogram as the main direction of the feature point. If there are other directions, the magnitude of the ordinate is greater than 80% of the principal direction ordinate, and this direction is also taken as the direction of the feature point.
And after the feature point detection is finished, determining a descriptor of the feature point. Firstly, the neighborhood of the feature point is rotated by theta (adjusted to 0 °) with the feature point as the center, where theta is the direction of the feature point. In the rotated image, a neighborhood window of 16 × 16 is taken with the feature point as the center, and each cell represents one pixel in the neighborhood window of the feature point. Uniformly dividing a 16 × 16 rectangular window into 16 sub-regions, increasing the weight value of a neighborhood close to the feature point and decreasing the weight value of a neighborhood far away from the feature point by adopting a Gaussian blur method, and then calculating gradient histograms of 8 directions in each region to obtain a feature vector of the feature point descriptor, wherein the feature vector is a 4 × 4 × 8= 128-dimensional vector. Next, the feature point descriptor is normalized, and D is the feature point descriptor, that is, D is (D)1,d2,…d128) And obtaining after normalization:
D &OverBar; = D &Sigma; i = 1 128 d i 2 = ( d 1 &OverBar; , d 2 &OverBar; , &CenterDot; &CenterDot; &CenterDot; d 128 &OverBar; ) in order to reduce the influence of the large gradient value, a threshold value of 0.2 is set for the large gradient value, if the value of a certain dimension in the vector is greater than 0.2, the value is set to be 0.2, and normalization processing is carried out again.
In the off-line case, all feature point descriptors of the pictures in the library are obtained by the above steps, and the descriptors are clustered. The obtained clusters are used as visual Words, and a Bag of Words model (Bag of Words) is applied to carry out inverted indexing on the pictures in the library. And then applying the same bag-of-words model to obtain the feature vector expression of the query picture.
Finally we index the global shape features of the picture. After a user submits a query picture, firstly, a Gabor filter is used for sampling and filtering the query picture according to the following formula:
Figure BDA0000381827360000071
wherein,
x r &theta; i = x cos ( &theta; i ) + y sin ( &theta; i ) y r &theta; i = - x sin ( &theta; i ) + y cos ( &theta; i )
l is the size of the filter; k is a normal number; σ is the standard deviation of the Gaussian function; thetai=π(i-1)/θl,i=1,2,...,θllIs the total number of directions in the dimension l. Subjecting the image to Gabor filterAnd (4) convolution is carried out, and the filtered image is obtained as follows:
F &theta; i l = G &theta; i l * I
dividing the filtered picture into 4 x 4 grids, taking the average value in each grid, and finally putting the average values obtained in the grids of all directions and scales in a vector as the shape characteristics of the query picture. In the off-line indexing step, the same calculation is carried out on the pictures in the library to obtain a shape feature index (a k-d tree index and a hash index are established for efficient retrieval) so as to match the picture shape features.
After the text information after word segmentation and the content feature information of the query picture are obtained, in step S103, a related picture is searched for in the joint index of the text and the image according to the obtained information.
Fig. 2 is a schematic flow chart of the search method, and shows a specific implementation method of step S103: we build a search (IR) model based on each single-modality (single-item) feature separately (step S201). Each IR model is independently operated and freely configured, so that different IR models can be selected to be combined according to actual conditions, and finally, a list containing results is returned according to corresponding sorting algorithms. Then, in step S202, the text information after word segmentation and/or the content features extracted from the query picture are input into the corresponding IR model to obtain a plurality of ordered lists, and in step S203, the ordered lists are fused to finally obtain a comprehensive ordered list and returned to the user. The embodiment of the joint search method combines the result output by the text-based retrieval model and the result output by the image-content-based retrieval model to obtain a comprehensive ordered list, wherein the comprehensive ordered list comprises the returned picture results and is arranged according to the descending order of the correlation degree with the query information.
Specifically, in step S201, a text-based IR model is first established. Preferably, the text IR model is created using Statistical Language Modeling (Statistical Language Modeling). Let V denote a dictionary (vocarbulariy) of a certain language, V ═ ω1,ω2,…,ω|v|}, call omegaiIs a term (term), D is a document in the document set C, D ═ D1d2…dn,diE.g. V. In the statistical translation model, when a user submits text query information Q, Q ═ Q1q2…qm,qiE.g. V, the probability that the document D is "translated" into the query information Q is:
P ( Q | D ) = &Pi; i = 1 m &Sigma; &omega; t ( q i | &omega; ) P ( &omega; | D ) - - - ( 1 )
where P (ω | D) is the basic document language model, t (q)iω) is the translation probability. After P (Q | D) is calculated, the ranking of the documents in the document set needs to be returned. At this time, we need to estimate the posterior summaryThe ratio P (D | Q) is, according to the bayesian formula:
P ( D | Q ) = P ( Q | D ) P ( D ) P ( Q ) &Proportional; P ( Q | D ) P ( D ) - - - ( 2 )
where P (D) may take some query-independent measure, this term is not considered in the model. After the posterior probability P (D | Q) is calculated, the documents in the document set may be ranked according to the probability values, and a ranked list may be returned, as in step S202.
When the user submits the query picture, in step S201, a plurality of IR models based on image content features are established. Each IR model corresponds to an image feature, including color features, texture features, shape features, and the like. As previously mentioned, these image content features are all represented in vector space. Therefore, the similarity between feature vectors needs to be measured. The similarity is preferably calculated using euclidean distance. Two n-dimensional vectors (x)11,x12,…x1n) And (x)21,x22,…x2n) The Euclidean distance between them is:
d 12 = &Sigma; k = 1 n ( x 1 k - x 2 k ) 2
larger distances indicate less correlation. After the similarity calculation is finished, the sorted list is returned in descending order of the degree of correlation as by step S202.
It should be noted that several lists are returned for several features extracted from the picture. For example, if two features, color and texture, of an image are extracted separately, two sorted lists are returned. Each list is sorted in descending order according to the degree of correlation with the query picture on the corresponding visual feature.
After obtaining a plurality of picture relevancy ranking lists, respectively giving each picture d on the listjAssigning a score SHLFIRMThe formula is as follows:
S HLFIRM ( d j ) = ( &Sigma; i = 1 N 1 d j &Element; L i ) &times; ( &alpha; i &times; 1 &psi; ( d j , L i , ) )
where ψ (x, H) denotes the position of picture x in list H, 1aIs an indicator function when a is true, i.e. when d is truejBelong to list LiIf so, 1 is taken, otherwise, 0 is taken. Alpha is alphaiIs the weight of the ith IR model, andintuitively, pictures that appear at a front position in the lists will get a higher score, with higher scoring pictures being more relevant to the query information. A score is computationally defined and sorted by the score as in steps S203 and S204.
In the above equation, if one of the IR models has better performance than the other IR models, a higher weight value should be assigned to the model to improve the performance of the whole system. In the embodiment of the invention, the weight alpha is set by adopting an automatic optimization methodi. Alternative automatic optimization methods include genetic algorithms, annealing algorithms, and the like.For example, in a genetic algorithm, we first initialize the weights of each model, and generate an initial population of weight vectors through multiple random initializations. Next, the fitness of each individual in the population is measured. In image retrieval, the adaptability of a weight vector is measured by the performance of a search result generated by the weight vector, namely, a plurality of test queries are given and relevant picture sets corresponding to the queries are obtained, and then the performance of the search result is output as an adaptability index according to a relevant picture set calculation program. There are many indexes for measuring the performance of the search result, such as F1 score, Normalized counted relational Gain, Mean Average Precision, etc. Then, the individuals are selected, crossed and mutated according to the adaptability, and a new generation of population is generated. The new generation of population information is superior to the previous generation. And (3) repeatedly, continuously improving the fitness of the weight vector until the algorithm termination condition is met: within the maximum iteration time limit, obtaining an individual meeting a preset weight vector fitness target value; or the maximum iteration number is reached, and then all the individuals with the highest fitness among all the generated individuals are returned.
The embodiment of the present invention may further include step S105: if the user is satisfied with the output result, the search process is ended; if the user is not satisfied with the output result or deviates from his/her previous idea, the query text and/or query picture can be supplemented or modified based on the previous query information. The method performs word segmentation and feature extraction on the modified text and/or picture, and repeats steps S102 to S105 until a result satisfied by the user is output.
Fig. 3 shows the results of the retrieval mechanism of the present invention compared to a single modality image retrieval mechanism. The upper part of the right half part of the figure shows the result of simple text retrieval, the middle part shows the result of simple image content retrieval, and the lower part is the result returned by the hybrid retrieval mechanism provided by the invention. The returned image results show that the performance of the method is greatly improved compared with the performance of the traditional single-mode retrieval method, and the requirements of users are met to a greater extent.
The image search system according to the present invention is a system corresponding to the above method, and as shown in fig. 4, includes:
the query information receiving terminal is used for receiving a query picture and/or a query text submitted by a user;
the query information processing module is used for extracting various content characteristics of the query picture and segmenting the query text;
the similarity single-item sequencing module is used for comparing various content characteristics of the inquired pictures with corresponding content characteristics of each picture in the database, and sequencing the pictures in the database according to the similarity to obtain various lists of content similarity; comparing the query text after word segmentation with descriptive documents corresponding to each picture in the database, and sequencing the pictures in the database according to the similarity to obtain a text similarity list;
and the comprehensive similarity sorting module is used for assigning scores to each picture in the database according to the positions in the lists and the weight of the list where the picture is located, re-sorting the pictures according to the assigned scores to obtain a comprehensive similarity sorting list, and returning the list to the user.
As a preferred embodiment, the content features of the picture include color features, texture features, and shape features.
As a preferred embodiment, the query information processing module performs word segmentation on the query text by using a hidden markov model.
As a preferred embodiment, the similarity single item ranking module measures the similarity between the query text after word segmentation and the descriptive document corresponding to each picture in the database by adopting a statistical language modeling method; and calculating the similarity between various content characteristics of the query picture and corresponding content characteristics of each picture in the database by adopting the Euclidean distance.
As a preferred embodiment, the comprehensive ranking module of similarity uses a genetic algorithm or an annealing algorithm to set the weight of each list.
In conclusion, the beneficial effects of the invention are as follows:
the invention improves the practicability of image search: first, most of the conventional search methods are based on monomodal search, and the way of expressing the query intention of the user is limited to a certain extent. Second, content-based image retrieval faces the semantic gap problem. The invention changes the status by combining the multi-modal information of text and image content.
The invention improves the flexibility of image search: most of the conventional image searching methods utilize several fixed features for searching, but the invention has the characteristic that feature combinations can be flexibly and freely configured.
The invention improves the accuracy of image search: according to the method, the return results based on the text image retrieval method and the content image retrieval method are combined, so that a more accurate picture relevancy ranking list is obtained, and the accuracy of the return results is greatly improved.
The above-mentioned embodiments only express several embodiments of the present invention, and the description thereof is more specific and detailed, but not construed as limiting the scope of the present invention. It should be noted that, for a person skilled in the art, several variations and modifications can be made without departing from the inventive concept, which falls within the scope of the present invention. Therefore, the protection scope of the present patent shall be subject to the appended claims.

Claims (10)

1. An image retrieval method, comprising the steps of:
receiving a query picture and/or a query text submitted by a user;
extracting various content characteristics of the query picture, and segmenting the query text;
comparing various content characteristics of the query picture with corresponding content characteristics of each picture in a database, and sequencing the pictures in the database according to the similarity to obtain various lists of content similarity; comparing the query text after word segmentation with descriptive documents corresponding to each picture in the database, and sequencing the pictures in the database according to the similarity to obtain a text similarity list;
and assigning a score to each picture in the database according to the position in each list and the weight of the list, reordering according to the assigned scores to obtain a comprehensive similarity ranking list, and returning the list to the user.
2. The image retrieval method according to claim 1,
the content features of the picture include color features, texture features, and shape features.
3. The image retrieval method according to claim 1 or 2,
and segmenting the query text by adopting a hidden Markov model.
4. The image retrieval method according to claim 1 or 2,
measuring the similarity between the query text after word segmentation and the descriptive document corresponding to each picture in the database by adopting a statistical language modeling method;
and calculating the similarity between various content characteristics of the query picture and corresponding content characteristics of each picture in the database by adopting the Euclidean distance.
5. The image retrieval method according to claim 1 or 2,
the weight of each list is set using a genetic algorithm or an annealing algorithm.
6. An image retrieval system, comprising:
the query information receiving terminal is used for receiving a query picture and/or a query text submitted by a user;
the query information processing module is used for extracting various content characteristics of the query picture and segmenting the query text;
the similarity single-item sequencing module is used for comparing various content characteristics of the inquired pictures with corresponding content characteristics of each picture in the database, and sequencing the pictures in the database according to the similarity to obtain various lists of content similarity; comparing the query text after word segmentation with descriptive documents corresponding to each picture in the database, and sequencing the pictures in the database according to the similarity to obtain a text similarity list;
and the comprehensive similarity sorting module is used for assigning scores to each picture in the database according to the positions in the lists and the weight of the list where the picture is located, re-sorting the pictures according to the assigned scores to obtain a comprehensive similarity sorting list, and returning the list to the user.
7. The image retrieval system of claim 6,
the content features of the picture include color features, texture features, and shape features.
8. Image retrieval system according to claim 6 or 7,
and the query information processing module adopts a hidden Markov model to perform word segmentation on the query text.
9. Image retrieval system according to claim 6 or 7,
the similarity single item ordering module measures the similarity between the query text after word segmentation and a descriptive document corresponding to each picture in a database by adopting a statistical language modeling method; and calculating the similarity between various content characteristics of the query picture and corresponding content characteristics of each picture in the database by adopting the Euclidean distance.
10. Image retrieval system according to claim 6 or 7,
the comprehensive similarity sorting module adopts a genetic algorithm or an annealing algorithm to set the weight of each list.
CN2013104202879A 2013-09-13 2013-09-13 Image retrieval method and image retrieval system Pending CN103473327A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN2013104202879A CN103473327A (en) 2013-09-13 2013-09-13 Image retrieval method and image retrieval system

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN2013104202879A CN103473327A (en) 2013-09-13 2013-09-13 Image retrieval method and image retrieval system

Publications (1)

Publication Number Publication Date
CN103473327A true CN103473327A (en) 2013-12-25

Family

ID=49798175

Family Applications (1)

Application Number Title Priority Date Filing Date
CN2013104202879A Pending CN103473327A (en) 2013-09-13 2013-09-13 Image retrieval method and image retrieval system

Country Status (1)

Country Link
CN (1) CN103473327A (en)

Cited By (37)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104765764A (en) * 2015-02-06 2015-07-08 南京理工大学 Indexing method based on large-scale image
CN105518678A (en) * 2015-06-29 2016-04-20 北京旷视科技有限公司 Searching method, searching apparatus, user device and computer program product
CN105989001A (en) * 2015-01-27 2016-10-05 北京大学 Image searching method and device, and image searching system
WO2016201992A1 (en) * 2015-06-17 2016-12-22 杭州海康威视数字技术股份有限公司 Video storage and retrieval method for cloud storage server, and video cloud storage system
CN106407323A (en) * 2016-08-31 2017-02-15 上海交通大学 Picture retrieving method based on diversity spatial distance
CN106484831A (en) * 2016-09-29 2017-03-08 百度在线网络技术(北京)有限公司 Search system, method and apparatus
CN106529444A (en) * 2016-11-03 2017-03-22 广州市万表科技股份有限公司 Method and device for acquiring watch information based on image identification
CN106529606A (en) * 2016-12-01 2017-03-22 中译语通科技(北京)有限公司 Method of improving image recognition accuracy
CN106708943A (en) * 2016-11-22 2017-05-24 安徽睿极智能科技有限公司 Image retrieval reordering method and system based on arrangement fusion
CN106980664A (en) * 2017-03-21 2017-07-25 苏州大学 A kind of bilingual comparable corpora mining method and device
CN107315837A (en) * 2017-07-17 2017-11-03 梧州井儿铺贸易有限公司 The accurate image indexing system of one kind retrieval
CN107358266A (en) * 2017-07-17 2017-11-17 陈剑桃 A kind of mobile terminal with identification function
CN107491456A (en) * 2016-06-13 2017-12-19 阿里巴巴集团控股有限公司 Image ranking method and device
CN107516020A (en) * 2017-08-17 2017-12-26 中国科学院深圳先进技术研究院 Determination method, apparatus, equipment and the storage medium of sequence site importance
CN107590291A (en) * 2017-10-30 2018-01-16 广东欧珀移动通信有限公司 A kind of searching method of picture, terminal device and storage medium
CN107679208A (en) * 2017-10-16 2018-02-09 广东欧珀移动通信有限公司 A kind of searching method of picture, terminal device and storage medium
CN107741972A (en) * 2017-10-12 2018-02-27 广东欧珀移动通信有限公司 A kind of searching method of picture, terminal device and storage medium
CN107748779A (en) * 2017-10-20 2018-03-02 百度在线网络技术(北京)有限公司 information generating method and device
CN107766853A (en) * 2016-08-16 2018-03-06 阿里巴巴集团控股有限公司 A kind of generation, display methods and the electronic equipment of the text message of image
CN108280209A (en) * 2018-01-31 2018-07-13 湖北工业大学 A kind of image search method and system based on fireworks algorithm
CN108287833A (en) * 2017-01-09 2018-07-17 北京艺鉴通科技有限公司 It is a kind of for the art work identification to scheme to search drawing method
WO2018213943A1 (en) * 2017-05-24 2018-11-29 上海冠勇信息科技有限公司 Image retrieval method based on feature extraction
CN110069654A (en) * 2019-03-15 2019-07-30 平安城市建设科技(深圳)有限公司 Source of houses searching method, device, equipment and computer readable storage medium
CN110070512A (en) * 2019-04-30 2019-07-30 秒针信息技术有限公司 The method and device of picture modification
CN110737794A (en) * 2019-10-16 2020-01-31 北京锐安科技有限公司 Image query method, system, server and storage medium
CN110868543A (en) * 2019-11-25 2020-03-06 三星电子(中国)研发中心 Intelligent photographing method and device and computer readable storage medium
CN111753118A (en) * 2019-03-29 2020-10-09 杭州海康威视数字技术股份有限公司 Vehicle retrieval method and device, electronic equipment and readable storage medium
WO2021087770A1 (en) * 2019-11-05 2021-05-14 深圳市欢太科技有限公司 Picture classification method and apparatus, and storage medium and electronic device
CN112819052A (en) * 2021-01-25 2021-05-18 哈尔滨工业大学(深圳)(哈尔滨工业大学深圳科技创新研究院) Multi-modal fine-grained mixing method, system, device and storage medium
CN112861944A (en) * 2021-01-28 2021-05-28 中山大学 Image retrieval method and device based on mixed modal input
CN113094464A (en) * 2021-03-31 2021-07-09 哈尔滨理工大学 Establishment and auxiliary identification method of extensible crop disease analysis library
CN113139121A (en) * 2020-01-20 2021-07-20 阿里巴巴集团控股有限公司 Query method, model training method, device, equipment and storage medium
CN113590854A (en) * 2021-09-29 2021-11-02 腾讯科技(深圳)有限公司 Data processing method, data processing equipment and computer readable storage medium
WO2021232464A1 (en) * 2020-05-20 2021-11-25 南京理工大学 Character offset detection method and system
CN113920335A (en) * 2021-09-28 2022-01-11 苏州冷王网络科技有限公司 Deep learning-based image and text embedded drug label identification method
CN114238588A (en) * 2022-02-24 2022-03-25 江西医之健科技有限公司 Data retrieval method, system, readable storage medium and computer equipment
CN116301655A (en) * 2023-04-12 2023-06-23 深圳市英唐数码科技有限公司 Method, system and readable storage medium for loading historical note pictures

Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1920818A (en) * 2006-09-14 2007-02-28 浙江大学 Transmedia search method based on multi-mode information convergence analysis
CN101419606A (en) * 2008-11-13 2009-04-29 浙江大学 Semi-automatic image labeling method based on semantic and content
CN101901249A (en) * 2009-05-26 2010-12-01 复旦大学 Text-based query expansion and sort method in image retrieval
CN101984420A (en) * 2010-09-03 2011-03-09 百度在线网络技术(北京)有限公司 Method and equipment for searching pictures based on word segmentation processing
CN102129477A (en) * 2011-04-23 2011-07-20 山东大学 Multimode-combined image reordering method
CN102262642A (en) * 2011-01-28 2011-11-30 北京理工大学 Web image search engine and realizing method thereof
CN102402593A (en) * 2010-11-05 2012-04-04 微软公司 Multi-modal approach to search query input

Patent Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1920818A (en) * 2006-09-14 2007-02-28 浙江大学 Transmedia search method based on multi-mode information convergence analysis
CN101419606A (en) * 2008-11-13 2009-04-29 浙江大学 Semi-automatic image labeling method based on semantic and content
CN101901249A (en) * 2009-05-26 2010-12-01 复旦大学 Text-based query expansion and sort method in image retrieval
CN101984420A (en) * 2010-09-03 2011-03-09 百度在线网络技术(北京)有限公司 Method and equipment for searching pictures based on word segmentation processing
CN102402593A (en) * 2010-11-05 2012-04-04 微软公司 Multi-modal approach to search query input
CN102262642A (en) * 2011-01-28 2011-11-30 北京理工大学 Web image search engine and realizing method thereof
CN102129477A (en) * 2011-04-23 2011-07-20 山东大学 Multimode-combined image reordering method

Cited By (49)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105989001B (en) * 2015-01-27 2019-09-06 北京大学 Image search method and device, image search system
CN105989001A (en) * 2015-01-27 2016-10-05 北京大学 Image searching method and device, and image searching system
CN104765764A (en) * 2015-02-06 2015-07-08 南京理工大学 Indexing method based on large-scale image
WO2016201992A1 (en) * 2015-06-17 2016-12-22 杭州海康威视数字技术股份有限公司 Video storage and retrieval method for cloud storage server, and video cloud storage system
CN105518678A (en) * 2015-06-29 2016-04-20 北京旷视科技有限公司 Searching method, searching apparatus, user device and computer program product
WO2017000109A1 (en) * 2015-06-29 2017-01-05 北京旷视科技有限公司 Search method, search apparatus, user equipment, and computer program product
CN107491456A (en) * 2016-06-13 2017-12-19 阿里巴巴集团控股有限公司 Image ranking method and device
CN107766853A (en) * 2016-08-16 2018-03-06 阿里巴巴集团控股有限公司 A kind of generation, display methods and the electronic equipment of the text message of image
CN107766853B (en) * 2016-08-16 2021-08-06 阿里巴巴集团控股有限公司 Image text information generation and display method and electronic equipment
CN106407323A (en) * 2016-08-31 2017-02-15 上海交通大学 Picture retrieving method based on diversity spatial distance
CN106484831A (en) * 2016-09-29 2017-03-08 百度在线网络技术(北京)有限公司 Search system, method and apparatus
CN106529444A (en) * 2016-11-03 2017-03-22 广州市万表科技股份有限公司 Method and device for acquiring watch information based on image identification
CN106708943A (en) * 2016-11-22 2017-05-24 安徽睿极智能科技有限公司 Image retrieval reordering method and system based on arrangement fusion
CN106529606A (en) * 2016-12-01 2017-03-22 中译语通科技(北京)有限公司 Method of improving image recognition accuracy
CN108287833A (en) * 2017-01-09 2018-07-17 北京艺鉴通科技有限公司 It is a kind of for the art work identification to scheme to search drawing method
CN106980664B (en) * 2017-03-21 2020-11-10 苏州大学 Bilingual comparable corpus mining method and device
CN106980664A (en) * 2017-03-21 2017-07-25 苏州大学 A kind of bilingual comparable corpora mining method and device
US11157767B2 (en) 2017-05-24 2021-10-26 Shanghai Firstbrave Information Technology Co., Ltd. Image searching method based on feature extraction
WO2018213943A1 (en) * 2017-05-24 2018-11-29 上海冠勇信息科技有限公司 Image retrieval method based on feature extraction
CN107358266A (en) * 2017-07-17 2017-11-17 陈剑桃 A kind of mobile terminal with identification function
CN107358266B (en) * 2017-07-17 2020-11-10 浙江信电技术股份有限公司 Mobile terminal with recognition function
CN107315837A (en) * 2017-07-17 2017-11-03 梧州井儿铺贸易有限公司 The accurate image indexing system of one kind retrieval
CN107516020B (en) * 2017-08-17 2021-05-14 中国科学院深圳先进技术研究院 Method, device, equipment and storage medium for determining importance of sequence sites
CN107516020A (en) * 2017-08-17 2017-12-26 中国科学院深圳先进技术研究院 Determination method, apparatus, equipment and the storage medium of sequence site importance
CN107741972A (en) * 2017-10-12 2018-02-27 广东欧珀移动通信有限公司 A kind of searching method of picture, terminal device and storage medium
CN107679208A (en) * 2017-10-16 2018-02-09 广东欧珀移动通信有限公司 A kind of searching method of picture, terminal device and storage medium
CN107748779A (en) * 2017-10-20 2018-03-02 百度在线网络技术(北京)有限公司 information generating method and device
CN107590291A (en) * 2017-10-30 2018-01-16 广东欧珀移动通信有限公司 A kind of searching method of picture, terminal device and storage medium
CN108280209B (en) * 2018-01-31 2020-07-07 湖北工业大学 Image retrieval method and system based on firework algorithm
CN108280209A (en) * 2018-01-31 2018-07-13 湖北工业大学 A kind of image search method and system based on fireworks algorithm
CN110069654A (en) * 2019-03-15 2019-07-30 平安城市建设科技(深圳)有限公司 Source of houses searching method, device, equipment and computer readable storage medium
CN111753118A (en) * 2019-03-29 2020-10-09 杭州海康威视数字技术股份有限公司 Vehicle retrieval method and device, electronic equipment and readable storage medium
CN110070512A (en) * 2019-04-30 2019-07-30 秒针信息技术有限公司 The method and device of picture modification
CN110737794A (en) * 2019-10-16 2020-01-31 北京锐安科技有限公司 Image query method, system, server and storage medium
WO2021087770A1 (en) * 2019-11-05 2021-05-14 深圳市欢太科技有限公司 Picture classification method and apparatus, and storage medium and electronic device
CN110868543A (en) * 2019-11-25 2020-03-06 三星电子(中国)研发中心 Intelligent photographing method and device and computer readable storage medium
CN113139121A (en) * 2020-01-20 2021-07-20 阿里巴巴集团控股有限公司 Query method, model training method, device, equipment and storage medium
WO2021232464A1 (en) * 2020-05-20 2021-11-25 南京理工大学 Character offset detection method and system
CN112819052A (en) * 2021-01-25 2021-05-18 哈尔滨工业大学(深圳)(哈尔滨工业大学深圳科技创新研究院) Multi-modal fine-grained mixing method, system, device and storage medium
CN112861944A (en) * 2021-01-28 2021-05-28 中山大学 Image retrieval method and device based on mixed modal input
CN112861944B (en) * 2021-01-28 2022-09-23 中山大学 Image retrieval method and device based on mixed modal input
CN113094464A (en) * 2021-03-31 2021-07-09 哈尔滨理工大学 Establishment and auxiliary identification method of extensible crop disease analysis library
CN113094464B (en) * 2021-03-31 2024-05-28 哈尔滨理工大学 Method for establishing and assisting in identifying expandable crop disease analysis library
CN113920335A (en) * 2021-09-28 2022-01-11 苏州冷王网络科技有限公司 Deep learning-based image and text embedded drug label identification method
CN113590854A (en) * 2021-09-29 2021-11-02 腾讯科技(深圳)有限公司 Data processing method, data processing equipment and computer readable storage medium
CN113590854B (en) * 2021-09-29 2021-12-31 腾讯科技(深圳)有限公司 Data processing method, data processing equipment and computer readable storage medium
CN114238588A (en) * 2022-02-24 2022-03-25 江西医之健科技有限公司 Data retrieval method, system, readable storage medium and computer equipment
CN116301655A (en) * 2023-04-12 2023-06-23 深圳市英唐数码科技有限公司 Method, system and readable storage medium for loading historical note pictures
CN116301655B (en) * 2023-04-12 2023-11-07 深圳市英唐数码科技有限公司 Method, system and readable storage medium for loading historical note pictures

Similar Documents

Publication Publication Date Title
CN103473327A (en) Image retrieval method and image retrieval system
Goëau et al. Lifeclef plant identification task 2014
Li et al. Contextual bag-of-words for visual categorization
Jang et al. Car-Rec: A real time car recognition system
CN106202256A (en) Propagate based on semanteme and mix the Web graph of multi-instance learning as search method
CN106095829A (en) Cross-media retrieval method based on degree of depth study with the study of concordance expression of space
CN110929498B (en) Method and device for calculating similarity of short text and readable storage medium
Arrish et al. Shape-based plagiarism detection for flowchart figures in texts
CN114461890A (en) Hierarchical multi-modal intellectual property search engine method and system
Dharani et al. Content based image retrieval system using feature classification with modified KNN algorithm
CN112100212A (en) Case scenario extraction method based on machine learning and rule matching
Wang et al. Beauty product image retrieval based on multi-feature fusion and feature aggregation
CN115982144A (en) Similar text duplicate removal method and device, storage medium and electronic device
Karamti et al. Content-based image retrieval system using neural network
CN107423294A (en) A kind of community image search method and system
JP6017277B2 (en) Program, apparatus and method for calculating similarity between contents represented by set of feature vectors
CN103049570B (en) Based on the image/video search ordering method of relevant Preserving map and a sorter
CN115827990B (en) Searching method and device
JP5833499B2 (en) Retrieval device and program for retrieving content expressed by high-dimensional feature vector set with high accuracy
Huang et al. Multi-query image retrieval using CNN and SIFT features
Tian et al. Research on image classification based on a combination of text and visual features
CN104715071B (en) A kind of specific pedestrian retrieval method described based on imperfect text
Thollard et al. Content-based re-ranking of text-based image search results
Kumar et al. Automatic feature weight determination using indexing and pseudo-relevance feedback for multi-feature content-based image retrieval
CN107092875B (en) Novel scene recognition method

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
AD01 Patent right deemed abandoned

Effective date of abandoning: 20180123

AD01 Patent right deemed abandoned