CN103473327A - Image retrieval method and image retrieval system - Google Patents
Image retrieval method and image retrieval system Download PDFInfo
- Publication number
- CN103473327A CN103473327A CN2013104202879A CN201310420287A CN103473327A CN 103473327 A CN103473327 A CN 103473327A CN 2013104202879 A CN2013104202879 A CN 2013104202879A CN 201310420287 A CN201310420287 A CN 201310420287A CN 103473327 A CN103473327 A CN 103473327A
- Authority
- CN
- China
- Prior art keywords
- picture
- similarity
- query
- text
- list
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 238000000034 method Methods 0.000 title claims abstract description 49
- 230000011218 segmentation Effects 0.000 claims description 17
- 238000012163 sequencing technique Methods 0.000 claims description 15
- 238000004422 calculation algorithm Methods 0.000 claims description 12
- 230000002068 genetic effect Effects 0.000 claims description 5
- 230000010365 information processing Effects 0.000 claims description 5
- 238000000137 annealing Methods 0.000 claims description 4
- 230000007246 mechanism Effects 0.000 abstract description 9
- 230000003247 decreasing effect Effects 0.000 abstract description 3
- 239000013598 vector Substances 0.000 description 18
- 239000011159 matrix material Substances 0.000 description 7
- 230000000007 visual effect Effects 0.000 description 6
- 230000006870 function Effects 0.000 description 5
- 239000000284 extract Substances 0.000 description 3
- 238000004364 calculation method Methods 0.000 description 2
- 238000010606 normalization Methods 0.000 description 2
- 238000005457 optimization Methods 0.000 description 2
- 238000013519 translation Methods 0.000 description 2
- 238000013459 approach Methods 0.000 description 1
- 230000009286 beneficial effect Effects 0.000 description 1
- 230000007547 defect Effects 0.000 description 1
- 238000001514 detection method Methods 0.000 description 1
- 238000010586 diagram Methods 0.000 description 1
- 238000002474 experimental method Methods 0.000 description 1
- 238000000605 extraction Methods 0.000 description 1
- 238000001914 filtration Methods 0.000 description 1
- 239000000463 material Substances 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 238000007781 pre-processing Methods 0.000 description 1
- 230000008569 process Effects 0.000 description 1
- 238000012545 processing Methods 0.000 description 1
- 238000011160 research Methods 0.000 description 1
- 238000005070 sampling Methods 0.000 description 1
- 238000007619 statistical method Methods 0.000 description 1
- 238000012360 testing method Methods 0.000 description 1
- 238000012549 training Methods 0.000 description 1
- 230000007704 transition Effects 0.000 description 1
Images
Landscapes
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
Abstract
The invention discloses an image retrieval method and an image retrieval system. The method includes for given query texts and/or query images, acquiring multiple similarity ordered lists of in-base images according to text relevance and image content relevance, and then returning a comprehensive ordered list by combining the acquired ordered lists and comprehensively considering the text similarity and the image content similarity. Through the multi-mode mixed retrieval mechanism, shortcomings of conventional single-mode retrieval mechanisms are overcome, respective advantages of a text retrieval method and an image content retrieval method are developed, and accuracy of image retrieval is greatly improved. Since only ordering results of single retrieval models are fused, the single retrieval models can be increased, decreased and replaced conveniently, text and image content feature retrieval models are configured flexibly, and performance of the image retrieval system is improved.
Description
Technical Field
The invention relates to the technical field of information retrieval, in particular to an image retrieval method and an image retrieval system.
Background
Image retrieval has been a hot topic of research in the multimedia field for the last decade. Image retrieval systems are a class of specialized search engine systems that provide users with retrieval services for relevant graphical image material on the internet based on descriptive text or visual features of the image (i.e., image content). For example, a series of search engines such as google, hundredth, etc. can provide a picture search service.
Conventional image retrieval relies on descriptive text of pictures, typically searching for pictures in a database based on keywords. However, in many cases, text keywords do not accurately describe the visual features of an image (for example, a specific pattern), and therefore, image content-based search techniques have been developed. At present, there are many features describing the image content, such as color features, texture features, shape features, and the like. However, although the image content features can capture the visual similarity of the pictures, the visual similarity does not necessarily represent semantic similarity, i.e. there is a "semantic gap" problem. Therefore, both text-based image retrieval and content-based image retrieval have advantages and disadvantages, and cannot well meet user requirements.
Disclosure of Invention
Based on the above situation, the invention provides an image retrieval method and system to improve the accuracy of image retrieval.
An image retrieval method, comprising the steps of:
receiving a query picture and/or a query text submitted by a user;
extracting various content characteristics of the query picture, and segmenting the query text;
comparing various content characteristics of the query picture with corresponding content characteristics of each picture in a database, and sequencing the pictures in the database according to the similarity to obtain various lists of content similarity; comparing the query text after word segmentation with descriptive documents corresponding to each picture in the database, and sequencing the pictures in the database according to the similarity to obtain a text similarity list;
and assigning a score to each picture in the database according to the position in each list and the weight of the list, reordering according to the assigned scores to obtain a comprehensive similarity ranking list, and returning the list to the user.
An image retrieval system comprising:
the query information receiving terminal is used for receiving a query picture and/or a query text submitted by a user;
the query information processing module is used for extracting various content characteristics of the query picture and segmenting the query text;
the similarity single-item sequencing module is used for comparing various content characteristics of the inquired pictures with corresponding content characteristics of each picture in the database, and sequencing the pictures in the database according to the similarity to obtain various lists of content similarity; comparing the query text after word segmentation with descriptive documents corresponding to each picture in the database, and sequencing the pictures in the database according to the similarity to obtain a text similarity list;
and the comprehensive similarity sorting module is used for assigning scores to each picture in the database according to the positions in the lists and the weight of the list where the picture is located, re-sorting the pictures according to the assigned scores to obtain a comprehensive similarity sorting list, and returning the list to the user.
The image retrieval method and the system of the invention are used for obtaining a plurality of similarity ranking lists of pictures in a library respectively according to the text relevance and the picture content relevance aiming at a given query text and/or query pictures, then combining the obtained plurality of ranking lists, comprehensively considering the text similarity and the picture content similarity, and returning a comprehensive ranking list. The multi-modal mixed retrieval mechanism avoids the defects of the traditional single-modal retrieval mechanism, exerts the advantages of the text retrieval method and the image content retrieval method, and greatly improves the accuracy of image retrieval. As only the sorting results of the single retrieval models are fused, the single retrieval models can be increased, decreased and replaced conveniently, flexible configuration of the text and image content characteristic retrieval models is realized, and the performance of the image retrieval system is improved.
Drawings
FIG. 1 is a flow chart illustrating an image retrieval method according to the present invention;
FIG. 2 is a flow chart of the combined search of image content and text in the image search method of the present invention;
FIG. 3 is a comparison of the search results of the image search method of the present invention and the conventional search method;
FIG. 4 is a schematic diagram of an image retrieval system according to the present invention.
Detailed Description
In order to make the objects, technical solutions and advantages of the present invention more apparent, the present invention will be described in further detail below with reference to the accompanying drawings and embodiments. It should be understood that the detailed description and specific examples, while indicating the scope of the invention, are intended for purposes of illustration only and are not intended to limit the scope of the invention.
The image retrieval method of the present invention, as shown in fig. 1, includes the steps of:
step S101, receiving a query picture and/or a query text submitted by a user;
s102, extracting various content characteristics of the query picture, and segmenting the query text;
s103, comparing various content characteristics of the inquired pictures with corresponding content characteristics of each picture in a database, and sequencing the pictures in the database according to the similarity to obtain various lists of content similarity; comparing the query text after word segmentation with descriptive documents corresponding to each picture in the database, and sequencing the pictures in the database according to the similarity to obtain a text similarity list;
and step S104, assigning a score to each picture in the database according to the position in each list and the weight of the list, reordering the pictures according to the assigned scores to obtain a comprehensive similarity ranking list, and returning the list to the user.
In a traditional retrieval method, retrieval is performed according to text description information submitted by a user, or a feature is extracted from a picture submitted by the user for retrieval, namely, single-mode retrieval. By adopting the retrieval method, the user can perform retrieval only according to the picture or text description information and can perform combined retrieval simultaneously according to the picture and text description information. In the case where the user submits only pictures, as described in step S102, the present retrieval method extracts not only one content feature, but also a plurality of content features, and performs comprehensive ranking. In conclusion, compared with the traditional retrieval method, the retrieval method is a multi-modal mixed retrieval method. Experiments prove that the mixed retrieval mechanism is greatly improved in the aspect of returning result accuracy compared with the conventional single-mode retrieval mechanism. The above steps are described in detail below.
After the user submits the query information, in step S102, content features are extracted from the submitted query image, and the submitted text is segmented. In the embodiment of the present invention, the image content features preferably include color features, texture features and shape features, which are features that are commonly used at present and reflect the contents of the image more typically. The approach taken by the word segmentation is a Hidden Markov Model (HMM). Let the set of states be Q = (Q)1,q2,…qN) I.e., a corpus of tagged parts of speech (e.g., beginning of word, middle of word, end of word); the observation set is V ═ V (V)1,v2,…qM) I.e. a complete set of characters to be participled input by the user; the observed sequence is O = (O)1,o2,…oT) Namely, the input character sequence to be divided; the state sequence is I ═ I (I)1,i2,…iT) I.e. a possible sequence of part-of-speech tags to be divided into character sequences. Firstly, determining a used corpus, and then obtaining three parameters of a hidden Markov model by a statistical method, wherein the three parameters are respectively a state transition probability matrix A ═ aij]N×NObservation probability matrix B = [ B ]j(k)]N×MThe initial state probability vector pi ═ pi (pi)i). Wherein:
πi=P(qi)
where count represents frequency, obtained from training data.
When HMM model λ ═ (a, B, pi) is determined, the word segmentation is performed using the viterbi algorithm. Defining all the individual paths (i) with state i at time t1,i2,…it-1,itOf) the maximum value of the probability is δt(i) Setting the t-1 th node of the path with the maximum probability as psit(i) In that respect First, initialize, make delta1(i)=πibi(o1),Ψ1(i) =0, i =1, 2, …, N. Then, recurrently, for T2, …, T, respectively:
i=1,2,…,N
finally, let P*=max1≤i≤NδT(i) And isP*The probability of the optimal path is represented,representing the end point of the optimal path. After finding the end point of the optimal path, backtracking is carried out, and for T-1, T-2 and … 1, the order is givenFinding an optimal pathThe optimal path is the output hidden state sequence, namely the corresponding word segmentation result. The same method is used for segmenting the document information of the pictures in the library, and the document is indexed by using the classical inverted index, so that the efficient retrieval is facilitated.
Next, we extract the content features of the query picture, including color features, texture features, and shape features. First we build a color histogram. When a user submits a query picture Q, preprocessing the submitted picture Q, and then counting a histogram in a color vector space, wherein the color histogram is a one-dimensional discrete function, namely:
in the formula, nkThe quantized color feature value is the number of pixels with k, N is the total number of image pixels, and l is the quantized color feature value number, i.e., the dimension of the one-dimensional vector H. Thereby obtaining a color histogram vector H of the query image QQ. Under the offline condition, the color histogram features are extracted and the indexes are built for the pictures in the library in the same way.
Next we extract Scale-Invariant Feature (sift-Invariant Feature), describing the texture features of the picture. When a user submits a query picture I (x, y), the scale space of the query picture is as follows:
L(x,y,σ)=G(x,y,σ)*I(x,y)
g (x, y, σ) is a gaussian function (sigma is a scale parameter). The Difference of gaussians (Difference of Gaussian) of the neighboring scale images is then calculated, i.e.:
D(x,y,σ)=L(x,y,kσ)-L(x,y,σ)
wherein k is typically 21/3。
After the Gaussian difference of the adjacent scale images is calculated, a series of images are obtained, and an extreme point is obtained in the image space. And respectively comparing a pixel point in each Gaussian difference image with all adjacent points thereof to see whether the pixel point is larger or smaller than the adjacent points of the image domain and the scale domain. After the extreme points are solved, curve fitting needs to be carried out on the DoG function in the scale space to screen the extreme points, and the points on the low contrast ratio and the edge are removed:
wherein,
it is indicated the offset of the sample points,is the extreme value of X. For each candidate extreme pointIf the value is less than a threshold (generally 0.03), the candidate extreme point is determined to be an unstable extreme point with low contrast and removed.
In order to obtain stable extreme points, the influence of the edges should be removed, when
And reserving key points, and otherwise, removing the key points. The key points are the feature points we are looking for. Wherein,
is the session matrix, DxxIs to derive twice in the x-direction of an image of a certain scale in DoG space. Tr (H) is the trace of the H matrix, and Det (H) is the determinant of matrix H. α is a large eigenvalue of the H matrix, β is a small eigenvalue of the H matrix, and γ is α/β.
After the positions of the feature points of the image are determined, next, a direction is assigned to the feature points of the image by solving the gradient of the neighborhood of each feature point, and then the gradient amplitude m (x, y) and the gradient direction θ (x, y) are defined as:
a region is defined by taking the characteristic point as a center, and a direction histogram is formed by utilizing the gradients of all points in the region. And selecting one item with the largest ordinate value from the histogram as the main direction of the feature point. If there are other directions, the magnitude of the ordinate is greater than 80% of the principal direction ordinate, and this direction is also taken as the direction of the feature point.
And after the feature point detection is finished, determining a descriptor of the feature point. Firstly, the neighborhood of the feature point is rotated by theta (adjusted to 0 °) with the feature point as the center, where theta is the direction of the feature point. In the rotated image, a neighborhood window of 16 × 16 is taken with the feature point as the center, and each cell represents one pixel in the neighborhood window of the feature point. Uniformly dividing a 16 × 16 rectangular window into 16 sub-regions, increasing the weight value of a neighborhood close to the feature point and decreasing the weight value of a neighborhood far away from the feature point by adopting a Gaussian blur method, and then calculating gradient histograms of 8 directions in each region to obtain a feature vector of the feature point descriptor, wherein the feature vector is a 4 × 4 × 8= 128-dimensional vector. Next, the feature point descriptor is normalized, and D is the feature point descriptor, that is, D is (D)1,d2,…d128) And obtaining after normalization:
In the off-line case, all feature point descriptors of the pictures in the library are obtained by the above steps, and the descriptors are clustered. The obtained clusters are used as visual Words, and a Bag of Words model (Bag of Words) is applied to carry out inverted indexing on the pictures in the library. And then applying the same bag-of-words model to obtain the feature vector expression of the query picture.
Finally we index the global shape features of the picture. After a user submits a query picture, firstly, a Gabor filter is used for sampling and filtering the query picture according to the following formula:
wherein,
l is the size of the filter; k is a normal number; σ is the standard deviation of the Gaussian function; thetai=π(i-1)/θl,i=1,2,...,θl,θlIs the total number of directions in the dimension l. Subjecting the image to Gabor filterAnd (4) convolution is carried out, and the filtered image is obtained as follows:
dividing the filtered picture into 4 x 4 grids, taking the average value in each grid, and finally putting the average values obtained in the grids of all directions and scales in a vector as the shape characteristics of the query picture. In the off-line indexing step, the same calculation is carried out on the pictures in the library to obtain a shape feature index (a k-d tree index and a hash index are established for efficient retrieval) so as to match the picture shape features.
After the text information after word segmentation and the content feature information of the query picture are obtained, in step S103, a related picture is searched for in the joint index of the text and the image according to the obtained information.
Fig. 2 is a schematic flow chart of the search method, and shows a specific implementation method of step S103: we build a search (IR) model based on each single-modality (single-item) feature separately (step S201). Each IR model is independently operated and freely configured, so that different IR models can be selected to be combined according to actual conditions, and finally, a list containing results is returned according to corresponding sorting algorithms. Then, in step S202, the text information after word segmentation and/or the content features extracted from the query picture are input into the corresponding IR model to obtain a plurality of ordered lists, and in step S203, the ordered lists are fused to finally obtain a comprehensive ordered list and returned to the user. The embodiment of the joint search method combines the result output by the text-based retrieval model and the result output by the image-content-based retrieval model to obtain a comprehensive ordered list, wherein the comprehensive ordered list comprises the returned picture results and is arranged according to the descending order of the correlation degree with the query information.
Specifically, in step S201, a text-based IR model is first established. Preferably, the text IR model is created using Statistical Language Modeling (Statistical Language Modeling). Let V denote a dictionary (vocarbulariy) of a certain language, V ═ ω1,ω2,…,ω|v|}, call omegaiIs a term (term), D is a document in the document set C, D ═ D1d2…dn,diE.g. V. In the statistical translation model, when a user submits text query information Q, Q ═ Q1q2…qm,qiE.g. V, the probability that the document D is "translated" into the query information Q is:
where P (ω | D) is the basic document language model, t (q)iω) is the translation probability. After P (Q | D) is calculated, the ranking of the documents in the document set needs to be returned. At this time, we need to estimate the posterior summaryThe ratio P (D | Q) is, according to the bayesian formula:
where P (D) may take some query-independent measure, this term is not considered in the model. After the posterior probability P (D | Q) is calculated, the documents in the document set may be ranked according to the probability values, and a ranked list may be returned, as in step S202.
When the user submits the query picture, in step S201, a plurality of IR models based on image content features are established. Each IR model corresponds to an image feature, including color features, texture features, shape features, and the like. As previously mentioned, these image content features are all represented in vector space. Therefore, the similarity between feature vectors needs to be measured. The similarity is preferably calculated using euclidean distance. Two n-dimensional vectors (x)11,x12,…x1n) And (x)21,x22,…x2n) The Euclidean distance between them is:
larger distances indicate less correlation. After the similarity calculation is finished, the sorted list is returned in descending order of the degree of correlation as by step S202.
It should be noted that several lists are returned for several features extracted from the picture. For example, if two features, color and texture, of an image are extracted separately, two sorted lists are returned. Each list is sorted in descending order according to the degree of correlation with the query picture on the corresponding visual feature.
After obtaining a plurality of picture relevancy ranking lists, respectively giving each picture d on the listjAssigning a score SHLFIRMThe formula is as follows:
where ψ (x, H) denotes the position of picture x in list H, 1aIs an indicator function when a is true, i.e. when d is truejBelong to list LiIf so, 1 is taken, otherwise, 0 is taken. Alpha is alphaiIs the weight of the ith IR model, andintuitively, pictures that appear at a front position in the lists will get a higher score, with higher scoring pictures being more relevant to the query information. A score is computationally defined and sorted by the score as in steps S203 and S204.
In the above equation, if one of the IR models has better performance than the other IR models, a higher weight value should be assigned to the model to improve the performance of the whole system. In the embodiment of the invention, the weight alpha is set by adopting an automatic optimization methodi. Alternative automatic optimization methods include genetic algorithms, annealing algorithms, and the like.For example, in a genetic algorithm, we first initialize the weights of each model, and generate an initial population of weight vectors through multiple random initializations. Next, the fitness of each individual in the population is measured. In image retrieval, the adaptability of a weight vector is measured by the performance of a search result generated by the weight vector, namely, a plurality of test queries are given and relevant picture sets corresponding to the queries are obtained, and then the performance of the search result is output as an adaptability index according to a relevant picture set calculation program. There are many indexes for measuring the performance of the search result, such as F1 score, Normalized counted relational Gain, Mean Average Precision, etc. Then, the individuals are selected, crossed and mutated according to the adaptability, and a new generation of population is generated. The new generation of population information is superior to the previous generation. And (3) repeatedly, continuously improving the fitness of the weight vector until the algorithm termination condition is met: within the maximum iteration time limit, obtaining an individual meeting a preset weight vector fitness target value; or the maximum iteration number is reached, and then all the individuals with the highest fitness among all the generated individuals are returned.
The embodiment of the present invention may further include step S105: if the user is satisfied with the output result, the search process is ended; if the user is not satisfied with the output result or deviates from his/her previous idea, the query text and/or query picture can be supplemented or modified based on the previous query information. The method performs word segmentation and feature extraction on the modified text and/or picture, and repeats steps S102 to S105 until a result satisfied by the user is output.
Fig. 3 shows the results of the retrieval mechanism of the present invention compared to a single modality image retrieval mechanism. The upper part of the right half part of the figure shows the result of simple text retrieval, the middle part shows the result of simple image content retrieval, and the lower part is the result returned by the hybrid retrieval mechanism provided by the invention. The returned image results show that the performance of the method is greatly improved compared with the performance of the traditional single-mode retrieval method, and the requirements of users are met to a greater extent.
The image search system according to the present invention is a system corresponding to the above method, and as shown in fig. 4, includes:
the query information receiving terminal is used for receiving a query picture and/or a query text submitted by a user;
the query information processing module is used for extracting various content characteristics of the query picture and segmenting the query text;
the similarity single-item sequencing module is used for comparing various content characteristics of the inquired pictures with corresponding content characteristics of each picture in the database, and sequencing the pictures in the database according to the similarity to obtain various lists of content similarity; comparing the query text after word segmentation with descriptive documents corresponding to each picture in the database, and sequencing the pictures in the database according to the similarity to obtain a text similarity list;
and the comprehensive similarity sorting module is used for assigning scores to each picture in the database according to the positions in the lists and the weight of the list where the picture is located, re-sorting the pictures according to the assigned scores to obtain a comprehensive similarity sorting list, and returning the list to the user.
As a preferred embodiment, the content features of the picture include color features, texture features, and shape features.
As a preferred embodiment, the query information processing module performs word segmentation on the query text by using a hidden markov model.
As a preferred embodiment, the similarity single item ranking module measures the similarity between the query text after word segmentation and the descriptive document corresponding to each picture in the database by adopting a statistical language modeling method; and calculating the similarity between various content characteristics of the query picture and corresponding content characteristics of each picture in the database by adopting the Euclidean distance.
As a preferred embodiment, the comprehensive ranking module of similarity uses a genetic algorithm or an annealing algorithm to set the weight of each list.
In conclusion, the beneficial effects of the invention are as follows:
the invention improves the practicability of image search: first, most of the conventional search methods are based on monomodal search, and the way of expressing the query intention of the user is limited to a certain extent. Second, content-based image retrieval faces the semantic gap problem. The invention changes the status by combining the multi-modal information of text and image content.
The invention improves the flexibility of image search: most of the conventional image searching methods utilize several fixed features for searching, but the invention has the characteristic that feature combinations can be flexibly and freely configured.
The invention improves the accuracy of image search: according to the method, the return results based on the text image retrieval method and the content image retrieval method are combined, so that a more accurate picture relevancy ranking list is obtained, and the accuracy of the return results is greatly improved.
The above-mentioned embodiments only express several embodiments of the present invention, and the description thereof is more specific and detailed, but not construed as limiting the scope of the present invention. It should be noted that, for a person skilled in the art, several variations and modifications can be made without departing from the inventive concept, which falls within the scope of the present invention. Therefore, the protection scope of the present patent shall be subject to the appended claims.
Claims (10)
1. An image retrieval method, comprising the steps of:
receiving a query picture and/or a query text submitted by a user;
extracting various content characteristics of the query picture, and segmenting the query text;
comparing various content characteristics of the query picture with corresponding content characteristics of each picture in a database, and sequencing the pictures in the database according to the similarity to obtain various lists of content similarity; comparing the query text after word segmentation with descriptive documents corresponding to each picture in the database, and sequencing the pictures in the database according to the similarity to obtain a text similarity list;
and assigning a score to each picture in the database according to the position in each list and the weight of the list, reordering according to the assigned scores to obtain a comprehensive similarity ranking list, and returning the list to the user.
2. The image retrieval method according to claim 1,
the content features of the picture include color features, texture features, and shape features.
3. The image retrieval method according to claim 1 or 2,
and segmenting the query text by adopting a hidden Markov model.
4. The image retrieval method according to claim 1 or 2,
measuring the similarity between the query text after word segmentation and the descriptive document corresponding to each picture in the database by adopting a statistical language modeling method;
and calculating the similarity between various content characteristics of the query picture and corresponding content characteristics of each picture in the database by adopting the Euclidean distance.
5. The image retrieval method according to claim 1 or 2,
the weight of each list is set using a genetic algorithm or an annealing algorithm.
6. An image retrieval system, comprising:
the query information receiving terminal is used for receiving a query picture and/or a query text submitted by a user;
the query information processing module is used for extracting various content characteristics of the query picture and segmenting the query text;
the similarity single-item sequencing module is used for comparing various content characteristics of the inquired pictures with corresponding content characteristics of each picture in the database, and sequencing the pictures in the database according to the similarity to obtain various lists of content similarity; comparing the query text after word segmentation with descriptive documents corresponding to each picture in the database, and sequencing the pictures in the database according to the similarity to obtain a text similarity list;
and the comprehensive similarity sorting module is used for assigning scores to each picture in the database according to the positions in the lists and the weight of the list where the picture is located, re-sorting the pictures according to the assigned scores to obtain a comprehensive similarity sorting list, and returning the list to the user.
7. The image retrieval system of claim 6,
the content features of the picture include color features, texture features, and shape features.
8. Image retrieval system according to claim 6 or 7,
and the query information processing module adopts a hidden Markov model to perform word segmentation on the query text.
9. Image retrieval system according to claim 6 or 7,
the similarity single item ordering module measures the similarity between the query text after word segmentation and a descriptive document corresponding to each picture in a database by adopting a statistical language modeling method; and calculating the similarity between various content characteristics of the query picture and corresponding content characteristics of each picture in the database by adopting the Euclidean distance.
10. Image retrieval system according to claim 6 or 7,
the comprehensive similarity sorting module adopts a genetic algorithm or an annealing algorithm to set the weight of each list.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN2013104202879A CN103473327A (en) | 2013-09-13 | 2013-09-13 | Image retrieval method and image retrieval system |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN2013104202879A CN103473327A (en) | 2013-09-13 | 2013-09-13 | Image retrieval method and image retrieval system |
Publications (1)
Publication Number | Publication Date |
---|---|
CN103473327A true CN103473327A (en) | 2013-12-25 |
Family
ID=49798175
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN2013104202879A Pending CN103473327A (en) | 2013-09-13 | 2013-09-13 | Image retrieval method and image retrieval system |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN103473327A (en) |
Cited By (37)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN104765764A (en) * | 2015-02-06 | 2015-07-08 | 南京理工大学 | Indexing method based on large-scale image |
CN105518678A (en) * | 2015-06-29 | 2016-04-20 | 北京旷视科技有限公司 | Searching method, searching apparatus, user device and computer program product |
CN105989001A (en) * | 2015-01-27 | 2016-10-05 | 北京大学 | Image searching method and device, and image searching system |
WO2016201992A1 (en) * | 2015-06-17 | 2016-12-22 | 杭州海康威视数字技术股份有限公司 | Video storage and retrieval method for cloud storage server, and video cloud storage system |
CN106407323A (en) * | 2016-08-31 | 2017-02-15 | 上海交通大学 | Picture retrieving method based on diversity spatial distance |
CN106484831A (en) * | 2016-09-29 | 2017-03-08 | 百度在线网络技术(北京)有限公司 | Search system, method and apparatus |
CN106529444A (en) * | 2016-11-03 | 2017-03-22 | 广州市万表科技股份有限公司 | Method and device for acquiring watch information based on image identification |
CN106529606A (en) * | 2016-12-01 | 2017-03-22 | 中译语通科技(北京)有限公司 | Method of improving image recognition accuracy |
CN106708943A (en) * | 2016-11-22 | 2017-05-24 | 安徽睿极智能科技有限公司 | Image retrieval reordering method and system based on arrangement fusion |
CN106980664A (en) * | 2017-03-21 | 2017-07-25 | 苏州大学 | A kind of bilingual comparable corpora mining method and device |
CN107315837A (en) * | 2017-07-17 | 2017-11-03 | 梧州井儿铺贸易有限公司 | The accurate image indexing system of one kind retrieval |
CN107358266A (en) * | 2017-07-17 | 2017-11-17 | 陈剑桃 | A kind of mobile terminal with identification function |
CN107491456A (en) * | 2016-06-13 | 2017-12-19 | 阿里巴巴集团控股有限公司 | Image ranking method and device |
CN107516020A (en) * | 2017-08-17 | 2017-12-26 | 中国科学院深圳先进技术研究院 | Determination method, apparatus, equipment and the storage medium of sequence site importance |
CN107590291A (en) * | 2017-10-30 | 2018-01-16 | 广东欧珀移动通信有限公司 | A kind of searching method of picture, terminal device and storage medium |
CN107679208A (en) * | 2017-10-16 | 2018-02-09 | 广东欧珀移动通信有限公司 | A kind of searching method of picture, terminal device and storage medium |
CN107741972A (en) * | 2017-10-12 | 2018-02-27 | 广东欧珀移动通信有限公司 | A kind of searching method of picture, terminal device and storage medium |
CN107748779A (en) * | 2017-10-20 | 2018-03-02 | 百度在线网络技术(北京)有限公司 | information generating method and device |
CN107766853A (en) * | 2016-08-16 | 2018-03-06 | 阿里巴巴集团控股有限公司 | A kind of generation, display methods and the electronic equipment of the text message of image |
CN108280209A (en) * | 2018-01-31 | 2018-07-13 | 湖北工业大学 | A kind of image search method and system based on fireworks algorithm |
CN108287833A (en) * | 2017-01-09 | 2018-07-17 | 北京艺鉴通科技有限公司 | It is a kind of for the art work identification to scheme to search drawing method |
WO2018213943A1 (en) * | 2017-05-24 | 2018-11-29 | 上海冠勇信息科技有限公司 | Image retrieval method based on feature extraction |
CN110069654A (en) * | 2019-03-15 | 2019-07-30 | 平安城市建设科技(深圳)有限公司 | Source of houses searching method, device, equipment and computer readable storage medium |
CN110070512A (en) * | 2019-04-30 | 2019-07-30 | 秒针信息技术有限公司 | The method and device of picture modification |
CN110737794A (en) * | 2019-10-16 | 2020-01-31 | 北京锐安科技有限公司 | Image query method, system, server and storage medium |
CN110868543A (en) * | 2019-11-25 | 2020-03-06 | 三星电子(中国)研发中心 | Intelligent photographing method and device and computer readable storage medium |
CN111753118A (en) * | 2019-03-29 | 2020-10-09 | 杭州海康威视数字技术股份有限公司 | Vehicle retrieval method and device, electronic equipment and readable storage medium |
WO2021087770A1 (en) * | 2019-11-05 | 2021-05-14 | 深圳市欢太科技有限公司 | Picture classification method and apparatus, and storage medium and electronic device |
CN112819052A (en) * | 2021-01-25 | 2021-05-18 | 哈尔滨工业大学(深圳)(哈尔滨工业大学深圳科技创新研究院) | Multi-modal fine-grained mixing method, system, device and storage medium |
CN112861944A (en) * | 2021-01-28 | 2021-05-28 | 中山大学 | Image retrieval method and device based on mixed modal input |
CN113094464A (en) * | 2021-03-31 | 2021-07-09 | 哈尔滨理工大学 | Establishment and auxiliary identification method of extensible crop disease analysis library |
CN113139121A (en) * | 2020-01-20 | 2021-07-20 | 阿里巴巴集团控股有限公司 | Query method, model training method, device, equipment and storage medium |
CN113590854A (en) * | 2021-09-29 | 2021-11-02 | 腾讯科技(深圳)有限公司 | Data processing method, data processing equipment and computer readable storage medium |
WO2021232464A1 (en) * | 2020-05-20 | 2021-11-25 | 南京理工大学 | Character offset detection method and system |
CN113920335A (en) * | 2021-09-28 | 2022-01-11 | 苏州冷王网络科技有限公司 | Deep learning-based image and text embedded drug label identification method |
CN114238588A (en) * | 2022-02-24 | 2022-03-25 | 江西医之健科技有限公司 | Data retrieval method, system, readable storage medium and computer equipment |
CN116301655A (en) * | 2023-04-12 | 2023-06-23 | 深圳市英唐数码科技有限公司 | Method, system and readable storage medium for loading historical note pictures |
Citations (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN1920818A (en) * | 2006-09-14 | 2007-02-28 | 浙江大学 | Transmedia search method based on multi-mode information convergence analysis |
CN101419606A (en) * | 2008-11-13 | 2009-04-29 | 浙江大学 | Semi-automatic image labeling method based on semantic and content |
CN101901249A (en) * | 2009-05-26 | 2010-12-01 | 复旦大学 | Text-based query expansion and sort method in image retrieval |
CN101984420A (en) * | 2010-09-03 | 2011-03-09 | 百度在线网络技术(北京)有限公司 | Method and equipment for searching pictures based on word segmentation processing |
CN102129477A (en) * | 2011-04-23 | 2011-07-20 | 山东大学 | Multimode-combined image reordering method |
CN102262642A (en) * | 2011-01-28 | 2011-11-30 | 北京理工大学 | Web image search engine and realizing method thereof |
CN102402593A (en) * | 2010-11-05 | 2012-04-04 | 微软公司 | Multi-modal approach to search query input |
-
2013
- 2013-09-13 CN CN2013104202879A patent/CN103473327A/en active Pending
Patent Citations (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN1920818A (en) * | 2006-09-14 | 2007-02-28 | 浙江大学 | Transmedia search method based on multi-mode information convergence analysis |
CN101419606A (en) * | 2008-11-13 | 2009-04-29 | 浙江大学 | Semi-automatic image labeling method based on semantic and content |
CN101901249A (en) * | 2009-05-26 | 2010-12-01 | 复旦大学 | Text-based query expansion and sort method in image retrieval |
CN101984420A (en) * | 2010-09-03 | 2011-03-09 | 百度在线网络技术(北京)有限公司 | Method and equipment for searching pictures based on word segmentation processing |
CN102402593A (en) * | 2010-11-05 | 2012-04-04 | 微软公司 | Multi-modal approach to search query input |
CN102262642A (en) * | 2011-01-28 | 2011-11-30 | 北京理工大学 | Web image search engine and realizing method thereof |
CN102129477A (en) * | 2011-04-23 | 2011-07-20 | 山东大学 | Multimode-combined image reordering method |
Cited By (49)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN105989001B (en) * | 2015-01-27 | 2019-09-06 | 北京大学 | Image search method and device, image search system |
CN105989001A (en) * | 2015-01-27 | 2016-10-05 | 北京大学 | Image searching method and device, and image searching system |
CN104765764A (en) * | 2015-02-06 | 2015-07-08 | 南京理工大学 | Indexing method based on large-scale image |
WO2016201992A1 (en) * | 2015-06-17 | 2016-12-22 | 杭州海康威视数字技术股份有限公司 | Video storage and retrieval method for cloud storage server, and video cloud storage system |
CN105518678A (en) * | 2015-06-29 | 2016-04-20 | 北京旷视科技有限公司 | Searching method, searching apparatus, user device and computer program product |
WO2017000109A1 (en) * | 2015-06-29 | 2017-01-05 | 北京旷视科技有限公司 | Search method, search apparatus, user equipment, and computer program product |
CN107491456A (en) * | 2016-06-13 | 2017-12-19 | 阿里巴巴集团控股有限公司 | Image ranking method and device |
CN107766853A (en) * | 2016-08-16 | 2018-03-06 | 阿里巴巴集团控股有限公司 | A kind of generation, display methods and the electronic equipment of the text message of image |
CN107766853B (en) * | 2016-08-16 | 2021-08-06 | 阿里巴巴集团控股有限公司 | Image text information generation and display method and electronic equipment |
CN106407323A (en) * | 2016-08-31 | 2017-02-15 | 上海交通大学 | Picture retrieving method based on diversity spatial distance |
CN106484831A (en) * | 2016-09-29 | 2017-03-08 | 百度在线网络技术(北京)有限公司 | Search system, method and apparatus |
CN106529444A (en) * | 2016-11-03 | 2017-03-22 | 广州市万表科技股份有限公司 | Method and device for acquiring watch information based on image identification |
CN106708943A (en) * | 2016-11-22 | 2017-05-24 | 安徽睿极智能科技有限公司 | Image retrieval reordering method and system based on arrangement fusion |
CN106529606A (en) * | 2016-12-01 | 2017-03-22 | 中译语通科技(北京)有限公司 | Method of improving image recognition accuracy |
CN108287833A (en) * | 2017-01-09 | 2018-07-17 | 北京艺鉴通科技有限公司 | It is a kind of for the art work identification to scheme to search drawing method |
CN106980664B (en) * | 2017-03-21 | 2020-11-10 | 苏州大学 | Bilingual comparable corpus mining method and device |
CN106980664A (en) * | 2017-03-21 | 2017-07-25 | 苏州大学 | A kind of bilingual comparable corpora mining method and device |
US11157767B2 (en) | 2017-05-24 | 2021-10-26 | Shanghai Firstbrave Information Technology Co., Ltd. | Image searching method based on feature extraction |
WO2018213943A1 (en) * | 2017-05-24 | 2018-11-29 | 上海冠勇信息科技有限公司 | Image retrieval method based on feature extraction |
CN107358266A (en) * | 2017-07-17 | 2017-11-17 | 陈剑桃 | A kind of mobile terminal with identification function |
CN107358266B (en) * | 2017-07-17 | 2020-11-10 | 浙江信电技术股份有限公司 | Mobile terminal with recognition function |
CN107315837A (en) * | 2017-07-17 | 2017-11-03 | 梧州井儿铺贸易有限公司 | The accurate image indexing system of one kind retrieval |
CN107516020B (en) * | 2017-08-17 | 2021-05-14 | 中国科学院深圳先进技术研究院 | Method, device, equipment and storage medium for determining importance of sequence sites |
CN107516020A (en) * | 2017-08-17 | 2017-12-26 | 中国科学院深圳先进技术研究院 | Determination method, apparatus, equipment and the storage medium of sequence site importance |
CN107741972A (en) * | 2017-10-12 | 2018-02-27 | 广东欧珀移动通信有限公司 | A kind of searching method of picture, terminal device and storage medium |
CN107679208A (en) * | 2017-10-16 | 2018-02-09 | 广东欧珀移动通信有限公司 | A kind of searching method of picture, terminal device and storage medium |
CN107748779A (en) * | 2017-10-20 | 2018-03-02 | 百度在线网络技术(北京)有限公司 | information generating method and device |
CN107590291A (en) * | 2017-10-30 | 2018-01-16 | 广东欧珀移动通信有限公司 | A kind of searching method of picture, terminal device and storage medium |
CN108280209B (en) * | 2018-01-31 | 2020-07-07 | 湖北工业大学 | Image retrieval method and system based on firework algorithm |
CN108280209A (en) * | 2018-01-31 | 2018-07-13 | 湖北工业大学 | A kind of image search method and system based on fireworks algorithm |
CN110069654A (en) * | 2019-03-15 | 2019-07-30 | 平安城市建设科技(深圳)有限公司 | Source of houses searching method, device, equipment and computer readable storage medium |
CN111753118A (en) * | 2019-03-29 | 2020-10-09 | 杭州海康威视数字技术股份有限公司 | Vehicle retrieval method and device, electronic equipment and readable storage medium |
CN110070512A (en) * | 2019-04-30 | 2019-07-30 | 秒针信息技术有限公司 | The method and device of picture modification |
CN110737794A (en) * | 2019-10-16 | 2020-01-31 | 北京锐安科技有限公司 | Image query method, system, server and storage medium |
WO2021087770A1 (en) * | 2019-11-05 | 2021-05-14 | 深圳市欢太科技有限公司 | Picture classification method and apparatus, and storage medium and electronic device |
CN110868543A (en) * | 2019-11-25 | 2020-03-06 | 三星电子(中国)研发中心 | Intelligent photographing method and device and computer readable storage medium |
CN113139121A (en) * | 2020-01-20 | 2021-07-20 | 阿里巴巴集团控股有限公司 | Query method, model training method, device, equipment and storage medium |
WO2021232464A1 (en) * | 2020-05-20 | 2021-11-25 | 南京理工大学 | Character offset detection method and system |
CN112819052A (en) * | 2021-01-25 | 2021-05-18 | 哈尔滨工业大学(深圳)(哈尔滨工业大学深圳科技创新研究院) | Multi-modal fine-grained mixing method, system, device and storage medium |
CN112861944A (en) * | 2021-01-28 | 2021-05-28 | 中山大学 | Image retrieval method and device based on mixed modal input |
CN112861944B (en) * | 2021-01-28 | 2022-09-23 | 中山大学 | Image retrieval method and device based on mixed modal input |
CN113094464A (en) * | 2021-03-31 | 2021-07-09 | 哈尔滨理工大学 | Establishment and auxiliary identification method of extensible crop disease analysis library |
CN113094464B (en) * | 2021-03-31 | 2024-05-28 | 哈尔滨理工大学 | Method for establishing and assisting in identifying expandable crop disease analysis library |
CN113920335A (en) * | 2021-09-28 | 2022-01-11 | 苏州冷王网络科技有限公司 | Deep learning-based image and text embedded drug label identification method |
CN113590854A (en) * | 2021-09-29 | 2021-11-02 | 腾讯科技(深圳)有限公司 | Data processing method, data processing equipment and computer readable storage medium |
CN113590854B (en) * | 2021-09-29 | 2021-12-31 | 腾讯科技(深圳)有限公司 | Data processing method, data processing equipment and computer readable storage medium |
CN114238588A (en) * | 2022-02-24 | 2022-03-25 | 江西医之健科技有限公司 | Data retrieval method, system, readable storage medium and computer equipment |
CN116301655A (en) * | 2023-04-12 | 2023-06-23 | 深圳市英唐数码科技有限公司 | Method, system and readable storage medium for loading historical note pictures |
CN116301655B (en) * | 2023-04-12 | 2023-11-07 | 深圳市英唐数码科技有限公司 | Method, system and readable storage medium for loading historical note pictures |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN103473327A (en) | Image retrieval method and image retrieval system | |
Goëau et al. | Lifeclef plant identification task 2014 | |
Li et al. | Contextual bag-of-words for visual categorization | |
Jang et al. | Car-Rec: A real time car recognition system | |
CN106202256A (en) | Propagate based on semanteme and mix the Web graph of multi-instance learning as search method | |
CN106095829A (en) | Cross-media retrieval method based on degree of depth study with the study of concordance expression of space | |
CN110929498B (en) | Method and device for calculating similarity of short text and readable storage medium | |
Arrish et al. | Shape-based plagiarism detection for flowchart figures in texts | |
CN114461890A (en) | Hierarchical multi-modal intellectual property search engine method and system | |
Dharani et al. | Content based image retrieval system using feature classification with modified KNN algorithm | |
CN112100212A (en) | Case scenario extraction method based on machine learning and rule matching | |
Wang et al. | Beauty product image retrieval based on multi-feature fusion and feature aggregation | |
CN115982144A (en) | Similar text duplicate removal method and device, storage medium and electronic device | |
Karamti et al. | Content-based image retrieval system using neural network | |
CN107423294A (en) | A kind of community image search method and system | |
JP6017277B2 (en) | Program, apparatus and method for calculating similarity between contents represented by set of feature vectors | |
CN103049570B (en) | Based on the image/video search ordering method of relevant Preserving map and a sorter | |
CN115827990B (en) | Searching method and device | |
JP5833499B2 (en) | Retrieval device and program for retrieving content expressed by high-dimensional feature vector set with high accuracy | |
Huang et al. | Multi-query image retrieval using CNN and SIFT features | |
Tian et al. | Research on image classification based on a combination of text and visual features | |
CN104715071B (en) | A kind of specific pedestrian retrieval method described based on imperfect text | |
Thollard et al. | Content-based re-ranking of text-based image search results | |
Kumar et al. | Automatic feature weight determination using indexing and pseudo-relevance feedback for multi-feature content-based image retrieval | |
CN107092875B (en) | Novel scene recognition method |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
AD01 | Patent right deemed abandoned |
Effective date of abandoning: 20180123 |
|
AD01 | Patent right deemed abandoned |