CN113128554B

CN113128554B - A target positioning method, system, device and medium based on template matching

Info

Publication number: CN113128554B
Application number: CN202110257976.7A
Authority: CN
Inventors: 彭绍湖; 朱希诚; 胡晓; 刘长红; 杨兴鑫
Original assignee: Guangzhou University
Current assignee: Guangzhou University
Priority date: 2021-03-10
Filing date: 2021-03-10
Publication date: 2022-05-24
Anticipated expiration: 2041-03-10
Also published as: CN113128554A

Abstract

The invention discloses a target positioning method, a system, a device and a medium based on template matching, wherein the method comprises the following steps: constructing a template image pyramid and constructing a first grid; selecting a first grid with the average gradient amplitude being greater than or equal to a preset first threshold value as a second grid; determining first gradient amplitude and first gradient direction entropy of all pixel points in a second grid, determining a first local feature point according to the first gradient amplitude and the first gradient direction entropy, and establishing a first LBP histogram feature vector according to the first local feature point; and establishing a second LBP histogram feature vector of the target image pyramid, and determining the position and the rotation angle of the target to be positioned according to the similarity matching of the first LBP histogram feature vector and the second LBP histogram feature vector. The invention reduces the requirement on system computing power, overcomes the defect of large influence of rotation in the prior art, improves the accuracy of target positioning, and can be widely applied to the technical field of computer vision.

Description

A target positioning method, system, device and medium based on template matching

技术领域technical field

本发明涉及计算机视觉技术领域，尤其是一种基于模板匹配的目标定位方法、系统、装置及介质。The present invention relates to the technical field of computer vision, in particular to a target positioning method, system, device and medium based on template matching.

背景技术Background technique

模板匹配在计算机视觉和图像处理领域中属于经典的方法，通过给定模板图像和目标图像，提取图像的特征向量，计算模板图像与目标图像中候选窗口的相似度，与模板图像最相似的候选窗口则为匹配结果。Template matching is a classic method in the field of computer vision and image processing. By giving a template image and a target image, the feature vector of the image is extracted, the similarity between the template image and the candidate window in the target image is calculated, and the candidate window most similar to the template image is calculated. The window is the matching result.

LBP(Local Binary Patterns)算法，自从2002年被Timo Ojala等人提出，通过比较像素点与八邻域的灰度值大小，共有36组二值模式用于表示像素点的旋转不变性，其中uniform≤2的9个二值模式被称为等价模式，其余二值模式被称为混合模式。基于LBP的变式也不断在创新，但是基于LBP算法的核心都是局部像素点的二值模式。Li Liu等人提出的MRELBP算法，包括基于中心强度的ELBP_CI方法、基于邻域强度的ELBP_NI方法、基于径向差异的ELBP_RD方法和基于角差的ELBP_AD方法，并且在ELBP的基础上，使用中值滤波器的响应来代替单个像素的值，最终在从混合模式中挑选出5个具有对称性的模式与等价模式的9个模式合并，但是混合模式中旋转不变稳定性不够，受旋转的影响大。Ioan Buciu等人提出的LBP-NMF算法针对面部表情，提取双眼、鼻子和嘴部这四个部分的图片的LBP直方图特征做模板匹配，因为区域内每个点的LBP直方图信息都需要被用于计算，所以计算消耗的时间一定更多，并且结果的准确性也与这四个部分的位置划分相关。The LBP (Local Binary Patterns) algorithm has been proposed by Timo Ojala et al since 2002. By comparing the gray value of pixels and eight neighborhoods, a total of 36 sets of binary patterns are used to represent the rotation invariance of pixels, among which uniform The 9 binary modes ≤ 2 are called equivalent modes, and the remaining binary modes are called mixed modes. Variants based on LBP are also constantly innovating, but the core of the LBP-based algorithm is the binary pattern of local pixels. The MRELBP algorithm proposed by Li Liu et al. includes the ELBP_CI method based on the center strength, the ELBP_NI method based on the neighborhood strength, the ELBP_RD method based on the radial difference and the ELBP_AD method based on the angular difference, and on the basis of ELBP, the median value is used. The response of the filter replaces the value of a single pixel, and finally 5 modes with symmetry are selected from the blending mode and merged with 9 modes of the equivalent mode, but the rotation invariant stability in the blending mode is not enough, affected by the rotation. Great impact. The LBP-NMF algorithm proposed by Ioan Buciu et al. is aimed at facial expressions and extracts the LBP histogram features of the four parts of the eyes, nose and mouth for template matching, because the LBP histogram information of each point in the region needs to be It is used for calculation, so the calculation time must be more, and the accuracy of the result is also related to the location division of these four parts.

发明内容SUMMARY OF THE INVENTION

本发明的目的在于至少一定程度上解决现有技术中存在的技术问题之一。The purpose of the present invention is to solve one of the technical problems existing in the prior art at least to a certain extent.

为此，本发明实施例的一个目的在于提供一种基于模板匹配的目标定位方法，该方法一方面根据筛选出的局部特征点建立LBP直方图特征向量，无需对图像内每个像素点都进行计算，减少了对系统算力的要求，另一方面根据LBP直方图特征向量的相似度匹配确定出稳定像素点，克服了现有技术受旋转影响大的缺点，提高了目标定位的准确度。To this end, an object of the embodiments of the present invention is to provide a target localization method based on template matching. On the one hand, the method establishes an LBP histogram feature vector according to the selected local feature points, without performing the detection of each pixel in the image. On the other hand, the stable pixel points are determined according to the similarity matching of the feature vector of the LBP histogram, which overcomes the shortcomings of the prior art that are greatly affected by rotation, and improves the accuracy of target positioning.

本发明实施例的另一个目的在于提供一种基于模板匹配的目标定位系统。Another object of the embodiments of the present invention is to provide a target positioning system based on template matching.

为了达到上述技术目的，本发明实施例所采取的技术方案包括：In order to achieve the above technical purpose, the technical solutions adopted in the embodiments of the present invention include:

第一方面，本发明实施例提供了一种基于模板匹配的目标定位方法，包括以下步骤：In a first aspect, an embodiment of the present invention provides a template matching-based target positioning method, including the following steps:

获取模板图像，根据所述模板图像构建模板图像金字塔，并根据所述模板图像金字塔构建第一网格；obtaining a template image, constructing a template image pyramid according to the template image, and constructing a first grid according to the template image pyramid;

确定所述第一网格中所有像素点的平均梯度幅度，并选取平均梯度幅度大于等于预设的第一阈值的第一网格作为第二网格；Determine the average gradient magnitude of all pixel points in the first grid, and select the first grid whose average gradient magnitude is greater than or equal to a preset first threshold as the second grid;

确定所述第二网格中所有像素点的第一梯度幅度和第一梯度方向熵值，并根据所述第一梯度幅度和所述第一梯度方向熵值确定第一局部特征点，并根据所述第一局部特征点建立第一LBP直方图特征向量；Determine the first gradient magnitude and the first gradient direction entropy value of all the pixel points in the second grid, and determine the first local feature point according to the first gradient magnitude and the first gradient direction entropy value, and according to The first local feature point establishes a first LBP histogram feature vector;

获取目标图像，根据所述目标图像构建目标图像金字塔，并确定所述目标图像金字塔的第二局部特征点，进而根据所述第二局部特征点建立第二LBP直方图特征向量，再根据所述第一LBP直方图特征向量和所述第二LBP直方图特征向量的相似度匹配确定待定位目标的位置和旋转角度。Acquiring a target image, constructing a target image pyramid according to the target image, and determining a second local feature point of the target image pyramid, and then establishing a second LBP histogram feature vector according to the second local feature point, and then according to the The similarity between the first LBP histogram feature vector and the second LBP histogram feature vector determines the position and rotation angle of the target to be located.

进一步地，在本发明的一个实施例中，所述获取模板图像，根据所述模板图像构建模板图像金字塔，并根据所述模板图像金字塔构建第一网格这一步骤，其具体包括：Further, in an embodiment of the present invention, the step of acquiring a template image, constructing a template image pyramid according to the template image, and constructing a first grid according to the template image pyramid specifically includes:

获取模板图像，对所述模板图像的宽和高进行扩展得到扩展模板图像；Obtain a template image, and expand the width and height of the template image to obtain an expanded template image;

对所述扩展模板图像进行下采样，生成模板图像金字塔；down-sampling the expanded template image to generate a template image pyramid;

提取所述模板图像金字塔中每一层的第一金字塔图像，并根据所述第一金字塔图像通过四叉树分割自适应构建第一网格；Extracting the first pyramid image of each layer in the template image pyramid, and adaptively constructing a first grid by quadtree segmentation according to the first pyramid image;

其中，所述第一金字塔图像被对应的第一网格所覆盖。Wherein, the first pyramid image is covered by the corresponding first grid.

进一步地，在本发明的一个实施例中，所述根据所述第一金字塔图像通过四叉树分割自适应构建第一网格这一步骤中，当所述第一网格的宽或高小于预设的第六阈值时，不再进行四叉树分割，所述第六阈值满足以下条件：Further, in an embodiment of the present invention, in the step of adaptively constructing a first grid through quadtree segmentation according to the first pyramid image, when the width or height of the first grid is less than When the preset sixth threshold is used, no quadtree segmentation is performed, and the sixth threshold satisfies the following conditions:

其中，μ_s表示第六阈值，W_TL表示模板图像金字塔的顶层图像的宽，H_TL表示模板图像金字塔最顶层图像的高。Among them, μ _s represents the sixth threshold, W _TL represents the width of the top image of the template image pyramid, and H _TL represents the height of the top image of the template image pyramid.

进一步地，在本发明的一个实施例中，所述确定所述第一网格中所有像素点的平均梯度幅度，并选取平均梯度幅度大于等于预设的第一阈值的第一网格作为第二网格这一步骤，其具体包括：Further, in an embodiment of the present invention, the average gradient magnitude of all pixels in the first grid is determined, and the first grid whose average gradient magnitude is greater than or equal to a preset first threshold is selected as the first grid. The second grid step, which specifically includes:

计算所述第一网格中所有像素点的梯度幅度总和，并根据所述第一网格中像素点的数量和所述梯度幅度总和确定平均梯度幅度；Calculate the sum of the gradient magnitudes of all pixels in the first grid, and determine the average gradient magnitude according to the number of pixels in the first grid and the sum of the gradient magnitudes;

确定平均梯度幅度大于等于所述第一阈值，则获取对应的第一网格作为第二网格。If it is determined that the average gradient magnitude is greater than or equal to the first threshold, the corresponding first grid is acquired as the second grid.

进一步地，在本发明的一个实施例中，所述确定所述第二网格中所有像素点的第一梯度幅度和第一梯度方向熵值，并根据所述第一梯度幅度和所述第一梯度方向熵值确定第一局部特征点，并根据所述第一局部特征点建立第一LBP直方图特征向量这一步骤，其具体包括：Further, in an embodiment of the present invention, the first gradient magnitude and the first gradient direction entropy value of all the pixel points in the second grid are determined, and the first gradient magnitude and the first gradient direction entropy value are determined according to the first gradient magnitude and the A gradient direction entropy value determines a first local feature point, and the step of establishing a first LBP histogram feature vector according to the first local feature point specifically includes:

确定所述第二网格中所有像素点的第一梯度幅度和第一梯度方向熵值；determining the first gradient magnitude and the first gradient direction entropy value of all pixels in the second grid;

获取预设的第二阈值和第三阈值，选取所述第一梯度幅度大于等于所述第二阈值且所述第一梯度方向熵值大于等于所述第三阈值的像素点作为第一局部特征点；Obtain a preset second threshold and a third threshold, and select pixels whose first gradient magnitude is greater than or equal to the second threshold and whose first gradient direction entropy value is greater than or equal to the third threshold as the first local feature point;

将所述第一局部特征点作为中心像素点提取出第一LBP直方图特征向量。The first LBP histogram feature vector is extracted by taking the first local feature point as the central pixel point.

进一步地，在本发明的一个实施例中，所述将所述第一局部特征点作为中心像素点提取出第一LBP直方图特征向量这一步骤，其具体为：Further, in an embodiment of the present invention, the step of extracting the first LBP histogram feature vector by using the first local feature point as a central pixel point is specifically:

将所述第一局部特征点作为中心像素点，并结合LBP等价模式下uniform≤2的九种模式，建立第一局部特征点的第一LBP直方图特征向量。The first LBP histogram feature vector of the first local feature point is established by taking the first local feature point as the central pixel point and combining the nine modes of uniform≤2 in the LBP equivalent mode.

进一步地，在本发明的一个实施例中，所述获取目标图像，根据所述目标图像构建目标图像金字塔，并确定所述目标图像金字塔的第二局部特征点，进而根据所述第二局部特征点建立第二LBP直方图特征向量，再根据所述第一LBP直方图特征向量和所述第二LBP直方图特征向量的相似度匹配确定待定位目标的位置和旋转角度这一步骤，其具体包括：Further, in an embodiment of the present invention, the acquiring a target image, constructing a target image pyramid according to the target image, and determining a second local feature point of the target image pyramid, and then according to the second local feature point to establish the second LBP histogram feature vector, and then according to the similarity matching of the first LBP histogram feature vector and the second LBP histogram feature vector to determine the position and rotation angle of the target to be located. include:

获取目标图像，根据所述目标图像构建目标图像金字塔，并根据所述目标图像金字塔构建第三网格；Obtaining a target image, constructing a target image pyramid according to the target image, and constructing a third grid according to the target image pyramid;

确定所述第三网格中所有像素点的平均梯度幅度，并选取平均梯度幅度大于等于预设的第四阈值的第三网格作为第四网格；Determine the average gradient magnitude of all pixel points in the third grid, and select the third grid whose average gradient magnitude is greater than or equal to a preset fourth threshold as the fourth grid;

确定所述第四网格中所有像素点的第二梯度幅度和第二梯度方向熵值，并根据所述第二梯度幅度和所述第二梯度方向熵值确定第二局部特征点，并根据所述第二局部特征点建立第二LBP直方图特征向量；Determine the second gradient magnitude and the second gradient direction entropy value of all the pixel points in the fourth grid, and determine the second local feature point according to the second gradient magnitude and the second gradient direction entropy value, and according to The second local feature point establishes a second LBP histogram feature vector;

计算所述模板图像金字塔和所述目标图像金字塔中每一层的第一LBP直方图特征向量和对应位置的第二LBP直方图特征向量的相似度，从顶层到底层依次进行相似度匹配，并将相似度小于预设的第五阈值的第二局部特征点舍去，进而在所述目标图像金字塔的底层筛选出相似度大于等于所述第五阈值的第二局部特征点，从而确定待定位目标的位置和旋转角度。Calculate the similarity of the first LBP histogram feature vector of each layer in the template image pyramid and the target image pyramid and the second LBP histogram feature vector of the corresponding position, and perform similarity matching from the top layer to the bottom layer in turn, and The second local feature points whose similarity is less than the preset fifth threshold are discarded, and the second local feature points whose similarity is greater than or equal to the fifth threshold are screened out at the bottom of the target image pyramid, so as to determine the to-be-located feature. The position and rotation angle of the target.

第二方面，本发明实施例提供了一种基于模板匹配的目标定位系统，包括：In a second aspect, an embodiment of the present invention provides a target positioning system based on template matching, including:

网格构建模块，用于获取模板图像，根据所述模板图像构建模板图像金字塔，并根据所述模板图像金字塔构建第一网格；a grid construction module for acquiring a template image, constructing a template image pyramid according to the template image, and constructing a first grid according to the template image pyramid;

网格选取模块，用于确定所述第一网格中所有像素点的平均梯度幅度，并选取平均梯度幅度大于等于预设的第一阈值的第一网格作为第二网格；a grid selection module, configured to determine the average gradient amplitude of all the pixels in the first grid, and select the first grid whose average gradient amplitude is greater than or equal to a preset first threshold as the second grid;

LBP直方图特征向量建立模块，用于确定所述第二网格中所有像素点的第一梯度幅度和第一梯度方向熵值，并根据所述第一梯度幅度和所述第一梯度方向熵值确定第一局部特征点，并根据所述第一局部特征点建立第一LBP直方图特征向量；The LBP histogram feature vector establishment module is used to determine the first gradient magnitude and the first gradient direction entropy value of all the pixels in the second grid, and according to the first gradient magnitude and the first gradient direction entropy value determines a first local feature point, and establishes a first LBP histogram feature vector according to the first local feature point;

相似度匹配模块，用于获取目标图像，根据所述目标图像构建目标图像金字塔，并确定所述目标图像金字塔的第二局部特征点，进而根据所述第二局部特征点建立第二LBP直方图特征向量，再根据所述第一LBP直方图特征向量和所述第二LBP直方图特征向量的相似度匹配确定待定位目标的位置和旋转角度。The similarity matching module is used to obtain a target image, construct a target image pyramid according to the target image, and determine a second local feature point of the target image pyramid, and then establish a second LBP histogram according to the second local feature point feature vector, and then determine the position and rotation angle of the target to be located according to similarity matching between the first LBP histogram feature vector and the second LBP histogram feature vector.

第三方面，本发明实施例提供了一种基于模板匹配的目标定位装置，包括：In a third aspect, an embodiment of the present invention provides a target positioning device based on template matching, including:

至少一个处理器；at least one processor;

至少一个存储器，用于存储至少一个程序；at least one memory for storing at least one program;

当所述至少一个程序被所述至少一个处理器执行时，使得所述至少一个处理器实现上述的一种基于模板匹配的目标定位方法。When the at least one program is executed by the at least one processor, the at least one processor is caused to implement the above-mentioned method for locating objects based on template matching.

第四方面，本发明实施例还提供了一种计算机可读存储介质，其中存储有处理器可执行的程序，所述处理器可执行的程序在由处理器执行时用于执行上述的一种基于模板匹配的目标定位方法。In a fourth aspect, an embodiment of the present invention further provides a computer-readable storage medium, in which a program executable by a processor is stored, and the program executable by the processor is used to execute the above one when executed by the processor Object localization method based on template matching.

本发明的优点和有益效果将在下面的描述中部分给出，部分将从下面的描述中变得明显，或通过本发明的实践了解到：The advantages and beneficial effects of the present invention will, in part, be given in the following description, and in part will become apparent from the following description, or be learned by practice of the present invention:

本发明实施例基于梯度幅度和梯度方向熵值筛选出金字塔图像中属于边缘和纹理的局部特征点，并根据局部特征点建立LBP直方图特征向量，进而根据LBP直方图特征向量的相似度匹配确定出稳定像素点，从而获取待定位目标的位置和旋转角度，本发明实施例相较现有技术而言，一方面根据筛选出的局部特征点建立LBP直方图特征向量，无需对图像内每个像素点都进行计算，减少了对系统算力的要求，另一方面根据LBP直方图特征向量的相似度匹配确定出稳定像素点，克服了现有技术受旋转影响大的缺点，提高了目标定位的准确度。In the embodiment of the present invention, local feature points belonging to edges and textures in the pyramid image are screened based on the gradient magnitude and gradient direction entropy value, and the LBP histogram feature vector is established according to the local feature points, and then the LBP histogram feature vector is determined according to the similarity matching of the LBP histogram feature vectors. Compared with the prior art, the embodiment of the present invention establishes the LBP histogram feature vector according to the selected local feature points on the one hand, and does not need to All pixel points are calculated, which reduces the requirement for system computing power. On the other hand, stable pixel points are determined according to the similarity matching of LBP histogram feature vectors, which overcomes the shortcomings of the prior art that are greatly affected by rotation and improves target positioning. accuracy.

附图说明Description of drawings

为了更清楚地说明本发明实施例中的技术方案，下面对本发明实施例中所需要使用的附图作以下介绍，应当理解的是，下面介绍中的附图仅仅为了方便清晰表述本发明的技术方案中的部分实施例，对于本领域的技术人员来说，在无需付出创造性劳动的前提下，还可以根据这些附图获取到其他附图。In order to explain the technical solutions in the embodiments of the present invention more clearly, the following descriptions are given to the accompanying drawings that are used in the embodiments of the present invention. It should be understood that the accompanying drawings in the following introduction are only for the convenience of clearly expressing the technology of the present invention. For some of the embodiments in the solution, for those skilled in the art, other drawings can also be obtained from these drawings without the need for creative work.

图1为本发明实施例提供的一种基于模板匹配的目标定位方法的步骤流程图；1 is a flowchart of steps of a template matching-based target positioning method provided by an embodiment of the present invention;

图2为本发明实施例提供的一种基于模板匹配的目标定位系统的结构框图；2 is a structural block diagram of a template matching-based target positioning system provided by an embodiment of the present invention;

图3为本发明实施例提供的一种基于模板匹配的目标定位装置的结构框图。FIG. 3 is a structural block diagram of a target positioning apparatus based on template matching according to an embodiment of the present invention.

具体实施方式Detailed ways

下面详细描述本发明的实施例，所述实施例的示例在附图中示出，其中自始至终相同或类似的标号表示相同或类似的元件或具有相同或类似功能的元件。下面通过参考附图描述的实施例是示例性的，仅用于解释本发明，而不能理解为对本发明的限制。对于以下实施例中的步骤编号，其仅为了便于阐述说明而设置，对步骤之间的顺序不做任何限定，实施例中的各步骤的执行顺序均可根据本领域技术人员的理解来进行适应性调整。The following describes in detail the embodiments of the present invention, examples of which are illustrated in the accompanying drawings, wherein the same or similar reference numerals refer to the same or similar elements or elements having the same or similar functions throughout. The embodiments described below with reference to the accompanying drawings are exemplary, only used to explain the present invention, and should not be construed as a limitation of the present invention. The numbers of the steps in the following embodiments are only set for the convenience of description, and the sequence between the steps is not limited in any way, and the execution sequence of each step in the embodiments can be adapted according to the understanding of those skilled in the art Sexual adjustment.

在本发明的描述中，多个的含义是两个或两个以上，如果有描述到第一、第二只是用于区分技术特征为目的，而不能理解为指示或暗示相对重要性或者隐含指明所指示的技术特征的数量或者隐含指明所指示的技术特征的先后关系。此外，除非另有定义，本文所使用的所有的技术和科学术语与本技术领域的技术人员通常理解的含义相同。In the description of the present invention, the meaning of multiple is two or more. If the first and second are described, they are only for the purpose of distinguishing technical features, and should not be understood as indicating or implying relative importance or implicit Indicates the number of the indicated technical features or implicitly indicates the order of the indicated technical features. Also, unless otherwise defined, all technical and scientific terms used herein have the same meaning as commonly understood by one of ordinary skill in the art.

参照图1，本发明实施例提供了一种基于模板匹配的目标定位方法，具体包括以下步骤：Referring to FIG. 1 , an embodiment of the present invention provides a target positioning method based on template matching, which specifically includes the following steps:

S101、获取模板图像，根据模板图像构建模板图像金字塔，并根据模板图像金字塔构建第一网格；S101, obtaining a template image, constructing a template image pyramid according to the template image, and constructing a first grid according to the template image pyramid;

具体地，模板图像金字塔中每一层均有对应的金字塔图像，第一网格用于覆盖该金字塔图像，第一网格中包含该金字塔图像的所有像素点，便于后续根据第一网格进行筛选以及确定属于边缘和纹理的局部特征点。步骤S101包括以下步骤：Specifically, each layer in the template image pyramid has a corresponding pyramid image, the first grid is used to cover the pyramid image, and the first grid contains all the pixels of the pyramid image, which is convenient for subsequent processing according to the first grid. Screen and identify local feature points belonging to edges and textures. Step S101 includes the following steps:

S1011、获取模板图像，对模板图像的宽和高进行扩展得到扩展模板图像；S1011. Obtain a template image, and expand the width and height of the template image to obtain an expanded template image;

S1012、对扩展模板图像进行下采样，生成模板图像金字塔；S1012, down-sampling the extended template image to generate a template image pyramid;

S1013、提取模板图像金字塔中每一层的第一金字塔图像，并根据第一金字塔图像通过四叉树分割自适应构建第一网格；S1013, extract the first pyramid image of each layer in the template image pyramid, and construct the first grid adaptively by quadtree segmentation according to the first pyramid image;

其中，第一金字塔图像被对应的第一网格所覆盖。Wherein, the first pyramid image is covered by the corresponding first grid.

具体地，获取模板图像，分别对模板图像的宽和高进行扩展，扩展为最接近的2的幂次方。对扩展后的图像以比例因子为2的高斯平滑进行下采样，生成模板图像金字塔。Specifically, a template image is obtained, and the width and height of the template image are respectively expanded to the nearest power of 2. The expanded image is downsampled with Gaussian smoothing with a scale factor of 2 to generate a template image pyramid.

对模板图金字塔中每一层的图像，以Angle_j角度进行旋转，Angle_j角度通过下式计算：Rotate the image of each layer in the template image pyramid at the angle of Angle _j , _which is calculated by the following formula:

其中，Angle_s是旋转的起始角度，Angle_E是旋转的终止角度，旋转角度基准步长μ_A被模板图金字塔的层数和旋转角度的范围限制，因此当旋转角度的范围小，μ_A的值也应该减小，而当金字塔的总层数小，μ_A的值应该增大。自金字塔顶层往下的每一层，旋转角度步长d_Ai相应减小，精确角度的同时减少计算所耗时间，通常μ_A设置为1。Among them, Angle _s is the starting angle of the rotation, Angle _E is the ending angle of the rotation, and the reference step size μ _A of the rotation angle is limited by the number of layers of the template image pyramid and the range of the rotation angle, so when the range of the rotation angle is small, μ _A should also decrease, while the value of _μA should increase when the total number of pyramid levels is small. For each layer down from the top layer of the pyramid, the rotation angle step size d _Ai is correspondingly reduced, and the calculation time is reduced while the angle is accurate. Usually μ _A is set to 1.

模板图金字塔的顶层图像的尺寸越小，与目标图匹配的计算耗时越短，但是匹配结果的正确率也降低，当金字塔上一层的结果在下一层匹配得分未高于阈值，则意味着该结果的下采样匹配的终止。The smaller the size of the top-level image of the template image pyramid, the shorter the calculation time for matching the target image, but the accuracy of the matching result is also reduced. When the result of the previous layer of the pyramid is not higher than the threshold, it means The termination of the downsampling match with the result.

进一步作为可选的实施方式，根据第一金字塔图像通过四叉树分割自适应构建第一网格这一步骤中，当第一网格的宽或高小于预设的第六阈值时，不再进行四叉树分割，第六阈值满足以下条件：Further as an optional embodiment, in the step of adaptively constructing the first grid through quadtree segmentation according to the first pyramid image, when the width or height of the first grid is less than the preset sixth threshold, no longer For quadtree segmentation, the sixth threshold satisfies the following conditions:

具体地，提取模板图像金字塔中所有图像，包括以一定角度旋转每层金字塔图像获得的图像，基于四叉树原理自适应构建第一网格。Specifically, all images in the template image pyramid are extracted, including images obtained by rotating each layer of pyramid images at a certain angle, and the first grid is adaptively constructed based on the quadtree principle.

网格尺寸大小可为3*3，同时所有图像需要被构建的相应第一网格完全覆盖。The grid size can be 3*3, while all images need to be completely covered by the corresponding first grid constructed.

第六阈值μ_s越小，通过筛选的网格越多，提取特征像素点的范围增加，更多的特征像素点能使匹配的精度提升，但会浪费计算与内存资源。同时更多信息量不够的像素点也随之包含在了特征像素点中，这将会导致匹配结果不稳定。第六阈值μ_s越大，通过筛选的网格越少，特征像素点的个数一定会减少，匹配计算的耗时相应减少，但用于匹配的特征像素点可能不够充足，导致匹配结果不准确。因此设定的第六阈值μ_s，应使得金字塔顶层图像至少能建立三层四叉树进行分割。The smaller the sixth threshold _μs is, the more grids are filtered, and the range of extracting feature pixels increases. More feature pixels can improve the matching accuracy, but it will waste computing and memory resources. At the same time, more pixels with insufficient information are also included in the feature pixels, which will lead to unstable matching results. The larger the sixth threshold _μs is, the fewer grids pass the screening, the number of feature pixels will be reduced, and the time-consuming matching calculation will be reduced accordingly, but the feature pixels used for matching may not be sufficient, resulting in matching results. Inaccurate. Therefore, the set sixth threshold μ _s should enable at least a three-level quadtree to be established for the top-level image of the pyramid for segmentation.

S102、确定第一网格中所有像素点的平均梯度幅度，并选取平均梯度幅度大于等于预设的第一阈值的第一网格作为第二网格。S102. Determine the average gradient magnitude of all the pixel points in the first grid, and select the first grid whose average gradient magnitude is greater than or equal to a preset first threshold as the second grid.

具体地，平均梯度幅度大于等于第一阈值的第一网格可初步确定包含边缘和纹理的局部特征点，通过第一阈值的设定筛选出第二网格用于后续确定局部特征点。步骤S102具体包括以下步骤：Specifically, a first grid with an average gradient magnitude greater than or equal to a first threshold can preliminarily determine local feature points including edges and textures, and a second grid can be screened for subsequent determination of local feature points by setting the first threshold. Step S102 specifically includes the following steps:

S1021、计算第一网格中所有像素点的梯度幅度总和，并根据第一网格中像素点的数量和梯度幅度总和确定平均梯度幅度；S1021, calculating the sum of the gradient magnitudes of all pixels in the first grid, and determining the average gradient magnitude according to the number of pixels in the first grid and the sum of the gradient magnitudes;

S1022、确定平均梯度幅度大于等于第一阈值，则获取对应的第一网格作为第二网格。S1022. It is determined that the average gradient magnitude is greater than or equal to the first threshold, and then acquire the corresponding first grid as the second grid.

具体地，计算第一网格中所有像素点的梯度幅度总和，梯度幅度总和除以网格中的像素点个数得到的平均梯度幅度，选取出的第二网格的平均梯度幅度应当大于等于第一阈值。Specifically, the sum of the gradient magnitudes of all the pixels in the first grid is calculated, and the average gradient magnitude obtained by dividing the sum of the gradient magnitudes by the number of pixels in the grid, the average gradient magnitude of the selected second grid should be greater than or equal to first threshold.

平均梯度幅度小于第一阈值，则很有可能为背景，或者是图像中缺少纹理和边缘的部分，在这些部分中无法提取出有充足信息量的特征点。If the average gradient magnitude is less than the first threshold, it is likely to be the background, or the parts of the image lacking texture and edges, and feature points with sufficient information cannot be extracted from these parts.

本发明实施例使用平均梯度幅度进行筛选，相当于图像的全局滤波器，过滤平均梯度幅度较小的网格，相当于不考虑图像中的少边缘少纹理或者无边缘无纹理的区域，缩小从图像中提取强特征点的范围。In this embodiment of the present invention, the average gradient amplitude is used for screening, which is equivalent to the global filter of the image. Filtering grids with a smaller average gradient amplitude is equivalent to ignoring the areas in the image with few edges and few textures or no edges and no textures. The range of extracting strong feature points in the image.

S103、确定第二网格中所有像素点的第一梯度幅度和第一梯度方向熵值，并根据第一梯度幅度和第一梯度方向熵值确定第一局部特征点，并根据第一局部特征点建立第一LBP直方图特征向量；S103. Determine the first gradient magnitude and the first gradient direction entropy value of all the pixel points in the second grid, and determine the first local feature point according to the first gradient magnitude and the first gradient direction entropy value, and according to the first local feature point to establish the first LBP histogram feature vector;

具体地，针对每一个筛选出的第二网格，基于梯度幅度和梯度方向熵值评估第二网格中像素点的特征信息量，检测出图像中属于边缘和纹理的局部特征点，然后根据该局部特征点建议LBP直方图特征向量。步骤S103具体包括以下步骤：Specifically, for each screened second grid, the feature information amount of the pixel points in the second grid is evaluated based on the gradient magnitude and gradient direction entropy value, and the local feature points belonging to edges and textures in the image are detected, and then according to This local feature point suggests the LBP histogram feature vector. Step S103 specifically includes the following steps:

S1031、确定第二网格中所有像素点的第一梯度幅度和第一梯度方向熵值；S1031, determine the first gradient magnitude and the first gradient direction entropy value of all pixels in the second grid;

S1032、获取预设的第二阈值和第三阈值，选取第一梯度幅度大于等于第二阈值且第一梯度方向熵值大于等于第三阈值的像素点作为第一局部特征点；S1032, obtaining a preset second threshold and a third threshold, and selecting pixels whose first gradient magnitude is greater than or equal to the second threshold and whose first gradient direction entropy value is greater than or equal to the third threshold as the first local feature point;

S1033、将第一局部特征点作为中心像素点提取出第一LBP直方图特征向量。S1033 , extracting a first LBP histogram feature vector by using the first local feature point as a central pixel point.

具体地，对于每一个筛选出的第二网格，一种基于梯度幅度和梯度方向熵方法被用于检测图像中属于边缘和纹理的局部特征点。一个像素点的梯度幅度越大，这个像素点包含的梯度信息越丰富；图像某一区域的梯度方向熵值越大，该区域的像素点的梯度方向变化越大，则图像角点存在于该区域的可能性加大，而角点处通常意味着目标轮廓改变，这能为模板匹配提供更多的特征信息。Specifically, for each filtered second grid, a gradient magnitude and gradient direction entropy method is used to detect local feature points belonging to edges and textures in the image. The larger the gradient magnitude of a pixel point, the richer the gradient information contained in this pixel point; the larger the gradient direction entropy value of a certain area of the image, the larger the gradient direction change of the pixel point in this area, the image corners exist in this area. The probability of the region is increased, and the corner point usually means that the target contour has changed, which can provide more feature information for template matching.

计算像素点的梯度方向熵，以该像素点为n*n区域的中心，n可设为3，即计算9个像素点的梯度方向的熵值。Calculate the gradient direction entropy of a pixel point, take the pixel point as the center of the n*n area, and n can be set to 3, that is, calculate the entropy value of the gradient direction of 9 pixel points.

在每个第二网格中选择出梯度幅度和梯度方向熵分别大于对应阈值的像素点，作为局部特征点。In each second grid, pixels whose gradient magnitude and gradient direction entropy are respectively greater than the corresponding threshold are selected as local feature points.

进一步作为可选的实施方式，将第一局部特征点作为中心像素点提取出第一LBP直方图特征向量这一步骤，其具体为：Further as an optional embodiment, the step of extracting the first LBP histogram feature vector by using the first local feature point as the central pixel point is specifically:

将第一局部特征点作为中心像素点，并结合LBP等价模式下uniform≤2的九种模式，建立第一局部特征点的第一LBP直方图特征向量。Taking the first local feature point as the center pixel, and combining the nine modes of uniform≤2 in the LBP equivalent mode, the first LBP histogram feature vector of the first local feature point is established.

具体地，将LBP等价模式下uniform≤2的九种模式，作为稳定旋转模式，共有9类。其余不属于稳定旋转模式都归为另一类，因此共有10类二值模式用于提取区域内中心像素点的直方图特征向量。Specifically, the nine modes with uniform≤2 in the LBP equivalent mode are regarded as stable rotation modes, and there are nine categories in total. The rest that do not belong to the stable rotation mode are classified into another category, so there are 10 types of binary modes used to extract the histogram feature vector of the central pixel in the region.

图像的一个3*3区域内，中心像素点的灰度值与周围8个像素点进行比较，周围像素点比中心像素点的灰度值大则标记为1，否则标记为0。因此8个像素点中连续被标记为1的个数会有0、1、2...8，共9类，分别对应着LBP等价模式下uniform≤2的九种模式，作为稳定旋转模式。其余不属于稳定旋转模式都归为另一类，因此共有10类二值模式用于提取每个局部特征点的直方图特征向量。In a 3*3 area of the image, the gray value of the central pixel is compared with the surrounding 8 pixels. If the gray value of the surrounding pixel is greater than that of the central pixel, it is marked as 1, otherwise it is marked as 0. Therefore, the number of consecutively marked 1 in the 8 pixels will be 0, 1, 2...8, a total of 9 categories, corresponding to the nine modes of uniform≤2 in the LBP equivalent mode, as the stable rotation mode . The rest that do not belong to stable rotation patterns are classified into another category, so there are 10 types of binary patterns used to extract the histogram feature vector of each local feature point.

同样，半径为r的区域大小为R_r*R_r，r＝1,2,3,...，R_r＝2r+1，则每一个半径为r的区域都有R_r*R_r个像素，除中心像素点以外，区域内共有R_r*R_r-1个像素点通过10类二值模式提取中心像素点的直方图特征向量。Similarly, the size of the area with radius r is R _r *R _r , r=1,2,3,..., R _r =2r+1, then each area with radius r has R _r *R _r Pixels, except for the center pixel, there are a total of R _r *R _r -1 pixels in the area, and the histogram feature vector of the center pixel is extracted by 10 types of binary mode.

对模板图金字塔中的所有图像检测局部特征点，每个局部特征点作为提取LBP特征区域的中心像素点，提取中心像素点的第一LBP直方图特征向量。Local feature points are detected for all images in the template image pyramid, each local feature point is used as the central pixel point of the extracted LBP feature area, and the first LBP histogram feature vector of the central pixel point is extracted.

获取第一LBP直方图特征向量的区域半径越大，包含的信息越丰富，但是提取特征向量所需的时间增加。The larger the radius of the region where the feature vector of the first LBP histogram is obtained, the richer the information contained, but the time required to extract the feature vector increases.

对模板图金字塔中的所有图像检测局部特征点，以每个局部特征点为区域中心，提取改进LBP直方图特征向量。Detect local feature points for all images in the template image pyramid, and extract the improved LBP histogram feature vector with each local feature point as the center of the region.

S104、获取目标图像，根据目标图像构建目标图像金字塔，并确定目标图像金字塔的第二局部特征点，进而根据第二局部特征点建立第二LBP直方图特征向量，再根据第一LBP直方图特征向量和第二LBP直方图特征向量的相似度匹配确定待定位目标的位置和旋转角度。S104: Acquire a target image, construct a target image pyramid according to the target image, and determine a second local feature point of the target image pyramid, and then establish a second LBP histogram feature vector according to the second local feature point, and then according to the first LBP histogram feature The similarity matching between the vector and the second LBP histogram feature vector determines the position and rotation angle of the target to be located.

具体地，对目标图像的处理与模板图像处理类似，同样先构建目标图像金字塔，然后确定第二局部特征点，进而建议第二LBP直方图特征向量，然后进行相似度匹配，根据匹配的结果确定待定位目标的位置和旋转角度。步骤S104具体包括以下步骤：Specifically, the processing of the target image is similar to the processing of the template image. The target image pyramid is also constructed first, and then the second local feature points are determined, and then the second LBP histogram feature vector is suggested, and then similarity matching is performed, which is determined according to the matching result. The position and rotation angle of the target to be positioned. Step S104 specifically includes the following steps:

S1041、获取目标图像，根据目标图像构建目标图像金字塔，并根据目标图像金字塔构建第三网格；S1041, obtaining a target image, constructing a target image pyramid according to the target image, and constructing a third grid according to the target image pyramid;

S1042、确定第三网格中所有像素点的平均梯度幅度，并选取平均梯度幅度大于等于预设的第四阈值的第三网格作为第四网格；S1042, determine the average gradient magnitude of all pixels in the third grid, and select the third grid whose average gradient magnitude is greater than or equal to a preset fourth threshold as the fourth grid;

S1043、确定第四网格中所有像素点的第二梯度幅度和第二梯度方向熵值，并根据第二梯度幅度和第二梯度方向熵值确定第二局部特征点，并根据第二局部特征点建立第二LBP直方图特征向量；S1043: Determine the second gradient magnitude and the second gradient direction entropy value of all the pixel points in the fourth grid, and determine the second local feature point according to the second gradient magnitude and the second gradient direction entropy value, and according to the second local feature point to establish the second LBP histogram feature vector;

S1044、计算模板图像金字塔和目标图像金字塔中每一层的第一LBP直方图特征向量和对应位置的第二LBP直方图特征向量的相似度，从顶层到底层依次进行相似度匹配，并将相似度小于预设的第五阈值的第二局部特征点舍去，进而在目标图像金字塔的底层筛选出相似度大于等于第五阈值的第二局部特征点，从而确定待定位目标的位置和旋转角度。S1044: Calculate the similarity between the first LBP histogram feature vector of each layer in the template image pyramid and the target image pyramid and the second LBP histogram feature vector at the corresponding position, perform similarity matching from the top layer to the bottom layer in sequence, and compare the similarity The second local feature points whose degree is less than the preset fifth threshold are discarded, and then the second local feature points whose similarity is greater than or equal to the fifth threshold are screened at the bottom of the target image pyramid, thereby determining the position and rotation angle of the target to be located. .

应当认识到，第二LBP直方图特征向量的建立过程与前述第一LBP直方图特征向量的建立过程基本类似，在此不做赘述。It should be recognized that the process of establishing the second LBP histogram feature vector is basically similar to the foregoing process of establishing the first LBP histogram feature vector, and details are not described herein.

具体地，获取待定位的目标图像后，构建与模板图像金字塔相同金字塔层数的目标图像金字塔，建立目标图像金字塔的第二LBP直方图特征向量，从目标图像金字塔和模板图像金字塔的顶层开始，在目标图像上滑动候选窗口，候选窗口中已记录模板图像中所有局部特征点的相对位置信息，不断将候选窗口的局部特征点的特征向量和对应目标图像上的特征向量进行匹配，通过模板图像的所有局部特征点的直方图特征向量(即第一LBP直方图特征向量)与目标图像对应位置的特征点的直方图特征向量(即第二LBP直方图特征向量)计算相似度。Specifically, after acquiring the target image to be positioned, constructing a target image pyramid with the same number of pyramid layers as the template image pyramid, establishing a second LBP histogram feature vector of the target image pyramid, starting from the top layer of the target image pyramid and the template image pyramid, Sliding the candidate window on the target image, the relative position information of all local feature points in the template image has been recorded in the candidate window, and the eigenvectors of the local feature points of the candidate window are continuously matched with the eigenvectors on the corresponding target image. The similarity is calculated between the histogram feature vectors of all local feature points (ie the first LBP histogram feature vector) and the histogram feature vectors of the feature points at the corresponding positions of the target image (ie the second LBP histogram feature vector).

每一层可能有多个结果，对于相似度低于设定的第五阈值的结果舍去，最终在金字塔底层的相似度匹配完成后检测出相似度高且稳定的局部特征点，从而根据该局部特征点获得定待位目标的位置和旋转角度。Each layer may have multiple results, and the results whose similarity is lower than the set fifth threshold are discarded. Finally, after the similarity matching at the bottom of the pyramid is completed, local feature points with high similarity and stable are detected. The local feature points obtain the position and rotation angle of the target to be positioned.

以上对本发明实施例的方法步骤进行了说明。本发明实施例提供了一种基于模板匹配的目标定位方法，首先，基于梯度幅度和梯度方向，检测出目标图像和模板图像中属于边缘和纹理的特征点；其次，使用一种LBP改进算法，从检测出的特征点中筛选出旋转稳定的局部特征点并建立LBP直方图特征向量；最后，采用一种缩小范围的搜索方法，基于检测出的稳定局部特征点，从金字塔顶层到底层逐层筛选，确定出相似度高于预设阈值且稳定的局部特征点，从而获得准确的目标位置和旋转角度。本发明实施例能够在目标图像存在模糊、遮挡、噪声、复杂的背景以及角度旋转的情况下，准确计算出目标位置和目标旋转角度。The method steps in the embodiments of the present invention are described above. The embodiment of the present invention provides a target positioning method based on template matching. First, based on the gradient magnitude and gradient direction, the feature points belonging to edges and textures in the target image and the template image are detected; secondly, an improved LBP algorithm is used, The rotation-stable local feature points are screened out from the detected feature points and the LBP histogram feature vector is established; finally, a narrow-range search method is adopted, based on the detected stable local feature points, from the top of the pyramid to the bottom layer by layer Screening to determine the local feature points whose similarity is higher than the preset threshold and stable, so as to obtain the accurate target position and rotation angle. The embodiment of the present invention can accurately calculate the target position and the target rotation angle when the target image has blur, occlusion, noise, complex background and angle rotation.

参照图2，本发明实施例提供了一种基于模板匹配的目标定位系统，包括：Referring to FIG. 2, an embodiment of the present invention provides a target positioning system based on template matching, including:

网格构建模块，用于获取模板图像，根据模板图像构建模板图像金字塔，并根据模板图像金字塔构建第一网格；The grid construction module is used for obtaining the template image, constructing the template image pyramid according to the template image, and constructing the first grid according to the template image pyramid;

网格选取模块，用于确定第一网格中所有像素点的平均梯度幅度，并选取平均梯度幅度大于等于预设的第一阈值的第一网格作为第二网格；The grid selection module is used to determine the average gradient magnitude of all the pixels in the first grid, and select the first grid whose average gradient magnitude is greater than or equal to the preset first threshold as the second grid;

LBP直方图特征向量建立模块，用于确定第二网格中所有像素点的第一梯度幅度和第一梯度方向熵值，并根据第一梯度幅度和第一梯度方向熵值确定第一局部特征点，并根据第一局部特征点建立第一LBP直方图特征向量；The LBP histogram feature vector building module is used to determine the first gradient magnitude and the first gradient direction entropy value of all pixels in the second grid, and determine the first local feature according to the first gradient magnitude and the first gradient direction entropy value point, and establish the first LBP histogram feature vector according to the first local feature point;

相似度匹配模块，用于获取目标图像，根据目标图像构建目标图像金字塔，并确定目标图像金字塔的第二局部特征点，进而根据第二局部特征点建立第二LBP直方图特征向量，再根据第一LBP直方图特征向量和第二LBP直方图特征向量的相似度匹配确定待定位目标的位置和旋转角度。The similarity matching module is used to obtain the target image, construct the target image pyramid according to the target image, and determine the second local feature point of the target image pyramid, and then establish the second LBP histogram feature vector according to the second local feature point, and then according to the second local feature point. The similarity matching between the feature vector of the first LBP histogram and the feature vector of the second LBP histogram determines the position and rotation angle of the target to be located.

上述方法实施例中的内容均适用于本系统实施例中，本系统实施例所具体实现的功能与上述方法实施例相同，并且达到的有益效果与上述方法实施例所达到的有益效果也相同。The contents in the above method embodiments are all applicable to the present system embodiments, the specific functions implemented by the present system embodiments are the same as the above method embodiments, and the beneficial effects achieved are also the same as those achieved by the above method embodiments.

参照图3，本发明实施例提供了一种基于模板匹配的目标定位装置，包括：Referring to FIG. 3, an embodiment of the present invention provides a target positioning device based on template matching, including:

至少一个处理器；at least one processor;

当上述至少一个程序被上述至少一个处理器执行时，使得上述至少一个处理器实现上述的一种基于模板匹配的目标定位方法。When the above-mentioned at least one program is executed by the above-mentioned at least one processor, the above-mentioned at least one processor is caused to implement the above-mentioned method for target positioning based on template matching.

上述方法实施例中的内容均适用于本装置实施例中，本装置实施例所具体实现的功能与上述方法实施例相同，并且达到的有益效果与上述方法实施例所达到的有益效果也相同。The contents in the above method embodiments are all applicable to the present device embodiments, the specific functions implemented by the present device embodiments are the same as the above method embodiments, and the beneficial effects achieved are also the same as those achieved by the above method embodiments.

本发明实施例还提供了一种计算机可读存储介质，其中存储有处理器可执行的程序，该处理器可执行的程序在由处理器执行时用于执行上述一种基于模板匹配的目标定位方法。An embodiment of the present invention further provides a computer-readable storage medium, in which a program executable by a processor is stored, and the program executable by the processor, when executed by the processor, is used to perform the above-mentioned target positioning based on template matching method.

本发明实施例的一种计算机可读存储介质，可执行本发明方法实施例所提供的一种基于模板匹配的目标定位方法，可执行方法实施例的任意组合实施步骤，具备该方法相应的功能和有益效果。A computer-readable storage medium according to an embodiment of the present invention can execute a template matching-based target positioning method provided by the method embodiment of the present invention, can execute any combination of implementation steps of the method embodiment, and has corresponding functions of the method. and beneficial effects.

本发明实施例还公开了一种计算机程序产品或计算机程序，该计算机程序产品或计算机程序包括计算机指令，该计算机指令存储在计算机可读存介质中。计算机设备的处理器可以从计算机可读存储介质读取该计算机指令，处理器执行该计算机指令，使得该计算机设备执行图1所示的方法。The embodiment of the present invention also discloses a computer program product or computer program, where the computer program product or computer program includes computer instructions, and the computer instructions are stored in a computer-readable storage medium. A processor of the computer device can read the computer instructions from the computer-readable storage medium, and the processor executes the computer instructions to cause the computer device to perform the method shown in FIG. 1 .

在一些可选择的实施例中，在方框图中提到的功能/操作可以不按照操作示图提到的顺序发生。例如，取决于所涉及的功能/操作，连续示出的两个方框实际上可以被大体上同时地执行或上述方框有时能以相反顺序被执行。此外，在本发明的流程图中所呈现和描述的实施例以示例的方式被提供，目的在于提供对技术更全面的理解。所公开的方法不限于本文所呈现的操作和逻辑流程。可选择的实施例是可预期的，其中各种操作的顺序被改变以及其中被描述为较大操作的一部分的子操作被独立地执行。In some alternative implementations, the functions/operations noted in the block diagrams may occur out of the order noted in the operational diagrams. For example, two blocks shown in succession may, in fact, be executed substantially concurrently or the blocks may sometimes be executed in the reverse order, depending upon the functionality/operations involved. Furthermore, the embodiments presented and described in the flowcharts of the present invention are provided by way of example in order to provide a more comprehensive understanding of the technology. The disclosed methods are not limited to the operations and logic flows presented herein. Alternative embodiments are contemplated in which the order of the various operations are altered and in which sub-operations described as part of larger operations are performed independently.

此外，虽然在功能性模块的背景下描述了本发明，但应当理解的是，除非另有相反说明，上述的功能和/或特征中的一个或多个可以被集成在单个物理装置和/或软件模块中，或者一个或多个功能和/或特征可以在单独的物理装置或软件模块中被实现。还可以理解的是，有关每个模块的实际实现的详细讨论对于理解本发明是不必要的。更确切地说，考虑到在本文中公开的装置中各种功能模块的属性、功能和内部关系的情况下，在工程师的常规技术内将会了解该模块的实际实现。因此，本领域技术人员运用普通技术就能够在无需过度试验的情况下实现在权利要求书中所阐明的本发明。还可以理解的是，所公开的特定概念仅仅是说明性的，并不意在限制本发明的范围，本发明的范围由所附权利要求书及其等同方案的全部范围来决定。Furthermore, although the invention is described in the context of functional modules, it is to be understood that, unless stated to the contrary, one or more of the above-described functions and/or features may be integrated in a single physical device and/or In software modules, or one or more functions and/or features may be implemented in separate physical devices or software modules. It will also be appreciated that a detailed discussion of the actual implementation of each module is not necessary to understand the present invention. Rather, given the attributes, functions, and internal relationships of the various functional modules in the apparatus disclosed herein, the actual implementation of the modules will be within the routine skill of the engineer. Accordingly, those skilled in the art, using ordinary skill, can implement the invention as set forth in the claims without undue experimentation. It is also to be understood that the specific concepts disclosed are illustrative only and are not intended to limit the scope of the invention, which is to be determined by the appended claims along with their full scope of equivalents.

上述功能如果以软件功能单元的形式实现并作为独立的产品销售或使用时，可以存储在一个计算机可读取存储介质中。基于这样的理解，本发明的技术方案本质上或者说对现有技术做出贡献的部分或者该技术方案的部分可以以软件产品的形式体现出来，该计算机软件产品存储在一个存储介质中，包括若干指令用以使得一台计算机设备(可以是个人计算机，服务器，或者网络设备等)执行本发明各个实施例上述方法的全部或部分步骤。而前述的存储介质包括：U盘、移动硬盘、只读存储器(ROM，Read-Only Memory)、随机存取存储器(RAM，Random Access Memory)、磁碟或者光盘等各种可以存储程序代码的介质。If the above functions are implemented in the form of software functional units and sold or used as independent products, they may be stored in a computer-readable storage medium. Based on this understanding, the technical solution of the present invention can be embodied in the form of a software product in essence, or the part that contributes to the prior art or the part of the technical solution. The computer software product is stored in a storage medium, including Several instructions are used to cause a computer device (which may be a personal computer, a server, or a network device, etc.) to execute all or part of the steps of the above-mentioned methods in various embodiments of the present invention. The aforementioned storage medium includes: U disk, mobile hard disk, Read-Only Memory (ROM, Read-Only Memory), Random Access Memory (RAM, Random Access Memory), magnetic disk or optical disk and other media that can store program codes .

在流程图中表示或在此以其他方式描述的逻辑和/或步骤，例如，可以被认为是用于实现逻辑功能的可执行指令的定序列表，可以具体实现在任何计算机可读介质中，以供指令执行系统、装置或设备(如基于计算机的系统、包括处理器的系统或其他可以从指令执行系统、装置或设备取指令并执行指令的系统)使用，或结合这些指令执行系统、装置或设备而使用。就本说明书而言，“计算机可读介质”可以是任何可以包含、存储、通信、传播或传输程序以供指令执行系统、装置或设备或结合这些指令执行系统、装置或设备而使用的装置。The logic and/or steps represented in flowcharts or otherwise described herein, for example, may be considered an ordered listing of executable instructions for implementing the logical functions, may be embodied in any computer-readable medium, For use with, or in conjunction with, an instruction execution system, apparatus, or device (such as a computer-based system, a system including a processor, or other system that can fetch instructions from and execute instructions from an instruction execution system, apparatus, or apparatus) or equipment. For the purposes of this specification, a "computer-readable medium" can be any device that can contain, store, communicate, propagate, or transport the program for use by or in connection with an instruction execution system, apparatus, or apparatus.

计算机可读介质的更具体的示例(非穷尽性列表)包括以下：具有一个或多个布线的电连接部(电子装置)，便携式计算机盘盒(磁装置)，随机存取存储器(RAM)，只读存储器(ROM)，可擦除可编辑只读存储器(EPROM或闪速存储器)，光纤装置，以及便携式光盘只读存储器(CDROM)。另外，计算机可读介质甚至可以是可在其上打印上述程序的纸或其他合适的介质，因为可以例如通过对纸或其他介质进行光学扫描，接着进行编辑、解译或必要时以其他合适方式进行处理来以电子方式获得上述程序，然后将其存储在计算机存储器中。More specific examples (non-exhaustive list) of computer readable media include the following: electrical connections with one or more wiring (electronic devices), portable computer disk cartridges (magnetic devices), random access memory (RAM), Read Only Memory (ROM), Erasable Editable Read Only Memory (EPROM or Flash Memory), Fiber Optic Devices, and Portable Compact Disc Read Only Memory (CDROM). In addition, the computer readable medium may even be paper or other suitable medium on which the above-mentioned program can be printed, as it is possible, for example, by optically scanning the paper or other medium, followed by editing, interpretation or other suitable means if necessary Processing is performed to obtain the above program electronically and then stored in computer memory.

应当理解，本发明的各部分可以用硬件、软件、固件或它们的组合来实现。在上述实施方式中，多个步骤或方法可以用存储在存储器中且由合适的指令执行系统执行的软件或固件来实现。例如，如果用硬件来实现，和在另一实施方式中一样，可用本领域公知的下列技术中的任一项或他们的组合来实现：具有用于对数据信号实现逻辑功能的逻辑门电路的离散逻辑电路，具有合适的组合逻辑门电路的专用集成电路，可编程门阵列(PGA)，现场可编程门阵列(FPGA)等。It should be understood that various parts of the present invention may be implemented in hardware, software, firmware or a combination thereof. In the above-described embodiments, various steps or methods may be implemented in software or firmware stored in memory and executed by a suitable instruction execution system. For example, if implemented in hardware, as in another embodiment, it can be implemented by any one or a combination of the following techniques known in the art: Discrete logic circuits, application specific integrated circuits with suitable combinational logic gates, Programmable Gate Arrays (PGA), Field Programmable Gate Arrays (FPGA), etc.

在本说明书的上述描述中，参考术语“一个实施方式/实施例”、“另一实施方式/实施例”或“某些实施方式/实施例”等的描述意指结合实施方式或示例描述的具体特征、结构、材料或者特点包含于本发明的至少一个实施方式或示例中。在本说明书中，对上述术语的示意性表述不一定指的是相同的实施方式或示例。而且，描述的具体特征、结构、材料或者特点可以在任何的一个或多个实施方式或示例中以合适的方式结合。In the above description of the present specification, reference to the description of the terms "one embodiment/example", "another embodiment/example" or "certain embodiments/examples" etc. means the description in conjunction with the embodiment or example. Particular features, structures, materials, or characteristics are included in at least one embodiment or example of the present invention. In this specification, schematic representations of the above terms do not necessarily refer to the same embodiment or example. Furthermore, the particular features, structures, materials or characteristics described may be combined in any suitable manner in any one or more embodiments or examples.

尽管已经示出和描述了本发明的实施方式，本领域的普通技术人员可以理解：在不脱离本发明的原理和宗旨的情况下可以对这些实施方式进行多种变化、修改、替换和变型，本发明的范围由权利要求及其等同物限定。Although embodiments of the present invention have been shown and described, it will be understood by those of ordinary skill in the art that various changes, modifications, substitutions and alterations can be made in these embodiments without departing from the principles and spirit of the invention, The scope of the invention is defined by the claims and their equivalents.

以上是对本发明的较佳实施进行了具体说明，但本发明并不限于上述实施例，熟悉本领域的技术人员在不违背本发明精神的前提下还可做作出种种的等同变形或替换，这些等同的变形或替换均包含在本申请权利要求所限定的范围内。The above is a specific description of the preferred implementation of the present invention, but the present invention is not limited to the above-mentioned embodiments, and those skilled in the art can also make various equivalent deformations or replacements on the premise of not violating the spirit of the present invention. Equivalent modifications or substitutions are included within the scope defined by the claims of the present application.

Claims

1. A target positioning method based on template matching is characterized by comprising the following steps:

acquiring a template image, constructing a template image pyramid according to the template image, and constructing a first grid according to the template image pyramid;

determining the average gradient amplitude of all pixel points in the first grid, and selecting the first grid with the average gradient amplitude being greater than or equal to a preset first threshold value as a second grid;

determining first gradient amplitude and first gradient direction entropy of all pixel points in the second grid, determining a first local feature point according to the first gradient amplitude and the first gradient direction entropy, and establishing a first LBP histogram feature vector according to the first local feature point;

acquiring a target image, constructing a target image pyramid according to the target image, determining a second local feature point of the target image pyramid, further establishing a second LBP histogram feature vector according to the second local feature point, and determining the position and the rotation angle of a target to be positioned according to the similarity matching of the first LBP histogram feature vector and the second LBP histogram feature vector;

the method comprises the steps of obtaining a target image, constructing a target image pyramid according to the target image, determining a second local feature point of the target image pyramid, further establishing a second LBP histogram feature vector according to the second local feature point, and determining the position and the rotating angle of a target to be positioned according to the similarity matching of the first LBP histogram feature vector and the second LBP histogram feature vector, wherein the steps specifically comprise:

acquiring a target image, constructing a target image pyramid according to the target image, and constructing a third grid according to the target image pyramid;

determining the average gradient amplitude of all pixel points in the third grid, and selecting the third grid with the average gradient amplitude being greater than or equal to a preset fourth threshold value as a fourth grid;

determining second gradient magnitude and second gradient direction entropy of all pixel points in the fourth grid, determining second local feature points according to the second gradient magnitude and the second gradient direction entropy, and establishing second LBP histogram feature vectors according to the second local feature points;

and calculating the similarity of the first LBP histogram feature vector of each layer in the template image pyramid and the target image pyramid and the second LBP histogram feature vector of the corresponding position, sequentially performing similarity matching from the top layer to the bottom layer, eliminating second local feature points of which the similarity is smaller than a preset fifth threshold value, and screening out second local feature points of which the similarity is larger than or equal to the fifth threshold value from the bottom layer of the target image pyramid, thereby determining the position and the rotation angle of the target to be positioned.

2. The target positioning method based on template matching according to claim 1, wherein the step of obtaining a template image, constructing a template image pyramid according to the template image, and constructing a first grid according to the template image pyramid specifically comprises:

acquiring a template image, and expanding the width and the height of the template image to obtain an expanded template image;

downsampling the extended template image to generate a template image pyramid;

extracting a first pyramid image of each layer in the template image pyramid, and constructing a first grid in a self-adaptive manner through quadtree segmentation according to the first pyramid image;

wherein the first pyramid image is covered by a corresponding first grid.

3. The template matching-based target positioning method according to claim 2, wherein in the step of adaptively constructing the first mesh from the first pyramid image through quadtree partitioning, when the width or height of the first mesh is smaller than a preset sixth threshold, the quadtree partitioning is not performed, and the sixth threshold satisfies the following condition:

wherein,μ _sa sixth threshold value is indicated which is,W _TLthe width of the top level image representing the template image pyramid,H _TLrepresenting the height of the topmost image of the template image pyramid.

4. The template matching-based target positioning method according to claim 1, wherein the step of determining an average gradient magnitude of all pixel points in the first mesh, and selecting the first mesh having the average gradient magnitude greater than or equal to a preset first threshold as the second mesh specifically includes:

calculating the sum of the gradient amplitudes of all the pixel points in the first grid, and determining the average gradient amplitude according to the number of the pixel points in the first grid and the sum of the gradient amplitudes;

and if the average gradient amplitude is determined to be larger than or equal to the first threshold, acquiring the corresponding first grid as a second grid.

5. The template matching-based target positioning method according to claim 1, wherein the step of determining a first gradient magnitude and a first gradient direction entropy of all pixel points in the second mesh, determining a first local feature point according to the first gradient magnitude and the first gradient direction entropy, and establishing a first LBP histogram feature vector according to the first local feature point specifically comprises:

determining first gradient amplitude and first gradient direction entropy values of all pixel points in the second grid;

acquiring a preset second threshold and a preset third threshold, and selecting pixel points of which the first gradient amplitude is greater than or equal to the second threshold and the first gradient direction entropy is greater than or equal to the third threshold as first local feature points;

and extracting a first LBP histogram feature vector by taking the first local feature point as a central pixel point.

6. The target positioning method based on template matching according to claim 5, wherein the step of extracting the first LBP histogram feature vector by using the first local feature point as a central pixel point specifically comprises:

and establishing a first local feature point LBP histogram feature vector by taking the first local feature point as a central pixel point and combining nine modes of which the uniform is less than or equal to 2 in the LBP equivalent mode.

7. An object positioning system based on template matching, comprising:

the grid construction module is used for acquiring a template image, constructing a template image pyramid according to the template image and constructing a first grid according to the template image pyramid;

the grid selection module is used for determining the average gradient amplitude of all pixel points in the first grid and selecting the first grid with the average gradient amplitude being greater than or equal to a preset first threshold value as a second grid;

an LBP histogram feature vector establishing module, configured to determine first gradient magnitudes and first gradient direction entropy values of all pixel points in the second grid, determine a first local feature point according to the first gradient magnitudes and the first gradient direction entropy values, and establish a first LBP histogram feature vector according to the first local feature point;

the similarity matching module is used for acquiring a target image, constructing a target image pyramid according to the target image, determining a second local feature point of the target image pyramid, further establishing a second LBP histogram feature vector according to the second local feature point, and determining the position and the rotation angle of the target to be positioned according to the similarity matching of the first LBP histogram feature vector and the second LBP histogram feature vector;

the similarity matching module is specifically configured to:

determining second gradient amplitude and second gradient direction entropy values of all pixel points in the fourth grid, determining second local feature points according to the second gradient amplitude and the second gradient direction entropy values, and establishing second LBP histogram feature vectors according to the second local feature points;

8. An object positioning device based on template matching, comprising:

at least one processor;

at least one memory for storing at least one program;

when executed by the at least one processor, cause the at least one processor to implement a method of target location based on template matching as claimed in any one of claims 1 to 6.

9. A computer readable storage medium, in which a processor executable program is stored, wherein the processor executable program, when executed by a processor, is adapted to perform a method of template matching based object localization according to any of claims 1 to 6.