Multi-object Template Matching Using Radial Ring Code Histograms

Shijiao Zheng¹⁴,
Buyang Zhang¹⁴ &
Hua Yang¹⁴

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 9218))

Included in the following conference series:

International Conference on Image and Graphics

1812 Accesses

Abstract

In this paper, a novel template matching algorithm named radial ring code histograms (RRCH) for multi-objects positioning is proposed. It is invariant to translation, rotation and illumination changes. To improve the identification ability of multi objects with different rotation angles, radial gradient codes using relative angle between gradient direction and position vector is proposed. Adjustable weights in different regions make it possible to adapt various type objects. Experiments using a LED sorting equipment demonstrate that our algorithm results in correct positioning for multi objects in complicated environments with noise and illumination invariance.

You have full access to this open access chapter, Download conference paper PDF

A Fast and Robust Template Matching Method with Rotated Gradient Features and Image Pyramid

Image Location Algorithm by Histogram Matching

2D registration based on contour matching for partial matching images

Article 24 December 2014

Keywords

1 Introduction

In industrial automation, template matching is a well-known machine vision approach for estimating the pose of object. As demands for production efficiency have increased, the ability to match multi object with different rotation angles in an image efficiently has become increasingly important, especially in integral circuit (IC) manufacturing fields. Traditional matching algorithms can’t already meet these demands, when the objects arranged closely under different illumination conditions.

Over the last two decades, various template matching algorithms have been proposed [1–3]. The most popular algorithm is the normalized cross correlation (NCC) [4]. It compensates for both additive and multiplicative variations under the uniform illumination changes. Because of lacking rotation invariance, when an object in the target image is rotated with respect to the template, a set of templates at different orientations is required. This procedure is brute-force and time-consuming. In order to increase the searching speed under different rotation angles, different algorithms by using salient and distinctive features such as regions, edges, contours and points, are proposed to get the rotation invariant. Invariant moments matching methods [5–7] are effective especially for rotated binary patterns, but its poor performance to noise and occlusion. Geometric hashing (GH) [8] is a good method in matching the objects with simple geometric structure, but it is limited to noise and complex shape objects. Generalized Hough transform (GHT) [9] is robust against rotation, scaling, which utilizes voting through local edges evidences. However, it needs enormous memory to store the voting space for selecting the maximum score point and it is unable to deal with the problem of multi rotation objects matching.

Many algorithms using gradient information extracted from the image are proposed. These algorithms are stably resistant to light transformation and widely used in object recognition [10]. Histograms of Oriented Gradient (HOG) [11] has significantly existing feature sets for human detection, but it has limit of recognizing large-angles-rotation objects. The latest matching algorithms based on scale and rotation invariant key-points with histograms of gradient information, like SIFT [12, 13], PCA-SIFT [14], SURF [15] and GLOH [16, 17], present very spectacular recognition performance. However, they fail to find some simple shapes with little grayscale variations. These algorithms have a prohibitive complexity so that they are difficult to be applied in industry.

Orientation codes [18] are based on the utilization of gradient orientation histograms for searching rotation-invariant object in the case of illumination fluctuations. The method consists of two stages. The first stage estimates the approximate rotation angle of the object image by using histograms of the orientation codes. Second, orientation code matching at the right orientation is applied only to the best histogram matches. Marion [19] presented a much faster technique by using integral histograms. The gradient orientation histograms are not intrinsically rotation invariant and a histogram shifting is necessary in order to find the best angle. Thus, this algorithm cannot deal with the condition of array-arranged multi objects.

This paper proposes a radial ring code histograms (RRCH) algorithm for multi-object template matching. The algorithm has the robustness against position translation, angle rotation and illumination changes. By using radial gradient codes which is the relative angle between gradient direction and position vector, it performs better than the original orientation codes algorithm, and the identification ability of multi objects is also improved. Distance ring projections are essentially adopted to improve the restriction of surrounding objects clutter. Various type objects can be matched by adapting adjustable weights in different regions. The proposed method is invariant to illumination owing to utilization of gradient information rather than pixel brightness directly. In addition, the method is more suitable used in multi-object coarse-search step to get the small candidate area and combined with other fine-matching steps.

The rest of this paper is organized as follows: Sect. 2 presents the problem about conventional orientation codes and describes the proposed RRCH algorithm. The implementation details and experimental results are given in Sect. 3. Conclusions follow in Sect. 4.

2 Matching with Radial Ring Code Histograms

The gradient orientation histograms [19] applied in coarse-search is not effective for searching multi objects. As shown in Fig. 1(a), the subimages under the blue circle and blue circle are obviously different. However, the gradient orientation histograms of the two subimages are similar, as showed in Fig. 1(b). This figure proves that the gradient orientation histogram can not match well, when the multi objects arranged closely.

Radial ring code histogram is proposed to tackle the problem. It is based on gradient information combining with the spatial structure. Firstly, gradient direction is transformed into radial gradient codes by the principle of statistics. Then, ring distance projection is utilized to acquire radial ring codes. Next, radial ring code histograms are calculated. Through comparing the histograms similarity between template image and the target image, the positions of multi objects are estimated. The detail procedure is described in this section.

2.1 Radial Ring Codes

Gradient information is chosen to generate the descriptor because of the little sensitivity to illumination changes. There are various operations, like Sobel operator, to calculate horizontal derivatives $ \nabla f_x $ and vertical derivatives $ \nabla f_y $ for computing the gradient direction $ \ \varPhi (x,y) = \tan ^{-1} (\nabla f_y/\nabla f_x) $ and gradient magnitude $ \varOmega (x,y) = \sqrt{\nabla f_y^2+\nabla f_x^2}$.

A circle effective region is selected to obtain the rotation invariance. The center of the circular region is regarded as a rotation reference point. The present point P in the region and the reference point O are linked to a straight line. The angle between this line and the gradient direction of the present point P is called as radial gradient angle $\alpha $, as shown in Fig. 2. If the image is rotated counterclockwise angle $ \theta $, P is transformed into $P^{'}$. Obviously, the relative angle $ \alpha $ does not change with the rotation.

Table 1. Radial gradient codes

Full size table

The range of the gradient direction is $ [-\pi , \pi ] $, while the radial gradient angle is $ [0, \pi ] $. To reduce the amount of computation, the radial gradient angles are quantified as the radial gradient codes using Eq. 1

$$\begin{aligned} \varUpsilon (x,y) = round(\frac{\alpha (x,y)}{\triangle \alpha }),{\,}when \quad \varOmega (x,y) > T \end{aligned}$$

(1)

The radial angle is divided into N groups, which can not be chosen too large or small, considering the rotation errors. To suppress the effective of noise, the radial gradient code is computed, when the magnitude is larger than the threshold T. The angle step is $ \triangle \alpha = \pi /N$, the relationship between radial gradient angles and radial gradient codes is shown in Table 1.

The radial gradient angle at one point changes with the different reference point. In Fig. 3, the relative angles between ${{\varvec{g}}}_{{\varvec{1}}}$ and the line OA or the line $O^{'}A$ are distinct. Therefore, the amplitude of the angle change is related to the radial distance (between the reference point and the present point). The radial distance is closer, the radial gradient angle changes more sharp. The evidence is that $ \angle O^{'}AO$ and $ \angle O^{'}BO$ are different. Thus, its essential to take a separate treatment with different radial distance.

The ring distance projection is a process as a partition step in our approach. As shown in Fig. 4, the effective region is segmented into two parts: inner circle and outer ring. If a reference point moves one pixel, the change value of radial gradient angle in outer ring should be littler than $\triangle \alpha $. The radius of inner circle r can be calculated in Eq. (2).

$$\begin{aligned} \begin{array}{ll} r\ge \frac{1}{ \tan ^{-1} (\triangle \alpha ) }\\ r\le R \end{array} \end{aligned}$$

(2)

r is chosen as $\frac{1}{2}R$ here. If there is only one region with radial gradient codes, the descriptor size is N, else the descriptor dimension is 2N. Sometimes, N dimensional vectors to represent an image is finite to achieve reliable matching. The descriptor with radial ring codes increases the vector dimension to improve the feature representation capability and ensure the rotation invariant feature description. Certainly, the more the dimensions are, the more computational cost is.

2.2 Radial Ring Code Histograms for Matching

In a region, codes are counted respectively to be expressed as radial ring code histograms. Histograms of the different regions are arranged according to a certain order. There is an example about two kinds of histograms in $N=9$. In Fig. 5(b), the first N histograms are from inner circle, while the second N histograms are from outer ring. The radial ring code histograms can be written as

$$\begin{aligned} {{\varvec{V}}}=[\nu (0),\nu (1),\ldots ,\nu (k)], \quad (k=N-1 \quad or \quad k=2N-1) \end{aligned}$$

(3)

RRCH is a real rotation invariant feature description, which does not depend on the main direction. It is stronger and more adaptable than the traditional method including nonlinear illumination changes. The weight of inner circle is $\omega $ and the weight of outer ring is $1-\omega $. Generally, $\omega $ is less than 0.5, when there are two regions.

Template image and subimage (part of object image with the same size of template image) are transformed into a vector in form of radial ring code histograms. There are some approaches to compare the correspondence with two vectors by distance or similar metric [20], such as the Chi-Square statistic, Euclidean distance or Manhattan distance. For multi object matching, there will be more difficult to make a contrast. So the similarity is

$$\begin{aligned} \begin{array}{ll} S = \frac{\sum ^{N-1}_{i=0} min({{\varvec{V}}}_{{\varvec{{M}}}}(i),{{\varvec{V}}}_{{\varvec{{O}}}}(i))}{\sum ^{N-1}_{i=0}({{\varvec{V}}}_{{\varvec{{M}}}}(i))},(k=N-1)\\ S = \frac{\sum ^{N-1}_{i=0} w\cdot min({{\varvec{V}}}_{{\varvec{{M}}}}(i),{{\varvec{V}}}_{{\varvec{{O}}}}(i))+\sum ^{2N-1}_{i=N} (1-w)\cdot min({{\varvec{V}}}_{{\varvec{{M}}}}(i),{{\varvec{V}}}_{{\varvec{{O}}}}(i))}{\sum ^{N-1}_{i=0}(w\cdot {{\varvec{V}}}_{{\varvec{{M}}}}(i))+\sum ^{2N-1}_{i=N}((1-w)\cdot {{\varvec{V}}}_{{\varvec{{M}}}}(i))},(k=2N-1,0\le \omega \le 1) \end{array} \end{aligned}$$

(4)

The radial ring code histograms of template image is ${{\varvec{V}}}_{{\varvec{{M}}}}$ and subimage is ${{\varvec{V}}}_{{\varvec{{O}}}}$. $min({{\varvec{V}}}_{{\varvec{{M}}}},{{\varvec{V}}}_{{\varvec{{O}}}})$ is to get max overlapping values in two histograms. The formula is different when k is $N-1$ or $2N-1$ and can be changed for various objects.

Ultimately, template image is sliding window through the entire image to search objects and get the score matrix. According to the threshold, the candidate areas are inquired with local maximum score.

3 Experiment

This section verifies the performance of the proposed method. First of all, a test for rotation-variance is shown. Secondly, the computation time is in comparison with some similar techniques. Thirdly, the multi object detection is essential to be validated. Finally, there are some experiments about the robustness of illumination varying. The images for this test are grabbed from LED machines in Fig. 6.

To confirm rotation-variance of the presented algorithm, IC chips were rotated with various arbitrary angles. An example is shown in Fig. 8. As can be seen, the histograms of the IC chips are closely resembled with high similarity. Meanwhile, the picture proves that when the patterns are rotated, the histograms are similar. Figure 8 shows that the proposed method plays a good performance on matching the object with rotation.

As explained above, NCC is the method to search objects by a set of templates with different rotation angles. The computational time of the proposed algorithm was also compared with NCC in different angle ranges, shown in Fig. 9. All the computations were performed with C language under the same conditions without any acceleration or speed promotion. From the experiment results, it can be seen that the computation time of NCC is increasing with the angle searching range, while the computation time of RRCH is remain unchanged due to the rotation invariance.

There is a problem presented in Sect. 2.1, the conventional method cannot recognize the difference between the red and the blue region in Fig. 1. However, RRCH is used for the two regions, the result shown in Fig. 7 is that histograms are larger different than the method. It means that RRCH is adaptable to solve the problem in multi objects matching.

There are same multi arrayed chips. The template images were extracted one of them in Fig. 10(a)(b), and Orientation Code Histograms and RRCH were used to get the results of score, shown in Fig. 10(c)(d)(e)(f). Chips were in illumination-invariant condition with different angles in Fig. 10(a). The environment for the image in Fig. 10(b) was more complicated, multi chips with illumination-fluctuations and noise. Figure 10(g)(h) verifies that the RRCH can be used to distinguish the position of the object with the other disturbing factors better than Orientation Code Histograms. If the score threshold is set appropriately, the small candidate areas can be selected quickly.

4 Conclusion

In this paper, a new method for multi-object matching method, called RRCH, is proposed. It is achieved with the utilization of gradient information in the form of radial ring codes. The method is robust for detecting objects in noisy environments in spite of illumination changes. Experiments demonstrate the rotation- and brightness invariance of the proposed method for object search. However, there is much room to promote in speed and optimal parameters and the method can be combined with other template matching algorithms. The method is more suitable used in multi-object coarse-search step to get the small candidate area and combined with other fine-matching steps.

References

Zitov, B., Flusser, J.: Image registration methods: a survey. Image Vis. Comput. 21, 977–1000 (2003)
Article Google Scholar
Ashburner, J.: A fast diffeomorphic image registration algorithm. Neuroimage 38, 95–113 (2007)
Article Google Scholar
Jenkinson, M., Bannister, P., Brady, M., Smith, S.: Improved optimization for the robust and accurate linear registration and motion correction of brain images. Neuroimage 17, 825–841 (2002)
Article Google Scholar
Lewis, J.: Fast normalized cross-correlation. Vis. Interface 10, 120–123 (1995)
Google Scholar
Wood, J.: Invariant pattern recognition: a review. Pattern Recogn. 29, 1–17 (1996)
Article Google Scholar
Liao, S.X., Pawlak, M.: On image analysis by moments. IEEE Trans. Pattern Anal. Mach. Intell. 18, 254–266 (1996)
Article Google Scholar
Flusser, J., Suk, T.: Pattern recognition by affine moment invariants. Pattern Recogn. 26, 167–174 (1993)
Article MathSciNet Google Scholar
Wolfson, H.J., Rigoutsos, I.: Geometric hashing: an overview. Comput. Sci. Eng. 4, 10–21 (1997)
Article Google Scholar
Ratnasamy, S., Karp, B., Yin, L., Yu, F., Estrin, D., Govindan, R., et al.: GHT: a geographic hash table for data-centric storage. In: Proceedings of the 1st ACM International Workshop on Wireless Sensor Networks and Applications, pp. 78–87 (2002)
Google Scholar
Kovalev, V.A., Petrou, M., Bondar, Y.S.: Using orientation tokens for object recognition. Pattern Recogn. Lett. 19, 1125–1132 (1998)
Article MATH Google Scholar
Dalal, N., Triggs, B.: Histograms of oriented gradients for human detection. In: IEEE Computer Society Conference on Computer Vision and Pattern Recognition, CVPR 2005, pp. 886–893 (2005)
Google Scholar
Lowe, D.G.: Object recognition from local scale-invariant features. In: Proceedings of the Seventh IEEE International Conference on Computer vision, pp. 1150–1157 (1999)
Google Scholar
Lowe, D.G.: Distinctive image features from scale-invariant keypoints. Int. J. Comput. Vis. 60, 91–110 (2004)
Article Google Scholar
Juan, L., Gwun, O.: A comparison of sift, pca-sift and surf. Int. J. Image Process. (IJIP) 3, 143–152 (2009)
Google Scholar
Bay, H., Tuytelaars, T., Van Gool, L.: SURF: speeded up robust features. In: Leonardis, A., Bischof, H., Pinz, A. (eds.) ECCV 2006, Part I. LNCS, vol. 3951, pp. 404–417. Springer, Heidelberg (2006)
Chapter Google Scholar
Mikolajczyk, K., Schmid, C.: Indexing based on scale invariant interest points. In: Proceedings of the Eighth IEEE International Conference on Computer Vision, ICCV 2001, pp. 525–531 (2001)
Google Scholar
Mikolajczyk, K., Schmid, C.: A performance evaluation of local descriptors. IEEE Trans. Pattern Anal. Mach. Intell. 27, 1615–1630 (2005)
Article Google Scholar
Ullah, F., Kaneko, S.I.: Using orientation codes for rotation-invariant template matching. Pattern Recogn. 37, 201–209 (2004)
Article MATH Google Scholar
Marimon, D., Ebrahimi, T.: Efficient rotation-discriminative template matching. In: Rueda, L., Mery, D., Kittler, J. (eds.) CIARP 2007. LNCS, vol. 4756, pp. 221–230. Springer, Heidelberg (2007)
Chapter Google Scholar
Swain, M.J., Ballard, D.H.: Color indexing. Int. J. Comput. Vis. 7, 11–32 (1991)
Article Google Scholar

Download references

Acknowledgments

The work described in this paper is partially supported by National Natural Science Foundation of China under grant Nos. 51327801, 51475193. The authors would like to thank the reviewers for their constructive comments that improved the presentation of the paper.

Author information

Authors and Affiliations

State Key Laboratory of Digital Manufacturing Equipment and Technology, Huazhong University of Science and Technology, Wuhan, China
Shijiao Zheng, Buyang Zhang & Hua Yang

Authors

Shijiao Zheng
View author publications
You can also search for this author in PubMed Google Scholar
Buyang Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Hua Yang
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Hua Yang .

Editor information

Editors and Affiliations

Department of Electronic Engineering, Tsinghua University, Beijing, China
Yu-Jin Zhang

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Zheng, S., Zhang, B., Yang, H. (2015). Multi-object Template Matching Using Radial Ring Code Histograms. In: Zhang, YJ. (eds) Image and Graphics. ICIG 2015. Lecture Notes in Computer Science(), vol 9218. Springer, Cham. https://doi.org/10.1007/978-3-319-21963-9_51

Download citation

DOI: https://doi.org/10.1007/978-3-319-21963-9_51
Published: 04 August 2015
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-21962-2
Online ISBN: 978-3-319-21963-9
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Societies and partnerships

The International Association for Pattern Recognition (opens in a new tab)