CN101499132B

CN101499132B - Three-dimensional transformation search method for extracting characteristic points in human face image

Info

Publication number: CN101499132B
Application number: CN 200910037867
Authority: CN
Inventors: 易法令; 熊伟; 黄展鹏; 赵洁
Original assignee: Guangdong Pharmaceutical University
Current assignee: GUANGZHOU HENGBIKANG INFORMATION TECHNOLOGY CO.,LTD.; Guangdong Pharmaceutical University
Priority date: 2009-03-12
Filing date: 2009-03-12
Publication date: 2013-05-01
Anticipated expiration: 2029-03-12
Also published as: CN101499132A

Abstract

The invention discloses a three-dimension transform search method for extracting characteristic point in face image which uses ASM (Active Shape Models) method locating face image as a base, changes two-dimension transform shape search method in present ASM into three-dimension transform shape search method. The method includes steps as follows: firstly, constructing a standard face three-dimension model; secondly, constructing three-dimension coordinates of a two-dimension stat model (basic shape) of ASM train centralizing face characteristic point according with standard face three-dimension model; finally, processing three-dimension transform to the two-dimension stat model having three-dimension coordinate and projecting to the two-dimension plane for approaching the given face characteristic point shape after search, wherein the search process adopts the method of two-step conversion and iterated approximation. The method can reflect real change of face gesture and has more search precision; the test result shows: the method can more approach fact characteristic point compared with present two-dimension transform search method.

Description

The three-dimensional transformation search method of feature point extraction in a kind of facial image

Technical field

The invention belongs to the recognition of face field, be specifically related to the method for face organ's feature point extraction of people's face.

Background technology

Recognition of face is based on a kind of biological identification technology that people's face feature information is carried out identification, and wherein to extract be the basis of carrying out recognition of face to face feature point.Recognition of face has wide practical use as people's identification mode, and at present, though there are some gyp face identification systems to progress into market, still, these technology and system have with a certain distance from practical, and performance and accuracy rate have much room for improvement.At present, what human face characteristic point extracted generally employing is ASM (ActiVe Shape Models, moving shape model) localization method, and the method generally comprised for three steps: (1) obtains a real shape description by the alignment training sample set; (2) catch the statistical information of the shape of having alignd; (3) search for shape instance at image.The method has preferably effect in general front face location, then effect is not good enough but locate for people's face that certain angle deflection is arranged.By analyzing and experiment, we find, the way of search of this and image is relevant: current ASM method, be by the basic configuration to the image of two dimension be rotated, zooming and panning operate and go to approach associated shape; And people's face is the object of a three-dimensional, and aforesaid operations obviously can not reflect the variation of human face posture fully, therefore, can have larger difference in the shape search when approaching.

Summary of the invention

The method that the object of the invention is to consider the problems referred to above and provide a kind of angle removal search human face posture from three-dimensional to change uses the method can improve the accuracy of people's face ASM shape search, and then improves precision and the efficient of whole face identification system.

Technical scheme of the present invention is:

The three-dimensional transformation search method of feature point extraction in a kind of facial image, at first make up the standard three-dimensional model of people's face, secondly obtain the third dimension coordinate of two-dimension human face unique point as the basis take this standard three-dimensional model, carry out the search of three-dimension varying ASM shape as the basis take three-dimensional coordinate at last, namely comprise the steps:

(1) makes up the standard three-dimensional model of people's face, comprise the three-dimensional coordinate (x, y, z) (wherein the front of people's face is the XY plane) of human face characteristic point in the three-dimensional model;

(2) take three-dimensional master pattern as the basis, according to the Two-dimensional Statistical model (x of the human face characteristic point of ASM training set ₁, y ₁, x ₂, y ₂..., x _n, y _n), determine in proportion the third dimension coordinate (z of each unique point in the Two-dimensional Statistical model _i);

(3) with the basic configuration that comprises three-dimensional coordinate respectively from around Z axis, X-axis, Y direction rotation, zooming and panning, the result with conversion projects to the XY plane at last, goes to approach the shape of current search by the result of projection.

Described step (1) people's face standard three-dimensional model takes the method for actual measurement to make up, and at first selects the people's face more than 50 in training set, the three-dimensional coordinate (x of each unique point of actual measurement ₁, y ₁, z ₁, x ₂, y ₂, z ₂..., x _n, y _n, z _n), wherein third dimension coordinate (z axle) begins to measure as zero plane with the central plane of neck, and above data are carried out normalized, then asks its mean value, just obtains the standard faces three-dimensional model.

Described step (2) realizes as follows:

1) the foundation third dimension (Z direction) the coordinate array SZ=[z corresponding with the Two-dimensional Statistical model of the human face characteristic point of ASM training set ₁, z ₂..., z _n], it is data from the third dimension coordinate of standard faces three-dimensional model, and with the ASM training set in human face characteristic point corresponding one by one;

2) in people's face standard three-dimensional model, choose three unique points, and record its two dimensional surface coordinate figure (x, y), calculate accordingly by these three points.Described three unique points are tail of the eyes of choosing two eyes and nose totally three points (P1, P2, P3) (with point 13,26 among Fig. 2,41 corresponding), in the standard faces three-dimensional model, this two dimensional surface coordinate of 3 is known, if coordinate is respectively (xc1, yc1), (xc2, yc2), (xc3, yc3); Corresponding three point coordinate also are known in the Two-dimensional Statistical model of the human face characteristic point of ASM training set, establish its coordinate and are respectively (x1, y1), (x2, y2), (x3, y3).

3) ask horizontal zoom factor C by a P1, P2 _x, ask vertical zoom factor C by mid point and the P3 of P1 and P2 _y, as follows respectively:

\{\begin{matrix} C_{x} = \sqrt{{(x 2 - x 1)}^{2} + {(y 2 - y 1)}^{2}} / (xc 2 - xc 1) \\ C_{y} = \sqrt{{(x 3 - \frac{x 1 + x 2}{2})}^{2} + {(y 3 - \frac{y 1 + y 2}{2})}^{2}} / (yc 3 - yc 1) \end{matrix}

Get both mean value at the zoom factor of Z direction, that is:

C _z＝(C _x+C _y)/2

Pass through C _zThe third dimension coordinate that multiplies each other and just can obtain two-dimension human face image with third dimension coordinate array SZ, i.e. Z axis coordinate.

Described step (3) is the anglec of rotation θ that needs to seek three directions _x, θ _y, θ _z, zooming parameter S _x, S _y, S _z, and the side-play amount T of three directions _x, T _y, T _z, calculate T for convenient _zBe made as 0, the original shape vector x of given people's face and target shape vector x ', both three-dimensional face carries out geometric transformation M to obtain the minor increment of x and x ' in the projection on XY plane, namely minimizes following formula:

E(θ _x，θ _y，θ _z，S _x，S _y，S _z，T _x，T _y)＝|M(x)-x′| ² (1)

Adopt the method for two step conversion iteration approximations to make formula (1) approach optimum parameter value.

The first step conversion of described two step conversion is to rotate and translation around Z axis on the XY plane; Detailed process is as follows: given two similar shape x and x ', seek anglec of rotation θ, yardstick convergent-divergent s, translational movement t, to x make geometric transformation X=M (s, θ) [x]+t so that the x after x ' and the conversion apart from minimum:

E＝(M(s，θ)[x]+t-x′) ^T(M(s，θ)[x]+t-x′) (2)

Wherein:

M (s, θ) [\begin{matrix} x_{i} \\ y_{i} \end{matrix}] = [\begin{matrix} (s \cdot \cos θ) x_{i} - (s \cdot \sin θ) y_{i} \\ (s \cdot \sin θ) x_{i} + (s \cdot \cos θ) y_{i} \end{matrix}]

t＝(t _x，t _y，...t _x，t _y) ^T

Make a=scos θ, b=ssin θ has like this: s ²=a ²+ b ², θ=tan ^-1(b/a)

So:

M (s, θ) [\begin{matrix} x_{i} \\ y_{i} \end{matrix}] = [\begin{matrix} a & - b \\ b & a \end{matrix}] [\begin{matrix} x_{i} \\ y_{i} \end{matrix}] + (\begin{matrix} t_{x} \\ t_{y} \end{matrix}) - - - (3)

Wherein a, b, t _x, t _yBe exactly calculative four attitude parameters, by seeking these four parameters so that in (2) formula the value of E minimum, thereby so that actual variation be consistent with calculating.

The second step conversion of described quadratic transformation is to rotate around Y-axis and Z axis respectively, and projects on the XY plane, and implementation procedure is as follows:

If (X _z, Y _z) be through horizontally rotating (around Z axis) and the displacement after coordinate (the z coordinate is constant), it at first rotates θ around Y-axis _yAngle, zoom factor are S _y, transversal displacement is T _{X '}, then have:

\{\begin{matrix} X_{y} = X_{z} * S_{y} * \cos θ_{y} + Z * S_{y} * \sin θ_{y} + T_{x^{'}} \\ Y_{y} = Y_{z}; \\ Z_{y} = - X_{z} * S_{y} * \sin θ_{y} + Z * S_{y} * \cos θ_{y} \end{matrix} - - - (6)

Rotate θ around X-axis again _xAngle, zoom factor is S simultaneously _x, the vertical misalignment amount is T _{Y '}, then have:

\{\begin{matrix} X_{x} = X_{y}; \\ Y_{x} = Y_{y} * S_{x} * \cos θ_{x} - Z_{y} * S_{x} * \sin θ_{x} + T_{y^{'}}; \\ Z_{x} = Y_{y} * S_{x} * \sin θ_{x} + Z_{y} * S_{x} * \cos θ_{x} \end{matrix} - - - (7)

Above two formulas of simultaneous, and to the XY plane projection can get the coordinate after the conversion:

\{\begin{matrix} X_{e} = X_{x} = X_{z} * S_{y} * \cos θ_{y} + Z * S_{y} * \sin θ_{y} + T_{x^{'}} \\ Y_{e} = Y_{x} = Y_{z} * S_{x} *cos θ_{x} - (- X_{z} * S_{y} * \sin θ_{y} + Z * S_{y} * \cos θ_{y}) * S_{x} *sin θ_{x} + T_{y^{'}} \end{matrix} - - - (8)

Make in the equation (8) a _y=S _y* cos θ _y, b _y=S _y* sin θ _y, a _x=S _x* cos θ _x, b _x=S _x* sin θ _x, and the actual coordinate of establishing after the conversion is (x ', y '), getting in equation (8) the substitution formula (1):

|X _z*a _y+Z*b _y+T _x′-x′| ²+|Y _z*a _x-(-X _z*b _y+Z*a _y)*b _x+T _y′-y′| ² (9)

Make (9) formula minimum, parameter wherein asked local derviation, can get:

X _z*a _y+Z*b _y+T _x′-x′＝0 (10)

Y _z*a _x-(-X _z*b _y+Z*a _y)*b _x+T _y′-y′＝0 (11)

By formula (10) is carried out multiple linear regression analysis, can be in the hope of parameter value a wherein _y, b _y, T _{X '}If total n unique point, detailed process is as follows:

1) averages

\overset{&OverBar;}{X_{z}} = \frac{1}{n} Σ_{i = 1}^{n} X_{zi}

\overset{&OverBar;}{Z} = \frac{1}{n} Σ_{i = 1}^{n} Z_{i}

\overset{&OverBar;}{x^{'}} = \frac{1}{n} Σ_{i = 1}^{n} x_{i}^{'}

2) S_{11} = Σ_{i = 1}^{n} {(X_{zi} - \overset{&OverBar;}{X_{z}})}^{2}

S_{22} = Σ_{i = 1}^{n} {(Z_{i} - \overset{&OverBar;}{Z})}^{2}

L = Σ_{i = 1}^{n} {(x_{i}^{'} - \overset{&OverBar;}{x^{'}})}^{2}

S_{12} = S_{21} = Σ_{i = 1}^{n} (X_{zi} - \overset{&OverBar;}{X}) (Z_{i} - Z)

S_{10} = Σ_{i = 1}^{n} (X_{zi} - \overset{&OverBar;}{X_{z}}) (x_{i}^{'} - \overset{&OverBar;}{x^{'}})

S_{20} = Σ_{i = 1}^{n} (Z_{i} - \overset{&OverBar;}{Z}) (x_{i}^{'} - \overset{&OverBar;}{x^{'}})

{3) a}_{y} = \frac{S_{10} S_{22} - S_{20} S_{12}}{S_{11} S_{22} - {S_{12}}^{2}}

b_{y} = \frac{S_{20} S_{11} - S_{10} S_{21}}{S_{11} S_{22} - {S_{12}}^{2}}

T_{x^{'}} = \overset{&OverBar;}{x^{'}} - a_{y} \overset{&OverBar;}{X_{z}} - b_{y} \overset{&OverBar;}{Z}

A _y, b _y, T _{X '}Value substitution formula (11), using the same method can be in the hope of a _x, b _x, T _{Y '}Second portion conversion M ₂Expression, then:

M_{2} (a_{y}, b_{y}, T_{x^{'}}, a_{x}, b_{x}, T_{y^{'}}) [\begin{matrix} X_{zi} \\ Y_{zi} \end{matrix}]

The most approaching with impact point.

Described iterative approach refer to unique point the XY plane around Z axis rotate and translation after coordinate (X _z, Y _z) intermediateness be to adopt repeatedly the method for iterative approach,

If intermediateness is (X _z, Y _z), concrete steps are as follows:

1) initial seasonal (X _z, Y _z) be end value (x ', y ');

2) (X _z, Y _z) substitution formula (1) replaces x ' wherein, tries to achieve in the formula (3) four transformation parameter a, b, t according to the ASM transform method of current two dimension _x, t _y

3) parameter a, b, t _x, t _ySubstitution formula (3) is obtained intermediateness (X _z, Y _z);

4) (X _z, Y _z) carry out second portion conversion (M ₂), obtain parameter a _y, b _y, T _{X '}, a _x, b _x, T _{Y '}

5) be the basis with (x ', y '), calculate M ₂Inverse transformation, can draw intermediateness (X ' _z, Y ' _z) namely:

({X^{'}}_{z}, {Y^{'}}_{z}) = M_{2 (a_{y}, b_{y}, T_{x^{'}}, a_{x}, b_{x}, T_{y^{'}})}^{- 1} (x^{'}, y^{'})

(X ' _z, Y ' _z) substitution formula (1), try to achieve in the formula (3) four transformation parameter a, b, t according to two-dimentional ASM transform method _x, t _yThen changed for (3) step over to, carry out iterative computation, iteration just can obtain meeting 10 parameters of accuracy requirement for 10 times.

The present invention with respect to the beneficial effect of prior art is: the three-dimensional transformation search method of ASM of the present invention is compared with current two-dimensional search method, has reflected more truly the variation of human face posture, thereby has better unique point search Approximation effect.

Description of drawings

The present invention is further detailed explanation below in conjunction with the drawings and specific embodiments.

Fig. 1 is the process flow diagram of people's face three-dimensional search method of the present invention;

When Fig. 2 is of the present invention for specifically testing two-dimension human face image is carried out the schematic diagram that unique point is demarcated;

Fig. 3 for the given concrete face characteristic point coordinate of test after to the relative approximation degree comparison diagram of view data in the training set;

Fig. 4 for the given concrete face characteristic point coordinate of test after to the relative approximation degree comparison diagram of view data in the non-training set;

Fig. 5 during for the search of test actual persons face to the relative approximation degree comparison diagram of view data in the training set;

Relative approximation degree to view data in the non-training set when Fig. 6 searches for for test actual persons face compares.

Embodiment

The process flow diagram of people's face three-dimensional transformation search method of the present invention as shown in Figure 1, people's head and face organ according to this point, can construct the three-dimensional face model of a standard having similarity aspect shape and the position; The present invention determines the three-dimensional coordinate of face features point just by this master pattern, then carry out the ASM shape search of three-dimension varying as the basis take three-dimensional coordinate; Because be two dimensional image during actual search, so need to project to two dimensional surface at last, the process of three-dimensional search conversion is:

The first, the standard three-dimensional model (x of structure people face ₁, y ₁, z ₁, x ₂, y ₂, z ₂..., x _n, y _n, z _n) (wherein the front of people's face is the XY plane), the unique point in the three-dimensional model should comprise unique point in the actual two dimensional image (be the adaptability of embodiment standard three-dimensional model, can more than the unique point in the actual two dimensional image).

The second, carrying out people's face two dimensional image when search, take three-dimensional master pattern as the basis, according to Two-dimensional Statistical model (the basic configuration) (x of the human face characteristic point of ASM training set ₁, y ₁, x ₂, y ₂..., x _n, y _n), determine in proportion the third dimension coordinate (zi) of each unique point in the Two-dimensional Statistical model.

The 3rd, when shape search conversion, basic configuration is respectively from around Z axis, X-axis, Y direction rotation, zooming and panning, and the result with conversion projects to the XY plane at last, goes to approach the shape of current search by the result of projection.

Below the main content of describing three aspects, the one, the three-dimensional coordinate of the standard obtaining three-dimensional model two-dimension human face unique point by people's face; The 2nd, adopt the method for two step conversion and iteration to obtain 10 transformation parameters when carrying out three-dimensional search; The 3rd, the implementation result test.

(1) third dimension coordinate of the Two-dimensional Statistical model of human face characteristic point obtains

Generally speaking, people's face section organ is not only fixed at the relative position of two dimensional surface, and the height of the third dimension (face contour) also is basically identical.Although the height of people's face section organ may have certain difference, such as: it is low that people's nose has height to have, if but with the reference plane as third dimension coordinate of the central plane (center of three-dimensional rotation) of neck, then this difference is just very little, does not affect search precision when actual search is approached.The standard three-dimensional model of people's face also consists of with unique point, specifically represents with the three-dimensional coordinate form, that is: (x ₁, y ₁, z ₁, x ₂, y ₂, z ₂..., x _n, y _n, z _n); Wherein should comprise all unique points selected in the two-dimension human face, when Fig. 2 was test, the unique point schematic diagram that two dimensional image is selected had been chosen altogether 59 unique points.The acquisition methods of the third dimension coordinate of the Two-dimensional Statistical model of human face characteristic point is as follows:

(1) the foundation third dimension (Z direction) the coordinate array SZ=[z corresponding with the Two-dimensional Statistical model of the human face characteristic point of ASM training set ₁, z ₂..., z _n], it is data from the third dimension coordinate of standard faces three-dimensional model, and with the ASM training set in human face characteristic point corresponding one by one.

(2) in the standard faces three-dimensional model, choose three unique points, and record its planimetric coordinates value.During actual test, what choose is three points of the tail of the eye and nose (P1, P2, P3) (with point 13,26 among Fig. 2,41 corresponding) of two eyes.In the standard faces three-dimensional model, this planar coordinate of 3 is known, establishes coordinate and is respectively (xc1, yc1), (xc2, yc2), (xc3, yc3); Correspondence three point coordinate of the Two-dimensional Statistical model of the human face characteristic point of ASM training set also are known, establish its coordinate and are respectively (x1, y1), (x2, y2), (x3, y3).

(3) ask horizontal zoom factor C by a P1, P2 _x, ask vertical zoom factor C by mid point and the P3 of P1 and P2 _y, as follows respectively:

\{\begin{matrix} C_{x} = \sqrt{{(x 2 - x 1)}^{2} + {(y 2 - y 1)}^{2}} / (xc 2 - xc 1) \\ C_{y} = \sqrt{{(x 3 - \frac{x 1 + x 2}{2})}^{2} + {(y 3 - \frac{y 1 + y 2}{2})}^{2}} / (yc 3 - yc 1) \end{matrix}

Get both mean value at the zoom factor of Z direction, that is:

C _z＝(C _x+C _y)/2

(2) three-dimension varying search

The original shape vector x of given people's face and target shape vector x ', both three-dimensional face is in the projection on XY plane.Realized the three dimensional stress of original shape vector x by the operation of first, the three-dimensional search conversion be exactly carry out on the x basis of three dimensional stress the three-dimension varying removal search approach the target shape vector x '.Compare with the ASM searching method of current facial image, three-dimensional ASM searching method need to be sought the anglec of rotation θ of three directions _x, θ _y, θ _z, zooming parameter S _x, S _y, S _z, and the side-play amount T of three directions _x, T _y, T _z(owing to finally will project to the XY plane, calculate for convenient, Tz can be made as 0) carries out geometric transformation M to obtain the minor increment of x and x ', namely minimizes following formula:

Conventional minimized method is that the parameters on the left side in the following formula is asked partial derivative, and then making it is 0, unites at last to find the solution separate equation and draw the parameters value.But because its parameter is numerous, and parameter all with certain any (x, y) coordinates correlation, above-mentioned conventional method may be obtained its parameter value hardly.Therefore, adopted the method for two step conversion and iteration to remove to approach optimum parameter value.

1) two step conversion

Whole three-dimension varying is divided into two parts: first rotates and translation around Z axis on the XY plane; Second portion is to rotate around Y-axis and Z axis respectively, and projects on the XY plane.Wherein: the conversion of first is consistent with current ASM search shape process, and detailed process is as follows: given two similar shape x and x ', seek anglec of rotation θ, yardstick convergent-divergent s, translational movement t, to x make geometric transformation X=M (s, θ) [x]+t so that the x after x ' and the conversion apart from minimum:

E＝(M(s，θ)[x]+t-x′) ^T(M(s，θ)[x]+t-x′) (2)

Wherein:

M (s, θ) [\begin{matrix} x_{i} \\ y_{i} \end{matrix}] = [\begin{matrix} (s \cdot \cos θ) x_{i} - (s \cdot \sin θ) y_{i} \\ (s \cdot \sin θ) x_{i} + (s \cdot \cos θ) y_{i} \end{matrix}]

t＝(t _x，t _y，...t _x，t _y) ^T

Make a=scos θ, b=ssin θ has like this: s ²=a ²+ b ², θ=tan ^-1(b/a)

So:

M (s, θ) [\begin{matrix} x_{i} \\ y_{i} \end{matrix}] = [\begin{matrix} a & - b \\ b & a \end{matrix}] [\begin{matrix} x_{i} \\ y_{i} \end{matrix}] + (\begin{matrix} t_{x} \\ t_{y} \end{matrix}) - - - (3)

Wherein a, b, t _x, t _yBe exactly calculative four attitude parameters, by seeking these four parameters so that in the formula (2) value of E minimum, thereby so that actual variation be consistent with calculating.Computing method are identical with the ASM transform method of current two dimension.Be provided with n unique point, computation process is as follows:

(1) wushu (3) substitution formula (2) gets:

E (a, b, t_{x}, t_{y}) = | M (x) - x^{' | 2}

= Σ_{i = 1}^{n} {({ax}_{i} - {by}_{i} + t_{x} - {x^{'}}_{i})}^{2} + {({bx}_{i} + {ay}_{i} + t_{y} - {y^{'}}_{i})}^{2} - - - (4)

(2) describe for convenient, define following and value:

S_{x} = \frac{1}{n} Σ_{i = 1}^{n} x_{i};

S_{y} = \frac{1}{n} Σ_{i = 1}^{n} y_{i}

S_{x^{'}} = \frac{1}{n} Σ_{i = 1}^{n} {x^{'}}_{i};

S_{y^{'}} = \frac{1}{n} Σ_{i = 1}^{n} {y^{'}}_{i}

S_{xx} = \frac{1}{n} Σ_{i = 1}^{n} x_{i}^{2};

S_{yy} = \frac{1}{n} Σ_{i = 1}^{n} {y_{i}}^{2}

S_{xy} = \frac{1}{n} Σ_{i = 1}^{n} x_{i} y_{i}

S_{{xx}^{'}} = \frac{1}{n} Σ_{i = 1}^{n} x_{i} {x^{'}}_{i};

S_{{yy}^{'}} = \frac{1}{n} Σ_{i = 1}^{n} y_{i} {y^{'}}_{i}

S_{{xy}^{'}} = \frac{1}{n} Σ_{i = 1}^{n} x_{i} {y^{'}}_{i};

S_{{yx}^{'}} = \frac{1}{n} Σ_{i = 1}^{n} y_{i} {x^{'}}_{i}

(3) each parameter in the formula (4) is asked partial derivative, and makes equation equal 0 can getting:

\{\begin{matrix} a (S_{xx} + S_{yy}) + t_{x} S_{x} + t_{y} S_{y} = S_{{xx}^{'}} + S_{{yy}^{'}} \\ b (S_{xx} + S_{yy}) + t_{y} S_{x} - t_{x} S_{y} = S_{{xy}^{'}} - S_{{yx}^{'}} \\ a S_{x} - b S_{y} + t_{x} = S_{x^{'}} \\ b S_{x} + a S_{y} + t_{y} = S_{y^{'}} \end{matrix} - - - (5)

(4) to top system of equations (5) simultaneous solution, calculate for simplifying, the center of original state x is moved on to initial point, like this S _x=S _y=0.So, can obtain the value of 4 parameters:

t _x＝S _x′；t _y＝S _y′

a＝(S _xx′+S _yy′)/(S _xx+S _yy)

b＝(S _xy′-S _yx′)/(S _xx+S _yy)

The implementation procedure of second portion conversion is as follows:

\{\begin{matrix} X_{y} = X_{z} * S_{y} * \cos θ_{y} + Z * S_{y} * \sin θ_{y} + T_{x^{'}} \\ Y_{y} = Y_{z}; \\ Z_{y} = - X_{z} * S_{y} * \sin θ_{y} + Z * S_{y} * \cos θ_{y} \end{matrix} - - - (6)

\{\begin{matrix} X_{x} = X_{y}; \\ Y_{x} = Y_{y} * S_{x} * \cos θ_{x} - Z_{y} * S_{x} * \sin θ_{x} + T_{y^{'}}; \\ Z_{x} = Y_{y} * S_{x} * \sin θ_{x} + Z_{y} * S_{x} * \cos θ_{x} \end{matrix} - - - (7)

\{\begin{matrix} X_{e} = X_{x} = X_{z} * S_{y} * \cos θ_{y} + Z * S_{y} * \sin θ_{y} + T_{x^{'}} \\ Y_{e} = Y_{x} = Y_{z} * S_{x} *cos θ_{x} - (- X_{z} * S_{y} * \sin θ_{y} + Z * S_{y} * \cos θ_{y}) * S_{x} *sin θ_{x} + T_{y^{'}} \end{matrix} - - - (8)

Make in the equation (8) a _y=S _y* cos θ _y, b _y=S _y* sin θ _y, a _x=S _x* cos θ _x, b _x=S _x* sin θ _x, and the actual coordinate of establishing after the conversion is (x ', y '), and getting in equation (8) the substitution formula (1): | X _z* a _y+ Z*b _y+ T _{X '}-x ' | ²+ | Y _z* a _x-(X _z* b _y+ Z*a _y) * b _x+ T _{Y '}-y ' | ²(9)

Make (9) formula minimum, parameter wherein asked local derviation, can get:

X _z*a _y+Z*b _y+T _x′-x′＝0 (10)

Y _z*a _x-(-X _z*b _y+Z*a _y)*b _x+T _y′-y′＝0 (11)

(1) averages

\overset{&OverBar;}{X_{z}} = \frac{1}{n} Σ_{i = 1}^{n} X_{zi}

\overset{&OverBar;}{Z} = \frac{1}{n} Σ_{i = 1}^{n} Z_{i}

\overset{&OverBar;}{x^{'}} = \frac{1}{n} Σ_{i = 1}^{n} x_{i}^{'}

{(2) S}_{11} = Σ_{i = 1}^{n} {(X_{zi} - \overset{&OverBar;}{X_{z}})}^{2}

S_{22} = Σ_{i = 1}^{n} {(Z_{i} - \overset{&OverBar;}{Z})}^{2}

L = Σ_{i = 1}^{n} {(x_{i}^{'} - \overset{&OverBar;}{x^{'}})}^{2}

S_{12} = S_{21} = Σ_{i = 1}^{n} (X_{zi} - \overset{&OverBar;}{X}) (Z_{i} - Z)

S_{10} = Σ_{i = 1}^{n} (X_{zi} - \overset{&OverBar;}{X_{z}}) (x_{i}^{'} - \overset{&OverBar;}{x^{'}})

S_{20} = Σ_{i = 1}^{n} (Z_{i} - \overset{&OverBar;}{Z}) (x_{i}^{'} - \overset{&OverBar;}{x^{'}})

(3) a_{y} = \frac{S_{10} S_{22} - S_{20} S_{12}}{S_{11} S_{22} - {S_{12}}^{2}}

b_{y} = \frac{S_{20} S_{11} - S_{10} S_{21}}{S_{11} S_{22} - {S_{12}}^{2}}

T_{x^{'}} = \overset{&OverBar;}{x^{'}} - a_{y} \overset{&OverBar;}{X_{z}} - b_{y} \overset{&OverBar;}{Z}

M_{2} (a_{y}, b_{y}, T_{x^{'}}, a_{x}, b_{x}, T_{y^{'}}) [\begin{matrix} X_{zi} \\ Y_{zi} \end{matrix}]

The most approaching with impact point.

2) iterative approach

How to obtain unique point the XY plane around Z axis rotate and translation after coordinate (X _z, Y _z) be the key of carrying out the second step conversion, because wherein intermediateness is unknown, when design, adopts repeatedly the method for iterative approach to obtain actual intermediateness, and finally obtain actual transformation parameter.If intermediateness is (X _z, Y _z) concrete steps are as follows:

(1) initial seasonal (X _z, Y _z) be end value (x ', y '),

(2) (X _z, Y _z) substitution formula (4), try to achieve in the formula (4) four transformation parameter a, b, t according to the ASM transform method of current two dimension _x, t _y

(3) parameter a, b, t _x, t _ySubstitution formula (3) is obtained intermediateness (X _z, Y _z)

(4) (X _z, Y _z) carry out second portion conversion (M ₂), obtain parameter a _y, b _y, T _{X '}, a _x, b _x, T _{Y '}

(5) be the basis with (x ', y '), calculate M ₂Inverse transformation, can draw intermediateness (X ' _z, Y ' _z) namely:

({X^{'}}_{z}, {Y^{'}}_{z}) = M_{2 (a_{y}, b_{y}, T_{x^{'}}, a_{x}, b_{x}, T_{y^{'}})}^{- 1} (x^{'}, y^{'})

(6) (X ' _z, Y ' _z) substitution formula (4), try to achieve in the formula (4) four transformation parameter a, b, t according to two-dimentional ASM transform method _x, t _yThen changed for (3) step over to, carry out loop iteration and calculate, general iteration just can reach accuracy requirement 10 times.

Can finally obtain 10 parameters of image three-dimensional conversion by above-mentioned alternative manner, that is: 6 parameters of 4 parameters of for the first time conversion (rotate and translation around Z axis on the XY plane) and for the second time conversion (rotate and project on the XY plane around Y-axis and Z axis respectively).

(3) implementation result test

By the human face characteristic point extraction system that adopts three-dimensional ASM method is tested, aspect the feature point extraction of carrying out non-training set data, the method improves a lot aspect accuracy than the ASM method of two dimension.The below has carried out two types test: the one, and then given concrete face characteristic point coordinate goes to approach with two kinds of methods, tests its approximation ratio; The 2nd, given concrete people's face is also pressed same searching algorithm removal search unique point, the then difference between comparison search result and the fact characteristic point with two kinds of methods respectively.Wherein every type test all includes the test to training set data and non-training set data.Result's demonstration, not obvious to the improvement degree of training set data this method, but non-training set data then is greatly increased.Because in actual applications, most view data should belong to non-training set, so this method has higher practical value.

When the structure test macro, selected the facial image of the different attitudes of 100 width of cloth as training set data, other has 30 width of cloth images as test data, wherein image resolution-ratio is 125*150, all images are all carried out manual unique point demarcate, as shown in Figure 2, each image has been chosen 59 unique points.In order to compare more exactly both effects, defined the concept of a relative approximation degree.If the unique point that D1 calculates when approaching for employing three-dimensional transformation search method of the present invention and the mean distance between the actual calibration point, D2 is the unique point that adopts conventional two-dimensional transform search and calculate when approaching and the mean distance between the actual calibration point, and relative approximation degree RN is expressed as:

RN＝(D2-D1)/D1*100％

Obviously RN represents then that for just three-dimensional Approximation effect is better, represents then that for bearing two-dimentional Approximation effect is good, and its numerical values recited then represents the degree of approaching.

1. the test search approaches concrete coordinate

We have chosen 12 width of cloth images in training set, the direct substitution of its coordinate is approached with two kinds of methods respectively, its result as shown in Figure 3, as can be seen from the figure, as a rule, both Approximation effects are consistent.Fig. 4 then is the relative approximation degree when image directly approaches in the non-training set, therefrom can find out, in most of the cases, three-dimensional approach method can more approach desired value.

2. test the search of concrete people's face

When the concrete people's face of search, from training set, chosen 15 width of cloth images and searched for its result such as Fig. 5.Basically identical with expected results, both difference is not obvious.Fig. 6 searches for result after the coupling to image in 30 non-training sets, can find out that therefrom three-dimension varying obviously is better than two dimension, and the relative approximation effect is better than directly the approaching of objectives, this be since in approximate procedure target may repeatedly adjust.

Claims

1. the three-dimensional transformation search method of feature point extraction in the facial image, it is characterized in that, at first make up the standard three-dimensional model of people's face, secondly obtain the third dimension coordinate of two-dimension human face unique point as the basis take this standard three-dimensional model, realize the search of three-dimension varying moving shape model ASM shape as the basis take three-dimensional coordinate at last, specifically comprise the steps:

(1) make up the standard three-dimensional model of people's face, comprise the three-dimensional coordinate (x, y, z) of human face characteristic point in the standard three-dimensional model, wherein the front of people's face is the XY plane;

(2) take the standard three-dimensional model as the basis, according to the Two-dimensional Statistical model (x of the human face characteristic point of ASM training set ₁, y ₁, x ₂, y ₂..., x _n, y _n), determine in proportion the third dimension coordinate z of each unique point in the Two-dimensional Statistical model _i

2. the three-dimensional transformation search method of feature point extraction in the facial image according to claim 1, it is characterized in that described step (1) standard three-dimensional model takes the method for actual measurement to make up, at first in training set, select arbitrarily the people's face more than 50, the three-dimensional coordinate (x of each unique point of actual measurement ₁, y ₁, z ₁, x ₂, y ₂, z ₂..., x _n, y _n, z _n), wherein third dimension coordinate z axle begins to measure as zero plane with the central plane of neck, and above data are carried out normalized, then asks its mean value, just obtains the standard three-dimensional model.

3. the three-dimensional transformation search method of feature point extraction in the facial image according to claim 1 is characterized in that described step (2) realizes as follows:

1) the foundation third dimension Z direction coordinate array SZ=[z corresponding with the Two-dimensional Statistical model of the human face characteristic point of ASM training set ₁, z ₂..., z _n], it is data from the third dimension coordinate of standard faces three-dimensional model, and with the ASM training set in human face characteristic point corresponding one by one;

2) in people's face standard three-dimensional model, choose three unique points, and record its two dimensional surface coordinate figure (x, y), calculate accordingly by these three points, described three unique points are to choose the tail of the eye of two eyes and nose totally three points (P1, P2, P3), in the standard three-dimensional model, this two dimensional surface coordinate of 3 is known, establishes coordinate and is respectively (xc1, yc1), (xc2, yc2), (xc3, yc3); Corresponding three point coordinate also are known in the Two-dimensional Statistical model of the human face characteristic point of ASM training set, establish its coordinate and are respectively (x1, y1), (x2, y2), (x3, y3);

\{\begin{matrix} C_{x} = \sqrt{{(x 2 - x 1)}^{2} + {(y 2 - y 1)}^{2}} / (xc 2 - xc 1) \\ C_{y} = \sqrt{{(x 3 - \frac{x 1 + x 2}{2})}^{2} + {(y 3 - \frac{y 1 + y 2}{2})}^{2}} / (yc 3 - yc 1) \end{matrix}

Get both mean value at the zoom factor of Z direction, that is:

C _z＝(C _x+C _y)/2

4. the three-dimensional transformation search method of feature point extraction in the facial image according to claim 1 is characterized in that being that described step (3) is the anglec of rotation θ that needs are sought three directions _x, θ _y, θ _z, zooming parameter S _x, S _y, S _z, and the side-play amount T of three directions _x, T _y, T _z, calculate T for convenient _zBe made as 0; The original shape vector x of given people's face and target shape vector x ', both three-dimensional face carries out three-dimensional rotation, convergent-divergent, translation and projective transformation M to obtain the minor increment of x and x ' in the projection on XY plane, namely minimizes following formula:

5. the three-dimensional transformation search method of feature point extraction in the facial image according to claim 4 is characterized in that adopting the method for two step conversion iteration approximations to make formula (1) approach optimum parameter value.

6. the three-dimensional transformation search method of feature point extraction in the facial image according to claim 5, the first step conversion that it is characterized in that being described two step conversion be on the XY plane around Z axis rotate, zooming and panning; Detailed process is as follows: the original shape vector x of given people's face and target shape vector x ', seek anglec of rotation θ, yardstick convergent-divergent s, translational movement t, to x make geometric transformation X=M (s, θ) [x]+t so that the x after x ' and the conversion apart from minimum:

E＝(M(s，θ)[x]+t-x′) ^T(M(s，θ)[x]+t-x′) (2)

Wherein:

M (s, θ) [\begin{matrix} x_{i} \\ y_{i} \end{matrix}] = [\begin{matrix} (s \cdot \cos θ) x_{i} - (s \cdot \sin θ) y_{i} \\ (s \cdot \sin θ) x_{i} + (s \cdot \cos θ) y_{i} \end{matrix}]

t＝(t _x，t _y，...t _x，t _y) ^T

Make a=scos θ, b=ssin θ has like this: s ²=a ²+ b ², θ=tan ^-1(b/a)

So:

M (s, θ) [\begin{matrix} x_{i} \\ y_{i} \end{matrix}] = [\begin{matrix} a & - b \\ b & a \end{matrix}] [\begin{matrix} x_{i} \\ y_{i} \end{matrix}] + (\begin{matrix} t_{x} \\ t_{y} \end{matrix}) - - - (3)

Wherein a, b, t _x, t _yBe exactly calculative four attitude parameters, by seeking these four parameters so that in (2) formula the value of E minimum, thereby so that actual variation be consistent with calculating;

Computation process is as follows:

For convenience, define first following and value:

S_{x} = \frac{1}{n} Σ_{i = 1}^{n} x_{i}

S_{y} = \frac{1}{n} Σ_{i = 1}^{n} y_{i}

S_{x^{'}} = \frac{1}{n} Σ_{i = 1}^{n} x_{i}^{'}

S_{y^{'}} = \frac{1}{n} Σ_{i = 1}^{n} y_{i}^{'}

S_{x x} = \frac{1}{n} Σ_{i = 1}^{n} x_{i}^{2}

S_{yy} = \frac{1}{n} Σ_{i = 1}^{n} y_{i}^{2}

S_{xy} = \frac{1}{n} Σ_{i = 1}^{n} x_{i} y_{i}

S_{{xx}^{'}} = \frac{1}{n} Σ_{i = 1}^{n} x_{i} x_{i}^{'}

S_{{yy}^{'}} = \frac{1}{n} Σ_{i = 1}^{n} y_{i} y_{i}^{'}

S_{{xy}^{'}} = \frac{1}{n} Σ_{i = 1}^{n} x_{i} y_{i}^{'}

S_{{yx}^{'}} = \frac{1}{n} Σ_{i = 1}^{n} y_{i} x_{i}^{'}

(3) formula substitution (1) Shi Kede:

E (a, b, t_{x}, t_{y}) = Σ_{i = 1}^{n} {({ax}_{i} - {by}_{i} + t_{x} - x_{i}^{'})}^{2} + {({bx}_{i} + {ay}_{i} + t_{y} - y_{i}^{'})}^{2}

Respectively each parameter of following formula is asked partial derivative, and makes that its value is 0 can get:

a(S _xx+S _yy)+t _xS _x+t _yS _y＝S _xx′+S _yy′

b(S _xx+S _yy)+t _yS _x-t _xS _y＝S _xy′-S _yx′

aS _x-bS _y+t _x＝S _x′

bS _x+aS _y+t _y＝S _y′

In order to simplify calculating, can be first the center of vector x be moved to initial point, S is arranged like this _x=0, S _y=0, can get the value of 4 parameters to 4 equation solutions in front:

t _x＝S _x′ t _y＝S _y′

a = \frac{S_{{xx}^{'}} + S_{{yy}^{'}}}{S_{xx} + S_{yy}}

b = \frac{S_{{xy}^{'}} + S_{{yx}^{'}}}{S_{xx} + S_{yy}}

7. the three-dimensional transformation search method of feature point extraction in the facial image according to claim 5 is characterized in that being that the second step conversion of described quadratic transformation is to rotate around Y-axis and Z axis respectively, and projects on the XY plane, and implementation procedure is as follows:

If (X _z, Y _z, be that the Z coordinate is constant through the coordinate after the rotation of horizontal winding Z axis and the displacement Z); It at first rotates θ around Y-axis _yAngle, zoom factor are S _y, transversal displacement is T _{X '}, then have:

\{\begin{matrix} X_{y} = X_{z} * S_{y} * \cos θ_{y} + Z * S_{y} * \sin θ_{y} + T_{x^{'}} \\ Y_{y} = Y_{z}; \\ Z_{y} = - X_{z} * S_{y} * \sin θ_{y} + Z * S_{y} * \cos θ_{y} \end{matrix} - - - (6)

Wherein: (X _y, Y _y, Z _y) expression point (X _z, Y _z, Z) around Y-axis rotate, coordinate after convergent-divergent and the lateral excursion;

\{\begin{matrix} X_{x} = X_{y;} \\ Y_{x} = Y_{y} * S_{x} * \cos θ_{x} - Z_{y} * S_{x} * \sin θ_{x} + T_{y^{'}}; \\ Z_{x} = Y_{y} * S_{x} * \sin θ_{x} + Z_{y} * S_{x} * \cos θ_{x} \end{matrix} - - - (7)

Wherein: (X _x, Y _x, Z _x) expression point (X _y, Y _y, Z _y) around X-axis rotate, coordinate behind convergent-divergent and the vertical misalignment;

\{\begin{matrix} X_{e} = X_{x} = X_{z} * S_{y} * \cos θ_{y} + Z * S_{y} * \sin θ_{y} + T_{x^{'}} \\ Y_{e} = Y_{x} = Y_{z} * S_{x} * \cos θ_{x} - (- X_{z} * S_{y} * \sin θ_{y} + Z * S_{y} * \cos θ_{y}) * S_{x} * \sin θ_{x} + T_{y^{'}} \end{matrix} - - - (8)

Make (9) formula minimum, parameter wherein asked local derviation, can get:

X _z*a _y+Z*b _y+T _x′-x′＝0 (10)

Y _z*a _x-(-X _z*b _y+Z*a _y)*b _x+T _y′-y′＝0 (11)

By formula (10) is carried out multiple linear regression analysis, can be in the hope of parameter value a wherein _y, b _y, T _{X '}If total n unique point, then X in the formula (10) _zCan be expressed as (X _Z1, X _Z2..., X _Zn), Z is expressed as (Z ₁, Z ₂..., Z _n), x ' be expressed as (x ' ₁, x ' ₂..., x ' _n), detailed process is as follows:

1) averages

\overset{&OverBar;}{X_{z}} = \frac{1}{n} Σ_{i = 1}^{n} X_{zi}

\overset{&OverBar;}{Z} = \frac{1}{n} Σ_{i = 1}^{n} Z_{i}

\overset{&OverBar;}{x^{'}} = \frac{1}{n} Σ_{i = 1}^{n} x_{i}^{'}

2) obtain

Value, and calculate its 1 to n with the intermediate value of value as next step calculating, use respectively S ₁₁, S ₂₂, L, S ₁₂, S ₁₀, S ₂₀Expression

S_{11} = Σ_{i = 1}^{n} {(X_{zi} - \overset{&OverBar;}{X_{z}})}^{2}

S_{22} = Σ_{i = 1}^{n} {(Z_{i} - \overset{&OverBar;}{Z})}^{2}

L = Σ_{i = 1}^{n} {(x_{i}^{'} - \overset{&OverBar;}{x^{'}})}^{2}

S_{12} = S_{21} = Σ_{i = 1}^{n} (X_{zi} - \overset{&OverBar;}{X}) (Z_{i} - Z)

S_{10} = Σ_{i = 1}^{n} (X_{zi} - \overset{&OverBar;}{X_{z}}) (x_{i}^{'} - \overset{&OverBar;}{x^{'}})

S_{20} = Σ_{i = 1}^{n} (Z_{i} - \overset{&OverBar;}{Z}) (x_{i}^{'} - \overset{&OverBar;}{x^{'}})

3) ask three parameter a _y, b _y, T _{X '}Value

a_{y} = \frac{S_{10} S_{22} - S_{20} S_{12}}{S_{11} S_{22} - {S_{12}}^{2}}

b_{y} = \frac{S_{20} S_{11} - S_{10} S_{21}}{S_{11} S_{22} - {S_{12}}^{2}}

T_{x^{'}} = \overset{&OverBar;}{x^{'}} - a_{y} \overset{&OverBar;}{X_{z}} - b_{y} \overset{&OverBar;}{Z}

A _y, b _y, T _{X '}Value substitution formula (11), using the same method can be in the hope of a _x, b _x, T _{Y '}Second step conversion M ₂Expression, then:

M_{2} (a_{y}, b_{y}, T_{x^{'}}, a_{x}, b_{x}, T_{y^{'}}) [\begin{matrix} X_{zi} \\ Y_{zi} \end{matrix}]

The most approaching with impact point.

8. the three-dimensional transformation search method of feature point extraction in the facial image according to claim 5, it is characterized in that being described iterative approach refer to unique point the XY plane around Z axis rotate and translation after coordinate (X _z, Y _z) intermediateness be to adopt repeatedly the method for iterative approach,

If intermediateness is (X _z, Y _z), concrete steps are as follows:

1) initial seasonal (X _z, Y _z) be end value (x ', y ');

2) (X _z, Y _z) substitution formula (1) replaces x ' wherein, tries to achieve in the formula (3) four transformation parameter a, b, t according to the ASM method of current two dimension _x, t _y

4) (X _z, Y _z) carry out second step conversion M ₂, obtain parameter a _y, b _y, T _{X '}, a _x, b _x, T _{Y '}

({X^{'}}_{z}, {Y^{'}}_{z}) = M_{2 (a_{y}, b_{y}, T_{x^{'}}, a_{x}, b_{x}, T_{y^{'}})}^{- 1} (x^{'}, y^{'})

6) (X ' _z, Y ' _z) substitution formula (1), try to achieve in the formula (3) four transformation parameter a, b, t according to two-dimentional ASM transform method _x, t _yThen changed for (3) step over to, carry out iterative computation, iteration just can obtain meeting 10 parameters of accuracy requirement for 10 times, i.e. 4 parameter a, b, the t of for the first time conversion _x, t _y, and 6 parameter a of for the second time conversion _y, b _y, T _{X '}, a _x, b _x, T _{Y '}