Nothing Special   »   [go: up one dir, main page]

Skip to main content
Log in

Photometric Stereo by Hemispherical Metric Embedding

  • Published:
Journal of Mathematical Imaging and Vision Aims and scope Submit manuscript

Abstract

Photometric stereo methods seek to reconstruct the 3d shape of an object from motionless images obtained with varying illumination. Most existing methods solve a restricted problem where the physical reflectance model, such as Lambertian reflectance, is known in advance. In contrast, we do not restrict ourselves to a specific reflectance model. Instead, we offer a method that works on a wide variety of reflectances. Our approach uses a simple yet uncommonly used property of the problem— the sought after normals are points on a unit hemisphere. We present a novel embedding method that maps pixels to normals on the unit hemisphere. Our experiments demonstrate that this approach outperforms existing manifold learning methods for the task of hemisphere embedding. We further show successful reconstructions of objects from a wide variety of reflectances including smooth, rough, diffuse and specular surfaces, even in the presence of significant attached shadows. Finally, we empirically prove that under these challenging settings we obtain more accurate shape reconstructions than existing methods.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7

Similar content being viewed by others

References

  1. Abrams, A., Hawley, C., Pless, R.: Heliometric stereo: shape from sun position. In: CVPR, (2012)

  2. Ackermann, J., Langguth, F., Fuhrmann, S., Goesele, M., Darmstadt, T.: Photometric stereo for outdoor webcams. In: ECCV, (2012)

  3. Alldrin, N., Mallick, S., Kriegman, D.: Resolving the generalized bas-relief ambiguity by entropy minimization. In: CVPR, (2007)

  4. Arnold, V., Cooke, R.: Lectures on partial differential equations. Universitext 1979. Springer, Berlin (2004). http://books.google.co.il/books?id=qlNJAYwmfTcC

  5. Balasubramanian, M., Schwartz, E.L.: The isomap algorithm and topological stability. In: Science, (2002)

  6. Basri, R., Jacobs, D., Kemelmacher, I.: Photometric stereo with general, unknown lighting. In: IJCV, (2007)

  7. Basri, R., Jacobs, D.W.: Lambertian reflectance and linear subspaces. IEEE Trans. Pattern Anal. Mach. Intell. 25(2), 218–233 (2003)

    Article  Google Scholar 

  8. Belhumeur, P.N., Kriegman, D.J., Yuille, A.L.: The bas-relief ambiguity. In: IJCV, (1999)

  9. Durou, J.D., Aujol, J.F., Courteille, F.: Integrating the normal field of a surface in the presence of discontinuities. pp. 261–273. Springer, Berlin (2009)

  10. Durou, J.D., Quéau, Y., Aujol, J.F.: Normal integration–part I: a survey (2016)

  11. Fang, Y., Vishwanathan, S., Sun, M., Ramani, K.: slle: Spherical locally linear embedding with applications to tomography. In: CVPR, (2011)

  12. Frankot, R.T., Chellappa, R., Member, S.: A method for enforcing integrability in shape from shading algorithms. IEEE Trans. Pattern Anal. Mach. Intell. 10, 439–451 (1988)

    Article  MATH  Google Scholar 

  13. Georghiades, A.: Incorporating the torrance and sparrow model of reflectance in uncalibrated photometric stereo. pp. 816–823 (2003)

  14. Georghiades, A.S.: Recovering 3-d shape and reflectance from a small number of photographs. In: EGRW, (2003)

  15. Gotardo, P.F.U., Simon, T., Sheikh, Y., Matthews, I.: Photogeometric scene flow for high-detail dynamic 3d reconstruction. In: The IEEE Int. Conf. Comput. Vis. (ICCV) (2015)

  16. Hayakawa, H.: Photometric stereo under a light-source with arbitrary motion. In: JOSA-A, (1994)

  17. Hertzmann, A., Seitz, S.: Example-based photometric stereo: shape reconstruction with general, varying brdfs. In: PAMI, (2005)

  18. Higo, T., Matsushita, Y., Ikeuchi, K.: Consensus photometric stereo. In: CVPR, (2010)

  19. Hobson, E.W.: The theory of spherical and ellipsoidal harmonics. In: CUP Archive, (1931)

  20. Hoeltgen, L., Quau, Y., Breuss, M., Radow, G.: Optimised photometric stereo via non-convex variational minimisation (regular paper). In: British machine vision conference (BMVC), (2016)

  21. Levina, E., Bickel, P.J.: Maximum likelihood estimation of intrinsic dimension. In: NIPS, (2004)

  22. Littwin, E., Averbuch-Elor, H., Cohen-Or, D.: Spherical embedding of inlier silhouette dissimilarities. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 3855–3863 (2015)

  23. Lu, F., Matsushita, Y., Sato, I., Okabe, T., Sato, Y.: Uncalibrated photometric stereo for unknown isotropic reflectances. In: CVPR, (2013)

  24. Mecca, R., Rodolà, E., Cremers, D.: Realistic photometric stereo using partial differential irradiance equation ratios. Comput.Graph. 51, 8–16 (2015)

  25. Midorikawa, K., Yamasaki, T., Aizawa, K.: Uncalibrated photometric stereo by stepwise optimization using principal components of isotropic brdfs. In: The IEEE Conference on Computer Vision and Pattern Recognition (CVPR), (2016)

  26. Nillius, P., Eklundh, J.O.: Classifying materials from their reflectance properties. In: European Conference on Computer Vision (ECCV), pp. 366–376 (2004)

  27. Okabe, T., Sato, I., Sato, Y.: Attached shadow coding: estimating surface normals from shadows under unknown reflectance and lighting conditions. In: ICCV, (2009)

  28. Papadhimitri, T., Favaro, P.: A closed-form solution to uncalibrated photometric stereo via diffuse maxima. IEEE Computer Society, Los Alamitos, (2012)

  29. Papadhimitri, T., Favaro, P.: A new perspective on uncalibrated photometric stereo. In: IEEE computer society CVPR, pp. 1474–1481. (2013)

  30. Papadhimitri, T., Favaro, P.: Uncalibrated near-light photometric stereo. In: Proceedings of the British Machine Vision Conference. BMVA Press (2014)

  31. Queau, Y., Lauze, F., Durou, J.D.: A L1-TV Algorithm for Robust perspective photometric stereo with spatially-varying lightings, In: SSVM, (2015)

  32. Quéau, Y., Lauze, F., Durou, J.D.: Solving uncalibrated photometric stereo using total variation. J. Math. Imag. Vis 52(1), 87–107 (2015)

  33. Queau, Y., Mecca, R., Durou, J.D.: Unbiased photometric stereo for colored surfaces: a variational approach. In: The IEEE Conference on Computer Vision and Pattern Recognition, (2016)

  34. Ramamoorthi, R., Hanrahan, P.: On the relationship between radiance and irradiance: determining the illumination from images of a convex lambertian object. JOSA 18(10), 2448–2459 (2001)

  35. Sato, I., Okabe, T., Yu, Q., Sato, Y.: Shape reconstruction based on similarity in radiance changes under varying illumination. In: ICCV, (2007)

  36. Saul, L.K., Roweis, S.T.: Think globally, fit locally: unsupervised learning of low dimensional manifolds. J. Mach. Learn. Res. 4, 119–155 (2003)

    MathSciNet  MATH  Google Scholar 

  37. Shi, B., Matsushita, Y., Wei, Y., Xu, C., Tan, P.: P.: Self-calibrating photometric stereo. In: CVPR, (2010)

  38. Simchony, T., Chellappa, R., Shao, M.: Direct analytical methods for solving poisson equations in computer vision problems. IEEE Trans. Pattern Anal. Mach. Intell. 12(5), 435–446 (1990)

    Article  MATH  Google Scholar 

  39. Smith, W.A.P., Ramamoorthi, R., Tozza, S.: Linear depth estimation from an uncalibrated, monocular polarisation image. In: Leibe B., Matas J., Sebe N., Welling M. (eds.) Computer Vision - ECCV 2016. ECCV 2016. Lecture Notes in Computer Science, vol. 9912. Springer, Cham (2016)

  40. Tenenbaum, J.B., de Silva, V., Langford, J.C.: A global geometric framework for nonlinear dimensionality reduction. Science 290(5500), 2319–2323 (2000)

  41. Torgerson, W.S.: Theor. Methods Scaling. Wiley, Hoboken (1958)

    Google Scholar 

  42. Tozza, S., Mecca, R., Duocastella, M., DelBue, A.: Direct differential photometric stereo shape recovery of diffuse and specular surfaces. J. Math. Imag. Vis. 56(1), 57–76 (2016)

  43. Woodham, R.J.: Photometric method for determining surface orientation from multiple images. Opt. Eng. 19, 139–144 (1980)

    Article  Google Scholar 

  44. Wu, L., Ganesh, A., Shi, B., Matsushita, Y., Wang, Y., Ma, Y.: Robust photometric stereo via low-rank matrix completion and recovery. In: ACCV, (2011)

  45. Xie, W., Dai, C., Wang, C.C.L.: Photometric stereo with near point lighting: a solution by mesh deformation. In: The IEEE Conference on Computer Vision and Pattern Recognition (CVPR), (2015)

  46. Yuille, A., Snow, D.: Shape and albedo from multiple images using integrability. In: CVPR, (1997)

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Nati Ofir.

Appendix

Appendix

Our proofs rely on a spherical harmonic representation of reflectance. Below we introduce notation, which for the most part follows the notation of [7]. Spherical harmonics are orthogonal functions from the unit sphere to the complex plain \(Y_{nm}:\mathcal{S}^2 \rightarrow \mathcal{C}\), \(n=0,1,2\ldots \) and \(-n \le m \le n\), defined as

$$\begin{aligned} Y_{nm}(\theta ,\phi ) = \sqrt{\frac{(2n+1)}{4\pi } \frac{(n-|m|)!}{(n+|m|)!}} P_{n|m|}(\cos \theta ) e^{im\phi }, \end{aligned}$$
(15)

where \(P_{nm}\) are the associated Legendre functions, defined as

$$\begin{aligned} P_{nm}(z) = \frac{(1-z^2)^{m/2}}{2^n n!} \frac{d^{n+m}}{dz^{n+m}}(z^2-1)^n. \end{aligned}$$
(16)

Each harmonic function \(Y_{nm}\) is a polynomial of degree n of the variables (xyz) on the unit sphere. We will therefore interchangeably use \(Y(\theta ,\phi )\) and Y(p) with \(p=(x,y,z)^T\) and \(x=\sin \theta \cos \phi \), \(y=\sin \theta \sin \phi \), and \(z=\cos \theta \).

To work with real functions we use the notation \(Y^e_{nm}=\mathrm {Re}(Y_{nm})\) and \(Y^o_{nm}=\mathrm {Im}(Y_{nm})\) (\(1 \le m \le n\)), which denote the even and odd components of the harmonics. Note that the zonal harmonics (\(Y_{n0}\)) are real, and that the real spherical harmonics too are orthogonal. In addition, to simplify notation, we interchangeably use a single subscript notation \(Y_s\) with \(s=1,2,3,\ldots \). For this notation we order the real harmonics by their order n and within every order by m, placing each even harmonic before its corresponding odd one, i.e., \(Y_{00},Y_{10},Y^e_{11},Y^o_{11},Y_{20},\ldots \).

We will be interested in surfaces whose reflectance kernel is isotropic and band limited. An example for such kernels is the Lambertian reflectance kernel \(k(\theta )=\max (\cos \theta ,0)\), which can be approximated to a great accuracy with the first nine harmonics (\(n \le 2\)). As is shown in [7], the reflectance function for such surfaces can be expressed as a convolution of the lighting function with the reflectance kernel. As a consequence, the intensity \(I_p\) at a pixel p with surface normal \(n_p\) and albedo \(\rho _p\) can be expressed in the following form. Let \(\ell (\theta ,\phi )\) denote the environment lighting as a function of direction, and let \(\ell (\theta ,\phi ) = \sum _{s=1}^\infty \ell _s Y_s(\theta ,\phi )\) be its harmonic decomposition, then

$$\begin{aligned} I_p = \rho _p \sum _{s=1}^\infty \alpha _s \ell _s k_s Y_s(n_p), \end{aligned}$$
(17)

where \(\alpha _s\) are constants due to the Funk-Hecke (convolution) theorem, \(k(\theta )=\sum _{n=0}^\infty k_{(n)} Y_{n0}(\theta ,\phi )\) is the harmonic expansion of the kernel and \(k_{s(n,m)}=k_{(n)}\).

For Lambertian surfaces we are interested in harmonics up to order 2. The kernel coefficients and the Funk-Hecke constants for these orders are

$$\begin{aligned} k_{(0)}&=\frac{\sqrt{\pi }}{2}\quad \alpha _{(0)}=\pi \nonumber \\ k_{(1)}&=\sqrt{\frac{\pi }{3}}\quad \alpha _{(1)}=\frac{2\pi }{3} \\ k_{(2)}&=\frac{\sqrt{5\pi }}{8}\quad \alpha _{(2)}=\frac{\pi }{4}, \nonumber \end{aligned}$$
(18)

and the harmonics are

$$\begin{aligned} Y_1 = Y_{00}&=\frac{1}{\sqrt{4\pi }}&Y_2 = Y_{10}&=\sqrt{\frac{3}{4\pi }}z \nonumber \\ Y_3 = Y_{11}^e&=\sqrt{\frac{3}{4\pi }}x&Y_4 = Y_{11}^o&=\sqrt{\frac{3}{4\pi }}y \nonumber \\ Y_5 = Y_{20}&=\frac{1}{2}\sqrt{\frac{5}{4\pi }}(3z^2-1)&Y_6 = Y_{21}^e&=3\sqrt{\frac{5}{12\pi }}xz \nonumber \\ Y_7 = Y_{21}^o&=3\sqrt{\frac{5}{12\pi }}yz&Y_8 = Y_{22}^e&=\frac{3}{2}\sqrt{\frac{5}{12\pi }}(x^2-y^2) \nonumber \\ Y_9 = Y_{22}^o&=3\sqrt{\frac{5}{12\pi }}xy. \end{aligned}$$
(19)

1.1 Equivalence of Distances for Isotropic Reflectance Kernels

Claim 3

Under uniformly distributed directional light sources and an isotropic, band-limited reflectance kernel, the \(\ell _2\) distances of normalized intensity vectors \(\hat{v}_p\) and \(\hat{v}_q\) of points p and q with nearby normals \(n_p\) and \(n_q\) are proportional to the spherical distance \(d(n_p,n_q)\) between the corresponding surface normals, i.e.,

$$\begin{aligned} \Vert \hat{v}_p - \hat{v}_q\Vert _2 = c \cdot d(n_p,n_q), \end{aligned}$$

where the constant c depends on the reflectance kernel.

Proof

We prove the claim by evaluating the expression:

$$\begin{aligned} \Vert \hat{v}_p - \hat{v}_q\Vert _2^2 = 2 - 2\hat{v}_p^T\hat{v}_q. \end{aligned}$$

The (un-normalized) vector \(v_{p}\) contains the intensities observed at pixel \(p\) over a set of images \(k=1..K\), each lit by a point light source from direction \(l_k \in \mathcal{S}^2\) whose magnitudes are equal. Using (17), the intensity \(I_{kp}\) of p can be written as,

$$\begin{aligned} I_{kp}=\rho _p \sum _{s=1}^\infty \ell _{ks} c_s Y_s(n_p), \end{aligned}$$

where \(\rho _p\) and \(n_p\) respectively are the albedo and normal at p, \(\ell _{ks}\) are the harmonic coefficients of the light in the k’th image, and \(c_s=\alpha _s k_s\) are constants whose values depend on the reflectance kernel. As we will be interested in normalized intensity vectors \(\hat{v}_p\), we can assume w.l.o.g. that \(\rho _p=1\) and likewise that the lights are of unit magnitudes.

The inner product of two intensity vectors \(v_{p}\) and \(v_{q}\) is given by,

$$\begin{aligned} v_{p}^Tv_{q}= & {} \sum _{k}^K \left( \sum _{s=1}^\infty \ell _{ks} c_s Y_s(n_p)\right) \left( \sum _{t=1}^\infty \ell _{kt} c_t Y_t(n_q)\right) \\= & {} \sum _{s=1}^\infty \sum _{t=1}^\infty N_{st} L_{st}, \end{aligned}$$

where,

$$\begin{aligned} L_{st}= & {} \sum _{k}^K \ell _{ks} \ell _{kt}\\ N_{st}= & {} c_s k_s Y_s(n_i) c_t k_t Y_t(n_j). \end{aligned}$$

Under the assumption that the light in the K images is distributed uniformly \(L_{st}=\mathbbm {1}_{s=t}\). This is because for a point source light at direction \(l_{k}\), which is expressed as a Dirac delta function \(\delta _{l_{k}}\), \(\ell _{ks}=<\delta _{l_{k}},Y_s>=Y_s(l_{k})\), so that,

$$\begin{aligned} L_{st}=\sum _{k}^K Y_s(l_{k}) Y_t(l_{k}), \end{aligned}$$

and for uniform light of unit intensity, we get

$$\begin{aligned} L_{st} \approx \int _{S^2}Y_s(l) Y_t(l) dl=\mathbbm {1}_{s=t,} \end{aligned}$$

due to the orthonormality of the spherical harmonics. The inner product \(v_{p}^Tv_{q}\) therefore simplifies to

$$\begin{aligned} v_{p}^Tv_{q} = \sum _{s=1}^\infty N_{s}, \end{aligned}$$

where \(N_s=c^2_s Y_s(n_p) Y_s(n_q)\).

We notice next that \(N_s\) can be expressed as a univariate polynomial in \(z=\cos \theta \), where \(\theta \) is the angle between \(n_p\) and \(n_q\). As the inner product \(v_{p}^Tv_{q}\) and the spherical harmonics are invariant to a global rotation of the normals, we can orient our coordinate system so that \(n_p=[0,0,1]^T\) and \(n_q=[\sin \theta , 0, \cos \theta ]^T\) (and \(\phi =0\)). In this coordinate frame \(Y_s(n_p)\) is constant and \(Y_s(n_q)\) is a (scaled) associated Legendre function, which is polynomial of degree n in \(z=\cos \theta \). We can therefore write

$$\begin{aligned} N_s = \sum _{r=0}^n a_{sr}z^r, \end{aligned}$$

with coefficients \(a_{sr}\) that depend on the reflectance kernel and the harmonic order, s. Since \(n_p\) and \(n_q\) are nearby, we use the Taylor approximation \(\cos \theta =1-\frac{\theta ^2}{2}+O(\theta ^4)\), which yields,

$$\begin{aligned} N_s = \sum _{r=0}^n a_{sr}(1-\frac{\theta ^2}{2})^r + O(\theta ^4) = a_{s}-b_{s}\frac{\theta ^2}{2}+O(\theta ^4), \end{aligned}$$

where \(a_s=\sum _{r=0}^n a_{sr}\) and \(b_s=\sum _{r=1}^n r a_{sr}\). We now have,

$$\begin{aligned} v_{p}^Tv_{q} = \sum _{s=1}^\infty \left( a_{s} - b_{s}\frac{\theta ^2}{2} + O(\theta ^4) \right) . \end{aligned}$$

We further assume that the reflectance kernel is band limited, so that harmonic terms of orders \(n>N\) for a finite N can be omitted. Therefore,

$$\begin{aligned} v_{p}^Tv_{q} = \sum _{s=1}^S \left( a_{s} - b_{s}\frac{\theta ^2}{2}\right) + O(N^2\theta ^4), \end{aligned}$$

where \(S=(N+1)^2\). We further denote \(a=\sum _{s=1}^S a_s\) and \(b=\sum _{s=1}^S b_s\).

\(v_{p}^Tv_{p}\) can be computed simply by plugging \(\theta =0\) in the previous expression, obtaining \(v_{p}^Tv_{p} = a\), and \(v_q^Tv_q=v_p^Tv_p\) due to rotation invariance.

We can now evaluate the inner product between the normalized intensity vectors,

$$\begin{aligned} \hat{v}_{p}^T \hat{v}_{q}= & {} \frac{v_{p}^Tv_{q}}{\Vert v_{p}\Vert \Vert v_{q}\Vert } = \frac{a - b \frac{\theta ^2}{2}}{a}+O(N^2\theta ^4) \\= & {} 1-\left( \frac{(c\theta )^2}{2}\right) +O(N^2\theta ^4), \end{aligned}$$

where \(c=\sqrt{b/a}\). Note that a is positive since \(a=\Vert v_p\Vert ^2\) and b is positive for sufficiently small \(\theta \), since \(v_p^T v_q<a\). We conclude that, up to \(O(N^2\theta ^4)\) terms,

$$\begin{aligned} \Vert \hat{v}_p - \hat{v}_q\Vert _2 = \sqrt{2 - 2\hat{v}_p^T\hat{v}_q} = c\theta = c \cdot d(n_p,n_q). \end{aligned}$$

\(\square \)

1.2 Intensity Distances to Geodesic Distances for the Lambertian Reflectance Kernel

Claim 4

The reflectance-dependent constant c can be computed analytically for Lambertian reflectance, and is approximately 0.93.

Proof

As proved in the previous claim,

$$\begin{aligned} \Vert \hat{v}_p - \hat{v}_q\Vert _2 = c \cdot d(n_p,n_q). \end{aligned}$$

We now wish to calculate the constant c for the Lambertian kernel. To this end we assume \(n_p=(0,0,1)^T\), \(n_q=(\sin \theta ,0,\cos \theta )^T\) and compute \(N_{s}=\alpha ^2_{(n)} k^2_{(n)} Y_s(n_p) Y_s(n_q)\) as a function of \(\theta \). Using (18)–(19) we get

$$\begin{aligned} N_{1}&=\frac{\pi }{4} \\ N_{2}&=\frac{\pi }{3}\cos \theta \\ N_{5}&=\frac{5 \pi }{128}(3 \cos ^2 \theta -1), \end{aligned}$$

and the rest of the terms vanish. Therefore,

$$\begin{aligned} v_p^T v_q=\sum _{s=1}^9 N_s = \frac{\pi }{4}+\frac{\pi }{3}\cos \theta +\frac{5 \pi }{128}(3 \cos ^2 \theta -1). \end{aligned}$$

Replacing \(\cos \theta \) by its Taylor approximation, \(\cos \theta \simeq 1-\frac{\theta ^2}{2}\), we get that

$$\begin{aligned} v_p^T v_q\simeq \frac{127\pi }{192}-\frac{109\pi }{384}\theta ^2, \end{aligned}$$

and \(v_p^Tv_p=v_q^Tv_q\) are obtained by plugging \(\theta =0\). Denote, \(a=127\pi /192\) and \(b=109\pi /192\), we can now calculate the inner product of the normalized observation vectors \(\hat{v}_p^T \hat{v}_q\),

$$\begin{aligned} \hat{v}_p^T \hat{v}_q = \frac{v_p^T v_q}{\Vert v_p\Vert \Vert v_q\Vert } \simeq \frac{a-b\frac{\theta ^2}{2}}{a} = 1-\frac{b}{a}\frac{\theta ^2}{2}. \end{aligned}$$

Let \(c=\sqrt{\frac{b}{a}} \approx 0.93\). We obtain,

$$\begin{aligned} \Vert \hat{v}_p - \hat{v}_q\Vert _2 \approx 0.93\theta . \end{aligned}$$

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Bartal, O., Ofir, N., Lipman, Y. et al. Photometric Stereo by Hemispherical Metric Embedding. J Math Imaging Vis 60, 148–162 (2018). https://doi.org/10.1007/s10851-017-0748-y

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10851-017-0748-y

Keywords

Navigation