Edge-preserving interpolation of depth data exploiting color information

Valeria Garro¹,
Carlo Dal Mutto²,
Pietro Zanuttigh² &
…
Guido M. Cortelazzo²

424 Accesses
6 Citations
Explore all metrics

Abstract

The extraction of depth information associated to dynamic scenes is an intriguing topic, because of its perspective role in many applications, including free viewpoint and 3D video systems. Time-of-flight (ToF) range cameras allow for the acquisition of depth maps at video rate, but they are characterized by a limited resolution, specially if compared with standard color cameras. This paper presents a super-resolution method for depth maps that exploits the side information from a standard color camera: the proposed method uses a segmented version of the high-resolution color image acquired by the color camera in order to identify the main objects in the scene and a novel surface prediction scheme in order to interpolate the depth samples provided by the ToF camera. Effective solutions are provided for critical issues such as the joint calibration between the two devices and the unreliability of the acquired data. Experimental results on both synthetic and real-world scenes have shown how the proposed method allows to obtain a more accurate interpolation with respect to standard interpolation approaches and state-of-the-art joint depth and color interpolation schemes.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Super-Resolution of Depth Map Exploiting Planar Surfaces

Reliable Fusion of ToF and Stereo Depth Driven by Confidence Measures

Image-guided ToF depth upsampling: a survey

Article 13 March 2017

Notes

In the following description, we will call samples the input depth values obtained by reprojecting the ToF data, and pixels, the output pixels of the high-resolution depth map.
Threshold values of 0.1 and 0.4 refer to a depth value range between 0 and 1.
The errors reported in this section are measured in pixels on the high-resolution image of the color cameras
The acquired data for this setup is available online at the address http://lttm.dei.unipd.it/downloads/superres/.
In both cases, we just warped the images using a 3D mesh built from the depth data; no ad hoc post processing algorithms were used.

References

Ballan L, Brusco N, Cortelazzo GM (2005) 3D passive shape recovery from texture and silhouette information. In: Proceedings of IEEE European conference on visual media production (CVMP). London
Beder C, Koch R (2008) Calibration of focal length and 3D pose based on the reflectance and depth image of a planar object. Int J Intell Syst Technol Appl 5:285–294
Google Scholar
Bouguet J, Matlab camera calibration toolbox (2000). http://www.vision.caltech.edu/bouguetj/calib_doc/. Accessed 6 May 2013
Diebel J, Thrun S (2005) An application of Markov random fields to range sensing. In: Proceedings of conference on neural information processing systems (NIPS)
Felzenszwalb PF, Huttenlocher DP (2004) Efficient graph-based image segmentation. Int J Comput Vision 59(2):167–181
Article Google Scholar
Fischler M, Bolles R (1987) Random sample consensus: a paradigm for model fitting with applications to image analysis and automated cartography. In: Readings in computer vision: issues, problems, principles, and paradigms. Morgan Kaufmann, San Francisco, pp 726–740
Garro V, dal Mutto C, Zanuttigh P, Cortelazzo G (2009) A novel interpolation scheme for range data with side information. In: Proceedings of IEEE European conference on visual media production (CVMP), pp 52–60
Guan L, Franco J, Pollefeys M (2008) 3D object reconstruction with heterogeneous sensor data. In: Proceedings of international symposium on 3D data processing, visualization and transmission (3DPVT)
Hartley R, Zisserman A (2004) Multiple view geometry in computer vision, 2nd edn. Cambridge University Press, Cambridge
Book Google Scholar
Hartley RI, Sturm P (1994) Triangulation. In: Proceedings of ARPA image understanding workshop, pp 957–966
Horn BKP (1987) Closed-form solution of absolute orientation using unit quaternions. J Opt Soc Am A 4(4):629–642
Article MathSciNet Google Scholar
Kim Y, Chan D, Theobalt C, Thrun S (2008) Design and calibration of a multi-view TOF sensor fusion system. In: Proceedings of IEEE CVPR workshop on time-of-flight computer vision
Kim Y, Theobalt C, Diebel J, Kosecka J, Micusik B, Thrun S (2009) Multi-view image and TOF sensor fusion for dense 3d reconstruction. In: Proceedings of 3-D digital imaging and modeling conference (3DIM)
Kopf J, Cohen MF, Lischinski D, Uyttendaele M (2007) Joint bilateral upsampling. ACM Trans Graph 26(3):96
Article Google Scholar
Langmann B, Hartmann K, Loffeld O (2011) Comparison of depth super-resolution methods for 2D/3D images. Int J Comput Inf Syst Ind Manag Appl 3:635–645
Google Scholar
Li Y, Xue T, Sun L, Liu J (2012) Joint example-based depth map super-resolution. In: Proceedings of IEEE international conference on multimedia and expo (ICME), pp 985–988
Lindner M, Lambers M, Kolb A (2008) Sub-pixel data fusion and edge-enhanced distance refinement for 2D/3D images. Int J Intell Syst Technol Appl 5:344–354
Google Scholar
Lu J, Min D, Pahwa R, Do M (2011) A revisit to MRF-based depth map super-resolution and enhancement. In: Proceedings of international conference on acoustics, speech and signal processing (ICASSP), pp 985–988
Dal Mutto C, Zanuttigh P, Cortelazzo G (2010) A probabilistic approach to TOF and stereo data fusion. In: Proceedings of international symposium on 3D data processing, visualization and transmission (3DPVT)
Schuon S, Theobalt C, Davis J, Thrun S (2008) High-quality scanning using time-of-flight depth super resolution. In: Proceedings of CVPR workshop on time-of-flight computer vision, pp 1–7
Kahlmann T, Ingensand H (2008) Calibration and development for increased accuracy of 3D range image cameras. J Appl Geodesy 2:1–11
Article Google Scholar
Yang Q, Yang R, Davis J, Nister D (2007) Spatial-depth super resolution for range images. In: Proceedings of IEEE conference on computer vision and pattern recognition (CVPR), pp 1–8
Zanuttigh P, Cortelazzo G (2009) Compression of depth information for 3d rendering. In: Proceedings of 3D TV conference
Zhang L, Curless B, Seitz S (2003) Spacetime stereo: shape recovery for dynamic scenes. In: Proceedings of IEEE conference on computer vision and pattern recognition (CVPR), pp 367–374
Zhang Z (1998) A flexible new technique for camera calibration. IEEE Trans Pattern Anal Mach Intell 22:1330–1334
Article Google Scholar
Zhu J, Wang L, Yang R, Davis J (2008) Fusion of time-of-flight depth and stereo for high accuracy depth maps. In: Proceedings of IEEE conference on computer vision and pattern recognition (CVPR), pp 1–8

Download references

Author information

Authors and Affiliations

Department of Computer Science, University of Verona, Verona, Italy
Valeria Garro
Department of Information Engineering, University of Padova, Padova, Italy
Carlo Dal Mutto, Pietro Zanuttigh & Guido M. Cortelazzo

Authors

Valeria Garro
View author publications
You can also search for this author in PubMed Google Scholar
Carlo Dal Mutto
View author publications
You can also search for this author in PubMed Google Scholar
Pietro Zanuttigh
View author publications
You can also search for this author in PubMed Google Scholar
Guido M. Cortelazzo
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Pietro Zanuttigh.

Appendix: Appendix: Bilinear interpolation on nonregular grids

In the proposed approach, after the calibration step, the available samples are not regularly distributed over a lattice. This appendix shows how the well-known bilinear interpolation scheme can be extended to nonregular grids.

Referring to Fig. 22, the depth of the red point p(x, y) is estimated from the depth D _i = D(p _i), i = 1, . . , 4 of the four blue samples p _i(x _i, y _i), i = 1, . . , 4. The procedure works in two steps: firstly, we estimate the depth of the two yellow points p _a(x _a, y _a) = p _a(x, y _a) and p _b(x _b, y _b) = p _a(x, y _b), and then the depth of p is computed by interpolating the ones of p _a and p _b. Let us define with Δx _i = |p _i−p|_x = |x _i−x|, i = 1, . . , 4 and Δy _i = |p _i−p|_y = |y _i−y|; i = 1, . . , 4 the absolute value of the differences between the x and y coordinates of the available low-resolution samples (blue samples) and the coordinates of the point that is estimated (in red), i.e., the absolute value of the x and y components of the vectors connecting samples p _i with p. First of all, the depth $D_{a} \triangleq D(\mathbf {p}_{a})$ of point p _a(x, y _a) is estimated by linearly interpolating the depths of p ₁ and p ₂.

$$ \hat{D_{a}} = \frac{\Delta x_{2}}{\Delta x_{1} + \Delta x_{2}} D_{1} + \frac{\Delta x_{1}}{\Delta x_{1} + \Delta x_{2}} D_{2} $$

(12)

$$ = C_{1} D_{1} + C_{2} D_{2} $$

(13)

where C ₁ = Δx ₂/(Δx ₁ + Δx ₂) and C ₂ = Δx ₁/(Δx ₁ + Δx ₂). The same procedure is applied to the estimate of depth D(p _b) of p _b(x, y _b) from p ₃ and p ₄:

$$\begin{array}{*{20}l} \hat{D_{b}} &= \frac{\Delta x_{4}}{\Delta x_{3} + \Delta x_{4}} D_{3} + \frac{\Delta x_{3}}{\Delta x_{3} + \Delta x_{4}} D_{4} \end{array} $$

(14)

$$ = C_{3} D_{3} + C_{4} D_{4} $$

(15)

where C ₃ = Δ x ₄/(Δx ₃ + Δx ₄) and C ₄ = Δx ₃/(Δx ₃ + Δx ₄). The vertical coordinates Δy _a = y _a − y and Δy _b = y − y _b of p _a and p _b with respect to p can be computed as follows:

$$\begin{array}{*{20}l} \Delta y_{a} &= \frac{\Delta x_{2}}{\Delta x_{1} + \Delta x_{2}} \Delta y_{1} + \frac{\Delta x_{1}}{\Delta x_{1} + \Delta x_{2}} \Delta y_{2} \end{array} $$

(16)

$$ = C_{3} \Delta y_{3} + C_{4} \Delta y_{4} $$

(17)

$$\begin{array}{*{20}l} \Delta y_{b} = \frac{\Delta x_{4}}{\Delta x_{3} + \Delta x_{4}} \Delta y_{3} + \frac{\Delta x_{3}}{\Delta x_{3} + \Delta x_{4}} \Delta y_{4} \end{array} $$

(18)

$$ =C_{3} \Delta y_{3} + C_{4} \Delta y_{4} $$

(19)

In the second step, the depths D _a and D _b of p _a and p _b are linearly interpolated to get the depth of p:

$$ \hat{D}(\mathbf{p}) = \frac{\Delta y_{b}}{\Delta y_{a} + \Delta y_{b}} \hat{D}_{a} + \frac{\Delta y_{a}}{\Delta y_{a} + \Delta y_{b}} \hat{D}_{b} $$

(20)

$$ = C_{a} C_{1} D_{1} + C_{a} C_{2} D_{2} + C_{b} C_{3} D_{3} + C_{b} C_{4} D_{4} \\ $$

(21)

$$ = \gamma_{1} D_{1} + \gamma_{2} D_{2} + \gamma_{3} D_{3} + \gamma_{4} D_{4} $$

(22)

where C _a = Δy _b/(Δy _a + Δy _b) , C _b = Δy _a/(Δy _a + Δy _b), γ ₁ = C _a C ₁, γ ₂ = C _a C ₂, γ ₃ = C _b C ₃ and γ ₄ = C _b C ₄. Equation 21 has been obtained by replacing $\hat {D_{a}}$ and $\hat {D_{a}}$ in Eq. 20 with their expressions from Eqs. 13 and 15. Note how the final result is a weighted average of the four samples where the weights depend on the positions of the various samples as in standard bilinear interpolation. This approach is directly used on the low-resolution samples when the segmented region contains all the four samples, while in the other cases, the missing samples are firstly estimated by the methods of Section 3.2, and then Eq. 22 is applied.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Garro, V., Dal Mutto, C., Zanuttigh, P. et al. Edge-preserving interpolation of depth data exploiting color information. Ann. Telecommun. 68, 597–613 (2013). https://doi.org/10.1007/s12243-013-0389-0

Download citation

Received: 14 September 2012
Accepted: 26 July 2013
Published: 20 August 2013
Issue Date: December 2013
DOI: https://doi.org/10.1007/s12243-013-0389-0

Edge-preserving interpolation of depth data exploiting color information

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

Super-Resolution of Depth Map Exploiting Planar Surfaces

Reliable Fusion of ToF and Stereo Depth Driven by Confidence Measures

Image-guided ToF depth upsampling: a survey

Notes

References

Author information

Authors and Affiliations

Corresponding author

Appendix: Appendix: Bilinear interpolation on nonregular grids

Rights and permissions

About this article

Cite this article

Keywords

Subscribe and save

Buy Now

Navigation

Edge-preserving interpolation of depth data exploiting color information

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

Super-Resolution of Depth Map Exploiting Planar Surfaces

Reliable Fusion of ToF and Stereo Depth Driven by Confidence Measures

Image-guided ToF depth upsampling: a survey

Notes

References

Author information

Authors and Affiliations

Corresponding author

Appendix: Appendix: Bilinear interpolation on nonregular grids

Appendix: Appendix: Bilinear interpolation on nonregular grids

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Subscribe and save

Buy Now

Search

Navigation