Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1145/1080402.1080419acmconferencesArticle/Chapter ViewAbstractPublication PagesapgvConference Proceedingsconference-collections
Article

Depth-of-field-based alpha-matte extraction

Published: 26 August 2005 Publication History

Abstract

In compositing applications, objects depicted in images frequently have to be separated from their background, so that they can be placed in a new environment. Alpha mattes are important tools aiding the selection of objects, but cannot normally be created in a fully automatic way. We present an algorithm that requires as input two images---one where the object is in focus, and one where the background is in focus---and then automatically produces an alpha matte indicating which pixels belong to the object. This algorithm is inspired by human visual processing and involves nonlinear response compression, center-surround mechanisms as well as a filling-in stage. The output can then be refined with standard computer vision techniques.

References

[1]
Adelson, E. H. 1982. Saturation and adaptation in the rod system. Vision Research 22, 1299--1312.
[2]
Boykov, Y., and Jolly, M.-P. 2001. Interactive graph cuts for optimal boundary and region segmentation of objects in nd images. In Proceedings of IEEE International Conference on Computer Vision.
[3]
Caselles, V., Kimmel, R., and Sapiro, G. 1997. Geodesic active contours. International Journal of Computer Vision 22, 1, 61--79.
[4]
Chuang, Y.-Y., Curless, B., Salesin, D., and Szeliski, R. 2001. A bayesian approach to digital matting. In Proceedings of Computer Vision and Pattern Recognition (CVPR 2001), vol. II, 264--271.
[5]
Chuang, Y.-Y., Agarwala, A., Curless, B., Salesin, D. H., and Szeliski, R. 2002. Video matting of complex scenes. In SIGGRAPH '02: Proceedings of the 29th annual conference on Computer graphics and interactive techniques, ACM Press, New York, NY, USA, 243--248.
[6]
Dakin, S. C., and Bex, P. J. 2003. Natural image statistics mediate brightness filling in. Proceedings of the Royal Society of London, B 270, 2341--2348.
[7]
Debevec, P. E., and Malik, J. 1997. Recovering high dynamic range radiance maps from photographs. In SIGGRAPH 97 Conference Proceedings, Annual Conference Series, 369--378.
[8]
Dowling, J. E. 1987. The retina: an approachable part of the brain. Belknap Press, Cambridge.
[9]
Elder, J. H. 1999. Are edges incomplete? International Journal of Computer Vision 34, 2/3, 97--122.
[10]
Ens, J., and Lawrence, P. 1993. An investigation of methods for determining depth from focus. IEEE Transactions on Pattern Analysis and Machine Intelligence 15, 2, 97--108.
[11]
Greig, D., Porteous, B., and Seheult, A. 1989. Exact MAP estimation for binary images. Journal of the Royal Statistical Society, B 51, 271--279.
[12]
Grossberg, S., and Mingolla, E. 1985. Neural dynamics of form perception: boundary adaptation, illusory figures, and neon color spreading. Psychological Review 92, 173--211.
[13]
Hood, D. C., and Finkelstein, M. A. 1979. Comparison of changes in sensitivity and sensation: implications for the response-intensity function of the human photopic system. Journal of Experimental Pssychology: Human Perceptual Performance 5, 391--405.
[14]
Horn, B. K. P., 1968. Focusing. memo 160.
[15]
ITU. 1990. International Telecommunication Union ITU-R Recommendation BT.709, Basic Parameter Values for the HDTV Standard for the Studio and for International Programme Exchange. Geneva. Formerly CCIR Rec. 709.
[16]
Kleinschmidt, J., and Dowling, J. E. 1975. Intracellular recordings from gecko photoreceptors during light and dark adaptation. J gen Physiol 66, 617--648.
[17]
Krauskopf, J. 1963. Effect of retinal stabilization on the appearance of heterochromatic targets. J Opt Soc Am 53, 741--744.
[18]
Kuffler, S. W. 1953. Discharge patterns and functional organization of mammalian retina. Journal of Neurophysiology 16, 37--68.
[19]
Li, Y., Sun, J., Tang, C.-K., and Shum, H.-Y. 2004. Lazy snapping. ACM Transactions on Graphics 23, 3, 303--308.
[20]
Marr, D. 1982. Vision, a computational investigation into the human representation and processing of visual information. W H Freeman and Company, San Fransisco.
[21]
Marshall, J. A., Burbeck, C. A., Ariely, D., Rolland, J. P., and Martin, K. E. 1996. Occlusion edge blur: a cue to relative visual depth. Journal of the Optical Society of America A 13, 681--688.
[22]
McGuire, M., Matusik, W., Pfister, H., Hughes, J. F., and Durand, F. 2005. Defocus video matting. ACM Transactions on Graphics 24, 3.
[23]
Mitsunaga, T., Yokoyama, T., and Totsuka, T. 1995. Autokey: human assisted key extraction. In SIGGRAPH '95: Proceedings of the 22nd annual conference on Computer graphics and interactive techniques, ACM Press, New York, NY, USA, 265--272.
[24]
Mortensen, E. N., and Barrett, W. A. 1995. Intelligent scissors for image composition. In SIGGRAPH '95: Proceedings of the 22nd annual conference on Computer graphics and interactive techniques, ACM Press, New York, NY, USA, 191--198.
[25]
Mortensen, E. N., and Barrett, W. A. 1999. Toboganbased intelligent scissors with a four parameter edge model. In Proceedings IEEE Conference on Computer Vision and Pattern Recognition, vol. 2, 452--458.
[26]
Naka, K. I., and Rushton, W. A. H. 1966. S-potentials from luminosity units in the retina of fish (cyprinidae). J Physiol 185, 587--599.
[27]
Nayar, S. K., and Nakagawa, Y. 1994. Shape from focus: an effective approach for rough surfaces. IEEE Transactions on Pattern Analysis and Machine Intelligence 16, 8, 824--831.
[28]
Neumann, H., Pessoa, L., and Hansen, T. 2001. Visual filling-in for computing perceptual surface properties. Biological Cybernetics 85, 355--369.
[29]
Palmer, S. E. 1999. Vision Science: Photons to Phenomenology. The MIT Press, Cambridge, Massachusetts.
[30]
Pattanaik, S. N., Tumblin, J., Yee, H., and Greenberg, D. P. 2000. Time-dependent visual adaptation for fast realistic display. In SIGGRAPH 2000 Conference Proceedings, 47--54.
[31]
Pentland, A. P. 1987. A new sense for depth of field. IEEE Transactions on Pattern Analysis and Machine Intelligence 9, 4, 523--531.
[32]
Perona, P., and Malik, J. 1990. Scale-space and edge detection using anisotropic diffusion. IEEE Transactions on Pattern Analysis and Machine Intelligence 12, 7, 629--639.
[33]
Reinhard, E., and Devlin, K. 2005. Dynamic range reduction inspired by photoreceptor physiology. IEEE Transactions on Visualization and Computer Graphics 11, 1 (January/February), 13--24.
[34]
Reinhard, E., Stark, M., Shirley, P., and Ferwerda, J. 2002. Photographic tone reproduction for digital images. ACM Transactions on Graphics 21, 3, 267--276.
[35]
Reinhard, E., Ward, G., Pattanaik, S., and Debevec, P. 2005. High Dynamic Range Imaging: Acquisition, Display and Image-Based Lighting. Morgan Kaufmann Publishers, San Francisco.
[36]
Rother, C., Kolmogorov, V., and Blake, A. 2004. "grab-cut" --- interactive foreground extraction using iterated graph cuts. ACM Transactions on Graphics 23, 3, 309--314.
[37]
Ruderman, D. L. 1997. Origins of scaling in natural images. Vision Research 37, 3385--3398.
[38]
Ruderman, D. L. 1997. The statistics of natural images. Network: Computation in Neural Systems 5, 4, 517--548.
[39]
Schlick, C. 1994. Quantization techniques for the visualization of high dynamic range pictures. In Photorealistic Rendering Techniques, Springer-Verlag Berlin Heidelberg New York, P. Shirley, G. Sakas, and S. Müller, Eds., 7--20.
[40]
Subbarao, M., and Surya, G. 1994. Depth from defocus: a spatial domain approach. International Journal of Computer Vision 13, 3, 271--294.
[41]
Subbarao, M. 1988. Parallel depth recovery by changing camera parameters. In Proc. of the International Conference on Computer Vision, 149--155.
[42]
Sun, J., Jia, J., Tang, C.-K., and Shum, H.-Y. 2004. Poisson matting. ACM Trans. Graph. 23, 3, 315--321.
[43]
Tumblin, J., Hodgins, J. K., and Guenter, B. K. 1999. Two methods for display of high contrast images. ACM Transactions on Graphics 18 (1), 56--94.
[44]
Walls, G. 1954. The filling-in process. American Journal of Optometry 31, 329--340.
[45]
Wang, J. Z., Li, J., Gray, R. M., and Wiederhold, G. 2001. Unsupervised multiresolution segmentation for images with low depth of field. IEEE Transactions on Pattern Analysis and Machine Intelligence 23, 1, 85--90.
[46]
Ward, G. 2003. Fast, robust image registration for compositing high dynamic range photographs from hand-held exposures. Journal of Graphics Tools 8, 2, 17--30.
[47]
Watanabe, M., and Nayar. K. 1997. Rational filters for passive depth from defocus. International Journal of Computer Vision 27, 3.
[48]
Weickert, J. 1997. A review of non-linear diffusion filtering. In Scale-Space Theory in Computer Vision, Springer-Verlag, Berlin, B. ter Haar Romeny, L. Florac, J. Koenderink, and M. Viergever, Eds., 3--28.
[49]
Witkin, A. P. 1983. Scale-space filtering. In Proceedings of the Eighth International Joint Conference on Artificial Intelligence, 2, 1019--1022.
[50]
Won, C. S., Pyun, K., and Gray, R. M. 2002. Automatic object segmentation in images with low depth of field. In Proceedings of IEEE International Conference on Image Processing, III.805 - III.808.

Cited By

View all
  • (2018)Automated human body measurement extractionInternational Journal of Clothing Science and Technology10.1108/IJCST-01-2017-000230:2(175-194)Online publication date: 21-Mar-2018
  • (2014)Automatic defocus spectral matting2014 IEEE International Conference on Image Processing (ICIP)10.1109/ICIP.2014.7025879(4328-4332)Online publication date: Oct-2014
  • (2012)Unsupervised and reliable image matting based on modified spectral mattingJournal of Visual Communication and Image Representation10.1016/j.jvcir.2012.03.00323:4(665-676)Online publication date: 1-May-2012
  • Show More Cited By

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Conferences
APGV '05: Proceedings of the 2nd symposium on Applied perception in graphics and visualization
August 2005
187 pages
ISBN:1595931392
DOI:10.1145/1080402
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 26 August 2005

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. alpha-mattes
  2. depth of field
  3. high dynamic range imaging
  4. human visual perception

Qualifiers

  • Article

Conference

APGV05
Sponsor:

Acceptance Rates

Overall Acceptance Rate 19 of 33 submissions, 58%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)0
  • Downloads (Last 6 weeks)0
Reflects downloads up to 25 Nov 2024

Other Metrics

Citations

Cited By

View all
  • (2018)Automated human body measurement extractionInternational Journal of Clothing Science and Technology10.1108/IJCST-01-2017-000230:2(175-194)Online publication date: 21-Mar-2018
  • (2014)Automatic defocus spectral matting2014 IEEE International Conference on Image Processing (ICIP)10.1109/ICIP.2014.7025879(4328-4332)Online publication date: Oct-2014
  • (2012)Unsupervised and reliable image matting based on modified spectral mattingJournal of Visual Communication and Image Representation10.1016/j.jvcir.2012.03.00323:4(665-676)Online publication date: 1-May-2012
  • (2011)Removal of Partial Occlusion from Single ImagesIEEE Transactions on Pattern Analysis and Machine Intelligence10.1109/TPAMI.2010.18733:3(647-654)Online publication date: 1-Mar-2011
  • (2011)Nonlocal mattingProceedings of the 2011 IEEE Conference on Computer Vision and Pattern Recognition10.1109/CVPR.2011.5995665(2193-2200)Online publication date: 20-Jun-2011
  • (2010)GrabcutDProceedings of the 2010 ACM workshop on Surreal media and virtual cloning10.1145/1878083.1878099(57-62)Online publication date: 29-Oct-2010
  • (2007)Automated removal of partial occlusion blurProceedings of the 8th Asian conference on Computer vision - Volume Part I10.5555/1775614.1775645(271-281)Online publication date: 18-Nov-2007
  • (2007)Dynamic Adaptation of Projected Imperceptible CodesProceedings of the 2007 6th IEEE and ACM International Symposium on Mixed and Augmented Reality10.1109/ISMAR.2007.4538845(1-10)Online publication date: 13-Nov-2007
  • (2007)Automated Removal of Partial Occlusion BlurComputer Vision – ACCV 200710.1007/978-3-540-76386-4_25(271-281)Online publication date: 2007
  • (2006)Image-based material editingACM SIGGRAPH 2006 Papers10.1145/1179352.1141937(654-663)Online publication date: 30-Jul-2006
  • Show More Cited By

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media