article

Revisiting active perception

Authors:

Yiannis Aloimonos,

John K. TsotsosAuthors Info & Claims

Autonomous Robots, Volume 42, Issue 2

Pages 177 - 196

https://doi.org/10.1007/s10514-017-9615-3

Published: 01 February 2018 Publication History

Abstract

Despite the recent successes in robotics, artificial intelligence and computer vision, a complete artificial agent necessarily must include active perception. A multitude of ideas and methods for how to accomplish this have already appeared in the past, their broader utility perhaps impeded by insufficient computational power or costly hardware. The history of these ideas, perhaps selective due to our perspectives, is presented with the goal of organizing the past literature and highlighting the seminal contributions. We argue that those contributions are as relevant today as they were decades ago and, with the state of modern computational tools, are poised to find new life in the robotic perception systems of the next decade.

References

[1]

Abbott, A. L., & Ahuja, N. (1992, November). University of Illinois active vision system. In Applications in optical science and engineering (pp. 757-768). International Society for Optics and Photonics.

[2]

Ackermann, E. (2016). How google wants to solve robotic grasping by letting robots learn for themselves. IEEE Spectrum, March 28, http://spectrum.ieee.org/automaton/robotics/artificialintelligence/google-large-scale-robotic-grasping-project

[3]

Aksoy, E., Abramov, A., Dörr, J., Ning, K., Dellen, B., & Wörgötter, F. (2011). Learning the semantics of object-action relations by observation. The International Journal of Robotics Research, 30, 1229-1249.

Digital Library

[4]

Allen, P. & Bajcsy, R. (1985). Two sensors are better than one: example of integration of vision and touch. Proceedings of 3rd ISRR, France, October.

[5]

Allen, P. K. (1985). Object recognition using vision and touch. PhD dissertation: University of Pennsylvania.

[6]

Aloimonos, J., Weiss, I., & Bandyopadhyay, A. (1988). Active vision. International Journal of Computer Vision, 1(4), 333-356.

[7]

Aloimonos, J. (1990). Purposive and qualitative active vision. Proceedings of 10th IEEE International Conference on Pattern Recognition, vol. 1, pp. 346-360.

[8]

Aloimonos, Y. (Ed.). (2013). Active perception. Milton Park: Psychology Press.

[9]

Alpert, S., Galun, M., Basri, R., & Brandt, A. (2007). Image segmentation by probabilistic bottom-up aggregation and cue integration. IEEE Transaction on PAMI, 34(2), 315-327.

[10]

Alur, R. (2016). Principles of cyber-physical systems. Cambridge:MIT Press.

[11]

Andreopoulos, A., & Tsotsos, J. K. (2007). A framework for door localization and door opening using a robotic wheelchair for people living with mobility impairments. In Robotics: Science and systems, Workshop: Robot manipulation: Sensing and adapting to the real world, Atlanta.

[12]

Andreopoulos, A., & Tsotsos, J. K. (2009). A theory of active object localization. In IEEE 12th international conference on computer vision, pp. 903-910.

[13]

Andreopoulos, A., Hasler, S., Wersing, H., Janssen, H., Tsotsos, J.K., & Körner, E. (2011). Active 3D object localization using a humanoid robot. IEEE Transactions on Robotics, 27(1), 47-64.

Digital Library

[14]

Andreopoulos, A., & Tsotsos, J. K. (2013a). 50 Years of object recognition: Directions forward. Computer Vision and Image Understanding, 117, 827-891.

[15]

Andreopoulos, A., & Tsotsos, J. K. (2013b). A computational learning theory of active object recognition under uncertainty. International Journal of Computer Vision, 101(1), 95-142.

Digital Library

[16]

Bajcsy, R. (1984). Shape from Touch. In G. N. Saridis (Ed.), Advances in Automation and Robotics. Stamford: JAI Press.

[17]

Bajcsy, R. (1985). Active perception vs passive perception. Proceedings of 3rd IEEE workshop on computer vision: Representation and Control, October 13-16, Bellaire, MI. (Washington DC: IEEE Computer Society Press), pp 55-62.

[18]

Bajcsy, R. (1988). Active perception. Proceedings of the IEEE, 76(8), 966-1005.

[19]

Bajcsy, R., & Campos, M. (1992). Active and exploratory Perception. CVGIP: Image Understanding, 56(1), 31-40.

Digital Library

[20]

Bajcsy, R., & Rosenthal, D. A. (1980). Visual and conceptual focus of attention. In S. Tanimoto & A. Klinger (Eds.), Structured computer vision (pp. 133-149). London, NY: Academic Press.

[21]

Bajcsy, R. K., & Rosenthal, D. A. (1975). Visual focussing and defocussing-an essential part of the pattern recognition process. Pattern recognition and data structures: In Proceedings on IEEE Conference on Computer Graphics.

[22]

Bajcsy, R., McCarthy, M. J., & Trinkle, J. C. (1984). Feeling by Grasping. Proceedings of the IEEE international conference on robotics, Atlanta

[23]

Bajcsy, R. & Sinha, P. R. (1989). Exploration of surfaces for robot mobility In Proceedings of the Fourth international conference on CAD/CAM robotics and factories of the future, pp. 397-404, vol. III, Tata McGraw-Hill, New Delhi, India.

[24]

Ballard, D. (1991). Animate vision. Artificial Intelligence, 48(1), 57-86.

Digital Library

[25]

Barrow, H., & Popplestone, R. (1971). Relational descriptions in picture processing. In B. Meltzer & D. Michie (Eds.), Machine intelligence 6 (pp. 377-396). Edinburgh: Edinburgh University Press.

[26]

Bestick, A.M., Burden, S.A., et al. (2015). Personalized kinematics for human-robot collaborative manipulation, IROS

[27]

Bogoni, L., & Bajcsy, R. (1994). Functionality investigation using a discrete event system approach. Journal of Robotics and Autonomous Systems, 13(3), 173-196.

[28]

Bogoni, L., & Bajcsy, R. (1995). Interactive recognition and representation of functionality. Computer Vision and Image Understanding, 62(2), 194-214.

Digital Library

[29]

Bjorkman, M., & Eklundh, J. O. (2006). Vision in the real world: Finding, attending and recognizing objects. International Journal of Imaging Systems and Technology, 16(5), 189-208.

[30]

Björkman, M. & Kragic, D. (2010). Active 3D scene segmentation and detection of unknown objects. In Proceedings of IEEE international conference on robotics and automation, pp. 3114-3120.

[31]

Borji, A., & Itti, L. (2013). State-of-the-art in visual attention modeling. IEEE Transactions on Pattern Analysis and Machine Intelligence, 35(1), 185-207.

Digital Library

[32]

Brown, C. (1990). Prediction and cooperation in gaze control. Biological Cybernetics, 63(1), 61-70.

Digital Library

[33]

Bruce, V., & Green, P. (1990). Visual perception: Physiology, psychology, and ecology (2nd ed.). Hillsdale, NJ: Lawrence Erlbaum Associates.

[34]

Bruce, N., Wloka, C., Frosst, N., Rahman, S., & Tsotsos, J. K. (2015). On computational modeling of visual saliency: Understanding what's right andwhat's left, Special issue on computational models of visual attention. Vision Research, 116(Part B), 95-112.

[35]

Burt, P. J. (1988). Attention mechanisms for vision in a dynamic world. In Proceedings on 9th international conference on pattern recognition, pp. 977-987.

[36]

Bylinksii, Z., DeGennaro, E., Rajalingham, R., Ruda, H., Jiang, J., & Tsotsos, J. K. (2015). Towards the quantitative evaluation of computational attention models, Special issue on computational models of visual attention. Vision Research, 116, 258-268.

[37]

Williams, T., Lowrance, J., Hanson, A., & Riseman, E. (1977). II. Model-building in the visions system 1. In IJCAI-77: 5th International joint conference on artificial intelligence-1977: Proceedings of the conference, Massachusetts Institute of Technology, Cambridge, Massachusetts, USA, August 22-25, 1977 (Vol. 2, p. 644).

[38]

Chen, S., Li, Y. F., Wang, W., & Zhang, J. (Eds.). (2008). Active sensor planning for multiview vision tasks (Vol. 1). Heidelberg: Springer.

[39]

Chen, S., Li, Y., & Kwok, N.M. (2011). Active vision in robotic systems: A survey of recent developments. The International Journal of Robotics Research, 30(11), 1343-1377.

Digital Library

[40]

Chessa, M., Solari, F., & Sabatini, S.P. (2009). A virtual reality simulator for active stereo vision systems. In VISAPP (2) (pp. 444-449).

[41]

Christensen, H. I. (1993). A low-cost robot camera head. International Journal of Pattern Recognition and Artificial Intelligence, 7(01), 69-87.

[42]

Christensen, H. I., Bowyer, K. W., & Bunke, H. (Eds.). (1993). Active robot vision: Camera heads, model based navigation and reactive control (Vol. 7). Singapore: World Scientific Publishers.

[43]

Clark, J. J., & Ferrier, N. J. (1988). Modal control of an attentive vision system. In Proceeding on international conference on computer vision, pp. 514-523.

[44]

Coates, A., Abbeel, P., & Ng, A. Y. (2008). Learning for control from multiple demonstrations. In Proceedings of the 25th international conference on machine learning (pp. 144-151). New York: ACM.

[45]

Coombs, D. J., & Brown, C.M. (1990). Intelligent gaze control in binocular vision. In Proceedings of 5th IEEE international symposium on intelligent control (pp. 239-245). IEEE.

[46]

Crowley, J.L., Krotkov, E., & Brown, C. (1992). Active computer vision: A tutorial. In IEEE international conference on robotics and automation, Nice, France, May 11.

[47]

Crowley, J. L. & Christensen, H. I. (1995). Integration and control of active visual processes. Proceedings of IROS 95, Pittsburgh, August.

[48]

Crowley, J. L., Bobet, P. & Mesrabi, M. (1992). Gaze control for a binocular camera head. In Computer Vision--ECCV'92 (pp. 588- 596). Springer: Berlin.

[49]

Dahiya, R. S., Metta, G., Valle, M., & Sandini, G. (2010). Tactile sensing-from humans to humanoids. IEEE Transactions on Robotics, 26(1), 1-20.

Digital Library

[50]

Dantam, N., & Stilman, M. (2013). The motion grammar: Analysis of a linguistic method for robot control. Transactions on Robotics, 29, 704-718.

Digital Library

[51]

Dickinson, S., Christensen, H., Tsotsos, J. K. & Olofsson, G. (1994). Active object recognition integrating attention and viewpoint control. In: Proceedings on European conference on computer vision, pp 2-14.

[52]

Du, F., Brady, M., & Murray, D. (1991). Gaze control for a two-eyed robot head. In: Proceedings on BMVC91 (pp. 193-201). London: Springer.

[53]

Ecins, C. F. & Aloimonos, Y. (2016). Cluttered Scene segmentation using the symmetry constraint. IEEE international conference on robotics and automation.

[54]

Fainekos, G. E., Kress-Gazit, H., & Pappas, G. J. (2005). Hybrid controllers for path planning: A temporal logic approach. In Proceedings of the Forty-fourth IEEE conference on decision and control (pp. 4885-4890). Philadelphia: IEEE.

[55]

Felzenszwalb, P. F., & Huttenlocher, D. P. (2004). Efficient graph-based image segmentation. International Journal of Computer Vision, 59(2), 167-181.

Digital Library

[56]

Fiala, J. C., Lumia, R., Roberts, K. J., & Wavering, A. J. (1994). TRICLOPS: A tool for studying active vision. International Journal of Computer Vision, 12(2-3), 231-250.

Digital Library

[57]

Findlay, J. M., & Gilchrist, I. D. (2003). Active vision: The psychology of looking and seeing. Oxford: Oxford University Press.

[58]

Fukushima, K. (1986). A neural network model for selective attention in visual pattern recognition. Biological Cybernetics, 55(1), 5-15.

Digital Library

[59]

Gallese, V., Fadiga, L., Fogassi, L., & Rizzolatti, G. (1996). Action recognition in the premotor cortex. Brain, 119(2), 593-609.

[60]

Garvey, T. D. (1976) Perceptual strategies for purposive vision. Doctoral Dissertation, Stanford University Stanford, CA.

[61]

Gibson, J. J. (1950). The perception of the visual world. Oxford: Houghton Mifflin.

[62]

Gibson, J. J. (1979). The ecological approach to visual perception. Boston: Houghton, Mifflin and Company.

[63]

Goldberg, K., & Bajcsy, R. (1984). Active touch and robot perception. Computation and Brain Theory, 7(2), 199-214.

[64]

Kato, I., Ohteru, S., Kobayashi, H., Shirai, K., & Uchiyama, A. (1973). Information-power machine with senses and limbs. Proc (pp. 12-24). Udine, Italy: CISM-IFToMM sympoisum on theory and practice of robots and manipulators.

[65]

Kelly, M. (1971). Edge detection in pictures by computer using planning. Machine Intelligence, 6, 397-409.

[66]

Koch, C., & Ullman, S. (1985). Shifts in selective visual attention: Towards the underlying neural circuitry. Human Neurobiology, 4, 219-227.

[67]

Kolev, S. & Todorov, E. (2015). Physically consistent state estimation and system identification for contacts. In IEEE/RAS international conference on humanoid robots.

[68]

Kollar, T., Tellex, S., Roy, D., & Roy, N. (2010). Toward understanding natural language directions. In Proceedings of human robot interaction conference. Osaka, Japan.

[69]

Ko¿ecká, J., & Bajcsy, R. (1994). Discrete event systems for autonomous mobile agents. Robotics and Autonomous Systems, 12(3), 187-198.

[70]

Krotkov, E. P. (1988). Focusing. International Journal of Computer Vision, 1(3), 223-237.

[71]

Krotkov, E. P. (1987). Exploratory visual sensing with a Agile Camera System, Ph.D. theses, TR-87-29, UPENN

[72]

Krotkov, E. P. (1989). Active computer vision by cooperative focus and stereo. Berlin: Springer.

[73]

Kuniyoshi, Y., Kita, N., Sugimoto, K., Nakamura, S. & Suehiro, T. (1995). A foveated wide angle lens for active vision. Proceedings on ICRA, pp 2982-2988

[74]

Lederman, S. J., & Klatzky, R. L. (1990). Haptic classification of common objects: Knowledge-driven exploration. Cognitive Psychology, 22(4), 421-459.

[75]

Manikonda, V., Krishnaprasad, P. S., & Hendler, J. (1999). Languages, behaviors, hybrid architectures, and motion control. New York: Springer.

[76]

Maturana, H. R., & Varela, F. J. (1987). The tree of knowledge: The biological roots of human understanding. New Science Library/Shambhala Publications.

[77]

Meltzoff, A. N., & Moore, M. K. (1989). Imitation in newborn infants: Exploring the range of gestures imitated and the underlying mechanisms. Journal of Developmental Psychology, 25(6), 954.

[78]

Maitin-Shepard, J., Cusumano-Towner, M., Lei, J., & Abbeel, P. (2010). Cloth grasp point detection based on multiple-view geometriccues with application to robotic towel folding. In 2010 IEEE International Conference on Robotics and Automation (ICRA) (pp. 2308-2315). IEEE.

[79]

Milios, E., Jenkin, M., & Tsotsos, J. K. (1993).Design and performance of TRISH, a binocular robot head with torsional eye movements, Special issue on active robot vision: Camera heads. International Journal of Pattern Recognition and Artificial Intelligence, 7(1), 51-68.

[80]

Mishra, A., Aloimonos, Y. & Fah, C. L. (2009). Active segmentation with fixation. IEEE 12th international conference on computer vision, pp 468-475.

[81]

Mishra, A. K., Aloimonos, Y., Cheong, L. F., & Kassim, A. (2012a). Active visual segmentation. IEEE Transactions on Pattern Analysis and Machine Intelligence, 34, 639-653.

Digital Library

[82]

Mishra, A.K., Shrivastava, A., & Aloimonos, Y. (2012b). Segmenting "simple" objects using RGB-D. IEEE Proceedings on international conference on robotics and automation, pp 4406-4413.

[83]

Moravec, H. P. (1980). Obstacle avoidance and navigation in the real world by a seeing robot rover (No. STAN-CS-80-813). Stanford University CA, Dept of Computer Science.

[84]

Nilsson, N. J. (1969). A mobile automaton: An application of artificial intelligence techniques. Menlo Park, CA: Artificial Intelligence Center, SRI International.

[85]

Pahlavan, K., & Eklundh, J.-O. (1992). A head-eye system: Analysis and deisgn. CVGIP: Image Understanding, 56(1), 41-56.

Digital Library

[86]

Pastore, N. (1971). Selective history of theories of visual perception (pp. 1650-1950). Oxford: Oxford University Press.

[87]

Pastra, K., & Aloimonos, Y. (2012). The minimalist grammar of action. Philosophical Transactions of the Royal Society: Biological Sciences, 367, 103-117.

[88]

Piaget, J. (1962). Play, dreams, and immitation on childhood. New York: W.W. Norton.

[89]

Poletti, M., & Rucci, M. (2013). Active vision: Adapating how to look. Current Biology, 23(17), R718-R720.

[90]

Pretlove, J. R. G., & Parker, G. A. (1993). The surrey attentive robot vision system. International Journal of Pattern Recognition and Artificial Intelligence, 7(01), 89-107.

[91]

Rabie, T. F., & Terzopoulos, D. (2000). Active perception in virtual humans. Vision Interface (VI), pp 16-22.

[92]

Rasouli, A., & Tsotsos, J. K. (2014). Visual saliency improves autonomous visual search. In 2014 Canadian Conference on Computer and Robot Vision (CRV) (pp. 111-118). IEEE.

[93]

Rimey, R. D., & Brown, C.M. (1991). Controlling eye movements with hidden Markov models. International Journal of Computer Vision, 7(1), 47-65.

Digital Library

[94]

Salas-Moreno, R., Newcombe, R., Strasdat, H., Kelly, P., & Davison, A. (2013). Slam++: Simultaneous localisation and mapping at the level of objects. In Proceedings of the IEEE conference on computer vision and pattern recognition, pp. 1352-1359.

[95]

Sandini, G., & Tagliasco, V. (1980). An anthropomorphic retina-like structure for scene analysis. Computer Graphics and Image Processing, 14(4), 365-372.

[96]

Sinha, P. R. (1991). Robotic exploration of surfaces and its application to legged locomotion, Ph.D. dissertation, Mechanical Engineering and Applied Mechanics, University of Pennsylvania, Philadelphia.

[97]

Siskind, J. M. (2001). Grounding the lexical semantics of verbs in visual perception using force dynamics and event logic. Journal of Artificial Intelligence Research, 15, 31.

Digital Library

[98]

Soatto, S. (2013). Actionable information in vision. In Machine learning for computer vision (pp. 17-48). Berlin: Springer.

[99]

Soong, J. & Brown, C. M. (1991). Inverse kinematics and gaze stabilization for the Rochester robot head, TR 394, Computer Science Dept., U. Rochester.

[100]

Sprague, N., Ballard, D., & Robinson, A. (2007). Modeling embodied visual behaviors. ACM Transactions on Applied Perception (TAP), 4(2), 11.

Digital Library

[101]

Sztipanovits, J., Koutsoukos, X., Karsai, G., Kottenstette, N., Antsaklis, P., Gupta, V., et al. (2012). Toward a science of cyber-physical system integration. Proceedings of the IEEE, 100(1).

[102]

Summers-Stay, D., Teo, C., Yang, Y., Fermüller, C., & Aloimonos, Y. (2013). Using aminimal action grammar for activity understanding in the real world. Proceedings of the 2013 IEEE/RSJ international conference on intelligent robots and systems (pp. 4104-4111). Vilamoura, Portugal.

[103]

Tellex, S. & Roy, D. (2009). Grounding spatial prepositions for video search. Proceedings of the eleventh international conference on multimodal interfaces (ICMI-2009). Cambridge, MA.

[104]

Tenenbaum, J. M. (1970). Accommodation in computer vision, Ph. D. Thesis, Computer Science Department Report, No. CS182, Stanford University, Stanford, CA.

[105]

Teo, C., Yang, Y., Daume, H., Fermüller, C., & Aloimonos, Y. (2012). Towards a Watson that sees: Language-guided action recognition for robots. Proceedings of the 2012 IEEE international conference on robotics and automation (pp. 374-381). Saint Paul, MN.

[106]

Teo, C. L., Fermüller, C., & Aloimonos, Y. (2015). A Gestaltist approach to contour-based object recognition: Combining bottom-up and top-down cues. The International Journal of Robotics Research, 0278364914558493.

[107]

Teo, C.L., Myers, A., Fermuller, C. & Aloimonos, Y. (2013). Embedding high-level information into lowlevel vision: Efficient object search in clutter. In Proceedings on IEEE international conference on robotics and automation, pp. 126-132.

[108]

Thrun, S., Burgard, W., & Fox, D. (2000). Probabilistic robotics. Cambridge: MIT Press.

[109]

Terzopoulos, D., & Rabie, T. F. (1995). Animat vision: Active vision in artificial animals. In Proceedings on fifth international conference on computer vision, 1995 (pp. 801-808). IEEE.

[110]

Terzopoulos, D., & Rabie, T. (1997). Animat vision: Active vision in artificial animals. Videre: Journal of Computer Vision Research, 1(1), 2-19.

[111]

Terzopoulos, D. (2003). Perceptive agents and systems in virtual reality. In Proceedings of the ACM symposium on Virtual reality software and technology, pp. 1-3.

[112]

Treisman, A. M., & Gelade, G. (1980). A feature-integration theory of attention. Cognitive Psychology, 12(1), 97-136.

[113]

Trinkle, J. C., Tzitzouris, J. A., & Pang, J. S. (2001). Dynamic multirigid-body systems with concurrent distributed contacts: Theory and examples. Philosophical Transactions: Mathematical, Physical, and Engineering Sciences, 359(1789), 2575-2593.

[114]

Tsikos, C. J. (1987). Segmentation of 3D scenes using multimodal Interaction between machine vision and programmable mechanical Scene Manipulation, Ph. D dissertation, Department of Computer and information Science, University of Pennsylvania, December.

[115]

Tsikos, C. J. & Bajcsy, R. (1991). Segmentation via manipulation. IEEE Transaction on Robotics and Automation, 7(3).

[116]

Tsotsos, J. K. (1977). Knowledge-base driven analysis of cinecardioangiograms. In Proceedings of the 5th international joint conference on Artificial intelligence. Vol. 2 (pp. 699-699). San Mateo: Morgan Kaufmann Publishers Inc.

[117]

Tsotsos, J.K., Mylopoulos, J., Cowey, H. D., & Zucker, S. W. (1979). ALVEN: A study on motion understanding by computer. In Proceedings of the 6th international joint conference on Artificial intelligence (Vol. 2, pp. 890-892). San Mateo: Morgan Kaufmann Publishers Inc.

[118]

Tsotsos, J. K. (1980). a framework for visual motion understanding, Ph.D. Dissertation, Department of Computer Science, CSRG TR-114, University of Toronto, May.

[119]

Tsotsos, J. K. (1987). A "Complexity Level" Analysis of Vision. Proceedings of the 1st international conference on computer vision, pp. 346 - 55, London, UK.

[120]

Tsotsos, J. K. (1989). The complexity of perceptual search tasks. In IJCAI (Vol. 89, pp. 1571-1577).

[121]

Tsotsos, J. K. (1992). On the relative complexity of passive vs active visual search. International Journal of Computer Vision, 7(2), 127-141.

Digital Library

[122]

Tsotsos, J. K., Verghese, G., Dickinson, S., Jenkin, M., Jepson, A., Milios, E., et al. (1998). Playbot a visually-guided robot for physically disabled children. Image and Vision Computing, 16(4), 275-292.

[123]

Tsotsos, J. K., Itti, L., & Rees, G. (2005). A brief and selective history of attention. In L. Itti, G. Rees, & J. K. Tsotsos (Eds.), Neurobiology of attention. Amsterdam: Elsevier Press, pp. xxiii - xxxii.

[124]

Tsotsos, J. K., & Shubina, K. (2007). Attention and visual search: Active robotic vision systems that search. Proceedings of The 5th international conference on computer vision systems, pp. 21-24.

[125]

Tsotsos, J. K. (2011). A computational perspective on visual attention. Cambridge, MA: MIT Press.

[126]

Yang, Y., Fermüller, C., & Aloimonos, Y. (2013). Detection of manipulation action consequences (MAC). Proceedings of the 2013 IEEE conference on computer vision and pattern recognition (pp. 2563- 2570). Portland, OR: IEEE.

[127]

Yang, Y., Fermuller, C., Aloimonos, Y., & Aksoy, E. E. (2015). Learning the semantics of manipulation action the 53rd annual meeting of the association for computational linguistics. Beijing: ACL.

[128]

Yang, Y., Guha, A., Fermuller, C., & Aloimonos, Y. (2014). A cognitive system for understanding human manipulation actions. Advances in Cognitive Sysytems, 3, 67-86.

[129]

Yang, Y., Li, Y., Fermuller, C. & Aloimonos, Y. (2015). Robot learning manipulation action plans by "Watching" unconstrained videos from the world wide web, the twenty-ninth AAAI conference on artificial intelligence.

[130]

Ye Y., & Tsotsos, J. K. (1995). Where to look next in 3D object search. In Proceedings of international symposium on computer vision, pp 539-544.

[131]

Ye, Y., & Tsotsos, J. K. (1996). 3D Sensor planning: Its formulation and complexity. In Kautz, H. & Selman, B. (Eds.), Proceedings on 4th international symposium on artificial intelligence and mathematics. Fort Lauderdale, FL.

[132]

Ye, Y., & Tsotsos, J. K. (1999). Sensor planning for object search. Computer Vision and Image Understanding, 73(2), 145-168.

Digital Library

[133]

Ye, Y., & Tsotsos, J.K. (2001). A complexity-level analysis of the sensor planning task for object search. Computational Intelligence, 17(4), 605-620.

[134]

Yu, X., Fermüller, C., Teo, C. L., Yang, Y. & Aloimonos, Y. (2011). Active scene recognition with vision and language. IEEE International Conference on Computer Vision (ICCV), pp 810-817.

[135]

Vernon, D. (2008). Cognitive vision: The case for embodied perception. Image and Vision Computing, 26(1), 127-140.

Digital Library

[136]

Weng, J. (2003). Developmental Robots. International Journal of Humanoid Robots, 1(2), 109-128.

[137]

Wade, N. J., & Wade, N. (2000). A natural history of vision. Cambridge: MIT Press.

[138]

Wilkes, D., & Tsotsos, J. K. (1992). Active object recognition. In Proceedings of computer vision and pattern recognition, pp. 136-141.

[139]

Wörgötter, F., Aksoy, E. E., Kruger, N., Piater, J., Ude, A., & Tamosiunaite, M. (2012). A simple ontology of manipulation actions based on hand-object relations. IEEE Transactions on Autonomous Mental Development, 1, 117-134.

[140]

Worch, J.-H., Balint-Benczedi, F., and Beetz, M. (2015). Perception for Everyday Human Robot Interaction, KI - Kunstliche Intelligenz, pp.1-7, Springer Berlin-Heidelberg.

[141]

Zampogiannis, K., Yang, Y., Fermuller, C., & Aloimonos, Y. (2015). Learning the spatial semantics of manipulation actions through preposition grounding. In 2015 IEEE international conference on robotics and automation (ICRA) (pp. 1389-1396). IEEE.

Cited By

Huang SHauser KShell DChen WKhardon RLiu L(2024)Adaptive Robotic Information Gathering via non-stationary Gaussian processesInternational Journal of Robotics Research10.1177/0278364923118449843:4(405-436)Online publication date: 1-Apr-2024
https://dl.acm.org/doi/10.1177/02783649231184498
Ancha SPathak GZhang JNarasimhan SHeld D(2024)Active velocity estimation using light curtains via self-supervised multi-armed banditsAutonomous Robots10.1007/s10514-024-10168-248:6Online publication date: 10-Aug-2024
https://dl.acm.org/doi/10.1007/s10514-024-10168-2
Scarpellini GRosa SMorerio PNatale LDel Bue A(2024)Look Around and Learn: Self-training Object Detection by ExplorationComputer Vision – ECCV 202410.1007/978-3-031-72992-8_5(72-88)Online publication date: 29-Sep-2024
https://dl.acm.org/doi/10.1007/978-3-031-72992-8_5
Show More Cited By

Revisiting active perception
1. Computer systems organization
  1. Embedded and cyber-physical systems
2. Computing methodologies
  1. Artificial intelligence

Recommendations

Perceptually guided high-fidelity rendering exploiting movement bias in visual attention

A major obstacle for real-time rendering of high-fidelity graphics is computational complexity. A key point to consider in the pursuit of “realism in real time” in computer graphics is that the Human Visual System (HVS) is a fundamental part of the ...
Maintaining frame rate perception in interactive environments by exploiting audio-visual cross-modal interaction

The entertainment industry, primarily the video games industry, continues to dictate the development and performance requirements of graphics hardware and computer graphics algorithms. However, despite the enormous progress in the last few years, it is ...
Opportunities and challenges with autonomous micro aerial vehicles

We survey the recent work on micro unmanned aerial vehicles (UAVs), a fast-growing field in robotics, outlining the opportunities for research and applications, along with the scientific and technological challenges. Micro-UAVs can operate in three-...

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image Autonomous Robots

Autonomous Robots Volume 42, Issue 2

February 2018

110 pages

ISSN:0929-5593

Issue’s Table of Contents

Copyright © Copyright © 2018 Springer Science+Business Media, LLC, part of Springer Nature.

Publisher

Kluwer Academic Publishers

United States

Publication History

Published: 01 February 2018

Author Tags

Qualifiers

Article

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

45
Total Citations
View Citations
0
Total Downloads

Downloads (Last 12 months)0
Downloads (Last 6 weeks)0

Reflects downloads up to 16 Nov 2024

Other Metrics

View Author Metrics

Citations

Cited By

Huang SHauser KShell DChen WKhardon RLiu L(2024)Adaptive Robotic Information Gathering via non-stationary Gaussian processesInternational Journal of Robotics Research10.1177/0278364923118449843:4(405-436)Online publication date: 1-Apr-2024
https://dl.acm.org/doi/10.1177/02783649231184498
Ancha SPathak GZhang JNarasimhan SHeld D(2024)Active velocity estimation using light curtains via self-supervised multi-armed banditsAutonomous Robots10.1007/s10514-024-10168-248:6Online publication date: 10-Aug-2024
https://dl.acm.org/doi/10.1007/s10514-024-10168-2
Scarpellini GRosa SMorerio PNatale LDel Bue A(2024)Look Around and Learn: Self-training Object Detection by ExplorationComputer Vision – ECCV 202410.1007/978-3-031-72992-8_5(72-88)Online publication date: 29-Sep-2024
https://dl.acm.org/doi/10.1007/978-3-031-72992-8_5
Jiang WLei BDaniilidis K(2024)FisherRF: Active View Selection and Mapping with Radiance Fields Using Fisher InformationComputer Vision – ECCV 202410.1007/978-3-031-72624-8_24(422-440)Online publication date: 29-Sep-2024
https://dl.acm.org/doi/10.1007/978-3-031-72624-8_24
Shang JRyoo MOh ANaumann TGloberson ASaenko KHardt MLevine S(2023)Active vision reinforcement learning under limited visual observabilityProceedings of the 37th International Conference on Neural Information Processing Systems10.5555/3666122.3666575(10316-10338)Online publication date: 10-Dec-2023
https://dl.acm.org/doi/10.5555/3666122.3666575
Lamanna LFaridghasemnia MGerevini ASaetti ASaffiotti ASerafini LTraverso PElkind E(2023)Learning to act for perceiving in partially unknown environmentsProceedings of the Thirty-Second International Joint Conference on Artificial Intelligence10.24963/ijcai.2023/609(5485-5493)Online publication date: 19-Aug-2023
https://dl.acm.org/doi/10.24963/ijcai.2023/609
Yu JLin MRahmani HShell DO’Kane J(2023)Planning to chronicleInternational Journal of Robotics Research10.1177/0278364921106915442:6(412-432)Online publication date: 1-May-2023
https://dl.acm.org/doi/10.1177/02783649211069154
Veiga TRenoux J(2023)From Reactive to Active Sensing: A Survey on Information Gathering in Decision-theoretic PlanningACM Computing Surveys10.1145/358306855:13s(1-22)Online publication date: 13-Jul-2023
https://dl.acm.org/doi/10.1145/3583068
Lin TUnni Krishnan ALi Z(2023)Perception-Motion Coupling in Active Telepresence: Human Behavior and Teleoperation Interface DesignACM Transactions on Human-Robot Interaction10.1145/357159912:3(1-24)Online publication date: 27-Mar-2023
https://dl.acm.org/doi/10.1145/3571599
Bai YDong MWei SLi JYu X(2023)YOLOOD: an arbitrary-oriented flexible flat cable detection method in robotic assemblyThe Journal of Supercomputing10.1007/s11227-023-05254-879:13(14869-14893)Online publication date: 12-Apr-2023
https://dl.acm.org/doi/10.1007/s11227-023-05254-8
Show More Cited By

View Options

View options

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Media

Figures

Other

Tables

View Issue’s Table of Contents