Online learning and controller adaptation will be an essential component for legged robots in the next few years as they begin to leave the laboratory setting and join our world. I present the first example of a learning system which is able to quickly and reliably acquire a robust feedback control policy for 3D dynamic bipedal walking from a blank slate using only trials implemented on the physical robot. The robot begins walking within a minute and learning converges in approximately 20 minutes. The learning works quickly enough that the robot is able to continually adapt to the terrain as it walks. This success can be attributed in part to the mechanics of our robot, which is capable of stable walking down a small ramp even when the computer is turned off. In this thesis, I analyze the dynamics of passive dynamic walking, starting with reduced planar models and working up to experiments on our real robot. I describe, in detail, the actor-critic reinforcement learning algorithm that is implemented on the return map dynamics of the biped. Finally, I address issues of scaling and controller augmentation using tools from optimal control theory and a simulation of a planar one-leg hopping robot. These learning results provide a starting point for the production of robust and energy efficient walking and running robots that work well initially, and continue to improve with experience. (Copies available exclusively from MIT Libraries, Rm. 14-0551, Cambridge, MA 02139-4307. Ph. 617-253-5668; Fax 617-253-1690.)
Cited By
- You Y, Zhou C, Li Z and Tsagarakis N A study of nonlinear forward models for dynamic walking 2017 IEEE International Conference on Robotics and Automation (ICRA), (3491-3496)
- Li Q, Chatzinikolaidis I, Yang Y, Vijayakumar S and Li Z Robust foot placement control for dynamic walking using online parameter estimation 2017 IEEE-RAS 17th International Conference on Humanoid Robotics (Humanoids), (165-170)
- Muico U, Lee Y, Popović J and Popović Z Contact-aware nonlinear control of dynamic characters ACM SIGGRAPH 2009 papers, (1-9)
- Iida F and Tedrake R Minimalistic control of a compass gait robot in rough terrain Proceedings of the 2009 IEEE international conference on Robotics and Automation, (3246-3251)
- Muico U, Lee Y, Popović J and Popović Z (2009). Contact-aware nonlinear control of dynamic characters, ACM Transactions on Graphics (TOG), 28:3, (1-9), Online publication date: 27-Jul-2009.
- da Silva M, Abe Y and Popović J Interactive simulation of stylized human locomotion ACM SIGGRAPH 2008 papers, (1-10)
- Chu B, Hong D, Park J and Chung J (2008). Passive dynamic walker controller design employing an RLS-based natural actor-critic learning algorithm, Engineering Applications of Artificial Intelligence, 21:7, (1027-1034), Online publication date: 1-Oct-2008.
- da Silva M, Abe Y and Popović J (2008). Interactive simulation of stylized human locomotion, ACM Transactions on Graphics, 27:3, (1-10), Online publication date: 1-Aug-2008.
- Barbič J and Popović J Real-time control of physically based simulations using gentle forces ACM SIGGRAPH Asia 2008 papers, (1-10)
- Ayala D and Bahón C Emerging behaviors by learning joint coordination in articulated mobile robots Proceedings of the 9th international work conference on Artificial neural networks, (806-813)
Recommendations
Gait detection based stable locomotion control system for biped robots
It is a challenge to maintain a steady and stable locomotion when a biped robot navigates an uneven surface or a step. Firstly it needs to detect the gait of the robot and related environmental objectives, and then to perform appropriate controls of ...
Capturability-based analysis and control of legged locomotion, Part 2: Application to M2V2, a lower-body humanoid
This two-part paper discusses the analysis and control of legged locomotion in terms of N-step capturability: the ability of a legged system to come to a stop without falling by taking N or fewer steps. We consider this ability to be crucial to legged ...