Abstract
Robot learning from demonstration (LfD) emerges as a promising solution to transfer human motion to the robot. However, because of the open-loop between the learner and task constraints, the precision of the reproduction at the desired task constraints cannot always be guaranteed and the model is not robust to changes of the training data. This paper proposes a closed-loop framework of LfD based on the bagging method of Gaussian Mixture Model and Gaussian Mixture Regression (GMM/GMR) to obtain a robust learner of LfD with high precision reproduction. The original demonstration data are divided into several sub-training data, from which multiple Gaussian mixture models are developed and combined through weighted average to provide predictions. A closed-loop is built between the reproduction of the combined learner and task constraints, and the weights that satisfy task constraints are estimated in the closed-loop. The prediction uncertainty of the models is automatically eliminated by the closed-loop, therefore, the low robustness of the LfD model to the training date is overcome. In experiments, tasks of the position and velocity are both constrained in dual closed-loop. It is shown that the proposed method can significantly meet the task constraints without increasing the complexity of the algorithm.
Similar content being viewed by others
Abbreviations
- t :
-
Time step
- \(\varvec{\xi }^o\), \(\varvec{\xi }\) :
-
Training data before/after DTW
- \(\varvec{\xi }^s\) :
-
Spatial component of the training data
- \(\varvec{\Theta }\) :
-
Parameters of GMM
- \(\pi _k\) :
-
Prior probability of a Gaussian distribution in a GMM
- \(\varvec{\beta }_k\) :
-
The probability of the component k to be responsible for t
- \(\varvec{\mu }_k\) :
-
Mean of a Gaussian component
- \(\varvec{\Sigma }_k\) :
-
Covariance matrix of a Gaussian distribution
- K :
-
Number of Gaussian components
- \({\mathcal {N}}(\varvec{\mu _k},\varvec{\Sigma _k})\) :
-
Gaussian distribution described by mean \(\varvec{\mu }_k\) and covariance matrix \(\varvec{\Sigma }_k\)
- \({\mathcal {N}}(\varvec{\xi }_i;\varvec{\mu }_k,\varvec{\Sigma }_k) \) :
-
Probability of \(\varvec{\xi }_i\) where the density function is a Gaussian distribution
- \(\hat{\varvec{\xi }^s}\) :
-
Expected mean of the reproduction (spatial component)
- \(\hat{\varvec{\Sigma }}^{ss}\) :
-
Expected covariance of the reproduction (spatial component)
- Q :
-
The number of the base learners
- D :
-
Spatial dimensionality
- N :
-
Number of datapoints
- M :
-
Demonstration number
- L :
-
Key-points number
- \(G,G^{'}\) :
-
Combined learner
- \(G_i\), \(G^{'}_i\) :
-
The i-th base learner
- \(y_i\), y :
-
The output of \(G_i\)/G
- \(\omega _i\) :
-
The generalized weight of base learner \(G_i\)
- \({\mathbb {C}}\) :
-
Task constraints
- \({\mathbf {A}}^d\) :
-
The d-th dimension of \({\mathbb {C}}\)
- \({\mathbf {B}}^d\) :
-
\(L \times Q\) matrix consisted of the output of each base learners
- \({\mathbf {C}}_q\) :
-
\(D \times D\) diagonal matrix
- \(p_i\) :
-
The i-th key-position point
- \(\varvec{d}_i\),\(\varvec{d}^{'}_i\) :
-
The distance between the reproduction of GMR/Bagging-GMR and the key-points \(p_i\)
References
Argall, B. D., Chernova, S., Veloso, M., & Browning, B. (2009). A survey of robot learning from demonstration. Robotics and Autonomous Systems, 57(5), 469–483.
Bänziger, T., Kunz, A., & Wegener, K. (2018). Optimizing human-robot task allocation using a simulation tool based on standardized work descriptions. Journal of Intelligent Manufacturing, 31(7), 1635–1648.
Breiman, L. (1996). Bagging predictors. Machine Learning, 24(2), 123–140.
Calinon, S. (2016). A tutorial on task-parameterized movement learning and retrieval. Intelligent Service Robotics, 9(1), 1–29.
Calinon, S. (2020). Lasa handwriting dataset library. Available in: https://gitlab.idiap.ch/rli/pbdlib-matlab/. Accessed 12 Mar 2018.
Calinon, S., Alizadeh, T., & Caldwell, D. G. (2013). On improving the extrapolation capability of task-parameterized movement models. In: 2013 IEEE/RSJ international conference on intelligent robots and systems (IROS) (pp. 610–616). IEEE.
Calinon, S., & Billard, A. (2007). Incremental learning of gestures by imitation in a humanoid robot. In: Proceedings of the ACM/IEEE international conference on human–robot interaction, (pp. 255–262). ACM.
Calinon, S., & Billard, A. (2007). Active teaching in robot programming by demonstration. In: The 16th IEEE international symposium on robot and human interactive communication, 2007. RO-MAN 2007 (pp. 702–707). IEEE.
Calinon, S., & Billard, A. (2008). A probabilistic programming by demonstration framework handling constraints in joint space and task space. In: Intelligent robots and systems, 2008. IROS 2008. IEEE/RSJ international conference on (pp. 367–372). IEEE.
Calinon, S., Guenter, F., & Billard, A. (2007). On learning, representing, and generalizing a task in a humanoid robot. IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics), 37(2), 286–298.
Chiu, C.-Y., Chao, S.-P., Ming-Yang, W., Yang, S.-N., & Lin, H.-C. (2004). Content-based retrieval for human motion data. Journal of Visual Communication and Image Representation, 15(3), 446–466.
Chen, T., & Ren, J. (2009). Bagging for gaussian process regression. Neurocomputing, 72(7–9), 1605–1610.
Chernova, S., & Thomaz, A. L. (2014). Robot learning from human teachers. Synthesis Lectures on Artificial Intelligence and Machine Learning, 8(3), 1–121.
Derigent, W., Cardin, O., & Trentesaux, D. (2020). Industry 4.0: Contributions of holonic manufacturing control architectures and future challenges. Journal of Intelligent Manufacturing. https://doi.org/10.1007/s10845-020-01532-x.
Duque, D. A., Prieto, F. A., & Hoyos, J. G. (2019). Trajectory generation for robotic assembly operations using learning by demonstration. Robotics and Computer-Integrated Manufacturing, 57, 292–302.
Farahani, M. D., & Mozayani, N. (2020). Acquiring reusable skills in intrinsically motivated reinforcement learning. Journal of Intelligent Manufacturing. https://doi.org/10.1007/s10845-020-01629-3.
Friedman, J. H., & Hall, P. (2007). On bagging and nonlinear estimation. Journal of Statistical Planning and Inference, 137(3), 669–683.
Huang, Y., Rozo, L., Silvério, J., & Caldwell, D. G. (2019). Kernelized movement primitives. International Journal of Robotics Research, 38(7), 833–852.
Huang, Y., Silvério, J., Rozo, L., & Caldwell, D. G. (2018). Generalized task-parameterized skill learning. In: 2018 IEEE international conference on robotics and automation (ICRA) (pp. 1–5). IEEE.
Ijspeert, A. J., Nakanishi, J., Hoffmann, H., Pastor, P., & Schaal, S. (2013). Dynamical movement primitives: Learning attractor models for motor behaviors. Neural Computation, 25(2), 328–373.
Khansari-Zadeh, S. M., & Billard, A. (2010). BM: An iterative algorithm to learn stable non-linear dynamical systems with Gaussian mixture models. In: 2010 IEEE international conference on robotics and automation (pp. 2381–2388). IEEE.
Lemme, A., Meirovitch, Y., Khansari-Zadeh, M., Flash, T., Billard, A., & Steil, J. J. (2015). Open-source benchmarking for learned reaching motion generation in robotics. Paladyn, Journal of Behavioral Robotics, 6, 30–41.
Micheler, S., Goh, Y., Lohse, N., et al. (2020). A transformation of human operation approach to inform system design for automation. Journal of Intelligent Manufacturing. https://doi.org/10.1007/s10845-020-01568-z.
Ni, J., Tang, W. C., & Xing, Y. (2018). Assembly process optimization for reducing the dimensional error of antenna assembly with abundant rivets. Journal of Intelligent Manufacturing, 29(1), 245–258.
Nielsen, I., Dang, Q.-V., Bocewicz, G., & Banaszak, Z. (2017). A methodology for implementation of mobile robot in adaptive manufacturing environments. Journal of Intelligent Manufacturing, 28(5), 1171–1188.
Paraschos, A., Daniel, C., Peters, J. R., & Neumann, G. (2013). Probabilistic movement primitives. In: Advances in neural information processing systems (pp. 2616–2624).
Paraschos, A., Daniel, C., Peters, J., & Neumann, G. (2018). Using probabilistic movement primitives in robotics. Autonomous Robots, 42(3), 529–551.
Pervez, A., & Lee, D. (2018). Learning task-parameterized dynamic movement primitives using mixture of GMMs. Intelligent Service Robotics, 11(1), 61–78.
Petersen, K. B., Pedersen, M. S., et al. (2008). The matrix cookbook. Technical University of Denmark, 7(15), 510.
Pignat, E., & Calinon, S. (2017). Learning adaptive dressing assistance from human demonstration. Robotics and Autonomous Systems, 93, 61–75.
Ueda, N., & Nakano, R. (1998). Deterministic annealing EM algorithm. Neural Networks, 11(2), 271–282.
Wang, K.-J., Rizqi, D. A., & Nguyen, H.-P. (2020). Skill transfer support model based on deep learning. Journal of Intelligent Manufacturing. https://doi.org/10.1007/s10845-020-01606-w.
Wilson, A. D., & Bobick, A. F. (1999). Parametric hidden Markov models for gesture recognition. IEEE Transactions on Pattern Analysis & Machine Intelligence, 21(9), 884–900.
Zhang, J. (1999). Inferential estimation of polymer quality using bootstrap aggregated neural networks. Neural Networks, 12(6), 927–938.
Acknowledgements
This research is supported by the Key Research and Development Plan under Grant No. 2018YFB1308700, the National Natural Science Foundation of China (NSFC) under Grant No. 51535004, and the Fundamental Research Funds for the Central Universities under Grant No. 2020kfyXJJS064.
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Ye, C., Yang, J. & Ding, H. Bagging for Gaussian mixture regression in robot learning from demonstration. J Intell Manuf 33, 867–879 (2022). https://doi.org/10.1007/s10845-020-01686-8
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10845-020-01686-8