Experience Adapter: Adapting Pre-trained Language Models for Continual Task Planning

Jiatao Zhang^15,16,
Jianfeng Liao¹⁶,
Tuocheng Hu¹⁷,
Tian Zhou¹⁵,
Haofu Qian^15,16,
Haoyang Zhang^15,16,
Han Li^15,16,
LanLing Tang^16,18,
Qiwei Meng¹⁶,
Wei Song^15,16 &
…
Shiqiang Zhu^15,16

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 14271))

Included in the following conference series:

International Conference on Intelligent Robotics and Applications

1347 Accesses

Abstract

In this paper, we investigate the challenge of Pre-trained Language Models (PLMs) for continual task planning. PLM-based planner is difficult to incorporate incremental experience without risking catastrophic forgetting or overwhelming the model parameters. Inspired by human cognition, we propose the Experience Adapter, a novel method that avoids the need for model re-training or fine-tuning. The adapter continually collects experiences externally, including observation memory and human feedback, represented in memory graph and rules. Using these, the adapter directs task planning and corrects behavior not aligning with human expectations. Our method, not relying on the planner’s inherent structure, pairs easily with various foundational planning methods. In experiments on everyday tasks within the VirtualHome environment, we show that our approach significantly improves task success rate from 47% to 64%. This non-invasive method fits seamlessly within existing model-serving pipelines without altering the model training.

J. Zhang and J. Liao—Contribute equally to this work.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 79.99; Price excludes VAT (USA)

Softcover Book: USD 99.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Brohan, A., et al.: Do as i can, not as i say: grounding language in robotic affordances. In: Conference on Robot Learning, pp. 287–318 (2023)
Google Scholar
Chen, M., et al.: Evaluating large language models trained on code. arXiv preprint arXiv:2107.03374 (2021)
Chen, Z., et al.: Vision transformer adapter for dense predictions. arXiv preprint arXiv:2205.08534 (2022)
De Lange, M., et al.: A continual learning survey: defying forgetting in classification tasks. IEEE Trans. Pattern Anal. Mach. Intell. 44(7), 3366–3385 (2021)
Google Scholar
Driess, D., et al.: PaLM-E: an embodied multimodal language model. arXiv preprint arXiv:2303.03378 (2023)
Goyal, Y., Khot, T., Summers-Stay, D., Batra, D., Parikh, D.: Making the V in VQA matter: elevating the role of image understanding in visual question answering. In: Conference on Computer Vision and Pattern Recognition, pp. 6325–6334 (2017)
Google Scholar
Guhur, P.L., Chen, S., Pinel, R.G., Tapaswi, M., Laptev, I., Schmid, C.: Instruction-driven history-aware policies for robotic manipulations. In: Conference on Robot Learning, pp. 175–187. PMLR (2023)
Google Scholar
Houlsby, N., et al.: Parameter-efficient transfer learning for NLP. In: International Conference on Machine Learning, pp. 2790–2799 (2019)
Google Scholar
Huang, C., Mees, O., Zeng, A., Burgard, W.: Visual language maps for robot navigation. arXiv preprint arXiv:2210.05714 (2022)
Huang, W., Abbeel, P., Pathak, D., Mordatch, I.: Language models as zero-shot planners: extracting actionable knowledge for embodied agents. In: International Conference on Machine Learning, pp. 9118–9147 (2022)
Google Scholar
Huang, W., et al.: Inner monologue: embodied reasoning through planning with language models (2022)
Google Scholar
Lesort, T., Lomonaco, V., Stoian, A., Maltoni, D., Filliat, D., Díaz-Rodríguez, N.: Continual learning for robotics: definition, framework, learning strategies, opportunities and challenges. Inf. Fusion 58, 52–68 (2020)
Article Google Scholar
Lewkowycz, A., et al.: Solving quantitative reasoning problems with language models. arXiv preprint arXiv:2206.14858 (2022)
Li, S., et al.: Pre-trained language models for interactive decision-making. Adv. Neural Inf. Process. Syst. 35, 31199–31212 (2022)
Google Scholar
Li, Y., Mao, H., Girshick, R., He, K.: Exploring plain vision transformer backbones for object detection. In: Avidan, S., Brostow, G., Cissé, M., Farinella, G.M., Hassner, T. (eds.) ECCV 2022. LNCS, vol. 13669, pp. 280–296. Springer, Cham (2022). https://doi.org/10.1007/978-3-031-20077-9_17
Chapter Google Scholar
McFarlane, R.: A survey of exploration strategies in reinforcement learning. McGill University (2018)
Google Scholar
Puig, X., et al.: Virtualhome: simulating household activities via programs. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 8494–8502 (2018)
Google Scholar
Stickland, A.C., Murray, I.: Bert and pals: projected attention layers for efficient adaptation in multi-task learning. In: International Conference on Machine Learning, pp. 5986–5995 (2019)
Google Scholar
Wang, Z., Cai, S., Liu, A., Ma, X., Liang, Y.: Describe, explain, plan and select: interactive planning with large language models enables open-world multi-task agents. arXiv preprint arXiv:2302.01560 (2023)
Zhao, W.X., et al.: A survey of large language models. arXiv preprint arXiv:2303.18223 (2023)

Download references

Acknowledgement

This research was supported by Key Research Project of Zhejiang Lab (Grant No. G2021NB0AL03) and National Natural Science Foundation of China (Grant No. U21A20488).

Author information

Authors and Affiliations

Zhejiang University, Hangzhou, China
Jiatao Zhang, Tian Zhou, Haofu Qian, Haoyang Zhang, Han Li, Wei Song & Shiqiang Zhu
Research Center for Intelligent Robotics, Zhejiang Lab, Hangzhou, China
Jiatao Zhang, Jianfeng Liao, Haofu Qian, Haoyang Zhang, Han Li, LanLing Tang, Qiwei Meng, Wei Song & Shiqiang Zhu
University of Electronic Science and Technology of China, Chengdu, China
Tuocheng Hu
Hangzhou Institute for Advanced Study, University of Chinese Academy of Sciences, Hangzhou, China
LanLing Tang

Authors

Jiatao Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Jianfeng Liao
View author publications
You can also search for this author in PubMed Google Scholar
Tuocheng Hu
View author publications
You can also search for this author in PubMed Google Scholar
Tian Zhou
View author publications
You can also search for this author in PubMed Google Scholar
Haofu Qian
View author publications
You can also search for this author in PubMed Google Scholar
Haoyang Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Han Li
View author publications
You can also search for this author in PubMed Google Scholar
LanLing Tang
View author publications
You can also search for this author in PubMed Google Scholar
Qiwei Meng
View author publications
You can also search for this author in PubMed Google Scholar
Wei Song
View author publications
You can also search for this author in PubMed Google Scholar
Shiqiang Zhu
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding authors

Correspondence to Wei Song or Shiqiang Zhu .

Editor information

Editors and Affiliations

Zhejiang University, Hangzhou, China
Huayong Yang
Harbin Institute of Technology, Shenzhen, China
Honghai Liu
Zhejiang University, Hangzhou, China
Jun Zou
Huazhong University of Science and Technology, Wuhan, China
Zhouping Yin
Shenyang Institute of Automation, Shenyang, Liaoning, China
Lianqing Liu
Zhejiang University, Hangzhou, China
Geng Yang
Zhejiang University, Hangzhou, China
Xiaoping Ouyang
Harbin Institute of Technology, Shenzhen, China
Zhiyong Wang

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Zhang, J. et al. (2023). Experience Adapter: Adapting Pre-trained Language Models for Continual Task Planning. In: Yang, H., et al. Intelligent Robotics and Applications. ICIRA 2023. Lecture Notes in Computer Science(), vol 14271. Springer, Singapore. https://doi.org/10.1007/978-981-99-6495-6_33

Download citation

DOI: https://doi.org/10.1007/978-981-99-6495-6_33
Published: 16 October 2023
Publisher Name: Springer, Singapore
Print ISBN: 978-981-99-6494-9
Online ISBN: 978-981-99-6495-6
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics