Towards Robust Continual Learning: A Multi-Head Approach with Online Prototype Equilibrium and Adaptive Prototypical Feedback

Quynh-Trang Pham Thi¹⁴,
Duc-Hung Nguyen¹⁴,
Thanh Hai Dang¹⁴,
Duc-Trong Le¹⁴,
Tri-Thanh Nguyen¹⁴ &
…
Quang-Thuy Ha¹⁴

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 14796))

Included in the following conference series:

Asian Conference on Intelligent Information and Database Systems

139 Accesses

Abstract

Continual learning is an approach in machine learning that aims to learn different tasks sequentially, still performing well on all of them. This manner is akin to human learning. However, continual learning faces a significant challenge known as catastrophic forgetting. This refers to as a decrease in the model’s performance on previously learned tasks when learning a new task. Memory-based replay method is one approach that has been proven effective in addressing this issue. After completing each task, the model stores a small amount of data to combine with new data for training an encountering task. This approach is associated with the size of the memory buffer and data security concerns. In this paper, instead of storing actual data points, we only store prototypes representing classes in learned tasks. To this end, we also employ two techniques, i.e. Online Prototype Equilibrium (OPE) and Adaptive Prototypical Feedback (APF) to enhance the quality of prototypes’ hidden representation. Furthermore, to enhance accurate classification, we do not use a single shared head for all classes. Instead, for each task, we add a within-task prediction head and a task-ID prediction head. Experimental results on benchmark datasets demonstrate that our method outperforms several state-of-the-art methods in terms of well-studied average accuracy.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 59.99; Price excludes VAT (USA)

Softcover Book: USD 74.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

McCloskey, M., Cohen, N.J.: Catastrophic interference in connectionist networks: the sequential learning problem. In: Psychology of Learning and Motivation, vol. 24, pp. 109–165. Academic Press (1989)
Google Scholar
Aljundi, R., Babiloni, F., Elhoseiny, M., Rohrbach, M., Tuytelaars, T.: Memory aware synapses: learning what (not) to forget. In: Proceedings of the European conference on computer vision (ECCV), pp. 139–154 (2018)
Google Scholar
Serra, J., Suris, D., Miron, M., Karatzoglou, A.: Overcoming catastrophic forgetting with hard attention to the task. In: International Conference on Machine Learning, pp. 4548–4557. PMLR, July 2018
Google Scholar
Mai, Z., Li, R., Jeong, J., Quispe, D., Kim, H., Sanner, S.: Online continual learning in image classification: an empirical survey. Neurocomputing 469, 28–51 (2022)
Article Google Scholar
Chrysakis, A., Moens, M.F.: Online continual learning from imbalanced data. In: International Conference on Machine Learning, pp. 1952–1961. PMLR, November 2020
Google Scholar
He, J., Mao, R., Shao, Z., Zhu, F.: Incremental learning in online scenario. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 13926–13935 (2020)
Google Scholar
Asadi, N., Davari, M., Mudur, S., Aljundi, R., Belilovsky, E.: Prototype-sample relation distillation: towards replay-free continual learning. In: International Conference on Machine Learning, pp. 1093–1106. PMLR, July 2023
Google Scholar
Chaudhry, A., Ranzato, M.A., Rohrbach, M., Elhoseiny, M.: Efficient lifelong learning with a-gem. arXiv preprint arXiv:1812.00420 (2018)
Caccia, M., et al.: Online fast adaptation and knowledge accumulation: a new approach to continual learning. arXiv preprint arXiv:2003.05856 (2020)
Wei, Y., Ye, J., Huang, Z., Zhang, J., Shan, H.: Online prototype learning for online continual learning. In: Proceedings of the IEEE/CVF International Conference on Computer Vision, pp. 18764–18774 (2023)
Google Scholar
De Lange, M., Tuytelaars, T.: Continual prototype evolution: learning online from non-stationary data streams. In: Proceedings of the IEEE/CVF International Conference on Computer Vision, pp. 8250–8259 (2021)
Google Scholar
Davari, M., Asadi, N., Mudur, S., Aljundi, R., Belilovsky, E.: Probing representation forgetting in supervised and unsupervised continual learning. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 16712–16721 (2022)
Google Scholar
Fang, Z., Wang, J., Wang, L., Zhang, L., Yang, Y., Liu, Z.: Seed: Self-supervised distillation for visual representation. arXiv preprint arXiv:2101.04731 (2021)
Hinton, G., Vinyals, O., Dean, J.: Distilling the knowledge in a neural network. arXiv preprint arXiv:1503.02531 (2015)
Guo, Y., Liu, B., Zhao, D.: Online continual learning through mutual information maximization. In: International Conference on Machine Learning, pp. 8109–8126. PMLR, June 2022
Google Scholar
Li, Z., Hoiem, D.: Learning without forgetting. IEEE Trans. Pattern Anal. Mach. Intell. 40(12), 2935–2947 (2017)
Article Google Scholar
Cha, H., Lee, J., Shin, J.: Co2l: contrastive continual learning. In: Proceedings of the IEEE/CVF International Conference on Computer Vision, pp. 9516–9525 (2021)
Google Scholar
Kim, G., Xiao, C., Konishi, T., Liu, B.: Learnability and Algorithm for Continual Learning. arXiv preprint arXiv:2306.12646 (2023)
Kim, G., Xiao, C., Konishi, T., Ke, Z., Liu, B.: A theoretical study on solving continual learning. Adv. Neural. Inf. Process. Syst. 35, 5065–5079 (2022)
Google Scholar
Rebuffi, S.A., Kolesnikov, A., Sperl, G., Lampert, C.H.: icarl: incremental classifier and representation learning. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 2001–2010 (2017)
Google Scholar
Buzzega, P., Boschini, M., Porrello, A., Abati, D., Calderara, S.: Dark experience for general continual learning: a strong, simple baseline. Adv. Neural. Inf. Process. Syst. 33, 15920–15930 (2020)
Google Scholar
Zhu, F., Zhang, X.Y., Wang, C., Yin, F., Liu, C.L.: Prototype augmentation and self-supervision for incremental learning. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 5871–5880 (2021)
Google Scholar
Aljundi, R., Lin, M., Goujaud, B., Bengio, Y.: Gradient based sample selection for online continual learning. Advances in neural information processing systems, 32 (2019)
Google Scholar
Chaudhry, A., et al.: On tiny episodic memories in continual learning. arXiv preprint arXiv:1902.10486 (2019)
Aljundi, R., et al.: Online continual learning with maximal interfered retrieval. In: Advances in Neural Information Processing Systems, vol. 32 (2019). 1, 2, 5, 6, 7
Google Scholar
Prabhu, A., Torr, P.H.S., Dokania, P.K.: GDumb: a simple approach that questions our progress in continual learning. In: Vedaldi, A., Bischof, H., Brox, T., Frahm, J.-M. (eds.) ECCV 2020. LNCS, vol. 12347, pp. 524–540. Springer, Cham (2020). https://doi.org/10.1007/978-3-030-58536-5_31
Chapter Google Scholar
Shim, D., Mai, Z., Jeong, J., Sanner, S., Kim, H., Jang, J.: Online class-incremental continual learning with adversarial shapley value. In: Proceedings of the AAAI Conference on Artificial Intelligence, vol. 35, No. 11, pp. 9630–9638, May 2021
Google Scholar
Mai, Z., Li, R., Kim, H., Sanner, S.: Supervised contrastive replay: revisiting the nearest class mean classifier in online class-incremental continual learning. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 3589–3599 (2021)
Google Scholar
Gu, Y., Yang, X., Wei, K., Deng, C.: Not just selection, but exploration: online class-incremental continual learning via dual view consistency. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 7442-7451 (2022)
Google Scholar

Download references

Author information

Authors and Affiliations

VNU University of Engineering and Technology, Hanoi, Vietnam
Quynh-Trang Pham Thi, Duc-Hung Nguyen, Thanh Hai Dang, Duc-Trong Le, Tri-Thanh Nguyen & Quang-Thuy Ha

Authors

Quynh-Trang Pham Thi
View author publications
You can also search for this author in PubMed Google Scholar
Duc-Hung Nguyen
View author publications
You can also search for this author in PubMed Google Scholar
Thanh Hai Dang
View author publications
You can also search for this author in PubMed Google Scholar
Duc-Trong Le
View author publications
You can also search for this author in PubMed Google Scholar
Tri-Thanh Nguyen
View author publications
You can also search for this author in PubMed Google Scholar
Quang-Thuy Ha
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Quynh-Trang Pham Thi .

Editor information

Editors and Affiliations

Wroclaw University of Science and Technology, Wroclaw, Poland
Ngoc Thanh Nguyen
University of Pau and Adour Countries, Pau, France
Richard Chbeir
Open University of Cyprus, Latsia, Cyprus
Yannis Manolopoulos
Iwate Prefectural University, Takizawa, Japan
Hamido Fujita
National University of Kaohsiung, Kaohsiung, Taiwan
Tzung-Pei Hong
Japan Advanced Institute of Science and Technology, Nomi, Japan
Le Minh Nguyen
Wrocław University of Science and Technology, Wrocław, Poland
Krystian Wojtkiewicz

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Pham Thi, QT., Nguyen, DH., Dang, T.H., Le, DT., Nguyen, TT., Ha, QT. (2024). Towards Robust Continual Learning: A Multi-Head Approach with Online Prototype Equilibrium and Adaptive Prototypical Feedback. In: Nguyen, N.T., et al. Intelligent Information and Database Systems. ACIIDS 2024. Lecture Notes in Computer Science(), vol 14796. Springer, Singapore. https://doi.org/10.1007/978-981-97-4985-0_22

Download citation

DOI: https://doi.org/10.1007/978-981-97-4985-0_22
Published: 16 July 2024
Publisher Name: Springer, Singapore
Print ISBN: 978-981-97-4984-3
Online ISBN: 978-981-97-4985-0
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics