Nothing Special   »   [go: up one dir, main page]

×
Please click here if you are not redirected within a few seconds.
Past year
  • Any time
  • Past hour
  • Past 24 hours
  • Past week
  • Past month
  • Past year
All results
Oct 8, 2024 · Ando, “Each Encounter Counts: Modeling Language Learning and Forgetting.” Proceedings of the 13th International Learning Analytics and Knowledge Conference ...
Jul 1, 2024 · We reduce this cost by equipping LLMs with explicit memory, a memory format cheaper than model parameters and text retrieval-augmented generation (RAG).
Nov 23, 2023 · This article is a beginner's guide to Language Modeling and covers how to use pre-trained models for natural language processing tasks using HuggingFace ...
Aug 11, 2024 · In this paper, we aim to overcome the forgetting problem in ToDs and propose a method (HESIT) with hyper-gradient-based exemplar strategy, which samples ...
Oct 23, 2024 · We systematically explored the existence and measurement of forgetting in pre-training, questioning traditional metrics such as perplexity (PPL) and ...
Dec 27, 2023 · This research investigates the cognitive mechanisms underlying the rise and fall of English word forms using two complementary research paradigms.
Nov 8, 2024 · Despite their popularity in non-English NLP, multilingual language models often underper- form monolingual ones due to inter-language.
Mar 15, 2024 · We also leave studies of non-English and multilingual language models to future surveys that can better focus on the many nuances of cross-lingual comparisons.
Mar 31, 2024 · Continual Learning (CL) enables machine learning mod- els to learn from continuously shifting new training data in absence of data from old tasks.