Can LLMs Learn New Concepts Incrementally without Forgetting?

Zheng, Junhao; Qiu, Shengjie; Ma, Qianli

Computer Science > Machine Learning

arXiv:2402.08526 (cs)

[Submitted on 13 Feb 2024 (v1), last revised 18 Jun 2024 (this version, v3)]

Title:Can LLMs Learn New Concepts Incrementally without Forgetting?

Authors:Junhao Zheng, Shengjie Qiu, Qianli Ma

View PDF HTML (experimental)

Abstract:Large Language Models (LLMs) have achieved remarkable success across various tasks, yet their ability to learn incrementally without forgetting remains underexplored. Incremental learning (IL) is crucial as it enables models to acquire new knowledge while retaining previously learned information, akin to human learning. Existing benchmarks for IL are insufficient due to data leakage issues and the overqualification of LLMs. To address these challenges, we introduce Concept-1K, a novel dataset comprising 1,023 recently emerged concepts across diverse domains. The concepts in Concept-1K are discrete, interpretable units of knowledge that allow for fine-grained analysis of learning and forgetting processes. Using Concept-1K as a testbed, we aim to answer the question: ``Can LLMs learn new concepts incrementally without forgetting like humans?'' Our investigation reveals that LLMs still suffer from catastrophic forgetting and that LoRA, despite fine-tuning fewer parameters, may lead to more forgetting on training data. Additionally, we explore the roles of in-context learning, model scale, buffer size, and pretraining in IL performance. These findings highlight the strengths and limitations of LLMs in IL scenarios and provide a robust benchmark for future research.

Comments:	28 pages
Subjects:	Machine Learning (cs.LG); Computation and Language (cs.CL)
Cite as:	arXiv:2402.08526 [cs.LG]
	(or arXiv:2402.08526v3 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2402.08526

Submission history

From: Junhao Zheng [view email]
[v1] Tue, 13 Feb 2024 15:29:50 UTC (5,072 KB)
[v2] Tue, 21 May 2024 08:29:44 UTC (5,072 KB)
[v3] Tue, 18 Jun 2024 06:56:44 UTC (5,667 KB)

Computer Science > Machine Learning

Title:Can LLMs Learn New Concepts Incrementally without Forgetting?

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.

Computer Science > Machine Learning

Title:Can LLMs Learn New Concepts Incrementally without Forgetting?

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.