Distilling Causal Effect from Miscellaneous Other-Class for Continual Named Entity Recognition

Zheng, Junhao; Liang, Zhanxian; Chen, Haibin; Ma, Qianli

Computer Science > Computation and Language

arXiv:2210.03980 (cs)

[Submitted on 8 Oct 2022]

Title:Distilling Causal Effect from Miscellaneous Other-Class for Continual Named Entity Recognition

Authors:Junhao Zheng, Zhanxian Liang, Haibin Chen, Qianli Ma

View PDF

Abstract:Continual Learning for Named Entity Recognition (CL-NER) aims to learn a growing number of entity types over time from a stream of data. However, simply learning Other-Class in the same way as new entity types amplifies the catastrophic forgetting and leads to a substantial performance drop. The main cause behind this is that Other-Class samples usually contain old entity types, and the old knowledge in these Other-Class samples is not preserved properly. Thanks to the causal inference, we identify that the forgetting is caused by the missing causal effect from the old data. To this end, we propose a unified causal framework to retrieve the causality from both new entity types and Other-Class. Furthermore, we apply curriculum learning to mitigate the impact of label noise and introduce a self-adaptive weight for balancing the causal effects between new entity types and Other-Class. Experimental results on three benchmark datasets show that our method outperforms the state-of-the-art method by a large margin. Moreover, our method can be combined with the existing state-of-the-art methods to improve the performance in CL-NER

Comments:	Accepted by EMNLP2022
Subjects:	Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2210.03980 [cs.CL]
	(or arXiv:2210.03980v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2210.03980

Submission history

From: Junhao Zheng [view email]
[v1] Sat, 8 Oct 2022 09:37:06 UTC (1,052 KB)

Computer Science > Computation and Language

Title:Distilling Causal Effect from Miscellaneous Other-Class for Continual Named Entity Recognition

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.

Computer Science > Computation and Language

Title:Distilling Causal Effect from Miscellaneous Other-Class for Continual Named Entity Recognition

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.