Investigating Large Language Models for Complex Word Identification in Multilingual and Multidomain Setups

Smădu, Răzvan-Alexandru; Ion, David-Gabriel; Cercel, Dumitru-Clementin; Pop, Florin; Cercel, Mihaela-Claudia

Computer Science > Computation and Language

arXiv:2411.01706 (cs)

[Submitted on 3 Nov 2024]

Title:Investigating Large Language Models for Complex Word Identification in Multilingual and Multidomain Setups

Authors:Răzvan-Alexandru Smădu, David-Gabriel Ion, Dumitru-Clementin Cercel, Florin Pop, Mihaela-Claudia Cercel

View PDF HTML (experimental)

Abstract:Complex Word Identification (CWI) is an essential step in the lexical simplification task and has recently become a task on its own. Some variations of this binary classification task have emerged, such as lexical complexity prediction (LCP) and complexity evaluation of multi-word expressions (MWE). Large language models (LLMs) recently became popular in the Natural Language Processing community because of their versatility and capability to solve unseen tasks in zero/few-shot settings. Our work investigates LLM usage, specifically open-source models such as Llama 2, Llama 3, and Vicuna v1.5, and closed-source, such as ChatGPT-3.5-turbo and GPT-4o, in the CWI, LCP, and MWE settings. We evaluate zero-shot, few-shot, and fine-tuning settings and show that LLMs struggle in certain conditions or achieve comparable results against existing methods. In addition, we provide some views on meta-learning combined with prompt learning. In the end, we conclude that the current state of LLMs cannot or barely outperform existing methods, which are usually much smaller.

Comments:	37 pages, 16 figures, Accepted by EMNLP 2024
Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:2411.01706 [cs.CL]
	(or arXiv:2411.01706v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2411.01706

Submission history

From: Răzvan-Alexandru Smădu [view email]
[v1] Sun, 3 Nov 2024 22:31:02 UTC (786 KB)

Computer Science > Computation and Language

Title:Investigating Large Language Models for Complex Word Identification in Multilingual and Multidomain Setups

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.

Computer Science > Computation and Language

Title:Investigating Large Language Models for Complex Word Identification in Multilingual and Multidomain Setups

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.