Large Language Models versus Classical Machine Learning: Performance in COVID-19 Mortality Prediction Using High-Dimensional Tabular Data

Ghaffarzadeh-Esfahani, Mohammadreza; Ghaffarzadeh-Esfahani, Mahdi; Salahi-Niri, Arian; Toreyhi, Hossein; Atf, Zahra; Mohsenzadeh-Kermani, Amirali; Sarikhani, Mahshad; Tajabadi, Zohreh; Shojaeian, Fatemeh; Bagheri, Mohammad Hassan; Feyzi, Aydin; Tarighatpayma, Mohammadamin; Gazmeh, Narges; Heydari, Fateme; Afshar, Hossein; Allahgholipour, Amirreza; Alimardani, Farid; Salehi, Ameneh; Asadimanesh, Naghmeh; Khalafi, Mohammad Amin; Shabanipour, Hadis; Moradi, Ali; Zadeh, Sajjad Hossein; Yazdani, Omid; Esbati, Romina; Maleki, Moozhan; Nasr, Danial Samiei; Soheili, Amirali; Majlesi, Hossein; Shahsavan, Saba; Soheilipour, Alireza; Goudarzi, Nooshin; Taherifard, Erfan; Hatamabadi, Hamidreza; Samaan, Jamil S; Savage, Thomas; Sakhuja, Ankit; Soroush, Ali; Nadkarni, Girish; Darazam, Ilad Alavi; Pourhoseingholi, Mohamad Amin; Safavi-Naini, Seyed Amir Ahmad

Computer Science > Machine Learning

arXiv:2409.02136 (cs)

COVID-19 e-print

Important: e-prints posted on arXiv are not peer-reviewed by arXiv; they should not be relied upon without context to guide clinical practice or health-related behavior and should not be reported in news media as established information without consulting multiple experts in the field.

[Submitted on 2 Sep 2024]

Title:Large Language Models versus Classical Machine Learning: Performance in COVID-19 Mortality Prediction Using High-Dimensional Tabular Data

View PDF

Abstract:Background: This study aimed to evaluate and compare the performance of classical machine learning models (CMLs) and large language models (LLMs) in predicting mortality associated with COVID-19 by utilizing a high-dimensional tabular dataset.
Materials and Methods: We analyzed data from 9,134 COVID-19 patients collected across four hospitals. Seven CML models, including XGBoost and random forest (RF), were trained and evaluated. The structured data was converted into text for zero-shot classification by eight LLMs, including GPT-4 and Mistral-7b. Additionally, Mistral-7b was fine-tuned using the QLoRA approach to enhance its predictive capabilities.
Results: Among the CML models, XGBoost and RF achieved the highest accuracy, with F1 scores of 0.87 for internal validation and 0.83 for external validation. In the LLM category, GPT-4 was the top performer with an F1 score of 0.43. Fine-tuning Mistral-7b significantly improved its recall from 1% to 79%, resulting in an F1 score of 0.74, which was stable during external validation.
Conclusion: While LLMs show moderate performance in zero-shot classification, fine-tuning can significantly enhance their effectiveness, potentially aligning them closer to CML models. However, CMLs still outperform LLMs in high-dimensional tabular data tasks.

Comments:	Code is available at: this https URL and this https URL. The datasets are available from the corresponding author on reasonable request (sdamirsa@ymail.com)
Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
MSC classes:	92C50, 68T50
ACM classes:	J.3
Cite as:	arXiv:2409.02136 [cs.LG]
	(or arXiv:2409.02136v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2409.02136

Submission history

From: Seyed Amir Ahmad Safavi-Naini [view email]
[v1] Mon, 2 Sep 2024 14:51:12 UTC (2,557 KB)

Computer Science > Machine Learning

Title:Large Language Models versus Classical Machine Learning: Performance in COVID-19 Mortality Prediction Using High-Dimensional Tabular Data

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.

Computer Science > Machine Learning

Title:Large Language Models versus Classical Machine Learning: Performance in COVID-19 Mortality Prediction Using High-Dimensional Tabular Data

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.