TY - JOUR
T1 - Exploring the role of Large Language Models in haematology
T2 - A focused review of applications, benefits and limitations
AU - Mudrik, Aya
AU - Nadkarni, Girish N.
AU - Efros, Orly
AU - Glicksberg, Benjamin S.
AU - Klang, Eyal
AU - Soffer, Shelly
N1 - Publisher Copyright:
© 2024 The Author(s). British Journal of Haematology published by British Society for Haematology and John Wiley & Sons Ltd.
PY - 2024/11/1
Y1 - 2024/11/1
N2 - Large language models (LLMs) have significantly impacted various fields with their ability to understand and generate human-like text. This study explores the potential benefits and limitations of integrating LLMs, such as ChatGPT, into haematology practices. Utilizing systematic review methodologies, we analysed studies published after 1 December 2022, from databases like PubMed, Web of Science and Scopus, and assessing each for bias with the QUADAS-2 tool. We reviewed 10 studies that applied LLMs in various haematology contexts. These models demonstrated proficiency in specific tasks, such as achieving 76% diagnostic accuracy for haemoglobinopathies. However, the research highlighted inconsistencies in performance and reference accuracy, indicating variability in reliability across different uses. Additionally, the limited scope of these studies and constraints on datasets could potentially limit the generalizability of our findings. The findings suggest that, while LLMs provide notable advantages in enhancing diagnostic processes and educational resources within haematology, their integration into clinical practice requires careful consideration. Before implementing them in haematology, rigorous testing and specific adaptation are essential. This involves validating their accuracy and reliability across different scenarios. Given the field's complexity, it is also critical to continuously monitor these models and adapt them responsively.
AB - Large language models (LLMs) have significantly impacted various fields with their ability to understand and generate human-like text. This study explores the potential benefits and limitations of integrating LLMs, such as ChatGPT, into haematology practices. Utilizing systematic review methodologies, we analysed studies published after 1 December 2022, from databases like PubMed, Web of Science and Scopus, and assessing each for bias with the QUADAS-2 tool. We reviewed 10 studies that applied LLMs in various haematology contexts. These models demonstrated proficiency in specific tasks, such as achieving 76% diagnostic accuracy for haemoglobinopathies. However, the research highlighted inconsistencies in performance and reference accuracy, indicating variability in reliability across different uses. Additionally, the limited scope of these studies and constraints on datasets could potentially limit the generalizability of our findings. The findings suggest that, while LLMs provide notable advantages in enhancing diagnostic processes and educational resources within haematology, their integration into clinical practice requires careful consideration. Before implementing them in haematology, rigorous testing and specific adaptation are essential. This involves validating their accuracy and reliability across different scenarios. Given the field's complexity, it is also critical to continuously monitor these models and adapt them responsively.
KW - ChatGPT
KW - Google Bard
KW - Large Language Models
KW - Microsoft Bing
KW - PaLM
KW - haematology
UR - https://www.scopus.com/pages/publications/85203013943
U2 - 10.1111/bjh.19738
DO - 10.1111/bjh.19738
M3 - Review article
C2 - 39226157
AN - SCOPUS:85203013943
SN - 0007-1048
VL - 205
SP - 1685
EP - 1698
JO - British Journal of Haematology
JF - British Journal of Haematology
IS - 5
ER -