KU-DMIS at MEDIQA-CORR 2024: Exploring the Reasoning Capabilities of Small Language Models in Medical Error Correction

Research output: Chapter in Book/Report/Conference proceedingConference contribution

1 Citation (Scopus)

Abstract

Recent advancements in large language models (LM) like OpenAI’s GPT-4 have shown promise in healthcare, particularly in medical question answering and clinical applications. However, their deployment raises privacy concerns and their size limits use in resource-constrained environments. Smaller open-source LMs have emerged as alternatives, but their reliability in medicine remains under-explored. This study evaluates small LMs in the medical field using the MEDIQA-CORR 2024 task, which assesses the ability of models to identify and correct errors in clinical notes. Initially, zero-shot inference and simple fine-tuning of small models resulted in poor performance. When fine-tuning with chain-of-thought (CoT) reasoning using synthetic data generated by GPT-4, their performance significantly improved. Meerkat-7B, a small LM trained with medical CoT reasoning, demonstrated notable performance gains. Our model outperforms other small non-commercial LMs and some larger models, achieving a 73.36 aggregate score on MEDIQA-CORR 2024.

Original languageEnglish
Title of host publicationClinicalNLP 2024 - 6th Workshop on Clinical Natural Language Processing, Proceedings of the Workshop
EditorsTristan Naumann, Asma Ben Abacha, Steven Bethard, Kirk Roberts, Danielle Bitterman
PublisherAssociation for Computational Linguistics (ACL)
Pages526-536
Number of pages11
ISBN (Electronic)9798891761094
Publication statusPublished - 2024
Event6th Workshop on Clinical Natural Language Processing, ClinicalNLP 2024, held at NAACL 2024 - Mexico City, Mexico
Duration: 2024 Jun 21 → …

Publication series

NameClinicalNLP 2024 - 6th Workshop on Clinical Natural Language Processing, Proceedings of the Workshop

Conference

Conference6th Workshop on Clinical Natural Language Processing, ClinicalNLP 2024, held at NAACL 2024
Country/TerritoryMexico
CityMexico City
Period24/6/21 → …

Bibliographical note

Publisher Copyright:
© 2024 Association for Computational Linguistics.

ASJC Scopus subject areas

  • Artificial Intelligence
  • Software

Fingerprint

Dive into the research topics of 'KU-DMIS at MEDIQA-CORR 2024: Exploring the Reasoning Capabilities of Small Language Models in Medical Error Correction'. Together they form a unique fingerprint.

Cite this