Abstract
Accurate medical symptom coding from unstructured clinical text, such as vaccine safety reports, is a critical task with applications in pharmacovigilance and safety monitoring. Symptom coding, as tailored in this study, involves identifying and linking nuanced symptom mentions to standardized vocabularies like MedDRA, differentiating it from broader medical coding tasks. Traditional approaches to this task, which treat symptom extraction and linking as independent workflows, often fail to handle the variability and complexity of clinical narratives, especially for rare cases. Recent advancements in Large Language Models (LLMs) offer new opportunities but face challenges in achieving consistent performance. To address these issues, we propose Task as Context (TACO) Prompting, a novel framework that unifies extraction and linking tasks by embedding task-specific context into LLM prompts. Our study also introduces SYMPCODER, a human-annotated dataset derived from Vaccine Adverse Event Reporting System (VAERS) reports, and a two-stage evaluation framework to comprehensively assess both symptom linking and mention fidelity. Our comprehensive evaluation of multiple LLMs, including Llama2-chat, Jackalope-7b, GPT-3.5 Turbo, GPT-4 Turbo, and GPT-4o, demonstrates TACO's effectiveness in improving flexibility and accuracy for tailored tasks like symptom coding, paving the way for more specific coding tasks and advancing clinical text processing methodologies.
| Original language | English |
|---|---|
| Title of host publication | Proceedings - 2025 IEEE/ACM International Conference on Connected Health |
| Subtitle of host publication | Applications, Systems and Engineering Technologies, CHASE 2025 |
| Pages | 176-186 |
| Number of pages | 11 |
| ISBN (Electronic) | 9798400715396 |
| DOIs | |
| State | Published - 2025 |
| Event | 10th IEEE/ACM International Conference on Connected Health: Applications, Systems and Engineering Technologies, CHASE 2025 - Manhattan, United States Duration: 24 Jun 2025 → 26 Jun 2025 |
Publication series
| Name | Proceedings - 2025 IEEE/ACM International Conference on Connected Health: Applications, Systems and Engineering Technologies, CHASE 2025 |
|---|
Conference
| Conference | 10th IEEE/ACM International Conference on Connected Health: Applications, Systems and Engineering Technologies, CHASE 2025 |
|---|---|
| Country/Territory | United States |
| City | Manhattan |
| Period | 24/06/25 → 26/06/25 |
UN SDGs
This output contributes to the following UN Sustainable Development Goals (SDGs)
-
SDG 3 Good Health and Well-being
Keywords
- Medical coding
- chain-of-thought prompting
- large language models
- task as context
Fingerprint
Dive into the research topics of 'Task as Context Prompting for Accurate Medical Symptom Coding Using Large Language Models'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver