Aslam, A. orcid.org/0000-0002-2654-4255, Walker, L., Abaho, M. et al. (16 more authors) (Submitted: 2024) An automation framework for clinical codelist development validated with UK data from patients with multiple long-term conditions. [Preprint - medRxiv] (Submitted)
Abstract
Background: Codelists play a crucial role in ensuring accurate and standardized communication within healthcare. However, preparation of high-quality codelists is a rigorous and time-consuming process. The literature focuses on transparency of clinical codelists and overlooks the utility of automation.
Method and Automated Framework Design: Here we present a Codelist Generation Framework that can automate generation of codelists with minimal input from clinical experts. We demonstrate the process using a specific project, DynAIRx, producing appropriate codelists and a framework allowing 1future projects to take advantage of automated codelist generation. Both the framework and codelist are publicly available.
Use-case: DynAIRx DynAIRx is an NIHR-funded project aiming to develop AIs to help optimise prescribing of medicines in patients with multiple long-term conditions. DynAIRx requires complex codelists to describe the trajectory of each patient, and the interaction between their conditions. We promptly generated ≈200 codelists for DynAIRx using the proposed framework and validated them with a panel of experts, significantly reducing the amount of time required by making effective use of automation.
Findings and Conclusion: The framework reduced the clinician time required to validate codes, automatically shrunk codelists using trusted sources and added new codes for review against existing codelists. In the DynAIRx case study, a codelist of ≈9600 codes required only 7-9 hours of clinician’s time in the end (while existing methods takes months), and application of the automation framework reduced the workload by >80%.
Competing Interest Statement The authors have declared no competing interest.
Funding Statement DynAIRx has been funded by the National Institute for Health and Care Research (NIHR) Artificial Intelligence for Multiple Long-Term Conditions (AIM) call (NIHR 203986). MG is partly funded by the NIHR Applied Research Collaboration North West Coast (ARC NWC). This research is supported by the NIHR ARC NWC. The views expressed in this publication are those of the author(s) and not necessarily those of the NIHR or the Department of Health and Social Care. IB is funded by an NIHR Senior Investigator award (NIHR205131).
Metadata
| Item Type: | Preprint |
|---|---|
| Authors/Creators: |
|
| Copyright, Publisher and Additional Information: | © 2025 The Author(s). This preprint is made available under a Creative Commons Attribution 4.0 International License. (https://creativecommons.org/licenses/by/4.0/) |
| Keywords: | Health Services and Systems; Biomedical and Clinical Sciences; Health Sciences; Management and decision making |
| Dates: |
|
| Institution: | The University of Sheffield |
| Academic Units: | ?? Sheffield.IJC ?? The University of Sheffield > Faculty of Medicine, Dentistry and Health (Sheffield) |
| Date Deposited: | 20 Nov 2025 15:49 |
| Last Modified: | 20 Nov 2025 15:49 |
| Status: | Submitted |
| Publisher: | Cold Spring Harbor Laboratory |
| Identification Number: | 10.1101/2024.09.25.24314215 |
| Open Archives Initiative ID (OAI ID): | oai:eprints.whiterose.ac.uk:234715 |

CORE (COnnecting REpositories)
CORE (COnnecting REpositories)