Newman-Griffis, D. orcid.org/0000-0002-0473-4226 and Fosler-Lussier, E. (2019) Writing habits and telltale neighbors: analyzing clinical concept usage patterns with sublanguage embeddings. In: Holderness, E., Yepes, A.J., Lavelli, A., Minard, A.-L., Pustejovsky, J. and Rinaldi, F., (eds.) Proceedings of the Tenth International Workshop on Health Text Mining and Information Analysis (LOUHI 2019). The Tenth International Workshop on Health Text Mining and Information Analysis (LOUHI 2019), 03 Nov 2019, Hong Kong, China. Association for Computational Linguistics , pp. 146-156. ISBN 9781950737772
Abstract
Natural language processing techniques are being applied to increasingly diverse types of electronic health records, and can benefit from in-depth understanding of the distinguishing characteristics of medical document types. We present a method for characterizing the usage patterns of clinical concepts among different document types, in order to capture semantic differences beyond the lexical level. By training concept embeddings on clinical documents of different types and measuring the differences in their nearest neighborhood structures, we are able to measure divergences in concept usage while correcting for noise in embedding learning. Experiments on the MIMIC-III corpus demonstrate that our approach captures clinically-relevant differences in concept usage and provides an intuitive way to explore semantic characteristics of clinical document collections.
Metadata
Item Type: | Proceedings Paper |
---|---|
Authors/Creators: |
|
Editors: |
|
Copyright, Publisher and Additional Information: | © 2019 Association for Computational Linguistics. Licensed on a Creative Commons Attribution 4.0 International License (https://creativecommons.org/licenses/by/4.0/). |
Dates: |
|
Institution: | The University of Sheffield |
Academic Units: | The University of Sheffield > Faculty of Social Sciences (Sheffield) > Information School (Sheffield) |
Depositing User: | Symplectic Sheffield |
Date Deposited: | 17 Feb 2023 11:24 |
Last Modified: | 18 Feb 2023 01:16 |
Status: | Published |
Publisher: | Association for Computational Linguistics |
Refereed: | Yes |
Identification Number: | 10.18653/v1/d19-6218 |
Open Archives Initiative ID (OAI ID): | oai:eprints.whiterose.ac.uk:196488 |