Jiang, X. orcid.org/0000-0003-4255-5445, Cai, C. orcid.org/0000-0003-3322-4273, Fan, W. orcid.org/0000-0002-6592-9561 et al. (2 more authors) (2022) Contextualised modelling for effective citation function classification. In: Proceedings of the 2022 6th International Conference on Natural Language Processing and Information Retrieval. NLPIR 2022: 2022 6th International Conference on Natural Language Processing and Information Retrieval, 17-19 Dec 2022, Bangkok, Thailand. Association for Computing Machinery , pp. 93-103.
Abstract
Citation function classification is an important task in scientific text mining. The past two decades have witnessed many computerised algorithms working on various citation function datasets tailored to various annotation schemes. Recently, deep learning has pushed the state of the art by a large margin. Several pitfalls exist. Due to annotation difficulty, data sizes, especially the minority classes, are often not big enough for training effective deep learning models. Being less discussed, most state-of-the-art deep learning solutions in fact generate a feature representation for the citation sentence or context, instead of modelling individual in-text citations. This is conceptually flawed as it is common to see multiple in-text citations with different functions in the same citation sentence. In addition, existing deep learning studies have only explored a rather limited design space of encoding citation and its surrounding context. This paper explored a wide range of modelling options based on SciBERT, the popular cross-disciplinary pre-trained scientific language model, and their performances on citation function classification, for the purpose of determining the most effective way of modeling citation and its context. To deal with the data size issue, we created a large-scale citation function dataset by mapping, merging and re-annotating six publicly available datasets from the computational linguistics domain by adapting Teufel et al.'s 12-class scheme. The best F1 scores we achieved were around 66.16%, 71.39% and 73.56% on a 11-class annotation scheme slightly adapted from Teufel et al.'s 12-class scheme, a reduced 7-class scheme by merging comparison functions, and Jurgens et al.'s 6-class scheme respectively. A useful observation is that there is no single best model that is superior for all functions, therefore the trained model variants allow for applications which emphasise on a specific type of or a specific group of citation functions.
Metadata
Item Type: | Proceedings Paper |
---|---|
Authors/Creators: | |
Copyright, Publisher and Additional Information: | © 2022 ACM. This is an author-produced version of a paper subsequently published in NLPIR '22. Uploaded in accordance with the publisher's self-archiving policy. |
Keywords: | Citation function classification; SciBERT; citation context analysis; citation intent identification; deep learning |
Dates: |
|
Institution: | The University of Sheffield |
Academic Units: | The University of Sheffield > Faculty of Social Sciences (Sheffield) > Information School (Sheffield) |
Depositing User: | Symplectic Sheffield |
Date Deposited: | 17 Jun 2025 13:54 |
Last Modified: | 17 Jun 2025 13:55 |
Status: | Published |
Publisher: | Association for Computing Machinery |
Refereed: | Yes |
Identification Number: | 10.1145/3582768.3582769 |
Related URLs: | |
Open Archives Initiative ID (OAI ID): | oai:eprints.whiterose.ac.uk:227918 |