Majumdar, S. orcid.org/0000-0003-3935-4087, Bandyopadhyay, A., Das, P.P. et al. (3 more authors) (2023) Can we predict useful comments in source codes? - Analysis of findings from Information Retrieval in Software Engineering Track @ FIRE 2022. In: The 14th Annual Meeting of the Forum for Information Retrieval Evaluation, 09-13 Dec 2022, Kolkata, India.
Abstract
The Information Retrieval in Software Engineering (IRSE) track aims to develop solutions for automated evaluation of code comments in a machine learning framework. In this track, there is a binary classification task to classify comments as useful and not useful. The dataset consists of 9048 code comments and surrounding code snippet pairs extracted from open source github C based projects. Overall 34 experiments have been submitted by 11 teams from various universities and software companies. The submissions have been evaluated quantitatively using the F1-Score and qualitatively based on the type of features developed, the supervised learning model used and their corresponding hyper-parameters. The best performing architectures mostly have employed transformer architectures coupled with a software development related embedding space.
Metadata
| Item Type: | Conference or Workshop Item |
|---|---|
| Authors/Creators: |
|
| Keywords: | bert, GPT-2, Stanford POS Tagging, neural networks, abstract syntax tree |
| Dates: |
|
| Institution: | The University of Leeds |
| Academic Units: | The University of Leeds > Faculty of Engineering & Physical Sciences (Leeds) > School of Computing (Leeds) |
| Date Deposited: | 09 Feb 2026 08:56 |
| Last Modified: | 09 Feb 2026 08:56 |
| Published Version: | https://dl.acm.org/doi/10.1145/3574318.3574329 |
| Status: | Published |
| Publisher: | Association for Computing Machinery (ACM) |
| Identification Number: | 10.1145/3574318.3574329 |
| Open Archives Initiative ID (OAI ID): | oai:eprints.whiterose.ac.uk:237541 |

CORE (COnnecting REpositories)
CORE (COnnecting REpositories)