This is the latest version of this eprint.
Majumdar, S. orcid.org/0000-0003-3935-4087, Paul, S., Dave, B. et al. (6 more authors) (2023) Generative AI for Software Metadata: Overview of the Information Retrieval in Software Engineering Track at FIRE 2023. In: Ghosh, K., Mandl, T., Majumder, P. and Mitra, M., (eds.) Working Notes of FIRE 2023 - Forum for Information Retrieval Evaluation (FIRE-WN 2023). FIRE 2023 - Forum for Information Retrieval Evaluation (FIRE-WN 2023), 15-18 Dec 2023, Goa, India. CEUR Workshop Proceedings, Aachen, Germany, pp. 598-604. ISSN: 1613-0073.
Abstract
The Information Retrieval in Software Engineering (IRSE) track aims to develop solutions for automated evaluation of code comments in a machine learning framework based on human and large language model generated labels. In this track, there is a binary classification task to classify comments as useful and not useful. The dataset consists of 9048 code comments and surrounding code snippet pairs extracted from open source github C based projects and an additional dataset generated individually by teams using large language models. Overall 56 experiments have been submitted by 17 teams from various universities and software companies. The submissions have been evaluated quantitatively using the F1-Score and qualitatively based on the type of features developed, the supervised learning model used and their corresponding hyper-parameters. The labels generated from large language models increase the bias in the prediction model but lead to less over-fitted results.
Metadata
| Item Type: | Proceedings Paper |
|---|---|
| Authors/Creators: |
|
| Editors: |
|
| Copyright, Publisher and Additional Information: | © 2023 Copyright for this paper by its authors. Use permitted under Creative Commons License Attribution 4.0 International (CC BY 4.0). |
| Keywords: | bert, GPT-2, Stanford POS Tagging, neural networks, abstract syntax tree |
| Dates: |
|
| Institution: | The University of Leeds |
| Academic Units: | The University of Leeds > Faculty of Engineering & Physical Sciences (Leeds) > School of Computing (Leeds) |
| Date Deposited: | 06 Feb 2026 16:08 |
| Last Modified: | 06 Feb 2026 16:11 |
| Published Version: | https://ceur-ws.org/Vol-3681/ |
| Status: | Published |
| Publisher: | CEUR Workshop Proceedings |
| Open Archives Initiative ID (OAI ID): | oai:eprints.whiterose.ac.uk:237540 |
Available Versions of this Item
-
Generative AI for Software Metadata: Overview of the Information Retrieval in Software Engineering Track at FIRE 2023. (deposited 06 Feb 2026 11:54)
- Generative AI for Software Metadata: Overview of the Information Retrieval in Software Engineering Track at FIRE 2023. (deposited 06 Feb 2026 16:08) [Currently Displayed]
Download
Filename: Generative AI for Software Metadata Overview.pdf
Licence: CC-BY 4.0

CORE (COnnecting REpositories)
CORE (COnnecting REpositories)