Classifying changes to LabVIEW and simulink models via changeset metrics

Abstract

Automated classification of software changes can help to understand the reason why a change was made. Support for the classification of changes can also guide the adoption of quality control practices as bugfix trends are observed, and cluster related sets of changes for similar management of the changed artifacts, thereby reducing maintenance efforts. A number of change classification techniques have been developed based on information extracted from the change author, change message, change size, or changed file. However, most of these approaches have targeted textual general-purpose programming languages. Furthermore, some of these approaches are computationally expensive because they often require the analysis of the whole source code, while others rely on the developers’ ability to describe a commit via a well-written message. In this paper, we present an approach to classify changes to models into the appropriate maintenance type via a set of metrics that are extracted from the version history of models. We developed seven metrics related to changes applied to models and model elements. We then conducted an empirical study involving 10 classifiers to determine the classifier that offers the best performance for automating the change classification process. These classifiers were trained on over 300 changesets extracted from the version history of 28 Simulink repositories, and 60 changesets from 10 LabVIEW repositories. The results of the study show that the Random Forest classifier offers the best performance for Simulink models, while the Bayes Net offers the best performance for LabVIEW models. The Random Forest classifier has also been evaluated by comparing its results with labels extracted from the discussions within the issues reported in a similar time frame. The evaluation results show that the Random Forest classifier is able to achieve an F-1 score of 0.83, thereby showing its ability to classify changes into the appropriate categories intended by the original developers.

Metadata

Item Type:	Article
Authors/Creators:	Popoola, Saheed Zhao, Xin Gray, Jeff Garcia-Dominguez, Antonio https://orcid.org/0000-0002-4744-9150
Copyright, Publisher and Additional Information:	Publisher Copyright: © The Author(s), under exclusive licence to Springer-Verlag London Ltd., part of Springer Nature 2024. This is an author-produced version of the published paper. Uploaded in accordance with the University’s Research Publications and Open Access policy.
Keywords:	Change classification,Changeset metrics,Classifier,LabVIEW,Simulink
Dates:	Accepted: 19 August 2024 Published (online): 9 September 2024
Institution:	The University of York
Academic Units:	The University of York > Faculty of Sciences (York) > Computer Science (York) The University of York > Faculty of Arts and Humanities (York) > English and Related Literature (York)
Depositing User:	Pure (York)
Date Deposited:	27 Sep 2024 08:20
Last Modified:	14 Jul 2025 23:26
Published Version:	https://doi.org/10.1007/s11334-024-00577-y
Status:	Published online
Refereed:	Yes
Identification Number:	10.1007/s11334-024-00577-y
Related URLs:	http://www.scopus.com/inward/record.url?...
Open Archives Initiative ID (OAI ID):	oai:eprints.whiterose.ac.uk:217685

Download

Accepted Version

Filename: change-classification-simulink-labview-preprint.pdf

Description: change-classification-simulink-labview-preprint

Licence: CC-BY 2.5

CLICK TO DOWNLOAD

[thumbnail of change-classification-simulink-labview-preprint]

CORE (COnnecting REpositories)

Classifying changes to LabVIEW and simulink models via changeset metrics

Abstract

Metadata

Download

Accepted Version

Export

Statistics