Liu, L. orcid.org/0000-0002-9236-3380, Bogachev, L. orcid.org/0000-0002-2365-2621, Rezaei, M. orcid.org/0000-0003-3892-421X et al. (4 more authors) (2025) OM4AnI: A Novel Overlap Measure for Anomaly Identification in Multi-Class Scatterplots. IEEE Transactions on Visualization and Computer Graphics. ISSN: 1077-2626
Abstract
Scatterplots are widely used across various domains to identify anomalies in datasets, particularly in multi-class settings, such as detecting misclassified or mislabeled data. However, scatterplot effectiveness often declines with large datasets due to limited display resolution. This paper introduces a novel Visual Quality Measure (VQM) - OM4AnI (Overlap Measure for Anomaly Identification) - which quantifies the degree of overlap for identifying anomalies, helping users estimate how effectively anomalies can be observed in multi-class scatterplots. OM4AnI begins by computing anomaly index based on each data point's position relative to its class cluster. The scatterplot is then discretized into a matrix representation by binning the display space into cell-level (pixel-level) grids and computing the coverage for each pixel. It takes into account the anomaly index of data points covering these pixels and visual features (marker shapes, marker sizes, and rendering orders). Building on this foundation, we sum all the coverage information in each cell (pixel) of matrix representation to obtain the final quality score with respect to anomaly identification. We conducted an evaluation to analyze the efficiency, effectiveness, sensitivity of OM4AnI in comparison with six representative baseline methods that are based on different computation granularity levels: data level, marker level, and pixel level. The results show that OM4AnI outperforms baseline methods by exhibiting more monotonic trends against the ground truth and greater sensitivity to rendering order, unlike the baseline methods. It confirms that OM4AnI can inform users about how effectively their scatterplots support anomaly identification. Overall, OM4AnI shows strong potential as an evaluation metric and for optimizing scatterplots through automatic adjustment of visual parameters.
Metadata
| Item Type: | Article |
|---|---|
| Authors/Creators: |
|
| Copyright, Publisher and Additional Information: | This is an author produced version of an article published in IEEE Transactions on Visualization and Computer Graphics, made available via the University of Leeds Research Outputs Policy under the terms of the Creative Commons Attribution License (CC-BY), which permits unrestricted use, distribution and reproduction in any medium, provided the original work is properly cited. |
| Keywords: | Anomaly identification, Visual quality measure, Multi-class scatter plot, Explainable AI |
| Dates: |
|
| Institution: | The University of Leeds |
| Academic Units: | The University of Leeds > Faculty of Engineering & Physical Sciences (Leeds) > School of Computing (Leeds) The University of Leeds > Faculty of Environment (Leeds) > Institute for Transport Studies (Leeds) The University of Leeds > Faculty of Arts, Humanities and Cultures (Leeds) > School of Design (Leeds) The University of Leeds > Faculty of Engineering & Physical Sciences (Leeds) > School of Mathematics (Leeds) |
| Funding Information: | Funder Grant number EPSRC (Engineering and Physical Sciences Research Council) EP/X029689/1 |
| Date Deposited: | 19 Dec 2025 12:27 |
| Last Modified: | 19 Dec 2025 21:16 |
| Status: | Published online |
| Publisher: | Institute of Electrical and Electronics Engineers (IEEE) |
| Identification Number: | 10.1109/tvcg.2025.3642219 |
| Related URLs: | |
| Open Archives Initiative ID (OAI ID): | oai:eprints.whiterose.ac.uk:235647 |
Download
Filename: _TVCG_R1_VEBreakDown.pdf
Licence: CC-BY 4.0

CORE (COnnecting REpositories)
CORE (COnnecting REpositories)