Comber, A. orcid.org/0000-0002-3652-7846 and Tsutsumida, N. (2023) Geographically weighted accuracy for hard and soft land cover classifications: 5 approaches with coded illustrations. International Journal of Remote Sensing, 44 (19). pp. 6233-6257. ISSN 0143-1161
Abstract
This paper examines different geographically weighted (GW) approaches for calculating spatially distributed measures of accuracy / uncertainty, consolidating current approaches and proposing 2 new ones. GW frameworks use a moving window or kernel to extract and weight data subsets, from which local (ie spatially distributed) statistics or metrics are calculated. A validation dataset with hard and soft classifications is used to illustrate the approaches. It contains observed field survey data (also commonly derived from higher resolution imagery), and predicted data from a fuzzy c-means classification. The hard classes were used to estimate spatially distributed measures of overall, user's and producer's accuracies in two ways. First, by conceptualising them as probabilities to be estimated from generalised linear regression models (GLMs), extended into Geographically Weighted GLMs. Second, by constructing local GW correspondence matrices and then calculating local accuracy measures from these. The soft classes were used to calculate per-class measures of fuzzy certainty from the absolute difference between predicted and observed fuzzy memberships. Then, a novel fuzzy certainty logic is proposed and used to create fuzzy confusion matrices and per-class measures of fuzzy omission and commission error, supporting measures of fuzzy user's and producer's certainties. These were extended to the GW case to generate spatially distributed measures. Finally, the soft classifications were conceptualised as compositional data and measures of difference were estimated using Aitchison distances. In each case, the local hard and soft accuracy and certainty measures were interpolated over a 1 km grid to estimate accuracy surfaces. The context for this review is the increasing operational use of training and validation data, often with high numbers of records, containing both hard and soft classes. The data and R code used to undertake all the analyses in this paper are provided, supporting more nuanced analyses of such data.
Metadata
Item Type: | Article |
---|---|
Authors/Creators: |
|
Copyright, Publisher and Additional Information: | © 2023 The Author(s). Published by Informa UK Limited, trading as Taylor & Francis Group. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/ licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. The terms on which this article has been published allow the posting of the Accepted Manuscript in a repository by the author(s) or with their consent. |
Keywords: | Classification accuracy; geographically weighted regression; logistic regression; fuzzy classification |
Dates: |
|
Institution: | The University of Leeds |
Academic Units: | The University of Leeds > Faculty of Environment (Leeds) > School of Geography (Leeds) > Centre for Spatial Analysis & Policy (Leeds) |
Depositing User: | Symplectic Publications |
Date Deposited: | 11 Oct 2023 15:41 |
Last Modified: | 27 Oct 2023 14:22 |
Published Version: | https://www.tandfonline.com/doi/full/10.1080/01431... |
Status: | Published |
Publisher: | Taylor & Francis |
Identification Number: | 10.1080/01431161.2023.2264503 |
Open Archives Initiative ID (OAI ID): | oai:eprints.whiterose.ac.uk:203914 |