Mills, O, Shackleton, N, Colbert, J et al. (3 more authors) (2022) Inter-relationships between geographical scale, socio-economic data suppression and population homogeneity. Applied Spatial Analysis and Policy, 15 (4). pp. 1075-1091. ISSN 1874-463X
Abstract
Over time, technology has greatly enhanced access to vast amounts of public data in government datasets. At the same time there has been an increase in ‘neighbourhood’ level research, in which researchers typically select an administrative unit for their analysis. As the demand for data driven insights and decision making continues to rise, researchers face a tradeoff between data suppression (to protect the privacy of citizens) and homogeneity (the similarity of individuals within an area unit for given characteristics). In this paper, we explore the extent that different scales of geography impact data suppression and spatial homogeneity using the intra-class correlation and the D-Statistic. We use age, sex, ethnicity, education and income data from the 2013 New Zealand Census to assess a) the extent to which data are suppressed, and b) the spatial homogeneity of these variables across 5 scales of ‘small area’ geography available to researchers in NZ. The data used for this paper was accessed via the Integrated Data Infrastructure (IDI), a large data repository of de-identified, linked microdata obtained from government agencies, and nationally representative surveys. The scales used in this study are the 2013 Meshblock, Statistical Area 1, Data Zone, Statistical Area 2 and Area Unit, each of which can be used to analyse patterns at the ‘neighbourhood’ scale. We found that Data Zones are a suitable choice for undertaking analyses of census data as they represent a’medium’ scale geography designed to reduce data suppression while maintaining reasonable levels of population homogeneity. The policy implications for this research relate to zone design and decisions relating to the definition of ‘a small cell count’ for data dissemination for different users of sociodemographic data.
Metadata
Item Type: | Article |
---|---|
Authors/Creators: |
|
Copyright, Publisher and Additional Information: | © 2022, The Author(s), under exclusive licence to Springer Nature B.V. This is an author produced version of an article published in Applied Spatial Analysis and Policy. Uploaded in accordance with the publisher's self-archiving policy. |
Keywords: | Data homogeneity; Data suppression; Data zones; New Zealand; Integrated data infrastructure (IDI); Intraclass correlation; MAUP effects |
Dates: |
|
Institution: | The University of Leeds |
Academic Units: | The University of Leeds > Faculty of Environment (Leeds) > School of Geography (Leeds) > Centre for Spatial Analysis & Policy (Leeds) |
Depositing User: | Symplectic Publications |
Date Deposited: | 08 Mar 2022 10:38 |
Last Modified: | 08 Feb 2023 01:13 |
Status: | Published |
Publisher: | Springer |
Identification Number: | 10.1007/s12061-021-09430-2 |
Related URLs: | |
Open Archives Initiative ID (OAI ID): | oai:eprints.whiterose.ac.uk:184287 |