Lam, T. orcid.org/0000-0002-4902-7293, Huang, C. orcid.org/0000-0002-9218-987X, Torney, A. orcid.org/0000-0003-0373-0482 et al. (12 more authors) (2026) Leveraging web-scraped data to examine alcohol pricing: an Australian feasibility study with retail data. International Journal of Drug Policy, 148. 105115. ISSN: 0955-3959
Abstract
Introduction
Although price is critical in determining alcohol purchase and subsequent harms, researchers rarely have access to comprehensive alcohol price data. Web scraping is an advanced data collection technique that uses automated computer scripts to efficiently gather extensive website data. The aims of this paper are to demonstrate web scraping’s capacity to generate alcohol policy relevant data, and to assess the method’s consistency by comparing datasets collected by a commercial provider with those produced by a university-developed scraper.
Methods
Price and product data from the entire online catalogues of major retailers representing the majority of the Australian market were scraped daily by the commercial provider since 2020, with data collected from all jurisdictions, and products sold by multiple retailers matched. A university-developed web scraper collected a single-day’s catalogue data from the country’s largest alcohol retailer to compare with the commercial dataset as a reliability cross-check.
Results
Of the 16,409 products identified in both the commercial and university databases, there was an excellent match on the product prices (intraclass correlation coefficient=0.997 [95 %CI: 0.9972–0.9973]). A visualisation from the three largest Australian retailers demonstrated how daily prices varied over a 12-month period, for example with more frequent price changes for Australia’s largest retailer compared to the second and third, and across jurisdictions, such as some deeper discounting in Victoria.
Discussion
This study presented an independently cross-checked large-scale and longitudinal web scraping approach to collect alcohol price data, and demonstrated that the adapted data could aid understanding of the alcohol retail market. Web scraping is a feasible method to collect price data to support the development of evidence-based alcohol price policy.
Metadata
| Item Type: | Article |
|---|---|
| Authors/Creators: |
|
| Copyright, Publisher and Additional Information: | © 2025 The Author(s). This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by- nc-nd/4.0/). |
| Keywords: | Alcohol; Alcohol policy; Alcohol price; Health economics; Online surveillance; Pricing; Public health policy |
| Dates: |
|
| Institution: | The University of Sheffield |
| Academic Units: | The University of Sheffield > Faculty of Medicine, Dentistry and Health (Sheffield) > School of Medicine and Population Health |
| Date Deposited: | 20 Jan 2026 11:03 |
| Last Modified: | 20 Jan 2026 11:03 |
| Status: | Published |
| Publisher: | Elsevier BV |
| Refereed: | Yes |
| Identification Number: | 10.1016/j.drugpo.2025.105115 |
| Related URLs: | |
| Open Archives Initiative ID (OAI ID): | oai:eprints.whiterose.ac.uk:236727 |
Download
Filename: 1-s2.0-S0955395925004116-main.pdf
Licence: CC-BY-NC-ND 4.0

CORE (COnnecting REpositories)
CORE (COnnecting REpositories)