Coelho, T. orcid.org/0009-0001-4894-0347, Ribeiro, L.S.F. orcid.org/0000-0003-1781-2630, Macedo, J. orcid.org/0000-0001-5558-3088 et al. (2 more authors) (2025) Minimizing risk through minimizing model-data interaction: A protocol for relying on proxy tasks when designing child sexual abuse imagery detection models. In: FAccT '25: Proceedings of the 2025 ACM Conference on Fairness, Accountability, and Transparency. FAccT '25: The 2025 ACM Conference on Fairness, Accountability, and Transparency, 23-26 Jun 2025, Athens, Greece. ACM , pp. 1543-1553. ISBN 9798400714825
Abstract
The distribution of child sexual abuse imagery (CSAI) is an ever-growing concern of our modern world; children who suffered from this heinous crime are revictimized, and the growing amount of illegal imagery distributed overwhelms law enforcement agents (LEAs) with the manual labor of categorization. To ease this burden researchers have explored methods for automating data triage and detection of CSAI, but the sensitive nature of the data imposes restricted access and minimal interaction between real data and learning algorithms, avoiding leaks at all costs. In observing how these restrictions have shaped the literature we formalize a definition of “Proxy Tasks”, i.e., the substitute tasks used for training models for CSAI without making use of CSA data. Under this new terminology we review current literature and present a protocol for making conscious use of Proxy Tasks together with consistent input from LEAs to design better automation in this field. Finally, we apply this protocol to study — for the first time — the task of Few-shot Indoor Scene Classification on CSAI, showing a final model that achieves promising results on a real-world CSAI dataset whilst having no weights actually trained on sensitive data.
Metadata
| Item Type: | Proceedings Paper |
|---|---|
| Authors/Creators: |
|
| Copyright, Publisher and Additional Information: | © 2025 The Authors. Except as otherwise noted, this author-accepted version of a paper published in FAccT '25: Proceedings of the 2025 ACM Conference on Fairness, Accountability, and Transparency is made available via the University of Sheffield Research Publications and Copyright Policy under the terms of the Creative Commons Attribution 4.0 International License (CC-BY 4.0), which permits unrestricted use, distribution and reproduction in any medium, provided the original work is properly cited. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ |
| Keywords: | Child Sexual Abuse Recognition; Few-shot Learning; Scene Classification |
| Dates: |
|
| Institution: | The University of Sheffield |
| Academic Units: | The University of Sheffield > Faculty of Engineering (Sheffield) > Department of Computer Science (Sheffield) |
| Depositing User: | Symplectic Sheffield |
| Date Deposited: | 04 Jul 2025 10:53 |
| Last Modified: | 04 Jul 2025 10:53 |
| Status: | Published |
| Publisher: | ACM |
| Refereed: | Yes |
| Identification Number: | 10.1145/3715275.3732102 |
| Related URLs: | |
| Open Archives Initiative ID (OAI ID): | oai:eprints.whiterose.ac.uk:228740 |

CORE (COnnecting REpositories)
CORE (COnnecting REpositories)