Risk of What? Defining Harm in the Context of AI Safety

This is a preprint and may not have undergone formal peer review

Abstract

For decades, the field of system safety has designed safe systems by reducing the risk of physical harm to humans, property and the environment to an acceptable level. Recently, this definition of safety has come under scrutiny by governments and researchers who argue that the narrow focus on reducing physical harm, whilst necessary, is not sufficient to secure the safety of AI systems. There is growing pressure to expand the scope of safety in the context of AI to address emerging harms, with particular emphasis being placed on the ways AI systems can reinforce and reproduce systemic harms. In this paper, we advocate for expanding the scope of conventional safety to include non-physical harms in the context of AI. However, we caution against broadening the scope to address systemic harms, as doing so presents intractable practical challenges for current safety methodologies. Instead, we propose that the scope of safety-related harms should be expanded to include psychological harms. Our proposal is partly motivated by the debates and evidence on social media, which fundamentally reshaped how harm is understood and addressed in the digital age, prompting new regulatory frameworks which aimed to protect users from the psychological risks of the technology. We draw on this precedent to motivate the inclusion of psychological harms in AI safety assessments. By expanding the scope of AI safety to include psychological harms, we take a critical step toward evolving the discipline of system safety into one that is better tuned and equipped to protect users against the complex and emerging harms propagated by AI systems.

Metadata

Item Type:	Preprint
Authors/Creators:	Fearnley, Laura Christina Anne (laura.fearnley@york.ac.uk) Cairns, Elly Stoneham, Tom https://orcid.org/0000-0001-5490-4927 Ryan, Philippa Mary https://orcid.org/0000-0003-1307-5207 Chubb, Jennifer Alison https://orcid.org/0000-0002-9716-820X Iacovides, Jo https://orcid.org/0000-0001-9674-8440 Iglesias Urrutia, Cynthia Paola https://orcid.org/0000-0002-3426-0930 Morgan, Phillip David James https://orcid.org/0000-0002-8797-4216 McDermid, John Alexander https://orcid.org/0000-0003-4745-4272 Habli, Ibrahim https://orcid.org/0000-0003-2736-8238
Dates:	Published: 2025
Institution:	The University of York
Academic Units:	The University of York > Faculty of Sciences (York) > Computer Science (York) The University of York > Faculty of Arts and Humanities (York) > Philosophy (York) The University of York > Faculty of Social Sciences (York) > Sociology (York) The University of York > Faculty of Sciences (York) > Health Sciences (York) The University of York > Faculty of Social Sciences (York) > The York Law School
Date Deposited:	17 Feb 2025 16:00
Last Modified:	03 Mar 2026 00:06
Status:	Published
Related URLs:	https://philpapers.org/archive/FEAROW.pd...
Open Archives Initiative ID (OAI ID):	oai:eprints.whiterose.ac.uk:223407

Download

Submitted Version

Filename: Risk_of_What.pdf

Description: Risk of What

CLICK TO DOWNLOAD

CORE (COnnecting REpositories)

Risk of What? Defining Harm in the Context of AI Safety

Abstract

Metadata

Download

Submitted Version

Export

Statistics