Using GPT-4 to Generate Failure Logic

Abstract

Large Language Models' (LLMs) ability to explain complex texts has raised the question of whether their encoded knowledge is sufficient to reason about system failures. The current weaknesses in LLMs, like misalignment and hallucinations suggest they make unsuitable safety analysts, but could ``fast but flawed'' analysis still be useful? LLMs can rapidly parse system descriptions for design mitigation strategies like redundancy, they can trace failure propagation from common mode faults (such as loss of power or hydraulics) to higher level events and even incorporate non-functional risks from outside the functional specification into an analysis. But despite their knowledge of hardware component failure modes, we found LLMs remain weak at failure logic reasoning. We used OpenAI's Generative Pre-trained Transformer (GPT) Builder to develop a specific role for analysing failure logic and generating the corresponding fault tree visualisation. Although there are no objective measures that qualitatively assess failure logic analysis (i.e.\/ logical errors have variable significance) or whether the choice of higher-level failure modes is a ``good model'' of system failure, we report on the iterative process of developing the GPT, our inability to override the underlying model behaviour to counter its weaknesses, and conclude by reflecting on the productivity gains of using LLMs despite their flawed reasoning.

Metadata

Item Type:	Conference or Workshop Item
Authors/Creators:	Clegg, Kester Dean https://orcid.org/0000-0002-4484-3291 McDermid, John Alexander https://orcid.org/0000-0003-4745-4272 Habli, Ibrahim https://orcid.org/0000-0003-2736-8238
Keywords:	Large Language Models · Fault Tree Analysis · Failure Logic
Dates:	Published: 16 July 2024
Institution:	The University of York
Academic Units:	The University of York > Faculty of Sciences (York) > Computer Science (York)
Depositing User:	Pure (York)
Date Deposited:	24 Jul 2024 13:30
Last Modified:	17 Jul 2025 05:11
Status:	Published
Refereed:	Yes
Open Archives Initiative ID (OAI ID):	oai:eprints.whiterose.ac.uk:215140

Download

SAFECOMP_SASSUR_2024

Filename: SAFECOMP_SASSUR_2024.pdf

Description: SAFECOMP_SASSUR_2024

CLICK TO DOWNLOAD

CORE (COnnecting REpositories)

Using GPT-4 to Generate Failure Logic

Abstract

Metadata

Download

SAFECOMP_SASSUR_2024

Export

Statistics