Clegg, Kester Dean orcid.org/0000-0002-4484-3291, McDermid, John Alexander orcid.org/0000-0003-4745-4272 and Habli, Ibrahim orcid.org/0000-0003-2736-8238 (2024) Using GPT-4 to Generate Failure Logic. In: UNSPECIFIED.
Abstract
Large Language Models' (LLMs) ability to explain complex texts has raised the question of whether their encoded knowledge is sufficient to reason about system failures. The current weaknesses in LLMs, like misalignment and hallucinations suggest they make unsuitable safety analysts, but could ``fast but flawed'' analysis still be useful? LLMs can rapidly parse system descriptions for design mitigation strategies like redundancy, they can trace failure propagation from common mode faults (such as loss of power or hydraulics) to higher level events and even incorporate non-functional risks from outside the functional specification into an analysis. But despite their knowledge of hardware component failure modes, we found LLMs remain weak at failure logic reasoning. We used OpenAI's Generative Pre-trained Transformer (GPT) Builder to develop a specific role for analysing failure logic and generating the corresponding fault tree visualisation. Although there are no objective measures that qualitatively assess failure logic analysis (i.e.\/ logical errors have variable significance) or whether the choice of higher-level failure modes is a ``good model'' of system failure, we report on the iterative process of developing the GPT, our inability to override the underlying model behaviour to counter its weaknesses, and conclude by reflecting on the productivity gains of using LLMs despite their flawed reasoning.
Metadata
Item Type: | Conference or Workshop Item |
---|---|
Authors/Creators: |
|
Keywords: | Large Language Models · Fault Tree Analysis · Failure Logic |
Dates: |
|
Institution: | The University of York |
Academic Units: | The University of York > Faculty of Sciences (York) > Computer Science (York) |
Depositing User: | Pure (York) |
Date Deposited: | 24 Jul 2024 13:30 |
Last Modified: | 13 Feb 2025 05:35 |
Status: | Published |
Refereed: | Yes |
Open Archives Initiative ID (OAI ID): | oai:eprints.whiterose.ac.uk:215140 |