Kaya, Gulsum Kubra, Bovell, Dominique, Sujan, Mark orcid.org/0000-0001-6895-946X et al. (1 more author) (2025) Large language models powered system safety assessment:applying STPA and FRAM. Safety science. 106960. ISSN: 0925-7535
Abstract
The advancement of large language models (LLMs) shows immense promise in many domains. However, their reliability is still questionable. This study aims to comparatively examine the performance of ChatGPT and Gemini in conducting a stand-alone systems-based risk assessment using System-Theoretic Process Analysis (STPA) and the Functional Resonance Analysis Method (FRAM). Our findings revealed that both LLMs demonstrated weaknesses in their analyses, with ChatGPT generally outperforming Gemini regarding response comprehensiveness and adhering to the prompted format. Specifically, LLMs failed to use systems thinking in their stand-alone applications and failed to follow up on previous prompt outputs. While LLMs can provide substantial amounts of information quickly, the effectiveness of LLMs in system safety assessment is contingent on addressing their limitations and implementing strategies to improve their capabilities.
Metadata
| Item Type: | Article |
|---|---|
| Authors/Creators: |
|
| Copyright, Publisher and Additional Information: | © 2025 The Author(s). |
| Keywords: | AI chatbot,ChatGPT,FRAM,Gemini,LLM,STPA |
| Dates: |
|
| Institution: | The University of York |
| Academic Units: | The University of York > Faculty of Sciences (York) > Computer Science (York) |
| Date Deposited: | 11 Mar 2026 13:00 |
| Last Modified: | 11 Mar 2026 13:00 |
| Published Version: | https://doi.org/10.1016/j.ssci.2025.106960 |
| Status: | Published |
| Refereed: | Yes |
| Identification Number: | 10.1016/j.ssci.2025.106960 |
| Related URLs: | |
| Open Archives Initiative ID (OAI ID): | oai:eprints.whiterose.ac.uk:238978 |
Download
Filename: 1-s2.0-S0925753525001857-main.pdf
Description: Large language models powered system safety assessment: applying STPA and FRAM
Licence: CC-BY 2.5

CORE (COnnecting REpositories)
CORE (COnnecting REpositories)