Bin-Hezam, R. and Stevenson, R. orcid.org/0000-0002-9483-6006 (Accepted: 2025) A generalised and adaptable reinforcement learning stopping method. In: Proceedings of the 48th International ACM SIGIR Conference on Research and Development in Information Retrieval. The 48th International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR 2025), 13-17 Jul 2025, Padua, Italy. ACM ISBN 9798400715921 (In Press)
Abstract
This paper presents a Technology Assisted Review (TAR) stopping approach based on Reinforcement Learning (RL). Previous such approaches offered limited control over stopping behaviour, such as fixing the target recall and tradeoff between preferring to maximise recall or cost. These limitations are overcome by introducing a novel RL environment, GRLStop, that allows a single model to be applied to multiple target recalls, balances the recall/cost tradeoff and integrates a classifier. Experiments were carried out on six benchmark datasets (CLEF e-Health datasets 2017-9, TREC Total Recall, TREC Legal and Reuters RCV1) at multiple target recall levels. Results showed that the proposed approach to be effective compared to multiple baselines in addition to offering greater flexibility.
Metadata
Item Type: | Proceedings Paper |
---|---|
Authors/Creators: |
|
Copyright, Publisher and Additional Information: | © 2025 Copyright held by the owner/author(s). |
Keywords: | Reinforcement Learning; Deep Reinforcement Learning; Technology Assisted Review; TAR; Stopping Methods |
Dates: |
|
Institution: | The University of Sheffield |
Academic Units: | The University of Sheffield > Faculty of Engineering (Sheffield) > Department of Computer Science (Sheffield) |
Depositing User: | Symplectic Sheffield |
Date Deposited: | 13 May 2025 10:43 |
Last Modified: | 13 May 2025 10:43 |
Status: | In Press |
Publisher: | ACM |
Refereed: | Yes |
Identification Number: | 10.1145/3726302.3729879 |
Related URLs: | |
Open Archives Initiative ID (OAI ID): | oai:eprints.whiterose.ac.uk:226220 |
Download
Filename: SIGIR25___Generalised_RLStop.pdf
