Learning implicit and explicit multi-task interactions for information extraction

Abstract

Information extraction aims at extracting entities, relations, etc., in text to support information retrieval systems. To extract information, researchers have considered multitask learning (ML) approaches. The conventional ML approach learns shared features across tasks, with the assumption that these features capture sufficient task interactions to learn expressive shared representations for task classification. However, such an assumption is flawed in different perspectives. First, the shared representation may contain noise introduced by another task; tasks coupled for multitask learning may have different complexities but this approach treats all tasks equally; the conventional approach has a flat structure which hinders the learning of explicit interactions. This approach however learns implicit interactions across tasks and often has a generalization ability which has benefited the learning of multitasks. In this paper, we take advantage of implicit interactions learned by conventional approaches while alleviating the issues mentioned above by developing a Recurrent Interaction Network with an effective Early Prediction Integration (RIN-EPI) for multitask learning. Specifically, RIN-EPI learns implicit and explicit interactions across two different but related tasks. To effectively learn explicit interactions across tasks, we consider the correlations among the outputs of related tasks. It is however obvious that task outputs are unobservable during training, so we leverage the predictions at intermediate layers (referred to as early predictions) as proxies as well as shared features across tasks to learn explicit interactions through attention mechanisms and sequence learning models. By recurrently learning explicit interactions, we gradually improve predictions for the individual tasks in the multitask learning. We demonstrate the effectiveness of RIN-EPI on the learning of two mainstream multitasks for information extraction: (1) entity recognition and relation classification, (2) aspect and opinion term co-extraction. Extensive experiments demonstrate the effectiveness of the RIN-EPI architecture, where we achieve state-of-the-art results on several benchmark datasets.

Metadata

Item Type:	Article
Authors/Creators:	Sun, K. Zhang, R. Mensah, S. https://orcid.org/0000-0003-0779-5574 Mao, Y. Liu, X.
Copyright, Publisher and Additional Information:	© 2022 Association for Computing Machinery. This is an author-produced version of a paper subsequently published in ACM Transactions on Information Systems. Uploaded in accordance with the publisher's self-archiving policy.
Keywords:	multitask learning; information extraction
Dates:	Accepted: 21 April 2022 Published (online): 11 June 2022 Published: April 2023
Institution:	The University of Sheffield
Academic Units:	The University of Sheffield > Faculty of Engineering (Sheffield) > Department of Computer Science (Sheffield)
Funding Information:	Funder Grant number The Leverhulme Trust RPG-2020-148
Depositing User:	Symplectic Sheffield
Date Deposited:	24 Aug 2022 10:58
Last Modified:	11 Jul 2024 08:22
Status:	Published
Publisher:	Association for Computing Machinery (ACM)
Refereed:	Yes
Identification Number:	10.1145/3533020
Open Archives Initiative ID (OAI ID):	oai:eprints.whiterose.ac.uk:190345

CORE (COnnecting REpositories)

Learning implicit and explicit multi-task interactions for information extraction

Abstract

Metadata

Download

Accepted Version

Export

Statistics