Combining pre-trained word embeddings and linguistic features for sequential metaphor identification

Abstract

We tackle the problem of identifying metaphors in text, treated as a sequence tagging task. The pre-trained word embeddings GloVe, ELMo and BERT have individually shown good performance on sequential metaphor identification. These embeddings are generated by different models, training targets and corpora, thus encoding different semantic and syntactic information. We show that leveraging GloVe, ELMo and feature-based BERT based on a multi-channel CNN and a Bidirectional LSTM model can significantly outperform any single word embedding method and the combination of the two embeddings. Incorporating linguistic features into our model can further improve model performance, yielding state-of-the-art performance on three public metaphor datasets. We also provide in-depth analysis on the effectiveness of leveraging multiple word embeddings, including analysing the spatial distribution of different embedding methods for metaphors and literals, and showing how well the embeddings complement each other in different genres and parts of speech.

Metadata

Item Type:	Article
Authors/Creators:	Mao, R. Lin, C. https://orcid.org/0000-0003-3454-2468 Guerin, F.
Copyright, Publisher and Additional Information:	© 2021 The Author(s). For reuse permissions, please contact the Author(s).
Dates:	Submitted: 7 April 2021
Institution:	The University of Sheffield
Academic Units:	The University of Sheffield > Faculty of Engineering (Sheffield) > Department of Computer Science (Sheffield)
Depositing User:	Symplectic Sheffield
Date Deposited:	12 Aug 2021 11:25
Last Modified:	12 Aug 2021 11:58
Published Version:	https://arxiv.org/abs/2104.03285v1
Status:	Submitted
Related URLs:	arXiv URL
Open Archives Initiative ID (OAI ID):	oai:eprints.whiterose.ac.uk:177030

CORE (COnnecting REpositories)

Combining pre-trained word embeddings and linguistic features for sequential metaphor identification

Abstract

Metadata

Download

Submitted Version

Export

Statistics