Miao, Q, Zhu, H, Liu, J et al. (5 more authors) (2022) MuchSUM: A Multi-channel Graph Neural Network for Extractive Summarization. In: Proceedings of the 45th International ACM SIGIR Conference on Research and Development in Information Retrieval. The 45th International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR), 11-15 Jul 2022, Madrid, Spain. ACM , pp. 2617-2622. ISBN 978-1-4503-8732-3
Abstract
Recent studies of extractive text summarization have leveraged BERT for document encoding with breakthrough performance. However, when using a pre-trained BERT-based encoder, existing approaches for selecting representative sentences for text summarization are inadequate since the encoder is not explicitly trained for representing sentences. Simply providing the BERT-initialized sentences to cross-sentential graph-based neural networks (GNNs) to encode semantic features of the sentences is not ideal because doing so fail to integrate other summary-worthy features like sentence importance and positions. This paper presents MuchSUM, a better approach for extractive text summarization. MuchSUM is a multi-channel graph convolutional network designed to explicitly incorporate multiple salient summary-worthy features. Specifically, we introduce three specific graph channels to encode the node textual features, node centrality features, and node position features, respectively, under bipartite word-sentence heterogeneous graphs. Then, a cross-channel convolution operation is designed to distill the common graph representations shared by different channels. Finally, the sentence representations of each channel are fused for extractive summarization. We also investigate three weighted graphs in each channel to infuse edge features for graph-based summarization modeling. Experimental results demonstrate our model can achieve considerable performance compared with some BERT-initialized graph-based extractive summarization systems.
Metadata
Item Type: | Proceedings Paper |
---|---|
Authors/Creators: |
|
Copyright, Publisher and Additional Information: | © 2022 ACM. This is an author produced version of a conference paper published in Proceedings of the 45th International ACM SIGIR Conference on Research and Development in Information Retrieval. Uploaded in accordance with the publisher's self-archiving policy. |
Keywords: | extractive summarization, multi-channel graph, textsummarization, bipartite word-sentence heterogeneous graph. |
Dates: |
|
Institution: | The University of Leeds |
Academic Units: | The University of Leeds > Faculty of Engineering & Physical Sciences (Leeds) > School of Computing (Leeds) |
Depositing User: | Symplectic Publications |
Date Deposited: | 20 Apr 2022 11:41 |
Last Modified: | 13 Jul 2022 09:38 |
Status: | Published |
Publisher: | ACM |
Identification Number: | 10.1145/3477495.3531906 |
Open Archives Initiative ID (OAI ID): | oai:eprints.whiterose.ac.uk:185779 |