Wang, J.K. orcid.org/0000-0003-0048-3893 and Gaizauskas, R. orcid.org/0000-0002-3356-5126 (2015) Generating Image Descriptions with Gold Standard Visual Inputs: Motivation, Evaluation and Baselines. In: Proceedings of the 15th European Workshop on Natural Language Generation (ENLG). 15th European Workshop on Natural Language Generation (ENLG), 10-11 Sep 2015, Brighton, UK. Association for Computational Linguistics , pp. 117-126. ISBN 978-1-941643-78-5
Abstract
In this paper, we present the task of generating image descriptions with gold standard visual detections as input, rather than directly from an image. This allows the Natural Language Generation community to focus on the text generation process, rather than dealing with the noise and complications arising from the visual detection process. We propose a fine-grained evaluation metric specifically for evaluating the content selection capabilities of image description generation systems. To demonstrate the evaluation metric on the task, several baselines are presented using bounding box information and textual information as priors for content selection. The baselines are evaluated using the proposed metric, showing that the fine-grained metric is useful for evaluating the content selection phase of an image description generation system.
Metadata
Item Type: | Proceedings Paper |
---|---|
Authors/Creators: |
|
Copyright, Publisher and Additional Information: | © 2015 The Association for Computational Linguistics. This is an author produced version of a paper subsequently published in Proceedings of the 15th European Workshop on Natural Language Generation (ENLG). Uploaded in accordance with the publisher's self-archiving policy. |
Dates: |
|
Institution: | The University of Sheffield |
Academic Units: | The University of Sheffield > Faculty of Engineering (Sheffield) > Department of Computer Science (Sheffield) |
Funding Information: | Funder Grant number ENGINEERING AND PHYSICAL SCIENCE RESEARCH COUNCIL (EPSRC) EP/K019082/1 |
Depositing User: | Symplectic Sheffield |
Date Deposited: | 18 May 2016 10:36 |
Last Modified: | 25 Mar 2018 14:30 |
Published Version: | http://www.aclweb.org/anthology/W15-4722 |
Status: | Published |
Publisher: | Association for Computational Linguistics |
Refereed: | Yes |
Open Archives Initiative ID (OAI ID): | oai:eprints.whiterose.ac.uk:99026 |