Quattoni, A., Ramisa, A., Madhyastha, P.S. et al. (2 more authors) (2016) Structured Prediction with Output Embeddings for Semantic Image Annotation. In: Proceedings of the 2016 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies. NAACL-HLT 2016, June 12 to June 17, 2016, San Diego, CA. Association for Computational Linguistics , San Diego, California , pp. 552-557. ISBN 978-1-941643-91-4
Abstract
We address the task of annotating images with semantic tuples. Solving this problem requires an algorithm which is able to deal with hundreds of classes for each argument of the tuple. In such contexts, data sparsity becomes a key challenge, as there will be a large number of classes for which only a few examples are available. We propose handling this by incorporating feature representations of both the inputs (images) and outputs (argument classes) into a factorized log-linear model, and exploiting the flexibility of scoring functions based on bilinear forms. Experiments show that integrating feature representations of the outputs in the structured prediction model leads to better overall predictions. We also conclude that the best output representation is specific for each type of argument.
Metadata
Item Type: | Proceedings Paper |
---|---|
Authors/Creators: |
|
Copyright, Publisher and Additional Information: | © 2016 Association for Computational Linguistics. This is an author produced version of a paper subsequently published in Proceedings of the 2016 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies. Uploaded in accordance with the publisher's self-archiving policy. |
Keywords: | cs.CV; cs.CV |
Dates: |
|
Institution: | The University of Sheffield |
Academic Units: | The University of Sheffield > Faculty of Engineering (Sheffield) > Department of Computer Science (Sheffield) |
Depositing User: | Symplectic Sheffield |
Date Deposited: | 03 Nov 2016 08:24 |
Last Modified: | 21 Mar 2018 18:53 |
Published Version: | http://www.aclweb.org/anthology/N16-1068 |
Status: | Published |
Publisher: | Association for Computational Linguistics |
Refereed: | Yes |
Related URLs: | |
Open Archives Initiative ID (OAI ID): | oai:eprints.whiterose.ac.uk:106915 |