Cai, Q. and Abhayaratne, C. orcid.org/0000-0002-2799-7395 (2023) SSDB-Net: a single-step dual branch network for weakly supervised semantic segmentation of food images. In: 2023 IEEE 25th International Workshop on Multimedia Signal Processing (MMSP) Proceedings. 2023 IEEE 25th International Workshop on Multimedia Signal Processing (MMSP), 27-29 Sep 2023, Poitiers, France. Institute of Electrical and Electronics Engineers (IEEE) ISBN 9798350338942
Abstract
Food image segmentation, as a critical task in food and nutrition research, promotes the development of various application domains such as calorie and nutrition estimation, food recommender systems, and daily food monitoring systems. Currently, most of the research is focused on food and non-food segmentation, which simply segments the food and background regions. Differently, semantic food segmentation can identify different specific food ingredients in a food image and provide more detailed and accurate information such as object location, shape and class. This is a more challenging but meaningful task, because the same food may appear in completely different colours, shapes and textures in different dishes, and correspondingly less researched. From the implementation perspective, most previous research is based on deep learning methods with pixel-level labelled data. However, annotating pixel-level labels requires extremely high labour costs. In this paper, a novel single-step dual branch network (SSDB-Net) is proposed to achieve weakly supervised semantic food segmentation. To our knowledge, this research is the first time proposing weakly supervised semantic food segmentation with image-level labels based on convolutional neural networks (CNN). It may serve as a benchmark for future food segmentation research. Our proposal method resulted in an mIoU of 14.79% for 104 categories in the FoodSeg103 dataset compared to 11.49% of the state-of-the-art WSSS method applied in food domains.
Metadata
Item Type: | Proceedings Paper |
---|---|
Authors/Creators: |
|
Copyright, Publisher and Additional Information: | © 2023 The Authors. Except as otherwise noted, this author-accepted version of a paper published in 2023 IEEE 25th International Workshop on Multimedia Signal Processing (MMSP) Proceedings is made available via the University of Sheffield Research Publications and Copyright Policy under the terms of the Creative Commons Attribution 4.0 International License (CC-BY 4.0), which permits unrestricted use, distribution and reproduction in any medium, provided the original work is properly cited. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ |
Keywords: | Weakly supervised semantic segmentation; semantic food segmentation; food image analysis |
Dates: |
|
Institution: | The University of Sheffield |
Academic Units: | The University of Sheffield > Faculty of Engineering (Sheffield) > Department of Electronic and Electrical Engineering (Sheffield) |
Depositing User: | Symplectic Sheffield |
Date Deposited: | 03 Jan 2024 17:05 |
Last Modified: | 04 Jan 2024 13:00 |
Status: | Published |
Publisher: | Institute of Electrical and Electronics Engineers (IEEE) |
Refereed: | Yes |
Identification Number: | 10.1109/mmsp59012.2023.10337656 |
Related URLs: | |
Open Archives Initiative ID (OAI ID): | oai:eprints.whiterose.ac.uk:206917 |