Zhang, H., Mohammed, A. and Dimitrova, V. orcid.org/0000-0002-7001-0891 (2024) Weakly Supervised Short Text Classification for Characterising Video Segments. In: Poquet, O., Ortega-Arranz, A., Viberg, O., Chounta, I-A., McLaren, B. and Jovanovic, J., (eds.) Proceedings of the 16th International Conference on Computer Supported Education - (Volume 2). CSEDU 2024:16th International Conference on Computer Supported Education, 02-04 May 2024, Angiers, France. Scitepress , pp. 197-204. ISBN 978-989-758-697-2
Abstract
In this age of life-wide learning, video-based learning has increasingly become a crucial method of education. However, the challenge lies in watching numerous videos and connecting key points from these videos with relevant study domains. This requires video characterization. Existing research on video characterization focuses on manual or automatic methods. These methods either require substantial human resources (experts to identify domain related videos and domain related areas in the videos) or rely on learner input (by relating video parts to their learning), often overlooking the assessment of their effectiveness in aiding learning. Manual methods are subjective, prone to errors and time consuming. Automatic supervised methods require training data which in many cases is unavailable. In this paper we propose a weakly supervised method that utilizes concepts from an ontology to guide models in thematically classifying and characterising video segments. Our research is concentra ted in the health domain, conducting experiments with several models, including the large language model GPT-4. The results indicate that CorEx significantly outperforms other models, while GLDA and Guided BERTopic show limitations in this task. Although GPT-4 demonstrates consistent performance, it still falls behind CorEx. This study offers an innovative perspective in video-based learning, especially in automating the detection of learning themes in video content.
Metadata
Item Type: | Proceedings Paper |
---|---|
Authors/Creators: |
|
Editors: |
|
Copyright, Publisher and Additional Information: | © 2024 by SCITEPRESS – Science and Technology Publications, Lda. This is an open access conference paper under the terms of the Creative Commons Attribution-NonCommercial-NoDerivs License (CC BY-NC-ND). |
Keywords: | Video-Based Learning, Weakly Supervised Text Classification, Large Language Model |
Dates: |
|
Institution: | The University of Leeds |
Academic Units: | The University of Leeds > Faculty of Engineering & Physical Sciences (Leeds) > School of Computing (Leeds) > Artificial Intelligence |
Funding Information: | Funder Grant number EU - European Union 825750 |
Depositing User: | Symplectic Publications |
Date Deposited: | 26 Sep 2024 15:53 |
Last Modified: | 26 Sep 2024 15:53 |
Status: | Published |
Publisher: | Scitepress |
Identification Number: | 10.5220/0012618600003693 |
Open Archives Initiative ID (OAI ID): | oai:eprints.whiterose.ac.uk:217642 |