Sivic, J., Everingham, M. and Zisserman, A. (2009) "'Who are you?' - Learning person specific classifiers from video". In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. CVPR 2009, June 20 - 25, 2009, Miami, Florida. , pp. 1145-1152.
We investigate the problem of automatically labelling faces of characters in TV or movie material with their names, using only weak supervision from automaticallyaligned subtitle and script text. Our previous work (Everingham et al. ) demonstrated promising results on the task, but the coverage of the method (proportion of video labelled) and generalization was limited by a restriction to frontal faces and nearest neighbour classification. In this paper we build on that method, extending the coverage greatly by the detection and recognition of characters in profile views. In addition, we make the following contributions: (i) seamless tracking, integration and recognition of profile and frontal detections, and (ii) a character specific multiple kernel classifier which is able to learn the features best able to discriminate between the characters. We report results on seven episodes of the TV series “Buffy the Vampire Slayer”, demonstrating significantly increased coverage and performance with respect to previous methods on this material.
|Institution:||The University of Leeds|
|Academic Units:||The University of Leeds > Faculty of Engineering (Leeds) > School of Computing (Leeds)|
|Depositing User:||Miss Jamie Grant|
|Date Deposited:||07 Jul 2009 12:40|
|Last Modified:||16 Sep 2016 13:47|