Optimizing Training Set Construction for Video Semantic Classification

Tang, Jinhui; Hua, Xian-Sheng; Song, Yan; Mei, Tao; Wu, Xiuqing

doi:10.1155/2008/693731

Research Article
Open access
Published: 21 November 2007

Optimizing Training Set Construction for Video Semantic Classification

Jinhui Tang¹,
Xian-Sheng Hua²,
Yan Song¹,
Tao Mei² &
…
Xiuqing Wu¹

EURASIP Journal on Advances in Signal Processing volume 2008, Article number: 693731 (2007) Cite this article

1317 Accesses
2 Citations
3 Altmetric
Metrics details

Abstract

We exploit the criteria to optimize training set construction for the large-scale video semantic classification. Due to the large gap between low-level features and higher-level semantics, as well as the high diversity of video data, it is difficult to represent the prototypes of semantic concepts by a training set of limited size. In video semantic classification, most of the learning-based approaches require a large training set to achieve good generalization capacity, in which large amounts of labor-intensive manual labeling are ineluctable. However, it is observed that the generalization capacity of a classifier highly depends on the geometrical distribution of the training data rather than the size. We argue that a training set which includes most temporal and spatial distribution information of the whole data will achieve a good performance even if the size of training set is limited. In order to capture the geometrical distribution characteristics of a given video collection, we propose four metrics for constructing/selecting an optimal training set, including salience, temporal dispersiveness, spatial dispersiveness, and diversity. Furthermore, based on these metrics, we propose a set of optimization rules to capture the most distribution information of the whole data using a training set with a given size. Experimental results demonstrate these rules are effective for training set construction in video semantic classification, and significantly outperform random training set selection.

Publisher note

To access the full article, please see PDF.

Author information

Authors and Affiliations

Department of Electronic Engineering and Information Science, University of Science and Technology of China, Hefei, 230027, China
Jinhui Tang, Yan Song & Xiuqing Wu
Microsoft Research Asia, Beijing, 100080, China
Xian-Sheng Hua & Tao Mei

Authors

Jinhui Tang
View author publications
You can also search for this author in PubMed Google Scholar
Xian-Sheng Hua
View author publications
You can also search for this author in PubMed Google Scholar
Yan Song
View author publications
You can also search for this author in PubMed Google Scholar
Tao Mei
View author publications
You can also search for this author in PubMed Google Scholar
Xiuqing Wu
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Jinhui Tang.

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 2.0 International License (https://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Reprints and permissions

About this article

Cite this article

Tang, J., Hua, XS., Song, Y. et al. Optimizing Training Set Construction for Video Semantic Classification. EURASIP J. Adv. Signal Process. 2008, 693731 (2007). https://doi.org/10.1155/2008/693731

Download citation

Received: 09 March 2007
Revised: 14 September 2007
Accepted: 12 November 2007
Published: 21 November 2007
DOI: https://doi.org/10.1155/2008/693731

Optimizing Training Set Construction for Video Semantic Classification

Abstract

Publisher note

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords