Conference Publication Details
Mandatory Fields
Guo J.;Gurrin C.
AVMA'12 - Proceedings of the 2012 ACM Workshop on Audio and Multimedia Methods for Large-Scale Video Analysis, Co-located with ACM Multimedia 2012
Short user-generated videos classification using accompanied audio categories
2012
December
Published
1
()
Optional Fields
MFCC User-generated video Video classification
15
20
This paper investigates the classification of short user-genera ted videos (UGVs) using the accompanied audio data since short UGVs accounts for a great proportion of the Internet UGVs and many short UGVs are accompanied by singlecategory soundtracks. We define seven types of UGVs corresponding to seven audio categories respectively. We also investigate three modeling approaches for audio feature representation, namely, single Gaussian (1G), Gaussian mixture (GMM) and Bag-of-Audio-Word (BoAW) models. Then using Support Vector Machine (SVM) with three different distance measurements corresponding to three feature representations, classifiers are trained to categorize the UGVs. The accompanying evaluation results show that these approaches are effective for categorizing the short UGVs based on their audio track. Experimental results show that a GMM representation with approximated Bhattacharyya distance (ABD) measurement produces the best performance, and BoAW representation with χ2 kernel also reports comparable results. Copyright 2012 ACM.
10.1145/2390214.2390220
Grant Details