AAAI Publications, Twenty-Fourth AAAI Conference on Artificial Intelligence

Font Size: 
Unsupervised Learning of Event Classes from Video
Muralikrishna Sridhar, Anthony G. Cohn, David C. Hogg

Last modified: 2010-07-05


We present a method for unsupervised learning of event classes from videos in which multiple actions might occur simultaneously. It is assumed that all such activities are produced from an underlying set of event class generators. The learning task is then to recover this generative process from visual data. A set of event classes is derived from the most likely decomposition of the tracks into a set of labelled events involving subsets of interacting tracks. Interactions between subsets of tracks are modelled as a relational graph structure that captures qualitative spatio-temporal relationships between these tracks. The posterior probability of candidate solutions favours decompositions in which events of the same class have a similar relational structure, together with other measures of well-formedness. A Markov Chain Monte Carlo (MCMC) procedure is used to efficiently search for the MAP solution. This search moves between possible decompositions of the tracks into sets of unlabelled events and at each move adds a close to optimal labelling (for this decomposition) using spectral clustering. Experiments on real data show that the discovered event classes are often semantically meaningful and correspond well with groundtruth event classes assigned by hand.


unsupervised learning of event classes; video activity understanding; scene interpretation; video activity analysis; learning from observation

Full Text: PDF