 Erratum
 Open Access
Erratum to: Novel KernelBased Recognizers of Human Actions
 Somayeh Danafar^{1},
 Alessandro Giusti^{1}Email author and
 Jürgen Schmidhuber^{1}
https://doi.org/10.1186/168761802012124
© Danafar et al.; licensee Springer. 2012
 Received: 21 May 2012
 Accepted: 21 May 2012
 Published: 21 June 2012
The original article was published in EURASIP Journal on Advances in Signal Processing 2010 2010:202768

Section 2 (Related works), Subsection 2.1 (Features for Action Recognition). The last three sentences of the second paragraph should read as follows:

Jhuang et al. [20] present a hierarchical supervised method with spatiotemporal, gradient and flow filters organized in various layers of complexity. In the last layer a multiclass SVM recognizes the human action.


Section 2 (Related works), Subsection 2.1 (Features for action recognition). The last paragraph should read as follows:

The approach is taken further by Schindler and Van Gool [24], who investigated the detection of actions from very short sequences called snippets. Two separate pathways for motion and shape are considered. Motion is modeled by means of optical flow, computed for different directions and scales. Shape is represented by Gabor filter responses. MAXpooling and comparison with a set of templates (learned using PCA) yield highlevel feature vectors, which are classified through SVMs. In our approach, we feed our classification algorithm by such a powerful feature descriptor, independently computed for each pair of frames.


Section 2 (Related works), Subsection 2.2. (Classification for action recognition). The second paragraph should read as follows:

Previous work [25] proposes 2D spatiotemporal compound features that are learned in a weaklysupervised approach using a data mining algorithm. Several researchers have explored unsupervised methods for motion analysis. Hoey [26] uses a multilevel dynamic Bayesian Network as an unsupervised classifier of facial expressions. Zhong et. al. [27] propose an unsupervised method to detect unusual activities in videos, by comparison with action prototypes. An alternative approach [28] detects unusual activity by spectral clustering and a hierarchical observation Hidden Markov Model. Boiman and Irani [29] explain a video sequence using patches from a database; as dense sampling of the patches is necessary in their approach, the resulting algorithm is very timeconsuming and unpractical for action recognition. Wang et al. [30] adopt spectral clustering to cluster a large set of human action images. In this context, shape features are used to compute distances, by means of a linear programming approach. Niebels et al. [11] deal with unsupervised learning of human action categories. Each action is represented by a probability distribution on spatiotemporal features. Action classes are modeled by latent topic models, such as probabilistic Latent Semantic Analysis and Latent Dirichlet Allocation.


The following sentence should be appended to the caption of figure 3 (page 7).

Illustration adapted from reference [24], Ⓒ2008 IEEE. Reprinted, with permission, from Proc. CVPR 2008.


Section 3 (The Maximum Mean Discrepancy). The second sentence should read as follows.

Recent work [5,6,8] studies the embedding of random variables into a Reproducing Kernel Hilbert Space (RKHS), by using kernels that take into account information from higher order statistics.


Section 3 (The Maximum Mean Discrepancy). Definition 1 should read as follows.

Let$P\in \mathcal{P}$ be a Borel probability measure which is defined on a separable metric space$\phantom{\rule{0.5em}{0ex}}\mathcal{X}$. A positive definite and bounded kernel k denoted as$k:\mathcal{X}\times \mathcal{X}\to \mathbb{R}$ can be used to map$\phantom{\rule{0.5em}{0ex}}\mathcal{P}$ to an RKHS$\phantom{\rule{0.5em}{0ex}}\mathcal{\mathcal{F}}$. For each$x\in \mathcal{X}$ and continuous function ϕ(x), a kernel is an inner product computation,$k(x,{x}^{\u2033}):=<\varphi \left(x\right),\varphi \left({x}^{\u2033}\right){>}_{\mathcal{\mathcal{F}}}$. The Pbased expectation of ϕ(x) is the socalled mean element _{ μ P }[8,9]:${\mu}_{P}:\mathcal{P}\phantom{\rule{1em}{0ex}}\to \phantom{\rule{1em}{0ex}}\mathcal{\mathcal{F}}$(1)$\phantom{\rule{1em}{0ex}}P\mapsto \underset{\mathcal{X}}{\int}\Phi \left(x\right)\text{dP.}$(2)


Section 3. The second paragraph after definition 3 should read as follows.

Theorem 2 defines the distance measure between probability distributions. However, we also need to measure whether the resulting distance is statistically significant for asymptotic distributions P and Q. In a twosample test this is done by comparing the null hypothesis,${\mathcal{\mathscr{H}}}_{0}:P=Q$, against the alternative hypothesis${\mathcal{\mathscr{H}}}_{1}:P\ne Q$; a significance threshold is then obtained.


After Theorem 6, the last sentence in the paragraph should read as follows.

This quantile can be estimated by means of the bootstrap method [8,9].

Notes
Authors’ Affiliations
References
 Danafar S, Giusti A, Schmidhuber J: Novel KernelBased Recognizers of Human Actions. EURASIP J. Adv. Signal Process. 2010. doi:10.1155/2010/202768Google Scholar
Copyright
This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.