Skip to main content

Erratum to: Novel Kernel-Based Recognizers of Human Actions

The Original Article was published on 16 May 2010

In our paper Novel Kernel-based Recognizers of Human Actions[1], several sentences should be corrected as indicated below. The corrections do not affect the basic results of the paper.

  • Section 2 (Related works), Subsection 2.1 (Features for Action Recognition). The last three sentences of the second paragraph should read as follows:

    • Jhuang et al. [20] present a hierarchical supervised method with spatio-temporal, gradient and flow filters organized in various layers of complexity. In the last layer a multi-class SVM recognizes the human action.

  • Section 2 (Related works), Subsection 2.1 (Features for action recognition). The last paragraph should read as follows:

    • The approach is taken further by Schindler and Van Gool [24], who investigated the detection of actions from very short sequences called snippets. Two separate pathways for motion and shape are considered. Motion is modeled by means of optical flow, computed for different directions and scales. Shape is represented by Gabor filter responses. MAX-pooling and comparison with a set of templates (learned using PCA) yield high-level feature vectors, which are classified through SVMs. In our approach, we feed our classification algorithm by such a powerful feature descriptor, independently computed for each pair of frames.

  • Section 2 (Related works), Subsection 2.2. (Classification for action recognition). The second paragraph should read as follows:

    • Previous work [25] proposes 2D spatio-temporal compound features that are learned in a weakly-supervised approach using a data mining algorithm. Several researchers have explored unsupervised methods for motion analysis. Hoey [26] uses a multilevel dynamic Bayesian Network as an unsupervised classifier of facial expressions. Zhong et. al. [27] propose an unsupervised method to detect unusual activities in videos, by comparison with action prototypes. An alternative approach [28] detects unusual activity by spectral clustering and a hierarchical observation Hidden Markov Model. Boiman and Irani [29] explain a video sequence using patches from a database; as dense sampling of the patches is necessary in their approach, the resulting algorithm is very time-consuming and unpractical for action recognition. Wang et al. [30] adopt spectral clustering to cluster a large set of human action images. In this context, shape features are used to compute distances, by means of a linear programming approach. Niebels et al. [11] deal with unsupervised learning of human action categories. Each action is represented by a probability distribution on spatio-temporal features. Action classes are modeled by latent topic models, such as probabilistic Latent Semantic Analysis and Latent Dirichlet Allocation.

  • The following sentence should be appended to the caption of figure 3 (page 7).

    • Illustration adapted from reference [24], 2008 IEEE. Reprinted, with permission, from Proc. CVPR 2008.

  • Section 3 (The Maximum Mean Discrepancy). The second sentence should read as follows.

    • Recent work [5,6,8] studies the embedding of random variables into a Reproducing Kernel Hilbert Space (RKHS), by using kernels that take into account information from higher order statistics.

  • Section 3 (The Maximum Mean Discrepancy). Definition 1 should read as follows.

    • LetPP be a Borel probability measure which is defined on a separable metric spaceX. A positive definite and bounded kernel k denoted ask:X×X can be used to mapP to an RKHS. For eachxX and continuous function ϕ(x), a kernel is an inner product computation,k(x, x ):=<ϕ(x),ϕ( x ) > . The P-based expectation of ϕ(x) is the so-called mean element μ P [8,9]:

      μ P : P
      P X Φ ( x ) dP.
  • Section 3. The second paragraph after definition 3 should read as follows.

    • Theorem 2 defines the distance measure between probability distributions. However, we also need to measure whether the resulting distance is statistically significant for asymptotic distributions P and Q. In a two-sample test this is done by comparing the null hypothesis, 0 :P=Q, against the alternative hypothesis 1 :PQ; a significance threshold is then obtained.

  • After Theorem 6, the last sentence in the paragraph should read as follows.

    • This quantile can be estimated by means of the bootstrap method [8,9].


  1. Danafar S, Giusti A, Schmidhuber J: Novel Kernel-Based Recognizers of Human Actions. EURASIP J. Adv. Signal Process. 2010. doi:10.1155/2010/202768

    Google Scholar 

Download references

Author information

Authors and Affiliations


Corresponding author

Correspondence to Alessandro Giusti.

Additional information

The online version of the original article can be found at 10.1155/2010/202768

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 2.0 International License ( ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Reprints and Permissions

About this article

Cite this article

Danafar, S., Giusti, A. & Schmidhuber, J. Erratum to: Novel Kernel-Based Recognizers of Human Actions. EURASIP J. Adv. Signal Process. 2012, 124 (2012).

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: