A Perceptual Model for Sinusoidal Audio Coding Based on Spectral Integration

van de Par, Steven; Kohlrausch, Armin; Heusdens, Richard; Jensen, Jesper; Jensen, Søren Holdt

doi:10.1155/ASP.2005.1292

Research Article
Open access
Published: 21 June 2005

A Perceptual Model for Sinusoidal Audio Coding Based on Spectral Integration

Steven van de Par¹,
Armin Kohlrausch^1,2,
Richard Heusdens³,
Jesper Jensen³ &
…
Søren Holdt Jensen⁴

EURASIP Journal on Advances in Signal Processing volume 2005, Article number: 317529 (2005) Cite this article

2131 Accesses
48 Citations
Metrics details

Abstract

Psychoacoustical models have been used extensively within audio coding applications over the past decades. Recently, parametric coding techniques have been applied to general audio and this has created the need for a psychoacoustical model that is specifically suited for sinusoidal modelling of audio signals. In this paper, we present a new perceptual model that predicts masked thresholds for sinusoidal distortions. The model relies on signal detection theory and incorporates more recent insights about spectral and temporal integration in auditory masking. As a consequence, the model is able to predict the distortion detectability. In fact, the distortion detectability defines a (perceptually relevant) norm on the underlying signal space which is beneficial for optimisation algorithms such as rate-distortion optimisation or linear predictive coding. We evaluate the merits of the model by combining it with a sinusoidal extraction method and compare the results with those obtained with the ISO MPEG-1 Layer I-II recommended model. Listening tests show a clear preference for the new model. More specifically, the model presented here leads to a reduction of more than 20% in terms of number of sinusoids needed to represent signals at a given quality level.

Author information

Authors and Affiliations

Digital Signal Processing Group, Philips Research Laboratories, Eindhoven, 5656 AA, The Netherlands
Steven van de Par & Armin Kohlrausch
Department of Technology Management, Eindhoven University of Technology, Eindhoven, 5600 MB, The Netherlands
Armin Kohlrausch
Department of Mediamatics, Delft University of Technology, Delft, 2600 GA, The Netherlands
Richard Heusdens & Jesper Jensen
Department of Communication Technology, Institute of Electronic Systems, Aalborg University, Aalborg, DK-9220, Denmark
Søren Holdt Jensen

Authors

Steven van de Par
View author publications
You can also search for this author in PubMed Google Scholar
Armin Kohlrausch
View author publications
You can also search for this author in PubMed Google Scholar
Richard Heusdens
View author publications
You can also search for this author in PubMed Google Scholar
Jesper Jensen
View author publications
You can also search for this author in PubMed Google Scholar
Søren Holdt Jensen
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Steven van de Par.

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 2.0 International License ( https://creativecommons.org/licenses/by/2.0 ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Reprints and permissions

About this article

Cite this article

van de Par, S., Kohlrausch, A., Heusdens, R. et al. A Perceptual Model for Sinusoidal Audio Coding Based on Spectral Integration. EURASIP J. Adv. Signal Process. 2005, 317529 (2005). https://doi.org/10.1155/ASP.2005.1292

Download citation

Received: 31 October 2003
Revised: 22 July 2004
Published: 21 June 2005
DOI: https://doi.org/10.1155/ASP.2005.1292

A Perceptual Model for Sinusoidal Audio Coding Based on Spectral Integration

Abstract

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords and phrases