Performance analysis and improvement of dither modulation under the composite attacks

Zhu, Xinshan; Ding, Jie

doi:10.1186/1687-6180-2012-53

Research
Open access
Published: 02 March 2012

Performance analysis and improvement of dither modulation under the composite attacks

Xinshan Zhu^1,2 &
Jie Ding^1,3

EURASIP Journal on Advances in Signal Processing volume 2012, Article number: 53 (2012) Cite this article

3224 Accesses
4 Citations
Metrics details

Abstract

The first goal of this article is to analyze the performance of dither modulation (DM) against the composite attacks including valumetric scaling, additive noise and constant change. The analyzes are developed under the assumptions that the host vector and noise vector are mutually independent and both of them have independently and identically distributed components. We derive the general expressions of the probability density functions of several concerned signals and the decoding error probability. The specific theoretical results are provided for the case of generalized Gaussian host and noise. Based on the analyzes, the performance of DM is predicted for different scenarios with a high degree of accuracy and evaluated for different distribution models of host and noise signals. Numerical simulations confirm the validity of the given theoretical analyzes. Then, we address to improve the robustness of DM against valumetric scaling plus constant change. The normalized dither modulation (NDM) is presented, which works by constructing a gain-invariant vector with zero mean for quantization. Performance analysis shows that NDM is theoretically invariant to valumetric scaling and constant change and achieves similar performance to DM in other aspects. The performance of NDM is further improved by weighting the quantization errors. Experiments on real images also show the advantage of NDM over DM subject to amplitude scaling and constant change.

1 Introduction

In the past decade, much attention has been paid to the quantization-based watermarking for canceling the host signal interference. One of the most important methods proposed so far is quantization index modulation (QIM) [1]. The basic QIM algorithm includes a number of variants, i.e., dither modulation (DM), distortion compensated dither modulation (DC-DM) (also known as scalar Costa scheme (SCS) [2]) and spread transform dither modulation (STDM) [1]. The theoretical performance of QIM methods is a key issue and has received considerable attention.

Initially, the Gaussian channel is often used in the analyzes and the performance of QIM methods has been extensively investigated in this case. A relatively crude approximation to the decoding error probability of QIM was given in [1] for the additive white Gaussian noise (AWGN) attacks. The performance of SCS was completely analyzed by Eggers et al. [2] under the AWGN attacks. In [3], the performance of scalar DC-QIM against AWGN was theoretically evaluated from the detection viewpoint. Recently, a new logarithmic QIM (LQIM) was presented in [4] and its performance was analyzed in the presence of AWGN. It has been pointed out in [5] that the performance of QIM methods may be overstated under Gaussian channels. In the second phase, a deeper analysis is done for QIM taking into account a much wider variety of attacks. The careful performance analyzes were presented by Pérez-Gonzàlez et al. [5] for a large class of QIM methods in the cases of uniform and Gaussian noise. Bartolini et al. [6] concentrated on analyzing the performance of STDM in the presence of two important classes of non additive attacks, the gain attack plus noise addition and the quantization attack. In [7], the authors proposed an improved DM scheme to resist linear-time-invariant filtering and provided a thorough analysis of it. We notice that most of previous analyzes make use of the Gaussian host assumption and even neglect the statistical properties of the host signal.

The conventional QIM has a serious drawback, i.e., the weakness against valumetric scaling. Spherical codes were utilized to cope with this problem in [8]. However, watermark embedding and recovery get very complicated [9]. Oostveen et al. [10] proposed to choose the adaptive quantization step size to be proportional to a local average of the host signal samples. Despite its robustness against valumetric scaling, the method presents a nonzero probability of error even for null distortions and becomes more sensitive to constant change. Rational dithered modulation (RDM) was developed in [9] using a gain-invariant adaptive quantization step size at both embedder and decoder. The RDM achieves invariance to valumetric scaling, but is still sensitive to constant change. Li and Cox [11] applied the modified Watson's perceptual model to provide resistance to valumetric scaling for QIM watermarking. The modification to Watson's model results in the degradation in quality and performance loss with respect to constant change.

The first objective of this article is to analyze the performance of DM against composite attacks, which is lacking in the literature. Obviously, in watermarking applications, it is more often that the watermark undergoes multiple attacks. Specifically, the combination of valumetric scaling, additive noise and constant change will be considered. On the other hand, most of previous analyzes are restricted to the Gaussian noise channel, even sometime regardless of the distribution of the host signal, which we will try to overcome. The generalized Gaussian distribution (GGD) is adopted to model both the host signal and the noise signal in our analysis. Since the GGD is a parametric family of distributions, we will observe how the choice of distribution model affects the performance of DM. Next, the weakness of DM is concerned. DM itself is largely vulnerable to valumetric scaling as well as constant change. Several existing improved DM schemes achieve the robustness against valumetric scaling, but becomes more sensitive to constant change. We will present the normalized DM (NDM) considering both of them. Under the light of the performance analyzes done for DM in this article, we show that NDM approaches the performance of DM, with the great advantage of insensitivity to valumetric scaling and constant change.

The rest of this article is organized as follows. Section 2 reviews the original DM and describes the problems to be solved. Next, Section 3 accurately derives the general PDF models for several relevant signals. In Section 4, the performance of DM under the composite attacks is mathematically analyzed by the derived PDFs. The decoding error probability is given in closed form and the theoretical results are confirmed by numerical simulations. Then, in Section 5, the NDM method is presented and its performance is theoretically evaluated. Section 6 provides a useful strategy to improve the performance of NDM. In Section 7, a series of tests on real data are done to verify the validity of analytical derivations and evaluate the presented approaches. Finally, Section 8 concludes.

Notation: In the remainder of this article, we use boldface lower-case letters to denote column vectors, e.g., x, and scalar variables are denoted by italicized lower-case letters, e.g., x. The probability distribution function (PDF) of a random variable (r.v.) x is denoted by p_X(x), whereas if x is discrete its probability mass function (PMF) is designated by P_X(x). We write x ~ p_X(x) to indicate that a r.v. x is distributed as p_X(x). p_X|Y(x|y) means the conditional probability of x given y. And the subscripts of the distribution functions will be dropped wherever it is clear the random variables they refer to. Finally, the mathematical expectation and standard deviation are respectively represented by μ_x and σ_x for a r.v. x.

2 Review of DM and problem

We will concentrate our attention on DM in this study. The uncoded binary DM can be summarized as follows.

Let x∈ ℝ ^N be a host signal vector in which we wish to embed the watermark message m. First, the message m is represented by a vector b with NR_m binary antipodal components, i.e., b_j = ± 1, j = 1, ..., NR_m, where R_m denotes the bit rate. The host signal x is then decomposed into NR_m subvectors (blocks) of length L = ⌊1/R_m⌋, denoted by $x_{1}, \dots, {x_{N R}}_{_{m}}$ . In the binary DM, two L-dimensional uniform quantizers Q_-1(·) and Q₊₁(·) are constructed, whose centroids are given by the lattices and Λ_-1 = 2Δℤ^L + d and Λ₊₁ = 2Δℤ^L + d+ Δa with d∈ ℝ^L a key-dependent dithering vector and a = (1, ..., 1)^T. Each message bit b_j is hidden by using ${Q_{b}}_{j} (\cdot)$ on x_j, resulting in the watermarked signal y∈ ℝ^N as

y_{j} = Q_{b_{j}} (x_{j}), j = 1, \dots, N R_{m} .

(1)

The watermark detector receives a distorted, watermarked signal, z, and decodes a message $\hat{m}$ using the minimal distance decoder

{\hat{b}}_{j} = arg min_{- 1, 1} ∥Q_{b_{j}} (z_{j}) - z_{j}∥, j = 1, \dots, N R_{m},

(2)

where ||·|| stands for Euclidean (i.e., ℓ₂) norm.

In practical watermarking applications, the watermarked signal might undergo composite attacks. It is well known that quantization-based watermarking is vulnerable to valumetric scaling attack. While the vector at the input of the decoder is scaled by ρ_j, i.e., z_j=ρ_jy_j, the quantization bins at the decoder are not scaled accordingly, thus producing a mismatch between embedder and decoder that dramatically affects performance. Also, the original DM is not robust to constant change distortion, i.e., z_j= y_j+ c_ja with c_ja constant value. No decoding error is made for |c_j| < Δ/2, however, the bit error rate (BER) is equal to 1 for Δ/2 <|c_j| < 3Δ/2. In this study, the two attacks are considered together with additive noise v_j, yielding the attacked signal as

z_{j} = ρ_{j} y_{j} + ν_{j} + c_{j} a .

(3)

We will analyze the performance of DM in the case of (3), and present an improved DM resisting both valumetric scaling and constant change. In the subsequent analysis, x, y, z and ν are all regarded as random vectors. And we assume that both x and v have independently and identically distributed (i.i.d.) components and v is independent from y. Since the mean value of additive noise ν_jcan be counted by the third term in the right-hand side of (3), it is reasonable to assume that μ_v = 0.

3 PDF models

Define the extracted vector $r, r ≜ Q_{b} (z) - z$ . Obviously, a crucial aspect when performing a rigorous analysis lies in computing the PDF of r. Let us begin with the issue.

3.1 PDF model of the watermarked signal

We use a lower-case letter to indicate any element of the vector denoted by the boldface one. The previously used index j is dropped for no specific values (or subvectors) are concerned. Given x ~ p_X(x), from the relation (1), the PDF of the watermarked signal y conditioned on a transmitted symbol b is written as

p_{Y} (y | b) = \sum_{k = - \infty}^{\infty} δ (y - y_{k}) \int_{y_{k} - Δ}^{y_{k} + Δ} p_{X} (x) d x,

(4)

where the variable y_k is defined as y_k = 2k Δ + (b + 1)Δ/ 2 + d and δ(·) denotes the delta function.

A few observations are in order about the PDF of y. First, for different dither value d, the PDF p_Y(y|b) is different. That means each element of y obeys different distributions by randomly selecting d during embedding. However, due to the fact P_Y(y_k+2Δ|b) = P_Y(y_k₊₁|b) exists, it is sufficient for us to consider the case d ∈ [-Δ,Δ). Further, if the PDF p_X (x) is symmetric, i.e., p_X(x) = p_X(-x), from (4), the PDF p_Y(y) satisfies p_Y(y|b = - 1) = p_Y(-y|b = 1) for the case of d = - Δ/ 2 and p_Y(y|b) = p_Y(-y|b) for the case of d = 0. The former indicates that the PDFs p_Y(y|b = - 1) and p_Y(y|b = 1) are mirrors of each other and the latter indicates that the PDF p_Y(y) is even. These two properties of p_Y(y) are exhibited in Figure 1.

3.2 PDF model of the attacked signal

Taking the Equation (3) into account and using the fact that for any $ρ > 0 {p_{ρ}}_{Y} (y) = \frac{1}{ρ} p_{Y} (\frac{y}{ρ})$ holds, the conditional PDF of z given the transmitted symbol b can be obtained by convolution [12]

p_{Z} (z | b) = \sum_{k = - \infty}^{\infty} P_{Y} (y_{k} | b) p_{v} (z - ρ y_{k} - c),

(5)

where the convolution follows from the independence between y and ν. Observing (5), if the effect of different d on P_Y(y) is ignored (this generally holds when the embedding distortion is acceptable), the PDF p_Z(z|b) with d ≠ 0 can be approximately viewed as the translate of p_Z(z|b) with d = 0, that is, p_Z(z + ρd|b, d ≠ 0) ≈ p_Z (z|b, d = 0).

Moreover, when both x and ν are distributed symmetrically around the origin, we have the mirror property p_Z(z + 2c|b = - 1) = p_Z(-z|b = 1) for the case d = - Δ/ 2, and the symmetric property p_Z(z + 2c|b) = p_Z(-z|b) for the case d = 0.

Figure 2a depicts qualitatively the PDFs of z for zero-mean Gaussian host data with variance 255² and zero-mean Gaussian noise. It can be seen that there is a bell curve present around each discrete value of y due to the existence of Gaussian noise, and the two adjacent ones even overlap for a large noise strength. Meanwhile, the distance between two discrete points of y is scaled by the scaling factor ρ and p_Z(z) is translated by constant value c. The corresponding empirical density curves of z are plotted in Figure 2b. We see that the analytical PDF of z fits well with empirical observations.

3.3 PDF model of the extracted signal

Recalling the definition of r given previously, it is immediate to write

p_{R} (r | b) = \{\begin{matrix} \sum_{j = - \infty}^{\infty} p_{Z} (z_{j} - r | b, d), & r \in [- Δ, Δ) \\ 0, & else, \end{matrix}

where p_R(r|b) is the PDF of r conditioned on the transmitted symbol b, and z_j has the similar definition with y_k. Due to (5), the above equation becomes

\begin{gathered} p_{R} (r | b) = \{\begin{matrix} \sum_{j} \sum_{k} P (y_{k} | b) p_{v} (μ_{j k} - r), & r \in [- Δ, Δ) \\ 0, & else \end{matrix} \end{gathered}

(6)

with μ_jk = z_j- ρy_k- c.

Now, let us analyze the properties of p_R(r). If ignoring the effect of d on P_Y(y), in view of (6), we derive p_R(r - ϵd|b, d ≠ 0) ≈ p_R(r|b, d = 0) with ϵ = ρ - 1. This shows that for the case d ≠ 0 the PDF p_R(r|b) can be approximately obtained by translating p_R(r|b, d = 0). Further, while |ϵ| is small enough for neglecting the term ϵd, we have the property p_R(r|b, d ≠ 0) ≈ p_R(r|b, d = 0). That is, despite the choice of d, p_R(r) approximately remains unchanged for small |ϵ|. Similarly to p_Z(z), by assuming the PDFs p_X(x) and p_ν(ν) are even, we obtain the mirror property p_R(r - 2c|b = 0) = p_R(-r|b = 1) for $d = - \frac{Δ}{2}$ and the symmetric property p_R(r - 2c|b) = p_R(-r|b) for d = 0. At the same time, for any ϵ, we derive p_R(r|b, ρ = 1 + ϵ) = p_R(r|b, ρ = 1-ε) for d = 0 and p_R(r|b = 0, ρ = 1 + ε) = p_R(r|b = 1, ρ = 1-ε) for $d = - \frac{Δ}{2}$ , where p_R(r|b, ρ) denotes the conditional PDF of r given the transmitted symbol b and the scaling factor ρ. These properties of p_R(r) are helpful for us to analyze the performance of DM.

Figure 3 plots the probability density curves of r and the corresponding empirical ones for zero-mean Gaussian host data with variance 255² and zero-mean Gaussian noise. As can be seen, the distribution curve of r is either dilated or compressed by the scale factor ρ, and the PDF p_R(r) with c ≠ 0 corresponds to p_R(r) with c = 0 translated by the constant value c. The probability that the values of r are distributed around zero decreases as attacks become strong, which results in the increase of BER. Comparison of Figure 3a, b reveals the analytical PDF of r fits perfectly with its empirical distribution.

4 Performance analysis of DM against the composite attacks

As the previous literatures, the decoding bit error probability P_e is used as the final performance measurement. Assuming that the symbol b is sent, the bit error probability will be

P_{e} = P (∥ \vec{r} ∥ > ∥ Δ \vec{a} - ∣ \vec{r} ∥ ∥ b)

(7)

where | r | denotes the vector of absolute values of components of r. Defining $s ≜ | r |^{T} a$ , the above expression is equivalent to

P_{e} = \int_{L Δ / 2}^{L Δ} p_{S} (s | b) d s .

(8)

To compute P_e, we need know the PDF p_S(s) of s. The exact solution for p_S(s) may be achieved by several means. One of the standard procedures is by performing multifold integral operation as

\begin{align} p_{S} (s | b) = & \int_{0}^{(L - 1) Δ} \dots \int_{0}^{2 Δ} \int_{0}^{Δ} p_{| R_{1} |} (u_{1} | b) p_{| R_{2} |} (u_{2} - u_{1} | b) \\ . . . p_{| R_{L} |} (s - \sum_{i = 1}^{L - 1} u_{i} ∣ b) d u_{1} d u_{2} \dots d u_{L - 1}, \end{align}

(9)

where p_|Rj|(r_j|b) = p_Rj(r_j|b)+p_Rj(-r_j|b) and p_Rj(r_j) is the PDF of the j th element of r. The computation of p_S(s) is feasible for a small L by (9). However, it becomes impractical as L increases. To solve the problem, it is nature to use mathematically tractable approximations. Let us assume that all components of d are equal, so that the vector r has i.i.d components. At this point, by the well known central limit theorem (CLT), s thus can be approximated by a Gaussian random variable, whose mean and variance are Lμ_|r|and $L σ_{| r |}^{2} .$ Using the derived PDF in (6), μ_|r|and $σ_{| r |}^{2}$ are represented as

μ_{| r |} = \sum_{j} \sum_{k} P (y_{k} | b) \int_{- Δ}^{Δ} ∣ u ∣ p_{ν} (μ_{j k} - u) d u

(10)

σ_{| r |}^{2} = \sum_{j} \sum_{k} P (y_{k} | b) \int_{- Δ}^{Δ} u^{2} p_{ν} (μ_{j k} - u) d u - μ_{| r |}^{2} .

(11)

Then, the probability P_e is computed as

P_{e} \approx Φ (\frac{\sqrt{L} (Δ - μ_{| r |})}{σ_{| r |}}) - Φ (\frac{\sqrt{L} (Δ / 2 - μ_{| r |})}{σ_{| r |}}),

(12)

where Φ(·) stands for the cumulative distribution function (CDF) of the standard Gaussian distribution. It should be pointed out the CLT approximation to P_e is only valid for very large L. In reality, the condition is generally met in order to improve the watermarking robustness.

Now, we can observe several useful properties of P_e from the previous analysis. When |ϵ| is small enough, by the property p_R(r|b, d ≠ 0) ≈ p_R(r|b, d = 0), it is easily understood that P_e approximately remains unchanged for different dither value. Therefore, without loss of generality, d is set to 0. Furthermore, for d = 0, if both p_X (x) and p_ν (ν) are even, using the property p_R(r|b, ρ = 1 + ϵ) = p_R(r|b, ρ = 1 - ϵ), the same value of P_e is obtained for the cases of ρ = 1- ϵ and ρ = 1 + ϵ. As a result, the property of P_e also holds for d ≠ 0 approximately.

4.1 Generalized Gaussian host and noise

Theoretically, P_e can be estimated only if the PDFs p_X(x) and p_ν(ν) are given. For the following analysis we consider a specific case where the host signal and attacking noise are statistically modeled by the GGD. The GGD model is used because it includes a family of distributions and suitable for many practical applications. The PDF of the GGD is defined as

p (t) = \frac{κ β}{2 Γ (β^{- 1})} e^{- ∣ κ (t - μ) ∣^{β}},

(13)

where $κ = \frac{1}{σ} \sqrt{Γ (3 β^{- 1}) / Γ (β^{- 1})}$ , and $Γ (u) = \int_{0}^{\infty} t^{u - 1} e^{- t} d t$ is the Gamma function. Thus, the distribution is completely specified by the mean μ, the standard deviation σ and the shape parameter β, and is denoted as GGD(β; μ, σ). The CDF of the GGD has the form [13]

Ψ (t) = \frac{1}{2} + s g n (t - μ) \frac{γ (β^{- 1}, | κ (t - μ) |^{β})}{2 Γ (β^{- 1})}

where $γ (u_{1}, u_{2}) = \int_{0}^{u_{2}} t^{u_{1} - 1} e^{- t} d t$ is the lower incomplete gamma function, and sgn(·) denotes the sign function, i.e.,

s g n (t) = \{\begin{matrix} 1, & t \geq 0 \\ - 1, & else . \end{matrix}

Note that Gaussian and Laplacian distributions are just two special cases of the GGD with β = 2 and β = 1, respectively.

First, the PMF P_Y(y) is calculated according to the distribution model of x. Given p_X(x) ~ GGD(β_x; μ_x, σ_x) and the corresponding CDF Ψ_x(x), in view of (4), we immediately write

P_{Y} (y_{k} | b) = Ψ_{x} (y_{k} + Δ) - Ψ_{x} (y_{k} - Δ) .

(14)

Then, the integration terms in (10) and (11) are derived for the case p_ν (ν) ~ GGD(β_ν; 0, σ_ν). In appendix, we obtain

\begin{align} \int_{- Δ}^{Δ} | u | p_{ν} (t - u) d u = & \frac{γ (2 β_{ν}^{- 1}, | κ (t - Δ) |^{β_{ν}}) - γ (2 β_{ν}^{- 1}, | κ t |^{β_{ν}})}{2 σ_{ν}^{- 1} \sqrt{Γ (β_{ν}^{- 1}) Γ (3 β_{ν}^{- 1})}} \\ + \frac{γ (2 β_{ν}^{- 1}, | κ (t + Δ) |^{β_{ν}}) - γ (2 β_{ν}^{- 1}, | κ t |^{β_{ν}})}{2 σ_{ν}^{- 1} \sqrt{Γ (β_{ν}^{- 1}) Γ (3 β_{ν}^{- 1})}} \\ + (2 Ψ_{ν} (t) - Ψ_{v} (t - Δ) - Ψ_{ν} (t + Δ)) t \end{align}

(15)

and

\begin{align} \int_{- Δ}^{Δ} u^{2} p_{ν} (t - u) d u = & \frac{γ (3 β_{ν}^{- 1}, 0) - γ (3 β_{ν}^{- 1}, | κ_{ν} (t - Δ) |^{β_{ν}})}{2 s g n (t - Δ) σ_{ν}^{- 2} Γ (3 β_{ν}^{- 1})} \\ + \frac{γ (3 β_{ν}^{- 1}, | κ_{v} (t + Δ) |^{β_{ν}}) - γ (3 β_{ν}^{- 1}, 0)}{2 s g n (t + Δ) σ_{ν}^{- 2} Γ (3 β_{ν}^{- 1})} \\ - \frac{(γ (2 β_{ν}^{- 1}, | κ (t + Δ) |^{β_{ν}}) - γ (2 β_{ν}^{- 1}, | κ (t - Δ) |^{β_{ν}})) t}{σ_{ν}^{- 1} \sqrt{Γ (β_{ν}^{- 1}) Γ (3 β_{ν}^{- 1})}} \\ + (Ψ_{ν} (t + Δ) - Ψ_{ν} (t - Δ)) t^{2} \\ (5) \end{align}

(16)

Now, the decoding bit-error probability can be estimated by computing (10), (11), and (12) with the use of (14), (15), and (16). Since the calculation of p_Y(y) is relatively simple in (4), the above analysis can be easily extended for other host distributions. However, the derivation of the integration terms in (10) and (11) might become very complex for the noise ν with other PDFs. Thus, they are computed numerically when needed.

4.2 Simulations on artificial signals

In order to verify the obtained theoretical results, we conduct a set of experiments on artificial signals. A set of 64000 random data, independently drawn from the GGD with zero mean and variance 255², are used as the host signal. A random message with equiprobable information bit is embedded using DM with L = 64, Δ = 7.79, and d = 0. The noise signal is also generated according to the zero-mean GGD. We calculated the empirical BER under the composite attacks, and compared them to the theoretical values. The obtained results are summarized in Figure 4 for the case of Gaussian host and noise.

Figure 4a shows the BER of DM as a function of the scaling factor ρ while fixing the constant value c and the noise standard deviation σ_ν. As can be seen, DM is definitely very sensitive to the scaling attack. The probability of error is unacceptably high when ρ movies beyond the range [0.98, 1.02]. The existence of noise and constant change causes the increase of BER further. And the effect of constant change becomes relatively distinct for strong noise. The theoretical approximation of BER agrees almost perfectly with the empirical results, particularly in the case of weak attacks. Figure 4a also demonstrates that the BER versus ρ curve is symmetric with respect to the point ρ = 1. Figure 4b depicts the plots of BER versus the constant value c while fixing the scaling factor ρ and the noise standard deviation σ_ν. For small ρ and σ_ν, the BER of DM starts to grow rapidly as long as the absolute value of c approaches to Δ/2. The effect of c on performance decreases as ρ and σ_ν increase. The estimated BERs are approximately equal to the empirical ones, but the estimation accuracy gets worse for a large c. At the same time, Figure 4b shows that the BER versus c curve is symmetrical around c = 0. Figure 4c plots the BER of DM as a function of the noise standard deviation σ_ν while fixing the scaling factor ρ and the constant value c. Obviously, the BER increases as σ_ν becomes large. The curve of BER versus σ_ν seems to be translated due to the effect of valumetric scaling and constant change distortions. Similar to the previous tests, the theoretical BERs fit the empirical ones very well and the maximal difference between them is lower than 0.02.

In the sequel, the sensitivity of DM to statistical properties of the host and noise is investigated. We tested the performance of DM against valumetric scaling attacks for different host PDF shapes controlled by β_x. The results are displayed in Figure 5a. It is remarkable that the performance of DM increases significantly as β_x goes down. For a small β_x, the BER plot becomes relatively flat and the BER grows slowly. This behavior can be explained as follows. For the GGD, the smaller β_x is, the more impulsive the shape, and the heavier the tails. As a result, the probability that a big value of x presents over the range of interests decreases. Thus, the introduced distortion (ρ - 1)y by the scaling attack degrades for the same value of ρ, resulting in the decrease of BER. We also observe that the theoretical approximation agrees almost perfectly with the empirical results for the cases β_x = 2, 8, but does worse for β_x = 1. This is because the CLT approximation to BER may underestimate the importance of the tails of p_X (x) with β_x = 1 and gives the smaller results than the true BERs [5]. However, in terms of constant change and additive noise, the performance of DM is insensitive to the shape parameter β_x, due to the fact that the two operations are independent from the watermarked signal. Hence, we just provide the results for the scaling attack herein. Then, we tested the performance of DM against additive noise with different PDF shapes controlled by β_ν. The results are exihibited in Figure 5b. We observe that the BER of DM goes down as β_ν increases for the same noise variance. Applying the same reasoning above here, we may understand that relatively serious distortions are introduced by the noise attack with a large β_ν, and thus, the performance of DM worsens.

5 Normalized DM and its performance

The robustness improvement for DM is taken into account in this section. A novel normalized DM (NDM), is presented, which is theoretically invariant to valumetric scaling and constant change. On the other hand, the performance of NDM is theoretically evaluated in terms of null distortions and noise addition.

5.1 Normalized DM

The main idea of NDM is to construct a gain-invariant vector with zero mean for quantization. There are many strategies for the construction of such a vector. In the study, the vector is achieved in the way that the host vector subtracts its nonzero sample mean and then is divided by its sample standard deviation. The method is described in details as follows.

Let $ū ≜ u^{T} a / L$ and $S_{u}^{2} ≜ | | u - ū | |^{2} / L$ denote the sample mean and variance of a L-dimensional vector u, respectively. Watermark embedding is performed by

y_{j} = λ_{j} S_{x_{j}} Q_{b_{j}} (\frac{x_{j} - {\bar{x}}_{j} a}{S_{x_{j}}}) + η_{j} a

(17)

for j = 1, ..., NR_m, where the factors λ_j and η_j are determined by two specific distortion situations. For convenience, we define the normalized host vector as $x_{j}^{'} = (x_{j} - {\bar{x}}_{j} a) / S_{x_{j}}$ and the error vector as $q_{e_{j}} = Q_{b_{j}} (x_{j}^{'}) - x_{j}^{'}$ . By (17), the sample variance of y_jsatisfies $S_{y_{j}}^{2} = λ_{j}^{2} S_{x_{j}}^{2} (1 + S_{q_{e_{j}}}^{2} + 2 q_{e_{j}}^{T} x_{j}^{'} / L)$ . An appropriate strategy to choose λ_j is to let $S_{y_{j}}^{2} = S_{x_{j}}^{2} .$ Therefore, we have

λ_{j} = {(1 + S_{q_{e_{j}}}^{2} + 2 q_{e_{j}}^{T} x_{j}^{'} / L)}^{- \frac{1}{2}} .

(18)

Then, η_j is obtained through minimizing the distance ||y_j- x_j||. This leads to

η_{j} = {\bar{x}}_{j} - λ_{j} S_{x_{j}} {\bar{q}}_{e_{j}} .

(19)

At detection time, the received signal z is first normalized as done at the embedder's side and then the minimal distance decoder is applied. The modified detector is represented as

{\hat{b}}_{j} = arg min_{- 1, 1} ∥\frac{z_{j} - {\bar{z}}_{j} a}{S_{z_{j}}} - Q_{b_{j}} (\frac{z_{j} - {\bar{z}}_{j} a}{S_{z_{j}}})∥ .

(20)

Now, it is possible to simultaneously see why NDM is insensitive to valumetric scaling and constant change attacks: Substituting z_j = p_jy_j+c_ja into (20), it can be readily verified that ρ_j and c_j cancel out in the expression, and consequently, the decision ${\hat{b}}_{j}$ does not depend on ρ_j and c_j.

5.2 Performance analysis

Having known that NDM overcomes the main weakness of the conventional DM, we will evaluate the performance of NDM in terms of null distortions and noise addition. Performing the normalization operation on both sides of (17) and applying (18) and (19), we get

\frac{y_{j} - ȳ_{j} a}{S_{y_{j}}} = λ_{j} (Q_{b_{j}} (\frac{x_{j} - {\bar{x}}_{j} a}{S_{x_{j}}}) - {\bar{q}}_{e_{j}} a) .

(21)

The above equation indicates that NDM introduces two extra operations in the absence of channel noise: valumetric scaling with λ_j and constant change with ${\bar{q}}_{e_{j}}$ . In other words, NDM can be regarded as DM undergoing valumetric scaling and constant change distortions. Thus, the theoretical performance of NDM for null distortions is approximately determined by (10), (11), and (12) as the noise standard deviation σ_ν approaches zeros.

To evaluate the effect of λ_j and ${\bar{q}}_{e_{j}}$ in (21) on the performance of NDM, we introduce the document-to-watermark ratio (DWR), defined as $ζ_{j} ≜ L S_{x_{j}}^{2} / ∣ ∣ y_{j} - x_{j} ∣ ∣^{2}$ for the j th subvector. Combining (17), (18) and (19), λ_j can be rewritten as

λ_{j} = (1 - \frac{1}{2 ζ_{j}}) / (1 + \frac{q_{e_{j}}^{_{T}} {x^{'}}_{j}}{L}) .

(22)

For small Δ, it has been shown [14] that each element of the error vector q_eobeys independently a uniform distribution over the interval [-Δ, Δ) and q_eis statistically independent from $x_{j}^{'}$ . Applying the properties, it is easy to derive that $q_{e_{j}}^{T} x_{j}^{'} / L$ in (22) has zero mean and variance Δ²/(3L). Thus, λ_j tends to 1 - 0.5/ζ_j as L → ∞ or Δ → 0 (i.e., ζ_j → ∞). Figure 6 plots the curves of the true average error |λ_j - 1| versus ζ_j for different L, as well as the limit 0.5/ζ_j. Notably, the gap between the factor λ_j and its limit becomes smaller and smaller as L and DWR increase. Over the most interesting range of ζ_j from 25 dB to 50 dB, the value of |λ_j - 1| is less than 0.01 for all the values of L tested. From Figure 4a, it is observed that the valumetric scaling with |λ_j - 1| < 0.01 affects the performance of NDM so less that no decoding error is made.

As to the constant change with $E{q_{e_{j}}}$ , we have a sufficient condition that $|E {q_{e_{j}}}| < Δ / 2$ for making no error. Considering the statistical properties of $q_{e_{j}}$ , it is possible to resort to the CLT to show that for large L, the sample mean ${\bar{q}}_{e_{j}}$ can be accurately modeled by a Gaussian PDF with zero mean and variance Δ²/(3L). Thus, the probability that $| {\bar{q}}_{e_{j}} | < Δ / 2$ holds can be computed as

P (| {\bar{q}}_{e_{j}} | < Δ / 2) \approx 1 - 2 Φ (- \sqrt{3 L / 4}) .

(23)

Since the probability in (23) approaches the value of 1 as L increases, NDM can present a zero probability of error as the original DM for large L in the absence of channel noise. Figure 7 shows the plots of the BER as a function of L. As shown in Figure 7, the probability of error sharply decreases to 0 as L increases. And the agreement of theoretical results with simulations is excellent.

Next, we will analyze the performance of NDM in noise channel. The received signal z_jhas the form z_j= y_j+ ν_j, where ν_jis an unknown noise source with zero sample mean and ν_jand y_jare orthonormal. Since NDM is invariant to valumetric scaling and constant change attacks, it's sufficient to consider the case. To measure the impact of the noise, we will follow the popular watermark-to-noise ratio (WNR), defined as $ξ_{j} ≜ | | y_{j} - x_{j} | |^{2} / | | ν j | |^{2}$ for the j th subvector. Applying the normalization operation to z_jyields

\frac{z_{j} - {\bar{z}}_{j} a}{S_{z_{j}}} = λ_{j}^{'} Q_{b_{j}} (\frac{x_{j} - {\bar{x}}_{j} a}{S_{x_{j}}}) + ν_{j}^{'} + {\bar{q}}_{e_{j}}^{'} a

(24)

with $λ_{j}^{'} = λ_{j} \sqrt{\frac{ζ_{j} ξ_{j}}{ζ_{j} ξ_{j} + 1}}$ , $ν_{j}^{'} = \sqrt{\frac{ζ_{j} ξ_{j}}{ζ_{j} ξ_{j} + 1}} \frac{ν_{j}}{s_{x_{j}}}$ , and ${\bar{q}}_{e_{j}}^{'} = - λ_{j} {\bar{q}}_{e_{j}}$ . Expression (24) illustrates that NDM undergoes the composite attacks as considered in (3). Therefore, the previously obtained theoretical results can be used to predict the performance of NDM.

Generally speaking, the factor $λ_{j}^{'}$ in (24) is approximately equal to the value of 1 by the fact that ζ_jξ_j ≫ 1 holds in practical applications. Figure 8 shows that the value of $| λ_{j}^{'} - 1 |$ is rather less even for serious distortions (e.g., WNR = -10 dB). On the other hand, for large L, the effect of ${\bar{q}}_{e_{j}}^{'}$ in (24) can be neglected. Based on these two considerations, the increase of BER is mainly derived from the term $ν_{j}^{'}$ in (24). As a result, we can draw the conclusion that NDM almost resists the same amount of noise as the original DM. Figure 9 illustrates the performance difference between NDM and DM under the additive noise attacks. As can be seen, NDM performs slightly worst than DM when the WNR is within the range [-1 dB, 3 dB], but outperforms it once WNR is lower than -1 dB. In principal, their performance is very close in this regard. Under the light of the above analysis we conclude that NDM achieves the performance approximately equal to DM, still keeping invariance against valumetric scaling and constant change attacks.

6 The improvement of NDM

The previous analysis shows that when λ_j ≠ 1 and ${\bar{q}}_{e_{j}} \neq 0$ the two factors have the negative impact on the performance of NDM. Thus, the influence of them should be decreased or eliminated so as to obtain the improved performance. Based on this idea, we present the improved NDM (IM-NDM) in the sequel.

In IM-NDM, the watermarked vector is generated by weighting the quantization error signal and adding it back to the host signal. The modified embedder is expressed as

y_{j} = λ_{j} S_{x_{j}} (x_{j}^{'} + α_{j} \cdot q_{e_{j}}) + η_{j} a,

(25)

where α_jdenotes the weight vector whose element is between 0 and 1, and $α_{j} \cdot q_{e_{j}}$ indicates that each dimension of α_jis multiplied by the corresponding dimension of $q_{e_{j}}$ · Similarly to (18) and (19), it is derived that

λ_{j} = {(1 + S_{q_{e_{j}}}^{2} + 2 q_{e_{j}}^{T} x_{j}^{'} / L)}^{- \frac{1}{2}}

(26)

and

η_{j} = {\bar{x}}_{j} - λ_{j} S_{x_{j}} {\bar{q}}_{e_{j}} .

(27)

Note that NDM is a special case of IM-NDM with α_j= a. The weight vector α_jplays an important role in the performance of IM-NDM. Through a careful choice of α_j, the influence of both λ_j and ${\bar{q}}_{e_{j}}$ in (25) can be decreased (or even eliminated), and at the same time the distortion-compensation (DC) mechanism is introduced. The latter is proved to be an effective way to improve the performance of quantization-based watermarking [1].

By letting λ_j = 1 and $η_{j} = {\bar{x}}_{j}$ , we have

\begin{align} {(α_{j} \cdot q_{e_{j}})}^{T} (α_{j} \cdot q_{e_{j}} + 2 {x^{'}}_{j}) & = 0 \\ {(α_{j} \cdot q_{e_{j}})}^{T} a & = 0 . \end{align}

(28)

Taking use of one solution of (28) in (25) allows us to eliminate the negative impact of λ_j and η_j. Obviously, it is easy to obtain one solution of (28) for one of the two equations in (28) is linear. If (28) has multiple solutions, an appreciate one should be chosen by the performance of IM-NDM. Obtaining the appropriate solution for α_jand investigation of its effect on watermarking performance is beyond the scope of this article and is a good direction for future research. If (28) has none solution, α_jshould be chosen to minimize |λ_j - 1| under the situation $η_{j} = {\bar{x}}_{j}$ . This is a constraint optimization problem and can be solved using the Lagrangian multiplier method.

Figure 9 illustrates the performance of IM-NDM described above under the additive noise attack, together with DC-DM, and the distortion compensated NDM (DC-NDM), namely IM-NDM taking the same weight for each element of $q_{e_{j}}$ , where the DC value is set to 0.66 for the latter two schemes. Obviously, DC-NDM almost presents the same robustness as DC-DM against weak attacks, and performs a little better facing very serious distortions (WNR <-4 dB). And they are noticeably outperformed by IM-NDM.

7 Experimental results

In this section, a series of experiments are conducted on real images to evaluate the validity of analytical derivations and performance of the proposed method.

7.1 Theoretical verification

In the experiments, we use three standard images, shown in Figure 10. The DM method is implemented in the spatial domain so as to observe its performance without the impact of transform operations. Specifically, all pixels of one image are rearranged in a vector as the host signal. A random binary message is embedded into the host vector by DM when given the quantization step Δ, the dither value d and the number of dimensions L. The watermarking algorithm is tested under the composite attacks of valumetric scaling with the factor ρ, constant change with the value c and additive noise ν following the distribution GGD(β_ν; 0, σ_ν). The distribution parameters of image pixels used for the computation of theoretical BERs are displayed in Table 1, which are obtained by the maximum likelihood estimator [15]. The experimental results are summarized in Figure 11 for L = 32, Δ = 8, and d = 0.

Table 1 Mean, variance, and shape parameter of image pixels

Full size table

Figure 11a depicts the plots of BER as a function of the scaling factor ρ for each image. On the Crowd image, which has the smallest shape parameter among the tested images, DM achieves the best performance. This behavior is consistent with the results in Figure 5a. The shape parameter of the Lena image is larger than one of Mandrill, but better performance is achieved on the Lena image. This can be explained as follows. The valumetric scaling operation introduces the serious distortions on the Mandrill image with a distinctively large mean luminance. As a result, not only the performance gain caused by the host PDF shape cancels out but also the BER grows up. The analytical curves closely fit the empirical data for the Lena and Mandrill images. By contrast, the prediction accuracy becomes slightly worse for the Crowd image. That is mainly due to the fact that the GGD is a poor model for this image. Figure 11b illustrates the sensitivity of DM to the addition/subtraction of a constant luminance value while fixing ρ and σ_ν. In the test, DM performs closely for all the test images. That is, the performance of DM with respect to constant change attack is insensitive to the statistical properties of host signal. It is remarkable that the empirical performance of DM is predicted by the theoretical results with a high degree of accuracy. The plots of BER versus the standard deviation σ_ν are shown in Figure 11c for β_ν = 1 and Figure 11d for β_ν = 8 while fixing ρ and c. As to the attack, the obvious performance difference is observed between different images. The effect is actually caused by the valumetric scaling operation, and thus can be removed by setting ρ = 1. Comparing Figure 11c with Figure 11d, it becomes clear that the additive noise with a flat PDF is a worst-case attack for DM. This agrees with the observation in Figure 5b. In the two cases, the predictions are desirable, but there are small discrepancies at some points.

7.2 Performance evaluation

We tested the performance of the proposed NDM in terms of imperceptibility and robustness and compared it with DM, DC-DM, Oostveen's method [10] and RDM [9]. Experiments were carried out on a database of 4000 images from the Corel database, each of dimension 256 × 384. The watermark embedding was performed in the spatial domain in order to see the sensitivity of the tested schemes to constant intensity change. Specifically, we divided the target image into nonoverlapping blocks of size 8× 8 and extracted a total of 225 blocks with the highest local variance. Each of the extracted blocks was modulated with two random message bits, so a total of 450 bits can be embedded into one image. The DC value was set to 0.66 for DC-DM. The L₂ vector norm of 50th-order was used as the division function in RDM.

In the experiment on watermarking imperceptibility, the watermark energy induced by all the tested schemes is kept the same and in this case the watermarked images' quality is assessed with several objective image quality metrics. The weighted peak signal-to-noise ratio (wPSNR) and the total perceptual error (TPE) are used to measure the global image quality, as well as the number of blocks greater than the first local perceptual error threshold (NLPE1) and the second local perceptual error threshold (NLPE2) to measure the local image quality. The parameters for them take the default values as suggested in Checkmark [16]. Table 2 reports the experimental results averaged over all the test images when the DWR is fixed at 21 dB.

Table 2 Watermark imperceptibility assessment in use of several image quality metrics

Full size table

As shown in Table 2, among all the tested watermarking schemes, NDM and its improved version offer the highest wPSNR values (in dB), the smallest TPE, NLPE1, and NLPE2 values for the same watermark energy. They all indicate that the performance of NDM, in terms of imperceptibility, is better than that of other ones. This is because the adaptive quantization step size is chosen to be proportional to the local variance of the host image in NDM (see (17)). The image quality produced by IM-NDM degrades when compared with NDM. The situation also presents between DM and DC-DM. This is attributed to the fact that a large quantization step is used for watermark embedding with distortion compensation. Surprisingly, RDM manifests the worst performance in this regard.

In what follows, the watermark robustness will be evaluated with respect to some typical image processing operations. The watermarked images were produced by the tested schemes when fixing DWR at 21 dB. All the given BERs are averaged over the test set of images, except otherwise indicated.

Figure 12 shows the robustness to amplitude scaling for all schemes. Clearly, except the conventional DM and DC-DM, the others manifest strong robustness against this attack. Particularly, the lowest values of BER are achieved by IM-NDM over the whole range of scaling factor ρ tested. However, when ρ exceeds 1.2, the robustness of IM-NDM goes down slightly. That can be attributed to the increasing rounding and clipping distortions.

Figure 13 illustrates the sensitivity of all schemes to the addition/subtraction of a constant luminance value c. It can be seen that both DM and DC-DM are very fragile to constant change attack. The BER of them sharply increases to 1 when c gets close to 10 or -10. Although Oostveen's method and RDM perform better than the original QIM schemes, they are still sensitive to this kind of attack. Our methods are evidently more robust in this regard than other ones. They are almost invariant to constant change and approximately keep the BER of 0 over the range of c tested.

The robustness to AWGN is shown in Figure 14 for each watermarking scheme. In this regard, NDM clearly outperforms Oosteen' method and RDM. Comparing with DM, NDM achieves higher BER for weak noise. This can be explained by the fact that the introduced noise causes the errors in the estimation of the quantization step size for NDM. However, as the noise becomes strong, the BER of DM grows rapidly and is finally lower than one of NDM. The situation is in accordance with the analytical results in Section 5.2. Note that IM-NDM behaves like NDM but presents the improved performance.

The robustness of NDM against AWGN was also tested on the Lena image to verify the analytical derivations for NDM. Since the performance of NDM depends on the local variance of the host image, the empirical BER can not be accurately predicted by exploiting the information from a certain image block. Thus, for the computation of the theoretical BER, we chose three image blocks with different variance: the middle one is around the average variance over those image blocks for watermark embedding and other two ones are respectively a little larger and smaller than it. The theoretical and empirical results are depicted in Figure 15. As can be seen, the upper analytical curve relatively fits well to empirical observations in the weak noise case, and other two curves respectively do well for the moderate noise and strong noise cases respectively. In principle, the theoretical results are effective for real image.

The sensitivity to JPEG compression is investigated in Figure 16. In this test, NDM performs a little worse than DM. IM-NDM improves the robustness of NDM, but still falls behind DC-DM. It is worth seeing that RDM has superior performance with respect to JPEG compression. That can be explained by the nature of JPEG compression. Unlike the AWGN, JPEG compression is an image-dependent processing operation. The goal of it is to reduce an image file size without noticeable image quality degradation. Thus, the perceptually unrelevant data are removed from an image after compression. The test results of image quality reveals that RDM modifies the image data to be easily noticed more largely than other ones, so that it is impaired less by compression. The situation is opposite for NDM. If the perceptual quality is set to be same for all the tested schemes, it is reasonable to believe that NDM will manifest better performance.

NDM is just a basic watermarking algorithm like DM. The above tests allow us to evaluate its performance baseline and the implementation is coarse. If one wants to design a NDM based watermarking scheme for practical applications, some effective technologies on performance improvement should be carried out, such as the choice of transform domain, the use of error-correction coding, etc. Several image-adaptive DM algorithms are presented by exploiting the characteristics of the human visual system in [11]. The same ideas can be straightforwardly applied to improve the performance of NDM. Recently, a new Logarithmic QIM is developed by introducing the μ-Law concept in [4]. NDM can also attempt to use the concept for the improvement of performance.

8 Conclusion

The contribution of this article is twofold. First, we have been theoretically evaluated the performance of DM facing the combination of valumetric scaling, additive noise and constant change. The analyzes were developed under the assumptions that both the host vector and the noise vector have i.i.d components and the two vectors are independent. We accurately derived the general expressions of the PDFs of the watermarked signal, the attacked signal and the extracted signal. By these derived PDFs, the decoding error probability was generally expressed in closed form. The specific analytical results were presented for the case of generalized Gaussian host and noise. Moreover, the theoretical results can be easily extended by modeling the host and noise signals with other distributions.

According to our analyzes, DM is largely vulnerable to valumetric scaling. And constant change and additive noise give rise to the relatively large performance loss of DM by combining them with valumetric scaling. Particularly, we have seen the effect of statistical properties of the host and noise signals on the performance of DM. The more impulsive the PDF shape of the host signal, the more robust DM is to valumetric scaling. The more flat the PDF shape of the noise source, the more sensitive DM is to additive noise. Simulations on artificial signals and real images show us that the bit-error probability is accurately predicted by the given theories for a wide range of host and noise PDF shapes. These can ultimately guide the design of efficient watermarking algorithms based on DM.

Second, a novel watermarking method, called NDM, has been developed. In the method, the normalized host signal vector is constructed for quantization. The NDM achieves its theoretical invariance to both valumetric scaling and constant change, but leads to small performance loss in the absence of channel noise. The BER of NDM against additive noise can be predicted by applying the presented theoretical results of DM. Further, the NDM is improved by weighting the quantization errors. Experiments on images demonstrate that the proposed method achieves better watermark imperceptibility and extremely strong robustness against valumetric scaling and constant change attacks comparing with the original QIM schemes and other improved versions.

Appendix

Here, we will derive the integration terms in (10) and (11) when the attacking noise obeys the distribution GGD(β_ν; 0, σ_ν). For this purpose, using a variable t instead of μ_jk, they are, respectively, rewritten as

\begin{align} \int_{- Δ}^{Δ} | u | p_{ν} (t - u) d u & = \int_{0}^{Δ} u (p_{ν} (t + u) + p_{ν} (u - t)) d u \\ = \int_{- t}^{Δ - t} u p_{ν} (u) d u + \int_{t}^{t + Δ} u p_{ν} (u) d u \\ + t (\int_{- t}^{Δ - t} p_{ν} (u) d u - \int_{t}^{t + Δ} p_{ν} (u) d u)) \end{align}

(29)

and

\begin{align} \int_{- Δ}^{Δ} u^{2} p_{ν} (t - u) d u & = \int_{t - Δ}^{t + Δ} (u^{2} - 2 t u + t^{2}) p_{ν} (u) d u \\ = \int_{t - Δ}^{t + Δ} u^{2} p_{ν} (u) d u - 2 t \int_{t - Δ}^{t + Δ} u p_{ν} (u) d u + t^{2} \int_{t - Δ}^{t + Δ} p_{ν} (u) d u \end{align}

(30)

Thus, achieving the two integrations can be attributed to the computation of I(t₁, t₂), defined as $I (t_{1}, t_{2}) ≜ \int_{t_{1}}^{t_{2}} u^{l} p_{ν} (u) d u$ with l being an integer.

Considering the case of t₁ ≥ 0 and t₂ ≥ 0, we have

\begin{align} I (t_{1}, t_{2}) & = \frac{κ_{ν} β_{ν}}{2 Γ (β_{ν}^{- 1})} \int_{t_{1}}^{t_{2}} u^{l} e^{- | κ_{ν} u |^{β_{ν}}} d u \\ = \frac{1}{2 Γ (β_{ν}^{- 1}) κ_{ν}^{l}} \int_{{(κ_{ν} t_{1})}^{β_{ν}}}^{{(κ_{ν} t_{2})}^{β_{ν}}} u \frac{l + 1}{β_{ν}} - 1 e^{- u} d u \\ = \frac{γ ((l + 1) β_{ν}^{- 1}, {(κ_{ν} t_{2})}^{β_{ν}}) - γ ((l + 1) β_{ν}^{- 1}, {(κ_{ν} t_{1})}^{β_{ν}})}{2 σ_{ν}^{- l} \sqrt{{(Γ (β_{ν}^{- 1}))}^{2 - l} {(Γ (3 β_{ν}^{- 1}))}^{l}}} \end{align}

(31)

where the first equality follows from (13) and the final equality follows from the definition of the lower incomplete gamma function.

In the case of t₁ ≤ 0 and t₂ ≤ 0, I(t₁, t₂) has the form

\begin{align} I (t_{1}, t_{2}) & = \frac{κ_{ν} β_{ν}}{2 Γ (β_{ν}^{- 1})} \int_{t_{1}}^{t_{2}} u^{l} e^{- {(- κ_{ν} u)}^{β_{ν}}} d u \\ = \frac{{(- 1)}^{l + 1} κ_{ν} β_{ν}}{2 Γ (β_{ν}^{- 1})} \int_{- t_{1}}^{- t_{2}} u^{l} e^{- {(- κ_{ν} u)}^{β_{ν}}} d u \\ = \frac{γ ((l + 1) β_{ν}^{- 1}, {(- κ_{ν} t_{2})}^{β_{ν}}) - γ ((l + 1) β_{ν}^{- 1}, {(- κ_{ν} t_{1})}^{β_{ν}})}{2 {(- 1)}^{l + 1} σ_{ν}^{- l} \sqrt{{(Γ (β_{ν}^{- 1}))}^{2 - l} {(Γ (3 β_{ν}^{- 1}))}^{l}}} \end{align}

(32)

where the final equality follows from (31).

Last, while t₁ ≤ 0 and t₂ ≥ 0, it follows that

\begin{align} I (t_{1}, t_{2}) & = \frac{κ_{ν} β_{ν}}{2 Γ (β_{ν}^{- 1})} (\int_{t_{1}}^{0} u^{l} e^{- {(- κ_{ν} u)}^{β_{ν}}} d u + \int_{0}^{t_{2}} u^{l} e^{- {(- κ_{ν} u)}^{β_{ν}}} d u) \\ = \frac{γ ((l + 1) β_{ν}^{- 1}, 0) - γ ((l + 1) β_{ν}^{- 1}, {(- κ_{ν} t_{1})}^{β_{ν}})}{2 {(- 1)}^{l + 1} σ_{ν}^{- l} \sqrt{{(Γ (β_{ν}^{- 1}))}^{2 - l} {(Γ (3 β_{ν}^{- 1}))}^{l}}} \\ + \frac{γ ((l + 1) β_{ν}^{- 1}, {(κ_{ν} t_{2})}^{β_{U}}) - γ ((l + 1) β_{ν}^{- 1}, 0)}{2 σ_{ν}^{- l} \sqrt{{(Γ (β_{ν}^{- 1}))}^{2 - l} {(Γ (3 β_{ν}^{- 1}))}^{l}}} \end{align}

(33)

where the final equality is due to (31) and (32). Combining the three cases, a unified form of I(t₁, t₂) is

\begin{align} I (t_{1}, t_{2}) & = \frac{γ ((l + 1) β_{ν}^{- 1}, 0) - γ ((l + 1) β_{ν}^{- 1}, | κ_{ν} t_{1} |^{β_{ν}})}{2 {(s g n (t_{1}))}^{l + 1} σ_{ν}^{- l} \sqrt{{(Γ (β_{ν}^{- 1}))}^{2 - l} {(Γ (3 β_{ν}^{- 1}))}^{l}}} \\ + \frac{γ ((l + 1) β_{ν}^{- 1}, | κ_{ν} t_{2} |^{β_{ν}}) - γ ((l + 1) β_{ν}^{- 1}, 0)}{2 {(s g n (t_{2}))}^{l + 1} σ_{ν}^{- l} \sqrt{{(Γ (β_{ν}^{- 1}))}^{2 - l} {(Γ (3 β_{ν}^{- 1}))}^{l}}} \end{align}

(34)

By the formula (34) and the CDF of the GGD, (29) becomes

\begin{align} \int_{- Δ}^{Δ} | u | p_{ν} (t - u) d u & = \frac{γ (2 β_{ν}^{- 1}, | κ (t - Δ) |^{β_{ν}}) - γ (2 β_{ν}^{- 1}, | κ t |^{β_{ν}})}{2 σ_{ν}^{- 1} \sqrt{Γ (β_{ν}^{- 1}) Γ (3 β_{ν}^{- 1})}} \\ + \frac{γ (2 β_{ν}^{- 1}, | κ (t + ▵) |^{β_{ν}}) - γ (2 β_{ν}^{- 1}, | κ t |^{β_{ν}})}{2 σ_{ν}^{- 1} \sqrt{Γ (β_{ν}^{- 1}) Γ (3 β_{ν}^{- 1})}} \\ + (2 Ψ_{ν} (t) - Ψ_{ν} (t - Δ) - Ψ_{ν} (t + Δ)) t \end{align}

and (30) becomes

\begin{align} \int_{- Δ}^{Δ} u^{2} p_{ν} (t - u) d u & = \frac{γ (3 β_{ν}^{- 1}, 0) - γ (3 β_{ν}^{- 1}, | κ_{ν} (t - ▵) |^{β_{ν}})}{2 s g n (t - Δ) σ_{ν}^{- 2} Γ (3 β_{ν}^{- 1})} \\ + \frac{γ (3 β_{ν}^{- 1}, | κ_{ν} (t + ▵) |^{β_{ν}}) - γ (3 β_{ν}^{- 1}, 0)}{2 s g n (t + Δ) σ_{ν}^{- 2} Γ (3 β_{ν}^{- 1})} \\ - \frac{(γ (2 β_{ν}^{- 1}, | κ (t + Δ) |^{β_{ν}}) - γ (2 β_{ν}^{- 1}, | κ (t - Δ) |^{β_{ν}})) t}{σ_{ν}^{- 1} \sqrt{Γ (β_{ν}^{- 1}) Γ (3 β_{ν}^{- 1})}} \\ + (Ψ_{ν} (t + Δ) - Ψ_{ν} (t - Δ)) t^{2} \end{align}

Acknowledgements

This study was supported by the National Natural Science Foundation of China (Grant No. 60803122, 61103018), by the Natural Science Foundation of Jiangsu Province (Grant No. BK2011442), by the Innovative Foundation of Yangzhou University (Grant No. 2011CXJ023), by the Opening Project of State Key Laboratory of Digital Publishing Technology, and by the Opening Project of State Key Laboratory of Software Development Environment (Grant No. SKLSDE-2011KF-08). The authors would like to thank the anonymous reviewers for their detailed comments that improved both the editorial and technical quality of this article substantially.

Abbreviations

AWGN:: additive white Gaussian noise
CDF:: cumulative distribution function
CLT:: central limit theorem
DC:: distortion-compensation
DC-DM:: distortion compensated dither modulation
DC-NDM:: distortion compensated NDM
DM:: dither modulation
DWR:: document-to-watermark ratio
GGD:: generalized Gaussian distribution
i.i.d.:: independently and identically distributed
IM-NDM:: improved NDM
LQIM:: logarithmic QIM
NDM:: normalized dither modulation
NLPE1:: number of blocks greater than the first local perceptual error threshold
NLPE2:: number of blocks greater than the second local perceptual error thresh-old
PDF:: probability distribution function
PMF:: probability mass function
BER:: bit error rate
QIM:: quantization index modulation
r.v.:: random variable
RDM:: rational dithered modulation
SCS:: scalar Costa scheme
STDM:: spread transform dither modulation
TPE:: total perceptual error
WNR:: watermark-to-noise ratio
wPSNR:: weighted peak signal-to-noise ratio.

References

Chen B, Wornell GW: Quantization index modulation: a class of provably good methods fordigital watermarking and information embedding. IEEE Trans Inf Theory 2001, 47(4):1423-1443. 10.1109/18.923725
Article MathSciNet MATH Google Scholar
Eggers JJ, Bauml R, Tzschoppe R, Girod B: Scalar costa scheme for information embedding. IEEE Trans Signal Process 2003, 51(4):1003-1019. 10.1109/TSP.2003.809366
Article MathSciNet Google Scholar
Boyer JP, Duhamel P, Blanc-Talon J: Performance analysis of scalar DC-QIM for zero-bit watermarking. IEEE Trans Inf Foren Secur 2007, 2(2):283-289.
Article Google Scholar
Kalantari NK, Ahadi SM: A logarithmic quantization index modulation for perceptually better data hiding. IEEE Trans Image Process 2010, 19(6):1504-1517.
Article MathSciNet Google Scholar
Pérez-Gonzàlez F, Balado F, Martin JRH: Performance analysis of existing and new methods for data hiding with known-host information in additive channels. IEEE Trans Signal Process 2003, 51(4):960-980. 10.1109/TSP.2003.809368
Article MathSciNet Google Scholar
Bartolini F, Barni M, Piva A: Performance analysis of ST-DM watermarking in presence of nonadditive attacks. IEEE Trans Signal Process 2004, 52(10):2965-2974. 10.1109/TSP.2004.833868
Article Google Scholar
Pérez-Gonzàlez F, Mosquera C: Quantization-based data hiding robust to linear-time-invariant filtering. IEEE Trans Inf Foren Secur 2008, 3(2):137-152.
Article Google Scholar
Conway JH, Sloane NJA: Sphere Packings, Lattices, and Groups. Springer, New York; 1988.
Book MATH Google Scholar
Pérez-Gonzàlez F, Mosquera C, Barni M, Abrardo A: Rational dither modulation: a high-rate data-hiding method invariant to gain attacks. IEEE Trans Signal Process 2005, 53(10):3960-3975.
Article MathSciNet Google Scholar
Oostveen JC, Kalker AAC, Staring M: Adaptive quantization watermarking. In Proc of SPIE: Security, Steganography, and Watermarking of Multimedia Contents VI. Volume 5306. San Jose, CA; 2004:296-303.
Chapter Google Scholar
Li Q, Cox IJ: Using perceptual models to improve fidelity and provide resistance to valumetric scaling for quantization index modulation watermarking. IEEE Trans Inf Forens Secur 2007, 2(2):127-139.
Article Google Scholar
Papoulis A: Probability, Random Variables, and Stochastic Processes. McGraw-Hill, New York; 1991.
MATH Google Scholar
Saralees N: A generalized normal distribution. J Appl Stat 2005, 32(7):685-694. 10.1080/02664760500079464
Article MathSciNet MATH Google Scholar
Schuchman L: Dither signals and their effect on quantization noise. IEEE Trans Commun Technol 1964, CT-12: 162-165.
Article Google Scholar
Do MN, Vetterli M: Wavelet-based texture retrieval using generalized gaussian density and Kullback-Leibler distance. IEEE Trans Image Process 2002, 11(2):146-158. 10.1109/83.982822
Article MathSciNet Google Scholar
Voloshynovskiy S, Pereira S, Iquise V, Pun T: Attack modelling: towards a second generation watermarking benchmark. Signal Process 2001, 81(Special 6):1177-1214.
Article MATH Google Scholar

Download references

Author information

Authors and Affiliations

School of Information Engineering, Yangzhou University, Yangzhou, 225009, China
Xinshan Zhu & Jie Ding
State Key Laboratory of Digital Publishing Technology, Chengfu Road 298, Beijing, 100871, China
Xinshan Zhu
State Key Laboratory of Software Development Environment, Beihang University, Beijing, 100191, China
Jie Ding

Authors

Xinshan Zhu
View author publications
You can also search for this author in PubMed Google Scholar
Jie Ding
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Xinshan Zhu.

Additional information

Competing interests

The authors declare that they have no competing interests.

Authors’ original submitted files for images

Below are the links to the authors’ original submitted files for images.

Authors’ original file for figure 1

Authors’ original file for figure 2

Authors’ original file for figure 3

Authors’ original file for figure 4

Authors’ original file for figure 5

Authors’ original file for figure 6

Authors’ original file for figure 7

Authors’ original file for figure 8

Authors’ original file for figure 9

Authors’ original file for figure 10

Authors’ original file for figure 11

Authors’ original file for figure 12

Authors’ original file for figure 13

Authors’ original file for figure 14

Authors’ original file for figure 15

Authors’ original file for figure 16

Authors’ original file for figure 17

Authors’ original file for figure 18

Authors’ original file for figure 19

Authors’ original file for figure 20

Authors’ original file for figure 21

Authors’ original file for figure 22

Authors’ original file for figure 23

Authors’ original file for figure 24

Authors’ original file for figure 25

Authors’ original file for figure 26

Authors’ original file for figure 27

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 2.0 International License (https://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Reprints and permissions

About this article

Cite this article

Zhu, X., Ding, J. Performance analysis and improvement of dither modulation under the composite attacks. EURASIP J. Adv. Signal Process. 2012, 53 (2012). https://doi.org/10.1186/1687-6180-2012-53

Download citation

Received: 20 June 2011
Accepted: 02 March 2012
Published: 02 March 2012
DOI: https://doi.org/10.1186/1687-6180-2012-53

Performance analysis and improvement of dither modulation under the composite attacks

Abstract

1 Introduction

2 Review of DM and problem

3 PDF models

3.1 PDF model of the watermarked signal

3.2 PDF model of the attacked signal

3.3 PDF model of the extracted signal

4 Performance analysis of DM against the composite attacks

4.1 Generalized Gaussian host and noise

4.2 Simulations on artificial signals

5 Normalized DM and its performance

5.1 Normalized DM

5.2 Performance analysis

6 The improvement of NDM

7 Experimental results

7.1 Theoretical verification

7.2 Performance evaluation

8 Conclusion

Appendix

Acknowledgements

Abbreviations

References

Author information

Authors and Affiliations

Corresponding author

Additional information

Competing interests

Authors’ original submitted files for images

Rights and permissions

About this article

Cite this article

Share this article

Keywords