# Efficient cooperative compressive spectrum sensing by identifying multi-candidate and exploiting deterministic matrix

- Jia Li
^{1}, - Qiang Wang
^{1}Email author, - Wenjie Yan
^{2}and - Yi Shen
^{1}

**2015**:85

https://doi.org/10.1186/s13634-015-0267-1

© Li et al. 2015

**Received: **9 February 2015

**Accepted: **29 August 2015

**Published: **22 September 2015

## Abstract

Cooperative spectrum sensing exploits the spatial diversity to improve the detection of occupied channels in cognitive radio networks (CRNs). Cooperative compressive spectrum sensing (CCSS) utilizing the sparsity of channel occupancy further improves the efficiency by reducing the number of reports without degrading detection performance. In this paper, we firstly and mainly propose the referred multi-candidate orthogonal matrix matching pursuit (MOMMP) algorithms to efficiently and effectively detect occupied channels at fusion center (FC), where multi-candidate identification and orthogonal projection are utilized to respectively reduce the number of required iterations and improve the probability of exact identification. Secondly, two common but different approaches based on threshold and Gaussian distribution are introduced to realize the multi-candidate identification. Moreover, to improve the detection accuracy and energy efficiency, we propose the matrix construction based on shrinkage and gradient descent (MCSGD) algorithm to provide a deterministic filter coefficient matrix of low *t*-average coherence. Finally, several numerical simulations validate that our proposals provide satisfactory performance with higher probability of detection, lower probability of false alarm and less detection time.

## Keywords

## 1 Introduction

Cognitive radio (CR), as a promising technology to improve the spectrum efficiency in wireless communication, has drawn great attentions in recent years [1]. By detecting the channel states or spectrum holes, the CR users are allowed to communicate over the spectrum allocated to primary users (PUs) when they are not fully utilizing it. Spectrum sensing is the key technology for the above application [2]. Considering that the detection of a single CR user may suffer from the wireless channel fading and the hidden terminal problem, cooperative spectrum sensing (CSS) exploiting several spatially distributed CR users is introduced to improve the detection performance for whole CR networks (CRNs) [3].

Although CSS benefits from cooperative gain to improve the detection accuracy, cooperation overhead is also incurred at the same time, which refers to any extra sensing time, delay, energy, reports, and operations devoted to CSS [4–6]. Traditionally, in centralized CSS, fusion center (FC) collects all sensing data and then makes decisions on channel occupancy. However, significant bandwidth is required due to the reporting over a wired high-speed backbone. One natural idea to address such issue is that each CR user reports a summary statistic. Considering that sparse signals can be exactly recovered from their fewer measurements than what are usually considered necessary in compressive sensing (CS) theory [7], cooperative compressive spectrum sensing (CCSS) using linear combinations as summary statistics is feasible. This is guaranteed by the recent studies on the actual spectrum utilization efficiency, which have revealed that a large portion of the licensed spectrum experiences low utilization, i.e., the channel occupancy is sparse. Thus, the energy efficiency of CRNs can be improved by reducing the number of reports.

A novel CCSS model exploiting a frequency selective filter equipped at each CR user to linearly combine multiple channel states was firstly introduced in [8]. Meanwhile, two detection algorithms respectively based on matrix completion and jointly sparse recovery were also proposed to recover the channel occupancy, while the relatively faster jointly sparse recovery algorithm still involves multiple constrained optimization sub-problems. Actually, many researchers have paid their attention to the design of jointly sparse recovery algorithm. In [9], a decentralized consensus optimization algorithm was derived to estimate the spectrum, where an *l*
_{1}-regularized least square problem was solved at each CR user. To further save the transmission burden by exchanging only the estimated joint support with neighbor users, Ling et al. [10] introduced several centralized or decentralized reweighed *l*
_{
q
} algorithms (*q* = 1,2). However, the above algorithms are of too high computational complexity, which limits their application in real-time and large-scale system. In [11], for the support with infinite cardinality, a continuous to finite (CTF) block consisting construction of a finite frame and recovering the joint support was proposed in recovery stage. However, in this paper, the support has finite cardinality, which makes the above block inapplicable. Recently, based on the idea of matching pursuit (MP) algorithm [12], three more efficient algorithms were consecutively proposed in [13, 14]. Unfortunately, they suffer from the non-orthogonality between selected atoms and residual data, which reduces the possibility of exactly detecting more occupied channels. Actually, just like the extension from MP to orthogonal MP(OMP) algorithm [15], the orthogonal projection (or least square) technique can overcome the above shortcoming. To further improve the efficiency, one simple and intuitive idea is to reduce the number of required iterations by identifying multi-candidate in each iteration, resulting the multi-candidate orthogonal matrix matching pursuit (MOMMP) algorithm. One such algorithm, based on the subspace pursuit (SP) algorithm [16], has been introduced in [17]. However, it requires a priori knowledge of the unknown and varying sparsity, which make it practically inapplicable. Respectively motivated by the idea of generalized OMP (gOMP) [18] algorithm and that of stagewise OMP (StOMP) algorithm [19] and adaptive OMP (AOMP) algorithm [14], this paper proposes two simple approaches to realize the multi-candidate identification.

In addition to the design of detection algorithm, the selection of filter coefficient matrix also greatly affects the detection performance due to the essential jointly sparse recovery problem. According to the CS theory, one can guarantee an exact recovery when the filter coefficient matrix satisfies the restricted isometry property (RIP) [20]. Although random matrices under Gaussian distribution have been proven to obey the RIP with overwhelming probability, it is NP-hard to certify that a random matrix indeed satisfies RIP. Therefore, it is of interest to find or construct a deterministic RIP matrix. In addition, a deterministic matrix benefits the energy efficiency of CSS since its elements do not need to be transmitted and it reduces the number of required reports with detection accuracy constraint. Till now, some researchers have focused on the matrix construction [21, 22]. Along this research line, we propose a novel shrinkage function and gradient descent method to construct a deterministic matrix, which is of low *t*-coherence and improves the detection effectiveness and efficiency.

Under the assumption that the received wide-sense stationary signal is sparse in frequency domain, many researches focus on the reduction of sampling rate and the recovery of original signal [23, 24]. Recently, power spectrum sensing drew much attention since the above sparsity assumption can be removed and an efficient recovery can be achieved at the same time. Along this research line, several sampling systems are introduced in [11, 25–31]. The multi-coset sampling used in [25–28] is based on the multi-rate signal processing and it employs parallel cosets that uniformly sample the signal at a decimated rate. Similarly, multiple branches are considered in [11, 30, 32], where each branch employs a different periodic mixing function followed by low-pass filtering and low-rate uniform sampling or an integrate-and-dump process. In [29], the coprime sampling scheme is employed that involves using two sampling branches with sampling rates coprime with each other. And, a distributed scheme to estimate the power spectrum from lossy observations is proposed in [31]. Different from the above literatures, this paper considers the sparsity of channel occupancy, which presents the states of all channels. Instead of time samples, each CR users transmits fewer linear combinations of multiple channel states to the FC. Compared with *L* compressions in each CR sensor [28], where *L* is the number of samplers in each coset, this paper implements only one compression in each CR user and it can be described as a product of the filter coefficient matrix and the sparse channel occupancy. As for the detection step, similar to the works in [27, 33], our proposal does not concentrate on the exact recovery of original signal either. Based on the energy distribution of row vectors of identification matrix, only the locations of nonzero rows are identified to obtain the sparse channel occupancy. Meanwhile, though the MOMMP algorithms iteratively execute one orthogonal projection step, the employment of multi-candidate identification has significantly saved the computational burden by decreasing the number of required iterations. In terms of detection from lossy measurements due to channel fading [31, 33], this paper performs extensive numerical experiments to validate the superiority of our proposal.

- 1.)
We propose the MOMMP algorithms to reduce detection time without degrading detection accuracy, in which multi-candidate identification and orthogonal projection are respectively implemented in each iteration to detect several occupied channels and recover partial state data;

- 2.)
We introduce two simple approaches to realize the multi-candidate identification. In threshold-based MOMMP (TMOMMP) algorithm, a certain number of coordinates are identified in each iteration, and a threshold-based scheme is then utilized to redeclare the occupied channels. Furthermore, in MOMMP based on Gaussian distribution (MOMMP-GD) algorithm, an uncertain number of coordinates are identified in each iteration according to the distribution of energy of row vectors in identification matrix;

- 3.)
We propose the matrix construction based on shrinkage and gradient descent (MCSGD) algorithm to construct a deterministic filter coefficient matrix, which is of low

*t*-averaged coherence; - 4.)
The advantages of integrating MOMMP and MCSGD algorithms are higher probability of exactly detecting occupied channels, less detection time, and unnecessary transmission of the components of deterministic filter coefficient matrix.

The rest of this paper is organized as follows. The second section briefly reviews the CS theory and the system model of CCSS. In the third section, the proposed TMOMMP and MOMMP-GD algorithms are described in detail. The proposed MCSGD algorithm providing deterministic matrix as well as its experimental validations are presented in the fourth section. Several numerical experiments validating detection accuracy and time are shown in the fifth section, followed by the Conclusions section.

## 2 System model of cooperative compressive spectrum sensing (CCSS)

### 2.1 Compressive sensing (CS) theory

*x*∈

*R*

^{ n }is said to be a sparse signal with sparsity

*s*(

*s*≪

*n*) if |

*Γ*|≤

*s*, where

*Γ*={

*i*|

*x*

_{ i }≠0,

*i*=1,…,

*n*} denotes its support. Assume the measurement signal

*y*∈

*R*

^{ p }is perturbed as

*Φ*∈

*R*

^{ p×n }(

*p*<

*n*) is the measurement matrix with each normalized column termed as atom, and

*e*is a noise vector. The recovery of

*x*can be achieved by solving the following

*l*

_{0}(or pseudo)-norm minimization problem

### 2.2 System model

We consider a CRN with *m* CR users that locally sense the state of *n* channels. A channel is either occupied by its licensed PU or unoccupied, corresponding to the states 1 and 0, respectively. Traditionally, each CR user orderly sends *n* states to the centralized FC to make global decisions on channel occupancy. Suppose that there is at most one PU on each channel and only *s*(*s*<*n*) channels are occupied, the channel occupancy can be described as a vector of length *n* and sparsity *s*. Due to each CR user can only sense limited channels at a time and the limitation of energy reservation and hardware implementation, it is impossible for each CR user to detect all *n* channels simultaneously.

*p*(

*p*<

*n*) linear combination of multiple channel states. In this way, each CR user will send only

*p*linear combinations to FC, which can be described as a compression by a filter coefficient matrix

*F*∈

*R*

^{ p×n }. We set a

*n*×

*n*diagonal matrix

*R*to represent the states of

*n*channels, where the

*j*(1≤

*j*≤

*n*)th diagonal entry is the state of the

*j*th channel (0 or 1). The

*p*reports from each CR user forms the vector

*F*·

*d*

*i*

*a*

*g*(

*R*)∈

*R*

^{ p×1}, whose elements are orderly collected by the FC. The channel gain between

*m*CR users and

*n*channels can be described in an

*m*×

*n*channel gain matrix

*C*[34], whose (

*i*,

*j*)th component is written as

where *P*
_{
j
} is the transmitted power of PU using *j*th channel, *d*
_{
i,j
} denotes the distance between the PU using *j*th channel and *i*th CR user, *α* is the propagation loss factor, and *h*
_{
i,j
} is the channel fading gain. Here, we consider *h*
_{
i,j
}=1,∀*i*,*j* for additive white Gaussian noise (AWGN) channel and |*h*
_{
i,j
}| following independent Rayleigh distribution for Rayleigh channel [34].

*F*∈

*R*

^{ p×n }on each CR user, we obtain the measurement data

*M*∈

*R*

^{ p×m }. Considering the inevitable perturbation in transmission,

*M*is further perturbed as

*X*=

*R*

*C*

^{ T }∈

*R*

^{ n×m }denotes the state data, and

*N*∈

*R*

^{ p×m }is a Gaussian noise in reporting channel. In this way, the number of reports is reduced from

*nm*to

*pm*, which saves the energy consuming in sharing channel information. Generally, the transmission failure will possibly make some entries of

*M*missing. Therefore, the entries of received data

*M*

^{ E }are described as

where the index set *E* denotes the successfully received locations in *M*.

According to the definition of *R* and the expression in (4), the detection can be achieved by solving a jointly sparse recovery problem. To guarantee the uniqueness and existence of its solution, *F* should satisfy the RIP [20]. Considering that CCSS only focuses on the locations of occupied channels, i.e., the nonzero elements in *d*
*i*
*a*
*g*(*R*), we propose to alternatively find the nonzero rows of *X*.

## 3 Multi-candidate orthogonal matrix matching pursuit (MOMMP) algorithms

### 3.1 The description of MOMMP algorithm

*M*

^{ E }is written as

*Γ*⊂{1,·,

*n*} is the support of

*d*

*i*

*a*

*g*(

*R*). The following identification matrix is firstly generated

*F*, we further express the

*i*th row of

*B*as

*B*when

*F*is orthogonal. However, it is the fact

*p*<<

*n*that makes the above case impossible. To be specific, the lower bound of the absolute inner product of distinct columns in

*F*∈

*R*

^{ p×n }, i.e., the following Welch bound [35], makes all the rows of

*B*nonzero.

Fortunately, the normalized columns of *F* still guarantee that *B* is a proximity to *X* due to \(|F_{:,i}^{T} F_{:,j}|<<|F_{:,i}^{T} F_{:,i}|=1\) for all *i*≠*j*. This means *i*th row of *B* is of large norm if *i*th row of *X* is nonzero. Therefore, we alternatively compute ∥*B*
_{
i,:}∥_{1},1≤*i*≤*n* to judge whether *X*
_{
i,:} is a null vector or not in sense of probability.

Following the idea of MMP and revised MMP (RMMP) algorithms in [13], we can detect one occupied channel in each iteration and terminate the detection after *s*th iteration. However, the actual sparsity *s* is often unknown since it corresponds to the instantaneous spectrum occupancy of PUs which is dynamically varying. Although the typically available upper bound *s*
_{max}, which can be measured from the maximum spectrum utilization observed statistically over a time period, provides a conservative determination of *s*, the cost is still high due to *O*(*s*) iterations in detection. In CRNs, the detection time shall be small enough compared with the channel coherence time and user dynamics, such that transmission opportunities can be utilized before the spectrum occupancy profile has changed. Thus, it is still worth investigating how to reduce the detection time without obviously degraded accuracy.

*Λ*=

*∅*, we detect several occupied channels in the

*l*th iteration by identifying set

*Υ*

^{ l }, where |

*Υ*

^{ l }|≥1. This is followed by the set expansion

*Λ*=

*Λ*∪

*Υ*

^{ l }. To combat the non-orthogonality between the selected atoms and the residual data [14], which conversely affects the possibility of exactly detecting more occupied channels, we implement an orthogonal projection process to compute the partial state data as \( {X_{\Lambda,:}}= F_{:,\Lambda }^{\dag } {M}\). Finally, the residual data is updated as

*Y*

_{ r }=

*M*−

*F*

_{:,Λ }

*X*

_{ Λ,:}. In this way, the issue of repeated detection is addressed due to the following result

With the aid of multi-candidate identification and orthogonal projection techniques, we respectively improve the detection efficiency and effectiveness. In this paper, the algorithms following the above idea are collectively referred to as multi-candidates orthogonal MMP (MOMMP). In the following content, based on two different approaches achieving multi-candidate, we propose two simple MOMMP algorithms.

### 3.2 Threshold-based MOMMP (TMOMMP) algorithm

Inspired by the idea of the gOMP algorithm, we propose to detect *k* occupied channels in each iteration, where *k*≥1 is a fixed integer. Based on the available *s*
_{max}, we terminate the detection by determining a maximum iteration number ite*r*
_{
max
}.

In each iteration, \(h^{l}=[ {{{\| {B_{1,:}^{l}} \|}_{1}},\cdot,{{\| {B_{n,:}^{l}} \|}_{1}}} ]\}\) is firstly computed from the generated identification matrix *B*
^{
l
}=*F*
^{
T
}
*Y*
_{
r
}. And then, *Υ*
^{
l
} is identified as the set of coordinates corresponding to the *k* largest entries of *h*
^{
l
} and *Λ* is expanded as *Λ*=*Λ*∪*Υ*
^{
l
}. Finally, the partial state data *X*
_{
Λ,:} is computed and the residual data *Y*
_{
r
} is updated. In terms of its termination, we choose to preset the maximum iteration number ite*r*
_{
max
}=⌈*s*
_{max}/*k*⌉, where ⌈*a*⌉ is the smallest integer that is larger than *a*.

*s*

_{max}channels are finally declared as occupied, demonstrating

*Γ*⊂

*Λ*. Different from the classic sparse signal recovery based on Lemma 3.1 in [19], CCSS aims to exactly obtain

*Γ*=

*Λ*since that \(\Lambda - \Gamma \neq \varnothing \) indicates the existence of false alarm. Thus, to reduce the probability of false alarm, it is necessary to refine the obtained

*Λ*or eliminate the coordinates in

*Λ*−

*Γ*. In this paper, we propose the following threshold-based method to achieve the refinement

where *λ*
_{1}>0 is a preset decision threshold. Empirically, the threshold is set as \(\lambda _{1} = \frac {{\rho _{1} + m \cdot \sigma _{1} }}{t_{1}}\), where *ρ*
_{1} is the average 1-norm value of all the nonzero rows in *X*, *σ*
_{1} is the standard deviation of all the noise components, *t*
_{1} is a parameter selected from [2.5,3.5]. In practice, a conservative determination of the unknown *σ*
_{1} can be obtained from the statistical observation over a time period.

Considering that the modification mainly stems from the threshold-based refinement, we refer to the above algorithm as TMOMMP, which is presented in Algorithm 1.

### 3.3 MOMMP based on Gaussian distribution (MOMMP-GD) algorithm

In addition to the above multi-candidate identification in the former subsection, the direction of identifying an uncertain number of coordinates is still worth studying [19]. Here, we propose to take full advantages of energy distribution of row vectors of identification matrix.

In each iteration, we firstly determine the available set *Ω*−*Λ* and generate the vector \(h^{l}=[ {{{\| {B_{{l_{1}},:}^{l}} \|}_{1}}, \ldots,{{\| {B_{{l_{d}},:}^{l}} \|}_{1}}} ]\), where *Ω*={1,·,*n*}, *d*=|*Ω*−*Λ*| and *l* denotes iteration counter. According to the experimental results in [14] and the property \({{\| {B_{{l_{i}},:}^{l}} \|}_{1}} \leq \sqrt m \cdot {{\| {B_{{l_{i}},:}^{l}} \|}_{2}}\), we conclude that the elements of *h*
^{
l
} will also approximately follow the Gaussian distribution. This means only several entries of *h*
^{
l
} are large and the others are concentrated on some certain positive constant. Thus, we propose to iteratively identify the coordinates, whose corresponding values in *h*
^{
l
} are near the peak. By *ρ*
_{2} and *σ*
_{2}, we respectively denote the mean and standard deviation of *h*
^{
l
}. Selecting a parameter *t*
_{2} from [2.5,3.5], we fix the threshold *λ*
_{2}=*ρ*
_{2}+*t*
_{2}∗*σ*
_{2}. Secondly, we identify the set \(\Upsilon ^{l} {\mathrm {= }}\left \{ {{l_{i}}\left | {{h^{l}_{i}} > \lambda _{2}}, ~ i=1,\cdot,d \right.} \right \}\). Through checking whether there is no newly identified coordinate, the detection makes a judgment of termination. If not, the set is expanded as *Λ*=*Λ*∪*Υ*
^{
l
}. Finally, the partial state data is computed as \({X_{\Lambda,:}} = F_{:,\Lambda }^{\dag } {M^{E}}\) and residual data is updated as *Y*
_{
r
}=*M*
^{
E
}−*F*
_{:,Λ
}
*X*
_{
Λ,:}.

Due to the Gaussian distribution of elements of *h*
^{
l
}, we referred to the above algorithm as MOMMP-GD algorithm and show it in Algorithm 2. Compared with the similar AOMP algorithm in [14], we propose to calculate the 1-norm of each row vector and compute *X*
_{
Λ,:} rather than \(X_{\Upsilon ^{l},:}\phantom {\dot {i}\!}\). The above modifications significantly improve the probability of exact detection, which can be seen from the extensive numerical experiments. Meanwhile, the MOMMP-GD algorithm adaptively terminates the detection without a prior knowledge of *s*
_{max} and the standard deviation of all the noise components, which is more promising in CRNs with fewer statistical observations.

## 4 Deterministic matrix from MCSGD algorithm

To improve the detection accuracy, based on the expression in (8), we construct a deterministic matrix by reducing the value of \(|F_{:,i}^{T} F_{:,j}|, \forall i \neq j\) while preserving that of \(|F_{:,i}^{T} F_{:,i}|, \forall i \in \Omega \) to be 1. On the other hand, the system model in the second section indicates that the components of *F* is required to guarantee an exact recovery of *X* and should be transmitted to the FC, which practically increases the energy consuming a lot. Fortunately, the above issue can be alleviated by utilizing a deterministic matrix *F* since it can be constructed and stored in FC beforehand and its transmission becomes unnecessary.

*F*as close as possible to an equiangular tight frame (ETF) is more practical, since ETF has the minimal coherence among all matrices of the same dimension. However, the coherence

*μ*, which is defined as the largest absolute and normalized inner product between different columns, is analyzed to be a pessimistic measure in CS [21]. Thus, we also adjust our main objective to the reduction of the following

*t*-averaged coherence [21], which is defined as the average of all absolute and normalized inner products between different columns in

*F*(that is, the off-diagonal elements in Gram matrix

*G*=

*F*

^{ T }

*F*) that are above the predefined threshold

*t*.

For *t*=0, we obtain the average of all the absolute off-diagonal entries of *G*. As *t* grows, the value of *μ*
_{
t
}(*F*) grows and approaches the coherence *μ* from below.

### 4.1 The description of MCSGD algorithm

*μ*

_{ t }(

*F*). Our approach is inspired by designing optimized projections by using shrinkage process in [21] and gradient descent process in [22]. Consider the following convex set \(\mathcal {H}\) which contains the ideal ETFs, the deterministic matrix construction problem can be solved by projecting onto \(\mathcal {H}\) alternatively

where the Welch bound *μ*
_{
w
} is described in (9).

Initializing *F* as a Gaussian random matrix with normalized columns, we firstly find \(H \in {\mathcal {H}}\) that minimizes \(\left \| {F^{T}F - H} \right \|_{F}^{2}\), and then update *F* that minimizes \(\left \| {F^{T}F - H} \right \|_{F}^{2}\). The *t*-averaged coherence of *F* will be gradually reduced. The iterations are carried on until the difference of *μ*
_{
t
}(*F*) between two consecutive iterations is under a tolerable precision or the algorithm has reached a maximum iteration number. After the final iteration, we obtain the optimized and deterministic matrix *F*. Now, let us briefly present the proposed MCSGD algorithm in Algorithm 3.

### 4.2 Find *H* using shrinkage process

*H*with shrinkage operation to Gram matrix

*G*[21]. In this part, we introduce another simple but effective shrinkage function to obtain the minimum

*H*.

where *γ*∈[*μ*
_{
w
},1) is the shrinkage factor, which enables to adjust the shrinkage range of the elements in *G*.

*γ*=0.4 as an example, and the variable range of the input valuables is [−1,1]. The variable range of the output value with non-shrinkage function remains the same, while that with the other three techniques are [−0.4,0.4]. The above results demonstrate that the shrinkage function reduces the variable range of values from [−1,1] to [−0.4,0.4], i.e., the relatively-large absolute off-diagonal entries in

*G*are effectively reduced.

*F*∈

*R*

^{400×500}is adopted to validate the proposed MCSGD algorithm. The input parameters are fixed as:

*β*=0.01,

*i*

*t*

*e*

*r*

_{ ext }=200,

*i*

*t*

*e*

*r*

_{ int }=100 and

*ε*=10

^{−4}. Selecting shrinkage factor

*γ*from {0.4,0.55,0.65,0.75}, we present the corresponding

*t*-averaged coherence

*μ*

_{ t }(

*F*) in Fig. 2. Obviously, the value of

*μ*

_{ t }(

*F*) decreases with iterations. Moreover, a smaller

*γ*requires fewer iterations to provide a lower

*μ*

_{ t }(

*F*). These simulations experimentally demonstrate the effectiveness of shrinkage process in MCSGD algorithm.

### 4.3 Update *F* using gradient descent process

After the implementation of the above shrinkage process, we get a temporal \(H \in {\mathcal {H}}\). The next issue to address is how to update *F*. In each iteration of the algorithm in [21], singular value decomposition (SVD) method is applied to reduce the rank of *H*. Afterwards, its square-root is built and the minimum *F* is found. However, the above algorithm will create some large values that are not present in the original Gram matrix. Large off diagonal values in *G* ruin completely the original signal recovery algorithms some times.

*F*. Denote the cost function \(J = \left \| {{F^{T}}F - H} \right \|_{F}^{2}\), the minimization problem is solved by the gradient descent process \({F_{i,j}} \leftarrow {F_{i,j}} - \eta \frac {{\partial J}}{{\partial {F_{i,j}}}}\), where

*η*>0 is the stepsize. According to matrix analysis [36], the following result holds

where *l* is the iteration counter and stepsize *β*=4*η*.

*β*is also experimentally investigated. The input parameters are same as that in the previous subsection except

*γ*=0.4. For each

*β*selected from {0.001,0.005,0.01,0.03,0.05}, the value of

*t*-averaged coherence

*μ*

_{ t }(

*F*) is regarded as a function of iterations in Fig. 3. It can be seen that both

*μ*

_{ t }(

*F*) and number of required iterations decrease as

*β*increases, and they finally become stable when

*β*≥0.03.

### 4.4 MCSGD analysis

According to the result in [37], we mention that constructing ETFs using alternating minimization methods have a global minimum subject to an appropriate initialization. In detail, there exist at least one accumulation point for \(\min _{G,H} \left \| {G - H} \right \|_{F}^{2}\), and the minimization leads to a global minimum after infinite number of iterations.

As shown in Algorithm 3, we also solve the matrix construction problem by adopting the alternating projection method from the macro perspective. To be specific, both the shrinkage process and the gradient descend process are used to update *H* and *F* by solving two minimization problems in the alternating projection procedure, and the above procedure will be repeated for a number of iterations. Due to the operations in alternating projection, the structure of *G*=*F*
^{
T
}
*F* and *H* changes frequently. Such operations make the set \(\mathcal {H}\) cannot maintain the convex sets all the time, which makes the whole problem a nonconvex one. Since the set \(\mathcal {H}\) is generally nonconvex, alternating projection may not converge as one might wish. However, it appears that norm convergence always occurs in practice [37]. The convergence of first process is guaranteed only if *γ* is chosen not far from 1, then the structures of *G* and *H* will be changed smoothly and slowly, and their distance becomes shorter in norm measure. Although smaller *γ* leads to much faster convergence rate, *γ* cannot be set too small since it will cause all the off-diagonal entries of Gram matrix breakthrough the Welch bound and even approach to zero, which may lead to badly non-convergence [21]. Facing the above tradeoff, according to the simulation results in the former subsection, we choose *γ*=0.4 as a reasonable choice. Referring to second minimization, the proposed method offers a gradient descent method to update the *F* which guarantees gradual reduction of cost function *J* and convergence to a local minimum [22].

*p*varying from 100 to 400 with increment 25 and fixed

*n*=500, their values of

*t*-averaged coherence

*μ*

_{ t }(

*F*) as well as the corresponding Welch bound are computed and shown in Fig. 4. For each

*p*, the value of

*t*-averaged coherence of

*F*from MCSGD algorithm is much lower than that of other algorithms and closer to the Welch bound. Therefore, the proposed MCSGD algorithm stands out as the best algorithm to provide deterministic matrix

*F*.

Combining the above theoretical and experimental results, we conjecture that the deterministic matrix *F* constructed by MCSGD algorithm further improves the detection performance of MOMMP algorithms. Referring to the realization of deterministic matrix *F* on each CR user, frequency-selective surfaces (FSSs) [38] and random filtering [39, 40] can be utilized as effective approaches.

## 5 Numerical experiments

In this section, we validate the detection accuracy and time of two MOMMP algorithms by performing numerical experiments under different sampling rates, SNRs, numbers of CR users, and reports, where the deterministic matrix *F* from MCSGD algorithm is utilized as a filter coefficient matrix.

### 5.1 Simulation setting

*m*CR users are uniformly and randomly located within a 500×500 m

^{2}area centered at the FC, and they cooperatively detect the presence of PUs within a 1000×1000 m

^{2}area centered at the same FC, to which

*n*=500 channels are licensed. Suppose that only

*s*channels are occupied, and their locations are chosen uniformly at random. A simple model of CRN is illustrated in Fig. 5. Rather than

*n*channel states, only

*p*linear combinations are reported by each CR user. Thus, the measurement data

*M*∈

*R*

^{ p×m }instead of state data

*X*∈

*R*

^{ n×m }is transmitted from

*m*CR users to FC, where the deterministic matrix

*F*∈

*R*

^{ p×n }from MCSGD algorithm is adopted as filter coefficient matrix. During the transmission,

*M*is corrupted by Gaussian noise

*N*with SNR varying from −15 to 30 dB.

*E*| means the number of reports received at the FC. After the implementation of MOMMP algorithms, the detection performance is evaluated by computing the following probability of detection

*P*

_{ D }and probability of false alarm

*P*

_{ F }

where No.Hit is the number of exact detections of the presence of PUs, No.Miss is the number of miss detections, No.False is the number of false alarms.

At the beginning, we investigate the influence of candidates number *k* upon the detection of TMOMMP algorithm. The reason for this is that the corresponding *iter*
_{
max
} = ⌈*s*
_{max}/*k*⌉ will greatly affect the detection time. Therefore, a reasonable *k* requiring less detection time while providing equivalent detection accuracy should be determined. Selecting *k* from {1,5,10,20} and determining its corresponding *iter*
_{
max
} subject to *k*∗*iter*
_{
max
} = *s*
_{max}=100, we experimentally obtain all equivalent values of *P*
_{
D
} and *P*
_{
F
}. For the sake of paper length, we omit the presentation of their experiment results and fix *k*=20 in the following experiments. In addition, to verify the average performance of our proposals in detecting occupied channels, we repeat the experiment with the same scenario *Q*=50 times and show their average values of *P*
_{
D
} and *P*
_{
F
} as well as detection times.

### 5.2 Detection accuracy versus sampling rate

Considering that sampling rate generally demonstrates the transmission loss and report compression, we firstly perform numerical simulations to demonstrate its influence upon the detection performance. Fix the following parameters: number of occupied channel *s*=50, number of reports *p*=400 and SNR = 15 dB. Actually, the case *s*=50 corresponds the spectrum utilization efficiency 10 %, which is large enough in practice according to FCC [41] and spectrum utilization efficiency in Singapore [42]. In this part, the AOMP algorithm in [14], jointly sparse recovery with missing observation (JSR-MO) algorithm in [17] and centralized reweighed *l*
_{
q
} (CRLq) algorithms (*q*=1,2) in [10] are also involved for comparison. Actually, instead of the time varying *s* in practice, only *s*
_{max}=100 is available and then utilized in TMOMMP and JSR-MO algorithms.

*P*

_{ D }and

*P*

_{ F }for AWGN fading channel and plot them in Fig. 6. Obviously, from the perspective of

*P*

_{ D }, CRL1 and CRL2 are superior to others in case of serious loss since they can provide higher

*P*

_{ D }when sampling rate is small. However, when sampling rate exceeds a threshold, namely 0.5, they are caught up with or even surpassed. On the other hand, CRL1 and CRL2 take thousands of seconds to finish the detection while others require a few seconds, which are shown in next subsection. Generally speaking, among all other greedy-type algorithms, the JSR-MO algorithm seems to be the best except when sampling rate falls between 0.1 and 0.2. Unfortunately, when sampling rate is larger than 0.2, the higher

*P*

_{ F }provided by JSR-MO will reduce the throughput [24] since some unoccupied channels are declared as occupied and not allocated to CR users. Compared with AOMP algorithm, the proposed MOMMP-GD algorithm significantly outperforms up to sampling rate=0.6. Afterwards, both MOMMP-GD and AOMP algorithms provide perfect detection, i.e.,

*P*

_{ D }≈1 and

*P*

_{ F }≈0. The above superiority experimentally demonstrates the positive effect of 1-norm in measuring the row vector and orthogonal projection in recovering state data. Moreover, the TMOMMP algorithm is superior to AOMP algorithm when sampling rate falls between 0.25 and 0.45. In terms of

*P*

_{ F }, the proposed TMOMMP and MOMMP-GD algorithms provide almost the same detection as AOMP algorithm. Therefore, these results indicate that the MOMMP-GD algorithm is more suitable for practical application with serious transmission failure.

*P*

_{ D }and

*P*

_{ F }in Fig. 7. Overall, although CRL1 and CRL2 can provide higher

*P*

_{ D }when sampling rate≤0.4, the significantly high

*P*

_{ F }and computational complexity limit their application in practice. With respect to

*P*

_{ D }, the proposed TMOMMP algorithm is almost the best one in case of serious loss, and JSR-MO algorithm outperforms only when 0.45≤sampling rate≤0.7. Moreover, the proposed TMOMMP and MOMMP-GD algorithms always outperform AOMP algorithm except when 0.5≤sampling rate≤0.7 the AOMP algorithm is slightly superior to TMOMMP algorithm. From the perspective of

*P*

_{ F }, the MOMMP-GD algorithm is equivalent to AOMP algorithm while the TMOMMP algorithm degrades when 0.2≤sampling rate≤0.5, which are much better than CRL1, CRL2, and JSR-MO algorithms. Comparing the results in Figs. 7 and 6, we conclude that better detection performance is obtained when Rayleigh fading channel is utilized as a sensing channel. For instance, to obtain

*P*

_{ D }≥0.9 from all three algorithms, sampling rate=0.47 is enough for Rayleigh fading channel while sampling rate=0.52 for AWGN fading channels. Thus, the above results demonstrate that a proper choice of algorithm greatly benefits the detection in certain fading channel.

Combining the above results, we conclude that our proposal provide higher probability of exactly detecting occupied channels.

### 5.3 Detection time versus sampling rate

In addition to detection accuracy, computational complexity is also an important evaluation indicator. The reason is that, in order not to interfere the PUs, CR users should detect the their presence with a high probability and vacate the channels or reduce transmit power within a certain amount of time. In other words, the time window for detection should be small enough. In each window with fixed duration, the first part is used for performing spectrum sensing and the rest is reserved for transmitting data. During the sensing interval, all CR users keep quiet as enforced by protocols, e.g., at the media access control layer. Therefore, an efficient detection algorithm can benefit the improvement of throughput.

### 5.4 Detection accuracy versus SNR

*s*=50,

*p*=400, and sampling rate=0.8, and varying SNR from −15 to 30 dB with increment 5 dB, we adopt AWGN fading channel and validate the detection performance of our proposals. From Fig. 9, we can see that the value of

*P*

_{ D }increases with SNR. At a high noise level, e.g., SNR = −5 dB, the TMOMMP algorithm is obviously superior to MOMMP-GD algorithm in the sense of

*P*

_{ D }while the MOMMP-GD algorithm significantly outperforms with respect to

*P*

_{ F }due to the non-existence of false alarm. In MOMMP-GD algorithm, high noise level will destruct the property of approximate Gaussian distribution, which results in the identification of fewer coordinates and hence small

*P*

_{ D }and

*P*

_{ F }. On the contrary, as for TMOMMP algorithm in the same scenario, the range of difference between

*h*

_{ i },∀

*i*∈

*Γ*and

*h*

_{ i },∀

*i*∉

*Γ*is reduced. Thus, a larger

*λ*

_{1}is required to identify suitable coordinates. However, the utilization of

*t*

_{1}makes

*λ*

_{1}smaller, resulting in larger

*P*

_{ D }and

*P*

_{ F }. Therefore, we propose to choose the detection algorithm by balancing

*P*

_{ D }and

*P*

_{ F }in practical application.

### 5.5 Detection accuracy versus number of CR users

*m*upon the detection performance over AWGN fading channel. Setting

*s*=50,

*p*=400, sampling rate=0.6, and SNR = 15 dB, and varying

*m*from 1 to 10, we obtain and present the corresponding

*P*

_{ D }and

*P*

_{ F }in Fig. 10. Obviously, the MOMMP-GD algorithm is always superior to TMOMMP algorithm except when

*m*is about 2. To be specific, with detection accuracy constraint

*P*

_{ D }≥0.95, the TMOMMP algorithm requires

*m*≥5 CR users while the MOMMP-GD algorithm

*m*=4 CR users.

### 5.6 Detection accuracy versus number of reports

*F*∈

*R*

^{ p×n }to obtain

*p*incoherent linear combinations of

*n*reports on each CR user, achieving report compression. Then, a problem intuitively arises: how many linear combinations are enough to guarantee an exact detection of

*s*occupied channels. Here, we calculate

*P*

_{ D }and

*P*

_{ F }for different values of

*p*to address the above problem. The input parameters are fixed as

*s*=50, loss rate=1, and SNR = 15 dB. For each

*p*varying from 100 to 400 with increment 25, we obtain the corresponding

*P*

_{ D }and

*P*

_{ F }in Fig. 11 to illustrate the influence of

*p*over AWGN fading channel. Obviously, the TMOMMP algorithm always provides higher

*P*

_{ D }than MOMMP-GD up to

*p*=225. However, only when

*p*≥175, the TMOMMP algorithm is also superior with respect to

*P*

_{ F }. Thus, the TMOMMP algorithm is empirically validated to be a competitive detection algorithm.

## 6 Conclusions

In this paper, we firstly proposed MOMMP algorithms to efficiently detect occupied channels in CRN, in which multi-candidate identification reduces the number of required iterations but does not degrade the detection accuracy. Secondly, two common but different approaches were introduced to realize the multi-candidate identification. With a prior knowledge of *s*
_{max}, the TMOMMP algorithm iteratively identified a fixed number of coordinates till a preset maximum iteration number, followed by a threshold-based refinement. Alternatively, without any prior knowledge, the MOMMP-GD algorithm iteratively identified an uncertain number of coordinates until there was no new coordinate. And then, deterministic matrix of low *t*-average coherence from the proposed MCSGD algorithm further improved the efficiency of CRN since the number of reports was reduced with detection performance constraint and the transmission of deterministic filter coefficient matrix was unnecessary. Finally, numerical simulations under different influencing factors were performed to validate that our proposals provide satisfactory performance with respect to *P*
_{
D
}, *P*
_{
F
} and detection time. Specifically, our proposals are more suitable for practical application with serious transmission failure and noise perturbation. In the further, we will investigate the application of the proposed MOMMP algorithms in other jointly sparse recovery problems.

## Declarations

### Acknowledgements

This work was financially supported by National Science Foundations of China (No.61174016).

**Open Access** This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made.

## Authors’ Affiliations

## References

- J Mitola, GQ Maguire Jr, Cognitive radio: making software radios more personal. Pers. Commun. IEEE. 6(4), 13–18 (1999).View ArticleGoogle Scholar
- Y Zeng, Y-C Liang, AT Hoang, R Zhang, A review on spectrum sensing for cognitive radio: challenges and solutions. EURASIP J. Adv. Sign. Process. 2010, 15 (2010).Google Scholar
- A Ghasemi, ES Sousa, in New Frontiers in Dynamic Spectrum Access Networks (DySPAN), 2005 First IEEE International Symposium on. Collaborative spectrum sensing for opportunistic access in fading environments, (2005), pp. 131–136.Google Scholar
- J Lundén, V Koivunen, A Huttunen, HV Poor, Collaborative cyclostationary spectrum sensing for cognitive radio systems. IEEE Trans. Signal Process. 57(11), 4182–4195 (2009).MathSciNetView ArticleGoogle Scholar
- L Zhang, Q Wu, G Ding, S Feng, J Wang, Performance analysis of probabilistic soft SSDF attack in cooperative spectrum sensing. EURASIP J. Adv. Signal Process. 2014(1), 1–9 (2014).View ArticleGoogle Scholar
- J Wang, S Feng, Q Wu, X Zheng, Y Xu, G Ding, A robust cooperative spectrum sensing scheme based on Dempster-Shafer theory and trustworthiness degree calculation in cognitive radio networks. EURASIP J. Adv. Signal Process. 2014(1), 1–12 (2014).View ArticleGoogle Scholar
- DL Donoho, Compressed sensing. IEEE Trans. Inf. Theory. 52(4), 1289–1306 (2006).MATHMathSciNetView ArticleGoogle Scholar
- J Meng, W Yin, H Li, E Hossain, Z Han, Collaborative spectrum sensing from sparse observations in cognitive radio networks. IEEE J. Selected Areas Commun. 29(2), 327–337 (2011).View ArticleGoogle Scholar
- F Zeng, C Li, Z Tian, Distributed compressive spectrum sensing in cooperative multihop cognitive networks. IEEE J. Selected Topics Signal Process. 5(1), 37–48 (2011).View ArticleGoogle Scholar
- Q Ling, Z Wen, W Yin, Decentralized jointly sparse optimization by reweighted minimization. IEEE Trans. Signal Process. 61(5), 1165–1170 (2013).MathSciNetView ArticleGoogle Scholar
- M Mishali, YC Eldar, From theory to practice: Sub-nyquist sampling of sparse wideband analog signals. IEEE J. Sel. Top. Signal Process. 4(2), 375–391 (2010).View ArticleGoogle Scholar
- SG Mallat, Z Zhang, Matching pursuits with time-frequency dictionaries. IEEE Trans. Signal Process. 41(12), 3397–3415 (1993).MATHView ArticleGoogle Scholar
- L Jiao, J Chen, J Wu, X Wang, S Zhang, Efficient collaborative spectrum sensing with low sample rate. Wirel. Pers. Commun. 67(4), 923–936 (2012).View ArticleGoogle Scholar
- C Jianrui, L Jiao, J Wu, X Wang, Compressive spectrum sensing in the cognitive radio networks by exploiting the sparsity of active radios. Wirel. Netw. 19, 661–671 (2013).View ArticleGoogle Scholar
- JA Tropp, AC Gilbert, Signal recovery from random measurements via orthogonal matching pursuit. IEEE Trans. Inf. Theory. 53(12), 4655–4666 (2007).MATHMathSciNetView ArticleGoogle Scholar
- W Dai, O Milenkovic, Subspace pursuit for compressive sensing signal reconstruction. IEEE Trans. Inf. Theory. 55(5), 2230–2249 (2009).MathSciNetView ArticleGoogle Scholar
- S Jin, X Zhang, in Global Communications Conference (GLOBECOM), 2014 IEEE. Collaborative compressive spectrum sensing with missing observations for Cognitive Radio networks, (2014), pp. 828–833.Google Scholar
- J Wang, S Kwon, B Shim, Generalized orthogonal matching pursuit. IEEE Trans. Signal Process. 60(12), 6202–6216 (2012).MathSciNetView ArticleGoogle Scholar
- DL Donoho, Y Tsaig, I Drori, J-L Starck, Sparse solution of underdetermined systems of linear equations by stagewise orthogonal matching pursuit. IEEE Trans. Inf. Theory. 58(2), 1094–1121 (2012).MathSciNetView ArticleGoogle Scholar
- EJ Candes, The restricted isometry property and its implications for compressed sensing. C. R. Math. 346(9), 589–592 (2008).MATHMathSciNetView ArticleGoogle Scholar
- M Elad, Optimized projections for compressed sensing. IEEE Trans. Signal Process. 55(12), 5695–5702 (2007).MathSciNetView ArticleGoogle Scholar
- V Abolghasemi, S Ferdowsi, S Sanei, A gradient-based alternating minimization approach for optimization of the measurement matrix in compressive sensing. Signal Proc. 92(4), 999–1009 (2012).View ArticleGoogle Scholar
- Y Wang, Z Tian, C Feng, Sparsity order estimation and its application in compressive spectrum sensing for cognitive radios. IEEE Trans. Wireless Commun. 11(6), 2116–2125 (2012).View ArticleGoogle Scholar
- H Sun, W-Y Chiu, A Nallanathan, Compressive power spectral density estimation. Commun. Lett. IEEE. 16(11), 1812–1815 (2012).MathSciNetView ArticleGoogle Scholar
- MA Lexa, ME Davies, JS Thompson, J Nikolic, in Acoustics, Speech and Signal Processing (ICASSP), 2011 IEEE International Conference on. Compressive power spectral density estimation, (2011), pp. 3884–3887.Google Scholar
- DD Ariananda, G Leus, Compressive wideband power spectrum estimation. IEEE Trans. Signal Process. 60(9), 4775–4789 (2012).MathSciNetView ArticleGoogle Scholar
- C-P Yen, Y Tsai, X Wang, Wideband spectrum sensing based on sub-nyquist sampling. IEEE Trans. Signal Process. 61(12), 3028–3040 (2013).View ArticleGoogle Scholar
- D Dony Ariananda, D Romero, G Leus, in Sensor Array and Multichannel Signal Processing Workshop (SAM), 2014 IEEE 8th. Cooperative compressive power spectrum estimation, (2014), pp. 97–100.Google Scholar
- P Pal, PP Vaidyanathan, in Digital Signal Processing Workshop and IEEE Signal Processing Education Workshop (DSP/SPE), 2011 IEEE. Coprime sampling and the MUSIC algorithm, (2011), pp. 289–294.Google Scholar
- D Cohen, YC Eldar, Sub-nyquist sampling for power spectrum sensing in cognitive radios: A unified approach. IEEE Trans. Signal Process. 62(15), 3897–3910 (2014).MathSciNetView ArticleGoogle Scholar
- O Mehanna, N Sidiropoulos, Frugal sensing: Wideband power spectrum sensing from few bits. IEEE Trans. Signal Process. 61(10), 2693–2703 (2013).MathSciNetView ArticleGoogle Scholar
- G Leus, DD Ariananda, Power spectrum blind sampling. IEEE Signal Process. Lett. 18(8), 443–446 (2011).View ArticleGoogle Scholar
- D Romero, G Leus, Wideband spectrum sensing from compressed measurements using spectral prior information. IEEE Trans. Signal Process. 61(24), 6232–6246 (2013).MathSciNetView ArticleGoogle Scholar
- Wireless communications: principles and practice, 2nd Edition (Prentice Hall, New Jersey, NJ, USA, 2012).Google Scholar
- JA Tropp, Greed is good: Algorithmic results for sparse approximation. IEEE Trans. Inf. Theory. 50(10), 2231–2242 (2004).MATHMathSciNetView ArticleGoogle Scholar
- Matrix analysis, 2nd Edition (Cambridge university press, New York, NY, USA, 2012).Google Scholar
- J Tropp, IS Dhillon, RW Heath Jr, T Strohmer, et al, Designing structured tight frames via an alternating projection method. IEEE Trans. Inf. Theory. 51(1), 188–209 (2005).MATHView ArticleGoogle Scholar
- R Mittra, CH Chan, T Cwik, Techniques for analyzing frequency selective surfaces-a review. Proceedings of the IEEE. 76(12), 1593–1615 (1988).View ArticleGoogle Scholar
- JA Tropp, MB Wakin, MF Duarte, D Baron, RG Baraniuk, in Acoustics, Speech and Signal Processing, 2006. ICASSP 2006 Proceedings. 2006 IEEE International Conference on, 3. Random filters for compressive sampling and reconstruction, (2006), pp. III 872–III 875.Google Scholar
- J Zhang, Y Ban, D Zhu, G Zhang, Random filtering structure-based compressive sensing radar. EURASIP J. Adv. Signal Process. 2014(1), 1–11 (2014).View ArticleGoogle Scholar
- Federal Communications Commission and others, Spectrum policy task force report. ET Docket. 2(135), (2002).Google Scholar
- MH Islam, CL Koh, SW Oh, X Qing, YY Lai, C Wang, et al, in Cognitive Radio Oriented Wireless Networks and Communications, 2008. CrownCom 2008. 3rd International Conference on. Spectrum survey in Singapore: Occupancy measurements and analyses, (2008), pp. 1–7.Google Scholar