Universal and efficient compressed sensing by spread spectrum and application to realistic Fourier imaging techniques

We advocate a compressed sensing strategy that consists of multiplying the signal of interest by a wide bandwidth modulation before projection onto randomly selected vectors of an orthonormal basis. Firstly, in a digital setting with random modulation, considering a whole class of sensing bases including the Fourier basis, we prove that the technique is universal in the sense that the required number of measurements for accurate recovery is optimal and independent of the sparsity basis. This universality stems from a drastic decrease of coherence between the sparsity and the sensing bases, which for a Fourier sensing basis relates to a spread of the original signal spectrum by the modulation (hence the name"spread spectrum"). The approach is also efficient as sensing matrices with fast matrix multiplication algorithms can be used, in particular in the case of Fourier measurements. Secondly, these results are confirmed by a numerical analysis of the phase transition of the l1- minimization problem. Finally, we show that the spread spectrum technique remains effective in an analog setting with chirp modulation for application to realistic Fourier imaging. We illustrate these findings in the context of radio interferometry and magnetic resonance imaging.


Introduction
In this section we concisely recall some basics of compressed sensing, emphasizing on the role of mutual coherence between the sparsity and sensing bases.We discuss the interest of improving the standard acquisition strategy in the context of Fourier imaging techniques such as radio interferometry and magnetic resonance imaging (MRI).Finally, we highlight the main contributions of our work, advocating a universal and efficient compressed sensing strategy coined spread spectrum, and describe the organization of this article.

Compressed sensing basics
Compressed sensing is a recent theory aiming at merging data acquisition and compression [1][2][3][4][5][6][7].It predicts that sparse or compressible signals can be recovered from a small number of linear and non-adaptative measurements.In this context, Gaussian and Bernouilli random matrices, respectively with independent standard normal and ±1 entries, have encountered a particular interest as they provide optimal conditions in terms of the number of measurements needed to recover sparse signals [3][4][5].However, the use of these matrices for real-world applications is limited for several reasons: no fast matrix multiplication algorithm is available, huge memory requirements for large scale problems, difficult implementation on hardware, etc.
Let us consider s-sparse digital signals x ∈ C N in an orthonormal basis Ψ = (ψ 1 , ..., ψ N ) ∈ C N ×N .The decomposition of x in this basis is denoted α = (α i ) 1 i N ∈ C N , α = Ψ * x (• * denotes the conjugate transpose), and contains s non-zero entries.The original signal x is then probed by projection onto m randomly selected vectors of another orthonormal basis Φ = (φ 1 , ..., φ N ) ∈ C N ×N .The indices Ω = {l 1 , . . ., l m } of the selected vectors are chosen independently and uniformly at random from {1, . . ., N }.We denote Φ * Ω the m × N matrix made of the selected rows of Φ * .The measurement vector y ∈ C m thus reads as We also denote A = Φ * Ψ ∈ C N ×N .Finally, we aim at recovering α by solving the 1 -minimization problem α = arg min where α 1 = N i=1 |α i | (|•| denotes the complex magnitude).The reconstructed signal x satisfies x = Ψα .
The theory of compressed sensing already demonstrates that a small number m N of random measurements are sufficient for an accurate and stable reconstruction of x [6,7].However, the recovery conditions depend on the mutual coherence µ between Φ and Ψ.This value is a similarity measure between the sensing and sparsity bases.It is defined as µ = max 1 i,j N | φ i , ψ j | and satisfies N −1/2 µ 1.The performance is optimal when the bases are perfectly incoherent, i.e. µ = N −1/2 , and unavoidably decreases when µ increases.

Fourier imaging applications and mutual coherence
The dependence of performance on the mutual coherence µ is a key concept in compressed sensing.It has significant implications for Fourier imaging applications, in particular radio interferometry or magnetic resonance imaging (MRI), where signals are probed in the orthonormal Fourier basis.In radio interferometry, one of the main challenges is to reconstruct accurately the original signal from a limited number of accessible measurements [8][9][10][11][12].In MRI, accelerating the acquisition process by reducing the number of measurements is of huge interest in, for example, static and dynamic imaging [13][14][15][16][17], parallel MRI [18][19][20], or MR spectroscopic imaging [21][22][23].The theory of compressed sensing shows that Fourier acquisition is the best sampling strategy when signals are sparse in the Dirac basis.The sensing system is indeed optimally incoherent.Unfortunately, natural signals are usually rather sparse in multi-scale bases, e.g., wavelet bases, which are coherent with the Fourier basis.Many measurements are thus needed to reconstruct accurately the original signal.In the perspective of accessing better performance, sampling strategies that improve the incoherence of the sensing scheme should be considered.

Main contributions and organization
In the present work, we advocate a compressed sensing strategy coined spread spectrum that consists of a wide bandwidth pre-modulation of the signal x before projection onto randomly selected vectors of an orthonormal basis.In the particular case of Fourier measurements, the pre-modulation amounts to a convolution in the Fourier domain which spreads the power spectrum of the original signal x (hence the name "spread spectrum"), while preserving its norm.Equivalently, this spread spectrum phenomenon acts on each sparsity basis vector describing x so that information of each of them is accessible whatever the Fourier coefficient selected.This effect implies a decrease of coherence between the sparsity and sensing bases and enables an enhancement of the reconstruction quality.
In Section 2, we study the spread spectrum technique in a digital setting for arbitrary pairs of sensing and sparsity bases (Φ, Ψ).We consider a digital pre-modulation c = (c l ) 1 l N ∈ C N with |c l | = 1 and random phases identifying a random Rademacher or Steinhaus sequence.We show that the recovery conditions do not depend anymore on the coherence of the system but on a new parameter β (Φ, Ψ) called modulus-coherence and defined as where φ ki and ψ kj are respectively the k th entries of the vectors φ i and ψ j .We then show that this parameter reaches its optimal value β (Φ, Ψ) = N −1/2 whatever the sparsity basis Ψ, for particular sensing matrices Φ including the Fourier matrix, thus providing universal recovery performances.It is also efficient as sensing matrices with fast matrix multiplication algorithms can be used, thus reducing the need in memory requirement and computational power.In Section 3, these theoretical results are confirmed numerically through an analysis of the empirical phase transition of the 1 -minimization problem for different pairs of sensing and sparsity bases.In Section 4, we show that the spread spectrum technique remains effective in an analog setting with chirp modulation for application to realistic Fourier imaging, and illustrate these findings in the context of radio interferometry and MRI.Finally, we conclude in Section 5.
In the context of compressed sensing, the spread spectrum technique was already briefly introduced by the authors for compressive sampling of pulse trains in [24], applied to radio interferometry in [25,26] and to MRI in [27][28][29][30].This paper provides theoretical foundations for this technique, both in the digital and analog settings.Note that other acquisition strategies can be related to the spread spectrum technique as discussed in Section 2.5.
Let us also acknowledge that spread spectrum techniques are very popular in telecommunications.For example, one can cite the direct sequence spread spectrum (DSSS) and the frequency hopping spread spectrum (FHSS) techniques.The former is sometimes used over wireless local area networks, the latter is used in Bluetooth systems [31].In general, spread spectrum techniques are used for their robustness to narrowband interference and also to establish secure communications.

Compressed sensing by spread spectrum
In this section, we first recall the standard recovery conditions of sparse signals randomly sampled in a bounded orthonormal system.These recovery results depend on the mutual coherence µ of the system.Hence, we study the effect of a random pre-modulation on this value and deduce recovery conditions for the spread spectrum technique.We finally show that the number of measurements needed to recover sparse signals becomes universal for a family of sensing matrices Φ which includes the Fourier basis.

Recovery results in a bounded orthonormal system
For the setting presented in Section 1, the theory of compressed sensing already provides sufficient conditions on the number of measurements needed to recover the vector α from the measurements y by solving the 1 -minimization problem (2) [6,7].
Let us acknowledge that even if the measurements are corrupted by noise or if α is non-exactly sparse, the theory of compressed sensing also shows that the reconstruction obtained by solving the 1 -minimization problem remains accurate and stable: Theorem 2 ( [7], Theorem 4.4).Let A = Φ * Ψ, Ω = {l 1 , . . ., l m } be a set of m indices chosen independently and uniformly at random from {1, . . ., N }, and T s (α) be the best s-sparse approximation of the (possibly non-sparse) vector α ∈ C N .Let the noisy measurements y = A Ω α + n ∈ C m be given with For some universal constants D, E > 0 and γ > 1, if relation (4) holds, then the solution α of the 1 -minimization problem with probability at least 1 − N −γ log 3 (N ) .
In the above theorems, the role of the mutual coherence µ is crucial as the number of measurements needed to reconstruct x scales quadratically with its value.In the worst case where Φ and Ψ are identical, µ = 1 and the signal x is probed in a domain where it is also sparse.According to relation (4), the number of measurements necessary to recover x is of order N .This result is actually very intuitive.For an accurate reconstruction of signals sampled in their sparsity domain, all the non-zero entries need to be probed.It becomes highly probable when m N .On the contrary, when Φ and Ψ are as incoherent as possible, i.e., µ = N −1/2 , the energy of the sparsity basis vectors spreads equally over the sensing basis vectors.Consequently, whatever the sensing basis vector selected, one always gets information of all the sparsity basis vectors describing the signal x, therefore reducing the need in the number of measurements.This is confirmed by relation ( 4) which shows that the number of measurements is of the order of s when µ = N −1/2 .To achieve much better performance when the mutual coherence is not optimal, one would naturally try to modify the measurement process to achieve a better global incoherence.We will see in the next section that a simple random pre-modulation is an efficient way to achieve this goal whatever the sparsity matrix Ψ.

Pre-modulation effect on the mutual coherence
The spread spectrum technique consists of pre-modulating the signal x by a wide-band signal c = (c l ) 1 l N ∈ C N , with |c l | = 1 and random phases, before projecting the resulting signal onto m vectors of the basis Φ.The measurement vector y thus satisfies where the additional matrix C ∈ R N ×N stands for the diagonal matrix associated to the sequence c.
In this setting, the matrix A c is orthonormal.Therefore, the recovery condition of sparse signals sampled with this matrix depends on the mutual coherence µ = max 1 i,j N | φ i , C ψ j |.With a pre-modulation by a random Rademacher or Steinhaus sequence, Lemma 1 shows that the mutual coherence µ is essentially bounded by the modulus-coherence β (Φ, Ψ) defined in equation ( 3).Lemma 1.Let ∈ (0, 1), c ∈ C N be a random Rademacher or Steinhaus sequence and C ∈ C N ×N be the associated diagonal matrix.Then, the mutual coherence µ = max with probabilty at least 1 − .
The proof of Lemma 1 relies on a simple application of the Hoeffding's inequality and the union bound.

Sparse recovery with the spread spectrum technique
Combining Theorem 1 with the previous estimate on the mutual coherence, we can state the following theorem: 2, be a random Rademacher or Steinhaus sequence, C ∈ C N ×N be the diagonal matrix associated to c, α ∈ C N be an s-sparse vector, Ω = {l 1 , . . ., l m } be a set of m indices chosen independently and uniformly at random from {1, . . ., N }, and then α is the unique minimizer of the 1 -minimization problem ( 2) with probability at least 1 − O (N −ρ ).
Proof.It is straightforward to check that C * C = CC * = I where I is the identity matrix.The matrix A c = Φ * CΨ is thus orthonormal and Theorem 1 applies.To keep the notations simple, let us denote F the event of failure of the 1 -minimization problem (2), X the event m CN µ 2 s log 4 (N ), and Y the event β (Φ, Ψ) 2 log (2N 2 / ) µ.According to Theorem 1 and Lemma 1, the probability of F given X satisfies and the probability of Y satisfies P(Y ) 1 − .We will shortly see that for a proper choice of , when condition ( 9) holds, we have Using this fact, we compute the probability of failure P(F ) of the 1 minimization problem.We start by noticing that where X c denotes the complement of event X.In the first inequality, the probability P(X) and P(F |X c ) are saturated to 1.One can also note that if β (Φ, Ψ) 2 log (2N 2 / ) µ, i.e., Y occurs, condition (10) implies that m CN µ 2 s log 4 (N ), i.e., X occurs.Therefore P(X|Y ) = 1, P(X c |Y ) = 0 and The probability of failure is thus bounded above by N −γ log 3 (N ) + .Consequently, if condition (10) holds with = N −ρ and 0 < ρ < log 3 (N ), α is the unique minimizer of the 1 -minimization problem (2) with probability at least 1 − O(N −ρ ).
Finally, noticing that for = N −ρ with N 2, condition (10) always holds when condition (9), with Note that relation (9) also ensures the stability of the spread spectrum technique relative to noise and compressibility by combination of Theorem 2 and Lemma 1.

Universal sensing bases with ideal modulus-coherence
Theorem 3 shows that the performance of the spread spectrum technique is driven by the modulus-coherence β (Φ, Ψ).In general the spread spectrum technique is not universal and the number of measurements required for accurate reconstructions depends on the value of this parameter.Definition 1. (Universal sensing basis) An orthonormal basis Φ ∈ C N ×N is called a universal sensing basis if all its entries φ ki , 1 k, i N , are of equal complex magnitude.
For universal sensing bases, e.g. the Fourier transform or the Hadamard transform, we have |φ ki | = N −1/2 for all 1 k, i N .It follows that β (Φ, Ψ) = N −1/2 and µ N −1/2 , i.e. its optimal value up to a logarithmic factor, whatever the sparsity matrix considered!For such sensing matrices, the spread spectrum technique is thus a simple and efficient way to render a system incoherent independently of the sparsity matrix.
Corollary 1. (Spread spectrum universality) Let c ∈ C N , with N 2, be a random Rademacher or Steinhaus sequence, C ∈ C N ×N be the diagonal matrix associated to c, α ∈ C N be an s-sparse vector, Ω = {l 1 , . . ., l m } be a set of m indices chosen independently and uniformly at random from {1, . . ., N }, and then α is the unique minimizer of the 1 -minimization problem ( 2) with probability at least 1 − O (N −ρ ).
For universal sensing bases, the spread spectrum technique is thus universal : the recovery condition does not depend on the sparsity basis and the number of measurements needed to reconstruct sparse signals is optimal in the sense that it is reduced to the sparsity level s.The technique is also efficient as the pre-modulation only requires a sample-by-sample multiplication between x and c.Furthermore, fast multiplication matrix algorithms are available for several universal sensing bases such as the Fourier or Hadamard bases.
In light of Corollary 1, one can notice that sampling sparse signals in the Fourier basis is a universal encoding strategy whatever the sparsity basis Ψ -even if the original signal is itself sparse in the Fourier basis!We will confirm these results experimentally in Section 3.

Related work
Let us acknowledge that the techniques proposed in [32][33][34][35][36][37] can be related to the spread spectrum technique.The benefit of a random pre-modulation in the measurement system is already briefly suggested in [32].The proofs of the claims presented in that conference paper have very recently been accepted for publication in [33] during the review process of this paper.The authors obtain similar recovery results as those presented here.In [34], the author proposes to convolve the signal x with a random waveform and randomly under-sample the result in time-domain.The random convolution is performed through a random pre-modulation in the Fourier domain and the signal thus spreads in time-domain.In our setting, this method actually corresponds to taking Φ as the Fourier matrix and Ψ as the composition of the Fourier matrix and the initial sparsity matrix.In [35], the authors propose a technique to sample signals sparse in the Fourier domain.They first pre-modulate the signal by a random sequence, then apply a low-pass antialiasing filter, and finally sample it at low rate.Finally, random pre-modulation is also used in [36] and [37] but for dimension reduction and low dimensional embedding.
We recover similar results, albeit in a different way.We also have a more general interpretation.In particular, we proved that changing the sensing matrix from the Fourier basis to the Hadamard does not change the recovery condition (11).

Numerical simulations
In this section, we confirm our theoretical predictions by showing, through a numerical analysis of the phase transition of the 1 -minimization problem, that the spread spectrum technique is universal for the Fourier and Hadamard sensing bases.

Settings
For the first set of simulations, we consider the Dirac, Fourier and Haar wavelet bases as sparsity basis Ψ and choose the Fourier basis as the sensing matrix Φ.We generate complex s-sparse signals of size N = 1024 with s ∈ {1, . . ., N }.The positions of the non-zero coefficients are chosen uniformly at random in {1, . . ., N }, their phases are set by generating a Steinhaus sequence, and their amplitudes follow a uniform distribution over [0, 1].The signals are then probed according to relation ( 1) or (7) and reconstructed from different number of measurements m ∈ {s, . . ., 10s} by solving the 1 -minimization problem (2) with the SPGL1 toolbox [38,39].For each pair (m, s), we compute the probability of recovery1 over 100 simulations.
For the second set of simulations, the same protocol is applied with the same sparsity basis but with the Hadamard basis as the sensing matrix Φ.

Results
Figure 1 shows the phase transitions of the 1 -minimization problem obtained for sparse signals in the Dirac, Haar and Fourier sparsity bases and probed in the Fourier basis with and without random pre-modulation.Figure 2 shows the same graphs but with measurements performed in the Hadamard basis.
In the absence of pre-modulation, one can note that the phase transitions depend on the mutual coherence of the system as predicted by Theorem 1.For the pairs Fourier-Dirac and Hadamard-Dirac, the mutual coherence is optimal and the experimental phase transitions match the one of Donoho-Tanner (dashed green line) [5].For all the other cases, the coherence is not optimal and the region where the signals are recovered is much smaller.The worst case is obtained for the pair Fourier-Fourier for which µ = 1.
In the presence of pre-modulation, Corollary 1 predicts that the performance should not depend on the sparsity basis and should become optimal.It is confirmed by the phases transition showed on Figures 1 and  2 as they all match the phase transition of Donoho-Tanner, even for the pair Fourier-Fourier!

Application to realistic Fourier imaging
In this section, we discuss the application of the spread spectrum technique to realistic analog Fourier imaging such as radio interferometric imaging or MRI.Firstly, we introduce the exact sensing matrix needed to account for the analog nature of the imaging problem.Secondly, while our original theoretical results strictly hold only in a digital setting, we derive explicit performance guarantees for the analog version of the spread spectrum technique.We also confirm on the basis of simulations that the spread spectrum technique drastically enhances the quality of reconstructed signals.

Sensing model
Radio interferometry dates back to more than sixty years ago [40][41][42][43].It allows observations of the sky with angular resolutions and sensitivities inaccessible with a single telescope.In a few words, radio telescope arrays synthesize the aperture of a unique telescope whose size would be the maximum projected distance between two telescopes of the array on the plane perpendicular to line of sight.Considering small field of views, the signal probed can be considered as a planar image on the plane perpendicular to the pointing direction of the instrument.Measurements are obtained through correlation of the incoming electric fields between each pair of telescopes.As showed by the van Cittert-Zernike theorem [43], these measurements correspond to the Fourier transform of the image multiplied by an illumination function.In general, the number of spatial frequencies probed are much smaller than the number of frequencies required by the Nyquist-Shannon theorem, so that the Fourier coverage is incomplete.An ill-posed inverse problem is thus defined for reconstruction of the original image.To address this problem, approaches based on compressed sensing have recently been developed [10][11][12].MR images are created by nuclear magnetic resonance in the tissues to be imaged.Standard MR measurements take the form of Fourier (also called k-space) coefficients of the image of interest.These measurements are obtained by application of linear gradient magnetic fields that provides the Fourier coefficient of the signal at a spatial frequency proportional the gradient strength and its duration of application.Accelerating the acquisition process, or equivalently increasing the achievable resolution for a fixed acquisition time, is of major interest for MRI applications.To address this problem, recent approaches based on compressed sensing seek to reconstruct the signal from incomplete information.In this context, several approaches have been designed [13,[28][29][30][44][45][46][47][48].
In light of the results of Section 2, Fourier imaging is a perfect framework for the spread spectrum technique, apart from the analog nature of the corresponding imaging problems.In the quoted applications, the random pre-modulation is replaced by a linear chirp pre-modulation [25][26][27][28][29][30].In radio interferometry, this modulation is inherently part of the acquisition process [25,26].In MRI, it is easily implemented through the use of dedicated coils or RF pulses [29,30].For two-dimensional signals, the linear chirp with chirp rate w ∈ R reads as a complex-valued function c : τ → e iπwτ 2 of the spatial variable τ ∈ R 2 .Note that for high chirp rates w, i.e. for chirp whose band-limit is of the same order of the band-limit of the signal, this chirp shares the following important properties with the random modulation: it is a wide-band signal which does not change the norm of the signal x, as |c(τ In this setting, the complete linear relationship between the signal and the measurements is given by In the above equation, the matrix U represents an up-sampling operator needed to avoid any aliasing of the modulated signal due to a lack of sampling resolution in a digital description of the originally analog problem.The convolution in Fourier space induced by the analog modulation implies, in contrast with the digital setting studied before, that the band limit of the modulated signal is the sum of the individual band limits of the original signal and of the chirp c.We assume here that, on its finite field of view L, the signal x is approximately band-limited with a cut-off frequency at B, i.e., its energy beyond the frequency B is negligible.The signal x is thus discretized on a grid of N = 2LB points.On this field of view L, the linear chirp c may be approximated by a band limited function of band limit identified by its maximum instantaneous frequency |w|L/2.This band limit can also be parametrized in terms of a discrete chirp rate w = wL 2 /N and thus |w|L/2 = | w| B. Therefore, an up-sampled grid with at least N w = (1 + | w|)N points needs to be considered and the modulated signal is correctly obtained by applying the chirp modulation on the signal after up-sampling on the N w points grid2 .The up-sampling operator U, implemented in Fourier space by zero padding, is of size N w × N and satisfies U * U = I ∈ C N ×N .Finally, the matrix C ∈ C Nw×Nw is the diagonal matrix implementing the chirp modulation on this up-sampled grid and the matrix F = (f i ) 1 i Nw ∈ C Nw×Nw stands for the discrete Fourier basis on the same grid.The indices Ω = {l 1 , . . ., l m } of the Fourier vectors selected to probe the signal are chosen independently and uniformly at random from {1, . ., N w }.

Illustration
Up to the introduction of the matrix U and the substitution of the linear chirp modulation for the random modulation, we are in the same setting as the one studied in Section 2. To illustrate the effectiveness of the spread spectrum technique, we consider two images of size N = 256 × 256 showed in Figure 3.The first image shows the radio emission associated with the encounter of a galaxy with its northern neighbor.It was acquired with the Very Large Array in New Mexico [49].The second image shows a brain acquired in an MRI scanner.This image is part of the Brainix database [50].These images are probed according to relation (12) in the absence ( w = 0) and presence ( w = 0.1) of a linear chirp modulation.Independent and identically distributed Gaussian noise with zero-mean is also added to the measurements.The variance σ 2 of the noise is defined such that the input snr = −10 log 10 σ 2 /Σ 2 x is 30 dB (Σ x stands for the sampled standard deviation of x).The images are reconstructed from m = 0.4N complex Fourier measurements by solving the 1 -minimization problem (5).Note that in order to stay in the setting of the theorems presented so far, no reality constraint is enforced in the reconstructions, so that the reconstructed images are complex valued.The sparsity bases Ψ used are the Daubechies-6 and Haar wavelet bases for the galaxy and the brain respectively.In each case, 20 reconstructions are performed for different noise and mask realizations.The complex magnitudes of reconstructed images with median mean squared errors are presented in Figure 3.
In the absence of linear chirp modulation, the quality of the reconstructed image is very low.However, one can already note that the fine scale structures are much better reconstructed than those at large scales.The fine details live at the small scales of the wavelet decomposition whereas the large structures live at larger scales.The small scale wavelets being more incoherent with the Fourier basis than the larger wavelets, the high frequency details are naturally better recovered.
In the presence of the linear chirp modulation, all the wavelets in Ψ become optimally incoherent with the Fourier basis thanks to the universality of the spread spectrum technique.Consequently, as one can observe on Figure 3, the low and high frequency details are better reconstructed and the image quality is drastically enhanced.Note that much better reconstructions can be obtained for the brain image by substituting the Total Variation norm 3 for the 1 norm in (5) [13,30].However, Theorems 1 and 3 do not hold for such a norm.
Let us acknowledge that these simulations are not fully realistic.For example, in radio-interferometry, the spatial frequency cannot be chosen at random.To simulate realistic acquisitions, one would have to consider non-random measurements in the continuous Fourier plane.Such a study is beyond the scope of this work.However, in the context of MRI, part of the authors implemented and tested this technique on a real scanner with in vivo acquisitions [29,30].

Modified recovery condition
Because of the modifications introduced in (12) to account for the analog nature of the problem, the digital theory associated with the measurement matrix (7) does not explicitly apply.Nevertheless, the previous illustration shows that the spread spectrum technique is indeed still very effective in this analog setting.Actually, performance guarantees similar to Theorem 1 may be obtained in this setting.
then α is the unique minimizer of the 1 -minimization problem (2) with probability at least 1 − N −γ log 3 (N ) .
3 1 norm of the magnitude of the gradient.
Proof.The proof follows directly from Theorem 4.4 in [7].Indeed, Theorem 4.4 applies to any matrices A Ω associated to an orthonormal system (with respect to the probability measure used to draw Ω) that satisfies the so-called boundedness condition (see Section 4.1 of [7] for more details).
Let us denote Ãw = (ã ik ) 1 i,k N = N 1/2 w A w the normalized measurement matrix.It is easy to check that this new matrix is orthonormal relative to the discrete uniform probability measure on {1, . . ., N w }.
w µ w .Applying Theorem 4.4 in [7] to the matrix Ãw terminates the proof.
Note that Theorem 4.4 in [7] also ensures that our analog sensing scheme is stable relative to noise and non exact sparsity if condition ( 13) is satisfied.Also note that one can obtain a similar results using Theorem 1.1 and 1.2 of the very recently accepted paper [51].
In view of this theorem, one can notice that the number of measurements needed for accurate reconstructions of sparse signals is proportional to the sparsity s times the product N w µ 2 w , with factors depending on the chirp rate w.In the analog framework, two effects are actually competing.On the one hand, the mutual coherence µ 2 w of the system is decreasing with the spread spectrum phenomenon, but, on the other hand, the number of accessible frequencies N w that bear information is increasing linearly with the chirp rate.The optimal recovery conditions are reached for a chirp rate w that ensures that the product N w µ 2 w is at its minimum.This minimum might change depending on the sparsity matrix, so the universality of the recovery is formally lost.
To illustrate this effect, Table 1 shows values of the product N w µ 2 w for different chirp rates w and two different sparsity matrices: the Fourier and the Dirac bases.The values are computed numerically for a size of signal N = 1024.In the case of the Dirac basis, one can notice that the product N w µ 2 w slightly increases with the chirp rate, thus predicting that the performance should even slightly diminish in the presence of a chirp.On the contrary, the product is drastically reduced for the Fourier basis as w increases, predicting a significant performance improvement in the presence of chirp modulation.

Experiments
To confirm the theoretical predictions of the previous section, we consider the Dirac and Fourier bases as sparsity matrices Ψ.We then generate complex s-sparse signals of size N = 1024 with s = 10.The positions of the non-zero coefficients are chosen uniformly at random in {1, . . ., N }, their signs are set by generating

Sparsity basis
Dirac Fourier N w µ 2 w at w = 0 1.00For each pair (m, s), we compute the probability of recovery over 100 simulations for different chirp rate w ∈ {0, 0.1, 0.25, 0.5}. Figure 4 shows the probability of recovery as a function of the number of measurements.Firstly, in the case where Ψ is the Dirac basis, one can notice that the number of measurements needed to reach a probability of recovery of 1 slightly increases with the chirp rate w.This is in line with the value in Table 1 and Theorem 4.
Secondly, in the case where Ψ is the Fourier basis, the performance becomes much better in the presence of a chirp.As predicted by the value in Table 1, the improvement is drastic when w goes from 0 to 0.1 and then starts to saturate between 0.1 and 0.5.
Thirdly, according to Table 1, the product N w µ 2 w is equal to 4.13 for the Fourier basis when w = 0.5.This is nearly the value obtained with the Dirac basis for the same chirp rate, suggesting the same probability of recovery for the same number of measurements.Indeed, one can notice on Figure 4 that the number of measurement needed to reach a probability of recovery of 1 is around 100 in both cases.
Finally, these results also suggest that the spread spectrum technique in the modified setting is almost universal in practice.Indeed, for the perfectly incoherent pair Fourier-Dirac of sensing-sparsity bases, the number of measurements needed for perfect recovery is around 100 and this number remains almost unchanged in presence of the linear chirp modulation.Furthermore, for the pair Fourier-Fourier, the spread spectrum technique allows to reduce the number of measurements for perfect recovery close to this optimal value.

Conclusion
We have presented a compressed sensing strategy that consists of a wide bandwidth pre-modulation of the signal of interest before projection onto randomly selected vectors of an orthonormal basis.In a digital setting with a random pre-modulation, the technique was proved to be universal for sensing bases such as the Fourier or Hadamard bases, where it may be implemented efficiently.Our results were confirmed through a numerical analysis of the phase transition of the 1 -minimization problem for different pairs of sensing and sparsity bases.
The spread spectrum technique was also shown to be of great interest for realistic analog Fourier imaging.In applications such as radio interferometry and MRI, the originally digital random pre-modulation may be mimicked by an analog linear chirp.Explicit performance guarantees for the analog version of the technique with a chirp modulation were derived.It shows that recovery results are still enhanced in this setting, though universality does not strictly hold anymore.Numerical simulations have shown that the quality of reconstructed signals is drastically enhanced in this more realistic setting, also for pairs of sensing-sparsity bases initially highly coherent, such as the Fourier-Fourier pair.

Figure 1 :Figure 2 :
Figure 1: Phase transition of the 1 -minimization problem for different sparsity bases and random selection of Fourier measurements without (left panels) and with (right panels) random modulation.The sparsity bases considered are the Dirac basis (top), the Haar wavelet basis (center), and the Fourier basis (bottom).The dashed green line indicates the phase transition of Donoho-Tanner [5].The color bar goes from white to black indicating a probability of recovery from 0 to 1.

1 Figure 3 :
Figure 3: Top panels: Image of the giant elliptical galaxy NGC1316 (center of the image) devouring its small northern neighbor.The image shows the radio emission associated with this encounter superimposed on an optical image.The radio emission was imaged using the Very Large Array in New Mexico (Image courtesy of NRAO/AUI and J. M. Uson).The image size is N = 256 × 256.Bottom panels: MRI image of a brain from the Brainix database.From left to right: original image; complex magnitudes of the reconstructed images from m = 0.4N measurements without chirp modulation; complex magnitudes of the reconstructed images from m = 0.4N measurements with chirp modulation.

Theorem 4 .
Let A w = F * CUΨ ∈ C m×N , µ w = max 1 i,j N | f i , CUψ j |, α ∈ C N bean s-sparse vector, Ω = {l 1 , . . ., l m } be a set of m indices chosen independently and uniformly at random from {1, . . ., N w }, and y = A c Ω α ∈ C m .For some universal constants C > 0 and γ > 1, if the number of measurements m satisfies m CN w µ 2 w s log 4 (N ), (

5 Figure 4 :
Figure 4: Probability of recovery of 10-sparse signals as a function of the number measurement m obtained with the measurement matrix (12) for two different sparsity basis: the Dirac basis (left) and the Fourier basis (right).The continuous black curve corresponds to the probability of recovery for w = 0.The dot-dashed blue curve corresponds to the probability of recovery for w = 0.1.The dashed black curve corresponds to the probability of recovery for w = 0.25.The continuous red curve corresponds to the probability of recovery for w = 0.5.

Table 1 :
Influence of a chirp modulation on N w µ 2 w .