 Research
 Open Access
 Published:
Roles of equalization in radar imaging: modeling for superesolution in 3D reconstruction
EURASIP Journal on Advances in Signal Processing volume 2012, Article number: 106 (2012)
Abstract
In radar imaging, resolution is generally dictated by its corresponding system point spread function, the response to a point source as a result of an external excitation. This notion of resolution turns out to be rather questionable, as the interpretation of echoes received from a range of continuous targets according to a linear model allows one to cast the imaging problem as a communication system that maps the target reflectivity function onto measurements, which in turn suggests that by virtue of sampling and equalization, one can achieve unlimited spatial resolution. This article reviews the fundamental problem inherent to pulse compression in a multistatic multiinputmultioutput (MIMO) scenario, from a communications viewpoint, in both focused and unfocused scenarios. We generalize the notion of 1D range compression and replace it by a more general 4D pulse compression. The process of focusing and scanning over a 3D object can be interpreted as a MIMO 4D convolution between a reflectivity tensor and a spacevarying system, which naturally induces a 4D MIMO channel convolution model. This implies that several wellestablished block and linear equalization methods can be easily extended to a 3D scenario with the purpose of achieving exact reconstruction of a given reflectivity volume. That is, assuming that no multiple scattering occurs, resolution is only limited in range by the sampling device in the unfocused case, while unlimited in case of focusing at multiple depths. Exact reconstruction under a zeroforcing or leastsquares criterion depends solely on the amount of diversity induced by sampling in both space (via scanning rate) and time (via sampling rate), which further allows for a tradeoff between range and crossrange resolution. For instance, the fastest scanning rate is achieved by steering non overlapping beams, in which case portions of the object can be reconstructed independently from each other.
1 Introduction
Pulse compression is a term widely used within the radar systems community, and was originally achieved by transmitting a Chirp waveform, which, when convolved with its matched version, allows the target information to be recovered without the need of transmitting a narrow pulse, in many cases unfeasible in a practical implementation [1]. The basic idea to restore resolution relies on the assertion that the equivalent of two successively received narrow pulses, whose widths are determined by the chirp bandwidth, are not allowed to interfere. After sampling the received signal, since either a matched^{a} or mismatched receiver is FIR, the overall response can at best only approximate an impulse, therefore limiting resolution. Thus, for a long time, there has been a common belief, especially in military and geophysics applications, that target information recovery for transmissions beyond the limits imposed by the system ambiguity function is not possible, the main reason stemming from the fact that, apparently, accuracy in detection has always been linked to a bandwidth issue.
The notion of resolution in imaging systems turns out to be rather questionable as raised by several authors, be it in the electromagnetic domain [2] for radar applications, or in the acoustic context for ultrasound imaging [3] and geophysics applications [4]. The interpretation of echoes received from a range of targets according to a linear model is by no means new [5, 6] and even though the idea of deconvolution is wellestablished in related fields as seismic exploration [4], a more precise formulation and connections of the imaging problem to signal processing and communications jargons in a 3D scenario is in order.
As noted in [6], the idea of a resolution criterion dates back to Lord Rayleigh (1879) in the context of light intensity distribution, who proposed that two components of equal intensity should be considered resolved, when the first maximum of one component sits at the first minimum of the other. Even though Rayleigh resolution has been considered a bound on resolution, the socalled superresolution algorithms became known as way to surpass this limit. These include minimum variance unbiased estimate (MVUE) and minimum norm solutions, which are very well established estimation techniques [7]. In pulse radar systems, we can interpret such two components as two pulses received from a straight line of range, which can still be resolved even in cases where the Rayleigh resolution limit is exceeded. Of course, this is the definition of resolution in one dimension.
More generally speaking, in an imaging system, resolution is defined independently in range and crossrange (azimuth and elevation). While the former is dictated by the pulse width, or, by the system ambiguity function (which usually assumes a matched filter structure for the receiver) [1], crossrange resolution is achieved by increasing the antenna aperture, or the transmitted signal frequency; In this scenario, the term superresolution refers to additional processing in which resolution beyond what is determined by the antenna features, transmitted pulse, and the usual matched filter can be achieved, by making use of additional diversity and/or signal processing methods.
Synthetic aperture radars (SAR) are one common example where a virtual array can be generated via antenna motion [1, 8]. There is a mathematically rich literature on SARs which exploits spatial diversity of transmitters and receivers in order to sharpen the system pointspreadfunction [8]. Most of these derivations assume weak scattering, although schemes that exploit multiple scattering are also possible. This can be developed in a monostatic, bistatic, or more generally, in multistatic and wideband configurations. Readers should also refer to [9] for a grouptheoretic based approach on radar imaging, assuming a freespace environment. In all these references, a common implication is that the synthesis of an ideal ambiguity function, i.e., an exact Diracdelta, is not possible, due to strong constraints in its mathematical form.
Thus, while mathematicians have extensively studied the imaging formation from an inverse operator perspective, the signal processing community has tackled the same issue using estimation and information theoretic tools, which have been widely established in digital communications [10]. In particular, the subject of pulse compression has evolved into a more general design of waveforms in multiinputmultioutput (MIMO) radars, capable of transmitting different waveforms from each antenna. They can be classified into bistatic MIMO radar and colocated MIMO radar. In the former, the transmitting antennas are widely separated, so that each antenna sees a different radar cross sections (RCS). In colocated MIMO radar, the antennas are closely positioned, so that they share the same RCS. The latter provides interference rejection capabilities, improved parameter identification and flexibility in the design of a beam pattern. Each specific scheme is determined by the antennas and imaging area positions in space, which, in turn, are defined by the application at hand. For instance, beamformers provide a flexible way of steering and focusing a microwave or ultrasound beam by timedelaying the corresponding signals a teach antenna [11]. For narrowband transmitted signals, the MIMO radar is thus approximated by a singleinputmultipleoutput (SIMO) system, where scaled versions of a single waveform are transmitted (also known as phasedarray radar).
Regardless of the application, the final goal continues to be the optimization of the corresponding MIMO ambiguity function, either by optimizing the waveform (aiming range compression) or its covariance matrix (which affects resolution in crossrange). In the latter, the MIMO ambiguity function is optimized by minimizing the side lobes of the autocorrelations and cross correlations of the waveforms. While most of these references consider point targets, the waveforms can be optimized considering extended targets as well [10]. In particular, [10] considers waveform and receiver optimization assuming that the target and clutter impulse responses are known, by maximizing the SINR in the colocated MIMO radar scheme. One important issue that results from the study of waveform design is whether, from an estimation point of view, one is really required to compress, as raised in [12] considering both CramérRao bound (CRB) and sufficient statistics for parameter estimation.
Unfortunately, be it for imaging point targets or extended targets, the resolution issue has been somewhat overlooked, in the sense that it is still considered bounded by the system ambiguity function (even in the MIMO case). A situation where this becomes of paramount importance arises frequently in radarbased medical imaging systems [13]. For example, detecting tumors or imaging microcalcifications in the human body often requires submillimeter resolution, in which case the method should also cope with the practical aspects that worsen the resolution limits normally encountered in longdistance communications applicationssee [14] and the references in the same issue. One of the main differences is in the choice of the carrier frequency [15]. Note that the higher the carrier frequency, the better the crossrange resolution (which justifies a 1D model, considering that an entire area can be covered via Ascan lines). On the other hand, in order not to incur into Rayleigh scattering effects [16], one must pick f_{ c } > v/ Δ_{min}, where v is the wave speed inside the medium, and Δ_{min} the desired minimum range resolution. Now, the higher the carrier, the more the signal is attenuated by the medium, which constitutes one of the fundamental tradeoffs in radar systems, namely, Crossrange Resolution × Depth of penetration. For instance, in Xray imaging, the exposure to ionizing radiation inherent to Xrays can be damaging to tissues, and, in general, harmful to the human body, since their effects are cumulative. These two factors are the main reasons why microwave imaging systems (MIS) have emerged as an attractive alternative in breast imaging for tumor detection [17–19]. Note that lower frequencies also limit the size of the antenna employed, and there should be a fair compromise among all the above factors.
Highly focused beams can also be achieved without necessarily increasing the carrier frequency, with an idea known as Time Reversal [20], another superresolution technique that generates a virtual aperture by timereversing the signals received from a point emitting source via multiple paths, exploiting the nonhomogeneous nature of a medium by retransmitting them in order to achieve high focusing at the source. The timereversal concept is identical to a matched filtering process, whose principle can be shown analytically, for instance, in [21]. It had been proposed recently in conjunction with a binary hypothesis test of detection in the presence or absence of a target in a highly cluttered environment considering a single antenna [22], and further in a beamforming configuration for target detection imaging [23], the latter further applied to a breast cancer scenario. It has also been combined with Spotlight SAR in [24].
An interesting fact with all current radarbased approaches is that resolution in range seems to be treated quite differently from resolution in crossrange. That is, while the former requires "pulse compression", the latter is solved via "beam compression". Note that because in general we have a collection of signals transmitted and received from several directions, the pulse is in reality a 3D function whose shape is given by the combined effect of the transmitted pulse and the MIMO medium's Green function. This motivates us to think of compression as an operation that is performed in all three dimensions in space, in a way that the term "pulse compression" is more properly applied to achieve volumetric, and not only range resolution. Moreover, methods for recovering resolution in radar imaging systems have always assumed a continuous model for the reflectivity information. This is perhaps the major reason why the solution to the inverse problem could only be approximated, normally done so via matched filtering. The fact is, in theory, reflectivity is discrete information whose granularity is dictated by the amount of resolution one is interested in achieving, for both range and crossrange. In this article, this will be accomplished by recasting the imaging problem into one that maps a reflectivity volume into measurements collected by illuminating a target object with a generally unfocused beam. As a result, the commonalities related to minimum range resolution for target detection can be considered irrelevant, once the pulse transmission problem is restated as a standard, albeit, 3D communication problem.
This presentation proposes to formulate the (static) imaging problem in both unfocused and focused cases as a MIMO multirate multidimensional transmission problem. We show how steering at several depths in the focused case can be interpreted as an induced 4D MIMO convolution model, subject to standard linear estimation/equalization problems. In the unfocused case, the amount of induced space and time diversity will be the determinant factor for the accuracy in reflectivity estimation. For the focused scenario, the central idea is to perform estimation of the entire object in one shot, once data from all points inside the object have been collected. As an important special case, we consider the singleantenna, freespace, narrowband, farfield scenario. This results in a Toeplitz structured transmission matrices so that when the beam pattern function is separable, efficient superfast estimation receivers based on the known transmitted pulse exist. This has significant impact on all radarbased applications requiring submillimeter resolution, which call for efficient receivers in order to cope with the often massive amount of data involved.
Notation: We denote by ^{T} the transpose and * the transpose complex conjugate operator, while we use * to denote convolution. We use vec(·) to stack the columns of a matrix into a vector, and resh_{ M } (·) to reshape a matrix with M rows into a column vector.
2 Background on related work
To contextualize the problem, we shall refer to [25], where an isotropic pointsource model p(t, x) = p(t)δ(x  x_{0}), is considered in a wideband distributed aperture. We assume that the transmitted signal wavelength is smaller than the minimum diameter of a target. Otherwise, Rayleigh scattering theory must apply, and the received signal will also depend on the wavelength and target diameter [16]. Hence, under the socalled distorted wave Born approximation, the scattered field measured at point x and time t from a continuous distribution of targets is given by
where
is the incident field, s(z) is the reflectivity function, and g(x, z, t) is the Green's function associated with the background medium [25]. Defining
we can write
In the development of [25], the scattering operator ℋ(s) represents a linear mapping from the input p(t) to the measurements, in a distributed array pattern. For simplicity, consider a single source of waveform. The operator ℋ is then seen as a linear mapping from the reflectivity variable s(·) to the scattering operator ℋ(s). In [25], the main contribution is towards inverting ℋ(s), or, more generally speaking, estimating s. That is, given the Green's function of the medium, an inverse scattering operator ℋ^{1} or a leastsquares estimator of the form
is considered (or a MMSE approach assuming knowledge of noise statistics); For such, the adjoint ℋ* is computed based on the underlying Green's function. Then, assuming a finite number of transmit vectors, the spectral decomposition of (ℋ*ℋ) is precomputed, a fact that allows for an offline computation of the action of the estimator on the received signal.
The points below raise important questions with regards to a practical and thus more precise modeling of the imaging system, which shall reveal the extent to which the information on the reflectivity function s(z) can be recovered:

1.
The above widebeam, wideband formulation does not convey any information on the geometry of the problem, and does not tell us what limits on resolution can be achieved for a particular antenna configuration. A general question that remains is, how do we quantify resolution by focusing (or unfocusing) when illuminating a target? How does the geometry of a beam and the way it is steered over an object affect its overall reconstruction?

2.
For a given carrier frequency, in which case we assume that we have sufficient penetration in range, the minimum range resolution Δ_{min} obtained from the function s(z) is translated into time resolution by virtue of sampling. This immediately raises some fundamental questions regarding feasibility and complexity of implementation of the corresponding receiver. That is, is it possible to sample the received signal fast enough? What kind of transmit/receive structure must be employed to achieve a certain resolution level? What is the impact of sampling in the amount of complexity required for a specific application?

3.
In a practical imaging system subject to sampling, and considering finite extent signals in the freespace, what should be the counterpart of (5)? Is there an efficient way of reconstruction that exploits structure, rather than performing the spectral factorization of (ℋ*ℋ)?
In the sequel, we connect fundamental aspects of radar imaging with modeling and equalization in digital communications as a starting point to tackle the above issues.
3 Motivation
Assume that the transmitter is able to produce a highly collimated beam, so that it can be approximately confined to a straight line, as illustrated in Figure 1.
In this case, our problem reduces to that of imaging a line, defined by the reflectivity function s(z). This is the central idea in radar imaging; to recover s(z) from measurements y(t, x, z), by converting time delay into range z.
An important case of study arises when the background medium is the freespace, where
the solution of the freespace wave equation. Let $p\left(t\right)=a\left(t\right){e}^{j{\omega}_{0}t}$ be the transmitted pulse from an antenna at point x, where ω_{ c } = 2πf_{ c } is the carrier frequency, with f_{ c } given in Hertz. Considering that a(t) is slowly varying, we have
where v(t) is an additive noise term, in order to model an additional uncertainty. Let τ = 2r  x_{0}/c, and define
Substituting it into (7), we get
Figure 2 shows the equivalent baseband system, obtained after demodulation of the received signal.
Given a predetermined target resolution, in general two received pulses will overlap, resulting in what is known in the communications jargon as inter symbol interference (ISI). Thus, a natural question that follows is whether such waveform can be designed in order to avoid ISI. The answer to this question is well known in communications theory; for such, the transmitted pulse must have a Nyquist property, meaning that its response should have zero crossings at multiples of the sampling instant T_{ s }. In theory, this can be easily achieved via, for instance, a raisedcosine pulse, which naturally conforms with the Rayleigh resolution criterion previously discussed. Two implementation issues arise in such design: in practice, sampling is performed at delayed instants kT_{ s } + t_{0}, which again results in ISI; Also, for highresolution sampling, one needs to design a very wide bandwidth pulse, which implies that we should be able to squeeze a narrow, highenergy lobe in the interval [T_{ s }, T_{ s }]. This may not be feasible in many applications (as medical ones), so that, in general, the pulse width will span a wider range. This will result in ISI as well.
All of the above assume no processing at the receiver. When the latter is employed, several techniques for ISI removal can be considered, so that the main question now is how to design both transmitter (pulse) and receiver according to some design criteria, in a way that ISI be completely removed or minimized. For instance, when the received pulses are sufficiently separated in time (yielding a certain spacial resolution), so that they do not overlap, MMSE equalization implies that the receive filter must be matched to a(t), in which case the output SNR is maximized. This is in fact the standard form of pulse compression used in radar systems, where a(t) is normally chosen as a chirp waveform. The chirp waveform is already by itself a form of coding, where the socalled time bandwidth product quantifies the amount of redundancy introduced [13]. Bark codes are also used as a pulse design technique, and in general, the idea is to look for good codes whose orthogonality properties can contribute to undistorted transmission [1, 26]. On the other hand, if one is concerned with the minimization of false alarms due to erroneous detection of sidelobes, a mismatched filter design must be pursued instead [26, 27]. Note that, without considering noise, a mismatched, possibly infinite impulse response (IIR) filter provides much higher resolution compared to a matched one (if the receiver is allowed to be IIR, the overall transmission can be an exact impulse). Moreover, even in the case where the received pulses do overlap, a general MMSE or leastsquares equalizer can be considered instead. Compensating for the inherent ISI in(8) in the MSE sense is the central idea behind most communications systems, even though several strategies of design and transmit/receive schemes can be envisioned.
In general, how fast one can sample is determined by the stateoftheart of analogtodigital converters (ADC). For instance, at the speed of light, submillimeter resolution implies that two received pulses are separated in time by T_{ s } = 2R_{min}/c = 2 · 10^{3}/(3 · 10^{8}) = 6 · 10^{12}s. Since 20 MHz ADCs are the most easily available, achieving such sampling rate becomes a formidable challenge in practice. On the other hand, in the acoustic context these issues are no longer at stake, since ultrasound waves propagate with much lower velocity (average 1540 m/s inside the body). However, the low velocity of ultrasound waves limits the frame rate in which the image is produced, especially for narrow beams and multiple focal points. These facts motivate us to look into general widebeam solutions, where backscatter from several points is received at the same point, or, possibly, at several receiving locations at the same time. Nevertheless, regardless of the nature of the transmissions (electromagnetic or acoustic), in this article we shall first assume that one can sample as fast as necessary, and will examine a few wellknown structures into which the radar problem can be straightforwardly cast. The range resolution limit is then tackled by considering focusing at multiple depths, at the desired spatial resolution.
4 Multidimensional multirate model
Equation (4) can be seen as a mapping from the transmitted waveform to measurements, through an operator that depends on the reflectivity function. A more insightful view of this problem, however, is to express the system as one that maps reflectivity in space onto measurements at the receiving antenna, through the combination of a known waveform and the medium's Green function. In this case, different transmission and reception architectures can be envisioned, depending on the level of focusing employed. This is intimately related to (1) the geometry of the antennas and object in space; (2) how many antenna elements are activated for transmission and reception in a certain scheme; and (3) on the nature of the transmitted signals.
4.1 Unfocused imaging: MIMO radar
In the MIMO case the transmitting and receiving beams are unfocused, and each antenna element transmits a different waveform. This is illustrated in Figure 3, where a particular patch from an antenna transmits omnidirectionally from each element illuminating the target completely. Thus, for a given point z inside the volume, the received signal at an arbitrary point y on the antenna, given the transmitted pulse p(t, x) at point x only, can be compactly written as [1]
where p"(t) denotes the second derivative with respect to t, and J_{ s }(x) and J_{ r }(y) are the time derivatives of the current density at the transmitting and receiving points on the antenna.
Our goal is to reconstruct the entire object volume, or, equivalently, to recover all the cross sections that form a 3D image. Here is where the geometry of the problem comes into picture. First, it is assumed that the aspect angle of all antennas elements is such that they see the same RCS. Second, we would like to preserve the geometry of the volume we want to reconstruct, in the sense that each cross section is "unrolled" to form a 3D tensor representing the object image. That is, we shall map each point of the reflectivity function s(z) onto the corresponding point of the unrolled object, which we define as continuous tensor defined by S(r), where r= [r_{1} r_{2} r]^{T}. Its discretetime version is denoted by the R_{1} × R_{2} × R_{3} tensor S, which is obtained by sampling S at [n_{0}T_{0} n_{1}T_{1} kT_{ r }]^{T}. The quantities {R_{1}, R_{2}} define the target resolution in cross range, while R_{3} defines resolution in range. We shall denote by S(k), k = 0, 1, . . . , R_{3}  1 the R_{1} × R_{2} matrix containing a rectangular lattice of points within a cross section of the object at range k, and by S(n)a vector of reflectivities within the tensor S,$\mathit{n}={\mathit{n}}_{0},{\mathit{n}}_{1},\dots ,{{\mathit{n}}_{R}}_{{}_{1}{R}_{2}}$, along direction n. Moreover, we define by s_{ k } the reflectivity at point r_{ k } for $k=0,1,\dots ,\stackrel{\u0303}{R}1$, $\stackrel{\u0303}{R}={R}_{1}{R}_{2}{R}_{3}$. We further define R = R_{1}R_{2}. Let us spatially sample the antenna surface, so that the transmitting patch contains Q_{ t } = Q_{1}Q_{2} antenna elements, which transmit Q_{ t } waveforms, i.e., p_{ ℓ }(t), ℓ = 0, 1, . . . , Q_{ t }  1. These in turn are received at Q_{ r } receiving elements, at y_{ i }(t), i = 0, 1, . . . , Q_{ r }  1. The complete model is illustrated in Figure 4a, which comprises the forward and backward MIMO channels denoted by G_{ f }(t) and G_{ b }(t), respectively. Let ${u}_{k}\left(t\right),\phantom{\rule{0.3em}{0ex}}\phantom{\rule{0.3em}{0ex}}k=0,1,\dots ,\stackrel{\u0303}{R}$ be the k th output of G_{ f }(t), at the corresponding point r_{ k },
and note that we can instead express the problem as a mapping from reflectivities to measurements as shown in Figure 4b.
We thus see that our main problem is how to recover the tensor S, given the measurements y_{ i }(t), at the desired resolution {R_{1}, R_{2}, R_{3}}. Before discussing the possible solutions to this problem, we first analyze what happens in the more general case of a focused (transmitted) beam.
4.2 Focused imaging
Focusing is achieved by concentrating energy in a small volume within the object, so that only echoes received from that particular volume become significant. This is achieved by integration of delayed copies of the transmitted signal, weighted by the time derivative of the current density over the antenna, giving rise to a beam pattern, as illustrated in Figure 5.
Assuming that the antenna elements are closely located, and that p(t) is narrowband, the transmitter becomes a bidimensional phased array, modeled as a SIMO system represented by a steering vector. On the other hand, the receiving beam can be unfocused in general, so that the received signals are collected at all antenna elements and processed by a MIMO system.
Let the m th entry of the underlying SIMO system be given by ${e}^{j{\omega}_{c}{t}_{m}}$, where t_{ m } is the delay applied to the signal point m relative to the signal at the center of the antenna x_{ o }, with respect to the focal point, which we denote by r_{ f } . We shall omit the dependence on r_{ f } for simplicity of notation. Let u_{ k }(t) be the signal at r_{ k }, defined within the depth of field corresponding to r_{ f } . Equation (10) in this case can be written as
where the term J(ℓ) consists of a 2D window that provides apodization, in order to shape the transmitted beam pattern. Let a_{ ℓ }= x_{ ℓ } x_{ o } be a vector from the center of the antenna to an arbitrary point on the antenna, and define the wave number κ = ω_{ o }/c_{ o }. Thus, expressing g_{ k }(x_{ ℓ }, t) explicitly as ${g}_{k}\left({\mathit{x}}_{\ell},\phantom{\rule{2.77695pt}{0ex}}t\right)={\mathit{\text{\u1e21}}}_{k}\left({\mathit{x}}_{\ell},\phantom{\rule{2.77695pt}{0ex}}t\right)\star \frac{\delta \left(t{\mathit{r}}_{k}{\mathit{x}}_{\ell}/c\right)}{4\pi \mathit{r}{\mathit{x}}_{\ell}}$, and using the Fraunhofer condition at the focal plane, we have
where $w\left(\hat{{\mathit{r}}_{k}{\mathit{x}}_{o}},\phantom{\rule{2.77695pt}{0ex}}t\right)$ is the 2D discrete time Fourier transform of the time varying tapering function $J\left(\ell \right){e}^{j{\omega}_{c}{t}_{\ell}}{\mathit{\u1e21}}_{k}\left({\mathit{x}}_{\ell},\phantom{\rule{2.77695pt}{0ex}}t\right)$ at r_{ k } at time t, with $^$ denoting the corresponding unitnorm vector. We shall define β_{ k }(r_{ f }, t) as the combined effect of the beam pattern and the transmitted pulse at time t, with f denoting the focal point. The complete system is illustrated by the spatial multirate structure of Figure 6a.
The interpretation of this scheme is as follows. Assume we have the antenna beam focused at r_{ f } . Because the beam is narrow at the focal plane, and due to a finite (extent) depth of field, the illuminated portion of the object can be represented by a K_{1} × K_{2} × K_{3} parallelepiped ${\stackrel{\u0304}{\mathit{s}}}_{{\mathit{r}}_{j}}$, which contains the corresponding most significant reflectivity samples (i.e., the ones shaped by the transmitted pulse, lateral beam function and the depth of field). Their positions relative to the point r_{ f } are set by the spatial shifts $z\left({\mathit{m}}_{i}\right)\triangleq {z}_{0}^{{m}_{i},0}{z}_{1}^{{m}_{i},1}{z}_{2}^{{m}_{i},2},i=0,\dots ,K1$, where K ≜ K_{1}K_{2}K_{3}. We further define $\stackrel{\u0303}{K}\triangleq {K}_{1}{K}_{2}$. The columns of the downsampling matrix B define the geometry in which these spatial blocks are processed, as the object is scanned at a given rate. That is, let ${\stackrel{\u0304}{\mathit{s}}}_{{\mathit{r}}_{a}}$ and ${\stackrel{\u0304}{\mathit{s}}}_{{\mathit{r}}_{b}}$ be two parallelepipeds (tensors), possibly intersecting, at focal points r_{ a } and r_{ b }, respectively. For instance, when K =  det B  (i.e., the volume of the fundamental parallelepiped defined by B), it implies that nonoverlapping blocks are being processed each time. This is actually the lowest spatial rate for which the object can be processed without loss of useful information. On the other hand, when choosing B= I the object is being scanned at the highest possible rate, which is given by the desired spatial resolution. For simplicity, we shall assume rectangular parallelepipeds, so that the beam region of support coincides with a rectangular parallelepiped as well. Each tensor block is then (continuoustime) convolved with the K_{1} × K_{2} × Q_{ r } tensor ${{\mathcal{F}}_{\mathit{r}}}_{{}_{f}}\left(t\right)$, the resulting impulse response of the cascade of the β_{ k }(r_{ f } , t)p(t) with G_{ b }(t), for f = {a, b}.
More specifically, assume J_{ r }(x_{ i }) = 1, and let ${{\mathcal{F}}_{\mathit{r}}}_{{}_{j},q}\left(t\right)$, q = 0, 1, . . . , Q_{ r }  1, be the K_{1} × K_{2} rectangular matrices that constitute ${{\mathcal{F}}_{\mathit{r}}}_{{}_{f}}\left(t\right)$ at the focal point r_{ j }, $j=0,1,\dots ,\mathcal{D}1$, where $\mathcal{D}$ is number of foci. The Q_{ r } × 1 received vector is defined as ${\mathit{y}}_{{\mathit{r}}_{j}}\left(t\right)={\left[{y}_{0}\left(t\right)\phantom{\rule{0.3em}{0ex}}\phantom{\rule{0.3em}{0ex}}{y}_{1}\left(t\right)\phantom{\rule{0.3em}{0ex}}\phantom{\rule{0.3em}{0ex}}\phantom{\rule{0.3em}{0ex}}\dots \phantom{\rule{0.3em}{0ex}}\phantom{\rule{0.3em}{0ex}}{y}_{{Q}_{r}1}\right]}^{\mathbf{T}}$ and expressed as
where ${\mathit{F}}_{{\mathit{r}}_{j}}\left(t\right)$ is ${Q}_{r}\times \stackrel{\u0303}{K}$ given by
and
for ℓ = 0, 1, . . . , K_{3}  1. The quantity ${\mathit{v}}_{{\mathit{r}}_{j}}\left(t\right)$ is the corresponding vectorized noise samples.
Here is the crucial point in understanding the imaging process in general. First, it is important to note the difference between the range resolution defined by the spatial sampling T_{ r }, and the time separation between two consecutive received pulses (here, more generally we speak of MIMO pulses), which leads to the sampling rate T_{ s } = 2Δ_{min}/c, where Δ_{min} is the minimum target separation required for a certain application, in the colocated radar. This coincides with the spatial sampling T_{ r } in the far field scenario, since in this case we are allowed only one focal point in range. Hence, T_{ r } = Δ_{min}, i.e., we are limited by the achievable time sampling. On the other hand, focusing allows sampling at the desired spatial resolution, so that Δ_{min} is only defined within the incoming tensors of reflectivities. That is, sampling the received vectors at the rate nT_{ s } yields the discrete model
where ${\mathit{F}}_{{\mathit{r}}_{j}}\left(0\right)\dots {\mathit{F}}_{{\mathit{r}}_{j}}\left(N1\right)$ is the sampled version of ${\mathit{F}}_{{\mathit{r}}_{j}}\left(t\right)$ at T_{ s }, while ${\mathit{s}}_{{\mathit{r}}_{j}}\left(V\ell \right)$ corresponds to the downsampled version of ${\mathit{s}}_{{\mathit{r}}_{j}}\left(\ell \right)$, with V = Δ_{min}/T_{ r }. Figure 6b illustrates the difference between T_{ r } and Δ_{min}.
5 Modern structures for reflectivity estimation
Suppose we are to estimate a tensor arriving from one particular focal point. Essentially, tensor imaging can be cast like any other MIMO estimation problem, formulated as either a Bayesian or classical estimation minimum variance unbiased estimator (MVUE) problem. Due to the difficulty of finding such nonlinear estimators in practice, it is common to restrict the receiver to be linear. Now, assume that the output signal is sampled at T_{ s }. In formulating the convolution model for the received echoes, two common structures can be induced, depending on the receiver design:
5.1 Block processing
Let ℒ be the received signal window length, and define the vectorized tensor of reflectivities and its corresponding received vector around r_{ j }, respectively:
where B' = N + ℒ1, for k = 0, 1,...,B1. This choice of ℒ induces a blockbyblock transmission scheme with interblock interference (IBI), where each block has size ℒ. That is, let $\mathcal{D}$ be the number of desired foci set to cover the object space. Then,
where
with dimensions ${Q}_{r}\mathcal{L}\times \stackrel{\u0303}{K}\left(N+\mathcal{L}1\right)$, and ${\stackrel{\u0304}{\mathit{v}}}_{{\mathit{r}}_{j}}\left(k\right)$ are the vectorized noise samples. This can be the case in a monostatic scenario illustrated in Figure 7a, where at the time the radar starts listening, part of the signal has already returned to the radar site.
On the other hand, if the observation window is sufficiently long so that all incoming MIMO pulses can be completely observed, transmission becomes memoryless, meaning that at the time the radar starts listening, no information of near targets is present (Figure 7b). In this case, we replace (19) and (20) by
where ${\mathit{H}}_{{\mathit{r}}_{j}}$ has the form
with dimensions $\left(N+B1\right){Q}_{r}\times B\stackrel{\u0303}{K}$.
When ${\mathit{H}}_{{\mathit{r}}_{j}}$ is a tall fullrank matrix, a solution is found rather independently for every focal point. Two important solutions can be envisioned:
(i) Matrix Inversion. Here, the solution is given by
where
The optimal delay ${\delta}_{opt}\in \left[0,\phantom{\rule{2.77695pt}{0ex}}N{Q}_{r}\right]$ is chosen such that it minimizes the output noise power, by minimizing the matrix norm $\u2225{\left({\mathit{I}}_{\delta}^{T}{\mathit{H}}_{{\mathit{r}}_{j}}\right)}^{1}\u2225$ (see [28], for the case of Toeplitz ${\mathit{H}}_{{\mathit{r}}_{j}}$).
(ii) Minimum variance (leastsquares). This is given by
In Equation (22), the dimension of ${\mathit{H}}_{{\mathit{r}}_{j}}$ is ${Q}_{r}\mathcal{L}\times \stackrel{\u0303}{K}\left(N+\mathcal{L}1\right)$, so that all we need is ${Q}_{r}\mathcal{L}\ge \stackrel{\u0303}{K}\left(N+\mathcal{L}1\right)$. The same goes for Equation (24), where the condition is $\left(N+M1\right){Q}_{r}>B\stackrel{\u0303}{K}$.
5.2 T_{ s }spaced spacetime linear equalization
In linear equalization, the final goal is to approximate the overall response as much as possible to an impulse (aside from minimizing the noise effect), for which a zero forcing equalizer is in general IIR. Now, if one is restricted to an FIR receiver, several solutions exist, all of which depending on the desired cost function, aiming to minimize the effect of sidelobes and noise at the output.
Referring to Figure 2, if the received signal is sampled at T_{ s } > T_{ p }, where T_{ p } is the pulse width, the standard pulse compression technique attempts to recover the reflectivity information inherent to s(k) via matched filtering. That is the case if one is primarily interested in maximizing the SNR. Alternatively, the requirement on the SNR can be relaxed, and a mismatched filter can be considered instead. In [27, 29], significant reduction on the sidelobes of the overall impulse response can be obtained, if different L_{ p }norm minimization criteria are considered. Observe that regardless of the optimization criteria employed, it is not possible in general to reconstruct the input perfectly, unless an IIR receiver is used, which brings up noise amplification issues in practice.
Given the structure of (22), one cannot find a receive MIMO filter ${\mathit{W}}_{{\mathit{r}}_{j}}\left(n\right)$, so that when convolved with ${\mathit{F}}_{{\mathit{r}}_{j}}\left(n\right)$ yields an impulse, unless ${\mathit{H}}_{{\mathit{r}}_{j}}$ is a tall fullrank matrix. This condition is satisfied if ${Q}_{r}\mathcal{L}\ge \stackrel{\u0303}{K}\left(N+\mathcal{L}1\right)$ and the subchannels of the MIMO channel do not share common zeros. Still, a solution to this problem can be found in the MVUE sense, given by
where ${h}_{\left(\delta {Q}_{r}\mathcal{L}+1:{Q}_{r}\mathcal{L}\left(\delta +1\right),:\right)}$ corresponds to the (δ + 1)th block column of ${\mathit{H}}_{{\mathit{r}}_{j}}$, and δ is the optimal delay for this estimation.
Observe that either in the unfocused scenario, or in case each tensor block ${\mathit{s}}_{{\mathit{r}}_{j}}$ is to be recovered independently from other blocks, its 3D resolution can only rely on the feasibility of the sampling rate device. That is, while lateral resolution in these cases depends on the spatial grid density, resolution in range will be limited to Δ_{min}. The point is thus how accurately one can estimate ${\mathit{s}}_{{\mathit{r}}_{j}}$ by using (27) or (28), specially when ${\mathit{H}}_{{\mathit{r}}_{j}}$ is not a tall fullrank matrix.
The fact is that, in addition to the 1D time convolution model for each focal point, scanning at an arbitrary resolution rate yields overlap of successive blocks, which naturally characterizes a volumetric convolution between the object (reflectivity tensor), and the spacevarying tensor defined by the beam pattern for different focal points. The spatial model for this operation will depend on the number of directions and depths the beam is moved to, whereby, similarly to block processing in time, motion can induce a particular processing structure in space (e.g., block processing, with or without overlap). The overall process is in general a 4D MIMO space (time)varying convolution model. For instance, choosing $\mathcal{D}=\left({R}_{1}+{K}_{1}1\right)\phantom{\rule{0.3em}{0ex}}\left({R}_{2}+{K}_{2}1\right)\phantom{\rule{0.3em}{0ex}}\left({R}_{3}+{K}_{3}1\right)$ implies scanning at the spatial resolution {T_{0}, T_{1}, T_{ r }}, and corresponds to a complete 4D MIMO (spacevarying) convolution. On the other hand, for the fastest scanning, we can pick $\mathcal{D}={R}_{1}{R}_{2}{R}_{3}/det\mathit{B}$ foci, and the whole space is scanned with nonoverlapping blocks. This tells us that unlike the unfocused scenario, or in the case where information received from around the focal point is not sufficient for independent estimations of each block, focusing allows for further diversity; in this sense, resolution is unlimited, and ultimately dictated by the desired spatial sampling of S, and not only by the sampling device. As a result, estimation must be accomplished jointly, after data received from around all foci have been acquired. Of course, the type of receiver will depend heavily on the nature of the signals involved. For instance, the ultrasound wave speed is much slower compared to the one of a microwave, so that quicker estimates must be produced for every focal point in the former case. As we have mentioned, the beam foci and their positions relative to the antenna will define the convolution model employed. For this reason, we shall consider the beam motion in two separate steps, first with respect to lateral motion, covering all azimuths and elevations, which is then followed by depth motion.
a) Lateral Motion
Consider the signals in (18), and assume that the object is scanned at a fixed depth r = d_{ k } across all possible azimuths and elevations such that the object is still within the beam. This corresponds to picking ${\mathit{r}}_{j}={\mathit{r}}_{l,k}=\left[{r}_{1}\phantom{\rule{1em}{0ex}}{r}_{2}\phantom{\rule{1em}{0ex}}{d}_{k}\right]$, where {r_{1}, r_{2}} are varied in order to cover ${\mathcal{D}}_{1}=\left({R}_{1}+{K}_{1}1\right)\phantom{\rule{0.3em}{0ex}}\left({R}_{2}+{K}_{2}1\right)$ focal points, for $l=0,1,\dots ,{\mathcal{D}}_{1}1$. We recognize the sequence $\left\{{\mathit{y}}_{{\mathit{r}}_{j}}^{\prime}\left(n\right)={\mathit{F}}_{{\mathit{r}}_{j}}\left(n\ell \right){\mathit{s}}_{{\mathit{r}}_{j}}\left(V\ell \right)\right\}$ as the output of a MIMO 2D space varying convolution between the constituting matrices ${\mathcal{F}}_{{\mathit{r}}_{j},q}\left(n\right)$ defined in (16), and the tensor slice at range Vℓ corresponding to depth d_{ k }, which we denote by ${\mathit{S}}_{{d}_{k}}\left(V\ell \right)$. This convolution at depth d_{ k } results in Q_{ r } images, or equivalently, a matrix with vector entries, each one having Q_{ r } coefficients, which we denote by ${\mathcal{Y}}_{{d}_{k},n}^{\prime}$. Note that similarly to the definition of a time window, which sets the number of received images from a focal point, we can also determine a spatial window, with dimension smaller than ${\mathcal{D}}_{1}$. However, in order to exploit full space and time diversity, we shall continue to assume a full spatial convolution model as well. We thus have
so we can replace (18) by a more compact form as
where ${\mathcal{H}}_{{d}_{k}}\left(n\ell \right)$ is a twolevel block banded matrix of size Q_{ r }(R_{1}+K_{1}1)(R_{2}+K_{2}1) × R_{1}R_{2}, representing a MIMO 2D convolution. Defining
and denoting ${\mathcal{X}}_{{d}_{k}}=vec\left({\mathcal{S}}_{{d}_{k}}\right)$ and ${\stackrel{\u0304}{\mathit{y}}}_{{d}_{k}}^{\prime}=vec\left({\mathit{Y}}_{{d}_{k}}\right)$, we have
where
is a Q_{ r }(N + B  1) (R_{1} + K_{1}  1) (R_{2} + K_{2}  1) × R_{1}R_{2}B block toeplitz matrix, with block banded blocks.
The 3D model (33) fully describes the farfield transmission scenario for a single depth at d_{0} = ∞; we see that while resolution is unlimited in crossrange, we are still limited to estimate B out of K_{3} images within the tensor S.
b) Depth Motion
The lateral scanning can be performed at several depths, giving rise to a full 4D convolution model. In other words, (30) can be defined for $k=0,1,\dots ,{\mathcal{D}}_{2}$, where ${\mathcal{D}}_{2}=\left({R}_{3}+{K}_{3}1\right)$, so that overall r_{ j } covers $\mathcal{D}=\left({R}_{1}+{K}_{1}1\right)\phantom{\rule{0.3em}{0ex}}\left({R}_{2}+{K}_{2}1\right)\phantom{\rule{0.3em}{0ex}}\left({R}_{3}+{K}_{3}1\right)$ focal points. There is, however, a subtle difference in the way convolution is performed in the axial direction. While the 2D convolution sum can be computed at the minimum resolution step {T_{1}, T_{2}}, the volumetric convolution is obtained by combining images at ranges Vℓ, ℓ = 0, 1, . . . , B  1, and at times 0, 1, . . . , N + B  1, for each depth d_{ k }. Let ${\mathcal{C}}_{{d}_{k},c}$, with c = 0, 1, . . . , B  1 be the c th block column
of ${\mathcal{H}}_{{d}_{k}}$and define
Then, denoting y= vec(Y) and x= vec(X) allows as to write
where $\mathcal{H}$ is now a 4level block banded convolution matrix, with dimensions Q_{ r }(N + B 1) (R_{1} + K_{1}  1) (R_{2} + K_{2}  1) (R_{3} + K_{3}  1) × BR_{1}R_{2}R_{3} given by Equation (38). Again, we can proceed in finding an estimate of the reflectivity tensor S in (37) via matrix inversion or LS, similarly to (25), with ${\mathit{H}}_{{\mathit{r}}_{j}}$ replaced by $\mathcal{H}$. Note that the tall fullrank condition imposed for separate blocks is much more relaxed in this case due to the redundancy of reflectivity in adjacent blocks.^{b}
5.3 Fractionallyspaced equalization
Due to the massive amount of data generated in radar imaging, a MIMO FIR receiver that reconstructs an image at a fast rate can be highly desirable. As we have mentioned in Section 5.2, for T_{ s }spaced equalization, this is possible only if ${\mathit{H}}_{{\mathit{r}}_{j}}$ is tall fullrank. If not, exact reconstruction based on an FIR receiver is still possible by inducing some form of redundancy in the equivalent discretetime system. As we have mentioned, coding is one form of redundancy, which leads to bandwidth expansion at the transmitter. One way to induce redundancy without physically introducing it at the transmitter, and more importantly, without incurring on bandwidth expansion, is by oversampling the received signal in time. Oversampling was originally performed by integer factors, giving rise to the socalled fractionally spaced equalizers. Recently, it has been shown that sampling the output at fractional rates, say $n{T}_{S}\left(\frac{L}{M}\right)$, where L > M, also allows for exact FIR receivers, in which case we further assume that L and M are coprime.
After equalization, a rate reduction step restores the original sampling rate (that means resolution in our context). Fractional sampling allows for significant reduction in overhead when compared to the more common T_{ s }/ 2spaced FS equalizer, which corresponds to the minimum integer sampling rate. Figure 8a illustrates the fractionallyspaced receiver, while Figure 8b shows the equivalent discretetime model with fractional sampling. The filters ${\mathit{F}}_{{\mathit{r}}_{j}}\left(z\right)$ and ${\mathit{W}}_{{\mathit{r}}_{j}}\left(z\right)$ are known as fractional biorthonormal partners [30] if in the absence of noise, the overall transmission is an exact delay (δ).
Oversampling the output is what makes exact reconstruction possible. In fact, using wellknown multirate noble identities [31], this scalar system can be further expressed via its block transmission equivalent, according to Figure 9. The polynomials E(z) and R(z) correspond to polyphase matrices, which are uniquely determined by polyphase components ${\mathit{F}}_{{\mathit{r}}_{j}}\left(z\right)$ and ${\mathit{W}}_{{\mathit{r}}_{j}}\left(z\right)$ respectively, via their order L type2 and type1 decompositions (we drop the notation on r_{ j } in their factors for simplicity)see [30]:
Define the functions, for 0 ≤ k ≤ L  1,
with m and l such that ℓL + mM = 1. Since L and M are coprime, the smallest integers m and ℓ are obtained by the Euclid's algorithm. Defining the order M polyphase decompositions
the entries of the polyphase matrices E(z) and R(z) are simply {E_{ k, j } (z), R_{ i, k }(z)}.
In [30], the authors develop a zero forcing solution for R(z) based on the Smith form of E(z). It is shown that there exists an FIR right vector fractional biorthogonal partner of F_{ i }(z) if and only if ${Q}_{r}L>\stackrel{\u0303}{K}M$, and the greatest common divisor (gcd) of all the $\stackrel{\u0303}{K}M\times \stackrel{\u0303}{K}M$minors of E(z) is a delay. If theses conditions are satisfied, exact reconstruction of the input can be achieved. The additional degree of freedom offered by the resulting receiver is then exploited in order to minimize the noise power at the output. As mentioned in [30], the advantage of the method relies on the fact that neither the input nor the noise autocorrelation matrices are needed when the noise is uncorrelated, in comparison with an MMSE solution. Note that the ZF solution of [30] is not the only one independent of statistics when the noise is uncorrelated; a pure minimumvariance (LS) based receiver also minimizes the output noise power. Also, in the solution proposed in [30], their ZF method is compared to the MMSE solution considering a zero decision delay in the filter design. This could deteriorate the MMSE performance in comparison with the ZF approach. In this sense, a MVUE approach is also a good candidate for equalization (moreover, computing the Smith form is not a systematic task, and represents an illconditioned factorization problem, similar to the Jordan form).
Let N_{ E } and N_{ R } be the lengths of the polynomial matrices E(z) and R(z), respectively, and define their corresponding matrix coefficients ${\mathcal{E}}_{j}=\left[{\mathcal{E}}_{0}{\mathcal{E}}_{1}\cdots {\mathcal{E}}_{{N}_{E}1}\right],{\mathcal{R}}_{j}=\left[{\mathcal{R}}_{0}{\mathcal{R}}_{1}\cdots {\mathcal{R}}_{{N}_{R}1}\right]$.
Define the QLN_{ R } × KM(N_{ E } + N_{ R }  1) block convolution matrix
Hence, as long as we have ${Q}_{r}L{N}_{R}\ge \stackrel{\u0303}{K}M\left({N}_{E}+{N}_{R}1\right)$and T is fullrank, if the noise is uncorrelated, similarly to (28) the MVUE of s_{ j }(k) is given by
6 Special case: single antenna, narrowband, freespace, farfield
A special case of general interest arises when focusing is performed in both transmission and reception, giving rise to two distinct antenna beam patterns. Let ${\beta}_{k}^{\prime}\left({\mathit{r}}_{f}\right)={w}_{1}\left(\hat{{\mathit{r}}_{k}{\mathit{x}}_{0}}\right){w}_{2}\left(\hat{{\mathit{r}}_{k}{\mathit{x}}_{0}}\right)$ be their product, corresponding to point r_{ k } within the reflectivity tensor at r_{ f } . In a freespace scenario, (9) simplifies to
where ${\beta}_{k}\left({\mathit{r}}_{f}\right)\triangleq {\beta}_{k}^{\prime}\left({\mathit{r}}_{f}\right)/32{\pi}^{2}\left{\mathit{x}}_{0}{\mathit{r}}_{k}\right$, and τ ≜ x_{0}r_{k}/c. We have also dropped the dependency on x_{0}, since it is fixed in our development. This corresponds to Q_{ r } = 1, so that we now deal with a simple multipleinputsingleoutput (MISO) model, as illustrated in Figure 10.
We observe that the 2D filter with coefficients given by the antenna beam pattern defined by β_{ k }(r_{ f }) = β_{ k }(r) is now spaceinvariant in crossrange, while filtering along range is accomplished through the transmitted pulse only, which is a timeinvariant scalar function p(t). That is, ${\mathit{F}}_{{r}_{j}}\left(n\ell \right)$ in (18) becomes
where ${\underset{}{\mathit{F}}}_{\phantom{\rule{0.3em}{0ex}}V\ell}=\left[{\beta}_{0}\left(V\ell \right)\phantom{\rule{2.77695pt}{0ex}}{\beta}_{1}\left(V\ell \right)\phantom{\rule{2.77695pt}{0ex}}.\phantom{\rule{2.77695pt}{0ex}}.\phantom{\rule{2.77695pt}{0ex}}.\phantom{\rule{2.77695pt}{0ex}}{\beta}_{\stackrel{\u0303}{K}1}\left(V\ell \right)\right]$, so that ${\mathcal{H}}_{{d}_{k}}\left(n\ell \right)$ in (30) is replaced by
At the farfield we focus on a single depth, d_{0}, so that ${\mathcal{H}}_{{d}_{k}}\left(n\ell \right)=p\left(n\ell \right)\underset{}{\mathcal{H}}$, where $\underset{}{\mathcal{H}}$ becomes a (R_{1} +K_{1} 1)(R_{2} +K_{2} 1) × K_{1}K_{2} twolevel block Toeplitz matrix with Toeplitz blocks. Hence, (33) simplifies to
or, equivalently,
in terms of the reshaped model, where $\mathcal{T}$ is tall, Toeplitzlike, similar to (24). Assume that the beam pattern function β_{ k }(r) is separable, and consider two (discrete) sequences b_{0}(n) and b_{1}(n), which define two Toeplitz matrices A_{0} and A_{1}, respectively, at a certain range, similarly to the structure of ${{\mathit{H}}_{\mathit{r}}}_{{}_{0}}$ in (24), with the ${\mathit{F}}_{{\mathit{r}}_{j}}\left(n\right)$ replaced by b_{0}(n) or b_{1}(n). In this case, we write $\underset{}{\mathcal{H}}$ as $\underset{}{\mathcal{H}}={\mathit{A}}_{1}\otimes {\mathit{A}}_{0}$, where A_{0} is (R_{1} + K_{1}  1) × R_{1}, and A_{1} is (R_{2} + K_{2}  1) × R_{2}. Therefore, referring to (29), we have
where [·]_{:, N + B2n}denotes the (N + B  2  n)th column of the argument, and ${\mathcal{Y}}_{{d}_{0},n}^{\prime}$ is defined in (29). This implies that
for n = 0, 1, . . . , N + B  2, where we use ${\mathcal{Y}}_{{d}_{0},n}$ to define the output that includes the noise term ${\mathcal{V}}_{{d}_{0}}\left(n\right)$, unlike ${\mathcal{Y}}_{{d}_{0},n}^{\prime}$ in (29). This model suggests, in light of (25), two main procedures for estimating the volumetric information ${\mathcal{S}}_{{d}_{0}}$:
(i) Matrix inverse. Given the separable model above, we can estimate ${\mathcal{S}}_{{d}_{0}}$ as
where δ_{0}, δ_{1}, and δ_{2} are chosen optimally. The restriction matrices ${\mathit{I}}_{{\delta}_{\mathsf{\text{0}}}},{\mathit{I}}_{{\delta}_{1}},{\mathit{I}}_{{\delta}_{2}}$ are defined similarly to (26), with δ_{0} ∈ [0, R_{1}], δ_{1} ∈ [0, R_{2}], and δ_{2} ∈ [0, B]. In [28], it is shown that for the 1D case, when the delay δ is equal to the number of zeros of the corresponding polynomial outside the unit circle, say e.g., p(n), the matrix norm $\u2225{\left({\mathcal{T}}^{\mathbf{T}}{\mathit{I}}_{{\delta}_{2}}\right)}^{1}\u2225$ is minimized. Here, because {b_{0}(n), b_{1}(n), p(n)} are known a priori, {δ_{0}, δ_{1}, δ_{2}} can be found offline, so that $\u2225{\left({\mathit{I}}_{{\delta}_{0}}^{\mathbf{T}}{\mathit{A}}_{0}\right)}^{1}\u2225$, $\u2225{\left({\mathit{I}}_{{\delta}_{1}}^{\mathbf{T}}{\mathit{A}}_{1}\right)}^{1}\u2225$, and $\u2225{\left({\mathit{I}}_{{\delta}_{2}}^{\mathbf{T}}\mathcal{T}\right)}^{1}\u2225$ can be minimized as well.
(ii) Leastsquares. Using well known properties of Kronecker products, the leastsquares estimate of ${\mathcal{S}}_{{d}_{0}}$ is given by
or, reshaping ${\widehat{\mathit{S}}}_{{d}_{0}}$ and ${\stackrel{\u0304}{\mathit{y}}}_{{d}_{0}}^{\prime}$ according to (32) and (32),
If $\underset{}{\mathcal{H}}$ is separable, we obtain
so that
We highlight that in this configuration, p(n), b_{ o }(n), and b_{1}(n) play the role of a known channel in a communication system, which can be optimally chosen in order to meet certain design specification.
Remark: Regarding implementation issues, as we have mentioned, the structure of the receiver is paramount for feasibility of the solution, specially when a significant amount of data is involved.
Exploiting the structure induced by the Green's function of a specific medium has thus a powerful impact on the actual implementable solution of the corresponding inverse. While in [25] an inverse or leastsquares operator is applied to the received signal (the latter by considering the adjoint ℋ* and precomputing the spectral decomposition of (ℋ*ℋ)), by virtue of sampling in a 3D freespace scenario, ℋ is replaced by $\left(\mathcal{T}\otimes \underset{}{\mathcal{H}}\right)$, so that the LS estimator is computed via (56). This allows us to replace a spectral factorization by a superfast receiver whose defining parameters are computed offline. This subject is closely connected with the field of fast equalization receivers in communications [7, 32], where an efficient estimation of the transmitted sequence is available.
7 Preliminary experiments
We perform some preliminary simulations considering the matrix inversion and leastsquares receivers for an exact modeling, 3D scenario in freespace aforementioned. We also plot the corresponding 1D equalization receivers based on Ascan lines. The behavior of these equalizers is well known, and the challenge in their implementations lies in cases where the conditioning of $\mathcal{T}$, A_{0}, and A_{1} becomes too high. Good numerical properties and minimum MSE occur when the columns of these matrices are orthogonal, so that the LS solution becomes a matched filter. This is the case when the transmitted pulses do not overlap, or can be approximated by well designed codes. Two fundamental questions must be taken into consideration when computing the above solutions: (1) What sort of algorithm or method must be employed when implementing (52) or (56). (2) How does the designed pulse affect the numerical properties of the underlying solution, and more importantly, how does it affect the implementation of the method chosen in (1). These issues commonly arise in several different areas where LS type solutions are deployed, and are currently a (hot) topic of considerable activity in the field of fast matrix equalizers [32, 33]. In hostile wireless communications environments, long delay spreads and deep fades are responsible for the illconditioning of the corresponding convolution matrix, a fact that can be similarly observed in $\left\{\mathcal{T},{\mathit{A}}_{0},{\mathit{A}}_{\mathsf{\text{1}}}\right\}$ depending on their design criteria. Fortunately, these matrices are known quantities, and can be designed in order to avoid numerical difficulties. Several practical aspects must be taken into consideration, including ease of implementation of the pulse, antenna dimensions, etc.
We assume that the beam pattern is generated by a 1 m × 1 m antenna with uniform tapering function and current density, producing two separable sinc functions b_{0}(n) and b_{1}(n). We estimate a 10 × 10 × 10 tensor, with resolution T_{1} = T_{2} = Δ_{min} = 0.1. The pulse length is T_{ p } = 10^{7}, and the carrier frequency is set to f_{ c } = 4 GHz. In Ascan imaging, the beam is swept over the object. For each received signal, the estimator is computed based solely on the transmitted pulse, while the beam pattern is usually designed in order to minimize the blurring effect in this direction. In the 3D modeling, however, whatever the beam pattern in cross range is, we make use of its shape when estimating along this direction as well. Observe that a tall matrix can always be induced by moving the beam over the object in order to perform a complete spatial convolution. Hence, improved results may be expected compared to the case of Ascans.
Figure 11a shows the MSE decay normalized by the Frobenius norm of $\mathcal{S}$, for a 10 MHz chirp pulse p(t) = cos(2πf_{ c }t + πκt^{2}), where κ is the chirp rate. For the matrix inversion approach, in theory the optimal delay δ_{2} should be equal to the number of zeros outside the unit circle, in this example given by δ_{2} = 99. However, the condition number of $\left({\mathit{I}}_{{\delta}_{2}}^{\mathbf{T}}\mathcal{T}\right)$ in this case is 7 · 10^{13}, and produces no meaningful result. The optimal delay was verified empirically to be δ_{2} = 0 instead. The corresponding condition number of $\left({\mathit{I}}_{{\delta}_{2}}^{\mathbf{T}}\mathcal{T}\right)$ is 7.5, and its performance is close to the one of the LS receiver. For the latter, the condition number of T*T is equal to 784.
Figure 11b shows the same scenario for a random pulse whose taps are drawn from zeromean, unitvariance gaussian variables. The condition number of T*T is 1.97, while for $\left({\mathit{I}}_{{\delta}_{2}}^{\mathbf{T}}\mathcal{T}\right)$, 13.2. This is to illustrate that conditioning is fundamental, and is not necessarily achieved when the transmitted signal is designed as a Chirp, also aiming implementation issues.
We consider an 80 × 80 real scene image Figure 12a for simplicity at a fixed range and an SNR of 10 dB, for a randomly generated pulse with Gaussian samples, and additive white Gaussian noise. The optimal minimumnorm matrix inverse and 3DLS estimation are illustrated in Figure 12b, c, respectively, which can be compared to the standard matchedfilter receiver in Figure 12d. The corresponding Ascan schemes are shown in Figure 13a, b. We further compare these recovered scenes with the ones obtained with a Chirp transmission, in Figure 14a, b. We clearly see the effect of conditioning in the matrix inverse solutions, and superiority of the full convolution informationbased receivers.
In this article, we have presented preliminary simulations considering the effect of the transmitted signal in Ascan and 3D estimation scenarios. A thorough comparison with other methods in a fair equivalent scenario is still to be pursued in a separate future study.
8 Conclusions
We have approached the 3D radar imaging reconstruction problem as a tensorial estimation, according to a multidimensional multirate model. This includes both unfocused and focused scenarios. Exact reconstruction of the reflectivity tensor is possible through standard equalization techniques, such as linear equalizers and block estimation. The conditions and accuracy in reconstruction depend on the amount of space and/or time diversity introduced, and can be induced by oversampling or focusing, where in the latter redundancy is achieved via overlapping tensor blocks. The entire process can be seen as a 4D space and time convolution model, whose knowledge allows for enhanced resolution compared to standard matched (or mismatched) filter approaches. In the unfocused scenario, or when scanning is performed via nonoverlapping blocks, oversampling through biorthogonal partners is one way of achieving exact reconstruction. Range resolution in these cases is limited by stateoftheart ADCs, and for focused beams, in theory there is no limit on achievable resolution. An important special case arises in the farfield, narrowband, freespace scenario, which results in a 3D separable convolution model, described via Toeplitzlike linear systems in all 3 dimensions. Because the transmitted pulse and beam pattern functions are known, this yields a superfast equalizer receiver, which can be precomputed via either matrix inversion or a LS solution. The complexity and numerical properties of the solution are one of the key factors for feasibility of implementation.
Endnotes
^{a}Matched Filtering is also called migration in the geophysics jargon, and backprojection in Xray tomography terminology. ^{b}DecisionDirected Receiver. In situations where the range to be imaged is constituted by isolated targets, with a well defined threshold for deciding upon their existence or not, one can rely on decisiondirected estimates in order to improve the imaging of the targets. This can be achieved by MIMO DFE, or approximately, by first obtaining a rough estimate of the reflectivity, and then rewriting the linear model in order to focus only on the range where the targets exist. This opens up several possibilities for structures based on decisiondirected feedback, and makes the estimation more accurate by focusing only at the ranges where the target is located.
References
 1.
Cheney M: Tutorial on Radar Imaging, 10 Lectures at IMA. 2005.
 2.
Lane RO: Superresolution and the radar point spread function. Proceedings of the London Communications Symposium, UCL Dept., London 2005, 58.
 3.
Taxt T, Strand J: Two dimensional noiserobust blind deconvolution of Ultrasound Images. IEEE Trans Ultrason Ferroelectr Freq Control 2001, 48(4):555566.
 4.
Tavares PL: Seismic Data Multichannel Deconvolution Based on Digital Communications Blind Equalization Algorithms. In Masters Thesis. COPPE/UFRJ; 2007.
 5.
Zemp RJ, Abbey CK, Insana MF: Linear system models for ultrasonic imaging: application to signal statistics. IEEE Trans Ultrason Ferroelectr Freq Control 2003, 50(6):642654.
 6.
Haykin S: Adaptive Filter Theory. 2nd edition. Prentice Hall, Upper Saddle River, NJ; 1991.
 7.
Sayed AH: Fundamentals of Adaptive Filtering. John Wiley & Sons, NJ; 2003.
 8.
Nolan CJ, Cheney M: Synthetic aperture inversion for arbitrary flight paths and nonflat topography. IEEE Trans Image Process 2003, 12: 10351043. 10.1109/TIP.2003.814243
 9.
Yazıcı B, Xie G: Wideband extended rangedoppler imaging and waveform design in the presence of clutter and noise. IEEE Trans Inf Theory 2006, 52: 45634580.
 10.
Chen CY, Vaidyanathan PP: MIMO radar waveform optimization with prior information of the extended target and clutter. IEEE Trans Signal Process 2009, 57(9):35333544.
 11.
Xie Y, Guo B, Xu L, Li J, Stoica P: Multistatic adaptive microwave imaging for early breast cancer detection. IEEE Trans Biomed Eng 2006, 53: 16471657. 10.1109/TBME.2006.878058
 12.
Li J, Xu L, Stoica P, Forsythe KW, Bliss DW: Range compression and waveform optimization for MIMO radar: a CramérRao bound based study. IEEE Trans Biomed Eng 2008, 56: 218232.
 13.
Misaridis T, Jensen JA: Use of modulated excitation signals in medical imagingPart 1: basic concepts and expexted benefits. IEEE Trans Ultrason Ferroelectr Freq Control 2005, 52(2):177191.
 14.
Levanda R, Leshem A: Synthetic aperture radio telescopes. IEEE Signal Process Mag 2010, 27(1):1429.
 15.
Natterer F, Cheney M, Borden B: Resolution for Radar and xray tomography. Inst Phys Publ Inverse Probl 2003, 19: S55S64.
 16.
Jackson JD: Classical Electrodynamics. 3rd edition. John Wiley & Sons, Inc., NY; 1998.
 17.
YBarra GA, Liu QH, Stang JP, Joines WT: Microwave breast imaging. In Emerging Tech Breast Imaging Mammography Edited by: Suri J et al. 2007, Chapter 16: 112. USA/Canada
 18.
Fitzgerald AJ, Berry E, Zinovev NN, Walker GC, Smith MA, Chamberlain JM: An Introduction to medical imaging with coherent terahertz frequency radiation. Phys Med Bio 2002, 47: R67R84. 10.1088/00319155/47/7/201
 19.
Guo B, Wang Y, Wu R: Microwave imaging via adaptive beamforming methods for breast cancer detection. J Electromag Waves Appl 2006, 20(1):5363. 10.1163/156939306775777350
 20.
Fink M: Time reversal of ultrasonic fieldsPart 1: basic principles. IEEE Trans Ultrason Ferro Freq Control 1992, 39(5):555566.
 21.
Asif A, Bai Q, Moura JMF: Time reversal matched field processing: an analytical justification. IEEE Int Symposium on Signal Processing and Information Technology, Vancouver, BC 2006, 8186.
 22.
Moura JMF, Jin Y: Detection by time reversal: Single antenna. IEEE Trans Signal Process 2007, 55(1):187201.
 23.
Jin Y, Moura JMF, Jiang Y, Wahl M, Zhu H, He Q: Breast cancer detection by time reversal imaging. ISBI'08 IEEE International Symposium on Biomedical imaging, Paris, France 2008, 816819.
 24.
Yuanwei J, Moura JMF: TRSAR: time reversal target focusing in spotlight SAR. Proceedings of ICASSP, Honolulu, HI 2007, 2: 957960.
 25.
Varslot T, Yazıcı B, Cheney M: Wideband pulseecho imaging with distributed apertures in multipath environments. Inverse Probl 2008, 28: 128.
 26.
Griep KR, Ritcey JA, Burlingame JJ: Polyphase codes and optimal filters for multiple user ranging. IEEE Trans Aerosp Electron Syst 1995, 31(2):752767.
 27.
Stoica P, Li J, Xue M: Transmit codes and receive filters for pulse compression radar systems. Proceedidng of ICASSP, Las Vegas, NV 2008, 36493652.
 28.
Scaglione A, Giannakis GB, Barbarossa S: Redundant filterbank precoders and equalizers Part II: blind channel estimation, synchronization, and direct equaization. IEEE Trans Signal Process 1999, 47: 20072022. 10.1109/78.771048
 29.
Cilliers JE, Smit JC: Pulse compression sidelobe reduction by minimization of L_{ p }norms. IEEE Trans Aerosp Electron Syst 2007, 43(3):12381247.
 30.
Vrcelj B, Vaidyanathan PP: Fractional biorthogonal partners in channel equalization and signal interpolation. IEEE Trans Signal Process 2003, 51: 19281940. 10.1109/TSP.2003.812844
 31.
Vaidyanathan PP: Multirate Systems and Filter Banks. Prentice Hall, NJ; 1993.
 32.
Merched R: Application of superfast algorithms to pilotbased channel estimation schemes. Proc SPAWC, Recife, Brazil 2008, 141145.
 33.
Merched R: Superresolution and superfast receivers in freespace radar imaging. Proceedings of ICASSP, Prague, Czech Republic 2011, 12411244.
Author information
Affiliations
Corresponding author
Authors’ original submitted files for images
Below are the links to the authors’ original submitted files for images.
Rights and permissions
Open Access This article is distributed under the terms of the Creative Commons Attribution 2.0 International License (https://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
About this article
Cite this article
Merched, R. Roles of equalization in radar imaging: modeling for superesolution in 3D reconstruction. EURASIP J. Adv. Signal Process. 2012, 106 (2012). https://doi.org/10.1186/168761802012106
Received:
Accepted:
Published:
Keywords
 radar imaging
 pulse compression
 resolution
 equalization
 tensor estimation