Skip to main content

A robust power allocation strategy based on benefit–cost ratio for multiple target guidance in the C-MIMO radar system under blanket jamming


How to utilize the limited power budget to accurately track more targets plays a critical role for the radar system in air defense applications, especially in the weapon guidance application. In this paper, we propose a robust power allocation (RPA) strategy in the collocated multiple-input and multiple-output (C-MIMO) radar system for multiple target guidance (MTG) under blanket jamming. The optimization model is established with the aim of increasing the number of effective tracking targets (ETTs) and improving the overall tracking accuracy among those targets. Since the mutual information (MI) quantifies the parameter estimation performance and can be predicted, the MI under blanket jamming is derived and utilized as the optimization criterion. We then propose a two-step optimization algorithm based on benefit–cost ratio (BCR) to solve the non-convex problem. Finally, numerical results are provided to demonstrate the effectiveness of the proposed algorithm.

1 Introduction

By adopting the simultaneous multi-beam (SM) working mode and exploiting the orthogonal signals, the collocated multi-input multi-output (C-MIMO) radar possesses the advantages such as waveform diversity [1], virtual aperture extension [2], and simultaneous multi-function [3], and has attracted much attention in recent years [1,2,3,4]. In practice, the limited resources are often the main factor restricting radar performance [5]. Therefore, in order to take full advantages of the C-MIMO radar system, the finite system resource should be efficiently managed.

Power allocation (PA) is a crucial theme in radar resource management for multiple target tracking (MTT). In many practical scenarios, the PA problem for MTT is mainly focused on: (1) Maximizing the overall tracking accuracy or the worst target tracking accuracy in all targets with the finite power budget [6,7,8,9,10]; (2) Minimizing the power consumption for given MTT performance requirements [11,12,13,14]. However, in practice, when combat missions are urgent, in order to guide more weapons to attack enemy objects accurately, the guidance radar system usually adopts in full power mode. In this case, the aforesaid PA strategies cannot be directly applied. In addition, for the guidance radar, the weapon launching conditions and enough guidance accuracy should be both satisfied [15]. Hence, the number of the effective tracking targets (ETTs) and the corresponding target tracking accuracy are both important performance indicators. Herein, the ETT denotes the target which satisfies the given tracking accuracy. However, to the best of our knowledge, the robust power allocation (RPA) strategy considering the above two optimization objectives is very limited.

Moreover, most of the existing studies on PA in the MIMO radar system are carried out under ideal detection condition, which is not common seen on the modern battlefields. Since the posterior Cramer–Rao lower bound (PCRLB) can provide a tight lower bound for any unbiased estimator [16,17,18], the PCRLB is usually used as the optimization metric in the resource allocation problems for MTT [5,6,7,8,9,10,11,12,13]. However, under the condition of electromagnetic interference, the process of target parameter estimation can be often very complex, which may cause the PCRLB cannot accurately quantify tracking accuracy. The MI is an information theoretic criterion, it has been proved that the more MI between radar echo and the target impulse response implies better capability of radar to estimate the target parameters [19]. In this case, the MI has been used in [20,21,22] for power allocation among the MIMO radars and jammers.

In this paper, the RPA strategy in the C-MIMO radar for multiple target guidance (MTG) application is studied under the blanket jamming environment. The MI between the reflected target signal and the path gain matrix is adopted to be the performance criterion for target parameter estimation. Based on the predicted MI, the RPA model is established to consider both the number of ETTs and their corresponding tracking accuracy, which formulated as a non-convex problem. In order to solve this problem, a two-step optimization algorithm based on benefit–cost ratio (BCR) [23] is proposed.

The rest of this paper is organized as follows: The system model of the C-MIMO radar along with the target motion model is demonstrated in Sect. 2. Section 3 derives the MI in terms of the PA and establishes a cognitive tracking scheme. In Sect. 4, the optimization model is formulated and an efficient solving algorithm is proposed. The numerical results and analysis are given in Sect. 5. Section 6 concludes this paper.

2 System model

Consider that a narrowband C-MIMO radar is located at (x0, y0). In order to track Q point-like enemy objects which carry with active oppressive jammers, a set of orthogonal and coherent pulse train signals are transmitted. In order to simplify the model analysis, we make the following assuptions:

  1. (1)

    The number and initial position of the tracked targets are known in advance as priori knwoledge;

  2. (2)

    Each target carries a self-defense jammer that continuously transmits jamming signal to the radar, which is modeled as Gaussian white noise;

  3. (3)

    The C-MIMO radar works in the SM pattern, and simultaneously transmits multiple orthogonal beams to track targets.

2.1 Radar signal model

Consider that the transmit signal for the qth target at the kth sample interval is normalized as \(s_{k,q} (t)\), and the number of transmit pulse trains in one measurement is \(L\), thus the lth pulse is [5]

$$s_{k,q,l} \left( t \right) = s_{k,q} \left[ {t^{\prime} + \left( {l - 1} \right)T_{{\text{p}}} } \right]$$

where Tp is the pulse repetition period, and \(t^{\prime}\) is the slow time. Moreover, the lth pulse in the received baseband signal is given by

$$r_{{k,q,l}} \left( t \right) = \delta _{{k,q}} h_{{k,q,l}} \sqrt {P_{{k,q}} } s_{{k,q}} \left[ {t - \left( {l - 1} \right)T_{p} - \tau _{{k,q}} } \right]e^{{ - j2\pi f_{{k,q}}^{{\text{d}}} t}} + n_{{k,q,l}} \left( t \right) + j_{{k,q,l}} \left( t \right)$$

where \(\delta_{k,q}\) denotes the attenuation of signal strength due to the path loss. \(h_{k,q,l} = h_{k,q,l}^{{\text{R}}} + h_{k,q,l}^{{\text{I}}}\) is the radar cross-section (RCS), modeled as a zero-mean white complex noise with variance \(\sigma_{k,q}^{2}\), denoted as \(h_{k,q,l} \sim CN(0,\sigma_{k,q}^{2} )\). \(P_{k,q}\) is the transient transmit power, the terms of \(\tau_{k,q}\) and \(f_{k,q}^{{\text{d}}}\) are the time delay and the Doppler frequency, respectively. \(n_{k,q,l} (t)\) represents the inherent environmental noise, modeled as \(n_{k,q,l} (t) \sim CN(0,\alpha_{k,q}^{2} )\), and \(j_{k,q,l} (t)\) denotes the oppressive jamming noise imposed by the jammer, distributed as \(j_{k,q,l} (t) \sim CN(0,\beta_{k,q}^{2} )\).

Aim at improving the echo signal–noise ratio (SNR), the coherent pulse accumulation technique is applied to the echo signal processing. Thus, the sampled signals of \(r_{k,q,l} (t)\) are

$${\hat{\mathbf{r}}}_{k,q,l} = {\hat{\mathbf{s}}}_{k,q,l} g_{k,q,l} + {\hat{\mathbf{n}}}_{k,q,l} + {\hat{\mathbf{j}}}_{k,q,l}$$

where \({\hat{\mathbf{s}}}_{k,q,l} \in {\mathbb{C}}^{M \times 1}\) denotes the sampling of \(s_{k,q,l} (t)\), and M indicates the sampling length. \({\hat{\mathbf{n}}}_{k,q,l} \in {\mathbb{C}}^{M \times 1}\) represents the sampling of \(n_{k,q,l} (t)\), \({\hat{\mathbf{j}}}_{k,q,l} \in {\mathbb{C}}^{M \times 1}\) is the sampling of \(j_{k,q,l} (t)\), and \(g_{k,q,l}\) dentoes the path gain coefficient, distributed as \(g_{k,q,l} \sim N(0,\gamma_{k,q}^{2} )\), with [7]

$$\gamma_{k,q}^{2} \propto P_{k,q} \sigma_{k,q}^{2} /R_{k,q}^{4}$$

where \(\propto\) is the proportional notation, and \(R_{k,q}\) is the distance from the qth target to the radar center at sample interval k.

Suppose that the C-MIMO radar continuously transmits \(N \le L\) pulses to the qth target at the kth sample interval, which means the number of coherent accumulation pulses in a detection is N. Then, the relevant echo signal can be denoted as

$$\begin{aligned} {\hat{\mathbf{R}}}_{k,q} & = \left( {{\hat{\mathbf{r}}}_{k,q,1} ,{\hat{\mathbf{r}}}_{k,q,2,} ...,{\hat{\mathbf{r}}}_{k,q,N} } \right) \\ & { = }{\hat{\mathbf{S}}}_{k,q} {\hat{\mathbf{G}}}_{k,q} + {\hat{\mathbf{N}}}_{k,q} + {\hat{\mathbf{J}}}_{k,q} \\ \end{aligned}$$


$$\left\{ {\begin{array}{*{20}l} {{\hat{\mathbf{S}}}_{k,q} = \left( {{\hat{\mathbf{s}}}_{k,q,1} ,{\hat{\mathbf{s}}}_{k,q,2} , \ldots ,{\hat{\mathbf{s}}}_{k,q,N} } \right)} \hfill \\ {{\hat{\mathbf{G}}}_{k,q} = {\text{diag}}\left( {g_{k,q,1} ,g_{k,q,2} , \ldots ,g_{k,q,N} } \right)} \hfill \\ {{\hat{\mathbf{N}}}_{k,q} = \left( {{\hat{\mathbf{n}}}_{k,q,1} ,{\hat{\mathbf{n}}}_{k,q,2} , \ldots ,{\hat{\mathbf{n}}}_{k,q,N} } \right)} \hfill \\ {{\hat{\mathbf{J}}}_{k,q} = \left( {{\hat{\mathbf{j}}}_{k,q,1} ,{\hat{\mathbf{j}}}_{k,q,2} , \ldots ,{\hat{\mathbf{j}}}_{k,q,N} } \right)} \hfill \\ \end{array} } \right.$$

In addition, to simplify the signal model, we assume that the transmit pulse waveforms are exactly the same. Thus, we have \({\hat{\mathbf{s}}}_{{k,q,1}} = {\hat{\mathbf{s}}}_{{k,q,2}} = \cdots = {\hat{\mathbf{s}}}_{{k,q,N}} = {\hat{\mathbf{s}}}_{{k,q}}\).

2.2 Target motion model

Without loss of generality, we assume that the target motion model can be described by the constant velocity (CV) model [6]. In this case, the qth target state is denoted by \({\mathbf{x}}_{k,q} = [x_{k,q} ,\dot{x}_{k,q} ,y_{k,q} ,\dot{y}_{k,q} ]^{{\text{T}}}\), where \([x_{k,q} ,y_{k,q} ]^{{\text{T}}}\) and \([\dot{x}_{k,q} ,\dot{y}_{k,q} ]^{{\text{T}}}\) represent the position and velocity at the kth sample interval in Cartesian coordinates. The target state transition model can be expressed by [7]

$${\mathbf{x}}_{k + 1,q} = {\mathbf{Fx}}_{k,q} + {\mathbf{w}}_{k,q}$$

where \({\mathbf{F}}\) denotes the state transition matrix of the CV model. The term of \({\mathbf{w}}_{k,q}\) represents an uncorrelated process noise sequence and is assumed to be a zero-mean Gaussian noise with the covariance matrix \({\mathbf{Q}}_{k,q}\). Herein, \({\mathbf{F}}\) and \({\mathbf{Q}}_{k,q}\) are given by [8]

$${\mathbf{F}} = \left[ {\begin{array}{*{20}c} 1 & {T_{{\text{s}}} } \\ 0 & 1 \\ \end{array} } \right] \otimes {\mathbf{I}}_{2}$$


$$\begin{aligned} {\mathbf{Q}}_{k,q} & = {\mathbb{E}}\left[ {{\mathbf{w}}_{k,q} \left( {{\mathbf{w}}_{k,q} } \right)^{{\text{T}}} } \right] \\ & = \left( {\int_{0}^{{T_{{\text{S}}} }} {m_{k,q} \left[ {\begin{array}{*{20}c} {T_{{\text{s}}} - t} \\ 1 \\ \end{array} } \right]} \left[ {\begin{array}{*{20}c} {T_{{\text{s}}} - t} & 1 \\ \end{array} } \right]{\text{d}}t} \right) \otimes {\mathbf{I}}_{2} \\ & { = }m_{k,q} \left[ {\begin{array}{*{20}c} {T_{{\text{s}}}^{{3}} /3} & {T_{{\text{s}}}^{{2}} /2} \\ {T_{{\text{s}}}^{{2}} /2} & {T_{s} } \\ \end{array} } \right] \otimes {\mathbf{I}}_{2} \\ \end{aligned}$$

where \(T_{{\text{s}}}\) is the sample interval, \({\mathbf{I}}_{2}\) denotes the second-order identity matrix, \(\otimes\) represents the Kronecker product operator, and \(m_{k,q}\) is the relevant process noise intensity [8].

2.3 Measurement model

According to the receive signal model in (2) and (5), the conditional probability density function (PDF) \(p({\hat{\mathbf{R}}}_{k,q} |{{\varvec{\upxi}}}_{k,q} )\) is given by

$$p\left( {\left. {{\hat{\mathbf{R}}}_{{k,q}} } \right|{\mathbf{\xi }}_{{k,q}} } \right) \propto \exp \left\{ { - \frac{1}{{\delta _{{k,q}}^{2} }}} \right.\int {|r_{{k,q,l}}^{{}} \left( t \right) - \alpha _{{k,q}}^{{}} h_{{k,q,l}}^{{}} } \left. {\sqrt {P_{{k,q}} } {\text{ }}s_{{k,q}} \left( {t - \tau _{{k,q}} } \right)e^{{ - j2\pi f_{{k,q}}^{{\text{d}}} t}} - j_{{k,q,l}}^{{}} \left( t \right)|{\text{d}}t} \right\}$$

where \({{\varvec{\upxi}}}_{k,q}^{{}} = [R_{k,q}^{{}} ,f_{k,q}^{{\text{d}}} ,\theta_{k,q}^{{}} ]^{{\text{T}}}\). Moreover, by adopting the maximum likelihood (ML) estimate method [24], the ML estimate of \({{\varvec{\upxi}}}_{k,q}^{{}}\) can be calculated as

$$\begin{aligned} \left\{ {{\hat{\mathbf{\xi }}}_{k,q}^{{}} } \right\}_{{{\text{ML}}}} & = \arg \left\{ {\mathop {\max }\limits_{{{{\varvec{\upxi}}}_{k,q} }} \left[ {\ln p\left( {\left. {{\hat{\mathbf{R}}}_{k,q} } \right|{{\varvec{\upxi}}}_{k,q} } \right)} \right]} \right\} \\ & = \arg \left\{ {\mathop {\max }\limits_{{{{\varvec{\upxi}}}_{k,q} }} \left[ {\sum\limits_{l = 1}^{L} {\left| {\int {r_{k,q,l}^{{}} \left( t \right) \times s_{k,q}^{ * } \left( {t - \tau_{k,q}^{{}} } \right)} } \right.} } \right.} \right.\left. {\left. {\left. { \times e^{{{ - j2\pi f_{k,q}{\text{d}}} t}} dt} \right|^{2} } \right]} \right\} \\ \end{aligned}$$

Therefore, the target information can be extracted from the receive signal, e.g., the time-delay, the Doppler frequency and the bearing angle. The measurement model can be expressed as

$${\mathbf{z}}_{k,q} = h\left( {{\mathbf{x}}_{k,q} } \right) + {{\varvec{\upnu}}}_{k,q}$$

where \(h({\mathbf{x}}_{k,q} ) = [R_{k,q} ,\dot{R}_{k,q} ,\theta_{k,q} ]^{{\text{T}}}\), \(R_{k,q}\), \(\dot{R}_{k,q}\) and \(\theta_{k,q}\) denote the range, radial velocity, and bearing angle of the qth target at the kth sample interval, respectively, i.e., given by

$$\left\{ {\begin{array}{*{20}l} {R_{k,q} = \sqrt {\left( {x_{k,q} - x_{0} } \right)^{2} + \left( {y_{k,q} - y_{0} } \right)^{2} } } \hfill \\ {\dot{R}_{k,q} = \left( {\dot{x}_{k,q} ,\dot{y}_{k,q} } \right)\left( \begin{gathered} x_{k,q} - x_{0} \hfill \\ y_{k,q} - y_{0} \hfill \\ \end{gathered} \right)/R_{k,q} } \hfill \\ {\theta_{k,q} = \arctan {{\left( {y_{k,q} - y_{0} } \right)} \mathord{\left/ {\vphantom {{\left( {y_{k,q} - y_{0} } \right)} {\left( {x_{k,q} - x_{0} } \right)}}} \right. \kern-\nulldelimiterspace} {\left( {x_{k,q} - x_{0} } \right)}}} \hfill \\ \end{array} } \right.$$

The measurement noise \({{\varvec{\upnu}}}_{k,q} \sim N({\mathbf{0}},{\mathbf{\Re }}_{k,q} )\), with \({\mathbf{\Re }}_{k,q} = {\text{diag}}(\sigma_{{R_{k,q} }}^{2} ,\sigma_{{\dot{R}_{k,q} }}^{2} ,\sigma_{{\theta_{k,q} }}^{2} )\). In addition, the elements in \({\mathbf{\Re }}_{k,q}\) are given by [9]

$$\left\{ {\begin{array}{*{20}l} {\sigma_{{R_{k,q} }}^{2} \propto \left( {\alpha_{k,q} P_{k,q} \left| {h_{k,q} } \right|^{2} \beta_{k,q}^{2} } \right)^{ - 1} } \hfill \\ {\sigma_{{\dot{R}_{k,q} }}^{2} \propto \left( {\alpha_{k,q} P_{k,q} \left| {h_{k,q} } \right|^{2} T_{k,q}^{2} } \right)^{ - 1} } \hfill \\ {\sigma_{{\theta_{{_{k,q} }} }}^{2} \propto \left( {\alpha_{k,q} P_{k,q} \left| {h_{k,q} } \right|^{2} /B_{{\text{w}}} } \right)^{ - 1} } \hfill \\ \end{array} } \right.$$

where \(\beta_{k,q}\) denotes the effective bandwidth, \(T_{k,q}\) is the effective time width, and \(B_{{\text{w}}}\) is the null-to-null beam width of receive antennas [25]. Moreover, it should be noted that all the elements in (14) are inversely linear with \(P_{k,q}\) [12], and thus, the measurement covariance can be rewritten as \({\mathbf{\Re }}_{k,q} = P_{k,q}^{ - 1} {{\varvec{\upchi}}}_{k,q}\). In this case, it is theoretically possible to achieve higher tracking accuracy by increasing the transmit power allocation for a certain target.

3 Cognitive tracking scheme based on MI under blanket jamming

3.1 Multi-beam target tracking with PSCKF

In the tracking process, the C-MIMO radar works in multi-beam mode and each beam operates in the “focused transmit focused receive” (FTFR) manner [13]. Therefore, the MTT task can be divided into a series of single target tracking [7]. In view of the nonlinear Gaussian state space model given in Sects. 2.2 and 2.3, the parallel square-root cubature Kalman filter (PSCKF) is adopted in target state estimation.

From the Bayesian theory perspective, based on recursive method, the PDF for target tracking is obtained by utilizing the known initial state probability density and system measurement value. In this case, after the system state transition model and the measurement model are probabilized, the state estimation of the nonlinear discrete system model (7) and (12) in the Gaussian domain can be normalized as

$$\left\{ {\begin{array}{*{20}l} {{\mathbf{x}}_{{k + 1\left| {k + 1,q} \right.}} = {\hat{\mathbf{x}}}_{{k + 1\left| {k + 1,q} \right.}} + {\mathbf{L}}_{k + 1,q} \left( {{\mathbf{z}}_{k + 1,q} - {\hat{\mathbf{z}}}_{k + 1,q} } \right)} \hfill \\ {{\mathbf{P}}_{{k + 1\left| {k + 1,q} \right.}} = {\mathbf{P}}_{{k + 1\left| {k,q} \right.}} - {\mathbf{L}}_{k + 1,q} {\mathbf{P}}_{k + 1,q}^{zz} {\mathbf{L}}_{k + 1,q}^{{\text{T}}} } \hfill \\ {{\mathbf{L}}_{k + 1,q} = {\mathbf{P}}_{k + 1,q}^{xz} \left( {{\mathbf{P}}_{k + 1,q}^{zz} } \right)^{ - 1} } \hfill \\ \end{array} } \right.$$

where \({\mathbf{P}}_{{k + 1\left| {k + 1,q} \right.}}\) is the state transition covariance matrix, \({\mathbf{P}}_{{k + 1\left| {k,q} \right.}}\) is the one-step predictive covariance matrix, \({\mathbf{L}}_{k + 1,q}\) is the Kalman gain, \({\mathbf{P}}_{k + 1,q}^{zz}\) is the measurement autocorrelation covariance matrix, and \({\mathbf{P}}_{k + 1,q}^{xz}\) is the state-measurement cross-correlation covariance matrix. To compute all the parameters in (15), the recursive calculation of PSCKF is given below.

FOR \(q = 1,2,...,Q\).

  1. (1)

    Time Update

Assume that the state transition covariance matrix \({\mathbf{P}}_{{k\left| {k,q} \right.}}\) is known, and the posterior density at the kth sample interval satisfies that \(p({\mathbf{x}}_{k,q} |{\mathbf{z}}_{k,q} )\sim N({\hat{\mathbf{x}}}_{{k\left| {k,q} \right.}} ,{\mathbf{P}}_{{k\left| {k,q} \right.}} )\). Then, we have.

Step 1.1: Cholesky decomposition.

$${\mathbf{P}}_{{k\left| {k,q} \right.}} = {\mathbf{S}}_{{k\left| {k,q} \right.}} \left( {{\mathbf{S}}_{{k\left| {k,q} \right.}} } \right)^{{\text{T}}}$$

Step 1.2: Calculate each cubature point and make one-step prediction.

$$\left\{ {\begin{array}{*{20}l} {{\mathbf{x}}_{{k\left| {k,q} \right.}}^{i} = {\hat{\mathbf{x}}}_{{k\left| {k,q} \right.}}^{{}} + {\mathbf{S}}_{{k\left| {k,q} \right.}} {{\varvec{\upxi}}}_{i} } \hfill \\ {{\mathbf{x}}_{{k + 1\left| {k,q} \right.}}^{i * } = {\mathbf{Fx}}_{{k\left| {k,q} \right.}}^{i} } \hfill \\ \end{array} } \right.$$

where \(i = 1,2,...,m\), \(m = 2n\), and n is the dimension of the state vector. Herein, \({{\varvec{\upxi}}}_{i} = \sqrt {m/2} [{\mathbf{1}}]\), where \([{\mathbf{1}}]\) is given by

$$\left[ {\mathbf{1}} \right] = \left\{ {\left( {\begin{array}{*{20}c} 1 \\ 0 \\ \ldots \\ 0 \\ \end{array} } \right),\left( {\begin{array}{*{20}c} 0 \\ 1 \\ \ldots \\ 0 \\ \end{array} } \right), \ldots \left( {\begin{array}{*{20}c} 0 \\ 0 \\ \ldots \\ 1 \\ \end{array} } \right),\left( {\begin{array}{*{20}c} { - 1} \\ 0 \\ \ldots \\ 0 \\ \end{array} } \right),\left( {\begin{array}{*{20}c} 0 \\ { - 1} \\ \ldots \\ 0 \\ \end{array} } \right), \ldots \left( {\begin{array}{*{20}c} 0 \\ 0 \\ \ldots \\ { - 1} \\ \end{array} } \right)} \right\}$$

Step 1.3: Estimate the square-root coefficient of the covariance matrix of state prediction value and state prediction error.

$$\left\{ {\begin{array}{*{20}l} {{\hat{\mathbf{x}}}_{{k + 1\left| {k,q} \right.}} = \frac{1}{m}\sum\limits_{i = 1}^{m} {{\mathbf{x}}_{k + 1\left| k \right.,q}^{i \, * } } } \hfill \\ {{\mathbf{S}}_{k + 1\left| k \right.,q} = {\text{Tria}}\left( {\left[ {{\mathbf{x}}_{k + 1\left| k \right.,q}^{ * } ,{\text{Chol}}\left( {{\mathbf{Q}}_{k,q} } \right)} \right]} \right)} \hfill \\ {{\mathbf{z}}_{{k + 1\left| {k,q} \right.}} = \frac{1}{\sqrt m }\left[ {{\mathbf{z}}_{{k + 1\left| {k,q} \right.}}^{1} - {\hat{\mathbf{z}}}_{{k + 1\left| {k,q} \right.}} ,{\mathbf{z}}_{{k + 1\left| {k,q} \right.}}^{2} - {\hat{\mathbf{z}}}_{{k + 1\left| {k,q} \right.}} , \ldots ,{\mathbf{z}}_{{k + 1\left| {k,q} \right.}}^{m} - {\hat{\mathbf{z}}}_{{k + 1\left| {k,q} \right.}} } \right]} \hfill \\ \end{array} } \right.$$

where \({\text{Tria(}} \cdot {)}\) and \({\text{Chol(}} \cdot {)}\) denote the QR decomposition operator and the Cholesky decomposition operator, respectively.

  1. (2)

    Measurement Update

Step 2.1: Estimate each cubature point and the square-root coefficient of the new covariance.

$$\left\{ {\begin{array}{*{20}l} {{\mathbf{x}}_{{k + 1\left| {k,q} \right.}}^{i} = {\hat{\mathbf{x}}}_{k + 1\left| k \right.,q}^{{}} + {\mathbf{S}}_{k + 1\left| k \right.,q} {{\varvec{\upxi}}}_{i} } \hfill \\ {{\mathbf{z}}_{k + 1\left| k \right.,q}^{i} = h\left( {{\mathbf{x}}_{k + 1\left| k \right.,q}^{i} } \right)} \hfill \\ {{\hat{\mathbf{z}}}_{k + 1\left| k \right.,q} = \frac{1}{m}\sum\limits_{i = 1}^{m} {{\mathbf{z}}_{k + 1\left| k \right.,q}^{i} } } \hfill \\ {{\mathbf{z}}_{k + 1\left| k \right.,q} = \frac{1}{\sqrt m }\left[ {{\mathbf{z}}_{k + 1\left| k \right.,q}^{1} - {\hat{\mathbf{z}}}_{k + 1\left| k \right.,q} ,{\mathbf{z}}_{k + 1\left| k \right.,q}^{2} - {\hat{\mathbf{z}}}_{k + 1\left| k \right.,q} , \ldots ,{\mathbf{z}}_{k + 1\left| k \right.,q}^{m} - {\hat{\mathbf{z}}}_{k + 1\left| k \right.,q} } \right]} \hfill \\ {{\mathbf{S}}_{k + 1\left| k \right.,q}^{ZZ} = {\text{Tria}}\left( {\left[ {{\mathbf{z}}_{k + 1\left| k \right.,q}^{{}} ,{\text{Chol}}\left( {{\mathbf{\Re }}_{k,q} } \right)} \right]} \right)} \hfill \\ \end{array} } \right.$$

Step 2.2: Calculate the new information covariance matrix and the cross-covariance matrix of state and measurement.

$$\left\{ {\begin{array}{*{20}l} {{\mathbf{P}}_{k + 1\left| k \right.,q}^{zz} = {\mathbf{S}}_{k + 1\left| k \right.,q}^{zz} \left( {{\mathbf{S}}_{k + 1\left| k \right.,q}^{zz} } \right)^{{\text{T}}} } \hfill \\ {{\mathbf{x}}_{k + 1\left| k \right.,q} = \frac{1}{\sqrt m }\left[ {{\mathbf{x}}_{k + 1\left| k \right.,q}^{1} - {\hat{\mathbf{x}}}_{k + 1\left| k \right.,q}^{{}} ,{\mathbf{x}}_{k + 1\left| k \right.,q}^{2} - {\hat{\mathbf{x}}}_{k + 1\left| k \right.,q} , \ldots ,{\mathbf{x}}_{k + 1\left| k \right.,q}^{m} - {\hat{\mathbf{x}}}_{k + 1\left| k \right.,q} } \right]} \hfill \\ {{\mathbf{P}}_{k + 1\left| k \right.,q}^{xz} = {\mathbf{x}}_{k + 1\left| k \right.,q} \left( {{\mathbf{z}}_{k + 1\left| k \right.,q} } \right)^{{\text{T}}} } \hfill \\ \end{array} } \right.$$

Step 2.3: Update the filter gain and compute the posterior state estimation.

$$\left\{ \begin{gathered} {\mathbf{L}}_{k + 1,q}^{{}} = {\mathbf{P}}_{k + 1\left| k \right.,q}^{xz} \left( {{\mathbf{P}}_{k + 1\left| k \right.,q}^{zz} } \right)^{ - 1} \hfill \\ {\hat{\mathbf{x}}}_{k + 1,q}^{{}} = {\hat{\mathbf{x}}}_{k + 1\left| k \right.,q}^{{}} + {\mathbf{L}}_{k + 1,q}^{{}} \left( {{\mathbf{z}}_{k + 1,q}^{{}} - {\hat{\mathbf{z}}}_{k + 1\left| k \right.,q}^{{}} } \right) \hfill \\ \end{gathered} \right.$$

Step 2.4: Recersion. Update the square-root coefficient of state error covariance matrix

$${\mathbf{S}}_{{k + 1\left| {k + 1} \right.,q}} = {\text{Tria}}\left( {\left[ {{\mathbf{x}}_{k + 1\left| k \right.,q}^{{}} - {\mathbf{L}}_{k + 1,q}^{{}} {\mathbf{z}}_{k + 1\left| k \right.,q}^{{}} ,{\mathbf{L}}_{k + 1,q}^{{}} {\text{Chol}}\left( {{\mathbf{\Re }}_{k,q} } \right)} \right]} \right)$$

Then, let \(k = k + 1\), and return to step 1.1.

3.1.1 END FOR.

3.2 MI derivation under blanket jamming

Assume that the transmit signal of \({\hat{\mathbf{s}}}_{k,q}\) is known, thus the pulse matrix \({\hat{\mathbf{S}}}_{k,q}\) can be obtained. Then, the MI between \({\hat{\mathbf{R}}}_{k,q}\) and \({\hat{\mathbf{G}}}_{k,q}\) can be expressed as [26]

$$\begin{aligned} I\left( {{\hat{\mathbf{R}}}_{k,q} ;{\hat{\mathbf{G}}}_{k,q} \left| {{\hat{\mathbf{S}}}_{k,q} } \right.} \right) & = H\left( {{\hat{\mathbf{R}}}_{k,q} \left| {{\hat{\mathbf{S}}}_{k,q} } \right.} \right) - H\left( {{\hat{\mathbf{R}}}_{k,q} \left| {{\hat{\mathbf{G}}}_{k,q} ,{\hat{\mathbf{S}}}_{k,q} } \right.} \right) \\ & { = }H\left( {{\hat{\mathbf{R}}}_{k,q} \left| {{\hat{\mathbf{S}}}_{k,q} } \right.} \right) - H\left( {{\hat{\mathbf{N}}}_{k,q} + {\hat{\mathbf{J}}}_{k,q} } \right) \\ \end{aligned}$$

where \(I({\hat{\mathbf{R}}}_{k,q} ;{\hat{\mathbf{G}}}_{k,q} |{\hat{\mathbf{S}}}_{k,q} )\) denotes the MI for the qth target. The term of \(H( \cdot )\) represents the differential entropy operator, which satisfies that, i.e., \(H({\mathbf{x}}) = - \int {p({\mathbf{x}})} \log p{(}{\mathbf{x}}{\text{)d}}{\mathbf{x}}\) and \(H({\mathbf{x}}\left| {\mathbf{y}} \right.) = - \int {\int {p({\mathbf{x}}\left| {\mathbf{y}} \right.)\log p{(}{\mathbf{x}}\left| {\mathbf{y}} \right.{\text{)d}}{\mathbf{x}}{\text{d}}{\mathbf{y}}} }\). Herein, \(p({\mathbf{x}})\) and \(p({\mathbf{x}}\left| {\mathbf{y}} \right.)\) are the PDF of \({\mathbf{x}}\) and the conditional PDF of \({\mathbf{x}}\) with respect to \({\mathbf{y}}\), respectively. In this case, \(p({\hat{\mathbf{R}}}_{k,q} |{\hat{\mathbf{S}}}_{k,q} )\) and \(p({\hat{\mathbf{N}}}_{k,q} + {\hat{\mathbf{J}}}_{k,q} )\) are given by

$$p\left( {{\hat{\mathbf{R}}}_{k,q} \left| {{\hat{\mathbf{S}}}_{k,q} } \right.} \right) = \frac{{\exp \left\{ { - {\text{tr}}\left[ {\left( {\gamma_{k,q}^{2} {\hat{\mathbf{s}}}_{k,q} {\hat{\mathbf{s}}}_{k,q}^{{\text{H}}} + \left( {\alpha_{k,q}^{2} + \beta_{k,q}^{2} } \right){\mathbf{I}}_{M} } \right)^{ - 1} {\hat{\mathbf{R}}}_{k,q} {\hat{\mathbf{R}}}_{k,q}^{{\text{H}}} } \right]} \right\}}}{{\pi^{MN} \left[ {\det \left( {\gamma_{k,q}^{2} {\hat{\mathbf{s}}}_{k,q} {\hat{\mathbf{s}}}_{k,q}^{{\text{H}}} + \left( {\alpha_{k,q}^{2} + \beta_{k,q}^{2} } \right){\mathbf{I}}_{M} } \right)} \right]^{N} }}$$


$$p\left( {{\hat{\mathbf{N}}}_{k,q} + {\hat{\mathbf{J}}}_{k,q} } \right) = \frac{{\exp \left\{ { - {\text{tr}}\left[ {\left( {\left( {\alpha_{k,q}^{2} + \beta_{k,q}^{2} } \right){\mathbf{I}}_{M} } \right)^{ - 1} \left( {{\hat{\mathbf{N}}}_{k,q} + {\hat{\mathbf{J}}}_{k,q} } \right)\left( {{\hat{\mathbf{N}}}_{k,q} + {\hat{\mathbf{J}}}_{k,q} } \right)^{{\text{H}}} } \right]} \right\}}}{{\pi^{MN} \left[ {\det \left( {\left( {\alpha_{k,q}^{2} + \beta_{k,q}^{2} } \right){\mathbf{I}}_{M} } \right)} \right]^{N} }}$$

Thus, \(H({\hat{\mathbf{R}}}_{k,q} |{\hat{\mathbf{S}}}_{k,q} )\) and \(H({\hat{\mathbf{N}}}_{k,q} + {\hat{\mathbf{J}}}_{k,q} )\) can be calculated as

$$\begin{aligned} H\left( {{\hat{\mathbf{R}}}_{k,q} \left| {{\hat{\mathbf{S}}}_{k,q} } \right.} \right) & = - \int {\int {p\left( {{\hat{\mathbf{R}}}_{k,q} \left| {{\hat{\mathbf{S}}}_{k,q} } \right.} \right)\log p\left( {{\hat{\mathbf{R}}}_{k,q} \left| {{\hat{\mathbf{S}}}_{k,q} } \right.} \right){\text{d}}{\hat{\mathbf{R}}}_{k,q} {\text{d}}{\hat{\mathbf{S}}}_{k,q} } } \\ & = - \int {p\left( {{\hat{\mathbf{R}}}_{k,q} \left| {{\hat{\mathbf{S}}}_{k,q} } \right.} \right)\log p\left( {{\hat{\mathbf{R}}}_{k,q} \left| {{\hat{\mathbf{S}}}_{k,q} } \right.} \right){\text{d}}{\hat{\mathbf{R}}}_{k,q} } \\ & = MN + N\log \left[ {\pi^{M} \det \left( {\gamma_{k,q}^{2} {\hat{\mathbf{s}}}_{k,q} {\hat{\mathbf{s}}}_{k,q}^{{\text{H}}} + \left( {\alpha_{k,q}^{2} + \beta_{k,q}^{2} } \right){\mathbf{I}}_{M} } \right)} \right] \\ \end{aligned}$$


$$\begin{aligned} H\left( {{\hat{\mathbf{N}}}_{k,q} + {\hat{\mathbf{J}}}_{k,q} } \right) & = - \int {p\left( {{\hat{\mathbf{N}}}_{k,q} + {\hat{\mathbf{J}}}_{k,q} } \right)\log p\left( {{\hat{\mathbf{N}}}_{k,q} + {\hat{\mathbf{J}}}_{k,q} } \right){\text{d}}{\hat{\mathbf{N}}}_{k,q} } \\ & \quad - \int {p\left( {{\hat{\mathbf{N}}}_{k,q} + {\hat{\mathbf{J}}}_{k,q} } \right)\log p\left( {{\hat{\mathbf{N}}}_{k,q} + {\hat{\mathbf{J}}}_{k,q} } \right){\text{d}}{\hat{\mathbf{J}}}_{k,q} \, } \\ & = MN + N\log \left[ {\pi^{M} \det \left( {\left( {\alpha_{k,q}^{2} + \beta_{k,q}^{2} } \right){\mathbf{I}}_{M} } \right)} \right] \\ \end{aligned}$$

Accordingly, combined with (24), (27) and (28), we have

$${\text{MI}}_{k,q} = N\log \frac{{\det \left( {\gamma_{k,q}^{2} {\hat{\mathbf{s}}}_{k,q} {\hat{\mathbf{s}}}_{k,q}^{{\text{H}}} + \left( {\alpha_{k,q}^{2} + \beta_{k,q}^{2} } \right){\mathbf{I}}_{M} } \right)}}{{\left( {\alpha_{k,q}^{2} + \beta_{k,q}^{2} } \right)^{M} }}$$

It can be seen from (4) and (29) that \(P_{k,q}\) is the only variable of \({\text{MI}}_{k,q}\) under the condition that the electromagnetic environment is regular and the pulse number is constant.

3.3 Cognitive tracking scheme based on transmit power

After obtaining the measurement information of all the tracked targets, the power-limited C-MIMO radar can calculate the power allocation results at the next tracking period according to the above steps. Hence, by feeding back the power allocation results to the C-MIMO radar transmitter, a close-loop feedback scheme for power allocation is established. The established cognitive tracking scheme is shown in Fig. 1.

Fig. 1
figure 1

Illustration of the proposed cognitive tracking scheme for RPA strategy

4 Optimization model establishment and solution

In this section, we formulate the optimization problem of the RPA based on the MI. In order to solve the optimization problem, a two-step optimization scheme based on the idea of BCR is proposed.

4.1 Problem formulation

Since the MI between the echo signals and path gain matrix is closely related to the accuracy of parameter estimation [27, 28] and has an inverse relationship with mean square error (MSE) [29], we adopt the MI as the criterion of target tracking accuracy. Moreover, in order to establish a closed-loop tracking recursive cycle, the predicted MI should be calculated to guide PA. To be specific, by adopting the PSCKF, the target state estimation \({\tilde{\mathbf{x}}}_{k - 1,q}\) can be obtained. Then, the predicted MI is obtained by combining with the predicted target state \({\tilde{\mathbf{x}}}_{\left. k \right|k - 1,q} = {\mathbf{F}}{\tilde{\mathbf{x}}}_{k - 1,q}\). We consider finding the suboptimal RPA which combines with the effective tracking quantity and target tracking accuracy. Hence, the problem is formulated as

$$\begin{aligned} & {\text{max}}\left\{ {\sum\limits_{q = 1}^{Q} {\left[ {u_{k,q} \left. {{\text{MI}}_{k,q} \left( {P_{k,q} } \right)} \right|_{{{\tilde{\mathbf{x}}}_{\left. k \right|k - 1,q} }} } \right]} } \right\} \\ & \quad {\text{s.t.}}\left\{ {\begin{array}{*{20}l} {u_{k,q} = 1,} \hfill & {\left. {{\text{MI}}_{k,q} \left( {P_{k,q} } \right)} \right|_{{{\tilde{\mathbf{x}}}_{\left. k \right|k - 1,q} }} \ge {\text{MI}}_{0} } \hfill \\ {u_{k,q} = 0,} \hfill & {\left. {{\text{MI}}_{k,q} \left( {P_{k,q} } \right)} \right|_{{{\tilde{\mathbf{x}}}_{\left. k \right|k - 1,q} }} < {\text{MI}}_{0} } \hfill \\ \end{array} } \right. \\ & \quad \quad \sum\limits_{q = 1}^{Q} {P_{k,q} = P_{{{\text{total}}}} \, } ,\quad P_{\min } \le P_{k,q} \le P_{\max } \, \\ \end{aligned}$$

where \({\text{MI}}_{0}\) is the predetermined threshold of MI when each target is effectively tracked, \(P_{{{\text{total}}}}\) denotes the predefined total power budget, and the transient power bound of \([P_{\min } ,P_{\max } ]\) is set to keep the transmitter stay in an endurable interval.

4.2 Two-step optimization algorithm based on BCR

Since the existence of the binary variable \(u_{k,q}\), (30) is a non-convex optimization problem. In order to solve (20), we propose a two-step optimization algorithm based on BCR.

Step 1: Determine the targets in order of BCR when meet the ETT condition. Firstly, calculating the threshold power \(P_{k,q}^{\min }\), which satisfies that \({\text{MI}}_{k,q} (P_{k,q}^{\min } ) = {\text{MI}}_{0}\). Then, by introducing the idea of BCR [23], the ratio of virtual power and the relative MI of each target is sorted in descending order. Thus, we have

$${\mathbf{IX}}_{k} = {\text{sort}}\left( {E_{k,1} ,E_{k,2} , \ldots ,E_{k,Q} } \right)$$

where \({\mathbf{IX}}_{k}\) is the permutation vector of the BCR in terms of power from all the moving targets. Herein, \(E_{k,q}\) denotes the BCR in power, which is expressed as

$$E_{k,q} = \frac{{{\text{MI}}_{k,q} \left( {P_{k,q}^{\min } + P_{{{\text{set}}}} } \right)}}{{P_{k,q}^{\min } + P_{{{\text{set}}}} }}$$

where \(P_{set}\) is a preset constant. Finally, calculating the maximum \(N_{k}\), which satisfies that

$$\sum\limits_{{n = {\mathbf{cel}}_{k} \left( 1 \right)}}^{{{\mathbf{cel}}_{k} \left( {N_{k} } \right)}} {P_{k,n}^{\min } \le P_{{{\text{total}}}} } , \, \sum\limits_{{n = {\mathbf{cel}}_{k} \left( 1 \right)}}^{{{\mathbf{cel}}_{k} \left( {N_{k} + 1} \right)}} {P_{k,n}^{\min } > P_{{{\text{total}}}} }$$

where \({\mathbf{cel}}_{k}\) records the indices of the targets in \({\mathbf{IX}}_{k}\).

Step 2: Maximize the MI for the given ETT information. After obtaining the quantity \(N_{k}\) and the indices of all the effective tracking targets \({\mathbf{cel}}_{k}\), combining with the threshold power \(P_{k,q}^{\min }\), (30) can be converted into

$$\begin{aligned} & {\text{max }}\left\{ {\sum\limits_{{q = {\mathbf{cel}}_{k} \left( 1 \right)}}^{{{\mathbf{cel}}_{k} \left( {N_{k} } \right)}} {\left[ {\left. {MI_{k,q} \left( {P_{k,q} } \right)} \right|_{{{\tilde{\mathbf{x}}}_{\left. k \right|k - 1,q} }} } \right]} } \right\} \\ & \quad {\text{s.t.}} P_{k,q} = P_{k,q}^{\min } + P_{k,q}^{{{\text{reset}}}} \\ & \quad \,\,\quad \sum\limits_{{q = {\mathbf{cel}}_{k} \left( 1 \right)}}^{{{\mathbf{cel}}_{k} \left( {N_{k} } \right)}} {P_{k,q} = P_{{{\text{total}}}} \, } , \, P_{\min } \le P_{k,q} \le P_{\max } \\ & \quad \,\,\quad \sum\limits_{{{\mathbf{cel}}_{k} \left( 1 \right)}}^{{{\mathbf{cel}}_{k} \left( {N_{k} } \right)}} {P_{k,q}^{{{\text{reset}}}} = P_{{{\text{total}}}} - \sum\limits_{{{\mathbf{cel}}_{k} \left( 1 \right)}}^{{{\mathbf{cel}}_{k} \left( {N_{k} } \right)}} {P_{k,q}^{\min } } \, } \\ & \quad \,\,\quad 0 \le P_{k,q}^{{{\text{reset}}}} \le P_{{{\text{total}}}} - \sum\limits_{{{\mathbf{cel}}_{k} \left( 1 \right)}}^{{{\mathbf{cel}}_{k} \left( {N_{k} } \right)}} {P_{k,q}^{\min } } \, \\ \end{aligned}$$

where \(P_{k,q}^{{{\text{reset}}}}\) denotes the allocation of the remaining transmit power to the qth target at sample interval k. Technically, (34) can be easily solved by the particle swarm optimization (PSO) [30], and the final solution set is the suboptimal power set that can satisfy the robust tracking of each target.

5 Experiments and analysis

5.1 Parameter settings

In this section, numerous results are presented to demonstrate the effectiveness of the proposed RPA strategy. Consider that a C-MIMO radar is located at the origin point, and \(Q = 8\) targets follow the CV model and are widely separated, whose initial motion parameters are shown in Table 1. Moreover, the joint configuration of the C-MIMO radar and the tracked targets is demonstrated in Fig. 2. The number of coherent accumulation pulses in one illumination is set as \(L = 200\), and the pulse repetition period is \(T_{{\text{p}}} = 1\,{\text{ms}}\). The lower and upper bounds of power constraints for the tracked targets are \(P_{\min } = 0.05P_{{{\text{total}}}}\) and \(P_{\max } = 0.8P_{{{\text{total}}}}\), respectively. In addition, the targets not assigned for tracking are monitored by radar, with allocated power \(P_{{{\text{mon}}}} = 0.01P_{{{\text{total}}}}\). The preset power compensation is \(P_{\min } = 0.1P_{{{\text{total}}}}\), and the threshold of MI is set as \({\text{MI}}_{0} = 0.6\,{\text{nats}}\). It is assumed that the RCS of all targets is the Swerling I model [31] with mean being 1, and remains constant over the measurement interval. Suppose that all targets carry with self-defense jammers and transmit oppressive signals to the C-MIMO radar. The particle population is set as \(N_{{{\text{pop}}}} = 100\), the number of iterations is \(T_{\max } = 200\), and the upper and lower bounds of the inertial weights are \(w_{\max } = 0.9\) and \(w_{\min } = 0.2\), respectively. A sequence of 20 frames with sample interval \(T_{{\text{s}}} = 1\,{\text{s}}\) are utilized in each Monte Carlo trial, while the number of trials is set as \(N_{{{\text{sim}}}} = 1{00}\). To evaluate the tracking performance, we define the sum of RMSEs of the tracked targets as

$${\text{R}}_{{{\text{MSE}},k}} = \sum\limits_{{q = {\mathbf{cel}}_{k} \left( 1 \right)}}^{{{\mathbf{cel}}_{k} \left( {N_{k} } \right)}} {\sqrt {\frac{1}{{N_{{{\text{sim}}}} }}\sum\limits_{j = 1}^{{N_{{{\text{sim}}}} }} {{\text{tr}}\left[ {{{\varvec{\Lambda}}}\left( {{\mathbf{x}}_{k,q} -{\hat{\mathbf{x}}}_{k,q}^{j} } \right)\left( {{\mathbf{x}}_{k,q} -{\hat{\mathbf{x}}}_{k,q}^{j} } \right)^{{\text{T}}} {{\varvec{\Lambda}}}} \right]} } }$$

where \({\hat{\mathbf{x}}}_{k,q}^{j}\) denotes the results of target parameters estimation in the jth trail. The matrix \({{\varvec{\Lambda}}} = {\mathbf{I}}_{2} \otimes {\text{diag}}(1,0)\).

Table 1 Target moving parameters
Fig. 2
figure 2

Joint configuration of the C-MIMO radar and the tracked targets

5.2 Results and discussions

  1. 1)

    Scenario 1: Effect of Distance

In this scenario, the interference signal intensity of each jammer is assumed to be constant and the corresponding jam-signal ratio (JSR) is set as 12 dB. Hence, since the targets SNRs are only related to the radar-target distances and the transmit power results [32, 33], the distance factor becomes the major contributor in the PA problem.

In order to demonstrate the effectiveness of the proposed algorithm, two benchmarks are used as comparison: 1) Uniform allocation; 2) Sum-max optimize allocation. In the uniform allocation scheme, all power resources are evenly distributed to \(Q = 8\) targets. As for the sum-max optimize allocation scheme, the optimization objective is to maximize the sum of MIs of all targets, thus its optimization model can be expressed as

$$\begin{aligned} & {\text{max }}\left\{ {\sum\limits_{q = 1}^{Q} {\left. {MI_{k,q} \left( {P_{k,q} } \right)} \right|_{{{\tilde{\mathbf{x}}}_{\left. k \right|k - 1,q} }} } } \right\} \\ & \quad {\text{s.t.}} \sum\limits_{q = 1}^{Q} {P_{k,q} = P_{{{\text{total}}}} \, } \\ & \quad \quad \quad P_{\min } \le P_{k,q} \le P_{\max } \, \\ \end{aligned}$$

Moreover, (36) is solved by the PSO algorithm. Figure 3 demonstrates the effective tracking quantity comparison among the three resource allocation strategy. In general, the proposed RPA strategy performs best among all the adopted methods obviously. In addition, although the effective tracking quantity obtained by the sum-max optimize allocation scheme is less than that obtained by the average allocation method in the initial period of time, this situation changes with the improvement of the overall target tracking accuracy.

Fig. 3
figure 3

Performance comparison in terms of the effective tracking quantity in scenario 1

Figure 4 further demonstrates the average PA results obtained by the proposed RPA strategy in scenario 1. Herein, the grid colors denote the ratio of allocated power towards different moving targets, which is given by

$$r_{{{\text{ave }}k,q}}^{{{\text{power}}}} = \frac{1}{{N_{{{\text{sim}}}} }}\sum\limits_{j = 1}^{{N_{{{\text{sim}}}} }} {\frac{{P_{k,q}^{j} }}{{P_{{{\text{total}}}} }}}$$

where \(P_{k,q}^{j}\) denotes the PA results in the jth trail. Moreover, the pinky white color represents that the ratio is zero, which means the corresponding target will not be tracked at this tracking interval. It should be noted that the target far from the radar is easily unable to meet the requirements of effective tracking under the given power resource budget due to the larger tracking error, so that the corresponding allocated power is less. Moreover, when the tracking error meets the effective tracking condition, the farther target from the radar is tend to allocated more power resource.

Fig. 4
figure 4

Average PA results obtained by the proposed RPA strategy in scenario 1

Figure 5 shows the targets that meet the effective tracking condition at each frame for the three PA schemes. Obviously, target 4 and target 5 can be tracked more easily by radar due to their closer proximity. In addition, the proposed RPA strategy shows good robustness as the tracking time increases.

  1. 2)

    Scenario 2: Effect of Interference Intensity

Fig. 5
figure 5

Indices of the effective tracking targets. a Uniform allocation. b Sum-max optimize allocation. c Proposed RPA strategy

In this senario, we will further study the influence of interference intensity on PA results. Therefore, we consider a time-varying JSR model. In the model, the JSR levels of target target q (q = 2, 3, 4, 5) are remain at 12 dB, which is consistent with that of scenario 1. In addition, it is assumed that the JSR levels of the rest of moving targets are time-varying, as shown in Fig. 6. As such, in addition to the distance factor, the interference intensity factor is also added to affect PA results.

Fig. 6
figure 6

Time-varying interference intensity model

The effective tracking quantity performances among the three PA schemes in scenario 2 are compared and demonstrated in Fig. 7. Obviously, it can be seen from Fig. 7 that the proposed RPA strategy still performs best among the three algorithms. Due to the stronger inteference intensity, the effective tracking quantity is smaller in scenario 2 than in scenario 1. Moreover, in the long run, the sum-max optimize allocation strategy performs better than the uniform allocation method in increasing tracking performance in scenario 2.

Fig. 7
figure 7

Performance comparison in terms of the effective tracking quantity in scenario 2

Figures 8 and 9 show the average PA results obtained by the proposed RPA strategy and the indices of effective tracking target for the three PA strategies, respectively. Compared with scenario 1, it can be noted that less power resources are allocated to targets (target 1 and target 8) with stronger interference intensity. In addition, due to the dual influence of longer radial distance and higher interference intensity, target 1 is abandoned due to the limitation of system power budget among the tracking process.

Fig. 8
figure 8

Average PA results obtained by the proposed RPA strategy in scenario 2

Fig. 9
figure 9

Indices of the effective tracking targets. a Uniform allocation. b Sum-max optimize allocation. c Proposed RPA strategy

6 Conclusions

In this paper, we proposed an RPA strategy in the C-MIMO radar system for MTG application under blanket jamming environment. Based on deriving and calculating the predicted MI and then setting the predetermined threshold of the MI, we formulated the RPA strategy as a non-convex optimization problem. In order to tackle the difficulty in solving this problem, a two-step optimization algorithm based on the BCR is utilized in the solving process. Numerical results showed that the distance from target to radar and the interference intensity have impact on the PA results. Additionally, in the proposed RPA strategy, those targets with low tracking accuracy are tend to be abandoned in exchange for higher tracking accuracy of other targets.

Availability of data and materials

Unfortunately, the data are not avaliable online. Kindly, for data requests, please contact the corresponding author.



Collocated multiple-input and multiple-output


Robust power allocation


Multiple target guidance


Effective tracking target


Mutual information


Benefit–cost ratio


Simultaneous multi-beam


Power allocation


Multiple target tracking


Posterior Cramer–Rao lower bound


Radar cross-section


Signal–noise ratio


Constant velocity


Probability density function


Maximum likelihood


Focused transmit focused receive


Parallel square-root cubature Kalman filter


Particle swarm optimization


  1. H. Liu S. Zhou, H. Zang, Y. Cao, Two waveform design criteria for colocated MIMO radar. In 2014 International Radar Conference (2014), p. 1–5

  2. E. Fishler, A.M. Haimovich, R.S. Blum et al., Spatial diversity in radars-models and detection performance. IEEE Trans. Signal Process. 54(3), 823–838 (2006)

    Article  Google Scholar 

  3. A. Gorji, R. Tharmarasa, W. Blair et al., Multiple unresolved target localization and tracking using collocated MIMO radars. IEEE Trans. Aerosp. Electron. Syst. 48(3), 2498–2517 (2012)

    Article  Google Scholar 

  4. J. Li, P. Stoica, MIMO radar with colocated antennas. IEEE Signal Process. Mag. 24(5), 106–114 (2007)

    Article  Google Scholar 

  5. H. Zhang, J. Shi, Q. Zhang et al., Antenna selection for target tracking in collocated MIMO radar. IEEE Trans. Aerosp. Electron. Syst. 57(1), 423–436 (2021)

    Article  Google Scholar 

  6. J. Yan, W. Pu, S.H. Zhou et al., Collobaborative detection and power allocation framework for target tracking in multiple radar system. Inf. Fusion 55, 173–183 (2020)

    Article  Google Scholar 

  7. J. Yan, H. Liu, B. Jiu, B. Chen, Z. Liu, Z. Bao, Simultaneous multibeam resource allocation scheme for multiple target tracking. IEEE Trans. Signal Process. 63(12), 3110–3112 (2015)

    Article  MathSciNet  Google Scholar 

  8. Z. Li, J. Xie, H. Zhang, Joint power and time width allocation in collocated MIMO radar for multi-target tracking. IET Radar Sonar Navig. 14(5), 686–693 (2020)

    Article  Google Scholar 

  9. J. Yan, H. Liu, W. Pu et al., Joint beam selection and power allocation for multiple target tracking in netted colocated MIMO radar systems. IEEE Trans. Signal Process. 64(24), 6417–6427 (2016)

    Article  MathSciNet  Google Scholar 

  10. W. Yi, Y. Yuan, R. Hoseinnezhad, L. Kong, Resource scheduling for distributed multi-target tracking in netted colocated MIMO radar systems. IEEE Trans. Signal Process. 68, 1602–1617 (2020)

    Article  MathSciNet  Google Scholar 

  11. J. Yan, W. Pu, S. Zhou, H. Liu, Z. Bao, Collaborative detection and power allocation framework for target tracking in multiple radar system. Inf. Fusion 55, 173–183 (2020)

    Article  Google Scholar 

  12. J. Yan, P. Zhang, J. Dai, H. Liu, Target capacity based simultaneous multibeam power allocation scheme for multiple target tracking application. Signal Process. 178, 107794–110778 (2021)

    Article  Google Scholar 

  13. Y. Yuan, W. Yi, R. Hoseinnezhad et al., Robust power allocation for resource-aware multi-target tracking with colocated MIMO radars. IEEE Trans. Signal Process. 68, 1602–1617 (2020)

    Article  MathSciNet  Google Scholar 

  14. C. Shi et al., Joint subcarrier assignment and power allocation strategy for integrated radar and communications systems based on power minimization. IEEE Sens. J. 19, 11167–11179 (2019)

    Article  Google Scholar 

  15. Z. Liu, X. Fang, Q. Fu, An analysis on error sources of guidance radar detection performance. In Proceedings of 2011 IEEE CIE International Conference on Radar (2011) , p. 1796–1800

  16. Y. Yuan, W. Yi, P. Varshney, Exponential mixture density based approximation to posterior Cramer–Rao lower bound for distributed target tracking. IEEE Trans. Signal Process. 70, 862–877 (2022)

    Article  MathSciNet  Google Scholar 

  17. J. Yan, W. Pu, S. Zhou, H. Liu, M. Greco, Optimal resource allocation for asynchronous multiple targets tracking in heterogeneous radar networks. IEEE Trans. Signal Process. 68, 4055–4068 (2020)

    Article  Google Scholar 

  18. H. Zhang, W. Liu, J. Shi et al., Joint detection threshold optimization and illumination time allocation strategy for cognitive tracking in a networked radar systems. IEEE Trans. Signal Process. to be published.

  19. M.R. Bell, Information theory and radar waveform design. IEEE Trans. Inf. Theory 39(5), 1578–1597 (1993)

    Article  Google Scholar 

  20. X. Song, P. Willett, S. Zhou, P.B. Luh, The MIMO radar and jammer games. IEEE Trans. Signal Process. 60(2), 687–699 (2012)

    Article  MathSciNet  Google Scholar 

  21. X. Lan, W. Li, X. Wang et al., MIMO radar and target Stacklberg game in the presence of clutter. IEEE Sens. J. 15(12), 6912–6920 (2015)

    Article  Google Scholar 

  22. L. Wang, L. Wang, Y. Zeng, M. Wang, Jamming power allocation strategy for MIMO radar based on MMSE and mutual information. IET Radar Sonar Navig. 11(7), 1081–1089 (2017)

    Article  Google Scholar 

  23. E.A. Frej, P. Ekel, A.T. Almeida, A benefit-to-cost ratio based approach for portfolio selection under multiple criteria with incomplete preference information. Inf. Sci. 545(4), 487–498 (2021)

    Article  MathSciNet  Google Scholar 

  24. S. Qiu, W. Sheng et al., A maximum likelihood method for joint DOA and polarization estimation based on manifold separation. IEEE Trans. Aerosp. Electron. Syst. 57(4), 2481–2500 (2021)

    Article  Google Scholar 

  25. H. Zhang, J. Xie, J. Shi et al., Joint beam and waveform selection for the MIMO radar target tracking. Signal Process. 156, 31–40 (2019)

    Article  Google Scholar 

  26. B. Tang, J. Li, Spectually constrained MIMO radar waveform design based on mutual information. IEEE Trans. Signal Process. 67(3), 821–834 (2019)

    Article  MathSciNet  Google Scholar 

  27. S. Sen, A. Nehorai, OFDM MIMO radar with mutual-information waveform design for low-grazing angle tracking. IEEE Trans. Signal Process. 58(6), 3152–3162 (2010)

    Article  MathSciNet  Google Scholar 

  28. C. Shi, Y. Wang, F. Wang et al., Joint optimization scheme for subcarrier selection and power allocation in multicarrier dual-function radar-communication system. IEEE Syst. J. 15(1), 947–958 (2021)

    Article  Google Scholar 

  29. L. Wang, D. Wang, H. Zeng et al., Jamming power allocation strategy for MIMO radar based on MMSE and mutual information. IET Radar Sonar Navig. 11(7), 1081–1089 (2017)

    Article  Google Scholar 

  30. S.M. Mikki, A.A. Kishk, Particle Swarm Optimization: A Physics-Based Approach (Morgan & Claypool, 2008)

    Google Scholar 

  31. J. Whitrow, Rapid evaluation of airborne intercept radar detection for Swerling I targets. IEEE Trans. Aerosp. Electron. Syst. 46(2), 905–910 (2010)

    Article  Google Scholar 

  32. J. Yan, J. Dai, W. Pu, H. Liu et al., Target capacity based resource optimization for multiple target tracking in radar networks. IEEE Trans. Signal Process. 69, 2410–2421 (2021)

    Article  MathSciNet  Google Scholar 

  33. J. Yan, J. Dai, W. Pu, S. Zhou et al., Quality of service constrained resource scheme for multiple target tracking in radar sensor network. IEEE Syst. J. 15(1), 771–779 (2021)

    Article  Google Scholar 

Download references


The authors would like to express their sincere thanks to the editors and anonymous reviewers.


This work was supported by National Nautural Science Foundation of China under Grant 62001506.

Author information

Authors and Affiliations



HZ, JX and ZL conceived and designed the expreriments; ZL performed the experiments; ZL, JG and CQ analyzed the data; ZL wrote the paper; JX administrated the project. All authors read and approved the final manuscript.

Corresponding author

Correspondence to Haowei Zhang.

Ethics declarations

Ethics approval and consent to participate

Not applicable.

Consent for publication


Competing interests

The authors declare that they have no competing interests.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Li, Z., Xie, J., Zhang, H. et al. A robust power allocation strategy based on benefit–cost ratio for multiple target guidance in the C-MIMO radar system under blanket jamming. EURASIP J. Adv. Signal Process. 2022, 48 (2022).

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: