Research  Open  Published:
Blind and semiblind ML detection for spacetime blockcoded OFDM wireless systems
EURASIP Journal on Advances in Signal Processingvolume 2014, Article number: 131 (2014)
Abstract
This paper investigates the joint maximum likelihood (ML) data detection and channel estimation problem for Alamouti spacetime blockcoded (STBC) orthogonal frequencydivision multiplexing (OFDM) wireless systems. The joint ML estimation and data detection is generally considered a hard combinatorial optimization problem. We propose an efficient lowcomplexity algorithm based on branchestimatebound strategy that renders exact joint ML solution. However, the computational complexity of blind algorithm becomes critical at low signaltonoise ratio (SNR) as the number of OFDM carriers and constellation size are increased especially in multipleantenna systems. To overcome this problem, a semiblind algorithm based on a new framework for reducing the complexity is proposed by relying on subcarrier reordering and decoding the carriers with different levels of confidence using a suitable reliability criterion. In addition, it is shown that by utilizing the inherent structure of Alamouti coding, the estimation performance improvement or the complexity reduction can be achieved. The proposed algorithms can reliably track the wireless Rayleigh fading channel without requiring any channel statistics. Simulation results presented against the perfect coherent detection demonstrate the effectiveness of blind and semiblind algorithms over frequencyselective channels with different fading characteristics.
1 Introduction
The increasing demand for higher data rates in recent years has called for transmissions over a broadband wireless channel which is frequency selective. The wireless channel is thus prone to intersymbol interference (ISI) which severely degrades the system performance and requires complex equalization techniques at the receiver. The orthogonal frequencydivision multiplexing (OFDM) has emerged as the most promising scheme to combat ISI and improve system performance. OFDM essentially transforms a broadband channel into a number of parallel narrowband channels using a cyclic prefix (CP) of appropriate length and renders simple onetap channel equalizer for each OFDM subcarrier [1, 2]. Due to these crucial advantages, OFDM is not only being used in many existing standards such as digital subscriber line (DSL), WLAN standards (IEEE 802.11 a/b/g), LAN standard (IEEE 802.11n) [3], MAN standards (IEEE 802.16e), digital audio broadcast (DAB), and digital video broadcast (DVB) [4] but also adopted for future wireless standards such as LTE and 4G+ [5]. Besides OFDM, the spatial dimensions in wireless communications are often exploited to further enhance the system capacity and /or improve the transmission reliability by employing multiple antennas at the transmitter and/or receiver. This offers many advantages over singleantenna systems including multiplexing gain and diversity gain [6]. Of several diversity schemes available in the literature, the Alamouti scheme [7] with two transmit and one receive antenna is the optimum in both the capacity and the diversity. Alamouti coding achieves full spatial diversity at full transmission rate for any signal (real or complex) constellation and offers very simple receiver structures. However, to decouple the signals at the receiver side via simple decoding, the Alamouti scheme requires the channel between each transmitreceive antenna to be constant over two consecutive OFDM symbols. Moreover, when dealing with frequencyselective channel, the Alamouti scheme has to be implemented over the block level.
In many wireless communications studies, it is often assumed that channel state information (CSI) is available at the receiver side for coherent data detection. This assumption is certainly not realistic. The current standards use pilot symbols to estimate the channel, thus sacrificing bandwidth which otherwise would have been available for data transmission. In highmobility wireless systems, the channels may even change so rapidly that this approach will become infeasible. Blind or semiblind detection over timevarying wireless channels has shown to enhance the system performance considerably [8, 9].
There are numerous blind estimation and equalization techniques available in the literature, namely, subspacebased methods [10, 11], secondorder statistics [12], Cholesky factorization [13], and iterative methods [14]. These methods either suffer from slow convergence, higher computational costs or assume channel to be stationary over several OFDM symbols. These drawbacks make maximum likelihood (ML)based approaches, e.g., [15, 16] more attractive due to their fast convergence despite having the higher computational cost. Usually, suboptimal techniques are employed to reduce the computational cost by restricting the search space of exhaustive ML search. Some of the suboptimal techniques are applicable to specific constant modulus constellations [17, 18]. Recently, in [19] and [20], the authors have proposed a lowcomplexity blind ML method for general constellations for singleinputmultipleoutput (SIMO) and singleinputsingleoutput (SISO) systems, respectively, which form the basis of our paper.
1.1 Contributions
In this paper, we extend the previous algorithm developed for SISO OFDM systems [20] to multipleinputsingleoutput (MISO) OFDM systems with two transmit antennas and employing Alamouti coding. Parallelizing the results and discussions therein, we first derive the exact blind ML algorithm based on branchestimateandbound strategy and then reduce its complexity using different methods. Then, a semiblind version is proposed by assuming that few training symbols are available. The proposed algorithms offers low complexity, fast convergence, works for signals drawn from general modulation constellations, and do not require any channel statistics. The contributions are listed as:

1.
Extension of blind algorithm for SISO to MISO OFDM systems and derivation of bound used to abort a certain search path in the blind search algorithm. The bound is calculated in a recursive manner offering low computations.

2.
Using semiorthogonality of the subcarriers by exploiting the structure of fast Fourier transform (FFT) matrix in order to reduce the computations for calculation of bound during the blind search.

3.
Reducing the complexity at low signaltonoise ratio (SNR) regime by employing the reliable carriers to accelerate the convergence of blind algorithm by reordering the subcarriers according to their reliability. Calculation of reliability, however, requires initial tentative channel estimates; therefore, a semiblind mode is adopted.

4.
Reducing the complexity and or improving the estimation performance of proposed algorithm by exploiting the orthogonal structure of Alamouti coding.
The framework of semiblind algorithm is based on reliably decoding the carriers by computing the vectorwise likelihood ratio first suggested in [21]. By supplying the algorithm with reordered carriers according to their reliability index, the backtracking is minimized which is the major source of complexity especially at low SNR. At higher SNR regime, it has been shown that the probability of backtracking is almost zero [20], while the number of operations is of the order of constant times the total number of OFDM subcarriers.
1.2 Organization of the paper
Section 2 describes the Alamouti coded OFDM system model with frequencyselective timevariant channels, and Section 3 describes the proposed blind ML algorithm. In Section 4, a low complexity variant of blind ML algorithm is derived by exploiting the structure of the FFT matrix. To further reduce the complexity of the algorithm, we propose a semiblind algorithm in Section 5, and in Section 6, we show how the structure of the Alamouti coding can be utilized to favour certain advantages. Simulation results are detailed in Section 7, and we conclude in Section 8.
1.3 Notation
We use lower case letters x to denote scalars, lower case boldface x to denote (column) vectors and x(i) to denote individual entries of a vector. Matrices are denoted by upper boldface letters X, whereas the calligraphic notations$\mathcal{X}$ is reserved for vectors in frequency domain. We also use x_{(i)} to represent a partial vector consisting of first i elements of x. (.)^{T}, (.)^{∗} and (.)^{H} represent transpose, conjugate and conjugate transpose (hermitian) operations, respectively. $\u3008\widehat{\mathcal{X}}(k)\u3009$ will denote the hard decoding decision that maps $\widehat{\mathcal{X}}(k)$ to $\mathcal{X}(k)$. The FFT and inverse FFT (IFFT) matrices are denoted by Q and Q^{H} respectively, where we define Q as q_{l,k}=e^{−j 2πlk/N} with l,k=0, 1,2,⋯,N−1. The notation $\parallel \mathbf{a}{\parallel}_{\mathbf{B}}^{2}$ represents weighted norm defined as $\parallel \mathbf{a}{\parallel}_{\mathbf{B}}^{2}={\mathbf{a}}^{H}\mathbf{B}\mathbf{a}$.
2 System model
Consider a singleuser OFDM system with two transmit and one receive antenna as shown in Figure 1. The frequencyselective channels from two transmit antennas to the receive antenna are modelled as finite impulse response (FIR) filters. We assume that both channels are independent Rayleighfading channels having maximum length L and that OFDM CP length is at least L−1 to avoid ISI.
Let$\mathcal{X}$ represent information symbols and that OFDM system has N subcarriers so that after IFFT operation the OFDM symbol can be written as
where Q is FFT matrix with [ Q]_{l,k}=e^{−j 2πlk/N}. Let the n th symbol of k th transmitted block from antenna i (= 1 or 2) be denoted by ${x}_{i}^{(k)}(n)$, with n=0,1,⋯,N−1. At times k=0,2,4,⋯ pair of blocks ${x}_{1}^{(k)}(n)$ and ${x}_{2}^{(k)}(n)$ are generated according to the following Alamouti coding rule [7, 22]:
where, (.)_{ N } is the modulo N operation. Each antenna transmits a data block of length N according to spacetime blockcoded (STBC) scheme after appending the CP. Adding CP eliminates interblock interference and converts linear convolution into circular convolution. In the presence of additive white Gaussian noise (AWGN), the received data blocks over two consecutive time instants after discarding the CPs can be written as
where ρ is the SNR, H_{1} and H_{2} are circular channel matrices from transmit antenna1 and transmit antenna2 to receive antenna, respectively, and n is circular symmetric AWGN with pdf: n∼C N(0,I). In (3), we also assumed that channel is static over two consecutive OFDM blocks at time instants k and k+1. Specifically, the structure of two circular channel matrices is
and where
represents the impulse response sequence of the i th channel matrix. At the receiver side, the frequency domain received symbols after FFT operations are obtained as
where ${\mathcal{X}}_{i}^{(j)}=\frac{1}{\sqrt{N}}{\mathbf{Qx}}_{i}^{(j)}$ and Λ_{ i }=Q H_{ i }Q^{H} are diagonal matrices whose entries are Npoint discrete Fourier transform (DFT) of h_{ i } after zero padding and ${\mathcal{N}}^{(j)}=\frac{1}{\sqrt{N}}{\mathbf{Qn}}^{(j)}$. Expanding (5) and using DFT properties, we get:
By stacking the received data symbols over consecutive intervals in one column and so as the DFT channel coefficients, (6) can be written in matrixvector notation as
where ${\mathcal{\mathscr{H}}}_{i}=\text{diag}({\mathbf{\Lambda}}_{i})=\mathbf{Q}\left[\begin{array}{l}{\mathbf{h}}_{i}\\ \mathbf{0}\end{array}\right]$. Let A^{H} consists of first L columns of Q, then
This allows us to rewrite (7) as
or even more compactly as
where$\mathcal{Y}$ and$\mathcal{N}$ are 2N×1 observed data and noise vectors, respectively, X_{ a } is a 2N×2L data matrix, which we shall refer to as Alamouti matrix, and h is a 2L×1 composite channel vector. This model can be easily transformed to SISOOFDM system of [20] by replacing X_{ a } with N×N square matrix $\text{diag}(\mathcal{X})$ containing N data symbols on its diagonal. Specifically, the SISO model corresponding to (10) is given by
where$\mathcal{Y}$ and$\mathcal{N}$ are Ndimensional received OFDM symbol and noise vector, respectively, while h is the length L SISO channel vector. In either case, the task of receiver is to jointly estimate the channel h and the data vector$\mathcal{X}$ given only the received data symbol$\mathcal{Y}$.
3 Joint ML/MAP solution
Considering the data model in (10), the joint ML channel estimation and data detection problem reduces to minimizing the following objective function:
where Ω^{2N} denotes all possible 2Ndimensional signal vectors. As seen from (12), the joint ML problem is a combinatorial problem involving Ω^{2N} hypothesis tests, and it is almost impossible to solve it exactly for sufficiently large Ω and N.
To solve it efficiently, we propose the following strategy. We start by decomposing the original cost function as
and define
as the partial joint ML metric up to the index i for$\mathcal{X}$, where ${\mathbf{X}}_{a(i)}=\left[\begin{array}{ll}\text{diag}\left({\mathcal{X}}_{1(i)}^{(k)}\right){\mathbf{A}}_{(i)}^{H}& \text{diag}\left({\mathcal{X}}_{2(i)}^{(k)}\right){\mathbf{A}}_{(i)}^{H}\\ \text{diag}\left({\mathcal{X}}_{2(i)}^{\ast (k)}\right){\mathbf{A}}_{(i)}^{H}& \text{diag}\left({\mathcal{X}}_{1(i)}^{\ast (k)}\right){\mathbf{A}}_{(i)}^{H}\end{array}\right]\phantom{\rule{0.3em}{0ex}}$ is a partial Alamouti matrix of dimension 2i×2L,X_{ a }(j), the 2×2L matrix is the same as X_{a(j)} with all ${\mathcal{X}}_{(j)}$ replaced by $\mathcal{X}(j)$, ${\mathcal{Y}}_{(i)}={\left[{\left[{\mathcal{Y}}_{(i)}^{(k)}\right]}^{T}{\left[{\mathcal{Y}}_{(i)}^{(k+1)}\right]}^{T}\right]}^{T}$ is the partial data vector of dimension 2i×1 and the partial matrix ${\mathbf{A}}_{(i)}^{H}$ consists of first i rows of A^{H}. It should be noted that partial Alamouti matrix X_{a(i)} is the function of the first i data points, while X_{ a }(i) is a function of the i th data point. Obviously, the solution that minimizes this partial joint ML metric is not the globally optimal. But we have the following lemma^{a}:
Lemma 1.
Let R represent the optimal value of the objective function in (12). If ${M}_{{\mathcal{X}}_{(i)}}>R$, then ${\mathcal{X}}_{(i)}$ cannot be the ML solution ${\widehat{\mathcal{X}}}_{(i)}^{\text{ML}}$ of (12). In other words, for any estimate ${\widehat{\mathcal{X}}}_{(i)}$ to correspond to the ML solution, we should have ${M}_{{\mathcal{X}}_{(i)}}<R$.
From Lemma 1, if the optimal value R of objective function (12) can be estimated, then we can adopt the following tree search procedure for joint estimation and detection: At each subcarrier i, make a guess of new value of $\mathcal{X}(i)={\left[\begin{array}{ll}{\mathcal{X}}_{1}(i)& {\mathcal{X}}_{2}(i)\end{array}\right]}^{T}$and use that along with previous estimates to construct ${\widehat{\mathcal{X}}}_{(i)}$ and ${\widehat{\mathbf{X}}}_{a(i)}$. Then, estimate h to minimize the associated cost function:
and calculate the resulting metric ${M}_{{\widehat{\mathcal{X}}}_{(i)}}$. If ${M}_{{\widehat{\mathcal{X}}}_{(i)}}<R$, then proceed to the next subcarrier i+1; otherwise, backtrack and change the guess of $\mathcal{X}(j)$ for some j≤i. We call this approach as the branchestimateandbound strategy, which reduces the search space of exhaustive ML search to those (partial) sequences that satisfy the given constraint ${M}_{{\widehat{\mathcal{X}}}_{(i)}}<R$. This approach however does not work for i≤L as X_{a(i)} will be full rank for any choice of ${\mathcal{X}}_{(i)}$, and therefore, h with 2L degrees of freedom can always be chosen by least squares (LS) to yield the trivial (zero) value for ${M}_{{\widehat{\mathcal{X}}}_{(i)}}$. To obtain a nontrivial value of ${M}_{{\widehat{\mathcal{X}}}_{(i)}}$, we have to use L pilots, but it would defeat our original motive of blind estimation. To overcome this problem, we adopt weighted regularized LS and instead of minimizing the ML objective function, J_{ML}, we minimize the maximum a posteriori (MAP) objective function:
where R_{ h } is the block diagonal autocorrelation matrix of the composite channel vector h, i.e., R_{ h }=E{h h^{H}}. The objective function in (16) can also be decomposed as
So, if we have a guess of ${\widehat{\mathcal{X}}}_{(i1)}$, the partial metric for$\mathcal{X}$ up to index i−1 can be written as
whose optimum value $\widehat{\mathbf{h}}$ and the cost can be computed [23].
3.1 Recursive derivation of bound
For our blind search strategy, the calculation of the metric or bound ${M}_{{\widehat{\mathcal{X}}}_{(i)}}$ is needed at each tree node for comparison with the optimal value of objective function, R. This bound can be derived recursively by simply expressing ${M}_{{\widehat{\mathcal{X}}}_{(i)}}$ in terms of new observation and an additional regressor ${\widehat{\mathbf{X}}}_{a}(i)$ as follows:
By invoking the block version of recursive least squares (RLS) algorithm to the cost function in (19) with the data vector of size 2×1 and the regressor matrix of dimension 2×2L, we get [23]
where
The RLS recursions are initialized by
Before introducing our algorithm, we first number the Ω^{2} combinations of the constellation points from two antennas by 1,2,…,Ω^{2} and treat them as a big constellation set Ψ, where the k th(1≤k≤Ω^{2}) vector constellation point is denoted by Ψ(k). We then perform the depthfirst search of signal tree for joint ML solution.
The algorithm essentially reduces the search space of exhaustive ML search by performing a trimmed search over the signal tree of N layers, where each tree node at the i th layer corresponds to a specific partial sequence ${\mathcal{X}}_{(i)}$ and each tree node at the intermediate layer has Ω^{2} offsprings to the next layer.
The parameter ρ can be easily determined by estimating the noise variance, whereas for R_{ h }, our simulation results indicate that we can replace it with an identity matrix with almost no effect on the performance via carrier reordering (see next section). To obtain the initial guess of search radius, we can use the strategy described in [20] to determine r that would guarantee a MAP solution with very high probability. Nevertheless, the algorithm itself takes care of the value of r, in that if it is too small such that the algorithm is not able to backtrack, then it doubles the value of r; if it is too large such that the algorithm reaches the last subcarrier too quickly, then it reduces r to the most recent value of objective function (see steps 4 and 6). Therefore, any choice of r would guarantee the MAP solution.
The complexity of the algorithm is mainly attributed to the calculation of the bound ${M}_{{\widehat{\mathcal{X}}}_{(i)}}$ (step 2) and the backtracking (step 3). The rest of the steps are simple additions and subtractions. From the RLS recursions, it can be seen that the calculation of the bound depends heavily on computation of 2 L×2L matrix P_{ i } in (25). In Section 4, we shall see how computation of P_{ i } can be avoided by exploiting the structure of the FFT matrix, while in Section 5 we shall deal with the issue of backtracking.
4 Lowcomplexity blind algorithm
As mentioned earlier, the complexity of computing the metric ${M}_{{\widehat{\mathcal{X}}}_{(i)}}$ depends heavily on computation of matrix P_{ i }. We show how we can completely avoid computing P_{ i } and hence simply discard (25) from RLS recursions. This means that RLS algorithm will reduce to least mean square (LMS) in terms of complexity.
4.1 Reducing complexity by avoiding P_{ i }
Let us assume that P_{1}=I and that a_{ i } are orthogonal for i=0,1,2,⋯,N−1, i.e. ${\mathbf{a}}_{i}^{H}{\mathbf{a}}_{j}=0$ for i≠j^{b}. First, we merge (24) and (25) to get
Now using (26) and our assumptions, it follows by induction that ${\mathbf{P}}_{i}{\widehat{\mathbf{X}}}_{a}^{H}(i+1)={\widehat{\mathbf{X}}}_{a}^{H}(i+1)$, ${\mathbf{P}}_{i}{\widehat{\mathbf{X}}}_{a}^{H}(i+2)={\widehat{\mathbf{X}}}_{a}^{H}(i+2)$ and ${\mathbf{P}}_{i+1}{\widehat{\mathbf{X}}}_{a}^{H}(i+2)={\widehat{\mathbf{X}}}_{a}^{H}(i+2)$. This means that if the successive regressors are orthogonal, we can simply replace P_{ i } with an identity matrix and hence discard Equation 25. Moreover, from the orthogonality assumptions, it also follows that
where I_{ n } represents an n×n identity matrix. Incorporating these results into the RLS recursions, the matrices Γ_{ i } and G_{ i } become independent of P_{ i } and are given by
The resulting lowcomplexity blind algorithm based on (20) to (22), (28) and (29) for metric computation, requires no matrix inversion or computation of P_{ i }.
4.2 Reducing complexity by carrier reordering
In the above approximation, we assumed that P_{1}=I and a_{ i } are orthogonal which allows us to use (27). However, a_{ i } are columns of the partial FFT matrix A_{ i }, so strictly speaking, they are not orthogonal. Hence, the successive regressor matrices would not be orthogonal, too. We will show that we can make them orthogonal or semiorthogonal by carrier reordering. To understand the idea, we compute and plot the magnitude of autocorrelation of these partial vectors given by
in Figure 2 for N=16 and L=4 and where we set l=1. It can be seen that columns 1, 5, 9, and 13 are orthogonal to each other and so are the columns 2, 6, 10, 14 and so on. If we visit the subcarriers in the order 1,5,9,13,2,6,10,14,⋯,4,8,12,16, we find that consecutive vectors will be orthogonal or approximately orthogonal. In general, it is found that with △=N/L, the vectors a_{ i },a_{i+△},a_{i+2△},∀i are approximately orthogonal. Therefore, by simple reordering the carriers, we can achieve orthogonality among different subcarriers and use that fact to reduce the complexity of our algorithm as done in section 4.1.
5 Complexity reduction by using reliable carriers
Despite avoiding P_{ i } and the carrier reordering approach, it has been observed that complexity of algorithm becomes very large in multipleantenna systems as compared to singleantenna systems especially in the low SNR regime. The major source of complexity is attributed to backtracking of the proposed algorithm. Backtracking occurs whenever the constraint ${M}_{{\widehat{\mathcal{X}}}_{(i)}}<R$ is not satisfied. The algorithm then goes back either to the nearest subcarrier or to the current subcarrier whose alphabet is not exhausted and increments the alphabet (step 3 then step 5). This issue is rigorously analysed in [20] where it is shown that the probability of backtracking is almost zero at higher SNR; however, no solution is proposed to deal with it in the lower SNR case. The fact is that backtracking is inevitable in all blind search algorithms. Although it cannot be avoided, we propose to minimize its effect by using the concept of reliable carriers. The previous methods of reducing the complexity have the following drawbacks:

1.
The proposed solutions are still very complex as they do not take into account the issue of backtracking which is a major source of complexity. They can be considered to have low complexity only in the high SNR regime, where we essentially get rid of backtracking.

2.
The proposed solutions do not work in low SNR regime and becomes infeasible for multipleantenna systems. This is due to the fact that the search space at each node grows as $\Omega {}^{{N}_{\mathrm{t}}}$ as compared to Ω for the SISO system, where Ω is the alphabet size and N _{t} is the number of transmit antennas. Thus, the complexity of the proposed algorithms due to backtracking ultimately dominates the complexity induced by computing the matrix P _{ i } (or its inverse) and becomes the real bottleneck.

3.
The proposed methods do not make use of the fact that pilots are usually present in real systems to aid in channel estimation and that the channel is usually slowly varying.
We can make use of the third point to our advantage. Specifically, the presence of some pilots and slow variation in the channel allows us to get a tentative estimate of the data. If we are able to arrange the data according to its reliability, starting with the most reliable data, then there would be a less chance that we need to backtrack. Since earlier data is reliable, there is no need to backtrack for this part. The later data might not be reliable, but by the time we start processing this data, the algorithm would have been already converged. However, measuring data reliability requires tentative channel/data estimates. Thus, the blind algorithm can be turned into a semiblind algorithm to reliably track the channel along with the data detection. The semiblind algorithm would require a short training sequence of L symbols only at the start of transmission to get a tentative estimate of the data and its reliability and no further pilots or channel statistics would be required.
5.1 Measuring the reliability
For measuring the reliability of data carriers, we borrow the idea presented in [21] by the author of the current paper, where it was used in the context of nonlinear distortion mitigation in OFDM. To minimize backtracking, the algorithm must devise a procedure to identify the reliable subcarriers from the tentative estimates of channel and the data. With receiver having an estimate of channel, the decoding process can be accomplished by rewriting (10) into the form as shown below:
where $\stackrel{~}{\mathcal{Y}}={\left[{\left[{\mathcal{Y}}^{(k)}\right]}^{T}\phantom{\rule{1em}{0ex}}{\left[{\mathcal{Y}}^{\ast (k+1)}\right]}^{T}\right]}^{T},\stackrel{~}{\mathcal{N}}={\left[{\left[{\mathcal{N}}^{(k)}\right]}^{T}{\left[{\mathcal{N}}^{\ast (k+1)}\right]}^{T}\right]}^{T}$ and H_{ a } is an Alamoutilike matrix defined as
By left multiplying both sides of (31) with $\frac{1}{\sqrt{\rho}}{\mathbf{H}}_{a}^{1}$ and rearranging the terms, we get:
with the difference vector; $\mathbf{D}\triangleq \frac{1}{\sqrt{\rho}}{\mathbf{H}}_{a}^{1}\stackrel{~}{\mathcal{N}}$. The imperfect knowledge of the channel results in an estimation error Δ H_{ a } and consequently the vector D represents the distortion due to channel estimation error and the effect of additive noise.
To assess the reliability, consider a data carrier $\widehat{\mathcal{X}}(k)$ (in scalar case) and its nearest constellation point $\u3008\widehat{\mathcal{X}}(k)\u3009$. Treating channel estimation error as noise, MLbased decoding would yield $\mathcal{X}(k)$ by mapping $\widehat{\mathcal{X}}(k)$ to the nearest constellation point $\u3008\widehat{\mathcal{X}}(k)\u3009$. Such a scheme would be very efficient at higher SNR if distortion was only due to AWGN. However, in our case, we have an additional perturbation due to channel estimation error that is independent of SNR, and therefore, we expect that part of data samples would be severely effected by the channel distortion and fall outside their actual decision regions. Clearly, there is a need to assess and identify these unreliable data coefficients for our algorithm to reduce the backtracking.
Authors in [21] have developed a rigorous method for assessing the reliability of estimated data coefficients. Intuitively, for the data carrier $\widehat{\mathcal{X}}(k)$, we can measure its reliability based on relative posterior probability that the difference term D(k) equals $\widehat{\mathcal{X}}(k)\u3008\widehat{\mathcal{X}}(k)\u3009$ to the probability that it equals some other vector $\widehat{\mathcal{X}}(k){\Omega}_{m}{\Omega}_{m}\ne \u3008\widehat{\mathcal{X}}(k)\u3009$. For example,
defines the reliability in decoding $\widehat{\mathcal{X}(k)}$ to the closest constellation point relative to decoding to the nearest neighbour ${\mathcal{X}}_{\mathit{\text{NN}}}(k)$. Figure 3 illustrates this concept, such that for instance even though $\widehat{\mathcal{X}}(1)$ and $\widehat{\mathcal{X}}(2)$ have the same distance from $\u3008\widehat{\mathcal{X}}(1)\u3009=\u3008\widehat{\mathcal{X}}(2)\u3009$, $\widehat{\mathcal{X}}(2)$ has a higher reliability than $\widehat{\mathcal{X}}(1)$ as it is farther from nearest neighbour. This suggests the dependence of phase ${\theta}_{\widehat{\mathcal{X}}(k)\u3008\widehat{\mathcal{X}}(k)\u3009}$ on the reliability in addition to the magnitude $\widehat{\mathcal{X}}(k)\u3008\widehat{\mathcal{X}}(k)\u3009$. In essence, the reliability of a measurement at each tone is a function that maps a triple (magnitude, phase and channel gain) into ${\mathcal{R}}_{[1,\infty ]}$. The exact expression for reliability is the generalization of (34) to the vectorwise likelihood ratio defined as
where f_{ D }(.) is the pdf of the distortion vector D, which by definition, can be easily seen to be Gaussian circularly symmetric with variance ${\sigma}_{D}^{2}=\frac{1}{\rho}\left({\mathbf{Ha}}^{1}\right){\left({\mathbf{Ha}}^{1}\right)}^{H}$. The above computation for exact reliability is however inefficient since it would require O(NΩ) evaluations of f_{ D }(.) which grows with constellation size Ω. Alternately, the geometricbased approximations for assessing reliability as derived in [21] may be employed with marginal loss in the performance. Ultimately, once the vector $\Re $ is computed, we can proceed to select the most reliable data tones. These reliable data tones can then be supplied to our algorithm for initial search of the ML solution. Based on the above developments, we now introduce the semiblind algorithm.
5.2 The semiblind algorithm

1.
Obtain an initial estimate of the channel vector h from L training/pilot symbols at start of transmission, then repeat the following steps over two consecutive time instants.

2.
Predict and decode the carriers $\widehat{\mathcal{X}}$ from previous channel estimate $\widehat{\mathbf{h}}$ and observation vector $\stackrel{~}{\mathcal{Y}}$.

3.
Use (35) to compute reliability of data carriers, ${\Re (k)}_{k=1}^{P}$, with 1≤P≤N and rearrange them in decreasing order of their reliability. The parameter P represents the total number of reordered carriers used by the algorithm.

4.
Run the blind algorithm proposed previously, starting with most reliable data first, to obtain the exact ML estimates of the channel and the data.
Remarks: The first three steps of the semiblind algorithm serve as preprocessing steps tailored to minimizing the backtracking of blind algorithm in step 4. One can easily obtain the channel estimates from pilots in step 1. The prediction step 2 is trivial and would suffer only little distortion as the channel does not change much in slow fading. To initiate the RLS recursions of blind algorithm in step 4, we initialize the channel vector with its previous estimate and set P_{ i }=I; thus, no channel statistics are required a priori. The blind algorithm is supplied with reordered carriers with most reliable data to start over the search (step 3), so that the there will be almost no backtracking during the convergence of the algorithm. However, we observe that the carrier reordering based on reliability measures does not ensure orthogonality of successive regressors; therefore, the low complexity variants of RLS introduced earlier cannot be employed.
6 Exploiting the structure of Alamouti coding
In our previous approaches to reducing the complexity of blind algorithm, we did not use the Alamouti coding structure, which can simplify the receiver design by linearly processing the received data samples. We will show that by exploiting the orthogonal structure of Alamouti coding, we can either achieve (a) The complexity reduction of blind algorithm for MISO OFDM system to that of SISO OFDM system of [20] with little effect on estimation performance, or (b) improve the estimation performance of our algorithm with full complexity of the MISO OFDM. To achieve the first goal, we start by multiplying (31) by ${\mathbf{H}}_{a}^{H}$ to get
where z is the new observation vector, w is the noise vector, and the block diagonal matrix B is defined as
where ⊗ represents the Kronecker product. Thus, owing to the orthogonal structure of Alamouti, the system in (36) is decoupled into two SISO OFDM systems as
where $\stackrel{~}{\mathbf{B}}=\left({\Lambda}_{1}^{\ast}{\Lambda}_{1}+{\Lambda}_{2}^{\ast}{\Lambda}_{2}\right)$. The joint ML solutions for SISO systems (42) would involve only 2Ω^{N} hypothesis tests as opposed to Ω^{2N} for the original system, which implies huge reduction in complexity. However, this comes at the cost of estimation performance as number of unknown channel parameters in (42) are increased to 2L as opposed to L is the SISO system. In a similar fashion, the estimation performance of the blind algorithm can be improved by multiplying (10) by ${\mathbf{X}}_{a}^{H}$ as follows:
where $\overline{\mathbf{z}}$ is the new observation vector, $\overline{\mathbf{w}}$ is the noise vector, and the block diagonal matrix $\overline{\mathbf{B}}$ is
We can see that the system in (40) can also be decoupled as
where $\stackrel{~}{\overline{\mathbf{B}}}=\left(\mathbf{A}\text{diag}\left({\mathcal{X}}_{1}^{(k)}{}^{2}+{\mathcal{X}}_{2}^{(k)}{}^{2}\right){\mathbf{A}}^{H}\right)$. It can be seen that the above equations correspond to the SISO systems for which the joint ML solution can be derived easily. It is worth mentioning that the search space for blind ML is not reduced due to dependence of observations and the matrix $\overline{\mathbf{B}}$ on ${\mathcal{X}}_{1}^{(k)}$ and ${\mathcal{X}}_{2}^{(k)}$. However, the number of channel parameters is reduced to half, which results in an improved estimation performance. In short, the Alamouti structure enables us to either improve upon the estimation performance or the complexity reduction but not the both.
7 Simulation results
For simulations, we first consider OFDM system with N=16 subcarriers and channel length L=4 for each transmitreceive channel and CP length of at least L−1. For blind algorithm, both channels are independent Rayleigh fading, assumed stationary over two consecutive OFDM blocks and each having an exponential power decay profile, i.e. E{h_{ i }(τ)^{2}}=e^{−0.2τ}. Information symbols are modulated using binary phase shift keying (BPSK) or quadrature amplitude modulation (QAM).
In Figure 4, we plot the results for N=16, BPSK data symbols using perfectly known channel and our exact blind algorithm, together with lowcomplexity variants, i.e. blind algorithm with (a) P_{ i }=I and (b) P_{ i }=I with subcarrier reordering. In the first case, the performance degrades and bit error rate (BER) reaches an error floor. However, with subcarrier reordering approach, we almost get the same performance as that of exact blind algorithm without requiring the channel statistics. A similar trend is observed in Figure 5, when 4QAM signal modulation is considered.
For semiblind algorithm, we adopt the AR(1) process to model the slow rayleigh fading channels, where the channel weight vector varies as [23]: h(n)=α h(n−1)+q(n) and where α=J_{0}(2π f_{d}T_{s}) and q is a complex normal vector with covariance matrix (1−α^{2})I. The product of maximum Doppler frequency f_{d} and sampling time T_{s} referred to as normalized Doppler frequency F_{D}, controls the amount of variations in the channel statistics. Two different values of normalized Doppler frequency, i.e., 0.1 and 0.001, are considered in the simulations. Results for semiblind algorithm are presented against perfect coherent detection in Figures 6 and 7 for BPSK and 4−QAM modulations respectively which show favourable performance of algorithm under different fading conditions. For higher modulations and number of subcarriers, we use the SISO OFDM systems obtained after decoupling by Alamouti structure and results are presented in Figure 8 for 16−QAM constellation with N=64.To assess the computational complexity of proposed algorithm, we compare the average number of nodes visited by the algorithm with various reliability measures in Figure 9. It is clearly observed that proposed reliability scheme offers significantly lower complexity at lower SNR values. At higher SNR, the complexity is constant, confirming the fact that there is almost no backtracking. Figure 10 shows that the performance for various degrees of reliability measures is almost identical which means that computational advantages are attained without degrading the performance. Through simulations, it has also been observed that the reliability of around 50% to 60% is enough for a good performance, although more importantly, the algorithm does not disfavour the usage of more reliable carriers.Finally, in Figure 11, the complexity of comparison for different modulation schemes such as BPSK, 4QAM and 16QAM is presented which clearly indicates the computational advantages of proposed reliabilitybased method.
8 Conclusions
In this paper, we presented a blind ML algorithm for joint channel estimation and data detection in OFDM wireless systems using Alamouti STBC coding. The simulation results show favourable performance of algorithm. As evident from simulations, our lowcomplexity blind algorithm performs equally well as exact blind algorithm. Moreover, the new algorithm does not need any prior information about channel statistics as it avoids calculating matrix P_{ i } with subcarrier reordering. Another major source of complexity in blind algorithm is the issue of backtracking which becomes more prominent in the lowSNR regime. This issue was not tackled in the previous studies. We proposed a semiblind algorithm which minimizes the probability of backtracking by supplying the blind algorithm with reordered subcarriers based on their reliability computations using a sophisticated reliability criterion. By minimizing the backtracking, significant improvement is achieved in terms of complexity without compromising the performance. Moreover, the orthogonal structure of Alamouti coding was also exploited to yield complexity reduction or improved estimation performance. The reduced complexity variant turns out to be very handy when large number of subcarriers and/or large size constellation modulations are employed.
Endnotes
^{a} This lemma was proved in [20] for SISO case; we simply extend it here to the multiantenna case.
^{b} In fact, a weaker condition that three consecutive vectors a_{ i }, a_{i+1} and a_{i+2} are orthogonal would suffice.
References
 1.
LJ Cimini: Analysis and simulation of a digital mobile channel using orthogonal frequency division multiplexing. IEEE Trans. Commun 1985, 33(7):665675. 10.1109/TCOM.1985.1096357
 2.
Bölcskei H: Principles of MIMOOFDM Wireless Systems, Chapter in CRC Handbook on Signal Processing for Mobile Communications, M. Ibnkahla, Ed. Ch. 12 2004.
 3.
Crow BP, Widjaja I, Kim JG, Sakai PT: IEEE 802.11 wireless local area networks. IEEE Commun. Mag 1997, 35(9):116126. 10.1109/35.620533
 4.
Koffman I, Roman V: Broadband wireless access solutions based on OFDM access in IEEE 802.16. IEEE Commun. Mag 2002, 40(4):96103. 10.1109/35.995857
 5.
3GPP TR 36.211: Evolved Universal Terrestrial Radio Access (EUTRA); Physical Channels and Modulation. ver. 10.5.0, 2012.
 6.
Telatar E: Capacity of multiantenna gaussian channels. Eur. Trans. Telecomm 1999, 10(6):585595. 10.1002/ett.4460100604
 7.
Alamouti S: A simple transmit diversity technique for wireless communications. IEEE J. Selected Areas Commun 1998, 16(8):14511458. 10.1109/49.730453
 8.
Stoica P, Ganesan G: Spacetime block codes: trained, blind and semiblind detection. IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP) . IEEE, Orlando, FL, USA; 13–17 May 2002.
 9.
Swindlehurst AL, Leus G: Blind and semiblind equalization for generalized spacetime block codes. IEEE Trans. Signal Process 2002, 50(10):24892498. 10.1109/TSP.2002.803319
 10.
Muquet B, De Courville M, Duhamel P: Subspacebased blind and semiblind channel estimation for OFDM systems. IEEE Trans. Signal Process 2002, 50(7):16991712. 10.1109/TSP.2002.1011210
 11.
Tu CC, Champagne B: Subspacebased blind channel estimation for MIMOOFDM systems with reduced time averaging. IEEE Trans. Veh. Tech 2010, 59(3):15391544.
 12.
Bolcskei H, Heath RW, Paulraj AJ: Blind equalization in OFDMbased multiantenna systems. The IEEE 2000 Adaptive Systems for Signal Processing, Communications, and Control Symposium (ASSPCC) . IEEE,, Lake Louise, Alta; 01–04 Oct 2000:5863.
 13.
Choi J, Lim CC: A Cholesky factorization based approach for blind FIR channel identification. IEEE Trans. Signal Process 2008, 56(4):17301735.
 14.
Banani SA, Vaughan RG: OFDM with iterative blind channel estimation. IEEE Trans. Veh. Tech 2010, 59(9):42984308.
 15.
Cui T, Tellambura C: Joint data detection and channel estimation for OFDM systems. IEEE Trans. Commun 2006, 54(4):670679.
 16.
Larsson EG, Stoica P, Li J: On maximumlikelihood detection and decoding for spacetime coding systems. IEEE Trans. Signal Process 2002, 50(4):937944. 10.1109/78.992142
 17.
Chang TH, Ma WK, Chi CY: Maximumlikelihood detection of orthogonal spacetime block coded OFDM in unknown block fading channels. IEEE Trans. Signal Process 2008, 56(4):16371649.
 18.
AlNaffouri TY, Quadeer AA: Cyclic prefix based enhanced data recovery in OFDM. IEEE Trans. Signal Process 2010, 58(6):34063410.
 19.
W Xu: Lowcomplexity blind maximumlikelihood detection for SIMO systems with general constellations. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) . IEEE,, Las Vegas, NV, USA; 31 Mar–4 Apr 2008:28172820.
 20.
AlNaffouri TY, Dahman AA, Sohail MS, Xu W, Hassibi B: Lowcomplexity blind equalization for OFDM systems with general constellations. IEEE Trans. Signal Process 2012, 60(12):63956407.
 21.
AlSafadi EB, TY AlNaffouri: Pilotless recovery of clipped OFDM signals by compressive sensing over reliable data carriers. In IEEE 13th International Workshop on Signal Processing Advances in Wireless Communications (SPAWC). IEEE, Cesme; 17–20 June 2012:580584.
 22.
N AlDhahir: Singlecarrier frequencydomain equalization for spacetime blockcoded transmissions over frequencyselective fading channels. IEEE Commun. Lett 2001, 5(7):304306.
 23.
Sayed AH: Fundamentals of Adaptive Filtering. Wiley, New York; 2003.
Acknowledgements
The authors would like to acknowledge the support provided by the Deanship of Scientific Research (DSR) at King Fahd University of Petroleum & Minerals (KFUPM) for funding this work through project No. RG122CES112.
Author information
Additional information
Competing interests
The authors declare that they have no competing interests.
Authors’ original submitted files for images
Rights and permissions
About this article
Received
Accepted
Published
DOI
Keywords
 Blind ML detection
 OFDM
 Spacetime block coding