A low complexity Hopfield neural network turbo equalizer

Myburgh, Hermanus C; Olivier, Jan C

doi:10.1186/1687-6180-2013-15

Research
Open access
Published: 08 February 2013

A low complexity Hopfield neural network turbo equalizer

Hermanus C Myburgh¹ &
Jan C Olivier²

EURASIP Journal on Advances in Signal Processing volume 2013, Article number: 15 (2013) Cite this article

2449 Accesses
4 Altmetric
Metrics details

Abstract

In this article, it is proposed that a Hopfield neural network (HNN) can be used to jointly equalize and decode information transmitted over a highly dispersive Rayleigh fading multipath channel. It is shown that a HNN MLSE equalizer and a HNN MLSE decoder can be merged in order to realize a low complexity joint equalizer and decoder, or turbo equalizer, without additional computational complexity due to the decoder. The computational complexity of the Hopfield neural network turbo equalizer (HNN-TE) is almost quadratic in the coded data block length and approximately independent of the channel memory length, which makes it an attractive choice for systems with extremely long memory. Results show that the performance of the proposed HNN-TE closely matches that of a conventional turbo equalizer in systems with short channel memory, and achieves near-matched filter performance in systems with extremely large memory.

Introduction

Turbo equalization has its roots in turbo coding, first proposed in [1] for the iterative decoding of concatenated convolutional codes. In [2, 3], the idea of turbo decoding was applied to systems transmitting convolutional coded information through multipath channels, in order to improve the bit-error rate (BER) performance, with great success. Due to the computational complexity of its constituent maximum a posteriori (MAP) equalizer and MAP decoder, the computational complexity of these turbo equalizers are exponentially related to the channel impulse response (CIR) length as well as the encoder constraint length, limiting their effective use in systems where the channel memory and/or the encoder constraint length is large, with the MAP equalizer being the main culprit due to long channel delay spreads.

To mitigate the high computational complexity exhibited by the MAP equalizer, several authors have proposed suboptimal equalizers to replace the optimal MAP equalizer in the Turbo Equalizer structure, with complexity that is linearly related to the channel memory length. In [4, 5], it was shown how a minimum mean squared error (MMSE) equalizer is used in a Turbo Equalizer by modifying it to make use of prior information provided in the form of extrinsic information. Various authors have also proposed the use of decision feedback equalizers (DFE) while using extrinsic information as prior information to improve the BER performance after each iteration [6–10]. Also, in [11, 12] it was proposed that a soft interference canceler (SIC) be modified to make use of soft information in order to be used as a low complexity equalizer in a turbo equalizer, and in [13] the way in which a SIC incorporates soft information was modified to improve performance. The proposed equalizers inherently suffer from noise enhancement (MMSE) and error propagation (DFE and SIC) which limit their performance, and hence the overall performance of the turbo equalizers in which they are used. Due to the fact that none of the proposed equalizers are able to produce exact MAP estimates of the transmitted coded information, the performance of the Turbo Equalizer in which they are implemented will ultimately be worse than when an optimal MAP equalizer is utilized, due to the performance loss incurred at the output of these suboptimal equalizers. This trade-off always exists: If one gains in terms complexity, one loses in terms of performance.

In this article, we propose to combat the performance loss due to suboptimal (or non-MAP) equalizer output, by combining the equalizer and the decoder into one equalizer/decoder structure, so that all information can be processed as a whole, and not be passed between the equalizer and the decoder. This vision has successfully been implemented and demonstrated by the authors in [14] using a dynamic Bayesian network (DBN) as basis. In this paper, however, we show that using the Hopfield neural network (HNN) [15] as the underlying structure also works well, and has a number of advantages as discussed in [16].

In [16], the authors proposed a maximum likelihood sequence estimation (MLSE) equalizer which is able to equalize M-ary quadrature amplitude modulation (M-QAM) modulated signals in systems with extremely long memory. The complexity of the equalizer proposed in [16] is quadratic in the data block length and approximately independent of the channel memory length. Its superior computational complexity is due to the high parallelism of its underlying neural network structure. It uses the HNN structure which enables fast parallel processing of information between neurons, producing ML sequence estimates at the output. It was shown in [16] that the performance of the HNN MLSE equalizer closely matches that of the Viterbi MLSE equalizer in short channels, and near-optimally recombines the energy spread across the channel in order to achieve near-matched filter performance when the channel is extremely long.

The HNN has also been shown by several authors to be able to decode balanced check codes [17, 18]. These codes, together with methods for encoding and decoding, were first proposed in [19], but it was later shown in [17, 18] that single codeword decoding can also be performed using the HNN. To date, balanced codes is the only class of codes that can be decoded with the HNN. The ability of the HNN to detect binary patterns allows it to determine the ML codeword from a predefined set of codewords. In this paper it is shown that the HNN ML decoder can be extended to allow for the ML estimation of a sequence of balanced check codes. It is therefore extendable to an MLSE decoder.

In this article, a novel turbo equalizer is developed by combining the HNN MLSE equalizer developed in [16] and a HNN MLSE decoder (used to decode balanced codes, and only balanced codes), resulting in the Hopfield neural network turbo equalizer (HNN-TE), which can be used as replacement for a conventional turbo equalizer (CTE), made up of a equalizer/decoder pair, in systems with extremely long memory, where the coded symbols are interleaved before transmission through the multipath channel. The HNN-TE is able to equalize and decode (balanced codes) in systems with extremely long memory, since the computational complexity is nearly independent of the channel memory length. Like the HNN MLSE equalizer, its superior complexity characteristics are due to the high parallelism of its underlying neural network structure.

This article is structured as follows. Section 2 presents a brief discussion on Turbo Equalization. Section 3 discusses the HNN in general, while the HNN MLSE equalizer and the HNN MLSE decoder are discussed in Section 4, followed by a discussion on the fusion of the two in order to realize the HNN-TE. In Section 5, the results of a computational complexity analysis of the HNN-TE and a CTE are presented, followed by a memory requirements analysis in Section 6. Simulation results are presented in Section 7 and conclusions are drawn in Section 8.

Turbo equalization

Turbo equalizers are used in multipath communication systems that make use of encoders, usually convolutional encoders, to encoded the source symbol sequence s of length N _u (using some generator matrix G) at a rate R _c to produce coded information symbols c of length N _c = N _u / R _c, after which the coded symbols c are interleaved with a random interleaver before modulation and transmission. The interleaved coded symbols ć are transmitted through a multipath channel with a CIR length of L, causing inter-symbol interference among adjacent transmitted symbols at the receiver. At the receiver the received inter-symbol interference (ISI) corrupted coded symbols are matched filtered and used as input to the turbo equalizer. The received symbol sequence is given by

r = H \overset{́}{c} + n,

(1)

where n is a vector containing complex Gaussian noise samples and ć is the interleaved coded symbols given by

\overset{́}{c} = J G^{T} s,

(2)

where J is an N _c × N _c interleaver matrix, and H is the N _c × N _c channel matrix

H = [\begin{matrix} h_{0} & 0 & \dots & 0 & 0 & 0 & 0 \\ ⋮ & h_{0} & \dots & 0 & 0 & 0 & 0 \\ h_{L - 1} & ⋮ & ⋱ & 0 & 0 & 0 & 0 \\ 0 & h_{L - 1} & ⋱ & ⋱ & 0 & 0 & 0 \\ ⋮ & ⋮ & ⋱ & ⋱ & h_{0} & 0 & 0 \\ 0 & 0 & 0 & h_{L - 1} & \dots & h_{0} & 0 \\ 0 & 0 & 0 & 0 & h_{L - 1} & \dots & h_{0} \end{matrix}] .

(3)

The turbo equalizer uses two a maximum a posterior (MAP) algorithms, one to equalize the ISI-corrupted received symbols and one to decode the equalized coded symbols, which iteratively exchange information. With each iteration of the system, extrinsic information is exchanged between the two MAP algorithms in order to improve the ability of each algorithm to produce correct estimates. This principle was first applied to Turbo Coding, where both MAP algorithms were MAP decoders [3], but has since been applied to iterative equalization and decoding (today known as Turbo Equalization) to reduce the BER performance of the coded multipath communication system [2–5].

Figure 1 shows the structure of the Turbo Equalizer. The MAP equalizer takes as input the ISI-corrupted received symbols r and the extrinsic information $L_{e}^{D} (\hat{s})$ (where $\hat{s}$ the interleaved coded symbol estimates) and produces a sequence of posterior transmitted symbol log-likelihood ratio (LLR) estimates $L^{E} (\hat{s})$ (note that $L_{e}^{D} (\hat{s})$ is zero during the first iteration). Extrinsic information $L_{e}^{E} (\hat{s})$ is determined by

L_{e}^{E} (\hat{s}) = L^{E} (\hat{s}) - L_{e}^{D} (\hat{s}),

(4)

which is deinterleaved to produce $L_{e}^{E} ({\hat{s}}^{'})$ , which is used as input to the MAP decoder to produce a sequence of posterior coded symbol LLR estimates $L^{D} ({\hat{s}}^{'})$ . $L^{D} ({\hat{s}}^{'})$ is used together with $L_{e}^{E} ({\hat{s}}^{'})$ to determine the extrinsic information

L_{e}^{D} ({\hat{s}}^{'}) = L^{D} ({\hat{s}}^{'}) - L_{e}^{E} ({\hat{s}}^{'}),

(5)

$L_{e}^{D} ({\hat{s}}^{'})$ is interleaved to produce $L_{e}^{D} (\hat{s})$ . $L_{e}^{D} (\hat{s})$ is used together with the received symbols r in the MAP equalizer, with $L_{e}^{D} (\hat{s})$ serving to provide prior information on the received symbols. The equalizer again produces posterior information $L^{E} (\hat{s})$ of the interleaved coded symbols. This process continues until the outputs of the decoder settle, or until a predefined stop-criterion is met [3]. After termination, the output $L (\hat{u})$ of the decoder gives an estimate of the source symbols.

The proposed HNN-TE is modeled on one HNN structure, implying that there is no exchange of extrinsic information between its constituent parts. Rather, all information is intrinsically processed in an iterative fashion.

The Hopfield neural network

The HNN was first proposed in [15] and it was shown in that the HNN can be used to solve combinatorial optimization problems as well as pattern recognition problems. In [15] Tank and Hopfield derived an energy function and showed how the HNN can be used to minimize this energy function, thus producing near-ML sequence estimates at the output of the neurons. To enable the HNN to solve an optimization problem, the cost function of that problem is mapped to the HNN energy function, where after the HNN iteratively minimizes its energy function and performs near-MLSE. Also, to enable the HNN to solve a binary pattern recognition problem, the autocorrelation matrix of the set of patterns is used as the weights between the HNN neurons, while the noisy pattern to be recognized is used as the input to the HNN. Again, the HNN iteratively performs pattern recognition in order to produce the near-ML patter at the output of the HNN.

Energy function

The Hopfield energy function can be written as [16]

L = - \frac{1}{2} s^{T} Xs - I^{T} Ts,

(6)

where I is a column vector with N elements, X is an N × N matrix. Assuming that s, I, and X contain complex values, these variables can be written as [16]

\begin{matrix} s & = s_{i} + j s_{q}, \\ I & = I_{i} + j I_{q}, \\ X & = X_{i} + j X_{q}, \end{matrix}

(7)

where s and I are column vectors of length N, and X is an N × N matrix, where subscripts i and q are used to denote the respective in-phase and quadrature components. X is the cross-correlation matrix of the complex received symbols such that

X^{H} = X_{i}^{T} - j X_{q}^{T} = X_{i} + j X_{q},

(8)

implying that it is Hermitian. Therefore $X_{i}^{T} = X_{i}$ is symmetric and $X_{q}^{T} = - X_{q}$ is skew symmetric [16]. By using the symmetric properties of X _i and X _q, (6) can be expanded and rewritten as

L = - \frac{1}{2} [s_{i}^{T} X_{i} s_{i} + s_{q}^{T} X_{q} s_{q} + 2 s_{q}^{T} X_{q} s_{i}] - [s_{i}^{T} I_{i} + s_{q}^{T} I_{q}]

which in turn can be rewritten as [16]

L = - \frac{1}{2} [s_{i}^{T} | s_{q}^{T}] [\begin{matrix} X_{i} & X_{q}^{T} \\ X_{q} & X_{i} \end{matrix}] [\frac{s_{i}}{s_{q}}] - [I_{i}^{T} | I_{q}^{T}] [\frac{s_{i}}{s_{q}}] .

(9)

It is clear that (9) is in the form of (6), where the variables in (6) are substituted as follows:

\begin{matrix} s^{T} & = [s_{i}^{T} | s_{q}^{T}], \\ I^{T} & = [I_{i}^{T} | I_{q}^{T}], \\ X & = [\begin{matrix} X_{i} & X_{q}^{T} \\ X_{q} & X_{i} \end{matrix}] . \end{matrix}

(10)

Equation (9) is used to derive the HNN MLSE equalizer, decoder, and eventually the HNN-TE.

Iterative system

The HNN minimizes the energy function (6) with the following iterative system:

\begin{matrix} u^{(i)} & = T s^{(i)} + I \\ s^{(i + 1)} & = g (β (i) u^{(i)}), \end{matrix}

(11)

where u = {u ₁, u ₂, …, u _N}^T is the internal state of the HNN, s = {s ₁, s ₂, …, s _N}^T is the vector of estimated symbols, g(.) is the decision function associated with each neuron and i indicates the iteration number. β(.) is a function used for optimization as in [14].

The estimated symbol vector $[s_{i}^{T} | s_{q}^{T}]$ is updated with each iteration. $[I_{i}^{T} | I_{q}^{T}]$ contains the best blind estimate for s, and is therefore used as input to the network, while $[\begin{matrix} X_{i} & X_{q}^{T} \\ X_{q} & X_{i} \end{matrix}]$ contains the cross-correlation information of the received symbols. The system produces the MLSE estimates in s after Z iterations.

The Hopfield neural network turbo equalizer

In this section, the derivation of the HNN-TE is discussed, by first deriving its constituent parts—the HNN MLSE equalizer and the HNN MLSE decoder—and then showing how the HNN-TE is finally realized by combining the two.

HNN MLSE equalizer

The HNN MLSE equalizer was developed by the authors in [16]. The HNN MLSE equalizer was applied to single-carrier M-QAM modulated system with extremely long memory, where the CIR length was as long as L = 250, even though this is not a limit. The ability of the HNN MLSE equalizer to equalize signals in systems with highly dispersive channels is due to the fact that its complexity grows quadratically with an increase in transmitted data block size, and that it is approximately independent of the channel memory length. In the following the HNN MLSE equalizer developed in [16] will be presented, without spending time on the derivation.

It was shown in [16] that the correlation matrices X _i and X _q in (10), for a single carrier system transmitting a data block of length N through a multipath channel of length L with the data block initiated and terminated by L - 1 known tail symbols, with values 1 for BPSK modulation and $\frac{1}{\sqrt{2}} + j \frac{1}{\sqrt{2}}$ for M-QAM modulation, can be determined by

X_{i} = - [\begin{matrix} 0 & α_{1} & \dots & α_{L - 1} & \dots & 0 \\ α_{1} & 0 & α_{1} & \dots & ⋱ & ⋮ \\ ⋮ & α_{1} & 0 & ⋱ & ⋮ & α_{L - 1} \\ α_{L - 1} & ⋮ & ⋱ & ⋱ & α_{1} & ⋮ \\ ⋮ & ⋱ & \dots & α_{1} & 0 & α_{1} \\ 0 & ⋱ & α_{L - 1} & \dots & α_{1} & 0 \end{matrix}]

(12)

and

X_{q} = - [\begin{matrix} 0 & γ_{1} & \dots & γ_{L - 1} & \dots & 0 \\ γ_{1} & 0 & γ_{1} & \dots & ⋱ & ⋮ \\ ⋮ & γ_{1} & 0 & ⋱ & ⋮ & γ_{L - 1} \\ γ_{L - 1} & ⋮ & ⋱ & ⋱ & γ_{1} & ⋮ \\ ⋮ & ⋱ & \dots & γ_{1} & 0 & γ_{1} \\ 0 & ⋱ & γ_{L - 1} & \dots & γ_{1} & 0 \end{matrix}]

(13)

where α = {α ₁, α ₂, …, α _L - 1} and γ = {γ ₁, γ ₂, …, γ _L - 1} are respectively, determined by

α_{k} = \sum_{j = 0}^{L - k - 1} h_{j}^{(i)} h_{j + k}^{(i)} + \sum_{j = 0}^{L - k - 1} h_{j}^{(q)} h_{j + k}^{(q)},

(14)

and

γ_{k} = \sum_{j = 0}^{L - k - 1} h_{j}^{(q)} h_{j + k}^{(i)} - \sum_{j = 0}^{L - k - 1} h_{j}^{(i)} h_{j + k}^{(q)},

(15)

where k = 1, 2, 3, …, L - 1 and i and q denote the in-phase and quadrature components of the CIR coefficients.

Upon inspection it is easy to see from (12) through (15) that X _i and X _q can be determined using the respective in-phase and quadrature components of the N × N channel matrix, with the in-phase and quadrature components of the CIR, $h^{(i)} = {h_{0}^{(i)}, h_{1}^{(i)}, \dots, h_{L - 1}^{(i)}}^{T}$ and $h^{(q)} = {h_{0}^{(q)}, h_{1}^{(q)}, \dots, h_{L - 1}^{(q)}}^{T}$ , on the diagonals such that

H^{(i)} = [\begin{matrix} h_{0}^{(i)} & 0 & \dots & 0 & 0 & 0 & 0 \\ ⋮ & h_{0}^{(i)} & \dots & 0 & 0 & 0 & 0 \\ h_{L - 1}^{(i)} & ⋮ & ⋱ & 0 & 0 & 0 & 0 \\ 0 & h_{L - 1}^{(i)} & ⋱ & ⋱ & 0 & 0 & 0 \\ ⋮ & ⋮ & ⋱ & ⋱ & h_{0}^{(i)} & 0 & 0 \\ 0 & 0 & 0 & h_{L - 1}^{(i)} & \dots & h_{0}^{(i)} & 0 \\ 0 & 0 & 0 & 0 & h_{L - 1}^{(i)} & \dots & h_{0}^{(i)} \end{matrix}]

(16)

and

H^{(q)} = [\begin{matrix} h_{0}^{(q)} & 0 & \dots & 0 & 0 & 0 & 0 \\ ⋮ & h_{0}^{(q)} & \dots & 0 & 0 & 0 & 0 \\ h_{L - 1}^{(q)} & ⋮ & ⋱ & 0 & 0 & 0 & 0 \\ 0 & h_{L - 1}^{(q)} & ⋱ & ⋱ & 0 & 0 & 0 \\ ⋮ & ⋮ & ⋱ & ⋱ & h_{0}^{(q)} & 0 & 0 \\ 0 & 0 & 0 & h_{L - 1}^{(q)} & \dots & h_{0}^{(q)} & 0 \\ 0 & 0 & 0 & 0 & h_{L - 1}^{(q)} & \dots & h_{0}^{(q)} \end{matrix}] .

(17)

Using H ⁽ⁱ⁾ and H ^(q) the correlation matrices in (12) and (13) can be determined by

X_{i} = - (H^{(i) T} H^{(i)} + H^{(q) T} H^{(q)})

(18)

which is simply

X_{i} = - Re {H^{T} H} .

(19)

Also

X_{q} = - {(H^{(q) T} H^{(i)} - H^{(i) T} H^{(q)})}^{T},

(20)

which is

X_{q} = - Im {H^{T} H} .

(21)

X _i and X _q are then used to construct the combined correlation matrix in (10).

X = [\begin{matrix} X_{i} & X_{q}^{T} \\ X_{q} & X_{i} \end{matrix}] .

(22)

It was also shown in [16] that the input vectors I _i and I _q in (10) are determined by

I_{i} = [\begin{matrix} λ_{1} - ρ (α_{1} + γ_{1} + \dots + α_{L - 1} + γ_{L - 1}) \\ λ_{2} - ρ (α_{2} + γ_{2} + \dots + α_{L - 1} + γ_{L - 1}) \\ λ_{3} - ρ (α_{3} + γ_{3} + \dots + α_{L - 1} + γ_{L - 1}) \\ ⋮ ⋮ ⋮ \\ λ_{L - 1} - ρ (α_{L - 1} + γ_{L - 1}) \\ λ_{L} \\ ⋮ ⋮ ⋮ \\ λ_{N - L + 1} \\ λ_{N - L + 2} - ρ (α_{L - 1} - γ_{L - 1}) \\ ⋮ ⋮ ⋮ \\ λ_{N - 2} - ρ (α_{3} - γ_{3} + \dots + α_{L - 1} - γ_{L - 1}) \\ λ_{N - 1} - ρ (α_{2} - γ_{2} + \dots + α_{L - 1} - γ_{L - 1}) \\ λ_{N} - ρ (α_{1} - γ_{1} + \dots + α_{L - 1} - γ_{L - 1}) \end{matrix}]

(23)

and

I_{q} = [\begin{matrix} ω_{1} - ρ (α_{1} - γ_{1} + \dots + α_{L - 1} - γ_{L - 1}) \\ ω_{2} - ρ (α_{2} - γ_{2} + \dots + α_{L - 1} - γ_{L - 1}) \\ ω_{3} - ρ (α_{3} - γ_{3} + \dots + α_{L - 1} - γ_{L - 1}) \\ ⋮ ⋮ ⋮ \\ ω_{L - 1} - ρ (α_{L - 1} - γ_{L - 1}) \\ ω_{L} \\ ⋮ ⋮ ⋮ \\ ω_{N - L + 1} \\ ω_{N - L + 2} - ρ (α_{L - 1} + γ_{L - 1}) \\ ⋮ ⋮ ⋮ \\ ω_{N - 2} - ρ (α_{3} + γ_{3} + \dots + α_{L - 1} + γ_{L - 1}) \\ ω_{N - 1} - ρ (α_{2} + γ_{2} + \dots + α_{L - 1} + γ_{L - 1}) \\ ω_{N} - ρ (α_{1} + γ_{1} + \dots + α_{L - 1} + γ_{L - 1}) \end{matrix}],

(24)

where $ρ = 1 / \sqrt{2}$ for M-QAM modulation, ρ = 1 in I _i and ρ = 0 in I _q for BPSK modulation, and Λ = {λ ₁, λ ₂, …, λ _N} is determined by

λ_{k} = \sum_{j = 0}^{L - 1} r_{j + k}^{(i)} h_{j}^{(i)} + \sum_{j = 0}^{L - 1} r_{j + k}^{(q)} h_{j}^{(q)},

(25)

and Ω = {ω ₁, ω ₂, …, ω _N} is determined by

ω_{k} = \sum_{j = 0}^{L - 1} r_{j + k}^{(q)} h_{j}^{(i)} - \sum_{j = 0}^{L - 1} r_{j + k}^{(i)} h_{j}^{(q)},

(26)

where k = 1, 2, 3, …, N with i and q again denoting the in-phase and quadrature components of the respective elements. The combined input vector in (10) is therefore constructed as

I = [\frac{I_{i}}{I_{q}}] .

(27)

Note that Λ and Ω can easily be determined by

Λ = H^{(i) T} r^{(i)} + H^{(q) T} r^{(q)},

(28)

and

Ω = H^{(i) T} r^{(q)} - H^{(q) T} r^{(i)},

(29)

where r ⁽ⁱ⁾ and r ^(q) are the respective in-phase and quadrature components of the received symbols r = {r ₁, r ₂, …, r _{N + L - 1}}^T.

By deriving the cross-correlation matrix X and the input vector I in (10), the model in (9) is complete, and the iterative system in (11) can be used to equalize M-QAM modulated symbols transmitted through a channel with large CIR lengths. The HNN MLSE equalizer was evaluated in [16] for BPSK and 16-QAM with performance reaching the matched-filter bound in extremely long channels.

HNN MLSE decoder

The HNN has been shown to be able to decode balanced codes [17, 18]. A binary word of length m is said to be balanced if it contains exactly m / 2 ones and m / 2 zeros [19]. In addition, balanced codes have the property that no codeword is contained in another word, which simply means that positions of ones in one codeword will never be a subset of the positions of ones in another codeword [19].

The encoding process is described in [19] where the first k bits of the uncoded word is flipped in order to ensure the resulting codedword is “balanced,” whereafter the position k is appended to the balanced codeword before transmission. This encoding process is not followed here, as the set of m = 2ⁿ balanced codewords are determined before hand, after which encoding is performed by mapping a set of n bits to 2ⁿ balanced binary phase-shift keying (BPSK) symbols of length 2ⁿ, or by mapping a set of 2n bits to 2ⁿ balanced quaternary quadrature amplitude modulation (4-QAM) symbols of length 2ⁿ.

The HNN decoder developed here uses the set of predetermined codewords to determined the connection weights describing the level of connection between the neurons. It has previously been shown how a HNN can be used to decoded one balanced code at a time, but the HNN MLSE decoder we derive here is able to simultaneously decode any number of concatenated codewords in order to provide the ML transmitted sequence of codewords. After the HNN MLSE decoding, the ML BPSK or 4-QAM codewords of length 2ⁿ are demapped to n bits (or 2n bits for 4-QAM), which completes the decoding process.

Codeword selection

The authors have found that Walsh-Hadamard codes, widely used in code division multiple access (CDMA) systems [20], are desirable codes for this application, due to their seeming balance and orthogonality characteristics. Walsh-Hadamard codes are linear codes that map n bits to 2ⁿ codewords, where each set of codewords have a Hamming distance of 2^n-1 and a Hamming weight of 2^n-1.

Walsh-Hadamard codes are not “balanced” as described above. The first codeword is always all-ones, while subsets of some codewords are contained in others, violating both restrictions for balance. Instead of using the complete set of Walsh-Hadamard codes to map n bits to 2ⁿ codewords, a subset of codes in the Walsh-Hadamard matrix is selected, duplicated and modified so as to construct a new set of 2ⁿ codewords of length 2ⁿ. Consider the set of length 2ⁿ = 8 Walsh-Hadamard codes

H_{8} = [\begin{matrix} 1 & 1 & 1 & 1 & 1 & 1 & 1 & 1 \\ 1 & 0 & 1 & 0 & 1 & 0 & 1 & 0 \\ 1 & 1 & 0 & 0 & 1 & 1 & 0 & 0 \\ 1 & 0 & 0 & 1 & 1 & 0 & 0 & 1 \\ 1 & 1 & 1 & 1 & 0 & 0 & 0 & 0 \\ 1 & 0 & 1 & 0 & 0 & 1 & 0 & 1 \\ 1 & 1 & 0 & 0 & 0 & 0 & 1 & 1 \\ 1 & 0 & 0 & 1 & 0 & 1 & 1 & 0 \end{matrix}] .

(30)

To construct a set of balanced codewords from H ₈, a subset of 2^n-1 codewords is selected, which is used as the first 2^n-1 codewords in the new set of codewords. The second set of 2^n-1 codewords are constructed as follows:

1.
Reverse the order in which the first 2^n-1 codewords appear in the new set.
2.
Flip the bits of the reversed set of 2^n-1 codewords.

Assuming the subset selected from H ₈ above is the set H _8,4:7 (implying that codewords in rows 4 through 7 are selected), the resulting set of 2ⁿ balanced codewords is

C_{8} = [\begin{matrix} 1 & 0 & 0 & 1 & 1 & 0 & 0 & 1 \\ 1 & 1 & 1 & 1 & 0 & 0 & 0 & 0 \\ 1 & 0 & 1 & 0 & 0 & 1 & 0 & 1 \\ 1 & 1 & 0 & 0 & 0 & 0 & 1 & 1 \\ 0 & 0 & 1 & 1 & 1 & 1 & 0 & 0 \\ 0 & 1 & 0 & 1 & 1 & 0 & 1 & 0 \\ 0 & 0 & 0 & 0 & 1 & 1 & 1 & 1 \\ 0 & 1 & 1 & 0 & 0 & 1 & 1 & 0 \end{matrix}] .

(31)

It is clear that C ₈ is balanced in the sense that the rows (codewords) as well as the columns are balanced. It has been found that the HNN decoder performs better if the rows as well as the columns are balanced. The Hamming weight of C ₈ is still 2^n-1 = 2², while the Hamming distance increases slightly larger than 2^n-1 = 2².

By following the steps described above, any set of Walsh-Hadamard codes of length 2ⁿ can be used to create a new set of 2ⁿ balanced codes of length m = 2ⁿ.

Encoding

Encoding is performed by mapping a group of n bits to 2ⁿ BPSK symbols, or a group of 2n bits to 2ⁿ 4-QAM symbols. Before encoding, the set of codewords $C_{2^{n}}$ derived from the set of Walsh-Hadamard codes $H_{2^{n}}$ is made bipolar by converting the 0’s to -1.

BPSK encoding

When BPSK modulation is used, n bits are mapped to 2ⁿ BPSK symbols. The n bits are used to determine an index k in the range 1– 2ⁿ, which is then used to select a codeword from the set of codewords in $C_{2^{n}}$ such that the selected codeword $c = C_{2^{n}} (k)$ . Table 1 shows the number of uncoded bits, codeword length, uncoded bit to coded symbol rate R _s and the uncoded bit to coded bit rate R _c (code rate) for different n.

Table 1 Input-output relationship for BPSK encoder

Full size table

4-QAM encoding

When 4-QAM modulation is used, 2n bits are mapped to 2ⁿ 4-QAM symbols. The first and second groups of n bits (out of 2n bits) are used to determine two indices, k ⁽ⁱ⁾ and k ^(q), in the range 1– 2ⁿ, one for the in-phase part, and the other for the quaternary part of the codeword. The first index k ⁽ⁱ⁾ selects a codeword from $C_{2^{n}}^{(i)}$ , where $C_{2^{n}}^{(i)}$ is derived as before, and the second index k ^(q) selects a codeword from $C_{2^{n}}^{(q)}$ , which can be equal to $C_{2^{n}}^{(i)}$ or can be uniquely determined as explained earlier. The 4-QAM “codeword” is then calculated as $c = C_{2^{n}}^{(i)} (k^{(i)}) + j C_{2^{n}}^{(q)} (k^{(q)})$ , which is much like the result of coded modulation where groups of coded bits (in this case uncoded bits) are mapped to signal constellation points to improve spectral efficiency [20]. Table 2 shows the number of uncoded bits, codeword length, the uncoded bit to coded symbol rate R _s and code rate R _c for different 2n. Even though the code rate remains the same as with BPSK modulation, the throughput doubles as expected.

Table 2 Input-output relationship for 4-QAM encoder

Full size table

Decoder

The HNN is known to be able to recognize input patterns from a set of stored patterns [15, 21]. In the context of the HNN decoder, the patterns are the balanced codewords, and the HNN is able to determine the ML codeword from a set of codewords. This has been demonstrated before but only for one codeword at a time [17]. Therefore, if a received data block contains P codewords, the HNN will have to be applied P times in order to determine P ML codewords. However, the HNN MLSE decoder developed here is able to determine the most likely sequence of codewords using a single HNN. The HNN MLSE decoder is therefore applied once to a received data block containing any number of codewords.

After the HNN MLSE decoder has determined the sequence of most likely transmitted codewords, the codewords are demapped by calculating the Euclidean distance between each ML codeword and each codeword in $C_{2^{n}}$ for BPSK modulation, and each codeword in $C_{2^{n}}^{(i)} + j C_{2^{n}}^{(q)}$ for 4-QAM modulation. The indices(s) corresponding to the codeword(s) that have the lowest Euclidean distance/distances is/are converted to bits, which completes the decoding phase.

The derivation of the HNN MLSE decoder entails the calculation of the cross-correlation matrices X _i and X _q, and the input vectors I _i and I _q in (10). The HNN MLSE decoder is first derived for the decoding of a single codeword, after which it will be extended to enable the decoding of any number of codewords simultaneously. Derivations are performed for 4-QAM only, since the BPSK HNN MLSE decoder is a simplification of its 4-QAM counterpart.

Single codeword decoding

To enable the HNN to store a set of codewords, the average correlation between all pattern must be stored in the weights between the neurons. According to Hebb’s rule of auto-associative memory [22], the connection weight matrix, or correlation matrix, is calculated by taking the cross-correlation of the patterns to be stored. Since we are working with complex symbols, there are two weight matrices to be calculated. The cross-correlation matrices in (9) are calculated as

\begin{matrix} X_{i} & = Re {C^{T} C} \\ = C_{2^{n}}^{(i) T} C_{2^{n}}^{(i)} + C_{2^{n}}^{(q) T} C_{2^{n}}^{(q)} \end{matrix}

(32)

and

\begin{matrix} X_{q} & = Im {C^{T} C} \\ = C_{2^{n}}^{(q) T} C_{2^{n}}^{(i)} - C_{2^{n}}^{(i) T} C_{2^{n}}^{(q)}, \end{matrix}

(33)

where $C = C_{2^{n}}^{(i)} + j C_{2^{n}}^{(q)}$ , and $C_{2^{n}}^{(i)}$ and $C_{2^{n}}^{(q)}$ are the matrices containing the generated codewords as before, respectively, used for the in-phase and quadrature components of the codeword. Note the similarities between the correlation matrices in (32) and (33) and those in (18) and (20). Also, the two input vectors are simply the real and imaginary components of the noise-corrupted received codeword, such that

I_{i} = Re {c} + Re {n}

(34)

and

I_{q} = Im {c} + Im {n}

(35)

where c is of length 2ⁿ and n is a vector containing complex samples from the distribution $N (μ, σ^{2})$ , where μ in the range 1 = in the range 10 and σ is the noise standard deviation. After the ML codeword is detected, each detected codeword (of length 2ⁿ) can be mapped back to n bits for BPSK modulation and 2n bits for 4-QAM modulation.

Multiple codeword decoding

It was shown how the HNN can be used to decode single codewords, but the HNN decoder can be extended in order to detect ML transmitted sequences of codewords. This step is crucial in our quest of merging the HNN decoder with the HNN MLSE equalizer, since the HNN MLSE equalizer detects ML sequences of transmitted symbols. If the transmitted information is encoded, these sequences contain multiple codewords, and hence the HNN decoder must be extended to detect not only single codewords, but codeword sequences.

This extension is easily achieved by using the HNN parameters already derived in (32) through (35). Consider a system transmitting a sequence of P balanced codewords of length 2ⁿ, where n is the length of the uncoded bit-words. The new correlation matrix is constructed by copying X in (10) along the diagonal according to the number of transmitted codewords P, such that

X^{(P)} = [\begin{array}{l} X_{i} & X_{q}^{T} \\ X_{q} & X_{i} \\ X_{i} & X_{q}^{T} & \emptyset \\ X_{q} & X_{i} \\ ⋱ & ⋱ \\ \emptyset & ⋱ & ⋱ \\ X_{i} & X_{q}^{T} \\ X_{q} & X_{i} \end{array}],

(36)

where $X = [\begin{array}{l} X_{i} & X_{q}^{T} \\ X_{q} & X_{i} \end{array}]$ is repeated on the diagonal P times and ∅ implies that the rest of X ^(P) is empty, containing only 0’s.

Also the input vector I in (10), consisting of I _i and I _q, is also extended according to the number of transmitted codewords P such that

I^{(P)} = [\frac{I_{i}^{(P)}}{I_{q}^{(P)}}],

(37)

where

I_{i}^{(P)} = {[Re {c_{1}}, Re {c_{2}}, \dots, Re {c_{p}}]}^{T} + Re {n}

(38)

and

I_{q}^{(P)} = {[Im {c_{1}}, Im {c_{2}}, \dots, Im {c_{p}}]}^{T} + Im {n},

(39)

where c _p is the p th codeword of length 2ⁿ, where p = 1, 2, …, P, and n is of length 2ⁿ P and contains complex samples from the distribution $N (μ, σ^{2})$ , where μ = 0 and σ is the noise standard deviation.

The extended cross-correlation matrix and input vector in (36) and (37) can now be used to estimate the ML sequence of transmitted codewords, after which each detected codeword (of length 2ⁿ) can be mapped back to n bits for BPSK modulation and 2n bits for 4-QAM modulation.

HNN turbo equalizer

The HNN-TE is an amalgamation of the HNN MLSE equalizer and the HNN MLSE decoder, which were discussed in the previous sections. In this section it is explained how the HNN MLSE equalizer and the HNN MLSE decoder are combined in order to perform iterative joint equalization and decoding (turbo equalization) using a single HNN structure. The HNN-TE is able to jointly equalize and decode BPSK and 4-QAM coded modulated signals in systems with highly dispersive multipath channels, with extremely low computational complexity compared to traditional turbo equalizers which employ a MAP equalizer/decoder pair.

System model

Since we already have complete models for the HNN MLSE equalizer and decoder, the combination of the two is fairly straight-forward. In order to distinguish between equalizer and decoder parameters a number of redefinitions are in order. For the HNN MLSE equalizer the correlation matrix and input vector relating to (10), as derived in (22) and (27), are now X _E and I _E, respectively, and will henceforth be referred to as “equalizer correlation matrix” and “equalizer input vector”. Similarly the HNN MLSE decoder correlation matrix and input vector relating to (10), as derived in (36) and (37), are now X _D and I _D, respectively, and will henceforth be referred to as “decoder correlation matrix” and “decoder input vector”.

When a coded data block of length N _c is transmitted through a multipath channel, X _E and X _D are determined according to (22) and (36), where both matrices are of size N _c × N _c. Since the function of the equalizer and the decoder has to be merged, it makes sense to somehow combine X _E and X _D to enable the equalizer to perform decoding, or to enable the decoder to perform equalization. This combination is performed by first normalizing X _D with respect to X _E, because of varying energy in a multipath fading channel between received data blocks. X _D is therefore normalized with respect to X _E such that

X_{D}^{(norm)} = (\frac{∥ X_{E} ∥}{∥ X_{D} ∥}) X_{D} .

(40)

Next the new correlation matrix is determined as

X_{TE} = X_{E} + X_{D}^{(norm)} .

(41)

The rationale behind the addition of the equalizer correlation matrix and the normalized decoder correlation matrix is that the connection weights in the decoder correlation matrix should bias those of the equalizer correlation matrix. Since X _TE contains X _E offset by $X_{D}^{(norm)}$ , joint equalization and decoding is made possible.

The new input vector also needs to be calculated. I _D contains the noise-corrupted coded symbols, while I _E contains not only received coded symbol information, but also the ISI information. Note that when there is no multipath or fading (L = 1 and h ₀ = 1), I _E reduces to I _D. The new input vector used in the HNN-TE is therefore simply

I_{TE} = I_{E} .

(42)

With the new correlation matrix X _TE and input vector I _TE, the HNN-TE model is complete, and the iterative system in (11) can be used to jointly equalize and decode (turbo equalize) the transmitted coded information.

Transformation

Upon reception the received symbol vector has to be deinterleaved to restore the one-to-one relationship between each element in r and c with respect to the first coefficient h ₀ of the CIR h = {h ₀, h ₁, …, h _L-1}^T. Deinterleaving r transforms the transmission model in (1). Substituting (2) in (1) and applying the deinterleaver, which is simply the Hermitian transpose of the interleaver matrix J, gives

J^{H} r = J^{H} H H G^{H} s + J^{H} n,

(43)

which is equivalent to transmitting the coded symbol sequence c = G ^T s through a channel

Q = J^{H} H H .

(44)

Therefore (43) can be written as

J^{H} r = Q G^{H} s + J^{H} n .

(45)

Consequently the new channel matrix Q, rather than the conventional channel matrix H in (3), is used in the calculation of the equalizer correlation matrix X _E derived in (22). Due to the above transformation, Q does not contain the CIR H on the diagonal as in H. Rather, each column in Q (of length N _c) contains a unique random combination of all CIR coefficients (where the rest of the N _c - L elements in a column are equal to 0), dictated by the randomization effect exhibited in Q due to the random interleaver. This randomization effect results from first multiplying the channel H with the interleaving matrix J and then deinterleaving by multiplying the result with J ^T (see (44)). Deinterleaving places the first CIR coefficient (h ₀) on the diagonal of Q, restoring the one-to-one relationship between each element in r and each corresponding coded transmitted symbol in c.

To illustrate this concept, consider the three-dimensional representations of |H J| and |Q| in Figures 2a, b, 3a,b, 4a,b, and 5a,b, for a hypothetical system transmitting coded information through a multipath channel with CIR lengths of L = 1, L = 5, L = 10, and L = 20, respectively, with a block length N _c = 80. Figure 2a,b show |H J| and |Q| for channels of length L = 1, where Figure 2a is clearly interleaved. It is also clear that the new channel Q in Figure 2b is deinterleaved, since the first coefficient h ₀ of the CIR has been restored to the diagonal of Q. Figure 3a and 5a show the interleaved channels for L = 5, L = 10, and L = 20, where Figure 3b and 5b show the new channels Q, again with the first CIR coefficient h ₀ restored to the diagonal. Even though h ₀ is restored to the diagonal of Q, it is clear that the rest of the CIR coefficients h ₁, h ₂, …, h _L-1 are scattered throughout Q. As stated before, each column in Q contains a unique random combination of all CIR coefficients (with h ₀ on the diagonal for each column), dictated by the randomization effect exhibited in Q, where the rest of the N _c - L elements in each column are equal to 0.

Computational complexity analysis

The computational complexity of the HNN-TE is compared to that of the CTE by calculating the number of computations performed for each received data block, for a fixed set of system parameters. The number of computations are normalized by the coded data block length so as to factor out the effect of the length of the transmitted data block, which allows us to present the computational complexity in terms of the number of computations required per received coded symbol. The complexity of the HNN-TE is quadratically related to the coded data block length, so a change in N _c will still have an effect on the normalized computational complexity.

The computational complexity of the HNN-TE was calculated as

\begin{matrix} C C_{HNN - TE} = & 2 N_{c}^{2.376} + 8 (N_{c} + L - 1) + Z_{HNN - TE} ({(N_{c} M / 2)}^{2} \\ + (N_{c} M / 2)) + 4 N_{c} k^{2} + 2 {(Nc + L - 1)}^{2.376}, \end{matrix}

(46)

where N _c is the coded data block length, L is the CIR length, M is the modulation constellation alphabet size (2 for BPSK and 4 for 4-QAM), Z _HNN-TE is the number of iterations and k is the codeword length, which was chosen as k = 8 for a code rate of R _c = 3 / 8. The first term in (46) is associated with the calculation of X _i in (19) and X _q in (21). The second term is associated with the calculation of Λ in (28) and Ω in (29). The third term is for the iterative calculation of the ML coded symbols in (11) while the second to last term in (46) is for the trivial ML detection of codewords after joint iterative MLSE equalization and decoding. The last term is due to the transformation in (43) through (45). Note that in the first and last terms of (46) the exponent is 2.376. It has been shown in [23] that the complexity of multiplication of two N × N matrices can be reduced from O(N ³) to O(N ^2.376). However, due to the fact that cubic complexity matrix multiplication is still preferred in practical applications due to ease of implementation, (46) serves as a lower bound on the HNN-TE computational complexity.

Therefore, the computational complexity of the HNN-TE is approximately quadratic at best, or more realistically cubic in the coded data block length (N _c), quadratic in the modulation constellation alphabet size (M), quadratic in the codeword length k, and approximately independent of the channel memory length (L).

The complexity of the CTE was determined as

C C_{CTE} = Z_{CTE} (4 N_{c} LQ + 4 N_{c} k^{2}),

(47)

where Z _CTE is the number of iterations and Q is the number of equalizer states, determined by 2^L-1 for BPSK modulation and 4^L-1 for 4-QAM. The first term in (47) is associated with the equalizer while the second term is associated with MAP decoding. The computational complexity of the CTE is therefore linear in the coded data block length (N _c), exponential in the channel memory length (L) and quadratic in the codeword length (k).

Figure 6 and shows the normalized computational complexity of the HNN-TE and the CTE for coded data block lengths of N _c = 80, N _c = 160, N _c = 320, N _c = 640, N _c = 1280, and N _c = 2560, where Z _HNN-TE = 25 and Z _CTE = 5, for BPSK and 4-QAM modulation when O(N ^2.376) matrix multiplication complexity is considered. Figure 7 shows the same information as Figure 6, but with O(N ³) matrix multiplication complexity. It is clear that the computational complexity of the HNN-TE increases with an increase in coded data block length, but for realistic data block lengths the complexity of the HNN-TE is superior to that of the CTE for channels with long memory. The HNN-TE is computationally less complex for BSPK modulation than for 4-QAM, but only slightly so. On the other hand, the complexity of the CTE grows exponentially with and increase in modulation order. From Figure 6 it is clear that the complexity of the HNN-TE is almost quadratically related to the coded data block length and approximately independent of the channel memory length, which is more evident when L is increased. The normalized computational complexity of the HNN-TE and the CTE (for O(N ^2.376) and O(N ³) matrix multiplication complexity) for N _c = 1280 using BPSK and 4-QAM for extremely long channels is shown in Figure 8, where there is no comparison between the complexity of the HNN-TE and that of the CTE, for both BSPK and 4-QAM modulation.

Memory requirements analysis

The memory requirements of the HNN-TE and the CTE are closely related to their respective computational complexities due to the structures employed by these algorithms. Table 3 describes the memory requirements of the HNN-TE for each received data block. The total memory requirement for the HNN-TE is $2 N_{c}^{2} + 6 N_{c} + N_{c} + L - 1 + 2 {(N_{c} + L - 1)}^{2}$ where each variable is of type float, which uses 32 bits. The memory requirements of the CTE per data block is shows in Table 4. The total memory requirement of the CTE is N _c M ^L-1 + 4N _c + L. Figure 9 shows the memory requirement of the HNN-TE and the CTE in bytes (32 bits = 8 bytes) for coded data block sizes of N _c = 160, N _c = 640, and N _c = 2560 and CIR lengths increasing from L = 1 to L = 25. From Figure 9 it is clear that the memory requirement of the HNN-TE remains constant over all channel lengths and modulation alphabet sizes, with less than 1 MB of memory required for N _c = 160, 6.6 MB for N _c = 640 and 100 MB for N _c = 2560. The memory requirements of the CTE, however, grows exponentially with the channel memory length, since the size of the trellis structure used in the MAP equalizer grows according to the same measure. The break-even point between the BPSK CTE and the HNN-TE (for both BPSK and 4-QAM) is L = 10.40 for N _c = 160, L = 12.35 for N _c = 640 and L = 14.30 for N _c = 2560, beyond which the HNN-TE require less memory than the CTE. Also, the break-even point between the 4-QAM CTE and the HNN-TE is L = 5.68 for N _c = 160, L = 6.66 for N _c = 640 and L = 7.66 for N _c = 2560. The memory requirements of the HNN-TE are therefore more favorable when higher order modulation alphabets are employed.

Table 3 HNN-TE memory requirements

Full size table

Table 4 CTE memory requirements

Full size table

Simulation results

The proposed HNN-TE was evaluated in a mobile fading environment for BPSK and 4-QAM modulation at a code rate of R _c = n / k = 3 / 8. To simulated the fading effect of mobile channels, the Rayleigh fading simulator proposed in [24] was used to generate uncorrelated fading vectors. When imperfect channel state information (CSI) was assumed, least squares channel estimation was used using various amounts of training symbols in the transmitted data block. On the other hand, when perfect CSI was assumed, the CIR coefficients were “estimated” by taking the mean of the uncorrelated fading vectors. Simulations were performed for short and long channels at various mobile speeds. Simulations were also performed to compare the performance of the HNN-TE and a CTE in short mobile fading channels for BPSK modulation. For all simulations the uncoded data block length was N _u = 480 and the coded data block length was N _c = 1280. In all simulations the frequency was hopped four times during each data block in order to further reduce the BER. For the CTE the number of iterations were Z = 5, and instead of using a fixed number of iterations for the HNN-TE, we use the function $Z (E_{b} / N_{0}) = 2 (5^{(E_{b} / N_{0}) / 5})$ (which produces Z(E _b / N ₀) = {2, 4, 8, 10, 22, 55} for E _b/N ₀ = {0, 2.5, 5, 7.5, 10}) to determine the number of iterations to be used given E _b / N ₀.

Figure 10 show the performance of the HNN-TE and the CTE for channel lengths of L = 4, L = 6, and L = 8 at a fixed mobile speed of 20 km/h, assuming perfect CSI. The performance of the HNN-TE is slightly better than that of the CTE for high SNR levels.

Figure 11 shows the performance of the HNN-TE and the CTE for a channel of length L = 6 at mobile speeds of 3 km/h, 50 km/h, 80 km/h, 140 km/h, and 200 km/h, assuming perfect CSI. It is clear that the HNN-TE outperforms the CTE at mobile speeds greater than 20 km/h, with the advantage of performance increasing with an increase in mobile speeds. It seems that the HNN-TE is less affected by increasing mobile speeds, which suggests that the HNN-TE is able to perform well in fast-fading mobile environments.

Figure 12 shows the performance of the HNN-TE and the CTE for a channel of length L = 6 at a mobile speed of 20 km/h, assuming imperfect CSI. To estimate the channel training sequences of length 4L, 6L, 8L, and 10L were used. From Figure 12 it is clear that the HNN-TE is superior to the CTE at high SNR levels when perfect CSI is not available. The HNN-TE seems to be less sensitive to channel estimation errors.

It is clear from Figures 10, 11, and 12 that the performance of the HNN-TE is superior to that of a CTE in short channels at varying mobile speeds, for both perfect and imperfect CSI. The HNN-TE outperforms the CTE in short channels, but with higher computational complexity. Figure 6 shows that the HNN-TE is more computationally complex than the CTE for short channels (L<10), when the coded data block length is relatively small (N _u<1280). However, the complexity of the HNN-TE is vastly superior to that of the CTE for long channels. It might be argued that the HNN-TE will perform better than the CTE since more iterations are used, but that is not true. It is stated in [3] that the performance of the CTE cannot be improved significantly beyond Z = 3 iterations in Rayleigh fading channels, so the performance gain of the HNN-TE compared to the CTE is probably due to the fact that HNN-TE is able to process all the available information internally as a whole, without having to exchange information between the equalizer and the decoder, as is the case in a CTE.

Figure 13 shows the performance of the HNN-TE for channels of length L = 10, L = 20, L = 50, L = 100 at a fixed mobile speed of 20 km/h for BPSK and 4-QAM modulation, assuming perfect CSI. It is clear that the performance for BPSK modulation is better than the performance for 4-QAM, which is due to the fact that Gray coding cannot be applied in the encoding process described in Section 4.2.2. The performance loss is therefore warranted.

Figure 14 shows the performance of the HNN-TE for a channel of length L = 50 at mobile speeds of 20 km/h, 80 km/h, 140 km/h, and 200 km/h for BPSK and 4-QAM modulation, assuming perfect CSI. It is clear that an increase in mobile speed leads to a performance degradation, although not as much as expected. Again BPSK modulation performs better than 4-QAM modulation.

Figure 15 shows the performance of the HNN-TE for a channel of length L = 50 at a mobile speed of 20 km/h for BPSK and 4-QAM modulation, assuming imperfect CSI. To estimate the channel, training sequences of length 4L, 6L, 8L, and 10L were used. As expected, a performance loss is incurred with a decrease in the number of training symbols. Again BPSK modulation outperforms 4-QAM modulation.

Figure 16 shows the performance of the HNN-TE for a channel of length L = 25 at a mobile speed of 20 km/h for BPSK and 4-QAM modulation, assuming perfect CSI, for different numbers of iterations. The number of iterations were chosen to be Z = 5, Z = 10, Z = 20, and Z = 50. The BER performance increases with an increase in the number of iterations. Since the performance degradation due to a decrease in the number of iterations is low at low signal levels, we adopt an iteration schedule that is dependent on the signal level. As stated before, we use the following function to determine the number of iterations: $Z (E_{b} / N_{0}) = 2 (5^{(E_{b} / N_{0}) / 5})$ .

Figure 17 shows the performance of the HNN-TE for a channel of length L = 50 at a mobile speed of 20 km/h for BPSK and 4-QAM modulation, assuming perfect CSI, for different code rates. The code rates were R _c = 1 / 2 (2 / 4), R _c = 3 / 8, R _c = 1 / 4 (4/16), and R _c = 5 / 32. From Figure 17 it is clear that the performance of the HNN-TE increases with a decrease in the code rate, with 4-QAM modulation performing worse than BPSK modulation.

From Figures 13, 14, 15, 16 and 17 it is clear that the HNN-TE is able to jointly equalize and decode BPSK and 4-QAM modulated signals, transmitted trough extremely long mobile fading channels. While the data rate using 4-QAM modulation is twice that using BPSK modulation, the performance is worse for 4-QAM modulation, due to the fact that Gray coding cannot be applied during coded modulation.

Conclusion

In this article, a low complexity turbo equalizer was developed which is able to jointly equalize and decode BPSK and 4-QAM coded-modulated signals in systems transmitting interleaved information through a multipath fading channels. It uses the Hopfield neural network as framework and hence it was fittingly named the Hopfield Neural Network Turbo Equalizer, or HNN-TE. The HNN-TE is able to turbo equalize coded modulated BPSK and 4-QAM signals in short as well as long multipath channels, slightly outperforming the CTE for short channels, although at higher computational cost. However, the HNN-TE computational complexity in long channels is vastly superior to that of CTE. The computational complexity of the HNN-TE is almost quadratically related to the coded data block length, while being approximately independent of the CIR length. This enables it to turbo equalize signals in systems with multiple hundreds of multipath elements. It was also demonstrated that the HNN-TE is less susceptible than the CTE to channel estimation errors, and it also outperforms the CTE in fast fading channels. The performance of the HNN-TE for BPSK modulation is better than for 4-QAM modulation, since Gray coding cannot be employed due to the coded modulation explained in this paper, while the complexity for 4-QAM is slightly higher.

References

Berrou C, Glavieux A, Thitimajshima P: Near Shannon limit error-correction and decoding: Turbo-Codes. Int. Conf. Commun 1993, 1064-1070.
Google Scholar
Douillard C, Jezequel M, Berrou C, Picart A, Didier P, Glavieux A: Iterative correction of intersymbol intereference: turbo-equalization. Europ. Trans. Telecommun 1995, 6: 507-511. 10.1002/ett.4460060506
Article Google Scholar
Bauch G, Khorram H, Hagenauer J: Iterative equalization and decoding in mobile communication systems. Proceedings of European Personal Mobile Communications Conference (EPMCC) 1997, 307-312.
Google Scholar
Koetter R, Tuchler M, Singer AC: Turbo equalization. IEEE Signal Process. Mag 2004, 21(1):67-80. 10.1109/MSP.2004.1267050
Article Google Scholar
Koetter R, Tuchler M, Singer AC: Turbo equalization: principles and new results. IEEE Trans. Commun 2002, 50(5):754-767. 10.1109/TCOMM.2002.1006557
Article Google Scholar
Lopes RR, Barry JR: The soft feedback equalizer for turbo equalization of highly dispersive channels. IEEE Trans. Commun 2006, 54(5):783-788.
Article Google Scholar
Dual-Hallen A, Hegaard C: Delayed decision feedback sequence estimation. IEEE Trans. Commun 1989, 37(5):428-436. 10.1109/26.24594
Article Google Scholar
Eyuboglu MV, Qureshi SU: Reduced-state sequence estimation with set partitioning and decision feedback. IEEE Trans. Commun 1988, 36(1):13-20. 10.1109/26.2724
Article Google Scholar
Wu J, Leong S, Lee K, Xiao C, Olivier JC: Improved BDFE using a priori information for turbo equalization. IEEE Trans. Wirel. Commun 2008, 7(1):233-240.
Article Google Scholar
Lou H, Xiao C: Soft-decision feedback turbo equalization for multilevel modulations. IEEE Trans. Signal Process 2011, 59(1):186-195.
Article MathSciNet Google Scholar
Fijalkow I, Pirez D, Roumy A, Ronger S, Vila P: Improved interference cancellation for turbo-equalization. Proceedings of IEEE International Conference on Acoustics, Speech and Signal Processing 2000, 416-419.
Google Scholar
Wang X, Poor HV: Iterative (turbo) soft interference cancellation and decoding for coded CDMA. IEEE Trans. Commun 1999, 47(7):1046-1061. 10.1109/26.774855
Article Google Scholar
Ampeliotis D, Berberidis K: Low complexity turbo equalization for high data rate. EURASIP J. Commun. Network 2006, 2006(ID 25686):1-12.
Article Google Scholar
Myburgh HC, Olivier JC: Reduced complexity turbo equalization using a dynamic Bayesian network. EURASIP J. Adv. Signal Process 2012. (Submitted for Publication)
Google Scholar
Hopfield JJ, Tank DW: Neural computations of decisions in optimization problems. Biol. Cybern 1985, 52: 1-25. 10.1007/BF00336930
Article MathSciNet Google Scholar
Myburgh HC, Olivier JC: Low complexity MLSE equalization in highly dispersive Rayleigh fading channels. EURASIP J. Adv. Signal Process 2010., 2010(ID 874874): http://asp.eurasipjournals.com/content/2010/1/874874
Google Scholar
Wiberg N: A class of Hopfield decodable codes. Proceedings of the IEEE-SP Workshop on Neural Networks for Signal Processing 1993, 88-97.
Google Scholar
Wang Q, Bhargava VK: An error correcting neural network. IEEE Pacific Rim Conference on Communications, Computers and Signal Processing 1989, 530-533.
Google Scholar
Knuth D: Efficient balanced codes. IEEE Trans. Inf. Theory 1986, IT-32(1):530-533.
MathSciNet Google Scholar
Proakis JG: Digital Communications. New York: McGraw-Hill, International Edition; 2001.
Google Scholar
Hopfield JJ: Artificial neural networks. IEEE Circ. Dev. Mag 1988, 4(5):3-10.
Article Google Scholar
Hebb DO: The Organization of Behavior. New York: Wiley; 1949.
Google Scholar
Winograd S, Coppersmith D: Matrix multiplication via arithmetic progressions. J. Symbolic Comput 1990, 9(3):251-280. 10.1016/S0747-7171(08)80013-2
Article MATH MathSciNet Google Scholar
Zheng YR, Xiao C: Improved models for the generation of multiple uncorrelated Rayleigh fading waveforms. IEEE Commun. Lett 2002, 6: 256-258.
Article Google Scholar

Download references

Author information

Authors and Affiliations

Department of Electrical, Electronic and Computer Engineering, University of Pretoria, Pretoria, 0002, South Africa
Hermanus C Myburgh
School of Engineering, University of Tasmania, Hobart, 7001, Australia
Jan C Olivier

Authors

Hermanus C Myburgh
View author publications
You can also search for this author in PubMed Google Scholar
Jan C Olivier
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Hermanus C Myburgh.

Additional information

Competing interests

Both authors declare that they have no competing interests.

Authors’ original submitted files for images

Below are the links to the authors’ original submitted files for images.

Authors’ original file for figure 1

Authors’ original file for figure 2

Authors’ original file for figure 3

Authors’ original file for figure 4

Authors’ original file for figure 5

Authors’ original file for figure 6

Authors’ original file for figure 7

Authors’ original file for figure 8

Authors’ original file for figure 9

Authors’ original file for figure 10

Authors’ original file for figure 11

Authors’ original file for figure 12

Authors’ original file for figure 13

Authors’ original file for figure 14

Authors’ original file for figure 15

Authors’ original file for figure 16

Authors’ original file for figure 17

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 2.0 International License (https://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Reprints and permissions

About this article

Cite this article

Myburgh, H.C., Olivier, J.C. A low complexity Hopfield neural network turbo equalizer. EURASIP J. Adv. Signal Process. 2013, 15 (2013). https://doi.org/10.1186/1687-6180-2013-15

Download citation

Received: 11 June 2012
Accepted: 15 January 2013
Published: 08 February 2013
DOI: https://doi.org/10.1186/1687-6180-2013-15

A low complexity Hopfield neural network turbo equalizer

Abstract

Abstract

Introduction

Turbo equalization

The Hopfield neural network

Energy function

Iterative system

The Hopfield neural network turbo equalizer

HNN MLSE equalizer

HNN MLSE decoder

Codeword selection

Encoding

BPSK encoding

4-QAM encoding

Decoder

Single codeword decoding

Multiple codeword decoding

HNN turbo equalizer

System model

Transformation

Computational complexity analysis

Memory requirements analysis

Simulation results

Conclusion

References

Author information

Authors and Affiliations

Corresponding author

Additional information

Competing interests

Authors’ original submitted files for images

Rights and permissions

About this article

Cite this article

Share this article

Keywords