Skip to main content

Doubly selective channel estimation for amplify-and-forward relay networks

Abstract

In this article, doubly selective channel estimation is considered for 1amplify-and-forward-based relay networks. The complex exponential basis expansion model is chosen to describe the time-varying channel, from which the infinite channel parameters are mapped onto finite ones. Since direct estimation of these coefficients encounters high computational complexity and large spectral cost, we develop an efficient estimator that only targets at useful channel parameters that could guarantee the later data detection. The training sequence design that can minimize the channel estimation mean-square error is also proposed. Finally, numerical results are provided to corroborate the study.

1 Introduction

Wireless relay networks have been a highly active research field ever since the pioneer work [13]. A typical relay network consists of a source node, one or several relay nodes, and a destination node. The transmission from the source node to the destination node involves two phases. In the first phase, the source node broadcasts signals to the relay nodes and possibly to the destination node. In the second phase, the relay nodes re-transmit its received signals in the first phase to the destination node under a certain relaying strategy such as amplify-and-forward (AF) and decode-and-forward (DF) [2]. It has been shown that such a relay network can enhance the system throughput [1], improve the transmission coverage [2], and increase the multiplexing gain [3]. Several standards that have been or are being specified for the next-generation mobile broadband communication systems [4] already included the relay-aided transmission, e.g., long-term evolution-advanced (LTE-A), IEEE 802.16j, and IEEE 802.16m.

Like any other wireless communication system, a relay network performs better with better channel estimates, and the quality of channel acquisition has a significant effect on the overall system performance. In addition, knowledge of channel state information is often a prerequisite for some physical layer approaches such as optimal strategy selection and the precoding design.

Assuming block fading scenarios, several channel estimation schemes were proposed for relay network with one or multiple-relay nodes. For example, the authors of [5, 6] studied the channel estimation for relay networks and pointed out that there exist many differences in channel estimation between the AF-based relay networks and the traditional point-to-point networks. Shortly later, channel estimations under frequency-selective environment were developed in [7, 8].

However, in many practical cases the source node, the relay node, and the destination node can be mobile. The relative motion between any two nodes will cause Doppler shift and thus make the channel time-varying [9]. Time-varying channel estimations for relay networks are studied in [10, 11]. It is shown in [10] that for time-varying channels in relay networks, transmission of several subblocks can obtain better estimation performance than that of a single subblock.

Furthermore, when the transmission data rates are high and nodes are mobile, the relay network is expected to operate over doubly selective channels. Canonne-Velasquez et al. [12] suggest a channel estimation algorithm for orthogonal frequency division multiplexing-based AF relay systems over doubly selective environments. However, the proposed channel estimation algorithm neglects the interference from the data symbols. Moreover, the optimal training sequence design for doubly selective AF relay channels remains an open problem. To the best of the authors’ knowledge, estimation techniques considering inter symbol interference between data and training symbols, as well as training sequence design, have not yet been developed. This motivates our current work.

The doubly selective channel can typically be represented in two ways: the autoregressive (AR) process [13] or the basis expansion model (BEM) [14]. AR models describe channel variation through a symbol-by-symbol update manner. Though second- and third-order AR models can provide excellent fits to the Jake model, the first-order AR process is usually adopted [15] due to its tractability. On the other hand, BEM describes the doubly selective channel as the superpositions of time-varying basis functions weighted by time-invariant coefficients. The candidate basis functions include complex exponential (Fourier) functions [14, 16], polynomials [17], wavelet [18], discrete prolate spheroidal sequences [19, 20], etc. BEMs can well describe the time variations of channel, and thus achieve better approximation performance than symbol-wise AR models [21].

In this article, we focus on complex exponential BEM (CE-BEM) [16] due to its popularity and clear physical meaning. The data frame structure is designed to adapt to the transmission in doubly selective channels and to facilitate both channel estimation and data detection. We first develop an estimator that targets the combined channel parameters and then propose a detection algorithm. The training sequence that can minimize the channel estimation mean-square error (MSE) is also found.

The rest of this article is organized as follows. Section 2. presents the relay system model over doubly selective channels. Section 3. discusses the channel statistics and CE-BEM approximation accuracy. Section 4. develops the channel estimator and data detector, and suggests the optimal training sequence design. Simulation results are provided in Section 5. Finally, conclusions are drawn in Section 6.

Notations: Vectors and matrices are given in boldface letters; the transpose, Hermitian, and inverse of A are AT, AH, and A−1, respectively; diag{a} is the diagonal matrix formed by a, tr(A) is the trace of the square matrix A, 0 L is the L × L matrix with all entries 0, and I L is the L×L identity matrix.

2 System model

Consider an AF relay network with one source node S , one relay node R , and one destination node D (Figure 1). Let h(i;l) denote the doubly selective channel between the source node S and the relay node R , g(i;l) denote the doubly selective channel between the relay node R and the destination node D .a Without loss of generality, we assume that the channel length of both h(i;l) and g(i:l) as L + 1, and each tap is modeled as a zero mean complex Gaussian random process with power σ h , l 2 (or σ g , l 2 ).

Figure 1
figure 1

System model for AF relay network over doubly selective channel.

We propose a new transmission scheme as shown in Figure 2. Each transmission block that containsN symbols is divided into P subblocks. Assume the kth subblock contains Nk symbols of which N s k symbols are data and are represented by s k , while N b k symbols are pilots and are represented by b k . The total number of data symbols is N s = k = 1 P N s k and the total number of pilots is N p = k = 1 P N b k . With such a structure, we can represent the whole block as a vector

x = [ s 1 T , b 1 T , , s P T , b P T ] T .
(1)
Figure 2
figure 2

Structure of one transmission block.

During the first phase, the relay node R receives

r ( i ) = l = 0 L h ( i ; l ) x ( i l ) + w 1 ( i ) ,
(2)

where w(i) is the additive complex white Gaussian noise (ACWGN) with mean zero and variance σ w 1 2 , i.e., w 1 C N ( 0 , σ w 1 2 ) . During the second phase, the relay node R amplifies r(i) with a constant factor α and then re-transmit it to the destination node D . The signal obtained by D is

y ( i ) = l = 0 L g ( i ; l ) αr ( i l ) + w 2 ( i ) = α l = 0 L g ( i ; l ) l = 0 L h ( i ; l ) x ( i l ) + α l = 0 L g ( i ; l ) w 1 ( i l ) + w 2 ( i ) w ( i ) ,
(3)

where w1(i) is the ACWGN with mean zero and variance σ w 1 2 , i.e., w 1 ( i ) C ( 0 , σ w 1 2 ) and w(i) is defined as the combined noise.

Remark 1

Suppose the average power of the source node is P 1 , i.e., E { | x i ( n ) | 2 } = P 1 and the average power of the relay node is P r . Then the amplifier factor α can be chosen as

α = P r P 1 l = 0 L σ h , l 2 + σ w 1 2 .
(4)

3 Doubly selective channel in relay networks

3.1 Relay channel statistics and CE-BEM

It was shown in [5, 22] that for relay networks, the channel statistics depend on the mobility of the three nodes. Denote fds, fdd, and fdr as the maximum Doppler shifts due to the motion of S , D , and R , respectively. The discrete autocorrelation functions for the lth tap of h(i;l) can be represented as [22, 23]

R h , l ( m ) = σ h , l 2 E ( h ( n + m ; l ) h ( n ; l ) ) = σ h , l 2 J 0 ( 2 Π f ds m T s ) J 0 ( 2 Π f dr m T s ) ,
(5)
R g , l ( m ) = σ g , l 2 E ( g ( n + m ; l ) g ( n ; l ) ) = σ g , l 2 J 0 ( 2 Π f dr m T s ) J 0 ( 2 Π f dd m T s ) ,
(6)

where J0(·) is the zeroth-order Bessel function of the first kind, and Tsis the symbol sampling duration. If one node is fixed, i.e., the corresponding Doppler shift becomes zero, then (5) and (6) reduce to the well-known Jakes model [9].

In fact, (5) and (6) reveal that the power spectra of h(i;l) and g(i;l) span over the bandwidth f d 1 = f ds + f dr and f d 2 = f dr + f dd , respectively. According to the analysis of CE-BEM in [14, 16], we can express the doubly selective channel as

h ( i ; l ) = q = 0 Q 1 h q ( l ) e j 2 Π ( q Q 1 / 2 ) i / N ,
(7)
g ( i ; l ) = q = 0 Q 2 g q ( l ) e j 2 Π ( q Q 2 / 2 ) i / N ,
(8)

where 0≤i≤N−1, 0≤l≤L, Q m ( m = 1 , 2 ) 2 f d m N T s is the number of basis. The CE-BEM coefficients hq(l) and gq(l) are assumed as zero-mean, complex Gaussian random variables with variance σ h , q , l 2 and σ g , q , l 2 , respectively [14, 16, 24].

To simplify the notation as well as the following discussion, we assume f d 1 = f d 2 = f d and Q1=Q2=Q. We further denote wq=2Π(q−Q/2)/N and define

h q = [ h q ( 0 ) , h q ( 1 ) , , h q ( L ) ] T ,
(9)
g q = [ g q ( 0 ) , g q ( 1 ) , , g q ( L ) ] T , q [ 0 , Q ] .
(10)

3.2 CE-BEM approximation accuracy

Currently, the approximation accuracy about CE-BEM to time-varying channel is only shown through simulations with the merit of MSE [16]. Here, we take one step further by deriving the theoretical MSE of the CE-BEM approximation.

Without loss of generality, let us consider the 1st tap of the channel h(i;1). Define h ̀ l = 1 = [ h ( 0 ; 1 ) , , h ( N 1 ; 1 ) ] T and h ̄ l = [ h 0 ( l ) , , h Q ( l ) ] T . From (7) we know that the approximation error is

e 1 = A h ̄ l h ̀ l = 1 ,
(11)

where

A = 1 1 1 e j w 0 e j w 1 e j w Q e j ( N 1 ) w 0 e j ( N 1 ) w 1 e j ( N 1 ) w Q .
(12)

Assume the singular value decomposition of the matrix A is A = U 0 Σ 0 V 0 H where U0 is an N×N unitary matrix, V0 is a (Q + 1)×(Q + 1) unitary matrix, and Σ0 is an N×(Q + 1) matrix with the (Q + 1) diagonal entries as a constant c0 and other entries as zero.b Let uk denotes the kth column vector of the matrix U0.

Lemma 1

The MSE of the approximation error is

V = 1 N E ( e 1 H e 1 ) = N Q 1 N k = Q + 2 N u k H R l u k ,
(13)

where Rl is the correlation function of h ̀ l = 1 .

Proof

See Appendix Appendix 1.

Lemma 1 gives the theoretical MSE of CE-BEM approximation to time-varying channels, which enables us to obtain the approximation accuracy without simulations.

A brief example about theoretical MSE of CE-BEM approximation to time-varying channels is shown in Figure 3, where the system parameters are taken as fds=fdr=50 Hz, fd1=100 Hz, Ts=100 μs, and N=200. For comparison, simulation approximation MSE is also given by averaging 100 trials. It shows that the theoretical MSE (13) agrees with the simulation MSE. It also reveals that the better approximation comes with larger Q, which indicates that CE-BEM is a good choice for the doubly selective channel.

Figure 3
figure 3

Theoretical and simulation MSE of CE-BEM approximation to the time-varying channel h(i;1).

3.3 Problem formulation

Next we apply CE-BEM (7) and (8) into (4) for channel estimation and data detection. Our tasks are (i) estimate the parameters h q and g q so that the channel h(i;l) and g(i;l) can be recovered for each time index i[0,N−1], or estimate the equivalent channel parameters that still enable the successful data detection as did in [6, 25]; (ii) find the optimal training sequence that can minimize the channel estimation error; (iii) recover the data s k k [1,P] from the estimated channel.

4 Channel estimation

Let us construct N×1 vectors r, y, and construct N×N matrices H, G from g(i;l) in the following way:

r = [ r ( 0 ) , r ( 1 ) , , r ( N 1 ) ] T ,
(14)
y = [ y ( 0 ) , y ( 1 ) , , y ( N 1 ) ] T ,
(15)
H i , j = h ( i ; i j ) , G i , j = g ( i ; i j ) ,
(16)

for i,j=1,2,…,N. We can write (2) and (4) as

r=Hx+ w 1 ,
(17)
y=αGr+ w 2 =αGHx+w,
(18)

where w i = [ w i ( 0 ) , w i ( 1 ) , , w i ( N 1 ) ] T , i = 1 , 2 and w=[w(0),w(1),…,w(N−1)]T.

4.1 Channel partition

Following the channel partition method in [24], we can split the channel matrix H into three matrices, namely, H s , H b , and H b ̄ , which are shown in Figure 4. Similarly, the channel H k , the k th (1≤kP) part of H corresponding to the k th sub-block input of s k b k , can also be partitioned into three matrices H k s , H k b , and H k b ̄ (Figure 5). After the separation of these channels, we derive two input–output relationships at the relay node

r s = H s s+ H b ̄ b ̄ + w 1 s ,
(19)
r b = H b b+ w 1 b ,
(20)
Figure 4
figure 4

Partition of the matrix H into H s , H b , and H b ̄ that are shown in dashed line on the right side of the figure.

Figure 5
figure 5

Partition of the matrix H k .

where r s = [ ( r 1 s ) T , , ( r P s ) T ] T , r b = [ ( r 1 s ) T , , ( r P s ) T ] T , b ̄ contains the first L and the last L entries of b k for all 1≤kP, while w 1 s and w 1 b denote the corresponding noise vectors.

Repeat the partition process for the channel G and G k . That is, split G into G s , G b ̄ , and G b , while split G k , the k th component of G, into G k s , G k b ̄ , and G k b . We obtain two input–output relationships at the destination node

y s =α G s r s +α G b ̄ r b ̄ + w 2 s ,
(21)
y b =α G b r b + w 2 b ,
(22)

where y s = [ ( y 1 s ) T , , ( y P s ) T ] T , y b = [ ( y 1 b ) T , , ( y P b ) T ] T , r b ̄ contains the first L and the last L entries of r k b for all 1≤KP, w 2 s , and w 2 b denote the corresponding noise vectors.

Combining (20) and (22) yields

y b = α G b H b b + α G b w 1 b + w 2 b w b
(23)

where w b is defined as the corresponding item.

It can readily be checked that (23) is equivalent to

y b = y 1 b y P b = α G 1 b H 1 b b 1 α G P b H P b b P + w b .
(24)

Note that in (24) H k b is an ( N b k L ) × N b k matrix and G k b is an ( N b k 2 L ) × ( N b k L ) matrix. To perform channel estimation, the individual channel matrix G k b should be a valid matrix. Thus, we require N b k 2 L > 0 , i.e., the training length for the k th subblock should be N b k 2 L + 1 .

4.2 Estimation algorithm

Let us define Λ M ( w q ) = diag { 1 , e j w q , , e j w q ( M 1 ) } . For any (L + 1)× 1 vector a = [ a 0 , a 1 , , a L ] T , define an M×(M + L) Toeplitz matrix as

T M + L ( a ) = a L a 0 0 0 a L a 0 M + L columns .
(25)

We provide the following two lemmas.

Lemma 2

T M + L ( a ) Λ M + L ( w q ) = Λ M ( w q ) T M + L ( μ a ) ,
(26)

where μ a = [ a 0 e j w q L , a 1 e j w q ( L 1 ) , , a L ] .

Proof

Proved from straight calculations due to the special structures of Λ M ( w q ) and T M + L ( μ a ) . □

Lemma 3

For two vectors a i = [ a i , 0 , a i , 1 , , a i , L ] T , i = 1 , 2 , there is

T M + L ( a 1 ) T M + 2 L ( a 2 ) = T M + 2 L ( a 1 a 2 ) ,
(27)

where denotes linear convolution.

Proof

Note that both T M + L ( a 1 ) and T M + 2 L ( a 2 ) are circulant matrix. Proved from straight calculations. □

According to these definitions and (7), it can readily be checked that

H = q = 0 Q Λ N ( w q ) Φ q ,
(28)

where Φ q is a lower triangular Toeplitz matrix with the first column [h q (0),…,h q (L),0,…,0]T. Furthermore, noticing that h ( i + n ; l ) = q = 0 Q h q ( l ) e j w q n e j w q i , we can find

H k b = q = 0 Q e j w q ( N s k + L + i = 1 k 1 N i ) Λ N b k L ( w q ) T N b k ( h q ) ,
(29)

where T N b k ( h q ) is ( N b k L ) × N b k Toeplitz matrix as defined in (25).

Similarly, based on (8) and g ( i + n ; l ) = q = 0 Q g q ( l ) e j w q n e j w q i we can obtain

G = q = 0 Q Λ N ( w q ) Ω q ,
(30)
G k b = q = 0 Q e j w q ( N s k + 2 L + i = 1 k 1 N i ) Λ N b k 2 L ( w q ) T N b k L ( g q ) ,
(31)

where Ω q is a lower triangular Toeplitz matrix with the first column [g q (0),…,g q (L),0,…,0]T, and T N b k L ( g q ) is an ( N b k 2 L ) × ( N b k L ) Toeplitz matrix as defined in (25).

Combining (29) and (31) gives

G k b H k b = m = 0 Q e j w m ( N s k + 2 L + i = 1 k 1 N i ) Λ N b k 2 L ( w m ) T N b k L ( g m ) × n = 0 Q e j w n ( N s k + L + i = 1 k 1 N i ) Λ N b k L ( w n ) T N b k ( h n ) = m = 0 Q n = 0 Q θ m , n , k Λ N b k 2 L ( w m ) T N b k L ( g m ) Λ N b k L ( w n ) T N b k ( h n ) Ξ m , n , k ,
(32)

where

θ m , n , k = e j w m ( N s k + 2 L + i = 1 k 1 N i ) + j w n ( N s k + L + i = 1 k 1 N i ) ,
(33)

and Ξm,n,k is defined as the corresponding item. Using Lemmas 2 and 3, Ξm,n,kcan be simplified as

Ξ m , n , k = Λ N b k 2 L ( w m ) Λ N b k 2 L ( w n ) T N b k L ( μ g m ) T N b k ( h n ) = Λ N b k 2 L ( w m + w n ) T N b k ( λ m , n ) ,
(34)

where

μ g m = [ g m ( 0 ) e j w n L , g m ( 1 ) e j w n ( L 1 ) , , g m ( L ) ] T ,
(35)
λ m , n = μ g m h n .
(36)

Since T N b k ( λ m , n ) is a Toeplitz matrix, we obtain

G k b H k b b k = m = 0 Q n = 0 Q θ m , n , k Λ N b k 2 L ( w m + w n ) T N b k ( λ m , n ) b k = m = 0 Q n = 0 Q θ m , n , k Λ N b k 2 L ( w m + w n ) B N b k ( b k ) λ m , n ,
(37)

where B N b k ( b k ) is defined as

B N b k ( b k ) = b k ( 2 L ) , , b k ( 0 ) b k ( 2 L + 1 ) , , b k ( 1 ) b k ( N b k 1 ) , , b k ( N b k 2 L 1 ) .
(38)

Unfortunately, it remains challenging to estimate λm,nfrom (37). The number of unknown elements for all λm,n(m,n[0,Q]) is (Q + 1)2(2L + 1). A direct way to estimate all λm,nrequires N b to be no less than 2 PL + ( Q + 1 ) 2 ( 2 L + 1 ) , which is too large and the transmission efficiency will be reduced. To solve this problem, we choose to estimate other type of channel information that requires smaller training length but at the same time guarantees the data detection.

Let us introduce two variables ζq,kand ϖ q as

ϖ q = w m + w n = 2 Π ( m + n Q ) / N = 2 Π ( q Q ) / N , m , n [ 0 , Q ] , q [ 0 , 2 Q ] ,
(39)
ζ q , k = e j ϖ q ( N s k + 2 L + i = 1 k 1 N i ) ,k[1,P].
(40)

It can readily be checked that θ m , n , k = ζ m + n , k e j w n L . Then we can combine those items that satisfy m + n=q in (37) and obtain

G k b H k b b k = q = 0 2 Q ζ q , k Λ N b k 2 L ( ϖ q ) B N b k ( b k ) η q ,
(41)

where

η q = m + n = q e j w n L λ m , n .
(42)

Define

η = [ η 0 T , η 1 T , , η 2 Q T ] T ,
(43)

as the parameters to be estimated. Substituting (41) into (24) provides a simple model

y b = α Ψ b η + w b ,
(44)

where Ψ b is defined as

Ψ b = ζ 0 , 1 Λ N b 1 2 L ( ϖ 0 ) B N b 1 ( b 1 ) , , ζ 2 Q , 1 Λ N b 1 2 L ( ϖ 2 Q ) B N b 1 ( b 1 ) ζ 0 , P Λ N b P 2 L ( ϖ 0 ) B N b P ( b P ) , , ζ 2 Q , P Λ N b P 2 L ( ϖ 2 Q ) B N b P ( b P ) .
(45)

Thus, instead of estimating the coefficients h q and g q , we could estimate another parameter η from

η ̂ = 1 α Ψ b H Ψ b 1 Ψ b H y b .
(46)

Moreover, η ̂ q can directly be obtained from η ̂ for each q[0,2Q].

Remark 2

Since η is a vector with (2Q + 1)(2L + 1) entries, N b should be at least (2Q + 1)(2L + 1) + 2PL.

4.3 Data detection

Substituting (19) into (21) yields

y s = α G s H s s + α G s H b ̄ b ̄ + α G s w 1 s + α G b ̄ r b ̄ + w 2 s .
(47)

Note that r b ̄ in (47) can be further decomposed as

r b ̄ = H b ̆ b ̆ + w 1 b ̄ ,
(48)

where H b ̆ contains the first L and the last L rows of every H k b , b ̆ contains the first 2L and the last 2L entries of every b k , 1≤KP and w 1 b ̄ denotes the corresponding noise. Then (47) can be written as

y s = α G s H s s + α G s H b ̄ b ̄ + α G b ̄ H b ̆ b ̆ + w s ,
(49)

where the combined noise vector w s = α G s w 1 s + α G b ̄ w 1 b ̄ + w 2 s .

Lemma 4

Among all training choices that lead to identical covariance matrix of the channel estimation error, if the training length N b k is greater than 4L + 1 and if the training has the first 2L and the last 2L entries equal to zero, then the interference to the data detection is minimized.

Proof

See Appendix Appendix 2. □

Following Lemma 4, we can simplify (49) as

y s = α G s H s s + w s ,
(50)

which is equivalent to

y s = y 1 s y P s = α G 1 s H 1 s s 1 α G P s H 1 s s P + w s .
(51)

Define U M ( h q ) as a Toeplitz matrix generated by the vector h q in the following way:

U M ( h q ) = h q ( 0 ) , , 0 h q ( L ) , , h q ( 0 ) 0 h q ( L ) M columns .
(52)

We have the following lemmas.

Lemma 5

U M ( h q ) Λ M ( w q ) = e j w q L Λ M + L ( w q ) U M ( μ h q ) ,
(53)

where μ h q = [ h q ( 0 ) e j w q L , h q ( 1 ) e j w q ( L 1 ) , , h q ( L ) ] T .

Proof

Proved from straight calculation. □

Lemma 6

U M + L ( g q ) U M ( h q ) = U M ( g q h q )
(54)

Proof

Proved from straight calculation. □

According to (7) and (8), we obtain

G k s = q = 0 Q e j w q i = 1 k 1 N i Λ N s k + 2 L ( w q ) U N s k + L ( g q ) ,
(55)
H k s = q = 0 Q e j w q i = 1 k 1 N i Λ N s k + L ( w q ) U N s k ( h q ) .
(56)

Next it can be verified that

G k s H k s = m = 0 Q n = 0 Q ϕ m , n , k Λ N s k + 2 L ( w m ) U N s k + L ( g m ) Λ N s k + L ( w n ) U N s k ( h n )
(57)

where ϕ m , n , k = e j ( w m + w n ) i = 1 k 1 N i . Using Lemmas 5 and 6, it can be derived that

U N s k + L ( g m ) Λ N s k + L ( w n ) U N s k ( h n ) = e j w n L Λ N s k + 2 L ( w n ) U N s k + L ( μ g m ) U N s k ( h n ) = e j w n L Λ N s k + 2 L ( w n ) U N s k ( λ m , n ) ,
(58)

where μ g m and λm,n are defined in (35) and (36), respectively. Substituting (58) into (57), we can obtain

G k s H k s = m = 0 Q n = 0 Q ϕ m , n , k e j w n L Λ N s k + 2 L ( w m + w n ) U N s k ( λ m , n ) = q = 0 2 Q e j ϖ q i = 1 k 1 N i Λ N s k + 2 L ( ϖ q ) U N s k ( η q ) .
(59)

Clearly, given the estimates of η q , G k s H k s can be reconstructed from (59). Hence, the data s k can be detected with the reconstructed channel information G k s H k s .

4.4 Training sequence design

The estimation error of η can be expressed as

e = η ̂ η = Ψ b H Ψ b 1 Ψ b H w b .
(60)

The correlation matrix of w b is then computed from (31) as

R w b =E w b w b H = σ w 2 2 q = 0 Q l = 0 L | g q ( l ) | 2 + σ w 1 2 I N b 2 PL .
(61)

Thus, the MSE of e is

σ e 2 = tr E ( e e H ) = C e tr Ψ b H Ψ b 1
(62)

where C e = σ w 2 2 q = 0 Q l = 0 L | g q ( l ) | 2 + σ w 1 2 / α 2 .

According to ([26], Appendix A), we know that σ e 2 in (62) is lower bounded as follows:

C e tr Ψ b H Ψ b 1 m C e [ Ψ b H Ψ b ] m , m ,
(63)

where the equality holds if and only if ( Ψ b H Ψ b ) is a diagonal matrix. We then need to design the training sequence that can diagonalize ( Ψ b H Ψ b ) .

Based on the definition of Ψ b (45), the optimal training sequence that can minimize the σ e 2 requires the following conditions to be satisfied:

k = 1 P B N b k ( b k ) H B N b k ( b k ) = P b I 2 L + 1 ,
(64)
k = 1 P B N b k ( b k ) H Λ N b k 2 L ( ϖ q 1 ) ζ q 1 , k H ζ q 2 , k Λ N b k 2 L ( ϖ q 2 ) B N b k ( b k ) = 0 2 L + 1 , q 1 q 2 , q 1 , q 2 [ 0 , 2 Q ]
(65)

where P b is the power allocated to the training sequence.

Let us first focus on (64). Observing the structure of B N b k ( b k ) , we know that (64) can be fulfilled if the following conditions are satisfied:

(C1): N b k =4L+1,k[1,P],
(66)
(C2): b k = P b / P [ 0 , , 0 , 1 , 0 , , 0 ] T .
(67)

With conditions (C1) and (C2), we can further simplify (65) as

P b P k = 1 P Λ 2 L + 1 ( ϖ q 1 ) ζ q 1 , k H ζ q 2 , k Λ 2 L + 1 ( ϖ q 2 ) = P b P k = 1 P e j 2 Π N ( q 2 q 1 ) ( N s k + 2 L + i = 1 k 1 N i ) Λ 2 L + 1 ( ϖ q 2 ϖ q 1 ) = 0 2 L + 1 , q 1 q 2 , q 1 , q 2 [ 0 , 2 Q ] .
(68)

It can readily be checked that the sufficient conditions to achieve (68) are

(C3):N=P( N s k +4L+1), N s k = N s /P,k[1,P].
(69)

Conditions (C1), (C2), and (C3) imply that the equal-spaced and equal-powered training sequence can minimize the estimation MSE. This coincides with the optimal training sequence design in the traditional point-to-point channel [24].

Remark 3

Note that Equation (68) cannot hold when P=1. It indicates that the traditional transmission frame [1, 2], i.e., sending and receiving the continuous data sequence only once, is not optimal in minimizing the estimation MSE.

4.5 Block parameters

Remark 2 shows N b ≥2PL + (2Q + 1)(2L + 1) and the optimal training design requires N b =P(4L + 1) to minimize the mean-square channel estimation error. Thus, we know that P≥(2Q + 1) and N ( N s k + 4 L + 1 ) ( 2 Q + 1 ) .

Suppose a 3-tap channel and N s k = 4 L + 1 = 9 , we can obtain

2 Q + 1 = 4 f d T s N + 1 N / 18 .
(70)

It can be found

f d T s 1 72 + 1 4 N 0 . 0139 + 1 4 N .
(71)

This is the maximum normalized Doppler shift that our estimation scheme can handle. With the following parameters

  • carrier frequency f c =900 MHz and thus the wavelength λ=1/3 m,

  • data rate 20 Kbps and thus the symbol period T s =50 μ s,

we can compute from f d =V/λ that the mobile speed V should not exceed 66 m/s, which can be satisfied in most application.

5 Simulation results

In order to evaluate the inherent performance of our algorithms, the doubly selective channels are generated directly from the CE-BEM (7) and (8). The same approach has been adopted in many other articles when testing the performance of channel estimation [24, 27]. However, the real channel will be used in date detection.

We assume that carrier frequency f c =900 MHz, one symbol period T s =50 μ s and the maximum mobility speed is 90 km/h. Thus, the maximum Doppler shift is f d =75 Hz and f d T s =3. 75×10−3. Suppose one block contains 360 symbols, i.e., N=360. Then Q=2N f d T s =4. We also assume that both doubly selective channels h(i;l) and g(i;l) has three taps, i.e., L=2. Thus, we know that P≥(2Q + 1)=9 and N b P(4L + 1)=81. The variance of each tap for channel h(i;l) is σ h , l 2 = q = 0 Q σ h , q , l 2 = e l / 10 and that for channel g(i;l) is σ g , l 2 = q = 0 Q σ g , q , l 2 = e l / 10 . The variance of the noise is taken as σ w 1 2 = σ w 2 2 = 1 . The SNR is defined as the ratio of symbol power to the noise power, i.e., E s /N0. BPSK constellation is utilized for both training and data symbols. One thousand Monte-Carlo trials are used for the averaging.

First we set the total number of trainings N b =120 and adopt three types of training: equi-powered and equi-spaced (our optimal design); equi-powered but with random length; equi-spaced but with random power. For performance comparison, the total power for each types of trainings is the same. For each type of training, we find the MSE of our specially defined channel η. The estimation MSEs versus SNR for each type are plotted in Figure 6. The lower bound of σ e 2 (63) is also plotted for comparison. It can be seen that the equi-spaced equi-powered training achieves the minimum estimation MSE among all the three trainings and its MSE almost approaches the lower bound in (63).

Figure 6
figure 6

Channel MSE versus the SNR.

Next we use the estimated channel η ̂ to perform data detection. Define bit error rate (BER) as the ratio of number of successfully decoded data symbols over N s the number of transmitted data symbols. The BER versus SNR is plotted in Figure 7. The BER curve in the case of perfectly known channel η is also plotted for comparison. It can be seen that our detection method works well, and at high SNR our BER curve approaches that of the ideal case when the channel is perfectly known at the receiver.

Figure 7
figure 7

BER versus the SNR: CE-BEM channel.

We also examine the performance of the suggested estimation and detection methods under real channel situations. That is, the channel are generated according to (5) and (6), and next our suggested estimation and detection methods are utilized. Three different number of bases Q are chosen as 4, 6, and 8, respectively, and hence the corresponding number of data symbols N s is 279, 243, and 207. The BER versus SNR is plotted in Figure 8. For comparison, the BER curve under perfect channel knowledge at the receiver is also displayed. Clearly, the proposed methods yield effective data detection. An error floor is observed in the high SNR region due to the mismatch between the BEM model and the real channels. Obviously, the place where the floor begins could be improved by increasing the number Q.

Figure 8
figure 8

BER versus the SNR: real channel.

In the last example, we choose three different number of subblocks P as 2Q + 1, 2Q + 2, and 2Q + 4, respectively, and the space left for data transmission is N s k = N P ( 4 L + 1 ) = 279, 270, and 252, respectively. Define the transmission efficiency as the ratio of the number of successfully decoded data symbols over total number of symbols, i.e., N s ×BER/N. We run the simulation process as SNR ranges from −10 to 30 dB. The transmission efficiency at different SNR for each P is plotted in Figure 9. It is shown that when the number of subblocks P equals 2Q + 1, the best transmission efficiency is achieved at all SNR. It can be explained that when P increase by one unit, the data loss will be 4L + 1, which cannot be compensated even if channel estimation performance can be improved by larger P.

Figure 9
figure 9

Transmission efficiency versus SNR.

6 Conclusion

In this article, doubly selective channel estimation was considered for AF-based relay networks. Based on the CE-BEM, we designed an efficient method to estimate the channel coefficients and detect data symbols. The optimal training sequence that can minimize the estimation MSE was also derived. Finally, extensive numerical results are provided to corroborate the proposed studies.

Appendix 1

Proof of Lemma 1

Assuming that h ̀ l = 1 is perfectly known, the best CE-BEM coefficients h ̄ l are obtained from the LS criterion as

h ̄ l = ( A H A ) 1 A H h ̀ l = 1 .
(72)

Thus, the approximation error can be simplified as

e 1 = A ( A H A ) 1 A H I N h ̀ l = 1 = U 0 M U 0 H h ̀ l = 1 ,
(73)

where M is defined as

M = 0 Q + 1 0 ( Q + 1 ) × ( N Q 1 ) 0 ( N Q 1 ) × ( Q + 1 ) I N Q 1 .
(74)

The MSE of the approximation error e1is

V = 1 N E h ̀ l = 1 H U 0 M 2 U 0 H h ̀ l = 1 = 1 N k = Q + 2 N E h ̀ l = 1 H u k u k H h ̀ l = 1 .
(75)

It can readily be checked that

E h ̀ l = 1 H u k u k H h ̀ l = 1 = E tr ( h ̀ l = 1 H u k u k H h ̀ l = 1 ) = E u k H h ̀ l = 1 h ̀ l = 1 H u k = u k H R h u k .
(76)

Substituting (76) into (75) yields (13).

Appendix 2

Proof of Lemma 4

The interference during the data detection can be expressed as

v = αΔ ( G s H s ) s + w s + αΔ ( G s H b ̄ ) b ̄ + αΔ ( G b ̄ H b ̆ ) b ̆ ,
(77)

where Δ(·) denotes the estimation error of the inside item. The correlation function of the interference v is given by

R v = α 2 P s E ( Δ ( G s H s ) Δ ( G s H s ) H ) + E ( w s w s H ) + α 2 E ( Δ ( G s H b ̄ ) b ̄ b ̄ H Δ ( G s H b ̄ ) H ) + α 2 E ( Δ ( G b ̄ H b ̆ ) b ̆ b ̆ H Δ ( G b ̄ H b ̆ ) H ) ,
(78)

where P s is the power allocated to the data sequence. We need to find the training scheme that can minimize the trace of R v .

Suppose there are two training schemes with identical E ( ( η η ̂ ) ( η η ̂ ) H ) . Thus, the first and the second items in (78) are the same for both training schemes. Clearly, the trace of the two items for both training schemes is the same. If the training scheme has the first 2L and the last 2L entries equal to zero, the third and fourth item in (78) will become zero; if the training scheme does not has such condition, then it cannot null these two semi-definite items.

Endnotes

aThere may exists a switching time at the relay node R , which results in a delay Δ in the second retransmission phase. However, our model can be well adapted by setting g(i + Δ;l)=g(i0;l) where i0 is the new starting point for the following CE-BEM approximation.bIt is because that ej 2ΠQ/(2N)A can be considered as part of a discrete Fourier transform matrix.

References

  1. Laneman J, Wornell G: Distributed space time block coded protocols for exploiting cooperative diversity in wireless networks. IEEE Trans. Inf. Theory 2003, 49(10):2415-2425. 10.1109/TIT.2003.817829

    Article  MathSciNet  Google Scholar 

  2. Laneman J, Tse D, Wornell G: Cooperative diversity in wireless networks: efficient protocols and outage behavior. IEEE Trans. Inf. Theory 2004, 50(12):3062-3080. 10.1109/TIT.2004.838089

    Article  MathSciNet  Google Scholar 

  3. Boyer J, Falconer DD, Yanikomeroglu H: Multihop diversity in wireless relaying channels. IEEE Trans. Commun 2004, 52(9):1820-1830.

    Article  Google Scholar 

  4. Yang Y, Hu H, Xu J, Mao G: Relay technologies for WiMax and LTE-advanced mobile systems. IEEE Commun. Mag 2009, 47(10):100-105.

    Article  Google Scholar 

  5. Patel CS, Stuber GL: Channel estimation for amplify and forward relay based cooperation diversity systems. IEEE Trans. Wirel. Commun 2007, 6(6):3348-3356.

    Article  Google Scholar 

  6. Gao F, Cui T, Nallanathan A: On channel estimation and optimal training design for amplify and forward relay network. IEEE Trans. Wirel. Commun 2008, 7(5):1907-1916.

    Article  Google Scholar 

  7. Lalos AS, Rontogiannis AA, Berberidis K: Frequency domain channel estimation for cooperative communication networks. IEEE Trans. Signal Process 2010, 58(6):3400-3405.

    Article  MathSciNet  Google Scholar 

  8. Song MG, Kim D, Im GH: Recursive channel estimation method for OFDM-based cooperative systems. IEEE Commun. Lett 2010, 14(11):1029-1031.

    Article  Google Scholar 

  9. Jakes WC: Microwave Mobile Communications. Wiley, New York; 1974.

    Google Scholar 

  10. Wang G, Gao F, Chen W, Tellambura C: Channel estimation and training design for two-way relay networks in time-selective fading environment. IEEE Trans. Wirel. Commun 2011, 10(8):2681-2691.

    Article  Google Scholar 

  11. He L, Wu Y-C, Ma S, Ng T-S, Poor HV: Superimposed training based channel estimation and data detection for OFDM amplify-and-forward cooperative systems under high mobility. IEEE Trans. Signal Process 2012, 60(1):274-284.

    Article  MathSciNet  Google Scholar 

  12. Canonne-Velasquez L, Nguyen-Le H, Le-Ngoc T: Doubly selective cascaded channel estimation in amplify-and-forward relay systems. In Proc. IEEE ICC. (Kyoto, Japan; 5:1-5.

    Google Scholar 

  13. Tong L, Sadler BM, Dong M: Pilot-assisted wireless transmissions. IEEE Signal Process. Mag 2004, 21(6):12-25. 10.1109/MSP.2004.1359139

    Article  Google Scholar 

  14. Giannakis GB, Tepedelenlioǧlu C: Basis expnsion models and diversity techniques for blind indentification and equalization of time-varying channels. Proc. IEEE 1998, 86: 1969-1986. 10.1109/5.720248

    Article  Google Scholar 

  15. Dong M, Tong L, Sadler B: Optimal insertion of pilot symbols for transmissions over time-varying flat fading channels. IEEE Trans. Signal Process 2004, 52(5):1403-1418. 10.1109/TSP.2004.826182

    Article  MathSciNet  Google Scholar 

  16. Ma X, Giannakis GB: Maximum-diversity transmissions over doubly-selective wireless channels. IEEE Trans. Inf. Theory 2003, 49: 1832-1840. 10.1109/TIT.2003.813485

    Article  MathSciNet  Google Scholar 

  17. Borah DK, Hart BD: Frequency-selective fading channel estimation with a polynomial time-varying channel model. IEEE Trans. Commun 1999, 47: 862-873.

    Article  Google Scholar 

  18. Martone M: Wavelet-based separating kernels for sequence estimation with unknown rapidly time-varying channels. IEEE Commun. Lett 1999, 3: 78-80.

    Article  Google Scholar 

  19. Slepian D: Prolate spheroidal wave functions, Fourier analysis, and uncertainty–V: the discrete case. Bell Syst. Technol. J 1978, 57(5):1371-1430.

    Article  Google Scholar 

  20. He S, Tugnait J: On doubly selective channel estimation using superimposed training and discrete prolate spheroidal sequences. IEEE Trans. Signal Process 2008, 56: 3214-3228.

    Article  MathSciNet  Google Scholar 

  21. Tugnait JK, He S, Kim H: Doubly selective channel estimation using exponential basis models and subblock tracking. IEEE Trans. Signal Process 2010, 58(3):1275-1289.

    Article  MathSciNet  Google Scholar 

  22. Patel CS, Stuber GL, Pratt TG: Statistical properties of amplify and forward relay fading channels. IEEE Trans. Veh. Technol 2006, 55(1):1-9. 10.1109/TVT.2005.861170

    Article  Google Scholar 

  23. Akki AS, Haber F: A statistical model for mobile-to-mobile land communication channel. IEEE Trans. Veh. Technol 1986, VT-35(1):2-7.

    Article  Google Scholar 

  24. Ma X, Giannakis GB, Ohno S: Optimal training for block transmissions over doubly selective wireless fading channels. IEEE Trans. Signal Process 2003, 41(5):1351-1366.

    MathSciNet  Google Scholar 

  25. Wang G, Gao F, Wu Y-C, Tellambura C: Joint carrier frequency offset and channel estimation for two-way relay networks. IEEE Trans. Wirel. Commun 2011, 10(2):456-465.

    Article  Google Scholar 

  26. Ohno S, Giannakis GB: Capacity maximizing MMSE-optimal pilots for wireless OFDM over frequency-selective block Rayleigh-fading channels. IEEE Trans. Inf. Theory 2004, 50(9):2138-2145. 10.1109/TIT.2004.833365

    Article  MathSciNet  Google Scholar 

  27. Zemen T, Mecklenbr ä uker CF: Time-variant channel estimation using discrete prolate spheriodal sequences. IEEE Trans. Signal Process 2005, 53: 3597-3607.

    Article  MathSciNet  Google Scholar 

Download references

Acknowledgements

We would like to thank the anonymous reviewers for their helpful and constructive comments, which significantly improve this article. This study was supported by the National Natural Science Foundation of China (Grant Nos. 61201202 and 60971124), by the Key grant Project of Chinese Ministry of education (No. 313006), by Tsinghua-Tencent Joint Laboratory for Internet Innovation Technology, and by the State Key Laboratory of Rail Traffic Control and Safety, Beijing Jiaotong University (No. RCS2009ZT006).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Gongpu Wang.

Additional information

Competing interests

The authors declare that they have no competing interests.

Authors’ original submitted files for images

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 2.0 International License (https://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Reprints and permissions

About this article

Cite this article

Wang, G., Gao, F., Xu, R. et al. Doubly selective channel estimation for amplify-and-forward relay networks. EURASIP J. Adv. Signal Process. 2012, 251 (2012). https://doi.org/10.1186/1687-6180-2012-251

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: https://doi.org/10.1186/1687-6180-2012-251

Keywords