# PARAFAC-based frequency synchronization for SC-FDMA uplink transmissions

- Malik MU Gul
^{1}Email author, - Sungeun Lee
^{1}and - Xiaoli Ma
^{1}

**2014**:146

https://doi.org/10.1186/1687-6180-2014-146

© Gul et al.; licensee Springer. 2014

**Received: **15 April 2014

**Accepted: **5 September 2014

**Published: **24 September 2014

## Abstract

Frequency synchronization is a critical requirement for single-carrier frequency division multiple-access (SC-FDMA) uplink transmissions. Based on the key observation that the received signal at the base station can be formulated as the parallel factor analysis model, we present a joint carrier frequency offset (CFO) estimation and compensation scheme for SC-FDMA uplink transmissions with interleaved sub-carrier allocation. The proposed scheme, employing identical training symbols, guarantees the identifiability of CFO estimation, allows the system to operate on full load, and it is readily extendible to multi-antenna base station (BS) reception. Comparisons with existing approaches in the literature reveal that the proposed algorithms provide lower complexity and superior performance.

### Keywords

Single-carrier frequency division multiple-access (SC-FDMA) Orthogonal frequency division multiple-access (OFDMA) Synchronization Carrier frequency offset CFO compensation## 1 Introduction

Orthogonal frequency division multiple-access (OFDMA) is a multi-carrier multi-user transmission technology used in various wireless communication systems nowadays, e.g.[1–3]. However, high peak-to-average-power ratio (PAPR) and sensitivity to carrier frequency offsets (CFOs) are the two main challenges in implementation of OFDMA systems. High PAPR forces the power amplifier to operate with a back-off, reducing the energy efficiency of the system. Energy efficiency is a major concern in uplink (UL) transmissions for battery-operated mobile users (MUs) due to limited power resources. To resolve the high PAPR issue, a variant of OFDMA termed as single-carrier frequency division multiple-access (SC-FDMA) has been proposed[4], which employs extra discrete Fourier transform (DFT) precoding at the transmitter (TX) to reduce the PAPR of the transmitted signal. Having the advantage of lower PAPR, SC-FDMA is employed in long-term evolution (LTE) UL transmissions as opposed to OFDMA[3]. However, the reduction in PAPR is also influenced by the choice of sub-carrier allocation scheme in UL transmissions[5]. Sub-carrier allocation schemes are broadly classified into three types: grouped, interleaved, and generalized sub-carrier allocations[6]. Among the three schemes, interleaved allocation scheme (IAS), in which the sub-carriers of different users are interleaved into each other, provides the lowest PAPR[5]. In addition, in a perfectly synchronized system, SC-FDMA with IAS outperforms OFDMA with minimum mean squared error (MMSE) equalization[7], which motivates the use of IAS in SC-FDMA UL transmissions.

However, being a variant of OFDMA, SC-FDMA transmissions are still sensitive to CFOs. CFOs arise in UL transmissions due to the residual errors in the downlink (DL) synchronization or Doppler shift induced by the mobility of MUs. The orthogonality of the sub-carriers is violated by CFOs, introducing inter-carrier interference (ICI). In addition, as different MUs have independent CFOs, multiple UL CFOs also introduce multi-user interference (MUI). Without precise CFO estimation and compensation at the base station (BS), corresponding to each MU, ICI, and MUI severely degrade the system performance. Thus, frequency synchronization is a critical step in SC-FDMA transmissions just as in OFDMA. As sub-carriers of different users as interleaved into each other, CFOs generate more MUI in IAS as compared to grouped or generalized allocation schemes. The superior performance of SC-FDMA with IAS in a perfectly synchronized system is lost in the case of non-zero CFOs[8] implying that SC-FDMA is more sensitive to CFOs as compared to OFDMA and motivating the need for an efficient frequency synchronization algorithm for SC-FDMA with IAS.

Due to its similarities with OFDMA, frequency synchronization schemes proposed for OFDMA are applicable to SC-FDMA as well. CFO estimation schemes proposed in the literature can be classified based on sub-carrier allocation schemes, like[9–11] for grouped allocation,[12–17] for generalized allocation, and[18–23] for IAS. These schemes can also be classified as data-aided[12, 15–17] or non-data-aided/blind schemes[9–11, 13, 18, 19, 21–23]. Data-aided schemes use specially designed training symbols[12, 16, 17] or pilot sub-carriers[15] to perform CFO estimation while blind schemes use null sub-carriers[9, 13], the inherent structure of the transmitted symbol[18, 19, 21, 22] or its statistical characteristics[10].

For CFO estimation in SC-FDMA with IAS, multiple signal classification (MUSIC) algorithm is employed in[18] for blind CFO estimation using a line search. A closed-form solution with better performance at lower signal-to-noise ratios (SNR) is proposed in[19] employing the estimation of signal parameters via rotational invariance technique (ESPRIT). As both MUSIC and ESPRIT are sub-space-based techniques, the CFO estimation schemes in[18, 19] require a non-empty noise sub-space for CFO estimation. This condition implies that the system cannot operate on full load or an extended cyclic prefix (CP) must be employed, which reduces the overall spectral efficiency. Low-complexity blind CFO estimation algorithm has been proposed in[22] while two CFO estimation methods for multi-antenna BSs, based on rank reduction and alternating projection methods are proposed in[21]. Although the schemes in[21] support fully loaded systems, the number of receive antennas must be greater than the channel order, which restricts its applicability in broadband systems. Data-aided maximum likelihood (ML) CFO estimation has been proposed in[16]. However, the complexity of the proposed scheme is too high for practical implementations. A sub-optimal algorithm with lower computational complexity but degraded performance is proposed in[17].

In addition to CFO estimation, various CFO compensation algorithms have also been proposed in the literature[24–29]. An interference cancelation scheme based on circular convolution is proposed in[24] while parallel interference cancelation is used in[25] for CFO compensation. A linear decorrelator detector is proposed in[26]. However, the method is computationally intensive as it requires inversion of a large matrix for satisfactory performance. A joint CFO compensation and channel estimation algorithm is proposed in[27] for multi-antenna SC-FDMA systems. Several time-domain CFO compensation schemes paired with successive interference cancelation and MU ordering are proposed in[24]. A low-complexity CFO compensation scheme for OFDMA transmissions has been proposed in[29] for interleaved OFDMA transmissions.

In this paper, we propose a joint CFO estimation and compensation scheme for SC-FDMA UL transmissions with IAS using parallel factor (PARAFAC) analysis method[30]. PARAFAC, an extension of matrix decomposition to tensors, has found numerous applications in signal processing including multi-user detection in CDMA systems[31, 32], CFO estimation for DL OFDM systems[33], channel equalization[34], and array signal processing[35, 36]. However, the scheme proposed in[33] is for single-user OFDM transmissions and it is not applicable to multi-user UL SC-FDMA transmissions, discussed in this paper. Employing identical training symbols and exploiting the structure of transmitted SC-FDMA signals with IAS, we show that in the presence of CFOs, the signal received at the BS can be formulated as the PARAFAC model. This key observation allows us to apply tensor decomposition methods to jointly retrieve the estimates of CFO and the MUI-free received training symbol corresponding to each user. The formulation also allows us to perform a low-complexity CFO compensation for subsequent data symbols. The proposed algorithm is semi-blind in the sense that it requires *identical* training symbols but the *actual knowledge* of the training symbols is not required. Thus, the training signals can be designed to optimize other receiver operations like timing synchronization or channel estimation. We prove that the proposed algorithm guarantees the identifiability of CFO estimation while allowing the system to operate on full load as opposed to[18, 19, 21]. In addition, the proposed algorithm can be readily extended to multi-antenna BS reception and offers better performance as compared to the existing approaches in the literature.

The rest of the paper has been organized as follows. Section 2 presents the system model and the structure of transmitted and received signals in SC-FDMA uplink transmissions. The proposed joint CFO estimation and compensation method is presented in Section 3 along with the discussion on identifiability. Section 4 presents the extension of the proposed algorithm to multi-antenna BS case while tensor decomposition algorithms and the associated computational complexity are discussed in Section 5. Section 6 presents the simulation results followed by conclusions in Section 7.

## 2 System model

*M*active users transmit to a BS using their allocated sub-carriers. The maximum number of users allowed in the system is denoted as

*M*

_{ u }. The total number of sub-carriers is denoted as

*N*while the number of sub-carriers allocated to each user is$R=\frac{N}{{M}_{u}}$. The transmission takes place in the form of frames where each frame contains a block of

*Q*identical SC-FDMA training symbols followed by data symbols. The block diagram of the SC-FDMA system is shown in Figure1.

### 2.1 Transmitted signal structure

*m*th user, 0 ≤

*m*≤

*M*- 1 is denoted as${\mathcal{I}}_{m}:=\left\{{v}_{m}+{\mathit{\text{rM}}}_{u}\mid 0\le r\le R-1\right\}$, where

*v*

_{ m }is the unique starting index of the

*m*th user. The

*q*th SC-FDMA training symbol contains

*R*training points, which are precoded by an

*R*point DFT and then mapped to the sub-carriers allocated to the

*m*th user. Thus, the

*k*th sub-carrier, 0 ≤

*k*≤

*N*- 1, of the

*q*th training symbol transmitted by the

*m*th user is given as

*Q*training symbols are identical, we drop the index

*q*in

*x*

_{ m }[

*r*

^{′}]. A

*N*point inverse discrete Fourier transform (IDFT) of

*X*

_{ m }[

*q*,

*k*], 0 ≤

*k*≤

*N*- 1, is then computed to generate the

*q*th SC-FDMA training symbol in time domain with

*n*th sample given as

*X*

_{ m }[

*q*,

*k*] is non-zero only for$k\in {\mathcal{I}}_{m}$,

where *δ*(·) is the Kronecker delta function, and (·)_{
R
} represents the modulo *R* operation. Equation 4 shows that the transmitted SC-FDMA signal of the *m* th user contains *M*_{
u
} copies of the original signal *x*_{
m
}[*r*], 0 ≤ *r* ≤ *R* - 1 scaled by the phase term corresponding to the starting index of the user. A CP of *N*_{
g
} samples is then appended at the start of each SC-FDMA symbol to combat inter-symbol interference (ISI) due to multi-path channel.

### 2.2 Received signal structure

*m*th user and the BS is assumed to be of length

*P*

_{ m }and denoted as${\stackrel{~}{h}}_{m}[p],0\le p\le {P}_{m}-1$. We assume that the channel remains constant for the

*Q*training symbols, and

*N*

_{ g }≥

*P*- 1 where

*P*= max

_{ m }{

*P*

_{ m }+

*d*

_{ m }}, and

*d*

_{ m }is the timing offset (TO) of the

*m*th user. Under this assumption, the TO of the

*m*th user can be incorporated into its channel and the effective channel is given as

Such a system is termed as quasi-synchronous system[6]. Once the TO estimate of each user is estimated at the BS, it is fed back to the user to advance its timing correspondingly and thus, the length of the CP of following symbols can be reduced.

*qth*received symbol at the BS is the sum of the

*M*SC-FDMA symbols transmitted by the MUs and given as

*f*

_{ m }is the CFO between the

*m*th user and the BS normalized by the sub-carrier spacing of the SC-FDMA system,

*N*

_{ t }=

*N*+

*N*

_{ g }, and

*w*[

*q*,

*n*] represents the corresponding additive white Gaussian noise (AWGN) sample. We assume that |

*f*

_{ m }| < 0.5, ∀

*m*implying that only fractional CFOs appear in the UL transmission. It is a reasonable assumption since UL transmissions initiate after DL synchronization, which implies that the UL CFOs can only be caused by the Doppler effect or residual DL synchronization errors. Using Equation 4, Equation 6 can be written as

*P*≤

*R*, the inner sum in Equation 7 is also periodic with period

*R*. If we denote

*x*

_{ m }[

*r*],0 ≤

*r*≤

*R*- 1, then Equation 7 can be written as

where *ϕ*_{
m
} = *f*_{
m
} + *v*_{
m
} and *ψ*_{
m
} = *f*_{
m
}*N*_{
g
}. Thus, the received signal is also periodic except for the phase terms introduced by the CFOs and the starting indices. As non-zeros CFOs introduce ICI and MUI, which severely degrade the overall system performance, CFO estimation and compensation must be performed before channel estimation, equalization, and subsequent receiver processes as shown in Figure1.

## 3 CFO estimation and compensation for SC-FDMA UL with IAS

In this section, we present the proposed CFO estimation and compensation algorithms for SC-FDMA UL transmissions. As the proposed scheme is based on PARAFAC analysis, we start by showing that the received signal in Equation 9 can be formulated as a PARAFAC model, which will allow us to apply tensor decomposition algorithms to estimate the CFO and the MUI-free signal of each user. For an introduction to PARAFAC, refer to[30, 31].

### 3.1 SC-FDMA UL as a PARAFAC model

*q*th received signal in (9) in an

*M*

_{ u }×

*R*matrix

**Y**(

*q*) can be expressed as

and **D**_{
M
}(*q*) is a *M* × *M* diagonal matrix with diagonal entries${e}^{j\frac{2\pi}{N}{f}_{m}{\mathit{\text{qN}}}_{t}}\phantom{\rule{1em}{0ex}}0\le m\le M-1$, and **W**(*q*) is an *M*_{
u
} × *R* matrix containing the corresponding AWGN samples.

*Q*training symbols are identical, if we stack all

**Y**(

*q*) matrices corresponding to the

*Q*received training symbols, the resulting

*Q*

*M*

_{ u }×

*R*matrix can be expressed as

*Q*×

*M*matrix with

*q*th row containing the diagonal entries of

**D**

_{ M }(

*q*). Similarly it can be shown that

**Y**

^{ R }is a

*R*

*Q*×

*M*

_{ u }matrix, which is a concatenation of

*R*matrices with

*r*th matrix given as

where${\mathbf{Y}}^{{M}_{u}}$ is an *M*_{
u
}*R* × *Q* matrix with${Y}^{{M}_{u}}[n,q]=y[q,n]$. In the absence of noise, Equations 14, 16, and 17 form the PARAFAC model of three-way tensors with rank *M*[30], which allows us to apply tensor decomposition algorithms to obtain estimates of **A**, **B**, and **C**, denoted as$\stackrel{\u02c6}{\mathbf{A}}$,$\stackrel{\u02c6}{\mathbf{B}}$, and$\stackrel{\u02c6}{\mathbf{C}}$, respectively. The details of the algorithms along with the discussion on computational requirements are deferred to Section 5. The CFO estimate and the MUI-free received signal estimate corresponding to each user can then be extracted from$\stackrel{\u02c6}{\mathbf{A}}$,$\stackrel{\u02c6}{\mathbf{B}}$, and$\stackrel{\u02c6}{\mathbf{C}}$ as shown in the following sections. We refer to Equations 14, 16, and 17 collectively as the PARAFAC model. A key advantage of employing the PARAFAC model is that the uniqueness of the signal decomposition in the PARAFAC model and thus, identifiability of CFO estimation can be guaranteed.

### 3.2 Identifiability of CFO estimation

We now discuss the conditions, which guarantee the identifiability of CFO estimation by using the uniqueness properties of the PARAFAC model. Identifiability refers to the uniqueness of the CFO estimates. Guaranteeing the identifiability of CFO estimation is an important requirement for channel frequency selectivity as well as MUI can affect the identifiability of CFO estimation[13, 37].

where *k*_{
A
} denotes the *k*-rank of **A**. Specifically, if rank (**A**) = *M* and every *l* ≤ *M* columns of **A** are linearly independent, then the *k*-rank of **A** is *l*[31]. We now investigate the *k*-rank of the matrices involved in the proposed PARAFAC model for the SC-FDMA UL in Equation 14. Recall that *ϕ*_{
m
} = *v*_{
m
} + *f*_{
m
} in Equation 11. As the CFO of each user is fractional, i.e., *f*_{
m
} < 0.5 as mentioned in Section 2, and the starting index *v*_{
m
} of each user is unique, *v*_{
m
}-0.5 < *ϕ*_{
m
} < *v*_{
m
} + 0.5 in Equation 11 lies in a unique interval for each user even if two or more users have CFO values close to each other. Therefore, using the fact that **A** is a Vandermonde matrix, it has full *k*-rank, i.e., *k*_{
A
} = min(*M*_{
u
},*M*) = *M*. Moreover, as the CFO is a continuous random variable, the probability that the CFOs of two different users are exactly the same is zero, or in other words, the event that the CFOs of two different users are the same occurs *almost never* ([39], p. 232). Thus, **B**, being a Vandermonde matrix as well, has a full *k*-rank, i.e., *k*_{
B
} = min(*Q*,*M*). Similarly, it can be shown that the matrix **C** in Equation 12 has full *k*-rank if the training symbols *x*_{
m
}(*r*),0 ≤ *r* ≤ *R* - 1 and channel of different users are un-correlated from each other. In that case, *k*_{
C
} = min(*R*,*M*). If *R* ≥ *M*, which is usually the case in broadband wireless systems, the condition in Equation 18 can be satisfied for *Q* ≥ 2. Thus, provided *R* ≥ *M*, **A**, **B** **,** and **C** can be uniquely estimated up to scaling and column permutation by transmission of only two identical training symbols. For BSs equipped with multiple antennas, the condition *R* ≥ *M* can be relaxed as discussed in Section 4.

**A**and

**B**is a complex exponential, any scaling of columns of$\stackrel{\u02c6}{\mathbf{A}}$ and$\stackrel{\u02c6}{\mathbf{B}}$ can be easily identified. The scaling of the columns of

**C**can then be rectified by dividing each column with the product of the scaling of respective columns of

**A**and

**B**. As

*ϕ*

_{ m }=

*v*

_{ m }+

*f*

_{ m }lies in a unique interval for each user, as discussed earlier, any column permutation can also be easily identified through$\stackrel{\u02c6}{\mathbf{A}}$. Specifically, the index of the column of$\stackrel{\u02c6}{\mathbf{A}}$ corresponding to the

*m*th user, denoted as

*c*

_{ m }is given as (see Appendix A)

### 3.3 CFO estimation using PARAFAC

**A**and

**B**depend on CFO,$\stackrel{\u02c6}{\mathbf{A}}$ and/or$\stackrel{\u02c6}{\mathbf{B}}$ can be employed for CFO estimation after correction of scaling and column permutation. We use the entries of$\stackrel{\u02c6}{\mathbf{B}}$, and the CFO of the

*m*th user is estimated as (see Appendix B)

Thus, the acquisition range of CFO estimation is${f}_{max}=\frac{N}{2{N}_{t}}$ sub-carrier spacings.

*Remark 1*. The CFO estimation schemes proposed in[18, 19, 21] cannot operate on fully loaded systems and require either CP extension or multiple antenna BSs to support full load. However, the proposed CFO estimation is independent of the number of active user and thus, provides better spectral efficiency as compared to existing algorithms.

*Remark 2*. It is now clear that the proposed CFO estimation only requires the transmissions of at least two identical training symbols. The idea of identical training signals has been employed before in timing and carrier synchronization for OFDM system, e.g., see[40]. Also, two out of four preambles employed in random access in LTE-Advanced contain two identical symbols[3]. However, it is important to note that the knowledge of actual data transmitted on the sub-carriers of the training signals is not required for CFO estimation. Thus, the proposed scheme can be referred as a semi-blind CFO estimation, and the training signal can be designed to optimize other receiver tasks such as timing synchronization and/or channel estimation.

### 3.4 CFO compensation for the training symbol

**C**, whose columns contain MUI-free received signals corresponding to each user. Thus, another advantage of PARAFAC decomposition is that it automatically decouples user’s received training signals, and CFO compensation can be performed for each user separately to recover

*y*

_{ m }[

*r*] 0 ≤

*r*≤

*R*- 1 in Equation 8. Specifically,

Once the CFO-compensated training signal of each user is available, it can be employed for channel estimation. Standard channel estimation techniques, like least squares (LS) or MMSE channel estimation[41] are applicable to the estimate of the received training signal in Equation 21.

### 3.5 CFO compensation for the data symbols

**C**replaced by a symbol-dependent matrix

**C**(

*q*) whose columns contain the MUI-free received data symbols of the users. As the receiver has already estimated the CFOs, it has the knowledge of$\stackrel{\u02c6}{\mathbf{A}}$ and${\stackrel{\u02c6}{\mathbf{D}}}_{M}(q)$ in Equation 10, and

**C**(

*q*) for the

*q*th data SC-FDMA symbol can estimated as

where${\stackrel{\u02c6}{\mathbf{A}}}^{\u2020}$ denotes the pseudo-inverse of **A**. CFO compensation for each user can then be performed separately using Equation 21. Thus, the formulation of the received signal in Equation 10, leading to the PARAFAC model, also allows us to design a time-domain CFO compensation scheme for data symbols using Equations 22 and 21. The CFO compensation scheme in[29] also formulates the received periodic signal in interleaved OFDMA transmissions as a low-dimensional matrix form similar to **Y**(*q*) for CFO compensation. However, the scheme in[29] is based on OFDMA transmissions, and thus, the scheme in[29] is not directly applicable to SC-FDMA transmissions.

#### 3.5.1 Complexity analysis of CFO compensation

We now calculate the computational requirements of the proposed CFO compensation scheme under full load and compare it with some of the existing schemes in the literature. Complexity analysis of PARAFAC decomposition for CFO estimation is discussed in Section 5. Calculation of$\stackrel{\u02c6}{\mathbf{C}}(q)$ in Equation 22 requires inversion of$\stackrel{\u02c6}{\mathbf{A}}$, which can be efficiently implemented using the fact that **A** is Vandermonde matrix and a closed-form solution for its inverse exists with a complexity of only$O\left({M}_{u}^{2}\right)$ operations[42]. Thus, the total computational load of Equations 22 and 21 is$\mathit{\text{O}}\phantom{\rule{0.3em}{0ex}}\left((N+1){M}_{u}+{M}_{u}^{2}\right)$. In contrast, the time-domain CFO compensation scheme in[28] requires the inversion of an *N* × *N* block diagonal matrix with$\mathit{\text{O}}\left({M}_{u}^{2}N+{N}^{2}\right)$ operations while the linear decorrelation-based frequency-domain CFO compensation scheme in[26] requires inversion of an *N* × *N* matrix with *O*(*N*^{3}) operations. The complexity of CFO compensation in[26] can be lowered by a banded matrix approximation at the cost of performance degradation. Thus, compared with the existing schemes, the proposed CFO compensation has the advantage of significantly reduced computational complexity.

## 4 Extension to multi-antenna BS case

*N*

_{ r }receive antennas, we denote the parameters associated with the

*t*th antenna by using the subscript ‘

*t*’. Thus,

*y*[

*q*,

*n*],

*y*

_{ m }[

*r*], and

*h*

_{ m }[

*p*] in Equations 8 and 9 are replaced by

*y*

_{ t }[

*q*,

*n*],

*y*

_{t,m}[

*r*], and

*h*

_{t,m}[

*p*], respectively. We assume that the channels between a user and different receive antennas are independent. However, receive antennas are collocated, and all receive chains are assumed to employ a common clock, which implies that the CFO across receiver chains remains constant. This allows us to employ the PARAFAC model in Equation 14 for the multi-antenna BS case with

**A**and

**B**unchanged as in Equations 11 and 15, respectively, while

**Y**

^{ Q }in Equation 14 is updated as

*Y*

_{ t }(

*q*) in Equation 10 corresponding to different receiver antennas. Correspondingly,

**C**in Equation 12 is updated as

*r*th row and

*m*th column of

**C**

_{ t }given as

Thus, the PARAFAC model is still applicable to the multi-antenna BS reception, and tensor decomposition can still be applied to jointly estimate the CFOs and MUI-free received signal corresponding to different users and receive antennas. A better CFO estimation performance is expected in this case due to more information brought by the additional receive chains at the cost of increased computational complexity.

*Remark 3*. As the dimensions of **C** in Equation 24 grow to *N*_{
r
}*R* × *M* for multi-antenna BS as opposed to *R* × *M* in Equation 12 for single-antenna BS, the *k*-rank of **C** is given as *k*_{
C
} = min(*N*_{
r
}*R*,*M*). Thus, according to the discussion in Section 3.2, identifiability of CFO estimation in multi-antenna BS case is guaranteed if *N*_{
r
}*R* ≥ *M*, which is a less restrictive condition as compared to *R* ≥ *M* for single-antenna BS. Moreover, the CFO compensation scheme presented in Section 3.5 is also applicable to multi-antenna reception. Since **A** and **B** remain un-changed,${\stackrel{\u02c6}{\mathbf{A}}}^{\u2020}$ in Equation 22 needs to be calculated only once for CFO compensation corresponding to all receive antennas. Thus, the complexity of CFO compensation increases only linearly with respect to the number of receive antennas.

## 5 Tensor decomposition algorithms and the associated computational complexity

In this section, we discuss how to estimate the matrices involved in the proposed PARAFAC model in Section 3.1. Various tensor decomposition algorithms proposed in the literature can be employed for estimation of **A**, **B** **,** and **C** matrices of the proposed PARAFAC model. The examples include alternating least squares (ALS), gradient descent algorithm, conjugate gradient algorithm, and the Levenberg-Marquardt algorithm. For a description of these algorithms in the context of PARAFAC, please refer to[30].

### 5.1 PARAFAC decomposition using alternating least squares

ALS is one of the most popular algorithms for PARAFAC, which we briefly describe here in the context of SC-FDMA CFO estimation.

**A**,

**B**

**,**and

**C**matrices can be estimated by minimizing either of the following cost functions

*i*, given${\stackrel{\u02c6}{\mathbf{A}}}^{(i-1)}$ and${\stackrel{\u02c6}{\mathbf{B}}}^{(i-1)}$,${\stackrel{\u02c6}{\mathbf{C}}}^{i}$ is calculated as the LS solution of

*J*

^{ Q }given as

respectively, and the algorithm moves on to the next iteration. The initial values, i.e.,${\stackrel{\u02c6}{\mathbf{A}}}^{(0)}$ and${\stackrel{\u02c6}{\mathbf{B}}}^{(0)}$ required for starting the algorithm are computed by setting the CFO of each user to zero, i.e., *f*_{
m
} = 0 ∀*m* in Equations 11 and 15, respectively. The effect of number of iterations on the convergence of the algorithm will be shown through simulation results in Section 6.

### 5.2 Computational complexity of PARAFAC decomposition

The computational complexity of ALS is dominated by the pseudo-inverse calculations required in Equations 25, 26, and 27 for each iteration. Thus, the computational complexity is given by *O*(*N*_{
i
}((*Q* *M*_{
u
} + *N* + *Q* *R*)(*M*^{2} + *M*) + *N* *Q* *M*)) operations[30], where *N*_{
i
} is the number of iterations of ALS. As a comparison, the complexity of data-aided ML CFO estimation in[16] is *O*(*M* *N*_{
c
}*N*_{
i
}(*P*^{3} + *P* *N*^{2})) where *N*_{
c
} is the number of candidate CFO values used in the line search, which is significantly higher than the proposed algorithm. On the other hand, the complexity of blind ESPRIT-based CFO estimation in[19] is$\mathit{\text{O}}\left({M}_{u}^{3}+{M}_{u}^{2}N\right)$, which is about the same complexity order as the proposed algorithm. However, the performance of the proposed algorithm is significantly better than[19] as shown in Section 6.

## 6 Simulation results

In this section, we evaluate the performance of the proposed algorithms through Monte Carlo simulations and compare it with some of the existing schemes in the literature. For CFO estimation, we compare with the algorithms in[16, 18, 19], and for CFO compensation, we compare with the CFO compensation scheme in[26]. The simulation parameters are as follows: the total number of sub-carriers *N* = 128, length of CP *N*_{
g
} = 16, maximum number of users *M*_{
u
} = 8, active users *M* = 4, number of sub-carrier allocated to each user$R=\frac{N}{8}=16$, and channel length for each user *P*_{
m
} = 6. For each iteration of the simulation, the CFO of each user is modeled as a random variable uniformly distributed between [-*f*_{max},*f*_{max}] with *f*_{max} = 0.4. The modulation scheme is quadrature phase shift keying (QPSK). The transmission is modeled in the form of frames, where each frame contains *Q* identical training symbols in the beginning followed by 20 SC-FDMA symbols carrying data. CFO and channel estimation is carried out using the training symbols, and the estimates are employed to decode all the data symbols in that frame. For comparison, *Q* symbols are also employed for CFO estimation using[16, 18, 19].

### 6.1 CFO estimation performance with increasing *f*_{max}

*f*

_{max}, i.e., the maximum absolute CFO at an SNR of 20 dB. The rest of the simulation parameters is given in the beginning of this section. As the maximum CFO increases, ICI and MUI increase, and we expect to see degradation in CFO estimation performance. This trend is visible in Figure3 for both values of simulated SNR as the MSE of CFO estimation increases slightly as

*f*

_{max}increases.

### 6.2 Effect of number of training symbols *Q* on CFO estimation

*Q*and the number of iterations of ALS on the MSE of CFO estimation at an SNR of 20 dB. The values of

*Q*range from 2 to 5, and the results are shown in Figure4. As shown, the MSE of CFO estimation decreases as more training symbols and/or more iterations of ALS are employed for CFO estimation. However, the price for the increase in CFO estimation accuracy is decreased spectral efficiency due to more training symbols and/or increased complexity and latency of CFO estimation due to more iteration of ALS. Thus, there exists a trade-off between CFO estimation performance and spectral efficiency/complexity. Figure4 also shows that the improvement in performance of CFO estimation is negligible after about 4 to 6 iterations of ALS. Thus, in the following simulations, only 5 iterations of ALS are employed.

### 6.3 CFO estimation performance for multi-antenna BS

*N*

_{ r }is increased from 1 to 4, about 7 dB improvement in performance is observed.

### 6.4 Comparison of CFO estimation performance with existing schemes

*Q*= 2 is employed for CFO estimation in all schemes although both schemes in[18, 19] are truly blind schemes and do not require knowledge of the training symbols. The proposed scheme is semi-blind as discussed in Section 3 because it does not require the knowledge of the training symbols and only requires that the

*Q*= 2 training symbols are identical. The scheme in[16] is data-aided and requires the knowledge of all training symbols. We also plot the modified Cramer-Rao bound (MCRB) for single-user OFDM ([43], Equation 7) as an absolute reference for CFO estimation performance. The number of candidate CFO values employed in the line search used in MUSIC and APFE is 1,000. The comparison, given in Figure6, shows that the performance of the proposed scheme is significantly better (8 to 10 dB improvement) than the MUSIC and ESPRIT schemes. The performance of ESPRIT is better than MUSIC for lower SNR while the opposite is true for higher SNR. As both MUSIC and ESPRIT are blind CFO estimation algorithms, CFO estimation accuracy can be increased by employing data symbols for CFO estimation. However, it will increase the complexity and latency of CFO estimation. Compared to APFE, the proposed algorithm offers about 1 dB performance improvement for SNR >5 dB. However, it should be noted that the complexity of the proposed algorithm is significantly less than APFE as discussed in Section 5. The complexity of CFO estimation in[16] can be lowered by using sub-optimal algorithms at the cost of degradation of CFO estimation accuracy. Compared to the MCRB, the performance of the proposed algorithm is consistently close to the absolute reference provided by MCRB.

### 6.5 Comparison of CFO estimation performance with increasing number of users

*M*) and compare it with the existing schemes in the literature. The number of users varies from a single user (

*M*= 1) to full load (

*M*= 8) while the operating SNR is 20 dB. The results are shown in Figure7.

As the number of users increases, MUI increases, and the performance of each algorithm decreases. As the sub-space-based algorithms cannot operate on full load, the performance of MUSIC and ESPRIT algorithms is severely degraded on full load. The performance of the proposed algorithm is consistently better than the other algorithms while it also supports full-load transmissions.

### 6.6 Performance of CFO compensation and channel estimation

### 6.7 Bit error rate performance

## 7 Conclusions

We have presented CFO estimation and compensation algorithms for SC-FDMA UL transmissions with interleaved sub-carrier allocation scheme. The proposed algorithm is based on PARAFAC and guarantees the identifiability of CFO estimation. Moreover, the proposed scheme allows the system to operate on full load gaining better spectral efficiency. Comparisons with the existing schemes in the literature demonstrate that the proposed algorithms have lower complexity and offer considerable performance improvements for both CFO estimation and compensation in SC-FDMA UL receivers.

## Appendix

### A Derivation of Equation 19

*ϕ*

_{ m }for a specific 0 ≤

*m*≤

*M*- 1, we can estimate

*ϕ*

_{ m }for each user by calculating

*f*

_{ m }explained in Appendix B. However, as the column of$\stackrel{\u02c6}{\mathbf{A}}$ may be permuted, the calculation of Equation 28 using

*m*th column of$\stackrel{\u02c6}{\mathbf{A}}$ may not correspond to the

*m*th user. As

*ϕ*

_{ m }=

*f*

_{ m }+

*v*

_{ m }for each

*m*lies in a unique interval as explained in Section 3.2, the column permutation can be identified by identifying which interval does the${\stackrel{\u02c6}{\varphi}}_{m}$ for a particular

*m*belongs to. This can be done by calculating the following metric:

that identifies, which${\stackrel{\u02c6}{\varphi}}_{{m}^{\prime}}0\le {m}^{\prime}\le M-1$ corresponds to the *m* th user. Plugging in the value of${\stackrel{\u02c6}{\varphi}}_{{m}^{\prime}}$ from Equation 28 in the above metric concludes to Equation 19.

### B Derivation of Equation 20

**B**in Equation 15 is a

*Q*×

*M*matrix and

*m*th, 0 ≤

*m*≤

*M*- 1 column of

**B**contains complex exponentials corresponding to the CFO of the

*m*th user. Therefore, dividing the

*q*th entry of the

*m*th column of$\stackrel{\u02c6}{\mathbf{B}}$ by the

*q*-1 entry gives an estimate of

The CFO of the *m* th user can then be estimated by calculating the angle of the estimate in Equation 31 and scaling it by$\frac{N}{2\pi {N}_{t}}$, which concludes to the Equation 20.

## Declarations

## Authors’ Affiliations

## References

- IEEE standard for information technology—telecommunications and information exchange between systems local and metropolitan area networks—specific requirements part 11: wireless LAN medium access control (MAC) and physical layer (PHY) specifications IEEE Std 802.11-2012 (Revision of IEEE Std 802.11-2007) 2012, 1-2793.Google Scholar
- IEEE standard for local and metropolitan area networks part 16: air interface for fixed and mobile broadband wireless access systems amendment 2: physical and medium access control layers for combined fixed and mobile operation in licensed bands and corrigendum 1 IEEE Std 802.16e-2005 and IEEE Std 802.16-2004/Cor 1-2005 (Amendment and Corrigendum to IEEE Std 802.16-2004) 2006, 1-822.Google Scholar
- 3rd generation partnership project (3GPP) technical specification group radio access network; physical layer aspects for evolved universal terrestrial radio access (UTRA) (Release 7) 3GPP TR 25.814, V7.1.0 2006.Google Scholar
- Myung HG, Lim J, Goodman DJ: Single carrier FDMA for uplink wireless transmission.
*IEEE Veh. Mag*2006, 1(3):30-38.View ArticleGoogle Scholar - Myung HG, Lim J, Goodman DJ: Peak-to-average power ratio of single carrier FDMA signals with pulse shaping. In
*IEEE 17th International Symposium on Personal, Indoor and Mobile Radio Communications*. Helsinki; 11–14 Sept 2006:1-5.View ArticleGoogle Scholar - Morelli M, Kuo CCJ, Pun MO: Synchronization techniques for orthogonal frequency division multiple access (OFDMA): a tutorial review.
*Proc. IEEE*2007, 95(7):1394-1427.View ArticleGoogle Scholar - Ciochina C, Sari H: A review of OFDMA and single-carrier FDMA. In
*2010 European Wireless Conference (EW)*. Lucca; 12–15 Apr 2010:706-710.View ArticleGoogle Scholar - Gul MMU, Lee S, Ma X: Which one is more sensitive to carrier frequency offsets - OFDMA or SC-FDMA? In
*IEEE Military Communications Conference, - MILCOM 2011*. Baltimore; 7–10 Nov 2011:2194-2199.Google Scholar - Barbarossa S, Pompili M, Giannakis GB: Channel-independent synchronization of orthogonal frequency division multiple access systems.
*IEEE J. Sel. Area Comm*Feb. 2002, 20(2):474-486. 10.1109/49.983375View ArticleGoogle Scholar - Yao Y, Giannakis GB: Blind carrier frequency offset estimation in SISO, MIMO, and multiuser OFDM systems.
*IEEE Trans. Comm*2005, 53(1):173-183. 10.1109/TCOMM.2004.840623MathSciNetView ArticleGoogle Scholar - van de Beek JJ, Börjesson PO, Boucheret ML, Landström D, Arenas JM, Odling P, Ostberg C, Wahlqvist M, Wilson SK: A time and frequency synchronization scheme for multiuser OFDM.
*IEEE J. Sel. Area Comm*1999, 17(11):1900-1914. 10.1109/49.806820View ArticleGoogle Scholar - Sanguinetti L, Morelli M: A low-complexity scheme for frequency estimation in uplink OFDMA systems.
*IEEE Trans. Wireless Comm*2010, 9(8):2430-2437.View ArticleGoogle Scholar - Gul M, Lee S, Ma X: Carrier frequency offset estimation for OFDMA uplink using null sub-carriers.
*Digit Signal Process*2014, 29: 127-137.MathSciNetView ArticleGoogle Scholar - Wang H, Yin Q: Multiuser carrier frequency offsets estimation for OFDMA uplink with generalized carrier assignment scheme.
*IEEE Trans. Wireless Comm*2009, 8(7):3347-3353.MathSciNetView ArticleGoogle Scholar - Sun P, Zhang L: Low complexity pilot aided frequency synchronization for OFDMA uplink transmission.
*IEEE Trans. Wireless Comm*2009, 8(7):3758-3769.View ArticleGoogle Scholar - Pun M-O, Morelli M, C-Kuo CJ: Maximum-likelihood synchronization and channel estimation for OFDMA uplink transmissions.
*IEEE Trans. Commun*2006, 54(4):726-736.View ArticleGoogle Scholar - Wang Z, Xin Y, Mathew G: Iterative carrier-frequency offset estimation for generalized OFDMA uplink transmission.
*IEEE Trans. Wireless Commun*2009, 8(3):1373-1383.View ArticleGoogle Scholar - Cao Z, Tureli U, Yao YD: Deterministic multiuser carrier-frequency offset estimation for interleaved OFDMA uplink.
*IEEE Trans. Commun*2004, 52(9):1585-1594. 10.1109/TCOMM.2004.833183View ArticleGoogle Scholar - Lee J, Lee S, Bang KJ, Cha S, Hong D: Carrier frequency offset estimation using ESPRIT for interleaved OFDMA uplink systems.
*IEEE Trans. Veh. Tech*2007, 56(5):3227-3231.View ArticleGoogle Scholar - Chang A-C, Yang S-H: An iterative MVDR CFO estimation for interleaved OFDMA uplink systems.
*Computing, Communications and IT Applications Conference (ComComAp), 2013*2013, 153-158.View ArticleGoogle Scholar - Zhang W, Gao F, Yin Q, Nallanathan A: Blind carrier frequency offset estimation for interleaved OFDMA uplink.
*IEEE Trans. Signal Process*2012, 60(7):3616-3627.MathSciNetView ArticleGoogle Scholar - Hsieh H-T, Wu WR: Blind maximum-likelihood carrier-frequency-offset estimation for interleaved OFDMA uplink systems.
*IEEE Trans. Veh. Tech*2011, 60(1):160-173.View ArticleGoogle Scholar - Cheng P, Chen Z, Guo YJ, Gui L: Distributed Bayesian compressive sensing based blind carrier-frequency offset estimation for interleaved OFDMA uplink. In
*IEEE International Symposium on Personal Indoor and Mobile Radio Communications, PIMRC 2013*. London; 8–11 Sept 2013:801-806.Google Scholar - Huang D, Letaief KB: An interference-cancellation scheme for carrier frequency offsets correction in OFDMA systems.
*IEEE Trans. Comm*2005, 53(7):1155-1165. 10.1109/TCOMM.2005.851558View ArticleGoogle Scholar - Marabissi D, Fantacci R, Papini S: Robust multiuser interference cancellation for OFDM systems with frequency offset.
*IEEE Trans. Wireless Comm*2006, 5(11):3068-3076.View ArticleGoogle Scholar - Cao Z, Tureli U, Yao Y-D: Low-complexity orthogonal spectral signal construction for generalized OFDMA uplink with frequency synchronization errors.
*IEEE Trans. Veh. Tech*2007, 56(3):1143-1154.View ArticleGoogle Scholar - Al-Kamali FS, Dessouky MI, Sallam BM, Shawki F, Al-Hanafy W, El-Samie FEA: Joint low-complexity equalization and carrier frequency offsets compensation scheme for MIMO SC-FDMA systems.
*IEEE TransWireless Comm*2012, 11(3):869-873.Google Scholar - Zhu Y, Ben Letaief K: CFO estimation and compensation in SC-IFDMA systems.
*IEEE Trans. Wireless Comm*2010, 9(10):3200-3213.View ArticleGoogle Scholar - Fan D, Cao Z: Carrier frequency offset compensation for an interleaved OFDMA uplink.
*Tsinghua SciTechnol*2008, 13(1):1-8.View ArticleGoogle Scholar - Comon P, Luciani X, De Almeida AL: Tensor decompositions, alternating least squares and other tales.
*J. Chemometr*2009, 23(7–8):393-405.View ArticleGoogle Scholar - Sidiropoulos ND, Giannakis GB, Bro R: Blind PARAFAC receivers for DS-CDMA systems.
*IEEE Trans. Signal Process*2000, 48(3):810-823. 10.1109/78.824675View ArticleGoogle Scholar - Nion D, De Lathauwer L: An enhanced line search scheme for complex-valued tensor decompositionsApplication in DS-CDMA.
*Signal Process*2008, 88(3):749-755. 10.1016/j.sigpro.2007.07.024MathSciNetView ArticleMATHGoogle Scholar - Jiang T, Sidiropoulos ND: A direct blind receiver for SIMO and MIMO OFDM systems subject to unknown frequency offset and multipath,. In 4th IEEE Workshop on Signal Processing Advances in Wireless Communications. Rome; 15:358-362.Google Scholar
- De Almeida A, Favier G, Mota J: Parafac-based unified tensor modeling for wireless communication systems with application to blind multiuser equalization.
*Signal Process*2007, 87(2):337-351. 10.1016/j.sigpro.2005.12.014View ArticleMATHGoogle Scholar - Sidiropoulos ND, Bro R, Giannakis GB: Parallel factor analysis in sensor array processing.
*IEEE Trans. Signal Process*2000, 48(8):2377-2388. 10.1109/78.852018View ArticleGoogle Scholar - De Almeida A, Favier G, Mota J: A constrained factor decomposition with application to MIMO antenna systems.
*IEEE Trans. Signal Process*2008, 56(6):2429-2442.MathSciNetView ArticleGoogle Scholar - Ma X, Tepedelenlioğlu C, Giannakis GB, Barbarossa S: Non-data-aided carrier offset estimators for OFDM with null subcarriers: identifiability, algorithms, and performance.
*IEEE J. Sel. Area Comm*2001, 19(12):2504-2515. 10.1109/49.974615View ArticleGoogle Scholar - Kruskal JB: Rank, decomposition, and uniqueness for 3-way and n-way arrays. In
*Multiway Data Analysis*. North-Holland, Amsterdam; 1989:7-18.Google Scholar - Grädel E, Kolaitis PG, Libkin L, Marx M, Spencer J, Vardi MY, Venema Y, Weinstein S:
*Finite Model Theory and Its Applications*. Springer, Heidelberg; 2007.MATHGoogle Scholar - Schmidl TM, Cox DC: Robust frequency and timing synchronization for OFDM.
*IEEE Trans. Comm*1997, 45(12):1613-1621. 10.1109/26.650240View ArticleGoogle Scholar - Ozdemir MK, Arslan H: Channel estimation for wireless OFDM systems.
*IEEE Commun. Surv. Tutorials*2007, 9(2):18-48.View ArticleGoogle Scholar - Eisinberg A, Fedele G: On the inversion of the vandermonde matrix.
*Appl. Math. Comput*2006, 174(2):1384-1397. 10.1016/j.amc.2005.06.014MathSciNetView ArticleMATHGoogle Scholar - Li Y, Minn H, Zeng J: An average Cramer-Rao bound for frequency offset estimation in frequency-selective fading channels.
*IEEE Trans. Wireless Comm*2010, 9(3):871-875.View ArticleGoogle Scholar

## Copyright

This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited.