Skip to main content

Non-maximally decimated filter bank-based single-carrier receiver: a pathway to next-generation wideband communication


We present the design of a wideband digital modem based on non-maximally decimated filter bank (NMDFB) with perfect reconstruction (PR) property. The PR-NMDFB contains an analysis filter bank (AFB) and a synthesis filter bank (SFB) whose efficient polyphase forms are named as polyphase analysis channelizer (PAC) and polyphase synthesis channelizer (PSC). The waveform being processed is the legacy square root Nyquist-shaped quadrature amplitude modulation (QAM). In contrast to orthogonal frequency division multiplexing (OFDM) systems, the shaped QAM transmission has much superior performance properties in throughput, peak-to-average power ratio (PAPR), and synchronization. We will show the PR-NMDFB is capable of efficiently performing several key tasks of a digital receiver with dramatic workload reduction. This includes digital filtering, carrier recovery, and symbol timing recovery. Moreover, the nature of NMDFB allows the signal processing to operate a significantly reduced sample rate, which is a desired characteristic for replacing current FIR implementation in wideband systems.

1. Introduction

The wireless technology has experienced significant growth in the past decades; and we have seen generations of wireless communication systems increasing their bandwidth and data rates by more than an order of magnitude per generation. Current systems offer 100-Mbps data rates in 20-MHz bandwidth links. We can expect future generation wireless systems to offer 1 Gbit/s data rates with 500-MHz bandwidth links. As the demands for high data rate communication grows, the conventional methods, i.e., shaped quadrature amplitude modulation (QAM), become seemingly incapable of dealing with highly dispersive channels in a cost-effective manner. The legacy receivers often require building several time offset overlapped finite impulse response (FIR) filters operating in parallel to perform synchronization, matched filtering, as well as channel equalization. As the signal bandwidth grows comparable to the hardware's clock rate, any FIR filtering becomes extremely expensive. Meanwhile, the frequency-selective channel requires very sophisticated equalizers, which further increases the difficulty in fitting the legacy QAM waveform into today's communication systems. To overcome this problem, many broadband techniques[1] such as spread spectrum and multicarrier transmission were introduced. Although the orthogonal frequency division multiplexing (OFDM) technique[2] suffers from problems such as high peak-to-average power ratio (PAPR), sensitivity to carrier frequency offset[3], and time and frequency guard spans, it has become dominant because of its simple equalizer structure for highly frequency-selective channels. An unappreciated advantage of OFDM is its fast Fourier transform (FFT)-based block processing nature[2], which not only takes advantage of FFT's efficiency but also reduces the hardware processing speed, due to the serial/parallel conversion at the transmitter and receiver.

The modern research towards wideband communication systems can be generally categorized as follows: (1) block-based transmission with cyclic prefix-occupied guard interval, which includes the conventional OFDM system and discrete Fourier transform (DFT)-precoded OFDM single-carrier modulation[37]; (2) non-block-based continuous streaming signal format, without cyclic prefix or guard interval, which includes filter bank (FB)-based solutions such as filter bank multicarrier offset QAM[810] and filter bank single-carrier (FBSC) transmission[1114]. The FBSC transmission is not well studied compared to FBMC, but it is believed to have significant influence in future high-throughput transmissions such as backhaul modems. In this paper, we shall focus on developing a unified receiver structure for FBSC transmissions. Figure 1 shows a perfect reconstruction non-maximally decimated filter bank (PR-NMDFB)-based FBSC receiver that embeds the carrier frequency offset (CFO) removal, fractional delay filter (FDF) for finding the correct symbol timing, matched filter (MF), and equalization process in between the polyphase analysis channelizer (PAC) and polyphase synthesis channelizer (PSC). These signal processing tasks are traditionally implemented by separate filters in the time domain, and we will show that they can also be implemented in the FB-transformed domain as independent intermediate processing elements (IPEs). Since the signals presented at PAC parallel output ports are highly decimated, the IPE processing demonstrates impressive workload reduction compared to the conventional time domain methods.

Figure 1
figure 1

High-level architecture of FBMC receiver.

Several authors have studied the FBSC from equalization perspectives and have shown the equalization task is closely related to subband adaptive filtering (SAF)[15]. The authors in[12] show the preliminary examples on FB-based channel identification. The authors in[11] demonstrated the design of fractionally spaced equalizer (FSE) via NMDFBs (oversampled FBs). The authors in this paper address the design of FBSC from the synchronization perspective. In particular, we will introduce the recently proposed[16] PR-NMDFB-based digital filtering concept which allows us to carry out much more rigorous performance analysis towards FBSC compared to the past works[1114]. Note that the PR-NMDFB in practice will become near-perfect reconstruction (NPR) due to practical realization; however, the bounds on reconstruction errors require arithmetic with sufficient bit width to support the stop-band attenuation levels.

We will present the NMDFB-based carrier frequency recovery, symbol timing recovery, and matched filtering (MF) process. These independent tasks were not discussed in the past works on FBSC. The novelty of this paper is as follows: (1) we present the PR-NMDFB filtering-based formulation for FBSC and (2) propose the synchronization techniques for FBSC in the FB-transformed domain. To our best knowledge, these topics were not presented before or studied in depth.

The organization of this paper is as follows: Section 2 introduces the PR-NMDFB and its filtering property, Section 3 presents the carrier frequency recovery technique, Section 4 solves the symbol time recovery problem, Section 5 performs the complexity analysis, Section 6 presents the simulation results, and Section 7 draws the conclusion.

2. Review of PR-NMDFB and its filtering property

The basic building blocks of FBSC modems are based on oversampled or non-maximally decimated FBs with PR support, which are in general considered well studied[1719] and have numerous variations[2024]. The implementation of FBSC can be viewed using FB to perform digital filtering in the channelizer domain; filtering is achieved by altering the gain and phase over each spectral span presented at analysis filter bank (AFB) outputs. In order to accomplish this FB-based filter task, several restrictions on FB need to be met: (1) it must have PR or near-PR property. (2) The aliasing cancelation should not involve other channels; as a counter example, the cosine-modulated filter bank (CMFB)[24] requires adjacent channels to cancel aliasing thus generally cannot be used in communication systems. (3) Efficient polyphase implementation must exist, preferably with fixed channel size M and adjustable decimation factor D (D < M). One filter bank design that meets these conditions is proposed by Karp, in Figure nine of[20] under the name 'DFT polyphase realization,’ which serves as an intermediate step to derive the polyphase implementation of modified DFT (MDFT)[20, 25].

2.1 Background on PR-NMDFB

The theoretical model of a generalized DFT-based PR-NMDFB is shown in Figure 2. The AFB contains M band-pass filters (BPFs), whose Z-transforms are A m (Z), for m = 0, 1, …, M - 1. The M BPFs have equal bandwidth and are centered on digital frequencies θ m = 2 π M m , for m = 0, 1, …, M - 1. Let a(n) be the impulse response of the low-pass prototype filter (LPPF). The impulse response of the m th BPF is a m n = a n e j 2 π M mn , whose Z-transform is A m Z = A e - j 2 π M m Z = A W M m Z , and W M def ¯ ¯ e - j 2 π M . A down-sampling by a factor D (D < M), which is an integer that divides M, follows each BPF. As a standard practice, we require the AFB filtered and down-sampled signal to be centered on zero frequency, and this is done via a set of complex rotators whose values are e - j 2 π M mnD . The AFB-transformed signal is then processed by the intermediate processing element denoted as IPE at deeply decimated sampling rate. The synthesis filter bank (SFB) performs the exact inverse processes of AFB: heterodyne, up-sampling, and filtering. It can be shown[16] that the Z-transform of the output signal z(n) is

Figure 2
figure 2

Generalized M -path non-maximally decimated filter bank model.

Z Z = 1 D G 1 × M T Z K M × M A M × D Z R D × 1 Z = 1 D T K 1 × D Z R D × 1 Z = 1 D T s K Z R Z + 1 D T A K Z R ¯ Z

where G Z = G Z W M 0 G Z W M M - 1 T are the BPFs for SFB, K M × M = diag K 0 K M - 1 is the IPE complex scalar gain applied in between AFB and SFB, column vector R Z = R Z W D 0 R Z W D D - 1 T = R Z W D 0 R ¯ Z T is the modulated versions of the input signal, matrix A M × D Z is defined as

A Z = A Z W M 0 W D 0 A Z W M 0 W D D - 1 A Z W M M - 1 W D 0 A Z W M M - 1 W D D - 1 M × D , = A M × 1 | A ¯ M × D - 1 M × D

and T K 1 × D Z G 1 × M T Z K M × M A M × D Z = T s K Z T A K Z is the total transfer function (TF) for the M-path, decimated by D, AFB, and SFB. T s K Z G 1 × M T Z K M × M A M × 1 Z is the desired signal TF, whereas T A K Z G 1 × M T Z K M × M A ¯ M × D - 1 Z is the undesired aliasing TF. We require the NMDFB to have PR property[20, 25]. This condition translates to (1) zero aliasing TF A Z W D d G Z = 0 , d = 1 , , D - 1 and (2) distortionless signal TF, A(Z) G(Z) = HNYQ(Z), where HNYQ(Z) is any Nyquist pulse.

2.2 The filtering property of PR-NMDFB

The PR-NMDFB-based filtering is discussed intensively in[16]; here, we briefly review its concept and establish our notation. Given the discrete time Fourier transform (DTFT) of a target spectrum S(ω) and its finite time duration or truncated impulse responses s(n), n = 0, 1, … Nmax, the goal is to synthesize or approximate it via an M-path PR-MNDFB.

Consider the NMDFB and time domain filtering model shown in Figure 3a,b. In Figure 3a, the input signal r(n) is passed through an M-path, decimation by D, AFB/SFB with diagonal IPE matrix K M × M , whose (m M)th entry is a complex scalar denoted as K m , for m = 0, 1, …, M - 1. In Figure 3b, the same signal r(n) is fed into an FIR filter S Z = Z s n and then delayed by n D , where n D is the total delay introduced by the M-path, decimation by D, AFB/SFB. Using Equation 1, the Z-transform of the output signal zf(n) corresponding to Figure 3a is written as Z f Z = T s K Z R Z + T A K Z R ¯ Z . As mentioned earlier, the aliasing energy can be made arbitrarily small by increasing AFB/SFB's stop-band attenuation. We now assume the aliasing energy is small and ignore it in the analysis. Therefore, we rewrite Zf(Z) as

Figure 3
figure 3

NMDFB domain filtering (a) and time domain filtering (b).

Z f Z = T s K Z R Z

The Z-transform of the output signal zt(n) corresponding to the time domain filtering model shown in Figure 2b can be written as

Z t Z = S Z Z - n D R Z

The difference between the two filtering models can be defined as

ε Z = T s K Z Z n D - S Z R Z Z - n D

Following Equation 4, we can further define the error TF to be

T ε Z = T s K Z Z n D - S Z = m = 0 M - 1 K m H NYQ Z W M m Z n D - S Z

Clearly, if the error TF  T ε (Z) is zero or can be made arbitrarily small, then the NMDFB domain filtering becomes equivalent to the time domain filtering. Recall that n D is total delay of the Nyquist pulse H NYQ Z W M m . Thus, the term H NYQ Z W M m Z n D is the zero-phase, non-causal version of the chosen Nyquist pulse that has a real spectrum. Let us denote the zero-phase Nyquist channel to be H ˜ NYQ Z W M m . We can further split Equation 5 into real and imaginary parts as shown in Equation 6:

T ε ω = m = 0 M - 1 Re K m H ˜ NYQ ω - ω m - Re S ω + j m = 0 M - 1 Im K m H ˜ NYQ ω - ω m - Im S ω

Equation 6 suggests the real and imaginary part of K m can be independently designed based on the real and imaginary part of the target spectrum S(ω). In addition, H ˜ NYQ ω - ω m , is also a design parameter, since any waveforms satisfying the Nyquist channel condition can be used to design the analysis/synthesis LPPF. Note that the goal here is to approximate the target spectrum S(ω). One option, as it is shown in[16], is to set HNYQ(ω) to be a gate function, i.e., sinc function in the time domain. This selection produces a piecewise constant approximation of the target spectrum S(ω), and Equations 5 and 6 can be viewed as the nearest neighbor interpolation error. Alternatively, setting H ˜ NYQ ω to be a triangular-shaped Nyquist pulse, i.e. sinc2 in the time domain, produces a frequency domain linear interpolation between adjacent filter banks which means approximately a straight line interpolation of S(ω), and Equations 5 and 6 describe the linear interpolation error. In this case, the gain and phase distortion of the approximated spectrum are directly proportional to the second derivative of the target spectrum and inversely proportional to the squared analysis/synthesis path number M. In this paper, we shall adopt the triangular-shaped Nyquist pulse:

h ˜ NYQ n = 1 M sinc 2 1 M n , for - n d n n d
H ˜ NYQ ω - ω m = 1 - ω - ω m 2 π / M , ω ω m - 2 π M , ω m + 2 π M 0 , otherwise

In practice, as detailed in[16], this filter can be designed by using a windowed (i.e., Kaiser) sinc function.

One preliminary application of the filtering property in a FBSC receiver is to perform square root raised cosine (SRRC) shaping or matched filtering. In this case, the target spectrum S(ω) can be set equal to the ideal SRRC spectrum, and the IPE gains K m  = S(ω m ). Due to the underlying triangular pulse shape, the approximated spectrum by K m , Km + 1 for ω [ω m , ωm + 1], can be written as {(ωm + 1 - ω) K m  + ωKm + 1}. Using Equation 5 and only considering region ω [ω m , ωm + 1], we can define the path-wise error TF on the interval ω [ω m , ωm + 1] as

T ε , m ω def ¯ ¯ ω m + 1 - ω K m + ω K m + 1 - S ω

Set K m  = S(ω m ) and Km + 1 = S(ωm + 1) and assume the target spectrum is analytic and twice differentiable, we can expand S(ω) using Taylor series around = ω m ; we find

S ω = S ω m + S ˙ ω m ω - ω m + 1 2 S ¨ η ω - ω m 2

for some η [ω m , ωm + 1]. Now, the maximum gain error within region [ω m , ωm + 1] can be bounded by

T ε , m ω ω ω m , ω m + 1 Max S ¨ ω 1 2 π M 2 = B ε , m

Let ξ s , m def ¯ ¯ ω ω m - π M , ω m ± π M min S ω ; the maximum phase distortion ϕ m can be readily obtained as

ϕ m atan B ε , m ξ s , m 2 - B ε , m 2 , for ξ s , m > B ε , m

Note that, if γs,m ≤ Bε,m, the phase distortion is from - π to π. However, this also implies the target spectrum S(ω) has significant attenuation at ω m and the signal around frequency ω m may not have significant value, i.e., in the stop-band of the target spectrum.

3. PR-NMDFB-based carrier frequency recovery

The conventional maximum likelihood carrier recovery approaches[2628], as shown in Figure 4, requires implementing a pair of band-edge (BE) filters that operate in parallel with the SRRC filter. The two BE filters are centered on the received signal's left and right transition bands. The carrier recovery is achieved by balancing the signal energy observed at the output of two BE filters. The offset correction process is achieved via a digital phase locked loop (PLL) and a direct digital synthesizer (DDS). As derived in[27], the optimal BE or frequency matched filter has frequency response equal to the frequency derivative of the SRRC MF, as shown in Figure 5a. Due to the difficulty of designing optimal BEs' sharp transition band, suboptimal BE filter is often used, as shown in Figure 5b, which makes BE filters have the same length as the SRRC MF filter. As the signal's bandwidth grows, not only the power consumption of these three filters grows, but it also adds difficulty to lay out high-speed circuits running at the speed of many times the input rate.

Figure 4
figure 4

Band-edge filter frequency locked loop.

Figure 5
figure 5

Spectra of SRRC MF, corresponding frequency MF, and spectral response of their cascade (a); suboptimal BE filters (b).

Let us examine a baseband equivalent system for a QAM signal with complex notation for the in-phase (I) and quadrature (Q) branches, e.g., Re(.) and Im(.), respectively. Denote the k th complex QAM data symbol as S k , with the symbol period T in seconds. Let the transmitter's SRRC shaping pulse be h tx (t) and additive white noise n(t) = n I (t) + jn Q (t). Thus, the received and digitized signal with the sampling speed T s = 1 f s = T / 2 , i.e., two samples per symbol is written as

r n = e j ω c n + ψ k = - S k h t x n - 2 k - τ + n n

where e j ω c n + ψ is the residue CFO and phase offset after down-conversion and τ is the sampler time offset. Note, we do not consider the multipath channel effects in this paper. The equalization for FBSC is treated in[11]; it can be easily shown from PR-NMDFB filtering property that setting IPE as the linear equalizer response is equivalent to an FIR linear equalizer.

Figure 6 shows the proposed CFO recovery scheme. The CFO and constellation phase offset removal is achieved via dedicated IPEs embedded in between the receiver's PAC and PSC along with other filtering tasks such as matched filtering and equalization. The IPE-based CFO correction converts the high-speed time domain DDS into a collection of low-speed DDSs working in the channelizer-transformed domain. The BE filters can surely be implemented via the NMDFB filtering property along with the receiver's MF and equalizer. Moreover, since the BE filters[27] were oversampled (see Figure 5b) and only the BE output signal energy is needed, they can be synthesized with reduced sample rate via PSC with much fewer paths as depicted in Figure 6. The constellation phase offset removal is a decision-directed process and happens after the CFO correction. It uses the same IPE as used in the CFO loop, and it is relatively simple to design. In this paper, we only focus on developing the BE filtering and CFO correction process; we will derive the operating conditions of IPE-based CFO correction, and we will show the design of PR-NMDFB-based BE filters.

Figure 6
figure 6

FB-transformed domain CFO recovery.

3.1 CFO correction in filter -bank-transformed domain

To compensate the CFO, the signals presented at the output of PAC shall be de-rotated by e - j ω c nD . Let us only consider the CFO correction IPE and ignore other IPEs for now, i.e., setting K M × M = I . The Z-transform of the output signal z(n) after frequency shift IPE is written as

Z Z = 1 D G 1 × M T Z K M × M A M × D Z e j ω c R D × 1 Z e j ω c = 1 D T s K Z R Z e j ω c + 1 D T A K Z R ¯ Z e j ω c

where T s K Z def ¯ ¯ G 1 × M T Z K M × M A M × 1 Z e j ω c and T A K Z def ¯ ¯ G 1 × M T Z K M × M A ¯ M × D - 1 Z e j ω c . The aliasing cancelation condition is verified if T A K Z = 0 , which can be translated to: A Z W D d e j ω c G Z = 0 , d = 1 , , D . It should be noted that the aliasing cancelation comes from the NMDFB prototype filter design and is unaffected by scalar diagonal matrix. The frequency-translated PR condition is verified if T s K Z = m = 0 M - 1 A Z W M m e j ω c G Z W M m = Z - n D . This condition is met if the composite response A Z e j ω c G Z produces Nyquist pulse HNYQ(Z) or its frequency-shifted version H NYQ Z e j ω c . We can design either A(Z) or G(Z) to be a Nyquist pulse and let the other filter be a low-pass filter that passes the Nyquist pulse unaltered. Equation 12 shows the impact of the two options, and we can see both choices produce the frequency-shifted version of the input signal when K M × M = I as along as the PR and aliasing cancelation conditions are met.

A Z e j ω c G Z = H NYQ Z e j ω c T s K Z = m = 0 M - 1 K m H NYQ Z W M m e j ω c
A Z e j ω c G Z = H NYQ Z T s K Z = m = 0 M - 1 K m H NYQ Z W M m

Figure 7 depicts the example of the analysis and synthesis LPPF for an M-path, decimation by D = M / 3, PR-NMDFB that matches Equation 12b. The analysis LPPF is assigned to be a low-pass filter, while the synthesis LPPF is chosen to be a triangular-shaped Nyquist pulse. For D = M/3 case, the PAC outputs have sample rate 3 × f s M or range from - 3 π M to 3 π M given f s  = 2π. As seen from Figure 7, one can frequency shift the analysis LPPF without raising aliasing energy inside the 'aliasing free shift region.’ However, the aliasing error will occur if we shift into the 'aliasing penalty shift region,’ i.e., CFO ω c   approaches π M . Clearly, this aliasing error is due to the transition band of the analysis LPPF as depicted in Figure 7. Note that a maximum frequency shift by π M is all we need to achieve full-range center frequency change, i.e., ω c - f s 2 , f s 2 in PR-NMDFB-based FBMC receiver. This is because one can cyclic shift the PAC outputs to account for CFO greater than half of the NMDFB channel spacing, recall the path/channel spacing is 2 π M in an M-path PR-NMDFB. Furthermore, it can be easily prove that the aliasing penalty shift region can be completely eliminated by having D = M/4, yet reducing the decimation rate for this option also raises the system's workload.

Figure 7
figure 7

Analysis and synthesis low-pass prototype filter (LPPF).

3.2 CFO detection in FB-transformed domain

The CFO detection is based on a pair of BE filters centered on the left and right of the received signal's transition bands. They can be efficiently integrated in the NMDFB structure via spectral shaping IPEs. As shown in[27], the target spectrum of the two BE filters is the frequency derivative of the SRRC shaping filter's left and right transition bands. To synthesize an NMDFB-based BE filter, one simply sets the target spectrum S(ω) to be the ideal BE filter spectrum with IPE gain K m  = S(ω m ). Unlike the conventional time domain implementation (Figure 5b) in which suboptimal, symmetrically extended BE filters have to be built to avoid designing sharp filter transition bands, the PR-NMDFB-based filter allows us to directly synthesize the ideal BE filter spectrum and produce near-optimal frequency response as shown in Figure 5a. We will further demonstrate this in the simulation section.

Serving as the error detector within a PLL, the input signal to the BE filter must already be frequency-shifted or error-compensated as shown in Figure 4. Re-examine Equation 12 and consider a non-trivial IPE, i.e., K M × M I , Equation 12a corresponds to filter the input data and then deliver the heterodyned output, whereas Equation 12b maps to first heterodyne the input signal and then apply the filtering. Clearly, the arrangement in Equation 12b matches the requirement of a loop control system. Therefore, the NMDFB implementation requires the analysis LPPF to be a low-pass filter and synthesis LPPF to be any Nyquist pulse, i.e., Figure 7. Another detail which is worth noting is that the conventional BE design produces an oversampled filter (see Figure 5). The oversampling ratio is related to the SRRC filter's transition bandwidth or the roll-off factor. Mapping to the NMDFB implementation, this means only a certain group of entries of BE filter's IPE matrix K M × M is non-zero. Equivalent of saying, only a portion of FB channels are needed to synthesize a BE filter. Therefore, M′-path, reduced size PSC is used to only synthesize the FB channels that fall into the signal's left or right transition bands. This saves computational resources and increases the efficiency of the proposed architecture.

4. PR-NMDFB-based symbol timing recovery

The goal of a symbol timing synchronizer is to sample the MF output at optimum instant for the k th symbol. The conventional analog solution controls a sample-and-hold circuit located at the output of the analog MF to identify the optimum sampling instant. It is also well known that[29, 30] the log-likelihood function for unknown timing phase τ for equally likely S k  {-1, + 1} is

Λ τ = k ln cosh 2 E N 0 y kT + τ

where E is the symbol energy, N0 is the power spectral density of zero-mean Gaussian noise, and y(kT + τ) is the analog MF output. The estimate τ ^ that maximizes Λ(τ) is the timing phase that forces the derivative of Λ(τ) to zero.

k tanh 2 E N 0 y kT + τ d dt y kT + τ = 0

Often, the tanh(.) function is replaced by its small signal approximation tanh(x) ≈ x for low signal-to-noise ratio (SNR) and tanh(x) ≈ sign(x) for high SNR. The authors in[29] proposed full digital timing recovery techniques based on polyphase FB and have shown various timing error detection (TED) methods via FBs. In this paper, we adopt the TED formed by taking the product between the MF output and the derivative MF output, i.e., Figure ten of[29]. The derivative MF is produced by taking the derivative to the time domain SRRC impulse response; this should not be confused with BE filter which is formed by taking the frequency domain derivative of the SRRC spectra.

4.1 Timing error correction in FB-transformed domain

The digital timing error correction requires building a fractional delay filter (FDF). The FDF is conventionally implemented in the time domain either via Farrow structure[31] which is a polynomial-based interpolator or via polyphase interpolators[26]. In this section, we introduce the NMDFB-based FDF.

The time domain perspective has led us to build interpolators acting on the desired time grid between available samples. Notice that the basic Fourier transform property states that time delay is phase shift in the frequency domain, and the phase shifter has frequency response S(Ω) = ejΩτ, where Ω is the continuous frequency in Fourier transform. Using K m  = S(ω m ), for ω m = 2 π M m , m = 0 , 1 , , M - 1 , gives K m = e j 2 π M , τ 0.5 , m M 2 . Note that the DTFT must have a periodic spectrum. Therefore, we force KM/2, the channel located on the half sample rate, to be the average of its adjacent neighbors: K M / 2 = 1 2 K M / 2 - 1 + K M / 2 + 1 . Doing so, we made the K m , m = 0, …, M - 1 periodic, but we also sacrificed the half sample rate channel. The operation bandwidth for this fractional delay filter becomes ω - π + π M , π - π M .

K m = e j 2 π M , f or m M 2 1 2 K M / 2 - 1 + K M / 2 + 1 , f or m = M 2

From the spectral approximation point of view, the problem becomes using M spectral samples to approximate complex sinusoid e j 2 π M up to half a cycle. In the case of straight line approximation and M = 64 , we use Equations 7 to 9 to determine the maximum magnitude error to be -70 dB and maximum phase distortion to be 3.012 × 10-4 rad. It is clear that the spectral approximation provides a very good FDF with only 64-path filter banks.

4.2 Timing error detection in FB-transformed domain

The time domain TED, Figure 10 of[29], requires building one FIR SRRC MF and one time domain derivative SRRC MF. The product of the two filters forms the TED which then drives a PLL. The PLL then tunes the FDF to find the best symbol timing. The NMDFB implementation of the TED together with the FDF is shown in Figure 8, where the IPEs are used to synthesize the FDF, MF, and the derivative MF. Their target spectrums can be readily obtained from the previously developed NMDFB filtering property in Section 2. As seen from Figure 8, one PAC and two PSCs are used to produce the time domain MF, derivative MF, as well as the FDF. We shall show a detailed workload analysis in the next section.

Figure 8
figure 8

FB-transformed domain symbol timing recovery.

5. Full NMDFB-based FBMC receiver and complexity analysis

The complete block diagram of the FBMC receiver is shown in Figure 9. We can see the entire signal processing is embedded between the PAC- and PSC-transformed domain, and all filtering tasks are implemented based on the NMDFB filtering property via dedicated IPEs. In this section, we would like to analyze the workload for the FBMC receiver and compare it to its time domain antecedent counterpart.

Figure 9
figure 9

FBMC receiver block diagram.

Examining Figure 9, the FBSC receiver contains one M-path PAC, two M-path PSC, two reduced size M′-path PSC, five M-entry IPEs, and two reduced size M′-entry IPEs. Let all LPPFs have L taps (real coefficient) per polyphase arm, i.e., the LPPF length for M-path PAC (or M-path PSC) is M × L and the LPPF length for M′-path PSC is M′ × L. Take the M-path, decimation by D, PAC for example: a block of D pieces of data enter the PAC, and its LPPF operates once, and so does its M-point FFT. Therefore, the workload in terms of the number of multiplies per input complex (I-Q pair) sample, for an M-path, decimation by D, PAC is calculated as 1 D 2 ML + M Pn t FFT , where 〈M PntFFT〉 denotes the number of multiplies required for an M-point FFT processing complex inputs. In addition, the workload for an M-entry complex coefficient IPE is 4 M D .

For an M-path, decimation by D arrangement, every block of D pieces of data coming in all the PACs, PSCs, and IPEs operate once. Therefore, the total workload count in terms of number of multiplies per input complex sample is

ops input = 1 D [ 6 ML + 4 M L + 20 M + 8 M + 3 M Pn t FFT + 2 M Pn t FFT ]

Equation 14 is the total workload count for the proposed FBSC receiver. It should be noted that Equation 14 also includes the fractionally spaced equalizer, which is essentially another IPE that can be combined with the MF IPE[16]. The conventional time domain implementation of Figure 8 requires building at least four FIR filters: SRRC MF, two BE filters (for carrier recovery), and one derivative MF (for timing recovery)[29]. These four filters often have the same length (suboptimal BE filter designed as Figure 5b) with the SRRC filter. Therefore, the workload count for time domain implementation is

ops input = 8 N

where N is the number of taps of SRRC filter.

We now assign practical parameters to Equations 14 and 15 to make further comparison. A 65-tap, i.e., N = 64, SRRC filter with 25% roll-off factor is used for time domain implementation. Therefore, based on Equation 15, the conventional FIR approach costs 520 real multiplies per complex input. In the FBSC solution, we use M = 240, M′ = 42, and D = M 3 = 80 , triangular-shaped PR-NMDFB with LPPF length L = 8, and our observation shows it can well equalize multipath channels with normalized RMS delay spread τrms/T = 5. The 240-point and 42-point complex FFT cost 1,100 and 152 real multiplies, respectively,[32]. Plugging in those parameters into Equation 14, we find the workload count for FBSC receiver is 270 real multiplies per complex input. Those numbers show the PR-NMDFB-based FBSC solution offers 48% workload reduction only for the synchronization part, and this workload reduction did not account for the linear equalization which is required in the conventional FIR approach; notice that, on the other hand, the equalization task has already been included in Equation 14. Furthermore, as will be shown in the simulation section, the 65-tap SRRC only gives 43-dB stop-band performance; however, the PR-NMDFB offers 100-dB dynamic range, an improvement by 56 dB.

6. Simulation results

This section presents the simulation results of the proposed FBSC receiver. We shall first examine the PR-NMDFB-implemented filters and compare them with the FIR-based designs. Then, we will demonstrate the behaviors of the proposed CFO recovery loop and symbol timing recovery loop.

6.1 PR-NMDFB-implemented filters

Figure 10 shows the magnitude response of the M = 240, D = 80, triangular-shaped PR-NMDFB-synthesized SRRC filter with 25% roll-off factor overlaid with 65-tap FIR SRRC filter. It can be shown that the power consumption of the two approaches is about the same (i.e., using PR-NMDFB as SRRC filter only); however, their performance differs vastly. The PR-NMDFB-synthesized SRRC has over 50 dB advantage in stop-band performance with much smaller in-band ripple. Figure 11 shows the SRRC shaped and MF-ed QPSK constellation on the first quadrant without multipath channel and noise. The measured error vector magnitude (EVM)[33] for PR-NMDFB-based approach is -73.02 dB or 0.022%, and for FIR-based approach is -65.95 dB or 0.050%.

Figure 10
figure 10

PR-NMDFB- and FIR-implemented SRRC filter.

Figure 11
figure 11

QPSK constellation dispersion.

Figure 12 shows the FIR-based time domain implementation of the suboptimal 65-tap BE filters and the PR-NMDFB-based BE filters. Comparing the two, we find the PR-NMDFB implements extremely sharp filter transitions and achieve near-optimal BE filter design[27]. Note that the smallest transition band of a triangular-shaped PR-NMDFB is π M , half of the NMDFB channel spacing. Again, the PR-NMDFB approach gives over 50 dB in stop-band performance compared to FIR-based BE filter design.

Figure 12
figure 12

Time and channelizer domain MF and BE filters.

Figure 13 shows the result for delaying the impulse response by 0, 0.5 and -0.5 samples. The left column figures show the impulse responses, while the right column figures show the magnitude responses. Note that we have subtracted the parasitic delay n D caused by PAC/PSC from the impulse responses. Thus, we see the impulse response for delaying by 0 sample sits on index 0. The impulses for delaying by ±0.5 samples become sampled time offset Nyquist pulses. The magnitude responses for delaying by ±0.5 samples are seen to have very slightly larger in-band distortions comparing to delaying by 0 sample. This is due to the use of IPE to approximate the complex sinusoid of the phase shifters (see Equation 13). Clearly, the worst case distortion happens when delaying by ±0.5, since all one need is fractional delay within one sample.

Figure 13
figure 13

M= 240, D= 80, PR-NMDFB-implemented FDF.

6.2 CFO recovery results

The CFO and constellation phase offset removal is simulated based on QPSK signal with 25-dB Eb/N0. The CFO is set to ω c = 0.2 2 π M or 20% of the NMDFB channel spacing. Figure 14 shows the CFO tracking loop dynamics. The horizontal axis is plotted in terms of block index, recall the NMDFB receiver takes in data samples in block of D, and in this example D = 80. We can see the PLL pulls the CFO to a very small value by balancing the signal energy from the two BE filters. Upon the convergence of the CFO loop, the constellation phase offset correction loop (Figure 15) began to work, which tracks the small residue CFO based on decision-directed criterion. The signal constellations taken before and after the convergence of the phase offset correction loop are shown in Figure 16.

Figure 14
figure 14

CFO tracking loop profile.

Figure 15
figure 15

Constellation phase offset removal tracking loop profile.

Figure 16
figure 16

Constellation before and after the convergence of the constellation phase offset removal loop.

6.3 Symbol timing recovery results

The symbol timing recovery is simulated under the same setup as the carrier recovery. The initial timing offset τ is set to be 0.5T s . Figure 17 shows the phase accumulator profile of the timing recovery PLL, and we can see the steady state is reached in about 100 blocks or 8,000 samples. The lower subplot of Figure 17 shows the input and output signals at the loop filter, and we see the timing error is driven to zero by the PLL. Figure 18 shows the QPSK constellation before/after the convergence of timing loop, which further confirms the functionality of the proposed approach.

Figure 17
figure 17

Symbol timing recovery loop profile.

Figure 18
figure 18

Constellation before and after the convergence of the symbol timing recovery loop.

7. Conclusion

We have proposed the complete receiver structure for FBSC transmission with emphasis on receiver filters and synchronization. We applied the PR-NMDFB-based filtering property in designing the wideband carrier and symbol timing synchronization. This allows us to concurrently design BE filters and fractional delay filters within the entire FB-based receiver structure. The complexity analysis shows the proposed approach offers 48% workload reduction when only the synchronization tasks are considered, and further reduction in computation is expected if equalization is considered. Moreover, in contrast to the conventional FIR approach, which requires the hardware clock to run on many times of the input sample rate, the proposed FBSC receiver allows all the filtering tasks to operate on deeply decimated sample rate, which is a highly desired characteristic for the hardware implementation of fully digital receivers.


  1. Ariyavisitakul S, Falconer D, Adachi F, Sari H: Guest editorial - wireless broadband techniques. IEEE J. Select. Areas Commun. 1999, 17(10):1709-1710.

    Article  Google Scholar 

  2. Richard VN, Prasad R: OFDM Wireless Multimedia Communication. Artech House, Boston; 2000.

    Google Scholar 

  3. Benvenuto N, Dinis R, Falconer D, Tomasin S: Single carrier modulation with nonlinear frequency domain equalization: an idea whose time has come—again. Proc. IEEE 2010, 98(1):69-96.

    Article  Google Scholar 

  4. Tomasin S, Benvenuto N: Iterative design and detection of a DFE in the frequency domain. IEEE Trans. Commun. 2005, 53(11):1867-1875. 10.1109/TCOMM.2005.858666

    Article  Google Scholar 

  5. Huang G, Nix A, Armour S: Feedback reliability calculation for an iterative block decision feedback equalizer, in 2009 IEEE 70th Vehicular Technology Conference Fall (VTC 2009-Fall). Piscataway, IEEE; 2009:1-5.

    Google Scholar 

  6. Magarini M, Barletta L, Spalvieri A: Efficient computation of the feedback filter for the hybrid decision feedback equalizer in highly dispersive channels. IEEE Trans. Wireless Commun. 2012, 11(6):2245-2253.

    Article  Google Scholar 

  7. Zhang C, Wang Z, Pan C, Chen S, Hanzo L: Low-complexity iterative frequency domain decision feedback equalization. Vehicular Technol. IEEE Trans. 2011, 60(3):1295-1301.

    Article  Google Scholar 

  8. Ihalainen T, Ikhlef A, Louveaux J, Renfors M: Channel equalization for multi-antenna FBMC/OQAM receivers. IEEE Trans. Veh. Technol. 2011, 60(5):2070-2085.

    Article  Google Scholar 

  9. Caus M, Perez-Neira AI: Transmitter-receiver designs for highly frequency selective channels in MIMO FBMC systems. Trans. Sig. Proc. 2012, 60(12):6519-6532.

    Article  MathSciNet  Google Scholar 

  10. Ihalainen T, Stitz TH, Rinne M, Renfors M: Channel equalization in filter bank based multicarrier modulation for wireless communications. EURASIP J. Appl. Signal Process 2007, 2007: 140-140.

    Article  MATH  Google Scholar 

  11. Yang Y, Ihalainen T, Rinne M, Renfors M: Frequency-domain equalization in single-carrier transmission: filter bank approach. EURASIP J. Adv. Signal Process 2007, 2007: 1-6.

    MATH  Google Scholar 

  12. Weiss S, Dooley SR, Stewart RW, Nandi AK: Adaptive equalization in oversampled subbands. Electron Lett. 1998, 34(15):1452-1453. 10.1049/el:19981085

    Article  Google Scholar 

  13. Yang Y, Stitz TH, Rinne M, Renfors M: Mitigation of narrowband interference in SC transmission with filter bank equalization. In IEEE Asia Pacific Conference on Circuits and Systems, 2006. APCCAS 2006. IEEE, Piscataway; 2006:748-751.

    Chapter  Google Scholar 

  14. Yang Y, Ihalainen T, Renfors M, Rinne M: Noise predictive turbo equalization for a filter bank based receiver in a SC transmission system. In IEEE 65th Vehicular Technology Conference, 2007. VTC2007-Spring. IEEE, Piscataway; 2007:2389-2393.

    Chapter  Google Scholar 

  15. Farhang-Boroujeny B: Adaptive Filters: Theory and Applications. Wiley, Chichester; 1998.

    MATH  Google Scholar 

  16. Chen X, Harris FJ, Venosa E, Rao BD: Non-maximally decimated analysis/synthesis filter banks: applications in wideband digital filtering. IEEE Trans. Signal Process. 2014, 62(4):852-867. doi: 10.1109/TSP.2013.2295549

    Article  MathSciNet  Google Scholar 

  17. Bolcskei H, Hlawatsch F: Oversampled cosine modulated filter banks with perfect reconstruction. IEEE Trans. Circuits and Systems II 1998, 45(8):1057-1071. 10.1109/82.718813

    Article  MATH  Google Scholar 

  18. Cvelkovic Z, Vetterli M: Oversampled filter banks. IEEE Trans. Signal Process 1998, 46(5):1245-1255. 10.1109/78.668788

    Article  Google Scholar 

  19. Bolcskei H, Hlawatsch F, Feichtinger HG: Frame-theoretic analysis of oversampled filter banks. IEEE Trans. Signal Process 1998, 46(12):3256-3268. 10.1109/78.735301

    Article  Google Scholar 

  20. Karp T, Fliege NJ: Modified DFT filter banks with perfect reconstruction. IEEE Trans. Circuits and Systems II 1999, 46(11):1404-1414. 10.1109/82.803480

    Article  MATH  Google Scholar 

  21. de Haan JM: Filter bank design for subband adaptive filtering. PhD dissertation. Department of Telecommunications and Signal Processing, University of Karlskrona and Ronneby, Sweden; 2001.

    Google Scholar 

  22. Harris FJ, Chen X, Venosa E, Rao BD: Wideband 160-channel polyphase filter bank cable TV channeliser. Signal Proc. IET 2011, 5(3):325-332. 10.1049/iet-spr.2010.0031

    Article  Google Scholar 

  23. Harris FJ, Venosa E, Chen X, Rao BD: Polyphase analysis filter bank down-converts unequal channel bandwidths with arbitrary center frequencies. Analog Integr. Circ. S. 2012, 71(3):481-494. 10.1007/s10470-011-9746-y

    Article  Google Scholar 

  24. Nguyen T: Near-perfect-reconstruction pseudo-QMF banks. IEEE Trans. Signal Process 1994, 42(1):65-76. 10.1109/78.258122

    Article  Google Scholar 

  25. Fliege NJ: Multirate Digital Signal Processing. Wiley, Chichester; 1994.

    MATH  Google Scholar 

  26. Harris FJ: Multirate Signal Processing for Communication Systems. Prentice Hall, Upper Saddle River; 2004.

    Google Scholar 

  27. Harris FJ, Venosa E, Chen X, Dick C: Band edge filters perform non data-aided carrier and timing synchronization of software defined radio QAM receivers. In 2012 15th International Symposium on Wireless Personal Multimedia Communications (WPMC). IEEE, Piscataway; 2012:271-275.

    Google Scholar 

  28. Dick C, Harris FJ, Rice M: Synchronization in software radios. Carrier and timing recovery using FPGAs. In 2000 IEEE Symposium on Field-Programmable Custom Computing Machines. IEEE, Piscataway; 2012:195-204.

    Google Scholar 

  29. Harris FJ, Rice M: Multirate digital filters for symbol timing synchronization in software-defined radios. IEEE J. Sel. Areas Commun. 2001, 19(12):2346-2357. 10.1109/49.974601

    Article  Google Scholar 

  30. Franks LE: Carrier and bit synchronization in data communication: a tutorial review. IEEE Trans. Commun. 1980, 28(8):1107-1121. 10.1109/TCOM.1980.1094775

    Article  Google Scholar 

  31. Harris FJ: Performance and design of Farrow filter used for arbitrary resampling. Proc. 13th Int. Conf. DSP 1997, 2: 595-599.

    Google Scholar 

  32. Burrus CS, Parks TW: DFT/FFT and Convolution Algorithms: Theory and Implementation. Wiley, New York; 1984.

    MATH  Google Scholar 

  33. Shafik R, Rahman S, Islam AR: On the extended relationships among EVM, BER and SNR as performance metrics. In International Conference on Electrical and Computer Engineering, 2006. ICECE '06. IEEE, Piscataway; 2006.

    Google Scholar 

Download references

Author information

Authors and Affiliations


Corresponding author

Correspondence to Xiaofei Chen.

Additional information

Competing interests

The authors declare that they have no competing interests.

Xiaofei Chen, fred harris, Elettra Venosa and Bhaskar D Rao contributed equally to this work.

Authors’ original submitted files for images

Below are the links to the authors’ original submitted files for images.

Authors’ original file for figure 1

Authors’ original file for figure 2

Authors’ original file for figure 3

Authors’ original file for figure 4

Authors’ original file for figure 5

Authors’ original file for figure 6

Authors’ original file for figure 7

Authors’ original file for figure 8

Authors’ original file for figure 9

Authors’ original file for figure 10

Authors’ original file for figure 11

Authors’ original file for figure 12

Authors’ original file for figure 13

Authors’ original file for figure 14

Authors’ original file for figure 15

Authors’ original file for figure 16

Authors’ original file for figure 17

Authors’ original file for figure 18

Authors’ original file for figure 19

Authors’ original file for figure 20

Authors’ original file for figure 21

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 2.0 International License (, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Chen, X., harris, f., Venosa, E. et al. Non-maximally decimated filter bank-based single-carrier receiver: a pathway to next-generation wideband communication. EURASIP J. Adv. Signal Process. 2014, 62 (2014).

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: