Contender waveforms for Low-Power Wide-Area networks in a scheduled 4G OFDM framework

When designing new solutions for Low-Power Wide-Area (LPWA) networks, coexistence and integration into existing 4G frameworks should be considered to ease the deployment procedure and reduce costs. In a previous work, Turbo-FSK was proposed as a potential physical layer for LPWA networks. With its constant envelope and high energy efficiency, the scheme is a serious contender for this type of networks. We propose to study the system in the Orthogonal Frequency Division Multiplexing (OFDM) framework, currently used by existing cellular networks and also considered for the recently standardized Narrow-Band IoT (NB-IoT). Several extensions of the Turbo-FSK scheme are presented. A hybrid modulation alphabet, combining orthogonal with linear modulations, is introduced, and the substitution of Frequency Shift Keying (FSK) modulation for another orthogonal modulation based on Zadoff-Chu (ZC) sequences is considered. Simulations are run under various conditions including Rayleigh fading channel with mobility, demonstrating that the Turbo-FSK scheme is able to achieve performance close to the Turbo Coded Orthogonal Frequency Division Multiplexing (TC-OFDM) when a low rate is considered. When considering hybridation of the alphabet, limitations appear for higher data rate, which can be overcome by changing the orthogonal alphabet. The study of the variation of the envelope of the compared solutions emphasizes the crucial trade-off between performance and efficiency of the power amplifier (PA), a main concern for low-power applications. While using the proposed solutions, we demonstrate that energy consumption can be reduced by up to a factor of 2.5.


Introduction
The Internet of Things (IoT) is expected to interconnect objects using both existing communication technologies and new emerging technologies [1,2]. These new technologies should be designed in a way that facilitates integration to existing frameworks, e.g, the Orthogonal Frequency Division Multiplexing (OFDM) framework used in Long-Term Evolution (LTE) systems (4G). The framework defines the structure of the signal and the overall parameters of the physical waveforms. By considering an existing framework, deployment costs are reduced and coexistence between various technologies is facilitated.
Low-Power Wide-Area (LPWA) networks [3,4] are part of the IoT and 5G context and are predicted to *Correspondence: jean-baptiste.dore@cea.fr 2 CEA, LETI, MINATEC Campus, 38054 Grenoble, France Full list of author information is available at the end of the article approximately represent 10% of the overall IoT connections [5]. Requirements for LPWA include longrange communication and low-energy consumption at the device level. Long range can be achieved by ensuring a very low level of sensitivity at the receiver, which can be obtained by allowing the Quality of Service (QoS) to be achieved for low levels of Signal-to-Noise Ratio (SNR) [6]. The low-energy consumption requirement results from the necessity to have a long battery life for the devices (up to 10 years). Regarding the physical layer, saving energy is usually done by selecting an energy-efficient modulation or by considering low-complexity operations for data transmissions (thus reducing the consumption of the processing unit). It is also achieved by ensuring a high efficiency for the power amplifier (PA), which is the most power-consuming component of the transmission chain [7][8][9]. The efficiency of the PA is highly dependent on the peak power of the signal [10]. Constant envelope modulations are thus of prime importance, as they allow for large PA efficiency gains.
Narrow-Band IoT (NB-IoT) is a solution partly dedicated to LPWA networks and was introduced in the LTE release 13 [11][12][13][14]. The uplink transmission of this solution combines [13 15] Turbo Code (TC) [11,15] as forward error correction (FEC) and a repetition factor. It guarantees the QoS to be reached for very low levels of SNR, i.e., low levels of sensitivity are obtained. The solution is highly energy efficient and offers a large range of data rates, making it suitable to numerous applications. A solution relying on a turbo receiver, the Turbo-FSK, was introduced in previous literature [6]. It allows for very low levels of sensitivity to be achieved thanks to the use of a combination of orthogonal modulation, a convolutional code, and an iterative receiver. Additionally, the scheme employs a constant envelope modulation, the Frequency Shift Keying (FSK) modulation, potentially increasing the efficiency of the PA.
If the Turbo-FSK scheme can be easily integrated in an OFDM framework thanks to its use of the FSK (which can be implemented using the fast Fourier transform (FFT) algorithm), it nonetheless lacks the flexibility offered by the NB-IoT solution. An interesting option is therefore to associate this technique with alphabet hybridation [16][17][18][19], which will improve the flexibility of the scheme. This hybridation of the FSK modulation, so-called Coplanar FSK [20], can be done while maintaining the constant envelope property, which gives a significant advantage to the Turbo-FSK technique. Additionally, the absence of constraints on the choice of the orthogonal alphabet for Turbo-FSK enables the design of alternative solutions relying on different modulations. For example, the chirp modulation [21] gained significant interest in recent years [22], particularly with LPWA industrial solution LoRa [23][24][25]. The alphabet of this orthogonal modulation is constructed from a chirp base, which can be, for example, selected as a Zadoff-Chu (ZC) sequence [26], a Constant Amplitude Zero Auto-Correlation (CAZAC) sequence notably used for the primary synchronization signal in the LTE protocol [11]. A possible alternative to Turbo-FSK can thus be the Turbo-ZC scheme, where a chirp modulation is used instead of the FSK modulation.
There are three main objectives and contributions for this work: present the possibility to integrate the Turbo-FSK in an OFDM framework, confront the scheme to more practical scenarios, and study the potential of the extensions of the scheme. Regarding the first objective, a thorough description of the OFDM framework is proposed, and equations of the several receivers are derived for the specific architecture. For the second objective, several transmission channels are considered: an additive white Gaussian noise (AWGN) channel, a static Rayleigh fading channel, and a Rayleigh fading channel with mobility. Finally, regarding the third objective, two variations of the initial Turbo-FSK scheme are considered: Coplanar Turbo-FSK, which relies on a hybrid FSK and Phase Shift Keying (PSK) alphabet, and Turbo-ZC (and Coplanar Turbo-ZC), which uses an orthogonal alphabet constructed from ZC sequences. For all the comparisons, two scenarios with two different data rates are considered. Three other schemes, based on the [13 15] TC, are used for comparisons: Turbo Coded Orthogonal Frequency Division Multiplexing (TC-OFDM), Turbo Coded Single Carrier Frequency Division Multiple Access (TC-SC-FDMA), and Turbo Coded Frequency Shift Keying (TC-FSK). The first two schemes are directly derived from the NB-IoT solution, while the third one enables a comparison of Turbo-FSK with another turbo coded constant envelope waveform.
The paper is organized as follows. The system model, including the OFDM framework and the five compared solutions, is presented in Section 2. Performance are compared in Section 3, and Section 4 concludes the paper.

System model
This section is dedicated to the presentation of the general OFDM framework and of the various techniques that are evaluated and compared.

OFDM framework
The general principle of the OFDM framework is to construct the signal in the frequency domain. Over the N FFT available carrier frequencies, only N A are considered to transmit information. This is a form of multicarrier modulation. The sequence generation in the OFDM framework, i.e., the transmitter architecture, and the considered channel are described first. Then, assuming a generic form for the transmitted sequence, the mathematical derivations of the likelihood of a sequence are computed. The result will be used in the next sections to derive the probabilistic receiver of the considered schemes. The generic architecture of the OFDM framework is finally presented, along with the values of the parameters considered for the simulations.

Transmitter
The OFDM framework is depicted in Fig. 1. From the information packet of size Q bits, a cyclic redundancy check (CRC) is computed and appended to the information block. Encoding and modulation steps are then performed. These steps depend on the choice of the technique and are presented later. The output of this block is represented by a matrix of size N A × N s , where N A is the number of active frequencies and N s the number of time slots (or OFDM symbols). This is the time-frequency allocation which is then included in the FFT grid of size N FFT × N s . Inverse discrete Fourier transform (IDFT) of size N FFT is then applied to convert the frequency signal into a time signal consisting of N FFT × N s samples (or chips). The inverse fast Fourier transform (IFFT) algorithm is used in practice to perform the IDFT. Considering the vector X of length N FFT , the discrete-time sequence of the considered OFDM symbol x is given by which is the IDFT of the vector X, with k ∈{0, . . . , N FFT −1}.
A cyclic prefix (CP) of length N CP consisting of the last N CP samples of each OFDM symbol is then inserted at the beginning of each symbol.

Channel
The emitted signal then transits through the channel. The use of a CP allows the effect of the channel to be expressed by a circular convolution when the channel delay spread is smaller than the size of the CP. Multi-path channel is modelled by a discrete-time domain representation, i.e., a finite impulse response (FIR) filter, applied at chip-time.
The useful signal (after removing the CP) at index k is given by where h is the channel impulse response, a complex vector with L ch elements, representing the gain of each possible paths. The elements ν k of the noise vector ν are independent and follow a circularly symmetric and zero mean complex normal distribution with variance σ 2 ch . The circular convolution assumption can only be considered if N CP ≥ L ch , i.e., if the delays of the previous OFDM symbols only affect the CP of the current symbol and not its useful part.

Likelihood computation
In order to derive the probabilistic receiver for each technique, we would like to express the likelihood of observing a sequence y given that a sequence x was transmitted. A generic form for the transmitted sequence is considered, and further simplifications will occur when specific modulation techniques are considered. The likelihood of observing a sequence y given that a sequence x was transmitted is given by As normal distribution with variance σ 2 ch was assumed, the likelihood is expressed where C st is a term independent of x and ·, · is the scalar product operator. Using (1) in (4), the likelihood can be expressed with where Y (n) (resp. H(n)) is the nth dimension of the discrete Fourier transform (DFT) of size N FFT of the vector y (resp. h) and X is the complex conjugate of X.
As the form of x was purposely kept general, additional computations are required to obtain the final expression of the maximum likelihood (ML) decoder. However, the expression demonstrates the necessity for the receiver to evaluate the DFT of the received time-vector y denoted by Y and to have knowledge of H, the DFT of the channel coefficients.
The log likelihood ratio (LLR) of the bit b m is given by where X m +1 resp.X m −1 the group of sequences x which encode an information word for which the bit b m is equal to + 1 (resp. − 1). The LLR of the bits for each technique will be computed using the specific expression of the sequence x in (5) and (6).

Receiver
In order to retrieve the information that was mapped in the frequency domain at the transmitter side, a DFT of size N FFT is applied on each OFDM symbol, using the FFT algorithm. The result is the vector Y , the channel observation. By selecting only the N A used frequencies, the N s vectors of size N A transmitted at the first place are retrieved. This signal is fed to the demodulator/decoder along with H. This vector can also be interpreted as the complex gain on each frequency. Using the channel observation and the vector H, the block demodulator uses a simplified formulation of (5) to compute the LLR of the transmitted bits. The information bits are finally estimated by the decoder and the CRC is computed to evaluate if the packet contains any error.

TC-OFDM and TC-SC-FDMA
Two well-known techniques are considered for comparison: the use of OFDM associated with the [13 15] TC [27] and the TC-SC-FDMA, for which a pre-coding based on a DFT transform is applied. These schemes are multiplexing techniques, i.e., a Quadrature Phase Shift Keying (QPSK) symbol is mapped on each frequency carrier. For both techniques, the encoder/modulator block of Fig. 1 uses a TC encoder and a QPSK modulator. However, since there are N A N s available symbols, a rate matching mechanism is used to repeat or puncture some bits. The rate R m corresponds to the ratio between the number of input bits (3Q + 12) of the rate matcher and the number of output bits (2N A N s ). If R m > 1, then bits are punctured (with interleaving to ensure that the punctured bits are evenly spread over time), and when R m < 1, bits are repeated. An interleaver is used between the encoding and modulation processes. For the case of TC-SC-FDMA, pre-coding using a DFT of size N DFT is applied.
While the derivations of the receivers could be obtained using the general expression of the likelihood (5), these computations are well known from the literature [28] and are not detailed here. For TC-SC-FDMA technique, the ML receiver is too complex to be considered as likelihood computation involved the whole symbol linked with the precoding. Zero-Forcing (ZF) equalizer is used instead. For the turbo decoder, the max-log approximation is considered and CRC is used as stop criterion for the iterations of the decoder, with a maximum number of iterations set to 10.

Turbo-FSK and Coplanar Turbo-FSK
The Turbo-FSK principle was introduced in [6,20,29]. The motivation for this scheme was to operate close to the Shannon capacity [30] while using a constant envelope modulation. Turbo-FSK can be implemented using an OFDM transceiver. The technique relies on the use of FSK modulation associated with a convolutional code and an iterative decoder. FSK is a common orthogonal modulation, based on an alphabet of orthogonal waveforms equivalent to pure frequencies. When using the previously introduced OFDM framework, only one of the N A possible carriers is activated every time slot. The information is carried by the index of the activated carrier. The innovative aspect of the Turbo-FSK scheme is in the association of the encoding and modulation processes at the transmitter side. The turbo receiver exploits this association to operate at very low levels of SNR.
The design of the Turbo-FSK technique allows for flexibility in terms of choice of alphabet. In order to increase the spectral efficiency of the technique, a hybrid modulation alphabet mixing FSK modulation (orthogonal component) and PSK modulation (linear component) is considered [16]. The alphabet is constructed using N A subsets A δ , with δ ∈ {0, . . . , N A − 1} (N A represents the number of possible frequencies for the FSK part of the alphabet), and M L linear shifts given by the complex number z p = exp{j2πp/M L }, with p ∈ {0, . . . , M L −1} (z p thus represents the phase shifts from the PSK part of the alphabet). Symbols belonging to the same orthogonal subset are coplanar (they only differ by a complex multiplicative factor), hence the denomination Coplanar Turbo-FSK. A time symbol x i from the alphabet is defined as where f δ is the vector of size N A generating the orthogonal subset A δ , with The spectral efficiency of the modulation only is given by When setting M L = 1, the modulation becomes equivalent to a classic FSK modulation with M tones. Dealing with large values of size of FSK modulation leads to low spectral efficiencies [31]. However, for M L > 1, the phase of the symbols carries information. For a constant value of M, increasing the value of M L leads to a reduction of the number of required orthogonal dimensions, thus improving the spectral efficiency of the modulation. Alternatively, for a constant number of orthogonal dimensions (i.e., a constant number of active carriers), increasing the value of M L increases the size of the alphabet and the spectral efficiency.
The encoder/modulator block for the Turbo-FSK is depicted in Fig. 2. Similar to the transmitter presented in [6], the encoder consists in λ stages where each stage encodes an interleaved version of the Q information bits. The information block is divided into N q information words of q bits. Eventually, a total of λ × (N q + 1) binary words of length q + 1 are generated. Each binary word is mapped on one of the codewords of the alphabet. In order to fit the N s available symbols, the rate matcher punctures or repeats some binary words so that Unlike the other schemes, the rate matcher operates at the word level instead of the bit level. This is because entire words of size q + 1 are associated to one of the symbols from the alphabet. The alphabet size is M = 2 q+1 . If the binary word b is associated to the symbol x i as expressed in (7), then the output of the modulator is This is very similar to FSK signaling as only one frequency carries information. However, unlike FSK where all the information is on the carrier index, additional bits can be mapped on the phase shift of the active carrier. When M L = 1, the scheme is equivalent to Turbo-FSK, but when M L ≥ 2, the scheme is called Coplanar Turbo-FSK. The resulting matrix of size N A ×N s is sparse, as only N s elements are different from 0.
The demodulator/decoder block is depicted in Fig. 2. First, the likelihood of all the symbols from the alphabet is computed. When combining the general expression (5) with (11), the likelihood given that the symbol x i was sent is expressed After computing the likelihood of each sequence, a matrix of size M × N s is obtained, containing the likelihoods of all the possible symbols from the alphabet. The puncturing management block is then in charge to recover the original M × λ(N q + 1) matrix generated at the transmitter. If a symbol was repeated, columns corresponding to the repetitions of this symbol are added. If a symbol was punctured, a column of 0s is added. The resulting matrix corresponds to the channel observation of all stages.
The turbo decoder then uses the observation of the λ stages to perform the iterative decoding presented in [6] (detailed in [20,29]), using the max-log approximation. The CRC can be used as a stop criteria, as it is computed at the end of each iteration.

Turbo-ZC and Coplanar Turbo-ZC
We propose to extend the Turbo-FSK technique to another family of orthogonal alphabets, the chirp modulation [21,22]. The chirps are selected as ZC sequences  [26]. In the time expression of the alphabet symbols given in Eq. (7), the vectors f δ are now constructed using ZC sequences (instead of the pure-tone complex exponential sequences for the Turbo-FSK). However, unlike the Coplanar Turbo-FSK, where symbols of the alphabet are expressed in time, we wish to express the symbols of the Coplanar Turbo-ZC alphabet in frequency. We consider the base ZC sequence g μ which has two parameters, its length N ZC and its index μ, both positive integers. The sequence is given by where mod(·, 2) is the modulo two operator, and with n ∈ {0, . . . , N ZC − 1}. The zero-autocorrelation property of the ZC sequence allows for the construction of an orthogonal alphabet using circularly shifted versions of a base sequence. The base sequence g μ with parameters N ZC = N A and μ = 1 is selected. An interesting implementation approach for the alphabet construction is the combination of the DFT matrix with the base sequence. The vectors f δ are constructed from vectors of the DFT matrix multiplied by the base sequence g 1 . Its samples are given by with N A a power of 2. This different approach also generates ZC sequences and allows for an efficient receiver architecture, as presented later. The encoder/modulator of the Coplanar Turbo-ZC system is strictly equivalent to the one depicted in Fig. 2 for the Turbo-FSK, apart from the modulation alphabet (the vectors generating the orthogonal subsets are expressed with (14) instead of (8)). The symbols from the alphabet will be mapped in the frequency domain, before the IFFT. The output of the block assuming that the current binary word is associated to the ith element of the alphabet is given by with n ∈ {0, . . . , N A − 1}, δ the index for the orthogonal part, and p for the linear part (the PSK alphabet). When M L = 1, the denomination Coplanar Turbo-ZC is simplified to Turbo-ZC, as each orthogonal subset contains only one symbol.
Unlike the Coplanar Turbo-FSK, the output matrix, of size N A × N s , is no longer sparse. Each column consists of a delayed ZC sequence. One property of the ZC sequence is its invariance to FFT operations. However, in this case, the size of the IFFT N FFT is different from the size of the sequence N A ; thus, the resulting time signal will not be a CAZAC sequence.
The demodulator/receiver for Coplanar Turbo-ZC is very similar to the one proposed for Coplanar Turbo-FSK, depicted in Fig. 3. The only difference is the likelihood computation. With the expression Eq. (5) obtained from a generic sequence and using Eq. (15), the likelihood is given by Note that the first part of Eq. (5) is included in the constant, as it can be shown independent of the index i. In order to compute the likelihood, each received sequence of size N A is multiplied by the conjugate of the channel coefficients and by the conjugate of the base sequence g 1 used to build the alphabet. A FFT of size N A is then applied. The likelihood of the codeword i is finally obtained by multiplying the δth output of the FFT by the linear coefficient p and retrieving the real part. Compared to the Coplanar Turbo-FSK, the computation of the likelihood for the Coplanar Turbo-ZC requires two additional steps: the multiplication by the base sequence and the FFT of the size N A .

TC-FSK
The use of the FSK modulation associated with the [13 15] TC can also be considered. For this scheme, the transmitter performs turbo encoding associated with a rate matching mechanism and an interleaver. FSK modulation is applied and the output is expressed with (11). The receiver computes the LLR of the binary sequence using which is obtained by combining Eqs. (12) and (6) and the max-log approximation. Turbo decoding is then performed, similar to the TC-OFDM. Unlike Coplanar Turbo-FSK, the TC-FSK scheme does not combine the demodulation and the decoding in a joint process. It nonetheless offers a constant envelope modulation.

Results and discussion
The performance of each scheme is assessed over several types of transmission channels, and the envelope variations of each technique is evaluated. In order to restrict the study, some parameters are kept constant for all the schemes. As all the schemes were presented associated with the same FFT configuration, the parameters for this architecture can be taken equal for all the schemes. The parameters are chosen as corresponding to the mode 1.4 MHz of LTE, i.e., latest NB IoT standard [14]. Two different scenarios are considered: a low-rate (8.24 kbps) as well as a high-rate scenario (46.68 kbps). This will exhibit the impact of the parameters of the schemes on the performance, notably the size of the linear part of the alphabet for both Coplanar Turbo-FSK and Coplanar Turbo-ZC. Specific values for the parameters of the OFDM framework are selected. The selected parameters are given in Table 1, where f s is the sampling frequency, f the subcarrier spacing and f c the carrier frequency. With N A = 16 active carriers, the bandwidth of the transmitted signal is equal to B = 240 kHz. The spectral efficiency can be computed with and the data rate is given by R = Q/N s . The first scenario considers the transmission of Q = 1000 bits over N s = 1700 OFDM symbols. The spectral efficiency is equal to η = 4.29 · 10 −3 bits/s/Hz, which gives a data rate of 8.24 kbps, a rather low value. This value is consistent with the fact that long-range solutions must transmit at low data rates to provide low sensitivity levels [6,20]. Due to the specific number of symbols, the rate matcher of each technique has a different output rate. The equivalent number of repetitions given by 1/R m and the parameters for each technique is presented in Table 2. For this scenario, we consider the Turbo-FSK and Turbo-ZC (thus M L = 1) with λ = 5. According to the results found in [32], it is the value of λ that gives the lowest E b /N 0 for this size of alphabet (M = 16). For the second scenario, a higher data rate is considered. The same packet size M = 1000 is transmitted over N s = 300 OFDM symbols. This gives a spectral efficiency of η = 2.43 · 10 −2 bits/s/Hz and a data rate equal to 46.68 kbps. The TC-FSK solution is not considered for this scenario, as the value of N s would imply a very large number of punctured bits, making the solution inadequate. The parameters for the four compared techniques are presented in Table 3. Both Coplanar Turbo-FSK and Coplanar Turbo-ZC are selected with parameters M L = 32 and λ = 3, which gives an alphabet size equal to M = 512. In [32], the combination M = 512 and λ = 3 was demonstrated to minimize the required E b /N 0 , for the orthogonal case (i.e., M L = 1). However, for this specific configuration, a loss in required E b /N 0 is expected for two reasons: 22% of the symbols are punctured and the choice of M L = 32 PSK will reduce the energy efficiency.
The performance of the packet error rate (PER) is simulated for various levels of SNR and three different channels: the AWGN channel and a Rayleigh fading channel with both static and mobile conditions (with classical Jake's model [33]). Perfect time and frequency synchronizations are assumed.

Under AWGN
The AWGN channel, i.e., a channel with a single and constant propagation path, is first considered. For the first scenario, the PER performance of the five different techniques is depicted in Fig. 4a, versus the SNR. Very low levels of SNR are considered. The TC-OFDM and TC-SC-FDMA schemes are equivalent under AWGN  and offer the best performance. Turbo-FSK and Turbo-ZC are also equivalent under this channel and exhibit a 0.2-dB loss at a PER of 10 −3 . Better performance could be achieved by selecting other combinations of alphabet size and number of repetition λ [32]. The TC-FSK suffers from a loss of 2.9 dB for the same PER versus TC-OFDM. Unlike Turbo-FSK, the receiver of this scheme does not combine FSK demodulation and turbo decoding in the same procedure. Better performance is achieved using a more sophisticated receiver such as the one considered in Turbo-FSK. The PER performance for the second scenario and for the four compared techniques is depicted in Fig. 4b. The levels of required SNR are higher than those for the first scenario, due to the increase of the data rate. Similar to the first scenario and due to the choice of the AWGN channel, TC-OFDM and TC-SC-FDMA are equivalent for this scenario, as well as Coplanar Turbo-FSK and Coplanar Turbo-ZC. Their performance is less than 0.4 dB away from the TC-OFDM, for a PER of 10 −2 . With this size of alphabet and number of repetitions (M = 512 and λ = 3), the Turbo-FSK (i.e., with M L = 1) overcomes the [13 15] TC [20]. However, the use of the linear modulation of size 32 associated with the puncturing of 22% of the codewords induces a loss of performance.
These simulations include the loss in spectral efficiency incurred by the introduction of the CP and the use of the CRC. The first is necessary to avoid inter-symbol interference (ISI) and inter-carrier interference (ICI) when considering multi-path channels. The latter is used to detect errors in the decoded packet and to compute the PER. Since these two elements induce a loss in spectral efficiency, this corresponds to a loss in E b /N 0 . Using a CRC of size 16, the loss is expressed in decibel as 10 log 10 N FFT N FFT + N CP + 10 log 10 Q Q + 16 .
With the considered parameters, this loss is equal to 0.37 dB. These results demonstrate the benefits of considering a joint coding and modulation scheme with respect to the use of FSK and a classical turbo code. When compared to turbo-coded modulations with repetition scheme, the performance loss of the proposed scheme is less than a few tenth of a decibel.

Under static Rayleigh fading channel
Simulations under the static Rayleigh fading channel are considered. Static implies that there is no relative velocity between the transmitter and the receiver, i.e., the channel does not evolve during the transmission of the packet. The Extended Typical Urban (ETU) profile [34, p. 191] is considered for the path gains and delays. This profile has a channel delay spread equal to τ = 5 μs, while the CP duration is equal to 4.6875 μs. This profile thus represents difficult conditions for the OFDM system. The coherence bandwidth of the channel is equal to τ = 1/τ = 200 kHz. Since the bandwidth of the signal B = 240 kHz is superior to the coherence bandwidth, frequency diversity is guaranteed.
In order to compare the schemes, the performance over a large number of channel realizations is computed (each realization is randomly generated following the Rayleigh a b Fig. 4 Performance of the compared solutions under AWGN, for the first scenario (a) and for the second scenario (b) model). Average performance over the channel realizations is presented. Since each realization has its own gain, the SNR is estimated at the input of the receiver. In the figures, the average SNR over all the realizations is presented on the horizontal axis. All the compared techniques are stressed with the same realizations of the channel. A large number of realizations is computed, and perfect channel state information (CSI) is considered. The performance of the five schemes for the first scenario is depicted in Fig. 5a. TC-OFDM outperforms all the other schemes with a PER of 10 −2 at − 11.4 dB of SNR. Performance of Turbo-ZC and Turbo-FSK is only 0.1 and 0.2 dB respectively away from TC-OFDM. TC-FSK exhibits a loss of 3.1 dB with respect to TC-OFDM, as for the AWGN case. The performance loss compared to Turbo-FSK is due to the use of a different decoder, since the signaling technique is identical. TC-SC-FDMA exhibits the worst performance, but this can be incurred to the choice of the ZF equalizer. Other equalizers could be considered such as the minimum mean square error (MMSE) and may lead to better performance.
For the second scenario, the performance of the four schemes is depicted in Fig. 5b. As for the case of the AWGN channel, increasing the data rate also increased the range of SNR for equivalent performance. Similar to the first scenario, the TC-SC-FDMA exhibits poor performance due to the use of the ZF equalizer. TC-OFDM and Coplanar Turbo-ZC have similar performance, reaching a PER of 10 −2 at a SNR of − 3.8 dB. Unlike the other scenario, Coplanar Turbo-FSK suffers from a 1.8-dB loss compared to TC-OFDM and Coplanar Turbo-ZC for the same PER. Both Coplanar Turbo-FSK and Coplanar Turbo-ZC use the exact same parameters. The difference in performance can thus be attributed to the use of different signaling techniques. Since the ZC signaling spreads the energy of one symbol on all the active carriers, frequency diversity is better exploited. Carriers with higher gains may compensate for faded carriers. However, in the FSK case, a fading on the frequency that carries the energy will induce the loss of the symbol. The difference in performance is reduced in the other scenario, which has a lower data rate. This may suggest that the use of a higher order of linear modulation (M L = 32 for this scenario) amplifies the difference between ZC and FSK signaling.
For the low-throughput case, performance remains the same for TC-OFDM system and the proposed scheme (Turbo-FSK and Turbo-ZC). In case of high throughput, the frequency diversity provided by Turbo ZC gives an advantage with respect to Coplanar Turbo-FSK making Coplanar Turbo-ZC interesting when compared to TC-OFDM.

Under Rayleigh fading channel with mobility (Jakes-Doppler spectrum)
Relative mobility between the transmitter and the receiver is now considered, along with the ETU fading profile. The relative velocity between the transmitter and the receiver is equal to υ = 50 km/h. With the selected carrier frequency, given in Table 1, the Doppler frequency is equal to f d = 115.8 Hz, giving a coherence time of t coh = 1/f d = 8.63 ms. The coherence is thus 129 times greater than the symbol time (equal to 66.6 μs). The receiver can assume a constant vector H for each symbol, which is actually the time-average estimation of the true channel. For the first scenario, the performance of the four schemes under mobility condition is depicted in Fig. 6a. When compared to the case of the static ETU depicted in Fig. 5, mobility significantly improves the performance of all the schemes. These schemes benefit from a large amount of redundancy (for example, the encoded bits for TC-OFDM and TC-SC-FDMA are repeated more than 17 a b times, and the TC itself offers redundancy). In the mobility case, a symbol and its repetitions experience different channel gains. This is a form of time diversity and performance tends toward the AWGN case when the order of diversity is very large [31]. While the TC-SC-FDMA exhibited poor performance in the static ETU case, the time diversity offered by the repetitions overcomes the negative impact of equalization and the scheme is less than 1 dB away from TC-OFDM. The TC-FSK scheme is 3.5 dB away from the TC-OFDM technique. The difference of performance in the static ETU case was in the same order of magnitude. Thanks to the use of repetition and a low spectral efficiency modulation technique, the TC-FSK scheme also benefits from diversity. Turbo-FSK and Turbo-ZC now exhibits a 0.6 and 0.4-dB gap respectively versus TC-OFDM. Both schemes use λ = 5 repetitions, which offers a diversity comparable to the other techniques.
The performance under mobility conditions for the second scenario is depicted in Fig. 6b. Both the TC-OFDM and TC-SC-FDMA schemes benefit from diversity due to the redundancy but at a smaller extent as there are only three repetitions. The curves tend less towards the AWGN case. While the Coplanar Turbo-ZC reached the performance of TC-OFDM for the static ETU case and was 0.5 dB away at a PER of 10 −2 in the AWGN case, the gap with TC-OFDM is now equal to 2.4 dB. Coplanar Turbo-FSK exhibits an even worse performance and the gap between Coplanar Turbo-FSK and Coplanar Turbo-ZC is now equal to 5.4 dB. For these two techniques, the lack of diversity gain could be explained by the combination of two aspects: the reduction of the value of λ to 3 and the rather high number of punctured codewords. Also, with the configuration M L = 32, more information is mapped on the linear dimensions. This combination has a devastating effect for the Coplanar Turbo-FSK, but it has less effects on the Coplanar Turbo-ZC, most likely due to its better frequency diversity. The spread spectrum feature of the ZC signaling seems again beneficial, and Coplanar Turbo-ZC outperforms Coplanar Turbo-FSK. When considering an even higher speed, Coplanar Turbo-ZC, Coplanar Turbo-FSK, and TC-FSK will not suffer from ICI since there is no multiplexing (unlike TC-OFDM and TC-SC-FDMA). This will nonetheless induce a spreading of the energy over the adjacent frequencies, i.e., a loss of power of the transmitted frequency for FSK signaling (along with a gain in the adjacent frequencies) and a distortion of the ZC sequence for the ZC signaling.
In case of mobility associated with fading channel, TC-OFDM remains the best choice. The use of spreading, i.e., SC-FDMA, enables to combine low envelope variation as well as good PER performance. We see here the limitations of the proposed schemes. However, we can expect a similar performance in terms of PER when considering Coplanar Turbo-ZC with respect to SC-FDMA. However, as previously mentioned, the proposed scheme drastically reduces the envelope variation of signal. Therefore, we describe, in the next subsection, a framework to globally compare the different solutions taking into account both channel performance and power consumption.

Envelope variations
In order to study the variations of the envelope, we propose to study the Instantaneous-to-Average Power Ratio (IAPR) [35], which is equivalent to a Peak-to-Average Power Ratio (PAPR) over one sample. The study of the IAPR is more relevant than the measure of the PAPR over a larger number of samples, as it will consider all the samples that potentially reach the non-linear region of the PA [35]. Evaluating the envelope variations is critical to tune the operating point of the PA, the last and most energyconsuming elements of the transmission chain [7,9]. The efficiency of the PA will influence both the consumption of the system and its cost. Since the IAPR describes the variation of the envelope, it also indicates how to drive the input of the PA in order to avoid the saturation and non-linear region. Usually, a backoff from the saturation point is considered, to ensure the saturation to be reached for only a given probability. However, the consequence of selecting a large back-off (or equally, that high values for the IAPR are more likely) is that the average level of output power is lowered. The efficiency of the PA can be shown to be dependent on the level of output power [10]. Consequently, the variation of the envelope is a major concern regarding the consumption of the system.
For all the compared solutions, the IAPR is measured from the time signal which is sent through the channel, after the addition of the CP in Fig. 1. For a sufficient statistic, 1000 packets are simulated using the parameters of Table 1.
The Complementary Cumulative Distribution Function (CCDF) of the IAPR is depicted for the five schemes in Fig. 7. TC-OFDM has the largest variations and exhibits a probability of 10 −3 to have an IAPR larger than 8 dB. TC-SC-FDMA shows a 2-dB improvement in its variations, thanks to the use of pre-coding. As expected, TC-FSK and Coplanar Turbo-FSK have the same performance, i.e., an IAPR of 0 dB. Both techniques rely on the use of FSK modulation which exhibits a constant envelope, and the Coplanar Turbo-FSK employs phase shifts as for the coplanar value, maintaining the constant envelope property. However, even though the Coplanar Turbo-ZC was constructed using CAZAC sequences, it does not exhibit a constant envelope. This is because the sequence was mapped in the frequency domain. The use of the IFFT oversamples the sequence to give a resulting time signal convolved with the IFFT window that does not have a constant envelope. Additionally, the number of possible sequences is equal to 16 (the value of N A ), i.e., the variations of a given signal will correspond (on average) to the Fig. 7 IAPR of the five compared solutions with the considered OFDM parameters variations of the 16 possible sequences. When the ratio N FFT /N A is an integer, all the sequences have the same distribution of the IAPR. The IAPR consists in a finite number of values (hence the discrete CCDF) and could be evaluated simply by considering the base sequence. The maximum IAPR of the Coplanar Turbo-ZC appears to be around 2.6 dB, a value that depends on the value of N A and the choice of the base sequence.
The measure of the IAPR clearly emphasizes the main benefits of the proposed schemes, the constant amplitude property of Coplanar Turbo-FSK, and the low PAPR of Coplanar Turbo-ZC, offering a gain of 5.4 dB versus TC-OFDM. It also gives information on how far you must reduce the input power (Input Backoff (IBO)) of a PA in order to limit non-linearity effects that could decrease the performance of the system.

Energy consumption
In the previous section, we stress the gain/loss we have considering new modulation and coding scheme. As a synthesis, we propose to balance the gain/loss through an estimation of the energy consumption of a device. In one hand, the proposed schemes are working worse on channel with mobility. On the other hand, a low PAPR level is achieved. The relation between the IBO and PA efficiency was previously mentioned. The theoretical efficiency of the PA can actually be shown conversely dependent on the level of IAPR [10,36]. In Fig. 8, the drain efficiency versus the Input Backoff is depicted for class A amplifiers, as well as the corresponding efficiency and PAPR of the five techniques when considering a CCDF of 10 −3 for the IAPR. By selecting Coplanar Turbo-FSK instead of TC-OFDM, the theoretical efficiency of the PA can be increased by 37%. Coplanar Turbo-ZC also offers a gain in efficiency. Regarding the second scenario previously introduced, using the Coplanar Turbo-ZC seems to be an interesting trade-off versus TC-OFDM, as the PA efficiency can be increased while maintaining almost similar performance Fig. 8 Drain efficiency versus IBO for class A amplifier, and efficiencies of the five compared solutions for the given scenario (unlike the Coplanar Turbo-FSK, which exhibits a more important performance loss). In order to illustrate this effect, we propose to consider the energy consumption associated to one packet.
The energy E consumed by the transmission of the packet can be used to estimate the battery life. It is usually expressed by the product P t T, where P t is the transmission power for the given QoS and T the packet duration [9]. However, since the PA has a specific efficiency ε < 1 (which depends on the technology used), the consumed energy is actually given by where the values of P t and ε vary depending on the technology selected. For a given QoS, if the required SNR is dB lower for the scheme X than for the considered reference, then the transmitted power can be reduced by dB. The efficiency ε is given by the value of the PAPR, which is selected for a CCDF of 10 −3 . While considering the TC-OFDM as a reference, the ratio of consumed energy is given by where ε 0 is the efficiency for the TC-OFDM, ε X for the technique X, and X the difference in SNR (in linear) between the technique X and the TC-OFDM. When ρ X < 1, this means that compared to TC-OFDM, more energy is consumed, and eventually the battery life is reduced.
When ρ X > 1, battery life is increased. This quantity describes two variations, the efficiency of the PA and the difference in performance, and how they can potentially balance each other. Indeed, a technique having a very good efficiency but poor performance may have a value of ρ X inferior to 1, meaning that the loss in performance is not balanced by the increase of the efficiency. For all the scenarios and the channels, the ratio of consumed energy is depicted in Fig. 9. TC-SC-FDMA offers a small gain in energy consumption, except for the static ETU case where its low performance induces an increase of the transmitted power. It should be mentioned that better performance could be obtained by considering a different equalizer. TC-FSK offers a gain thanks to its constant envelope. Coplanar Turbo-FSK gives important gains (up to 2.5 under AWGN, i.e., an increase of the battery life by a factor 2.5) but suffers from its low performance under mobility conditions. Finally, Coplanar Turbo-ZC offers an interesting trade-off with gains included between 1 and 1.9 depending on the conditions. To conclude, both proposed schemes really improve the power consumption with respect to TC-OFDM and SC-FDMA. Coplanar Turbo-FSK is particularly interesting for static channel but suffers from poor performances in case of mobility. In the latest case, Coplanar Turbo-ZC offers a tradeoff. When high throughput and mobility are considered, one can expect the best performance with respect to TC-OFDM and SC-FDMA. In case of low throughput, performance of Coplanar Turbo-ZC remains below coplanar Turbo-FSK but the gap is quite small. This makes Coplanar Turbo-ZC a promising scheme.

Conclusions
In the context of LPWA networks, reaching low levels of sensitivity and reducing the consumption at the device level are major concerns. Nonetheless, coexistence and integration into existing frameworks should be considered to ease the deployment procedure and reduce costs. In this work, we proposed the integration of Turbo-FSK in the OFDM framework and extended the original scheme to the Coplanar Turbo-FSK, which employs a hybrid modulation alphabet, and to the Coplanar Turbo-ZC, which uses another orthogonal alphabet based on ZC sequences.
We demonstrated the possibility for the proposed techniques to use a transmitter and a receiver based on an OFDM-like architecture. Simulations demonstrated that the proposed schemes achieve performance close to TC-OFDM in AWGN conditions and Rayleigh fading channels for both static and mobility conditions. However, while TC-OFDM exhibits important envelope variations, a known effect for this modulation, the proposed techniques offer different levels of reduction of the envelope variations, which improves the efficiency of the PA. For example, using Coplanar Turbo-FSK with a class A amplifier can lead to a theoretical drain efficiency of 60%. Coplanar Turbo-FSK trades a small loss in PER for a constant envelope and is a very powerful alternative to the use of classic OFDM, potentially increasing the battery life by a factor 2.5. The presented Coplanar Turbo-ZC exhibits higher envelope variations but exploits the frequency diversity in a better way, offering performance even closer to the TC-OFDM. It appears to be an even more interesting alternative to TC-OFDM and offers a battery life gain for all the considered conditions.
The proposed solutions can be seen as promising additional modes for LPWA transmission, as they can be included in a scheduled network based on an OFDM framework. The solutions could be part of the NB-IoT standard as a new low-power mode which allows a significant reduction of the consumption at the device level.