Electrical transient modeling for appliance characterization

Nait-Meziane, Mohamed; Ravier, Philippe; Abed-Meraim, Karim; Lamarque, Guy; Bunetel, Jean-Charles Le; Raingeaud, Yves

doi:10.1186/s13634-019-0644-2

Research
Open access
Published: 20 November 2019

Electrical transient modeling for appliance characterization

Mohamed Nait-Meziane ORCID: orcid.org/0000-0002-7506-2703¹,
Philippe Ravier¹,
Karim Abed-Meraim¹,
Guy Lamarque¹,
Jean-Charles Le Bunetel² &
…
Yves Raingeaud²

EURASIP Journal on Advances in Signal Processing volume 2019, Article number: 55 (2019) Cite this article

2183 Accesses
2 Citations
Metrics details

Abstract

Transient signals are characteristic of the underlying phenomenon generating them, which makes their analysis useful in many fields. Transients occur as a sudden change between two steady state regimes, subsist for a short period, and tend to decay over time. Hence, superimposed damped sinusoids (SDS) were extensively used for transients modeling as they are adequate for describing decaying phenomena. However, SDS are not adapted for modeling the turn-on transient current of electrical appliances as it tends to decay to a steady state that is different from the one preceding it. In this paper, we propose a new and more suitable model for these signals for the purpose of characterizing appliances. We also propose an algorithm for the model parameter estimation and validate its performance on simulated and real data. Moreover, we give an example on the use of the model parameters as features for the classification of appliances using the Controlled On/Off Loads Library (COOLL) dataset. The results show that the proposed algorithm is efficient and that for real data the network fundamental frequency must be estimated to account for its variations around the nominal value. Finally, real data experiments showed that the model parameters used as features yielded a classification accuracy of 98%.

1 Introduction

Studying transient phenomena is important and useful in many fields such as biomedical research for the analysis of heart rate variability [1], the extraction of detailed information of muscle behavior [2], and the detection and classification of epileptic spikes [3]; mechanics for the study of the susceptibility of structures to vibration issues [4]; and for seismic events detection and temporal localization [5, 6]. Monitoring electrical loads and systems is particularly one of the areas where transients play a central role. We cite as applications the analysis of disturbances affecting the quality of the electric power system [7, 8], fault detection in rotary machines [9, 10], and non-intrusive load monitoring (NILM) [11–13], a field concerned with extracting individual energy consumption (e.g., of different appliances) from measured total energy consumption (e.g., at main breaker panel).

Transients embed a decay or damping characteristic as they exist for short periods, and therefore, the superimposed damped sinusoids (SDS) model [14] was extensively used to model transients in many fields. For example, it was used for modeling electric disturbances [15], transient audio signals [16], and the free induction decay observed in nuclear resonance spectroscopy [17]. Along with the model, different algorithms were proposed for its parameter estimation [18]. The most known methods are Prony’s [19], Pisarenko’s [20], matrix pencil [21], Estimation of Signal Parameters via Rotational Invariance Techniques (ESPRIT) [22], and MUltiple SIgnal Classification (MUSIC) [23]. Despite its success, the SDS model is inadequate for turn-on transient current signals. In fact, a lot of turn-on transient current signals are characterized by a quasi-stationary harmonic content (Fig. 1a) whereas the SDS model is best suited for modeling vanishing non-stationary content (Fig. 1b) because having different damping factors for each frequency produces a signal with non-stationary frequency content. Moreover, the turn-on transient current decays to a steady state that is different from the steady state preceding the turn-on of the appliance, whereas in the SDS the transient model starts from one steady state and decays to the same one afterwards. The electrical current “turn-on" transient is the current that appears with the switching-on of an electrical appliance. This corresponds to a transition from one steady state to another. For example, if we consider a single appliance on the network, then the first steady state is the state of no consumption, the second steady state is the state of steady consumption, and the transient is the part in between. Note that the transient we are interested in modeling is the one related to the electrical consumption. This transient is different from the very high frequency transient (appearing as a short pulse preceding the turn-on transient) generated by the switching devices because of the closure of the circuit [24]. Turn-on transients are appliance-dependent and last usually from few power system cycles to few seconds. A turn-on transient is typically characterized by a high current amplitude (surge) at the beginning of consumption followed by a decrease (or damping) in the amplitude of the consumed energy until reaching a stable state (Fig. 1a).

In this paper, we propose a new model for turn-on transient current signals along with an efficient algorithm for the parameter estimation. The parameters are used to characterize electrical appliances and are shown to be useful for appliance classification. Several objectives are targeted in this paper including:

Derivation of an efficient estimation algorithm for the model parameters.
Assessment of the estimation performance via the computation of the Cramér-Rao bound (CRB).
Validation of the proposed model on real transient signals and the evaluation of the modeling error when using the developed estimation algorithm.
Exploitation of the model parameters for appliance characterization and assessment of their usefulness as relevant features for a classification task example.
As a by-product, we also developed a full experimental setup (with a dedicated transient signal acquisition device) to build our dataset of real transient signals corresponding to different electrical appliances. This dataset is used for our model validation as well as for the performance assessment of the proposed appliance identification method.

The rest of the paper is organized as follows: Section 2.1 describes the proposed data model, Section 2.2 details the proposed parameter estimation algorithm, Section 3 gives the derivation of the parameters’ CRBs, Section 4.1 gives the assessment of the proposed model and algorithm on simulated and real data, Section 4.2 shows the usefulness of the model parameters through an appliance classification example, and finally, Section 5 concludes the paper.

2 Methods

2.1 Data modeling

In this section, we propose and discuss a mathematical model for turn-on transient current signals. Strictly speaking, we model the turn-on transient including a small part of the following steady state regime; mainly because the transient end is not well defined and because estimating the harmonic content is easier on the steady state part.

The shape of the turn-on transient and the related amplitudes vary from one electrical appliance to another. To take into account these variations, we propose to model the noiseless electrical current turn-on transient s(t) as the product of two signals

$$ s(t) = e(t)s_{s}(t), \quad \forall t\in [0, +\infty) $$

(1)

where e(t) represents an amplitude modulation (envelope) and s_s(t) is a sum of d sinusoids given as follows

$$ s_{s}(t) = {\sum\limits_{i = 1}^d {{a_i}{\cos\left(2\pi {f_i}t + \phi_{i}\right)}}}, \quad \forall t\in [0, +\infty) $$

(2)

where a_i (≥0), ϕ_i∈[−π,π] and f_i are the sinusoids amplitudes, phases, and frequencies, respectively. The number d of sinusoids (current harmonics) is assumed known (typically d=5 harmonics is enough to represent the sinusoidal signal s_s(t)) and the frequencies satisfy f_i=(2i−1)f₀,i=1,…,d where f₀≈50 Hz. Indeed, because of the half-wave symmetry found in electrical signals (i.e., for a periodic signal g(t) of period P, a half-wave symmetry is characterized by g(t+P/2)=−g(t)), the sinusoid frequencies f_i are odd-order harmonics of the fundamental frequency. Note that the nominal value (50 Hz) of the network fundamental frequency is a priori known, but due to its fluctuations around this value over time (i.e., f₀=50+δf), we have observed that for a correct modeling, f₀ should be considered as unknown and hence one needs to re-estimate the fundamental frequency value for each transient signal^{Footnote 1}.

The envelope e(t) is chosen of the form e^u(t)+1 such that $e^{u(t)}\xrightarrow [{t \to + \infty }]{}0$. This exponential function was chosen for its usefulness in describing damped phenomena. A classical damped model is such that u(t)=−αt with α>0. For our model, we propose to extend this classical model in order to adapt it to real current signals. Specifically, we propose to model u(t) as a polynomial function allowing more flexibility in describing the amplitude modulation of real signals

$$ e(t) = {e^{{\text{p}^{T}}\text{t}}} + 1, \quad \forall t\in [0, +\infty) $$

(3)

where p=[p₀,p₁,…,p_n]^T is a vector of n+1 polynomial coefficients and t=[1,t,…,tⁿ]^T is a time vector such that p^Tt is a polynomial of degree n allowing the model adaptation to the real signal variations^{Footnote 2}. The polynomial is such that ${\text {p}^{T}}\text {t}\mathop {\longrightarrow } \limits _{t \to + \infty } - \infty $ leading to $e(t)\mathop {\longrightarrow } \limits _{t \to + \infty } 1$ (verified if p_n<0).

We assume that the measured current signal x(t) is corrupted by an additive white Gaussian noise (AWGN) w(t) with zero mean and variance σ²

$$ x(t) = s(t)+w(t), \quad \forall t\in \mathbb{R}. $$

(4)

The passage from continuous to discrete-time notation is done using t_k=kT_s, where T_s=1/F_s is the sampling period and $k\in \mathbb {Z}$. This notation is used in the remainder of the paper.

2.2 Parameter estimation algorithm

The proposed parameter estimation algorithm proceeds in two steps:

Estimation of the fundamental frequency f₀.
Estimation of the other signal parameters using the a priori estimated frequency $\hat {f}_{0}$.

2.2.1 Fundamental frequency estimation

We assume that the fundamental frequency is unknown but quasi-constant over the transient duration, typically less than 5 s, and we propose to estimate it using the voltage signal, which is almost perfectly sinusoidal (Fig. 2). Indeed, the stability of f₀ (i.e., its rate of change) is an important issue that was discussed in depth in the literature. This can be seen from the plot in Fig. 3 (borrowed from http://wwwhome.cs.utwente.nl/~ptdeboer/misc/mains.html) which represents the “Allan deviation” of f₀ for a measurement over a period of 69 days. As explained in this reference, if the Allan deviation at an averaging duration of 10 s is 10⁻⁴, it means that if one measures the frequency during 10 s and once more during the next 10 s, these measurements will differ on average by 0.01%. Based on this, we consider the frequency variation over our 5-s measurement period as negligible. Hence, the fundamental frequency estimation problem turns into the classical problem of estimating the frequency of a monotone signal in noise. It is known that the CRB of the frequency parameter of a monotone signal decreases with a rate of 1/N³ [26]

$$ \text{var}(\hat{f}_0) \ge \frac{12}{(2\pi)^2 \eta N(N^2-1)}, $$

(5)

where η is the signal-to-noise ratio (SNR) and N the number of signal samples. This allows for highly precise frequency estimates. Practically, we use the algorithm proposed by Aboutanios and Mulgrew [27] which is shown to provide a precise frequency estimate reaching the CRB. Indeed, the voltage signal is modeled here as a pure sinusoid of frequency f₀ corrupted by an AWGN. In such a case, the optimal maximum likelihood (ML) solution coincides with the peak location estimation of the Fourier transform of the signal. This estimation is achieved by the low cost efficient numerical method in [27].

In the second step (next section), we estimate the remaining parameters using the frequency $\hat {f}_{0}$.

2.2.2 Transient current estimation (TCE) algorithm

Given the estimated frequency $\hat {f}_{0}$, the second step of our estimation algorithm operates in two phases: (i) initialization and (ii) parameter estimation.

The initialization phase provides initial estimates of the parameters p_j,a_i, and ϕ_i (j=0,…,n and i=1,…,d) to be used in the parameter estimation phase, during which these estimates will be refined. This two-phase structure of the algorithm is motivated by the difficulty and high computational cost of the nonlinear maximum-likelihood-based estimation criterion (see (15)). In such a case, we usually seek a good initial estimate and then refine it in order to alleviate the ill-convergence and high computational cost of the problem.

Note that the algorithm needs some pre-specified values for n, d, and f_i and also needs a pre-defined steady state portion used in the initialization step for the estimation of the amplitudes a_i and phases ϕ_i of the sinusoids. Hereafter, we start by discussing these “pre-specified quantities,” then we proceed to detailing the algorithm.

2.2.2.1 Pre-specified quantities

These quantities are f_i (i=1,…,d),n,d,t_ss1, and t_ss2 (“ss” stands for steady state). As mentioned in Section 2.1, f_i are odd-order harmonics of f₀. Taking into account the estimated fundamental frequency $\hat {f}_{0}$, the sinusoids’ frequencies are given by $f_{i}=(2i-1)\hat {f}_{0}$. The rest of the parameters are chosen in an ad hoc way. The polynomial degree n and the number of sinusoids d were chosen based on experimental observations made on the real data we used. The chosen values were n=3 and d=5 (see the discussion of assumption A3 in Section 4.1.2).

Quantities t_ss1 and t_ss2 define the time instants that delimit a portion, noted x_ss(t_k), of the steady state of the current signal (Fig. 4). This portion is used in the initialization phase for the estimation of the amplitudes and phases. On this steady state portion, t_k∈[t_ss1,t_ss2], we can write x(t_k)=x_ss(t_k)=s_s(t_k)+w(t_k) where s_s(t_k) is the sum of sinusoids signal (2); we neglect the envelope influence by assuming e(t)=1 on this portion.

We define t_ss1 and t_ss2 using the High Accuracy NILM Detector (HAND) algorithm [28] found in the literature. Applying HAND on a turn-on transient signal provides the time-instants $t_{\text {beg}}^{\text {on}}$ and $t_{\text {end}}^{\text {on}}$ defining the beginning and end of the turn-on transition (Fig. 4). Practically, we define t_ss1 a few (typically ten) time cycles after $t_{\text {end}}^{\text {on}}$ and t_ss2 such that the duration of x_ss(t_k) is 25 time-cycles (i.e., half a second, sufficient to get good initial estimates of amplitudes and phases).

2.2.2.2 Initialization phase

1.
Estimation of s_s(t_k): Using the least squares (LS) criterion, parameters a_i and ϕ_i are estimated using (t_k∈[t_ss1,t_ss2])
$$\begin{array}{*{20}l} x_{ss}(t_{k}) & = s_{s}(t_{k}) + w(t_{k}) \\ & = {\sum\limits_{i = 1}^{d} {{a_{i}}{\cos\left(2\pi {f_{i}}t_{k} + {\phi}_{i}\right)}}}+ w(t_{k}) \\ & = \sum\limits_{i = 1}^{d} [{a_{i}} \cos{\phi}_{i}\cos\left(2\pi {f_{i}}t_{k}\right) \\ & - {a_{i}} \sin{\phi}_{i}\sin\left(2\pi {f_{i}}t_{k}\right)] + w(t_{k}). \end{array} $$
(6)

Writing (6) in vector form gives
$$ \mathbf{x_{ss}}= \mathbf{M} \mathbf{{c}} + {\mathbf w}, $$
(7)

where x_ss=[x_ss(t_ss1),…,x_ss(t_ss2)]^T,c=[a₁ cosϕ₁,a₁ sinϕ₁,…,a_d cosϕ_d,a_d sinϕ_d]^T,w=[w(t_ss1),…,w(t_ss2)]^T is the noise vector and M s the matrix given in (8).
$$ {{} \begin{aligned} \mathbf{M} = \left[ {\begin{array}{ccccc} {\cos \left({2\pi {f_1}{t_{ss1}}} \right)} & { - \sin \left({2\pi {f_1}{t_{ss1}}} \right)} & \cdots & {\cos \left({2\pi {f_d}{t_{ss1}}} \right)} & { - \sin \left({2\pi {f_d}{t_{ss1}}} \right)} \\ \vdots & \vdots & \ddots & \vdots & \vdots \\ {\cos \left({2\pi {f_1}{t_{ss2}}} \right)} & { - \sin \left({2\pi {f_1}{t_{ss2}}} \right)} & \cdots & {\cos \left({2\pi {f_d}{t_{ss2}}} \right)} & { - \sin \left({2\pi {f_d}{t_{ss2}}} \right)} \end{array}} \right] \end{aligned}} $$
(8)

The LS criterion, used to find an estimate for c, aims to minimize the square of the Euclidean norm (∥·∥₂) of the difference between the measured signal and the data model, i.e.,
$$\begin{array}{*{20}l} &\hat{\mathbf{c}} = \underset{\mathbf{c}}{\text{arg\,min}} \frac{1}{2} \|{\mathbf{x}_{\mathbf{s}\mathbf{s}}-\mathbf{M}\mathbf{c}}\|_{2}^2 \\ &\text{subject to}\quad a_{i} \ge 0 \; \text{and} \; \phi_i \in [-\pi, \pi], \forall i. \end{array} $$
(9)

In that case, the solution $\hat {\mathbf {c}}$ is given by
$${r \hat{\mathbf{C}}l} \hat{\mathbf{c}} = \mathbf{M}^{+} \mathbf{x}_{\mathbf{s}\mathbf{s}}, $$
(10)

where M⁺=(M^TM)⁻¹M^T is the (Moore-Penrose) pseudo-inverse of M. We extract from $\hat {\boldsymbol {c}}$ two vectors $\hat {\mathbf {c}\mathbf {s}} = \left [\hat {a}_{1} \cos \hat {\phi }_{1}, \dots, \hat {a}_{d} \cos \hat {\phi }_{d}\right ]^{T}$ and $\hat {\mathbf {s}\mathbf {n}} = \left [\hat {a}_{1} \sin \hat {\phi }_{1}, \dots, \hat {a}_{d} \sin \hat {\phi }_{d}\right ]^{T}$, and we compute $\hat {a}_{i}$ and $\hat {\phi }_{i}$ as follows
$$ \hat{\mathbf{a}} = \left[ {\begin{array}{c} \hat{a}_{1} \\ \vdots \\ \hat{a}_{d} \end{array}} \right] = \sqrt{\hat{\mathbf{c}\mathbf{s}} \odot \hat{\mathbf{c}\mathbf{s}} + \hat{\mathbf{s}\mathbf{n}} \odot \hat{\mathbf{s}\mathbf{n}}}, $$
(11)

$$ \hat{\boldsymbol{\phi}} = \left[ {\begin{array}{*{20}{c}} \hat{\phi}_{1} \\ \vdots \\ \hat{\phi}_{d} \end{array}} \right] = \arctan \left(\hat{\mathbf{s}\mathbf{n}} \oslash \hat{\mathbf{c}\mathbf{s}} \right), $$
(12)

where ⊙ and ⊘ are the element-wise product and division operators, respectively.
2.
Estimation of e(t_k) :

To estimate e(t_k), we use the trust-region-reflective (TRR) algorithm [29, 30], an efficient nonlinear optimization algorithm that belongs to the “trust region” class of algorithms [31]. This algorithm allows constraints to be imposed on the values of the parameter estimates enabling us to satisfy our constraints (a_i≥0,ϕ_i∈[−π,π] and p_n<0).

Having the estimates $\hat {\mathbf {a}}$ and $\hat {\boldsymbol {\phi }}$ obtained from the previous step and the (overall) measured signal x=[x(t₀),…,x(t_N−1)]^T, we estimate an initial value of p=[p₀,…,p_n]^T (the remaining unknown) using
$$\begin{array}{*{20}l} &\hat{\mathbf{p}} = \underset{\mathbf{p}}{\text{arg\,min}} \frac{1}{2}||\mathbf{x}-(\mathbf{e} \odot \mathbf{M}_{ov} \hat{\mathbf{c}})||_{2}^2 \\ &\text{subject to} \quad p_n < 0, \end{array} $$
(13)

where e=[e(t₀),…,e(t_N−1)]^T, N is the number of samples of x, and M_ov is the matrix M (8) with t∈[t₀,…,t_N−1] instead of t∈[t_ss1,…,t_ss2]. Moreover, t₀ is practically chosen as the time-instant corresponding to the maximum current amplitude; that way, we model the damped part of the turn-on transient starting from the maximum amplitude. In the end of this phase, we obtain the initial estimated parameter vector $\boldsymbol {\hat {\theta }}_{0} = \left [\hat {\mathbf {p}}^{T}, \hat {\mathbf {a}}^{T}, \hat {\boldsymbol {\phi }}^{T} \right ]^{T}$.

2.2.2.3 Parameter estimation phase

As the estimation of a_i and ϕ_i is done using only a portion of the total measured signal x(t_k), the aim of this parameter estimation phase is to improve the estimation of all the parameters by considering all the samples of x(t_k). We use for this the same TRR algorithm considered for the estimation of e(t_k) by taking as initial value the result of the estimation phase $\hat {\boldsymbol {\theta }}_{0}$. The unknown, to be estimated this time, is the global parameter vector θ=[p^T,a^T,ϕ^T]^T estimated as

$$ {{}\begin{aligned} \boldsymbol{\hat{\theta}} = \underset{\boldsymbol{\theta}}{\text{arg\,min}} \frac{1}{2}\|{\mathbf{x}\,-\,(\mathbf{e} \odot \mathbf{M}_{ov} \mathbf{c})}\|_{2}^2 \,\,\,\,\,\, \end{aligned}} $$

(14)

The TCE algorithm is summarized in Algorithm ??.

3 Cramér-Rao bounds of the model parameters

The Cramér-Rao bound (CRB) provides a lower bound on the variance of any unbiased estimator. We show in Section 4.1.1 that this unbiasedness condition is approximately verified (at least for moderate and high SNRs) for our estimated parameters, and hence, we can use the CRB to assess the performance of our estimation.

Evaluating the performance of the estimation consists of comparing the estimated parameters’ variances with their CRB. Taking into account the dependence of s(t_k) on the parameter vector θ and N samples of x(t_k), (4) can be written using vector notation as

$$ \boldsymbol{x} = \boldsymbol{s}(\boldsymbol{\theta}) + \boldsymbol{w}, $$

(15)

where x is normally distributed with mean μ(θ)=s(θ) =[s(t₀,θ),…,s(t_N−1,θ)]^T and a covariance matrix C(θ)=C=σ²I.

The CRB is defined as the inverse of the Fisher information matrix (FIM). If we assume θ=[p^T,a^T,ϕ^T]^T=[θ₁,…,θ_K]^T,K=n+1+2d (the noise power is assumed known here) and $\mathbf {x} = [x(t_{0}), \dots, x(t_{N-1})]^{T} \in \mathbb {R}^{N}$, then for the general Gaussian case where $\boldsymbol {x} \sim \mathcal {N}(\boldsymbol {\mu }(\boldsymbol {\theta }), \boldsymbol {C}(\boldsymbol {\theta }))$, the FIM is given elementwise by [26]

$$ \begin{array}{*{20}{l}} \left[ \mathbf{F}\left(\boldsymbol{\theta} \right) \right]_{ij} = & \left(\frac{\partial \boldsymbol{\mu} \left(\boldsymbol{\theta} \right)}{\partial \theta_{i}} \right)^{T}\mathbf{C}^{- 1}\left(\boldsymbol{\theta} \right)\left(\frac{\partial \boldsymbol{\mu} \left(\boldsymbol{\theta} \right)} {\partial {\theta_{j}}} \right) \\ & + \frac{1}{2}\text{tr}\left[\mathbf{C}^{- 1}\left(\boldsymbol{\theta} \right)\frac{\partial {\mathbf{C}}\left(\boldsymbol{\theta} \right)} {\partial \theta_{i}}\mathbf{C}^{- 1}\left(\boldsymbol{\theta} \right)\frac{\partial \mathbf{C}\left(\boldsymbol{\theta} \right)} {\partial \theta_{j}} \right], \end{array} $$

(16)

where $\frac {\partial \boldsymbol {\mu } \left (\boldsymbol {\theta } \right)}{\partial {\theta _{i}}} = \left [ \frac {\partial \left [\boldsymbol {\mu } \left (\boldsymbol {\theta } \right)\right ]_{1}}{\partial {\theta _{i}}}, \dots, \frac {\partial \left [\boldsymbol {\mu } \left (\boldsymbol {\theta } \right)\right ]_{N}}{\partial {\theta _{i}}} \right ]^{T}$, and

$$\frac{\partial \mathbf{C}\left(\boldsymbol{\theta} \right)} {\partial \theta_{i}} = \left[ \begin{array}{*{20}{c}} \frac{\partial \left[ \mathbf{C}\left(\boldsymbol{\theta} \right) \right]_{11}} {\partial \theta_{i}} & \cdots & \frac{\partial \left[ \mathbf{C}\left(\boldsymbol{\theta} \right) \right]_{1N}} {\partial \theta_{i}} \\ \vdots & \ddots & \vdots \\ \frac{\partial \left[ \mathbf{C}\left(\boldsymbol{\theta} \right) \right]_{N1}} {\partial \theta_{i}} & \cdots & \frac{\partial \left[ \mathbf{C}\left(\boldsymbol{\theta} \right) \right]_{NN}} {\partial \theta_{i}} \end{array} \right]. $$

The symbol [·]_i denotes element of index i of the corresponding vector, [·]_ij denotes the element of index ij of the corresponding matrix, and tr [·] denotes the trace operator.

For our model where μ(θ)=s(θ) and C(θ)=σ²I (covariance matrix independent of θ), the (elementwise) FIM becomes

$$ \left[ {{\mathbf{F}}\left(\boldsymbol{\theta} \right)} \right]_{ij} = \frac{1}{\sigma^{2}}\left(\frac{\partial \boldsymbol{s}(\boldsymbol{\theta})}{\partial \theta_{i}}\right)^{T} \left(\frac{\partial \boldsymbol{s}(\boldsymbol{\theta})}{\partial \theta_{j}}\right), \quad i, j = 1, \dots, K. $$

(17)

Taking into account (17) and the structure of θ, the FIM can be written using matrix notation in the form of a nineblock matrix representing the partial derivatives with respect to the elements of θ as

$$ \mathbf{F}\left(\boldsymbol{\theta}\right) = \frac{1} {\sigma^{2}}\left[ \begin{array}{*{20}{c}} \left(\frac{\partial \mathbf{s}^{T}} {\partial p_{l}}\frac{\partial \mathbf{s}} {\partial p_{m}} \right) & \left(\frac{\partial \mathbf{s}^{T}} {\partial p_{l}}\frac{\partial \mathbf{s}} {\partial a_{m}} \right) & \left(\frac{\partial \mathbf{s}^{T}} {\partial p_{l}}\frac{\partial \mathbf{s}} {\partial \phi_{m}} \right) \\ \left(\frac{\partial \mathbf{s}^{T}} {\partial a_{l}}\frac{\partial \mathbf{s}} {\partial p_{m}} \right) & \left(\frac{\partial \mathbf{s}^{T}} {\partial a_{l}}\frac{\partial \mathbf{s}} {\partial a_{m}} \right) & \left(\frac{\partial \mathbf{s}^{T}} {\partial a_{l}}\frac{\partial \mathbf{s}} {\partial \phi_{m}} \right) \\ \left(\frac{\partial \mathbf{s}^{T}} {\partial \phi_{l}}\frac{\partial \mathbf{s}} {\partial p_{m}} \right) & \left(\frac{\partial \mathbf{s}^{T}} {\partial \phi_{l}}\frac{\partial \mathbf{s}} {\partial a_{m}} \right) & \left(\frac{\partial \mathbf{s}^{T}} {\partial \phi_{l}}\frac{\partial \mathbf{s}} {\partial \phi_{m}} \right) \end{array} \right], $$

(18)

where l,m∈{1,…,n+1} for the elements of p and l,m∈{1,…,d} for the elements of a and ϕ.

Since this matrix is symmetric (because of the symmetry of the second order partial derivatives), we only have to compute the following terms to find all the elements of the matrix: $\frac {\partial \mathbf {s}^{T}}{\partial p_{l}}\frac {\partial \mathbf {s}}{\partial p_{m}}, \frac {\partial \mathbf {s}^{T}}{\partial p_{l}}\frac {\partial \mathbf {s}}{\partial a_{m}}, \frac {\partial \mathbf {s}^{T}}{\partial p_{l}}\frac {\partial \mathbf {s}}{\partial \phi _{m}}, \frac {\partial \mathbf {s}^{T}}{\partial a_{l}}\frac {\partial \mathbf {s}}{\partial a_{m}}, \frac {\partial \mathbf {s}^{T}}{\partial a_{l}}\frac {\partial \mathbf {s}}{\partial \phi _{m}}, \frac {\partial \mathbf {s}^{T}}{\partial \phi _{l}}\frac {\partial \mathbf {s}}{\partial \phi _{m}}$. After straightforward derivations, we get

$$ {{} \begin{aligned} \frac{\partial \mathbf{s}^{T}}{\partial p_{l}} \frac{\partial \mathbf{s}}{\partial p_{m}} = \sum\limits_{k = 0}^{N - 1} \left(s_{s}(t_{k})e^{\mathbf{p}^{T}\mathbf{t}_{k}} \right)^{2}t_{k}^{l + m}\\ \frac{\partial \mathbf{s}^{T}}{\partial p_{l}} \frac{{\partial {\mathbf{s}}}} {{\partial {a_m}}} = \sum\limits_{k = 0}^{N - 1} {s({t_k}){e^{{{\mathbf{p}}^{T}}{{\mathbf{t}}_k}}}t_{k}^{l}\cos \left({2\pi {f_m}{t_k} + {\phi _m}} \right)}\\ \frac{{\partial {{\mathbf{s}}^{T}}}} {{\partial {p_l}}}\frac{{\partial {\mathbf{s}}}} {{\partial {\phi _m}}} = - \sum\limits_{k = 0}^{N - 1} {s({t_k}){e^{{{\mathbf{p}}^{T}}{{\mathbf{t}}_k}}}t_{k}^{l}{a_m}\sin \left({2\pi {f_m}{t_k} + {\phi _m}} \right)}\\ \frac{{\partial {{\mathbf{s}}^{T}}}} {{\partial {a_l}}}\frac{{\partial {\mathbf{s}}}} {{\partial {a_m}}} = \sum\limits_{k = 0}^{N - 1} e{{({t_k})}^{2}}\cos \left({2\pi {f_l}{t_k} + {\phi _l}} \right) \cos \left({2\pi {f_m}{t_k} + {\phi _m}} \right)\\ \frac{{\partial {{\mathbf{s}}^{T}}}} {{\partial {a_l}}}\frac{{\partial {\mathbf{s}}}} {{\partial {\phi _m}}} = - \sum\limits_{k = 0}^{N - 1} {e{{({t_k})}^{2}}{a_m}\cos \left({2\pi {f_l}{t_k} + {\phi _l}} \right)\sin \left({2\pi {f_m}{t_k} + {\phi _m}} \right)}\\ \frac{{\partial {{\mathbf{s}}^{T}}}} {{\partial {\phi _l}}}\frac{{\partial {\mathbf{s}}}} {{\partial {\phi _m}}} = \sum\limits_{k = 0}^{N - 1} {e{{({t_k})}^{2}}{a_l}{a_m}\sin \left({2\pi {f_l}{t_k} + {\phi _l}} \right)\sin \left({2\pi {f_m}{t_k} + {\phi _m}} \right)}, \\ \end{aligned}} $$

(19)

where ${\mathbf {t}_{k}} = {\left [ {1,{t_{k}}, \ldots,t_{k}^{n}} \right ]^{T}}$, and s(t_k),s_s(t_k),e(t_k) are defined in (1), (2), and (3), respectively. Finally, the CRB is equal to F⁻¹(θ) obtained after inserting expressions (19) in (18) and inverting F(θ).

In the previous CRB derivation, we assumed the noise variance σ² known so that K=n+1+2d. If we assume that σ² is also an unknown parameter to be estimated such that θ^′=[p^T,a^T,ϕ^T,σ²]^T, then the FIM F(θ^′) is equal to the FIM F(θ) augmented with one row and one column corresponding to partial derivatives with respect to σ². Using (16), we get

$$ \mathbf{F}\left(\boldsymbol{\theta'}\right) = \left[ {\begin{array}{*{20}{c}} {\mathbf{F}\left(\boldsymbol{\theta}\right)} & {\mathbf{0}} \\ {\mathbf{0}} & {\frac{N} {{2{\sigma^{4}}}}} \end{array}} \right]. $$

(20)

The CRB is then given by

$$ \text{CRB}(\boldsymbol{\theta'})=\mathbf{F}^{- 1}\left(\boldsymbol{\theta'}\right) = \left[ {\begin{array}{*{20}{c}} {{{\mathbf{F}}^{- 1}}\left(\boldsymbol{\theta}\right)} & {\mathbf{0}} \\ {\mathbf{0}} & {\frac{{2{\sigma^{4}}}} {N}} \end{array}} \right]. $$

(21)

This indicates that it is sufficient to independently compute F⁻¹(θ) and $\frac {2\sigma ^{4}}{N}$ to find F⁻¹(θ^′). It also means that the existence or lack of information about σ² does not affect the performance bound (CRB) of the other desired parameters.

4 Results and discussion

4.1 Estimation performance assessment

4.1.1 Assessment on simulated data

In this section, we present the results of the estimation performance evaluation on simulated data. Hereafter, we present (i) the simulated signal and its parameters, (ii) the bias of the estimated parameters, (iii) the estimated parameters variance and its comparison to the CRB, (iv) the CRB variation with respect to the sampling frequency, and (v) the convergence of the TCE algorithm.

4.1.1.1 Simulated signal and its parameters

Taking the considered setup (n=3 and d=5) in this section, we end up with 14 parameters for the simulated signal. So, with such large number of degrees of freedom, we decided to choose the set of parameters such that the simulated signal will resemble as much as possible real signals, and without a priori knowledge on what parameter values are appropriate, we decided to tweak the model parameters and choose the ones that gave a simulated signal “resembling” (similar waveform) typical real current waveforms from our dataset. The noiseless signal model is

$$ s(t_k) \,=\, e(t_k)s_{s}(t_k) \\ \,=\, \left(e^{{\mathbf{p}^{T}}{\boldsymbol{t}_k}} + 1\right){\sum\limits_{i = 1}^d {{a_i}{\cos\left(2\pi {f_i}t_k + \phi_{i}\right)}}}, $$

where $\mathbf {p} = {\left [ {{p_{0}},{p_{1}}, \ldots,{p_{n}}} \right ]^{T}}, {\boldsymbol {t}_{k}} = {\left [ {1,t_{k}, \ldots,{t_{k}^{n}}} \right ]^{T}}, t_{k} \in [t_{0}, t_{N-1}], a_{i}$ (≥0), ϕ_i∈[−π,π] and f_i=(2i−1)f₀,i=1,…,d with a fixed fundamental frequency f₀=50 Hz. The chosen model parameter values are

F_s=30 kHz: sampling frequency
t₀=0 s, t_N−1=3 s: specify signal duration
n=3: polynomial degree
d=5: number of harmonics
p=[1.9,− 9,8.5,− 4]^T
a=[1.8,0.5,0.2,0.1,0.05]^T
ϕ=[− 3,3,2.5,1.5,1]^T
t_ss1=2.5 s, t_ss2=3 s: define steady state portion.

The polynomial degree is chosen relatively small (we found that n=3 is sufficient to characterize the tested signals, and hence, this value will be used in the remainder of the paper). The previous transient signal is then corrupted by an additive white Gaussian noise of zero mean and variance σ² (varied such that the signal-to-noise ratio (SNR) $=\frac {1/N\sum _{k=0}^{N-1}s(t_{k})^{2}}{\sigma ^{2}}$ varies in the range [0−50] dB). The obtained simulated signal is shown in Fig. 5.

4.1.1.2 Bias of the estimated parameters

As mentioned before, the CRB applies for unbiased estimators. Our maximum likelihood estimation method based on criterion (14) is known to be asymptotically, i.e., for high SNR, unbiased [26]. Here, we evaluate our estimator bias numerically using (1000) Monte-Carlo runs.

Figure 6 gives the bias computed for the parameters estimated using the proposed algorithm versus the signal-to-noise ratio (SNR). We can see that all estimated parameters have negligible bias for SNR values greater than or equal to 30 dB. Between 10 dB and 30 dB, the estimated parameters have very small biases, and below 10 dB, we start getting some bias, nonetheless with values that are still small compared to the true parameter values.

4.1.1.3 Estimated parameters’ variance and its comparison to the CRB

Similarly to the bias computation, the variance is also computed numerically. Figure 7 shows the different parameter variances compared with their respective CRBs. We note that all the parameter variances coincide with their respective CRBs almost perfectly. Hence, our estimation is efficient (unbiased and the variance reaches the CRB).

4.1.1.4 CRB variation with respect to the sampling frequency

Due to the transient behavior of the observed phenomena, a good choice for the sampling frequency F_s of measurements is mandatory. We seek a sufficiently high sampling frequency to catch the transient behavior but not too high to avoid heavy computational load. The CRB allows us to evaluate the impact of the sampling frequency on the parameter variance lower bounds and therefore decide on the desired performance taking into account computational complexity.

Figure 8 gives the variation of the parameters’ CRB as a function of F_s (1 to 100 kHz) on a logarithmic scale. The results show that an increase in F_s results in a better estimation performance (linearly decreasing variance w.r.t. to the sample size) for all the parameters. This is expected, since a higher F_s means more data samples (on a fixed time period) and hence better performance. When considering real signals, however, this is not necessarily true. The white noise (independence) assumption initially verified for relatively low F_s (still high frequency though) might not be verified in practice for higher frequency values. At higher frequencies, the data samples become closer and might become correlated, when the time duration between two samples is too small to assume independence. In that case, the computed CRB assuming a white noise can no longer be used to evaluate the estimation performance.

Practically, finding the adequate sampling frequency is not easy since it depends on different parameters: transient waveforms of interest and their frequency contents, computational complexity, desired performance, etc. According to our experiments, and for the study of turn-on transient signals, a sampling frequency at least equal to 5 kHz is recommended (captures around 50 harmonics) whereas going beyond 100 kHz starts generating heavy data processing. A sampling frequency of 30 kHz seems to be a good compromise, since it captures around 300 harmonics and is less computationally heavy. Hence, our choice was F_s=30 kHz for simulations.

4.1.1.5 Convergence of the TCE algorithm

Since the TCE algorithm uses an optimization algorithm in both its estimation and refinement phases, it is important to check its convergence, especially if there is a need for real-time processing. Hereafter, we check the convergence of the nonlinear optimization algorithm, trust-region-reflective (TRR), for both phases.

Figure 9 gives, for different SNR values, the mean-square-error (MSE) as a function of the number of iterations in the estimation phase of the TCE algorithm. Independently of the SNR, the algorithm converges after ten iterations. The convergence of the TRR algorithm in the refinement phase is even faster. Indeed, the initialization point being better defined, it converges at most after three iterations.

4.1.2 Assessment on real data

4.1.2.1 Real data considerations

Until now, we have implicitly assumed, for simulated data, some simplifying assumptions to test the parameter estimation independently of the performance of other blocs that may condition the estimation. These assumptions are as follows: (A1) a well-defined portion of the steady state, (A2) transient starting from a maximum, and (A3) known polynomial degree n and number of harmonics d. In real situations, however, these assumptions are not necessarily verified for the following reasons: (i) the definition of the steady state portion is affected by the precision of turn-on transient (end) detection and is never perfect (depends on the detector accuracy); (ii) physically, there will always be a latency in the appliance response before the current signal reaches its maximum amplitude; and (iii) the polynomial degree as well as the harmonic numbers are only chosen parameters used to trade-off between complexity and modeling efficiency.

The easiest assumption to get around in a real situation is A2 since we only need to detect the signal maximum amplitude and model the damped part, starting from this maximum (so the portion of transient signal preceding the peak value will be disregarded). For assumption A1, we use the HAND detector [28] built specifically to allow high accuracy detection of turn-on transients. For A3, we relied on our dataset of real life signals to get our ad hoc choice of the “effective” polynomial degree n=3 and “effective” number of harmonics d=5 that have been experimentally shown to be suitable for a good modeling of the considered transient signals. As an example, Fig. 10 shows different plots comparing the real signal x(t) to its estimate $\hat {x}(t)$ for different values of the polynomial degree n (1, 3, 5 and 7). We note the improvement of the root-mean-square error (RMSE) between n=1 and n=3, hence a better estimation using n=3, and a slight improvement of the RMSE between n=3,n=5, and n=7. We consider n=3 to be a good trade-off between model complexity and the estimation performance (less than 10% of relative RMSE difference, e.g., between n=3 and n=5, we have a relative RMSE difference of $\frac {0.2473-0.2433}{0.2473}\approx 1.6\%$).

As an aside, note that the particular “two-steps” structure of the proposed parameter estimation algorithm (Section 2.2) is motivated by the highly nonlinear optimization problem involved. Such a two-step approach helps to avoid local minima by providing a good initialization point in the first step, then refining the obtained estimate in the second step. For completeness, we have also tried to improve the estimate of f₀ by jointly estimating it when (i) estimating $\hat {\boldsymbol {p}}$ (13), (ii) when refining the estimation of all parameters (14), and (iii) in both (i) and (ii). This, however, did not improve the results, indicating that the proposed approach already leads to near optimal values (due in part to the highly precise estimate of f₀ obtained using [27]). As an example, we have conducted the joint estimations described above considering the real signal used in Fig. 10 with n=3 that gave initially RMSE = 0.2473. The newly obtained results, in terms of RMSE, were 0.2473, 0.2479, and 0.2479, respectively for (i), (ii), and (iii). The joint estimation of f₀ was not considered further as it would generate more computational load without performance gain.

4.1.2.2 Estimation with TCE on a real signal of the COOLL dataset

The real signal is taken from a turn-on transient dataset we built especially for transients analysis. The dataset is called Controlled On/Off Loads Library (COOLL) [32] and is freely available on the internet (https://coolldataset.github.io/). Since the measurement system [33] (Fig. 11) used to collect the dataset’s signals allows the control over the turn-on/off, we know exactly the turn-on/off time instants and assumptions A1 and A2 hold then true. Moreover, we consider the signal starting from its maximum in order to verify A3.

The COOLL dataset signals (Table 1) are sampled at F_s=100 kHz^{Footnote 3}. The dataset consists of turn-on transient current and voltage signals of 12 different electrical appliances and each appliance has 20 signal examples. Figure 12 shows a typical histogram of the noise on a measured current signal taken from its pre-turn-on part (noise only). This shows that the noise distribution for the COOLL current signals is Gaussian with zero mean and a standard deviation of 2.2 mA (equivalent to an approximate power consumption of 0.5 W).

Table 1 COOLL dataset summary

Full size table

Next, we provide an illustrative example corresponding to a test signal of a fan (Fig. 13). The total duration of the measurement is 6 s with a 0.5 s of pre-turn-on. The estimation results of TCE on the fan signal (Fig. 13) are as follows:

$$\begin{array}{@{}rcl@{}}{l} \hat{\mathbf{p}} = \left[\begin{array}{l} -1.15\\ -0.19\\ 0.20\\ -0.32 \end{array}\right], \hat{\mathbf{a}} = \left[\begin{array}{l} 0.21\\ 7.3\times 10^{-3}\\ 6.4\times 10^{-3}\\ 4.7\times 10^{-3}\\ 1.0\times 10^{-3} \end{array}\right], \hat{\boldsymbol{\phi}}= \left[\begin{array}{l} -3.10\\ 2.94\\ 2.72\\ 0.22\\ 1.27 \end{array}\right],\\ \text{RMSE} =\sqrt{\frac{1}{N}\sum_{n=0}^{N-1} |x(t_n)-s(\hat{\boldsymbol{\theta}}, t_n)|^{2}} = 7.1\times 10^{-3}~A, \end{array} $$

where RMSE is the root-mean-square-error. The above estimation results indicate little information on the estimation quality, especially that we are applying the algorithm on a real signal. Nonetheless, the RMSE gives an idea about the estimation quality but is still without much meaning if not considered relative to some reference value. Here, we propose to compare it to the average maximum value of the steady state amplitude (around 0.2 A). We get a relative RMSE of 3.6%. Note that we got an average relative RMSE of around 8% for the whole dataset, which is acceptable considering the variability of real signals.

Figure 14 allows to get a visual feel for the estimation quality. Figure 14a and b show a good fit between the reconstructed signal and the originalzone.

4.2 Classification of cOOLL dataset’s appliances using the model parameters

Here, we propose to classify the appliances of the COOLL dataset using the model parameters. We use the classical supervised k-nearest neighbors algorithm (k-NN) [34, Chap. 13], which proceeds by taking the test example (here the vector of parameters representing the test signal) and classifying it according to a majority vote of the k-nearest examples (of the training dataset). We used the Euclidean distance as a distance metric. We assess the result using K-fold cross-validation with K=10. This validation works by first partitioning the dataset to K equal partitions (in our case each partition contains 84 example), then take one partition for testing and keep the other nine partitions for training, and we assess the performance using for example the classification accuracy (CA). This process is repeated K times, taking at each time a different partition for testing and the remaining nine for training. The final result is the average of the K accuracy results.

Note that the estimated values of the phase parameters ϕ_i are too random to be considered as features for the classification and, hence, are discarded hereafter. We apply the k-NN on the data using the estimated $\hat {p}_{j}, j = 0, \dots, 3$ and $\hat {a}_{i}, i = 1, \dots, 5$. The results are presented as a confusion matrix (Fig. 15).

The classification accuracy (CA) is given in the bottom rightmost corner. It is defined as $CA = \frac {TP}{\text {Tot}}$ where TP is the number of true positives (i.e., examples correctly classified) and Tot is the total number of considered examples. Figure 15 also gives the values of two largely used performance metrics for classification known as recall (rightmost column) and precision (bottom row). These are defined as $\text {recall} = \frac {TP}{RP}$ and $\text {precision} = \frac {TP}{CP}$ where RP is the number of relevant positives (i.e., examples belonging to the true class), and CP is the number of examples classified as positives. Note that these metrics depend on the relevant (considered) appliance class and are, hence, recomputed for each class. To illustrate this, consider the first row of the confusion matrix (Fig. 15) corresponding to the true class “drill.” Here, $\text {recall} = \frac {103}{120} = 85.8\%$ (i.e., 103 examples are correctly classified among a total of 120 relevant positives (row sum)). Similarly, for the first column corresponding to the predicted (classified) class “drill,” we have $\text {precision} = \frac {103}{121} = 85.1\%$ (i.e., 103 examples are correctly classified among 121 classified as a drill (column sum)).

We obtain a CA of 92.4%. Although the CA is higher than 92%, we expect a less variable characteristic feature capturing the envelope shape to be more relevant for the classification. In fact, the $\hat {p}_{j}$ are sensitive to the chosen origin of time (except the last parameter $\hat {p}_{n}$) and their estimated values are less stable due to the difficulty of precisely defining the origin of time for transient signals. To remedy this, we need to construct a new feature that is independent of the time origin and that still is characteristic of the envelope shape. We propose to use the minimum radius of curvature of the estimated envelope signal $\hat {e}(t)$ constructed using the $\hat {p}_{j}$ parameters. For a function f(t), the radius of curvature at point t₀ is defined as [35]

$$ R(t_{0}) = \left| \frac{(1+f'(t_{0})^{2})^{3/2}}{f\prime\prime(t_{0})} \right| $$

(22)

where f^′(t₀) and f^″(t₀) are the first and second derivatives of f(t) at point t₀, respectively. Practically, we compute this value for each sample point of $\hat {e}(t_{k})$ and take the minimum value R_min. This minimum value is inversely proportional to the maximum curvature, which is a distinctive feature of the turn-on transient signals as can be seen in Fig. 16.

It is important to notice that the phase of the grid (referred to as action delay) when switching-on the appliance might affect the shape of the transient signal envelope. To investigate the influence of the phase of the grid on the chosen minimum radius of curvature, we evaluate the variation of the latter parameter for different delays as illustrated in Fig. 17 for a drill and a vacuum cleaner. As can be seen from these plots, the minimum radius of curvature remains relatively stable w.r.t. action delay parameter values. This observation is valid for most of the electrical appliances.

The classification results using R_min and $\hat {a}_{i}$ are shown in Fig. 18. Compared to the previous result, we obtain an improvement of 5.6%, with a CA of 98.0%. Another performance metric used often in assessing classification performance is the F1 score defined as $2\frac {\text {precision}\times \text {recall}}{\text {precision}+ \text {recall}}$. This metric can be computed for each appliance and gives a single number assessing the performance, which is especially helpful when comparing different classifiers or, as is the case here, the result of two different sets of features. The F1 score results for our classification are given in Table 2. These results show an improvement in the F1 score for almost all the appliances when using the set of features $\{R_{\text {min}}, \hat {a}_{i}\}$ compared to the set of features $\{\hat {p}_{j}, \hat {a}_{i}\}$.

Table 2 Comparative F1 scores of the different COOLL appliances for the classification result using the two sets of features $\{\hat {p}_{j}, \hat {a}_{i}\}$ and $\{R_{\text {min}}, \hat {a}_{i}\}, j=0,\dots, 3$, and i=1,…,5

Full size table

5 Conclusion

We proposed in this paper a new mathematical representation suitable for modeling turn-on transient current signals and proposed an algorithm for the model parameter estimation. The efficiency of the algorithm is assessed theoretically via benchmarking its estimation error variances with respect to the CRB derived in Section 3 of this paper. Later on, the proposed parametric model is validated using real data from the COOLL dataset that we developed specifically for this research work. A “good” fitting between the proposed signal model and the real-life signals has been observed with, in particular, an average relative mean-square-error of about 8%. Note also that our experimental tests showed the need for estimating the fundamental frequency due to its deviation from the nominal value (i.e., 50 Hz). A classification method using the model parameters has been proposed. The obtained results show the usefulness of the transient signal parameters as relevant features for the characterization of electrical appliances with a correct classification accuracy of 98% in the considered context.

The proposed model is valid for a lot of electrical appliances that show a single-phase behavior during the turn-on such as incandescent light bulbs, compact fluorescent lamps, heaters, vacuum cleaners, and hairdryers. However, some electrical appliances may have a turn-on transient current signal consisting of different phases each with a distinct signal content (harmonics with different amplitudes and phases) and a distinct envelope shape corresponding to the different regimes that the appliance goes through during turn-on. For instance, the microwave turn-on transient shown in Fig. 19 has two phases (some microwaves may have more than two) each with its specific characteristics. As a perspective work, one can consider using our model to characterize each phase independently (as a single-phase appliance) and then devise some rule to identify the corresponding multi-phase appliance (e.g., considering the occurrence of the different phases in a time series).

Availability of data and materials

The dataset created and used during the current study is freely available at https://coolldataset.github.io/.

Notes

The European norm “EN 50160” [25] fixes the acceptable variation ranges for δf₀. For the synchronous grid of Continental Europe—the largest synchronous (same frequency) grid in the world linking most of Europe’s countries and some countries of north Africa—these ranges are ± 1% of f₀ (δf₀=[− 0.5,+ 0.5] Hz) 99.5% of a year and − 6%/+ 4% of f₀ (δf₀=[− 3,+ 2] Hz) 100% of the time. The latter range is made large to account for occasional high variations.
Based on our real data measurements, we have observed that a polynomial order n=3 is sufficient to model properly the considered transient signals.
After measurements were done we found that F_s=30 kHz would have been enough to capture the transient behavior of interest.

Abbreviations

CA:: Classification accuracy
COOLL:: Controlled On/Off Loads Library
CP:: Classified as positives
CRB:: Cramér-Rao bound
ESPRIT:: Estimation of Signal Parameters via Rotational Invariance Techniques
FIM:: Fisher information matrix
HAND:: High Accuracy NILM Detector
k-NN:: k-nearest neighbors
LM:: Levenberg-Marquardt
LS:: Least squares
MSE:: Mean-square-error
MUSIC:: Multiple Signal Classification
NILM:: Non-intrusive load monitoring
RMSE:: Root-mean-square-error
RP:: Relevant positives
SDS:: Superimposed damped sinusoids
SNR:: Signal-to-noise ratio
TCE:: Transient current estimation
TP:: True positives
TRR:: Trust-region-reflective

References

C. A. García, A. Otero, X. Vila, D. G. Márquez, A new algorithm for wavelet-based heart rate variability analysis. Biomed. Signal Process. Control.8(6), 542–550 (2013). https://doi.org/10.1016/j.bspc.2013.05.006.
Article Google Scholar
X. Chen, H. Wen, Q. Li, T. Wang, S. Chen, Y. -P. Zheng, Z. Zhang, Identifying transient patterns of in vivo muscle behaviors during isometric contraction by local polynomial regression. Biomed. Signal Process. Control.24:, 93–102 (2016). https://doi.org/10.1016/j.bspc.2015.09.009.
Article Google Scholar
T. P. Exarchos, A. T. Tzallas, D. I. Fotiadis, S. Konitsiotis, S. Giannopoulos, EEG transient event detection and classification using association rules. IEEE Trans. Inf. Technol. Biomed.10(3), 451–457 (2006). https://doi.org/10.1109/TITB.2006.872067.
Article Google Scholar
A. Belsak, J. Flasker, Adaptive wavelet transform method to identify cracks in gears. EURASIP J. Adv. Signal Proc.2010(1), 879875 (2010). https://doi.org/10.1155/2010/879875.
Article Google Scholar
C. Capilla, Application of the Haar wavelet transform to detect microseismic signal arrivals. J. Appl. Geophys.59(1), 36–46 (2006). https://doi.org/10.1016/j.jappgeo.2005.07.005.
Article Google Scholar
X. Li, Z. Li, E. Wang, J. Feng, L. Chen, N. Li, X. Kong, Extraction of microseismic waveforms characteristics prior to rock burst using hilbert–huang transform. Measurement. 91:, 101–113 (2016). https://doi.org/10.1016/j.measurement.2016.05.045.
Article Google Scholar
J. Seymour, T. Horsley, The seven types of power problems. White paper. 18:, 1–21 (2005).
Google Scholar
M. H. J. Bollen, E. Styvaktakis, I. Y. -H. Gu, Categorization and analysis of power system transients. IEEE Trans Power Deliv.20(3), 2298–2306 (2005). https://doi.org/10.1109/TPWRD.2004.843386.
Article Google Scholar
S. Wang, Z. K. Zhu, Y. He, W. Huang, Adaptive parameter identification based on Morlet wavelet and application in gearbox fault feature detection. EURASIP J. Adv. Signal Process.2010(1), 842879 (2010). https://doi.org/10.1155/2010/842879.
Article Google Scholar
W. Jiao, S. Qian, Y. Chang, S. Yang, Research on vibration response of a multi-faulted rotor system using LMD-based time-frequency representation. EURASIP J. Adv. Signal Process.2012(1), 73 (2012). https://doi.org/10.1186/1687-6180-2012-73.
Article Google Scholar
S. B. Leeb, S. R. Shaw, J. L. Kirtley Jr, Transient event detection in spectral envelope estimates for nonintrusive load monitoring. Power Deliv. IEEE Trans.10(3), 1200–1210 (1995).
Article Google Scholar
C. Laughman, K. Lee, R. Cox, S. Shaw, S. Leeb, L. Norford, P. Armstrong, Power signature analysis. Power Energy Mag. IEEE. 1(2), 56–63 (2003).
Article Google Scholar
H. -H. Chang, H. -T. Yang, Applying a non-intrusive energy-management system to economic dispatch for a cogeneration system and power utility. Appl. Energy. 86(11), 2335–2343 (2009).
Article Google Scholar
R. Kumaresan, D. Tufts, Estimating the parameters of exponentially damped sinusoids and pole-zero modeling in noise. IEEE Trans. Acoust. Speech. Signal Process.30(6), 833–840 (1982).
Article Google Scholar
L. Lovisolo, M. P. Tcheou, E. A. B. da Silva, M. A. M. Rodrigues, P. S. R. Diniz, Modeling of electric disturbance signals using damped sinusoids via atomic decompositions and its applications. EURASIP J. Adv. Signal Process.2007(1), 029507 (2007). https://doi.org/10.1155/2007/29507.
Article Google Scholar
R. Boyer, K. Abed-Meraim, Audio modeling based on delayed sinusoids. IEEE Trans. Speech Audio Process.12(2), 110–120 (2004). https://doi.org/10.1109/TSA.2003.819953.
Article Google Scholar
D. V. Rubtsov, J. L. Griffin, Time-domain Bayesian detection and estimation of noisy damped sinusoidal signals applied to NMR spectroscopy. J Magn. Reson. 188(2), 367–379 (2007). https://doi.org/10.1016/j.jmr.2007.08.008.
Article Google Scholar
M. A. Al-Radhawi, K. Abed-Meraim, Parameter estimation of superimposed damped sinusoids using exponential windows. Signal Process.100:, 16–22 (2014). https://doi.org/10.1016/j.sigpro.2013.12.025.
Article Google Scholar
R. Prony, Essai expérimental et analytique : sur les lois de la dilatabilité des fluides élastiques et sur celles de la force expansive de la vapeur de l’eau et de la vapeur de l’alkool, à différentes températures. J. de l’École Polytechnique Floréal et Plairial. 1(22), 24–76 (1795).
Google Scholar
V. F. Pisarenko, The retrieval of harmonics from a covariance function. Geophys. J. Int.33(3), 347–366 (1973).
Article Google Scholar
Y. Hua, T. K. Sarkar, Matrix pencil method for estimating parameters of exponentially damped/undamped sinusoids in noise. Acoust. Speech. Signal Process. IEEE Trans.38(5), 814–824 (1990).
Article MathSciNet Google Scholar
R. Roy, T. Kailath, ESPRIT—estimation of signal parameters via rotational invariance techniques. Acoust. Speech. Signal Process. IEEE Trans.37(7), 984–995 (1989).
Article Google Scholar
R. Schmidt, Multiple emitter location and signal parameter estimation. IEEE Trans. Antennas Propag.34(3), 276–280 (1986).
Article Google Scholar
E. K. Howell, How switches produce electrical noise. Electromagn. Compat. IEEE Trans.EMC-21(3), 162–170 (1979).
Article Google Scholar
CENELEC, Voltage characteristics of electricity supplied by public electricity networks. European Standard EN 50160 (2010).
S. M. Kay, Fundamentals of Statistical Signal Processing, Volume I: Estimation Theory (Prentice Hall PTR, Upper Saddle River, 1993).
MATH Google Scholar
E. Aboutanios, B. Mulgrew, Iterative frequency estimation by interpolation on fourier coefficients. IEEE Trans. Signal Process.53(4), 1237–1242 (2005).
Article MathSciNet Google Scholar
M. Nait Meziane, P. Ravier, G. Lamarque, J. -C. Le Bunetel, Y. Raingeaud, in 42nd IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). High accuracy event detection for non-intrusive load monitoring (IEEE, 2017), pp. 2452–2456. https://doi.org/10.1109/ICASSP.2017.7952597.
T. F. Coleman, Y. Li, On the convergence of interior-reflective newton methods for nonlinear minimization subject to bounds. Math. Program.67(1), 189–224 (1994). https://doi.org/10.1007/BF01582221.
Article MathSciNet Google Scholar
T. F. Coleman, Y. Li, An interior trust region approach for nonlinear minimization subject to bounds. SIAM J. Optim.6(2), 418–445 (1996). https://doi.org/10.1137/0806023.
Article MathSciNet Google Scholar
Y. -x. Yuan, in Iciam, 99. A review of trust region algorithms for optimization (Citeseer, 2000), pp. 271–282.
T. Picon, M. Nait Meziane, P. Ravier, G. Lamarque, C. Novello, J. -C. Le Bunetel, Y. Raingeaud, COOLL: Controlled on/off loads library, a public dataset of high-sampled electrical signals for appliance identification. arXiv preprint arXiv:1611.05803 [cs.OH] (2016).
M. Nait Meziane, T. Picon, P. Ravier, G. Lamarque, J. -C. Le Bunetel, Y. Raingeaud, in Conference on Environment and Electrical Engineering (EEEIC), 2016 Proceedings of the 16th IEEE International. A measurement system for creating datasets of on/off-controlled electrical loads, (2016), pp. 2579–2583.
J. Friedman, T. Hastie, R. Tibshirani, The Elements of Statistical Learning. vol. 1 (Springer, New York, 2001).
MATH Google Scholar
J. D. Lawrence, A Catalog of Special Plane Curves (Courier Corporation, North Chelmsford, 1972).
MATH Google Scholar

Download references

Acknowledgements

Not applicable.

Funding

This current study was supported in part by the Région Centre-Val de Loire (France) through the project MDE-MAC3 (Contract no 2012 00073640).

Author information

Authors and Affiliations

PRISME Laboratory, University of Orléans, 12 rue de Blois, Orléans, 45067, France
Mohamed Nait-Meziane, Philippe Ravier, Karim Abed-Meraim & Guy Lamarque
GREMAN Laboratory, UMR 7347 CNRS–University of Tours, 20 avenue Monge, Tours, 37200, France
Jean-Charles Le Bunetel & Yves Raingeaud

Authors

Mohamed Nait-Meziane
View author publications
You can also search for this author in PubMed Google Scholar
Philippe Ravier
View author publications
You can also search for this author in PubMed Google Scholar
Karim Abed-Meraim
View author publications
You can also search for this author in PubMed Google Scholar
Guy Lamarque
View author publications
You can also search for this author in PubMed Google Scholar
Jean-Charles Le Bunetel
View author publications
You can also search for this author in PubMed Google Scholar
Yves Raingeaud
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

MNM, PR, KAM, and GL conceived and designed the experiments, analyzed the data, and interpreted the results. MNM performed the experiments and wrote the manuscript. MNM, PR, and KAM contributed in developing the model and parameter estimation algorithm. JCLB and YR provided their expertise for the power grid aspects of the experiments. All authors read and approved the final manuscript.

Corresponding author

Correspondence to Mohamed Nait-Meziane.

Ethics declarations

Competing interests

The authors declare that they have no competing interests.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License(http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made.

Reprints and permissions

About this article

Cite this article

Nait-Meziane, M., Ravier, P., Abed-Meraim, K. et al. Electrical transient modeling for appliance characterization. EURASIP J. Adv. Signal Process. 2019, 55 (2019). https://doi.org/10.1186/s13634-019-0644-2

Download citation

Received: 27 March 2019
Accepted: 03 September 2019
Published: 20 November 2019
DOI: https://doi.org/10.1186/s13634-019-0644-2

Electrical transient modeling for appliance characterization

Abstract

1 Introduction

2 Methods

2.1 Data modeling

2.2 Parameter estimation algorithm

2.2.1 Fundamental frequency estimation

2.2.2 Transient current estimation (TCE) algorithm

2.2.2.1 Pre-specified quantities

2.2.2.2 Initialization phase

2.2.2.3 Parameter estimation phase

3 Cramér-Rao bounds of the model parameters

4 Results and discussion

4.1 Estimation performance assessment

4.1.1 Assessment on simulated data

4.1.1.1 Simulated signal and its parameters

4.1.1.2 Bias of the estimated parameters

4.1.1.3 Estimated parameters’ variance and its comparison to the CRB

4.1.1.4 CRB variation with respect to the sampling frequency

4.1.1.5 Convergence of the TCE algorithm

4.1.2 Assessment on real data

4.1.2.1 Real data considerations

4.1.2.2 Estimation with TCE on a real signal of the COOLL dataset

4.2 Classification of cOOLL dataset’s appliances using the model parameters

5 Conclusion

Availability of data and materials

Notes

Abbreviations

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Publisher’s Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords