Fault diagnosis of Tennessee Eastman process using signal geometry matching technique

Li, Han; Xiao, De-yun

doi:10.1186/1687-6180-2011-83

Research
Open access
Published: 10 October 2011

Fault diagnosis of Tennessee Eastman process using signal geometry matching technique

Han Li¹ &
De-yun Xiao¹

EURASIP Journal on Advances in Signal Processing volume 2011, Article number: 83 (2011) Cite this article

5973 Accesses
9 Citations
Metrics details

Abstract

This article employs adaptive rank-order morphological filter to develop a pattern classification algorithm for fault diagnosis in benchmark chemical process: Tennessee Eastman process. Rank-order filtering possesses desirable properties of dealing with nonlinearities and preserving details in complex processes. Based on these benefits, the proposed algorithm achieves pattern matching through adopting one-dimensional adaptive rank-order morphological filter to process unrecognized signals under supervision of different standard signal patterns. The matching degree is characterized by the evaluation of error between standard signal and filter output signal. Initial parameter settings of the algorithm are subject to random choices and further tuned adaptively to make output approach standard signal as closely as possible. Data fusion technique is also utilized to combine diagnostic results from multiple sources. Different fault types in Tennessee Eastman process are studied to manifest the effectiveness and advantages of the proposed method. The results show that compared with many typical multivariate statistics based methods, the proposed algorithm performs better on the deterministic faults diagnosis.

1. Introduction

The last decades have been witnessing the modern large-scale processes developing toward high complexity and multiplicity in industries such as chemical, metallurgical, mechanical, logistics, and etc. These processes are generally characterized by a long-process flow with large operation scales and complicated mechanisms. The typical features are highly nonlinear, long-time delay, and heavily correlated among measurements [1]. Process monitoring, aiming to ensure that the operations satisfy the performance specifications and indicating anomalies, becomes a major challenge in practice. First, the requirements of process expertise for model-based methods often pose difficulties for operators not specializing in this realm; secondly, the system identification theory based methods need to postulate specified mathematical models, which are incapable of capturing varied nonlinearities. In addition, due to the growing number of sensors installed in processes, quantity of data constantly generated under different conditions soars by a few orders of magnitude or more compared to small-scale processes [2]. The fundamental dilemma for process monitoring is deficient knowledge to establish relative accurate mathematical process description while incomplete methodology to exploit abundant data to reveal process mechanisms and operational statuses. In large-scale processes, standard PI (proportional-integral) or PID (proportional-integral-derivative) closed-loop control schemes are often adopted to compensate for variable disturbances and outliers. However, excessive compensation may easily cause controllers overburden and a trivial glitch could eventually develop to catastrophic fault(s). Based on the considerations of practical limits, demands of safety operation, cost optimization as well as business opportunities in technical development, the problem of how to more effectively utilize mass amount of process data to meet the increasing demand of system reliability has received intensive attention of academics and practitioners in related areas. Among all the tasks, data-driven fault diagnosis, involving the use of data to detect and identify faults, is one of the most interesting research domains.

In previous extensively cited literature, Venkatasubramanian once proposed classical three subclasses of diagnostic techniques: quantitative model-based methods, qualitative model-based methods, and process history based method[3–5]. From a new perspective to further investigate Venkatasubramanian's classification, data-driven based fault diagnosis not only includes a large part of techniques in process history based method, but also some belonging to qualitative model-based methods. To view data-driven methods as an integrated type, we can re-divide fault diagnosis methods into three subclasses, namely analytical model-based methods, qualitative knowledge-based methods, and data-driven based methods (DDBM), where DDBM can be further divided into data transform based methods (DTBM) and data reasoning based methods (DRBM). Figure 1 illustrates the proposed classification. In general, DDBM are associated with the methods with insufficient information available to form mechanism model. These kinds of methods employ process data in dynamic system to perform fault detection, diagnosis, identification, and location. DTBM, to be more specifically, highlights the adoption of linear or nonlinear mathematical transforms to map original data to data in another form and the transforms are often reversible. The transformed data may be without clear physical meanings, but with more practicality. The key concept of data transform lies in two attributes: deterministic transform paradigm and realization of data compression. With this concept, the scope of DTBM is smaller and more concentrating compared to DDBM; the purpose for data utilization is more specific. DTBM also needs no in-depth knowledge about system structure as well as experience accumulation and reasoning knowledge which are necessary to DRBM. Besides, the implementation of DTBM algorithms are easily understood and realized, but the drawback may be less robust than model based methods. Dimension transformation (often dimension reduction), filtering, decomposition and nonlinear mapping are recognized as common tools for data transform.

In Figure 1, signal processing is categorized as a data transform methodology which covers a wide range of different techniques. Typical ones are primarily filtering and multilayer signal decomposition, both requiring preset models and carefully selected parameters, like Wavelet Analysis, Hilbert-Huang Transform, etc. Morphological signal processing, however, gives a different viewpoint. It derives from rank-order based data sorting technique and modifies signal geometry shape to achieve filtering [6]. This feature may provide more advantages of noise reduction and detail preservation than linear tools when treating measurements in complex processes [7]. Moreover, Salembier [8] analyzed that how the performance of rank-order based filter can be adaptively optimized in terms of the filter mask and rank value. Based on the investigations above, morphological signal processing as a nonlinear data transform tool may be suitable for constructing feature extractor for pattern matching.

In our previous work (unpublished work), we developed Salembier's idea [8] to adaptively adjust flat structuring element and rank parameter for each sample rather than adopting uniform ones for all the samples in a sampled sequence. Based on this idea, we designed a signal geometry matching approach: pattern classification using one-dimensional adaptive rank-order morphological filter for fault diagnosis, named PC1DARMF approach. The proposed method belongs to DTBM with major parameters capable of being randomly chosen, which is superior to those DTBM which need predefined parameters. This article applies PC1DARMF approach to a more complex and challenging application: Tennessee Eastman process (TEP). TEP is a classic model of an industrial chemical process widely studied in literature for validating new developed control or process monitoring strategies. It is a typical large-scale process characterized by features described previously. The fact that many data-driven diagnostic methods have been performed on TEP also provides chances to evaluate their performances in comparisons with method proposed in this article.

The remainder of this article is organized as follows: Section 2 expounds the derivation of pattern classification method using adaptive rank-order morphological filter. Key implementation issues are also discussed. An example is given to build a step-by-step realization of the method, making it easier for readers to understand. Section 3 gives an essential introduction to TEP and reviews the previous TEP fault diagnosis methods. Section 4 shows the diagnosis results for different TEP simulated faults with detailed analysis. Comparisons between the proposed method and typical multivariate statistics based approaches are made to highlight the advantages and features of PC1DARMF. The last part finally presents the conclusion and discussions.

2. Signal geometry matching based on adaptive rank-order morphological filter

2.1. One-dimensional adaptive rank-order morphological filter (1DARMF)

Adaptive rank-order morphological filter is derived from a nonlinear signal processing tool referred as the rank-order based filter (ROBF). ROBF firstly reads a certain number of input values, then sorts the values in ascending order and determines the output value according to the predefined rank parameter in the sorted set. The basic definition of one-dimensional (1D) ROBF is firstly given in [9]: let x_i be discrete sampled signal defined on a 1D space Z and M be a 1D mask containing N points (|M|= N and | | is the set cardinality). Define j as an index belonging to the mask M and r as the normalized rank parameter of the filter (0 ≤ r ≤1). Given the rank-order operator denoted by f_r,M [x_i ], the output of ROBF y_i can be then formulated as (1):

y_{i} = f_{r, M} [x_{i}] = Ran k_{n} {x_{i - j} | j \in M}

(1)

where elements of set X are sorted in ascending order and Rank_n{X} denotes the n th ordered value in X (n is the nearest integer value of (N - 1)r + 1), x_i-jdenote all the points which belong to the range of mask M centered on i (e.g., if j = -3, -2, -1,0,1,2,3, i - j = i - 3,...,i+3). This operation is the essentials of both median filter and morphological filter with flat structuring element [8, 9]. However, its drawback is that the selections of filter mask and rank parameter heavily rely on practical experience and intuition. With understanding the feature of ROBF, its adaptive form named adaptive rank-order morphological filter was then proposed [8, 9]. It is optimized as adapting filter mask and rank parameter in order to minimize a criterion such as the MAE (mean absolute error) or the MSE (mean squared error). The problem of designing adaptive rank-order morphological filter can be briefly stated as follows: assume that x_i and d_i are given as noised signal and desired signal, respectively, when ROBF f_r,M is adopted, the aim is to find the best rank parameter r and filter mask M which minimizes a cost function C between output y_i and d_i using iterative learning. In order to expound the procedure of building 1DARMF for better understanding, how to formulate the operation of ROBF is to be introduced at the beginning.

First, in order to overcome the optimization difficulty for dealing with the discrete nature of parameters, the rank parameter r can be optimized in continuous normalized manner and let n in Rank_n{X} be the nearest integer value of (N - 1)r + 1. Secondly, for filter mask M optimization problem, a search area A which is selected to be larger than the optimum mask is introduced and a continuous value m^(j)is assigned for ∀j ∈ A. New filter mask in next iterative step is thus determined by comparing the set of continuous values associated with the current filter mask against a preset value (denoted as threshold thm_M). If the assigned value for any j ∈ A is greater than the threshold, the location associated to that j belongs to the filter mask. With introduction of search area A and the continuous values assignments, the optimization problem of filter mask M is successfully converted from the binary values modification of the mask (belong or not belong) to continuous values m^(j)modification.

On the basis of realizing parameters updating continuously, we proceed to find a way to establish a mathematical relationship involving filter input, output, and the parameters all together. Let us define S the sum of signs of (x_i-j-y_i ) for all j. It can be expressed by

S = \sum_{j \in M} sgn (x_{i - j} - y_{i})

(2)

It is easy to find out that if r = 0, y_i is the minimum of {x_i-j| j ∈ M}and S is then equal to N - 1; if r = 0.5, y_i is the median value of {x_i-j| j ∈ M} and S = 0; if r = 1, y_i is the maximum of {x_i-j| j ∈ M}, S = - (N - 1). Based on the mapping relations between S and r above, if they were assumed to be linearly related, the general expression of S with respect to r is given as

S = - (2 r - 1) (N - 1)

(3)

In case of thm_M being set 0, we obtain if (sgn(m^(j)- thm_M)+1)/2 = 1, then m^(j)> thm_M, which means j ∈ M and else if (sgn(m^(j)-thm_M)+1/2) = 0, then m^{(j) < thm_M}, j ∈ M^c Notice all j is selected from A and let (sgn(m^(j)-thm_M)+1/2) (i.e., (sgn(m^(j))+1)/2) be the weight, combing (2) and (3) gives

S = \sum_{j \in A} \frac{1}{2} (sgn (m^{(j)}) + 1) sgn (x_{i - j} - y_{i}) = - (2 r - 1) [\sum_{j \in A} (sgn (m^{(j)}) + 1) ∕ 2 - 1]

(4)

F (m^{(j)}, x_{i - j}, y_{j}, r) = \sum_{j \in A} \frac{1}{2} (sgn (m^{(j)}) + 1) [sgn (x_{i - j} - y_{i}) + 2 r - 1] + 1 - 2 r = 0

(5)

Thus, the output of ROBF is successfully expressed by the implicit function F(m^(j),x_i-j,y_j ,r). As will be stated later, this implicit function is applied to take derivatives of y_i with respect to m and r to develop iterative formulae for parameter updates.

In [8], an iterative algorithm similar to the LMS (least mean squares) algorithm was suggested to update the m^(j)and r in the case of MSE optimization:

m^{(n e x t, j)} = m^{(j)} + 2 α (d_{i} - y_{i}) \frac{\partial y_{i}}{\partial m^{(j)}} \forall j \in A

(6)

r^{(n e x t)} = r + 2 β (d_{i} - y_{i}) \frac{\partial y_{i}}{\partial r}

(7)

Where α and β are two predefined parameters controlling the convergence rates. The derivatives of y_j with respect to m^(j)and r are calculated through employing implicit function (5). To obtain the expression of $\frac{\partial y_{i}}{\partial m^{(j)}}$ and $\frac{\partial y_{i}}{\partial r}$ , the derivative of F with respect to m_k is firstly expressed as

\frac{d F}{d m^{(j)}} = \frac{\partial F}{\partial m^{(j)}} + (\frac{\partial F}{\partial y_{i}}) (\frac{\partial y_{i}}{\partial m^{(j)}}) = 0

(8)

That is

\frac{\partial y_{i}}{\partial m^{(j)}} = - \frac{\frac{\partial F}{\partial m^{(j)}}}{\frac{\partial F}{\partial y_{i}}}

(9)

Using (5) to take the derivative of F with respect to m^(j)gives

\begin{gathered} \frac{\partial F}{\partial m^{(j)}} = \frac{\partial sgn (m^{(j)})}{2 \partial m^{(j)}} [sgn (x_{i - j} - y_{i}) + 2 r - 1] \\ = δ (m^{(j)}) [sgn (x_{i - j} - y_{i}) + 2 r - 1] \end{gathered}

(10)

$\frac{\partial F}{\partial y_{i}}$ is also calculated by using (5):

\frac{\partial F}{\partial y_{i}} = - \sum_{j \in A} (sgn (m^{(j)}) + 1) δ (x_{i - j} - y_{j})

(11)

In (11), the term δ(x_i-j- y_i) is equal to 1 only if j equals to j₀ , i.e., the time shift whose corresponding x_i-j₀equals to output y_i . This indicates j₀ ∈ M and sgn(m_j₀) = 1, (11) is simplified to

\frac{\partial F}{\partial y_{i}} = - 2

(12)

Combined with (10), (9) is written as

\frac{\partial y_{i}}{\partial m^{(j)}} = \frac{1}{2} δ (m^{(j)}) [sgn (x_{i - j} - y_{i}) + 2 r - 1]

(13)

If δ(m_k ) is replaced by δ'(m_k ) = 1 for -1 ≤ m_k ≤ 1 for simplification. Based on (13), (6) is converted to

m^{(next, j)} = m^{(j)} + α (d_{i} - y_{i}) [sgn (x_{i - j} - y_{i}) + 2 r - 1]

(14)

Similar with the deduction of (9) and (13), we also have

\frac{\partial y_{i}}{\partial r} = - \frac{\frac{\partial F}{\partial r}}{\frac{\partial F}{\partial y_{i}}}

(15)

\frac{\partial F}{\partial r} = 2 [\frac{1}{2} \sum_{j \in A} (sgn (m^{(j)}) + 1) - 1] = 2 (N - 1)

(16)

Based on (12), (16) is written as

\frac{\partial y_{i}}{\partial r} = N - 1

(17)

Combined with (17), (7) is converted to

r^{(next)} = r + 2 β (d_{i} - y_{i}) (N - 1)

(18)

where N = |M| is the current length of filter mask in use.

Combining (1), (14), and (18), the parameters updating algorithm for one dimensional adaptive rank order morphological filter are given as (19), where itN denotes the current iteration and itN + 1 for the next. Note that the update processes of filter mask M and rank parameter r are varying according to each sample i rather than remaining the same for each sample.

\begin{array}{l} y_{i}^{(i t N)} = {Rank}_{(N_{i}^{(i t N)} - 1) r_{i}^{(i t N)} + 1} {x_{i - j} | j \in M_{i}^{(i t N)}}, | M_{i}^{(i t N)} | = N_{i}^{(i t N)} \\ {\begin{cases} m_{i}^{(i t N + 1), j} = m_{i}^{(i t N), j} + α (d_{i} - y_{i}^{(i t N)}) [sgn (x_{i - j} - j) - y_{i}^{(i t N)}) + 2 r_{i}^{(i t N)} - 1], \forall j \in M_{i}^{(i t N)} \\ M_{i}^{(i t N + 1)} = {j | \forall j \in M_{i}^{(i t N)}, m_{i}^{(i t N + 1), j} > thm_M} \\ r_{i}^{(i t N + 1)} = r_{i}^{(i t N)} + 2 β (d_{i} - y_{i}^{(i t N)}) (N_{i}^{(i t N)} - 1) \end{cases} \end{array}

(19)

To illustrate the performance of 1DARMF given by (19), an example is shown in Figure 2. In Figure 2a, it depicts three signals: noised signal x (dash-dot line) as input signal, desired signal d (solid line) as supervisory signal, and output signal y (dotted line) as recovered signal. x = s + n, where s is the useful signal contaminated by Gaussian noise n and SNR _x (signal-to-noise ratio) is set 2. In this example, s = sin(t) and d is selected equal to s in order to recover the useful signal. Initial parameters of 1DARMF in (19) are set as follows: initial 1D filter mask M⁽⁰⁾ = [-5,-4,-3,-2,-1,0,1,2,3,4,5], initial assigned value for element in the mask m^(0,j)= 0.5 (∀j ∈ M), initial rank parameter r⁽⁰⁾ = 0, thm_M = 0, max iterations iterationNUM = 300, convergence rate α = 1 × 10^-4 and β = 1.5 × 10^-3.

If we define the sum of squared error between y and d as the evaluation of signal recovering ability, the expression is given as

e^{(i t N)} = \sum_{i} | {y_{i}}^{(i t N)} - d_{i} |^{2}

(20)

where i means the i th sample of signal and itN denotes current iteration. Figure 2b shows e^(itN)converges to steady state and oscillates in a stable manner as itN gets increased.

2.2. Pattern classification using 1DARMF (PC1DARMF)

In Section 2.1, the general procedure to implement 1DARMF needs desired signal d as supervisory signal to train the key parameters of filter to obtain desired output. However, for a certain input x, if d is alternatively chosen, the iterative training process would finally lead to different output y. This means under supervision of inappropriate or undesirable d, the output may fail to recover useful signal from original input x. A performance comparison of 1DARMF using different supervisory signals is given to illustrate this phenomenon in Figure 3. With input x and the initial parameters being set the same with Section 2.1, different d results in different y, as shown in Figure 3a, c, e, g, i. Figure 3b, d, f, h, j depict corresponding e^(itN)gradually reaches stable oscillation as iterations increase. The most distinct common feature is all e^(itN)eventually progress to a steady-state through enough iterations. This phenomenon can be theoretically guaranteed: Feuer and Weinstein [10] concluded that if the convergence rate was restrained within a upper limit, then it was the necessary and sufficient for LMS algorithm to ensure the convergence of the algorithm. Therefore, with the proper selection of α in (6) and β in (7), e^(itN)is also expected to stably oscillate eventually. The selection rule will be later summarized in Section 2.3. This condition is the crucial prerequisite to further form our algorithm for pattern classification. In Table 1 min(e^(itN)) are also listed to numerically compare the effect of different d on signal recovering.

Table 1 min(e^(itN)) gained using different supervisory signal d (s = sin(t))

Fault diagnosis of Tennessee Eastman process using signal geometry matching technique

Abstract

1. Introduction

2. Signal geometry matching based on adaptive rank-order morphological filter

2.1. One-dimensional adaptive rank-order morphological filter (1DARMF)

2.2. Pattern classification using 1DARMF (PC1DARMF)

2.3. Issues for implementing PC1DARMF algorithm

2.3.1. Initial parameter settings

2.3.2. Convergence rates selections

2.3.3. Iteration stop criteria

3. Tennessee Eastman process fault diagnosis using PC1DARMF algorithm

3.1. Introduction to Tennessee Eastman process (TEP)

3.2. Related work for TEP fault diagnosis

3.3. Diagnostic procedure of using PC1DARMF algorithm

4. Simulation result analysis

4.1. Data set specification for simulation

4.2. Deterministic fault diagnosis in TEP

4.3. Stochastic fault classification in TEP

4.4. Diagnosis of all fault types in TEP

5. Conclusion and discussion

Abbreviations

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Additional information

Competing interests

Authors’ original submitted files for images

Rights and permissions

About this article

Cite this article

Share this article

Keywords