A Failure Probability Calculation Method for Power

energies
Article
A Failure Probability Calculation Method for Power
Equipment Based on Multi-Characteristic Parameters
Hang Liu 1, *, Youyuan Wang 1, *, Yi Yang 2 , Ruijin Liao 1 , Yujie Geng 2 and Liwei Zhou 1
1
2
*
State Key Laboratory of Power Transmission Equipment & System Security and New Technology,
School of Electrical Engineering, Chongqing University, Chongqing 400044, China; [email protected] (R.L.);
[email protected] (L.Z.)
Shandong Electric Power Research Institute, Shandong Electric Power Company, Jinan 250002, China;
[email protected] (Y.Y.); [email protected] (Y.G.)
Correspondence: [email protected] (H.L.); [email protected] (Y.W.); Tel.: +86-23-6511-1795 (Y.W.)
Academic Editor: Issouf Fofana
Received: 30 March 2017; Accepted: 11 May 2017; Published: 17 May 2017
Abstract: Although traditional fault diagnosis methods can qualitatively identify the failure modes
for power equipment, it is difficult to evaluate the failure probability quantitatively. In this paper, a
failure probability calculation method for power equipment based on multi-characteristic parameters
is proposed. After collecting the historical data of different fault characteristic parameters, the
distribution functions and the cumulative distribution functions of each parameter, which are applied
to dispersing the parameters and calculating the differential warning values, are calculated by using
the two-parameter Weibull model. To calculate the membership functions of parameters for each
failure mode, the Apriori algorithm is chosen to mine the association rules between parameters and
failure modes. After that, the failure probability of each failure mode is obtained by integrating the
membership functions of different parameters by a weighted method, and the important weight of
each parameter is calculated by the differential warning values. According to the failure probability
calculation result, the series model is established to estimate the failure probability of the equipment.
Finally, an application example for two 220 kV transformers is presented to show the detailed process
of the method. Compared with traditional fault diagnosis methods, the calculation results not only
identify the failure modes correctly, but also reflect the failure probability changing trend of the
equipment accurately.
Keywords: failure probability; multi-characteristic parameters; the Weibull model; differential
warning value; association rule; failure modes
1. Introduction
With the development of smart grid, the Chinese power industry is entering the era of Big
Data [1–3]. Establishing the fault diagnosis models based on Big Data technology is of great importance
since it can realize the comprehensive utilization of multi-source data and improve the accuracy of
the fault diagnosis results, which is beneficial for ensuring reliability and safe operation, optimizing
the condition-based maintenance (CBM) strategies, and increasing the remaining useful life (RUL) of
power equipment.
With the development of online monitoring technology and information technology, the volume
and variety of the condition monitoring data of the fault characteristic parameters have increased
dramatically in recent years [4]. The fault diagnosis technology has become more automatic and
intelligent than before by using multi-source data. For example, evidence fusion theory [5,6], support
vector machine (SVM) [7], artificial neural network (ANN) [8–10], and other methods have been widely
Energies 2017, 10, 704; doi:10.3390/en10050704
www.mdpi.com/journal/energies
Energies 2017, 10, 704
2 of 15
applied to fault diagnosis for power equipment. Though many research achievements have been
reached, there are still some shortcomings, which are summarized as follows:
(1)
(2)
(3)
Although the volume and variety of the monitoring data are large enough, most of the data
changes in the normal range. That is to say, there are little fault data or abnormal data. However,
many existing diagnosis methods often require a large amount of fault data to build and optimize
the models. The low value density of the condition monitoring data limits the application of
those methods [11–13].
Most existing methods have failed to reveal the association between the failure modes and various
characteristic parameters. The equipment failure modes are difficult to identify correctly when
some kinds of parameters are changed. Additionally, the values of some key parameters and
factors in the models are difficult to estimate. Thus, they are always determined by empirical
values or expert experience, which may cause large errors.
The insulation performance of power equipment is degraded gradually by the long-term impact
of mechanical and electrical stress, which causes an increasing trend in the failure probability of
the equipment. Many existing fault diagnosis methods can only qualitatively identify the failure
modes of the equipment under abnormal conditions, but not quantitatively evaluate the failure
probability and the changing trends of normal equipment. In fact, the equipment under normal
conditions still has a risk of failure.
With the increasing reliability requirements of the power supply, the traditional fault diagnosis
methods cannot meet the requirements of the fault diagnosis of the power system. How to make
full use of the multi-characteristic parameters to improve the accuracy and the effectiveness when
establishing the model becomes a significant challenge. This paper presents a failure probability
calculation method for power equipment based on multi-characteristic parameters, which achieves the
accurate identification of fault modes and quantifies the change trend of the failure probability of the
power equipment.
This manuscript is organized as follows: In Section 2, the probability density functions and the
cumulative distribution functions of different fault characteristic parameters are obtained, and the steps
to disperse the parameters and calculate the differential warning values are presented. In Section 3,
the method for association rules mining and the membership function calculations for parameters are
given. In Section 4, the model to estimate the failure probability of the failure modes, as well as the
equipment’s failure probability, are introduced. In Section 5, the procedure of the method is listed in
detail. An application example for two 220 kV power transformers is presented in Section 6, and the
conclusions are drawn in Section 7.
2. The Discrete Intervals and the Differential Warning Values Calculation
2.1. The Weibull Model
The probability density function and the cumulative probability distribution function of the
two-parameter Weibull distribution are shown in Equations (1) and (2), respectively [14].
β x β −1
x β
f (x) =
exp −
(1)
α α
α
x β
(2)
F = 1 − exp −
α
where x is the value of a kind of fault characteristic parameter; α and β are the scale and shape
parameters. When these two parameters are determined, the Weibull model is uniquely determined. α
and β can be estimated by using the maximum likelihood estimation method [15], and the steps are
as follows:
Energies 2017, 10, 704
3 of 15
(1) The
parameter
Energies
2017,
10, 704
time series X = [x1, x2, …, xn] are Substituted into the logarithmic likelihood
3 of 15
function equation shown in Equation (3). The likelihood functions are established in Equation (4):
(1)
β −1
  x  β into

The parameter time series X = [x1 , x2 , . .m. ,βxn ]xiare
the logarithmic likelihood
 Substituted
ln L ( X ) = ln ∏   exp  −  i  
(3)
function equation shown in Equation (3). iThe
likelihood
functions
are
established in Equation (4):
=1 α  α 
  α  
m
xi) β−1
x β
β
∂
X
ln

(
exp − i
(3)
ln L( X ) = ln ∏
=
0

α α
α
i =1
 ∂α
(
( (XX))
∂ ln
 ∂ ln
=
=00
 ∂∂lnβ∂α(X )
=0
∂β
(4)
(4)
(2) Substitute Equation (3) into Equation (4), then α and β are calculated.
(2) Substitute Equation (3) into Equation (4), then α and β are calculated.
Taking the dissolved gas in transformer oil as an example, the dissolved gas content data in
Taking
dissolved
gas in220
transformer
oil as an
dissolved
gas
content
data in
recent three the
years
of different
kV transformers
in example,
a region the
were
collected.
The
gas includes
recent
three
years
of
different
220
kV
transformers
in
a
region
were
collected.
The
gas
includes
hydrogen, methane, ethane, ethylene, and acetylene, whose chemical formulae are H2, CH4, C2H4,
hydrogen,
ethane, ethylene,
and to
acetylene,
whose
chemical formulae
are H2 , CH
4 , C2 H
4,
C2H6, and methane,
C2H2, respectively.
According
the steps,
the parameters
of the Weibull
model
are
C
H
,
and
C
H
,
respectively.
According
to
the
steps,
the
parameters
of
the
Weibull
model
are
2
6
2
2
estimated and shown in Table 1, and the probability density function curves and frequency
estimated
shown
Tableof1,gas
andare
theshown
probability
density
function curves and frequency histograms
histogramsand
of the
fiveintypes
in Figure
1, respectively.
of the five types of gas are shown in Figure 1, respectively.
Table 1. The parameters calculation results of the Weibull model.
Table 1. The parameters calculation results of the Weibull model.
Parameters
CH4 C2H6 C2H4 C2H2
H2
Parametersα
H2 17.03 CH
4
10.26
C2.94
2 H6
C2 H41.46 C2 H2
1.68
β
17.030.81 10.26
1.06
0.81
1.06
2.94
0.79
0.79
0.811.68 0.89 1.46
0.81
0.89
α
β
(a) The content of H2 (uL/L)
(b) The content of CH4 (uL/L)
0.3
0.35
0.3
0.25
0.25
0.2
0.2
0.15
0.15
0.1
0.1
0.05
0.05
0
0
5
10
15
20
25
30
0
(c) The content of C2H6 (uL/L)
0
5
10
15
20
25
(d) The content of C2H4 (uL/L)
Figure 1.
1. Cont.
Figure
Cont.
30
Energies
Energies 2017,
2017, 10,
10, 704
704
Energies 2017, 10, 704
44 of
of 15
15
4 of 15
0.3
0.3
0.25
0.25
0.2
0.2
0.15
0.15
0.1
0.1
0.05
0.05
0
0
0
5
10
15
20
25
30
0
5
10
15
20
25
30
(e) The content of C2H2 (uL/L)
(e) The content of C2H2 (uL/L)
Figure 1. The probability density function curves and frequency histograms of the dissolved gas.
Figure 1.
1. The probability density function curves and frequency histograms of the dissolved gas.
Figure
From Figure 1, the probability density functions are consistent with the actual distribution of
From
Figureindicates
theprobability
probability
density
consistent
the actual
distribution
of
Figure
1,1,the
density
functions
areare
consistent
withwith
the actual
distribution
of each
each From
gas, which
the effectiveness
offunctions
the Weibull
model.
eachwhich
gas, which
indicates
the effectiveness
the Weibull
model.
gas,
indicates
the effectiveness
of theofWeibull
model.
2.2. The Discrete Intervals Calculation
2.2. The Discrete Intervals Calculation
The fault characteristic parameters value is continuous, but the association rule mining
fault
characteristic
parameters
value
is continuous,
the association
rule
mining
The
fault
characteristic
value
isparameters
continuous,
but the but
association
ruletomining
algorithms
algorithms
require
discreteparameters
input.
Thus,
the
should
be
discretized
different
classes
algorithms
require
discrete
input.
Thus,
the
parameters
should
be
discretized
to
different
classes
require
discrete
input.[16].
Thus,
thepaper
parameters
should
be discretized
classes
before
data
before data
mining
This
presents
a discrete
method to
fordifferent
parameters
based
on
the
before
data
mining
[16].
This
paper
presents
a
discrete
method
for
parameters
based
on
the
mining
[16].
This
paper
presents
a
discrete
method
for
parameters
based
on
the
distribution
probability
distribution probability of the equipment condition. The steps are as follows:
distribution
probability
of the
equipment
of
the equipment
condition.
The
steps are condition.
as follows:The steps are as follows:
(1) According to [17], the equipment condition is divided into normal condition, attentive condition,
(1) According
condition
isisdivided
into
attentive
(1)
Accordingto
to[17],
[17],the
theequipment
equipment
condition
divided
intonormal
normalcondition,
condition,
attentive
condition,
abnormal
condition,
and
serious
condition.
The distribution
probability
of thecondition,
different
abnormal
condition,
and
serious
condition.
The
distribution
probability
of
the
different
abnormal
condition,
and
serious
condition.
The
distribution
probability
of
the
different
conditions
P
=
p
p
p
p
[ norm atten abni ser ] are calculated after collecting the actual condition
conditions
h
P =p[ atten
pnorm ppabn
p
p calculated
] are calculated
conditions
after collecting
actual data
condition
P
=
after collecting
the actualthe
condition
of all
atten
p
norm
serabn areser
data of all of the same
type
ofpequipment.
data
of same
all of type
the same
type of equipment. F = [ F F F
of
the
of
equipment.
(2) The cumulative distribution probability
hI
II
III 1] isiobtained by accumulating the
F
=
F
F
F
[
by accumulating
the
(2) The
cumulative
distribution
probability
The cumulative
distribution
probability F = I FI IIFII IIIFIII 1] 1is obtained
is obtained
by accumulating
distribution
probability
P.
P. the inverse cumulative probability distribution function based on
probability
P.into
(3) distribution
Fthe
I, Fdistribution
II, and FIII
areprobability
brought
IIII
, ,and
into the
cumulative
probability
distribution
function
based
(3) F
FII,,FFtwo-parameter
andFFIIIIIIare
arebrought
brought
theinverse
inverse
cumulative
probability
distribution
xII xIIIoni]
the
Weibullinto
model
expressed
in Equation
(5). Three
values Xfunction
ph= [ xI
X
=
x
x
xIII ]
[
the
Weibullmodel
modelexpressed
expressedininEquation
Equation
(5).
Three
values
the two-parameter
Weibull
(5).
Three
values
X p =p 2.xI I xII II xIII
are
calculated and four
discrete intervals
s1–s4 are obtained,
as
shown
in Figure
are calculated
calculated and
and four
four discrete
discrete intervals s11–s
–s44are
areobtained,
obtained,as
asshown
shownin
inFigure
Figure2.
2.
are
1/ β
x = α − ln (1 − F ) 1/ β
(5)
xx ==αα[−
− lnln(1(1−−F F) )]1/β
(5)
Figure 2. The discrete intervals of parameters.
Figure 2. The discrete intervals of parameters.
The distribution
distribution probability
probability and
the cumulative
cumulative probability
probability of
of the
the 220
220 kV
kV transformers
transformers are
are
The
and the
The
distribution
probability
and
the
cumulative
probability
of
the
220
kV
transformers
are
collected and
andlisted
listedin
inTable
Table2,2,and
andthe
thecalculation
calculation
result
discrete
intervals
of the
types
of
collected
result
of of
thethe
discrete
intervals
of the
fivefive
types
of the
collected
and
listedininTable
Table3.2, and the calculation result of the discrete intervals of the five types of
the
gas
are
shown
gas are shown in Table 3.
the gas are shown in Table 3.
Energies 2017, 10, 704
5 of 15
Table 2. The data of 220kV transformers.
Equipment Condition
Normal
Condition
Attentive
Condition
Abnormal
Condition
Serious
Condition
Distribution probability
Cumulative probability
95.46%
95.46%
3.53%
98.99%
0.94%
99.93%
0.07%
1
Table 3. The discrete intervals of the gas (uL/L).
Gas
xI
xII
xIII
H2
CH4
C2 H6
C2 H4
C2 H2
82.44
29.75
6.79
5.19
12.21
139.89
43.22
11.08
8.10
20.12
257.84
66.57
19.54
13.55
35.84
2.3. The Differential Warning Values Calculation
The abnormal increasing of fault characteristic parameters, such as the dissolved gas in oil, will
increase the failure probability of the equipment. To ensure the safe and reliable operation of the power
equipment, the warning value of each characteristic parameter is given by standard or by guide. If the
parameter value exceeds the warning value, it indicates that the equipment is under poor condition
or has a fault. For example, the warning value of hydrogen content is 150 uL/L, and the acetylene
content is 5 uL/L for the 220 kV power transformers [17].
However, due to the operating environment, the service age and health status of each equipment
are different, and the actual warning value of the equipment may deviate from the warning value
in [17]. According to the method in [18], this paper presents a differential warning values calculation
method for the equipment. The steps are as follows:
(1)
(2)
Count the average failure probability Pave of the same types of equipment in a region.
The probability Pave is substituted into the inverse cumulative probability distribution functions
with different characteristic parameters, as shown in Equation (5), and the differential warning
value Vdi f of each parameter is obtained.
Table 4 is the differential warning value of the dissolved gas when the average failure probability
of the 220 kV transformers is 0.18.
Table 4. The differential warning value of the gas (uL/L).
Gas
Differential Warning Value
H2
CH4
C2 H6
C2 H4
C2 H2
37.52
17.06
3.27
2.67
5.80
3. Membership Function of the Fault Characteristic Parameter
3.1. The Association Rules Mining for Failure Modes and the Fault Characteristic Parameters
Different failure modes will have different effects on the change trend of the fault characteristic
parameters, which are quantified by association rule mining algorithms, such as Apriori and FP-Growth
algorithms [19,20]. The Apriori algorithm scans the whole database in each loop to calculate the
support and confidence coefficient of the candidate item sets. The association rule mining by the
Apriori algorithm includes the following steps.
Energies 2017, 10, 704
(1)
(2)
(3)
(4)
6 of 15
The fault data, which have m kinds of fault characteristic parameters and g kinds of failure modes,
are collected. All of the parameters are discretized in four classes.
The dataset of fault data is established and denoted as Df = [G, df1 , df2 , ···, df m ], where G is the
transaction of the failure modes, G = [G(1), . . . , G(g)]. The dfi = [dfi (1) dfi (2), dfi (3), dfi (4)] is the
transaction of the fault characteristic parameter i (i = 1, . . . , m).
The Df is divided into m datasets according to each parameter, namely Df1 = [G, df1 ], Df2 = [G, df2 ],
. . . , Dfm = [G, dfm ].
Set the minimum support (Supmin) the minimum confidence coefficient (Confmin), the Apriori
algorithm is applied to finding out all of the frequent two-item sets G → d f , the support
coefficient Sup G → d f , and the confidence coefficient Con f G → d f based on Equations (6)
and (7):
count G → d f
Sup G → d f =
(6)
count D f
count G → d f
Con f G → d f =
(7)
count D f
where Count(·) is the counting function of the item set.
Support is used to measure the statistical importance of association rules in the
entire dataset.
The greater the support, the more frequent the item sets or rules appear in Df . If Sup G → d f is no
less than the Supmin, G → d f is regarded as a frequent item set. Otherwise, G → d f is regarded as an
infrequent item set.
Confidence is used tomeasurethe trustworthiness ofassociation
rules. If an association rule
G → d f satisfies both Sup G → d f ≥ Supmin and Con f G → d f ≥ Con f min, G → d f will be
regarded as a strong association rule. Otherwise, it is a weak association rule.
3.2. Membership Function Calculation Method
According to association rules between the failure modes and the parameters, the distribution
probability, that is, the confidence coefficient between each item set is obtained. For any kind of failure
mode j and characteristic parameter i, Equation (8) is satisfied:
4
∑
Con f G ( j) → d f i (k) = 1
(8)
k =1
The membership function of the characteristic parameter i for the failure mode j is expressed in
Equation (9):

Cij (1)

x ∈ s1
 xI · x



C
2
(
)
C (1) + ij

x ∈ s2

xII − xI · ( x − xI )
 ij
Cij (3)
Fij = Cij (1) + Cij (2) +
(9)
x ∈ s3
xIII − xII · ( x − xII )



C
4
(
)
ij


Cij (1) + Cij (2) + Cij (3) + xM − xIII · ( x − xIII ) x ∈ s4



1
x > xM
where Cij (sk ) = con f G ( j) → d f i (sk ) , x is the value of characteristic parameters. xM is the value
under the condition that the cumulative distribution probability is (FIII + 1)/2.
Energies 2017, 10, 704
7 of 15
4. The Failure Probability of Failure Modes and the Equipment
4.1. The Failure Probability of the Failure Mode
According to the method referred to in Section 3, the membership function of different failure
modes resulting from different fault characteristic parameters can be figured out. Those different
membership functions have different contribution to the failure probability of the failure modes.
Therefore, the failure probability of each failure mode is calculated by a weighted method. For example,
the failure probability of the jth failure mode is calculated by Equation (10):
m
PG( j) = F · W =
∑ Fij · ωij
(10)
i =1
m
∑ ωij = 1
(11)
i =1
where Fij and ωij are the membership function and important weight from the ith characteristic
parameter to the jth failure mode, respectively. The important weight ωij of each fault characteristic
parameter for different failure modes is calculated in Section 4.2.
4.2. The Calculation for Important Weight
The existing weight calculation methods, such as analytic hierarchy process (AHP) [21], may lack
accuracy because of the dependence of subjective judgments from experts. To ensure the rationality of
the result, a method for weight calculation is presented according to the differential warning values of
the characteristic parameters, which is shown as follows:
(1)
(2)
Obtain the differential warning values of the different characteristic parameters.
Count the number (N) of samples in which the characteristic parameters are greater than, or equal
to, their differential warning values shown in Equation (12):



N=

N11
N21
···
Nm1
N12
N22
···
Nm2
···
···
···
···
N1m
N2m
···
Nmg





(12)
For every Nij , it is the number of samples of the ith characteristic parameter to the jth failure mode.
The important weights of all of the fault characteristic parameters for the jth failure mode are obtained
by the normalized calculation expressed in Equation (13):
ωij =
Nij
m
(13)
∑ Nij
i =1
According to Equations (12) and (13), the result of the importance weight W is obtained based
on the calculation of the fault data and the differential warning value rather than the experience of
experts so that the accuracy and rationality are ensured.
4.3. The Failure Probability of the Equipment
Every type of the failure mode will cause equipment failure, and the series network is
established between equipment failure probability and the failure probability of the failure modes [14].
The calculation method of the equipment failure probability is given by Equation (14):
Energies 2017, 10, 704
8 of 15
g
Pequ = 1 −
∑
1 − Pj
(14)
j =1
where Pequ is the equipment failure probability, and Pj is the failure probability of the jth failure mode.
Energies 2017, 10, 704
8 of 15
5. The Procedure of the Method
5. The
Procedure
of method
the Method
The
flow of the
is shown in Figure 3. The key steps are as follows:
The flow of the method is shown in Figure 3. The key steps are as follows:
(1) According to the method in Section 2, the probability density functions and cumulative
distributiontofunctions
of different
fault2,characteristic
parameters
on aand
two-parameter
(1) According
the method
in Section
the probability
density based
functions
cumulative
Weibull distribution
model
are calculated.
Based on the inverse
cumulative
function,
distribution
functions
of different
fault characteristic
parameters
based distribution
on a two-parameter
the condition
distribution
and the
average
probability
of thedistribution
equipment
Weibull
distribution
modelprobability
are calculated.
Based
on failure
the inverse
cumulative
are applied
calculating
the discrete
intervals and the differential
warning
value ofofeach
function,
thetocondition
distribution
probability
average failure
probability
the
characteristic
equipment
areparameter.
applied to calculating the discrete intervals and the differential warning value of
characteristic
parameter.
(2) each
According
to the method
in Section 3, the Apriori algorithm is applied to finding the association
(2) According
the method
in Section
3, the
Apriori
is applied
to findingand
the calculating
association
rules in thetofault
data between
failure
modes,
asalgorithm
well as different
parameters
rules
the fault data
betweenoffailure
modes, item
as well
as different
parameters
and calculating
of
of theinconfidence
coefficient
the frequent
sets.
After that,
the cumulative
probability
the
confidence
coefficient
ofcharacteristic
the frequentparameter
item sets.is After
that, the cumulative probability
distribution
function
of each
obtained.
function
of each
characteristic
parameter
obtained.
(3) distribution
The membership
function
and
the important
weight ofiseach
parameter are obtained in Section 4.
(3) The
function
weight
ofcalculated
each parameter
are obtained
in Sectionand
4.
(4)
Themembership
failure probability
of and
all ofthe
theimportant
failure modes
are
by a weighted
calculation,
(4) The
failure
probability
of
all
of
the
failure
modes
are
calculated
by
a
weighted
calculation,
and
the series model is proposed to calculate the failure probability of the equipment.
the series model is proposed to calculate the failure probability of the equipment.
1 Historical data of the fault characteristic parameters
Two-parameter Weibull function
Discrete intervals and the differential warning value
2 Fault data
The important weight of parameters;
Association rule mining by Apriori Algorism;
The membership function Fij
3 The monitoring data of the characteristic parameters
The failure probability of the failure modes ;
The failure probability of the equipment
Figure 3. The flow of the failure probability calculation.
Figure 3. The flow of the failure probability calculation.
6. Application Example
6. Application Example
6.1. Basic
6.1.
Basic Information
Information
According to
to the
thefault
faultdata
datasamples
samplesofofthe
the220
220
transformers,
four
types
of the
failure
modes
According
kVkV
transformers,
four
types
of the
failure
modes
are
are
defined,
namely,
low-temperature
overheating
(G
1), high-temperature overheating (G2), lowdefined, namely, low-temperature overheating (G1 ), high-temperature overheating (G2 ), low-energy
energy discharging (G3), and high-energy discharging (G4). The quantitative distribution of each
failure mode is shown in Figure 4.
Energies 2017, 10, 704
9 of 15
discharging (G3 ), and high-energy discharging (G4 ). The quantitative distribution of each failure mode
is shown
in10,
Figure
Energies
2017,
704 4.
9 of 15
Energies 2017, 10, 704
9 of 15
0.2
0.2
0.11
0.11
0.21
0.21
0.48
0.48
G1
G1
G2
G2
Figure4.
4.The
Thequantitative
quantitativedistribution
distributionof
ofeach
eachfailure
failure mode.
mode.
Figure
Figure 4. The quantitative distribution of each failure mode.
6.2.
6.2. The
TheFailure
Failure Probability
Probability and
and Important
Important Weight
Weight
6.2. The Failure Probability and Important Weight
After
After setting
setting the
the minimum
minimum support
support Supmin
Supmin ==0.02
0.02and
andthe
theminimum
minimumconfidence
confidenceConfmin
Confmin==0.02,
0.02,
After setting the minimum support Supmin = 0.02 and the minimum confidence Confmin = 0.02,
the
association
rules
among
all
of
the
failure
modes
and
the
different
ways
by
which
the
gas
dissolves
the association rules among all of the failure modes and the different ways by which the gas dissolves
the association rules among all of the failure modes and the different ways by which the gas dissolves
are
are calculated
calculated by
by Apriori
Apriori algorithm.
algorithm. Taking
Taking methane
methane and
and ethane
ethane as
as examples,
examples, the
the frequent
frequent item
item sets
sets
are calculated by Apriori algorithm. Taking methane and ethane as examples, the frequent item sets
and
the
confidence
results
are
shown
in
Figure
5a,b.
and the confidence results are
and the confidence results are shown in Figure 5a,b.
Confidence
Confidence
0.8
0.8
0.7
0.7
0.6
0.6
0.5
0.5
0.4
0.4
0.3
0.3
0.2
0.2
0.1
0.1
0
0
G1 →s1
G1→s1
→s1
G3
G3 →s1
0.54
0.54
0.41
0.41
0.08
0.08
G1 →s2
G1→s2
→s2
G3
G3 →s2
0.69
0.69
0.44
0.44
0.25
0.25 0.21
0.21
0.17
0.08 0.09 0.17 0.09
0.09
0.08 0.09
G1 →s3
G1→s3
→s3
G3
G3 →s3
Frequent item sets
Frequent item sets
G1 →s4
G1→s4
→s4
G3
G3 →s4
G2 →s1
G2→s1
→s1
G4
G4 →s1
0.31
0.31 0.25
0.25 0.17
0.14
0.17 0.14
0.08
0.08
G2 →s2
G2→s2
→s2
G4
G4 →s2
G2 →s3
G2→s3
→s3
G4
G4 →s3
G2 →s4
G2→s4
→s4
G4
G4 →s4
(a) The association rule between failure modes and methane.
(a) The association rule between failure modes and methane.
Confidence
Confidence
0.9
0.9
0.8
0.8
0.7
0.7
0.6
0.6
0.5
0.5
0.4
0.4
0.3
0.3
0.2
0.2
0.1
0.1
0
0
G1 →s1
G1→s1
→s1
G3
G3 →s1
0.82
0.82
0.61
0.61
0.37
0.37
0.48
0.48
0.33 0.31
0.33 0.31
0.15 0.18
0.18 0.1 0.12
0.07 0.15
0.1 0.12
0.07
G1 →s2
G1→s2
→s2
G3
G3 →s2
G1 →s3
G1→s3
→s3
G3
G3 →s3
0.17
0.17
Frequent item sets
Frequent item sets
G1 →s4
G1→s4
→s4
G3
G3 →s4
G2 →s1
G2→s1
→s1
G4
G4 →s1
0.16
0.16
0.05 0.03
0.05 0.03
G2 →s2
G2→s2
→s2
G4
G4 →s2
G2 →s3
G2→s3
→s3
G4
G4 →s3
0.05
0.05
G2 →s4
G2→s4
→s4
G4
G4 →s4
(b) The association rule between failure modes and ethane.
(b) The association rule between failure modes and ethane.
Figure 5. The calculation result.
Figure 5. The calculation result.
Figure 5. The calculation result.
The membership functions of different kinds of gas are calculated, and the curves of the
The membership functions of different kinds of gas are calculated, and the curves of the
functions
are shown infunctions
Figure 6.
The
functionsmembership
are shown in Figure 6.of different kinds of gas are calculated, and the curves of the functions
are shown in Figure 6.
10 of 15
10 of 15
1.0
1.0
0.8
0.8
Failure Probability
Failure probability
Energies 2017, 10, 704
Energies 2017, 10, 704
0.6
0.4
0.2
0.0
0
50
100
150
H2(uL/L)
200
G1
G2
G3
G4
250
G1
G2
G3
G4
0.6
0.4
0.2
0.00
300
20
40
1.0
1.0
0.8
0.8
0.6
0.4
0.2
G1
2
4
6
8
10
80
(b) CH4
Failure Probability
Failure Probability
(a) H2
0.00
60
CH4(uL/L)
12
G2
G3
G4
16
18
20
14
G1
G2
G3
G4
0.6
0.4
0.2
0.00
10
20
30
40
50
C2H4(uL/L)
C2H6(uL/L)
(c) C2H6
(d) C2H4
Failure Probability
1.0
0.8
0.6
0.4
0.2
0.00
5
10
15
20
G1
G2
G3
25
G4
30
C2H2(uL/L)
(e) C2H2
Figure 6.
6. The
The membership
membership functions
functions of
of the
the dissolved
dissolved gas.
gas.
Figure
The important weights of the five types of gas to the four failure modes are calculated and shown
The important weights of the five types of gas to the four failure modes are calculated and shown
in Figure 7.
in Figure 7.
It is obvious that the weight of acetylene is small in overheating failure modes while the alkanes
It is obvious that the weight of acetylene is small in overheating failure modes while the alkanes
is larger. Additionally, the weight of hydrogen is much larger than the weight of other gases in
is larger. Additionally, the weight of hydrogen is much larger than the weight of other gases in
discharging failure modes.
discharging failure modes.
Energies
Energies2017,
2017,10,
10,704
704
Energies 2017, 10, 704
1111ofof1515
11 of 15
H2
H2
0.5
0.5
0.4
0.4
0.3
0.3
0.2
0.2
0.1
0.1
0.0
0.0
C2H2
C2H2
G1
G1
G3
G3
G2
G2
G4
G4
CH4
CH4
C2H4
C2H4
C2H6
C2H6
Figure7.7.The
Theimportant
importantweight
weightofofeach
eachgas.
gas.
Figure
Figure 7. The important weight of each gas.
6.3.Case
CaseStudy
Study11
6.3.
6.3. Case Study 1
Shownin
inFigure
Figure88 are
are the
the time
time series
toto
30
Shown
series data,
data, which
whichinclude
include236
236samples
samplesfrom
from1 1January
January2016
2016
Shown in Figure 8 are the time series data, which include 236 samples from 1 January 2016 to 30
June
2016,
transformer (named
(named T1)
T1) where
where the
the value
valueofof
30
June
2016,ofofthe
thedissolved
dissolvedgas
gascontent
contentofofaa 220
220 kV
kV transformer
June 2016, of the dissolved gas content of a 220 kV transformer (named T1) where the value of
ethyleneisiszero.
zero.
ethylene
ethylene is zero.
120
120
90
90
60
60
30
300
0
10
108
86
64
42
20
0
H2
H2
50
50
100
100
150
150
200
200
C2 H6
C2 H6
50
50
100
100
150
150
200
200
7
7
6
6
5
5
4
4
3
3
0.5
0.5
0.4
0.4
0.3
0.3
0.2
0.2
0.1
0.1
0.0
0.0
CH4
CH4
50
50
100
100
150
150
200
200
C2 H4
C2 H4
50
50
100
100
150
150
200
200
Time(Day)
Time(Day)
Figure 8. Thecontent
content of gas(uL/L).
(uL/L).
Figure
Figure8.8.The
The contentofofgas
gas (uL/L).
The data of the five kinds of dissolved gas are brought to each kind of membership function,
Thedata
dataofofthe
thefive
five
kinds
dissolved
brought
to each
of membership
function,
The
kinds
of of
dissolved
gasgas
areare
brought
to each
kindkind
of membership
function,
and
and the failure probability of the four failure modes and the failure probability of T1 are calculated
andfailure
the failure
probability
the four
failure
modes
andfailure
the failure
probability
T1 calculated
are calculated
the
probability
of theoffour
failure
modes
and the
probability
of T1ofare
by
by Equations (10) and (14). The calculation result is shown in Figure 9.
by Equations
calculation
result
is shown
in Figure
Equations
(10) (10)
and and
(14).(14).
The The
calculation
result
is shown
in Figure
9. 9.
It is clear from Figure 9 that the hydrogen content in T1 abnormally increased from the 45th day
clearfrom
fromFigure
Figure99that
thatthe
thehydrogen
hydrogencontent
contentin
inT1
T1abnormally
abnormallyincreased
increasedfrom
fromthe
the45th
45thday
day
ItItisisclear
to the 63rd day, and surpassed the warning value (37.52 uL/L) so that the failure probability of the
tothe
the 63rd
63rd day,
day, and
and surpassed the warning
soso
that
thethe
failure
probability
of the
to
warning value
value(37.52
(37.52uL/L)
uL/L)
that
failure
probability
of
low-energy discharging (G3) and low-temperature overheating (G1) increased, resulting in the
3) 3and
in the
the
low-energy
discharging
(G(G
the
low-energy
discharging
) andlow-temperature
low-temperatureoverheating
overheating(G
(G1)1 ) increased,
increased, resulting in
increment of the equipment failure probability. With the hydrogen content decreasing after the 63rd
increment of
of the equipment
thethe
hydrogen
content
decreasing
afterafter
the 63rd
increment
equipment failure
failureprobability.
probability.With
With
hydrogen
content
decreasing
the
day, the equipment failure probability decreased to a lower level, and remained lower than 0.3. The
day,day,
the equipment
failure
probability
decreased
to a to
lower
level,
and and
remained
lower
thanthan
0.3. 0.3.
The
63rd
the equipment
failure
probability
decreased
a lower
level,
remained
lower
calculation results are consistent with the actual situation of T1.
calculation
results
are are
consistent
with
thethe
actual
situation
of T1.
The
calculation
results
consistent
with
actual
situation
of T1.
6.4. Case Study 2
The monitoring data of a 220 kV transformer (named T2) in 2015 are listed in Table 5. The overheat
defect was detected in the oil pump on the 96th day, and it became serious after the 99th day.
The diagnosis resulting from three-ratio method is 022 which means the failure mode of T2 is
Energies 2017, 10, 704
12 of 15
1.0
0.9
Faiulre Probability
Energies 2017, 10, 704
G1
G2
G3
G4
T1
12 of 15
0.8
0.7
Faiulre Probability
0.6
high-temperature overheating
(G2 ). The content of total hydrocarbon is the sum of CH4 , C2 H6 ,
0.5
Energies
2017,C
10,
704
12 of 15
C2 H4 and
H
.
2 2
6.4. Case Study 2
0.4
0.3
1.0
0.2
0.9
0.1
0.8
0.0
0.7
G1
0
G2
G3
50
G4
T1
100
150
200
Time(Day)
0.6
0.5
0.4
Figure 9. The failure probability calculation result of T1.
0.3
0.2
0.1
The monitoring
0.0 data of a 220 kV transformer (named T2) in 2015 are listed in Table 5. The
0
50
100
150
200
overheat defect was detected in the oil pump on the 96th day, and it became serious after the 99th
Time(Day)
day. The diagnosis resulting from three-ratio method is 022 which means the failure mode of T2 is
high-temperature overheating
(GThe
2). The content of total hydrocarbon is the sum of CH4, C2H6, C2H4
Figure9.
9.
failureprobability
probability calculation
calculation result
result of
of T1.
T1.
Figure
The failure
and C2H2.
Table 5. The content of the dissolved gas in T2 (uL/L).
Table 5. The content of the dissolved gas in T2 (uL/L).
6.4. Case Study 2
The monitoring
220 kV transformer
(named
T2) in 2015
are listed in Table 5. The
Day data
H2 of aH
CH
C2CH2H
C H
Total Hydrocarbon
66
2H
2
Day
2 4 CH4
CC
2H
4 4 C2H2 2 Total
Hydrocarbon
overheat defect was detected in the oil pump on the 96th day, and it became serious after the 99th
1st
0 1.6 1.6
00
0.60.6
0
2.2
1st
0
0
2.2
day. The diagnosis
which means
the
31st
146.6 three-ratio
6.6
26 method
1.41.4 is 0022 0
34
14 from
26
31st resulting
34 failure mode of T2 is
high-temperature
(G
2). 110
The content
of190
total 0hydrocarbon
sum of CH4, C2H6, C2H4
73
110
41
0
96thoverheating
96th
73
41
190
341is the341
77
99th
374.8
99th
77120 120 44
44
210210 0.8 0.8
374.8
and C2H2.
95
406.2
105th
95130 130 55
55
220220 1.2 1.2
406.2
76
140
64
230
1.1
435.1
107th
140
64 of the
230
1.1 gas in T2 435.1
Table765. The
content
dissolved
(uL/L).
86
416.1
110th
86137 137 48
48
230230 1.1 1.1
416.1
135 H
320
634.4
Day
2230 CH
2H6
C320
2H
4
C21.4
H2 1.4
Total Hydrocarbon
146th
135
2304 C83
83
634.4
140 0140
240 1.6
736.1
1st
0.6
2.2
196st
240 085
85
410410 01.1 1.1
736.1
31st
14
6.6
26
1.4
0
34
96th
73
110
41
190
0
341
Thefailure
failureprobability
probabilitycalculation
calculationresult
resultof
ofT2
T2isisshown
shownin
inFigure
Figure10,
10,which
whichindicates
indicatesthat
thatfailure
failure
The
99th
77
120
44
210
0.8
374.8
probability
was
increasing
rapidly
from
the
31th
day
to
the
96th
day,
and
the
failure
mode
of
T2
was
probability was increasing
from the
day to1.2
the 96th day,
and the failure mode of T2 was
105th rapidly
95
130
55 31th
220
406.2
G
1
or
G
2
.
After
the
96th
day,
the
failure
probability
of
T2
approached
1,
which
means
that
T2
had
G1 or G2 . After the 96th
day, the
107th
76 failure
140 probability
64
230 of T2
1.1 approached
435.11, which means that T2 had aa
seriousoverheating
overheatingfault.
fault.
The86result
result
isconsistent
consistent
withthe
theactual
actualsituation.
situation.
110thThe
137is
48
230with
1.1
416.1
serious
146th 135 230
83
320
1.4
634.4
196st 140 240
85
410
1.1
736.1
105th
107th
110th
146th
196st
G1
G2
G3
G4
Equipment
Failure Probability
Failure Probability
1.0
The failure probability
calculation result of T2 is shown in Figure 10, which indicates that failure
0.9
probability was increasing rapidly from the 31th day to the 96th day, and the failure mode of T2 was
0.8
G1 or G2. After the 96th day, the failure probability of T2 approached 1, which means that T2 had a
0.7
serious overheating
0.6fault. The result is consistent with the actual situation.
0.5
0.4
0.3
1.0
0.2
0.9
0.1
0.8
0.0
0
0.7
0.6
0.5
0.4
0.3
0.2
0.1
0.0
0
G1
50
G2
G3
100
G4
Equipment
150
200
Time(Day)
Figure
Figure10.
10. The
The result
result of
offailure
failureprobability
probabilitycalculation.
calculation.
50
100
150
Time(Day)
Figure 10. The result of failure probability calculation.
200
Energies 2017, 10, 704
13 of 15
6.5. Comparative Analysis
To make a comparison between the method in this paper and the traditional methods, this paper
presents two types of common fault diagnosis methods based on classifier and the threshold values.
6.5.1. The Fault Diagnosis Method Based on a Classifier
Three kinds of common classifiers are presented to establish the fault diagnosis models,
respectively, namely, decision tree classifier (J48), random forest classifier (RF), and SVM. After
the training process of each model, the accuracy results of the three models are obtained and listed
in Table 6. According to the precision results, the models based on RF and SVM are selected for
fault diagnosis.
Table 6. Precision results of each model.
Day
Precision of the Model
J48
RF
SVM
0.873
0.997
0.952
6.5.2. The Fault Diagnosis Method Based on the Threshold Value
The content and the relative increase rate of the dissolved gas in oil are important criteria for fault
diagnosis. The threshold values of different kinds of gas are listed in Table 7 according to [17].
Table 7. The threshold values of the dissolved gas in oil.
Fault Severity
The Fault Criterion
Slight fault
The content of total hydrocarbon or H2 is greater than 150 µL/L.
The content of C2 H2 is greater than 5 µL/L;
Serious fault
The relative increase rate of total hydrocarbon is greater than 10% per month;
The increase rate of total hydrocarbon is greater than 10 mL per day.
The monitoring data of Case 1, shown in Figure 8, is applied to the fault diagnosis model based
on RF, SVM, and the threshold value. The results indicate the equipment is in a normal state without
any fault, which is consistent with the actual situation. Similarly, the fault diagnosis result of Case 2 is
also obtained and listed in Table 8. It indicates that the failure mode of the T2 is the high-temperature
overheating fault (G2 ), and it becomes serious after the 96th day.
Table 8. The fault diagnosis results of three models.
Day
RF
SVM
Guide
1st
31st
96th~196st
Normal
Normal
G2
Normal
Normal
G2
Normal
Normal
Serious
According to the results, three traditional fault diagnosis methods have high accuracy in
determining whether the equipment fails. However, the model of the threshold value fails to identify
the failure modes for the equipment while the method based on the classifier is unable to determine the
severity of the fault. Furthermore, any traditional methods can not quantify the failure probability of
an equipment failure and reflect the changing trend of the failure probability. Taking T2 as an example,
the fault cannot be found before the 96th day by traditional methods, so that maintenance strategy for
T2 is not be made or arranged in time. The equipment failure may cause serious power interruption,
Energies 2017, 10, 704
14 of 15
and significant losses of equipment and to the power system. These shortcomings of the traditional
models make it impossible to meet the effectiveness requirements for the power system.
7. Conclusions
In this paper, a failure probability calculation method for power equipment based on
multi-characteristic parameters is proposed. The historical data of different fault characteristic
parameters are collected to obtain the cumulative distribution function and the differential warning
value of each parameter. Then the association relations between different failure modes and the
characteristic parameter and the important weight of each parameter are obtained. Finally, the
weighted calculation and the series model are applied to calculating the failure probability of all of the
failure modes and the equipment.
The failure probability calculation method realizes the comprehensive utilization of
multi-characteristic parameters. To avoid the influence of subjective factors on the calculation results,
the process of the fault probability function calculation is driven by data, but not judgment from
experts, so that the results are ensured.
An application example for two 220 kV transformers is presented, and the procedure of the
method is described in detail. The results indicate that this novel method has obvious advantages in
accurately quantifying the failure probability, reflecting the failure changing trend of the equipment
and helping to create a timely maintenance strategy so that the remaining useful life and the reliability
of the power equipment can be markedly increased.
Acknowledgments: The work is supported by the National High-tech R and D Program of China (863 Program,
2015AA050204) and the China State Grid Corporation of Science and Technology Project (520626150032).
Author Contributions: The research presented in this paper was a collaborative effort among all authors.
Hang Liu, Youyuan Wang, and Liwei Zhou proposed the methodology of the model and contributed to the
manuscript at all stages. Ruijin Liao discussed the results and revised the manuscript critically. Yi Yang and
Yujie Geng prepared the data for the paper.
Conflicts of Interest: The authors declare no conflict of interest.
References
1.
2.
3.
4.
5.
6.
7.
8.
9.
10.
Chinese Society for Electrical Engineering Information Committee. Chinese Electric Power Big Data
Development White Paper; China Electric Power Press: Beijing, China, 2013; pp. 1–13.
Yang, Y.D.; Bi, Z.Q. Advances and future challenges in electric power big data. In Proceedings of the 2014
Second International Conference on Advanced Cloud and Big Data, Wuhan, China, 20–22 November 2014.
Zhang, P.; Yang, H.F.; Xu, Y.B. Power big data and its application scenarios in power grid. Proc. CSEE 2014,
34, 85–92. [CrossRef]
Lai, Z.T. Electric Power Big Data: Energy Internet Age Transformation and Value Creation; Machinery Industry
Press: Beijing, China, 2015; pp. 123–131. ISBN: 978-7-111-51693-4.
Otman, B.; Yuan, X.H. Engine fault diagnosis based on multi-sensor information fusion using
Dempster–Shafer evidence theory. Inf. Fusion 2007, 8, 379–386. [CrossRef]
Lu, F.; Jiang, C.Y.; Huang, J.Q. A novel data hierarchical fusion method for gas turbine engine performance
fault diagnosis. Energies 2016, 9, 828. [CrossRef]
Li, J.; Zhang, Q.; Wang, K. Optimal dissolved gas ratios selected by genetic algorithm for power transformer
fault diagnosis based on support vector machine. IEEE Trans. Dielectri. Electri. Insul. 2016, 23, 1198–1206.
[CrossRef]
Lu, J.J.; Huang, J.Q.; Lu, F. Sensor fault diagnosis for aero engine based on online sequential extreme learning
machine with memory principle. Energies 2017, 10, 39. [CrossRef]
Hazlee, A.I.; Xin, R.C.; Ab, H.A.B. Hybrid modified evolutionary particle swarm optimisation-time varying
acceleration coefficient-artificial neural network for power transformer fault diagnosis. Measurement 2016,
90, 94–102. [CrossRef]
Chen, W.G.; Pan, C.; Yun, Y.X.; Liu, Y.L. Wavelet networks in power transformers diagnosis using dissolved
gas analysis. IEEE Trans. Power Deliv. 2009, 24, 187–194. [CrossRef]
Energies 2017, 10, 704
11.
12.
13.
14.
15.
16.
17.
18.
19.
20.
21.
15 of 15
The “four Vs” of Big Data. Available online: http://www.ibmbigdatahub.com/sites/default/files/
infographic_file/4-Vs-of-big-data (accessed on 12 October 2012).
Peng, X.S.; Deng, D.Y.; Cheng, S.J.; Wen, J.Y.; Li, C.H.; Niu, L. Key technologies of electric power big data
and its application prospects in smart grid. Proc. CSEE 2015, 35, 503–511. [CrossRef]
Song, Y.Q.; Zhou, G.L.; Zhu, Y.L. Present status and challenges of big data processing in smart grid.
Power Syst. Technol. 2013, 37, 928–935. [CrossRef]
Li, W.Y. A.3.4 Weibull Distribution. In Risk Assessment of Power Systems: Models, Methods, and Applications;
John Wiley & Sons Press: New York, NY, USA, 2005; ISBN: 1118843355.
Zhong, B.; Liu, Q.S.; Liu, C.L. Mathematical Statistics; Higher Education Press: Beijing, China, 2015;
pp. 103–115.
Zheng, Y.B. Research on Ensemble Learning of Fault Diagnosis and Prediction and Maintenance
Decision-Making Models for Transformers. Ph.D. Thesis, Chongqing University, Chongqing, China,
October 2011.
The State Grid Corporation. Guide for Evaluation of Oil-immersed Power Transformers (Reactors); The State Grid
Corporation: Beijing, China, 2008.
Qi, B.; Zhang, P. Calculation Method on differentiated warning value of power transformer based on
distribution model. High Volt. Eng. 2017, 42, 2290–2298. [CrossRef]
Sheng, G.H.; Hou, H.J.; Jiang, X.C. A novel association rule mining method of big data for power transformers
state parameters based on probabilistic graph model. IEEE Trans. Smart Grid 2016, 1, 1–9. [CrossRef]
Suresh, J.; Rushyanth, P.; Trinath, C. Generating associations rule mining using Apriori and FPGrowth
Algorithms. Int. J. Comput. Trends Technol. 2013, 4, 887–892.
Xu, Z.H.; Wei, C.P. A consistency improving method in analytic hierarchy process. Eur. J. Oper. Res. 1999,
116, 443–449. [CrossRef]
© 2017 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access
article distributed under the terms and conditions of the Creative Commons Attribution
(CC BY) license (http://creativecommons.org/licenses/by/4.0/).