On The Use of Artificial Intelligence For High Impedance Fault Detection and Electrical Safety
On The Use of Artificial Intelligence For High Impedance Fault Detection and Electrical Safety
Abstract—Accidents caused by faults on overhead power lines performance is observed in their detection logic when facing
have been more frequently reported under extreme weather con- the HIF events. Electrical safety studies have shown that
ditions and may strongly threaten the safety and stability of the conventional protection schemes detected and cleared only
power grids, e.g., massive wildfires caused by the electrical arcs
or lines getting in touch with vegetation, relay miss-operations, 17.5% of staged HIFs [3]–[5]. Therefore, HIF detection and
etc. It has been widely recognized that the electric safety concerns localization in electrical power systems yet remain a safety-
engendered by overhead line faults have to be timely and properly threatening challenge for power system protection engineers,
addressed to minimize the subsequent risks and damages. The a fast and accurate solution to which is urgently needed to
existing monitoring devices and protective relays can barely limit the safety risks, prevent power grid operation violations,
detect high impedance faults (HIFs) and are unable to warn
the system operators until serious abnormalities or damages and save human lives [6], [7].
are observed. Aiming at avoiding the damaging consequences Many research works have been conducted in diagnosing
of HIFs, an online monitoring system embedded with machine HIFs, each with some advantages and disadvantages; reference
learning analytics is proposed that ensures a fast and accurate [8] harnessed the high-frequency content in real vegetation
detection of HIFs in power systems. The performance of the fault signatures and proposed a method for the detection of
proposed artificial intelligence engine is tested under a variety of
simulated conditions and the numerical results demonstrate its distinct and very small-current HIFs. In [9], an HIF detection
efficacy and superiority over the state-of-the-art advancements. approach for power distribution networks is suggested using
fuzzy logic control that evaluates the 3rd and 5th harmonics
Index Terms—Convolutional Neural Network (CNN); electrical
safety; event detection; high impedance fault (HIF); feature in the electrical current signals. However, due to the dif-
extraction; wavelet transform (WT). ferent HIFs characteristics compounded by the existence of
harmonics and noises in the power waveforms, their accuracy
and speed performance may be compromised in real-world
I. I NTRODUCTION scenarios. A scheme to detect HIFs using the Time-Time
Electricity has dramatically changed our daily lives and (TT) transform that analyzes and determines the fault wave
fuels our modern society. The electricity grid is constantly patterns is introduced in [10]; this scheme can handle low
exposed, and yet vulnerable, to a wide range of threats, signal to noise ratio (SNR) in power waveforms through a
some foreseeable and some unpredictable and random in threshold selection procedure using unscented transformation
nature. One safety-threatening disruption in power systems (UT). While it can be applied to microgrids with different
is recognized as High Impedance Faults (HIF), the detection ratings and structures, the detection threshold must be tailor-
of which has long remained a challenging concern in the made and adjusted appropriately. Reference [11] presented a
electric industry. HIFs can cause “arcs” or “flashover” from variable-importance-based feature selection method to identify
the wires, through the air, to the neighboring trees, other HIFs from a large pool of signal signatures; this feature
vegetation or equipment, where it can cause fires, injuries, or selection scheme utilizes the discrete Fourier transform (DFT)
even fatalities [1]. A life-threatening example is the constant and Kalman Filter (KF) for harmonics coefficient estimation
exposure and contact of a power line with a tree branch and HIF duration and magnitude measurement.
during high-wind conditions, which can threaten homes in With the rapid advancements in Artificial Intelligence (AI),
residential neighborhoods and spark wildfires in rural areas. many researchers have implemented various HIF detection
Such types of faults are commonly caused by undesired techniques through a variety of machine learning technologies.
contacts with bare energized electrical conductive parts, the References [1], [12], [13] utilized waveform pattern analysis
high-impedance nature of which significantly restricts the flow and Support Vector Machine (SVM) to classify and ultimately
of fault current to a level hard to be detected by the overcurrent detect HIFs in power grids. In [4], semi-supervised learning
protective relays [2]. In particular, the existing commercial and probabilistic learning are used for HIF detection and
microprocessor-based protective relays activate a tripping de- localization, revealing promising detection accuracy, but with
cision when the electrical measurements are observed well a compromised response time of half a second. In [14], an
beyond the detection threshold; however, an unsatisfactory HIF detection approach using empirical mode decomposition
(EMD) combined with an artificial neural network (ANN)
This work was supported in part by the National Science Foundation (NSF) is proposed, where the HIF detection and classification are
under Grant ICER-2022505. achieved through predominant harmonic signatures caused by
S. Wang and P. Dehghanian are with the Department of Electrical and
Computer Engineering, The George Washington University, Washington, DC HIFs in the electrical signals. Discrete wavelet transform is
20052, USA (e-mails: shiyuan1225@gwu.edu; payman@gwu.edu). applied in [15] to monitor the high-frequency components
Digital Object Identifier 10.1109/TIA.2020.3017698
0093-9994 © 2020 IEEE. Personal use is permitted, but republication/redistribution requires IEEE permission.
See https://www.ieee.org/publications/rights/index.html for more information.
2
A. HIF Modeling where, RF and LF are the HIF’s resistance and inductance
in series; VF P and VF N are the positive and negative arc
A single-line diagram of an HIF event in a radial distribution
voltages during HIF events, which archives the non-linearity
system is illustrated in Fig. 1(a). The sending node is modeled
of HIFs. The simulated voltage and current waveforms are
by an ideal AC source; Z1 and Z2 are known impedance values
demonstrated in Fig. 2. It can be found that although there is
and can be estimated according to the system topology and
almost no influence on the voltage waveform, the HIF event
operating conditions; R, L and C stand for the per unit length
slightly distorts the current signal while the current magnitude
resistance, inductance, and capacitance of the line. By applying
has remained around the rated range; in other words, the HIF
Kirchhoff’s voltage law to the dashed circle in Fig. 1(a), the
event cannot be detected easily by the conventional protective
following equation is derived:
relays. One should note that, at the beginning of an HIF event,
d(in1 − ic ) this HIF model can barely match the characteristics of the
vn1 = δ R(in1 − ic ) + L + vF (1) first period due to the build-up phenomenon [24]. However,
dt
3
Fig. 3. Comparison of the STFT vs. WT with: HIF-affected waveform and harmonic-injected waveform starting at t=20ms with harmonic orders h= 3 and
5 and magnitudes of 0.08pu and 0.08pu, respectively.
online applicability of the HIF detection schemes. In order to D. Convolutional Neural Networks
archive a low-computing complexity and yet accurate design, Convolutional Neural Networks (CNNs) are among deep
we select WT alone in this paper for power waveform feature machine learning techniques that have been proven very ef-
extraction and HIF detection. fective in processing image-related tasks [38]–[40]. Therefore,
The wavelet transform can be seen as the computation once the scalograms from WT are generated, CNNs can be
process of the similarities between the signal of interest x(t) used as the HIF event detection engine through classifying
and the selected wavelet. x(t) is here the measured single- the scalograms. The CNN training process mainly involves
phase waveform. The WT calculation is defined as follows: feature/representation learning for feature detection and clas-
sification. In general, the execution of CNN for HIF detection
∞
1 t−b and classification is achieved through a set of cross-correlation
X(ω|a, b) = √ x(t)Ψ* ( )dt (6)
a −∞ a assessments as follows:
sp (m, n) = I u (m + v, n + w)K p (v, w), (8)
where a, b are the scaling factor and time shift; Ψ(t) is the
u v w
selected wavelet (called mother wavelet when a = 1 and
p
b = 0), and “*” denotes the complex conjugate operator. where s (m, n) stands for the convolutional layer’s output at
With different values of a and b selected, Ψ( t−b position (m, n) and p-th channel; the u-th convolutional kernel
a ) becomes
the ”daughter wavelets” of Ψ(t) [35]–[37], and each value of is marked as K u ; and I u denotes the image/data volume in
a corresponds to a pseudo frequency. Typically, the WT uses the u-th channel. A complex convolutional layer is comprised
discrete scaling factors ak =2i , where i is integer. Thus, the of a set of simple layers [41], as expressed in the following:
extracted feature from each pseudo frequency is finite. To have
I l = pool (σ(s)) , (9)
more information extracted and waveform feature redundancy
in the scalogram, a set of linearly-increasing real numbers can where I l stands for the l-th layer’s output volume; σ(·) stands
be assigned to i, and the WT then becomes pseudo-continuous for the nonlinear operation of the active function; and pool(·)
(PCWT). Here, the PCWT is defined as follows: is a pooling (down-sample) operation in the pooling layer.
The abstraction ability of the network generally increases with
W −1
1 nTs − bk the number of stacked convolutional layers [42]. The final
X[ω|ak , bk ] = √ x[n]Ψ* [ ] (7) representations in a CNN’s last layer are usually reshaped to
a n=0 ak
vectors and fed into the fully-connected (FC) layers.
where γ is a constant. Accordingly, and based on (10), the Fs 7.68kHz Line Length 2km
length of the Gaussian window in Gabor wavelet also adapts SLoad 0.5 - 1.5 pu pfload 0.8 - 1
different frequencies. HIF Model Setting
The proposed Gabor wavelet in the discrete time domain is
Parameter Range Parameter Range
ωc Ts (n − bk ) Ts2 (n − bk )2 ZFP 0.02 pu - 1.5 pu ZFN 0.02 pu - 1.5 pu
Ψ[n|ak , bk ] = exp j exp − ,
ak a2k α02 pfFP 0-1 pfFN 0-1
(16)
VFP 0.08 pu - 0.65 pu VFN 0.08 pu - 0.65 pu
Applying different discrete scaling factors ak and time shift
Fault Location 1% - 99%
bk , we achieve the proposed PCQ-WT as follows,
* :A signal to noise ratio of 40dB is added to the AC voltage source.
W
−1
Ts (n − bk )
X(ωk |ak , bk ) = x[n]Ψ* [ ]
n=0
ak
W −1 feature extraction will not be compromised. The scalograms
ωc T 2 (n − bk )2 fed into the CNN are cropped from 10ms to 50ms (of the total
= x[n]exp −j Ts (n − bk ) − s 2 2 .
n=0
ak ak α 0 60ms WT output)—i.e., observation+ wavelet length— which
(17) has 40ms (308 samples) duration.
The proposed compact CNN configuration for scalo-
If the pseudo frequencies of interest and the Gabor wavelet
gram classification is as follows: Input layer (256×308);
bank are designed properly, a set of PCQ-WTs in form of a
Convolution (Conv.) layer (32×5×11); Max-pooling layer
vector Xω can be generated conveying waveform features in a
(3×3); Conv. layer(32×5×5); Max-pooling layer(3×3);
certain frequency range. During both transient and steady state
Conv. layer(32×5×5); Full-connected (FC) layer(200×1); FC
operating modes, WT time-frequency analysis is conducted
layer(3×1). Conventional images have homogeneous units on
along time, and a scalogram stream can be then achieved.
the horizontal and vertical axes, while the scalograms axes
carry different information regarding the HIF events on either
C. Proposed CNN Configuration for HIF Detection time or frequency. Therefore, a wide kernel in the first Conv.
With the PCQ-WT extracted features in form of scalo- layer that can extract more information from the transitions
grams available, the HIF detection problem is converted to along the time axis is used. The stride of the first layer
a supervised scalograms classification problem. However, the is (2,3), and the remaining Conv. layers use strides with
classification of the high-dimensional 2-D scalograms is chal- a size of (1,1). Besides, batch normalization [47] is used
lenging. specifically, every frame of the obtained scalogram in the last FC layer. In the last Conv. layer and the first
has hundreds by hundreds (scales × time) pixels; it is very FC layer, Dropout [48] is used to prevent over-fitting. All
challenging to process such high dimensional data through activation functions in the CNN are Rectified Linear Unit
the conventional pattern classification approaches. Here, we (ReLU). We choose cross-entropy as the loss function. One
convert the PCQ-WT scalograms into 2-D images and propose can see that our proposed CNN is not that ”deep” compared to
a compact CNN architecture to classify the HIFs concealed in the regular image classification CNNs; the suggested compact
the scalograms by the PCQ-WT. The proposed CNN has a CNN architecture further reduces the computing complexity
simple architecture for HIF detection, yet achieving a very in HIF detection.
fast processing time for online applications.
IV. C ASE S TUDY AND N UMERICAL E XPERIMENTS
D. PCWT and CNN Parameter Setting
A. Test Scenarios Configuration
The sampling rate F s for the signal and the feature ex-
traction is 7680Hz which provides 128 samples per nominal The parameter specifications used for generating the test
fundamental cycle. The observation window for the PCQ- waveforms are listed in Table. I. We, in particular, focus on
WT is set to 308 samples (40ms). The time shift b for the HIF event detection since conventional faults can typically be
modified Gabor daughter wavelets is 10ms (77 samples) for detected by existing protective devices. Three test scenarios
simplicity. The scaling factor a for the proposed PCQ-WT (HIF event, load change event, and normal operation event)
is chosen as 2i , where i is sampled uniformly from 0 to 8. are simulated. For each HIF simulation, all parameters are
The central frequency ωc = 1152 and the pseudo frequency randomly selected in the designated ranges. In each simulated
will roughly reach up to the 19th order of harmonics. All waveform, only one event occurs at a random point in time.
the Gabor wavelets have 20ms duration. By this design, the The waveform generation system is developed according to
computational complexity in computing the scalograms is Fig. 1(a) and the improved HIF model in Fig. 5. Gaussian
reduced, while the pseudo frequency bandwidth coverage for noises with SNR of 40dB are added to the AC voltage
7
(a) HIF with SNR 30dB (b) Load Increase with SNR 30dB
Fig. 6. Test waveform simulation results: (a) HIF event; (b) Load increase event; both simulated events start at t=20ms.
Fig. 8. Online HIF detection on a simulated single-phase current waveform: detected result (top) and original waveform (bottom).
[17] S. Wang, P. Dehghanian, L. Li, and B. Wang, “A machine learning [38] A. Krizhevsky, I. Sutskever, and G. E. Hinton, “Imagenet classification
approach to detection of geomagnetically induced currents in power with deep convolutional neural networks,” in Advances in neural infor-
grids,” IEEE Transactions on Industry Applications, vol. 56, no. 2, pp. mation processing systems, 2012, pp. 1097–1105.
1098–1106, 2019. [39] K. Simonyan and A. Zisserman, “Very deep convolutional networks for
[18] S. Wang, P. Dehghanian, and L. Li, “Power grid online surveillance large-scale image recognition,” arXiv preprint arXiv:1409.1556, 2014.
through PMU-embedded convolutional neural networks,” IEEE Trans- [40] L. Li, M. Doroslovački, and M. H. Loew, “Discriminant analysis deep
actions on Industry Applications, vol. 56, no. 2, pp. 1146–1155, 2019. neural networks,” in 53rd Annual Conference on Information Sciences
[19] M. H. Rezaeian Koochi, P. Dehghanian, S. Esmaeili, P. Dehghanian, and and Systems, March 2019, pp. 1–6.
S. Wang, “A synchrophasor-based decision tree approach for identifica- [41] I. Goodfellow, Y. Bengio, and A. Courville, Deep Learning. MIT Press,
tion of most coherent generating units,” in Annual Conference of the 2016, http://www.deeplearningbook.org.
IEEE Industrial Electronics Society, Oct 2018, pp. 71–76. [42] Y. Bengio, A. Courville, and P. Vincent, “Representation learning: A
[20] S. Wang, P. Dehghanian, L. Li, and B. Wang, “A machine learning review and new perspectives,” IEEE transactions on pattern analysis
approach to detection of geomagnetically induced currents in power and machine intelligence, vol. 35, no. 8, pp. 1798–1828, 2013.
grids,” in IEEE Industry Applications Society Annual Meeting (IAS), [43] J. V. B. Soares, J. J. G. Leandro, R. M. Cesar, H. F. Jelinek, and
2019, pp. 1–7. M. J. Cree, “Retinal vessel segmentation using the 2-D Gabor wavelet
[21] M. Babakmehr, F. Harirchi, M. Nazir, S. Wang, P. Dehghanian, and and supervised classification,” IEEE Transactions on Medical Imaging,
J. Enslin, “Sparse representation-based classification of geomagnetically vol. 25, no. 9, pp. 1214–1222, Sept 2006.
induced currents,” in Clemson University Power Systems Conference, [44] Z. Zhang, M. Lyons, M. Schuster, and S. Akamatsu, “Comparison
March. 2020, pp. 1–7. between geometry-based and Gabor-wavelets-based facial expression
recognition using multi-layer perceptron,” in Proceedings Third IEEE
[22] B. Shinde, S. Wang, P. Dehghanian, and M. Babakmehr, “Real-time
International Conference on Automatic Face and Gesture Recognition,
detection of critical generators in power systems: A deep learning hcp
April 1998, pp. 454–459.
approach,” in 2020 IEEE Texas Power and Energy Conference (TPEC).
[45] B. S. Manjunath and W. Y. Ma, “Texture features for browsing and
IEEE, 2020, pp. 1–6.
retrieval of image data,” IEEE Transactions on Pattern Analysis and
[23] A. Ghaderi, H. L. Ginn III, and H. A. Mohammadpour, “High impedance Machine Intelligence, vol. 18, no. 8, pp. 837–842, Aug 1996.
fault detection: A review,” Electric Power Systems Research, vol. 143, [46] J. Hubbard, “Calculation of partition functions,” Physical Review Letters,
pp. 376–388, 2017. vol. 3, no. 2, p. 77, 1959.
[24] W. C. dos Santos, B. A. de Souza, N. S. D. Brito, F. B. Costa, and [47] S. Ioffe and C. Szegedy, “Batch normalization: Accelerating deep
M. R. C. Paes, “High impedance faults: From field tests to modeling,” network training by reducing internal covariate shift,” arXiv preprint
Journal of Control, Automation and Electrical Systems, vol. 24, no. 6, arXiv:1502.03167, 2015.
pp. 885–896, 2013. [48] N. Srivastava, G. Hinton, A. Krizhevsky, I. Sutskever, and R. Salakhut-
[25] A. von Meier, E. Stewart, A. McEachern, M. Andersen, and dinov, “Dropout: a simple way to prevent neural networks from over-
L. Mehrmanesh, “Precision micro-synchrophasors for distribution sys- fitting,” The Journal of Machine Learning Research, vol. 15, no. 1, pp.
tems: A summary of applications,” IEEE Transactions on Smart Grid, 1929–1958, 2014.
vol. 8, no. 6, pp. 2926–2936, 2017. [49] D. P. Kingma and J. Ba, “Adam: A method for stochastic optimization,”
[26] R. F. Nuqui and A. G. Phadke, “Phasor measurement unit placement arXiv preprint arXiv:1412.6980, 2014.
techniques for complete and incomplete observability,” IEEE Transac-
tions on Power Delivery, vol. 20, no. 4, pp. 2381–2388, 2005.
[27] T. Nguyen, S. Wang, M. Alhazmi, M. Nazemi, A. Estebsari, and
P. Dehghanian, “Electric power grid resilience to cyber adversaries: State
of the art,” IEEE Access, vol. 8, pp. 87 592–87 608, 2020.
[28] E. Mangueira Lima, C. M. dos Santos Junqueira, N. Silva Dantas
Brito, B. A. de Souza, R. de Almeida Coelho, and H. Gayoso Meira Shiyuan Wang (S’18) received the B.Eng. degree in
Suassuna de Medeiros, “High impedance fault detection method based mechanical engineering from University of Science
on the short-time fourier transform,” IET Generation, Transmission and Technology Beijing, China, in 2012; the M.Sc.
Distribution, vol. 12, no. 11, pp. 2577–2584, 2018. and Ph.D. degrees in electrical engineering from The
[29] Y.-C. Su, K.-L. Lian, and H.-H. Chang, “Feature selection of non- George Washington University, Washington, DC,
intrusive load monitoring system using stft and wavelet transform,” in e- USA, in 2014 and 2020, respectively. His research
Business Engineering (ICEBE), 2011 IEEE 8th International Conference interests include power system reliability and re-
on. IEEE, 2011, pp. 293–298. siliency, smart grid and renewable energy, power
grid harmonic analysis, and application of signal
[30] S.-H. Ni, K.-F. Lo, L. Lehmann, and Y.-H. Huang, “Time–frequency
processing in energy analytics.
analyses of pile-integrity testing using wavelet transform,” Computers
and Geotechnics, vol. 35, no. 4, pp. 600–607, 2008.
[31] S. Wang, P. Dehghanian, and B. Zhang, “A data-driven algorithm for
online power grid topology change identification with PMUs,” in 2019
IEEE Power & Energy Society General Meeting. IEEE, 2019, pp. 1–5.
[32] S. Wang, P. Dehghanian, and Y. Gu, “A novel multi-resolution wavelet
transform for online power grid waveform classification,” in The 1st
IEEE International Conference on Smart Grid Synchronized Measure- Payman Dehghanian (S11, M17, SM20) received
ments and Analytics (SGSMA), May 2019, pp. 1–8. the B.Sc. degree in electrical engineering from the
[33] J. C. Chen, B. T. Phung, H. W. Wu, D. M. Zhang, and T. Blackburn, University of Tehran, Tehran, Iran, in 2009, the
“Detection of high impedance faults using wavelet transform,” in 2014 M.Sc. degree in electrical engineering from the
Australasian Universities Power Engineering Conference, 2014, pp. 1–6. Sharif University of Technology, Tehran, Iran, in
[34] Shyh-Jier Huang and Cheng-Tao Hsieh, “High-impedance fault detection 2011, and the Ph.D. degree in electrical engineering
utilizing a morlet wavelet transform approach,” IEEE Transactions on from Texas A&M University, Texas, USA, in 2017.
Power Delivery, vol. 14, no. 4, pp. 1401–1410, 1999. He is currently an Assistant Professor with the
[35] D. P. Mishra, S. R. Samantaray, and G. Joos, “A combined wavelet and Department of Electrical and Computer Engineer-
data-mining based intelligent protection scheme for microgrid,” IEEE ing, The George Washington University, Washing-
Transactions on Smart Grid, vol. 7, no. 5, pp. 2295–2304, Sept 2016. ton, DC, USA. His research interests include power
system protection and control, power system reliability and resiliency, asset
[36] K. Thirumala, M. S. Prasad, T. Jain, and A. C. Umarikar, “Tunable-q
management, and smart electricity grid applications.
wavelet transform and dual multiclass svm for online automatic detection
Dr. Dehghanian was a recipient of the 2013 IEEE Iran Section Best M.Sc.
of power quality disturbances,” IEEE Transactions on Smart Grid, vol. 9,
Thesis Award in Electrical Engineering, the 2014 and 2015 IEEE Region
no. 4, pp. 3018–3028, July 2018.
5 Outstanding Professional Achievement Awards, and the 2015 IEEE-HKN
[37] S. Wang, L. Li, and P. Dehghanian, “Power grid online surveillance Outstanding Young Professional Award.
through PMU-embedded convolutional neural networks,” in IEEE In-
dustry Applications Society (IAS) Annual Meeting, 2019, pp. 1–7.