Optimized Dynamic Mode Decomposition via Non-Convex ... - MDPI

4 downloads 0 Views 5MB Size Report
Feb 27, 2018 - Zhang Dang 1,2,3 ID , Yong Lv 1,2,*, Yourong Li 1,2 and Cancan Yi 1,2 ...... Han, T.; Liu, X.; Tan, A. Fault Diagnosis of Rolling Element Bearings ...
entropy Article

Optimized Dynamic Mode Decomposition via Non-Convex Regularization and Multiscale Permutation Entropy Zhang Dang 1,2,3 1

2 3

*

ID

, Yong Lv 1,2, *, Yourong Li 1,2 and Cancan Yi 1,2

Key Laboratory of Metallurgical Equipment and Control Technology, Ministry of Education, Wuhan University of Science and Technology, Wuhan 430081, China; [email protected] (Z.D.); [email protected] (Y.L.); [email protected] (C.Y.) Hubei Key Laboratory of Mechanical Transmission and Manufacturing Engineering, Wuhan University of Science and Technology, Wuhan 430081, China National Demonstration Center for Experimental Mechanical Education, Wuhan University of Science and Technology, Wuhan 430081, China Correspondence: [email protected]; Tel.: +86-027-6886-2857; Fax: +86-027-6886-2212

Received: 22 January 2018; Accepted: 23 February 2018; Published: 27 February 2018

Abstract: Dynamic mode decomposition (DMD) is essentially a hybrid algorithm based on mode decomposition and singular value decomposition, and it inevitably inherits the drawbacks of these two algorithms, including the selection strategy of truncated rank order and wanted mode components. A novel denoising and feature extraction algorithm for multi-component coupled noisy mechanical signals is proposed based on the standard DMD algorithm, which provides a new method solving the two intractable problems above. Firstly, a sparse optimization method of non-convex penalty function is adopted to determine the optimal dimensionality reduction space in the process of DMD, obtaining a series of optimal DMD modes. Then, multiscale permutation entropy calculation is performed to calculate the complexity of each DMD mode. Modes corresponding to the noise components are discarded by threshold technology, and we reconstruct the modes whose entropies are smaller than a threshold to recover the signal. By applying the algorithm to rolling bearing simulation signals and comparing with the result of wavelet transform, the effectiveness of the proposed method can be verified. Finally, the proposed method is applied to the experimental rolling bearing signals. Results demonstrated that the proposed approach has a good application prospect in noise reduction and fault feature extraction. Keywords: dynamic mode decomposition; sparse optimization; non-convex regularization; multiscale permutation entropy; feature extraction

1. Introduction Generally, amplitude-modulated and/or frequency-modulated (AM-FM) multi-component signals collected by sensors are interfered with by background noise, resulting lower signal-to-noise ratio (SNR) [1]. The reason for the low SNR signals obtained by a determinate acquisition system are usually as follows: firstly, when a rolling bearing is in the infantile fault period, the fault feature is inconspicuous and buried in the background noise. Secondly, signal energy as well as incipient failure features will attenuate because of the long distance between the fault source and the sensor, which usually is located on the bearing pedestal. Moreover, the characteristic signal will also be attenuated when there is a long transmission path from the sensor to the acquisition device [2]. Thirdly, the signals collected by sensors contain strong background noise. The main transmission system of a machine is a complex system composed of multiple components such as motors, couplings, multiple

Entropy 2018, 20, 152; doi:10.3390/e20030152

www.mdpi.com/journal/entropy

Entropy 2018, 20, 152

2 of 19

reduction gearbox. The vibration of all components will be collected by the sensor. Noisy signals make it difficult to identify the fault features in signal processing, therefore, it is necessary to perform denoising and characterize fault feature accurately on the low SNR vibration signal. In the last few decades, many theoretical denoising methods have been proposed and are widely used in medical signaling, image processing and mechanical engineering signal processing [3–5]. Donolo and John-stone [6], taking the advantage of the spatial diversity of wavelet basis functions, proposed a denoising method with a threshold based on wavelet transform. Their methods concentrate the energy of the ideal signal, benefiting from the effective wavelet, guaranteeing a certain de-noising effect. However, the selection of threshold functions is the main problem with wavelet de-noising algorithms, which usually generate a deviation between the calculated result and the real one. Empirical mode decomposition (EMD) [7] can be treated as a typical adaptive signal decomposition method of frequency-selection and filtering. The signal is decomposed into a series of intrinsic mode functions (IMFs) arranged from high frequency to low frequency. The source signal can be processed by abandoning high-frequency components for the consideration that noise components are generally distributed in the high-frequency domain. However, as EMD has the defects of modal aliasing, the ideal signal and noise coexist in the selected IMFs. This would result in the loss of useful information. Ensemble empirical mode decomposition (EEMD) [8] was developed to solve the mode-mixing problem of EMD by adding noise to the source signal. It is more accurate and effective for data decomposition than EMD, but it prompts new trouble by being time consuming. Variable mode decomposition (VMD) [9,10] is a signal decomposition method based on wiener filtering, one dimensional Hilbert transform and heterodyne demodulation analysis, but the decomposition results are restricted by the selection of penalty parameters and the number of components. Singular value decomposition (SVD) [11], as a data-processing method without prior knowledge, has been successfully applied to signal denoising and proved to be effective. However, there exist two pivotal problems: one is that how to determine the effective rank order after decomposition, and the other one is that how to achieve the reconstruction matrix. Meanwhile, the noise reduction effect of SVD is not prominent when analyzing the low SNR signal. Dynamic mode decomposition (DMD) [12] is an equation-free Koopman frequency analysis technique based on the theory of SVD and mode decomposition. It can extract the spatiotemporal coherent characteristics by decomposing a dynamic model into a series of single-frequency non-orthogonal modes based on its inherent space-time. The reconstructed signal can characterize the useful information of the original signal perfectly by superposing the filtered modes according to time coefficients. The algorithm is suitable for extracting dynamic information of nonlinear systems [13]. The instantaneous components and high-frequency noise components can be effectively removed, due to the dynamic delay processing in the algorithm [12]. The DMD algorithm has been well applied in the fluid mechanics community. Rowley [12] applied DMD in the jet cross flow field. The results show that DMD theory can extract the shedding frequency while the linear stability analysis method cannot. In addition, DMD can separate the shear layer and wall vortices, whereas the proper orthogonal decomposition (POD) algorithm is ineffective. Schmidt [13] applied DMD to the flow field in a variety of environmental conditions, indicating that the DMD theory is effective for feature extraction with numerical simulation and measured velocity field data. Moreover, DMD can separate video frames into a background and multiple foregrounds successfully when applied to video processing [14]. DMD can also be used for biometrics to detect fraudulent samples [15]. In the field of robotics and neuroscience, DMD has been used to estimate the perturbation of human-robot interactions [16], and the coherent modes in large-scale neural recording have been extracted [17]. Nowadays, theoretical research with DMD mainly focuses on two aspects. In the first aspect, a variety of improved methods are proposed based on the standard DMD to solve the problem that DMD cannot accurately extract the dynamic information due to the influence of signal noise. These strategies range from noise-corrected dynamic mode decomposition with control [18], total least-squares DMD [19], compressed sensing DMD [20] to sparsity-promoting DMD [21]. The improved DMD algorithm can extract fluid dynamic information

Entropy 2018, 20, 152

3 of 19

more accurately in varying degrees. In the second aspect, in view of the large data processing problems, researchers aim at reducing the computational complexity and the program storage space in either pre-processing or post-processing, using strategies including random DMD [22], random low-rank DMD [23], and tensor-based DMD [24]. The standard DMD and its improved algorithm are more suitable in analyzing periodic or quasi-periodic signals, but it is prone to generate some false mode components. Therefore, the theory of fault vibration signal processing and fault mode recognition with DMD needs further study. Essentially, DMD is a hybrid algorithm based on mode decomposition and SVD, and it inevitably inherits the drawbacks of these two algorithms, including the selection strategy of truncated rank order and DMD modes. The number of truncation rank for the similar matrix needs to be artificially and blindly selected in advance; whether the number value is appropriate or not is of crucial importance for accurately representing the dynamic information of reconstructed signal. The measured vibration signal with the characteristic of non-linear and non-stationary is coupled with multi-components and contain strong environmental noise. To achieve the purpose of fault feature extraction with DMD, it requires separation of the fault feature components effectively from the DMD modes. It is necessary to establish an adaptive optimization algorithm based on DMD to achieve an accurate truncation of rank and screen the single-frequency non-orthogonal modes effectively. In this paper, an optimized DMD algorithm is proposed for extracting the fault features drowned in the multi-component coupled and noisy mechanical signal. The idea of sparse optimization is adopted to select the optimal rank for the solution of truncated rank order selection problem. By introducing the constraint of non-convex regulation to the rank function, the similar matrix has low rank attribute, making the ideal components (signals) more centralized. As for the fact that the noise’s multiscale permutation entropy (MPE) is larger than the ideal component’s, the low rank modes are selected corresponding to faults and inherent features by a threshold, solving the difficulty in selecting useful DMD modes. Then the signal is reconstructed by the low rank DMD modes, and Fourier transform (FT) is conducted to achieve the purpose of denoising and fault diagnosis of the noisy mechanical signal. The structure of this paper is as follows. Section 2 introduces the basic theory, including the standard algorithm of DMD, the sparse optimization algorithm with non-convex optimization regulation, and MPE. In Section 3, the proposed algorithm is applied to the normal and faulty bearing noisy simulation signals to verify its effectiveness. Analysis results of two experimental bearing fault signals are described in Section 4. Conclusions are summarized in Section 5. 2. Methodology In this part, the standard DMD algorithm is introduced based on time series in fault diagnosis. To solve the problem of selecting the optimal rank, a technique based on sparse optimization is proposed. The details are described as in Section 2.2. Then, we introduce the basic algorithm of MPE, and show the basic flow chart in the form of illustration. 2.1. DMD with Time Series Signal Supposing that a signal S is acquired by numerical simulation or a sensor with equal intervals between two successive sampling points, S = ( x1 , x2 . . . , xi , . . . x N ), xi ∈ R, while ∆t = xi+1 − xi . Hankel matrix X can be obtained by a sliding window over the corresponding vector S.

Entropy 2018, 20, 152

4 of 19

     X=   

x1 x2 ··· xi ··· xm

x2 x3

··· ···

x i +1

···

x m +1

···

xn



x n +1      = [ X1 , X2 , · · · , X n ] , X ∈ R m × n x n + i −1   

(1)

xN

where Xi = ( xi , xi+1 . . . xs+i−1 ) T , m + n − 1 = N. Decomposing the Hankel matrix into two shift-stack matrices as in the form of Equation (2). Xt = [ X1 , X2 , · · · , Xn−1 ]; Xt+1 = [ X2 , X3 , · · · , Xn ]; Xt , Xt+1 ∈ Rm×(n−1)

(2)

Assume that the two continuous sequence matrices satisfy a mapping relation with optimal linear operator. DMD algorithm utilizes a low rank expression of matrix A to capture the potential dynamic characteristics. Xt+1 = AXt (3) A can be also regarded as an operator to approximate the dynamic characteristics when the data are produced in nonlinear systems [13]. Matrix A usually contains many complex entries if S is presented in high-dimensional space. The DMD algorithm finds an optimal similarity matrix F ∈ Cr×r by projecting the matrix A with its r-th order eigenvectors according to POD. A = UFU∗

(4)

where U∗ is the complex conjugate transpose of POD mode U, and U is obtained by SVD of matrix Xt . Xt ≈ UΣV∗

(5)

where U ∈ Cm×r , V ∈ Rr×(n−1) are known as the left and right eigenvectors respectively, U∗ U = I, V∗ V = I, The symbol ∗ denotes the complex conjugate transpose. Σr ∈ Rr×r is a diagonal matrix containing r non-zero eigenvalues {σ1 , σ2 , · · · σr } on its diagonal. The matrix F is determined by minimizing the Frobenius norm of the error between Xt+1 and AXt . Ψ = argminkXt+1 − UFΣV∗ k2F

(6)

F

Optimal solution of the above formula can be got directly. FDMD = U∗ Xt+1 VΣ−1

(7)

Equation (7) was proposed by Schmid [13], while the literature also introduced the application of DMD algorithm. Then a matrix FDMD ∈ Rr×r with lower dimension is obtained. After the similar transformation above, FDMD and A are almost equivalent with each other. Hence, the eigenvectors and eigenvalues of A are replaced by the eigenvectors and eigenvalues of FDMD . Matrix FDMD optimizes the low-dimensional approximation of the internal data mapping matrix A by the POD subspace modes of Xt . The dynamic information of the r-dimensional subspace is expressed as: Xt+1 = FDMD Xt (8)

Entropy 2018, 20, 152

5 of 19

Perform eigenvalue decomposition on the similarity matrix FDMD : FDMD = WΛW−1

(9)

where W = [ω1 , ω2 , · · · , ωr ] ∈ Rr×r are the eigenvectors of similarity matrix FDMD , and Λ = diag([λ1 , λ2 , · · · , λr ]) ∈ Rr×r is a diagonal matrix containing the corresponding complex eigenvalue λi . The evolution of the signal characteristics can be characterized by similarity matrix. In addition, the i-th eigenvector of original operator is presented by relevant feature vector of similarity matrix. φi = Xt+1 Vi Σi−1 Wi

(10)

Equation (10), the project DMD approximate solution, is often called standard DMD mode. By firstly rewriting for convenience vi = ln(λi )/∆t, the approximate solution of reconstruction matrix of DMD is then given by: r

XDMD =

∑ φi exp(vi ∆t)bi = Φ exp(Ωt)b, XDMD ∈ Rm×n

(11)

i =1

where Φ is a matrix consisted of DMD modes φi , Ω = diag(λi ) is a diagonal matrix whose entries are continuous-time eigenvalues vi of the similarity matrix FDMD , b is a vector containing the initial amplitude of each mode, b = ΦΓ xt , Γ denotes the Moore-Penrose pseudoinverse. 2.2. Sparse Optimized DMD via Non-Convex Regularization As described in Section 2.1, r-th non-zero eigenvalues of the similarity matrix FDMD are retained to reconstruct the analytical signal, and r is the truncation rank order for the SVD of Xt . In the process of standard DMD, the rank of r needs to be manually preset. It plays a crucial role in accurately expressing the dynamic features of the original signal. If r is too large, it may reconstruct some unnecessary sparse components, such as noise, which cannot play a good denoising effect. Moreover, it increases the storage space and decreases the calculating efficiency. On the contrary, if r is too small, it will inevitably filter out useful low-rank components that symbolize the system dynamic information, failing to extract the signal physical characteristics. The difficulty lies in the determination of optimized truncation rank r. Recently, many sparse optimization algorithms have been proposed in the fields of signal and image processing [16], compressive sensing [17], machine learning [25], and data mining [26] for consideration that the optimization variables have some sparse structures. Sparse optimization makes it possible to reconstruct high-dimensional signals and extract potential information from a small amount of data. Meanwhile, sparse optimization can greatly accelerate the calculation speed in large-scale optimization problems. Based on DMD theory, this paper introduces a parametric sparse optimization framework for the optimal low-rank approximation of Xt . If the shift-stack Hankel matrix Xt take the form of ∼

Xt = Xt + Nt

(12)



where Xt is the ideal clean matrix containing all the dynamical characteristics of the vibration system. Nt is a random noisy matrix. A typical sparse optimization framework with minimal low rank matrix approximation is defined as follows. ∼ 2 1 X∗ = argmin{ kXt − Xt k F + λΠ} 2 F

where λ > 0 is a regularization coefficient, Π is a sparse regularization.

(13)

Entropy 2018, 20, 152

6 of 19

The above minimum low-rank approximation problem is transformed into a standard problem k

of minimizing nuclear penalty when Π = ∑ |σi (Xt )|. Theoretically, the nuclear norm optimization i =1

framework is convex, and it can be directly solved by taking traditional SVD on the shift-stack Hankel matrix Xt . However, the solution of convex norm-based nuclear norm optimization often leads to underestimate the rank of the original matrix [27]. Selesnick [28] proposed a sparse optimization method based on non-convex regularization to denoise a noisy signal. The calculation process can guarantee the strict convexity of Equation (13) and generates a unique solution. Inspired by the idea of [28,29], this paper adopts a sparse framework of non-convex optimization to obtain the best truncated rank of the shift-stack matrix. Non-convex optimization can approximate the non-zero singular value of the calculation matrix more accurately than nuclear norm optimization framework. Moreover, the non-convex method is more robust to strong background noise then nuclear norm optimization. Let us define Π =

k

∑ φ(σi ( Xt ); α), φ : R → R corresponding to the non-convex sparse

i =1

regularization of sparse optimization. Non-convex penalty function must meet the following five properties: (1) (2) (3) (4) (5)

φ is symmetric and continuous in R space. i.e., φ(− x; a) = φ( x; a), which is continuous derivable in R; φ0 ( x; a) > 0, ∀ x > 0; φ00 ( x; a) ≤ 0, ∀ x > 0; φ0 (0+ ; a) = 1; inf φ00 ( x; a) = φ00 (0+ ; a) = − a; x 6 =0

Chen [30] listed various penalty functions satisfying the above assumption in Table II. Here we employ the partly quadratic penalty function [27], defined as: ( φ( x; a) =

| x | − 2a x2 , | x | < 1 |x| ≥ 2a ,

1 a 1 a

(14)

where α is a non-convex penalty parameter and controls the degree of non-convexity of φ, satisfying 0 ≤ α ≤ λ1 . Finally, the sparse optimization objective function based on non-convex regularization is defined as Equation (15). The optimal truncation rank r of singular value which should be determined in the process of DMD algorithm is obtained by the non-convex optimization framework. k ∼ 2 1 X∗ = argmin{ kXt − Xt k F + λ ∑ φ(σi (Xt ); α)} 2 F i =1

(15)

2.3. Optimized DMD Modes via MPE The result of the DMD algorithm obtains a series of single-frequency modes that represent the dynamics of the original system. DMD modes are similar to the IMFs obtained by EMD, while each DMD mode corresponds to a single frequency. It can also be regarded as the vibration modes in mechanical vibration. All DMD modes express the inherent characteristics, fault information and noise background components (introduced by the shift-stack matrix Xt+1 ) of the original system information, making it difficult to diagnose mechanical fault features effectively. Approximate entropy (AE) [31] reflects the time series’ complexity by calculating the approximate degree of two points in the given dimension and the probability of new mode when the dimension changes. AE has the characteristics of stable calculation results with short data sequence. Sample entropy [32,33] is an improved algorithm, and it performs better than AE to maintain the consistency of the calculation results. Permutation entropy (PE) [33] is an algorithm based on time series phase space reconstruction

Entropy 2018, 20, 152

7 of 19

and neighborhood value comparison, which can effectively describe the complexity of the system. PE has the advantages of simple algorithm, fast computing speed, and reliable calculation results [34]. MPE is an improved algorithm based on PE. In addition, it is especially suitable for nonlinear data. In this paper, MPE threshold is used to identify and remove the modes of noise components. The remaining low-rank components are reconstructed by DMD to achieve the purpose of noise reduction and fault feature extraction for noisy fault signals. The basic idea is that it first gets the time series coarse-grained in time-scale, then calculates PE on the coarse-grained data. The process of MPE is briefly described here. For more detailed information about MPE, it can be referred to in [33,35]. First, a given time series { xk , k = 1, 2, · · · , N } is coarse-grained. js

x sj =

N 1 ∑ xk (1 ≤ j ≤ [ s ]) s i=( j− 1) s +1

(16)

where S is the scale factor, S = 1, 2, . . .,h when S = 1, the coarse-grained sequence is the original i sequence, and the result of MPE is PE. Ns means rounding Ns . If S ≥ 2, the original sequence is h i coarse-grained with the length of Ns . Secondly, the phase space reconstruction is carried on the coarse-grained time series. xid = [ x(i) , x(i+τ ) , · · · , x(i+(m−1)τ ) ]

(17)

where i = 1, 2, · · · , N. d ≥ 2 is the embedding dimension; τ is the delay time; Each row vector in the reconstructed phase space matrix is a reconstruction vector, and there are d reconstruction vectors in total. Considering every two data in the form of ascending or descending order, the j-th reconstruction vector has d! possible ordinal patterns. Finally, the MPE value H is defined as: H ( N, d, τ ) = −

1 d! pi p ln( i ) ∑ ln(d!) i=1 N N

(18)

where pi represents the frequency of a particular sequence pattern in all reconstruction vectors. Figure 1 shows the calculation process of pi with a few capacity sample data sequences. The regular parameter ln(d!) ensures that the maximum value of H ( N, d, τ ) is 1. MPE is a measure of system complexity. The larger the entropy, the more complex the system components, and the closer the system is to the stochastic signal. Conversely, the smaller the entropy value, the simpler the system components appears. For the fixed-length analysis sequence, the result of MPE dependent on the scale factor s and the embedding dimension d. Generally, if d is too small, the reconstructed vector contains little information, the algorithm loses its validity and cannot detect the kinetic mutation of the sequence. If d is too large, the reconstruction of phase will homogenize the time series, which is not only time-consuming but also unable to reflect the small changes of the sequence. Bandt [33] and Zunino [34] suggested that d should be in the range 3–7. Time delay τ has little effect on the MPE of time series [35,36], and in most literatures [36–38], authors employ τ = 1. In this paper, we employ s = 5, d = 6, τ = 1 in the calculation of MPE. In [36,38,39], the length N of time series are suggested as N > 5d!, thus the sample points of the simulation signal should be larger than 3600. Ultimately, the flowchart of the algorithm based on DMD via non-convex regularization and MPE is shown in Figure 2.

Entropy 2018, 20, 152 Entropy 2018, 20, x

8 of 19 8 of 19

0.25

(a)

0.1

-0.1 0 0.2

(b)

5

10

15

20

25

30

0.1 0

S =3

1 0.2

(c)

2

3

4

5

6

7

8

9

10

0.1

d = 3,τ = 1

0 1

2

3

4

5

6

7

8

(d) 1 0.8 0.6

(e)

0.4 0.2 0 0

1

2

3

4

5

6

Figure1.1. An An example example for the schedule of calculating withMPE. MPE.(a) (a)noisy noisytime timeseries serieswith withlength length Figure calculating ppi iwith of coarse-grained time series within of NN==30; 30(b) ; (b) coarse-grained time series withinnon-overlapping non-overlappingtime timeslice slicereferred referredtotoas asthe thescale scale factor data segments with embedded dimension d = 3d and = 1; (d) factor s s==3;3(c) ; (c) data segments with embedded dimension = 3time anddelay timeτdelay τ =ordinal 1 ; (d) patterns accordingaccording to the data (e) The frequency of each particular ordinal patterns all the ordinal patterns to segments; the data segments; (e) The frequency of each particular ordinalofpatterns data segments. of all Entropy 2018, 20, the x data segments. 9 of 19

MPE is a measure of system complexity. The larger the entropy, the more complex the system components, and the closer the system is to the stochastic signal. Conversely, the smaller the entropy value, the simpler the system components appears. For the fixed-length analysis sequence, the result of MPE dependent on the scale factor s and the embedding dimension d . Generally, if d is too small, the reconstructed vector contains little information, the algorithm loses its validity and cannot detect the kinetic mutation of the sequence. If d is too large, the reconstruction of phase will homogenize the time series, which is not only time-consuming but also unable to reflect the small changes of the sequence. Bandt [33] and Zunino [34] suggested that d should be in the range 3–7. Time delay τ has littleFigure effect2.on the chart MPEofofthe time seriesalgorithm. [35,36], and in most literatures [36–38], A flow proposed Figure 2. A flow chart of the proposed algorithm.

authors employ τ = 1 . In this paper, we employ s = 5, d = 6, τ = 1 in the calculation of MPE. In 3. Algorithm Application in Fault Bearing Simulation Signal theApplication length N of areSimulation suggested Signal as N > 5d ! , thus the sample points of the 3.[36,38,39], Algorithm in time Faultseries Bearing Here we adopt the dynamic model of bearing fault signal established by Randall [40]. The simulation signal should be larger than 3600. Here we adopt the dynamic model of bearing fault signal established by Randall [40]. mathematical modeltheofflowchart fault rolling is described in Equation (19). According to theand Ultimately, of thebearing algorithm based on DMD via non-convex regularization The mathematical model of fault rolling bearing is described in Equation (19). According to the characteristics of bearing structure, instantaneous impact signal should be generated when bearing MPE is shown in Figure 2. characteristics of bearing structure, instantaneous impact signal should be generated when bearing parts pass through the fault location. The impact motivates the bearing system oscillating by its parts pass through the fault location. The impact motivates the bearing system oscillating by its natural natural frequency with high-frequency attenuation. Define T as a period of two intervals, S (t ) as frequency with high-frequency attenuation. Define T as a period of two intervals, S(t) as the natural Ak as thefrequency natural frequency oscillation the i -thofamplitude of shock response. The of oscillation function,function, and Ak asand the i-th amplitude shock response. The interference additive noise nt (zeronoise mean nadditive noise) should benoise) taken should into consideration in the interference of additive (zero background mean additive background be taken into t

consideration in the simulation signal in view of poor working environment of rolling bearings. Therefore, the mathematical model of fault rolling bearing can be described as: M  = f ( t ) Ak S (t − kT − τ k ) + n(t )  

Entropy 2018, 20, 152

9 of 19

simulation signal in view of poor working environment of rolling bearings. Therefore, the mathematical model of fault rolling bearing can be described as: M

   

f (t) = ∑ Ak S(t − kT − τk ) + n(t)

  

Ak = ak cos(2π f m t + ϕ A ) + c A S(t − kT − τk ) = exp[− B(t − kT − τk )· sin[2π f n (t − kT − τk ) + γ]

k =1

(19)

where ak is the k-th impact energy; γ and ϕ A are the initial phases; f m is the modulation frequency; B is the attenuation coefficient depended on the bearing system; τk is the time lag from its mean period due to the presence of tiny fluctuation, and c A is a random constant. Time domain simulation signals of rolling bearing with normal bearing and inner race, outer race and roller faults are respectively plotted in Figure 3. It should be pointed out that Gaussian white noise is added so as to SNR reach to 0.5 precisely. The parameters selection of Equation (19) is listed in Table 1, where f r and f c are denoted as rotational frequency and cage frequency. f i , f o , f b respectively represent the failure frequency of the inner circle, the outer ring, and the rolling element. The sampling frequency is selected as 5 KHz, and the number of total sampling points is 8000. Table 1. The parameters selection with simulation signals of rolling bearing. ak Entropy 2018, 20, 3.5x

γ

ϕA

B

τk

cA

fr (Hz)

fi (Hz)

fo (Hz)

fb (Hz)

fc (Hz)

0

0

750

0.01

1

25

180

130

80

12

10 of 19

2

(a )

0 -2

Amplitude

2

0

0.2

0.4

0.6

0.8

1

1.2

1.4

1.6

(b)

0 -2 2

0

0.2

0.4

0.6

0.8

1

1.2

1.4

1.6

(c )

0 -2

0

0.2

0.4

0.6

0.8

1

1.2

1.4

1.6

2 0 -2

(d ) 0

0.2

0.4

0.6

0.8

1

1.2

1.4

1.6

Time(s) Figure 3. Time normal Figure 3. Time domain domain simulation simulation signals signals of of rolling rolling bear bear including including noise noise with with SNR SNR 0.5 0.5 (a) (a) normal bearing; (b) inner fault; (c) outer fault; (d) cage fault.

The FT spectrum analysis of the bearing noisy signal is shown in Figure 4. There are a large The FT spectrum analysis of the bearing noisy signal is shown in Figure 4. There are a large number number of noisy spectral components, as shown by the blue dotted boxes in Figure 4a–d, which of noisy spectral components, as shown by the blue dotted boxes in Figure 4a–d, which severely affect severely affect the feature identification of the fault. Rotational frequency and its resonance the feature identification of the fault. Rotational frequency and its resonance frequencies (as marked in frequencies (as marked in blue circles in Figure 3a) should be easy to recognize in the spectrum blue circles in Figure 3a) should be easy to recognize in the spectrum domain when the bearing is under domain when the bearing is under healthy conditions. However, numbers of noise frequencies healthy conditions. However, numbers of noise frequencies revolve around the center frequencies, revolve around the center frequencies, as marked in the brown dotted box in Figure 3a, resulting in as marked in the brown dotted box in Figure 3a, resulting in the fact that we cannot interpret the the fact that we cannot interpret the signal properly. The same situation appears in the bearing inner signal properly. The same situation appears in the bearing inner race fault signal spectrum, as shown race fault signal spectrum, as shown in Figure 3c. In general, the three common types of faults on the in Figure 3c. In general, the three common types of faults on the inner race, the outer race and the inner the outer race and theasrolling can be modeled as f = f , f = 0, f m = f c , rollingrace, elements can be modeled f m = felements r , f m = 0, f m = f c , respectively.mThatr is tomsay, rotational respectively. That is to say, rotational frequency, 0, cage frequency should modulate their center frequency, respectively. In Figure 3b,d, the center frequency belongs to the inner race and balls can be identified, but it is difficult to identify their respective modulation frequency as they are submerged by the noise components.

domain when the bearing is under healthy conditions. However, numbers of noise frequencies revolve around the center frequencies, as marked in the brown dotted box in Figure 3a, resulting in the fact that we cannot interpret the signal properly. The same situation appears in the bearing inner race fault signal spectrum, as shown in Figure 3c. In general, the three common types of faults on the Entropy 2018, 20, 152 inner race, the outer race and the rolling elements can be modeled as f m = f r , f m = 0, f m10=off c19,

respectively. That is to say, rotational frequency, 0, cage frequency should modulate their center frequency, respectively. In Figure 3b,d, the center frequency belongs to respectively. the inner raceInand balls3b,d, can 0, cage frequency should modulate their center frequency, Figure be but it belongs is difficult identify theirballs respective modulation astothey are the identified, center frequency to thetoinner race and can be identified, butfrequency it is difficult identify submerged by the noise components. their respective modulation frequency as they are submerged by the noise components. 0.01

(a )

0.005 0

0

50

100

150

200

250

300

350

400

450

500

Amplitude

0.02

(b )

0.01 0

0

50

100

150

200

250

300

350

400

450

500

(c )

0.02 0.01 0 0.02

0

50

100

150

200

250

300

350

400

450

500

0.01 0

(d ) 0

50

100

150

200

250

300

Frequency(Hz)

350

400

450

500

Figure 4. Entropy 2018, 20, 11 of 19 Figure 4. xFrequency Frequency spectrum spectrum of of rolling rolling bear bear simulation simulation signals signals with with noisy noisy (a) (a) normal normal bearing; bearing; (b) (b) inner inner fault; fault; (c) (c) outer outer fault; fault; (d) (d) cage cage fault. fault.

In this paper, the parameters

λ, a

are selected as 1.2, 0.5 respectively in the process of sparse

optimization usingthe non-convex optimization regularization. MPE threshold is selected as 0.7 In this paper, parameters λ, a are selected as 1.2, 0.5The respectively in the process of sparse (refer to the conclusion section).optimization We recover the signals by DMD modes whoseisMPE values are(refer less optimization using non-convex regularization. The MPE threshold selected as 0.7 than threshold. Spectrums were obtained FTwhose on the reconstruction signals, as to thethe conclusion section). We recover the signalsby byperforming DMD modes MPE values are less than the shown in Figure 5. were obtained by performing FT on the reconstruction signals, as shown in Figure 5. threshold. Spectrums 0.01

fr

2 fr

3 fr

19 f r



(a )

0.005 0

Amplitude

0.04

0

0.02 0 0.04

50

100

150

fr

0

fi − f r

50

100

150

200 fi

250

300

350

400

450

500

2 fi 2 fi − f r

fi + f r

200

fo

250

300

350

2 fo

(b )

2 fi + f r

400

450

500

3 fo

(c )

0.02 0

0

50

0.02

fb

0.01 0

100 fb + fc

0

50

100

150

200

250

300

3 fb 2 fb 2 fb − f c 2 fb + f c 3 fb − f c 3 fb + f c

150

200

250

350

400

300

350

500 6 fb



Frequency(Hz)

450

6 fb -f c 6 f b +f c (d )

400

450

500

Figure 5. Frequency spectrum of the noisy signals provided by the proposed algorithm. (a) Normal Figure 5. Frequency spectrum of the noisy signals provided by the proposed algorithm. (a) Normal bearing; bearing; (b) (b) inner inner fault; fault; (c) (c) outer outer fault; fault; (d) (d) cage cage fault. fault.

Meanwhile, in order to illustrate the effectiveness of our method, multi-scale wavelet transform Meanwhile, in order to illustrate the effectiveness of our method, multi-scale wavelet transform is is applied on the four noisy signals by taking the wavelet function db1, WT results are shown in applied on the four noisy signals by taking the wavelet function db1, WT results are shown in Figure 6. Figure 6. Though it is obvious that the harmonic frequencies of fr and fo can be identified in Though it is obvious that the harmonic frequencies of f r and f o can be identified in Figure 6a,c, there are Figure 6a,c, there areattributed still somebyfrequencies attributed by the involving around midstill some frequencies the noise involving around thenoise mid-frequencies, as can the we see in frequencies, as can we see in the brown dotted box. Meanwhile the spectra of the proposed algorithm are cleaner and more readable then the results of FT and WT. Not only can we pick out their center and modulation frequencies in Figure 5, but also there is few noise frequencies surrounding them. Our method has tremendous advantages in both de-noising and fault feature extraction compared with FT and WT.

bearing; (b) inner fault; (c) outer fault; (d) cage fault.

Meanwhile, in order to illustrate the effectiveness of our method, multi-scale wavelet transform is applied on the four noisy signals by taking the wavelet function db1, WT results are shown in Figure 6. Though it is obvious that the harmonic frequencies of fr and fo can be identified in

Entropy 2018, 20, 152

11 of 19

Figure 6a,c, there are still some frequencies attributed by the noise involving around the midfrequencies, as can we see in the brown dotted box. Meanwhile the spectra of the proposed algorithm the box. readable Meanwhile thethe spectra of of theFT proposed algorithm cleaner andout more readable are brown cleanerdotted and more then results and WT. Not onlyare can we pick their center then the results of FT and WT. Not only can also we pick theirnoise center and modulation frequencies and modulation frequencies in Figure 5, but thereout is few frequencies surrounding them. in Figure 5, but there is few noise frequencies surrounding them. method has tremendous Our method hasalso tremendous advantages in both de-noising and faultOur feature extraction compared advantages both de-noising and fault feature extraction compared with FT and WT. with FT andinWT. x 10

fr

5 0

-3

0

2 fr

50

3 fr

19 f r



100

150

200

250

(a ) 300

350

Amplitude

0.02 2 fi − f r

0.01 0

0

50

100

0.04

150

200

fo

250

300

350

2 fo

400

450

500

2 fi 2 fi + fr

400

(b ) 450

500

3 fo

(c )

0.02 0

0

50

0.02 0.01 0

fb − fc

0

50

100

150

200

250

300

3 fb 2 fb fb f + f 2 f − f 2 fb + fc 3 fb − f c 3 fb + fc b c b c

100

150

200

250

350

400

350

500

6 fb 6 fb -f c 6 f b +f c



300

450

400

450

(d )

500

Frequency(Hz) Figure 6. 6. Frequency Frequencyspectrum spectrumofofthe the noisy signals provided by WT. (a) Normal bearing; (b) inner Figure noisy signals provided by WT. (a) Normal bearing; (b) inner fault; fault; (c) fault; outer (d) fault; (d)fault. cage fault. (c) outer cage

4. Algorithm Application in Experimental Signals of Rolling Bear Signals of bearing systems collected by sensors from practice spots are much more complex than the simulation signals, which contain multi-component signals and background noises transmitted by other components. In order to verify the effectiveness of the proposed algorithm in signal de-noising and fault diagnosis, we apply the proposed algorithm to the actual bearing signals. We test the bearing signals from two experimental environments. First, the bearing signal is from the published bearing test data at the university of Cincinnati [41]; the second comes from our rotating machinery vibration analysis test rig. The Intelligent Maintenance Systems of Cincinnati is shown in Figure 7. Four Rexnord ZA-2115 double row bearings were installed on a shaft, which was driven by an AC motor at the speed of 2000 RPM via sever rub belts. Each row of the bearing has Z = 16 roller elements. The circle diameter of the bearing bitch and roller elements are respectively D = 28.15 mm, d = 3.31 mm, and the bearing’s contact angle is α = 15.17◦ . They utilized high sensitivity quartz ICP accelerometers (PCB 353B33), installed on the bearing housing, collecting the vibration signals and NI DAQ Card 6062E collecting the signals. We take their experimental data of fourth channel in Set No. 3 to verify our method. The total number of the time series data is 20,480, and the sampling frequency is 20 KHz. Finally, the bearing failed because of outer race fault. The rotation frequency and outer ring fault frequency can be respectively calculated as f r = 33.33 Hz, f o = 236.17 Hz by formulas [42]. The time domain and the frequency domain of the selected signals are shown in Figure 8. Obviously, it is hard to identify whether there is a bearing fault from Figure 8b for such large amounts of interference frequency presented in the frequency spectrum. Then we apply our algorithm on the chosen signal with chosen parameters λ = 1.2, a = 0.5, s = 5, d = 6, τ = 1. The truncation rank is calculated as 484 by applying non-convex optimization regularization on the rank function. The same number of single-frequency DMD modes are obtained correspondingly. The MPE values corresponding to each DMD modes are shown

collecting the signals. machinery vibration analysis test rig. The Intelligent Maintenance Systems of Cincinnati is shown in Figure 7. Four Rexnord ZA-2115 double row bearings were installed on a shaft, which was driven by an AC motor at the speed of 2000 RPM via sever rub belts. Each row of the bearing has Z = 16 roller elements. The circle diameter of Entropy 2018, 20, 152 12 of 19 the bearing bitch and roller elements are respectively D = 28.15mm , d = 3.31mm , and the

bearing’s contact angle is α = 15.17 . They utilized high sensitivity quartz ICP accelerometers (PCB in Figureinstalled 9. We recover the wanted signals by DMD whose MPEand values are less than 0.7. 353B33), on the bearing housing, collecting themodes vibration signals NI DAQ Card 6062E Spectrums are obtained by performing FT on the reconstruction signal, as shown in Figure 10. collecting the signals.

(b)

(a)

Figure 7. The Intelligent Maintenance Systems of Cincinnati: (a) the schematic diagram of the bearing test rig; (b) spot photo of the bearing test rig.

We take their experimental data of fourth channel in Set No. 3 to verify our method. The total number of the time series data is 20480, and the sampling frequency is 20 KHz. Finally, the bearing failed because of outer race fault. The rotation frequency and outer ring fault frequency can be respectively calculated as f r = 33.33Hz, f o = 236.17Hz by formulas (b) [42]. (a) The time domain and the frequency domain of the selected signals are shown in Figure 8. Figure Intelligent Maintenance (a) diagram of bearing Figure 7. 7. The The Intelligent Maintenance Systems of Cincinnati: (a) the the schematic schematic diagram of the the bearing Obviously, it is hard to identify whetherSystems there isof a Cincinnati: bearing fault from Figure 8b for such large amounts test rig; (b) spot photo of the bearing test rig. test rig; (b) frequency spot photo of the bearing of interference presented in test the rig. frequency spectrum. We take their experimental data of fourth channel in Set No. 3 to verify our method. The total number of the time series data is 20480, and the sampling frequency is 20 KHz. Finally, the bearing ) V ( ) failed and outer ring fault frequency can be e because of outer race fault. The rotation frequency V d 0.3

4.5

0.2

u t i l p m A

x 10

-3

4

3.5

0.1

respectively calculated as 0

( e d u t i l p m A

fr = 33.33Hz, fo = 236.17Hz

Entropy 2018, 20, x -0.1

3

by formulas [42].

2.5

13 of 19

2

The time domain and the frequency domain of the selected signals are shown in Figure 8. Obviously, it is hard to identify whether there is a bearing fault from Figure 8b for such large amounts Then we apply our algorithm on the chosen signal with chosen parameters of interference frequency presented in the frequency spectrum. λ = 1.2, a = 0.5, s = 5, d = 6, τ = 1. The truncation rank is calculated as 484 by applying nonTime (Sec) Frequency(Hz) convex optimization regularization on the rank function. The same number of single-frequency DMD ) V (a) (b) ( ) e V corresponding to each DMD modes are shown d modes are obtained correspondingly. The MPE values ( u e t d i u l Figure 8. The outer race fault signal of the Intelligent t Maintenance Systems of in p in Figure 9.8.We the fault wanted signals DMD modes whose MPE values are less than 0.7. i Therecover outer race signal of the by Intelligent Maintenance Systems of Cincinnati Cincinnati in time time m Figure l A p m domain and frequency: (a) Time domain; and (b) Frequency domain of FT. A Spectrums are obtained by performing FT on the reconstruction signal, as shown in Figure 10. domain and frequency: (a) Time domain; and (b) Frequency domain of FT. 1.5

-0.2

1

-0.3

-0.4 0.3 0

0.5

0.1

0.2

0.3

0.4

0.5

0.6

0.7

0.8

0.9

-3 0 0x 10 4.5

1

50

100

150

200

50

100

150

200

250

300

350

400

450

500

250

300

350

400

450

500

4

0.2

3.5

0.1

3

0

2.5

2

-0.1

1.5

-0.2

1

-0.3 -0.4

1 0

0.1

0.5

0.2

0.3

0.4

0.5

0.95

0.6

0.7

0.8

0.9

0

1

0

Time (Sec)

Frequency(Hz)

(a)

0.9

(b)

0.8 0.75 0.7 0.65

0

50

100

150

200

250

300

350

400

450

Number of DMD modes Figure Figure 9. 9. MPE MPE values values of of DMD DMD modes, modes, the the original original signal is from Cincinnati. 1.2

x 10

-3

fo 1

itude(V)

MPE value

Figure 8. The outer race fault signal of the Intelligent Maintenance Systems of Cincinnati in time 0.85 domain and frequency: (a) Time domain; and (b) Frequency domain of FT.

0.8

0.6

4 fr

500

0.65

0

50

100

150

200

250

300

350

400

450

500

Number of DMD modes Entropy 2018, 20, 152

13 of 19

Figure 9. MPE values of DMD modes, the original signal is from Cincinnati. 1.2

x 10

-3

fo

Amplitude(V)

1

0.8

0.6

4 fr

2 fo

10 f r

8 fr

0.4

0.2

0

0

50

100

150

200

250

300

350

400

450

500

Frequency(Hz) Figure Figure10. 10.The Theresult resultprovided providedby byour ouralgorithm, algorithm,the theoriginal original signal signal is is from from Cincinnati. Cincinnati.

As As shown shown in in Figure Figure 10 10 there there are are aa few few characteristic characteristic frequencies frequencies located located in in the the frequency frequency f spectrogramprovided providedby byour ouralgorithm. algorithm.Outer Outerrace racefailure failure frequency of o and and its its resonant resonant frequency frequency spectrogram frequency 2 f can be easily captured. Also, some resonant frequencies of the rotation frequency f are presented r 2 foo can be easily captured. Also, some resonant frequencies of the rotation frequency fr in the spectrum. However, regrettably, some of the frequencies we cannot account for are stillare in presented the spectrum. However, regrettably, some of thethe frequencies we cannot account system for are Figure 10,inwhich are shown in the red box, revealing that signal of the actual bearing still in Figure Yet 10, which aregot shown the red box, revealing the signal of the actual system is complex. we still the in failure frequency of thethat bearing test signal, whichbearing is in line with is complex. Yet we still got the failure frequency of the bearing test signal, which is in line with experimental results. experimental Wavelet results. packet decomposition and EEMD are applied on the experimental signal, as shown Wavelet and EEMD are applied on transformed the experimental signal, packet, as shown in Entropy 2018,11. 20, xpacket 14 with of 19 in Figure Figure decomposition 11a is the spectrum of the recover signal by wavelet Figure 11. Figure 11a is the spectrum of the recover signal transformed by wavelet packet, with db15 db15 and the default threshold. The optimal amplitude of the noise is set as 0.2 and the ensemble 100 EEMD. Figure spectrum of which the0.2 largest correlation with and the default threshold. Theshows optimal amplitude ofIMF3 the noise is has setwhich as andthe the ensemble sizethe is sizefor is the 100 for the EEMD.11b Figure 11bthe shows the spectrum of IMF3 has largest correlation original signal. A large amount of unexplainable interference frequencies, visible in the spectrogram, with the original signal. A large amount of unexplainable interference frequencies, visible in the seriously affectseriously the recognition the rotation and outer ring frequency. the spectrogram, affect theofrecognition offrequency the rotation frequency andfault outer ring fault Thus, frequency. optimized DMD outperforms wavelet packet decomposition and EEMD in the mode decomposition Thus, the optimized DMD outperforms wavelet packet decomposition and EEMD in the mode of rolling bearing signal. decomposition ofexperimental rolling bearing experimental signal. 4

x 10

-3

8 fr

3 2

2 fo

fo

4 fr

(a )

Amplitude(V)

1 0

1.5

0 x 10

50

100

150

200

250

300

350

400

450

500

-3

fo

8 fr

1

10 f r

(b )

0.5 0

0

50

100

150

200

250

300

350

400

450

500

Frequency(Hz) Figure11. 11.Frequency Frequencyspectrogram spectrogramprovided providedby bywavelet waveletpacket packetdecomposition decompositionand andEEMD. EEMD.(a) (a)Results Results Figure ofwavelet waveletpacket packetdecomposition; decomposition;(b) (b)Frequency Frequencyspectrogram spectrogramof of IMF3 IMF3 decomposed decomposed by by EEMD. EEMD. of

Next, we apply the algorithm on the experimental signal measured by our rotating machinery vibration analysis test rig. The physical structure of the experimental rig is shown in Figure 12. The whole device consists of a variable speed drive motor, shaft, gear box, deflection wheel, bearing and governor. A picture of the sensor in Figure 12 refers to the position where the single channel vibration acceleration sensor (PCB-352C33, NY, USA) is installed. A cylinder roller bearing NU205 with

0

0

50

100

150

200

250

300

350

400

450

500

Frequency(Hz) Figure 11. Frequency spectrogram provided by wavelet packet decomposition and EEMD. (a) results Entropyof2018, 20, 152 wavelet packet decomposition; (b) Frequency spectrogram of IMF3 decomposed by EEMD.

14 of 19

Next, we apply the algorithm on the experimental signal measured by our rotating machinery Next, we apply the algorithm on the experimental signal measured by our rotating machinery vibration analysis test rig. The physical structure of the experimental rig is shown in Figure 12. The vibration analysis test rig. The physical structure of the experimental rig is shown in Figure 12. whole device consists of a variable speed drive motor, shaft, gear box, deflection wheel, bearing and The whole device consists of a variable speed drive motor, shaft, gear box, deflection wheel, bearing governor. A picture of the sensor in Figure 12 refers to the position where the single channel vibration and governor. A picture of the sensor in Figure 12 refers to the position where the single channel acceleration sensor (PCB-352C33, NY, USA) is installed. A cylinder roller bearing NU205 with vibration acceleration sensor (PCB-352C33, NY, USA) is installed. A cylinder roller bearing NU205 artificial defects on the outside surface of the inner race was mounted in the housing. The fault with artificial defects on the outside surface of the inner race was mounted in the housing. The fault bearing physical diagram is shown in Figure 13. The number of rolling elements is = 13, and the bearing physical diagram is shown in Figure 13. The number of rolling elements is Z = 13, and the cylinder roller diameter is = 7.5 mm, pitch diameter of the testing bearing is = 39 mm. We set cylinder roller diameter is d = 7.5 mm, pitch diameter of the testing bearing is D = 39 mm. We set the the motor at the speed of 980RPM. So, rotation frequency and outer ring fault frequency can be motor at the speed of 980 RPM. So, rotation frequency and outer ring fault frequency can be calculated 16.3Hz, f = 126.3Hz , respectively [42]. calculated as f r = 16.3asHz,f r f= i = 126.3 Hz,i respectively [42].

Figure 12. 12. Rotating structure diagram of test device; (b) Figure Rotatingmachinery machineryvibration vibrationanalysis analysistest testrig. rig.(a)(a) Structure diagram of test device; physical object of the device; 1—motor, 2—belt pulley, 3—drive shaft support, 4—coupler, 5—gear (b) physical object of the device; 1—motor, 2—belt pulley, 3—drive shaft support, 4—coupler, 5—gear box, 6—bearing 6—bearingpedestal, pedestal, with replaceable rolling bearing NU205, 7—belt 8—radial force box, with replaceable rolling bearing NU205, 7—belt pulley,pulley, 8—radial force device, device, 9—bearing pedestal, with rolling bearing not replaced. 9—bearing pedestal, with rolling bearing not replaced. Entropy 2018, 20, x 15 of 19

Figure 13. Physical diagram of the fault bearing. Figure 13. Physical diagram of the fault bearing.

A total number of 4000 sample points, shown in Figure 14a in time domain, are taken into A total number of 4000 algorithm. sample points, in Figure 14aisinshown time domain, are14b. taken into analysis analysis with the proposed Theshown frequency domain in Figure with the proposed algorithm. The frequency domain is shown in Figure 14b. As is shown in Figure 14b, rotational frequency and its resonant frequencies are presented in 0.045 f the spectrum diagram. The center frequency of the inner race can also be easily found out, but the 0.04 modulation components are not obvious. Meanwhile,0.035 there are large amounts of interference frequency f f 2 f 3 signal 0.03 test f presented in the frequency spectral. The spectrum of the is not intuitionistic, which makes it 5f 6f 0.025 11 f 12 f difficult to diagnose the fault in view of the technicians. 16 f 3

r

Amplitude(V)

Amplitude(V)

2

1

0

-1

i

r

r

i

r

r

r

4 fr

0.02

0.015

7 fr

r

8 fr

r

13 f r

0.01 -2

0.005 -3

0

0.2

0.4

0.6

0.8

1

1.2

Time (Sec)

(a)

1.4

1.6

1.8

2

0

0

50

100

150

200

250

300

350

400

Frequency(Hz)

(b)

Figure 14. The measured signal in time domain and frequency, the original signal is from our test rig.

Figure 13. Physical diagram of the fault bearing.

A total number of 4000 sample points, shown in Figure 14a in time domain, are taken into proposed algorithm. The frequency domain is shown in Figure 14b. 15 of 19

analysis with Entropy 2018, 20, the 152

0.045

3

fr

0.04 2

Amplitude(V)

Amplitude(V)

0.035 1

0

-1

2 fr 3 f

0.03

fi 5 fr

r

0.025

4 fr

0.02 0.015

fi

6 fr

11 f r

7 fr

8 fr

12 f r

16 f r

13 f r

0.01 -2

0.005 -3

0

0.2

0.4

0.6

0.8

1

1.2

1.4

1.6

1.8

2

0

0

50

100

150

200

250

300

350

400

Frequency(Hz)

Time (Sec)

(a)

(b)

Figure 14. The measured signal in time time domain domain and and frequency, frequency, the original signal is is from from our our test test rig. rig. Time domain; domain; and and (b) (b) Frequency Frequency domain domain of of FT. FT. (a) Time

As is shown inthe Figure 14b, rotational frequency and frequencies aresame presented in the Subsequently, proposed algorithm is applied to its theresonant tested signal with the parameters spectrum diagram. The center frequency of the inner race can also be easily found out, but the λ = 1.2, a = 0.5, s = 5, d = 6, τ = 1. 682 single-frequency DMD modes are obtained by sparse modulation components are not obvious. there large amounts interference optimization algorithm. Figure 15 shows the Meanwhile, MPE values of eachare DMD modes. DMDofmodes whose frequency presented in the frequency spectral. The spectrum of the test signal is not intuitionistic, MPE values are less than 0.82 are reconstructed to form a recovery signal. Spectrums are obtained by which makes difficult to diagnose the faultasinshown view of technicians. performing FTiton the reconstruction signal, in the Figure 16. Subsequently, the proposed algorithm is applied to the tested signal with the same parameters Wavelet packet decomposition and EEMD are also employed on our experimental signal, as λ = 1.2, a = 0.5, s = 5, d = 6, τ = 1 . 682 single-frequency DMD modes are obtained shown in Figure 17. The methods and parameters used in the analysis are consistent with by the sparse signal optimization algorithm. Figure 15 shows the MPE values of each DMD modes. DMD modes whose processing procedure of Cincinnati. Here the spectrum of IMF1 decomposed by EEMD is selected MPE values arethe lesslargest than 0.82 are reconstructed to form asignal, recovery are obtained by because it has correlation with the original as signal. shown Spectrums in Figure 17b. Obviously, performing FT on the reconstruction signal, as shown in Figure 16. Figure 16 is more readable than Figures 13b and 17. The rotational frequency f r and its harmonics decomposition and EEMD arecould also the employed on our experimental signal, as (2 f r , 3Wavelet f r , 4 f r 5 fpacket easily identified. Not only center frequency be pointed out clearly r , · · · ) are shown Figure 17. The methods parameters used in the with the but alsointhe modulation frequencyand of the inner face. That is toanalysis say, weare areconsistent more confident to signal judge processing procedure of Cincinnati. Here the spectrum of IMF1 decomposed by EEMD is selected that the inner race of the testing rolling bearing has encountered failure, which is highly consistent because has the largest correlation with the signal, shown in Figure with the itactual situation. The basic reason theoriginal frequencies areas easy to read is that 17b. thereObviously, are a few

Entropy 2018, 20, x

16 of 19

f r and itsand interference nearby Therefore, algorithm of noise reduction feature Figure 16 is frequencies more readable thanthem. Figures 13b andthe 17.proposed The rotational frequency harmonics extraction is proved toisbe practical and to outperform other modeother decompositions such as wavelet and proved to be practical and toonly outperform mode decompositions 4 fr 5 fr ,  ( 2 f rfeature , 3 f r , extraction ) are easily identified. Not could the center frequency be pointedsuch out packet decomposition and EEMD. as wavelet packet decomposition and EEMD. clearly but also the modulation frequency of the inner face. That is to say, we are more confident to judge that the 0.95 inner race of the testing rolling bearing has encountered failure, which is highly consistent with the actual situation. The basic reason the frequencies are easy to read is that there are 0.9 a few interference frequencies nearby them. Therefore, the proposed algorithm of noise reduction MPE value

0.85 0.8 0.75 0.7 0.65 0.6 0.55

0

100

200

300

400

500

600

Number of DMD modes Figure Figure 15. 15. MPE MPE values values of of DMD DMD modes, modes, the the original original signal signal is is from from our our test test rig. rig. 0.035

fr

0.03

Amplitude(V)

0.025

fi − f r fi

0.02 0.015

2 fr

3f

2 fi − f r

700

0.65 0.6 0.6 0.55 0.55 0 0

100 100

200 200

300 300 Number

400

500

600 600

500 of400 DMD modes Number of DMD modes

Entropy 2018, 20, 152

700 700

16 of 19

Figure 15. MPE values of DMD modes, the original signal is from our test rig. Figure 15. MPE values of DMD modes, the original signal is from our test rig. 0.035 0.035

fr fr

Amplitude(V) Amplitude(V)

0.03 0.03 0.025 0.025

fi − f r f fi − f r fii

0.02 0.02 0.015 0.015

2 fr 2 fr 3 fr 3 fr 4 fr 5 fr 4 fr 5 fr 6 fr 7 fr 6 f r 7 f r 88 ff r

0.01 0.01

2 fi − f r 2 f i − f2r f i 2 fi 2 fi + f r 2 fi + f r

fi + fr r fi + fr

0.005 0.005 0 00 0

50 50

100 100

150 150

200 200

250 250

3 fi 3 f3i f + f i r 3 fi − f r 3 fi − f r 3 fi + f r 300 300

350 350

400 400

Frequency(Hz) Frequency(Hz) Figure signal is is from from our our test test rig. rig. Figure 16. 16. The The result result provided provided by by the the proposed proposed algorithm, algorithm, the the original original signal Figure 16. The result provided by the proposed algorithm, the original signal is from our test rig. 0.06 0.06

fr 2 fr fr 2 fr

Amplitude(V) Amplitude(V)

0.02 0.02 0 00 0 0.03 0.03 0.02 0.02

4 fr 4 fr 50 50

fr fr

100 100

2 fr 4 fr 5 fr 6 f 2 f r 3 f r4 f r 5 f r 6 f rr 3 fr

50 50

2 fi − f r 2 f 2 fi − f r 2 f ii 2 fi + f r 2 fi + f r

fi + fr fi + fr

0.01 0.01 0 00 0

fi fi

fi − fr fi − fr

0.04 0.04

100 100

150 150

fi fi

200 200

250 250

(a ) (a )

3 fi − f r 3 fi − f r 3 fi 3 fi 300 300

350 350

400 400

11 f r 11 f r

150 200 150 200 Frequency(Hz)

(b ) (b ) 250 250

300 300

350 350

400 400

Frequency(Hz)

Figure 17. Frequency spectrogram provided by wavelet packet decomposition and EEMD. (a) Results Figure Figure 17. 17. Frequency Frequency spectrogram spectrogram provided provided by by wavelet wavelet packet packet decomposition decomposition and and EEMD. EEMD. (a) (a) Results Results of wavelet packet decomposition; (b) Frequency spectrogram of IMF1 decomposed by EEMD. of of wavelet wavelet packet decomposition; (b) Frequency spectrogram of IMF1 decomposed by EEMD.

5. Conclusions As a powerful tool for Koopman spectrum analysis technology with the characteristic of equation-free and data driven, DMD decomposes the time series into a series of single-frequency modes without any assumption. Theoretically, dynamic features of the original system including fault features and noise components exist in these DMD modes only if the truncation rank order r is appropriate. The truncated rank order needs to be preset artificially in the standard DMD algorithm, which is a crucial problem. Another puzzle with the application of standard DMD is of how to define a selection strategy to filter the undesired DMD modes in the process of signal reconstruction. By proposing solutions for the selection strategy of truncated rank order and DMD modes, this paper puts forward a novel denoising and feature extraction algorithm for multi-component coupled noisy mechanical signals. Non-convex penalty regularization is adopted to the rank function of the shift-stack matrix, and a series of optimal DMD modes are obtained by this sparse optimization method. MPE is performed as a tool of optimal mode selection. After calculating the MPE of each DMD mode, signal is reconstructed by the modes whose entropies are smaller than a threshold. The proposed algorithm is successfully applied in rolling bearing simulation and experimental signals, demonstrating that it has a good application prospect in noise reduction and fault feature extraction.

Entropy 2018, 20, 152

17 of 19

DMD algorithm has been widely used in the field of fluid mechanics and video processing. This paper presents an optimized DMD algorithm and applies it to mechanical vibration signal. The parameters of non-convex penalty regularization and MPE in our algorithm are proved to be appropriate by repeated verification. The non-convex optimization method is used to extract the dynamic characteristics of the original system more comprehensively than selecting truncation rank with a “elbow” of the singular value. The dynamic characteristics, fault features and noise components of the original signal can be identified by using MPE method. However, for filtering the noise components, entropy threshold needs to be set by multiple attempts in our algorithm. For example, when dealing with the experimental signal of Cincinnati, we firstly set the threshold values of 0.7, 0.75, 0.8, etc., and then observed the effect of the recovery signal through the spectrum diagram respectively. We found that when the threshold value was 0.7, the fault characteristics were more obvious, and the interference frequencies were less. Therefore, 0.7 was selected as the criterion. Research on different vibration signals, various transmission systems and different degrees of fault damage to select a certain MPE threshold is our future work. Acknowledgments: This work was supported by the National Natural Science Foundation of China under Grants No. 51475339, the Natural Science Foundation of Hubei province under Grant No. 2016CFA042. The Applied Basic Research Programs of Wuhan science and technology bureau under Grants No. 2017010201010115. The authors also appreciate the free download of the original bearing failure data and one photo picture provided by the intelligent maintenance system center of University of Cincinnati. Author Contributions: Zhang dang was in charge of contributing the analysis method and the program codes, analyzing the data, and writing the paper. Yong Lv and Yourong Li were in charge of the supervision of this research and helping in revising. Cancan Yi was responsible for the experimental design and data analysis. All authors have read and approved the final manuscript. Conflicts of Interest: The authors declare no conflict of interest.

References 1. 2. 3. 4. 5. 6. 7. 8. 9. 10.

11. 12.

Golafshan, R.; Sanliturk, K.Y. SVD and Hankel matrix based de-noising approach for ball bearing fault detection and its assessment using artificial faults. Mech. Syst. Signal Process. 2016, 70–71, 36–50. [CrossRef] Yi, C.; Lv, Y.; Xiao, H.; Huang, T.; You, G. Multisensor signal denoising based on matching synchrosqueezing wavelet transform for mechanical fault condition assessment. Meas. Sci. Technol. 2018. [CrossRef] AlMahamdy, M.; Riley, H.B. Performance study of different denoising methods for ECG signals. Procedia Comput. Sci. 2014, 37, 325–332. [CrossRef] Schlüter, S.; Sheppard, A.; Brown, K.; Wildenschild, D. Image processing of multiphase images obtained via X-ray microtomography: A review. Water Resour. Res. 2014, 50, 3615–3639. [CrossRef] Rai, A.; Upadhyay, S.H. A review on signal processing techniques utilized in the fault diagnosis of rolling element bearings. Tribol. Int. 2016, 96, 289–306. [CrossRef] Donoho, D.; Johnstone, I. Adapting to Unknown Smoothness via Wavelet Shrinkage. Publ. Am. Stat. Assoc. 1995, 90, 1200–1224. [CrossRef] Lv, Y.; Yuan, R.; Song, G. Multivariate empirical mode decomposition and its application to fault diagnosis of rolling bearing. Mech. Syst. Signal Process. 2016, 81, 219–234. [CrossRef] Yang, Z.X.; Zhong, J.H. A hybrid EEMD-based SampEn and SVD for acoustic signal processing and fault diagnosis. Entropy 2016, 18, 112. [CrossRef] Dragomiretskiy, K.; Zosso, D. Variational Mode Decomposition. IEEE Trans. Signal Process. 2014, 62, 531–544. [CrossRef] Wang, X.B.; Yang, Z.X.; Yan, X.A. Novel particle swarm optimization-based variational mode decomposition method for the fault diagnosis of complex rotating machinery. IEEE/ASME Trans. Mechatron. 2018, 23, 68–79. [CrossRef] Lv, Y.; He, B.; Yi, C.; Dang, Z. A novel scheme on multi-channel mechanical fault signal diagnosis based on augmented quaternion singular spectrum analysis. J. Vibroeng. 2017, 19, 955–966. Rowley, C.W.; Mezic, I.; Bagheri, S.; Schlatter, P.; Henningson, D. Spectral analysis of nonlinear flows. J. Fluid Mech. 2009, 641, 115–127. [CrossRef]

Entropy 2018, 20, 152

13. 14. 15. 16.

17.

18. 19. 20. 21. 22. 23. 24. 25.

26. 27. 28. 29. 30. 31. 32. 33. 34. 35.

36.

18 of 19

Schmid, P.J. Dynamic mode decomposition of numerical and experimental data. J. Fluid Mech. 2010, 656, 5–28. [CrossRef] Grosek, J.; Kutz, J.N. Dynamic mode decomposition for real-time background/foreground separation in video. arXiv 2014, arXiv:1404.7592. Tirunagari, S.; Poh, N.; Windridge, D.; Ho, A.T.S. Detection of Face Spoofing Using Visual Dynamics. IEEE Trans. Inf. Forensics Secur. 2015, 10, 762–777. [CrossRef] Berger, E.; Sastuba, M.; Vogt, D.; Jung, B. Dynamic Mode Decomposition for perturbation estimation in human robot interaction. In Proceedings of the Ro-Man IEEE International Symposium on Robot and Human Interactive Communication, Edinburgh, UK, 25–29 August 2016; pp. 593–600. Brunton, B.W.; Johnson, L.A.; Ojemann, J.G.; Kutz, J.N. Extracting spatial–temporal coherent patterns in large-scale neural recordings using dynamic mode decomposition. J. Neurosci. Methods 2015, 258, 1–15. [CrossRef] [PubMed] Dawson, S.; Hemati, M.; Williams, M.; Rowley, C.W. Characterizing and correcting for the effect of sensor noise in the dynamic mode decomposition. Exp. Fluids 2016, 57, 1–19. [CrossRef] Hemati, M.S.; Rowley, C.W.; Deem, E.A.; Cattafesta, L.N. De-Biasing the Dynamic Mode Decomposition for Applied Koopman Spectral Analysis. J. Nonlinear Sci. 2015, 25, 1–40. Brunton, S.L.; Proctor, J.L.; Kutz, J.N. Compressive sampling and dynamic mode decomposition. arXiv 2013, arXiv:1312.5186. Jovanovic, M.; Schmid, P.; Nichols, J.W. Sparsity-promoting Dynamic Mode Decomposition. Phys. Fluids 2013, 26, 561–571. [CrossRef] Erichson, N.B.; Steven, L.B.; Kutz, J.N. Randomized Dynamic Mode Decomposition. arXiv 2017, arXiv:1702.02912. Erichson, N.B.; Donovan, C. Randomized low-rank Dynamic Mode Decomposition for motion detection. Comput. Vis. Image Underst. 2016, 146, 40–50. [CrossRef] Klus, S.; Gelß, P.; Peitz, S.; Schütte, C. Tensor-based dynamic mode decomposition. arXiv 2016, arXiv:1606.06625. Zhao, X.; Li, W.; Zhou, L.; Song, G.; Ba, Q. Application of support vector machine for pattern classification of active thermometry-based pipeline scour monitoring. Struct. Control Health Monit. 2015, 22, 903–918. [CrossRef] Metzler, C.A.; Maleki, A.; Baraniuk, R.G. From Denoising to Compressed Sensing. IEEE Trans. Inf. Theory 2016, 62, 5117–5144. [CrossRef] Parekh, A.; Selesnick, I.W. Enhanced Low-Rank Matrix Approximation. IEEE Signal Process. Lett. 2015, 23, 493–497. [CrossRef] ˙ Selesnick, I.W.; Ilker, B. Sparse Signal Estimation by Maximally Sparse Convex Optimization. IEEE Trans. Signal Process. 2014, 62, 1078–1092. [CrossRef] Chen, P.Y.; Selesnick, I.W. Translation-invariant shrinkage/thresholding of group sparse signals. Signal Process. 2013, 94, 476–489. [CrossRef] Chen, P.Y.; Selesnick, I.W. Group-sparse signal denoising: Nonconvex regularization, convex optimization. IEEE Trans. Signal Process. 2014, 62, 3464–3478. [CrossRef] Pincus, S.M. Approximate entropy as a measure of system complexity. Proc. Natl. Acad. Sci. USA 1991, 88, 2297–2301. [CrossRef] [PubMed] Richman, J.S.; Moorman, J.R. Physiological time-series analysis using approximate entropy and sample entropy. Am. J. Physiol. Heart Circ. Physiol. 2000, 278, H2039–H2049. [CrossRef] [PubMed] Bandt, C.; Pompe, B. Permutation entropy: A natural complexity measure for time series. Phys. Rev. Lett. 2002, 88, 174102. [CrossRef] [PubMed] Zunino, L.; Olivares, F.; Scholkmann, F.; Rosso, O.A. Permutation entropy based time series analysis: Equalities in the input signal can lead to false conclusions. Phys. Lett. A 2017, 381, 1883–1892. [CrossRef] Li, Y.; Xu, M.; Wei, Y.; Huang, W. A new rolling bearing fault diagnosis method based on multiscale permutation entropy and improved support vector machine based binary tree. Measurement 2016, 77, 80–94. [CrossRef] An, X.; Pan, L. Bearing fault diagnosis of a wind turbine based on variational mode decomposition and permutation entropy. Proc. Inst. Mech. Eng. Part O J. Risk Reliab. 2017, 231, 200–206. [CrossRef]

Entropy 2018, 20, 152

37. 38. 39. 40.

41. 42.

19 of 19

Wu, S.D.; Wu, P.H.; Wu, C.W.; Ding, J.J.; Wang, C.C. Bearing fault diagnosis based on multiscale permutation entropy and support vector machine. Entropy 2012, 14, 1343–1356. [CrossRef] Riedl, M.; Müller, A.; Wessel, N. Practical considerations of permutation entropy. Eur. Phys. J. Spec. Top. 2013, 222, 249–262. [CrossRef] Amigó, J.M.; Zambrano, S.; Sanjuán, M.A. Combinatorial detection of determinism in noisy time series. Europhys. Lett. 2008, 83, 60005. [CrossRef] Randall, R.B.; Antoni, J.; Chobsaard, S. The relationship between spectral correlation and envelope analysis in the diagnostics of bearing faults and other cyclostationary machine signals. Mech. Syst. Signal Process. 2001, 15, 945–962. [CrossRef] Intelligent Systems Division. Available online: https://ti.arc.nasa.gov/tech/dash/pcoe/prognostic-datarepository/#bearing (accessed on 27 August 2017). Han, T.; Liu, X.; Tan, A. Fault Diagnosis of Rolling Element Bearings Based on Multiscale Dynamic Time Warping. Measurement 2017, 95, 355–366. [CrossRef] © 2018 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).

Suggest Documents