<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Publishing DTD v1.1 20151215//EN" "http://jats.nlm.nih.gov/publishing/1.1/JATS-journalpublishing1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink" xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" article-type="research-article" dtd-version="1.1">
<front>
<journal-meta>
<journal-id journal-id-type="pmc">CMC</journal-id>
<journal-id journal-id-type="nlm-ta">CMC</journal-id>
<journal-id journal-id-type="publisher-id">CMC</journal-id>
<journal-title-group>
<journal-title>Computers, Materials &#x0026; Continua</journal-title>
</journal-title-group>
<issn pub-type="epub">1546-2226</issn>
<issn pub-type="ppub">1546-2218</issn>
<publisher>
<publisher-name>Tech Science Press</publisher-name>
<publisher-loc>USA</publisher-loc>
</publisher>
</journal-meta>
<article-meta>
<article-id pub-id-type="publisher-id">14941</article-id>
<article-id pub-id-type="doi">10.32604/cmc.2021.014941</article-id>
<article-categories>
<subj-group subj-group-type="heading">
<subject>Article</subject>
</subj-group>
</article-categories>
<title-group>
<article-title>A Novel Method Based on UNET for Bearing Fault Diagnosis</article-title>
<alt-title alt-title-type="left-running-head">A Novel Method Based on UNET for Bearing Fault Diagnosis</alt-title>
<alt-title alt-title-type="right-running-head">A Novel Method Based on UNET for Bearing Fault Diagnosis</alt-title>
</title-group>
<contrib-group content-type="authors">
<contrib id="author-1" contrib-type="author" corresp="yes">
<name name-style="western">
<surname>Kumar</surname>
<given-names>Dileep</given-names>
</name>
<xref ref-type="aff" rid="aff-1">1</xref>
<email>dileepkalani1994@gmail.com</email>
</contrib>
<contrib id="author-2" contrib-type="author">
<name name-style="western">
<surname>Kalwar</surname>
<given-names>Imtiaz Hussain</given-names>
</name>
<xref ref-type="aff" rid="aff-2">2</xref>
</contrib>
<contrib id="author-3" contrib-type="author">
<name name-style="western">
<surname>Hussain</surname>
<given-names>Tanweer</given-names>
</name>
<xref ref-type="aff" rid="aff-1">1</xref>
</contrib>
<contrib id="author-4" contrib-type="author">
<name name-style="western">
<surname>Chowdhry</surname>
<given-names>Bhawani Shankar</given-names>
</name>
<xref ref-type="aff" rid="aff-1">1</xref>
</contrib>
<contrib id="author-5" contrib-type="author">
<name name-style="western">
<surname>Ujjan</surname>
<given-names>Sanaullah Mehran</given-names>
</name>
<xref ref-type="aff" rid="aff-1">1</xref>
</contrib>
<contrib id="author-6" contrib-type="author">
<name name-style="western">
<surname>Memon</surname>
<given-names>Tayab Din</given-names>
</name>
<xref ref-type="aff" rid="aff-3">3</xref>
</contrib>
<aff id="aff-1"><label>1</label><institution>National Centre of Robotics and Automation, HHCMS Lab, Mehran University of Engineering &#x0026; Technology</institution>, <addr-line>Jamshoro, 76020, Sindh</addr-line>, <country>Pakistan</country></aff>
<aff id="aff-2"><label>2</label><institution>Department of Electrical Engineering, DHA SUFFA University</institution>, <addr-line>Karachi, Sindh</addr-line>, <country>Pakistan</country></aff>
<aff id="aff-3"><label>3</label><institution>Department of Electronic Engineering, Mehran University of Engineering and Technology</institution>, <addr-line>Jamshoro, 76020, Sindh</addr-line>, <country>Pakistan</country></aff>
</contrib-group>
<author-notes>
<corresp id="cor1"><label>&#x002A;</label>Corresponding Author: Dileep Kumar. Email: <email>dileepkalani1994@gmail.com</email></corresp>
</author-notes>
<pub-date date-type="pub" publication-format="electronic"><day>31</day><month>5</month><year>2021</year></pub-date>
<volume>69</volume>
<issue>1</issue>
<fpage>393</fpage>
<lpage>408</lpage>
<history>
<date date-type="received"><day>28</day><month>10</month><year>2020</year></date>
<date date-type="accepted"><day>25</day><month>2</month><year>2021</year></date>
</history>
<permissions>
<copyright-statement>&#x00A9; 2021 Soother et al.</copyright-statement>
<copyright-year>2021</copyright-year>
<copyright-holder>Soother et al.</copyright-holder>
<license xlink:href="https://creativecommons.org/licenses/by/4.0/">
<license-p>This work is licensed under a <ext-link ext-link-type="uri" xlink:type="simple" xlink:href="https://creativecommons.org/licenses/by/4.0/">Creative Commons Attribution 4.0 International License</ext-link>, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</license-p>
</license>
</permissions>
<self-uri content-type="pdf" xlink:href="TSP_CMC_14941.pdf"></self-uri>
<abstract>
<p>Reliability of rotating machines is highly dependent on the smooth rolling of bearings. Thus, it is very essential for reliable operation of rotating machines to monitor the working condition of bearings using suitable fault diagnosis and condition monitoring approach. In the recent past, Deep Learning (DL) has become applicable in condition monitoring of rotating machines owing to its performance. This paper proposes a novel bearing fault diagnosis method based on the processing and analysis of the vibration images. The proposed method is the UNET model that is a recent development in DL models. The model is applied to the 2D vibration images obtained by transforming normalized amplitudes of the time-series vibration data samples into the corresponding vibration images. The UNET model performs pixel-level feature learning using the vibration images owing to its unique architecture. The results demonstrate that the model can perform dense predictions without any loss of label information, generally caused by the sliding window labelling method. The comparative analysis with other DL models confirmed the superiority of the UNET model which has achieved maximum accuracy of 98.91% and F1-Score of 99%.</p>
</abstract>
<kwd-group kwd-group-type="author">
<kwd>Condition monitoring</kwd>
<kwd>deep learning</kwd>
<kwd>fault diagnosis</kwd>
<kwd>rotating machines</kwd>
<kwd>vibration</kwd>
</kwd-group>
</article-meta>
</front>
<body>
<sec id="s1">
<label>1</label>
<title>Introduction</title>
<p>Bearings are critical components in rotating machines used in various applications such as auto-manufacturing, chemical industries, railways, water pumping stations, etc. Their health has significant impacts on the operation of industrial systems and economy of industries, with 45%&#x2013;55% of faults in industrial motors are caused by bearing faults [<xref ref-type="bibr" rid="ref-1">1</xref>,<xref ref-type="bibr" rid="ref-2">2</xref>]. Therefore, effective fault diagnosis of bearings can significantly increase the reliability and life span of machines in various industrial applications, which in turn reduces maintenance and operation costs [<xref ref-type="bibr" rid="ref-3">3</xref>,<xref ref-type="bibr" rid="ref-4">4</xref>]. Considering the importance of bearings in rotating machines, various methods have been applied by researchers to effectively monitor the bearing condition and to avoid downtime caused by the bearing failure.</p>
<p>Vibration is considered as an early and highly accurate indicator of various faults in mechanical equipment, and its measurement and analysis are widely employed with Artificial Intelligence (AI) based fault diagnosis and prognosis methods [<xref ref-type="bibr" rid="ref-5">5</xref>&#x2013;<xref ref-type="bibr" rid="ref-7">7</xref>]. Typically, a fault diagnosis procedure comprises of three steps: data acquisition, feature extraction, and fault diagnosis. Firstly, the data acquisition step for machinery fault detection involves signals such as vibration, current, voltage, temperature, and acoustic emissions. Second step includes the extraction of time-domain features (root mean square, skewness, kurtosis, and gap factor), frequency domain (Fourier transform), and time-frequency domain features (wavelet transform, short-time Fourier transform, empirical mode decomposition, and Hilbert-Huang transform) [<xref ref-type="bibr" rid="ref-8">8</xref>&#x2013;<xref ref-type="bibr" rid="ref-13">13</xref>]. Lastly, fault diagnosis step which has been widely explored by researchers using different model-based techniques and AI-based techniques. Machine Learning (ML) based methods are the foundation for effective fault diagnosis in the arena of AI. Various researches in literature have employed these techniques for fault diagnosis, for instance, Support Vector Machines (SVM), K-Nearest Neighbors (KNN), and Random Forest (RF) [<xref ref-type="bibr" rid="ref-14">14</xref>&#x2013;<xref ref-type="bibr" rid="ref-16">16</xref>].</p>
<p>ML methods have yet proven to be an effective diagnosis method; however, these methods have some inherent limitations and their performance extremely relies on designed features. For combined bearing fault detection, it often requires advanced signal processing techniques [<xref ref-type="bibr" rid="ref-17">17</xref>]. Moreover, these methods are not effective in a practical industrial environment where background noise and interference of signal components are inevitable [<xref ref-type="bibr" rid="ref-18">18</xref>]. Thus, the progress in AI led to the emergence of DL which has been widely exploited by researchers in various fields of this era. DL as a subfield of AI and has provided an efficient way of automatic representative feature learning, even from raw and noisy input data without any human intervention [<xref ref-type="bibr" rid="ref-19">19</xref>,<xref ref-type="bibr" rid="ref-20">20</xref>]. Compared to ML methods, DL methods automatically learn rich features even from raw data using high-performance computing hardware [<xref ref-type="bibr" rid="ref-7">7</xref>,<xref ref-type="bibr" rid="ref-21">21</xref>&#x2013;<xref ref-type="bibr" rid="ref-24">24</xref>]. Various DL architectures have been exploited for efficient bearing fault detection including Convolutional Neural Networks (CNN), Long Short-Term Memory (LSTM), Auto-Encoders (AE), Deep Belief Networks (DBN), and Deep Boltzmann Machines (DBM) [<xref ref-type="bibr" rid="ref-3">3</xref>,<xref ref-type="bibr" rid="ref-25">25</xref>&#x2013;<xref ref-type="bibr" rid="ref-28">28</xref>]. Among these DL models, CNNs has been the most representative model and has demonstrated robust and remarkable performance as a supervised learning approach [<xref ref-type="bibr" rid="ref-29">29</xref>,<xref ref-type="bibr" rid="ref-30">30</xref>]. Furthermore, Full Convolutional Network (FCN) can perform pixel-level semantic segmentation of image data owing to their rich hierarchical architecture. It has also made substantial progress in achieving higher level of feature hierarchy by processing of lower level features. These models learn features through complex mapping which extracts spectral information from individual pixels [<xref ref-type="bibr" rid="ref-3">3</xref>,<xref ref-type="bibr" rid="ref-6">6</xref>]. Considering the merits of CNN model, it has been widely used to diagnose rolling bearing fault using 1D (one dimensional) and 2D (two dimensional) data with 1D-CNN and 2D-CNN configurations, respectively. Guo et al. [<xref ref-type="bibr" rid="ref-31">31</xref>] have used an adaptive CNN (ADCNN) to classify the 4 conditions of the rolling bearing with the Case Western Reserve University (CWRU) bearing data. The initial layers of the model comprise of LeNet-5 and then stacked hierarchical layers which determine two output components: fault type and fault size. The adaptive configuration of the learning rate allowed to maintain a trade-off between accuracy and training speed. This model yielded competitive performance to the earlier models.</p>
<p>In [<xref ref-type="bibr" rid="ref-32">32</xref>], a deep CNN model with multisensory vibration data for classifying the nine bearing conditions have been experimented. Their model was able to learn features from the raw data and avoided overfitting through dropout regularization. Li et al. [<xref ref-type="bibr" rid="ref-33">33</xref>] have employed deep CNN with data augmentation. The model was trained with 400 data samples of the CWRU bearing dataset with the ten bearing conditions. Raw vibration data was the input to the CNN layers then residual layers were stacked for better feature learning. The network was trained with batch-normalization and dropout layers for fast training and avoiding overfitting through data augmentation. Although these DL architectures have demonstrated effective performance in bearing fault diagnosis, but these models face multiclass window problem and loss of information owing to sliding window labelling technique. Thus, these models pose limitations in performing dense predictions.</p>
<p>Unlike the above-cited works, a novel bearing fault diagnosis method based on UNET and vibration images is presented in this paper. The proposed approach overcomes the multiclass window problem that occur in DL models and can predict label for each input data sample in the bearing data. The UNET model achieves pixel-level dense predictions through the down-sampling and up-sampling layers in the architecture. This model is successfully applied in various domains and has revealed excellent performance in terms of dense predictions [<xref ref-type="bibr" rid="ref-34">34</xref>&#x2013;<xref ref-type="bibr" rid="ref-36">36</xref>]. The main contribution of this research article is twofold. The primary contribution is UNET based novel rolling bearing fault diagnosis method, which can perform dense predictions by overcoming the multiclass window problem, caused by sliding window-labelling technique. To the best of authors&#x2019; knowledge, UNET network is applied to diagnose bearing faults for the first time. The second is the dense labelled 2D vibration images without any data-augmentation given as input to the UNET model for sample-based bearing condition classification.</p>
<p>The remaining paper is organized as follows: Section 2 discusses dense labelling and fault classification using UNET, Section 3 explains the UNET architecture and training process, Section 4 discusses the dataset, experiment configuration, and experiment evaluation, Section 5 reports and discusses the results, and Section 6 concludes the investigation.</p>
</sec>
<sec id="s2">
<label>2</label>
<title>Dense Predictions Using UNET</title>
<p>Conventionally, time-series or sequential signals are divided into fixed-length windows using sliding window techniques. Then, each window is assigned with the same label and this label assigning technique leads to a multiclass window problem. There are two sliding window techniques: the first technique assigns a label by selecting the most frequent sample class and the second technique assigns labels by selecting last sample class of time step. Using both the techniques, classifier generates incorrect output information and causes a decrease in accuracy owing to the multiclass window problem. This problem is solved by dense labelling technique which increases classifiers&#x2019; accuracy by providing correct label information. This method assigns label to each sample of dataset rather than labelling based on sliding windows. Hence, it upholds all the label information in dataset. <xref ref-type="fig" rid="fig-1">Fig. 1</xref> describes both the labelling techniques. Current sample and next sample are described as S1 and S2. In the case of sliding window labelling method, the most frequent appearing class is assigned with label S1. However, owing to sliding windows it contains information of both classes. Thus, it causes learning incorrect information which in turn brings down the recognition accuracy.</p>
<fig id="fig-1">
<label>Figure 1</label>
<caption>
<title>Sliding-window labelling and dense labelling</title>
</caption>
<graphic mimetype="image" mime-subtype="tif" xlink:href="CMC_14941-fig-1.tif"/></fig>
<p>The traditional CNN restricts implementation of dense prediction, whereas UNET model reduces resolution of top-level output by using the max-pooling operation. For image recognition task, UNET reduces sensitivity of image shift. In the case of time-series data recognition, CNN causes a mismatch between input data length and output label length. To overcome this problem, UNET architecture has been successfully implemented by adding up-convolution layers or up-sampling layers in CNN architecture. It allows the use of dense labelling and preserves same length for input series and output label. Thus, UNET can be applied for dense predictions owing to addressing the multiclass window problem which is often posed by traditional CNN.</p>
</sec>
<sec id="s3">
<label>3</label>
<title>Bearing Fault Classification Using UNET</title>
<sec id="s3_1">
<label>3.1</label>
<title>UNET Architecture</title>
<p>UNET is the recent development on CNN that utilizes deep layer architecture for automatic feature learning from input data. The multilayer architecture allows to learn more abstract features and can perform classification at pixel-level. To achieve pixel-level classification, an end-to-end UNET architecture was proposed by Ronneberger et al. [<xref ref-type="bibr" rid="ref-37">37</xref>] for semantic segmentation of the biomedical images. The main objectives were to improve the precise segmentation and localization of neuronal composition in the microscopic images. The goal behind the development of UNET model was to realize pixel-level classification through an in-depth feature processing.</p>
<p>UNET architecture consists of two paths: a contractive path and an expansive path and both are symmetric to each other and yield an architecture like U-shape as shown in <xref ref-type="fig" rid="fig-2">Fig. 2</xref> (Thus called as UNET). The network on the left side is contracting path that is like a traditional CNN involving convolution and pooling layers with activations. Thus, learning the image contents. On the right side of <xref ref-type="fig" rid="fig-2">Fig. 2</xref> is an expansive path which includes stacked up-sampling layers and the corresponding convolutional layers on the left side. Both the network paths are merged to compensate for the loss of information caused by pooling operation. As a result, the architecture preserves the same resolution of images as in input network layer.</p>
<fig id="fig-2">
<label>Figure 2</label>
<caption>
<title>UNET architecture</title>
</caption>
<graphic mimetype="image" mime-subtype="tif" xlink:href="CMC_14941-fig-2.tif"/>
</fig>
<p><xref ref-type="fig" rid="fig-2">Fig. 2</xref> shows that in the contracting network, in which each block consists of the two repeated and unpadded 3 &#x00D7; 3 convolution layer followed by a ReLU (Rectified Linear Unit) activation and 2 &#x00D7; 2 max-pooling layer with a stride of 2 for down-sampling. Whereas, the expansive network consists of the same number of up-sampling layers as the down-sampling layers. In the end, a full-connection layer and softmax classifier are employed to map input feature vectors with the corresponding output class. In a contractive network, convolutional and pooling layers transform high-resolution images to low-resolution images also called down-sampling. While in an expansive network, the transposed convolutional layers transform low-resolution images to high-resolution images also known as up-sampling. Transposed convolution functions exactly the opposite way of normal convolution thus known as de-convolution. It up-samples the images by learning parameters through backpropagation.</p>
<p>In this paper, a 9-level UNET architecture is employed for efficient pixel-level feature learning in the bearing fault classification. The architecture comprises of three parts: encoding network, decoding network, and a bridge that connects both the networks as given in <xref ref-type="table" rid="table-1">Tab. 1</xref>. The encoding network transforms vibration images into compact representations and the decoding network recovers the transformed representations as pixel-wise categorizations. The complete network is constructed using 3&#x00D7;3 convolution layers, pooling layers, and a ReLU activation function.</p>
<table-wrap id="table-1">
<label>Table 1</label>
<caption>
<title>Structure of the UNET network</title>
</caption>
<table>
<colgroup>
<col/>
<col/>
<col/>
<col/>
<col/>
<col/>
</colgroup>
<thead>
<tr>
<th></th>
<th>Unit level</th>
<th>Layer</th>
<th>Filter</th>
<th>Stride</th>
<th>Output size</th>
</tr>
</thead>
<tbody valign="top">
<tr>
<td>Input</td>
<td></td>
<td></td>
<td></td>
<td></td>
<td>32 &#x00D7; 32 &#x00D7; 1</td>
</tr>
<tr>
<td rowspan="8">Encoding network</td>
<td rowspan="2">Level-1</td>
<td>Conv-1</td>
<td>3 &#x00D7; 3/16</td>
<td>2</td>
<td>32 &#x00D7; 32 &#x00D7; 16</td>
</tr>
<tr>
<td>Conv-2</td>
<td>3 &#x00D7; 3/16</td>
<td>2</td>
<td>32 &#x00D7; 32 &#x00D7; 16</td>
</tr>
<tr>
<td rowspan="2">Level-2</td>
<td>Conv-3</td>
<td>3 &#x00D7; 3/32</td>
<td>2</td>
<td>16 &#x00D7; 16 &#x00D7; 32</td>
</tr>
<tr>
<td>Conv-4</td>
<td>3 &#x00D7; 3/32</td>
<td>2</td>
<td>16 &#x00D7; 16 &#x00D7; 32</td>
</tr>
<tr>
<td rowspan="2">Level-3</td>
<td>Conv-5</td>
<td>3 &#x00D7; 3/64</td>
<td>2</td>
<td>8 &#x00D7; 8 &#x00D7; 64</td>
</tr>
<tr>
<td>Conv-6</td>
<td>3 &#x00D7; 3/64</td>
<td>2</td>
<td>8 &#x00D7; 8 &#x00D7; 64</td>
</tr>
<tr>
<td rowspan="2">Level-4</td>
<td>Conv-7</td>
<td>3 &#x00D7; 3/128</td>
<td>2</td>
<td>4 &#x00D7; 4 &#x00D7; 128</td>
</tr>
<tr>
<td>Conv-8</td>
<td>3 &#x00D7; 3/128</td>
<td>2</td>
<td>4 &#x00D7; 4 &#x00D7; 128</td>
</tr>
<tr>
<td rowspan="2">Bridge</td>
<td rowspan="2">Level-5</td>
<td>Conv-9</td>
<td>3 &#x00D7; 3/256</td>
<td>2</td>
<td>2 &#x00D7; 2 &#x00D7; 256</td>
</tr>
<tr>
<td>Conv-10</td>
<td>3 &#x00D7; 3/256</td>
<td>2</td>
<td>2 &#x00D7; 2 &#x00D7; 256</td>
</tr>
<tr>
<td rowspan="8">Decoding network</td>
<td rowspan="2">Level-6</td>
<td>Conv-11</td>
<td>3 &#x00D7; 3/128</td>
<td>2</td>
<td>4 &#x00D7; 4 &#x00D7; 128</td>
</tr>
<tr>
<td>Conv-12</td>
<td>3 &#x00D7; 3/128</td>
<td>2</td>
<td>4 &#x00D7; 4 &#x00D7; 128</td>
</tr>
<tr>
<td rowspan="2">Level-7</td>
<td>Conv-13</td>
<td>3 &#x00D7; 3/64</td>
<td>2</td>
<td>8 &#x00D7; 8 &#x00D7; 64</td>
</tr>
<tr>
<td>Conv-14</td>
<td>3 &#x00D7; 3/64</td>
<td>2</td>
<td>8 &#x00D7; 8 &#x00D7; 64</td>
</tr>
<tr>
<td rowspan="2">Level-8</td>
<td>Conv-15</td>
<td>3 &#x00D7; 3/32</td>
<td>2</td>
<td>16 &#x00D7; 16 &#x00D7; 32</td>
</tr>
<tr>
<td>Conv-16</td>
<td>3 &#x00D7; 3/32</td>
<td>2</td>
<td>16 &#x00D7; 16 &#x00D7; 32</td>
</tr>
<tr>
<td rowspan="2">Level-9</td>
<td>Conv-17</td>
<td>3 &#x00D7; 3/16</td>
<td>2</td>
<td>32 &#x00D7; 32 &#x00D7; 16</td>
</tr>
<tr>
<td>Conv-18</td>
<td>3 &#x00D7; 3/16</td>
<td>2</td>
<td>32 &#x00D7; 32 &#x00D7; 16</td>
</tr>
<tr>
<td>Output</td>
<td></td>
<td>Softmax</td>
<td></td>
<td></td>
<td>1 &#x00D7; 10</td>
</tr>
</tbody>
</table>
</table-wrap>
</sec>
<sec id="s3_2">
<label>3.2</label>
<title>Network Training</title>
<p>The employed network architecture is shown in <xref ref-type="fig" rid="fig-2">Fig. 2</xref>. The network receives input vibration images with size (N, N, C), where N denotes the number of sampling points and C denotes the number of channels. The architecture consists a total of 18 layers including contracting and expanding network layers. The input nodes <inline-formula id="ieqn-1"><mml:math id="mml-ieqn-1"><mml:msub><mml:mi>x</mml:mi><mml:mrow><mml:mi>i</mml:mi><mml:mo>,</mml:mo><mml:mi>j</mml:mi></mml:mrow></mml:msub></mml:math></inline-formula> receive three-dimensional input <inline-formula id="ieqn-2"><mml:math id="mml-ieqn-2"><mml:msub><mml:mi>h</mml:mi><mml:mrow><mml:mi>l</mml:mi></mml:mrow></mml:msub><mml:mo>&#x00D7;</mml:mo><mml:msub><mml:mi>w</mml:mi><mml:mrow><mml:mi>l</mml:mi></mml:mrow></mml:msub><mml:mo>&#x00D7;</mml:mo><mml:msub><mml:mi>f</mml:mi><mml:mrow><mml:mi>l</mml:mi></mml:mrow></mml:msub></mml:math></inline-formula> , where <inline-formula id="ieqn-3"><mml:math id="mml-ieqn-3"><mml:msub><mml:mi>h</mml:mi><mml:mrow><mml:mi>l</mml:mi></mml:mrow></mml:msub></mml:math></inline-formula> and <inline-formula id="ieqn-4"><mml:math id="mml-ieqn-4"><mml:msub><mml:mi>w</mml:mi><mml:mrow><mml:mi>l</mml:mi></mml:mrow></mml:msub></mml:math></inline-formula> are the height and width of the input vector at a location (i, j) and <inline-formula id="ieqn-5"><mml:math id="mml-ieqn-5"><mml:msub><mml:mi>f</mml:mi><mml:mrow><mml:mi>l</mml:mi></mml:mrow></mml:msub></mml:math></inline-formula> is number of the input feature maps in layer <inline-formula id="ieqn-6"><mml:math id="mml-ieqn-6"><mml:mi>l</mml:mi></mml:math></inline-formula>. The output of layer <inline-formula id="ieqn-7"><mml:math id="mml-ieqn-7"><mml:mi>l</mml:mi></mml:math></inline-formula> is represented by <inline-formula id="ieqn-8"><mml:math id="mml-ieqn-8"><mml:msub><mml:mi>y</mml:mi><mml:mrow><mml:mi>i</mml:mi><mml:mo>,</mml:mo><mml:mi>j</mml:mi></mml:mrow></mml:msub></mml:math></inline-formula> and can be found by the formula given in <xref ref-type="disp-formula" rid="eqn-1">Eq. (1)</xref>:</p>
<p><disp-formula id="eqn-1">
<label>(1)</label>
<mml:math id="mml-eqn-1" display="block"><mml:msub><mml:mi>y</mml:mi><mml:mrow><mml:mi>i</mml:mi><mml:mo>,</mml:mo><mml:mi>j</mml:mi></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:mi>f</mml:mi><mml:mrow><mml:mo>(</mml:mo><mml:msub><mml:mrow><mml:mo>{</mml:mo><mml:msub><mml:mi>x</mml:mi><mml:mrow><mml:mi>i</mml:mi><mml:mo>+</mml:mo><mml:msup><mml:mi>i</mml:mi><mml:mo>&#x2032;</mml:mo></mml:msup><mml:mo>,</mml:mo><mml:mi>j</mml:mi><mml:mo>+</mml:mo><mml:msup><mml:mi>j</mml:mi><mml:mo>&#x2032;</mml:mo></mml:msup></mml:mrow></mml:msub><mml:mo>}</mml:mo></mml:mrow><mml:mrow><mml:mo>&#x2212;</mml:mo><mml:mstyle displaystyle="true" scriptlevel="0"><mml:mfrac><mml:mrow><mml:msub><mml:mi>k</mml:mi><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo>&#x2212;</mml:mo><mml:mn>1</mml:mn></mml:mrow><mml:mn>2</mml:mn></mml:mfrac><mml:mo>&#x2264;</mml:mo><mml:msup><mml:mi>i</mml:mi><mml:mo>&#x2032;</mml:mo></mml:msup><mml:mo>&#x2264;</mml:mo><mml:mstyle displaystyle="true" scriptlevel="0"><mml:mfrac><mml:mrow><mml:msub><mml:mi>k</mml:mi><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo>&#x2212;</mml:mo><mml:mn>1</mml:mn></mml:mrow><mml:mn>2</mml:mn></mml:mfrac><mml:mo>,</mml:mo><mml:mo>&#x2212;</mml:mo><mml:mstyle displaystyle="true" scriptlevel="0"><mml:mfrac><mml:mrow><mml:msub><mml:mi>k</mml:mi><mml:mrow><mml:mi>j</mml:mi></mml:mrow></mml:msub><mml:mo>&#x2212;</mml:mo><mml:mn>1</mml:mn></mml:mrow><mml:mn>2</mml:mn></mml:mfrac><mml:mo>&#x2264;</mml:mo><mml:msup><mml:mi>j</mml:mi><mml:mo>&#x2032;</mml:mo></mml:msup><mml:mo>&#x2264;</mml:mo><mml:mstyle displaystyle="true" scriptlevel="0"><mml:mfrac><mml:mrow><mml:msub><mml:mi>k</mml:mi><mml:mrow><mml:mi>j</mml:mi></mml:mrow></mml:msub><mml:mo>&#x2212;</mml:mo><mml:mn>1</mml:mn></mml:mrow><mml:mn>2</mml:mn></mml:mfrac></mml:mstyle></mml:mstyle></mml:mstyle></mml:mstyle></mml:mrow></mml:msub><mml:mo>)</mml:mo></mml:mrow></mml:math>
</disp-formula></p>
<p>where <inline-formula id="ieqn-9"><mml:math id="mml-ieqn-9"><mml:msub><mml:mi>k</mml:mi><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub></mml:math></inline-formula> and <inline-formula id="ieqn-10"><mml:math id="mml-ieqn-10"><mml:msub><mml:mi>k</mml:mi><mml:mrow><mml:mi>j</mml:mi></mml:mrow></mml:msub></mml:math></inline-formula> represents convolution kernels and <inline-formula id="ieqn-11"><mml:math id="mml-ieqn-11"><mml:mi>f</mml:mi><mml:mo stretchy="false">(</mml:mo><mml:mo>&#x22C5;</mml:mo><mml:mo stretchy="false">)</mml:mo></mml:math></inline-formula> denotes the type of layer (non-linear operation with activation, matrix multiplication of convolution layer, max-pooling operation of maximum cell layer, etc.). The output of the layer <inline-formula id="ieqn-12"><mml:math id="mml-ieqn-12"><mml:mi>l</mml:mi></mml:math></inline-formula> becomes input of the layer <inline-formula id="ieqn-13"><mml:math id="mml-ieqn-13"><mml:mi>l</mml:mi><mml:mo>+</mml:mo><mml:mn>1</mml:mn></mml:math></inline-formula>. Its size is given by <inline-formula id="ieqn-14"><mml:math id="mml-ieqn-14"><mml:msub><mml:mi>h</mml:mi><mml:mrow><mml:mi>l</mml:mi><mml:mo>+</mml:mo><mml:mn>1</mml:mn></mml:mrow></mml:msub><mml:mo>&#x00D7;</mml:mo><mml:msub><mml:mi>w</mml:mi><mml:mrow><mml:mi>l</mml:mi><mml:mo>+</mml:mo><mml:mn>1</mml:mn></mml:mrow></mml:msub><mml:mo>&#x00D7;</mml:mo><mml:msub><mml:mi>f</mml:mi><mml:mrow><mml:mi>l</mml:mi><mml:mo>+</mml:mo><mml:mn>1</mml:mn></mml:mrow></mml:msub></mml:math></inline-formula>; where <inline-formula id="ieqn-15"><mml:math id="mml-ieqn-15"><mml:msub><mml:mi>h</mml:mi><mml:mrow><mml:mi>l</mml:mi><mml:mo>+</mml:mo><mml:mn>1</mml:mn></mml:mrow></mml:msub><mml:mo>=</mml:mo></mml:math></inline-formula> (<inline-formula id="ieqn-16"><mml:math id="mml-ieqn-16"><mml:msub><mml:mi>h</mml:mi><mml:mrow><mml:mi>l</mml:mi></mml:mrow></mml:msub><mml:mo>&#x2212;</mml:mo><mml:msub><mml:mi>k</mml:mi><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo stretchy="false">)</mml:mo><mml:mrow><mml:mo>/</mml:mo></mml:mrow><mml:msub><mml:mi>s</mml:mi><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo>+</mml:mo><mml:mn>1</mml:mn></mml:math></inline-formula> and <inline-formula id="ieqn-17"><mml:math id="mml-ieqn-17"><mml:msub><mml:mi>w</mml:mi><mml:mrow><mml:mi>l</mml:mi><mml:mo>+</mml:mo><mml:mn>1</mml:mn></mml:mrow></mml:msub><mml:mo>=</mml:mo></mml:math></inline-formula> (<inline-formula id="ieqn-18"><mml:math id="mml-ieqn-18"><mml:msub><mml:mi>w</mml:mi><mml:mrow><mml:mi>l</mml:mi></mml:mrow></mml:msub><mml:mo>&#x2212;</mml:mo><mml:msub><mml:mi>k</mml:mi><mml:mrow><mml:mi>j</mml:mi></mml:mrow></mml:msub><mml:mo stretchy="false">)</mml:mo><mml:mrow><mml:mo>/</mml:mo></mml:mrow><mml:msub><mml:mi>s</mml:mi><mml:mrow><mml:mi>j</mml:mi></mml:mrow></mml:msub><mml:mo>+</mml:mo><mml:mn>1</mml:mn></mml:math></inline-formula>. The stride of movement is expressed by <inline-formula id="ieqn-19"><mml:math id="mml-ieqn-19"><mml:msub><mml:mi>s</mml:mi><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub></mml:math></inline-formula> and <inline-formula id="ieqn-20"><mml:math id="mml-ieqn-20"><mml:msub><mml:mi>s</mml:mi><mml:mrow><mml:mi>j</mml:mi></mml:mrow></mml:msub></mml:math></inline-formula>. Application of the filling operation on input feature map allows to maintain resolution of output feature map same as the input feature map.</p>
<p>To achieve effective and accurate dense predictions using UNET, the network training is performed for estimating appropriate parameters (W, b) from a given dataset and corresponding labels. Through minimizing loss of all the samples in training dataset, accurate dense predictions can be made possible. The loss function is expressed in <xref ref-type="disp-formula" rid="eqn-2">Eq. (2)</xref>:</p>
<p><disp-formula id="eqn-2">
<label>(2)</label>
<mml:math id="mml-eqn-2" display="block"><mml:mi>l</mml:mi><mml:mrow><mml:mo>(</mml:mo><mml:mi>x</mml:mi><mml:mo>,</mml:mo><mml:mi>y</mml:mi><mml:mo>;</mml:mo><mml:mi>W</mml:mi><mml:mo>,</mml:mo><mml:mi>b</mml:mi><mml:mo>)</mml:mo></mml:mrow><mml:mo>=</mml:mo><mml:munderover><mml:mo>&#x2211;</mml:mo><mml:mrow><mml:mi>J</mml:mi></mml:mrow><mml:mrow><mml:mi>N</mml:mi></mml:mrow></mml:munderover><mml:mi>l</mml:mi><mml:mrow><mml:msup><mml:mi></mml:mi><mml:mo>&#x2032;</mml:mo></mml:msup></mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mi>x</mml:mi><mml:mo>,</mml:mo><mml:msub><mml:mi>y</mml:mi><mml:mrow><mml:mi>j</mml:mi></mml:mrow></mml:msub><mml:mo>;</mml:mo><mml:mi>W</mml:mi><mml:mo>,</mml:mo><mml:mi>b</mml:mi><mml:mo stretchy="false">)</mml:mo></mml:math>
</disp-formula></p>
<p>where <inline-formula id="ieqn-21"><mml:math id="mml-ieqn-21"><mml:msup><mml:mi>l</mml:mi><mml:mo>&#x2032;</mml:mo></mml:msup><mml:mrow><mml:mo>(</mml:mo><mml:mi>x</mml:mi><mml:mo>,</mml:mo><mml:msub><mml:mi>y</mml:mi><mml:mrow><mml:mi>j</mml:mi></mml:mrow></mml:msub><mml:mo>;</mml:mo><mml:mi>W</mml:mi><mml:mo>,</mml:mo><mml:mi>b</mml:mi><mml:mo>)</mml:mo></mml:mrow><mml:mo>=</mml:mo><mml:mo>&#x2212;</mml:mo><mml:mi>log</mml:mi><mml:mo>&#x2061;</mml:mo><mml:mo stretchy="false">(</mml:mo><mml:mi>p</mml:mi><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:mo>(</mml:mo><mml:msub><mml:mi>y</mml:mi><mml:mrow><mml:mi>j</mml:mi></mml:mrow></mml:msub><mml:mrow><mml:mo stretchy="false">|</mml:mo></mml:mrow><mml:mi>x</mml:mi><mml:mo>,</mml:mo><mml:mi>W</mml:mi><mml:mo>,</mml:mo><mml:mi>b</mml:mi><mml:mo>)</mml:mo></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:math></inline-formula> denotes the negative logarithmic loss function of j-th sample in a batch.</p>
</sec>
</sec>
<sec id="s4">
<label>4</label>
<title>Experimental Study</title>
<p>This section reports experiments performed on a rolling bearing dataset using UNET model. Furthermore, the investigation and evaluation process of performance of UNET is reported in this section. In addition, this section presents a comparison of UNET model performance with other DL architectures, which are employed in the recent research for bearing fault diagnosis. Firstly, the dataset employed in this research is explained, then the experimental configuration and evaluation metrics are discussed.</p>
<sec id="s4_1">
<label>4.1</label>
<title>Dataset and Vibration Image Construction</title>
<p>In this investigation, the CWRU bearing dataset is used to evaluate performance of the proposed model [<xref ref-type="bibr" rid="ref-38">38</xref>]. The reason behind choosing this dataset is that it is the most widely employed by the researchers as a benchmark dataset for bearing fault diagnosis [<xref ref-type="bibr" rid="ref-4">4</xref>]. Furthermore, this dataset is provided with open access for research community which allows them to evaluate the performance of their proposed algorithms. The experimental setup includes a dynamometer (right), a 2 HP Reliance electric motor (left), and a torque encoder (center) as depicted in <xref ref-type="fig" rid="fig-3">Fig. 3</xref>. The setup also includes a control system for proper operation of the system; however, it is not shown in the figure. The test bearings are attached to support the motor shaft. Various single point artificial faults of different sizes are introduced to the rolling bearings using electric discharge machining. The vibration is collected using accelerometers attached to the fan-end and derive-end, and base of the motor. These accelerometers are attached using magnetic bases with position at the 12&#x2019;o clock. The data is acquired using a 16 channel data recorder. The setup is operated under the four different load conditions including 0&#x2013;3 HP load applied by the dynamometer within range of the speed between 1720 and 1797.</p>
<fig id="fig-3">
<label>Figure 3</label>
<caption>
<title>Experimental setup</title>
</caption>
<graphic mimetype="image" mime-subtype="tif" xlink:href="CMC_14941-fig-3.tif"/>
</fig>
<p>In this paper, a balanced dataset is used, which includes different conditions of the CWRU bearing dataset with 1 HP load. These conditions of bearing include healthy condition and nine bearing faults of different sizes in mils. The dataset includes a total of 10 classes of the vibration data that is collected from the drive-end of the setup with a sampling rate of 48 kHz. The size of each condition is given in <xref ref-type="table" rid="table-2">Tab. 2</xref> with the appropriate class labels.</p>
<table-wrap id="table-2">
<label>Table 2</label>
<caption>
<title>Bearing fault class labels and data samples</title>
</caption>
<table>
<colgroup>
<col/>
<col/>
<col/>
</colgroup>
<thead>
<tr>
<th>Bearing condition</th>
<th>Class label</th>
<th>No. of samples</th>
</tr>
</thead>
<tbody>
<tr>
<td>Ball fault (0.007 inch)</td>
<td>Ball_007</td>
<td>460</td>
</tr>
<tr>
<td>Ball fault (0.014 inch)</td>
<td>Ball_014</td>
<td>460</td>
</tr>
<tr>
<td>Ball fault (0.021 inch)</td>
<td>Ball_021</td>
<td>460</td>
</tr>
<tr>
<td>Inner race fault (0.007 inch)</td>
<td>IR_007</td>
<td>460</td>
</tr>
<tr>
<td>Inner race fault (0.014 inch)</td>
<td>IR_014</td>
<td>460</td>
</tr>
<tr>
<td>Inner race fault (0.021 inch)</td>
<td>IR_021</td>
<td>460</td>
</tr>
<tr>
<td>Normal bearing</td>
<td>Normal</td>
<td>460</td>
</tr>
<tr>
<td>Outer race fault (0.007 inch)</td>
<td>OR_007</td>
<td>460</td>
</tr>
<tr>
<td>Outer race fault (0.014 inch)</td>
<td>OR_014</td>
<td>460</td>
</tr>
<tr>
<td>Outer race fault (0.021 inch)</td>
<td>OR_021</td>
<td>460</td>
</tr>
</tbody>
</table>
</table-wrap>
<p>The vibration signatures of bearings obtained as 1D data can be transformed into 2D form as images. The process includes normalization of each sample in vibration signal into the range [&#x2212;1, 1]. Then, the normalized amplitude of each sample of the signal transformed into the corresponding pixel in the vibration image [<xref ref-type="bibr" rid="ref-39">39</xref>]. This transformation between normalized amplitude of vibration signal and pixels can be expressed as given in <xref ref-type="disp-formula" rid="eqn-3">Eq. (3)</xref>:</p>
<p><disp-formula id="eqn-3">
<label>(3)</label>
<mml:math id="mml-eqn-3" display="block"><mml:mi>P</mml:mi><mml:mrow><mml:mo>[</mml:mo><mml:mi>i</mml:mi><mml:mo>,</mml:mo><mml:mi>j</mml:mi><mml:mo>]</mml:mo></mml:mrow><mml:mo>=</mml:mo><mml:mi>A</mml:mi><mml:mo stretchy="false">[</mml:mo><mml:mrow><mml:mo>(</mml:mo><mml:mi>i</mml:mi><mml:mo>&#x2212;</mml:mo><mml:mi>j</mml:mi><mml:mo>)</mml:mo></mml:mrow><mml:mrow><mml:mo>&#x2217;</mml:mo></mml:mrow><mml:mi>M</mml:mi><mml:mo>+</mml:mo><mml:mi>j</mml:mi><mml:mo stretchy="false">]</mml:mo></mml:math>
</disp-formula></p>
<p>where P[i, j] is the intensity of the corresponding pixels in M &#x002A; N vibration image, i &#x003D; 1:N, and j &#x003D; 1:M. A [&#x00B7;] represents normalized amplitude of each vibration sample. The number of samples in a vibration signal are transformed into an equal number of pixels in the vibration image.</p>
<p>The dataset including these bearing conditions is transformed into the vibration images with the minimum resolution. The transformation resulted in a total of 4600 vibration images of size 32 &#x00D7; 32 &#x00D7; 1. <xref ref-type="fig" rid="fig-4">Fig. 4</xref> shows the transformed vibration images of the ten bearing conditions with the appropriate class labels.</p>
<fig id="fig-4">
<label>Figure 4</label>
<caption>
<title>Vibration images of the bearing conditions</title>
</caption>
<graphic mimetype="image" mime-subtype="tif" xlink:href="CMC_14941-fig-4.tif"/>
</fig>
</sec>
<sec id="s4_2">
<label>4.2</label>
<title>Experiment Configuration</title>
<p>To train the UNET network, the vibration images of different bearing conditions are given as input. The size of the each vibration image is 32 &#x00D7; 32 &#x00D7; 1 including height, width, and the number of channels, respectively. The model is trained with the Adam optimizer, batch-size of 128, and the dropout rate of 0.2. The learning rate is set adaptively with the minimum value of 1 &#x00D7; 10<sup>8</sup>. The adaptive learning rate callback allowed to reduce learning by 0.1, if the model validation loss does not improve for five iterations. The early stopping callback allowed to avoid overfitting of the model through stopping the training, if the validation loss does not improve for ten continuous iterations. Subsequently, it saves the learned model parameters. In the end, the softmax loss function is used to classify the ten different conditions of the bearing. The dataset is randomly divided into training, test, and validation set with a percentage of 70%, 20%, and 10%, respectively.</p>
<p>To compare the performance of the proposed model, three DL models are employed which were investigated in the recent researches on this dataset. These models include FCN [<xref ref-type="bibr" rid="ref-40">40</xref>], LeNet-5 [<xref ref-type="bibr" rid="ref-41">41</xref>], and ResNet-50 [<xref ref-type="bibr" rid="ref-42">42</xref>]. These models are described briefly in the following paragraphs.
<list list-type="bullet">
<list-item>
<p>FCN: This model consists of 6 layers including: two convolutional layers, two max-pooling layers, and two Full Connection (FC) layers. The model is employed for bearing condition diagnosis with a dropout of 0.2, batch size of 128, and zero padding. At the final stage, a softmax classifier is stacked.</p></list-item>
<list-item>
<p>LeNet-5: It is a 2D network comprises of nine layers including three convolutional layers, three max-pooling layers, and three FC layers. A softmax layer is added at the end of the model to classify the bearing conditions. The dropout is set at 0.2 and batch-size of 128.</p></list-item>
<list-item>
<p>ResNet-50: ResNet-50 model can optimize model parameters through a &#x201C;shortcut connection or residual unit&#x201D; in each convolutional block. It also assists in avoiding overfitting without loss of important information. This model consists of two blocks: stacked structure block and two FC blocks. The stacked structure blocks achieve low training parameters through reducing the size of feature map which in turn minimizes hardware requirement. Each stacked block consists of convolution layer, max-pooling layer, and batch-normalization. Meanwhile, FC blocks allow classifying the bearing conditions. Here, this model is utilized with a dropout rate of 0.2 and a batch size of 128.</p></list-item>
</list></p>
<p>The hardware used in this investigation is provided with a GPU model GeForce RTX-2060. The system is programmed with Python 3.7.1 and DL frameworks including Keras and Tensorflow.</p>
</sec>
<sec id="s4_3">
<label>4.3</label>
<title>Experiment Evaluation</title>
<p>To compare the performance of the UNET model and comparative models, following evaluation indexes are used as given in <xref ref-type="disp-formula" rid="eqn-4">Eqs. (4)</xref>&#x2013;<xref ref-type="disp-formula" rid="eqn-7">(7)</xref>:</p>
<p><disp-formula id="eqn-4">
<label>(4)</label>
<mml:math id="mml-eqn-4" display="block"><mml:mtext>Accuracy</mml:mtext><mml:mo>=</mml:mo><mml:mfrac><mml:mrow><mml:mrow><mml:mi mathvariant="normal">T</mml:mi><mml:mi mathvariant="normal">P</mml:mi></mml:mrow><mml:mo>+</mml:mo><mml:mrow><mml:mi mathvariant="normal">T</mml:mi><mml:mi mathvariant="normal">N</mml:mi></mml:mrow></mml:mrow><mml:mrow><mml:mi mathvariant="normal">m</mml:mi></mml:mrow></mml:mfrac><mml:mo>&#x00D7;</mml:mo><mml:mn>100</mml:mn></mml:math>
</disp-formula></p>
<p><disp-formula id="eqn-5">
<label>(5)</label>
<mml:math id="mml-eqn-5" display="block"><mml:mtext>Precision</mml:mtext><mml:mo>=</mml:mo><mml:mfrac><mml:mrow><mml:mi mathvariant="normal">T</mml:mi><mml:mi mathvariant="normal">P</mml:mi></mml:mrow><mml:mrow><mml:mrow><mml:mi mathvariant="normal">T</mml:mi><mml:mi mathvariant="normal">P</mml:mi></mml:mrow><mml:mo>+</mml:mo><mml:mrow><mml:mi mathvariant="normal">F</mml:mi><mml:mi mathvariant="normal">P</mml:mi></mml:mrow></mml:mrow></mml:mfrac></mml:math>
</disp-formula></p>
<p><disp-formula id="eqn-6">
<label>(6)</label>
<mml:math id="mml-eqn-6" display="block"><mml:mtext>Recall</mml:mtext><mml:mo>=</mml:mo><mml:mfrac><mml:mrow><mml:mi mathvariant="normal">T</mml:mi><mml:mi mathvariant="normal">P</mml:mi></mml:mrow><mml:mrow><mml:mrow><mml:mi mathvariant="normal">T</mml:mi><mml:mi mathvariant="normal">P</mml:mi></mml:mrow><mml:mo>+</mml:mo><mml:mrow><mml:mi mathvariant="normal">F</mml:mi><mml:mi mathvariant="normal">N</mml:mi></mml:mrow></mml:mrow></mml:mfrac></mml:math>
</disp-formula></p>
<p><disp-formula id="eqn-7">
<label>(7)</label>
<mml:math id="mml-eqn-7" display="block"><mml:mrow><mml:mi mathvariant="normal">F</mml:mi></mml:mrow><mml:mn>1</mml:mn><mml:mtext>-Score</mml:mtext><mml:mo>=</mml:mo><mml:mfrac><mml:mrow><mml:mn>2</mml:mn><mml:mo stretchy="false">(</mml:mo><mml:mtext>Precision&#xA0;</mml:mtext><mml:mo>&#x00D7;</mml:mo><mml:mtext>Recall</mml:mtext><mml:mo stretchy="false">)</mml:mo></mml:mrow><mml:mrow><mml:mtext>Precision</mml:mtext><mml:mo>+</mml:mo><mml:mtext>Recall</mml:mtext></mml:mrow></mml:mfrac></mml:math>
</disp-formula></p>
<p>where TP denotes true positives, TN denotes true negatives, FP denotes the false positives, FN denotes false negatives, and m denotes number of examples in data.</p>
</sec>
</sec>
<sec id="s5">
<label>5</label>
<title>Results and Discussions</title>
<p>In this section, the performance of the four models is compared and summarized based on accuracy and F1-Score indexes. The highest results obtained with the UNET and comparative methods are shown in <xref ref-type="table" rid="table-3">Tab. 3</xref>. The results clearly indicate effectiveness of the UNET method, which outperforms the comparative DL methods owing to its capability of dense predictions. The UNET has achieved the best accuracy of 98.91% and the highest F1-Score of 99% among all the employed DL models on the dataset in this research. FCN has achieved 97.61% of accuracy and 97.6% of F1-Score. While, LetNet-5 has demonstrated the second-best performance with accuracy and F1-Score of 96.74% and 96.8%, respectively. However, ResNet-50 has demonstrated the lowest performance in diagnosing the rolling bearing conditions. The comparative models have yielded lower accuracies than the UNET owing to loss of useful information caused by sliding window labelling technique. Contrarily, UNET model preserves useful information from the vibration images owing its hierarchical network architecture which in turn allowed it to perform better than the comparative models. The achieved results conclude the UNET model as an effective model among all the used DL models in this investigation.</p>
<table-wrap id="table-3">
<label>Table 3</label>
<caption>
<title>The evaluation indexes of the UNET and the comparative methods on CWRU bearing dataset</title>
</caption>
<table>
<colgroup>
<col/>
<col/>
<col/>
<col/>
<col/>
</colgroup>
<thead>
<tr>
<th>Indicator</th>
<th>UNET (%)</th>
<th>LeNet-5 (%)</th>
<th>ResNet-50 (%)</th>
<th>FCN (%)</th>
</tr>
</thead>
<tbody>
<tr>
<td>Accuracy</td>
<td>98.91</td>
<td>96.74</td>
<td>95.43</td>
<td>97.61</td>
</tr>
<tr>
<td>F1-Score</td>
<td>99.00</td>
<td>96.80</td>
<td>95.50</td>
<td>97.60</td>
</tr>
</tbody>
</table>
</table-wrap>
<p><xref ref-type="fig" rid="fig-5">Fig. 5</xref> shows the F1-Score of the UNET model and comparative DL models in terms of the each bearing condition. Comparatively, UNET model has the best F1-Score for each bearing condition.</p>
<fig id="fig-5">
<label>Figure 5</label>
<caption>
<title>F1-Score of the UNET and comparative methods for each bearing condition</title>
</caption>
<graphic mimetype="image" mime-subtype="tif" xlink:href="CMC_14941-fig-5.tif"/>
</fig>
<p><xref ref-type="fig" rid="fig-6">Fig. 6</xref> shows the confusion matrices of the UNET and comparative DL methods in terms of the each bearing condition. As shown in <xref ref-type="fig" rid="fig-6">Fig. 6a</xref>, UNET model accuracy for the individual classes varies within the range of 95.65% to 100%. Similarly, individual class accuracy of the LeNet-5 is shown in <xref ref-type="fig" rid="fig-6">Fig. 6b</xref>, where the accuracy has the minimum value of 84.75% and the maximum of 100%. In <xref ref-type="fig" rid="fig-6">Fig. 6c</xref>, individual class accuracy of the ResNet-50 model is shown which varies within the range of 82.61% and 100%. Lastly, confusion matrix of the FCN model shows the individual class accuracy changes between 82.61% and 100%. These confusion matrices confirm the UNET model as the best model for the individual bearing condition classification among the employed DL models.</p>
<fig id="fig-6">
<label>Figure 6</label>
<caption>
<title>Confusion matrix of the (a) UNET, (b) LeNet-5, (c) ResNet-50, and (d) FCN</title>
</caption>
<graphic mimetype="image" mime-subtype="tif" xlink:href="CMC_14941-fig-6.tif"/></fig>
<p><xref ref-type="fig" rid="fig-7">Fig. 7</xref> shows training accuracy and loss function of the employed DL models. These models were trained with early stopping callback which stops the training when a DL model achieves maximum accuracy without overfitting. Thus, number of the epochs varies for each DL model. <xref ref-type="fig" rid="fig-7">Fig. 7a</xref> shows that the accuracy of the UNET model tends to be stable after the 30th epoch and achieves the accuracy of 98.91% and its loss function value improved after the 25th epoch. The LeNet-5 model achieves the accuracy of 96.74%, but the model response keeps fluctuating as shown in <xref ref-type="fig" rid="fig-7">Fig. 7b</xref>. Similarly, the ResNet-50 model achieves the accuracy of 95.43% with the fluctuating response for both the accuracy and the loss function as depicted in <xref ref-type="fig" rid="fig-7">Fig. 7c</xref>. Lastly, <xref ref-type="fig" rid="fig-7">Fig. 7d</xref> shows accuracy of the FCN that is around 97.61% and its accuracy and the loss function response improve after 20 epochs. It can be noted from the figures that the UNET model depicts the most stable classification performance with the highest classification accuracy.</p>
<fig id="fig-7">
<label>Figure 7</label>
<caption>
<title>Accuracy and loss of the (a) UNET, (b) LeNet-5, (c) ResNet-50, and (d) FCN</title>
</caption>
<graphic mimetype="image" mime-subtype="tif" xlink:href="CMC_14941-fig-7a.tif"/>
<graphic mimetype="image" mime-subtype="tif" xlink:href="CMC_14941-fig-7b.tif"/>
</fig>
<p>Although the UNET model was fed with the vibration images of resolution of 32 &#x00D7; 32 &#x00D7; 1, yet it yielded the excellent performance in terms of classifying the bearing conditions owing to the pixel-level learning capabilities. Moreover, with the adaptive learning rate the model learned features hierarchically thorough adjusting the learning rate, if the performance of the model did not improve for certain iterations. Another technique which is known as early stopping, helped to avoid the overfitting, if the model did not improve the performance. The UNET model has depicted robustness in the training process as shown in <xref ref-type="fig" rid="fig-7">Fig. 7a</xref>.</p>
<p>The UNET model was remarkably able to predict sample-based rolling bearing conditions owing to its inherent property of dense predictions. Its comparison with the other popular DL models revealed that the UNET model has superior and robust performance in terms of rolling bearing fault diagnosis. It was observed from the results that the proposed algorithm possesses the excellent potential for fault diagnosis of rotating machines in various industrial applications.</p>
</sec>
<sec id="s6">
<label>6</label>
<title>Conclusion</title>
<p>In this paper, a novel DL method namely UNET is proposed for rolling bearing fault classification based on the vibration images. Compared to the existing DL methods, this model overcomes the multiclass window problem that is inherent in the sliding window labelling method. The UNET model used in this investigation has effectively performed dense predictions of the bearing conditions with an accuracy of 98.91% and F1-Score of 99%. The obtained results confirm that the UNET model outperforms the comparative DL models such as CNN, LeNet-5, and ResNet-50. The model yielded robust and better results than the comparative methods on short-term feature recognition. The robust classification results of the UNET model on this bearing dataset indicates its excellent potential for applications in other domains.</p>
</sec>
</body>
<back>
<fn-group>
<fn fn-type="other">
<p><bold>Funding Statement:</bold> Authors would like to acknowledge the support of the &#x2018;Haptics, Human Robotics, and Condition Monitoring Lab&#x2019; established in Mehran University of Engineering and Technology, Jamshoro under the umbrella of the National Center of Robotics and Automation. This work was supported by the Higher Education Commission Pakistan (Grant No. 2(1076)/HEC/M&#x0026;E/2018/704).</p>
</fn>
<fn fn-type="conflict">
<p><bold>Conflicts of Interest:</bold> The authors declare that they have no conflicts of interest to report regarding this research work.</p>
</fn>
</fn-group>
<ref-list content-type="authoryear">
<title>References</title>
<ref id="ref-1"><label>[1]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><given-names>E.</given-names> <surname>Karim</surname></string-name>, <string-name><given-names>T.</given-names> <surname>Memon</surname></string-name> and <string-name><given-names>I.</given-names> <surname>Hussain</surname></string-name></person-group>, &#x201C;<article-title>FPGA based on-line fault diagnostic of induction motors using electrical signature analysis</article-title>,&#x201D; <source>International Journal of Information Technology</source>, vol. <volume>11</volume>, no. <issue>2</issue>, pp. <fpage>165</fpage>&#x2013;<lpage>169</lpage>, <year>2019</year>.</mixed-citation></ref>
<ref id="ref-2"><label>[2]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><given-names>M.</given-names> <surname>Cerrada</surname></string-name>, <string-name><given-names>R. V.</given-names> <surname>S&#x00E1;nchez</surname></string-name>, <string-name><given-names>C.</given-names> <surname>Li</surname></string-name>, <string-name><given-names>F.</given-names> <surname>Pacheco</surname></string-name>, <string-name><given-names>D.</given-names> <surname>Cabrera</surname></string-name> <etal>et al.</etal></person-group><italic>,</italic> &#x201C;<article-title>A review on data-driven fault severity assessment in rolling bearings</article-title>,&#x201D; <source>Mechanical Systems and Signal Processing</source>, vol. <volume>99</volume>, pp. <fpage>169</fpage>&#x2013;<lpage>196</lpage>, <year>2018</year>.</mixed-citation></ref>
<ref id="ref-3"><label>[3]</label><mixed-citation publication-type="conf-proc"><person-group person-group-type="author"><string-name><given-names>N.</given-names> <surname>Enshaei</surname></string-name> and <string-name><given-names>F.</given-names> <surname>Naderkhani</surname></string-name></person-group>, &#x201C;<article-title>Application of deep learning for fault diagnostic in induction machine&#x2019;s bearings</article-title>,&#x201D; in <conf-name>Proc. IEEE Int. Conf. on Prognostics and Health Management</conf-name>, <conf-loc>San Francisco, CA, USA</conf-loc>, pp. <fpage>1</fpage>&#x2013;<lpage>7</lpage>, <year>2019</year>. </mixed-citation></ref>
<ref id="ref-4"><label>[4]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><given-names>Y.</given-names> <surname>Yang</surname></string-name>, <string-name><given-names>P.</given-names> <surname>Fu</surname></string-name> and <string-name><given-names>Y.</given-names> <surname>He</surname></string-name></person-group>, &#x201C;<article-title>Bearing fault automatic classification based on deep learning</article-title>,&#x201D; <source>IEEE Access</source>, vol. <volume>6</volume>, pp. <fpage>71540</fpage>&#x2013;<lpage>71554</lpage>, <year>2018</year>.</mixed-citation></ref>
<ref id="ref-5"><label>[5]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><given-names>D. K.</given-names> <surname>Soother</surname></string-name> and <string-name><given-names>J.</given-names> <surname>Daudpoto</surname></string-name></person-group>, &#x201C;<article-title>A brief review of condition monitoring techniques for the induction motor</article-title>,&#x201D; <source>Transactions of the Canadian Society for Mechanical Engineering</source>, vol. <volume>43</volume>, no. <issue>4</issue>, pp. <fpage>499</fpage>&#x2013;<lpage>508</lpage>, <year>2019</year>.</mixed-citation></ref>
<ref id="ref-6"><label>[6]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><given-names>G.</given-names> <surname>Toh</surname></string-name> and <string-name><given-names>J.</given-names> <surname>Park</surname></string-name></person-group>, &#x201C;<article-title>Review of vibration-based structural health monitoring using deep learning</article-title>,&#x201D; <source>Applied Sciences</source>, vol. <volume>10</volume>, no. <issue>5</issue>, pp. <fpage>1</fpage>&#x2013;<lpage>24</lpage>, <year>2020</year>.</mixed-citation></ref>
<ref id="ref-7"><label>[7]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><given-names>M. D.</given-names> <surname>Prieto</surname></string-name>, <string-name><given-names>G.</given-names> <surname>Cirrincione</surname></string-name>, <string-name><given-names>A. G.</given-names> <surname>Espinosa</surname></string-name>, <string-name><given-names>J. A.</given-names> <surname>Ortega</surname></string-name> and <string-name><given-names>H.</given-names> <surname>Henao</surname></string-name></person-group>, &#x201C;<article-title>Bearing fault detection by a novel condition-monitoring scheme based on statistical-time features and neural networks</article-title>,&#x201D; <source>IEEE Transactions on Industrial Electronics</source>, vol. <volume>60</volume>, no. <issue>8</issue>, pp. <fpage>3398</fpage>&#x2013;<lpage>3407</lpage>, <year>2012</year>.</mixed-citation></ref>
<ref id="ref-8"><label>[8]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><given-names>A.</given-names> <surname>Malhi</surname></string-name> and <string-name><given-names>R.</given-names> <surname>Gao</surname></string-name></person-group>, &#x201C;<article-title>PCA-based feature selection scheme for machine defect classification</article-title>,&#x201D; <source>IEEE Transactions on Instrumentation and Measurement</source>, vol. <volume>53</volume>, no. <issue>6</issue>, pp. <fpage>1517</fpage>&#x2013;<lpage>1525</lpage>, <year>2004</year>.</mixed-citation></ref>
<ref id="ref-9"><label>[9]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><given-names>A. F.</given-names> <surname>Aimer</surname></string-name>, <string-name><given-names>A. H.</given-names> <surname>Boudinar</surname></string-name>, <string-name><given-names>N.</given-names> <surname>Benouzza</surname></string-name> and <string-name><given-names>A.</given-names> <surname>Bendiabdellah</surname></string-name></person-group>, &#x201C;<article-title>Bearing fault diagnosis of a PWM inverter fed-induction motor using an improved short time Fourier transform</article-title>,&#x201D; <source>Journal of Electrical Engineering and Technology</source>, vol. <volume>14</volume>, no. <issue>3</issue>, pp. <fpage>1201</fpage>&#x2013;<lpage>1210</lpage>, <year>2019</year>.</mixed-citation></ref>
<ref id="ref-10"><label>[10]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><given-names>P.</given-names> <surname>Konar</surname></string-name> and <string-name><given-names>P.</given-names> <surname>Chattopadhyay</surname></string-name></person-group>, &#x201C;<article-title>Bearing fault detection of induction motor using wavelet and support vector machines (SVMs)</article-title>,&#x201D; <source>Applied Soft Computing</source>, vol. <volume>11</volume>, no. <issue>6</issue>, pp. <fpage>4203</fpage>&#x2013;<lpage>4211</lpage>, <year>2011</year>.</mixed-citation></ref>
<ref id="ref-11"><label>[11]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><given-names>D. S.</given-names> <surname>Singh</surname></string-name> and <string-name><given-names>Q.</given-names> <surname>Zhao</surname></string-name></person-group>, &#x201C;<article-title>Pseudo-fault signal assisted EMD for fault detection and isolation in rotating machines</article-title>,&#x201D; <source>Mechanical Systems and Signal Processing</source>, vol. <volume>81</volume>, no. <issue>1971</issue>, pp. <fpage>202</fpage>&#x2013;<lpage>218</lpage>, <year>2016</year>.</mixed-citation></ref>
<ref id="ref-12"><label>[12]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><given-names>S.</given-names> <surname>Osman</surname></string-name> and <string-name><given-names>W.</given-names> <surname>Wang</surname></string-name></person-group>, &#x201C;<article-title>A morphological Hilbert&#x2013;Huang transform technique for bearing fault detection</article-title>,&#x201D; <source>IEEE Transactions on Instrumentation and Measurement</source>, vol. <volume>65</volume>, no. <issue>11</issue>, pp. <fpage>2646</fpage>&#x2013;<lpage>2656</lpage>, <year>2016</year>.</mixed-citation></ref>
<ref id="ref-13"><label>[13]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><given-names>J.</given-names> <surname>Tian</surname></string-name>, <string-name><given-names>C.</given-names> <surname>Morillo</surname></string-name>, <string-name><given-names>M. H.</given-names> <surname>Azarian</surname></string-name> and <string-name><given-names>M.</given-names> <surname>Pecht</surname></string-name></person-group>, &#x201C;<article-title>Motor bearing fault detection using spectral kurtosis-based feature extraction coupled with k-nearest neighbor distance analysis</article-title>,&#x201D; <source>IEEE Transactions on Industrial Electronics</source>, vol. <volume>63</volume>, no. <issue>3</issue>, pp. <fpage>1793</fpage>&#x2013;<lpage>1803</lpage>, <year>2015</year>.</mixed-citation></ref>
<ref id="ref-14"><label>[14]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><given-names>T.</given-names> <surname>Han</surname></string-name>, <string-name><given-names>D.</given-names> <surname>Jiang</surname></string-name>, <string-name><given-names>Q.</given-names> <surname>Zhao</surname></string-name>, <string-name><given-names>L.</given-names> <surname>Wang</surname></string-name> and <string-name><given-names>K.</given-names> <surname>Yin</surname></string-name></person-group>, &#x201C;<article-title>Comparison of random forest, artificial neural networks and support vector machine for intelligent diagnosis of rotating machinery</article-title>,&#x201D; <source>Transactions of the Institute of Measurement and Control</source>, vol. <volume>40</volume>, no. <issue>8</issue>, pp. <fpage>2681</fpage>&#x2013;<lpage>2693</lpage>, <year>2018</year>.</mixed-citation></ref>
<ref id="ref-15"><label>[15]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><given-names>X.</given-names> <surname>Yan</surname></string-name> and <string-name><given-names>M.</given-names> <surname>Jia</surname></string-name></person-group>, &#x201C;<article-title>A novel optimized SVM classification algorithm with multi-domain feature and its application to fault diagnosis of rolling bearing</article-title>,&#x201D; <source>Neurocomputing</source>, vol. <volume>313</volume>, pp. <fpage>47</fpage>&#x2013;<lpage>64</lpage>, <year>2018</year>.</mixed-citation></ref>
<ref id="ref-16"><label>[16]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><given-names>F.</given-names> <surname>Jia</surname></string-name>, <string-name><given-names>Y.</given-names> <surname>Lei</surname></string-name>, <string-name><given-names>J.</given-names> <surname>Lin</surname></string-name>, <string-name><given-names>X.</given-names> <surname>Zhou</surname></string-name> and <string-name><given-names>N.</given-names> <surname>Lu</surname></string-name></person-group>, &#x201C;<article-title>Deep neural networks: A promising tool for fault characteristic mining and intelligent diagnosis of rotating machinery with massive data</article-title>,&#x201D; <source>Mechanical Systems and Signal Processing</source>, vol. <volume>72</volume>, pp. <fpage>303</fpage>&#x2013;<lpage>315</lpage>, <year>2016</year>.</mixed-citation></ref>
<ref id="ref-17"><label>[17]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><given-names>H.</given-names> <surname>Shao</surname></string-name>, <string-name><given-names>H.</given-names> <surname>Jiang</surname></string-name>, <string-name><given-names>H.</given-names> <surname>Zhang</surname></string-name> and <string-name><given-names>T.</given-names> <surname>Liang</surname></string-name></person-group>, &#x201C;<article-title>Electric locomotive bearing fault diagnosis using a novel convolutional deep belief network</article-title>,&#x201D; <source>IEEE Transactions on Industrial Electronics</source>, vol. <volume>65</volume>, no. <issue>3</issue>, pp. <fpage>2727</fpage>&#x2013;<lpage>2736</lpage>, <year>2017</year>.</mixed-citation></ref>
<ref id="ref-18"><label>[18]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><given-names>H.</given-names> <surname>Zhu</surname></string-name>, <string-name><given-names>J.</given-names> <surname>Cheng</surname></string-name>, <string-name><given-names>C.</given-names> <surname>Zhang</surname></string-name>, <string-name><given-names>J.</given-names> <surname>Wu</surname></string-name> and <string-name><given-names>X.</given-names> <surname>Shao</surname></string-name></person-group>, &#x201C;<article-title>Stacked pruning sparse denoising autoencoder based intelligent fault diagnosis of rolling bearings</article-title>,&#x201D; <source>Applied Soft Computing</source>, vol. <volume>88</volume>, no. <issue>99</issue>, pp. <fpage>106060</fpage>, <year>2020</year>.</mixed-citation></ref>
<ref id="ref-19"><label>[19]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><given-names>Y.</given-names> <surname>Yuan</surname></string-name>, <string-name><given-names>G.</given-names> <surname>Ma</surname></string-name>, <string-name><given-names>C.</given-names> <surname>Cheng</surname></string-name>, <string-name><given-names>B.</given-names> <surname>Zhou</surname></string-name>, <string-name><given-names>H.</given-names> <surname>Zhao</surname></string-name> <etal>et al.</etal></person-group><italic>,</italic> &#x201C;<article-title>A general end-to-end diagnosis framework for manufacturing systems</article-title>,&#x201D; <source>National Science Review</source>, vol. <volume>7</volume>, no. <issue>2</issue>, pp. <fpage>418</fpage>&#x2013;<lpage>429</lpage>, <year>2020</year>.</mixed-citation></ref>
<ref id="ref-20"><label>[20]</label><mixed-citation publication-type="other"><person-group person-group-type="author"><string-name><given-names>K.</given-names> <surname>Dev</surname></string-name>, <string-name><given-names>S. A.</given-names> <surname>Khowaja</surname></string-name>, <string-name><given-names>A.</given-names> <surname>Jaiswal</surname></string-name>, <string-name><given-names>A. S.</given-names> <surname>Bist</surname></string-name>, <string-name><given-names>V.</given-names> <surname>Siani</surname></string-name> <etal>et al.</etal></person-group><italic>,</italic> &#x201C;<article-title>Triage of potential COVID-19 patients from chest x-ray images using hierarchical convolutional networks</article-title>,&#x201D; <comment>arXiv preprint arXiv:.00618</comment>, <year>2020</year>. [Online]. Available: <ext-link ext-link-type="uri" xlink:href="https://arxiv.org/abs/2011.00618">https://arxiv.org/abs/2011.00618</ext-link>, <comment>(Accessed 02 November 2020)</comment>.</mixed-citation></ref>
<ref id="ref-21"><label>[21]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><given-names>D. T.</given-names> <surname>Hoang</surname></string-name> and <string-name><given-names>H. J.</given-names> <surname>Kang</surname></string-name></person-group>, &#x201C;<article-title>A survey on deep learning based bearing fault diagnosis</article-title>,&#x201D; <source>Neurocomputing</source>, vol. <volume>335</volume>, no. <issue>7</issue>, pp. <fpage>327</fpage>&#x2013;<lpage>335</lpage>, <year>2019</year>.</mixed-citation></ref>
<ref id="ref-22"><label>[22]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><given-names>S.</given-names> <surname>khan</surname></string-name> and <string-name><given-names>T.</given-names> <surname>Yairi</surname></string-name></person-group>, &#x201C;<article-title>A review on the application of deep learning in system health management</article-title>,&#x201D; <source>Mechanical Systems and Signal Processing</source>, vol. <volume>107</volume>, no. <issue>2</issue>, pp. <fpage>241</fpage>&#x2013;<lpage>265</lpage>, <year>2018</year>.</mixed-citation></ref>
<ref id="ref-23"><label>[23]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><given-names>T.</given-names> <surname>Chen</surname></string-name>, <string-name><given-names>Z.</given-names> <surname>Wang</surname></string-name>, <string-name><given-names>X.</given-names> <surname>Yang</surname></string-name> and <string-name><given-names>K.</given-names> <surname>Jiang</surname></string-name></person-group>, &#x201C;<article-title>A deep capsule neural network with stochastic delta rule for bearing fault diagnosis on raw vibration signals</article-title>,&#x201D; <source>Measurement</source>, vol. <volume>148</volume>, pp. <fpage>106857</fpage>, <year>2019</year>.</mixed-citation></ref>
<ref id="ref-24"><label>[24]</label><mixed-citation publication-type="conf-proc"><person-group person-group-type="author"><string-name><given-names>M.</given-names> <surname>Ma</surname></string-name>, <string-name><given-names>X.</given-names> <surname>Chen</surname></string-name>, <string-name><given-names>S.</given-names> <surname>Wang</surname></string-name>, <string-name><given-names>Y.</given-names> <surname>Liu</surname></string-name> and <string-name><given-names>W.</given-names> <surname>Li</surname></string-name></person-group>, &#x201C;<article-title>Bearing degradation assessment based on Weibull distribution and deep belief network</article-title>,&#x201D; in <conf-name>Proc. Int. Symp. on Flexible Automation</conf-name>, <conf-loc>Cleveland, OH, USA</conf-loc>, pp. <fpage>382</fpage>&#x2013;<lpage>385</lpage>, <year>2016</year>. </mixed-citation></ref>
<ref id="ref-25"><label>[25]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><given-names>X.</given-names> <surname>Guo</surname></string-name>, <string-name><given-names>L.</given-names> <surname>Chen</surname></string-name> and <string-name><given-names>C.</given-names> <surname>Shen</surname></string-name></person-group>, &#x201C;<article-title>Hierarchical adaptive deep convolution neural network and its application to bearing fault diagnosis</article-title>,&#x201D; <source>Measurement</source>, vol. <volume>93</volume>, no. <issue>4</issue>, pp. <fpage>490</fpage>&#x2013;<lpage>502</lpage>, <year>2016</year>.</mixed-citation></ref>
<ref id="ref-26"><label>[26]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><given-names>L.</given-names> <surname>Yu</surname></string-name>, <string-name><given-names>J.</given-names> <surname>Qu</surname></string-name>, <string-name><given-names>F.</given-names> <surname>Gao</surname></string-name> and <string-name><given-names>Y.</given-names> <surname>Tian</surname></string-name></person-group>, &#x201C;<article-title>A novel hierarchical algorithm for bearing fault diagnosis based on stacked LSTM</article-title>,&#x201D; <source>Shock and Vibration</source>, vol. <volume>2019</volume>, pp. <fpage>1</fpage>&#x2013;<lpage>10</lpage>, <year>2019</year>.</mixed-citation></ref>
<ref id="ref-27"><label>[27]</label><mixed-citation publication-type="conf-proc"><person-group person-group-type="author"><string-name><given-names>S.</given-names> <surname>Deng</surname></string-name>, <string-name><given-names>Z.</given-names> <surname>Cheng</surname></string-name>, <string-name><given-names>C.</given-names> <surname>Li</surname></string-name>, <string-name><given-names>X.</given-names> <surname>Yao</surname></string-name>, <string-name><given-names>Z.</given-names> <surname>Chen</surname></string-name> <etal>et al.</etal></person-group><italic>,</italic> &#x201C;<article-title>Rolling bearing fault diagnosis based on deep boltzmann machines</article-title>,&#x201D; in <conf-name>Proc. Prognostics and System Health Management Conf.</conf-name>, <publisher-loc>Chengdu</publisher-loc>, pp. <fpage>1</fpage>&#x2013;<lpage>6</lpage>, <year>2016</year>. </mixed-citation></ref>
<ref id="ref-28"><label>[28]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><given-names>H.</given-names> <surname>Shao</surname></string-name>, <string-name><given-names>H.</given-names> <surname>Jiang</surname></string-name>, <string-name><given-names>X.</given-names> <surname>Zhang</surname></string-name> and <string-name><given-names>M.</given-names> <surname>Niu</surname></string-name></person-group>, &#x201C;<article-title>Rolling bearing fault diagnosis using an optimization deep belief network</article-title>,&#x201D; <source>Measurement Science and Technology</source>, vol. <volume>26</volume>, no. <issue>11</issue>, pp. <fpage>115002</fpage>, <year>2015</year>.</mixed-citation></ref>
<ref id="ref-29"><label>[29]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><given-names>Z.</given-names> <surname>Chen</surname></string-name>, <string-name><given-names>A.</given-names> <surname>Mauricio</surname></string-name>, <string-name><given-names>W.</given-names> <surname>Li</surname></string-name> and <string-name><given-names>K.</given-names> <surname>Gryllias</surname></string-name></person-group>, &#x201C;<article-title>A deep learning method for bearing fault diagnosis based on cyclic spectral coherence and convolutional neural networks</article-title>,&#x201D; <source>Mechanical Systems and Signal Processing</source>, vol. <volume>140</volume>, pp. <fpage>106683</fpage>, <year>2020</year>.</mixed-citation></ref>
<ref id="ref-30"><label>[30]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><given-names>Q.</given-names> <surname>Zhou</surname></string-name>, <string-name><given-names>Y.</given-names> <surname>Li</surname></string-name>, <string-name><given-names>Y.</given-names> <surname>Tian</surname></string-name> and <string-name><given-names>L.</given-names> <surname>Jiang</surname></string-name></person-group>, &#x201C;<article-title>A novel method based on nonlinear auto-regression neural network and convolutional neural network for imbalanced fault diagnosis of rotating machinery</article-title>,&#x201D; <source>Measurement</source>, vol. <volume>161</volume>, pp. <fpage>107880</fpage>, <year>2020</year>.</mixed-citation></ref>
<ref id="ref-31"><label>[31]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><given-names>X.</given-names> <surname>Guo</surname></string-name>, <string-name><given-names>L.</given-names> <surname>Chen</surname></string-name> and <string-name><given-names>C.</given-names> <surname>Shen</surname></string-name></person-group>, &#x201C;<article-title>Hierarchical adaptive deep convolution neural network and its application to bearing fault diagnosis</article-title>,&#x201D; <source>Measurement</source>, vol. <volume>93</volume>, no. <issue>4</issue>, pp. <fpage>490</fpage>&#x2013;<lpage>502</lpage>, <year>2016</year>.</mixed-citation></ref>
<ref id="ref-32"><label>[32]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><given-names>M.</given-names> <surname>Xia</surname></string-name>, <string-name><given-names>T.</given-names> <surname>Li</surname></string-name>, <string-name><given-names>L.</given-names> <surname>Xu</surname></string-name>, <string-name><given-names>L.</given-names> <surname>Liu</surname></string-name> and <string-name><given-names>De</given-names> <surname>Silva</surname></string-name></person-group>, &#x201C;<article-title>Fault diagnosis for rotating machinery using multiple sensors and convolutional neural networks</article-title>,&#x201D; <source>IEEE/ASME Transactions on Mechatronics</source>, vol. <volume>23</volume>, no. <issue>1</issue>, pp. <fpage>101</fpage>&#x2013;<lpage>110</lpage>, <year>2017</year>.</mixed-citation></ref>
<ref id="ref-33"><label>[33]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><given-names>X.</given-names> <surname>Li</surname></string-name>, <string-name><given-names>W.</given-names> <surname>Zhang</surname></string-name>, <string-name><given-names>Q.</given-names> <surname>Ding</surname></string-name> and <string-name><given-names>J. Q.</given-names> <surname>Sun</surname></string-name></person-group>, &#x201C;<article-title>Intelligent rotating machinery fault diagnosis based on deep learning using data augmentation</article-title>,&#x201D; <source>Journal of Intelligent Manufacturing</source>, vol. <volume>31</volume>, no. <issue>2</issue>, pp. <fpage>433</fpage>&#x2013;<lpage>452</lpage>, <year>2020</year>.</mixed-citation></ref>
<ref id="ref-34"><label>[34]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><given-names>Y.</given-names> <surname>Zhang</surname></string-name>, <string-name><given-names>Z.</given-names> <surname>Zhang</surname></string-name>, <string-name><given-names>Y.</given-names> <surname>Zhang</surname></string-name>, <string-name><given-names>J.</given-names> <surname>Bao</surname></string-name>, <string-name><given-names>Y.</given-names> <surname>Zhang</surname></string-name> <etal>et al.</etal></person-group><italic>,</italic> &#x201C;<article-title>Human activity recognition based on motion sensor using U-Net</article-title>,&#x201D; <source>IEEE Access</source>, vol. <volume>7</volume>, pp. <fpage>75213</fpage>&#x2013;<lpage>75226</lpage>, <year>2019</year>.</mixed-citation></ref>
<ref id="ref-35"><label>[35]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><given-names>S.</given-names> <surname>Wei</surname></string-name>, <string-name><given-names>H.</given-names> <surname>Zhang</surname></string-name>, <string-name><given-names>C.</given-names> <surname>Wang</surname></string-name>, <string-name><given-names>Y.</given-names> <surname>Wang</surname></string-name> and <string-name><given-names>L.</given-names> <surname>Xu</surname></string-name></person-group>, &#x201C;<article-title>Multi-temporal SAR data large-scale crop mapping based on U-Net model</article-title>,&#x201D; <source>Remote Sensing</source>, vol. <volume>11</volume>, no. <issue>1</issue>, pp. <fpage>1</fpage>&#x2013;<lpage>18</lpage>, <year>2019</year>.</mixed-citation></ref>
<ref id="ref-36"><label>[36]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><given-names>Y.</given-names> <surname>Luo</surname></string-name>, <string-name><given-names>X.</given-names> <surname>Li</surname></string-name>, <string-name><given-names>C.</given-names> <surname>Luo</surname></string-name>, <string-name><given-names>F.</given-names> <surname>Wang</surname></string-name>, <string-name><given-names>X.</given-names> <surname>Wu</surname></string-name> <etal>et al.</etal></person-group><italic>,</italic> &#x201C;<article-title>Tissue segmentation in nasopharyngeal CT images using two stage learning</article-title>,&#x201D; <source>Computers, Materials &#x0026; Continua</source>, vol. <volume>65</volume>, no. <issue>2</issue>, pp. <fpage>1771</fpage>&#x2013;<lpage>1780</lpage>, <year>2020</year>.</mixed-citation></ref>
<ref id="ref-37"><label>[37]</label><mixed-citation publication-type="conf-proc"><person-group person-group-type="author"><string-name><given-names>O.</given-names> <surname>Ronneberger</surname></string-name>, <string-name><given-names>P.</given-names> <surname>Fischer</surname></string-name> and <string-name><given-names>T.</given-names> <surname>Brox</surname></string-name></person-group>, &#x201C;<article-title>U-net: Convolutional networks for biomedical image segmentation</article-title>,&#x201D; in <conf-name>Proc. Int. Conf. on Medical Image Computing and Computer-Assisted Intervention</conf-name>, <conf-loc>Munich, Germany</conf-loc>, pp. <fpage>234</fpage>&#x2013;<lpage>241</lpage>, <year>2015</year>. </mixed-citation></ref>
<ref id="ref-38"><label>[38]</label><mixed-citation publication-type="other"><person-group person-group-type="author"><string-name><given-names>K.</given-names> <surname>Loparo</surname></string-name></person-group>, &#x201C;<article-title>Case western reserve university bearing data centre website</article-title>,&#x201D; <year>2012</year>. <comment>[Online]. Available: <ext-link ext-link-type="uri" xlink:href="https://csegroups.case.edu/bearingdatacenter/pages/welcome-case-western-reserve-university-bearing-data-center-website">https://csegroups.case.edu/bearingdatacenter/pages/welcome-case-western-reserve-university-bearing-data-center-website</ext-link></comment>.</mixed-citation></ref>
<ref id="ref-39"><label>[39]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><given-names>D.</given-names> <surname>Nguyen</surname></string-name>, <string-name><given-names>M.</given-names> <surname>Kang</surname></string-name>, <string-name><given-names>C. H.</given-names> <surname>Kim</surname></string-name> and <string-name><given-names>J. M.</given-names> <surname>Kim</surname></string-name></person-group>, &#x201C;<article-title>Highly reliable state monitoring system for induction motors using dominant features in a two-dimension vibration signal</article-title>,&#x201D; <source>New Review of Hypermedia and Multimedia</source>, vol. <volume>19</volume>, no. <issue>3&#x2013;4</issue>, pp. <fpage>248</fpage>&#x2013;<lpage>258</lpage>, <year>2013</year>.</mixed-citation></ref>
<ref id="ref-40"><label>[40]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><given-names>D. T.</given-names> <surname>Hoang</surname></string-name> and <string-name><given-names>H. J.</given-names> <surname>Kang</surname></string-name></person-group>, &#x201C;<article-title>Rolling element bearing fault diagnosis using convolutional neural network and vibration image</article-title>,&#x201D; <source>Cognitive System Research</source>, vol. <volume>53</volume>, no. <issue>6</issue>, pp. <fpage>42</fpage>&#x2013;<lpage>50</lpage>, <year>2019</year>.</mixed-citation></ref>
<ref id="ref-41"><label>[41]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><given-names>L.</given-names> <surname>Wan</surname></string-name>, <string-name><given-names>Y.</given-names> <surname>Chen</surname></string-name>, <string-name><given-names>H.</given-names> <surname>Li</surname></string-name> and <string-name><given-names>C.</given-names> <surname>Li</surname></string-name></person-group>, &#x201C;<article-title>Rolling-element bearing fault diagnosis using improved LeNet-5 network</article-title>,&#x201D; <source>Sensors</source>, vol. <volume>20</volume>, no. <issue>6</issue>, pp. <fpage>1</fpage>&#x2013;<lpage>23</lpage>, <year>2020</year>.</mixed-citation></ref>
<ref id="ref-42"><label>[42]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><given-names>J.</given-names> <surname>Duan</surname></string-name>, <string-name><given-names>T.</given-names> <surname>Shi</surname></string-name>, <string-name><given-names>H.</given-names> <surname>Zhou</surname></string-name>, <string-name><given-names>J.</given-names> <surname>Xuan</surname></string-name> and <string-name><given-names>S. J.</given-names> <surname>Wang</surname></string-name></person-group>, &#x201C;<article-title>A novel ResNet-based model structure and its applications in machine health monitoring</article-title>,&#x201D; <source>Journal of Vibration and Control</source>, vol. <volume>2020</volume>, pp. <fpage>1</fpage>&#x2013;<lpage>15</lpage>, <year>2020</year>.</mixed-citation></ref>
</ref-list>
</back>
</article>
