<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Publishing DTD v1.1 20151215//EN" "http://jats.nlm.nih.gov/publishing/1.1/JATS-journalpublishing1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink" xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" article-type="research-article" dtd-version="1.1">
<front>
<journal-meta>
<journal-id journal-id-type="pmc">CMC</journal-id>
<journal-id journal-id-type="nlm-ta">CMC</journal-id>
<journal-id journal-id-type="publisher-id">CMC</journal-id>
<journal-title-group>
<journal-title>Computers, Materials &#x0026; Continua</journal-title>
</journal-title-group>
<issn pub-type="epub">1546-2226</issn>
<issn pub-type="ppub">1546-2218</issn>
<publisher>
<publisher-name>Tech Science Press</publisher-name>
<publisher-loc>USA</publisher-loc>
</publisher>
</journal-meta>
<article-meta>
<article-id pub-id-type="publisher-id">18040</article-id>
<article-id pub-id-type="doi">10.32604/cmc.2021.018040</article-id>
<article-categories>
<subj-group subj-group-type="heading">
<subject>Article</subject>
</subj-group>
</article-categories>
<title-group>
<article-title>Pseudo Zernike Moment and Deep Stacked Sparse Autoencoder for COVID-19 Diagnosis</article-title>
<alt-title alt-title-type="left-running-head">Pseudo Zernike Moment and Deep Stacked Sparse Autoencoder for COVID-19 Diagnosis</alt-title>
<alt-title alt-title-type="right-running-head">Pseudo Zernike Moment and Deep Stacked Sparse Autoencoder for COVID-19 Diagnosis</alt-title>
</title-group>
<contrib-group content-type="authors">
<contrib id="author-1" contrib-type="author">
<name name-style="western">
<surname>Zhang</surname>
<given-names>Yu-Dong</given-names>
</name>
<xref ref-type="aff" rid="aff-1">1</xref>
</contrib>
<contrib id="author-2" contrib-type="author">
<name name-style="western">
<surname>Khan</surname>
<given-names>Muhammad Attique</given-names>
</name>
<xref ref-type="aff" rid="aff-2">2</xref>
</contrib>
<contrib id="author-3" contrib-type="author">
<name name-style="western">
<surname>Zhu</surname>
<given-names>Ziquan</given-names>
</name>
<xref ref-type="aff" rid="aff-3">3</xref>
</contrib>
<contrib id="author-4" contrib-type="author" corresp="yes">
<name name-style="western">
<surname>Wang</surname>
<given-names>Shui-Hua</given-names>
</name>
<xref ref-type="aff" rid="aff-4">4</xref>
<email>shuihuawang@ieee.org</email>
</contrib>
<aff id="aff-1"><label>1</label><institution>School of Informatics, University of Leicester</institution>, <addr-line>Leicester, LE1 7RH</addr-line>, <country>UK</country></aff>
<aff id="aff-2"><label>2</label><institution>Department of Computer Science, HITEC University Taxila</institution>, <addr-line>Taxila</addr-line>, <country>Pakistan</country></aff>
<aff id="aff-3"><label>3</label><institution>Science in Civil Engineering, University of Florida</institution>, <addr-line>Gainesville, Florida, FL 32608, Gainesville</addr-line>, <country>USA</country></aff>
<aff id="aff-4"><label>4</label><institution>School of Mathematics and Actuarial Science, University of Leicester</institution>, <addr-line>LE1 7RH</addr-line>, <country>UK</country></aff>
</contrib-group>
<author-notes>
<corresp id="cor1">&#x002A;Corresponding Author: Shui-Hua Wang. Email: <email>shuihuawang@ieee.org</email></corresp>
</author-notes>
<pub-date pub-type="epub" date-type="pub" iso-8601-date="2021-08-23">
<day>23</day>
<month>08</month>
<year>2021</year>
</pub-date>
<volume>69</volume>
<issue>3</issue>
<fpage>3145</fpage>
<lpage>3162</lpage>
<history>
<date date-type="received">
<day>22</day>
<month>2</month>
<year>2021</year>
</date>
<date date-type="accepted">
<day>07</day>
<month>4</month>
<year>2021</year>
</date>
</history>
<permissions>
<copyright-statement>&#x00A9; 2021 Zhang et al.</copyright-statement>
<copyright-year>2021</copyright-year>
<copyright-holder>Zhang et al.</copyright-holder>
<license xlink:href="https://creativecommons.org/licenses/by/4.0/">
<license-p>This work is licensed under a <ext-link ext-link-type="uri" xlink:type="simple" xlink:href="https://creativecommons.org/licenses/by/4.0/">Creative Commons Attribution 4.0 International License</ext-link>, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</license-p>
</license>
</permissions>
<self-uri content-type="pdf" xlink:href="TSP_CMC_18040.pdf"></self-uri>
<abstract>
<p>(<italic>Aim</italic>) COVID-19 is an ongoing infectious disease. It has caused more than 107.45 m confirmed cases and 2.35 m deaths till 11/Feb/2021. Traditional computer vision methods have achieved promising results on the automatic smart diagnosis. (<italic>Method</italic>) This study aims to propose a novel deep learning method that can obtain better performance. We use the pseudo-Zernike moment (PZM), derived from Zernike moment, as the extracted features. Two settings are introducing: (i) image plane over unit circle; and (ii) image plane inside the unit circle. Afterward, we use a deep-stacked sparse autoencoder (DSSAE) as the classifier. Besides, multiple-way data augmentation is chosen to overcome overfitting. The multiple-way data augmentation is based on Gaussian noise, salt-and-pepper noise, speckle noise, horizontal and vertical shear, rotation, Gamma correction, random translation and scaling. (<italic>Results</italic>) 10 runs of 10-fold cross validation shows that our PZM-DSSAE method achieves a sensitivity of 92.06% &#x00B1; 1.54%, a specificity of 92.56% &#x00B1; 1.06%, a precision of 92.53% &#x00B1; 1.03%, and an accuracy of 92.31% &#x00B1; 1.08%. Its F1 score, MCC, and FMI arrive at 92.29% &#x00B1;1.10%, 84.64% &#x00B1; 2.15%, and 92.29% &#x00B1; 1.10%, respectively. The AUC of our model is 0.9576. (<italic>Conclusion</italic>) We demonstrate &#x201C;image plane over unit circle&#x201D; can get better results than &#x201C;image plane inside a unit circle.&#x201D; Besides, this proposed PZM-DSSAE model is better than eight state-of-the-art approaches.</p>
</abstract>
<kwd-group kwd-group-type="author">
<kwd>Pseudo Zernike moment</kwd>
<kwd>stacked sparse autoencoder</kwd>
<kwd>deep learning</kwd>
<kwd>COVID-19</kwd>
<kwd>multiple-way data augmentation</kwd>
<kwd>medical image analysis</kwd>
</kwd-group>
</article-meta>
</front>
<body>
<sec id="s1">
<label>1</label>
<title>Introduction</title>
<p>COVID-19 has caused more than 107.45 m confirmed cases and 2.35 m deaths till 11/Feb/2021 in about 192 countries/regions and 26 cruise/naval ships [<xref ref-type="bibr" rid="ref-1">1</xref>]. <?A3B2 "fig1",5,"anchor"?><xref ref-type="fig" rid="fig-1">Fig. 1</xref> shows the top 10 countries of cumulative confirmed cases and deaths, respectively. The main symptoms of COVID-19 are low fever, a new and ongoing cough, a loss or change to taste and smell [<xref ref-type="bibr" rid="ref-2">2</xref>]. In the UK, three vaccines are formally approved as Pfizer/BioNTech, Oxford/AstraZeneca, and Moderna. Two COVID-19 diagnosis methods are available. The former is viral testing to test the existence of viral RNA fragments [<xref ref-type="bibr" rid="ref-3">3</xref>]. The swab test shortcomings are two folds: (i) the swab samples may be contaminated, and (ii) it needs to wait from several hours to several days to get the test results. The latter is chest imaging. There are two main chest imaging available: chest computed tomography (CCT) [<xref ref-type="bibr" rid="ref-4">4</xref>] and chest X-ray (CXR) [<xref ref-type="bibr" rid="ref-5">5</xref>].</p>
<fig id="fig-1">
<label>Figure 1</label>
<caption>
<title>Data till 11/Feb/2021 (a) Cumulative confirmed cases (b) Cumulative deaths</title>
</caption>
<graphic mimetype="image" mime-subtype="png" xlink:href="CMC_18040-fig-1.png"/>
</fig>
<p>CCT is one of the best chest imaging [<xref ref-type="bibr" rid="ref-6">6</xref>] techniques since it provides the finest resolution and can recognize extremely small nodules in the chest region. CCT employs computer-processed combinations of multiple X-ray observations taken from different angles [<xref ref-type="bibr" rid="ref-7">7</xref>] to produce high-quality 3D tomographic images (virtual slices). In contrast, CXR only provides one 2D image, which performs poorly on soft tissue contrast. This study focuses on the CCT images [<xref ref-type="bibr" rid="ref-8">8</xref>].</p>
<p>Currently, numerous studies are working on using machine learning (ML) and deep learning (DL) technologies [<xref ref-type="bibr" rid="ref-9">9</xref>,<xref ref-type="bibr" rid="ref-10">10</xref>]. For example, Guo et al. [<xref ref-type="bibr" rid="ref-11">11</xref>] employed ResNet-18 for classifying thyroid images. Lu [<xref ref-type="bibr" rid="ref-12">12</xref>] utilized an extreme learning machine (ELM) trained by bat algorithm (BA). Those two approaches were not developing for COVID-19, but they can be transferred to the COVID-19 dataset easily and used as comparison basis approaches in our experiments. For COVID-19 researches, Yao [<xref ref-type="bibr" rid="ref-13">13</xref>] proposed a wavelet entropy biogeography-based optimization (WEBBO) method for COVID-19 diagnosis. Wu [<xref ref-type="bibr" rid="ref-14">14</xref>] presented three-segment biogeography-based optimization (3SBBO) for recognizing COVID-19 patients. Wang et al. [<xref ref-type="bibr" rid="ref-15">15</xref>] presented a DeCovNet. Their accuracy achieved 90.1%. El-kenawy et al. [<xref ref-type="bibr" rid="ref-16">16</xref>] presented a novel feature selection voting classifier (FSVC) method for COVID-19 classification. Yu et al. [<xref ref-type="bibr" rid="ref-17">17</xref>] presented a GoogleNet-COD method to detect COVID-19. Chen [<xref ref-type="bibr" rid="ref-18">18</xref>] designed a gray-level co-occurrence matrix and support vector machine (GLCMSVM) method to classify COVID-19 images [<xref ref-type="bibr" rid="ref-19">19</xref>].</p>
<p>To further improve the performance of automatic COVID-19 diagnosis, this paper proposes a novel method that combines the traditional ML approach with the recent DL approach. We use the pseudo-Zernike moment (PZM) as the extracted features, and we use a deep-stacked sparse autoencoder (<italic>i.e</italic>., one of the deep neural networks) as the classifier. The combination achieves excellent results that overperform eight state-of-the-art approaches. The novelties of our paper lie in the following aspects
<list list-type="bullet">
<list-item>
<p>We are the first to apply a pseudo-Zernike moment to COVID-19 image analysis.</p></list-item>
<list-item>
<p>Deep stacked sparse autoencoder (DSSAE) works better than traditional classifiers.</p></list-item>
<list-item>
<p>Our proposed &#x201C;PZM-DSSAE&#x201D; model is better than eight state-of-the-art approaches.</p></list-item>
</list></p>
</sec>
<sec id="s2">
<label>2</label>
<title>Dataset</title>
<p>We use the dataset in reference [<xref ref-type="bibr" rid="ref-20">20</xref>], which contains 148 COVID-19 patients and 148 healthy control (HC) subjects. Slice level selection [<xref ref-type="bibr" rid="ref-20">20</xref>] was employed to generate <inline-formula id="ieqn-1"><mml:math id="mml-ieqn-1"><mml:msub><mml:mi>C</mml:mi><mml:mrow><mml:mn>1</mml:mn></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:mn>320</mml:mn></mml:math></inline-formula> COVID-19 images and <inline-formula id="ieqn-2"><mml:math id="mml-ieqn-2"><mml:msub><mml:mi>C</mml:mi><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:mn>320</mml:mn></mml:math></inline-formula> HC images. The raw images are with sizes of <inline-formula id="ieqn-3"><mml:math id="mml-ieqn-3"><mml:mn>1024</mml:mn><mml:mo>&#x00D7;</mml:mo><mml:mn>1024</mml:mn><mml:mo>&#x00D7;</mml:mo><mml:mn>3</mml:mn></mml:math></inline-formula>. A four-step preprocessing was used on this dataset. First, the images are converted to grayscale to save storage amount. Second, histogram stretch is used to enhance the contrast. Third, border pixels are removed, which contains the text and ruler in the right side, and the check-up bed in the bottom. Finally, downsampling to width <inline-formula id="ieqn-4"><mml:math id="mml-ieqn-4"><mml:mi>W</mml:mi></mml:math></inline-formula> and height <inline-formula id="ieqn-5"><mml:math id="mml-ieqn-5"><mml:mi>H</mml:mi></mml:math></inline-formula> is carried out to further reduce the storage of the dataset. <?A3B2 "fig2",5,"anchor"?><xref ref-type="fig" rid="fig-2">Fig. 2</xref> display one example of COVID-19 patient and one sample of HC subject. Algorithm 1 itemizes the pseudocode of preprocessing.</p>
<table-wrap id="table-6">
<caption>
<title>Algorithm 1: Pseudocode of preprocessing</title>
</caption>
<table>
<colgroup>
<col/>
</colgroup>
<tbody>
<tr>
<td>Input Import <inline-formula id="ieqn-6"><mml:math id="mml-ieqn-6"><mml:msub><mml:mi>C</mml:mi><mml:mrow><mml:mn>1</mml:mn></mml:mrow></mml:msub></mml:math></inline-formula> raw COVID-19 and <inline-formula id="ieqn-7"><mml:math id="mml-ieqn-7"><mml:msub><mml:mi>C</mml:mi><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msub></mml:math></inline-formula> raw HC images.</td>
</tr>
<tr>
<td>Step 1 Grayscale. All the images were converted to grayscale.</td>
</tr>
<tr>
<td>Step 2 Histogram Stretch: The minimum and maximum grayscale values are mapped to <inline-formula id="ieqn-8"><mml:math id="mml-ieqn-8"><mml:mrow><mml:mo>(</mml:mo><mml:msub><mml:mi>g</mml:mi><mml:mrow><mml:mi>m</mml:mi><mml:mi>i</mml:mi><mml:mi>n</mml:mi></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:mspace width="thinmathspace" /><mml:msub><mml:mi>g</mml:mi><mml:mrow><mml:mi>m</mml:mi><mml:mi>a</mml:mi><mml:mi>x</mml:mi></mml:mrow></mml:msub><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula>.</td>
</tr>
<tr>
<td>Step 3 Crop: <inline-formula id="ieqn-9"><mml:math id="mml-ieqn-9"><mml:mrow><mml:mo>(</mml:mo><mml:msubsup><mml:mi>m</mml:mi><mml:mrow><mml:mn>1</mml:mn></mml:mrow><mml:mrow><mml:mi>c</mml:mi></mml:mrow></mml:msubsup><mml:mo>,</mml:mo><mml:mspace width="thinmathspace" /><mml:msubsup><mml:mi>m</mml:mi><mml:mrow><mml:mn>2</mml:mn></mml:mrow><mml:mrow><mml:mi>c</mml:mi></mml:mrow></mml:msubsup><mml:mo>,</mml:mo><mml:mspace width="thinmathspace" /><mml:msubsup><mml:mi>m</mml:mi><mml:mrow><mml:mn>3</mml:mn></mml:mrow><mml:mrow><mml:mi>c</mml:mi></mml:mrow></mml:msubsup><mml:mo>,</mml:mo><mml:mspace width="thinmathspace" /><mml:msubsup><mml:mi>m</mml:mi><mml:mrow><mml:mn>4</mml:mn></mml:mrow><mml:mrow><mml:mi>c</mml:mi></mml:mrow></mml:msubsup><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula> Pixels are removed from the top, bottom, left, and right sides.</td>
</tr>
<tr>
<td>Step 4 Downsampling: All images are downscaled to the size of <inline-formula id="ieqn-10"><mml:math id="mml-ieqn-10"><mml:mrow><mml:mo>(</mml:mo><mml:mi>W</mml:mi><mml:mo>,</mml:mo><mml:mspace width="thinmathspace" /><mml:mi>H</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula>.</td>
</tr>
<tr>
<td>Output <inline-formula id="ieqn-11"><mml:math id="mml-ieqn-11"><mml:msub><mml:mi>C</mml:mi><mml:mrow><mml:mn>1</mml:mn></mml:mrow></mml:msub></mml:math></inline-formula> preprocessed COVID-19 image and <inline-formula id="ieqn-12"><mml:math id="mml-ieqn-12"><mml:msub><mml:mi>C</mml:mi><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msub></mml:math></inline-formula> preprocessed HC images.</td>
</tr>
</tbody>
</table>
</table-wrap>
<fig id="fig-2">
<label>Figure 2</label>
<caption>
<title>Example of preprocessed images (a) COVID-19 (b) HC</title>
</caption>
<graphic mimetype="image" mime-subtype="png" xlink:href="CMC_18040-fig-2.png"/>
</fig>
</sec>
<sec id="s3">
<label>3</label>
<title>Methodology</title>
<sec id="s3_1">
<label>3.1</label>
<title>Pseudo Zernike Moment</title>
<p><xref ref-type="table" rid="table-1">Tab. 1</xref> displays the abbreviation list Image moment was firstly introduced by Hu [<xref ref-type="bibr" rid="ref-21">21</xref>], who used geometric moments to generate a set of invariants. Hu&#x2019;s moments have been widely used in knee osteoarthritis classification [<xref ref-type="bibr" rid="ref-22">22</xref>], brain tumor classification [<xref ref-type="bibr" rid="ref-23">23</xref>], etc. However, geometric moments are sensitive to noise. Thus, Teague [<xref ref-type="bibr" rid="ref-24">24</xref>] introduced Zernike moments (ZMs) based on orthogonal Zernike polynomials. The orthogonal moments have been proven to be more robust in noisy conditions, and they can achieve a near-zero value of redundancy measure [<xref ref-type="bibr" rid="ref-25">25</xref>].</p>
<table-wrap id="table-1">
<label>Table 1</label>
<caption>
<title>Abbreviation list</title>
</caption>
<table>
<colgroup>
<col/>
<col/>
<col/>
<col/>
</colgroup>
<thead>
<tr>
<th>Abbreviation</th>
<th>Meaning</th>
<th>Abbreviation</th>
<th>Meaning</th>
</tr>
</thead>
<tbody>
<tr>
<td>AE</td>
<td>Autoencoder</td>
<td>HC</td>
<td>Healthy control</td>
</tr>
<tr>
<td>AUC</td>
<td>Area under the curve</td>
<td>IP</td>
<td>Image plane</td>
</tr>
<tr>
<td>CCT</td>
<td>Chest computed tomography</td>
<td>MCC</td>
<td>Matthews correlation coefficient</td>
</tr>
<tr>
<td>CM</td>
<td>Confusion matrix</td>
<td>ML</td>
<td>Machine learning</td>
</tr>
<tr>
<td>CXR</td>
<td>Chest X-ray</td>
<td>PZM</td>
<td>Pseudo Zernike moment</td>
</tr>
<tr>
<td>DL</td>
<td>Deep learning</td>
<td>ROC</td>
<td>Receiver operating characteristic</td>
</tr>
<tr>
<td>DSSAE</td>
<td>Deep stacked sparse autoencoder</td>
<td>UC</td>
<td>Unit circle</td>
</tr>
<tr>
<td>FMI</td>
<td>Fowlkes&#x2013;Mallows index</td>
<td>ZM</td>
<td>Zernike moment</td>
</tr>
</tbody>
</table>
</table-wrap>
<p>Later, pseudo Zernike moment (PZM) is derived from Zernike moment. PZMs have been proven to give better performances than other moment functions such as Hu moments, Zernike moments, etc. For example, for an order <inline-formula id="ieqn-13"><mml:math id="mml-ieqn-13"><mml:mi>p</mml:mi></mml:math></inline-formula>, there are <inline-formula id="ieqn-14"><mml:math id="mml-ieqn-14"><mml:msup><mml:mrow><mml:mo>(</mml:mo><mml:mi>p</mml:mi><mml:mo>+</mml:mo><mml:mn>1</mml:mn><mml:mo>)</mml:mo></mml:mrow><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msup></mml:math></inline-formula> linearly independent pseudo-Zernike polynomials of orders <inline-formula id="ieqn-15"><mml:math id="mml-ieqn-15"><mml:mo>&#x2264;</mml:mo><mml:mi>p</mml:mi></mml:math></inline-formula>, while there are only <inline-formula id="ieqn-16"><mml:math id="mml-ieqn-16"><mml:mstyle displaystyle="true" scriptlevel="0"><mml:mfrac><mml:mn>1</mml:mn><mml:mn>2</mml:mn></mml:mfrac><mml:mrow><mml:mo>(</mml:mo><mml:mi>p</mml:mi><mml:mo>+</mml:mo><mml:mn>1</mml:mn><mml:mo>)</mml:mo></mml:mrow><mml:mrow><mml:mo>(</mml:mo><mml:mi>p</mml:mi><mml:mo>+</mml:mo><mml:mn>2</mml:mn><mml:mo>)</mml:mo></mml:mrow></mml:mstyle></mml:math></inline-formula> Zernike polynomials. Hence, PZM is more expressive and offers more feature vectors than ZM.</p>
<p>The kernel of PZMs is a set of orthogonal pseudo-Zernike polynomials defined over the polar coordinate inside a unit circle (UC). The 2D PZM of order <inline-formula id="ieqn-17"><mml:math id="mml-ieqn-17"><mml:mi>p</mml:mi></mml:math></inline-formula> with repetition <inline-formula id="ieqn-18"><mml:math id="mml-ieqn-18"><mml:mi>q</mml:mi></mml:math></inline-formula> of an image <inline-formula id="ieqn-19"><mml:math id="mml-ieqn-19"><mml:mi>g</mml:mi><mml:mrow><mml:mo>(</mml:mo><mml:mi>r</mml:mi><mml:mo>,</mml:mo><mml:mspace width="thinmathspace" /><mml:mi>&#x03B8;</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula> is defined as [<xref ref-type="bibr" rid="ref-26">26</xref>]</p>
<p><disp-formula id="eqn-1">
<label>(1)</label>
<mml:math id="mml-eqn-1" display="block"><mml:msub><mml:mi>Z</mml:mi><mml:mrow><mml:mi>p</mml:mi><mml:mi>q</mml:mi></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:mfrac><mml:mrow><mml:mi>p</mml:mi><mml:mo>+</mml:mo><mml:mn>1</mml:mn></mml:mrow><mml:mi>&#x03C0;</mml:mi></mml:mfrac><mml:msubsup><mml:mo>&#x222B;</mml:mo><mml:mrow><mml:mo>&#x2212;</mml:mo><mml:mi>&#x03C0;</mml:mi></mml:mrow><mml:mrow><mml:mi>&#x03C0;</mml:mi></mml:mrow></mml:msubsup><mml:msubsup><mml:mo>&#x222B;</mml:mo><mml:mrow><mml:mn>0</mml:mn></mml:mrow><mml:mrow><mml:mn>1</mml:mn></mml:mrow></mml:msubsup><mml:msubsup><mml:mi>W</mml:mi><mml:mrow><mml:mi>p</mml:mi><mml:mi>q</mml:mi></mml:mrow><mml:mrow><mml:mo>&#x2217;</mml:mo></mml:mrow></mml:msubsup><mml:mrow><mml:mo>(</mml:mo><mml:mi>r</mml:mi><mml:mo>,</mml:mo><mml:mspace width="thinmathspace" /><mml:mi>&#x03B8;</mml:mi><mml:mo>)</mml:mo></mml:mrow><mml:mi>g</mml:mi><mml:mrow><mml:mo>(</mml:mo><mml:mi>r</mml:mi><mml:mo>,</mml:mo><mml:mspace width="thinmathspace" /><mml:mi>&#x03B8;</mml:mi><mml:mo>)</mml:mo></mml:mrow><mml:mrow><mml:mi mathvariant="italic">r</mml:mi></mml:mrow><mml:mspace width="thinmathspace" /><mml:mrow><mml:mi mathvariant="normal">d</mml:mi></mml:mrow><mml:mi>r</mml:mi><mml:mspace width="thinmathspace" /><mml:mrow><mml:mi mathvariant="normal">d</mml:mi></mml:mrow><mml:mi>&#x03B8;</mml:mi><mml:mo>,</mml:mo></mml:math>
</disp-formula></p>
<p>where the pseudo-Zernike polynomials <inline-formula id="ieqn-20"><mml:math id="mml-ieqn-20"><mml:msub><mml:mi>W</mml:mi><mml:mrow><mml:mi>p</mml:mi><mml:mi>q</mml:mi></mml:mrow></mml:msub><mml:mrow><mml:mo>(</mml:mo><mml:mi>r</mml:mi><mml:mo>,</mml:mo><mml:mspace width="thinmathspace" /><mml:mi>&#x03B8;</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula> of order <inline-formula id="ieqn-21"><mml:math id="mml-ieqn-21"><mml:mi>p</mml:mi></mml:math></inline-formula> are defined as</p>
<p><disp-formula id="eqn-2">
<label>(2)</label>
<mml:math id="mml-eqn-2" display="block"><mml:msub><mml:mi>W</mml:mi><mml:mrow><mml:mi>p</mml:mi><mml:mi>q</mml:mi></mml:mrow></mml:msub><mml:mrow><mml:mo>(</mml:mo><mml:mi>r</mml:mi><mml:mo>,</mml:mo><mml:mspace width="thinmathspace" /><mml:mi>&#x03B8;</mml:mi><mml:mo>)</mml:mo></mml:mrow><mml:mo>=</mml:mo><mml:msub><mml:mi>R</mml:mi><mml:mrow><mml:mi>p</mml:mi><mml:mi>q</mml:mi></mml:mrow></mml:msub><mml:mrow><mml:mo>(</mml:mo><mml:mi>r</mml:mi><mml:mo>)</mml:mo></mml:mrow><mml:msup><mml:mi>e</mml:mi><mml:mrow><mml:mi>j</mml:mi><mml:mi>q</mml:mi><mml:mi>&#x03B8;</mml:mi></mml:mrow></mml:msup><mml:mo>,</mml:mo><mml:mspace width="1em" /><mml:mi>j</mml:mi><mml:mo>=</mml:mo><mml:msqrt><mml:mo>&#x2212;</mml:mo><mml:mn>1</mml:mn></mml:msqrt></mml:math>
</disp-formula></p>
<p><disp-formula id="eqn-3">
<label>(3)</label>
<mml:math id="mml-eqn-3" display="block"><mml:msub><mml:mi>R</mml:mi><mml:mrow><mml:mi>p</mml:mi><mml:mi>q</mml:mi></mml:mrow></mml:msub><mml:mrow><mml:mo>(</mml:mo><mml:mi>r</mml:mi><mml:mo>)</mml:mo></mml:mrow><mml:mo>=</mml:mo><mml:munderover><mml:mo>&#x2211;</mml:mo><mml:mrow><mml:mi>k</mml:mi><mml:mo>=</mml:mo><mml:mn>0</mml:mn></mml:mrow><mml:mrow><mml:mi>p</mml:mi><mml:mo>&#x2212;</mml:mo><mml:mrow><mml:mo>|</mml:mo><mml:mi>q</mml:mi><mml:mo>|</mml:mo></mml:mrow></mml:mrow></mml:munderover><mml:msup><mml:mrow><mml:mo>(</mml:mo><mml:mo>&#x2212;</mml:mo><mml:mn>1</mml:mn><mml:mo>)</mml:mo></mml:mrow><mml:mrow><mml:mi>k</mml:mi></mml:mrow></mml:msup><mml:mfrac><mml:mrow><mml:mrow><mml:mo>(</mml:mo><mml:mn>2</mml:mn><mml:mi>p</mml:mi><mml:mo>+</mml:mo><mml:mn>1</mml:mn><mml:mo>&#x2212;</mml:mo><mml:mi>k</mml:mi><mml:mo>)</mml:mo></mml:mrow><mml:mo>!</mml:mo></mml:mrow><mml:mrow><mml:mi>k</mml:mi><mml:mo>!</mml:mo><mml:mrow><mml:mo>(</mml:mo><mml:mi>p</mml:mi><mml:mo>+</mml:mo><mml:mrow><mml:mo>|</mml:mo><mml:mi>q</mml:mi><mml:mo>|</mml:mo></mml:mrow><mml:mo>+</mml:mo><mml:mn>1</mml:mn><mml:mo>&#x2212;</mml:mo><mml:mi>k</mml:mi><mml:mo>)</mml:mo></mml:mrow><mml:mo>!</mml:mo><mml:mrow><mml:mo>(</mml:mo><mml:mi>p</mml:mi><mml:mo>&#x2212;</mml:mo><mml:mrow><mml:mo>|</mml:mo><mml:mi>q</mml:mi><mml:mo>|</mml:mo></mml:mrow><mml:mo>&#x2212;</mml:mo><mml:mi>k</mml:mi><mml:mo>)</mml:mo></mml:mrow><mml:mo>!</mml:mo></mml:mrow></mml:mfrac><mml:msup><mml:mi>r</mml:mi><mml:mrow><mml:mi>p</mml:mi><mml:mo>&#x2212;</mml:mo><mml:mi>k</mml:mi></mml:mrow></mml:msup></mml:math>
</disp-formula></p>
<p>where <inline-formula id="ieqn-22"><mml:math id="mml-ieqn-22"><mml:mn>0</mml:mn><mml:mo>&#x2264;</mml:mo><mml:mrow><mml:mo>|</mml:mo><mml:mi>q</mml:mi><mml:mo>|</mml:mo></mml:mrow><mml:mo>&#x2264;</mml:mo><mml:mi>p</mml:mi></mml:math></inline-formula>. In practice, pseudo Zernike functions (<uri xlink:href="https://www.mathworks.com/matlabcentral/fileexchange/33644-pseudo-zernike-functions">https://www.mathworks.com/matlabcentral/</uri> <uri xlink:href="https://www.mathworks.com/matlabcentral/fileexchange/33644-pseudo-zernike-functions">fileexchange/33644-pseudo-zernike-functions</uri>) are used for simplicity and fast calculation. <?A3B2 "fig3",5,"anchor"?><xref ref-type="fig" rid="fig-3">Fig. 3</xref> displays pseudo Zernike functions of orders <inline-formula id="ieqn-23"><mml:math id="mml-ieqn-23"><mml:mi>p</mml:mi><mml:mo>&#x2264;</mml:mo><mml:mn>5</mml:mn></mml:math></inline-formula>.</p>
<p>Note that PZM are defined in terms of polar coordinates <inline-formula id="ieqn-25"><mml:math id="mml-ieqn-25"><mml:mrow><mml:mo>(</mml:mo><mml:mi>r</mml:mi><mml:mo>,</mml:mo><mml:mspace width="thinmathspace" /><mml:mi>&#x03B8;</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula> with <inline-formula id="ieqn-26"><mml:math id="mml-ieqn-26"><mml:mrow><mml:mo>|</mml:mo><mml:mi>r</mml:mi><mml:mo>|</mml:mo></mml:mrow><mml:mo>&#x2264;</mml:mo><mml:mn>1</mml:mn></mml:math></inline-formula>. Therefore, the computation of PZM requires a linear transformation of the image plane (IP) coordinates <inline-formula id="ieqn-27"><mml:math id="mml-ieqn-27"><mml:mrow><mml:mo>(</mml:mo><mml:mi>w</mml:mi><mml:mo>,</mml:mo><mml:mspace width="thinmathspace" /><mml:mi>h</mml:mi><mml:mo>)</mml:mo></mml:mrow><mml:mo>,</mml:mo><mml:mspace width="thinmathspace" /><mml:mn>1</mml:mn><mml:mo>&#x2264;</mml:mo><mml:mi>w</mml:mi><mml:mo>&#x2264;</mml:mo><mml:mi>W</mml:mi><mml:mo>,</mml:mo><mml:mspace width="thinmathspace" /><mml:mn>1</mml:mn><mml:mo>&#x2264;</mml:mo><mml:mi>h</mml:mi><mml:mo>&#x2264;</mml:mo><mml:mi>H</mml:mi></mml:math></inline-formula> to the UC domain <inline-formula id="ieqn-28"><mml:math id="mml-ieqn-28"><mml:mrow><mml:mo>(</mml:mo><mml:mi>x</mml:mi><mml:mo>,</mml:mo><mml:mspace width="thinmathspace" /><mml:mi>y</mml:mi><mml:mo>)</mml:mo></mml:mrow><mml:mo>&#x2208;</mml:mo><mml:msup><mml:mrow><mml:mi mathvariant="double-struck">R</mml:mi></mml:mrow><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msup></mml:math></inline-formula>. There are two commonly used transformations as shown in <?A3B2 "fig4",5,"anchor"?><xref ref-type="fig" rid="fig-4">Fig. 4</xref>: (i) IP over UC; and (ii) IP inside UC. In this study, we use the former (IP over UC), because the lesions will not occur within the four corners of the CCT image.</p>
<fig id="fig-3">
<label>Figure 3</label>
<caption>
<title>Pseudo Zernike functions of orders <inline-formula id="ieqn-24"><mml:math id="mml-ieqn-24"><mml:mi mathvariant="bold-italic">p</mml:mi><mml:mo>&#x2264;</mml:mo><mml:mrow><mml:mn mathvariant="bold">5</mml:mn></mml:mrow></mml:math></inline-formula></title>
</caption>
<graphic mimetype="image" mime-subtype="png" xlink:href="CMC_18040-fig-3.png"/>
</fig>
<fig id="fig-4">
<label>Figure 4</label>
<caption>
<title>Two transformation (IP: image plane; UC: unit circle) (a) Raw image plane <inline-formula id="ieqn-29"><mml:math id="mml-ieqn-29"><mml:mi>W</mml:mi><mml:mo>&#x00D7;</mml:mo><mml:mi>H</mml:mi></mml:math></inline-formula> (b) IP over UC (c) IP inside UC</title>
</caption>
<graphic mimetype="image" mime-subtype="png" xlink:href="CMC_18040-fig-4.png"/>
</fig>
</sec>
<sec id="s3_2">
<label>3.2</label>
<title>Autoencoder</title>
<p>Traditionally, <inline-formula id="ieqn-30"><mml:math id="mml-ieqn-30"><mml:mi>p</mml:mi></mml:math></inline-formula>-order PZMs are sent into shallow classifiers, such as multi-layer perceptron [<xref ref-type="bibr" rid="ref-27">27</xref>], adaptive differential evolution wavelet neural network (Ada-DEWNN) [<xref ref-type="bibr" rid="ref-28">28</xref>], linear regression classifier (LRC) [<xref ref-type="bibr" rid="ref-29">29</xref>], kernel support vector machine (KSVM) [<xref ref-type="bibr" rid="ref-30">30</xref>]. In this study, we introduced a customized deep-stacked sparse autoencoder (DSSAE). DSSAE is a type of deep neural network technologies, and we expect DSSAE to achieve better performances than shallow models.</p>
<p>The fundamental element of DSSAE in the autoencoder (AE), which is a typical shallow neural network that learns to map its input <inline-formula id="ieqn-31"><mml:math id="mml-ieqn-31"><mml:mi>X</mml:mi></mml:math></inline-formula> to output <inline-formula id="ieqn-32"><mml:math id="mml-ieqn-32"><mml:mi>Y</mml:mi></mml:math></inline-formula>. There is an internal code output <inline-formula id="ieqn-33"><mml:math id="mml-ieqn-33"><mml:msub><mml:mi>I</mml:mi><mml:mrow><mml:mi>N</mml:mi></mml:mrow></mml:msub></mml:math></inline-formula> That represents the input <inline-formula id="ieqn-34"><mml:math id="mml-ieqn-34"><mml:mi>X</mml:mi></mml:math></inline-formula>. The whole AE can be divided into two parts: An encoder part <inline-formula id="ieqn-35"><mml:math id="mml-ieqn-35"><mml:mrow><mml:mo>(</mml:mo><mml:msub><mml:mi>A</mml:mi><mml:mrow><mml:mi>X</mml:mi></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:mspace width="thinmathspace" /><mml:msub><mml:mi>B</mml:mi><mml:mrow><mml:mi>X</mml:mi></mml:mrow></mml:msub><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula> that maps the input <inline-formula id="ieqn-36"><mml:math id="mml-ieqn-36"><mml:mi>X</mml:mi></mml:math></inline-formula> to the code <inline-formula id="ieqn-37"><mml:math id="mml-ieqn-37"><mml:msub><mml:mi>I</mml:mi><mml:mrow><mml:mi>N</mml:mi></mml:mrow></mml:msub></mml:math></inline-formula>, and a decoder part <inline-formula id="ieqn-38"><mml:math id="mml-ieqn-38"><mml:mrow><mml:mo>(</mml:mo><mml:msub><mml:mi>A</mml:mi><mml:mrow><mml:mi>Y</mml:mi></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:mspace width="thinmathspace" /><mml:msub><mml:mi>B</mml:mi><mml:mrow><mml:mi>Y</mml:mi></mml:mrow></mml:msub><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula> that maps the code to a reconstructed data <inline-formula id="ieqn-39"><mml:math id="mml-ieqn-39"><mml:mi>Y</mml:mi></mml:math></inline-formula>.</p>
<p>The structure of AE is displayed in <?A3B2 "fig5",5,"anchor"?><xref ref-type="fig" rid="fig-5">Fig. 5</xref>, where the encoder part is with weight <inline-formula id="ieqn-40"><mml:math id="mml-ieqn-40"><mml:msub><mml:mi>A</mml:mi><mml:mrow><mml:mi>X</mml:mi></mml:mrow></mml:msub></mml:math></inline-formula> and bias <inline-formula id="ieqn-41"><mml:math id="mml-ieqn-41"><mml:msub><mml:mi>B</mml:mi><mml:mrow><mml:mi>X</mml:mi></mml:mrow></mml:msub></mml:math></inline-formula>, and the decoder part is with weights <inline-formula id="ieqn-42"><mml:math id="mml-ieqn-42"><mml:msub><mml:mi>A</mml:mi><mml:mrow><mml:mi>Y</mml:mi></mml:mrow></mml:msub></mml:math></inline-formula> and bias <inline-formula id="ieqn-43"><mml:math id="mml-ieqn-43"><mml:msub><mml:mi>B</mml:mi><mml:mrow><mml:mi>Y</mml:mi></mml:mrow></mml:msub></mml:math></inline-formula>. We have</p>
<p><disp-formula id="eqn-4">
<label>(4)</label>
<mml:math id="mml-eqn-4" display="block"><mml:msub><mml:mi>I</mml:mi><mml:mrow><mml:mi>N</mml:mi></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:msub><mml:mi>z</mml:mi><mml:mrow><mml:mi>L</mml:mi><mml:mi>S</mml:mi></mml:mrow></mml:msub><mml:mrow><mml:mo>(</mml:mo><mml:msub><mml:mi>A</mml:mi><mml:mrow><mml:mi>X</mml:mi></mml:mrow></mml:msub><mml:mi>X</mml:mi><mml:mo>+</mml:mo><mml:msub><mml:mi>B</mml:mi><mml:mrow><mml:mi>X</mml:mi></mml:mrow></mml:msub><mml:mo>)</mml:mo></mml:mrow><mml:mo>,</mml:mo></mml:math>
</disp-formula></p>
<p><disp-formula id="eqn-5">
<label>(5)</label>
<mml:math id="mml-eqn-5" display="block"><mml:mi>Y</mml:mi><mml:mo>=</mml:mo><mml:msub><mml:mi>z</mml:mi><mml:mrow><mml:mi>L</mml:mi><mml:mi>S</mml:mi></mml:mrow></mml:msub><mml:mrow><mml:mo>(</mml:mo><mml:msub><mml:mi>A</mml:mi><mml:mrow><mml:mi>Y</mml:mi></mml:mrow></mml:msub><mml:msub><mml:mi>I</mml:mi><mml:mrow><mml:mi>N</mml:mi></mml:mrow></mml:msub><mml:mo>+</mml:mo><mml:msub><mml:mi>B</mml:mi><mml:mrow><mml:mi>Y</mml:mi></mml:mrow></mml:msub><mml:mo>)</mml:mo></mml:mrow><mml:mo>,</mml:mo></mml:math>
</disp-formula></p>
<p>where the output <inline-formula id="ieqn-44"><mml:math id="mml-ieqn-44"><mml:mi>Y</mml:mi></mml:math></inline-formula> is an estimate of input <inline-formula id="ieqn-45"><mml:math id="mml-ieqn-45"><mml:mi>X</mml:mi></mml:math></inline-formula>, and <inline-formula id="ieqn-46"><mml:math id="mml-ieqn-46"><mml:msub><mml:mi>z</mml:mi><mml:mrow><mml:mi>L</mml:mi><mml:mi>S</mml:mi></mml:mrow></mml:msub></mml:math></inline-formula> is the log sigmoid function</p>
<p><disp-formula id="eqn-6">
<label>(6)</label>
<mml:math id="mml-eqn-6" display="block"><mml:msub><mml:mi>z</mml:mi><mml:mrow><mml:mi>L</mml:mi><mml:mi>S</mml:mi></mml:mrow></mml:msub><mml:mrow><mml:mo>(</mml:mo><mml:mi>x</mml:mi><mml:mo>)</mml:mo></mml:mrow><mml:mo>=</mml:mo><mml:mfrac><mml:mn>1</mml:mn><mml:mrow><mml:mn>1</mml:mn><mml:mo>+</mml:mo><mml:mi>exp</mml:mi><mml:mo>&#x2061;</mml:mo><mml:mrow><mml:mo>(</mml:mo><mml:mo>&#x2212;</mml:mo><mml:mi>x</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:mrow></mml:mfrac><mml:mo>.</mml:mo></mml:math>
</disp-formula></p>
<fig id="fig-5">
<label>Figure 5</label>
<caption>
<title>Structure of an AE</title>
</caption>
<graphic mimetype="image" mime-subtype="png" xlink:href="CMC_18040-fig-5.png"/>
</fig>
</sec>
<sec id="s3_3">
<label>3.3</label>
<title>Sparse Autoencoder</title>
<p>The sparse autoencoder (SAE) is a variant of AE. SAE encourages sparsity into AE. SAE only allows a small fraction of the hidden neurons to be active at the same time. To minimize the error between the input vector <inline-formula id="ieqn-47"><mml:math id="mml-ieqn-47"><mml:mi>X</mml:mi></mml:math></inline-formula> and the output <italic>Y</italic>, the raw loss function <inline-formula id="ieqn-48"><mml:math id="mml-ieqn-48"><mml:msub><mml:mi>J</mml:mi><mml:mrow><mml:mi>b</mml:mi></mml:mrow></mml:msub></mml:math></inline-formula> of AE is deduced as:</p>
<p><disp-formula id="eqn-7">
<label>(7)</label>
<mml:math id="mml-eqn-7" display="block"><mml:msub><mml:mi>J</mml:mi><mml:mrow><mml:mi>b</mml:mi></mml:mrow></mml:msub><mml:mrow><mml:mo>(</mml:mo><mml:msub><mml:mi>A</mml:mi><mml:mrow><mml:mi>X</mml:mi></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:mspace width="thinmathspace" /><mml:msub><mml:mi>A</mml:mi><mml:mrow><mml:mi>Y</mml:mi></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:mspace width="thinmathspace" /><mml:msub><mml:mi>B</mml:mi><mml:mrow><mml:mi>X</mml:mi></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:mspace width="thinmathspace" /><mml:msub><mml:mi>B</mml:mi><mml:mrow><mml:mi>Y</mml:mi></mml:mrow></mml:msub><mml:mo>)</mml:mo></mml:mrow><mml:mo>=</mml:mo><mml:mfrac><mml:mn>1</mml:mn><mml:msub><mml:mi>N</mml:mi><mml:mrow><mml:mi>S</mml:mi></mml:mrow></mml:msub></mml:mfrac><mml:msup><mml:mrow><mml:mo symmetric="true">&#x2016;</mml:mo><mml:mi>Y</mml:mi><mml:mo>&#x2212;</mml:mo><mml:mi>X</mml:mi><mml:mo symmetric="true">&#x2016;</mml:mo></mml:mrow><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msup><mml:mo>,</mml:mo></mml:math>
</disp-formula></p>
<p>where <inline-formula id="ieqn-49"><mml:math id="mml-ieqn-49"><mml:msub><mml:mi>N</mml:mi><mml:mrow><mml:mi>S</mml:mi></mml:mrow></mml:msub></mml:math></inline-formula> means the number of training samples. From <xref ref-type="disp-formula" rid="eqn-4">Eqs. (4)</xref> and <xref ref-type="disp-formula" rid="eqn-5">(5)</xref>, we find the output <inline-formula id="ieqn-50"><mml:math id="mml-ieqn-50"><mml:mi>Y</mml:mi></mml:math></inline-formula> can be expressed in the way of</p>
<p><disp-formula id="eqn-8">
<label>(8)</label>
<mml:math id="mml-eqn-8" display="block"><mml:mi>Y</mml:mi><mml:mo>=</mml:mo><mml:msub><mml:mi>z</mml:mi><mml:mrow><mml:mi>A</mml:mi><mml:mi>E</mml:mi></mml:mrow></mml:msub><mml:mrow><mml:mo>(</mml:mo><mml:mi>X</mml:mi><mml:mo>&#x2223;</mml:mo><mml:msub><mml:mi>A</mml:mi><mml:mrow><mml:mi>X</mml:mi></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:mspace width="thinmathspace" /><mml:msub><mml:mi>A</mml:mi><mml:mrow><mml:mi>Y</mml:mi></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:mspace width="thinmathspace" /><mml:msub><mml:mi>B</mml:mi><mml:mrow><mml:mi>X</mml:mi></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:mspace width="thinmathspace" /><mml:msub><mml:mi>B</mml:mi><mml:mrow><mml:mi>Y</mml:mi></mml:mrow></mml:msub><mml:mo>)</mml:mo></mml:mrow><mml:mo>,</mml:mo></mml:math>
</disp-formula></p>
<p>where <inline-formula id="ieqn-51"><mml:math id="mml-ieqn-51"><mml:msub><mml:mi>z</mml:mi><mml:mrow><mml:mi>A</mml:mi><mml:mi>E</mml:mi></mml:mrow></mml:msub></mml:math></inline-formula> is the abstract of AE function [<xref ref-type="bibr" rid="ref-31">31</xref>]. Hence, <xref ref-type="disp-formula" rid="eqn-7">Eq. (7)</xref> can be revised as</p>
<p><disp-formula id="eqn-9">
<label>(9)</label>
<mml:math id="mml-eqn-9" display="block"><mml:msub><mml:mi>J</mml:mi><mml:mrow><mml:mi>b</mml:mi></mml:mrow></mml:msub><mml:mrow><mml:mo>(</mml:mo><mml:msub><mml:mi>A</mml:mi><mml:mrow><mml:mi>X</mml:mi></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:mspace width="thinmathspace" /><mml:msub><mml:mi>A</mml:mi><mml:mrow><mml:mi>Y</mml:mi></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:mspace width="thinmathspace" /><mml:msub><mml:mi>B</mml:mi><mml:mrow><mml:mi>X</mml:mi></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:mspace width="thinmathspace" /><mml:msub><mml:mi>B</mml:mi><mml:mrow><mml:mi>Y</mml:mi></mml:mrow></mml:msub><mml:mo>)</mml:mo></mml:mrow><mml:mo>=</mml:mo><mml:mfrac><mml:mn>1</mml:mn><mml:msub><mml:mi>N</mml:mi><mml:mrow><mml:mi>S</mml:mi></mml:mrow></mml:msub></mml:mfrac><mml:msup><mml:mrow><mml:mo symmetric="true">&#x2016;</mml:mo><mml:msub><mml:mi>z</mml:mi><mml:mrow><mml:mi>A</mml:mi><mml:mi>E</mml:mi></mml:mrow></mml:msub><mml:mrow><mml:mo>(</mml:mo><mml:mi>X</mml:mi><mml:mo>&#x2223;</mml:mo><mml:msub><mml:mi>A</mml:mi><mml:mrow><mml:mi>X</mml:mi></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:mspace width="thinmathspace" /><mml:msub><mml:mi>A</mml:mi><mml:mrow><mml:mi>Y</mml:mi></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:mspace width="thinmathspace" /><mml:msub><mml:mi>B</mml:mi><mml:mrow><mml:mi>X</mml:mi></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:mspace width="thinmathspace" /><mml:msub><mml:mi>B</mml:mi><mml:mrow><mml:mi>Y</mml:mi></mml:mrow></mml:msub><mml:mo>)</mml:mo></mml:mrow><mml:mo>&#x2212;</mml:mo><mml:mi>X</mml:mi><mml:mo symmetric="true">&#x2016;</mml:mo></mml:mrow><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msup><mml:mo>.</mml:mo></mml:math>
</disp-formula></p>
<p>To avoid over-complete mapping or learn a trivial mapping, we define one <inline-formula id="ieqn-52"><mml:math id="mml-ieqn-52"><mml:msub><mml:mi>L</mml:mi><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msub></mml:math></inline-formula> regularization term <inline-formula id="ieqn-53"><mml:math id="mml-ieqn-53"><mml:msub><mml:mi mathvariant="normal">&#x0393;</mml:mi><mml:mrow><mml:mi>A</mml:mi></mml:mrow></mml:msub></mml:math></inline-formula> of the weights <inline-formula id="ieqn-54"><mml:math id="mml-ieqn-54"><mml:mrow><mml:mo>(</mml:mo><mml:msub><mml:mi>A</mml:mi><mml:mrow><mml:mi>X</mml:mi></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:mspace width="thinmathspace" /><mml:msub><mml:mi>A</mml:mi><mml:mrow><mml:mi>Y</mml:mi></mml:mrow></mml:msub><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula> and one regularization term <inline-formula id="ieqn-55"><mml:math id="mml-ieqn-55"><mml:msub><mml:mi mathvariant="normal">&#x0393;</mml:mi><mml:mrow><mml:mi>s</mml:mi></mml:mrow></mml:msub></mml:math></inline-formula> of the sparsity constraint. Therefore, the loss function <inline-formula id="ieqn-56"><mml:math id="mml-ieqn-56"><mml:msub><mml:mi>J</mml:mi><mml:mrow><mml:mi>l</mml:mi></mml:mrow></mml:msub></mml:math></inline-formula> of SAE is derived as:</p>
<p><disp-formula id="eqn-10">
<label>(10)</label>
<mml:math id="mml-eqn-10" display="block"><mml:msub><mml:mi>J</mml:mi><mml:mrow><mml:mi>l</mml:mi></mml:mrow></mml:msub><mml:mrow><mml:mo>(</mml:mo><mml:msub><mml:mi>A</mml:mi><mml:mrow><mml:mi>X</mml:mi></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:mspace width="thinmathspace" /><mml:msub><mml:mi>A</mml:mi><mml:mrow><mml:mi>Y</mml:mi></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:mspace width="thinmathspace" /><mml:msub><mml:mi>B</mml:mi><mml:mrow><mml:mi>X</mml:mi></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:mspace width="thinmathspace" /><mml:msub><mml:mi>B</mml:mi><mml:mrow><mml:mi>Y</mml:mi></mml:mrow></mml:msub><mml:mo>)</mml:mo></mml:mrow><mml:mo>=</mml:mo><mml:mfrac><mml:mn>1</mml:mn><mml:msub><mml:mi>N</mml:mi><mml:mrow><mml:mi>S</mml:mi></mml:mrow></mml:msub></mml:mfrac><mml:msup><mml:mrow><mml:mo symmetric="true">&#x2016;</mml:mo><mml:msub><mml:mi>z</mml:mi><mml:mrow><mml:mi>A</mml:mi><mml:mi>E</mml:mi></mml:mrow></mml:msub><mml:mrow><mml:mo>(</mml:mo><mml:mi>X</mml:mi><mml:mo>&#x2223;</mml:mo><mml:msub><mml:mi>A</mml:mi><mml:mrow><mml:mi>X</mml:mi></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:mspace width="thinmathspace" /><mml:msub><mml:mi>A</mml:mi><mml:mrow><mml:mi>Y</mml:mi></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:mspace width="thinmathspace" /><mml:msub><mml:mi>B</mml:mi><mml:mrow><mml:mi>X</mml:mi></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:mspace width="thinmathspace" /><mml:msub><mml:mi>B</mml:mi><mml:mrow><mml:mi>Y</mml:mi></mml:mrow></mml:msub><mml:mo>)</mml:mo></mml:mrow><mml:mo>&#x2212;</mml:mo><mml:mi>X</mml:mi><mml:mo symmetric="true">&#x2016;</mml:mo></mml:mrow><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msup><mml:mo>+</mml:mo><mml:msub><mml:mi>a</mml:mi><mml:mrow><mml:mi>s</mml:mi></mml:mrow></mml:msub><mml:mo>&#x00D7;</mml:mo><mml:msub><mml:mi mathvariant="normal">&#x0393;</mml:mi><mml:mrow><mml:mi>s</mml:mi></mml:mrow></mml:msub><mml:mo>+</mml:mo><mml:msub><mml:mi>a</mml:mi><mml:mrow><mml:mi>A</mml:mi></mml:mrow></mml:msub><mml:mo>&#x00D7;</mml:mo><mml:msub><mml:mi mathvariant="normal">&#x0393;</mml:mi><mml:mrow><mml:mi>A</mml:mi></mml:mrow></mml:msub><mml:mo>,</mml:mo></mml:math>
</disp-formula></p>
<p>where <inline-formula id="ieqn-57"><mml:math id="mml-ieqn-57"><mml:msub><mml:mi>a</mml:mi><mml:mrow><mml:mi>s</mml:mi></mml:mrow></mml:msub></mml:math></inline-formula> stands for the sparsity regulation factor, and <inline-formula id="ieqn-58"><mml:math id="mml-ieqn-58"><mml:msub><mml:mi>a</mml:mi><mml:mrow><mml:mi>A</mml:mi></mml:mrow></mml:msub></mml:math></inline-formula> the weight regulation factor. The sparsity regularization term <inline-formula id="ieqn-59"><mml:math id="mml-ieqn-59"><mml:msub><mml:mi mathvariant="normal">&#x0393;</mml:mi><mml:mrow><mml:mi>s</mml:mi></mml:mrow></mml:msub></mml:math></inline-formula> is defined as:</p>
<p><disp-formula id="eqn-11">
<label>(11)</label>
<mml:math id="mml-eqn-11" display="block"><mml:msub><mml:mi mathvariant="normal">&#x0393;</mml:mi><mml:mrow><mml:mi>s</mml:mi></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:munderover><mml:mo>&#x2211;</mml:mo><mml:mrow><mml:mi>m</mml:mi><mml:mo>=</mml:mo><mml:mn>1</mml:mn></mml:mrow><mml:mrow><mml:mrow><mml:mo>|</mml:mo><mml:msub><mml:mi>I</mml:mi><mml:mrow><mml:mi>N</mml:mi></mml:mrow></mml:msub><mml:mo>|</mml:mo></mml:mrow></mml:mrow></mml:munderover><mml:msub><mml:mi>z</mml:mi><mml:mrow><mml:mi>K</mml:mi><mml:mi>L</mml:mi></mml:mrow></mml:msub><mml:mrow><mml:mo>(</mml:mo><mml:mi>&#x03C1;</mml:mi><mml:mo>,</mml:mo><mml:mspace width="thinmathspace" /><mml:msub><mml:mrow><mml:mover><mml:mi>&#x03C1;</mml:mi><mml:mo stretchy="false">&#x005E;</mml:mo></mml:mover></mml:mrow><mml:mrow><mml:mi>m</mml:mi></mml:mrow></mml:msub><mml:mo>)</mml:mo></mml:mrow><mml:mo>=</mml:mo><mml:munderover><mml:mo>&#x2211;</mml:mo><mml:mrow><mml:mi>m</mml:mi><mml:mo>=</mml:mo><mml:mn>1</mml:mn></mml:mrow><mml:mrow><mml:mrow><mml:mo>|</mml:mo><mml:msub><mml:mi>I</mml:mi><mml:mrow><mml:mi>N</mml:mi></mml:mrow></mml:msub><mml:mo>|</mml:mo></mml:mrow></mml:mrow></mml:munderover><mml:mi>&#x03C1;</mml:mi><mml:mi>log</mml:mi><mml:mo>&#x2061;</mml:mo><mml:mfrac><mml:mi>&#x03C1;</mml:mi><mml:msub><mml:mrow><mml:mover><mml:mi>&#x03C1;</mml:mi><mml:mo stretchy="false">&#x005E;</mml:mo></mml:mover></mml:mrow><mml:mrow><mml:mi>m</mml:mi></mml:mrow></mml:msub></mml:mfrac><mml:mo>+</mml:mo><mml:mrow><mml:mo>(</mml:mo><mml:mn>1</mml:mn><mml:mo>&#x2212;</mml:mo><mml:mi>&#x03C1;</mml:mi><mml:mo>)</mml:mo></mml:mrow><mml:mi>log</mml:mi><mml:mo>&#x2061;</mml:mo><mml:mfrac><mml:mrow><mml:mn>1</mml:mn><mml:mo>&#x2212;</mml:mo><mml:mi>&#x03C1;</mml:mi></mml:mrow><mml:mrow><mml:mn>1</mml:mn><mml:mo>&#x2212;</mml:mo><mml:msub><mml:mrow><mml:mover><mml:mi>&#x03C1;</mml:mi><mml:mo stretchy="false">&#x005E;</mml:mo></mml:mover></mml:mrow><mml:mrow><mml:mi>m</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:mfrac><mml:mo>,</mml:mo></mml:math>
</disp-formula></p>
<p>where <inline-formula id="ieqn-60"><mml:math id="mml-ieqn-60"><mml:msub><mml:mi>z</mml:mi><mml:mrow><mml:mi>K</mml:mi><mml:mi>L</mml:mi></mml:mrow></mml:msub></mml:math></inline-formula> stands for the Kullback&#x2013;Leibler divergence [<xref ref-type="bibr" rid="ref-32">32</xref>] function, <inline-formula id="ieqn-61"><mml:math id="mml-ieqn-61"><mml:mrow><mml:mo>|</mml:mo><mml:msub><mml:mi>I</mml:mi><mml:mrow><mml:mi>N</mml:mi></mml:mrow></mml:msub><mml:mo>|</mml:mo></mml:mrow></mml:math></inline-formula> is the number of elements of internal code output <inline-formula id="ieqn-62"><mml:math id="mml-ieqn-62"><mml:msub><mml:mi>I</mml:mi><mml:mrow><mml:mi>N</mml:mi></mml:mrow></mml:msub></mml:math></inline-formula>, <inline-formula id="ieqn-63"><mml:math id="mml-ieqn-63"><mml:msub><mml:mrow><mml:mover><mml:mi>&#x03C1;</mml:mi><mml:mo stretchy="false">&#x005E;</mml:mo></mml:mover></mml:mrow><mml:mrow><mml:mi>m</mml:mi></mml:mrow></mml:msub></mml:math></inline-formula> is the <inline-formula id="ieqn-64"><mml:math id="mml-ieqn-64"><mml:mi>m</mml:mi></mml:math></inline-formula>-th neuron&#x2019;s average activation value over all <inline-formula id="ieqn-65"><mml:math id="mml-ieqn-65"><mml:msub><mml:mi>N</mml:mi><mml:mrow><mml:mi>S</mml:mi></mml:mrow></mml:msub></mml:math></inline-formula> training samples, and <inline-formula id="ieqn-66"><mml:math id="mml-ieqn-66"><mml:mi>&#x03C1;</mml:mi></mml:math></inline-formula> is its desired value, viz., sparsity proportion factor. The weight regularization term <inline-formula id="ieqn-67"><mml:math id="mml-ieqn-67"><mml:msub><mml:mi mathvariant="normal">&#x0393;</mml:mi><mml:mrow><mml:mi>A</mml:mi></mml:mrow></mml:msub></mml:math></inline-formula> is defined as</p>
<p><disp-formula id="eqn-12">
<label>(12)</label>
<mml:math id="mml-eqn-12" display="block"><mml:msub><mml:mi mathvariant="normal">&#x0393;</mml:mi><mml:mrow><mml:mi>A</mml:mi></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:mfrac><mml:mn>1</mml:mn><mml:mn>2</mml:mn></mml:mfrac><mml:mo>&#x00D7;</mml:mo><mml:msubsup><mml:mrow><mml:mo symmetric="true">&#x2016;</mml:mo><mml:mtable columnalign="left left" rowspacing="4pt" columnspacing="1em"><mml:mtr><mml:mtd><mml:msub><mml:mi>A</mml:mi><mml:mrow><mml:mi>X</mml:mi></mml:mrow></mml:msub></mml:mtd><mml:mtd><mml:msub><mml:mi>A</mml:mi><mml:mrow><mml:mi>Y</mml:mi></mml:mrow></mml:msub></mml:mtd></mml:mtr></mml:mtable><mml:mo symmetric="true">&#x2016;</mml:mo></mml:mrow><mml:mrow><mml:mn>2</mml:mn></mml:mrow><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msubsup><mml:mo>.</mml:mo></mml:math>
</disp-formula></p>
<p>The training procedure is set to scaled conjugate gradient descent (SCGD) method.</p>
</sec>
<sec id="s3_4">
<label>3.4</label>
<title>Deep Stacked Sparse Autoencoder</title>
<p>We use SAE as the building block and establish the final deep-stacked sparse autoencoder (DSSAE) classifier by following three operations: (i) We include input layer, preprocessing layer, PZM layer; (ii) We stack four SAEs; (iii) We append softmax layer at the bottom of our AI model. The details of this proposed PZM-DSSAE model are listed in <?A3B2 "tbl2",5,"anchor"?><xref ref-type="table" rid="table-2">Tab. 2</xref> and illustrated in <?A3B2 "fig6",5,"anchor"?><xref ref-type="fig" rid="fig-6">Fig. 6</xref>. After processing, all the CCT images are normalized to fixed grayscaled images with the size of <inline-formula id="ieqn-68"><mml:math id="mml-ieqn-68"><mml:mi>W</mml:mi><mml:mo>&#x00D7;</mml:mo><mml:mi>H</mml:mi></mml:math></inline-formula>. Then, PZM is applied to obtain feature vector with size of <inline-formula id="ieqn-69"><mml:math id="mml-ieqn-69"><mml:msup><mml:mrow><mml:mo>(</mml:mo><mml:mi>p</mml:mi><mml:mo>+</mml:mo><mml:mn>1</mml:mn><mml:mo>)</mml:mo></mml:mrow><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msup><mml:mo>&#x00D7;</mml:mo><mml:mn>1</mml:mn></mml:math></inline-formula>. In the classification stage, four SAE blocks with number of neurons of <inline-formula id="ieqn-70"><mml:math id="mml-ieqn-70"><mml:mrow><mml:mo>(</mml:mo><mml:msub><mml:mi>M</mml:mi><mml:mrow><mml:mn>1</mml:mn></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:mspace width="thinmathspace" /><mml:msub><mml:mi>M</mml:mi><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:mspace width="thinmathspace" /><mml:msub><mml:mi>M</mml:mi><mml:mrow><mml:mn>3</mml:mn></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:mspace width="thinmathspace" /><mml:msub><mml:mi>M</mml:mi><mml:mrow><mml:mn>4</mml:mn></mml:mrow></mml:msub><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula> are employed. Finally, a softmax layer with neurons of <inline-formula id="ieqn-71"><mml:math id="mml-ieqn-71"><mml:msub><mml:mi>M</mml:mi><mml:mrow><mml:mi>c</mml:mi></mml:mrow></mml:msub></mml:math></inline-formula> is appended, where <inline-formula id="ieqn-72"><mml:math id="mml-ieqn-72"><mml:msub><mml:mi>M</mml:mi><mml:mrow><mml:mi>c</mml:mi></mml:mrow></mml:msub></mml:math></inline-formula> means the number of categories to be identified.</p>
<table-wrap id="table-2">
<label>Table 2</label>
<caption>
<title>Layer details of proposed PZM-DSSAE model</title>
</caption>
<table>
<colgroup>
<col/>
<col/>
<col/>
</colgroup>
<thead>
<tr>
<th>Layer</th>
<th>Trainable weights</th>
<th>Size</th>
</tr>
</thead>
<tbody>
<tr>
<td>Input</td>
<td>None</td>
<td><inline-formula id="ieqn-73"><mml:math id="mml-ieqn-73"><mml:mn>1024</mml:mn><mml:mo>&#x00D7;</mml:mo><mml:mn>1024</mml:mn><mml:mo>&#x00D7;</mml:mo><mml:mn>3</mml:mn></mml:math></inline-formula></td>
</tr>
<tr>
<td>Preprocessing</td>
<td>None</td>
<td><inline-formula id="ieqn-74"><mml:math id="mml-ieqn-74"><mml:mi>W</mml:mi><mml:mo>&#x00D7;</mml:mo><mml:mi>H</mml:mi></mml:math></inline-formula></td>
</tr>
<tr>
<td>PZM</td>
<td>None</td>
<td><inline-formula id="ieqn-75"><mml:math id="mml-ieqn-75"><mml:msup><mml:mrow><mml:mo>(</mml:mo><mml:mi>p</mml:mi><mml:mo>+</mml:mo><mml:mn>1</mml:mn><mml:mo>)</mml:mo></mml:mrow><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msup><mml:mo>&#x00D7;</mml:mo><mml:mn>1</mml:mn></mml:math></inline-formula></td>
</tr>
<tr>
<td>1st SAE</td>
<td><inline-formula id="ieqn-76"><mml:math id="mml-ieqn-76"><mml:msup><mml:mrow><mml:mo>(</mml:mo><mml:mi>p</mml:mi><mml:mo>+</mml:mo><mml:mn>1</mml:mn><mml:mo>)</mml:mo></mml:mrow><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msup><mml:mo>&#x00D7;</mml:mo><mml:msub><mml:mi>M</mml:mi><mml:mrow><mml:mn>1</mml:mn></mml:mrow></mml:msub></mml:math></inline-formula></td>
<td><inline-formula id="ieqn-77"><mml:math id="mml-ieqn-77"><mml:msub><mml:mi>M</mml:mi><mml:mrow><mml:mn>1</mml:mn></mml:mrow></mml:msub></mml:math></inline-formula></td>
</tr>
<tr>
<td>2nd SAE</td>
<td><inline-formula id="ieqn-78"><mml:math id="mml-ieqn-78"><mml:msub><mml:mi>M</mml:mi><mml:mrow><mml:mn>1</mml:mn></mml:mrow></mml:msub><mml:mo>&#x00D7;</mml:mo><mml:msub><mml:mi>M</mml:mi><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msub></mml:math></inline-formula></td>
<td><inline-formula id="ieqn-79"><mml:math id="mml-ieqn-79"><mml:msub><mml:mi>M</mml:mi><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msub></mml:math></inline-formula></td>
</tr>
<tr>
<td>3rd SAE</td>
<td><inline-formula id="ieqn-80"><mml:math id="mml-ieqn-80"><mml:msub><mml:mi>M</mml:mi><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msub><mml:mo>&#x00D7;</mml:mo><mml:msub><mml:mi>M</mml:mi><mml:mrow><mml:mn>3</mml:mn></mml:mrow></mml:msub></mml:math></inline-formula></td>
<td><inline-formula id="ieqn-81"><mml:math id="mml-ieqn-81"><mml:msub><mml:mi>M</mml:mi><mml:mrow><mml:mn>3</mml:mn></mml:mrow></mml:msub></mml:math></inline-formula></td>
</tr>
<tr>
<td>4th SAE</td>
<td><inline-formula id="ieqn-82"><mml:math id="mml-ieqn-82"><mml:msub><mml:mi>M</mml:mi><mml:mrow><mml:mn>3</mml:mn></mml:mrow></mml:msub><mml:mo>&#x00D7;</mml:mo><mml:msub><mml:mi>M</mml:mi><mml:mrow><mml:mn>4</mml:mn></mml:mrow></mml:msub></mml:math></inline-formula></td>
<td><inline-formula id="ieqn-83"><mml:math id="mml-ieqn-83"><mml:msub><mml:mi>M</mml:mi><mml:mrow><mml:mn>4</mml:mn></mml:mrow></mml:msub></mml:math></inline-formula></td>
</tr>
<tr>
<td>Softmax</td>
<td><inline-formula id="ieqn-84"><mml:math id="mml-ieqn-84"><mml:msub><mml:mi>M</mml:mi><mml:mrow><mml:mn>4</mml:mn></mml:mrow></mml:msub><mml:mo>&#x00D7;</mml:mo><mml:msub><mml:mi>M</mml:mi><mml:mrow><mml:mi>c</mml:mi></mml:mrow></mml:msub></mml:math></inline-formula></td>
<td><inline-formula id="ieqn-85"><mml:math id="mml-ieqn-85"><mml:msub><mml:mi>M</mml:mi><mml:mrow><mml:mi>c</mml:mi></mml:mrow></mml:msub><mml:mo>&#x00D7;</mml:mo><mml:mn>1</mml:mn></mml:math></inline-formula></td>
</tr>
</tbody>
</table>
</table-wrap>
<fig id="fig-6">
<label>Figure 6</label>
<caption>
<title>Structure of proposed PZM-DSSAE model</title>
</caption>
<graphic mimetype="image" mime-subtype="png" xlink:href="CMC_18040-fig-6.png"/>
</fig>
</sec>
<sec id="s3_5">
<label>3.5</label>
<title>18-Way Data Augmentation</title>
<p>The small size of training images causes overfitting, one solution to data augmentation (DA) that creates fake training images. Multiple-way DA (MDA) is an enhanced method of DA. Wang [<xref ref-type="bibr" rid="ref-33">33</xref>] proposed a 14-way data augmentation, in which they employed seven different DA techniques on <inline-formula id="ieqn-86"><mml:math id="mml-ieqn-86"><mml:mi>k</mml:mi></mml:math></inline-formula>-th training image <inline-formula id="ieqn-87"><mml:math id="mml-ieqn-87"><mml:mi>g</mml:mi><mml:mrow><mml:mo>(</mml:mo><mml:mi>k</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula> and its mirrored image <inline-formula id="ieqn-88"><mml:math id="mml-ieqn-88"><mml:msup><mml:mi>g</mml:mi><mml:mrow><mml:mrow><mml:mo>(</mml:mo><mml:mi>m</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:mrow></mml:msup><mml:mrow><mml:mo>(</mml:mo><mml:mi>k</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula>.</p>
<p>In this study, we add two new DA techniques, speckle noise (SN) [<xref ref-type="bibr" rid="ref-34">34</xref>] and salt-and-pepper noise (SAPN). SN altered image is defined as</p>
<p><disp-formula id="eqn-13">
<label>(13)</label>
<mml:math id="mml-eqn-13" display="block"><mml:msup><mml:mi>g</mml:mi><mml:mrow><mml:mi>S</mml:mi><mml:mi>N</mml:mi></mml:mrow></mml:msup><mml:mrow><mml:mo>(</mml:mo><mml:mi>k</mml:mi><mml:mo>)</mml:mo></mml:mrow><mml:mo>=</mml:mo><mml:mi>g</mml:mi><mml:mrow><mml:mo>(</mml:mo><mml:mi>k</mml:mi><mml:mo>)</mml:mo></mml:mrow><mml:mo>+</mml:mo><mml:msub><mml:mi>N</mml:mi><mml:mrow><mml:mi>S</mml:mi><mml:mi>N</mml:mi></mml:mrow></mml:msub><mml:mo>&#x2217;</mml:mo><mml:mi>g</mml:mi><mml:mrow><mml:mo>(</mml:mo><mml:mi>k</mml:mi><mml:mo>)</mml:mo></mml:mrow><mml:mo>,</mml:mo></mml:math>
</disp-formula></p>
<p>where <inline-formula id="ieqn-89"><mml:math id="mml-ieqn-89"><mml:msub><mml:mi>N</mml:mi><mml:mrow><mml:mi>S</mml:mi><mml:mi>N</mml:mi></mml:mrow></mml:msub></mml:math></inline-formula> is uniformly distributed random noise. The mean and variance of <inline-formula id="ieqn-90"><mml:math id="mml-ieqn-90"><mml:msub><mml:mi>N</mml:mi><mml:mrow><mml:mi>S</mml:mi><mml:mi>N</mml:mi></mml:mrow></mml:msub></mml:math></inline-formula> is set to <inline-formula id="ieqn-91"><mml:math id="mml-ieqn-91"><mml:msub><mml:mi>m</mml:mi><mml:mrow><mml:mi>S</mml:mi><mml:mi>N</mml:mi></mml:mrow></mml:msub></mml:math></inline-formula> and <inline-formula id="ieqn-92"><mml:math id="mml-ieqn-92"><mml:msub><mml:mi>v</mml:mi><mml:mrow><mml:mi>S</mml:mi><mml:mi>N</mml:mi></mml:mrow></mml:msub></mml:math></inline-formula>, respectively.</p>
<p>For the <inline-formula id="ieqn-93"><mml:math id="mml-ieqn-93"><mml:mi>k</mml:mi></mml:math></inline-formula>-th training image <inline-formula id="ieqn-94"><mml:math id="mml-ieqn-94"><mml:mi>g</mml:mi><mml:mo stretchy="false">(</mml:mo><mml:mi>k</mml:mi><mml:mo stretchy="false">)</mml:mo></mml:math></inline-formula>, the SAPN altered image [<xref ref-type="bibr" rid="ref-35">35</xref>] is defined as <inline-formula id="ieqn-95"><mml:math id="mml-ieqn-95"><mml:msup><mml:mi>g</mml:mi><mml:mrow><mml:mi>S</mml:mi><mml:mi>A</mml:mi><mml:mi>P</mml:mi><mml:mi>N</mml:mi></mml:mrow></mml:msup><mml:mrow><mml:mo>(</mml:mo><mml:mi>k</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula> with its values are set as</p>
<p><disp-formula id="eqn-14">
<label>(14)</label>
<mml:math id="mml-eqn-14" display="block"><mml:mrow><mml:mo>{</mml:mo><mml:mtable columnalign="left left" rowspacing="1.4em 1.8em 0.2em" columnspacing="1em" displaystyle="false"><mml:mtr><mml:mtd><mml:mrow><mml:mi mathvariant="normal">P</mml:mi></mml:mrow><mml:mrow><mml:mo>(</mml:mo><mml:msup><mml:mi>g</mml:mi><mml:mrow><mml:mi>S</mml:mi><mml:mi>A</mml:mi><mml:mi>P</mml:mi><mml:mi>N</mml:mi></mml:mrow></mml:msup><mml:mo>=</mml:mo><mml:mi>g</mml:mi><mml:mo>)</mml:mo></mml:mrow><mml:mo>=</mml:mo><mml:mn>1</mml:mn><mml:mo>&#x2212;</mml:mo><mml:msubsup><mml:mi>&#x03B3;</mml:mi><mml:mrow><mml:mi>d</mml:mi></mml:mrow><mml:mrow><mml:mi>s</mml:mi><mml:mi>a</mml:mi><mml:mi>p</mml:mi><mml:mi>n</mml:mi></mml:mrow></mml:msubsup><mml:mo>,</mml:mo></mml:mtd></mml:mtr><mml:mtr><mml:mtd><mml:mrow><mml:mi mathvariant="normal">P</mml:mi></mml:mrow><mml:mrow><mml:mo>(</mml:mo><mml:msup><mml:mi>g</mml:mi><mml:mrow><mml:mi>S</mml:mi><mml:mi>A</mml:mi><mml:mi>P</mml:mi><mml:mi>N</mml:mi></mml:mrow></mml:msup><mml:mo>=</mml:mo><mml:msub><mml:mi>g</mml:mi><mml:mrow><mml:mo movablelimits="true" form="prefix">min</mml:mo></mml:mrow></mml:msub><mml:mo>)</mml:mo></mml:mrow><mml:mo>=</mml:mo><mml:mstyle displaystyle="true" scriptlevel="0"><mml:mfrac><mml:msubsup><mml:mi>&#x03B3;</mml:mi><mml:mrow><mml:mi>d</mml:mi></mml:mrow><mml:mrow><mml:mi>s</mml:mi><mml:mi>a</mml:mi><mml:mi>p</mml:mi><mml:mi>n</mml:mi></mml:mrow></mml:msubsup><mml:mn>2</mml:mn></mml:mfrac><mml:mo>,</mml:mo></mml:mstyle></mml:mtd></mml:mtr><mml:mtr><mml:mtd><mml:mrow><mml:mi mathvariant="normal">P</mml:mi></mml:mrow><mml:mrow><mml:mo>(</mml:mo><mml:msup><mml:mi>g</mml:mi><mml:mrow><mml:mi>S</mml:mi><mml:mi>A</mml:mi><mml:mi>P</mml:mi><mml:mi>N</mml:mi></mml:mrow></mml:msup><mml:mo>=</mml:mo><mml:msub><mml:mi>g</mml:mi><mml:mrow><mml:mo movablelimits="true" form="prefix">max</mml:mo></mml:mrow></mml:msub><mml:mo>)</mml:mo></mml:mrow><mml:mo>=</mml:mo><mml:mstyle displaystyle="true" scriptlevel="0"><mml:mfrac><mml:msubsup><mml:mi>&#x03B3;</mml:mi><mml:mrow><mml:mi>d</mml:mi></mml:mrow><mml:mrow><mml:mi>s</mml:mi><mml:mi>a</mml:mi><mml:mi>p</mml:mi><mml:mi>n</mml:mi></mml:mrow></mml:msubsup><mml:mn>2</mml:mn></mml:mfrac><mml:mo>,</mml:mo></mml:mstyle></mml:mtd></mml:mtr></mml:mtable><mml:mo fence="true" stretchy="true" symmetric="true"></mml:mo></mml:mrow></mml:math>
</disp-formula></p>
<p>where <inline-formula id="ieqn-96"><mml:math id="mml-ieqn-96"><mml:msubsup><mml:mi>&#x03B3;</mml:mi><mml:mrow><mml:mi>d</mml:mi></mml:mrow><mml:mrow><mml:mi>s</mml:mi><mml:mi>a</mml:mi><mml:mi>p</mml:mi><mml:mi>n</mml:mi></mml:mrow></mml:msubsup></mml:math></inline-formula> stands for noise density, and <inline-formula id="ieqn-97"><mml:math id="mml-ieqn-97"><mml:mrow><mml:mi mathvariant="normal">P</mml:mi></mml:mrow></mml:math></inline-formula> the probability function. <inline-formula id="ieqn-98"><mml:math id="mml-ieqn-98"><mml:msub><mml:mi>g</mml:mi><mml:mrow><mml:mo movablelimits="true" form="prefix">min</mml:mo></mml:mrow></mml:msub></mml:math></inline-formula> and <inline-formula id="ieqn-99"><mml:math id="mml-ieqn-99"><mml:msub><mml:mi>g</mml:mi><mml:mrow><mml:mo movablelimits="true" form="prefix">max</mml:mo></mml:mrow></mml:msub></mml:math></inline-formula> correspond to black and white colors, respectively. The definitions of <inline-formula id="ieqn-100"><mml:math id="mml-ieqn-100"><mml:msub><mml:mi>g</mml:mi><mml:mrow><mml:mi>m</mml:mi><mml:mi>i</mml:mi><mml:mi>n</mml:mi></mml:mrow></mml:msub></mml:math></inline-formula> and <inline-formula id="ieqn-101"><mml:math id="mml-ieqn-101"><mml:msub><mml:mi>g</mml:mi><mml:mrow><mml:mi>m</mml:mi><mml:mi>a</mml:mi><mml:mi>x</mml:mi></mml:mrow></mml:msub></mml:math></inline-formula> can be found in Algorithm 1.</p>
<p>First, <inline-formula id="ieqn-102"><mml:math id="mml-ieqn-102"><mml:msup><mml:mi>Q</mml:mi><mml:mrow><mml:mi>D</mml:mi></mml:mrow></mml:msup></mml:math></inline-formula> different DA methods as shown in <?A3B2 "fig7",5,"anchor"?><xref ref-type="fig" rid="fig-7">Fig. 7</xref> are applied to <inline-formula id="ieqn-103"><mml:math id="mml-ieqn-103"><mml:mi>g</mml:mi><mml:mrow><mml:mo>(</mml:mo><mml:mi>k</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula>. Let <inline-formula id="ieqn-104"><mml:math id="mml-ieqn-104"><mml:msub><mml:mrow><mml:mi mathvariant="bold">H</mml:mi></mml:mrow><mml:mrow><mml:mi>m</mml:mi></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:mspace width="thinmathspace" /><mml:mi>m</mml:mi><mml:mo>=</mml:mo><mml:mn>1</mml:mn><mml:mo>,</mml:mo><mml:mspace width="thinmathspace" /><mml:mo>&#x2026;</mml:mo><mml:mo>,</mml:mo><mml:mspace width="thinmathspace" /><mml:msup><mml:mi>Q</mml:mi><mml:mrow><mml:mi>D</mml:mi></mml:mrow></mml:msup></mml:math></inline-formula> denote each DA operation, we have the augmented dataset on raw image <inline-formula id="ieqn-105"><mml:math id="mml-ieqn-105"><mml:mi>g</mml:mi><mml:mrow><mml:mo>(</mml:mo><mml:mi>k</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula> as:</p>
<p><disp-formula id="eqn-15">
<label>(15)</label>
<mml:math id="mml-eqn-15" display="block"><mml:msub><mml:mrow><mml:mi mathvariant="bold">H</mml:mi></mml:mrow><mml:mrow><mml:mi>m</mml:mi></mml:mrow></mml:msub><mml:mrow><mml:mo>[</mml:mo><mml:mi>g</mml:mi><mml:mrow><mml:mo>(</mml:mo><mml:mi>k</mml:mi><mml:mo>)</mml:mo></mml:mrow><mml:mo>]</mml:mo></mml:mrow><mml:mo>,</mml:mo><mml:mspace width="1em" /><mml:mi>m</mml:mi><mml:mo>=</mml:mo><mml:mn>1</mml:mn><mml:mo>,</mml:mo><mml:mspace width="thinmathspace" /><mml:mo>&#x2026;</mml:mo><mml:mo>,</mml:mo><mml:mspace width="thinmathspace" /><mml:msup><mml:mi>Q</mml:mi><mml:mrow><mml:mi>D</mml:mi></mml:mrow></mml:msup><mml:mo>.</mml:mo></mml:math>
</disp-formula></p>
<fig id="fig-7">
<label>Figure 7</label>
<caption>
<title>Diagram of proposed 16-way DA</title>
</caption>
<graphic mimetype="image" mime-subtype="png" xlink:href="CMC_18040-fig-7.png"/>
</fig>
<p>Suppose <inline-formula id="ieqn-106"><mml:math id="mml-ieqn-106"><mml:msup><mml:mi>Q</mml:mi><mml:mrow><mml:mi>N</mml:mi></mml:mrow></mml:msup></mml:math></inline-formula> stands for the size of generated new images for each DA method, we have</p>
<p><disp-formula id="eqn-16">
<label>(16)</label>
<mml:math id="mml-eqn-16" display="block"><mml:mrow><mml:mo>|</mml:mo><mml:msub><mml:mrow><mml:mi mathvariant="bold">H</mml:mi></mml:mrow><mml:mrow><mml:mi>m</mml:mi></mml:mrow></mml:msub><mml:mrow><mml:mo>[</mml:mo><mml:mi>g</mml:mi><mml:mrow><mml:mo>(</mml:mo><mml:mi>k</mml:mi><mml:mo>)</mml:mo></mml:mrow><mml:mo>]</mml:mo></mml:mrow><mml:mo>|</mml:mo></mml:mrow><mml:mo>=</mml:mo><mml:msup><mml:mi>Q</mml:mi><mml:mrow><mml:mi>N</mml:mi></mml:mrow></mml:msup><mml:mo>.</mml:mo></mml:math>
</disp-formula></p>
<p>Second, horizontal mirrored image is generated as:</p>
<p><disp-formula id="eqn-17">
<label>(17)</label>
<mml:math id="mml-eqn-17" display="block"><mml:msup><mml:mi>g</mml:mi><mml:mrow><mml:mrow><mml:mo>(</mml:mo><mml:mi>m</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:mrow></mml:msup><mml:mrow><mml:mo>(</mml:mo><mml:mi>k</mml:mi><mml:mo>)</mml:mo></mml:mrow><mml:mo>=</mml:mo><mml:msub><mml:mi>z</mml:mi><mml:mrow><mml:mi>b</mml:mi></mml:mrow></mml:msub><mml:mrow><mml:mo>[</mml:mo><mml:mi>g</mml:mi><mml:mrow><mml:mo>(</mml:mo><mml:mi>k</mml:mi><mml:mo>)</mml:mo></mml:mrow><mml:mo>]</mml:mo></mml:mrow></mml:math>
</disp-formula></p>
<p>where <inline-formula id="ieqn-107"><mml:math id="mml-ieqn-107"><mml:msub><mml:mi>z</mml:mi><mml:mrow><mml:mi>b</mml:mi></mml:mrow></mml:msub></mml:math></inline-formula> stands for horizontal mirror function.</p>
<p>Third, all the <inline-formula id="ieqn-108"><mml:math id="mml-ieqn-108"><mml:msup><mml:mi>Q</mml:mi><mml:mrow><mml:mi>D</mml:mi></mml:mrow></mml:msup></mml:math></inline-formula> different DA methods are performed on the mirror image <inline-formula id="ieqn-109"><mml:math id="mml-ieqn-109"><mml:msub><mml:mi>p</mml:mi><mml:mrow><mml:mi>c</mml:mi></mml:mrow></mml:msub><mml:mrow><mml:mo>(</mml:mo><mml:mi>k</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula>, and generate <inline-formula id="ieqn-110"><mml:math id="mml-ieqn-110"><mml:msup><mml:mi>Q</mml:mi><mml:mrow><mml:mi>D</mml:mi></mml:mrow></mml:msup></mml:math></inline-formula> different dataset.</p>
<p><disp-formula id="eqn-18">
<label>(18)</label>
<mml:math id="mml-eqn-18" display="block"><mml:mrow><mml:mo>{</mml:mo><mml:mtable columnalign="left left" rowspacing="1.2em 0.2em" columnspacing="1em" displaystyle="false"><mml:mtr><mml:mtd><mml:msub><mml:mrow><mml:mi mathvariant="bold">H</mml:mi></mml:mrow><mml:mrow><mml:mi>m</mml:mi></mml:mrow></mml:msub><mml:mrow><mml:mo>[</mml:mo><mml:msup><mml:mi>g</mml:mi><mml:mrow><mml:mrow><mml:mo>(</mml:mo><mml:mi>m</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:mrow></mml:msup><mml:mrow><mml:mo>(</mml:mo><mml:mi>k</mml:mi><mml:mo>)</mml:mo></mml:mrow><mml:mo>]</mml:mo></mml:mrow><mml:mo>,</mml:mo></mml:mtd><mml:mtd><mml:mi>m</mml:mi><mml:mo>=</mml:mo><mml:mn>1</mml:mn><mml:mo>,</mml:mo><mml:mspace width="thinmathspace" /><mml:mo>&#x2026;</mml:mo><mml:mo>,</mml:mo><mml:msup><mml:mi>Q</mml:mi><mml:mrow><mml:mi>D</mml:mi></mml:mrow></mml:msup></mml:mtd></mml:mtr><mml:mtr><mml:mtd><mml:mrow><mml:mo>|</mml:mo><mml:msub><mml:mrow><mml:mi mathvariant="bold">H</mml:mi></mml:mrow><mml:mrow><mml:mi>m</mml:mi></mml:mrow></mml:msub><mml:mrow><mml:mo>[</mml:mo><mml:msup><mml:mi>g</mml:mi><mml:mrow><mml:mrow><mml:mo>(</mml:mo><mml:mi>m</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:mrow></mml:msup><mml:mrow><mml:mo>(</mml:mo><mml:mi>k</mml:mi><mml:mo>)</mml:mo></mml:mrow><mml:mo>]</mml:mo></mml:mrow><mml:mo>|</mml:mo></mml:mrow><mml:mo>=</mml:mo><mml:msup><mml:mi>Q</mml:mi><mml:mrow><mml:mi>N</mml:mi></mml:mrow></mml:msup><mml:mo>,</mml:mo></mml:mtd><mml:mtd><mml:mi>m</mml:mi><mml:mo>=</mml:mo><mml:mn>1</mml:mn><mml:mo>,</mml:mo><mml:mo>&#x2026;</mml:mo><mml:mo>,</mml:mo><mml:msup><mml:mi>Q</mml:mi><mml:mrow><mml:mi>D</mml:mi></mml:mrow></mml:msup></mml:mtd></mml:mtr></mml:mtable><mml:mo fence="true" stretchy="true" symmetric="true"></mml:mo></mml:mrow></mml:math>
</disp-formula></p>
<p>Fourth, the raw image <inline-formula id="ieqn-111"><mml:math id="mml-ieqn-111"><mml:mi>g</mml:mi><mml:mrow><mml:mo>(</mml:mo><mml:mi>k</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula>, the mirrored image <inline-formula id="ieqn-112"><mml:math id="mml-ieqn-112"><mml:msup><mml:mi>g</mml:mi><mml:mrow><mml:mrow><mml:mo>(</mml:mo><mml:mi>m</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:mrow></mml:msup><mml:mrow><mml:mo>(</mml:mo><mml:mi>k</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula>, all the above <inline-formula id="ieqn-113"><mml:math id="mml-ieqn-113"><mml:msup><mml:mi>Q</mml:mi><mml:mrow><mml:mi>D</mml:mi></mml:mrow></mml:msup></mml:math></inline-formula>-way results of raw image <inline-formula id="ieqn-114"><mml:math id="mml-ieqn-114"><mml:msub><mml:mrow><mml:mi mathvariant="bold">H</mml:mi></mml:mrow><mml:mrow><mml:mi>m</mml:mi></mml:mrow></mml:msub><mml:mrow><mml:mo>[</mml:mo><mml:mi>g</mml:mi><mml:mrow><mml:mo>(</mml:mo><mml:mi>k</mml:mi><mml:mo>)</mml:mo></mml:mrow><mml:mo>]</mml:mo></mml:mrow></mml:math></inline-formula>, and <inline-formula id="ieqn-115"><mml:math id="mml-ieqn-115"><mml:msup><mml:mi>Q</mml:mi><mml:mrow><mml:mi>D</mml:mi></mml:mrow></mml:msup></mml:math></inline-formula>-way DA results of horizontal mirrored image <inline-formula id="ieqn-116"><mml:math id="mml-ieqn-116"><mml:msub><mml:mrow><mml:mi mathvariant="bold">H</mml:mi></mml:mrow><mml:mrow><mml:mi>m</mml:mi></mml:mrow></mml:msub><mml:mrow><mml:mo>[</mml:mo><mml:msup><mml:mi>g</mml:mi><mml:mrow><mml:mrow><mml:mo>(</mml:mo><mml:mi>m</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:mrow></mml:msup><mml:mrow><mml:mo>(</mml:mo><mml:mi>k</mml:mi><mml:mo>)</mml:mo></mml:mrow><mml:mo>]</mml:mo></mml:mrow></mml:math></inline-formula>, are combined together. The final generated dataset from <inline-formula id="ieqn-117"><mml:math id="mml-ieqn-117"><mml:mi>g</mml:mi><mml:mrow><mml:mo>(</mml:mo><mml:mi>k</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula> is defined as <inline-formula id="ieqn-118"><mml:math id="mml-ieqn-118"><mml:mrow><mml:mi mathvariant="bold">F</mml:mi></mml:mrow><mml:mrow><mml:mo>(</mml:mo><mml:mi>k</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula>:</p>
<p><disp-formula id="eqn-19">
<label>(19)</label>
<mml:math id="mml-eqn-19" display="block"><mml:mi>g</mml:mi><mml:mrow><mml:mo>(</mml:mo><mml:mi>k</mml:mi><mml:mo>)</mml:mo></mml:mrow><mml:mo stretchy="false">&#x21A6;</mml:mo><mml:mrow><mml:mi mathvariant="bold">F</mml:mi></mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mi>k</mml:mi><mml:mo stretchy="false">)</mml:mo><mml:mo>=</mml:mo><mml:msub><mml:mi>z</mml:mi><mml:mrow><mml:mi>a</mml:mi></mml:mrow></mml:msub><mml:mrow><mml:mo>{</mml:mo><mml:mtable columnalign="left left" rowspacing="1.4em 2em 1.4em 0.4em" columnspacing="1em"><mml:mtr><mml:mtd><mml:mi>g</mml:mi><mml:mrow><mml:mo>(</mml:mo><mml:mi>k</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:mtd><mml:mtd><mml:msup><mml:mi>g</mml:mi><mml:mrow><mml:mrow><mml:mo>(</mml:mo><mml:mi>m</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:mrow></mml:msup><mml:mrow><mml:mo>(</mml:mo><mml:mi>k</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:mtd></mml:mtr><mml:mtr><mml:mtd><mml:munder><mml:mrow><mml:munder><mml:mrow><mml:msub><mml:mrow><mml:mi mathvariant="bold">H</mml:mi></mml:mrow><mml:mrow><mml:mn>1</mml:mn></mml:mrow></mml:msub><mml:mrow><mml:mo>[</mml:mo><mml:mi>g</mml:mi><mml:mrow><mml:mo>(</mml:mo><mml:mi>k</mml:mi><mml:mo>)</mml:mo></mml:mrow><mml:mo>]</mml:mo></mml:mrow></mml:mrow><mml:mo>&#x23DF;</mml:mo></mml:munder></mml:mrow><mml:msup><mml:mi>Q</mml:mi><mml:mrow><mml:mi>N</mml:mi></mml:mrow></mml:msup></mml:munder></mml:mtd><mml:mtd><mml:munder><mml:mrow><mml:munder><mml:mrow><mml:msub><mml:mrow><mml:mi mathvariant="bold">H</mml:mi></mml:mrow><mml:mrow><mml:mn>1</mml:mn></mml:mrow></mml:msub><mml:mrow><mml:mo>[</mml:mo><mml:msup><mml:mi>g</mml:mi><mml:mrow><mml:mrow><mml:mo>(</mml:mo><mml:mi>m</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:mrow></mml:msup><mml:mrow><mml:mo>(</mml:mo><mml:mi>k</mml:mi><mml:mo>)</mml:mo></mml:mrow><mml:mo>]</mml:mo></mml:mrow></mml:mrow><mml:mo>&#x23DF;</mml:mo></mml:munder></mml:mrow><mml:msup><mml:mi>Q</mml:mi><mml:mrow><mml:mi>N</mml:mi></mml:mrow></mml:msup></mml:munder></mml:mtd></mml:mtr><mml:mtr><mml:mtd><mml:mo>&#x22EF;</mml:mo></mml:mtd><mml:mtd><mml:mo>&#x22EF;</mml:mo></mml:mtd></mml:mtr><mml:mtr><mml:mtd><mml:munder><mml:mrow><mml:munder><mml:mrow><mml:msub><mml:mrow><mml:mi mathvariant="bold">H</mml:mi></mml:mrow><mml:mrow><mml:mrow><mml:msup><mml:mi>Q</mml:mi><mml:mrow><mml:mi>D</mml:mi></mml:mrow></mml:msup></mml:mrow></mml:mrow></mml:msub><mml:mrow><mml:mo>[</mml:mo><mml:mi>g</mml:mi><mml:mrow><mml:mo>(</mml:mo><mml:mi>k</mml:mi><mml:mo>)</mml:mo></mml:mrow><mml:mo>]</mml:mo></mml:mrow></mml:mrow><mml:mo>&#x23DF;</mml:mo></mml:munder></mml:mrow><mml:msup><mml:mi>Q</mml:mi><mml:mrow><mml:mi>N</mml:mi></mml:mrow></mml:msup></mml:munder></mml:mtd><mml:mtd><mml:munder><mml:mrow><mml:munder><mml:mrow><mml:msub><mml:mrow><mml:mi mathvariant="bold">H</mml:mi></mml:mrow><mml:mrow><mml:mrow><mml:msup><mml:mi>Q</mml:mi><mml:mrow><mml:mi>D</mml:mi></mml:mrow></mml:msup></mml:mrow></mml:mrow></mml:msub><mml:mrow><mml:mo>[</mml:mo><mml:msup><mml:mi>g</mml:mi><mml:mrow><mml:mrow><mml:mo>(</mml:mo><mml:mi>m</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:mrow></mml:msup><mml:mrow><mml:mo>(</mml:mo><mml:mi>k</mml:mi><mml:mo>)</mml:mo></mml:mrow><mml:mo>]</mml:mo></mml:mrow></mml:mrow><mml:mo>&#x23DF;</mml:mo></mml:munder></mml:mrow><mml:msup><mml:mi>Q</mml:mi><mml:mrow><mml:mi>N</mml:mi></mml:mrow></mml:msup></mml:munder></mml:mtd></mml:mtr></mml:mtable><mml:mo>}</mml:mo></mml:mrow></mml:math>
</disp-formula></p>
<p>where <inline-formula id="ieqn-119"><mml:math id="mml-ieqn-119"><mml:msub><mml:mi>z</mml:mi><mml:mrow><mml:mi>a</mml:mi></mml:mrow></mml:msub></mml:math></inline-formula> stands for the concatenation function. Suppose augmentation factor is <inline-formula id="ieqn-120"><mml:math id="mml-ieqn-120"><mml:msup><mml:mi>Q</mml:mi><mml:mrow><mml:mi>A</mml:mi></mml:mrow></mml:msup></mml:math></inline-formula>, which stands for the number of images in <inline-formula id="ieqn-121"><mml:math id="mml-ieqn-121"><mml:mrow><mml:mi mathvariant="bold">F</mml:mi></mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mi>k</mml:mi><mml:mo stretchy="false">)</mml:mo></mml:math></inline-formula>, we have</p>
<p><disp-formula id="eqn-20">
<label>(20)</label>
<mml:math id="mml-eqn-20" display="block"><mml:msup><mml:mi>Q</mml:mi><mml:mrow><mml:mi>A</mml:mi></mml:mrow></mml:msup><mml:mo>=</mml:mo><mml:mfrac><mml:mrow><mml:mo>|</mml:mo><mml:mrow><mml:mi mathvariant="bold">F</mml:mi></mml:mrow><mml:mrow><mml:mo>(</mml:mo><mml:mi>k</mml:mi><mml:mo>)</mml:mo></mml:mrow><mml:mo>|</mml:mo></mml:mrow><mml:mrow><mml:mo>|</mml:mo><mml:mi>g</mml:mi><mml:mrow><mml:mo>(</mml:mo><mml:mi>k</mml:mi><mml:mo>)</mml:mo></mml:mrow><mml:mo>|</mml:mo></mml:mrow></mml:mfrac><mml:mo>=</mml:mo><mml:mfrac><mml:mrow><mml:mrow><mml:mo>(</mml:mo><mml:mn>1</mml:mn><mml:mo>+</mml:mo><mml:msup><mml:mi>Q</mml:mi><mml:mrow><mml:mi>D</mml:mi></mml:mrow></mml:msup><mml:mo>&#x00D7;</mml:mo><mml:msup><mml:mi>Q</mml:mi><mml:mrow><mml:mi>N</mml:mi></mml:mrow></mml:msup><mml:mo>)</mml:mo></mml:mrow><mml:mo>&#x00D7;</mml:mo><mml:mn>2</mml:mn></mml:mrow><mml:mn>1</mml:mn></mml:mfrac><mml:mo>=</mml:mo><mml:mn>2</mml:mn><mml:mo>&#x00D7;</mml:mo><mml:msup><mml:mi>Q</mml:mi><mml:mrow><mml:mi>D</mml:mi></mml:mrow></mml:msup><mml:mo>&#x00D7;</mml:mo><mml:msup><mml:mi>Q</mml:mi><mml:mrow><mml:mi>N</mml:mi></mml:mrow></mml:msup><mml:mo>+</mml:mo><mml:mn>2</mml:mn></mml:math>
</disp-formula></p>
<p>Algorithm 2 summarizes the pseudocode of proposed 18-way DA method.</p>
<table-wrap id="table-7">
<caption>
<title>Algorithm 2: Pseudocode of proposed 18-way data augmentation on <italic>k</italic>-th training image</title>
</caption>
<table>
<colgroup>
<col/>
</colgroup>
<tbody>
<tr>
<td>Step 1 Import raw preprocessed <inline-formula id="ieqn-122"><mml:math id="mml-ieqn-122"><mml:mi>k</mml:mi></mml:math></inline-formula>-th training image <inline-formula id="ieqn-123"><mml:math id="mml-ieqn-123"><mml:mi>g</mml:mi><mml:mrow><mml:mo>(</mml:mo><mml:mi>k</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula>.</td>
</tr>
<tr>
<td>Step 2 <inline-formula id="ieqn-124"><mml:math id="mml-ieqn-124"><mml:msup><mml:mi>Q</mml:mi><mml:mrow><mml:mi>D</mml:mi></mml:mrow></mml:msup></mml:math></inline-formula> geometric or photometric or noise-injection DA transforms <inline-formula id="ieqn-125"><mml:math id="mml-ieqn-125"><mml:msub><mml:mrow><mml:mi mathvariant="bold">H</mml:mi></mml:mrow><mml:mrow><mml:mi>m</mml:mi></mml:mrow></mml:msub></mml:math></inline-formula> are utilized on <inline-formula id="ieqn-126"><mml:math id="mml-ieqn-126"><mml:mi>g</mml:mi><mml:mrow><mml:mo>(</mml:mo><mml:mi>k</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula>. We obtain <inline-formula id="ieqn-127"><mml:math id="mml-ieqn-127"><mml:msub><mml:mrow><mml:mi mathvariant="bold">H</mml:mi></mml:mrow><mml:mrow><mml:mi>m</mml:mi></mml:mrow></mml:msub><mml:mrow><mml:mo>[</mml:mo><mml:mi>g</mml:mi><mml:mrow><mml:mo>(</mml:mo><mml:mi>k</mml:mi><mml:mo>)</mml:mo></mml:mrow><mml:mo>]</mml:mo></mml:mrow><mml:mo>,</mml:mo><mml:mspace width="thinmathspace" /><mml:mi>m</mml:mi><mml:mo>=</mml:mo><mml:mn>1</mml:mn><mml:mo>,</mml:mo><mml:mspace width="thinmathspace" /><mml:mo>&#x2026;</mml:mo><mml:mo>,</mml:mo><mml:mspace width="thinmathspace" /><mml:msup><mml:mi>Q</mml:mi><mml:mrow><mml:mi>D</mml:mi></mml:mrow></mml:msup></mml:math></inline-formula>. See <xref ref-type="disp-formula" rid="eqn-15">Eq. (15)</xref>. Each enhanced dataset contains <inline-formula id="ieqn-128"><mml:math id="mml-ieqn-128"><mml:msup><mml:mi>Q</mml:mi><mml:mrow><mml:mi>N</mml:mi></mml:mrow></mml:msup></mml:math></inline-formula> new images. See <xref ref-type="disp-formula" rid="eqn-16">Eq. (16)</xref>.</td>
</tr>
<tr>
<td>Step 3 A horizontal mirror image is generated as <inline-formula id="ieqn-129"><mml:math id="mml-ieqn-129"><mml:msup><mml:mi>g</mml:mi><mml:mrow><mml:mrow><mml:mo>(</mml:mo><mml:mi>m</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:mrow></mml:msup><mml:mrow><mml:mo>(</mml:mo><mml:mi>k</mml:mi><mml:mo>)</mml:mo></mml:mrow><mml:mo>=</mml:mo><mml:msub><mml:mi>z</mml:mi><mml:mrow><mml:mi>b</mml:mi></mml:mrow></mml:msub><mml:mrow><mml:mo>[</mml:mo><mml:mi>g</mml:mi><mml:mrow><mml:mo>(</mml:mo><mml:mi>k</mml:mi><mml:mo>)</mml:mo></mml:mrow><mml:mo>]</mml:mo></mml:mrow></mml:math></inline-formula>. See <xref ref-type="disp-formula" rid="eqn-17">Eq. (17)</xref>.</td>
</tr>
<tr>
<td>Step 4 <inline-formula id="ieqn-130"><mml:math id="mml-ieqn-130"><mml:msup><mml:mi>Q</mml:mi><mml:mrow><mml:mi>D</mml:mi></mml:mrow></mml:msup></mml:math></inline-formula>-way data augmentation methods are carried out on <inline-formula id="ieqn-131"><mml:math id="mml-ieqn-131"><mml:msup><mml:mi>g</mml:mi><mml:mrow><mml:mrow><mml:mo>(</mml:mo><mml:mi>m</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:mrow></mml:msup><mml:mrow><mml:mo>(</mml:mo><mml:mi>k</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula>, we obtain <inline-formula id="ieqn-132"><mml:math id="mml-ieqn-132"><mml:msub><mml:mrow><mml:mi mathvariant="bold">H</mml:mi></mml:mrow><mml:mrow><mml:mi>m</mml:mi></mml:mrow></mml:msub><mml:mrow><mml:mo>[</mml:mo><mml:msup><mml:mi>g</mml:mi><mml:mrow><mml:mrow><mml:mo>(</mml:mo><mml:mi>m</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:mrow></mml:msup><mml:mrow><mml:mo>(</mml:mo><mml:mi>k</mml:mi><mml:mo>)</mml:mo></mml:mrow><mml:mo>]</mml:mo></mml:mrow><mml:mo>,</mml:mo><mml:mspace width="thinmathspace" /><mml:mi>m</mml:mi><mml:mo>=</mml:mo><mml:mn>1</mml:mn><mml:mo>,</mml:mo><mml:mspace width="thinmathspace" /><mml:mo>&#x2026;</mml:mo><mml:mo>,</mml:mo><mml:mspace width="thinmathspace" /><mml:msup><mml:mi>Q</mml:mi><mml:mrow><mml:mi>D</mml:mi></mml:mrow></mml:msup></mml:math></inline-formula>. See <xref ref-type="disp-formula" rid="eqn-18">Eq. (18)</xref>.</td>
</tr>
<tr>
<td>Step 5 The raw image, the mirrored image, all the above <inline-formula id="ieqn-133"><mml:math id="mml-ieqn-133"><mml:msup><mml:mi>Q</mml:mi><mml:mrow><mml:mi>D</mml:mi></mml:mrow></mml:msup></mml:math></inline-formula>-way DA results of raw image, and <inline-formula id="ieqn-134"><mml:math id="mml-ieqn-134"><mml:msup><mml:mi>Q</mml:mi><mml:mrow><mml:mi>D</mml:mi></mml:mrow></mml:msup></mml:math></inline-formula>-way DA results of a horizontal mirrored image are combined <italic>via</italic> <inline-formula id="ieqn-135"><mml:math id="mml-ieqn-135"><mml:msub><mml:mi>z</mml:mi><mml:mrow><mml:mi>a</mml:mi></mml:mrow></mml:msub></mml:math></inline-formula>. See <xref ref-type="disp-formula" rid="eqn-19">Eq. (19)</xref>.</td>
</tr>
<tr>
<td>Step 6 A new dataset <inline-formula id="ieqn-136"><mml:math id="mml-ieqn-136"><mml:mrow><mml:mi mathvariant="bold">F</mml:mi></mml:mrow><mml:mrow><mml:mo>(</mml:mo><mml:mi>k</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula> is generated with number of images as <inline-formula id="ieqn-137"><mml:math id="mml-ieqn-137"><mml:msup><mml:mi>Q</mml:mi><mml:mrow><mml:mi>A</mml:mi></mml:mrow></mml:msup><mml:mo>=</mml:mo><mml:mn>2</mml:mn><mml:mo>&#x00D7;</mml:mo><mml:msup><mml:mi>Q</mml:mi><mml:mrow><mml:mi>D</mml:mi></mml:mrow></mml:msup><mml:mo>&#x00D7;</mml:mo><mml:msup><mml:mi>Q</mml:mi><mml:mrow><mml:mi>N</mml:mi></mml:mrow></mml:msup><mml:mo>+</mml:mo><mml:mn>2</mml:mn></mml:math></inline-formula>. See <xref ref-type="disp-formula" rid="eqn-20">Eq. (20)</xref>.</td>
</tr>
</tbody>
</table>
</table-wrap>
</sec>
<sec id="s3_6">
<label>3.6</label>
<title>Cross-Validation</title>
<p><inline-formula id="ieqn-138"><mml:math id="mml-ieqn-138"><mml:mi>F</mml:mi></mml:math></inline-formula>-fold cross-validation was used in this study. The whole dataset is divided into <inline-formula id="ieqn-139"><mml:math id="mml-ieqn-139"><mml:mi>F</mml:mi></mml:math></inline-formula> folds. At <inline-formula id="ieqn-140"><mml:math id="mml-ieqn-140"><mml:mi>f</mml:mi></mml:math></inline-formula>-th trial, <inline-formula id="ieqn-141"><mml:math id="mml-ieqn-141"><mml:mn>1</mml:mn><mml:mo>&#x2264;</mml:mo><mml:mi>f</mml:mi><mml:mo>&#x2264;</mml:mo><mml:mi>F</mml:mi></mml:math></inline-formula>, the <inline-formula id="ieqn-142"><mml:math id="mml-ieqn-142"><mml:mi>f</mml:mi></mml:math></inline-formula>-th fold is selected as the test, and the rest <inline-formula id="ieqn-143"><mml:math id="mml-ieqn-143"><mml:mi>F</mml:mi><mml:mo>&#x2212;</mml:mo><mml:mn>1</mml:mn></mml:math></inline-formula> folds [<xref ref-type="bibr" rid="ref-36">36</xref>]: <inline-formula id="ieqn-144"><mml:math id="mml-ieqn-144"><mml:mrow><mml:mo>[</mml:mo><mml:mn>1</mml:mn><mml:mo>,</mml:mo><mml:mspace width="thinmathspace" /><mml:mo>&#x2026;</mml:mo><mml:mo>,</mml:mo><mml:mspace width="thinmathspace" /><mml:mi>f</mml:mi><mml:mo>&#x2212;</mml:mo><mml:mn>1</mml:mn><mml:mo>,</mml:mo><mml:mspace width="thinmathspace" /><mml:mi>f</mml:mi><mml:mo>+</mml:mo><mml:mn>1</mml:mn><mml:mo>,</mml:mo><mml:mspace width="thinmathspace" /><mml:mo>&#x2026;</mml:mo><mml:mo>,</mml:mo><mml:mspace width="thinmathspace" /><mml:mi>F</mml:mi><mml:mo>]</mml:mo></mml:mrow></mml:math></inline-formula> are selected as training set (<?A3B2 "fig8",5,"anchor"?><xref ref-type="fig" rid="fig-8">Fig. 8</xref>). In this study, suppose <inline-formula id="ieqn-145"><mml:math id="mml-ieqn-145"><mml:mi>F</mml:mi><mml:mo>=</mml:mo><mml:mn>10</mml:mn></mml:math></inline-formula>, then each fold will contain 32 COVID-19 images and 32 HC images.</p>
<fig id="fig-8">
<label>Figure 8</label>
<caption>
<title><italic>F</italic>-fold cross validation</title>
</caption>
<graphic mimetype="image" mime-subtype="png" xlink:href="CMC_18040-fig-8.png"/>
</fig>
</sec>
<sec id="s3_7">
<label>3.7</label>
<title>Evaluation</title>
<p>To avoid randomness, we run the whole above procedure <inline-formula id="ieqn-146"><mml:math id="mml-ieqn-146"><mml:msup><mml:mi>N</mml:mi><mml:mrow><mml:mi>R</mml:mi></mml:mrow></mml:msup></mml:math></inline-formula> times with different initial random seeds and different cross-validation partitions. The ideal confusion matrix (CM) <inline-formula id="ieqn-147"><mml:math id="mml-ieqn-147"><mml:msup><mml:mi>R</mml:mi><mml:mrow><mml:mrow><mml:mi mathvariant="italic">i</mml:mi><mml:mi mathvariant="italic">d</mml:mi><mml:mi mathvariant="italic">e</mml:mi><mml:mi mathvariant="italic">a</mml:mi><mml:mi mathvariant="italic">l</mml:mi></mml:mrow></mml:mrow></mml:msup></mml:math></inline-formula> is defined as</p>
<p><disp-formula id="eqn-21">
<label>(21)</label>
<mml:math id="mml-eqn-21" display="block"><mml:msup><mml:mi>R</mml:mi><mml:mrow><mml:mrow><mml:mi mathvariant="italic">i</mml:mi><mml:mi mathvariant="italic">d</mml:mi><mml:mi mathvariant="italic">e</mml:mi><mml:mi mathvariant="italic">a</mml:mi><mml:mi mathvariant="italic">l</mml:mi></mml:mrow></mml:mrow></mml:msup><mml:mo>=</mml:mo><mml:mrow><mml:mo>{</mml:mo><mml:msup><mml:mi>r</mml:mi><mml:mrow><mml:mrow><mml:mi mathvariant="italic">i</mml:mi><mml:mi mathvariant="italic">d</mml:mi><mml:mi mathvariant="italic">e</mml:mi><mml:mi mathvariant="italic">a</mml:mi><mml:mi mathvariant="italic">l</mml:mi></mml:mrow></mml:mrow></mml:msup><mml:mrow><mml:mo>(</mml:mo><mml:mi>m</mml:mi><mml:mo>,</mml:mo><mml:mspace width="thinmathspace" /><mml:mi>n</mml:mi><mml:mo>)</mml:mo></mml:mrow><mml:mo>}</mml:mo></mml:mrow><mml:mo>=</mml:mo><mml:msup><mml:mi>N</mml:mi><mml:mrow><mml:mi>R</mml:mi></mml:mrow></mml:msup><mml:mo>&#x00D7;</mml:mo><mml:mrow><mml:mo>[</mml:mo><mml:mtable columnalign="left left" rowspacing="1em 0.4em" columnspacing="1em"><mml:mtr><mml:mtd><mml:msub><mml:mi>C</mml:mi><mml:mrow><mml:mn>1</mml:mn></mml:mrow></mml:msub></mml:mtd><mml:mtd><mml:mn>0</mml:mn></mml:mtd></mml:mtr><mml:mtr><mml:mtd><mml:mn>0</mml:mn></mml:mtd><mml:mtd><mml:msub><mml:mi>C</mml:mi><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msub></mml:mtd></mml:mtr></mml:mtable><mml:mo>]</mml:mo></mml:mrow><mml:mo>,</mml:mo></mml:math>
</disp-formula></p>
<p>Note here the off-diagonal entries of <inline-formula id="ieqn-148"><mml:math id="mml-ieqn-148"><mml:msup><mml:mi>R</mml:mi><mml:mrow><mml:mrow><mml:mi mathvariant="italic">i</mml:mi><mml:mi mathvariant="italic">d</mml:mi><mml:mi mathvariant="italic">e</mml:mi><mml:mi mathvariant="italic">a</mml:mi><mml:mi mathvariant="italic">l</mml:mi></mml:mrow></mml:mrow></mml:msup></mml:math></inline-formula> are all zero, viz., <inline-formula id="ieqn-149"><mml:math id="mml-ieqn-149"><mml:msup><mml:mi>r</mml:mi><mml:mrow><mml:mrow><mml:mi mathvariant="italic">i</mml:mi><mml:mi mathvariant="italic">d</mml:mi><mml:mi mathvariant="italic">e</mml:mi><mml:mi mathvariant="italic">a</mml:mi><mml:mi mathvariant="italic">l</mml:mi></mml:mrow></mml:mrow></mml:msup><mml:mrow><mml:mo>(</mml:mo><mml:mi>m</mml:mi><mml:mo>,</mml:mo><mml:mspace width="thinmathspace" /><mml:mi>n</mml:mi><mml:mo>)</mml:mo></mml:mrow><mml:mo>=</mml:mo><mml:mn>0</mml:mn><mml:mo>,</mml:mo><mml:mspace width="thinmathspace" /><mml:mi mathvariant="normal">&#x2200;</mml:mi><mml:mspace width="thinmathspace" /><mml:mi>m</mml:mi><mml:mo>&#x2260;</mml:mo><mml:mi>n</mml:mi></mml:math></inline-formula>. <inline-formula id="ieqn-150"><mml:math id="mml-ieqn-150"><mml:msub><mml:mi>C</mml:mi><mml:mrow><mml:mn>1</mml:mn></mml:mrow></mml:msub></mml:math></inline-formula> and <inline-formula id="ieqn-151"><mml:math id="mml-ieqn-151"><mml:msub><mml:mi>C</mml:mi><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msub></mml:math></inline-formula> are the number of samples of each category, which can be found in Algorithm 1. Seven measures are defined based on realistic CM [<xref ref-type="bibr" rid="ref-37">37</xref>] defined as:</p>
<p><disp-formula id="eqn-22">
<label>(22)</label>
<mml:math id="mml-eqn-22" display="block"><mml:msup><mml:mi>R</mml:mi><mml:mrow><mml:mi>r</mml:mi><mml:mi>e</mml:mi><mml:mi>a</mml:mi><mml:mi>l</mml:mi></mml:mrow></mml:msup><mml:mo>=</mml:mo><mml:mrow><mml:mo>{</mml:mo><mml:msup><mml:mi>r</mml:mi><mml:mrow><mml:mi>r</mml:mi><mml:mi>e</mml:mi><mml:mi>a</mml:mi><mml:mi>l</mml:mi></mml:mrow></mml:msup><mml:mrow><mml:mo>(</mml:mo><mml:mi>m</mml:mi><mml:mo>,</mml:mo><mml:mspace width="thinmathspace" /><mml:mi>n</mml:mi><mml:mo>)</mml:mo></mml:mrow><mml:mo>}</mml:mo></mml:mrow><mml:mo>=</mml:mo><mml:mrow><mml:mo>[</mml:mo><mml:mtable columnalign="left left" rowspacing="1.4em 0.4em" columnspacing="1em"><mml:mtr><mml:mtd><mml:mi>T</mml:mi><mml:mi>P</mml:mi></mml:mtd><mml:mtd><mml:mi>F</mml:mi><mml:mi>N</mml:mi></mml:mtd></mml:mtr><mml:mtr><mml:mtd><mml:mi>F</mml:mi><mml:mi>P</mml:mi></mml:mtd><mml:mtd><mml:mi>T</mml:mi><mml:mi>N</mml:mi></mml:mtd></mml:mtr></mml:mtable><mml:mo>]</mml:mo></mml:mrow><mml:mo>,</mml:mo></mml:math>
</disp-formula></p>
<p>The first four measures are sensitivity, specificity, precision and accuracy, common in most pattern recognition papers. The last three measures are F1 score, Matthews correlation coefficient (MCC) [<xref ref-type="bibr" rid="ref-38">38</xref>], and Fowlkes&#x2013;Mallows index (FMI) [<xref ref-type="bibr" rid="ref-39">39</xref>]. They are defined as:</p>
<p><disp-formula id="eqn-23">
<label>(23)</label>
<mml:math id="mml-eqn-23" display="block"><mml:mrow><mml:mo>{</mml:mo><mml:mtable columnalign="left left" rowspacing="1.8em 1.8em 0.2em" columnspacing="1em" displaystyle="false"><mml:mtr><mml:mtd><mml:mi>F</mml:mi><mml:mn>1</mml:mn><mml:mo>=</mml:mo><mml:mstyle displaystyle="true" scriptlevel="0"><mml:mfrac><mml:mrow><mml:mn>2</mml:mn><mml:mi>T</mml:mi><mml:mi>P</mml:mi></mml:mrow><mml:mrow><mml:mn>2</mml:mn><mml:mi>T</mml:mi><mml:mi>P</mml:mi><mml:mo>+</mml:mo><mml:mi>F</mml:mi><mml:mi>P</mml:mi><mml:mo>+</mml:mo><mml:mi>F</mml:mi><mml:mi>N</mml:mi></mml:mrow></mml:mfrac></mml:mstyle></mml:mtd></mml:mtr><mml:mtr><mml:mtd><mml:mi>M</mml:mi><mml:mi>C</mml:mi><mml:mi>C</mml:mi><mml:mo>=</mml:mo><mml:mstyle displaystyle="true" scriptlevel="0"><mml:mfrac><mml:mrow><mml:mi>T</mml:mi><mml:mi>P</mml:mi><mml:mo>&#x00D7;</mml:mo><mml:mi>T</mml:mi><mml:mi>N</mml:mi><mml:mo>&#x2212;</mml:mo><mml:mi>F</mml:mi><mml:mi>P</mml:mi><mml:mo>&#x00D7;</mml:mo><mml:mi>F</mml:mi><mml:mi>N</mml:mi></mml:mrow><mml:msqrt><mml:mrow><mml:mo>(</mml:mo><mml:mi>T</mml:mi><mml:mi>P</mml:mi><mml:mo>+</mml:mo><mml:mi>F</mml:mi><mml:mi>P</mml:mi><mml:mo>)</mml:mo></mml:mrow><mml:mo>&#x00D7;</mml:mo><mml:mrow><mml:mo>(</mml:mo><mml:mi>T</mml:mi><mml:mi>P</mml:mi><mml:mo>+</mml:mo><mml:mi>F</mml:mi><mml:mi>N</mml:mi><mml:mo>)</mml:mo></mml:mrow><mml:mo>&#x00D7;</mml:mo><mml:mrow><mml:mo>(</mml:mo><mml:mi>T</mml:mi><mml:mi>N</mml:mi><mml:mo>+</mml:mo><mml:mi>F</mml:mi><mml:mi>P</mml:mi><mml:mo>)</mml:mo></mml:mrow><mml:mo>&#x00D7;</mml:mo><mml:mrow><mml:mo>(</mml:mo><mml:mi>T</mml:mi><mml:mi>N</mml:mi><mml:mo>+</mml:mo><mml:mi>F</mml:mi><mml:mi>N</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:msqrt></mml:mfrac><mml:mo>,</mml:mo></mml:mstyle></mml:mtd></mml:mtr><mml:mtr><mml:mtd><mml:mi>F</mml:mi><mml:mi>M</mml:mi><mml:mi>I</mml:mi><mml:mo>=</mml:mo><mml:msqrt><mml:mstyle displaystyle="true" scriptlevel="0"><mml:mfrac><mml:mrow><mml:mi>T</mml:mi><mml:mi>P</mml:mi></mml:mrow><mml:mrow><mml:mi>T</mml:mi><mml:mi>P</mml:mi><mml:mo>+</mml:mo><mml:mi>F</mml:mi><mml:mi>P</mml:mi></mml:mrow></mml:mfrac><mml:mo>&#x00D7;</mml:mo><mml:mfrac><mml:mrow><mml:mi>T</mml:mi><mml:mi>P</mml:mi></mml:mrow><mml:mrow><mml:mi>T</mml:mi><mml:mi>P</mml:mi><mml:mo>+</mml:mo><mml:mi>F</mml:mi><mml:mi>N</mml:mi></mml:mrow></mml:mfrac></mml:mstyle></mml:msqrt></mml:mtd></mml:mtr></mml:mtable><mml:mo fence="true" stretchy="true" symmetric="true"></mml:mo></mml:mrow></mml:math>
</disp-formula></p>
<p>Besides, the receiver operating characteristic (ROC) curve [<xref ref-type="bibr" rid="ref-40">40</xref>] is used to provide a graphical plot of our model. ROC curve is created by plotting the true positive rate against the false-positive rate at various threshold settings. The area under the curve (AUC) is also calculated.</p>
</sec>
</sec>
<sec id="s4">
<label>4</label>
<title>Experimental Results</title>
<sec id="s4_1">
<label>4.1</label>
<title>Parameter Setting</title>
<p><?A3B2 "tbl3",5,"anchor"?><xref ref-type="table" rid="table-3">Tab. 3</xref> displays the parameter setting of this study. The number of samples of each class is 320. The minimum and maximum grayscale values are set to <inline-formula id="ieqn-152"><mml:math id="mml-ieqn-152"><mml:mrow><mml:mo>(</mml:mo><mml:mn>0</mml:mn><mml:mo>,</mml:mo><mml:mspace width="thinmathspace" /><mml:mn>255</mml:mn><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula>. For the crop operation, 200 pixels are removed from all four sides. The preprocessed image is with size of <inline-formula id="ieqn-153"><mml:math id="mml-ieqn-153"><mml:mn>256</mml:mn><mml:mo>&#x00D7;</mml:mo><mml:mn>256</mml:mn></mml:math></inline-formula>. The max order of PZM is set to 19, so we have <inline-formula id="ieqn-154"><mml:math id="mml-ieqn-154"><mml:msup><mml:mrow><mml:mo>(</mml:mo><mml:mn>19</mml:mn><mml:mo>+</mml:mo><mml:mn>1</mml:mn><mml:mo>)</mml:mo></mml:mrow><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msup><mml:mo>=</mml:mo><mml:mn>400</mml:mn></mml:math></inline-formula> PZM features. The weight regularization factor <inline-formula id="ieqn-155"><mml:math id="mml-ieqn-155"><mml:msub><mml:mi>a</mml:mi><mml:mrow><mml:mi>A</mml:mi></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:mn>0.001</mml:mn></mml:math></inline-formula>, the sparsity regulation factor <inline-formula id="ieqn-156"><mml:math id="mml-ieqn-156"><mml:msub><mml:mi>a</mml:mi><mml:mrow><mml:mi>s</mml:mi></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:mn>1.1</mml:mn></mml:math></inline-formula>, and the sparsity proportion factor is <inline-formula id="ieqn-157"><mml:math id="mml-ieqn-157"><mml:mi>&#x03C1;</mml:mi><mml:mo>=</mml:mo><mml:mn>0.05</mml:mn></mml:math></inline-formula>. The neurons of four SAEs are 300, 200, 100, and 50, respectively. The number of classes to be classified is set to 2. The number of folds in cross-validation is set to 10. The mean and variance of uniformly distributed random noise in SN are set to 0 and 0.05, respectively. The noise density of SAPN is set to 0.05. The number of different DA methods is set to 9, and the number of the newly generated image is set to 30. The augmentation factor is obtained as <inline-formula id="ieqn-158"><mml:math id="mml-ieqn-158"><mml:msup><mml:mi>Q</mml:mi><mml:mrow><mml:mi>A</mml:mi></mml:mrow></mml:msup><mml:mo>=</mml:mo><mml:mn>542</mml:mn></mml:math></inline-formula> (See Algorithm 2). The number of runs is set to 10.</p>
<table-wrap id="table-3">
<label>Table 3</label>
<caption>
<title>Parameter setting</title>
</caption>
<table>
<colgroup>
<col/>
<col/>
<col/>
<col/>
<col/>
<col/>
</colgroup>
<thead>
<tr>
<th>Parameter</th>
<th>Value</th>
<th>Parameter</th>
<th>Value</th>
<th>Parameter</th>
<th>Value</th>
</tr>
</thead>
<tbody>
<tr>
<td><inline-formula id="ieqn-159"><mml:math id="mml-ieqn-159"><mml:mrow><mml:mo>(</mml:mo><mml:msub><mml:mi>C</mml:mi><mml:mrow><mml:mn>1</mml:mn></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:mspace width="thinmathspace" /><mml:msub><mml:mi>C</mml:mi><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msub><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula></td>
<td><inline-formula id="ieqn-160"><mml:math id="mml-ieqn-160"><mml:mrow><mml:mo>(</mml:mo><mml:mn>320</mml:mn><mml:mo>,</mml:mo><mml:mspace width="thinmathspace" /><mml:mn>320</mml:mn><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula></td>
<td><inline-formula id="ieqn-161"><mml:math id="mml-ieqn-161"><mml:mi>&#x03C1;</mml:mi></mml:math></inline-formula></td>
<td>0.05</td>
<td><inline-formula id="ieqn-162"><mml:math id="mml-ieqn-162"><mml:msub><mml:mi>m</mml:mi><mml:mrow><mml:mi>S</mml:mi><mml:mi>N</mml:mi></mml:mrow></mml:msub></mml:math></inline-formula></td>
<td>0</td>
</tr>
<tr>
<td><inline-formula id="ieqn-163"><mml:math id="mml-ieqn-163"><mml:mrow><mml:mo>(</mml:mo><mml:msub><mml:mi>g</mml:mi><mml:mrow><mml:mi>m</mml:mi><mml:mi>i</mml:mi><mml:mi>n</mml:mi></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:mspace width="thinmathspace" /><mml:msub><mml:mi>g</mml:mi><mml:mrow><mml:mi>m</mml:mi><mml:mi>a</mml:mi><mml:mi>x</mml:mi></mml:mrow></mml:msub><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula></td>
<td><inline-formula id="ieqn-164"><mml:math id="mml-ieqn-164"><mml:mrow><mml:mo>(</mml:mo><mml:mn>0</mml:mn><mml:mo>,</mml:mo><mml:mspace width="thinmathspace" /><mml:mn>255</mml:mn><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula></td>
<td><inline-formula id="ieqn-165"><mml:math id="mml-ieqn-165"><mml:msub><mml:mi>M</mml:mi><mml:mrow><mml:mn>1</mml:mn></mml:mrow></mml:msub></mml:math></inline-formula></td>
<td>300</td>
<td><inline-formula id="ieqn-166"><mml:math id="mml-ieqn-166"><mml:msub><mml:mi>v</mml:mi><mml:mrow><mml:mi>S</mml:mi><mml:mi>N</mml:mi></mml:mrow></mml:msub></mml:math></inline-formula></td>
<td>0.05</td>
</tr>
<tr>
<td><inline-formula id="ieqn-167"><mml:math id="mml-ieqn-167"><mml:mrow><mml:mo>(</mml:mo><mml:msubsup><mml:mi>m</mml:mi><mml:mrow><mml:mn>1</mml:mn></mml:mrow><mml:mrow><mml:mi>c</mml:mi></mml:mrow></mml:msubsup><mml:mo>,</mml:mo><mml:mspace width="thinmathspace" /><mml:msubsup><mml:mi>m</mml:mi><mml:mrow><mml:mn>2</mml:mn></mml:mrow><mml:mrow><mml:mi>c</mml:mi></mml:mrow></mml:msubsup><mml:mo>,</mml:mo><mml:mspace width="thinmathspace" /><mml:msubsup><mml:mi>m</mml:mi><mml:mrow><mml:mn>3</mml:mn></mml:mrow><mml:mrow><mml:mi>c</mml:mi></mml:mrow></mml:msubsup><mml:mo>,</mml:mo><mml:mspace width="thinmathspace" /><mml:msubsup><mml:mi>m</mml:mi><mml:mrow><mml:mn>4</mml:mn></mml:mrow><mml:mrow><mml:mi>c</mml:mi></mml:mrow></mml:msubsup><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula></td>
<td><inline-formula id="ieqn-168"><mml:math id="mml-ieqn-168"><mml:mrow><mml:mo>(</mml:mo><mml:mn>200</mml:mn><mml:mo>,</mml:mo><mml:mspace width="thinmathspace" /><mml:mn>200</mml:mn><mml:mo>,</mml:mo><mml:mspace width="thinmathspace" /><mml:mn>200</mml:mn><mml:mo>,</mml:mo><mml:mspace width="thinmathspace" /><mml:mn>200</mml:mn><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula></td>
<td><inline-formula id="ieqn-169"><mml:math id="mml-ieqn-169"><mml:msub><mml:mi>M</mml:mi><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msub></mml:math></inline-formula></td>
<td>200</td>
<td><inline-formula id="ieqn-170"><mml:math id="mml-ieqn-170"><mml:msubsup><mml:mi>&#x03B3;</mml:mi><mml:mrow><mml:mi>d</mml:mi></mml:mrow><mml:mrow><mml:mi>s</mml:mi><mml:mi>a</mml:mi><mml:mi>p</mml:mi><mml:mi>n</mml:mi></mml:mrow></mml:msubsup></mml:math></inline-formula></td>
<td>0.05</td>
</tr>
<tr>
<td><inline-formula id="ieqn-171"><mml:math id="mml-ieqn-171"><mml:mrow><mml:mo>(</mml:mo><mml:mi>W</mml:mi><mml:mo>,</mml:mo><mml:mspace width="thinmathspace" /><mml:mi>H</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula></td>
<td><inline-formula id="ieqn-172"><mml:math id="mml-ieqn-172"><mml:mrow><mml:mo>(</mml:mo><mml:mn>256</mml:mn><mml:mo>,</mml:mo><mml:mspace width="thinmathspace" /><mml:mn>256</mml:mn><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula></td>
<td><inline-formula id="ieqn-173"><mml:math id="mml-ieqn-173"><mml:msub><mml:mi>M</mml:mi><mml:mrow><mml:mn>3</mml:mn></mml:mrow></mml:msub></mml:math></inline-formula></td>
<td>100</td>
<td><inline-formula id="ieqn-174"><mml:math id="mml-ieqn-174"><mml:msup><mml:mi>Q</mml:mi><mml:mrow><mml:mi>D</mml:mi></mml:mrow></mml:msup></mml:math></inline-formula></td>
<td>9</td>
</tr>
<tr>
<td><inline-formula id="ieqn-175"><mml:math id="mml-ieqn-175"><mml:mi>p</mml:mi></mml:math></inline-formula></td>
<td>19</td>
<td><inline-formula id="ieqn-176"><mml:math id="mml-ieqn-176"><mml:msub><mml:mi>M</mml:mi><mml:mrow><mml:mn>4</mml:mn></mml:mrow></mml:msub></mml:math></inline-formula></td>
<td>50</td>
<td><inline-formula id="ieqn-177"><mml:math id="mml-ieqn-177"><mml:msup><mml:mi>Q</mml:mi><mml:mrow><mml:mi>N</mml:mi></mml:mrow></mml:msup></mml:math></inline-formula></td>
<td>30</td>
</tr>
<tr>
<td><inline-formula id="ieqn-178"><mml:math id="mml-ieqn-178"><mml:msub><mml:mi>a</mml:mi><mml:mrow><mml:mi>A</mml:mi></mml:mrow></mml:msub></mml:math></inline-formula></td>
<td>0.001</td>
<td><inline-formula id="ieqn-179"><mml:math id="mml-ieqn-179"><mml:msub><mml:mi>M</mml:mi><mml:mrow><mml:mi>c</mml:mi></mml:mrow></mml:msub></mml:math></inline-formula></td>
<td>2</td>
<td><inline-formula id="ieqn-180"><mml:math id="mml-ieqn-180"><mml:msup><mml:mi>Q</mml:mi><mml:mrow><mml:mi>A</mml:mi></mml:mrow></mml:msup></mml:math></inline-formula></td>
<td>542</td>
</tr>
<tr>
<td><inline-formula id="ieqn-181"><mml:math id="mml-ieqn-181"><mml:msub><mml:mi>a</mml:mi><mml:mrow><mml:mi>s</mml:mi></mml:mrow></mml:msub></mml:math></inline-formula></td>
<td>1.1</td>
<td><inline-formula id="ieqn-182"><mml:math id="mml-ieqn-182"><mml:mi>F</mml:mi></mml:math></inline-formula></td>
<td>10</td>
<td><inline-formula id="ieqn-183"><mml:math id="mml-ieqn-183"><mml:msup><mml:mi>N</mml:mi><mml:mrow><mml:mi>R</mml:mi></mml:mrow></mml:msup></mml:math></inline-formula></td>
<td>10</td>
</tr>
</tbody>
</table>
</table-wrap>
</sec>
<sec id="s4_2">
<label>4.2</label>
<title>Illustration of 18-Way Data Augmentation</title>
<p><?A3B2 "fig9",5,"anchor"?><xref ref-type="fig" rid="fig-9">Fig. 9</xref> shows the <inline-formula id="ieqn-184"><mml:math id="mml-ieqn-184"><mml:msup><mml:mi>Q</mml:mi><mml:mrow><mml:mi>D</mml:mi></mml:mrow></mml:msup></mml:math></inline-formula>-way DA to the raw image. Due to the page limit, the mirrored image and its corresponding DA results are not displayed. As can be observed in <xref ref-type="fig" rid="fig-9">Fig. 9</xref>, the multiple-way DA can increase our training images&#x2019; diversity.</p>
<fig id="fig-9">
<label>Figure 9</label>
<caption>
<title><inline-formula id="ieqn-185"><mml:math id="mml-ieqn-185"><mml:msup><mml:mi mathvariant="bold-italic">Q</mml:mi><mml:mrow><mml:mi mathvariant="bold-italic">D</mml:mi></mml:mrow></mml:msup></mml:math></inline-formula>-way DA results of raw image (a) Gaussian noise (b) SAPN (c) SN (d) Horizontal shear (e) Vertical shear (f) Rotation (g) Gamma correction (h) Random translation (i) Scaling</title>
</caption>
<graphic mimetype="image" mime-subtype="png" xlink:href="CMC_18040-fig-9a.png"/>
<graphic mimetype="image" mime-subtype="png" xlink:href="CMC_18040-fig-9b.png"/>
</fig>
</sec>
<sec id="s4_3">
<label>4.3</label>
<title>Statistical Analysis and Transformation Comparison</title>
<p><?A3B2 "tbl4",5,"anchor"?><xref ref-type="table" rid="table-4">Tab. 4</xref> gives the 10 runs of 10-fold cross-validation, where we can see our method achieves a sensitivity of 92.06% &#x00B1; 1.54%, a specificity of 92.56% &#x00B1; 1.06%, a precision of 92.53% &#x00B1; 1.03%, and an accuracy of 92.31% &#x00B1; 1.08%. Its F1 score, MCC, and FMI arrive at 92.29% &#x00B1; 1.10%, 84.64% &#x00B1; 2.15%, and 92.29% &#x00B1; 1.10%, respectively. The AUC is 0.9576.</p>
<p>In addition, we compared the two transformation settings: IP over UC against IP inside UC (See <xref ref-type="fig" rid="fig-4">Fig. 4</xref>). The IP inside the UC setting achieves a sensitivity of 91.84% &#x00B1; 2.18%, a specificity of 92.44% &#x00B1; 1.31%, and an accuracy of 92.14% &#x00B1; 1.12%, which are worse than IP over UC setting. This comparison result demonstrates the reason why we choose IP over UC in this study. Particularly, the receiver operating characteristics (ROC) curves of both settings are displayed in <?A3B2 "fig10",5,"anchor"?><xref ref-type="fig" rid="fig-10">Fig. 10</xref>.</p>
<table-wrap id="table-4">
<label>Table 4</label>
<caption>
<title>10 Runs of statistical analysis of proposed PZM-DSSAE method</title>
</caption>
<table>
<colgroup>
<col/>
<col/>
<col/>
<col/>
<col/>
<col/>
<col/>
<col/>
</colgroup>
<thead>
<tr>
<th>Run</th>
<th>Sen</th>
<th>Spc</th>
<th>Prc</th>
<th>Acc</th>
<th>F1</th>
<th>MCC</th>
<th>FMI</th>
</tr>
</thead>
<tbody>
<tr>
<td>1</td>
<td>90.94</td>
<td>91.88</td>
<td>91.80</td>
<td>91.41</td>
<td>91.37</td>
<td>82.82</td>
<td>91.37</td>
</tr>
<tr>
<td>2</td>
<td>91.56</td>
<td>91.56</td>
<td>91.56</td>
<td>91.56</td>
<td>91.56</td>
<td>83.13</td>
<td>91.56</td>
</tr>
<tr>
<td>3</td>
<td>91.88</td>
<td>92.50</td>
<td>92.45</td>
<td>92.19</td>
<td>92.16</td>
<td>84.38</td>
<td>92.16</td>
</tr>
<tr>
<td>4</td>
<td>92.19</td>
<td>91.56</td>
<td>91.61</td>
<td>91.88</td>
<td>91.90</td>
<td>83.75</td>
<td>91.90</td>
</tr>
<tr>
<td>5</td>
<td>91.25</td>
<td>92.19</td>
<td>92.11</td>
<td>91.72</td>
<td>91.68</td>
<td>83.44</td>
<td>91.68</td>
</tr>
<tr>
<td>6</td>
<td>90.63</td>
<td>92.19</td>
<td>92.06</td>
<td>91.41</td>
<td>91.34</td>
<td>82.82</td>
<td>91.34</td>
</tr>
<tr>
<td>7</td>
<td>93.75</td>
<td>93.13</td>
<td>93.17</td>
<td>93.44</td>
<td>93.46</td>
<td>86.88</td>
<td>93.46</td>
</tr>
<tr>
<td>8</td>
<td>95.31</td>
<td>94.38</td>
<td>94.43</td>
<td>94.84</td>
<td>94.87</td>
<td>89.69</td>
<td>94.87</td>
</tr>
<tr>
<td>9</td>
<td>92.81</td>
<td>91.88</td>
<td>91.95</td>
<td>92.34</td>
<td>92.38</td>
<td>84.69</td>
<td>92.38</td>
</tr>
<tr>
<td>10</td>
<td>90.31</td>
<td>94.38</td>
<td>94.14</td>
<td>92.34</td>
<td>92.19</td>
<td>84.76</td>
<td>92.20</td>
</tr>
<tr>
<td>MSD</td>
<td>92.06 &#x00B1; 1.54</td>
<td>92.56 &#x00B1; 1.06</td>
<td>92.53 &#x00B1; 1.03</td>
<td>92.31 &#x00B1; 1.08</td>
<td>92.29 &#x00B1; 1.10</td>
<td>84.64 &#x00B1; 2.15</td>
<td>92.29 &#x00B1; 1.10</td>
</tr>
</tbody>
</table>
</table-wrap>
<fig id="fig-10">
<label>Figure 10</label>
<caption>
<title>ROC curves of two settings (a) IP over UC (b) IP inside UC</title>
</caption>
<graphic mimetype="image" mime-subtype="png" xlink:href="CMC_18040-fig-10.png"/>
</fig>
</sec>
<sec id="s4_4">
<label>4.4</label>
<title>Comparison to State-of-the-Art Methods</title>
<p>This proposed PZM-DSSAE method is compared with 8 state-of-the-art methods. The comparison results are carried out on the same dataset <italic>via</italic> 10 runs of 10-fold cross-validation, and the results are displayed in <?A3B2 "tbl5",5,"anchor"?><xref ref-type="table" rid="table-5">Tab. 5</xref>. <?A3B2 "fig11",5,"anchor"?><xref ref-type="fig" rid="fig-11">Fig. 11</xref> displays the error bar of the proposed method against 8 state-of-the-art methods. We can see that the proposed PZM-DSSAE gives the best performance among all the methods. The reason is three folds: (i) We try to use PZM as the feature descriptors, (ii) DSSAE is used as the classifier, (iii) 18-way DA is employed to solve the overfitting problem.</p>
<table-wrap id="table-5">
<label>Table 5</label>
<caption>
<title>Comparison to state-of-the-art methods</title>
</caption>
<table>
<colgroup>
<col/>
<col/>
<col/>
<col/>
<col/>
<col/>
<col/>
<col/>
</colgroup>
<thead>
<tr>
<th>Method</th>
<th>Sen</th>
<th>Spc</th>
<th>Prc</th>
<th>Acc</th>
<th>F1</th>
<th>MCC</th>
<th>FMI</th>
</tr>
</thead>
<tbody>
<tr>
<td>ResNet-18 [<xref ref-type="bibr" rid="ref-11">11</xref>]</td>
<td>78.88 &#x00B1; 2.57</td>
<td>89.28 &#x00B1; 0.90</td>
<td>88.05 &#x00B1; 0.79</td>
<td>84.08 &#x00B1; 1.14</td>
<td>83.19 &#x00B1; 1.44</td>
<td>68.55 &#x00B1; 2.10</td>
<td>83.32 &#x00B1; 1.36</td>
</tr>
<tr>
<td>ELM-BA [<xref ref-type="bibr" rid="ref-12">12</xref>]</td>
<td>56.91 &#x00B1; 1.21</td>
<td>71.94 &#x00B1; 2.17</td>
<td>67.01 &#x00B1; 1.52</td>
<td>64.42 &#x00B1; 0.88</td>
<td>61.53 &#x00B1; 0.77</td>
<td>29.19 &#x00B1; 1.88</td>
<td>61.74 &#x00B1; 0.77</td>
</tr>
<tr>
<td>WEBBO [<xref ref-type="bibr" rid="ref-13">13</xref>]</td>
<td>72.94 &#x00B1; 0.96</td>
<td>73.97 &#x00B1; 1.02</td>
<td>73.70 &#x00B1; 0.79</td>
<td>73.45 &#x00B1; 0.69</td>
<td>73.31 &#x00B1; 0.71</td>
<td>46.91 &#x00B1; 1.38</td>
<td>73.32 &#x00B1; 0.71</td>
</tr>
<tr>
<td>3SBBO [<xref ref-type="bibr" rid="ref-14">14</xref>]</td>
<td>85.94 &#x00B1; 1.68</td>
<td>84.75 &#x00B1; 2.42</td>
<td>84.96 &#x00B1; 2.16</td>
<td>85.34 &#x00B1; 1.81</td>
<td>85.44 &#x00B1; 1.74</td>
<td>70.71 &#x00B1; 3.61</td>
<td>85.44 &#x00B1; 1.73</td>
</tr>
<tr>
<td>DeCovNet [<xref ref-type="bibr" rid="ref-15">15</xref>]</td>
<td>90.03 &#x00B1; 1.22</td>
<td>90.34 &#x00B1; 1.25</td>
<td>90.33 &#x00B1; 1.07</td>
<td>90.19 &#x00B1; 0.68</td>
<td>90.17 &#x00B1; 0.69</td>
<td>80.39 &#x00B1; 1.35</td>
<td>90.18 &#x00B1; 0.68</td>
</tr>
<tr>
<td>FSVC [<xref ref-type="bibr" rid="ref-16">16</xref>]</td>
<td>90.25 &#x00B1; 1.27</td>
<td>90.03 &#x00B1; 0.80</td>
<td>90.06 &#x00B1; 0.72</td>
<td>90.14 &#x00B1; 0.70</td>
<td>90.15 &#x00B1; 0.73</td>
<td>80.29 &#x00B1; 1.41</td>
<td>90.15 &#x00B1; 0.74</td>
</tr>
<tr>
<td>GoogleNet-COD [<xref ref-type="bibr" rid="ref-17">17</xref>]</td>
<td>89.44 &#x00B1; 1.59</td>
<td>82.91 &#x00B1; 1.64</td>
<td>83.98 &#x00B1; 1.16</td>
<td>86.17 &#x00B1; 0.67</td>
<td>86.61 &#x00B1; 0.68</td>
<td>72.53 &#x00B1; 1.32</td>
<td>86.66 &#x00B1; 0.68</td>
</tr>
<tr>
<td>GLCMSVM [<xref ref-type="bibr" rid="ref-18">18</xref>]</td>
<td>72.38 &#x00B1; 2.68</td>
<td>77.38 &#x00B1; 1.96</td>
<td>76.22 &#x00B1; 1.21</td>
<td>74.88 &#x00B1; 0.86</td>
<td>74.21 &#x00B1; 1.25</td>
<td>49.85 &#x00B1; 1.70</td>
<td>74.25 &#x00B1; 1.21</td>
</tr>
<tr>
<td>PZM-DSSAE (Ours)</td>
<td>92.06 &#x00B1; 1.54</td>
<td>92.56 &#x00B1; 1.06</td>
<td>92.53 &#x00B1; 1.03</td>
<td>92.31 &#x00B1; 1.08</td>
<td>92.29 &#x00B1; 1.10</td>
<td>84.64 &#x00B1; 2.15</td>
<td>92.29 &#x00B1; 1.10</td>
</tr>
</tbody>
</table>
</table-wrap>
<fig id="fig-11">
<label>Figure 11</label>
<caption>
<title>Error bar plot of method comparison</title>
</caption>
<graphic mimetype="image" mime-subtype="png" xlink:href="CMC_18040-fig-11.png"/>
</fig>
</sec>
</sec>
<sec id="s5">
<label>5</label>
<title>Conclusion</title>
<p>This study proposed a novel PZM-DSSAE system for COVID-19 diagnosis. As far as the authors&#x2019; best known, we are the first to apply PZM to COVID-19 image analysis. Also, two other improvements are carried out: (i) DSSAE is used as the classifier, and (ii) multiple-way data augmentation is employed to generalize the classifier. Our model yields a sensitivity of 92.06% &#x00B1; 1.54%, a specificity of 92.56% &#x00B1; 1.06%, an accuracy of 92.31% &#x00B1; 1.08%, and an AUC of 0.9576.</p>
<p>In the future, we shall collect more COVID-19 images from more patients and multiple modalities. Also, other advanced AI models will be tested, such as graph neural networks and attention networks.</p>
</sec>
</body>
<back>
<fn-group>
<fn fn-type="other">
<p><bold>Funding Statement:</bold> This study was supported by Royal Society International Exchanges Cost Share Award, UK (RP202G0230); Medical Research Council Confidence in Concept Award, UK (MC_PC_17171); Hope Foundation for Cancer Research, UK (RM60G0680); Global Challenges Research Fund (GCRF), UK (P202PF11)</p>
</fn>
<fn fn-type="conflict">
<p><bold>Conflicts of Interest:</bold> The authors declare that they have no conflicts of interest to report regarding the present study.</p>
</fn>
</fn-group>
<ref-list content-type="authoryear">
<title>References</title>
<ref id="ref-1"><label>1</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><given-names>B.</given-names> <surname>Zemrani</surname></string-name>, <string-name><given-names>M.</given-names> <surname>Gehri</surname></string-name>, <string-name><given-names>E.</given-names> <surname>Masserey</surname></string-name>, <string-name><given-names>C.</given-names> <surname>Knob</surname></string-name> and <string-name><given-names>R.</given-names> <surname>Pellaton</surname></string-name></person-group>, &#x201C;<article-title>A hidden side of the COVID-19 pandemic in children: The double burden of undernutrition and overnutrition</article-title>,&#x201D; <source>International Journal for Equity in Health</source>, vol. <volume>20</volume>, no. <issue>1</issue>, pp. <fpage>4</fpage>, <year>2021</year>.</mixed-citation></ref>
<ref id="ref-2"><label>2</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><given-names>M.</given-names> <surname>Bassi</surname></string-name>, <string-name><given-names>L.</given-names> <surname>Negri</surname></string-name>, <string-name><given-names>A.</given-names> <surname>Delle Fave</surname></string-name> and <string-name><given-names>R.</given-names> <surname>Accardi</surname></string-name></person-group>, &#x201C;<article-title>The relationship between post-traumatic stress and positive mental health symptoms among health workers during COVID-19 pandemic in Lombardy</article-title>,&#x201D; <source>Italy Journal of Affective Disorders</source>, vol. <volume>280</volume>, pp. <fpage>1</fpage>&#x2013;<lpage>6</lpage>, <year>2021</year>.</mixed-citation></ref>
<ref id="ref-3"><label>3</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><given-names>D.</given-names> <surname>Simiies</surname></string-name>, <string-name><given-names>A. R.</given-names> <surname>Stengaard</surname></string-name>, <string-name><given-names>L.</given-names> <surname>Combs</surname></string-name>, <string-name><given-names>D.</given-names> <surname>Raben</surname></string-name> and <string-name><given-names>T. C. I. A.</given-names> <surname>Euro</surname></string-name></person-group>, &#x201C;<article-title>Impact of the COVID-19 pandemic on testing services for HIV, viral hepatitis and sexually transmitted infections in the WHO European Region, march to august 2020</article-title>,&#x201D; <source>Eurosurveillance</source>, vol. <volume>25</volume>, pp. <fpage>7</fpage>, <year>2020</year>.</mixed-citation></ref>
<ref id="ref-4"><label>4</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><given-names>F.</given-names> <surname>Salahshour</surname></string-name>, <string-name><given-names>M.-M.</given-names> <surname>Mehrabinejad</surname></string-name>, <string-name><given-names>M. N.</given-names> <surname>Toosi</surname></string-name>, <string-name><given-names>M.</given-names> <surname>Gity</surname></string-name>, <string-name><given-names>H.</given-names> <surname>Ghanaati</surname></string-name> <etal>et al.</etal></person-group><italic>,</italic> &#x201C;<article-title>Clinical and chest CT features as a predictive tool for COVID-19 clinical progress: Introducing a novel semi-quantitative scoring system</article-title>,&#x201D; <source>European Radiology</source>, vol. <volume>11</volume>, pp. <fpage>1</fpage>&#x2013;<lpage>11</lpage>, <year>2021</year>.</mixed-citation></ref>
<ref id="ref-5"><label>5</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><given-names>A. M.</given-names> <surname>Ismael</surname></string-name> and <string-name><given-names>A.</given-names> <surname>Sengur</surname></string-name></person-group>, &#x201C;<article-title>The investigation of multiresolution approaches for chest X-ray image based COVID-19 detection</article-title>,&#x201D; <source>Health Information Science and Systems</source>, vol. <volume>8</volume>, no. <issue>1</issue>, pp. <fpage>1</fpage>&#x2013;<lpage>13</lpage>, <year>2020</year>.</mixed-citation></ref>
<ref id="ref-6"><label>6</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><given-names>A.</given-names> <surname>Hata</surname></string-name>, <string-name><given-names>M.</given-names> <surname>Yanagawa</surname></string-name>, <string-name><given-names>Y.</given-names> <surname>Yoshida</surname></string-name>, <string-name><given-names>T.</given-names> <surname>Miyata</surname></string-name>, <string-name><given-names>N.</given-names> <surname>Kikuchi</surname></string-name> <etal>et al.</etal></person-group><italic>,</italic> &#x201C;<article-title>The image quality of deep-learning image reconstruction of chest CT images on a mediastinal window setting</article-title>,&#x201D; <source>Clinical Radiology</source>, vol. <volume>76</volume>, no. <issue>2</issue>, pp. <fpage>9</fpage>, <year>2021</year>.</mixed-citation></ref>
<ref id="ref-7"><label>7</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><given-names>Y.</given-names> <surname>Satoh</surname></string-name>, <string-name><given-names>U.</given-names> <surname>Motosugi</surname></string-name>, <string-name><given-names>M.</given-names> <surname>Imai</surname></string-name>, <string-name><given-names>Y.</given-names> <surname>Omiya</surname></string-name> and <string-name><given-names>H.</given-names> <surname>Onishi</surname></string-name></person-group>, &#x201C;<article-title>Evaluation of image quality at the detector&#x2019;s edge of dedicated breast positron emission tomography</article-title>,&#x201D; <source>Ejnmmi Physics</source>, vol. <volume>8</volume>, no. <issue>1</issue>, pp. <fpage>14</fpage>, <year>2021</year>.</mixed-citation></ref>
<ref id="ref-8"><label>8</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><given-names>M. A.</given-names> <surname>Khan</surname></string-name>, <string-name><given-names>S.</given-names> <surname>Kadry</surname></string-name>, <string-name><given-names>Y.-D.</given-names> <surname>Zhang</surname></string-name>, <string-name><given-names>T.</given-names> <surname>Akram</surname></string-name>, <string-name><given-names>M.</given-names> <surname>Sharif</surname></string-name> <etal>et al.</etal></person-group><italic>,</italic> &#x201C;<article-title>Prediction of COVID-19-pneumonia based on selected deep features and one class kernel extreme learning machine</article-title>,&#x201D; <source>Computers &#x0026; Electrical Engineering</source>, vol. <volume>90</volume>, pp. <fpage>106960</fpage>, <year>2021</year>.</mixed-citation></ref>
<ref id="ref-9"><label>9</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><given-names>T.</given-names> <surname>Akram</surname></string-name>, <string-name><given-names>M.</given-names> <surname>Attique</surname></string-name>, <string-name><given-names>S.</given-names> <surname>Gul</surname></string-name>, <string-name><given-names>A.</given-names> <surname>Shahzad</surname></string-name>, <string-name><given-names>M.</given-names> <surname>Altaf</surname></string-name> <etal>et al.</etal></person-group><italic>,</italic> &#x201C;<article-title>A novel framework for rapid diagnosis of COVID-19 on computed tomography scans</article-title>,&#x201D; <source>Pattern Analysis and Applications</source>, pp. <fpage>1</fpage>&#x2013;<lpage>14</lpage>, <year>2021</year>. <uri xlink:href="https://doi.org/10.1007/s10044-020-00950-0">https://doi.org/10.1007/s10044-020-00950-0</uri>.</mixed-citation></ref>
<ref id="ref-10"><label>10</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><given-names>H. T.</given-names> <surname>Rauf</surname></string-name>, <string-name><given-names>M. I. U.</given-names> <surname>Lali</surname></string-name>, <string-name><given-names>M. A.</given-names> <surname>Khan</surname></string-name>, <string-name><given-names>S.</given-names> <surname>Kadry</surname></string-name>, <string-name><given-names>H.</given-names> <surname>Alolaiyan</surname></string-name> <etal>et al.</etal></person-group><italic>,</italic> &#x201C;<article-title>Time series forecasting of COVID-19 transmission in Asia Pacific countries using deep neural networks</article-title>,&#x201D; <source>Personal and Ubiquitous Computing</source>, vol. <volume>8</volume>, pp. <fpage>1</fpage>&#x2013;<lpage>18</lpage>, <year>2021</year>.</mixed-citation></ref>
<ref id="ref-11"><label>11</label><mixed-citation publication-type="conf-proc"><person-group person-group-type="author"><string-name><given-names>M. H.</given-names> <surname>Guo</surname></string-name> and <string-name><given-names>Y. Z.</given-names> <surname>Du</surname></string-name></person-group>, &#x201C;<article-title>Classification of thyroid ultrasound standard plane images using ResNet-18 networks</article-title>,&#x201D; in <conf-name>IEEE 13th Int. Conf. on Anti-Counterfeiting, Security, and Identification</conf-name>, <publisher-loc>Xiamen, China</publisher-loc>, pp. <fpage>324</fpage>&#x2013;<lpage>328</lpage>, <year>2019</year>.</mixed-citation></ref>
<ref id="ref-12"><label>12</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><given-names>S.</given-names> <surname>Lu</surname></string-name></person-group>, &#x201C;<article-title>A pathological brain detection system based on extreme learning machine optimized by Bat algorithm</article-title>,&#x201D; <source>CNS &#x0026; Neurological Disorders-Drug Targets</source>, vol. <volume>16</volume>, no. <issue>1</issue>, pp. <fpage>23</fpage>&#x2013;<lpage>29</lpage>, <year>2017</year>.</mixed-citation></ref>
<ref id="ref-13"><label>13</label><mixed-citation publication-type="book"><person-group person-group-type="author"><string-name><given-names>X.</given-names> <surname>Yao</surname></string-name></person-group>, &#x201C;<chapter-title>COVID-19 detection via wavelet entropy and biogeography-based optimization</chapter-title>,&#x201D; in <source>{COVID-19}: Prediction, Decision-Making, and Its Impacts</source>, <person-group person-group-type="editor"><string-name><given-names>K. C.</given-names> <surname>Santosh</surname></string-name>, <string-name><given-names>A.</given-names> <surname>Joshi</surname></string-name> (Eds.),</person-group> <publisher-loc>Cham</publisher-loc>: <publisher-name>Springer</publisher-name>, pp. <fpage>69</fpage>&#x2013;<lpage>76</lpage>, <year>2020</year>.</mixed-citation></ref>
<ref id="ref-14"><label>14</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><given-names>X.</given-names> <surname>Wu</surname></string-name></person-group>, &#x201C;<article-title>Diagnosis of COVID-19 by wavelet renyi entropy and three-segment biogeography-based optimization</article-title>,&#x201D; <source>International Journal of Computational Intelligence Systems</source>, vol. <volume>13</volume>, no. <issue>1</issue>, pp. <fpage>1332</fpage>&#x2013;<lpage>1344</lpage>, <year>2020</year>.</mixed-citation></ref>
<ref id="ref-15"><label>15</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><given-names>X. G.</given-names> <surname>Wang</surname></string-name>, <string-name><given-names>X. B.</given-names> <surname>Deng</surname></string-name>, <string-name><given-names>Q.</given-names> <surname>Fu</surname></string-name>, <string-name><given-names>Q.</given-names> <surname>Zhou</surname></string-name>, <string-name><given-names>J. P.</given-names> <surname>Feng</surname></string-name> <etal>et al.</etal></person-group><italic>,</italic> &#x201C;<article-title>A weakly-supervised framework for COVID-19 classification and lesion localization from chest CT</article-title>,&#x201D; <source>IEEE Transactions on Medical Imaging</source>, vol. <volume>39</volume>, no. <issue>8</issue>, pp. <fpage>2615</fpage>&#x2013;<lpage>2625</lpage>, <year>2020</year>.</mixed-citation></ref>
<ref id="ref-16"><label>16</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><given-names>E. S. M.</given-names> <surname>El-kenawy</surname></string-name>, <string-name><given-names>A.</given-names> <surname>Ibrahim</surname></string-name>, <string-name><given-names>S.</given-names> <surname>Mirjalili</surname></string-name>, <string-name><given-names>M. M.</given-names> <surname>Eid</surname></string-name> and <string-name><given-names>S. E.</given-names> <surname>Hussein</surname></string-name></person-group>, &#x201C;<article-title>Novel feature selection and voting classifier algorithms for COVID-19 classification in CT images</article-title>,&#x201D; <source>IEEE Access</source>, vol. <volume>8</volume>, pp. <fpage>179317</fpage>&#x2013;<lpage>179335</lpage>, <year>2020</year>.</mixed-citation></ref>
<ref id="ref-17"><label>17</label><mixed-citation publication-type="conf-proc"><person-group person-group-type="author"><string-name><given-names>X.</given-names> <surname>Yu</surname></string-name>, <string-name><given-names>S.-H.</given-names> <surname>Wang</surname></string-name>, <string-name><given-names>X.</given-names> <surname>Zhang</surname></string-name> and <string-name><given-names>Y.-D.</given-names> <surname>Zhang</surname></string-name></person-group>, &#x201C;<article-title>Detection of COVID-19 by GoogLeNet-COD</article-title>,&#x201D; in <conf-name>Int. Conf. on Intelligent Computing</conf-name>, <publisher-loc>Bari, Italy</publisher-loc>, pp. <fpage>499</fpage>&#x2013;<lpage>509</lpage>, <year>2020</year>. </mixed-citation></ref>
<ref id="ref-18"><label>18</label><mixed-citation publication-type="book"><person-group person-group-type="author"><string-name><given-names>Y.</given-names> <surname>Chen</surname></string-name></person-group>, &#x201C;<chapter-title>Covid-19 classification based on gray-level co-occurrence matrix and support vector machine</chapter-title>,&#x201D; in <source>COVID-19: Prediction, Decision-Making, and Its Impacts</source>, <person-group person-group-type="editor"><string-name><given-names>K. C.</given-names> <surname>Santosh</surname></string-name>, <string-name><given-names>A.</given-names> <surname>Joshi</surname></string-name></person-group> (Eds.), <publisher-loc>Singapore</publisher-loc>: <publisher-name>Springer, Cham</publisher-name>, pp. <fpage>47</fpage>&#x2013;<lpage>55</lpage>, <year>2020</year>.</mixed-citation></ref>
<ref id="ref-19"><label>19</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><given-names>M. A.</given-names> <surname>Khan</surname></string-name>, <string-name><given-names>N.</given-names> <surname>Hussain</surname></string-name>, <string-name><given-names>A.</given-names> <surname>Majid</surname></string-name>, <string-name><given-names>M.</given-names> <surname>Alhaisoni</surname></string-name>, <string-name><given-names>S. A. C.</given-names> <surname>Bukhari</surname></string-name> <etal>et al.</etal></person-group><italic>,</italic> &#x201C;<article-title>Classification of positive COVID-19 CT scans using deep learning</article-title>,&#x201D; <source>Computers, Materials and Continua</source>, vol. <volume>66</volume>, no. <issue>2</issue>, pp. <fpage>1</fpage>&#x2013;<lpage>15</lpage>, <year>2021</year>.</mixed-citation></ref>
<ref id="ref-20"><label>20</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><given-names>Y. D.</given-names> <surname>Zhang</surname></string-name></person-group>, &#x201C;<article-title>A seven-layer convolutional neural network for chest CT based COVID-19 diagnosis using stochastic pooling</article-title>,&#x201D; <source>IEEE Sensors Journal</source>, vol. <volume>1</volume>, pp. <fpage>1</fpage>, <year>2021</year>.</mixed-citation></ref>
<ref id="ref-21"><label>21</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><given-names>M.-K.</given-names> <surname>Hu</surname></string-name></person-group>, &#x201C;<article-title>Visual pattern recognition by moment invariants</article-title>,&#x201D; <source>IRE Transactions on Information Theory</source>, vol. <volume>8</volume>, pp. <fpage>179</fpage>&#x2013;<lpage>187</lpage>, <year>1962</year>.</mixed-citation></ref>
<ref id="ref-22"><label>22</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><given-names>S. S.</given-names> <surname>Gornale</surname></string-name>, <string-name><given-names>P. U.</given-names> <surname>Patravali</surname></string-name> and <string-name><given-names>P. S.</given-names> <surname>Hiremath</surname></string-name></person-group>, &#x201C;<article-title>Automatic detection and classification of knee osteoarthritis using Hu&#x2019;s invariant moments</article-title>,&#x201D; <source>Frontiers in Robotics and AI</source>, vol. <volume>7</volume>, pp. <fpage>8</fpage>, <year>2020</year>.</mixed-citation></ref>
<ref id="ref-23"><label>23</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><given-names>S.</given-names> <surname>Ouchtati</surname></string-name>, <string-name><given-names>A.</given-names> <surname>Chergui</surname></string-name>, <string-name><given-names>S.</given-names> <surname>Mavromatis</surname></string-name>, <string-name><given-names>B.</given-names> <surname>Aissa</surname></string-name> and <string-name><given-names>J.</given-names> <surname>Sequeira</surname></string-name></person-group>, &#x201C;<article-title>Novel method for brain tumor classification based on use of image entropy and seven Hu&#x2019;s invariant moments</article-title>,&#x201D; <source>Traitement Du Signal</source>, vol. <volume>36</volume>, no. <issue>6</issue>, pp. <fpage>483</fpage>&#x2013;<lpage>491</lpage>, <year>2019</year>.</mixed-citation></ref>
<ref id="ref-24"><label>24</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><given-names>M. R.</given-names> <surname>Teague</surname></string-name></person-group>, &#x201C;<article-title>Image analysis via the general theory of moments</article-title>,&#x201D; <source>Journal Optical Soceity America</source>, vol. <volume>70</volume>, no. <issue>8</issue>, pp. <fpage>920</fpage>&#x2013;<lpage>930</lpage>, <year>1980</year>.</mixed-citation></ref>
<ref id="ref-25"><label>25</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><given-names>A.</given-names> <surname>Grosso</surname></string-name> and <string-name><given-names>T.</given-names> <surname>Scharf</surname></string-name></person-group>, &#x201C;<article-title>Scalar analytical expressions for the field dependence of Zernike polynomials in asymmetric optical systems with circular symmetric surfaces</article-title>,&#x201D; <source>OSA Continuum</source>, vol. <volume>3</volume>, no. <issue>10</issue>, pp. <fpage>2749</fpage>&#x2013;<lpage>2765</lpage>, <year>2020</year>.</mixed-citation></ref>
<ref id="ref-26"><label>26</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><given-names>C. W.</given-names> <surname>Chong</surname></string-name>, <string-name><given-names>P.</given-names> <surname>Raveendran</surname></string-name> and <string-name><given-names>R.</given-names> <surname>Mukundan</surname></string-name></person-group>, &#x201C;<article-title>The scale invariants of pseudo-Zernike moments</article-title>,&#x201D; <source>Pattern Analysis and Applications</source>, vol. <volume>6</volume>, no. <issue>3</issue>, pp. <fpage>176</fpage>&#x2013;<lpage>184</lpage>, <year>2003</year>.</mixed-citation></ref>
<ref id="ref-27"><label>27</label><mixed-citation publication-type="conf-proc"><person-group person-group-type="author"><string-name><given-names>N.</given-names> <surname>Borzue</surname></string-name>, <string-name><given-names>K.</given-names> <surname>Faez</surname></string-name> and <collab>IEEE</collab></person-group>, &#x201C;<article-title>Object contour detecting using pseudo zernike moment and multi-layer perceptron</article-title>,&#x201D; in <conf-name>22nd Iranian Conf. on Biomedical Engineering</conf-name>, <publisher-loc>Tehran, Iran</publisher-loc>, pp. <fpage>304</fpage>&#x2013;<lpage>308</lpage>, <year>2015</year>. </mixed-citation></ref>
<ref id="ref-28"><label>28</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><given-names>S. P.</given-names> <surname>Singh</surname></string-name> and <string-name><given-names>S.</given-names> <surname>Urooj</surname></string-name></person-group>, &#x201C;<article-title>An improved CAD system for breast cancer diagnosis based on generalized pseudo zernike moment and Ada-DEWNN classifier</article-title>,&#x201D; <source>Journal of Medical Systems</source>, vol. <volume>40</volume>, no. <issue>4</issue>, pp. <fpage>1</fpage>&#x2013;<lpage>23</lpage>, <year>2016</year>.</mixed-citation></ref>
<ref id="ref-29"><label>29</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><given-names>S.</given-names> <surname>Du</surname></string-name></person-group>, &#x201C;<article-title>Alzheimer&#x2019;s disease detection by pseudo zernike moment and linear regression classification</article-title>,&#x201D; <source>CNS &#x0026; Neurological Disorders-Drug Targets</source>, vol. <volume>16</volume>, no. <issue>1</issue>, pp. <fpage>11</fpage>&#x2013;<lpage>15</lpage>, <year>2017</year>.</mixed-citation></ref>
<ref id="ref-30"><label>30</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><given-names>Y.</given-names> <surname>Jiang</surname></string-name></person-group>, &#x201C;<article-title>Exploring a smart pathological brain detection method on pseudo Zernike moment</article-title>,&#x201D; <source>Multimedia Tools and Applications</source>, vol. <volume>77</volume>, no. <issue>17</issue>, pp. <fpage>22589</fpage>&#x2013;<lpage>22604</lpage>, <year>2018</year>.</mixed-citation></ref>
<ref id="ref-31"><label>31</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><given-names>D. K.</given-names> <surname>Mohanty</surname></string-name>, <string-name><given-names>A. K.</given-names> <surname>Parida</surname></string-name> and <string-name><given-names>S.</given-names> <surname>Suman</surname></string-name></person-group>, &#x201C;<article-title>Financial market prediction under deep learning framework using auto encoder and kernel extreme learning machine</article-title>,&#x201D; <source>Applied Soft Computing</source>, vol. <volume>99</volume>, no. <issue>3</issue>, pp. <fpage>14</fpage>, <year>2021</year>.</mixed-citation></ref>
<ref id="ref-32"><label>32</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><given-names>J.</given-names> <surname>Lequesne</surname></string-name> and <string-name><given-names>P.</given-names> <surname>Regnault</surname></string-name></person-group>, &#x201C;<article-title>vsgoftest: An R package for goodness-of-fit testing based on Kullback&#x2013;Leibler divergence</article-title>,&#x201D; <source>Journal of Statistical Software</source>, vol. <volume>96</volume>, no. <issue>Code Snippet 1</issue>, pp. <fpage>26</fpage>, <year>2020</year>.</mixed-citation></ref>
<ref id="ref-33"><label>33</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><given-names>S.-H.</given-names> <surname>Wang</surname></string-name></person-group>, &#x201C;<article-title>Covid-19 classification by FGCNet with deep feature fusion from graph convolutional network and convolutional neural network</article-title>,&#x201D; <source>Information Fusion</source>, vol. <volume>67</volume>, pp. <fpage>208</fpage>&#x2013;<lpage>229</lpage>, <year>2021</year>.</mixed-citation></ref>
<ref id="ref-34"><label>34</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><given-names>C.</given-names> <surname>Buitrago-Duque</surname></string-name>, <string-name><given-names>R.</given-names> <surname>Castaneda</surname></string-name> and <string-name><given-names>J.</given-names> <surname>Garcia-Sucerquia</surname></string-name></person-group>, &#x201C;<article-title>Pointwise phasor tuning for single-shot speckle noise reduction in phase wave fields</article-title>,&#x201D; <source>Optics and Lasers in Engineering</source>, vol. <volume>137</volume>, pp. <fpage>5</fpage>, <year>2021</year>.</mixed-citation></ref>
<ref id="ref-35"><label>35</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><given-names>K.</given-names> <surname>Vasanth</surname></string-name> and <string-name><given-names>R.</given-names> <surname>Varatharajan</surname></string-name></person-group>, &#x201C;<article-title>An adaptive content based closer proximity pixel replacement algorithm for high density salt and pepper noise removal in images</article-title>,&#x201D; <source>Journal of Ambient Intelligence and Humanized Computing</source>, vol. <volume>3</volume>, pp. <fpage>1</fpage>&#x2013;<lpage>15</lpage>, <year>2020</year>.</mixed-citation></ref>
<ref id="ref-36"><label>36</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><given-names>D.</given-names> <surname>Muller</surname></string-name> and <string-name><given-names>F.</given-names> <surname>Kramer</surname></string-name></person-group>, &#x201C;<article-title>MIScnn: A framework for medical image segmentation with convolutional neural networks and deep learning</article-title>,&#x201D; <source>BMC Medical Imaging</source>, vol. <volume>21</volume>, pp. <fpage>11</fpage>, <year>2021</year>.</mixed-citation></ref>
<ref id="ref-37"><label>37</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><given-names>J.</given-names> <surname>Radoux</surname></string-name> and <string-name><given-names>P.</given-names> <surname>Bogaert</surname></string-name></person-group>, &#x201C;<article-title>About the pitfall of erroneous validation data in the estimation of confusion matrices</article-title>,&#x201D; <source>Remote Sensing</source>, vol. <volume>12</volume>, pp. <fpage>23</fpage>, <year>2020</year>.</mixed-citation></ref>
<ref id="ref-38"><label>38</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><given-names>D.</given-names> <surname>Dreizin</surname></string-name>, <string-name><given-names>F.</given-names> <surname>Goldmann</surname></string-name>, <string-name><given-names>C.</given-names> <surname>LeBedis</surname></string-name>, <string-name><given-names>A.</given-names> <surname>Boscak</surname></string-name>, <string-name><given-names>M.</given-names> <surname>Dattwyler</surname></string-name> <etal>et al.</etal></person-group><italic>,</italic> &#x201C;<article-title>An automated deep learning method for tile AO/OTA pelvic fracture severity grading from trauma whole-body CT</article-title>,&#x201D; <source>Journal of Digital Imaging</source>, vol. <volume>21</volume>, pp. <fpage>1</fpage>&#x2013;<lpage>13</lpage>, <year>2021</year>.</mixed-citation></ref>
<ref id="ref-39"><label>39</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><given-names>R.</given-names> <surname>Jena</surname></string-name>, <string-name><given-names>B.</given-names> <surname>Pradhan</surname></string-name> and <string-name><given-names>A. M.</given-names> <surname>Alamri</surname></string-name></person-group>, &#x201C;<article-title>Susceptibility to seismic amplification and earthquake probability estimation using recurrent neural network model in Odisha</article-title>,&#x201D; <source>India Applied Sciences-Basel</source>, vol. <volume>10</volume>, pp. <fpage>18</fpage>, <year>2020</year>.</mixed-citation></ref>
<ref id="ref-40"><label>40</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><given-names>A. T.</given-names> <surname>Young</surname></string-name>, <string-name><given-names>K.</given-names> <surname>Fernandez</surname></string-name>, <string-name><given-names>J.</given-names> <surname>Pfau</surname></string-name>, <string-name><given-names>R.</given-names> <surname>Reddy</surname></string-name>, <string-name><given-names>N. A.</given-names> <surname>Cao</surname></string-name> <etal>et al.</etal></person-group><italic>,</italic> &#x201C;<article-title>Stress testing reveals gaps in clinic readiness of image-based diagnostic artificial intelligence models</article-title>,&#x201D; <source>NPJ Digital Medicine</source>, vol. <volume>4</volume>, pp. <fpage>8</fpage>, <year>2021</year>.</mixed-citation></ref>
</ref-list>
</back>
</article>