<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Publishing DTD v1.1 20151215//EN" "http://jats.nlm.nih.gov/publishing/1.1/JATS-journalpublishing1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink" xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" article-type="research-article" dtd-version="1.1">
<front>
<journal-meta>
<journal-id journal-id-type="pmc">BIOCELL</journal-id>
<journal-id journal-id-type="nlm-ta">BIOCELL</journal-id>
<journal-id journal-id-type="publisher-id">BIOCELL</journal-id>
<journal-title-group>
<journal-title>BIOCELL</journal-title>
</journal-title-group>
<issn pub-type="epub">1667-5746</issn>
<issn pub-type="ppub">0327-9545</issn>
<publisher>
<publisher-name>Tech Science Press</publisher-name>
<publisher-loc>USA</publisher-loc>
</publisher>
</journal-meta>
<article-meta>
<article-id pub-id-type="publisher-id">16655</article-id>
<article-id pub-id-type="doi">10.32604/biocell.2022.016655</article-id>
<article-categories>
<subj-group subj-group-type="heading">
<subject>Article</subject>
</subj-group>
</article-categories>
<title-group>
<article-title>Tissue specific prediction of N<sup>6</sup>-methyladenine sites based on an ensemble of multi-input hybrid neural network</article-title><alt-title alt-title-type="left-running-head">Tissue specific prediction of N<sup>6</sup>-methyladenine sites based on an ensemble of multi-input hybrid neural network</alt-title><alt-title alt-title-type="right-running-head">Tissue specific prediction of N<sup>6</sup>-methyladenine sites based on an ensemble of multi-input hybrid neural network</alt-title>
</title-group>
<contrib-group content-type="authors">
<contrib id="author-1" contrib-type="author">
<name name-style="western"><surname>JIA</surname><given-names>CANGZHI</given-names></name>
<xref ref-type="aff" rid="aff-1">1</xref>
</contrib>
<contrib id="author-2" contrib-type="author">
<name name-style="western"><surname>JIN</surname><given-names>DONG</given-names></name>
<xref ref-type="aff" rid="aff-1">1</xref>
</contrib>
<contrib id="author-3" contrib-type="author">
<name name-style="western"><surname>WANG</surname><given-names>XIN</given-names></name>
<xref ref-type="aff" rid="aff-1">1</xref>
</contrib>
<contrib id="author-4" contrib-type="author">
<name name-style="western"><surname>ZHAO</surname><given-names>QI</given-names></name>
<xref ref-type="aff" rid="aff-2">2</xref>
<email>zhaoqi@lnu.edu.cn</email>
</contrib>
<aff id="aff-1"><label>1</label><institution>School of Science, Dalian Maritime University</institution>, <addr-line>Dalian, 116026</addr-line>, <country>China</country></aff>
<aff id="aff-2"><label>2</label><institution>School of Computer Science and Software Engineering, University of Science and Technology Liaoning</institution>, <addr-line>Anshan, 114051</addr-line>, <country>China</country></aff>
</contrib-group><author-notes><corresp id="cor1">&#x002A;Address correspondence to: Qi Zhao, <email>zhaoqi@lnu.edu.cn</email></corresp></author-notes>
<pub-date pub-type="epub" date-type="pub" iso-8601-date="2021-12-14"><day>14</day>
<month>12</month>
<year>2021</year></pub-date>
<volume>46</volume>
<issue>4</issue>
<fpage>1105</fpage>
<lpage>1121</lpage>
<history>
<date date-type="received"><day>12</day><month>4</month><year>2021</year></date>
<date date-type="accepted"><day>02</day><month>7</month><year>2021</year></date>
</history>
<permissions>
<copyright-statement>&#x00A9; 2022 Jia et al.</copyright-statement>
<copyright-year>2022</copyright-year>
<copyright-holder>Jia et al.</copyright-holder>
<license xlink:href="https://creativecommons.org/licenses/by/4.0/">
<license-p>This work is licensed under a <ext-link ext-link-type="uri" xlink:type="simple" xlink:href="https://creativecommons.org/licenses/by/4.0/">Creative Commons Attribution 4.0 International License</ext-link>, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</license-p>
</license>
</permissions>
<self-uri content-type="pdf" xlink:href="TSP_BIOCELL_16655.pdf"></self-uri>
<abstract>
<p>N<sup>6</sup>-Methyladenine is a dynamic and reversible post translational modification, which plays an essential role in various biological processes. Because of the current inability to identify m6A-containing mRNAs, computational approaches have been developed to identify m6A sites in DNA sequences. Aiming to improve prediction performance, we introduced a novel ensemble computational approach based on three hybrid deep neural networks, including a convolutional neural network, a capsule network, and a bidirectional gated recurrent unit (BiGRU) with the self-attention mechanism, to identify m6A sites in four tissues of three species. Across a total of 11 datasets, we selected different feature subsets, after optimized from 4933 dimensional features, as input for the deep hybrid neural networks. In addition, to solve the deviation caused by the relatively small number of experimentally verified samples, we constructed an ensemble model through integrating five sub-classifiers based on different training datasets. When compared through 5-fold cross-validation and independent tests, our model showed its superiority to previous methods, im6A-TS-CNN and iRNA-m6A.</p>
</abstract>
<kwd-group kwd-group-type="author">
<kwd>M6A sites</kwd>
<kwd>Deep hybrid neural networks</kwd>
<kwd>Ensemble model</kwd>
<kwd>Feature selection</kwd>
</kwd-group>
</article-meta>
</front>
<body>
<sec id="s1">
<title>Introduction</title>
<p>There are more than 160 identified types of RNA post-transcriptional modifications. Among them, the 5&#x2019; cap and 3&#x2019; poly modifications play important roles in transcriptional regulation, while the function of internal modification is maintaining the stability of mRNA in eukaryotes (<xref ref-type="bibr" rid="ref-3">Cao <italic>et al</italic>., 2016</xref>; <xref ref-type="bibr" rid="ref-26">Yan <italic>et al</italic>., 2021</xref>). One of the most common internal modifications is N6-Methyladenine (m6A). Since discovered in the 1970s, it has been observed in a wide range of eukaryotes, including <italic>yeast</italic>, <italic>Arabidopsis thaliana</italic>, <italic>Drosophila</italic>, and mammals, as well as in the RNA of viruses (<xref ref-type="bibr" rid="ref-3">Cao <italic>et al</italic>., 2016</xref>; <xref ref-type="bibr" rid="ref-27">Yang <italic>et al</italic>., 2020</xref>). N<sup>6</sup>-Methyladenine is a dynamic, reversible post translational modification, and is essential in post transcriptional regulation, regulating gene expression, splicing, editing RNA and maintaining genomic stability (<xref ref-type="bibr" rid="ref-3">Cao <italic>et al</italic>., 2016</xref>). However, m6A modifications were considered static and unalterable, owing to both the ignorance of m6A demethylating enzymes and the short lifetime of most RNA species (median mammalian RNA half-lives are approximately 5 h) (<xref ref-type="bibr" rid="ref-3">Cao <italic>et al</italic>., 2016</xref>; <xref ref-type="bibr" rid="ref-26">Yan <italic>et al</italic>., 2021</xref>). The inability to identify m6A-containing mRNAs has also hindered investigation into their biological roles.</p>
<p>Developing computational tools for predicting m6A sites from DNA sequences could help overcome above-mentioned problems (<xref ref-type="bibr" rid="ref-30">Zhao <italic>et al</italic>., 2019</xref>; <xref ref-type="bibr" rid="ref-9">Li <italic>et al</italic>., 2018</xref>; <xref ref-type="bibr" rid="ref-22">Wei <italic>et al</italic>., 2018</xref>; <xref ref-type="bibr" rid="ref-5">Chen <italic>et al</italic>., 2017</xref>; <xref ref-type="bibr" rid="ref-25">Xing <italic>et al</italic>., 2017</xref>; <xref ref-type="bibr" rid="ref-18">Shahid and Maqsood, 2018</xref>; <xref ref-type="bibr" rid="ref-23">Wei <italic>et al</italic>., 2016</xref>; <xref ref-type="bibr" rid="ref-15">Qi <italic>et al</italic>., 2019</xref>; <xref ref-type="bibr" rid="ref-12">Liu <italic>et al</italic>., 2016</xref>; <xref ref-type="bibr" rid="ref-4">Chen <italic>et al</italic>., 2018</xref>). Computational methods to identify m6A sites can be classified as either shallow or deep learning, according to the classification algorithm adopted.</p>
<p>There are several representative examples of classification models based on shallow learning. <xref ref-type="bibr" rid="ref-7">Feng <italic>et al</italic>. (2019)</xref> integrated nucleotide physicochemical properties into PseKNC (Pseudo K-tuple Nucleotide Composition) and SVM (Support vector machine) so as to build a prediction tool called iDNA6mA-PseKNC. Another prediction model, SDM6A, was developed by <xref ref-type="bibr" rid="ref-1">Basith <italic>et al</italic>. (2019)</xref> to identify m6A sites in the rice genome. <xref ref-type="bibr" rid="ref-1">Basith <italic>et al</italic>. (2019)</xref> used numerical representations of nucleotides, mono-nucleotide binary encoding, di-nucleotide binary encoding, local position-specific di-nucleotide frequency, ring-function hydrogen chemical properties and K-nearest neighbor in order to select features by F-score. Then, they tried four traditional machine learning (ML) classifiers, namely SVM, ERT (extremely randomized tree), RF (random forest) and XGB (extreme gradient boosting), to predict DNA N6-methyladenine. Finally, two classifiers were integrated to construct the model. <xref ref-type="bibr" rid="ref-8">Hasan <italic>et al</italic>. (2020)</xref> implemented five encoding schemes (mono-nucleotide binary, din-ucleotide binary, k-space spectral nucleotide, k-mer, and electron&#x2013;ion interaction pseudo potential compositions) to build five single-encoding RF models for identifying the DNA m6A sites in the Rosaceae genome. They then combined the prediction probability scores of these five RF models and used a linear regression model to construct an i6mA-Fuse classifier.</p>
<p>Several predictors based on deep learning algorithms have also been developed. <xref ref-type="bibr" rid="ref-20">Tahir <italic>et al</italic>. (2019)</xref> built a deep learning automatic computing model, iDNA6mA, which could predict m6A sites by integrating one-hot encoding and a convolutional neural network (CNN). <xref ref-type="bibr" rid="ref-14">Nazari <italic>et al</italic>. (2019)</xref> used not only a convolutional neural network but also the natural language processing model Word2Vec in order to extract features from sequences automaticly, and succeeded in constructing the iN6-Methyl model, which was able to identify m6A sites in multiple species.</p>
<p>Moreover, with the deepening understanding of the spatial specificity of gene expression, there were two studies offering insight into distinguishing m6A modification sites in various tissues of <italic>human</italic>, <italic>mouse</italic> and <italic>rat</italic>. <xref ref-type="bibr" rid="ref-6">Dao <italic>et al</italic>. (2020)</xref> extracted three kinds of features, containing physical-chemical property, mono-nucleotide binary encoding and nucleotide chemical properties, and combined them with SVM to construct a predictor called iRNA-m6A. In another study, <xref ref-type="bibr" rid="ref-10">Liu <italic>et al</italic>. (2020)</xref> proposed a predictor called im6A-TS-CNN which employed one-hot encoding and CNN. But neither model gave satisfactory performance because of the limitation in the feature extraction and classifier architecture designation. These two studies did not consider location and context information, and did not pay attention to redundant information as well. In addition, the deep network architecture should be further explored and designed so that its deep feature learning ability should be promising.</p>
<p>To address these limitations, we proposed a novel computation model, considering three kinds of feature descriptors: one-hot encoding, sequence features derived from <italic>iLearn</italic>, and K-tuple nucleotide frequency pattern (KNFP), to characterize the nucleic acid sequence. To scale down the information noise caused by excessive unrelated features, we used the F-score and reduced feature dimension through a series of triple 5-fold cross-validation tests. Following this, we used a hierarchical deep learning network composed of a multi-channel convolutional neural network, a capsule network, and a bidirectional gated recurrent unit (BiGRU) with the self-attention mechanism to learn local and contextual information. Moreover, we randomly divided the positive and negative training datasets into five mutually exclusive parts of similar size, and then selected four parts combined as new training datasets, with the remaining one part adopted as cross-validation test set to optimize the model at each time. Finally, we built an ensemble model and gave the forecast labels according to the majority voting strategy. To evaluate the effectiveness of the ensemble model, we compared its performance with im6A-TS-CNN and iRNA-m6A through a 5-fold cross validation and an independent test. For all the 11 datasets, our model gave the best performance with measures of accuracy and Matthews correlation coefficient. In addition, we visualized the analysis results of the brain in <italic>human</italic>, <italic>mouse</italic> and <italic>rat</italic> using t-distributed Stochastic Neighbor Embedding (t-SNE). <xref ref-type="fig" rid="fig-1">Fig. 1</xref> demonstrates the design and optimization process of our model.</p>
<fig id="fig-1">
<label>Figure 1</label>
<caption>
<title>The framework of our model.</title></caption>
<graphic mimetype="image" mime-subtype="png" xlink:href="BIOCELL_16655-fig-1.png"/>
</fig>
</sec>
<sec id="s2">
<title>Materials and Methods</title>
<sec id="s2_1">
<title>Datasets</title>
<p>In this study, we trained and evaluated our model on the benchmark datasets containing a total of 11 training and 11 testing datasets from <italic>human</italic>, <italic>mouse</italic> and <italic>rat</italic>, which were also used in iRNA-m6A and im6A-TS-CNN models (<xref ref-type="bibr" rid="ref-6">Dao <italic>et al</italic>., 2020</xref>; <xref ref-type="bibr" rid="ref-10">Liu <italic>et al</italic>., 2020</xref>). Each dataset contains the same number of positive and negative samples. Each sample is a 41nt-length RNA sequence with Adenine in the center. Detailed information about these datasets can be found in the work of <xref ref-type="bibr" rid="ref-6">Dao <italic>et al</italic>. (2020)</xref>.</p>
</sec>
<sec id="s2_2">
<title>Feature extraction and feature selection</title>
<p>It is vital to extract efficacious features when developing new computational model based on machine or deep learning algorithms (<xref ref-type="bibr" rid="ref-28">Zhang and Liu, 2019</xref>). In this study, we extracted three categories of features from the sequence: one-hot encoding, sequence features, and order features.</p>
</sec>
<sec id="s2_3">
<title>One-hot encoding</title>
<p>Given a DNA sequence D, its intuitive expression is</p>
<p><disp-formula id="eqn-1"><label>(1)</label>
<mml:math id="mml-eqn-1" display="block"><mml:mrow><mml:mrow><mml:mi mathvariant="normal">D</mml:mi></mml:mrow></mml:mrow><mml:mo>=</mml:mo><mml:mrow><mml:msub><mml:mrow><mml:mrow><mml:mi mathvariant="normal">R</mml:mi></mml:mrow></mml:mrow><mml:mn>1</mml:mn></mml:msub></mml:mrow><mml:mrow><mml:msub><mml:mrow><mml:mrow><mml:mi mathvariant="normal">R</mml:mi></mml:mrow></mml:mrow><mml:mn>2</mml:mn></mml:msub></mml:mrow><mml:mrow><mml:msub><mml:mrow><mml:mrow><mml:mi mathvariant="normal">R</mml:mi></mml:mrow></mml:mrow><mml:mn>3</mml:mn></mml:msub></mml:mrow><mml:mrow><mml:msub><mml:mrow><mml:mrow><mml:mi mathvariant="normal">R</mml:mi></mml:mrow></mml:mrow><mml:mn>4</mml:mn></mml:msub></mml:mrow><mml:mrow><mml:msub><mml:mrow><mml:mrow><mml:mi mathvariant="normal">R</mml:mi></mml:mrow></mml:mrow><mml:mn>5</mml:mn></mml:msub></mml:mrow><mml:mrow><mml:msub><mml:mrow><mml:mrow><mml:mi mathvariant="normal">R</mml:mi></mml:mrow></mml:mrow><mml:mn>6</mml:mn></mml:msub></mml:mrow><mml:mrow><mml:msub><mml:mrow><mml:mrow><mml:mi mathvariant="normal">R</mml:mi></mml:mrow></mml:mrow><mml:mn>7</mml:mn></mml:msub></mml:mrow><mml:mo>&#x22EF;</mml:mo><mml:mrow><mml:msub><mml:mrow><mml:mrow><mml:mi mathvariant="normal">R</mml:mi></mml:mrow></mml:mrow><mml:mi>L</mml:mi></mml:msub></mml:mrow></mml:math>
</disp-formula></p>
<p>where <inline-formula id="ieqn-1">
<mml:math id="mml-ieqn-1"><mml:mrow><mml:msub><mml:mrow><mml:mrow><mml:mi mathvariant="normal">R</mml:mi></mml:mrow></mml:mrow><mml:mrow><mml:mrow><mml:mi mathvariant="normal">I</mml:mi></mml:mrow></mml:mrow></mml:msub></mml:mrow></mml:math>
</inline-formula> represents the <inline-formula id="ieqn-2">
<mml:math id="mml-ieqn-2"><mml:mi>i</mml:mi></mml:math>
</inline-formula>-th nucleic acid residue at position <inline-formula id="ieqn-3">
<mml:math id="mml-ieqn-3"><mml:mi>i</mml:mi></mml:math>
</inline-formula> in the DNA sequence.</p>
<p>Each sequence with 41 nt is represented with a 41 &#x00D7; 41 vector, in which (1, 0, 0, 0) stands for G, (0, 1, 0, 0) stands for C, (0, 0, 1, 0) stands for U, and (0, 0, 0, 1) stands for A.</p>
</sec>
<sec id="s2_4">
<title>Sequence features</title>
<p>Transforming a DNA sequence sample into a vector based on its sequence characteristic composition is a simple but universal strategy, which can capture significant biological information (<xref ref-type="bibr" rid="ref-31">Zhen <italic>et al</italic>., 2020</xref>; <xref ref-type="bibr" rid="ref-32">Zou <italic>et al</italic>., 2019</xref>). <italic>iLearn</italic> is a comprehensive and versatile Python-based toolkit including a variety of descriptors for DNA, RNA and proteins (<xref ref-type="bibr" rid="ref-31">Zhen <italic>et al</italic>., 2020</xref>). We used <italic>iLearn</italic> to calculate and extract four types of features: nucleic acid composition, binary electron-ion interaction pseudopotentials, autocorrelation and cross-covariance, pseudo nucleic acid composition, and achieved a total of 1325 features. The names and dimensions of features used in this section are listed in <xref ref-type="table" rid="table-1">Table 1</xref>. As for the specific definitions of these features, please refer to (<xref ref-type="bibr" rid="ref-31">Zhen <italic>et al</italic>., 2020</xref>; <xref ref-type="bibr" rid="ref-32">Zou <italic>et al</italic>., 2019</xref>).</p>
</sec>
<sec id="s2_5">
<title>K-tuple nucleotide frequency pattern</title>
<p>KNFP integrates the information from K-mer as well as one-hot encoding, and can compensate for insufficient short-range or local sequence order information effectively (<xref ref-type="bibr" rid="ref-27">Yang <italic>et al</italic>., 2020</xref>). It has been used to identify protein-RNA binding sites and protein-circRNA interaction sites (<xref ref-type="bibr" rid="ref-27">Yang <italic>et al</italic>., 2020</xref>). K-mer can map any DNA sequence to a vector with <inline-formula id="ieqn-4">
<mml:math id="mml-ieqn-4"><mml:mrow><mml:msup><mml:mn>4</mml:mn><mml:mrow><mml:mrow><mml:mi mathvariant="normal">k</mml:mi></mml:mrow></mml:mrow></mml:msup></mml:mrow></mml:math>
</inline-formula> dimensions as follows:</p>
<p><disp-formula id="eqn-2"><label>(2)</label>
<mml:math id="mml-eqn-2" display="block"><mml:mrow><mml:mrow><mml:mi mathvariant="normal">R</mml:mi></mml:mrow></mml:mrow><mml:mo>=</mml:mo><mml:mo stretchy="false">[</mml:mo><mml:msub><mml:mi>&#x03C6;</mml:mi><mml:mn>1</mml:mn></mml:msub><mml:msub><mml:mi>&#x03C6;</mml:mi><mml:mn>2</mml:mn></mml:msub><mml:mo>&#x22EF;</mml:mo><mml:msub><mml:mi>&#x03C6;</mml:mi><mml:mi>u</mml:mi></mml:msub><mml:mo>&#x22EF;</mml:mo><mml:msub><mml:mi>&#x03C6;</mml:mi><mml:mn>4</mml:mn></mml:msub><mml:mi>k</mml:mi><mml:msup><mml:mo stretchy="false">]</mml:mo><mml:mrow><mml:mi mathvariant="normal">T</mml:mi></mml:mrow></mml:msup></mml:math>
</disp-formula></p>
<p>where <inline-formula id="ieqn-5">
<mml:math id="mml-ieqn-5"><mml:mrow><mml:msub><mml:mi>&#x03C6;</mml:mi><mml:mrow><mml:mrow><mml:mi mathvariant="normal">u</mml:mi></mml:mrow></mml:mrow></mml:msub></mml:mrow><mml:mrow><mml:mo>(</mml:mo><mml:mrow><mml:mrow><mml:mrow><mml:mi mathvariant="normal">u</mml:mi><mml:mo>=</mml:mo><mml:mn>1</mml:mn></mml:mrow></mml:mrow><mml:mo>,</mml:mo><mml:mrow><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:mrow><mml:mo>,</mml:mo><mml:mo>&#x22EF;</mml:mo><mml:mo>,</mml:mo><mml:mrow><mml:msup><mml:mrow><mml:mrow><mml:mn>4</mml:mn></mml:mrow></mml:mrow><mml:mrow><mml:mrow><mml:mi mathvariant="normal">k</mml:mi></mml:mrow></mml:mrow></mml:msup></mml:mrow></mml:mrow><mml:mo>)</mml:mo></mml:mrow></mml:math>
</inline-formula> is the frequency of the <italic>u</italic>-th k-mer along the sequence (k &#x003D; 1,2,3 in this work).</p>
<p>On one hand, <inline-formula id="ieqn-6">
<mml:math id="mml-ieqn-6"><mml:mrow><mml:mrow><mml:mi mathvariant="normal">R</mml:mi></mml:mrow></mml:mrow></mml:math>
</inline-formula> can be transformed to a diagonal matrix <bold>R</bold><sub><bold>D</bold></sub>, if multiplied by the identity matrix. On the other hand, for a DNA sequence D of length <italic>L</italic>, the number of <italic>k</italic>-mer is <italic>L</italic><inline-formula id="ieqn-7">
<mml:math id="mml-ieqn-7"><mml:mo>&#x2212;</mml:mo></mml:math>
</inline-formula><italic>k &#x002B; 1</italic>. Each k-mer can be encoded as a one-hot vector with dimension of <inline-formula id="ieqn-8">
<mml:math id="mml-ieqn-8"><mml:mrow><mml:msup><mml:mn>4</mml:mn><mml:mrow><mml:mrow><mml:mi mathvariant="normal">k</mml:mi></mml:mrow></mml:mrow></mml:msup></mml:mrow></mml:math>
</inline-formula>. The product of the <inline-formula id="ieqn-9">
<mml:math id="mml-ieqn-9"><mml:mrow><mml:mo>(</mml:mo><mml:mrow><mml:mi>L</mml:mi><mml:mo>&#x2212;</mml:mo><mml:mi>k</mml:mi><mml:mo>+</mml:mo><mml:mn>1</mml:mn></mml:mrow><mml:mo>)</mml:mo></mml:mrow><mml:mo>&#x2217;</mml:mo><mml:mrow><mml:msup><mml:mn>4</mml:mn><mml:mrow><mml:mrow><mml:mi mathvariant="normal">k</mml:mi></mml:mrow></mml:mrow></mml:msup></mml:mrow></mml:math>
</inline-formula> matrix (M) and <bold>R</bold><sub><bold>D</bold></sub> is KNFP.</p>
<table-wrap id="table-1"><label>Table 1</label>
<caption>
<title>The information of features set in this study</title></caption>
<table><colgroup>
<col/>
<col/>
<col/>
</colgroup>
<thead>
<tr>
<th>Descriptor groups</th>
<th>Descriptor</th>
<th>Dimension</th>
</tr>
</thead>
<tbody>
<tr>
<td>Nucleic acid composition</td>
<td>Nucleic Acid Composition (NAC)</td>
<td>4</td>
</tr>
<tr>
<td></td>
<td>Enhanced Nucleic Acid Composition (ENAC)</td>
<td>148</td>
</tr>
<tr>
<td></td>
<td>Di-Nucleotide Composition (DNC)</td>
<td>16</td>
</tr>
<tr>
<td></td>
<td>Tri-Nucleotide Composition (TNC)</td>
<td>64</td>
</tr>
<tr>
<td></td>
<td>Composition of k-spaced Nucleic Acid Pairs (CKSNAP)</td>
<td>64</td>
</tr>
<tr>
<td></td>
<td>Basic kmer (Kmer)</td>
<td>84</td>
</tr>
<tr>
<td></td>
<td>Reverse Compliment Kmer (RCKmer)</td>
<td>12</td>
</tr>
<tr>
<td>Binary</td>
<td>Binary (Binary)</td>
<td>164</td>
</tr>
<tr>
<td>Electron-ion interaction pseudopotentials</td>
<td>Electron-ion interaction pseudopotentials of trinucleotide (EIIP),</td>
<td>41</td>
</tr>
<tr>
<td></td>
<td>Electron-ion interaction pseudopotentials of trinucleotide (PseEIIP)</td>
<td>64</td>
</tr>
<tr>
<td>Autocorrelation and cross-covariance</td>
<td>Dinucleotide-based Auto Covariance (DAC)</td>
<td>30</td>
</tr>
<tr>
<td></td>
<td>Dinucleotide-based Cross Covariance (DCC)</td>
<td>150</td>
</tr>
<tr>
<td></td>
<td>Dinucleotide-based Auto-Cross Covariance (DACC)</td>
<td>180</td>
</tr>
<tr>
<td></td>
<td>Trinucleotide-based Auto Covariance (TAC)</td>
<td>10</td>
</tr>
<tr>
<td></td>
<td>Trinucleotide-based Cross Covariance (TCC)</td>
<td>10</td>
</tr>
<tr>
<td></td>
<td>Trinucleotide-based Auto-Cross Covariance (TACC)</td>
<td>20</td>
</tr>
<tr>
<td>Pseudo nucleic acid composition</td>
<td>Pseudo Dinucleotide Composition (PseDNC)</td>
<td>18</td>
</tr>
<tr>
<td></td>
<td>Pseudo k-tupler Composition (PseKNC)</td>
<td>66</td>
</tr>
<tr>
<td></td>
<td>Parallel Correlation Pseudo Dinucleotide Composition (PCPseDNC)</td>
<td>18</td>
</tr>
<tr>
<td></td>
<td>Parallel Correlation Pseudo Trinucleotide Composition (PCPseTNC)</td>
<td>66</td>
</tr>
<tr>
<td></td>
<td>Series Correlation Pseudo Dinucleotide Composition (SCPseDNC)</td>
<td>28</td>
</tr>
</tbody>
</table>
</table-wrap>
<p>For example, given a sequence <bold>S</bold> &#x003D; &#x2018;ACGACGAA&#x2019;, 1-mer is encoded as one-hot vectors: G &#x003D; (1, 0, 0, 0), C &#x003D; (0, 1, 0, 0), U &#x003D; (0, 0, 1, 0), and A &#x003D; (0, 0, 0, 1). Then according to the position information, <bold>S</bold> can be transformed to a matrix as follows:</p>
<p><disp-formula id="eqn-16">
<mml:math id="mml-eqn-16" display="block"><mml:mrow><mml:mrow><mml:mi mathvariant="normal">M</mml:mi></mml:mrow></mml:mrow><mml:mo>=</mml:mo><mml:mrow><mml:mo>(</mml:mo><mml:mrow><mml:mtable rowspacing="4pt" columnspacing="1em"><mml:mtr><mml:mtd><mml:mn>0</mml:mn></mml:mtd><mml:mtd><mml:mn>0</mml:mn></mml:mtd><mml:mtd><mml:mn>0</mml:mn></mml:mtd><mml:mtd><mml:mn>1</mml:mn></mml:mtd></mml:mtr><mml:mtr><mml:mtd><mml:mn>0</mml:mn></mml:mtd><mml:mtd><mml:mn>1</mml:mn></mml:mtd><mml:mtd><mml:mn>0</mml:mn></mml:mtd><mml:mtd><mml:mn>0</mml:mn></mml:mtd></mml:mtr><mml:mtr><mml:mtd><mml:mn>1</mml:mn></mml:mtd><mml:mtd><mml:mn>0</mml:mn></mml:mtd><mml:mtd><mml:mn>0</mml:mn></mml:mtd><mml:mtd><mml:mn>0</mml:mn></mml:mtd></mml:mtr><mml:mtr><mml:mtd><mml:mn>0</mml:mn></mml:mtd><mml:mtd><mml:mn>0</mml:mn></mml:mtd><mml:mtd><mml:mn>0</mml:mn></mml:mtd><mml:mtd><mml:mn>1</mml:mn></mml:mtd></mml:mtr><mml:mtr><mml:mtd><mml:mn>0</mml:mn></mml:mtd><mml:mtd><mml:mn>1</mml:mn></mml:mtd><mml:mtd><mml:mn>0</mml:mn></mml:mtd><mml:mtd><mml:mn>0</mml:mn></mml:mtd></mml:mtr><mml:mtr><mml:mtd><mml:mn>1</mml:mn></mml:mtd><mml:mtd><mml:mn>0</mml:mn></mml:mtd><mml:mtd><mml:mn>0</mml:mn></mml:mtd><mml:mtd><mml:mn>0</mml:mn></mml:mtd></mml:mtr><mml:mtr><mml:mtd><mml:mn>0</mml:mn></mml:mtd><mml:mtd><mml:mn>0</mml:mn></mml:mtd><mml:mtd><mml:mn>0</mml:mn></mml:mtd><mml:mtd><mml:mn>1</mml:mn></mml:mtd></mml:mtr><mml:mtr><mml:mtd><mml:mn>0</mml:mn></mml:mtd><mml:mtd><mml:mn>0</mml:mn></mml:mtd><mml:mtd><mml:mn>0</mml:mn></mml:mtd><mml:mtd><mml:mn>1</mml:mn></mml:mtd></mml:mtr></mml:mtable></mml:mrow><mml:mo>)</mml:mo></mml:mrow></mml:math>
</disp-formula></p>
<p>For <bold>S</bold>, the frequency vector of 1-mer (G, C, U, A) is R &#x003D; (0.25, 0.25, 0, 0.5). Then through multiplying R by an identity matrix, it is converted to a diagonal matrix as follows:</p>
<p><disp-formula id="eqn-17">
<mml:math id="mml-eqn-17" display="block"><mml:mrow><mml:mrow><mml:msub><mml:mi mathvariant="bold">R</mml:mi><mml:mi mathvariant="bold">D</mml:mi></mml:msub></mml:mrow></mml:mrow><mml:mo>=</mml:mo><mml:mrow><mml:mo>(</mml:mo><mml:mrow><mml:mtable rowspacing="4pt" columnspacing="1em"><mml:mtr><mml:mtd><mml:mrow><mml:mn>0.25</mml:mn></mml:mrow></mml:mtd><mml:mtd><mml:mn>0</mml:mn></mml:mtd><mml:mtd><mml:mn>0</mml:mn></mml:mtd><mml:mtd><mml:mn>0</mml:mn></mml:mtd></mml:mtr><mml:mtr><mml:mtd><mml:mn>0</mml:mn></mml:mtd><mml:mtd><mml:mrow><mml:mn>0.25</mml:mn></mml:mrow></mml:mtd><mml:mtd><mml:mn>0</mml:mn></mml:mtd><mml:mtd><mml:mn>0</mml:mn></mml:mtd></mml:mtr><mml:mtr><mml:mtd><mml:mn>0</mml:mn></mml:mtd><mml:mtd><mml:mn>0</mml:mn></mml:mtd><mml:mtd><mml:mn>0</mml:mn></mml:mtd><mml:mtd><mml:mn>0</mml:mn></mml:mtd></mml:mtr><mml:mtr><mml:mtd><mml:mn>0</mml:mn></mml:mtd><mml:mtd><mml:mn>0</mml:mn></mml:mtd><mml:mtd><mml:mn>0</mml:mn></mml:mtd><mml:mtd><mml:mrow><mml:mn>0.5</mml:mn></mml:mrow></mml:mtd></mml:mtr></mml:mtable></mml:mrow><mml:mo>)</mml:mo></mml:mrow></mml:math>
</disp-formula></p>
<p>The KNFP <inline-formula id="ieqn-10">
<mml:math id="mml-ieqn-10"><mml:mo>=</mml:mo><mml:mrow><mml:mrow><mml:mi mathvariant="normal">M</mml:mi></mml:mrow></mml:mrow><mml:mo>&#x00D7;</mml:mo><mml:mrow><mml:mrow><mml:mi mathvariant="bold">R</mml:mi><mml:mi mathvariant="bold">D</mml:mi></mml:mrow></mml:mrow></mml:math>
</inline-formula> is given as</p>
<p><disp-formula id="eqn-3">
<mml:math id="mml-eqn-3" display="block"><mml:mtable columnalign="left"><mml:mtr><mml:mtd><mml:mrow><mml:mo>(</mml:mo><mml:mrow><mml:mtable rowspacing="4pt" columnspacing="1em"><mml:mtr><mml:mtd><mml:mn>0</mml:mn></mml:mtd><mml:mtd><mml:mn>0</mml:mn></mml:mtd><mml:mtd><mml:mn>0</mml:mn></mml:mtd><mml:mtd><mml:mn>1</mml:mn></mml:mtd></mml:mtr><mml:mtr><mml:mtd><mml:mn>0</mml:mn></mml:mtd><mml:mtd><mml:mn>1</mml:mn></mml:mtd><mml:mtd><mml:mn>0</mml:mn></mml:mtd><mml:mtd><mml:mn>0</mml:mn></mml:mtd></mml:mtr><mml:mtr><mml:mtd><mml:mn>1</mml:mn></mml:mtd><mml:mtd><mml:mn>0</mml:mn></mml:mtd><mml:mtd><mml:mn>0</mml:mn></mml:mtd><mml:mtd><mml:mn>0</mml:mn></mml:mtd></mml:mtr><mml:mtr><mml:mtd><mml:mn>0</mml:mn></mml:mtd><mml:mtd><mml:mn>0</mml:mn></mml:mtd><mml:mtd><mml:mn>0</mml:mn></mml:mtd><mml:mtd><mml:mn>1</mml:mn></mml:mtd></mml:mtr><mml:mtr><mml:mtd><mml:mn>0</mml:mn></mml:mtd><mml:mtd><mml:mn>1</mml:mn></mml:mtd><mml:mtd><mml:mn>0</mml:mn></mml:mtd><mml:mtd><mml:mn>0</mml:mn></mml:mtd></mml:mtr><mml:mtr><mml:mtd><mml:mn>1</mml:mn></mml:mtd><mml:mtd><mml:mn>0</mml:mn></mml:mtd><mml:mtd><mml:mn>0</mml:mn></mml:mtd><mml:mtd><mml:mn>0</mml:mn></mml:mtd></mml:mtr><mml:mtr><mml:mtd><mml:mn>0</mml:mn></mml:mtd><mml:mtd><mml:mn>0</mml:mn></mml:mtd><mml:mtd><mml:mn>0</mml:mn></mml:mtd><mml:mtd><mml:mn>1</mml:mn></mml:mtd></mml:mtr><mml:mtr><mml:mtd><mml:mn>0</mml:mn></mml:mtd><mml:mtd><mml:mn>0</mml:mn></mml:mtd><mml:mtd><mml:mn>0</mml:mn></mml:mtd><mml:mtd><mml:mn>1</mml:mn></mml:mtd></mml:mtr></mml:mtable></mml:mrow><mml:mo>)</mml:mo></mml:mrow><mml:mrow><mml:mrow><mml:mo>&#x2217;</mml:mo></mml:mrow></mml:mrow><mml:mrow><mml:mo>(</mml:mo><mml:mrow><mml:mtable rowspacing="4pt" columnspacing="1em"><mml:mtr><mml:mtd><mml:mrow><mml:mn>0.25</mml:mn></mml:mrow></mml:mtd><mml:mtd><mml:mn>0</mml:mn></mml:mtd><mml:mtd><mml:mn>0</mml:mn></mml:mtd><mml:mtd><mml:mn>0</mml:mn></mml:mtd></mml:mtr><mml:mtr><mml:mtd><mml:mn>0</mml:mn></mml:mtd><mml:mtd><mml:mrow><mml:mn>0.25</mml:mn></mml:mrow></mml:mtd><mml:mtd><mml:mn>0</mml:mn></mml:mtd><mml:mtd><mml:mn>0</mml:mn></mml:mtd></mml:mtr><mml:mtr><mml:mtd><mml:mn>0</mml:mn></mml:mtd><mml:mtd><mml:mn>0</mml:mn></mml:mtd><mml:mtd><mml:mn>0</mml:mn></mml:mtd><mml:mtd><mml:mn>0</mml:mn></mml:mtd></mml:mtr><mml:mtr><mml:mtd><mml:mn>0</mml:mn></mml:mtd><mml:mtd><mml:mn>0</mml:mn></mml:mtd><mml:mtd><mml:mn>0</mml:mn></mml:mtd><mml:mtd><mml:mrow><mml:mn>0.5</mml:mn></mml:mrow></mml:mtd></mml:mtr></mml:mtable></mml:mrow><mml:mo>)</mml:mo></mml:mrow></mml:mtd></mml:mtr><mml:mtr><mml:mtd><mml:mo>=</mml:mo><mml:mrow><mml:mo>(</mml:mo><mml:mrow><mml:mtable><mml:mtr><mml:mtd><mml:mn>0</mml:mn></mml:mtd><mml:mtd><mml:mn>0</mml:mn></mml:mtd><mml:mtd><mml:mn>0</mml:mn></mml:mtd><mml:mtd><mml:mrow><mml:mn>0.5</mml:mn></mml:mrow></mml:mtd></mml:mtr><mml:mtr><mml:mtd><mml:mn>0</mml:mn></mml:mtd><mml:mtd><mml:mrow><mml:mn>0.25</mml:mn></mml:mrow></mml:mtd><mml:mtd><mml:mn>0</mml:mn></mml:mtd><mml:mtd><mml:mn>0</mml:mn></mml:mtd></mml:mtr><mml:mtr><mml:mtd><mml:mrow><mml:mn>0.25</mml:mn></mml:mrow></mml:mtd><mml:mtd><mml:mn>0</mml:mn></mml:mtd><mml:mtd><mml:mn>0</mml:mn></mml:mtd><mml:mtd><mml:mn>0</mml:mn></mml:mtd></mml:mtr><mml:mtr><mml:mtd><mml:mn>0</mml:mn></mml:mtd><mml:mtd><mml:mn>0</mml:mn></mml:mtd><mml:mtd><mml:mn>0</mml:mn></mml:mtd><mml:mtd><mml:mrow><mml:mn>0.5</mml:mn></mml:mrow></mml:mtd></mml:mtr><mml:mtr><mml:mtd><mml:mn>0</mml:mn></mml:mtd><mml:mtd><mml:mrow><mml:mn>0.25</mml:mn></mml:mrow></mml:mtd><mml:mtd><mml:mn>0</mml:mn></mml:mtd><mml:mtd><mml:mn>0</mml:mn></mml:mtd></mml:mtr><mml:mtr><mml:mtd><mml:mrow><mml:mn>0.25</mml:mn></mml:mrow></mml:mtd><mml:mtd><mml:mn>0</mml:mn></mml:mtd><mml:mtd><mml:mn>0</mml:mn></mml:mtd><mml:mtd><mml:mn>0</mml:mn></mml:mtd></mml:mtr><mml:mtr><mml:mtd><mml:mn>0</mml:mn></mml:mtd><mml:mtd><mml:mn>0</mml:mn></mml:mtd><mml:mtd><mml:mn>0</mml:mn></mml:mtd><mml:mtd><mml:mrow><mml:mn>0.5</mml:mn></mml:mrow></mml:mtd></mml:mtr><mml:mtr><mml:mtd><mml:mn>0</mml:mn></mml:mtd><mml:mtd><mml:mn>0</mml:mn></mml:mtd><mml:mtd><mml:mn>0</mml:mn></mml:mtd><mml:mtd><mml:mrow><mml:mn>0.5</mml:mn></mml:mrow></mml:mtd></mml:mtr></mml:mtable></mml:mrow><mml:mo>)</mml:mo></mml:mrow></mml:mtd></mml:mtr></mml:mtable></mml:math>
</disp-formula></p>
</sec>
<sec id="s2_6">
<title>Feature optimization</title>
<p>Experiments have shown that excessive feature information can interfere with the performance of classifiers. Therefore, feature selection methods should be applied to find the most informative features for training classifiers and thus reduce the dimensionality of the feature vector. F-score has been extensively applied in bioinformatics because of its effectiveness in balancing accuracy and stability (<xref ref-type="bibr" rid="ref-2">Bui <italic>et al</italic>., 2016</xref>; <xref ref-type="bibr" rid="ref-9">Li <italic>et al</italic>., 2018</xref>).</p>
<p>The F-score of the <italic>j</italic>-th feature is defined as</p>
<p><disp-formula id="eqn-18"><label>(3)</label>
<mml:math id="mml-eqn-18" display="block"><mml:mtable columnalign="left" rowspacing=".5em" columnspacing="thickmathspace" displaystyle="true"><mml:mtr><mml:mtd><mml:mrow><mml:mrow><mml:mi mathvariant="normal">F</mml:mi></mml:mrow></mml:mrow><mml:mo>&#x2212;</mml:mo><mml:mrow><mml:mrow><mml:mi mathvariant="normal">s</mml:mi><mml:mi mathvariant="normal">c</mml:mi><mml:mi mathvariant="normal">o</mml:mi><mml:mi mathvariant="normal">r</mml:mi><mml:mi mathvariant="normal">e</mml:mi></mml:mrow></mml:mrow><mml:mrow><mml:mo>(</mml:mo><mml:mi>j</mml:mi><mml:mo>)</mml:mo></mml:mrow><mml:mo>=</mml:mo></mml:mtd><mml:mtd><mml:mrow><mml:mfrac><mml:mrow><mml:mrow><mml:msup><mml:mrow><mml:mrow><mml:mo>(</mml:mo><mml:mrow><mml:msubsup><mml:mrow><mml:mover><mml:mi>x</mml:mi><mml:mo stretchy="false">&#x00AF;</mml:mo></mml:mover></mml:mrow><mml:mi>j</mml:mi><mml:mrow><mml:mrow><mml:mo>(</mml:mo><mml:mo>+</mml:mo><mml:mo>)</mml:mo></mml:mrow></mml:mrow></mml:msubsup><mml:mo>&#x2212;</mml:mo><mml:mrow><mml:msub><mml:mrow><mml:mrow><mml:mover><mml:mi>x</mml:mi><mml:mo stretchy="false">&#x00AF;</mml:mo></mml:mover></mml:mrow></mml:mrow><mml:mi>j</mml:mi></mml:msub></mml:mrow></mml:mrow><mml:mo>)</mml:mo></mml:mrow></mml:mrow><mml:mn>2</mml:mn></mml:msup></mml:mrow><mml:mo>+</mml:mo><mml:mrow><mml:msup><mml:mrow><mml:mrow><mml:mo>(</mml:mo><mml:mrow><mml:msubsup><mml:mrow><mml:mover><mml:mi>x</mml:mi><mml:mo stretchy="false">&#x00AF;</mml:mo></mml:mover></mml:mrow><mml:mi>j</mml:mi><mml:mrow><mml:mrow><mml:mo>(</mml:mo><mml:mo>&#x2212;</mml:mo><mml:mo>)</mml:mo></mml:mrow></mml:mrow></mml:msubsup><mml:mo>&#x2212;</mml:mo><mml:mrow><mml:msub><mml:mrow><mml:mrow><mml:mover><mml:mi>x</mml:mi><mml:mo stretchy="false">&#x00AF;</mml:mo></mml:mover></mml:mrow></mml:mrow><mml:mi>j</mml:mi></mml:msub></mml:mrow></mml:mrow><mml:mo>)</mml:mo></mml:mrow></mml:mrow><mml:mn>2</mml:mn></mml:msup></mml:mrow></mml:mrow><mml:mrow><mml:mrow><mml:mfrac><mml:mn>1</mml:mn><mml:mrow><mml:mrow><mml:msup><mml:mi>m</mml:mi><mml:mo>+</mml:mo></mml:msup></mml:mrow><mml:mo>&#x2212;</mml:mo><mml:mn>1</mml:mn></mml:mrow></mml:mfrac></mml:mrow><mml:munderover><mml:mo movablelimits="false">&#x2211;</mml:mo><mml:mrow><mml:mi>k</mml:mi><mml:mo>=</mml:mo><mml:mn>1</mml:mn></mml:mrow><mml:mrow><mml:mrow><mml:msup><mml:mi>m</mml:mi><mml:mo>+</mml:mo></mml:msup></mml:mrow></mml:mrow></mml:munderover><mml:mrow><mml:mrow><mml:msup><mml:mrow><mml:mrow><mml:mo>(</mml:mo><mml:mrow><mml:msubsup><mml:mrow><mml:mover><mml:mi>x</mml:mi><mml:mo stretchy="false">&#x00AF;</mml:mo></mml:mover></mml:mrow><mml:mrow><mml:mi>k</mml:mi><mml:mo>,</mml:mo><mml:mi>j</mml:mi></mml:mrow><mml:mrow><mml:mrow><mml:mo>(</mml:mo><mml:mo>+</mml:mo><mml:mo>)</mml:mo></mml:mrow></mml:mrow></mml:msubsup><mml:mo>&#x2212;</mml:mo><mml:msubsup><mml:mrow><mml:mover><mml:mi>x</mml:mi><mml:mo stretchy="false">&#x00AF;</mml:mo></mml:mover></mml:mrow><mml:mi>j</mml:mi><mml:mrow><mml:mrow><mml:mo>(</mml:mo><mml:mo>+</mml:mo><mml:mo>)</mml:mo></mml:mrow></mml:mrow></mml:msubsup></mml:mrow><mml:mo>)</mml:mo></mml:mrow></mml:mrow><mml:mn>2</mml:mn></mml:msup></mml:mrow></mml:mrow><mml:mo>+</mml:mo><mml:mrow><mml:mfrac><mml:mn>1</mml:mn><mml:mrow><mml:mrow><mml:msup><mml:mi>m</mml:mi><mml:mo>&#x2212;</mml:mo></mml:msup></mml:mrow><mml:mo>&#x2212;</mml:mo><mml:mn>1</mml:mn></mml:mrow></mml:mfrac></mml:mrow><mml:munderover><mml:mo movablelimits="false">&#x2211;</mml:mo><mml:mrow><mml:mi>k</mml:mi><mml:mo>=</mml:mo><mml:mn>1</mml:mn></mml:mrow><mml:mrow><mml:mrow><mml:msup><mml:mi>m</mml:mi><mml:mo>&#x2212;</mml:mo></mml:msup></mml:mrow></mml:mrow></mml:munderover><mml:mrow><mml:mrow><mml:msup><mml:mrow><mml:mrow><mml:mo>(</mml:mo><mml:mrow><mml:msubsup><mml:mrow><mml:mover><mml:mi>x</mml:mi><mml:mo stretchy="false">&#x00AF;</mml:mo></mml:mover></mml:mrow><mml:mrow><mml:mi>k</mml:mi><mml:mo>,</mml:mo><mml:mi>j</mml:mi></mml:mrow><mml:mrow><mml:mrow><mml:mo>(</mml:mo><mml:mo>&#x2212;</mml:mo><mml:mo>)</mml:mo></mml:mrow></mml:mrow></mml:msubsup><mml:mo>&#x2212;</mml:mo><mml:msubsup><mml:mrow><mml:mover><mml:mi>x</mml:mi><mml:mo stretchy="false">&#x00AF;</mml:mo></mml:mover></mml:mrow><mml:mi>j</mml:mi><mml:mrow><mml:mrow><mml:mo>(</mml:mo><mml:mo>&#x2212;</mml:mo><mml:mo>)</mml:mo></mml:mrow></mml:mrow></mml:msubsup></mml:mrow><mml:mo>)</mml:mo></mml:mrow></mml:mrow><mml:mn>2</mml:mn></mml:msup></mml:mrow></mml:mrow></mml:mrow></mml:mfrac></mml:mrow><mml:mo>,</mml:mo></mml:mtd></mml:mtr></mml:mtable></mml:math>
</disp-formula></p>
<p>where <inline-formula id="ieqn-11">
<mml:math id="mml-ieqn-11"><mml:mrow><mml:msub><mml:mrow><mml:mover><mml:mi>x</mml:mi><mml:mo stretchy="false">&#x00AF;</mml:mo></mml:mover></mml:mrow><mml:mi>j</mml:mi></mml:msub></mml:mrow><mml:mo>,</mml:mo><mml:msubsup><mml:mrow><mml:mover><mml:mi>x</mml:mi><mml:mo stretchy="false">&#x00AF;</mml:mo></mml:mover></mml:mrow><mml:mi>j</mml:mi><mml:mrow><mml:mrow><mml:mo>(</mml:mo><mml:mo>+</mml:mo><mml:mo>)</mml:mo></mml:mrow></mml:mrow></mml:msubsup></mml:math>
</inline-formula> and <inline-formula id="ieqn-12">
<mml:math id="mml-ieqn-12"><mml:msubsup><mml:mrow><mml:mover><mml:mi>x</mml:mi><mml:mo stretchy="false">&#x00AF;</mml:mo></mml:mover></mml:mrow><mml:mi>j</mml:mi><mml:mrow><mml:mrow><mml:mo>(</mml:mo><mml:mo>&#x2212;</mml:mo><mml:mo>)</mml:mo></mml:mrow></mml:mrow></mml:msubsup></mml:math>
</inline-formula> denote average values of the <italic>j</italic>-th feature in the combined positive and negative training datasets, the positive datasets, and the negative datasets, respectively. <inline-formula id="ieqn-13">
<mml:math id="mml-ieqn-13"><mml:mrow><mml:msup><mml:mi>m</mml:mi><mml:mo>+</mml:mo></mml:msup></mml:mrow></mml:math>
</inline-formula> is the total number of positive samples; <inline-formula id="ieqn-14">
<mml:math id="mml-ieqn-14"><mml:mrow><mml:msup><mml:mi>m</mml:mi><mml:mo>&#x2212;</mml:mo></mml:msup></mml:mrow></mml:math>
</inline-formula> is the total number of negative samples; <inline-formula id="ieqn-15">
<mml:math id="mml-ieqn-15"><mml:msubsup><mml:mrow><mml:mover><mml:mi>x</mml:mi><mml:mo stretchy="false">&#x00AF;</mml:mo></mml:mover></mml:mrow><mml:mrow><mml:mi>k</mml:mi><mml:mo>,</mml:mo><mml:mi>j</mml:mi></mml:mrow><mml:mrow><mml:mrow><mml:mo>(</mml:mo><mml:mo>+</mml:mo><mml:mo>)</mml:mo></mml:mrow></mml:mrow></mml:msubsup></mml:math>
</inline-formula> represents the <italic>j</italic>-th feature of the <italic>k</italic>-th positive sample and <inline-formula id="ieqn-16">
<mml:math id="mml-ieqn-16"><mml:msubsup><mml:mrow><mml:mover><mml:mi>x</mml:mi><mml:mo stretchy="false">&#x00AF;</mml:mo></mml:mover></mml:mrow><mml:mrow><mml:mi>k</mml:mi><mml:mo>,</mml:mo><mml:mi>j</mml:mi></mml:mrow><mml:mrow><mml:mrow><mml:mo>(</mml:mo><mml:mo>&#x2212;</mml:mo><mml:mo>)</mml:mo></mml:mrow></mml:mrow></mml:msubsup></mml:math>
</inline-formula> represents the <italic>j</italic>-th feature of the <italic>k</italic>-th negative sample. The higher the F-score is, the more useful the corresponding feature is for the classification.</p>
<fig id="fig-2">
<label>Figure 2</label>
<caption>
<title>The structure of the multi-input hybrid neural network.</title></caption>
<graphic mimetype="image" mime-subtype="png" xlink:href="BIOCELL_16655-fig-2.png"/>
</fig>
</sec>
<sec id="s2_7">
<title>Multi-input hybrid neural network</title>
<p>The model architecture consisted of a multi-channel CNN, a capsule network and a BiGRU network. Each of these three networks has, respectively, been shown to be effective in object detection (<xref ref-type="bibr" rid="ref-9">Li <italic>et al</italic>., 2018</xref>), protein post-translational modification site prediction (<xref ref-type="bibr" rid="ref-21">Wang <italic>et al</italic>., 2019</xref>) and social bots detection (<xref ref-type="bibr" rid="ref-24">Wu <italic>et al</italic>., 2020</xref>). The structure of the multi-input hybrid neural network is shown in <xref ref-type="fig" rid="fig-2">Fig. 2</xref>.</p>
</sec>
<sec id="s2_8">
<title>Multi-channel CNN</title>
<p>The input of the multi-input hybrid neural network is the one-hot encoding, sequence features and KNFP, respectively. For each type of features, we applied 32 convolution filters and performed batch normalization to readjust the data distribution. The input of a batch in the neural network is <inline-formula id="ieqn-17">
<mml:math id="mml-ieqn-17"><mml:mrow><mml:mrow><mml:mi mathvariant="normal">X</mml:mi></mml:mrow></mml:mrow><mml:mo>=</mml:mo><mml:mrow><mml:mo>[</mml:mo><mml:mrow><mml:mrow><mml:msub><mml:mi>x</mml:mi><mml:mn>1</mml:mn></mml:msub></mml:mrow><mml:mo>,</mml:mo><mml:mrow><mml:msub><mml:mi>x</mml:mi><mml:mn>2</mml:mn></mml:msub></mml:mrow><mml:mo>,</mml:mo><mml:mo>&#x22EF;</mml:mo><mml:mo>,</mml:mo><mml:mrow><mml:msub><mml:mi>x</mml:mi><mml:mi>n</mml:mi></mml:msub></mml:mrow></mml:mrow><mml:mo>]</mml:mo></mml:mrow></mml:math>
</inline-formula>, where <inline-formula id="ieqn-18">
<mml:math id="mml-ieqn-18"><mml:mrow><mml:msub><mml:mi>x</mml:mi><mml:mi>i</mml:mi></mml:msub></mml:mrow></mml:math>
</inline-formula> represents a sample and <italic>n</italic> is the batch size.</p>
<p>The mean value of elements in each batch is obtained by:</p>
<p><disp-formula id="eqn-4"><label>(4)</label>
<mml:math id="mml-eqn-4" display="block"><mml:mrow><mml:msub><mml:mi>&#x03BC;</mml:mi><mml:mi>B</mml:mi></mml:msub></mml:mrow><mml:mo>=</mml:mo><mml:mrow><mml:mfrac><mml:mn>1</mml:mn><mml:mi>n</mml:mi></mml:mfrac></mml:mrow><mml:munderover><mml:mo movablelimits="false">&#x2211;</mml:mo><mml:mrow><mml:mi>i</mml:mi><mml:mo>=</mml:mo><mml:mn>1</mml:mn></mml:mrow><mml:mi>n</mml:mi></mml:munderover><mml:mrow><mml:mrow><mml:msub><mml:mi>x</mml:mi><mml:mi>i</mml:mi></mml:msub></mml:mrow></mml:mrow></mml:math>
</disp-formula></p>
<p>Then, the variance of a batch is calculated by</p>
<p><disp-formula id="eqn-5"><label>(5)</label>
<mml:math id="mml-eqn-5" display="block"><mml:msubsup><mml:mi>&#x03C3;</mml:mi><mml:mi>B</mml:mi><mml:mn>2</mml:mn></mml:msubsup><mml:mo>=</mml:mo><mml:mrow><mml:mfrac><mml:mn>1</mml:mn><mml:mi>n</mml:mi></mml:mfrac></mml:mrow><mml:munderover><mml:mo movablelimits="false">&#x2211;</mml:mo><mml:mrow><mml:mi>i</mml:mi><mml:mo>=</mml:mo><mml:mn>1</mml:mn></mml:mrow><mml:mi>n</mml:mi></mml:munderover><mml:mrow><mml:mrow><mml:msup><mml:mrow><mml:mrow><mml:mo>(</mml:mo><mml:mrow><mml:mrow><mml:msub><mml:mi>x</mml:mi><mml:mi>i</mml:mi></mml:msub></mml:mrow><mml:mo>&#x2212;</mml:mo><mml:mrow><mml:msub><mml:mi>&#x03BC;</mml:mi><mml:mi>B</mml:mi></mml:msub></mml:mrow></mml:mrow><mml:mo>)</mml:mo></mml:mrow></mml:mrow><mml:mn>2</mml:mn></mml:msup></mml:mrow></mml:mrow></mml:math>
</disp-formula></p>
<p>This allows us to normalize each element:</p>
<p><disp-formula id="eqn-6"><label>(6)</label>
<mml:math id="mml-eqn-6" display="block"><mml:msubsup><mml:mi>x</mml:mi><mml:mi>i</mml:mi><mml:mi mathvariant="normal">&#x2032;</mml:mi></mml:msubsup><mml:mo>=</mml:mo><mml:mrow><mml:mfrac><mml:mrow><mml:mrow><mml:msub><mml:mi>x</mml:mi><mml:mi>i</mml:mi></mml:msub></mml:mrow><mml:mo>&#x2212;</mml:mo><mml:mrow><mml:msub><mml:mi>&#x03BC;</mml:mi><mml:mi>B</mml:mi></mml:msub></mml:mrow></mml:mrow><mml:mrow><mml:msqrt><mml:msubsup><mml:mi>&#x03C3;</mml:mi><mml:mi>B</mml:mi><mml:mn>2</mml:mn></mml:msubsup><mml:mo>+</mml:mo><mml:mi>&#x03F5;</mml:mi></mml:msqrt></mml:mrow></mml:mfrac></mml:mrow></mml:math>
</disp-formula></p>
<p>The final output of the network is given by</p>
<p><disp-formula id="eqn-7"><label>(7)</label>
<mml:math id="mml-eqn-7" display="block"><mml:mrow><mml:msub><mml:mi>y</mml:mi><mml:mi>i</mml:mi></mml:msub></mml:mrow><mml:mo>=</mml:mo><mml:mrow><mml:msub><mml:mi>&#x03B3;</mml:mi><mml:mi>i</mml:mi></mml:msub></mml:mrow><mml:mo>&#x22C5;</mml:mo><mml:msubsup><mml:mi>x</mml:mi><mml:mi>i</mml:mi><mml:mi mathvariant="normal">&#x2032;</mml:mi></mml:msubsup><mml:mo>+</mml:mo><mml:mrow><mml:msub><mml:mi>&#x03B2;</mml:mi><mml:mi>i</mml:mi></mml:msub></mml:mrow></mml:math>
</disp-formula></p>
<p>where <inline-formula id="ieqn-19">
<mml:math id="mml-ieqn-19"><mml:mi>&#x03F5;</mml:mi></mml:math>
</inline-formula> is a small positive number used to prevent the denominator from being 0.</p>
<p>Finally, we merge three outputs using the Swish activation function <inline-formula id="ieqn-20">
<mml:math id="mml-ieqn-20"><mml:mi>&#x03C3;</mml:mi></mml:math>
</inline-formula>, which is defined as follows:</p>
<p><disp-formula id="eqn-8"><label>(8)</label>
<mml:math id="mml-eqn-8" display="block"><mml:mi>&#x03C3;</mml:mi><mml:mrow><mml:mo>(</mml:mo><mml:mi>x</mml:mi><mml:mo>)</mml:mo></mml:mrow><mml:mo>=</mml:mo><mml:mn>1</mml:mn><mml:mrow><mml:mo>/</mml:mo></mml:mrow><mml:mrow><mml:mo>(</mml:mo><mml:mrow><mml:mn>1</mml:mn><mml:mo>+</mml:mo><mml:mi>exp</mml:mi><mml:mo>&#x2061;</mml:mo><mml:mrow><mml:mo>(</mml:mo><mml:mrow><mml:mo>&#x2212;</mml:mo><mml:mi>x</mml:mi></mml:mrow><mml:mo>)</mml:mo></mml:mrow></mml:mrow><mml:mo>)</mml:mo></mml:mrow></mml:math>
</disp-formula></p>
<p>Because of the limited size of our datasets, we add a 1 &#x00D7; 1 multi-channel CNN to enhance representational capabilities of the model. The 1 &#x00D7; 1 convolution is used to maintain the size of the feature map and integrate the information by linearly weighting the input feature map of each channel. With additional layers of such 1 &#x00D7; 1 convolution, the final extracted features would become more concise.</p>
</sec>
<sec id="s2_9">
<title>Capsule network</title>
<p>The capsule network (CapsNet) was proposed by <xref ref-type="bibr" rid="ref-17">Sabour <italic>et al</italic>. (2017)</xref> and applied in stance detection (<xref ref-type="bibr" rid="ref-29">Zhao and Yang, 2020</xref>), image recognition (<xref ref-type="bibr" rid="ref-16">Qian <italic>et al</italic>., 2020</xref>) and automated classification (<xref ref-type="bibr" rid="ref-13">Mobiny <italic>et al</italic>., 2019</xref>). Since the capsule network collects location information, it can learn a good representation from a small amount of data. We use the capsule network and focus on the hierarchical relationship of local features. The output of the multi-channel CNN is adopted as the input vector of the capsule network. We make an affine transformation of the input vector as follows:</p>
<p><disp-formula id="eqn-9"><label>(9)</label>
<mml:math id="mml-eqn-9" display="block"><mml:mrow><mml:msub><mml:mrow><mml:mover><mml:mrow><mml:mi mathvariant="normal">u</mml:mi></mml:mrow><mml:mo>&#x005E;</mml:mo></mml:mover></mml:mrow><mml:mrow><mml:mrow><mml:mrow><mml:mi mathvariant="normal">j</mml:mi></mml:mrow></mml:mrow><mml:mrow><mml:mo stretchy="false">|</mml:mo></mml:mrow><mml:mrow><mml:mrow><mml:mi mathvariant="normal">i</mml:mi></mml:mrow></mml:mrow></mml:mrow></mml:msub></mml:mrow><mml:mo>=</mml:mo><mml:mrow><mml:msub><mml:mrow><mml:mrow><mml:mi mathvariant="normal">W</mml:mi></mml:mrow></mml:mrow><mml:mrow><mml:mrow><mml:mrow><mml:mi mathvariant="normal">i</mml:mi><mml:mi mathvariant="normal">j</mml:mi></mml:mrow></mml:mrow></mml:mrow></mml:msub></mml:mrow><mml:mrow><mml:msub><mml:mrow><mml:mrow><mml:mi mathvariant="normal">u</mml:mi></mml:mrow></mml:mrow><mml:mrow><mml:mrow><mml:mrow><mml:mi mathvariant="normal">i</mml:mi></mml:mrow></mml:mrow><mml:mo>,</mml:mo></mml:mrow></mml:msub></mml:mrow></mml:math>
</disp-formula></p>
<p>where <inline-formula id="ieqn-21">
<mml:math id="mml-ieqn-21"><mml:mrow><mml:msub><mml:mrow><mml:mrow><mml:mi mathvariant="normal">W</mml:mi></mml:mrow></mml:mrow><mml:mrow><mml:mrow><mml:mrow><mml:mi mathvariant="normal">i</mml:mi><mml:mi mathvariant="normal">j</mml:mi></mml:mrow></mml:mrow></mml:mrow></mml:msub></mml:mrow></mml:math>
</inline-formula> is the weight matrix that needs to be trained and <inline-formula id="ieqn-22">
<mml:math id="mml-ieqn-22"><mml:mrow><mml:msub><mml:mrow><mml:mrow><mml:mi mathvariant="normal">u</mml:mi></mml:mrow></mml:mrow><mml:mrow><mml:mrow><mml:mi mathvariant="normal">I</mml:mi></mml:mrow></mml:mrow></mml:msub></mml:mrow></mml:math>
</inline-formula> is the input vector of the capsule neural network.</p>
<p>Next, the weighted sum is applied to all the prediction vectors as follows:</p>
<p><disp-formula id="eqn-10"><label>(10)</label>
<mml:math id="mml-eqn-10" display="block"><mml:mrow><mml:msub><mml:mrow><mml:mrow><mml:mi mathvariant="normal">s</mml:mi></mml:mrow></mml:mrow><mml:mrow><mml:mrow><mml:mi mathvariant="normal">j</mml:mi></mml:mrow></mml:mrow></mml:msub></mml:mrow><mml:mo>=</mml:mo><mml:munder><mml:mo movablelimits="false">&#x2211;</mml:mo><mml:mrow><mml:mrow><mml:mi mathvariant="normal">i</mml:mi></mml:mrow></mml:mrow></mml:munder><mml:mrow><mml:mrow><mml:msub><mml:mrow><mml:mrow><mml:mi mathvariant="normal">c</mml:mi></mml:mrow></mml:mrow><mml:mrow><mml:mrow><mml:mrow><mml:mi mathvariant="normal">i</mml:mi><mml:mi mathvariant="normal">j</mml:mi></mml:mrow></mml:mrow></mml:mrow></mml:msub></mml:mrow></mml:mrow><mml:mrow><mml:msub><mml:mrow><mml:mover><mml:mrow><mml:mi mathvariant="normal">u</mml:mi></mml:mrow><mml:mo>&#x005E;</mml:mo></mml:mover></mml:mrow><mml:mrow><mml:mrow><mml:mrow><mml:mi mathvariant="normal">j</mml:mi></mml:mrow></mml:mrow><mml:mrow><mml:mo stretchy="false">|</mml:mo></mml:mrow><mml:mrow><mml:mrow><mml:mi mathvariant="normal">i</mml:mi></mml:mrow></mml:mrow></mml:mrow></mml:msub></mml:mrow><mml:mo>,</mml:mo></mml:math>
</disp-formula></p>
<p>where <inline-formula id="ieqn-23">
<mml:math id="mml-ieqn-23"><mml:mrow><mml:msub><mml:mrow><mml:mrow><mml:mi mathvariant="normal">c</mml:mi></mml:mrow></mml:mrow><mml:mrow><mml:mrow><mml:mrow><mml:mi mathvariant="normal">i</mml:mi><mml:mi mathvariant="normal">j</mml:mi></mml:mrow></mml:mrow></mml:mrow></mml:msub></mml:mrow></mml:math>
</inline-formula> is the coupling coefficient in the dynamic routing process.</p>
<p>Finally, we obtain output vectors through a non-linear activation function as follows:</p>
<p><disp-formula id="eqn-11"><label>(11)</label>
<mml:math id="mml-eqn-11" display="block"><mml:mrow><mml:msub><mml:mrow><mml:mrow><mml:mi mathvariant="normal">v</mml:mi></mml:mrow></mml:mrow><mml:mrow><mml:mrow><mml:mi mathvariant="normal">j</mml:mi></mml:mrow></mml:mrow></mml:msub></mml:mrow><mml:mo>=</mml:mo><mml:mrow><mml:mfrac><mml:mrow><mml:mrow><mml:msup><mml:mrow><mml:mrow><mml:mo symmetric="true">&#x2016;</mml:mo><mml:mrow><mml:mrow><mml:msub><mml:mrow><mml:mrow><mml:mi mathvariant="normal">s</mml:mi></mml:mrow></mml:mrow><mml:mrow><mml:mrow><mml:mi mathvariant="normal">j</mml:mi></mml:mrow></mml:mrow></mml:msub></mml:mrow></mml:mrow><mml:mo symmetric="true">&#x2016;</mml:mo></mml:mrow></mml:mrow><mml:mn>2</mml:mn></mml:msup></mml:mrow></mml:mrow><mml:mrow><mml:mn>1</mml:mn><mml:mo>+</mml:mo><mml:mrow><mml:msup><mml:mrow><mml:mrow><mml:mo symmetric="true">&#x2016;</mml:mo><mml:mrow><mml:mrow><mml:msub><mml:mrow><mml:mrow><mml:mi mathvariant="normal">s</mml:mi></mml:mrow></mml:mrow><mml:mrow><mml:mrow><mml:mi mathvariant="normal">j</mml:mi></mml:mrow></mml:mrow></mml:msub></mml:mrow></mml:mrow><mml:mo symmetric="true">&#x2016;</mml:mo></mml:mrow></mml:mrow><mml:mn>2</mml:mn></mml:msup></mml:mrow></mml:mrow></mml:mfrac></mml:mrow><mml:mrow><mml:mfrac><mml:mrow><mml:mrow><mml:msub><mml:mrow><mml:mrow><mml:mi mathvariant="normal">s</mml:mi></mml:mrow></mml:mrow><mml:mrow><mml:mrow><mml:mi mathvariant="normal">j</mml:mi></mml:mrow></mml:mrow></mml:msub></mml:mrow></mml:mrow><mml:mrow><mml:mrow><mml:mo symmetric="true">&#x2016;</mml:mo><mml:mrow><mml:mrow><mml:msub><mml:mrow><mml:mrow><mml:mi mathvariant="normal">s</mml:mi></mml:mrow></mml:mrow><mml:mrow><mml:mrow><mml:mi mathvariant="normal">j</mml:mi></mml:mrow></mml:mrow></mml:msub></mml:mrow></mml:mrow><mml:mo symmetric="true">&#x2016;</mml:mo></mml:mrow></mml:mrow></mml:mfrac></mml:mrow><mml:mo>.</mml:mo></mml:math>
</disp-formula></p>
</sec>
<sec id="s2_10">
<title>BiGRU network</title>
<p>The third segment of this model is the BiGRU network, which helps to extract deep-level features of sequences. The current hidden layer state of the BiGRU is determined by three factors: the current input <inline-formula id="ieqn-24">
<mml:math id="mml-ieqn-24"><mml:mrow><mml:msub><mml:mrow><mml:mrow><mml:mi mathvariant="normal">x</mml:mi></mml:mrow></mml:mrow><mml:mrow><mml:mrow><mml:mi mathvariant="normal">t</mml:mi></mml:mrow></mml:mrow></mml:msub></mml:mrow></mml:math>
</inline-formula>, the output of the forward hidden layer state at time-step (t&#x2013;1) <inline-formula id="ieqn-25">
<mml:math id="mml-ieqn-25"><mml:mover><mml:mrow><mml:msub><mml:mi>h</mml:mi><mml:mrow><mml:mi>t</mml:mi><mml:mo>&#x2212;</mml:mo><mml:mn>1</mml:mn></mml:mrow></mml:msub></mml:mrow><mml:mo>&#x2192;</mml:mo></mml:mover></mml:math>
</inline-formula> and the output of the reverse hidden layer state <inline-formula id="ieqn-26">
<mml:math id="mml-ieqn-26"><mml:mover><mml:mrow><mml:msub><mml:mi>h</mml:mi><mml:mrow><mml:mi>t</mml:mi><mml:mo>&#x2212;</mml:mo><mml:mn>1</mml:mn></mml:mrow></mml:msub></mml:mrow><mml:mo>&#x2190;</mml:mo></mml:mover></mml:math>
</inline-formula>. BiGRU can be regarded as two GRUs, so the state of hidden layer <inline-formula id="ieqn-27">
<mml:math id="mml-ieqn-27"><mml:mrow><mml:msub><mml:mi>h</mml:mi><mml:mi>t</mml:mi></mml:msub></mml:mrow></mml:math>
</inline-formula> at time-step <italic>t</italic> can be obtained by the weighted sum of the forward hidden layer state <inline-formula id="ieqn-28">
<mml:math id="mml-ieqn-28"><mml:mover><mml:mrow><mml:msub><mml:mrow><mml:mrow><mml:mi mathvariant="normal">h</mml:mi></mml:mrow></mml:mrow><mml:mrow><mml:mrow><mml:mrow><mml:mi mathvariant="normal">t</mml:mi></mml:mrow></mml:mrow><mml:mo>&#x2212;</mml:mo><mml:mn>1</mml:mn></mml:mrow></mml:msub></mml:mrow><mml:mo>&#x2192;</mml:mo></mml:mover></mml:math>
</inline-formula> and reverse hidden layer state <inline-formula id="ieqn-29">
<mml:math id="mml-ieqn-29"><mml:mover><mml:mrow><mml:msub><mml:mi>h</mml:mi><mml:mrow><mml:mi>t</mml:mi><mml:mo>&#x2212;</mml:mo><mml:mn>1</mml:mn></mml:mrow></mml:msub></mml:mrow><mml:mo>&#x2190;</mml:mo></mml:mover></mml:math>
</inline-formula>:</p>
<p><disp-formula id="eqn-12"><label>(12)</label>
<mml:math id="mml-eqn-12" display="block"><mml:mover><mml:mrow><mml:msub><mml:mrow><mml:mrow><mml:mi mathvariant="normal">h</mml:mi></mml:mrow></mml:mrow><mml:mrow><mml:mrow><mml:mi mathvariant="normal">t</mml:mi></mml:mrow></mml:mrow></mml:msub></mml:mrow><mml:mo>&#x2192;</mml:mo></mml:mover><mml:mo>=</mml:mo><mml:mspace width="thickmathspace" /><mml:mrow><mml:mrow><mml:mi mathvariant="normal">G</mml:mi><mml:mi mathvariant="normal">R</mml:mi><mml:mi mathvariant="normal">U</mml:mi></mml:mrow></mml:mrow><mml:mrow><mml:mo>(</mml:mo><mml:mrow><mml:mrow><mml:msub><mml:mrow><mml:mrow><mml:mi mathvariant="normal">x</mml:mi></mml:mrow></mml:mrow><mml:mrow><mml:mrow><mml:mi mathvariant="normal">t</mml:mi></mml:mrow></mml:mrow></mml:msub></mml:mrow><mml:mo>,</mml:mo><mml:mover><mml:mrow><mml:msub><mml:mrow><mml:mrow><mml:mi mathvariant="normal">h</mml:mi></mml:mrow></mml:mrow><mml:mrow><mml:mrow><mml:mrow><mml:mi mathvariant="normal">t</mml:mi><mml:mo>&#x2212;</mml:mo><mml:mn>1</mml:mn></mml:mrow></mml:mrow></mml:mrow></mml:msub></mml:mrow><mml:mo>&#x2192;</mml:mo></mml:mover></mml:mrow><mml:mo>)</mml:mo></mml:mrow></mml:math>
</disp-formula></p>
<p><disp-formula id="eqn-13"><label>(13)</label>
<mml:math id="mml-eqn-13" display="block"><mml:mrow><mml:mover accent='true'><mml:mrow><mml:msub><mml:mtext>h</mml:mtext><mml:mtext>t</mml:mtext></mml:msub></mml:mrow><mml:mo stretchy='true'>&#x2192;</mml:mo></mml:mover><mml:mo>=</mml:mo><mml:mtext>&#x205F;GRU</mml:mtext><mml:mrow><mml:mo>(</mml:mo><mml:mrow><mml:msub><mml:mtext>x</mml:mtext><mml:mtext>t</mml:mtext></mml:msub><mml:mo>,</mml:mo><mml:mover accent='true'><mml:mrow><mml:msub><mml:mi>h</mml:mi><mml:mrow><mml:mi>t</mml:mi><mml:mo>&#x2212;</mml:mo><mml:mn>1</mml:mn></mml:mrow></mml:msub></mml:mrow><mml:mo stretchy='true'>&#x2190;</mml:mo></mml:mover></mml:mrow><mml:mo>)</mml:mo></mml:mrow></mml:mrow></mml:math>
</disp-formula></p>
<p><disp-formula id="eqn-14"><label>(14)</label>
<mml:math id="mml-eqn-14" display="block"><mml:mrow><mml:msub><mml:mrow><mml:mrow><mml:mi mathvariant="normal">h</mml:mi></mml:mrow></mml:mrow><mml:mrow><mml:mrow><mml:mi mathvariant="normal">t</mml:mi></mml:mrow></mml:mrow></mml:msub></mml:mrow><mml:mo>=</mml:mo><mml:mrow><mml:msub><mml:mrow><mml:mrow><mml:mi mathvariant="normal">w</mml:mi></mml:mrow></mml:mrow><mml:mrow><mml:mrow><mml:mi mathvariant="normal">t</mml:mi></mml:mrow></mml:mrow></mml:msub></mml:mrow><mml:mo>&#x22C5;</mml:mo><mml:mover><mml:mrow><mml:msub><mml:mrow><mml:mrow><mml:mi mathvariant="normal">h</mml:mi></mml:mrow></mml:mrow><mml:mrow><mml:mrow><mml:mi mathvariant="normal">t</mml:mi></mml:mrow></mml:mrow></mml:msub></mml:mrow><mml:mo>&#x2192;</mml:mo></mml:mover><mml:mo>+</mml:mo><mml:mrow><mml:msub><mml:mrow><mml:mrow><mml:mi mathvariant="normal">v</mml:mi></mml:mrow></mml:mrow><mml:mrow><mml:mrow><mml:mi mathvariant="normal">t</mml:mi></mml:mrow></mml:mrow></mml:msub></mml:mrow><mml:mo>&#x22C5;</mml:mo><mml:mrow><mml:msub><mml:mover><mml:mrow><mml:mi mathvariant="normal">h</mml:mi></mml:mrow><mml:mo>&#x2190;</mml:mo></mml:mover><mml:mrow><mml:mrow><mml:mi mathvariant="normal">t</mml:mi></mml:mrow></mml:mrow></mml:msub></mml:mrow><mml:mo>+</mml:mo><mml:mrow><mml:msub><mml:mrow><mml:mrow><mml:mi mathvariant="normal">b</mml:mi></mml:mrow></mml:mrow><mml:mrow><mml:mrow><mml:mrow><mml:mi mathvariant="normal">t</mml:mi></mml:mrow></mml:mrow><mml:mo>,</mml:mo></mml:mrow></mml:msub></mml:mrow></mml:math>
</disp-formula></p>
<p>where GRU<inline-formula id="ieqn-30">
<mml:math id="mml-ieqn-30"><mml:mrow><mml:mo>(</mml:mo><mml:mrow></mml:mrow><mml:mo>)</mml:mo></mml:mrow></mml:math>
</inline-formula> represents a nonlinear transformation of the input word vector; <inline-formula id="ieqn-31">
<mml:math id="mml-ieqn-31"><mml:mrow><mml:msub><mml:mrow><mml:mrow><mml:mi mathvariant="normal">w</mml:mi></mml:mrow></mml:mrow><mml:mrow><mml:mrow><mml:mi mathvariant="normal">t</mml:mi></mml:mrow></mml:mrow></mml:msub></mml:mrow></mml:math>
</inline-formula> and <inline-formula id="ieqn-32">
<mml:math id="mml-ieqn-32"><mml:mrow><mml:msub><mml:mrow><mml:mrow><mml:mi mathvariant="normal">v</mml:mi></mml:mrow></mml:mrow><mml:mrow><mml:mrow><mml:mi mathvariant="normal">t</mml:mi></mml:mrow></mml:mrow></mml:msub></mml:mrow></mml:math>
</inline-formula> are the weighed matrices; and <inline-formula id="ieqn-33">
<mml:math id="mml-ieqn-33"><mml:mrow><mml:msub><mml:mrow><mml:mrow><mml:mi mathvariant="normal">b</mml:mi></mml:mrow></mml:mrow><mml:mrow><mml:mrow><mml:mi mathvariant="normal">t</mml:mi></mml:mrow></mml:mrow></mml:msub></mml:mrow></mml:math>
</inline-formula> is the bias term.</p>
</sec>
<sec id="s2_11">
<title>Parameter setting</title>
<p>Considering the number of datasets and the precision of the model, three feature maps were obtained from batch normalization and 1D convolution with 32 filters (kernel size &#x003D; 3). The multi-channel CNN contained three 1 &#x00D7; 1 convolution layers and took the Swish as activation function. Considering time cost, we only employed one capsule layer with 14 num_capsule (dim_capsule &#x003D; 41, routings &#x003D; 3). The BiGRU had 32 hidden units followed by a fully connected layer and used the ReLU activation function. We also used dropout with a keep probability of 0.3 to prevent the model from over fitting. For stochastic gradient descent, we selected the Adam optimization algorithm. The entire program was written in Python 3.6.</p>
</sec>
<sec id="s2_12">
<title>Performance assessment</title>
<p>To evaluate the performance of our prediction model, we used four measurements including accuracy (Acc), sensitivity (Sn), specificity (Sp), and Matthew&#x2019;s correlation coefficient (MCC) on 5-fold cross-validation and independent dataset tests. The formulas are provided as follows:</p>
<p><disp-formula id="eqn-15"><label>(15)</label>
<mml:math id="mml-eqn-15" display="block"><mml:mrow><mml:mo>{</mml:mo><mml:mrow><mml:mtable rowspacing="4pt" columnspacing="1em"><mml:mtr><mml:mtd columnalign="left"><mml:mrow><mml:mrow><mml:mrow><mml:mi mathvariant="normal">S</mml:mi><mml:mi mathvariant="normal">p</mml:mi></mml:mrow></mml:mrow><mml:mspace width="thickmathspace" /><mml:mrow><mml:mrow><mml:mo>=</mml:mo></mml:mrow></mml:mrow><mml:mrow><mml:mfrac><mml:mrow><mml:mrow><mml:mrow><mml:mi>T</mml:mi><mml:mi>N</mml:mi></mml:mrow></mml:mrow></mml:mrow><mml:mrow><mml:mrow><mml:mrow><mml:mi>T</mml:mi><mml:mi>N</mml:mi><mml:mo>+</mml:mo><mml:mi>F</mml:mi><mml:mi>P</mml:mi></mml:mrow></mml:mrow></mml:mrow></mml:mfrac></mml:mrow></mml:mrow></mml:mtd></mml:mtr><mml:mtr><mml:mtd columnalign="left"><mml:mrow><mml:mrow><mml:mrow><mml:mi mathvariant="normal">S</mml:mi><mml:mi mathvariant="normal">n</mml:mi></mml:mrow></mml:mrow><mml:mspace width="thickmathspace" /><mml:mrow><mml:mrow><mml:mo>=</mml:mo></mml:mrow></mml:mrow><mml:mrow><mml:mfrac><mml:mrow><mml:mrow><mml:mrow><mml:mi>T</mml:mi><mml:mi>P</mml:mi></mml:mrow></mml:mrow></mml:mrow><mml:mrow><mml:mrow><mml:mrow><mml:mi>F</mml:mi><mml:mi>N</mml:mi><mml:mo>+</mml:mo><mml:mi>T</mml:mi><mml:mi>P</mml:mi></mml:mrow></mml:mrow></mml:mrow></mml:mfrac></mml:mrow></mml:mrow></mml:mtd></mml:mtr><mml:mtr><mml:mtd columnalign="left"><mml:mrow><mml:mrow><mml:mrow><mml:mi mathvariant="normal">A</mml:mi><mml:mi mathvariant="normal">c</mml:mi><mml:mi mathvariant="normal">c</mml:mi></mml:mrow></mml:mrow><mml:mspace width="thickmathspace" /><mml:mrow><mml:mrow><mml:mo>=</mml:mo></mml:mrow></mml:mrow><mml:mrow><mml:mfrac><mml:mrow><mml:mrow><mml:mrow><mml:mi>T</mml:mi><mml:mi>P</mml:mi><mml:mo>+</mml:mo><mml:mi>T</mml:mi><mml:mi>N</mml:mi></mml:mrow></mml:mrow></mml:mrow><mml:mrow><mml:mrow><mml:mrow><mml:mi>T</mml:mi><mml:mi>P</mml:mi><mml:mo>+</mml:mo><mml:mi>T</mml:mi><mml:mi>N</mml:mi><mml:mo>+</mml:mo><mml:mi>F</mml:mi><mml:mi>N</mml:mi><mml:mo>+</mml:mo><mml:mi>T</mml:mi><mml:mi>P</mml:mi></mml:mrow></mml:mrow></mml:mrow></mml:mfrac></mml:mrow></mml:mrow></mml:mtd></mml:mtr><mml:mtr><mml:mtd columnalign="left"><mml:mrow><mml:mrow><mml:mrow><mml:mi mathvariant="normal">M</mml:mi><mml:mi mathvariant="normal">C</mml:mi><mml:mi mathvariant="normal">C</mml:mi></mml:mrow></mml:mrow><mml:mspace width="thickmathspace" /><mml:mrow><mml:mrow><mml:mo>=</mml:mo></mml:mrow></mml:mrow><mml:mrow><mml:mfrac><mml:mrow><mml:mrow><mml:mrow><mml:mi>T</mml:mi><mml:mi>P</mml:mi></mml:mrow></mml:mrow><mml:mo>&#x00D7;</mml:mo><mml:mrow><mml:mrow><mml:mi>T</mml:mi><mml:mi>N</mml:mi><mml:mo>&#x2212;</mml:mo><mml:mi>F</mml:mi><mml:mi>P</mml:mi></mml:mrow></mml:mrow><mml:mo>&#x00D7;</mml:mo><mml:mrow><mml:mrow><mml:mi>F</mml:mi><mml:mi>N</mml:mi></mml:mrow></mml:mrow></mml:mrow><mml:mrow><mml:msqrt><mml:mrow><mml:mo>(</mml:mo><mml:mrow><mml:mrow><mml:mrow><mml:mi>T</mml:mi><mml:mi>P</mml:mi><mml:mo>+</mml:mo><mml:mi>F</mml:mi><mml:mi>N</mml:mi></mml:mrow></mml:mrow></mml:mrow><mml:mo>)</mml:mo></mml:mrow><mml:mrow><mml:mo>(</mml:mo><mml:mrow><mml:mrow><mml:mrow><mml:mi>F</mml:mi><mml:mi>P</mml:mi><mml:mo>+</mml:mo><mml:mi>T</mml:mi><mml:mi>N</mml:mi></mml:mrow></mml:mrow></mml:mrow><mml:mo>)</mml:mo></mml:mrow><mml:mrow><mml:mo>(</mml:mo><mml:mrow><mml:mrow><mml:mrow><mml:mi>T</mml:mi><mml:mi>P</mml:mi><mml:mo>+</mml:mo><mml:mi>F</mml:mi><mml:mi>P</mml:mi></mml:mrow></mml:mrow></mml:mrow><mml:mo>)</mml:mo></mml:mrow><mml:mrow><mml:mo>(</mml:mo><mml:mrow><mml:mrow><mml:mrow><mml:mi>F</mml:mi><mml:mi>N</mml:mi><mml:mo>+</mml:mo><mml:mi>T</mml:mi><mml:mi>N</mml:mi></mml:mrow></mml:mrow></mml:mrow><mml:mo>)</mml:mo></mml:mrow></mml:msqrt></mml:mrow></mml:mfrac></mml:mrow></mml:mrow></mml:mtd></mml:mtr></mml:mtable></mml:mrow><mml:mo fence="true" stretchy="true" symmetric="true"></mml:mo></mml:mrow></mml:math>
</disp-formula></p>
<p>where TP, TN, FP, and FN represent the number of true positives, true negatives, false positives, and false negatives, respectively.</p>
</sec>
</sec>
<sec id="s3">
<title>Results</title>
<p>We observed a deviation in the process of 5-fold cross-validation test (<xref ref-type="table" rid="table-4">Table S1</xref>), probably due to the limited number of experimentally verified samples. A general strategy to solve the problem of insufficient samples is to construct an ensemble prediction model. On this purpose, we randomly divided the positive and negative training datasets into five mutually exclusive parts of similar size. And then, we selected the combination of four parts as a new training dataset, while the remaining one part was adopted as validation test dataset to train and optimize the three-layer hybrid neural network at each time. Thus, we got five sub models which were then integrated into a novel ensemble prediction model based on a majority voting strategy.</p>
<p>In order to verify the effectiveness of the ensemble model, we compared it with two previous prediction methods: im6A-TS-CNN and iRNA-m6A. The same training and independent datasets were used for our model, im6A-TS-CNN and iRNA-m6A; therefore, both 5-fold cross-validation and independent test could be used to evaluate these methods objectively. There was a total of 132 results from 11 datasets involving four indicators (Sn, Sp, Acc, MCC), among which our model achieved superior predictive performance as measured by average MCC and Acc. Specifically, on the 5-fold cross validation, for <italic>Homo sapiens</italic>, our model gave MCC &#x003D; 0.581, <italic>vs.</italic> 0.550 for the second-placed im6A-TS-CNN; for <italic>Musmusculus</italic>, our model gave MCC &#x003D; 0.558 <italic>vs.</italic> 0.517 for the second-placed im6A-TS-CNN; for <italic>Rattusnorvegicus</italic>, our model reached MCC &#x003D; 0.626 <italic>vs.</italic> 0.600 for the second-placed im6A-TS-CNN. On the independent dataset, for <italic>Homo sapiens</italic>, our model showed MCC &#x003D; 0.572 <italic>vs.</italic> 0.547 for the second-placed im6A-TS-CNN; for <italic>Musmusculus</italic>, our model gave MCC &#x003D; 0.546 <italic>vs.</italic> 0.525 for the second-placed im6A-TS-CNN; for <italic>Rattusnorvegicus</italic>, our model reached MCC &#x003D; 0.617 <italic>vs.</italic> 0.604 for the second-placed im6A-TS-CNN.</p>
<p>To observe the comparison results intuitively, we showed MCC values of these three models in <xref ref-type="fig" rid="fig-3">Figs. 3</xref> and <xref ref-type="fig" rid="fig-4">4</xref>. Moreover, the comparison results measured with other indicators are provided in <xref ref-type="table" rid="table-5">Tables S2</xref> and <xref ref-type="table" rid="table-6">S3</xref>.</p>
<fig id="fig-3">
<label>Figure 3</label>
<caption>
<title>Comparison with MCC measure of different models on 5-fold cross validation test.</title></caption>
<graphic mimetype="image" mime-subtype="png" xlink:href="BIOCELL_16655-fig-3.png"/>
</fig>
<fig id="fig-4">
<label>Figure 4</label>
<caption>
<title>Comparison with MCC measure of different models on independent tests.</title></caption>
<graphic mimetype="image" mime-subtype="png" xlink:href="BIOCELL_16655-fig-4.png"/>
</fig>
</sec>
<sec id="s4">
<title>Discussion</title>
<sec id="s4_1">
<title>Effectiveness of feature selection</title>
<p>If too many features are extracted, the generalizability of the model will be weakened. Thus it is important to determine the appropriate step size for feature selection. As the dimension of the input matrix was 41 &#x00D7; N, we chose the step size as 41 and evaluated the performance of our model with feature matrices of different dimensions (41 &#x00D7; N, <inline-formula id="ieqn-34">
<mml:math id="mml-ieqn-34"><mml:mi>N</mml:mi><mml:mo>&#x2208;</mml:mo><mml:mrow><mml:mo>(</mml:mo><mml:mrow><mml:mn>5</mml:mn><mml:mo>,</mml:mo><mml:mn>6</mml:mn><mml:mo>,</mml:mo><mml:mn>7</mml:mn><mml:mo>,</mml:mo><mml:mo>&#x22EF;</mml:mo><mml:mo>,</mml:mo><mml:mn>24</mml:mn><mml:mo>,</mml:mo><mml:mn>25</mml:mn></mml:mrow><mml:mo>)</mml:mo></mml:mrow></mml:math>
</inline-formula>) on 5-fold cross-validation tests successively. To reduce the deviation caused by the fluctuation of the neural network, we ran each test three times for the 11 datasets from the brain, liver, kidney, heart, and testis of <italic>Homo sapiens</italic>, <italic>Musmusculus</italic>, and <italic>Rattusnorvegicus</italic>. The optimal feature subset was finalized according to the average accuracy. The detailed feature selection results are shown in <xref ref-type="fig" rid="fig-5">Fig. 5</xref>. As should be noticed, dimensions of optimal features were various for different datasets; their corresponding dimensions are listed in <xref ref-type="table" rid="table-2">Table 2</xref>. These results indicate that it is necessary to establish a specialized model for each tissue type in each species.</p>
<table-wrap id="table-2"><label>Table 2</label>
<caption>
<title>The dimensions of optimal features and prediction accuracy for different datasets</title></caption>
<table><colgroup>
<col/>
<col/>
<col/>
</colgroup>
<thead>
<tr>
<th>Datasets</th>
<th>Feature Dimension</th>
<th>Acc</th>
</tr>
</thead>
<tbody>
<tr>
<td>h_b</td>
<td>41 &#x00D7; 19</td>
<td>73.91%</td>
</tr>
<tr>
<td>h_k</td>
<td>41 &#x00D7; 5</td>
<td>80.76%</td>
</tr>
<tr>
<td>h_l</td>
<td>41 &#x00D7; 5</td>
<td>81.49%</td>
</tr>
<tr>
<td>m_b</td>
<td>41 &#x00D7; 5</td>
<td>79.86%</td>
</tr>
<tr>
<td>m_h</td>
<td>41 &#x00D7; 12</td>
<td>75.87%</td>
</tr>
<tr>
<td>m_k</td>
<td>41 &#x00D7; 12</td>
<td>81.87%</td>
</tr>
<tr>
<td>m_l</td>
<td>41 &#x00D7; 11</td>
<td>73.43%</td>
</tr>
<tr>
<td>m_t</td>
<td>41 &#x00D7; 15</td>
<td>77.23%</td>
</tr>
<tr>
<td>r_b</td>
<td>41 &#x00D7; 15</td>
<td>78.23%</td>
</tr>
<tr>
<td>r_k</td>
<td>41 &#x00D7; 13</td>
<td>83.25%</td>
</tr>
<tr>
<td>r_l</td>
<td>41 &#x00D7; 7</td>
<td>82.43%</td>
</tr>
</tbody>
</table>
</table-wrap>
<fig id="fig-5">
<label>Figure 5</label>
<caption>
<title>The accuracy comparison of different feature dimension on 11 datasets of three species.</title></caption>
<graphic mimetype="image" mime-subtype="png" xlink:href="BIOCELL_16655-fig-5.png"/>
</fig>
</sec>
<sec id="s4_2">
<title>Parameter selection</title>
<p>Generally, there are two ways to select parameters, i.e., empirical choice and Bayesian optimization. With <italic>Homo sapiens</italic> as an example, we tried both methods to find the most suitable parameters. The initial parameters were set based on a previous work to compare the prediction results roughly. Then, if the prediction model was under fitting, we attempted to add more convolution kernels and neurons; otherwise if the prediction model was over fitting, we attempted to reduce the number of convolution kernels and neurons. In addition, batch normalization, dropout, and regularization were introduced to avoid over fitting during the optimizing process. Alternatively, Bayesian optimization (<xref ref-type="bibr" rid="ref-19">Snoek <italic>et al</italic>., 2012</xref>) was used to tune the key parameters including batch size, dropout rate, filter1, filter2, pool_size and <italic>etc</italic>. Finally, the optimal parameters wereselected according to the AUC value. The Baysian optimization of parameters in models for <italic>Homo sapiens</italic> and corresponding AUC values are provided in <xref ref-type="table" rid="table-7">Table S4</xref> and <xref ref-type="fig" rid="fig-6">Fig. 6A</xref>. We compared the best results obtained by the two methods and concluded the result given by empirical adjustment parameters showed more advantages (<xref ref-type="fig" rid="fig-6">Fig. 6B</xref>). Thus, for <italic>Musmusculus</italic> and <italic>Rattusnorvegicus</italic>, we used the empirical method to determine the parameters in neural network.</p>
</sec>
<sec id="s4_3">
<title>Comparison of different classifiers</title>
<p>To verify the effectiveness of hybrid neural network, we compared its prediction performance with several traditional classification algorithms including logistic regression, decision tree, support vector machine (SVM), random forest (RF), gradient boosting decision tree (GBDT), extreme gradient boosting (XGBoost) and light gradient boosting machine (LightGBM) for <italic>Homo sapiens</italic>. The average accuracies of 5-fold cross-validation tests obtained by the seven algorithms are listed in <xref ref-type="table" rid="table-3">Table 3</xref>. On three datasets of <italic>Homo sapiens</italic>, our model achieves the mean accuracy of 74.29%, 1.28% higher than the second best algorithm, LightGBM, which demonstrates that hybrid neural network is capable of improving the recognition performance for m6A sites of various tissues in different species.</p>
<fig id="fig-6">
<label>Figure 6</label>
<caption>
<title>(A) Boxplot comparison results among the Baysian optimization of parameters of the models for Homo sapiens measurements. (B)
The comparison results of empirical choice and Bayesian optimization. (C) The t-SNE comparison of different stage for brain tissue.</title></caption>
<graphic mimetype="image" mime-subtype="png" xlink:href="BIOCELL_16655-fig-6a.png"/>
<graphic mimetype="image" mime-subtype="png" xlink:href="BIOCELL_16655-fig-6b.png"/>
</fig>
</sec>
<sec id="s4_4">
<title>Visualization of feature representations</title>
<p>To observe the effectiveness of extracted features intuitively, we applied t-distributed stochastic neighbor embedding (t-SNE) to visualize the feature representations. We took the brain tissue as an example and demonstrated the features after mapped through the concatenate layer and the attention layer. Each dot in <xref ref-type="fig" rid="fig-6">Fig. 6C</xref> represents a sample, with purple dots representing m6A sites and yellow dots representing non-m6A sites. The overlapping of the two sample types on the left side of <xref ref-type="fig" rid="fig-6">Fig. 6C</xref> indicates that it is difficult to distinguish m6A sites from non-m6A sites. However, the features were relatively separated, as shown on the right side of <xref ref-type="fig" rid="fig-6">Fig. 6C</xref>, after selected and processed by the deep hierarchical network, which was, multi-channel CNN, capsule network and BiGRU network with the self-attention mechanism. The t-SNE plots indicated that the hybrid deep hierarchical networks could learn sequence motif information from selected features. But there are still some overlaps between the two types, indicating that our model is not completely specific for all m6A sites. This fact is consistent with the need to establish specialized models for each tissue type.</p>
<table-wrap id="table-3"><label>Table 3</label>
<caption>
<title>Comparison of different classifiers for identifying m6A sites on 5-fold cross-validation</title></caption>
<table><colgroup>
<col/>
<col/>
<col/>
<col/>
</colgroup>
<thead>
<tr>
<th>Method</th>
<th>h_b</th>
<th>h_k</th>
<th>h_l</th>
</tr>
</thead>
<tbody>
<tr>
<td>Support Vector Machines</td>
<td>71.82</td>
<td>79.03</td>
<td>79.90</td>
</tr>
<tr>
<td>Decision Tree</td>
<td>62.75</td>
<td>71.29</td>
<td>71.24</td>
</tr>
<tr>
<td>Logistic Regression</td>
<td>65.81</td>
<td>71.33</td>
<td>71.91</td>
</tr>
<tr>
<td>Random forest</td>
<td>70.92</td>
<td>78.49</td>
<td>79.65</td>
</tr>
<tr>
<td>GBDT</td>
<td>73.55</td>
<td>80.31</td>
<td>80.77</td>
</tr>
<tr>
<td>XGBoost</td>
<td>72.13</td>
<td>79.14</td>
<td>80.16</td>
</tr>
<tr>
<td>LightGBM</td>
<td>73.01</td>
<td>80.00</td>
<td>80.68</td>
</tr>
<tr>
<td><bold>Our model</bold></td>
<td><bold>74.29</bold></td>
<td><bold>80.75</bold></td>
<td><bold>81.80</bold></td>
</tr>
</tbody>
</table>
</table-wrap>
</sec>
</sec>
<sec id="s5">
<title>Conclusion</title>
<p>In this work, we introduced a novel ensemble computational approach to identify m6A sites, based on three hybrid neural networks. For different tissues of different species, we selected different optimized feature subsets from 4933 features as multi-input for the deep hybrid neural networks. Our predication model consisted of a multi-channel CNN, a capsule network and a BiGRU network with the self-attention mechanism, and was evaluated on 11 datasets. To solve the deviation caused by the relatively small number of experimentally verified samples, we constructed an ensemble model through integrating five sub-classifiers based on different training datasets. To estimate the performance of this model, comparisons were made on 11 datasets by 5-fold cross validation and independent test datasets. Results of all tests revealed that when measured with Acc and MCC, our model is superior to two previous tools, iRNA-m6A and im6A-TS-CNN. However, the specificity of our model is not satisfactory on h_b, h_k, m_h, m_k and r_b datasets. In future work, we will extract more types of information and further optimize these models.</p>
</sec>
</body>
<back><fn-group>
<fn fn-type="other">
<p><bold>Availability of Data and Materials:</bold> The data sets and source code used in this study are freely available at https://github.com/Dong7777/im6A.</p>
</fn>
<fn fn-type="other">
<p><bold>Author Contribution:</bold> QZ and XW conceived the project, developed the prediction method, designed and implemented the experiments, analyzed the results, and wrote the paper. DJ and CZJ implemented the experiments, analyzed the results, and wrote the paper. All authors read and approved the final manuscript.</p>
</fn>
<fn fn-type="other">
<p><bold>Funding Statement:</bold> This work was supported by the National Natural Science Foundation of China (Nos. 62071079 and 61803065).</p>
</fn>
<fn fn-type="conflict">
<p><bold>Conflicts of Interest:</bold> The authors declare that they have no conflicts of interest to report regarding the present study.</p>
</fn>
</fn-group>
<ref-list content-type="authoryear">
<title>References</title>
<ref id="ref-1"><label>Basith <italic>et al</italic>. (2019)</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Basith</surname> <given-names>S</given-names></string-name>, <string-name><surname>Manavalan</surname> <given-names>B</given-names></string-name>, <string-name><surname>Shin</surname> <given-names>TH</given-names></string-name>, <string-name><surname>Lee</surname> <given-names>G</given-names></string-name></person-group> (<year>2019</year>). <article-title>SDM6A: A web-based integrative machine-learning framework for predicting 6mA sites in the rice genome</article-title>. <source>Molecular Therapy Nucleic Acids</source> <volume>18</volume>: <fpage>131</fpage>&#x2013;<lpage>141</lpage>. DOI <pub-id pub-id-type="doi">10.1016/j.omtn.2019.08.011</pub-id>.</mixed-citation></ref>
<ref id="ref-2"><label>Bui <italic>et al</italic>. (2016)</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Bui</surname> <given-names>VM</given-names></string-name>, <string-name><surname>Weng</surname> <given-names>SL</given-names></string-name>, <string-name><surname>Lu</surname> <given-names>CT</given-names></string-name>, <string-name><surname>Chang</surname> <given-names>TH</given-names></string-name>, <string-name><surname>Weng</surname> <given-names>JT</given-names></string-name>, <string-name><surname>Lee</surname> <given-names>TY</given-names></string-name></person-group> (<year>2016</year>). <article-title>SOHSite: Incorporating evolutionary information and physicochemical properties to identify protein S-sulfenylation sites</article-title>. <source>BMC Genomics</source> <volume>17</volume>: <fpage>1</fpage>&#x2013;<lpage>9</lpage>. DOI <pub-id pub-id-type="doi">10.1186/s12864-015-2299-1</pub-id>.</mixed-citation></ref>
<ref id="ref-3"><label>Cao <italic>et al</italic>. (2016)</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Cao</surname> <given-names>G</given-names></string-name>, <string-name><surname>Li</surname> <given-names>HB</given-names></string-name>, <string-name><surname>Yin</surname> <given-names>Z</given-names></string-name>, <string-name><surname>Flavell</surname> <given-names>RA</given-names></string-name></person-group> (<year>2016</year>). <article-title>Recent advances in dynamic m6A RNA modification</article-title>. <source>Open Biology</source> <volume>6</volume>: <fpage>160003</fpage>. DOI <pub-id pub-id-type="doi">10.1098/rsob.160003</pub-id>.</mixed-citation></ref>
<ref id="ref-4"><label>Chen <italic>et al</italic>. (2018)</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Chen</surname> <given-names>W</given-names></string-name>, <string-name><surname>Feng</surname> <given-names>P</given-names></string-name>, <string-name><surname>Yang</surname> <given-names>H</given-names></string-name>, <string-name><surname>Ding</surname> <given-names>H</given-names></string-name>, <string-name><surname>Lin</surname> <given-names>H</given-names></string-name>, <string-name><surname>Chou</surname> <given-names>KC</given-names></string-name></person-group> (<year>2018</year>). <article-title>iRNA-3typeA: Identifying three types of modification at RNA&#x2019;s adenosine sites, molecular therapy</article-title>. <source>Molecular Therapy Nucleic Acids</source> <volume>11</volume>: <fpage>468</fpage>&#x2013;<lpage>474</lpage>. DOI <pub-id pub-id-type="doi">10.1016/j.omtn.2018.03.012</pub-id>.</mixed-citation></ref>
<ref id="ref-5"><label>Chen <italic>et al</italic>. (2017)</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Chen</surname> <given-names>W</given-names></string-name>, <string-name><surname>Xing</surname> <given-names>P</given-names></string-name>, <string-name><surname>Zou</surname> <given-names>Q</given-names></string-name></person-group> (<year>2017</year>). <article-title>Detecting N6-methyladenosine sites from RNA transcriptomes using ensemble support vector machines</article-title>. <source>Scientific Reports</source> <volume>7</volume>: <fpage>40242</fpage>. DOI <pub-id pub-id-type="doi">10.1038/srep40242</pub-id>.</mixed-citation></ref>
<ref id="ref-6"><label>Dao <italic>et al</italic>. (2020)</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Dao</surname> <given-names>FY</given-names></string-name>, <string-name><surname>Lv</surname> <given-names>H</given-names></string-name>, <string-name><surname>Yang</surname> <given-names>YH</given-names></string-name>, <string-name><surname>Zulfiqar</surname> <given-names>H</given-names></string-name>, <string-name><surname>Lin</surname> <given-names>H</given-names></string-name></person-group> (<year>2020</year>). <article-title>Computational identification of N6-methyladenosine sites in multiple tissues of mammals</article-title>. <source>Computational and Structural Biotechnology Journal</source> <volume>18</volume>: <fpage>1084</fpage>&#x2013;<lpage>1091</lpage>. DOI <pub-id pub-id-type="doi">10.1016/j.csbj.2020.04.015</pub-id>.</mixed-citation></ref>
<ref id="ref-7"><label>Feng et al. (2019)</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Feng</surname> <given-names>P</given-names></string-name>, <string-name><surname>Yang</surname> <given-names>H</given-names></string-name>, <string-name><surname>Ding</surname> <given-names>H</given-names></string-name>, <string-name><surname>Lin</surname> <given-names>H</given-names></string-name>, <string-name><surname>Chen</surname> <given-names>W</given-names></string-name> <etal>et al.</etal></person-group> (<year>2019</year>). <article-title>iDNA6mA-PseKNC: Identifying DNA n6-methyladenosine sites by incorporating nucleotide physicochemical properties into PseKNC</article-title>. <source>Genomics</source> <volume>111</volume>: <fpage>96</fpage>&#x2013;<lpage>102</lpage>. DOI <pub-id pub-id-type="doi">10.1016/j.ygeno.2018.01.005</pub-id>.</mixed-citation></ref>
<ref id="ref-8"><label>Hasan et al. (2020)</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Hasan</surname> <given-names>MM</given-names></string-name>, <string-name><surname>Manavalan</surname> <given-names>B</given-names></string-name>, <string-name><surname>Shoombuatong</surname> <given-names>W</given-names></string-name>, <string-name><surname>Khatun</surname> <given-names>MS</given-names></string-name>, <string-name><surname>Kurata</surname> <given-names>H</given-names></string-name></person-group> (<year>2020</year>). <article-title>i6mA-Fuse: Improved and robust prediction of DNA 6mA sites in the Rosaceae genome by fusing multiple feature representation</article-title>. <source>Plant Molecular Biology</source> <volume>103</volume>: <fpage>225</fpage>&#x2013;<lpage>234</lpage>. DOI <pub-id pub-id-type="doi">10.1007/s11103-020-00988-y</pub-id>.</mixed-citation></ref>
<ref id="ref-9"><label>Li <italic>et al</italic>. (2018)</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Li</surname> <given-names>QX</given-names></string-name>, <string-name><surname>Rong</surname> <given-names>CH</given-names></string-name>, <string-name><surname>Cai</surname> <given-names>YX</given-names></string-name>, <string-name><surname>Ran</surname> <given-names>S</given-names></string-name>, <string-name><surname>Yi</surname> <given-names>WL</given-names></string-name></person-group> (<year>2018</year>). <article-title>M6AMRFS: Robust prediction of N6-methyladenosine sites with sequence-based features in multiple species</article-title>. <source>Frontiers in Genetics</source> <volume>9</volume>: <fpage>495</fpage>. <comment>DOI 10.3389/fgene.2018.00495</comment>.</mixed-citation></ref>
<ref id="ref-10"><label>Liu et al. (2020)</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Liu</surname> <given-names>K</given-names></string-name>, <string-name><surname>Cao</surname> <given-names>L</given-names></string-name>, <string-name><surname>Du</surname> <given-names>P</given-names></string-name>, <string-name><surname>Chen</surname> <given-names>W</given-names></string-name></person-group> (<year>2020</year>). <article-title>im6A-TS-CNN: Identifying the N(6)-methyladenine site in multiple tissues by using the convolutional neural network</article-title>. <source>Molecular Therapy Nucleic Acids</source> <volume>21</volume>: <fpage>1044</fpage>&#x2013;<lpage>1049</lpage>. DOI <pub-id pub-id-type="doi">10.1016/j.omtn.2020.07.034</pub-id>.</mixed-citation></ref>
<ref id="ref-11"><label>Liu and Liu (2017)</label><mixed-citation publication-type="other"><person-group person-group-type="author"><string-name><surname>Liu</surname> <given-names>S</given-names></string-name>, <string-name><surname>Liu</surname> <given-names>Z</given-names></string-name></person-group> (<year>2017</year>). <article-title>Multi-channel CNN-based object detection for enhanced situation awareness</article-title>. <italic>Sensors and Electronics Technology (SET) Panel Symposium SET-241 on 9th NATO Military Sensing Symposium</italic>.</mixed-citation></ref>
<ref id="ref-12"><label>Liu <italic>et al</italic>. (2016)</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Liu</surname> <given-names>Z</given-names></string-name>, <string-name><surname>Xiao</surname> <given-names>X</given-names></string-name>, <string-name><surname>Yu</surname> <given-names>DJ</given-names></string-name>, <string-name><surname>Jia</surname> <given-names>J</given-names></string-name>, <string-name><surname>Qiu</surname> <given-names>WR</given-names></string-name>, <string-name><surname>Chou</surname> <given-names>KC</given-names></string-name></person-group> (<year>2016</year>). <article-title>pRNAm-PC: Predicting N(6)-methyladenosine sites in RNA sequences via physical-chemical properties</article-title>. <source>Analytical Biochemistry</source> <volume>497</volume>: <fpage>60</fpage>&#x2013;<lpage>67</lpage>. DOI <pub-id pub-id-type="doi">10.1016/j.ab.2015.12.017</pub-id>.</mixed-citation></ref>
<ref id="ref-13"><label>Mobiny <italic>et al</italic>. (2019)</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Mobiny</surname> <given-names>A</given-names></string-name>, <string-name><surname>Lu</surname> <given-names>H</given-names></string-name>, <string-name><surname>Nguyen</surname> <given-names>HV</given-names></string-name>, <string-name><surname>Roysam</surname> <given-names>B</given-names></string-name>, <string-name><surname>Varadarajan</surname> <given-names>N</given-names></string-name></person-group> (<year>2019</year>). <article-title>Automated classification of apoptosis in phase contrast microscopy using capsule network</article-title>. <source>IEEE Transactions on Medical Imaging</source> <volume>39</volume>: <fpage>1</fpage>&#x2013;<lpage>10</lpage>. DOI <pub-id pub-id-type="doi">10.1109/TMI.2019.2918181</pub-id>.</mixed-citation></ref>
<ref id="ref-14"><label>Nazari et al. (2019)</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Nazari </surname> <given-names>I</given-names></string-name>, <string-name><surname>Tahir </surname> <given-names>M</given-names></string-name>, <string-name><surname>Tayara </surname> <given-names>H</given-names></string-name>, <string-name><surname>Chong</surname> <given-names>KT</given-names></string-name></person-group> (<year>2019</year>). <article-title>iN6-methyl (5-step): Identifying RNA N6-methyladenosine sites using deep learning mode via chou&#x2019;s 5-step rules and chou&#x2019;s general PseKNC</article-title>. <source>Chemometrics and Intelligent Laboratory Systems</source> <volume>193</volume>: <fpage>103811</fpage>. DOI <pub-id pub-id-type="doi">10.1016/j.chemolab.2019.103811</pub-id>.</mixed-citation></ref>
<ref id="ref-15"><label>Qi <italic>et al</italic>. (2019)</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Qi</surname> <given-names>CK</given-names></string-name>, <string-name><surname>Zhen</surname> <given-names>W</given-names></string-name>, <string-name><surname>Qing</surname> <given-names>Z</given-names></string-name>, <string-name><surname>Yu</surname> <given-names>WX</given-names></string-name>, <string-name><surname>Rong</surname> <given-names>R</given-names></string-name>, <string-name><surname>Liang</surname> <given-names>LZ</given-names></string-name>, <string-name><surname>Long</surname> <given-names>SJ</given-names></string-name>, <string-name><surname>Jo&#x00E3;o</surname> <given-names>MP</given-names></string-name>, <string-name><surname>Daniel</surname> <given-names>JR</given-names></string-name>, <string-name><surname>Jia</surname> <given-names>M</given-names></string-name></person-group> (<year>2019</year>). <article-title>WHISTLE: A high-accuracy map of the human N6-methyladenosine (m6A) epitranscriptome predicted using a machine learning approach</article-title>. <source>Nucleic Acids Research</source> <volume>47</volume>: <fpage>e41</fpage>. DOI <pub-id pub-id-type="doi">10.1093/nar/gkz074</pub-id>.</mixed-citation></ref>
<ref id="ref-16"><label>Qian <italic>et al</italic>. (2020)</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Qian</surname> <given-names>K</given-names></string-name>, <string-name><surname>Tian</surname> <given-names>L</given-names></string-name>, <string-name><surname>Liu</surname> <given-names>Y</given-names></string-name>, <string-name><surname>Wen</surname> <given-names>X</given-names></string-name>, <string-name><surname>Bao</surname> <given-names>J</given-names></string-name></person-group> (<year>2020</year>). <article-title>Image robust recognition based on feature-entropy-oriented differential fusion capsule network</article-title>. <source>Applied Intelligence</source> <volume>51</volume>: <fpage>1108</fpage>&#x2013;<lpage>1117</lpage>. <comment>DOI 10.1007/s10489-020-01873-3</comment>.</mixed-citation></ref>
<ref id="ref-17"><label>Sabour et al. (2017)</label><mixed-citation publication-type="other"><person-group person-group-type="author"><string-name><surname>Sabour</surname> <given-names>S</given-names></string-name>, <string-name><surname>Frosst</surname> <given-names>N</given-names></string-name>, <string-name><surname>Hinton</surname> <given-names>GE</given-names></string-name></person-group> (<year>2017</year>). <article-title>Dynamic routing between capsules</article-title>. <source>31st Conference on Neural Information Processing Systems (NIPS 2017)</source>, Long Beach, CA, USA.</mixed-citation></ref>
<ref id="ref-18"><label>Shahid and Maqsood (2018)</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Shahid</surname> <given-names>A</given-names></string-name>, <string-name><surname>Maqsood</surname> <given-names>H</given-names></string-name></person-group> (<year>2018</year>). <article-title>iMethyl-STTNC: Identification of N 6-methyladenosine sites by extending the idea of SAAC into Chou&#x2019;s PseAAC to formulate RNA sequences</article-title>. <source>Journal of Theoretical Biology</source> <volume>455</volume>: <fpage>205</fpage>&#x2013;<lpage>211</lpage>. DOI <pub-id pub-id-type="doi">10.1016/j.jtbi.2018.07.018</pub-id>.</mixed-citation></ref>
<ref id="ref-19"><label>Snoek <italic>et al</italic>. (2012)</label><mixed-citation publication-type="book"><person-group person-group-type="author"><string-name><surname>Snoek</surname> <given-names>J</given-names></string-name>, <string-name><surname>Larochelle</surname> <given-names>H</given-names></string-name>, <string-name><surname>Adams</surname> <given-names>RP</given-names></string-name></person-group> <year>(2012)</year>. <chapter-title>Practical Bayesian optimization of machine learning algorithms</chapter-title>. In: <source>Advances in Neural Information Processing Systems</source><italic>,</italic> vol. 4.</mixed-citation></ref>
<ref id="ref-20"><label>Tahir <italic>et al</italic>. (2019)</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Tahir</surname> <given-names>M</given-names></string-name>, <string-name><surname>Tayara</surname> <given-names>H</given-names></string-name>, <string-name><surname>Chong</surname> <given-names>KT</given-names></string-name></person-group> (<year>2019</year>). <article-title>iDNA6mA (5-step rule): Identification of DNA N6-methyladenine sites in the rice genome by intelligent computational model via chou&#x2019;s 5-step rule</article-title>. <source>Chemometrics &#x0026; Intelligent Laboratory Systems</source> <volume>189</volume>: <fpage>96</fpage>&#x2013;<lpage>101</lpage>. DOI <pub-id pub-id-type="doi">10.1016/j.chemolab.2019.04.007</pub-id>.</mixed-citation></ref>
<ref id="ref-21"><label>Wang <italic>et al</italic>. (2019)</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Wang</surname> <given-names>D</given-names></string-name>, <string-name><surname>Liang</surname> <given-names>Y</given-names></string-name>, <string-name><surname>Xu</surname> <given-names>D</given-names></string-name></person-group> (<year>2019</year>). <article-title>Capsule network for protein post-translational modification site prediction</article-title>. <source>Bioinformatics</source> <volume>35</volume>: <fpage>2386</fpage>&#x2013;<lpage>2394</lpage>. DOI <pub-id pub-id-type="doi">10.1093/bioinformatics/bty977</pub-id>.</mixed-citation></ref>
<ref id="ref-22"><label>Wei <italic>et al</italic>. (2018)</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Wei</surname> <given-names>L</given-names></string-name>, <string-name><surname>Chen</surname> <given-names>H</given-names></string-name>, <string-name><surname>Ran</surname> <given-names>S</given-names></string-name></person-group> (<year>2018</year>). <article-title>M6APred-EL: A sequence-based predictor for identifying N6-methyladenosine sites using ensemble learning</article-title>. <source>Molecular Therapy Nucleic Acids</source> <volume>12</volume>: <fpage>635</fpage>&#x2013;<lpage>644</lpage>. DOI <pub-id pub-id-type="doi">10.1016/j.omtn.2018.07.004</pub-id>.</mixed-citation></ref>
<ref id="ref-23"><label>Wei <italic>et al</italic>. (2016)</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Wei</surname> <given-names>C</given-names></string-name>, <string-name><surname>Hua</surname> <given-names>T</given-names></string-name>, <string-name><surname>Hao</surname> <given-names>L</given-names></string-name></person-group> (<year>2016</year>). <article-title>MethyRNA: A web-server for identification of N(6)-methyladenosine sites</article-title>. <source>Journal of Biomolecular Structure &#x0026; Dynamics</source> <volume>35</volume>: <fpage>1</fpage>&#x2013;<lpage>11</lpage>. DOI <pub-id pub-id-type="doi">10.1080/07391102.2016.1157761</pub-id>.</mixed-citation></ref>
<ref id="ref-24"><label>Wu <italic>et al</italic>. (2020)</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Wu</surname> <given-names>Y</given-names></string-name>, <string-name><surname>Fang</surname> <given-names>Y</given-names></string-name>, <string-name><surname>Shang</surname> <given-names>S</given-names></string-name>, <string-name><surname>Jin</surname> <given-names>J</given-names></string-name>, <string-name><surname>Wang</surname> <given-names>H</given-names></string-name></person-group> (<year>2020</year>). <article-title>A novel framework for detecting social bots with deep neural networks and active learning</article-title>. <source>Knowledge-Based Systems</source> <volume>211</volume>: <fpage>106525</fpage>. DOI <pub-id pub-id-type="doi">10.1016/j.knosys.2020.106525</pub-id>.</mixed-citation></ref>
<ref id="ref-25"><label>Xing <italic>et al</italic>. (2017)</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Xing</surname> <given-names>P</given-names></string-name>, <string-name><surname>Su</surname> <given-names>R</given-names></string-name>, <string-name><surname>Guo</surname> <given-names>F</given-names></string-name>, <string-name><surname>Wei</surname> <given-names>L</given-names></string-name></person-group> (<year>2017</year>). <article-title>Identifying N6-methyladenosine sites using multi-interval nucleotide pair position specificity and support vector machine</article-title>. <source>Scientific Reports</source> <volume>7</volume>: <fpage>46757</fpage>. DOI <pub-id pub-id-type="doi">10.1038/srep46757</pub-id>.</mixed-citation></ref>
<ref id="ref-26"><label>Yan <italic>et al</italic>. (2021)</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Yan</surname> <given-names>AC</given-names></string-name>, <string-name><surname>Liang</surname> <given-names>Y</given-names></string-name>, <string-name><surname>Quan</surname> <given-names>Z</given-names></string-name></person-group> (<year>2021</year>). <article-title>Prediction of bio-sequence modifications and the associations with diseases</article-title>. <source>Briefings in Functional Genomics</source> <volume>20</volume>: <fpage>1</fpage>&#x2013;<lpage>18</lpage>. <comment>DOI 10.1093/bfgp/elaa023</comment>.</mixed-citation></ref>
<ref id="ref-27"><label>Yang <italic>et al</italic>. (2020)</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Yang</surname> <given-names>C</given-names></string-name>, <string-name><surname>Hu</surname> <given-names>Y</given-names></string-name>, <string-name><surname>Zhou</surname> <given-names>B</given-names></string-name>, <string-name><surname>Bo</surname> <given-names>Z</given-names></string-name>, <string-name><surname>Lu</surname> <given-names>BY</given-names></string-name>, <string-name><surname>Bin</surname> <given-names>LZ</given-names></string-name>, <string-name><surname>Li</surname> <given-names>GC</given-names></string-name>, <string-name><surname>Huan</surname> <given-names>Y</given-names></string-name>, <string-name><surname>Min</surname> <given-names>WS</given-names></string-name>, <string-name><surname>Feng</surname> <given-names>XY</given-names></string-name></person-group> (<year>2020</year>). <article-title>The role of m6A modification in physiology and disease</article-title>. <source>Cell Death &#x0026; Disease</source> <volume>11</volume>: <fpage>960</fpage>. DOI <pub-id pub-id-type="doi">10.1038/s41419-020-03143-z</pub-id>.</mixed-citation></ref>
<ref id="ref-28"><label>Zhang and Liu (2019)</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Zhang</surname> <given-names>J</given-names></string-name>, <string-name><surname>Liu</surname> <given-names>B</given-names></string-name></person-group> (<year>2019</year>). <article-title>A review on the recent developments of sequence-based protein feature extraction methods</article-title>. <source>Current Bioinformatics</source> <volume>14</volume>: <fpage>190</fpage>&#x2013;<lpage>199</lpage>. DOI <pub-id pub-id-type="doi">10.2174/1574893614666181212102749</pub-id>.</mixed-citation></ref>
<ref id="ref-29"><label>Zhao and Yang (2020)</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Zhao</surname> <given-names>G</given-names></string-name>, <string-name><surname>Yang</surname> <given-names>P</given-names></string-name></person-group> (<year>2020</year>). <article-title>Pretrained embeddings for stance detection with hierarchical capsule network on social media</article-title>. <source>ACM Transactions on Information Systems</source> <volume>39</volume>: <fpage>1</fpage>. DOI <pub-id pub-id-type="doi">10.1145/3412362</pub-id>.</mixed-citation></ref>
<ref id="ref-30"><label>Zhao <italic>et al</italic>. (2019)</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Zhao</surname> <given-names>X</given-names></string-name>, <string-name><surname>Zhang</surname> <given-names>Y</given-names></string-name>, <string-name><surname>Ning</surname> <given-names>Q</given-names></string-name>, <string-name><surname>Zhang</surname> <given-names>H</given-names></string-name>, <string-name><surname>Ji</surname> <given-names>J</given-names></string-name>, <string-name><surname>Yin</surname> <given-names>M</given-names></string-name></person-group> (<year>2019</year>). <article-title>Identifying N6-methyladenosine sites using extreme gradient boosting system optimized by particle swarm optimizer</article-title>. <source>Journal of Theoretical Biology</source> <volume>467</volume>: <fpage>39</fpage>&#x2013;<lpage>47</lpage>. DOI <pub-id pub-id-type="doi">10.1016/j.jtbi.2019.01.035</pub-id>.</mixed-citation></ref>
<ref id="ref-31"><label>Zhen <italic>et al</italic>. (2020)</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Zhen</surname> <given-names>C</given-names></string-name>, <string-name><surname>Pei</surname> <given-names>Z</given-names></string-name>, <string-name><surname>Yi</surname> <given-names>LF</given-names></string-name>, <string-name><surname>Tatiana</surname> <given-names>TM</given-names></string-name>, <string-name><surname>Andr&#x00E9;</surname> <given-names>L</given-names></string-name> <etal>et al.</etal></person-group> (<year>2020</year>). <article-title>iLearn: An integrated platform and meta-learner for feature engineering, machine-learning analysis and modeling of DNA, RNA and protein sequence data</article-title>. <source>Briefings in Bioinformatics</source> <volume>21</volume>: <fpage>1047</fpage>&#x2013;<lpage>1057</lpage>. DOI <pub-id pub-id-type="doi">10.1093/bib/bbz041</pub-id>.</mixed-citation></ref>
<ref id="ref-32"><label>Zou <italic>et al</italic>. (2019)</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Zou</surname> <given-names>Q</given-names></string-name>, <string-name><surname>Xing</surname> <given-names>P</given-names></string-name>, <string-name><surname>Wei</surname> <given-names>L</given-names></string-name>, <string-name><surname>Liu</surname> <given-names>B</given-names></string-name></person-group> (<year>2019</year>). <article-title>Gene2vec: Gene subsequence embedding for prediction of mammalian <italic>N6</italic>-methyladenosine sites from mRNA</article-title>. <source>RNA</source> <volume>25</volume>: <fpage>205</fpage>&#x2013;<lpage>218</lpage>. DOI <pub-id pub-id-type="doi">10.1261/rna.069112.118</pub-id>.</mixed-citation></ref>
</ref-list><app-group><app id="app-1">
<title></title>
<sec id="s6"><title/>
<p><bold>Supplementary Tables</bold></p>
<table-wrap id="table-4"><label>Table S1</label>
<caption>
<title>The detailed results on 5-fold and independent tests of different datasets</title></caption>
<table><colgroup>
<col/>
<col/>
<col/>
<col/>
<col/>
<col/>
<col/>
</colgroup>
<thead>
<tr>
<th>Data</th>
<th>Time</th>
<th>Method</th>
<th>Sn (%)</th>
<th>Sp (%)</th>
<th>MCC</th>
<th>Acc (%)</th>
</tr>
</thead>
<tbody>
<tr>
<td rowspan="10">h_b</td>
<td rowspan="2">1</td>
<td>5-fold</td>
<td>80.37</td>
<td>67.73</td>
<td>49.08</td>
<td>74.00</td>
</tr>
<tr>
<td>independent</td>
<td>81.78</td>
<td>66.18</td>
<td>48.55</td>
<td>73.98</td>
</tr>
<tr>
<td rowspan="2">2</td>
<td>5-fold</td>
<td>82.03</td>
<td>65.77</td>
<td>48.48</td>
<td>73.85</td>
</tr>
<tr>
<td>independent</td>
<td>82.99</td>
<td>64.68</td>
<td>48.50</td>
<td>73.84</td>
</tr>
<tr>
<td rowspan="2">3</td>
<td>5-fold</td>
<td>80.00</td>
<td>68.48</td>
<td>48.94</td>
<td>74.29</td>
</tr>
<tr>
<td>independent</td>
<td>81.82</td>
<td>67.42</td>
<td>49.76</td>
<td>74.62</td>
</tr>
<tr>
<td rowspan="2">4</td>
<td>5-fold</td>
<td>83.61</td>
<td>64.19</td>
<td>48.79</td>
<td>73.93</td>
</tr>
<tr>
<td>independent</td>
<td>85.12</td>
<td>63.18</td>
<td>49.51</td>
<td>74.15</td>
</tr>
<tr>
<td rowspan="2">5</td>
<td>5-fold</td>
<td>79.08</td>
<td>68.73</td>
<td>48.08</td>
<td>73.91</td>
</tr>
<tr>
<td>independent</td>
<td>80.30</td>
<td>68.09</td>
<td>48.76</td>
<td>74.20</td>
</tr>
<tr>
<td rowspan="7">h_k</td>
<td rowspan="2">1</td>
<td>5-fold</td>
<td>82.69</td>
<td>78.79</td>
<td>61.56</td>
<td>80.73</td>
</tr>
<tr>
<td>independent</td>
<td>80.89</td>
<td>79.23</td>
<td>60.12</td>
<td>80.06</td>
</tr>
<tr>
<td rowspan="2">2</td>
<td>5-fold</td>
<td>82.05</td>
<td>79.00</td>
<td>61.18</td>
<td>80.54</td>
</tr>
<tr>
<td>independent</td>
<td>80.52</td>
<td>79.27</td>
<td>59.79</td>
<td>79.89</td>
</tr>
<tr>
<td rowspan="2">3</td>
<td>5-fold</td>
<td>83.59</td>
<td>77.90</td>
<td>61.62</td>
<td>80.75</td>
</tr>
<tr>
<td>independent</td>
<td>82.33</td>
<td>77.35</td>
<td>59.75</td>
<td>79.84</td>
</tr>
<tr>
<td>4</td>
<td>5-fold</td>
<td>84.39</td>
<td>77.07</td>
<td>61.67</td>
<td>80.71</td>
</tr>
<tr>
<td rowspan="3"></td>
<td></td>
<td>independent</td>
<td>83.21</td>
<td>76.80</td>
<td>60.13</td>
<td>80.00</td>
</tr>
<tr>
<td rowspan="2">5</td>
<td>5-fold</td>
<td>83.25</td>
<td>77.89</td>
<td>61.25</td>
<td>80.59</td>
</tr>
<tr>
<td>independent</td>
<td>82.05</td>
<td>77.78</td>
<td>59.88</td>
<td>79.91</td>
</tr>
<tr>
<td rowspan="10">h_l</td>
<td rowspan="2">1</td>
<td>5-fold</td>
<td>84.85</td>
<td>78.74</td>
<td>63.75</td>
<td>81.80</td>
</tr>
<tr>
<td>independent</td>
<td>84.13</td>
<td>77.79</td>
<td>62.05</td>
<td>80.96</td>
</tr>
<tr>
<td rowspan="2">2</td>
<td>5-fold</td>
<td>84.79</td>
<td>77.79</td>
<td>62.85</td>
<td>81.36</td>
</tr>
<tr>
<td>independent</td>
<td>84.28</td>
<td>78.06</td>
<td>62.46</td>
<td>81.17</td>
</tr>
<tr>
<td rowspan="2">3</td>
<td>5-fold</td>
<td>84.26</td>
<td>78.50</td>
<td>62.92</td>
<td>81.42</td>
</tr>
<tr>
<td>independent</td>
<td>83.79</td>
<td>78.82</td>
<td>62.68</td>
<td>81.30</td>
</tr>
<tr>
<td rowspan="2">4</td>
<td>5-fold</td>
<td>83.77</td>
<td>79.36</td>
<td>63.22</td>
<td>81.61</td>
</tr>
<tr>
<td>independent</td>
<td>83.14</td>
<td>79.23</td>
<td>62.42</td>
<td>81.19</td>
</tr>
<tr>
<td rowspan="2">5</td>
<td>5-fold</td>
<td>85.02</td>
<td>77.76</td>
<td>81.36</td>
<td>81.36</td>
</tr>
<tr>
<td>independent</td>
<td>84.05</td>
<td>77.83</td>
<td>62.00</td>
<td>80.94</td>
</tr>
<tr>
<td rowspan="10">m_b</td>
<td rowspan="2">1</td>
<td>5-fold</td>
<td>83.09</td>
<td>76.51</td>
<td>59.75</td>
<td>79.80</td>
</tr>
<tr>
<td>independent</td>
<td>83.48</td>
<td>75.90</td>
<td>59.55</td>
<td>79.69</td>
</tr>
<tr>
<td rowspan="2">2</td>
<td>5-fold</td>
<td>82.65</td>
<td>77.06</td>
<td>59.91</td>
<td>79.91</td>
</tr>
<tr>
<td>independent</td>
<td>82.92</td>
<td>75.85</td>
<td>58.91</td>
<td>79.38</td>
</tr>
<tr>
<td rowspan="2">3</td>
<td>5-fold</td>
<td>83.81</td>
<td>75.58</td>
<td>59.64</td>
<td>79.69</td>
</tr>
<tr>
<td>independent</td>
<td>84.27</td>
<td>74.64</td>
<td>59.19</td>
<td>79.46</td>
</tr>
<tr>
<td rowspan="2">4</td>
<td>5-fold</td>
<td>82.07</td>
<td>77.52</td>
<td>59.67</td>
<td>79.79</td>
</tr>
<tr>
<td>independent</td>
<td>82.44</td>
<td>76.02</td>
<td>58.59</td>
<td>79.23</td>
</tr>
<tr>
<td rowspan="2">5</td>
<td>5-fold</td>
<td>82.96</td>
<td>76.32</td>
<td>59.49</td>
<td>79.68</td>
</tr>
<tr>
<td>independent</td>
<td>83.48</td>
<td>75.90</td>
<td>59.55</td>
<td>79.69</td>
</tr>
<tr>
<td rowspan="10">m_h</td>
<td rowspan="2">1</td>
<td>5-fold</td>
<td>79.77</td>
<td>71.26</td>
<td>51.59</td>
<td>75.62</td>
</tr>
<tr>
<td>independent</td>
<td>80.27</td>
<td>69.73</td>
<td>50.28</td>
<td>75.00</td>
</tr>
<tr>
<td rowspan="2">2</td>
<td>5-fold</td>
<td>81.56</td>
<td>70.64</td>
<td>52.53</td>
<td>76.10</td>
</tr>
<tr>
<td>independent</td>
<td>80.36</td>
<td>70.32</td>
<td>50.94</td>
<td>75.34</td>
</tr>
<tr>
<td rowspan="2">3</td>
<td>5-fold</td>
<td>79.82</td>
<td>71.97</td>
<td>52.23</td>
<td>75.90</td>
</tr>
<tr>
<td>independent</td>
<td>79.27</td>
<td>71.36</td>
<td>50.80</td>
<td>75.32</td>
</tr>
<tr>
<td rowspan="2">4</td>
<td>5-fold</td>
<td>79.53</td>
<td>72.24</td>
<td>51.96</td>
<td>75.92</td>
</tr>
<tr>
<td>independent</td>
<td>78.27</td>
<td>72.64</td>
<td>50.99</td>
<td>75.45</td>
</tr>
<tr>
<td rowspan="2">5</td>
<td>5-fold</td>
<td>79.29</td>
<td>72.36</td>
<td>51.89</td>
<td>75.87</td>
</tr>
<tr>
<td>independent</td>
<td>78.09</td>
<td>70.82</td>
<td>49.04</td>
<td>74.45</td>
</tr>
<tr>
<td rowspan="10">m_k</td>
<td rowspan="2">1</td>
<td>5-fold</td>
<td>83.41</td>
<td>79.94</td>
<td>63.40</td>
<td>81.66</td>
</tr>
<tr>
<td>independent</td>
<td>83.05</td>
<td>79.83</td>
<td>62.91</td>
<td>81.44</td>
</tr>
<tr>
<td rowspan="2">2</td>
<td>5-fold</td>
<td>67.97</td>
<td>78.93</td>
<td>63.61</td>
<td>81.74</td>
</tr>
<tr>
<td>independent</td>
<td>84.19</td>
<td>78.62</td>
<td>62.90</td>
<td>81.40</td>
</tr>
<tr>
<td rowspan="2">3</td>
<td>5-fold</td>
<td>84.26</td>
<td>79.19</td>
<td>63.56</td>
<td>81.72</td>
</tr>
<tr>
<td>independent</td>
<td>83.63</td>
<td>78.64</td>
<td>62.35</td>
<td>81.14</td>
</tr>
<tr>
<td rowspan="2">4</td>
<td>5-fold</td>
<td>83.57</td>
<td>79.45</td>
<td>63.11</td>
<td>81.51</td>
</tr>
<tr>
<td>independent</td>
<td>83.91</td>
<td>78.34</td>
<td>62.34</td>
<td>81.12</td>
</tr>
<tr>
<td rowspan="2">5</td>
<td>5-fold</td>
<td>83.97</td>
<td>78.96</td>
<td>63.05</td>
<td>81.47</td>
</tr>
<tr>
<td>independent</td>
<td>83.76</td>
<td>78.52</td>
<td>62.36</td>
<td>81.14</td>
</tr>
<tr>
<td rowspan="5">m_l</td>
<td rowspan="2">1</td>
<td>5-fold</td>
<td>77.35</td>
<td>69.45</td>
<td>47.18</td>
<td>73.37</td>
</tr>
<tr>
<td>independent</td>
<td>76.07</td>
<td>69.61</td>
<td>45.78</td>
<td>72.84</td>
</tr>
<tr>
<td rowspan="2">2</td>
<td>5-fold</td>
<td>77.76</td>
<td>69.04</td>
<td>47.06</td>
<td>73.43</td>
</tr>
<tr>
<td>independent</td>
<td>77.16</td>
<td>68.84</td>
<td>46.16</td>
<td>73.00</td>
</tr>
<tr>
<td>3</td>
<td>5-fold</td>
<td>81.37</td>
<td>65.03</td>
<td>47.28</td>
<td>73.25</td>
</tr>
<tr>
<td rowspan="5"></td>
<td></td>
<td>independent</td>
<td>81.15</td>
<td>64.77</td>
<td>46.55</td>
<td>72.96</td>
</tr>
<tr>
<td rowspan="2">4</td>
<td>5-fold</td>
<td>76.95</td>
<td>70.87</td>
<td>47.98</td>
<td>73.92</td>
</tr>
<tr>
<td>independent</td>
<td>75.34</td>
<td>70.72</td>
<td>46.12</td>
<td>73.03</td>
</tr>
<tr>
<td rowspan="2">5</td>
<td>5-fold</td>
<td>79.39</td>
<td>67.63</td>
<td>47.36</td>
<td>73.48</td>
</tr>
<tr>
<td>independent</td>
<td>78.30</td>
<td>67.70</td>
<td>46.26</td>
<td>73.00</td>
</tr>
<tr>
<td rowspan="10">m_t</td>
<td rowspan="2">1</td>
<td>5-fold</td>
<td>83.08</td>
<td>70.78</td>
<td>54.30</td>
<td>76.93</td>
</tr>
<tr>
<td>independent</td>
<td>83.60</td>
<td>70.29</td>
<td>54.37</td>
<td>76.94</td>
</tr>
<tr>
<td rowspan="2">2</td>
<td>5-fold</td>
<td>79.98</td>
<td>73.97</td>
<td>54.15</td>
<td>77.01</td>
</tr>
<tr>
<td>independent</td>
<td>80.41</td>
<td>74.20</td>
<td>54.72</td>
<td>77.31</td>
</tr>
<tr>
<td rowspan="2">3</td>
<td>5-fold</td>
<td>82.28</td>
<td>72.01</td>
<td>54.74</td>
<td>77.20</td>
</tr>
<tr>
<td>independent</td>
<td>81.68</td>
<td>72.23</td>
<td>54.15</td>
<td>76.96</td>
</tr>
<tr>
<td rowspan="2">4</td>
<td>5-fold</td>
<td>84.01</td>
<td>69.92</td>
<td>54.63</td>
<td>77.02</td>
</tr>
<tr>
<td>independent</td>
<td>84.17</td>
<td>70.17</td>
<td>54.88</td>
<td>77.17</td>
</tr>
<tr>
<td rowspan="2">5</td>
<td>5-fold</td>
<td>83.67</td>
<td>70.32</td>
<td>54.55</td>
<td>77.00</td>
</tr>
<tr>
<td>independent</td>
<td>84.06</td>
<td>70.38</td>
<td>54.96</td>
<td>77.22</td>
</tr>
<tr>
<td rowspan="10">r_b</td>
<td rowspan="2">1</td>
<td>5-fold</td>
<td>81.25</td>
<td>75.22</td>
<td>56.71</td>
<td>78.27</td>
</tr>
<tr>
<td>independent</td>
<td>79.71</td>
<td>75.29</td>
<td>55.05</td>
<td>77.50</td>
</tr>
<tr>
<td rowspan="2">2</td>
<td>5-fold</td>
<td>81.31</td>
<td>74.47</td>
<td>55.94</td>
<td>77.87</td>
</tr>
<tr>
<td>independent</td>
<td>79.80</td>
<td>74.99</td>
<td>54.85</td>
<td>77.39</td>
</tr>
<tr>
<td rowspan="2">3</td>
<td>5-fold</td>
<td>82.43</td>
<td>73.29</td>
<td>56.02</td>
<td>77.83</td>
</tr>
<tr>
<td>independent</td>
<td>81.28</td>
<td>73.37</td>
<td>54.83</td>
<td>77.33</td>
</tr>
<tr>
<td rowspan="2">4</td>
<td>5-fold</td>
<td>82.67</td>
<td>73.59</td>
<td>56.55</td>
<td>78.10</td>
</tr>
<tr>
<td>independent</td>
<td>82.18</td>
<td>72.91</td>
<td>55.32</td>
<td>77.54</td>
</tr>
<tr>
<td rowspan="2">5</td>
<td>5-fold</td>
<td>80.36</td>
<td>75.99</td>
<td>56.42</td>
<td>78.17</td>
</tr>
<tr>
<td>independent</td>
<td>79.20</td>
<td>75.71</td>
<td>54.95</td>
<td>77.46</td>
</tr>
<tr>
<td rowspan="10">r_k</td>
<td rowspan="2">1</td>
<td>5-fold</td>
<td>86.19</td>
<td>79.59</td>
<td>65.96</td>
<td>82.89</td>
</tr>
<tr>
<td>independent</td>
<td>86.98</td>
<td>79.08</td>
<td>66.26</td>
<td>83.03</td>
</tr>
<tr>
<td rowspan="2">2</td>
<td>5-fold</td>
<td>84.91</td>
<td>80.87</td>
<td>65.83</td>
<td>82.89</td>
</tr>
<tr>
<td>independent</td>
<td>85.58</td>
<td>80.91</td>
<td>66.56</td>
<td>83.25</td>
</tr>
<tr>
<td rowspan="2">3</td>
<td>5-fold</td>
<td>85.13</td>
<td>80.47</td>
<td>65.78</td>
<td>82.84</td>
</tr>
<tr>
<td>independent</td>
<td>86.54</td>
<td>80.42</td>
<td>67.08</td>
<td>83.48</td>
</tr>
<tr>
<td rowspan="2">4</td>
<td>5-fold</td>
<td>84.78</td>
<td>80.82</td>
<td>65.84</td>
<td>82.84</td>
</tr>
<tr>
<td>independent</td>
<td>85.81</td>
<td>80.54</td>
<td>66.44</td>
<td>83.17</td>
</tr>
<tr>
<td rowspan="2">5</td>
<td>5-fold</td>
<td>83.53</td>
<td>82.47</td>
<td>66.02</td>
<td>83.00</td>
</tr>
<tr>
<td>independent</td>
<td>84.82</td>
<td>82.08</td>
<td>66.92</td>
<td>83.45</td>
</tr>
<tr>
<td rowspan="10">r_l</td>
<td rowspan="2">1</td>
<td>5-fold</td>
<td>83.57</td>
<td>80.74</td>
<td>64.32</td>
<td>82.12</td>
</tr>
<tr>
<td>independent</td>
<td>84.96</td>
<td>78.49</td>
<td>63.58</td>
<td>81.73</td>
</tr>
<tr>
<td rowspan="2">2</td>
<td>5-fold</td>
<td>83.69</td>
<td>80.93</td>
<td>64.72</td>
<td>82.26</td>
</tr>
<tr>
<td>independent</td>
<td>85.24</td>
<td>77.87</td>
<td>63.28</td>
<td>81.56</td>
</tr>
<tr>
<td rowspan="2">3</td>
<td>5-fold</td>
<td>83.43</td>
<td>81.64</td>
<td>65.08</td>
<td>82.46</td>
</tr>
<tr>
<td>independent</td>
<td>84.17</td>
<td>78.83</td>
<td>63.09</td>
<td>81.50</td>
</tr>
<tr>
<td rowspan="2">4</td>
<td>5-fold</td>
<td>83.58</td>
<td>81.02</td>
<td>64.69</td>
<td>82.29</td>
</tr>
<tr>
<td>independent</td>
<td>84.62</td>
<td>79.17</td>
<td>63.89</td>
<td>81.90</td>
</tr>
<tr>
<td rowspan="2">5</td>
<td>5-fold</td>
<td>83.66</td>
<td>80.83</td>
<td>64.61</td>
<td>82.29</td>
</tr>
<tr>
<td>independent</td>
<td>84.85</td>
<td>78.60</td>
<td>63.57</td>
<td>81.73</td>
</tr>
</tbody>
</table>
</table-wrap>
<table-wrap id="table-5"><label>Table S2</label>
<caption>
<title>Comparison of our model with im6A-TS-CNN and iRNA-m6A on 5-fold cross-validation test</title></caption>
<table><colgroup>
<col/>
<col/>
<col/>
<col/>
<col/>
<col/>
</colgroup>
<thead>
<tr>
<th>Species</th>
<th>Methods</th>
<th>Sn (%)</th>
<th>Sp (%)</th>
<th>Acc (%)</th>
<th>MCC</th>
</tr>
</thead>
<tbody>
<tr>
<td rowspan="3">h_b</td>
<td>iRNA-m6A</td>
<td>74.79</td>
<td>66.19</td>
<td>71.26</td>
<td>0.41</td>
</tr>
<tr>
<td>im6A-TS-CNN</td>
<td>75.35</td>
<td><bold>69.71</bold></td>
<td>72.53</td>
<td>0.4523</td>
</tr>
<tr>
<td>our model</td>
<td><bold>80.00</bold></td>
<td>68.48</td>
<td><bold>74.29</bold></td>
<td><bold>0.4894</bold></td>
</tr>
<tr>
<td rowspan="3">h_k</td>
<td>iRNA-m6A</td>
<td>80.85</td>
<td>76.34</td>
<td>78.99</td>
<td>0.57</td>
</tr>
<tr>
<td>im6A-TS-CNN</td>
<td>81.70</td>
<td><bold>78.25</bold></td>
<td>79.98</td>
<td>0.6006</td>
</tr>
<tr>
<td>our model</td>
<td><bold>83.59</bold></td>
<td>77.90</td>
<td><bold>80.75</bold></td>
<td><bold>0.6162</bold></td>
</tr>
<tr>
<td rowspan="3">h_l</td>
<td>iRNA-m6A</td>
<td>81.32</td>
<td>78.13</td>
<td>80.13</td>
<td>0.59</td>
</tr>
<tr>
<td>im6A-TS-CNN</td>
<td>80.18</td>
<td><bold>79.69</bold></td>
<td>79.94</td>
<td>0.5992</td>
</tr>
<tr>
<td>our model</td>
<td><bold>84.85</bold></td>
<td>78.74</td>
<td><bold>81.80</bold></td>
<td><bold>0.6375</bold></td>
</tr>
<tr>
<td rowspan="3">m_b</td>
<td>iRNA-m6A</td>
<td>79.32</td>
<td>76.90</td>
<td>78.75</td>
<td>0.58</td>
</tr>
<tr>
<td>im6A-TS-CNN</td>
<td>81.50</td>
<td>75.85</td>
<td>78.67</td>
<td>0.5749</td>
</tr>
<tr>
<td>our model</td>
<td><bold>82.65</bold></td>
<td><bold>77.06</bold></td>
<td><bold>79.91</bold></td>
<td><bold>0.5991</bold></td>
</tr>
<tr>
<td rowspan="3">m_h</td>
<td>iRNA-m6A</td>
<td>75.24</td>
<td>68.97</td>
<td>72.76</td>
<td>0.44</td>
</tr>
<tr>
<td>im6A-TS-CNN</td>
<td>78.37</td>
<td>67.60</td>
<td>72.99</td>
<td>0.4633</td>
</tr>
<tr>
<td>our model</td>
<td><bold>81.56</bold></td>
<td><bold>70.64</bold></td>
<td><bold>76.10</bold></td>
<td><bold>0.5253</bold></td>
</tr>
<tr>
<td rowspan="3">m_k</td>
<td>iRNA-m6A</td>
<td>82.60</td>
<td>77.31</td>
<td>79.98</td>
<td>0.60</td>
</tr>
<tr>
<td>im6A-TS-CNN</td>
<td>79.91</td>
<td><bold>81.00</bold></td>
<td>80.46</td>
<td>0.6094</td>
</tr>
<tr>
<td>our model</td>
<td><bold>84.10</bold></td>
<td>78.93</td>
<td><bold>81.74</bold></td>
<td><bold>0.6361</bold></td>
</tr>
<tr>
<td rowspan="3">m_l</td>
<td>iRNA-m6A</td>
<td>74.93</td>
<td>65.59</td>
<td>70.59</td>
<td>0.41</td>
</tr>
<tr>
<td>im6A-TS-CNN</td>
<td>72.39</td>
<td>70.24</td>
<td>71.32</td>
<td>0.4288</td>
</tr>
<tr>
<td>our model</td>
<td><bold>76.95</bold></td>
<td><bold>70.87</bold></td>
<td><bold>73.92</bold></td>
<td><bold>0.4798</bold></td>
</tr>
<tr>
<td rowspan="3">m_t</td>
<td>iRNA-m6A</td>
<td>78.14</td>
<td>70.02</td>
<td>74.40</td>
<td>0.48</td>
</tr>
<tr>
<td>im6A-TS-CNN</td>
<td>75.21</td>
<td><bold>75.61</bold></td>
<td>75.41</td>
<td>0.5090</td>
</tr>
<tr>
<td>our model</td>
<td><bold>82.28</bold></td>
<td>72.00</td>
<td><bold>77.20</bold></td>
<td><bold>0.5474</bold></td>
</tr>
<tr>
<td rowspan="3">r_b</td>
<td>iRNA-m6A</td>
<td>77.00</td>
<td>73.47</td>
<td>75.96</td>
<td>0.50</td>
</tr>
<tr>
<td>im6A-TS-CNN</td>
<td>79.04</td>
<td>74.23</td>
<td>76.64</td>
<td>0.5379</td>
</tr>
<tr>
<td>our model</td>
<td><bold>81.25</bold></td>
<td><bold>75.22</bold></td>
<td><bold>78.27</bold></td>
<td><bold>0.5671</bold></td>
</tr>
<tr>
<td rowspan="3">r_k</td>
<td>iRNA-m6A</td>
<td>82.46</td>
<td>80.05</td>
<td>81.78</td>
<td>0.63</td>
</tr>
<tr>
<td>im6A-TS-CNN</td>
<td><bold>84.15</bold></td>
<td>80.77</td>
<td>82.46</td>
<td>0.6500</td>
</tr>
<tr>
<td>our model</td>
<td>83.53</td>
<td><bold>82.47</bold></td>
<td><bold>83.00</bold></td>
<td><bold>0.6602</bold></td>
</tr>
<tr>
<td rowspan="3">r_l</td>
<td>iRNA-m6A</td>
<td>83.09</td>
<td>76.33</td>
<td>80.90</td>
<td>0.60</td>
</tr>
<tr>
<td>im6A-TS-CNN</td>
<td>81.56</td>
<td>79.63</td>
<td>80.59</td>
<td>0.6126</td>
</tr>
<tr>
<td>our model</td>
<td><bold>83.43</bold></td>
<td><bold>81.64</bold></td>
<td><bold>82.46</bold></td>
<td><bold>0.6508</bold></td>
</tr>
</tbody>
</table>
</table-wrap>
<table-wrap id="table-6"><label>Table S3</label>
<caption>
<title>Comparison of our model with im6A-TS-CNN and iRNA-m6A on independent test</title></caption>
<table><colgroup>
<col/>
<col/>
<col/>
<col/>
<col/>
<col/>
</colgroup>
<thead>
<tr>
<th>Species</th>
<th>Methods</th>
<th>Sn (%)</th>
<th>Sp (%)</th>
<th>Acc (%)</th>
<th>MCC</th>
</tr>
</thead>
<tbody>
<tr>
<td rowspan="3">h_b</td>
<td>iRNA-m6A</td>
<td>69.50</td>
<td><bold>72.98</bold></td>
<td>71.10</td>
<td>0.42</td>
</tr>
<tr>
<td>im6A-TS-CNN</td>
<td>75.17</td>
<td>70.20</td>
<td>72.69</td>
<td>0.4543</td>
</tr>
<tr>
<td>our model</td>
<td><bold>81.82</bold></td>
<td>67.42</td>
<td><bold>74.62</bold></td>
<td><bold>0.4976</bold></td>
</tr>
<tr>
<td rowspan="3">h_k</td>
<td>iRNA-m6A</td>
<td>77.13</td>
<td>78.42</td>
<td>77.76</td>
<td>0.56</td>
</tr>
<tr>
<td>im6A-TS-CNN</td>
<td>79.95</td>
<td><bold>78.53</bold></td>
<td>79.24</td>
<td>0.5848</td>
</tr>
<tr>
<td>our model</td>
<td><bold>82.33</bold></td>
<td>77.35</td>
<td><bold>79.84</bold></td>
<td><bold>0.5975</bold></td>
</tr>
<tr>
<td rowspan="3">h_l</td>
<td>iRNA-m6A</td>
<td>78.19</td>
<td>79.87</td>
<td>79.01</td>
<td>0.58</td>
</tr>
<tr>
<td>im6A-TS-CNN</td>
<td><bold>84.81</bold></td>
<td>75.02</td>
<td>79.92</td>
<td>0.6012</td>
</tr>
<tr>
<td>our model</td>
<td>84.13</td>
<td><bold>77.79</bold></td>
<td><bold>80.96</bold></td>
<td><bold>0.6205</bold></td>
</tr>
<tr>
<td rowspan="3">m_b</td>
<td>iRNA-m6A</td>
<td>77.20</td>
<td>79.41</td>
<td>78.26</td>
<td>0.57</td>
</tr>
<tr>
<td>im6A-TS-CNN</td>
<td><bold>86.22</bold></td>
<td>70.74</td>
<td>78.48</td>
<td>0.5765</td>
</tr>
<tr>
<td>our model</td>
<td>82.92</td>
<td><bold>75.85</bold></td>
<td><bold>79.38</bold></td>
<td><bold>0.5891</bold></td>
</tr>
<tr>
<td rowspan="3">m_h</td>
<td>iRNA-m6A</td>
<td>70.52</td>
<td><bold>72.13</bold></td>
<td>71.30</td>
<td>0.43</td>
</tr>
<tr>
<td>im6A-TS-CNN</td>
<td>75.82</td>
<td>71.36</td>
<td>73.59</td>
<td>0.4723</td>
</tr>
<tr>
<td>our model</td>
<td><bold>80.36</bold></td>
<td>70.32</td>
<td><bold>75.34</bold></td>
<td><bold>0.5094</bold></td>
</tr>
<tr>
<td rowspan="3">m_k</td>
<td>iRNA-m6A</td>
<td>78.37</td>
<td>80.32</td>
<td>79.31</td>
<td>0.59</td>
</tr>
<tr>
<td>im6A-TS-CNN</td>
<td>80.52</td>
<td><bold>81.00</bold></td>
<td>80.76</td>
<td>0.6151</td>
</tr>
<tr>
<td>our model</td>
<td><bold>84.19</bold></td>
<td>78.62</td>
<td><bold>81.40</bold></td>
<td><bold>0.6290</bold></td>
</tr>
<tr>
<td rowspan="3">m_l</td>
<td>iRNA-m6A</td>
<td>67.82</td>
<td>69.86</td>
<td>68.79</td>
<td>0.38</td>
</tr>
<tr>
<td>im6A-TS-CNN</td>
<td><bold>75.56</bold></td>
<td>67.58</td>
<td>71.57</td>
<td>0.4328</td>
</tr>
<tr>
<td>our model</td>
<td>75.34</td>
<td><bold>70.72</bold></td>
<td><bold>73.03</bold></td>
<td><bold>0.4611</bold></td>
</tr>
<tr>
<td rowspan="3">m_t</td>
<td>iRNA-m6A</td>
<td>72.19</td>
<td>75.08</td>
<td>73.54</td>
<td>0.47</td>
</tr>
<tr>
<td>im6A-TS-CNN</td>
<td><bold>83.45</bold></td>
<td>68.87</td>
<td>76.16</td>
<td>0.5288</td>
</tr>
<tr>
<td>our model</td>
<td>81.68</td>
<td><bold>72.23</bold></td>
<td><bold>76.96</bold></td>
<td><bold>0.5415</bold></td>
</tr>
<tr>
<td rowspan="3">r_b</td>
<td>iRNA-m6A</td>
<td>73.93</td>
<td><bold>76.48</bold></td>
<td>75.14</td>
<td>0.50</td>
</tr>
<tr>
<td>im6A-TS-CNN</td>
<td>78.05</td>
<td>75.84</td>
<td>76.95</td>
<td>0.5391</td>
</tr>
<tr>
<td>our model</td>
<td><bold>79.71</bold></td>
<td>75.29</td>
<td><bold>77.50</bold></td>
<td><bold>0.5505</bold></td>
</tr>
<tr>
<td rowspan="3">r_k</td>
<td>iRNA-m6A</td>
<td>80.18</td>
<td>82.77</td>
<td>81.42</td>
<td>0.63</td>
</tr>
<tr>
<td>im6A-TS-CNN</td>
<td><bold>84.85</bold></td>
<td>80.59</td>
<td>82.72</td>
<td>0.6550</td>
</tr>
<tr>
<td>our model</td>
<td>84.82</td>
<td><bold>82.08</bold></td>
<td><bold>83.45</bold></td>
<td><bold>0.6692</bold></td>
</tr>
<tr>
<td rowspan="3">r_l</td>
<td>iRNA-m6A</td>
<td>77.74</td>
<td>82.31</td>
<td>79.85</td>
<td>0.60</td>
</tr>
<tr>
<td>im6A-TS-CNN</td>
<td><bold>84.51</bold></td>
<td>75.94</td>
<td>80.22</td>
<td>0.6067</td>
</tr>
<tr>
<td>our model</td>
<td>84.17</td>
<td><bold>78.83</bold></td>
<td><bold>81.50</bold></td>
<td><bold>0.6309</bold></td>
</tr>
</tbody>
</table>
</table-wrap>
<table-wrap id="table-7"><label>Table S4</label>
<caption>
<title>The Baysian optimization of parameters of the models for <italic>Homo sapiens</italic> and their corresponding AUC values</title></caption>
<table><colgroup>
<col/>
<col/>
<col/>
<col/>
<col/>
<col/>
<col/>
<col/>
</colgroup>
<thead>
<tr>
<th>Site</th>
<th>Iter</th>
<th>AUC</th>
<th>BatchSize</th>
<th>Dropout</th>
<th>Filter1</th>
<th>Filter2</th>
<th>Pool_size</th>
</tr>
</thead>
<tbody>
<tr>
<td rowspan="6">h_b</td>
<td>1</td>
<td>0.8179</td>
<td>58.26</td>
<td>0.4733</td>
<td>36.93</td>
<td>18.32</td>
<td>4.421</td>
</tr>
<tr>
<td>2</td>
<td>0.8138</td>
<td>28.18</td>
<td>0.3217</td>
<td>49.46</td>
<td>59.49</td>
<td>1.396</td>
</tr>
<tr>
<td>3</td>
<td>0.8151</td>
<td>59.94</td>
<td>0.4248</td>
<td>53.48</td>
<td>52.05</td>
<td>1.087</td>
</tr>
<tr>
<td>4</td>
<td>0.8147</td>
<td>25.95</td>
<td>0.262</td>
<td>35.09</td>
<td>23.58</td>
<td>3.233</td>
</tr>
<tr>
<td>5</td>
<td>0.8083</td>
<td>37.67</td>
<td>0.7697</td>
<td>45.2</td>
<td>54.46</td>
<td>4.385</td>
</tr>
<tr>
<td>6</td>
<td>0.8148</td>
<td>59.45</td>
<td>0.404</td>
<td>40.12</td>
<td>16.06</td>
<td>1.833</td>
</tr>
<tr>
<td rowspan="24"></td>
<td><bold>7</bold></td>
<td><bold>0.8281</bold></td>
<td><bold>54.73</bold></td>
<td><bold>0.1725</bold></td>
<td><bold>35.69</bold></td>
<td><bold>20.81</bold></td>
<td><bold>5</bold></td>
</tr>
<tr>
<td>8</td>
<td>0.812</td>
<td>54.37</td>
<td>0.3625</td>
<td>35.45</td>
<td>23.15</td>
<td>4.707</td>
</tr>
<tr>
<td>9</td>
<td>0.7931</td>
<td>57.46</td>
<td>0.8763</td>
<td>37.05</td>
<td>18.55</td>
<td>3.137</td>
</tr>
<tr>
<td>10</td>
<td>0.8198</td>
<td>41.53</td>
<td>0.3397</td>
<td>18.51</td>
<td>57.52</td>
<td>3.572</td>
</tr>
<tr>
<td>11</td>
<td>0.8185</td>
<td>58.46</td>
<td>0.5256</td>
<td>36.27</td>
<td>18.39</td>
<td>4.917</td>
</tr>
<tr>
<td>12</td>
<td>0.8166</td>
<td>42.22</td>
<td>0.6375</td>
<td>19</td>
<td>55.76</td>
<td>3.65</td>
</tr>
<tr>
<td>13</td>
<td>0.8099</td>
<td>40.6</td>
<td>0.1195</td>
<td>19.13</td>
<td>57.7</td>
<td>4.218</td>
</tr>
<tr>
<td>14</td>
<td>0.7949</td>
<td>42.41</td>
<td>0.8663</td>
<td>18.45</td>
<td>58.83</td>
<td>3.549</td>
</tr>
<tr>
<td>15</td>
<td>0.8108</td>
<td>37.71</td>
<td>0.1253</td>
<td>60.34</td>
<td>20.41</td>
<td>1.787</td>
</tr>
<tr>
<td>16</td>
<td>0.8083</td>
<td>26.62</td>
<td>0.7123</td>
<td>35.1</td>
<td>23.44</td>
<td>2.947</td>
</tr>
<tr>
<td>17</td>
<td>0.7872</td>
<td>58.62</td>
<td>0.8791</td>
<td>36.55</td>
<td>18.45</td>
<td>4.474</td>
</tr>
<tr>
<td>18</td>
<td>0.8067</td>
<td>47.71</td>
<td>0.7671</td>
<td>46.4</td>
<td>62.58</td>
<td>3.441</td>
</tr>
<tr>
<td>19</td>
<td>0.8004</td>
<td>56.99</td>
<td>0.2105</td>
<td>32.53</td>
<td>34.94</td>
<td>1.373</td>
</tr>
<tr>
<td>20</td>
<td>0.815</td>
<td>31.8</td>
<td>0.5697</td>
<td>42.38</td>
<td>37.04</td>
<td>4.057</td>
</tr>
<tr>
<td>21</td>
<td>0.8191</td>
<td>33.78</td>
<td>0.5886</td>
<td>20.38</td>
<td>35.46</td>
<td>1.81</td>
</tr>
<tr>
<td>22</td>
<td>0.8116</td>
<td>29.97</td>
<td>0.7187</td>
<td>34.7</td>
<td>44.49</td>
<td>4.749</td>
</tr>
<tr>
<td>23</td>
<td>0.8158</td>
<td>61.54</td>
<td>0.6018</td>
<td>37.22</td>
<td>17.1</td>
<td>3.574</td>
</tr>
<tr>
<td>24</td>
<td>0.8104</td>
<td>39.34</td>
<td>0.1123</td>
<td>18.12</td>
<td>26.62</td>
<td>4.678</td>
</tr>
<tr>
<td>25</td>
<td>0.808</td>
<td>51.52</td>
<td>0.7274</td>
<td>21.13</td>
<td>35.18</td>
<td>1.646</td>
</tr>
<tr>
<td>26</td>
<td>0.8146</td>
<td>30.16</td>
<td>0.7064</td>
<td>28.67</td>
<td>28.26</td>
<td>3.524</td>
</tr>
<tr>
<td>27</td>
<td>0.807</td>
<td>20.28</td>
<td>0.6955</td>
<td>63.51</td>
<td>60.14</td>
<td>1.745</td>
</tr>
<tr>
<td>28</td>
<td>0.8017</td>
<td>54.06</td>
<td>0.1049</td>
<td>20.94</td>
<td>20.32</td>
<td>2.206</td>
</tr>
<tr>
<td>29</td>
<td>0.7925</td>
<td>47.68</td>
<td>0.8692</td>
<td>48.22</td>
<td>43.16</td>
<td>4.589</td>
</tr>
<tr>
<td>30</td>
<td>0.7836</td>
<td>52.37</td>
<td>0.8856</td>
<td>26.76</td>
<td>51.14</td>
<td>4.636</td>
</tr>
<tr>
<td rowspan="16">h_k</td>
<td>1</td>
<td>0.8865</td>
<td>27.62</td>
<td>0.1822</td>
<td>63.49</td>
<td>57.64</td>
<td>1.022</td>
</tr>
<tr>
<td>2</td>
<td>0.8862</td>
<td>17.25</td>
<td>0.8519</td>
<td>19.04</td>
<td>18.11</td>
<td>4.207</td>
</tr>
<tr>
<td>3</td>
<td>0.8852</td>
<td>30.91</td>
<td>0.8505</td>
<td>55.14</td>
<td>60.01</td>
<td>1.146</td>
</tr>
<tr>
<td>4</td>
<td>0.8957</td>
<td>24.91</td>
<td>0.4471</td>
<td>55.84</td>
<td>58.81</td>
<td>3.029</td>
</tr>
<tr>
<td>5</td>
<td>0.8892</td>
<td>62.59</td>
<td>0.6936</td>
<td>35.71</td>
<td>58.66</td>
<td>1.323</td>
</tr>
<tr>
<td>6</td>
<td>0.8892</td>
<td>36.86</td>
<td>0.8033</td>
<td>53.87</td>
<td>32.8</td>
<td>1.581</td>
</tr>
<tr>
<td>7</td>
<td>0.8959</td>
<td>29.23</td>
<td>0.4374</td>
<td>16.45</td>
<td>27.14</td>
<td>4.664</td>
</tr>
<tr>
<td>8</td>
<td>0.8877</td>
<td>25.2</td>
<td>0.8051</td>
<td>61.32</td>
<td>46.61</td>
<td>2.356</td>
</tr>
<tr>
<td>9</td>
<td>0.8898</td>
<td>21.21</td>
<td>0.1127</td>
<td>62.65</td>
<td>32.66</td>
<td>4.801</td>
</tr>
<tr>
<td>10</td>
<td>0.8932</td>
<td>26.14</td>
<td>0.2924</td>
<td>55.64</td>
<td>59.26</td>
<td>3.719</td>
</tr>
<tr>
<td>11</td>
<td>0.8881</td>
<td>28.12</td>
<td>0.7712</td>
<td>16.7</td>
<td>26.53</td>
<td>2.585</td>
</tr>
<tr>
<td>12</td>
<td>0.8935</td>
<td>29.44</td>
<td>0.6128</td>
<td>17.31</td>
<td>26.67</td>
<td>4.812</td>
</tr>
<tr>
<td>13</td>
<td>0.8901</td>
<td>24.43</td>
<td>0.2625</td>
<td>55.98</td>
<td>58.03</td>
<td>4.167</td>
</tr>
<tr>
<td>14</td>
<td>0.8927</td>
<td>23.66</td>
<td>0.6515</td>
<td>56.06</td>
<td>59.04</td>
<td>2.588</td>
</tr>
<tr>
<td>15</td>
<td>0.8923</td>
<td>31</td>
<td>0.7073</td>
<td>16.62</td>
<td>25.86</td>
<td>4.236</td>
</tr>
<tr>
<td>16</td>
<td>0.8917</td>
<td>25.56</td>
<td>0.1618</td>
<td>55.95</td>
<td>60.06</td>
<td>2.138</td>
</tr>
<tr>
<td rowspan="14"></td>
<td>17</td>
<td>0.8923</td>
<td>25.66</td>
<td>0.7022</td>
<td>55.78</td>
<td>57.91</td>
<td>2.251</td>
</tr>
<tr>
<td>18</td>
<td>0.8805</td>
<td>24.92</td>
<td>0.9</td>
<td>54.85</td>
<td>59.18</td>
<td>3.011</td>
</tr>
<tr>
<td>19</td>
<td>0.8958</td>
<td>59.21</td>
<td>0.5495</td>
<td>16.44</td>
<td>31.61</td>
<td>4.712</td>
</tr>
<tr>
<td>20</td>
<td>0.8887</td>
<td>53.69</td>
<td>0.2379</td>
<td>52.76</td>
<td>47.04</td>
<td>1.124</td>
</tr>
<tr>
<td>21</td>
<td>0.8937</td>
<td>44.33</td>
<td>0.3335</td>
<td>33.72</td>
<td>46.8</td>
<td>1.279</td>
</tr>
<tr>
<td>22</td>
<td>0.8934</td>
<td>51.61</td>
<td>0.5804</td>
<td>50.05</td>
<td>62.51</td>
<td>4.767</td>
</tr>
<tr>
<td>23</td>
<td>0.8877</td>
<td>19.55</td>
<td>0.8194</td>
<td>34.65</td>
<td>57.58</td>
<td>4.131</td>
</tr>
<tr>
<td>24</td>
<td>0.8924</td>
<td>18.18</td>
<td>0.2217</td>
<td>18.58</td>
<td>33.29</td>
<td>3.279</td>
</tr>
<tr>
<td>25</td>
<td>0.8866</td>
<td>26.36</td>
<td>0.8189</td>
<td>55.52</td>
<td>59.83</td>
<td>4.239</td>
</tr>
<tr>
<td>26</td>
<td>0.8922</td>
<td>53.93</td>
<td>0.6832</td>
<td>27.44</td>
<td>45.65</td>
<td>2.076</td>
</tr>
<tr>
<td><bold>27</bold></td>
<td><bold>0.8966</bold></td>
<td><bold>63.69</bold></td>
<td><bold>0.2681</bold></td>
<td><bold>32.11</bold></td>
<td><bold>57.99</bold></td>
<td><bold>3.167</bold></td>
</tr>
<tr>
<td>28</td>
<td>0.8817</td>
<td>53.69</td>
<td>0.8958</td>
<td>36.72</td>
<td>24.32</td>
<td>1.576</td>
</tr>
<tr>
<td>29</td>
<td>0.8883</td>
<td>58.26</td>
<td>0.7604</td>
<td>32.55</td>
<td>48.2</td>
<td>2.216</td>
</tr>
<tr>
<td>30</td>
<td>0.8958</td>
<td>38.95</td>
<td>0.3098</td>
<td>31.45</td>
<td>25.63</td>
<td>3.056</td>
</tr>
<tr>
<td rowspan="26">h_l</td>
<td>1</td>
<td>0.8916</td>
<td>53.6</td>
<td>0.311</td>
<td>31.46</td>
<td>22.42</td>
<td>2.128</td>
</tr>
<tr>
<td>2</td>
<td>0.8839</td>
<td>17.65</td>
<td>0.2256</td>
<td>17.22</td>
<td>56.2</td>
<td>2.226</td>
</tr>
<tr>
<td>3</td>
<td>0.8885</td>
<td>60.54</td>
<td>0.304</td>
<td>51.54</td>
<td>60.88</td>
<td>4.216</td>
</tr>
<tr>
<td>4</td>
<td>0.8955</td>
<td>52.81</td>
<td>0.4069</td>
<td>30</td>
<td>43.98</td>
<td>2.922</td>
</tr>
<tr>
<td>5</td>
<td>0.8864</td>
<td>34.16</td>
<td>0.2884</td>
<td>25.94</td>
<td>54.17</td>
<td>1.926</td>
</tr>
<tr>
<td>6</td>
<td>0.8938</td>
<td>64</td>
<td>0.5</td>
<td>24.79</td>
<td>40.7</td>
<td>5</td>
</tr>
<tr>
<td>7</td>
<td>0.8952</td>
<td>63.53</td>
<td>0.1774</td>
<td>23.88</td>
<td>39.39</td>
<td>4.683</td>
</tr>
<tr>
<td>8</td>
<td>0.8914</td>
<td>54.86</td>
<td>0.1</td>
<td>21.84</td>
<td>36.41</td>
<td>1.494</td>
</tr>
<tr>
<td>9</td>
<td>0.8889</td>
<td>57.41</td>
<td>0.1</td>
<td>31.49</td>
<td>37.73</td>
<td>4.523</td>
</tr>
<tr>
<td>10</td>
<td>0.8953</td>
<td>55.69</td>
<td>0.3392</td>
<td>26.53</td>
<td>46.24</td>
<td>2.447</td>
</tr>
<tr>
<td>11</td>
<td>0.8825</td>
<td>51.8</td>
<td>0.1</td>
<td>30.6</td>
<td>50.1</td>
<td>1</td>
</tr>
<tr>
<td>12</td>
<td>0.8939</td>
<td>54.38</td>
<td>0.5</td>
<td>26.92</td>
<td>42.96</td>
<td>3.386</td>
</tr>
<tr>
<td>13</td>
<td>0.8878</td>
<td>57.71</td>
<td>0.109</td>
<td>30.3</td>
<td>44.69</td>
<td>3.078</td>
</tr>
<tr>
<td>14</td>
<td>0.8951</td>
<td>51.12</td>
<td>0.2503</td>
<td>26.74</td>
<td>44.41</td>
<td>4.487</td>
</tr>
<tr>
<td>15</td>
<td>0.8948</td>
<td>50.52</td>
<td>0.5</td>
<td>28.24</td>
<td>42.28</td>
<td>1.554</td>
</tr>
<tr>
<td>16</td>
<td>0.8915</td>
<td>53.2</td>
<td>0.4991</td>
<td>22.75</td>
<td>45.03</td>
<td>1.885</td>
</tr>
<tr>
<td>17</td>
<td>0.8963</td>
<td>50.22</td>
<td>0.5</td>
<td>30.3</td>
<td>42.07</td>
<td>5</td>
</tr>
<tr>
<td>18</td>
<td>0.8926</td>
<td>63.98</td>
<td>0.4724</td>
<td>22.37</td>
<td>36.93</td>
<td>1.27</td>
</tr>
<tr>
<td>19</td>
<td>0.8932</td>
<td>48.51</td>
<td>0.3468</td>
<td>32.98</td>
<td>39.46</td>
<td>1.988</td>
</tr>
<tr>
<td>20</td>
<td>0.8958</td>
<td>61.73</td>
<td>0.3842</td>
<td>19.5</td>
<td>39.64</td>
<td>4.381</td>
</tr>
<tr>
<td>21</td>
<td>0.8881</td>
<td>60.1</td>
<td>0.1462</td>
<td>20.13</td>
<td>44.58</td>
<td>3.484</td>
</tr>
<tr>
<td>22</td>
<td>0.8929</td>
<td>64</td>
<td>0.1</td>
<td>18.74</td>
<td>36.75</td>
<td>5</td>
</tr>
<tr>
<td>23</td>
<td>0.8951</td>
<td>46.45</td>
<td>0.1</td>
<td>28.38</td>
<td>42.1</td>
<td>5</td>
</tr>
<tr>
<td>24</td>
<td>0.897</td>
<td>45.1</td>
<td>0.4244</td>
<td>33.07</td>
<td>43.02</td>
<td>4.536</td>
</tr>
<tr>
<td>25</td>
<td>0.8958</td>
<td>41.83</td>
<td>0.265</td>
<td>33.43</td>
<td>41.34</td>
<td>4.753</td>
</tr>
<tr>
<td>26</td>
<td>0.8931</td>
<td>43.31</td>
<td>0.1812</td>
<td>37.88</td>
<td>43.07</td>
<td>4.694</td>
</tr>
<tr>
<td rowspan="4"></td>
<td><bold>27</bold></td>
<td><bold>0.8972</bold></td>
<td><bold>42.26</bold></td>
<td><bold>0.5</bold></td>
<td><bold>30.96</bold></td>
<td><bold>45.29</bold></td>
<td><bold>5</bold></td>
</tr>
<tr>
<td>28</td>
<td>0.8935</td>
<td>43.09</td>
<td>0.3186</td>
<td>30.57</td>
<td>43.63</td>
<td>2.289</td>
</tr>
<tr>
<td>29</td>
<td>0.8903</td>
<td>42.32</td>
<td>0.2771</td>
<td>34.9</td>
<td>47.72</td>
<td>4.69</td>
</tr>
<tr>
<td>30</td>
<td>0.896</td>
<td>38.85</td>
<td>0.5</td>
<td>29.44</td>
<td>43.49</td>
<td>5</td>
</tr>
</tbody>
</table>
</table-wrap>
</sec></app></app-group>
</back>
</article>