<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Publishing DTD v1.1 20151215//EN" "http://jats.nlm.nih.gov/publishing/1.1/JATS-journalpublishing1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink" xmlns:mml="http://www.w3.org/1998/Math/MathML" xml:lang="en" article-type="research-article" dtd-version="1.1">
<front>
<journal-meta>
<journal-id journal-id-type="pmc">JAI</journal-id>
<journal-id journal-id-type="nlm-ta">JAI</journal-id>
<journal-id journal-id-type="publisher-id">JAI</journal-id>
<journal-title-group>
<journal-title>Journal on Artificial Intelligence</journal-title>
</journal-title-group>
<issn pub-type="epub">2579-003X</issn>
<issn pub-type="ppub">2579-0021</issn>
<publisher>
<publisher-name>Tech Science Press</publisher-name>
<publisher-loc>USA</publisher-loc>
</publisher>
</journal-meta>
<article-meta>
<article-id pub-id-type="publisher-id">49083</article-id>
<article-id pub-id-type="doi">10.32604/jai.2024.049083</article-id>
<article-categories>
<subj-group subj-group-type="heading">
<subject>Article</subject>
</subj-group>
</article-categories>
<title-group>
<article-title>Causality-Driven Common and Label-Specific Features Learning</article-title>
<alt-title alt-title-type="left-running-head">Causality-Driven Common and Label-Specific Features Learning</alt-title>
<alt-title alt-title-type="right-running-head">Causality-Driven Common and Label-Specific Features Learning</alt-title>
</title-group>
<contrib-group>
<contrib id="author-1" contrib-type="author" corresp="yes">
<name name-style="western"><surname>Xu</surname><given-names>Yuting</given-names></name><xref ref-type="aff" rid="aff-1">1</xref><email>xytingerny@163.com</email></contrib>
<contrib id="author-2" contrib-type="author">
<name name-style="western"><surname>Zhang</surname><given-names>Deqing</given-names></name><xref ref-type="aff" rid="aff-1">1</xref></contrib>
<contrib id="author-3" contrib-type="author">
<name name-style="western"><surname>Guo</surname><given-names>Huaibei</given-names></name><xref ref-type="aff" rid="aff-2">2</xref></contrib>
<contrib id="author-4" contrib-type="author">
<name name-style="western"><surname>Wang</surname><given-names>Mengyue</given-names></name><xref ref-type="aff" rid="aff-1">1</xref></contrib>
<aff id="aff-1"><label>1</label><institution>School of Intelligent Transportation Modern Industry, Anhui Sanlian University</institution>, <addr-line>Hefei, 230601</addr-line>, <country>China</country></aff>
<aff id="aff-2"><label>2</label><institution>Heyetang Middle School</institution>, <addr-line>Jinhua, 322010</addr-line>, <country>China</country></aff>
</contrib-group>
<author-notes>
<corresp id="cor1"><label>&#x002A;</label>Corresponding Author: Yuting Xu. Email: <email>xytingerny@163.com</email></corresp>
</author-notes>
<pub-date date-type="collection" publication-format="electronic"><year>2024</year></pub-date>
<pub-date date-type="pub" publication-format="electronic"><day>05</day><month>4</month><year>2024</year></pub-date>
<volume>6</volume>
<issue>0</issue>
<fpage>53</fpage>
<lpage>69</lpage>
<history>
<date date-type="received">
<day>27</day>
<month>12</month>
<year>2023</year>
</date>
<date date-type="accepted">
<day>04</day>
<month>3</month>
<year>2024</year>
</date>
</history>
<permissions>
<copyright-statement>&#x00A9; 2024 Xu et al.</copyright-statement>
<copyright-year>2024</copyright-year>
<copyright-holder>Xu et al.</copyright-holder>
<license xlink:href="https://creativecommons.org/licenses/by/4.0/">
<license-p>This work is licensed under a <ext-link ext-link-type="uri" xlink:type="simple" xlink:href="https://creativecommons.org/licenses/by/4.0/">Creative Commons Attribution 4.0 International License</ext-link>, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</license-p>
</license>
</permissions>
<self-uri content-type="pdf" xlink:href="TSP_JAI_49083.pdf"></self-uri>
<abstract>
<p>In multi-label learning, the label-specific features learning framework can effectively solve the dimensional catastrophe problem brought by high-dimensional data. The classification performance and robustness of the model are effectively improved. Most existing label-specific features learning utilizes the cosine similarity method to measure label correlation. It is well known that the correlation between labels is asymmetric. However, existing label-specific features learning only considers the private features of labels in classification and does not take into account the common features of labels. Based on this, this paper proposes a Causality-driven Common and Label-specific Features Learning, named CCSF algorithm. Firstly, the causal learning algorithm GSBN is used to calculate the asymmetric correlation between labels. Then, in the optimization, both <inline-formula id="ieqn-1"><mml:math id="mml-ieqn-1"><mml:msub><mml:mi>l</mml:mi><mml:mrow><mml:mn>2</mml:mn><mml:mo>,</mml:mo><mml:mn>1</mml:mn></mml:mrow></mml:msub></mml:math></inline-formula>-norm and <inline-formula id="ieqn-2"><mml:math id="mml-ieqn-2"><mml:msub><mml:mi>l</mml:mi><mml:mrow><mml:mn>1</mml:mn></mml:mrow></mml:msub></mml:math></inline-formula>-norm are used to select the corresponding features, respectively. Finally, it is compared with six state-of-the-art algorithms on nine datasets. The experimental results prove the effectiveness of the algorithm in this paper.</p>
</abstract>
<kwd-group kwd-group-type="author">
<kwd>Label-specific features learning</kwd>
<kwd>causal learning</kwd>
<kwd>asymmetric label correlation</kwd>
<kwd>common features</kwd>
</kwd-group>
<funding-group>
<award-group id="awg1">
<funding-source>2022 University Research Priorities</funding-source>
<award-id>2022AH051989</award-id>
</award-group>
</funding-group>
</article-meta>
</front>
<body>
<sec id="s1">
<label>1</label>
<title>Introduction</title>
<p>Multi-label learning [<xref ref-type="bibr" rid="ref-1">1</xref>] (MLL) is one of the hot research areas in machine learning, which alleviates the problem that instances covering multiple concepts or semantics in numerous real-world application scenarios cannot be accurately handled by traditional single-label algorithms. In real life, MLL has also long been applied in several domains, such as text classification [<xref ref-type="bibr" rid="ref-2">2</xref>], image annotation [<xref ref-type="bibr" rid="ref-3">3</xref>], protein function detection [<xref ref-type="bibr" rid="ref-4">4</xref>] and personalized recommendation [<xref ref-type="bibr" rid="ref-5">5</xref>], to name a few. With the rapid development of the Internet, data is gradually characterized by high dimensional distribution [<xref ref-type="bibr" rid="ref-6">6</xref>]. This can lead to the problem of dimensional catastrophe suffered by multi-label algorithms for data learning.</p>
<p>Label-specific feature (LSF) learning can effectively solve this problem, which is to establish the label-specific relation between labels and features by learning the connection between features and labels. The core idea is that each label should have a specific feature corresponding to it, i.e., the specific features of the label are learned. In multi-label learning, <inline-formula id="ieqn-3"><mml:math id="mml-ieqn-3"><mml:msub><mml:mi>l</mml:mi><mml:mrow><mml:mn>1</mml:mn></mml:mrow></mml:msub></mml:math></inline-formula>-norm can attain feature sparsity and extract label-specific features, which we call private features of labels. The <inline-formula id="ieqn-4"><mml:math id="mml-ieqn-4"><mml:msub><mml:mi>l</mml:mi><mml:mrow><mml:mn>2</mml:mn><mml:mo>,</mml:mo><mml:mn>1</mml:mn></mml:mrow></mml:msub></mml:math></inline-formula>-norm can also achieve feature sparsity and extract more relevant features of the labels, which we call the common features of the labels.</p>
<p>Label correlation [<xref ref-type="bibr" rid="ref-7">7</xref>] (LC) has long been commonly used in LSF learning, which effectively improves the classification performance of LSF learning algorithms. However, the correlation calculated by cosine similarity is symmetric, and ignore the asymmetric correlation may introduce redundant information in the model. Cosine similarity is also highly susceptible to dimensional catastrophe. As the amount of data increases, the Euclidean distance metric deteriorates. In the process of calculation, the label relevance calculated by cosine similarity is highly susceptible to the a priori knowledge of the labels. Most of the labels in multi-label datasets rely on manual expert marking. With the increase of data volume and the influence of experts&#x2019; experience, it is inevitable that there will be omission and miss labeling in the process of marking. For such incomplete datasets, the LC computed by cosine similarity methods are inevitably mixed with many spurious correlations. Therefore, it is necessary to adopt the causal learning [<xref ref-type="bibr" rid="ref-8">8</xref>] algorithm to measure the asymmetric correlation between labels.</p>
<p>In LSF learning, most algorithms only consider the private features of labels and do not consider the common features of labels [<xref ref-type="bibr" rid="ref-9">9</xref>]. However, when we classify two similar labels, the LC of the similar labels are also similarly strongly correlated, but the computed weight matrices are not necessarily similar. As shown in <xref ref-type="fig" rid="fig-1">Fig. 1</xref>. The labels <inline-formula id="ieqn-5"><mml:math id="mml-ieqn-5"><mml:msub><mml:mi>y</mml:mi><mml:mrow><mml:mn>1</mml:mn></mml:mrow></mml:msub></mml:math></inline-formula> and <inline-formula id="ieqn-6"><mml:math id="mml-ieqn-6"><mml:msub><mml:mi>y</mml:mi><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msub></mml:math></inline-formula> are strongly correlated labels, yet the learned weight coefficients are really different. This indicates that we should fully consider the common and private features of labels in the process of classification. Only in this way, the LSF learning can obtain more accurate classification performance.</p>
<fig id="fig-1">
<label>Figure 1</label>
<caption>
<title>The process of addressing the label-specific feature</title>
</caption>
<graphic mimetype="image" mime-subtype="tif" xlink:href="JAI_49083-fig-1.tif"/>
</fig>
<p>Based on the above analysis, we propose a causality-driven common and LSF learning. The main contributions of this paper are as follows:</p>
<p>1) We propose a novel CCSF method, which use <inline-formula id="ieqn-7"><mml:math id="mml-ieqn-7"><mml:msub><mml:mi>l</mml:mi><mml:mrow><mml:mn>2</mml:mn><mml:mo>,</mml:mo><mml:mn>1</mml:mn></mml:mrow></mml:msub></mml:math></inline-formula>-norm and <inline-formula id="ieqn-8"><mml:math id="mml-ieqn-8"><mml:msub><mml:mi>l</mml:mi><mml:mrow><mml:mn>1</mml:mn></mml:mrow></mml:msub></mml:math></inline-formula>-norm to learn the common and private features of labels, respectively. Thereby, more correlated features are extracted for classification.</p>
<p>2) We use a causal learning algorithm to compute asymmetric label correlations, discarding the traditional way of combining correlation matrix and neighbor matrix, which reduces the influence of original labels.</p>
<p>The remaining sections are organized as follows. <xref ref-type="sec" rid="s2">Section 2</xref> summarizes some state-of-the-art domestic and international research. The proposed framework and model optimization of CCSF are presented in <xref ref-type="sec" rid="s3">Section 3</xref>. <xref ref-type="sec" rid="s4">Section 4</xref> analyzes the experimental results and other related experiments. Finally, the conclusion is presented in <xref ref-type="sec" rid="s5">Section 5</xref>.</p>
</sec>
<sec id="s2">
<label>2</label>
<title>Related Work</title>
<p>Traditional MLL considers that all labels are distinguished based on the same features. However, this categorization is not reasonable and brings a lot of redundant information in the process of categorization, and the classification results are often sub-optimal. Zhang et al. proposed the LSF learning algorithm LIFT [<xref ref-type="bibr" rid="ref-10">10</xref>], which considers that each label is classified based on specific features. Compared with the traditional classification methods, it effectively improves the classification performance of MLL algorithm. But the algorithm does not take into account the correlation between labels. We consider that each label does not exist independently, but has a strong or weak correlation with other labels. The LLSF [<xref ref-type="bibr" rid="ref-11">11</xref>] algorithm proposed by Huang et al. uses the cosine similarity method to measure the correlation between labels. Two strongly correlated labels, whose LSF are also strongly correlated, which further improves the performance of the LSF learning algorithm. By different methods to measure the correlation between labels, Cheng et al. proposed the FF-MLLA [<xref ref-type="bibr" rid="ref-12">12</xref>] algorithm, which utilizes the Minkowski distance to measure the inter-sample similarity based on LC, and uses the singular value decomposition and the limit learning machine to classify multiple labels. The LF-LPLC [<xref ref-type="bibr" rid="ref-13">13</xref>] algorithm proposed by Weng et al. uses the nearest-neighbor technique to consider the local correlation of labels on the basis of the LSF learning algorithm. The algorithm not only enriches the semantic information of labels, but also solves the imbalance problem of labels. The MLFC [<xref ref-type="bibr" rid="ref-14">14</xref>] algorithm proposed by Zhang et al. further improves the performance of the LSF learning algorithm by uniting LSF learning and LC to obtain LSF for each label. For the missing label problem occurring in LSF learning algorithms, the LSML [<xref ref-type="bibr" rid="ref-15">15</xref>] algorithm proposed by Huang et al. utilizes the correlation between labels and has better experimental results not only on the complete dataset, but also on the missing label dataset. Zhao et al. proposed the LSGL [<xref ref-type="bibr" rid="ref-16">16</xref>] algorithm, which considers not only global but also local correlations between labels. LSGL algorithm, based on the assumption that both global and local correlations coexist, has more accurate classification performance than the LSF learning algorithm, which only considers local correlations.</p>
<p>However, most of the above algorithms use cosine similarity to measure out symmetric correlations in the learning of LSF. In fact, the correlation between labels is mostly asymmetric. As the data dimension increases, the Euclidean distance metric becomes less effective. ACML [<xref ref-type="bibr" rid="ref-17">17</xref>] algorithm proposed by Bao et al. and CCSRMC [<xref ref-type="bibr" rid="ref-18">18</xref>] algorithm proposed by Zhang et al. measure the asymmetric correlation between labels using the DC algorithm in causal learning, which are both effective in improving the classification performance of MLL. Luo et al. proposed the MLDL [<xref ref-type="bibr" rid="ref-19">19</xref>] algorithm to fully utilize the structural relationship between features and labels. Not only does it use bi-Laplace regularization to mine the local information of the labels, but it also employs a causal learning algorithm to explore the intrinsic causal relationships between the labels. The BDLS [<xref ref-type="bibr" rid="ref-20">20</xref>] algorithm proposed by Tan et al. introduces a bi-mapping learning framework in LSF learning and uses a causal learning algorithm to calculate the asymmetric correlation between labels, which also effectively improves the classification performance of the LSF learning algorithm. However, the above LSF learning only considers the private features of labels and not the common features of labels. CLML [<xref ref-type="bibr" rid="ref-9">9</xref>] algorithm proposed by Li et al. first uses a norm in the LSF framework to extract the common features of the labels. Subsequently, the GLFS [<xref ref-type="bibr" rid="ref-21">21</xref>] algorithm proposed by Zhang et al. builds a group-preserving optimization framework for feature selection by learning the common features of similar labels and the private features of each label using K-means clustering. Based on the above analysis, we adopt a causal learning algorithm to learn asymmetric LC among labels in LSF learning framework. The <inline-formula id="ieqn-9"><mml:math id="mml-ieqn-9"><mml:msub><mml:mi>l</mml:mi><mml:mrow><mml:mn>2</mml:mn><mml:mo>,</mml:mo><mml:mn>1</mml:mn></mml:mrow></mml:msub></mml:math></inline-formula>-norm and <inline-formula id="ieqn-10"><mml:math id="mml-ieqn-10"><mml:msub><mml:mi>l</mml:mi><mml:mrow><mml:mn>1</mml:mn></mml:mrow></mml:msub></mml:math></inline-formula>-norm used to extract the common and private features of labels, respectively. The effectiveness of the algorithm in this paper is proved through a large number of experiments.</p>
</sec>
<sec id="s3">
<label>3</label>
<title>CCSF Model Construction and Optimization</title>
<sec id="s3_1">
<label>3.1</label>
<title>CCSF Model Construction</title>
<p>In MLL, <inline-formula id="ieqn-11"><mml:math id="mml-ieqn-11"><mml:mi mathvariant="bold-italic">X</mml:mi></mml:math></inline-formula> denotes the feature matrix, <inline-formula id="ieqn-12"><mml:math id="mml-ieqn-12"><mml:mi>Y</mml:mi></mml:math></inline-formula> represents the label matrix, and the dataset <inline-formula id="ieqn-13"><mml:math id="mml-ieqn-13"><mml:mi mathvariant="bold-italic">D</mml:mi><mml:mo>=</mml:mo><mml:mrow><mml:mo>{</mml:mo><mml:mrow><mml:mo>(</mml:mo><mml:msub><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mrow><mml:mn>1</mml:mn></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:msub><mml:mi mathvariant="bold-italic">y</mml:mi><mml:mrow><mml:mn>1</mml:mn></mml:mrow></mml:msub><mml:mo>)</mml:mo></mml:mrow><mml:mo>,</mml:mo><mml:mrow><mml:mo>(</mml:mo><mml:msub><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:msub><mml:mi mathvariant="bold-italic">y</mml:mi><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msub><mml:mo>)</mml:mo></mml:mrow><mml:mo>,</mml:mo><mml:mo>&#x2026;</mml:mo><mml:mo>,</mml:mo><mml:mo stretchy="false">(</mml:mo><mml:msub><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mrow><mml:mi>n</mml:mi></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:msub><mml:mi mathvariant="bold-italic">y</mml:mi><mml:mrow><mml:mi>n</mml:mi></mml:mrow></mml:msub><mml:mo stretchy="false">)</mml:mo><mml:mo>}</mml:mo></mml:mrow></mml:math></inline-formula>, where <inline-formula id="ieqn-14"><mml:math id="mml-ieqn-14"><mml:mi mathvariant="bold-italic">X</mml:mi><mml:mo>&#x2208;</mml:mo><mml:msup><mml:mrow><mml:mi mathvariant="double-struck">R</mml:mi></mml:mrow><mml:mrow><mml:mi>n</mml:mi><mml:mo>&#x00D7;</mml:mo><mml:mi>d</mml:mi></mml:mrow></mml:msup></mml:math></inline-formula>, <inline-formula id="ieqn-15"><mml:math id="mml-ieqn-15"><mml:mi mathvariant="bold-italic">Y</mml:mi><mml:mo>&#x2208;</mml:mo><mml:msup><mml:mrow><mml:mi mathvariant="double-struck">R</mml:mi></mml:mrow><mml:mrow><mml:mi>n</mml:mi><mml:mo>&#x00D7;</mml:mo><mml:mrow><mml:mi mathvariant="fraktur">l</mml:mi></mml:mrow></mml:mrow></mml:msup></mml:math></inline-formula>, <inline-formula id="ieqn-16"><mml:math id="mml-ieqn-16"><mml:mrow><mml:mi mathvariant="fraktur">l</mml:mi></mml:mrow></mml:math></inline-formula> is the number of labels, <inline-formula id="ieqn-17"><mml:math id="mml-ieqn-17"><mml:mi>n</mml:mi></mml:math></inline-formula> is the number of samples, <inline-formula id="ieqn-18"><mml:math id="mml-ieqn-18"><mml:mi>d</mml:mi></mml:math></inline-formula> is the number of features. <inline-formula id="ieqn-19"><mml:math id="mml-ieqn-19"><mml:msub><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mrow><mml:mi>n</mml:mi></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:mrow><mml:mo>{</mml:mo><mml:msub><mml:mi>x</mml:mi><mml:mrow><mml:mi>n</mml:mi><mml:mn>1</mml:mn></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:msub><mml:mi>x</mml:mi><mml:mrow><mml:mi>n</mml:mi><mml:mn>2</mml:mn></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:mo>&#x2026;</mml:mo><mml:mo>,</mml:mo><mml:msub><mml:mi>x</mml:mi><mml:mrow><mml:mi>n</mml:mi><mml:mi>d</mml:mi></mml:mrow></mml:msub><mml:mo>}</mml:mo></mml:mrow></mml:math></inline-formula> and <inline-formula id="ieqn-20"><mml:math id="mml-ieqn-20"><mml:msub><mml:mi mathvariant="bold-italic">y</mml:mi><mml:mrow><mml:mi>n</mml:mi></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:mrow><mml:mo>{</mml:mo><mml:msub><mml:mi>y</mml:mi><mml:mrow><mml:mi>n</mml:mi><mml:mn>1</mml:mn></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:msub><mml:mi>y</mml:mi><mml:mrow><mml:mi>n</mml:mi><mml:mn>2</mml:mn></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:mo>&#x2026;</mml:mo><mml:mo>,</mml:mo><mml:msub><mml:mi>y</mml:mi><mml:mrow><mml:mi>n</mml:mi><mml:mrow><mml:mi mathvariant="fraktur">l</mml:mi></mml:mrow></mml:mrow></mml:msub><mml:mo>}</mml:mo></mml:mrow></mml:math></inline-formula> denote the feature and label vectors. The basic model of CCSF in conjunction with the LLSF [<xref ref-type="bibr" rid="ref-10">10</xref>] algorithm proposed by Huang et al. can be written as:
<disp-formula id="eqn-1"><label>(1)</label><mml:math id="mml-eqn-1" display="block"><mml:munder><mml:mo movablelimits="true" form="prefix">min</mml:mo><mml:mrow><mml:mi mathvariant="bold-italic">W</mml:mi></mml:mrow></mml:munder><mml:mfrac><mml:mn>1</mml:mn><mml:mn>2</mml:mn></mml:mfrac><mml:msubsup><mml:mrow><mml:mo symmetric="true">&#x2016;</mml:mo><mml:mrow><mml:mi mathvariant="bold-italic">X</mml:mi><mml:mi mathvariant="bold-italic">W</mml:mi></mml:mrow><mml:mo>&#x2212;</mml:mo><mml:mi mathvariant="bold-italic">Y</mml:mi><mml:mo symmetric="true">&#x2016;</mml:mo></mml:mrow><mml:mrow><mml:mi>F</mml:mi></mml:mrow><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msubsup><mml:mo>+</mml:mo><mml:mi>&#x03B1;</mml:mi><mml:msub><mml:mrow><mml:mo symmetric="true">&#x2016;</mml:mo><mml:mi mathvariant="bold-italic">W</mml:mi><mml:mo symmetric="true">&#x2016;</mml:mo></mml:mrow><mml:mrow><mml:mn>1</mml:mn></mml:mrow></mml:msub></mml:math></disp-formula>where <inline-formula id="ieqn-21"><mml:math id="mml-ieqn-21"><mml:mi>&#x03B1;</mml:mi></mml:math></inline-formula> is the feature sparse parameter, <inline-formula id="ieqn-22"><mml:math id="mml-ieqn-22"><mml:mi mathvariant="bold-italic">W</mml:mi></mml:math></inline-formula> is the weight coefficient and <inline-formula id="ieqn-23"><mml:math id="mml-ieqn-23"><mml:mi mathvariant="bold-italic">W</mml:mi><mml:mo>=</mml:mo><mml:mrow><mml:mo>[</mml:mo><mml:msub><mml:mi mathvariant="bold-italic">w</mml:mi><mml:mrow><mml:mn>1</mml:mn></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:msub><mml:mrow><mml:mtext mathvariant="bold">w</mml:mtext></mml:mrow><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:msub><mml:mrow><mml:mtext mathvariant="bold">w</mml:mtext></mml:mrow><mml:mrow><mml:mn>3</mml:mn></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:mo>&#x2026;</mml:mo><mml:mo>,</mml:mo><mml:msub><mml:mrow><mml:mtext mathvariant="bold">w</mml:mtext></mml:mrow><mml:mrow><mml:mrow><mml:mi mathvariant="fraktur">l</mml:mi></mml:mrow></mml:mrow></mml:msub><mml:mo>]</mml:mo></mml:mrow><mml:mo>&#x2208;</mml:mo><mml:msup><mml:mrow><mml:mi mathvariant="double-struck">R</mml:mi></mml:mrow><mml:mrow><mml:mi>d</mml:mi><mml:mo>&#x00D7;</mml:mo><mml:mrow><mml:mi mathvariant="fraktur">l</mml:mi></mml:mrow></mml:mrow></mml:msup></mml:math></inline-formula>, and <inline-formula id="ieqn-24"><mml:math id="mml-ieqn-24"><mml:msub><mml:mrow><mml:mtext mathvariant="bold">w</mml:mtext></mml:mrow><mml:mrow><mml:mrow><mml:mi mathvariant="fraktur">l</mml:mi></mml:mrow></mml:mrow></mml:msub><mml:mo>&#x2208;</mml:mo><mml:msup><mml:mrow><mml:mi mathvariant="double-struck">R</mml:mi></mml:mrow><mml:mrow><mml:mi>d</mml:mi></mml:mrow></mml:msup></mml:math></inline-formula> denotes the LSF of each label. However, <xref ref-type="disp-formula" rid="eqn-1">Eq. (1)</xref> only adopts the <inline-formula id="ieqn-25"><mml:math id="mml-ieqn-25"><mml:msub><mml:mi>l</mml:mi><mml:mrow><mml:mn>1</mml:mn></mml:mrow></mml:msub></mml:math></inline-formula>-norm, which can only extract the private features of the label, but not the shared features of the label. So, we put <inline-formula id="ieqn-26"><mml:math id="mml-ieqn-26"><mml:msub><mml:mi>l</mml:mi><mml:mrow><mml:mn>2</mml:mn><mml:mo>,</mml:mo><mml:mn>1</mml:mn></mml:mrow></mml:msub></mml:math></inline-formula>-norm in <xref ref-type="disp-formula" rid="eqn-1">Eq. (1)</xref> to extract the common features of labels, and <xref ref-type="disp-formula" rid="eqn-2">Eq. (2)</xref> can be written as:
<disp-formula id="eqn-2"><label>(2)</label><mml:math id="mml-eqn-2" display="block"><mml:munder><mml:mo movablelimits="true" form="prefix">min</mml:mo><mml:mrow><mml:mi mathvariant="bold-italic">W</mml:mi></mml:mrow></mml:munder><mml:mfrac><mml:mn>1</mml:mn><mml:mn>2</mml:mn></mml:mfrac><mml:msubsup><mml:mrow><mml:mo symmetric="true">&#x2016;</mml:mo><mml:mrow><mml:mi mathvariant="bold-italic">X</mml:mi><mml:mi mathvariant="bold-italic">W</mml:mi></mml:mrow><mml:mo>&#x2212;</mml:mo><mml:mi mathvariant="bold-italic">Y</mml:mi><mml:mo symmetric="true">&#x2016;</mml:mo></mml:mrow><mml:mrow><mml:mi>F</mml:mi></mml:mrow><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msubsup><mml:mo>+</mml:mo><mml:mi>&#x03B1;</mml:mi><mml:msub><mml:mrow><mml:mo symmetric="true">&#x2016;</mml:mo><mml:mi mathvariant="bold-italic">W</mml:mi><mml:mo symmetric="true">&#x2016;</mml:mo></mml:mrow><mml:mrow><mml:mn>1</mml:mn></mml:mrow></mml:msub><mml:mo>+</mml:mo><mml:mi>&#x03B2;</mml:mi><mml:msub><mml:mrow><mml:mo symmetric="true">&#x2016;</mml:mo><mml:mi mathvariant="bold-italic">W</mml:mi><mml:mo symmetric="true">&#x2016;</mml:mo></mml:mrow><mml:mrow><mml:mn>2</mml:mn><mml:mo>,</mml:mo><mml:mn>1</mml:mn></mml:mrow></mml:msub></mml:math></disp-formula>where <inline-formula id="ieqn-27"><mml:math id="mml-ieqn-27"><mml:mi>&#x03B2;</mml:mi></mml:math></inline-formula> is the feature sparse parameter.</p>
<p>LC has been widely used in LSF learning algorithms, which can effectively improve the classification performance of MLL algorithms. But cosine similarity [<xref ref-type="bibr" rid="ref-22">22</xref>] all calculates symmetric correlations. Indeed, correlations between labels are asymmetric [<xref ref-type="bibr" rid="ref-23">23</xref>]. In this paper, we use a globally structured causal learning algorithm GSBN [<xref ref-type="bibr" rid="ref-24">24</xref>]. First, Markov Blanket (MB) or Parent and Child (PC) part-to-whole structure learning for each label is obtained. Then a directed acyclic graph (DAG) framework is constructed using MB or PC learning.</p>
<p>With the constraint of causal LC, assuming that <inline-formula id="ieqn-28"><mml:math id="mml-ieqn-28"><mml:mi mathvariant="bold-italic">C</mml:mi></mml:math></inline-formula> is the causal LC matrix and <inline-formula id="ieqn-29"><mml:math id="mml-ieqn-29"><mml:msub><mml:mi>C</mml:mi><mml:mrow><mml:mi>i</mml:mi><mml:mi>j</mml:mi></mml:mrow></mml:msub></mml:math></inline-formula> denotes the causal relationship between labels <inline-formula id="ieqn-30"><mml:math id="mml-ieqn-30"><mml:msub><mml:mi>y</mml:mi><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub></mml:math></inline-formula> and <inline-formula id="ieqn-31"><mml:math id="mml-ieqn-31"><mml:msub><mml:mi>y</mml:mi><mml:mrow><mml:mi>j</mml:mi></mml:mrow></mml:msub></mml:math></inline-formula>. We improve the learning efficiency of LSF by calculating the Euclidean distance between <inline-formula id="ieqn-32"><mml:math id="mml-ieqn-32"><mml:msub><mml:mi mathvariant="bold-italic">w</mml:mi><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub></mml:math></inline-formula> and <inline-formula id="ieqn-33"><mml:math id="mml-ieqn-33"><mml:msub><mml:mi mathvariant="bold-italic">w</mml:mi><mml:mrow><mml:mi>j</mml:mi></mml:mrow></mml:msub></mml:math></inline-formula>, <inline-formula id="ieqn-34"><mml:math id="mml-ieqn-34"><mml:msub><mml:mi>C</mml:mi><mml:mrow><mml:mi>i</mml:mi><mml:mi>j</mml:mi></mml:mrow></mml:msub><mml:msubsup><mml:mrow><mml:mo symmetric="true">&#x2016;</mml:mo><mml:msub><mml:mi mathvariant="bold-italic">w</mml:mi><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo>&#x2212;</mml:mo><mml:msub><mml:mi mathvariant="bold-italic">w</mml:mi><mml:mrow><mml:mi>j</mml:mi></mml:mrow></mml:msub><mml:mo symmetric="true">&#x2016;</mml:mo></mml:mrow><mml:mrow><mml:mn>2</mml:mn></mml:mrow><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msubsup></mml:math></inline-formula>. When the labels are causally related, the features are similar. Accordingly, <inline-formula id="ieqn-35"><mml:math id="mml-ieqn-35"><mml:msub><mml:mi mathvariant="bold-italic">w</mml:mi><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub></mml:math></inline-formula> will be closer to <inline-formula id="ieqn-36"><mml:math id="mml-ieqn-36"><mml:msub><mml:mi mathvariant="bold-italic">w</mml:mi><mml:mrow><mml:mi>j</mml:mi></mml:mrow></mml:msub></mml:math></inline-formula>. The causal correlation matrix <inline-formula id="ieqn-37"><mml:math id="mml-ieqn-37"><mml:mi mathvariant="bold-italic">C</mml:mi></mml:math></inline-formula> is defined as follows:
<disp-formula id="eqn-3"><label>(3)</label><mml:math id="mml-eqn-3" display="block"><mml:msub><mml:mi>C</mml:mi><mml:mrow><mml:mi>i</mml:mi><mml:mi>j</mml:mi></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:mrow><mml:mo>{</mml:mo><mml:mtable columnalign="left left" rowspacing=".2em" columnspacing="1em" displaystyle="false"><mml:mtr><mml:mtd><mml:mn>1</mml:mn></mml:mtd><mml:mtd><mml:msub><mml:mi>y</mml:mi><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo stretchy="false">&#x2192;</mml:mo><mml:msub><mml:mi>y</mml:mi><mml:mrow><mml:mi>j</mml:mi></mml:mrow></mml:msub></mml:mtd></mml:mtr><mml:mtr><mml:mtd><mml:mn>0</mml:mn></mml:mtd><mml:mtd><mml:msub><mml:mi>y</mml:mi><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo>&#x219B;</mml:mo><mml:msub><mml:mi>y</mml:mi><mml:mrow><mml:mi>j</mml:mi></mml:mrow></mml:msub></mml:mtd></mml:mtr></mml:mtable><mml:mo fence="true" stretchy="true" symmetric="true"></mml:mo></mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mi>i</mml:mi><mml:mo>,</mml:mo><mml:mi>j</mml:mi><mml:mo>&#x2208;</mml:mo><mml:mn>1</mml:mn><mml:mo>,</mml:mo><mml:mo>&#x2026;</mml:mo><mml:mo>,</mml:mo><mml:mrow><mml:mi mathvariant="fraktur">l</mml:mi></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:math></disp-formula>where <inline-formula id="ieqn-38"><mml:math id="mml-ieqn-38"><mml:msub><mml:mi>y</mml:mi><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo stretchy="false">&#x2192;</mml:mo><mml:msub><mml:mi>y</mml:mi><mml:mrow><mml:mi>j</mml:mi></mml:mrow></mml:msub></mml:math></inline-formula> indicates that the label <inline-formula id="ieqn-39"><mml:math id="mml-ieqn-39"><mml:msub><mml:mi>y</mml:mi><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub></mml:math></inline-formula> is causally related to <inline-formula id="ieqn-40"><mml:math id="mml-ieqn-40"><mml:msub><mml:mi>y</mml:mi><mml:mrow><mml:mi>j</mml:mi></mml:mrow></mml:msub></mml:math></inline-formula> and <inline-formula id="ieqn-41"><mml:math id="mml-ieqn-41"><mml:msub><mml:mi>C</mml:mi><mml:mrow><mml:mi>i</mml:mi><mml:mi>j</mml:mi></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:mn>1</mml:mn></mml:math></inline-formula>. Conversely <inline-formula id="ieqn-42"><mml:math id="mml-ieqn-42"><mml:msub><mml:mi>y</mml:mi><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo>&#x219B;</mml:mo><mml:msub><mml:mi>y</mml:mi><mml:mrow><mml:mi>j</mml:mi></mml:mrow></mml:msub></mml:math></inline-formula> indicates that the label <inline-formula id="ieqn-43"><mml:math id="mml-ieqn-43"><mml:msub><mml:mi>y</mml:mi><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub></mml:math></inline-formula> is not causally related to <inline-formula id="ieqn-44"><mml:math id="mml-ieqn-44"><mml:msub><mml:mi>y</mml:mi><mml:mrow><mml:mi>j</mml:mi></mml:mrow></mml:msub></mml:math></inline-formula> and <inline-formula id="ieqn-45"><mml:math id="mml-ieqn-45"><mml:msub><mml:mi>C</mml:mi><mml:mrow><mml:mi>i</mml:mi><mml:mi>j</mml:mi></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:mn>0</mml:mn></mml:math></inline-formula>.</p>
<p>Therefore, we add causal constraints based on <xref ref-type="disp-formula" rid="eqn-2">Eq. (2)</xref>. The core formula of the CCSF algorithm can be written as:</p>
<disp-formula id="eqn-4"><label>(4)</label><mml:math id="mml-eqn-4" display="block"><mml:munder><mml:mo movablelimits="true" form="prefix">min</mml:mo><mml:mrow><mml:mi mathvariant="bold-italic">W</mml:mi></mml:mrow></mml:munder><mml:mfrac><mml:mn>1</mml:mn><mml:mn>2</mml:mn></mml:mfrac><mml:msubsup><mml:mrow><mml:mo symmetric="true">&#x2016;</mml:mo><mml:mrow><mml:mi mathvariant="bold-italic">X</mml:mi><mml:mi mathvariant="bold-italic">W</mml:mi></mml:mrow><mml:mo>&#x2212;</mml:mo><mml:mi mathvariant="bold-italic">Y</mml:mi><mml:mo symmetric="true">&#x2016;</mml:mo></mml:mrow><mml:mrow><mml:mi>F</mml:mi></mml:mrow><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msubsup><mml:mo>+</mml:mo><mml:mi>&#x03B1;</mml:mi><mml:msub><mml:mrow><mml:mo symmetric="true">&#x2016;</mml:mo><mml:mi mathvariant="bold-italic">W</mml:mi><mml:mo symmetric="true">&#x2016;</mml:mo></mml:mrow><mml:mrow><mml:mn>1</mml:mn></mml:mrow></mml:msub><mml:mo>+</mml:mo><mml:mi>&#x03B2;</mml:mi><mml:msub><mml:mrow><mml:mo symmetric="true">&#x2016;</mml:mo><mml:mi mathvariant="bold-italic">W</mml:mi><mml:mo symmetric="true">&#x2016;</mml:mo></mml:mrow><mml:mrow><mml:mn>2</mml:mn><mml:mo>,</mml:mo><mml:mn>1</mml:mn></mml:mrow></mml:msub><mml:mo>+</mml:mo><mml:mi>&#x03B3;</mml:mi><mml:mrow><mml:mtext>tr</mml:mtext></mml:mrow><mml:mrow><mml:mo>(</mml:mo><mml:mrow><mml:mi mathvariant="bold-italic">W</mml:mi><mml:mi mathvariant="bold-italic">C</mml:mi></mml:mrow><mml:msup><mml:mi mathvariant="bold-italic">W</mml:mi><mml:mrow><mml:mrow><mml:mtext>T</mml:mtext></mml:mrow></mml:mrow></mml:msup><mml:mo>)</mml:mo></mml:mrow></mml:math></disp-formula>
<p>where <inline-formula id="ieqn-46"><mml:math id="mml-ieqn-46"><mml:mi>&#x03B3;</mml:mi></mml:math></inline-formula> is the hyperparameter.</p>
</sec>
<sec id="s3_2">
<label>3.2</label>
<title>CCSF Model Optimization</title>
<p>Considering the non-smoothness of the <inline-formula id="ieqn-47"><mml:math id="mml-ieqn-47"><mml:msub><mml:mi>l</mml:mi><mml:mrow><mml:mn>2</mml:mn><mml:mo>,</mml:mo><mml:mn>1</mml:mn></mml:mrow></mml:msub></mml:math></inline-formula>-norm, we use the technique in the literature [<xref ref-type="bibr" rid="ref-25">25</xref>] to deal with the non-smoothness.
<disp-formula id="eqn-5"><label>(5)</label><mml:math id="mml-eqn-5" display="block"><mml:mfrac><mml:mrow><mml:mi mathvariant="normal">&#x2202;</mml:mi><mml:msub><mml:mrow><mml:mo symmetric="true">&#x2016;</mml:mo><mml:msub><mml:mi mathvariant="bold-italic">W</mml:mi><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo symmetric="true">&#x2016;</mml:mo></mml:mrow><mml:mrow><mml:mn>2</mml:mn><mml:mo>,</mml:mo><mml:mn>1</mml:mn></mml:mrow></mml:msub></mml:mrow><mml:mrow><mml:mi mathvariant="normal">&#x2202;</mml:mi><mml:msub><mml:mi mathvariant="bold-italic">W</mml:mi><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:mfrac><mml:mo>=</mml:mo><mml:mfrac><mml:mrow><mml:mi mathvariant="normal">&#x2202;</mml:mi><mml:mi>T</mml:mi><mml:mi>r</mml:mi><mml:mrow><mml:mo>(</mml:mo><mml:msubsup><mml:mi mathvariant="bold-italic">W</mml:mi><mml:mrow><mml:mi>i</mml:mi></mml:mrow><mml:mrow><mml:mi>T</mml:mi></mml:mrow></mml:msubsup><mml:msub><mml:mi mathvariant="bold-italic">A</mml:mi><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:msub><mml:mi mathvariant="bold-italic">W</mml:mi><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo>)</mml:mo></mml:mrow></mml:mrow><mml:mrow><mml:mi mathvariant="normal">&#x2202;</mml:mi><mml:msub><mml:mi mathvariant="bold-italic">W</mml:mi><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:mfrac><mml:mo>=</mml:mo><mml:mn>2</mml:mn><mml:msub><mml:mi mathvariant="bold-italic">A</mml:mi><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:msub><mml:mi mathvariant="bold-italic">W</mml:mi><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub></mml:math></disp-formula>where <inline-formula id="ieqn-48"><mml:math id="mml-ieqn-48"><mml:msub><mml:mi mathvariant="bold-italic">A</mml:mi><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo>&#x2208;</mml:mo><mml:msup><mml:mrow><mml:mi mathvariant="double-struck">R</mml:mi></mml:mrow><mml:mrow><mml:mrow><mml:mi mathvariant="fraktur">l</mml:mi><mml:mi mathvariant="fraktur">l</mml:mi></mml:mrow></mml:mrow></mml:msup></mml:math></inline-formula> is a diagonal matrix with the jth diagonal element <inline-formula id="ieqn-49"><mml:math id="mml-ieqn-49"><mml:msubsup><mml:mi>A</mml:mi><mml:mrow><mml:mi>i</mml:mi></mml:mrow><mml:mrow><mml:mi>j</mml:mi><mml:mi>j</mml:mi></mml:mrow></mml:msubsup><mml:mo>=</mml:mo><mml:mstyle displaystyle="true" scriptlevel="0"><mml:mfrac><mml:mrow><mml:mtext mathvariant="bold">1</mml:mtext></mml:mrow><mml:mrow><mml:mrow><mml:mtext mathvariant="bold">2</mml:mtext></mml:mrow><mml:msub><mml:mrow><mml:mo symmetric="true">&#x2016;</mml:mo><mml:msubsup><mml:mi>w</mml:mi><mml:mrow><mml:mi>i</mml:mi></mml:mrow><mml:mrow><mml:mi>j</mml:mi></mml:mrow></mml:msubsup><mml:mo symmetric="true">&#x2016;</mml:mo></mml:mrow><mml:mrow><mml:mrow><mml:mtext mathvariant="bold">2</mml:mtext></mml:mrow></mml:mrow></mml:msub></mml:mrow></mml:mfrac></mml:mstyle></mml:math></inline-formula>. If <inline-formula id="ieqn-50"><mml:math id="mml-ieqn-50"><mml:msubsup><mml:mi>w</mml:mi><mml:mrow><mml:mi>i</mml:mi></mml:mrow><mml:mrow><mml:mi>j</mml:mi></mml:mrow></mml:msubsup><mml:mo>=</mml:mo><mml:mn>0</mml:mn></mml:math></inline-formula>, then <inline-formula id="ieqn-51"><mml:math id="mml-ieqn-51"><mml:msubsup><mml:mi>A</mml:mi><mml:mrow><mml:mi>i</mml:mi></mml:mrow><mml:mrow><mml:mi>j</mml:mi><mml:mi>j</mml:mi></mml:mrow></mml:msubsup><mml:mo>&#x2208;</mml:mo><mml:mi mathvariant="normal">&#x2202;</mml:mi></mml:math></inline-formula>.</p>
<p>The CCSF model is a convex optimization problem. Due to the non-smoothness of the <inline-formula id="ieqn-52"><mml:math id="mml-ieqn-52"><mml:msub><mml:mi>l</mml:mi><mml:mrow><mml:mn>1</mml:mn></mml:mrow></mml:msub></mml:math></inline-formula>-norm, this paper adopts the accelerated proximal gradient descent method [<xref ref-type="bibr" rid="ref-26">26</xref>] to solve the non-smoothness of the weight matrix W by alternating iterations. The objective function is:
<disp-formula id="eqn-6"><label>(6)</label><mml:math id="mml-eqn-6" display="block"><mml:munder><mml:mo movablelimits="true" form="prefix">min</mml:mo><mml:mrow><mml:mi>W</mml:mi><mml:mo>&#x2208;</mml:mo><mml:mrow><mml:mi mathvariant="fraktur">H</mml:mi></mml:mrow></mml:mrow></mml:munder><mml:mi>F</mml:mi><mml:mrow><mml:mo>(</mml:mo><mml:mi mathvariant="bold-italic">W</mml:mi><mml:mo>)</mml:mo></mml:mrow><mml:mo>=</mml:mo><mml:mi>f</mml:mi><mml:mrow><mml:mo>(</mml:mo><mml:mi mathvariant="bold-italic">W</mml:mi><mml:mo>)</mml:mo></mml:mrow><mml:mo>+</mml:mo><mml:mi>g</mml:mi><mml:mrow><mml:mo>(</mml:mo><mml:mi mathvariant="bold-italic">W</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:math></disp-formula>where <inline-formula id="ieqn-53"><mml:math id="mml-ieqn-53"><mml:mrow><mml:mi mathvariant="fraktur">H</mml:mi></mml:mrow></mml:math></inline-formula> is the Hilbert space. The expressions for <inline-formula id="ieqn-54"><mml:math id="mml-ieqn-54"><mml:mi>f</mml:mi><mml:mrow><mml:mo>(</mml:mo><mml:mi mathvariant="bold-italic">W</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula> and <inline-formula id="ieqn-55"><mml:math id="mml-ieqn-55"><mml:mi>g</mml:mi><mml:mrow><mml:mo>(</mml:mo><mml:mi mathvariant="bold-italic">W</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula> are shown in <xref ref-type="disp-formula" rid="eqn-7">Eqs. (7)</xref> and <xref ref-type="disp-formula" rid="eqn-8">(8)</xref>, which are both convex functions and satisfy the Lipschitz condition.
<disp-formula id="eqn-7"><label>(7)</label><mml:math id="mml-eqn-7" display="block"><mml:mrow><mml:mo>(</mml:mo><mml:mi mathvariant="bold-italic">W</mml:mi><mml:mo>)</mml:mo></mml:mrow><mml:mo>=</mml:mo><mml:munder><mml:mo movablelimits="true" form="prefix">min</mml:mo><mml:mrow><mml:mi mathvariant="bold-italic">W</mml:mi></mml:mrow></mml:munder><mml:mfrac><mml:mn>1</mml:mn><mml:mn>2</mml:mn></mml:mfrac><mml:msubsup><mml:mrow><mml:mo symmetric="true">&#x2016;</mml:mo><mml:mrow><mml:mi mathvariant="bold-italic">X</mml:mi><mml:mi mathvariant="bold-italic">W</mml:mi></mml:mrow><mml:mo>&#x2212;</mml:mo><mml:mi mathvariant="bold-italic">Y</mml:mi><mml:mo symmetric="true">&#x2016;</mml:mo></mml:mrow><mml:mrow><mml:mi>F</mml:mi></mml:mrow><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msubsup><mml:mo>+</mml:mo><mml:mi>&#x03B3;</mml:mi><mml:mrow><mml:mtext>tr</mml:mtext></mml:mrow><mml:mrow><mml:mo>(</mml:mo><mml:mrow><mml:mi mathvariant="bold-italic">W</mml:mi><mml:mi mathvariant="bold-italic">C</mml:mi></mml:mrow><mml:msup><mml:mi mathvariant="bold-italic">W</mml:mi><mml:mrow><mml:mrow><mml:mtext>T</mml:mtext></mml:mrow></mml:mrow></mml:msup><mml:mo>)</mml:mo></mml:mrow><mml:mo>+</mml:mo><mml:mi>&#x03B2;</mml:mi><mml:msub><mml:mrow><mml:mo symmetric="true">&#x2016;</mml:mo><mml:mi mathvariant="bold-italic">W</mml:mi><mml:mo symmetric="true">&#x2016;</mml:mo></mml:mrow><mml:mrow><mml:mn>2</mml:mn><mml:mo>,</mml:mo><mml:mn>1</mml:mn></mml:mrow></mml:msub></mml:math></disp-formula>
<disp-formula id="eqn-8"><label>(8)</label><mml:math id="mml-eqn-8" display="block"><mml:mi>g</mml:mi><mml:mrow><mml:mo>(</mml:mo><mml:mi mathvariant="bold-italic">W</mml:mi><mml:mo>)</mml:mo></mml:mrow><mml:mo>=</mml:mo><mml:mi>&#x03B1;</mml:mi><mml:msub><mml:mrow><mml:mo symmetric="true">&#x2016;</mml:mo><mml:mi mathvariant="bold-italic">W</mml:mi><mml:mo symmetric="true">&#x2016;</mml:mo></mml:mrow><mml:mrow><mml:mn>1</mml:mn></mml:mrow></mml:msub></mml:math></disp-formula>
<disp-formula id="eqn-9"><label>(9)</label><mml:math id="mml-eqn-9" display="block"><mml:mi mathvariant="normal">&#x2207;</mml:mi><mml:mi>f</mml:mi><mml:mrow><mml:mo>(</mml:mo><mml:mi mathvariant="bold-italic">W</mml:mi><mml:mo>)</mml:mo></mml:mrow><mml:mo>=</mml:mo><mml:msup><mml:mi mathvariant="bold-italic">X</mml:mi><mml:mrow><mml:mi>T</mml:mi></mml:mrow></mml:msup><mml:mrow><mml:mi mathvariant="bold-italic">X</mml:mi><mml:mi mathvariant="bold-italic">W</mml:mi></mml:mrow><mml:mo>&#x2212;</mml:mo><mml:msup><mml:mi mathvariant="bold-italic">X</mml:mi><mml:mrow><mml:mi>T</mml:mi></mml:mrow></mml:msup><mml:mi mathvariant="bold-italic">Y</mml:mi><mml:mo>+</mml:mo><mml:mn>2</mml:mn><mml:mi>&#x03B3;</mml:mi><mml:mrow><mml:mi mathvariant="bold-italic">W</mml:mi><mml:mi mathvariant="bold-italic">C</mml:mi></mml:mrow><mml:mo>+</mml:mo><mml:mrow><mml:mtext mathvariant="bold">2</mml:mtext></mml:mrow><mml:mrow><mml:mi mathvariant="bold-italic">A</mml:mi><mml:mi mathvariant="bold-italic">W</mml:mi></mml:mrow></mml:math></disp-formula></p>
<p>For any matrices <inline-formula id="ieqn-56"><mml:math id="mml-ieqn-56"><mml:msub><mml:mi mathvariant="bold-italic">W</mml:mi><mml:mrow><mml:mn>1</mml:mn></mml:mrow></mml:msub></mml:math></inline-formula>, <inline-formula id="ieqn-57"><mml:math id="mml-ieqn-57"><mml:msub><mml:mi mathvariant="bold-italic">W</mml:mi><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msub></mml:math></inline-formula>, there is:
<disp-formula id="eqn-10"><label>(10)</label><mml:math id="mml-eqn-10" display="block"><mml:mrow><mml:mo symmetric="true">&#x2016;</mml:mo><mml:mi mathvariant="normal">&#x2207;</mml:mi><mml:mi>f</mml:mi><mml:mrow><mml:mo>(</mml:mo><mml:msub><mml:mi mathvariant="bold-italic">W</mml:mi><mml:mrow><mml:mn>1</mml:mn></mml:mrow></mml:msub><mml:mo>)</mml:mo></mml:mrow><mml:mo>&#x2212;</mml:mo><mml:mi mathvariant="normal">&#x2207;</mml:mi><mml:mi>f</mml:mi><mml:mrow><mml:mo>(</mml:mo><mml:msub><mml:mi mathvariant="bold-italic">W</mml:mi><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msub><mml:mo>)</mml:mo></mml:mrow><mml:mo symmetric="true">&#x2016;</mml:mo></mml:mrow><mml:mo>&#x2264;</mml:mo><mml:msub><mml:mi>L</mml:mi><mml:mrow><mml:mi>g</mml:mi></mml:mrow></mml:msub><mml:mrow><mml:mo symmetric="true">&#x2016;</mml:mo><mml:mrow><mml:mi mathvariant="normal">&#x0394;</mml:mi></mml:mrow><mml:mi mathvariant="bold-italic">W</mml:mi><mml:mo symmetric="true">&#x2016;</mml:mo></mml:mrow></mml:math></disp-formula>where <inline-formula id="ieqn-58"><mml:math id="mml-ieqn-58"><mml:msub><mml:mi>L</mml:mi><mml:mrow><mml:mi>g</mml:mi></mml:mrow></mml:msub></mml:math></inline-formula> is the Lipschitz constant and <inline-formula id="ieqn-59"><mml:math id="mml-ieqn-59"><mml:mrow><mml:mi mathvariant="normal">&#x0394;</mml:mi></mml:mrow><mml:mi mathvariant="bold-italic">W</mml:mi><mml:mo>=</mml:mo><mml:msub><mml:mi mathvariant="bold-italic">W</mml:mi><mml:mrow><mml:mn>1</mml:mn></mml:mrow></mml:msub><mml:mo>&#x2212;</mml:mo><mml:msub><mml:mi mathvariant="bold-italic">W</mml:mi><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msub></mml:math></inline-formula>. Introducing the quadratic approximation <inline-formula id="ieqn-60"><mml:math id="mml-ieqn-60"><mml:mi>F</mml:mi><mml:mrow><mml:mo>(</mml:mo><mml:mi mathvariant="bold-italic">W</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula> for <inline-formula id="ieqn-61"><mml:math id="mml-ieqn-61"><mml:mi mathvariant="bold-italic">Q</mml:mi><mml:mrow><mml:mo>(</mml:mo><mml:mi mathvariant="bold-italic">W</mml:mi><mml:mo>,</mml:mo><mml:msup><mml:mi mathvariant="bold-italic">W</mml:mi><mml:mrow><mml:mrow><mml:mo>(</mml:mo><mml:mi>t</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:mrow></mml:msup><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula>, then
<disp-formula id="eqn-11"><label>(11)</label><mml:math id="mml-eqn-11" display="block"><mml:mi>Q</mml:mi><mml:mrow><mml:mo>(</mml:mo><mml:mi mathvariant="bold-italic">W</mml:mi><mml:mo>,</mml:mo><mml:msup><mml:mi mathvariant="bold-italic">W</mml:mi><mml:mrow><mml:mrow><mml:mo>(</mml:mo><mml:mi>t</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:mrow></mml:msup><mml:mo>)</mml:mo></mml:mrow><mml:mo>=</mml:mo><mml:mi>f</mml:mi><mml:mrow><mml:mo>(</mml:mo><mml:msup><mml:mi mathvariant="bold-italic">W</mml:mi><mml:mrow><mml:mrow><mml:mo>(</mml:mo><mml:mi>t</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:mrow></mml:msup><mml:mo>)</mml:mo></mml:mrow><mml:mo>+</mml:mo><mml:mrow><mml:mo>(</mml:mo><mml:mi mathvariant="normal">&#x2207;</mml:mi><mml:mi>f</mml:mi><mml:mrow><mml:mo>(</mml:mo><mml:msup><mml:mi mathvariant="bold-italic">W</mml:mi><mml:mrow><mml:mrow><mml:mo>(</mml:mo><mml:mi>t</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:mrow></mml:msup><mml:mo>)</mml:mo></mml:mrow><mml:mo>,</mml:mo><mml:mi mathvariant="bold-italic">W</mml:mi><mml:mo>&#x2212;</mml:mo><mml:msup><mml:mi mathvariant="bold-italic">W</mml:mi><mml:mrow><mml:mrow><mml:mo>(</mml:mo><mml:mi>t</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:mrow></mml:msup><mml:mo>)</mml:mo></mml:mrow><mml:mo>+</mml:mo><mml:mfrac><mml:msub><mml:mi>L</mml:mi><mml:mrow><mml:mi>g</mml:mi></mml:mrow></mml:msub><mml:mn>2</mml:mn></mml:mfrac><mml:msubsup><mml:mrow><mml:mo symmetric="true">&#x2016;</mml:mo><mml:mi mathvariant="bold-italic">W</mml:mi><mml:mo>&#x2212;</mml:mo><mml:msup><mml:mi mathvariant="bold-italic">W</mml:mi><mml:mrow><mml:mrow><mml:mo>(</mml:mo><mml:mi>t</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:mrow></mml:msup><mml:mo symmetric="true">&#x2016;</mml:mo></mml:mrow><mml:mrow><mml:mi>F</mml:mi></mml:mrow><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msubsup><mml:mo>+</mml:mo><mml:mi>g</mml:mi><mml:mrow><mml:mo>(</mml:mo><mml:mi mathvariant="bold-italic">W</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:math></disp-formula></p>
<p>Let <inline-formula id="ieqn-62"><mml:math id="mml-ieqn-62"><mml:msub><mml:mi mathvariant="bold-italic">q</mml:mi><mml:mrow><mml:mi>t</mml:mi></mml:mrow></mml:msub><mml:mrow><mml:mo>(</mml:mo><mml:mi mathvariant="bold-italic">W</mml:mi><mml:mo>)</mml:mo></mml:mrow><mml:mo>=</mml:mo><mml:msub><mml:mi mathvariant="bold-italic">W</mml:mi><mml:mrow><mml:mi>t</mml:mi></mml:mrow></mml:msub><mml:mo>&#x2212;</mml:mo><mml:mstyle displaystyle="true" scriptlevel="0"><mml:mfrac><mml:mn>1</mml:mn><mml:msub><mml:mi>L</mml:mi><mml:mrow><mml:mi>g</mml:mi></mml:mrow></mml:msub></mml:mfrac></mml:mstyle><mml:mi mathvariant="normal">&#x2207;</mml:mi><mml:mi>f</mml:mi><mml:mrow><mml:mo>(</mml:mo><mml:mi mathvariant="bold-italic">W</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula>, then
<disp-formula id="eqn-12"><label>(12)</label><mml:math id="mml-eqn-12" display="block"><mml:mi mathvariant="bold-italic">W</mml:mi><mml:mo>=</mml:mo><mml:mi>arg</mml:mi><mml:mo>&#x2061;</mml:mo><mml:munder><mml:mo movablelimits="true" form="prefix">min</mml:mo><mml:mrow><mml:mi>W</mml:mi></mml:mrow></mml:munder><mml:mi mathvariant="bold-italic">Q</mml:mi><mml:mrow><mml:mo>(</mml:mo><mml:mi mathvariant="bold-italic">W</mml:mi><mml:mo>,</mml:mo><mml:msup><mml:mi mathvariant="bold-italic">W</mml:mi><mml:mrow><mml:mrow><mml:mo>(</mml:mo><mml:mi>t</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:mrow></mml:msup><mml:mo>)</mml:mo></mml:mrow><mml:mo>=</mml:mo><mml:mi>arg</mml:mi><mml:mo>&#x2061;</mml:mo><mml:munder><mml:mo movablelimits="true" form="prefix">min</mml:mo><mml:mrow><mml:mi>W</mml:mi></mml:mrow></mml:munder><mml:mfrac><mml:mn>1</mml:mn><mml:mn>2</mml:mn></mml:mfrac><mml:msubsup><mml:mrow><mml:mo symmetric="true">&#x2016;</mml:mo><mml:mi mathvariant="bold-italic">W</mml:mi><mml:mo>&#x2212;</mml:mo><mml:msup><mml:mi mathvariant="bold-italic">q</mml:mi><mml:mrow><mml:mrow><mml:mo>(</mml:mo><mml:mi>t</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:mrow></mml:msup><mml:mo symmetric="true">&#x2016;</mml:mo></mml:mrow><mml:mrow><mml:mi>F</mml:mi></mml:mrow><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msubsup><mml:mo>+</mml:mo><mml:mfrac><mml:mi>&#x03B1;</mml:mi><mml:msub><mml:mi>L</mml:mi><mml:mrow><mml:mi>g</mml:mi></mml:mrow></mml:msub></mml:mfrac><mml:msub><mml:mrow><mml:mo symmetric="true">&#x2016;</mml:mo><mml:mi mathvariant="bold-italic">W</mml:mi><mml:mo symmetric="true">&#x2016;</mml:mo></mml:mrow><mml:mrow><mml:mn>1</mml:mn></mml:mrow></mml:msub></mml:math></disp-formula></p>
<p>The optimization algorithm proposed by Lin et al. [<xref ref-type="bibr" rid="ref-27">27</xref>] points out that
<disp-formula id="eqn-13"><label>(13)</label><mml:math id="mml-eqn-13" display="block"><mml:msup><mml:mi mathvariant="bold-italic">W</mml:mi><mml:mrow><mml:mrow><mml:mo>(</mml:mo><mml:mi>t</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:mrow></mml:msup><mml:mo>=</mml:mo><mml:msub><mml:mi mathvariant="bold-italic">W</mml:mi><mml:mrow><mml:mi mathvariant="bold-italic">t</mml:mi></mml:mrow></mml:msub><mml:mo>+</mml:mo><mml:mfrac><mml:mrow><mml:msub><mml:mi>&#x03B8;</mml:mi><mml:mrow><mml:mi>t</mml:mi><mml:mo>+</mml:mo><mml:mn>1</mml:mn></mml:mrow></mml:msub><mml:mo>&#x2212;</mml:mo><mml:mn>1</mml:mn></mml:mrow><mml:msub><mml:mi>&#x03B8;</mml:mi><mml:mrow><mml:mi>t</mml:mi></mml:mrow></mml:msub></mml:mfrac><mml:mrow><mml:mo>(</mml:mo><mml:msub><mml:mi mathvariant="bold-italic">W</mml:mi><mml:mrow><mml:mi>t</mml:mi></mml:mrow></mml:msub><mml:mo>&#x2212;</mml:mo><mml:msub><mml:mi mathvariant="bold-italic">W</mml:mi><mml:mrow><mml:mi>t</mml:mi><mml:mo>&#x2212;</mml:mo><mml:mn>1</mml:mn></mml:mrow></mml:msub><mml:mo>)</mml:mo></mml:mrow></mml:math></disp-formula></p>
<p>In <xref ref-type="disp-formula" rid="eqn-13">Eq. (13)</xref>, <inline-formula id="ieqn-63"><mml:math id="mml-ieqn-63"><mml:msub><mml:mi>b</mml:mi><mml:mrow><mml:mi>t</mml:mi></mml:mrow></mml:msub></mml:math></inline-formula> satisfies <inline-formula id="ieqn-64"><mml:math id="mml-ieqn-64"><mml:msubsup><mml:mi>b</mml:mi><mml:mrow><mml:mi>t</mml:mi><mml:mo>+</mml:mo><mml:mn>1</mml:mn></mml:mrow><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msubsup><mml:mo>&#x2212;</mml:mo><mml:msub><mml:mi>b</mml:mi><mml:mrow><mml:mi>t</mml:mi><mml:mo>+</mml:mo><mml:mn>1</mml:mn></mml:mrow></mml:msub><mml:mo>&#x2264;</mml:mo><mml:msubsup><mml:mi>b</mml:mi><mml:mrow><mml:mi>t</mml:mi></mml:mrow><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msubsup></mml:math></inline-formula>. Meanwhile, the convergence rate of <inline-formula id="ieqn-65"><mml:math id="mml-ieqn-65"><mml:mi mathvariant="bold-italic">O</mml:mi><mml:mrow><mml:mo>(</mml:mo><mml:msup><mml:mi>t</mml:mi><mml:mrow><mml:mo>&#x2212;</mml:mo><mml:mn>2</mml:mn></mml:mrow></mml:msup><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula> is improved, and <inline-formula id="ieqn-66"><mml:math id="mml-ieqn-66"><mml:msub><mml:mi mathvariant="bold-italic">W</mml:mi><mml:mrow><mml:mi>t</mml:mi></mml:mrow></mml:msub></mml:math></inline-formula> is the result of the <inline-formula id="ieqn-67"><mml:math id="mml-ieqn-67"><mml:mi>t</mml:mi></mml:math></inline-formula>th iteration. The soft threshold function for performing the iterative operation is shown in <xref ref-type="disp-formula" rid="eqn-14">Eq. (14)</xref>.
<disp-formula id="eqn-14"><label>(14)</label><mml:math id="mml-eqn-14" display="block"><mml:msub><mml:mi mathvariant="bold-italic">W</mml:mi><mml:mrow><mml:mi>t</mml:mi><mml:mo>+</mml:mo><mml:mn>1</mml:mn></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:msub><mml:mi mathvariant="bold-italic">S</mml:mi><mml:mrow><mml:mi>&#x03B5;</mml:mi></mml:mrow></mml:msub><mml:mrow><mml:mo>[</mml:mo><mml:msup><mml:mi mathvariant="bold-italic">q</mml:mi><mml:mrow><mml:mrow><mml:mo>(</mml:mo><mml:mi>t</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:mrow></mml:msup><mml:mo>]</mml:mo></mml:mrow><mml:mo>=</mml:mo><mml:mi>arg</mml:mi><mml:mo>&#x2061;</mml:mo><mml:munder><mml:mo movablelimits="true" form="prefix">min</mml:mo><mml:mrow><mml:mi>W</mml:mi></mml:mrow></mml:munder><mml:mi>&#x03B5;</mml:mi><mml:msub><mml:mrow><mml:mo symmetric="true">&#x2016;</mml:mo><mml:mi mathvariant="bold-italic">W</mml:mi><mml:mo symmetric="true">&#x2016;</mml:mo></mml:mrow><mml:mrow><mml:mn>1</mml:mn></mml:mrow></mml:msub><mml:mo>+</mml:mo><mml:mfrac><mml:mn>1</mml:mn><mml:mn>2</mml:mn></mml:mfrac><mml:msubsup><mml:mrow><mml:mo symmetric="true">&#x2016;</mml:mo><mml:mi mathvariant="bold-italic">W</mml:mi><mml:mo>&#x2212;</mml:mo><mml:msup><mml:mi mathvariant="bold-italic">q</mml:mi><mml:mrow><mml:mrow><mml:mo>(</mml:mo><mml:mi>t</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:mrow></mml:msup><mml:mo symmetric="true">&#x2016;</mml:mo></mml:mrow><mml:mrow><mml:mi>F</mml:mi></mml:mrow><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msubsup></mml:math></disp-formula>where <inline-formula id="ieqn-68"><mml:math id="mml-ieqn-68"><mml:msub><mml:mi mathvariant="bold-italic">S</mml:mi><mml:mrow><mml:mi>&#x03B5;</mml:mi></mml:mrow></mml:msub><mml:mrow><mml:mo>[</mml:mo><mml:mo>&#x22C5;</mml:mo><mml:mo>]</mml:mo></mml:mrow></mml:math></inline-formula> is the soft threshold operator. For any one parameter <inline-formula id="ieqn-69"><mml:math id="mml-ieqn-69"><mml:msub><mml:mi>x</mml:mi><mml:mrow><mml:mi>i</mml:mi><mml:mi>j</mml:mi></mml:mrow></mml:msub></mml:math></inline-formula> and <inline-formula id="ieqn-70"><mml:math id="mml-ieqn-70"><mml:mi>&#x03B5;</mml:mi><mml:mo>=</mml:mo><mml:mstyle displaystyle="true" scriptlevel="0"><mml:mfrac><mml:mi>&#x03B1;</mml:mi><mml:msub><mml:mi>L</mml:mi><mml:mrow><mml:mi>g</mml:mi></mml:mrow></mml:msub></mml:mfrac></mml:mstyle></mml:math></inline-formula>, we have
<disp-formula id="eqn-15"><label>(15)</label><mml:math id="mml-eqn-15" display="block"><mml:msub><mml:mi mathvariant="bold-italic">S</mml:mi><mml:mrow><mml:mi>&#x03B5;</mml:mi></mml:mrow></mml:msub><mml:mrow><mml:mo>(</mml:mo><mml:msub><mml:mi>x</mml:mi><mml:mrow><mml:mi>i</mml:mi><mml:mi>j</mml:mi></mml:mrow></mml:msub><mml:mo>)</mml:mo></mml:mrow><mml:mo>=</mml:mo><mml:mrow><mml:mo>{</mml:mo><mml:mtable columnalign="left left" rowspacing=".2em" columnspacing="1em" displaystyle="false"><mml:mtr><mml:mtd><mml:msub><mml:mi>x</mml:mi><mml:mrow><mml:mi>i</mml:mi><mml:mi>j</mml:mi></mml:mrow></mml:msub><mml:mo>&#x2212;</mml:mo><mml:mi>&#x03B5;</mml:mi></mml:mtd><mml:mtd><mml:mi>w</mml:mi><mml:mi>h</mml:mi><mml:mi>e</mml:mi><mml:mi>n</mml:mi><mml:mtext>&#x00A0;</mml:mtext><mml:msub><mml:mi>x</mml:mi><mml:mrow><mml:mi>i</mml:mi><mml:mi>j</mml:mi></mml:mrow></mml:msub><mml:mo>&#x003E;</mml:mo><mml:mi>&#x03B5;</mml:mi></mml:mtd></mml:mtr><mml:mtr><mml:mtd><mml:mtable rowspacing="4pt" columnspacing="1em"><mml:mtr><mml:mtd><mml:msub><mml:mi>x</mml:mi><mml:mrow><mml:mi>i</mml:mi><mml:mi>j</mml:mi></mml:mrow></mml:msub><mml:mo>+</mml:mo><mml:mi>&#x03B5;</mml:mi></mml:mtd></mml:mtr><mml:mtr><mml:mtd><mml:mn>0</mml:mn></mml:mtd></mml:mtr></mml:mtable></mml:mtd><mml:mtd><mml:mtable rowspacing="4pt" columnspacing="1em"><mml:mtr><mml:mtd><mml:mi>w</mml:mi><mml:mi>h</mml:mi><mml:mi>e</mml:mi><mml:mi>n</mml:mi><mml:mtext>&#x00A0;</mml:mtext><mml:msub><mml:mi>x</mml:mi><mml:mrow><mml:mi>i</mml:mi><mml:mi>j</mml:mi></mml:mrow></mml:msub><mml:mo>&#x003C;</mml:mo><mml:mo>&#x2212;</mml:mo><mml:mi>&#x03B5;</mml:mi></mml:mtd></mml:mtr><mml:mtr><mml:mtd><mml:mrow><mml:mi>o</mml:mi><mml:mi>t</mml:mi><mml:mi>h</mml:mi><mml:mi>e</mml:mi><mml:mi>r</mml:mi></mml:mrow></mml:mtd></mml:mtr></mml:mtable></mml:mtd></mml:mtr></mml:mtable><mml:mo fence="true" stretchy="true" symmetric="true"></mml:mo></mml:mrow></mml:math></disp-formula></p>
<p>According to <inline-formula id="ieqn-71"><mml:math id="mml-ieqn-71"><mml:mi>f</mml:mi><mml:mrow><mml:mo>(</mml:mo><mml:mi mathvariant="bold-italic">W</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula>, the Lipschitz constant is calculated as:
<disp-formula id="eqn-16"><label>(16)</label><mml:math id="mml-eqn-16" display="block"><mml:mtable columnalign="right left right left right left right left right left right left" rowspacing="3pt" columnspacing="0em 2em 0em 2em 0em 2em 0em 2em 0em 2em 0em" displaystyle="true"><mml:mtr><mml:mtd><mml:msubsup><mml:mrow><mml:mo symmetric="true">&#x2016;</mml:mo><mml:mi>f</mml:mi><mml:mrow><mml:mo>(</mml:mo><mml:msub><mml:mi mathvariant="bold-italic">W</mml:mi><mml:mrow><mml:mn>1</mml:mn></mml:mrow></mml:msub><mml:mo>)</mml:mo></mml:mrow><mml:mo>&#x2212;</mml:mo><mml:mi>f</mml:mi><mml:mrow><mml:mo>(</mml:mo><mml:msub><mml:mi mathvariant="bold-italic">W</mml:mi><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msub><mml:mo>)</mml:mo></mml:mrow><mml:mo symmetric="true">&#x2016;</mml:mo></mml:mrow><mml:mrow><mml:mi>F</mml:mi></mml:mrow><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msubsup></mml:mtd><mml:mtd><mml:mi></mml:mi><mml:mo>=</mml:mo><mml:msubsup><mml:mrow><mml:mo symmetric="true">&#x2016;</mml:mo><mml:msup><mml:mi mathvariant="bold-italic">X</mml:mi><mml:mrow><mml:mi>T</mml:mi></mml:mrow></mml:msup><mml:mi mathvariant="bold-italic">X</mml:mi><mml:mi mathvariant="normal">&#x0394;</mml:mi><mml:mi mathvariant="bold-italic">W</mml:mi><mml:mo symmetric="true">&#x2016;</mml:mo></mml:mrow><mml:mrow><mml:mi>F</mml:mi></mml:mrow><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msubsup><mml:mo>+</mml:mo><mml:msubsup><mml:mrow><mml:mo symmetric="true">&#x2016;</mml:mo><mml:mn>2</mml:mn><mml:mi>&#x03B3;</mml:mi><mml:mi mathvariant="normal">&#x0394;</mml:mi><mml:mrow><mml:mi mathvariant="bold-italic">W</mml:mi><mml:mi mathvariant="bold-italic">R</mml:mi></mml:mrow><mml:mo symmetric="true">&#x2016;</mml:mo></mml:mrow><mml:mrow><mml:mi>F</mml:mi></mml:mrow><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msubsup><mml:mo>+</mml:mo><mml:msubsup><mml:mrow><mml:mo symmetric="true">&#x2016;</mml:mo><mml:mn>2</mml:mn><mml:mi>&#x03B2;</mml:mi><mml:mi mathvariant="normal">&#x0394;</mml:mi><mml:mrow><mml:mi mathvariant="bold-italic">W</mml:mi><mml:mi mathvariant="bold-italic">A</mml:mi></mml:mrow><mml:mo symmetric="true">&#x2016;</mml:mo></mml:mrow><mml:mrow><mml:mi>F</mml:mi></mml:mrow><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msubsup></mml:mtd></mml:mtr><mml:mtr><mml:mtd /><mml:mtd><mml:mi></mml:mi><mml:mspace width="1em" /><mml:mo>&#x2264;</mml:mo><mml:mn>2</mml:mn><mml:msubsup><mml:mrow><mml:mo symmetric="true">&#x2016;</mml:mo><mml:msup><mml:mi mathvariant="bold-italic">X</mml:mi><mml:mrow><mml:mi>T</mml:mi></mml:mrow></mml:msup><mml:mi mathvariant="bold-italic">X</mml:mi><mml:mo symmetric="true">&#x2016;</mml:mo></mml:mrow><mml:mrow><mml:mn>2</mml:mn></mml:mrow><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msubsup><mml:msubsup><mml:mrow><mml:mo symmetric="true">&#x2016;</mml:mo><mml:mi mathvariant="normal">&#x0394;</mml:mi><mml:mi mathvariant="bold-italic">W</mml:mi><mml:mo symmetric="true">&#x2016;</mml:mo></mml:mrow><mml:mrow><mml:mi>F</mml:mi></mml:mrow><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msubsup><mml:mo>+</mml:mo><mml:mn>4</mml:mn><mml:mi>&#x03B3;</mml:mi><mml:msubsup><mml:mrow><mml:mo symmetric="true">&#x2016;</mml:mo><mml:mi mathvariant="bold-italic">C</mml:mi><mml:mo symmetric="true">&#x2016;</mml:mo></mml:mrow><mml:mrow><mml:mn>2</mml:mn></mml:mrow><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msubsup><mml:msubsup><mml:mrow><mml:mo symmetric="true">&#x2016;</mml:mo><mml:mi mathvariant="normal">&#x0394;</mml:mi><mml:mi mathvariant="bold-italic">W</mml:mi><mml:mo symmetric="true">&#x2016;</mml:mo></mml:mrow><mml:mrow><mml:mi>F</mml:mi></mml:mrow><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msubsup><mml:mo>+</mml:mo><mml:mn>4</mml:mn><mml:mi>&#x03B2;</mml:mi><mml:msubsup><mml:mrow><mml:mo symmetric="true">&#x2016;</mml:mo><mml:mi mathvariant="bold-italic">A</mml:mi><mml:mo symmetric="true">&#x2016;</mml:mo></mml:mrow><mml:mrow><mml:mn>2</mml:mn></mml:mrow><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msubsup><mml:msubsup><mml:mrow><mml:mo symmetric="true">&#x2016;</mml:mo><mml:mi mathvariant="normal">&#x0394;</mml:mi><mml:mi mathvariant="bold-italic">W</mml:mi><mml:mo symmetric="true">&#x2016;</mml:mo></mml:mrow><mml:mrow><mml:mi>F</mml:mi></mml:mrow><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msubsup></mml:mtd></mml:mtr></mml:mtable></mml:math></disp-formula></p>
<p>Therefore, the Lipschitz constant for the CCSF model is:
<disp-formula id="eqn-17"><label>(17)</label><mml:math id="mml-eqn-17" display="block"><mml:msub><mml:mi>L</mml:mi><mml:mrow><mml:mi>g</mml:mi></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:msqrt><mml:mn>2</mml:mn><mml:mrow><mml:mo>(</mml:mo><mml:msubsup><mml:mrow><mml:mo symmetric="true">&#x2016;</mml:mo><mml:msup><mml:mi mathvariant="bold-italic">X</mml:mi><mml:mrow><mml:mi>T</mml:mi></mml:mrow></mml:msup><mml:mi mathvariant="bold-italic">X</mml:mi><mml:mo symmetric="true">&#x2016;</mml:mo></mml:mrow><mml:mrow><mml:mn>2</mml:mn></mml:mrow><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msubsup><mml:mo>+</mml:mo><mml:mn>2</mml:mn><mml:mi>&#x03B3;</mml:mi><mml:msubsup><mml:mrow><mml:mo symmetric="true">&#x2016;</mml:mo><mml:mi mathvariant="bold-italic">C</mml:mi><mml:mo symmetric="true">&#x2016;</mml:mo></mml:mrow><mml:mrow><mml:mn>2</mml:mn></mml:mrow><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msubsup><mml:mo>+</mml:mo><mml:mn>2</mml:mn><mml:mi>&#x03B2;</mml:mi><mml:msubsup><mml:mrow><mml:mo symmetric="true">&#x2016;</mml:mo><mml:mi mathvariant="bold-italic">A</mml:mi><mml:mo symmetric="true">&#x2016;</mml:mo></mml:mrow><mml:mrow><mml:mn>2</mml:mn></mml:mrow><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msubsup><mml:mo>)</mml:mo></mml:mrow></mml:msqrt></mml:math></disp-formula></p>
<p>The CCSF algorithm framework is as following:</p>
<fig id="fig-6">
<graphic mimetype="image" mime-subtype="tif" xlink:href="JAI_49083-fig-6.tif"/>
</fig>
<p>The validation method is as follows. <inline-formula id="ieqn-86"><mml:math id="mml-ieqn-86"><mml:msub><mml:mi>X</mml:mi><mml:mrow><mml:mi>t</mml:mi><mml:mi>e</mml:mi><mml:mi>s</mml:mi><mml:mi>t</mml:mi></mml:mrow></mml:msub></mml:math></inline-formula> stands for testing dataset. The matrix dimension <inline-formula id="ieqn-87"><mml:math id="mml-ieqn-87"><mml:mi>m</mml:mi></mml:math></inline-formula> is the sample size of the remainder of the test set. <inline-formula id="ieqn-88"><mml:math id="mml-ieqn-88"><mml:msub><mml:mi>Y</mml:mi><mml:mrow><mml:mi>t</mml:mi><mml:mi>e</mml:mi><mml:mi>s</mml:mi><mml:mi>t</mml:mi></mml:mrow></mml:msub></mml:math></inline-formula> represents predictive matrix. <inline-formula id="ieqn-89"><mml:math id="mml-ieqn-89"><mml:msub><mml:mi>S</mml:mi><mml:mrow><mml:mi>t</mml:mi><mml:mi>e</mml:mi><mml:mi>s</mml:mi><mml:mi>t</mml:mi></mml:mrow></mml:msub></mml:math></inline-formula> represents score matrix.</p>
<fig id="fig-7">
<graphic mimetype="image" mime-subtype="tif" xlink:href="JAI_49083-fig-7.tif"/>
</fig>
</sec>
<sec id="s3_3">
<label>3.3</label>
<title>Complexity Analysis</title>
<p>The time complexity analysis of CCSF and comparison algorithms is shown in <xref ref-type="table" rid="table-1">Table 1</xref>, where <inline-formula id="ieqn-96"><mml:math id="mml-ieqn-96"><mml:mi>n</mml:mi></mml:math></inline-formula> represents the number of samples, <inline-formula id="ieqn-97"><mml:math id="mml-ieqn-97"><mml:mi>d</mml:mi></mml:math></inline-formula> represents the number of features, and <inline-formula id="ieqn-98"><mml:math id="mml-ieqn-98"><mml:mi>l</mml:mi></mml:math></inline-formula> represents the number of labels. The time complexity of CCSF consists of computing the asymmetric correlation matrix and accelerated gradient descent method, which results in <inline-formula id="ieqn-99"><mml:math id="mml-ieqn-99"><mml:mi>O</mml:mi><mml:mrow><mml:mo>(</mml:mo><mml:msup><mml:mi>d</mml:mi><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msup><mml:mi>l</mml:mi><mml:mo>+</mml:mo><mml:mi>n</mml:mi><mml:mi>d</mml:mi><mml:mi>l</mml:mi><mml:mo>+</mml:mo><mml:mrow><mml:mtext>d</mml:mtext></mml:mrow><mml:msup><mml:mi>l</mml:mi><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msup><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula>. According to <xref ref-type="table" rid="table-1">Table 1</xref>, it can be seen that the time complexity of LLSF is lower than that of CCSF, which is <inline-formula id="ieqn-100"><mml:math id="mml-ieqn-100"><mml:mi>O</mml:mi><mml:mrow><mml:mo>(</mml:mo><mml:msup><mml:mi>d</mml:mi><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msup><mml:mo>+</mml:mo><mml:mi>d</mml:mi><mml:mi>l</mml:mi><mml:mo>+</mml:mo><mml:msup><mml:mi>l</mml:mi><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msup><mml:mo>+</mml:mo><mml:mi>n</mml:mi><mml:mi>d</mml:mi><mml:mo>+</mml:mo><mml:mi>n</mml:mi><mml:mi>l</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula>, but the classification effect is not as good as that of CCSF. The time complexity of FF-MLLA is not given in the article. The rest of the algorithms have higher time complexity than that of CCSF.</p>
<table-wrap id="table-1">
<label>Table 1</label>
<caption>
<title>Time complexity of the algorithms</title>
</caption>
<table frame="hsides">
<colgroup>
<col align="left"/>
<col align="left"/>
</colgroup>
<thead>
<tr>
<th>Methods</th>
<th>Complexity analysis</th>
</tr>
</thead>
<tbody>
<tr>
<td>LSGL</td>
<td><inline-formula id="ieqn-101"><mml:math id="mml-ieqn-101"><mml:mi>O</mml:mi><mml:mrow><mml:mo>(</mml:mo><mml:mi>d</mml:mi><mml:mrow><mml:mo>(</mml:mo><mml:msup><mml:mi>d</mml:mi><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msup><mml:mo>+</mml:mo><mml:mi>n</mml:mi><mml:mi>l</mml:mi><mml:mo>+</mml:mo><mml:msup><mml:mi>l</mml:mi><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msup><mml:mo>)</mml:mo></mml:mrow><mml:mo>+</mml:mo><mml:mi>l</mml:mi><mml:mrow><mml:mo>(</mml:mo><mml:msup><mml:mi>l</mml:mi><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msup><mml:mo>+</mml:mo><mml:msup><mml:mi>d</mml:mi><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msup><mml:mo>)</mml:mo></mml:mrow><mml:mo>+</mml:mo><mml:mi>n</mml:mi><mml:mrow><mml:mo>(</mml:mo><mml:msup><mml:mi>l</mml:mi><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msup><mml:mo>+</mml:mo><mml:msup><mml:mi>d</mml:mi><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msup><mml:mo>)</mml:mo></mml:mrow><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula></td>
</tr>
<tr>
<td>ACML</td>
<td><inline-formula id="ieqn-102"><mml:math id="mml-ieqn-102"><mml:mi>O</mml:mi><mml:mrow><mml:mo>(</mml:mo><mml:msup><mml:mi>d</mml:mi><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msup><mml:mrow><mml:mo>(</mml:mo><mml:mi>n</mml:mi><mml:mi>l</mml:mi><mml:mo>+</mml:mo><mml:mi>n</mml:mi><mml:mo>+</mml:mo><mml:mi>l</mml:mi><mml:mo>)</mml:mo></mml:mrow><mml:mo>+</mml:mo><mml:msup><mml:mi>l</mml:mi><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msup><mml:mrow><mml:mo>(</mml:mo><mml:mi>n</mml:mi><mml:mo>+</mml:mo><mml:mi>d</mml:mi><mml:mo>+</mml:mo><mml:mn>3</mml:mn><mml:mrow><mml:mo>/</mml:mo></mml:mrow><mml:mn>2</mml:mn><mml:mo>)</mml:mo></mml:mrow><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula></td>
</tr>
<tr>
<td>LSML</td>
<td><inline-formula id="ieqn-103"><mml:math id="mml-ieqn-103"><mml:mi>O</mml:mi><mml:mrow><mml:mo>(</mml:mo><mml:mrow><mml:mo>(</mml:mo><mml:mi>n</mml:mi><mml:mo>+</mml:mo><mml:mi>l</mml:mi><mml:mo>)</mml:mo></mml:mrow><mml:msup><mml:mi>d</mml:mi><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msup><mml:mo>+</mml:mo><mml:mrow><mml:mo>(</mml:mo><mml:mi>n</mml:mi><mml:mo>+</mml:mo><mml:mi>d</mml:mi><mml:mo>)</mml:mo></mml:mrow><mml:msup><mml:mi>l</mml:mi><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msup><mml:mo>+</mml:mo><mml:mi>n</mml:mi><mml:mi>d</mml:mi><mml:mi>l</mml:mi><mml:mo>+</mml:mo><mml:msup><mml:mi>d</mml:mi><mml:mrow><mml:mn>3</mml:mn></mml:mrow></mml:msup><mml:mo>+</mml:mo><mml:msup><mml:mi>l</mml:mi><mml:mrow><mml:mn>3</mml:mn></mml:mrow></mml:msup><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula></td>
</tr>
<tr>
<td>LLSF</td>
<td><inline-formula id="ieqn-104"><mml:math id="mml-ieqn-104"><mml:mi>O</mml:mi><mml:mrow><mml:mo>(</mml:mo><mml:msup><mml:mi>d</mml:mi><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msup><mml:mo>+</mml:mo><mml:mi>d</mml:mi><mml:mi>l</mml:mi><mml:mo>+</mml:mo><mml:msup><mml:mi>l</mml:mi><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msup><mml:mo>+</mml:mo><mml:mi>n</mml:mi><mml:mi>d</mml:mi><mml:mo>+</mml:mo><mml:mi>n</mml:mi><mml:mi>l</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula></td>
</tr>
<tr>
<td>LSI-CI</td>
<td><inline-formula id="ieqn-105"><mml:math id="mml-ieqn-105"><mml:mi>O</mml:mi><mml:mrow><mml:mo>(</mml:mo><mml:mi>n</mml:mi><mml:msup><mml:mi>d</mml:mi><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msup><mml:mo>+</mml:mo><mml:mi>n</mml:mi><mml:mi>d</mml:mi><mml:mo>+</mml:mo><mml:mi>n</mml:mi><mml:mi>d</mml:mi><mml:mi>l</mml:mi><mml:mo>+</mml:mo><mml:mi>l</mml:mi><mml:msup><mml:mi>g</mml:mi><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msup><mml:mo>+</mml:mo><mml:msup><mml:mi>d</mml:mi><mml:mrow><mml:mn>3</mml:mn></mml:mrow></mml:msup><mml:mo>+</mml:mo><mml:msup><mml:mi>d</mml:mi><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msup><mml:mi>l</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula></td>
</tr>
<tr>
<td>CCSF</td>
<td><inline-formula id="ieqn-106"><mml:math id="mml-ieqn-106"><mml:mi>O</mml:mi><mml:mrow><mml:mo>(</mml:mo><mml:msup><mml:mi>d</mml:mi><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msup><mml:mi>l</mml:mi><mml:mo>+</mml:mo><mml:mi>n</mml:mi><mml:mi>d</mml:mi><mml:mi>l</mml:mi><mml:mo>+</mml:mo><mml:mrow><mml:mtext>d</mml:mtext></mml:mrow><mml:msup><mml:mi>l</mml:mi><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msup><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula></td>
</tr>
</tbody>
</table>
</table-wrap>
</sec>
</sec>
<sec id="s4">
<label>4</label>
<title>Experiment</title>
<sec id="s4_1">
<label>4.1</label>
<title>Datasets</title>
<p>To validate the effectiveness of the algorithm proposed in this paper, five cross-validations were performed on nine multi-label benchmark datasets. The datasets are from different domains, the details of which are shown in <xref ref-type="table" rid="table-2">Table 2</xref>.</p>
<table-wrap id="table-2">
<label>Table 2</label>
<caption>
<title>Multi-label datasets</title>
</caption>
<table frame="hsides">
<colgroup>
<col align="left"/>
<col align="left"/>
<col align="left"/>
<col align="left"/>
<col align="left"/>
<col align="left"/>
</colgroup>
<thead>
<tr>
<th>Datasets</th>
<th>Instance</th>
<th>Feature</th>
<th>Label</th>
<th>Cardinality</th>
<th>Domain</th>
</tr>
</thead>
<tbody>
<tr>
<td>Birds <sup>[2]</sup></td>
<td>645</td>
<td>260</td>
<td>20</td>
<td>1.471</td>
<td>Images</td>
</tr>
<tr>
<td>Arts <sup>[1]</sup></td>
<td>5000</td>
<td>462</td>
<td>26</td>
<td>1.636</td>
<td>Text</td>
</tr>
<tr>
<td>Computer <sup>[1]</sup></td>
<td>5000</td>
<td>681</td>
<td>33</td>
<td>1.508</td>
<td>Text</td>
</tr>
<tr>
<td>Education <sup>[1]</sup></td>
<td>5000</td>
<td>550</td>
<td>33</td>
<td>1.461</td>
<td>Text</td>
</tr>
<tr>
<td>Entertainment <sup>[1]</sup></td>
<td>5000</td>
<td>640</td>
<td>21</td>
<td>1.640</td>
<td>News</td>
</tr>
<tr>
<td>Business <sup>[1]</sup></td>
<td>5000</td>
<td>438</td>
<td>30</td>
<td>1.438</td>
<td>News</td>
</tr>
<tr>
<td>Recreation <sup>[1]</sup></td>
<td>5000</td>
<td>606</td>
<td>22</td>
<td>1.606</td>
<td>News</td>
</tr>
<tr>
<td>Reference <sup>[1]</sup></td>
<td>5000</td>
<td>793</td>
<td>33</td>
<td>1.793</td>
<td>Text</td>
</tr>
<tr>
<td>Science <sup>[1]</sup></td>
<td>5000</td>
<td>743</td>
<td>40</td>
<td>1.451</td>
<td>Text</td>
</tr>
</tbody>
</table>
<table-wrap-foot><fn><p>Note: [1] <ext-link ext-link-type="uri" xlink:href="http://www.uco.es/kdis/mllresources/">http://www.uco.es/kdis/mllresources/</ext-link>. [2] <ext-link ext-link-type="uri" xlink:href="http://mulan.sourceforge.net/datasets-mlc.html">http://mulan.sourceforge.net/datasets-mlc.html</ext-link>.</p>
</fn>
</table-wrap-foot>
</table-wrap>
</sec>
<sec id="s4_2">
<label>4.2</label>
<title>Results and Comparison Algorithms</title>
<p>The experimental codes are implemented in MatlabR2021a, with a hardware environment of IntelCore (TM) i5-11600KF 3.90 GHz CPU, 32 G RAM, and an operating system of Windows 10.</p>
<p>In order to compare the effectiveness of CCSF algorithms, six commonly used evaluation metrics in MLL are selected in this paper, which are Hamming Loss (HL), Average Precision (AP), One Error (OE), Ranking Loss (RL), Coverage (CV), and AUC (AUC). Among them, the smaller the HL, OE, RL, CV metrics the better, the larger the AP and AUC metrics the better the experimental effect. Specific formulas and meanings can be found in the literature [<xref ref-type="bibr" rid="ref-28">28</xref>,<xref ref-type="bibr" rid="ref-29">29</xref>]. The parameters of the comparison algorithm are set as follows:</p>
<p>1) In LSGL [<xref ref-type="bibr" rid="ref-16">16</xref>] algorithm, <inline-formula id="ieqn-107"><mml:math id="mml-ieqn-107"><mml:msub><mml:mi>&#x03BB;</mml:mi><mml:mrow><mml:mn>1</mml:mn></mml:mrow></mml:msub><mml:mo>&#x2208;</mml:mo><mml:mrow><mml:mo>{</mml:mo><mml:msup><mml:mn>10</mml:mn><mml:mrow><mml:mo>&#x2212;</mml:mo><mml:mn>3</mml:mn></mml:mrow></mml:msup><mml:mo>,</mml:mo><mml:msup><mml:mn>10</mml:mn><mml:mrow><mml:mo>&#x2212;</mml:mo><mml:mn>2</mml:mn></mml:mrow></mml:msup><mml:mo>,</mml:mo><mml:mo>&#x2026;</mml:mo><mml:mo>,</mml:mo><mml:msup><mml:mn>10</mml:mn><mml:mrow><mml:mn>3</mml:mn></mml:mrow></mml:msup><mml:mo>}</mml:mo></mml:mrow></mml:math></inline-formula>, <inline-formula id="ieqn-108"><mml:math id="mml-ieqn-108"><mml:msub><mml:mi>&#x03BB;</mml:mi><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:msub><mml:mi>&#x03BB;</mml:mi><mml:mrow><mml:mn>3</mml:mn></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:msub><mml:mi>&#x03BB;</mml:mi><mml:mrow><mml:mn>4</mml:mn></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:msub><mml:mi>&#x03BB;</mml:mi><mml:mrow><mml:mn>5</mml:mn></mml:mrow></mml:msub><mml:mo>&#x2208;</mml:mo><mml:mrow><mml:mo>{</mml:mo><mml:msup><mml:mn>10</mml:mn><mml:mrow><mml:mo>&#x2212;</mml:mo><mml:mn>3</mml:mn></mml:mrow></mml:msup><mml:mo>,</mml:mo><mml:msup><mml:mn>10</mml:mn><mml:mrow><mml:mo>&#x2212;</mml:mo><mml:mn>2</mml:mn></mml:mrow></mml:msup><mml:mo>,</mml:mo><mml:mo>&#x2026;</mml:mo><mml:mo>,</mml:mo><mml:msup><mml:mn>10</mml:mn><mml:mrow><mml:mn>1</mml:mn></mml:mrow></mml:msup><mml:mo>}</mml:mo></mml:mrow></mml:math></inline-formula>;</p>
<p>2) The parameters interval of the ACML [<xref ref-type="bibr" rid="ref-17">17</xref>] algorithm are <inline-formula id="ieqn-109"><mml:math id="mml-ieqn-109"><mml:mi>&#x03B1;</mml:mi><mml:mo>&#x2208;</mml:mo><mml:mrow><mml:mo>[</mml:mo><mml:msup><mml:mn>2</mml:mn><mml:mrow><mml:mo>&#x2212;</mml:mo><mml:mn>10</mml:mn></mml:mrow></mml:msup><mml:mo>,</mml:mo><mml:msup><mml:mn>2</mml:mn><mml:mrow><mml:mn>10</mml:mn></mml:mrow></mml:msup><mml:mo>]</mml:mo></mml:mrow><mml:mo>,</mml:mo><mml:mtext>&#x00A0;</mml:mtext><mml:mi>&#x03B2;</mml:mi><mml:mo>&#x2208;</mml:mo><mml:mrow><mml:mo>[</mml:mo><mml:msup><mml:mn>2</mml:mn><mml:mrow><mml:mo>&#x2212;</mml:mo><mml:mn>10</mml:mn></mml:mrow></mml:msup><mml:mo>,</mml:mo><mml:msup><mml:mn>2</mml:mn><mml:mrow><mml:mn>10</mml:mn></mml:mrow></mml:msup><mml:mo>]</mml:mo></mml:mrow></mml:math></inline-formula>;</p>
<p>3) Numbers of nearest neighbors in the FF-MLLA [<xref ref-type="bibr" rid="ref-12">12</xref>] algorithm are <inline-formula id="ieqn-110"><mml:math id="mml-ieqn-110"><mml:mrow><mml:mtext>k</mml:mtext></mml:mrow><mml:mo>=</mml:mo><mml:mn>15</mml:mn></mml:math></inline-formula>, <inline-formula id="ieqn-111"><mml:math id="mml-ieqn-111"><mml:mi>&#x03B2;</mml:mi><mml:mo>=</mml:mo></mml:math></inline-formula>1, <inline-formula id="ieqn-112"><mml:math id="mml-ieqn-112"><mml:mrow><mml:mtext>KRBF</mml:mtext></mml:mrow><mml:mo>=</mml:mo><mml:mn>100</mml:mn></mml:math></inline-formula>;</p>
<p>4) The parameters of LSML [<xref ref-type="bibr" rid="ref-15">15</xref>] are set as follows <inline-formula id="ieqn-113"><mml:math id="mml-ieqn-113"><mml:msub><mml:mi>&#x03BB;</mml:mi><mml:mrow><mml:mn>1</mml:mn></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:msup><mml:mn>10</mml:mn><mml:mrow><mml:mn>1</mml:mn></mml:mrow></mml:msup><mml:mo>,</mml:mo><mml:mtext>&#x00A0;</mml:mtext><mml:msub><mml:mi>&#x03BB;</mml:mi><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:msup><mml:mn>10</mml:mn><mml:mrow><mml:mo>&#x2212;</mml:mo><mml:mn>5</mml:mn></mml:mrow></mml:msup><mml:mo>,</mml:mo><mml:mtext>&#x00A0;</mml:mtext><mml:msub><mml:mi>&#x03BB;</mml:mi><mml:mrow><mml:mn>3</mml:mn></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:msup><mml:mn>10</mml:mn><mml:mrow><mml:mo>&#x2212;</mml:mo><mml:mn>3</mml:mn></mml:mrow></mml:msup><mml:mo>,</mml:mo><mml:mtext>&#x00A0;</mml:mtext><mml:msub><mml:mi>&#x03BB;</mml:mi><mml:mrow><mml:mn>4</mml:mn></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:msup><mml:mn>10</mml:mn><mml:mrow><mml:mo>&#x2212;</mml:mo><mml:mn>5</mml:mn></mml:mrow></mml:msup></mml:math></inline-formula>;</p>
<p>5) The parameters of LLSF [<xref ref-type="bibr" rid="ref-11">11</xref>] are set to <inline-formula id="ieqn-114"><mml:math id="mml-ieqn-114"><mml:mi>&#x03B1;</mml:mi><mml:mo>=</mml:mo><mml:msup><mml:mn>2</mml:mn><mml:mrow><mml:mo>&#x2212;</mml:mo><mml:mn>4</mml:mn></mml:mrow></mml:msup><mml:mo>,</mml:mo><mml:mtext>&#x00A0;</mml:mtext><mml:mi>&#x03B2;</mml:mi><mml:mo>=</mml:mo><mml:msup><mml:mn>2</mml:mn><mml:mrow><mml:mo>&#x2212;</mml:mo><mml:mn>6</mml:mn></mml:mrow></mml:msup><mml:mo>,</mml:mo><mml:mtext>&#x00A0;</mml:mtext><mml:mi>&#x03B3;</mml:mi><mml:mo>=</mml:mo><mml:mn>1</mml:mn></mml:math></inline-formula>;</p>
<p>6) The parameters of LSI-CI [<xref ref-type="bibr" rid="ref-30">30</xref>] are set to <inline-formula id="ieqn-115"><mml:math id="mml-ieqn-115"><mml:mi>&#x03B1;</mml:mi><mml:mo>=</mml:mo><mml:msup><mml:mn>2</mml:mn><mml:mrow><mml:mn>10</mml:mn></mml:mrow></mml:msup><mml:mo>,</mml:mo><mml:mtext>&#x00A0;</mml:mtext><mml:mi>&#x03B2;</mml:mi><mml:mo>=</mml:mo><mml:msup><mml:mn>2</mml:mn><mml:mrow><mml:mn>8</mml:mn></mml:mrow></mml:msup><mml:mo>,</mml:mo><mml:mtext>&#x00A0;</mml:mtext><mml:mi>&#x03B3;</mml:mi><mml:mo>=</mml:mo><mml:mn>1</mml:mn><mml:mo>,</mml:mo><mml:mtext>&#x00A0;</mml:mtext><mml:mi>&#x03B8;</mml:mi><mml:mo>=</mml:mo><mml:msup><mml:mn>2</mml:mn><mml:mrow><mml:mo>&#x2212;</mml:mo><mml:mn>8</mml:mn></mml:mrow></mml:msup></mml:math></inline-formula>;</p>
<p>7) The parameters of CCSF are set as <inline-formula id="ieqn-116"><mml:math id="mml-ieqn-116"><mml:mi>&#x03B1;</mml:mi><mml:mo>,</mml:mo><mml:mtext>&#x00A0;</mml:mtext><mml:mi>&#x03B2;</mml:mi><mml:mo>,</mml:mo><mml:mtext>&#x00A0;</mml:mtext><mml:mi>&#x03B3;</mml:mi><mml:mo>&#x2208;</mml:mo><mml:mrow><mml:mo>[</mml:mo><mml:msup><mml:mn>2</mml:mn><mml:mrow><mml:mo>&#x2212;</mml:mo><mml:mn>10</mml:mn></mml:mrow></mml:msup><mml:mo>,</mml:mo><mml:mtext>&#x00A0;</mml:mtext><mml:msup><mml:mn>2</mml:mn><mml:mrow><mml:mn>10</mml:mn></mml:mrow></mml:msup><mml:mo>]</mml:mo></mml:mrow></mml:math></inline-formula>.</p>
<p>The experimental results of the CCSF algorithm on 9 datasets with 6 state-of-the-art algorithms under 6 different metrics are given in <xref ref-type="table" rid="table-2">Table 2</xref>, where &#x201C;<inline-formula id="ieqn-117"><mml:math id="mml-ieqn-117"><mml:mo stretchy="false">&#x2191;</mml:mo></mml:math></inline-formula>&#x201D; (&#x201C;<inline-formula id="ieqn-118"><mml:math id="mml-ieqn-118"><mml:mo stretchy="false">&#x2193;</mml:mo></mml:math></inline-formula>&#x201D;) indicates that higher (lower) values of the metrics are better, and the experimental results that are dominant are bolded. The details are as follows.</p>
<p>1) As can be seen from <xref ref-type="table" rid="table-3">Table 3</xref>, out of the 54 sets of experimental results, the CCSF algorithm is superior in 49 sets, with a superiority rate of 90.74%. The CCSF algorithm significantly outperforms the other compared algorithms on all 8 datasets. The variance of the CCSF algorithm is smaller, which also proves that the CCSF algorithm is more stable. On the Birds dataset, the CCSF algorithm and the ACML algorithm are equally dominant, due to the fact that both algorithms use causal learning algorithms to compute asymmetric correlations between labels. While the Birds dataset is small, it is difficult to extract more common features of the labels, and the experimental effect dominance is not obvious compared to the larger dataset.</p>
<table-wrap id="table-3">
<label>Table 3</label>
<caption>
<title>Test results of each algorithms on six evaluation metrics (mean &#x00B1; std)</title>
</caption>
<table frame="hsides">
<colgroup>
<col/>
<col/>
<col/>
<col/>
<col/>
<col/>
<col/>
<col/>
<col/>
</colgroup>
<thead>
<tr>
<th>Data sets</th>
<th>Metrics</th>
<th>CCSF</th>
<th>LSGL</th>
<th>ACML</th>
<th>FF-MLLA</th>
<th>LSML</th>
<th>LLSF</th>
<th>LSF-CI</th>
</tr>
</thead>
<tbody>
<tr>
<td rowspan="6">Birds</td>
<td>HL&#x2193;</td>
<td><bold>0.0470 &#x00B1; 0.0056</bold></td>
<td>0.0505 &#x00B1; 0.0047</td>
<td>0.0512 &#x00B1; 0.0023</td>
<td>0.0476 &#x00B1; 0.0040</td>
<td>0.0595 &#x00B1; 0.0046</td>
<td>0.0506 &#x00B1; 0.0041</td>
<td>0.0647 &#x00B1; 0.0063</td>
</tr>
<tr>
<td>AP&#x2191;</td>
<td><bold>0.7710 &#x00B1; 0.0314</bold></td>
<td>0.7664 &#x00B1; 0.0128</td>
<td>0.7648 &#x00B1; 0.0275</td>
<td>0.7517 &#x00B1; 0.0247</td>
<td>0.7596 &#x00B1; 0.0219</td>
<td>0.7582 &#x00B1; 0.0350</td>
<td>0.6302 &#x00B1; 0.0165</td>
</tr>
<tr>
<td>OE&#x2193;</td>
<td><bold>0.2742 &#x00B1; 0.0503</bold></td>
<td>0.2806 &#x00B1; 0.0186</td>
<td>0.2821 &#x00B1; 0.0301</td>
<td>0.3007 &#x00B1; 0.0358</td>
<td>0.2884 &#x00B1; 0.0458</td>
<td>0.2915 &#x00B1; 0.0385</td>
<td>0.4062 &#x00B1; 0.0233</td>
</tr>
<tr>
<td>RL&#x2193;</td>
<td>0.0915 &#x00B1; 0.0234</td>
<td>0.0948 &#x00B1; 0.0128</td>
<td><bold>0.0891 &#x00B1; 0.0150</bold></td>
<td>0.1067 &#x00B1; 0.0116</td>
<td>0.0972 &#x00B1; 0.0061</td>
<td>0.0962 &#x00B1; 0.0232</td>
<td>0.2206 &#x00B1; 0.0168</td>
</tr>
<tr>
<td>CV&#x2193;</td>
<td>0.1425 &#x00B1; 0.0285</td>
<td>0.1486 &#x00B1; 0.0229</td>
<td><bold>0.1391 &#x00B1; 0.0210</bold></td>
<td>0.1512 &#x00B1; 0.0206</td>
<td>0.1483 &#x00B1; 0.0162</td>
<td>0.1471 &#x00B1; 0.0322</td>
<td>0.2739 &#x00B1; 0.0156</td>
</tr>
<tr>
<td>AUC&#x2191;</td>
<td>0.8787 &#x00B1; 0.0204</td>
<td>0.8702 &#x00B1; 0.0184</td>
<td><bold>0.8953 &#x00B1; 0.0460</bold></td>
<td>0.7714 &#x00B1; 0.0145</td>
<td>0.6498 &#x00B1; 0.0040</td>
<td>0.7690 &#x00B1; 0.0178</td>
<td>0.6978 &#x00B1; 0.0145</td>
</tr>
<tr>
<td rowspan="6">Arts</td>
<td>HL&#x2193;</td>
<td><bold>0.0525 &#x00B1; 0.0014</bold></td>
<td>0.0529 &#x00B1; 0.0009</td>
<td>0.0536 &#x00B1; 0.0007</td>
<td>0.0588 &#x00B1; 0.0015</td>
<td>0.0582 &#x00B1; 0.0011</td>
<td>0.0566 &#x00B1; 0.0009</td>
<td>0.0561 &#x00B1; 0.0013</td>
</tr>
<tr>
<td>AP&#x2191;</td>
<td><bold>0.6367 &#x00B1; 0.0112</bold></td>
<td>0.6340 &#x00B1; 0.0069</td>
<td>0.6241 &#x00B1; 0.0141</td>
<td>0.5211 &#x00B1; 0.0101</td>
<td>0.5932 &#x00B1; 0.0069</td>
<td>0.5852 &#x00B1; 0.0147</td>
<td>0.5451 &#x00B1; 0.0100</td>
</tr>
<tr>
<td>OE&#x2193;</td>
<td><bold>0.4402 &#x00B1; 0.0157</bold></td>
<td>0.4454 &#x00B1; 0.0118</td>
<td>0.4524 &#x00B1; 0.0179</td>
<td>0.6070 &#x00B1; 0.0191</td>
<td>0.4762 &#x00B1; 0.0088</td>
<td>0.4900 &#x00B1; 0.0181</td>
<td>0.5090 &#x00B1; 0.0180</td>
</tr>
<tr>
<td>RL&#x2193;</td>
<td><bold>0.1098 &#x00B1; 0.0052</bold></td>
<td>0.1263 &#x00B1; 0.0029</td>
<td>0.1405 &#x00B1; 0.0074</td>
<td>0.1571 &#x00B1; 0.0031</td>
<td>0.1770 &#x00B1; 0.0058</td>
<td>0.1841 &#x00B1; 0.0106</td>
<td>0.2621 &#x00B1; 0.0106</td>
</tr>
<tr>
<td>CV&#x2193;</td>
<td><bold>0.1686 &#x00B1; 0.002</bold></td>
<td>0.1970 &#x00B1; 0.0034</td>
<td>0.2141 &#x00B1; 0.0080</td>
<td>0.2212 &#x00B1; 0.0050</td>
<td>0.2567 &#x00B1; 0.0060</td>
<td>0.2650 &#x00B1; 0.0117</td>
<td>0.3448 &#x00B1; 0.0107</td>
</tr>
<tr>
<td>AUC&#x2191;</td>
<td><bold>0.8573 &#x00B1; 0.0053</bold></td>
<td>0.8316 &#x00B1; 0.0026</td>
<td>0.7832 &#x00B1; 0.0891</td>
<td>0.5558 &#x00B1; 0.0059</td>
<td>0.6723 &#x00B1; 0.0178</td>
<td>0.6916 &#x00B1; 0.0114</td>
<td>0.7102 &#x00B1; 0.0007</td>
</tr>
<tr>
<td rowspan="6">Computers</td>
<td>HL&#x2193;</td>
<td><bold>0.0324 &#x00B1; 0.0017</bold></td>
<td>0.0332 &#x00B1; 0.001</td>
<td>0.0339 &#x00B1; 0.0009</td>
<td>0.0383 &#x00B1; 0.0010</td>
<td>0.0391 &#x00B1; 0.0008</td>
<td>0.0389 &#x00B1; 0.0010</td>
<td>0.0415 &#x00B1; 0.0018</td>
</tr>
<tr>
<td>AP&#x2191;</td>
<td><bold>0.7193 &#x00B1; 0.014</bold></td>
<td>0.7171 &#x00B1; 0.0121</td>
<td>0.7093 &#x00B1; 0.0163</td>
<td>0.6424 &#x00B1; 0.0041</td>
<td>0.6915 &#x00B1; 0.0059</td>
<td>0.6575 &#x00B1; 0.0064</td>
<td>0.5839 &#x00B1; 0.0059</td>
</tr>
<tr>
<td>OE&#x2193;</td>
<td><bold>0.3366 &#x00B1; 0.0177</bold></td>
<td>0.3416 &#x00B1; 0.0216</td>
<td>0.3466 &#x00B1; 0.0171</td>
<td>0.4302 &#x00B1; 0.0072</td>
<td>0.3608 &#x00B1; 0.0060</td>
<td>0.4080 &#x00B1; 0.0094</td>
<td>0.4614 &#x00B1; 0.0072</td>
</tr>
<tr>
<td>RL&#x2193;</td>
<td><bold>0.0655 &#x00B1; 0.005</bold></td>
<td>0.0841 &#x00B1; 0.0051</td>
<td>0.0980 &#x00B1; 0.0086</td>
<td>0.0974 &#x00B1; 0.0044</td>
<td>0.1230 &#x00B1; 0.0059</td>
<td>0.1229 &#x00B1; 0.0059</td>
<td>0.2299 &#x00B1; 0.0126</td>
</tr>
<tr>
<td>CV&#x2193;</td>
<td><bold>0.1017 &#x00B1; 0.0082</bold></td>
<td>0.1261 &#x00B1; 0.0044</td>
<td>0.1406 &#x00B1; 0.0106</td>
<td>0.1402 &#x00B1; 0.0045</td>
<td>0.1725 &#x00B1; 0.0051</td>
<td>0.1720 &#x00B1; 0.0088</td>
<td>0.2888 &#x00B1; 0.0170</td>
</tr>
<tr>
<td>AUC&#x2191;</td>
<td><bold>0.9078 &#x00B1; 0.007</bold></td>
<td>0.8878 &#x00B1; 0.0065</td>
<td>0.7828 &#x00B1; 0.0916</td>
<td>0.6764 &#x00B1; 0.0041</td>
<td>0.7505 &#x00B1; 0.0165</td>
<td>0.6810 &#x00B1; 0.0136</td>
<td>0.7813 &#x00B1; 0.0068</td>
</tr>
<tr>
<td rowspan="6">Education</td>
<td>HL&#x2193;</td>
<td><bold>0.0369 &#x00B1; 0.0017</bold></td>
<td>0.0369 &#x00B1; 0.0010</td>
<td>0.0371 &#x00B1; 0.0008</td>
<td>0.0407 &#x00B1; 0.0002</td>
<td>0.0411 &#x00B1; 0.0003</td>
<td>0.0414 &#x00B1; 0.0007</td>
<td>0.0418 &#x00B1; 0.0012</td>
</tr>
<tr>
<td>AP&#x2191;</td>
<td><bold>0.6437 &#x00B1; 0.0192</bold></td>
<td><bold>0.6437 &#x00B1; 0.0083</bold></td>
<td>0.6337 &#x00B1; 0.0153</td>
<td>0.5497 &#x00B1; 0.0050</td>
<td>0.6033 &#x00B1; 0.0082</td>
<td>0.5805 &#x00B1; 0.0069</td>
<td>0.5290 &#x00B1; 0.0166</td>
</tr>
<tr>
<td>OE&#x2193;</td>
<td>0.4622 &#x00B1; 0.0274</td>
<td>0.4542 &#x00B1; 0.0154</td>
<td><bold>0.4606 &#x00B1;</bold> 0.0203</td>
<td>0.5868 &#x00B1; 0.0079</td>
<td>0.4826 &#x00B1; 0.0178</td>
<td>0.5090 &#x00B1; 0.0070</td>
<td>0.5290 &#x00B1; 0.0166</td>
</tr>
<tr>
<td>RL&#x2193;</td>
<td><bold>0.0700 &#x00B1; 0.0054</bold></td>
<td>0.0953 &#x00B1; 0.0054</td>
<td>0.1089 &#x00B1; 0.0057</td>
<td>0.1001 &#x00B1; 0.0053</td>
<td>0.1526 &#x00B1; 0.0068</td>
<td>0.1642 &#x00B1; 0.0065</td>
<td>0.2486 &#x00B1; 0.0081</td>
</tr>
<tr>
<td>CV&#x2193;</td>
<td><bold>0.0981 &#x00B1; 0.0062</bold></td>
<td>0.1425 &#x00B1; 0.0081</td>
<td>0.1592 &#x00B1; 0.0086</td>
<td>0.1323 &#x00B1; 0.0067</td>
<td>0.2123 &#x00B1; 0.0077</td>
<td>0.2215 &#x00B1; 0.0068</td>
<td>0.3133 &#x00B1; 0.0115</td>
</tr>
<tr>
<td>AUC&#x2191;</td>
<td><bold>0.9165 &#x00B1; 0.0056</bold></td>
<td>0.8807 &#x00B1; 0.0082</td>
<td>0.8709 &#x00B1; 0.0190</td>
<td>0.5612 &#x00B1; 0.0037</td>
<td>0.6435 &#x00B1; 0.0003</td>
<td>0.6660 &#x00B1; 0.0160</td>
<td>0.6784 &#x00B1; 0.0532</td>
</tr>
<tr>
<td rowspan="6" >Entertain</td>
<td>HL&#x2193;</td>
<td><bold>0.0491 &#x00B1; 0.0021</bold></td>
<td>0.0505 &#x00B1; 0.0017</td>
<td>0.0508 &#x00B1; 0.0014</td>
<td>0.0589 &#x00B1; 0.0005</td>
<td>0.0570 &#x00B1; 0.0006</td>
<td>0.0550 &#x00B1; 0.0015</td>
<td>0.0550 &#x00B1; 0.0014</td>
</tr>
<tr>
<td>AP&#x2191;</td>
<td><bold>0.7052 &#x00B1; 0.0134</bold></td>
<td>0.7002 &#x00B1; 0.0108</td>
<td>0.6925 &#x00B1; 0.0067</td>
<td>0.5777 &#x00B1; 0.0110</td>
<td>0.6731 &#x00B1; 0.0089</td>
<td>0.6669 &#x00B1; 0.0071</td>
<td>0.6351 &#x00B1; 0.0076</td>
</tr>
<tr>
<td>OE&#x2193;</td>
<td><bold>0.3772 &#x00B1; 0.0231</bold></td>
<td>0.3848 &#x00B1; 0.0157</td>
<td>0.3912 &#x00B1; 0.0106</td>
<td>0.5668 &#x00B1; 0.0178</td>
<td>0.4072 &#x00B1; 0.0113</td>
<td>0.4092 &#x00B1; 0.0070</td>
<td>0.4166 &#x00B1; 0.0063</td>
</tr>
<tr>
<td>RL&#x2193;</td>
<td><bold>0.0875 &#x00B1; 0.0036</bold></td>
<td>0.1019 &#x00B1; 0.0053</td>
<td>0.1163 &#x00B1; 0.0035</td>
<td>0.1284 &#x00B1; 0.0047</td>
<td>0.1422 &#x00B1; 0.0040</td>
<td>0.1460 &#x00B1; 0.0110</td>
<td>0.2215 &#x00B1; 0.0096</td>
</tr>
<tr>
<td>CV&#x2193;</td>
<td><bold>0.1234 &#x00B1; 0.0060</bold></td>
<td>0.1456 &#x00B1; 0.0073</td>
<td>0.1605 &#x00B1; 0.0066</td>
<td>0.1661 &#x00B1; 0.0052</td>
<td>0.1897 &#x00B1; 0.0044</td>
<td>0.1918 &#x00B1; 0.0114</td>
<td>0.2717 &#x00B1; 0.0109</td>
</tr>
<tr>
<td>AUC&#x2191;</td>
<td><bold>0.8844 &#x00B1; 0.0062</bold></td>
<td>0.8648 &#x00B1; 0.0056</td>
<td>0.8013 &#x00B1; 0.0009</td>
<td>0.5879 &#x00B1; 0.0035</td>
<td>0.6128 &#x00B1; 0.0034</td>
<td>0.7600 &#x00B1; 0.1127</td>
<td>0.5699 &#x00B1; 0.0901</td>
</tr>
<tr>
<td rowspan="6">Business</td>
<td>HL&#x2193;</td>
<td><bold>0.0239 &#x00B1; 0.0013</bold></td>
<td>0.0245 &#x00B1; 0.0005</td>
<td>0.0266 &#x00B1; 0.0004</td>
<td>0.0261 &#x00B1; 0.0016</td>
<td>0.0287 &#x00B1; 0.0010</td>
<td>0.0295 &#x00B1; 0.0009</td>
<td>0.0398 &#x00B1; 0.0008</td>
</tr>
<tr>
<td>AP&#x2191;</td>
<td><bold>0.8978 &#x00B1; 0.0082</bold></td>
<td>0.8898 &#x00B1; 0.0068</td>
<td>0.8809 &#x00B1; 0.0072</td>
<td>0.8805 &#x00B1; 0.0110</td>
<td>0.8798 &#x00B1; 0.0085</td>
<td>0.8484 &#x00B1; 0.0090</td>
<td>0.7825 &#x00B1; 0.0116</td>
</tr>
<tr>
<td>OE&#x2193;</td>
<td><bold>0.1024 &#x00B1; 0.0144</bold></td>
<td>0.1086 &#x00B1; 0.0102</td>
<td>0.1158 &#x00B1; 0.0074</td>
<td>0.1140 &#x00B1; 0.0101</td>
<td>0.1104 &#x00B1; 0.0105</td>
<td>0.1452 &#x00B1; 0.0112</td>
<td>0.2240 &#x00B1; 0.0185</td>
</tr>
<tr>
<td>RL&#x2193;</td>
<td><bold>0.0304 &#x00B1; 0.0037</bold></td>
<td>0.0386 &#x00B1; 0.0014</td>
<td>0.0443 &#x00B1; 0.0048</td>
<td>0.0452 &#x00B1; 0.0049</td>
<td>0.0485 &#x00B1; 0.0048</td>
<td>0.0635 &#x00B1; 0.0041</td>
<td>0.1036 &#x00B1; 0.0071</td>
</tr>
<tr>
<td>CV&#x2193;</td>
<td><bold>0.0651 &#x00B1; 0.0057</bold></td>
<td>0.0794 &#x00B1; 0.0019</td>
<td>0.0895 &#x00B1; 0.0078</td>
<td>0.0833 &#x00B1; 0.0068</td>
<td>0.0967 &#x00B1; 0.0097</td>
<td>0.1096 &#x00B1; 0.0064</td>
<td>0.1559 &#x00B1; 0.0092</td>
</tr>
<tr>
<td>AUC&#x2191;</td>
<td><bold>0.9503 &#x00B1; 0.0060</bold></td>
<td>0.9371 &#x00B1; 0.0028</td>
<td>0.8971 &#x00B1; 0.0142</td>
<td>0.8520 &#x00B1; 0.0075</td>
<td>0.7990 &#x00B1; 0.0005</td>
<td>0.7168 &#x00B1; 0.0093</td>
<td>0.7009 &#x00B1; 0.0141</td>
</tr>
<tr>
<td/>
<td>HL&#x2193;</td>
<td><bold>0.0239 &#x00B1; 0.0010</bold></td>
<td>0.0251 &#x00B1; 0.0007</td>
<td>0.0257 &#x00B1; 0.0008</td>
<td>0.0292 &#x00B1; 0.0004</td>
<td>0.0294 &#x00B1; 0.0010</td>
<td>0.0280 &#x00B1; 0.0006</td>
<td>0.0298 &#x00B1; 0.0014</td>
</tr>
<tr>
<td/>
<td>AP&#x2191;</td>
<td><bold>0.7269 &#x00B1; 0.0154</bold></td>
<td>0.7249 &#x00B1; 0.0051</td>
<td>0.7135 &#x00B1; 0.0033</td>
<td>0.6301 &#x00B1; 0.0078</td>
<td>0.7052 &#x00B1; 0.0072</td>
<td>0.6634 &#x00B1; 0.0129</td>
<td>0.5929 &#x00B1; 0.0173</td>
</tr>
<tr>
<td/>
<td>OE&#x2193;</td>
<td><bold>0.3534 &#x00B1; 0.0205</bold></td>
<td>0.3582 &#x00B1; 0.0062</td>
<td>0.3642 &#x00B1; 0.0090</td>
<td>0.4658 &#x00B1; 0.0084</td>
<td>0.3666 &#x00B1; 0.0088</td>
<td>0.4020 &#x00B1; 0.0144</td>
<td>0.4692 &#x00B1; 0.0178</td>
</tr>
<tr>
<td>Reference</td>
<td>RL&#x2193;</td>
<td><bold>0.0573 &#x00B1; 0.0046</bold></td>
<td>0.0709 &#x00B1; 0.0049</td>
<td>0.0930 &#x00B1; 0.0047</td>
<td>0.0934 &#x00B1; 0.0047</td>
<td>0.1070 &#x00B1; 0.0060</td>
<td>0.1398 &#x00B1; 0.0087</td>
<td>0.2426 &#x00B1; 0.0149</td>
</tr>
<tr>
<td/>
<td>CV&#x2193;</td>
<td><bold>0.0716 &#x00B1; 0.0045</bold></td>
<td>0.0923 &#x00B1; 0.0067</td>
<td>0.1194 &#x00B1; 0.0079</td>
<td>0.1100 &#x00B1; 0.0057</td>
<td>0.1354 &#x00B1; 0.0072</td>
<td>0.1705 &#x00B1; 0.0104</td>
<td>0.2745 &#x00B1; 0.0171</td>
</tr>
<tr>
<td/>
<td>AUC&#x2191;</td>
<td><bold>0.9277 &#x00B1; 0.0046</bold></td>
<td>0.9086 &#x00B1; 0.0072</td>
<td>0.7505 &#x00B1; 0.056</td>
<td>0.6461 &#x00B1; 0.0035</td>
<td>0.7234 &#x00B1; 0.0093</td>
<td>0.6983 &#x00B1; 0.0023</td>
<td>0.6728 &#x00B1; 0.0003</td>
</tr>
<tr>
<td rowspan="6">Recreation</td>
<td>HL&#x2193;</td>
<td><bold>0.0517 &#x00B1; 0.0019</bold></td>
<td>0.0541 &#x00B1; 0.0010</td>
<td>0.0535 &#x00B1; 0.0011</td>
<td>0.9361 &#x00B1; 0.0016</td>
<td>0.0578 &#x00B1; 0.0008</td>
<td>0.0571 &#x00B1; 0.0013</td>
<td>0.0565 &#x00B1; 0.0003</td>
</tr>
<tr>
<td>AP&#x2191;</td>
<td><bold>0.6539 &#x00B1; 0.0110</bold></td>
<td>0.6509 &#x00B1; 0.0072</td>
<td>0.6391 &#x00B1; 0.0043</td>
<td>0.4892 &#x00B1; 0.0039</td>
<td>0.6185 &#x00B1; 0.0097</td>
<td>0.5985 &#x00B1; 0.0148</td>
<td>0.5692 &#x00B1; 0.0097</td>
</tr>
<tr>
<td>OE&#x2193;</td>
<td><bold>0.4334 &#x00B1; 0.0181</bold></td>
<td>0.4408 &#x00B1; 0.0099</td>
<td>0.4444 &#x00B1; 0.0092</td>
<td>0.6616 &#x00B1; 0.0071</td>
<td>0.4614 &#x00B1; 0.0102</td>
<td>0.4890 &#x00B1; 0.0248</td>
<td>0.5056 &#x00B1; 0.0113</td>
</tr>
<tr>
<td>RL&#x2193;</td>
<td><bold>0.1194 &#x00B1; 0.0045</bold></td>
<td>0.1291 &#x00B1; 0.0032</td>
<td>0.1485 &#x00B1; 0.0037</td>
<td>0.1830 &#x00B1; 0.0015</td>
<td>0.1741 &#x00B1; 0.0107</td>
<td>0.1868 &#x00B1; 0.0045</td>
<td>0.2446 &#x00B1; 0.0075</td>
</tr>
<tr>
<td>CV&#x2193;</td>
<td><bold>0.1620 &#x00B1; 0.0052</bold></td>
<td>0.1776 &#x00B1; 0.0049</td>
<td>0.1992 &#x00B1; 0.0059</td>
<td>0.2221 &#x00B1; 0.0028</td>
<td>0.2277 &#x00B1; 0.0123</td>
<td>0.2392 &#x00B1; 0.0044</td>
<td>0.2968 &#x00B1; 0.0088</td>
</tr>
<tr>
<td>AUC&#x2191;</td>
<td><bold>0.8448 &#x00B1; 0.0071</bold></td>
<td>0.8305 &#x00B1; 0.0045</td>
<td>0.8022 &#x00B1; 0.0071</td>
<td>0.5339 &#x00B1; 0.0017</td>
<td>0.6991 &#x00B1; 0.0189</td>
<td>0.7764 &#x00B1; 0.0080</td>
<td>0.6101 &#x00B1; 0.0019</td>
</tr>
<tr>
<td rowspan="6">Science</td>
<td>HL&#x2193;</td>
<td><bold>0.0302 &#x00B1; 0.0008</bold></td>
<td>0.0311 &#x00B1; 0.0006</td>
<td>0.0311 &#x00B1; 0.0006</td>
<td>0.0348 &#x00B1; 0.0008</td>
<td>0.0333 &#x00B1; 0.0007</td>
<td>0.0348 &#x00B1; 0.0007</td>
<td>0.0348 &#x00B1; 0.0007</td>
</tr>
<tr>
<td>AP&#x2191;</td>
<td><bold>0.6144 &#x00B1; 0.0189</bold></td>
<td><bold>0.6144 &#x00B1; 0.0084</bold></td>
<td>0.6077 &#x00B1; 0.0081</td>
<td>0.4556 &#x00B1; 0.0135</td>
<td>0.5890 &#x00B1; 0.0158</td>
<td>0.5521 &#x00B1; 0.0105</td>
<td>0.5166 &#x00B1; 0.0116</td>
</tr>
<tr>
<td>OE&#x2193;</td>
<td><bold>0.4746 &#x00B1; 0.027</bold></td>
<td>0.4768 &#x00B1; 0.0095</td>
<td>0.4772 &#x00B1; 0.0108</td>
<td>0.6694 &#x00B1; 0.0146</td>
<td>0.4884 &#x00B1; 0.0207</td>
<td>0.5274 &#x00B1; 0.0075</td>
<td>0.5512 &#x00B1; 0.0191</td>
</tr>
<tr>
<td>RL&#x2193;</td>
<td><bold>0.0933 &#x00B1; 0.009</bold></td>
<td>0.1142 &#x00B1; 0.0088</td>
<td>0.1289 &#x00B1; 0.0046</td>
<td>0.1568 &#x00B1; 0.0025</td>
<td>0.1530 &#x00B1; 0.0106</td>
<td>0.1774 &#x00B1; 0.0083</td>
<td>0.2473 &#x00B1; 0.0056</td>
</tr>
<tr>
<td>CV&#x2193;</td>
<td><bold>0.1280 &#x00B1; 0.0109</bold></td>
<td>0.1605 &#x00B1; 0.0128</td>
<td>0.1778 &#x00B1; 0.0058</td>
<td>0.1978 &#x00B1; 0.0057</td>
<td>0.2048 &#x00B1; 0.0143</td>
<td>0.2296 &#x00B1; 0.0087</td>
<td>0.3027 &#x00B1; 0.0065</td>
</tr>
<tr>
<td>AUC&#x2191;</td>
<td>0.8829 &#x00B1; 0.0098</td>
<td>0.8557 &#x00B1; 0.0109</td>
<td>0.8600 &#x00B1; 0.0003</td>
<td>0.5346 &#x00B1; 0.0028</td>
<td>0.6415 &#x00B1; 0.0340</td>
<td><bold>0.8962 &#x00B1; 0.0031</bold></td>
<td>0.7624 &#x00B1; 0.0007</td>
</tr>
</tbody>
</table>
</table-wrap>
<p>2) The CCSF algorithm significantly outperforms the ACML algorithm on these 54 sets of experimental results. This is because the ACML algorithm only takes into account the asymmetric relationship between the labels and does not take into account the fact that the common features of the labels also have a very significant role in multi-label classification.</p>
<p>3) The CCSF algorithm significantly outperforms the traditional LLSF algorithm and the LSGL algorithm. The reason is that the LLSF algorithm only considers the global correlation of labels. The LSGL algorithm is superior to the LLSF algorithm, which is because the LSGL algorithm not only considers the global correlation of labels, but also considers the local correlation of labels. Both of them do not consider the causal relationship between the labels and do not take into account that the common features of labels can effectively improve the performance of multi-label classification algorithms. However, we adopt a global causality and do not consider the local causality between labels, which is also a defect of the algorithm in this paper.</p>
<p>4) The experimental results of the CCSF algorithm for the average ranking of six evaluation metrics on nine datasets are demonstrated in <xref ref-type="table" rid="table-4">Table 4</xref>, which also fully proves that the adoption of causal correlation and common features of labels can effectively improve the classification performance of the LSF model.</p>
<table-wrap id="table-4">
<label>Table 4</label>
<caption>
<title>AVG results of each algorithms on five evaluation metrics</title>
</caption>
<table frame="hsides">
<colgroup>
<col align="left"/>
<col align="left"/>
<col align="left"/>
<col align="left"/>
<col align="left"/>
<col align="left"/>
<col align="left"/>
<col align="left"/>
</colgroup>
<thead>
<tr>
<th>Metrics</th>
<th align="center" colspan="7">Average ranking</th>
</tr>
<tr>
<th/>
<th>CCSF</th>
<th>LSGL</th>
<th>ACML</th>
<th>FF-MLLA</th>
<th>LSML</th>
<th>LLSF</th>
<th>LSF-CI</th>
</tr>
</thead>
<tbody>
<tr>
<td>HL&#x2193;</td>
<td><bold>1.0556</bold></td>
<td>2.1667</td>
<td>3.2222</td>
<td>4.8889</td>
<td>5.5556</td>
<td>5.0000</td>
<td>6.1111</td>
</tr>
<tr>
<td>AP&#x2191;</td>
<td><bold>1.0556</bold></td>
<td>1.9444</td>
<td>3.0000</td>
<td>6.2222</td>
<td>4.1111</td>
<td>5.1111</td>
<td>6.5556</td>
</tr>
<tr>
<td>OE&#x2193;</td>
<td><bold>1.2222</bold></td>
<td>1.8889</td>
<td>2.8889</td>
<td>6.3333</td>
<td>4.1111</td>
<td>5.1111</td>
<td>6.4444</td>
</tr>
<tr>
<td>RL&#x2193;</td>
<td><bold>1.1111</bold></td>
<td>2.1111</td>
<td>3.0000</td>
<td>4.2222</td>
<td>4.8889</td>
<td>5.6667</td>
<td>7.0000</td>
</tr>
<tr>
<td>CV&#x2193;</td>
<td><bold>1.1111</bold></td>
<td>2.4444</td>
<td>3.2222</td>
<td>3.6667</td>
<td>5.0000</td>
<td>5.5556</td>
<td>7.0000</td>
</tr>
<tr>
<td>AUC&#x2191;</td>
<td><bold>1.2222</bold></td>
<td>2.3333</td>
<td>2.7778</td>
<td>6.2222</td>
<td>5.4444</td>
<td>4.5556</td>
<td>5.4444</td>
</tr>
</tbody>
</table>
</table-wrap>
</sec>
<sec id="s4_3">
<label>4.3</label>
<title>Parameter Sensitivity Analysis</title>
<p>The CCSF algorithm has three main hyperparameters. <inline-formula id="ieqn-119"><mml:math id="mml-ieqn-119"><mml:mi>&#x03B1;</mml:mi></mml:math></inline-formula> and <inline-formula id="ieqn-120"><mml:math id="mml-ieqn-120"><mml:mi>&#x03B2;</mml:mi></mml:math></inline-formula> jointly adjust the contribution of the matrix <inline-formula id="ieqn-121"><mml:math id="mml-ieqn-121"><mml:mi mathvariant="bold-italic">W</mml:mi></mml:math></inline-formula>, where <inline-formula id="ieqn-122"><mml:math id="mml-ieqn-122"><mml:mi>&#x03B1;</mml:mi></mml:math></inline-formula> controls the contribution of the private features of the labels and <inline-formula id="ieqn-123"><mml:math id="mml-ieqn-123"><mml:mi>&#x03B2;</mml:mi></mml:math></inline-formula> controls the contribution of the common features of the labels. <inline-formula id="ieqn-124"><mml:math id="mml-ieqn-124"><mml:mi>&#x03B3;</mml:mi></mml:math></inline-formula> controls the effect of asymmetric LC on the model. In order to test the sensitivity of the CCSF model, we control the other two parameters unchanged and adjust one parameter at <inline-formula id="ieqn-125"><mml:math id="mml-ieqn-125"><mml:mrow><mml:mo>[</mml:mo><mml:msup><mml:mn>2</mml:mn><mml:mrow><mml:mo>&#x2212;</mml:mo><mml:mn>10</mml:mn></mml:mrow></mml:msup><mml:mo>,</mml:mo><mml:msup><mml:mn>2</mml:mn><mml:mrow><mml:mn>10</mml:mn></mml:mrow></mml:msup><mml:mo>]</mml:mo></mml:mrow></mml:math></inline-formula> for the experiment, respectively, and the experimental results are shown in <xref ref-type="fig" rid="fig-2">Fig. 2</xref>. <inline-formula id="ieqn-126"><mml:math id="mml-ieqn-126"><mml:mi>&#x03C7;</mml:mi><mml:mo>=</mml:mo><mml:msup><mml:mn>2</mml:mn><mml:mrow><mml:mi>x</mml:mi></mml:mrow></mml:msup></mml:math></inline-formula> denotes the log function of log with base 2. As shown in the figure, our algorithms all have better experimental results in general, although there are some fluctuations in <inline-formula id="ieqn-127"><mml:math id="mml-ieqn-127"><mml:mrow><mml:mo>[</mml:mo><mml:msup><mml:mn>2</mml:mn><mml:mrow><mml:mo>&#x2212;</mml:mo><mml:mn>10</mml:mn></mml:mrow></mml:msup><mml:mo>,</mml:mo><mml:msup><mml:mn>2</mml:mn><mml:mrow><mml:mn>10</mml:mn></mml:mrow></mml:msup><mml:mo>]</mml:mo></mml:mrow></mml:math></inline-formula>, which may also be due to the small intervals set by our algorithms. We suggest setting the parameters <inline-formula id="ieqn-128"><mml:math id="mml-ieqn-128"><mml:mi>&#x03B1;</mml:mi><mml:mo>=</mml:mo><mml:msup><mml:mn>2</mml:mn><mml:mrow><mml:mn>4</mml:mn></mml:mrow></mml:msup><mml:mo>,</mml:mo><mml:mi>&#x03B2;</mml:mi><mml:mo>=</mml:mo><mml:msup><mml:mn>2</mml:mn><mml:mrow><mml:mn>4</mml:mn></mml:mrow></mml:msup><mml:mo>,</mml:mo><mml:mi>&#x03B3;</mml:mi><mml:mo>=</mml:mo><mml:msup><mml:mn>2</mml:mn><mml:mrow><mml:mn>4</mml:mn></mml:mrow></mml:msup></mml:math></inline-formula>.</p>
<fig id="fig-2">
<label>Figure 2</label>
<caption>
<title>Parameter sensitivity analysis on the Birds dataset</title>
</caption>
<graphic mimetype="image" mime-subtype="tif" xlink:href="JAI_49083-fig-2.tif"/>
</fig>
</sec>
<sec id="s4_4">
<label>4.4</label>
<title>Component Analysis</title>
<p>In order to verify that introducing common features of labels in the model can effectively improve the performance of multi-label LSF learning algorithms. We conducted component analysis experiments on nine datasets. We compare the CCSF algorithm, which combines the common and private features of label, with the CSF algorithm, which considers only the private features of label. The experimental results are shown in <xref ref-type="fig" rid="fig-3">Fig. 3</xref>, where the CCSF algorithm outperforms the CSF algorithm on multiple datasets. This indicates that considering the common and private features of labels can effectively improve the performance of LSF algorithm. It also demonstrates that common feature learning of labels introduced into multi-label classification algorithms can improve the accuracy of the algorithms.</p>
<fig id="fig-3">
<label>Figure 3</label>
<caption>
<title>Component analysis on nine datasets</title>
</caption>
<graphic mimetype="image" mime-subtype="tif" xlink:href="JAI_49083-fig-3.tif"/>
</fig>
</sec>
<sec id="s4_5">
<label>4.5</label>
<title>Statistical Hypothesis Testing</title>
<p>The statistical hypothesis tests in this paper are all based on a significance level of <inline-formula id="ieqn-129"><mml:math id="mml-ieqn-129"><mml:mi>&#x03B8;</mml:mi><mml:mo>=</mml:mo><mml:mn>0.05</mml:mn></mml:math></inline-formula>. The Friedman test [<xref ref-type="bibr" rid="ref-31">31</xref>] was first used to evaluate the comprehensive performance of the CCSF algorithm on all datasets. The obtained <inline-formula id="ieqn-130"><mml:math id="mml-ieqn-130"><mml:msub><mml:mi>F</mml:mi><mml:mrow><mml:mi>F</mml:mi></mml:mrow></mml:msub></mml:math></inline-formula> is compared with the critical value of the F-test. If it is greater, the original hypothesis is rejected, and vice versa. The experimental results are shown in <xref ref-type="table" rid="table-5">Table 5</xref>. The <inline-formula id="ieqn-131"><mml:math id="mml-ieqn-131"><mml:msub><mml:mi>F</mml:mi><mml:mrow><mml:mi>F</mml:mi></mml:mrow></mml:msub></mml:math></inline-formula> of the CCSF algorithm is greater than the critical value for all evaluation metrics, so the original hypothesis is rejected for all of them.</p>
<table-wrap id="table-5">
<label>Table 5</label>
<caption>
<title>The Friedman statistics <inline-formula id="ieqn-132"><mml:math id="mml-ieqn-132"><mml:msub><mml:mi>F</mml:mi><mml:mrow><mml:mi>F</mml:mi></mml:mrow></mml:msub></mml:math></inline-formula> of the critical value and each evaluation metric</title>
</caption>
<table frame="hsides">
<colgroup>
<col align="left"/>
<col align="left"/>
<col align="left"/>
</colgroup>
<thead>
<tr>
<th>Metrics</th>
<th><inline-formula id="ieqn-133"><mml:math id="mml-ieqn-133"><mml:msub><mml:mi>F</mml:mi><mml:mrow><mml:mi>F</mml:mi></mml:mrow></mml:msub></mml:math></inline-formula></th>
<th>Critical value</th>
</tr>
</thead>
<tbody valign="top">
<tr>
<td>HL</td>
<td>25.4452</td>
<td rowspan="6">2.2950</td>
</tr>
<tr>
<td>AP</td>
<td>153.2800</td>
</tr>
<tr>
<td>OE</td>
<td>108.3077</td>
</tr>
<tr>
<td>RL</td>
<td>82.7200</td>
</tr>
<tr>
<td>CV</td>
<td>46.6504</td>
</tr>
<tr>
<td>AUC</td>
<td>25.9775</td>
</tr>
</tbody>
</table>
</table-wrap>
<p>Nemenyi test [<xref ref-type="bibr" rid="ref-32">32</xref>] is then used to compare the CCSF algorithm with the other six algorithms on all datasets. A significant difference exists when the difference between the average rankings of the two algorithms on all datasets is greater than the Critical Difference (CD) and vice versa. CD value is calculated as follows:</p>
<p><disp-formula id="eqn-18"><label>(18)</label><mml:math id="mml-eqn-18" display="block"><mml:mi>C</mml:mi><mml:mi>D</mml:mi><mml:mo>=</mml:mo><mml:msub><mml:mi>q</mml:mi><mml:mrow><mml:mi>&#x03B8;</mml:mi></mml:mrow></mml:msub><mml:msqrt><mml:mfrac><mml:mrow><mml:mi>K</mml:mi><mml:mrow><mml:mo>(</mml:mo><mml:mi>K</mml:mi><mml:mo>+</mml:mo><mml:mn>1</mml:mn><mml:mo>)</mml:mo></mml:mrow></mml:mrow><mml:mrow><mml:mn>6</mml:mn><mml:mi>N</mml:mi></mml:mrow></mml:mfrac></mml:msqrt></mml:math></disp-formula>where <inline-formula id="ieqn-134"><mml:math id="mml-ieqn-134"><mml:mrow><mml:mtext>K</mml:mtext></mml:mrow><mml:mo>=</mml:mo><mml:mn>7</mml:mn></mml:math></inline-formula>, <inline-formula id="ieqn-135"><mml:math id="mml-ieqn-135"><mml:mrow><mml:mtext>N</mml:mtext></mml:mrow><mml:mo>=</mml:mo><mml:mn>9</mml:mn></mml:math></inline-formula>, <inline-formula id="ieqn-136"><mml:math id="mml-ieqn-136"><mml:msub><mml:mi>q</mml:mi><mml:mrow><mml:mi>&#x03B8;</mml:mi></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:mn>2.9480</mml:mn></mml:math></inline-formula>, <inline-formula id="ieqn-137"><mml:math id="mml-ieqn-137"><mml:mrow><mml:mtext>CD</mml:mtext></mml:mrow><mml:mo>=</mml:mo><mml:mn>3.0021</mml:mn></mml:math></inline-formula>. <xref ref-type="fig" rid="fig-4">Fig. 4</xref> demonstrates the CCSF algorithm compared to other algorithms on six evaluation metrics. The algorithm performance decreases in this way from left to right. There is no significant difference between CCSF algorithm and LSGL and ACML algorithms on HL, AP, RL, CV, AUC metrics, and there is no significant difference between CCSF algorithm and LSGL, ACML, LSML algorithms on OE metrics. Other than, there is a significant difference between the CCSF algorithm and the other algorithms in six evaluation metrics. The effectiveness of the algorithm proposed in this paper can be seen from these two statistical hypothesis tests.</p>
<fig id="fig-4">
<label>Figure 4</label>
<caption>
<title>Performance comparison of the CCSF algorithm and the comparison algorithm</title>
</caption>
<graphic mimetype="image" mime-subtype="tif" xlink:href="JAI_49083-fig-4.tif"/>
</fig>
</sec>
<sec id="s4_6">
<label>4.6</label>
<title>Convergence of CCSF</title>
<p>In this paper, the sentiment dataset and the yeast dataset are selected for convergence analysis. As can be seen in <xref ref-type="fig" rid="fig-5">Fig. 5</xref>, after about forty iterations, the experimental results tend to converge. We conducted the same experiment on other datasets. The convergence results are also similar.</p>
<fig id="fig-5">
<label>Figure 5</label>
<caption>
<title>Convergence of CCSF</title>
</caption>
<graphic mimetype="image" mime-subtype="tif" xlink:href="JAI_49083-fig-5.tif"/>
</fig>
</sec>
</sec>
<sec id="s5">
<label>5</label>
<title>Conclusion</title>
<p>In response to the fact that most of the current LSF learning does not consider the common features of the labels. And only symmetric LC is considered in the calculation of LC. The result is the introduction of much redundant information when classification is performed, which reduces the classification performance of MLL algorithms. Based on the above problem, we use <inline-formula id="ieqn-138"><mml:math id="mml-ieqn-138"><mml:msub><mml:mi>l</mml:mi><mml:mrow><mml:mn>2</mml:mn><mml:mo>,</mml:mo><mml:mn>1</mml:mn></mml:mrow></mml:msub></mml:math></inline-formula>-norm and <inline-formula id="ieqn-139"><mml:math id="mml-ieqn-139"><mml:msub><mml:mi>l</mml:mi><mml:mrow><mml:mn>1</mml:mn></mml:mrow></mml:msub></mml:math></inline-formula>-norm to extract the common and private features of the labels, respectively. And the asymmetric correlation between labels is calculated utilizing the causal learning algorithm. A large number of experiments are conducted on nine datasets using six evaluation metrics, and the results prove the effectiveness of the algorithm in this paper. But at the same time, we find some problems. We use a global-based causal learning algorithm, which computes the global LC. However, some labels are only associated with local labels and only have local correlation. To minimize the complexity of the model, we also did not utilize instance correlation to improve the classification accuracy of the model. To minimize the complexity of the model, we also did not utilize instance correlation to improve the classification accuracy of the model. In the future, we will try to compute the local correlation of labels using causal learning algorithms and perform experiments in conjunction with instance correlation. We observe the results of the experiments on the complete dataset and try to solve the missing label problem.</p>
</sec>
</body>
<back>
<ack>
<p>None.</p>
</ack>
<sec><title>Funding Statement</title>
<p>2022 University Research Priorities, No. 2022AH051989.</p>
</sec>
<sec><title>Author Contributions</title>
<p>The authors confirm contribution to the paper as follows: study conception and design: Y. T. Xu and D. Q. Zhang; analysis and interpretation of results: H. B. Guo and Y. T. Xu; draft manuscript preparation: Y. T. Xu and M. Y. Wang. All authors reviewed the results and approved the final version of the manuscript.</p>
</sec>
<sec sec-type="data-availability"><title>Availability of Data and Materials</title>
<p>All datasets are publicly available for download. The download URL is in <xref ref-type="sec" rid="s4_1">Section 4.1</xref>.</p>
</sec>
<sec sec-type="COI-statement"><title>Conflicts of Interest</title>
<p>The authors declare that they have no conflicts of interest to report regarding the present study.</p>
</sec>
<ref-list content-type="authoryear">
<title>References</title>
<ref id="ref-1"><label>[1]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><given-names>M. L.</given-names> <surname>Zhang</surname></string-name> and <string-name><given-names>Z. H.</given-names> <surname>Zhou</surname></string-name></person-group>, &#x201C;<article-title>A review on multi-label learning algorithms</article-title>,&#x201D; <source>IEEE Trans. Knowl. Data Eng.</source>, vol. <volume>26</volume>, no. <issue>8</issue>, pp. <fpage>1819</fpage>&#x2013;<lpage>1837</lpage>, <year>2013</year>. doi: <pub-id pub-id-type="doi">10.1109/TKDE.2013.39</pub-id>.</mixed-citation></ref>
<ref id="ref-2"><label>[2]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><given-names>W.</given-names> <surname>Wei</surname></string-name> <etal>et al.</etal></person-group>, &#x201C;<article-title>Automatic image annotation based on an improved nearest neighbor technique with tag semantic extension model</article-title>,&#x201D; <source>Procedia Comput. Sci.</source>, vol. <volume>183</volume>, no. <issue>24</issue>, pp. <fpage>616</fpage>&#x2013;<lpage>623</lpage>, <year>2021</year>. doi: <pub-id pub-id-type="doi">10.1016/j.procs.2021.02.105</pub-id>.</mixed-citation></ref>
<ref id="ref-3"><label>[3]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><given-names>T.</given-names> <surname>Qian</surname></string-name>, <string-name><given-names>F.</given-names> <surname>Li</surname></string-name>, <string-name><given-names>M. S.</given-names> <surname>Zhang</surname></string-name>, <string-name><given-names>G. N.</given-names> <surname>Jin</surname></string-name>, <string-name><given-names>P.</given-names> <surname>Fan</surname></string-name> and <string-name><given-names>W.</given-names> <surname>Dai</surname></string-name></person-group>, &#x201C;<article-title>Contrastive learning from label distribution: A case study on text classification</article-title>,&#x201D; <source>Neurocomput.</source>, vol. <volume>507</volume>, no. <issue>7</issue>, pp. <fpage>208</fpage>&#x2013;<lpage>220</lpage>, <year>2022</year>. doi: <pub-id pub-id-type="doi">10.1016/j.neucom.2022.07.076</pub-id>.</mixed-citation></ref>
<ref id="ref-4"><label>[4]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><given-names>W. Q.</given-names> <surname>Xia</surname></string-name> <etal>et al.</etal></person-group>, &#x201C;<article-title>PFmulDL: A novel strategy enabling multi-class and multi-label protein function annotation by integrating diverse deep learning methods</article-title>,&#x201D; <source>Comput. Biol. Med.</source>, vol. <volume>145</volume>, pp. <fpage>105465</fpage>, <year>2022</year>. doi: <pub-id pub-id-type="doi">10.1016/j.compbiomed.2022.105465</pub-id>; <pub-id pub-id-type="pmid">35366467</pub-id></mixed-citation></ref>
<ref id="ref-5"><label>[5]</label><mixed-citation publication-type="conf-proc"><person-group person-group-type="author"><string-name><given-names>S. H.</given-names> <surname>Liu</surname></string-name>, <string-name><given-names>B.</given-names> <surname>Wang</surname></string-name>, <string-name><given-names>B.</given-names> <surname>Liu</surname></string-name>, and <string-name><given-names>L. T.</given-names> <surname>Yang</surname></string-name></person-group>, &#x201C;<article-title>Multi-community graph convolution networks with decision fusion for personalized recommendation</article-title>,&#x201D; in <conf-name>Pacific-Asia Conf. Knowl. Discov. Data Min.</conf-name>, <publisher-loc>Chengdu, China</publisher-loc>, <year>2022</year>, pp. <fpage>16</fpage>&#x2013;<lpage>28</lpage>.</mixed-citation></ref>
<ref id="ref-6"><label>[6]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><given-names>J. L.</given-names> <surname>Miu</surname></string-name>, <string-name><given-names>Y. B.</given-names> <surname>Wang</surname></string-name>, <string-name><given-names>Y. S.</given-names> <surname>Cheng</surname></string-name>, and <string-name><given-names>F.</given-names> <surname>Chen</surname></string-name></person-group>, &#x201C;<article-title>Parallel dual&#x2014;channel multi-label feature selection</article-title>,&#x201D; <source>Soft Comput.</source>, vol. <volume>27</volume>, no. <issue>11</issue>, pp. <fpage>7115</fpage>&#x2013;<lpage>7130</lpage>, <year>2023</year>. doi: <pub-id pub-id-type="doi">10.1007/s00500-023-07916-4</pub-id>.</mixed-citation></ref>
<ref id="ref-7"><label>[7]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><given-names>Y. B.</given-names> <surname>Wang</surname></string-name>, <string-name><given-names>W. X.</given-names> <surname>Ge</surname></string-name>, <string-name><given-names>Y. S.</given-names> <surname>Cheng</surname></string-name>, and <string-name><given-names>H. F.</given-names> <surname>Wu</surname></string-name></person-group>, &#x201C;<article-title>Weak-label-specific features learning based on multidimensional correlation</article-title>,&#x201D; <source>J. Nanjing Univ. (Natural Sci.)</source>, vol. <volume>59</volume>, no. <issue>4</issue>, pp. <fpage>690</fpage>&#x2013;<lpage>704</lpage>, <year>2023</year> (In Chinese).</mixed-citation></ref>
<ref id="ref-8"><label>[8]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><given-names>K.</given-names> <surname>Yu</surname></string-name> <etal>et al.</etal></person-group>, &#x201C;<article-title>Causality-based feature selection: Methods and evaluations</article-title>,&#x201D; <source>ACM Comput. Surv.</source>, vol. <volume>53</volume>, no. <issue>5</issue>, pp. <fpage>1</fpage>&#x2013;<lpage>36</lpage>, <year>2020</year>.</mixed-citation></ref>
<ref id="ref-9"><label>[9]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><given-names>J. H.</given-names> <surname>Li</surname></string-name>, <string-name><given-names>P. P.</given-names> <surname>Li</surname></string-name>, <string-name><given-names>X. G.</given-names> <surname>Hu</surname></string-name>, and <string-name><given-names>K.</given-names> <surname>Yu</surname></string-name></person-group>, &#x201C;<article-title>Learning common and label-specific features for multi-Label classification with correlation information</article-title>,&#x201D; <source>Pattern Recogn.</source>, vol. <volume>121</volume>, no. <issue>8</issue>, pp. <fpage>108257</fpage>, <year>2022</year>. doi: <pub-id pub-id-type="doi">10.1016/j.patcog.2021.108259</pub-id>.</mixed-citation></ref>
<ref id="ref-10"><label>[10]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><given-names>M. L.</given-names> <surname>Zhang</surname></string-name> and <string-name><given-names>L.</given-names> <surname>Wu</surname></string-name></person-group>, &#x201C;<article-title>LIFT: Multi-label learning with label-specific features</article-title>,&#x201D; <source>IEEE Trans. Pattern Anal. Mach. Intell.</source>, vol. <volume>37</volume>, no. <issue>1</issue>, pp. <fpage>107</fpage>&#x2013;<lpage>120</lpage>, <year>2015</year>. doi: <pub-id pub-id-type="doi">10.1109/TPAMI.2014.2339815</pub-id>; <pub-id pub-id-type="pmid">26353212</pub-id></mixed-citation></ref>
<ref id="ref-11"><label>[11]</label><mixed-citation publication-type="conf-proc"><person-group person-group-type="author"><string-name><given-names>J.</given-names> <surname>Huang</surname></string-name>, <string-name><given-names>G.</given-names> <surname>Li</surname></string-name>, <string-name><given-names>Q.</given-names> <surname>Huang</surname></string-name>, and <string-name><given-names>X. D.</given-names> <surname>Wu</surname></string-name></person-group>, &#x201C;<article-title>Learning label specific features for multi-label classification</article-title>,&#x201D; in <conf-name>2015 IEEE Int. Conf. Data Min.</conf-name>, <publisher-loc>Atlantic City, NJ, USA</publisher-loc>, <year>2015</year>, pp. <fpage>181</fpage>&#x2013;<lpage>190</lpage>.</mixed-citation></ref>
<ref id="ref-12"><label>[12]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><given-names>Y. S.</given-names> <surname>Cheng</surname></string-name>, <string-name><given-names>K.</given-names> <surname>Qian</surname></string-name>, <string-name><given-names>Y. B.</given-names> <surname>Wang</surname></string-name>, and <string-name><given-names>D. W.</given-names> <surname>Zhao</surname></string-name></person-group>, &#x201C;<article-title>Multi-label lazy learning approach based on firefly method</article-title>,&#x201D; <source>J. Comput. Appl.</source>, vol. <volume>39</volume>, no. <issue>5</issue>, pp. <fpage>1305</fpage>&#x2013;<lpage>1311</lpage>, <year>2019</year> (In Chinese).</mixed-citation></ref>
<ref id="ref-13"><label>[13]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><given-names>W.</given-names> <surname>Weng</surname></string-name>, <string-name><given-names>Y. J.</given-names> <surname>Lin</surname></string-name>, <string-name><given-names>S. X.</given-names> <surname>Wu</surname></string-name>, <string-name><given-names>Y. W.</given-names> <surname>Li</surname></string-name>, and <string-name><given-names>Y.</given-names> <surname>Kang</surname></string-name></person-group>, &#x201C;<article-title>Multi-label learning based on label-specific features and local pairwise label correlation</article-title>,&#x201D; <source>Neurocomput.</source>, vol. <volume>273</volume>, no. <issue>9</issue>, pp. <fpage>385</fpage>&#x2013;<lpage>394</lpage>, <year>2018</year>. doi: <pub-id pub-id-type="doi">10.1016/j.neucom.2017.07.044</pub-id>.</mixed-citation></ref>
<ref id="ref-14"><label>[14]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><given-names>J.</given-names> <surname>Zhang</surname></string-name> <etal>et al.</etal></person-group>, &#x201C;<article-title>Multi label learning with label-specific features by resolving label correlation</article-title>,&#x201D; <source>Knowl.-Based Syst.</source>, vol. <volume>159</volume>, no. <issue>8</issue>, pp. <fpage>148</fpage>&#x2013;<lpage>157</lpage>, <year>2018</year>. doi: <pub-id pub-id-type="doi">10.1016/j.knosys.2018.07.003</pub-id>.</mixed-citation></ref>
<ref id="ref-15"><label>[15]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><given-names>J.</given-names> <surname>Huang</surname></string-name> <etal>et al.</etal></person-group>, &#x201C;<article-title>Improving multi-label classification with missing labels by learning label-specific features</article-title>,&#x201D; <source>Inf. Sci.</source>, vol. <volume>492</volume>, no. <issue>1</issue>, pp. <fpage>124</fpage>&#x2013;<lpage>146</lpage>, <year>2019</year>. doi: <pub-id pub-id-type="doi">10.1016/j.ins.2019.04.021</pub-id>.</mixed-citation></ref>
<ref id="ref-16"><label>[16]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><given-names>D. W.</given-names> <surname>Zhao</surname></string-name>, <string-name><given-names>Q. W.</given-names> <surname>Gao</surname></string-name>, <string-name><given-names>Y. X.</given-names> <surname>Lu</surname></string-name>, and <string-name><given-names>D.</given-names> <surname>Sun</surname></string-name></person-group>, &#x201C;<article-title>Learning multi-label label-specific features via global and local label correlations</article-title>,&#x201D; <source>Soft Comput.</source>, vol. <volume>26</volume>, no. <issue>5</issue>, pp. <fpage>2225</fpage>&#x2013;<lpage>2239</lpage>, <year>2022</year>. doi: <pub-id pub-id-type="doi">10.1007/s00500-021-06645-w</pub-id>.</mixed-citation></ref>
<ref id="ref-17"><label>[17]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><given-names>J. C.</given-names> <surname>Bao</surname></string-name>, <string-name><given-names>Y. B.</given-names> <surname>Wang</surname></string-name>, and <string-name><given-names>Y. S.</given-names> <surname>Cheng</surname></string-name></person-group>, &#x201C;<article-title>Asymmetry label correlation for multi-label learning</article-title>,&#x201D; <source>Appl. Intell.</source>, vol. <volume>55</volume>, no. <issue>6</issue>, pp. <fpage>6093</fpage>&#x2013;<lpage>6105</lpage>, <year>2022</year>. doi: <pub-id pub-id-type="doi">10.1007/s10489-021-02725-4</pub-id>.</mixed-citation></ref>
<ref id="ref-18"><label>[18]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><given-names>C.</given-names> <surname>Zhang</surname></string-name>, <string-name><given-names>Y. S.</given-names> <surname>Cheng</surname></string-name>, <string-name><given-names>Y. B.</given-names> <surname>Wang</surname></string-name>, and <string-name><given-names>Y. T.</given-names> <surname>Xu</surname></string-name></person-group>, &#x201C;<article-title>Interactive causal correlation space reshape for multi-label classification</article-title>,&#x201D; <source>Int. J. Interact. Multimed. Artif. Intell.</source>, vol. <volume>7</volume>, no. <issue>5</issue>, pp. <fpage>107</fpage>&#x2013;<lpage>120</lpage>, <year>2022</year>. doi: <pub-id pub-id-type="doi">10.9781/ijimai.2022.08.007</pub-id></mixed-citation></ref>
<ref id="ref-19"><label>[19]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><given-names>J.</given-names> <surname>Luo</surname></string-name>, <string-name><given-names>Q. W.</given-names> <surname>Gao</surname></string-name>, <string-name><given-names>Y.</given-names> <surname>Tan</surname></string-name>, <string-name><given-names>D. W.</given-names> <surname>Zhao</surname></string-name>, <string-name><given-names>Y. X.</given-names> <surname>Lu</surname></string-name> and <string-name><given-names>D.</given-names> <surname>Sun</surname></string-name></person-group>, &#x201C;<article-title>Multi label learning based on double Laplace regularization and causal inference</article-title>,&#x201D; <source>Comput. Eng.</source>, vol. <volume>49</volume>, pp. <fpage>49</fpage>&#x2013;<lpage>60</lpage>, <year>2023</year> (In Chinese).</mixed-citation></ref>
<ref id="ref-20"><label>[20]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><given-names>Y.</given-names> <surname>Tan</surname></string-name>, <string-name><given-names>D.</given-names> <surname>Sun</surname></string-name>, <string-name><given-names>Y.</given-names> <surname>Shi</surname></string-name>, <string-name><given-names>L.</given-names> <surname>Gao</surname></string-name>, <string-name><given-names>Q.</given-names> <surname>Gao</surname></string-name> and <string-name><given-names>Y.</given-names> <surname>Lu</surname></string-name></person-group>, &#x201C;<article-title>Bi-directional mapping for multi-label learning of label-specific features</article-title>,&#x201D; <source>Appl. Intell.</source>, vol. <volume>52</volume>, no. <issue>7</issue>, pp. <fpage>8147</fpage>&#x2013;<lpage>8166</lpage>, <year>2022</year>. doi: <pub-id pub-id-type="doi">10.1007/s10489-021-02868-4</pub-id>.</mixed-citation></ref>
<ref id="ref-21"><label>[21]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><given-names>J.</given-names> <surname>Zhang</surname></string-name> <etal>et al.</etal></person-group>, &#x201C;<article-title>Group-preserving label-specific feature selection for multi-label learning</article-title>,&#x201D; <source>Expert. Syst. Appl.</source>, vol. <volume>213</volume>, pp. <fpage>118861</fpage>, <year>2023</year>. doi: <pub-id pub-id-type="doi">10.1016/j.eswa.2022.118861</pub-id>.</mixed-citation></ref>
<ref id="ref-22"><label>[22]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><given-names>L. L.</given-names> <surname>Zhang</surname></string-name>, <string-name><given-names>Y. S.</given-names> <surname>Cheng</surname></string-name>, <string-name><given-names>Y. B.</given-names> <surname>Wang</surname></string-name>, and <string-name><given-names>G. S.</given-names> <surname>Pei</surname></string-name></person-group>, &#x201C;<article-title>Feature-label dual-mapping for missing label-specific features learning</article-title>,&#x201D; <source>Soft Comput.</source>, vol. <volume>25</volume>, no. <issue>14</issue>, pp. <fpage>9307</fpage>&#x2013;<lpage>9323</lpage>, <year>2021</year>. doi: <pub-id pub-id-type="doi">10.1007/s00500-021-05884-1</pub-id>.</mixed-citation></ref>
<ref id="ref-23"><label>[23]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><given-names>P.</given-names> <surname>Zhao</surname></string-name>, <string-name><given-names>S. Y.</given-names> <surname>Zhao</surname></string-name>, <string-name><given-names>X. Y.</given-names> <surname>Zhao</surname></string-name>, <string-name><given-names>H. T.</given-names> <surname>Liu</surname></string-name>, and <string-name><given-names>X.</given-names> <surname>Jia</surname></string-name></person-group>, &#x201C;<article-title>Partial multi-label learning based on sparse asymmetric label correlations</article-title>,&#x201D; <source>Knowl.-Based Syst.</source>, vol. <volume>245</volume>, pp. <fpage>108601</fpage>, <year>2022</year>. doi: <pub-id pub-id-type="doi">10.1016/j.knosys.2022.108601</pub-id>.</mixed-citation></ref>
<ref id="ref-24"><label>[24]</label><mixed-citation publication-type="conf-proc"><person-group person-group-type="author"><string-name><given-names>D.</given-names> <surname>Margaritis</surname></string-name> and <string-name><given-names>S.</given-names> <surname>Thrun</surname></string-name></person-group>, &#x201C;<article-title>Bayesian network induction via local neighborhoods</article-title>,&#x201D; in <conf-name>Proc. Conf. Neural Inf. Process. Syst.</conf-name>, <publisher-loc>Harrahs and Harveys, Lake Tahoe, USA</publisher-loc>, <year>2000</year>, pp. <fpage>505</fpage>&#x2013;<lpage>511</lpage>.</mixed-citation></ref>
<ref id="ref-25"><label>[25]</label><mixed-citation publication-type="conf-proc"><person-group person-group-type="author"><string-name><given-names>A.</given-names> <surname>Argyriou</surname></string-name>, <string-name><given-names>T.</given-names> <surname>Evgeniou</surname></string-name>, and <string-name><given-names>M.</given-names> <surname>Pontil</surname></string-name></person-group>, &#x201C;<article-title>Multi-task feature learning</article-title>,&#x201D; in <conf-name>Annual Conf. Neural Inf. Process. Syst.</conf-name>, <publisher-loc>Vancouver, British Columbia, Canada</publisher-loc>, <year>2006</year>, pp. <fpage>41</fpage>&#x2013;<lpage>48</lpage>.</mixed-citation></ref>
<ref id="ref-26"><label>[26]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><given-names>A.</given-names> <surname>Beck</surname></string-name> and <string-name><given-names>M.</given-names> <surname>Teboulle</surname></string-name></person-group>, &#x201C;<article-title>A fast iterative shrinkage-thresholding algorithm for linear inverse problems</article-title>,&#x201D; <source>SIAM J. Imaging Sci.</source>, vol. <volume>2</volume>, no. <issue>1</issue>, pp. <fpage>183</fpage>&#x2013;<lpage>202</lpage>, <year>2009</year>. doi: <pub-id pub-id-type="doi">10.1137/080716542</pub-id>.</mixed-citation></ref>
<ref id="ref-27"><label>[27]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><given-names>Z. C.</given-names> <surname>Lin</surname></string-name>, <string-name><given-names>A.</given-names> <surname>Ganesh</surname></string-name>, <string-name><given-names>J.</given-names> <surname>Wright</surname></string-name>, <string-name><given-names>L. Q.</given-names> <surname>Wu</surname></string-name>, <string-name><given-names>M. M.</given-names> <surname>Chen</surname></string-name> and <string-name><given-names>Y.</given-names> <surname>Ma</surname></string-name></person-group>, &#x201C;<article-title>Fast convex optimization algorithms for exact recovery of a corrupted low-rank matrix</article-title>,&#x201D; <source>Coordinated Sci. Lab. Report</source>, vol. <volume>246</volume>, pp. <fpage>2214</fpage>, <year>2009</year>.</mixed-citation></ref>
<ref id="ref-28"><label>[28]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><given-names>D. W.</given-names> <surname>Zhao</surname></string-name>, <string-name><given-names>Q. W.</given-names> <surname>Gao</surname></string-name>, <string-name><given-names>Y. X.</given-names> <surname>Lu</surname></string-name>, and <string-name><given-names>D.</given-names> <surname>Sun</surname></string-name></person-group>, &#x201C;<article-title>Learning view-specific labels and label-feature dependence maximization for multi-view multi-label classification</article-title>,&#x201D; <source>Appl. Soft Comput.</source>, vol. <volume>124</volume>, no. <issue>8</issue>, pp. <fpage>109071</fpage>, <year>2022</year>. doi: <pub-id pub-id-type="doi">10.1016/j.asoc.2022.109071</pub-id>.</mixed-citation></ref>
<ref id="ref-29"><label>[29]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><given-names>K.</given-names> <surname>Qian</surname></string-name>, <string-name><given-names>X. Y.</given-names> <surname>Min</surname></string-name>, <string-name><given-names>Y. S.</given-names> <surname>Cheng</surname></string-name>, and <string-name><given-names>F.</given-names> <surname>Min</surname></string-name></person-group>, &#x201C;<article-title>Weight matrix sharing for multi-label learning</article-title>,&#x201D; <source>Pattern Recogn.</source>, vol. <volume>136</volume>, pp. <fpage>109156</fpage>, <year>2023</year>. doi: <pub-id pub-id-type="doi">10.1016/j.patcog.2022.109156</pub-id>.</mixed-citation></ref>
<ref id="ref-30"><label>[30]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><given-names>H. R.</given-names> <surname>Han</surname></string-name>, <string-name><given-names>M. X.</given-names> <surname>Huang</surname></string-name>, <string-name><given-names>Y.</given-names> <surname>Zhang</surname></string-name>, <string-name><given-names>X. G.</given-names> <surname>Yang</surname></string-name>, and <string-name><given-names>W. G.</given-names> <surname>Feng</surname></string-name></person-group>, &#x201C;<article-title>Multi-label learning with label specific features using correlation information</article-title>,&#x201D; <source>IEEE Access</source>, vol. <volume>7</volume>, pp. <fpage>11474</fpage>&#x2013;<lpage>11484</lpage>, <year>2019</year>. doi: <pub-id pub-id-type="doi">10.1109/ACCESS.2019.2891611</pub-id>.</mixed-citation></ref>
<ref id="ref-31"><label>[31]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><given-names>J.</given-names> <surname>Demsar</surname></string-name></person-group>, &#x201C;<article-title>Statistical comparisons of classifiers over multiple data sets</article-title>,&#x201D; <source>J. Mach Learn. Res.</source>, vol. <volume>7</volume>, no. <issue>1</issue>, pp. <fpage>1</fpage>&#x2013;<lpage>30</lpage>, <year>2006</year>.</mixed-citation></ref>
<ref id="ref-32"><label>[32]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><given-names>D.</given-names> <surname>Zhao</surname></string-name>, <string-name><given-names>H.</given-names> <surname>Li</surname></string-name>, <string-name><given-names>Y.</given-names> <surname>Lu</surname></string-name>, <string-name><given-names>D.</given-names> <surname>Sun</surname></string-name>, <string-name><given-names>D.</given-names> <surname>Zhu</surname></string-name> and <string-name><given-names>Q.</given-names> <surname>Gao</surname></string-name></person-group>, &#x201C;<article-title>Multi label weak-label learning via semantic reconstruction and label correlations</article-title>,&#x201D; <source>Inf. Sci.</source>, vol. <volume>623</volume>, no. <issue>8</issue>, pp. <fpage>379</fpage>&#x2013;<lpage>401</lpage>, <year>2023</year>. doi: <pub-id pub-id-type="doi">10.1016/j.ins.2022.12.047</pub-id>.</mixed-citation></ref>
</ref-list>
</back></article>